BLASTX nr result
ID: Akebia22_contig00012194
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00012194 (1930 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citr... 485 e-134 ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255... 464 e-128 ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608... 451 e-125 ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204... 453 e-124 ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma... 453 e-124 ref|XP_002517932.1| conserved hypothetical protein [Ricinus comm... 438 e-120 ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310... 427 e-117 emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera] 426 e-116 ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Popu... 423 e-115 gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis] 412 e-112 ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600... 402 e-109 ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma... 389 e-105 ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [... 389 e-105 ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arab... 387 e-105 ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261... 387 e-104 ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma... 386 e-104 ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prun... 385 e-104 ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] ... 382 e-103 ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phas... 379 e-102 ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutr... 374 e-101 >ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|567910083|ref|XP_006447355.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|567910085|ref|XP_006447356.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|567910087|ref|XP_006447357.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|568831767|ref|XP_006470130.1| PREDICTED: uncharacterized protein LOC102608093 isoform X1 [Citrus sinensis] gi|568831769|ref|XP_006470131.1| PREDICTED: uncharacterized protein LOC102608093 isoform X2 [Citrus sinensis] gi|568831771|ref|XP_006470132.1| PREDICTED: uncharacterized protein LOC102608093 isoform X3 [Citrus sinensis] gi|568831773|ref|XP_006470133.1| PREDICTED: uncharacterized protein LOC102608093 isoform X4 [Citrus sinensis] gi|557549965|gb|ESR60594.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|557549966|gb|ESR60595.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|557549967|gb|ESR60596.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|557549968|gb|ESR60597.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] Length = 523 Score = 485 bits (1248), Expect = e-134 Identities = 268/526 (50%), Positives = 336/526 (63%), Gaps = 36/526 (6%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MA RRELGFPK ++LR+Q AR TL VR +GH VE+REDG FIFFC C APCYSD Sbjct: 1 MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487 LFDHLKGNLH ER +AAK+TLLG NPWPFNDGVLF N E++KQ + S+ + S Sbjct: 61 LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120 Query: 1486 NSNGNDNVALERVGDNENLDSHK----------CATVTIEKSLNGENCNMVIPGVLCKDV 1337 N + N+A+ + G++ ++ ++ C T + + E+C+ VIPGV KD Sbjct: 121 YHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDE 180 Query: 1336 ISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVT 1157 I L VRFIG G+IAAR+ + E +I+RIWC WLGK DP ED ++ HDF IVT Sbjct: 181 IVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGK-KDPE-DEDIVEIPDHDFAIVT 238 Query: 1156 FSYNYTLGRRTL-DDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKS-------- 1004 F YNY LGR+ L DD+ + +NGEG RK RKKSFSDPED+S+S Sbjct: 239 FVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRK-RKKSFSDPEDVSESLSKQYDSC 297 Query: 1003 -------------LVLGQYGDESRHA----ISXXXXXXXXXXRVASERICDICKHKILPE 875 L+L +YGD+ HA R+A+ER+CDIC+ KILP+ Sbjct: 298 GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357 Query: 874 KDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKR 695 KDV+ LLN+KTG LACSSRN+NG FH+FH SCLIHWILLC+FE+ TNQ K S+R Sbjct: 358 KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKRRSRR 417 Query: 694 KNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKAN 515 KN SKR + D + I Q SS+ CPECQG+G+N+E +LE+PTI ++F Y IK + Sbjct: 418 KNGSKRVQ-ARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKIKVS 476 Query: 514 EACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377 +A AWMK+PE LQN STG FP SEE QEKV PLKLLHFY A+ Sbjct: 477 DARKAWMKNPEALQNCSTGFYFPSRSEEKFQEKVSPLKLLHFYSAE 522 >ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255678 [Vitis vinifera] Length = 520 Score = 464 bits (1194), Expect = e-128 Identities = 271/532 (50%), Positives = 337/532 (63%), Gaps = 41/532 (7%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MA R ELGF K +LR+Q AR TLR VR++GH VE+REDG FIFFC C APCYS+ Sbjct: 1 MARRTELGFLKTSASSLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSE 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLS---GSSLIVVN 1496 S L+DHLKGNLH ERYAAAK+TLL S+PWPFNDGVLF N E DK LS G+ ++ Sbjct: 61 SVLYDHLKGNLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLG 120 Query: 1495 SGKNSNGNDNVALERVGD------NENLDSHK-----CATVTIEKSLN--GENCNMVIPG 1355 + KN N N+A+ GD N +++ H C +SLN G NC+M+IPG Sbjct: 121 THKNDN---NLAIVCHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPG 177 Query: 1354 VLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGH 1175 V+ KD ++ LEVRF+GFG+IAAR E + K I++IWC W GK +P E + H Sbjct: 178 VMIKDEVTELEVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKE-EPGDGETVM-VPDH 235 Query: 1174 DFGIVTFSYNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL-- 1001 DF +VTF+Y+Y LGR+ L D L EG RK RKKSFSDPEDIS+SL Sbjct: 236 DFAVVTFNYHYNLGRKGLFD-----DVISMLSSSPTEGSGRK-RKKSFSDPEDISESLSN 289 Query: 1000 -------------------VLGQYGDE---SRHAISXXXXXXXXXXR-VASERICDICKH 890 +L +Y D+ +R S + VA+ER+CDIC+H Sbjct: 290 QYDSSGEDSLISNSPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQH 349 Query: 889 KILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGT 710 K+LP KDV+TL+NMKTG+L CSSRNV GAFH+FHTSCLIHWILLC+FEI+TNQL K Sbjct: 350 KMLPGKDVATLMNMKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLR 409 Query: 709 HGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNY 530 S+RK+ SK + + + T Q SV CPECQG+GI +ED +LE P IP E+F Y Sbjct: 410 RSSRRKSGSKCNGKGKDGVIKPTTLQICSVFCPECQGTGIMIED-ELEIPNIPLSEMFKY 468 Query: 529 NIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 374 IK ++A AWMK+PE L++ STG FP S ET+QEKV LKLLHFY ADE Sbjct: 469 KIKVSDAHRAWMKNPEELKHCSTGFNFPSQSGETVQEKVSSLKLLHFYSADE 520 >ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608093 isoform X5 [Citrus sinensis] Length = 508 Score = 451 bits (1159), Expect(2) = e-125 Identities = 250/499 (50%), Positives = 317/499 (63%), Gaps = 36/499 (7%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MA RRELGFPK ++LR+Q AR TL VR +GH VE+REDG FIFFC C APCYSD Sbjct: 1 MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487 LFDHLKGNLH ER +AAK+TLLG NPWPFNDGVLF N E++KQ + S+ + S Sbjct: 61 LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120 Query: 1486 NSNGNDNVALERVGDNENLDSHK----------CATVTIEKSLNGENCNMVIPGVLCKDV 1337 N + N+A+ + G++ ++ ++ C T + + E+C+ VIPGV KD Sbjct: 121 YHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDE 180 Query: 1336 ISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVT 1157 I L VRFIG G+IAAR+ + E +I+RIWC WLGK DP ED ++ HDF IVT Sbjct: 181 IVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGK-KDPE-DEDIVEIPDHDFAIVT 238 Query: 1156 FSYNYTLGRRTL-DDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKS-------- 1004 F YNY LGR+ L DD+ + +NGEG RK RKKSFSDPED+S+S Sbjct: 239 FVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRK-RKKSFSDPEDVSESLSKQYDSC 297 Query: 1003 -------------LVLGQYGDESRHA----ISXXXXXXXXXXRVASERICDICKHKILPE 875 L+L +YGD+ HA R+A+ER+CDIC+ KILP+ Sbjct: 298 GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357 Query: 874 KDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKR 695 KDV+ LLN+KTG LACSSRN+NG FH+FH SCLIHWILLC+FE+ TNQ K S+R Sbjct: 358 KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKRRSRR 417 Query: 694 KNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKAN 515 KN SKR + D + I Q SS+ CPECQG+G+N+E +LE+PTI ++F Y IK + Sbjct: 418 KNGSKRVQ-ARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKIKVS 476 Query: 514 EACLAWMKDPELLQNRSTG 458 +A AWMK+PE LQN STG Sbjct: 477 DARKAWMKNPEALQNCSTG 495 Score = 25.8 bits (55), Expect(2) = e-125 Identities = 10/13 (76%), Positives = 13/13 (100%) Frame = -3 Query: 425 SGKGIASKVASFL 387 +GKG+ASK+ASFL Sbjct: 494 TGKGVASKIASFL 506 >ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204451 [Cucumis sativus] gi|449475785|ref|XP_004154550.1| PREDICTED: uncharacterized LOC101204451 [Cucumis sativus] Length = 525 Score = 453 bits (1166), Expect = e-124 Identities = 254/530 (47%), Positives = 319/530 (60%), Gaps = 41/530 (7%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MA R ELGFPK Y+LR+Q AR LR VR +GH VE+RE+G FIFFC C APCYSD Sbjct: 1 MARRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRENGKKFIFFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQL------------ 1523 S LF HLKG LH ER +AAKLTLLG NPWPF+DGVLF + E D Q+ Sbjct: 61 SVLFSHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPIEGDNQVGISNDNHERLLE 120 Query: 1522 ---SGSSLIVVNSGKNSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGV 1352 + ++L +V NS GN N E G+ N++ C+ + GE+C +VIPGV Sbjct: 121 YNNNDNNLAIVKYVGNSKGNGNRQEEFNGNMRNVED--CSFENLND--GGESCPLVIPGV 176 Query: 1351 LCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHD 1172 L K+ IS ++VR +G+G+IAAR E I ++RIWC WLGK D E+ K+ H+ Sbjct: 177 LIKEEISDIKVRELGYGQIAARFTEKDGIFSGVSRIWCEWLGKVNDGI--ENMVKVPEHN 234 Query: 1171 FGIVTFSYNYTLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPED------- 1016 + I+TF+YN LGR+ LDD+ E N E R+ +RKKSFSDPED Sbjct: 235 YAIITFTYNVDLGRKGLLDDVKLLLSSSPGAESQNDE-NRQVKRKKSFSDPEDGSLSMSP 293 Query: 1015 --------------ISKSLVLGQYGDESRHAI----SXXXXXXXXXXRVASERICDICKH 890 + SL L Y D+ R+A+ER+CDIC+ Sbjct: 294 QYDSSGEDSSASNCVMSSLSLDGYDDQILSTTVMLNKAVRRELRRQQRLAAERMCDICQQ 353 Query: 889 KILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGT 710 KIL KDV+TLLNMKTGRLACSSRNVNG FH+FHTSCLIHWILLC++EI L K Sbjct: 354 KILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEISVKDLGGSKVR 413 Query: 709 HGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNY 530 +RK +K ++ + + + R K Q SV CP CQG+GI ++ LE+PT+P EIF Y Sbjct: 414 RRYRRKKKTKGNKHIKDGETRQIKTQIDSVFCPACQGTGITIDGDDLEKPTVPLSEIFKY 473 Query: 529 NIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRA 380 IK ++A AWMK PE+LQN STG +FPY +ETIQE V PLKLLHFY A Sbjct: 474 KIKVSDARRAWMKSPEVLQNCSTGFQFPYQPDETIQENVKPLKLLHFYGA 523 >ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508707510|gb|EOX99406.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 517 Score = 453 bits (1165), Expect = e-124 Identities = 252/528 (47%), Positives = 327/528 (61%), Gaps = 34/528 (6%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MAERRELG P+ +L++Q AR TL VR +GH +E+REDG FIFFC C APCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487 S L DHLKG+LH R AAAK+TLLG+NPWPFNDGVLF E++K+L+G Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111 Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENC-----NMVIPGVLCKDVISSLE 1322 +GN N LE +++NL + + NC +++IPGVL KD IS L+ Sbjct: 112 --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169 Query: 1321 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNY 1142 VRFIGFG+IAAR E + +I+RIWC WLGK + ++D K H F +VTF YN Sbjct: 170 VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227 Query: 1141 TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1001 LGR+ LDD+ ++NG+ RK RKKSFSDPEDIS+SL Sbjct: 228 DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286 Query: 1000 ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 869 L +Y D+ S AI +A+ER+CDIC+ K+LPEKD Sbjct: 287 ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343 Query: 868 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689 V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E N N K S+RKN Sbjct: 344 VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403 Query: 688 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 509 +K +++ + + + T SSV+CPECQG+GI+VE +LE+P + ++F Y IK ++A Sbjct: 404 GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQMFRYKIKVSDA 463 Query: 508 CLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE*KS 365 AWMK PE+L+N STG F S E +QEK+LPLKLLHFY AD+ +S Sbjct: 464 RRAWMKSPEMLENCSTGFHFRSQSGEMVQEKILPLKLLHFYSADKYES 511 >ref|XP_002517932.1| conserved hypothetical protein [Ricinus communis] gi|223542914|gb|EEF44450.1| conserved hypothetical protein [Ricinus communis] Length = 509 Score = 438 bits (1127), Expect = e-120 Identities = 246/519 (47%), Positives = 315/519 (60%), Gaps = 28/519 (5%) Frame = -1 Query: 1846 MAERRELGFPK-GGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYS 1670 MA R ELGF K GG +L++Q AR TL VR +GH VE+REDG FIFFC C APCYS Sbjct: 1 MAGRWELGFTKTGGANSLKEQLARTTLNNVRSKGHPYVELREDGKRFIFFCTLCLAPCYS 60 Query: 1669 DSALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSG 1490 D+ LFDHLKGNLH ER + A LTLL NPWPF+DGV F E +KQL +I ++ Sbjct: 61 DAVLFDHLKGNLHTERLSTATLTLLKENPWPFSDGVHFFDTSSENEKQL----VIKNDNE 116 Query: 1489 KNSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVRFI 1310 NGN ++A+ + G + + + + NG +++I GVL KD IS L+ RF+ Sbjct: 117 SRGNGNSSLAIVKYGGSLKPTGDEDTGCNKDANDNGRISDLLIQGVLVKDDISDLQARFM 176 Query: 1309 GFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTLGR 1130 G+G I AR+ E I+RIWC WLGK + D +K+ H+F +VTF+YNY LGR Sbjct: 177 GYGRIGARLIEKDGNSNDISRIWCEWLGK--NTPCDLDKAKVLDHEFAVVTFAYNYDLGR 234 Query: 1129 R-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKS----------------- 1004 + LDD+ E DN G RK RKKSFSDPED+S+S Sbjct: 235 KGLLDDVKLLLSSSPVQESDNQGGTNRK-RKKSFSDPEDVSESFSNQYDSSGEESLTSIG 293 Query: 1003 -----LVLGQYGDESRHA----ISXXXXXXXXXXRVASERICDICKHKILPEKDVSTLLN 851 L+L ++ D+ H+ +A+ER+CDIC+ KILPEKDV+TL+N Sbjct: 294 GPPTRLLLDRHDDQFLHSKVISSKTLRRELRRQHHIAAERMCDICQQKILPEKDVATLVN 353 Query: 850 MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVSKRSE 671 M TG+LACSSRN G +H+FHTSCLIHWILL ++E+ NQ + KG S+RKN +K S Sbjct: 354 MNTGKLACSSRNTYGQYHVFHTSCLIHWILLSEYEMARNQSVSPKGRRKSRRKNGTKSSH 413 Query: 670 ILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMK 491 + +K + Q SSV CPECQG+G +E + E PTIP E+F Y IK + AWMK Sbjct: 414 V---EKVKALNNQISSVFCPECQGTGAILEKDERELPTIPLSEMFKYKIKVGDGRRAWMK 470 Query: 490 DPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 374 PE+L+N S G FP SE +Q KVLPLKLLHFYRADE Sbjct: 471 SPEVLENCSIGFHFPSQSEGAVQAKVLPLKLLHFYRADE 509 >ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310040 [Fragaria vesca subsp. vesca] Length = 525 Score = 427 bits (1099), Expect = e-117 Identities = 249/534 (46%), Positives = 317/534 (59%), Gaps = 43/534 (8%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MA R ++G PK +LR+Q R LR VR +GH VEVREDG FIFFC C APCYSD Sbjct: 1 MAGRWDVGVPKTNACSLREQATRTILRNVRSQGHSYVEVREDGKKFIFFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQL------------ 1523 LFDHLKGNLH ER AAAK+TLL NPWPFNDGV+F N E DK + Sbjct: 61 KVLFDHLKGNLHNERLAAAKVTLLRPNPWPFNDGVVFFNNSYETDKGVVTPDDNKCRMLE 120 Query: 1522 ---SGSSLIVVNSGKN--SNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIP 1358 + ++L +V G N +NG D+ ++ + NE +D + + + +G ++VIP Sbjct: 121 SHDNENNLAIVKYGGNLKTNGYDHCGVDGLECNEYIDLQGLQSNVGDSTADGAKSSVVIP 180 Query: 1357 GVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTG 1178 G++ +D I+ LEVR +G GEIAAR I RIWC WLG S ED + Sbjct: 181 GIVVRDEITDLEVREVGLGEIAARFLGK----DGIGRIWCEWLGVKSIDS--EDLCNVPE 234 Query: 1177 HDFGIVTFSYNYTLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL 1001 HDF +VTFSYN LGR+ LDD+ +E NGEG K RKKSFSDPEDIS SL Sbjct: 235 HDFAVVTFSYNIDLGRKGLLDDVRMLLSSSPTIESGNGEGTGCK-RKKSFSDPEDISDSL 293 Query: 1000 ---------------------VLGQYGDE---SRHAISXXXXXXXXXXR-VASERICDIC 896 +L Y D+ +R ++ + +AS R+CDIC Sbjct: 294 SNQYESFGEDSSASSGTASRLLLDHYDDQLLNTRFILNKSIRRELRRQQRLASGRMCDIC 353 Query: 895 KHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLK 716 + ++LP KDV+TL+N+KTG+LACSSRNVNGAFH+FHTSCLIHWILLC+ E+ TNQ K Sbjct: 354 QQRMLPGKDVATLMNLKTGKLACSSRNVNGAFHVFHTSCLIHWILLCEVEVITNQNTGSK 413 Query: 715 GTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIF 536 S+RK +K + + + + PQ SV CPECQG+GI V+ LE+P +P ++F Sbjct: 414 ARRRSRRKTAAKCNG--KDAQLKSLSPQIYSVFCPECQGTGIVVDGDDLEKPNLPLSQMF 471 Query: 535 NYNIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 374 Y IK ++A AWMK PE+LQN STG FP + IQEKV LKLL FYRA E Sbjct: 472 RYKIKVSDARRAWMKSPEMLQNCSTGFHFPSLNAAGIQEKVKTLKLLRFYRAHE 525 >emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera] Length = 896 Score = 426 bits (1096), Expect = e-116 Identities = 250/501 (49%), Positives = 315/501 (62%), Gaps = 41/501 (8%) Frame = -1 Query: 1801 NLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALFDHLKGNLHKER 1622 +LR+Q AR TLR VR++GH VE+REDG FIFFC C APCYS+S L+DHLKGNLH ER Sbjct: 352 SLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSESVLYDHLKGNLHSER 411 Query: 1621 YAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLS---GSSLIVVNSGKNSNGNDNVALER 1451 YAAAK+TLL S+PWPFNDGVLF N E DK LS G+ ++ + KN N N+A+ Sbjct: 412 YAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLGTHKNDN---NLAIVC 468 Query: 1450 VGD------NENLDSHK-----CATVTIEKSLN--GENCNMVIPGVLCKDVISSLEVRFI 1310 GD N +++ H C +SLN G NC+M+IPGV+ KD ++ LEVRF+ Sbjct: 469 HGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTELEVRFL 528 Query: 1309 GFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTLGR 1130 GFG+IAAR E + K I++IWC W GK +P E + HDF +VTF+Y+Y LGR Sbjct: 529 GFGQIAARFFEKDGVSKGISKIWCEWFGKE-EPGDGETVM-VPDHDFAVVTFNYHYNLGR 586 Query: 1129 RTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL----------------- 1001 + L D L EG RK RKKSFSDPEDIS+SL Sbjct: 587 KGLFD-----DVISMLSSSPTEGSGRK-RKKSFSDPEDISESLSNQYDSSGEDSLISNSP 640 Query: 1000 ----VLGQYGDE---SRHAISXXXXXXXXXXR-VASERICDICKHKILPEKDVSTLLNMK 845 +L +Y D+ +R S + VA+ER+CDIC+HK+LP KDV+TL NMK Sbjct: 641 SPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLXNMK 700 Query: 844 TGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVSKRSEIL 665 TG+L CSSRNV GAFH+FHTSCLIHWILLC+FEI+TNQL K S+RK+ SK + Sbjct: 701 TGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSKCNGKG 760 Query: 664 MNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKDP 485 + + T Q SV CPECQG+GI +ED +LE P IP E+F Y IK ++A AWMK+P Sbjct: 761 KDGVIKPTTLQICSVFCPECQGTGIMIED-ELEIPNIPLSEMFKYKIKVSDAHRAWMKNP 819 Query: 484 ELLQNRSTGLRFPYNSEETIQ 422 E L++ STG FP S ET+Q Sbjct: 820 EELKHCSTGFNFPSQSGETVQ 840 >ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa] gi|550325787|gb|EEE95821.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa] Length = 513 Score = 423 bits (1087), Expect = e-115 Identities = 232/525 (44%), Positives = 317/525 (60%), Gaps = 34/525 (6%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MA RE+GFPK +LR+Q AR TL +VR GH +E+REDG FIFFC C +PCYSD Sbjct: 1 MAGNREVGFPKTTASSLREQLARTTLSRVRARGHPYLELREDGKRFIFFCTLCLSPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487 + L DHL+GNLH ER +AAK TLL NPWPF+DG+ F ++QL+ + GK Sbjct: 61 TILLDHLRGNLHTERLSAAKATLLKPNPWPFSDGIHFFDASSGNEEQLA------IKDGK 114 Query: 1486 NSN-------GNDNVALERVGDNENLDSHKCATVTIEKSLNG--ENCNMVIPGVLCKDVI 1334 S+ +DN+A+ + +N C TV ++++L+G E ++VIP V K+ + Sbjct: 115 ESSRFLKFEENSDNLAIVKYVENLKPG---CDTV-VDENLSGSDEGSDLVIPSVRLKEEV 170 Query: 1333 SSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTF 1154 S L+ +G G+IAAR++E + +I+RIWC WLGK S ED K+ HDFG+VTF Sbjct: 171 SDLKATLVGSGQIAARMYEKKDGSNEISRIWCEWLGKKS--SNDEDKVKVLDHDFGVVTF 228 Query: 1153 SYNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL--------- 1001 +Y+Y LG+ L D + + + +RK+S S+PED+S+SL Sbjct: 229 AYDYELGKSGLFDDVKLLLSSSAPALTENDERGNWKRKRSVSEPEDVSRSLTNQYGLCEE 288 Query: 1000 ------------VLGQYGDESRH----AISXXXXXXXXXXRVASERICDICKHKILPEKD 869 VL +Y D+ H + R+A+E++CDIC+ K+LPEKD Sbjct: 289 ESSKTTCASSNLVLDRYDDQLMHTRFISNKTVRREVRKQQRIAAEKMCDICQQKMLPEKD 348 Query: 868 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689 V+TL N KTG+LACSSRNV GAFH+FHTSCLIHWIL C+FEI NQ + KG S++KN Sbjct: 349 VATLWNRKTGKLACSSRNVYGAFHVFHTSCLIHWILYCEFEIVRNQTVSTKGGRRSRKKN 408 Query: 688 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 509 +K + + + SV CP+CQG+G+N+E + E+P P E+F Y IK +E Sbjct: 409 GTKSNTTGKDGTVNVLPNPIVSVFCPDCQGTGVNIEGDEFEKPLTPLSEMFKYKIKVSEG 468 Query: 508 CLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 374 WMK+PE+L+N STG FP S E +QEKVLPLKLLHFYR +E Sbjct: 469 HRGWMKNPEILENCSTGFHFPSQSGEPVQEKVLPLKLLHFYRPEE 513 >gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis] Length = 638 Score = 412 bits (1060), Expect = e-112 Identities = 245/539 (45%), Positives = 318/539 (58%), Gaps = 54/539 (10%) Frame = -1 Query: 1846 MAERRELGFPKGGVY--------NLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIW 1691 MA R LGFPK +L+ Q R LR VR +GH VE+REDG IFFC Sbjct: 1 MAGRGILGFPKSNELAVSKTTSCSLKDQAKRTILRNVRSQGHTYVELREDGKKSIFFCTL 60 Query: 1690 CRAPCYSDSALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEED------- 1532 C APCYSD LFDHLKGNLH +R + AK+TLLG NPWPFNDGV+F N E D Sbjct: 61 CLAPCYSDCVLFDHLKGNLHNQRLSTAKVTLLGPNPWPFNDGVVFFNNPTENDDDTVISN 120 Query: 1531 --------KQLSGSSLIVVNSGKN--SNGNDNVALERVG-DNENLDSHKCATVTIEKSLN 1385 Q S ++L +V G+N S N ++ ++ +G NEN DS + + Sbjct: 121 GNQSRLLESQDSENNLAIVTYGENLESCANGHIMVDELGHQNENPDSAG------NLAGS 174 Query: 1384 GENCNMVIPGVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSF 1205 GENC ++IPGV D I+++EVR +G+G I+ R E + I+RIWC WLGK Sbjct: 175 GENCAVLIPGVRAGDEIANVEVREVGYGLISVRFREKDGVSNDISRIWCEWLGKKTIED- 233 Query: 1204 HEDASKLTGHDFGIVTFSYN-YTLGRRTL-DDLNPXXXXXXXLEIDNGEGKRRKQRKKSF 1031 ED K+ HDF IVTFSYN ++LGR L DD+ E+ NG+ RK R+KSF Sbjct: 234 -EDFLKVPEHDFAIVTFSYNNFSLGRMGLHDDVKALLCSSPAAEMQNGDVSSRK-RRKSF 291 Query: 1030 SDPEDISK-------------------SLVLGQYGDE-------SRHAISXXXXXXXXXX 929 SDPED S+ SL+L QY D+ S AI Sbjct: 292 SDPEDSSENLSNQYDSCGEDSSASAVTSLMLDQYDDQLLQTRFISNKAIRRELRRQQR-- 349 Query: 928 RVASERICDICKHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDF 749 +A+ER+CDIC+HK+LP KDV+TL+N+KTGRLACSSRN NGAFHLFHTSCLIHW+LLC+ Sbjct: 350 -IAAERMCDICQHKMLPGKDVATLMNVKTGRLACSSRNTNGAFHLFHTSCLIHWVLLCEV 408 Query: 748 EIWTNQLDNLKGTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQL 569 E TNQ + K S+RK SK +E+L + + + + + VICPECQG+G + DG+ Sbjct: 409 EKCTNQSEAPKVKRRSRRKAASKCNEVLNDSEVKAFRTPINRVICPECQGTGTMI-DGED 467 Query: 568 EEPTIPPFEIFNYNIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLH 392 E+PT+P ++F Y IK ++A AWMK PE+L N STG FP +EETIQ ++ + +H Sbjct: 468 EKPTVPLSKMFKYKIKVSDARRAWMKSPEVLGNCSTGFHFPSPAEETIQVHLVYIAEIH 526 >ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600129 [Solanum tuberosum] Length = 521 Score = 402 bits (1032), Expect = e-109 Identities = 238/527 (45%), Positives = 310/527 (58%), Gaps = 41/527 (7%) Frame = -1 Query: 1834 RELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALF 1655 R+L FP+ NL++Q R TL+ VR +GHI VE+REDG +FFC C +PCYSDS LF Sbjct: 4 RQLDFPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLVFFCTLCHSPCYSDSVLF 63 Query: 1654 DHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGKNS-- 1481 +HLKGNLH E AAAK TLL NPWPFNDGVLF +N E+DK VN GK+ Sbjct: 64 NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLF-FNDPEQDKHSPN-----VNVGKSRLV 117 Query: 1480 ----NGNDNVALERVGDN--ENLDSH----KCATVTIEKSLNGENCNMVIPGVLCKDVIS 1331 ++A+ DN N D++ + + E + NGE+ +VIPGVLCKD +S Sbjct: 118 DTCLEDESSLAIVECDDNLRHNGDTYVTEYEYCLLDSELTGNGESEYLVIPGVLCKDELS 177 Query: 1330 SLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFS 1151 LEV+ IG G+IAARI KKI RIWC WL K D S + HDF +VTF Sbjct: 178 DLEVKHIGIGKIAARISVRGIDSKKIRRIWCEWLVKKDSDDM--DTSVVPDHDFAVVTFP 235 Query: 1150 YNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL---------- 1001 YNY LGR+ L D E + G R+++RK SFSDPED S+SL Sbjct: 236 YNYNLGRKPLLDDRFLLPSSPYSESEETSGTRKRKRK-SFSDPEDFSESLSNHCDSSGEE 294 Query: 1000 -----------VLGQYGDE--SRHAISXXXXXXXXXXR--VASERICDICKHKILPEKDV 866 +LG D+ S IS + VASER+CDIC+ K+LP KDV Sbjct: 295 SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDV 354 Query: 865 STLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLD----NLKGTHGSK 698 +TLL+ K+G+L CSSRN+ GAFHLFH SCLIHWIL C+ + + +D K SK Sbjct: 355 ATLLSWKSGKLMCSSRNMTGAFHLFHVSCLIHWILQCELQTYVKPVDEPKMETKAKRRSK 414 Query: 697 RKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKA 518 RK +K + D+ + + + +SV CPECQG+GI +E +LE+P + E++ + IK Sbjct: 415 RKTGTKHNAKEKEDEIKSAR-RINSVFCPECQGTGIIIEGDELEKPPVSLSEVYRHKIKL 473 Query: 517 NEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377 ++A AWMK+PE+LQN STG P ++ +QE V PLKLLHFYRA+ Sbjct: 474 SDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 520 >ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508707513|gb|EOX99409.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 478 Score = 389 bits (998), Expect = e-105 Identities = 220/481 (45%), Positives = 289/481 (60%), Gaps = 34/481 (7%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MAERRELG P+ +L++Q AR TL VR +GH +E+REDG FIFFC C APCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487 S L DHLKG+LH R AAAK+TLLG+NPWPFNDGVLF E++K+L+G Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111 Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENC-----NMVIPGVLCKDVISSLE 1322 +GN N LE +++NL + + NC +++IPGVL KD IS L+ Sbjct: 112 --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169 Query: 1321 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNY 1142 VRFIGFG+IAAR E + +I+RIWC WLGK + ++D K H F +VTF YN Sbjct: 170 VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227 Query: 1141 TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1001 LGR+ LDD+ ++NG+ RK RKKSFSDPEDIS+SL Sbjct: 228 DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286 Query: 1000 ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 869 L +Y D+ S AI +A+ER+CDIC+ K+LPEKD Sbjct: 287 ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343 Query: 868 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689 V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E N N K S+RKN Sbjct: 344 VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403 Query: 688 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 509 +K +++ + + + T SSV+CPECQG+GI+VE +LE+P + ++ ++K Sbjct: 404 GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRC 463 Query: 508 C 506 C Sbjct: 464 C 464 >ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] gi|508707511|gb|EOX99407.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 481 Score = 389 bits (998), Expect = e-105 Identities = 220/481 (45%), Positives = 289/481 (60%), Gaps = 34/481 (7%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MAERRELG P+ +L++Q AR TL VR +GH +E+REDG FIFFC C APCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487 S L DHLKG+LH R AAAK+TLLG+NPWPFNDGVLF E++K+L+G Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111 Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENC-----NMVIPGVLCKDVISSLE 1322 +GN N LE +++NL + + NC +++IPGVL KD IS L+ Sbjct: 112 --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169 Query: 1321 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNY 1142 VRFIGFG+IAAR E + +I+RIWC WLGK + ++D K H F +VTF YN Sbjct: 170 VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227 Query: 1141 TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1001 LGR+ LDD+ ++NG+ RK RKKSFSDPEDIS+SL Sbjct: 228 DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286 Query: 1000 ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 869 L +Y D+ S AI +A+ER+CDIC+ K+LPEKD Sbjct: 287 ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343 Query: 868 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689 V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E N N K S+RKN Sbjct: 344 VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403 Query: 688 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 509 +K +++ + + + T SSV+CPECQG+GI+VE +LE+P + ++ ++K Sbjct: 404 GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRC 463 Query: 508 C 506 C Sbjct: 464 C 464 >ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata] gi|297315349|gb|EFH45772.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata] Length = 517 Score = 387 bits (995), Expect = e-105 Identities = 226/525 (43%), Positives = 311/525 (59%), Gaps = 35/525 (6%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MAE++ELG PK + NL++Q AR TL+ +RL+GH +E+REDG F+FFC C APCYSD Sbjct: 1 MAEKKELGLPKSSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQ---LSGSSLIVVN 1496 + L HL GNLHKER A A+LTLLG+NPWPF+DGVLF + E+++ +SG + + Sbjct: 60 TILLGHLNGNLHKERLACARLTLLGTNPWPFSDGVLFFDSSTGEEEEKTPVSGGASVPGT 119 Query: 1495 SGKNSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVR 1316 G S+ +D A+ + +N+ ++ A VT ++ + + +++I GVL K+ +E + Sbjct: 120 LGHCSD-DDRFAIVKYDNNKANGGNQPAAVTDDEPSHSTD-DLLISGVLIKERTLDVEAK 177 Query: 1315 FIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTL 1136 FIGFG IAAR+ E+ I+++WC WLG G PS E A+ + HDF IVTFSY Y L Sbjct: 178 FIGFGRIAARLFETKGRTTWIDKLWCEWLGDEG-PSDEEKAT-IPEHDFAIVTFSYFYNL 235 Query: 1135 GRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLV----------LGQY 986 GR L D E NGE RK RKKSFSDPED S+SL G Sbjct: 236 GRLGLLDDPSRLLTTSQSESGNGEDSGRK-RKKSFSDPEDTSESLCNQYDSSEEVSSGHN 294 Query: 985 GDESRHAISXXXXXXXXXXRVA---------------SERICDICKHKILPEKDVSTLLN 851 + SR I+ V SERIC++CK K+LP KD + +LN Sbjct: 295 SNSSRALIADYDDSLMSKRVVKNKTVRRELRRQQRIFSERICEVCKQKMLPGKDAAAILN 354 Query: 850 MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKR--KNVSKR 677 MKTG LAC SRN+ GAFHLFH SC++HW L C+ EI N++ + K G KR K+ S + Sbjct: 355 MKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGK---GKKRCTKHSSGQ 411 Query: 676 SEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAW 497 + + N+ Q SV CPECQG+GIN+E G +E T P + + + +K +E AW Sbjct: 412 TGVKWNELANDVSWQIFSVFCPECQGTGINIEGGVIERDTFPLSQTWRFQVKVSEGRKAW 471 Query: 496 MKDPELLQNRSTGLRFPYNSEETIQ-----EKVLPLKLLHFYRAD 377 +K+PE L+N STG FP ++E+ Q E+V +KL+ FYR + Sbjct: 472 VKNPEKLKNCSTGFHFPQQADESGQIPVQEERVQMMKLVRFYRVE 516 >ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261554 [Solanum lycopersicum] Length = 526 Score = 387 bits (993), Expect = e-104 Identities = 235/527 (44%), Positives = 304/527 (57%), Gaps = 41/527 (7%) Frame = -1 Query: 1834 RELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALF 1655 ++L P+ NL++Q R TL+ VR +GHI VE+REDG IFFC C +PCYSDS LF Sbjct: 4 KQLDVPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLIFFCTLCHSPCYSDSVLF 63 Query: 1654 DHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGKNS-- 1481 +HLKGNLH E AAAK TLL NPWPFNDGVLF +N E+DKQ S VN GK+ Sbjct: 64 NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLF-FNDPEQDKQDKQSPN--VNVGKSRLV 120 Query: 1480 ----NGNDNVALERVGDN--ENLDSH----KCATVTIEKSLNGENCNMVIPGVLCKDVIS 1331 +VA+ DN N D++ + + E N E+ +VIPGVLCKD +S Sbjct: 121 DTCLEDESSVAIVEYDDNLRHNEDTYVSEYEYGLLDSELIGNEESDYLVIPGVLCKDELS 180 Query: 1330 SLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFS 1151 LEV+ IG G+IAARI K I RIWC WL K D S + HDF +VTF Sbjct: 181 DLEVKHIGIGKIAARISVRGIDSKSIRRIWCEWLAKKDSDDM--DTSVVPDHDFAVVTFP 238 Query: 1150 YNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL---------- 1001 YNY LGR L D E + K+++KSFSDPED S+SL Sbjct: 239 YNYNLGRSPLLDDRFLLPSSPYSESEE-TSVTGKRKRKSFSDPEDFSESLSNHCDSSGEE 297 Query: 1000 -----------VLGQYGDE--SRHAISXXXXXXXXXXR--VASERICDICKHKILPEKDV 866 +LG D+ S IS + VASER+CDIC+ K+LP KDV Sbjct: 298 SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDV 357 Query: 865 STLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLD----NLKGTHGSK 698 +TLL+ K+G+L CSSRN++GAFHLFH SCLIHWIL C+ + +D K SK Sbjct: 358 ATLLSWKSGKLMCSSRNMSGAFHLFHVSCLIHWILQCELQTSVKPVDEPKMEPKAKRRSK 417 Query: 697 RKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKA 518 +K +K + D+ + + + +SV CPECQG+GI +E +LE+P + E++ IK Sbjct: 418 KKTGTKHNAKEKEDETKSAR-RINSVFCPECQGTGICIEGDELEKPPVSLSEVYRLKIKL 476 Query: 517 NEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377 ++A AWMK+PE+LQN STG P ++ +QE V PLKLLHFYRA+ Sbjct: 477 SDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 523 >ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508707512|gb|EOX99408.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 470 Score = 386 bits (992), Expect = e-104 Identities = 218/465 (46%), Positives = 283/465 (60%), Gaps = 34/465 (7%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MAERRELG P+ +L++Q AR TL VR +GH +E+REDG FIFFC C APCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487 S L DHLKG+LH R AAAK+TLLG+NPWPFNDGVLF E++K+L+G Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111 Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENC-----NMVIPGVLCKDVISSLE 1322 +GN N LE +++NL + + NC +++IPGVL KD IS L+ Sbjct: 112 --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169 Query: 1321 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNY 1142 VRFIGFG+IAAR E + +I+RIWC WLGK + ++D K H F +VTF YN Sbjct: 170 VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227 Query: 1141 TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1001 LGR+ LDD+ ++NG+ RK RKKSFSDPEDIS+SL Sbjct: 228 DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286 Query: 1000 ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 869 L +Y D+ S AI +A+ER+CDIC+ K+LPEKD Sbjct: 287 ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343 Query: 868 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689 V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E N N K S+RKN Sbjct: 344 VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403 Query: 688 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTI 554 +K +++ + + + T SSV+CPECQG+GI+VE +LE+P + Sbjct: 404 GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDV 448 >ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prunus persica] gi|462394196|gb|EMJ00100.1| hypothetical protein PRUPE_ppa004741mg [Prunus persica] Length = 493 Score = 385 bits (989), Expect = e-104 Identities = 235/539 (43%), Positives = 296/539 (54%), Gaps = 49/539 (9%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MA R ELGFPK +LR+Q R LR VR +GH VE+REDG FIFFC C APCYSD Sbjct: 1 MAGRWELGFPKTSASSLREQATRTILRNVRSQGHTYVELREDGKKFIFFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQL------------ 1523 LFDHLKGNLHK+R AAAK+TLL NPWPFNDGV F +N E DK L Sbjct: 61 KVLFDHLKGNLHKDRLAAAKVTLLRPNPWPFNDGVAFFHNPDETDKHLVITDGNKFRMLE 120 Query: 1522 ---SGSSLIVVNSGKN--SNGNDNVALERVGDNENLD------SHKCATVTIEKSLNGEN 1376 ++L +V G+N SNGN++V + + N +LD + K + + N N Sbjct: 121 SPDDENNLAIVKYGENLISNGNEHVGTDGLECNGSLDFPRVRSNFKFSCSNENSTANEVN 180 Query: 1375 CNMVIPGVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHED 1196 ++VIP VL +D ++ +E + +G G+IAAR E ++ K I RIWC WLGK +E Sbjct: 181 SSVVIPSVLVRDDVTDIEAKKVGLGQIAARFLEKDKVSKGIGRIWCEWLGK--KAIGNEY 238 Query: 1195 ASKLTGHDFGIVTFSYNYTLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPE 1019 K+ HDF +VTFSYN LGRR LDD+ +E +NGEG K RKKSFSDPE Sbjct: 239 HLKVPEHDFAVVTFSYNIDLGRRGLLDDVKMLLSSSPSVETENGEGSGSK-RKKSFSDPE 297 Query: 1018 DISKS---------------------LVLGQYGDESRHA----ISXXXXXXXXXXRVASE 914 DIS+S L+L +Y D+ H R+A Sbjct: 298 DISESLSNQYDSCGEDSSASSGASSKLLLDRYDDQLLHTRFILNKSIRRELRRQQRLALG 357 Query: 913 RICDICKHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTN 734 R+CDIC+ +++P KDVS L+N+KTGRLACSSRNVNGAFH+FHTSCLIHWILLC+ EI N Sbjct: 358 RMCDICQQRMIPGKDVSALINLKTGRLACSSRNVNGAFHVFHTSCLIHWILLCEVEI-AN 416 Query: 733 QLDNLKGTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTI 554 Q N K S+RKN +K + + + Q SV CPECQG+G ++ LE+P + Sbjct: 417 QSTNSKVRRRSRRKNAAKCNG--QDGQMTALSTQIHSVFCPECQGTGAIIDGDDLEKPNL 474 Query: 553 PPFEIFNYNIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377 P QEKV PLKL+HFYRAD Sbjct: 475 P----------------------------------------LSQEKVKPLKLMHFYRAD 493 >ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] gi|145334149|ref|NP_001078455.1| uncharacterized protein [Arabidopsis thaliana] gi|7269680|emb|CAB79628.1| putative protein [Arabidopsis thaliana] gi|110742700|dbj|BAE99261.1| hypothetical protein [Arabidopsis thaliana] gi|332660060|gb|AEE85460.1| uncharacterized protein AT4G28260 [Arabidopsis thaliana] gi|332660061|gb|AEE85461.1| uncharacterized protein AT4G28260 [Arabidopsis thaliana] Length = 516 Score = 382 bits (981), Expect = e-103 Identities = 221/522 (42%), Positives = 304/522 (58%), Gaps = 32/522 (6%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MAE++ELG PK + NL++Q AR TL+ +RL+GH +E+REDG F+FFC C APCYSD Sbjct: 1 MAEKKELGLPKPSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCC--EEDKQLSGSSLIVVNS 1493 + L HL GNLHKER A A++TLLG+NPWPF+DGVLF + EE+K V ++ Sbjct: 60 TILLGHLNGNLHKERLACARITLLGTNPWPFSDGVLFFDSSTGEEEEKSPVSGGEGVPDT 119 Query: 1492 GKNSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVRF 1313 ++ + ++ A+ + +N+ + A VT ++ + + +++I GVL K+ +E +F Sbjct: 120 LEHCSDDERFAIVKYDNNKTNGDNVPAAVTDDEPSHAAD-DLLISGVLIKERTLDVEAKF 178 Query: 1312 IGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTLG 1133 IGFG IAAR+ E+ I+++WC WLG G PS E A+ + HDF IVTFSY Y LG Sbjct: 179 IGFGRIAARLFETKGRTTWIDKLWCEWLGDEG-PSDEEKAT-IPEHDFAIVTFSYFYNLG 236 Query: 1132 RRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLV----------LGQYG 983 R L D E NGE RK RKKSFSDPED S+SL G Sbjct: 237 RLGLLDDPGRLLTSSQSESGNGEDSGRK-RKKSFSDPEDTSESLCNQYDSSEEVSSGHNS 295 Query: 982 DESRHAISXXXXXXXXXXRVA---------------SERICDICKHKILPEKDVSTLLNM 848 + SR I+ V SERIC++CK K+LP KD + +LNM Sbjct: 296 NSSRDLIADYDDSLMSKRVVKNRTVRRELRRQQRIFSERICEVCKQKMLPGKDAAAILNM 355 Query: 847 KTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVSKRSEI 668 KTG LAC SRN+ GAFHLFH SC++HW L C+ EI N++ + KG + S ++ + Sbjct: 356 KTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGKGKKRCTKH--SGQTGV 413 Query: 667 LMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKD 488 N+ Q SV CPECQG+GIN+E +E T P + + + +K +E AW+K+ Sbjct: 414 KWNELANDVSWQIFSVFCPECQGTGINIEGAVIERDTFPLSQTWRFQVKVSEGRKAWVKN 473 Query: 487 PELLQNRSTGLRFPYNSEETIQ-----EKVLPLKLLHFYRAD 377 PE L+N STG FP +EET Q E+V +KL+ FYR + Sbjct: 474 PERLKNCSTGFHFPQQAEETEQIPVQEERVQMMKLVRFYRVE 515 >ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phaseolus vulgaris] gi|561023122|gb|ESW21852.1| hypothetical protein PHAVU_005G104500g [Phaseolus vulgaris] Length = 498 Score = 379 bits (972), Expect = e-102 Identities = 215/517 (41%), Positives = 293/517 (56%), Gaps = 27/517 (5%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MA + ELG K V N ++Q AR L+ VR +GH VE+RE+G FI+FC C APCYSD Sbjct: 1 MAGKLELGPLKSDVSNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487 LFDHLKGNLHKER +AAK+TLLG PWPFNDG++F E D+ L + K Sbjct: 61 DVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSIESDRDLEVADSYRNRLLK 120 Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVRFIG 1307 +N ++++A+ + + ++ C+T + + C +VIP +L +D I ++V +G Sbjct: 121 FNNNDNSLAIVKFDEGVQSNAEPCST----DGMPNDECGLVIPHLLIRDEIFDVKVSEVG 176 Query: 1306 FGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTLGRR 1127 G+IAAR E I RIWC WLGK G+ +D ++ HDF IV F+YNY LGR Sbjct: 177 LGKIAARFLEKCSALSGIKRIWCEWLGKKGNDQ--QDGVEILEHDFAIVNFAYNYDLGRS 234 Query: 1126 -TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLVLGQYGDESRHAISXXX 950 LDD+ + + G R+ K+S SD +DIS SL QY + + Sbjct: 235 GLLDDVKSL--------LPSASGGRKG--KRSLSDSDDISDSLC-NQYDSSAEESSDSNN 283 Query: 949 XXXXXXXR--------------------------VASERICDICKHKILPEKDVSTLLNM 848 +A+E++C+IC+ K+LP KDV+ LLN+ Sbjct: 284 SSAPLTLDQFNNHHVCTRFISSKAVRKELRRKQRLAAEKVCNICQQKMLPGKDVAALLNL 343 Query: 847 KTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVSKRSEI 668 T R+ACSSRN GAFH+FHTSCLIHWI+LC+FEI TN L KRK S +I Sbjct: 344 NTRRVACSSRNKTGAFHVFHTSCLIHWIILCEFEIITNHLVRPNVRRIVKRKIASDGEKI 403 Query: 667 LMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKD 488 K++ + +V CPECQG+G+ ++ +E+P ++F + IKA +A WMK Sbjct: 404 ---GKEKDIEKHIRTVFCPECQGTGMVIDGDGVEQPEFSLSQMFKFKIKACDARREWMKS 460 Query: 487 PELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377 PE+LQN STG FP SEE +EKV P+ LLHFYRAD Sbjct: 461 PEILQNCSTGFHFPSQSEEIFEEKVEPINLLHFYRAD 497 >ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutrema salsugineum] gi|557114148|gb|ESQ54431.1| hypothetical protein EUTSA_v10024944mg [Eutrema salsugineum] Length = 514 Score = 374 bits (961), Expect = e-101 Identities = 217/525 (41%), Positives = 296/525 (56%), Gaps = 35/525 (6%) Frame = -1 Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667 MAE +ELG PK + +L++Q AR TLR +R +GH +E+REDG F+FFC C APCYSD Sbjct: 1 MAESKELGLPKTAI-SLKEQLARTTLRNLRSQGHTYIELREDGKRFVFFCTLCLAPCYSD 59 Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCC-EEDKQLSGSSLIVVNSG 1490 + L HL GNLHKER + A++TLLG NPWPFNDGVLF + EE+K L V Sbjct: 60 AILLGHLNGNLHKERLSCARITLLGENPWPFNDGVLFFDSSTGEEEKTLISDGEGVTGPL 119 Query: 1489 KNSNGNDNVALERVGDNENLDSH--KCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVR 1316 + + N+ A+ +N +S I+ N N+VI +L K+ +E + Sbjct: 120 HHCSDNERFAIVTYDENRTCESQGDNQPAAGIDDEPNHCAENLVISNLLIKEKTLDVEAK 179 Query: 1315 FIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTL 1136 FIGFG IAAR+ E+ I+++WC WLG+ P E+ + + HDF IVTFSY Y L Sbjct: 180 FIGFGRIAARLFETKGRTTWIDKLWCEWLGEESPPD--EEKATVPEHDFAIVTFSYFYNL 237 Query: 1135 GRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLVLGQYGDESRHAIS 959 GR L D + E NGE RK RKKSFSDPED S+SL QY +S +S Sbjct: 238 GRLGLLADPSRLLTLSQSAESGNGEDNGRK-RKKSFSDPEDTSESLC-NQY--DSSEEVS 293 Query: 958 XXXXXXXXXXRVA----------------------------SERICDICKHKILPEKDVS 863 +A S+RIC++CK K+LP KD + Sbjct: 294 SARNSNSSRALIADYDDHLVNKRVIKNKSVRRELRKQQRIFSDRICEVCKQKMLPGKDAA 353 Query: 862 TLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVS 683 +LNMKTG+LACSSRN GAFHLFH SC++HW L C+ EI +++ + KG +K + Sbjct: 354 AILNMKTGKLACSSRNRLGAFHLFHVSCVVHWFLFCETEILGSKMVSGKG-----KKRCT 408 Query: 682 KRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACL 503 K+S + N+ Q SV CPECQG+GIN+E +E T P + + + +K +E Sbjct: 409 KQSGVKWNELVGDVSWQIFSVFCPECQGTGINIEGDVIERDTFPLSQTWRFGVKVSEGRK 468 Query: 502 AWMKDPELLQNRSTGLRFPYNSEETI---QEKVLPLKLLHFYRAD 377 AW+K+PE L+N STG FP EE + +++V +KL+ FYR + Sbjct: 469 AWVKNPEKLENCSTGFHFPQQDEELVKGQEDRVQSMKLVRFYRVE 513