BLASTX nr result
ID: Catharanthus23_contig00000111
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00000111 (2768 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citr... 569 e-159 ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600... 564 e-158 ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255... 558 e-156 ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261... 548 e-153 ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204... 543 e-151 ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310... 537 e-150 ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608... 535 e-149 gb|EOX99406.1| Uncharacterized protein isoform 1 [Theobroma cacao] 534 e-149 ref|XP_002517932.1| conserved hypothetical protein [Ricinus comm... 525 e-146 emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera] 524 e-146 ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Popu... 511 e-142 gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis] 506 e-140 gb|EMJ00100.1| hypothetical protein PRUPE_ppa004741mg [Prunus pe... 488 e-135 gb|ESW21852.1| hypothetical protein PHAVU_005G104500g [Phaseolus... 478 e-132 ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807... 478 e-132 ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arab... 474 e-130 ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779... 473 e-130 ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] ... 470 e-129 gb|EOX99407.1| Uncharacterized protein isoform 2, partial [Theob... 469 e-129 gb|EOX99409.1| Uncharacterized protein isoform 4 [Theobroma cacao] 468 e-129 >ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|567910083|ref|XP_006447355.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|567910085|ref|XP_006447356.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|567910087|ref|XP_006447357.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|568831767|ref|XP_006470130.1| PREDICTED: uncharacterized protein LOC102608093 isoform X1 [Citrus sinensis] gi|568831769|ref|XP_006470131.1| PREDICTED: uncharacterized protein LOC102608093 isoform X2 [Citrus sinensis] gi|568831771|ref|XP_006470132.1| PREDICTED: uncharacterized protein LOC102608093 isoform X3 [Citrus sinensis] gi|568831773|ref|XP_006470133.1| PREDICTED: uncharacterized protein LOC102608093 isoform X4 [Citrus sinensis] gi|557549965|gb|ESR60594.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|557549966|gb|ESR60595.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|557549967|gb|ESR60596.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] gi|557549968|gb|ESR60597.1| hypothetical protein CICLE_v10014904mg [Citrus clementina] Length = 523 Score = 569 bits (1466), Expect = e-159 Identities = 294/528 (55%), Positives = 360/528 (68%), Gaps = 8/528 (1%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA RRELGFPK SL+EQ+AR TL NVR QGHTYV+LR+DGKR +FFCTLCLAPCYSD Sbjct: 1 MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 VLF HL GNLHTERL+ A+ TLL PNPWPFNDG++FF++ +++K V++ LD Sbjct: 61 LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120 Query: 1279 TQSNLENPLAIVSWQKNL--------GSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIP 1434 +N N LAIV + +++ G DE H N + ++ + + D+V IP Sbjct: 121 YHNNDSN-LAIVKYGEDMKVNGNEHSGLDEVHFDCE-NGTQVRDIYSESCDKV-----IP 173 Query: 1435 GVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHD 1614 GV K+E+ L V ++G+ QIAAR ++KD + RIWCEW G+ D +E + +HD Sbjct: 174 GVFLKDEIVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVEIPDHD 233 Query: 1615 FAVVTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQ 1794 FA+VTF YNY+LGRKGL DD+K LL SSP +SE+ G+ +++KSFSDPEDVSE + Q Sbjct: 234 FAIVTFVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQ 293 Query: 1795 YDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQK 1974 YD +++LLD Y DQLLHAR + SK RRE+RRQQ +AAERMCDICQQK Sbjct: 294 YDSCGEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQK 353 Query: 1975 MLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXX 2154 +LP KDVAALLN KTG LACSSRNL G FHVFH+SCLIHWILLCE E+ Q P Sbjct: 354 ILPDKDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKR 413 Query: 2155 XXXXXXXXXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKI 2334 + E QI S FCPECQGTG+NIE DELEKPT+ LS++FKYKI Sbjct: 414 RSRRKNGSKRVQARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKI 473 Query: 2335 KANDACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRA 2478 K +DA KAW K+PE LQNCS GFYFP +SE QEKVSPLKLL FY A Sbjct: 474 KVSDARKAWMKNPEALQNCSTGFYFPSRSEEKFQEKVSPLKLLHFYSA 521 >ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600129 [Solanum tuberosum] Length = 521 Score = 564 bits (1454), Expect = e-158 Identities = 294/521 (56%), Positives = 364/521 (69%), Gaps = 4/521 (0%) Frame = +1 Query: 931 RELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSDSVLF 1110 R+L FP+ +LKEQ+ R+TL+NVR QGH YV+LR+DGKR VFFCTLC +PCYSDSVLF Sbjct: 4 RQLDFPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLVFFCTLCHSPCYSDSVLF 63 Query: 1111 KHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILDTQSN 1290 HL GNLHTE LA A+ATLLKPNPWPFNDG++FFND P++DK P + + ++DT Sbjct: 64 NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLFFND-PEQDKHSPNVNVGKSRLVDTCLE 122 Query: 1291 LENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEVSSLE 1470 E+ LAIV NL + + +L N + +LVIPGVL K+E+S LE Sbjct: 123 DESSLAIVECDDNLRHNGDTYVTEYEYCLLDSELTGNGE--SEYLVIPGVLCKDELSDLE 180 Query: 1471 VTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSYNYNL 1650 V ++G+ +IAAR + RIWCEW + DS + + VV +HDFAVVTF YNYNL Sbjct: 181 VKHIGIGKIAARISVRGIDSKKIRRIWCEWLVKKDSDDMDTSVVPDHDFAVVTFPYNYNL 240 Query: 1651 GRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXXXXXX 1830 GRK L+DD ++LLPSSP+SESE+ SG+R RKRKSFSDPED SE + N D Sbjct: 241 GRKPLLDD-RFLLPSSPYSESEETSGTRKRKRKSFSDPEDFSESLSNHCDSSGEESQSTN 299 Query: 1831 XLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVAALLN 2010 N K++L DDQL+ +R++ SKTMRRELR+QQ VA+ERMCDICQQKMLPGKDVA LL+ Sbjct: 300 NSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDVATLLS 359 Query: 2011 RKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAP----XXXXXXXXXXXX 2178 K+G+L CSSRN+TGAFH+FHVSCLIHWIL CEL+ Y K +D P Sbjct: 360 WKSGKLMCSSRNMTGAFHLFHVSCLIHWILQCELQTYVKPVDEPKMETKAKRRSKRKTGT 419 Query: 2179 XXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKA 2358 +EIK+ R+I S FCPECQGTGI IE DELEKP V LSE++++KIK +DA KA Sbjct: 420 KHNAKEKEDEIKSARRINSVFCPECQGTGIIIEGDELEKPPVSLSEVYRHKIKLSDARKA 479 Query: 2359 WFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRAN 2481 W K+PE+LQNCS GF PP+ + + QE VSPLKLL FYRAN Sbjct: 480 WMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 520 >ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255678 [Vitis vinifera] Length = 520 Score = 558 bits (1437), Expect = e-156 Identities = 292/528 (55%), Positives = 358/528 (67%), Gaps = 6/528 (1%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA R ELGF K SL+EQ AR TLRNVR+QGH YV+LR+DGKR +FFCTLCLAPCYS+ Sbjct: 1 MARRTELGFLKTSASSLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSE 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 SVL+ HL GNLH+ER A A+ TLLK +PWPFNDG++FF++ + DK L + + + +L Sbjct: 61 SVLYDHLKGNLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLG 120 Query: 1279 TQSNLENPLAIVSWQKNLGSDEN-----HSQISLNRGVLHEVPNVNEDRVGYHLVIPGVL 1443 T N +N LAIV +L N HS + + V ++N ++IPGV+ Sbjct: 121 THKN-DNNLAIVCHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVM 179 Query: 1444 WKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAV 1623 K+EV+ LEV ++G QIAARF EKDG +IWCEWFG+ + G+ + +V +HDFAV Sbjct: 180 IKDEVTELEVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDGETVMVPDHDFAV 239 Query: 1624 VTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDX 1803 VTF+Y+YNLGRKGL DD+ +L SSP GS +++KSFSDPED+SE + NQYD Sbjct: 240 VTFNYHYNLGRKGLFDDVISMLSSSP------TEGSGRKRKKSFSDPEDISESLSNQYDS 293 Query: 1804 XXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLP 1983 + ++LLD YDDQLL R + SKT+RRELRRQQ VAAERMCDICQ KMLP Sbjct: 294 SGEDSLISNSPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLP 353 Query: 1984 GKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXX 2163 GKDVA L+N KTG+L CSSRN+ GAFHVFH SCLIHWILLCE EI+ QL P Sbjct: 354 GKDVATLMNMKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSR 413 Query: 2164 XXXXXXXXXXXXNEEIK-ANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKA 2340 + IK QICS FCPECQGTGI IE DELE P +PLSE+FKYKIK Sbjct: 414 RKSGSKCNGKGKDGVIKPTTLQICSVFCPECQGTGIMIE-DELEIPNIPLSEMFKYKIKV 472 Query: 2341 NDACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484 +DA +AW K+PE L++CS GF FP QS QEKVS LKLL FY A+E Sbjct: 473 SDAHRAWMKNPEELKHCSTGFNFPSQSGETVQEKVSSLKLLHFYSADE 520 >ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261554 [Solanum lycopersicum] Length = 526 Score = 548 bits (1413), Expect = e-153 Identities = 287/525 (54%), Positives = 360/525 (68%), Gaps = 6/525 (1%) Frame = +1 Query: 931 RELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSDSVLF 1110 ++L P+ +LKEQ+ R+TL+NVR QGH YV+LR+DGKR +FFCTLC +PCYSDSVLF Sbjct: 4 KQLDVPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLIFFCTLCHSPCYSDSVLF 63 Query: 1111 KHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDE--DKSLPVTSADQIPILDTQ 1284 HL GNLHTE LA A+ATLLKPNPWPFNDG++FFND + DK P + + ++DT Sbjct: 64 NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLFFNDPEQDKQDKQSPNVNVGKSRLVDTC 123 Query: 1285 SNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEVSS 1464 E+ +AIV + NL +E+ G+L NE+ +LVIPGVL K+E+S Sbjct: 124 LEDESSVAIVEYDDNLRHNEDTYVSEYEYGLLDSELIGNEE--SDYLVIPGVLCKDELSD 181 Query: 1465 LEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSYNY 1644 LEV ++G+ +IAAR + RIWCEW + DS + + VV +HDFAVVTF YNY Sbjct: 182 LEVKHIGIGKIAARISVRGIDSKSIRRIWCEWLAKKDSDDMDTSVVPDHDFAVVTFPYNY 241 Query: 1645 NLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXXXX 1824 NLGR L+DD ++LLPSSP+SESE+ S + RKRKSFSDPED SE + N D Sbjct: 242 NLGRSPLLDD-RFLLPSSPYSESEETSVTGKRKRKSFSDPEDFSESLSNHCDSSGEESQS 300 Query: 1825 XXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVAAL 2004 N K++L DDQL+ +R++ SKTMRRELR+QQ VA+ERMCDICQQKMLPGKDVA L Sbjct: 301 TNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDVATL 360 Query: 2005 LNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLD----APXXXXXXXXXX 2172 L+ K+G+L CSSRN++GAFH+FHVSCLIHWIL CEL+ K +D P Sbjct: 361 LSWKSGKLMCSSRNMSGAFHLFHVSCLIHWILQCELQTSVKPVDEPKMEPKAKRRSKKKT 420 Query: 2173 XXXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDAC 2352 +E K+ R+I S FCPECQGTGI IE DELEKP V LSE+++ KIK +DA Sbjct: 421 GTKHNAKEKEDETKSARRINSVFCPECQGTGICIEGDELEKPPVSLSEVYRLKIKLSDAR 480 Query: 2353 KAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANES 2487 KAW K+PE+LQNCS GF PP+ + + QE VSPLKLL FYRAN S Sbjct: 481 KAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRANVS 525 >ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204451 [Cucumis sativus] gi|449475785|ref|XP_004154550.1| PREDICTED: uncharacterized LOC101204451 [Cucumis sativus] Length = 525 Score = 543 bits (1399), Expect = e-151 Identities = 282/524 (53%), Positives = 353/524 (67%), Gaps = 4/524 (0%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA R ELGFPK SL+EQ AR LRNVR QGHTYV+LR++GK+ +FFCTLCLAPCYSD Sbjct: 1 MARRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRENGKKFIFFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 SVLF HL G LHTERL+ A+ TLL PNPWPF+DG++FF+ + D + +++ + +L+ Sbjct: 61 SVLFSHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPIEGDNQVGISNDNHERLLE 120 Query: 1279 TQSNLENPLAIVSWQKNL-GSDENHSQISLNRGVLHEVP--NVNEDRVGYHLVIPGVLWK 1449 +N +N LAIV + N G+ + + N + + N+N+ LVIPGVL K Sbjct: 121 YNNN-DNNLAIVKYVGNSKGNGNRQEEFNGNMRNVEDCSFENLNDGGESCPLVIPGVLIK 179 Query: 1450 NEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVT 1629 E+S ++V +G QIAARF EKDG RIWCEW G+++ G E V EH++A++T Sbjct: 180 EEISDIKVRELGYGQIAARFTEKDGIFSGVSRIWCEWLGKVNDGIENMVKVPEHNYAIIT 239 Query: 1630 FSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXX 1809 F+YN +LGRKGL+DD+K LL SSP +ES+++ + +++KSFSDPED S M QYD Sbjct: 240 FTYNVDLGRKGLLDDVKLLLSSSPGAESQNDENRQVKRKKSFSDPEDGSLSMSPQYDSSG 299 Query: 1810 XXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGK 1989 + + + LDGYDDQ+L V+ +K +RRELRRQQ +AAERMCDICQQK+L K Sbjct: 300 EDSSASNCVMSSLSLDGYDDQILSTTVMLNKAVRRELRRQQRLAAERMCDICQQKILTHK 359 Query: 1990 DVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXX 2169 DVA LLN KTGRLACSSRN+ G FHVFH SCLIHWILLCE EI K L Sbjct: 360 DVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEISVKDLGGSKVRRRYRRK 419 Query: 2170 XXXXXXXXXXNEEIK-ANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKAND 2346 + E + QI S FCP CQGTGI I+ D+LEKPTVPLSEIFKYKIK +D Sbjct: 420 KKTKGNKHIKDGETRQIKTQIDSVFCPACQGTGITIDGDDLEKPTVPLSEIFKYKIKVSD 479 Query: 2347 ACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRA 2478 A +AW KSPE+LQNCS GF FP Q + QE V PLKLL FY A Sbjct: 480 ARRAWMKSPEVLQNCSTGFQFPYQPDETIQENVKPLKLLHFYGA 523 >ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310040 [Fragaria vesca subsp. vesca] Length = 525 Score = 537 bits (1384), Expect = e-150 Identities = 278/533 (52%), Positives = 364/533 (68%), Gaps = 11/533 (2%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA R ++G PK CSL+EQ R LRNVR QGH+YV++R+DGK+ +FFCTLCLAPCYSD Sbjct: 1 MAGRWDVGVPKTNACSLREQATRTILRNVRSQGHSYVEVREDGKKFIFFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 VLF HL GNLH ERLA A+ TLL+PNPWPFNDG++FFN+ + DK + ++ +L+ Sbjct: 61 KVLFDHLKGNLHNERLAAAKVTLLRPNPWPFNDGVVFFNNSYETDKGVVTPDDNKCRMLE 120 Query: 1279 TQSNLENPLAIVSWQKNLGSD----------ENHSQISLNRGVLHEVPNVNEDRVGYHLV 1428 + N EN LAIV + NL ++ E + I L +G+ V + D +V Sbjct: 121 SHDN-ENNLAIVKYGGNLKTNGYDHCGVDGLECNEYIDL-QGLQSNVGDSTADGAKSSVV 178 Query: 1429 IPGVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLE 1608 IPG++ ++E++ LEV VG+ +IAARF+ KDG RIWCEW G +E V E Sbjct: 179 IPGIVVRDEITDLEVREVGLGEIAARFLGKDGI----GRIWCEWLGVKSIDSEDLCNVPE 234 Query: 1609 HDFAVVTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMG 1788 HDFAVVTFSYN +LGRKGL+DD++ LL SSP ES + G+ +++KSFSDPED+S+ + Sbjct: 235 HDFAVVTFSYNIDLGRKGLLDDVRMLLSSSPTIESGNGEGTGCKRKKSFSDPEDISDSLS 294 Query: 1789 NQYDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQ 1968 NQY+ +++LLD YDDQLL+ R + +K++RRELRRQQ +A+ RMCDICQ Sbjct: 295 NQYESFGEDSSASSGTASRLLLDHYDDQLLNTRFILNKSIRRELRRQQRLASGRMCDICQ 354 Query: 1969 QKMLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXX 2148 Q+MLPGKDVA L+N KTG+LACSSRN+ GAFHVFH SCLIHWILLCE+E+ Q Sbjct: 355 QRMLPGKDVATLMNLKTGKLACSSRNVNGAFHVFHTSCLIHWILLCEVEVITNQ--NTGS 412 Query: 2149 XXXXXXXXXXXXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFK 2325 + ++K+ + QI S FCPECQGTGI ++ D+LEKP +PLS++F+ Sbjct: 413 KARRRSRRKTAAKCNGKDAQLKSLSPQIYSVFCPECQGTGIVVDGDDLEKPNLPLSQMFR 472 Query: 2326 YKIKANDACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484 YKIK +DA +AW KSPE+LQNCS GF+FP + QEKV LKLL FYRA+E Sbjct: 473 YKIKVSDARRAWMKSPEMLQNCSTGFHFPSLNAAGIQEKVKTLKLLRFYRAHE 525 >ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608093 isoform X5 [Citrus sinensis] Length = 508 Score = 535 bits (1378), Expect = e-149 Identities = 275/502 (54%), Positives = 340/502 (67%), Gaps = 8/502 (1%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA RRELGFPK SL+EQ+AR TL NVR QGHTYV+LR+DGKR +FFCTLCLAPCYSD Sbjct: 1 MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 VLF HL GNLHTERL+ A+ TLL PNPWPFNDG++FF++ +++K V++ LD Sbjct: 61 LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120 Query: 1279 TQSNLENPLAIVSWQKNL--------GSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIP 1434 +N N LAIV + +++ G DE H N + ++ + + D+V IP Sbjct: 121 YHNNDSN-LAIVKYGEDMKVNGNEHSGLDEVHFDCE-NGTQVRDIYSESCDKV-----IP 173 Query: 1435 GVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHD 1614 GV K+E+ L V ++G+ QIAAR ++KD + RIWCEW G+ D +E + +HD Sbjct: 174 GVFLKDEIVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVEIPDHD 233 Query: 1615 FAVVTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQ 1794 FA+VTF YNY+LGRKGL DD+K LL SSP +SE+ G+ +++KSFSDPEDVSE + Q Sbjct: 234 FAIVTFVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQ 293 Query: 1795 YDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQK 1974 YD +++LLD Y DQLLHAR + SK RRE+RRQQ +AAERMCDICQQK Sbjct: 294 YDSCGEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQK 353 Query: 1975 MLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXX 2154 +LP KDVAALLN KTG LACSSRNL G FHVFH+SCLIHWILLCE E+ Q P Sbjct: 354 ILPDKDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKR 413 Query: 2155 XXXXXXXXXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKI 2334 + E QI S FCPECQGTG+NIE DELEKPT+ LS++FKYKI Sbjct: 414 RSRRKNGSKRVQARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKI 473 Query: 2335 KANDACKAWFKSPELLQNCSLG 2400 K +DA KAW K+PE LQNCS G Sbjct: 474 KVSDARKAWMKNPEALQNCSTG 495 >gb|EOX99406.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 517 Score = 534 bits (1376), Expect = e-149 Identities = 279/523 (53%), Positives = 350/523 (66%), Gaps = 1/523 (0%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MAERRELG P+ CSLKEQ+AR TL NVR QGHTY++LR+DGKR +FFCTLCLAPCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 SVL HL G+LH+ RLA A+ TLL NPWPFNDG++FF L +++K L +Q +L+ Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLE 120 Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458 +N +N LAIV + S++S R NVN L+IPGVL K+E+ Sbjct: 121 FHNNDDN-LAIVEYVG--------SEVSSYR------KNVNCRAGDSDLLIPGVLIKDEI 165 Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638 S L+V ++G +IAARF EKDG L++ RIWCEW G+ N+ +H FAVVTF Y Sbjct: 166 SDLKVRFIGFGKIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVY 225 Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818 N +LGRKGL+DD+K LL S + E+ + +++KSFSDPED+SE + NQYD Sbjct: 226 NCDLGRKGLLDDVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDS 285 Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998 ++++ LD YDDQLL R + SK +RRELRRQQ +AAERMCDICQQKMLP KDVA Sbjct: 286 SASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVA 345 Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178 L+N TG+L CSSRN+ GAFHVFH SCLIHWILLCE+E P Sbjct: 346 TLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGA 405 Query: 2179 XXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACK 2355 + E KA I S CPECQGTGI++E DELEKP V LS++F+YKIK +DA + Sbjct: 406 KSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQMFRYKIKVSDARR 465 Query: 2356 AWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484 AW KSPE+L+NCS GF+F QS + QEK+ PLKLL FY A++ Sbjct: 466 AWMKSPEMLENCSTGFHFRSQSGEMVQEKILPLKLLHFYSADK 508 >ref|XP_002517932.1| conserved hypothetical protein [Ricinus communis] gi|223542914|gb|EEF44450.1| conserved hypothetical protein [Ricinus communis] Length = 509 Score = 525 bits (1352), Expect = e-146 Identities = 278/525 (52%), Positives = 352/525 (67%), Gaps = 3/525 (0%) Frame = +1 Query: 919 MAERRELGFPKHGVC-SLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYS 1095 MA R ELGF K G SLKEQ+AR TL NVR +GH YV+LR+DGKR +FFCTLCLAPCYS Sbjct: 1 MAGRWELGFTKTGGANSLKEQLARTTLNNVRSKGHPYVELREDGKRFIFFCTLCLAPCYS 60 Query: 1096 DSVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPIL 1275 D+VLF HL GNLHTERL+ A TLLK NPWPF+DG+ FF+ + +K L + + ++ Sbjct: 61 DAVLFDHLKGNLHTERLSTATLTLLKENPWPFSDGVHFFDTSSENEKQLVIKNDNE---- 116 Query: 1276 DTQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNE 1455 ++ N + LAIV + +L N+ + N++ L+I GVL K++ Sbjct: 117 -SRGNGNSSLAIVKYGGSL-KPTGDEDTGCNK-------DANDNGRISDLLIQGVLVKDD 167 Query: 1456 VSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFS 1635 +S L+ ++G +I AR +EKDG +D RIWCEW G+ + VL+H+FAVVTF+ Sbjct: 168 ISDLQARFMGYGRIGARLIEKDGNSNDISRIWCEWLGKNTPCDLDKAKVLDHEFAVVTFA 227 Query: 1636 YNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYD-XXXX 1812 YNY+LGRKGL+DD+K LL SSP ES++ G+ +++KSFSDPEDVSE NQYD Sbjct: 228 YNYDLGRKGLLDDVKLLLSSSPVQESDNQGGTNRKRKKSFSDPEDVSESFSNQYDSSGEE 287 Query: 1813 XXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKD 1992 ++LLD +DDQ LH++V+ SKT+RRELRRQ +AAERMCDICQQK+LP KD Sbjct: 288 SLTSIGGPPTRLLLDRHDDQFLHSKVISSKTLRRELRRQHHIAAERMCDICQQKILPEKD 347 Query: 1993 VAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXX 2172 VA L+N TG+LACSSRN G +HVFH SCLIHWILL E E+ Q +P Sbjct: 348 VATLVNMNTGKLACSSRNTYGQYHVFHTSCLIHWILLSEYEMARNQSVSPKGRRKSRRKN 407 Query: 2173 XXXXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDA 2349 E++KA N QI S FCPECQGTG +E DE E PT+PLSE+FKYKIK D Sbjct: 408 GTKSSHV---EKVKALNNQISSVFCPECQGTGAILEKDERELPTIPLSEMFKYKIKVGDG 464 Query: 2350 CKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484 +AW KSPE+L+NCS+GF+FP QSE Q KV PLKLL FYRA+E Sbjct: 465 RRAWMKSPEVLENCSIGFHFPSQSEGAVQAKVLPLKLLHFYRADE 509 >emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera] Length = 896 Score = 524 bits (1350), Expect = e-146 Identities = 271/492 (55%), Positives = 335/492 (68%), Gaps = 6/492 (1%) Frame = +1 Query: 964 SLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSDSVLFKHLHGNLHTER 1143 SL+EQ AR TLRNVR+QGH YV+LR+DGKR +FFCTLCLAPCYS+SVL+ HL GNLH+ER Sbjct: 352 SLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSESVLYDHLKGNLHSER 411 Query: 1144 LAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILDTQSNLENPLAIVSWQ 1323 A A+ TLLK +PWPFNDG++FF++ + DK L + + + +L T N +N LAIV Sbjct: 412 YAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLGTHKN-DNNLAIVCHG 470 Query: 1324 KNLGSDEN-----HSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEVSSLEVTYVGV 1488 +L N HS + + V ++N ++IPGV+ K+EV+ LEV ++G Sbjct: 471 DDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTELEVRFLGF 530 Query: 1489 AQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSYNYNLGRKGLI 1668 QIAARF EKDG +IWCEWFG+ + G+ + +V +HDFAVVTF+Y+YNLGRKGL Sbjct: 531 GQIAARFFEKDGVSKGISKIWCEWFGKEEPGDGETVMVPDHDFAVVTFNYHYNLGRKGLF 590 Query: 1669 DDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXXXXXXXLNAKV 1848 DD+ +L SSP GS +++KSFSDPED+SE + NQYD + ++ Sbjct: 591 DDVISMLSSSP------TEGSGRKRKKSFSDPEDISESLSNQYDSSGEDSLISNSPSPRL 644 Query: 1849 LLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVAALLNRKTGRL 2028 LLD YDDQLL R + SKT+RRELRRQQ VAAERMCDICQ KMLPGKDVA L N KTG+L Sbjct: 645 LLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLXNMKTGKL 704 Query: 2029 ACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXXXXXXXXXNEE 2208 CSSRN+ GAFHVFH SCLIHWILLCE EI+ QL P + Sbjct: 705 VCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSKCNGKGKDGV 764 Query: 2209 IK-ANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKAWFKSPELLQ 2385 IK QICS FCPECQGTGI IE DELE P +PLSE+FKYKIK +DA +AW K+PE L+ Sbjct: 765 IKPTTLQICSVFCPECQGTGIMIE-DELEIPNIPLSEMFKYKIKVSDAHRAWMKNPEELK 823 Query: 2386 NCSLGFYFPPQS 2421 +CS GF FP QS Sbjct: 824 HCSTGFNFPSQS 835 >ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa] gi|550325787|gb|EEE95821.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa] Length = 513 Score = 511 bits (1315), Expect = e-142 Identities = 267/526 (50%), Positives = 341/526 (64%), Gaps = 4/526 (0%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA RE+GFPK SL+EQ+AR TL VR +GH Y++LR+DGKR +FFCTLCL+PCYSD Sbjct: 1 MAGNREVGFPKTTASSLREQLARTTLSRVRARGHPYLELREDGKRFIFFCTLCLSPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIP-IL 1275 ++L HL GNLHTERL+ A+ATLLKPNPWPF+DG+ FF+ ++ L + + L Sbjct: 61 TILLDHLRGNLHTERLSAAKATLLKPNPWPFSDGIHFFDASSGNEEQLAIKDGKESSRFL 120 Query: 1276 DTQSNLENPLAIVSWQKNL--GSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWK 1449 + N +N LAIV + +NL G D V+ E N++ G LVIP V K Sbjct: 121 KFEENSDN-LAIVKYVENLKPGCDT----------VVDE--NLSGSDEGSDLVIPSVRLK 167 Query: 1450 NEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVT 1629 EVS L+ T VG QIAAR EK ++ RIWCEW G+ S +E VL+HDF VVT Sbjct: 168 EEVSDLKATLVGSGQIAARMYEKKDGSNEISRIWCEWLGKKSSNDEDKVKVLDHDFGVVT 227 Query: 1630 FSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXX 1809 F+Y+Y LG+ GL DD+K LL SS + +E++ ++++S S+PEDVS + NQY Sbjct: 228 FAYDYELGKSGLFDDVKLLLSSSAPALTENDERGNWKRKRSVSEPEDVSRSLTNQYGLCE 287 Query: 1810 XXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGK 1989 ++ ++LD YDDQL+H R + +KT+RRE+R+QQ +AAE+MCDICQQKMLP K Sbjct: 288 EESSKTTCASSNLVLDRYDDQLMHTRFISNKTVRREVRKQQRIAAEKMCDICQQKMLPEK 347 Query: 1990 DVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXX 2169 DVA L NRKTG+LACSSRN+ GAFHVFH SCLIHWIL CE EI Q + Sbjct: 348 DVATLWNRKTGKLACSSRNVYGAFHVFHTSCLIHWILYCEFEIVRNQTVSTKGGRRSRKK 407 Query: 2170 XXXXXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKAND 2346 + + I S FCP+CQGTG+NIE DE EKP PLSE+FKYKIK ++ Sbjct: 408 NGTKSNTTGKDGTVNVLPNPIVSVFCPDCQGTGVNIEGDEFEKPLTPLSEMFKYKIKVSE 467 Query: 2347 ACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484 + W K+PE+L+NCS GF+FP QS QEKV PLKLL FYR E Sbjct: 468 GHRGWMKNPEILENCSTGFHFPSQSGEPVQEKVLPLKLLHFYRPEE 513 >gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis] Length = 638 Score = 506 bits (1303), Expect = e-140 Identities = 270/516 (52%), Positives = 341/516 (66%), Gaps = 14/516 (2%) Frame = +1 Query: 919 MAERRELGFPKHGV--------CSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTL 1074 MA R LGFPK CSLK+Q R LRNVR QGHTYV+LR+DGK+ +FFCTL Sbjct: 1 MAGRGILGFPKSNELAVSKTTSCSLKDQAKRTILRNVRSQGHTYVELREDGKKSIFFCTL 60 Query: 1075 CLAPCYSDSVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTS 1254 CLAPCYSD VLF HL GNLH +RL+ A+ TLL PNPWPFNDG++FFN+ + D +++ Sbjct: 61 CLAPCYSDCVLFDHLKGNLHNQRLSTAKVTLLGPNPWPFNDGVVFFNNPTENDDDTVISN 120 Query: 1255 ADQIPILDTQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVG----YH 1422 +Q +L++Q + EN LAIV++ +NL S N + G +E P+ + G Sbjct: 121 GNQSRLLESQDS-ENNLAIVTYGENLESCANGHIMVDELGHQNENPDSAGNLAGSGENCA 179 Query: 1423 LVIPGVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVV 1602 ++IPGV +E++++EV VG I+ RF EKDG +D RIWCEW G+ +E V Sbjct: 180 VLIPGVRAGDEIANVEVREVGYGLISVRFREKDGVSNDISRIWCEWLGKKTIEDEDFLKV 239 Query: 1603 LEHDFAVVTFSY-NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSE 1779 EHDFA+VTFSY N++LGR GL DD+K LL SSP +E ++ S ++RKSFSDPED SE Sbjct: 240 PEHDFAIVTFSYNNFSLGRMGLHDDVKALLCSSPAAEMQNGDVSSRKRRKSFSDPEDSSE 299 Query: 1780 FMGNQYDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCD 1959 + NQYD ++LD YDDQLL R + +K +RRELRRQQ +AAERMCD Sbjct: 300 NLSNQYDSCGEDSSASAV--TSLMLDQYDDQLLQTRFISNKAIRRELRRQQRIAAERMCD 357 Query: 1960 ICQQKMLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDA 2139 ICQ KMLPGKDVA L+N KTGRLACSSRN GAFH+FH SCLIHW+LLCE+E Q +A Sbjct: 358 ICQHKMLPGKDVATLMNVKTGRLACSSRNTNGAFHLFHTSCLIHWVLLCEVEKCTNQSEA 417 Query: 2140 PXXXXXXXXXXXXXXXXXXXNEEIKANR-QICSAFCPECQGTGINIESDELEKPTVPLSE 2316 P + E+KA R I CPECQGTG I+ ++ EKPTVPLS+ Sbjct: 418 PKVKRRSRRKAASKCNEVLNDSEVKAFRTPINRVICPECQGTGTMIDGED-EKPTVPLSK 476 Query: 2317 IFKYKIKANDACKAWFKSPELLQNCSLGFYFPPQSE 2424 +FKYKIK +DA +AW KSPE+L NCS GF+FP +E Sbjct: 477 MFKYKIKVSDARRAWMKSPEVLGNCSTGFHFPSPAE 512 >gb|EMJ00100.1| hypothetical protein PRUPE_ppa004741mg [Prunus persica] Length = 493 Score = 488 bits (1256), Expect = e-135 Identities = 263/536 (49%), Positives = 333/536 (62%), Gaps = 15/536 (2%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA R ELGFPK SL+EQ R LRNVR QGHTYV+LR+DGK+ +FFCTLCLAPCYSD Sbjct: 1 MAGRWELGFPKTSASSLREQATRTILRNVRSQGHTYVELREDGKKFIFFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 VLF HL GNLH +RLA A+ TLL+PNPWPFNDG+ FF++ + DK L +T ++ +L+ Sbjct: 61 KVLFDHLKGNLHKDRLAAAKVTLLRPNPWPFNDGVAFFHNPDETDKHLVITDGNKFRMLE 120 Query: 1279 TQSNLENPLAIVSWQKNLGSDENHS---------------QISLNRGVLHEVPNVNEDRV 1413 + + EN LAIV + +NL S+ N ++ N N + V Sbjct: 121 SPDD-ENNLAIVKYGENLISNGNEHVGTDGLECNGSLDFPRVRSNFKFSCSNENSTANEV 179 Query: 1414 GYHLVIPGVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGS 1593 +VIP VL +++V+ +E VG+ QIAARF+EKD RIWCEW G+ GNE Sbjct: 180 NSSVVIPSVLVRDDVTDIEAKKVGLGQIAARFLEKDKVSKGIGRIWCEWLGKKAIGNEYH 239 Query: 1594 HVVLEHDFAVVTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDV 1773 V EHDFAVVTFSYN +LGR+GL+DD+K LL SSP E+E+ GS ++++KSFSDPED+ Sbjct: 240 LKVPEHDFAVVTFSYNIDLGRRGLLDDVKMLLSSSPSVETENGEGSGSKRKKSFSDPEDI 299 Query: 1774 SEFMGNQYDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERM 1953 SE + NQYD ++K+LLD YDDQLLH R + +K++RRELRRQQ +A RM Sbjct: 300 SESLSNQYDSCGEDSSASSGASSKLLLDRYDDQLLHTRFILNKSIRRELRRQQRLALGRM 359 Query: 1954 CDICQQKMLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQL 2133 CDICQQ+M+PGKDV+AL+N KTGRLACSSRN+ GAFHVFH SCLIHWILLCE+EI + Sbjct: 360 CDICQQRMIPGKDVSALINLKTGRLACSSRNVNGAFHVFHTSCLIHWILLCEVEIANQST 419 Query: 2134 DAPXXXXXXXXXXXXXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLS 2313 ++ + + QI S FCPECQGTG I+ D+LEKP +PL Sbjct: 420 NS--KVRRRSRRKNAAKCNGQDGQMTALSTQIHSVFCPECQGTGAIIDGDDLEKPNLPL- 476 Query: 2314 EIFKYKIKANDACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRAN 2481 SQEKV PLKL+ FYRA+ Sbjct: 477 ---------------------------------------SQEKVKPLKLMHFYRAD 493 >gb|ESW21852.1| hypothetical protein PHAVU_005G104500g [Phaseolus vulgaris] Length = 498 Score = 478 bits (1231), Expect = e-132 Identities = 255/521 (48%), Positives = 339/521 (65%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA + ELG K V + KEQ ARK L+ VR QGH YV+LR++GK+ ++FCTLCLAPCYSD Sbjct: 1 MAGKLELGPLKSDVSNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 VLF HL GNLH ERL+ A+ TLL P PWPFNDG++FF+ + D+ L V + + +L Sbjct: 61 DVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSIESDRDLEVADSYRNRLLK 120 Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458 +N +N LAIV + + + S+ +PN D G LVIP +L ++E+ Sbjct: 121 FNNN-DNSLAIVKFDEGVQSNAEPCSTD-------GMPN---DECG--LVIPHLLIRDEI 167 Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638 ++V+ VG+ +IAARF+EK L RIWCEW G+ + + +LEHDFA+V F+Y Sbjct: 168 FDVKVSEVGLGKIAARFLEKCSALSGIKRIWCEWLGKKGNDQQDGVEILEHDFAIVNFAY 227 Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818 NY+LGR GL+DD+K LLPS+ SG R KR S SD +D+S+ + NQYD Sbjct: 228 NYDLGRSGLLDDVKSLLPSA--------SGGRKGKR-SLSDSDDISDSLCNQYDSSAEES 278 Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998 +A + LD +++ + R + SK +R+ELRR+Q +AAE++C+ICQQKMLPGKDVA Sbjct: 279 SDSNNSSAPLTLDQFNNHHVCTRFISSKAVRKELRRKQRLAAEKVCNICQQKMLPGKDVA 338 Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178 ALLN T R+ACSSRN TGAFHVFH SCLIHWI+LCE EI L P Sbjct: 339 ALLNLNTRRVACSSRNKTGAFHVFHTSCLIHWIILCEFEIITNHLVRPNVRRIVKRKIAS 398 Query: 2179 XXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKA 2358 ++I+ + I + FCPECQGTG+ I+ D +E+P LS++FK+KIKA DA + Sbjct: 399 DGEKIGKEKDIE--KHIRTVFCPECQGTGMVIDGDGVEQPEFSLSQMFKFKIKACDARRE 456 Query: 2359 WFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRAN 2481 W KSPE+LQNCS GF+FP QSE I +EKV P+ LL FYRA+ Sbjct: 457 WMKSPEILQNCSTGFHFPSQSEEIFEEKVEPINLLHFYRAD 497 >ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807746 [Glycine max] Length = 500 Score = 478 bits (1230), Expect = e-132 Identities = 254/522 (48%), Positives = 337/522 (64%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA + ELG PK + + KEQ ARK L+ VR QGH YV+LR++GK+ ++FCTLCLAPCYSD Sbjct: 1 MAGKLELGPPKSDISNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 VLF HL GNLH ERL+ A+ TLL P PWPFNDG++FF+ + DK L V + + +L Sbjct: 61 DVLFDHLKGNLHRERLSAAKVTLLGPKPWPFNDGLVFFDTSTESDKELEVADSYRNRLLK 120 Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458 + ++ LAIV + + + S+ I + +D LVIP +L +E+ Sbjct: 121 FNDD-DSSLAIVKFGEGVQSNAKPCSIE----------GMQDDECA--LVIPNLLIGDEI 167 Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638 L+V VG+ +IAARF+EK L+ RIWCEW G+ +G VLEHDFAVV F+Y Sbjct: 168 FDLKVKEVGLGKIAARFLEKCHALNGIKRIWCEWLGKESNGERDGVEVLEHDFAVVIFAY 227 Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818 NY+LGR GL+DD+K LLP S + + + S SD +DVS+F+ NQYD Sbjct: 228 NYDLGRSGLLDDVKTLLPVS----------AGQKGKTSLSDSDDVSDFLCNQYDSSAEES 277 Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998 ++++ LD +++ L R + SK +R+ELRR+Q +AAE++C+ICQQKMLPGKDVA Sbjct: 278 SDSNNSSSRLTLDQFNNHLC-TRFISSKALRKELRRKQRLAAEKVCNICQQKMLPGKDVA 336 Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178 ALLN KT R+ACSSRN TGAFHVFH SCLIHWI+LCE EI L P Sbjct: 337 ALLNLKTRRVACSSRNRTGAFHVFHTSCLIHWIILCEFEIIINHLVRPNIRRVVKRKVAS 396 Query: 2179 XXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKA 2358 ++I + I + FCPECQGTG+ I+ D +E+P LS++FK+KIKA DA + Sbjct: 397 DGDKMGKEKDI--GKHIRTVFCPECQGTGMIIDGDGVEQPEFSLSQMFKFKIKACDARRD 454 Query: 2359 WFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484 W KSPE+LQNCS GF+FP QSE I +EKV P+ LL FYRA++ Sbjct: 455 WIKSPEVLQNCSTGFHFPSQSEEIFEEKVEPINLLHFYRADD 496 >ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata] gi|297315349|gb|EFH45772.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata] Length = 517 Score = 474 bits (1219), Expect = e-130 Identities = 245/525 (46%), Positives = 334/525 (63%), Gaps = 6/525 (1%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MAE++ELG PK + +LKEQ+AR TL+N+RLQGHTY++LR+DGKR VFFCTLCLAPCYSD Sbjct: 1 MAEKKELGLPKSSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLP-DEDKSLPVTSADQIPIL 1275 ++L HL+GNLH ERLA AR TLL NPWPF+DG++FF+ +E++ PV+ +P Sbjct: 60 TILLGHLNGNLHKERLACARLTLLGTNPWPFSDGVLFFDSSTGEEEEKTPVSGGASVPGT 119 Query: 1276 DTQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNE 1455 + ++ AIV + N + N V + P+ + D L+I GVL K Sbjct: 120 LGHCSDDDRFAIVKYDNNKANGGNQPA-----AVTDDEPSHSTD----DLLISGVLIKER 170 Query: 1456 VSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFS 1635 +E ++G +IAAR E G ++WCEW G +E + EHDFA+VTFS Sbjct: 171 TLDVEAKFIGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPSDEEKATIPEHDFAIVTFS 230 Query: 1636 YNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXX 1815 Y YNLGR GL+DD LL +S SES + S +++KSFSDPED SE + NQYD Sbjct: 231 YFYNLGRLGLLDDPSRLLTTS-QSESGNGEDSGRKRKKSFSDPEDTSESLCNQYDSSEEV 289 Query: 1816 XXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDV 1995 +++ L+ YDD L+ RV+K+KT+RRELRRQQ + +ER+C++C+QKMLPGKD Sbjct: 290 SSGHNSNSSRALIADYDDSLMSKRVVKNKTVRRELRRQQRIFSERICEVCKQKMLPGKDA 349 Query: 1996 AALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXX 2175 AA+LN KTG LAC SRNL GAFH+FHVSC++HW L CE EI ++ + Sbjct: 350 AAILNMKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGKGKKRCTKHSS 409 Query: 2176 XXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACK 2355 + QI S FCPECQGTGINIE +E+ T PLS+ +++++K ++ K Sbjct: 410 GQTGVKWNELANDVSWQIFSVFCPECQGTGINIEGGVIERDTFPLSQTWRFQVKVSEGRK 469 Query: 2356 AWFKSPELLQNCSLGFYFPPQSE-----IISQEKVSPLKLLPFYR 2475 AW K+PE L+NCS GF+FP Q++ + +E+V +KL+ FYR Sbjct: 470 AWVKNPEKLKNCSTGFHFPQQADESGQIPVQEERVQMMKLVRFYR 514 >ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779572 isoform X1 [Glycine max] gi|571494415|ref|XP_006592839.1| PREDICTED: uncharacterized protein LOC100779572 isoform X2 [Glycine max] Length = 501 Score = 473 bits (1218), Expect = e-130 Identities = 256/522 (49%), Positives = 335/522 (64%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MA + ELG PK V + KEQ ARK L+ VR QGH YV+LR++GK+ ++FCTLCLAPCYSD Sbjct: 1 MAGKLELGPPKSDVSNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 VLF HL GNLH ERL+ A+ TLL P PWPFNDG++FF+ + K L V + Q +L Sbjct: 61 DVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSTESHKELEVADSYQNRLLK 120 Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458 N + LAIV + + S+ I + +D Y LVIP +L +E+ Sbjct: 121 FNDN-DVSLAIVKFGDGVQSNAKPRSID----------GMQDDE--YALVIPNLLIGDEI 167 Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638 ++V VG+ +IAARF+EK L+ RIWCEW G+ +G VLEHDFAVV F+Y Sbjct: 168 FDVKVREVGLGKIAARFLEKCHALNGIKRIWCEWLGKESNGERDGVEVLEHDFAVVIFAY 227 Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818 NY+LGR GL+DD+ LLPS+ SG + K S SD +DVS+ + NQYD Sbjct: 228 NYDLGRSGLLDDVNTLLPSA--------SGGQKGK-SSLSDFDDVSDSVCNQYDSSAEES 278 Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998 ++++ LD +++ L R + SK +R+ELRR+Q +AAE++C+ICQQKMLPGKDVA Sbjct: 279 SDSNNSSSRLTLDQFNNHLC-TRFISSKALRKELRRKQRLAAEKVCNICQQKMLPGKDVA 337 Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178 ALLN KT R+ACSSRN TGAFHVFH SCLIHWI+LCE EI L P Sbjct: 338 ALLNLKTRRVACSSRNRTGAFHVFHTSCLIHWIILCEFEIITNHLVCPNVRRVVKRKVAS 397 Query: 2179 XXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKA 2358 ++I + I + FCPECQGTG+ I+ D +E+P LS++FK+KIKA DA + Sbjct: 398 DGNKIGKEKDI--GKHIRTVFCPECQGTGMIIDGDGVEQPEFSLSQMFKFKIKACDARRD 455 Query: 2359 WFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484 W KSPE+L+NCS GF+FP QSE I +EKV P+ LL FYRA++ Sbjct: 456 WIKSPEVLKNCSTGFHFPSQSEEIFEEKVEPINLLHFYRADD 497 >ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] gi|145334149|ref|NP_001078455.1| uncharacterized protein [Arabidopsis thaliana] gi|7269680|emb|CAB79628.1| putative protein [Arabidopsis thaliana] gi|110742700|dbj|BAE99261.1| hypothetical protein [Arabidopsis thaliana] gi|332660060|gb|AEE85460.1| uncharacterized protein AT4G28260 [Arabidopsis thaliana] gi|332660061|gb|AEE85461.1| uncharacterized protein AT4G28260 [Arabidopsis thaliana] Length = 516 Score = 470 bits (1209), Expect = e-129 Identities = 248/526 (47%), Positives = 337/526 (64%), Gaps = 7/526 (1%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MAE++ELG PK + +LKEQ+AR TL+N+RLQGHTY++LR+DGKR VFFCTLCLAPCYSD Sbjct: 1 MAEKKELGLPKPSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLP-DEDKSLPVTSADQIPIL 1275 ++L HL+GNLH ERLA AR TLL NPWPF+DG++FF+ +E++ PV+ + +P Sbjct: 60 TILLGHLNGNLHKERLACARITLLGTNPWPFSDGVLFFDSSTGEEEEKSPVSGGEGVPDT 119 Query: 1276 DTQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNE 1455 + + AIV + N + +N + V + P+ D L+I GVL K Sbjct: 120 LEHCSDDERFAIVKYDNNKTNGDN-----VPAAVTDDEPSHAAD----DLLISGVLIKER 170 Query: 1456 VSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFS 1635 +E ++G +IAAR E G ++WCEW G +E + EHDFA+VTFS Sbjct: 171 TLDVEAKFIGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPSDEEKATIPEHDFAIVTFS 230 Query: 1636 YNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXX 1815 Y YNLGR GL+DD LL SS SES + S +++KSFSDPED SE + NQYD Sbjct: 231 YFYNLGRLGLLDDPGRLLTSS-QSESGNGEDSGRKRKKSFSDPEDTSESLCNQYDSSEEV 289 Query: 1816 XXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDV 1995 +++ L+ YDD L+ RV+K++T+RRELRRQQ + +ER+C++C+QKMLPGKD Sbjct: 290 SSGHNSNSSRDLIADYDDSLMSKRVVKNRTVRRELRRQQRIFSERICEVCKQKMLPGKDA 349 Query: 1996 AALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXX 2175 AA+LN KTG LAC SRNL GAFH+FHVSC++HW L CE EI ++ + Sbjct: 350 AAILNMKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVS--GKGKKRCTKH 407 Query: 2176 XXXXXXXXNEEIK-ANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDAC 2352 NE + QI S FCPECQGTGINIE +E+ T PLS+ +++++K ++ Sbjct: 408 SGQTGVKWNELANDVSWQIFSVFCPECQGTGINIEGAVIERDTFPLSQTWRFQVKVSEGR 467 Query: 2353 KAWFKSPELLQNCSLGFYFPPQSE-----IISQEKVSPLKLLPFYR 2475 KAW K+PE L+NCS GF+FP Q+E + +E+V +KL+ FYR Sbjct: 468 KAWVKNPERLKNCSTGFHFPQQAEETEQIPVQEERVQMMKLVRFYR 513 >gb|EOX99407.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 481 Score = 469 bits (1207), Expect = e-129 Identities = 252/496 (50%), Positives = 315/496 (63%), Gaps = 20/496 (4%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MAERRELG P+ CSLKEQ+AR TL NVR QGHTY++LR+DGKR +FFCTLCLAPCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 SVL HL G+LH+ RLA A+ TLL NPWPFNDG++FF L +++K L +Q +L+ Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLE 120 Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458 +N +N LAIV + S++S R NVN L+IPGVL K+E+ Sbjct: 121 FHNNDDN-LAIVEYVG--------SEVSSYR------KNVNCRAGDSDLLIPGVLIKDEI 165 Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638 S L+V ++G +IAARF EKDG L++ RIWCEW G+ N+ +H FAVVTF Y Sbjct: 166 SDLKVRFIGFGKIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVY 225 Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818 N +LGRKGL+DD+K LL S + E+ + +++KSFSDPED+SE + NQYD Sbjct: 226 NCDLGRKGLLDDVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDS 285 Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998 ++++ LD YDDQLL R + SK +RRELRRQQ +AAERMCDICQQKMLP KDVA Sbjct: 286 SASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVA 345 Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178 L+N TG+L CSSRN+ GAFHVFH SCLIHWILLCE+E P Sbjct: 346 TLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGA 405 Query: 2179 XXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEI------------ 2319 + E KA I S CPECQGTGI++E DELEKP V LS++ Sbjct: 406 KSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRCCC 465 Query: 2320 -------FKYKIKAND 2346 F+YKIK +D Sbjct: 466 TRKLAGMFRYKIKVSD 481 >gb|EOX99409.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 478 Score = 468 bits (1204), Expect = e-129 Identities = 248/479 (51%), Positives = 310/479 (64%), Gaps = 1/479 (0%) Frame = +1 Query: 919 MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098 MAERRELG P+ CSLKEQ+AR TL NVR QGHTY++LR+DGKR +FFCTLCLAPCYSD Sbjct: 1 MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60 Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278 SVL HL G+LH+ RLA A+ TLL NPWPFNDG++FF L +++K L +Q +L+ Sbjct: 61 SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLE 120 Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458 +N +N LAIV + S++S R NVN L+IPGVL K+E+ Sbjct: 121 FHNNDDN-LAIVEYVG--------SEVSSYR------KNVNCRAGDSDLLIPGVLIKDEI 165 Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638 S L+V ++G +IAARF EKDG L++ RIWCEW G+ N+ +H FAVVTF Y Sbjct: 166 SDLKVRFIGFGKIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVY 225 Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818 N +LGRKGL+DD+K LL S + E+ + +++KSFSDPED+SE + NQYD Sbjct: 226 NCDLGRKGLLDDVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDS 285 Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998 ++++ LD YDDQLL R + SK +RRELRRQQ +AAERMCDICQQKMLP KDVA Sbjct: 286 SASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVA 345 Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178 L+N TG+L CSSRN+ GAFHVFH SCLIHWILLCE+E P Sbjct: 346 TLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGA 405 Query: 2179 XXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDAC 2352 + E KA I S CPECQGTGI++E DELEKP V LS++ +K C Sbjct: 406 KSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRCC 464