BLASTX nr result

ID: Catharanthus23_contig00000111 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00000111
         (2768 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citr...   569   e-159
ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600...   564   e-158
ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255...   558   e-156
ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261...   548   e-153
ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204...   543   e-151
ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310...   537   e-150
ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608...   535   e-149
gb|EOX99406.1| Uncharacterized protein isoform 1 [Theobroma cacao]    534   e-149
ref|XP_002517932.1| conserved hypothetical protein [Ricinus comm...   525   e-146
emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]   524   e-146
ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Popu...   511   e-142
gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]     506   e-140
gb|EMJ00100.1| hypothetical protein PRUPE_ppa004741mg [Prunus pe...   488   e-135
gb|ESW21852.1| hypothetical protein PHAVU_005G104500g [Phaseolus...   478   e-132
ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807...   478   e-132
ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arab...   474   e-130
ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779...   473   e-130
ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] ...   470   e-129
gb|EOX99407.1| Uncharacterized protein isoform 2, partial [Theob...   469   e-129
gb|EOX99409.1| Uncharacterized protein isoform 4 [Theobroma cacao]    468   e-129

>ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citrus clementina]
            gi|567910083|ref|XP_006447355.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910085|ref|XP_006447356.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910087|ref|XP_006447357.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|568831767|ref|XP_006470130.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X1 [Citrus
            sinensis] gi|568831769|ref|XP_006470131.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X2 [Citrus
            sinensis] gi|568831771|ref|XP_006470132.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X3 [Citrus
            sinensis] gi|568831773|ref|XP_006470133.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X4 [Citrus
            sinensis] gi|557549965|gb|ESR60594.1| hypothetical
            protein CICLE_v10014904mg [Citrus clementina]
            gi|557549966|gb|ESR60595.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549967|gb|ESR60596.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549968|gb|ESR60597.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
          Length = 523

 Score =  569 bits (1466), Expect = e-159
 Identities = 294/528 (55%), Positives = 360/528 (68%), Gaps = 8/528 (1%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA RRELGFPK    SL+EQ+AR TL NVR QGHTYV+LR+DGKR +FFCTLCLAPCYSD
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
             VLF HL GNLHTERL+ A+ TLL PNPWPFNDG++FF++  +++K   V++      LD
Sbjct: 61   LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120

Query: 1279 TQSNLENPLAIVSWQKNL--------GSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIP 1434
              +N  N LAIV + +++        G DE H     N   + ++ + + D+V     IP
Sbjct: 121  YHNNDSN-LAIVKYGEDMKVNGNEHSGLDEVHFDCE-NGTQVRDIYSESCDKV-----IP 173

Query: 1435 GVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHD 1614
            GV  K+E+  L V ++G+ QIAAR ++KD    +  RIWCEW G+ D  +E    + +HD
Sbjct: 174  GVFLKDEIVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVEIPDHD 233

Query: 1615 FAVVTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQ 1794
            FA+VTF YNY+LGRKGL DD+K LL SSP  +SE+  G+  +++KSFSDPEDVSE +  Q
Sbjct: 234  FAIVTFVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQ 293

Query: 1795 YDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQK 1974
            YD             +++LLD Y DQLLHAR + SK  RRE+RRQQ +AAERMCDICQQK
Sbjct: 294  YDSCGEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQK 353

Query: 1975 MLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXX 2154
            +LP KDVAALLN KTG LACSSRNL G FHVFH+SCLIHWILLCE E+   Q   P    
Sbjct: 354  ILPDKDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKR 413

Query: 2155 XXXXXXXXXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKI 2334
                           + E     QI S FCPECQGTG+NIE DELEKPT+ LS++FKYKI
Sbjct: 414  RSRRKNGSKRVQARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKI 473

Query: 2335 KANDACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRA 2478
            K +DA KAW K+PE LQNCS GFYFP +SE   QEKVSPLKLL FY A
Sbjct: 474  KVSDARKAWMKNPEALQNCSTGFYFPSRSEEKFQEKVSPLKLLHFYSA 521


>ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600129 [Solanum tuberosum]
          Length = 521

 Score =  564 bits (1454), Expect = e-158
 Identities = 294/521 (56%), Positives = 364/521 (69%), Gaps = 4/521 (0%)
 Frame = +1

Query: 931  RELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSDSVLF 1110
            R+L FP+    +LKEQ+ R+TL+NVR QGH YV+LR+DGKR VFFCTLC +PCYSDSVLF
Sbjct: 4    RQLDFPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLVFFCTLCHSPCYSDSVLF 63

Query: 1111 KHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILDTQSN 1290
             HL GNLHTE LA A+ATLLKPNPWPFNDG++FFND P++DK  P  +  +  ++DT   
Sbjct: 64   NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLFFND-PEQDKHSPNVNVGKSRLVDTCLE 122

Query: 1291 LENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEVSSLE 1470
             E+ LAIV    NL  + +         +L      N +    +LVIPGVL K+E+S LE
Sbjct: 123  DESSLAIVECDDNLRHNGDTYVTEYEYCLLDSELTGNGE--SEYLVIPGVLCKDELSDLE 180

Query: 1471 VTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSYNYNL 1650
            V ++G+ +IAAR   +        RIWCEW  + DS +  + VV +HDFAVVTF YNYNL
Sbjct: 181  VKHIGIGKIAARISVRGIDSKKIRRIWCEWLVKKDSDDMDTSVVPDHDFAVVTFPYNYNL 240

Query: 1651 GRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXXXXXX 1830
            GRK L+DD ++LLPSSP+SESE+ SG+R RKRKSFSDPED SE + N  D          
Sbjct: 241  GRKPLLDD-RFLLPSSPYSESEETSGTRKRKRKSFSDPEDFSESLSNHCDSSGEESQSTN 299

Query: 1831 XLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVAALLN 2010
              N K++L   DDQL+ +R++ SKTMRRELR+QQ VA+ERMCDICQQKMLPGKDVA LL+
Sbjct: 300  NSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDVATLLS 359

Query: 2011 RKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAP----XXXXXXXXXXXX 2178
             K+G+L CSSRN+TGAFH+FHVSCLIHWIL CEL+ Y K +D P                
Sbjct: 360  WKSGKLMCSSRNMTGAFHLFHVSCLIHWILQCELQTYVKPVDEPKMETKAKRRSKRKTGT 419

Query: 2179 XXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKA 2358
                    +EIK+ R+I S FCPECQGTGI IE DELEKP V LSE++++KIK +DA KA
Sbjct: 420  KHNAKEKEDEIKSARRINSVFCPECQGTGIIIEGDELEKPPVSLSEVYRHKIKLSDARKA 479

Query: 2359 WFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRAN 2481
            W K+PE+LQNCS GF  PP+ + + QE VSPLKLL FYRAN
Sbjct: 480  WMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 520


>ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255678 [Vitis vinifera]
          Length = 520

 Score =  558 bits (1437), Expect = e-156
 Identities = 292/528 (55%), Positives = 358/528 (67%), Gaps = 6/528 (1%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA R ELGF K    SL+EQ AR TLRNVR+QGH YV+LR+DGKR +FFCTLCLAPCYS+
Sbjct: 1    MARRTELGFLKTSASSLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSE 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
            SVL+ HL GNLH+ER A A+ TLLK +PWPFNDG++FF++  + DK L + + +   +L 
Sbjct: 61   SVLYDHLKGNLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLG 120

Query: 1279 TQSNLENPLAIVSWQKNLGSDEN-----HSQISLNRGVLHEVPNVNEDRVGYHLVIPGVL 1443
            T  N +N LAIV    +L    N     HS  + +  V     ++N       ++IPGV+
Sbjct: 121  THKN-DNNLAIVCHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVM 179

Query: 1444 WKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAV 1623
             K+EV+ LEV ++G  QIAARF EKDG      +IWCEWFG+ + G+  + +V +HDFAV
Sbjct: 180  IKDEVTELEVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDGETVMVPDHDFAV 239

Query: 1624 VTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDX 1803
            VTF+Y+YNLGRKGL DD+  +L SSP        GS  +++KSFSDPED+SE + NQYD 
Sbjct: 240  VTFNYHYNLGRKGLFDDVISMLSSSP------TEGSGRKRKKSFSDPEDISESLSNQYDS 293

Query: 1804 XXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLP 1983
                       + ++LLD YDDQLL  R + SKT+RRELRRQQ VAAERMCDICQ KMLP
Sbjct: 294  SGEDSLISNSPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLP 353

Query: 1984 GKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXX 2163
            GKDVA L+N KTG+L CSSRN+ GAFHVFH SCLIHWILLCE EI+  QL  P       
Sbjct: 354  GKDVATLMNMKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSR 413

Query: 2164 XXXXXXXXXXXXNEEIK-ANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKA 2340
                        +  IK    QICS FCPECQGTGI IE DELE P +PLSE+FKYKIK 
Sbjct: 414  RKSGSKCNGKGKDGVIKPTTLQICSVFCPECQGTGIMIE-DELEIPNIPLSEMFKYKIKV 472

Query: 2341 NDACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484
            +DA +AW K+PE L++CS GF FP QS    QEKVS LKLL FY A+E
Sbjct: 473  SDAHRAWMKNPEELKHCSTGFNFPSQSGETVQEKVSSLKLLHFYSADE 520


>ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261554 [Solanum
            lycopersicum]
          Length = 526

 Score =  548 bits (1413), Expect = e-153
 Identities = 287/525 (54%), Positives = 360/525 (68%), Gaps = 6/525 (1%)
 Frame = +1

Query: 931  RELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSDSVLF 1110
            ++L  P+    +LKEQ+ R+TL+NVR QGH YV+LR+DGKR +FFCTLC +PCYSDSVLF
Sbjct: 4    KQLDVPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLIFFCTLCHSPCYSDSVLF 63

Query: 1111 KHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDE--DKSLPVTSADQIPILDTQ 1284
             HL GNLHTE LA A+ATLLKPNPWPFNDG++FFND   +  DK  P  +  +  ++DT 
Sbjct: 64   NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLFFNDPEQDKQDKQSPNVNVGKSRLVDTC 123

Query: 1285 SNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEVSS 1464
               E+ +AIV +  NL  +E+        G+L      NE+    +LVIPGVL K+E+S 
Sbjct: 124  LEDESSVAIVEYDDNLRHNEDTYVSEYEYGLLDSELIGNEE--SDYLVIPGVLCKDELSD 181

Query: 1465 LEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSYNY 1644
            LEV ++G+ +IAAR   +        RIWCEW  + DS +  + VV +HDFAVVTF YNY
Sbjct: 182  LEVKHIGIGKIAARISVRGIDSKSIRRIWCEWLAKKDSDDMDTSVVPDHDFAVVTFPYNY 241

Query: 1645 NLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXXXX 1824
            NLGR  L+DD ++LLPSSP+SESE+ S +  RKRKSFSDPED SE + N  D        
Sbjct: 242  NLGRSPLLDD-RFLLPSSPYSESEETSVTGKRKRKSFSDPEDFSESLSNHCDSSGEESQS 300

Query: 1825 XXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVAAL 2004
                N K++L   DDQL+ +R++ SKTMRRELR+QQ VA+ERMCDICQQKMLPGKDVA L
Sbjct: 301  TNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDVATL 360

Query: 2005 LNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLD----APXXXXXXXXXX 2172
            L+ K+G+L CSSRN++GAFH+FHVSCLIHWIL CEL+   K +D     P          
Sbjct: 361  LSWKSGKLMCSSRNMSGAFHLFHVSCLIHWILQCELQTSVKPVDEPKMEPKAKRRSKKKT 420

Query: 2173 XXXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDAC 2352
                      +E K+ R+I S FCPECQGTGI IE DELEKP V LSE+++ KIK +DA 
Sbjct: 421  GTKHNAKEKEDETKSARRINSVFCPECQGTGICIEGDELEKPPVSLSEVYRLKIKLSDAR 480

Query: 2353 KAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANES 2487
            KAW K+PE+LQNCS GF  PP+ + + QE VSPLKLL FYRAN S
Sbjct: 481  KAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRANVS 525


>ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204451 [Cucumis sativus]
            gi|449475785|ref|XP_004154550.1| PREDICTED:
            uncharacterized LOC101204451 [Cucumis sativus]
          Length = 525

 Score =  543 bits (1399), Expect = e-151
 Identities = 282/524 (53%), Positives = 353/524 (67%), Gaps = 4/524 (0%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA R ELGFPK    SL+EQ AR  LRNVR QGHTYV+LR++GK+ +FFCTLCLAPCYSD
Sbjct: 1    MARRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRENGKKFIFFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
            SVLF HL G LHTERL+ A+ TLL PNPWPF+DG++FF+   + D  + +++ +   +L+
Sbjct: 61   SVLFSHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPIEGDNQVGISNDNHERLLE 120

Query: 1279 TQSNLENPLAIVSWQKNL-GSDENHSQISLNRGVLHEVP--NVNEDRVGYHLVIPGVLWK 1449
              +N +N LAIV +  N  G+     + + N   + +    N+N+      LVIPGVL K
Sbjct: 121  YNNN-DNNLAIVKYVGNSKGNGNRQEEFNGNMRNVEDCSFENLNDGGESCPLVIPGVLIK 179

Query: 1450 NEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVT 1629
             E+S ++V  +G  QIAARF EKDG      RIWCEW G+++ G E    V EH++A++T
Sbjct: 180  EEISDIKVRELGYGQIAARFTEKDGIFSGVSRIWCEWLGKVNDGIENMVKVPEHNYAIIT 239

Query: 1630 FSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXX 1809
            F+YN +LGRKGL+DD+K LL SSP +ES+++   + +++KSFSDPED S  M  QYD   
Sbjct: 240  FTYNVDLGRKGLLDDVKLLLSSSPGAESQNDENRQVKRKKSFSDPEDGSLSMSPQYDSSG 299

Query: 1810 XXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGK 1989
                    + + + LDGYDDQ+L   V+ +K +RRELRRQQ +AAERMCDICQQK+L  K
Sbjct: 300  EDSSASNCVMSSLSLDGYDDQILSTTVMLNKAVRRELRRQQRLAAERMCDICQQKILTHK 359

Query: 1990 DVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXX 2169
            DVA LLN KTGRLACSSRN+ G FHVFH SCLIHWILLCE EI  K L            
Sbjct: 360  DVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEISVKDLGGSKVRRRYRRK 419

Query: 2170 XXXXXXXXXXNEEIK-ANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKAND 2346
                      + E +    QI S FCP CQGTGI I+ D+LEKPTVPLSEIFKYKIK +D
Sbjct: 420  KKTKGNKHIKDGETRQIKTQIDSVFCPACQGTGITIDGDDLEKPTVPLSEIFKYKIKVSD 479

Query: 2347 ACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRA 2478
            A +AW KSPE+LQNCS GF FP Q +   QE V PLKLL FY A
Sbjct: 480  ARRAWMKSPEVLQNCSTGFQFPYQPDETIQENVKPLKLLHFYGA 523


>ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310040 [Fragaria vesca
            subsp. vesca]
          Length = 525

 Score =  537 bits (1384), Expect = e-150
 Identities = 278/533 (52%), Positives = 364/533 (68%), Gaps = 11/533 (2%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA R ++G PK   CSL+EQ  R  LRNVR QGH+YV++R+DGK+ +FFCTLCLAPCYSD
Sbjct: 1    MAGRWDVGVPKTNACSLREQATRTILRNVRSQGHSYVEVREDGKKFIFFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
             VLF HL GNLH ERLA A+ TLL+PNPWPFNDG++FFN+  + DK +     ++  +L+
Sbjct: 61   KVLFDHLKGNLHNERLAAAKVTLLRPNPWPFNDGVVFFNNSYETDKGVVTPDDNKCRMLE 120

Query: 1279 TQSNLENPLAIVSWQKNLGSD----------ENHSQISLNRGVLHEVPNVNEDRVGYHLV 1428
            +  N EN LAIV +  NL ++          E +  I L +G+   V +   D     +V
Sbjct: 121  SHDN-ENNLAIVKYGGNLKTNGYDHCGVDGLECNEYIDL-QGLQSNVGDSTADGAKSSVV 178

Query: 1429 IPGVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLE 1608
            IPG++ ++E++ LEV  VG+ +IAARF+ KDG      RIWCEW G     +E    V E
Sbjct: 179  IPGIVVRDEITDLEVREVGLGEIAARFLGKDGI----GRIWCEWLGVKSIDSEDLCNVPE 234

Query: 1609 HDFAVVTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMG 1788
            HDFAVVTFSYN +LGRKGL+DD++ LL SSP  ES +  G+  +++KSFSDPED+S+ + 
Sbjct: 235  HDFAVVTFSYNIDLGRKGLLDDVRMLLSSSPTIESGNGEGTGCKRKKSFSDPEDISDSLS 294

Query: 1789 NQYDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQ 1968
            NQY+             +++LLD YDDQLL+ R + +K++RRELRRQQ +A+ RMCDICQ
Sbjct: 295  NQYESFGEDSSASSGTASRLLLDHYDDQLLNTRFILNKSIRRELRRQQRLASGRMCDICQ 354

Query: 1969 QKMLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXX 2148
            Q+MLPGKDVA L+N KTG+LACSSRN+ GAFHVFH SCLIHWILLCE+E+   Q      
Sbjct: 355  QRMLPGKDVATLMNLKTGKLACSSRNVNGAFHVFHTSCLIHWILLCEVEVITNQ--NTGS 412

Query: 2149 XXXXXXXXXXXXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFK 2325
                             + ++K+ + QI S FCPECQGTGI ++ D+LEKP +PLS++F+
Sbjct: 413  KARRRSRRKTAAKCNGKDAQLKSLSPQIYSVFCPECQGTGIVVDGDDLEKPNLPLSQMFR 472

Query: 2326 YKIKANDACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484
            YKIK +DA +AW KSPE+LQNCS GF+FP  +    QEKV  LKLL FYRA+E
Sbjct: 473  YKIKVSDARRAWMKSPEMLQNCSTGFHFPSLNAAGIQEKVKTLKLLRFYRAHE 525


>ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608093 isoform X5 [Citrus
            sinensis]
          Length = 508

 Score =  535 bits (1378), Expect = e-149
 Identities = 275/502 (54%), Positives = 340/502 (67%), Gaps = 8/502 (1%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA RRELGFPK    SL+EQ+AR TL NVR QGHTYV+LR+DGKR +FFCTLCLAPCYSD
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
             VLF HL GNLHTERL+ A+ TLL PNPWPFNDG++FF++  +++K   V++      LD
Sbjct: 61   LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120

Query: 1279 TQSNLENPLAIVSWQKNL--------GSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIP 1434
              +N  N LAIV + +++        G DE H     N   + ++ + + D+V     IP
Sbjct: 121  YHNNDSN-LAIVKYGEDMKVNGNEHSGLDEVHFDCE-NGTQVRDIYSESCDKV-----IP 173

Query: 1435 GVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHD 1614
            GV  K+E+  L V ++G+ QIAAR ++KD    +  RIWCEW G+ D  +E    + +HD
Sbjct: 174  GVFLKDEIVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPEDEDIVEIPDHD 233

Query: 1615 FAVVTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQ 1794
            FA+VTF YNY+LGRKGL DD+K LL SSP  +SE+  G+  +++KSFSDPEDVSE +  Q
Sbjct: 234  FAIVTFVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRKRKKSFSDPEDVSESLSKQ 293

Query: 1795 YDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQK 1974
            YD             +++LLD Y DQLLHAR + SK  RRE+RRQQ +AAERMCDICQQK
Sbjct: 294  YDSCGEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQK 353

Query: 1975 MLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXX 2154
            +LP KDVAALLN KTG LACSSRNL G FHVFH+SCLIHWILLCE E+   Q   P    
Sbjct: 354  ILPDKDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKR 413

Query: 2155 XXXXXXXXXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKI 2334
                           + E     QI S FCPECQGTG+NIE DELEKPT+ LS++FKYKI
Sbjct: 414  RSRRKNGSKRVQARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKI 473

Query: 2335 KANDACKAWFKSPELLQNCSLG 2400
            K +DA KAW K+PE LQNCS G
Sbjct: 474  KVSDARKAWMKNPEALQNCSTG 495


>gb|EOX99406.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 517

 Score =  534 bits (1376), Expect = e-149
 Identities = 279/523 (53%), Positives = 350/523 (66%), Gaps = 1/523 (0%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MAERRELG P+   CSLKEQ+AR TL NVR QGHTY++LR+DGKR +FFCTLCLAPCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
            SVL  HL G+LH+ RLA A+ TLL  NPWPFNDG++FF  L +++K L     +Q  +L+
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLE 120

Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458
              +N +N LAIV +          S++S  R       NVN       L+IPGVL K+E+
Sbjct: 121  FHNNDDN-LAIVEYVG--------SEVSSYR------KNVNCRAGDSDLLIPGVLIKDEI 165

Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638
            S L+V ++G  +IAARF EKDG L++  RIWCEW G+    N+      +H FAVVTF Y
Sbjct: 166  SDLKVRFIGFGKIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVY 225

Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818
            N +LGRKGL+DD+K LL S   +  E+   +  +++KSFSDPED+SE + NQYD      
Sbjct: 226  NCDLGRKGLLDDVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDS 285

Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998
                  ++++ LD YDDQLL  R + SK +RRELRRQQ +AAERMCDICQQKMLP KDVA
Sbjct: 286  SASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVA 345

Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178
             L+N  TG+L CSSRN+ GAFHVFH SCLIHWILLCE+E        P            
Sbjct: 346  TLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGA 405

Query: 2179 XXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACK 2355
                   + E KA    I S  CPECQGTGI++E DELEKP V LS++F+YKIK +DA +
Sbjct: 406  KSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQMFRYKIKVSDARR 465

Query: 2356 AWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484
            AW KSPE+L+NCS GF+F  QS  + QEK+ PLKLL FY A++
Sbjct: 466  AWMKSPEMLENCSTGFHFRSQSGEMVQEKILPLKLLHFYSADK 508


>ref|XP_002517932.1| conserved hypothetical protein [Ricinus communis]
            gi|223542914|gb|EEF44450.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 509

 Score =  525 bits (1352), Expect = e-146
 Identities = 278/525 (52%), Positives = 352/525 (67%), Gaps = 3/525 (0%)
 Frame = +1

Query: 919  MAERRELGFPKHGVC-SLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYS 1095
            MA R ELGF K G   SLKEQ+AR TL NVR +GH YV+LR+DGKR +FFCTLCLAPCYS
Sbjct: 1    MAGRWELGFTKTGGANSLKEQLARTTLNNVRSKGHPYVELREDGKRFIFFCTLCLAPCYS 60

Query: 1096 DSVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPIL 1275
            D+VLF HL GNLHTERL+ A  TLLK NPWPF+DG+ FF+   + +K L + + ++    
Sbjct: 61   DAVLFDHLKGNLHTERLSTATLTLLKENPWPFSDGVHFFDTSSENEKQLVIKNDNE---- 116

Query: 1276 DTQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNE 1455
             ++ N  + LAIV +  +L           N+       + N++     L+I GVL K++
Sbjct: 117  -SRGNGNSSLAIVKYGGSL-KPTGDEDTGCNK-------DANDNGRISDLLIQGVLVKDD 167

Query: 1456 VSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFS 1635
            +S L+  ++G  +I AR +EKDG  +D  RIWCEW G+    +     VL+H+FAVVTF+
Sbjct: 168  ISDLQARFMGYGRIGARLIEKDGNSNDISRIWCEWLGKNTPCDLDKAKVLDHEFAVVTFA 227

Query: 1636 YNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYD-XXXX 1812
            YNY+LGRKGL+DD+K LL SSP  ES++  G+  +++KSFSDPEDVSE   NQYD     
Sbjct: 228  YNYDLGRKGLLDDVKLLLSSSPVQESDNQGGTNRKRKKSFSDPEDVSESFSNQYDSSGEE 287

Query: 1813 XXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKD 1992
                      ++LLD +DDQ LH++V+ SKT+RRELRRQ  +AAERMCDICQQK+LP KD
Sbjct: 288  SLTSIGGPPTRLLLDRHDDQFLHSKVISSKTLRRELRRQHHIAAERMCDICQQKILPEKD 347

Query: 1993 VAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXX 2172
            VA L+N  TG+LACSSRN  G +HVFH SCLIHWILL E E+   Q  +P          
Sbjct: 348  VATLVNMNTGKLACSSRNTYGQYHVFHTSCLIHWILLSEYEMARNQSVSPKGRRKSRRKN 407

Query: 2173 XXXXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDA 2349
                      E++KA N QI S FCPECQGTG  +E DE E PT+PLSE+FKYKIK  D 
Sbjct: 408  GTKSSHV---EKVKALNNQISSVFCPECQGTGAILEKDERELPTIPLSEMFKYKIKVGDG 464

Query: 2350 CKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484
             +AW KSPE+L+NCS+GF+FP QSE   Q KV PLKLL FYRA+E
Sbjct: 465  RRAWMKSPEVLENCSIGFHFPSQSEGAVQAKVLPLKLLHFYRADE 509


>emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]
          Length = 896

 Score =  524 bits (1350), Expect = e-146
 Identities = 271/492 (55%), Positives = 335/492 (68%), Gaps = 6/492 (1%)
 Frame = +1

Query: 964  SLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSDSVLFKHLHGNLHTER 1143
            SL+EQ AR TLRNVR+QGH YV+LR+DGKR +FFCTLCLAPCYS+SVL+ HL GNLH+ER
Sbjct: 352  SLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSESVLYDHLKGNLHSER 411

Query: 1144 LAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILDTQSNLENPLAIVSWQ 1323
             A A+ TLLK +PWPFNDG++FF++  + DK L + + +   +L T  N +N LAIV   
Sbjct: 412  YAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLGTHKN-DNNLAIVCHG 470

Query: 1324 KNLGSDEN-----HSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEVSSLEVTYVGV 1488
             +L    N     HS  + +  V     ++N       ++IPGV+ K+EV+ LEV ++G 
Sbjct: 471  DDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTELEVRFLGF 530

Query: 1489 AQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSYNYNLGRKGLI 1668
             QIAARF EKDG      +IWCEWFG+ + G+  + +V +HDFAVVTF+Y+YNLGRKGL 
Sbjct: 531  GQIAARFFEKDGVSKGISKIWCEWFGKEEPGDGETVMVPDHDFAVVTFNYHYNLGRKGLF 590

Query: 1669 DDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXXXXXXXLNAKV 1848
            DD+  +L SSP        GS  +++KSFSDPED+SE + NQYD            + ++
Sbjct: 591  DDVISMLSSSP------TEGSGRKRKKSFSDPEDISESLSNQYDSSGEDSLISNSPSPRL 644

Query: 1849 LLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVAALLNRKTGRL 2028
            LLD YDDQLL  R + SKT+RRELRRQQ VAAERMCDICQ KMLPGKDVA L N KTG+L
Sbjct: 645  LLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLXNMKTGKL 704

Query: 2029 ACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXXXXXXXXXNEE 2208
             CSSRN+ GAFHVFH SCLIHWILLCE EI+  QL  P                   +  
Sbjct: 705  VCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSKCNGKGKDGV 764

Query: 2209 IK-ANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKAWFKSPELLQ 2385
            IK    QICS FCPECQGTGI IE DELE P +PLSE+FKYKIK +DA +AW K+PE L+
Sbjct: 765  IKPTTLQICSVFCPECQGTGIMIE-DELEIPNIPLSEMFKYKIKVSDAHRAWMKNPEELK 823

Query: 2386 NCSLGFYFPPQS 2421
            +CS GF FP QS
Sbjct: 824  HCSTGFNFPSQS 835


>ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa]
            gi|550325787|gb|EEE95821.2| hypothetical protein
            POPTR_0013s13670g [Populus trichocarpa]
          Length = 513

 Score =  511 bits (1315), Expect = e-142
 Identities = 267/526 (50%), Positives = 341/526 (64%), Gaps = 4/526 (0%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA  RE+GFPK    SL+EQ+AR TL  VR +GH Y++LR+DGKR +FFCTLCL+PCYSD
Sbjct: 1    MAGNREVGFPKTTASSLREQLARTTLSRVRARGHPYLELREDGKRFIFFCTLCLSPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIP-IL 1275
            ++L  HL GNLHTERL+ A+ATLLKPNPWPF+DG+ FF+     ++ L +    +    L
Sbjct: 61   TILLDHLRGNLHTERLSAAKATLLKPNPWPFSDGIHFFDASSGNEEQLAIKDGKESSRFL 120

Query: 1276 DTQSNLENPLAIVSWQKNL--GSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWK 1449
              + N +N LAIV + +NL  G D           V+ E  N++    G  LVIP V  K
Sbjct: 121  KFEENSDN-LAIVKYVENLKPGCDT----------VVDE--NLSGSDEGSDLVIPSVRLK 167

Query: 1450 NEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVT 1629
             EVS L+ T VG  QIAAR  EK    ++  RIWCEW G+  S +E    VL+HDF VVT
Sbjct: 168  EEVSDLKATLVGSGQIAARMYEKKDGSNEISRIWCEWLGKKSSNDEDKVKVLDHDFGVVT 227

Query: 1630 FSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXX 1809
            F+Y+Y LG+ GL DD+K LL SS  + +E++     ++++S S+PEDVS  + NQY    
Sbjct: 228  FAYDYELGKSGLFDDVKLLLSSSAPALTENDERGNWKRKRSVSEPEDVSRSLTNQYGLCE 287

Query: 1810 XXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGK 1989
                     ++ ++LD YDDQL+H R + +KT+RRE+R+QQ +AAE+MCDICQQKMLP K
Sbjct: 288  EESSKTTCASSNLVLDRYDDQLMHTRFISNKTVRREVRKQQRIAAEKMCDICQQKMLPEK 347

Query: 1990 DVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXX 2169
            DVA L NRKTG+LACSSRN+ GAFHVFH SCLIHWIL CE EI   Q  +          
Sbjct: 348  DVATLWNRKTGKLACSSRNVYGAFHVFHTSCLIHWILYCEFEIVRNQTVSTKGGRRSRKK 407

Query: 2170 XXXXXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKAND 2346
                      +  +      I S FCP+CQGTG+NIE DE EKP  PLSE+FKYKIK ++
Sbjct: 408  NGTKSNTTGKDGTVNVLPNPIVSVFCPDCQGTGVNIEGDEFEKPLTPLSEMFKYKIKVSE 467

Query: 2347 ACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484
              + W K+PE+L+NCS GF+FP QS    QEKV PLKLL FYR  E
Sbjct: 468  GHRGWMKNPEILENCSTGFHFPSQSGEPVQEKVLPLKLLHFYRPEE 513


>gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]
          Length = 638

 Score =  506 bits (1303), Expect = e-140
 Identities = 270/516 (52%), Positives = 341/516 (66%), Gaps = 14/516 (2%)
 Frame = +1

Query: 919  MAERRELGFPKHGV--------CSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTL 1074
            MA R  LGFPK           CSLK+Q  R  LRNVR QGHTYV+LR+DGK+ +FFCTL
Sbjct: 1    MAGRGILGFPKSNELAVSKTTSCSLKDQAKRTILRNVRSQGHTYVELREDGKKSIFFCTL 60

Query: 1075 CLAPCYSDSVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTS 1254
            CLAPCYSD VLF HL GNLH +RL+ A+ TLL PNPWPFNDG++FFN+  + D    +++
Sbjct: 61   CLAPCYSDCVLFDHLKGNLHNQRLSTAKVTLLGPNPWPFNDGVVFFNNPTENDDDTVISN 120

Query: 1255 ADQIPILDTQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVG----YH 1422
             +Q  +L++Q + EN LAIV++ +NL S  N   +    G  +E P+   +  G      
Sbjct: 121  GNQSRLLESQDS-ENNLAIVTYGENLESCANGHIMVDELGHQNENPDSAGNLAGSGENCA 179

Query: 1423 LVIPGVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVV 1602
            ++IPGV   +E++++EV  VG   I+ RF EKDG  +D  RIWCEW G+    +E    V
Sbjct: 180  VLIPGVRAGDEIANVEVREVGYGLISVRFREKDGVSNDISRIWCEWLGKKTIEDEDFLKV 239

Query: 1603 LEHDFAVVTFSY-NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSE 1779
             EHDFA+VTFSY N++LGR GL DD+K LL SSP +E ++   S  ++RKSFSDPED SE
Sbjct: 240  PEHDFAIVTFSYNNFSLGRMGLHDDVKALLCSSPAAEMQNGDVSSRKRRKSFSDPEDSSE 299

Query: 1780 FMGNQYDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCD 1959
             + NQYD               ++LD YDDQLL  R + +K +RRELRRQQ +AAERMCD
Sbjct: 300  NLSNQYDSCGEDSSASAV--TSLMLDQYDDQLLQTRFISNKAIRRELRRQQRIAAERMCD 357

Query: 1960 ICQQKMLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDA 2139
            ICQ KMLPGKDVA L+N KTGRLACSSRN  GAFH+FH SCLIHW+LLCE+E    Q +A
Sbjct: 358  ICQHKMLPGKDVATLMNVKTGRLACSSRNTNGAFHLFHTSCLIHWVLLCEVEKCTNQSEA 417

Query: 2140 PXXXXXXXXXXXXXXXXXXXNEEIKANR-QICSAFCPECQGTGINIESDELEKPTVPLSE 2316
            P                   + E+KA R  I    CPECQGTG  I+ ++ EKPTVPLS+
Sbjct: 418  PKVKRRSRRKAASKCNEVLNDSEVKAFRTPINRVICPECQGTGTMIDGED-EKPTVPLSK 476

Query: 2317 IFKYKIKANDACKAWFKSPELLQNCSLGFYFPPQSE 2424
            +FKYKIK +DA +AW KSPE+L NCS GF+FP  +E
Sbjct: 477  MFKYKIKVSDARRAWMKSPEVLGNCSTGFHFPSPAE 512


>gb|EMJ00100.1| hypothetical protein PRUPE_ppa004741mg [Prunus persica]
          Length = 493

 Score =  488 bits (1256), Expect = e-135
 Identities = 263/536 (49%), Positives = 333/536 (62%), Gaps = 15/536 (2%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA R ELGFPK    SL+EQ  R  LRNVR QGHTYV+LR+DGK+ +FFCTLCLAPCYSD
Sbjct: 1    MAGRWELGFPKTSASSLREQATRTILRNVRSQGHTYVELREDGKKFIFFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
             VLF HL GNLH +RLA A+ TLL+PNPWPFNDG+ FF++  + DK L +T  ++  +L+
Sbjct: 61   KVLFDHLKGNLHKDRLAAAKVTLLRPNPWPFNDGVAFFHNPDETDKHLVITDGNKFRMLE 120

Query: 1279 TQSNLENPLAIVSWQKNLGSDENHS---------------QISLNRGVLHEVPNVNEDRV 1413
            +  + EN LAIV + +NL S+ N                 ++  N        N   + V
Sbjct: 121  SPDD-ENNLAIVKYGENLISNGNEHVGTDGLECNGSLDFPRVRSNFKFSCSNENSTANEV 179

Query: 1414 GYHLVIPGVLWKNEVSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGS 1593
               +VIP VL +++V+ +E   VG+ QIAARF+EKD       RIWCEW G+   GNE  
Sbjct: 180  NSSVVIPSVLVRDDVTDIEAKKVGLGQIAARFLEKDKVSKGIGRIWCEWLGKKAIGNEYH 239

Query: 1594 HVVLEHDFAVVTFSYNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDV 1773
              V EHDFAVVTFSYN +LGR+GL+DD+K LL SSP  E+E+  GS ++++KSFSDPED+
Sbjct: 240  LKVPEHDFAVVTFSYNIDLGRRGLLDDVKMLLSSSPSVETENGEGSGSKRKKSFSDPEDI 299

Query: 1774 SEFMGNQYDXXXXXXXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERM 1953
            SE + NQYD            ++K+LLD YDDQLLH R + +K++RRELRRQQ +A  RM
Sbjct: 300  SESLSNQYDSCGEDSSASSGASSKLLLDRYDDQLLHTRFILNKSIRRELRRQQRLALGRM 359

Query: 1954 CDICQQKMLPGKDVAALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQL 2133
            CDICQQ+M+PGKDV+AL+N KTGRLACSSRN+ GAFHVFH SCLIHWILLCE+EI  +  
Sbjct: 360  CDICQQRMIPGKDVSALINLKTGRLACSSRNVNGAFHVFHTSCLIHWILLCEVEIANQST 419

Query: 2134 DAPXXXXXXXXXXXXXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLS 2313
            ++                     +    + QI S FCPECQGTG  I+ D+LEKP +PL 
Sbjct: 420  NS--KVRRRSRRKNAAKCNGQDGQMTALSTQIHSVFCPECQGTGAIIDGDDLEKPNLPL- 476

Query: 2314 EIFKYKIKANDACKAWFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRAN 2481
                                                   SQEKV PLKL+ FYRA+
Sbjct: 477  ---------------------------------------SQEKVKPLKLMHFYRAD 493


>gb|ESW21852.1| hypothetical protein PHAVU_005G104500g [Phaseolus vulgaris]
          Length = 498

 Score =  478 bits (1231), Expect = e-132
 Identities = 255/521 (48%), Positives = 339/521 (65%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA + ELG  K  V + KEQ ARK L+ VR QGH YV+LR++GK+ ++FCTLCLAPCYSD
Sbjct: 1    MAGKLELGPLKSDVSNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
             VLF HL GNLH ERL+ A+ TLL P PWPFNDG++FF+   + D+ L V  + +  +L 
Sbjct: 61   DVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSIESDRDLEVADSYRNRLLK 120

Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458
              +N +N LAIV + + + S+               +PN   D  G  LVIP +L ++E+
Sbjct: 121  FNNN-DNSLAIVKFDEGVQSNAEPCSTD-------GMPN---DECG--LVIPHLLIRDEI 167

Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638
              ++V+ VG+ +IAARF+EK   L    RIWCEW G+  +  +    +LEHDFA+V F+Y
Sbjct: 168  FDVKVSEVGLGKIAARFLEKCSALSGIKRIWCEWLGKKGNDQQDGVEILEHDFAIVNFAY 227

Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818
            NY+LGR GL+DD+K LLPS+        SG R  KR S SD +D+S+ + NQYD      
Sbjct: 228  NYDLGRSGLLDDVKSLLPSA--------SGGRKGKR-SLSDSDDISDSLCNQYDSSAEES 278

Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998
                  +A + LD +++  +  R + SK +R+ELRR+Q +AAE++C+ICQQKMLPGKDVA
Sbjct: 279  SDSNNSSAPLTLDQFNNHHVCTRFISSKAVRKELRRKQRLAAEKVCNICQQKMLPGKDVA 338

Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178
            ALLN  T R+ACSSRN TGAFHVFH SCLIHWI+LCE EI    L  P            
Sbjct: 339  ALLNLNTRRVACSSRNKTGAFHVFHTSCLIHWIILCEFEIITNHLVRPNVRRIVKRKIAS 398

Query: 2179 XXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKA 2358
                    ++I+  + I + FCPECQGTG+ I+ D +E+P   LS++FK+KIKA DA + 
Sbjct: 399  DGEKIGKEKDIE--KHIRTVFCPECQGTGMVIDGDGVEQPEFSLSQMFKFKIKACDARRE 456

Query: 2359 WFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRAN 2481
            W KSPE+LQNCS GF+FP QSE I +EKV P+ LL FYRA+
Sbjct: 457  WMKSPEILQNCSTGFHFPSQSEEIFEEKVEPINLLHFYRAD 497


>ref|XP_003541913.1| PREDICTED: uncharacterized protein LOC100807746 [Glycine max]
          Length = 500

 Score =  478 bits (1230), Expect = e-132
 Identities = 254/522 (48%), Positives = 337/522 (64%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA + ELG PK  + + KEQ ARK L+ VR QGH YV+LR++GK+ ++FCTLCLAPCYSD
Sbjct: 1    MAGKLELGPPKSDISNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
             VLF HL GNLH ERL+ A+ TLL P PWPFNDG++FF+   + DK L V  + +  +L 
Sbjct: 61   DVLFDHLKGNLHRERLSAAKVTLLGPKPWPFNDGLVFFDTSTESDKELEVADSYRNRLLK 120

Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458
               + ++ LAIV + + + S+     I            + +D     LVIP +L  +E+
Sbjct: 121  FNDD-DSSLAIVKFGEGVQSNAKPCSIE----------GMQDDECA--LVIPNLLIGDEI 167

Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638
              L+V  VG+ +IAARF+EK   L+   RIWCEW G+  +G      VLEHDFAVV F+Y
Sbjct: 168  FDLKVKEVGLGKIAARFLEKCHALNGIKRIWCEWLGKESNGERDGVEVLEHDFAVVIFAY 227

Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818
            NY+LGR GL+DD+K LLP S          +  + + S SD +DVS+F+ NQYD      
Sbjct: 228  NYDLGRSGLLDDVKTLLPVS----------AGQKGKTSLSDSDDVSDFLCNQYDSSAEES 277

Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998
                  ++++ LD +++ L   R + SK +R+ELRR+Q +AAE++C+ICQQKMLPGKDVA
Sbjct: 278  SDSNNSSSRLTLDQFNNHLC-TRFISSKALRKELRRKQRLAAEKVCNICQQKMLPGKDVA 336

Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178
            ALLN KT R+ACSSRN TGAFHVFH SCLIHWI+LCE EI    L  P            
Sbjct: 337  ALLNLKTRRVACSSRNRTGAFHVFHTSCLIHWIILCEFEIIINHLVRPNIRRVVKRKVAS 396

Query: 2179 XXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKA 2358
                    ++I   + I + FCPECQGTG+ I+ D +E+P   LS++FK+KIKA DA + 
Sbjct: 397  DGDKMGKEKDI--GKHIRTVFCPECQGTGMIIDGDGVEQPEFSLSQMFKFKIKACDARRD 454

Query: 2359 WFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484
            W KSPE+LQNCS GF+FP QSE I +EKV P+ LL FYRA++
Sbjct: 455  WIKSPEVLQNCSTGFHFPSQSEEIFEEKVEPINLLHFYRADD 496


>ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp.
            lyrata] gi|297315349|gb|EFH45772.1| hypothetical protein
            ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata]
          Length = 517

 Score =  474 bits (1219), Expect = e-130
 Identities = 245/525 (46%), Positives = 334/525 (63%), Gaps = 6/525 (1%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MAE++ELG PK  + +LKEQ+AR TL+N+RLQGHTY++LR+DGKR VFFCTLCLAPCYSD
Sbjct: 1    MAEKKELGLPKSSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLP-DEDKSLPVTSADQIPIL 1275
            ++L  HL+GNLH ERLA AR TLL  NPWPF+DG++FF+    +E++  PV+    +P  
Sbjct: 60   TILLGHLNGNLHKERLACARLTLLGTNPWPFSDGVLFFDSSTGEEEEKTPVSGGASVPGT 119

Query: 1276 DTQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNE 1455
                + ++  AIV +  N  +  N         V  + P+ + D     L+I GVL K  
Sbjct: 120  LGHCSDDDRFAIVKYDNNKANGGNQPA-----AVTDDEPSHSTD----DLLISGVLIKER 170

Query: 1456 VSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFS 1635
               +E  ++G  +IAAR  E  G      ++WCEW G     +E    + EHDFA+VTFS
Sbjct: 171  TLDVEAKFIGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPSDEEKATIPEHDFAIVTFS 230

Query: 1636 YNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXX 1815
            Y YNLGR GL+DD   LL +S  SES +   S  +++KSFSDPED SE + NQYD     
Sbjct: 231  YFYNLGRLGLLDDPSRLLTTS-QSESGNGEDSGRKRKKSFSDPEDTSESLCNQYDSSEEV 289

Query: 1816 XXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDV 1995
                   +++ L+  YDD L+  RV+K+KT+RRELRRQQ + +ER+C++C+QKMLPGKD 
Sbjct: 290  SSGHNSNSSRALIADYDDSLMSKRVVKNKTVRRELRRQQRIFSERICEVCKQKMLPGKDA 349

Query: 1996 AALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXX 2175
            AA+LN KTG LAC SRNL GAFH+FHVSC++HW L CE EI   ++ +            
Sbjct: 350  AAILNMKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGKGKKRCTKHSS 409

Query: 2176 XXXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACK 2355
                          + QI S FCPECQGTGINIE   +E+ T PLS+ +++++K ++  K
Sbjct: 410  GQTGVKWNELANDVSWQIFSVFCPECQGTGINIEGGVIERDTFPLSQTWRFQVKVSEGRK 469

Query: 2356 AWFKSPELLQNCSLGFYFPPQSE-----IISQEKVSPLKLLPFYR 2475
            AW K+PE L+NCS GF+FP Q++      + +E+V  +KL+ FYR
Sbjct: 470  AWVKNPEKLKNCSTGFHFPQQADESGQIPVQEERVQMMKLVRFYR 514


>ref|XP_003540357.1| PREDICTED: uncharacterized protein LOC100779572 isoform X1 [Glycine
            max] gi|571494415|ref|XP_006592839.1| PREDICTED:
            uncharacterized protein LOC100779572 isoform X2 [Glycine
            max]
          Length = 501

 Score =  473 bits (1218), Expect = e-130
 Identities = 256/522 (49%), Positives = 335/522 (64%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MA + ELG PK  V + KEQ ARK L+ VR QGH YV+LR++GK+ ++FCTLCLAPCYSD
Sbjct: 1    MAGKLELGPPKSDVSNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
             VLF HL GNLH ERL+ A+ TLL P PWPFNDG++FF+   +  K L V  + Q  +L 
Sbjct: 61   DVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSTESHKELEVADSYQNRLLK 120

Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458
               N +  LAIV +   + S+     I            + +D   Y LVIP +L  +E+
Sbjct: 121  FNDN-DVSLAIVKFGDGVQSNAKPRSID----------GMQDDE--YALVIPNLLIGDEI 167

Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638
              ++V  VG+ +IAARF+EK   L+   RIWCEW G+  +G      VLEHDFAVV F+Y
Sbjct: 168  FDVKVREVGLGKIAARFLEKCHALNGIKRIWCEWLGKESNGERDGVEVLEHDFAVVIFAY 227

Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818
            NY+LGR GL+DD+  LLPS+        SG +  K  S SD +DVS+ + NQYD      
Sbjct: 228  NYDLGRSGLLDDVNTLLPSA--------SGGQKGK-SSLSDFDDVSDSVCNQYDSSAEES 278

Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998
                  ++++ LD +++ L   R + SK +R+ELRR+Q +AAE++C+ICQQKMLPGKDVA
Sbjct: 279  SDSNNSSSRLTLDQFNNHLC-TRFISSKALRKELRRKQRLAAEKVCNICQQKMLPGKDVA 337

Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178
            ALLN KT R+ACSSRN TGAFHVFH SCLIHWI+LCE EI    L  P            
Sbjct: 338  ALLNLKTRRVACSSRNRTGAFHVFHTSCLIHWIILCEFEIITNHLVCPNVRRVVKRKVAS 397

Query: 2179 XXXXXXXNEEIKANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDACKA 2358
                    ++I   + I + FCPECQGTG+ I+ D +E+P   LS++FK+KIKA DA + 
Sbjct: 398  DGNKIGKEKDI--GKHIRTVFCPECQGTGMIIDGDGVEQPEFSLSQMFKFKIKACDARRD 455

Query: 2359 WFKSPELLQNCSLGFYFPPQSEIISQEKVSPLKLLPFYRANE 2484
            W KSPE+L+NCS GF+FP QSE I +EKV P+ LL FYRA++
Sbjct: 456  WIKSPEVLKNCSTGFHFPSQSEEIFEEKVEPINLLHFYRADD 497


>ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana]
            gi|145334149|ref|NP_001078455.1| uncharacterized protein
            [Arabidopsis thaliana] gi|7269680|emb|CAB79628.1|
            putative protein [Arabidopsis thaliana]
            gi|110742700|dbj|BAE99261.1| hypothetical protein
            [Arabidopsis thaliana] gi|332660060|gb|AEE85460.1|
            uncharacterized protein AT4G28260 [Arabidopsis thaliana]
            gi|332660061|gb|AEE85461.1| uncharacterized protein
            AT4G28260 [Arabidopsis thaliana]
          Length = 516

 Score =  470 bits (1209), Expect = e-129
 Identities = 248/526 (47%), Positives = 337/526 (64%), Gaps = 7/526 (1%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MAE++ELG PK  + +LKEQ+AR TL+N+RLQGHTY++LR+DGKR VFFCTLCLAPCYSD
Sbjct: 1    MAEKKELGLPKPSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLP-DEDKSLPVTSADQIPIL 1275
            ++L  HL+GNLH ERLA AR TLL  NPWPF+DG++FF+    +E++  PV+  + +P  
Sbjct: 60   TILLGHLNGNLHKERLACARITLLGTNPWPFSDGVLFFDSSTGEEEEKSPVSGGEGVPDT 119

Query: 1276 DTQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNE 1455
                + +   AIV +  N  + +N     +   V  + P+   D     L+I GVL K  
Sbjct: 120  LEHCSDDERFAIVKYDNNKTNGDN-----VPAAVTDDEPSHAAD----DLLISGVLIKER 170

Query: 1456 VSSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFS 1635
               +E  ++G  +IAAR  E  G      ++WCEW G     +E    + EHDFA+VTFS
Sbjct: 171  TLDVEAKFIGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPSDEEKATIPEHDFAIVTFS 230

Query: 1636 YNYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXX 1815
            Y YNLGR GL+DD   LL SS  SES +   S  +++KSFSDPED SE + NQYD     
Sbjct: 231  YFYNLGRLGLLDDPGRLLTSS-QSESGNGEDSGRKRKKSFSDPEDTSESLCNQYDSSEEV 289

Query: 1816 XXXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDV 1995
                   +++ L+  YDD L+  RV+K++T+RRELRRQQ + +ER+C++C+QKMLPGKD 
Sbjct: 290  SSGHNSNSSRDLIADYDDSLMSKRVVKNRTVRRELRRQQRIFSERICEVCKQKMLPGKDA 349

Query: 1996 AALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXX 2175
            AA+LN KTG LAC SRNL GAFH+FHVSC++HW L CE EI   ++ +            
Sbjct: 350  AAILNMKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVS--GKGKKRCTKH 407

Query: 2176 XXXXXXXXNEEIK-ANRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDAC 2352
                    NE     + QI S FCPECQGTGINIE   +E+ T PLS+ +++++K ++  
Sbjct: 408  SGQTGVKWNELANDVSWQIFSVFCPECQGTGINIEGAVIERDTFPLSQTWRFQVKVSEGR 467

Query: 2353 KAWFKSPELLQNCSLGFYFPPQSE-----IISQEKVSPLKLLPFYR 2475
            KAW K+PE L+NCS GF+FP Q+E      + +E+V  +KL+ FYR
Sbjct: 468  KAWVKNPERLKNCSTGFHFPQQAEETEQIPVQEERVQMMKLVRFYR 513


>gb|EOX99407.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
          Length = 481

 Score =  469 bits (1207), Expect = e-129
 Identities = 252/496 (50%), Positives = 315/496 (63%), Gaps = 20/496 (4%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MAERRELG P+   CSLKEQ+AR TL NVR QGHTY++LR+DGKR +FFCTLCLAPCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
            SVL  HL G+LH+ RLA A+ TLL  NPWPFNDG++FF  L +++K L     +Q  +L+
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLE 120

Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458
              +N +N LAIV +          S++S  R       NVN       L+IPGVL K+E+
Sbjct: 121  FHNNDDN-LAIVEYVG--------SEVSSYR------KNVNCRAGDSDLLIPGVLIKDEI 165

Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638
            S L+V ++G  +IAARF EKDG L++  RIWCEW G+    N+      +H FAVVTF Y
Sbjct: 166  SDLKVRFIGFGKIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVY 225

Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818
            N +LGRKGL+DD+K LL S   +  E+   +  +++KSFSDPED+SE + NQYD      
Sbjct: 226  NCDLGRKGLLDDVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDS 285

Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998
                  ++++ LD YDDQLL  R + SK +RRELRRQQ +AAERMCDICQQKMLP KDVA
Sbjct: 286  SASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVA 345

Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178
             L+N  TG+L CSSRN+ GAFHVFH SCLIHWILLCE+E        P            
Sbjct: 346  TLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGA 405

Query: 2179 XXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEI------------ 2319
                   + E KA    I S  CPECQGTGI++E DELEKP V LS++            
Sbjct: 406  KSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRCCC 465

Query: 2320 -------FKYKIKAND 2346
                   F+YKIK +D
Sbjct: 466  TRKLAGMFRYKIKVSD 481


>gb|EOX99409.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 478

 Score =  468 bits (1204), Expect = e-129
 Identities = 248/479 (51%), Positives = 310/479 (64%), Gaps = 1/479 (0%)
 Frame = +1

Query: 919  MAERRELGFPKHGVCSLKEQVARKTLRNVRLQGHTYVDLRKDGKREVFFCTLCLAPCYSD 1098
            MAERRELG P+   CSLKEQ+AR TL NVR QGHTY++LR+DGKR +FFCTLCLAPCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1099 SVLFKHLHGNLHTERLAVARATLLKPNPWPFNDGMIFFNDLPDEDKSLPVTSADQIPILD 1278
            SVL  HL G+LH+ RLA A+ TLL  NPWPFNDG++FF  L +++K L     +Q  +L+
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGLHGNQNRLLE 120

Query: 1279 TQSNLENPLAIVSWQKNLGSDENHSQISLNRGVLHEVPNVNEDRVGYHLVIPGVLWKNEV 1458
              +N +N LAIV +          S++S  R       NVN       L+IPGVL K+E+
Sbjct: 121  FHNNDDN-LAIVEYVG--------SEVSSYR------KNVNCRAGDSDLLIPGVLIKDEI 165

Query: 1459 SSLEVTYVGVAQIAARFVEKDGFLDDFHRIWCEWFGRMDSGNEGSHVVLEHDFAVVTFSY 1638
            S L+V ++G  +IAARF EKDG L++  RIWCEW G+    N+      +H FAVVTF Y
Sbjct: 166  SDLKVRFIGFGKIAARFCEKDGVLNEISRIWCEWLGKEVPRNDDKLKAPKHGFAVVTFVY 225

Query: 1639 NYNLGRKGLIDDIKYLLPSSPHSESEDNSGSRNRKRKSFSDPEDVSEFMGNQYDXXXXXX 1818
            N +LGRKGL+DD+K LL S   +  E+   +  +++KSFSDPED+SE + NQYD      
Sbjct: 226  NCDLGRKGLLDDVKSLLTSGSPTGLENGDSASRKRKKSFSDPEDISESLSNQYDSSGEDS 285

Query: 1819 XXXXXLNAKVLLDGYDDQLLHARVLKSKTMRRELRRQQSVAAERMCDICQQKMLPGKDVA 1998
                  ++++ LD YDDQLL  R + SK +RRELRRQQ +AAERMCDICQQKMLP KDVA
Sbjct: 286  SASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQRIAAERMCDICQQKMLPEKDVA 345

Query: 1999 ALLNRKTGRLACSSRNLTGAFHVFHVSCLIHWILLCELEIYAKQLDAPXXXXXXXXXXXX 2178
             L+N  TG+L CSSRN+ GAFHVFH SCLIHWILLCE+E        P            
Sbjct: 346  TLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKNGA 405

Query: 2179 XXXXXXXNEEIKA-NRQICSAFCPECQGTGINIESDELEKPTVPLSEIFKYKIKANDAC 2352
                   + E KA    I S  CPECQGTGI++E DELEKP V LS++    +K    C
Sbjct: 406  KSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRCC 464


Top