BLASTX nr result

ID: Rheum21_contig00017500 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00017500
         (1780 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   363   2e-97
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   362   2e-97
gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe...   353   1e-94
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   345   3e-92
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   345   5e-92
gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [...   344   8e-92
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   341   5e-91
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     340   2e-90
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              320   2e-84
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   319   2e-84
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   318   5e-84
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   315   4e-83
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   311   4e-82
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   311   4e-82
gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i...   303   1e-79
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   302   4e-79
gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i...   301   5e-79
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   296   2e-77
ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791...   286   2e-74
ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791...   284   1e-73

>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  363 bits (931), Expect = 2e-97
 Identities = 202/421 (47%), Positives = 258/421 (61%), Gaps = 6/421 (1%)
 Frame = -3

Query: 1727 SQKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXX 1548
            SQK+RW GC S  WCF  QKH+KRIG A+LVPEP+++   A+ A N +    I   F   
Sbjct: 38   SQKRRWGGCWSISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQAAAISLPFVAP 97

Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368
                               AGLVS  SI+ N++SPGGP S+FAIGPYAHETQLVSPP FS
Sbjct: 98   PSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFS 157

Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXY 1191
            T+TTEPSTAPFTPPPESVH+TTPSSPEVPFA  LD S   G    +F            +
Sbjct: 158  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLH 217

Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGS 1011
            PGSP G LISP S ISGSGTSSP  D E   AG  +PD   G PP++LNL  L   +WGS
Sbjct: 218  PGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGS 277

Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDN----QESVSQRVSFELISEDVVR 843
             + SG+LTPD       +GF  N Q +  +L    +N     + V  RVSFEL +EDVVR
Sbjct: 278  RQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVR 337

Query: 842  CVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKP-QAPADAEEGQ 666
            CVEK+PT L +A+ + +L++    E+     ++  +    + E A ++P + P D EE  
Sbjct: 338  CVEKKPTTLAEAV-SESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAP 396

Query: 665  RCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQ 486
            R Q+Q+S +LGS KEFNFD+ + G+  +P +  +WW N+KV GK++    KNW+FFPV Q
Sbjct: 397  RHQKQQSITLGSTKEFNFDSAD-GDSHEPTIASDWWANEKVVGKDS-GAIKNWAFFPVIQ 454

Query: 485  P 483
            P
Sbjct: 455  P 455


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  362 bits (930), Expect = 2e-97
 Identities = 201/421 (47%), Positives = 258/421 (61%), Gaps = 6/421 (1%)
 Frame = -3

Query: 1727 SQKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXX 1548
            SQK+RW GC +  WCF  QKH+KRIG A+LVPEP+++   A+ A N +    I   F   
Sbjct: 38   SQKRRWGGCWNISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQATAISLPFVAP 97

Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368
                               AGLVS  SI+ N++SPGGP S+FAIGPYAHETQLVSPP FS
Sbjct: 98   PSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFS 157

Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXY 1191
            T+TTEPSTAPFTPPPESVH+TTPSSPEVPFA  LD S   G    +F            +
Sbjct: 158  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLH 217

Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGS 1011
            PGSP G LISP S ISGSGTSSP  D E   AG  +PD   G PP++LNL  L   +WGS
Sbjct: 218  PGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGS 277

Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDN----QESVSQRVSFELISEDVVR 843
             + SG+LTPD       +GF  N Q +  +L    +N     + V  RVSFEL +EDVVR
Sbjct: 278  RQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVR 337

Query: 842  CVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKP-QAPADAEEGQ 666
            CVEK+PT L +A+ + +L++    E+     ++  +    + E A ++P + P D EE  
Sbjct: 338  CVEKKPTTLAEAV-SESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAP 396

Query: 665  RCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQ 486
            R Q+Q+S +LGS KEFNFD+ + G+  +P +  +WW N+KV GK++    KNW+FFPV Q
Sbjct: 397  RHQKQQSITLGSTKEFNFDSAD-GDSHEPTIASDWWANEKVVGKDS-GAIKNWAFFPVIQ 454

Query: 485  P 483
            P
Sbjct: 455  P 455


>gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  353 bits (907), Expect = 1e-94
 Identities = 208/427 (48%), Positives = 254/427 (59%), Gaps = 10/427 (2%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW    S YWCF  Q+HKKRIG A+LVPE +     A  A N   PI   +      
Sbjct: 38   QKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPRAEN---PIQTPSIVLPFV 94

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                                   F S+ A+++SP GP S+FAIGPYAHETQLVSPP FST
Sbjct: 95   APPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTSIFAIGPYAHETQLVSPPVFST 154

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188
            +TTEPSTAPFTPPPESVH+TTPSSPEVPFA  LD    +G    RF            YP
Sbjct: 155  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYP 214

Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008
            GSP GQLISP S ISGSGTSSP  D E  A G ++ + R G PP++LNL +L   DWGS 
Sbjct: 215  GSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDILSTRDWGSR 274

Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE-----SVSQRVSFELISEDVVR 843
              SGS+TPDG    S DGF +  Q     L    +N+      S++ RVSFEL SE+V+R
Sbjct: 275  LGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIR 334

Query: 842  CVEKEPTVLPKALQTVALED----KDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAE 675
            CVEK+P  L +A+ T +LED    + +++ S V   S    G+TS  DA EK  A AD E
Sbjct: 335  CVEKKPVALAEAVST-SLEDTEKAQSKEDPSKVVSSSICPVGETS-NDAAEK--AVADGE 390

Query: 674  EGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFP 495
            E Q   +QRS +LGS+KEFNFDN + G+     +G +WW N+KV  KE   P KNWSFFP
Sbjct: 391  EAQLHPKQRSITLGSVKEFNFDNPDGGDSGN-SIGSDWWANEKVDAKE-NGPTKNWSFFP 448

Query: 494  VAQPGAT 474
            + QPG +
Sbjct: 449  MMQPGVS 455


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  345 bits (886), Expect = 3e-92
 Identities = 208/446 (46%), Positives = 252/446 (56%), Gaps = 29/446 (6%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW  C   YWCFRS K K RIG A+L PE  +   G   A NL+    I   F    
Sbjct: 38   QKRRWGSCWGEYWCFRSPKDK-RIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAPP 96

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                              +GL+S TSI ANI+SPGGP S+FAIGPYAHETQLVSPP FST
Sbjct: 97   SSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFST 156

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188
            +TTEPSTAPFTPPPESVH+TTPSSPEVPFA   D +  +G   HRF+           YP
Sbjct: 157  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYP 216

Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAG-LYYPDARFGVPPRILNLKLLPACDWGS 1011
            GSP G LISP S ISGSGTSSP  D +   +G   + + R G PP++L L  L   +WGS
Sbjct: 217  GSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGGPPKLLTLDKLSNHEWGS 276

Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQ--------NTTDSLTDEH---------------DN 900
               SGS+TPD   P S DG  ++ Q        +  DS+ D                 +N
Sbjct: 277  RIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNN 336

Query: 899  QESVSQRVSFELISEDVVRCVEKEPTVLPKA----LQTVALEDKDEDERSSVDDKSTTMT 732
            +  V  RVSFEL +EDVVRCVEK+   L KA    LQ  A  + DE+ R  V D S    
Sbjct: 337  EIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVD-SEGRV 395

Query: 731  GKTSQEDATEKPQAPADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTN 552
            G+T+  +  EK    A+ EEGQ   +QRS +LGS KEFNFDN + G   KP +  +WW N
Sbjct: 396  GETA-NNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWAN 454

Query: 551  DKVSGKEAEQPAKNWSFFPVAQPGAT 474
            +KV GKE    +KNWS F + QP  +
Sbjct: 455  EKVVGKEV-GASKNWSIFHMMQPSVS 479


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  345 bits (884), Expect = 5e-92
 Identities = 196/427 (45%), Positives = 251/427 (58%), Gaps = 10/427 (2%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW  C S Y CF  QKHKK+IG A+L PEPS+   GA  + N +    +   F    
Sbjct: 38   QKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPP 97

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                              AGLVS TSI+A+++SP GP S+FAIGPYAHETQLVSPP FST
Sbjct: 98   SSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFST 157

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGFSHRFMXXXXXXXXXXXYPG 1185
            +TTEPSTAPFTPPPESVH+TTPSSPEVPFA FLD S  +G +   +           +PG
Sbjct: 158  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTG--LRFPFDFQSYQFHPG 215

Query: 1184 SPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSGR 1005
            SP GQLISP S ISGSGTSSP  D E    G ++P+ R G PP++LNL  L  C+WGS +
Sbjct: 216  SPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQ 275

Query: 1004 ESGSLTPDGYLPKSHDGFPVNHQ----NTTDSLTDEHDNQESVSQRVSFELISEDVVRCV 837
             SG+LTP+  + +    F ++ Q     +     + H N + V+ RVSFEL +ED  RCV
Sbjct: 276  GSGALTPES-VRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCV 334

Query: 836  EKEPTVLPKALQ------TVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAE 675
            E++P    K +       T A E+K+  E     +    +T   S E       A  D E
Sbjct: 335  EEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPE------MASTDGE 388

Query: 674  EGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFP 495
               + ++Q+S +LGS+KEFNFDN ++G+ +KP    NWW N  V GKE E   KNWSFFP
Sbjct: 389  AAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKEGE-TTKNWSFFP 446

Query: 494  VAQPGAT 474
            + Q G +
Sbjct: 447  MVQSGVS 453


>gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  344 bits (882), Expect = 8e-92
 Identities = 203/427 (47%), Positives = 254/427 (59%), Gaps = 12/427 (2%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW GC S YWCF S K KKRIG A+L  E S +      A N +    I   F    
Sbjct: 38   QKRRWGGCWSIYWCFGSYKQKKRIGPAVLTSETSFSGANVPAAENPTQAPAIALPFVAPP 97

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                              AGLVS TSI+A+++SPG P S+FAIGPYAHETQLVSPP FST
Sbjct: 98   SSPASFLPSEPPSATQSPAGLVSLTSISASMYSPG-PASIFAIGPYAHETQLVSPPVFST 156

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGFS-HRFMXXXXXXXXXXXYP 1188
            +TTEPSTAPFTPPPESVH+TTPSSPEVPFA  L  +   G    RF            +P
Sbjct: 157  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHP 216

Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008
            GSP GQLISP S ISGSGTSSP  D E FAA L++P+ R G PP++LNL    +C+WGS 
Sbjct: 217  GSPVGQLISPSSGISGSGTSSPFRDGE-FAASLHFPEFRMGDPPKLLNLDKHSSCEWGSH 275

Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEH-------DNQESVSQRVSFELISEDV 849
              SG+LTPD       +GF ++HQ  ++  +  H       ++Q + + RVSFEL +E+V
Sbjct: 276  HGSGTLTPDATRSTPRNGFLLDHQ-ISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEEV 334

Query: 848  VRCVEKEPTVLPKA----LQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPAD 681
            VR +E E     +A    LQ  A  + +E +   VDD    + G+TS E      +A AD
Sbjct: 335  VRSLEMETATPSEAVSGSLQIEATRESEEHDTKVVDDYECRV-GETSNE---RPEKALAD 390

Query: 680  AEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSF 501
             E   +  + +S +LGS KEFNFDNV+ G+  KP L  +WW NDKV+GK    P +NWSF
Sbjct: 391  REGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKPILTSDWWANDKVAGKGGGVP-RNWSF 449

Query: 500  FPVAQPG 480
            FP+ QPG
Sbjct: 450  FPMMQPG 456


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  341 bits (875), Expect = 5e-91
 Identities = 194/426 (45%), Positives = 250/426 (58%), Gaps = 10/426 (2%)
 Frame = -3

Query: 1721 KKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXXX 1542
            ++RW  C S Y CF  QKHKK+IG A+L PEPS+   GA  + N +    +   F     
Sbjct: 38   QRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPS 97

Query: 1541 XXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFSTY 1362
                             AGLVS TSI+A+++SP GP S+FAIGPYAHETQLVSPP FST+
Sbjct: 98   SPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTF 157

Query: 1361 TTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGFSHRFMXXXXXXXXXXXYPGS 1182
            TTEPSTAPFTPPPESVH+TTPSSPEVPFA FLD S  +G +   +           +PGS
Sbjct: 158  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTG--LRFPFDFQSYQFHPGS 215

Query: 1181 PAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSGRE 1002
            P GQLISP S ISGSGTSSP  D E    G ++P+ R G PP++LNL  L  C+WGS + 
Sbjct: 216  PVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKLSTCEWGSYQG 275

Query: 1001 SGSLTPDGYLPKSHDGFPVNHQ----NTTDSLTDEHDNQESVSQRVSFELISEDVVRCVE 834
            SG+LTP+  + +    F ++ Q     +     + H N + V+ RVSFEL +ED  RCVE
Sbjct: 276  SGALTPES-VRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVE 334

Query: 833  KEPTVLPKALQ------TVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEE 672
            ++P    K +       T A E+K+  E     +    +T   S E       A  D E 
Sbjct: 335  EKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPE------MASTDGEA 388

Query: 671  GQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPV 492
              + ++Q+S +LGS+KEFNFDN ++G+ +KP    NWW N  V GKE E   KNWSFFP+
Sbjct: 389  APQHRKQQSITLGSVKEFNFDNADEGDSRKPS-SSNWWANGSVIGKEGE-TTKNWSFFPM 446

Query: 491  AQPGAT 474
             Q G +
Sbjct: 447  VQSGVS 452


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  340 bits (871), Expect = 2e-90
 Identities = 197/425 (46%), Positives = 250/425 (58%), Gaps = 8/425 (1%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            +K+RW GCLS YWCF + K++ RIG  +LVPE +     A  A N +    +   F    
Sbjct: 40   RKRRWGGCLSIYWCFGTPKNRTRIGHGVLVPETAQPGNSAPRAENSTQTHAVILPFIAPP 99

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                              AGL+S TS++A+++SPGGP S+FAIGPYAHETQLVSPP FST
Sbjct: 100  SSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPVFST 159

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGF-SHRFMXXXXXXXXXXXYP 1188
            +TTEPSTAPFTPPPESVH+TTPSSPEVPFA  LD + H+G    RF             P
Sbjct: 160  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQP 219

Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008
            GSP GQLISP S ISGSGTSSP  D E  A G ++ + R G PP++LNL  L   DWGS 
Sbjct: 220  GSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGDPPKLLNLDKLSKFDWGSR 279

Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQESVSQRVSFELISEDVVRCVEKE 828
            + SGSLTPD   P S   F V      +      +N     +RVSF++ +EDV+R VEK+
Sbjct: 280  QGSGSLTPDSVKPIS--TFEVAPHLKPNGRCRNAEN--VADRRVSFDVSTEDVIRYVEKK 335

Query: 827  PTVLPKALQTVALEDKDEDERSSVDDKS-------TTMTGKTSQEDATEKPQAPADAEEG 669
               L +A+ T +L+D    +R    D +           G+TS E   E  +AP   EE 
Sbjct: 336  TVPLAEAMLT-SLKDTTMGQREENSDSNKVEEIGCENRVGETSNE---EPDKAPTSGEEV 391

Query: 668  QRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVA 489
             + Q+ RS +LGS KEFNFDN + G+  K     +WW N KV+GKE   P++NWSFFP+ 
Sbjct: 392  LQHQKHRSITLGSSKEFNFDNADAGDLHKSDSVSDWWANQKVAGKEG-APSQNWSFFPMI 450

Query: 488  QPGAT 474
            QPG +
Sbjct: 451  QPGVS 455


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  320 bits (819), Expect = 2e-84
 Identities = 195/422 (46%), Positives = 231/422 (54%), Gaps = 5/422 (1%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW  C   YWCFRS K K RIG A+L PE  +   G   A NL+    I   F    
Sbjct: 38   QKRRWGSCWGEYWCFRSPKDK-RIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAPP 96

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                              +GL+S TSI ANI+SPGGP S+FAIGPYAHETQLVSPP FST
Sbjct: 97   SSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFST 156

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188
            +TTEPSTAPFTPPPESVH+TTPSSPEVPFA   D +  +G   HRF+           YP
Sbjct: 157  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYP 216

Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008
            GSP G LISP S ISGSGTSSP            +PD                       
Sbjct: 217  GSPVGHLISPSSGISGSGTSSP------------FPD----------------------- 241

Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQESVSQRVSFELISEDVVRCVEKE 828
              SGS+TPD   P S DG  ++H           +N+  V  RVSFEL +EDVVRCVEK+
Sbjct: 242  -RSGSITPDALGPPSRDGSVLDHSGCP-------NNEIMVDHRVSFELTAEDVVRCVEKD 293

Query: 827  PTVLPKA----LQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEEGQRC 660
               L KA    LQ  A  + DE+ R  V D S    G+T+  +  EK    A+ EEGQ  
Sbjct: 294  SAALVKAVSASLQNPATVEIDENSREVVVD-SEGRVGETA-NNPPEKAPEDANGEEGQPH 351

Query: 659  QRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQPG 480
             +QRS +LGS KEFNFDN + G   KP +  +WW N+KV GKE    +KNWS F + QP 
Sbjct: 352  HKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEV-GASKNWSIFHMMQPS 410

Query: 479  AT 474
             +
Sbjct: 411  VS 412


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  319 bits (818), Expect = 2e-84
 Identities = 191/429 (44%), Positives = 242/429 (56%), Gaps = 12/429 (2%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGAT-PARNLSLPIPIGAHFXXX 1548
            QK+RW GC S YWCF SQK  KRIG A+ +PE  +TA GA  P+ N S   P  +     
Sbjct: 38   QKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPE--TTASGADRPSSNTSSQAP--SIVLPF 93

Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368
                                  V    ++ + +SP GP S+FAIGPYAHETQLVSPP FS
Sbjct: 94   IAPPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFS 153

Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-PHSGFSHRFMXXXXXXXXXXXY 1191
             +TTEPSTAPFTPPPESVH+TTPSSPEVPFA  LD +  +    HR+             
Sbjct: 154  AFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQ 213

Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGS 1011
            PGSP   LISPGS IS SGTSSP LD E      Y P       P+ LNL+ +   +WGS
Sbjct: 214  PGSPVSNLISPGSAISVSGTSSPFLDRE------YTPGR-----PQFLNLEKIAPHEWGS 262

Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQNT-----TDSLTDEHDNQESVSQRVSFELISEDVV 846
             + SG+LTP+   PK HD F +N+QN+             ++   V  RVSFE+ +EDVV
Sbjct: 263  RQGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVV 322

Query: 845  RCVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQ-----APAD 681
            RCVEK+PT++ +   +V+L+D    ERS+   ++             E  +     +  D
Sbjct: 323  RCVEKKPTMMMRT-GSVSLQD---TERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTD 378

Query: 680  AEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSF 501
             E+GQR Q+ RS +LGS KEFNFDNV+ G P K  +G +WW N+KV GKE   P  NW  
Sbjct: 379  GEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKE---PCNNW-I 434

Query: 500  FPVAQPGAT 474
            FP+ QPG +
Sbjct: 435  FPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  318 bits (815), Expect = 5e-84
 Identities = 188/431 (43%), Positives = 241/431 (55%), Gaps = 14/431 (3%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW  C S YWCF SQK  KRIG A+ +PE +++A    P+ N S   P  +      
Sbjct: 38   QKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADR-PSSNTSSQAP--SIVLPFI 94

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                                 V    ++ + +SP GP S+FAIGPYAHETQLVSPP FS 
Sbjct: 95   APPSSPASFLPSEPPSATHSPVGSKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSA 154

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-PHSGFSHRFMXXXXXXXXXXXYP 1188
            +TTEPSTAPFTPPPESVH+TTPSSPEVPFA  LD +  +    HR+             P
Sbjct: 155  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQP 214

Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008
            GSP   LISPGS IS SGTSSP L+ E      Y P       P+ LNL+ +   +WGS 
Sbjct: 215  GSPVSNLISPGSAISVSGTSSPFLERE------YTPGR-----PQFLNLEKIAPHEWGSR 263

Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNT-----TDSLTDEHDNQESVSQRVSFELISEDVVR 843
            + SG+LTP+   PK HD F +N+QNT             ++   V  RVSFE+ +EDVVR
Sbjct: 264  QGSGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVR 323

Query: 842  CVEKEPTVLPKALQTVALEDKDED--------ERSSVDDKSTTMTGKTSQEDATEKPQAP 687
            CVEK+PT++ +   +V+L+D +          E S+  D S     +   E ++      
Sbjct: 324  CVEKKPTMMMRT-GSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSS------ 376

Query: 686  ADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNW 507
             D E+GQR Q+ RS +LGS KEFNFDNV+ G P K  +G +WW N+KV GKE   P  NW
Sbjct: 377  TDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKE---PCNNW 433

Query: 506  SFFPVAQPGAT 474
              FP+ QPG +
Sbjct: 434  -IFPMMQPGVS 443


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  315 bits (807), Expect = 4e-83
 Identities = 190/424 (44%), Positives = 247/424 (58%), Gaps = 10/424 (2%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIP-IGAHFXXX 1548
            QK+RW  C S YWCF   +H+KRIG A+LVPE S+    ++ A N +   P I   F   
Sbjct: 41   QKRRWGSCWSVYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPFVAP 100

Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368
                               AG++S TS++A+++SP GP S+FAIGPYAHETQLVSPP FS
Sbjct: 101  PSSPASFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFS 160

Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSGFSH-RFMXXXXXXXXXXXY 1191
            T+TTEPSTAPFTPPPESV +TTPSSPEVPFA  L+ S  +G +  RF            Y
Sbjct: 161  TFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFY 220

Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGS 1011
            PGSP GQLISP S ISGSGTSSP  D E  AAG  + + +  VPP++LNL  L   + GS
Sbjct: 221  PGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKLSVHECGS 280

Query: 1010 GRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQESVSQ----RVSFELISEDVVR 843
             + SG+LTPD     S   FP++ Q +  +     DN+    Q    RVSF+L +ED +R
Sbjct: 281  RQGSGTLTPDAVRATS-CSFPLDRQCSDIASNRHSDNENKDDQVADLRVSFDLSAEDALR 339

Query: 842  CVEKEPT----VLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAE 675
              E +P     ++P++++     +K + + S +        G+TS        QA    E
Sbjct: 340  YAEPKPASPVKIMPESMKNEIAAEKVQ-KSSEIRHNFECRVGETSNGIL---EQASTGGE 395

Query: 674  EGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFP 495
            +  R Q+ R+ +LG+ KEFNFDN + G P KP  GP+WW N    GKE +  AKNWSFFP
Sbjct: 396  KTPRHQKHRTLTLGTFKEFNFDNAD-GVP-KPSAGPDWWDNGSDVGKE-DFTAKNWSFFP 452

Query: 494  VAQP 483
            V QP
Sbjct: 453  VMQP 456


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  311 bits (798), Expect = 4e-82
 Identities = 188/427 (44%), Positives = 250/427 (58%), Gaps = 12/427 (2%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW+     YWCF  Q+H+KRIG A+++PE +S       A NL+    I   F    
Sbjct: 2    QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPP 61

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                                  +F S++A+++SPG P S+FAIGPYAHETQLVSPP FST
Sbjct: 62   SSPASFLQSEPPSAMQSPG--FNF-SLSASMYSPG-PSSIFAIGPYAHETQLVSPPVFST 117

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188
            +TTEPSTAPFTPP ESVH+T PSSPEVPFA  LD +   G    R+            YP
Sbjct: 118  FTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYP 177

Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008
            GSP GQLISP S ISGSGTSSP LD E  + G ++ + R G  P++LNL +L   DWGS 
Sbjct: 178  GSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSR 237

Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE------SVSQRVSFELISEDVV 846
              SGS+TPD     S +GF +    T + + +   N        S+  RVSFEL +E+VV
Sbjct: 238  LCSGSVTPDAAKSTSSEGFTLK-PYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 296

Query: 845  RCVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGK----TSQEDATEKPQAPADA 678
            RCVEK+P  L +A+ T +L+  ++ ER    ++  + + +     +  D++EK     DA
Sbjct: 297  RCVEKKPVALAEAVST-SLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEK-AVGGDA 354

Query: 677  EE-GQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSF 501
            EE   R Q++RS +LGS KEFNFDN + G+     +  +WW N+KV  KE  + +KNWSF
Sbjct: 355  EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGE-SKNWSF 413

Query: 500  FPVAQPG 480
            FP+ QPG
Sbjct: 414  FPMIQPG 420


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  311 bits (798), Expect = 4e-82
 Identities = 188/427 (44%), Positives = 250/427 (58%), Gaps = 12/427 (2%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW+     YWCF  Q+H+KRIG A+++PE +S       A NL+    I   F    
Sbjct: 39   QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPP 98

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                                  +F S++A+++SPG P S+FAIGPYAHETQLVSPP FST
Sbjct: 99   SSPASFLQSEPPSAMQSPG--FNF-SLSASMYSPG-PSSIFAIGPYAHETQLVSPPVFST 154

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLSPHSG-FSHRFMXXXXXXXXXXXYP 1188
            +TTEPSTAPFTPP ESVH+T PSSPEVPFA  LD +   G    R+            YP
Sbjct: 155  FTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYP 214

Query: 1187 GSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDWGSG 1008
            GSP GQLISP S ISGSGTSSP LD E  + G ++ + R G  P++LNL +L   DWGS 
Sbjct: 215  GSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSR 274

Query: 1007 RESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE------SVSQRVSFELISEDVV 846
              SGS+TPD     S +GF +    T + + +   N        S+  RVSFEL +E+VV
Sbjct: 275  LCSGSVTPDAAKSTSSEGFTLK-PYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVV 333

Query: 845  RCVEKEPTVLPKALQTVALEDKDEDERSSVDDKSTTMTGK----TSQEDATEKPQAPADA 678
            RCVEK+P  L +A+ T +L+  ++ ER    ++  + + +     +  D++EK     DA
Sbjct: 334  RCVEKKPVALAEAVST-SLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEK-AVGGDA 391

Query: 677  EE-GQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSF 501
            EE   R Q++RS +LGS KEFNFDN + G+     +  +WW N+KV  KE  + +KNWSF
Sbjct: 392  EELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGE-SKNWSF 450

Query: 500  FPVAQPG 480
            FP+ QPG
Sbjct: 451  FPMIQPG 457


>gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 485

 Score =  303 bits (777), Expect = 1e-79
 Identities = 191/461 (41%), Positives = 233/461 (50%), Gaps = 47/461 (10%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QKKRW  C   YWCF SQK+ KRIG A+LVPEP       + A N+S P  I   F    
Sbjct: 31   QKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPP 90

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                              AGL+S TS++ N +SP GP S+FAIGPYAHETQLV+PP FS 
Sbjct: 91   SSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSA 150

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-----PHSGFSHRFMXXXXXXXXX 1200
             TTEPSTAPFTPPPESV +TTPSSPEVPFA  L  S      +SG + +F          
Sbjct: 151  LTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSY 210

Query: 1199 XXYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACD 1020
              YPGSP G LISPGS IS SGTSSP  D           + R G  P++L  +      
Sbjct: 211  QIYPGSPGGNLISPGSAISNSGTSSPFPDRRPIL------EFRMGEAPKLLGFENFTTRK 264

Query: 1019 WGSGRESGSLTPDGYL--------------------------------PKSHDGFPVNHQ 936
            WGS   SGSLTPDG                                  P S DGF V  Q
Sbjct: 265  WGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQ 324

Query: 935  NTTDSLTDEHDN-----QESVSQRVSFELISEDVVRCVEKEPTVLPKALQT-----VALE 786
             +  +L     N     +  V  RVSFEL  EDV  C+E +  +  +A+       VA  
Sbjct: 325  ISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEG 384

Query: 785  DKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEEGQRCQRQRSASLGSIKEFNFDN 606
             K+ D      + S  +  + +  +  EK  A  +AEE    Q+ RS +LGSIKEFNFDN
Sbjct: 385  RKERDGIKKDLESSCELFIRETSNETVEK--ASGEAEEEHSYQKHRSVTLGSIKEFNFDN 442

Query: 605  VNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQP 483
                   KP +   WW N+KV+GKEA +P  +W+FFP+ QP
Sbjct: 443  TKGEASDKPTIRSEWWANEKVAGKEA-RPGNSWTFFPMLQP 482


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  302 bits (773), Expect = 4e-79
 Identities = 194/475 (40%), Positives = 237/475 (49%), Gaps = 61/475 (12%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW GC S YWCF S K  KRIG A+L PEP       T A N S    I   F    
Sbjct: 45   QKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTSAENQSQSTAITVPFIAPP 103

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                              AGL+S TS++ N +SPGGP S+FAIGPYAHETQLV+PP FS 
Sbjct: 104  SSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTPPAFSA 163

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-----PHSGFSHRFMXXXXXXXXX 1200
            +TTEPSTAPFTPPPESV +TTPSSPEVPFA  L  S      +SG + +F          
Sbjct: 164  FTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHYEFQSY 223

Query: 1199 XXYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYP--DARFGVPPRILNLKLLPA 1026
              YPGSP GQLISPGSVIS SGTSSP  D         YP  + R G  P++L  +    
Sbjct: 224  PLYPGSPGGQLISPGSVISNSGTSSPFPDR--------YPILEFRMGEAPKLLGFEHFTT 275

Query: 1025 CDWGSGRES------------------------------------------------GSL 990
              WGS   S                                                GSL
Sbjct: 276  RKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSL 335

Query: 989  TPDGYLPKSHDGFPVNHQ-----NTTDSLTDEHDNQESVSQRVSFELISEDVVRCVEKEP 825
            TPD   P S DGF + +Q     +  +S      ++  V  RVSFEL  E+V RC+E + 
Sbjct: 336  TPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKS 395

Query: 824  TVLPKALQTVALEDKDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEEGQRCQRQ-R 648
                +A      +   ED+  S     T     T  E + E P+ P+   E + C R+ R
Sbjct: 396  LASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTG-ETSGETPEKPSGEMEEEHCYRKHR 454

Query: 647  SASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQP 483
            S +LGSIKEFNFDN +K  P KP +   WW N+ ++GKEA +PA NW+FFP+ QP
Sbjct: 455  SITLGSIKEFNFDN-SKEVPDKPSINSEWWANETIAGKEA-RPANNWTFFPLLQP 507


>gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 489

 Score =  301 bits (772), Expect = 5e-79
 Identities = 190/460 (41%), Positives = 232/460 (50%), Gaps = 47/460 (10%)
 Frame = -3

Query: 1721 KKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXXX 1542
            KKRW  C   YWCF SQK+ KRIG A+LVPEP       + A N+S P  I   F     
Sbjct: 36   KKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPS 95

Query: 1541 XXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFSTY 1362
                             AGL+S TS++ N +SP GP S+FAIGPYAHETQLV+PP FS  
Sbjct: 96   SPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSAL 155

Query: 1361 TTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-----PHSGFSHRFMXXXXXXXXXX 1197
            TTEPSTAPFTPPPESV +TTPSSPEVPFA  L  S      +SG + +F           
Sbjct: 156  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQ 215

Query: 1196 XYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNLKLLPACDW 1017
             YPGSP G LISPGS IS SGTSSP  D           + R G  P++L  +      W
Sbjct: 216  IYPGSPGGNLISPGSAISNSGTSSPFPDRRPIL------EFRMGEAPKLLGFENFTTRKW 269

Query: 1016 GSGRESGSLTPDGYL--------------------------------PKSHDGFPVNHQN 933
            GS   SGSLTPDG                                  P S DGF V  Q 
Sbjct: 270  GSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQI 329

Query: 932  TTDSLTDEHDN-----QESVSQRVSFELISEDVVRCVEKEPTVLPKALQT-----VALED 783
            +  +L     N     +  V  RVSFEL  EDV  C+E +  +  +A+       VA   
Sbjct: 330  SEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGR 389

Query: 782  KDEDERSSVDDKSTTMTGKTSQEDATEKPQAPADAEEGQRCQRQRSASLGSIKEFNFDNV 603
            K+ D      + S  +  + +  +  EK  A  +AEE    Q+ RS +LGSIKEFNFDN 
Sbjct: 390  KERDGIKKDLESSCELFIRETSNETVEK--ASGEAEEEHSYQKHRSVTLGSIKEFNFDNT 447

Query: 602  NKGEPQKPCLGPNWWTNDKVSGKEAEQPAKNWSFFPVAQP 483
                  KP +   WW N+KV+GKEA +P  +W+FFP+ QP
Sbjct: 448  KGEASDKPTIRSEWWANEKVAGKEA-RPGNSWTFFPMLQP 486


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  296 bits (758), Expect = 2e-77
 Identities = 186/434 (42%), Positives = 230/434 (52%), Gaps = 19/434 (4%)
 Frame = -3

Query: 1724 QKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLSLPIPIGAHFXXXX 1545
            QK+RW  CLS YWCF S +H KRIG A+LVPEP      A  + NL+L   I   F    
Sbjct: 31   QKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASENLNLSTSIVLPFIAPP 90

Query: 1544 XXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFST 1365
                              AG +S T+++ N +SP GP SMFAIGPYAHETQLVSPP FST
Sbjct: 91   SSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPVFST 150

Query: 1364 YTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLDLS-----PHSGFSHRFMXXXXXXXXX 1200
            + TEPSTAPFTPPPESV +TTPSSPEVPFA  L  S      +SG + +           
Sbjct: 151  FPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPY 210

Query: 1199 XXYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGV-PPRILNLKLLPAC 1023
              YP SP G LISP   IS SGTSSP            +PD R  V  P++L  +     
Sbjct: 211  QLYPESPVGHLISP---ISNSGTSSP------------FPDRRPIVEAPKLLGFEHFSTR 255

Query: 1022 DWGSGRESGSLTPDGYLPKSHDGFPVNHQ-----NTTDSLTDEHDNQESVSQRVSFELIS 858
             WGS   SGSLTPDG  P S D F + +Q     +  +S +   + +  +  RVSFEL  
Sbjct: 256  RWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAG 315

Query: 857  EDVVRCVEKEPT----VLPKALQTVALEDKDEDERSSVDDKSTT---MTGKTSQEDATEK 699
            EDV  CVEK+P      +   LQ +  E + E ER  + + +          + + A+EK
Sbjct: 316  EDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEK 375

Query: 698  PQAPADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKP-CLGPNWWTNDKVSGKEAEQ 522
              A A+ EE Q  ++      GSIKEFNFDN       KP  +G  WW N+KV GK    
Sbjct: 376  --ASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGK-GTG 432

Query: 521  PAKNWSFFPVAQPG 480
            P  NW+FFP+ QPG
Sbjct: 433  PQTNWTFFPLLQPG 446


>ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791666 isoform X1 [Glycine
            max]
          Length = 461

 Score =  286 bits (732), Expect = 2e-74
 Identities = 186/434 (42%), Positives = 235/434 (54%), Gaps = 16/434 (3%)
 Frame = -3

Query: 1727 SQKKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLS-LPIP-IGAHFX 1554
            +QKKRW   L    CF  +K +KRIG A+LVPEP++   GA PA   S +  P I   F 
Sbjct: 39   TQKKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTN--GADPAAAASSIQAPSITLPFV 96

Query: 1553 XXXXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPT 1374
                                  G VS T ++A+I+SPGGP S+FAIGPYAHETQLVSPP 
Sbjct: 97   APPSSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPV 156

Query: 1373 FSTYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLD-LSPHSGFSHRFMXXXXXXXXXX 1197
            FS      STAPFTPPPESVHMTTPSSPEVPFA  LD  + +S    RF           
Sbjct: 157  FSA----SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQ 212

Query: 1196 XYPGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNL--KLLPAC 1023
             +PGSP GQLISP S IS SGTSSPL D E  A   +  D +   PP++LNL  KL    
Sbjct: 213  FHPGSPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCE 272

Query: 1022 DWGSGRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE----SVSQRVSFELISE 855
            +  S   SGSLTPD     +  GF  NH  +   ++    N      S++ RVSFEL ++
Sbjct: 273  NQKSNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNNRLNEISINHRVSFELSAQ 332

Query: 854  DVVRCVEKEP------TVLPKALQTVALEDKDE-DERSSVDDKSTTMTGKTSQEDATEKP 696
             V++ +E +P       VLPK        DK+E  E S++DDK         Q   T   
Sbjct: 333  KVLKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETTLG 392

Query: 695  QAPADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPA 516
               A        ++ +S +L S KEFNFDN + G+   P +  +WW N+KV+GKE E  +
Sbjct: 393  GDKATTVH----EKDQSLTLSSAKEFNFDNADGGDSLAPNIVADWWANEKVAGKERE-AS 447

Query: 515  KNWSFFPVAQPGAT 474
            K+WSFFP+ QPG +
Sbjct: 448  KDWSFFPMIQPGVS 461


>ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791666 isoform X2 [Glycine
            max]
          Length = 441

 Score =  284 bits (726), Expect = 1e-73
 Identities = 185/432 (42%), Positives = 233/432 (53%), Gaps = 16/432 (3%)
 Frame = -3

Query: 1721 KKRWSGCLSAYWCFRSQKHKKRIGRAILVPEPSSTAIGATPARNLS-LPIP-IGAHFXXX 1548
            KKRW   L    CF  +K +KRIG A+LVPEP++   GA PA   S +  P I   F   
Sbjct: 21   KKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTN--GADPAAAASSIQAPSITLPFVAP 78

Query: 1547 XXXXXXXXXXXXXXXXXXXAGLVSFTSIAANIHSPGGPRSMFAIGPYAHETQLVSPPTFS 1368
                                G VS T ++A+I+SPGGP S+FAIGPYAHETQLVSPP FS
Sbjct: 79   PSSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPVFS 138

Query: 1367 TYTTEPSTAPFTPPPESVHMTTPSSPEVPFAHFLD-LSPHSGFSHRFMXXXXXXXXXXXY 1191
                  STAPFTPPPESVHMTTPSSPEVPFA  LD  + +S    RF            +
Sbjct: 139  A----SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQFH 194

Query: 1190 PGSPAGQLISPGSVISGSGTSSPLLDHELFAAGLYYPDARFGVPPRILNL--KLLPACDW 1017
            PGSP GQLISP S IS SGTSSPL D E  A   +  D +   PP++LNL  KL    + 
Sbjct: 195  PGSPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCENQ 254

Query: 1016 GSGRESGSLTPDGYLPKSHDGFPVNHQNTTDSLTDEHDNQE----SVSQRVSFELISEDV 849
             S   SGSLTPD     +  GF  NH  +   ++    N      S++ RVSFEL ++ V
Sbjct: 255  KSNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNNRLNEISINHRVSFELSAQKV 314

Query: 848  VRCVEKEP------TVLPKALQTVALEDKDE-DERSSVDDKSTTMTGKTSQEDATEKPQA 690
            ++ +E +P       VLPK        DK+E  E S++DDK         Q   T     
Sbjct: 315  LKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETTLGGD 374

Query: 689  PADAEEGQRCQRQRSASLGSIKEFNFDNVNKGEPQKPCLGPNWWTNDKVSGKEAEQPAKN 510
             A        ++ +S +L S KEFNFDN + G+   P +  +WW N+KV+GKE E  +K+
Sbjct: 375  KATTVH----EKDQSLTLSSAKEFNFDNADGGDSLAPNIVADWWANEKVAGKERE-ASKD 429

Query: 509  WSFFPVAQPGAT 474
            WSFFP+ QPG +
Sbjct: 430  WSFFPMIQPGVS 441


Top