BLASTX nr result

ID: Rehmannia25_contig00012072 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00012072
         (1895 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   441   e-121
gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus pe...   433   e-118
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   418   e-114
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     415   e-113
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   414   e-113
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   411   e-112
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   409   e-111
gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [...   405   e-110
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   402   e-109
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   401   e-109
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              379   e-102
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   379   e-102
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   370   e-100
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   363   1e-97
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   359   2e-96
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   353   2e-94
gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i...   345   5e-92
gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i...   342   3e-91
ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791...   325   3e-86
ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791...   322   4e-85

>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  441 bits (1133), Expect = e-121
 Identities = 241/452 (53%), Positives = 295/452 (65%), Gaps = 19/452 (4%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVI 1513
            R P  +VQKRRWGSC   Y CF S K KRIGHA + PE+        P +E+ +Q P+++
Sbjct: 31   RVPQPTVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAP-GSGVPAAENLTQAPTIV 89

Query: 1512 XXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLV 1333
                               SATQSP+GLLS+TS++AN+YSPGGP SIFAIGPYAHETQLV
Sbjct: 90   LPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLV 149

Query: 1332 SPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEF 1153
            SPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+L +PN RNGEAG R+ L+QYEF
Sbjct: 150  SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEF 209

Query: 1152 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDF-AAGYPFFLEFRTGNPPKLLDLDKI 976
            QSYQL PGSPV HL              PFPDRDF  +G   FLEFR G PPKLL LDK+
Sbjct: 210  QSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGGPPKLLTLDKL 269

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLP------------NTGSYRLA 832
               EW S  GSG++TPDA+GP SRD  +L+RQ SDV   P            +  S+ L+
Sbjct: 270  SNHEWGSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLS 329

Query: 831  -----NDETVLDHRVSFEITAEEVVRCVEKKPVVGSPKSAPESVENVEHIK-EEKPIKTA 670
                 N+E ++DHRVSFE+TAE+VVRCVEK       K+   S++N   ++ +E   +  
Sbjct: 330  DSGCPNNEIMVDHRVSFELTAEDVVRCVEKDS-AALVKAVSASLQNPATVEIDENSREVV 388

Query: 669  NGVDHPSGETSNITSEKDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIA 490
               +   GET+N   EK     +G+  + HHK R+ITLGS KEFNFD+ DGG+ D+P+I 
Sbjct: 389  VDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNI- 447

Query: 489  SSDWWVNEKVVAEDGSPSNQWSFFPLMQTGVS 394
            SSDWW NEKVV ++   S  WS F +MQ  VS
Sbjct: 448  SSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 479


>gb|EMJ10328.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  433 bits (1113), Expect = e-118
 Identities = 237/435 (54%), Positives = 289/435 (66%), Gaps = 2/435 (0%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516
            R P  +VQKRRWGS  S+Y CFG  +  KRIGHA ++PETT  R  +AP +E+P Q PS+
Sbjct: 31   RVPQATVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTD-RGGDAPRAENPIQTPSI 89

Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336
            +                   SATQSP G  S+T   A+MYSP GP SIFAIGPYAHETQL
Sbjct: 90   VLPFVAPPSSPASFLQSEPPSATQSPAGFFSLT---ASMYSPSGPTSIFAIGPYAHETQL 146

Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156
            VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P+ RNGE GQR+PL+ YE
Sbjct: 147  VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYE 206

Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976
            FQSYQL PGSPV  L              PFPD +FAA    FLEFRTG+PPKLL+LD +
Sbjct: 207  FQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGDPPKLLNLDIL 266

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796
              R+W S  GSG+VTPD     S D  LL  Q  +V   P + + R  N++  ++HRVSF
Sbjct: 267  STRDWGSRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNN-RGRNNDISINHRVSF 325

Query: 795  EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIK-EEKPIKTANGVDHPSGETSNITSEK 619
            E+++EEV+RCVEKKP V   ++   S+E+ E  + +E P K  +    P GETSN  +EK
Sbjct: 326  ELSSEEVIRCVEKKP-VALAEAVSTSLEDTEKAQSKEDPSKVVSSSICPVGETSNDAAEK 384

Query: 618  DHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSP 439
                 DG+  + H K R+ITLGS KEFNFD+ DGG  D  +   SDWW NEKV A++  P
Sbjct: 385  --AVADGEEAQLHPKQRSITLGSVKEFNFDNPDGG--DSGNSIGSDWWANEKVDAKENGP 440

Query: 438  SNQWSFFPLMQTGVS 394
            +  WSFFP+MQ GVS
Sbjct: 441  TKNWSFFPMMQPGVS 455


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  418 bits (1074), Expect = e-114
 Identities = 231/435 (53%), Positives = 287/435 (65%), Gaps = 2/435 (0%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKR-IGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516
            R P  +VQKRRWGSC S+Y CFG  K K+ IGHA + PE +    + AP SE+P+Q P+V
Sbjct: 31   RVPQATVQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAP-GNGAPASENPTQAPAV 89

Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336
                                S TQSP GL+S+TS+SA+MYSP GP SIFAIGPYAHETQL
Sbjct: 90   TLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQL 149

Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156
            VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+ L+P+LRNG+ G R+P   ++
Sbjct: 150  VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFP---FD 206

Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976
            FQSYQ  PGSPV  L              PFPD +FA G   F EFR G PPKLL+LDK+
Sbjct: 207  FQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKL 266

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796
               EW S QGSGA+TP++V  R   + LL+RQ SDV   P +G+     +  V++HRVSF
Sbjct: 267  STCEWGSYQGSGALTPESV-RRGSPNFLLHRQFSDVPSRPRSGNGH--KNGQVVNHRVSF 323

Query: 795  EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPI-KTANGVDHPSGETSNITSEK 619
            E+TAE+  RCVE+KP   S K+ PE VEN    KEEK   ++    +   G TSN + E 
Sbjct: 324  ELTAEDASRCVEEKPAF-SIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPEM 382

Query: 618  DHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSP 439
                TDG+   QH K ++ITLGS KEFNFD+ D G+  +PS  SS+WW N  V+ ++G  
Sbjct: 383  --ASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS--SSNWWANGSVIGKEGET 438

Query: 438  SNQWSFFPLMQTGVS 394
            +  WSFFP++Q+GVS
Sbjct: 439  TKNWSFFPMVQSGVS 453


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  415 bits (1067), Expect = e-113
 Identities = 230/439 (52%), Positives = 285/439 (64%), Gaps = 6/439 (1%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTK-RIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516
            R P  +V+KRRWG CLS+Y CFG+ K + RIGH  ++PET     ++AP +E+ +Q  +V
Sbjct: 33   RVPQATVRKRRWGGCLSIYWCFGTPKNRTRIGHGVLVPETAQP-GNSAPRAENSTQTHAV 91

Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336
            I                   SATQSP GLLS+TSVSA+MYSPGGP SIFAIGPYAHETQL
Sbjct: 92   ILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQL 151

Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156
            VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PN+ NGE GQR+P+   E
Sbjct: 152  VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNE 211

Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976
            FQSY  QPGSP+  L              PFPD +FAA  P FLEFRTG+PPKLL+LDK+
Sbjct: 212  FQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGDPPKLLNLDKL 271

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAP--LPNTGSYRLANDETVLDHRV 802
             + +W S QGSG++TPD+V P S           +VAP   PN    R  N E V D RV
Sbjct: 272  SKFDWGSRQGSGSLTPDSVKPIS---------TFEVAPHLKPNG---RCRNAENVADRRV 319

Query: 801  SFEITAEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKT---ANGVDHPSGETSNI 631
            SF+++ E+V+R VEKK V  +        +     +EE          G ++  GETSN 
Sbjct: 320  SFDVSTEDVIRYVEKKTVPLAEAMLTSLKDTTMGQREENSDSNKVEEIGCENRVGETSN- 378

Query: 630  TSEKDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAE 451
              E D   T G+   QH K R+ITLGS+KEFNFD+ D G+  + S + SDWW N+KV  +
Sbjct: 379  -EEPDKAPTSGEEVLQHQKHRSITLGSSKEFNFDNADAGDLHK-SDSVSDWWANQKVAGK 436

Query: 450  DGSPSNQWSFFPLMQTGVS 394
            +G+PS  WSFFP++Q GVS
Sbjct: 437  EGAPSQNWSFFPMIQPGVS 455


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  414 bits (1063), Expect = e-113
 Identities = 226/424 (53%), Positives = 285/424 (67%), Gaps = 2/424 (0%)
 Frame = -3

Query: 1671 QKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXXX 1495
            QKRRWG C S+  CFG  K  KRIGHA ++PE T +R+ NA  + + +Q  ++       
Sbjct: 39   QKRRWGGCWSISWCFGFQKHRKRIGHAVLVPEPTASRS-NASEAVNSTQAAAISLPFVAP 97

Query: 1494 XXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 1315
                         SATQSP GL+S+ S+S NMYSPGGP+SIFAIGPYAHETQLVSPPVFS
Sbjct: 98   PSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFS 157

Query: 1314 TFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQLQ 1135
            TFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P+LR GE GQ++P + YEFQSY L 
Sbjct: 158  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLH 217

Query: 1134 PGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWES 955
            PGSPV +L              PFPD +FA   P F +F  G+PPKLL+LDK+  REW S
Sbjct: 218  PGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGS 277

Query: 954  CQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEV 775
             QGSG +TPDAVG   R+    NRQ S+VA  P++ +  L  D+ ++DHRVSFE+T E+V
Sbjct: 278  RQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSEN-GLRKDQ-IVDHRVSFELTTEDV 335

Query: 774  VRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKTANGVDHP-SGETSNITSEKDHIHTDG 598
            VRCVEKKP     ++  ES++N   +++E+    A  V H  +GE +N    K  +  D 
Sbjct: 336  VRCVEKKPTT-LAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPV--DV 392

Query: 597  DNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFF 418
            +   +H K ++ITLGSTKEFNFDS D G+  EP+IA SDWW NEKVV +D      W+FF
Sbjct: 393  EEAPRHQKQQSITLGSTKEFNFDSAD-GDSHEPTIA-SDWWANEKVVGKDSGAIKNWAFF 450

Query: 417  PLMQ 406
            P++Q
Sbjct: 451  PVIQ 454


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  411 bits (1057), Expect = e-112
 Identities = 230/435 (52%), Positives = 286/435 (65%), Gaps = 2/435 (0%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKR-IGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516
            R P  +VQ RRWGSC S+Y CFG  K K+ IGHA + PE +    + AP SE+P+Q P+V
Sbjct: 31   RVPQATVQ-RRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAP-GNGAPASENPTQAPAV 88

Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336
                                S TQSP GL+S+TS+SA+MYSP GP SIFAIGPYAHETQL
Sbjct: 89   TLPFAAPPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQL 148

Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156
            VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+ L+P+LRNG+ G R+P   ++
Sbjct: 149  VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFP---FD 205

Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976
            FQSYQ  PGSPV  L              PFPD +FA G   F EFR G PPKLL+LDK+
Sbjct: 206  FQSYQFHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGEPPKLLNLDKL 265

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796
               EW S QGSGA+TP++V  R   + LL+RQ SDV   P +G+     +  V++HRVSF
Sbjct: 266  STCEWGSYQGSGALTPESV-RRGSPNFLLHRQFSDVPSRPRSGNGH--KNGQVVNHRVSF 322

Query: 795  EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPI-KTANGVDHPSGETSNITSEK 619
            E+TAE+  RCVE+KP   S K+ PE VEN    KEEK   ++    +   G TSN + E 
Sbjct: 323  ELTAEDASRCVEEKPAF-SIKTVPEYVENGTQAKEEKNSGESIQSFECRVGVTSNDSPEM 381

Query: 618  DHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSP 439
                TDG+   QH K ++ITLGS KEFNFD+ D G+  +PS  SS+WW N  V+ ++G  
Sbjct: 382  --ASTDGEAAPQHRKQQSITLGSVKEFNFDNADEGDSRKPS--SSNWWANGSVIGKEGET 437

Query: 438  SNQWSFFPLMQTGVS 394
            +  WSFFP++Q+GVS
Sbjct: 438  TKNWSFFPMVQSGVS 452


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  409 bits (1052), Expect = e-111
 Identities = 224/424 (52%), Positives = 284/424 (66%), Gaps = 2/424 (0%)
 Frame = -3

Query: 1671 QKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXXX 1495
            QKRRWG C ++  CFG  K  KRIGHA ++PE T +R+ NA  + + +Q  ++       
Sbjct: 39   QKRRWGGCWNISWCFGFQKHRKRIGHAVLVPEPTASRS-NASEAVNSTQATAISLPFVAP 97

Query: 1494 XXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVFS 1315
                         SATQSP GL+S+ S+S NMYSPGGP+SIFAIGPYAHETQLVSPPVFS
Sbjct: 98   PSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFS 157

Query: 1314 TFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQLQ 1135
            TFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P+LR GE GQ++P + YEFQSY L 
Sbjct: 158  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLH 217

Query: 1134 PGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWES 955
            PGSPV +L              PFPD +FA   P F +F  G+PPKLL+LDK+  REW S
Sbjct: 218  PGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGDPPKLLNLDKLSIREWGS 277

Query: 954  CQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEV 775
             QGSG +TPDAV    R+    NRQ S+VA  P++ +  L  D+ ++DHRVSFE+T E+V
Sbjct: 278  RQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSEN-GLRKDQ-IVDHRVSFELTTEDV 335

Query: 774  VRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKTANGVDHP-SGETSNITSEKDHIHTDG 598
            VRCVEKKP     ++  ES++N   +++E+    A  V H  +GE +N    K  +  D 
Sbjct: 336  VRCVEKKPTT-LAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPV--DV 392

Query: 597  DNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFF 418
            +   +H K ++ITLGSTKEFNFDS D G+  EP+IA SDWW NEKVV +D      W+FF
Sbjct: 393  EEAPRHQKQQSITLGSTKEFNFDSAD-GDSHEPTIA-SDWWANEKVVGKDSGAIKNWAFF 450

Query: 417  PLMQ 406
            P++Q
Sbjct: 451  PVIQ 454


>gb|EOY24784.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  405 bits (1042), Expect = e-110
 Identities = 227/435 (52%), Positives = 279/435 (64%), Gaps = 3/435 (0%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTK-RIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516
            R P  +VQKRRWG C S+Y CFGS K K RIG A +  ET+ + A N P +E+P+Q P++
Sbjct: 31   RVPQATVQKRRWGGCWSIYWCFGSYKQKKRIGPAVLTSETSFSGA-NVPAAENPTQAPAI 89

Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336
                                SATQSP GL+S+TS+SA+MYSPG P SIFAIGPYAHETQL
Sbjct: 90   ALPFVAPPSSPASFLPSEPPSATQSPAGLVSLTSISASMYSPG-PASIFAIGPYAHETQL 148

Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156
            VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL PNL+ GE  QR+P++ YE
Sbjct: 149  VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYE 208

Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976
            FQSYQL PGSPV  L              PF D +FAA    F EFR G+PPKLL+LDK 
Sbjct: 209  FQSYQLHPGSPVGQLISPSSGISGSGTSSPFRDGEFAASL-HFPEFRMGDPPKLLNLDKH 267

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796
               EW S  GSG +TPDA     R+  LL+ Q S++   P+  +  + ND+   +HRVSF
Sbjct: 268  SSCEWGSHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPHLKNKEVQNDQVAHNHRVSF 327

Query: 795  EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIK--EEKPIKTANGVDHPSGETSNITSE 622
            E+T EEVVR +E +    +P  A      +E  +  EE   K  +  +   GETSN   E
Sbjct: 328  ELTTEEVVRSLEME--TATPSEAVSGSLQIEATRESEEHDTKVVDDYECRVGETSNERPE 385

Query: 621  KDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGS 442
            K     D + + QHHK ++ITLGS KEFNFD+VDGG+  +P I +SDWW N+KV  + G 
Sbjct: 386  K--ALADREGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKP-ILTSDWWANDKVAGKGGG 442

Query: 441  PSNQWSFFPLMQTGV 397
                WSFFP+MQ GV
Sbjct: 443  VPRNWSFFPMMQPGV 457


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  402 bits (1034), Expect = e-109
 Identities = 232/436 (53%), Positives = 280/436 (64%), Gaps = 3/436 (0%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516
            R P  S+QKRRWG C S+Y CFGS K TKRIGHA  IPETT + AD  P+S   SQ PS+
Sbjct: 31   RVPQASIQKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADR-PSSNTSSQAPSI 89

Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336
            +                   SAT SP G      +S + YSP GP SIFAIGPYAHETQL
Sbjct: 90   VLPFIAPPSSPASFLPSEPPSATHSPVG---SKCLSMSTYSPSGPASIFAIGPYAHETQL 146

Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156
            VSPPVFS FTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PN +N  AG RYP  QYE
Sbjct: 147  VSPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYE 206

Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976
            FQSYQLQPGSPVS+L              PF DR++  G P F           L+L+KI
Sbjct: 207  FQSYQLQPGSPVSNLISPGSAISVSGTSSPFLDREYTPGRPQF-----------LNLEKI 255

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796
               EW S QGSG +TP+AV P+  D+ LLN Q+S V  LP   +    ND TV+DHRVSF
Sbjct: 256  APHEWGSRQGSGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFN-GWKNDLTVVDHRVSF 314

Query: 795  EITAEEVVRCVEKKPVVGSPKSAPESVENVEHI--KEEKPIKTANGVDHPSGETSNITSE 622
            EITAE+VVRCVEKKP +   ++   S+++ E    ++E   + +NG DH   E S    E
Sbjct: 315  EITAEDVVRCVEKKPTM-MMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHE 373

Query: 621  KDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGS 442
                 TDG++ ++  K R+ITLGS+KEFNFD+VDGG  D+ +I  SDWW NEKV+ ++  
Sbjct: 374  GS--STDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATI-GSDWWANEKVLGKE-- 428

Query: 441  PSNQWSFFPLMQTGVS 394
            P N W  FP+MQ GVS
Sbjct: 429  PCNNW-IFPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  401 bits (1030), Expect = e-109
 Identities = 231/436 (52%), Positives = 280/436 (64%), Gaps = 3/436 (0%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516
            R P  S+QKRRWGSC S+Y CFGS K TKRIGHA  IPETT + AD  P+S   SQ PS+
Sbjct: 31   RVPQASIQKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADR-PSSNTSSQAPSI 89

Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336
            +                   SAT SP G      +S + YSP GP SIFAIGPYAHETQL
Sbjct: 90   VLPFIAPPSSPASFLPSEPPSATHSPVG---SKCLSMSTYSPSGPASIFAIGPYAHETQL 146

Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156
            VSPPVFS FTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PN +N  AG RYP  QYE
Sbjct: 147  VSPPVFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYE 206

Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976
            FQSYQLQPGSPVS+L              PF +R++  G P F           L+L+KI
Sbjct: 207  FQSYQLQPGSPVSNLISPGSAISVSGTSSPFLEREYTPGRPQF-----------LNLEKI 255

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796
               EW S QGSG +TP+AV P+  DS LLN Q++ V  LP   +    ND TV+DHRVSF
Sbjct: 256  APHEWGSRQGSGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFN-GWKNDLTVVDHRVSF 314

Query: 795  EITAEEVVRCVEKKPVVGSPKSAPESVENVEHI--KEEKPIKTANGVDHPSGETSNITSE 622
            EITAE+VVRCVEKKP +   ++   S+++ E    ++E   + +N  DH   E S    E
Sbjct: 315  EITAEDVVRCVEKKPTM-MMRTGSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHE 373

Query: 621  KDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGS 442
                 TDG++ ++  K R+ITLGS+KEFNFD+VDGG  D+ +I  SDWW NEKV+ ++  
Sbjct: 374  GS--STDGEDGQRQQKHRSITLGSSKEFNFDNVDGGYPDKATI-GSDWWANEKVLGKE-- 428

Query: 441  PSNQWSFFPLMQTGVS 394
            P N W  FP+MQ GVS
Sbjct: 429  PCNNW-IFPMMQPGVS 443


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  379 bits (973), Expect = e-102
 Identities = 212/434 (48%), Positives = 261/434 (60%), Gaps = 1/434 (0%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVI 1513
            R P  +VQKRRWGSC   Y CF S K KRIGHA + PE+        P +E+ +Q P+++
Sbjct: 31   RVPQPTVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAP-GSGVPAAENLTQAPTIV 89

Query: 1512 XXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLV 1333
                               SATQSP+GLLS+TS++AN+YSPGGP SIFAIGPYAHETQLV
Sbjct: 90   LPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLV 149

Query: 1332 SPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEF 1153
            SPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+L +PN RNGEAG R+ L+QYEF
Sbjct: 150  SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEF 209

Query: 1152 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIV 973
            QSYQL PGSPV HL              PFPDR                           
Sbjct: 210  QSYQLYPGSPVGHLISPSSGISGSGTSSPFPDR--------------------------- 242

Query: 972  RREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFE 793
                     SG++TPDA+GP SRD  +L+                  N+E ++DHRVSFE
Sbjct: 243  ---------SGSITPDALGPPSRDGSVLDHSG-------------CPNNEIMVDHRVSFE 280

Query: 792  ITAEEVVRCVEKKPVVGSPKSAPESVENVEHIK-EEKPIKTANGVDHPSGETSNITSEKD 616
            +TAE+VVRCVEK       K+   S++N   ++ +E   +     +   GET+N   EK 
Sbjct: 281  LTAEDVVRCVEKDS-AALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKA 339

Query: 615  HIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPS 436
                +G+  + HHK R+ITLGS KEFNFD+ DGG+ D+P+I SSDWW NEKVV ++   S
Sbjct: 340  PEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNI-SSDWWANEKVVGKEVGAS 398

Query: 435  NQWSFFPLMQTGVS 394
              WS F +MQ  VS
Sbjct: 399  KNWSIFHMMQPSVS 412


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  379 bits (973), Expect = e-102
 Identities = 220/435 (50%), Positives = 274/435 (62%), Gaps = 3/435 (0%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516
            R P  ++QKRRWGSC S+Y CFG ++  KRIGHA ++PE +    D++      +Q P++
Sbjct: 34   RVPQATIQKRRWGSCWSVYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTI 93

Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336
                                SA+QSP G+LS+TSVSA+MYSP GP SIFAIGPYAHETQL
Sbjct: 94   TLPFVAPPSSPASFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQL 153

Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156
            VSPP FSTFTTEPSTAPFTPPPES+ LTTPSSPEVPFA+LLEP+ RNGEAG R+P + YE
Sbjct: 154  VSPPAFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYE 213

Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976
            FQSYQ  PGSPV  L              PFPD +FAA  P FLEF+   PPKLL+LDK+
Sbjct: 214  FQSYQFYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMAVPPKLLNLDKL 273

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796
               E  S QGSG +TPDAV   S  S  L+RQ SD+A   N  S     D+ V D RVSF
Sbjct: 274  SVHECGSRQGSGTLTPDAVRATS-CSFPLDRQCSDIA--SNRHSDNENKDDQVADLRVSF 330

Query: 795  EITAEEVVRCVEKKPVVGSP-KSAPESVEN-VEHIKEEKPIKTANGVDHPSGETSNITSE 622
            +++AE+ +R  E KP   SP K  PES++N +   K +K  +  +  +   GETSN   E
Sbjct: 331  DLSAEDALRYAEPKP--ASPVKIMPESMKNEIAAEKVQKSSEIRHNFECRVGETSNGILE 388

Query: 621  KDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGS 442
            +    T G+   +H K RT+TLG+ KEFNFD+ DG    +PS A  DWW N   V ++  
Sbjct: 389  Q--ASTGGEKTPRHQKHRTLTLGTFKEFNFDNADG--VPKPS-AGPDWWDNGSDVGKEDF 443

Query: 441  PSNQWSFFPLMQTGV 397
             +  WSFFP+MQ  +
Sbjct: 444  TAKNWSFFPVMQPSI 458


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  370 bits (951), Expect = e-100
 Identities = 228/471 (48%), Positives = 283/471 (60%), Gaps = 10/471 (2%)
 Frame = -3

Query: 1776 LTMRRGANGTDXXXXXXXXXXXXXXXXA---RGPHDSVQKRRWGSCLSLYSCFGSNK-TK 1609
            + MRRG NG D                A   R P  +VQKRRW     +Y CFG  +  K
Sbjct: 1    MMMRRGVNGGDGNNALDTINAAASAIAAAESRVPQATVQKRRWAKGWGVYWCFGFQRHRK 60

Query: 1608 RIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXXXXXXXXXXXXXXXXSATQSPTGL 1429
            RIGHA ++PETT +   N P +E+ +Q  S++                   SA QSP   
Sbjct: 61   RIGHAVILPETT-SPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFN 119

Query: 1428 LSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESIHLTT 1249
             S+   SA+MYSPG P+SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPP ES+HLT 
Sbjct: 120  FSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTR 175

Query: 1248 PSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXX 1069
            PSSPEVPFA+LL+ N R GE GQRYPL+ YEFQSYQ  PGSPV  L              
Sbjct: 176  PSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSS 235

Query: 1068 PFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWESCQGSGAVTPDAVGPRSRDSRLL 889
            PF D +FA+G   FLEFRTG  PK+L+LD +  R+W S   SG+VTPDA    S +   L
Sbjct: 236  PFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTL 295

Query: 888  NRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEVVRCVEKKPV--VGSPKSAPESV 715
                 +   L    + R  ND   + HRVSFE++AEEVVRCVEKKPV    +  ++ +S 
Sbjct: 296  KPYTPE-GVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSA 354

Query: 714  ENVEHIKEEKP-IKTANGVDHPSGETSNITSEKDHIHTDGDNEK---QHHKTRTITLGST 547
            E  E  +EE P  + ++  + P  +TSN +SEK      GD E+   ++ K R+ITLGS 
Sbjct: 355  EKAE--REEGPNQEVSSSHECPVVDTSNDSSEK---AVGGDAEELSYRYQKERSITLGSA 409

Query: 546  KEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFFPLMQTGVS 394
            KEFNFD+ DGG+    SI S+DWW NEKVV ++   S  WSFFP++Q G+S
Sbjct: 410  KEFNFDNADGGDSGTSSI-STDWWANEKVVLKENGESKNWSFFPMIQPGMS 459


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  363 bits (933), Expect = 1e-97
 Identities = 217/434 (50%), Positives = 271/434 (62%), Gaps = 7/434 (1%)
 Frame = -3

Query: 1674 VQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXX 1498
            +QKRRW     +Y CFG  +  KRIGHA ++PETT +   N P +E+ +Q  S++     
Sbjct: 1    MQKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETT-SPGHNDPRAENLTQASSIVLPFAA 59

Query: 1497 XXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVF 1318
                          SA QSP    S+   SA+MYSPG P+SIFAIGPYAHETQLVSPPVF
Sbjct: 60   PPSSPASFLQSEPPSAMQSPGFNFSL---SASMYSPG-PSSIFAIGPYAHETQLVSPPVF 115

Query: 1317 STFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQL 1138
            STFTTEPSTAPFTPP ES+HLT PSSPEVPFA+LL+ N R GE GQRYPL+ YEFQSYQ 
Sbjct: 116  STFTTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQW 175

Query: 1137 QPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWE 958
             PGSPV  L              PF D +FA+G   FLEFRTG  PK+L+LD +  R+W 
Sbjct: 176  YPGSPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGEAPKVLNLDILFTRDWG 235

Query: 957  SCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEE 778
            S   SG+VTPDA    S +   L     +   L    + R  ND   + HRVSFE++AEE
Sbjct: 236  SRLCSGSVTPDAAKSTSSEGFTLKPYTPE-GVLNARSNSRRRNDGASIGHRVSFELSAEE 294

Query: 777  VVRCVEKKPV--VGSPKSAPESVENVEHIKEEKP-IKTANGVDHPSGETSNITSEKDHIH 607
            VVRCVEKKPV    +  ++ +S E  E  +EE P  + ++  + P  +TSN +SEK    
Sbjct: 295  VVRCVEKKPVALAEAVSTSLQSAEKAE--REEGPNQEVSSSHECPVVDTSNDSSEK---A 349

Query: 606  TDGDNEK---QHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPS 436
              GD E+   ++ K R+ITLGS KEFNFD+ DGG+    SI S+DWW NEKVV ++   S
Sbjct: 350  VGGDAEELSYRYQKERSITLGSAKEFNFDNADGGDSGTSSI-STDWWANEKVVLKENGES 408

Query: 435  NQWSFFPLMQTGVS 394
              WSFFP++Q G+S
Sbjct: 409  KNWSFFPMIQPGMS 422


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  359 bits (922), Expect = 2e-96
 Identities = 214/443 (48%), Positives = 262/443 (59%), Gaps = 15/443 (3%)
 Frame = -3

Query: 1677 SVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXX 1501
            +VQKRRWGSCLSLY CFGS++ +KRIGHA ++PE     A  AP SE+ +   S++    
Sbjct: 29   TVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAV-APASENLNLSTSIVLPFI 87

Query: 1500 XXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPV 1321
                           S+TQSP G LS+T++S N YSP GP S+FAIGPYAHETQLVSPPV
Sbjct: 88   APPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPV 147

Query: 1320 FSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNL----RNGEAGQRYPLTQYEF 1153
            FSTF TEPSTAPFTPPPES+ LTTPSSPEVPFA+LL  +L    RN    Q+  L+ YEF
Sbjct: 148  FSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEF 207

Query: 1152 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIV 973
            Q YQL P SPV HL                P  +     PF         PKLL  +   
Sbjct: 208  QPYQLYPESPVGHLIS--------------PISNSGTSSPFPDRRPIVEAPKLLGFEHFS 253

Query: 972  RREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFE 793
             R W S  GSG++TPD  GP SRDS LL  Q S+VA L N+ S    N ETV+DHRVSFE
Sbjct: 254  TRRWGSRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSES-GSQNGETVIDHRVSFE 312

Query: 792  ITAEEVVRCVEKKPVVGSPKSAPESVEN-VEHIKEEKPIK---------TANGVDHPSGE 643
            +  E+V  CVEKKPV     ++ E+V+N ++ I EE  I+         T N  +   GE
Sbjct: 313  LAGEDVAVCVEKKPV-----ASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGE 367

Query: 642  TSNITSEKDHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEK 463
                 SEK     +G+ E+ H K   I  GS KEFNFD+  G    +P+I  S+WWVNEK
Sbjct: 368  ALKAASEK--ASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEK 425

Query: 462  VVAEDGSPSNQWSFFPLMQTGVS 394
            VV +   P   W+FFPL+Q G+S
Sbjct: 426  VVGKGTGPQTNWTFFPLLQPGIS 448


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  353 bits (905), Expect = 2e-94
 Identities = 214/424 (50%), Positives = 262/424 (61%), Gaps = 2/424 (0%)
 Frame = -3

Query: 1692 RGPHDSVQKRRWGSCLSLYSCFGSNKTKR-IGHAAVIPETTPTRADNAPTSEHPSQPPSV 1516
            R P   VQK+RW S  S+Y CFG  K+KR IGHA + PE++      AP +E+ +Q P V
Sbjct: 31   RVPQAMVQKQRWRSHWSIYWCFGYQKSKRQIGHAVLFPESSAP-GSGAPAAENSAQAPEV 89

Query: 1515 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 1336
                                S TQSP GL+S TS+SA+MYSP GP SIFAIGPYAHETQL
Sbjct: 90   TFPFVAPPSSPASFFQSEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQL 149

Query: 1335 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 1156
            VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+L++P LRNG  G R+P   ++
Sbjct: 150  VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRFP---FD 206

Query: 1155 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 976
            FQSYQ  PGS V  L              PFPD +FA G P   EFR G  PKLL+LDK+
Sbjct: 207  FQSYQFHPGSSVGQLISPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG--PKLLNLDKL 264

Query: 975  VRREWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSF 796
              REW S Q SGA+TPD+V   S  + LL+RQ SDVA  P + +    +D+ V++HR SF
Sbjct: 265  STREWGSYQDSGALTPDSVRHGS-PNFLLHRQFSDVASHPRSENGH--DDDQVVNHRFSF 321

Query: 795  EITAEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPI-KTANGVDHPSGETSNITSEK 619
            E++ ++  RCVE+KP   S K+ PE VEN    KEE+   +     +  SG+TSN T E 
Sbjct: 322  ELSVKDASRCVEEKPAC-SIKTVPEYVENGTKAKEEENYGELIQSFERRSGDTSNDTPET 380

Query: 618  DHIHTDGDNEKQHHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSP 439
                TDG+   QH K + ITLGS  EFNFD+ D G+   PS  SS+W     V      P
Sbjct: 381  P--STDGE-APQHRKQQPITLGSVNEFNFDNADEGDSHNPS--SSNW-----VKQPRTGP 430

Query: 438  SNQW 427
            S+ W
Sbjct: 431  SSLW 434


>gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 485

 Score =  345 bits (884), Expect = 5e-92
 Identities = 215/471 (45%), Positives = 271/471 (57%), Gaps = 43/471 (9%)
 Frame = -3

Query: 1677 SVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXX 1501
            +VQK+RWGSC  LY CFGS K +KRIGHA ++PE     A +  T+E+ S P  +I    
Sbjct: 29   TVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGA-SVSTAENVSNPTGIILPFI 87

Query: 1500 XXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPV 1321
                           SATQSP GLLS+TS+S N YSP GP SIFAIGPYAHETQLV+PPV
Sbjct: 88   APPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPV 147

Query: 1320 FSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNL----RNGEAGQRYPLTQYEF 1153
            FS  TTEPSTAPFTPPPES+ LTTPSSPEVPFA+LL  +L    RN    Q++ L+ YEF
Sbjct: 148  FSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEF 207

Query: 1152 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIV 973
            QSYQ+ PGSP  +L              PFPDR         LEFR G  PKLL  +   
Sbjct: 208  QSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRP------ILEFRMGEAPKLLGFENFT 261

Query: 972  RREWESCQGSGA----------------VT----------------PDAVGPRSRDSRLL 889
             R+W S  GSG+                VT                PD +GP SRD  L+
Sbjct: 262  TRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLV 321

Query: 888  NRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEVVRCVEKKPVVGSPKSAPESVEN 709
              Q S+VA L N  +    NDET++DHRVSFE++ E+V  C+E K ++ S ++  E  ++
Sbjct: 322  GSQISEVALLANPAN-GPKNDETIVDHRVSFELSGEDVAPCLESKSLLPS-RAVSEYPKD 379

Query: 708  V--EHIKEEKPIK--TANGVDHPSGETSNITSEKDHIHTDGDNEKQH--HKTRTITLGST 547
            +  E  KE   IK    +  +    ETSN T EK      G+ E++H   K R++TLGS 
Sbjct: 380  LVAEGRKERDGIKKDLESSCELFIRETSNETVEK----ASGEAEEEHSYQKHRSVTLGSI 435

Query: 546  KEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFFPLMQTGVS 394
            KEFNFD+  G   D+P+I  S+WW NEKV  ++  P N W+FFP++Q  VS
Sbjct: 436  KEFNFDNTKGEASDKPTI-RSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 489

 Score =  342 bits (877), Expect = 3e-91
 Identities = 214/470 (45%), Positives = 269/470 (57%), Gaps = 43/470 (9%)
 Frame = -3

Query: 1674 VQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXX 1498
            V K+RWGSC  LY CFGS K +KRIGHA ++PE     A +  T+E+ S P  +I     
Sbjct: 34   VYKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGA-SVSTAENVSNPTGIILPFIA 92

Query: 1497 XXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVF 1318
                          SATQSP GLLS+TS+S N YSP GP SIFAIGPYAHETQLV+PPVF
Sbjct: 93   PPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVF 152

Query: 1317 STFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNL----RNGEAGQRYPLTQYEFQ 1150
            S  TTEPSTAPFTPPPES+ LTTPSSPEVPFA+LL  +L    RN    Q++ L+ YEFQ
Sbjct: 153  SALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQ 212

Query: 1149 SYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVR 970
            SYQ+ PGSP  +L              PFPDR         LEFR G  PKLL  +    
Sbjct: 213  SYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRP------ILEFRMGEAPKLLGFENFTT 266

Query: 969  REWESCQGSGA----------------VT----------------PDAVGPRSRDSRLLN 886
            R+W S  GSG+                VT                PD +GP SRD  L+ 
Sbjct: 267  RKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVG 326

Query: 885  RQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEEVVRCVEKKPVVGSPKSAPESVENV 706
             Q S+VA L N  +    NDET++DHRVSFE++ E+V  C+E K ++ S ++  E  +++
Sbjct: 327  SQISEVALLANPAN-GPKNDETIVDHRVSFELSGEDVAPCLESKSLLPS-RAVSEYPKDL 384

Query: 705  --EHIKEEKPIK--TANGVDHPSGETSNITSEKDHIHTDGDNEKQH--HKTRTITLGSTK 544
              E  KE   IK    +  +    ETSN T EK      G+ E++H   K R++TLGS K
Sbjct: 385  VAEGRKERDGIKKDLESSCELFIRETSNETVEK----ASGEAEEEHSYQKHRSVTLGSIK 440

Query: 543  EFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSFFPLMQTGVS 394
            EFNFD+  G   D+P+I  S+WW NEKV  ++  P N W+FFP++Q  VS
Sbjct: 441  EFNFDNTKGEASDKPTI-RSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_003525388.1| PREDICTED: uncharacterized protein LOC100791666 isoform X1 [Glycine
            max]
          Length = 461

 Score =  325 bits (834), Expect = 3e-86
 Identities = 185/432 (42%), Positives = 250/432 (57%), Gaps = 4/432 (0%)
 Frame = -3

Query: 1677 SVQKRRWGSCLSLYSCFGSNKT-KRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXX 1501
            S QK+RWGS L    CFG  KT KRIGHA ++PE T   AD A  +    Q PS+     
Sbjct: 38   STQKKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTNGADPAAAASS-IQAPSITLPFV 96

Query: 1500 XXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPV 1321
                           S  QSP G +S T VSA++YSPGGP SIFAIGPYAHETQLVSPPV
Sbjct: 97   APPSSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPV 156

Query: 1320 FSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQ 1141
            FS      STAPFTPPPES+H+TTPSSPEVPFA+LL+PN +N E  QR+ ++ Y+FQSYQ
Sbjct: 157  FSA----SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQ 212

Query: 1140 LQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVR--R 967
              PGSPV  L              P PD +F A +   L+F+  +PPKLL+LD  +    
Sbjct: 213  FHPGSPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCE 272

Query: 966  EWESCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEIT 787
              +S  GSG++TPDA    ++   L N   S++   P+  + RL  +E  ++HRVSFE++
Sbjct: 273  NQKSNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNNRL--NEISINHRVSFELS 330

Query: 786  AEEVVRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKTANGVDHPSGETSNITSEKDHIH 607
            A++V++ +E KP   +  +    ++N     +++     + +D     +     +     
Sbjct: 331  AQKVLKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETT 390

Query: 606  TDGDNEKQ-HHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQ 430
              GD     H K +++TL S KEFNFD+ DGG+   P+I  +DWW NEKV  ++   S  
Sbjct: 391  LGGDKATTVHEKDQSLTLSSAKEFNFDNADGGDSLAPNIV-ADWWANEKVAGKEREASKD 449

Query: 429  WSFFPLMQTGVS 394
            WSFFP++Q GVS
Sbjct: 450  WSFFPMIQPGVS 461


>ref|XP_006580574.1| PREDICTED: uncharacterized protein LOC100791666 isoform X2 [Glycine
            max]
          Length = 441

 Score =  322 bits (825), Expect = 4e-85
 Identities = 183/429 (42%), Positives = 248/429 (57%), Gaps = 4/429 (0%)
 Frame = -3

Query: 1668 KRRWGSCLSLYSCFGSNKT-KRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXXXX 1492
            K+RWGS L    CFG  KT KRIGHA ++PE T   AD A  +    Q PS+        
Sbjct: 21   KKRWGSWLGKIGCFGYKKTRKRIGHAVLVPEPTTNGADPAAAASS-IQAPSITLPFVAPP 79

Query: 1491 XXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVFST 1312
                        S  QSP G +S T VSA++YSPGGP SIFAIGPYAHETQLVSPPVFS 
Sbjct: 80   SSPASFFQSEPPSTAQSPIGKVSHTCVSASIYSPGGPASIFAIGPYAHETQLVSPPVFSA 139

Query: 1311 FTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQLQP 1132
                 STAPFTPPPES+H+TTPSSPEVPFA+LL+PN +N E  QR+ ++ Y+FQSYQ  P
Sbjct: 140  ----SSTAPFTPPPESVHMTTPSSPEVPFAQLLDPNNKNSETFQRFQISHYDFQSYQFHP 195

Query: 1131 GSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVR--REWE 958
            GSPV  L              P PD +F A +   L+F+  +PPKLL+LD  +      +
Sbjct: 196  GSPVGQLISPRSAISVSGTSSPLPDSEFNATFAHILDFQRADPPKLLNLDNKLSSCENQK 255

Query: 957  SCQGSGAVTPDAVGPRSRDSRLLNRQDSDVAPLPNTGSYRLANDETVLDHRVSFEITAEE 778
            S  GSG++TPDA    ++   L N   S++   P+  + RL  +E  ++HRVSFE++A++
Sbjct: 256  SNHGSGSLTPDAARSTTQSGFLSNHWVSEIKMSPHPSNNRL--NEISINHRVSFELSAQK 313

Query: 777  VVRCVEKKPVVGSPKSAPESVENVEHIKEEKPIKTANGVDHPSGETSNITSEKDHIHTDG 598
            V++ +E KP   +  +    ++N     +++     + +D     +     +       G
Sbjct: 314  VLKSLENKPAASAWTNVLPKLKNDAPTTDKEEKSEESALDDKQVVSEAHNDQPLETTLGG 373

Query: 597  DNEKQ-HHKTRTITLGSTKEFNFDSVDGGNCDEPSIASSDWWVNEKVVAEDGSPSNQWSF 421
            D     H K +++TL S KEFNFD+ DGG+   P+I  +DWW NEKV  ++   S  WSF
Sbjct: 374  DKATTVHEKDQSLTLSSAKEFNFDNADGGDSLAPNIV-ADWWANEKVAGKEREASKDWSF 432

Query: 420  FPLMQTGVS 394
            FP++Q GVS
Sbjct: 433  FPMIQPGVS 441


Top