BLASTX nr result

ID: Mentha23_contig00015373 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00015373
         (1314 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus...   313   8e-83
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   299   2e-78
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   296   1e-77
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   294   5e-77
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   290   1e-75
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   288   3e-75
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   287   6e-75
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   281   3e-73
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     275   4e-71
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   263   1e-67
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   263   1e-67
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   258   3e-66
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   253   1e-64
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   253   1e-64
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              250   1e-63
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   242   3e-61
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   237   7e-60
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   237   7e-60
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   233   1e-58
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   233   1e-58

>gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus guttatus]
          Length = 420

 Score =  313 bits (803), Expect = 8e-83
 Identities = 202/399 (50%), Positives = 226/399 (56%), Gaps = 34/399 (8%)
 Frame = -3

Query: 1309 SFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSP-QPPSIXXXXXXXXXXXXXX 1133
            SFWSLYWCF     KRIGHA+LV ETSSS    T     P QPPSI              
Sbjct: 42   SFWSLYWCFRPNNNKRIGHAVLVTETSSSDTAYTPTAERPFQPPSIVLPFTAPPSSPASF 101

Query: 1132 XXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEPS 956
                        TG+L               + +FAIGPYAHETQLVSPPVFSTFTTEPS
Sbjct: 102  IPSEPPSSTQSPTGLLSLSSPSGNIYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPS 161

Query: 955  TAPYTPPPE-SVHMTTPSSPEVPFARLLEPNLQNGQRYPFSQYEFQSYQLQPGSPVSHLX 779
            TAPYTPPPE S H+TTPSSPEVPFARLLEPN    QRYP SQYEFQSYQLQPGSPVSHL 
Sbjct: 162  TAPYTPPPEFSAHLTTPSSPEVPFARLLEPN----QRYPLSQYEFQSYQLQPGSPVSHLI 217

Query: 778  XXXXXXXXXXXXXPFPE------HPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTAT 617
                         PF +      HPFFLE   GN P +          +WES Q SG  T
Sbjct: 218  SPCSGISGSGASSPFLDRDFAAVHPFFLEFGGGNPPRR---------DQWESCQESGVVT 268

Query: 616  P----DPRSRDN-FLLNRQDSDVAPVSE---------SAVSHRVSFEITNEEVVRCVEKK 479
            P     PRSRD+  LLNRQ+SD++P+ +         +A+ HRVSFEIT E+V+RCVEKK
Sbjct: 269  PTDAVGPRSRDSCVLLNRQNSDISPLPDNCTGLENDVAAIDHRVSFEITAEKVIRCVEKK 328

Query: 478  EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFE--NV 305
               T  E       SVG+                   RHQKNRTITLGS+KEFNFE  N 
Sbjct: 329  SLETAQE-------SVGKKPIELINREEDQTEIVNEKRHQKNRTITLGSTKEFNFEGGNC 381

Query: 304  DE----DSDWFVGE-----EGAGHSEKWSFFPMMQTGVS 215
            DE     S+W+V E     EG G SE WSFFP++Q GVS
Sbjct: 382  DEPCVDSSEWWVNEKKVPKEGGGSSENWSFFPILQPGVS 420


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  299 bits (765), Expect = 2e-78
 Identities = 194/410 (47%), Positives = 228/410 (55%), Gaps = 44/410 (10%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            G  WS+YWCFGS K TKRIGHA+ +PET++SGAD  ++  S Q PSI             
Sbjct: 43   GGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADRPSSNTSSQAPSIVLPFIAPPSSPAS 102

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXSMFAIGPYAHETQLVSPPVFSTFTTEPS 956
                          G                 S+FAIGPYAHETQLVSPPVFS FTTEPS
Sbjct: 103  FLPSEPPSATHSPVG--SKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPS 160

Query: 955  TAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQLQPGSPVSH 785
            TAP+TPPPESVH+TTPSSPEVPFA+LL+PN QN   G RYPF+QYEFQSYQLQPGSPVS+
Sbjct: 161  TAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSN 220

Query: 784  LXXXXXXXXXXXXXXPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD-- 611
            L              PF +     E   G   P+ L L+KI   EW SRQGSGT TP+  
Sbjct: 221  LISPGSAISVSGTSSPFLDR----EYTPGR--PQFLNLEKIAPHEWGSRQGSGTLTPEAV 274

Query: 610  -PRSRDNFLLNRQDSDVAPVSE---------SAVSHRVSFEITNEEVVRCVEKKEP---N 470
             P+  DNFLLN Q+S V  + +         + V HRVSFEIT E+VVRCVEKK      
Sbjct: 275  NPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMR 334

Query: 469  TG------IERSLIEKTSVGESSN---------RKQXXXXXXXXXXXXXRHQKNRTITLG 335
            TG       ERS   + ++ E SN          ++             R QK+R+ITLG
Sbjct: 335  TGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDGQRQQKHRSITLG 394

Query: 334  SSKEFNFENVDE--------DSDWFVGEEGAGHS--EKWSFFPMMQTGVS 215
            SSKEFNF+NVD          SDW+  E+  G      W  FPMMQ GVS
Sbjct: 395  SSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKEPCNNW-IFPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  296 bits (758), Expect = 1e-77
 Identities = 193/410 (47%), Positives = 228/410 (55%), Gaps = 44/410 (10%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            GS WS+YWCFGS K TKRIGHA+ +PET++S AD  ++  S Q PSI             
Sbjct: 43   GSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADRPSSNTSSQAPSIVLPFIAPPSSPAS 102

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXSMFAIGPYAHETQLVSPPVFSTFTTEPS 956
                          G                 S+FAIGPYAHETQLVSPPVFS FTTEPS
Sbjct: 103  FLPSEPPSATHSPVG--SKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVFSAFTTEPS 160

Query: 955  TAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQLQPGSPVSH 785
            TAP+TPPPESVH+TTPSSPEVPFA+LL+PN QN   G RYPF+QYEFQSYQLQPGSPVS+
Sbjct: 161  TAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQLQPGSPVSN 220

Query: 784  LXXXXXXXXXXXXXXPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD-- 611
            L              PF E     E   G   P+ L L+KI   EW SRQGSGT TP+  
Sbjct: 221  LISPGSAISVSGTSSPFLER----EYTPGR--PQFLNLEKIAPHEWGSRQGSGTLTPEAV 274

Query: 610  -PRSRDNFLLNRQDSDVAPVSE---------SAVSHRVSFEITNEEVVRCVEKKEP---N 470
             P+  D+FLLN Q++ V  + +         + V HRVSFEIT E+VVRCVEKK      
Sbjct: 275  NPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVEKKPTMMMR 334

Query: 469  TG------IERSLIEKTSVGESSN---------RKQXXXXXXXXXXXXXRHQKNRTITLG 335
            TG       ERS   + ++ E SN          ++             R QK+R+ITLG
Sbjct: 335  TGSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQQKHRSITLG 394

Query: 334  SSKEFNFENVDE--------DSDWFVGEEGAGHS--EKWSFFPMMQTGVS 215
            SSKEFNF+NVD          SDW+  E+  G      W  FPMMQ GVS
Sbjct: 395  SSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKEPCNNW-IFPMMQPGVS 443


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  294 bits (753), Expect = 5e-77
 Identities = 188/421 (44%), Positives = 228/421 (54%), Gaps = 55/421 (13%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            GS+WS+YWCFG  +  KRIGHA+LVPET+  G D+  A +  Q PSI             
Sbjct: 43   GSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPRAENPIQTPSIVLPFVAPPSSPAS 102

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXSMFAIGPYAHETQLVSPPVFSTFTTEPS 956
                          G                  +FAIGPYAHETQLVSPPVFSTFTTEPS
Sbjct: 103  FLQSEPPSATQSPAGFFSLTASMYSPSGPTS--IFAIGPYAHETQLVSPPVFSTFTTEPS 160

Query: 955  TAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQLQPGSPVSH 785
            TAP+TPPPESVH+TTPSSPEVPFA+LL+P+ +N   GQR+P S YEFQSYQL PGSPV  
Sbjct: 161  TAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQLYPGSPVGQ 220

Query: 784  LXXXXXXXXXXXXXXPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGT 623
            L              PFP+  F      FLE RTG+ PPKLL LD +  R+W SR GSG+
Sbjct: 221  LISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGD-PPKLLNLDILSTRDWGSRLGSGS 279

Query: 622  ATPD---PRSRDNFLLNRQDSDVA--PVSES-------AVSHRVSFEITNEEVVRCVEKK 479
             TPD     S D FLL  Q  +V   P S +       +++HRVSFE+++EEV+RCVEKK
Sbjct: 280  VTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEEVIRCVEKK 339

Query: 478  ----------------------EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXR 365
                                  +P+  +  S+     VGE+SN                 
Sbjct: 340  PVALAEAVSTSLEDTEKAQSKEDPSKVVSSSI---CPVGETSN--DAAEKAVADGEEAQL 394

Query: 364  HQKNRTITLGSSKEFNFENVDE-------DSDWFVGE----EGAGHSEKWSFFPMMQTGV 218
            H K R+ITLGS KEFNF+N D         SDW+  E    +  G ++ WSFFPMMQ GV
Sbjct: 395  HPKQRSITLGSVKEFNFDNPDGGDSGNSIGSDWWANEKVDAKENGPTKNWSFFPMMQPGV 454

Query: 217  S 215
            S
Sbjct: 455  S 455


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  290 bits (741), Expect = 1e-75
 Identities = 189/438 (43%), Positives = 225/438 (51%), Gaps = 72/438 (16%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXXX 1133
            GS W  YWCF SPK KRIGHA+L PE+ + G+    A +  Q P+I              
Sbjct: 43   GSCWGEYWCFRSPKDKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAPPSSPASF 102

Query: 1132 XXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEPS 956
                        +G+L               + +FAIGPYAHETQLVSPPVFSTFTTEPS
Sbjct: 103  LQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPS 162

Query: 955  TAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQPGSPVSH 785
            TAP+TPPPESVH+TTPSSPEVPFA+L +PN +NG+   R+  SQYEFQSYQL PGSPV H
Sbjct: 163  TAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGH 222

Query: 784  LXXXXXXXXXXXXXXPFPEHPF-------FLELRTGNYPPKLLELDKIVLREWESRQGSG 626
            L              PFP+  F       FLE R G  PPKLL LDK+   EW SR GSG
Sbjct: 223  LISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGG-PPKLLTLDKLSNHEWGSRIGSG 281

Query: 625  TATPD---PRSRDNFLLNRQDSDV---------------------------APVSESAVS 536
            + TPD   P SRD  +L+RQ SDV                            P +E  V 
Sbjct: 282  SITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNNEIMVD 341

Query: 535  HRVSFEITNEEVVRCVEKKEPN--TGIERSL-------IEKTS----------VGESSNR 413
            HRVSFE+T E+VVRCVEK        +  SL       I++ S          VGE++N 
Sbjct: 342  HRVSFELTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANN 401

Query: 412  KQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFENVDE--------DSDWFVGE----EG 269
                            H K R+ITLGS+KEFNF+N D          SDW+  E    + 
Sbjct: 402  PPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKE 461

Query: 268  AGHSEKWSFFPMMQTGVS 215
             G S+ WS F MMQ  VS
Sbjct: 462  VGASKNWSIFHMMQPSVS 479


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  288 bits (738), Expect = 3e-75
 Identities = 186/413 (45%), Positives = 233/413 (56%), Gaps = 51/413 (12%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            G  WS+ WCFG  K  KRIGHA+LVPE ++S ++++ AV+S Q  +I             
Sbjct: 44   GGCWSISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQAAAISLPFVAPPSSPAS 103

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEP 959
                          G++               S +FAIGPYAHETQLVSPPVFSTFTTEP
Sbjct: 104  FLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEP 163

Query: 958  STAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPGSPVS 788
            STAP+TPPPESVH+TTPSSPEVPFA+LL+P+L   + GQ++PFS YEFQSY L PGSPV 
Sbjct: 164  STAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVG 223

Query: 787  HLXXXXXXXXXXXXXXPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSG 626
            +L              PFP+  F      F +   G+ PPKLL LDK+ +REW SRQGSG
Sbjct: 224  NLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSIREWGSRQGSG 282

Query: 625  TATPD---PRSRDNFLLNRQDSDVA--PVSESA------VSHRVSFEITNEEVVRCVEKK 479
            T TPD      R+ F  NRQ S+VA  P SE+       V HRVSFE+T E+VVRCVEKK
Sbjct: 283  TLTPDAVGSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKK 342

Query: 478  EPNT---GIERSLIEKTSV------GESSN---------RKQXXXXXXXXXXXXXRHQKN 353
             P T    +  SL   T+V      GE+ N                         RHQK 
Sbjct: 343  -PTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQ 401

Query: 352  RTITLGSSKEFNFENVDED-------SDWFVGE----EGAGHSEKWSFFPMMQ 227
            ++ITLGS+KEFNF++ D D       SDW+  E    + +G  + W+FFP++Q
Sbjct: 402  QSITLGSTKEFNFDSADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQ 454


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  287 bits (735), Expect = 6e-75
 Identities = 185/413 (44%), Positives = 233/413 (56%), Gaps = 51/413 (12%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            G  W++ WCFG  K  KRIGHA+LVPE ++S ++++ AV+S Q  +I             
Sbjct: 44   GGCWNISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQATAISLPFVAPPSSPAS 103

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEP 959
                          G++               S +FAIGPYAHETQLVSPPVFSTFTTEP
Sbjct: 104  FLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSPPVFSTFTTEP 163

Query: 958  STAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPGSPVS 788
            STAP+TPPPESVH+TTPSSPEVPFA+LL+P+L   + GQ++PFS YEFQSY L PGSPV 
Sbjct: 164  STAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQSYHLHPGSPVG 223

Query: 787  HLXXXXXXXXXXXXXXPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSG 626
            +L              PFP+  F      F +   G+ PPKLL LDK+ +REW SRQGSG
Sbjct: 224  NLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSIREWGSRQGSG 282

Query: 625  TATPD---PRSRDNFLLNRQDSDVA--PVSESA------VSHRVSFEITNEEVVRCVEKK 479
            T TPD      R+ F  NRQ S+VA  P SE+       V HRVSFE+T E+VVRCVEKK
Sbjct: 283  TLTPDAVRSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTTEDVVRCVEKK 342

Query: 478  EPNT---GIERSLIEKTSV------GESSN---------RKQXXXXXXXXXXXXXRHQKN 353
             P T    +  SL   T+V      GE+ N                         RHQK 
Sbjct: 343  -PTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVDVEEAPRHQKQ 401

Query: 352  RTITLGSSKEFNFENVDED-------SDWFVGE----EGAGHSEKWSFFPMMQ 227
            ++ITLGS+KEFNF++ D D       SDW+  E    + +G  + W+FFP++Q
Sbjct: 402  QSITLGSTKEFNFDSADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFPVIQ 454


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  281 bits (720), Expect = 3e-73
 Identities = 188/418 (44%), Positives = 223/418 (53%), Gaps = 53/418 (12%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPKTK-RIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            G  WS+YWCFGS K K RIG A+L  ETS SGA+   A +  Q P+I             
Sbjct: 43   GGCWSIYWCFGSYKQKKRIGPAVLTSETSFSGANVPAAENPTQAPAIALPFVAPPSSPAS 102

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXSMFAIGPYAHETQLVSPPVFSTFTTEPS 956
                          G++               S+FAIGPYAHETQLVSPPVFSTFTTEPS
Sbjct: 103  FLPSEPPSATQSPAGLVSLTSISASMYSPGPASIFAIGPYAHETQLVSPPVFSTFTTEPS 162

Query: 955  TAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG---QRYPFSQYEFQSYQLQPGSPVSH 785
            TAP+TPPPESVH+TTPSSPEVPFA+LL PNLQ G   QR+P S YEFQSYQL PGSPV  
Sbjct: 163  TAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQLHPGSPVGQ 222

Query: 784  LXXXXXXXXXXXXXXPFPEHPF-----FLELRTGNYPPKLLELDKIVLREWESRQGSGTA 620
            L              PF +  F     F E R G+ PPKLL LDK    EW S  GSGT 
Sbjct: 223  LISPSSGISGSGTSSPFRDGEFAASLHFPEFRMGD-PPKLLNLDKHSSCEWGSHHGSGTL 281

Query: 619  TPD---PRSRDNFLLNRQDSDV----------APVSESAVSHRVSFEITNEEVVRCVEKK 479
            TPD      R+ FLL+ Q S++              + A +HRVSFE+T EEVVR +E +
Sbjct: 282  TPDATRSTPRNGFLLDHQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEEVVRSLEME 341

Query: 478  --EPNTGIERSL-IEKT----------------SVGESSNRKQXXXXXXXXXXXXXRHQK 356
               P+  +  SL IE T                 VGE+SN +              +H K
Sbjct: 342  TATPSEAVSGSLQIEATRESEEHDTKVVDDYECRVGETSNER--PEKALADREGKPQHHK 399

Query: 355  NRTITLGSSKEFNFENVDE--------DSDWF----VGEEGAGHSEKWSFFPMMQTGV 218
            +++ITLGS+KEFNF+NVD          SDW+    V  +G G    WSFFPMMQ GV
Sbjct: 400  HQSITLGSAKEFNFDNVDGGDAHKPILTSDWWANDKVAGKGGGVPRNWSFFPMMQPGV 457


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  275 bits (702), Expect = 4e-71
 Identities = 173/414 (41%), Positives = 223/414 (53%), Gaps = 48/414 (11%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPKTK-RIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            G   S+YWCFG+PK + RIGH +LVPET+  G  +  A +S Q  ++             
Sbjct: 45   GGCLSIYWCFGTPKNRTRIGHGVLVPETAQPGNSAPRAENSTQTHAVILPFIAPPSSPAS 104

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEP 959
                          G+L               + +FAIGPYAHETQLVSPPVFSTFTTEP
Sbjct: 105  FLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEP 164

Query: 958  STAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQLQPGSPVS 788
            STAP+TPPPESVH+TTPSSPEVPFA+LL+PN+ N   GQR+P    EFQSY  QPGSP+ 
Sbjct: 165  STAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNEFQSYYFQPGSPIG 224

Query: 787  HLXXXXXXXXXXXXXXPFPE------HPFFLELRTGNYPPKLLELDKIVLREWESRQGSG 626
             L              PFP+       P FLE RTG+ PPKLL LDK+   +W SRQGSG
Sbjct: 225  QLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGD-PPKLLNLDKLSKFDWGSRQGSG 283

Query: 625  TATPD---PRSRDNFLLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK-------- 479
            + TPD   P S      + + +     +E+    RVSF+++ E+V+R VEKK        
Sbjct: 284  SLTPDSVKPISTFEVAPHLKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKKTVPLAEAM 343

Query: 478  ----------EPNTGIERSLIE----KTSVGESSNRKQXXXXXXXXXXXXXRHQKNRTIT 341
                      +     + + +E    +  VGE+SN  +             +HQK+R+IT
Sbjct: 344  LTSLKDTTMGQREENSDSNKVEEIGCENRVGETSN--EEPDKAPTSGEEVLQHQKHRSIT 401

Query: 340  LGSSKEFNFENVDED--------SDWFVGEEGAGH----SEKWSFFPMMQTGVS 215
            LGSSKEFNF+N D          SDW+  ++ AG     S+ WSFFPM+Q GVS
Sbjct: 402  LGSSKEFNFDNADAGDLHKSDSVSDWWANQKVAGKEGAPSQNWSFFPMIQPGVS 455


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  263 bits (672), Expect = 1e-67
 Identities = 177/419 (42%), Positives = 221/419 (52%), Gaps = 53/419 (12%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            GS WS+Y CFG  K K+ IGHA+L PE S+ G  +  + +  Q P++             
Sbjct: 42   GSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPSSPAS 101

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEP 959
                          G++               + +FAIGPYAHETQLVSPPVFSTFTTEP
Sbjct: 102  FFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEP 161

Query: 958  STAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQPGSPVS 788
            STAP+TPPPESVH+TTPSSPEVPFA+ L+P+L+NG    R+PF   +FQSYQ  PGSPV 
Sbjct: 162  STAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQFHPGSPVG 218

Query: 787  HLXXXXXXXXXXXXXXPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSG 626
             L              PFP+  F      F E R G  PPKLL LDK+   EW S QGSG
Sbjct: 219  QLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCEWGSYQGSG 277

Query: 625  TATPDP--RSRDNFLLNRQDSDVAPVSES--------AVSHRVSFEITNEEVVRCVE--- 485
              TP+   R   NFLL+RQ SDV     S         V+HRVSFE+T E+  RCVE   
Sbjct: 278  ALTPESVRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKP 337

Query: 484  -----------------KKEPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXRHQK 356
                             K+E N+G      E   VG +SN                +H+K
Sbjct: 338  AFSIKTVPEYVENGTQAKEEKNSGESIQSFE-CRVGVTSN--DSPEMASTDGEAAPQHRK 394

Query: 355  NRTITLGSSKEFNFENVDE-------DSDWF-----VGEEGAGHSEKWSFFPMMQTGVS 215
             ++ITLGS KEFNF+N DE        S+W+     +G+EG   ++ WSFFPM+Q+GVS
Sbjct: 395  QQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFPMVQSGVS 452


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  263 bits (672), Expect = 1e-67
 Identities = 177/419 (42%), Positives = 221/419 (52%), Gaps = 53/419 (12%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            GS WS+Y CFG  K K+ IGHA+L PE S+ G  +  + +  Q P++             
Sbjct: 43   GSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPSSPAS 102

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEP 959
                          G++               + +FAIGPYAHETQLVSPPVFSTFTTEP
Sbjct: 103  FFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEP 162

Query: 958  STAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQPGSPVS 788
            STAP+TPPPESVH+TTPSSPEVPFA+ L+P+L+NG    R+PF   +FQSYQ  PGSPV 
Sbjct: 163  STAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQFHPGSPVG 219

Query: 787  HLXXXXXXXXXXXXXXPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSG 626
             L              PFP+  F      F E R G  PPKLL LDK+   EW S QGSG
Sbjct: 220  QLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCEWGSYQGSG 278

Query: 625  TATPDP--RSRDNFLLNRQDSDVAPVSES--------AVSHRVSFEITNEEVVRCVE--- 485
              TP+   R   NFLL+RQ SDV     S         V+HRVSFE+T E+  RCVE   
Sbjct: 279  ALTPESVRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCVEEKP 338

Query: 484  -----------------KKEPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXRHQK 356
                             K+E N+G      E   VG +SN                +H+K
Sbjct: 339  AFSIKTVPEYVENGTQAKEEKNSGESIQSFE-CRVGVTSN--DSPEMASTDGEAAPQHRK 395

Query: 355  NRTITLGSSKEFNFENVDE-------DSDWF-----VGEEGAGHSEKWSFFPMMQTGVS 215
             ++ITLGS KEFNF+N DE        S+W+     +G+EG   ++ WSFFPM+Q+GVS
Sbjct: 396  QQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFPMVQSGVS 453


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  258 bits (660), Expect = 3e-66
 Identities = 174/416 (41%), Positives = 219/416 (52%), Gaps = 51/416 (12%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVH-SPQPPSIXXXXXXXXXXXX 1139
            GS WS+YWCFG  +  KRIGHA+LVPE S+ G DS+ A + + Q P+I            
Sbjct: 46   GSCWSVYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPFVAPPSSPA 105

Query: 1138 XXXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTE 962
                           GIL               + +FAIGPYAHETQLVSPP FSTFTTE
Sbjct: 106  SFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPPAFSTFTTE 165

Query: 961  PSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQPGSPV 791
            PSTAP+TPPPESV +TTPSSPEVPFA+LLEP+ +NG+   R+PFS YEFQSYQ  PGSPV
Sbjct: 166  PSTAPFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSYQFYPGSPV 225

Query: 790  SHLXXXXXXXXXXXXXXPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGS 629
              L              PFP+  F      FLE +    PPKLL LDK+ + E  SRQGS
Sbjct: 226  GQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMA-VPPKLLNLDKLSVHECGSRQGS 284

Query: 628  GTATPDP--RSRDNFLLNRQDSDVAP--------VSESAVSHRVSFEITNEEVVRCVEKK 479
            GT TPD    +  +F L+RQ SD+A           +     RVSF+++ E+ +R  E K
Sbjct: 285  GTLTPDAVRATSCSFPLDRQCSDIASNRHSDNENKDDQVADLRVSFDLSAEDALRYAEPK 344

Query: 478  EPN----------TGIERSLIEKTS---------VGESSNRKQXXXXXXXXXXXXXRHQK 356
              +            I    ++K+S         VGE+SN                RHQK
Sbjct: 345  PASPVKIMPESMKNEIAAEKVQKSSEIRHNFECRVGETSN--GILEQASTGGEKTPRHQK 402

Query: 355  NRTITLGSSKEFNFENVD------EDSDWFVGEEGAGH----SEKWSFFPMMQTGV 218
            +RT+TLG+ KEFNF+N D         DW+      G     ++ WSFFP+MQ  +
Sbjct: 403  HRTLTLGTFKEFNFDNADGVPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVMQPSI 458


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  253 bits (647), Expect = 1e-64
 Identities = 169/417 (40%), Positives = 210/417 (50%), Gaps = 54/417 (12%)
 Frame = -3

Query: 1303 WSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXXXXX 1127
            W +YWCFG  +  KRIGHA+++PET+S G +   A +  Q  SI                
Sbjct: 10   WGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQ 69

Query: 1126 XXXXXXXXXXTGILXXXXXXXXXXXXXXXSMFAIGPYAHETQLVSPPVFSTFTTEPSTAP 947
                                         S+FAIGPYAHETQLVSPPVFSTFTTEPSTAP
Sbjct: 70   SEPPSAMQSPG---FNFSLSASMYSPGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP 126

Query: 946  YTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPGSPVSHLXX 776
            +TPP ESVH+T PSSPEVPFA+LL+ N    + GQRYP S YEFQSYQ  PGSPV  L  
Sbjct: 127  FTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLIS 186

Query: 775  XXXXXXXXXXXXPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATP 614
                        PF +  F      FLE RTG   PK+L LD +  R+W SR  SG+ TP
Sbjct: 187  PSSGISGSGTSSPFLDSEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSRLCSGSVTP 245

Query: 613  D---PRSRDNF---------LLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK--- 479
            D     S + F         +LN + +       +++ HRVSFE++ EEVVRCVEKK   
Sbjct: 246  DAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVA 305

Query: 478  -----------------EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXRHQKNR 350
                             E     E S   +  V ++SN                R+QK R
Sbjct: 306  LAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKER 365

Query: 349  TITLGSSKEFNFENVDE--------DSDWFVGEEGA----GHSEKWSFFPMMQTGVS 215
            +ITLGS+KEFNF+N D          +DW+  E+      G S+ WSFFPM+Q G+S
Sbjct: 366  SITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 422


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  253 bits (647), Expect = 1e-64
 Identities = 169/417 (40%), Positives = 210/417 (50%), Gaps = 54/417 (12%)
 Frame = -3

Query: 1303 WSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXXXXX 1127
            W +YWCFG  +  KRIGHA+++PET+S G +   A +  Q  SI                
Sbjct: 47   WGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQ 106

Query: 1126 XXXXXXXXXXTGILXXXXXXXXXXXXXXXSMFAIGPYAHETQLVSPPVFSTFTTEPSTAP 947
                                         S+FAIGPYAHETQLVSPPVFSTFTTEPSTAP
Sbjct: 107  SEPPSAMQSPG---FNFSLSASMYSPGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAP 163

Query: 946  YTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPGSPVSHLXX 776
            +TPP ESVH+T PSSPEVPFA+LL+ N    + GQRYP S YEFQSYQ  PGSPV  L  
Sbjct: 164  FTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLIS 223

Query: 775  XXXXXXXXXXXXPFPEHPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATP 614
                        PF +  F      FLE RTG   PK+L LD +  R+W SR  SG+ TP
Sbjct: 224  PSSGISGSGTSSPFLDSEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSRLCSGSVTP 282

Query: 613  D---PRSRDNF---------LLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK--- 479
            D     S + F         +LN + +       +++ HRVSFE++ EEVVRCVEKK   
Sbjct: 283  DAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVA 342

Query: 478  -----------------EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXRHQKNR 350
                             E     E S   +  V ++SN                R+QK R
Sbjct: 343  LAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKER 402

Query: 349  TITLGSSKEFNFENVDE--------DSDWFVGEEGA----GHSEKWSFFPMMQTGVS 215
            +ITLGS+KEFNF+N D          +DW+  E+      G S+ WSFFPM+Q G+S
Sbjct: 403  SITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 459


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  250 bits (638), Expect = 1e-63
 Identities = 166/404 (41%), Positives = 200/404 (49%), Gaps = 38/404 (9%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXXX 1133
            GS W  YWCF SPK KRIGHA+L PE+ + G+    A +  Q P+I              
Sbjct: 43   GSCWGEYWCFRSPKDKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAPPSSPASF 102

Query: 1132 XXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEPS 956
                        +G+L               + +FAIGPYAHETQLVSPPVFSTFTTEPS
Sbjct: 103  LQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPS 162

Query: 955  TAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQPGSPVSH 785
            TAP+TPPPESVH+TTPSSPEVPFA+L +PN +NG+   R+  SQYEFQSYQL PGSPV H
Sbjct: 163  TAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGH 222

Query: 784  LXXXXXXXXXXXXXXPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD-- 611
            L              PFP+                                SG+ TPD  
Sbjct: 223  LISPSSGISGSGTSSPFPDR-------------------------------SGSITPDAL 251

Query: 610  -PRSRDNFLLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKKEPN--TGIERSL--- 449
             P SRD  +L   D    P +E  V HRVSFE+T E+VVRCVEK        +  SL   
Sbjct: 252  GPPSRDGSVL---DHSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQNP 308

Query: 448  ----IEKTS----------VGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFE 311
                I++ S          VGE++N                 H K R+ITLGS+KEFNF+
Sbjct: 309  ATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFD 368

Query: 310  NVDE--------DSDWFVGE----EGAGHSEKWSFFPMMQTGVS 215
            N D          SDW+  E    +  G S+ WS F MMQ  VS
Sbjct: 369  NADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 412


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  242 bits (617), Expect = 3e-61
 Identities = 167/402 (41%), Positives = 208/402 (51%), Gaps = 48/402 (11%)
 Frame = -3

Query: 1309 SFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXXX 1133
            S WS+YWCFG  K+KR IGHA+L PE+S+ G+ +  A +S Q P +              
Sbjct: 44   SHWSIYWCFGYQKSKRQIGHAVLFPESSAPGSGAPAAENSAQAPEVTFPFVAPPSSPASF 103

Query: 1132 XXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEPS 956
                         G++               + +FAIGPYAHETQLVSPPVFSTFTTEPS
Sbjct: 104  FQSEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPS 163

Query: 955  TAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQPGSPVSH 785
            TAP+TPPPESVH+TTPSSPEVPFA+L++P L+NG    R+PF   +FQSYQ  PGS V  
Sbjct: 164  TAPFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRFPF---DFQSYQFHPGSSVGQ 220

Query: 784  LXXXXXXXXXXXXXXPFPEHPFFL------ELRTGNYPPKLLELDKIVLREWESRQGSGT 623
            L              PFP+  F +      E R G   PKLL LDK+  REW S Q SG 
Sbjct: 221  LISPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG---PKLLNLDKLSTREWGSYQDSGA 277

Query: 622  ATPDP--RSRDNFLLNRQDSDVA--PVSESA------VSHRVSFEITNEEVVRCVEKK-- 479
             TPD       NFLL+RQ SDVA  P SE+       V+HR SFE++ ++  RCVE+K  
Sbjct: 278  LTPDSVRHGSPNFLLHRQFSDVASHPRSENGHDDDQVVNHRFSFELSVKDASRCVEEKPA 337

Query: 478  ------------------EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXRHQKN 353
                              E N G      E+ S G++SN                +H+K 
Sbjct: 338  CSIKTVPEYVENGTKAKEEENYGELIQSFERRS-GDTSN---DTPETPSTDGEAPQHRKQ 393

Query: 352  RTITLGSSKEFNFENVDE-------DSDWFVGEEGAGHSEKW 248
            + ITLGS  EFNF+N DE        S+W V +   G S  W
Sbjct: 394  QPITLGSVNEFNFDNADEGDSHNPSSSNW-VKQPRTGPSSLW 434


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  237 bits (605), Expect = 7e-60
 Identities = 163/451 (36%), Positives = 210/451 (46%), Gaps = 85/451 (18%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            GS W LYWCFGS K +KRIGHA+LVPE    GA  +TA +   P  I             
Sbjct: 40   GSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPAS 99

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEP 959
                          G+L               + +FAIGPYAHETQLV+PPVFS  TTEP
Sbjct: 100  FLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEP 159

Query: 958  STAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG-------QRYPFSQYEFQSYQLQPG 800
            STAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEFQSYQ+ PG
Sbjct: 160  STAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPG 219

Query: 799  SPVSHLXXXXXXXXXXXXXXPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSG-- 626
            SP  +L              PFP+    LE R G   PKLL  +    R+W SR GSG  
Sbjct: 220  SPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGE-APKLLGFENFTTRKWGSRLGSGSL 278

Query: 625  --------------TATPD-------------------PRSRDNFLLNRQDSDVAPVS-- 551
                          + TPD                   P SRD FL+  Q S+VA ++  
Sbjct: 279  TPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANP 338

Query: 550  -------ESAVSHRVSFEITNEEVVRCVEKK--------------------EPNTGIERS 452
                   E+ V HRVSFE++ E+V  C+E K                    +   GI++ 
Sbjct: 339  ANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKD 398

Query: 451  LIEKTSVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFENVDED-------- 296
            L     +       +              +QK+R++TLGS KEFNF+N   +        
Sbjct: 399  LESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIR 458

Query: 295  SDWFVGEEGAGHSEK----WSFFPMMQTGVS 215
            S+W+  E+ AG   +    W+FFPM+Q  VS
Sbjct: 459  SEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  237 bits (605), Expect = 7e-60
 Identities = 163/451 (36%), Positives = 210/451 (46%), Gaps = 85/451 (18%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            GS W LYWCFGS K +KRIGHA+LVPE    GA  +TA +   P  I             
Sbjct: 36   GSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPAS 95

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEP 959
                          G+L               + +FAIGPYAHETQLV+PPVFS  TTEP
Sbjct: 96   FLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEP 155

Query: 958  STAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG-------QRYPFSQYEFQSYQLQPG 800
            STAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEFQSYQ+ PG
Sbjct: 156  STAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPG 215

Query: 799  SPVSHLXXXXXXXXXXXXXXPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSG-- 626
            SP  +L              PFP+    LE R G   PKLL  +    R+W SR GSG  
Sbjct: 216  SPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGE-APKLLGFENFTTRKWGSRLGSGSL 274

Query: 625  --------------TATPD-------------------PRSRDNFLLNRQDSDVAPVS-- 551
                          + TPD                   P SRD FL+  Q S+VA ++  
Sbjct: 275  TPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANP 334

Query: 550  -------ESAVSHRVSFEITNEEVVRCVEKK--------------------EPNTGIERS 452
                   E+ V HRVSFE++ E+V  C+E K                    +   GI++ 
Sbjct: 335  ANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKD 394

Query: 451  LIEKTSVGESSNRKQXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFENVDED-------- 296
            L     +       +              +QK+R++TLGS KEFNF+N   +        
Sbjct: 395  LESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIR 454

Query: 295  SDWFVGEEGAGHSEK----WSFFPMMQTGVS 215
            S+W+  E+ AG   +    W+FFPM+Q  VS
Sbjct: 455  SEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  233 bits (594), Expect = 1e-58
 Identities = 161/437 (36%), Positives = 208/437 (47%), Gaps = 71/437 (16%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            GS WSLYWCFGS K +KRIGHA+LVPE  + G       +     +I             
Sbjct: 36   GSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPAS 95

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEP 959
                          G+L               + +FAIGPYAHETQLVSPPVFSTFTTEP
Sbjct: 96   FLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEP 155

Query: 958  STAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQRY-------PFSQYEFQSYQLQPG 800
            STA +TPPPE VHMTTP SPEVPFA+LL  +L   +RY       P SQYEF  YQ  PG
Sbjct: 156  STANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ-DPG 214

Query: 799  SPVSHLXXXXXXXXXXXXXXPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTA 620
            SP S+L              PFP     +E R G  PPK L  +    R+W SR GSG+ 
Sbjct: 215  SPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGE-PPKFLGYEHFSTRKWGSRVGSGSV 273

Query: 619  TP-------------------------------DPRSRDNFLLNRQDSDVAP-------- 557
            TP                               +P SRD++LL  Q S+VA         
Sbjct: 274  TPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVASLANSDNGS 333

Query: 556  -VSESAVSHRVSFEITNEEVVRCVEKK------EPNTGIERSLIEKTSVGESSN----RK 410
             + E+ + HRVSFE+T E+V  C EK+      +P   ++ S +  + +   S+    + 
Sbjct: 334  EIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKT 393

Query: 409  QXXXXXXXXXXXXXRHQKNRTITLGSSKEFNFENV--------DEDSDWFVGEEGA---- 266
                           H+K+R IT GSSK+F+F+NV          D +W+  ++ A    
Sbjct: 394  YGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKES 453

Query: 265  GHSEKWSFFPMMQTGVS 215
            G    W+FFP++Q GVS
Sbjct: 454  GIQNNWTFFPVLQPGVS 470


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  233 bits (594), Expect = 1e-58
 Identities = 164/422 (38%), Positives = 203/422 (48%), Gaps = 56/422 (13%)
 Frame = -3

Query: 1312 GSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXX 1136
            GS  SLYWCFGS + +KRIGHA+LVPE    GA +  + +     SI             
Sbjct: 36   GSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASENLNLSTSIVLPFIAPPSSPAS 95

Query: 1135 XXXXXXXXXXXXXTGILXXXXXXXXXXXXXXXS-MFAIGPYAHETQLVSPPVFSTFTTEP 959
                          G L               + MFAIGPYAHETQLVSPPVFSTF TEP
Sbjct: 96   FLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPVFSTFPTEP 155

Query: 958  STAPYTPPPESVHMTTPSSPEVPFARLLEPNLQ-------NGQRYPFSQYEFQSYQLQPG 800
            STAP+TPPPESV +TTPSSPEVPFA+LL  +L          Q+   S YEFQ YQL P 
Sbjct: 156  STAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPE 215

Query: 799  SPVSHLXXXXXXXXXXXXXXPFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTA 620
            SPV HL              PFP+    +E       PKLL  +    R W SR GSG+ 
Sbjct: 216  SPVGHL---ISPISNSGTSSPFPDRRPIVE------APKLLGFEHFSTRRWGSRLGSGSL 266

Query: 619  TPD---PRSRDNFLLNRQDSDVAPVS---------ESAVSHRVSFEITNEEVVRCVEKKE 476
            TPD   P SRD+FLL  Q S+VA ++         E+ + HRVSFE+  E+V  CVEKK 
Sbjct: 267  TPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGEDVAVCVEKKP 326

Query: 475  PNTG----------IERSLIEKTSVGESSNR------------KQXXXXXXXXXXXXXRH 362
              +           +E   IE+   G S +             K               H
Sbjct: 327  VASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKASAEGEEEQCH 386

Query: 361  QKNRTITLGSSKEFNFENVDED---------SDWFVGE----EGAGHSEKWSFFPMMQTG 221
            +K+  I  GS KEFNF+N   +         S+W+V E    +G G    W+FFP++Q G
Sbjct: 387  KKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPG 446

Query: 220  VS 215
            +S
Sbjct: 447  IS 448


Top