BLASTX nr result

ID: Mentha26_contig00035246 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00035246
         (1636 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus...   335   4e-89
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   314   8e-83
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   311   5e-82
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   308   4e-81
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   305   5e-80
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   303   1e-79
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   300   1e-78
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   294   9e-77
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     285   3e-74
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   276   2e-71
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   272   4e-70
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   271   5e-70
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   269   2e-69
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   266   2e-68
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              260   1e-66
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   251   9e-64
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   251   9e-64
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   248   6e-63
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   248   6e-63
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   248   6e-63

>gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus guttatus]
          Length = 420

 Score =  335 bits (859), Expect = 4e-89
 Identities = 216/440 (49%), Positives = 239/440 (54%), Gaps = 35/440 (7%)
 Frame = +1

Query: 172  MRRGAN-GPDXXXXXXXXXXXXXXXXXRGDHASSAQKRRWGSFWSLYWCFGSPKTKRIGH 348
            MRRG N G D                  G HASS QKRRW SFWSLYWCF     KRIGH
Sbjct: 1    MRRGVNNGTDALETISAAASAIASAEAHGAHASSLQKRRWRSFWSLYWCFRPNNNKRIGH 60

Query: 349  AILVPETSSSGADST-TAVHSSQPPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXGILXXX 525
            A+LV ETSSS    T TA    QPPSI                           G+L   
Sbjct: 61   AVLVTETSSSDTAYTPTAERPFQPPSIVLPFTAPPSSPASFIPSEPPSSTQSPTGLLSLS 120

Query: 526  XXXXXXXXXXXXX-MFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE-SVHMTTPSS 699
                          +FAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE S H+TTPSS
Sbjct: 121  SPSGNIYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPEFSAHLTTPSS 180

Query: 700  PEVPFARLLEPNLQNGQRYPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPE-- 873
            PEVPFARLLEPN    QRYP SQYEFQSYQLQPGSPVSHL               F +  
Sbjct: 181  PEVPFARLLEPN----QRYPLSQYEFQSYQLQPGSPVSHLISPCSGISGSGASSPFLDRD 236

Query: 874  ----HPFFLEFRTGNYPPKLLELDTIMLREWESRQGSGTATP----DPRLRDN-FLLNRQ 1026
                HPFFLEF  GN P +          +WES Q SG  TP     PR RD+  LLNRQ
Sbjct: 237  FAAVHPFFLEFGGGNPPRR---------DQWESCQESGVVTPTDAVGPRSRDSCVLLNRQ 287

Query: 1027 DSDVAPVSE---------STVSHRVSFEITNEEVVRCVEKKEPNTGIERSLVEKTSVGES 1179
            +SD++P+ +         + + HRVSFEIT E+V+RCVEKK   T  E       SVG+ 
Sbjct: 288  NSDISPLPDNCTGLENDVAAIDHRVSFEITAEKVIRCVEKKSLETAQE-------SVGKK 340

Query: 1180 SNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFE--NVDE----DSDWFVGE----- 1326
                               HQKNRTITLGS+KEFNFE  N DE     S+W+V E     
Sbjct: 341  PIELINREEDQTEIVNEKRHQKNRTITLGSTKEFNFEGGNCDEPCVDSSEWWVNEKKVPK 400

Query: 1327 EGAGHSEKWSFFPMMQSGVS 1386
            EG G SE WSFFP++Q GVS
Sbjct: 401  EGGGSSENWSFFPILQPGVS 420


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  314 bits (804), Expect = 8e-83
 Identities = 197/418 (47%), Positives = 233/418 (55%), Gaps = 44/418 (10%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXX 441
            +S QKRRWG  WS+YWCFGS K TKRIGHA+ +PET++SGAD  ++  SSQ PSI     
Sbjct: 35   ASIQKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADRPSSNTSSQAPSIVLPFI 94

Query: 442  XXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXXMFAIGPYAHETQLVSPPVF 621
                                  G                  +FAIGPYAHETQLVSPPVF
Sbjct: 95   APPSSPASFLPSEPPSATHSPVG--SKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVF 152

Query: 622  STFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQL 792
            S FTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+PN QN   G RYPF+QYEFQSYQL
Sbjct: 153  SAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQL 212

Query: 793  QPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLEFRTGNYPPKLLELDTIMLREWESRQGS 972
            QPGSPVS+L               F +     E+  G   P+ L L+ I   EW SRQGS
Sbjct: 213  QPGSPVSNLISPGSAISVSGTSSPFLDR----EYTPGR--PQFLNLEKIAPHEWGSRQGS 266

Query: 973  GTATPD---PRLRDNFLLNRQDSDVAPVSE---------STVSHRVSFEITNEEVVRCVE 1116
            GT TP+   P+  DNFLLN Q+S V  + +         + V HRVSFEIT E+VVRCVE
Sbjct: 267  GTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVE 326

Query: 1117 KKEP---NTG------IERSLVEKTSVGESSN---------RKQXXXXXXXXXXXXXXHQ 1242
            KK      TG       ERS   + ++ E SN          ++               Q
Sbjct: 327  KKPTMMMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDGQRQQ 386

Query: 1243 KNRTITLGSSKEFNFENVDE--------DSDWFVGEEGAGHS--EKWSFFPMMQSGVS 1386
            K+R+ITLGSSKEFNF+NVD          SDW+  E+  G      W  FPMMQ GVS
Sbjct: 387  KHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKEPCNNW-IFPMMQPGVS 443


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  311 bits (797), Expect = 5e-82
 Identities = 196/418 (46%), Positives = 233/418 (55%), Gaps = 44/418 (10%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXX 441
            +S QKRRWGS WS+YWCFGS K TKRIGHA+ +PET++S AD  ++  SSQ PSI     
Sbjct: 35   ASIQKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADRPSSNTSSQAPSIVLPFI 94

Query: 442  XXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXXMFAIGPYAHETQLVSPPVF 621
                                  G                  +FAIGPYAHETQLVSPPVF
Sbjct: 95   APPSSPASFLPSEPPSATHSPVG--SKCLSMSTYSPSGPASIFAIGPYAHETQLVSPPVF 152

Query: 622  STFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQL 792
            S FTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+PN QN   G RYPF+QYEFQSYQL
Sbjct: 153  SAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQL 212

Query: 793  QPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLEFRTGNYPPKLLELDTIMLREWESRQGS 972
            QPGSPVS+L               F E     E+  G   P+ L L+ I   EW SRQGS
Sbjct: 213  QPGSPVSNLISPGSAISVSGTSSPFLER----EYTPGR--PQFLNLEKIAPHEWGSRQGS 266

Query: 973  GTATPD---PRLRDNFLLNRQDSDVAPVSE---------STVSHRVSFEITNEEVVRCVE 1116
            GT TP+   P+  D+FLLN Q++ V  + +         + V HRVSFEIT E+VVRCVE
Sbjct: 267  GTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCVE 326

Query: 1117 KKEP---NTG------IERSLVEKTSVGESSN---------RKQXXXXXXXXXXXXXXHQ 1242
            KK      TG       ERS   + ++ E SN          ++               Q
Sbjct: 327  KKPTMMMRTGSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQQ 386

Query: 1243 KNRTITLGSSKEFNFENVDE--------DSDWFVGEEGAGHS--EKWSFFPMMQSGVS 1386
            K+R+ITLGSSKEFNF+NVD          SDW+  E+  G      W  FPMMQ GVS
Sbjct: 387  KHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKEPCNNW-IFPMMQPGVS 443


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  308 bits (790), Expect = 4e-81
 Identities = 192/429 (44%), Positives = 234/429 (54%), Gaps = 55/429 (12%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXX 441
            ++ QKRRWGS+WS+YWCFG  +  KRIGHA+LVPET+  G D+  A +  Q PSI     
Sbjct: 35   ATVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPRAENPIQTPSIVLPFV 94

Query: 442  XXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXXMFAIGPYAHETQLVSPPVF 621
                                  G                  +FAIGPYAHETQLVSPPVF
Sbjct: 95   APPSSPASFLQSEPPSATQSPAGFFSLTASMYSPSGPTS--IFAIGPYAHETQLVSPPVF 152

Query: 622  STFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQL 792
            STFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+P+ +N   GQR+P S YEFQSYQL
Sbjct: 153  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQL 212

Query: 793  QPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLEFRTGNYPPKLLELDTIMLREW 954
             PGSPV  L               FP+  F      FLEFRTG+ PPKLL LD +  R+W
Sbjct: 213  YPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGD-PPKLLNLDILSTRDW 271

Query: 955  ESRQGSGTATPD---PRLRDNFLLNRQDSDVA--PVSES-------TVSHRVSFEITNEE 1098
             SR GSG+ TPD       D FLL  Q  +V   P S +       +++HRVSFE+++EE
Sbjct: 272  GSRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSEE 331

Query: 1099 VVRCVEKK----------------------EPNTGIERSLVEKTSVGESSNRKQXXXXXX 1212
            V+RCVEKK                      +P+  +  S+     VGE+SN         
Sbjct: 332  VIRCVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSI---CPVGETSN--DAAEKAV 386

Query: 1213 XXXXXXXXHQKNRTITLGSSKEFNFENVDE-------DSDWFVGE----EGAGHSEKWSF 1359
                    H K R+ITLGS KEFNF+N D         SDW+  E    +  G ++ WSF
Sbjct: 387  ADGEEAQLHPKQRSITLGSVKEFNFDNPDGGDSGNSIGSDWWANEKVDAKENGPTKNWSF 446

Query: 1360 FPMMQSGVS 1386
            FPMMQ GVS
Sbjct: 447  FPMMQPGVS 455


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  305 bits (780), Expect = 5e-80
 Identities = 189/423 (44%), Positives = 240/423 (56%), Gaps = 51/423 (12%)
 Frame = +1

Query: 259  HASSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXX 435
            H +++QKRRWG  WS+ WCFG  K  KRIGHA+LVPE ++S ++++ AV+S+Q  +I   
Sbjct: 34   HQATSQKRRWGGCWSISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQAAAISLP 93

Query: 436  XXXXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSP 612
                                    G++                 +FAIGPYAHETQLVSP
Sbjct: 94   FVAPPSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSP 153

Query: 613  PVFSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQS 783
            PVFSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+P+L   + GQ++PFS YEFQS
Sbjct: 154  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQS 213

Query: 784  YQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLEFRTGNYPPKLLELDTIML 945
            Y L PGSPV +L               FP+  F      F +F  G+ PPKLL LD + +
Sbjct: 214  YHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSI 272

Query: 946  REWESRQGSGTATPD---PRLRDNFLLNRQDSDVA--PVSES------TVSHRVSFEITN 1092
            REW SRQGSGT TPD      R+ F  NRQ S+VA  P SE+       V HRVSFE+T 
Sbjct: 273  REWGSRQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTT 332

Query: 1093 EEVVRCVEKKEPNT---GIERSLVEKTSV------GESSN---------RKQXXXXXXXX 1218
            E+VVRCVEKK P T    +  SL   T+V      GE+ N                    
Sbjct: 333  EDVVRCVEKK-PTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVD 391

Query: 1219 XXXXXXHQKNRTITLGSSKEFNFENVDED-------SDWFVGE----EGAGHSEKWSFFP 1365
                  HQK ++ITLGS+KEFNF++ D D       SDW+  E    + +G  + W+FFP
Sbjct: 392  VEEAPRHQKQQSITLGSTKEFNFDSADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFP 451

Query: 1366 MMQ 1374
            ++Q
Sbjct: 452  VIQ 454


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  303 bits (777), Expect = 1e-79
 Identities = 188/423 (44%), Positives = 240/423 (56%), Gaps = 51/423 (12%)
 Frame = +1

Query: 259  HASSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXX 435
            H +++QKRRWG  W++ WCFG  K  KRIGHA+LVPE ++S ++++ AV+S+Q  +I   
Sbjct: 34   HQATSQKRRWGGCWNISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQATAISLP 93

Query: 436  XXXXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSP 612
                                    G++                 +FAIGPYAHETQLVSP
Sbjct: 94   FVAPPSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSP 153

Query: 613  PVFSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQS 783
            PVFSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+P+L   + GQ++PFS YEFQS
Sbjct: 154  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQS 213

Query: 784  YQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLEFRTGNYPPKLLELDTIML 945
            Y L PGSPV +L               FP+  F      F +F  G+ PPKLL LD + +
Sbjct: 214  YHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSI 272

Query: 946  REWESRQGSGTATPD---PRLRDNFLLNRQDSDVA--PVSES------TVSHRVSFEITN 1092
            REW SRQGSGT TPD      R+ F  NRQ S+VA  P SE+       V HRVSFE+T 
Sbjct: 273  REWGSRQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTT 332

Query: 1093 EEVVRCVEKKEPNT---GIERSLVEKTSV------GESSN---------RKQXXXXXXXX 1218
            E+VVRCVEKK P T    +  SL   T+V      GE+ N                    
Sbjct: 333  EDVVRCVEKK-PTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVD 391

Query: 1219 XXXXXXHQKNRTITLGSSKEFNFENVDED-------SDWFVGE----EGAGHSEKWSFFP 1365
                  HQK ++ITLGS+KEFNF++ D D       SDW+  E    + +G  + W+FFP
Sbjct: 392  VEEAPRHQKQQSITLGSTKEFNFDSADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFP 451

Query: 1366 MMQ 1374
            ++Q
Sbjct: 452  VIQ 454


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  300 bits (769), Expect = 1e-78
 Identities = 191/445 (42%), Positives = 227/445 (51%), Gaps = 72/445 (16%)
 Frame = +1

Query: 268  SAQKRRWGSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXXXX 447
            + QKRRWGS W  YWCF SPK KRIGHA+L PE+ + G+    A + +Q P+I       
Sbjct: 36   TVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAP 95

Query: 448  XXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPVFS 624
                                G+L                 +FAIGPYAHETQLVSPPVFS
Sbjct: 96   PSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFS 155

Query: 625  TFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQ 795
            TFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+L +PN +NG+   R+  SQYEFQSYQL 
Sbjct: 156  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLY 215

Query: 796  PGSPVSHLXXXXXXXXXXXXXXXFPEHPF-------FLEFRTGNYPPKLLELDTIMLREW 954
            PGSPV HL               FP+  F       FLEFR G  PPKLL LD +   EW
Sbjct: 216  PGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGG-PPKLLTLDKLSNHEW 274

Query: 955  ESRQGSGTATPD---PRLRDNFLLNRQDSDV---------------------------AP 1044
             SR GSG+ TPD   P  RD  +L+RQ SDV                            P
Sbjct: 275  GSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCP 334

Query: 1045 VSESTVSHRVSFEITNEEVVRCVEK-------------KEPNT------GIERSLVEKTS 1167
             +E  V HRVSFE+T E+VVRCVEK             + P T        E  +  +  
Sbjct: 335  NNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGR 394

Query: 1168 VGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVDE--------DSDWFVG 1323
            VGE++N                 H K R+ITLGS+KEFNF+N D          SDW+  
Sbjct: 395  VGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWAN 454

Query: 1324 E----EGAGHSEKWSFFPMMQSGVS 1386
            E    +  G S+ WS F MMQ  VS
Sbjct: 455  EKVVGKEVGASKNWSIFHMMQPSVS 479


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  294 bits (752), Expect = 9e-77
 Identities = 191/426 (44%), Positives = 228/426 (53%), Gaps = 53/426 (12%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPKTK-RIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXX 441
            ++ QKRRWG  WS+YWCFGS K K RIG A+L  ETS SGA+   A + +Q P+I     
Sbjct: 35   ATVQKRRWGGCWSIYWCFGSYKQKKRIGPAVLTSETSFSGANVPAAENPTQAPAIALPFV 94

Query: 442  XXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXXMFAIGPYAHETQLVSPPVF 621
                                  G++                +FAIGPYAHETQLVSPPVF
Sbjct: 95   APPSSPASFLPSEPPSATQSPAGLVSLTSISASMYSPGPASIFAIGPYAHETQLVSPPVF 154

Query: 622  STFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG---QRYPFSQYEFQSYQL 792
            STFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL PNLQ G   QR+P S YEFQSYQL
Sbjct: 155  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQL 214

Query: 793  QPGSPVSHLXXXXXXXXXXXXXXXFPEHPF-----FLEFRTGNYPPKLLELDTIMLREWE 957
             PGSPV  L               F +  F     F EFR G+ PPKLL LD     EW 
Sbjct: 215  HPGSPVGQLISPSSGISGSGTSSPFRDGEFAASLHFPEFRMGD-PPKLLNLDKHSSCEWG 273

Query: 958  SRQGSGTATPDPRL---RDNFLLNRQDSDVA--------PVSESTV--SHRVSFEITNEE 1098
            S  GSGT TPD      R+ FLL+ Q S++          V    V  +HRVSFE+T EE
Sbjct: 274  SHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEE 333

Query: 1099 VVRCVEKK--EPNTGIERSL-VEKT----------------SVGESSNRKQXXXXXXXXX 1221
            VVR +E +   P+  +  SL +E T                 VGE+SN +          
Sbjct: 334  VVRSLEMETATPSEAVSGSLQIEATRESEEHDTKVVDDYECRVGETSNER--PEKALADR 391

Query: 1222 XXXXXHQKNRTITLGSSKEFNFENVDE--------DSDWF----VGEEGAGHSEKWSFFP 1365
                 H K+++ITLGS+KEFNF+NVD          SDW+    V  +G G    WSFFP
Sbjct: 392  EGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKPILTSDWWANDKVAGKGGGVPRNWSFFP 451

Query: 1366 MMQSGV 1383
            MMQ GV
Sbjct: 452  MMQPGV 457


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  285 bits (730), Expect = 3e-74
 Identities = 178/430 (41%), Positives = 227/430 (52%), Gaps = 56/430 (13%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPKTK-RIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXX 441
            ++ +KRRWG   S+YWCFG+PK + RIGH +LVPET+  G  +  A +S+Q  ++     
Sbjct: 37   ATVRKRRWGGCLSIYWCFGTPKNRTRIGHGVLVPETAQPGNSAPRAENSTQTHAVILPFI 96

Query: 442  XXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPV 618
                                  G+L                 +FAIGPYAHETQLVSPPV
Sbjct: 97   APPSSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPV 156

Query: 619  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQ 789
            FSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+PN+ N   GQR+P    EFQSY 
Sbjct: 157  FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNEFQSYY 216

Query: 790  LQPGSPVSHLXXXXXXXXXXXXXXXFPE------HPFFLEFRTGNYPPKLLELDTIMLRE 951
             QPGSP+  L               FP+       P FLEFRTG+ PPKLL LD +   +
Sbjct: 217  FQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGD-PPKLLNLDKLSKFD 275

Query: 952  WESRQGSGTATPD-----------PRLRDNFLLNRQDSDVAPVSESTVSHRVSFEITNEE 1098
            W SRQGSG+ TPD           P L+ N             +E+    RVSF+++ E+
Sbjct: 276  WGSRQGSGSLTPDSVKPISTFEVAPHLKPNGRCRN--------AENVADRRVSFDVSTED 327

Query: 1099 VVRCVEKK------------------EPNTGIERSLVE----KTSVGESSNRKQXXXXXX 1212
            V+R VEKK                  +     + + VE    +  VGE+SN  +      
Sbjct: 328  VIRYVEKKTVPLAEAMLTSLKDTTMGQREENSDSNKVEEIGCENRVGETSN--EEPDKAP 385

Query: 1213 XXXXXXXXHQKNRTITLGSSKEFNFENVDED--------SDWFVGEEGAGH----SEKWS 1356
                    HQK+R+ITLGSSKEFNF+N D          SDW+  ++ AG     S+ WS
Sbjct: 386  TSGEEVLQHQKHRSITLGSSKEFNFDNADAGDLHKSDSVSDWWANQKVAGKEGAPSQNWS 445

Query: 1357 FFPMMQSGVS 1386
            FFPM+Q GVS
Sbjct: 446  FFPMIQPGVS 455


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  276 bits (706), Expect = 2e-71
 Identities = 182/427 (42%), Positives = 227/427 (53%), Gaps = 53/427 (12%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSSQPPSIXXXXX 441
            ++ QKRRWGS WS+Y CFG  K K+ IGHA+L PE S+ G  +  + + +Q P++     
Sbjct: 35   ATVQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFA 94

Query: 442  XXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPV 618
                                  G++                 +FAIGPYAHETQLVSPPV
Sbjct: 95   APPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPV 154

Query: 619  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQ 789
            FSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+ L+P+L+NG    R+PF   +FQSYQ
Sbjct: 155  FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQ 211

Query: 790  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLEFRTGNYPPKLLELDTIMLRE 951
              PGSPV  L               FP+  F      F EFR G  PPKLL LD +   E
Sbjct: 212  FHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCE 270

Query: 952  WESRQGSGTATPDP--RLRDNFLLNRQDSDVAPVSES--------TVSHRVSFEITNEEV 1101
            W S QGSG  TP+   R   NFLL+RQ SDV     S         V+HRVSFE+T E+ 
Sbjct: 271  WGSYQGSGALTPESVRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDA 330

Query: 1102 VRCVE--------------------KKEPNTGIERSLVEKTSVGESSNRKQXXXXXXXXX 1221
             RCVE                    K+E N+G E     +  VG +SN            
Sbjct: 331  SRCVEEKPAFSIKTVPEYVENGTQAKEEKNSG-ESIQSFECRVGVTSN--DSPEMASTDG 387

Query: 1222 XXXXXHQKNRTITLGSSKEFNFENVDE-------DSDWF-----VGEEGAGHSEKWSFFP 1365
                 H+K ++ITLGS KEFNF+N DE        S+W+     +G+EG   ++ WSFFP
Sbjct: 388  EAAPQHRKQQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFP 446

Query: 1366 MMQSGVS 1386
            M+QSGVS
Sbjct: 447  MVQSGVS 453


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  272 bits (695), Expect = 4e-70
 Identities = 180/423 (42%), Positives = 224/423 (52%), Gaps = 53/423 (12%)
 Frame = +1

Query: 277  KRRWGSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSSQPPSIXXXXXXXXX 453
            +RRWGS WS+Y CFG  K K+ IGHA+L PE S+ G  +  + + +Q P++         
Sbjct: 38   QRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPS 97

Query: 454  XXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPVFSTF 630
                              G++                 +FAIGPYAHETQLVSPPVFSTF
Sbjct: 98   SPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTF 157

Query: 631  TTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQPG 801
            TTEPSTAP+TPPPESVH+TTPSSPEVPFA+ L+P+L+NG    R+PF   +FQSYQ  PG
Sbjct: 158  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQFHPG 214

Query: 802  SPVSHLXXXXXXXXXXXXXXXFPEHPF------FLEFRTGNYPPKLLELDTIMLREWESR 963
            SPV  L               FP+  F      F EFR G  PPKLL LD +   EW S 
Sbjct: 215  SPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCEWGSY 273

Query: 964  QGSGTATPDP--RLRDNFLLNRQDSDVAPVSES--------TVSHRVSFEITNEEVVRCV 1113
            QGSG  TP+   R   NFLL+RQ SDV     S         V+HRVSFE+T E+  RCV
Sbjct: 274  QGSGALTPESVRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCV 333

Query: 1114 E--------------------KKEPNTGIERSLVEKTSVGESSNRKQXXXXXXXXXXXXX 1233
            E                    K+E N+G E     +  VG +SN                
Sbjct: 334  EEKPAFSIKTVPEYVENGTQAKEEKNSG-ESIQSFECRVGVTSN--DSPEMASTDGEAAP 390

Query: 1234 XHQKNRTITLGSSKEFNFENVDE-------DSDWF-----VGEEGAGHSEKWSFFPMMQS 1377
             H+K ++ITLGS KEFNF+N DE        S+W+     +G+EG   ++ WSFFPM+QS
Sbjct: 391  QHRKQQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFPMVQS 449

Query: 1378 GVS 1386
            GVS
Sbjct: 450  GVS 452


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  271 bits (694), Expect = 5e-70
 Identities = 178/424 (41%), Positives = 223/424 (52%), Gaps = 51/424 (12%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVH-SSQPPSIXXXX 438
            ++ QKRRWGS WS+YWCFG  +  KRIGHA+LVPE S+ G DS+ A + ++Q P+I    
Sbjct: 38   ATIQKRRWGSCWSVYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPF 97

Query: 439  XXXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPP 615
                                   GIL                 +FAIGPYAHETQLVSPP
Sbjct: 98   VAPPSSPASFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPP 157

Query: 616  VFSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSY 786
             FSTFTTEPSTAP+TPPPESV +TTPSSPEVPFA+LLEP+ +NG+   R+PFS YEFQSY
Sbjct: 158  AFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSY 217

Query: 787  QLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLEFRTGNYPPKLLELDTIMLR 948
            Q  PGSPV  L               FP+  F      FLEF+    PPKLL LD + + 
Sbjct: 218  QFYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMA-VPPKLLNLDKLSVH 276

Query: 949  EWESRQGSGTATPDP--RLRDNFLLNRQDSDVAP--------VSESTVSHRVSFEITNEE 1098
            E  SRQGSGT TPD       +F L+RQ SD+A           +     RVSF+++ E+
Sbjct: 277  ECGSRQGSGTLTPDAVRATSCSFPLDRQCSDIASNRHSDNENKDDQVADLRVSFDLSAED 336

Query: 1099 VVRCVEKKEPN----------TGIERSLVEKTS---------VGESSNRKQXXXXXXXXX 1221
             +R  E K  +            I    V+K+S         VGE+SN            
Sbjct: 337  ALRYAEPKPASPVKIMPESMKNEIAAEKVQKSSEIRHNFECRVGETSN--GILEQASTGG 394

Query: 1222 XXXXXHQKNRTITLGSSKEFNFENVD------EDSDWFVGEEGAGH----SEKWSFFPMM 1371
                 HQK+RT+TLG+ KEFNF+N D         DW+      G     ++ WSFFP+M
Sbjct: 395  EKTPRHQKHRTLTLGTFKEFNFDNADGVPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVM 454

Query: 1372 QSGV 1383
            Q  +
Sbjct: 455  QPSI 458


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  269 bits (688), Expect = 2e-69
 Identities = 178/461 (38%), Positives = 220/461 (47%), Gaps = 56/461 (12%)
 Frame = +1

Query: 172  MRRGANGPDXXXXXXXXXXXXXXXXXRGDHASSA--QKRRWGSFWSLYWCFGSPK-TKRI 342
            MRRG NG D                        A  QKRRW   W +YWCFG  +  KRI
Sbjct: 3    MRRGVNGGDGNNALDTINAAASAIAAAESRVPQATVQKRRWAKGWGVYWCFGFQRHRKRI 62

Query: 343  GHAILVPETSSSGADSTTAVHSSQPPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXGILXX 522
            GHA+++PET+S G +   A + +Q  SI                                
Sbjct: 63   GHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPG---FN 119

Query: 523  XXXXXXXXXXXXXXMFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHMTTPSSP 702
                          +FAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPP ESVH+T PSSP
Sbjct: 120  FSLSASMYSPGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSP 179

Query: 703  EVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPE 873
            EVPFA+LL+ N    + GQRYP S YEFQSYQ  PGSPV  L               F +
Sbjct: 180  EVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLD 239

Query: 874  HPF------FLEFRTGNYPPKLLELDTIMLREWESRQGSGTATPDPRLRDNF-------- 1011
              F      FLEFRTG   PK+L LD +  R+W SR  SG+ TPD     +         
Sbjct: 240  SEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTLKPY 298

Query: 1012 ----LLNRQDSDVAPVSESTVSHRVSFEITNEEVVRCVEKK------------------- 1122
                +LN + +       +++ HRVSFE++ EEVVRCVEKK                   
Sbjct: 299  TPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSAEKAE 358

Query: 1123 -EPNTGIERSLVEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVD 1299
             E     E S   +  V ++SN                 +QK R+ITLGS+KEFNF+N D
Sbjct: 359  REEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNAD 418

Query: 1300 E--------DSDWFVGEEGA----GHSEKWSFFPMMQSGVS 1386
                      +DW+  E+      G S+ WSFFPM+Q G+S
Sbjct: 419  GGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 459


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  266 bits (681), Expect = 2e-68
 Identities = 170/425 (40%), Positives = 212/425 (49%), Gaps = 54/425 (12%)
 Frame = +1

Query: 274  QKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXXXXX 450
            QKRRW   W +YWCFG  +  KRIGHA+++PET+S G +   A + +Q  SI        
Sbjct: 2    QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPP 61

Query: 451  XXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXXMFAIGPYAHETQLVSPPVFSTF 630
                                                  +FAIGPYAHETQLVSPPVFSTF
Sbjct: 62   SSPASFLQSEPPSAMQSPG---FNFSLSASMYSPGPSSIFAIGPYAHETQLVSPPVFSTF 118

Query: 631  TTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPG 801
            TTEPSTAP+TPP ESVH+T PSSPEVPFA+LL+ N    + GQRYP S YEFQSYQ  PG
Sbjct: 119  TTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPG 178

Query: 802  SPVSHLXXXXXXXXXXXXXXXFPEHPF------FLEFRTGNYPPKLLELDTIMLREWESR 963
            SPV  L               F +  F      FLEFRTG   PK+L LD +  R+W SR
Sbjct: 179  SPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSR 237

Query: 964  QGSGTATPDPRLRDNF------------LLNRQDSDVAPVSESTVSHRVSFEITNEEVVR 1107
              SG+ TPD     +             +LN + +       +++ HRVSFE++ EEVVR
Sbjct: 238  LCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVR 297

Query: 1108 CVEKK--------------------EPNTGIERSLVEKTSVGESSNRKQXXXXXXXXXXX 1227
            CVEKK                    E     E S   +  V ++SN              
Sbjct: 298  CVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEEL 357

Query: 1228 XXXHQKNRTITLGSSKEFNFENVDE--------DSDWFVGEEGA----GHSEKWSFFPMM 1371
               +QK R+ITLGS+KEFNF+N D          +DW+  E+      G S+ WSFFPM+
Sbjct: 358  SYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMI 417

Query: 1372 QSGVS 1386
            Q G+S
Sbjct: 418  QPGMS 422


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  260 bits (665), Expect = 1e-66
 Identities = 168/411 (40%), Positives = 202/411 (49%), Gaps = 38/411 (9%)
 Frame = +1

Query: 268  SAQKRRWGSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXXXX 447
            + QKRRWGS W  YWCF SPK KRIGHA+L PE+ + G+    A + +Q P+I       
Sbjct: 36   TVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAP 95

Query: 448  XXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPVFS 624
                                G+L                 +FAIGPYAHETQLVSPPVFS
Sbjct: 96   PSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFS 155

Query: 625  TFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQ 795
            TFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+L +PN +NG+   R+  SQYEFQSYQL 
Sbjct: 156  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLY 215

Query: 796  PGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLEFRTGNYPPKLLELDTIMLREWESRQGSG 975
            PGSPV HL               FP+                                SG
Sbjct: 216  PGSPVGHLISPSSGISGSGTSSPFPDR-------------------------------SG 244

Query: 976  TATPD---PRLRDNFLLNRQDSDVAPVSESTVSHRVSFEITNEEVVRCVEK--------- 1119
            + TPD   P  RD  +L   D    P +E  V HRVSFE+T E+VVRCVEK         
Sbjct: 245  SITPDALGPPSRDGSVL---DHSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAV 301

Query: 1120 ----KEPNT------GIERSLVEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGS 1269
                + P T        E  +  +  VGE++N                 H K R+ITLGS
Sbjct: 302  SASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGS 361

Query: 1270 SKEFNFENVDE--------DSDWFVGE----EGAGHSEKWSFFPMMQSGVS 1386
            +KEFNF+N D          SDW+  E    +  G S+ WS F MMQ  VS
Sbjct: 362  AKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 412


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  251 bits (640), Expect = 9e-64
 Identities = 167/459 (36%), Positives = 216/459 (47%), Gaps = 85/459 (18%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXX 441
            ++ QK+RWGS W LYWCFGS K +KRIGHA+LVPE    GA  +TA + S P  I     
Sbjct: 28   TTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFI 87

Query: 442  XXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPV 618
                                  G+L                 +FAIGPYAHETQLV+PPV
Sbjct: 88   APPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPV 147

Query: 619  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG-------QRYPFSQYEF 777
            FS  TTEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEF
Sbjct: 148  FSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEF 207

Query: 778  QSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLEFRTGNYPPKLLELDTIMLREWE 957
            QSYQ+ PGSP  +L               FP+    LEFR G   PKLL  +    R+W 
Sbjct: 208  QSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGE-APKLLGFENFTTRKWG 266

Query: 958  SRQGSG----------------TATPD-------------------PRLRDNFLLNRQDS 1032
            SR GSG                + TPD                   P  RD FL+  Q S
Sbjct: 267  SRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQIS 326

Query: 1033 DVAPVS---------ESTVSHRVSFEITNEEVVRCVEKK--------------------E 1125
            +VA ++         E+ V HRVSFE++ E+V  C+E K                    +
Sbjct: 327  EVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRK 386

Query: 1126 PNTGIERSLVEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVDED 1305
               GI++ L     +       +              +QK+R++TLGS KEFNF+N   +
Sbjct: 387  ERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGE 446

Query: 1306 --------SDWFVGEEGAGHSEK----WSFFPMMQSGVS 1386
                    S+W+  E+ AG   +    W+FFPM+Q  VS
Sbjct: 447  ASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  251 bits (640), Expect = 9e-64
 Identities = 168/405 (41%), Positives = 210/405 (51%), Gaps = 45/405 (11%)
 Frame = +1

Query: 274  QKRRWGSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSSQPPSIXXXXXXXX 450
            QK+RW S WS+YWCFG  K+KR IGHA+L PE+S+ G+ +  A +S+Q P +        
Sbjct: 38   QKQRWRSHWSIYWCFGYQKSKRQIGHAVLFPESSAPGSGAPAAENSAQAPEVTFPFVAPP 97

Query: 451  XXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPVFST 627
                               G++                 +FAIGPYAHETQLVSPPVFST
Sbjct: 98   SSPASFFQSEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFST 157

Query: 628  FTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQP 798
            FTTEPSTAP+TPPPESVH+TTPSSPEVPFA+L++P L+NG    R+PF   +FQSYQ  P
Sbjct: 158  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRFPF---DFQSYQFHP 214

Query: 799  GSPVSHLXXXXXXXXXXXXXXXFPEHPFFL------EFRTGNYPPKLLELDTIMLREWES 960
            GS V  L               FP+  F +      EFR G   PKLL LD +  REW S
Sbjct: 215  GSSVGQLISPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG---PKLLNLDKLSTREWGS 271

Query: 961  RQGSGTATPDPRLRD--NFLLNRQDSDVA--PVSES------TVSHRVSFEITNEEVVRC 1110
             Q SG  TPD       NFLL+RQ SDVA  P SE+       V+HR SFE++ ++  RC
Sbjct: 272  YQDSGALTPDSVRHGSPNFLLHRQFSDVASHPRSENGHDDDQVVNHRFSFELSVKDASRC 331

Query: 1111 VEKKEPNTGIER---------SLVEKTSVGE--------SSNRKQXXXXXXXXXXXXXXH 1239
            VE+K P   I+             E+ + GE        S +                 H
Sbjct: 332  VEEK-PACSIKTVPEYVENGTKAKEEENYGELIQSFERRSGDTSNDTPETPSTDGEAPQH 390

Query: 1240 QKNRTITLGSSKEFNFENVDE-------DSDWFVGEEGAGHSEKW 1353
            +K + ITLGS  EFNF+N DE        S+W V +   G S  W
Sbjct: 391  RKQQPITLGSVNEFNFDNADEGDSHNPSSSNW-VKQPRTGPSSLW 434


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  248 bits (633), Expect = 6e-63
 Identities = 165/445 (37%), Positives = 212/445 (47%), Gaps = 71/445 (15%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXX 441
            S+ QKRRWGS WSLYWCFGS K +KRIGHA+LVPE ++ G       + +   +I     
Sbjct: 28   STVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPAAPGPAVPVTENPNHSATIVIPFI 87

Query: 442  XXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPV 618
                                  G+L                 +FAIGPYAHETQLVSPPV
Sbjct: 88   APPSSPASFLPSDPPSATQSPAGLLSLKSLSINAYSPGGTASIFAIGPYAHETQLVSPPV 147

Query: 619  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQRY-------PFSQYEF 777
            FSTFTTEPSTA +TPPPE VHMTTP SPEVPFA+LL  +L   +RY       P SQYEF
Sbjct: 148  FSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEF 207

Query: 778  QSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLEFRTGNYPPKLLELDTIMLREWE 957
              YQ  PGSP S+L               FP     +EFR G  PPK L  +    R+W 
Sbjct: 208  VPYQ-DPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGE-PPKFLGYEHFSTRKWG 265

Query: 958  SRQGSGTATP-------------------------------DPRLRDNFLLNRQDSDVAP 1044
            SR GSG+ TP                               +P  RD++LL  Q S+VA 
Sbjct: 266  SRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLEYQISEVAS 325

Query: 1045 ---------VSESTVSHRVSFEITNEEVVRCVEKKEPNTGIERSL----------VEKTS 1167
                     + E  + HRVSFE+T E+V  C EK+   +  +++L            K+ 
Sbjct: 326  LANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKEPVMSHSQQTLPMDVSNLLANEMKSG 385

Query: 1168 VGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENV--------DEDSDWFVG 1323
               +  +                H+K+R IT GSSK+F+F+NV          D +W+  
Sbjct: 386  SSMAEEKTYGSPRKASESGEDQCHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTS 445

Query: 1324 EEGAGH----SEKWSFFPMMQSGVS 1386
            ++ AG        W+FFP++Q GVS
Sbjct: 446  DKAAGKESGIQNNWTFFPVLQPGVS 470


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  248 bits (633), Expect = 6e-63
 Identities = 166/455 (36%), Positives = 213/455 (46%), Gaps = 85/455 (18%)
 Frame = +1

Query: 277  KRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXXXXXX 453
            K+RWGS W LYWCFGS K +KRIGHA+LVPE    GA  +TA + S P  I         
Sbjct: 36   KKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPS 95

Query: 454  XXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPVFSTF 630
                              G+L                 +FAIGPYAHETQLV+PPVFS  
Sbjct: 96   SPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSAL 155

Query: 631  TTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG-------QRYPFSQYEFQSYQ 789
            TTEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEFQSYQ
Sbjct: 156  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQ 215

Query: 790  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLEFRTGNYPPKLLELDTIMLREWESRQG 969
            + PGSP  +L               FP+    LEFR G   PKLL  +    R+W SR G
Sbjct: 216  IYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGE-APKLLGFENFTTRKWGSRLG 274

Query: 970  SG----------------TATPD-------------------PRLRDNFLLNRQDSDVAP 1044
            SG                + TPD                   P  RD FL+  Q S+VA 
Sbjct: 275  SGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVAL 334

Query: 1045 VS---------ESTVSHRVSFEITNEEVVRCVEKK--------------------EPNTG 1137
            ++         E+ V HRVSFE++ E+V  C+E K                    +   G
Sbjct: 335  LANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDG 394

Query: 1138 IERSLVEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVDED---- 1305
            I++ L     +       +              +QK+R++TLGS KEFNF+N   +    
Sbjct: 395  IKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDK 454

Query: 1306 ----SDWFVGEEGAGHSEK----WSFFPMMQSGVS 1386
                S+W+  E+ AG   +    W+FFPM+Q  VS
Sbjct: 455  PTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  248 bits (633), Expect = 6e-63
 Identities = 168/445 (37%), Positives = 213/445 (47%), Gaps = 71/445 (15%)
 Frame = +1

Query: 265  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSSQPPSIXXXXX 441
            S+ QKRRWGS WSLYWCFGS K +KRIGHA+LVPE  + G       + +   +I     
Sbjct: 28   STVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVTENPNHSATIVIPFI 87

Query: 442  XXXXXXXXXXXXXXXXXXXXXXGILXXXXXXXXXXXXXXXX-MFAIGPYAHETQLVSPPV 618
                                  G+L                 +FAIGPYAHETQLVSPPV
Sbjct: 88   APPSSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLVSPPV 147

Query: 619  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQRY-------PFSQYEF 777
            FSTFTTEPSTA +TPPPE VHMTTP SPEVPFA+LL  +L   +RY       P SQYEF
Sbjct: 148  FSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEF 207

Query: 778  QSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLEFRTGNYPPKLLELDTIMLREWE 957
              YQ  PGSP S+L               FP     +EFR G  PPK L  +    R+W 
Sbjct: 208  VPYQ-DPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGE-PPKFLGYEHFSTRKWG 265

Query: 958  SRQGSGTATP-------------------------------DPRLRDNFLLNRQDSDVAP 1044
            SR GSG+ TP                               +P  RD++LL  Q S+VA 
Sbjct: 266  SRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVAS 325

Query: 1045 ---------VSESTVSHRVSFEITNEEVVRCVEKK------EPNTGIERS--LVEKTSVG 1173
                     + E+ + HRVSFE+T E+V  C EK+      +P   ++ S  L  +   G
Sbjct: 326  LANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMRSG 385

Query: 1174 ES--SNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENV--------DEDSDWFVG 1323
             S    +                H+K+R IT GSSK+F+F+NV          D +W+  
Sbjct: 386  SSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTS 445

Query: 1324 EEGA----GHSEKWSFFPMMQSGVS 1386
            ++ A    G    W+FFP++Q GVS
Sbjct: 446  DKAAVKESGIQNNWTFFPVLQPGVS 470


Top