BLASTX nr result

ID: Mentha22_contig00013269 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00013269
         (1717 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus...   363   2e-97
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   333   2e-88
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   332   4e-88
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   331   5e-88
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   328   4e-87
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   327   8e-87
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   326   2e-86
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   324   7e-86
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     316   2e-83
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   306   2e-80
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   301   5e-79
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   301   6e-79
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   292   3e-76
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              291   5e-76
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   290   2e-75
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   283   2e-73
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   280   1e-72
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   277   1e-71
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   274   8e-71
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   273   2e-70

>gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus guttatus]
          Length = 420

 Score =  363 bits (931), Expect = 2e-97
 Identities = 226/440 (51%), Positives = 252/440 (57%), Gaps = 35/440 (7%)
 Frame = +2

Query: 260  MRRGAN-GPDXXXXXXXXXXXXXXXXXRVDHASSAQKRRWGSFWSLYWCFGSPKTKRIGH 436
            MRRG N G D                    HASS QKRRW SFWSLYWCF     KRIGH
Sbjct: 1    MRRGVNNGTDALETISAAASAIASAEAHGAHASSLQKRRWRSFWSLYWCFRPNNNKRIGH 60

Query: 437  AILVPETSSSGADSTTAVHSP-QPPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXGILSMA 613
            A+LV ETSSS    T     P QPPSI                           G+LS++
Sbjct: 61   AVLVTETSSSDTAYTPTAERPFQPPSIVLPFTAPPSSPASFIPSEPPSSTQSPTGLLSLS 120

Query: 614  SASANMYSP-GPASMFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE-SVHMTTPSS 787
            S S N+YSP GPAS+FAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE S H+TTPSS
Sbjct: 121  SPSGNIYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPEFSAHLTTPSS 180

Query: 788  PEVPFARLLEPNLQNGQRYPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPE-- 961
            PEVPFARLLEPN    QRYP SQYEFQSYQLQPGSPVSHL               F +  
Sbjct: 181  PEVPFARLLEPN----QRYPLSQYEFQSYQLQPGSPVSHLISPCSGISGSGASSPFLDRD 236

Query: 962  ----HPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATP----DPRSRDN-FLLNRQ 1114
                HPFFLE   GN P +          +WES Q SG  TP     PRSRD+  LLNRQ
Sbjct: 237  FAAVHPFFLEFGGGNPPRR---------DQWESCQESGVVTPTDAVGPRSRDSCVLLNRQ 287

Query: 1115 DSDVAPVSE---------SAVSHRVSFEITNEEVVRCVEKKEPNTGIERSLIEKTSVGES 1267
            +SD++P+ +         +A+ HRVSFEIT E+V+RCVEKK   T  E       SVG+ 
Sbjct: 288  NSDISPLPDNCTGLENDVAAIDHRVSFEITAEKVIRCVEKKSLETAQE-------SVGKK 340

Query: 1268 SNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFE--NVDE----DSDWFVGE----- 1414
                               HQKNRTITLGS+KEFNFE  N DE     S+W+V E     
Sbjct: 341  PIELINREEDQTEIVNEKRHQKNRTITLGSTKEFNFEGGNCDEPCVDSSEWWVNEKKVPK 400

Query: 1415 EGAGHSEKWSFFPMMQTGVS 1474
            EG G SE WSFFP++Q GVS
Sbjct: 401  EGGGSSENWSFFPILQPGVS 420


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  333 bits (853), Expect = 2e-88
 Identities = 200/423 (47%), Positives = 252/423 (59%), Gaps = 51/423 (12%)
 Frame = +2

Query: 347  HASSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXX 523
            H +++QKRRWG  WS+ WCFG  K  KRIGHA+LVPE ++S ++++ AV+S Q  +I   
Sbjct: 34   HQATSQKRRWGGCWSISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQAAAISLP 93

Query: 524  XXXXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSP 700
                                    G++S+ S S NMYSPG P+S+FAIGPYAHETQLVSP
Sbjct: 94   FVAPPSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSP 153

Query: 701  PVFSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQS 871
            PVFSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+P+L   + GQ++PFS YEFQS
Sbjct: 154  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQS 213

Query: 872  YQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVL 1033
            Y L PGSPV +L               FP+  F      F +   G+ PPKLL LDK+ +
Sbjct: 214  YHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSI 272

Query: 1034 REWESRQGSGTATPD---PRSRDNFLLNRQDSDVA--PVSESA------VSHRVSFEITN 1180
            REW SRQGSGT TPD      R+ F  NRQ S+VA  P SE+       V HRVSFE+T 
Sbjct: 273  REWGSRQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTT 332

Query: 1181 EEVVRCVEKKEPNT---GIERSLIEKTSV------GESSN---------RKQXXXXXXXX 1306
            E+VVRCVEKK P T    +  SL   T+V      GE+ N                    
Sbjct: 333  EDVVRCVEKK-PTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVD 391

Query: 1307 XXXXXXHQKNRTITLGSSKEFNFENVDED-------SDWFVGE----EGAGHSEKWSFFP 1453
                  HQK ++ITLGS+KEFNF++ D D       SDW+  E    + +G  + W+FFP
Sbjct: 392  VEEAPRHQKQQSITLGSTKEFNFDSADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFP 451

Query: 1454 MMQ 1462
            ++Q
Sbjct: 452  VIQ 454


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  332 bits (850), Expect = 4e-88
 Identities = 199/423 (47%), Positives = 252/423 (59%), Gaps = 51/423 (12%)
 Frame = +2

Query: 347  HASSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXX 523
            H +++QKRRWG  W++ WCFG  K  KRIGHA+LVPE ++S ++++ AV+S Q  +I   
Sbjct: 34   HQATSQKRRWGGCWNISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQATAISLP 93

Query: 524  XXXXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSP 700
                                    G++S+ S S NMYSPG P+S+FAIGPYAHETQLVSP
Sbjct: 94   FVAPPSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSP 153

Query: 701  PVFSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQS 871
            PVFSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+P+L   + GQ++PFS YEFQS
Sbjct: 154  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQS 213

Query: 872  YQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVL 1033
            Y L PGSPV +L               FP+  F      F +   G+ PPKLL LDK+ +
Sbjct: 214  YHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSI 272

Query: 1034 REWESRQGSGTATPD---PRSRDNFLLNRQDSDVA--PVSESA------VSHRVSFEITN 1180
            REW SRQGSGT TPD      R+ F  NRQ S+VA  P SE+       V HRVSFE+T 
Sbjct: 273  REWGSRQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTT 332

Query: 1181 EEVVRCVEKKEPNT---GIERSLIEKTSV------GESSN---------RKQXXXXXXXX 1306
            E+VVRCVEKK P T    +  SL   T+V      GE+ N                    
Sbjct: 333  EDVVRCVEKK-PTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVD 391

Query: 1307 XXXXXXHQKNRTITLGSSKEFNFENVDED-------SDWFVGE----EGAGHSEKWSFFP 1453
                  HQK ++ITLGS+KEFNF++ D D       SDW+  E    + +G  + W+FFP
Sbjct: 392  VEEAPRHQKQQSITLGSTKEFNFDSADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFP 451

Query: 1454 MMQ 1462
            ++Q
Sbjct: 452  VIQ 454


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  331 bits (849), Expect = 5e-88
 Identities = 204/445 (45%), Positives = 242/445 (54%), Gaps = 72/445 (16%)
 Frame = +2

Query: 356  SAQKRRWGSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXX 535
            + QKRRWGS W  YWCF SPK KRIGHA+L PE+ + G+    A +  Q P+I       
Sbjct: 36   TVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAP 95

Query: 536  XXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSPPVFS 712
                                G+LS+ S +AN+YSPG PAS+FAIGPYAHETQLVSPPVFS
Sbjct: 96   PSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFS 155

Query: 713  TFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQ 883
            TFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+L +PN +NG+   R+  SQYEFQSYQL 
Sbjct: 156  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLY 215

Query: 884  PGSPVSHLXXXXXXXXXXXXXXXFPEHPF-------FLELRTGNYPPKLLELDKIVLREW 1042
            PGSPV HL               FP+  F       FLE R G  PPKLL LDK+   EW
Sbjct: 216  PGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGG-PPKLLTLDKLSNHEW 274

Query: 1043 ESRQGSGTATPD---PRSRDNFLLNRQDSDV---------------------------AP 1132
             SR GSG+ TPD   P SRD  +L+RQ SDV                            P
Sbjct: 275  GSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCP 334

Query: 1133 VSESAVSHRVSFEITNEEVVRCVEKKEPN--TGIERSL-------IEKTS---------- 1255
             +E  V HRVSFE+T E+VVRCVEK        +  SL       I++ S          
Sbjct: 335  NNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGR 394

Query: 1256 VGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVDE--------DSDWFVG 1411
            VGE++N                 H K R+ITLGS+KEFNF+N D          SDW+  
Sbjct: 395  VGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWAN 454

Query: 1412 E----EGAGHSEKWSFFPMMQTGVS 1474
            E    +  G S+ WS F MMQ  VS
Sbjct: 455  EKVVGKEVGASKNWSIFHMMQPSVS 479


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  328 bits (842), Expect = 4e-87
 Identities = 205/419 (48%), Positives = 242/419 (57%), Gaps = 45/419 (10%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 529
            +S QKRRWG  WS+YWCFGS K TKRIGHA+ +PET++SGAD  ++  S Q PSI     
Sbjct: 35   ASIQKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADRPSSNTSSQAPSIVLPFI 94

Query: 530  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 706
                                  G   +   S + YSP GPAS+FAIGPYAHETQLVSPPV
Sbjct: 95   APPSSPASFLPSEPPSATHSPVGSKCL---SMSTYSPSGPASIFAIGPYAHETQLVSPPV 151

Query: 707  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQ 877
            FS FTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+PN QN   G RYPF+QYEFQSYQ
Sbjct: 152  FSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQ 211

Query: 878  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQG 1057
            LQPGSPVS+L               F +     E   G   P+ L L+KI   EW SRQG
Sbjct: 212  LQPGSPVSNLISPGSAISVSGTSSPFLDR----EYTPGR--PQFLNLEKIAPHEWGSRQG 265

Query: 1058 SGTATPD---PRSRDNFLLNRQDSDVAPVSE---------SAVSHRVSFEITNEEVVRCV 1201
            SGT TP+   P+  DNFLLN Q+S V  + +         + V HRVSFEIT E+VVRCV
Sbjct: 266  SGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCV 325

Query: 1202 EKKEP---NTG------IERSLIEKTSVGESSN---------RKQXXXXXXXXXXXXXXH 1327
            EKK      TG       ERS   + ++ E SN          ++               
Sbjct: 326  EKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDGQRQ 385

Query: 1328 QKNRTITLGSSKEFNFENVDE--------DSDWFVGEEGAGHS--EKWSFFPMMQTGVS 1474
            QK+R+ITLGSSKEFNF+NVD          SDW+  E+  G      W  FPMMQ GVS
Sbjct: 386  QKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKEPCNNW-IFPMMQPGVS 443


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  327 bits (839), Expect = 8e-87
 Identities = 201/430 (46%), Positives = 246/430 (57%), Gaps = 56/430 (13%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 529
            ++ QKRRWGS+WS+YWCFG  +  KRIGHA+LVPET+  G D+  A +  Q PSI     
Sbjct: 35   ATVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPRAENPIQTPSIVLPFV 94

Query: 530  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 706
                                  G  S+   +A+MYSP GP S+FAIGPYAHETQLVSPPV
Sbjct: 95   APPSSPASFLQSEPPSATQSPAGFFSL---TASMYSPSGPTSIFAIGPYAHETQLVSPPV 151

Query: 707  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQ 877
            FSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+P+ +N   GQR+P S YEFQSYQ
Sbjct: 152  FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQ 211

Query: 878  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLRE 1039
            L PGSPV  L               FP+  F      FLE RTG+ PPKLL LD +  R+
Sbjct: 212  LYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGD-PPKLLNLDILSTRD 270

Query: 1040 WESRQGSGTATPD---PRSRDNFLLNRQDSDVA--PVSES-------AVSHRVSFEITNE 1183
            W SR GSG+ TPD     S D FLL  Q  +V   P S +       +++HRVSFE+++E
Sbjct: 271  WGSRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSE 330

Query: 1184 EVVRCVEKK----------------------EPNTGIERSLIEKTSVGESSNRKQXXXXX 1297
            EV+RCVEKK                      +P+  +  S+     VGE+SN        
Sbjct: 331  EVIRCVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSI---CPVGETSN--DAAEKA 385

Query: 1298 XXXXXXXXXHQKNRTITLGSSKEFNFENVDE-------DSDWFVGE----EGAGHSEKWS 1444
                     H K R+ITLGS KEFNF+N D         SDW+  E    +  G ++ WS
Sbjct: 386  VADGEEAQLHPKQRSITLGSVKEFNFDNPDGGDSGNSIGSDWWANEKVDAKENGPTKNWS 445

Query: 1445 FFPMMQTGVS 1474
            FFPMMQ GVS
Sbjct: 446  FFPMMQPGVS 455


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  326 bits (835), Expect = 2e-86
 Identities = 204/419 (48%), Positives = 242/419 (57%), Gaps = 45/419 (10%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 529
            +S QKRRWGS WS+YWCFGS K TKRIGHA+ +PET++S AD  ++  S Q PSI     
Sbjct: 35   ASIQKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADRPSSNTSSQAPSIVLPFI 94

Query: 530  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 706
                                  G   +   S + YSP GPAS+FAIGPYAHETQLVSPPV
Sbjct: 95   APPSSPASFLPSEPPSATHSPVGSKCL---SMSTYSPSGPASIFAIGPYAHETQLVSPPV 151

Query: 707  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQ 877
            FS FTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+PN QN   G RYPF+QYEFQSYQ
Sbjct: 152  FSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQ 211

Query: 878  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQG 1057
            LQPGSPVS+L               F E     E   G   P+ L L+KI   EW SRQG
Sbjct: 212  LQPGSPVSNLISPGSAISVSGTSSPFLER----EYTPGR--PQFLNLEKIAPHEWGSRQG 265

Query: 1058 SGTATPD---PRSRDNFLLNRQDSDVAPVSE---------SAVSHRVSFEITNEEVVRCV 1201
            SGT TP+   P+  D+FLLN Q++ V  + +         + V HRVSFEIT E+VVRCV
Sbjct: 266  SGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCV 325

Query: 1202 EKKEP---NTG------IERSLIEKTSVGESSN---------RKQXXXXXXXXXXXXXXH 1327
            EKK      TG       ERS   + ++ E SN          ++               
Sbjct: 326  EKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQ 385

Query: 1328 QKNRTITLGSSKEFNFENVDE--------DSDWFVGEEGAGHS--EKWSFFPMMQTGVS 1474
            QK+R+ITLGSSKEFNF+NVD          SDW+  E+  G      W  FPMMQ GVS
Sbjct: 386  QKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKEPCNNW-IFPMMQPGVS 443


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  324 bits (831), Expect = 7e-86
 Identities = 203/426 (47%), Positives = 241/426 (56%), Gaps = 53/426 (12%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPKTK-RIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 529
            ++ QKRRWG  WS+YWCFGS K K RIG A+L  ETS SGA+   A +  Q P+I     
Sbjct: 35   ATVQKRRWGGCWSIYWCFGSYKQKKRIGPAVLTSETSFSGANVPAAENPTQAPAIALPFV 94

Query: 530  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPGPASMFAIGPYAHETQLVSPPVF 709
                                  G++S+ S SA+MYSPGPAS+FAIGPYAHETQLVSPPVF
Sbjct: 95   APPSSPASFLPSEPPSATQSPAGLVSLTSISASMYSPGPASIFAIGPYAHETQLVSPPVF 154

Query: 710  STFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG---QRYPFSQYEFQSYQL 880
            STFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL PNLQ G   QR+P S YEFQSYQL
Sbjct: 155  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQL 214

Query: 881  QPGSPVSHLXXXXXXXXXXXXXXXFPEHPF-----FLELRTGNYPPKLLELDKIVLREWE 1045
             PGSPV  L               F +  F     F E R G+ PPKLL LDK    EW 
Sbjct: 215  HPGSPVGQLISPSSGISGSGTSSPFRDGEFAASLHFPEFRMGD-PPKLLNLDKHSSCEWG 273

Query: 1046 SRQGSGTATPD---PRSRDNFLLNRQDSDV----------APVSESAVSHRVSFEITNEE 1186
            S  GSGT TPD      R+ FLL+ Q S++              + A +HRVSFE+T EE
Sbjct: 274  SHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEE 333

Query: 1187 VVRCVEKK--EPNTGIERSL-IEKT----------------SVGESSNRKQXXXXXXXXX 1309
            VVR +E +   P+  +  SL IE T                 VGE+SN +          
Sbjct: 334  VVRSLEMETATPSEAVSGSLQIEATRESEEHDTKVVDDYECRVGETSNER--PEKALADR 391

Query: 1310 XXXXXHQKNRTITLGSSKEFNFENVDE--------DSDWF----VGEEGAGHSEKWSFFP 1453
                 H K+++ITLGS+KEFNF+NVD          SDW+    V  +G G    WSFFP
Sbjct: 392  EGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKPILTSDWWANDKVAGKGGGVPRNWSFFP 451

Query: 1454 MMQTGV 1471
            MMQ GV
Sbjct: 452  MMQPGV 457


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  316 bits (810), Expect = 2e-83
 Identities = 188/422 (44%), Positives = 241/422 (57%), Gaps = 48/422 (11%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPKTK-RIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 529
            ++ +KRRWG   S+YWCFG+PK + RIGH +LVPET+  G  +  A +S Q  ++     
Sbjct: 37   ATVRKRRWGGCLSIYWCFGTPKNRTRIGHGVLVPETAQPGNSAPRAENSTQTHAVILPFI 96

Query: 530  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSPPV 706
                                  G+LS+ S SA+MYSPG PAS+FAIGPYAHETQLVSPPV
Sbjct: 97   APPSSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPV 156

Query: 707  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQ 877
            FSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+PN+ N   GQR+P    EFQSY 
Sbjct: 157  FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNEFQSYY 216

Query: 878  LQPGSPVSHLXXXXXXXXXXXXXXXFPE------HPFFLELRTGNYPPKLLELDKIVLRE 1039
             QPGSP+  L               FP+       P FLE RTG+ PPKLL LDK+   +
Sbjct: 217  FQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGD-PPKLLNLDKLSKFD 275

Query: 1040 WESRQGSGTATPD---PRSRDNFLLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK 1210
            W SRQGSG+ TPD   P S      + + +     +E+    RVSF+++ E+V+R VEKK
Sbjct: 276  WGSRQGSGSLTPDSVKPISTFEVAPHLKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKK 335

Query: 1211 ------------------EPNTGIERSLIE----KTSVGESSNRKQXXXXXXXXXXXXXX 1324
                              +     + + +E    +  VGE+SN  +              
Sbjct: 336  TVPLAEAMLTSLKDTTMGQREENSDSNKVEEIGCENRVGETSN--EEPDKAPTSGEEVLQ 393

Query: 1325 HQKNRTITLGSSKEFNFENVDED--------SDWFVGEEGAGH----SEKWSFFPMMQTG 1468
            HQK+R+ITLGSSKEFNF+N D          SDW+  ++ AG     S+ WSFFPM+Q G
Sbjct: 394  HQKHRSITLGSSKEFNFDNADAGDLHKSDSVSDWWANQKVAGKEGAPSQNWSFFPMIQPG 453

Query: 1469 VS 1474
            VS
Sbjct: 454  VS 455


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  306 bits (783), Expect = 2e-80
 Identities = 193/427 (45%), Positives = 239/427 (55%), Gaps = 53/427 (12%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 529
            ++ QKRRWGS WS+Y CFG  K K+ IGHA+L PE S+ G  +  + +  Q P++     
Sbjct: 35   ATVQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFA 94

Query: 530  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 706
                                  G++S+ S SA+MYSP GPAS+FAIGPYAHETQLVSPPV
Sbjct: 95   APPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPV 154

Query: 707  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQ 877
            FSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+ L+P+L+NG    R+PF   +FQSYQ
Sbjct: 155  FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQ 211

Query: 878  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLRE 1039
              PGSPV  L               FP+  F      F E R G  PPKLL LDK+   E
Sbjct: 212  FHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCE 270

Query: 1040 WESRQGSGTATPDP--RSRDNFLLNRQDSDVAPVSES--------AVSHRVSFEITNEEV 1189
            W S QGSG  TP+   R   NFLL+RQ SDV     S         V+HRVSFE+T E+ 
Sbjct: 271  WGSYQGSGALTPESVRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDA 330

Query: 1190 VRCVE--------------------KKEPNTGIERSLIEKTSVGESSNRKQXXXXXXXXX 1309
             RCVE                    K+E N+G      E   VG +SN            
Sbjct: 331  SRCVEEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFE-CRVGVTSN--DSPEMASTDG 387

Query: 1310 XXXXXHQKNRTITLGSSKEFNFENVDE-------DSDWF-----VGEEGAGHSEKWSFFP 1453
                 H+K ++ITLGS KEFNF+N DE        S+W+     +G+EG   ++ WSFFP
Sbjct: 388  EAAPQHRKQQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFP 446

Query: 1454 MMQTGVS 1474
            M+Q+GVS
Sbjct: 447  MVQSGVS 453


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  301 bits (772), Expect = 5e-79
 Identities = 191/423 (45%), Positives = 236/423 (55%), Gaps = 53/423 (12%)
 Frame = +2

Query: 365  KRRWGSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXX 541
            +RRWGS WS+Y CFG  K K+ IGHA+L PE S+ G  +  + +  Q P++         
Sbjct: 38   QRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPS 97

Query: 542  XXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPVFSTF 718
                              G++S+ S SA+MYSP GPAS+FAIGPYAHETQLVSPPVFSTF
Sbjct: 98   SPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTF 157

Query: 719  TTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQPG 889
            TTEPSTAP+TPPPESVH+TTPSSPEVPFA+ L+P+L+NG    R+PF   +FQSYQ  PG
Sbjct: 158  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQFHPG 214

Query: 890  SPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLREWESR 1051
            SPV  L               FP+  F      F E R G  PPKLL LDK+   EW S 
Sbjct: 215  SPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCEWGSY 273

Query: 1052 QGSGTATPDP--RSRDNFLLNRQDSDVAPVSES--------AVSHRVSFEITNEEVVRCV 1201
            QGSG  TP+   R   NFLL+RQ SDV     S         V+HRVSFE+T E+  RCV
Sbjct: 274  QGSGALTPESVRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCV 333

Query: 1202 E--------------------KKEPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXX 1321
            E                    K+E N+G      E   VG +SN                
Sbjct: 334  EEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFE-CRVGVTSN--DSPEMASTDGEAAP 390

Query: 1322 XHQKNRTITLGSSKEFNFENVDE-------DSDWF-----VGEEGAGHSEKWSFFPMMQT 1465
             H+K ++ITLGS KEFNF+N DE        S+W+     +G+EG   ++ WSFFPM+Q+
Sbjct: 391  QHRKQQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFPMVQS 449

Query: 1466 GVS 1474
            GVS
Sbjct: 450  GVS 452


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  301 bits (771), Expect = 6e-79
 Identities = 189/424 (44%), Positives = 237/424 (55%), Gaps = 51/424 (12%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVH-SPQPPSIXXXX 526
            ++ QKRRWGS WS+YWCFG  +  KRIGHA+LVPE S+ G DS+ A + + Q P+I    
Sbjct: 38   ATIQKRRWGSCWSVYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPF 97

Query: 527  XXXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPP 703
                                   GILS+ S SA+MYSP GPAS+FAIGPYAHETQLVSPP
Sbjct: 98   VAPPSSPASFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPP 157

Query: 704  VFSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSY 874
             FSTFTTEPSTAP+TPPPESV +TTPSSPEVPFA+LLEP+ +NG+   R+PFS YEFQSY
Sbjct: 158  AFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSY 217

Query: 875  QLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLR 1036
            Q  PGSPV  L               FP+  F      FLE +    PPKLL LDK+ + 
Sbjct: 218  QFYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMA-VPPKLLNLDKLSVH 276

Query: 1037 EWESRQGSGTATPDP--RSRDNFLLNRQDSDVAP--------VSESAVSHRVSFEITNEE 1186
            E  SRQGSGT TPD    +  +F L+RQ SD+A           +     RVSF+++ E+
Sbjct: 277  ECGSRQGSGTLTPDAVRATSCSFPLDRQCSDIASNRHSDNENKDDQVADLRVSFDLSAED 336

Query: 1187 VVRCVEKKEPN----------TGIERSLIEKTS---------VGESSNRKQXXXXXXXXX 1309
             +R  E K  +            I    ++K+S         VGE+SN            
Sbjct: 337  ALRYAEPKPASPVKIMPESMKNEIAAEKVQKSSEIRHNFECRVGETSN--GILEQASTGG 394

Query: 1310 XXXXXHQKNRTITLGSSKEFNFENVD------EDSDWFVGEEGAGH----SEKWSFFPMM 1459
                 HQK+RT+TLG+ KEFNF+N D         DW+      G     ++ WSFFP+M
Sbjct: 395  EKTPRHQKHRTLTLGTFKEFNFDNADGVPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVM 454

Query: 1460 QTGV 1471
            Q  +
Sbjct: 455  QPSI 458


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  292 bits (748), Expect = 3e-76
 Identities = 189/461 (40%), Positives = 233/461 (50%), Gaps = 56/461 (12%)
 Frame = +2

Query: 260  MRRGANGPDXXXXXXXXXXXXXXXXXRVDHASSA--QKRRWGSFWSLYWCFGSPK-TKRI 430
            MRRG NG D                        A  QKRRW   W +YWCFG  +  KRI
Sbjct: 3    MRRGVNGGDGNNALDTINAAASAIAAAESRVPQATVQKRRWAKGWGVYWCFGFQRHRKRI 62

Query: 431  GHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXGILSM 610
            GHA+++PET+S G +   A +  Q  SI                              S+
Sbjct: 63   GHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFNFSL 122

Query: 611  ASASANMYSPGPASMFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHMTTPSSP 790
               SA+MYSPGP+S+FAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPP ESVH+T PSSP
Sbjct: 123  ---SASMYSPGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSP 179

Query: 791  EVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPE 961
            EVPFA+LL+ N    + GQRYP S YEFQSYQ  PGSPV  L               F +
Sbjct: 180  EVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLD 239

Query: 962  HPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDNF----- 1099
              F      FLE RTG   PK+L LD +  R+W SR  SG+ TPD     S + F     
Sbjct: 240  SEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTLKPY 298

Query: 1100 ----LLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK------------------- 1210
                +LN + +       +++ HRVSFE++ EEVVRCVEKK                   
Sbjct: 299  TPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSAEKAE 358

Query: 1211 -EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVD 1387
             E     E S   +  V ++SN                 +QK R+ITLGS+KEFNF+N D
Sbjct: 359  REEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNAD 418

Query: 1388 E--------DSDWFVGEEGA----GHSEKWSFFPMMQTGVS 1474
                      +DW+  E+      G S+ WSFFPM+Q G+S
Sbjct: 419  GGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 459


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  291 bits (746), Expect = 5e-76
 Identities = 181/411 (44%), Positives = 217/411 (52%), Gaps = 38/411 (9%)
 Frame = +2

Query: 356  SAQKRRWGSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXX 535
            + QKRRWGS W  YWCF SPK KRIGHA+L PE+ + G+    A +  Q P+I       
Sbjct: 36   TVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAP 95

Query: 536  XXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSPPVFS 712
                                G+LS+ S +AN+YSPG PAS+FAIGPYAHETQLVSPPVFS
Sbjct: 96   PSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFS 155

Query: 713  TFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQ 883
            TFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+L +PN +NG+   R+  SQYEFQSYQL 
Sbjct: 156  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLY 215

Query: 884  PGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSG 1063
            PGSPV HL               FP+                                SG
Sbjct: 216  PGSPVGHLISPSSGISGSGTSSPFPDR-------------------------------SG 244

Query: 1064 TATPD---PRSRDNFLLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKKEPN--TGI 1228
            + TPD   P SRD  +L   D    P +E  V HRVSFE+T E+VVRCVEK        +
Sbjct: 245  SITPDALGPPSRDGSVL---DHSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAV 301

Query: 1229 ERSL-------IEKTS----------VGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGS 1357
              SL       I++ S          VGE++N                 H K R+ITLGS
Sbjct: 302  SASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGS 361

Query: 1358 SKEFNFENVDE--------DSDWFVGE----EGAGHSEKWSFFPMMQTGVS 1474
            +KEFNF+N D          SDW+  E    +  G S+ WS F MMQ  VS
Sbjct: 362  AKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 412


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  290 bits (741), Expect = 2e-75
 Identities = 181/425 (42%), Positives = 225/425 (52%), Gaps = 54/425 (12%)
 Frame = +2

Query: 362  QKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXX 538
            QKRRW   W +YWCFG  +  KRIGHA+++PET+S G +   A +  Q  SI        
Sbjct: 2    QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPP 61

Query: 539  XXXXXXXXXXXXXXXXXXXGILSMASASANMYSPGPASMFAIGPYAHETQLVSPPVFSTF 718
                                  S+   SA+MYSPGP+S+FAIGPYAHETQLVSPPVFSTF
Sbjct: 62   SSPASFLQSEPPSAMQSPGFNFSL---SASMYSPGPSSIFAIGPYAHETQLVSPPVFSTF 118

Query: 719  TTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPG 889
            TTEPSTAP+TPP ESVH+T PSSPEVPFA+LL+ N    + GQRYP S YEFQSYQ  PG
Sbjct: 119  TTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPG 178

Query: 890  SPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLREWESR 1051
            SPV  L               F +  F      FLE RTG   PK+L LD +  R+W SR
Sbjct: 179  SPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSR 237

Query: 1052 QGSGTATPD---PRSRDNF---------LLNRQDSDVAPVSESAVSHRVSFEITNEEVVR 1195
              SG+ TPD     S + F         +LN + +       +++ HRVSFE++ EEVVR
Sbjct: 238  LCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVR 297

Query: 1196 CVEKK--------------------EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXX 1315
            CVEKK                    E     E S   +  V ++SN              
Sbjct: 298  CVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEEL 357

Query: 1316 XXXHQKNRTITLGSSKEFNFENVDE--------DSDWFVGEEGA----GHSEKWSFFPMM 1459
               +QK R+ITLGS+KEFNF+N D          +DW+  E+      G S+ WSFFPM+
Sbjct: 358  SYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMI 417

Query: 1460 QTGVS 1474
            Q G+S
Sbjct: 418  QPGMS 422


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  283 bits (723), Expect = 2e-73
 Identities = 184/472 (38%), Positives = 228/472 (48%), Gaps = 98/472 (20%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXX 532
            ++ QKRRWG  WSLYWCFGS KTKRIGHA+L PE    GA  T+A +  Q  +I      
Sbjct: 42   TTVQKRRWGGCWSLYWCFGSHKTKRIGHAVLAPEPEVQGAVVTSAENQSQSTAITVPFIA 101

Query: 533  XXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSPPVF 709
                                 G+LS+ S S N YSPG PAS+FAIGPYAHETQLV+PP F
Sbjct: 102  PPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTPPAF 161

Query: 710  STFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQ-------NGQRYPFSQYEFQ 868
            S FTTEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEFQ
Sbjct: 162  SAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHYEFQ 221

Query: 869  SYQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWES 1048
            SY L PGSP   L               FP+    LE R G   PKLL  +    R+W S
Sbjct: 222  SYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGE-APKLLGFEHFTTRKWGS 280

Query: 1049 RQGSGTATPD-------------------------------------------------- 1078
            R GSGT TPD                                                  
Sbjct: 281  RLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSLTPDAV 340

Query: 1079 -PRSRDNFLLNRQDSDVAPVS---------ESAVSHRVSFEITNEEVVRCVEKKE----- 1213
             P SRD F L  Q S+VA ++         E+ V HRVSFE++ EEV RC+E K      
Sbjct: 341  GPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKSLASCR 400

Query: 1214 ------PNTGIERSL--------IEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITL 1351
                  P++  E  +         E    GE+S   +              ++K+R+ITL
Sbjct: 401  AFSECPPDSMAEDQIKSGKMLMTDENLPTGETSG--ETPEKPSGEMEEEHCYRKHRSITL 458

Query: 1352 GSSKEFNFENVDE-------DSDWFVGEEGAGH----SEKWSFFPMMQTGVS 1474
            GS KEFNF+N  E       +S+W+  E  AG     +  W+FFP++Q  VS
Sbjct: 459  GSIKEFNFDNSKEVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  280 bits (717), Expect = 1e-72
 Identities = 182/408 (44%), Positives = 223/408 (54%), Gaps = 48/408 (11%)
 Frame = +2

Query: 362  QKRRWGSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXX 538
            QK+RW S WS+YWCFG  K+KR IGHA+L PE+S+ G+ +  A +S Q P +        
Sbjct: 38   QKQRWRSHWSIYWCFGYQKSKRQIGHAVLFPESSAPGSGAPAAENSAQAPEVTFPFVAPP 97

Query: 539  XXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPVFST 715
                               G++S  S SA+MYSP GPAS+FAIGPYAHETQLVSPPVFST
Sbjct: 98   SSPASFFQSEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFST 157

Query: 716  FTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQP 886
            FTTEPSTAP+TPPPESVH+TTPSSPEVPFA+L++P L+NG    R+PF   +FQSYQ  P
Sbjct: 158  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRFPF---DFQSYQFHP 214

Query: 887  GSPVSHLXXXXXXXXXXXXXXXFPEHPFFL------ELRTGNYPPKLLELDKIVLREWES 1048
            GS V  L               FP+  F +      E R G   PKLL LDK+  REW S
Sbjct: 215  GSSVGQLISPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG---PKLLNLDKLSTREWGS 271

Query: 1049 RQGSGTATPDP--RSRDNFLLNRQDSDVA--PVSESA------VSHRVSFEITNEEVVRC 1198
             Q SG  TPD       NFLL+RQ SDVA  P SE+       V+HR SFE++ ++  RC
Sbjct: 272  YQDSGALTPDSVRHGSPNFLLHRQFSDVASHPRSENGHDDDQVVNHRFSFELSVKDASRC 331

Query: 1199 VEKK--------------------EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXX 1318
            VE+K                    E N G      E+ S G++SN               
Sbjct: 332  VEEKPACSIKTVPEYVENGTKAKEEENYGELIQSFERRS-GDTSN---DTPETPSTDGEA 387

Query: 1319 XXHQKNRTITLGSSKEFNFENVDE-------DSDWFVGEEGAGHSEKW 1441
              H+K + ITLGS  EFNF+N DE        S+W V +   G S  W
Sbjct: 388  PQHRKQQPITLGSVNEFNFDNADEGDSHNPSSSNW-VKQPRTGPSSLW 434


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  277 bits (708), Expect = 1e-71
 Identities = 177/459 (38%), Positives = 227/459 (49%), Gaps = 85/459 (18%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 529
            ++ QK+RWGS W LYWCFGS K +KRIGHA+LVPE    GA  +TA +   P  I     
Sbjct: 28   TTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFI 87

Query: 530  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 706
                                  G+LS+ S S N YSP GPAS+FAIGPYAHETQLV+PPV
Sbjct: 88   APPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPV 147

Query: 707  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG-------QRYPFSQYEF 865
            FS  TTEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEF
Sbjct: 148  FSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEF 207

Query: 866  QSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWE 1045
            QSYQ+ PGSP  +L               FP+    LE R G   PKLL  +    R+W 
Sbjct: 208  QSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGE-APKLLGFENFTTRKWG 266

Query: 1046 SRQGSG----------------TATPD-------------------PRSRDNFLLNRQDS 1120
            SR GSG                + TPD                   P SRD FL+  Q S
Sbjct: 267  SRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQIS 326

Query: 1121 DVAPVS---------ESAVSHRVSFEITNEEVVRCVEKK--------------------E 1213
            +VA ++         E+ V HRVSFE++ E+V  C+E K                    +
Sbjct: 327  EVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRK 386

Query: 1214 PNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVDED 1393
               GI++ L     +       +              +QK+R++TLGS KEFNF+N   +
Sbjct: 387  ERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGE 446

Query: 1394 --------SDWFVGEEGAGHSEK----WSFFPMMQTGVS 1474
                    S+W+  E+ AG   +    W+FFPM+Q  VS
Sbjct: 447  ASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  274 bits (701), Expect = 8e-71
 Identities = 176/455 (38%), Positives = 224/455 (49%), Gaps = 85/455 (18%)
 Frame = +2

Query: 365  KRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXX 541
            K+RWGS W LYWCFGS K +KRIGHA+LVPE    GA  +TA +   P  I         
Sbjct: 36   KKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPS 95

Query: 542  XXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPVFSTF 718
                              G+LS+ S S N YSP GPAS+FAIGPYAHETQLV+PPVFS  
Sbjct: 96   SPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSAL 155

Query: 719  TTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG-------QRYPFSQYEFQSYQ 877
            TTEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEFQSYQ
Sbjct: 156  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQ 215

Query: 878  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQG 1057
            + PGSP  +L               FP+    LE R G   PKLL  +    R+W SR G
Sbjct: 216  IYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGE-APKLLGFENFTTRKWGSRLG 274

Query: 1058 SG----------------TATPD-------------------PRSRDNFLLNRQDSDVAP 1132
            SG                + TPD                   P SRD FL+  Q S+VA 
Sbjct: 275  SGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVAL 334

Query: 1133 VS---------ESAVSHRVSFEITNEEVVRCVEKK--------------------EPNTG 1225
            ++         E+ V HRVSFE++ E+V  C+E K                    +   G
Sbjct: 335  LANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDG 394

Query: 1226 IERSLIEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVDED---- 1393
            I++ L     +       +              +QK+R++TLGS KEFNF+N   +    
Sbjct: 395  IKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDK 454

Query: 1394 ----SDWFVGEEGAGHSEK----WSFFPMMQTGVS 1474
                S+W+  E+ AG   +    W+FFPM+Q  VS
Sbjct: 455  PTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  273 bits (697), Expect = 2e-70
 Identities = 178/430 (41%), Positives = 220/430 (51%), Gaps = 56/430 (13%)
 Frame = +2

Query: 353  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 529
            ++ QKRRWGS  SLYWCFGS + +KRIGHA+LVPE    GA +  + +     SI     
Sbjct: 28   TTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASENLNLSTSIVLPFI 87

Query: 530  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 706
                                  G LS+ + S N YSP GPASMFAIGPYAHETQLVSPPV
Sbjct: 88   APPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPV 147

Query: 707  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQ-------NGQRYPFSQYEF 865
            FSTF TEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L          Q+   S YEF
Sbjct: 148  FSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEF 207

Query: 866  QSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWE 1045
            Q YQL P SPV HL               FP+    +E       PKLL  +    R W 
Sbjct: 208  QPYQLYPESPVGHL---ISPISNSGTSSPFPDRRPIVE------APKLLGFEHFSTRRWG 258

Query: 1046 SRQGSGTATPD---PRSRDNFLLNRQDSDVAPVS---------ESAVSHRVSFEITNEEV 1189
            SR GSG+ TPD   P SRD+FLL  Q S+VA ++         E+ + HRVSFE+  E+V
Sbjct: 259  SRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGEDV 318

Query: 1190 VRCVEKKEPNTG----------IERSLIEKTSVGESSNR------------KQXXXXXXX 1303
              CVEKK   +           +E   IE+   G S +             K        
Sbjct: 319  AVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKASA 378

Query: 1304 XXXXXXXHQKNRTITLGSSKEFNFENVDED---------SDWFVGE----EGAGHSEKWS 1444
                   H+K+  I  GS KEFNF+N   +         S+W+V E    +G G    W+
Sbjct: 379  EGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGPQTNWT 438

Query: 1445 FFPMMQTGVS 1474
            FFP++Q G+S
Sbjct: 439  FFPLLQPGIS 448


Top