BLASTX nr result

ID: Mentha24_contig00032254 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00032254
         (1584 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus...   363   2e-97
ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626...   333   2e-88
ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citr...   332   4e-88
ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241...   331   5e-88
ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660...   328   3e-87
ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prun...   327   7e-87
ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254...   326   2e-86
ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family prot...   324   6e-86
gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]     316   2e-83
ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Popu...   306   2e-80
ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Popu...   301   4e-79
ref|XP_002509822.1| conserved hypothetical protein [Ricinus comm...   301   5e-79
ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309...   292   3e-76
emb|CBI34651.3| unnamed protein product [Vitis vinifera]              291   4e-76
ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309...   290   2e-75
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   283   2e-73
ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Popu...   280   1e-72
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...   277   1e-71
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...   274   7e-71
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   273   2e-70

>gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Mimulus guttatus]
          Length = 420

 Score =  363 bits (931), Expect = 2e-97
 Identities = 226/440 (51%), Positives = 252/440 (57%), Gaps = 35/440 (7%)
 Frame = +2

Query: 149  MRRGAN-GPDXXXXXXXXXXXXXXXXXRVDHASSAQKRRWGSFWSLYWCFGSPKTKRIGH 325
            MRRG N G D                    HASS QKRRW SFWSLYWCF     KRIGH
Sbjct: 1    MRRGVNNGTDALETISAAASAIASAEAHGAHASSLQKRRWRSFWSLYWCFRPNNNKRIGH 60

Query: 326  AILVPETSSSGADSTTAVHSP-QPPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXGILSMA 502
            A+LV ETSSS    T     P QPPSI                           G+LS++
Sbjct: 61   AVLVTETSSSDTAYTPTAERPFQPPSIVLPFTAPPSSPASFIPSEPPSSTQSPTGLLSLS 120

Query: 503  SASANMYSP-GPASMFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE-SVHMTTPSS 676
            S S N+YSP GPAS+FAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPE S H+TTPSS
Sbjct: 121  SPSGNIYSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPEFSAHLTTPSS 180

Query: 677  PEVPFARLLEPNLQNGQRYPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPE-- 850
            PEVPFARLLEPN    QRYP SQYEFQSYQLQPGSPVSHL               F +  
Sbjct: 181  PEVPFARLLEPN----QRYPLSQYEFQSYQLQPGSPVSHLISPCSGISGSGASSPFLDRD 236

Query: 851  ----HPFFLELRTGNYPPKLLELDKIVLREWESRQGSGTATP----DPRSRDN-FLLNRQ 1003
                HPFFLE   GN P +          +WES Q SG  TP     PRSRD+  LLNRQ
Sbjct: 237  FAAVHPFFLEFGGGNPPRR---------DQWESCQESGVVTPTDAVGPRSRDSCVLLNRQ 287

Query: 1004 DSDVAPVSE---------SAVSHRVSFEITNEEVVRCVEKKEPNTGIERSLIEKTSVGES 1156
            +SD++P+ +         +A+ HRVSFEIT E+V+RCVEKK   T  E       SVG+ 
Sbjct: 288  NSDISPLPDNCTGLENDVAAIDHRVSFEITAEKVIRCVEKKSLETAQE-------SVGKK 340

Query: 1157 SNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFE--NVDE----DSDWFVGE----- 1303
                               HQKNRTITLGS+KEFNFE  N DE     S+W+V E     
Sbjct: 341  PIELINREEDQTEIVNEKRHQKNRTITLGSTKEFNFEGGNCDEPCVDSSEWWVNEKKVPK 400

Query: 1304 EGAGHSEKWSFFPMMQTGVS 1363
            EG G SE WSFFP++Q GVS
Sbjct: 401  EGGGSSENWSFFPILQPGVS 420


>ref|XP_006476541.1| PREDICTED: uncharacterized protein LOC102626793 [Citrus sinensis]
          Length = 460

 Score =  333 bits (853), Expect = 2e-88
 Identities = 200/423 (47%), Positives = 252/423 (59%), Gaps = 51/423 (12%)
 Frame = +2

Query: 236  HASSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXX 412
            H +++QKRRWG  WS+ WCFG  K  KRIGHA+LVPE ++S ++++ AV+S Q  +I   
Sbjct: 34   HQATSQKRRWGGCWSISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQAAAISLP 93

Query: 413  XXXXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSP 589
                                    G++S+ S S NMYSPG P+S+FAIGPYAHETQLVSP
Sbjct: 94   FVAPPSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSP 153

Query: 590  PVFSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQS 760
            PVFSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+P+L   + GQ++PFS YEFQS
Sbjct: 154  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQS 213

Query: 761  YQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVL 922
            Y L PGSPV +L               FP+  F      F +   G+ PPKLL LDK+ +
Sbjct: 214  YHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSI 272

Query: 923  REWESRQGSGTATPD---PRSRDNFLLNRQDSDVA--PVSESA------VSHRVSFEITN 1069
            REW SRQGSGT TPD      R+ F  NRQ S+VA  P SE+       V HRVSFE+T 
Sbjct: 273  REWGSRQGSGTLTPDAVGSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTT 332

Query: 1070 EEVVRCVEKKEPNT---GIERSLIEKTSV------GESSN---------RKQXXXXXXXX 1195
            E+VVRCVEKK P T    +  SL   T+V      GE+ N                    
Sbjct: 333  EDVVRCVEKK-PTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVD 391

Query: 1196 XXXXXXHQKNRTITLGSSKEFNFENVDED-------SDWFVGE----EGAGHSEKWSFFP 1342
                  HQK ++ITLGS+KEFNF++ D D       SDW+  E    + +G  + W+FFP
Sbjct: 392  VEEAPRHQKQQSITLGSTKEFNFDSADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFP 451

Query: 1343 MMQ 1351
            ++Q
Sbjct: 452  VIQ 454


>ref|XP_006439523.1| hypothetical protein CICLE_v10020073mg [Citrus clementina]
            gi|557541785|gb|ESR52763.1| hypothetical protein
            CICLE_v10020073mg [Citrus clementina]
          Length = 460

 Score =  332 bits (850), Expect = 4e-88
 Identities = 199/423 (47%), Positives = 252/423 (59%), Gaps = 51/423 (12%)
 Frame = +2

Query: 236  HASSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXX 412
            H +++QKRRWG  W++ WCFG  K  KRIGHA+LVPE ++S ++++ AV+S Q  +I   
Sbjct: 34   HQATSQKRRWGGCWNISWCFGFQKHRKRIGHAVLVPEPTASRSNASEAVNSTQATAISLP 93

Query: 413  XXXXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSP 589
                                    G++S+ S S NMYSPG P+S+FAIGPYAHETQLVSP
Sbjct: 94   FVAPPSSPASFLQSEPPSATQSPAGLVSLNSISGNMYSPGGPSSIFAIGPYAHETQLVSP 153

Query: 590  PVFSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQS 760
            PVFSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+P+L   + GQ++PFS YEFQS
Sbjct: 154  PVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSLRFGEQGQKFPFSYYEFQS 213

Query: 761  YQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVL 922
            Y L PGSPV +L               FP+  F      F +   G+ PPKLL LDK+ +
Sbjct: 214  YHLHPGSPVGNLISPSSGISGSGTSSPFPDGEFATAGPQFPDFHRGD-PPKLLNLDKLSI 272

Query: 923  REWESRQGSGTATPD---PRSRDNFLLNRQDSDVA--PVSESA------VSHRVSFEITN 1069
            REW SRQGSGT TPD      R+ F  NRQ S+VA  P SE+       V HRVSFE+T 
Sbjct: 273  REWGSRQGSGTLTPDAVRSTPRNGFFQNRQISEVALRPHSENGLRKDQIVDHRVSFELTT 332

Query: 1070 EEVVRCVEKKEPNT---GIERSLIEKTSV------GESSN---------RKQXXXXXXXX 1195
            E+VVRCVEKK P T    +  SL   T+V      GE+ N                    
Sbjct: 333  EDVVRCVEKK-PTTLAEAVSESLQNGTTVEKEESSGEAENVHHSCAGEAANDEPLKTPVD 391

Query: 1196 XXXXXXHQKNRTITLGSSKEFNFENVDED-------SDWFVGE----EGAGHSEKWSFFP 1342
                  HQK ++ITLGS+KEFNF++ D D       SDW+  E    + +G  + W+FFP
Sbjct: 392  VEEAPRHQKQQSITLGSTKEFNFDSADGDSHEPTIASDWWANEKVVGKDSGAIKNWAFFP 451

Query: 1343 MMQ 1351
            ++Q
Sbjct: 452  VIQ 454


>ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera]
          Length = 479

 Score =  331 bits (849), Expect = 5e-88
 Identities = 204/445 (45%), Positives = 242/445 (54%), Gaps = 72/445 (16%)
 Frame = +2

Query: 245  SAQKRRWGSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXX 424
            + QKRRWGS W  YWCF SPK KRIGHA+L PE+ + G+    A +  Q P+I       
Sbjct: 36   TVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAP 95

Query: 425  XXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSPPVFS 601
                                G+LS+ S +AN+YSPG PAS+FAIGPYAHETQLVSPPVFS
Sbjct: 96   PSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFS 155

Query: 602  TFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQ 772
            TFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+L +PN +NG+   R+  SQYEFQSYQL 
Sbjct: 156  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLY 215

Query: 773  PGSPVSHLXXXXXXXXXXXXXXXFPEHPF-------FLELRTGNYPPKLLELDKIVLREW 931
            PGSPV HL               FP+  F       FLE R G  PPKLL LDK+   EW
Sbjct: 216  PGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGG-PPKLLTLDKLSNHEW 274

Query: 932  ESRQGSGTATPD---PRSRDNFLLNRQDSDV---------------------------AP 1021
             SR GSG+ TPD   P SRD  +L+RQ SDV                            P
Sbjct: 275  GSRIGSGSITPDALGPPSRDGSVLDRQVSDVIHPPSGDDSVLDRQISDVASHSLSDSGCP 334

Query: 1022 VSESAVSHRVSFEITNEEVVRCVEKKEPN--TGIERSL-------IEKTS---------- 1144
             +E  V HRVSFE+T E+VVRCVEK        +  SL       I++ S          
Sbjct: 335  NNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAVSASLQNPATVEIDENSREVVVDSEGR 394

Query: 1145 VGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVDE--------DSDWFVG 1300
            VGE++N                 H K R+ITLGS+KEFNF+N D          SDW+  
Sbjct: 395  VGETANNPPEKAPEDANGEEGQPHHKQRSITLGSAKEFNFDNADGGHSDKPNISSDWWAN 454

Query: 1301 E----EGAGHSEKWSFFPMMQTGVS 1363
            E    +  G S+ WS F MMQ  VS
Sbjct: 455  EKVVGKEVGASKNWSIFHMMQPSVS 479


>ref|XP_006343965.1| PREDICTED: uncharacterized protein At1g76660-like [Solanum tuberosum]
          Length = 443

 Score =  328 bits (842), Expect = 3e-87
 Identities = 205/419 (48%), Positives = 242/419 (57%), Gaps = 45/419 (10%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 418
            +S QKRRWG  WS+YWCFGS K TKRIGHA+ +PET++SGAD  ++  S Q PSI     
Sbjct: 35   ASIQKRRWGGCWSMYWCFGSQKQTKRIGHAVFIPETTASGADRPSSNTSSQAPSIVLPFI 94

Query: 419  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 595
                                  G   +   S + YSP GPAS+FAIGPYAHETQLVSPPV
Sbjct: 95   APPSSPASFLPSEPPSATHSPVGSKCL---SMSTYSPSGPASIFAIGPYAHETQLVSPPV 151

Query: 596  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQ 766
            FS FTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+PN QN   G RYPF+QYEFQSYQ
Sbjct: 152  FSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQ 211

Query: 767  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQG 946
            LQPGSPVS+L               F +     E   G   P+ L L+KI   EW SRQG
Sbjct: 212  LQPGSPVSNLISPGSAISVSGTSSPFLDR----EYTPGR--PQFLNLEKIAPHEWGSRQG 265

Query: 947  SGTATPD---PRSRDNFLLNRQDSDVAPVSE---------SAVSHRVSFEITNEEVVRCV 1090
            SGT TP+   P+  DNFLLN Q+S V  + +         + V HRVSFEIT E+VVRCV
Sbjct: 266  SGTLTPEAVNPKYHDNFLLNYQNSGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCV 325

Query: 1091 EKKEP---NTG------IERSLIEKTSVGESSN---------RKQXXXXXXXXXXXXXXH 1216
            EKK      TG       ERS   + ++ E SN          ++               
Sbjct: 326  EKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNGHDHGGHEPSREIHEGSSTDGEDGQRQ 385

Query: 1217 QKNRTITLGSSKEFNFENVDE--------DSDWFVGEEGAGHS--EKWSFFPMMQTGVS 1363
            QK+R+ITLGSSKEFNF+NVD          SDW+  E+  G      W  FPMMQ GVS
Sbjct: 386  QKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKEPCNNW-IFPMMQPGVS 443


>ref|XP_007209129.1| hypothetical protein PRUPE_ppa005552mg [Prunus persica]
            gi|462404864|gb|EMJ10328.1| hypothetical protein
            PRUPE_ppa005552mg [Prunus persica]
          Length = 455

 Score =  327 bits (839), Expect = 7e-87
 Identities = 201/430 (46%), Positives = 246/430 (57%), Gaps = 56/430 (13%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 418
            ++ QKRRWGS+WS+YWCFG  +  KRIGHA+LVPET+  G D+  A +  Q PSI     
Sbjct: 35   ATVQKRRWGSWWSMYWCFGFQRHKKRIGHAVLVPETTDRGGDAPRAENPIQTPSIVLPFV 94

Query: 419  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 595
                                  G  S+   +A+MYSP GP S+FAIGPYAHETQLVSPPV
Sbjct: 95   APPSSPASFLQSEPPSATQSPAGFFSL---TASMYSPSGPTSIFAIGPYAHETQLVSPPV 151

Query: 596  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQ 766
            FSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+P+ +N   GQR+P S YEFQSYQ
Sbjct: 152  FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHFRNGEGGQRFPLSHYEFQSYQ 211

Query: 767  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLRE 928
            L PGSPV  L               FP+  F      FLE RTG+ PPKLL LD +  R+
Sbjct: 212  LYPGSPVGQLISPSSGISGSGTSSPFPDLEFAARGHHFLEFRTGD-PPKLLNLDILSTRD 270

Query: 929  WESRQGSGTATPD---PRSRDNFLLNRQDSDVA--PVSES-------AVSHRVSFEITNE 1072
            W SR GSG+ TPD     S D FLL  Q  +V   P S +       +++HRVSFE+++E
Sbjct: 271  WGSRLGSGSVTPDGAKSTSSDGFLLKPQTPEVVLNPRSNNRGRNNDISINHRVSFELSSE 330

Query: 1073 EVVRCVEKK----------------------EPNTGIERSLIEKTSVGESSNRKQXXXXX 1186
            EV+RCVEKK                      +P+  +  S+     VGE+SN        
Sbjct: 331  EVIRCVEKKPVALAEAVSTSLEDTEKAQSKEDPSKVVSSSI---CPVGETSN--DAAEKA 385

Query: 1187 XXXXXXXXXHQKNRTITLGSSKEFNFENVDE-------DSDWFVGE----EGAGHSEKWS 1333
                     H K R+ITLGS KEFNF+N D         SDW+  E    +  G ++ WS
Sbjct: 386  VADGEEAQLHPKQRSITLGSVKEFNFDNPDGGDSGNSIGSDWWANEKVDAKENGPTKNWS 445

Query: 1334 FFPMMQTGVS 1363
            FFPMMQ GVS
Sbjct: 446  FFPMMQPGVS 455


>ref|XP_004245591.1| PREDICTED: uncharacterized protein LOC101254118 [Solanum
            lycopersicum]
          Length = 443

 Score =  326 bits (835), Expect = 2e-86
 Identities = 204/419 (48%), Positives = 242/419 (57%), Gaps = 45/419 (10%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 418
            +S QKRRWGS WS+YWCFGS K TKRIGHA+ +PET++S AD  ++  S Q PSI     
Sbjct: 35   ASIQKRRWGSCWSMYWCFGSQKQTKRIGHAVFIPETTASAADRPSSNTSSQAPSIVLPFI 94

Query: 419  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 595
                                  G   +   S + YSP GPAS+FAIGPYAHETQLVSPPV
Sbjct: 95   APPSSPASFLPSEPPSATHSPVGSKCL---SMSTYSPSGPASIFAIGPYAHETQLVSPPV 151

Query: 596  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQ 766
            FS FTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+PN QN   G RYPF+QYEFQSYQ
Sbjct: 152  FSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNYQNVAAGHRYPFAQYEFQSYQ 211

Query: 767  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQG 946
            LQPGSPVS+L               F E     E   G   P+ L L+KI   EW SRQG
Sbjct: 212  LQPGSPVSNLISPGSAISVSGTSSPFLER----EYTPGR--PQFLNLEKIAPHEWGSRQG 265

Query: 947  SGTATPD---PRSRDNFLLNRQDSDVAPVSE---------SAVSHRVSFEITNEEVVRCV 1090
            SGT TP+   P+  D+FLLN Q++ V  + +         + V HRVSFEIT E+VVRCV
Sbjct: 266  SGTLTPEAVNPKYHDSFLLNYQNTGVHRLPKPFNGWKNDLTVVDHRVSFEITAEDVVRCV 325

Query: 1091 EKKEP---NTG------IERSLIEKTSVGESSN---------RKQXXXXXXXXXXXXXXH 1216
            EKK      TG       ERS   + ++ E SN          ++               
Sbjct: 326  EKKPTMMMRTGSVSLQDTERSTKRQENLAEMSNAHDHSGHEPSREIHEGSSTDGEDGQRQ 385

Query: 1217 QKNRTITLGSSKEFNFENVDE--------DSDWFVGEEGAGHS--EKWSFFPMMQTGVS 1363
            QK+R+ITLGSSKEFNF+NVD          SDW+  E+  G      W  FPMMQ GVS
Sbjct: 386  QKHRSITLGSSKEFNFDNVDGGYPDKATIGSDWWANEKVLGKEPCNNW-IFPMMQPGVS 443


>ref|XP_007040283.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
            gi|508777528|gb|EOY24784.1| Hydroxyproline-rich
            glycoprotein family protein [Theobroma cacao]
          Length = 458

 Score =  324 bits (831), Expect = 6e-86
 Identities = 203/426 (47%), Positives = 241/426 (56%), Gaps = 53/426 (12%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPKTK-RIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 418
            ++ QKRRWG  WS+YWCFGS K K RIG A+L  ETS SGA+   A +  Q P+I     
Sbjct: 35   ATVQKRRWGGCWSIYWCFGSYKQKKRIGPAVLTSETSFSGANVPAAENPTQAPAIALPFV 94

Query: 419  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPGPASMFAIGPYAHETQLVSPPVF 598
                                  G++S+ S SA+MYSPGPAS+FAIGPYAHETQLVSPPVF
Sbjct: 95   APPSSPASFLPSEPPSATQSPAGLVSLTSISASMYSPGPASIFAIGPYAHETQLVSPPVF 154

Query: 599  STFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG---QRYPFSQYEFQSYQL 769
            STFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL PNLQ G   QR+P S YEFQSYQL
Sbjct: 155  STFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLGPNLQYGEGVQRFPISHYEFQSYQL 214

Query: 770  QPGSPVSHLXXXXXXXXXXXXXXXFPEHPF-----FLELRTGNYPPKLLELDKIVLREWE 934
             PGSPV  L               F +  F     F E R G+ PPKLL LDK    EW 
Sbjct: 215  HPGSPVGQLISPSSGISGSGTSSPFRDGEFAASLHFPEFRMGD-PPKLLNLDKHSSCEWG 273

Query: 935  SRQGSGTATPD---PRSRDNFLLNRQDSDV----------APVSESAVSHRVSFEITNEE 1075
            S  GSGT TPD      R+ FLL+ Q S++              + A +HRVSFE+T EE
Sbjct: 274  SHHGSGTLTPDATRSTPRNGFLLDHQISEITSHPHLKNKEVQNDQVAHNHRVSFELTTEE 333

Query: 1076 VVRCVEKK--EPNTGIERSL-IEKT----------------SVGESSNRKQXXXXXXXXX 1198
            VVR +E +   P+  +  SL IE T                 VGE+SN +          
Sbjct: 334  VVRSLEMETATPSEAVSGSLQIEATRESEEHDTKVVDDYECRVGETSNER--PEKALADR 391

Query: 1199 XXXXXHQKNRTITLGSSKEFNFENVDE--------DSDWF----VGEEGAGHSEKWSFFP 1342
                 H K+++ITLGS+KEFNF+NVD          SDW+    V  +G G    WSFFP
Sbjct: 392  EGKPQHHKHQSITLGSAKEFNFDNVDGGDAHKPILTSDWWANDKVAGKGGGVPRNWSFFP 451

Query: 1343 MMQTGV 1360
            MMQ GV
Sbjct: 452  MMQPGV 457


>gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis]
          Length = 455

 Score =  316 bits (810), Expect = 2e-83
 Identities = 188/422 (44%), Positives = 241/422 (57%), Gaps = 48/422 (11%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPKTK-RIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 418
            ++ +KRRWG   S+YWCFG+PK + RIGH +LVPET+  G  +  A +S Q  ++     
Sbjct: 37   ATVRKRRWGGCLSIYWCFGTPKNRTRIGHGVLVPETAQPGNSAPRAENSTQTHAVILPFI 96

Query: 419  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSPPV 595
                                  G+LS+ S SA+MYSPG PAS+FAIGPYAHETQLVSPPV
Sbjct: 97   APPSSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQLVSPPV 156

Query: 596  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQN---GQRYPFSQYEFQSYQ 766
            FSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+LL+PN+ N   GQR+P    EFQSY 
Sbjct: 157  FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNEFQSYY 216

Query: 767  LQPGSPVSHLXXXXXXXXXXXXXXXFPE------HPFFLELRTGNYPPKLLELDKIVLRE 928
             QPGSP+  L               FP+       P FLE RTG+ PPKLL LDK+   +
Sbjct: 217  FQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGD-PPKLLNLDKLSKFD 275

Query: 929  WESRQGSGTATPD---PRSRDNFLLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK 1099
            W SRQGSG+ TPD   P S      + + +     +E+    RVSF+++ E+V+R VEKK
Sbjct: 276  WGSRQGSGSLTPDSVKPISTFEVAPHLKPNGRCRNAENVADRRVSFDVSTEDVIRYVEKK 335

Query: 1100 ------------------EPNTGIERSLIE----KTSVGESSNRKQXXXXXXXXXXXXXX 1213
                              +     + + +E    +  VGE+SN  +              
Sbjct: 336  TVPLAEAMLTSLKDTTMGQREENSDSNKVEEIGCENRVGETSN--EEPDKAPTSGEEVLQ 393

Query: 1214 HQKNRTITLGSSKEFNFENVDED--------SDWFVGEEGAGH----SEKWSFFPMMQTG 1357
            HQK+R+ITLGSSKEFNF+N D          SDW+  ++ AG     S+ WSFFPM+Q G
Sbjct: 394  HQKHRSITLGSSKEFNFDNADAGDLHKSDSVSDWWANQKVAGKEGAPSQNWSFFPMIQPG 453

Query: 1358 VS 1363
            VS
Sbjct: 454  VS 455


>ref|XP_002298027.2| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346901|gb|EEE82832.2| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 453

 Score =  306 bits (783), Expect = 2e-80
 Identities = 193/427 (45%), Positives = 239/427 (55%), Gaps = 53/427 (12%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 418
            ++ QKRRWGS WS+Y CFG  K K+ IGHA+L PE S+ G  +  + +  Q P++     
Sbjct: 35   ATVQKRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFA 94

Query: 419  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 595
                                  G++S+ S SA+MYSP GPAS+FAIGPYAHETQLVSPPV
Sbjct: 95   APPSSPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPV 154

Query: 596  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQ 766
            FSTFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+ L+P+L+NG    R+PF   +FQSYQ
Sbjct: 155  FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQ 211

Query: 767  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLRE 928
              PGSPV  L               FP+  F      F E R G  PPKLL LDK+   E
Sbjct: 212  FHPGSPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCE 270

Query: 929  WESRQGSGTATPDP--RSRDNFLLNRQDSDVAPVSES--------AVSHRVSFEITNEEV 1078
            W S QGSG  TP+   R   NFLL+RQ SDV     S         V+HRVSFE+T E+ 
Sbjct: 271  WGSYQGSGALTPESVRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDA 330

Query: 1079 VRCVE--------------------KKEPNTGIERSLIEKTSVGESSNRKQXXXXXXXXX 1198
             RCVE                    K+E N+G      E   VG +SN            
Sbjct: 331  SRCVEEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFE-CRVGVTSN--DSPEMASTDG 387

Query: 1199 XXXXXHQKNRTITLGSSKEFNFENVDE-------DSDWF-----VGEEGAGHSEKWSFFP 1342
                 H+K ++ITLGS KEFNF+N DE        S+W+     +G+EG   ++ WSFFP
Sbjct: 388  EAAPQHRKQQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFP 446

Query: 1343 MMQTGVS 1363
            M+Q+GVS
Sbjct: 447  MVQSGVS 453


>ref|XP_006368761.1| hypothetical protein POPTR_0001s09590g [Populus trichocarpa]
            gi|550346902|gb|ERP65330.1| hypothetical protein
            POPTR_0001s09590g [Populus trichocarpa]
          Length = 452

 Score =  301 bits (772), Expect = 4e-79
 Identities = 191/423 (45%), Positives = 236/423 (55%), Gaps = 53/423 (12%)
 Frame = +2

Query: 254  KRRWGSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXX 430
            +RRWGS WS+Y CFG  K K+ IGHA+L PE S+ G  +  + +  Q P++         
Sbjct: 38   QRRWGSCWSIYLCFGYQKHKKQIGHAVLFPEPSAPGNGAPASENPTQAPAVTLPFAAPPS 97

Query: 431  XXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPVFSTF 607
                              G++S+ S SA+MYSP GPAS+FAIGPYAHETQLVSPPVFSTF
Sbjct: 98   SPASFFQSEPPSVTQSPAGLVSLTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFSTF 157

Query: 608  TTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQPG 778
            TTEPSTAP+TPPPESVH+TTPSSPEVPFA+ L+P+L+NG    R+PF   +FQSYQ  PG
Sbjct: 158  TTEPSTAPFTPPPESVHLTTPSSPEVPFAQFLDPSLRNGDTGLRFPF---DFQSYQFHPG 214

Query: 779  SPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLREWESR 940
            SPV  L               FP+  F      F E R G  PPKLL LDK+   EW S 
Sbjct: 215  SPVGQLISPSSGISGSGTSSPFPDGEFAVGGAHFPEFRIGE-PPKLLNLDKLSTCEWGSY 273

Query: 941  QGSGTATPDP--RSRDNFLLNRQDSDVAPVSES--------AVSHRVSFEITNEEVVRCV 1090
            QGSG  TP+   R   NFLL+RQ SDV     S         V+HRVSFE+T E+  RCV
Sbjct: 274  QGSGALTPESVRRGSPNFLLHRQFSDVPSRPRSGNGHKNGQVVNHRVSFELTAEDASRCV 333

Query: 1091 E--------------------KKEPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXX 1210
            E                    K+E N+G      E   VG +SN                
Sbjct: 334  EEKPAFSIKTVPEYVENGTQAKEEKNSGESIQSFE-CRVGVTSN--DSPEMASTDGEAAP 390

Query: 1211 XHQKNRTITLGSSKEFNFENVDE-------DSDWF-----VGEEGAGHSEKWSFFPMMQT 1354
             H+K ++ITLGS KEFNF+N DE        S+W+     +G+EG   ++ WSFFPM+Q+
Sbjct: 391  QHRKQQSITLGSVKEFNFDNADEGDSRKPSSSNWWANGSVIGKEGE-TTKNWSFFPMVQS 449

Query: 1355 GVS 1363
            GVS
Sbjct: 450  GVS 452


>ref|XP_002509822.1| conserved hypothetical protein [Ricinus communis]
            gi|223549721|gb|EEF51209.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 459

 Score =  301 bits (771), Expect = 5e-79
 Identities = 189/424 (44%), Positives = 237/424 (55%), Gaps = 51/424 (12%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVH-SPQPPSIXXXX 415
            ++ QKRRWGS WS+YWCFG  +  KRIGHA+LVPE S+ G DS+ A + + Q P+I    
Sbjct: 38   ATIQKRRWGSCWSVYWCFGYHRHRKRIGHAVLVPENSAPGNDSSAAENPTTQAPTITLPF 97

Query: 416  XXXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPP 592
                                   GILS+ S SA+MYSP GPAS+FAIGPYAHETQLVSPP
Sbjct: 98   VAPPSSPASFLQSEPPSASQSPAGILSLTSVSASMYSPSGPASIFAIGPYAHETQLVSPP 157

Query: 593  VFSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSY 763
             FSTFTTEPSTAP+TPPPESV +TTPSSPEVPFA+LLEP+ +NG+   R+PFS YEFQSY
Sbjct: 158  AFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLEPSNRNGEAGLRFPFSNYEFQSY 217

Query: 764  QLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLR 925
            Q  PGSPV  L               FP+  F      FLE +    PPKLL LDK+ + 
Sbjct: 218  QFYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAAGPRFLEFQMA-VPPKLLNLDKLSVH 276

Query: 926  EWESRQGSGTATPDP--RSRDNFLLNRQDSDVAP--------VSESAVSHRVSFEITNEE 1075
            E  SRQGSGT TPD    +  +F L+RQ SD+A           +     RVSF+++ E+
Sbjct: 277  ECGSRQGSGTLTPDAVRATSCSFPLDRQCSDIASNRHSDNENKDDQVADLRVSFDLSAED 336

Query: 1076 VVRCVEKKEPN----------TGIERSLIEKTS---------VGESSNRKQXXXXXXXXX 1198
             +R  E K  +            I    ++K+S         VGE+SN            
Sbjct: 337  ALRYAEPKPASPVKIMPESMKNEIAAEKVQKSSEIRHNFECRVGETSN--GILEQASTGG 394

Query: 1199 XXXXXHQKNRTITLGSSKEFNFENVD------EDSDWFVGEEGAGH----SEKWSFFPMM 1348
                 HQK+RT+TLG+ KEFNF+N D         DW+      G     ++ WSFFP+M
Sbjct: 395  EKTPRHQKHRTLTLGTFKEFNFDNADGVPKPSAGPDWWDNGSDVGKEDFTAKNWSFFPVM 454

Query: 1349 QTGV 1360
            Q  +
Sbjct: 455  QPSI 458


>ref|XP_004298812.1| PREDICTED: uncharacterized protein LOC101309729 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 459

 Score =  292 bits (748), Expect = 3e-76
 Identities = 189/461 (40%), Positives = 233/461 (50%), Gaps = 56/461 (12%)
 Frame = +2

Query: 149  MRRGANGPDXXXXXXXXXXXXXXXXXRVDHASSA--QKRRWGSFWSLYWCFGSPK-TKRI 319
            MRRG NG D                        A  QKRRW   W +YWCFG  +  KRI
Sbjct: 3    MRRGVNGGDGNNALDTINAAASAIAAAESRVPQATVQKRRWAKGWGVYWCFGFQRHRKRI 62

Query: 320  GHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXGILSM 499
            GHA+++PET+S G +   A +  Q  SI                              S+
Sbjct: 63   GHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPPSSPASFLQSEPPSAMQSPGFNFSL 122

Query: 500  ASASANMYSPGPASMFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHMTTPSSP 679
               SA+MYSPGP+S+FAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPP ESVH+T PSSP
Sbjct: 123  ---SASMYSPGPSSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPAESVHLTRPSSP 179

Query: 680  EVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPE 850
            EVPFA+LL+ N    + GQRYP S YEFQSYQ  PGSPV  L               F +
Sbjct: 180  EVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPGSPVGQLISPSSGISGSGTSSPFLD 239

Query: 851  HPF------FLELRTGNYPPKLLELDKIVLREWESRQGSGTATPD---PRSRDNF----- 988
              F      FLE RTG   PK+L LD +  R+W SR  SG+ TPD     S + F     
Sbjct: 240  SEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSRLCSGSVTPDAAKSTSSEGFTLKPY 298

Query: 989  ----LLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKK------------------- 1099
                +LN + +       +++ HRVSFE++ EEVVRCVEKK                   
Sbjct: 299  TPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVRCVEKKPVALAEAVSTSLQSAEKAE 358

Query: 1100 -EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVD 1276
             E     E S   +  V ++SN                 +QK R+ITLGS+KEFNF+N D
Sbjct: 359  REEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEELSYRYQKERSITLGSAKEFNFDNAD 418

Query: 1277 E--------DSDWFVGEEGA----GHSEKWSFFPMMQTGVS 1363
                      +DW+  E+      G S+ WSFFPM+Q G+S
Sbjct: 419  GGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMIQPGMS 459


>emb|CBI34651.3| unnamed protein product [Vitis vinifera]
          Length = 412

 Score =  291 bits (746), Expect = 4e-76
 Identities = 181/411 (44%), Positives = 217/411 (52%), Gaps = 38/411 (9%)
 Frame = +2

Query: 245  SAQKRRWGSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXX 424
            + QKRRWGS W  YWCF SPK KRIGHA+L PE+ + G+    A +  Q P+I       
Sbjct: 36   TVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAPGSGVPAAENLTQAPTIVLPFVAP 95

Query: 425  XXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSPPVFS 601
                                G+LS+ S +AN+YSPG PAS+FAIGPYAHETQLVSPPVFS
Sbjct: 96   PSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLVSPPVFS 155

Query: 602  TFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQ 772
            TFTTEPSTAP+TPPPESVH+TTPSSPEVPFA+L +PN +NG+   R+  SQYEFQSYQL 
Sbjct: 156  TFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEFQSYQLY 215

Query: 773  PGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQGSG 952
            PGSPV HL               FP+                                SG
Sbjct: 216  PGSPVGHLISPSSGISGSGTSSPFPDR-------------------------------SG 244

Query: 953  TATPD---PRSRDNFLLNRQDSDVAPVSESAVSHRVSFEITNEEVVRCVEKKEPN--TGI 1117
            + TPD   P SRD  +L   D    P +E  V HRVSFE+T E+VVRCVEK        +
Sbjct: 245  SITPDALGPPSRDGSVL---DHSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSAALVKAV 301

Query: 1118 ERSL-------IEKTS----------VGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGS 1246
              SL       I++ S          VGE++N                 H K R+ITLGS
Sbjct: 302  SASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQRSITLGS 361

Query: 1247 SKEFNFENVDE--------DSDWFVGE----EGAGHSEKWSFFPMMQTGVS 1363
            +KEFNF+N D          SDW+  E    +  G S+ WS F MMQ  VS
Sbjct: 362  AKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 412


>ref|XP_004298813.1| PREDICTED: uncharacterized protein LOC101309729 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 422

 Score =  290 bits (741), Expect = 2e-75
 Identities = 181/425 (42%), Positives = 225/425 (52%), Gaps = 54/425 (12%)
 Frame = +2

Query: 251  QKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXX 427
            QKRRW   W +YWCFG  +  KRIGHA+++PET+S G +   A +  Q  SI        
Sbjct: 2    QKRRWAKGWGVYWCFGFQRHRKRIGHAVILPETTSPGHNDPRAENLTQASSIVLPFAAPP 61

Query: 428  XXXXXXXXXXXXXXXXXXXGILSMASASANMYSPGPASMFAIGPYAHETQLVSPPVFSTF 607
                                  S+   SA+MYSPGP+S+FAIGPYAHETQLVSPPVFSTF
Sbjct: 62   SSPASFLQSEPPSAMQSPGFNFSL---SASMYSPGPSSIFAIGPYAHETQLVSPPVFSTF 118

Query: 608  TTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNL---QNGQRYPFSQYEFQSYQLQPG 778
            TTEPSTAP+TPP ESVH+T PSSPEVPFA+LL+ N    + GQRYP S YEFQSYQ  PG
Sbjct: 119  TTEPSTAPFTPPAESVHLTRPSSPEVPFAQLLDSNFRFGEGGQRYPLSHYEFQSYQWYPG 178

Query: 779  SPVSHLXXXXXXXXXXXXXXXFPEHPF------FLELRTGNYPPKLLELDKIVLREWESR 940
            SPV  L               F +  F      FLE RTG   PK+L LD +  R+W SR
Sbjct: 179  SPVGQLISPSSGISGSGTSSPFLDSEFASGGHHFLEFRTGE-APKVLNLDILFTRDWGSR 237

Query: 941  QGSGTATPD---PRSRDNF---------LLNRQDSDVAPVSESAVSHRVSFEITNEEVVR 1084
              SG+ TPD     S + F         +LN + +       +++ HRVSFE++ EEVVR
Sbjct: 238  LCSGSVTPDAAKSTSSEGFTLKPYTPEGVLNARSNSRRRNDGASIGHRVSFELSAEEVVR 297

Query: 1085 CVEKK--------------------EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXX 1204
            CVEKK                    E     E S   +  V ++SN              
Sbjct: 298  CVEKKPVALAEAVSTSLQSAEKAEREEGPNQEVSSSHECPVVDTSNDSSEKAVGGDAEEL 357

Query: 1205 XXXHQKNRTITLGSSKEFNFENVDE--------DSDWFVGEEGA----GHSEKWSFFPMM 1348
               +QK R+ITLGS+KEFNF+N D          +DW+  E+      G S+ WSFFPM+
Sbjct: 358  SYRYQKERSITLGSAKEFNFDNADGGDSGTSSISTDWWANEKVVLKENGESKNWSFFPMI 417

Query: 1349 QTGVS 1363
            Q G+S
Sbjct: 418  QPGMS 422


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  283 bits (723), Expect = 2e-73
 Identities = 184/472 (38%), Positives = 228/472 (48%), Gaps = 98/472 (20%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPKTKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXX 421
            ++ QKRRWG  WSLYWCFGS KTKRIGHA+L PE    GA  T+A +  Q  +I      
Sbjct: 42   TTVQKRRWGGCWSLYWCFGSHKTKRIGHAVLAPEPEVQGAVVTSAENQSQSTAITVPFIA 101

Query: 422  XXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSPG-PASMFAIGPYAHETQLVSPPVF 598
                                 G+LS+ S S N YSPG PAS+FAIGPYAHETQLV+PP F
Sbjct: 102  PPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQLVTPPAF 161

Query: 599  STFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQ-------NGQRYPFSQYEFQ 757
            S FTTEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEFQ
Sbjct: 162  SAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFALSHYEFQ 221

Query: 758  SYQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWES 937
            SY L PGSP   L               FP+    LE R G   PKLL  +    R+W S
Sbjct: 222  SYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGE-APKLLGFEHFTTRKWGS 280

Query: 938  RQGSGTATPD-------------------------------------------------- 967
            R GSGT TPD                                                  
Sbjct: 281  RLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGSLTPDAV 340

Query: 968  -PRSRDNFLLNRQDSDVAPVS---------ESAVSHRVSFEITNEEVVRCVEKKE----- 1102
             P SRD F L  Q S+VA ++         E+ V HRVSFE++ EEV RC+E K      
Sbjct: 341  GPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESKSLASCR 400

Query: 1103 ------PNTGIERSL--------IEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITL 1240
                  P++  E  +         E    GE+S   +              ++K+R+ITL
Sbjct: 401  AFSECPPDSMAEDQIKSGKMLMTDENLPTGETSG--ETPEKPSGEMEEEHCYRKHRSITL 458

Query: 1241 GSSKEFNFENVDE-------DSDWFVGEEGAGH----SEKWSFFPMMQTGVS 1363
            GS KEFNF+N  E       +S+W+  E  AG     +  W+FFP++Q  VS
Sbjct: 459  GSIKEFNFDNSKEVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>ref|XP_002304504.1| hypothetical protein POPTR_0003s12950g [Populus trichocarpa]
            gi|222841936|gb|EEE79483.1| hypothetical protein
            POPTR_0003s12950g [Populus trichocarpa]
          Length = 441

 Score =  280 bits (717), Expect = 1e-72
 Identities = 182/408 (44%), Positives = 223/408 (54%), Gaps = 48/408 (11%)
 Frame = +2

Query: 251  QKRRWGSFWSLYWCFGSPKTKR-IGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXX 427
            QK+RW S WS+YWCFG  K+KR IGHA+L PE+S+ G+ +  A +S Q P +        
Sbjct: 38   QKQRWRSHWSIYWCFGYQKSKRQIGHAVLFPESSAPGSGAPAAENSAQAPEVTFPFVAPP 97

Query: 428  XXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPVFST 604
                               G++S  S SA+MYSP GPAS+FAIGPYAHETQLVSPPVFST
Sbjct: 98   SSPASFFQSEPPSVTQSPAGLVSRTSISASMYSPSGPASIFAIGPYAHETQLVSPPVFST 157

Query: 605  FTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNGQ---RYPFSQYEFQSYQLQP 775
            FTTEPSTAP+TPPPESVH+TTPSSPEVPFA+L++P L+NG    R+PF   +FQSYQ  P
Sbjct: 158  FTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLIDPTLRNGVTGLRFPF---DFQSYQFHP 214

Query: 776  GSPVSHLXXXXXXXXXXXXXXXFPEHPFFL------ELRTGNYPPKLLELDKIVLREWES 937
            GS V  L               FP+  F +      E R G   PKLL LDK+  REW S
Sbjct: 215  GSSVGQLISPSSGISGSGTSSPFPDGEFAVGGPHSPEFRMG---PKLLNLDKLSTREWGS 271

Query: 938  RQGSGTATPDP--RSRDNFLLNRQDSDVA--PVSESA------VSHRVSFEITNEEVVRC 1087
             Q SG  TPD       NFLL+RQ SDVA  P SE+       V+HR SFE++ ++  RC
Sbjct: 272  YQDSGALTPDSVRHGSPNFLLHRQFSDVASHPRSENGHDDDQVVNHRFSFELSVKDASRC 331

Query: 1088 VEKK--------------------EPNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXX 1207
            VE+K                    E N G      E+ S G++SN               
Sbjct: 332  VEEKPACSIKTVPEYVENGTKAKEEENYGELIQSFERRS-GDTSN---DTPETPSTDGEA 387

Query: 1208 XXHQKNRTITLGSSKEFNFENVDE-------DSDWFVGEEGAGHSEKW 1330
              H+K + ITLGS  EFNF+N DE        S+W V +   G S  W
Sbjct: 388  PQHRKQQPITLGSVNEFNFDNADEGDSHNPSSSNW-VKQPRTGPSSLW 434


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich
            glycoprotein family protein isoform 1 [Theobroma cacao]
          Length = 485

 Score =  277 bits (708), Expect = 1e-71
 Identities = 177/459 (38%), Positives = 227/459 (49%), Gaps = 85/459 (18%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 418
            ++ QK+RWGS W LYWCFGS K +KRIGHA+LVPE    GA  +TA +   P  I     
Sbjct: 28   TTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFI 87

Query: 419  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 595
                                  G+LS+ S S N YSP GPAS+FAIGPYAHETQLV+PPV
Sbjct: 88   APPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPV 147

Query: 596  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG-------QRYPFSQYEF 754
            FS  TTEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEF
Sbjct: 148  FSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEF 207

Query: 755  QSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWE 934
            QSYQ+ PGSP  +L               FP+    LE R G   PKLL  +    R+W 
Sbjct: 208  QSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGE-APKLLGFENFTTRKWG 266

Query: 935  SRQGSG----------------TATPD-------------------PRSRDNFLLNRQDS 1009
            SR GSG                + TPD                   P SRD FL+  Q S
Sbjct: 267  SRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQIS 326

Query: 1010 DVAPVS---------ESAVSHRVSFEITNEEVVRCVEKK--------------------E 1102
            +VA ++         E+ V HRVSFE++ E+V  C+E K                    +
Sbjct: 327  EVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRK 386

Query: 1103 PNTGIERSLIEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVDED 1282
               GI++ L     +       +              +QK+R++TLGS KEFNF+N   +
Sbjct: 387  ERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGE 446

Query: 1283 --------SDWFVGEEGAGHSEK----WSFFPMMQTGVS 1363
                    S+W+  E+ AG   +    W+FFPM+Q  VS
Sbjct: 447  ASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich
            glycoprotein family protein isoform 2 [Theobroma cacao]
          Length = 489

 Score =  274 bits (701), Expect = 7e-71
 Identities = 176/455 (38%), Positives = 224/455 (49%), Gaps = 85/455 (18%)
 Frame = +2

Query: 254  KRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXXXXXX 430
            K+RWGS W LYWCFGS K +KRIGHA+LVPE    GA  +TA +   P  I         
Sbjct: 36   KKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPS 95

Query: 431  XXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPVFSTF 607
                              G+LS+ S S N YSP GPAS+FAIGPYAHETQLV+PPVFS  
Sbjct: 96   SPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSAL 155

Query: 608  TTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQNG-------QRYPFSQYEFQSYQ 766
            TTEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L+         Q++  S YEFQSYQ
Sbjct: 156  TTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQ 215

Query: 767  LQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWESRQG 946
            + PGSP  +L               FP+    LE R G   PKLL  +    R+W SR G
Sbjct: 216  IYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGE-APKLLGFENFTTRKWGSRLG 274

Query: 947  SG----------------TATPD-------------------PRSRDNFLLNRQDSDVAP 1021
            SG                + TPD                   P SRD FL+  Q S+VA 
Sbjct: 275  SGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGSQISEVAL 334

Query: 1022 VS---------ESAVSHRVSFEITNEEVVRCVEKK--------------------EPNTG 1114
            ++         E+ V HRVSFE++ E+V  C+E K                    +   G
Sbjct: 335  LANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDG 394

Query: 1115 IERSLIEKTSVGESSNRKQXXXXXXXXXXXXXXHQKNRTITLGSSKEFNFENVDED---- 1282
            I++ L     +       +              +QK+R++TLGS KEFNF+N   +    
Sbjct: 395  IKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDK 454

Query: 1283 ----SDWFVGEEGAGHSEK----WSFFPMMQTGVS 1363
                S+W+  E+ AG   +    W+FFPM+Q  VS
Sbjct: 455  PTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  273 bits (697), Expect = 2e-70
 Identities = 178/430 (41%), Positives = 220/430 (51%), Gaps = 56/430 (13%)
 Frame = +2

Query: 242  SSAQKRRWGSFWSLYWCFGSPK-TKRIGHAILVPETSSSGADSTTAVHSPQPPSIXXXXX 418
            ++ QKRRWGS  SLYWCFGS + +KRIGHA+LVPE    GA +  + +     SI     
Sbjct: 28   TTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEPMVPGAVAPASENLNLSTSIVLPFI 87

Query: 419  XXXXXXXXXXXXXXXXXXXXXXGILSMASASANMYSP-GPASMFAIGPYAHETQLVSPPV 595
                                  G LS+ + S N YSP GPASMFAIGPYAHETQLVSPPV
Sbjct: 88   APPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYSPSGPASMFAIGPYAHETQLVSPPV 147

Query: 596  FSTFTTEPSTAPYTPPPESVHMTTPSSPEVPFARLLEPNLQ-------NGQRYPFSQYEF 754
            FSTF TEPSTAP+TPPPESV +TTPSSPEVPFA+LL  +L          Q+   S YEF
Sbjct: 148  FSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLDRSRRNSGTNQKLSLSNYEF 207

Query: 755  QSYQLQPGSPVSHLXXXXXXXXXXXXXXXFPEHPFFLELRTGNYPPKLLELDKIVLREWE 934
            Q YQL P SPV HL               FP+    +E       PKLL  +    R W 
Sbjct: 208  QPYQLYPESPVGHL---ISPISNSGTSSPFPDRRPIVE------APKLLGFEHFSTRRWG 258

Query: 935  SRQGSGTATPD---PRSRDNFLLNRQDSDVAPVS---------ESAVSHRVSFEITNEEV 1078
            SR GSG+ TPD   P SRD+FLL  Q S+VA ++         E+ + HRVSFE+  E+V
Sbjct: 259  SRLGSGSLTPDGAGPASRDSFLLENQISEVASLANSESGSQNGETVIDHRVSFELAGEDV 318

Query: 1079 VRCVEKKEPNTG----------IERSLIEKTSVGESSNR------------KQXXXXXXX 1192
              CVEKK   +           +E   IE+   G S +             K        
Sbjct: 319  AVCVEKKPVASAETVQNTLQDIVEEGEIERERDGISESTENCCEFCVGEALKAASEKASA 378

Query: 1193 XXXXXXXHQKNRTITLGSSKEFNFENVDED---------SDWFVGE----EGAGHSEKWS 1333
                   H+K+  I  GS KEFNF+N   +         S+W+V E    +G G    W+
Sbjct: 379  EGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPNIIGSEWWVNEKVVGKGTGPQTNWT 438

Query: 1334 FFPMMQTGVS 1363
            FFP++Q G+S
Sbjct: 439  FFPLLQPGIS 448


Top