BLASTX nr result

ID: Rauwolfia21_contig00016002 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00016002
         (1032 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002321853.1| hydroxyproline-rich glycoprotein [Populus tr...   105   2e-20
ref|XP_006490259.1| PREDICTED: uncharacterized protein LOC102625...   103   1e-19
gb|EMJ20811.1| hypothetical protein PRUPE_ppa023224mg, partial [...   103   1e-19
ref|XP_006421765.1| hypothetical protein CICLE_v10007148mg, part...   102   2e-19
emb|CAN66820.1| hypothetical protein VITISV_003496 [Vitis vinifera]   102   2e-19
gb|EOY22987.1| Hydroxyproline-rich glycoprotein family protein, ...   101   4e-19
ref|XP_004234156.1| PREDICTED: uncharacterized protein LOC101245...   100   9e-19
ref|XP_006348055.1| PREDICTED: uncharacterized protein LOC102604...    98   5e-18
ref|XP_002510900.1| hypothetical protein RCOM_1498790 [Ricinus c...    96   3e-17
gb|EXC20892.1| hypothetical protein L484_012968 [Morus notabilis]      92   3e-16
ref|XP_004308157.1| PREDICTED: uncharacterized protein LOC101314...    92   3e-16
dbj|BAB11237.1| unnamed protein product [Arabidopsis thaliana]         91   7e-16
ref|NP_199981.2| hydroxyproline-rich glycoprotein family protein...    91   7e-16
ref|XP_006280760.1| hypothetical protein CARUB_v10026727mg [Caps...    90   1e-15
gb|ESW15322.1| hypothetical protein PHAVU_007G063100g [Phaseolus...    88   6e-15
ref|XP_002864122.1| hydroxyproline-rich glycoprotein family prot...    86   2e-14
gb|ACU21406.1| unknown [Glycine max]                                   86   3e-14
ref|XP_006401924.1| hypothetical protein EUTSA_v10014011mg [Eutr...    83   2e-13
gb|AAM19117.1|AC104427_15 Hypothetical protein [Oryza sativa Jap...    68   5e-09
gb|EAY88365.1| hypothetical protein OsI_09820 [Oryza sativa Indi...    67   9e-09

>ref|XP_002321853.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222868849|gb|EEF05980.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 333

 Score =  105 bits (263), Expect = 2e-20
 Identities = 79/210 (37%), Positives = 97/210 (46%), Gaps = 12/210 (5%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            VPF WE +PG         + + +P  LP         PVKLIASVPF WEEKPGKPL C
Sbjct: 40   VPFLWEVRPGVAKRDWKPEVSSVTPVQLP---------PVKLIASVPFNWEEKPGKPLSC 90

Query: 618  FXXXXXXXXXXXXXXESNSFPSLPGNSKCT-GDNQIGMNDQD------GDGTEMLESDTE 776
            F                 +  +LP +  C+ GD+     D D      GD   M  SD E
Sbjct: 91   F----SQSPESAFITPQANLLALPWHVTCSQGDDNHKQEDGDSGEENFGDEQVMFNSDLE 146

Query: 777  TCEYETDDGSFSSVPSLSAN----SLAPTMPLPSQ-QSPLKDPNSRQLQSPDSPXXXXXX 941
            +  +ETD+ SFSS  SL AN    S+A +  +P Q  SP  D N +Q      P      
Sbjct: 147  SFSFETDE-SFSSAQSLLANCMVSSVAISTAVPVQTTSPTDDSNGQQETPSSPPSETDSS 205

Query: 942  XXXXXXXXXXLVGASFLEWLFPLLAPRSSF 1031
                      L GA+FLEWLFPL  P+S F
Sbjct: 206  TSSYATGVSSLEGAAFLEWLFPLYTPKSGF 235



 Score = 62.4 bits (150), Expect = 3e-07
 Identities = 33/70 (47%), Positives = 39/70 (55%), Gaps = 1/70 (1%)
 Frame = +3

Query: 138 KMAEEDPSQVSSTVQIKQSITVPFVWEMVPGTPKKDWNXXXXXXXXXXXX-VKLFASVPF 314
           KMA  +    S    I+Q  +VPF+WE+ PG  K+DW              VKL ASVPF
Sbjct: 19  KMAGLEVIDSSRKKHIRQPPSVPFLWEVRPGVAKRDWKPEVSSVTPVQLPPVKLIASVPF 78

Query: 315 GWEEKPGKPL 344
            WEEKPGKPL
Sbjct: 79  NWEEKPGKPL 88


>ref|XP_006490259.1| PREDICTED: uncharacterized protein LOC102625222 [Citrus sinensis]
          Length = 296

 Score =  103 bits (256), Expect = 1e-19
 Identities = 73/203 (35%), Positives = 96/203 (47%), Gaps = 5/203 (2%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            VPF WE+KPG P         + SP V+          PVKLIAS+PF WEEKPG PLP 
Sbjct: 20   VPFLWEQKPGIPKKDWKPKDSSVSPIVVTP--------PVKLIASIPFDWEEKPGTPLPS 71

Query: 618  FXXXXXXXXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDGDGTEMLESDTETCEYETD 797
            F                    +LP           G+ + D    +  +   ++ +++TD
Sbjct: 72   F------SQPAVLPNPPEKLLALPPPPPMYSQGYYGIFNNDEASDDDHDKQNDSFDFDTD 125

Query: 798  DGSFSSVPSLSANSLAPTMPL----PSQQSPLKDPNSRQLQSPDSP-XXXXXXXXXXXXX 962
            D SFSS PSL AN L P++ +    P Q+S   D  + +L+ P SP              
Sbjct: 126  D-SFSSAPSLLANCLVPSVAISSAVPVQRSLSSDTTTDELEIPSSPASEAESSTSSYETG 184

Query: 963  XXXLVGASFLEWLFPLLAPRSSF 1031
               LVGASFLE LFPLL P++SF
Sbjct: 185  TSSLVGASFLECLFPLLPPKTSF 207


>gb|EMJ20811.1| hypothetical protein PRUPE_ppa023224mg, partial [Prunus persica]
          Length = 284

 Score =  103 bits (256), Expect = 1e-19
 Identities = 77/215 (35%), Positives = 102/215 (47%), Gaps = 17/215 (7%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLP--VKLIASVPFKWEEKPGKPL 611
            VPF WEE+PG P      P+ +S+         S+F  P  VKL+ASVPFKWEEKPG PL
Sbjct: 16   VPFLWEERPGIPKKDWKPPVVSSN---------SSFPAPHIVKLVASVPFKWEEKPGTPL 66

Query: 612  PCF-XXXXXXXXXXXXXXESNSFPSLP---------GNSKCTGDNQIGMNDQDGDGTEML 761
            P F               +  +FPS P         G ++  GD+  G  D +     M 
Sbjct: 67   PSFSEPTLESACPSSLPLQLITFPSPPISSHQYDYDGENEDYGDDISGNGDGEDGAPSMF 126

Query: 762  ESDTETCEYETDDGSFSSVPSLSANSLAPTMPL----PSQQSPLKDPNSRQLQSPDSP-X 926
              + E  ++ETDD SF S P+L AN L P++ +    P+ +S   +  S   ++P SP  
Sbjct: 127  NLELEAFDFETDD-SFISAPALLANCLVPSIAISTAVPADKSTPTEDKSAWPETPSSPAS 185

Query: 927  XXXXXXXXXXXXXXXLVGASFLEWLFPLLAPRSSF 1031
                           LVGASFLE LFPL+   S F
Sbjct: 186  EAGSSTSSYATGVSSLVGASFLECLFPLIPANSGF 220


>ref|XP_006421765.1| hypothetical protein CICLE_v10007148mg, partial [Citrus clementina]
            gi|557523638|gb|ESR35005.1| hypothetical protein
            CICLE_v10007148mg, partial [Citrus clementina]
          Length = 273

 Score =  102 bits (254), Expect = 2e-19
 Identities = 76/203 (37%), Positives = 97/203 (47%), Gaps = 5/203 (2%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            VPF WE+KPG P         + SP V+          PVKLIAS+PF WEEKPG PLP 
Sbjct: 9    VPFLWEQKPGIPKKDWKPEDSSVSPIVVTP--------PVKLIASIPFDWEEKPGTPLPS 60

Query: 618  FXXXXXXXXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDGDGTEMLESDTETCEYETD 797
            F               S   P +       G   I  ND+  D     +   ++ +++TD
Sbjct: 61   FSQPAVLPNPPEKLLASPPPPPMYSQ----GYYGIFNNDEASDDDH--DKRNDSFDFDTD 114

Query: 798  DGSFSSVPSLSANSLAPTMPL----PSQQSPLKDPNSRQLQSPDSP-XXXXXXXXXXXXX 962
            D SFSS PSL AN L P++ +    P Q+S   D  + +L+ P SP              
Sbjct: 115  D-SFSSAPSLLANCLVPSVAISSAVPVQRSLSSDTTTDELEIPSSPASEAESSTSSYETG 173

Query: 963  XXXLVGASFLEWLFPLLAPRSSF 1031
               LVGASFLE LFPLL P++SF
Sbjct: 174  TSSLVGASFLECLFPLLPPKTSF 196


>emb|CAN66820.1| hypothetical protein VITISV_003496 [Vitis vinifera]
          Length = 341

 Score =  102 bits (254), Expect = 2e-19
 Identities = 86/248 (34%), Positives = 108/248 (43%), Gaps = 5/248 (2%)
 Frame = +3

Query: 303  SVPFGWEEKPGKPLLHCYXXXXXXXXXXXXXESNCLQSPGRFIAPVPFEWEEKPGKPLPC 482
            SVPF WEEKPG P                  E   +  P     P P      P  P P 
Sbjct: 18   SVPFLWEEKPGIP------------KKDWKPEVTAVNPPPPPPPPPP------PPPPPPP 59

Query: 483  CSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPCFXXXXXXXXXXXXXX 662
               P P   P   P         P+KLIAS+PF WEEKPGKPLP F              
Sbjct: 60   PPPPPPPPPPPPPP---------PIKLIASIPFTWEEKPGKPLPFFSGTPHDDSLLLFPP 110

Query: 663  ESNSFPSLPGNSKCTGDNQIGMNDQDGDGTEMLESDTETCEYETDDGSFSSVPSLSANSL 842
            +      +  +S    D++   +D D +   + ESD E   +ETDD SFSS PSL AN L
Sbjct: 111  KK----LVCCSSLSDADSKDYEDDGDDEHDGIFESDFEAFGFETDD-SFSSAPSLLANRL 165

Query: 843  APTMPL----PSQQSPLKDPNSRQLQSPDSP-XXXXXXXXXXXXXXXXLVGASFLEWLFP 1007
              T+ +    P Q++ L + ++ Q +SP SP                 LVG+SFL+ LFP
Sbjct: 166  MSTVAISTAVPVQKTSLNEDSNDQPESPSSPASETNSSTSXYATGTTSLVGSSFLDCLFP 225

Query: 1008 LLAPRSSF 1031
            L  P S F
Sbjct: 226  LFPPNSGF 233


>gb|EOY22987.1| Hydroxyproline-rich glycoprotein family protein, putative [Theobroma
            cacao]
          Length = 313

 Score =  101 bits (252), Expect = 4e-19
 Identities = 77/207 (37%), Positives = 97/207 (46%), Gaps = 9/207 (4%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            VPF WE +PG         + + +P + P+        P+KLIASVPF WEEKPG PLP 
Sbjct: 21   VPFLWEVRPGIAKKDWKPGVSSVTPTLPPRT-------PIKLIASVPFNWEEKPGTPLPR 73

Query: 618  FXXXXXXXXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDGDGTE----MLESDTETCE 785
            F                 + P  P  +     N    ND  GDG++    + E D ET  
Sbjct: 74   FSQPPVEPAAVPLSANLMTLPPRPVYTPAY-FNGYDNNDDRGDGSDEQDVVPEMDLETFG 132

Query: 786  YETDDGSFSSVPSLSANSLAPTMPL----PSQQSPLKDPNSRQLQSPDSP-XXXXXXXXX 950
            +ETDD SFSS PSL AN L  +  +    P Q++   D +S   ++P SP          
Sbjct: 133  FETDD-SFSSAPSLLANCLVASTAICTAVPVQKTYHADNSSDHPETPSSPASETESSTSS 191

Query: 951  XXXXXXXLVGASFLEWLFPLLAPRSSF 1031
                   LVGASFLE LFPLL P S F
Sbjct: 192  YATGTSSLVGASFLECLFPLLPPNSGF 218



 Score = 63.2 bits (152), Expect = 2e-07
 Identities = 31/71 (43%), Positives = 42/71 (59%), Gaps = 3/71 (4%)
 Frame = +3

Query: 141 MAEEDPSQVSSTVQIKQSITVPFVWEMVPGTPKKDWN---XXXXXXXXXXXXVKLFASVP 311
           MAE++P++ S+  + +   +VPF+WE+ PG  KKDW                +KL ASVP
Sbjct: 1   MAEKEPTEHSNKRKTRLPPSVPFLWEVRPGIAKKDWKPGVSSVTPTLPPRTPIKLIASVP 60

Query: 312 FGWEEKPGKPL 344
           F WEEKPG PL
Sbjct: 61  FNWEEKPGTPL 71


>ref|XP_004234156.1| PREDICTED: uncharacterized protein LOC101245523 [Solanum
            lycopersicum]
          Length = 328

 Score =  100 bits (249), Expect = 9e-19
 Identities = 81/218 (37%), Positives = 104/218 (47%), Gaps = 20/218 (9%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            +PF WEE+PG P+    +P P ++      +    F  PVKLIASVPF+WEEKPG PLP 
Sbjct: 28   IPFIWEERPGIPIKDW-KPKPVATATT---SGAFTFTPPVKLIASVPFEWEEKPGTPLPF 83

Query: 618  FXXXXXXXXXXXXXXESNSFPSLPGNSKCT---GD----------NQIGMNDQDGDGTEM 758
            F                 +   LP   +     GD          +Q G +++D    EM
Sbjct: 84   F----------SQTSPHENIVGLPSTVRAVHEGGDDFWAGIGEYIDQRGNHEED----EM 129

Query: 759  LESDTETCEYETDDGSFSSVP-SLSANSLAPTMPLPS-----QQSPLKDPNSRQLQSPDS 920
             ES+ E  + E+   SFSS P SL AN   PT+ + S     Q SP  D +  QLQSP S
Sbjct: 130  TESEVEASDSESIYESFSSAPSSLLANGFIPTVDISSAVPVEQTSPTADIHHTQLQSPLS 189

Query: 921  P-XXXXXXXXXXXXXXXXLVGASFLEWLFPLLAPRSSF 1031
            P                 LVG +FLE LFPLL+P +SF
Sbjct: 190  PTSEAGSSVLSYATGTTSLVGTAFLEKLFPLLSPNTSF 227



 Score = 63.2 bits (152), Expect = 2e-07
 Identities = 33/73 (45%), Positives = 40/73 (54%), Gaps = 6/73 (8%)
 Frame = +3

Query: 144 AEEDPSQVSSTVQIKQSITVPFVWEMVPGTPKKDW------NXXXXXXXXXXXXVKLFAS 305
           AE +PS  S+  + +Q I++PF+WE  PG P KDW                   VKL AS
Sbjct: 9   AENNPSVNSTCKKERQQISIPFIWEERPGIPIKDWKPKPVATATTSGAFTFTPPVKLIAS 68

Query: 306 VPFGWEEKPGKPL 344
           VPF WEEKPG PL
Sbjct: 69  VPFEWEEKPGTPL 81


>ref|XP_006348055.1| PREDICTED: uncharacterized protein LOC102604397 [Solanum tuberosum]
          Length = 329

 Score = 98.2 bits (243), Expect = 5e-18
 Identities = 80/217 (36%), Positives = 104/217 (47%), Gaps = 19/217 (8%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            +PF WEE+PG P+    +P P    A+   +    F  PVKLIASVPF+WEEKPG PLP 
Sbjct: 30   IPFIWEERPGIPIKDW-KPKPV---AMATTSGAFTFTPPVKLIASVPFEWEEKPGTPLPF 85

Query: 618  FXXXXXXXXXXXXXXESNSFPSLPGNSKCTGDNQIGMN----------DQDG--DGTEML 761
            F              +++   ++ G      D   G +          DQ G  +  EM 
Sbjct: 86   F-------------SQTSPHGNIVGLPSIVRDVHEGRDDFWAGIGEYIDQHGSHEEDEMS 132

Query: 762  ESDTETCEYETDDGSFSSVP-SLSANSLAPTMPLPS-----QQSPLKDPNSRQLQSPDSP 923
            ES+ E  + E+   SFSS P SL AN   PT+ + S     Q SP  D +  QLQ+P SP
Sbjct: 133  ESEVEASDSESIYESFSSAPSSLLANGFIPTVDISSAVPVEQTSPTADIHHSQLQTPLSP 192

Query: 924  -XXXXXXXXXXXXXXXXLVGASFLEWLFPLLAPRSSF 1031
                             LVG +FLE LFPLL+P +SF
Sbjct: 193  TSEAGSSVLSYATGTTSLVGTAFLEKLFPLLSPDTSF 229



 Score = 65.1 bits (157), Expect = 4e-08
 Identities = 34/74 (45%), Positives = 41/74 (55%), Gaps = 6/74 (8%)
 Frame = +3

Query: 141 MAEEDPSQVSSTVQIKQSITVPFVWEMVPGTPKKDWN------XXXXXXXXXXXXVKLFA 302
           MAE +PS  S+  + +Q I++PF+WE  PG P KDW                   VKL A
Sbjct: 10  MAENNPSGNSTCKKERQQISIPFIWEERPGIPIKDWKPKPVAMATTSGAFTFTPPVKLIA 69

Query: 303 SVPFGWEEKPGKPL 344
           SVPF WEEKPG PL
Sbjct: 70  SVPFEWEEKPGTPL 83


>ref|XP_002510900.1| hypothetical protein RCOM_1498790 [Ricinus communis]
            gi|223550015|gb|EEF51502.1| hypothetical protein
            RCOM_1498790 [Ricinus communis]
          Length = 278

 Score = 95.5 bits (236), Expect = 3e-17
 Identities = 74/205 (36%), Positives = 94/205 (45%), Gaps = 7/205 (3%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            VPF WEE+PG         + + +   LP         PVKLIASVPF WEEKPGKPLPC
Sbjct: 21   VPFLWEERPGIAKKDWKPVVSSVTTLALPP--------PVKLIASVPFNWEEKPGKPLPC 72

Query: 618  FXXXXXXXXXXXXXXESNSFPSLP------GNSKCTGDNQIGMNDQDGDGTEMLESDTET 779
            F                NS PS P       + +   +N+ G ++       + + D E+
Sbjct: 73   FSQPPMESPPATL----NSLPSPPMYYQRCDDCEFNNENRAGHDNYGEKEEGIFDLDIES 128

Query: 780  CEYETDDGSFSSVPSLSANSLAPTMPLPSQQSPLKDPNSRQLQSPDSP-XXXXXXXXXXX 956
              +ETDD S SS PSL AN L  ++ + S   P+       L++P SP            
Sbjct: 129  FSFETDD-SLSSAPSLLANCLVSSVAV-SDAVPVD-----HLETPSSPASDTDSSTSSYA 181

Query: 957  XXXXXLVGASFLEWLFPLLAPRSSF 1031
                 L GAS LE LFPL AP S F
Sbjct: 182  TGISSLTGASLLECLFPLYAPDSGF 206



 Score = 62.4 bits (150), Expect = 3e-07
 Identities = 33/70 (47%), Positives = 37/70 (52%), Gaps = 2/70 (2%)
 Frame = +3

Query: 141 MAEEDPSQVSSTVQIKQSITVPFVWEMVPGTPKKDWNXXXXXXXXXXXX--VKLFASVPF 314
           M E +  + S    I+Q   VPF+WE  PG  KKDW               VKL ASVPF
Sbjct: 1   MTENEIIEASKRKHIRQPPFVPFLWEERPGIAKKDWKPVVSSVTTLALPPPVKLIASVPF 60

Query: 315 GWEEKPGKPL 344
            WEEKPGKPL
Sbjct: 61  NWEEKPGKPL 70


>gb|EXC20892.1| hypothetical protein L484_012968 [Morus notabilis]
          Length = 322

 Score = 92.4 bits (228), Expect = 3e-16
 Identities = 76/207 (36%), Positives = 92/207 (44%), Gaps = 9/207 (4%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPT-SSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLP 614
            VPF WE KPG          P+ SS  ++P         PVKLIASVPFKWEEKPG PLP
Sbjct: 23   VPFLWEVKPGIAKKDWKPEFPSVSSVPIVPLP-------PVKLIASVPFKWEEKPGTPLP 75

Query: 615  CFXXXXXXXXXXXXXXES-NSFPSLPGNSKCTGDNQIGMNDQDGDGTEM--LESDTETCE 785
             F                 +++P    N           N+ DG   E    + D  T  
Sbjct: 76   SFSQPSQESASPLLPLPPIDNYPYEGVNVYQDSSEDSSSNEGDGQDEEQRGFKLDLGTFG 135

Query: 786  YETDDGSFSSVPSLSAN----SLAPTMPLPSQQSPLKDPNSRQLQSPDSP-XXXXXXXXX 950
             E DD SF S PSL AN    S+A +  +P+Q   L +  S  L+SP SP          
Sbjct: 136  SEADD-SFCSAPSLLANCLVSSVAISTAVPAQNVSLPEDKSGPLESPSSPASETEISTSS 194

Query: 951  XXXXXXXLVGASFLEWLFPLLAPRSSF 1031
                   LVG+S LE LFPL  P+S F
Sbjct: 195  YETGTSSLVGSSLLECLFPLFPPKSGF 221



 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 34/74 (45%), Positives = 42/74 (56%), Gaps = 6/74 (8%)
 Frame = +3

Query: 141 MAEEDPSQVSSTVQ--IKQSITVPFVWEMVPGTPKKDWNXXXXXXXXXXXX----VKLFA 302
           MAE +  Q S+T +  ++Q  +VPF+WE+ PG  KKDW                 VKL A
Sbjct: 1   MAEIELIQTSTTSKKHVRQPPSVPFLWEVKPGIAKKDWKPEFPSVSSVPIVPLPPVKLIA 60

Query: 303 SVPFGWEEKPGKPL 344
           SVPF WEEKPG PL
Sbjct: 61  SVPFKWEEKPGTPL 74


>ref|XP_004308157.1| PREDICTED: uncharacterized protein LOC101314801 [Fragaria vesca
            subsp. vesca]
          Length = 308

 Score = 92.4 bits (228), Expect = 3e-16
 Identities = 70/207 (33%), Positives = 101/207 (48%), Gaps = 9/207 (4%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            VPF WEE+PG P       + +++ A +P         PVKLIASVPF WEEKPG PLP 
Sbjct: 15   VPFLWEERPGIPKKDWKPTVSSNNVAPIP---------PVKLIASVPFIWEEKPGTPLPY 65

Query: 618  FXXXXXXXXXXXXXXESNSFPSLPGNSKCT---GDNQIGMNDQDGDGTEMLES----DTE 776
            F                 ++PS P  S+     G+     ++ + DG + ++S    D +
Sbjct: 66   FMESSSESATTEPMM-LITYPSPPICSQHNDHGGEEYSDASNGNDDGEDEIQSVFKLDMQ 124

Query: 777  TCEYETDDGSFSSVPSLSANSLAPTMPLPSQ-QSPLKDPNSRQLQSPDSP-XXXXXXXXX 950
              ++ETDD SFSS PSL AN L  ++ + +   +P  + +  +  +P SP          
Sbjct: 125  AFDFETDD-SFSSAPSLLANCLVSSLAISTAVPAPEDESDQTETDTPSSPLSEAGSSTSS 183

Query: 951  XXXXXXXLVGASFLEWLFPLLAPRSSF 1031
                   LVG +FLE LFPLL  ++ F
Sbjct: 184  YATGTSSLVGGAFLECLFPLLPAKAGF 210



 Score = 59.7 bits (143), Expect = 2e-06
 Identities = 29/56 (51%), Positives = 34/56 (60%), Gaps = 1/56 (1%)
 Frame = +3

Query: 180 QIKQSITVPFVWEMVPGTPKKDWN-XXXXXXXXXXXXVKLFASVPFGWEEKPGKPL 344
           Q+++  +VPF+WE  PG PKKDW              VKL ASVPF WEEKPG PL
Sbjct: 8   QVREPPSVPFLWEERPGIPKKDWKPTVSSNNVAPIPPVKLIASVPFIWEEKPGTPL 63


>dbj|BAB11237.1| unnamed protein product [Arabidopsis thaliana]
          Length = 325

 Score = 90.9 bits (224), Expect = 7e-16
 Identities = 81/249 (32%), Positives = 103/249 (41%), Gaps = 10/249 (4%)
 Frame = +3

Query: 303  SVPFGWEEKPGKPLLHCYXXXXXXXXXXXXXESNCLQSPGRFIAPVPFEWEEKPGKPLPC 482
            SVPF WEE+PG P  +                   +  P + +  VPF WEE PGKPLP 
Sbjct: 21   SVPFIWEERPGFPKKNWQPSLATFVPSPPPLPPP-IPVPVKLVTSVPFRWEETPGKPLPA 79

Query: 483  CSQ--------PLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPCFXXXXXX 638
             S         PL T++P  LP        +PVK + SVPF WEE PG+P PCF      
Sbjct: 80   SSNDPPQLPHPPLETATPTPLPPP----VPVPVKQVTSVPFDWEETPGQPYPCFV----- 130

Query: 639  XXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDGDGTEMLESDTETCEYETDDGSFSSV 818
                       S P L        D  +      GD    +E+ ++  +  + D SFSSV
Sbjct: 131  ---------DTSPPELL-------DQPLPPPPMYGD----VETSSDIFDDASSD-SFSSV 169

Query: 819  PSLSANSLAPTMPLPSQQSPLKDP-NSRQLQSPDSP-XXXXXXXXXXXXXXXXLVGASFL 992
            PSL A + + ++          D  N+     P SP                 LVGASFL
Sbjct: 170  PSLLATNRSVSISGAVAVDEFDDNLNTVTSSMPTSPAYESDDSTSSYMTGASSLVGASFL 229

Query: 993  EWLFPLLAP 1019
            E LFP L P
Sbjct: 230  EKLFPRLLP 238


>ref|NP_199981.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis
            thaliana] gi|332008731|gb|AED96114.1| hydroxyproline-rich
            glycoprotein family protein [Arabidopsis thaliana]
          Length = 343

 Score = 90.9 bits (224), Expect = 7e-16
 Identities = 81/249 (32%), Positives = 103/249 (41%), Gaps = 10/249 (4%)
 Frame = +3

Query: 303  SVPFGWEEKPGKPLLHCYXXXXXXXXXXXXXESNCLQSPGRFIAPVPFEWEEKPGKPLPC 482
            SVPF WEE+PG P  +                   +  P + +  VPF WEE PGKPLP 
Sbjct: 21   SVPFIWEERPGFPKKNWQPSLATFVPSPPPLPPP-IPVPVKLVTSVPFRWEETPGKPLPA 79

Query: 483  CSQ--------PLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPCFXXXXXX 638
             S         PL T++P  LP        +PVK + SVPF WEE PG+P PCF      
Sbjct: 80   SSNDPPQLPHPPLETATPTPLPPP----VPVPVKQVTSVPFDWEETPGQPYPCFV----- 130

Query: 639  XXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDGDGTEMLESDTETCEYETDDGSFSSV 818
                       S P L        D  +      GD    +E+ ++  +  + D SFSSV
Sbjct: 131  ---------DTSPPELL-------DQPLPPPPMYGD----VETSSDIFDDASSD-SFSSV 169

Query: 819  PSLSANSLAPTMPLPSQQSPLKDP-NSRQLQSPDSP-XXXXXXXXXXXXXXXXLVGASFL 992
            PSL A + + ++          D  N+     P SP                 LVGASFL
Sbjct: 170  PSLLATNRSVSISGAVAVDEFDDNLNTVTSSMPTSPAYESDDSTSSYMTGASSLVGASFL 229

Query: 993  EWLFPLLAP 1019
            E LFP L P
Sbjct: 230  EKLFPRLLP 238


>ref|XP_006280760.1| hypothetical protein CARUB_v10026727mg [Capsella rubella]
            gi|482549464|gb|EOA13658.1| hypothetical protein
            CARUB_v10026727mg [Capsella rubella]
          Length = 341

 Score = 90.1 bits (222), Expect = 1e-15
 Identities = 80/250 (32%), Positives = 98/250 (39%), Gaps = 11/250 (4%)
 Frame = +3

Query: 303  SVPFGWEEKPGKPLLHCYXXXXXXXXXXXXXESNCLQSPGRFIAPVPFEWEEKPGKPLPC 482
            SVPF WEE+PG P                      +  P + +  VPF WEE PGKPLP 
Sbjct: 18   SVPFIWEERPGYPKKDWQPSLATFVPSPPPLPPP-VPVPVKLVTSVPFRWEETPGKPLPA 76

Query: 483  CSQ--------PLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPCFXXXXXX 638
             S         PL T++   LP        +PVKL+ SVPF WEE PG+P PCF      
Sbjct: 77   SSNNQPQLPHPPLETATTTSLPPP----VPVPVKLVTSVPFDWEETPGQPYPCFVDFN-- 130

Query: 639  XXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDGDGTEMLESDTETCEYETDDGSFSSV 818
                         P  P +         G  + + D  +   SD           SFSSV
Sbjct: 131  -------------PREPLDQPLPPPPMYGEVETNSDIFDDASSD-----------SFSSV 166

Query: 819  PSLSANSLAPTMP-LPSQQSPLKDPNSRQLQS-PDSP-XXXXXXXXXXXXXXXXLVGASF 989
            PSL A + + ++           D   R+  S P SP                 LVGASF
Sbjct: 167  PSLLATNRSVSISNTVVAMDEFDDKQHRETSSTPSSPTYESDDSTSSYMTGASSLVGASF 226

Query: 990  LEWLFPLLAP 1019
            LE LFP L P
Sbjct: 227  LEKLFPRLLP 236



 Score = 75.1 bits (183), Expect = 4e-11
 Identities = 70/272 (25%), Positives = 95/272 (34%), Gaps = 24/272 (8%)
 Frame = +3

Query: 180 QIKQSITVPFVWEMVPGTPKKDWN--------XXXXXXXXXXXXVKLFASVPFGWEEKPG 335
           Q++Q  +VPF+WE  PG PKKDW                     VKL  SVPF WEE PG
Sbjct: 12  QLRQPPSVPFIWEERPGYPKKDWQPSLATFVPSPPPLPPPVPVPVKLVTSVPFRWEETPG 71

Query: 336 KPLLHCYXXXXXXXXXXXXXESNC-----LQSPGRFIAPVPFEWEEKPGKPLPCCSQPLP 500
           KPL                  +       +  P + +  VPF+WEE PG+P PC      
Sbjct: 72  KPLPASSNNQPQLPHPPLETATTTSLPPPVPVPVKLVTSVPFDWEETPGQPYPC------ 125

Query: 501 TSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPL--PCFXXXXXXXXXXXXXXESNS 674
                                   V F   E   +PL  P                 S+S
Sbjct: 126 -----------------------FVDFNPREPLDQPLPPPPMYGEVETNSDIFDDASSDS 162

Query: 675 FPSLP-----GNSKCTGDNQIGMNDQDGDGTEMLESDTETCEYETDDGSFSSVPSLS--- 830
           F S+P       S    +  + M++ D        S   +  YE+DD + S +   S   
Sbjct: 163 FSSVPSLLATNRSVSISNTVVAMDEFDDKQHRETSSTPSSPTYESDDSTSSYMTGASSLV 222

Query: 831 -ANSLAPTMPLPSQQSPLKDPNSRQLQSPDSP 923
            A+ L    P       +K  +S  +Q P  P
Sbjct: 223 GASFLEKLFPRLLPAEKVKAADSEDVQVPTHP 254


>gb|ESW15322.1| hypothetical protein PHAVU_007G063100g [Phaseolus vulgaris]
          Length = 300

 Score = 87.8 bits (216), Expect = 6e-15
 Identities = 71/203 (34%), Positives = 85/203 (41%), Gaps = 5/203 (2%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            VPF WE KPG P          SS    P       Q P+KLIASVPF WEEKPGKPLP 
Sbjct: 16   VPFIWEVKPGIPKKDWKAEAEVSSLGHFP-------QTPLKLIASVPFVWEEKPGKPLPN 68

Query: 618  FXXXXXXXXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDG-----DGTEMLESDTETC 782
            F                    S  G S        G +D+D      D   +   D E  
Sbjct: 69   FSDVSVDPVLPKPEKTLIHIASSSGFSVAC---NFGHDDKDKGSCSYDSESITSLDLEAF 125

Query: 783  EYETDDGSFSSVPSLSANSLAPTMPLPSQQSPLKDPNSRQLQSPDSPXXXXXXXXXXXXX 962
             ++ D+ SF  VPSL AN L P+  + S     + P+S      DS              
Sbjct: 126  TFDADE-SFGLVPSLLANCLVPSAKVSSAIPLAETPSSPASSETDSSISSYATGRSSP-- 182

Query: 963  XXXLVGASFLEWLFPLLAPRSSF 1031
                +GA+FLE LFPL AP+S F
Sbjct: 183  ----IGATFLESLFPLYAPQSGF 201



 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 28/57 (49%), Positives = 34/57 (59%), Gaps = 3/57 (5%)
 Frame = +3

Query: 183 IKQSITVPFVWEMVPGTPKKDWNXXXXXXXXXXXX---VKLFASVPFGWEEKPGKPL 344
           +++   VPF+WE+ PG PKKDW                +KL ASVPF WEEKPGKPL
Sbjct: 10  VREPPAVPFIWEVKPGIPKKDWKAEAEVSSLGHFPQTPLKLIASVPFVWEEKPGKPL 66


>ref|XP_002864122.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297309957|gb|EFH40381.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 343

 Score = 85.9 bits (211), Expect = 2e-14
 Identities = 81/252 (32%), Positives = 99/252 (39%), Gaps = 13/252 (5%)
 Frame = +3

Query: 303  SVPFGWEEKPGKPLLHCYXXXXXXXXXXXXXESNCLQSPGRFIAPVPFEWEEKPGKPLPC 482
            SVPF WEE+PG P  +                   +  P + +  VPF WEE PGKPLP 
Sbjct: 21   SVPFIWEERPGFPKKNWQPSLATFVPSPPLLPPP-VPVPVKLVTSVPFRWEETPGKPLPP 79

Query: 483  CSQ--------PLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPCFXXXXXX 638
             S         PL T++   LP        +PVK + SVPF WEE PG+P PCF      
Sbjct: 80   SSNDPPQLPHPPLETATTTPLPPP----VPVPVKQVTSVPFDWEETPGQPYPCF------ 129

Query: 639  XXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDGDGTEMLESDTETCEYETDDG---SF 809
                                     N   + DQ      M   + ET     DD    SF
Sbjct: 130  ----------------------VDTNPPELLDQPLPPPPMY-GEVETSSDIFDDASSDSF 166

Query: 810  SSVPSLSANSLAPTMPLPSQQSPLKDPNSRQLQS-PDSP-XXXXXXXXXXXXXXXXLVGA 983
            SSVPSL A + + ++          D  +R  +S P SP                 LVGA
Sbjct: 167  SSVPSLLATNRSVSISGAVAVDEFDDNLNRVTRSMPTSPAYESDDSTSSYMTGASSLVGA 226

Query: 984  SFLEWLFPLLAP 1019
            SFLE LFP L P
Sbjct: 227  SFLEKLFPRLLP 238



 Score = 73.6 bits (179), Expect = 1e-10
 Identities = 70/283 (24%), Positives = 107/283 (37%), Gaps = 22/283 (7%)
 Frame = +3

Query: 141 MAEEDPSQVSST-VQIKQSITVPFVWEMVPGTPKKDWN--------XXXXXXXXXXXXVK 293
           M+E +P +      Q++Q  +VPF+WE  PG PKK+W                     VK
Sbjct: 1   MSEMEPKETKPPRKQLRQPPSVPFIWEERPGFPKKNWQPSLATFVPSPPLLPPPVPVPVK 60

Query: 294 LFASVPFGWEEKPGKPLLHCYXXXXXXXXXXXXXESNC-----LQSPGRFIAPVPFEWEE 458
           L  SVPF WEE PGKPL                  +       +  P + +  VPF+WEE
Sbjct: 61  LVTSVPFRWEETPGKPLPPSSNDPPQLPHPPLETATTTPLPPPVPVPVKQVTSVPFDWEE 120

Query: 459 KPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPCFXXXXXX 638
            PG+P PC    + T+ P +L                       ++P  P P +      
Sbjct: 121 TPGQPYPCF---VDTNPPELL-----------------------DQPLPPPPMY-GEVET 153

Query: 639 XXXXXXXXESNSFPSLPG----NSKCTGDNQIGMNDQDGDGTEMLESDTETCEYETDDGS 806
                    S+SF S+P     N   +    + +++ D +   +  S   +  YE+DD +
Sbjct: 154 SSDIFDDASSDSFSSVPSLLATNRSVSISGAVAVDEFDDNLNRVTRSMPTSPAYESDDST 213

Query: 807 FSSVPSLS----ANSLAPTMPLPSQQSPLKDPNSRQLQSPDSP 923
            S +   S    A+ L    P       +K  +S  +Q    P
Sbjct: 214 SSYMTGASSLVGASFLEKLFPRLLPLEKVKSADSEDVQVSTHP 256


>gb|ACU21406.1| unknown [Glycine max]
          Length = 222

 Score = 85.5 bits (210), Expect = 3e-14
 Identities = 72/199 (36%), Positives = 86/199 (43%), Gaps = 1/199 (0%)
 Frame = +3

Query: 438  VPFEWEEKPGKPLPCCSQPLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPC 617
            VPF WE KPG P        P   P V PK        P+KLIASVPF WEEKPGKPLP 
Sbjct: 16   VPFIWEVKPGIPKKDWK---PEPEPEV-PKT-------PLKLIASVPFVWEEKPGKPLPN 64

Query: 618  F-XXXXXXXXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDGDGTEMLESDTETCEYET 794
            F                S+S  S   N     D   G +    D   +   D E   ++ 
Sbjct: 65   FSVDHPVPPKPLLIHVASSSAFSFACNFGHDHDKDKG-SLSSSDNESITTLDLEAFSFDE 123

Query: 795  DDGSFSSVPSLSANSLAPTMPLPSQQSPLKDPNSRQLQSPDSPXXXXXXXXXXXXXXXXL 974
            D+   SSVPSL AN L P+  + S   PL++       SP S                  
Sbjct: 124  DESFVSSVPSLLANCLVPSAKV-STAIPLRETTP---SSPASSSETDSGTSSYATGMSSP 179

Query: 975  VGASFLEWLFPLLAPRSSF 1031
            +GA+FLE LFPL  P+S F
Sbjct: 180  IGATFLECLFPLFPPKSGF 198



 Score = 62.0 bits (149), Expect = 4e-07
 Identities = 28/54 (51%), Positives = 35/54 (64%)
 Frame = +3

Query: 183 IKQSITVPFVWEMVPGTPKKDWNXXXXXXXXXXXXVKLFASVPFGWEEKPGKPL 344
           +++  +VPF+WE+ PG PKKDW             +KL ASVPF WEEKPGKPL
Sbjct: 10  VREPPSVPFIWEVKPGIPKKDWKPEPEPEVPKTP-LKLIASVPFVWEEKPGKPL 62


>ref|XP_006401924.1| hypothetical protein EUTSA_v10014011mg [Eutrema salsugineum]
            gi|557103014|gb|ESQ43377.1| hypothetical protein
            EUTSA_v10014011mg [Eutrema salsugineum]
          Length = 343

 Score = 82.8 bits (203), Expect = 2e-13
 Identities = 79/251 (31%), Positives = 97/251 (38%), Gaps = 12/251 (4%)
 Frame = +3

Query: 303  SVPFGWEEKPGKPLLHCYXXXXXXXXXXXXXESNCLQSPGRFIAPVPFEWEEKPGKPLPC 482
            SVPF WEE+PG P  +                   +  P + +  VPF WE+ PGKPLP 
Sbjct: 18   SVPFIWEERPGLPKKNWQPSLATFVPSPPPLPPP-IPVPVKLVTSVPFRWEQTPGKPLPS 76

Query: 483  CSQ--------PLPTSSPAVLPKAEISNFQLPVKLIASVPFKWEEKPGKPLPCFXXXXXX 638
             S         PL T++   LP        +PVKL+ SVPF  EE PG+P PCF      
Sbjct: 77   SSNDPPQLPHPPLETATAPPLPPP----VPVPVKLVTSVPFVREETPGQPYPCF------ 126

Query: 639  XXXXXXXXESNSFPSLPGNSKCTGDNQIGMNDQDGDGTEML-ESDTETCEY-ETDDGSFS 812
                                     NQ    DQ      M  E +T +  Y +    SFS
Sbjct: 127  ----------------------VDTNQTEPLDQPLPPPPMYGEVETNSDIYDDASSDSFS 164

Query: 813  SVPS-LSANSLAPTMPLPSQQSPLKDPNSRQLQSPDSP-XXXXXXXXXXXXXXXXLVGAS 986
            SVPS L+ N   P     +     ++ N      P SP                 LVGAS
Sbjct: 165  SVPSLLTGNRSVPVSGAVTVDEFDENLNRETSSVPTSPGYESDDSTSSYMTGASSLVGAS 224

Query: 987  FLEWLFPLLAP 1019
            FLE LFP L P
Sbjct: 225  FLEKLFPRLLP 235


>gb|AAM19117.1|AC104427_15 Hypothetical protein [Oryza sativa Japonica Group]
           gi|108705974|gb|ABF93769.1| hypothetical protein
           LOC_Os03g03580 [Oryza sativa Japonica Group]
           gi|125584777|gb|EAZ25441.1| hypothetical protein
           OsJ_09257 [Oryza sativa Japonica Group]
          Length = 231

 Score = 68.2 bits (165), Expect = 5e-09
 Identities = 34/73 (46%), Positives = 45/73 (61%), Gaps = 2/73 (2%)
 Frame = +3

Query: 132 ELKMAEEDPSQVSSTVQIKQSITVPFVWEMVPGTPKKDW--NXXXXXXXXXXXXVKLFAS 305
           EL+++E +P +     ++KQ I+VPF+WE+ PG PKKDW  +             KL  S
Sbjct: 3   ELELSEFNPRE-----RVKQQISVPFLWEVKPGAPKKDWAISNPVPSAISCPSPAKLVVS 57

Query: 306 VPFGWEEKPGKPL 344
           VPF WEEKPGKPL
Sbjct: 58  VPFQWEEKPGKPL 70


>gb|EAY88365.1| hypothetical protein OsI_09820 [Oryza sativa Indica Group]
          Length = 231

 Score = 67.4 bits (163), Expect = 9e-09
 Identities = 34/73 (46%), Positives = 44/73 (60%), Gaps = 2/73 (2%)
 Frame = +3

Query: 132 ELKMAEEDPSQVSSTVQIKQSITVPFVWEMVPGTPKKDW--NXXXXXXXXXXXXVKLFAS 305
           EL++ E +P +     ++KQ I+VPF+WE+ PG PKKDW  +             KL  S
Sbjct: 3   ELELPEFNPRE-----RVKQQISVPFLWEVKPGAPKKDWAISNPVPSAISCPSPAKLVVS 57

Query: 306 VPFGWEEKPGKPL 344
           VPF WEEKPGKPL
Sbjct: 58  VPFQWEEKPGKPL 70


Top