BLASTX nr result

ID: Mentha29_contig00020565 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00020565
         (594 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU28733.1| hypothetical protein MIMGU_mgv1a010039mg [Mimulus...   288   1e-75
gb|EPS73514.1| hypothetical protein M569_01239, partial [Genlise...   223   3e-56
ref|XP_004228397.1| PREDICTED: uncharacterized protein LOC101253...   222   7e-56
ref|XP_006364945.1| PREDICTED: uncharacterized protein LOC102583...   221   1e-55
ref|XP_006361122.1| PREDICTED: uncharacterized protein LOC102585...   220   3e-55
ref|XP_007215731.1| hypothetical protein PRUPE_ppa009082mg [Prun...   219   5e-55
ref|XP_007222980.1| hypothetical protein PRUPE_ppa008967mg [Prun...   216   5e-54
ref|XP_003518719.2| PREDICTED: uncharacterized protein LOC100775...   213   3e-53
ref|XP_006573104.1| PREDICTED: uncharacterized protein LOC100806...   210   3e-52
ref|XP_003517783.1| PREDICTED: uncharacterized protein LOC100806...   210   3e-52
ref|XP_007157641.1| hypothetical protein PHAVU_002G086800g [Phas...   209   6e-52
ref|XP_002875178.1| hypothetical protein ARALYDRAFT_904548 [Arab...   208   1e-51
ref|XP_003613679.1| Mucin-like protein [Medicago truncatula] gi|...   207   1e-51
gb|AGV54769.1| mucin-like protein [Phaseolus vulgaris]                206   3e-51
ref|XP_007046137.1| Mucin-related protein [Theobroma cacao] gi|5...   206   4e-51
ref|NP_178391.1| mucin-related protein [Arabidopsis thaliana] gi...   205   9e-51
ref|XP_004298438.1| PREDICTED: uncharacterized protein LOC101307...   204   2e-50
ref|XP_004144093.1| PREDICTED: uncharacterized protein LOC101222...   204   2e-50
ref|XP_002273560.1| PREDICTED: uncharacterized protein LOC100256...   203   3e-50
ref|XP_006395751.1| hypothetical protein EUTSA_v10004617mg [Eutr...   202   4e-50

>gb|EYU28733.1| hypothetical protein MIMGU_mgv1a010039mg [Mimulus guttatus]
          Length = 324

 Score =  288 bits (736), Expect = 1e-75
 Identities = 145/198 (73%), Positives = 162/198 (81%), Gaps = 1/198 (0%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           K+GE+EWNDAWETAWLPDDLSGKS+RA WE+DVSF+LP A  +NPQ  ++S PEEID ET
Sbjct: 38  KSGEDEWNDAWETAWLPDDLSGKSARASWESDVSFALPAAD-QNPQ--LSSLPEEIDAET 94

Query: 183 QAFVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKI 362
           +AFVE+MN+NWDQRKGKS KK  D+N                   LENIKRDYRLTKQKI
Sbjct: 95  KAFVEDMNDNWDQRKGKSVKKDDDKNESPPGPASSSSSTSLYS--LENIKRDYRLTKQKI 152

Query: 363 HASLWVKEIEKLEEAKLGNSISGADDIEKLLDSASEIFDSANNDLGNPKISGSDFKNKPD 542
           HA LWVKEIEKLEEAKLGNSISG DDIEKLLDSASEIFDSANND G+PKI GS+FKNKPD
Sbjct: 153 HAGLWVKEIEKLEEAKLGNSISGGDDIEKLLDSASEIFDSANNDFGDPKIPGSEFKNKPD 212

Query: 543 GWETMSKNPD-GNVWDMS 593
           GWET SK+PD G++WDMS
Sbjct: 213 GWETTSKSPDGGSIWDMS 230


>gb|EPS73514.1| hypothetical protein M569_01239, partial [Genlisea aurea]
          Length = 276

 Score =  223 bits (568), Expect = 3e-56
 Identities = 120/200 (60%), Positives = 138/200 (69%), Gaps = 3/200 (1%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           K G+++WN AWETAWLPDDLSGK  RAPWE DV+F   P+   NP          +D ET
Sbjct: 16  KAGDDDWNVAWETAWLPDDLSGKGPRAPWETDVAFP-SPSHGNNPTEL------PMDAET 68

Query: 183 QAFVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKI 362
           QAFVEEMN+NW+QRKGKSAKK    +                   LENIK+DYRLTKQ+I
Sbjct: 69  QAFVEEMNDNWEQRKGKSAKKEAPNDT---------ETATPALYSLENIKKDYRLTKQRI 119

Query: 363 HASLWVKEIEKLEEAKLGNSISGA--DDIEKLLDSASEIFDSANNDLGNPKISGSDFKNK 536
           HA LWVKEIEKLEEAKLG +ISGA  DDI+K LDSASEIFDS NN+L     S S+ KNK
Sbjct: 120 HAGLWVKEIEKLEEAKLGEAISGAHDDDIDKFLDSASEIFDSGNNEL-KAAGSSSELKNK 178

Query: 537 PDGWETMSKNPD-GNVWDMS 593
           PDGWE  SK+ D G++WDMS
Sbjct: 179 PDGWEATSKSSDGGSIWDMS 198


>ref|XP_004228397.1| PREDICTED: uncharacterized protein LOC101253611 [Solanum
           lycopersicum]
          Length = 314

 Score =  222 bits (565), Expect = 7e-56
 Identities = 116/205 (56%), Positives = 140/205 (68%), Gaps = 8/205 (3%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPP-EEIDVE 179
           K+G +EWNDAWE AWLPDDLSGK+ RAPWEADV+F+LP  ++   +     P   E+D E
Sbjct: 32  KSGNDEWNDAWEAAWLPDDLSGKN-RAPWEADVNFALPDDTSNTTEITQIEPRVSEVDAE 90

Query: 180 TQAFVEEMNENWDQRKGKSAKK----VPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRL 347
           T+AFVE+MNENW  RKGK        V + N                   LENIK+DYRL
Sbjct: 91  TKAFVEDMNENWHLRKGKQKNSSEGIVMNENGSSLYS-------------LENIKKDYRL 137

Query: 348 TKQKIHASLWVKEIEKLEEAKLGNSISGA---DDIEKLLDSASEIFDSANNDLGNPKISG 518
            KQ++HA LW+KEIEK+EEAKLG+SI G+   DDIEKLLDS SEIFDS N+D  N   + 
Sbjct: 138 KKQRVHAGLWLKEIEKMEEAKLGDSIGGSGNGDDIEKLLDSCSEIFDSPNDDSNNSNTT- 196

Query: 519 SDFKNKPDGWETMSKNPDGNVWDMS 593
           S+FKNKPDGWET SK  DGN+W+MS
Sbjct: 197 SEFKNKPDGWETTSKTQDGNIWEMS 221


>ref|XP_006364945.1| PREDICTED: uncharacterized protein LOC102583834 [Solanum tuberosum]
          Length = 315

 Score =  221 bits (563), Expect = 1e-55
 Identities = 117/203 (57%), Positives = 141/203 (69%), Gaps = 6/203 (2%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPAS---AENPQAAVASPPEEID 173
           K+G +EWNDAWE AWLPDDLSGK+ R+PWEADV+F+LP  +   AEN Q  + S   E+D
Sbjct: 32  KSGNDEWNDAWEAAWLPDDLSGKN-RSPWEADVNFALPDDTSNAAENTQ--IESRVSEVD 88

Query: 174 VETQAFVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTK 353
            ET+AFVE+MNENW  RKGK  K   +                     LENIK+DYRL K
Sbjct: 89  AETKAFVEDMNENWQLRKGKQQKNGSEGT--------NMNENGSSLYSLENIKKDYRLKK 140

Query: 354 QKIHASLWVKEIEKLEEAKLGNSISGA---DDIEKLLDSASEIFDSANNDLGNPKISGSD 524
           Q++HA LW+KEIEK+EEAKLG+SI G+   DDIEKLLDS SEIFDS  +D  N   + S+
Sbjct: 141 QRVHAGLWLKEIEKMEEAKLGDSIGGSGNGDDIEKLLDSCSEIFDSPYDDSSNSNTT-SE 199

Query: 525 FKNKPDGWETMSKNPDGNVWDMS 593
           FKNKPDGWET SK  DGN+W+MS
Sbjct: 200 FKNKPDGWETTSKTQDGNIWEMS 222


>ref|XP_006361122.1| PREDICTED: uncharacterized protein LOC102585548 [Solanum tuberosum]
          Length = 311

 Score =  220 bits (560), Expect = 3e-55
 Identities = 120/203 (59%), Positives = 139/203 (68%), Gaps = 6/203 (2%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPP---ASAENPQAAVASPPEEID 173
           K+G +EWNDAWE AWLPDDLSGK+ RAPWEADV+F+       S EN Q  + S   E+D
Sbjct: 28  KSGNDEWNDAWEAAWLPDDLSGKN-RAPWEADVNFAHSDDTIGSTENIQ--IESRVPEVD 84

Query: 174 VETQAFVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTK 353
            ET+AFVE+MNENW  RKGK  K     N G                 LENIKRDYRL K
Sbjct: 85  TETKAFVEDMNENWHLRKGKQQK-----NGGEGININESESSLYS---LENIKRDYRLKK 136

Query: 354 QKIHASLWVKEIEKLEEAKLGNSISG---ADDIEKLLDSASEIFDSANNDLGNPKISGSD 524
           Q++HA LW+KEIEK+EE KLG+SI G   ADDIEKLLDS SEIFDS N+D  N   + S+
Sbjct: 137 QRVHAGLWLKEIEKMEETKLGDSIGGSGNADDIEKLLDSCSEIFDSPNDDSSNSNTT-SE 195

Query: 525 FKNKPDGWETMSKNPDGNVWDMS 593
           FKNKPDGWET SK  DGN+W+MS
Sbjct: 196 FKNKPDGWETTSKTQDGNIWEMS 218


>ref|XP_007215731.1| hypothetical protein PRUPE_ppa009082mg [Prunus persica]
           gi|462411881|gb|EMJ16930.1| hypothetical protein
           PRUPE_ppa009082mg [Prunus persica]
          Length = 307

 Score =  219 bits (558), Expect = 5e-55
 Identities = 114/200 (57%), Positives = 136/200 (68%), Gaps = 3/200 (1%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           K G+++WNDAWETAWLP DLSG SSRAPWEADV+FS   +S   P  A        D+ET
Sbjct: 39  KKGDDDWNDAWETAWLPPDLSGSSSRAPWEADVNFSSSESSVVLPSDA--------DLET 90

Query: 183 QAFVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKI 362
           +AFVE+MNENW++R+    +K    N                   L++IK+DYR+ KQ+I
Sbjct: 91  KAFVEDMNENWNERRKPKEEKQQSENGS-------------SLYSLDSIKKDYRIKKQRI 137

Query: 363 HASLWVKEIEKLEEAKL--GNSISGADDIEKLLDSASEIFDSANNDLGNPKI-SGSDFKN 533
           HA LW+KEIEK EEAKL   NS  G DDIE+LLDS S+IFDSANNDL N K  S SDFKN
Sbjct: 138 HAGLWMKEIEKQEEAKLADSNSFGGGDDIERLLDSCSDIFDSANNDLENSKAPSASDFKN 197

Query: 534 KPDGWETMSKNPDGNVWDMS 593
           KPDGWET SK  DGNVW+M+
Sbjct: 198 KPDGWETTSKAKDGNVWEMT 217


>ref|XP_007222980.1| hypothetical protein PRUPE_ppa008967mg [Prunus persica]
           gi|462419916|gb|EMJ24179.1| hypothetical protein
           PRUPE_ppa008967mg [Prunus persica]
          Length = 312

 Score =  216 bits (549), Expect = 5e-54
 Identities = 112/200 (56%), Positives = 139/200 (69%), Gaps = 3/200 (1%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           K G+++WNDAW+TAWLP DLSG SSRAPWE DV+FS   +S   P  A        D+ET
Sbjct: 39  KKGDDDWNDAWDTAWLPPDLSGSSSRAPWETDVNFSSSESSVVLPSDA--------DLET 90

Query: 183 QAFVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKI 362
           +AFVE+MNENW++R+ K  ++ P +                    L++IK+DYR+ KQ+I
Sbjct: 91  KAFVEDMNENWNERR-KPKEENPQKQ-------QQQSENGSSLYSLDSIKKDYRVKKQRI 142

Query: 363 HASLWVKEIEKLEEAKL--GNSISGADDIEKLLDSASEIFDSANNDLGNPKI-SGSDFKN 533
           HA LW+KEIEK EEAKL   NS+ G DDIE+LLDS S+IFDSANNDL N K+ S SDFKN
Sbjct: 143 HAGLWMKEIEKQEEAKLADSNSVGGGDDIERLLDSCSDIFDSANNDLENSKVPSASDFKN 202

Query: 534 KPDGWETMSKNPDGNVWDMS 593
           KPDGWET SK  DGNVW+M+
Sbjct: 203 KPDGWETTSKAKDGNVWEMT 222


>ref|XP_003518719.2| PREDICTED: uncharacterized protein LOC100775402 [Glycine max]
          Length = 327

 Score =  213 bits (542), Expect = 3e-53
 Identities = 108/199 (54%), Positives = 145/199 (72%), Gaps = 2/199 (1%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           ++GE+EWN+AWETAWLPDDL+ K+ +APWE+DV+F   P+S+ +  AA A+  +  D ET
Sbjct: 59  RSGEDEWNEAWETAWLPDDLTPKT-QAPWESDVNF---PSSSSSSSAAAANDGDG-DEET 113

Query: 183 QAFVEEMNENWDQR-KGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQK 359
           +AFV EMNENW++R KG   K+  + N                   +EN+K+DYRL KQ+
Sbjct: 114 KAFVAEMNENWNERRKGSKEKEKKEENGALYS--------------VENMKKDYRLKKQR 159

Query: 360 IHASLWVKEIEKLEEAKLGNS-ISGADDIEKLLDSASEIFDSANNDLGNPKISGSDFKNK 536
           +HA LW+KEIEKLEEAKLG+S ++G DDI++LLDS S+IFD  NNDL N ++  S+FKN 
Sbjct: 160 MHAGLWMKEIEKLEEAKLGDSDVAGDDDIQRLLDSCSDIFDPGNNDLNNVQVQTSEFKNM 219

Query: 537 PDGWETMSKNPDGNVWDMS 593
           PDGWET+SKN +GNVW+MS
Sbjct: 220 PDGWETISKNQEGNVWEMS 238


>ref|XP_006573104.1| PREDICTED: uncharacterized protein LOC100806758 isoform X2 [Glycine
           max]
          Length = 262

 Score =  210 bits (534), Expect = 3e-52
 Identities = 112/199 (56%), Positives = 141/199 (70%), Gaps = 2/199 (1%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           ++GE+EWN+AWETAWLPDDL+ K+ RAPWE+DV+F  P  SA  P A      E+ D ET
Sbjct: 43  RSGEDEWNEAWETAWLPDDLTPKT-RAPWESDVNF--PSYSA--PAA------EDGDEET 91

Query: 183 QAFVEEMNENWDQR-KGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQK 359
           +AFV EMNENW++R KG   K+  + N                   LEN+K+DYRL KQ+
Sbjct: 92  KAFVAEMNENWNERRKGSKEKEKREENGALYS--------------LENMKKDYRLKKQR 137

Query: 360 IHASLWVKEIEKLEEAKLGNS-ISGADDIEKLLDSASEIFDSANNDLGNPKISGSDFKNK 536
           +HA LW+KEIEKLEEAKLG+S I+G DDI++LLDS S+IFD  NN+L N  +  SDFKN 
Sbjct: 138 MHAGLWMKEIEKLEEAKLGDSDIAGGDDIQRLLDSCSDIFDPGNNNLNNAHVQTSDFKNM 197

Query: 537 PDGWETMSKNPDGNVWDMS 593
           PDGWET+SKN +GNVW+MS
Sbjct: 198 PDGWETISKNQEGNVWEMS 216


>ref|XP_003517783.1| PREDICTED: uncharacterized protein LOC100806758 isoform X1 [Glycine
           max]
          Length = 305

 Score =  210 bits (534), Expect = 3e-52
 Identities = 112/199 (56%), Positives = 141/199 (70%), Gaps = 2/199 (1%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           ++GE+EWN+AWETAWLPDDL+ K+ RAPWE+DV+F  P  SA  P A      E+ D ET
Sbjct: 43  RSGEDEWNEAWETAWLPDDLTPKT-RAPWESDVNF--PSYSA--PAA------EDGDEET 91

Query: 183 QAFVEEMNENWDQR-KGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQK 359
           +AFV EMNENW++R KG   K+  + N                   LEN+K+DYRL KQ+
Sbjct: 92  KAFVAEMNENWNERRKGSKEKEKREENGALYS--------------LENMKKDYRLKKQR 137

Query: 360 IHASLWVKEIEKLEEAKLGNS-ISGADDIEKLLDSASEIFDSANNDLGNPKISGSDFKNK 536
           +HA LW+KEIEKLEEAKLG+S I+G DDI++LLDS S+IFD  NN+L N  +  SDFKN 
Sbjct: 138 MHAGLWMKEIEKLEEAKLGDSDIAGGDDIQRLLDSCSDIFDPGNNNLNNAHVQTSDFKNM 197

Query: 537 PDGWETMSKNPDGNVWDMS 593
           PDGWET+SKN +GNVW+MS
Sbjct: 198 PDGWETISKNQEGNVWEMS 216


>ref|XP_007157641.1| hypothetical protein PHAVU_002G086800g [Phaseolus vulgaris]
           gi|561031056|gb|ESW29635.1| hypothetical protein
           PHAVU_002G086800g [Phaseolus vulgaris]
          Length = 320

 Score =  209 bits (531), Expect = 6e-52
 Identities = 104/198 (52%), Positives = 142/198 (71%), Gaps = 1/198 (0%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           ++GE+EWN+AWETAWLP+DL  K+ RAPWE+DV+F   P+S+ +P AA A    E D ET
Sbjct: 50  RSGEDEWNEAWETAWLPEDLRPKT-RAPWESDVNF---PSSSSSPVAA-ADAVAEADEET 104

Query: 183 QAFVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKI 362
           +AFV EMNENW++R+  S +K  ++                    +EN+K+DYRL KQ++
Sbjct: 105 KAFVAEMNENWNERRRGSKEKEKEKRE-----------ENGALYSVENMKKDYRLKKQRM 153

Query: 363 HASLWVKEIEKLEEAKLGNS-ISGADDIEKLLDSASEIFDSANNDLGNPKISGSDFKNKP 539
           HA LW+KEIEKLEEAKL +S ++  DDI++L+DS S+IFD  NNDL N ++  ++FKN P
Sbjct: 154 HAGLWMKEIEKLEEAKLADSDVAAGDDIQRLIDSCSDIFDPGNNDLNNAQVQTAEFKNMP 213

Query: 540 DGWETMSKNPDGNVWDMS 593
           DGWET+SKN +GNVW+MS
Sbjct: 214 DGWETISKNQEGNVWEMS 231


>ref|XP_002875178.1| hypothetical protein ARALYDRAFT_904548 [Arabidopsis lyrata subsp.
           lyrata] gi|297321016|gb|EFH51437.1| hypothetical protein
           ARALYDRAFT_904548 [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score =  208 bits (529), Expect = 1e-51
 Identities = 109/197 (55%), Positives = 139/197 (70%), Gaps = 3/197 (1%)
 Frame = +3

Query: 12  EEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVETQAF 191
           E++WNDAWE+AWLPDDL+ K  RAPWE DV+FS+  ++A           EEIDVE +AF
Sbjct: 46  EDKWNDAWESAWLPDDLTDKI-RAPWETDVNFSVKESTATT---------EEIDVEAKAF 95

Query: 192 VEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKIHAS 371
           VE+MNE+W++R+GKS K V  R                    LE +K+DYRL KQ++HAS
Sbjct: 96  VEDMNEHWNERRGKSGK-VEKREEKNKKEIGDGEESSSSLYSLETMKKDYRLKKQRVHAS 154

Query: 372 LWVKEIEKLEEAKLGNSIS--GADDIEKLLDSASEIFDSANNDLGNPKI-SGSDFKNKPD 542
           LWVKEIEKLEEAKLG+S S  GADDI++LLDS SEIFDS ++D    ++ SGS+ KNKPD
Sbjct: 155 LWVKEIEKLEEAKLGDSGSGGGADDIDRLLDSCSEIFDSVDHDFDKLEVSSGSELKNKPD 214

Query: 543 GWETMSKNPDGNVWDMS 593
           GWE+ +K  DGN+W+MS
Sbjct: 215 GWESTAKEQDGNLWEMS 231


>ref|XP_003613679.1| Mucin-like protein [Medicago truncatula]
           gi|355515014|gb|AES96637.1| Mucin-like protein [Medicago
           truncatula]
          Length = 299

 Score =  207 bits (528), Expect = 1e-51
 Identities = 111/200 (55%), Positives = 133/200 (66%), Gaps = 3/200 (1%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           ++GEEEWN+AWE+AWLP DL+ K+ RAPWE DV+F+                 E  D ET
Sbjct: 39  RSGEEEWNEAWESAWLPQDLTPKT-RAPWEGDVNFA---------------SEEVADAET 82

Query: 183 QAFVEEMNENWDQR-KGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQK 359
           +AFVEEMNENW++R KG   +KV + N                   LENIK+DYRL KQK
Sbjct: 83  KAFVEEMNENWNERRKGLKKEKVEEENV------------KGGIYSLENIKKDYRLKKQK 130

Query: 360 IHASLWVKEIEKLEEAKLGN--SISGADDIEKLLDSASEIFDSANNDLGNPKISGSDFKN 533
           +HA LW KEIEKLEEAKLG     +G DDI+KLLDS S+IFDS NNDL N K   S+FKN
Sbjct: 131 LHAGLWSKEIEKLEEAKLGGCGGGNGDDDIQKLLDSCSDIFDSHNNDLNNAKDPTSEFKN 190

Query: 534 KPDGWETMSKNPDGNVWDMS 593
            PDGWET+SKN DGN+W+MS
Sbjct: 191 MPDGWETISKNQDGNIWEMS 210


>gb|AGV54769.1| mucin-like protein [Phaseolus vulgaris]
          Length = 319

 Score =  206 bits (525), Expect = 3e-51
 Identities = 105/198 (53%), Positives = 141/198 (71%), Gaps = 1/198 (0%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           ++GE+EWN+AWETAWLP+DL  K+ RAPWE+DV+F   P+S+ +P AA A    E D ET
Sbjct: 50  RSGEDEWNEAWETAWLPEDLRPKT-RAPWESDVNF---PSSSSSPVAA-ADAVAEADEET 104

Query: 183 QAFVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKI 362
           +AFV EMNENW++R+  S +K  ++                    +EN+K+DYRL KQK+
Sbjct: 105 KAFVAEMNENWNERRRGSKEKEKEKRE-----------ENGALYSVENMKKDYRLKKQKM 153

Query: 363 HASLWVKEIEKLEEAKLGNS-ISGADDIEKLLDSASEIFDSANNDLGNPKISGSDFKNKP 539
           HA LW+KEIEKLEEAKL +S ++  DDI++L+DS S+IFD  NND  N ++  +DFKN P
Sbjct: 154 HAGLWMKEIEKLEEAKLADSDVAAGDDIQRLIDSCSDIFDPGNNDSHNAQVQTADFKNMP 213

Query: 540 DGWETMSKNPDGNVWDMS 593
           DGWET+SKNP+GNV +MS
Sbjct: 214 DGWETISKNPEGNVLEMS 231


>ref|XP_007046137.1| Mucin-related protein [Theobroma cacao] gi|508710072|gb|EOY01969.1|
           Mucin-related protein [Theobroma cacao]
          Length = 319

 Score =  206 bits (524), Expect = 4e-51
 Identities = 112/195 (57%), Positives = 137/195 (70%), Gaps = 1/195 (0%)
 Frame = +3

Query: 12  EEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVETQAF 191
           ++ WNDAWE AWLPDD+S K+ RAPWEADV+F  P    ++    V S   ++D ET+AF
Sbjct: 47  DDSWNDAWEAAWLPDDISPKN-RAPWEADVNF--PSNDEQSATKMVLS--SDVDAETKAF 101

Query: 192 VEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKIHAS 371
           VE+MNENW++R+ KS K+  +  A                  LEN+K DYRL KQ+IHA 
Sbjct: 102 VEDMNENWNERR-KSPKQKQNEEA-----EKERKGEGGGLYSLENMKNDYRLKKQRIHAG 155

Query: 372 LWVKEIEKLEEAKLGNSISGADDIEKLLDSASEIFDSANNDLGNPK-ISGSDFKNKPDGW 548
           LW+KEI+KLEEAKLG+S   ADDI+KLLDS SEIFDS N DL N K +S S+FKNKPDGW
Sbjct: 156 LWMKEIDKLEEAKLGDS---ADDIDKLLDSCSEIFDSGNADLENSKLLSTSEFKNKPDGW 212

Query: 549 ETMSKNPDGNVWDMS 593
           ET SK PDGNVW+MS
Sbjct: 213 ETTSKAPDGNVWEMS 227


>ref|NP_178391.1| mucin-related protein [Arabidopsis thaliana]
           gi|3461815|gb|AAC32909.1| predicted by genscan
           [Arabidopsis thaliana] gi|26451250|dbj|BAC42727.1|
           unknown protein [Arabidopsis thaliana]
           gi|29824133|gb|AAP04027.1| unknown protein [Arabidopsis
           thaliana] gi|330250545|gb|AEC05639.1| mucin-related
           protein [Arabidopsis thaliana]
          Length = 314

 Score =  205 bits (521), Expect = 9e-51
 Identities = 108/197 (54%), Positives = 138/197 (70%), Gaps = 3/197 (1%)
 Frame = +3

Query: 12  EEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVETQAF 191
           E++WNDAWE+AWLPDDL+ K  RAPWE DV+F +        + + A+  EEIDVE +AF
Sbjct: 43  EDKWNDAWESAWLPDDLTDKI-RAPWERDVNFPV--------KESTATAIEEIDVEAKAF 93

Query: 192 VEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKIHAS 371
           VE+MNE+WD+R+GKS K V  R                    LE +K+DYRL KQ++HAS
Sbjct: 94  VEDMNEHWDERRGKSGK-VEKREEKKKEIGDGEDESSSSLYSLETMKKDYRLKKQRVHAS 152

Query: 372 LWVKEIEKLEEAKLGNSIS--GADDIEKLLDSASEIFDSANNDLGNPKI-SGSDFKNKPD 542
           LWVKEIEKLEEAKL +S S  GADDI++LLDS SEIFDS ++D    ++ SGS+ KNKPD
Sbjct: 153 LWVKEIEKLEEAKLDDSGSGGGADDIDRLLDSCSEIFDSVDHDFDKLEVSSGSELKNKPD 212

Query: 543 GWETMSKNPDGNVWDMS 593
           GWE+ +K  DGN+W+MS
Sbjct: 213 GWESTAKEQDGNLWEMS 229


>ref|XP_004298438.1| PREDICTED: uncharacterized protein LOC101307692 [Fragaria vesca
           subsp. vesca]
          Length = 312

 Score =  204 bits (519), Expect = 2e-50
 Identities = 112/200 (56%), Positives = 134/200 (67%), Gaps = 3/200 (1%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           K  ++EWNDAWETAWLP D   ++SRAPWE D S S P A+AE   A V   P + D ET
Sbjct: 43  KNADDEWNDAWETAWLPPDPPSENSRAPWETDESLS-PAAAAE---AVVI--PSDADPET 96

Query: 183 QAFVEEMNENWDQR-KGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQK 359
           +AFVE+MN NWD+R K K  KK     +G                 LENIK+DYRL KQ+
Sbjct: 97  KAFVEDMNANWDERRKPKEEKKQEQSGSGELYS-------------LENIKKDYRLKKQR 143

Query: 360 IHASLWVKEIEKLEEAKLGNS-ISGADDIEKLLDSASEIFDSANNDLGNPKI-SGSDFKN 533
           IHA LW+KEIEK +EAKL +S I G DDIE+LLDS S IFD+ NNDL N K+ S  DFKN
Sbjct: 144 IHAGLWMKEIEKQQEAKLADSGIGGGDDIERLLDSCSNIFDTTNNDLQNAKVPSADDFKN 203

Query: 534 KPDGWETMSKNPDGNVWDMS 593
           KPDGWET SK  +G+VW+M+
Sbjct: 204 KPDGWETTSKAQEGSVWEMT 223


>ref|XP_004144093.1| PREDICTED: uncharacterized protein LOC101222958 [Cucumis sativus]
           gi|449525482|ref|XP_004169746.1| PREDICTED:
           uncharacterized LOC101222958 [Cucumis sativus]
          Length = 333

 Score =  204 bits (518), Expect = 2e-50
 Identities = 102/197 (51%), Positives = 138/197 (70%), Gaps = 2/197 (1%)
 Frame = +3

Query: 9   GEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVETQA 188
           G++EWND WE+AWLP+DLS K+ +APWE+DV+F      + NP       P ++D +T+A
Sbjct: 59  GDDEWNDTWESAWLPEDLSPKN-KAPWESDVNFP-----SGNPTVVF---PSDVDADTKA 109

Query: 189 FVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKIHA 368
           FVE+M ENWD+R+   A +   +                    LENIK+DYRL KQ+IHA
Sbjct: 110 FVEDMTENWDERR--KASQTQKQGVNQEEKGGDGGGGGGSLYSLENIKKDYRLQKQRIHA 167

Query: 369 SLWVKEIEKLEEAKLGNSISGA-DDIEKLLDSASEIFDSANNDLGNPKI-SGSDFKNKPD 542
           +LW+KEIEK EEA+LG+SI+G+ DDIE+L+DS SEIF+S N+DL   K+ S S+F+NKPD
Sbjct: 168 NLWMKEIEKQEEARLGDSIAGSGDDIERLMDSCSEIFESVNDDLNRSKVPSSSEFRNKPD 227

Query: 543 GWETMSKNPDGNVWDMS 593
           GWET+SK  DGNVW+M+
Sbjct: 228 GWETLSKGHDGNVWEMT 244


>ref|XP_002273560.1| PREDICTED: uncharacterized protein LOC100256690 [Vitis vinifera]
          Length = 299

 Score =  203 bits (516), Expect = 3e-50
 Identities = 103/197 (52%), Positives = 139/197 (70%), Gaps = 2/197 (1%)
 Frame = +3

Query: 9   GEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVETQA 188
           G++EWN+AWE+AWLP+DLS K+ RAPWE DV+FS       + ++A+  P  ++D ET+A
Sbjct: 39  GDDEWNNAWESAWLPEDLSAKN-RAPWETDVNFS-------SSESAIVLP-SDVDAETKA 89

Query: 189 FVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKIHA 368
           FVE+M ENW++R+    K+   R++                  LEN+K+DYRL KQ++HA
Sbjct: 90  FVEDMTENWNERRKGQQKQEEKRDS---------------LYNLENMKKDYRLKKQRLHA 134

Query: 369 SLWVKEIEKLEEAKLGNSIS-GADDIEKLLDSASEIFDSANNDLGNPKI-SGSDFKNKPD 542
            LW+KEIEK EEAKLG+ ++ G DDI++LLDS SEIFD+ NND  N  I S S+FKNKPD
Sbjct: 135 GLWMKEIEKQEEAKLGDLVAGGGDDIDRLLDSYSEIFDTGNNDFNNSNIPSSSEFKNKPD 194

Query: 543 GWETMSKNPDGNVWDMS 593
           GWET+SK  DGN+W+MS
Sbjct: 195 GWETISKAQDGNIWEMS 211


>ref|XP_006395751.1| hypothetical protein EUTSA_v10004617mg [Eutrema salsugineum]
           gi|557092390|gb|ESQ33037.1| hypothetical protein
           EUTSA_v10004617mg [Eutrema salsugineum]
          Length = 317

 Score =  202 bits (515), Expect = 4e-50
 Identities = 105/200 (52%), Positives = 136/200 (68%), Gaps = 3/200 (1%)
 Frame = +3

Query: 3   KTGEEEWNDAWETAWLPDDLSGKSSRAPWEADVSFSLPPASAENPQAAVASPPEEIDVET 182
           +  E++WNDAWE+AWLP+DL+ KS RAPWE DV+F            A  S  EEIDVE 
Sbjct: 44  RNDEDKWNDAWESAWLPEDLTDKS-RAPWETDVNFP------STESTAAISSSEEIDVEA 96

Query: 183 QAFVEEMNENWDQRKGKSAKKVPDRNAGXXXXXXXXXXXXXXXXXLENIKRDYRLTKQKI 362
           +AFVE+MNE+W +R+GKS K+                        LE +K+DYRL KQ++
Sbjct: 97  KAFVEDMNEHWSERRGKSGKEEKKERK---------EIEGESLYSLETMKKDYRLKKQRV 147

Query: 363 HASLWVKEIEKLEEAKLGNSIS--GADDIEKLLDSASEIFDSANNDLGNPKI-SGSDFKN 533
           HASLWVKEIEKLEEAKLG+S +  GADDI++LLDS SEIFDS ++D    ++ SG++ KN
Sbjct: 148 HASLWVKEIEKLEEAKLGDSGAGGGADDIDRLLDSCSEIFDSVDHDFDKLEVSSGTELKN 207

Query: 534 KPDGWETMSKNPDGNVWDMS 593
           KPDGWE+ +K  DGN+W+MS
Sbjct: 208 KPDGWESTAKEQDGNLWEMS 227