BLASTX nr result

ID: Mentha26_contig00038007 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00038007
         (1023 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI37358.3| unnamed protein product [Vitis vinifera]              236   1e-59
ref|XP_006355512.1| PREDICTED: mucin-19-like [Solanum tuberosum]      226   2e-56
ref|XP_002267137.2| PREDICTED: uncharacterized protein LOC100266...   219   2e-54
emb|CAN66568.1| hypothetical protein VITISV_039539 [Vitis vinifera]   219   2e-54
ref|XP_004246157.1| PREDICTED: uncharacterized protein LOC101252...   214   6e-53
emb|CAN74654.1| hypothetical protein VITISV_022993 [Vitis vinifera]   211   4e-52
ref|XP_006440297.1| hypothetical protein CICLE_v10018443mg [Citr...   209   1e-51
ref|XP_006477174.1| PREDICTED: uncharacterized protein LOC102627...   206   2e-50
ref|XP_007209070.1| hypothetical protein PRUPE_ppa000035mg [Prun...   199   1e-48
ref|XP_007157291.1| hypothetical protein PHAVU_002G057800g [Phas...   197   4e-48
ref|XP_007039813.1| G2484-1 protein, putative isoform 6 [Theobro...   193   1e-46
ref|XP_007039812.1| G2484-1 protein, putative isoform 5 [Theobro...   193   1e-46
ref|XP_007039811.1| G2484-1 protein, putative isoform 4 [Theobro...   193   1e-46
ref|XP_007039808.1| G2484-1 protein, putative isoform 1 [Theobro...   193   1e-46
ref|XP_002530649.1| conserved hypothetical protein [Ricinus comm...   193   1e-46
ref|XP_002868073.1| hypothetical protein ARALYDRAFT_329795 [Arab...   192   2e-46
ref|XP_006573722.1| PREDICTED: uncharacterized protein LOC100792...   192   2e-46
ref|XP_006573716.1| PREDICTED: uncharacterized protein LOC100792...   192   2e-46
ref|XP_006590567.1| PREDICTED: mucin-17-like [Glycine max]            191   3e-46
ref|XP_007155669.1| hypothetical protein PHAVU_003G221300g [Phas...   191   3e-46

>emb|CBI37358.3| unnamed protein product [Vitis vinifera]
          Length = 1979

 Score =  236 bits (602), Expect = 1e-59
 Identities = 133/263 (50%), Positives = 180/263 (68%), Gaps = 9/263 (3%)
 Frame = +2

Query: 209 TYFFWDTQMDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVE 385
           TYFFWDT MDY+DND++ QNL LA E S+K   VL P+ALPKFDFDD+L GHLRFDSLVE
Sbjct: 40  TYFFWDTPMDYDDNDFQSQNLRLAGEGSAKFPPVLGPYALPKFDFDDSLQGHLRFDSLVE 99

Query: 386 NEVFLGISSQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKA 565
            EVFLGI SQED+ WIEDFSRG +GIEF SSAAESC++ R  NVWSEATSSESVEMLLK+
Sbjct: 100 TEVFLGIESQEDNQWIEDFSRGSSGIEFSSSAAESCSISRRNNVWSEATSSESVEMLLKS 159

Query: 566 VGQEEMVPGENMIEESDPGNQLGSSTRVVD-----NNSGDARKTDDVDDG--IAPADAVG 724
           VGQEE+VPG+  +++S   ++LGS T+ ++     +NS  +   + +D G  I P + +G
Sbjct: 160 VGQEEIVPGQTTVKDSGACDELGSITKQMEHNLKPDNSNLSNVGNVIDSGPTIRPDEFLG 219

Query: 725 ISFTSCQTSAVESEQAECTLQVQETKLSSFGVGIDNKDSSLALATENSNLV-MKEADSSQ 901
            SF     S +  +  +   Q+++T  +  G  +  + S+    TE + L+  K+ D++Q
Sbjct: 220 -SF-----SVLNKDAGKELPQIEDTSQTREGDSLAYRSSTDLPVTEGNMLIDSKDDDANQ 273

Query: 902 GETCGLVDESLSHQMQEELPLHG 970
           GE   LV+ESL++  Q++    G
Sbjct: 274 GEIDTLVNESLNNNTQDDFSASG 296


>ref|XP_006355512.1| PREDICTED: mucin-19-like [Solanum tuberosum]
          Length = 2181

 Score =  226 bits (575), Expect = 2e-56
 Identities = 129/241 (53%), Positives = 160/241 (66%), Gaps = 7/241 (2%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDYNDNDY+    HLA E+SSK+S VL P+ALPKFDFDD+L GHLRFDSLVENEVFLGI 
Sbjct: 1   MDYNDNDYQS---HLAGEDSSKVSSVLHPYALPKFDFDDSLQGHLRFDSLVENEVFLGIP 57

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           +QED+HWIEDFSRG +GIEF SSA +SC++PR  NVWSEATS+ESVEMLLK+V QEEMVP
Sbjct: 58  TQEDNHWIEDFSRGSSGIEFSSSATDSCSIPRRNNVWSEATSTESVEMLLKSVRQEEMVP 117

Query: 590 GENMIEESDPGNQLGSSTRVVDNNSGDARKTDDVDD--GIAPAD---AVGISFTSCQTSA 754
           G+ +IEESD GN+LG   +  +++     K DDV D    APAD       SF+ C+ + 
Sbjct: 118 GDTIIEESDAGNELGCLIQPAESSLKLDDKRDDVKDSSSAAPADESVEFSGSFSRCERTK 177

Query: 755 VESEQAECTLQVQETKLSSFGVG-IDNKDSSLALATENSNLVMKEADSSQGETCGLVDES 931
           +E     C  + QE +  + G   I  +  S     E     +K  D + GE    + ES
Sbjct: 178 IEGIHIVCAPERQEVEPIADGCSDIAGETYSGFNTEEKLQTEIKSIDENLGEVKTSLSES 237

Query: 932 L 934
           L
Sbjct: 238 L 238


>ref|XP_002267137.2| PREDICTED: uncharacterized protein LOC100266068 [Vitis vinifera]
          Length = 2292

 Score =  219 bits (557), Expect = 2e-54
 Identities = 126/255 (49%), Positives = 173/255 (67%), Gaps = 9/255 (3%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNL LA E S+K   VL P+ALPKFDFDD+L GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQSQNLRLAGEGSAKFPPVLGPYALPKFDFDDSLQGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           SQED+ WIEDFSRG +GIEF SSAAESC++ R  NVWSEATSSESVEMLLK+VGQEE+VP
Sbjct: 61  SQEDNQWIEDFSRGSSGIEFSSSAAESCSISRRNNVWSEATSSESVEMLLKSVGQEEIVP 120

Query: 590 GENMIEESDPGNQLGSSTRVVD-----NNSGDARKTDDVDDG--IAPADAVGISFTSCQT 748
           G+  +++S   ++LGS T+ ++     +NS  +   + +D G  I P + +G SF     
Sbjct: 121 GQTTVKDSGACDELGSITKQMEHNLKPDNSNLSNVGNVIDSGPTIRPDEFLG-SF----- 174

Query: 749 SAVESEQAECTLQVQETKLSSFGVGIDNKDSSLALATENSNLV-MKEADSSQGETCGLVD 925
           S +  +  +   Q+++T  +  G  +  + S+    TE + L+  K+ D++QGE   LV+
Sbjct: 175 SVLNKDAGKELPQIEDTSQTREGDSLAYRSSTDLPVTEGNMLIDSKDDDANQGEIDTLVN 234

Query: 926 ESLSHQMQEELPLHG 970
           ESL++  Q++    G
Sbjct: 235 ESLNNNTQDDFSASG 249


>emb|CAN66568.1| hypothetical protein VITISV_039539 [Vitis vinifera]
          Length = 2321

 Score =  219 bits (557), Expect = 2e-54
 Identities = 126/255 (49%), Positives = 173/255 (67%), Gaps = 9/255 (3%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNL LA E S+K   VL P+ALPKFDFDD+L GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQSQNLRLAGEGSAKFPPVLGPYALPKFDFDDSLQGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           SQED+ WIEDFSRG +GIEF SSAAESC++ R  NVWSEATSSESVEMLLK+VGQEE+VP
Sbjct: 61  SQEDNQWIEDFSRGSSGIEFSSSAAESCSISRRNNVWSEATSSESVEMLLKSVGQEEIVP 120

Query: 590 GENMIEESDPGNQLGSSTRVVD-----NNSGDARKTDDVDDG--IAPADAVGISFTSCQT 748
           G+  +++S   ++LGS T+ ++     +NS  +   + +D G  I P + +G SF     
Sbjct: 121 GQTTVKDSGACDELGSITKQMEHNLKPDNSNLSNVGNVIDSGPTIRPDEFLG-SF----- 174

Query: 749 SAVESEQAECTLQVQETKLSSFGVGIDNKDSSLALATENSNLV-MKEADSSQGETCGLVD 925
           S +  +  +   Q+++T  +  G  +  + S+    TE + L+  K+ D++QGE   LV+
Sbjct: 175 SVLNKDAGKELPQIEDTSQTREGDSLAYRSSTDLPVTEGNMLIDSKDDDANQGEIDTLVN 234

Query: 926 ESLSHQMQEELPLHG 970
           ESL++  Q++    G
Sbjct: 235 ESLNNNTQDDFSASG 249


>ref|XP_004246157.1| PREDICTED: uncharacterized protein LOC101252108 [Solanum
           lycopersicum]
          Length = 2155

 Score =  214 bits (544), Expect = 6e-53
 Identities = 125/252 (49%), Positives = 163/252 (64%), Gaps = 8/252 (3%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDYNDNDY+    HLA E+SSK+S VL P+ALPKFDFDD      RFDSLVENEVFLGI 
Sbjct: 1   MDYNDNDYQS---HLAGEDSSKVSSVLHPYALPKFDFDD------RFDSLVENEVFLGIP 51

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           +QED+HWIEDFSRG +GIEF SSA +SC++PR  NVWSEATS+ESVEMLLK+VGQE+MVP
Sbjct: 52  TQEDNHWIEDFSRGSSGIEFSSSATDSCSIPRRNNVWSEATSTESVEMLLKSVGQEDMVP 111

Query: 590 GENMIEESDPGNQLGSSTRVVDNNSGDARKTDDVDDGIAPADAV-----GISFTSCQTSA 754
           G+ +IEESD GN+LG   +  +++     K DDV + I+   AV       SF+ C+ + 
Sbjct: 112 GDTIIEESDAGNELGCLIQPAESSLKLDDKQDDVKNSISATPAVESVELSGSFSRCERTK 171

Query: 755 VESEQAECTLQVQETKLSSFGVGIDNKDSSLALATENSNLVMKEADSSQGETCGLVDESL 934
           +E+  + C  + QE  +     G    ++   L TE     +K  D + GE      ESL
Sbjct: 172 IEAIHSVCAPERQE--VGPIADGCSGVNTEEKLQTE-----VKSIDENLGEVRTAQSESL 224

Query: 935 --SHQMQEELPL 964
             ++  Q  +P+
Sbjct: 225 PDNYNRQPSIPV 236


>emb|CAN74654.1| hypothetical protein VITISV_022993 [Vitis vinifera]
          Length = 644

 Score =  211 bits (537), Expect = 4e-52
 Identities = 124/257 (48%), Positives = 172/257 (66%), Gaps = 11/257 (4%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNL LA E S+K   VL P+ALPKFDFDD+L GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQSQNLRLAGEGSAKFPPVLGPYALPKFDFDDSLQGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           SQED+ WIEDFSRG +GIEF SSAAESC++ R  NVWSEATSSESVE+LLK+VGQEE+VP
Sbjct: 61  SQEDNQWIEDFSRGSSGIEFSSSAAESCSISRRNNVWSEATSSESVEILLKSVGQEEIVP 120

Query: 590 GENMIEESDPGNQLGSSTRVVD-----NNSGDARKTDDVDDG--IAPADAVGISFTSCQT 748
           G+  +++S   ++LGS T+ ++     +NS  +   + +D G  I P + +G       +
Sbjct: 121 GQTTVKDSGACDELGSITKQMEHNLKPDNSNLSNVGNVIDSGPTIRPDEFLG-------S 173

Query: 749 SAVESEQAECTL-QVQETKLSSFGVGIDNKDSSLALATENSNLV--MKEADSSQGETCGL 919
            +V +E AE  L Q+++T  +  G  +  + S+     E + L+    + D++Q E   L
Sbjct: 174 FSVLNEDAEKELPQIEDTSQTREGDSLAYRSSTDLPVIEGNMLIDSKDDDDANQREIDTL 233

Query: 920 VDESLSHQMQEELPLHG 970
           V+ESL++  Q++    G
Sbjct: 234 VNESLNNNTQDDFSASG 250


>ref|XP_006440297.1| hypothetical protein CICLE_v10018443mg [Citrus clementina]
           gi|567895620|ref|XP_006440298.1| hypothetical protein
           CICLE_v10018443mg [Citrus clementina]
           gi|567895622|ref|XP_006440299.1| hypothetical protein
           CICLE_v10018443mg [Citrus clementina]
           gi|557542559|gb|ESR53537.1| hypothetical protein
           CICLE_v10018443mg [Citrus clementina]
           gi|557542560|gb|ESR53538.1| hypothetical protein
           CICLE_v10018443mg [Citrus clementina]
           gi|557542561|gb|ESR53539.1| hypothetical protein
           CICLE_v10018443mg [Citrus clementina]
          Length = 2155

 Score =  209 bits (532), Expect = 1e-51
 Identities = 119/253 (47%), Positives = 162/253 (64%), Gaps = 7/253 (2%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDYNDN+++ QNL LA E ++K   VLRP+ALPKFDFDD+L+GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYNDNEFQSQNLQLAGEGNTKFPPVLRPYALPKFDFDDSLHGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WIE++SRGG+GIEFR+SAAESC++ RHINVWSEATSSESVEMLLK+VGQEE +P
Sbjct: 61  SNEDNQWIEEYSRGGSGIEFRTSAAESCSISRHINVWSEATSSESVEMLLKSVGQEENIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVD----NNSGDARKTDDVDD--GIAPADAVGISFTSCQTS 751
           G+ ++ ESD  ++LG   + ++    +N  +  K  DV D   I P D VG         
Sbjct: 121 GKTIMRESDACDELGCVVKQMELGPKHNDDNLSKGGDVVDIRPIVPPDGVG--------- 171

Query: 752 AVESEQAECTLQVQETKLSSFGVGIDNKDSSLALATENSNLVMKEADSSQGETCGLVDES 931
                Q +     Q+ K  S   G  +  +S  ++ +   ++ KE+ +          ES
Sbjct: 172 ---GGQPQADASFQKNKCESSVDGGLSDPASDGISGKGDIVLSKESYTVDQRKVDTFIES 228

Query: 932 LSHQMQEELPLHG 970
           L+++ +E+    G
Sbjct: 229 LNNRTEEDSSASG 241


>ref|XP_006477174.1| PREDICTED: uncharacterized protein LOC102627454 isoform X1 [Citrus
           sinensis] gi|568846679|ref|XP_006477175.1| PREDICTED:
           uncharacterized protein LOC102627454 isoform X2 [Citrus
           sinensis] gi|568846681|ref|XP_006477176.1| PREDICTED:
           uncharacterized protein LOC102627454 isoform X3 [Citrus
           sinensis]
          Length = 2155

 Score =  206 bits (523), Expect = 2e-50
 Identities = 119/253 (47%), Positives = 162/253 (64%), Gaps = 7/253 (2%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDYNDN+++ QNL LA E ++K   VLRP+ALPKFDFDD+L+G+LRFDSLVE EVFLGI 
Sbjct: 1   MDYNDNEFQSQNLQLAGEGNTKFPPVLRPYALPKFDFDDSLHGNLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WIE++SRGG+GIEFR+SAAESC++ RHINVWSEATSSESVEMLLK+VGQEE +P
Sbjct: 61  SNEDNQWIEEYSRGGSGIEFRTSAAESCSISRHINVWSEATSSESVEMLLKSVGQEENIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVD----NNSGDARKTDDVDD--GIAPADAVGISFTSCQTS 751
           G+ ++ ESD  ++LG   + ++    +N  +  K  DV D   I P D VG        S
Sbjct: 121 GKTIMRESDACDELGCVVKQMELGPKHNDDNLSKGGDVVDIRPIVPPDGVGGGQPQADAS 180

Query: 752 AVESEQAECTLQVQETKLSSFGVGIDNKDSSLALATENSNLVMKEADSSQGETCGLVDES 931
               ++ +C   V          GI  K   + L+ E+  +  ++ D+          ES
Sbjct: 181 ---FQKNKCESSVDGGLSDPVSDGISGK-GDIVLSKESFTVDQRKVDT--------FIES 228

Query: 932 LSHQMQEELPLHG 970
           L+++ +E+    G
Sbjct: 229 LNNRTEEDSSASG 241


>ref|XP_007209070.1| hypothetical protein PRUPE_ppa000035mg [Prunus persica]
           gi|462404805|gb|EMJ10269.1| hypothetical protein
           PRUPE_ppa000035mg [Prunus persica]
          Length = 2263

 Score =  199 bits (507), Expect = 1e-48
 Identities = 109/213 (51%), Positives = 143/213 (67%), Gaps = 7/213 (3%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E ++    VLRP+ALPKF+FDD+L+GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQSQNLHLAGEGNTNYPPVLRPYALPKFEFDDSLHGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S E +HWIEDFSRG +GIEF SSAAESC++ R  NVWSEATSSESVEMLLK+VGQEE++P
Sbjct: 61  SSETNHWIEDFSRGSSGIEFNSSAAESCSISRRNNVWSEATSSESVEMLLKSVGQEEIIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVD----NNSGDARKTDDVDD--GIAPADAVGISFTSCQTS 751
            + + EE D   +L   T+ ++    N+     + +DV D     P D +  + +  +  
Sbjct: 121 PQTIFEELDACKELHCLTKQMEPSFNNDDNILSQMEDVTDLQPTLPQDDIPENISGIEDV 180

Query: 752 AVESEQAECTLQVQETKLSSFGVGIDNKDSSLA 850
            V+  + E   Q  E KLS  G   D   ++L+
Sbjct: 181 GVDQLRVEDASQTHEGKLSVAGNSGDLDPNALS 213


>ref|XP_007157291.1| hypothetical protein PHAVU_002G057800g [Phaseolus vulgaris]
           gi|593788506|ref|XP_007157292.1| hypothetical protein
           PHAVU_002G057800g [Phaseolus vulgaris]
           gi|561030706|gb|ESW29285.1| hypothetical protein
           PHAVU_002G057800g [Phaseolus vulgaris]
           gi|561030707|gb|ESW29286.1| hypothetical protein
           PHAVU_002G057800g [Phaseolus vulgaris]
          Length = 2169

 Score =  197 bits (502), Expect = 4e-48
 Identities = 109/221 (49%), Positives = 145/221 (65%), Gaps = 3/221 (1%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E S+K   VLRP+ALPKFDFD+NL  +LRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQNQNLHLAGEGSAKFPPVLRPYALPKFDFDENLQANLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WI+ FSRGG+GIEF S+AAESC++ RH NVWSEATSSESVEMLLK+VGQE+ +P
Sbjct: 61  SNEDNQWIDAFSRGGSGIEFSSTAAESCSISRHGNVWSEATSSESVEMLLKSVGQEDYIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNNS--GDARKTDDVDDGIAPADAVGISFTSCQTSAVES 763
            + +I+ESD  ++L    + +D N    D  +  D    + P+     SF+  +      
Sbjct: 121 RQTVIQESDACDELACLAKQMDTNPKFEDRNEFKDSISDVHPSGGTHASFSGLKEDVGMD 180

Query: 764 EQAECTLQVQETKLSSFGVGIDNKDSSLALATENSNLVMKE 886
           +  +   Q  E +LS  G    + +  L+    N++L M E
Sbjct: 181 KSEDGLSQGHEGELSFDGA---SSNPELSDIHGNNDLPMSE 218


>ref|XP_007039813.1| G2484-1 protein, putative isoform 6 [Theobroma cacao]
           gi|508777058|gb|EOY24314.1| G2484-1 protein, putative
           isoform 6 [Theobroma cacao]
          Length = 2138

 Score =  193 bits (490), Expect = 1e-46
 Identities = 112/252 (44%), Positives = 164/252 (65%), Gaps = 11/252 (4%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E ++K   VLRP+ALP+FDFDDNL+GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQSQNLHLAGEGNNKFPPVLRPYALPRFDFDDNLHGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WIEDFSRG TGI F SSAAE C++ R  NVWSEA SSESVEMLLK+VGQ+E +P
Sbjct: 61  SSEDNQWIEDFSRGSTGIVFSSSAAEPCSISRRNNVWSEAASSESVEMLLKSVGQDETIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNN--SGDARKTDDVDDGIAPADAVGI---SFTSCQTS- 751
           G+ + ++SD  ++LG   + ++ +   GD+  + +  DG+ PA   G     F+  + + 
Sbjct: 121 GQIISKDSDACDELGCIIKQMEPSLKHGDSGLSKE-GDGLRPALQAGEIPGKFSGLKGNV 179

Query: 752 AVESEQAECTLQVQETKLSSFGVGID----NKDSSLALATENSNLVMKEADSSQGETCGL 919
             +    E   Q+ E + +  G   D    ++++ L +   + +   ++   ++ +   L
Sbjct: 180 GGDHPLVEDVSQMHEGEPTVDGAFKDPNTISRNTDLPVTERDKSKDCEQIVVNENQVDAL 239

Query: 920 VDESLSHQMQEE 955
           VD+S+ ++ QE+
Sbjct: 240 VDQSVDNRGQED 251


>ref|XP_007039812.1| G2484-1 protein, putative isoform 5 [Theobroma cacao]
           gi|508777057|gb|EOY24313.1| G2484-1 protein, putative
           isoform 5 [Theobroma cacao]
          Length = 2151

 Score =  193 bits (490), Expect = 1e-46
 Identities = 112/252 (44%), Positives = 164/252 (65%), Gaps = 11/252 (4%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E ++K   VLRP+ALP+FDFDDNL+GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQSQNLHLAGEGNNKFPPVLRPYALPRFDFDDNLHGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WIEDFSRG TGI F SSAAE C++ R  NVWSEA SSESVEMLLK+VGQ+E +P
Sbjct: 61  SSEDNQWIEDFSRGSTGIVFSSSAAEPCSISRRNNVWSEAASSESVEMLLKSVGQDETIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNN--SGDARKTDDVDDGIAPADAVGI---SFTSCQTS- 751
           G+ + ++SD  ++LG   + ++ +   GD+  + +  DG+ PA   G     F+  + + 
Sbjct: 121 GQIISKDSDACDELGCIIKQMEPSLKHGDSGLSKE-GDGLRPALQAGEIPGKFSGLKGNV 179

Query: 752 AVESEQAECTLQVQETKLSSFGVGID----NKDSSLALATENSNLVMKEADSSQGETCGL 919
             +    E   Q+ E + +  G   D    ++++ L +   + +   ++   ++ +   L
Sbjct: 180 GGDHPLVEDVSQMHEGEPTVDGAFKDPNTISRNTDLPVTERDKSKDCEQIVVNENQVDAL 239

Query: 920 VDESLSHQMQEE 955
           VD+S+ ++ QE+
Sbjct: 240 VDQSVDNRGQED 251


>ref|XP_007039811.1| G2484-1 protein, putative isoform 4 [Theobroma cacao]
           gi|508777056|gb|EOY24312.1| G2484-1 protein, putative
           isoform 4 [Theobroma cacao]
          Length = 2110

 Score =  193 bits (490), Expect = 1e-46
 Identities = 112/252 (44%), Positives = 164/252 (65%), Gaps = 11/252 (4%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E ++K   VLRP+ALP+FDFDDNL+GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQSQNLHLAGEGNNKFPPVLRPYALPRFDFDDNLHGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WIEDFSRG TGI F SSAAE C++ R  NVWSEA SSESVEMLLK+VGQ+E +P
Sbjct: 61  SSEDNQWIEDFSRGSTGIVFSSSAAEPCSISRRNNVWSEAASSESVEMLLKSVGQDETIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNN--SGDARKTDDVDDGIAPADAVGI---SFTSCQTS- 751
           G+ + ++SD  ++LG   + ++ +   GD+  + +  DG+ PA   G     F+  + + 
Sbjct: 121 GQIISKDSDACDELGCIIKQMEPSLKHGDSGLSKE-GDGLRPALQAGEIPGKFSGLKGNV 179

Query: 752 AVESEQAECTLQVQETKLSSFGVGID----NKDSSLALATENSNLVMKEADSSQGETCGL 919
             +    E   Q+ E + +  G   D    ++++ L +   + +   ++   ++ +   L
Sbjct: 180 GGDHPLVEDVSQMHEGEPTVDGAFKDPNTISRNTDLPVTERDKSKDCEQIVVNENQVDAL 239

Query: 920 VDESLSHQMQEE 955
           VD+S+ ++ QE+
Sbjct: 240 VDQSVDNRGQED 251


>ref|XP_007039808.1| G2484-1 protein, putative isoform 1 [Theobroma cacao]
           gi|590676695|ref|XP_007039809.1| G2484-1 protein,
           putative isoform 1 [Theobroma cacao]
           gi|590676698|ref|XP_007039810.1| G2484-1 protein,
           putative isoform 1 [Theobroma cacao]
           gi|508777053|gb|EOY24309.1| G2484-1 protein, putative
           isoform 1 [Theobroma cacao] gi|508777054|gb|EOY24310.1|
           G2484-1 protein, putative isoform 1 [Theobroma cacao]
           gi|508777055|gb|EOY24311.1| G2484-1 protein, putative
           isoform 1 [Theobroma cacao]
          Length = 2123

 Score =  193 bits (490), Expect = 1e-46
 Identities = 112/252 (44%), Positives = 164/252 (65%), Gaps = 11/252 (4%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E ++K   VLRP+ALP+FDFDDNL+GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQSQNLHLAGEGNNKFPPVLRPYALPRFDFDDNLHGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WIEDFSRG TGI F SSAAE C++ R  NVWSEA SSESVEMLLK+VGQ+E +P
Sbjct: 61  SSEDNQWIEDFSRGSTGIVFSSSAAEPCSISRRNNVWSEAASSESVEMLLKSVGQDETIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNN--SGDARKTDDVDDGIAPADAVGI---SFTSCQTS- 751
           G+ + ++SD  ++LG   + ++ +   GD+  + +  DG+ PA   G     F+  + + 
Sbjct: 121 GQIISKDSDACDELGCIIKQMEPSLKHGDSGLSKE-GDGLRPALQAGEIPGKFSGLKGNV 179

Query: 752 AVESEQAECTLQVQETKLSSFGVGID----NKDSSLALATENSNLVMKEADSSQGETCGL 919
             +    E   Q+ E + +  G   D    ++++ L +   + +   ++   ++ +   L
Sbjct: 180 GGDHPLVEDVSQMHEGEPTVDGAFKDPNTISRNTDLPVTERDKSKDCEQIVVNENQVDAL 239

Query: 920 VDESLSHQMQEE 955
           VD+S+ ++ QE+
Sbjct: 240 VDQSVDNRGQED 251


>ref|XP_002530649.1| conserved hypothetical protein [Ricinus communis]
           gi|223529782|gb|EEF31718.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 2104

 Score =  193 bits (490), Expect = 1e-46
 Identities = 115/249 (46%), Positives = 156/249 (62%), Gaps = 9/249 (3%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           M+Y+DND++ QNLHLA E S+K S VLRP+ALPKFDFDD+L+G LRFDSLVE EVFLGI 
Sbjct: 1   MEYDDNDFQSQNLHLAGEGSNKFSPVLRPYALPKFDFDDSLHGSLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S E+S WIED+SRG +GI+F SSAAESCA+ R  NVWSEATSSESVEMLLK+VGQEE++P
Sbjct: 61  SNENSQWIEDYSRGSSGIQFSSSAAESCAISRRNNVWSEATSSESVEMLLKSVGQEELIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNNSGDARKTDDVDDGIAPADAVGISFTSCQT-SAVESE 766
            +   +ES+  ++LG   + ++ +      T      +A   +  +     +  S ++  
Sbjct: 121 AQTNTKESNACDELGCIIKPMEPSLKQESNTPARVGDVANLQSTLLPGEFPENFSMLDES 180

Query: 767 QAECTLQVQETKLSSFG-VGIDNKDSSLALATENSNLVM------KEADSSQGETCGLVD 925
             E   Q++++ L+  G V +D   S L+       L +      K  D +Q E      
Sbjct: 181 GGEQQAQLEDSLLTHKGDVSVDQSLSDLSAVNVEVRLPISGLIDGKSDDVNQREVNITNS 240

Query: 926 ESLSHQMQE 952
           ESL  +MQE
Sbjct: 241 ESLDTRMQE 249


>ref|XP_002868073.1| hypothetical protein ARALYDRAFT_329795 [Arabidopsis lyrata subsp.
           lyrata] gi|297313909|gb|EFH44332.1| hypothetical protein
           ARALYDRAFT_329795 [Arabidopsis lyrata subsp. lyrata]
          Length = 1744

 Score =  192 bits (488), Expect = 2e-46
 Identities = 105/242 (43%), Positives = 152/242 (62%), Gaps = 1/242 (0%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E ++K   VL+P+ALPKFDFDD LN HLRFDSL E+E FLGI 
Sbjct: 1   MDYDDNDFQNQNLHLAGEANNKFPPVLQPYALPKFDFDDTLNTHLRFDSLGESEAFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
             ED++WIEDFSRG +GI F S A ESCA+PRH NVWSEATSSESVEMLL +VGQ+E++ 
Sbjct: 61  GNEDNNWIEDFSRGSSGIVFSSGATESCAIPRHNNVWSEATSSESVEMLLNSVGQDEVIV 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNNSGDARKTDDVDDGIAPADAVGISFTSCQTSAVESEQ 769
            E+ I++SD  ++LG +   ++       K+   ++ +       +  TS + S V+++ 
Sbjct: 121 REDTIKKSDTSDELGCTMEPMEPGQTSHEKSPSKEETVNLQPNPSVDDTSGEFSVVKTDD 180

Query: 770 AECTLQVQETKLSSFGVGIDNKDSSLALATENSNLVMKEADSSQGETCGLVDESLSHQMQ 949
            +  + V++   S   V   + +   A+ T N+  V     +   +      ++L HQ +
Sbjct: 181 GQEQVLVKDD--SPTAVEEASVEEKNAILTSNTATVEAVQTAGLDKIGPESTDNLRHQTE 238

Query: 950 EE 955
           E+
Sbjct: 239 EK 240


>ref|XP_006573722.1| PREDICTED: uncharacterized protein LOC100792961 isoform X7 [Glycine
           max]
          Length = 2102

 Score =  192 bits (487), Expect = 2e-46
 Identities = 93/144 (64%), Positives = 117/144 (81%), Gaps = 1/144 (0%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E S+K   VLRP+ALPKFDFD++L  +LRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQNQNLHLAGEGSAKFPPVLRPYALPKFDFDESLQANLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WI+ FSRGG+GIEF S+AAESC++ RH NVWSEATSSESVEMLLK+VGQE+ +P
Sbjct: 61  SNEDNQWIDTFSRGGSGIEFSSTAAESCSISRHGNVWSEATSSESVEMLLKSVGQEDYIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNN 661
            + +I+ESD  ++L    + +D N
Sbjct: 121 RQTVIQESDACDELACLAKQMDTN 144


>ref|XP_006573716.1| PREDICTED: uncharacterized protein LOC100792961 isoform X1 [Glycine
           max] gi|571436299|ref|XP_006573717.1| PREDICTED:
           uncharacterized protein LOC100792961 isoform X2 [Glycine
           max] gi|571436301|ref|XP_006573718.1| PREDICTED:
           uncharacterized protein LOC100792961 isoform X3 [Glycine
           max] gi|571436303|ref|XP_006573719.1| PREDICTED:
           uncharacterized protein LOC100792961 isoform X4 [Glycine
           max] gi|571436305|ref|XP_006573720.1| PREDICTED:
           uncharacterized protein LOC100792961 isoform X5 [Glycine
           max] gi|571436307|ref|XP_006573721.1| PREDICTED:
           uncharacterized protein LOC100792961 isoform X6 [Glycine
           max]
          Length = 2142

 Score =  192 bits (487), Expect = 2e-46
 Identities = 93/144 (64%), Positives = 117/144 (81%), Gaps = 1/144 (0%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E S+K   VLRP+ALPKFDFD++L  +LRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQNQNLHLAGEGSAKFPPVLRPYALPKFDFDESLQANLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WI+ FSRGG+GIEF S+AAESC++ RH NVWSEATSSESVEMLLK+VGQE+ +P
Sbjct: 61  SNEDNQWIDTFSRGGSGIEFSSTAAESCSISRHGNVWSEATSSESVEMLLKSVGQEDYIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNN 661
            + +I+ESD  ++L    + +D N
Sbjct: 121 RQTVIQESDACDELACLAKQMDTN 144


>ref|XP_006590567.1| PREDICTED: mucin-17-like [Glycine max]
          Length = 2135

 Score =  191 bits (486), Expect = 3e-46
 Identities = 93/144 (64%), Positives = 117/144 (81%), Gaps = 1/144 (0%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLHLA E S+K   VLRP+ALPKFDFD++L  +LRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQNQNLHLAGEGSAKFPPVLRPYALPKFDFDESLQANLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WI+ FSRGG+GIEF S+AAESC++ RH NVWSEATSSESVEMLLK+VGQE+ +P
Sbjct: 61  SNEDNQWIDAFSRGGSGIEFSSTAAESCSISRHGNVWSEATSSESVEMLLKSVGQEDYIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVDNN 661
            + +I+ESD  ++L    + +D N
Sbjct: 121 RQTVIQESDACDELACLAKQMDTN 144


>ref|XP_007155669.1| hypothetical protein PHAVU_003G221300g [Phaseolus vulgaris]
           gi|561029023|gb|ESW27663.1| hypothetical protein
           PHAVU_003G221300g [Phaseolus vulgaris]
          Length = 2281

 Score =  191 bits (486), Expect = 3e-46
 Identities = 115/254 (45%), Positives = 154/254 (60%), Gaps = 8/254 (3%)
 Frame = +2

Query: 233 MDYNDNDYEGQNLHLASEESSKIS-VLRPFALPKFDFDDNLNGHLRFDSLVENEVFLGIS 409
           MDY+DND++ QNLH+  E S+K   VLRP+ALPKFD D++L GHLRFDSLVE EVFLGI 
Sbjct: 1   MDYDDNDFQSQNLHITGEGSTKFPPVLRPYALPKFDLDESLQGHLRFDSLVETEVFLGIE 60

Query: 410 SQEDSHWIEDFSRGGTGIEFRSSAAESCALPRHINVWSEATSSESVEMLLKAVGQEEMVP 589
           S ED+ WI+ +SRG +GIEF S+AAESC++ RH NVWSEATSSESVEMLLK+VGQEE +P
Sbjct: 61  SNEDNQWIDAYSRGSSGIEFGSTAAESCSISRHNNVWSEATSSESVEMLLKSVGQEEFIP 120

Query: 590 GENMIEESDPGNQLGSSTRVVD---NNSGDARKTDDVDDGIAPA----DAVGISFTSCQT 748
            E  I+ES+  ++L    + ++   N +      D V D   P     +  G+     + 
Sbjct: 121 RETDIQESNAFDELACLAKQMEPGPNPNNRNEYKDGVTDLQPPCFIHENLAGLKEAEREQ 180

Query: 749 SAVESEQAECTLQVQETKLSSFGVGIDNKDSSLALATENSNLVMKEADSSQGETCGLVDE 928
           S     Q E ++    + L    + + N D  L +A   S    K  D++QG+   + D 
Sbjct: 181 SQAVVSQGELSIDGSLSTLQPHDI-LGNVD--LPVARGFSFTDDKSDDANQGKVEIVADG 237

Query: 929 SLSHQMQEELPLHG 970
           SL  + QEE    G
Sbjct: 238 SLEEKTQEESAASG 251


Top