BLASTX nr result

ID: Mentha24_contig00029125 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00029125
         (1008 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU29092.1| hypothetical protein MIMGU_mgv1a0109492mg, partia...   311   4e-82
ref|XP_006364460.1| PREDICTED: uncharacterized protein LOC102606...   281   2e-73
ref|XP_004245966.1| PREDICTED: uncharacterized protein LOC101264...   279   1e-72
gb|EPS59472.1| hypothetical protein M569_15336, partial [Genlise...   266   1e-68
ref|XP_002264344.1| PREDICTED: uncharacterized protein LOC100266...   263   8e-68
ref|XP_004146124.1| PREDICTED: uncharacterized protein LOC101211...   260   7e-67
ref|XP_004300141.1| PREDICTED: uncharacterized protein LOC101302...   258   4e-66
ref|XP_007209323.1| hypothetical protein PRUPE_ppa008010mg [Prun...   257   5e-66
ref|XP_006368288.1| hypothetical protein POPTR_0001s01320g [Popu...   247   6e-63
gb|EXB49813.1| hypothetical protein L484_006351 [Morus notabilis]     244   3e-62
ref|XP_007040826.1| Uncharacterized protein isoform 2 [Theobroma...   244   4e-62
ref|XP_006432860.1| hypothetical protein CICLE_v10002048mg [Citr...   243   1e-61
ref|XP_002303468.2| hypothetical protein POPTR_0003s10200g [Popu...   241   4e-61
emb|CBI37457.3| unnamed protein product [Vitis vinifera]              239   1e-60
ref|XP_007040825.1| Uncharacterized protein isoform 1 [Theobroma...   230   6e-58
ref|NP_001241403.1| uncharacterized protein LOC100811221 [Glycin...   229   2e-57
ref|XP_003534316.1| PREDICTED: uncharacterized protein LOC100813...   228   4e-57
ref|XP_007158154.1| hypothetical protein PHAVU_002G128700g [Phas...   225   2e-56
ref|XP_004512397.1| PREDICTED: uncharacterized protein LOC101511...   223   7e-56
ref|XP_006414004.1| hypothetical protein EUTSA_v10025843mg [Eutr...   216   9e-54

>gb|EYU29092.1| hypothetical protein MIMGU_mgv1a0109492mg, partial [Mimulus
           guttatus]
          Length = 221

 Score =  311 bits (796), Expect = 4e-82
 Identities = 153/221 (69%), Positives = 171/221 (77%), Gaps = 15/221 (6%)
 Frame = +1

Query: 196 VDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHYDYYWASVFEVEYIDHSGQARSALAEA 375
           VDLRSSKVCELGLLNYKAK+VFYP ERKKYRCHYDYYWASVF+VEY DHSGQAR ALAEA
Sbjct: 1   VDLRSSKVCELGLLNYKAKYVFYPFERKKYRCHYDYYWASVFKVEYTDHSGQARFALAEA 60

Query: 376 PNEALPDNCRPSFGAAWLTKDKFK------------XXXXXXXXXXXXXXXAEDPSTVEM 519
           PNEALPD+CRPSFG AWLTKDKFK                           AE+PSTVEM
Sbjct: 61  PNEALPDDCRPSFGTAWLTKDKFKVNGTYECWYTLGISKVNINHKGLFNCQAEEPSTVEM 120

Query: 520 LKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGVISGFVTAFLSITLIGILYPFIAIIRRL 699
           LKRYSILF R+LKSWF +W S+  WRWD+IAG+I+GF+T+  SI  +G+L+PF+A IRRL
Sbjct: 121 LKRYSILFIRILKSWFYNWDSINQWRWDVIAGLITGFLTSLFSIAFVGLLHPFLAFIRRL 180

Query: 700 FA---SAAYPSTILLKRVCFFAVYFSFMGWVTIQYVKRLGL 813
            A   S  YPST+LLKR CFFA+YFS M WVTIQYVKRLGL
Sbjct: 181 VASWMSTPYPSTVLLKRACFFALYFSCMSWVTIQYVKRLGL 221


>ref|XP_006364460.1| PREDICTED: uncharacterized protein LOC102606400 [Solanum tuberosum]
          Length = 301

 Score =  281 bits (720), Expect = 2e-73
 Identities = 140/249 (56%), Positives = 174/249 (69%), Gaps = 15/249 (6%)
 Frame = +1

Query: 112 MYLAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRC 291
           +YLAV +GNLSI SPIS+ S+C+IVSS VDLRSSKVCELGLLNYKAKHV YP ERKK+RC
Sbjct: 44  LYLAVFLGNLSISSPISLPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVLYPSERKKFRC 103

Query: 292 HYDYYWASVFEVEYIDHSGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK-------- 447
           HYDYYWASVF+VEY+DHSGQARSALAEAPNEALP +CRP+F  AWLTKDKF+        
Sbjct: 104 HYDYYWASVFKVEYMDHSGQARSALAEAPNEALPSDCRPNFSGAWLTKDKFEVNKTYECW 163

Query: 448 ----XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAG 615
                              A+DPST+EM  RY ILF R+LKSW+ S    + WRW+ +AG
Sbjct: 164 YTLGISKVHIYQAGFFDCDAKDPSTIEMFIRYLILFMRILKSWYVSGVLYWHWRWEAVAG 223

Query: 616 VISGFVTAFLSITLIGILYPFIAIIRRLFASAAYP---STILLKRVCFFAVYFSFMGWVT 786
           VI+GF T+ +++ L  +L  F + I +L     +    + I L+RVCF   Y SF  W+ 
Sbjct: 224 VIAGFCTSIMTVILFALLRKFFSCIYQLSVVRRFTLPFNKIRLRRVCFLLAYVSFTSWLA 283

Query: 787 IQYVKRLGL 813
           +QY++R+GL
Sbjct: 284 VQYLRRIGL 292


>ref|XP_004245966.1| PREDICTED: uncharacterized protein LOC101264543 [Solanum
           lycopersicum]
          Length = 301

 Score =  279 bits (713), Expect = 1e-72
 Identities = 138/249 (55%), Positives = 172/249 (69%), Gaps = 15/249 (6%)
 Frame = +1

Query: 112 MYLAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRC 291
           +YLAV +GNLSI SPIS+ S+C+IVSS VDLRSSKVCELGLLNYKAKHV YP ERKK+RC
Sbjct: 44  LYLAVFLGNLSISSPISLPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVLYPSERKKFRC 103

Query: 292 HYDYYWASVFEVEYIDHSGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK-------- 447
           HYDYYWASVF+VEY+DHSGQARSALAEAPNEALP +CRP+F  AWLTKDKF+        
Sbjct: 104 HYDYYWASVFKVEYVDHSGQARSALAEAPNEALPSDCRPNFSGAWLTKDKFEVNKTYKCW 163

Query: 448 ----XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAG 615
                              A+DPST+EM  RY ILF R+LKSW+ S    + WRW+ +AG
Sbjct: 164 YTLGISKVHIYQAGFFDCDAKDPSTIEMFIRYLILFMRILKSWYVSGVLYWHWRWEAVAG 223

Query: 616 VISGFVTAFLSITLIGILYPFIAIIRRLFASAAYP---STILLKRVCFFAVYFSFMGWVT 786
           VI+GF T+ +++ L  +L    + I +L     +    + + L+RVCF   Y SF  W+ 
Sbjct: 224 VIAGFCTSIMTVILFALLRKLFSCIYQLSVVRRFTLPFNKVRLRRVCFLLAYVSFTSWLA 283

Query: 787 IQYVKRLGL 813
           +QY +R+GL
Sbjct: 284 VQYFRRIGL 292


>gb|EPS59472.1| hypothetical protein M569_15336, partial [Genlisea aurea]
          Length = 260

 Score =  266 bits (679), Expect = 1e-68
 Identities = 131/247 (53%), Positives = 162/247 (65%), Gaps = 15/247 (6%)
 Frame = +1

Query: 112 MYLAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRC 291
           M LA+   +L +WSPISV   CR+VSS VDLRSSKVC +G+LNY AK+V YPLE+ KYRC
Sbjct: 14  MNLAIYFESLRLWSPISVRCLCRVVSSSVDLRSSKVCAIGVLNYNAKNVLYPLEKNKYRC 73

Query: 292 HYDYYWASVFEVEYIDHSGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK-------- 447
           HYDYYWA++ +VE+ DH G  R ALAEAPNEALP NCRPSF  AWLTK KF         
Sbjct: 74  HYDYYWAAILKVEFTDHLGHERFALAEAPNEALPYNCRPSFSGAWLTKSKFMINETYDCW 133

Query: 448 ----XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAG 615
                              A+DPST+EM+K +S L  R+LKS FS WGS+  WRWD+IAG
Sbjct: 134 YTLGISKVNINHEGLFNCRADDPSTLEMMKLHSTLLIRILKSSFSGWGSLMHWRWDVIAG 193

Query: 616 VISGFVTAFLSITLIGILYPFIAIIRRLFAS---AAYPSTILLKRVCFFAVYFSFMGWVT 786
           + +GF+TA L I L+ +++P I    RL  S     YP T+ LKR   F VY  FM W+T
Sbjct: 194 LTTGFITALLVIALVSLIWPLIQSTTRLLGSWLFIRYPITLFLKRAFVFTVYLMFMCWIT 253

Query: 787 IQYVKRL 807
           +QY++RL
Sbjct: 254 LQYLRRL 260


>ref|XP_002264344.1| PREDICTED: uncharacterized protein LOC100266685 [Vitis vinifera]
          Length = 291

 Score =  263 bits (672), Expect = 8e-68
 Identities = 133/247 (53%), Positives = 170/247 (68%), Gaps = 14/247 (5%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           +AV +GNLS+ SP+SV S+C+IVSS VDLRSSKVCELGLLNYKAKHVFYPLE++K+RCHY
Sbjct: 37  VAVFVGNLSVSSPVSVPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVFYPLEKRKFRCHY 96

Query: 298 DYYWASVFEVEYIDHSGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK---------- 447
           DYYWASVF+VEY D  GQ R  L EAPNEALP +CRP+FGAAWLTKDKFK          
Sbjct: 97  DYYWASVFKVEYKDSLGQTRLTLTEAPNEALPLDCRPNFGAAWLTKDKFKVNETYDCWYA 156

Query: 448 --XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGVI 621
                            A++PST+EM++RYSIL TR+L+SW +S G    WR + +AGVI
Sbjct: 157 SGISKVSIYQDSFFSCQAKEPSTIEMIRRYSILSTRILQSWLASQGRGKYWRLETVAGVI 216

Query: 622 SGFVTAFLSITLIGILYPFIAIIRRLFASAAY--PSTILLKRVCFFAVYFSFMGWVTIQY 795
           +GF T+ +SI+L+ IL+   + + R+          ++  +R  F   Y  FMGW+ I+Y
Sbjct: 217 TGFSTSLISISLVRILHQVKSWLPRILKRQLLLAVKSVRFRRAFFLVTYVIFMGWLAIEY 276

Query: 796 VKRLGLS 816
            KRLG+S
Sbjct: 277 GKRLGIS 283


>ref|XP_004146124.1| PREDICTED: uncharacterized protein LOC101211843 [Cucumis sativus]
          Length = 303

 Score =  260 bits (664), Expect = 7e-67
 Identities = 133/245 (54%), Positives = 166/245 (67%), Gaps = 13/245 (5%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           +AV + N SI SPIS+ S+C+IVSS VDLRSSKVCELGLLNYKAK+VFYP ER K+RC Y
Sbjct: 47  VAVFVANSSITSPISLRSQCKIVSSSVDLRSSKVCELGLLNYKAKNVFYPYERNKFRCRY 106

Query: 298 DYYWASVFEVEYIDH-SGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK--------- 447
           DYYWASVF+VE  DH SG+AR ALAEAPNEALP  CRP+FGAAWL K KFK         
Sbjct: 107 DYYWASVFKVEMKDHFSGKARVALAEAPNEALPHKCRPNFGAAWLAKYKFKVNETYDCWY 166

Query: 448 ---XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGV 618
                             A++P+T+EM+KRY  L T++L SW+SS      WRWD++ G+
Sbjct: 167 SSGISKVSLDYDGFSGCQAQEPTTIEMIKRYYFLCTKILLSWYSSKEKAIFWRWDMLGGL 226

Query: 619 ISGFVTAFLSITLIGILYPFIAIIRRLFASAAYPSTILLKRVCFFAVYFSFMGWVTIQYV 798
           ++GF T+ ++IT++ IL P I  + R F +  +   I L R CF   YFSF+GW+ IQY 
Sbjct: 227 VTGFSTSLITITVLRILQPLIPWMLRYFTTRFF---IHLNRACFLVAYFSFVGWLIIQYG 283

Query: 799 KRLGL 813
           KRL L
Sbjct: 284 KRLSL 288


>ref|XP_004300141.1| PREDICTED: uncharacterized protein LOC101302166 [Fragaria vesca
           subsp. vesca]
          Length = 290

 Score =  258 bits (658), Expect = 4e-66
 Identities = 136/251 (54%), Positives = 167/251 (66%), Gaps = 19/251 (7%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           + V + N S+  PISV S+CRIVSS VDL+S+KVCELGLLNYKAK+VFYPLE +++RC Y
Sbjct: 34  VTVFVSNSSVPGPISVSSQCRIVSSSVDLKSAKVCELGLLNYKAKNVFYPLEGRRFRCRY 93

Query: 298 DYYWASVFEVEYID-HSGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK--------- 447
           DYYWASVF+VEY D  SGQ R ALAEAP+EALP NCRP+FGAAWLTKDKFK         
Sbjct: 94  DYYWASVFKVEYQDLSSGQTRVALAEAPSEALPLNCRPNFGAAWLTKDKFKVNKTYDCWY 153

Query: 448 ---XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGV 618
                             A+DPST EM++RY IL T++L+SWF S      WRW+ + GV
Sbjct: 154 TYGVSQVSLYEDGFFSCQAKDPSTFEMIRRYFILLTKILQSWFLSQEPAMFWRWETMVGV 213

Query: 619 ISGFVTAFLSITLIGIL------YPFIAIIRRLFASAAYPSTILLKRVCFFAVYFSFMGW 780
           ++GF TA +SIT I ++       P I+  R L  S    S +L +R CF   Y SFMGW
Sbjct: 214 VTGFSTAMISITFIRLMQLLKSQLPQISARRMLTQSV---SAVLFRRTCFLVAYISFMGW 270

Query: 781 VTIQYVKRLGL 813
           +TI+Y KRLGL
Sbjct: 271 LTIEYGKRLGL 281


>ref|XP_007209323.1| hypothetical protein PRUPE_ppa008010mg [Prunus persica]
           gi|462405058|gb|EMJ10522.1| hypothetical protein
           PRUPE_ppa008010mg [Prunus persica]
          Length = 349

 Score =  257 bits (657), Expect = 5e-66
 Identities = 132/247 (53%), Positives = 166/247 (67%), Gaps = 14/247 (5%)
 Frame = +1

Query: 115 YLAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCH 294
           ++A+ + N SI SPISV S+CRI+SS VDL+SSKVCELGL NYKAKHVFYP E +++RC 
Sbjct: 94  FVAIFVANSSIPSPISVSSQCRILSSSVDLKSSKVCELGLFNYKAKHVFYPFEGRRFRCR 153

Query: 295 YDYYWASVFEVEYIDH-SGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK-------- 447
           YDYYWAS+F+VEY D  SGQ + ALAEAPNEALP +CRP+FGAAWLTKDKFK        
Sbjct: 154 YDYYWASIFKVEYKDQSSGQTQLALAEAPNEALPLDCRPNFGAAWLTKDKFKVNETYDCW 213

Query: 448 ----XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAG 615
                              A+DPST EM++RY IL T++L SWF +      WRW+ +AG
Sbjct: 214 YTYGISKVSLYHDGFFSCQAKDPSTFEMIRRYFILATKILHSWFVAQERAGFWRWETVAG 273

Query: 616 VISGFVTAFLSITLIGILYPFIAIIRRLFASAAYP-STILLKRVCFFAVYFSFMGWVTIQ 792
           VI+GF T+ +SI+ I +L    + + +LFA+   P   I  +R CF   Y SFM W+ IQ
Sbjct: 274 VIAGFSTSLISISFIRLLQQMKSRLPQLFAARVLPLYMIRFRRTCFLVAYISFMSWLVIQ 333

Query: 793 YVKRLGL 813
           Y KRLGL
Sbjct: 334 YGKRLGL 340


>ref|XP_006368288.1| hypothetical protein POPTR_0001s01320g [Populus trichocarpa]
           gi|118483148|gb|ABK93480.1| unknown [Populus
           trichocarpa] gi|550346193|gb|ERP64857.1| hypothetical
           protein POPTR_0001s01320g [Populus trichocarpa]
          Length = 306

 Score =  247 bits (630), Expect = 6e-63
 Identities = 132/252 (52%), Positives = 165/252 (65%), Gaps = 19/252 (7%)
 Frame = +1

Query: 115 YLAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCH 294
           +LA+ +G+LSI +P+S+ S+C+I+SS VDLRSSK+CE G LNYKAKHVFYP  R K+RC 
Sbjct: 49  FLAIFMGHLSITTPLSLPSQCKILSSSVDLRSSKICEPGFLNYKAKHVFYPYNRSKFRCR 108

Query: 295 YDYYWASVFEVEYIDHS-GQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK-------- 447
           YDYYWASVFEVEY D+S GQ + ALAEAPNEALP NCRP+FGAAWLTKDKFK        
Sbjct: 109 YDYYWASVFEVEYKDYSLGQTQFALAEAPNEALPLNCRPNFGAAWLTKDKFKVNKTYDCW 168

Query: 448 ----XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLL-KSWFSSWGSMYLWRWDLIA 612
                              A+DPS VEM+KR+ IL   +L  S     G    WRW+ IA
Sbjct: 169 YTSGILKVSLYRDDLFSCQAKDPSQVEMIKRFFILSKEMLHSSLVQKKGKAGYWRWETIA 228

Query: 613 GVISGFVTAFLSITLIGILYPF-----IAIIRRLFASAAYPSTILLKRVCFFAVYFSFMG 777
           GVI+GF T+ ++I+ I IL        +  + R+F   ++ + +  KR CF   Y SFMG
Sbjct: 229 GVIAGFSTSIITISFIRILQHIKSWFRLPSVARMF---SHTNIVFFKRACFLVAYISFMG 285

Query: 778 WVTIQYVKRLGL 813
           W+TIQY KRLGL
Sbjct: 286 WLTIQYGKRLGL 297


>gb|EXB49813.1| hypothetical protein L484_006351 [Morus notabilis]
          Length = 298

 Score =  244 bits (624), Expect = 3e-62
 Identities = 133/253 (52%), Positives = 160/253 (63%), Gaps = 21/253 (8%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           +A+ + N S+ SPISV S+C+IVSS VDLRSSKVCELGLLNYKAKHVFYP  + K+RC Y
Sbjct: 44  VAIFLANSSVSSPISVPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVFYPFGKNKFRCRY 103

Query: 298 DYYWASVFEVEYID-HSGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK--------- 447
           DYYWASVF+VEY D  SG  R A AEAPNEALP NCRP+FGAAWL KDKFK         
Sbjct: 104 DYYWASVFKVEYKDLSSGVNRFASAEAPNEALPLNCRPNFGAAWLNKDKFKVNETYDCWY 163

Query: 448 ---XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGV 618
                             A DPST EM+KRYSIL  ++L+SW  S      WRWD++AGV
Sbjct: 164 THGIPKVSLPDDGFFSCQANDPSTFEMIKRYSILSVKVLQSWLLSREKSKHWRWDVLAGV 223

Query: 619 ISGFVTAFLSITLIGILYPFIAIIRRLFASAAYPSTILLK--------RVCFFAVYFSFM 774
             GF T+ +SIT       F+  +R+L +S    +  LL+        R CF  VYFS M
Sbjct: 224 FVGFSTSLISIT-------FVVFLRQLKSSLFSAAKSLLRAFLRITFTRACFLVVYFSVM 276

Query: 775 GWVTIQYVKRLGL 813
            W+ +QY KR+GL
Sbjct: 277 AWLAVQYGKRIGL 289


>ref|XP_007040826.1| Uncharacterized protein isoform 2 [Theobroma cacao]
           gi|508778071|gb|EOY25327.1| Uncharacterized protein
           isoform 2 [Theobroma cacao]
          Length = 259

 Score =  244 bits (623), Expect = 4e-62
 Identities = 125/246 (50%), Positives = 163/246 (66%), Gaps = 13/246 (5%)
 Frame = +1

Query: 115 YLAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCH 294
           ++AV IG LSI + I++ ++C+IVSS VD+RSSK+CELGLLNYKAKHV Y  ER K+RC 
Sbjct: 12  FVAVFIGELSIPNSITIPTQCKIVSSSVDIRSSKICELGLLNYKAKHVLYHFERSKFRCR 71

Query: 295 YDYYWASVFEVEYIDHS-GQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK-------- 447
           YDYYW SVFEVEY DHS GQ R A  EAPNEALP +CRP+FGAAWLTKDKFK        
Sbjct: 72  YDYYWTSVFEVEYRDHSLGQTRLAFTEAPNEALPLSCRPNFGAAWLTKDKFKVNETYDCW 131

Query: 448 ----XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAG 615
                              A+DPST+EM+KRY ++ ++++ SW SS G    WRW+ IAG
Sbjct: 132 YTSGISKVKLYNDGFFSCQAKDPSTIEMIKRYLMISSKIVYSWLSSKGRGIYWRWETIAG 191

Query: 616 VISGFVTAFLSITLIGILYPFIAIIRRLFASAAYPSTILLKRVCFFAVYFSFMGWVTIQY 795
           V++GF T+ ++I+ I IL    + + +        +T+ +KRVCF  VY S MGW+  QY
Sbjct: 192 VVTGFSTSIITISFIRILQHMKSWLPQAL------NTVHIKRVCFLLVYVSVMGWLVSQY 245

Query: 796 VKRLGL 813
            +RL +
Sbjct: 246 WRRLNI 251


>ref|XP_006432860.1| hypothetical protein CICLE_v10002048mg [Citrus clementina]
           gi|557534982|gb|ESR46100.1| hypothetical protein
           CICLE_v10002048mg [Citrus clementina]
          Length = 292

 Score =  243 bits (619), Expect = 1e-61
 Identities = 126/246 (51%), Positives = 163/246 (66%), Gaps = 14/246 (5%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           +AVL G  S+ S I V S+C+IVSS VD+RSSKVCELG+LNYKAK VFYP E  K+RC Y
Sbjct: 46  VAVLFGESSVSSSIFVPSQCKIVSSSVDIRSSKVCELGVLNYKAKRVFYPFEASKFRCRY 105

Query: 298 DYYWASVFEVEYIDHS-GQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK--------- 447
           DYYWAS+F+VEY+DHS GQ R A AEAPNEALP +CRP+FGAAWLTKDKFK         
Sbjct: 106 DYYWASIFKVEYLDHSLGQTRLAFAEAPNEALPHSCRPNFGAAWLTKDKFKVNETYGCWY 165

Query: 448 ---XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGV 618
                             A+DPS  EM++RYSIL  ++L+SWF+S        W+++AG+
Sbjct: 166 TIGMSKVSLYRDGFFSCQAKDPSMAEMIRRYSILSVKILQSWFTSKKKAKYLSWEIVAGL 225

Query: 619 ISGFVTAFLSITLIGILYPFIA-IIRRLFASAAYPSTILLKRVCFFAVYFSFMGWVTIQY 795
            +GF+T+ ++I+++GIL      ++ R+F      + I  KR CF  VY S MGWV I Y
Sbjct: 226 TTGFLTSLITISVVGILQQMKPWMLARVF------TRICFKRACFLVVYLSVMGWVAILY 279

Query: 796 VKRLGL 813
            ++LGL
Sbjct: 280 GEKLGL 285


>ref|XP_002303468.2| hypothetical protein POPTR_0003s10200g [Populus trichocarpa]
           gi|550342886|gb|EEE78447.2| hypothetical protein
           POPTR_0003s10200g [Populus trichocarpa]
          Length = 364

 Score =  241 bits (614), Expect = 4e-61
 Identities = 133/254 (52%), Positives = 163/254 (64%), Gaps = 21/254 (8%)
 Frame = +1

Query: 115 YLAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCH 294
           +LA+L+G+ SI +P S+  +CRI+SS VDLRSSK+CELGLLNYKAKHVFYP  R K+RC 
Sbjct: 107 FLAILMGHFSITTPPSLPFQCRILSSSVDLRSSKICELGLLNYKAKHVFYPNNRSKFRCR 166

Query: 295 YDYYWASVFEVEYIDHS-GQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK-------- 447
           YDYYWASVFEVEY D+S GQ + ALAEAPNEALP NCRP+FGAAWL KDKFK        
Sbjct: 167 YDYYWASVFEVEYEDYSLGQTQFALAEAPNEALPLNCRPNFGAAWLAKDKFKVNKTYDCW 226

Query: 448 ----XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKS---WFSSWGSMYLWRWDL 606
                              A+DPS  EM+KRY IL   +L S   W    G    W W+ 
Sbjct: 227 YTSGISKVSLYRDDLFSCQAKDPSQAEMIKRYFILSKEMLHSSPVW--KKGKASYWGWET 284

Query: 607 IAGVISGFVTAFLSITLIGIL-----YPFIAIIRRLFASAAYPSTILLKRVCFFAVYFSF 771
           IAGVI+GF T+ ++I+ I IL     +  +  + R+F+ A   + +  KR CF   YFSF
Sbjct: 285 IAGVITGFSTSIITISFIKILQYIKSWLRLTSVARMFSRA---NVVFFKRACFLVAYFSF 341

Query: 772 MGWVTIQYVKRLGL 813
           MGW+TIQ  KR GL
Sbjct: 342 MGWLTIQCGKRFGL 355


>emb|CBI37457.3| unnamed protein product [Vitis vinifera]
          Length = 310

 Score =  239 bits (610), Expect = 1e-60
 Identities = 118/197 (59%), Positives = 146/197 (74%), Gaps = 12/197 (6%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           +AV +GNLS+ SP+SV S+C+IVSS VDLRSSKVCELGLLNYKAKHVFYPLE++K+RCHY
Sbjct: 37  VAVFVGNLSVSSPVSVPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVFYPLEKRKFRCHY 96

Query: 298 DYYWASVFEVEYIDHSGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK---------- 447
           DYYWASVF+VEY D  GQ R  L EAPNEALP +CRP+FGAAWLTKDKFK          
Sbjct: 97  DYYWASVFKVEYKDSLGQTRLTLTEAPNEALPLDCRPNFGAAWLTKDKFKVNETYDCWYA 156

Query: 448 --XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGVI 621
                            A++PST+EM++RYSIL TR+L+SW +S G    WR + +AGVI
Sbjct: 157 SGISKVSIYQDSFFSCQAKEPSTIEMIRRYSILSTRILQSWLASQGRGKYWRLETVAGVI 216

Query: 622 SGFVTAFLSITLIGILY 672
           +GF T+ +SI+L+ IL+
Sbjct: 217 TGFSTSLISISLVRILH 233


>ref|XP_007040825.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508778070|gb|EOY25326.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 313

 Score =  230 bits (587), Expect = 6e-58
 Identities = 125/271 (46%), Positives = 163/271 (60%), Gaps = 38/271 (14%)
 Frame = +1

Query: 115 YLAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCH 294
           ++AV IG LSI + I++ ++C+IVSS VD+RSSK+CELGLLNYKAKHV Y  ER K+RC 
Sbjct: 41  FVAVFIGELSIPNSITIPTQCKIVSSSVDIRSSKICELGLLNYKAKHVLYHFERSKFRCR 100

Query: 295 YDYYWASVFEVEYIDHS-GQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK-------- 447
           YDYYW SVFEVEY DHS GQ R A  EAPNEALP +CRP+FGAAWLTKDKFK        
Sbjct: 101 YDYYWTSVFEVEYRDHSLGQTRLAFTEAPNEALPLSCRPNFGAAWLTKDKFKVNETYDCW 160

Query: 448 ----XXXXXXXXXXXXXXXAEDPSTVEMLKRYSIL------------------------- 540
                              A+DPST+EM+KRY ++                         
Sbjct: 161 YTSGISKVKLYNDGFFSCQAKDPSTIEMIKRYLMIEQYSGTSRTALSETWEERIRISECR 220

Query: 541 FTRLLKSWFSSWGSMYLWRWDLIAGVISGFVTAFLSITLIGILYPFIAIIRRLFASAAYP 720
            ++++ SW SS G    WRW+ IAGV++GF T+ ++I+ I IL    + + +        
Sbjct: 221 SSKIVYSWLSSKGRGIYWRWETIAGVVTGFSTSIITISFIRILQHMKSWLPQAL------ 274

Query: 721 STILLKRVCFFAVYFSFMGWVTIQYVKRLGL 813
           +T+ +KRVCF  VY S MGW+  QY +RL +
Sbjct: 275 NTVHIKRVCFLLVYVSVMGWLVSQYWRRLNI 305


>ref|NP_001241403.1| uncharacterized protein LOC100811221 [Glycine max]
           gi|255642352|gb|ACU21440.1| unknown [Glycine max]
          Length = 305

 Score =  229 bits (583), Expect = 2e-57
 Identities = 117/248 (47%), Positives = 159/248 (64%), Gaps = 16/248 (6%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           L ++  + SI +PIS+ S+C+IVS+GVD+RSSK+CELGLLNYKAK VF+  ER K+RC Y
Sbjct: 49  LVLMFADFSIPNPISLPSQCKIVSTGVDIRSSKICELGLLNYKAKDVFHHFERSKFRCRY 108

Query: 298 DYYWASVFEVEYIDH-SGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK--------- 447
           DYYWASVF+VEY DH SGQ + A AEAPNEALP  CRP+FGAAW T+ KFK         
Sbjct: 109 DYYWASVFKVEYKDHFSGQTQVAFAEAPNEALPLYCRPNFGAAWFTQYKFKVNESYDCWY 168

Query: 448 ---XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGV 618
                             A + ST+E  ++YS +   +  SW SS G    WRW+ +AGV
Sbjct: 169 TSGNSKVHLHQDNLFGCDAHEQSTLEKSRQYSTMAMEMAISWLSSRGRTKHWRWETLAGV 228

Query: 619 ISGFVTAFLSITLIGILYPFIAIIRRLFASAAYP---STILLKRVCFFAVYFSFMGWVTI 789
           ++GF+T+ +SIT I  L  F++ + + F +  +    + +L++R CF   Y SF+ W+ I
Sbjct: 229 VTGFLTSLISITFIRFLQLFLSSLHQSFTTWIFSWRVNAVLIRRACFLLAYLSFVAWLVI 288

Query: 790 QYVKRLGL 813
           +Y KRLGL
Sbjct: 289 EYGKRLGL 296


>ref|XP_003534316.1| PREDICTED: uncharacterized protein LOC100813000 [Glycine max]
          Length = 303

 Score =  228 bits (580), Expect = 4e-57
 Identities = 117/248 (47%), Positives = 160/248 (64%), Gaps = 16/248 (6%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           L ++  + SI +PIS+ S+C+IVS+GVD+RSSK+CELGLL+YKAK VF+  ER K+RC Y
Sbjct: 47  LVLMFADFSIPNPISLPSQCKIVSTGVDIRSSKICELGLLDYKAKDVFHHFERSKFRCRY 106

Query: 298 DYYWASVFEVEYIDH-SGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK--------- 447
           DYYWASVF+VEY DH SGQ + A AEAPNEALP  CRP+FGAAWLT+ KFK         
Sbjct: 107 DYYWASVFKVEYKDHFSGQTQVAFAEAPNEALPLYCRPNFGAAWLTQYKFKVNETYDCWY 166

Query: 448 ---XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGV 618
                             A + ST+E  ++YS +   ++ SWFSS G    WRW+ +AGV
Sbjct: 167 TSGISKVRLHQDSLFGCDAHEQSTLEKSRQYSTMAMEMVISWFSSRGRTKHWRWETLAGV 226

Query: 619 ISGFVTAFLSITLIGILYPFIAIIRRLFASAAYP---STILLKRVCFFAVYFSFMGWVTI 789
           ++GF+T+ +SIT I  L   +  + + F +  +    + +L++R CF   Y SF+ W+ I
Sbjct: 227 VTGFLTSLISITFIRFLQLLLPSLYQSFTTWIFSWRVNAVLIRRACFLLAYLSFVAWLAI 286

Query: 790 QYVKRLGL 813
           +Y KRLGL
Sbjct: 287 EYGKRLGL 294


>ref|XP_007158154.1| hypothetical protein PHAVU_002G128700g [Phaseolus vulgaris]
           gi|561031569|gb|ESW30148.1| hypothetical protein
           PHAVU_002G128700g [Phaseolus vulgaris]
          Length = 305

 Score =  225 bits (574), Expect = 2e-56
 Identities = 117/248 (47%), Positives = 156/248 (62%), Gaps = 16/248 (6%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           L ++  + SI +PIS+ S+C+IVS+GVD+RSSK+CELGLLNYKAK VF   ER K+RC Y
Sbjct: 49  LILMFADFSIPNPISLPSQCKIVSTGVDIRSSKICELGLLNYKAKDVFQHFERSKFRCRY 108

Query: 298 DYYWASVFEVEYIDH-SGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK--------- 447
           DYYWASVF+VEY DH SGQ + A AEAPNEALP  CRP+FGAAWLT+ KFK         
Sbjct: 109 DYYWASVFKVEYKDHFSGQTQVAFAEAPNEALPLYCRPNFGAAWLTQYKFKVNETYNCWY 168

Query: 448 ---XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGV 618
                             A   ST+E  ++YS +   +  SW S  G    WRW+ +AGV
Sbjct: 169 TSGISKVRLRQDNLFGCHAHQQSTLEKSRQYSTMAMEMAISWLSGRGRTKHWRWETLAGV 228

Query: 619 ISGFVTAFLSITLIGILYPFIAIIRRLFASAAYP---STILLKRVCFFAVYFSFMGWVTI 789
           +SGF+T+ +SIT I   +  ++ I + F +  +P   + + ++R CF   Y SF+ W+ I
Sbjct: 229 VSGFLTSLISITFIRFAHILLSSIYQSFTTWIFPWRVNAVFIRRSCFLLAYLSFVAWLAI 288

Query: 790 QYVKRLGL 813
           +Y KRLGL
Sbjct: 289 EYGKRLGL 296


>ref|XP_004512397.1| PREDICTED: uncharacterized protein LOC101511402 [Cicer arietinum]
          Length = 305

 Score =  223 bits (569), Expect = 7e-56
 Identities = 117/249 (46%), Positives = 160/249 (64%), Gaps = 17/249 (6%)
 Frame = +1

Query: 118 LAVLIGNLSIWSPISVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHY 297
           L +++ + S+ +PIS+ S CRIVS+GVD+RSSK+CELGL NYKAK +F   ER K+RC Y
Sbjct: 48  LLLMLADFSVPNPISLPSHCRIVSTGVDIRSSKICELGLSNYKAKDIFRHFERSKFRCRY 107

Query: 298 DYYWASVFEVEYIDH-SGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK--------- 447
           DYYWASVF+VEY DH SGQ + A AEAP+EALP  CRP+FGAAWLT+ KFK         
Sbjct: 108 DYYWASVFKVEYKDHFSGQRQFAFAEAPSEALPLYCRPNFGAAWLTQYKFKVNETYDCWY 167

Query: 448 ---XXXXXXXXXXXXXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWG-SMYLWRWDLIAG 615
                             A++ ST+E + +YS     ++  W S  G  +  WRW++I G
Sbjct: 168 TSGISKVHLYQDNLFGCRADEQSTIEKIIQYSTQAMEMINYWISDIGRRVKYWRWEVIVG 227

Query: 616 VISGFVTAFLSITLIGILYPFIAIIRRLFAS---AAYPSTILLKRVCFFAVYFSFMGWVT 786
           V+SGF T+ +SIT I  L   ++ +R+ FA+   +   + +L++R CFF  Y SF+GW+ 
Sbjct: 228 VVSGFATSLISITFIMFLKLLLSSLRQSFAAWILSWRVNAVLIRRTCFFFAYLSFVGWLA 287

Query: 787 IQYVKRLGL 813
           I+Y KRLGL
Sbjct: 288 IEYGKRLGL 296


>ref|XP_006414004.1| hypothetical protein EUTSA_v10025843mg [Eutrema salsugineum]
           gi|557115174|gb|ESQ55457.1| hypothetical protein
           EUTSA_v10025843mg [Eutrema salsugineum]
          Length = 299

 Score =  216 bits (551), Expect = 9e-54
 Identities = 114/234 (48%), Positives = 148/234 (63%), Gaps = 18/234 (7%)
 Frame = +1

Query: 160 SVHSRCRIVSSGVDLRSSKVCELGLLNYKAKHVFYPLERKKYRCHYDYYWASVFEVEYID 339
           S+ SRC+IVSS VDLRSSKVC +GLLN KA+HVFYP ER K+RC YDYYWASVF+VEY D
Sbjct: 62  SLASRCKIVSSSVDLRSSKVCGIGLLNIKAQHVFYPFERDKFRCRYDYYWASVFKVEYKD 121

Query: 340 H-SGQARSALAEAPNEALPDNCRPSFGAAWLTKDKFK------------XXXXXXXXXXX 480
           H  GQ R A +EAPNEALP  CRP+FGAA LTKD FK                       
Sbjct: 122 HLMGQTRLAFSEAPNEALPPECRPNFGAALLTKDNFKVNETYDCWYTLGIPKIKLYQDGF 181

Query: 481 XXXXAEDPSTVEMLKRYSILFTRLLKSWFSSWGSMYLWRWDLIAGVISGFVTAFLSITLI 660
               A D S  ++ K+Y++LF+RLL+SWF+  G    WR+D+IAG++SGF T+ +++ ++
Sbjct: 182 FGCQANDRSFTDIFKQYAVLFSRLLQSWFNGKGRPKYWRYDVIAGIVSGFSTSIITVFVM 241

Query: 661 GILYPFIAIIRRLFAS-----AAYPSTILLKRVCFFAVYFSFMGWVTIQYVKRL 807
            IL    + + R F S     +     + +KR C   VYFS +GW+  QY+K L
Sbjct: 242 RILRHAKSWVPRAFCSVKSQYSKVNLVVQMKRACLVLVYFSALGWMATQYLKIL 295


Top