BLASTX nr result

ID: Mentha29_contig00030608 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00030608
         (864 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU41709.1| hypothetical protein MIMGU_mgv1a025324mg, partial...   282   1e-73
emb|CBI30576.3| unnamed protein product [Vitis vinifera]              180   7e-43
ref|XP_002269847.1| PREDICTED: uncharacterized protein LOC100261...   177   4e-42
ref|XP_007205809.1| hypothetical protein PRUPE_ppa010713mg [Prun...   172   2e-40
emb|CAN79695.1| hypothetical protein VITISV_023936 [Vitis vinifera]   169   1e-39
ref|XP_007027056.1| Uncharacterized protein isoform 3 [Theobroma...   165   2e-38
ref|XP_007027055.1| Uncharacterized protein isoform 2 [Theobroma...   165   2e-38
ref|XP_007027054.1| Uncharacterized protein isoform 1 [Theobroma...   165   2e-38
ref|XP_004302736.1| PREDICTED: uncharacterized protein LOC101292...   160   8e-37
ref|XP_007027057.1| Uncharacterized protein isoform 4 [Theobroma...   159   2e-36
ref|XP_007027058.1| Uncharacterized protein isoform 5 [Theobroma...    88   4e-15

>gb|EYU41709.1| hypothetical protein MIMGU_mgv1a025324mg, partial [Mimulus
           guttatus]
          Length = 317

 Score =  282 bits (722), Expect = 1e-73
 Identities = 149/253 (58%), Positives = 181/253 (71%), Gaps = 1/253 (0%)
 Frame = -1

Query: 825 MADGDGEANGSAANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLY-ENNSQEMITDKQ 649
           MAD  G  N S  ND+V+LEAMRKDS SWHPC+VSLCSRGLGLI+ + +N  +E+ITD+Q
Sbjct: 1   MADTGG--NNSTGNDVVQLEAMRKDSFSWHPCKVSLCSRGLGLILQFGDNYMEEIITDQQ 58

Query: 648 EIFARIRVRSTPLQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEKVNHVRHSKRIHCRC 469
           E+ ARIRVRSTPLQGDDCSS+RQG+ VLA +SS    VF DA +E+   VRHSKRIHCRC
Sbjct: 59  EVMARIRVRSTPLQGDDCSSLRQGDRVLATRSSHAKSVFCDALVEEAMRVRHSKRIHCRC 118

Query: 468 SFTIKWLHRALPEEALTVPASAIMKLATKSIDLHPTISAFFSSQEPSNALDAAPCLEIAE 289
           +F IKWLH+   EE LTVPA AIMKL+T+SI+LHPTIS +FS  E SN LD +P    A+
Sbjct: 119 TFKIKWLHQ---EETLTVPAGAIMKLSTESINLHPTISTYFSMLESSNDLDKSPYSIAAD 175

Query: 288 GNNWEMDINVLLEQQIKKISTSHVPQDDFLKDYVSEHKVDLGGQSHGWEIKAXXXXXXXX 109
             N EMDINVLLE+QI++I  S        KD+V   +VDLGGQSHGWEI A        
Sbjct: 176 ITNLEMDINVLLEKQIEEIRNSTNVSQKISKDFVLGLEVDLGGQSHGWEIDASLKEPCVT 235

Query: 108 XXXLDKLNSFNGS 70
               + + ++ GS
Sbjct: 236 IPFPNNIKAYTGS 248


>emb|CBI30576.3| unnamed protein product [Vitis vinifera]
          Length = 693

 Score =  180 bits (456), Expect = 7e-43
 Identities = 102/234 (43%), Positives = 145/234 (61%), Gaps = 2/234 (0%)
 Frame = -1

Query: 852 FAHKFISQSMADGDGEANGSAANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLYENNS 673
           F +K +  SM  G G+A        VELEAMRKD  SWHPC+VSL S G GLIV + +  
Sbjct: 116 FPYKLVC-SMGTGTGDAT-------VELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQD 167

Query: 672 -QEMITDKQEIFARIRVRSTPLQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEKVNHVR 496
            +++I++++E  AR+R+RS PLQG+DCS I +GE VLA   S    + +DA +EK   VR
Sbjct: 168 LEDIISNEEEALARLRIRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKALRVR 227

Query: 495 HSKRIHCRCSFTIKWLHRALPEEALTVPASAIMKLATKSIDLHPTISAFFSSQEPSNALD 316
           HS RI CRC+F IKWLH+ L      VP+S+IMKLAT+SI +HP ++AF    +  N   
Sbjct: 228 HSTRISCRCTFVIKWLHQDLKGATSIVPSSSIMKLATQSITVHPMVAAFLKPIKTLNCSA 287

Query: 315 AAPCLEIAEGNNWEMDINVLLEQQIKKIST-SHVPQDDFLKDYVSEHKVDLGGQ 157
           A     + E  + E+D++ LLE+QI++IS  +   + +  +D +   K D+  Q
Sbjct: 288 APSFSTVFEDVDCEVDLHKLLEKQIEEISNLADASKKEISEDILFGIKADIKEQ 341


>ref|XP_002269847.1| PREDICTED: uncharacterized protein LOC100261386 [Vitis vinifera]
          Length = 552

 Score =  177 bits (450), Expect = 4e-42
 Identities = 99/225 (44%), Positives = 140/225 (62%), Gaps = 2/225 (0%)
 Frame = -1

Query: 825 MADGDGEANGSAANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLYENNS-QEMITDKQ 649
           M  G G+A        VELEAMRKD  SWHPC+VSL S G GLIV + +   +++I++++
Sbjct: 1   MGTGTGDAT-------VELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQDLEDIISNEE 53

Query: 648 EIFARIRVRSTPLQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEKVNHVRHSKRIHCRC 469
           E  AR+R+RS PLQG+DCS I +GE VLA   S    + +DA +EK   VRHS RI CRC
Sbjct: 54  EALARLRIRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKALRVRHSTRISCRC 113

Query: 468 SFTIKWLHRALPEEALTVPASAIMKLATKSIDLHPTISAFFSSQEPSNALDAAPCLEIAE 289
           +F IKWLH+ L      VP+S+IMKLAT+SI +HP ++AF    +  N   A     + E
Sbjct: 114 TFVIKWLHQDLKGATSIVPSSSIMKLATQSITVHPMVAAFLKPIKTLNCSAAPSFSTVFE 173

Query: 288 GNNWEMDINVLLEQQIKKIST-SHVPQDDFLKDYVSEHKVDLGGQ 157
             + E+D++ LLE+QI++IS  +   + +  +D +   K D+  Q
Sbjct: 174 DVDCEVDLHKLLEKQIEEISNLADASKKEISEDILFGIKADIKEQ 218


>ref|XP_007205809.1| hypothetical protein PRUPE_ppa010713mg [Prunus persica]
           gi|462401451|gb|EMJ07008.1| hypothetical protein
           PRUPE_ppa010713mg [Prunus persica]
          Length = 238

 Score =  172 bits (435), Expect = 2e-40
 Identities = 92/194 (47%), Positives = 127/194 (65%), Gaps = 3/194 (1%)
 Frame = -1

Query: 795 SAANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLYENNS-QEMITDKQEIFARIRVRS 619
           S A ++ ELEAM K+  SWHPCQVSL S    LIV +     ++M+ +  E   R+R R 
Sbjct: 5   SEAENVTELEAMCKEDSSWHPCQVSLSSTKDSLIVDFGGQELEDMVLNTDEALTRLRFRC 64

Query: 618 TPLQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEKVNHVRHSKRIHCRCSFTIKWLHRA 439
            PLQGDDC+ I +GEHVLA   SQ    F+DA++EKV  VRHS R++CRC+F IKWLH+ 
Sbjct: 65  APLQGDDCTRI-EGEHVLAINKSQSKSHFFDAKVEKVLRVRHSTRVYCRCTFMIKWLHQD 123

Query: 438 LPEEALTVPASAIMKLATKSIDLHPTISAFFSS--QEPSNALDAAPCLEIAEGNNWEMDI 265
           L  + +TVP+S+IMKL  K+I++HPT+SAF  S  Q   ++  + P +   E    E+D+
Sbjct: 124 LKGQMVTVPSSSIMKLTGKNINVHPTVSAFLKSVKQMGLDSASSVPVMLEVEDFAVELDL 183

Query: 264 NVLLEQQIKKISTS 223
           N  LE+QI+ I+ S
Sbjct: 184 NKFLEKQIEDITVS 197


>emb|CAN79695.1| hypothetical protein VITISV_023936 [Vitis vinifera]
          Length = 1508

 Score =  169 bits (428), Expect = 1e-39
 Identities = 114/311 (36%), Positives = 164/311 (52%), Gaps = 27/311 (8%)
 Frame = -1

Query: 852 FAHKFISQSMADGDGEANGSAANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLYENNS 673
           F ++ +  SM  G G+A        VELEAMRKD  SWHPC+VSL S G GLIV + +  
Sbjct: 20  FEYEELVCSMGTGTGDAT-------VELEAMRKDDSSWHPCRVSLSSTGFGLIVDFGSQD 72

Query: 672 -QEMITDKQEIFARIRVRSTPLQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEK----- 511
            +++I++++E  AR+R+RS PLQG+DCS I +GE VLA   S    + +DA +EK     
Sbjct: 73  LEDIISNEEEALARLRIRSVPLQGEDCSLIEEGERVLATHKSHFKTLSFDAMVEKEMSHE 132

Query: 510 -------------VN----HVRHSKRIHCRCSFTIKWLHRALPEEALTVPASAIMKLATK 382
                        VN     VRHS RI CRC+F IKWLH+ L      VP+S+IMKLAT+
Sbjct: 133 FXIECDLIDWGIXVNVVALRVRHSTRISCRCTFVIKWLHQDLKGATSIVPSSSIMKLATQ 192

Query: 381 SIDLHPTISAFFSSQEPSNALDAAPCLEIAEGNNWEMDINVLLEQQIKKIST-SHVPQDD 205
           SI +HP ++AF    +  N   A     + E  + E+D++ LLE+QI++IS  +   + +
Sbjct: 193 SITVHPMVAAFLKPIKTLNCSAAPSFSTVFEDVDCEVDLHKLLEKQIEEISNLADASKKE 252

Query: 204 FLKDYVSEHKVDLGGQSHGWEIKAXXXXXXXXXXXLDKLNSFNGSEKSQPVASI-LETST 28
             +D +   K D+  Q     +              ++ N F  S +S     + +E   
Sbjct: 253 ISEDILFGIKADIKEQMDCSPVAESKITSSHFQVPHEQENHFKRSTRSSSKLRVNMEVKD 312

Query: 27  VCTP--SIQEE 1
              P  SIQEE
Sbjct: 313 PLPPDSSIQEE 323


>ref|XP_007027056.1| Uncharacterized protein isoform 3 [Theobroma cacao]
           gi|508715661|gb|EOY07558.1| Uncharacterized protein
           isoform 3 [Theobroma cacao]
          Length = 409

 Score =  165 bits (417), Expect = 2e-38
 Identities = 94/216 (43%), Positives = 134/216 (62%), Gaps = 2/216 (0%)
 Frame = -1

Query: 795 SAANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLYENNS-QEMITDKQEIFARIRVRS 619
           + +++ VELEA RK+  SWHPC+V L S G  LIV +      +M+  K+E+   +R RS
Sbjct: 5   TVSDNSVELEAKRKEDSSWHPCRVYLSSSGDSLIVNFGRQELDDMLLQKEEVLMHLRFRS 64

Query: 618 TPLQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEKVNHVRHSKRIHCRCSFTIKWLHRA 439
            PLQ DDC  I +GE VLA + SQ   +F+DA + KV+ VRHSKR  CRC+F IKWL + 
Sbjct: 65  MPLQVDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQD 123

Query: 438 LPEEALTVPASAIMKLATKSIDLHPTISAFFSSQEPSNALDAAPCLEIAEGNNWEMDINV 259
           L  +  T+P+S+IMKLATKSI  HP I+     ++      ++P L I EG + E+D+N 
Sbjct: 124 LEGQTFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNK 183

Query: 258 LLEQQIKKIST-SHVPQDDFLKDYVSEHKVDLGGQS 154
           LL++QI++IS  +   + D  +D    +K    GQS
Sbjct: 184 LLQKQIEQISNLADASKKDIPEDIPWRNKGVNKGQS 219


>ref|XP_007027055.1| Uncharacterized protein isoform 2 [Theobroma cacao]
           gi|508715660|gb|EOY07557.1| Uncharacterized protein
           isoform 2 [Theobroma cacao]
          Length = 611

 Score =  165 bits (417), Expect = 2e-38
 Identities = 94/216 (43%), Positives = 134/216 (62%), Gaps = 2/216 (0%)
 Frame = -1

Query: 795 SAANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLYENNS-QEMITDKQEIFARIRVRS 619
           + +++ VELEA RK+  SWHPC+V L S G  LIV +      +M+  K+E+   +R RS
Sbjct: 5   TVSDNSVELEAKRKEDSSWHPCRVYLSSSGDSLIVNFGRQELDDMLLQKEEVLMHLRFRS 64

Query: 618 TPLQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEKVNHVRHSKRIHCRCSFTIKWLHRA 439
            PLQ DDC  I +GE VLA + SQ   +F+DA + KV+ VRHSKR  CRC+F IKWL + 
Sbjct: 65  MPLQVDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQD 123

Query: 438 LPEEALTVPASAIMKLATKSIDLHPTISAFFSSQEPSNALDAAPCLEIAEGNNWEMDINV 259
           L  +  T+P+S+IMKLATKSI  HP I+     ++      ++P L I EG + E+D+N 
Sbjct: 124 LEGQTFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNK 183

Query: 258 LLEQQIKKIST-SHVPQDDFLKDYVSEHKVDLGGQS 154
           LL++QI++IS  +   + D  +D    +K    GQS
Sbjct: 184 LLQKQIEQISNLADASKKDIPEDIPWRNKGVNKGQS 219


>ref|XP_007027054.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508715659|gb|EOY07556.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 567

 Score =  165 bits (417), Expect = 2e-38
 Identities = 94/216 (43%), Positives = 134/216 (62%), Gaps = 2/216 (0%)
 Frame = -1

Query: 795 SAANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLYENNS-QEMITDKQEIFARIRVRS 619
           + +++ VELEA RK+  SWHPC+V L S G  LIV +      +M+  K+E+   +R RS
Sbjct: 5   TVSDNSVELEAKRKEDSSWHPCRVYLSSSGDSLIVNFGRQELDDMLLQKEEVLMHLRFRS 64

Query: 618 TPLQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEKVNHVRHSKRIHCRCSFTIKWLHRA 439
            PLQ DDC  I +GE VLA + SQ   +F+DA + KV+ VRHSKR  CRC+F IKWL + 
Sbjct: 65  MPLQVDDCFHIEEGERVLADRKSQFKILFHDAVVVKVDRVRHSKR-GCRCTFMIKWLDQD 123

Query: 438 LPEEALTVPASAIMKLATKSIDLHPTISAFFSSQEPSNALDAAPCLEIAEGNNWEMDINV 259
           L  +  T+P+S+IMKLATKSI  HP I+     ++      ++P L I EG + E+D+N 
Sbjct: 124 LEGQTFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNK 183

Query: 258 LLEQQIKKIST-SHVPQDDFLKDYVSEHKVDLGGQS 154
           LL++QI++IS  +   + D  +D    +K    GQS
Sbjct: 184 LLQKQIEQISNLADASKKDIPEDIPWRNKGVNKGQS 219


>ref|XP_004302736.1| PREDICTED: uncharacterized protein LOC101292719 [Fragaria vesca
           subsp. vesca]
          Length = 580

 Score =  160 bits (404), Expect = 8e-37
 Identities = 95/213 (44%), Positives = 129/213 (60%), Gaps = 3/213 (1%)
 Frame = -1

Query: 789 ANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLYENNS-QEMITDKQEIFARIRVRSTP 613
           A +  ELEA+ K   SW+PC VSL S    LIV +     ++M+ +K E   R+R RS P
Sbjct: 7   AENATELEALCKQDSSWYPCHVSLSSTEDSLIVDFGRQELEDMVLNKDEALMRLRFRSGP 66

Query: 612 LQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEKVNHVRHSKRIHCRCSFTIKWLHRALP 433
           LQGDDCS I +GEHVLA   S      YDA++EKV  VRHS R++CRCSF I WLH    
Sbjct: 67  LQGDDCSHI-EGEHVLAIHKSPFKSYLYDAKVEKVTRVRHSTRVYCRCSFMILWLHPDFK 125

Query: 432 EEALTVPASAIMKLATKSIDLHPTISAFFSSQEPSNALDAAPCLEIA-EGNNWEMDINVL 256
            + +T+ +S+IMKLA+KSI+ HPT++A F S +    L  AP L I  E  + E D+N L
Sbjct: 126 GQMVTITSSSIMKLASKSINSHPTVAALFKSVK-QMGLYTAPLLPIMHEDIDVEFDLNKL 184

Query: 255 LEQQIKKISTS-HVPQDDFLKDYVSEHKVDLGG 160
           L +QI++I+ S +   ++   D +   K D  G
Sbjct: 185 LGKQIEEINISANRVTNEITVDIIEGVKADSSG 217


>ref|XP_007027057.1| Uncharacterized protein isoform 4 [Theobroma cacao]
           gi|508715662|gb|EOY07559.1| Uncharacterized protein
           isoform 4 [Theobroma cacao]
          Length = 565

 Score =  159 bits (401), Expect = 2e-36
 Identities = 93/216 (43%), Positives = 133/216 (61%), Gaps = 2/216 (0%)
 Frame = -1

Query: 795 SAANDIVELEAMRKDSLSWHPCQVSLCSRGLGLIVLYENNS-QEMITDKQEIFARIRVRS 619
           + +++ VELEA RK+  SWHPC+V L S G  LIV +      +M+  K+E+   +R RS
Sbjct: 5   TVSDNSVELEAKRKEDSSWHPCRVYLSSSGDSLIVNFGRQELDDMLLQKEEVLMHLRFRS 64

Query: 618 TPLQGDDCSSIRQGEHVLAAKSSQVDGVFYDARMEKVNHVRHSKRIHCRCSFTIKWLHRA 439
            PLQ DDC  I +GE VLA + SQ   +F+DA +  V+ VRHSKR  CRC+F IKWL + 
Sbjct: 65  MPLQVDDCFHIEEGERVLADRKSQFKILFHDAVV--VDRVRHSKR-GCRCTFMIKWLDQD 121

Query: 438 LPEEALTVPASAIMKLATKSIDLHPTISAFFSSQEPSNALDAAPCLEIAEGNNWEMDINV 259
           L  +  T+P+S+IMKLATKSI  HP I+     ++      ++P L I EG + E+D+N 
Sbjct: 122 LEGQTFTLPSSSIMKLATKSISAHPIINKLLKPEKHRGLSYSSPLLTILEGTDSEIDLNK 181

Query: 258 LLEQQIKKIST-SHVPQDDFLKDYVSEHKVDLGGQS 154
           LL++QI++IS  +   + D  +D    +K    GQS
Sbjct: 182 LLQKQIEQISNLADASKKDIPEDIPWRNKGVNKGQS 217


>ref|XP_007027058.1| Uncharacterized protein isoform 5 [Theobroma cacao]
           gi|508715663|gb|EOY07560.1| Uncharacterized protein
           isoform 5 [Theobroma cacao]
          Length = 468

 Score = 88.2 bits (217), Expect = 4e-15
 Identities = 51/120 (42%), Positives = 74/120 (61%), Gaps = 1/120 (0%)
 Frame = -1

Query: 510 VNHVRHSKRIHCRCSFTIKWLHRALPEEALTVPASAIMKLATKSIDLHPTISAFFSSQEP 331
           V+ VRHSKR  CRC+F IKWL + L  +  T+P+S+IMKLATKSI  HP I+     ++ 
Sbjct: 2   VDRVRHSKR-GCRCTFMIKWLDQDLEGQTFTLPSSSIMKLATKSISAHPIINKLLKPEKH 60

Query: 330 SNALDAAPCLEIAEGNNWEMDINVLLEQQIKKIST-SHVPQDDFLKDYVSEHKVDLGGQS 154
                ++P L I EG + E+D+N LL++QI++IS  +   + D  +D    +K    GQS
Sbjct: 61  RGLSYSSPLLTILEGTDSEIDLNKLLQKQIEQISNLADASKKDIPEDIPWRNKGVNKGQS 120


Top