BLASTX nr result

ID: Mentha26_contig00006200 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00006200
         (653 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU32569.1| hypothetical protein MIMGU_mgv1a003227mg [Mimulus...   102   1e-19
ref|XP_007224114.1| hypothetical protein PRUPE_ppa015679mg [Prun...    97   6e-18
ref|XP_007035162.1| RNA-binding (RRM/RBD/RNP motifs) family prot...    86   8e-15
ref|XP_004297831.1| PREDICTED: uncharacterized protein LOC101296...    83   9e-14
ref|XP_006297030.1| hypothetical protein CARUB_v10013033mg [Caps...    79   9e-13
ref|XP_002883427.1| nucleic acid binding protein [Arabidopsis ly...    78   3e-12
ref|XP_004247930.1| PREDICTED: uncharacterized protein LOC101262...    75   2e-11
ref|XP_002314482.2| hypothetical protein POPTR_0010s08070g [Popu...    74   5e-11
ref|XP_006354470.1| PREDICTED: dentin sialophosphoprotein-like [...    73   9e-11
ref|NP_188979.2| RNA recognition motif-containing protein [Arabi...    69   1e-09
ref|XP_006406035.1| hypothetical protein EUTSA_v10020149mg [Eutr...    65   2e-08
ref|XP_006857341.1| hypothetical protein AMTR_s00067p00095130 [A...    59   1e-06

>gb|EYU32569.1| hypothetical protein MIMGU_mgv1a003227mg [Mimulus guttatus]
          Length = 598

 Score =  102 bits (253), Expect = 1e-19
 Identities = 77/221 (34%), Positives = 102/221 (46%), Gaps = 4/221 (1%)
 Frame = +1

Query: 1   SGELKKQKQRPKVSLPTSNVKRSNEQVEGEETRKSGFDSG--ELKEQKQGPKVSLPTSDV 174
           +GE++  K R          + SNEQ   E+  K    SG  +     + P+ S  +S V
Sbjct: 204 TGEVEYNKPRSFAFTHQPMTQNSNEQPSTEDADKRSVISGLIDFMRLAEDPEPSSSSSKV 263

Query: 175 QRSDKRIEGEETRKSGFDSGDLKEQKQRPKVLLSTXXXXXXXXXXXISAITESEENKVLV 354
               K    E                QRP     T                +S+ENKVL+
Sbjct: 264 SVVVKENSSEANL-------------QRPGPTKKTHFDGQ-----------KSKENKVLI 299

Query: 355 RFLSHSVTKNQIFNFFKKCGEILKIECPDVQKPLLKTACVHFKTRDGLENAISKSGLCV- 531
           RFL  + T   IF  F+ CGEI K+E P  +  L K+  ++FKTR+G   A+ K+ L V 
Sbjct: 300 RFLRSNATDAHIFQCFESCGEISKVEIPYAEASLFKSGYIYFKTREGFNKALKKTSLLVA 359

Query: 532 GGPVIVESATS-ENWTTTVHVPSLFGDPDVPAALVKNPTRT 651
           GG V VESA+S         +PSL GD + PAALVKNPTRT
Sbjct: 360 GGIVTVESASSTRKRNVKTPIPSLIGDHNTPAALVKNPTRT 400


>ref|XP_007224114.1| hypothetical protein PRUPE_ppa015679mg [Prunus persica]
            gi|462421050|gb|EMJ25313.1| hypothetical protein
            PRUPE_ppa015679mg [Prunus persica]
          Length = 835

 Score = 96.7 bits (239), Expect = 6e-18
 Identities = 74/225 (32%), Positives = 105/225 (46%), Gaps = 11/225 (4%)
 Frame = +1

Query: 10   LKKQKQRPKVSLPTSN----VKRSNEQVEGEETRKSGFDSGELKEQKQGPKVSLPTSDVQ 177
            +K+     KV++P  +     KRS     G    +      E +EQ      SL T  V 
Sbjct: 428  IKRGGSVKKVTVPADSHKHDAKRSESSFRGRAMERETNTIDESEEQPCRGIASLNTDSVS 487

Query: 178  RSDKRIEGEETRKSGFDSGDLKEQKQR-----PKVLLSTXXXXXXXXXXXISAITESEEN 342
            +SD              SGDLK   Q+     PK+ L T            S    S E+
Sbjct: 488  KSD--------------SGDLKVASQKKSKLSPKIHLLTSKEDLNKIPITFSQKEGSTES 533

Query: 343  KVLVRFLSHSVTKNQIFNFFKKCGEILKIECPDVQK-PLLKTACVHFKTRDGLENAISKS 519
            KVLVRFL  +V  + + N    CGEI+KI+   V +    + A VHFKT +  + A+ K+
Sbjct: 534  KVLVRFLHKNVKDDAVVNALNDCGEIVKIQLLSVSEGSNFRDAWVHFKTSNESQRALRKT 593

Query: 520  GLCVGGPVIVESATS-ENWTTTVHVPSLFGDPDVPAALVKNPTRT 651
             L +G   +V  ATS E+    V +P++ GD ++P AL+KNPTRT
Sbjct: 594  DLIIGNSEVVVVATSLEDVLNKVSIPNVIGDSELPVALIKNPTRT 638


>ref|XP_007035162.1| RNA-binding (RRM/RBD/RNP motifs) family protein [Theobroma cacao]
           gi|508714191|gb|EOY06088.1| RNA-binding (RRM/RBD/RNP
           motifs) family protein [Theobroma cacao]
          Length = 245

 Score = 86.3 bits (212), Expect = 8e-15
 Identities = 57/199 (28%), Positives = 97/199 (48%), Gaps = 4/199 (2%)
 Frame = +1

Query: 67  SNEQVEGEETRKSGFDSGELKEQKQGPKVSLPTSDVQRSDKR--IEGEETRKSGFDSGDL 240
           S  ++  ++  +S   +  + E+++G  ++LP SD  +S     +    +R+    S  +
Sbjct: 3   SETKISLDKKEESCIPTESISEREEGSSLNLPNSDHVKSGPNYPVPPSISREELNSSSPM 62

Query: 241 KEQKQRPKVLLSTXXXXXXXXXXXISAITESEENKVLVRFLSHSVTKNQIFNFFKKCGEI 420
           +  K+                         S+EN VL RFL+ ++ K+ I   F  C  I
Sbjct: 63  RSTKEG------------------------SKENMVLDRFLTQNIEKHSILAAFCDCWPI 98

Query: 421 LKIECPDVQKP-LLKTACVHFKTRDGLENAISKSGLCV-GGPVIVESATSENWTTTVHVP 594
           + +E   + K  + K   VHF+TR+G +N + K+ L V      VE+++SE+    + +P
Sbjct: 99  VNVEEVSLTKQSMFKDFVVHFETREGYQNTLKKTDLMVLNAEAFVEASSSEDMDDAISIP 158

Query: 595 SLFGDPDVPAALVKNPTRT 651
            L GDPD P ALVKNPT+T
Sbjct: 159 DLIGDPDAPVALVKNPTKT 177


>ref|XP_004297831.1| PREDICTED: uncharacterized protein LOC101296092 [Fragaria vesca
           subsp. vesca]
          Length = 736

 Score = 82.8 bits (203), Expect = 9e-14
 Identities = 45/109 (41%), Positives = 66/109 (60%), Gaps = 2/109 (1%)
 Frame = +1

Query: 331 SEENKVLVRFLSHSVTKNQIFNFFKKCGEILKIEC-PDVQKPLLKTACVHFKTRDGLENA 507
           S E+KV+VRFL   V ++ I+  F  CG I +I+  P ++  + +   VHFKT +G   A
Sbjct: 425 STESKVMVRFLHKFVQESSIYKAFDDCGCITRIQLLPLIEGSIFRAGYVHFKTAEGSHKA 484

Query: 508 ISKSGLCVGGPVIVESATS-ENWTTTVHVPSLFGDPDVPAALVKNPTRT 651
           + KSG+   G  +V  A S E+    + +P+L GDP+VP  LVK+PTRT
Sbjct: 485 LRKSGIVSEGHTVVVDANSLEDVPNKIAIPNLIGDPEVPLMLVKSPTRT 533


>ref|XP_006297030.1| hypothetical protein CARUB_v10013033mg [Capsella rubella]
           gi|482565739|gb|EOA29928.1| hypothetical protein
           CARUB_v10013033mg [Capsella rubella]
          Length = 764

 Score = 79.3 bits (194), Expect = 9e-13
 Identities = 44/110 (40%), Positives = 64/110 (58%), Gaps = 2/110 (1%)
 Frame = +1

Query: 328 ESEENKVLVRFLSHSVTKNQIFNFFKKCGEILKI-ECPDVQKPLLKTACVHFKTRDGLEN 504
           E   NKVL+RFL  S  KN I   F   G +L + E P ++  + K A + F+T+  +++
Sbjct: 456 EHSPNKVLLRFLQESFNKNDIVEVFSGFGTVLDVQEIPSLEGCIYKDALLTFETKTAVKD 515

Query: 505 AISKSGLCVGG-PVIVESATSENWTTTVHVPSLFGDPDVPAALVKNPTRT 651
           A+ K  + V    V VE+A+ ++   T+ +P L GDPDVP ALVK P RT
Sbjct: 516 ALKKVSVMVKNYSVCVEAASQKDMVETICIPDLIGDPDVPIALVKEPART 565


>ref|XP_002883427.1| nucleic acid binding protein [Arabidopsis lyrata subsp. lyrata]
           gi|297329267|gb|EFH59686.1| nucleic acid binding protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 785

 Score = 77.8 bits (190), Expect = 3e-12
 Identities = 59/215 (27%), Positives = 99/215 (46%), Gaps = 4/215 (1%)
 Frame = +1

Query: 19  QKQRPKVSLPTSNVKRSNEQVEGEETRKSGFDSGELKEQKQGPKVSLPTSDVQRSDKRIE 198
           +K    ++LPT N         G + + +  ++   +E+     +S   S  Q+SD    
Sbjct: 389 KKLMDSLNLPTDN---------GMDAQANSLNASSSEEKS----ISKSNSAFQKSDGFCA 435

Query: 199 GEETRKSGFDSGDLKEQKQRPKVLLSTXXXXXXXXXXXISAITESEE--NKVLVRFLSHS 372
            EE    G ++  ++ Q    +  L+            + A++  E   NKVL+RFL  S
Sbjct: 436 TEEEESKG-ETLVMENQSLCSQATLAATTANPKVTKKSLFALSAGEHSPNKVLLRFLQES 494

Query: 373 VTKNQIFNFFKKCGEILKI-ECPDVQKPLLKTACVHFKTRDGLENAISKSGLCV-GGPVI 546
             K  I   F + G +L + E P  +  + K A + F+T   ++ A+ K  + V     +
Sbjct: 495 CQKKHIVEVFSQFGAVLHVQEIPSFEGCIYKDALLTFETNTAVKKALEKGRVTVMNNNAV 554

Query: 547 VESATSENWTTTVHVPSLFGDPDVPAALVKNPTRT 651
           VE+ + E+    + +P L GDPDVP ALVK P+RT
Sbjct: 555 VEATSQEDMVERICIPDLIGDPDVPVALVKEPSRT 589


>ref|XP_004247930.1| PREDICTED: uncharacterized protein LOC101262563 [Solanum
           lycopersicum]
          Length = 851

 Score = 75.1 bits (183), Expect = 2e-11
 Identities = 35/107 (32%), Positives = 64/107 (59%)
 Frame = +1

Query: 331 SEENKVLVRFLSHSVTKNQIFNFFKKCGEILKIECPDVQKPLLKTACVHFKTRDGLENAI 510
           S+ENK+ ++F++   T++ + + FK CG I K+  P V+    K A ++F+++ G + A+
Sbjct: 546 SDENKLTIKFVNVKATEDDVRDCFKSCGAITKVVFPSVKSTNYKVAHIYFESKKGRQKAL 605

Query: 511 SKSGLCVGGPVIVESATSENWTTTVHVPSLFGDPDVPAALVKNPTRT 651
             S + +   V+VE+         + +P L G P+VP +LVK+P+RT
Sbjct: 606 EWSDVVIKNIVVVEATFPPKGRERMCIPDLIGYPEVPTSLVKHPSRT 652


>ref|XP_002314482.2| hypothetical protein POPTR_0010s08070g [Populus trichocarpa]
           gi|550329344|gb|EEF00653.2| hypothetical protein
           POPTR_0010s08070g [Populus trichocarpa]
          Length = 696

 Score = 73.6 bits (179), Expect = 5e-11
 Identities = 56/187 (29%), Positives = 95/187 (50%), Gaps = 7/187 (3%)
 Frame = +1

Query: 112 DSGELKEQKQGPKVSLPTSDVQRSDKR---IEGEETRKSGF--DSGDLKEQKQRPKVLLS 276
           D+  LK   +  K  L    V +++K+   I+ E++ K+    + G+L   ++ P+  L 
Sbjct: 319 DNAILKSNAESRKTVLKDRAVSKNNKKRSKIKKEQSPKTMTKDEGGNLDIAEKTPQAPLE 378

Query: 277 TXXXXXXXXXXXISAITESE-ENKVLVRFLSHSVTKNQIFNFFKKCGEILKIE-CPDVQK 450
           T           ++++ + + ENK+L+RFL   V    I + F+ CG I KIE    V+ 
Sbjct: 379 TSEKDSNQTP--LTSLADGDTENKLLLRFLHKDVGDGDIISCFRNCGPISKIEKVSSVKG 436

Query: 451 PLLKTACVHFKTRDGLENAISKSGLCVGGPVIVESATSENWTTTVHVPSLFGDPDVPAAL 630
             L  A +HF+TR GL  A+ K  + +       +A   +  + + +P+L GD D+  AL
Sbjct: 437 SNLFDAFLHFETRQGLHKALEKPEVLIKN----SNAFIHDTASRISIPNLIGDIDISVAL 492

Query: 631 VKNPTRT 651
           VK+PTRT
Sbjct: 493 VKHPTRT 499


>ref|XP_006354470.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum]
          Length = 928

 Score = 72.8 bits (177), Expect = 9e-11
 Identities = 35/107 (32%), Positives = 61/107 (57%)
 Frame = +1

Query: 331 SEENKVLVRFLSHSVTKNQIFNFFKKCGEILKIECPDVQKPLLKTACVHFKTRDGLENAI 510
           S+ENK+ ++F++   T+  + + FK CG I K+  P V     K A ++F+++ G + A+
Sbjct: 623 SDENKMTIKFVNVKATEQDVCDGFKGCGAITKVVFPSVISTNYKVAHIYFESKKGKQKAL 682

Query: 511 SKSGLCVGGPVIVESATSENWTTTVHVPSLFGDPDVPAALVKNPTRT 651
             S   +   V+VE+         + +P L G P+VP +LVK+P+RT
Sbjct: 683 KWSDTVIRNVVVVEATFPPKGRERMCIPDLIGHPEVPTSLVKHPSRT 729


>ref|NP_188979.2| RNA recognition motif-containing protein [Arabidopsis thaliana]
           gi|11994322|dbj|BAB02281.1| unnamed protein product
           [Arabidopsis thaliana] gi|332643236|gb|AEE76757.1| RNA
           recognition motif-containing protein [Arabidopsis
           thaliana]
          Length = 811

 Score = 68.9 bits (167), Expect = 1e-09
 Identities = 41/111 (36%), Positives = 60/111 (54%), Gaps = 3/111 (2%)
 Frame = +1

Query: 328 ESEENKVLVRFLSHSVTKNQIFNFFK-KCGEILKI-ECPDVQKPLLKTACVHFKTRDGLE 501
           E   NKVL+RFL  S  K  I   F  + G +L + E P ++  + K A + F+T   ++
Sbjct: 505 EHSPNKVLLRFLPESSMKKHIVKAFSSQFGAVLHVQEIPSIEGCIYKDALLTFETNTAVK 564

Query: 502 NAISKSGLCVGG-PVIVESATSENWTTTVHVPSLFGDPDVPAALVKNPTRT 651
            A+ K  + V     +VE+ + E+    + +P L GDPDVP ALVK P RT
Sbjct: 565 KALKKGHVTVMNYNTVVEATSQEDMVERICIPDLIGDPDVPVALVKEPART 615


>ref|XP_006406035.1| hypothetical protein EUTSA_v10020149mg [Eutrema salsugineum]
           gi|557107181|gb|ESQ47488.1| hypothetical protein
           EUTSA_v10020149mg [Eutrema salsugineum]
          Length = 730

 Score = 64.7 bits (156), Expect = 2e-08
 Identities = 41/113 (36%), Positives = 60/113 (53%), Gaps = 2/113 (1%)
 Frame = +1

Query: 319 AITESEENKVLVRFLSHSVTKNQIFNFFKKCGEILKI-ECPDVQKPLLKTACVHFKTRDG 495
           ++ E   NKV +RFL     K +I   F + G +L   E P       K A + F+T   
Sbjct: 425 SVGEHSPNKVCLRFLPR-FDKEEIVKRFSEFGAVLDFQEIPSFDGCYYKDAVLTFETHSA 483

Query: 496 LENAISKSGLCVGG-PVIVESATSENWTTTVHVPSLFGDPDVPAALVKNPTRT 651
           ++ A+ K+ + V    VIVE+ + E+    + +P L GDPDVP AL+K PTRT
Sbjct: 484 VKKALKKAVVMVKNYSVIVEATSQEDNVEKICIPDLIGDPDVPIALLKEPTRT 536


>ref|XP_006857341.1| hypothetical protein AMTR_s00067p00095130 [Amborella trichopoda]
           gi|548861434|gb|ERN18808.1| hypothetical protein
           AMTR_s00067p00095130 [Amborella trichopoda]
          Length = 773

 Score = 58.9 bits (141), Expect = 1e-06
 Identities = 39/109 (35%), Positives = 54/109 (49%), Gaps = 4/109 (3%)
 Frame = +1

Query: 337 ENKVLVRFLSHSVTKNQIFNFFKKCGEILKIECPDVQKPLLK--TACVHFKTRDGLENAI 510
           +N + V+FL  S T   I   F  CGEI ++      K   +   A V F   +GL+ A+
Sbjct: 467 QNTLFVKFLPKSATDVDIRKAFGGCGEIEELCIIPSLKATARFNNAYVSFLRGEGLQRAL 526

Query: 511 SKSGLCVGGPVIVESATSE--NWTTTVHVPSLFGDPDVPAALVKNPTRT 651
            KS L + G  +V  A S     T  V + +L GDPD P  L++NP RT
Sbjct: 527 EKSNLVINGADVVVEADSPLLKITNMVSISNLIGDPDAPLPLLENPVRT 575


Top