BLASTX nr result

ID: Mentha22_contig00036127 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00036127
         (367 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...    73   4e-11
ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...    72   6e-11
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...    71   2e-10
gb|AAF81336.1|AC007767_16 Strong similarity to a putative non-LT...    70   4e-10
gb|AAC67331.1| putative non-LTR retroelement reverse transcripta...    69   5e-10
gb|AAF79357.1|AC007887_16 F15O4.34 [Arabidopsis thaliana]              67   3e-09
emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678...    67   3e-09
gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]              66   6e-09
dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]            66   6e-09
ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcript...    65   7e-09
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...    65   1e-08
ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668...    64   2e-08
gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali...    64   2e-08
ref|XP_006586315.1| PREDICTED: uncharacterized protein LOC102664...    63   4e-08
gb|ABE65413.1| hypothetical protein At1g62890 [Arabidopsis thali...    62   8e-08
gb|AEL30369.1| hypothetical protein 205D04_9 [Arachis hypogaea]        62   8e-08
gb|ABK28152.1| unknown [Arabidopsis thaliana]                          62   8e-08
dbj|BAD66732.1| orf147a [Beta vulgaris subsp. vulgaris] gi|54606...    62   1e-07
gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcrip...    62   1e-07
ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arab...    61   2e-07

>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score = 73.2 bits (178), Expect = 4e-11
 Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 2/79 (2%)
 Frame = +1

Query: 16   TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFADISR--RCM 189
            TK  + + R    ++ W+K +W  Y  PK+S  LW  +Q RL + DR+K  +  +   C 
Sbjct: 1338 TKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCT 1397

Query: 190  LCNNADETNDHTFFQCERT 246
            LCNNA+ET DH FF C+ T
Sbjct: 1398 LCNNAEETRDHLFFSCQYT 1416


>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score = 72.4 bits (176), Expect = 6e-11
 Identities = 34/109 (31%), Positives = 54/109 (49%)
 Frame = +1

Query: 28  YEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFADISRRCMLCNNAD 207
           Y++ R       W   IW+  IP K S  LW A + RL + DR  F +    C LC N  
Sbjct: 275 YDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLATKNRLLALDRAAFLNKGFLCPLCTNEA 334

Query: 208 ETNDHTFFQCERTVEI*SEICLWLKIQNQMTTIPSTIHRFQREKAGSGI 354
           E++ H FF C  ++ + + I  W+ ++ Q  ++  +I    R +A SG+
Sbjct: 335 ESHAHLFFSCRTSLRVWAHIRDWIPLKRQSISLQHSISALIRRRATSGV 383


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score = 70.9 bits (172), Expect = 2e-10
 Identities = 32/86 (37%), Positives = 48/86 (55%), Gaps = 2/86 (2%)
 Frame = +1

Query: 16   TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFAD--ISRRCM 189
            TK+ + H R    ++ WHK +W ++  PKFS   W AI+ RL + DR+   +      C+
Sbjct: 762  TKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCV 821

Query: 190  LCNNADETNDHTFFQCERTVEI*SEI 267
             C++  ET DH FFQC  + EI + I
Sbjct: 822  FCSSPMETRDHLFFQCCYSSEIWTSI 847


>gb|AAF81336.1|AC007767_16 Strong similarity to a putative non-LTR retroelement reverse
           transcriptase At2g23880 gi|3738337 from Arabidopsis
           thaliana BAC F27L4 gb|AC005170 [Arabidopsis thaliana]
          Length = 206

 Score = 69.7 bits (169), Expect = 4e-10
 Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
 Frame = +1

Query: 16  TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFAD--ISRRCM 189
           TK+ + H R    ++ WH  +W ++  PKFS   W A++ RL   DR+   +      C+
Sbjct: 49  TKDTWNHIRTSSNQRAWHTGVWFAHATPKFSFCAWLAVRNRLSMVDRMMTWNNGTPTTCV 108

Query: 190 LCNNADETNDHTFFQCERTVEI*SEI 267
            C++  ET DH FFQC  + EI + I
Sbjct: 109 FCSSPMETRDHLFFQCHYSSEIWTSI 134


>gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1449

 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 30/82 (36%), Positives = 46/82 (56%), Gaps = 2/82 (2%)
 Frame = +1

Query: 16   TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKF--ADISRRCM 189
            TKE + + R  G +  WHK +W ++  PKFS  +W A+  RL + D++      +   C+
Sbjct: 1266 TKETWNNTRTMGIEVPWHKGVWFTHSTPKFSFCVWLAVYDRLSTGDKMLLWNRGLQGTCL 1325

Query: 190  LCNNADETNDHTFFQCERTVEI 255
            LC NA E+ DH FF C  + E+
Sbjct: 1326 LCRNATESRDHLFFSCSFSSEV 1347


>gb|AAF79357.1|AC007887_16 F15O4.34 [Arabidopsis thaliana]
          Length = 236

 Score = 67.0 bits (162), Expect = 3e-09
 Identities = 30/77 (38%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
 Frame = +1

Query: 16  TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKF--ADISRRCM 189
           TKE   H R       W+K +W  +  PK+S  +W A+  RL + DR+       S  C+
Sbjct: 86  TKETLNHMRTISMDVDWYKGVWFGHSTPKYSFCVWLAVLNRLSTGDRMTHWNGGQSAACV 145

Query: 190 LCNNADETNDHTFFQCE 240
           LC+NA ET DH FF C+
Sbjct: 146 LCHNAPETRDHLFFSCD 162


>emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1|
           putative protein [Arabidopsis thaliana]
          Length = 473

 Score = 67.0 bits (162), Expect = 3e-09
 Identities = 29/82 (35%), Positives = 42/82 (51%), Gaps = 2/82 (2%)
 Frame = +1

Query: 16  TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKF--ADISRRCM 189
           TK+ + H R    K  W+K +W +   PK +  +W A+  RL + DR+      +   C+
Sbjct: 289 TKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVDATCI 348

Query: 190 LCNNADETNDHTFFQCERTVEI 255
           LCN A E+ DH FF C    EI
Sbjct: 349 LCNKALESRDHLFFSCPFATEI 370


>gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana]
          Length = 653

 Score = 65.9 bits (159), Expect = 6e-09
 Identities = 37/120 (30%), Positives = 58/120 (48%), Gaps = 5/120 (4%)
 Frame = +1

Query: 16  TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFAD--ISRRCM 189
           T++ +   R       WH  IW ++  PKFS   W A+Q RL + D++   +  +S  C+
Sbjct: 470 TRDTWNQTRNTSTPVTWHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCV 529

Query: 190 LCNNADETNDHTFFQCERTVEI*SEICLWL---KIQNQMTTIPSTIHRFQREKAGSGILR 360
           LCNN  ET +H FF C  T EI   +   +   K     +TI +++    R +  S + R
Sbjct: 530 LCNNNIETRNHLFFSCCYTAEIWENLAKNIYKAKFSTNWSTILTSVSTTWRNRTESFLAR 589


>dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]
          Length = 478

 Score = 65.9 bits (159), Expect = 6e-09
 Identities = 33/86 (38%), Positives = 47/86 (54%), Gaps = 2/86 (2%)
 Frame = +1

Query: 16  TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKF--ADISRRCM 189
           TKE +   R    K  W+K +W S+  PK+SV  W AI+ RL + DR+    A     C+
Sbjct: 296 TKETWAATREPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCV 355

Query: 190 LCNNADETNDHTFFQCERTVEI*SEI 267
           LC++  ET DH FF C  + E+ S +
Sbjct: 356 LCHHLVETRDHLFFTCPYSAEVWSTL 381


>ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcriptase)-related protein
           [Arabidopsis thaliana] gi|332643359|gb|AEE76880.1|
           RNA-directed DNA polymerase (reverse
           transcriptase)-related protein [Arabidopsis thaliana]
          Length = 746

 Score = 65.5 bits (158), Expect = 7e-09
 Identities = 32/82 (39%), Positives = 45/82 (54%), Gaps = 2/82 (2%)
 Frame = +1

Query: 16  TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKF--ADISRRCM 189
           TKE +   R    K  W+K +W S+  PK+SV  W AI+ RL + DR+    A     C+
Sbjct: 268 TKETWAATREPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCV 327

Query: 190 LCNNADETNDHTFFQCERTVEI 255
           LC++  ET DH FF C  + E+
Sbjct: 328 LCHHLVETRDHLFFTCPYSAEV 349


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score = 65.1 bits (157), Expect = 1e-08
 Identities = 31/79 (39%), Positives = 44/79 (55%), Gaps = 2/79 (2%)
 Frame = +1

Query: 16   TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDR-LKFAD-ISRRCM 189
            T++ + H R+   +  WHK IW S+  PK+S   W A  GRL + DR + +A+ I+  C+
Sbjct: 1040 TRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCI 1099

Query: 190  LCNNADETNDHTFFQCERT 246
             C    ET DH FF C  T
Sbjct: 1100 FCQGTLETRDHLFFTCSFT 1118


>ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max]
          Length = 477

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 31/110 (28%), Positives = 54/110 (49%)
 Frame = +1

Query: 22  EAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFADISRRCMLCNN 201
           +AY++ R       W+  +W+  IP K S  LW A +  L + DR  F +    C LC  
Sbjct: 302 KAYDYIRGVKPAVNWNSVVWNPAIPSKMSFILWLATKNHLLTLDRAAFLNKGLLCPLCRT 361

Query: 202 ADETNDHTFFQCERTVEI*SEICLWLKIQNQMTTIPSTIHRFQREKAGSG 351
             +++ H FF C  ++++ + I  W+ +  Q  ++  TI+     +A SG
Sbjct: 362 KAKSHAHLFFSCRISLQVWANIRDWIPLHRQTISLQCTINSRICGRATSG 411


>gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana]
            gi|20197043|gb|AAM14892.1| putative reverse transcriptase
            [Arabidopsis thaliana]
          Length = 1412

 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 35/101 (34%), Positives = 49/101 (48%), Gaps = 2/101 (1%)
 Frame = +1

Query: 22   EAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKF--ADISRRCMLC 195
            E +   R +G  K WHKAIW S   PKF+   W A   RL + D++      IS  C+LC
Sbjct: 1225 EIWHQIREQGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLC 1284

Query: 196  NNADETNDHTFFQCERTVEI*SEICLWLKIQNQMTTIPSTI 318
            N + E+ DH FF C  +  I   +   L +    T  P+ +
Sbjct: 1285 NISAESRDHLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALL 1325


>ref|XP_006586315.1| PREDICTED: uncharacterized protein LOC102664837 [Glycine max]
          Length = 97

 Score = 63.2 bits (152), Expect = 4e-08
 Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 2/78 (2%)
 Frame = +1

Query: 64  WHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRL-KFADISRR-CMLCNNADETNDHTFFQC 237
           W    + +Y  P+ S T W A  GRL + DRL +F  I  + C LCN  DE++DH FF C
Sbjct: 10  WRHLFYRNYARPRASHTTWLACHGRLATKDRLCRFGLIQEKICSLCNEVDESHDHLFFAC 69

Query: 238 ERTVEI*SEICLWLKIQN 291
             + ++ SE+  W+  Q+
Sbjct: 70  SESKKVWSEVLNWIDCQH 87


>gb|ABE65413.1| hypothetical protein At1g62890 [Arabidopsis thaliana]
          Length = 195

 Score = 62.0 bits (149), Expect = 8e-08
 Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
 Frame = +1

Query: 16  TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFADISRR--CM 189
           TK  + +      +  W+K +W  Y  PK+S  LW  IQ RL + D +K  +  ++  C 
Sbjct: 5   TKVTWNNIGMHQPQTNWYKGVWFPYSTPKYSFLLWLTIQNRLSTGDHIKAWNSGQQVTCT 64

Query: 190 LCNNADETNDHTFFQCERTVEI 255
           LC NA+ET +  FF C  T E+
Sbjct: 65  LCGNAEETRNLLFFSCHYTSEV 86


>gb|AEL30369.1| hypothetical protein 205D04_9 [Arachis hypogaea]
          Length = 458

 Score = 62.0 bits (149), Expect = 8e-08
 Identities = 29/87 (33%), Positives = 49/87 (56%), Gaps = 4/87 (4%)
 Frame = +1

Query: 70  KAIWHSYIPPKFSVTLWFAIQGRLKSFDRL-KFADI---SRRCMLCNNADETNDHTFFQC 237
           + +W   +PP+  +  WF + GR+ + DRL +F  I     RC+LC+ A+ET  H F +C
Sbjct: 306 RTVWKGLVPPRVELLTWFVLVGRVNTKDRLCRFRVIPQQDNRCVLCDKAEETVFHLFLEC 365

Query: 238 ERTVEI*SEICLWLKIQNQMTTIPSTI 318
           E T ++    C WL+   +  ++P T+
Sbjct: 366 ETTWKV---WCAWLRALGRQWSLPGTL 389


>gb|ABK28152.1| unknown [Arabidopsis thaliana]
          Length = 196

 Score = 62.0 bits (149), Expect = 8e-08
 Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
 Frame = +1

Query: 16  TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFADISRR--CM 189
           TK  + +      +  W+K +W  Y  PK+S  LW  IQ RL + D +K  +  ++  C 
Sbjct: 5   TKVTWNNIGMHQPQTNWYKGVWFPYSTPKYSFLLWLTIQNRLSTGDHIKAWNSGQQVTCT 64

Query: 190 LCNNADETNDHTFFQCERTVEI 255
           LC NA+ET +  FF C  T E+
Sbjct: 65  LCGNAEETRNLLFFSCHYTSEV 86


>dbj|BAD66732.1| orf147a [Beta vulgaris subsp. vulgaris] gi|54606753|dbj|BAD66776.1|
           orf147a [Beta vulgaris subsp. vulgaris]
          Length = 147

 Score = 61.6 bits (148), Expect = 1e-07
 Identities = 32/71 (45%), Positives = 42/71 (59%), Gaps = 2/71 (2%)
 Frame = +1

Query: 82  HSYIPPKFSVTLWFAIQGRLKSFDRL-KFADI-SRRCMLCNNADETNDHTFFQCERTVEI 255
           +++ PPK +   W  I  RL + DRL KF  +  + C+LC N DET DH FF CE + EI
Sbjct: 7   NNFSPPKCTFITWLTILDRLATCDRLQKFGIVCDQLCVLCGNVDETRDHLFFVCEFSYEI 66

Query: 256 *SEICLWLKIQ 288
            S +  WL IQ
Sbjct: 67  WSSLLCWLGIQ 77


>gb|AAC13599.1| similar to reverse transcriptase (Pfam: transcript_fact.hmm, score:
            72.31) [Arabidopsis thaliana]
          Length = 928

 Score = 61.6 bits (148), Expect = 1e-07
 Identities = 35/93 (37%), Positives = 50/93 (53%), Gaps = 2/93 (2%)
 Frame = +1

Query: 46   KGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFADI--SRRCMLCNNADETND 219
            K +K+ WHK +W ++  PK S  +W AI  +L +  R++  ++  S  C+LCNN  ET D
Sbjct: 755  KPKKEAWHKGVWFAHETPKHSFCVWLAIWNKLSTGQRMQHWNLQSSVGCVLCNNNLETRD 814

Query: 220  HTFFQCERTVEI*SEICLWLKIQNQMTTIPSTI 318
            H FF C  T  I   +   L +Q   TT   TI
Sbjct: 815  HLFFSCAYTSGIWEALAKNL-LQRSYTTDWQTI 846


>ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arabidopsis lyrata subsp.
           lyrata] gi|297311371|gb|EFH41795.1| hypothetical protein
           ARALYDRAFT_917542 [Arabidopsis lyrata subsp. lyrata]
          Length = 227

 Score = 60.8 bits (146), Expect = 2e-07
 Identities = 24/84 (28%), Positives = 43/84 (51%)
 Frame = +1

Query: 16  TKEAYEHFRAKGEKKFWHKAIWHSYIPPKFSVTLWFAIQGRLKSFDRLKFADISRRCMLC 195
           ++  +   R    K  WH ++W     P++S  +W A++ +L +  R++   + + C+ C
Sbjct: 65  SRHTWNLLRKAKHKVLWHNSVWFPQRVPRYSFIVWLAVKDQLSTGTRMRAWGVEQPCVFC 124

Query: 196 NNADETNDHTFFQCERTVEI*SEI 267
              DE+ DH FF C  T  I SE+
Sbjct: 125 RERDESRDHLFFACPFTYSIWSEL 148


Top