BLASTX nr result

ID: Mentha23_contig00046563 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00046563
         (320 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660...   112   7e-23
ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein A...   106   4e-21
ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A...   102   4e-20
ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein A...   101   1e-19
ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298...    97   3e-18
ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665...    95   9e-18
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]        95   1e-17
ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268...    94   1e-17
ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664...    92   6e-17
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...    92   1e-16
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...    91   2e-16
ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661...    91   2e-16
ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663...    90   3e-16
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...    90   4e-16
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]            90   4e-16
ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232...    89   6e-16
ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670...    89   8e-16
ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcript...    86   5e-15
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...    86   5e-15
dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]            86   5e-15

>ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max]
          Length = 303

 Score =  112 bits (279), Expect = 7e-23
 Identities = 56/110 (50%), Positives = 70/110 (63%), Gaps = 5/110 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           +PL +++L   HY PLL +IT  I  WS   LS AG+ EL+++V+QG+  FWI   PLP 
Sbjct: 97  VPLLSSRLNVCHYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWIGIFPLPQ 156

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWN 317
           S+ DRIN+  R FLW     G K  LVAW  VC PK EGGLGL +L  WN
Sbjct: 157 SVLDRINASCRNFLWGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWN 206


>ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 239

 Score =  106 bits (264), Expect = 4e-21
 Identities = 52/110 (47%), Positives = 68/110 (61%), Gaps = 5/110 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           +PL +++L   HY PLL +IT  I  WS   LS AG+ EL+++V+QG+  FW+K  PL  
Sbjct: 97  VPLLSSRLNVCHYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWMKIFPLSQ 156

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWN 317
           S+ DRIN+    FLW     G    L+AW  VC PK EGGLGL +L  WN
Sbjct: 157 SVLDRINASCCNFLWGKADIGKNKSLIAWSVVCSPKKEGGLGLFNLKDWN 206


>ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 316

 Score =  102 bits (255), Expect = 4e-20
 Identities = 51/109 (46%), Positives = 66/109 (60%), Gaps = 5/109 (4%)
 Frame = +3

Query: 6   PLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPVS 185
           PL +++L   HY PLL +I   I  W+   LS  G+ EL+K+V+QG+  FW++  PLP S
Sbjct: 131 PLLSSRLNVCHYAPLLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQS 190

Query: 186 ISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWN 317
           + DRIN+    FLW     G    LVAW  VC PK EGGLGL +L  WN
Sbjct: 191 VLDRINASCCNFLWSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWN 239


>ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine
           max]
          Length = 192

 Score =  101 bits (251), Expect = 1e-19
 Identities = 51/110 (46%), Positives = 67/110 (60%), Gaps = 5/110 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           +PL +++L   HY  LL +IT  I  WS   LS AG+ EL+++V+QG+  FW++   LP 
Sbjct: 54  VPLLSSRLNVCHYALLLSKITGLIQGWSKKSLSYAGKLELIRAVIQGIVNFWMEIFSLPQ 113

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWN 317
           S+ D IN+  R FLW     G    LVAW  VC PK EGGLGL +L  WN
Sbjct: 114 SVMDWINASCRNFLWGKADIGKNKPLVAWSVVCSPKKEGGLGLLNLKDWN 163


>ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca
           subsp. vesca]
          Length = 958

 Score = 96.7 bits (239), Expect = 3e-18
 Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           +PL  +KL     +PLLD+I + I  W N  LS AGR +L++SVL  ++ +W   L LP 
Sbjct: 596 IPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPK 655

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320
            +   I  +LR FLW     G     VAW  +C+PK EGGLG++DL  WNK
Sbjct: 656 KVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNK 706


>ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max]
          Length = 506

 Score = 95.1 bits (235), Expect = 9e-18
 Identities = 45/111 (40%), Positives = 65/111 (58%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           +P+ + KL  IHY+PL+D+I   I  W+   LS AGR +LV SV+  +  +W+   P P 
Sbjct: 145 VPVTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALTNYWLNCFPFPK 204

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320
           S+  +I +  R FLW     GS+   VAWK +C P+  GGL + D+  WNK
Sbjct: 205 SVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNK 255


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score = 94.7 bits (234), Expect = 1e-17
 Identities = 49/111 (44%), Positives = 63/111 (56%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3    LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
            LPL   KL    Y PLL++IT+    W N CLS AGR +L+ SV+ G   FW+ +  LP 
Sbjct: 767  LPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPK 826

Query: 183  SISDRINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320
                RI S   +FLW      +K   V+W  +C+PK EGGLGL+ L  WNK
Sbjct: 827  GCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNK 877


>ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum
           lycopersicum]
          Length = 717

 Score = 94.4 bits (233), Expect = 1e-17
 Identities = 47/111 (42%), Positives = 67/111 (60%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           +PL++ KL  I + PL++++ + IN W+   LS AGRA+LVK+VL GV+  W +   +P 
Sbjct: 567 VPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPA 626

Query: 183 SISDRINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320
            I   I    R +LW      +K  L+AW  VC PK EGGLGL +L  WN+
Sbjct: 627 KIIKLIEGLCRSYLWSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNR 677


>ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max]
          Length = 939

 Score = 92.4 bits (228), Expect = 6e-17
 Identities = 48/111 (43%), Positives = 66/111 (59%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           +PL++ KL   HY  L+D+I   I  WS   LS AGR +L++SV+     FW++ LPLP 
Sbjct: 587 IPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLPLPK 646

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320
            +  RIN+  R FLW      S+   +AW+ VC PK  GGL + +LA WNK
Sbjct: 647 FVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNK 697


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
           [Arabidopsis thaliana]
          Length = 1164

 Score = 91.7 bits (226), Expect = 1e-16
 Identities = 47/111 (42%), Positives = 64/111 (57%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           LPL + KL    Y PL+++IT+  N W    LS AGR +L+ SV+ G+  FWI S  LP+
Sbjct: 664 LPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSFILPL 723

Query: 183 SISDRINSQLRKFLWGSK-----YCLVAWKNVCMPKDEGGLGLQDLATWNK 320
               +I S   +FLW S+        VAW  VC+PK EGG+GL+  A  N+
Sbjct: 724 GCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNR 774


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score = 90.9 bits (224), Expect = 2e-16
 Identities = 45/111 (40%), Positives = 61/111 (54%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3    LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
            LPL   KL    Y+ L+D+I +  N W+   LS AGR +L+ SV+     FW+ S  LP 
Sbjct: 766  LPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPK 825

Query: 183  SISDRINSQLRKFLWGSKY-----CLVAWKNVCMPKDEGGLGLQDLATWNK 320
                 I     +FLWG+         V+W+N C+PK EGGLGL++  TWNK
Sbjct: 826  CCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNK 876


>ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max]
          Length = 947

 Score = 90.5 bits (223), Expect = 2e-16
 Identities = 39/110 (35%), Positives = 68/110 (61%), Gaps = 5/110 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           +PL + KL   +Y PL+D+IT+ I  W++  L++ GR ++V   +  +  FW++ LP+P+
Sbjct: 587 VPLTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIPM 646

Query: 183 SISDRINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWN 317
           S+  +I+S  R F+W      ++   +AW +VC PK +GGL + +L  WN
Sbjct: 647 SVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWN 696


>ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max]
          Length = 514

 Score = 90.1 bits (222), Expect = 3e-16
 Identities = 43/110 (39%), Positives = 64/110 (58%), Gaps = 5/110 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           +PL+  KL   HY PL+++I   I  WS+  LS+AGR +LV+S++  +  +W+   P+P 
Sbjct: 248 VPLSCKKLNVHHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPK 307

Query: 183 SISDRINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWN 317
            +  +I+S  R F+W       +  LVAWK VC P   GGL L +L  WN
Sbjct: 308 KVIQKIDSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWN 357


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
           thaliana]
          Length = 1072

 Score = 89.7 bits (221), Expect = 4e-16
 Identities = 44/111 (39%), Positives = 62/111 (55%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           LPL   KL    Y PLL+++++ +  W +  LS AGR +L+ SV+ G+  FW+ +  LP 
Sbjct: 627 LPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPK 686

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320
               +I S   KFLW     G K   V+W + C+PK EGGLG +    WNK
Sbjct: 687 GCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNK 737


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score = 89.7 bits (221), Expect = 4e-16
 Identities = 44/111 (39%), Positives = 62/111 (55%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           LPL   KL    Y PLL+++++ +  W +  LS AGR +L+ SV+ G+  FW+ +  LP 
Sbjct: 627 LPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPK 686

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320
               +I S   KFLW     G K   V+W + C+PK EGGLG +    WNK
Sbjct: 687 GCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNK 737


>ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232446, partial [Cucumis
           sativus]
          Length = 382

 Score = 89.0 bits (219), Expect = 6e-16
 Identities = 44/110 (40%), Positives = 65/110 (59%), Gaps = 5/110 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           LPL   +L +   +PL+ +ITS I  WS   LS AGR +LV+SVL+ ++ +W     LP+
Sbjct: 41  LPLLFGRLQSCDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLRSLQVYWASVFMLPM 100

Query: 183 SISDRINSQLRKFLWGSKY-----CLVAWKNVCMPKDEGGLGLQDLATWN 317
            +   ++  LR +LW  K        VAW  VC+P DEGGL ++D ++WN
Sbjct: 101 KVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGLAIRDGSSWN 150


>ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max]
          Length = 383

 Score = 88.6 bits (218), Expect = 8e-16
 Identities = 43/89 (48%), Positives = 54/89 (60%), Gaps = 5/89 (5%)
 Frame = +3

Query: 66  SFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPVSISDRINSQLRKFLWGSK--- 236
           S  +RWS   LS AG+ EL+++V+QG+  FW+   PLP S+ D I +  R FLWG     
Sbjct: 106 SISSRWSRKSLSYAGKVELIRAVIQGIANFWMSIFPLPQSVLDTIIATCRNFLWGKADGG 165

Query: 237 --YCLVAWKNVCMPKDEGGLGLQDLATWN 317
               LVAW  VC PK EGGLGL +L  WN
Sbjct: 166 KIKPLVAWSEVCTPKKEGGLGLFNLKDWN 194


>ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcriptase)-related protein
           [Arabidopsis thaliana] gi|332643359|gb|AEE76880.1|
           RNA-directed DNA polymerase (reverse
           transcriptase)-related protein [Arabidopsis thaliana]
          Length = 746

 Score = 85.9 bits (211), Expect = 5e-15
 Identities = 44/111 (39%), Positives = 62/111 (55%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           LPL   K+    Y PL+++I   I +W+   LS AGR +L+ SV+  +  FW+ +  LP 
Sbjct: 30  LPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPS 89

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320
           +    I+S    FLW      +K   VAW +VC PKDEGGLG++ L   NK
Sbjct: 90  ACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANK 140


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
            thaliana]
          Length = 1253

 Score = 85.9 bits (211), Expect = 5e-15
 Identities = 43/106 (40%), Positives = 57/106 (53%), Gaps = 5/106 (4%)
 Frame = +3

Query: 15   ANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPVSISD 194
            A KL    Y PLL+++      WS  CLS AGR +L+ SV+ G+  FWI +  LP     
Sbjct: 704  ARKLRIAEYGPLLEKLAKRFRSWSVKCLSFAGRVQLIASVISGIINFWISTFILPKGCVK 763

Query: 195  RINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWN 317
            RI +   +FLW       K   VAW  VC+PK+EGG+GL+     N
Sbjct: 764  RIEALCARFLWSGNIDVKKGAKVAWSEVCLPKEEGGVGLRRFTVLN 809


>dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]
          Length = 478

 Score = 85.9 bits (211), Expect = 5e-15
 Identities = 44/111 (39%), Positives = 62/111 (55%), Gaps = 5/111 (4%)
 Frame = +3

Query: 3   LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182
           LPL   K+    Y PL+++I   I +W+   LS AGR +L+ SV+  +  FW+ +  LP 
Sbjct: 30  LPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPS 89

Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320
           +    I+S    FLW      +K   VAW +VC PKDEGGLG++ L   NK
Sbjct: 90  ACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANK 140


Top