BLASTX nr result
ID: Mentha23_contig00046563
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00046563 (320 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660... 112 7e-23 ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein A... 106 4e-21 ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein A... 102 4e-20 ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein A... 101 1e-19 ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298... 97 3e-18 ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665... 95 9e-18 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 95 1e-17 ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268... 94 1e-17 ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664... 92 6e-17 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 92 1e-16 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 91 2e-16 ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661... 91 2e-16 ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663... 90 3e-16 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 90 4e-16 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 90 4e-16 ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232... 89 6e-16 ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670... 89 8e-16 ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcript... 86 5e-15 gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00... 86 5e-15 dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] 86 5e-15 >ref|XP_006590026.1| PREDICTED: uncharacterized protein LOC102660482 [Glycine max] Length = 303 Score = 112 bits (279), Expect = 7e-23 Identities = 56/110 (50%), Positives = 70/110 (63%), Gaps = 5/110 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 +PL +++L HY PLL +IT I WS LS AG+ EL+++V+QG+ FWI PLP Sbjct: 97 VPLLSSRLNVCHYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWIGIFPLPQ 156 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWN 317 S+ DRIN+ R FLW G K LVAW VC PK EGGLGL +L WN Sbjct: 157 SVLDRINASCRNFLWGKADIGKKKPLVAWSVVCSPKREGGLGLFNLKDWN 206 >ref|XP_006584200.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 239 Score = 106 bits (264), Expect = 4e-21 Identities = 52/110 (47%), Positives = 68/110 (61%), Gaps = 5/110 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 +PL +++L HY PLL +IT I WS LS AG+ EL+++V+QG+ FW+K PL Sbjct: 97 VPLLSSRLNVCHYAPLLSKITGLIQGWSRKSLSYAGKLELIRAVIQGIVNFWMKIFPLSQ 156 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWN 317 S+ DRIN+ FLW G L+AW VC PK EGGLGL +L WN Sbjct: 157 SVLDRINASCCNFLWGKADIGKNKSLIAWSVVCSPKKEGGLGLFNLKDWN 206 >ref|XP_006584390.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 316 Score = 102 bits (255), Expect = 4e-20 Identities = 51/109 (46%), Positives = 66/109 (60%), Gaps = 5/109 (4%) Frame = +3 Query: 6 PLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPVS 185 PL +++L HY PLL +I I W+ LS G+ EL+K+V+QG+ FW++ PLP S Sbjct: 131 PLLSSRLNVCHYAPLLYKIVGLIQGWNKKSLSYVGKLELIKAVIQGIMNFWMRIFPLPQS 190 Query: 186 ISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWN 317 + DRIN+ FLW G LVAW VC PK EGGLGL +L WN Sbjct: 191 VLDRINASCCNFLWSKADIGKNKPLVAWPVVCSPKQEGGLGLFNLKDWN 239 >ref|XP_006586426.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Glycine max] Length = 192 Score = 101 bits (251), Expect = 1e-19 Identities = 51/110 (46%), Positives = 67/110 (60%), Gaps = 5/110 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 +PL +++L HY LL +IT I WS LS AG+ EL+++V+QG+ FW++ LP Sbjct: 54 VPLLSSRLNVCHYALLLSKITGLIQGWSKKSLSYAGKLELIRAVIQGIVNFWMEIFSLPQ 113 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWN 317 S+ D IN+ R FLW G LVAW VC PK EGGLGL +L WN Sbjct: 114 SVMDWINASCRNFLWGKADIGKNKPLVAWSVVCSPKKEGGLGLLNLKDWN 163 >ref|XP_004293181.1| PREDICTED: uncharacterized protein LOC101298394 [Fragaria vesca subsp. vesca] Length = 958 Score = 96.7 bits (239), Expect = 3e-18 Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 +PL +KL +PLLD+I + I W N LS AGR +L++SVL ++ +W L LP Sbjct: 596 IPLITSKLRMQDCSPLLDRIETRIKSWENKVLSFAGRLQLIQSVLSSIQVYWASHLILPK 655 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320 + I +LR FLW G VAW +C+PK EGGLG++DL WNK Sbjct: 656 KVLKDIEKRLRCFLWAGNCSGRAATKVAWSEICLPKCEGGLGIKDLHCWNK 706 >ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max] Length = 506 Score = 95.1 bits (235), Expect = 9e-18 Identities = 45/111 (40%), Positives = 65/111 (58%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 +P+ + KL IHY+PL+D+I I W+ LS AGR +LV SV+ + +W+ P P Sbjct: 145 VPVTSKKLSTIHYSPLIDKIVGKIKHWTARLLSYAGRLQLVNSVMFALTNYWLNCFPFPK 204 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320 S+ +I + R FLW GS+ VAWK +C P+ GGL + D+ WNK Sbjct: 205 SVLQKIEAICRIFLWTGGFEGSRKSPVAWKQICSPRSCGGLNIIDIDIWNK 255 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 94.7 bits (234), Expect = 1e-17 Identities = 49/111 (44%), Positives = 63/111 (56%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 LPL KL Y PLL++IT+ W N CLS AGR +L+ SV+ G FW+ + LP Sbjct: 767 LPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPK 826 Query: 183 SISDRINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320 RI S +FLW +K V+W +C+PK EGGLGL+ L WNK Sbjct: 827 GCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNK 877 >ref|XP_004253224.1| PREDICTED: uncharacterized protein LOC101268376 [Solanum lycopersicum] Length = 717 Score = 94.4 bits (233), Expect = 1e-17 Identities = 47/111 (42%), Positives = 67/111 (60%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 +PL++ KL I + PL++++ + IN W+ LS AGRA+LVK+VL GV+ W + +P Sbjct: 567 VPLSSKKLNTIQWYPLIEKVMARINSWTAKKLSYAGRAQLVKTVLFGVQALWAQLFIIPA 626 Query: 183 SISDRINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320 I I R +LW +K L+AW VC PK EGGLGL +L WN+ Sbjct: 627 KIIKLIEGLCRSYLWSGVGYVTKKALIAWDKVCSPKYEGGLGLINLKIWNR 677 >ref|XP_006584238.1| PREDICTED: uncharacterized protein LOC102664824 [Glycine max] Length = 939 Score = 92.4 bits (228), Expect = 6e-17 Identities = 48/111 (43%), Positives = 66/111 (59%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 +PL++ KL HY L+D+I I WS LS AGR +L++SV+ FW++ LPLP Sbjct: 587 IPLSSKKLNIKHYQVLIDKIVGRITHWSAGLLSYAGRVQLIQSVIFATINFWMQCLPLPK 646 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320 + RIN+ R FLW S+ +AW+ VC PK GGL + +LA WNK Sbjct: 647 FVIMRINAICRSFLWIGNSNISRKSPIAWEKVCSPKINGGLNIINLAIWNK 697 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 91.7 bits (226), Expect = 1e-16 Identities = 47/111 (42%), Positives = 64/111 (57%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 LPL + KL Y PL+++IT+ N W LS AGR +L+ SV+ G+ FWI S LP+ Sbjct: 664 LPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISGIVNFWISSFILPL 723 Query: 183 SISDRINSQLRKFLWGSK-----YCLVAWKNVCMPKDEGGLGLQDLATWNK 320 +I S +FLW S+ VAW VC+PK EGG+GL+ A N+ Sbjct: 724 GCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFAVSNR 774 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 90.9 bits (224), Expect = 2e-16 Identities = 45/111 (40%), Positives = 61/111 (54%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 LPL KL Y+ L+D+I + N W+ LS AGR +L+ SV+ FW+ S LP Sbjct: 766 LPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYSTVNFWLSSFILPK 825 Query: 183 SISDRINSQLRKFLWGSKY-----CLVAWKNVCMPKDEGGLGLQDLATWNK 320 I +FLWG+ V+W+N C+PK EGGLGL++ TWNK Sbjct: 826 CCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFWTWNK 876 >ref|XP_006579104.1| PREDICTED: uncharacterized protein LOC102661523 [Glycine max] Length = 947 Score = 90.5 bits (223), Expect = 2e-16 Identities = 39/110 (35%), Positives = 68/110 (61%), Gaps = 5/110 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 +PL + KL +Y PL+D+IT+ I W++ L++ GR ++V + + FW++ LP+P+ Sbjct: 587 VPLTSKKLNIKYYLPLIDKITTRIRHWTSKLLNMTGRVQMVNCTITAIVQFWMQCLPIPM 646 Query: 183 SISDRINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWN 317 S+ +I+S R F+W ++ +AW +VC PK +GGL + +L WN Sbjct: 647 SVIKKIDSMCRSFVWSRSTEITRKSPIAWNSVCRPKGQGGLNIFNLKVWN 696 >ref|XP_006605183.1| PREDICTED: uncharacterized protein LOC102663533 [Glycine max] Length = 514 Score = 90.1 bits (222), Expect = 3e-16 Identities = 43/110 (39%), Positives = 64/110 (58%), Gaps = 5/110 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 +PL+ KL HY PL+++I I WS+ LS+AGR +LV+S++ + +W+ P+P Sbjct: 248 VPLSCKKLNVHHYLPLVEKIVGKIRHWSSKLLSIAGRIQLVRSIITAIAQYWMSVFPMPK 307 Query: 183 SISDRINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWN 317 + +I+S R F+W + LVAWK VC P GGL L +L WN Sbjct: 308 KVIQKIDSICRSFIWSGSAEVKRKSLVAWKQVCKPARCGGLNLINLELWN 357 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 89.7 bits (221), Expect = 4e-16 Identities = 44/111 (39%), Positives = 62/111 (55%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 LPL KL Y PLL+++++ + W + LS AGR +L+ SV+ G+ FW+ + LP Sbjct: 627 LPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPK 686 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320 +I S KFLW G K V+W + C+PK EGGLG + WNK Sbjct: 687 GCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNK 737 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 89.7 bits (221), Expect = 4e-16 Identities = 44/111 (39%), Positives = 62/111 (55%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 LPL KL Y PLL+++++ + W + LS AGR +L+ SV+ G+ FW+ + LP Sbjct: 627 LPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPK 686 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320 +I S KFLW G K V+W + C+PK EGGLG + WNK Sbjct: 687 GCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNK 737 >ref|XP_004173733.1| PREDICTED: uncharacterized protein LOC101232446, partial [Cucumis sativus] Length = 382 Score = 89.0 bits (219), Expect = 6e-16 Identities = 44/110 (40%), Positives = 65/110 (59%), Gaps = 5/110 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 LPL +L + +PL+ +ITS I WS LS AGR +LV+SVL+ ++ +W LP+ Sbjct: 41 LPLLFGRLQSCDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLRSLQVYWASVFMLPM 100 Query: 183 SISDRINSQLRKFLWGSKY-----CLVAWKNVCMPKDEGGLGLQDLATWN 317 + ++ LR +LW K VAW VC+P DEGGL ++D ++WN Sbjct: 101 KVHRDVDKILRSYLWRGKEEGRGGAKVAWDEVCLPFDEGGLAIRDGSSWN 150 >ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max] Length = 383 Score = 88.6 bits (218), Expect = 8e-16 Identities = 43/89 (48%), Positives = 54/89 (60%), Gaps = 5/89 (5%) Frame = +3 Query: 66 SFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPVSISDRINSQLRKFLWGSK--- 236 S +RWS LS AG+ EL+++V+QG+ FW+ PLP S+ D I + R FLWG Sbjct: 106 SISSRWSRKSLSYAGKVELIRAVIQGIANFWMSIFPLPQSVLDTIIATCRNFLWGKADGG 165 Query: 237 --YCLVAWKNVCMPKDEGGLGLQDLATWN 317 LVAW VC PK EGGLGL +L WN Sbjct: 166 KIKPLVAWSEVCTPKKEGGLGLFNLKDWN 194 >ref|NP_189068.2| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] gi|332643359|gb|AEE76880.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] Length = 746 Score = 85.9 bits (211), Expect = 5e-15 Identities = 44/111 (39%), Positives = 62/111 (55%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 LPL K+ Y PL+++I I +W+ LS AGR +L+ SV+ + FW+ + LP Sbjct: 30 LPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPS 89 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320 + I+S FLW +K VAW +VC PKDEGGLG++ L NK Sbjct: 90 ACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANK 140 >gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis thaliana] Length = 1253 Score = 85.9 bits (211), Expect = 5e-15 Identities = 43/106 (40%), Positives = 57/106 (53%), Gaps = 5/106 (4%) Frame = +3 Query: 15 ANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPVSISD 194 A KL Y PLL+++ WS CLS AGR +L+ SV+ G+ FWI + LP Sbjct: 704 ARKLRIAEYGPLLEKLAKRFRSWSVKCLSFAGRVQLIASVISGIINFWISTFILPKGCVK 763 Query: 195 RINSQLRKFLWG-----SKYCLVAWKNVCMPKDEGGLGLQDLATWN 317 RI + +FLW K VAW VC+PK+EGG+GL+ N Sbjct: 764 RIEALCARFLWSGNIDVKKGAKVAWSEVCLPKEEGGVGLRRFTVLN 809 >dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] Length = 478 Score = 85.9 bits (211), Expect = 5e-15 Identities = 44/111 (39%), Positives = 62/111 (55%), Gaps = 5/111 (4%) Frame = +3 Query: 3 LPLAANKLLNIHYNPLLDQITSFINRWSNSCLSLAGRAELVKSVLQGVECFWIKSLPLPV 182 LPL K+ Y PL+++I I +W+ LS AGR +L+ SV+ + FW+ + LP Sbjct: 30 LPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWMSAFRLPS 89 Query: 183 SISDRINSQLRKFLW-----GSKYCLVAWKNVCMPKDEGGLGLQDLATWNK 320 + I+S FLW +K VAW +VC PKDEGGLG++ L NK Sbjct: 90 ACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANK 140