BLASTX nr result
ID: Mentha22_contig00038044
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00038044 (345 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670... 73 4e-11 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 72 8e-11 ref|XP_006586315.1| PREDICTED: uncharacterized protein LOC102664... 69 5e-10 dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] 67 3e-09 ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668... 67 3e-09 ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arab... 67 3e-09 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 67 3e-09 gb|AAF81336.1|AC007767_16 Strong similarity to a putative non-LT... 66 4e-09 emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678... 65 1e-08 ref|XP_004163799.1| PREDICTED: putative ribonuclease H protein A... 64 2e-08 gb|AEL30369.1| hypothetical protein 205D04_9 [Arachis hypogaea] 64 2e-08 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 64 2e-08 gb|AAC67331.1| putative non-LTR retroelement reverse transcripta... 64 2e-08 gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub... 64 3e-08 ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665... 63 4e-08 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 63 4e-08 gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] 63 5e-08 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 62 8e-08 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 62 1e-07 ref|XP_004977924.1| PREDICTED: putative ribonuclease H protein A... 62 1e-07 >ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max] Length = 383 Score = 73.2 bits (178), Expect = 4e-11 Identities = 34/97 (35%), Positives = 49/97 (50%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKFSDIPRQCLLCNAAEESNDHLFFQCPR 143 W IW P K S LWLA + RL DR F + C LC ES+ HLFF C Sbjct: 287 WSSIIWNPVIPSKMSFILWLATKNRLLALDRAAFLNKGFLCPLCTNEAESHAHLFFSCRT 346 Query: 142 TVEIWSGICSWLKIKHRMSTIPSAIRRFQREKAGSGI 32 ++ +W+ I W+ +K + ++ +I R +A SG+ Sbjct: 347 SLRVWAHIRDWIPLKRQSISLQHSISALIRRRATSGV 383 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 72.0 bits (175), Expect = 8e-11 Identities = 37/90 (41%), Positives = 53/90 (58%), Gaps = 2/90 (2%) Frame = -3 Query: 331 KKFWHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKF--SDIPRQCLLCNAAEESNDHLF 158 ++ WHK +W ++ KFS WLA+R RL T DR+ + P C+ C++ E+ DHLF Sbjct: 775 QRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHLF 834 Query: 157 FQCPRTVEIWSGICSWLKIKHRMSTIPSAI 68 FQC + EIW+ I + K R ST SA+ Sbjct: 835 FQCCYSSEIWTSIAKNV-YKDRFSTKWSAV 863 >ref|XP_006586315.1| PREDICTED: uncharacterized protein LOC102664837 [Glycine max] Length = 97 Score = 69.3 bits (168), Expect = 5e-10 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 2/78 (2%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRL-KFSDIPRQ-CLLCNAAEESNDHLFFQC 149 W +R+Y + S T WLA GRL T DRL +F I + C LCN +ES+DHLFF C Sbjct: 10 WRHLFYRNYARPRASHTTWLACHGRLATKDRLCRFGLIQEKICSLCNEVDESHDHLFFAC 69 Query: 148 PRTVEIWSGICSWLKIKH 95 + ++WS + +W+ +H Sbjct: 70 SESKKVWSEVLNWIDCQH 87 >dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] Length = 478 Score = 67.0 bits (162), Expect = 3e-09 Identities = 30/78 (38%), Positives = 46/78 (58%), Gaps = 2/78 (2%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKF--SDIPRQCLLCNAAEESNDHLFFQC 149 W+K +W S+ K+SV W+A++ RL T DR+ + C+LC+ E+ DHLFF C Sbjct: 312 WYKGVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCHHLVETRDHLFFTC 371 Query: 148 PRTVEIWSGICSWLKIKH 95 P + E+WS + L +H Sbjct: 372 PYSAEVWSTLTRKLLSQH 389 >ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max] Length = 477 Score = 66.6 bits (161), Expect = 3e-09 Identities = 32/106 (30%), Positives = 51/106 (48%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKFSDIPRQCLLCNAAEESNDHLFFQCPR 143 W+ +W P K S LWLA + L T DR F + C LC +S+ HLFF C Sbjct: 316 WNSVVWNPAIPSKMSFILWLATKNHLLTLDRAAFLNKGLLCPLCRTKAKSHAHLFFSCRI 375 Query: 142 TVEIWSGICSWLKIKHRMSTIPSAIRRFQREKAGSGIVRKAKWIVL 5 ++++W+ I W+ + + ++ I +A SG K + + L Sbjct: 376 SLQVWANIRDWIPLHRQTISLQCTINSRICGRATSGTWGKFRCLAL 421 >ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arabidopsis lyrata subsp. lyrata] gi|297311371|gb|EFH41795.1| hypothetical protein ARALYDRAFT_917542 [Arabidopsis lyrata subsp. lyrata] Length = 227 Score = 66.6 bits (161), Expect = 3e-09 Identities = 28/75 (37%), Positives = 43/75 (57%) Frame = -3 Query: 331 KKFWHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKFSDIPRQCLLCNAAEESNDHLFFQ 152 K WH ++W ++S +WLA++ +L T R++ + + C+ C +ES DHLFF Sbjct: 78 KVLWHNSVWFPQRVPRYSFIVWLAVKDQLSTGTRMRAWGVEQPCVFCRERDESRDHLFFA 137 Query: 151 CPRTVEIWSGICSWL 107 CP T IWS + S L Sbjct: 138 CPFTYSIWSELTSRL 152 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 66.6 bits (161), Expect = 3e-09 Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 2/70 (2%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKFSDIPR--QCLLCNAAEESNDHLFFQC 149 W+K +W Y K+S LWL ++ RL T DR+K + + C LCN AEE+ DHLFF C Sbjct: 1354 WYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEETRDHLFFSC 1413 Query: 148 PRTVEIWSGI 119 T +W + Sbjct: 1414 QYTSYVWEAL 1423 >gb|AAF81336.1|AC007767_16 Strong similarity to a putative non-LTR retroelement reverse transcriptase At2g23880 gi|3738337 from Arabidopsis thaliana BAC F27L4 gb|AC005170 [Arabidopsis thaliana] Length = 206 Score = 66.2 bits (160), Expect = 4e-09 Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 2/73 (2%) Frame = -3 Query: 331 KKFWHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKF--SDIPRQCLLCNAAEESNDHLF 158 ++ WH +W ++ KFS WLA+R RL DR+ + P C+ C++ E+ DHLF Sbjct: 62 QRAWHTGVWFAHATPKFSFCAWLAVRNRLSMVDRMMTWNNGTPTTCVFCSSPMETRDHLF 121 Query: 157 FQCPRTVEIWSGI 119 FQC + EIW+ I Sbjct: 122 FQCHYSSEIWTSI 134 >emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1| putative protein [Arabidopsis thaliana] Length = 473 Score = 65.1 bits (157), Expect = 1e-08 Identities = 30/70 (42%), Positives = 40/70 (57%), Gaps = 2/70 (2%) Frame = -3 Query: 331 KKFWHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKFSD--IPRQCLLCNAAEESNDHLF 158 K W+K +W + K + +WLA+ RL T DR+ + + C+LCN A ES DHLF Sbjct: 302 KVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVDATCILCNKALESRDHLF 361 Query: 157 FQCPRTVEIW 128 F CP EIW Sbjct: 362 FSCPFATEIW 371 >ref|XP_004163799.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Cucumis sativus] Length = 288 Score = 64.3 bits (155), Expect = 2e-08 Identities = 35/80 (43%), Positives = 42/80 (52%), Gaps = 2/80 (2%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKF--SDIPRQCLLCNAAEESNDHLFFQC 149 W +W K S WLA++ RL T DRL S IP C+LC ES DHLFF C Sbjct: 201 WSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGRNYESRDHLFFPC 260 Query: 148 PRTVEIWSGICSWLKIKHRM 89 P EIWS I ++ HR+ Sbjct: 261 PFGWEIWSRILLFMSSSHRI 280 >gb|AEL30369.1| hypothetical protein 205D04_9 [Arachis hypogaea] Length = 458 Score = 63.9 bits (154), Expect = 2e-08 Identities = 30/88 (34%), Positives = 51/88 (57%), Gaps = 4/88 (4%) Frame = -3 Query: 316 KAIWRSYFPLKFSVTLWLAMRGRLKTFDRL-KFSDIPRQ---CLLCNAAEESNDHLFFQC 149 + +W+ P + + W + GR+ T DRL +F IP+Q C+LC+ AEE+ HLF +C Sbjct: 306 RTVWKGLVPPRVELLTWFVLVGRVNTKDRLCRFRVIPQQDNRCVLCDKAEETVFHLFLEC 365 Query: 148 PRTVEIWSGICSWLKIKHRMSTIPSAIR 65 T ++W C+WL+ R ++P ++ Sbjct: 366 ETTWKVW---CAWLRALGRQWSLPGTLK 390 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 63.9 bits (154), Expect = 2e-08 Identities = 35/93 (37%), Positives = 48/93 (51%), Gaps = 2/93 (2%) Frame = -3 Query: 340 KGEKKFWHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKFSD--IPRQCLLCNAAEESND 167 +G K WHKAIW S KF+ WLA RL T D++ + I C+LCN + ES D Sbjct: 1233 QGLVKQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISAESRD 1292 Query: 166 HLFFQCPRTVEIWSGICSWLKIKHRMSTIPSAI 68 HLFF C + IW + L + + P+ + Sbjct: 1293 HLFFSCNFSSHIWDRLTRRLLLCRYTTNFPALL 1325 >gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1449 Score = 63.9 bits (154), Expect = 2e-08 Identities = 30/67 (44%), Positives = 40/67 (59%), Gaps = 2/67 (2%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKFSDIPRQ--CLLCNAAEESNDHLFFQC 149 WHK +W ++ KFS +WLA+ RL T D++ + Q CLLC A ES DHLFF C Sbjct: 1282 WHKGVWFTHSTPKFSFCVWLAVYDRLSTGDKMLLWNRGLQGTCLLCRNATESRDHLFFSC 1341 Query: 148 PRTVEIW 128 + E+W Sbjct: 1342 SFSSEVW 1348 >gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata] Length = 441 Score = 63.5 bits (153), Expect = 3e-08 Identities = 29/78 (37%), Positives = 44/78 (56%), Gaps = 2/78 (2%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKF--SDIPRQCLLCNAAEESNDHLFFQC 149 W++ IW S+ K+S WLA + RL T DR+ + + C+ C E+ +HLFF C Sbjct: 275 WYQGIWFSHATPKYSFITWLATKNRLSTGDRMMSWNAGVNLSCVFCQEQTETRNHLFFTC 334 Query: 148 PRTVEIWSGICSWLKIKH 95 + E+WSG+ S L +H Sbjct: 335 RYSREVWSGLTSKLLTRH 352 >ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max] Length = 506 Score = 63.2 bits (152), Expect = 4e-08 Identities = 34/103 (33%), Positives = 53/103 (51%), Gaps = 2/103 (1%) Frame = -3 Query: 337 GEKKFWHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRL-KFSDIP-RQCLLCNAAEESNDH 164 G++K W ++ + + + LWLA GRL T DRL K+ I + C C+ EES +H Sbjct: 338 GQRKEWKNLLYGNTARPRANFILWLACHGRLSTKDRLCKYGMIDDKSCCFCSE-EESMNH 396 Query: 163 LFFQCPRTVEIWSGICSWLKIKHRMSTIPSAIRRFQREKAGSG 35 LFF C + +W + W++I+H S P+ + G G Sbjct: 397 LFFVCDNSKRVWMEVLQWVQIRHDPSDWPNELHWLTHHTKGKG 439 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 63.2 bits (152), Expect = 4e-08 Identities = 32/69 (46%), Positives = 41/69 (59%), Gaps = 3/69 (4%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLK---FSDIPRQCLLCNAAEESNDHLFFQ 152 WHKA+W K + W+ RL T DRL+ FS IP C+LCN +ES +HLFF+ Sbjct: 986 WHKAVWFKDHVPKQAFICWVVAHNRLHTRDRLRRWGFS-IPPTCVLCNDLDESREHLFFR 1044 Query: 151 CPRTVEIWS 125 C + EIWS Sbjct: 1045 CQFSSEIWS 1053 >gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] Length = 653 Score = 62.8 bits (151), Expect = 5e-08 Identities = 35/104 (33%), Positives = 53/104 (50%), Gaps = 5/104 (4%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKFSD--IPRQCLLCNAAEESNDHLFFQC 149 WH IW ++ KFS WLA++ RL T D++ + + C+LCN E+ +HLFF C Sbjct: 486 WHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSC 545 Query: 148 PRTVEIWSGICSWL---KIKHRMSTIPSAIRRFQREKAGSGIVR 26 T EIW + + K STI +++ R + S + R Sbjct: 546 CYTAEIWENLAKNIYKAKFSTNWSTILTSVSTTWRNRTESFLAR 589 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 62.0 bits (149), Expect = 8e-08 Identities = 32/90 (35%), Positives = 49/90 (54%), Gaps = 4/90 (4%) Frame = -3 Query: 340 KGEKKFWHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKF----SDIPRQCLLCNAAEES 173 K + W+K +W S+ K+ WLA+R RL T R++ SD+ +C C+ + E+ Sbjct: 456 KSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDV--KCTFCSTSIET 513 Query: 172 NDHLFFQCPRTVEIWSGICSWLKIKHRMST 83 DHLFF C IW+ I + ++HR ST Sbjct: 514 RDHLFFSCSYASAIWTAIAKNV-LQHRFST 542 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 61.6 bits (148), Expect = 1e-07 Identities = 30/67 (44%), Positives = 36/67 (53%), Gaps = 2/67 (2%) Frame = -3 Query: 322 WHKAIWRSYFPLKFSVTLWLAMRGRLKTFDRL--KFSDIPRQCLLCNAAEESNDHLFFQC 149 WHK IW S+ K+S WLA GRL T DR+ + I C+ C E+ DHLFF C Sbjct: 1056 WHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDHLFFTC 1115 Query: 148 PRTVEIW 128 T IW Sbjct: 1116 SFTSVIW 1122 >ref|XP_004977924.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Setaria italica] Length = 117 Score = 61.6 bits (148), Expect = 1e-07 Identities = 30/80 (37%), Positives = 43/80 (53%), Gaps = 2/80 (2%) Frame = -3 Query: 316 KAIWRSYFPLKFSVTLWLAMRGRLKTFDRLKFSDIPRQ--CLLCNAAEESNDHLFFQCPR 143 K IWR++ PLK +WLA++ RL T DR + C LC E+ DH+F +C Sbjct: 38 KTIWRAWAPLKIKFFMWLAIKDRLWTADRRHRQGLQDHTACALCEQERETTDHIFVRCSY 97 Query: 142 TVEIWSGICSWLKIKHRMST 83 T ++W I S L I++ T Sbjct: 98 TQQVWQEISSILNIQNHAPT 117