BLASTX nr result
ID: Mentha23_contig00035216
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00035216 (374 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 84 2e-14 ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668... 77 2e-12 ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670... 74 2e-11 gb|AAF81336.1|AC007767_16 Strong similarity to a putative non-LT... 73 4e-11 gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] 71 1e-10 gb|AEL30369.1| hypothetical protein 205D04_9 [Arachis hypogaea] 71 2e-10 ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665... 70 3e-10 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 69 9e-10 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 68 1e-09 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 68 1e-09 ref|XP_006586315.1| PREDICTED: uncharacterized protein LOC102664... 67 2e-09 ref|XP_004239564.1| PREDICTED: uncharacterized protein LOC101259... 67 2e-09 ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arab... 67 2e-09 gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub... 67 2e-09 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 67 2e-09 gb|AAC67331.1| putative non-LTR retroelement reverse transcripta... 67 3e-09 ref|XP_004153592.1| PREDICTED: uncharacterized protein LOC101219... 66 4e-09 ref|XP_006381710.1| hypothetical protein POPTR_0006s16215g [Popu... 66 6e-09 ref|XP_006300939.1| hypothetical protein CARUB_v10021318mg, part... 66 6e-09 ref|XP_004148188.1| PREDICTED: uncharacterized protein LOC101204... 66 6e-09 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 84.0 bits (206), Expect = 2e-14 Identities = 45/124 (36%), Positives = 65/124 (52%), Gaps = 2/124 (1%) Frame = -3 Query: 369 KKFWHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKF--SDIPRQCLLCSAAEESNEHLF 196 ++ WHK +W ++ PKFS WLA+R RL T DR+ + P C+ CS+ E+ +HLF Sbjct: 775 QRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHLF 834 Query: 195 FQCPRTEEIWTGICSWLKIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWY 16 FQC + EIWT I + K+R ST SA+ + + I ++ IW Sbjct: 835 FQCCYSSEIWTSIAKNV-YKDRFSTKWSAVVNYISDSQPDRIQSFLSRYTFQVSIHSIWR 893 Query: 15 ARNS 4 RNS Sbjct: 894 ERNS 897 >ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max] Length = 477 Score = 77.0 bits (188), Expect = 2e-12 Identities = 39/120 (32%), Positives = 57/120 (47%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKFSDIPRQCLLCSAAEESNEHLFFQCPR 181 W+ +W IP K S LWLA + L T DR F + C LC +S+ HLFF C Sbjct: 316 WNSVVWNPAIPSKMSFILWLATKNHLLTLDRAAFLNKGLLCPLCRTKAKSHAHLFFSCRI 375 Query: 180 TEEIWTGICSWLKIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWYARNSL 1 + ++W I W+ + + ++ I +A SG K + + L AV W +RN L Sbjct: 376 SLQVWANIRDWIPLHRQTISLQCTINSRICGRATSGTWGKFRCLALAIAVYCTWISRNLL 435 >ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max] Length = 383 Score = 74.3 bits (181), Expect = 2e-11 Identities = 35/97 (36%), Positives = 49/97 (50%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKFSDIPRQCLLCSAAEESNEHLFFQCPR 181 W IW IP K S LWLA + RL DR F + C LC+ ES+ HLFF C Sbjct: 287 WSSIIWNPVIPSKMSFILWLATKNRLLALDRAAFLNKGFLCPLCTNEAESHAHLFFSCRT 346 Query: 180 TEEIWTGICSWLKIKNRMSTIPSAIRRFQREKAGSGI 70 + +W I W+ +K + ++ +I R +A SG+ Sbjct: 347 SLRVWAHIRDWIPLKRQSISLQHSISALIRRRATSGV 383 >gb|AAF81336.1|AC007767_16 Strong similarity to a putative non-LTR retroelement reverse transcriptase At2g23880 gi|3738337 from Arabidopsis thaliana BAC F27L4 gb|AC005170 [Arabidopsis thaliana] Length = 206 Score = 73.2 bits (178), Expect = 4e-11 Identities = 36/90 (40%), Positives = 52/90 (57%), Gaps = 2/90 (2%) Frame = -3 Query: 369 KKFWHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKF--SDIPRQCLLCSAAEESNEHLF 196 ++ WH +W ++ PKFS WLA+R RL DR+ + P C+ CS+ E+ +HLF Sbjct: 62 QRAWHTGVWFAHATPKFSFCAWLAVRNRLSMVDRMMTWNNGTPTTCVFCSSPMETRDHLF 121 Query: 195 FQCPRTEEIWTGICSWLKIKNRMSTIPSAI 106 FQC + EIWT I + K+ ST SA+ Sbjct: 122 FQCHYSSEIWTSIAKNV-YKDGFSTDWSAV 150 >gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] Length = 653 Score = 71.2 bits (173), Expect = 1e-10 Identities = 40/123 (32%), Positives = 61/123 (49%), Gaps = 5/123 (4%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKFSD--IPRQCLLCSAAEESNEHLFFQC 187 WH IW ++ PKFS WLA++ RL T D++ + + C+LC+ E+ HLFF C Sbjct: 486 WHMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSC 545 Query: 186 PRTEEIWTGICSWL---KIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWY 16 T EIW + + K STI +++ R + S + R + A + IW+ Sbjct: 546 CYTAEIWENLAKNIYKAKFSTNWSTILTSVSTTWRNRTESFLAR----YIFQATIHTIWH 601 Query: 15 ARN 7 RN Sbjct: 602 ERN 604 >gb|AEL30369.1| hypothetical protein 205D04_9 [Arachis hypogaea] Length = 458 Score = 70.9 bits (172), Expect = 2e-10 Identities = 39/123 (31%), Positives = 65/123 (52%), Gaps = 7/123 (5%) Frame = -3 Query: 354 KAIWRSYIPPKFSITLWLAMRGRLKTFDRL-KFSDIPRQ---CLLCSAAEESNEHLFFQC 187 + +W+ +PP+ + W + GR+ T DRL +F IP+Q C+LC AEE+ HLF +C Sbjct: 306 RTVWKGLVPPRVELLTWFVLVGRVNTKDRLCRFRVIPQQDNRCVLCDKAEETVFHLFLEC 365 Query: 186 PRTEEIWTGICSWLKIKNRMSTIPSAIR-RFQR-EKAGSGIVRKAKWIV-LGAAVSYIWY 16 T ++W C+WL+ R ++P ++ F+ K V + +W + A + W Sbjct: 366 ETTWKVW---CAWLRALGRQWSLPGTLKDHFESWTKLSVRKVDRKRWFLGFFAVIWTTWL 422 Query: 15 ARN 7 RN Sbjct: 423 ERN 425 >ref|XP_006582615.1| PREDICTED: uncharacterized protein LOC102665746 [Glycine max] Length = 506 Score = 70.1 bits (170), Expect = 3e-10 Identities = 38/125 (30%), Positives = 61/125 (48%), Gaps = 2/125 (1%) Frame = -3 Query: 372 EKKFWHKAIWRSYIPPKFSITLWLAMRGRLKTFDRL-KFSDIP-RQCLLCSAAEESNEHL 199 ++K W ++ + P+ + LWLA GRL T DRL K+ I + C CS EES HL Sbjct: 339 QRKEWKNLLYGNTARPRANFILWLACHGRLSTKDRLCKYGMIDDKSCCFCSE-EESMNHL 397 Query: 198 FFQCPRTEEIWTGICSWLKIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIW 19 FF C ++ +W + W++I++ S P+ + G G + + + IW Sbjct: 398 FFVCDNSKRVWMEVLQWVQIRHDPSDWPNELHWLTHHTKGKGTRAAVLKMAIAETIYEIW 457 Query: 18 YARNS 4 RN+ Sbjct: 458 NIRNN 462 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 68.6 bits (166), Expect = 9e-10 Identities = 41/122 (33%), Positives = 57/122 (46%), Gaps = 2/122 (1%) Frame = -3 Query: 366 KFWHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKF--SDIPRQCLLCSAAEESNEHLFF 193 K WHKAIW S PKF+ WLA RL T D++ I C+LC+ + ES +HLFF Sbjct: 1237 KQWHKAIWFSGATPKFTFISWLAAHDRLTTGDKMASWNRGISSVCVLCNISAESRDHLFF 1296 Query: 192 QCPRTEEIWTGICSWLKIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWYA 13 C + IW + L + + P+ + + SG R V A + +W Sbjct: 1297 SCNFSSHIWDRLTRRLLLCRYTTNFPALLLLLSGQDF-SGTKRFLLRYVFQATIHTLWRE 1355 Query: 12 RN 7 RN Sbjct: 1356 RN 1357 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 68.2 bits (165), Expect = 1e-09 Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 5/123 (4%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRL--KFSDIPRQCLLCSAAEESNEHLFFQC 187 WHK IW S+ PK+S WLA GRL T DR+ + I C+ C E+ +HLFF C Sbjct: 1056 WHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDHLFFTC 1115 Query: 186 PRTEEIWTGICSWL---KIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWY 16 T IW + + + + +I AI Q + + R V A + +W Sbjct: 1116 SFTSVIWVDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRR----YVFQATIYIVWR 1171 Query: 15 ARN 7 RN Sbjct: 1172 ERN 1174 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 68.2 bits (165), Expect = 1e-09 Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 2/78 (2%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKFSDIPR--QCLLCSAAEESNEHLFFQC 187 W+K +W Y PK+S LWL ++ RL T DR+K + + C LC+ AEE+ +HLFF C Sbjct: 1354 WYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEETRDHLFFSC 1413 Query: 186 PRTEEIWTGICSWLKIKN 133 T +W + L N Sbjct: 1414 QYTSYVWEALTQRLLSTN 1431 >ref|XP_006586315.1| PREDICTED: uncharacterized protein LOC102664837 [Glycine max] Length = 97 Score = 67.4 bits (163), Expect = 2e-09 Identities = 30/74 (40%), Positives = 47/74 (63%), Gaps = 2/74 (2%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRL-KFSDIPRQ-CLLCSAAEESNEHLFFQC 187 W +R+Y P+ S T WLA GRL T DRL +F I + C LC+ +ES++HLFF C Sbjct: 10 WRHLFYRNYARPRASHTTWLACHGRLATKDRLCRFGLIQEKICSLCNEVDESHDHLFFAC 69 Query: 186 PRTEEIWTGICSWL 145 ++++W+ + +W+ Sbjct: 70 SESKKVWSEVLNWI 83 >ref|XP_004239564.1| PREDICTED: uncharacterized protein LOC101259935 [Solanum lycopersicum] Length = 189 Score = 67.4 bits (163), Expect = 2e-09 Identities = 43/127 (33%), Positives = 61/127 (48%), Gaps = 6/127 (4%) Frame = -3 Query: 369 KKFWHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKF--SDIPRQCLLCSAAEESNEHLF 196 K W ++++ PK TLW+ M +L T DRL + R C++C AEES EHLF Sbjct: 32 KPEWRCLMFKNAARPKAGFTLWILMNRKLATVDRLTKWGMALHRDCVMCKRAEESMEHLF 91 Query: 195 FQCPRTEEIWTGICSWLKIKNRMSTIPSAIRRFQR--EKAGSGIVRKAKWI--VLGAAVS 28 QC E IW + W+ + + M P + F + K G G +A+ VL V Sbjct: 92 IQCHYAEAIWERLLRWINVHSNM---PKSWTEFIQWCVKNGKGKTVRAQVFKGVLAEGVY 148 Query: 27 YIWYARN 7 +W RN Sbjct: 149 GLWSERN 155 >ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arabidopsis lyrata subsp. lyrata] gi|297311371|gb|EFH41795.1| hypothetical protein ARALYDRAFT_917542 [Arabidopsis lyrata subsp. lyrata] Length = 227 Score = 67.4 bits (163), Expect = 2e-09 Identities = 27/75 (36%), Positives = 44/75 (58%) Frame = -3 Query: 369 KKFWHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKFSDIPRQCLLCSAAEESNEHLFFQ 190 K WH ++W P++S +WLA++ +L T R++ + + C+ C +ES +HLFF Sbjct: 78 KVLWHNSVWFPQRVPRYSFIVWLAVKDQLSTGTRMRAWGVEQPCVFCRERDESRDHLFFA 137 Query: 189 CPRTEEIWTGICSWL 145 CP T IW+ + S L Sbjct: 138 CPFTYSIWSELTSRL 152 >gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata] Length = 441 Score = 67.4 bits (163), Expect = 2e-09 Identities = 38/124 (30%), Positives = 63/124 (50%), Gaps = 5/124 (4%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKF--SDIPRQCLLCSAAEESNEHLFFQC 187 W++ IW S+ PK+S WLA + RL T DR+ + + C+ C E+ HLFF C Sbjct: 275 WYQGIWFSHATPKYSFITWLATKNRLSTGDRMMSWNAGVNLSCVFCQEQTETRNHLFFTC 334 Query: 186 PRTEEIWTGICSWLKIKNRMSTIPSAIRRFQREKAGSG---IVRKAKWIVLGAAVSYIWY 16 + E+W+G+ S L ++ + + ++ + G+ ++R A I+ V IW Sbjct: 335 RYSREVWSGLTSKLLTRHYSTDWTTILKLLTDKTLGNNRLFLLRYAFQIL----VYSIWK 390 Query: 15 ARNS 4 RNS Sbjct: 391 ERNS 394 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 67.4 bits (163), Expect = 2e-09 Identities = 39/122 (31%), Positives = 58/122 (47%), Gaps = 4/122 (3%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKF----SDIPRQCLLCSAAEESNEHLFF 193 W+K +W S+ PK+ WLA+R RL T R++ SD+ +C CS + E+ +HLFF Sbjct: 462 WYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDV--KCTFCSTSIETRDHLFF 519 Query: 192 QCPRTEEIWTGICSWLKIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWYA 13 C IWT I + +++R ST I + E I + V +W Sbjct: 520 SCSYASAIWTAIAKNV-LQHRFSTDWQTIVNYISETQTDRIRSFLSRYIFQLTVHTVWKE 578 Query: 12 RN 7 RN Sbjct: 579 RN 580 >gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1449 Score = 67.0 bits (162), Expect = 3e-09 Identities = 29/67 (43%), Positives = 40/67 (59%), Gaps = 2/67 (2%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKF--SDIPRQCLLCSAAEESNEHLFFQC 187 WHK +W ++ PKFS +WLA+ RL T D++ + CLLC A ES +HLFF C Sbjct: 1282 WHKGVWFTHSTPKFSFCVWLAVYDRLSTGDKMLLWNRGLQGTCLLCRNATESRDHLFFSC 1341 Query: 186 PRTEEIW 166 + E+W Sbjct: 1342 SFSSEVW 1348 >ref|XP_004153592.1| PREDICTED: uncharacterized protein LOC101219214 [Cucumis sativus] Length = 152 Score = 66.2 bits (160), Expect = 4e-09 Identities = 40/109 (36%), Positives = 53/109 (48%), Gaps = 2/109 (1%) Frame = -3 Query: 327 PKFSITLWLAMRGRLKTFDRLKFSD--IPRQCLLCSAAEESNEHLFFQCPRTEEIWTGIC 154 PK S WLA+R RL T DRL + D IP CLLC ES +HLFF C EIW+ I Sbjct: 7 PKRSFCAWLAIRDRLGTKDRLSWWDRLIPLSCLLCGWNYESRDHLFFSCHFGWEIWSRIL 66 Query: 153 SWLKIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWYARN 7 + R+ + + G + RK ++ A + +IW RN Sbjct: 67 LLMSSSQRIGYWGVELSWICNQGIGKSVRRKLWHLLWCATIYFIWQERN 115 >ref|XP_006381710.1| hypothetical protein POPTR_0006s16215g [Populus trichocarpa] gi|550336461|gb|ERP59507.1| hypothetical protein POPTR_0006s16215g [Populus trichocarpa] Length = 155 Score = 65.9 bits (159), Expect = 6e-09 Identities = 35/101 (34%), Positives = 55/101 (54%), Gaps = 1/101 (0%) Frame = -3 Query: 306 WLAMRGRLKTFDRLKFSDIPRQCLLCSAAEESNEH-LFFQCPRTEEIWTGICSWLKIKNR 130 +L ++GRL+T DRL+F C+LC ++++ H LFF C T +W I +WL++ R Sbjct: 18 YLRVKGRLRTRDRLRFIGTETHCVLCRHHDDNHSHQLFFACNWTSILWRKIRAWLRMNRR 77 Query: 129 MSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWYARN 7 M+T+ SA R K + + + + L V IW RN Sbjct: 78 MATLNSATRGLSTRK--KNLEARMRRVSLSITVYLIWEERN 116 >ref|XP_006300939.1| hypothetical protein CARUB_v10021318mg, partial [Capsella rubella] gi|482569649|gb|EOA33837.1| hypothetical protein CARUB_v10021318mg, partial [Capsella rubella] Length = 290 Score = 65.9 bits (159), Expect = 6e-09 Identities = 38/123 (30%), Positives = 63/123 (51%), Gaps = 4/123 (3%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKF--SDIPRQCLLCSAAEESNEHLFFQC 187 W K++W PK + W+ +R RL T DRL+ ++P +CLLC+++ ES HLFF+C Sbjct: 113 WFKSVWFKERIPKHAFISWVVIRNRLTTRDRLRGWGMNVPSECLLCTSSAESRLHLFFEC 172 Query: 186 PRTEEIWTGICSW--LKIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWYA 13 + E+W+ + L I + +R + +K +R ++ A V +W Sbjct: 173 AYSHEVWSSFFTHPSLSPPAMFEDIVAWVRSSRSKK-----LRTICKLIFQAVVYGLWRE 227 Query: 12 RNS 4 RNS Sbjct: 228 RNS 230 >ref|XP_004148188.1| PREDICTED: uncharacterized protein LOC101204314 [Cucumis sativus] Length = 282 Score = 65.9 bits (159), Expect = 6e-09 Identities = 41/120 (34%), Positives = 53/120 (44%), Gaps = 2/120 (1%) Frame = -3 Query: 360 WHKAIWRSYIPPKFSITLWLAMRGRLKTFDRLKFSD--IPRQCLLCSAAEESNEHLFFQC 187 W +W PK S WLA+R RL T RL D IP CLLC ES +HLFF C Sbjct: 126 WSGLLWGGGNIPKHSFCAWLAIRDRLGTRGRLSRWDRSIPLSCLLCGGNYESRDHLFFSC 185 Query: 186 PRTEEIWTGICSWLKIKNRMSTIPSAIRRFQREKAGSGIVRKAKWIVLGAAVSYIWYARN 7 EIW+ I +R + + G + RK ++ A + +IW RN Sbjct: 186 HFGWEIWSRILLLKSSSHRTGYWGVELSWIYNQGIGKSVRRKLWRLLWCATIYFIWQERN 245