BLASTX nr result
ID: Mentha22_contig00036126
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00036126 (446 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670... 75 1e-11 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 72 6e-11 ref|XP_006586315.1| PREDICTED: uncharacterized protein LOC102664... 72 1e-10 dbj|BAD66732.1| orf147a [Beta vulgaris subsp. vulgaris] gi|54606... 71 1e-10 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 71 1e-10 ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668... 70 4e-10 gb|AAF81336.1|AC007767_16 Strong similarity to a putative non-LT... 70 4e-10 emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678... 66 6e-09 ref|XP_006378155.1| hypothetical protein POPTR_0010s04250g, part... 65 7e-09 gb|AEL30369.1| hypothetical protein 205D04_9 [Arachis hypogaea] 65 7e-09 gb|AAC67331.1| putative non-LTR retroelement reverse transcripta... 65 1e-08 gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thali... 64 2e-08 ref|XP_004977924.1| PREDICTED: putative ribonuclease H protein A... 63 4e-08 ref|XP_006381710.1| hypothetical protein POPTR_0006s16215g [Popu... 63 5e-08 emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulga... 63 5e-08 ref|XP_006598659.1| PREDICTED: uncharacterized protein LOC102659... 62 6e-08 gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] 62 6e-08 gb|AAF79357.1|AC007887_16 F15O4.34 [Arabidopsis thaliana] 62 8e-08 dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] 62 8e-08 gb|ABE65413.1| hypothetical protein At1g62890 [Arabidopsis thali... 62 1e-07 >ref|XP_006579213.1| PREDICTED: uncharacterized protein LOC102670237 [Glycine max] Length = 383 Score = 74.7 bits (182), Expect = 1e-11 Identities = 41/132 (31%), Positives = 60/132 (45%) Frame = +1 Query: 16 NLIEAQSKLDSWFAGNNKGTKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGR 195 N+ A+ L+SW Y++ R I IP K S LWLA + R Sbjct: 252 NIEVAKLMLNSWGCNEQTLAGKMYDYIRGTRPVVHWSSIIWNPVIPSKMSFILWLATKNR 311 Query: 196 LKTFDRLKFTNVPRRCMLCNNADETNDHLFFKCERTVEIWSEICTWLKIQNQMTTIPSAI 375 L DR F N C LC N E++ HLFF C ++ +W+ I W+ ++ Q ++ +I Sbjct: 312 LLALDRAAFLNKGFLCPLCTNEAESHAHLFFSCRTSLRVWAHIRDWIPLKRQSISLQHSI 371 Query: 376 RRFQREIAGSGI 411 R A SG+ Sbjct: 372 SALIRRRATSGV 383 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 72.4 bits (176), Expect = 6e-11 Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 2/86 (2%) Frame = +1 Query: 73 TKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTN--VPRRCM 246 TK + H R ++ HK + ++ PKFS WLAI+ RL T DR+ N P C+ Sbjct: 762 TKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCV 821 Query: 247 LCNNADETNDHLFFKCERTVEIWSEI 324 C++ ET DHLFF+C + EIW+ I Sbjct: 822 FCSSPMETRDHLFFQCCYSSEIWTSI 847 >ref|XP_006586315.1| PREDICTED: uncharacterized protein LOC102664837 [Glycine max] Length = 97 Score = 71.6 bits (174), Expect = 1e-10 Identities = 33/72 (45%), Positives = 46/72 (63%), Gaps = 2/72 (2%) Frame = +1 Query: 139 RSYIPPKFSVTLWLAIQGRLKTFDRL-KFTNVPRR-CMLCNNADETNDHLFFKCERTVEI 312 R+Y P+ S T WLA GRL T DRL +F + + C LCN DE++DHLFF C + ++ Sbjct: 16 RNYARPRASHTTWLACHGRLATKDRLCRFGLIQEKICSLCNEVDESHDHLFFACSESKKV 75 Query: 313 WSEICTWLKIQN 348 WSE+ W+ Q+ Sbjct: 76 WSEVLNWIDCQH 87 >dbj|BAD66732.1| orf147a [Beta vulgaris subsp. vulgaris] gi|54606753|dbj|BAD66776.1| orf147a [Beta vulgaris subsp. vulgaris] Length = 147 Score = 71.2 bits (173), Expect = 1e-10 Identities = 34/70 (48%), Positives = 43/70 (61%), Gaps = 2/70 (2%) Frame = +1 Query: 142 SYIPPKFSVTLWLAIQGRLKTFDRLKFTNV--PRRCMLCNNADETNDHLFFKCERTVEIW 315 ++ PPK + WL I RL T DRL+ + + C+LC N DET DHLFF CE + EIW Sbjct: 8 NFSPPKCTFITWLTILDRLATCDRLQKFGIVCDQLCVLCGNVDETRDHLFFVCEFSYEIW 67 Query: 316 SEICTWLKIQ 345 S + WL IQ Sbjct: 68 SSLLCWLGIQ 77 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 71.2 bits (173), Expect = 1e-10 Identities = 35/86 (40%), Positives = 49/86 (56%), Gaps = 2/86 (2%) Frame = +1 Query: 73 TKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTNVPR--RCM 246 TK + + R ++ +K + Y PK+S LWL +Q RL T DR+K N + C Sbjct: 1338 TKVTWNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCT 1397 Query: 247 LCNNADETNDHLFFKCERTVEIWSEI 324 LCNNA+ET DHLFF C+ T +W + Sbjct: 1398 LCNNAEETRDHLFFSCQYTSYVWEAL 1423 >ref|XP_006595311.1| PREDICTED: uncharacterized protein LOC102668530 [Glycine max] Length = 477 Score = 69.7 bits (169), Expect = 4e-10 Identities = 41/141 (29%), Positives = 64/141 (45%) Frame = +1 Query: 16 NLIEAQSKLDSWFAGNNKGTKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGR 195 N+ A+ L+SW + AY++ R + + IP K S LWLA + Sbjct: 281 NVEAAKQTLNSWNSNEQLLAGKAYDYIRGVKPAVNWNSVVWNPAIPSKMSFILWLATKNH 340 Query: 196 LKTFDRLKFTNVPRRCMLCNNADETNDHLFFKCERTVEIWSEICTWLKIQNQMTTIPSAI 375 L T DR F N C LC +++ HLFF C ++++W+ I W+ + Q ++ I Sbjct: 341 LLTLDRAAFLNKGLLCPLCRTKAKSHAHLFFSCRISLQVWANIRDWIPLHRQTISLQCTI 400 Query: 376 RRFQREIAGSGILRKGKWIAL 438 A SG K + +AL Sbjct: 401 NSRICGRATSGTWGKFRCLAL 421 >gb|AAF81336.1|AC007767_16 Strong similarity to a putative non-LTR retroelement reverse transcriptase At2g23880 gi|3738337 from Arabidopsis thaliana BAC F27L4 gb|AC005170 [Arabidopsis thaliana] Length = 206 Score = 69.7 bits (169), Expect = 4e-10 Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 2/86 (2%) Frame = +1 Query: 73 TKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTN--VPRRCM 246 TK + H R ++ H + ++ PKFS WLA++ RL DR+ N P C+ Sbjct: 49 TKDTWNHIRTSSNQRAWHTGVWFAHATPKFSFCAWLAVRNRLSMVDRMMTWNNGTPTTCV 108 Query: 247 LCNNADETNDHLFFKCERTVEIWSEI 324 C++ ET DHLFF+C + EIW+ I Sbjct: 109 FCSSPMETRDHLFFQCHYSSEIWTSI 134 >emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1| putative protein [Arabidopsis thaliana] Length = 473 Score = 65.9 bits (159), Expect = 6e-09 Identities = 39/116 (33%), Positives = 54/116 (46%), Gaps = 6/116 (5%) Frame = +1 Query: 73 TKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTN--VPRRCM 246 TK + H R K +K + + PK + +WLA+ RL T DR+ N V C+ Sbjct: 289 TKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVDATCI 348 Query: 247 LCNNADETNDHLFFKCERTVEIWSEICTWLK----IQNQMTTIPSAIRRFQREIAG 402 LCN A E+ DHLFF C EIW + + + T I + R + IAG Sbjct: 349 LCNKALESRDHLFFSCPFATEIWEPLAKTIYNTCFYTDWQTIINNVSRNWPDRIAG 404 >ref|XP_006378155.1| hypothetical protein POPTR_0010s04250g, partial [Populus trichocarpa] gi|550329025|gb|ERP55952.1| hypothetical protein POPTR_0010s04250g, partial [Populus trichocarpa] Length = 112 Score = 65.5 bits (158), Expect = 7e-09 Identities = 35/113 (30%), Positives = 56/113 (49%) Frame = +1 Query: 40 LDSWFAGNNKGTKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLK 219 L SW + T AY F K + + + P+ + +LWL G+L+T DRL+ Sbjct: 2 LSSWHSRPGSFTANAYHFFTYKVDHVQWASVVWEQWFLPRHNFSLWLL--GKLRTRDRLQ 59 Query: 220 FTNVPRRCMLCNNADETNDHLFFKCERTVEIWSEICTWLKIQNQMTTIPSAIR 378 F + LC+N+ E++ HLFF C + +W + WL+ + M T+ IR Sbjct: 60 FISTDPLYPLCHNSSESHAHLFFSCAWSSSLWGKARYWLEFHSSMPTLNRVIR 112 >gb|AEL30369.1| hypothetical protein 205D04_9 [Arachis hypogaea] Length = 458 Score = 65.5 bits (158), Expect = 7e-09 Identities = 29/88 (32%), Positives = 52/88 (59%), Gaps = 4/88 (4%) Frame = +1 Query: 127 KAI*RSYIPPKFSVTLWLAIQGRLKTFDRL-KFTNVPR---RCMLCNNADETNDHLFFKC 294 + + + +PP+ + W + GR+ T DRL +F +P+ RC+LC+ A+ET HLF +C Sbjct: 306 RTVWKGLVPPRVELLTWFVLVGRVNTKDRLCRFRVIPQQDNRCVLCDKAEETVFHLFLEC 365 Query: 295 ERTVEIWSEICTWLKIQNQMTTIPSAIR 378 E T ++W C WL+ + ++P ++ Sbjct: 366 ETTWKVW---CAWLRALGRQWSLPGTLK 390 >gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1449 Score = 65.1 bits (157), Expect = 1e-08 Identities = 32/83 (38%), Positives = 47/83 (56%), Gaps = 2/83 (2%) Frame = +1 Query: 73 TKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTN--VPRRCM 246 TK + + R G + HK + ++ PKFS +WLA+ RL T D++ N + C+ Sbjct: 1266 TKETWNNTRTMGIEVPWHKGVWFTHSTPKFSFCVWLAVYDRLSTGDKMLLWNRGLQGTCL 1325 Query: 247 LCNNADETNDHLFFKCERTVEIW 315 LC NA E+ DHLFF C + E+W Sbjct: 1326 LCRNATESRDHLFFSCSFSSEVW 1348 >gb|AAD08951.1| putative reverse transcriptase [Arabidopsis thaliana] gi|20197043|gb|AAM14892.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 1412 Score = 63.9 bits (154), Expect = 2e-08 Identities = 42/128 (32%), Positives = 57/128 (44%), Gaps = 5/128 (3%) Frame = +1 Query: 7 CRGNLIEAQSKLDSWFAGNNKG---TKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLW 177 CRG E L G K + + R +G K HKAI S PKF+ W Sbjct: 1198 CRGICTEQDRSLCRSIGGQFKAKFFSPEIWHQIREQGLVKQWHKAIWFSGATPKFTFISW 1257 Query: 178 LAIQGRLKTFDRLKFTN--VPRRCMLCNNADETNDHLFFKCERTVEIWSEICTWLKIQNQ 351 LA RL T D++ N + C+LCN + E+ DHLFF C + IW + L + Sbjct: 1258 LAAHDRLTTGDKMASWNRGISSVCVLCNISAESRDHLFFSCNFSSHIWDRLTRRLLLCRY 1317 Query: 352 MTTIPSAI 375 T P+ + Sbjct: 1318 TTNFPALL 1325 >ref|XP_004977924.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Setaria italica] Length = 117 Score = 63.2 bits (152), Expect = 4e-08 Identities = 37/106 (34%), Positives = 48/106 (45%), Gaps = 2/106 (1%) Frame = +1 Query: 49 WFAGNNKGTKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTN 228 W + AY F K K I R++ P K +WLAI+ RL T DR Sbjct: 12 WDLSGIYSARSAYRAFFQGATKFAAAKTIWRAWAPLKIKFFMWLAIKDRLWTADRRHRQG 71 Query: 229 VPRR--CMLCNNADETNDHLFFKCERTVEIWSEICTWLKIQNQMTT 360 + C LC ET DH+F +C T ++W EI + L IQN T Sbjct: 72 LQDHTACALCEQERETTDHIFVRCSYTQQVWQEISSILNIQNHAPT 117 >ref|XP_006381710.1| hypothetical protein POPTR_0006s16215g [Populus trichocarpa] gi|550336461|gb|ERP59507.1| hypothetical protein POPTR_0006s16215g [Populus trichocarpa] Length = 155 Score = 62.8 bits (151), Expect = 5e-08 Identities = 28/69 (40%), Positives = 43/69 (62%), Gaps = 1/69 (1%) Frame = +1 Query: 175 WLAIQGRLKTFDRLKFTNVPRRCMLCNNADETNDH-LFFKCERTVEIWSEICTWLKIQNQ 351 +L ++GRL+T DRL+F C+LC + D+ + H LFF C T +W +I WL++ + Sbjct: 18 YLRVKGRLRTRDRLRFIGTETHCVLCRHHDDNHSHQLFFACNWTSILWRKIRAWLRMNRR 77 Query: 352 MTTIPSAIR 378 M T+ SA R Sbjct: 78 MATLNSATR 86 >emb|CCA65979.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1110 Score = 62.8 bits (151), Expect = 5e-08 Identities = 39/129 (30%), Positives = 56/129 (43%), Gaps = 2/129 (1%) Frame = +1 Query: 58 GNNKGTKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTNVP- 234 G+ K AY+ GE+ + I +Y PK LW+ + RL T DR+ V Sbjct: 936 GDKFSMKKAYKKISENGERVRWRRLICNNYATPKSKFILWMMLHERLPTVDRISRWGVQC 995 Query: 235 -RRCMLCNNADETNDHLFFKCERTVEIWSEICTWLKIQNQMTTIPSAIRRFQREIAGSGI 411 LC N ET HLFF C + +WS+IC ++ N + I + G Sbjct: 996 DLNYRLCRNDGETIQHLFFSCSYSAGVWSKICYIMRFPNSGVSHQEII----SSVCGQAR 1051 Query: 412 LRKGKWIAL 438 +KGK I + Sbjct: 1052 KKKGKLIVM 1060 >ref|XP_006598659.1| PREDICTED: uncharacterized protein LOC102659749 [Glycine max] Length = 686 Score = 62.4 bits (150), Expect = 6e-08 Identities = 34/87 (39%), Positives = 43/87 (49%), Gaps = 2/87 (2%) Frame = +1 Query: 154 PKFSVTLWLAIQGRLKTFDRLKFTNVPR--RCMLCNNADETNDHLFFKCERTVEIWSEIC 327 P+ +VTLWLA Q RL T RLK N+ + C LC DE DHL F C T IW E+ Sbjct: 538 PRANVTLWLACQNRLATKTRLKNMNLIQCSLCSLCKEQDEDLDHLMFSCRVTKAIWLEVL 597 Query: 328 TWLKIQNQMTTIPSAIRRFQREIAGSG 408 W+ I + +R + G G Sbjct: 598 KWMDIDHTPQMWRDEVRWVMQYTKGKG 624 >gb|AAF87143.1|AC002423_8 T23E23.16 [Arabidopsis thaliana] Length = 653 Score = 62.4 bits (150), Expect = 6e-08 Identities = 31/69 (44%), Positives = 40/69 (57%), Gaps = 2/69 (2%) Frame = +1 Query: 124 HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTN--VPRRCMLCNNADETNDHLFFKCE 297 H I ++ PKFS WLA+Q RL T D++ N + C+LCNN ET +HLFF C Sbjct: 487 HMGIWFAHATPKFSFCAWLAVQNRLSTGDKMLQWNRRLSPTCVLCNNNIETRNHLFFSCC 546 Query: 298 RTVEIWSEI 324 T EIW + Sbjct: 547 YTAEIWENL 555 >gb|AAF79357.1|AC007887_16 F15O4.34 [Arabidopsis thaliana] Length = 236 Score = 62.0 bits (149), Expect = 8e-08 Identities = 32/86 (37%), Positives = 45/86 (52%), Gaps = 2/86 (2%) Frame = +1 Query: 73 TKGAYEHFRAKGEKKF*HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTNVPRR--CM 246 TK H R +K + + PK+S +WLA+ RL T DR+ N + C+ Sbjct: 86 TKETLNHMRTISMDVDWYKGVWFGHSTPKYSFCVWLAVLNRLSTGDRMTHWNGGQSAACV 145 Query: 247 LCNNADETNDHLFFKCERTVEIWSEI 324 LC+NA ET DHLFF C+ +WS + Sbjct: 146 LCHNAPETRDHLFFSCDFASIVWSNL 171 >dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana] Length = 478 Score = 62.0 bits (149), Expect = 8e-08 Identities = 31/77 (40%), Positives = 45/77 (58%), Gaps = 2/77 (2%) Frame = +1 Query: 124 HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTNV--PRRCMLCNNADETNDHLFFKCE 297 +K + S+ PK+SV W+AI+ RL T DR+ N C+LC++ ET DHLFF C Sbjct: 313 YKGVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCHHLVETRDHLFFTCP 372 Query: 298 RTVEIWSEICTWLKIQN 348 + E+WS + L Q+ Sbjct: 373 YSAEVWSTLTRKLLSQH 389 >gb|ABE65413.1| hypothetical protein At1g62890 [Arabidopsis thaliana] Length = 195 Score = 61.6 bits (148), Expect = 1e-07 Identities = 30/66 (45%), Positives = 39/66 (59%), Gaps = 2/66 (3%) Frame = +1 Query: 124 HKAI*RSYIPPKFSVTLWLAIQGRLKTFDRLKFTNVPRR--CMLCNNADETNDHLFFKCE 297 +K + Y PK+S LWL IQ RL T D +K N ++ C LC NA+ET + LFF C Sbjct: 22 YKGVWFPYSTPKYSFLLWLTIQNRLSTGDHIKAWNSGQQVTCTLCGNAEETRNLLFFSCH 81 Query: 298 RTVEIW 315 T E+W Sbjct: 82 YTSEVW 87