BLASTX nr result
ID: Mentha22_contig00052857
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00052857 (404 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Ar... 74 2e-11 pir||S23319 hypothetical protein 2 - Arabidopsis thaliana retrot... 71 2e-10 emb|CAA31653.1| polyprotein [Arabidopsis thaliana] 69 7e-10 dbj|BAA97536.1| retroelement pol polyprotein-like [Arabidopsis t... 67 3e-09 emb|CAN80490.1| hypothetical protein VITISV_004703 [Vitis vinifera] 66 6e-09 ref|XP_003303164.1| hypothetical protein PTT_15280 [Pyrenophora ... 65 7e-09 emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana] 65 1e-08 gb|ABD63150.1| Putative retroelement pol polyprotein, related [A... 64 3e-08 ref|XP_003381524.1| putative integrase core domain protein [Tric... 63 4e-08 gb|EFN71322.1| Retrovirus-related Pol polyprotein from transposo... 63 4e-08 emb|CAN72704.1| hypothetical protein VITISV_016225 [Vitis vinifera] 63 4e-08 dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 62 6e-08 emb|CAA20201.1| putative transposable element [Arabidopsis thali... 62 6e-08 ref|XP_006588085.1| PREDICTED: uncharacterized protein LOC102667... 61 2e-07 ref|XP_004515992.1| PREDICTED: uncharacterized protein LOC101498... 61 2e-07 gb|AAF79369.1|AC007887_28 F15O4.39 [Arabidopsis thaliana] 61 2e-07 ref|XP_003634412.1| PREDICTED: uncharacterized protein LOC100853... 61 2e-07 emb|CAN76346.1| hypothetical protein VITISV_024039 [Vitis vinifera] 61 2e-07 gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsi... 61 2e-07 gb|EFA06256.1| hypothetical protein TcasGA2_TC009115 [Tribolium ... 60 2e-07 >gb|AAF19226.1|AC007505_2 Highly similar to Ta1-3 polyprotein [Arabidopsis thaliana] Length = 1356 Score = 73.9 bits (180), Expect = 2e-11 Identities = 43/124 (34%), Positives = 65/124 (52%), Gaps = 1/124 (0%) Frame = -2 Query: 370 ENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKK-NGTVMIANGDEIPVFG 194 E +A +V+ + WILDSGCT+HMTSR+D F S +K N T+++ + + G Sbjct: 289 EKLVFSEALSVNEQMVKDLWILDSGCTSHMTSRRDWFISFQEKGNTTILLGDDHSVESQG 348 Query: 193 VGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKNG 14 G +R+D H + +V + P L+ NLIS L++ GY +G+ R KN Sbjct: 349 QGTIRID----THGGTIKILENVKYVPHLR-RNLISTGTLDKLGYRHEGGEGKVRYFKNN 403 Query: 13 KEYL 2 K L Sbjct: 404 KTAL 407 >pir||S23319 hypothetical protein 2 - Arabidopsis thaliana retrotransposon Ta1-2 (strain Landsberg) (fragment) gi|16384|emb|CAA37924.1| unnamed protein product [Arabidopsis thaliana] Length = 1084 Score = 70.9 bits (172), Expect = 2e-10 Identities = 40/124 (32%), Positives = 64/124 (51%), Gaps = 1/124 (0%) Frame = -2 Query: 370 ENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKKNGTVMIANGD-EIPVFG 194 E +A +V+ + W+LDSGCT+HM++RKD F + K GT ++ D + G Sbjct: 199 EKLVFSEALSVNDLAVRDIWVLDSGCTSHMSARKDWFCNFRKDGGTTILLGDDHSVKSQG 258 Query: 193 VGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKNG 14 G +++D H + +V + P+L+ NLIS L++ GY+ G+ R KN Sbjct: 259 QGSIKID----THGGTITVLENVKYVPELR-RNLISTGTLDKRGYKHEGGDGKVRYFKNQ 313 Query: 13 KEYL 2 K L Sbjct: 314 KTAL 317 >emb|CAA31653.1| polyprotein [Arabidopsis thaliana] Length = 1291 Score = 68.9 bits (167), Expect = 7e-10 Identities = 37/124 (29%), Positives = 66/124 (53%), Gaps = 1/124 (0%) Frame = -2 Query: 370 ENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKKNG-TVMIANGDEIPVFG 194 E +A +V+ + W+LDSGCT+HM++R+D F S + G T+++ + + G Sbjct: 299 EKLVFSEALSVNDLAVRDIWVLDSGCTSHMSARRDWFCSFREDGGPTILLGDDHSVKSQG 358 Query: 193 VGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKNG 14 G ++++ H + + +V + P+L+ NLIS L++ GY+ G+ R KN Sbjct: 359 QGSIKIE----THGGTIIGLENVKYVPELR-RNLISTGTLDKRGYKHEGGDGKVRYFKNQ 413 Query: 13 KEYL 2 K L Sbjct: 414 KTAL 417 >dbj|BAA97536.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1338 Score = 66.6 bits (161), Expect = 3e-09 Identities = 41/116 (35%), Positives = 63/116 (54%), Gaps = 1/116 (0%) Frame = -2 Query: 364 FTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLT-KKNGTVMIANGDEIPVFGVG 188 F +A A ++ WI D+GCT+HM+SRK+ F L ++G V +AN + V G+G Sbjct: 304 FMVSEAAARASKGSSEEWICDTGCTSHMSSRKEWFEDLVFSESGNVSMANDTTLQVKGIG 363 Query: 187 QVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLK 20 VR+ N TV + +V++ P + NLIS LE +G SK G +++K Sbjct: 364 SVRI----LNDDGTTVLLTNVMYIPGM-SKNLISLGTLENKGCWFKSKNGILKVIK 414 >emb|CAN80490.1| hypothetical protein VITISV_004703 [Vitis vinifera] Length = 777 Score = 65.9 bits (159), Expect = 6e-09 Identities = 37/119 (31%), Positives = 61/119 (51%), Gaps = 1/119 (0%) Frame = -2 Query: 373 DENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKKN-GTVMIANGDEIPVF 197 D+ + + + + W+LDSGCT HM R+D F+S + N G +++ N V Sbjct: 158 DDGYDSTEVLTIRLNPNHEEWVLDSGCTYHMCPRRDWFSSYQEVNGGKLLLGNNMSCNVV 217 Query: 196 GVGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLK 20 G+G + +++ H T + +V PDLK NLIS L++ GY +K G+ + K Sbjct: 218 GIGTMAINM----HDGKTRTLKEVRHVPDLK-RNLISLGTLDKSGYNFKAKNGKLTISK 271 >ref|XP_003303164.1| hypothetical protein PTT_15280 [Pyrenophora teres f. teres 0-1] gi|311320987|gb|EFQ88742.1| hypothetical protein PTT_15280 [Pyrenophora teres f. teres 0-1] Length = 309 Score = 65.5 bits (158), Expect = 7e-09 Identities = 35/111 (31%), Positives = 54/111 (48%) Frame = -2 Query: 337 STPQIPSHWILDSGCTTHMTSRKDQFTSLTKKNGTVMIANGDEIPVFGVGQVRLDLLTSN 158 S + + W+ DSG + HM+ D F +L GT+ +A +IP+ G G VRL L+ + Sbjct: 167 SGSSLSNSWLFDSGTSRHMSGNIDDFVTLEPARGTITVAGQQKIPIEGQGTVRLTLVLPD 226 Query: 157 HSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKNGKEY 5 S+ + +VLF LK L S + +GYE+ + L K Y Sbjct: 227 GSTRFSELTNVLFSEQLKSTRLFSWPYVRNKGYEVQATGDHLYLTKLDGRY 277 >emb|CAA37918.1| unnamed protein product [Arabidopsis thaliana] Length = 560 Score = 64.7 bits (156), Expect = 1e-08 Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 1/124 (0%) Frame = -2 Query: 370 ENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKKNGTVMIANGD-EIPVFG 194 E +A +V+ + W LDSGC +HM++RKD F + + GT ++ D + G Sbjct: 287 EKLVFSEALSVNDLAVRDIWELDSGCPSHMSARKDWFCNFREDGGTTILLGDDHSVKSQG 346 Query: 193 VGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKNG 14 G +++D H + +V + P+L+ NLIS L++ GY+ G+ R KN Sbjct: 347 QGSIKID----THGGTITVLENVKYVPELR-RNLISTGTLDKRGYKHEGGDGKVRYFKNQ 401 Query: 13 KEYL 2 K L Sbjct: 402 KTAL 405 >gb|ABD63150.1| Putative retroelement pol polyprotein, related [Asparagus officinalis] Length = 168 Score = 63.5 bits (153), Expect = 3e-08 Identities = 36/101 (35%), Positives = 58/101 (57%), Gaps = 1/101 (0%) Frame = -2 Query: 319 SHWILDSGCTTHMTSRKDQFTSLTKKNGTVMIANGDE-IPVFGVGQVRLDLLTSNHSSAT 143 S+W+LDSG T H+T R+D F+S + +G V+I D+ + +G VR+ + H Sbjct: 7 SYWMLDSGATYHVTPRRDWFSSFERLDGGVIIMENDQSCGICTIGTVRIRM----HDGVV 62 Query: 142 VHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLK 20 + DV F P LK NLIS LE +G++++ + E ++ K Sbjct: 63 RELKDVRFVPQLK-KNLISVGALEAKGFKVTFEDAEAKVTK 102 >ref|XP_003381524.1| putative integrase core domain protein [Trichinella spiralis] gi|316979667|gb|EFV62422.1| putative integrase core domain protein [Trichinella spiralis] Length = 1133 Score = 63.2 bits (152), Expect = 4e-08 Identities = 36/102 (35%), Positives = 56/102 (54%) Frame = -2 Query: 313 WILDSGCTTHMTSRKDQFTSLTKKNGTVMIANGDEIPVFGVGQVRLDLLTSNHSSATVHI 134 W +D G T+HMT ++ F SL ++ TV +A+ I G+G L +T + + +H+ Sbjct: 224 WYVDFGATSHMTCNRNFFESLERRKSTVYLADNTAIQAEGIGHGWLFCVTPDGTIEEIHV 283 Query: 133 NDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKNGKE 8 DVLF P L+ G L+S R+ GY+I Q + L+ KE Sbjct: 284 KDVLFIPSLETG-LLSAQRITNNGYKIMF-QDDTSLISYQKE 323 >gb|EFN71322.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Camponotus floridanus] Length = 491 Score = 63.2 bits (152), Expect = 4e-08 Identities = 36/114 (31%), Positives = 61/114 (53%) Frame = -2 Query: 358 TEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKKNGTVMIANGDEIPVFGVGQVR 179 TE A + P W LDSGCT+H+ + K+ F T + + +AN D V G VR Sbjct: 262 TETALLTTNPAKSKEWCLDSGCTSHLCNNKELFVETTNISSGLKLANNDTTKVEAKGDVR 321 Query: 178 LDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKN 17 + L N + + + L+ P+L+ NL+S +++ +EI+ KQ +R +++N Sbjct: 322 ITALV-NKKEKPLRLMNTLYVPNLR-TNLLSVAKIVDNNHEITFKQ-DRAIVRN 372 >emb|CAN72704.1| hypothetical protein VITISV_016225 [Vitis vinifera] Length = 817 Score = 63.2 bits (152), Expect = 4e-08 Identities = 39/125 (31%), Positives = 61/125 (48%), Gaps = 1/125 (0%) Frame = -2 Query: 373 DENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKKN-GTVMIANGDEIPVF 197 D+ + + K ++ Q W+LDSGCT HM R D F+S + N G V++ N V Sbjct: 28 DDGYDSAKVLTITLNQNHEEWVLDSGCTYHMCPRIDWFSSYQEFNGGNVLLGNNMSCSVV 87 Query: 196 GVGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKN 17 G+G V +++ S+ + +V PDLK NLIS L+ GY ++ + + K Sbjct: 88 GIGTVAINMFDGMKST----LKEVRHVPDLK-INLISLGTLDESGYNFKAENRKLTISKG 142 Query: 16 GKEYL 2 L Sbjct: 143 AMSNL 147 >dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana] Length = 1342 Score = 62.4 bits (150), Expect = 6e-08 Identities = 42/130 (32%), Positives = 61/130 (46%), Gaps = 1/130 (0%) Frame = -2 Query: 403 AAQFTLDYSDDENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTK-KNGTVM 227 A+ T +D +A P WILD+GC+ HMT RKD + +G V Sbjct: 282 ASTVTARVTDAAALVVSRALLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVR 341 Query: 226 IANGDEIPVFGVGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISS 47 + N V G+G VR+ N +T+ + DV + P++ NLIS LE +G S Sbjct: 342 MGNDTYSEVKGIGDVRI----KNEDGSTILLTDVRYIPEM-SKNLISLGTLEDKGCWFES 396 Query: 46 KQGERRLLKN 17 K+G + KN Sbjct: 397 KKGILTIFKN 406 >emb|CAA20201.1| putative transposable element [Arabidopsis thaliana] gi|7268932|emb|CAB79135.1| putative transposable element [Arabidopsis thaliana] Length = 1308 Score = 62.4 bits (150), Expect = 6e-08 Identities = 41/123 (33%), Positives = 61/123 (49%) Frame = -2 Query: 370 ENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKKNGTVMIANGDEIPVFGV 191 E +A ++ + W++DSGCT HMTSR D F+ +N T MI GD+ V Sbjct: 290 EKLVYSEALSMYDQEAKDKWVIDSGCTYHMTSRMDWFSEF-NENETTMILLGDDHTVESK 348 Query: 190 GQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKNGK 11 G + + T H + + +V F P+L+ NLIS L++ GY+ G+ R K K Sbjct: 349 GSGTVKVNT--HGGSIRVLKNVRFVPNLR-RNLISTGTLDKLGYKHEGGDGKVRFYKENK 405 Query: 10 EYL 2 L Sbjct: 406 TAL 408 >ref|XP_006588085.1| PREDICTED: uncharacterized protein LOC102667204 [Glycine max] Length = 529 Score = 60.8 bits (146), Expect = 2e-07 Identities = 36/130 (27%), Positives = 62/130 (47%), Gaps = 1/130 (0%) Frame = -2 Query: 397 QFTLDYSDDENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKK-NGTVMIA 221 Q D D N A S + W LD+GC+ HMT ++ F ++ K + A Sbjct: 295 QLAQDEDSDSNKVLLMATTNSEEDNVNLWYLDTGCSNHMTGHREWFVNIDDKVKSKIKFA 354 Query: 220 NGDEIPVFGVGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQ 41 + + + G+G+V + HS INDVL+ P++K NL+S +L +GY + + Sbjct: 355 DNNSVTTKGIGKVMIQRKDGQHS----FINDVLYVPNMK-NNLLSLGQLLEKGYPMQIED 409 Query: 40 GERRLLKNGK 11 + ++ + K Sbjct: 410 SQMKIFDSNK 419 >ref|XP_004515992.1| PREDICTED: uncharacterized protein LOC101498930 [Cicer arietinum] Length = 324 Score = 60.8 bits (146), Expect = 2e-07 Identities = 39/123 (31%), Positives = 60/123 (48%), Gaps = 1/123 (0%) Frame = -2 Query: 400 AQFTLDYSDDENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKK-NGTVMI 224 AQ + SD + A S P W LD+GC+ HMT K+ F S+ K + Sbjct: 208 AQMAAEDSDSDEMLL-MATTKSDDDCPEQWYLDTGCSNHMTDHKEWFVSIDDKVKREIGF 266 Query: 223 ANGDEIPVFGVGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSK 44 A+ + G+G+V + N S I DVL+ P++K NL+S +L +GY + + Sbjct: 267 ADNSSVKAEGIGKVLIQRRDGNQS----FICDVLYVPNMK-NNLLSLGQLLEKGYSMKME 321 Query: 43 QGE 35 QG+ Sbjct: 322 QGK 324 >gb|AAF79369.1|AC007887_28 F15O4.39 [Arabidopsis thaliana] Length = 893 Score = 60.8 bits (146), Expect = 2e-07 Identities = 32/105 (30%), Positives = 57/105 (54%), Gaps = 1/105 (0%) Frame = -2 Query: 313 WILDSGCTTHMTSRKDQFTSLTK-KNGTVMIANGDEIPVFGVGQVRLDLLTSNHSSATVH 137 WI+D+GC+ HMT RK+ + K+G V + N + V G+G+V+ +N ++ Sbjct: 264 WIMDTGCSFHMTPRKEYLIDFEEAKSGKVRMTNNSFLEVKGIGKVKF----TNQDGTSII 319 Query: 136 INDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKNGKEYL 2 ++ V + P++ NLIS L+ EGYE G R+++ ++ Sbjct: 320 LHGVRYIPEM-SKNLISMGTLDSEGYEFRGNNGTLRVMQGSNVFM 363 >ref|XP_003634412.1| PREDICTED: uncharacterized protein LOC100853086 [Vitis vinifera] Length = 558 Score = 60.8 bits (146), Expect = 2e-07 Identities = 40/113 (35%), Positives = 57/113 (50%), Gaps = 1/113 (0%) Frame = -2 Query: 388 LDYSDDENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKKN-GTVMIANGD 212 +DY +E FA + W++DSGCT HMT+ +D F L + V I NG+ Sbjct: 259 VDYCQEEQLFAATCFANKSTS--ESWLVDSGCTNHMTNNQDLFRELDRTTISKVRIGNGE 316 Query: 211 EIPVFGVGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEI 53 IPV G G V ++ T I DVLF PD+ NL+S +L E +++ Sbjct: 317 YIPVKGKGTVAIESQT-----GLKLIYDVLFVPDI-DQNLLSVGQLVEEEFKV 363 >emb|CAN76346.1| hypothetical protein VITISV_024039 [Vitis vinifera] Length = 571 Score = 60.8 bits (146), Expect = 2e-07 Identities = 40/113 (35%), Positives = 57/113 (50%), Gaps = 1/113 (0%) Frame = -2 Query: 388 LDYSDDENFTTEKAFAVSTPQIPSHWILDSGCTTHMTSRKDQFTSLTKKN-GTVMIANGD 212 +DY +E FA + W++DSGCT HMT+ +D F L + V I NG+ Sbjct: 292 VDYCQEEQLFAATCFANKSTS--ESWLVDSGCTNHMTNNQDLFRELDRTTISKVRIGNGE 349 Query: 211 EIPVFGVGQVRLDLLTSNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEI 53 IPV G G V ++ T I DVLF PD+ NL+S +L E +++ Sbjct: 350 YIPVKGKGTVAIESQT-----GLKLIYDVLFVPDI-DQNLLSVGQLVEEEFKV 396 >gb|AAD19773.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1335 Score = 60.8 bits (146), Expect = 2e-07 Identities = 37/108 (34%), Positives = 57/108 (52%), Gaps = 1/108 (0%) Frame = -2 Query: 340 VSTPQIPSHWILDSGCTTHMTSRKDQFTSLTK-KNGTVMIANGDEIPVFGVGQVRLDLLT 164 V T I + W+LD+GC+ HMT RKD F + +G V + N PV G+G +++ Sbjct: 276 VVTDSIANEWVLDTGCSFHMTPRKDWFKDFKELSSGYVKMGNDTYSPVKGIGSIKI---- 331 Query: 163 SNHSSATVHINDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLK 20 N + V + DV + P++ NLIS LE G S+ G +++K Sbjct: 332 RNSDGSQVILTDVRYMPNMT-RNLISLGTLEDRGCWFKSQDGILKIVK 378 >gb|EFA06256.1| hypothetical protein TcasGA2_TC009115 [Tribolium castaneum] Length = 1267 Score = 60.5 bits (145), Expect = 2e-07 Identities = 33/101 (32%), Positives = 60/101 (59%), Gaps = 1/101 (0%) Frame = -2 Query: 313 WILDSGCTTHMTSRKDQFTSLTKKNG-TVMIANGDEIPVFGVGQVRLDLLTSNHSSATVH 137 WILDSG + HMTSR+D F+++ + + +V + NG E+ V G G + ++ N Sbjct: 269 WILDSGASAHMTSRRDMFSTIQEVDEFSVKLGNGSELKVKGKGTIEIECWLENEWIKN-K 327 Query: 136 INDVLFEPDLKGGNLISESRLEREGYEISSKQGERRLLKNG 14 + DV + P+LK NL SE ++ ++G I+ + + +++ +G Sbjct: 328 MTDVWYIPNLK-RNLFSEGQITKKGMTITKENNKAKIVCDG 367