BLASTX nr result
ID: Cephaelis21_contig00018538
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00018538 (1441 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAD21778.1| putative non-LTR retroelement reverse transcripta... 114 9e-36 gb|AAB82639.1| putative non-LTR retroelement reverse transcripta... 95 6e-34 emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulga... 99 8e-31 pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW1... 99 1e-30 gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa ... 83 3e-30 >gb|AAD21778.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1715 Score = 114 bits (284), Expect(2) = 9e-36 Identities = 91/332 (27%), Positives = 153/332 (46%), Gaps = 1/332 (0%) Frame = -3 Query: 1439 WNCRGLGGPFTISQLKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWKIVEPQXX 1260 WNC+GLG P T+ +L+E R++ D +FL ETKQ+ + + K+ F I+ P+ Sbjct: 368 WNCQGLGQPLTVRRLEEVQRVYFLDMLFLIETKQQDNYTRDLGVKM-GFEDMCIISPRGL 426 Query: 1259 XXXXXXGWSDKIIIKQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQQWLYLQH 1080 W + I QV+S+D V+ V +N + + +Y P R W LQ Sbjct: 427 SGGLVVYWKKHLSI-QVISHDVRL-VDLYVEYKNFNFYLSCIYGHPIPSERHHLWEKLQR 484 Query: 1079 -QKHLWGQY*FPGGDLNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMDIRYKGRPWTW 903 H G + GD N+I++ + K+GGR RS GSL+ F + I M D++ KG P++W Sbjct: 485 VSAHRSGPW-MMCGDFNEILNLNEKKGGRRRSIGSLQNFTNMINCCNMKDLKSKGNPYSW 543 Query: 902 ANNRVGEGFVEERLDWFFGSPDWALLYPNALVYHILKQASDHCLLIMEDKPPNKPPSKRF 723 R E +E LD F + DW +P + SDH +I++ +F Sbjct: 544 VGKRQNE-TIESCLDRVFINSDWQASFPAFETEFLPIAGSDHAPVIIDIAEEVCTKRGQF 602 Query: 722 CFDKHFLDLPGIE*EIEKV*NQPQSGTFMFQVAEKVKSSRVALLKLKSSHRMNSGAAIAG 543 +D+ +++ N+ +S + EK+ R L K K + N+ I Sbjct: 603 RYDRRHFQFEDFVDSVQRGWNRGRSDSH-GGYYEKLHCCRQELAKWKRRTKTNTAEKIET 661 Query: 542 IKSKLEKMQNE*GTRNWRLWNQLQAQLGQEYK 447 +K +++ + + T + +L+ L Q Y+ Sbjct: 662 LKYRVDAAERD-HTLPHQTILRLRQDLNQAYR 692 Score = 64.3 bits (155), Expect(2) = 9e-36 Identities = 33/96 (34%), Positives = 48/96 (50%) Frame = -1 Query: 439 EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGIECSEHKDIESEIYDS 260 E +W KSR +W+ GDRNT FF A RK +N I+ + QGIE I + Sbjct: 695 ELYWHLKSRNRWMLLGDRNTMFFYASTKLRKSRNRIKAITDAQGIENFRDDTIGKVAENY 754 Query: 259 YTQLFTSSKPSCWGDAVDNLQSSITSSMNQRLTTPV 152 + LFT+++ S W + + + +T MN L V Sbjct: 755 FADLFTTTQTSDWEEIISGIAPKVTEQMNHELLQSV 790 >gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1374 Score = 95.1 bits (235), Expect(2) = 6e-34 Identities = 85/330 (25%), Positives = 143/330 (43%) Frame = -3 Query: 1439 WNCRGLGGPFTISQLKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWKIVEPQXX 1260 WNC+G+G T+ L+E L+ + +FL ETK++ ++ +V L F VEP Sbjct: 6 WNCQGVGNTPTVRHLREIRGLYFPEVIFLCETKKRRNYLENVVGHL-GFFDLHTVEPIGK 64 Query: 1259 XXXXXXGWSDKIIIKQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQQWLYLQH 1080 W D + IK VL +D ++ + ++ + +Y P + R + W L Sbjct: 65 SGGLALMWKDSVQIK-VLQSDKRL-IDALLIWQDKEFYLTCIYGEPVQAERGELWERLTR 122 Query: 1079 QKHLWGQY*FPGGDLNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMDIRYKGRPWTWA 900 GD N+++D S K GG R S +F + + ++ + G ++W Sbjct: 123 LGLSRSGPWMLTGDFNELVDPSEKIGGPARKESSCLEFRQMLNSCGLWEVNHSGYQFSWY 182 Query: 899 NNRVGEGFVEERLDWFFGSPDWALLYPNALVYHILKQASDHCLLIMEDKPPNKPPSKRFC 720 NR E V+ RLD + W L+P A ++ K SDH LI N F Sbjct: 183 GNRNDE-LVQCRLDRTVANQAWMELFPQAKATYLQKICSDHSPLINNLVGDNWRKWAGFK 241 Query: 719 FDKHFLDLPGIE*EIEKV*NQPQSGTFMFQVAEKVKSSRVALLKLKSSHRMNSGAAIAGI 540 +DK ++ G + + +Q + T + EK+ S R + K K + +S I + Sbjct: 242 YDKRWVQREGFKDLLCNFWSQQSTKTNALMM-EKIASCRREISKWKRVSKPSSAVRIQEL 300 Query: 539 KSKLEKMQNE*GTRNWRLWNQLQAQLGQEY 450 + KL+ + L +L+ +L QEY Sbjct: 301 QFKLDAATKQIPFDRREL-ARLKKELSQEY 329 Score = 77.0 bits (188), Expect(2) = 6e-34 Identities = 35/98 (35%), Positives = 56/98 (57%) Frame = -1 Query: 439 EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGIECSEHKDIESEIYDS 260 E+FWQ+KSRI W+ GDRNTK+F A R+ QN I++L+ ++G E + +D+ Sbjct: 333 EQFWQEKSRIMWMRNGDRNTKYFHAATKNRRAQNRIQKLIDEEGREWTSDEDLGRVAEAY 392 Query: 259 YTQLFTSSKPSCWGDAVDNLQSSITSSMNQRLTTPVVK 146 + +LF S + ++NL ++ MN L P+ K Sbjct: 393 FKKLFASEDVGYTVEELENLTPLVSDQMNNNLLAPITK 430 >emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1355 Score = 99.4 bits (246), Expect(2) = 8e-31 Identities = 80/317 (25%), Positives = 132/317 (41%), Gaps = 7/317 (2%) Frame = -3 Query: 1439 WNCRGLGGPFTISQLKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWKIVEPQXX 1260 WNCRG+G P T+ QL++ + D +FLSET ++ +L F V + Sbjct: 6 WNCRGVGNPRTVRQLRKWSTFYAPDIMFLSETMINKTESEALKSRL-GFANAFGVSSRGR 64 Query: 1259 XXXXXXGWSDKIIIKQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQQWLYLQH 1080 W +++ V + + + + GI+ +A ++ + W ++ Sbjct: 65 AGGLCVFWREELSFSLVSFSQHHICGDIDDGAKKWRFVGIYGWAKEEE--KHHTWSLMRF 122 Query: 1079 QKHLWGQY*FPGGDLNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMDIRYKGRPWTWA 900 + GGD N+IM K+GG R + QF + + + + D+ Y G TW Sbjct: 123 LCEDLSRPILMGGDFNEIMSYEEKEGGADRVRRGMYQFRETMDDLFLRDLGYNGVWHTWE 182 Query: 899 NNRVGEGFVEERLDWFFGSPDWALLYPNALVYHILKQASDHCLLIMEDKPPNKPPSK--R 726 + ERLD F SP WA +YPN +V H ++ SDH + + +P SK R Sbjct: 183 RGNSLSTCIRERLDRFVCSPSWATMYPNTIVDHSMRYKSDHLAICLRSNRTRRPTSKQRR 242 Query: 725 FCFDKHFLDLPGIE*EIEKV*NQPQSGTFMFQVAEKVKSSRVALLKLK-----SSHRMNS 561 F F+ +L P E I + + R+ LL LK S N Sbjct: 243 FFFETSWLLDPTCEETIRDAWTDSAGDSL---------TGRLDLLALKLKSWSSEKGGNI 293 Query: 560 GAAIAGIKSKLEKMQNE 510 G + ++S L ++Q + Sbjct: 294 GKQLGRVESDLCRLQQQ 310 Score = 62.4 bits (150), Expect(2) = 8e-31 Identities = 37/102 (36%), Positives = 52/102 (50%), Gaps = 2/102 (1%) Frame = -1 Query: 445 K*EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGIECSEHKDIESEIY 266 K E W +SR + +GDRNTK+F A+QRK++N ++ L G C E DIE Sbjct: 333 KQEARWYLRSRAMEVRDGDRNTKYFHHKASQRKKRNFVKGLFDASGTWCEEVDDIECVFT 392 Query: 265 DSYTQLFTSSKPS--CWGDAVDNLQSSITSSMNQRLTTPVVK 146 D +T +FTS+ PS D + + +T N L P K Sbjct: 393 DYFTSIFTSTNPSDVQLNDVLCCVDPVVTEECNTWLLKPFSK 434 >pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW15) - Arabidopsis thaliana retrotransposon Ta11-1 gi|976278|gb|AAA75254.1| reverse transcriptase [Arabidopsis thaliana] Length = 1333 Score = 99.4 bits (246), Expect(2) = 1e-30 Identities = 85/320 (26%), Positives = 140/320 (43%), Gaps = 11/320 (3%) Frame = -3 Query: 1439 WNCRGLGGP--FTISQLKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWK---IV 1275 WNC+GLG TI +L E H + +FL ETK +V L ++G++ V Sbjct: 6 WNCQGLGWSQDLTIPRLMEMRLSHFPEVLFLMETKN----CSNVVVDLQEWLGYERVFTV 61 Query: 1274 EPQXXXXXXXXGWSD--KIIIKQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQ 1101 P W I+IK N FQ++F +H + VY +P + Sbjct: 62 NPIGLSGGLALFWKKGVDIVIKYADKNLIDFQIQFG----SHEFYVSCVYGNPAFSDKHL 117 Query: 1100 QWLYLQ----HQKHLWGQY*FPGGDLNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMD 933 W + ++K W GD N I+ + K+GG R S F D + +M++ Sbjct: 118 VWEKITRIGINRKEPWCML----GDFNPILHNGEKRGGPRRGDSSFLPFTDMLDSCDMLE 173 Query: 932 IRYKGRPWTWANNRVGEGFVEERLDWFFGSPDWALLYPNALVYHILKQASDHCLLIMEDK 753 + G P+TW + E +++ RLD FG+ +W +P + + K+ SDH +++ Sbjct: 174 LPSIGNPFTW-GGKTNEMWIQSRLDRCFGNKNWFRFFPISNQEFLDKRGSDHRPVLVRLT 232 Query: 752 PPNKPPSKRFCFDKHFLDLPGIE*EIEKV*NQPQSGTFMFQVAEKVKSSRVALLKLKSSH 573 + F FDK + P ++ I + N Q + V +K+K R AL + K + Sbjct: 233 KTKEEYRGNFRFDKRLFNQPNVKETIVQAWNGSQRNENLL-VLDKLKHCRSALSRWKKEN 291 Query: 572 RMNSGAAIAGIKSKLEKMQN 513 +NS I ++ LE Q+ Sbjct: 292 NINSSTRITQARAALELEQS 311 Score = 61.6 bits (148), Expect(2) = 1e-30 Identities = 35/100 (35%), Positives = 53/100 (53%), Gaps = 2/100 (2%) Frame = -1 Query: 439 EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGI--ECSEHKDIESEIY 266 E FW QKSR +W+ GD+NT FF A + + I++L G+ + +K +E Y Sbjct: 335 EVFWSQKSRAKWMHSGDKNTSFFHASVKDNRGKQHIDQLCDVNGLFHKDEMNKGAIAEAY 394 Query: 265 DSYTQLFTSSKPSCWGDAVDNLQSSITSSMNQRLTTPVVK 146 ++ LF S+ PS + D ++ Q +T SMN L V K Sbjct: 395 --FSDLFKSTDPSSFVDLFEDYQPRVTESMNNTLIAAVSK 432 >gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa rugosa] Length = 1656 Score = 83.2 bits (204), Expect(2) = 3e-30 Identities = 65/293 (22%), Positives = 118/293 (40%) Frame = -3 Query: 1397 LKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWKIVEPQXXXXXXXXGWSDKIII 1218 L+ + H + +FL ET+Q+ + R L F +V+P W D + + Sbjct: 624 LRRICKKHNPEILFLMETRQQEGIIKEWKRNL-KFTDHHVVDPIATGRGLALFWGDAVQV 682 Query: 1217 KQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQQWLYLQHQKHLWGQY*FPGGD 1038 + S+ ++Y +P ++ W + + + GD Sbjct: 683 SILDSSPNYVDTVVSFLSDAFVCKITWMYGNPHDNEKRAFWRLMYSRFPVQSLPWLVLGD 742 Query: 1037 LNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMDIRYKGRPWTWANNRVGEGFVEERLD 858 N+++D S K GG +K F DF+ + D+ +KG ++W R G F++ERLD Sbjct: 743 FNEVLDPSEKWGGGPPLPWRIKLFRDFLNNGHLRDLHFKGPGFSWFAMRHGRVFIKERLD 802 Query: 857 WFFGSPDWALLYPNALVYHILKQASDHCLLIMEDKPPNKPPSKRFCFDKHFLDLPGIE*E 678 G+ W+ PN + H+ K SDH L+++ P ++ F F++ + Sbjct: 803 RALGNIAWSSSQPNTQILHLPKIGSDHRPLLLDSNPKMLNKTRLFRFEQMWTTHEEYSDV 862 Query: 677 IEKV*NQPQSGTFMFQVAEKVKSSRVALLKLKSSHRMNSGAAIAGIKSKLEKM 519 I++ G+ M + S AL N +A + S +EK+ Sbjct: 863 IQRSWPPAFGGSAMRSWNRNLLSCGKALKMWSKEKFSNPSVQVADLLSDIEKL 915 Score = 76.6 bits (187), Expect(2) = 3e-30 Identities = 34/96 (35%), Positives = 56/96 (58%) Frame = -1 Query: 439 EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGIECSEHKDIESEIYDS 260 E +W Q+SR+ WL GD+N+ FF QR++ N I RL D G D+ + D Sbjct: 942 EMYWHQRSRVNWLKLGDQNSSFFHQTTIQRRQYNKIVRLKDDHGNWLDSEADVALQFLDY 1001 Query: 259 YTQLFTSSKPSCWGDAVDNLQSSITSSMNQRLTTPV 152 +T L+ S+ P W + +D + +++T+ MN+ L++PV Sbjct: 1002 FTALYQSNGPQQWEEVLDFVDTAVTAEMNKILSSPV 1037