BLASTX nr result
ID: Cephaelis21_contig00026217
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00026217 (1948 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa ... 335 9e-94 gb|AAF97969.1|AC000103_19 F21J9.30 [Arabidopsis thaliana] 337 5e-90 gb|AAG03119.1|AC004133_13 F5A9.24 [Arabidopsis thaliana] 337 5e-90 pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW1... 336 2e-89 gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam... 324 6e-89 >gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa rugosa] Length = 1656 Score = 335 bits (860), Expect(2) = 9e-94 Identities = 183/527 (34%), Positives = 280/527 (53%) Frame = -3 Query: 1607 WILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDEGY 1428 W++ GDFN++ P EK GG P P F DF++ + + F G +W + Sbjct: 737 WLVLGDFNEVLDPSEKWGGGPPLPWRIKLFRDFLNNGHLRDLHFKGPGFSWFAMRHGRVF 796 Query: 1427 IEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*ISK 1248 I+ RLD+ G W + I H+ K SDH L+LD+ P + F F+Q + Sbjct: 797 IKERLDRALGNIAWSSSQPNTQILHLPKIGSDHRPLLLDSNPKMLNKTRLFRFEQMWTTH 856 Query: 1247 QGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQLA 1068 + +VI+R+W F GS M + +C + W++++ N ++Q+ +L ++E+L Sbjct: 857 EEYSDVIQRSWPPAFGGSAMRSWNRNLLSCGKALKMWSKEKFSNPSVQVADLLSDIEKLH 916 Query: 1067 DLGGQRNWETWHNLQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQRRKSNR 888 + L Q+ + + +E +W Q+ RV WLK GD+N+ FFH T+QRR+ N+ Sbjct: 917 QSNPPDAHHQINILTDQVTKLWTQDEMYWHQRSRVNWLKLGDQNSSFFHQTTIQRRQYNK 976 Query: 887 LERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPSTITESMNSSLIRP 708 + RL+ G W E ++ + WE+ L + +T MN L P Sbjct: 977 IVRLKDDHGNWLDSEADVALQFLDYFTALYQSNGPQQWEEVLDFVDTAVTAEMNKILSSP 1036 Query: 707 VEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXXXFNHT 528 V E+K+AVF + KAPG DG S F+Q+ W VQ + ++ N T Sbjct: 1037 VSLLEVKKAVFDLGATKAPGPDGFSGIFYQNQWEWVQSIIHESALQHQTSSSLLQVMNRT 1096 Query: 527 LISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLEGRKIL 348 ++ IPK++ PT S +RPI+LCN YKI++KI+ RL+ + IS+NQSAF+ R+I Sbjct: 1097 HLALIPKVKAPTHPSHYRPIALCNFSYKILTKIIASRLQPFMSELISDNQSAFVSNRQIQ 1156 Query: 347 DNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQFVS*IS 168 DNV+IAHE HHL R LKLDM KA+DRVEW FL ++ +MGF ++ + Sbjct: 1157 DNVIIAHEIYHHLKLTRSCNNGAFGLKLDMNKAYDRVEWNFLEAVLRKMGFVDSWIGLVM 1216 Query: 167 KCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27 C+ ++S S ING+ P RG++QGDPLSP+LFL ++ LS + Sbjct: 1217 SCVTTSSLSVLINGKPGPSFLPSRGLRQGDPLSPFLFLFVNDVLSRM 1263 Score = 37.0 bits (84), Expect(2) = 9e-94 Identities = 25/107 (23%), Positives = 50/107 (46%) Frame = -1 Query: 1948 LKESLRLFKPEITFLCETKRKSGFVKTVCKKLGFSSRFSIVDPTGMSGGLLLG*DESVTT 1769 L+ + PEI FL ET+++ G +K + L F+ +VDP GL L ++V Sbjct: 624 LRRICKKHNPEILFLMETRQQEGIIKEWKRNLKFTDH-HVVDPIATGRGLALFWGDAVQV 682 Query: 1768 YQIITTSFSIEVEFESPSSAGRMWAVFIYASTNEKVRLAQWKELLSK 1628 + ++ ++ S A ++Y + ++ + A W+ + S+ Sbjct: 683 SILDSSPNYVDTVVSFLSDAFVCKITWMYGNPHDNEKRAFWRLMYSR 729 >gb|AAF97969.1|AC000103_19 F21J9.30 [Arabidopsis thaliana] Length = 1270 Score = 337 bits (865), Expect = 5e-90 Identities = 192/532 (36%), Positives = 296/532 (55%), Gaps = 3/532 (0%) Frame = -3 Query: 1613 NNWILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDE 1434 + W + GDFNDI EK GG + FN+ I D+ ++P G TWA D Sbjct: 92 DKWCMFGDFNDILHNGEKNGGPRRSDLDCKAFNEMIKGCDLVEMPAHGNGFTWAGRRGDH 151 Query: 1433 GYIEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*I 1254 +I+ RLD+ FG W + T + + SDH +++ ++ + +F FD+R + Sbjct: 152 -WIQCRLDRAFGNKEWFCFFPVSNQTFLDFRGSDHRPVLIKLMSSQDSYRGQFRFDKRFL 210 Query: 1253 SKQGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQ 1074 K+ ++E I R W G+ + +A +++ACR + +W ++ N N+ +I L+ +E+ Sbjct: 211 FKEDVKEAIIRTWSRGKHGTNI-SVADRLRACRKSLSSWKKQNNLNSLDKINQLEAALEK 269 Query: 1073 LADLGGQRNWETWHN---LQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQR 903 L W + L+ L +AY+ EE +W+QK R +WL+ G+RN+ +FHA Q Sbjct: 270 EQSLV----WPIFQRVSVLKKDLAKAYREEEAYWKQKSRQKWLRSGNRNSKYFHAAVKQN 325 Query: 902 RKSNRLERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPSTITESMNS 723 R+ R+E+L+ +G E + G+ D ++E MN Sbjct: 326 RQRKRIEKLKDVNGNMQTSEAAKGEVAAAYFGNLFKSSNPSGFTDWFSGLVPRVSEVMNE 385 Query: 722 SLIRPVEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXX 543 SL+ V EIKEAVFS+ P APG DGMS FFQ +W V V V+ Sbjct: 386 SLVGEVSAQEIKEAVFSIKPASAPGPDGMSALFFQHYWSTVGNQVTSEVKKFFADGIMPA 445 Query: 542 XFNHTLISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLE 363 +N+T + IPK Q PT++ RPISLC+V+YKIISKI+ +RL+ LP +S+ QSAF+ Sbjct: 446 EWNYTHLCLIPKTQHPTEMVDLRPISLCSVLYKIISKIMAKRLQPWLPEIVSDTQSAFVS 505 Query: 362 GRKILDNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQF 183 R I DN+++AHE +H L R +F+A+K DM+KA+DRVEW +L +++ +GF L++ Sbjct: 506 ERLITDNILVAHELVHSLKVHPRISSEFMAVKSDMSKAYDRVEWSYLRSLLLSLGFHLKW 565 Query: 182 VS*ISKCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27 V+ I C+ S ++S IN I QRG++QGDPLSP+LF++C+E L+HL Sbjct: 566 VNWIMVCVSSVTYSVLINDCPFGLIILQRGLRQGDPLSPFLFVLCTEGLTHL 617 >gb|AAG03119.1|AC004133_13 F5A9.24 [Arabidopsis thaliana] Length = 1254 Score = 337 bits (865), Expect = 5e-90 Identities = 192/532 (36%), Positives = 296/532 (55%), Gaps = 3/532 (0%) Frame = -3 Query: 1613 NNWILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDE 1434 + W + GDFNDI EK GG + FN+ I D+ ++P G TWA D Sbjct: 95 DKWCMFGDFNDILHNGEKNGGPRRSDLDCKAFNEMIKGCDLVEMPAHGNGFTWAGRRGDH 154 Query: 1433 GYIEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*I 1254 +I+ RLD+ FG W + T + + SDH +++ ++ + +F FD+R + Sbjct: 155 -WIQCRLDRAFGNKEWFCFFPVSNQTFLDFRGSDHRPVLIKLMSSQDSYRGQFRFDKRFL 213 Query: 1253 SKQGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQ 1074 K+ ++E I R W G+ + +A +++ACR + +W ++ N N+ +I L+ +E+ Sbjct: 214 FKEDVKEAIIRTWSRGKHGTNI-SVADRLRACRKSLSSWKKQNNLNSLDKINQLEAALEK 272 Query: 1073 LADLGGQRNWETWHN---LQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQR 903 L W + L+ L +AY+ EE +W+QK R +WL+ G+RN+ +FHA Q Sbjct: 273 EQSLV----WPIFQRVSVLKKDLAKAYREEEAYWKQKSRQKWLRSGNRNSKYFHAAVKQN 328 Query: 902 RKSNRLERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPSTITESMNS 723 R+ R+E+L+ +G E + G+ D ++E MN Sbjct: 329 RQRKRIEKLKDVNGNMQTSEAAKGEVAAAYFGNLFKSSNPSGFTDWFSGLVPRVSEVMNE 388 Query: 722 SLIRPVEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXX 543 SL+ V EIKEAVFS+ P APG DGMS FFQ +W V V V+ Sbjct: 389 SLVGEVSAQEIKEAVFSIKPASAPGPDGMSALFFQHYWSTVGNQVTSEVKKFFADGIMPA 448 Query: 542 XFNHTLISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLE 363 +N+T + IPK Q PT++ RPISLC+V+YKIISKI+ +RL+ LP +S+ QSAF+ Sbjct: 449 EWNYTHLCLIPKTQHPTEMVDLRPISLCSVLYKIISKIMAKRLQPWLPEIVSDTQSAFVS 508 Query: 362 GRKILDNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQF 183 R I DN+++AHE +H L R +F+A+K DM+KA+DRVEW +L +++ +GF L++ Sbjct: 509 ERLITDNILVAHELVHSLKVHPRISSEFMAVKSDMSKAYDRVEWSYLRSLLLSLGFHLKW 568 Query: 182 VS*ISKCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27 V+ I C+ S ++S IN I QRG++QGDPLSP+LF++C+E L+HL Sbjct: 569 VNWIMVCVSSVTYSVLINDCPFGLIILQRGLRQGDPLSPFLFVLCTEGLTHL 620 >pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW15) - Arabidopsis thaliana retrotransposon Ta11-1 gi|976278|gb|AAA75254.1| reverse transcriptase [Arabidopsis thaliana] Length = 1333 Score = 336 bits (861), Expect = 2e-89 Identities = 190/527 (36%), Positives = 288/527 (54%) Frame = -3 Query: 1607 WILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDEGY 1428 W + GDFN I EKRGG +SF F D +D DM ++P +G TW +E + Sbjct: 133 WCMLGDFNPILHNGEKRGGPRRGDSSFLPFTDMLDSCDMLELPSIGNPFTWGGK-TNEMW 191 Query: 1427 IEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*ISK 1248 I+ RLD+ FG W + + K+ SDH +++ T++ + F FD+R ++ Sbjct: 192 IQSRLDRCFGNKNWFRFFPISNQEFLDKRGSDHRPVLVRLTKTKEEYRGNFRFDKRLFNQ 251 Query: 1247 QGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQLA 1068 ++E I +AW + L K+K CR + W ++ N N++ +I + +E L Sbjct: 252 PNVKETIVQAWNGSQRNENLLVLD-KLKHCRSALSRWKKENNINSSTRITQARAALE-LE 309 Query: 1067 DLGGQRNWETWHNLQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQRRKSNR 888 G + +L+ L +A EE FW QK R +W+ GD+NT FFHA R Sbjct: 310 QSSGFPRADLVFSLKNDLCKANHDEEVFWSQKSRAKWMHSGDKNTSFFHASVKDNRGKQH 369 Query: 887 LERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPSTITESMNSSLIRP 708 +++L +G + KDE + D D+ +TESMN++LI Sbjct: 370 IDQLCDVNGLFHKDEMNKGAIAEAYFSDLFKSTDPSSFVDLFEDYQPRVTESMNNTLIAA 429 Query: 707 VEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXXXFNHT 528 V EI+EAVF++ + APG+DG + FFQ +W I+ V K ++ +N T Sbjct: 430 VSKNEIREAVFAIRSSSAPGVDGFTGFFFQKYWSIICLQVTKEIQNFFLLGYFPKSWNFT 489 Query: 527 LISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLEGRKIL 348 + +PK + P K++ RPISLC+V+YKIISKI+ RL+ LP +S NQSAF+ R I Sbjct: 490 HLCLLPKKKKPDKMTDLRPISLCSVLYKIISKIMVRRLQPFLPDLVSPNQSAFVAERLIF 549 Query: 347 DNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQFVS*IS 168 DN++IAHE +H L + K F+A+K +M+KAFDRVEW ++ ++ +GF ++V I Sbjct: 550 DNILIAHEVVHGLRTHKSVSKGFIAIKSNMSKAFDRVEWNYVRALLDALGFHQKWVGWIM 609 Query: 167 KCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27 + S S+S IN +A I P RG++QGDPLSP+LF++CSE L+HL Sbjct: 610 FMISSVSYSVLINDKAFGNIVPSRGLRQGDPLSPFLFVLCSEGLTHL 656 >gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam: rvt.hmm, score: 42.57) [Arabidopsis thaliana] Length = 1662 Score = 324 bits (831), Expect(2) = 6e-89 Identities = 190/530 (35%), Positives = 282/530 (53%), Gaps = 3/530 (0%) Frame = -3 Query: 1607 WILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDEGY 1428 WIL GDFN+I EK GG +F F + + D++ I +G +W Sbjct: 512 WILIGDFNEILSNNEKIGGPQRDEWTFRGFRNMVSTCDLKDIRSIGDRFSWVGERHSHT- 570 Query: 1427 IEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*ISK 1248 ++ LD+ F S A + + SDH L L E T R + F FD+R + Sbjct: 571 VKCCLDRAFINSEGAFLFPFAELEFLEFTGSDHKPLFLSLEKTETRKMRPFRFDKRLLEV 630 Query: 1247 QGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQLA 1068 + +K W G L +++ CR + K N N+ ++I L+ +++ Sbjct: 631 PHFKTYVKAGWNKAINGQRK-HLPDQVRTCRQAMAKLKHKSNLNSRIRINQLQAALDKAM 689 Query: 1067 DLGGQRNWETWHNLQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQRRKSNR 888 + T ++Q +L AY+ EE++W+QK R QW+KEGDRNT FFHACT R NR Sbjct: 690 SSVNRTERRTISHIQRELTVAYRDEERYWQQKSRNQWMKEGDRNTEFFHACTKTRFSVNR 749 Query: 887 LERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPS---TITESMNSSL 717 L ++ +G + + E+ G ++IDF +TE +N L Sbjct: 750 LVTIKDEEGMIYRGDKEIGVHAQEFFTKVYESN---GRPVSIIDFAGFKPIVTEQINDDL 806 Query: 716 IRPVEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXXXF 537 + + D EI A+ + +KAPG DG++ F++S W IV DV K V+ Sbjct: 807 TKDLSDLEIYNAICHIGDDKAPGPDGLTARFYKSCWEIVGPDVIKEVKIFFRTSYMKQSI 866 Query: 536 NHTLISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLEGR 357 NHT I IPKI P +S +RPI+LCNV+YKIISK L ERLK L +S++Q+AF+ GR Sbjct: 867 NHTNICMIPKITNPETLSDYRPIALCNVLYKIISKCLVERLKGHLDAIVSDSQAAFIPGR 926 Query: 356 KILDNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQFVS 177 + DNV+IAHE +H L +R + ++A+K D++KA+DRVEW FL M GF ++ Sbjct: 927 LVNDNVMIAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDRVEWNFLETTMRLFGFSETWIK 986 Query: 176 *ISKCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27 I ++S ++S +NG I+PQRGI+QGDPLSPYLF++C++ L+HL Sbjct: 987 WIMGAVKSVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLFILCADILNHL 1036 Score = 32.0 bits (71), Expect(2) = 6e-89 Identities = 29/107 (27%), Positives = 46/107 (42%) Frame = -1 Query: 1948 LKESLRLFKPEITFLCETKRKSGFVKTVCKKLGFSSRFSIVDPTGMSGGLLLG*DESVTT 1769 L ++FK ++ FL ET K + + LGF + + P G SGGL L +SV Sbjct: 401 LSNLCKVFKFDVLFLIETLNKCEVISNLASVLGFPNVIT-QPPQGHSGGLALLWKDSVRL 459 Query: 1768 YQIITTSFSIEVEFESPSSAGRMWAVFIYASTNEKVRLAQWKELLSK 1628 + I+V + + V+ + +E+ L E LSK Sbjct: 460 SNLYQDDRHIDVHISINNINFYLSRVYGHPCQSERHSLWTHFENLSK 506