BLASTX nr result

ID: Cephaelis21_contig00026217 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00026217
         (1948 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa ...   335   9e-94
gb|AAF97969.1|AC000103_19 F21J9.30 [Arabidopsis thaliana]             337   5e-90
gb|AAG03119.1|AC004133_13 F5A9.24 [Arabidopsis thaliana]              337   5e-90
pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW1...   336   2e-89
gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam...   324   6e-89

>gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa rugosa]
          Length = 1656

 Score =  335 bits (860), Expect(2) = 9e-94
 Identities = 183/527 (34%), Positives = 280/527 (53%)
 Frame = -3

Query: 1607 WILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDEGY 1428
            W++ GDFN++  P EK GG P  P     F DF++   +  + F G   +W        +
Sbjct: 737  WLVLGDFNEVLDPSEKWGGGPPLPWRIKLFRDFLNNGHLRDLHFKGPGFSWFAMRHGRVF 796

Query: 1427 IEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*ISK 1248
            I+ RLD+  G   W  +     I H+ K  SDH  L+LD+ P      + F F+Q   + 
Sbjct: 797  IKERLDRALGNIAWSSSQPNTQILHLPKIGSDHRPLLLDSNPKMLNKTRLFRFEQMWTTH 856

Query: 1247 QGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQLA 1068
            +   +VI+R+W   F GS M      + +C   +  W++++  N ++Q+ +L  ++E+L 
Sbjct: 857  EEYSDVIQRSWPPAFGGSAMRSWNRNLLSCGKALKMWSKEKFSNPSVQVADLLSDIEKLH 916

Query: 1067 DLGGQRNWETWHNLQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQRRKSNR 888
                       + L  Q+ + +  +E +W Q+ RV WLK GD+N+ FFH  T+QRR+ N+
Sbjct: 917  QSNPPDAHHQINILTDQVTKLWTQDEMYWHQRSRVNWLKLGDQNSSFFHQTTIQRRQYNK 976

Query: 887  LERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPSTITESMNSSLIRP 708
            + RL+   G W   E ++  +                WE+ L    + +T  MN  L  P
Sbjct: 977  IVRLKDDHGNWLDSEADVALQFLDYFTALYQSNGPQQWEEVLDFVDTAVTAEMNKILSSP 1036

Query: 707  VEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXXXFNHT 528
            V   E+K+AVF +   KAPG DG S  F+Q+ W  VQ  + ++              N T
Sbjct: 1037 VSLLEVKKAVFDLGATKAPGPDGFSGIFYQNQWEWVQSIIHESALQHQTSSSLLQVMNRT 1096

Query: 527  LISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLEGRKIL 348
             ++ IPK++ PT  S +RPI+LCN  YKI++KI+  RL+  +   IS+NQSAF+  R+I 
Sbjct: 1097 HLALIPKVKAPTHPSHYRPIALCNFSYKILTKIIASRLQPFMSELISDNQSAFVSNRQIQ 1156

Query: 347  DNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQFVS*IS 168
            DNV+IAHE  HHL   R        LKLDM KA+DRVEW FL  ++ +MGF   ++  + 
Sbjct: 1157 DNVIIAHEIYHHLKLTRSCNNGAFGLKLDMNKAYDRVEWNFLEAVLRKMGFVDSWIGLVM 1216

Query: 167  KCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27
             C+ ++S S  ING+      P RG++QGDPLSP+LFL  ++ LS +
Sbjct: 1217 SCVTTSSLSVLINGKPGPSFLPSRGLRQGDPLSPFLFLFVNDVLSRM 1263



 Score = 37.0 bits (84), Expect(2) = 9e-94
 Identities = 25/107 (23%), Positives = 50/107 (46%)
 Frame = -1

Query: 1948 LKESLRLFKPEITFLCETKRKSGFVKTVCKKLGFSSRFSIVDPTGMSGGLLLG*DESVTT 1769
            L+   +   PEI FL ET+++ G +K   + L F+    +VDP     GL L   ++V  
Sbjct: 624  LRRICKKHNPEILFLMETRQQEGIIKEWKRNLKFTDH-HVVDPIATGRGLALFWGDAVQV 682

Query: 1768 YQIITTSFSIEVEFESPSSAGRMWAVFIYASTNEKVRLAQWKELLSK 1628
              + ++   ++      S A      ++Y + ++  + A W+ + S+
Sbjct: 683  SILDSSPNYVDTVVSFLSDAFVCKITWMYGNPHDNEKRAFWRLMYSR 729


>gb|AAF97969.1|AC000103_19 F21J9.30 [Arabidopsis thaliana]
          Length = 1270

 Score =  337 bits (865), Expect = 5e-90
 Identities = 192/532 (36%), Positives = 296/532 (55%), Gaps = 3/532 (0%)
 Frame = -3

Query: 1613 NNWILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDE 1434
            + W + GDFNDI    EK GG   +      FN+ I   D+ ++P  G   TWA    D 
Sbjct: 92   DKWCMFGDFNDILHNGEKNGGPRRSDLDCKAFNEMIKGCDLVEMPAHGNGFTWAGRRGDH 151

Query: 1433 GYIEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*I 1254
             +I+ RLD+ FG   W      +  T +  + SDH  +++    ++   + +F FD+R +
Sbjct: 152  -WIQCRLDRAFGNKEWFCFFPVSNQTFLDFRGSDHRPVLIKLMSSQDSYRGQFRFDKRFL 210

Query: 1253 SKQGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQ 1074
             K+ ++E I R W     G+ +  +A +++ACR  + +W ++ N N+  +I  L+  +E+
Sbjct: 211  FKEDVKEAIIRTWSRGKHGTNI-SVADRLRACRKSLSSWKKQNNLNSLDKINQLEAALEK 269

Query: 1073 LADLGGQRNWETWHN---LQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQR 903
               L     W  +     L+  L +AY+ EE +W+QK R +WL+ G+RN+ +FHA   Q 
Sbjct: 270  EQSLV----WPIFQRVSVLKKDLAKAYREEEAYWKQKSRQKWLRSGNRNSKYFHAAVKQN 325

Query: 902  RKSNRLERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPSTITESMNS 723
            R+  R+E+L+  +G     E    +                G+ D        ++E MN 
Sbjct: 326  RQRKRIEKLKDVNGNMQTSEAAKGEVAAAYFGNLFKSSNPSGFTDWFSGLVPRVSEVMNE 385

Query: 722  SLIRPVEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXX 543
            SL+  V   EIKEAVFS+ P  APG DGMS  FFQ +W  V   V   V+          
Sbjct: 386  SLVGEVSAQEIKEAVFSIKPASAPGPDGMSALFFQHYWSTVGNQVTSEVKKFFADGIMPA 445

Query: 542  XFNHTLISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLE 363
             +N+T +  IPK Q PT++   RPISLC+V+YKIISKI+ +RL+  LP  +S+ QSAF+ 
Sbjct: 446  EWNYTHLCLIPKTQHPTEMVDLRPISLCSVLYKIISKIMAKRLQPWLPEIVSDTQSAFVS 505

Query: 362  GRKILDNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQF 183
             R I DN+++AHE +H L    R   +F+A+K DM+KA+DRVEW +L  +++ +GF L++
Sbjct: 506  ERLITDNILVAHELVHSLKVHPRISSEFMAVKSDMSKAYDRVEWSYLRSLLLSLGFHLKW 565

Query: 182  VS*ISKCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27
            V+ I  C+ S ++S  IN      I  QRG++QGDPLSP+LF++C+E L+HL
Sbjct: 566  VNWIMVCVSSVTYSVLINDCPFGLIILQRGLRQGDPLSPFLFVLCTEGLTHL 617


>gb|AAG03119.1|AC004133_13 F5A9.24 [Arabidopsis thaliana]
          Length = 1254

 Score =  337 bits (865), Expect = 5e-90
 Identities = 192/532 (36%), Positives = 296/532 (55%), Gaps = 3/532 (0%)
 Frame = -3

Query: 1613 NNWILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDE 1434
            + W + GDFNDI    EK GG   +      FN+ I   D+ ++P  G   TWA    D 
Sbjct: 95   DKWCMFGDFNDILHNGEKNGGPRRSDLDCKAFNEMIKGCDLVEMPAHGNGFTWAGRRGDH 154

Query: 1433 GYIEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*I 1254
             +I+ RLD+ FG   W      +  T +  + SDH  +++    ++   + +F FD+R +
Sbjct: 155  -WIQCRLDRAFGNKEWFCFFPVSNQTFLDFRGSDHRPVLIKLMSSQDSYRGQFRFDKRFL 213

Query: 1253 SKQGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQ 1074
             K+ ++E I R W     G+ +  +A +++ACR  + +W ++ N N+  +I  L+  +E+
Sbjct: 214  FKEDVKEAIIRTWSRGKHGTNI-SVADRLRACRKSLSSWKKQNNLNSLDKINQLEAALEK 272

Query: 1073 LADLGGQRNWETWHN---LQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQR 903
               L     W  +     L+  L +AY+ EE +W+QK R +WL+ G+RN+ +FHA   Q 
Sbjct: 273  EQSLV----WPIFQRVSVLKKDLAKAYREEEAYWKQKSRQKWLRSGNRNSKYFHAAVKQN 328

Query: 902  RKSNRLERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPSTITESMNS 723
            R+  R+E+L+  +G     E    +                G+ D        ++E MN 
Sbjct: 329  RQRKRIEKLKDVNGNMQTSEAAKGEVAAAYFGNLFKSSNPSGFTDWFSGLVPRVSEVMNE 388

Query: 722  SLIRPVEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXX 543
            SL+  V   EIKEAVFS+ P  APG DGMS  FFQ +W  V   V   V+          
Sbjct: 389  SLVGEVSAQEIKEAVFSIKPASAPGPDGMSALFFQHYWSTVGNQVTSEVKKFFADGIMPA 448

Query: 542  XFNHTLISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLE 363
             +N+T +  IPK Q PT++   RPISLC+V+YKIISKI+ +RL+  LP  +S+ QSAF+ 
Sbjct: 449  EWNYTHLCLIPKTQHPTEMVDLRPISLCSVLYKIISKIMAKRLQPWLPEIVSDTQSAFVS 508

Query: 362  GRKILDNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQF 183
             R I DN+++AHE +H L    R   +F+A+K DM+KA+DRVEW +L  +++ +GF L++
Sbjct: 509  ERLITDNILVAHELVHSLKVHPRISSEFMAVKSDMSKAYDRVEWSYLRSLLLSLGFHLKW 568

Query: 182  VS*ISKCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27
            V+ I  C+ S ++S  IN      I  QRG++QGDPLSP+LF++C+E L+HL
Sbjct: 569  VNWIMVCVSSVTYSVLINDCPFGLIILQRGLRQGDPLSPFLFVLCTEGLTHL 620


>pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW15) - Arabidopsis
            thaliana retrotransposon Ta11-1 gi|976278|gb|AAA75254.1|
            reverse transcriptase [Arabidopsis thaliana]
          Length = 1333

 Score =  336 bits (861), Expect = 2e-89
 Identities = 190/527 (36%), Positives = 288/527 (54%)
 Frame = -3

Query: 1607 WILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDEGY 1428
            W + GDFN I    EKRGG     +SF  F D +D  DM ++P +G   TW     +E +
Sbjct: 133  WCMLGDFNPILHNGEKRGGPRRGDSSFLPFTDMLDSCDMLELPSIGNPFTWGGK-TNEMW 191

Query: 1427 IEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*ISK 1248
            I+ RLD+ FG   W      +    + K+ SDH  +++    T++  +  F FD+R  ++
Sbjct: 192  IQSRLDRCFGNKNWFRFFPISNQEFLDKRGSDHRPVLVRLTKTKEEYRGNFRFDKRLFNQ 251

Query: 1247 QGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQLA 1068
              ++E I +AW        +  L  K+K CR  +  W ++ N N++ +I   +  +E L 
Sbjct: 252  PNVKETIVQAWNGSQRNENLLVLD-KLKHCRSALSRWKKENNINSSTRITQARAALE-LE 309

Query: 1067 DLGGQRNWETWHNLQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQRRKSNR 888
               G    +   +L+  L +A   EE FW QK R +W+  GD+NT FFHA     R    
Sbjct: 310  QSSGFPRADLVFSLKNDLCKANHDEEVFWSQKSRAKWMHSGDKNTSFFHASVKDNRGKQH 369

Query: 887  LERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPSTITESMNSSLIRP 708
            +++L   +G + KDE                      + D   D+   +TESMN++LI  
Sbjct: 370  IDQLCDVNGLFHKDEMNKGAIAEAYFSDLFKSTDPSSFVDLFEDYQPRVTESMNNTLIAA 429

Query: 707  VEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXXXFNHT 528
            V   EI+EAVF++  + APG+DG +  FFQ +W I+   V K ++           +N T
Sbjct: 430  VSKNEIREAVFAIRSSSAPGVDGFTGFFFQKYWSIICLQVTKEIQNFFLLGYFPKSWNFT 489

Query: 527  LISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLEGRKIL 348
             +  +PK + P K++  RPISLC+V+YKIISKI+  RL+  LP  +S NQSAF+  R I 
Sbjct: 490  HLCLLPKKKKPDKMTDLRPISLCSVLYKIISKIMVRRLQPFLPDLVSPNQSAFVAERLIF 549

Query: 347  DNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQFVS*IS 168
            DN++IAHE +H L   +   K F+A+K +M+KAFDRVEW ++  ++  +GF  ++V  I 
Sbjct: 550  DNILIAHEVVHGLRTHKSVSKGFIAIKSNMSKAFDRVEWNYVRALLDALGFHQKWVGWIM 609

Query: 167  KCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27
              + S S+S  IN +A   I P RG++QGDPLSP+LF++CSE L+HL
Sbjct: 610  FMISSVSYSVLINDKAFGNIVPSRGLRQGDPLSPFLFVLCSEGLTHL 656


>gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam: rvt.hmm, score:
            42.57) [Arabidopsis thaliana]
          Length = 1662

 Score =  324 bits (831), Expect(2) = 6e-89
 Identities = 190/530 (35%), Positives = 282/530 (53%), Gaps = 3/530 (0%)
 Frame = -3

Query: 1607 WILGGDFNDIRCPEEKRGGRPCTPASFWNFNDFIDQMDMEKIPFLGKN*TWANNWEDEGY 1428
            WIL GDFN+I    EK GG      +F  F + +   D++ I  +G   +W         
Sbjct: 512  WILIGDFNEILSNNEKIGGPQRDEWTFRGFRNMVSTCDLKDIRSIGDRFSWVGERHSHT- 570

Query: 1427 IEVRLDKFFGASTWLVTHSTAVITHVRKQASDHSLLVLDTEPTRKRIKQRFCFDQR*ISK 1248
            ++  LD+ F  S        A +  +    SDH  L L  E T  R  + F FD+R +  
Sbjct: 571  VKCCLDRAFINSEGAFLFPFAELEFLEFTGSDHKPLFLSLEKTETRKMRPFRFDKRLLEV 630

Query: 1247 QGLEEVIKRAWEADFVGSPMFRLAAKIKACRLGILAWNRKQNFNAALQIQNLKDEMEQLA 1068
               +  +K  W     G     L  +++ CR  +     K N N+ ++I  L+  +++  
Sbjct: 631  PHFKTYVKAGWNKAINGQRK-HLPDQVRTCRQAMAKLKHKSNLNSRIRINQLQAALDKAM 689

Query: 1067 DLGGQRNWETWHNLQGQLNQAYQAEEKFWRQKLRVQWLKEGDRNTHFFHACTLQRRKSNR 888
                +    T  ++Q +L  AY+ EE++W+QK R QW+KEGDRNT FFHACT  R   NR
Sbjct: 690  SSVNRTERRTISHIQRELTVAYRDEERYWQQKSRNQWMKEGDRNTEFFHACTKTRFSVNR 749

Query: 887  LERLEKADGTWTKDEDELLDEIXXXXXXXXXXXXSWGWEDALIDFPS---TITESMNSSL 717
            L  ++  +G   + + E+                  G   ++IDF      +TE +N  L
Sbjct: 750  LVTIKDEEGMIYRGDKEIGVHAQEFFTKVYESN---GRPVSIIDFAGFKPIVTEQINDDL 806

Query: 716  IRPVEDGEIKEAVFSMNPNKAPGMDGMSPCFFQSFWHIVQFDVCKAVRXXXXXXXXXXXF 537
             + + D EI  A+  +  +KAPG DG++  F++S W IV  DV K V+            
Sbjct: 807  TKDLSDLEIYNAICHIGDDKAPGPDGLTARFYKSCWEIVGPDVIKEVKIFFRTSYMKQSI 866

Query: 536  NHTLISFIPKIQLPTKISQFRPISLCNVIYKIISKILTERLKLCLPFCISENQSAFLEGR 357
            NHT I  IPKI  P  +S +RPI+LCNV+YKIISK L ERLK  L   +S++Q+AF+ GR
Sbjct: 867  NHTNICMIPKITNPETLSDYRPIALCNVLYKIISKCLVERLKGHLDAIVSDSQAAFIPGR 926

Query: 356  KILDNVVIAHEYIHHLNKMRRGRKKFVALKLDMAKAFDRVEWRFLYFIMIRMGFDLQFVS 177
             + DNV+IAHE +H L   +R  + ++A+K D++KA+DRVEW FL   M   GF   ++ 
Sbjct: 927  LVNDNVMIAHEMMHSLKTRKRVSQSYMAVKTDVSKAYDRVEWNFLETTMRLFGFSETWIK 986

Query: 176  *ISKCLQSASFSFNINGEAK*YIRPQRGIKQGDPLSPYLFLICSEALSHL 27
             I   ++S ++S  +NG     I+PQRGI+QGDPLSPYLF++C++ L+HL
Sbjct: 987  WIMGAVKSVNYSVLVNGIPHGTIQPQRGIRQGDPLSPYLFILCADILNHL 1036



 Score = 32.0 bits (71), Expect(2) = 6e-89
 Identities = 29/107 (27%), Positives = 46/107 (42%)
 Frame = -1

Query: 1948 LKESLRLFKPEITFLCETKRKSGFVKTVCKKLGFSSRFSIVDPTGMSGGLLLG*DESVTT 1769
            L    ++FK ++ FL ET  K   +  +   LGF +  +   P G SGGL L   +SV  
Sbjct: 401  LSNLCKVFKFDVLFLIETLNKCEVISNLASVLGFPNVIT-QPPQGHSGGLALLWKDSVRL 459

Query: 1768 YQIITTSFSIEVEFESPSSAGRMWAVFIYASTNEKVRLAQWKELLSK 1628
              +      I+V     +    +  V+ +   +E+  L    E LSK
Sbjct: 460  SNLYQDDRHIDVHISINNINFYLSRVYGHPCQSERHSLWTHFENLSK 506


Top