BLASTX nr result

ID: Cephaelis21_contig00018538 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00018538
         (1441 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAD21778.1| putative non-LTR retroelement reverse transcripta...   114   9e-36
gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...    95   6e-34
emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulga...    99   8e-31
pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW1...    99   1e-30
gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa ...    83   3e-30

>gb|AAD21778.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1715

 Score =  114 bits (284), Expect(2) = 9e-36
 Identities = 91/332 (27%), Positives = 153/332 (46%), Gaps = 1/332 (0%)
 Frame = -3

Query: 1439 WNCRGLGGPFTISQLKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWKIVEPQXX 1260
            WNC+GLG P T+ +L+E  R++  D +FL ETKQ+  +   +  K+  F    I+ P+  
Sbjct: 368  WNCQGLGQPLTVRRLEEVQRVYFLDMLFLIETKQQDNYTRDLGVKM-GFEDMCIISPRGL 426

Query: 1259 XXXXXXGWSDKIIIKQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQQWLYLQH 1080
                   W   + I QV+S+D    V+  V  +N + +   +Y  P    R   W  LQ 
Sbjct: 427  SGGLVVYWKKHLSI-QVISHDVRL-VDLYVEYKNFNFYLSCIYGHPIPSERHHLWEKLQR 484

Query: 1079 -QKHLWGQY*FPGGDLNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMDIRYKGRPWTW 903
               H  G +    GD N+I++ + K+GGR RS GSL+ F + I    M D++ KG P++W
Sbjct: 485  VSAHRSGPW-MMCGDFNEILNLNEKKGGRRRSIGSLQNFTNMINCCNMKDLKSKGNPYSW 543

Query: 902  ANNRVGEGFVEERLDWFFGSPDWALLYPNALVYHILKQASDHCLLIMEDKPPNKPPSKRF 723
               R  E  +E  LD  F + DW   +P      +    SDH  +I++          +F
Sbjct: 544  VGKRQNE-TIESCLDRVFINSDWQASFPAFETEFLPIAGSDHAPVIIDIAEEVCTKRGQF 602

Query: 722  CFDKHFLDLPGIE*EIEKV*NQPQSGTFMFQVAEKVKSSRVALLKLKSSHRMNSGAAIAG 543
             +D+           +++  N+ +S +      EK+   R  L K K   + N+   I  
Sbjct: 603  RYDRRHFQFEDFVDSVQRGWNRGRSDSH-GGYYEKLHCCRQELAKWKRRTKTNTAEKIET 661

Query: 542  IKSKLEKMQNE*GTRNWRLWNQLQAQLGQEYK 447
            +K +++  + +  T   +   +L+  L Q Y+
Sbjct: 662  LKYRVDAAERD-HTLPHQTILRLRQDLNQAYR 692



 Score = 64.3 bits (155), Expect(2) = 9e-36
 Identities = 33/96 (34%), Positives = 48/96 (50%)
 Frame = -1

Query: 439 EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGIECSEHKDIESEIYDS 260
           E +W  KSR +W+  GDRNT FF A    RK +N I+ +   QGIE      I     + 
Sbjct: 695 ELYWHLKSRNRWMLLGDRNTMFFYASTKLRKSRNRIKAITDAQGIENFRDDTIGKVAENY 754

Query: 259 YTQLFTSSKPSCWGDAVDNLQSSITSSMNQRLTTPV 152
           +  LFT+++ S W + +  +   +T  MN  L   V
Sbjct: 755 FADLFTTTQTSDWEEIISGIAPKVTEQMNHELLQSV 790


>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score = 95.1 bits (235), Expect(2) = 6e-34
 Identities = 85/330 (25%), Positives = 143/330 (43%)
 Frame = -3

Query: 1439 WNCRGLGGPFTISQLKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWKIVEPQXX 1260
            WNC+G+G   T+  L+E   L+  + +FL ETK++  ++ +V   L  F     VEP   
Sbjct: 6    WNCQGVGNTPTVRHLREIRGLYFPEVIFLCETKKRRNYLENVVGHL-GFFDLHTVEPIGK 64

Query: 1259 XXXXXXGWSDKIIIKQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQQWLYLQH 1080
                   W D + IK VL +D    ++  +  ++   +   +Y  P +  R + W  L  
Sbjct: 65   SGGLALMWKDSVQIK-VLQSDKRL-IDALLIWQDKEFYLTCIYGEPVQAERGELWERLTR 122

Query: 1079 QKHLWGQY*FPGGDLNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMDIRYKGRPWTWA 900
                        GD N+++D S K GG  R   S  +F   +    + ++ + G  ++W 
Sbjct: 123  LGLSRSGPWMLTGDFNELVDPSEKIGGPARKESSCLEFRQMLNSCGLWEVNHSGYQFSWY 182

Query: 899  NNRVGEGFVEERLDWFFGSPDWALLYPNALVYHILKQASDHCLLIMEDKPPNKPPSKRFC 720
             NR  E  V+ RLD    +  W  L+P A   ++ K  SDH  LI      N      F 
Sbjct: 183  GNRNDE-LVQCRLDRTVANQAWMELFPQAKATYLQKICSDHSPLINNLVGDNWRKWAGFK 241

Query: 719  FDKHFLDLPGIE*EIEKV*NQPQSGTFMFQVAEKVKSSRVALLKLKSSHRMNSGAAIAGI 540
            +DK ++   G +  +    +Q  + T    + EK+ S R  + K K   + +S   I  +
Sbjct: 242  YDKRWVQREGFKDLLCNFWSQQSTKTNALMM-EKIASCRREISKWKRVSKPSSAVRIQEL 300

Query: 539  KSKLEKMQNE*GTRNWRLWNQLQAQLGQEY 450
            + KL+    +       L  +L+ +L QEY
Sbjct: 301  QFKLDAATKQIPFDRREL-ARLKKELSQEY 329



 Score = 77.0 bits (188), Expect(2) = 6e-34
 Identities = 35/98 (35%), Positives = 56/98 (57%)
 Frame = -1

Query: 439 EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGIECSEHKDIESEIYDS 260
           E+FWQ+KSRI W+  GDRNTK+F A    R+ QN I++L+ ++G E +  +D+       
Sbjct: 333 EQFWQEKSRIMWMRNGDRNTKYFHAATKNRRAQNRIQKLIDEEGREWTSDEDLGRVAEAY 392

Query: 259 YTQLFTSSKPSCWGDAVDNLQSSITSSMNQRLTTPVVK 146
           + +LF S       + ++NL   ++  MN  L  P+ K
Sbjct: 393 FKKLFASEDVGYTVEELENLTPLVSDQMNNNLLAPITK 430


>emb|CCA66044.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1355

 Score = 99.4 bits (246), Expect(2) = 8e-31
 Identities = 80/317 (25%), Positives = 132/317 (41%), Gaps = 7/317 (2%)
 Frame = -3

Query: 1439 WNCRGLGGPFTISQLKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWKIVEPQXX 1260
            WNCRG+G P T+ QL++    +  D +FLSET        ++  +L  F     V  +  
Sbjct: 6    WNCRGVGNPRTVRQLRKWSTFYAPDIMFLSETMINKTESEALKSRL-GFANAFGVSSRGR 64

Query: 1259 XXXXXXGWSDKIIIKQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQQWLYLQH 1080
                   W +++    V  +      + +   +     GI+ +A  ++  +   W  ++ 
Sbjct: 65   AGGLCVFWREELSFSLVSFSQHHICGDIDDGAKKWRFVGIYGWAKEEE--KHHTWSLMRF 122

Query: 1079 QKHLWGQY*FPGGDLNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMDIRYKGRPWTWA 900
                  +    GGD N+IM    K+GG  R    + QF + +  + + D+ Y G   TW 
Sbjct: 123  LCEDLSRPILMGGDFNEIMSYEEKEGGADRVRRGMYQFRETMDDLFLRDLGYNGVWHTWE 182

Query: 899  NNRVGEGFVEERLDWFFGSPDWALLYPNALVYHILKQASDHCLLIMEDKPPNKPPSK--R 726
                    + ERLD F  SP WA +YPN +V H ++  SDH  + +      +P SK  R
Sbjct: 183  RGNSLSTCIRERLDRFVCSPSWATMYPNTIVDHSMRYKSDHLAICLRSNRTRRPTSKQRR 242

Query: 725  FCFDKHFLDLPGIE*EIEKV*NQPQSGTFMFQVAEKVKSSRVALLKLK-----SSHRMNS 561
            F F+  +L  P  E  I          +          + R+ LL LK     S    N 
Sbjct: 243  FFFETSWLLDPTCEETIRDAWTDSAGDSL---------TGRLDLLALKLKSWSSEKGGNI 293

Query: 560  GAAIAGIKSKLEKMQNE 510
            G  +  ++S L ++Q +
Sbjct: 294  GKQLGRVESDLCRLQQQ 310



 Score = 62.4 bits (150), Expect(2) = 8e-31
 Identities = 37/102 (36%), Positives = 52/102 (50%), Gaps = 2/102 (1%)
 Frame = -1

Query: 445 K*EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGIECSEHKDIESEIY 266
           K E  W  +SR   + +GDRNTK+F   A+QRK++N ++ L    G  C E  DIE    
Sbjct: 333 KQEARWYLRSRAMEVRDGDRNTKYFHHKASQRKKRNFVKGLFDASGTWCEEVDDIECVFT 392

Query: 265 DSYTQLFTSSKPS--CWGDAVDNLQSSITSSMNQRLTTPVVK 146
           D +T +FTS+ PS     D +  +   +T   N  L  P  K
Sbjct: 393 DYFTSIFTSTNPSDVQLNDVLCCVDPVVTEECNTWLLKPFSK 434


>pir||S65812 RNA-directed DNA polymerase (EC 2.7.7.49) (clone DW15) - Arabidopsis
            thaliana retrotransposon Ta11-1 gi|976278|gb|AAA75254.1|
            reverse transcriptase [Arabidopsis thaliana]
          Length = 1333

 Score = 99.4 bits (246), Expect(2) = 1e-30
 Identities = 85/320 (26%), Positives = 140/320 (43%), Gaps = 11/320 (3%)
 Frame = -3

Query: 1439 WNCRGLGGP--FTISQLKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWK---IV 1275
            WNC+GLG     TI +L E    H  + +FL ETK       +V   L  ++G++    V
Sbjct: 6    WNCQGLGWSQDLTIPRLMEMRLSHFPEVLFLMETKN----CSNVVVDLQEWLGYERVFTV 61

Query: 1274 EPQXXXXXXXXGWSD--KIIIKQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQ 1101
             P          W     I+IK    N   FQ++F     +H  +   VY +P    +  
Sbjct: 62   NPIGLSGGLALFWKKGVDIVIKYADKNLIDFQIQFG----SHEFYVSCVYGNPAFSDKHL 117

Query: 1100 QWLYLQ----HQKHLWGQY*FPGGDLNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMD 933
             W  +     ++K  W       GD N I+ +  K+GG  R   S   F D +   +M++
Sbjct: 118  VWEKITRIGINRKEPWCML----GDFNPILHNGEKRGGPRRGDSSFLPFTDMLDSCDMLE 173

Query: 932  IRYKGRPWTWANNRVGEGFVEERLDWFFGSPDWALLYPNALVYHILKQASDHCLLIMEDK 753
            +   G P+TW   +  E +++ RLD  FG+ +W   +P +    + K+ SDH  +++   
Sbjct: 174  LPSIGNPFTW-GGKTNEMWIQSRLDRCFGNKNWFRFFPISNQEFLDKRGSDHRPVLVRLT 232

Query: 752  PPNKPPSKRFCFDKHFLDLPGIE*EIEKV*NQPQSGTFMFQVAEKVKSSRVALLKLKSSH 573
               +     F FDK   + P ++  I +  N  Q    +  V +K+K  R AL + K  +
Sbjct: 233  KTKEEYRGNFRFDKRLFNQPNVKETIVQAWNGSQRNENLL-VLDKLKHCRSALSRWKKEN 291

Query: 572  RMNSGAAIAGIKSKLEKMQN 513
             +NS   I   ++ LE  Q+
Sbjct: 292  NINSSTRITQARAALELEQS 311



 Score = 61.6 bits (148), Expect(2) = 1e-30
 Identities = 35/100 (35%), Positives = 53/100 (53%), Gaps = 2/100 (2%)
 Frame = -1

Query: 439 EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGI--ECSEHKDIESEIY 266
           E FW QKSR +W+  GD+NT FF A     + +  I++L    G+  +   +K   +E Y
Sbjct: 335 EVFWSQKSRAKWMHSGDKNTSFFHASVKDNRGKQHIDQLCDVNGLFHKDEMNKGAIAEAY 394

Query: 265 DSYTQLFTSSKPSCWGDAVDNLQSSITSSMNQRLTTPVVK 146
             ++ LF S+ PS + D  ++ Q  +T SMN  L   V K
Sbjct: 395 --FSDLFKSTDPSSFVDLFEDYQPRVTESMNNTLIAAVSK 432


>gb|AFP55574.1| non-ltr retroelement reverse transcriptase [Rosa rugosa]
          Length = 1656

 Score = 83.2 bits (204), Expect(2) = 3e-30
 Identities = 65/293 (22%), Positives = 118/293 (40%)
 Frame = -3

Query: 1397 LKEELRLHLSDFVFLSETKQKPAFVHSVCRKL*HFVGWKIVEPQXXXXXXXXGWSDKIII 1218
            L+   + H  + +FL ET+Q+   +    R L  F    +V+P          W D + +
Sbjct: 624  LRRICKKHNPEILFLMETRQQEGIIKEWKRNL-KFTDHHVVDPIATGRGLALFWGDAVQV 682

Query: 1217 KQVLSNDFCFQVEFEVTGRNHSSWGIFVYASPDKQIRKQQWLYLQHQKHLWGQY*FPGGD 1038
              + S+                    ++Y +P    ++  W  +  +  +        GD
Sbjct: 683  SILDSSPNYVDTVVSFLSDAFVCKITWMYGNPHDNEKRAFWRLMYSRFPVQSLPWLVLGD 742

Query: 1037 LNDIMDSSNKQGGRVRSAGSLKQFNDFILGMEMMDIRYKGRPWTWANNRVGEGFVEERLD 858
             N+++D S K GG       +K F DF+    + D+ +KG  ++W   R G  F++ERLD
Sbjct: 743  FNEVLDPSEKWGGGPPLPWRIKLFRDFLNNGHLRDLHFKGPGFSWFAMRHGRVFIKERLD 802

Query: 857  WFFGSPDWALLYPNALVYHILKQASDHCLLIMEDKPPNKPPSKRFCFDKHFLDLPGIE*E 678
               G+  W+   PN  + H+ K  SDH  L+++  P     ++ F F++ +         
Sbjct: 803  RALGNIAWSSSQPNTQILHLPKIGSDHRPLLLDSNPKMLNKTRLFRFEQMWTTHEEYSDV 862

Query: 677  IEKV*NQPQSGTFMFQVAEKVKSSRVALLKLKSSHRMNSGAAIAGIKSKLEKM 519
            I++       G+ M      + S   AL         N    +A + S +EK+
Sbjct: 863  IQRSWPPAFGGSAMRSWNRNLLSCGKALKMWSKEKFSNPSVQVADLLSDIEKL 915



 Score = 76.6 bits (187), Expect(2) = 3e-30
 Identities = 34/96 (35%), Positives = 56/96 (58%)
 Frame = -1

Query: 439  EKFWQQKSRIQWLAEGDRNTKFFQAYATQRKRQNCIERLVTDQGIECSEHKDIESEIYDS 260
            E +W Q+SR+ WL  GD+N+ FF     QR++ N I RL  D G       D+  +  D 
Sbjct: 942  EMYWHQRSRVNWLKLGDQNSSFFHQTTIQRRQYNKIVRLKDDHGNWLDSEADVALQFLDY 1001

Query: 259  YTQLFTSSKPSCWGDAVDNLQSSITSSMNQRLTTPV 152
            +T L+ S+ P  W + +D + +++T+ MN+ L++PV
Sbjct: 1002 FTALYQSNGPQQWEEVLDFVDTAVTAEMNKILSSPV 1037


Top