BLASTX nr result

ID: Cephaelis21_contig00020932 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00020932
         (1046 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAB82639.1| putative non-LTR retroelement reverse transcripta...   162   1e-37
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta...   154   5e-35
emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|72677...   153   8e-35
gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam...   152   1e-34
pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi...   152   2e-34

>gb|AAB82639.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1374

 Score =  162 bits (411), Expect = 1e-37
 Identities = 109/351 (31%), Positives = 179/351 (50%), Gaps = 3/351 (0%)
 Frame = +1

Query: 1    CRGLGGPSTISQLREELRFHLPNIVFLCETKKK-SFVHSVCTKLKMLSMWRVVEPRGLSG 177
            C+G+G   T+  LRE    + P ++FLCETKK+ +++ +V   L    +   VEP G SG
Sbjct: 8    CQGVGNTPTVRHLREIRGLYFPEVIFLCETKKRRNYLENVVGHLGFFDL-HTVEPIGKSG 66

Query: 178  GLMIGWSDRIVVKQVVLNEFCIQ--IEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQ 351
            GL + W D + +K +  ++  I   + ++D E    C    +Y    +A R   W  L +
Sbjct: 67   GLALMWKDSVQIKVLQSDKRLIDALLIWQDKEFYLTC----IYGEPVQAERGELWERLTR 122

Query: 352  QRHFWGKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWA 531
                    W L GD N++ D SEK GG AR E S   FR  +    + E+  +G  ++W 
Sbjct: 123  LGLSRSGPWMLTGDFNELVDPSEKIGGPARKESSCLEFRQMLNSCGLWEVNHSGYQFSWY 182

Query: 532  NNRDGEGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFL 711
             NR+ E  V+ RLDR  A+  W+   P+A   ++ K  SDH+ LI +    +  +   F 
Sbjct: 183  GNRNDE-LVQCRLDRTVANQAWMELFPQAKATYLQKICSDHSPLINNLVGDNWRKWAGFK 241

Query: 712  FDKRLLNLPQCEETVAVAWNKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQAI 891
            +DKR +     ++ +   W++    T    ++ +I +CR  + K K  ++ +S   IQ +
Sbjct: 242  YDKRWVQREGFKDLLCNFWSQQSTKTNAL-MMEKIASCRREISKWKRVSKPSSAVRIQEL 300

Query: 892  KSKMMAMQEKGGQRDWQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044
            + K+ A   K    D +E A LK +L +EY  EE +W +KSR+ W++ GD+
Sbjct: 301  QFKLDA-ATKQIPFDRRELARLKKELSQEYNNEEQFWQEKSRIMWMRNGDR 350


>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1750

 Score =  154 bits (388), Expect = 5e-35
 Identities = 106/352 (30%), Positives = 167/352 (47%), Gaps = 6/352 (1%)
 Frame = +1

Query: 7    GLGGPSTISQLREELRFHLPNIVFLCETKKKSFVHSVCTKLKMLSMWRVVE--PRGLSGG 180
            G+G P T S+L    R +  +I+FL ET  +     VC     L    V+   P G SGG
Sbjct: 391  GIGMPLTQSRLFRLFRMYNYDILFLVETLNQC--DKVCKLAYDLGFPNVITQPPNGRSGG 448

Query: 181  LMIGWSDRIVVKQVVLNEFCIQ--IEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQQ 354
            L + W + + +  +  +E  I   + F +      C    VY    ++ R   W  L+  
Sbjct: 449  LALMWKNNVSLSLISQDERLIDSHVTFNNKSFYLSC----VYGHPTQSERHQLWQTLEHI 504

Query: 355  RHFWGKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWAN 534
                   W L GD N+I   +EK GG  R E +FR FR+ +   ++ +++  G  ++W  
Sbjct: 505  SDNRNAEWLLVGDFNEILSNAEKIGGPMREEWTFRNFRNMVSHCDIEDMRSKGDRFSWVG 564

Query: 535  NRDGEGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFLF 714
             R     V+  LDR F +  W    P A +  +    SDH  +++        R++ F F
Sbjct: 565  ERHTHT-VKCCLDRVFINSAWTATFPYAEIEFLDFTGSDHKPVLVHFNESFPRRSKLFRF 623

Query: 715  DKRLLNLPQCEETVAVAW--NKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQA 888
            D RL+++P  +  V  +W  N+    TP   +  RI +CR A+ +LK  + LNS + I+ 
Sbjct: 624  DNRLIDIPTFKRIVQTSWRTNRNSRSTP---ITERISSCRQAMARLKHASNLNSEQRIKK 680

Query: 889  IKSKMMAMQEKGGQRDWQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044
            ++S +    E   + D Q    L+  L + +  EE+YW QKSR QW+KEGDQ
Sbjct: 681  LQSSLNRAMESTRRVDRQLIPQLQESLAKAFSDEEIYWKQKSRNQWMKEGDQ 732


>emb|CAB40051.1| putative protein [Arabidopsis thaliana] gi|7267781|emb|CAB81184.1|
            putative protein [Arabidopsis thaliana]
          Length = 1294

 Score =  153 bits (386), Expect = 8e-35
 Identities = 105/347 (30%), Positives = 164/347 (47%)
 Frame = +1

Query: 4    RGLGGPSTISQLREELRFHLPNIVFLCETKKKSFVHSVCTKLKMLSMWRVVEPRGLSGGL 183
            +G+G P T SQL    +    +++FL ET  K  V S    +          P+G SGGL
Sbjct: 370  KGIGVPLTQSQLSNLCKVFKFDVLFLIETLNKCEVISNLASVLGFPNVITQPPQGHSGGL 429

Query: 184  MIGWSDRIVVKQVVLNEFCIQIEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQQRHF 363
             + W D + +  +  ++  I +    + I    +   VY    ++ R S W   +     
Sbjct: 430  ALLWKDSVRLSNLYQDDRHIDVHISINNIN--FYLSRVYGHPCQSERHSLWTHFENLSKT 487

Query: 364  WGKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWANNRD 543
                W L GD N+I   +EK GG  R E +FRGFR+ +   ++ +I+  G  ++W   R 
Sbjct: 488  RNDPWILIGDFNEILSNNEKIGGPQRDEWTFRGFRNMVSTCDLKDIRSIGDRFSWVGERH 547

Query: 544  GEGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFLFDKR 723
                V+  LDR F + E     P A +  +    SDH  L L  +     + R F FDKR
Sbjct: 548  SHT-VKCCLDRAFINSEGAFLFPFAELEFLEFTGSDHKPLFLSLEKTETRKMRPFRFDKR 606

Query: 724  LLNLPQCEETVAVAWNKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQAIKSKM 903
            LL +P  +  V   WNKA I      +  +++ CR A+ KLK  + LNS   I  +++ +
Sbjct: 607  LLEVPHFKTYVKAGWNKA-INGQRKHLPDQVRTCRQAMAKLKHKSNLNSRIRINQLQAAL 665

Query: 904  MAMQEKGGQRDWQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044
                    + + +  ++++ +L   YR EE YW QKSR QW+KEGD+
Sbjct: 666  DKAMSSVNRTERRTISHIQRELTVAYRDEERYWQQKSRNQWMKEGDR 712


>gb|AAC33961.1| contains similarity to reverse trancriptase (Pfam: rvt.hmm, score:
            42.57) [Arabidopsis thaliana]
          Length = 1662

 Score =  152 bits (384), Expect = 1e-34
 Identities = 105/346 (30%), Positives = 163/346 (47%)
 Frame = +1

Query: 7    GLGGPSTISQLREELRFHLPNIVFLCETKKKSFVHSVCTKLKMLSMWRVVEPRGLSGGLM 186
            G+G P T SQL    +    +++FL ET  K  V S    +          P+G SGGL 
Sbjct: 391  GIGVPLTQSQLSNLCKVFKFDVLFLIETLNKCEVISNLASVLGFPNVITQPPQGHSGGLA 450

Query: 187  IGWSDRIVVKQVVLNEFCIQIEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQQRHFW 366
            + W D + +  +  ++  I +    + I    +   VY    ++ R S W   +      
Sbjct: 451  LLWKDSVRLSNLYQDDRHIDVHISINNIN--FYLSRVYGHPCQSERHSLWTHFENLSKTR 508

Query: 367  GKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWANNRDG 546
               W L GD N+I   +EK GG  R E +FRGFR+ +   ++ +I+  G  ++W   R  
Sbjct: 509  NDPWILIGDFNEILSNNEKIGGPQRDEWTFRGFRNMVSTCDLKDIRSIGDRFSWVGERHS 568

Query: 547  EGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFLFDKRL 726
               V+  LDR F + E     P A +  +    SDH  L L  +     + R F FDKRL
Sbjct: 569  HT-VKCCLDRAFINSEGAFLFPFAELEFLEFTGSDHKPLFLSLEKTETRKMRPFRFDKRL 627

Query: 727  LNLPQCEETVAVAWNKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQAIKSKMM 906
            L +P  +  V   WNKA I      +  +++ CR A+ KLK  + LNS   I  +++ + 
Sbjct: 628  LEVPHFKTYVKAGWNKA-INGQRKHLPDQVRTCRQAMAKLKHKSNLNSRIRINQLQAALD 686

Query: 907  AMQEKGGQRDWQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044
                   + + +  ++++ +L   YR EE YW QKSR QW+KEGD+
Sbjct: 687  KAMSSVNRTERRTISHIQRELTVAYRDEERYWQQKSRNQWMKEGDR 732


>pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana
            (fragment)
          Length = 1365

 Score =  152 bits (383), Expect = 2e-34
 Identities = 113/350 (32%), Positives = 169/350 (48%), Gaps = 2/350 (0%)
 Frame = +1

Query: 1    CRGLGGPSTISQLREELRFHLPNIVFLCETKK-KSFVHSVCTKLKMLSMWRVVEPRGLSG 177
            C+GL  P TI  L+E  + H P+I+FL ETK  + FV+ V   L        VEP G SG
Sbjct: 7    CQGLRNPWTIRYLKEMKKDHFPDILFLMETKNSQDFVYKVFCWLGY-DFIHTVEPEGRSG 65

Query: 178  GLMIGWSDRIVVKQVVLNEFCIQIEFEDSEIQQVCWGIFVYASIDKAVRRSQWVFLQQQR 357
            GL I W   + ++ +  ++  + ++   S   +V +   VY      +R   W  L    
Sbjct: 66   GLAIFWKSHLEIEFLYADKNLMDLQV--SSRNKVWFISCVYGLPVTHMRPKLWEHLNSIG 123

Query: 358  HFWGKFWFLGGDLNDIRDRSEKQGGTARSEGSFRGFRSFIQDMEMAEIQFNGALWTWANN 537
                + W L GD NDIR   EK GG  RS  SF+ F   + +  M E+   G  +TW  N
Sbjct: 124  LKRAEAWCLIGDFNDIRSNDEKLGGPRRSPSSFQCFEHMLLNCSMHELGSTGNSFTWGGN 183

Query: 538  RDGEGYVEERLDRFFASPEWLLQSPRAVVHHILKQTSDHAMLILDSQPPSKPRARRFLFD 717
            R+ + +V+ +LDR F +P W    P A    + K  SDH  +++     ++    +F +D
Sbjct: 184  RNDQ-WVQCKLDRCFGNPAWFSIFPNAHQWFLEKFGSDHRPVLVKFTNDNELFRGQFRYD 242

Query: 718  KRLLNLPQCEETVAVAWNKAQIGTPMFQVVSRIKACRVALLKLKGNAQLNSGKAIQAIKS 897
            KRL + P C E +  +WN A          S I+ CR A+   K ++  N+   I+ ++ 
Sbjct: 243  KRLDDDPYCIEVIHRSWNSAMSQGTHSSFFSLIE-CRRAISVWKHSSDTNAQSRIKRLRK 301

Query: 898  KMMAMQEKGGQRD-WQEWANLKLQLGEEYRKEEVYWCQKSRVQWLKEGDQ 1044
             + A  EK  Q   W     +K QL   Y  EE++W QKSR +WL  GD+
Sbjct: 302  DLDA--EKSIQIPCWPRIEYIKDQLSLAYGDEELFWRQKSRQKWLAGGDK 349


Top