BLASTX nr result

ID: Cephaelis21_contig00031245 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00031245
         (1502 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAF23831.1|AC007234_3 F1E22.12 [Arabidopsis thaliana]              159   2e-36
sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H pr...   159   2e-36
gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptas...   148   3e-33
gb|ABW81175.1| non-LTR retrotransposon transposase [Arabidopsis ...   138   4e-30
gb|AAD22368.1| putative non-LTR retroelement reverse transcripta...   134   5e-29

>gb|AAF23831.1|AC007234_3 F1E22.12 [Arabidopsis thaliana]
          Length = 1055

 Score =  159 bits (401), Expect = 2e-36
 Identities = 109/371 (29%), Positives = 172/371 (46%), Gaps = 4/371 (1%)
 Frame = +1

Query: 364  GAQDQVWWGLSNDGRFSIKSVMIFLXXXXXXXXXXXXLWKLVWAWKGPEKLRTFLWLIVH 543
            GA+D++ W  S DG+FS++S    L             +  +W  + PE+++TFLWL+ +
Sbjct: 359  GARDRLSWKFSQDGQFSVRSAYEMLTVDEVPRPNMASFFNCLWKVRVPERVKTFLWLVGN 418

Query: 544  RRLPTAKLCFDRKVIQSSRCHRCGIDPENIIHVLRDCSFAKGVWLQLVHQNKWESFFSLT 723
            + + T +    R +  S+ C  C    E+++HVLRDC    G+W+++V Q + + FFS +
Sbjct: 419  QAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQRRQQGFFSKS 478

Query: 724  VQDWIRQNLVDDYGSEPGLQSPWKLRFVVACWSLWIWRNNFIFSEDAVSQEQPLXXXXXX 903
            + +W+  NL D  G E     PW   F V  W  W WR   IF E+   +++        
Sbjct: 479  LFEWLYDNLGDRSGCE---DIPWSTIFAVIIWWGWKWRCGNIFGENTKCRDRVKFVKEWA 535

Query: 904  XXXXXXSSTTLGKIPGICKHKINGWCAWTPPELGWIKVNCNGAVDWVRGIATIGGVLRDS 1083
                   S  +  + GI + ++     W  P +GW+KVN +GA     G+A+ GGVLRD 
Sbjct: 536  VEVYRAHSGNV--LVGITQPRVERMIGWVSPCVGWVKVNTDGASRGNPGLASAGGVLRDC 593

Query: 1084 EGN*ISGFSQVIEFGDVLSTKLKTLMMGFSLARRMGYVKIQLEPDSQSAVDLVKRRIEGW 1263
             G    GFS  I        +L  +  G   A      +++LE DS+  V  +K  I   
Sbjct: 594  TGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEKKVPRVELEVDSEVIVGFLKTGI--- 650

Query: 1264 DTDMHILEEVGSVWESRLG----LNISHVVRECNSIADWLTRRGSVGSQGLRI*PSPPIQ 1431
             +D H L  +  +    L     + I HV RE N +AD L       S G       P  
Sbjct: 651  -SDSHPLSFLVRLCHGFLQKDWLVRIVHVYREANRLADGLANYAFSLSLGFHSFDLVPDA 709

Query: 1432 VVEVVCKETLG 1464
            +  ++ ++TLG
Sbjct: 710  MSSLLREDTLG 720


>sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H protein At1g65750
          Length = 620

 Score =  159 bits (401), Expect = 2e-36
 Identities = 109/371 (29%), Positives = 172/371 (46%), Gaps = 4/371 (1%)
 Frame = +1

Query: 364  GAQDQVWWGLSNDGRFSIKSVMIFLXXXXXXXXXXXXLWKLVWAWKGPEKLRTFLWLIVH 543
            GA+D++ W  S DG+FS++S    L             +  +W  + PE+++TFLWL+ +
Sbjct: 250  GARDRLSWKFSQDGQFSVRSAYEMLTVDEVPRPNMASFFNCLWKVRVPERVKTFLWLVGN 309

Query: 544  RRLPTAKLCFDRKVIQSSRCHRCGIDPENIIHVLRDCSFAKGVWLQLVHQNKWESFFSLT 723
            + + T +    R +  S+ C  C    E+++HVLRDC    G+W+++V Q + + FFS +
Sbjct: 310  QAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQRRQQGFFSKS 369

Query: 724  VQDWIRQNLVDDYGSEPGLQSPWKLRFVVACWSLWIWRNNFIFSEDAVSQEQPLXXXXXX 903
            + +W+  NL D  G E     PW   F V  W  W WR   IF E+   +++        
Sbjct: 370  LFEWLYDNLGDRSGCE---DIPWSTIFAVIIWWGWKWRCGNIFGENTKCRDRVKFVKEWA 426

Query: 904  XXXXXXSSTTLGKIPGICKHKINGWCAWTPPELGWIKVNCNGAVDWVRGIATIGGVLRDS 1083
                   S  +  + GI + ++     W  P +GW+KVN +GA     G+A+ GGVLRD 
Sbjct: 427  VEVYRAHSGNV--LVGITQPRVERMIGWVSPCVGWVKVNTDGASRGNPGLASAGGVLRDC 484

Query: 1084 EGN*ISGFSQVIEFGDVLSTKLKTLMMGFSLARRMGYVKIQLEPDSQSAVDLVKRRIEGW 1263
             G    GFS  I        +L  +  G   A      +++LE DS+  V  +K  I   
Sbjct: 485  TGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEKKVPRVELEVDSEVIVGFLKTGI--- 541

Query: 1264 DTDMHILEEVGSVWESRLG----LNISHVVRECNSIADWLTRRGSVGSQGLRI*PSPPIQ 1431
             +D H L  +  +    L     + I HV RE N +AD L       S G       P  
Sbjct: 542  -SDSHPLSFLVRLCHGFLQKDWLVRIVHVYREANRLADGLANYAFSLSLGFHSFDLVPDA 600

Query: 1432 VVEVVCKETLG 1464
            +  ++ ++TLG
Sbjct: 601  MSSLLREDTLG 611


>gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptase); Polynucleotidyl
            transferase, Ribonuclease H fold [Medicago truncatula]
          Length = 729

 Score =  148 bits (374), Expect = 3e-33
 Identities = 93/331 (28%), Positives = 147/331 (44%), Gaps = 1/331 (0%)
 Frame = +1

Query: 235  VFSQLSVEHFFNQTEDRRREYFQELLPMDVCHRIEKFSELLNPGAQDQVWWGLSNDGRFS 414
            V + +SV    N +      +  + L +D+ ++I       +    D + WG +N  +F+
Sbjct: 386  VDTTISVRDAINTSGVWDLNFLMDNLHVDIVNQILALPTPSDFDGPDTIGWGGTNTLKFT 445

Query: 415  IKSVMIFLXXXXXXXXXXXXLWKLVWAWKGPEKLRTFLWLIVHRRLPTAKLCFDRKVIQS 594
            ++S                  WK +W WKGP +++TF+WL  H R+ T        V  S
Sbjct: 446  VQSAYNLQQENPFAVGGD---WKTLWNWKGPHRIQTFIWLAAHGRILTNYRRSKWGVGIS 502

Query: 595  SRCHRCGIDPENIIHVLRDCSFAKGVWLQLVHQNKWESFFSLTVQDWIRQNL-VDDYGSE 771
              C  C  + E +IHVLRDC  +  VWL+L+  N   +FFS   ++W+  NL     G  
Sbjct: 503  PTCPCCAREDETVIHVLRDCVHSTQVWLRLIPHNYITNFFSFDCREWVFNNLNKKGIGDN 562

Query: 772  PGLQSPWKLRFVVACWSLWIWRNNFIFSEDAVSQEQPLXXXXXXXXXXXXSSTTLGKIPG 951
            P   + W+  F+  CW LW WRN  IF         P             ++  + K   
Sbjct: 563  P---ATWQTTFMTTCWYLWNWRNKSIFEIGFQRPSNPTLVIQKFTREIEDNTKLVHKSS- 618

Query: 952  ICKHKINGWCAWTPPELGWIKVNCNGAVDWVRGIATIGGVLRDSEGN*ISGFSQVIEFGD 1131
                K   +  W  P  GW+K+NC+GA      +A  GG+LRDS+G  I G+ + I   D
Sbjct: 619  --HQKETIYIGWMRPPFGWVKLNCDGAWKGSGTLAGCGGLLRDSDGRWIKGYFKKIGMCD 676

Query: 1132 VLSTKLKTLMMGFSLARRMGYVKIQLEPDSQ 1224
                ++  + +G  +A R     + +E DS+
Sbjct: 677  AFHAEMWGMYLGLDMAWRENTTHLIVESDSK 707


>gb|ABW81175.1| non-LTR retrotransposon transposase [Arabidopsis cebennensis]
          Length = 799

 Score =  138 bits (347), Expect = 4e-30
 Identities = 100/347 (28%), Positives = 156/347 (44%), Gaps = 6/347 (1%)
 Frame = +1

Query: 364  GAQDQVWWGLSNDGRFSIKSVMIFLXXXXXXXXXXXXLWKLVWAWKGPEKLRTFLWLIVH 543
            G +D++ WG S DG+F++KS    L            ++  VW    PE+++ FLWL VH
Sbjct: 429  GVRDRLSWGESLDGKFTVKSAYSSLMKDNTPRSDLSRMYDRVWKVIAPERVKIFLWLGVH 488

Query: 544  RRLPTAKLCFDRKVIQSSRCHRCGIDPENIIHVLRDCSFAKGVWLQLVHQNKWESFFSLT 723
            + + T      R +  +  C  C    E IIH+LRDC    G+W +++ + K  +FF+ T
Sbjct: 489  QVIMTNMERQRRHLSDTGICQVCKGGDETIIHILRDCPAMYGIWTRIIPRRKRGTFFTQT 548

Query: 724  VQDWIRQNLVDDYGSEPGLQSPWKLRFVVACWSLWIWRNNFIFSEDAVSQEQP--LXXXX 897
            + +W+  NL  D G   G   PW        W  W WR   +F  +   ++    +    
Sbjct: 549  LLEWVYDNL-GDSGKVDG--CPWS----CVRWG-WKWRCGNVFGVNGKCRDMVKFVRDRA 600

Query: 898  XXXXXXXXSSTTLGKIPGICKHKINGWCAWTPPELGWIKVNCNGAVDWVRGIATIGGVLR 1077
                         GKI G    +I    +W  P  GW+K+N +GA     G+AT GG+LR
Sbjct: 601  SEVIQAHLVEGNSGKIRG----RIERMVSWVKPAEGWLKLNTDGASKGNPGLATAGGILR 656

Query: 1078 DSEGN*ISGFSQVIEFGDVLSTKLKTLMMGFSLARRMGYVKIQLEPDSQSAVDLVKRRIE 1257
              +G+ I GF+  I        +L  +  G  +A      ++++E DS    +LV   + 
Sbjct: 657  QQDGSWIGGFAVNIGICSAPLAELWRVYYGLYIAWERKITRLEVEVDS----ELVVGFLT 712

Query: 1258 GWDTDMHILEEVGSVWESRLG----LNISHVVRECNSIADWLTRRGS 1386
             W +D H L  +  +    +     + ISHV RE N +AD L    S
Sbjct: 713  TWISDSHPLSFLVRLCYGFISRDWIVRISHVYREANRLADGLANYAS 759


>gb|AAD22368.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 321

 Score =  134 bits (338), Expect = 5e-29
 Identities = 94/325 (28%), Positives = 146/325 (44%)
 Frame = +1

Query: 505  PEKLRTFLWLIVHRRLPTAKLCFDRKVIQSSRCHRCGIDPENIIHVLRDCSFAKGVWLQL 684
            PE++R FLWL+V + + T    + R +  +  C  C    E I+HVLRDC    G+W +L
Sbjct: 3    PERVRVFLWLVVQQVIITNVERYRRHLSDTRVCQICQGGEETILHVLRDCPAMAGIWSRL 62

Query: 685  VHQNKWESFFSLTVQDWIRQNLVDDYGSEPGLQSPWKLRFVVACWSLWIWRNNFIFSEDA 864
            V +++   FF+ ++ +WI +NL +        +  W   FV+A W  W WR   IF  + 
Sbjct: 63   VPRDQIRQFFTASLLEWIYKNLRE--------RGSWPTVFVMAVWWGWKWRCGNIFGGNG 114

Query: 865  VSQEQPLXXXXXXXXXXXXSSTTLGKIPGICKHKINGWCAWTPPELGWIKVNCNGAVDWV 1044
              +++              ++   G    +   ++    +W  PE GW+K+N +GA    
Sbjct: 115  KCRDRVKFIKDLAEEVAIANAFVKGN--EVRVSRVERLVSWVSPEDGWVKLNTDGASRGN 172

Query: 1045 RGIATIGGVLRDSEGN*ISGFSQVIEFGDVLSTKLKTLMMGFSLARRMGYVKIQLEPDSQ 1224
             G AT GGVLRD  G  I GF+  I        +L  +  G  +A   G  +++LE DS+
Sbjct: 173  PGFATAGGVLRDHNGAWIGGFAVNIGVCSAPLAELWGVYYGLFIAWGRGARRVELEVDSK 232

Query: 1225 SAVDLVKRRIEGWDTDMHILEEVGSVWESRLGLNISHVVRECNSIADWLTRRGSVGSQGL 1404
              V  +   I        +L            + ISHV RE N +AD L       S GL
Sbjct: 233  MVVGFLTTGIADSHPLSFLLRLCYDFLSKGWIVRISHVYREANRLADGLANYAFSLSLGL 292

Query: 1405 RI*PSPPIQVVEVVCKETLGLQYLR 1479
             +  S P  V  ++  +  G+ Y R
Sbjct: 293  HLLESRPDVVSSILLDDVAGVSYPR 317