BLASTX nr result

ID: Coptis21_contig00036546 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00036546
         (735 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   162   5e-38
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       154   2e-35
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           152   7e-35
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   152   7e-35
dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ...   152   7e-35

>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  162 bits (411), Expect = 5e-38
 Identities = 83/207 (40%), Positives = 128/207 (61%), Gaps = 2/207 (0%)
 Frame = -3

Query: 727  DCLPLIEKITSRISNWKNKVLNRAGRVQLVQSVLSSFQTYWAKTFVLPKAVLEKVGKICN 548
            D LPL+EKI +RI++W N+ L+ AGR+QL++SVLSS   +W   F LPKA L+++ K+ +
Sbjct: 932  DYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFS 991

Query: 547  RFIWAGPNMERKMHHSSHATLRVDKEQGGLGLIDPKCWNQAAYCGLVFKLVQREDSLWAK 368
             F+W+GP++  K    + + +   KE+GGLGL   K  N+ +   L+++++   DSLW K
Sbjct: 992  AFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKPLKEANEVSLLKLIWRILSARDSLWVK 1051

Query: 367  WSWEHHIKNKHFWTTKTPQDC-SWVWRGILKHREVAWRFVRHSIANGKEMSFWHDPWCS- 194
            W  +H I+ + FW+ K      SW+WR ILK R+ A  F R  + +G   SFWHD WC  
Sbjct: 1052 WVNKHLIRKETFWSVKENTGLGSWLWRKILKQRDKARLFHRMEVRSGTFTSFWHDHWCPL 1111

Query: 193  GPLYLNAQAKSLIANLVPQEAKVADVI 113
            G L+ +  ++  I   +P  A VA+V+
Sbjct: 1112 GRLHQHMGSRGTIDLGIPNNATVAEVM 1138


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  154 bits (388), Expect = 2e-35
 Identities = 81/237 (34%), Positives = 125/237 (52%), Gaps = 2/237 (0%)
 Frame = -3

Query: 718  PLIEKITSRISNWKNKVLNRAGRVQLVQSVLSSFQTYWAKTFVLPKAVLEKVGKICNRFI 539
            PL+EKIT+R  +W NK L+ AGR+QL+ SV+     +W  TF+LPK  ++++  +C+RF+
Sbjct: 781  PLLEKITARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFL 840

Query: 538  WAGPNMERKMHHSSHATLRVDKEQGGLGLIDPKCWNQAAYCGLVFKLVQREDSLWAKWSW 359
            W+G   + K    S A L + K +GGLGL     WN+     L+++L   +DSLWA W  
Sbjct: 841  WSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQH 900

Query: 358  EHHIKNKHFWTTKTPQDCSWVWRGILKHREVAWRFVRHSIANGKEMSFWHDPWCS-GPLY 182
             HH+    FW  +  Q  SW W+ +L  R +A +F+   + NG +  +W+D W S GPL+
Sbjct: 901  LHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLF 960

Query: 181  LNAQAKSLIANLVPQEAKVADVITNGCWSRTIT-ELPETEVKQWIMETEINSYLTED 14
                     +  VP  AKVA   +   W   ++   P   +   +    + S   ED
Sbjct: 961  RIIGDIGPSSLRVPLLAKVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQED 1017


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  152 bits (384), Expect = 7e-35
 Identities = 81/224 (36%), Positives = 121/224 (54%), Gaps = 7/224 (3%)
 Frame = -3

Query: 733  IRDCLPLIEKITSRISNWKNKVLNRAGRVQLVQSVLSSFQTYWAKTFVLPKAVLEKVGKI 554
            I D  PL+EK+++R+ +W +K L+ AGR QL+ SV+     +W  TF+LPK  ++K+  +
Sbjct: 636  IADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESL 695

Query: 553  CNRFIWAGPNMERKMHHSSHATLRVDKEQGGLGLIDPKCWNQAAYCGLVFKLVQREDSLW 374
            C++F+WAG    RK    S     + K +GGLG      WN+     L++ L  R+ SLW
Sbjct: 696  CSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLW 755

Query: 373  AKWSWEHHIKNKHFWTTKTPQDCSWVWRGILKHREVAWRFVRHSIANGKEMSFWHDPWCS 194
            A+W   H + +  FW     Q   W W+ +L  R +A +F++  + NG  +SFW D W S
Sbjct: 756  AQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTS 815

Query: 193  -GPL--YLNAQAKSLIANLVPQEAKVADVITNGCW----SRTIT 83
             GPL  YL       +   +P  AKVAD I    W    SR++T
Sbjct: 816  LGPLIKYLGDVGSRPLR--IPFSAKVADAIDGSGWRLPLSRSLT 857


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  152 bits (384), Expect = 7e-35
 Identities = 81/224 (36%), Positives = 121/224 (54%), Gaps = 7/224 (3%)
 Frame = -3

Query: 733  IRDCLPLIEKITSRISNWKNKVLNRAGRVQLVQSVLSSFQTYWAKTFVLPKAVLEKVGKI 554
            I D  PL+EK+++R+ +W +K L+ AGR QL+ SV+     +W  TF+LPK  ++K+  +
Sbjct: 636  IADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESL 695

Query: 553  CNRFIWAGPNMERKMHHSSHATLRVDKEQGGLGLIDPKCWNQAAYCGLVFKLVQREDSLW 374
            C++F+WAG    RK    S     + K +GGLG      WN+     L++ L  R+ SLW
Sbjct: 696  CSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLW 755

Query: 373  AKWSWEHHIKNKHFWTTKTPQDCSWVWRGILKHREVAWRFVRHSIANGKEMSFWHDPWCS 194
            A+W   H + +  FW     Q   W W+ +L  R +A +F++  + NG  +SFW D W S
Sbjct: 756  AQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTS 815

Query: 193  -GPL--YLNAQAKSLIANLVPQEAKVADVITNGCW----SRTIT 83
             GPL  YL       +   +P  AKVAD I    W    SR++T
Sbjct: 816  LGPLIKYLGDVGSRPLR--IPFSAKVADAIDGSGWRLPLSRSLT 857


>dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis
            thaliana]
          Length = 1223

 Score =  152 bits (384), Expect = 7e-35
 Identities = 86/247 (34%), Positives = 128/247 (51%), Gaps = 6/247 (2%)
 Frame = -3

Query: 727  DCLPLIEKITSRISNWKNKVLNRAGRVQLVQSVLSSFQTYWAKTFVLPKAVLEKVGKICN 548
            DCLPL+E++  RI +W ++ L+ AGR+ L+ SVL S   +W   F LP+  + ++ K+C+
Sbjct: 785  DCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKCIRELEKMCS 844

Query: 547  RFIWAGPNMERKMHHSSHATLRVDKEQGGLGLIDPKCWNQAAYCGLVFKLVQREDSLWAK 368
             F+W+G  M       S   +   K++GGLGL   K  N      LV+K+V   +SLW K
Sbjct: 845  AFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVK 904

Query: 367  WSWEHHIKNKHFWTTK-TPQDCSWVWRGILKHREVAWRFVRHSIANGKEMSFWHDPWCS- 194
            W  +H ++N  FW  K T    SW+W+ +LK+REVA    +  + NGK+ SFW+D W   
Sbjct: 905  WVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDL 964

Query: 193  GPLYLNAQAKSLIANLVPQEAKVADVITNGCWSR----TITELPETEVKQWIMETEINSY 26
            G L      + LI   + +   V +  TN    R        + +   K W   TE    
Sbjct: 965  GQLLERTGDRGLIDLGISRRMTVEEAWTNRRQRRHRNDVYNVIEDALKKSWDTRTE---- 1020

Query: 25   LTEDKVV 5
             TEDKV+
Sbjct: 1021 -TEDKVL 1026


Top