BLASTX nr result

ID: Coptis25_contig00030602 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00030602
         (1086 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAD24831.1| putative non-LTR retroelement reverse transcripta...    76   2e-11
ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis ...    72   2e-10
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta...    71   5e-10
gb|ABD65056.1| hypothetical protein 27.t00122 [Brassica oleracea]      65   4e-08
gb|AAD17398.1| putative non-LTR retroelement reverse transcripta...    60   7e-07

>gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1524

 Score = 75.9 bits (185), Expect = 2e-11
 Identities = 78/282 (27%), Positives = 123/282 (43%), Gaps = 15/282 (5%)
 Frame = +1

Query: 1    LPTNKILHDRHIKDQDICPRCLQHCETIEHALFSCPKLQILWYTGPLSLRPETWNPHLTT 180
            L T + L  R ++   ICPRC +  E+I HALF+CP   + W+    SL        L +
Sbjct: 1223 LATTERLTTRGMRIDPICPRCHRENESINHALFTCPFATMAWWLSDSSL----IRNQLMS 1278

Query: 181  KDLIISIIT-QNHTKDTVIQNLSHLLNI--AHFIWTDRNNIVYKSNLHPIDVPRLLSQAQ 351
             D   +I    N  +DT + +   LL +     IW  RNN+V+           L ++A+
Sbjct: 1279 NDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAE 1338

Query: 352  ----------HSMLPTKPPPLIQYFIPNNLTPNLHLIAT-DGSFDPTTQKSGIGFTI-NK 495
                      H   P+    + +  I     P  ++    D  FD    ++  G+ I N 
Sbjct: 1339 THDWLNATQSHKKTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNH 1398

Query: 496  WNDDLLFAGSKQSDVHGAEEAEFQALK*ALQKTSQEGLSCVLVCSDCRSLVNGVNGRSDD 675
            +   + +   K +      EAE +AL  ALQ+T   G + V +  DC++L+N +NG S  
Sbjct: 1399 YGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFH 1458

Query: 676  VSWQLESTLLELVDLKMSFAFCQVVFCPRNLLQHAHLLAELG 801
             S  L + L ++      FA  Q  F  R   + AH+LA+ G
Sbjct: 1459 SS--LANHLEDISFWANKFASIQFGFIRRKGNKLAHVLAKYG 1498


>ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis thaliana]
            gi|6682231|gb|AAF23283.1|AC016661_8 putative non-LTR
            reverse transcriptase [Arabidopsis thaliana]
            gi|332641254|gb|AEE74775.1| RNase H domain-containing
            protein [Arabidopsis thaliana]
          Length = 484

 Score = 72.0 bits (175), Expect = 2e-10
 Identities = 77/282 (27%), Positives = 121/282 (42%), Gaps = 15/282 (5%)
 Frame = +1

Query: 1    LPTNKILHDRHIKDQDICPRCLQHCETIEHALFSCPKLQILWYTGPLSLRPETWNPHLTT 180
            L T + L  R ++    CPRC +  E+I HALF+CP   + W     SL        L +
Sbjct: 183  LATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSL----IRNQLMS 238

Query: 181  KDLIISIIT-QNHTKDTVIQNLSHLLNI--AHFIWTDRNNIVYKSNLHPIDVPRLLSQAQ 351
             D   +I    N  +DT + +   LL +     IW  RNN+V+           L ++A+
Sbjct: 239  NDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAE 298

Query: 352  ----------HSMLPTKPPPLIQYFIPNNLTPNLHLIAT-DGSFDPTTQKSGIGFTI-NK 495
                      H   P+    + +  I     P  ++    D  FD    ++  G+ I N 
Sbjct: 299  THDWLNATQSHKKTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNH 358

Query: 496  WNDDLLFAGSKQSDVHGAEEAEFQALK*ALQKTSQEGLSCVLVCSDCRSLVNGVNGRSDD 675
            +   + +   K +      EAE +AL  ALQ+T   G + V +  DC++L+N +NG S  
Sbjct: 359  YGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFH 418

Query: 676  VSWQLESTLLELVDLKMSFAFCQVVFCPRNLLQHAHLLAELG 801
             S  L + L ++      FA  Q  F  R   + AH+LA+ G
Sbjct: 419  SS--LANHLEDISFWANKFASIQFGFIRRKGNKLAHVLAKYG 458


>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1750

 Score = 70.9 bits (172), Expect = 5e-10
 Identities = 76/282 (26%), Positives = 121/282 (42%), Gaps = 15/282 (5%)
 Frame = +1

Query: 1    LPTNKILHDRHIKDQDICPRCLQHCETIEHALFSCPKLQILWYTGPLSLRPETWNPHLTT 180
            L T + L  R ++    CPRC +  E+I HALF+CP   + W     SL        L +
Sbjct: 1449 LATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSL----IRNQLMS 1504

Query: 181  KDLIISIIT-QNHTKDTVIQNLSHLLNI--AHFIWTDRNNIVYKSNLHPIDVPRLLSQAQ 351
             D   +I    N  +DT + +   LL +     IW  RNN+V+           L ++A+
Sbjct: 1505 NDFEENISNILNFVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAE 1564

Query: 352  ----------HSMLPTKPPPLIQYFIPNNLTPNLHLIAT-DGSFDPTTQKSGIGFTI-NK 495
                      H   P+    + +  I     P  ++    D  FD    ++  G+ I N 
Sbjct: 1565 THDWLNATQSHKKTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNH 1624

Query: 496  WNDDLLFAGSKQSDVHGAEEAEFQALK*ALQKTSQEGLSCVLVCSDCRSLVNGVNGRSDD 675
            +   + +   K +      EAE +AL  ALQ+T   G + V +  DC++L+N +NG S  
Sbjct: 1625 YGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFH 1684

Query: 676  VSWQLESTLLELVDLKMSFAFCQVVFCPRNLLQHAHLLAELG 801
             S  L + L ++      FA  Q  F  +   + AH+LA+ G
Sbjct: 1685 SS--LANHLEDISFWANKFASIQFGFIRKKGNKLAHVLAKYG 1724


>gb|ABD65056.1| hypothetical protein 27.t00122 [Brassica oleracea]
          Length = 239

 Score = 64.7 bits (156), Expect = 4e-08
 Identities = 56/218 (25%), Positives = 87/218 (39%), Gaps = 8/218 (3%)
 Frame = +1

Query: 19  LHDRHIKDQDICPRCLQHCETIEHALFSCPKLQILWYTGPLSLRPETWNPHLTTKDLIIS 198
           L DRH      CPRC QH ET+ H LF CP     W    L + P    P  +  D    
Sbjct: 15  LADRHCHPNRTCPRCGQHEETVNHMLFECPFATQTWSLETLPIEPREL-PRPSIFDNFDY 73

Query: 199 IITQNHTKDTVIQNLSHLLNIAHFIWTDRNNIVYKS-NLHPIDVPRLLSQAQHS-----M 360
           ++ + H ++   + L+ +  I  F+W  RN  V+ + ++ P++V +  +    S     +
Sbjct: 74  LLHRIHKRNGTEECLARIPWILWFLWKARNEKVFNNKDISPLEVFQSAASEAASWRVAQI 133

Query: 361 LPTKPP--PLIQYFIPNNLTPNLHLIATDGSFDPTTQKSGIGFTINKWNDDLLFAGSKQS 534
           +P  P     +    P    P  H    D S+     + G GF +   +   LF     +
Sbjct: 134 IPEAPEVNDNLSVLEPQYRPPQRHFFRVDASWKEDDARYGGGFVMENEDGSTLFGSFPSN 193

Query: 535 DVHGAEEAEFQALK*ALQKTSQEGLSCVLVCSDCRSLV 648
            V     AEF  L  A++     G   +   SD   LV
Sbjct: 194 RVLPPLHAEFGTLLWAMKSLLTLGHVSMAFESDRMQLV 231


>gb|AAD17398.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1225

 Score = 60.5 bits (145), Expect = 7e-07
 Identities = 61/257 (23%), Positives = 106/257 (41%), Gaps = 12/257 (4%)
 Frame = +1

Query: 1    LPTNKILHDRHIKDQDICPRCLQHCETIEHALFSCPKLQILWYTGPLSLRPETW--NPHL 174
            L TN+ L  RHI  + +CPRC    E+I H LF CP  + +W   P+      +  N   
Sbjct: 947  LATNQRLFSRHIGTEKVCPRCGAEEESINHLLFLCPPSRQIWALSPIPSSEYIFPRNSLF 1006

Query: 175  TTKDLIISIITQNHTKDTVIQNLSHLLNIAHFIWTDRNNIVYKSNLH--------PIDVP 330
               D ++S   +    + +++    +L    +IW  RN  ++++ +          I   
Sbjct: 1007 YNFDFLLSRGKEFDIAEDIMEIFPWIL---WYIWKSRNRFIFENVIESPQVILDFAIQEA 1063

Query: 331  RLLSQAQHSMLPTK-PPPLIQYFIPNNLTPNLHLIATDGSFDPTTQKSGIGFTINKWNDD 507
             +  QA    + T+ PPP +   +P NL P  ++   D S+      SG G+ +   +  
Sbjct: 1064 NVWKQANSKEVATEYPPPQV---VPANLPPTRNVCQFDASWHLKDTLSGHGWVLVDQDIV 1120

Query: 508  LLFAGSKQSDVHGAEEAEFQALK*ALQKTSQEGLSCVLVCSDCRSLVNGVNGRSDDVSWQ 687
            LL              AE  +L  A++     G+S     SD    ++ +   S+  ++ 
Sbjct: 1121 LLLGLKSARKSLSPLHAEVDSLLWAMECMISLGVSDCSFASDSADFISLLENPSEWPTFV 1180

Query: 688  LE-STLLELVDLKMSFA 735
             E +T   LV    SF+
Sbjct: 1181 AELATFSSLVCFFPSFS 1197


Top