BLASTX nr result

ID: Coptis21_contig00012050 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00012050
         (1084 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAD24831.1| putative non-LTR retroelement reverse transcripta...   100   6e-19
emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...    99   2e-18
ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis ...    97   9e-18
gb|EEE69144.1| hypothetical protein OsJ_28268 [Oryza sativa Japo...    96   2e-17
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta...    96   2e-17

>gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1524

 Score =  100 bits (249), Expect = 6e-19
 Identities = 104/389 (26%), Positives = 161/389 (41%), Gaps = 35/389 (8%)
 Frame = -3

Query: 1067 WNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYMQLLP--NS 894
            W+ ++I+          I++I+L  +   +  IW     G++TV+S Y W +   P  N 
Sbjct: 1128 WDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGY-WLLTHDPSTNI 1186

Query: 893  PSSTP------IPANIWN--IIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAIC 738
            P+  P      +   IWN  I+ KLK       F+W+ L   L T   LT R M  + IC
Sbjct: 1187 PAINPPHGSIDLKTRIWNLPIMPKLKH------FLWRALSQALATTERLTTRGMRIDPIC 1240

Query: 737  PRCTQAPESIDHALLTCPSLATTWFTSPLSL-RTQSHTAFLDTFLKISMETQPKDDILYN 561
            PRC +  ESI+HAL TCP     W+ S  SL R Q           +S + +     + N
Sbjct: 1241 PRCHRENESINHALFTCPFATMAWWLSDSSLIRNQ----------LMSNDFEENISNILN 1290

Query: 560  LTHFANLSDF-----------IWLDRNQLIFTPNHKPLSSIKLVAQSTT----------S 444
                  +SDF           IW  RN ++F    +  S   L A++ T           
Sbjct: 1291 FVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHK 1350

Query: 443  FLPSKPKPLCKYYIPKIFLPELFL-ITIDGGY--ESLSKTGGIGLTICKWSCDILFAGSK 273
              PS  + + +  I     P  ++    D G+  + L  TGG  +    +   I +   K
Sbjct: 1351 KTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGG-WIIRNHYGTPISWGSMK 1409

Query: 272  HCIASGPEEMEFQALLWGLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELEDLRSK 93
                S P E E +ALL  L++        V    DC+++++ +NG    SS  L +    
Sbjct: 1410 LAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFHSS--LANHLED 1467

Query: 92   IQFIRQSFGFCRFVYCSRSETQHAHLLAE 6
            I F    F   +F +  R   + AH+LA+
Sbjct: 1468 ISFWANKFASIQFGFIRRKGNKLAHVLAK 1496


>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score = 98.6 bits (244), Expect = 2e-18
 Identities = 94/372 (25%), Positives = 163/372 (43%), Gaps = 15/372 (4%)
 Frame = -3

Query: 1076 NNSWNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYMQLLPN 897
            N+ WN   +N+LF    +T I +I +      + W+W   +NG+FTV+S Y  Y +LL +
Sbjct: 978  NDRWNVELLNTLFQPWESTAIQRIPVALQKKPDQWMWMMSKNGQFTVRSAY--YHELLED 1035

Query: 896  ---SPSSTPIP-ANIWNIIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAICPRC 729
                PS++  P   +W  I K K P +++LF W+ +H+ L     +  R M  +  CPRC
Sbjct: 1036 RKTGPSTSRGPNLKLWQKIWKAKIPPKVKLFSWKAIHNGLAVYTNMRKRGMNIDGACPRC 1095

Query: 728  TQAPESIDHALLTCPSLATTWFTSPLSLRTQSHTAFLDTFLKISMETQPKDDILYNLTHF 549
             +  E+ +H +  C   +  W+ SPL + T +  A        S+    KD   + L  F
Sbjct: 1096 GEKEETTEHLIWGCDESSRAWYISPLRIHTGNIEAGSFRIWVESLLDTHKDTEWWAL--F 1153

Query: 548  ANLSDFIWLDRNQLIFTPNHKPLSSIKLVAQSTTSFLPSKPKPLCKYYIPKIFL------ 387
              +   IWL RN+ +F    K L+  ++V ++    +  + +  C +  P   L      
Sbjct: 1154 WMICWNIWLGRNKWVF--EKKKLAFQEVVERAVRGVMEFEEE--CAHTSPVETLNTHENG 1209

Query: 386  ---PELFLITIDGGYESLSKTG-GIGLTICKWSCDILFAGSKHCIA-SGPEEMEFQALLW 222
               P + ++ ++         G G+G  +     D+L A      A   P   E  +L +
Sbjct: 1210 WSVPPVGMVKLNVDAAVFKHVGIGMGGVVRDAEGDVLLATCCGGWAMEDPAMAEACSLRY 1269

Query: 221  GLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELEDLRSKIQFIRQSFGFCRFVYCS 42
            GL+ A +   + +V   DCK +   + G+ A        +   I ++        F +  
Sbjct: 1270 GLKVAYEAGFRNLVVEMDCKKLFLQLRGK-ASDVTPFGRVVDDILYLASKCSNVVFEHVK 1328

Query: 41   RSETQHAHLLAE 6
            R   + AHLLA+
Sbjct: 1329 RHCNKVAHLLAQ 1340


>ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis thaliana]
            gi|6682231|gb|AAF23283.1|AC016661_8 putative non-LTR
            reverse transcriptase [Arabidopsis thaliana]
            gi|332641254|gb|AEE74775.1| RNase H domain-containing
            protein [Arabidopsis thaliana]
          Length = 484

 Score = 96.7 bits (239), Expect = 9e-18
 Identities = 103/389 (26%), Positives = 159/389 (40%), Gaps = 35/389 (8%)
 Frame = -3

Query: 1067 WNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYMQLLP--NS 894
            W+ ++I+          I++I+L  +   +  IW     G++TV+S Y W +   P  N 
Sbjct: 88   WDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGY-WLLTHDPSTNI 146

Query: 893  PSSTP------IPANIWN--IIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAIC 738
            P+  P      +   IWN  I+ KLK       F+W+ L   L T   LT R M  +  C
Sbjct: 147  PAINPPHGSIDLKTRIWNLPIMPKLKH------FLWRALSQALATTERLTTRGMRIDPSC 200

Query: 737  PRCTQAPESIDHALLTCPSLATTWFTSPLSL-RTQSHTAFLDTFLKISMETQPKDDILYN 561
            PRC +  ESI+HAL TCP     W  S  SL R Q           +S + +     + N
Sbjct: 201  PRCHRENESINHALFTCPFATMAWRLSDSSLIRNQ----------LMSNDFEENISNILN 250

Query: 560  LTHFANLSDF-----------IWLDRNQLIFTPNHKPLSSIKLVAQSTT----------S 444
                  +SDF           IW  RN ++F    +  S   L A++ T           
Sbjct: 251  FVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHK 310

Query: 443  FLPSKPKPLCKYYIPKIFLPELFL-ITIDGGY--ESLSKTGGIGLTICKWSCDILFAGSK 273
              PS  + + +  I     P  ++    D G+  + L  TGG  +    +   I +   K
Sbjct: 311  KTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGG-WIIRNHYGTPISWGSMK 369

Query: 272  HCIASGPEEMEFQALLWGLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELEDLRSK 93
                S P E E +ALL  L++        V    DC+++++ +NG    SS  L +    
Sbjct: 370  LAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFHSS--LANHLED 427

Query: 92   IQFIRQSFGFCRFVYCSRSETQHAHLLAE 6
            I F    F   +F +  R   + AH+LA+
Sbjct: 428  ISFWANKFASIQFGFIRRKGNKLAHVLAK 456


>gb|EEE69144.1| hypothetical protein OsJ_28268 [Oryza sativa Japonica Group]
          Length = 1256

 Score = 95.9 bits (237), Expect = 2e-17
 Identities = 94/393 (23%), Positives = 159/393 (40%), Gaps = 37/393 (9%)
 Frame = -3

Query: 1076 NNSWNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYM---QL 906
            + SW++++I+  F    A  I  IHL      +   W  +R G+F+V+S Y+  +   Q+
Sbjct: 843  SGSWDNDKISHHFLPMDAEAILNIHLSSRLEEDFIAWHPDRLGRFSVRSAYHLAVALAQV 902

Query: 905  LPNSPSSTPIPANIWNIIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAICPRCT 726
               S SS    +   N + K  +P ++++F W+ + + L T      R +  + IC  C 
Sbjct: 903  NDGSSSSGNGSSRACNALWKCNAPQKVKIFAWRAITNSLTTLENKKKRKLEVSDICTICG 962

Query: 725  QAPESIDHALLTCPSLATTW--FTSPLSLRTQS--HTAFLDTFLKISMETQPKDDILYNL 558
               E + HAL  CP     W   +  + L   S  H       L +  ++QP+D ++  +
Sbjct: 963  VESEYVVHALFHCPHARQLWEAMSDDMQLNPLSNIHNGDSKVILDLLEQSQPEDQVMLLM 1022

Query: 557  THFANLSDFIWLDRNQLIFTPNHKPLSSIKLVAQSTTSFL-------------PSKPKPL 417
              +      IW  RN+++   + KP   I +  +   S++             P K K +
Sbjct: 1023 VLWR-----IWHTRNEIV---HGKPAPGILVSKRFIESYVLSLAEIKQHPQANPEKGKHV 1074

Query: 416  CKYYIPKIF----------------LPELFLITIDGGYESLSKTGGIGLTICKWSCDILF 285
                + K                  LP    + +DG ++     GGIG  +   + +++F
Sbjct: 1075 VDVVLKKSHSIKRSREPAPDKWSKPLPGSMKLNVDGSFQESEGKGGIGAVLRNCTGEVIF 1134

Query: 284  AGSKHC-IASGPEEMEFQALLWGLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELE 108
            A   H    S   EME  A   GL  AL   +  +V  +DC +MV          S EL 
Sbjct: 1135 AACGHVDHCSSALEMELLASRDGLALALQWTLLPIVIETDCLAMVHLFRDATGAKS-ELA 1193

Query: 107  DLRSKIQFIRQSFGFCRFVYCSRSETQHAHLLA 9
             L ++I  +        F  C RS+   +H LA
Sbjct: 1194 FLITEIDSLLVGNRDISFNKCLRSQNLISHCLA 1226


>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1750

 Score = 95.5 bits (236), Expect = 2e-17
 Identities = 102/389 (26%), Positives = 159/389 (40%), Gaps = 35/389 (8%)
 Frame = -3

Query: 1067 WNHNRINSLFTAEIATEIYKIHLDHNCVSEPWIWTTERNGKFTVKSTYNWYMQLLP--NS 894
            W+ ++I+          I++I+L  +   +  IW     G++TV+S Y W +   P  N 
Sbjct: 1354 WDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGY-WLLTHDPSTNI 1412

Query: 893  PSSTP------IPANIWN--IIRKLKSPLRIQLFIWQLLHSILPTKLFLTHRHMATNAIC 738
            P+  P      +   IWN  I+ KLK       F+W+ L   L T   LT R M  +  C
Sbjct: 1413 PAINPPHGSIDLKTRIWNLPIMPKLKH------FLWRALSQALATTERLTTRGMRIDPSC 1466

Query: 737  PRCTQAPESIDHALLTCPSLATTWFTSPLSL-RTQSHTAFLDTFLKISMETQPKDDILYN 561
            PRC +  ESI+HAL TCP     W  S  SL R Q           +S + +     + N
Sbjct: 1467 PRCHRENESINHALFTCPFATMAWRLSDSSLIRNQ----------LMSNDFEENISNILN 1516

Query: 560  LTHFANLSDF-----------IWLDRNQLIFTPNHKPLSSIKLVAQSTT----------S 444
                  +SDF           IW  RN ++F    +  S   L A++ T           
Sbjct: 1517 FVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHK 1576

Query: 443  FLPSKPKPLCKYYIPKIFLPELFL-ITIDGGY--ESLSKTGGIGLTICKWSCDILFAGSK 273
              PS  + + +  I     P  ++    D G+  + L  TGG  +    +   I +   K
Sbjct: 1577 KTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGG-WIIRNHYGTPISWGSMK 1635

Query: 272  HCIASGPEEMEFQALLWGLEKALDLDIKMVVFVSDCKSMVDAVNGQIACSSWELEDLRSK 93
                S P E E +ALL  L++        V    DC+++++ +NG    SS  L +    
Sbjct: 1636 LAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFHSS--LANHLED 1693

Query: 92   IQFIRQSFGFCRFVYCSRSETQHAHLLAE 6
            I F    F   +F +  +   + AH+LA+
Sbjct: 1694 ISFWANKFASIQFGFIRKKGNKLAHVLAK 1722


Top