BLASTX nr result

ID: Coptis25_contig00010210 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00010210
         (1085 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thalia...    88   4e-15
ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis ...    79   2e-12
gb|EEC83100.1| hypothetical protein OsI_28249 [Oryza sativa Indi...    79   2e-12
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta...    79   2e-12
emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...    78   3e-12

>ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thaliana]
            gi|4972055|emb|CAB43923.1| putative protein [Arabidopsis
            thaliana] gi|7269807|emb|CAB79667.1| putative protein
            [Arabidopsis thaliana] gi|67633766|gb|AAY78807.1|
            putative reverse transcriptase/RNA-dependent DNA
            polymerase [Arabidopsis thaliana]
            gi|332660185|gb|AEE85585.1| Ribonuclease H-like protein
            [Arabidopsis thaliana]
          Length = 575

 Score = 87.8 bits (216), Expect = 4e-15
 Identities = 86/346 (24%), Positives = 135/346 (39%), Gaps = 12/346 (3%)
 Frame = -1

Query: 1028 LNVNSLFSPALPTWNQNRLG*LFPFQIIQSINRVHIRPLNLPDQII*PYAKNGNISVNLA 849
            L V+ L   +   W ++ +  LFP    + I  +      + D     Y  +G+ +V   
Sbjct: 169  LKVSDLIDESGREWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSG 228

Query: 848  YFFLSKQDNLSFSS------SFNPLIVQKLWKLKIDAATQLFIWKLYSGGLPTGDVLHKC 687
            Y+ L++  N   S       S NP I QK+WK +     Q F+WK  S  LP    L   
Sbjct: 229  YWVLTQIINKRSSPQEVSEPSLNP-IYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYR 287

Query: 686  KFKGDITCSFCQKCIETALHALFSYYWVKLMWNVSNAMLNISSHDDWEVND*LNLLISWC 507
                +  C  C  C ET  H LF   + +L W +S+  + +    +W  +  +NL   W 
Sbjct: 288  HLSKESACIRCPSCKETVNHLLFKCTFARLTWAISSIPIPLGG--EWADSIYVNLY--WV 343

Query: 506  F---DKDEDISRRGYFCIQFLYELWLTRNNARMEQKPINTKLILNLSA--LKRNRNHSAF 342
            F   + +    +        L+ LW  RN      +  N + +L  +   L+  R  +  
Sbjct: 344  FNLGNGNPQWEKASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEA 403

Query: 341  LTCPPFGNIALASQPLCFNFNTWSSPSVPWVKVNFDTALDKSHYSGAAGAIARDSEGVIF 162
             +C   G     ++  C     W  P   WVK N D   ++ +     G + R+ +G + 
Sbjct: 404  ESC---GTKPQVNRSSC---GRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVK 457

Query: 161  GAAFRNF-FANDHEQAKAIGAEVAVLLALRLQLNIVIFEGDCQTLI 27
                R         +A+      AVL   R Q N VIFE D Q LI
Sbjct: 458  WMGARALPKLKSVLEAELEAMRWAVLSLSRFQYNYVIFESDSQVLI 503


>ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis thaliana]
            gi|6682231|gb|AAF23283.1|AC016661_8 putative non-LTR
            reverse transcriptase [Arabidopsis thaliana]
            gi|332641254|gb|AEE74775.1| RNase H domain-containing
            protein [Arabidopsis thaliana]
          Length = 484

 Score = 79.0 bits (193), Expect = 2e-12
 Identities = 80/322 (24%), Positives = 137/322 (42%), Gaps = 17/322 (5%)
 Frame = -1

Query: 938  INRVHIRPLNLPDQII*PYAKNGNISVNLAYFFLSKQDNLSFSSSFNPL----IVQKLWK 771
            I+R+++     PD+II  Y   G  +V   Y+ L+   + +  +   P     +  ++W 
Sbjct: 105  IHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTRIWN 164

Query: 770  LKIDAATQLFIWKLYSGGLPTGDVLHKCKFKGDITCSFCQKCIETALHALFSYYWVKLMW 591
            L I    + F+W+  S  L T + L     + D +C  C +  E+  HALF+  +  + W
Sbjct: 165  LPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAW 224

Query: 590  NVSNAML--NISSHDDWEVND*LNLLISWCFDKDEDISR-RGYFCIQFLYELWLTRNNA- 423
             +S++ L  N    +D+E N  ++ +++  F +D  +S       +  ++ +W  RNN  
Sbjct: 225  RLSDSSLIRNQLMSNDFEEN--ISNILN--FVQDTTMSDFHKLLPVWLIWRIWKARNNVV 280

Query: 422  --RMEQKPINTKLILNLSALKRNRNHSAFLTCPPFGNIALASQPLCFNFNTWSSPSVPWV 249
              +  + P  +K +L+  A      H               ++ +  N   W +P   +V
Sbjct: 281  FNKFRESP--SKTVLSAKA----ETHDWLNATQSHKKTPSPTRQIAENKIEWRNPPATYV 334

Query: 248  KVNFDTALDKSHYSGAAGAIARDSEG--VIFGAAFRNFFANDHEQAKAIGAEVAVLLALR 75
            K NFD   D        G I R+  G  + +G+      +N  E      AE   LLA  
Sbjct: 335  KCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLE------AETKALLAAL 388

Query: 74   LQLNI-----VIFEGDCQTLIS 24
             Q  I     V  EGDCQTLI+
Sbjct: 389  QQTWIRGYTQVFMEGDCQTLIN 410


>gb|EEC83100.1| hypothetical protein OsI_28249 [Oryza sativa Indica Group]
          Length = 1300

 Score = 79.0 bits (193), Expect = 2e-12
 Identities = 83/344 (24%), Positives = 139/344 (40%), Gaps = 23/344 (6%)
 Frame = -1

Query: 989  WNQNRLG*LFPFQIIQSINRVHIRPLNLPDQII*PYAKNGNISVNLAYFFLSKQDNLSFS 810
            W+ +++  +F    ++ I  +H    +  D +     K G  SV  AY       N+  S
Sbjct: 892  WDSHKIQQIFLPIDVEKILSIHTSRFHENDFVAWHSDKLGRFSVRSAYHLALSLSNVVAS 951

Query: 809  SSFNPLIVQK----LWKLKIDAATQLFIWKLYSGGLPTGDVLHKCKFKGDITCSFCQKCI 642
            SS +   + K    LW   +    ++FIW+  S  L T     K + +    CS C    
Sbjct: 952  SSSSGQELSKAWNQLWSCHVPQKVRIFIWRAASNSLATMVNKKKKRLEHCSMCSICGTEE 1011

Query: 641  ETALHALFSYYWVKLMWNVSN--AMLNISSHDDWEVND*LNLLISWCFDKDEDISRRGY- 471
            E   HAL      K +W V      + + +  +W   D       W FD  E IS+    
Sbjct: 1012 EDVAHALCRCPHAKYLWEVMRRAKAITVQADRNWTGAD-------WIFDISERISKEERP 1064

Query: 470  FCIQFLYELWLTRNN---------ARMEQKPINTKLILNLSALKRNRNHSA-----FLTC 333
              +  L+ +W  RN          A + Q+ I++  I +L  +++  + +       + C
Sbjct: 1065 TLLMMLWRIWYVRNEITHGKAAVPAEVSQRFISS-YITSLLEIRQFPDANLCKGKHVIRC 1123

Query: 332  PPFGNIALASQPLCFNFNT-WSSPSVPWVKVNFDTALDKSHYSGAAGAIARDSEG-VIFG 159
               G  A  + P   +    W  P   W+K+N D + D    SG  GA+ R+SEG +IF 
Sbjct: 1124 AAAG--AQVNHPRVNSVPVRWVRPQAGWMKLNVDGSYDPRDGSGGIGAVLRNSEGKLIFA 1181

Query: 158  AAFRNFFANDHEQAKAIGAEVAVLLALRLQLNIVIFEGDCQTLI 27
            A           +A+ +  +  ++LAL+     +I E DC  L+
Sbjct: 1182 ACGSMCRPVSALEAELVACKEGIILALQWTFLPIIVETDCLELV 1225


>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1750

 Score = 79.0 bits (193), Expect = 2e-12
 Identities = 80/322 (24%), Positives = 137/322 (42%), Gaps = 17/322 (5%)
 Frame = -1

Query: 938  INRVHIRPLNLPDQII*PYAKNGNISVNLAYFFLSKQDNLSFSSSFNPL----IVQKLWK 771
            I+R+++     PD+II  Y   G  +V   Y+ L+   + +  +   P     +  ++W 
Sbjct: 1371 IHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTRIWN 1430

Query: 770  LKIDAATQLFIWKLYSGGLPTGDVLHKCKFKGDITCSFCQKCIETALHALFSYYWVKLMW 591
            L I    + F+W+  S  L T + L     + D +C  C +  E+  HALF+  +  + W
Sbjct: 1431 LPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAW 1490

Query: 590  NVSNAML--NISSHDDWEVND*LNLLISWCFDKDEDISR-RGYFCIQFLYELWLTRNNA- 423
             +S++ L  N    +D+E N  ++ +++  F +D  +S       +  ++ +W  RNN  
Sbjct: 1491 RLSDSSLIRNQLMSNDFEEN--ISNILN--FVQDTTMSDFHKLLPVWLIWRIWKARNNVV 1546

Query: 422  --RMEQKPINTKLILNLSALKRNRNHSAFLTCPPFGNIALASQPLCFNFNTWSSPSVPWV 249
              +  + P  +K +L+  A      H               ++ +  N   W +P   +V
Sbjct: 1547 FNKFRESP--SKTVLSAKA----ETHDWLNATQSHKKTPSPTRQIAENKIEWRNPPATYV 1600

Query: 248  KVNFDTALDKSHYSGAAGAIARDSEG--VIFGAAFRNFFANDHEQAKAIGAEVAVLLALR 75
            K NFD   D        G I R+  G  + +G+      +N  E      AE   LLA  
Sbjct: 1601 KCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLE------AETKALLAAL 1654

Query: 74   LQLNI-----VIFEGDCQTLIS 24
             Q  I     V  EGDCQTLI+
Sbjct: 1655 QQTWIRGYTQVFMEGDCQTLIN 1676


>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score = 78.2 bits (191), Expect = 3e-12
 Identities = 78/324 (24%), Positives = 124/324 (38%), Gaps = 4/324 (1%)
 Frame = -1

Query: 989  WNQNRLG*LFPFQIIQSINRVHIRPLNLPDQII*PYAKNGNISVNLAYFFLSKQDNL--- 819
            WN   L  LF      +I R+ +     PDQ +   +KNG  +V  AY+    +D     
Sbjct: 981  WNVELLNTLFQPWESTAIQRIPVALQKKPDQWMWMMSKNGQFTVRSAYYHELLEDRKTGP 1040

Query: 818  SFSSSFNPLIVQKLWKLKIDAATQLFIWKLYSGGLPTGDVLHKCKFKGDITCSFCQKCIE 639
            S S   N  + QK+WK KI    +LF WK    GL     + K     D  C  C +  E
Sbjct: 1041 STSRGPNLKLWQKIWKAKIPPKVKLFSWKAIHNGLAVYTNMRKRGMNIDGACPRCGEKEE 1100

Query: 638  TALHALFSYYWVKLMWNVSNAMLNISSHDDWEVND*LNLLISWCFDKDEDISRRGYFCIQ 459
            T  H ++        W +S   ++  + +         + +    D  +D      F + 
Sbjct: 1101 TTEHLIWGCDESSRAWYISPLRIHTGNIEAGS----FRIWVESLLDTHKDTEWWALFWM- 1155

Query: 458  FLYELWLTRNNARMEQKPINTKLILNLSALKRNRNHSAFLTCPPFGNIALASQPLCFNFN 279
              + +WL RN    E+K +  + ++  +               P        + L  + N
Sbjct: 1156 ICWNIWLGRNKWVFEKKKLAFQEVVERAVRGVMEFEEECAHTSPV-------ETLNTHEN 1208

Query: 278  TWSSPSVPWVKVNFDTALDKSHYSGAAGAIARDSEG-VIFGAAFRNFFANDHEQAKAIGA 102
             WS P V  VK+N D A+ K H     G + RD+EG V+       +   D   A+A   
Sbjct: 1209 GWSVPPVGMVKLNVDAAVFK-HVGIGMGGVVRDAEGDVLLATCCGGWAMEDPAMAEACSL 1267

Query: 101  EVAVLLALRLQLNIVIFEGDCQTL 30
               + +A       ++ E DC+ L
Sbjct: 1268 RYGLKVAYEAGFRNLVVEMDCKKL 1291


Top