BLASTX nr result

ID: Coptis24_contig00016661 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00016661
         (1402 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thalia...    91   9e-16
ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis ...    87   8e-15
emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga...    87   8e-15
gb|AAD24831.1| putative non-LTR retroelement reverse transcripta...    87   1e-14
gb|EEE66057.1| hypothetical protein OsJ_22054 [Oryza sativa Japo...    79   2e-14

>ref|NP_194638.1| Ribonuclease H-like protein [Arabidopsis thaliana]
            gi|4972055|emb|CAB43923.1| putative protein [Arabidopsis
            thaliana] gi|7269807|emb|CAB79667.1| putative protein
            [Arabidopsis thaliana] gi|67633766|gb|AAY78807.1|
            putative reverse transcriptase/RNA-dependent DNA
            polymerase [Arabidopsis thaliana]
            gi|332660185|gb|AEE85585.1| Ribonuclease H-like protein
            [Arabidopsis thaliana]
          Length = 575

 Score = 90.5 bits (223), Expect = 9e-16
 Identities = 93/388 (23%), Positives = 154/388 (39%), Gaps = 11/388 (2%)
 Frame = +3

Query: 129  LKVNSLFSPVLLSWNQNRLGELFPFXXXXXXXXXXXXPSGSPDHIIWPATKNRELTVKSA 308
            LKV+ L       W ++ +  LFP                  D   W  T + + TVKS 
Sbjct: 169  LKVSDLIDESGREWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSG 228

Query: 309  YVFLTKQDNISFSS------SFNPLSMKNLWKLHLDAAT*LFIWKLYIGGLPTGDVLHKF 470
            Y  LT+  N   S       S NP+  K +WK         F+WK     LP    L   
Sbjct: 229  YWVLTQIINKRSSPQEVSEPSLNPIYQK-IWKSQTSPKIQHFLWKCLSNSLPVAGALAYR 287

Query: 471  KFKGDISCSFCQKCIETASHVFFSCTWIKMMWYVSNAMLDINVHPDWNVKEWINQF-IFW 647
                + +C  C  C ET +H+ F CT+ ++ W +S+  + I +  +W    ++N + +F 
Sbjct: 288  HLSKESACIRCPSCKETVNHLLFKCTFARLTWAISS--IPIPLGGEWADSIYVNLYWVFN 345

Query: 648  CSSKDEDICRRGYLCLFILYELWLARNKARMECRPIELKSILNFSDMKRE---ITSLAFL 818
              + +    +   L  ++L+ LW  RN+     R    + +L  ++   E   I + A  
Sbjct: 346  LGNGNPQWEKASQLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAES 405

Query: 819  SITYPGLPLSFNIIXXXXXXXXXVKVNFNVAFDRSHFSAAVGVVVRNAEGVILGAAFRRF 998
              T P +  S +           VK N +  ++R +    +G V+RN +G +     R  
Sbjct: 406  CGTKPQVNRS-SCGRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARAL 464

Query: 999  -SANDPEQAELIGAEVAILLALRLNLCFVILEGDCQTLMSALKTCNSSLLG*NSFFVFQH 1175
                   +AEL     A+L   R    +VI E D Q L+  L   N+  +  +     Q 
Sbjct: 465  PKLKSVLEAELEAMRWAVLSLSRFQYNYVIFESDSQVLIEIL---NNDEIWPSLKPTIQD 521

Query: 1176 IFALAAGLDKFVFSWVSRTGNGFAHGLA 1259
            +  L +   +  F ++ R GN  A  +A
Sbjct: 522  LQRLLSQFTEVKFVFIPREGNTLAERVA 549


>ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis thaliana]
            gi|6682231|gb|AAF23283.1|AC016661_8 putative non-LTR
            reverse transcriptase [Arabidopsis thaliana]
            gi|332641254|gb|AEE74775.1| RNase H domain-containing
            protein [Arabidopsis thaliana]
          Length = 484

 Score = 87.4 bits (215), Expect = 8e-15
 Identities = 90/395 (22%), Positives = 162/395 (41%), Gaps = 15/395 (3%)
 Frame = +3

Query: 126  DLKVNSLFSPV--LLSWNQNRLGELFPFXXXXXXXXXXXXPSGSPDHIIWPATKNRELTV 299
            ++ +N+LF        W+ +++ +                 S  PD IIW      E TV
Sbjct: 72   EMTINNLFERKGSYYFWDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTV 131

Query: 300  KSAYVFLTKQDNISFSSSFNP---LSMKN-LWKLHLDAAT*LFIWKLYIGGLPTGDVLHK 467
            +S Y  LT   + +  +   P   + +K  +W L +      F+W+     L T + L  
Sbjct: 132  RSGYWLLTHDPSTNIPAINPPHGSIDLKTRIWNLPIMPKLKHFLWRALSQALATTERLTT 191

Query: 468  FKFKGDISCSFCQKCIETASHVFFSCTWIKMMWYVSNAMLDINVHPDWNVKEWINQFIFW 647
               + D SC  C +  E+ +H  F+C +  M W +S++ L  N     + +E I+  + +
Sbjct: 192  RGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSLIRNQLMSNDFEENISNILNF 251

Query: 648  CSSKDEDICRRGYLCLFILYELWLARNKARM-ECRPIELKSILNFSDMKRE--ITSLAFL 818
                      +  L +++++ +W ARN     + R    K++L+      +    + +  
Sbjct: 252  VQDTTMSDFHK-LLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHK 310

Query: 819  SITYPGLPLSFNIIXXXXXXXXXVKVNFNVAFDRSHFSAAVGVVVRNAEGV-ILGAAFRR 995
                P   ++ N I         VK NF+  FD     A  G ++RN  G  I   + + 
Sbjct: 311  KTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKL 370

Query: 996  FSANDPEQAELIGAEVAILLALRLNLCFVILEGDCQTLMSALK--TCNSSLLG*NSFFVF 1169
               ++P +AE      A+          V +EGDCQTL++ +   + +SSL         
Sbjct: 371  AHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFHSSLA-------- 422

Query: 1170 QHIFALAAGLDKFV---FSWVSRTGNGFAHGLASW 1265
             H+  ++   +KF    F ++ R GN  AH LA +
Sbjct: 423  NHLEDISFWANKFASIQFGFIRRKGNKLAHVLAKY 457


>emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1369

 Score = 87.4 bits (215), Expect = 8e-15
 Identities = 93/380 (24%), Positives = 148/380 (38%), Gaps = 6/380 (1%)
 Frame = +3

Query: 168  WNQNRLGELFPFXXXXXXXXXXXXPSGSPDHIIWPATKNRELTVKSAYVFLTKQDNISFS 347
            WN   L  LF                  PD  +W  +KN + TV+SAY     +D  +  
Sbjct: 981  WNVELLNTLFQPWESTAIQRIPVALQKKPDQWMWMMSKNGQFTVRSAYYHELLEDRKTGP 1040

Query: 348  SSFNPLSMK---NLWKLHLDAAT*LFIWKLYIGGLPTGDVLHKFKFKGDISCSFCQKCIE 518
            S+    ++K    +WK  +     LF WK    GL     + K     D +C  C +  E
Sbjct: 1041 STSRGPNLKLWQKIWKAKIPPKVKLFSWKAIHNGLAVYTNMRKRGMNIDGACPRCGEKEE 1100

Query: 519  TASHVFFSCTWIKMMWYVSNAMLDINVHPDWNVKEWINQFIFWCSSKDEDICRRGYLCLF 698
            T  H+ + C      WY+S     + +H   N++     F  W  S  +      +  LF
Sbjct: 1101 TTEHLIWGCDESSRAWYIS----PLRIHTG-NIE--AGSFRIWVESLLDTHKDTEWWALF 1153

Query: 699  --ILYELWLARNKARMECRPIELKSILNFSDMKREITSLAFLSITYPGLPLSFNIIXXXX 872
              I + +WL RNK   E + +  + ++  + ++  +      + T P   L+ +      
Sbjct: 1154 WMICWNIWLGRNKWVFEKKKLAFQEVVERA-VRGVMEFEEECAHTSPVETLNTHENGWSV 1212

Query: 873  XXXXXVKVNFNVAFDRSHFSAAVGVVVRNAEG-VILGAAFRRFSANDPEQAELIGAEVAI 1049
                 VK+N + A  + H    +G VVR+AEG V+L      ++  DP  AE       +
Sbjct: 1213 PPVGMVKLNVDAAVFK-HVGIGMGGVVRDAEGDVLLATCCGGWAMEDPAMAEACSLRYGL 1271

Query: 1050 LLALRLNLCFVILEGDCQTLMSALKTCNSSLLG*NSFFVFQHIFALAAGLDKFVFSWVSR 1229
             +A       +++E DC+ L   L+   S +       V   I  LA+     VF  V R
Sbjct: 1272 KVAYEAGFRNLVVEMDCKKLFLQLRGKASDVTPFGR--VVDDILYLASKCSNVVFEHVKR 1329

Query: 1230 TGNGFAHGLASWASKQIPGR 1289
              N  AH LA      +  R
Sbjct: 1330 HCNKVAHLLAQMCKNAMEKR 1349


>gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1524

 Score = 86.7 bits (213), Expect = 1e-14
 Identities = 89/395 (22%), Positives = 162/395 (41%), Gaps = 15/395 (3%)
 Frame = +3

Query: 126  DLKVNSLFSPV--LLSWNQNRLGELFPFXXXXXXXXXXXXPSGSPDHIIWPATKNRELTV 299
            ++ +N+LF        W+ +++ +                 S  PD IIW      E TV
Sbjct: 1112 EMTINNLFERKGSYYFWDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTV 1171

Query: 300  KSAYVFLTKQDNISFSSSFNP---LSMKN-LWKLHLDAAT*LFIWKLYIGGLPTGDVLHK 467
            +S Y  LT   + +  +   P   + +K  +W L +      F+W+     L T + L  
Sbjct: 1172 RSGYWLLTHDPSTNIPAINPPHGSIDLKTRIWNLPIMPKLKHFLWRALSQALATTERLTT 1231

Query: 468  FKFKGDISCSFCQKCIETASHVFFSCTWIKMMWYVSNAMLDINVHPDWNVKEWINQFIFW 647
               + D  C  C +  E+ +H  F+C +  M W++S++ L  N     + +E I+  + +
Sbjct: 1232 RGMRIDPICPRCHRENESINHALFTCPFATMAWWLSDSSLIRNQLMSNDFEENISNILNF 1291

Query: 648  CSSKDEDICRRGYLCLFILYELWLARNKARM-ECRPIELKSILNFSDMKRE--ITSLAFL 818
                      +  L +++++ +W ARN     + R    K++L+      +    + +  
Sbjct: 1292 VQDTTMSDFHK-LLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHK 1350

Query: 819  SITYPGLPLSFNIIXXXXXXXXXVKVNFNVAFDRSHFSAAVGVVVRNAEGV-ILGAAFRR 995
                P   ++ N I         VK NF+  FD     A  G ++RN  G  I   + + 
Sbjct: 1351 KTPSPTRQIAENKIEWRNPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKL 1410

Query: 996  FSANDPEQAELIGAEVAILLALRLNLCFVILEGDCQTLMSALK--TCNSSLLG*NSFFVF 1169
               ++P +AE      A+          V +EGDCQTL++ +   + +SSL         
Sbjct: 1411 AHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLINGISFHSSLA-------- 1462

Query: 1170 QHIFALAAGLDKFV---FSWVSRTGNGFAHGLASW 1265
             H+  ++   +KF    F ++ R GN  AH LA +
Sbjct: 1463 NHLEDISFWANKFASIQFGFIRRKGNKLAHVLAKY 1497


>gb|EEE66057.1| hypothetical protein OsJ_22054 [Oryza sativa Japonica Group]
          Length = 940

 Score = 79.0 bits (193), Expect(2) = 2e-14
 Identities = 93/421 (22%), Positives = 170/421 (40%), Gaps = 20/421 (4%)
 Frame = +3

Query: 78   KPIILKDLLDPGPSLFDLKVNSLFSPVLLSWNQNRLGELFPFXXXXXXXXXXXXPSGSPD 257
            KP +++ LL   P   D+ V+ L +  +  W+++++   F                G  D
Sbjct: 517  KPSMVRPLL---PMPDDVTVDFLVNAAIGEWDEDKVFSFFDETTAQQILQIPVSAHGGED 573

Query: 258  HIIWPATKNRELTVKSAYVFLTKQDNISFSSSFNPLSM-----------KNLWKLHLDAA 404
             I WP  K    +V+SAY  L + +    + S N   M           K LW+++    
Sbjct: 574  FISWPHDKRGVFSVRSAYN-LARSEIFMAAQSENGRGMLSGLQESANRWKELWRINAPGK 632

Query: 405  T*LFIWKLYIGGLPTGDVLHKFKFKGDISCSFCQKCIETASHVFFSCTWIKMMWYVSNAM 584
                +W++    LP+G  L +        C FC++  +   H+F  C +   +W      
Sbjct: 633  MLTNLWRIVHDCLPSGFQLRRRHIPATDGCCFCER-DDRIEHIFLLCPFAVCIWDSIKQH 691

Query: 585  LDINVHPD--WNVKEWINQFIFWCSSKDEDICRRGYLCLFILYELWLARNKARME---CR 749
             D+ +      N+K+W+  F+   S+  +            L+ +W ARN +R       
Sbjct: 692  FDLKLCMTDLSNMKQWVFDFLGRSSNIQKT------ALAVTLWHIWEARNHSRNNPTLAN 745

Query: 750  PIE-LKSILNFSDMKREITSLAFLSITYPGLPLSFNIIXXXXXXXXXVKVNFNVAFDRSH 926
            P + ++ IL + +M  +    A  ++    L     +          + +N + A  +S 
Sbjct: 746  PRQVIQKILAYVEMIEQHCCCAVQAVRGDALR---PVPRWRPPPEGTILINTDAAVFQSV 802

Query: 927  FSAAVGVVVRNAEGVILGAAFRRFS-ANDPEQAELIGAEVAILLALRLNLCFVILEGDCQ 1103
             S  +G + R+  G+ L AA  R S    PE AE +    A+  A+      ++L  DC 
Sbjct: 803  NSFGLGFLFRDHSGLCLFAANERHSGCIQPEMAEALAIRCALRTAMEEGHQKIVLASDCL 862

Query: 1104 TLMSALKT--CNSSLLG*NSFFVFQHIFALAAGLDKFVFSWVSRTGNGFAHGLASWASKQ 1277
             ++  +++   + S++G     +   I  LAAG     F  V+R  N  AH LA  + + 
Sbjct: 863  AIIQKIQSGARDRSMVG----ALVSDINFLAAGFLDCSFIHVNRVTNAAAHLLAQCSEQT 918

Query: 1278 I 1280
            +
Sbjct: 919  V 919



 Score = 27.7 bits (60), Expect(2) = 2e-14
 Identities = 10/20 (50%), Positives = 12/20 (60%)
 Frame = +2

Query: 2   GFLWNIGAGSSISIFNDPWL 61
           G  W IG GSS+ I  D W+
Sbjct: 494 GVRWGIGNGSSVKILKDHWI 513


Top