BLASTX nr result

ID: Coptis24_contig00009393 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00009393
         (1104 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_189164.1| Ribonuclease H-like protein [Arabidopsis thalia...    87   7e-15
ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis ...    74   8e-11
ref|XP_002466618.1| hypothetical protein SORBIDRAFT_01g011130 [S...    73   1e-10
gb|AAD24831.1| putative non-LTR retroelement reverse transcripta...    73   1e-10
gb|AAD20714.1| putative non-LTR retroelement reverse transcripta...    73   1e-10

>ref|NP_189164.1| Ribonuclease H-like protein [Arabidopsis thaliana]
            gi|9294184|dbj|BAB02086.1| reverse transcriptase-like
            protein [Arabidopsis thaliana]
            gi|332643482|gb|AEE77003.1| Ribonuclease H-like protein
            [Arabidopsis thaliana]
          Length = 343

 Score = 87.0 bits (214), Expect = 7e-15
 Identities = 80/316 (25%), Positives = 138/316 (43%), Gaps = 3/316 (0%)
 Frame = -3

Query: 1039 KMWKTKVEASIQLLVWKLYNGGLLTGDRLRRQKFKGDISCVFCQKEIETDEHLFLQCAWI 860
            K+WK K    I+  +WKL +G L TGD L+R+  +    C  C +E ET +HLF  C + 
Sbjct: 17   KIWKLKTAPKIKHFLWKLLSGALATGDNLKRRHIRNHPQCHRCCQEDETSQHLFFDCFYA 76

Query: 859  RCLWFGSSLSIRMEEHEGKTLHEWIGEAVCWNSLEDFAETEVQRFASTYFLVMINEIWRC 680
            + +W  S +  +     G T+ E   E +  + L   A  + Q F    +++    +W+ 
Sbjct: 77   QQVWRASGIPHQELRTTGITM-ETKMELLLSSCL---ANRQPQLFNLAIWILW--RLWKS 130

Query: 679  RNQLRFEKIKPNMEQMIRTVARKV--GSTLEAYKKDVRSSGQTSRKENNPIEYHMALNKV 506
            RNQL F++   + +  ++     V        Y + +     +SR +    +  MA  K 
Sbjct: 131  RNQLVFQQKSISWQNTLQRARNDVQEWEDTNTYVQSLNQQVHSSRHQ----QPTMARTKW 186

Query: 505  GIDFYSSWPKISFDGAFDRMTKKGGAAAICRDADGKLLGSVYR-RFHAETPYEAECHGAE 329
                 S+W K ++DGAF+  T+   A  + RD +G  +GS            E+E     
Sbjct: 187  QRP-PSTWIKYNYDGAFNHQTRNAKAGWLMRDENGVYMGSGQAIGSTTSDSLESEFQALI 245

Query: 328  LAAILLYKLKLDKVLIMGDCRDLMQTLRSNGVGLEQDDFISRIEVLFKQSYLMFSCNLCF 149
            +A    +     KV+  GD + + + + +  +   + ++I   E  F Q          F
Sbjct: 246  IAMQHAWSQGYRKVIFEGDSKQVEELMNNEKLNFGRFNWIR--EGRFWQKRFE---EAVF 300

Query: 148  YWFRRAHNREADELSK 101
             W  R +N+ AD L+K
Sbjct: 301  KWVPRTNNQPADILAK 316


>ref|NP_187562.1| RNase H domain-containing protein [Arabidopsis thaliana]
            gi|6682231|gb|AAF23283.1|AC016661_8 putative non-LTR
            reverse transcriptase [Arabidopsis thaliana]
            gi|332641254|gb|AEE74775.1| RNase H domain-containing
            protein [Arabidopsis thaliana]
          Length = 484

 Score = 73.6 bits (179), Expect = 8e-11
 Identities = 71/334 (21%), Positives = 132/334 (39%), Gaps = 6/334 (1%)
 Frame = -3

Query: 1039 KMWKTKVEASIQLLVWKLYNGGLLTGDRLRRQKFKGDISCVFCQKEIETDEHLFLQCAWI 860
            ++W   +   ++  +W+  +  L T +RL  +  + D SC  C +E E+  H    C + 
Sbjct: 161  RIWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFA 220

Query: 859  RCLWFGSSLSIRMEEHEGKTLHEWIGEAVCWNSLEDFAETEVQRFASTYFLVMINEIWRC 680
               W  S  S+   +       E I      N L    +T +  F     + +I  IW+ 
Sbjct: 221  TMAWRLSDSSLIRNQLMSNDFEENIS-----NILNFVQDTTMSDFHKLLPVWLIWRIWKA 275

Query: 679  RNQLRFEKIKPNMEQMIRTVARKVGSTLEAYKKDVRSSGQTSRKENNPIEYHMALNKVGI 500
            RN + F K + +  + + +   +    L A +   ++   T +   N IE+         
Sbjct: 276  RNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHKKTPSPTRQIAENKIEWR-------- 327

Query: 499  DFYSSWPKISFDGAFDRMTKKGGAAAICRDADGKLL--GSVYRRFHAETPYEAECHGAEL 326
            +  +++ K +FD  FD    +     I R+  G  +  GS+ +  H   P EAE      
Sbjct: 328  NPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSM-KLAHTSNPLEAETKALLA 386

Query: 325  AAILLYKLKLDKVLIMGDCRDLMQTLRSNGVGLEQDDFISRIEVLFKQSYLMFSCNLCFY 146
            A    +     +V + GDC+ L+  +  NG+           ++ F  +      ++ F 
Sbjct: 387  ALQQTWIRGYTQVFMEGDCQTLINLI--NGISFHSSLANHLEDISFWANKF---ASIQFG 441

Query: 145  WFRRAHNREADELSKWALNYAAIYPP----PCWI 56
            + RR  N+ A  L+K+   Y+  Y      P W+
Sbjct: 442  FIRRKGNKLAHVLAKYGCTYSTFYSGSGSLPIWL 475


>ref|XP_002466618.1| hypothetical protein SORBIDRAFT_01g011130 [Sorghum bicolor]
            gi|241920472|gb|EER93616.1| hypothetical protein
            SORBIDRAFT_01g011130 [Sorghum bicolor]
          Length = 463

 Score = 72.8 bits (177), Expect = 1e-10
 Identities = 81/349 (23%), Positives = 144/349 (41%), Gaps = 13/349 (3%)
 Frame = -3

Query: 1099 QNHISDCQGETSTANEQAV---KKMWKTKVEASIQLLVWKLYNGGLLTGDRLRRQKFKGD 929
            +NH S  QG+   +N   +   K++WK      I+  +W+  +      D L R+  +  
Sbjct: 123  RNHTS--QGQQGGSNPGPLSLWKRVWKLSCPNKIKHFLWRFLHNSHPLRDNLIRRGMEIV 180

Query: 928  ISCVFCQKEIETDEHLFLQCAWIRCLWFGSSLSIRMEEHEGKTLHEWIGEAVCWNSLEDF 749
              C  C +  E   HLF +C   R +W    LS   E              V  N     
Sbjct: 181  PRCPVCNQVGEDGGHLFFKCGMARQVWELLGLSTERE--------------VLANFYTPI 226

Query: 748  AETE-VQRFASTYFLVMINEIWRC---RNQLRFEKIKPNMEQMIRTVARKVGSTLEAYKK 581
               E + R + +  L+MI  +W     RN +R E  + + + + R V        E Y +
Sbjct: 227  DVVEFILRASESRKLMMIVALWYTWSERNAIREEDRRRSPQTLARCV--------ELYVQ 278

Query: 580  DVRSSGQTSRKENNPIEYHMALNKVGIDFYSSWPKISFDGAFDRMTKKGGAAAICRDADG 401
            ++R++  T+    N  +     +K  +D      K++ DG+F   T+ G    + RD +G
Sbjct: 279  EMRTTETTANPTANQEQQQYKWSKPPVDIL----KLNCDGSFSPETRAGSWGVLIRDHEG 334

Query: 400  KLLGSVYRRF-HAETPYEAECHGAELAAILLYKLKLDKVLIMGDCRDLMQTLRSN----- 239
             ++ S   R  H  TP +AE         L   L + ++++  D  ++++ ++++     
Sbjct: 335  DVIMSGRGRVNHLMTPMQAELIACLQGVQLAANLGIGRLILETDALEVVKAIKTSAYNYA 394

Query: 238  GVGLEQDDFISRIEVLFKQSYLMFSCNLCFYWFRRAHNREADELSKWAL 92
             VG   ++  S IE+ F     +F+C +C        NR A EL+   L
Sbjct: 395  AVGYLVEEIKSLIELNFISVECVFACRIC--------NRAAHELAALGL 435


>gb|AAD24831.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1524

 Score = 72.8 bits (177), Expect = 1e-10
 Identities = 70/334 (20%), Positives = 132/334 (39%), Gaps = 6/334 (1%)
 Frame = -3

Query: 1039 KMWKTKVEASIQLLVWKLYNGGLLTGDRLRRQKFKGDISCVFCQKEIETDEHLFLQCAWI 860
            ++W   +   ++  +W+  +  L T +RL  +  + D  C  C +E E+  H    C + 
Sbjct: 1201 RIWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPICPRCHRENESINHALFTCPFA 1260

Query: 859  RCLWFGSSLSIRMEEHEGKTLHEWIGEAVCWNSLEDFAETEVQRFASTYFLVMINEIWRC 680
               W+ S  S+   +       E I      N L    +T +  F     + +I  IW+ 
Sbjct: 1261 TMAWWLSDSSLIRNQLMSNDFEENIS-----NILNFVQDTTMSDFHKLLPVWLIWRIWKA 1315

Query: 679  RNQLRFEKIKPNMEQMIRTVARKVGSTLEAYKKDVRSSGQTSRKENNPIEYHMALNKVGI 500
            RN + F K + +  + + +   +    L A +   ++   T +   N IE+         
Sbjct: 1316 RNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHKKTPSPTRQIAENKIEWR-------- 1367

Query: 499  DFYSSWPKISFDGAFDRMTKKGGAAAICRDADGKLL--GSVYRRFHAETPYEAECHGAEL 326
            +  +++ K +FD  FD    +     I R+  G  +  GS+ +  H   P EAE      
Sbjct: 1368 NPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSM-KLAHTSNPLEAETKALLA 1426

Query: 325  AAILLYKLKLDKVLIMGDCRDLMQTLRSNGVGLEQDDFISRIEVLFKQSYLMFSCNLCFY 146
            A    +     +V + GDC+ L+  +  NG+           ++ F  +      ++ F 
Sbjct: 1427 ALQQTWIRGYTQVFMEGDCQTLINLI--NGISFHSSLANHLEDISFWANKF---ASIQFG 1481

Query: 145  WFRRAHNREADELSKWALNYAAIYPP----PCWI 56
            + RR  N+ A  L+K+   Y+  Y      P W+
Sbjct: 1482 FIRRKGNKLAHVLAKYGCTYSTFYSGSGSLPIWL 1515


>gb|AAD20714.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1750

 Score = 72.8 bits (177), Expect = 1e-10
 Identities = 70/334 (20%), Positives = 132/334 (39%), Gaps = 6/334 (1%)
 Frame = -3

Query: 1039 KMWKTKVEASIQLLVWKLYNGGLLTGDRLRRQKFKGDISCVFCQKEIETDEHLFLQCAWI 860
            ++W   +   ++  +W+  +  L T +RL  +  + D SC  C +E E+  H    C + 
Sbjct: 1427 RIWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFA 1486

Query: 859  RCLWFGSSLSIRMEEHEGKTLHEWIGEAVCWNSLEDFAETEVQRFASTYFLVMINEIWRC 680
               W  S  S+   +       E I      N L    +T +  F     + +I  IW+ 
Sbjct: 1487 TMAWRLSDSSLIRNQLMSNDFEENIS-----NILNFVQDTTMSDFHKLLPVWLIWRIWKA 1541

Query: 679  RNQLRFEKIKPNMEQMIRTVARKVGSTLEAYKKDVRSSGQTSRKENNPIEYHMALNKVGI 500
            RN + F K + +  + + +   +    L A +   ++   T +   N IE+         
Sbjct: 1542 RNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHKKTPSPTRQIAENKIEWR-------- 1593

Query: 499  DFYSSWPKISFDGAFDRMTKKGGAAAICRDADGKLL--GSVYRRFHAETPYEAECHGAEL 326
            +  +++ K +FD  FD    +     I R+  G  +  GS+ +  H   P EAE      
Sbjct: 1594 NPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSM-KLAHTSNPLEAETKALLA 1652

Query: 325  AAILLYKLKLDKVLIMGDCRDLMQTLRSNGVGLEQDDFISRIEVLFKQSYLMFSCNLCFY 146
            A    +     +V + GDC+ L+  +  NG+           ++ F  +      ++ F 
Sbjct: 1653 ALQQTWIRGYTQVFMEGDCQTLINLI--NGISFHSSLANHLEDISFWANKF---ASIQFG 1707

Query: 145  WFRRAHNREADELSKWALNYAAIYPP----PCWI 56
            + R+  N+ A  L+K+   Y+  Y      P W+
Sbjct: 1708 FIRKKGNKLAHVLAKYGCTYSTFYSDSGSLPIWL 1741


Top