BLASTX nr result

ID: Coptis25_contig00018126 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00018126
         (1270 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABW74566.1| integrase [Boechera divaricarpa]                       273   e-114
emb|CAN79061.1| hypothetical protein VITISV_024577 [Vitis vinifera]   270   e-112
gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabi...   256   e-106
gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabi...   256   e-106
gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thal...   256   e-106

>gb|ABW74566.1| integrase [Boechera divaricarpa]
          Length = 1165

 Score =  273 bits (699), Expect(2) = e-114
 Identities = 136/257 (52%), Positives = 174/257 (67%)
 Frame = +3

Query: 384  KYLEDLLKRFSMSNCKPISSPAALNEKLQLNDGALKVDEKVYRSLVGSLLYLTHTRPDIM 563
            KY  DLLKRF M+ C  + +P  +NEKL   DG  K D   +RSLVG L+YLTHTRPDI 
Sbjct: 898  KYACDLLKRFDMAGCNAVETPMNVNEKLLAGDGTEKADATKFRSLVGGLIYLTHTRPDIC 957

Query: 564  YTVSLLSRFLSEPSKLHLAAAKRVLRYLQGTKNFGLKYVKEEVNSLISFTDSD*AGSLDD 743
            + VS +SRF+  P+K H  AAKR+LRY+  T  +GL Y       L+ FTDSD AG + D
Sbjct: 958  FAVSAISRFMHGPTKQHFGAAKRLLRYIARTAEYGLWYCSVSKFKLVGFTDSDWAGCVQD 1017

Query: 744  KKSTSGYAFCLGSKLIAWSSKKQKIVALSSTEAEYIAVTDAICEGIWLRRILKDLQQEVK 923
            +KSTSG+ F LGS  + WSSKKQ + ALSS+EAEY A T A C+ +WLRRIL D++QE +
Sbjct: 1018 RKSTSGHVFNLGSGAVCWSSKKQNVTALSSSEAEYTAATAAACQAVWLRRILADIKQEQE 1077

Query: 924  TPTTIFCDNMSAIATTKNPVFQSRTKHIELRYHYIRHKVEKKKVELKFINTNEQLADMLT 1103
              TTIFCDN + IA  KNP +  RTKHI ++ H+IR  V +  V L++ +TNEQ AD+LT
Sbjct: 1078 KATTIFCDNKATIAMNKNPAYHGRTKHISIKVHFIRDLVSEGSVTLEYCSTNEQSADVLT 1137

Query: 1104 KAVSMDKFV*ARGTMNV 1154
            KA+S +KF   R  + V
Sbjct: 1138 KALSRNKFDYFRSKLGV 1154



 Score =  165 bits (418), Expect(2) = e-114
 Identities = 74/129 (57%), Positives = 105/129 (81%), Gaps = 1/129 (0%)
 Frame = +2

Query: 2    EEVYVEQPLGYVKEGAENKVYKLQKALYSLKQAPRTWNSKIDGYFRQTGFSKSRSEPSLY 181
            EEVYV+QP G++ EG E  VY+L KALY LKQAPR W +KID YF +TGF +S+SEP+LY
Sbjct: 770  EEVYVDQPEGFIVEGREGFVYRLYKALYGLKQAPRAWYNKIDSYFAETGFERSKSEPTLY 829

Query: 182  IKKEG-NELLIVCLYVDDLIYTGTSMKILQDFKEAMMKEYEMTDLGLMKYFLGIQVQQSK 358
            IKK+G  ++L+VCLYVDD+IY G+S  ++ +FK +MM+++EMTDLGL+ +FLG++V+Q +
Sbjct: 830  IKKQGAGDILVVCLYVDDMIYMGSSASLVSEFKASMMEKFEMTDLGLLYFFLGLEVKQVE 889

Query: 359  GNIFISQEE 385
              +F+SQ +
Sbjct: 890  DGVFVSQHK 898


>emb|CAN79061.1| hypothetical protein VITISV_024577 [Vitis vinifera]
          Length = 1424

 Score =  270 bits (691), Expect(2) = e-112
 Identities = 137/259 (52%), Positives = 180/259 (69%)
 Frame = +3

Query: 378  KKKYLEDLLKRFSMSNCKPISSPAALNEKLQLNDGALKVDEKVYRSLVGSLLYLTHTRPD 557
            +K+Y+E +LK+F M+ C  +S+P  +NEKL+  DG   VDE  +RSLVG+LLYLT TRPD
Sbjct: 1155 QKRYVEHILKKFGMAGCNXVSTPLVVNEKLRKEDGGKMVDETHFRSLVGNLLYLTATRPD 1214

Query: 558  IMYTVSLLSRFLSEPSKLHLAAAKRVLRYLQGTKNFGLKYVKEEVNSLISFTDSD*AGSL 737
            IM+  SLLSRF+  PS LHL AAKRVLRYLQGT   G+KY +     LI   DSD  G +
Sbjct: 1215 IMFAASLLSRFMHYPSHLHLGAAKRVLRYLQGTVELGIKYFRNIEVKLIGHCDSDWGGCI 1274

Query: 738  DDKKSTSGYAFCLGSKLIAWSSKKQKIVALSSTEAEYIAVTDAICEGIWLRRILKDLQQE 917
            DD KSTSGYAF LGS +I+W SKKQ  VA SS EAEYI+ + A  + IWLRRIL+D++++
Sbjct: 1275 DDMKSTSGYAFSLGSGVISWVSKKQGSVAQSSAEAEYISASLATSQAIWLRRILEDIKEK 1334

Query: 918  VKTPTTIFCDNMSAIATTKNPVFQSRTKHIELRYHYIRHKVEKKKVELKFINTNEQLADM 1097
                T + CDN SAIA  KN VF SRT+HI ++YH+I+  +   +V+L +  + EQ AD+
Sbjct: 1335 QNEATYLLCDNKSAIAIAKNXVFHSRTRHIAVKYHFIKEVISDGEVQLMYCKSEEQXADI 1394

Query: 1098 LTKAVSMDKFV*ARGTMNV 1154
             TKA+ ++K V  R  + V
Sbjct: 1395 FTKALPLEKLVHFRKLLGV 1413



 Score =  161 bits (408), Expect(2) = e-112
 Identities = 74/127 (58%), Positives = 102/127 (80%), Gaps = 1/127 (0%)
 Frame = +2

Query: 5    EVYVEQPLGYVKEGAENKVYKLQKALYSLKQAPRTWNSKIDGYFRQTGFSKSRSEPSLYI 184
            E+YVEQP G+V +G ENKVYKL+KALY LKQAPR W ++ID YF + GF +S+SEP+LY+
Sbjct: 1030 EIYVEQPQGFVVDGEENKVYKLKKALYGLKQAPRAWYTQIDSYFIENGFIRSKSEPTLYV 1089

Query: 185  K-KEGNELLIVCLYVDDLIYTGTSMKILQDFKEAMMKEYEMTDLGLMKYFLGIQVQQSKG 361
            K K+ +++LIV LYVDDLI+TG   K+++ F+  MMK+YEM+D+GL+ YFLGI+V Q + 
Sbjct: 1090 KSKDNSQILIVALYVDDLIFTGNDEKMVEKFRNEMMKKYEMSDMGLLHYFLGIEVYQEED 1149

Query: 362  NIFISQE 382
             +FI Q+
Sbjct: 1150 GVFICQK 1156


>gb|AAG60117.1|AC073555_1 copia-type polyprotein, putative [Arabidopsis thaliana]
          Length = 1352

 Score =  256 bits (655), Expect(2) = e-106
 Identities = 123/259 (47%), Positives = 169/259 (65%)
 Frame = +3

Query: 387  YLEDLLKRFSMSNCKPISSPAALNEKLQLNDGALKVDEKVYRSLVGSLLYLTHTRPDIMY 566
            Y +++LK+F M +  P+ +P     KL   +    VD   ++SLVGSL YLT TRPDI+Y
Sbjct: 1086 YAKEVLKKFKMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILY 1145

Query: 567  TVSLLSRFLSEPSKLHLAAAKRVLRYLQGTKNFGLKYVKEEVNSLISFTDSD*AGSLDDK 746
             V ++SR++  P+  H  AAKR+LRY++GT NFGL Y       L+ ++DSD  G +DD+
Sbjct: 1146 AVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDR 1205

Query: 747  KSTSGYAFCLGSKLIAWSSKKQKIVALSSTEAEYIAVTDAICEGIWLRRILKDLQQEVKT 926
            KSTSG+ F +G     W SKKQ IV LS+ EAEY+A T  +C  IWLR +LK+L    + 
Sbjct: 1206 KSTSGFVFYIGDTAFTWMSKKQPIVVLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEE 1265

Query: 927  PTTIFCDNMSAIATTKNPVFQSRTKHIELRYHYIRHKVEKKKVELKFINTNEQLADMLTK 1106
            PT IF DN SAIA  KNPVF  R+KHI+ RYHYIR  V KK V+L+++ T++Q+AD+ TK
Sbjct: 1266 PTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADIFTK 1325

Query: 1107 AVSMDKFV*ARGTMNVAAS 1163
             +  + F+  R  + VA S
Sbjct: 1326 PLKREDFIKMRSLLGVAKS 1344



 Score =  157 bits (398), Expect(2) = e-106
 Identities = 72/127 (56%), Positives = 98/127 (77%)
 Frame = +2

Query: 2    EEVYVEQPLGYVKEGAENKVYKLQKALYSLKQAPRTWNSKIDGYFRQTGFSKSRSEPSLY 181
            EEVY+EQP GY+ +G E+KV +L+KALY LKQAPR WN++ID YF++  F K   E +LY
Sbjct: 958  EEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALY 1017

Query: 182  IKKEGNELLIVCLYVDDLIYTGTSMKILQDFKEAMMKEYEMTDLGLMKYFLGIQVQQSKG 361
            IK +  ++LI CLYVDDLI+TG +  + ++FK+ M KE+EMTD+GLM Y+LGI+V+Q   
Sbjct: 1018 IKIQKEDILIACLYVDDLIFTGNNPSMFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDN 1077

Query: 362  NIFISQE 382
             IFI+QE
Sbjct: 1078 GIFITQE 1084


>gb|AAG50698.1|AC079604_5 copia-type polyprotein, putative [Arabidopsis thaliana]
            gi|12321387|gb|AAG50765.1|AC079131_10 copia-type
            polyprotein, putative [Arabidopsis thaliana]
          Length = 1320

 Score =  256 bits (655), Expect(2) = e-106
 Identities = 123/259 (47%), Positives = 169/259 (65%)
 Frame = +3

Query: 387  YLEDLLKRFSMSNCKPISSPAALNEKLQLNDGALKVDEKVYRSLVGSLLYLTHTRPDIMY 566
            Y +++LK+F M +  P+ +P     KL   +    VD   ++SLVGSL YLT TRPDI+Y
Sbjct: 1054 YAKEVLKKFKMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILY 1113

Query: 567  TVSLLSRFLSEPSKLHLAAAKRVLRYLQGTKNFGLKYVKEEVNSLISFTDSD*AGSLDDK 746
             V ++SR++  P+  H  AAKR+LRY++GT NFGL Y       L+ ++DSD  G +DD+
Sbjct: 1114 AVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDR 1173

Query: 747  KSTSGYAFCLGSKLIAWSSKKQKIVALSSTEAEYIAVTDAICEGIWLRRILKDLQQEVKT 926
            KSTSG+ F +G     W SKKQ IV LS+ EAEY+A T  +C  IWLR +LK+L    + 
Sbjct: 1174 KSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEE 1233

Query: 927  PTTIFCDNMSAIATTKNPVFQSRTKHIELRYHYIRHKVEKKKVELKFINTNEQLADMLTK 1106
            PT IF DN SAIA  KNPVF  R+KHI+ RYHYIR  V KK V+L+++ T++Q+AD+ TK
Sbjct: 1234 PTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADIFTK 1293

Query: 1107 AVSMDKFV*ARGTMNVAAS 1163
             +  + F+  R  + VA S
Sbjct: 1294 PLKREDFIKMRSLLGVAKS 1312



 Score =  157 bits (398), Expect(2) = e-106
 Identities = 72/127 (56%), Positives = 98/127 (77%)
 Frame = +2

Query: 2    EEVYVEQPLGYVKEGAENKVYKLQKALYSLKQAPRTWNSKIDGYFRQTGFSKSRSEPSLY 181
            EEVY+EQP GY+ +G E+KV +L+KALY LKQAPR WN++ID YF++  F K   E +LY
Sbjct: 926  EEVYIEQPQGYIVKGEEDKVLRLKKALYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALY 985

Query: 182  IKKEGNELLIVCLYVDDLIYTGTSMKILQDFKEAMMKEYEMTDLGLMKYFLGIQVQQSKG 361
            IK +  ++LI CLYVDDLI+TG +  + ++FK+ M KE+EMTD+GLM Y+LGI+V+Q   
Sbjct: 986  IKIQKEDILIACLYVDDLIFTGNNPSMFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDN 1045

Query: 362  NIFISQE 382
             IFI+QE
Sbjct: 1046 GIFITQE 1052


>gb|AAD50001.1|AC007259_14 Hypothetical protein [Arabidopsis thaliana]
          Length = 1352

 Score =  256 bits (655), Expect(2) = e-106
 Identities = 123/259 (47%), Positives = 168/259 (64%)
 Frame = +3

Query: 387  YLEDLLKRFSMSNCKPISSPAALNEKLQLNDGALKVDEKVYRSLVGSLLYLTHTRPDIMY 566
            Y +++LK+F M +  P+ +P     KL   +    VD   ++SLVGSL YLT TRPDI+Y
Sbjct: 1086 YAKEVLKKFKMDDSNPVCTPMECGIKLSKKEEGEGVDPTTFKSLVGSLRYLTCTRPDILY 1145

Query: 567  TVSLLSRFLSEPSKLHLAAAKRVLRYLQGTKNFGLKYVKEEVNSLISFTDSD*AGSLDDK 746
             V ++SR++  P+  H  AAKR+LRY++GT NFGL Y       L+ ++DSD  G +DD+
Sbjct: 1146 AVGVVSRYMEHPTTTHFKAAKRILRYIKGTVNFGLHYSTTSDYKLVGYSDSDWGGDVDDR 1205

Query: 747  KSTSGYAFCLGSKLIAWSSKKQKIVALSSTEAEYIAVTDAICEGIWLRRILKDLQQEVKT 926
            KSTSG+ F +G     W SKKQ IV LS+ EAEY+A T  +C  IWLR +LK+L    + 
Sbjct: 1206 KSTSGFVFYIGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELSLPQEE 1265

Query: 927  PTTIFCDNMSAIATTKNPVFQSRTKHIELRYHYIRHKVEKKKVELKFINTNEQLADMLTK 1106
            PT IF DN SAIA  KNPVF  R+KHI+ RYHYIR  V KK V+L+++ T++Q+AD  TK
Sbjct: 1266 PTKIFVDNKSAIALAKNPVFHDRSKHIDTRYHYIRECVSKKDVQLEYVKTHDQVADFFTK 1325

Query: 1107 AVSMDKFV*ARGTMNVAAS 1163
             +  + F+  R  + VA S
Sbjct: 1326 PLKRENFIKMRSLLGVAKS 1344



 Score =  157 bits (397), Expect(2) = e-106
 Identities = 72/127 (56%), Positives = 97/127 (76%)
 Frame = +2

Query: 2    EEVYVEQPLGYVKEGAENKVYKLQKALYSLKQAPRTWNSKIDGYFRQTGFSKSRSEPSLY 181
            EEVY+EQP GY+ +G E+KV +L+K LY LKQAPR WN++ID YF++  F K   E +LY
Sbjct: 958  EEVYIEQPQGYIVKGEEDKVLRLKKVLYGLKQAPRAWNTRIDKYFKEKDFIKCPYEHALY 1017

Query: 182  IKKEGNELLIVCLYVDDLIYTGTSMKILQDFKEAMMKEYEMTDLGLMKYFLGIQVQQSKG 361
            IK +  ++LI CLYVDDLI+TG +  I ++FK+ M KE+EMTD+GLM Y+LGI+V+Q   
Sbjct: 1018 IKIQKEDILIACLYVDDLIFTGNNPSIFEEFKKEMTKEFEMTDIGLMSYYLGIEVKQEDN 1077

Query: 362  NIFISQE 382
             IFI+QE
Sbjct: 1078 GIFITQE 1084


Top