BLASTX nr result

ID: Coptis24_contig00004012 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00004012
         (3811 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002305691.1| predicted protein [Populus trichocarpa] gi|2...   323   2e-85
ref|XP_002331921.1| predicted protein [Populus trichocarpa] gi|2...   316   4e-83
ref|XP_003549171.1| PREDICTED: uncharacterized protein LOC100817...   313   3e-82
ref|XP_003620160.1| hypothetical protein MTR_6g077930 [Medicago ...   301   7e-79
emb|CAN81192.1| hypothetical protein VITISV_022847 [Vitis vinifera]   285   9e-74

>ref|XP_002305691.1| predicted protein [Populus trichocarpa] gi|222848655|gb|EEE86202.1|
            predicted protein [Populus trichocarpa]
          Length = 1132

 Score =  323 bits (829), Expect = 2e-85
 Identities = 193/511 (37%), Positives = 281/511 (54%), Gaps = 6/511 (1%)
 Frame = -3

Query: 1886 DMKISYEDNTRQDFNSQVSSQKWIPVGRKDSEVLKNIDTVYTCNNREDVLKPCKIKDDVD 1707
            + +I+  ++ +Q+ +S    QKWIP+G ++SE+  +     +  +  D  +P +  +D  
Sbjct: 629  EKEITLAEHCKQNHSSGSVMQKWIPIGVRESELATSARFGNSLPDPSD--RPAR--EDFT 684

Query: 1706 LEEYIVPVSAPSVGPEKTCLVSRHDMSKEATMEIEMLQLNSVCKN-TEHNFRKDRPRDYI 1530
            L       S  S     + L+     S  A+   +    +    N T   F  ++     
Sbjct: 685  LRNVQENASFDSQDLVSSSLLGTCQGSGNASCSPKEDDHSQKLNNSTGWMFELNKKHVEA 744

Query: 1529 SASTKGNDTDLLSIGSEMAVKALNA---SYQLQILCESIQQATGSPLAEFEKLLQXXXXX 1359
             +ST        S   + ++K + A   + ++Q+ CE+IQ +TGSP+AEFE+ L      
Sbjct: 745  DSSTSEYSDQQFSAFEDKSIKIIQAVKDACRVQMECEAIQMSTGSPVAEFERFLHFSSPV 804

Query: 1358 XXXXXXVLLQCRACLGNPHPFSSLCKHQLANISLRDVWSWYEKPGNYGLEVRAEDTKTLK 1179
                   L  C+ CL +    +  C+H++  I L  +W WYE+ GNYGLEVRAED +  K
Sbjct: 805  ISQLPG-LSCCQTCLCDRLVGARPCRHEIPYIPLGCLWKWYEEHGNYGLEVRAEDFENSK 863

Query: 1178 GMNIDSLSFNAHFVPFLSAIQLFGHSQKSRHSESVVHLSPEFSKNGEAENNYFHVEESID 999
             + +D +SF  +FVPFLSAIQLF       H+   ++ +P+    G  E +    +    
Sbjct: 864  SLGLDCVSFRGYFVPFLSAIQLF-----KNHTSQPINKAPDHGIFGTHEASESSEDSKAG 918

Query: 998  ELASSPGIISADGRDFSGLSSAPSCSNDMNLIFEFFESEQPQQRKPLYAKVREIVKVGTS 819
             L     +I       +  S   +CS+D  L+FE+FE EQPQQR+P Y K++E+V+   S
Sbjct: 919  RLPIFSVLIPKPRTTAAAQSVDVACSDDAELLFEYFEPEQPQQRQPFYEKIQELVRGNAS 978

Query: 818  NH-QVFGDPSGLQCMQLHNIHPASWYSVAWYPIYRIPEGNFRASFLTYHSLGHLVQRCIA 642
            +  +++GDP+ L  + LH++HP SWYSVAWYPIYRIP+GNFR +FLTYHSLGHLV R   
Sbjct: 979  SRCKMYGDPTNLASLNLHDLHPRSWYSVAWYPIYRIPDGNFRTAFLTYHSLGHLVHRSAK 1038

Query: 641  EPLKGSQYCVISPVLGLQSYNAQGECWFYPKVPTESFSKIITP-FSSSEVLKERLRTLEE 465
                    CV+SPV+GLQSYNAQGECWF P+      +   TP    S ++KERLRTL E
Sbjct: 1039 FDSPSKNECVVSPVVGLQSYNAQGECWFQPRHSVNQTTG--TPSLDPSVIMKERLRTLAE 1096

Query: 464  NALLFARGCVDKGHAKVTNRQPDYEFFLSRK 372
             A L AR  V+KG+    NR PDYEFFLSR+
Sbjct: 1097 TASLMARAVVNKGNQTSVNRHPDYEFFLSRR 1127


>ref|XP_002331921.1| predicted protein [Populus trichocarpa] gi|222874593|gb|EEF11724.1|
            predicted protein [Populus trichocarpa]
          Length = 1150

 Score =  316 bits (809), Expect = 4e-83
 Identities = 196/520 (37%), Positives = 293/520 (56%), Gaps = 14/520 (2%)
 Frame = -3

Query: 1886 DMKISYEDNTRQDFNSQVSSQKWIPVGRKDSEVLKNIDTVYTCNNRED-------VLKPC 1728
            + +++  +  +Q+ +S    QKWIP+G KD E+  +     +  +  D        L+  
Sbjct: 645  EKEVTVAEYCKQNHSSVTVMQKWIPIGVKDPELTTSARFGNSSPDPSDGPAGEDLTLRNV 704

Query: 1727 KIKDDVDLEEYIVPVSAPSVGPEKTCLVSRHDMS-KEATMEIEMLQLNSVCKNTEHNFRK 1551
            + K + D ++    VS+  +G   TC  S + +   +    I+ L+ NS     E N +K
Sbjct: 705  QDKANFDSQDL---VSSLMLG---TCQDSGNAVCFPQEDDRIQKLK-NSTLWMDELN-KK 756

Query: 1550 DRPRDYISASTKGNDTDLLSIGSEMAVKALNASYQLQILCESIQQATGSPLAEFEKLLQX 1371
                D +++ +           S   ++A+  + ++Q+  E+IQ A G P+AEFE+ L  
Sbjct: 757  HVAADALTSESSYQQFSAFEDESIKIIQAVKDTCRVQMESEAIQMAAGGPIAEFERFLHL 816

Query: 1370 XXXXXXXXXXVLLQCRACLGNPHPFSSLCKHQLANISLRDVWSWYEKPGNYGLEVRAEDT 1191
                       L  C+ CL +    +SLC+H++ NI L  +W WYE+ GNYGLEVRAE+ 
Sbjct: 817  SSPVINFPS--LSCCQTCLDDRLVGASLCRHEIPNIPLGCIWKWYEEHGNYGLEVRAEEC 874

Query: 1190 KTLKGMNIDSLSFNAHFVPFLSAIQLF-GHSQK---SRHSESVVHLSPEFSKNGEAENNY 1023
            +     + D  SF+ +FVPFLSA+QLF  HS +   +++S     +S  +  +  +EN+ 
Sbjct: 875  ENSNSGSFDHFSFHGYFVPFLSAVQLFKNHSSQPINNKNSAPDHEISDTYKASESSENS- 933

Query: 1022 FHVEESIDELASSPGIISADGRDFSGLSSAPSCSNDMNLIFEFFESEQPQQRKPLYAKVR 843
                 ++  L     +I          S   +CS+   L+FE+FESEQPQQR+PLY K++
Sbjct: 934  -----NVGRLPIFSLLIPQPRTTAVAQSVNLTCSDGAELLFEYFESEQPQQRRPLYEKIQ 988

Query: 842  EIVKV-GTSNHQVFGDPSGLQCMQLHNIHPASWYSVAWYPIYRIPEGNFRASFLTYHSLG 666
            E+ +   +S ++++GDP+ L  + LH++HP SWYSVAWYPIYRIP+G+FRA+FLTYHSLG
Sbjct: 989  ELARGDASSRYKMYGDPTNLASLNLHDLHPRSWYSVAWYPIYRIPDGHFRAAFLTYHSLG 1048

Query: 665  HLVQRCIAEPLKGSQYCVISPVLGLQSYNAQGECWFYPKVPTESFSKIITPFSS-SEVLK 489
            HLV +           C++SPV+GLQSYNAQGECWF  +      +   TP S+ S +LK
Sbjct: 1049 HLVHKSAEVDYASKDACIVSPVVGLQSYNAQGECWFQLRHSVNQAAG--TPISNPSVILK 1106

Query: 488  ERLRTLEENALLFARGCVDKGHAKVTNRQPDYEFFLSRKR 369
            ERLRTL E A L AR  V+KG+    NR PDYEFFLSR R
Sbjct: 1107 ERLRTLGETASLIARAVVNKGNQTSINRHPDYEFFLSRGR 1146


>ref|XP_003549171.1| PREDICTED: uncharacterized protein LOC100817088 [Glycine max]
          Length = 1181

 Score =  313 bits (801), Expect = 3e-82
 Identities = 202/554 (36%), Positives = 301/554 (54%), Gaps = 25/554 (4%)
 Frame = -3

Query: 1955 TEHVSVESLNASLDKIQRNTFGSDMKISYEDNTRQDFNSQVSSQKWIPVGRKDSEVLKNI 1776
            +E +S +S N   D++ +N    + ++S  D   Q+ +S  +  KWIPVG+KD  + K+ 
Sbjct: 646  SEELSPDSCNLEGDEVGQN----EKEVSSADYNAQNHSSGTTLWKWIPVGKKDRGLEKSE 701

Query: 1775 DTVYTCNNREDVLKPCKIKDDVDLEEYIVPVSAPSVGPE-----KTCLVSRHDMSKEATM 1611
                   N +        +++ + E  + P  A S  P+     + C    +D       
Sbjct: 702  SNSAPPENSD-----ASSRNNSNSESSVEPEVASSENPDSLNASRACNGQIYD-KVSCLD 755

Query: 1610 EIEMLQLNSVCKNT--EHNFRKDRPRDYISASTKGNDTDLLSIGSEMAVKALNASYQLQI 1437
            E E  ++ S    T  EH   +D+         +  + D+L   S    +A+N + + Q+
Sbjct: 756  EGENHKMGSQVARTLTEH---RDKHEAANHMFYECENQDMLENYSYRIAQAVNDACKAQL 812

Query: 1436 LCESIQQATGSPLAEFEKLLQXXXXXXXXXXXVLLQCRACLGNPHPFSSLCKHQLANISL 1257
             CE++  ATG P+AEFE+LL                C AC  N    +SLC+H++ ++SL
Sbjct: 813  ACEAVHMATGGPVAEFERLLHFCSPVICKSLSSH-SCSACSHNHGGGASLCRHEIPDLSL 871

Query: 1256 RDVWSWYEKPGNYGLEVRAEDTKTLKGMN-IDSLSFNAHFVPFLSAIQLFGHSQK----- 1095
              +W WYEK G+YGLE+RA+  +  K    +    F A+FVP LSA+QLF + +      
Sbjct: 872  GCLWQWYEKHGSYGLEIRAQGHENPKRQGGVADFPFRAYFVPSLSAVQLFKNHENLCVNN 931

Query: 1094 ------SRHSESVVHLSPEFSKNGEAENNYFHV----EESIDELASSPG-IISADGRDFS 948
                  S  SE+   +    + +  ++++ F V      + D+ + +P    S +     
Sbjct: 932  GDRLPNSEVSEACEMVDISANSSTASQHSIFSVLFPQPRNQDKSSQTPKETASINNASIP 991

Query: 947  GLSSAPSCSNDMNLIFEFFESEQPQQRKPLYAKVREIVKVGTS-NHQVFGDPSGLQCMQL 771
             ++S  +CS D+ L+FE+FE EQPQQR+PLY K++E+V+         +GDP+ L  + L
Sbjct: 992  SINS--TCSGDLELLFEYFEFEQPQQRQPLYEKIQELVRGHIPIESSTYGDPTKLDSINL 1049

Query: 770  HNIHPASWYSVAWYPIYRIPEGNFRASFLTYHSLGHLVQRCIAEPLKGSQYCVISPVLGL 591
             ++HP SW+SVAWYPIYRIP+GNFRASFLTYHSLGHLV+R  ++ L     C++SP +GL
Sbjct: 1050 RDLHPRSWFSVAWYPIYRIPDGNFRASFLTYHSLGHLVRRRTSD-LSTVGSCIVSPTVGL 1108

Query: 590  QSYNAQGECWFYPKVPTESFSKIITPFSSSEVLKERLRTLEENALLFARGCVDKGHAKVT 411
            QSYNAQGECWF  K    +   +      S +LKERLRTLEE A L AR  V+KG+   T
Sbjct: 1109 QSYNAQGECWFQLKHSAPAAEMV--NLEPSLLLKERLRTLEETASLMARAVVNKGNLTCT 1166

Query: 410  NRQPDYEFFLSRKR 369
            NR PDYEFFLSR+R
Sbjct: 1167 NRHPDYEFFLSRRR 1180


>ref|XP_003620160.1| hypothetical protein MTR_6g077930 [Medicago truncatula]
            gi|355495175|gb|AES76378.1| hypothetical protein
            MTR_6g077930 [Medicago truncatula]
          Length = 1107

 Score =  301 bits (772), Expect = 7e-79
 Identities = 196/522 (37%), Positives = 280/522 (53%), Gaps = 7/522 (1%)
 Frame = -3

Query: 1913 KIQRNTFGSDMK-ISYEDNTRQDFNS-QVSSQKWIPVGRKDSEVLKNIDTVYTCN-NRED 1743
            K+  N  G  +K +S  D   Q+ +S   +  KWIPVG+KD+ + K+     +   + E 
Sbjct: 612  KLLDNQVGQTVKEVSSADYNGQNHSSGSTALWKWIPVGKKDAGMAKSESNSSSSQYSDEP 671

Query: 1742 VLKPCKIKDDVDLEEYIVPVSAPSVGPEKTCLVSRHDMSKEATMEIEMLQLNSVCKNTEH 1563
              K   +++ ++ +   +  +  S    +T  + R +       E        +  +   
Sbjct: 672  TSKIIDMENGLEPKSDSLSQNQDSSPDTRTTSIGRIEGENHKLGE-------EIAGSLTE 724

Query: 1562 NFRKDRPRDYISASTKGNDTDLLSIGSEMAVKALNASYQLQILCESIQQATGSPLAEFEK 1383
               K +  ++I    +     LL   S    +A+N + ++Q+ C+ + + TG+P+AEFEK
Sbjct: 725  RMDKHQVDNHIIYECESQC--LLENDSYRIAQAVNDACRVQLACDVVHKVTGAPVAEFEK 782

Query: 1382 LLQXXXXXXXXXXXVLLQCRACLGNPHPFSSLCKHQLANISLRDVWSWYEKPGNYGLEVR 1203
            LL             L  C  C  N      LC+H++  +SL  +W WYEK G+YGLE+R
Sbjct: 783  LLHFCSPVICRSPDSL-GCFTCAKNHLIGVPLCRHEIPEVSLGCLWEWYEKHGSYGLEIR 841

Query: 1202 A---EDTKTLKGMNIDSLSFNAHFVPFLSAIQLFGHSQKSRHSESVVHLSPEFSKNGEAE 1032
            A   ED KTL G  +    F A+FVP LSA+QLF + +    + SV  L+ + S+  E  
Sbjct: 842  AWDYEDPKTLGG--VGHFPFRAYFVPSLSAVQLFKNRESRCVNNSVSFLNCKVSEACEMI 899

Query: 1031 NNYFHVEESIDELASSPGIISADGRDFSGLSSAPSCSNDMNLIFEFFESEQPQQRKPLYA 852
            +N    E+S     S+    S D           +CS D  L+FE+FE EQPQQR+PLY 
Sbjct: 900  DNS---EDSFIGRFSNASNPSTDS----------TCSGDSELLFEYFECEQPQQRRPLYE 946

Query: 851  KVREIVKVGTS-NHQVFGDPSGLQCMQLHNIHPASWYSVAWYPIYRIPEGNFRASFLTYH 675
            +++E+V+       + +GD + L+ + L ++HP SWYSVAWYPIYRIP+GNFRASFLTYH
Sbjct: 947  RIQELVRGDVQIQSKTYGDATKLESINLRDLHPRSWYSVAWYPIYRIPDGNFRASFLTYH 1006

Query: 674  SLGHLVQRCIAEPLKGSQYCVISPVLGLQSYNAQGECWFYPKVPTESFSKIITPFSSSEV 495
            SLGHLV R           CV+SP +GLQSYNAQGECWF     T     +    + S  
Sbjct: 1007 SLGHLVCRSSNSDSPTLDSCVVSPAVGLQSYNAQGECWFQLNQSTRRTEML--GINPSVF 1064

Query: 494  LKERLRTLEENALLFARGCVDKGHAKVTNRQPDYEFFLSRKR 369
            L+ERLRTLEE A L AR  V+KG+   TNR PDYEFFLSR+R
Sbjct: 1065 LQERLRTLEETASLMARADVNKGNQTCTNRHPDYEFFLSRRR 1106


>emb|CAN81192.1| hypothetical protein VITISV_022847 [Vitis vinifera]
          Length = 1239

 Score =  285 bits (728), Expect = 9e-74
 Identities = 189/526 (35%), Positives = 273/526 (51%), Gaps = 20/526 (3%)
 Frame = -3

Query: 1886 DMKISYEDNTRQDFNSQVSSQKWIPVGRKDSEV--LKNIDTVYTCNNREDVLKPCKIKDD 1713
            D ++   +N++Q+ +S    +KW PV +K+S    L   D     +  E   +    K+ 
Sbjct: 731  DKEVYLSENSKQEHSSASVMKKWKPVAKKNSGFASLGRSDISLLAHADEPAAEGWTPKNS 790

Query: 1712 VDLEEYIVPVSAPSVGPEKTCLVSRHDMSKEATMEIEMLQLNSVCKNTEHNFRKDRPR-D 1536
            V+ +         S    +   V     +   +   +   + + C  T        P  +
Sbjct: 791  VEEKASSNSHKPISSNDSEIMCVDHSFGNANCSSPEDKSPIQNTC--TPKQLXNKHPAVN 848

Query: 1535 YISASTKGNDTDLLSIGSEMAVKALNASYQLQILCESIQQATGSPLAEFEKLLQXXXXXX 1356
              + S K          S     AL+ +Y++Q L ES+Q ATG P+A+FE+LL       
Sbjct: 849  CFTHSCKEKHIYAFGADSSKISGALHDAYRVQQLSESVQLATGCPIADFERLLHAASPII 908

Query: 1355 XXXXXVLLQCRACL----GNPHPFSSLCKHQLANISLRDVWSWYEKPGNYGLEVRAEDTK 1188
                 V + C+ C+    G P     LC+H+  NI+LR +W WYEK G+YGLEVR ED +
Sbjct: 909  CRSNSVKI-CQTCVRDEVGRP-----LCRHEAPNITLRSLWKWYEKHGSYGLEVRLEDCE 962

Query: 1187 TLKGMNIDSLSFNAHFVPFLSAIQLFGHSQKSRHSES--VVHLSPEFSKNGEAENNYFHV 1014
              K +     +F A+FVP LSA+QLF    +S H ++  VV  + E SK  ++  N   +
Sbjct: 963  YSKRLGFYHSAFRAYFVPSLSAVQLF-KKPRSHHMDNGPVVSRACEMSKTSQSSFNIGQL 1021

Query: 1013 --------EESIDELASSPGIISADGRDFSGLSSA--PSCSNDMNLIFEFFESEQPQQRK 864
                        +E + SP          S +S +   + ++D  L+FE+FES+QPQ RK
Sbjct: 1022 PIFSILFPRPCTEETSFSPLENQMHSSQVSSMSQSVDTTITDDSELLFEYFESDQPQLRK 1081

Query: 863  PLYAKVREIVKV-GTSNHQVFGDPSGLQCMQLHNIHPASWYSVAWYPIYRIPEGNFRASF 687
            PL+ K++E+V   G S ++V+GDP+ L  M L  +H +SWYSVAWYPIYRIP+G FRA+F
Sbjct: 1082 PLFEKIKELVSGDGPSWNKVYGDPTKLDSMNLDELHHSSWYSVAWYPIYRIPDGEFRAAF 1141

Query: 686  LTYHSLGHLVQRCIAEPLKGSQYCVISPVLGLQSYNAQGECWFYPKVPTESFSKIITPFS 507
            LTYHS GHLV R           C++SPV+GLQSYNAQ         P  S ++      
Sbjct: 1142 LTYHSFGHLVHRSSTFDSHRKDACIVSPVVGLQSYNAQ---------PILSQTEETXNLK 1192

Query: 506  SSEVLKERLRTLEENALLFARGCVDKGHAKVTNRQPDYEFFLSRKR 369
             SE+L++RL+TLE  A L AR  V KG+ K  NR PDYEFFLSR+R
Sbjct: 1193 PSEILRKRLKTLEXTASLMARAEVSKGNLKSVNRHPDYEFFLSRQR 1238


Top