BLASTX nr result

ID: Coptis21_contig00021721 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00021721
         (2226 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN71330.1| hypothetical protein VITISV_031551 [Vitis vinifera]   188   5e-45
ref|XP_002309551.1| predicted protein [Populus trichocarpa] gi|2...   141   6e-31
ref|XP_002324808.1| predicted protein [Populus trichocarpa] gi|2...   140   2e-30
emb|CAX46395.1| putative EMF1 protein [Rosa lucieae]                  139   4e-30
ref|XP_002516553.1| hypothetical protein RCOM_0802530 [Ricinus c...   112   3e-22

>emb|CAN71330.1| hypothetical protein VITISV_031551 [Vitis vinifera]
          Length = 1388

 Score =  188 bits (478), Expect = 5e-45
 Identities = 148/478 (30%), Positives = 216/478 (45%), Gaps = 18/478 (3%)
 Frame = -2

Query: 1763 QNMEKPSSAAQLLIDSSRSYCGNQNP----------TFSQNHEKQSSGPQFLLDNP-RIC 1617
            +N  +  ++ QL    +R++     P           F    EK SSG QF    P R  
Sbjct: 916  ENSHRQKTSVQLWKKYNRNHFNMAQPEGTHGSTGVIAFPVCQEKPSSGVQFSCPGPSRHN 975

Query: 1616 VDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQNSGRDDRRVWSRMIPNCFPFGTST 1437
                  W    V  R S T+L   EAY    N   Q+   +   VWS M PN   FG S 
Sbjct: 976  GASNCKWSRDMVGQRSSHTSLHAFEAYNACYNAPQQSE--EAAHVWSAMTPNHMAFGFSI 1033

Query: 1436 PQKFSAQSNTTNRPYQCLELFPKGSMNGNHGLKSMDSLANHLETQKRNLELETSKRAHPE 1257
            PQ+ +  SN  +       +  K  M G   LK ++S A  LE Q R++  ET  RAH E
Sbjct: 1034 PQECATHSNNMDMISHSSNMLHKRKMTGEQDLKFLNSNAFDLEKQNRSIGSETLNRAHVE 1093

Query: 1256 YPFACRDKEVQFNPLAVRPVDLYNKEQ-SATHLLRLMDAGFSPSTPINLEDSQRFGKQPF 1080
            YPFAC+D  ++ +P  V  +DLY+ E   A HLL LMD+G    TP +++   +F K+  
Sbjct: 1094 YPFACKDNGIKLHPKLVGSLDLYSNENIPAMHLLSLMDSGMQSRTPFSMDGDSKFLKKSA 1153

Query: 1079 LARDNHQQDMR--RFGYSKNKEGLMLPPPPRCSSRNPHPVKFTGNFPSSSDVNAVGSSIP 906
              RD   ++      G  K +     P    C  +N    +      + + V A  SS  
Sbjct: 1154 FPRDYDSREFSGLEIGAYKARNSSRQPSSDHC-GKNHLAERSCACSLAVTSVGAFASSSQ 1212

Query: 905  KNDFFQRKHGSRVDPLAKSMPTSPRAHLNGNAKMKEAPIQTN--TLRXXXXXXXXXXXXX 732
            K+  F  K    +D +       PR+     +K+   PIQ    T R             
Sbjct: 1213 KDGNF--KPAGLIDQVL------PRSRGKEKSKVLPPPIQNRGCTSRKSSSTCGDYSGTN 1264

Query: 731  NLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDPIKHVAMVNKV--GTASLGTSTRRREFC 558
            + E +P +   K F    + M FP     I++ I+ +A+      GT     S    E C
Sbjct: 1265 H-ESIPIHDTQKRFPGASNSMRFPLQPHTIENSIECIALETHCNGGTFWPINSRSETEIC 1323

Query: 557  SMNRNPADFTVPEAGNEYMIGYGDLKPKENVLSRDRTGLVEPVGNKRQKMVKLTAIKD 384
            S+N+NPADF+VP AGN YMIG  DLK  +++ S +R GL+   G+K++++VKLT++K+
Sbjct: 1324 SINKNPADFSVPVAGNIYMIGAEDLKFGKSISSENRNGLINVDGHKKKRVVKLTSVKE 1381


>ref|XP_002309551.1| predicted protein [Populus trichocarpa] gi|222855527|gb|EEE93074.1|
            predicted protein [Populus trichocarpa]
          Length = 1309

 Score =  141 bits (356), Expect = 6e-31
 Identities = 130/443 (29%), Positives = 187/443 (42%), Gaps = 11/443 (2%)
 Frame = -2

Query: 1682 FSQNHEKQSSGPQFLLDNPRIC-VDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQN 1506
            F Q+ EK SS  Q       +  + Q     G  V +R    N  T     T  ++  Q+
Sbjct: 878  FLQHQEKPSSRVQHSACISNVQNISQNCKQIGDVVGNRSCYANFHTPGPCNTCHSIPQQS 937

Query: 1505 SGRDDRRVWSRMIPNCFPFGTSTPQKFSAQSNTTNRPYQCLELFP-------KGSMNGNH 1347
              ++   +WS M+ N  PF  + P K   QS   N       +FP       K +MNG+ 
Sbjct: 938  --KEANHLWSSMMSNHMPFVYTIPPKCVTQSTNVN-------VFPHSSGSNLKENMNGDR 988

Query: 1346 GLKSMDSLANHLETQKRNLELETSKRAHPEYPFACRDKEVQFNPLAVRPVDLYNKEQ-SA 1170
             LK ++  A +L  Q RN   ET  RA  EYPFA +   ++ N   +  +DLY+ E   A
Sbjct: 989  ELKFLNKNAANLGKQNRNFGSETLIRARSEYPFAGKHNGIELNQKPIGSLDLYSNETIPA 1048

Query: 1169 THLLRLMDAGFSPSTPINLEDSQRFGKQPFLARDNHQQDMRRFGYSKNKE-GLMLPPPPR 993
             HLL LMDAG   S PIN++ + +F K+P +  +   ++  R      K    +  PPP 
Sbjct: 1049 MHLLSLMDAGVQSSAPINMDVNSKFLKRPSITHNPEPKEFSRLDTGAFKAVNTVKHPPPN 1108

Query: 992  CSSRNPHPVKFTGNFPSSSDVNAVGSSIPKNDFFQRKHGSRVDPLAKSMPTSPRAHLNGN 813
               +N     F  + P         SS   +D   RK        A   P      +   
Sbjct: 1109 HHGKNQLAENFRDHIPVIQTTAGASSSSILHDKGIRK--------ATDFPIQV---VQDK 1157

Query: 812  AKMKEAPIQTNTLRXXXXXXXXXXXXXNLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDP 633
             K K +  +T                 N   +P +++   F    D   FP     ++ P
Sbjct: 1158 DKRKGSDSRTQNKVNRSQKSAYGGFGTNCGSIPAHNMQTMFYGASDSSMFPLPFRALEKP 1217

Query: 632  IKH-VAMVNKVGTASLGTSTRRREFCSMNRNPADFTVPEAGNEYMIGYGDLKPKENVLSR 456
             KH +       T     S+   E CS+NRNPADFTVPEAGN YMI   DLK ++ V   
Sbjct: 1218 NKHKLESPANNRTVHAHKSSSETEVCSVNRNPADFTVPEAGNMYMIVGEDLKFEKEVPFV 1277

Query: 455  DRTGLVEPVGNKRQKMVKLTAIK 387
            + +  ++  G KRQ+  KL A+K
Sbjct: 1278 NGSRSLKLDGPKRQR--KLPAVK 1298


>ref|XP_002324808.1| predicted protein [Populus trichocarpa] gi|222866242|gb|EEF03373.1|
            predicted protein [Populus trichocarpa]
          Length = 540

 Score =  140 bits (352), Expect = 2e-30
 Identities = 130/441 (29%), Positives = 195/441 (44%), Gaps = 24/441 (5%)
 Frame = -2

Query: 1634 DNPRICVDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQNSGRDDRRVWSRMIPNCF 1455
            + P+IC  +     G+ V +R   T  QT  A  T Q++  Q+  ++  ++WS M+PN  
Sbjct: 134  NTPQICKQR-----GEVVGNRSCHTRFQTPGACNTCQSIPQQS--KEANQLWSSMMPNHM 186

Query: 1454 PFGTSTPQKFSAQSNTTNRPYQCLELFP-------KGSMNGNHGLKSMDSLANHLETQKR 1296
            PF  S P K    S         +++FP       K +MNG+  LK  +  A +L  Q R
Sbjct: 187  PFVYSIPPKCVTPSTN-------VDVFPHSPGTVLKENMNGDRVLKFPNKNAANLGKQNR 239

Query: 1295 NLELETSKRAHPEYPFACRDKEVQFNPLAVRPVDLYNKEQ-SATHLLRLMDAGFSPSTPI 1119
            NL  ET  RAH EYPFA +   ++ N   +  ++LY+ E   A HLL LMDAG   S PI
Sbjct: 240  NLGSETLLRAHAEYPFAGKHNGIELNHKPMGSLELYSNETIPAMHLLSLMDAGVQSSAPI 299

Query: 1118 NLEDSQRFGKQPFLARDNHQQDMRRFGYSKNKEGLMLPPPPRC---------SSRNPHPV 966
            N++ + +F K+P +  +   ++  R      K    +  PPR          SSR+  P+
Sbjct: 300  NMDVNPKFLKRPAIIHNAEPKEFSRLDTGAYKVISSVKHPPRNHNGKNQLAESSRDLIPI 359

Query: 965  KFTGNFPSSSDV-------NAVGSSIPKNDFFQRKHGSRVDPLAKSMPTSPRAHLNGNAK 807
              T    SS  +         V    P   + +R+ GS  D   ++     +   NG   
Sbjct: 360  MQTTAGASSLSIRHDKRIRKPVDLPSPVIQYKERRKGS--DSRTQNKANRSQTSANGGFG 417

Query: 806  MKEAPIQTNTLRXXXXXXXXXXXXXNLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDPIK 627
                 I  +++R                       P  F      +PF     P KD +K
Sbjct: 418  TNCGSIPAHSMRIMSFGAPD---------------PSVF-----SLPFRALENPNKDKLK 457

Query: 626  HVAMVNKVGTASLGTSTRRREFCSMNRNPADFTVPEAGNEYMIGYGDLKPKENVLSRDRT 447
             +     V      + T   E CS+NRNPADFT+PEAGN YMI    L+ +++V   + +
Sbjct: 458  SLDNNRIVHPHKSSSET---EVCSVNRNPADFTIPEAGNMYMIAGEALRFEKDVPFANGS 514

Query: 446  GLVEPVGNKRQKMVKLTAIKD 384
              ++  G KRQ+  KL A+KD
Sbjct: 515  HSLKLDGRKRQR--KLPAMKD 533


>emb|CAX46395.1| putative EMF1 protein [Rosa lucieae]
          Length = 483

 Score =  139 bits (349), Expect = 4e-30
 Identities = 115/396 (29%), Positives = 169/396 (42%), Gaps = 9/396 (2%)
 Frame = -2

Query: 1682 FSQNHEKQSSGPQF-LLDNPRICVDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQN 1506
            FSQ+ +K S   QF    N +    Q+  WDG  + HR+S +NLQ+  A  T Q+V    
Sbjct: 105  FSQSQKKPSPRGQFPAAGNSKCSCAQSCKWDGNMMGHRFSNSNLQSFAACNTCQSV--PQ 162

Query: 1505 SGRDDRRVWSRMIPNCFPFGTSTPQKFSAQSNTTNRPYQCLELFPKGSMNGNHGLKSMDS 1326
            S  +   +WS +I    PF    PQK  AQS+      Q      KG+  G+  L S++ 
Sbjct: 163  SKEEAAHLWSPVISAHMPFAYENPQKGPAQSSNVKMVSQSPGSLQKGNATGDCDL-SLNL 221

Query: 1325 LANHLETQKRNLELETSKRAHPEYPFACRDKEVQFNPLAVRPVDLYNKEQ-SATHLLRLM 1149
             A + E +   +  ET  R +PEY F C+    + +  ++  +DLY+ E   A HLL LM
Sbjct: 222  NAPNFEKRNEAVGSETISRTNPEYSFTCKRNGTEPHQNSLGSLDLYSNETIPAMHLLSLM 281

Query: 1148 DAGFSPSTPINLEDSQRFGKQPFLARDNHQQDMRRFGYSKNKEGL------MLPPPPRCS 987
            DAG      +N+  + +F K+PF        D+   GY     GL      +  P   C 
Sbjct: 282  DAGMRSGASLNMGGNPKFPKRPF------PNDLNSKGYPGLDIGLYKAADTVNHPSSNCY 335

Query: 986  SRNPHPVKFTGNFPSSSDVNAVGSSIPKNDFFQRKHGSRVDPLAKSMPTSPRAHLNGNAK 807
             +N    K    FP++    A  SS   +  F R     +D ++ S             +
Sbjct: 336  GKNHLSEKSLDLFPTNPTFGASSSSFEHSKSFGRA-TDFMDQVSSSQKKE-------KIQ 387

Query: 806  MKEAPIQTNTLRXXXXXXXXXXXXXNLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDPIK 627
               +P Q    R             N   +P + +PKGFL     + FP +   I +  K
Sbjct: 388  RSHSPAQNRGPRSQKSLAADGGFGNNRTTIPVHSIPKGFLPVSGPLMFPLHYHTIANSRK 447

Query: 626  H-VAMVNKVGTASLGTSTRRREFCSMNRNPADFTVP 522
            H +   N  GT     ++     C MNRNPADFT+P
Sbjct: 448  HNLETPNANGTMKPPKTSSESSICCMNRNPADFTIP 483


>ref|XP_002516553.1| hypothetical protein RCOM_0802530 [Ricinus communis]
            gi|223544373|gb|EEF45894.1| hypothetical protein
            RCOM_0802530 [Ricinus communis]
          Length = 1310

 Score =  112 bits (281), Expect = 3e-22
 Identities = 113/433 (26%), Positives = 167/433 (38%), Gaps = 4/433 (0%)
 Frame = -2

Query: 1682 FSQNHEKQSSGPQFLLDNP-RICVDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQN 1506
            F Q+ EK S G Q    +  R    Q   W G  V  R S + LQTS    T Q +  ++
Sbjct: 883  FFQHQEKPSYGIQHSASSSGRQNTAQDCKWIGDLVGKRSSHSCLQTSGTCNTCQGIPQKS 942

Query: 1505 SGRDDRRVWSRMIPNCFPFGTSTPQKFSAQSNTTNRPYQCLELFPKGSMNGNHGLKSMDS 1326
              ++   +WS ++PN  PF  S  Q  S    + +          K ++NG+   K ++ 
Sbjct: 943  --KETNHLWSSVMPNHMPFVYSISQNCSTLPTSMDVLSNSPSSMNKENVNGHREFKFLNQ 1000

Query: 1325 LANHLETQKRNLELETSKRAHPEYPFACRDKEVQFNPLAVRPVDLYNKEQ-SATHLLRLM 1149
             A +   Q R    +  K      PFAC+   +      +   DLY+ E   A HLL LM
Sbjct: 1001 SAANFGKQNRAFGSDVLKTCAD--PFACKHNGIDLTQKPMGSFDLYSNETIPAMHLLSLM 1058

Query: 1148 DAGFSPSTPINLEDSQRFGKQPFLARDNHQQDMRRFGYSKNK-EGLMLPPPPRCSSRNPH 972
            DAG     PINL+ + +F K+P    D   ++  R      K    M   P  C  +N  
Sbjct: 1059 DAGLQSGAPINLDMTPKFFKRPSATHDQDPKEFSRLDSGAYKVTNTMKHTPYECHGKNQA 1118

Query: 971  PVKFTGNFPSSSDVNAVGSSIPKNDFFQRKHGSRVDPLAKSMPTSPRAHLNGNAKMKEAP 792
                 G   +   V    +S   +D   +K       +++              K  ++ 
Sbjct: 1119 AEDSHGRLSTIPVVVGPSASSFSHDTCFKKATDFTCQVSQE---------KVKGKGSDSR 1169

Query: 791  IQTNTLRXXXXXXXXXXXXXNLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDPIKHVAMV 612
             Q +  R             N   +P + +   F    D   FP     ++   KH   V
Sbjct: 1170 TQNSGYRSQKSVSPSGNFGTNCGSIPVHRMQTMFFGASDSRMFPLQFRGLETSTKHKFKV 1229

Query: 611  NKVGTASLGTSTRRRE-FCSMNRNPADFTVPEAGNEYMIGYGDLKPKENVLSRDRTGLVE 435
               GT  + +     +  CS+NRNPADF+ P  GN YMI   DLK  E V   + +    
Sbjct: 1230 PS-GTRPVHSHKSSSDGICSVNRNPADFSTPGPGNLYMISGEDLKVGELVPLMNGSVSTR 1288

Query: 434  PVGNKRQKMVKLT 396
              G KRQK +  T
Sbjct: 1289 LFGQKRQKKLPTT 1301


Top