BLASTX nr result
ID: Coptis21_contig00021721
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00021721 (2226 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN71330.1| hypothetical protein VITISV_031551 [Vitis vinifera] 188 5e-45 ref|XP_002309551.1| predicted protein [Populus trichocarpa] gi|2... 141 6e-31 ref|XP_002324808.1| predicted protein [Populus trichocarpa] gi|2... 140 2e-30 emb|CAX46395.1| putative EMF1 protein [Rosa lucieae] 139 4e-30 ref|XP_002516553.1| hypothetical protein RCOM_0802530 [Ricinus c... 112 3e-22 >emb|CAN71330.1| hypothetical protein VITISV_031551 [Vitis vinifera] Length = 1388 Score = 188 bits (478), Expect = 5e-45 Identities = 148/478 (30%), Positives = 216/478 (45%), Gaps = 18/478 (3%) Frame = -2 Query: 1763 QNMEKPSSAAQLLIDSSRSYCGNQNP----------TFSQNHEKQSSGPQFLLDNP-RIC 1617 +N + ++ QL +R++ P F EK SSG QF P R Sbjct: 916 ENSHRQKTSVQLWKKYNRNHFNMAQPEGTHGSTGVIAFPVCQEKPSSGVQFSCPGPSRHN 975 Query: 1616 VDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQNSGRDDRRVWSRMIPNCFPFGTST 1437 W V R S T+L EAY N Q+ + VWS M PN FG S Sbjct: 976 GASNCKWSRDMVGQRSSHTSLHAFEAYNACYNAPQQSE--EAAHVWSAMTPNHMAFGFSI 1033 Query: 1436 PQKFSAQSNTTNRPYQCLELFPKGSMNGNHGLKSMDSLANHLETQKRNLELETSKRAHPE 1257 PQ+ + SN + + K M G LK ++S A LE Q R++ ET RAH E Sbjct: 1034 PQECATHSNNMDMISHSSNMLHKRKMTGEQDLKFLNSNAFDLEKQNRSIGSETLNRAHVE 1093 Query: 1256 YPFACRDKEVQFNPLAVRPVDLYNKEQ-SATHLLRLMDAGFSPSTPINLEDSQRFGKQPF 1080 YPFAC+D ++ +P V +DLY+ E A HLL LMD+G TP +++ +F K+ Sbjct: 1094 YPFACKDNGIKLHPKLVGSLDLYSNENIPAMHLLSLMDSGMQSRTPFSMDGDSKFLKKSA 1153 Query: 1079 LARDNHQQDMR--RFGYSKNKEGLMLPPPPRCSSRNPHPVKFTGNFPSSSDVNAVGSSIP 906 RD ++ G K + P C +N + + + V A SS Sbjct: 1154 FPRDYDSREFSGLEIGAYKARNSSRQPSSDHC-GKNHLAERSCACSLAVTSVGAFASSSQ 1212 Query: 905 KNDFFQRKHGSRVDPLAKSMPTSPRAHLNGNAKMKEAPIQTN--TLRXXXXXXXXXXXXX 732 K+ F K +D + PR+ +K+ PIQ T R Sbjct: 1213 KDGNF--KPAGLIDQVL------PRSRGKEKSKVLPPPIQNRGCTSRKSSSTCGDYSGTN 1264 Query: 731 NLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDPIKHVAMVNKV--GTASLGTSTRRREFC 558 + E +P + K F + M FP I++ I+ +A+ GT S E C Sbjct: 1265 H-ESIPIHDTQKRFPGASNSMRFPLQPHTIENSIECIALETHCNGGTFWPINSRSETEIC 1323 Query: 557 SMNRNPADFTVPEAGNEYMIGYGDLKPKENVLSRDRTGLVEPVGNKRQKMVKLTAIKD 384 S+N+NPADF+VP AGN YMIG DLK +++ S +R GL+ G+K++++VKLT++K+ Sbjct: 1324 SINKNPADFSVPVAGNIYMIGAEDLKFGKSISSENRNGLINVDGHKKKRVVKLTSVKE 1381 >ref|XP_002309551.1| predicted protein [Populus trichocarpa] gi|222855527|gb|EEE93074.1| predicted protein [Populus trichocarpa] Length = 1309 Score = 141 bits (356), Expect = 6e-31 Identities = 130/443 (29%), Positives = 187/443 (42%), Gaps = 11/443 (2%) Frame = -2 Query: 1682 FSQNHEKQSSGPQFLLDNPRIC-VDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQN 1506 F Q+ EK SS Q + + Q G V +R N T T ++ Q+ Sbjct: 878 FLQHQEKPSSRVQHSACISNVQNISQNCKQIGDVVGNRSCYANFHTPGPCNTCHSIPQQS 937 Query: 1505 SGRDDRRVWSRMIPNCFPFGTSTPQKFSAQSNTTNRPYQCLELFP-------KGSMNGNH 1347 ++ +WS M+ N PF + P K QS N +FP K +MNG+ Sbjct: 938 --KEANHLWSSMMSNHMPFVYTIPPKCVTQSTNVN-------VFPHSSGSNLKENMNGDR 988 Query: 1346 GLKSMDSLANHLETQKRNLELETSKRAHPEYPFACRDKEVQFNPLAVRPVDLYNKEQ-SA 1170 LK ++ A +L Q RN ET RA EYPFA + ++ N + +DLY+ E A Sbjct: 989 ELKFLNKNAANLGKQNRNFGSETLIRARSEYPFAGKHNGIELNQKPIGSLDLYSNETIPA 1048 Query: 1169 THLLRLMDAGFSPSTPINLEDSQRFGKQPFLARDNHQQDMRRFGYSKNKE-GLMLPPPPR 993 HLL LMDAG S PIN++ + +F K+P + + ++ R K + PPP Sbjct: 1049 MHLLSLMDAGVQSSAPINMDVNSKFLKRPSITHNPEPKEFSRLDTGAFKAVNTVKHPPPN 1108 Query: 992 CSSRNPHPVKFTGNFPSSSDVNAVGSSIPKNDFFQRKHGSRVDPLAKSMPTSPRAHLNGN 813 +N F + P SS +D RK A P + Sbjct: 1109 HHGKNQLAENFRDHIPVIQTTAGASSSSILHDKGIRK--------ATDFPIQV---VQDK 1157 Query: 812 AKMKEAPIQTNTLRXXXXXXXXXXXXXNLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDP 633 K K + +T N +P +++ F D FP ++ P Sbjct: 1158 DKRKGSDSRTQNKVNRSQKSAYGGFGTNCGSIPAHNMQTMFYGASDSSMFPLPFRALEKP 1217 Query: 632 IKH-VAMVNKVGTASLGTSTRRREFCSMNRNPADFTVPEAGNEYMIGYGDLKPKENVLSR 456 KH + T S+ E CS+NRNPADFTVPEAGN YMI DLK ++ V Sbjct: 1218 NKHKLESPANNRTVHAHKSSSETEVCSVNRNPADFTVPEAGNMYMIVGEDLKFEKEVPFV 1277 Query: 455 DRTGLVEPVGNKRQKMVKLTAIK 387 + + ++ G KRQ+ KL A+K Sbjct: 1278 NGSRSLKLDGPKRQR--KLPAVK 1298 >ref|XP_002324808.1| predicted protein [Populus trichocarpa] gi|222866242|gb|EEF03373.1| predicted protein [Populus trichocarpa] Length = 540 Score = 140 bits (352), Expect = 2e-30 Identities = 130/441 (29%), Positives = 195/441 (44%), Gaps = 24/441 (5%) Frame = -2 Query: 1634 DNPRICVDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQNSGRDDRRVWSRMIPNCF 1455 + P+IC + G+ V +R T QT A T Q++ Q+ ++ ++WS M+PN Sbjct: 134 NTPQICKQR-----GEVVGNRSCHTRFQTPGACNTCQSIPQQS--KEANQLWSSMMPNHM 186 Query: 1454 PFGTSTPQKFSAQSNTTNRPYQCLELFP-------KGSMNGNHGLKSMDSLANHLETQKR 1296 PF S P K S +++FP K +MNG+ LK + A +L Q R Sbjct: 187 PFVYSIPPKCVTPSTN-------VDVFPHSPGTVLKENMNGDRVLKFPNKNAANLGKQNR 239 Query: 1295 NLELETSKRAHPEYPFACRDKEVQFNPLAVRPVDLYNKEQ-SATHLLRLMDAGFSPSTPI 1119 NL ET RAH EYPFA + ++ N + ++LY+ E A HLL LMDAG S PI Sbjct: 240 NLGSETLLRAHAEYPFAGKHNGIELNHKPMGSLELYSNETIPAMHLLSLMDAGVQSSAPI 299 Query: 1118 NLEDSQRFGKQPFLARDNHQQDMRRFGYSKNKEGLMLPPPPRC---------SSRNPHPV 966 N++ + +F K+P + + ++ R K + PPR SSR+ P+ Sbjct: 300 NMDVNPKFLKRPAIIHNAEPKEFSRLDTGAYKVISSVKHPPRNHNGKNQLAESSRDLIPI 359 Query: 965 KFTGNFPSSSDV-------NAVGSSIPKNDFFQRKHGSRVDPLAKSMPTSPRAHLNGNAK 807 T SS + V P + +R+ GS D ++ + NG Sbjct: 360 MQTTAGASSLSIRHDKRIRKPVDLPSPVIQYKERRKGS--DSRTQNKANRSQTSANGGFG 417 Query: 806 MKEAPIQTNTLRXXXXXXXXXXXXXNLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDPIK 627 I +++R P F +PF P KD +K Sbjct: 418 TNCGSIPAHSMRIMSFGAPD---------------PSVF-----SLPFRALENPNKDKLK 457 Query: 626 HVAMVNKVGTASLGTSTRRREFCSMNRNPADFTVPEAGNEYMIGYGDLKPKENVLSRDRT 447 + V + T E CS+NRNPADFT+PEAGN YMI L+ +++V + + Sbjct: 458 SLDNNRIVHPHKSSSET---EVCSVNRNPADFTIPEAGNMYMIAGEALRFEKDVPFANGS 514 Query: 446 GLVEPVGNKRQKMVKLTAIKD 384 ++ G KRQ+ KL A+KD Sbjct: 515 HSLKLDGRKRQR--KLPAMKD 533 >emb|CAX46395.1| putative EMF1 protein [Rosa lucieae] Length = 483 Score = 139 bits (349), Expect = 4e-30 Identities = 115/396 (29%), Positives = 169/396 (42%), Gaps = 9/396 (2%) Frame = -2 Query: 1682 FSQNHEKQSSGPQF-LLDNPRICVDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQN 1506 FSQ+ +K S QF N + Q+ WDG + HR+S +NLQ+ A T Q+V Sbjct: 105 FSQSQKKPSPRGQFPAAGNSKCSCAQSCKWDGNMMGHRFSNSNLQSFAACNTCQSV--PQ 162 Query: 1505 SGRDDRRVWSRMIPNCFPFGTSTPQKFSAQSNTTNRPYQCLELFPKGSMNGNHGLKSMDS 1326 S + +WS +I PF PQK AQS+ Q KG+ G+ L S++ Sbjct: 163 SKEEAAHLWSPVISAHMPFAYENPQKGPAQSSNVKMVSQSPGSLQKGNATGDCDL-SLNL 221 Query: 1325 LANHLETQKRNLELETSKRAHPEYPFACRDKEVQFNPLAVRPVDLYNKEQ-SATHLLRLM 1149 A + E + + ET R +PEY F C+ + + ++ +DLY+ E A HLL LM Sbjct: 222 NAPNFEKRNEAVGSETISRTNPEYSFTCKRNGTEPHQNSLGSLDLYSNETIPAMHLLSLM 281 Query: 1148 DAGFSPSTPINLEDSQRFGKQPFLARDNHQQDMRRFGYSKNKEGL------MLPPPPRCS 987 DAG +N+ + +F K+PF D+ GY GL + P C Sbjct: 282 DAGMRSGASLNMGGNPKFPKRPF------PNDLNSKGYPGLDIGLYKAADTVNHPSSNCY 335 Query: 986 SRNPHPVKFTGNFPSSSDVNAVGSSIPKNDFFQRKHGSRVDPLAKSMPTSPRAHLNGNAK 807 +N K FP++ A SS + F R +D ++ S + Sbjct: 336 GKNHLSEKSLDLFPTNPTFGASSSSFEHSKSFGRA-TDFMDQVSSSQKKE-------KIQ 387 Query: 806 MKEAPIQTNTLRXXXXXXXXXXXXXNLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDPIK 627 +P Q R N +P + +PKGFL + FP + I + K Sbjct: 388 RSHSPAQNRGPRSQKSLAADGGFGNNRTTIPVHSIPKGFLPVSGPLMFPLHYHTIANSRK 447 Query: 626 H-VAMVNKVGTASLGTSTRRREFCSMNRNPADFTVP 522 H + N GT ++ C MNRNPADFT+P Sbjct: 448 HNLETPNANGTMKPPKTSSESSICCMNRNPADFTIP 483 >ref|XP_002516553.1| hypothetical protein RCOM_0802530 [Ricinus communis] gi|223544373|gb|EEF45894.1| hypothetical protein RCOM_0802530 [Ricinus communis] Length = 1310 Score = 112 bits (281), Expect = 3e-22 Identities = 113/433 (26%), Positives = 167/433 (38%), Gaps = 4/433 (0%) Frame = -2 Query: 1682 FSQNHEKQSSGPQFLLDNP-RICVDQTSNWDGKSVAHRYSQTNLQTSEAYETFQNVSGQN 1506 F Q+ EK S G Q + R Q W G V R S + LQTS T Q + ++ Sbjct: 883 FFQHQEKPSYGIQHSASSSGRQNTAQDCKWIGDLVGKRSSHSCLQTSGTCNTCQGIPQKS 942 Query: 1505 SGRDDRRVWSRMIPNCFPFGTSTPQKFSAQSNTTNRPYQCLELFPKGSMNGNHGLKSMDS 1326 ++ +WS ++PN PF S Q S + + K ++NG+ K ++ Sbjct: 943 --KETNHLWSSVMPNHMPFVYSISQNCSTLPTSMDVLSNSPSSMNKENVNGHREFKFLNQ 1000 Query: 1325 LANHLETQKRNLELETSKRAHPEYPFACRDKEVQFNPLAVRPVDLYNKEQ-SATHLLRLM 1149 A + Q R + K PFAC+ + + DLY+ E A HLL LM Sbjct: 1001 SAANFGKQNRAFGSDVLKTCAD--PFACKHNGIDLTQKPMGSFDLYSNETIPAMHLLSLM 1058 Query: 1148 DAGFSPSTPINLEDSQRFGKQPFLARDNHQQDMRRFGYSKNK-EGLMLPPPPRCSSRNPH 972 DAG PINL+ + +F K+P D ++ R K M P C +N Sbjct: 1059 DAGLQSGAPINLDMTPKFFKRPSATHDQDPKEFSRLDSGAYKVTNTMKHTPYECHGKNQA 1118 Query: 971 PVKFTGNFPSSSDVNAVGSSIPKNDFFQRKHGSRVDPLAKSMPTSPRAHLNGNAKMKEAP 792 G + V +S +D +K +++ K ++ Sbjct: 1119 AEDSHGRLSTIPVVVGPSASSFSHDTCFKKATDFTCQVSQE---------KVKGKGSDSR 1169 Query: 791 IQTNTLRXXXXXXXXXXXXXNLELVPRNHLPKGFLSTPDRMPFPTNSLPIKDPIKHVAMV 612 Q + R N +P + + F D FP ++ KH V Sbjct: 1170 TQNSGYRSQKSVSPSGNFGTNCGSIPVHRMQTMFFGASDSRMFPLQFRGLETSTKHKFKV 1229 Query: 611 NKVGTASLGTSTRRRE-FCSMNRNPADFTVPEAGNEYMIGYGDLKPKENVLSRDRTGLVE 435 GT + + + CS+NRNPADF+ P GN YMI DLK E V + + Sbjct: 1230 PS-GTRPVHSHKSSSDGICSVNRNPADFSTPGPGNLYMISGEDLKVGELVPLMNGSVSTR 1288 Query: 434 PVGNKRQKMVKLT 396 G KRQK + T Sbjct: 1289 LFGQKRQKKLPTT 1301