BLASTX nr result
ID: Coptis24_contig00008500
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00008500 (3189 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241... 482 e-133 ref|XP_003544160.1| PREDICTED: uncharacterized protein LOC100799... 423 e-115 ref|XP_003542703.1| PREDICTED: uncharacterized protein LOC100800... 421 e-115 ref|XP_002513834.1| conserved hypothetical protein [Ricinus comm... 417 e-113 ref|XP_003614856.1| hypothetical protein MTR_5g060420 [Medicago ... 401 e-109 >ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera] Length = 665 Score = 482 bits (1240), Expect = e-133 Identities = 313/665 (47%), Positives = 398/665 (59%), Gaps = 58/665 (8%) Frame = +2 Query: 980 MERSEPTLVPQWLKGTPNVTGAAXXXXXXXXXXXXXDEPGVTLPTRNRSSLGIADHD--- 1150 M+++EP LVP+WLK + +VTG D+ P R + + DHD Sbjct: 1 MDKTEPALVPEWLKSSGSVTGGGSTNHHFAPSLLQSDDGAALKPAR-KLMVNSNDHDTGR 59 Query: 1151 SSSYSFASLDRFRKSSSSNGSIGHDKDNTSHLXXXXXXXXXXXXXXWDKDTLGFREKDRS 1330 SS+ + FR+SSSSNGS GH + +S W+KD +R+KD+S Sbjct: 60 SSNLERTTSSYFRRSSSSNGS-GHPRSFSSF-------GRTNREREWEKDIHDYRDKDKS 111 Query: 1331 IPGDKKSRAYSD--ADVFTSRIEKDMFKRSQSMISGKRDEVRPRRVVADVXXXXXXXXXX 1504 + D + R YSD ++ R+E+DM +RSQSMI+GKR ++ PR+V ADV Sbjct: 112 VLSDHRHRDYSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSN 171 Query: 1505 XXXXXXKVI---SVHKATFERDFPSLGVEEKQV---VERVGSPGLNIAAQNLPMGTSAMI 1666 I SV KA F+R+FPSLG E+KQ + RV SPGL A Q+LP+G + +I Sbjct: 172 GDGQLASGIVTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVI 231 Query: 1667 MGNGWTSALAEVPVISGKNSLGLSSVAQTAASSSGSLAPSTTTGLNMAETLAQAPSRAR- 1843 G+GWTSALAEVPVI G N+ G+SSV Q+ ++SS S+APSTT+GLNMAETL Q P+RAR Sbjct: 232 GGDGWTSALAEVPVIIGSNTTGVSSVQQSVSASSVSVAPSTTSGLNMAETLVQGPARARA 291 Query: 1844 -TTPQLSVETQRLEELAIKQSRQLIPMTPSMTKSSVLNSEKQKPKASARSEMSLASKIAQ 2020 TPQLSV TQRLEELA+KQSRQLIPMTPSM K +++ S KPK SKI Sbjct: 292 NATPQLSVGTQRLEELALKQSRQLIPMTPSMPK-TLVPSPSDKPK----------SKIGL 340 Query: 2021 QQLTSTQPVNSLRGGTARSDVPKTSHGGKLLVLKAGRE-NGTSHPTKDSSSPTNASRIAS 2197 Q L +S RGG ARSDV KTS+ GKL VLK RE NG S KDS SPT SR+A+ Sbjct: 341 QPLHLVN--HSQRGGPARSDVTKTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVAN 398 Query: 2198 NPLAVVAPTVGLAPMKS-QNSPKVAVAAERKATVLPANNTSSIERRPINSQAQSRNDFFN 2374 +PLAV G A ++S +N+P +A A R + VL +S+E+RP SQAQSRNDFFN Sbjct: 399 SPLAVTPSAAGSASLRSPRNNPTLASAERRPSVVL-----TSVEKRP-TSQAQSRNDFFN 452 Query: 2375 LMRKKTVTNNSSAAPDPGPVPSPNILEKSGELMTE-VTNDVSIQGNDITASDHSSL---N 2542 LMRKK+ TN SA P+ GP S ++ EKS EL+TE VT V+ +G DI +SD+S L N Sbjct: 453 LMRKKSSTNPPSAVPESGPAVSSSVSEKSDELITEVVTAPVTPKGRDILSSDNSGLDWSN 512 Query: 2543 ADHG---------------------------------------AVVASNGDVYEESQSSI 2605 + G V NGD + SQ + Sbjct: 513 ENRGDKTENGNNEACGVSQNDRDDEIDNVNGDACDVSQRDQGDEVHDGNGDACDVSQKFL 572 Query: 2606 NNGVKHSSSDAILYPDEEEAAFLRSLGWDEGAGDEEGLTEEEINAFYEEYMKLRPSAKLC 2785 +NG KHSS D +LYPDEEEAAFLRSLGW+E G++EGLTEEEINAFY+E MKL+PS+ L Sbjct: 573 DNGEKHSSPDEVLYPDEEEAAFLRSLGWEEN-GEDEGLTEEEINAFYKECMKLKPSSNLL 631 Query: 2786 QGKMP 2800 Q +P Sbjct: 632 QRMLP 636 >ref|XP_003544160.1| PREDICTED: uncharacterized protein LOC100799079 [Glycine max] Length = 618 Score = 423 bits (1087), Expect = e-115 Identities = 286/626 (45%), Positives = 365/626 (58%), Gaps = 19/626 (3%) Frame = +2 Query: 980 MERSEPTLVPQWLKGTPNVTGAAXXXXXXXXXXXXXDEPGVTLPTRNRSSLGIADHDSSS 1159 MERSEP LVP+WL+ +V GA D +L RN+SS +D DS+ Sbjct: 1 MERSEPALVPEWLRSAGSVAGAGSSAQQFASSSAHTD----SLSVRNKSSKNGSDFDSAR 56 Query: 1160 YSFASLDRFRKS----SSSNGSIGH--DKDNTSHLXXXXXXXXXXXXXXWDKDTLGFREK 1321 F L+R S SS NGS H N +H DKD REK Sbjct: 57 SVF--LERTSSSNSRRSSMNGSAKHAYSSFNRNHR---------------DKDR--DREK 97 Query: 1322 DRSIPGDKKSRAYSD--ADVFTSRIEKDMFKRSQSMISGKRDEVRPRRVVADVXXXXXXX 1495 DRS GD SD A++F R+E+D +RS SM+S K++EV PRRVV D Sbjct: 98 DRSSFGDHWDCDGSDPLANIFPGRMERDTLRRSHSMVSRKQNEVIPRRVVVDTKSGGSHQ 157 Query: 1496 XXXXXXXXXKVIS--VHKATFERDFPSLGVEEKQVVE---RVGSPGLNIAA-QNLPMGTS 1657 +S + KA F++DFPSL EEKQ + RV SP L AA Q+LP+G+S Sbjct: 158 NNSNGILSGSNVSNSIQKAVFDKDFPSLSTEEKQGIADVVRVSSPALGAAASQSLPVGSS 217 Query: 1658 AMIMGNGWTSALAEVPVISGKNSLGLSSVAQTAASSSGSLAPSTTTGLNMAETLAQAPSR 1837 A+I G GWTSALAEVP I G +S G SV QT ++SGS+A STT GLNMAE LAQ PSR Sbjct: 218 ALIGGEGWTSALAEVPAIIGSSSTGSLSVQQTVNTTSGSVASSTTAGLNMAEALAQTPSR 277 Query: 1838 ARTTPQLSVETQRLEELAIKQSRQLIPMTPSMTKSSVLNSEKQKPKASAR-SEMSLASKI 2014 AR+ PQ+ V+TQRLEELAIKQSRQLIP+TPSM K+SV NSEK KPK + R ++M++ +K Sbjct: 278 ARSAPQVLVKTQRLEELAIKQSRQLIPVTPSMPKASVHNSEKSKPKTAIRNADMNVVTKS 337 Query: 2015 AQQQLTSTQPVN-SLRGGTARSDVPKTSHGGKLLVLKA-GRENGTSHPTKDSSSPTNASR 2188 QQ + N S+R ++ D PKTS GK LK+ ENGTS +KD S+PTN S Sbjct: 338 VPQQPPALHIANQSVRSVNSKVDAPKTS--GKFTDLKSVVWENGTSPTSKDVSNPTNYSN 395 Query: 2189 IASNPLAVVAPTVGLAPMKSQNSPKVAVAAERKATVLPANNTSSIERRPINSQAQSRNDF 2368 VA AP+++ N+ K ERK T + S++E++ SQ QSRNDF Sbjct: 396 SKPGNQHAVALGAASAPLRNPNNLK--SPTERKPTSMDLKLGSNLEKKHSISQVQSRNDF 453 Query: 2369 FNLMRKKTVTNNSSAAPDPGPVPSPNILEKSGELMTEVTNDVSIQGNDITASDHSSLNAD 2548 FNL++KKT+ N+S+ PD GP+ S +EKSGE+ V I + S + Sbjct: 454 FNLIKKKTLMNSSAVLPDSGPMVSSPAMEKSGEVNRGV----------IVSPSASPQSHG 503 Query: 2549 HGAVVASNG-DVYEESQSSINNGVKHSSSDAILYPDEEEAAFLRSLGWDEGAGDEEGLTE 2725 +G + SNG +EE +N K S+ +YPDEEEAAFLRSLGW+E + ++EGLTE Sbjct: 504 NGTELTSNGTHAHEEVHRLSDNEEKESNPSVTIYPDEEEAAFLRSLGWEENSDEDEGLTE 563 Query: 2726 EEINAFYEEYMKLRPSA-KLCQGKMP 2800 EEINAFY+E KL P+ KLCQGK P Sbjct: 564 EEINAFYQECKKLDPTTFKLCQGKQP 589 >ref|XP_003542703.1| PREDICTED: uncharacterized protein LOC100800475 [Glycine max] Length = 621 Score = 421 bits (1081), Expect = e-115 Identities = 287/626 (45%), Positives = 364/626 (58%), Gaps = 19/626 (3%) Frame = +2 Query: 980 MERSEPTLVPQWLKGTPNVTGAAXXXXXXXXXXXXXDEPGVTLPTRNRSSLGIADHDSSS 1159 MERSEP LVP+WL+ +V GA D V +RNRSS +D DS+ Sbjct: 1 MERSEPALVPEWLRSAGSVAGAGSSAQQFASSSGHTDSLSVAHHSRNRSSKNGSDFDSAR 60 Query: 1160 YSFASLDRFRKS----SSSNGSIGH--DKDNTSHLXXXXXXXXXXXXXXWDKDTLGFREK 1321 F L+R S SS NGS H N SH DKD REK Sbjct: 61 SVF--LERTSSSNSRRSSINGSAKHAYSSFNRSHR---------------DKDR--DREK 101 Query: 1322 DRSIPGDKKSRAYSD--ADVFTSRIEKDMFKRSQSMISGKRDEVRPRRVVADVXXXXXXX 1495 DRS GD SD A++F R+E+D +RS SM+S K+ EV PRRV D Sbjct: 102 DRSSFGDHWDCDGSDPLANLFPGRMERDTLRRSHSMVSRKQSEVIPRRVAVDTKSGGSHQ 161 Query: 1496 XXXXXXXXXKVIS--VHKATFERDFPSLGVEEKQ---VVERVGSPGLNIA-AQNLPMGTS 1657 +S + KA F++DFPSL EEKQ V RV SPGL A +Q+LP+G+S Sbjct: 162 NNSNGILSGSNVSSSIQKAVFDKDFPSLSTEEKQGIAEVVRVSSPGLGAAVSQSLPVGSS 221 Query: 1658 AMIMGNGWTSALAEVPVISGKNSLGLSSVAQTAASSSGSLAPSTTTGLNMAETLAQAPSR 1837 A+I G GWTSALAEVP I G +S G SV QT ++SGS+APSTT GLNMAE LAQ PSR Sbjct: 222 ALIGGEGWTSALAEVPAIIGSSSTGSLSVQQTVNTTSGSVAPSTTAGLNMAEALAQTPSR 281 Query: 1838 ARTTPQLSVETQRLEELAIKQSRQLIPMTPSMTKSSVLNSEKQKPKASAR-SEMSLASKI 2014 AR+ PQ+ V+TQRLEELAIKQSRQLIP+TPSM K+SV NSEK KPK + R ++M++ +K Sbjct: 282 ARSAPQVLVKTQRLEELAIKQSRQLIPVTPSMPKASVHNSEKSKPKTAIRNADMNVVTKT 341 Query: 2015 AQQQLTSTQPVN-SLRGGTARSDVPKTSHGGKLLVLKA-GRENGTSHPTKDSSSPTNASR 2188 QQ ++ + S+R A+ D PKTS GK LK+ ENG S +KD S+PTN S Sbjct: 342 VPQQPSALHIASQSVRSVNAKVDTPKTS--GKFTDLKSVVWENGASPTSKDVSNPTNYSN 399 Query: 2189 IASNPLAVVAPTVGLAPMKSQNSPKVAVAAERKATVLPANNTSSIERRPINSQAQSRNDF 2368 VA AP+++ N+ K ERK + + S++E++ SQ QSRNDF Sbjct: 400 SKPGNQHAVASGAASAPLRNPNNLK--SPTERKPSSMDLKLGSNLEKKHSISQVQSRNDF 457 Query: 2369 FNLMRKKTVTNNSSAAPDPGPVPSPNILEKSGELMTEVTNDVSIQGNDITASDHSSLNAD 2548 FNL++KKT+ N S+ PD GP+ S +EKSGE+ E+ N +AS S N Sbjct: 458 FNLIKKKTLMNCSAVLPDSGPMVSSPAMEKSGEVNREIVNP--------SASPQSLGN-- 507 Query: 2549 HGAVVASNG-DVYEESQSSINNGVKHSSSDAILYPDEEEAAFLRSLGWDEGAGDEEGLTE 2725 G + SNG +E +N K S+ +YP+EEEAAFLRSLGW+E + ++EGLTE Sbjct: 508 -GTELTSNGTHAHEVIHRISDNEEKESNPSVTIYPEEEEAAFLRSLGWEENSDEDEGLTE 566 Query: 2726 EEINAFYEEYMKLRPSA-KLCQGKMP 2800 EEINAFY+E KL P+A KL QG P Sbjct: 567 EEINAFYQECKKLDPTAFKLSQGMQP 592 >ref|XP_002513834.1| conserved hypothetical protein [Ricinus communis] gi|223546920|gb|EEF48417.1| conserved hypothetical protein [Ricinus communis] Length = 596 Score = 417 bits (1071), Expect = e-113 Identities = 285/619 (46%), Positives = 358/619 (57%), Gaps = 15/619 (2%) Frame = +2 Query: 980 MERSEPTLVPQWLKGTPNVTGAAXXXXXXXXXXXXXDEPGVTLPTRNRSSLGIADHDSSS 1159 MERSEPTLVP+WL+ + +V G D +R+R+S +D DS Sbjct: 1 MERSEPTLVPEWLRSSGSVPGGGSSAHHFASSSPHSDVSSSVHHSRSRNSKSTSDFDSPR 60 Query: 1160 YSFASLDRFRKS----SSSNGSIGHDKDNTSHLXXXXXXXXXXXXXXWDKDTLGFREKDR 1327 +F LDR S SSSNGS H + S DKD R+K+R Sbjct: 61 SAF--LDRTSSSNSRRSSSNGSAKHAYSSFSRSHR-------------DKDRE--RDKER 103 Query: 1328 SIPGDKKSRAYSDA-DVFTSRIEKDMFKRSQSMISGKRDEVRPRRVVADVXXXXXXXXXX 1504 G+ SD SR EKD +RS SM+S K EV PRR AD+ Sbjct: 104 LNFGNHWDNDASDPLGSILSRNEKDALRRSHSMVSRKLGEVLPRRFAADLRNGSNSNHVN 163 Query: 1505 XXXXXXKV---ISVHKATFERDFPSLGVEEKQV---VERVGSPGLNIAAQNLPMGTSAMI 1666 S+ KA FE+DFPSLG EE+Q + RV SPGL+ A Q+LP+ +SA+I Sbjct: 164 GNGLISGGGVGNSIPKAVFEKDFPSLGSEERQGAPDIGRVSSPGLSTAVQSLPVSSSALI 223 Query: 1667 MGNGWTSALAEVPVISGKNSLGLSSVAQTAASSSGSLAPSTTTGLNMAETLAQAPSRART 1846 G GWTSALAEVP I G NS G SS QT A+S+ APST GLNMAE L QAP+R RT Sbjct: 224 GGEGWTSALAEVPAIIGNNSSGSSSSVQTVATSASG-APSTVAGLNMAEALTQAPTRTRT 282 Query: 1847 TPQLSVETQRLEELAIKQSRQLIPMTPSMTKSSVLN-SEKQKPKASAR-SEMSLASKIAQ 2020 PQLSV+TQRLEELAIKQSRQLIP+TPSM KSSVLN S+K KPK R SEM++A K Q Sbjct: 283 APQLSVQTQRLEELAIKQSRQLIPVTPSMPKSSVLNSSDKSKPKTVVRSSEMNMAPKNLQ 342 Query: 2021 QQLTSTQPV-NSLRGGTARSDVPKTSHGGKLLVLKAGRENGTSHPTKDSSSPTNASRIAS 2197 QQ +S V SL GG +SD K SH GKL VLK G ENG S KD ++P NA R A+ Sbjct: 343 QQPSSLHAVTQSLAGGHVKSDASKASH-GKLFVLKPGWENGASPSPKDIANPNNAGRAAN 401 Query: 2198 NPLAVVAPTVGLAPMKSQNSPKVAVAAERKATVLPANNTSSIERRPINSQAQSRNDFFNL 2377 + LA AP+V AP++S N+PK++ A ERK+ L + ++E+RP+ SQ QSR+DFFNL Sbjct: 402 SQLA-AAPSVPSAPLRSPNNPKLS-AGERKSASLNLISGFNVEKRPLLSQTQSRHDFFNL 459 Query: 2378 MRKKTVTNNSSAAPDPGPVPSPNILEKSGELMTEVTNDVSIQGNDITASDHSSLNA-DHG 2554 ++KKT+ N+S+A D S ++ TN+ + + N AS S A +G Sbjct: 460 LKKKTLKNSSTALTD------------SASAISSPTNEKACEINKEAASAPSCPQAIKNG 507 Query: 2555 AVVASNGDVYEESQSSINNGVKHSSSDAILYPDEEEAAFLRSLGWDEGAGDEEGLTEEEI 2734 + + NG EE EEEAAFLRSLGW+E +G++EGLTEEEI Sbjct: 508 SELTGNGGTCEE-------------------VSEEEAAFLRSLGWEENSGEDEGLTEEEI 548 Query: 2735 NAFYEEYMKLRPSAKLCQG 2791 NAF +E MKL+PS K+C+G Sbjct: 549 NAFIQECMKLKPSLKVCRG 567 >ref|XP_003614856.1| hypothetical protein MTR_5g060420 [Medicago truncatula] gi|355516191|gb|AES97814.1| hypothetical protein MTR_5g060420 [Medicago truncatula] Length = 685 Score = 401 bits (1030), Expect = e-109 Identities = 286/638 (44%), Positives = 367/638 (57%), Gaps = 28/638 (4%) Frame = +2 Query: 980 MERSEPTLVPQWLKGTPNVTGAAXXXXXXXXXXXXXDE--PGVTLPTRNRSSLGIADHDS 1153 M+RSEP+LVP+WL+ +V GA D P RNRSS D DS Sbjct: 1 MDRSEPSLVPEWLRSAGSVVGAGNSAQHFASSSSHADSHSPSAANNNRNRSSKNTGDFDS 60 Query: 1154 SSYSFASLDRFRKSSSSNGSIG------HDKDNTSHLXXXXXXXXXXXXXXWDKDTLGFR 1315 S F LDR +SS GSI + N +H DKD R Sbjct: 61 SRSVF--LDRTSSASSRRGSINGSAKHAYSSFNRNHR---------------DKDR--DR 101 Query: 1316 EKDRSIPGDKKSRAYSD--ADVFTSRIEKDMFKRSQSMISGKRDEVRPRRVVADVXXXXX 1489 EKDRS GD R SD ++F+ RIE+D +RS SM+S K+ E PRRV AD Sbjct: 102 EKDRSNFGDHWDRDGSDPLVNLFSGRIERDTLRRSHSMVSRKQGETLPRRVAADTKSGGS 161 Query: 1490 XXXXXXXXXXXKVI---SVHKATFERDFPSLGVEEKQVVERVG---SPGLNI-AAQNLPM 1648 S+ KA F++DFPSLG +EKQ + +G SPGL A+Q+LP+ Sbjct: 162 SNHNNGNGALSVGSVGSSIQKAVFDKDFPSLGADEKQGIAEIGRVSSPGLGATASQSLPV 221 Query: 1649 GTSAMIMGNGWTSALAEVPVISGKNSLGLSSVAQTAASSSGSLAPSTTTGLNMAETLAQA 1828 G+SA+I G GWTSALAEVP + G +S G SS QT A++S S++ ST GLNMAE LAQA Sbjct: 222 GSSALIGGEGWTSALAEVPSVIGSSSAGSSSAQQTIAATSVSVSSSTAAGLNMAEALAQA 281 Query: 1829 PSRARTTPQLSVETQRLEELAIKQSRQLIPMTPSMTKSSVLN-SEKQKPKASAR-SEMSL 2002 PSRAR+TPQ+SV+TQRLEELAIKQSRQLIP+TPSM K+ LN SEK KPK + R +EM++ Sbjct: 282 PSRARSTPQVSVKTQRLEELAIKQSRQLIPVTPSMPKALALNSSEKSKPKTAVRNAEMNV 341 Query: 2003 ASKIAQQQLTSTQPVN-SLRGGTARSDVPKTSHGGKLLVLKA-GRENGTSHPTKDSSSPT 2176 A+K A QQ ++ + S+R A+ DVPKTS GK LK+ ENG S +KD S+PT Sbjct: 342 ATKSALQQPSALHIASQSVRIVNAKVDVPKTS--GKFTDLKSVVWENGASPTSKDVSNPT 399 Query: 2177 NASRIASNPLAVVAPTVGLAPMKSQ---NSPKVAVAAERKATVLPANNTSSIERRPINSQ 2347 N + S VA P+++ NSP+ ERK L S+++++ SQ Sbjct: 400 NYANSKSANQHCVASAAAPTPVRNPSNLNSPR-----ERKPASLDLKLGSALDKKQSISQ 454 Query: 2348 AQSRNDFFNLMRKKTVTNNSSAAPDPGPVPSPNILEKSGELMTEVTNDVSIQGNDITASD 2527 +SRNDFFNL++ KT TN+S+ PD G + S LEKSGE+ E +AS Sbjct: 455 VKSRNDFFNLLKNKTATNSSTVFPDSGQMVSSPTLEKSGEVNRESVMP--------SASP 506 Query: 2528 HSSLNADHGAVVASNGDVYEESQSSIN--NGVKHSSSDAILYPDEEEAAFLRSLGWDEGA 2701 S NA A SNG+ + + ++ + +S A +YPDEEEAAFLRSLGW+E + Sbjct: 507 QSVGNA---AEPTSNGNAHAHAHEVLSRISDDDEKNSRATVYPDEEEAAFLRSLGWEENS 563 Query: 2702 GDEEGLTEEEINAFYEEY-MKLRPSA-KLCQGKMPLQL 2809 ++EGLTEEEINAFY+E KL PSA KLC M QL Sbjct: 564 DEDEGLTEEEINAFYQEVCKKLDPSALKLCIEGMQPQL 601