BLASTX nr result
ID: Coptis21_contig00024415
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00024415 (1830 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI29872.3| unnamed protein product [Vitis vinifera] 543 e-152 ref|XP_002277702.2| PREDICTED: uncharacterized protein LOC100241... 541 e-151 ref|XP_003553419.1| PREDICTED: uncharacterized protein LOC100800... 520 e-145 ref|XP_002526720.1| conserved hypothetical protein [Ricinus comm... 513 e-143 ref|NP_001190119.1| armadillo/beta-catenin-like repeat-containin... 455 e-125 >emb|CBI29872.3| unnamed protein product [Vitis vinifera] Length = 1112 Score = 543 bits (1398), Expect = e-152 Identities = 289/540 (53%), Positives = 380/540 (70%), Gaps = 2/540 (0%) Frame = +3 Query: 18 FCSIDRVRIAAGDTLVAVLKCHNQNPDAIIMLIDCLSKLCQGPENPNPAGREVKGSVSDS 197 + S +RV+ +A D + A+LK HNQN + + ML+D LS L Q P +G +GS D+ Sbjct: 581 YSSNERVQSSASDAMTALLKNHNQNYEVLSMLLDSLSNLSQSLGLPKTSGDIEEGSKLDT 640 Query: 198 DRLLRLIPQWSKTVQNWNIYIEPLVDKMFAEPSNAVIVRFLSYISDNLAEAQDVVLHHVL 377 +++L LIP+WS++VQ+WN+ I PL+DKMFAEPSNA +VRFLSYIS++LAEA D+V H +L Sbjct: 641 EKVLGLIPEWSESVQDWNLLIGPLIDKMFAEPSNATLVRFLSYISEHLAEAADIVFHRIL 700 Query: 378 LYMQGQKQGNEMLLSGGASGTFTCVEFDKSKDCLFDRLCPLLIIKLLPFRVFNDLNSTVM 557 L+M+GQK+ +E + S T+ + K + LFDRLCPLL+I+LLP RVFNDLNS+V+ Sbjct: 701 LHMKGQKELDESFFTKWESKTYAADDSMKLQHSLFDRLCPLLVIRLLPMRVFNDLNSSVI 760 Query: 558 YGQLIVQGSLHDEG--DNAGSNCISSSLVTRAFNKFEFEDVRKLAAELCGRIHPKVLLPI 731 YGQL Q +H G D C++ L+ RA KFEFEDVRKLAAELCGRIHP+VLLPI Sbjct: 761 YGQLPDQVVVHGYGSIDINDHECVAMLLLNRALGKFEFEDVRKLAAELCGRIHPQVLLPI 820 Query: 732 IRSQLEHAAYSRDMLKMKACLFSICTSLVSRGSYSACHPDMLEIRKIIEMVLLWPSLDSD 911 + S LE AA S+D++K+KACLFS+CTSLV+RG S P ML+I+K I+ +LLWPSLD D Sbjct: 821 LSSHLELAADSQDIVKIKACLFSVCTSLVARGRDSLSQPAMLKIQKTIKTILLWPSLDGD 880 Query: 912 EVSKAQHGCIDCLALMVCTELQAPDLSEDCSRNNFSISREKRANGCAFSRNSVLRYVIEN 1091 EVSKAQHGCIDCLALM+CTELQAP + SI + G + +SV+ YVI Sbjct: 881 EVSKAQHGCIDCLALMICTELQAPKSFIGSVSDKISIIGKNFHPGDSALGDSVVTYVIHQ 940 Query: 1092 LTRNRNESNSSANFAAQSHIMREFSDVGCESKGSIPYSFHLCMANVLISACQKISNPGKK 1271 L+ + E+ S++ + + C S+ S+P SF LCMANVLISACQKIS+ GKK Sbjct: 941 LSLDAVEAASTSMLCSDN----------CASEPSVPLSFRLCMANVLISACQKISDSGKK 990 Query: 1272 PLAERILPVLIHSIEGMKDSEIRAACLQVLFSAVFYLKSAILPHXXXXXXXXXXXXXXXX 1451 A RILP LIH ++ +KDSEIR AC+QVLFSAV++LKS ILP+ Sbjct: 991 AFARRILPYLIHFVQVIKDSEIRVACVQVLFSAVYHLKSMILPYSSELLKLSLKSLEGNS 1050 Query: 1452 HKEKMASVKLMASIMASEDVVIGSISGGLLEARSVLSNIASTDPSMELREVCKKLLVCIT 1631 KE+MA VKLMAS+MASED ++ +IS GLLEAR VL ++ DPS+E++++C+KLL C+T Sbjct: 1051 EKERMAGVKLMASLMASEDAIVENISEGLLEARLVLLSMYMADPSLEVQQMCQKLLACLT 1110 >ref|XP_002277702.2| PREDICTED: uncharacterized protein LOC100241927 [Vitis vinifera] Length = 1106 Score = 541 bits (1395), Expect = e-151 Identities = 291/542 (53%), Positives = 382/542 (70%), Gaps = 4/542 (0%) Frame = +3 Query: 18 FCSIDRVRIAAGDTLVAVLKCHNQNPDAIIMLIDCLSKLCQGPENPNPAGREVKGSVSDS 197 + S +RV+ +A D + A+LK HNQN + + ML+D LS L Q P +G +GS D+ Sbjct: 581 YSSNERVQSSASDAMTALLKNHNQNYEVLSMLLDSLSNLSQSLGLPKTSGDIEEGSKLDT 640 Query: 198 DRLLRLIPQWSKTVQNWNIYIEPLVDKMFAEPSNAVIVRFLSYISDNLAEAQDVVLHHVL 377 +++L LIP+WS++VQ+WN+ I PL+DKMFAEPSNA +VRFLSYIS++LAEA D+V H +L Sbjct: 641 EKVLGLIPEWSESVQDWNLLIGPLIDKMFAEPSNATLVRFLSYISEHLAEAADIVFHRIL 700 Query: 378 LYMQGQKQGNEMLLSGGASGTFTCVEFDKSKDCLFDRLCPLLIIKLLPFRVFNDLNSTVM 557 L+M+GQK+ +E + S T+ + K + LFDRLCPLL+I+LLP RVFNDLNS+V+ Sbjct: 701 LHMKGQKELDESFFTKWESKTYAADDSMKLQHSLFDRLCPLLVIRLLPMRVFNDLNSSVI 760 Query: 558 YGQLIVQGSLHDEG--DNAGSNCISSSLVTRAFNKFEFEDVRKLAAELCGRIHPKVLLPI 731 YGQL Q +H G D C++ L+ RA KFEFEDVRKLAAELCGRIHP+VLLPI Sbjct: 761 YGQLPDQVVVHGYGSIDINDHECVAMLLLNRALGKFEFEDVRKLAAELCGRIHPQVLLPI 820 Query: 732 IRSQLEHAAYSRDMLKMKACLFSICTSLVSRGSYSACHPDMLEIRKIIEMVLLWPSLDSD 911 + S LE AA S+D++K+KACLFS+CTSLV+RG S P ML+I+K I+ +LLWPSLD D Sbjct: 821 LSSHLELAADSQDIVKIKACLFSVCTSLVARGRDSLSQPAMLKIQKTIKTILLWPSLDGD 880 Query: 912 EVSKAQHGCIDCLALMVCTELQAPDLSEDCSRNNF--SISREKRANGCAFSRNSVLRYVI 1085 EVSKAQHGCIDCLALM+CTELQAP +F S+S + G F +SV+ YVI Sbjct: 881 EVSKAQHGCIDCLALMICTELQAP--------KSFIGSVSDKISIIGKNFHPDSVVTYVI 932 Query: 1086 ENLTRNRNESNSSANFAAQSHIMREFSDVGCESKGSIPYSFHLCMANVLISACQKISNPG 1265 L+ + E+ S++ + + C S+ S+P SF LCMANVLISACQKIS+ G Sbjct: 933 HQLSLDAVEAASTSMLCSDN----------CASEPSVPLSFRLCMANVLISACQKISDSG 982 Query: 1266 KKPLAERILPVLIHSIEGMKDSEIRAACLQVLFSAVFYLKSAILPHXXXXXXXXXXXXXX 1445 KK A RILP LIH ++ +KDSEIR AC+QVLFSAV++LKS ILP+ Sbjct: 983 KKAFARRILPYLIHFVQVIKDSEIRVACVQVLFSAVYHLKSMILPYSSELLKLSLKSLEG 1042 Query: 1446 XXHKEKMASVKLMASIMASEDVVIGSISGGLLEARSVLSNIASTDPSMELREVCKKLLVC 1625 KE+MA VKLMAS+MASED ++ +IS GLLEAR VL ++ DPS+E++++C+KLL C Sbjct: 1043 NSEKERMAGVKLMASLMASEDAIVENISEGLLEARLVLLSMYMADPSLEVQQMCQKLLAC 1102 Query: 1626 IT 1631 +T Sbjct: 1103 LT 1104 >ref|XP_003553419.1| PREDICTED: uncharacterized protein LOC100800773 [Glycine max] Length = 1097 Score = 520 bits (1338), Expect = e-145 Identities = 284/541 (52%), Positives = 370/541 (68%), Gaps = 3/541 (0%) Frame = +3 Query: 24 SIDRVRIAAGDTLVAVLKCHNQNPDAIIMLIDCLSKLCQGPENPNPAGREVKGSVSDSDR 203 S D + +A D ++ VLK HNQ + I +L+DCLS + + + G KGS D+D+ Sbjct: 578 SPDESQSSASDAIIGVLKHHNQRIEIIFLLLDCLSNMSKSLDLTQSTGD--KGSKLDADQ 635 Query: 204 LLRLIPQWSKTVQNWNIYIEPLVDKMFAEPSNAVIVRFLSYISDNLAEAQDVVLHHVLLY 383 +L+L+P WSK+VQ+WN+ I PLVDKMF +PSNA IV+FLSYIS+NLA D+VLHHVLL+ Sbjct: 636 VLKLVPVWSKSVQDWNLLIGPLVDKMFGDPSNATIVKFLSYISENLANVADLVLHHVLLH 695 Query: 384 MQGQKQGNEMLLSGGASGTFTCVEFDKSKDCLFDRLCPLLIIKLLPFRVFNDLNSTVMYG 563 ++ QK+ +E LS T+TC EF++ + LF+ LCPLLIIK+LP + FNDLNS++MYG Sbjct: 696 VKEQKKIDESFLSRWEQRTYTCDEFEEMQQSLFEHLCPLLIIKILPLKTFNDLNSSIMYG 755 Query: 564 QLIVQGSLHDEGD---NAGSNCISSSLVTRAFNKFEFEDVRKLAAELCGRIHPKVLLPII 734 L Q + D G + +CI++ L+ RAF +FEFE+VRKL+AELCGRIHP+VLLP + Sbjct: 756 HLS-QNIIQDAGSRDTDIDYDCIAAFLLNRAFCEFEFEEVRKLSAELCGRIHPQVLLPFV 814 Query: 735 RSQLEHAAYSRDMLKMKACLFSICTSLVSRGSYSACHPDMLEIRKIIEMVLLWPSLDSDE 914 S LE A S+++LK+KACLFSICTSL+ RG S HP M IRK+IE VLLWP L++D Sbjct: 815 CSLLERAVDSKNVLKIKACLFSICTSLMVRGWESLSHPSMYSIRKMIETVLLWPCLNADS 874 Query: 915 VSKAQHGCIDCLALMVCTELQAPDLSEDCSRNNFSISREKRANGCAFSRNSVLRYVIENL 1094 VSKAQHGCIDCLALM+C ELQA + S NN SI RA G NSV+ YVI Sbjct: 875 VSKAQHGCIDCLALMICAELQAKE-----SINN-SIPDTVRALG--KKGNSVVTYVINQF 926 Query: 1095 TRNRNESNSSANFAAQSHIMREFSDVGCESKGSIPYSFHLCMANVLISACQKISNPGKKP 1274 N+NE S+ EF D E ++ SF LCM NVLIS CQKIS KKP Sbjct: 927 FNNKNEQTSTP----------EFGDENSEFVAAVSLSFCLCMGNVLISTCQKISESCKKP 976 Query: 1275 LAERILPVLIHSIEGMKDSEIRAACLQVLFSAVFYLKSAILPHXXXXXXXXXXXXXXXXH 1454 A +++P L+HS+E SEIRAAC QVLFSAV++L+SA+LP+ Sbjct: 977 FAAQVIPFLLHSLEFETKSEIRAACTQVLFSAVYHLRSAVLPYASDLLRMALKALRKESD 1036 Query: 1455 KEKMASVKLMASIMASEDVVIGSISGGLLEARSVLSNIASTDPSMELREVCKKLLVCITF 1634 KE+MA KL+AS+MASED+++ +IS GLL+ARSVLS I+S+DPS EL+++C KLL CI+ Sbjct: 1037 KERMAGAKLIASLMASEDMILENISVGLLQARSVLSTISSSDPSPELQQLCCKLLACISS 1096 Query: 1635 P 1637 P Sbjct: 1097 P 1097 >ref|XP_002526720.1| conserved hypothetical protein [Ricinus communis] gi|223533909|gb|EEF35634.1| conserved hypothetical protein [Ricinus communis] Length = 1054 Score = 513 bits (1322), Expect = e-143 Identities = 290/536 (54%), Positives = 364/536 (67%), Gaps = 13/536 (2%) Frame = +3 Query: 63 VAVLKCHNQNPDAIIMLIDCLSKL-----------CQGPENPNPAGREVKGSVSDSDRLL 209 + +LK HNQ P+ I +L+DCLS + C+ N AG +V D DR+L Sbjct: 535 IGMLKHHNQQPEVICLLLDCLSDISVPLWKNVCFACELVLLFNIAGPKV-----DIDRVL 589 Query: 210 RLIPQWSKTVQNWNIYIEPLVDKMFAEPSNAVIVRFLSYISDNLAEAQDVVLHHVLLYMQ 389 +L+P+W K VQNWN I L+DKMFAEP+NA+IV+FLSYIS+ LAEA DVVL++VL M+ Sbjct: 590 KLMPEWCKNVQNWNSMIILLLDKMFAEPANAIIVKFLSYISERLAEAADVVLYYVLSQMK 649 Query: 390 GQKQGNEMLLSGGASGTFTCVEFDKSKDCLFDRLCPLLIIKLLPFRVFNDLNSTVMYGQL 569 QK NE LLS S + + K + LF+RLCPLLII+LLP RVFNDL S+ MYGQL Sbjct: 650 PQKGINEGLLSTWKSRSCNNEDLMKMQQTLFERLCPLLIIRLLPLRVFNDLESSTMYGQL 709 Query: 570 IVQGSLHDEGD-NAGSNCISSSLVTRAFNKFEFEDVRKLAAELCGRIHPKVLLPIIRSQL 746 Q + GD N +CI++ L+ RAFNK+EFEDVRKLAAELCGR+HP+VL P++ + L Sbjct: 710 PSQVITQECGDVNIADDCIAAFLLQRAFNKYEFEDVRKLAAELCGRLHPQVLFPVVLTIL 769 Query: 747 EHAAYSRDMLKMKACLFSICTSLVSRGSYSACHPDMLEIRKIIEMVLLWPSLDSDEVSKA 926 E+AA D+LK+KACLF+ICTSLV +G S HP + +IRK IE VLLWPSLD DEVSKA Sbjct: 770 ENAANFHDILKIKACLFAICTSLVVKGKDSVYHPVIFQIRKTIEAVLLWPSLDGDEVSKA 829 Query: 927 QHGCIDCLALMVCTELQAPDLSEDCSRNNFSISREKRANGCAFSRNSVLRYVIENLTRNR 1106 QHGCIDCLALM+C ELQA + +D S N F I+ + +G + + NS L YVI L ++ Sbjct: 830 QHGCIDCLALMICAELQATESLKD-SSNKFRIAGKIIDSGKSTAGNSALAYVIHQLANDK 888 Query: 1107 NE-SNSSANFAAQSHIMREFSDVGCESKGSIPYSFHLCMANVLISACQKISNPGKKPLAE 1283 NE S SS N CE + +IP S LCMAN LISACQKIS+ GKK A Sbjct: 889 NEVSVSSLNIE------------NCEFEATIPCSLRLCMANALISACQKISDSGKKSFAR 936 Query: 1284 RILPVLIHSIEGMKDSEIRAACLQVLFSAVFYLKSAILPHXXXXXXXXXXXXXXXXHKEK 1463 R LP LIHS+E + EIRAAC+QV+FSAV++LKSA++P+ KE+ Sbjct: 937 RSLPNLIHSVEMISHPEIRAACIQVMFSAVYHLKSAVVPYSADLLKLSLKFLRKGSDKER 996 Query: 1464 MASVKLMASIMASEDVVIGSISGGLLEARSVLSNIASTDPSMELREVCKKLLVCIT 1631 MA KLMAS+MASED ++ SIS GLLEAR VLS I+S+DPS +L+ VCK LL CIT Sbjct: 997 MAGAKLMASLMASEDDILESISEGLLEARIVLSAISSSDPSPDLQVVCKNLLACIT 1052 >ref|NP_001190119.1| armadillo/beta-catenin-like repeat-containing protein [Arabidopsis thaliana] gi|332646153|gb|AEE79674.1| armadillo/beta-catenin-like repeat-containing protein [Arabidopsis thaliana] Length = 1096 Score = 455 bits (1170), Expect = e-125 Identities = 256/535 (47%), Positives = 350/535 (65%), Gaps = 2/535 (0%) Frame = +3 Query: 33 RVRIAAGDTLVAVLKCHNQNPDAIIMLIDCLSKLCQGPENPNPAGREVKGSVSDSDRLLR 212 +V+ +A +TL+ VLK H ++ D I ML+ LS + Q + G +G DSDR+L+ Sbjct: 580 KVQSSATETLLGVLKHHKEDFDVICMLLTSLSNI-QALDTAESNGHSTEGLTFDSDRVLK 638 Query: 213 LIPQWSKTVQNWNIYIEPLVDKMFAEPSNAVIVRFLSYISDNLAEAQDVVLHHVLLYMQG 392 LIP+W+++VQNWN I PL+DKMF EPSNA++VRFLS IS++LA+ D+VL HVL +M+ Sbjct: 639 LIPEWARSVQNWNSLIGPLLDKMFLEPSNAIMVRFLSCISESLADTSDLVLPHVLSHMKK 698 Query: 393 QKQGNEMLLSGGASGTFTCVEFDKSKDCLFDRLCPLLIIKLLPFRVFNDLNSTVMYGQLI 572 Q + + +S S T + V+ KS+ LFD LCPLLI++LLP RVF+D++S+ +YG+ + Sbjct: 699 QNKVDASFIS--RSDTKSSVDKTKSEKSLFDHLCPLLILRLLPQRVFDDIDSSTIYGKFL 756 Query: 573 VQGSLHDEGDNAGSNC--ISSSLVTRAFNKFEFEDVRKLAAELCGRIHPKVLLPIIRSQL 746 S++D D +C I++ ++ RAF+KFEFE+VRKL+AELCGR+HP+VL P + QL Sbjct: 757 SGDSVNDYQDIKFEDCQCIATFILERAFSKFEFEEVRKLSAELCGRLHPQVLFPTVLLQL 816 Query: 747 EHAAYSRDMLKMKACLFSICTSLVSRGSYSACHPDMLEIRKIIEMVLLWPSLDSDEVSKA 926 E A +D LK+KACLFSICTSL+ RG S H +IRK++E +LLWPS++ DE+SK Sbjct: 817 EKATEIQDSLKIKACLFSICTSLMVRGWESLSHRVTPKIRKVLENILLWPSVE-DEISKV 875 Query: 927 QHGCIDCLALMVCTELQAPDLSEDCSRNNFSISREKRANGCAFSRNSVLRYVIENLTRNR 1106 QHGCIDCLALM+C ELQ + S + R+ G S SVL Y I L +R Sbjct: 876 QHGCIDCLALMICAELQ------HLKSSKTSGGEKIRSTGKDTSGYSVLDYTIHCLIEDR 929 Query: 1107 NESNSSANFAAQSHIMREFSDVGCESKGSIPYSFHLCMANVLISACQKISNPGKKPLAER 1286 + +S + + CE+ +P F LCMANV+ISACQK KK A + Sbjct: 930 SNCSSIPKLSTDI--------LTCEN--PLPIPFRLCMANVIISACQKNPESSKKTFARK 979 Query: 1287 ILPVLIHSIEGMKDSEIRAACLQVLFSAVFYLKSAILPHXXXXXXXXXXXXXXXXHKEKM 1466 LP LIHS++ + E+RAAC+QVLFSA ++LKS +LP KEK+ Sbjct: 980 ALPPLIHSLKVISVPEVRAACIQVLFSATYHLKSTLLPVSSDLLKLSLRFLEQGSEKEKL 1039 Query: 1467 ASVKLMASIMASEDVVIGSISGGLLEARSVLSNIASTDPSMELREVCKKLLVCIT 1631 A KLMAS+MASEDV++ +IS GLLEARSVLS + +DPS ++REVC KLL CIT Sbjct: 1040 AGAKLMASLMASEDVILENISEGLLEARSVLSKASLSDPSRDVREVCAKLLACIT 1094