BLASTX nr result
ID: Coptis21_contig00014857
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00014857 (1799 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002275536.2| PREDICTED: uncharacterized protein LOC100245... 378 e-102 ref|XP_002518393.1| vacuolar protein sorting-associated protein,... 364 5e-98 ref|XP_003541522.1| PREDICTED: uncharacterized protein LOC100783... 299 2e-78 ref|XP_002874219.1| hypothetical protein ARALYDRAFT_910516 [Arab... 293 8e-77 ref|NP_568451.7| uncharacterized protein [Arabidopsis thaliana] ... 291 3e-76 >ref|XP_002275536.2| PREDICTED: uncharacterized protein LOC100245550 [Vitis vinifera] Length = 4054 Score = 378 bits (970), Expect = e-102 Identities = 236/640 (36%), Positives = 353/640 (55%), Gaps = 41/640 (6%) Frame = -2 Query: 1798 SHILNIRLRKKTGGALTRTFEISFSVQHVSCVLPSEFLAILIGYFCLPDWTPHGHDTCVT 1619 S ILNIR+ K ++ E+S S+QHV C+LP E+LAI+IGYF LPDW + + V Sbjct: 1769 SPILNIRMTKGNAESIGSHSELSISIQHVCCILPPEYLAIVIGYFSLPDWGLNANKQPVF 1828 Query: 1618 ENCE----ERDTDNYVILWKIEILESTLILPVECSRGQSLHLGLKQLYISFTTVKHREDA 1451 + E ++D L+K+EI++STLILPV+ + Q L+L ++QLY SF + Sbjct: 1829 GKHKHINREPESD---FLFKLEIVDSTLILPVKSNGSQFLNLDIQQLYCSFMDKSCSGEV 1885 Query: 1450 LKDIPIDCRVSEKMIVDNVHLLNIFGRDLCMSLVLLEDKENISAKLDKNTLNGNITLISP 1271 L+DIP +C V + D LN+FGRDL +SL+L +D + +++ GNIT I+P Sbjct: 1886 LRDIPPECLVQAHEVADKSCSLNVFGRDLSLSLLLFKDDAHDLLMFGQDSAPGNITFIAP 1945 Query: 1270 LDLDIWIRIPCENKAFVGLS-TPTCIMVEIKICQINAEDDFFLFGVQSVLNVIDELSAVG 1094 L +D+W+RIP E++ G S P C+MV + CQ+ AED + G +++++VI + S++ Sbjct: 1946 LSVDVWVRIPWESETLNGCSPAPMCVMVRVCNCQLIAEDGYIFSGFEALIDVIFQFSSID 2005 Query: 1093 TLSEGFKSDIFQFLQFKKILKEGSFVLPDASCVSFREVRCRAKSLSIRLCRSRPGHFVPP 914 S+ F SD+ QFL K+ L+E V AS + F E RC SLSI+ C + + Sbjct: 2006 EESKCFTSDVLQFLHSKRSLRESRAVPSKASNMMFTEARCFVNSLSIKFCCLKDPS-ISF 2064 Query: 913 ELVAKVDMEVELSASFRNDVPLCLDIECSSLMLYSFHSRVILVQCKSDGSVSSGAGIHLE 734 E VAK DM+ SAS RN++PL DI SSL LYS + ++LV C S SS +H Sbjct: 2065 EPVAKADMQFVFSASLRNEIPLRWDICFSSLSLYSLPNCLMLVHCISASPNSSVLDMHFS 2124 Query: 733 KSNRVENKLLVNLPFVEIWLHLSDWSKVIELLVSYLEQLSQISFMSASSKNSDSSLMAAN 554 + ++ EN+L L + IWLHL W++VI+L Y QL++ S +SS S + Sbjct: 2125 RLDQGENELDFALASLNIWLHLFKWAEVIDLFNYYAGQLAEPSMQDSSSDVIASGPLDPL 2184 Query: 553 IEDCSS-----------------------------------LIVKSEIIGASVHYPLLVI 479 IED + L +KS+ I + H P+ V Sbjct: 2185 IEDKAPLDRRKNVAVSVSKYSVPSLSMSSYFVSQTMKQNAILNMKSDNIAITFHIPVWVS 2244 Query: 478 ADAFSESREVENKQEKPWDYSSNILGEEPVLKGRHHQYITITLQSREGELVISGEHAILN 299 ++FS+ RE ++E+P S +++G H ++I +TLQSR L+I+G + Sbjct: 2245 GESFSKIRESAIQEERPLSSLS------AIVEGEHSKFIEVTLQSRNNVLIINGSDIKVK 2298 Query: 298 CSVEEARGMLDIVHDQRVLSWSLFQLFQANVVADICFKEQKQ-HTSVNIQVDSLDMSISH 122 +E+ G L I D+ V SW F LFQ NV A+IC + H +Q D+LD+ +S Sbjct: 2299 SCLEQMSGSLQICEDKSVHSWPFFHLFQVNVEAEICNNPMEPVHVKTVVQCDNLDVWLSR 2358 Query: 121 QLFYIWNILGYQNPETGTFQYLASVMDLKVQLRKASLLLT 2 Q+F+ W+ G++ PE G+ Q+ S + +VQLRK SLLLT Sbjct: 2359 QVFHFWHGTGFKIPEAGSSQFTFSHVYFEVQLRKLSLLLT 2398 >ref|XP_002518393.1| vacuolar protein sorting-associated protein, putative [Ricinus communis] gi|223542238|gb|EEF43780.1| vacuolar protein sorting-associated protein, putative [Ricinus communis] Length = 3482 Score = 364 bits (934), Expect = 5e-98 Identities = 225/626 (35%), Positives = 348/626 (55%), Gaps = 29/626 (4%) Frame = -2 Query: 1792 ILNIRLRKKTGGALTRTFEISFSVQHVSCVLPSEFLAILIGYFCLPDWTPHGHDTCVTEN 1613 ILN+R++K G++T FE+S +QHV C LP E+LAI+IGYF DW+ + VTEN Sbjct: 1181 ILNLRVKKGLSGSVTSQFEVSIGIQHVYCFLPPEYLAIIIGYFSSSDWSTNLSMQLVTEN 1240 Query: 1612 CEERDTDN-YVILWKIEILESTLILPVECSRGQSLHLGLKQLYISFTTVKHREDALKDIP 1436 C+ T+ +++K EIL+S LILPVE Q L L+QLY S +D L+DIP Sbjct: 1241 CDCIVTEKGNPVVYKFEILDSILILPVERDDHQFLKAELQQLYCSIILNCSPDDVLEDIP 1300 Query: 1435 IDCRVSEKMIVDNVHLLNIFGRDLCMSLVLLEDKENISAKLDKNTLNGNITLISPLDLDI 1256 +C V + LNI+GRDL +SL+L +D L+++ NITLI+PL D+ Sbjct: 1301 CECMVPTDKVAKANDCLNIYGRDLFLSLLLCKDDGYGCLILNEDNGFNNITLIAPLSADV 1360 Query: 1255 WIRIPCENKAFVGLST-PTCIMVEIKICQINAEDDFFLFGVQSVLNVIDELSAVGTLSEG 1079 W+R+PCE++ + S+ TC+M I CQ++A+D + L G +++++VI++ S++G S+ Sbjct: 1361 WVRLPCESEPCLNSSSASTCVMSRIANCQLHADDCYTLDGFEALVDVINQFSSIGNESKY 1420 Query: 1078 FKSDIFQFLQFKKILKEGSFVLPDASCVSFREVRCRAKSLSIRLCRSRPGHFVPPELVAK 899 F SDI QF Q K+ LKE V AS + F E RC A SLS+ L +S+ + + +AK Sbjct: 1421 FTSDILQFFQLKRSLKESGGVPTVASGMVFTEARCCANSLSVILYQSKRDS-IMEKPIAK 1479 Query: 898 VDMEVELSASFRNDVPLCLDIECSSLMLYSFHSRVILVQCKSDGSVSSGAGIHLEKSNRV 719 DM++ SAS N+ P+ LD+ SSL ++S V++ QC + S SS I S Sbjct: 1480 ADMQLICSASLINETPVELDLSFSSLAIHSLPDSVMIAQCANAHSASSALHIFFSNSIEA 1539 Query: 718 ENKLLVNLPFVEIWLHLSDWSKVIELLVSYLEQLSQISFMSASSKN-------------- 581 EN+ + LP + IWLH+ D S VI + Y +++S+ + +SSK+ Sbjct: 1540 ENEFHICLPSLNIWLHVLDSSAVIGIYNYYSKRMSETLVVESSSKSLSKDMADHTENATF 1599 Query: 580 --SDSSLMAANI----------EDCSSLIVKSEIIGASVHYPLLVIADAFSESREVENKQ 437 S SSL+ N +D L V+SE IG +VH+P+ A E E ++ Sbjct: 1600 SVSQSSLLKNNSPFDHPNEHTNQDSFVLSVRSECIGLTVHFPIWDSQSAVCEIETAEVQE 1659 Query: 436 EKPWDYSSNILGEEPVLKGRHHQYITITLQSREGELVISGEHAILNCSVEEARGMLDIVH 257 ++P SS+ +G+ +++ +T SR L + G++ L +E+ G + I Sbjct: 1660 QRPRFVSSH------ATEGKKCKFMAVTAHSRNSRLSMVGKNVRLKSILEKTSGTVGICE 1713 Query: 256 DQRVLSWSLFQLFQANVVADICFKEQK-QHTSVNIQVDSLDMSISHQLFYIWNILGYQNP 80 D+ + +W FQ+ + +V+ +IC + +QVD +DM +SHQ+ W + + P Sbjct: 1714 DKSITTWPFFQISEVDVMTEICNNHMNIAVIKLEVQVDRVDMWLSHQVLCFWYGVQFDIP 1773 Query: 79 ETGTFQYLASVMDLKVQLRKASLLLT 2 ETGT Q MDLK+Q RK SLL++ Sbjct: 1774 ETGTSQSSIESMDLKLQSRKVSLLIS 1799 >ref|XP_003541522.1| PREDICTED: uncharacterized protein LOC100783352 [Glycine max] Length = 3441 Score = 299 bits (765), Expect = 2e-78 Identities = 207/630 (32%), Positives = 323/630 (51%), Gaps = 31/630 (4%) Frame = -2 Query: 1798 SHILNIRLRKKTGGALTRTFEISFSVQHVSCVLPSEFLAILIGYFCLPDWTPHGHDTCVT 1619 S ILN+R+RK + T EIS +QHV C+LPSE+L+I+IGYF L DW D C + Sbjct: 1193 SPILNVRVRKGQNISSTIDLEISIGIQHVYCMLPSEYLSIIIGYFSLSDWGGASGDQCFS 1252 Query: 1618 ENCEERDTDNYV-ILWKIEILESTLILPVECSRGQSLHLGLKQLYISFTTVKHREDALKD 1442 + + D N + I +K EIL+S LI PV + Q + + + QLY SF ++ LK+ Sbjct: 1253 DEQSDTDVKNEMKITYKFEILDSNLIFPVVSNDRQFIKIEMPQLYCSFIENSGVDEVLKN 1312 Query: 1441 IPIDCRVSEKMIVDNVHLLNIFGRDLCMSLVLLEDKENISAKLDKNTLNGNITLISPLDL 1262 IP +C V + LN+FGRDL +S +L ++ A +++NT LI+P++ Sbjct: 1313 IPPECLVPIHKLAKRNDCLNVFGRDLFVSFLLYKNDLLGLATVERNTEFLTSALIAPINA 1372 Query: 1261 DIWIRIPCENKAFVGLSTPTCIMVEIKICQINAEDDFFLFGVQSVLNVIDELSAVGTLSE 1082 D+W+RIP K+ ++ C M I C I AED F G ++ +VI+E S+V S+ Sbjct: 1373 DVWVRIPVGGKSNCKSTSSICFMTSISSCHIVAEDSHFFDGCMAIWDVIEEFSSVDDQSK 1432 Query: 1081 GFKSDIFQFLQFKKILKEGSFVLPD--ASCVSFREVRCRAKSLSIRLCRSRPGHFVPPEL 908 FKSD+ QFL K+ L+ + P AS + EV+C A+SL I R FV EL Sbjct: 1433 CFKSDVLQFLNSKRSLEATRTISPTLMASTIMSTEVKCCAQSLFISF-HHRKEDFV--EL 1489 Query: 907 VAKVDMEVELSASFRNDVPLCLDIECSSLMLYSFHSRVILVQCKSDGSVSSGAGIHLEKS 728 + K D+ SAS ND +CLD+ SS++ YS IL +C S I +S Sbjct: 1490 ITKGDLGFVCSASLINDSLVCLDLGFSSVVFYSPRDS-ILAKCTPTSFSMSVLSISFSQS 1548 Query: 727 NRVENKLLVNLPFVEIWLHLSDWSKVIELLVSY---LEQLSQISFMSASSKNSDSSLMAA 557 +NKL + L ++IWLHL++W++V++ L + LE+ + ++ S ++ +S+ + Sbjct: 1549 IGGKNKLDLCLSSIDIWLHLAEWTEVVKFLNHFRLHLERTPVNAITNSLSVDASNSVKKS 1608 Query: 556 NIEDCSS--------------------LIVKSEIIGASVHYPLLVIADAFSESREVENKQ 437 ++ SS I+KSE + H P+ V + E + + Sbjct: 1609 TVQHSSSFLDSESTSAPFTSQEIENDVFIIKSENFCITFHIPVWVGEEPHVEFQHSQGLN 1668 Query: 436 EKPWDYSSNILGEEPVLKGRHHQYITITLQSREGELVISGEHAILNCSVEEARGMLDIVH 257 P SS+I+ E+ +++T++ ELVI L +E+ ++ IV Sbjct: 1669 VTPLSVSSDIVEEKDA------KFLTVSFNMNGFELVIRSRDIQLTSKMEKLSSVIMIVE 1722 Query: 256 DQRVLSWSLFQLFQANVVADICFKEQKQHT-----SVNIQVDSLDMSISHQLFYIWNILG 92 + R S L + + V A +C K HT +V I D+ ++ ISH F++WN + Sbjct: 1723 NGRHTSCPLLDVIEVQVDAVLC----KNHTNTIELNVEIACDNSNVWISHPTFHLWNAVK 1778 Query: 91 YQNPETGTFQYLASVMDLKVQLRKASLLLT 2 + PE+G QY S + K Q+RK S+LLT Sbjct: 1779 FDVPESGPSQYSTSGITFKFQMRKVSILLT 1808 >ref|XP_002874219.1| hypothetical protein ARALYDRAFT_910516 [Arabidopsis lyrata subsp. lyrata] gi|297320056|gb|EFH50478.1| hypothetical protein ARALYDRAFT_910516 [Arabidopsis lyrata subsp. lyrata] Length = 3344 Score = 293 bits (751), Expect = 8e-77 Identities = 199/628 (31%), Positives = 313/628 (49%), Gaps = 29/628 (4%) Frame = -2 Query: 1798 SHILNIRLRKKTGGALTRTFEISFSVQHVSCVLPSEFLAILIGYFCLPDWTPHGHDTCVT 1619 S +LN+R+RKK E+S +QH C+LP E+LAI+IGYF L DWT + Sbjct: 1032 SQVLNLRVRKKDLEPSGSELEVSIGIQHTCCILPPEYLAIIIGYFSLSDWTSKSGLQSLP 1091 Query: 1618 ENCE-ERDTDNYVILWKIEILESTLILPVECSRGQSLHLGLKQLYISFTTVKHREDALKD 1442 + E + + I +KIEIL+S+++LPVE + L + ++QLYISF + ++ Sbjct: 1092 QATELTKAPSEFAIAYKIEILDSSIVLPVEDDDRRQLKVDIQQLYISFVPECALSNVVQH 1151 Query: 1441 IPIDCRVSEKMIVDNVHLLNIFGRDLCMSLVLLEDKENISAKLDKNTLNGNITLISPLDL 1262 IP +C + + + LNIFGRDL +SL+L E IS + + + +ITL + + Sbjct: 1152 IPQECVIPLNQVAERADCLNIFGRDLSVSLLLSES--GIST-FENDAMCRSITLAASIIA 1208 Query: 1261 DIWIRIPCENKAFVGLSTPTCIMVEIKICQINAEDDFFLFGVQSVLNVIDELSAVGTLSE 1082 D WI PC+ L+ C+M + +C+I +D L G ++ L+V D+LS V S+ Sbjct: 1209 DAWISFPCDRNPLTDLA---CVMSRVDVCEIVVDDSDALDGFKAFLDVFDQLSLVDEESK 1265 Query: 1081 GFKSDIFQFLQFKKILKEGSFVLPDASCVSFREVRCRAKSLSIRLCRSR--PGHFVPPEL 908 F SD+ QFL+ K LK+ V P S SF + R L+ +L R R PG + E Sbjct: 1266 LFVSDVPQFLRTKMRLKQELSVAPLGSSTSFIKFRIFVNLLTAKLHRLRKDPGTLLS-EP 1324 Query: 907 VAKVDMEVELSASFRNDVPLCLDIECSSLMLYSFHSRVILVQCKSDGSVSSGAGIHLEKS 728 V + DM+ S F+N+ P+ LD++ + +YS S V+L +C + S + + Sbjct: 1325 VLQADMKFVCSGEFKNNFPMSLDVQFFEIGIYSLLSSVMLARCINAYGDPSALKVRFTEQ 1384 Query: 727 NRVENKLLVNLPFVEIWLHLSDWSKVIELLVSYLEQLSQISFMSASSKNSDSSL------ 566 E L +LP ++IWLH DW +VIELL SY + L S+ + D S+ Sbjct: 1385 AENEYDLCFSLPSLDIWLHSFDWIEVIELLKSYSQILEDPFLSKGSNLDMDESIGVVRTV 1444 Query: 565 --------------MAANIEDCSSLIVKSEIIGASVHYPLLVIADAF-----SESREVEN 443 ++ N + + +SE IG +H+PL F ++ E+ Sbjct: 1445 CDNTDRVLNVLQTEVSENSSEVMAFSARSETIGVQIHFPLCTSHTEFPGFMATDVHEISE 1504 Query: 442 KQEKPWDYSSNILGEEPVLKGRHHQYITITLQSREGELVISGEHAILNCSVEEARGMLDI 263 ++ + + KG + +Y+++T +SR GEL I G L+ +E+ G+L I Sbjct: 1505 EEHRNF------------FKGNYCKYVSVTARSRSGELSILGRDVKLSYKIEKLNGILAI 1552 Query: 262 VHDQRVLSWSLFQLFQANVVADICFKEQK-QHTSVNIQVDSLDMSISHQLFYIWNILGYQ 86 V S SLF Q V I + K V I D+++M SHQ+ W+ + + Sbjct: 1553 SGVDTVRSCSLFGAAQLLVETSIQMDQNKIMSIDVGILSDNVEMHASHQVLSFWHGITFD 1612 Query: 85 NPETGTFQYLASVMDLKVQLRKASLLLT 2 PET + Q M +KVQ+R SLL++ Sbjct: 1613 APETPSSQNSQGNMSIKVQIRDVSLLIS 1640 >ref|NP_568451.7| uncharacterized protein [Arabidopsis thaliana] gi|332005969|gb|AED93352.1| uncharacterized protein [Arabidopsis thaliana] Length = 3464 Score = 291 bits (746), Expect = 3e-76 Identities = 206/627 (32%), Positives = 313/627 (49%), Gaps = 28/627 (4%) Frame = -2 Query: 1798 SHILNIRLRKKTGGALTRTFEISFSVQHVSCVLPSEFLAILIGYFCLPDWTPHGHDTCVT 1619 S +LN+R+RK+ E+S +QH C+LP E+LAI+IGYF L DWT + Sbjct: 1151 SQVLNLRVRKRGLEPSGSQLEVSIGIQHTYCILPPEYLAIIIGYFSLSDWTSKSGLQSLP 1210 Query: 1618 ENCE-ERDTDNYVILWKIEILESTLILPVECSRGQSLHLGLKQLYISFTTVKHREDALKD 1442 + E + + I +KIEIL+S+++LPVE + L + ++QLYISF + ++ Sbjct: 1211 QATELTKAHSEFAISYKIEILDSSIVLPVEGDDRRQLKVDIQQLYISFIPECALSNVVQH 1270 Query: 1441 IPIDCRVSEKMIVDNVHLLNIFGRDLCMSLVLLEDKENISAKLDKNTLNGNITLISPLDL 1262 IP +C + ++ LNIFGRDL +SL+L E +IS KN + +ITL + + Sbjct: 1271 IPQECVIPLNQVLGRADCLNIFGRDLSVSLLLSES--DIST-FKKNAVCRSITLAASIIA 1327 Query: 1261 DIWIRIPCENKAFVGLSTPTCIMVEIKICQINAEDDFFLFGVQSVLNVIDELSAVGTLSE 1082 D WIR PC++ L+ C+M + +C+I +D L G ++ L+V+D+LS V S+ Sbjct: 1328 DTWIRFPCDHNPLTELA---CVMSRVDVCEIVVDDSDALDGFKAFLDVVDQLSLVDEESK 1384 Query: 1081 GFKSDIFQFLQFKKILKEGSFVLPDASCVSFREVRCRAKSLSIRLCRSR--PGHFVPPEL 908 F SD+ QFL K LK+ V P SF + R L+ +L R R PG + E Sbjct: 1385 LFVSDVPQFLHTKMRLKQELSVAPLEPSTSFIKFRIFVNLLTSKLHRLRKAPGTLLS-EP 1443 Query: 907 VAKVDMEVELSASFRNDVPLCLDIECSSLMLYSFHSRVILVQCKSDGSVSSGAGIHLEKS 728 V + DM+ S +N+ P+ LD++ + LYS S V+L +C + S + + Sbjct: 1444 VLQADMKFVCSGELKNNFPMSLDVQFFKIGLYSLLSSVMLARCINADGDPSALRVRFTEQ 1503 Query: 727 NRVENKLLVNLPFVEIWLHLSDWSKVIELLVSY---LEQLSQISFMSASSK--------- 584 E L +LP ++IWLH DW +VIELL SY LE S+ F S SK Sbjct: 1504 AENEYDLCFSLPSLDIWLHFFDWIEVIELLKSYSQKLEDSSEDRFFSKGSKLDMDESIGV 1563 Query: 583 ------NSDSSL------MAANIEDCSSLIVKSEIIGASVHYPLLVIADAFSESREVENK 440 N+D L ++ N + S +SE IG +H PL F + Sbjct: 1564 VRTICDNTDRVLNVLQTEVSENSSEVMSFAARSENIGVKIHIPLCTSHTEFPGFMATDVH 1623 Query: 439 QEKPWDYSSNILGEEPVLKGRHHQYITITLQSREGELVISGEHAILNCSVEEARGMLDIV 260 + ++++ KG + +Y+++T SR GEL I G L+ +E+ G+L I Sbjct: 1624 EISEEEHTN-------CFKGNYCKYVSVTACSRSGELSILGRDVKLSYKIEKLNGILAIS 1676 Query: 259 HDQRVLSWSLFQLFQANVVADICFKEQK-QHTSVNIQVDSLDMSISHQLFYIWNILGYQN 83 V S SLF Q V I + K V I D ++M SHQ+ W+ + + Sbjct: 1677 GVDTVRSCSLFGAAQLLVETSIQMDQNKIMSIDVGILSDKVEMHASHQVLSFWHGITFDA 1736 Query: 82 PETGTFQYLASVMDLKVQLRKASLLLT 2 PET + Q M +KVQ+R SLL++ Sbjct: 1737 PETPSSQNSEGNMSIKVQIRDVSLLIS 1763