BLASTX nr result
ID: Coptis24_contig00021726
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00021726 (1246 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002310176.1| predicted protein [Populus trichocarpa] gi|2... 214 3e-53 ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus c... 207 3e-51 ref|XP_003548909.1| PREDICTED: uncharacterized protein LOC100818... 182 2e-43 ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arab... 180 6e-43 ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] ... 177 4e-42 >ref|XP_002310176.1| predicted protein [Populus trichocarpa] gi|222853079|gb|EEE90626.1| predicted protein [Populus trichocarpa] Length = 868 Score = 214 bits (546), Expect = 3e-53 Identities = 144/435 (33%), Positives = 225/435 (51%), Gaps = 23/435 (5%) Frame = -2 Query: 1245 NVSLLNN---SSCICFEQVSISDVKFRFSPWSFPAFTLEFSGVYVKLRANKVI------- 1096 +VS LNN SS F++V++ + FRFS WS PA + GV + L A +V Sbjct: 44 DVSALNNESESSRFQFKEVTVDHLSFRFSNWSSPACKIGIRGVNITLLAGEVKEEGSLRR 103 Query: 1095 --KKSEKRKEILSVLDPEGVLLHDAIEKIITNSITSARSWVMTSXXXXXXXXXXXLIHDV 922 K SE++K+ ++ DPEG LH+ +E+I+ N +R+W TS I D Sbjct: 104 ARKLSEEKKKAVAGFDPEGSALHNVLERILLNP--PSRNWFKTSLLNLLLKHCHLQISDT 161 Query: 921 NLELQLHD--DDVSSSLKIKELSLNAV-DECSCLLKGFVGAVLMPRRFCSLDFSVRGLEI 751 NL++Q D D V L++K+ + + + CLL+G VGAV P + S RG Sbjct: 162 NLQVQFPDLNDAVVFLLELKDFNGESEHSDPGCLLRGVVGAVFKPLKVVSFVMDFRGFGF 221 Query: 750 GLRKEEYANRVLYLEEISTXXXXXXXXXXXXXXXVPQFDIAFCPSDLQIVIAFDILIAKE 571 + E+ N + ++ + VP+ + F P DL ++ AF L KE Sbjct: 222 AYKMEDQINHISSFTDLLSCIKLNDLRVADFNIRVPKLSLLFSPLDLLVLSAFGKLSTKE 281 Query: 570 AKHVRNGRELWNIAANRVDSLTMAAKLSLRKLVGIARIWLRYVHTYESLLSLLGYPGETM 391 KHVR+GR+LW +AANR+ + + +LSL KLV +WLRY + YE LLSLLGY + + Sbjct: 282 RKHVRSGRQLWKLAANRLGYVPSSPRLSLHKLVDFICLWLRYQNAYEYLLSLLGYSADNL 341 Query: 390 FEKSSSRMSMNKKLSNDVRNHWKVVSEIEKDMPVEVLARGRRVARERASFQSSTPSSTQR 211 +KS ++S +K N V+++W +S IEK++P E +A+ RR+AR RA ++ + Sbjct: 342 LKKSVIKLSEDKMFLNSVKHNWGEISGIEKELPAEAIAQARRIARYRAVSNIQNGKNSFK 401 Query: 210 HVKFDK--FIFSKILSYIARTFCFIYHSVIQFLVVWASL---NRHEEVDGISRVVSEDY- 49 DK +FSKILS + +Y ++ L + + + ++D SEDY Sbjct: 402 ESSMDKQVNVFSKILSVFIVIWNVMYKILLSILHCFFFIILFFQRPKLDWNPGNNSEDYS 461 Query: 48 --FHCCVNFRKVFIT 10 + +NF K+ +T Sbjct: 462 SRYCFLLNFGKILVT 476 >ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus communis] gi|223538452|gb|EEF40058.1| hypothetical protein RCOM_0603630 [Ricinus communis] Length = 1720 Score = 207 bits (528), Expect = 3e-51 Identities = 141/435 (32%), Positives = 224/435 (51%), Gaps = 26/435 (5%) Frame = -2 Query: 1236 LLNNSSCICFEQVSISDVKFRFSPWSFPAFTLEFSGVYVKL------------RANKVIK 1093 LL+++S F V+I ++ RFS WS PAF +E GV V L RA K + Sbjct: 51 LLDDASLFSFGGVTIEELTLRFSNWSVPAFNIEVRGVNVILVAREEEEERSSVRARKSSE 110 Query: 1092 K-SEKRKEILSVLDPEGVLLHDAIEKIITNSITSARSWVMTSXXXXXXXXXXXLIHDVNL 916 K +E++K+ ++ DPEG LHD +EKI+ + T +R TS + D L Sbjct: 111 KVNEEKKKAVAGFDPEGGALHDVLEKILIS--TPSRKGFTTSLLNLILKHCHLQVFDTKL 168 Query: 915 ELQLH--DDDVSSSLKIKELSLNA-VDECSCLLKGFVGAVLMPRRFCSLDFSVRGLEIGL 745 ++Q+ +DD+ L++KE + + E CLL+GF+G P + S+ + +GL IG Sbjct: 169 QVQVPILNDDLVCLLELKEFNGESEYFEHGCLLRGFLGVAFNPPKETSIVMNFKGLGIGY 228 Query: 744 RKEEYANRVLYLEEISTXXXXXXXXXXXXXXXVPQFDIAFCPSDLQIVIAFDILIAKEAK 565 + N V+ ++ + VP ++ P DL ++ L KE K Sbjct: 229 WMNDKENSVVSSTDLFSCIRLNDLQLADISIRVPGLNLLLSPLDLLVLSVLGRLPLKEPK 288 Query: 564 HVRNGRELWNIAANRVDSLTMAAKLSLRKLVGIARIWLRYVHTYESLLSLLGYPGETMFE 385 HVRNGR+LW +AANR+ +T +LSL L +WLRY++ YE LLS +GY + + Sbjct: 289 HVRNGRQLWRLAANRLGYVTSFPRLSLHNLADFVCMWLRYLNAYEHLLSFIGYTQVNLLK 348 Query: 384 KSSSRMSMNKKLSNDVRNHWKVVSEIEKDMPVEVLARGRRVARERA--SFQSSTPSSTQR 211 + S M +K + V+ HW+++S EK++P E +A+ RR+AR +A S S + Sbjct: 349 RPSIGMLRDKMFHSSVKQHWELISRTEKELPPEAIAQARRIARYKATLSIPQGEDSYKEY 408 Query: 210 HVKFDKFIFSKILSYIARTFCFIYHSVIQFLVVWASL---NRHEEVDGISRVVSEDYFHC 40 V+ +FSK+LS + T+ I+ V+ + + S+ + + DG ++SED HC Sbjct: 409 SVRSQFQVFSKVLSLLVFTWNVIHRVVLSNIHAFLSIVFSRQEPKFDGHLGIISED--HC 466 Query: 39 -----CVNFRKVFIT 10 +NF KV IT Sbjct: 467 PQYCFLLNFGKVLIT 481 >ref|XP_003548909.1| PREDICTED: uncharacterized protein LOC100818143 [Glycine max] Length = 3602 Score = 182 bits (461), Expect = 2e-43 Identities = 125/435 (28%), Positives = 221/435 (50%), Gaps = 24/435 (5%) Frame = -2 Query: 1236 LLNNSSCICFEQVSISDVKFRFSPWSFPAFTLEFSGVYV--------------KLRANKV 1099 L ++ + + F+ +S+ + RFS W PAFT+E GV + +LR +K Sbjct: 51 LFHSPAFLFFKDLSVERLTLRFSTWFPPAFTVELHGVRIVQSFEKPEAEECAARLRNSKY 110 Query: 1098 IKKSEKRKEILSVLDPEGVLLHDAIEKIITNSITSARSWVMTSXXXXXXXXXXXLIHDVN 919 + RK LS LDPEG LHD +E+I+ + + TS + H ++ Sbjct: 111 DCEDYLRKN-LSALDPEGCSLHDILERILFAA--PEKKDFTTSFWNLILKNCHLVAHCIH 167 Query: 918 LELQLH--DDDVSSSLKIKELSLNA--VDECSCLLKGFVGAVLMPRRFCSLDFSVRGLEI 751 +E+QL +D+ +IKELS+ + VD+ CLL+GF+ +V +P + +L G Sbjct: 168 VEIQLPVLNDEFMCFGEIKELSVRSKYVDK-KCLLRGFLSSVFIPMKDSTLVLKGVGFRA 226 Query: 750 GLRKEEYANRVLYLEEISTXXXXXXXXXXXXXXXVPQFDIAFCPSDLQIVIAFDILIAKE 571 L +++ VL ++ P+ +F P + + + F L++ Sbjct: 227 RLVGKDHTGNVLLSSDMQIDIKFRDLKLASCTLCFPELVFSFSPDGISVCLLFLKLVSNN 286 Query: 570 AKHVRNGRELWNIAANRVDSLTMAAKLSLRKLVGIARIWLRYVHTYESLLSLLGYPGETM 391 R RELW IAA+R+ +T+ +LS +LVG+ W+ Y + YE++L L+GY Sbjct: 287 YNQSRGARELWRIAASRIGHVTVTPRLSFHRLVGVIGQWIHYANAYENILLLIGYSTSHT 346 Query: 390 FEKSSSRMSMNKKLSNDVRNHWKVVSEIEKDMPVEVLARGRRVARERASFQSSTPSSTQR 211 ++KS S+++ NK + + HWK++S+IEK +PVE ++ RR+AR RA+ + S + + Sbjct: 347 WKKSISKLTRNKLILSSASRHWKLISDIEKKLPVEGISLARRIARHRAALKDSI-NCHED 405 Query: 210 HVKFDKFI--FSKILSYIARTFCFIYHSVIQFLVVWASLNRHEEVDG--ISRVVSEDYFH 43 V +KF F +LS++ + I H ++ + + + ++DG + ++ + Sbjct: 406 FVTTNKFFRPFIFLLSFMWKLISTIIHCLVN-IFSREKIVQDPDIDGCCLESLIEDPCQS 464 Query: 42 CC--VNFRKVFITVN 4 CC +NF K+ ITV+ Sbjct: 465 CCFVLNFGKIIITVS 479 >ref|XP_002877744.1| hypothetical protein ARALYDRAFT_485391 [Arabidopsis lyrata subsp. lyrata] gi|297323582|gb|EFH54003.1| hypothetical protein ARALYDRAFT_485391 [Arabidopsis lyrata subsp. lyrata] Length = 3074 Score = 180 bits (457), Expect = 6e-43 Identities = 143/446 (32%), Positives = 220/446 (49%), Gaps = 31/446 (6%) Frame = -2 Query: 1245 NVSLLN---NSSCICFEQVSISDVKFRFSPWSFPAFTLEFSGVYVKLRANKVIKKSEKRK 1075 +VS LN + S FE+ +I + R S WS PA +E GV VKL A + S +RK Sbjct: 45 DVSQLNQLLDGSNFQFEKFTIDHLVVRLSVWSAPAIKIEIRGVNVKLSARGTEEGSSRRK 104 Query: 1074 ------------EILSVLDPEGVLLHDAIEKIITNSITSARSWVMTSXXXXXXXXXXXLI 931 ++LS +DPEG +LHD +EK++ S TS S + TS I Sbjct: 105 RASSDRVANEIKKVLSSIDPEGCVLHDILEKMLGRS-TSQISKLKTSFSNLILRHFRIRI 163 Query: 930 HDVNLELQLH-DDDVSSSLKIKELSLNAVDECSC-LLKGFVGAVLMPRRFCSLDFSVRGL 757 H +N+++ L ++S ++I EL ++ + + L++ AVL P R SL S G Sbjct: 164 HGINVQVCLPGSSNLSCVMEINELRSDSENFGNLGLVRSSAAAVLFPLRRSSLTLSCFGF 223 Query: 756 EIGLRKEEYANRVLYLEEISTXXXXXXXXXXXXXXXVPQFDIAFCPSDLQIVIAFDILIA 577 IG +++ + + + +P+ + +F P+DL +++ L + Sbjct: 224 NIGYKRDNEIADLCGFDSLVMLITLHNLQLVDLIVRIPELNFSFRPTDLPVLMGLANLSS 283 Query: 576 KEAKHVRNGRELWNIAANRVDSLTMAAKLSLRKLVGIARIWLRYVHTYESLLSLLGYPGE 397 K++ +VRNGR LW +AA R + +S + LV +WLRYV+ YE LLSL GY Sbjct: 284 KDSNYVRNGRYLWKVAARRTGLMISPHTVSFQNLVSAVILWLRYVNAYEYLLSLAGY-SR 342 Query: 396 TMFEKSSS-RMSMNKKLSNDVRNHWKVVSEIEKDMPVEVLARGRRVARERASFQS-STPS 223 +M EKS + S NK+ R W+++ IEK++P E +AR RRVAR R QS ++ Sbjct: 343 SMPEKSLLWKFSENKRHFGTARRKWEMICNIEKELPAEAIARARRVARYRTCLQSQNSDE 402 Query: 222 STQRHVKFDKF--------IFSKILSYIARTF----CFIYHSVIQFLVVWASLNRHEEVD 79 S + F + + I I+RTF CF++ + ++L R+ E D Sbjct: 403 SYDESFVYGHFNCLSKTTGVLACIWRLISRTFWSIACFLWSN--KYLTQELQTGRNNEDD 460 Query: 78 GISRVVSEDYFHCCVNFRKVFITVNP 1 S +VS + FH VN KV IT P Sbjct: 461 --SELVSLE-FHAVVNLGKVSITFYP 483 >ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] gi|332645140|gb|AEE78661.1| uncharacterized protein [Arabidopsis thaliana] Length = 3072 Score = 177 bits (450), Expect = 4e-42 Identities = 138/445 (31%), Positives = 215/445 (48%), Gaps = 30/445 (6%) Frame = -2 Query: 1245 NVSLLN---NSSCICFEQVSISDVKFRFSPWSFPAFTLEFSGVYVKLRANKVIKKSEKRK 1075 +VS LN + S FE+ ++ + FS WS PA E GV VKL A + S +RK Sbjct: 45 DVSQLNQLFDESNFQFEKFTVDQLVVSFSVWSAPAIKFEIRGVNVKLSARGTDEGSSRRK 104 Query: 1074 ------------EILSVLDPEGVLLHDAIEKIITNSITSARSWVMTSXXXXXXXXXXXLI 931 ++LS +DP+G +LHD +EK++ S TS S + TS I Sbjct: 105 RASSDTVANEIKKVLSSIDPKGCVLHDILEKMLGRS-TSQISKLKTSFSNLILRHFRIQI 163 Query: 930 HDVNLELQLH-DDDVSSSLKIKELSLNAVDECSC-LLKGFVGAVLMPRRFCSLDFSVRGL 757 H +N+++ L D+S ++I EL ++ + + L++ AVL P R S S G Sbjct: 164 HGINVQVCLPGSSDLSCLMEINELRSDSENFGNLSLVRSSAAAVLFPLRRSSFTLSCFGF 223 Query: 756 EIGLRKEEYANRVLYLEEISTXXXXXXXXXXXXXXXVPQFDIAFCPSDLQIVIAFDILIA 577 IG +++ + + + VP+ +F P+DL +++ L + Sbjct: 224 NIGYKRDNEIVDLCGFDSLVMLITLHNLQLVDLVVRVPELSFSFRPTDLPVLMGLANLSS 283 Query: 576 KEAKHVRNGRELWNIAANRVDSLTMAAKLSLRKLVGIARIWLRYVHTYESLLSLLGYPGE 397 K++ +VRNGR LW +AA R + +S + LV + +WLRYV+ YE LLSL GY + Sbjct: 284 KDSNYVRNGRYLWKVAARRTGLMISPHSVSFQNLVSVVILWLRYVNAYEYLLSLAGYSRK 343 Query: 396 TMFEKSSSRMSMNKKLSNDVRNHWKVVSEIEKDMPVEVLARGRRVARERASFQSS----- 232 + + S NK+ R W+++ IEK++P E +AR RRVAR RA S Sbjct: 344 MPEKSLLWKFSENKRHFVTARRKWEMICNIEKELPAEAIARARRVARYRACLNSQDADDD 403 Query: 231 -TPSSTQRHVKF---DKFIFSKILSYIARTF----CFIYHSVIQFLVVWASLNRHEEVDG 76 SS H K+ ++ + I I+RTF CF++ + + L +R+ E D Sbjct: 404 YDESSLYGHFKYLSKTTWVLAYIWRLISRTFWSIACFLW--LNKLLTQELQTDRNNEDD- 460 Query: 75 ISRVVSEDYFHCCVNFRKVFITVNP 1 S VS + FH VN K+ +T P Sbjct: 461 -SECVSLE-FHAVVNLGKLSVTCYP 483