BLASTX nr result
ID: Coptis23_contig00017004
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis23_contig00017004 (1561 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002264494.2| PREDICTED: uncharacterized protein LOC100267... 476 e-132 emb|CBI35883.3| unnamed protein product [Vitis vinifera] 471 e-130 ref|XP_002892952.1| hypothetical protein ARALYDRAFT_889150 [Arab... 379 e-103 ref|XP_002521337.1| conserved hypothetical protein [Ricinus comm... 379 e-102 gb|AAF97311.1|AC007843_14 Hypothetical protein [Arabidopsis thal... 379 e-102 >ref|XP_002264494.2| PREDICTED: uncharacterized protein LOC100267761 [Vitis vinifera] Length = 1884 Score = 476 bits (1226), Expect = e-132 Identities = 271/541 (50%), Positives = 350/541 (64%), Gaps = 22/541 (4%) Frame = +2 Query: 2 MDSIITAAVTEICSRNKNGLLLKDLWARLETKITTSGLNLCPGVKKSIWNGLLNIPNLQF 181 MDSI+ AA+ EICS+ NGL L+ LW L ++++GL+L GVK +IW LL P L+F Sbjct: 1 MDSIVFAALEEICSQGANGLALQSLWPNLHAALSSAGLDLSSGVKAAIWANLLKTPGLEF 60 Query: 182 TSKKSNFSPQDSAIKLVEDSEKLGLKIVAGESLRDCFLGIYDT-TSGDSGISPPQRAVLE 358 S+ + + D AI+ V EKL LKIVA E LRD F+G+YD S +GIS QR VLE Sbjct: 61 QSRNVSRNADDPAIQSVVQCEKLNLKIVAAEHLRDSFVGLYDAKASAVTGISAVQRRVLE 120 Query: 359 RLGGCRTNGITQKQLTDESGMKANNIFYVLRNLQVRGLIVRQSTIVRTKENVPGGENELE 538 RL RTNGITQ QL E G+KANN+FYVLRNL+ RGLIVRQS+IVRTKE GE++ Sbjct: 121 RLAIARTNGITQSQLCKEFGIKANNMFYVLRNLECRGLIVRQSSIVRTKEACSEGESK-- 178 Query: 539 TTSIVNTNLIHLHRYAKPLSSQQRLEITKSDTLESL---GDVRGDT----------TMLL 679 +SIV+TNLIHL+RY K L SQQ+LEITK D L GD RG ML+ Sbjct: 179 NSSIVSTNLIHLYRYGKHLGSQQKLEITKEDKLLDCLGNGDERGAAGDGGTRGCGEEMLI 238 Query: 680 KDYVPAMKAICDKLEEADGKVLVVSDIKQDLGYRDRQGHRAWRHICKRLKDAHLVEEFQA 859 KDY+PAMKAICDKLEEA+GKVLVV DIKQDLGY+ GH++WR+IC RLKDA LVEEF A Sbjct: 239 KDYLPAMKAICDKLEEANGKVLVVRDIKQDLGYQGYHGHKSWRNICSRLKDAGLVEEFDA 298 Query: 860 KVNKKVVSCLRLLKHIGLASDDFDTDQSLKCGKRGQMMNQYLELPIEHQIYDLVDSKGPK 1039 +VNKK K G DD D +Q +K GKRGQ+ +Q +ELP+EHQIYD++D++GPK Sbjct: 299 EVNKKP-------KTQGSGLDDPDAEQLVKSGKRGQITDQLVELPMEHQIYDMIDAEGPK 351 Query: 1040 GLTVTEVFKHLGLNNKRNYSRLLSMCSRFGMHLEAESHNRSKQYRIWTSGNFTDNSANVL 1219 GLTV EV + LG+N+K NY+R L+M SRFGMHL+AESH R YR+WT+GNF S+N Sbjct: 352 GLTVIEVCQRLGINSKANYNRFLNMFSRFGMHLQAESHKRGMAYRVWTAGNFNPASSNAF 411 Query: 1220 PGKSGDMLSMRSVN--------DIGEREDQTIHLLESPASKDEFRSSEETEYGQRGLELS 1375 P KS ++ + V+ D+ ++ QTI L+ K + + +T+ + E S Sbjct: 412 PDKSENIFNENGVSNPHVVGYMDLHQKSAQTIQELDPSTLKTDNTTHGKTKNREIEPEPS 471 Query: 1376 YTSPGDGASCQLLICGSNPHGLGCDATDGCEEHDLVNMATESISVPSEKPPTTPSNQLKR 1555 PG G Q+L+C SNP + D + + ++ +++I P T+P K Sbjct: 472 QIFPGGGECNQMLLCPSNPLEFNHEKKDPVPDAE-PDLESKAIEANDALPETSPLALSKS 530 Query: 1556 Q 1558 Q Sbjct: 531 Q 531 >emb|CBI35883.3| unnamed protein product [Vitis vinifera] Length = 1683 Score = 471 bits (1211), Expect = e-130 Identities = 254/445 (57%), Positives = 314/445 (70%), Gaps = 24/445 (5%) Frame = +2 Query: 2 MDSIITAAVTEICSRNKNGLLLKDLWARLETKITTSGLNLCPGVKKSIWNGLLNIPNLQF 181 MDSI+ AA+ EICS+ NGL L+ LW L ++++GL+L GVK +IW LL P L+F Sbjct: 1 MDSIVFAALEEICSQGANGLALQSLWPNLHAALSSAGLDLSSGVKAAIWANLLKTPGLEF 60 Query: 182 TSKKSNFSPQDSAIKLVEDSEKLGLKIVAGESLRDCFLGIYDT-TSGDSGISPPQRAVLE 358 S+ + + D AI+ V EKL LKIVA E LRD F+G+YD S +GIS QR VLE Sbjct: 61 QSRNVSRNADDPAIQSVVQCEKLNLKIVAAEHLRDSFVGLYDAKASAVTGISAVQRRVLE 120 Query: 359 RLGGCRTNGITQKQLTDESGMKANNIFYVLRNLQVRGLIVRQSTIVRTKENVPGGENELE 538 RL RTNGITQ QL E G+KANN+FYVLRNL+ RGLIVRQS+IVRTKE GE++ Sbjct: 121 RLAIARTNGITQSQLCKEFGIKANNMFYVLRNLECRGLIVRQSSIVRTKEACSEGESK-- 178 Query: 539 TTSIVNTNLIHLHRYAKPLSSQQRLEITKSDTLESL---GDVRGDT----------TMLL 679 +SIV+TNLIHL+RY K L SQQ+LEITK D L GD RG ML+ Sbjct: 179 NSSIVSTNLIHLYRYGKHLGSQQKLEITKEDKLLDCLGNGDERGAAGDGGTRGCGEEMLI 238 Query: 680 KDYVPAMKAICDKLEEADGKVLVVSDIKQDLGYRDRQGHRAWRHICKRLKDAHLVEEFQA 859 KDY+PAMKAICDKLEEA+GKVLVV DIKQDLGY+ GH++WR+IC RLKDA LVEEF A Sbjct: 239 KDYLPAMKAICDKLEEANGKVLVVRDIKQDLGYQGYHGHKSWRNICSRLKDAGLVEEFDA 298 Query: 860 KVNKKVVSCLRLLKHI----------GLASDDFDTDQSLKCGKRGQMMNQYLELPIEHQI 1009 +VNKKVVSCLRLLK G DD D +Q +K GKRGQ+ +Q +ELP+EHQI Sbjct: 299 EVNKKVVSCLRLLKKFSPKCFEPKTQGSGLDDPDAEQLVKSGKRGQITDQLVELPMEHQI 358 Query: 1010 YDLVDSKGPKGLTVTEVFKHLGLNNKRNYSRLLSMCSRFGMHLEAESHNRSKQYRIWTSG 1189 YD++D++GPKGLTV EV + LG+N+K NY+R L+M SRFGMHL+AESH R YR+WT+G Sbjct: 359 YDMIDAEGPKGLTVIEVCQRLGINSKANYNRFLNMFSRFGMHLQAESHKRGMAYRVWTAG 418 Query: 1190 NFTDNSANVLPGKSGDMLSMRSVND 1264 NF S+N P KS ++ + V++ Sbjct: 419 NFNPASSNAFPDKSENIFNENGVSN 443 >ref|XP_002892952.1| hypothetical protein ARALYDRAFT_889150 [Arabidopsis lyrata subsp. lyrata] gi|297338794|gb|EFH69211.1| hypothetical protein ARALYDRAFT_889150 [Arabidopsis lyrata subsp. lyrata] Length = 1850 Score = 379 bits (974), Expect = e-103 Identities = 209/448 (46%), Positives = 285/448 (63%), Gaps = 21/448 (4%) Frame = +2 Query: 2 MDSIITAAVTEICSRNKNGLLLKDLWARLETKITTSGLNLCPGVKKSIWNGLLNIPNLQF 181 MDSI+ + EIC + G+ L LW+RL S L P VK +W LL +P LQF Sbjct: 1 MDSIVCTTLEEICCQGNTGIPLVSLWSRL------SPPPLSPSVKAHVWRNLLAVPQLQF 54 Query: 182 TSKKSNFSPQDSAIKLVEDSEKLGLKIVAGESLRDCFLGIYDTTSGDSGISPPQRAVLER 361 +K + + P D++I+ +E++ +L L+IVA E LR F+G+YD S ++ IS QR VLER Sbjct: 55 KAKNTVYEPSDASIQQLEEALRLDLRIVANEKLRGNFVGLYDAQSNNTTISAIQRRVLER 114 Query: 362 LGGCRTNGITQKQLTDESGMKANNIFYVLRNLQVRGLIVRQSTIVRTKENVPGGENELET 541 L R NG+ Q L E G++ N FY++++L+ RGL+V+Q IVRTKE GE + +T Sbjct: 115 LAVARANGVAQNLLAKEFGIEGRNFFYIVKHLESRGLVVKQPAIVRTKE--VDGEGDSKT 172 Query: 542 TSIVNTNLIHLHRYAKPLSSQQRLEITKSDTLESLGDVRGDTT--------------MLL 679 TS ++TN+I+L RYAKPL SQQR EI K D+L + + T L+ Sbjct: 173 TSCISTNMIYLSRYAKPLGSQQRFEICKEDSLSETPMMEHEVTPAGDSLLSESTKEDTLI 232 Query: 680 KDYVPAMKAICDKLEEADGKVLVVSDIKQDLGY-RDRQGHRAWRHICKRLKDAHLVEEFQ 856 KD++PAMKAICDKLEEA+ KVLVVSDIKQDLGY HRAWR +C+RL D+H+VEEF Sbjct: 233 KDFLPAMKAICDKLEEANEKVLVVSDIKQDLGYLGSHSRHRAWRSVCRRLTDSHVVEEFD 292 Query: 857 AKVNKKVVSCLRLLKHIGLASDDFDT----DQSLKCGKRGQMMNQYLELPIEHQIYDLVD 1024 A VN KV CLRLLK ++ DF+ LK G+ Q Q LELPI++QIYD+VD Sbjct: 293 AVVNNKVERCLRLLKR--FSAKDFNNYSGKKHLLKFGRSIQRTEQTLELPIDNQIYDMVD 350 Query: 1025 SKGPKGLTVTEVFKHLGLNNKRNYSRLLSMCSRFGMHLEAESHNRSKQYRIWTSGNFTDN 1204 ++G KGL V EV + LG++ K++YSRL S+C R GMH++AESH +++ +R+WTSGN Sbjct: 351 AEGSKGLAVMEVCERLGIDKKKSYSRLYSICLRVGMHIQAESHKKTRVFRVWTSGNAGSE 410 Query: 1205 SANVLPGK--SGDMLSMRSVNDIGERED 1282 +++ P K + + S ND G D Sbjct: 411 CSDLFPEKVENRSWENNVSTNDFGTPHD 438 >ref|XP_002521337.1| conserved hypothetical protein [Ricinus communis] gi|223539415|gb|EEF41005.1| conserved hypothetical protein [Ricinus communis] Length = 1854 Score = 379 bits (973), Expect = e-102 Identities = 225/494 (45%), Positives = 296/494 (59%), Gaps = 17/494 (3%) Frame = +2 Query: 2 MDSIITAAVTEICSRNKNGLLLKDLWARLETKITTSGLNLCPGVKKSIWNGLLNIPNLQF 181 MDS+I+ A+ EICSR GL + LW+ L T S +K +IW LL+IP+LQF Sbjct: 1 MDSLISTALEEICSRGATGLSVSSLWSTLTPTPTNS-------LKIAIWKNLLSIPSLQF 53 Query: 182 TSKKSN-FSPQDSAIKLVEDSEKLGLKIVAGESLRDCFLGIYDTTSGDSGISPPQRAVLE 358 SK F+ D I+ ED+EKL LKIVA LRDCF+G+YD S +GI P QR LE Sbjct: 54 ISKNDTPFTSTDPKIQRFEDAEKLNLKIVANNHLRDCFVGLYDAPS--TGICPLQRRTLE 111 Query: 359 RLGGCRTNGITQKQLTDESGMKANNIFYVLRNLQVRGLIVRQSTIVRTKE---NVPGGEN 529 RL RT G+TQ QL E G++ NN FY +RNL+ R LIVRQ +V+TKE + GGE+ Sbjct: 112 RLAISRTIGVTQNQLAKEFGIEGNNYFYRVRNLECRKLIVRQPAVVKTKEAAVDCEGGES 171 Query: 530 ELETTSIVNTNLIHLHRYAKPLSSQQRLEITKSDTLESLGDVRG-DTTMLLKDYVPAMKA 706 + +SIV+TNLI+L RYAK L QQR EI K D + D G + + +KD++PAMKA Sbjct: 172 K--NSSIVSTNLIYLSRYAKHLGVQQRFEINKGD----IDDTHGFEDDVAIKDFLPAMKA 225 Query: 707 ICDKLEEADGKVLVVSDIKQDLGYRDRQGHRAWRHICKRLKDAHLVEEFQAKVNKKVVSC 886 I DKL+EA+ KVL+VSDIKQ LGY R GHRAWR+IC+RLKDA +VE F AKVN KV C Sbjct: 226 ISDKLQEANDKVLIVSDIKQSLGYTGRSGHRAWRNICRRLKDAGIVESFDAKVNGKVEHC 285 Query: 887 LRLLKHIGL---------ASDDFDTDQSLKCGKRGQMMNQYLELPIEHQIYDLVDSKGPK 1039 LRLLK L +D QS+K G+R Q Q +ELPI+ QIYD++D+K + Sbjct: 286 LRLLKKFSLDNFEKKILGCRNDCPNKQSVKFGRRSQQTEQLVELPIDQQIYDMIDAKRTE 345 Query: 1040 GLTVTEVFKHLGLNNKRNYSRLLSMCSRFGMHLEAESHNRSKQYRIWTSGNFTDNSANVL 1219 G T+ EV LGL+ KRN SRL ++ SRFGMH++AE+H ++ +R+WT N T +N Sbjct: 346 GATMIEVCGRLGLDRKRNDSRLHNLFSRFGMHVQAENHKKTVAFRVWTPENSTPKESNAF 405 Query: 1220 PGKSGDMLSMRSVNDIGEREDQTIHLLESPASKDEFRSSEETEYGQRGLELSYTS---PG 1390 KS +L D T L+ + + EY +E+ + + P Sbjct: 406 LDKSKSVLG---------GNDHT--LIVGNCDVPDGSTEALVEYNHSAVEIDFATSKKPN 454 Query: 1391 DGASCQLLICGSNP 1432 D + C +P Sbjct: 455 DNKEIEAEPCNGSP 468 >gb|AAF97311.1|AC007843_14 Hypothetical protein [Arabidopsis thaliana] Length = 1808 Score = 379 bits (972), Expect = e-102 Identities = 209/442 (47%), Positives = 285/442 (64%), Gaps = 15/442 (3%) Frame = +2 Query: 2 MDSIITAAVTEICSRNKNGLLLKDLWARLETKITTSGLNLCPGVKKSIWNGLLNIPNLQF 181 MDSI+ A+ EIC + G+ L LW+RL S L P VK +W LL +P LQF Sbjct: 1 MDSIVCTALEEICCQGNTGIPLVSLWSRL------SPPPLSPSVKAHVWRNLLAVPQLQF 54 Query: 182 TSKKSNFSPQDSAIKLVEDSEKLGLKIVAGESLRDCFLGIYDTTSGDSGISPPQRAVLER 361 +K + + P D++I+ +E++ +L L+I A E LR F+G+YD S ++ IS QR VLER Sbjct: 55 KAKNTVYEPSDASIQQLEEALRLDLRIFANEKLRGNFVGLYDAQSNNTTISAIQRRVLER 114 Query: 362 LGGCRTNGITQKQLTDESGMKANNIFYVLRNLQVRGLIVRQSTIVRTKENVPGGENELET 541 L R NG+ Q L E G++ N FY++++L+ RGL+V+Q IVRTKE GE + +T Sbjct: 115 LAVARANGVAQNLLAKEFGIEGRNFFYIVKHLESRGLVVKQPAIVRTKE--VDGEGDSKT 172 Query: 542 TSIVNTNLIHLHRYAKPLSSQQRLEITKSDTL---------ESLGDVRGDTTMLLKDYVP 694 TS ++TN+I+L RYAKPL SQQR EI K D+L +SL L+KD++P Sbjct: 173 TSCISTNMIYLSRYAKPLGSQQRFEICKEDSLLEQEATPAGDSLQSESTKEDTLIKDFLP 232 Query: 695 AMKAICDKLEEADGKVLVVSDIKQDLGY-RDRQGHRAWRHICKRLKDAHLVEEFQAKVNK 871 AM+AICDKLEE + KVLVVSDIKQDLGY HRAWR +C+RL D+H+VEEF A VN Sbjct: 233 AMQAICDKLEETNEKVLVVSDIKQDLGYLGSHSRHRAWRSVCRRLTDSHVVEEFDAVVNN 292 Query: 872 KVVSCLRLLKHIGLASDDFD---TDQSLKCGKRGQMMNQYLELPIEHQIYDLVDSKGPKG 1042 KV CLRLLK ++ DF+ Q LK G+ Q Q LELPI++QIYD+VD++G KG Sbjct: 293 KVERCLRLLKR--FSAKDFNYSGKKQLLKFGRSIQKTEQTLELPIDNQIYDMVDAEGSKG 350 Query: 1043 LTVTEVFKHLGLNNKRNYSRLLSMCSRFGMHLEAESHNRSKQYRIWTSGNFTDNSANVLP 1222 L V EV + LG++ K++YSRL S+C + GMHL+AESH +++ +R+WTSGN ++ P Sbjct: 351 LAVMEVCERLGIDKKKSYSRLYSICLKVGMHLQAESHKKTRVFRVWTSGNAGSECSDRFP 410 Query: 1223 GKSGDMLSMRSV--NDIGERED 1282 K+ + +V ND G D Sbjct: 411 EKAENRSWENNVPINDFGTPHD 432