BLASTX nr result
ID: Coptis24_contig00008000
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00008000 (1763 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri... 701 0.0 ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260... 692 0.0 ref|XP_003544778.1| PREDICTED: uncharacterized protein LOC100776... 674 0.0 ref|XP_002302270.1| predicted protein [Populus trichocarpa] gi|2... 672 0.0 ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776... 668 0.0 >ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis] gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase II, putative [Ricinus communis] Length = 478 Score = 701 bits (1809), Expect = 0.0 Identities = 348/478 (72%), Positives = 398/478 (83%), Gaps = 2/478 (0%) Frame = -3 Query: 1758 MKRKSAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1579 M +KS+ C++CE SN ASIC VCV+ RL EY LKSL++ RD YSRL+ LVAK KAD Sbjct: 1 MNKKSSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKAD 60 Query: 1578 DQLSWRVHQNQKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1399 DQL+WRVHQN+KL+ LREKL +++QL Q KAK +KMS DL +YGLLES+ L+KNRV Sbjct: 61 DQLNWRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRV 120 Query: 1398 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1219 +QLEK++PNLICTQSLGHMAITSE LH SV +KQICKLFPQRRV+V+G++KDGS+GQYD Sbjct: 121 DQLEKYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYD 180 Query: 1218 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVRALAAPVLHNSGFAGSCSRIWQRD 1039 QICNARLPRGLDPHS+PSEELAASLGYMVQLLNLVV LAAP LHNSGFAGSCSRIWQRD Sbjct: 181 QICNARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 1038 SYWDARPSSRSNEFPLFIPRQNSCTRGGENSWSDRSSSSFGVASMESEKKPTLDS--TGS 865 SYW+ARPSSRSNE+PLFIPRQ C+ GENSW+DRSSS+FGVASMESE++ LDS + S Sbjct: 241 SYWNARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSS 300 Query: 864 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 685 FN NSASPHSVETHKDLQKGISL+KKSVAC+TAY YN LCLD P+EASTFEAFAKLL TL Sbjct: 301 FNYNSASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATL 360 Query: 684 SSSKEVRSVFSLKIASSRSCKQAQKLNKXXXXXXXXXXXXXXXXXXXXXXLRQNAGDNNL 505 SSSKEVRSVFSLK+A SRSCKQ QKLNK L +N DNNL Sbjct: 361 SSSKEVRSVFSLKMACSRSCKQVQKLNKSVWNVNSIISSSTLMESAHAPHLTKNINDNNL 420 Query: 504 SNSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDATKR 331 NSA SFL+ NE+SD GK+ESL++GWDLVEHPTFPPPPSQ+ED+EHWTRAMF+DATK+ Sbjct: 421 RNSATSFLFANEISDAGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478 >ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera] gi|302141899|emb|CBI19102.3| unnamed protein product [Vitis vinifera] Length = 478 Score = 692 bits (1785), Expect = 0.0 Identities = 348/478 (72%), Positives = 396/478 (82%), Gaps = 2/478 (0%) Frame = -3 Query: 1758 MKRKSAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1579 M RK++ CS+CEKSNLASICAVCV+ RL EY SLKS + RDS Y RL+ LVAK KAD Sbjct: 1 MTRKTSSCSICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKAD 60 Query: 1578 DQLSWRVHQNQKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1399 DQ++WRV QN+KL++LREKL ++Q GKAK++KMS DLK +YGLLESAM L+KNRV Sbjct: 61 DQINWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNRV 120 Query: 1398 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1219 EQLEKFYPNLICTQ+LG MAITSER HKQSVVIKQICKLFPQRRV +DG++KDGS+ YD Sbjct: 121 EQLEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPYD 180 Query: 1218 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVRALAAPVLHNSGFAGSCSRIWQRD 1039 QICN RLPR LDPHSVPS+ELAASLGYMVQLLNLVV LAAP LHNSGFAGSCSRIWQR+ Sbjct: 181 QICNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQRE 240 Query: 1038 SYWDARPSSRSNEFPLFIPRQNSCTRGGENSWSDRSSSSFGVASMESEKKPTLDSTG--S 865 SYW+ RPSSRSNE+PLFIPRQN C+ GENSWS+RSSS+FG+ASMES++KP L+S+G S Sbjct: 241 SYWNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSSS 300 Query: 864 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 685 FN +SAS HSVETHKDLQKGISLLKKSVAC+T YCY+SLCLD P+EASTFEAFAKLL L Sbjct: 301 FNYSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAIL 360 Query: 684 SSSKEVRSVFSLKIASSRSCKQAQKLNKXXXXXXXXXXXXXXXXXXXXXXLRQNAGDNNL 505 SSSKEVRSVFSLK+A SRSCKQ Q+LNK + +N DNNL Sbjct: 361 SSSKEVRSVFSLKMACSRSCKQVQQLNKSIWNMNSAISSSTLLESAHTLPMTRNIFDNNL 420 Query: 504 SNSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDATKR 331 NSA SFLYT EMSD+GK+ESL+E WDLVEH FPPPPSQ+EDIEHWTRAM +DATK+ Sbjct: 421 PNSAASFLYTTEMSDIGKNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDATKK 478 >ref|XP_003544778.1| PREDICTED: uncharacterized protein LOC100776426 isoform 2 [Glycine max] Length = 477 Score = 674 bits (1740), Expect = 0.0 Identities = 333/475 (70%), Positives = 394/475 (82%), Gaps = 2/475 (0%) Frame = -3 Query: 1758 MKRKSAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1579 M RK++ C++CE SN ASIC++CV+ RL EY SLK L++ RDS Y +L+ LV K K D Sbjct: 1 MARKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGD 60 Query: 1578 DQLSWRVHQNQKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1399 DQ +WRV Q++KL++L+EKL +++Q++QG+AK++ MS DLK +YGLLESA+ TL+KNRV Sbjct: 61 DQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRV 120 Query: 1398 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1219 EQLEKFYPNLICTQSLGH+AITSE LHK+SVVIKQICKLFPQRRVV++G+R+DG +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYD 180 Query: 1218 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVRALAAPVLHNSGFAGSCSRIWQRD 1039 QICNARLPR LDPHSVPSEEL+ SLGYMVQLLNLV+ LAAP LHNSGFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 1038 SYWDARPSSRSNEFPLFIPRQNSCTRGGENSWSDRSSSSFGVASMESEKKPTLDSTG--S 865 SYWDARPSSRSNE+PLFIPRQN C+ GENSWS+RSSS+FGVAS+ESE++ LDS+G S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTS 300 Query: 864 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 685 FN + AS HSV+THKDLQKGISLLKKSV CITAYCYNSLCLD PSEASTFEAFAKLL TL Sbjct: 301 FNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATL 360 Query: 684 SSSKEVRSVFSLKIASSRSCKQAQKLNKXXXXXXXXXXXXXXXXXXXXXXLRQNAGDNNL 505 +SSKEVRSVFSLK+A SR+CKQ Q+LNK +N +N L Sbjct: 361 ASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTKNLIENYL 420 Query: 504 SNSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDA 340 +S SFLY ++SD GK+E L+EGWD+VEHPTFPPPPSQSED+EHWTRAMF+DA Sbjct: 421 PSSTGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDA 474 >ref|XP_002302270.1| predicted protein [Populus trichocarpa] gi|222843996|gb|EEE81543.1| predicted protein [Populus trichocarpa] Length = 475 Score = 672 bits (1735), Expect = 0.0 Identities = 340/478 (71%), Positives = 389/478 (81%), Gaps = 2/478 (0%) Frame = -3 Query: 1758 MKRKSAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1579 M +KS+ C++CE SN ASIC +CV+ RL EY LKSL + RDS YS+L+ L+AK KAD Sbjct: 1 MNKKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKAD 60 Query: 1578 DQLSWRVHQNQKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1399 DQ +WRV QN+KL+ REKL+ ++QL+QGKAK++K+S DLK + G+LESA L+KNR+ Sbjct: 61 DQFNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRM 120 Query: 1398 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1219 EQLEKFYPNLICTQSLGHMAITSE LHKQSVVIKQICKLFPQRRV VDG+R +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYD 178 Query: 1218 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVRALAAPVLHNSGFAGSCSRIWQRD 1039 QICNARLPRGLDPHSV SEELAASLGYMVQLLNLV LAAP LHN+GFAGSCSRIWQRD Sbjct: 179 QICNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRD 238 Query: 1038 SYWDARPSSRSNEFPLFIPRQNSCTRGGENSWSDRSSSSFGVASMESEKKPTLDST--GS 865 SYW+A PSSRSNE+PLFIPRQN C+ ENSW+D+SSS+FGVASMESE++P LDST S Sbjct: 239 SYWNACPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNS 298 Query: 864 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 685 FN +S SPHSVETHKDLQKG+SLLKKSVAC+TAYCYN LCLD PS+ STFEAFAKLL TL Sbjct: 299 FNYSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTL 358 Query: 684 SSSKEVRSVFSLKIASSRSCKQAQKLNKXXXXXXXXXXXXXXXXXXXXXXLRQNAGDNNL 505 SSSKEVRSVF+LK+A SRSCKQ QKLNK L +N DNNL Sbjct: 359 SSSKEVRSVFNLKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNL 418 Query: 504 SNSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDATKR 331 NSA SFL+ +SD GK+ES ++GWDLVEHPTFPPPPSQ EDIEHWTRAMF+DATK+ Sbjct: 419 PNSAASFLFATGISD-GKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475 >ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoform 1 [Glycine max] Length = 475 Score = 668 bits (1723), Expect = 0.0 Identities = 332/475 (69%), Positives = 393/475 (82%), Gaps = 2/475 (0%) Frame = -3 Query: 1758 MKRKSAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1579 M RK++ C++CE SN ASIC++CV+ RL EY SLK L++ RDS Y +L+ LV K K D Sbjct: 1 MARKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGD 60 Query: 1578 DQLSWRVHQNQKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1399 DQ +WRV Q++KL++L+EKL +++Q++QG+AK++ MS DLK +YGLLESA+ TL+KNRV Sbjct: 61 DQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRV 120 Query: 1398 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1219 EQLEKFYPNLICTQSLGH+AITSE LHK+SVVIKQICKLFPQRRVV++G+R+DG +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYD 180 Query: 1218 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVRALAAPVLHNSGFAGSCSRIWQRD 1039 QICNARLPR LDPHSVPSEEL+ SLGYMVQLLNLV+ LAAP LHNSGFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 1038 SYWDARPSSRSNEFPLFIPRQNSCTRGGENSWSDRSSSSFGVASMESEKKPTLDSTG--S 865 SYWDARPSSRSNE+PLFIPRQN C+ GENSWS+RSSS+FGVAS+ESE++ LDS+G S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTS 300 Query: 864 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 685 FN + AS HSV+THKDLQKGISLLKKSV CITAYCYNSLCLD PSEASTFEAFAKLL TL Sbjct: 301 FNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATL 360 Query: 684 SSSKEVRSVFSLKIASSRSCKQAQKLNKXXXXXXXXXXXXXXXXXXXXXXLRQNAGDNNL 505 +SSKEVRSVFSLK+A SR+CKQ Q+LNK + +N L Sbjct: 361 ASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTRI--ENYL 418 Query: 504 SNSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDA 340 +S SFLY ++SD GK+E L+EGWD+VEHPTFPPPPSQSED+EHWTRAMF+DA Sbjct: 419 PSSTGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDA 472