BLASTX nr result
ID: Coptis21_contig00012554
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00012554 (1775 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri... 699 0.0 ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260... 692 0.0 ref|XP_003544778.1| PREDICTED: uncharacterized protein LOC100776... 673 0.0 ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776... 671 0.0 ref|XP_002302270.1| predicted protein [Populus trichocarpa] gi|2... 671 0.0 >ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis] gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase II, putative [Ricinus communis] Length = 478 Score = 699 bits (1804), Expect = 0.0 Identities = 348/478 (72%), Positives = 398/478 (83%), Gaps = 3/478 (0%) Frame = -1 Query: 1751 MKRKNAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1572 M +K++ C++CE SN ASIC VCV+ RL EY LKSL++ RD YSRL+ LVAK KAD Sbjct: 1 MNKKSSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKAD 60 Query: 1571 DQLSWRVHQNEKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1392 DQL+WRVHQNEKL+ LREKL +++QL Q KAK +KMS DL +YGLLES+ L+KNRV Sbjct: 61 DQLNWRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRV 120 Query: 1391 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1212 +QLEK++PNLICTQSLGHMAITSE LH SV +KQICKLFPQRRV+V+G++KDGS+GQYD Sbjct: 121 DQLEKYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYD 180 Query: 1211 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVKALAAPVLHNSGFAGSCSRIWQRD 1032 QICNARLPRGLDPHS+PSEELAASLGYMVQLLNLVV LAAP LHNSGFAGSCSRIWQRD Sbjct: 181 QICNARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 1031 SYWDARPSSRSNEFPLFIPRQNSCTSGGENSWSDRSSSSFGVASMESEKKPSLDS--TGS 858 SYW+ARPSSRSNE+PLFIPRQ C++ GENSW+DRSSS+FGVASMESE++ LDS + S Sbjct: 241 SYWNARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSS 300 Query: 857 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 678 FN NSASPHSVETHKDLQKGISL+KKSVAC+TAY YN LCLD P+EASTFEAFAKLL TL Sbjct: 301 FNYNSASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATL 360 Query: 677 SSSKEVRSVFSLKMASSRSCKQAQKLNK-XXXXXXXXXXXXXXXXXXXXXXLRNAGDNNL 501 SSSKEVRSVFSLKMA SRSCKQ QKLNK +N DNNL Sbjct: 361 SSSKEVRSVFSLKMACSRSCKQVQKLNKSVWNVNSIISSSTLMESAHAPHLTKNINDNNL 420 Query: 500 SNSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDATKR 327 NSA SFL+ NE+SD GK+ESL++GWDLVEHPTFPPPPSQ+ED+EHWTRAMF+DATK+ Sbjct: 421 RNSATSFLFANEISDAGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478 >ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera] gi|302141899|emb|CBI19102.3| unnamed protein product [Vitis vinifera] Length = 478 Score = 692 bits (1786), Expect = 0.0 Identities = 351/478 (73%), Positives = 395/478 (82%), Gaps = 3/478 (0%) Frame = -1 Query: 1751 MKRKNAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1572 M RK + CS+CEKSNLASICAVCV+ RL EY SLKS + RDS Y RL+ LVAK KAD Sbjct: 1 MTRKTSSCSICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKAD 60 Query: 1571 DQLSWRVHQNEKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1392 DQ++WRV QNEKL++LREKL ++Q GKAK++KMS DLK +YGLLESAM L+KNRV Sbjct: 61 DQINWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNRV 120 Query: 1391 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1212 EQLEKFYPNLICTQ+LG MAITSER HKQSVVIKQICKLFPQRRV +DG++KDGS+ YD Sbjct: 121 EQLEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPYD 180 Query: 1211 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVKALAAPVLHNSGFAGSCSRIWQRD 1032 QICN RLPR LDPHSVPS+ELAASLGYMVQLLNLVV LAAP LHNSGFAGSCSRIWQR+ Sbjct: 181 QICNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQRE 240 Query: 1031 SYWDARPSSRSNEFPLFIPRQNSCTSGGENSWSDRSSSSFGVASMESEKKPSLDSTG--S 858 SYW+ RPSSRSNE+PLFIPRQN C++ GENSWS+RSSS+FG+ASMES++KP L+S+G S Sbjct: 241 SYWNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSSS 300 Query: 857 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 678 FN +SAS HSVETHKDLQKGISLLKKSVAC+T YCY+SLCLD P+EASTFEAFAKLL L Sbjct: 301 FNYSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAIL 360 Query: 677 SSSKEVRSVFSLKMASSRSCKQAQKLNK-XXXXXXXXXXXXXXXXXXXXXXLRNAGDNNL 501 SSSKEVRSVFSLKMA SRSCKQ Q+LNK RN DNNL Sbjct: 361 SSSKEVRSVFSLKMACSRSCKQVQQLNKSIWNMNSAISSSTLLESAHTLPMTRNIFDNNL 420 Query: 500 SNSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDATKR 327 NSA SFLYT EMSD+GK+ESL+E WDLVEH FPPPPSQ+EDIEHWTRAM +DATK+ Sbjct: 421 PNSAASFLYTTEMSDIGKNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDATKK 478 >ref|XP_003544778.1| PREDICTED: uncharacterized protein LOC100776426 isoform 2 [Glycine max] Length = 477 Score = 673 bits (1737), Expect = 0.0 Identities = 335/475 (70%), Positives = 394/475 (82%), Gaps = 3/475 (0%) Frame = -1 Query: 1751 MKRKNAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1572 M RK + C++CE SN ASIC++CV+ RL EY SLK L++ RDS Y +L+ LV K K D Sbjct: 1 MARKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGD 60 Query: 1571 DQLSWRVHQNEKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1392 DQ +WRV Q+EKL++L+EKL +++Q++QG+AK++ MS DLK +YGLLESA+ TL+KNRV Sbjct: 61 DQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRV 120 Query: 1391 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1212 EQLEKFYPNLICTQSLGH+AITSE LHK+SVVIKQICKLFPQRRVV++G+R+DG +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYD 180 Query: 1211 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVKALAAPVLHNSGFAGSCSRIWQRD 1032 QICNARLPR LDPHSVPSEEL+ SLGYMVQLLNLV+ LAAP LHNSGFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 1031 SYWDARPSSRSNEFPLFIPRQNSCTSGGENSWSDRSSSSFGVASMESEKKPSLDSTG--S 858 SYWDARPSSRSNE+PLFIPRQN C++ GENSWS+RSSS+FGVAS+ESE++ LDS+G S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTS 300 Query: 857 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 678 FN + AS HSV+THKDLQKGISLLKKSV CITAYCYNSLCLD PSEASTFEAFAKLL TL Sbjct: 301 FNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATL 360 Query: 677 SSSKEVRSVFSLKMASSRSCKQAQKLNK-XXXXXXXXXXXXXXXXXXXXXXLRNAGDNNL 501 +SSKEVRSVFSLKMA SR+CKQ Q+LNK +N +N L Sbjct: 361 ASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTKNLIENYL 420 Query: 500 SNSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDA 336 +S SFLY ++SD GK+E L+EGWD+VEHPTFPPPPSQSED+EHWTRAMF+DA Sbjct: 421 PSSTGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDA 474 >ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoform 1 [Glycine max] Length = 475 Score = 671 bits (1732), Expect = 0.0 Identities = 334/474 (70%), Positives = 392/474 (82%), Gaps = 2/474 (0%) Frame = -1 Query: 1751 MKRKNAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1572 M RK + C++CE SN ASIC++CV+ RL EY SLK L++ RDS Y +L+ LV K K D Sbjct: 1 MARKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGD 60 Query: 1571 DQLSWRVHQNEKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1392 DQ +WRV Q+EKL++L+EKL +++Q++QG+AK++ MS DLK +YGLLESA+ TL+KNRV Sbjct: 61 DQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRV 120 Query: 1391 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1212 EQLEKFYPNLICTQSLGH+AITSE LHK+SVVIKQICKLFPQRRVV++G+R+DG +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYD 180 Query: 1211 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVKALAAPVLHNSGFAGSCSRIWQRD 1032 QICNARLPR LDPHSVPSEEL+ SLGYMVQLLNLV+ LAAP LHNSGFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 1031 SYWDARPSSRSNEFPLFIPRQNSCTSGGENSWSDRSSSSFGVASMESEKKPSLDSTG--S 858 SYWDARPSSRSNE+PLFIPRQN C++ GENSWS+RSSS+FGVAS+ESE++ LDS+G S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTS 300 Query: 857 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 678 FN + AS HSV+THKDLQKGISLLKKSV CITAYCYNSLCLD PSEASTFEAFAKLL TL Sbjct: 301 FNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATL 360 Query: 677 SSSKEVRSVFSLKMASSRSCKQAQKLNKXXXXXXXXXXXXXXXXXXXXXXLRNAGDNNLS 498 +SSKEVRSVFSLKMA SR+CKQ Q+LNK +N L Sbjct: 361 ASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTRI-ENYLP 419 Query: 497 NSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDA 336 +S SFLY ++SD GK+E L+EGWD+VEHPTFPPPPSQSED+EHWTRAMF+DA Sbjct: 420 SSTGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDA 472 >ref|XP_002302270.1| predicted protein [Populus trichocarpa] gi|222843996|gb|EEE81543.1| predicted protein [Populus trichocarpa] Length = 475 Score = 671 bits (1731), Expect = 0.0 Identities = 340/478 (71%), Positives = 390/478 (81%), Gaps = 3/478 (0%) Frame = -1 Query: 1751 MKRKNAKCSLCEKSNLASICAVCVDSRLIEYKNSLKSLRNVRDSCYSRLTAQLVAKSKAD 1572 M +K++ C++CE SN ASIC +CV+ RL EY LKSL + RDS YS+L+ L+AK KAD Sbjct: 1 MNKKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKAD 60 Query: 1571 DQLSWRVHQNEKLSKLREKLNLTEKQLSQGKAKLDKMSIDLKFRYGLLESAMLTLKKNRV 1392 DQ +WRV QNEKL+ REKL+ ++QL+QGKAK++K+S DLK + G+LESA L+KNR+ Sbjct: 61 DQFNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRM 120 Query: 1391 EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVVVDGDRKDGSNGQYD 1212 EQLEKFYPNLICTQSLGHMAITSE LHKQSVVIKQICKLFPQRRV VDG+R +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYD 178 Query: 1211 QICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVVKALAAPVLHNSGFAGSCSRIWQRD 1032 QICNARLPRGLDPHSV SEELAASLGYMVQLLNLV LAAP LHN+GFAGSCSRIWQRD Sbjct: 179 QICNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRD 238 Query: 1031 SYWDARPSSRSNEFPLFIPRQNSCTSGGENSWSDRSSSSFGVASMESEKKPSLDST--GS 858 SYW+A PSSRSNE+PLFIPRQN C++ ENSW+D+SSS+FGVASMESE++P LDST S Sbjct: 239 SYWNACPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNS 298 Query: 857 FNCNSASPHSVETHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFEAFAKLLGTL 678 FN +S SPHSVETHKDLQKG+SLLKKSVAC+TAYCYN LCLD PS+ STFEAFAKLL TL Sbjct: 299 FNYSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTL 358 Query: 677 SSSKEVRSVFSLKMASSRSCKQAQKLNK-XXXXXXXXXXXXXXXXXXXXXXLRNAGDNNL 501 SSSKEVRSVF+LKMA SRSCKQ QKLNK ++N DNNL Sbjct: 359 SSSKEVRSVFNLKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNL 418 Query: 500 SNSAVSFLYTNEMSDVGKSESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFVDATKR 327 NSA SFL+ +SD GK+ES ++GWDLVEHPTFPPPPSQ EDIEHWTRAMF+DATK+ Sbjct: 419 PNSAASFLFATGISD-GKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475