BLASTX nr result
ID: Coptis24_contig00006607
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00006607 (1860 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAF71990.1|AC013453_15 Hypothetical protein [Arabidopsis thal... 167 7e-39 ref|NP_173018.2| centromere protein C [Arabidopsis thaliana] gi|... 167 7e-39 ref|XP_002890116.1| CENP-C [Arabidopsis lyrata subsp. lyrata] gi... 160 1e-36 ref|XP_002298134.1| predicted protein [Populus trichocarpa] gi|2... 159 2e-36 gb|AAU04611.1| CENP-C [Arabidopsis arenosa] 157 7e-36 >gb|AAF71990.1|AC013453_15 Hypothetical protein [Arabidopsis thaliana] Length = 710 Score = 167 bits (424), Expect = 7e-39 Identities = 160/551 (29%), Positives = 251/551 (45%), Gaps = 44/551 (7%) Frame = -3 Query: 1843 VVNLETTDMH---GASQEKEAVVDGCIDAIQ----DSVAEEKNKLDGILDELL--STDCM 1691 V+NLE ++ + Q E+ + + DS + L+ +L +LL S + + Sbjct: 192 VINLEASEKEIPIASEQSLESATAAHVTTVDREVDDSTVDTDKDLNNVLKDLLACSREEL 251 Query: 1690 EGDGAVKLLQERLHIRPVNVDNFCLPELGSVRKTDVRLPLEHVPRPRKSSSYADNVTKKS 1511 EGDGA+KLL+ERL I+ N++ F +PE VRK +++ + P RKS S N+ K + Sbjct: 252 EGDGAIKLLEERLQIKSFNIEKFSIPEFQDVRKMNLKASGSNPPN-RKSLSDIQNILKGT 310 Query: 1510 KDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQRDRPSDPFLFSE------SDRSP--VGD 1355 ++ +V ++ +H S + P D F F + D+ P V Sbjct: 311 -NRVAVRKNSHSPSPQTI---------KHFSSPNPPVDQFSFPDIHNLLPGDQQPSEVNV 360 Query: 1354 STRANGIENGAPSPHVDSTSKSAEVNRSALSFSGKFESMIKENPTDSNM---------VL 1202 A I N +P+ +V + ++ N S + SG+ +S I S++ V+ Sbjct: 361 QPIAKDIPNTSPT-NVGTVDVASPFNDSVVKRSGEDDSHIHSGIHRSHLSRDGNPDICVM 419 Query: 1201 DKLATEDSVHALDQVEHNSSRINDNVNLTVNGCDR---DLEDEVDGMQQPEPIGKGKSPA 1031 D ++ S V+ + +V ++ +G +R D E++ + ++ + + + A Sbjct: 420 DSISNRSSAMLQKNVDMRTKGKEVDVPMSESGANRNTGDRENDAEINEETDNLERLAECA 479 Query: 1030 GSTIVLDKLRTEDSSHSLDPLDGSSSKHTNHMDYSLNGCDGHLEDEVSTDTYFDTNSISV 851 + EDS + G+SSK N N G LE Sbjct: 480 SKEVTRPFTVEEDS---IPYQQGASSKSPNRAPEQYNTMGGSLEHAEHNQ---------- 526 Query: 850 DGMQQQETVGSIVAEDGCEGLTMKNVHGPESQLDVSGDTIDADNEGHVHSDNAVGIPEQN 671 G+ ++E V + GL ++N PE T G S+ + Sbjct: 527 -GLHEEENVNT----GSASGLQVENA--PEVHKYSHKQTNKRRKRGSSDSNVKKRSKTVH 579 Query: 670 IESGGGQTHQQPPGESL---------NVRIKQKAPKRSGRESKIVS-RRSLAGAGTKWQG 521 E+GG + + P ES N R ++K K E K+ S R+SLA AGTK +G Sbjct: 580 GETGGDKQMKTLPHESRAKKQTKGKSNEREEKKPKKTLTHEGKLFSCRKSLAAAGTKIEG 639 Query: 520 GVRRTTRNIMRPLEYWRGERVVYGRVHESLPSVIAYKYASPAKGK-----LKVHSFVSDD 356 GVRR+TR RPLEYWRGER +YGR+HESL +VI KYASP +GK KV SFVSD+ Sbjct: 640 GVRRSTRIKSRPLEYWRGERFLYGRIHESLTTVIGIKYASPGEGKRDSRASKVKSFVSDE 699 Query: 355 YKDLVEKQSLY 323 YK LV+ +L+ Sbjct: 700 YKKLVDFAALH 710 >ref|NP_173018.2| centromere protein C [Arabidopsis thaliana] gi|51477443|gb|AAU04629.1| CENP-C [Arabidopsis thaliana] gi|332191225|gb|AEE29346.1| centromere protein C [Arabidopsis thaliana] Length = 705 Score = 167 bits (424), Expect = 7e-39 Identities = 160/551 (29%), Positives = 251/551 (45%), Gaps = 44/551 (7%) Frame = -3 Query: 1843 VVNLETTDMH---GASQEKEAVVDGCIDAIQ----DSVAEEKNKLDGILDELL--STDCM 1691 V+NLE ++ + Q E+ + + DS + L+ +L +LL S + + Sbjct: 187 VINLEASEKEIPIASEQSLESATAAHVTTVDREVDDSTVDTDKDLNNVLKDLLACSREEL 246 Query: 1690 EGDGAVKLLQERLHIRPVNVDNFCLPELGSVRKTDVRLPLEHVPRPRKSSSYADNVTKKS 1511 EGDGA+KLL+ERL I+ N++ F +PE VRK +++ + P RKS S N+ K + Sbjct: 247 EGDGAIKLLEERLQIKSFNIEKFSIPEFQDVRKMNLKASGSNPPN-RKSLSDIQNILKGT 305 Query: 1510 KDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQRDRPSDPFLFSE------SDRSP--VGD 1355 ++ +V ++ +H S + P D F F + D+ P V Sbjct: 306 -NRVAVRKNSHSPSPQTI---------KHFSSPNPPVDQFSFPDIHNLLPGDQQPSEVNV 355 Query: 1354 STRANGIENGAPSPHVDSTSKSAEVNRSALSFSGKFESMIKENPTDSNM---------VL 1202 A I N +P+ +V + ++ N S + SG+ +S I S++ V+ Sbjct: 356 QPIAKDIPNTSPT-NVGTVDVASPFNDSVVKRSGEDDSHIHSGIHRSHLSRDGNPDICVM 414 Query: 1201 DKLATEDSVHALDQVEHNSSRINDNVNLTVNGCDR---DLEDEVDGMQQPEPIGKGKSPA 1031 D ++ S V+ + +V ++ +G +R D E++ + ++ + + + A Sbjct: 415 DSISNRSSAMLQKNVDMRTKGKEVDVPMSESGANRNTGDRENDAEINEETDNLERLAECA 474 Query: 1030 GSTIVLDKLRTEDSSHSLDPLDGSSSKHTNHMDYSLNGCDGHLEDEVSTDTYFDTNSISV 851 + EDS + G+SSK N N G LE Sbjct: 475 SKEVTRPFTVEEDS---IPYQQGASSKSPNRAPEQYNTMGGSLEHAEHNQ---------- 521 Query: 850 DGMQQQETVGSIVAEDGCEGLTMKNVHGPESQLDVSGDTIDADNEGHVHSDNAVGIPEQN 671 G+ ++E V + GL ++N PE T G S+ + Sbjct: 522 -GLHEEENVNT----GSASGLQVENA--PEVHKYSHKQTNKRRKRGSSDSNVKKRSKTVH 574 Query: 670 IESGGGQTHQQPPGESL---------NVRIKQKAPKRSGRESKIVS-RRSLAGAGTKWQG 521 E+GG + + P ES N R ++K K E K+ S R+SLA AGTK +G Sbjct: 575 GETGGDKQMKTLPHESRAKKQTKGKSNEREEKKPKKTLTHEGKLFSCRKSLAAAGTKIEG 634 Query: 520 GVRRTTRNIMRPLEYWRGERVVYGRVHESLPSVIAYKYASPAKGK-----LKVHSFVSDD 356 GVRR+TR RPLEYWRGER +YGR+HESL +VI KYASP +GK KV SFVSD+ Sbjct: 635 GVRRSTRIKSRPLEYWRGERFLYGRIHESLTTVIGIKYASPGEGKRDSRASKVKSFVSDE 694 Query: 355 YKDLVEKQSLY 323 YK LV+ +L+ Sbjct: 695 YKKLVDFAALH 705 >ref|XP_002890116.1| CENP-C [Arabidopsis lyrata subsp. lyrata] gi|297335958|gb|EFH66375.1| CENP-C [Arabidopsis lyrata subsp. lyrata] Length = 708 Score = 160 bits (404), Expect = 1e-36 Identities = 155/521 (29%), Positives = 233/521 (44%), Gaps = 45/521 (8%) Frame = -3 Query: 1765 IQDSVAEEKNKLDGILDELL--STDCMEGDGAVKLLQERLHIRPVNVDNFCLPELGSVRK 1592 + DS + L+ IL ELL S D +EGD AVK L++ L I+ +NV+ F +PE VRK Sbjct: 220 VDDSTVDTDKDLNNILKELLASSRDELEGDAAVKRLEDVLQIKSLNVEKFSIPEFQDVRK 279 Query: 1591 TDVRLPLEHVPRPRKSSSYADNVTKK-SKDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQ 1415 +++ + P RKS S N+ K + G +H+ +H S Sbjct: 280 MNMKASGSN-PSNRKSLSDIQNILKGIHRVAGRKNSHSPSP-----------QTRKHFSS 327 Query: 1414 RDRPSDPFLFSE------SDRSP--VGDSTRANGIENGAPSPHVDSTSKSAEVNRSALSF 1259 + P D F F + D+ P V A I N +PS +V + ++ N S Sbjct: 328 PNPPVDQFSFPDIHNLLPGDQQPSEVDVQPLAKDIANTSPS-NVGTVDVASPFNNSVEKR 386 Query: 1258 SGKFESMIKENPTDSNM---------VLDKLATEDSVHALDQVEHNSSRINDNVNLTVNG 1106 SG+ +S I S++ V+D ++ +S V+ ++ +V ++ +G Sbjct: 387 SGEDDSHIHSGIHRSHLRPDGNADICVMDSISNRNSAMLEVNVDMRTTGKEVDVPISESG 446 Query: 1105 CDR---------DLEDEVDGMQQPEPIGKGKSPAGSTIVLDKLRTEDSSHSLDPLDGSSS 953 +R D+ +E D ++ ++ T+ D + + G+SS Sbjct: 447 ANRNTGQRENDTDINEETDHLEMLAEYASKEATRPFTVEEDSIPYQQ---------GTSS 497 Query: 952 KHTNHMDYSLNGCDG---HLEDEVSTDTYFDTNSISVDGMQQQETVGSIVAEDGCEGLTM 782 N N DG H E + N+ S G+Q+ A + T Sbjct: 498 NSPNRAPEQYNTMDGPSEHAEHNQGLHEEENVNTDSASGLQENALQE---AHNSSHKQTN 554 Query: 781 KNVHGPESQLDVSGDTIDADNEGHVHSDNAVGIPEQNI---ESGGGQTHQQPPGESLNVR 611 K S +V VH + G P+ ESG + ++ E Sbjct: 555 KRRKRGSSDSNVK------KRSKTVHGETG-GDPQMKTLPHESGAKKQTKRKSNER---- 603 Query: 610 IKQKAPKRSG----RESKIVSRR-SLAGAGTKWQGGVRRTTRNIMRPLEYWRGERVVYGR 446 ++K PK + RE K+ SRR SLA AGTK +GGVRR+TR RPLEYW+GER +YGR Sbjct: 604 -EEKKPKNTRKTLTREGKLFSRRKSLAAAGTKMEGGVRRSTRIKSRPLEYWKGERFLYGR 662 Query: 445 VHESLPSVIAYKYASPAKGK-----LKVHSFVSDDYKDLVE 338 +HESL +VI KYASP +GK KV SFVSD+YK+LV+ Sbjct: 663 IHESLTTVIGIKYASPGEGKSDLRACKVKSFVSDEYKELVD 703 >ref|XP_002298134.1| predicted protein [Populus trichocarpa] gi|222845392|gb|EEE82939.1| predicted protein [Populus trichocarpa] Length = 746 Score = 159 bits (402), Expect = 2e-36 Identities = 161/572 (28%), Positives = 266/572 (46%), Gaps = 60/572 (10%) Frame = -3 Query: 1858 SQSYSVVNLETTDMHGASQEKEAVVDGCIDAIQDSVAEEKNKLDGILDELLSTDC--MEG 1685 SQ+ ++ +E+T++ A +E E + + Q S+A+ + ++D +LDELL+ DC ++G Sbjct: 219 SQAVALQLMESTNV--ALEESELAGEWLLVQEQASMAKAEKRVDKLLDELLACDCEELDG 276 Query: 1684 DGAVKLLQERLHIRPVNVDNFCLPELGSVRKTDVRLPLEHVPRPRKSSSYADNVTK---- 1517 DGAV LLQ+RL ++ ++++ LPEL V++T++ ++P+PR S+ N+ + Sbjct: 277 DGAVTLLQDRLQVKSLDIEKLNLPELLYVQRTNLNALGGNLPKPRNVLSHIHNLPRRTLT 336 Query: 1516 --KSKDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQRDRPSDPFLFS---ESDRSPVGDS 1352 K + G+ + ++L +H+ Q + P++P L S E D + G+S Sbjct: 337 PMKQQIAGNSTSSFGSPAPPKSQLASLALLRKHILQSNPPTNPVLKSLIIEEDDTTAGNS 396 Query: 1351 TRANGIENGAPSPHVDSTSKSAEVNRSALSFSGKFESMIKENPTDSNMVLDKLATEDSVH 1172 + + A + ++ S ++V S S + +SN+ +D T + + Sbjct: 397 SPTE-VAVKALNDNLTSLGSGSDVRPSKSSAEVE----------NSNVGVDNGITYEYLS 445 Query: 1171 ALD---QVEHNSSRINDNVNLTVNGCDRDLEDEVDGMQQPEPIGKGKSPAGSTIVLDKLR 1001 L V+ N ++++LT G +ED + + + S GS ++ + Sbjct: 446 QLGGDADVQTNGPNELEDMDLTSRGSAMQVED-IQQKAVDKSLNGNLSSLGSGSIVCPSK 504 Query: 1000 T----EDSSHSLDP--LDGSSSKHTNHMDYSLNGCDGHLEDEVSTDTYFDTNSISVDGMQ 839 T E+S+ +D +D +SS +D N + LED DT ++ Sbjct: 505 TSAEVENSNIGVDDGVIDENSSLRGGDVDIQTNRRN-ELEDMPE-----DTAMEYLNPRD 558 Query: 838 QQETVGSIVAEDGCEGLTMKNVHGPESQLDVSGDTIDADNEGHVHSDNAVGIPEQNIESG 659 Q E + + ED +D +T D D E P+ N E Sbjct: 559 QFEQLSAAFVEDHA--------------MDSCPETQDRDLE-----QTKANTPKHNNERV 599 Query: 658 GGQTHQQPPGESLNVRIKQKAPKRSGRESKIVSRR-SLAGA------------------- 539 ++PP S N + K+K+ GR+ + +SRR SLAGA Sbjct: 600 -----EKPPVVSTNKQTKEKSCTAKGRKYRSLSRRQSLAGAHCCVKLSIIWLFFSLLFLS 654 Query: 538 ----------------GTKWQGGVRRTTRNIMRPLEYWRGERVVYGRVHESLPSVIAYKY 407 GT W+ GVRR+TR RPLEYW+GER +YGR+H SL +VI KY Sbjct: 655 QSYCVDNLIYVVSIASGTSWETGVRRSTRIRSRPLEYWKGERFLYGRIHGSLATVIGIKY 714 Query: 406 ASPA--KGK--LKVHSFVSDDYKDLVEKQSLY 323 SP KGK LKV SFVSD+YK+LVE +L+ Sbjct: 715 ESPQNDKGKPALKVKSFVSDEYKNLVELAALH 746 >gb|AAU04611.1| CENP-C [Arabidopsis arenosa] Length = 710 Score = 157 bits (398), Expect = 7e-36 Identities = 150/512 (29%), Positives = 234/512 (45%), Gaps = 36/512 (7%) Frame = -3 Query: 1765 IQDSVAEEKNKLDGILDELL--STDCMEGDGAVKLLQERLHIRPVNVDNFCLPELGSVRK 1592 + DS + L+ IL +LL S D +EGD AVKLL++ L I +NV+ F +PE VRK Sbjct: 222 VDDSTVDTDKDLNNILKKLLASSRDELEGDAAVKLLEDHLQIESLNVEKFSIPEFQDVRK 281 Query: 1591 TDVRLPLEHVPRPRKSSSYADNVTKKS-KDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQ 1415 +++ + P RKS S N+ K + + G +H+ +H S Sbjct: 282 MNLKASGSN-PSNRKSLSDIQNILKGTHRVAGRKNSHSPSP-----------QTRKHFSS 329 Query: 1414 RDRPSDPFLFSE------SDRSP--VGDSTRANGIENGAPS---------PHVDSTSKSA 1286 + P D F F + D+ P V A I N +PS P +S K + Sbjct: 330 PNPPVDQFSFPDIHNLLPGDQQPSEVDVQPLAKDIANTSPSNVGTVDVASPFNNSVEKRS 389 Query: 1285 EVNRSALSFSGKFESMIKENPTDSNMVLDKLATEDSVHALDQVEHNSSRINDNVNLTVNG 1106 + + S + SG S ++ + V+D ++ +S V+ ++ +V ++ +G Sbjct: 390 DEDDSHIH-SGIHRSHLRPDGNVDICVMDSISNRNSAMLEVNVDMRTTGKEVDVPMSESG 448 Query: 1105 CDRDL---EDEVDGMQQPEPIGKGKSPAGSTIVLDKLRTEDSSHSLDPLDGSSSKHTNHM 935 +R+ E+++D ++ + A EDS + G+SS N Sbjct: 449 ANRNTGQRENDIDINEETGHLEMLAEYASKEATRPFTVEEDS---IPYQQGTSSNSPNRA 505 Query: 934 DYSLNGCDG---HLEDEVSTDTYFDTNSISVDGMQQQETVGSIVAEDGCEGLTMKNVHGP 764 N DG H E + N+ S G+Q+ + + + ++ Sbjct: 506 PEQYNTMDGPSEHAEHNQGLHEEENVNTDSASGLQEN-ALQEVHNSSHKQTNKLRKRGSS 564 Query: 763 ESQLDVSGDTIDADNEGHVHSDNAVGIPEQNIESGGGQTHQQPPGESLNVRIKQKAPKRS 584 +S + T+ + G +P ESG + ++ E ++K PK + Sbjct: 565 DSNVKKRSKTVHGETGGDPQMKT---LPH---ESGVKKQTKRKSNER-----EEKKPKNT 613 Query: 583 G----RESKIVSRR-SLAGAGTKWQGGVRRTTRNIMRPLEYWRGERVVYGRVHESLPSVI 419 RE K+ SRR SLA AGTK +GGVRR+TR RPLEYW+GER +YGR+HESL +VI Sbjct: 614 RKTLTREGKLFSRRKSLAAAGTKMEGGVRRSTRIKSRPLEYWKGERFLYGRIHESLTTVI 673 Query: 418 AYKYASPAKGK-----LKVHSFVSDDYKDLVE 338 KYASP +GK KV SFVSD+YK+LV+ Sbjct: 674 GIKYASPGEGKSDVRACKVKSFVSDEYKELVD 705