BLASTX nr result

ID: Coptis24_contig00006607 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00006607
         (1860 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAF71990.1|AC013453_15 Hypothetical protein [Arabidopsis thal...   167   7e-39
ref|NP_173018.2| centromere protein C [Arabidopsis thaliana] gi|...   167   7e-39
ref|XP_002890116.1| CENP-C [Arabidopsis lyrata subsp. lyrata] gi...   160   1e-36
ref|XP_002298134.1| predicted protein [Populus trichocarpa] gi|2...   159   2e-36
gb|AAU04611.1| CENP-C [Arabidopsis arenosa]                           157   7e-36

>gb|AAF71990.1|AC013453_15 Hypothetical protein [Arabidopsis thaliana]
          Length = 710

 Score =  167 bits (424), Expect = 7e-39
 Identities = 160/551 (29%), Positives = 251/551 (45%), Gaps = 44/551 (7%)
 Frame = -3

Query: 1843 VVNLETTDMH---GASQEKEAVVDGCIDAIQ----DSVAEEKNKLDGILDELL--STDCM 1691
            V+NLE ++      + Q  E+     +  +     DS  +    L+ +L +LL  S + +
Sbjct: 192  VINLEASEKEIPIASEQSLESATAAHVTTVDREVDDSTVDTDKDLNNVLKDLLACSREEL 251

Query: 1690 EGDGAVKLLQERLHIRPVNVDNFCLPELGSVRKTDVRLPLEHVPRPRKSSSYADNVTKKS 1511
            EGDGA+KLL+ERL I+  N++ F +PE   VRK +++    + P  RKS S   N+ K +
Sbjct: 252  EGDGAIKLLEERLQIKSFNIEKFSIPEFQDVRKMNLKASGSNPPN-RKSLSDIQNILKGT 310

Query: 1510 KDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQRDRPSDPFLFSE------SDRSP--VGD 1355
             ++ +V  ++                 +H S  + P D F F +       D+ P  V  
Sbjct: 311  -NRVAVRKNSHSPSPQTI---------KHFSSPNPPVDQFSFPDIHNLLPGDQQPSEVNV 360

Query: 1354 STRANGIENGAPSPHVDSTSKSAEVNRSALSFSGKFESMIKENPTDSNM---------VL 1202
               A  I N +P+ +V +   ++  N S +  SG+ +S I      S++         V+
Sbjct: 361  QPIAKDIPNTSPT-NVGTVDVASPFNDSVVKRSGEDDSHIHSGIHRSHLSRDGNPDICVM 419

Query: 1201 DKLATEDSVHALDQVEHNSSRINDNVNLTVNGCDR---DLEDEVDGMQQPEPIGKGKSPA 1031
            D ++   S      V+  +     +V ++ +G +R   D E++ +  ++ + + +    A
Sbjct: 420  DSISNRSSAMLQKNVDMRTKGKEVDVPMSESGANRNTGDRENDAEINEETDNLERLAECA 479

Query: 1030 GSTIVLDKLRTEDSSHSLDPLDGSSSKHTNHMDYSLNGCDGHLEDEVSTDTYFDTNSISV 851
               +       EDS   +    G+SSK  N      N   G LE                
Sbjct: 480  SKEVTRPFTVEEDS---IPYQQGASSKSPNRAPEQYNTMGGSLEHAEHNQ---------- 526

Query: 850  DGMQQQETVGSIVAEDGCEGLTMKNVHGPESQLDVSGDTIDADNEGHVHSDNAVGIPEQN 671
             G+ ++E V +        GL ++N   PE        T      G   S+        +
Sbjct: 527  -GLHEEENVNT----GSASGLQVENA--PEVHKYSHKQTNKRRKRGSSDSNVKKRSKTVH 579

Query: 670  IESGGGQTHQQPPGESL---------NVRIKQKAPKRSGRESKIVS-RRSLAGAGTKWQG 521
             E+GG +  +  P ES          N R ++K  K    E K+ S R+SLA AGTK +G
Sbjct: 580  GETGGDKQMKTLPHESRAKKQTKGKSNEREEKKPKKTLTHEGKLFSCRKSLAAAGTKIEG 639

Query: 520  GVRRTTRNIMRPLEYWRGERVVYGRVHESLPSVIAYKYASPAKGK-----LKVHSFVSDD 356
            GVRR+TR   RPLEYWRGER +YGR+HESL +VI  KYASP +GK      KV SFVSD+
Sbjct: 640  GVRRSTRIKSRPLEYWRGERFLYGRIHESLTTVIGIKYASPGEGKRDSRASKVKSFVSDE 699

Query: 355  YKDLVEKQSLY 323
            YK LV+  +L+
Sbjct: 700  YKKLVDFAALH 710


>ref|NP_173018.2| centromere protein C [Arabidopsis thaliana]
            gi|51477443|gb|AAU04629.1| CENP-C [Arabidopsis thaliana]
            gi|332191225|gb|AEE29346.1| centromere protein C
            [Arabidopsis thaliana]
          Length = 705

 Score =  167 bits (424), Expect = 7e-39
 Identities = 160/551 (29%), Positives = 251/551 (45%), Gaps = 44/551 (7%)
 Frame = -3

Query: 1843 VVNLETTDMH---GASQEKEAVVDGCIDAIQ----DSVAEEKNKLDGILDELL--STDCM 1691
            V+NLE ++      + Q  E+     +  +     DS  +    L+ +L +LL  S + +
Sbjct: 187  VINLEASEKEIPIASEQSLESATAAHVTTVDREVDDSTVDTDKDLNNVLKDLLACSREEL 246

Query: 1690 EGDGAVKLLQERLHIRPVNVDNFCLPELGSVRKTDVRLPLEHVPRPRKSSSYADNVTKKS 1511
            EGDGA+KLL+ERL I+  N++ F +PE   VRK +++    + P  RKS S   N+ K +
Sbjct: 247  EGDGAIKLLEERLQIKSFNIEKFSIPEFQDVRKMNLKASGSNPPN-RKSLSDIQNILKGT 305

Query: 1510 KDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQRDRPSDPFLFSE------SDRSP--VGD 1355
             ++ +V  ++                 +H S  + P D F F +       D+ P  V  
Sbjct: 306  -NRVAVRKNSHSPSPQTI---------KHFSSPNPPVDQFSFPDIHNLLPGDQQPSEVNV 355

Query: 1354 STRANGIENGAPSPHVDSTSKSAEVNRSALSFSGKFESMIKENPTDSNM---------VL 1202
               A  I N +P+ +V +   ++  N S +  SG+ +S I      S++         V+
Sbjct: 356  QPIAKDIPNTSPT-NVGTVDVASPFNDSVVKRSGEDDSHIHSGIHRSHLSRDGNPDICVM 414

Query: 1201 DKLATEDSVHALDQVEHNSSRINDNVNLTVNGCDR---DLEDEVDGMQQPEPIGKGKSPA 1031
            D ++   S      V+  +     +V ++ +G +R   D E++ +  ++ + + +    A
Sbjct: 415  DSISNRSSAMLQKNVDMRTKGKEVDVPMSESGANRNTGDRENDAEINEETDNLERLAECA 474

Query: 1030 GSTIVLDKLRTEDSSHSLDPLDGSSSKHTNHMDYSLNGCDGHLEDEVSTDTYFDTNSISV 851
               +       EDS   +    G+SSK  N      N   G LE                
Sbjct: 475  SKEVTRPFTVEEDS---IPYQQGASSKSPNRAPEQYNTMGGSLEHAEHNQ---------- 521

Query: 850  DGMQQQETVGSIVAEDGCEGLTMKNVHGPESQLDVSGDTIDADNEGHVHSDNAVGIPEQN 671
             G+ ++E V +        GL ++N   PE        T      G   S+        +
Sbjct: 522  -GLHEEENVNT----GSASGLQVENA--PEVHKYSHKQTNKRRKRGSSDSNVKKRSKTVH 574

Query: 670  IESGGGQTHQQPPGESL---------NVRIKQKAPKRSGRESKIVS-RRSLAGAGTKWQG 521
             E+GG +  +  P ES          N R ++K  K    E K+ S R+SLA AGTK +G
Sbjct: 575  GETGGDKQMKTLPHESRAKKQTKGKSNEREEKKPKKTLTHEGKLFSCRKSLAAAGTKIEG 634

Query: 520  GVRRTTRNIMRPLEYWRGERVVYGRVHESLPSVIAYKYASPAKGK-----LKVHSFVSDD 356
            GVRR+TR   RPLEYWRGER +YGR+HESL +VI  KYASP +GK      KV SFVSD+
Sbjct: 635  GVRRSTRIKSRPLEYWRGERFLYGRIHESLTTVIGIKYASPGEGKRDSRASKVKSFVSDE 694

Query: 355  YKDLVEKQSLY 323
            YK LV+  +L+
Sbjct: 695  YKKLVDFAALH 705


>ref|XP_002890116.1| CENP-C [Arabidopsis lyrata subsp. lyrata] gi|297335958|gb|EFH66375.1|
            CENP-C [Arabidopsis lyrata subsp. lyrata]
          Length = 708

 Score =  160 bits (404), Expect = 1e-36
 Identities = 155/521 (29%), Positives = 233/521 (44%), Gaps = 45/521 (8%)
 Frame = -3

Query: 1765 IQDSVAEEKNKLDGILDELL--STDCMEGDGAVKLLQERLHIRPVNVDNFCLPELGSVRK 1592
            + DS  +    L+ IL ELL  S D +EGD AVK L++ L I+ +NV+ F +PE   VRK
Sbjct: 220  VDDSTVDTDKDLNNILKELLASSRDELEGDAAVKRLEDVLQIKSLNVEKFSIPEFQDVRK 279

Query: 1591 TDVRLPLEHVPRPRKSSSYADNVTKK-SKDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQ 1415
             +++    + P  RKS S   N+ K   +  G   +H+                 +H S 
Sbjct: 280  MNMKASGSN-PSNRKSLSDIQNILKGIHRVAGRKNSHSPSP-----------QTRKHFSS 327

Query: 1414 RDRPSDPFLFSE------SDRSP--VGDSTRANGIENGAPSPHVDSTSKSAEVNRSALSF 1259
             + P D F F +       D+ P  V     A  I N +PS +V +   ++  N S    
Sbjct: 328  PNPPVDQFSFPDIHNLLPGDQQPSEVDVQPLAKDIANTSPS-NVGTVDVASPFNNSVEKR 386

Query: 1258 SGKFESMIKENPTDSNM---------VLDKLATEDSVHALDQVEHNSSRINDNVNLTVNG 1106
            SG+ +S I      S++         V+D ++  +S      V+  ++    +V ++ +G
Sbjct: 387  SGEDDSHIHSGIHRSHLRPDGNADICVMDSISNRNSAMLEVNVDMRTTGKEVDVPISESG 446

Query: 1105 CDR---------DLEDEVDGMQQPEPIGKGKSPAGSTIVLDKLRTEDSSHSLDPLDGSSS 953
             +R         D+ +E D ++        ++    T+  D +  +          G+SS
Sbjct: 447  ANRNTGQRENDTDINEETDHLEMLAEYASKEATRPFTVEEDSIPYQQ---------GTSS 497

Query: 952  KHTNHMDYSLNGCDG---HLEDEVSTDTYFDTNSISVDGMQQQETVGSIVAEDGCEGLTM 782
               N      N  DG   H E         + N+ S  G+Q+        A +     T 
Sbjct: 498  NSPNRAPEQYNTMDGPSEHAEHNQGLHEEENVNTDSASGLQENALQE---AHNSSHKQTN 554

Query: 781  KNVHGPESQLDVSGDTIDADNEGHVHSDNAVGIPEQNI---ESGGGQTHQQPPGESLNVR 611
            K      S  +V            VH +   G P+      ESG  +  ++   E     
Sbjct: 555  KRRKRGSSDSNVK------KRSKTVHGETG-GDPQMKTLPHESGAKKQTKRKSNER---- 603

Query: 610  IKQKAPKRSG----RESKIVSRR-SLAGAGTKWQGGVRRTTRNIMRPLEYWRGERVVYGR 446
             ++K PK +     RE K+ SRR SLA AGTK +GGVRR+TR   RPLEYW+GER +YGR
Sbjct: 604  -EEKKPKNTRKTLTREGKLFSRRKSLAAAGTKMEGGVRRSTRIKSRPLEYWKGERFLYGR 662

Query: 445  VHESLPSVIAYKYASPAKGK-----LKVHSFVSDDYKDLVE 338
            +HESL +VI  KYASP +GK      KV SFVSD+YK+LV+
Sbjct: 663  IHESLTTVIGIKYASPGEGKSDLRACKVKSFVSDEYKELVD 703


>ref|XP_002298134.1| predicted protein [Populus trichocarpa] gi|222845392|gb|EEE82939.1|
            predicted protein [Populus trichocarpa]
          Length = 746

 Score =  159 bits (402), Expect = 2e-36
 Identities = 161/572 (28%), Positives = 266/572 (46%), Gaps = 60/572 (10%)
 Frame = -3

Query: 1858 SQSYSVVNLETTDMHGASQEKEAVVDGCIDAIQDSVAEEKNKLDGILDELLSTDC--MEG 1685
            SQ+ ++  +E+T++  A +E E   +  +   Q S+A+ + ++D +LDELL+ DC  ++G
Sbjct: 219  SQAVALQLMESTNV--ALEESELAGEWLLVQEQASMAKAEKRVDKLLDELLACDCEELDG 276

Query: 1684 DGAVKLLQERLHIRPVNVDNFCLPELGSVRKTDVRLPLEHVPRPRKSSSYADNVTK---- 1517
            DGAV LLQ+RL ++ ++++   LPEL  V++T++     ++P+PR   S+  N+ +    
Sbjct: 277  DGAVTLLQDRLQVKSLDIEKLNLPELLYVQRTNLNALGGNLPKPRNVLSHIHNLPRRTLT 336

Query: 1516 --KSKDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQRDRPSDPFLFS---ESDRSPVGDS 1352
              K +  G+  +               ++L +H+ Q + P++P L S   E D +  G+S
Sbjct: 337  PMKQQIAGNSTSSFGSPAPPKSQLASLALLRKHILQSNPPTNPVLKSLIIEEDDTTAGNS 396

Query: 1351 TRANGIENGAPSPHVDSTSKSAEVNRSALSFSGKFESMIKENPTDSNMVLDKLATEDSVH 1172
            +    +   A + ++ S    ++V  S  S   +          +SN+ +D   T + + 
Sbjct: 397  SPTE-VAVKALNDNLTSLGSGSDVRPSKSSAEVE----------NSNVGVDNGITYEYLS 445

Query: 1171 ALD---QVEHNSSRINDNVNLTVNGCDRDLEDEVDGMQQPEPIGKGKSPAGSTIVLDKLR 1001
             L     V+ N     ++++LT  G    +ED +      + +    S  GS  ++   +
Sbjct: 446  QLGGDADVQTNGPNELEDMDLTSRGSAMQVED-IQQKAVDKSLNGNLSSLGSGSIVCPSK 504

Query: 1000 T----EDSSHSLDP--LDGSSSKHTNHMDYSLNGCDGHLEDEVSTDTYFDTNSISVDGMQ 839
            T    E+S+  +D   +D +SS     +D   N  +  LED        DT    ++   
Sbjct: 505  TSAEVENSNIGVDDGVIDENSSLRGGDVDIQTNRRN-ELEDMPE-----DTAMEYLNPRD 558

Query: 838  QQETVGSIVAEDGCEGLTMKNVHGPESQLDVSGDTIDADNEGHVHSDNAVGIPEQNIESG 659
            Q E + +   ED                +D   +T D D E           P+ N E  
Sbjct: 559  QFEQLSAAFVEDHA--------------MDSCPETQDRDLE-----QTKANTPKHNNERV 599

Query: 658  GGQTHQQPPGESLNVRIKQKAPKRSGRESKIVSRR-SLAGA------------------- 539
                 ++PP  S N + K+K+    GR+ + +SRR SLAGA                   
Sbjct: 600  -----EKPPVVSTNKQTKEKSCTAKGRKYRSLSRRQSLAGAHCCVKLSIIWLFFSLLFLS 654

Query: 538  ----------------GTKWQGGVRRTTRNIMRPLEYWRGERVVYGRVHESLPSVIAYKY 407
                            GT W+ GVRR+TR   RPLEYW+GER +YGR+H SL +VI  KY
Sbjct: 655  QSYCVDNLIYVVSIASGTSWETGVRRSTRIRSRPLEYWKGERFLYGRIHGSLATVIGIKY 714

Query: 406  ASPA--KGK--LKVHSFVSDDYKDLVEKQSLY 323
             SP   KGK  LKV SFVSD+YK+LVE  +L+
Sbjct: 715  ESPQNDKGKPALKVKSFVSDEYKNLVELAALH 746


>gb|AAU04611.1| CENP-C [Arabidopsis arenosa]
          Length = 710

 Score =  157 bits (398), Expect = 7e-36
 Identities = 150/512 (29%), Positives = 234/512 (45%), Gaps = 36/512 (7%)
 Frame = -3

Query: 1765 IQDSVAEEKNKLDGILDELL--STDCMEGDGAVKLLQERLHIRPVNVDNFCLPELGSVRK 1592
            + DS  +    L+ IL +LL  S D +EGD AVKLL++ L I  +NV+ F +PE   VRK
Sbjct: 222  VDDSTVDTDKDLNNILKKLLASSRDELEGDAAVKLLEDHLQIESLNVEKFSIPEFQDVRK 281

Query: 1591 TDVRLPLEHVPRPRKSSSYADNVTKKS-KDKGSVETHNDXXXXXXXXXXXXSMLSRHVSQ 1415
             +++    + P  RKS S   N+ K + +  G   +H+                 +H S 
Sbjct: 282  MNLKASGSN-PSNRKSLSDIQNILKGTHRVAGRKNSHSPSP-----------QTRKHFSS 329

Query: 1414 RDRPSDPFLFSE------SDRSP--VGDSTRANGIENGAPS---------PHVDSTSKSA 1286
             + P D F F +       D+ P  V     A  I N +PS         P  +S  K +
Sbjct: 330  PNPPVDQFSFPDIHNLLPGDQQPSEVDVQPLAKDIANTSPSNVGTVDVASPFNNSVEKRS 389

Query: 1285 EVNRSALSFSGKFESMIKENPTDSNMVLDKLATEDSVHALDQVEHNSSRINDNVNLTVNG 1106
            + + S +  SG   S ++ +      V+D ++  +S      V+  ++    +V ++ +G
Sbjct: 390  DEDDSHIH-SGIHRSHLRPDGNVDICVMDSISNRNSAMLEVNVDMRTTGKEVDVPMSESG 448

Query: 1105 CDRDL---EDEVDGMQQPEPIGKGKSPAGSTIVLDKLRTEDSSHSLDPLDGSSSKHTNHM 935
             +R+    E+++D  ++   +      A           EDS   +    G+SS   N  
Sbjct: 449  ANRNTGQRENDIDINEETGHLEMLAEYASKEATRPFTVEEDS---IPYQQGTSSNSPNRA 505

Query: 934  DYSLNGCDG---HLEDEVSTDTYFDTNSISVDGMQQQETVGSIVAEDGCEGLTMKNVHGP 764
                N  DG   H E         + N+ S  G+Q+   +  +      +   ++     
Sbjct: 506  PEQYNTMDGPSEHAEHNQGLHEEENVNTDSASGLQEN-ALQEVHNSSHKQTNKLRKRGSS 564

Query: 763  ESQLDVSGDTIDADNEGHVHSDNAVGIPEQNIESGGGQTHQQPPGESLNVRIKQKAPKRS 584
            +S +     T+  +  G         +P    ESG  +  ++   E      ++K PK +
Sbjct: 565  DSNVKKRSKTVHGETGGDPQMKT---LPH---ESGVKKQTKRKSNER-----EEKKPKNT 613

Query: 583  G----RESKIVSRR-SLAGAGTKWQGGVRRTTRNIMRPLEYWRGERVVYGRVHESLPSVI 419
                 RE K+ SRR SLA AGTK +GGVRR+TR   RPLEYW+GER +YGR+HESL +VI
Sbjct: 614  RKTLTREGKLFSRRKSLAAAGTKMEGGVRRSTRIKSRPLEYWKGERFLYGRIHESLTTVI 673

Query: 418  AYKYASPAKGK-----LKVHSFVSDDYKDLVE 338
              KYASP +GK      KV SFVSD+YK+LV+
Sbjct: 674  GIKYASPGEGKSDVRACKVKSFVSDEYKELVD 705


Top