BLASTX nr result

ID: Cephaelis21_contig00004456 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00004456
         (1698 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAI48081.1| centromere protein C homologue [Nicotiana tabacu...   169   2e-39
dbj|BAI48085.1| centromere protein C [Nicotiana tomentosiformis]      167   1e-38
dbj|BAI48084.1| centromere protein C homologue [Nicotiana tabacum]    164   9e-38
gb|AAU04611.1| CENP-C [Arabidopsis arenosa]                           128   4e-27
ref|XP_003619785.1| CENP-C [Medicago truncatula] gi|355494800|gb...   128   5e-27

>dbj|BAI48081.1| centromere protein C homologue [Nicotiana tabacum]
            gi|262263167|dbj|BAI48086.1| centromere protein C
            [Nicotiana sylvestris]
          Length = 715

 Score =  169 bits (429), Expect = 2e-39
 Identities = 161/515 (31%), Positives = 236/515 (45%), Gaps = 58/515 (11%)
 Frame = +1

Query: 4    KTEKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRT 183
            KTE  +NGIL+ELLS N EDL G+ ALS LQERL IKPI+ G L +P+F  T     G+ 
Sbjct: 241  KTE--INGILNELLSSNGEDLIGEMALSNLQERLGIKPIELGPLCIPEFPMT-----GKV 293

Query: 184  NFMALGEKLPMPRKTLSNISNLANRLS-GETHAKKKVGESPI----SPTA--------HX 324
            +  A GE++  P K   +I +L    + G    +++  ESP     SPT           
Sbjct: 294  DGKAFGERIRKPWKFSQDIRDLVKSATEGTASTRRQHEESPTNNLASPTPPKSPHASLSL 353

Query: 325  XXXXXXXXXXXXDPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYAN----FESH 492
                        DPFSPL++DL      Y  D+ +  PP  ++       +N     ESH
Sbjct: 354  LKQKIFRSNPLRDPFSPLNIDL------YNNDSQSDHPPGWSMKMNPQCISNNAGPTESH 407

Query: 493  VENELGRPTTSTMDLQTVISTASDDLADHLLDKNLHREITDTDYQPTGTQTDFTECVTGD 672
             E E       + D   ++  +  D +        H ++ + D      +T      +G+
Sbjct: 408  GETE---NIAGSDDTNIMVPLSGSDFS--------HEQLMENDSGKDNVKTGSNGSQSGE 456

Query: 673  ALVENSFCHNTTMGGESINSVIAANRNTENVADRCEAAVLS-TRFNSHADDST--QIGGD 843
             L EN +          IN+ I  N N  N+    E+  L   +  S  +D +  Q G +
Sbjct: 457  EL-ENGY-------DIEINTDI--NLNMRNMDSHYESDALDKVKDVSVVNDVSKDQQGLE 506

Query: 844  TGGYVHQPQKVEGMPLETDVPASPQVMPQLQMVENL--------YG--------DQPPLD 975
            T  Y    QK++   +  +  +SPQ   +     N         +G        D  P  
Sbjct: 507  TESYF-SCQKMQDGEVLAETLSSPQAQGEADDTHNCSVETVAADFGSFEIDGQVDDMPPQ 565

Query: 976  QPNSM------------VTEDNPVITPSIAAEIVTKKESKKL--QTKEHR--------KV 1089
            + NS             VT D      S+A E+ + +   KL   + +H         K 
Sbjct: 566  RANSAEQDHHFEDSVKDVTSDQ---LSSVAVEVHSTEVRSKLPDMSPQHHAKAKDKQPKA 622

Query: 1090 KRPRGDXXXXXXXXXXXXFAEGGTSFESGVRRSQRIKSRPLQYWKGERFLFGRVNEGIKL 1269
            KRP G              A+ GTSF+ GVRRS+R+K+RPL+YWKGER L+G +N+ +KL
Sbjct: 623  KRPAGGRRESKALRSRPSLADAGTSFQDGVRRSKRMKTRPLEYWKGERLLYGWINDSLKL 682

Query: 1270 IGVKYISPGKGDGKLKVKPYISDDYKEMLDLAARH 1374
            +GVKY+SPGK  G +KV+ YISDDYK++++ AAR+
Sbjct: 683  VGVKYLSPGK--GSVKVESYISDDYKDLVESAARY 715


>dbj|BAI48085.1| centromere protein C [Nicotiana tomentosiformis]
          Length = 714

 Score =  167 bits (422), Expect = 1e-38
 Identities = 148/506 (29%), Positives = 230/506 (45%), Gaps = 49/506 (9%)
 Frame = +1

Query: 4    KTEKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRT 183
            KTE   NGIL+ELLS N  DL+G  ALS LQE L+IKPI+ G L  P+F  T     G+ 
Sbjct: 240  KTEN--NGILNELLSSNGGDLNGGMALSKLQEWLQIKPIELGPLCFPEFPMT-----GKV 292

Query: 184  NFMALGEKLPMPRKTLSNISNLANRLS-GETHAKKKVGESPI----SPTA--------HX 324
            +  A GE++  PRK    I +L    + G T  +++  ESP     SPT           
Sbjct: 293  DGKAFGERIRKPRKFSLEIRDLVKSATEGTTSTRRQHEESPTNNLASPTPPKSPHASLSL 352

Query: 325  XXXXXXXXXXXXDPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYANFESHVENE 504
                        DPFSPL++DL         D+ +  PP  ++       +N        
Sbjct: 353  LRQKISQSNPLRDPFSPLNIDLDNS------DSQSDHPPGWSMKMNPQCISNSAG----- 401

Query: 505  LGRPTTSTMDLQTVISTASDDLADHLLDKNL-HREITDTDYQPTGTQTDFTECVTGDALV 681
               PT S  + + +  + + ++   L   N  H ++   D      +T      +G+ L 
Sbjct: 402  ---PTESHGETENIAGSDNANIMLPLSGSNFSHEQLMINDSGKDNVKTGPNGSQSGEEL- 457

Query: 682  ENSFCHNTTMGGESINSVIAANRNTENVADRCEAAVLSTRFNSHADDSTQIGGDTGGYVH 861
            EN +  +          ++ ++  ++ +    + +V++           Q G +T  Y+ 
Sbjct: 458  ENGYDIDINTDINLTMRIMDSHYESDVLDKVKDVSVVNDVLKD------QQGLETESYI- 510

Query: 862  QPQKVEGMPLETDVPASPQVMPQLQMVENLYGDQPPLDQPNSMVTEDNPVITP------- 1020
              QK++   +  +  +SPQ   +     N   +   +D  +S +      + P       
Sbjct: 511  SCQKMQDGEVLAETLSSPQAQGEADDTHNCSVETVAVDFGSSEIDGQVDDMPPQRAHSAE 570

Query: 1021 ------------------SIAAEIVTKKESKKL--QTKEHR--------KVKRPRGDXXX 1116
                              S+A E+ + +   KL   + +H         K KRP G    
Sbjct: 571  QDHHFEDSVKGVTSDQLSSVAVEVHSTEVRSKLPDMSPQHHAKAKDKQPKAKRPAGGRRE 630

Query: 1117 XXXXXXXXXFAEGGTSFESGVRRSQRIKSRPLQYWKGERFLFGRVNEGIKLIGVKYISPG 1296
                      A+ GTSF+ GVRRS+R+K+RPL+YWKGER LFGRVN+ +KL+GVKYISPG
Sbjct: 631  SKALRSRPSLADAGTSFQDGVRRSKRMKTRPLEYWKGERLLFGRVNDSLKLVGVKYISPG 690

Query: 1297 KGDGKLKVKPYISDDYKEMLDLAARH 1374
            K  G +KV+ +ISDDYK++++LAAR+
Sbjct: 691  K--GSVKVESFISDDYKDLVELAARY 714


>dbj|BAI48084.1| centromere protein C homologue [Nicotiana tabacum]
          Length = 714

 Score =  164 bits (414), Expect = 9e-38
 Identities = 147/506 (29%), Positives = 228/506 (45%), Gaps = 49/506 (9%)
 Frame = +1

Query: 4    KTEKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRT 183
            KTE   NGIL+ELLS N  DL+G  ALS LQE L+IKPI+ G L  P+F        G+ 
Sbjct: 240  KTEN--NGILNELLSSNGGDLNGGMALSKLQEWLQIKPIELGPLCFPEFPMA-----GKV 292

Query: 184  NFMALGEKLPMPRKTLSNISNLANRLS-GETHAKKKVGESPI----SPTA--------HX 324
            +  A GE++  PRK    I +L    + G T  +++  ESP     SPT           
Sbjct: 293  DGKAFGERIRKPRKFSLEIRDLVKSATEGTTSTRRQHEESPTNNLASPTPPKSPHASLSL 352

Query: 325  XXXXXXXXXXXXDPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYANFESHVENE 504
                        DPFSPL++DL         D+ +  PP  ++       +N        
Sbjct: 353  LRQKISQSNPLRDPFSPLNIDLDNS------DSQSDHPPGWSMKMNPQCISNSAG----- 401

Query: 505  LGRPTTSTMDLQTVISTASDDLADHLLDKNL-HREITDTDYQPTGTQTDFTECVTGDALV 681
               PT S  + + +  + + ++   L   N  H ++   D      +T      +G+ L 
Sbjct: 402  ---PTESHGETENIAGSDNANIMLPLSGSNFSHEQLMINDSGKDNVKTGPNGSQSGEEL- 457

Query: 682  ENSFCHNTTMGGESINSVIAANRNTENVADRCEAAVLSTRFNSHADDSTQIGGDTGGYVH 861
            EN +  +          ++ ++  ++ +    + +V++           Q G +T  Y+ 
Sbjct: 458  ENGYDIDINTDINLTMRIMDSHYESDVLDKVKDVSVVNDVLKD------QQGLETESYI- 510

Query: 862  QPQKVEGMPLETDVPASPQVMPQLQMVENLYGDQPPLDQPNSMVTEDNPVITP------- 1020
              QK++   +  +  +SPQ   +     N   +   +D  +S +      + P       
Sbjct: 511  SCQKMQDGEVLAETLSSPQAQGEADDTHNCSVETVAVDFGSSEIDGQVDNMPPQRAHSAE 570

Query: 1021 ------------------SIAAEIVTKKESKKL---QTKEHRKVK-------RPRGDXXX 1116
                              S+A E+ + +   KL     + H K K       RP G    
Sbjct: 571  QDHHFEDSVKGVTSDQLSSVAVEVHSTEVRSKLPDMSPQHHAKAKDKQPKAERPAGGRRE 630

Query: 1117 XXXXXXXXXFAEGGTSFESGVRRSQRIKSRPLQYWKGERFLFGRVNEGIKLIGVKYISPG 1296
                      A+ GTSF+ GVRRS+R+K+RPL+YWKGER LFGRVN+ +KL+GVKYISPG
Sbjct: 631  SKALRSRPSLADAGTSFQDGVRRSKRMKTRPLEYWKGERLLFGRVNDSLKLVGVKYISPG 690

Query: 1297 KGDGKLKVKPYISDDYKEMLDLAARH 1374
            K  G +KV+ +ISDDYK++++LAAR+
Sbjct: 691  K--GSVKVESFISDDYKDLVELAARY 714


>gb|AAU04611.1| CENP-C [Arabidopsis arenosa]
          Length = 710

 Score =  128 bits (322), Expect = 4e-27
 Identities = 153/520 (29%), Positives = 226/520 (43%), Gaps = 64/520 (12%)
 Frame = +1

Query: 7    TEKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRTN 186
            T+K LN IL +LL+ + ++L+GD A+ LL++ L+I+ ++  K S+P+F         + N
Sbjct: 229  TDKDLNNILKKLLASSRDELEGDAAVKLLEDHLQIESLNVEKFSIPEF-----QDVRKMN 283

Query: 187  FMALGEKLPMPRKTLSNISNLANRLSGETH--AKKKVGESPISPTAHXXXXXXXXXXXXX 360
              A G   P  RK+LS+I N+   L G TH  A +K   SP   T               
Sbjct: 284  LKASGSN-PSNRKSLSDIQNI---LKG-THRVAGRKNSHSPSPQTRKHFSSPNPPV---- 334

Query: 361  DPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYANFESHVENELGRPTTSTMDLQ 540
            D FS  D+    PG+        +QP EV V       AN               T+D+ 
Sbjct: 335  DQFSFPDIHNLLPGD--------QQPSEVDVQPLAKDIANTSPS--------NVGTVDVA 378

Query: 541  TVISTASDDLADHLLDKNLHREITDTDYQPTGTQTDFTECVT------GDALVENSFCHN 702
            +  + + +  +D   D ++H  I  +  +P G   +   CV         A++E +    
Sbjct: 379  SPFNNSVEKRSDED-DSHIHSGIHRSHLRPDG---NVDICVMDSISNRNSAMLEVNVDMR 434

Query: 703  TTMGGESIN---SVIAANRNT---ENVADRCE-----------AAVLSTRFNSHADDSTQ 831
            TT  G+ ++   S   ANRNT   EN  D  E           A+  +TR  +  +DS  
Sbjct: 435  TT--GKEVDVPMSESGANRNTGQRENDIDINEETGHLEMLAEYASKEATRPFTVEEDS-- 490

Query: 832  IGGDTGGYVHQPQKVEGMPLETDVPASPQVMPQ-LQMVENLYGDQPPLDQPNSMVTEDNP 1008
            I    G   + P +        D P+      Q L   EN+  D     Q N++    N 
Sbjct: 491  IPYQQGTSSNSPNRAPEQYNTMDGPSEHAEHNQGLHEEENVNTDSASGLQENALQEVHNS 550

Query: 1009 V------ITPSIAAEIVTKKESK---------------------KLQTKE---HRKVKRP 1098
                   +    +++   KK SK                     K QTK     R+ K+P
Sbjct: 551  SHKQTNKLRKRGSSDSNVKKRSKTVHGETGGDPQMKTLPHESGVKKQTKRKSNEREEKKP 610

Query: 1099 RGDXXXXXXXXXXXX----FAEGGTSFESGVRRSQRIKSRPLQYWKGERFLFGRVNEGI- 1263
            +                   A  GT  E GVRRS RIKSRPL+YWKGERFL+GR++E + 
Sbjct: 611  KNTRKTLTREGKLFSRRKSLAAAGTKMEGGVRRSTRIKSRPLEYWKGERFLYGRIHESLT 670

Query: 1264 KLIGVKYISPGKGDGKL---KVKPYISDDYKEMLDLAARH 1374
             +IG+KY SPG+G   +   KVK ++SD+YKE++D AA H
Sbjct: 671  TVIGIKYASPGEGKSDVRACKVKSFVSDEYKELVDFAASH 710


>ref|XP_003619785.1| CENP-C [Medicago truncatula] gi|355494800|gb|AES76003.1| CENP-C
            [Medicago truncatula]
          Length = 641

 Score =  128 bits (321), Expect = 5e-27
 Identities = 133/481 (27%), Positives = 218/481 (45%), Gaps = 26/481 (5%)
 Frame = +1

Query: 10   EKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRTNF 189
            E ++N IL  LL  +SE+L+G+GA++LLQERL++K I   KLS+PDF   D       + 
Sbjct: 195  ENKMNDILKGLLDCDSEELEGEGAMNLLQERLQVKSIVFEKLSVPDF--LDIQPIDLKSL 252

Query: 190  MALGEKLPMPRKTLSNISNLANRLSGETHAKKKVGES---------PISPTA--HXXXXX 336
                 K P   K  S++ N    ++ +T  ++ VG +         P SP A        
Sbjct: 253  QGTLSK-PSKGKAFSDVDNWLKGMNIQTPLRRSVGYAEKQLASPTPPKSPFASLSSLQKH 311

Query: 337  XXXXXXXXDPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYANFESHVENELGRP 516
                    DPFS  ++DL  P  +Y   ++  Q  ++    +LS          +EL  P
Sbjct: 312  ISRSKLSTDPFSTHEIDL-VPTRSYSPIHMADQEVDIVGSSKLS----------DELTAP 360

Query: 517  TTSTM---DLQTVISTASDDLADHLLDKNLHREITDTDYQPTGTQTDFTECVTGD-ALVE 684
            TT  +     +  I   S++  +H   +N   E+     +      D    +T   ++V+
Sbjct: 361  TTEDVIAAGEKNTIPETSENSKEH-NSRNPSDEVNAPIIEDIVDNPDRNCTITPQKSMVD 419

Query: 685  NS----FCHNTTMGGESINSVIAANRN--TENVADRCEAAV-LSTRFNSHADDSTQIGGD 843
            NS    F  N      +++  +   R+   + V D  E    +      H DD+T     
Sbjct: 420  NSTEPGFNANVDSNEPAVDMDVDIGRSGMGKRVMDDTEGRQNVEPNEPFHFDDNTLEENM 479

Query: 844  TGGYVHQPQKVEGMPLETDVPASPQVMPQLQMVENL-YGDQPPLDQPNSMVTEDNPVITP 1020
             G     P   +   L T++P + Q  P      ++  G +   D P   + E       
Sbjct: 480  QGFTSSIP--TDDANLNTELPLADQSNPVTYQANSMDKGSRRSDDGPEQCLQEKTIGSAA 537

Query: 1021 SIAAEIVTKKESKKLQTKEHRKVKRPRGDXXXXXXXXXXXXFAEGGTSFESGVRRSQRIK 1200
             +  + + K   +K  +K  R +++                 A+ GTS+ESGVRRS R +
Sbjct: 538  PVNGQTIVKSCMRK-GSKGKRLLRK---------------SLADAGTSWESGVRRSTRFR 581

Query: 1201 SRPLQYWKGERFLFGRVNEGIK-LIGVKYISPGKGDGK--LKVKPYISDDYKEMLDLAAR 1371
            ++PL+YWKGER ++GRV+E +  +IGVK +SPG  DGK  +KVK ++SD YKE+ ++A+ 
Sbjct: 582  TKPLEYWKGERMVYGRVHESLSTVIGVKCMSPG-SDGKPTMKVKSFVSDKYKELFEIASE 640

Query: 1372 H 1374
            +
Sbjct: 641  Y 641


Top