BLASTX nr result
ID: Cephaelis21_contig00004456
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00004456 (1698 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAI48081.1| centromere protein C homologue [Nicotiana tabacu... 169 2e-39 dbj|BAI48085.1| centromere protein C [Nicotiana tomentosiformis] 167 1e-38 dbj|BAI48084.1| centromere protein C homologue [Nicotiana tabacum] 164 9e-38 gb|AAU04611.1| CENP-C [Arabidopsis arenosa] 128 4e-27 ref|XP_003619785.1| CENP-C [Medicago truncatula] gi|355494800|gb... 128 5e-27 >dbj|BAI48081.1| centromere protein C homologue [Nicotiana tabacum] gi|262263167|dbj|BAI48086.1| centromere protein C [Nicotiana sylvestris] Length = 715 Score = 169 bits (429), Expect = 2e-39 Identities = 161/515 (31%), Positives = 236/515 (45%), Gaps = 58/515 (11%) Frame = +1 Query: 4 KTEKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRT 183 KTE +NGIL+ELLS N EDL G+ ALS LQERL IKPI+ G L +P+F T G+ Sbjct: 241 KTE--INGILNELLSSNGEDLIGEMALSNLQERLGIKPIELGPLCIPEFPMT-----GKV 293 Query: 184 NFMALGEKLPMPRKTLSNISNLANRLS-GETHAKKKVGESPI----SPTA--------HX 324 + A GE++ P K +I +L + G +++ ESP SPT Sbjct: 294 DGKAFGERIRKPWKFSQDIRDLVKSATEGTASTRRQHEESPTNNLASPTPPKSPHASLSL 353 Query: 325 XXXXXXXXXXXXDPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYAN----FESH 492 DPFSPL++DL Y D+ + PP ++ +N ESH Sbjct: 354 LKQKIFRSNPLRDPFSPLNIDL------YNNDSQSDHPPGWSMKMNPQCISNNAGPTESH 407 Query: 493 VENELGRPTTSTMDLQTVISTASDDLADHLLDKNLHREITDTDYQPTGTQTDFTECVTGD 672 E E + D ++ + D + H ++ + D +T +G+ Sbjct: 408 GETE---NIAGSDDTNIMVPLSGSDFS--------HEQLMENDSGKDNVKTGSNGSQSGE 456 Query: 673 ALVENSFCHNTTMGGESINSVIAANRNTENVADRCEAAVLS-TRFNSHADDST--QIGGD 843 L EN + IN+ I N N N+ E+ L + S +D + Q G + Sbjct: 457 EL-ENGY-------DIEINTDI--NLNMRNMDSHYESDALDKVKDVSVVNDVSKDQQGLE 506 Query: 844 TGGYVHQPQKVEGMPLETDVPASPQVMPQLQMVENL--------YG--------DQPPLD 975 T Y QK++ + + +SPQ + N +G D P Sbjct: 507 TESYF-SCQKMQDGEVLAETLSSPQAQGEADDTHNCSVETVAADFGSFEIDGQVDDMPPQ 565 Query: 976 QPNSM------------VTEDNPVITPSIAAEIVTKKESKKL--QTKEHR--------KV 1089 + NS VT D S+A E+ + + KL + +H K Sbjct: 566 RANSAEQDHHFEDSVKDVTSDQ---LSSVAVEVHSTEVRSKLPDMSPQHHAKAKDKQPKA 622 Query: 1090 KRPRGDXXXXXXXXXXXXFAEGGTSFESGVRRSQRIKSRPLQYWKGERFLFGRVNEGIKL 1269 KRP G A+ GTSF+ GVRRS+R+K+RPL+YWKGER L+G +N+ +KL Sbjct: 623 KRPAGGRRESKALRSRPSLADAGTSFQDGVRRSKRMKTRPLEYWKGERLLYGWINDSLKL 682 Query: 1270 IGVKYISPGKGDGKLKVKPYISDDYKEMLDLAARH 1374 +GVKY+SPGK G +KV+ YISDDYK++++ AAR+ Sbjct: 683 VGVKYLSPGK--GSVKVESYISDDYKDLVESAARY 715 >dbj|BAI48085.1| centromere protein C [Nicotiana tomentosiformis] Length = 714 Score = 167 bits (422), Expect = 1e-38 Identities = 148/506 (29%), Positives = 230/506 (45%), Gaps = 49/506 (9%) Frame = +1 Query: 4 KTEKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRT 183 KTE NGIL+ELLS N DL+G ALS LQE L+IKPI+ G L P+F T G+ Sbjct: 240 KTEN--NGILNELLSSNGGDLNGGMALSKLQEWLQIKPIELGPLCFPEFPMT-----GKV 292 Query: 184 NFMALGEKLPMPRKTLSNISNLANRLS-GETHAKKKVGESPI----SPTA--------HX 324 + A GE++ PRK I +L + G T +++ ESP SPT Sbjct: 293 DGKAFGERIRKPRKFSLEIRDLVKSATEGTTSTRRQHEESPTNNLASPTPPKSPHASLSL 352 Query: 325 XXXXXXXXXXXXDPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYANFESHVENE 504 DPFSPL++DL D+ + PP ++ +N Sbjct: 353 LRQKISQSNPLRDPFSPLNIDLDNS------DSQSDHPPGWSMKMNPQCISNSAG----- 401 Query: 505 LGRPTTSTMDLQTVISTASDDLADHLLDKNL-HREITDTDYQPTGTQTDFTECVTGDALV 681 PT S + + + + + ++ L N H ++ D +T +G+ L Sbjct: 402 ---PTESHGETENIAGSDNANIMLPLSGSNFSHEQLMINDSGKDNVKTGPNGSQSGEEL- 457 Query: 682 ENSFCHNTTMGGESINSVIAANRNTENVADRCEAAVLSTRFNSHADDSTQIGGDTGGYVH 861 EN + + ++ ++ ++ + + +V++ Q G +T Y+ Sbjct: 458 ENGYDIDINTDINLTMRIMDSHYESDVLDKVKDVSVVNDVLKD------QQGLETESYI- 510 Query: 862 QPQKVEGMPLETDVPASPQVMPQLQMVENLYGDQPPLDQPNSMVTEDNPVITP------- 1020 QK++ + + +SPQ + N + +D +S + + P Sbjct: 511 SCQKMQDGEVLAETLSSPQAQGEADDTHNCSVETVAVDFGSSEIDGQVDDMPPQRAHSAE 570 Query: 1021 ------------------SIAAEIVTKKESKKL--QTKEHR--------KVKRPRGDXXX 1116 S+A E+ + + KL + +H K KRP G Sbjct: 571 QDHHFEDSVKGVTSDQLSSVAVEVHSTEVRSKLPDMSPQHHAKAKDKQPKAKRPAGGRRE 630 Query: 1117 XXXXXXXXXFAEGGTSFESGVRRSQRIKSRPLQYWKGERFLFGRVNEGIKLIGVKYISPG 1296 A+ GTSF+ GVRRS+R+K+RPL+YWKGER LFGRVN+ +KL+GVKYISPG Sbjct: 631 SKALRSRPSLADAGTSFQDGVRRSKRMKTRPLEYWKGERLLFGRVNDSLKLVGVKYISPG 690 Query: 1297 KGDGKLKVKPYISDDYKEMLDLAARH 1374 K G +KV+ +ISDDYK++++LAAR+ Sbjct: 691 K--GSVKVESFISDDYKDLVELAARY 714 >dbj|BAI48084.1| centromere protein C homologue [Nicotiana tabacum] Length = 714 Score = 164 bits (414), Expect = 9e-38 Identities = 147/506 (29%), Positives = 228/506 (45%), Gaps = 49/506 (9%) Frame = +1 Query: 4 KTEKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRT 183 KTE NGIL+ELLS N DL+G ALS LQE L+IKPI+ G L P+F G+ Sbjct: 240 KTEN--NGILNELLSSNGGDLNGGMALSKLQEWLQIKPIELGPLCFPEFPMA-----GKV 292 Query: 184 NFMALGEKLPMPRKTLSNISNLANRLS-GETHAKKKVGESPI----SPTA--------HX 324 + A GE++ PRK I +L + G T +++ ESP SPT Sbjct: 293 DGKAFGERIRKPRKFSLEIRDLVKSATEGTTSTRRQHEESPTNNLASPTPPKSPHASLSL 352 Query: 325 XXXXXXXXXXXXDPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYANFESHVENE 504 DPFSPL++DL D+ + PP ++ +N Sbjct: 353 LRQKISQSNPLRDPFSPLNIDLDNS------DSQSDHPPGWSMKMNPQCISNSAG----- 401 Query: 505 LGRPTTSTMDLQTVISTASDDLADHLLDKNL-HREITDTDYQPTGTQTDFTECVTGDALV 681 PT S + + + + + ++ L N H ++ D +T +G+ L Sbjct: 402 ---PTESHGETENIAGSDNANIMLPLSGSNFSHEQLMINDSGKDNVKTGPNGSQSGEEL- 457 Query: 682 ENSFCHNTTMGGESINSVIAANRNTENVADRCEAAVLSTRFNSHADDSTQIGGDTGGYVH 861 EN + + ++ ++ ++ + + +V++ Q G +T Y+ Sbjct: 458 ENGYDIDINTDINLTMRIMDSHYESDVLDKVKDVSVVNDVLKD------QQGLETESYI- 510 Query: 862 QPQKVEGMPLETDVPASPQVMPQLQMVENLYGDQPPLDQPNSMVTEDNPVITP------- 1020 QK++ + + +SPQ + N + +D +S + + P Sbjct: 511 SCQKMQDGEVLAETLSSPQAQGEADDTHNCSVETVAVDFGSSEIDGQVDNMPPQRAHSAE 570 Query: 1021 ------------------SIAAEIVTKKESKKL---QTKEHRKVK-------RPRGDXXX 1116 S+A E+ + + KL + H K K RP G Sbjct: 571 QDHHFEDSVKGVTSDQLSSVAVEVHSTEVRSKLPDMSPQHHAKAKDKQPKAERPAGGRRE 630 Query: 1117 XXXXXXXXXFAEGGTSFESGVRRSQRIKSRPLQYWKGERFLFGRVNEGIKLIGVKYISPG 1296 A+ GTSF+ GVRRS+R+K+RPL+YWKGER LFGRVN+ +KL+GVKYISPG Sbjct: 631 SKALRSRPSLADAGTSFQDGVRRSKRMKTRPLEYWKGERLLFGRVNDSLKLVGVKYISPG 690 Query: 1297 KGDGKLKVKPYISDDYKEMLDLAARH 1374 K G +KV+ +ISDDYK++++LAAR+ Sbjct: 691 K--GSVKVESFISDDYKDLVELAARY 714 >gb|AAU04611.1| CENP-C [Arabidopsis arenosa] Length = 710 Score = 128 bits (322), Expect = 4e-27 Identities = 153/520 (29%), Positives = 226/520 (43%), Gaps = 64/520 (12%) Frame = +1 Query: 7 TEKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRTN 186 T+K LN IL +LL+ + ++L+GD A+ LL++ L+I+ ++ K S+P+F + N Sbjct: 229 TDKDLNNILKKLLASSRDELEGDAAVKLLEDHLQIESLNVEKFSIPEF-----QDVRKMN 283 Query: 187 FMALGEKLPMPRKTLSNISNLANRLSGETH--AKKKVGESPISPTAHXXXXXXXXXXXXX 360 A G P RK+LS+I N+ L G TH A +K SP T Sbjct: 284 LKASGSN-PSNRKSLSDIQNI---LKG-THRVAGRKNSHSPSPQTRKHFSSPNPPV---- 334 Query: 361 DPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYANFESHVENELGRPTTSTMDLQ 540 D FS D+ PG+ +QP EV V AN T+D+ Sbjct: 335 DQFSFPDIHNLLPGD--------QQPSEVDVQPLAKDIANTSPS--------NVGTVDVA 378 Query: 541 TVISTASDDLADHLLDKNLHREITDTDYQPTGTQTDFTECVT------GDALVENSFCHN 702 + + + + +D D ++H I + +P G + CV A++E + Sbjct: 379 SPFNNSVEKRSDED-DSHIHSGIHRSHLRPDG---NVDICVMDSISNRNSAMLEVNVDMR 434 Query: 703 TTMGGESIN---SVIAANRNT---ENVADRCE-----------AAVLSTRFNSHADDSTQ 831 TT G+ ++ S ANRNT EN D E A+ +TR + +DS Sbjct: 435 TT--GKEVDVPMSESGANRNTGQRENDIDINEETGHLEMLAEYASKEATRPFTVEEDS-- 490 Query: 832 IGGDTGGYVHQPQKVEGMPLETDVPASPQVMPQ-LQMVENLYGDQPPLDQPNSMVTEDNP 1008 I G + P + D P+ Q L EN+ D Q N++ N Sbjct: 491 IPYQQGTSSNSPNRAPEQYNTMDGPSEHAEHNQGLHEEENVNTDSASGLQENALQEVHNS 550 Query: 1009 V------ITPSIAAEIVTKKESK---------------------KLQTKE---HRKVKRP 1098 + +++ KK SK K QTK R+ K+P Sbjct: 551 SHKQTNKLRKRGSSDSNVKKRSKTVHGETGGDPQMKTLPHESGVKKQTKRKSNEREEKKP 610 Query: 1099 RGDXXXXXXXXXXXX----FAEGGTSFESGVRRSQRIKSRPLQYWKGERFLFGRVNEGI- 1263 + A GT E GVRRS RIKSRPL+YWKGERFL+GR++E + Sbjct: 611 KNTRKTLTREGKLFSRRKSLAAAGTKMEGGVRRSTRIKSRPLEYWKGERFLYGRIHESLT 670 Query: 1264 KLIGVKYISPGKGDGKL---KVKPYISDDYKEMLDLAARH 1374 +IG+KY SPG+G + KVK ++SD+YKE++D AA H Sbjct: 671 TVIGIKYASPGEGKSDVRACKVKSFVSDEYKELVDFAASH 710 >ref|XP_003619785.1| CENP-C [Medicago truncatula] gi|355494800|gb|AES76003.1| CENP-C [Medicago truncatula] Length = 641 Score = 128 bits (321), Expect = 5e-27 Identities = 133/481 (27%), Positives = 218/481 (45%), Gaps = 26/481 (5%) Frame = +1 Query: 10 EKRLNGILDELLSKNSEDLDGDGALSLLQERLKIKPIDPGKLSLPDFGRTDFHGFGRTNF 189 E ++N IL LL +SE+L+G+GA++LLQERL++K I KLS+PDF D + Sbjct: 195 ENKMNDILKGLLDCDSEELEGEGAMNLLQERLQVKSIVFEKLSVPDF--LDIQPIDLKSL 252 Query: 190 MALGEKLPMPRKTLSNISNLANRLSGETHAKKKVGES---------PISPTA--HXXXXX 336 K P K S++ N ++ +T ++ VG + P SP A Sbjct: 253 QGTLSK-PSKGKAFSDVDNWLKGMNIQTPLRRSVGYAEKQLASPTPPKSPFASLSSLQKH 311 Query: 337 XXXXXXXXDPFSPLDVDLSEPGNAYCVDNINRQPPEVTVLKELSIYANFESHVENELGRP 516 DPFS ++DL P +Y ++ Q ++ +LS +EL P Sbjct: 312 ISRSKLSTDPFSTHEIDL-VPTRSYSPIHMADQEVDIVGSSKLS----------DELTAP 360 Query: 517 TTSTM---DLQTVISTASDDLADHLLDKNLHREITDTDYQPTGTQTDFTECVTGD-ALVE 684 TT + + I S++ +H +N E+ + D +T ++V+ Sbjct: 361 TTEDVIAAGEKNTIPETSENSKEH-NSRNPSDEVNAPIIEDIVDNPDRNCTITPQKSMVD 419 Query: 685 NS----FCHNTTMGGESINSVIAANRN--TENVADRCEAAV-LSTRFNSHADDSTQIGGD 843 NS F N +++ + R+ + V D E + H DD+T Sbjct: 420 NSTEPGFNANVDSNEPAVDMDVDIGRSGMGKRVMDDTEGRQNVEPNEPFHFDDNTLEENM 479 Query: 844 TGGYVHQPQKVEGMPLETDVPASPQVMPQLQMVENL-YGDQPPLDQPNSMVTEDNPVITP 1020 G P + L T++P + Q P ++ G + D P + E Sbjct: 480 QGFTSSIP--TDDANLNTELPLADQSNPVTYQANSMDKGSRRSDDGPEQCLQEKTIGSAA 537 Query: 1021 SIAAEIVTKKESKKLQTKEHRKVKRPRGDXXXXXXXXXXXXFAEGGTSFESGVRRSQRIK 1200 + + + K +K +K R +++ A+ GTS+ESGVRRS R + Sbjct: 538 PVNGQTIVKSCMRK-GSKGKRLLRK---------------SLADAGTSWESGVRRSTRFR 581 Query: 1201 SRPLQYWKGERFLFGRVNEGIK-LIGVKYISPGKGDGK--LKVKPYISDDYKEMLDLAAR 1371 ++PL+YWKGER ++GRV+E + +IGVK +SPG DGK +KVK ++SD YKE+ ++A+ Sbjct: 582 TKPLEYWKGERMVYGRVHESLSTVIGVKCMSPG-SDGKPTMKVKSFVSDKYKELFEIASE 640 Query: 1372 H 1374 + Sbjct: 641 Y 641