BLASTX nr result
ID: Catharanthus22_contig00047366
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00047366 (337 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006476928.1| PREDICTED: uncharacterized protein LOC102609... 141 1e-31 ref|XP_006439980.1| hypothetical protein CICLE_v10022963mg [Citr... 141 1e-31 ref|XP_004236296.1| PREDICTED: centromere protein X-like [Solanu... 137 1e-30 ref|XP_006351435.1| PREDICTED: centromere protein X-like [Solanu... 136 3e-30 ref|XP_002318691.1| hypothetical protein POPTR_0012s09240g [Popu... 135 4e-30 ref|XP_002276097.2| PREDICTED: uncharacterized protein LOC100253... 135 6e-30 emb|CBI14906.3| unnamed protein product [Vitis vinifera] 135 6e-30 gb|EXB94443.1| hypothetical protein L484_018943 [Morus notabilis] 133 3e-29 ref|XP_004299168.1| PREDICTED: centromere protein X-like [Fragar... 133 3e-29 gb|EOY22302.1| Centromere protein X isoform 1 [Theobroma cacao] 132 4e-29 ref|XP_006303039.1| hypothetical protein CARUB_v10021194mg [Caps... 128 9e-28 ref|NP_178000.2| uncharacterized protein [Arabidopsis thaliana] ... 128 9e-28 ref|XP_002889209.1| hypothetical protein ARALYDRAFT_316775 [Arab... 127 1e-27 gb|EOY22303.1| Centromere protein X isoform 2, partial [Theobrom... 124 2e-26 ref|XP_004515852.1| PREDICTED: centromere protein X-like [Cicer ... 123 3e-26 gb|ESW27495.1| hypothetical protein PHAVU_003G206900g [Phaseolus... 120 1e-25 gb|EOY22304.1| Centromere protein X isoform 3 [Theobroma cacao] 120 1e-25 ref|XP_003525610.1| PREDICTED: centromere protein X-like [Glycin... 119 4e-25 ref|XP_004138074.1| PREDICTED: centromere protein X-like [Cucumi... 118 7e-25 ref|XP_003550858.2| PREDICTED: centromere protein X-like [Glycin... 116 3e-24 >ref|XP_006476928.1| PREDICTED: uncharacterized protein LOC102609861 isoform X1 [Citrus sinensis] gi|568846168|ref|XP_006476929.1| PREDICTED: uncharacterized protein LOC102609861 isoform X2 [Citrus sinensis] Length = 109 Score = 141 bits (355), Expect = 1e-31 Identities = 75/106 (70%), Positives = 83/106 (78%), Gaps = 2/106 (1%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENT--EVGGATPKKSRPTSANAKAL 152 E TFD DLI+ IFK +W RR+ ER+RN D M+ E + G T KK+RPTSANA AL Sbjct: 4 ETTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLHQAGAGTSKKNRPTSANANAL 63 Query: 151 KLSCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 KLSCELLR+FVTEAVQRAA IAEAEG S IEATHLERILPQLLLDF Sbjct: 64 KLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF 109 >ref|XP_006439980.1| hypothetical protein CICLE_v10022963mg [Citrus clementina] gi|557542242|gb|ESR53220.1| hypothetical protein CICLE_v10022963mg [Citrus clementina] Length = 109 Score = 141 bits (355), Expect = 1e-31 Identities = 75/106 (70%), Positives = 83/106 (78%), Gaps = 2/106 (1%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENT--EVGGATPKKSRPTSANAKAL 152 E TFD DLI+ IFK +W RR+ ER+RN D M+ E + G T KK+RPTSANA AL Sbjct: 4 ETTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLDQAGAGTSKKNRPTSANANAL 63 Query: 151 KLSCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 KLSCELLR+FVTEAVQRAA IAEAEG S IEATHLERILPQLLLDF Sbjct: 64 KLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF 109 >ref|XP_004236296.1| PREDICTED: centromere protein X-like [Solanum lycopersicum] Length = 104 Score = 137 bits (346), Expect = 1e-30 Identities = 71/104 (68%), Positives = 86/104 (82%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146 ENTFD DL++EIFK VW R+A+ER +NE+++N+D E VG ++ K+ RPT ANA ALKL Sbjct: 4 ENTFDPDLVHEIFKLVWKRKAAERGKNELSENIDNE---VGASSSKRIRPTFANANALKL 60 Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 S ELLR+FV EA+QRAA IAEAEG+ IEATHLERILPQLLLDF Sbjct: 61 SSELLRVFVAEAIQRAATIAEAEGSVKIEATHLERILPQLLLDF 104 >ref|XP_006351435.1| PREDICTED: centromere protein X-like [Solanum tuberosum] Length = 104 Score = 136 bits (342), Expect = 3e-30 Identities = 70/104 (67%), Positives = 86/104 (82%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146 ENTFD DL++EIFK VW R+A+ER +NE+++N++ E VG ++ K+ RPT ANA ALKL Sbjct: 4 ENTFDPDLVHEIFKLVWKRKAAERGKNELSENIENE---VGASSSKRFRPTFANANALKL 60 Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 S ELLR+FV EA+QRAA IAEAEG+ IEATHLERILPQLLLDF Sbjct: 61 SSELLRVFVAEAIQRAATIAEAEGSVKIEATHLERILPQLLLDF 104 >ref|XP_002318691.1| hypothetical protein POPTR_0012s09240g [Populus trichocarpa] gi|118488565|gb|ABK96095.1| unknown [Populus trichocarpa] gi|222859364|gb|EEE96911.1| hypothetical protein POPTR_0012s09240g [Populus trichocarpa] Length = 103 Score = 135 bits (341), Expect = 4e-30 Identities = 72/104 (69%), Positives = 81/104 (77%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146 E TFD LI IFK +W RRA ER++NE DG + EVG T KK+R TSAN+ ALKL Sbjct: 3 EVTFDPGLIQAIFKHIWTRRALEREKNE---GNDGTDCEVGTGTLKKTRTTSANSNALKL 59 Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 SCELLR+F+TEAVQR+AMIAEAEGA IE THLERILPQLLLDF Sbjct: 60 SCELLRIFITEAVQRSAMIAEAEGAGKIEGTHLERILPQLLLDF 103 >ref|XP_002276097.2| PREDICTED: uncharacterized protein LOC100253596 [Vitis vinifera] Length = 323 Score = 135 bits (340), Expect = 6e-30 Identities = 73/101 (72%), Positives = 82/101 (81%) Frame = -1 Query: 316 FDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSCE 137 FD DLI+ IFK VW R A ER++NE AD ++ EVG AT KK+RPTSANA ALKLSCE Sbjct: 226 FDPDLIHAIFKLVWSRTALEREKNEGADPLE---CEVGAATSKKNRPTSANANALKLSCE 282 Query: 136 LLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 LLR+FV EAV+RAA IAEAEG + IEATHLERILPQLLLDF Sbjct: 283 LLRVFVIEAVERAATIAEAEGVNKIEATHLERILPQLLLDF 323 >emb|CBI14906.3| unnamed protein product [Vitis vinifera] Length = 179 Score = 135 bits (340), Expect = 6e-30 Identities = 73/101 (72%), Positives = 82/101 (81%) Frame = -1 Query: 316 FDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSCE 137 FD DLI+ IFK VW R A ER++NE AD ++ EVG AT KK+RPTSANA ALKLSCE Sbjct: 82 FDPDLIHAIFKLVWSRTALEREKNEGADPLE---CEVGAATSKKNRPTSANANALKLSCE 138 Query: 136 LLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 LLR+FV EAV+RAA IAEAEG + IEATHLERILPQLLLDF Sbjct: 139 LLRVFVIEAVERAATIAEAEGVNKIEATHLERILPQLLLDF 179 >gb|EXB94443.1| hypothetical protein L484_018943 [Morus notabilis] Length = 232 Score = 133 bits (334), Expect = 3e-29 Identities = 71/98 (72%), Positives = 77/98 (78%) Frame = -1 Query: 307 DLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSCELLR 128 DLI+ IFK VW RRA ER++NE AD +D +E G KKSRPTSAN ALKLSCE LR Sbjct: 138 DLIHSIFKLVWTRRALEREKNESADALD---SEAGAGASKKSRPTSANGNALKLSCEFLR 194 Query: 127 LFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 +FVTEAVQRAA IAEAE S IEATHLERILPQLLLDF Sbjct: 195 IFVTEAVQRAAAIAEAEDVSKIEATHLERILPQLLLDF 232 >ref|XP_004299168.1| PREDICTED: centromere protein X-like [Fragaria vesca subsp. vesca] Length = 106 Score = 133 bits (334), Expect = 3e-29 Identities = 73/102 (71%), Positives = 79/102 (77%) Frame = -1 Query: 319 TFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSC 140 TF+ DLI+ IFK VW RRA ER E + +DGE +VG T KK+RP SANA ALKLSC Sbjct: 6 TFETDLIHAIFKLVWSRRALERQLVEGTEALDGE-VQVGAGTSKKNRPMSANANALKLSC 64 Query: 139 ELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 ELLR FVTEAVQRAA IAEAEG IEATHLERILPQLLLDF Sbjct: 65 ELLRNFVTEAVQRAAAIAEAEGTDKIEATHLERILPQLLLDF 106 >gb|EOY22302.1| Centromere protein X isoform 1 [Theobroma cacao] Length = 108 Score = 132 bits (333), Expect = 4e-29 Identities = 69/102 (67%), Positives = 80/102 (78%) Frame = -1 Query: 319 TFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSC 140 T D DLI IFK +W R+A ER+RN I N D ++EVG T KK+RPTS NA +LKLS Sbjct: 8 TLDPDLIGAIFKHIWARKAHERERNGI-QNTDALDSEVGAGTSKKNRPTSTNADSLKLSS 66 Query: 139 ELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 ELLR+F+TEAVQRAA IAEAEG + IEATH+ERILPQLLLDF Sbjct: 67 ELLRIFITEAVQRAATIAEAEGGTEIEATHVERILPQLLLDF 108 >ref|XP_006303039.1| hypothetical protein CARUB_v10021194mg [Capsella rubella] gi|565490801|ref|XP_006303040.1| hypothetical protein CARUB_v10021194mg [Capsella rubella] gi|482571749|gb|EOA35937.1| hypothetical protein CARUB_v10021194mg [Capsella rubella] gi|482571750|gb|EOA35938.1| hypothetical protein CARUB_v10021194mg [Capsella rubella] Length = 104 Score = 128 bits (321), Expect = 9e-28 Identities = 69/103 (66%), Positives = 78/103 (75%) Frame = -1 Query: 322 NTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLS 143 NTFD DLI+ IFK +W RR ER+R+ D +D EV T KK+R SANA ALKLS Sbjct: 5 NTFDSDLIHAIFKHIWARRFRERERS---DAIDATEAEVALGTTKKNRLASANANALKLS 61 Query: 142 CELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 CELL+ FV+EAVQRAA+IAEAEG IEATHLERILPQLLLDF Sbjct: 62 CELLKSFVSEAVQRAAIIAEAEGMDKIEATHLERILPQLLLDF 104 >ref|NP_178000.2| uncharacterized protein [Arabidopsis thaliana] gi|22135948|gb|AAM91556.1| unknown protein [Arabidopsis thaliana] gi|24899661|gb|AAN65045.1| unknown protein [Arabidopsis thaliana] gi|332198032|gb|AEE36153.1| uncharacterized protein AT1G78790 [Arabidopsis thaliana] Length = 104 Score = 128 bits (321), Expect = 9e-28 Identities = 69/103 (66%), Positives = 78/103 (75%) Frame = -1 Query: 322 NTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLS 143 NTFD DLI+ IFK +W RR ER+R+ D +D EV T KK+R SANA ALKLS Sbjct: 5 NTFDSDLIHAIFKHIWARRFRERERS---DAIDATEAEVALGTTKKNRLASANANALKLS 61 Query: 142 CELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 CELL+ FV+EAVQRAA+IAEAEG IEATHLERILPQLLLDF Sbjct: 62 CELLKSFVSEAVQRAAIIAEAEGMEKIEATHLERILPQLLLDF 104 >ref|XP_002889209.1| hypothetical protein ARALYDRAFT_316775 [Arabidopsis lyrata subsp. lyrata] gi|297335050|gb|EFH65468.1| hypothetical protein ARALYDRAFT_316775 [Arabidopsis lyrata subsp. lyrata] Length = 104 Score = 127 bits (320), Expect = 1e-27 Identities = 68/103 (66%), Positives = 78/103 (75%) Frame = -1 Query: 322 NTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLS 143 NTFD DLI+ IFK +W RR ER+R+ D +D E+ T KK+R SANA ALKLS Sbjct: 5 NTFDSDLIHAIFKHIWARRFRERERS---DAIDATEAEIALGTTKKNRLASANANALKLS 61 Query: 142 CELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 CELL+ FV+EAVQRAA+IAEAEG IEATHLERILPQLLLDF Sbjct: 62 CELLKSFVSEAVQRAAIIAEAEGMDKIEATHLERILPQLLLDF 104 >gb|EOY22303.1| Centromere protein X isoform 2, partial [Theobroma cacao] Length = 124 Score = 124 bits (310), Expect = 2e-26 Identities = 67/102 (65%), Positives = 78/102 (76%) Frame = -1 Query: 319 TFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSC 140 T D DLI IFK +W R+A ER+RN I N D ++EVG T KK+RPTS +LKLS Sbjct: 27 TLDPDLIGAIFKHIWARKAHERERNGI-QNTDALDSEVGAGTSKKNRPTS---NSLKLSS 82 Query: 139 ELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 ELLR+F+TEAVQRAA IAEAEG + IEATH+ERILPQLLLDF Sbjct: 83 ELLRIFITEAVQRAATIAEAEGGTEIEATHVERILPQLLLDF 124 >ref|XP_004515852.1| PREDICTED: centromere protein X-like [Cicer arietinum] Length = 103 Score = 123 bits (308), Expect = 3e-26 Identities = 67/104 (64%), Positives = 78/104 (75%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146 E TFD DLI+ I KR+W R ER+ +A N D +EVG + KK+R TSANA ALKL Sbjct: 3 ETTFDCDLIHSIMKRIWTLRTLEREN--VATN-DALESEVGAGSSKKNRTTSANASALKL 59 Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 +CELLR+F+TEAVQRA IAEAEG + IEATHLE ILPQLLLDF Sbjct: 60 TCELLRVFITEAVQRAVAIAEAEGDTQIEATHLESILPQLLLDF 103 >gb|ESW27495.1| hypothetical protein PHAVU_003G206900g [Phaseolus vulgaris] Length = 103 Score = 120 bits (302), Expect = 1e-25 Identities = 67/104 (64%), Positives = 75/104 (72%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146 E T D DLI+ I KR W RA E + E+ D D +EVG T KK+R TSAN ALKL Sbjct: 3 EVTLDTDLIHSILKRFWTLRALEPENFEVKDAPD---SEVGVGTSKKNRTTSANGNALKL 59 Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 +CELLR+F+TEAVQRAA IAEAEGAS IE TH E ILPQLLLDF Sbjct: 60 TCELLRIFITEAVQRAAAIAEAEGASQIEPTHFEIILPQLLLDF 103 >gb|EOY22304.1| Centromere protein X isoform 3 [Theobroma cacao] Length = 132 Score = 120 bits (302), Expect = 1e-25 Identities = 70/126 (55%), Positives = 81/126 (64%), Gaps = 24/126 (19%) Frame = -1 Query: 319 TFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTS---------- 170 T D DLI IFK +W R+A ER+RN I N D ++EVG T KK+RPTS Sbjct: 8 TLDPDLIGAIFKHIWARKAHERERNGI-QNTDALDSEVGAGTSKKNRPTSSNFSSHFFPW 66 Query: 169 --------------ANAKALKLSCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILP 32 ANA +LKLS ELLR+F+TEAVQRAA IAEAEG + IEATH+ERILP Sbjct: 67 MNFINFTLLLYLIAANADSLKLSSELLRIFITEAVQRAATIAEAEGGTEIEATHVERILP 126 Query: 31 QLLLDF 14 QLLLDF Sbjct: 127 QLLLDF 132 >ref|XP_003525610.1| PREDICTED: centromere protein X-like [Glycine max] Length = 103 Score = 119 bits (298), Expect = 4e-25 Identities = 68/104 (65%), Positives = 77/104 (74%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146 E TF+ DLI+ I KR W +A ER+ E D D +EVG T KK+R TSANA ALKL Sbjct: 3 EVTFECDLIHSILKRFWTLQALERENVEANDPPD---SEVGVGTSKKNRSTSANANALKL 59 Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 + ELLR+F+TEAVQRAA IAEAEGAS IE THLE ILPQLLLDF Sbjct: 60 TSELLRIFITEAVQRAATIAEAEGASQIEPTHLEIILPQLLLDF 103 >ref|XP_004138074.1| PREDICTED: centromere protein X-like [Cucumis sativus] gi|449501349|ref|XP_004161344.1| PREDICTED: centromere protein X-like [Cucumis sativus] Length = 106 Score = 118 bits (296), Expect = 7e-25 Identities = 66/106 (62%), Positives = 79/106 (74%), Gaps = 2/106 (1%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEV--GGATPKKSRPTSANAKAL 152 E F DLI+ IFK W RR+ ER++NE +N D + EV G T KKSRP SANA AL Sbjct: 3 ETGFHPDLIHAIFKLEWSRRSLEREKNE--NNPDAMDCEVDAGAGTSKKSRPMSANANAL 60 Query: 151 KLSCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 KLS +L+++F++EAVQRAA IAEAEG S IE THLER+LPQLLLDF Sbjct: 61 KLSSKLVQIFISEAVQRAATIAEAEGISRIEPTHLERVLPQLLLDF 106 >ref|XP_003550858.2| PREDICTED: centromere protein X-like [Glycine max] Length = 104 Score = 116 bits (291), Expect = 3e-24 Identities = 66/104 (63%), Positives = 75/104 (72%) Frame = -1 Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146 E TF+ DLI+ I KR W RA ER+ E D D +EVG T KK+R TSANA ALKL Sbjct: 4 EITFECDLIHSILKRFWTLRALERENVEANDAPD---SEVGVGTSKKNRSTSANANALKL 60 Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14 + ELLR+F+TEAVQRAA AE EGAS +E THLE ILPQLLLDF Sbjct: 61 TSELLRIFITEAVQRAAATAEVEGASQLEPTHLEIILPQLLLDF 104