BLASTX nr result

ID: Catharanthus22_contig00047366 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00047366
         (337 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006476928.1| PREDICTED: uncharacterized protein LOC102609...   141   1e-31
ref|XP_006439980.1| hypothetical protein CICLE_v10022963mg [Citr...   141   1e-31
ref|XP_004236296.1| PREDICTED: centromere protein X-like [Solanu...   137   1e-30
ref|XP_006351435.1| PREDICTED: centromere protein X-like [Solanu...   136   3e-30
ref|XP_002318691.1| hypothetical protein POPTR_0012s09240g [Popu...   135   4e-30
ref|XP_002276097.2| PREDICTED: uncharacterized protein LOC100253...   135   6e-30
emb|CBI14906.3| unnamed protein product [Vitis vinifera]              135   6e-30
gb|EXB94443.1| hypothetical protein L484_018943 [Morus notabilis]     133   3e-29
ref|XP_004299168.1| PREDICTED: centromere protein X-like [Fragar...   133   3e-29
gb|EOY22302.1| Centromere protein X isoform 1 [Theobroma cacao]       132   4e-29
ref|XP_006303039.1| hypothetical protein CARUB_v10021194mg [Caps...   128   9e-28
ref|NP_178000.2| uncharacterized protein [Arabidopsis thaliana] ...   128   9e-28
ref|XP_002889209.1| hypothetical protein ARALYDRAFT_316775 [Arab...   127   1e-27
gb|EOY22303.1| Centromere protein X isoform 2, partial [Theobrom...   124   2e-26
ref|XP_004515852.1| PREDICTED: centromere protein X-like [Cicer ...   123   3e-26
gb|ESW27495.1| hypothetical protein PHAVU_003G206900g [Phaseolus...   120   1e-25
gb|EOY22304.1| Centromere protein X isoform 3 [Theobroma cacao]       120   1e-25
ref|XP_003525610.1| PREDICTED: centromere protein X-like [Glycin...   119   4e-25
ref|XP_004138074.1| PREDICTED: centromere protein X-like [Cucumi...   118   7e-25
ref|XP_003550858.2| PREDICTED: centromere protein X-like [Glycin...   116   3e-24

>ref|XP_006476928.1| PREDICTED: uncharacterized protein LOC102609861 isoform X1 [Citrus
           sinensis] gi|568846168|ref|XP_006476929.1| PREDICTED:
           uncharacterized protein LOC102609861 isoform X2 [Citrus
           sinensis]
          Length = 109

 Score =  141 bits (355), Expect = 1e-31
 Identities = 75/106 (70%), Positives = 83/106 (78%), Gaps = 2/106 (1%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENT--EVGGATPKKSRPTSANAKAL 152
           E TFD DLI+ IFK +W RR+ ER+RN   D M+ E    + G  T KK+RPTSANA AL
Sbjct: 4   ETTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLHQAGAGTSKKNRPTSANANAL 63

Query: 151 KLSCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           KLSCELLR+FVTEAVQRAA IAEAEG S IEATHLERILPQLLLDF
Sbjct: 64  KLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF 109


>ref|XP_006439980.1| hypothetical protein CICLE_v10022963mg [Citrus clementina]
           gi|557542242|gb|ESR53220.1| hypothetical protein
           CICLE_v10022963mg [Citrus clementina]
          Length = 109

 Score =  141 bits (355), Expect = 1e-31
 Identities = 75/106 (70%), Positives = 83/106 (78%), Gaps = 2/106 (1%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENT--EVGGATPKKSRPTSANAKAL 152
           E TFD DLI+ IFK +W RR+ ER+RN   D M+ E    + G  T KK+RPTSANA AL
Sbjct: 4   ETTFDSDLIHAIFKHIWTRRSLERERNGGTDAMESEFLLDQAGAGTSKKNRPTSANANAL 63

Query: 151 KLSCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           KLSCELLR+FVTEAVQRAA IAEAEG S IEATHLERILPQLLLDF
Sbjct: 64  KLSCELLRVFVTEAVQRAAAIAEAEGVSKIEATHLERILPQLLLDF 109


>ref|XP_004236296.1| PREDICTED: centromere protein X-like [Solanum lycopersicum]
          Length = 104

 Score =  137 bits (346), Expect = 1e-30
 Identities = 71/104 (68%), Positives = 86/104 (82%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146
           ENTFD DL++EIFK VW R+A+ER +NE+++N+D E   VG ++ K+ RPT ANA ALKL
Sbjct: 4   ENTFDPDLVHEIFKLVWKRKAAERGKNELSENIDNE---VGASSSKRIRPTFANANALKL 60

Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           S ELLR+FV EA+QRAA IAEAEG+  IEATHLERILPQLLLDF
Sbjct: 61  SSELLRVFVAEAIQRAATIAEAEGSVKIEATHLERILPQLLLDF 104


>ref|XP_006351435.1| PREDICTED: centromere protein X-like [Solanum tuberosum]
          Length = 104

 Score =  136 bits (342), Expect = 3e-30
 Identities = 70/104 (67%), Positives = 86/104 (82%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146
           ENTFD DL++EIFK VW R+A+ER +NE+++N++ E   VG ++ K+ RPT ANA ALKL
Sbjct: 4   ENTFDPDLVHEIFKLVWKRKAAERGKNELSENIENE---VGASSSKRFRPTFANANALKL 60

Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           S ELLR+FV EA+QRAA IAEAEG+  IEATHLERILPQLLLDF
Sbjct: 61  SSELLRVFVAEAIQRAATIAEAEGSVKIEATHLERILPQLLLDF 104


>ref|XP_002318691.1| hypothetical protein POPTR_0012s09240g [Populus trichocarpa]
           gi|118488565|gb|ABK96095.1| unknown [Populus
           trichocarpa] gi|222859364|gb|EEE96911.1| hypothetical
           protein POPTR_0012s09240g [Populus trichocarpa]
          Length = 103

 Score =  135 bits (341), Expect = 4e-30
 Identities = 72/104 (69%), Positives = 81/104 (77%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146
           E TFD  LI  IFK +W RRA ER++NE     DG + EVG  T KK+R TSAN+ ALKL
Sbjct: 3   EVTFDPGLIQAIFKHIWTRRALEREKNE---GNDGTDCEVGTGTLKKTRTTSANSNALKL 59

Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           SCELLR+F+TEAVQR+AMIAEAEGA  IE THLERILPQLLLDF
Sbjct: 60  SCELLRIFITEAVQRSAMIAEAEGAGKIEGTHLERILPQLLLDF 103


>ref|XP_002276097.2| PREDICTED: uncharacterized protein LOC100253596 [Vitis vinifera]
          Length = 323

 Score =  135 bits (340), Expect = 6e-30
 Identities = 73/101 (72%), Positives = 82/101 (81%)
 Frame = -1

Query: 316 FDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSCE 137
           FD DLI+ IFK VW R A ER++NE AD ++    EVG AT KK+RPTSANA ALKLSCE
Sbjct: 226 FDPDLIHAIFKLVWSRTALEREKNEGADPLE---CEVGAATSKKNRPTSANANALKLSCE 282

Query: 136 LLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           LLR+FV EAV+RAA IAEAEG + IEATHLERILPQLLLDF
Sbjct: 283 LLRVFVIEAVERAATIAEAEGVNKIEATHLERILPQLLLDF 323


>emb|CBI14906.3| unnamed protein product [Vitis vinifera]
          Length = 179

 Score =  135 bits (340), Expect = 6e-30
 Identities = 73/101 (72%), Positives = 82/101 (81%)
 Frame = -1

Query: 316 FDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSCE 137
           FD DLI+ IFK VW R A ER++NE AD ++    EVG AT KK+RPTSANA ALKLSCE
Sbjct: 82  FDPDLIHAIFKLVWSRTALEREKNEGADPLE---CEVGAATSKKNRPTSANANALKLSCE 138

Query: 136 LLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           LLR+FV EAV+RAA IAEAEG + IEATHLERILPQLLLDF
Sbjct: 139 LLRVFVIEAVERAATIAEAEGVNKIEATHLERILPQLLLDF 179


>gb|EXB94443.1| hypothetical protein L484_018943 [Morus notabilis]
          Length = 232

 Score =  133 bits (334), Expect = 3e-29
 Identities = 71/98 (72%), Positives = 77/98 (78%)
 Frame = -1

Query: 307 DLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSCELLR 128
           DLI+ IFK VW RRA ER++NE AD +D   +E G    KKSRPTSAN  ALKLSCE LR
Sbjct: 138 DLIHSIFKLVWTRRALEREKNESADALD---SEAGAGASKKSRPTSANGNALKLSCEFLR 194

Query: 127 LFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           +FVTEAVQRAA IAEAE  S IEATHLERILPQLLLDF
Sbjct: 195 IFVTEAVQRAAAIAEAEDVSKIEATHLERILPQLLLDF 232


>ref|XP_004299168.1| PREDICTED: centromere protein X-like [Fragaria vesca subsp. vesca]
          Length = 106

 Score =  133 bits (334), Expect = 3e-29
 Identities = 73/102 (71%), Positives = 79/102 (77%)
 Frame = -1

Query: 319 TFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSC 140
           TF+ DLI+ IFK VW RRA ER   E  + +DGE  +VG  T KK+RP SANA ALKLSC
Sbjct: 6   TFETDLIHAIFKLVWSRRALERQLVEGTEALDGE-VQVGAGTSKKNRPMSANANALKLSC 64

Query: 139 ELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           ELLR FVTEAVQRAA IAEAEG   IEATHLERILPQLLLDF
Sbjct: 65  ELLRNFVTEAVQRAAAIAEAEGTDKIEATHLERILPQLLLDF 106


>gb|EOY22302.1| Centromere protein X isoform 1 [Theobroma cacao]
          Length = 108

 Score =  132 bits (333), Expect = 4e-29
 Identities = 69/102 (67%), Positives = 80/102 (78%)
 Frame = -1

Query: 319 TFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSC 140
           T D DLI  IFK +W R+A ER+RN I  N D  ++EVG  T KK+RPTS NA +LKLS 
Sbjct: 8   TLDPDLIGAIFKHIWARKAHERERNGI-QNTDALDSEVGAGTSKKNRPTSTNADSLKLSS 66

Query: 139 ELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           ELLR+F+TEAVQRAA IAEAEG + IEATH+ERILPQLLLDF
Sbjct: 67  ELLRIFITEAVQRAATIAEAEGGTEIEATHVERILPQLLLDF 108


>ref|XP_006303039.1| hypothetical protein CARUB_v10021194mg [Capsella rubella]
           gi|565490801|ref|XP_006303040.1| hypothetical protein
           CARUB_v10021194mg [Capsella rubella]
           gi|482571749|gb|EOA35937.1| hypothetical protein
           CARUB_v10021194mg [Capsella rubella]
           gi|482571750|gb|EOA35938.1| hypothetical protein
           CARUB_v10021194mg [Capsella rubella]
          Length = 104

 Score =  128 bits (321), Expect = 9e-28
 Identities = 69/103 (66%), Positives = 78/103 (75%)
 Frame = -1

Query: 322 NTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLS 143
           NTFD DLI+ IFK +W RR  ER+R+   D +D    EV   T KK+R  SANA ALKLS
Sbjct: 5   NTFDSDLIHAIFKHIWARRFRERERS---DAIDATEAEVALGTTKKNRLASANANALKLS 61

Query: 142 CELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           CELL+ FV+EAVQRAA+IAEAEG   IEATHLERILPQLLLDF
Sbjct: 62  CELLKSFVSEAVQRAAIIAEAEGMDKIEATHLERILPQLLLDF 104


>ref|NP_178000.2| uncharacterized protein [Arabidopsis thaliana]
           gi|22135948|gb|AAM91556.1| unknown protein [Arabidopsis
           thaliana] gi|24899661|gb|AAN65045.1| unknown protein
           [Arabidopsis thaliana] gi|332198032|gb|AEE36153.1|
           uncharacterized protein AT1G78790 [Arabidopsis thaliana]
          Length = 104

 Score =  128 bits (321), Expect = 9e-28
 Identities = 69/103 (66%), Positives = 78/103 (75%)
 Frame = -1

Query: 322 NTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLS 143
           NTFD DLI+ IFK +W RR  ER+R+   D +D    EV   T KK+R  SANA ALKLS
Sbjct: 5   NTFDSDLIHAIFKHIWARRFRERERS---DAIDATEAEVALGTTKKNRLASANANALKLS 61

Query: 142 CELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           CELL+ FV+EAVQRAA+IAEAEG   IEATHLERILPQLLLDF
Sbjct: 62  CELLKSFVSEAVQRAAIIAEAEGMEKIEATHLERILPQLLLDF 104


>ref|XP_002889209.1| hypothetical protein ARALYDRAFT_316775 [Arabidopsis lyrata subsp.
           lyrata] gi|297335050|gb|EFH65468.1| hypothetical protein
           ARALYDRAFT_316775 [Arabidopsis lyrata subsp. lyrata]
          Length = 104

 Score =  127 bits (320), Expect = 1e-27
 Identities = 68/103 (66%), Positives = 78/103 (75%)
 Frame = -1

Query: 322 NTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLS 143
           NTFD DLI+ IFK +W RR  ER+R+   D +D    E+   T KK+R  SANA ALKLS
Sbjct: 5   NTFDSDLIHAIFKHIWARRFRERERS---DAIDATEAEIALGTTKKNRLASANANALKLS 61

Query: 142 CELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           CELL+ FV+EAVQRAA+IAEAEG   IEATHLERILPQLLLDF
Sbjct: 62  CELLKSFVSEAVQRAAIIAEAEGMDKIEATHLERILPQLLLDF 104


>gb|EOY22303.1| Centromere protein X isoform 2, partial [Theobroma cacao]
          Length = 124

 Score =  124 bits (310), Expect = 2e-26
 Identities = 67/102 (65%), Positives = 78/102 (76%)
 Frame = -1

Query: 319 TFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKLSC 140
           T D DLI  IFK +W R+A ER+RN I  N D  ++EVG  T KK+RPTS    +LKLS 
Sbjct: 27  TLDPDLIGAIFKHIWARKAHERERNGI-QNTDALDSEVGAGTSKKNRPTS---NSLKLSS 82

Query: 139 ELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           ELLR+F+TEAVQRAA IAEAEG + IEATH+ERILPQLLLDF
Sbjct: 83  ELLRIFITEAVQRAATIAEAEGGTEIEATHVERILPQLLLDF 124


>ref|XP_004515852.1| PREDICTED: centromere protein X-like [Cicer arietinum]
          Length = 103

 Score =  123 bits (308), Expect = 3e-26
 Identities = 67/104 (64%), Positives = 78/104 (75%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146
           E TFD DLI+ I KR+W  R  ER+   +A N D   +EVG  + KK+R TSANA ALKL
Sbjct: 3   ETTFDCDLIHSIMKRIWTLRTLEREN--VATN-DALESEVGAGSSKKNRTTSANASALKL 59

Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           +CELLR+F+TEAVQRA  IAEAEG + IEATHLE ILPQLLLDF
Sbjct: 60  TCELLRVFITEAVQRAVAIAEAEGDTQIEATHLESILPQLLLDF 103


>gb|ESW27495.1| hypothetical protein PHAVU_003G206900g [Phaseolus vulgaris]
          Length = 103

 Score =  120 bits (302), Expect = 1e-25
 Identities = 67/104 (64%), Positives = 75/104 (72%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146
           E T D DLI+ I KR W  RA E +  E+ D  D   +EVG  T KK+R TSAN  ALKL
Sbjct: 3   EVTLDTDLIHSILKRFWTLRALEPENFEVKDAPD---SEVGVGTSKKNRTTSANGNALKL 59

Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           +CELLR+F+TEAVQRAA IAEAEGAS IE TH E ILPQLLLDF
Sbjct: 60  TCELLRIFITEAVQRAAAIAEAEGASQIEPTHFEIILPQLLLDF 103


>gb|EOY22304.1| Centromere protein X isoform 3 [Theobroma cacao]
          Length = 132

 Score =  120 bits (302), Expect = 1e-25
 Identities = 70/126 (55%), Positives = 81/126 (64%), Gaps = 24/126 (19%)
 Frame = -1

Query: 319 TFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTS---------- 170
           T D DLI  IFK +W R+A ER+RN I  N D  ++EVG  T KK+RPTS          
Sbjct: 8   TLDPDLIGAIFKHIWARKAHERERNGI-QNTDALDSEVGAGTSKKNRPTSSNFSSHFFPW 66

Query: 169 --------------ANAKALKLSCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILP 32
                         ANA +LKLS ELLR+F+TEAVQRAA IAEAEG + IEATH+ERILP
Sbjct: 67  MNFINFTLLLYLIAANADSLKLSSELLRIFITEAVQRAATIAEAEGGTEIEATHVERILP 126

Query: 31  QLLLDF 14
           QLLLDF
Sbjct: 127 QLLLDF 132


>ref|XP_003525610.1| PREDICTED: centromere protein X-like [Glycine max]
          Length = 103

 Score =  119 bits (298), Expect = 4e-25
 Identities = 68/104 (65%), Positives = 77/104 (74%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146
           E TF+ DLI+ I KR W  +A ER+  E  D  D   +EVG  T KK+R TSANA ALKL
Sbjct: 3   EVTFECDLIHSILKRFWTLQALERENVEANDPPD---SEVGVGTSKKNRSTSANANALKL 59

Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           + ELLR+F+TEAVQRAA IAEAEGAS IE THLE ILPQLLLDF
Sbjct: 60  TSELLRIFITEAVQRAATIAEAEGASQIEPTHLEIILPQLLLDF 103


>ref|XP_004138074.1| PREDICTED: centromere protein X-like [Cucumis sativus]
           gi|449501349|ref|XP_004161344.1| PREDICTED: centromere
           protein X-like [Cucumis sativus]
          Length = 106

 Score =  118 bits (296), Expect = 7e-25
 Identities = 66/106 (62%), Positives = 79/106 (74%), Gaps = 2/106 (1%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEV--GGATPKKSRPTSANAKAL 152
           E  F  DLI+ IFK  W RR+ ER++NE  +N D  + EV  G  T KKSRP SANA AL
Sbjct: 3   ETGFHPDLIHAIFKLEWSRRSLEREKNE--NNPDAMDCEVDAGAGTSKKSRPMSANANAL 60

Query: 151 KLSCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           KLS +L+++F++EAVQRAA IAEAEG S IE THLER+LPQLLLDF
Sbjct: 61  KLSSKLVQIFISEAVQRAATIAEAEGISRIEPTHLERVLPQLLLDF 106


>ref|XP_003550858.2| PREDICTED: centromere protein X-like [Glycine max]
          Length = 104

 Score =  116 bits (291), Expect = 3e-24
 Identities = 66/104 (63%), Positives = 75/104 (72%)
 Frame = -1

Query: 325 ENTFDRDLINEIFKRVWIRRASERDRNEIADNMDGENTEVGGATPKKSRPTSANAKALKL 146
           E TF+ DLI+ I KR W  RA ER+  E  D  D   +EVG  T KK+R TSANA ALKL
Sbjct: 4   EITFECDLIHSILKRFWTLRALERENVEANDAPD---SEVGVGTSKKNRSTSANANALKL 60

Query: 145 SCELLRLFVTEAVQRAAMIAEAEGASTIEATHLERILPQLLLDF 14
           + ELLR+F+TEAVQRAA  AE EGAS +E THLE ILPQLLLDF
Sbjct: 61  TSELLRIFITEAVQRAAATAEVEGASQLEPTHLEIILPQLLLDF 104


Top