BLASTX nr result
ID: Catharanthus22_contig00030443
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00030443 (481 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ22310.1| hypothetical protein PRUPE_ppa017892mg [Prunus pe... 86 5e-15 gb|EXC04711.1| Transcription factor [Morus notabilis] 83 4e-14 ref|XP_004148201.1| PREDICTED: uncharacterized protein LOC101207... 81 1e-13 ref|XP_002528315.1| r2r3-myb transcription factor, putative [Ric... 79 6e-13 ref|XP_006435772.1| hypothetical protein CICLE_v10032317mg [Citr... 76 5e-12 ref|XP_002272706.1| PREDICTED: uncharacterized protein LOC100260... 76 5e-12 ref|XP_002311241.1| hypothetical protein POPTR_0008s07100g [Popu... 74 2e-11 ref|XP_003518439.1| PREDICTED: transcription factor MYB44 [Glyci... 63 5e-08 gb|ABH02886.1| MYB transcription factor MYB166 [Glycine max] 63 5e-08 gb|EOY17728.1| Myb domain protein 73 [Theobroma cacao] 61 2e-07 ref|XP_006595859.1| PREDICTED: transcription factor MYB44-like [... 57 3e-06 gb|ESW13745.1| hypothetical protein PHAVU_008G222600g [Phaseolus... 56 4e-06 >gb|EMJ22310.1| hypothetical protein PRUPE_ppa017892mg [Prunus persica] Length = 239 Score = 85.9 bits (211), Expect = 5e-15 Identities = 63/158 (39%), Positives = 82/158 (51%), Gaps = 13/158 (8%) Frame = -3 Query: 479 NAIKNHWNSTLKRKRQIDSEFPVSSAA-----------AKRQCLXXXXXXXXXXXXXXXX 333 NAIKNHWNSTL+R+RQ+ SS + KRQCL Sbjct: 99 NAIKNHWNSTLRRRRQLAELSSASSDSNSVAQIRYEPGMKRQCLRASPEP---------- 148 Query: 332 GDDFEEM--GLGLIDGPETCLSLLPPGGGTLVESLLMSDDQRKGGSEKEEQRSTAEIEEN 159 D F+ G G++ ET L+LLPPG E +L S+D +E + E+EE Sbjct: 149 -DSFKADVGGDGVV---ETSLTLLPPGEKEENEVILKSEDH------DDEGKCAVEMEET 198 Query: 158 CLVTIMQRMIAHEVRSYIDKIRSQGGLQIGPGFQSEGL 45 CL+TIMQRMIA EVR+Y+D +R++ G G QS GL Sbjct: 199 CLLTIMQRMIAQEVRNYVDGLRAEAG-PTSFGLQSAGL 235 >gb|EXC04711.1| Transcription factor [Morus notabilis] Length = 247 Score = 82.8 bits (203), Expect = 4e-14 Identities = 58/153 (37%), Positives = 80/153 (52%), Gaps = 5/153 (3%) Frame = -3 Query: 479 NAIKNHWNSTLKRKRQIDSEFPVS-SAAAKRQCLXXXXXXXXXXXXXXXXGDDFEEMGLG 303 N IKNHWNSTL+R+R +E P S S+ + + E Sbjct: 100 NGIKNHWNSTLRRRRSGATENPSSPSSPSSSSDVAAAEEENSEPVLKRQRIGASPERNR- 158 Query: 302 LIDGPETCLSLLPPGGGTL----VESLLMSDDQRKGGSEKEEQRSTAEIEENCLVTIMQR 135 +DG ET L+L PG T+ E + +SD G ++E +R E+EE CL+ +MQR Sbjct: 159 -LDGIETTLTLSLPGDKTVPVPAAEEMPVSD---VGLKKEEGERGAVEMEEKCLIALMQR 214 Query: 134 MIAHEVRSYIDKIRSQGGLQIGPGFQSEGLNGP 36 M+A EVR+YID +R++G L G G QS NGP Sbjct: 215 MVAQEVRNYIDGLRAKGAL--GFGLQSAAQNGP 245 >ref|XP_004148201.1| PREDICTED: uncharacterized protein LOC101207929 [Cucumis sativus] gi|449507813|ref|XP_004163135.1| PREDICTED: uncharacterized LOC101207929 [Cucumis sativus] Length = 284 Score = 81.3 bits (199), Expect = 1e-13 Identities = 60/162 (37%), Positives = 79/162 (48%), Gaps = 23/162 (14%) Frame = -3 Query: 479 NAIKNHWNSTLKRKRQIDSEFPVSSAAAKRQCLXXXXXXXXXXXXXXXXGDDFEEMGL-- 306 NAIKNHWNSTL+R+R D S+A KR DD E L Sbjct: 115 NAIKNHWNSTLRRRRDADLSSD-STAFLKR----PSYEVSRSASDDDDNDDDDSEASLKR 169 Query: 305 -----GLIDG--PETCLSLLPPGGGTLV--------ESLLMSDDQRKG------GSEKEE 189 + G PET L L PG + E + + ++ G +E+E+ Sbjct: 170 TCFDRNSVGGGEPETSLRLSLPGEVVVAAEMDVKVKEEVTVESEKNDGRRRVVAAAEEEK 229 Query: 188 QRSTAEIEENCLVTIMQRMIAHEVRSYIDKIRSQGGLQIGPG 63 E++E+CL TIMQRMIA EVR+YID +R++GGL IGPG Sbjct: 230 GNRKKEVDESCLATIMQRMIAQEVRNYIDSLRARGGLGIGPG 271 >ref|XP_002528315.1| r2r3-myb transcription factor, putative [Ricinus communis] gi|223532270|gb|EEF34073.1| r2r3-myb transcription factor, putative [Ricinus communis] Length = 264 Score = 79.0 bits (193), Expect = 6e-13 Identities = 59/165 (35%), Positives = 78/165 (47%), Gaps = 32/165 (19%) Frame = -3 Query: 479 NAIKNHWNSTLKRKR------------------------QIDSEFPV-SSAAAKRQCLXX 375 NAIKNHWNSTL+RKR + DSE S+AAAKRQCL Sbjct: 103 NAIKNHWNSTLRRKRIAEFSSASSESNSAIKRLSLDGDSESDSESGSDSAAAAKRQCLGG 162 Query: 374 XXXXXXXXXXXXXXGDDFEEMGLGLIDGPETCLSLLPPGGGTLVESLLMSDDQRKGGSEK 195 D G + PET L+L PPG G + + + + + E Sbjct: 163 SGCTE----------DSSFNGGDAKVVEPETLLTLSPPGDGFVTAAAAIGEKTEEEEEEV 212 Query: 194 E-------EQRSTAEIEENCLVTIMQRMIAHEVRSYIDKIRSQGG 81 E R E EE+CL+TIM++MIA EVR+Y+D++R+Q G Sbjct: 213 EVVKGGGDNGRLVREEEESCLLTIMRKMIAVEVRNYVDRLRAQDG 257 >ref|XP_006435772.1| hypothetical protein CICLE_v10032317mg [Citrus clementina] gi|568865881|ref|XP_006486296.1| PREDICTED: myb protein-like [Citrus sinensis] gi|557537968|gb|ESR49012.1| hypothetical protein CICLE_v10032317mg [Citrus clementina] Length = 285 Score = 75.9 bits (185), Expect = 5e-12 Identities = 65/178 (36%), Positives = 81/178 (45%), Gaps = 43/178 (24%) Frame = -3 Query: 479 NAIKNHWNSTLKRKRQID------------------------SEFPVSSAAAKRQCLXXX 372 NAIKNHWNSTL+RKR + +E S KRQCL Sbjct: 114 NAIKNHWNSTLRRKRMAELSSASSESNSVMMKRPGFENNISSAESDSDSGGIKRQCLRVS 173 Query: 371 XXXXXXXXXXXXXGDDFEEMGLGLIDGPETCLSLLPPG--------GGTLVES---LLMS 225 D + G++ GPET L+L PPG GG VE L ++ Sbjct: 174 QEH------------DSYNVEAGIV-GPETLLTLSPPGESVVLGGSGGEKVEDEEKLKIN 220 Query: 224 ---DDQRKGGS---EKEEQRSTA--EIEENCLVTIMQRMIAHEVRSYIDKIRSQGGLQ 75 D GG EK E R T E+EE+CL TIM+RMIA EVR+ D +R+Q GL+ Sbjct: 221 KECDGDGDGGKIEKEKGEDRCTVDMEMEESCLFTIMRRMIAEEVRNQFDGLRAQAGLR 278 >ref|XP_002272706.1| PREDICTED: uncharacterized protein LOC100260493 [Vitis vinifera] Length = 253 Score = 75.9 bits (185), Expect = 5e-12 Identities = 54/156 (34%), Positives = 76/156 (48%), Gaps = 8/156 (5%) Frame = -3 Query: 479 NAIKNHWNSTLKRKR--QIDSEFPVSSAAAKRQCLXXXXXXXXXXXXXXXXGDDFEEMGL 306 NAIKNHWNSTL+R+R ++ S S++A KR E Sbjct: 98 NAIKNHWNSTLRRRRVGELSSASSESNSAMKRPSFDATVSSESDSGLKRQLVGASPEHNS 157 Query: 305 --GLIDGPETCLSLLPPGGG--TLVESLLMSDDQRKGG--SEKEEQRSTAEIEENCLVTI 144 + PET L+L PPG TL + D + +G + KE +R E+ + CLV I Sbjct: 158 CDRVKAEPETSLTLSPPGDSVVTLPVGDRVEDARDEGARITRKEGERCAVEMADTCLVKI 217 Query: 143 MQRMIAHEVRSYIDKIRSQGGLQIGPGFQSEGLNGP 36 +QRMIA EVR+Y +R++ GL++ P N P Sbjct: 218 IQRMIAEEVRNYFIALRAEDGLRVRPELDPAAQNNP 253 >ref|XP_002311241.1| hypothetical protein POPTR_0008s07100g [Populus trichocarpa] gi|222851061|gb|EEE88608.1| hypothetical protein POPTR_0008s07100g [Populus trichocarpa] Length = 259 Score = 73.9 bits (180), Expect = 2e-11 Identities = 52/158 (32%), Positives = 80/158 (50%), Gaps = 21/158 (13%) Frame = -3 Query: 479 NAIKNHWNSTLKRKR-QIDSEFPVSSAAAKRQCLXXXXXXXXXXXXXXXXGDDFEEMGLG 303 NAIKNHWNSTL+RKR + S S++ KR L G Sbjct: 99 NAIKNHWNSTLRRKRGSVSSASSESNSVFKRSTLEVSVVSESESDSGSKRQCLHASPGHN 158 Query: 302 LI------DGPETCLSLLPPGGGTLVESLLMSDDQRKG----GSEK----------EEQR 183 + DGPET L+L PPG G + S+ +++ ++G G EK E+ R Sbjct: 159 SVNGDVGVDGPETSLTLSPPGDGFV--SMAVAEKLKEGVAVNGREKDLGESIMKDAEKIR 216 Query: 182 STAEIEENCLVTIMQRMIAHEVRSYIDKIRSQGGLQIG 69 T E++E+C+ ++Q++I EVR Y D+++++ G+ IG Sbjct: 217 CTEEMDEDCVRALIQKIIQEEVRIYFDRLKTRNGVTIG 254 >ref|XP_003518439.1| PREDICTED: transcription factor MYB44 [Glycine max] Length = 247 Score = 62.8 bits (151), Expect = 5e-08 Identities = 49/148 (33%), Positives = 73/148 (49%), Gaps = 9/148 (6%) Frame = -3 Query: 479 NAIKNHWNSTLKRKRQIDSEFPVSSAAAKRQCLXXXXXXXXXXXXXXXXGDDFEEMGLGL 300 NAIKNHWNSTL+R+R ++S SSA+ + ++ EE L Sbjct: 96 NAIKNHWNSTLRRRRAVESS-SSSSASPPAKRHSSLFDTLHPFKKQCIEKENEEERVLSP 154 Query: 299 IDGPETCLSLLPPGGGTLVESLLMSDDQRKGGSEKEE------QRSTAEIEENCLVTIMQ 138 + T LSL PPG + E ++Q + EKEE + ++N + +MQ Sbjct: 155 V---TTSLSLFPPGEKSEEEE--EEEEQEEEEKEKEELFQVNVNVNQKVTDQNYFMQMMQ 209 Query: 137 RMIAHEVRSYIDKIRS---QGGLQIGPG 63 RMIA EVR+Y++ +R+ GL + PG Sbjct: 210 RMIAEEVRNYMETLRNCQRNNGLSLEPG 237 >gb|ABH02886.1| MYB transcription factor MYB166 [Glycine max] Length = 179 Score = 62.8 bits (151), Expect = 5e-08 Identities = 49/148 (33%), Positives = 73/148 (49%), Gaps = 9/148 (6%) Frame = -3 Query: 479 NAIKNHWNSTLKRKRQIDSEFPVSSAAAKRQCLXXXXXXXXXXXXXXXXGDDFEEMGLGL 300 NAIKNHWNSTL+R+R ++S SSA+ + ++ EE L Sbjct: 28 NAIKNHWNSTLRRRRAVESS-SSSSASPPAKRHSSLFDTLHPFKKQCIEKENEEERVLSP 86 Query: 299 IDGPETCLSLLPPGGGTLVESLLMSDDQRKGGSEKEE------QRSTAEIEENCLVTIMQ 138 + T LSL PPG + E ++Q + EKEE + ++N + +MQ Sbjct: 87 V---TTSLSLFPPGEKSEEEE--EEEEQEEEEKEKEELFQVNVNVNQKVTDQNYFMQMMQ 141 Query: 137 RMIAHEVRSYIDKIRS---QGGLQIGPG 63 RMIA EVR+Y++ +R+ GL + PG Sbjct: 142 RMIAEEVRNYMETLRNCQRNNGLSLEPG 169 >gb|EOY17728.1| Myb domain protein 73 [Theobroma cacao] Length = 236 Score = 60.8 bits (146), Expect = 2e-07 Identities = 45/138 (32%), Positives = 64/138 (46%), Gaps = 18/138 (13%) Frame = -3 Query: 479 NAIKNHWNSTLKRKRQIDSEFPVS---SAAAKRQCLXXXXXXXXXXXXXXXXGDDFEEMG 309 NA+KNHWNSTL+RKR + S ++A KR + E + Sbjct: 96 NAVKNHWNSTLRRKRAAELSSGSSESNNSAVKRWSSQDASESDSGNKRQCLRVEVHENVE 155 Query: 308 LGLIDGPETCLSLLPPGGGTLV------------ESLLMSDDQ---RKGGSEKEEQRSTA 174 GP+T L+L PPG + E ++ D++ R GG EE++ Sbjct: 156 FV---GPKTLLTLSPPGESVVSGHMEEKVEDEEEEEVVKRDEEGGGRGGGGGGEEEKRRV 212 Query: 173 EIEENCLVTIMQRMIAHE 120 E++E CL+TIMQRMI E Sbjct: 213 EMKETCLLTIMQRMIKEE 230 >ref|XP_006595859.1| PREDICTED: transcription factor MYB44-like [Glycine max] Length = 228 Score = 57.0 bits (136), Expect = 3e-06 Identities = 41/131 (31%), Positives = 61/131 (46%), Gaps = 1/131 (0%) Frame = -3 Query: 479 NAIKNHWNSTLKRKRQID-SEFPVSSAAAKRQCLXXXXXXXXXXXXXXXXGDDFEEMGLG 303 NAIKNHWNSTL+R+R + S +S AKR ++ + Sbjct: 99 NAIKNHWNSTLRRRRTAEQSSSSSASPPAKRPSSLFDTLHPLKKQCIEKENEEESPV--- 155 Query: 302 LIDGPETCLSLLPPGGGTLVESLLMSDDQRKGGSEKEEQRSTAEIEENCLVTIMQRMIAH 123 T LSL PPG ++ + EKEE + ++N + +MQRMIA Sbjct: 156 -----TTSLSLFPPGE---------KSEEEEEEEEKEEFANQKVSDQNYFMQMMQRMIAE 201 Query: 122 EVRSYIDKIRS 90 EVR+Y++ +R+ Sbjct: 202 EVRNYMETLRN 212 >gb|ESW13745.1| hypothetical protein PHAVU_008G222600g [Phaseolus vulgaris] Length = 239 Score = 56.2 bits (134), Expect = 4e-06 Identities = 45/140 (32%), Positives = 66/140 (47%), Gaps = 9/140 (6%) Frame = -3 Query: 479 NAIKNHWNSTLKRKRQIDSEFPVSSAAAKRQCLXXXXXXXXXXXXXXXXGDDFEEMGLGL 300 NAIKNHWNSTL+R+ + S+A+ + ++ E M + L Sbjct: 97 NAIKNHWNSTLRRRGTTEHSSSSSAASPSTKRPSPLELCYPLKKQRVEKENEEECMAIPL 156 Query: 299 IDGPETCLSLLPPGGGTLVESLLMSDDQRKGGSEKEEQRSTAEIEENCLVT--------- 147 + LSL P G + E +M ++ EKEE+R E E N + T Sbjct: 157 MKP----LSLFPSGEKS--EGEMMEEE------EKEEEREVEEFEVNNINTASDQDYFLQ 204 Query: 146 IMQRMIAHEVRSYIDKIRSQ 87 +MQ+MIA EVR+Y+D +R Q Sbjct: 205 MMQQMIADEVRNYVDSLRHQ 224