BLASTX nr result
ID: Cephaelis21_contig00009741
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00009741 (1456 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278063.1| PREDICTED: transcription factor MYB39 [Vitis... 395 e-107 ref|XP_002321627.1| predicted protein [Populus trichocarpa] gi|2... 384 e-104 ref|XP_002318064.1| predicted protein [Populus trichocarpa] gi|2... 382 e-103 ref|XP_002511336.1| r2r3-myb transcription factor, putative [Ric... 374 e-101 ref|XP_003525561.1| PREDICTED: uncharacterized protein LOC100811... 343 8e-92 >ref|XP_002278063.1| PREDICTED: transcription factor MYB39 [Vitis vinifera] Length = 367 Score = 395 bits (1015), Expect = e-107 Identities = 220/389 (56%), Positives = 264/389 (67%), Gaps = 6/389 (1%) Frame = -1 Query: 1321 MGRHSCCVKQKLRKGLWSPEEDEKLLNYITRYGVGCWSSVPKLAGLQRCGKSCRLRWINY 1142 MGRHSCC+KQKLRKGLWSPEEDEKL NYITR+GVGCWSSVPKLAGLQRCGKSCRLRWINY Sbjct: 1 MGRHSCCLKQKLRKGLWSPEEDEKLYNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINY 60 Query: 1141 LRPDLKRGMFSQQEEDLILRLHEVLGNRWAQIAAQLPGRTDNEIKNFWNSSLKKRLLKQG 962 LRPDLKRGMFSQQEED+I+ LH+VLGNRWAQIAAQLPGRTDNEIKNFWNS LKK+LLKQG Sbjct: 61 LRPDLKRGMFSQQEEDIIISLHQVLGNRWAQIAAQLPGRTDNEIKNFWNSCLKKKLLKQG 120 Query: 961 IDPNTHQPLSEVQVRDEENRTTDHKGSFQIPFSITDQLPHLXXXXXXXSGMDREFQMRSA 782 +DPNTH+PL+E +V D +N T K S Q+ Q L + ++ F + ++ Sbjct: 121 MDPNTHKPLNETEVGDGKNCT--EKASLQVL-----QPKGLPAVPSSAAEFEQPFMVNNS 173 Query: 781 IYNVGGEMDTSIDQPMVSKQLFDPLLMLEFQPNIHPSGYHPNFLPQCEQIVRPNNDHHDQ 602 GG + S Q ++K FDP+ EFQ + P GY N L Q Q +RP DQ Sbjct: 174 SCYDGGLTEGSRVQ-FMNKPGFDPMSFFEFQAGVDPMGYSSNLLSQYHQTIRP----FDQ 228 Query: 601 NEFEEGISGGYDAFSSMPTLTNFEQTSMAETDFSDSSTSRMTSLMLNEAKEXXXXXXXXX 422 N+ E + G F+S+P LTNF+Q ++ ETDFSD+S SRM S NEAKE Sbjct: 229 NQLEANSNVG---FASLPGLTNFDQGNLTETDFSDNSASRMGSFFFNEAKE-----SSSN 280 Query: 421 XXXXXXXXXXXQINKMAANVDAFSWNAVNKFDSMFEYHVNGIKSEEGQL------QIQTH 260 QIN M NV AFSW+A NK +++F+Y ++GIKSEE + Q+ + Sbjct: 281 SSNITSHPAGFQINNMGENV-AFSWDAENKLEALFQYQISGIKSEELKPSSWYGDQVHSQ 339 Query: 259 ISGDFSNYPLTSLSDEDLSGASLGVLHQI 173 S DFSNYPLTSLS EDL+GAS V Q+ Sbjct: 340 NSEDFSNYPLTSLS-EDLNGASFDVFQQM 367 >ref|XP_002321627.1| predicted protein [Populus trichocarpa] gi|222868623|gb|EEF05754.1| predicted protein [Populus trichocarpa] Length = 371 Score = 384 bits (985), Expect = e-104 Identities = 220/395 (55%), Positives = 269/395 (68%), Gaps = 12/395 (3%) Frame = -1 Query: 1321 MGRHSCCVKQKLRKGLWSPEEDEKLLNYITRYGVGCWSSVPKLAGLQRCGKSCRLRWINY 1142 MGRHSCC+KQKLRKGLWSPEEDEKLLNYITR+GVGCWSSVPKLAGLQRCGKSCRLRWINY Sbjct: 1 MGRHSCCLKQKLRKGLWSPEEDEKLLNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINY 60 Query: 1141 LRPDLKRGMFSQQEEDLILRLHEVLGNRWAQIAAQLPGRTDNEIKNFWNSSLKKRLLKQG 962 LRPDLKRGMFSQQEEDLI+ LHEVLGNRWAQIAAQLPGRTDNEIKN WNS LKK+L+KQG Sbjct: 61 LRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRTDNEIKNLWNSYLKKKLMKQG 120 Query: 961 IDPNTHQPLSEVQVRDEENRTTDHKGSFQIPFSITDQLPHLXXXXXXXSGMDREFQMRSA 782 IDP TH+PL +V V++E++ T K SFQIP S LP + + F + Sbjct: 121 IDPTTHKPLCQVGVKEEKDCT--EKASFQIPQS--KGLP----IVSNFTAQEPAFLINDT 172 Query: 781 IYNVGGEMDTSIDQPMVSKQLFDPLLMLEFQPNIHPSGYHPNFLPQCEQIVRPNNDHHDQ 602 YN G + S +Q ++KQ +DPL EF I +GY+P+ + P DQ Sbjct: 173 TYNSSGLPEVSREQ-FLNKQAYDPLSYFEFPAGIDLTGYNPSL----SSVYHPTVRSLDQ 227 Query: 601 NEFEEGISGGYDAFSSMPTLTNFEQTSMAETDFSDSSTSRMTSLMLNEAKEXXXXXXXXX 422 N+FE + G F+SMP+LT+F+ SM+ TDFSD+S SRM+S+ LNEAKE Sbjct: 228 NQFETSSNFG---FTSMPSLTSFDHGSMSGTDFSDNSASRMSSMFLNEAKE------SSS 278 Query: 421 XXXXXXXXXXXQINKMAANVDAF-SWNA-VNKFDSMFEYH-VNGIKSEE---------GQ 278 Q+N M N AF SW++ +K +S+F+YH VNG+K+EE G+ Sbjct: 279 NSSNISNYAGYQMNNMVENAAAFSSWDSDDHKLESVFQYHQVNGVKTEELKPSPWHEAGR 338 Query: 277 LQIQTHISGDFSNYPLTSLSDEDLSGASLGVLHQI 173 L + S DF++YPLTSLS ED++GA+ V HQI Sbjct: 339 LHTHQN-SVDFNSYPLTSLS-EDITGANFDVFHQI 371 >ref|XP_002318064.1| predicted protein [Populus trichocarpa] gi|222858737|gb|EEE96284.1| predicted protein [Populus trichocarpa] Length = 370 Score = 382 bits (982), Expect = e-103 Identities = 221/394 (56%), Positives = 266/394 (67%), Gaps = 11/394 (2%) Frame = -1 Query: 1321 MGRHSCCVKQKLRKGLWSPEEDEKLLNYITRYGVGCWSSVPKLAGLQRCGKSCRLRWINY 1142 MGRHSCC+KQKLRKGLWSPEEDE+L NYITR+GVGCWSSVPKLAGLQRCGKSCRLRWINY Sbjct: 1 MGRHSCCLKQKLRKGLWSPEEDERLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINY 60 Query: 1141 LRPDLKRGMFSQQEEDLILRLHEVLGNRWAQIAAQLPGRTDNEIKNFWNSSLKKRLLKQG 962 LRPDLKRGMFSQQEEDLI+ HEVLGNRWAQIAAQLPGRTDNEIKNFWNS LKK+L+KQG Sbjct: 61 LRPDLKRGMFSQQEEDLIISFHEVLGNRWAQIAAQLPGRTDNEIKNFWNSCLKKKLMKQG 120 Query: 961 IDPNTHQPLSEVQVRDEENRTTDHKGSFQIPFSITDQLPHLXXXXXXXSGMDREFQMRSA 782 IDP TH+PLS+V+V++E+ T K SFQIP S LP L S + F + Sbjct: 121 IDPATHKPLSQVEVKEEKICT--EKASFQIPQS--KGLPIL----SNFSAPEPAFIINDT 172 Query: 781 IYNVGGEMDTSIDQPMVSKQLFDPLLMLEFQPNIHPSGYHPNFLPQCEQIVRPNNDHHDQ 602 YN G + S +Q ++KQ +DP+ EF P+I P+GY+ N VRP DQ Sbjct: 173 AYNSSGLTEASREQ-FINKQAYDPIAYFEFPPSIVPTGYNSNLSSVYHPTVRP----LDQ 227 Query: 601 NEFEEGISGGYDAFSSMPTLTNFEQTSMAETDFSDSSTSRMTSLMLNEAKEXXXXXXXXX 422 N+FE + F+SMP+LT+F+ SM+ TDFSD+S SRM+S+ LNEAKE Sbjct: 228 NQFE---TSSNFVFTSMPSLTSFDHGSMSGTDFSDNSASRMSSMFLNEAKE------SSS 278 Query: 421 XXXXXXXXXXXQINKMAANVDAF-SWNAVNKFDSMFEYH-VNGIKSEE---------GQL 275 Q++ M N F SW++ +K +S+F+YH VNGIK+ E GQL Sbjct: 279 NSSNISNYAGYQMSNMVENAAGFSSWDSDDKLESVFQYHQVNGIKTGELKPSPWHDAGQL 338 Query: 274 QIQTHISGDFSNYPLTSLSDEDLSGASLGVLHQI 173 + S DFS+ PL SLS EDL GA+ HQI Sbjct: 339 HTHQN-SVDFSSCPLKSLS-EDLKGANFDGFHQI 370 >ref|XP_002511336.1| r2r3-myb transcription factor, putative [Ricinus communis] gi|223550451|gb|EEF51938.1| r2r3-myb transcription factor, putative [Ricinus communis] Length = 378 Score = 374 bits (959), Expect = e-101 Identities = 215/399 (53%), Positives = 262/399 (65%), Gaps = 16/399 (4%) Frame = -1 Query: 1321 MGRHSCCVKQKLRKGLWSPEEDEKLLNYITRYGVGCWSSVPKLAGLQRCGKSCRLRWINY 1142 M RHSCC+KQKLRKGLWSPEEDEKL NYITR+GVGCWSSVPKLAGLQRCGKSCRLRWINY Sbjct: 1 MRRHSCCLKQKLRKGLWSPEEDEKLYNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINY 60 Query: 1141 LRPDLKRGMFSQQEEDLILRLHEVLGNRWAQIAAQLPGRTDNEIKNFWNSSLKKRLLKQG 962 LRPDLKRGMFSQQEEDLI+ LHEVLGNRWAQIAAQLPGRTDNEIKNFWNS LKK+L+KQG Sbjct: 61 LRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRTDNEIKNFWNSCLKKKLMKQG 120 Query: 961 IDPNTHQPL---SEVQVRDEENRTTDHKGSFQIPFSITDQLPHLXXXXXXXSGMDREFQM 791 IDP TH+P+ E +V+DE + DHK S + + L + + F + Sbjct: 121 IDPTTHKPIITGHETEVKDERD-CMDHKESISLQIPQSKGLLSSSSSSIISNVQEPTFLI 179 Query: 790 R-SAIYNVGGEMDTSIDQPMVS------KQLFDPLLMLEFQPNIHPSGYHPNFLPQCEQI 632 + Y G +TS +Q +++ KQ +DPL EF ++ P GY+ N Sbjct: 180 NDTTTYYSNGLTETSREQFIMANSNNNKKQAYDPLSYFEFPASVEPGGYNYN-------- 231 Query: 631 VRPNND---HHDQNEFEEGISGGYDAFSSMPTLTNFEQTSMAETDFSDSSTSRMTSLMLN 461 NN DQN+FE + AF+SMP+L +F+ S++ TDFSDSS SR++S+ LN Sbjct: 232 ---NNSTIRMFDQNQFE---TSSNFAFTSMPSLASFDHGSISATDFSDSSASRLSSMFLN 285 Query: 460 -EAKEXXXXXXXXXXXXXXXXXXXXQINKMAANVDAFSWNAVNKFDSMFEYHVNGIKSEE 284 +AKE INKM + AFSW+ NKFD+MF++ VNGIK+EE Sbjct: 286 DQAKE------SSSNSSNISNYTGYHINKMVEDNAAFSWDTDNKFDAMFQFPVNGIKTEE 339 Query: 283 --GQLQIQTHISGDFSNYPLTSLSDEDLSGASLGVLHQI 173 + T S DFS+YPLTSLS EDL+GA+ V HQI Sbjct: 340 LRQSSRQDTQHSVDFSSYPLTSLS-EDLTGANFDVFHQI 377 >ref|XP_003525561.1| PREDICTED: uncharacterized protein LOC100811140 [Glycine max] Length = 370 Score = 343 bits (879), Expect = 8e-92 Identities = 202/392 (51%), Positives = 249/392 (63%), Gaps = 9/392 (2%) Frame = -1 Query: 1321 MGRHSCCVKQKLRKGLWSPEEDEKLLNYITRYGVGCWSSVPKLAGLQRCGKSCRLRWINY 1142 MGRHSCCVKQKLRKGLWSPEEDEKL NYITR+GVGCWSSVPKLAGLQRCGKSCRLRWINY Sbjct: 1 MGRHSCCVKQKLRKGLWSPEEDEKLFNYITRFGVGCWSSVPKLAGLQRCGKSCRLRWINY 60 Query: 1141 LRPDLKRGMFSQQEEDLILRLHEVLGNRWAQIAAQLPGRTDNEIKNFWNSSLKKRLLKQG 962 LRPDLKRGMFSQQEEDLI+ LHEVLGNRWAQIAAQLPGRTDNEIKNFWNS LKK+LLKQG Sbjct: 61 LRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRTDNEIKNFWNSCLKKKLLKQG 120 Query: 961 IDPNTHQPLSEVQVRDEENRTTDHKGSFQIPFSITDQLPHLXXXXXXXSGMDREFQMRSA 782 IDP+TH+PL+E V++E + Q P S +P + S + + Sbjct: 121 IDPSTHKPLTEAHVKEE--KKIIETSPMQTPLSQGPSVPLI-----FPSSQGSSLLISDS 173 Query: 781 IYNVGGEMDTSIDQPMVSKQLFDPLLMLEFQPNIHPSGYHPNFLPQCEQIVRPNNDHHDQ 602 Y GG + S + ++K DPL +F + SG++ LP + + + DQ Sbjct: 174 SYYDGGLTEAS-REIFMTKPALDPLSYYDFPMGVAQSGFN---LPVSQ--YQTSLKASDQ 227 Query: 601 NEFEEGISGGYDAFSSMPTLTNFEQTSMAETDFSD-SSTSRMTSLMLNEAKEXXXXXXXX 425 N F G + Y FSSMP+LTN + +++ T+FSD +S S+++SL +N+ + Sbjct: 228 NPF--GPNSSY-VFSSMPSLTNSDHGNVSVTEFSDNNSASKISSLFMND--QVKESSSNS 282 Query: 424 XXXXXXXXXXXXQINKMAANVDAFSWNAVNKFDSMFEYHVNGIKS--------EEGQLQI 269 I+ M N FSW NKFD +F++ VN KS EEGQL Sbjct: 283 SNLSTIYHGGGCHISSMMENA-GFSWEGDNKFDPLFQFQVNATKSEDFKTSSWEEGQLHT 341 Query: 268 QTHISGDFSNYPLTSLSDEDLSGASLGVLHQI 173 Q I DF+++PLTSLS EDL+GA+ V I Sbjct: 342 QNSI--DFTSFPLTSLS-EDLTGANFDVFQHI 370