BLASTX nr result
ID: Catharanthus23_contig00003004
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00003004 (1464 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004245311.1| PREDICTED: uncharacterized protein LOC101249... 218 5e-54 ref|XP_006355199.1| PREDICTED: uncharacterized protein LOC102580... 213 2e-52 emb|CBI20803.3| unnamed protein product [Vitis vinifera] 185 4e-44 ref|XP_002283217.1| PREDICTED: uncharacterized protein LOC100255... 185 4e-44 ref|XP_002330777.1| predicted protein [Populus trichocarpa] 172 3e-40 ref|XP_002332103.1| predicted protein [Populus trichocarpa] 162 3e-37 ref|XP_006371828.1| hypothetical protein POPTR_0018s04010g [Popu... 161 6e-37 gb|EMJ28290.1| hypothetical protein PRUPE_ppa025574mg [Prunus pe... 160 2e-36 ref|XP_004136441.1| PREDICTED: uncharacterized protein LOC101210... 158 5e-36 ref|XP_006412613.1| hypothetical protein EUTSA_v10025787mg [Eutr... 157 8e-36 ref|NP_194855.2| sequence-specific DNA binding transcription fac... 156 2e-35 ref|XP_002867306.1| predicted protein [Arabidopsis lyrata subsp.... 156 2e-35 emb|CAA16530.1| hypothetical protein [Arabidopsis thaliana] gi|7... 154 7e-35 ref|XP_006284107.1| hypothetical protein CARUB_v10005240mg [Caps... 154 1e-34 ref|XP_006284106.1| hypothetical protein CARUB_v10005240mg [Caps... 153 2e-34 ref|XP_006483638.1| PREDICTED: uncharacterized protein LOC102622... 151 6e-34 ref|XP_006450086.1| hypothetical protein CICLE_v10009072mg [Citr... 150 1e-33 ref|XP_002527997.1| transcription factor, putative [Ricinus comm... 144 9e-32 gb|EXC32757.1| hypothetical protein L484_019870 [Morus notabilis] 132 3e-28 ref|XP_006382204.1| hypothetical protein POPTR_0006s29340g [Popu... 132 3e-28 >ref|XP_004245311.1| PREDICTED: uncharacterized protein LOC101249843 [Solanum lycopersicum] Length = 316 Score = 218 bits (555), Expect = 5e-54 Identities = 128/318 (40%), Positives = 178/318 (55%), Gaps = 37/318 (11%) Frame = +2 Query: 164 LEMERGSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMN 343 +E GS TRSQAAPDWT+ E VTLVNE+ A + E S+ASFQKWQ V NCN+L +N Sbjct: 1 MERSGGSLRTRSQAAPDWTLHESVTLVNEMKATQIECGNSLASFQKWQSTVHNCNSLGVN 60 Query: 344 RSLNQCKKKWAELLAEYKKVKPWEEGYW-SCDSNEREELGLPEGFDRELFKAIDRYVKKK 520 RSLNQCK++W +L +Y KVKPWE YW S D + EL LPE FD ELF AI RY+ + Sbjct: 61 RSLNQCKRRWESMLEQYNKVKPWESAYWDSFDEERKRELELPEQFDFELFNAIARYLSLE 120 Query: 521 GGDDNAEGPETDPESD--SQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWR---- 682 G D G ETDP++D +Q N F+ + GP I+E +PWR Sbjct: 121 GEDGG--GAETDPDTDPEAQQVQGNNAFL-EIGPKRQRRRTKTKRYKIEERLNPWRRILN 177 Query: 683 ---------------------------TSVSINTKQESSSLNQMPELPRNDIVSEMKLQP 781 + S+ K+E+SS +M ELP +V+++K + Sbjct: 178 ENRKYEQSKMGIKHEASIDAGLEAPRHENSSLEIKRETSSPEEMTELPNLSMVNKVKAEQ 237 Query: 782 DGEDERQKIMCAELLKSTELINATLQGNLAENVE---ADSKNAEAVQIDFNRLQGDRLID 952 D +++M A L ++ E+I A +GN ++ + A N +A ++ R QG++LID Sbjct: 238 FHVDNPEELMAATLRENAEMITAITEGNTMDDRDCSLAGLNNFDAGRLHLIRSQGNQLID 297 Query: 953 CLGTIANTLAQLCDLVHE 1006 CLG I++TL QLCD +H+ Sbjct: 298 CLGKISDTLIQLCDAIHK 315 >ref|XP_006355199.1| PREDICTED: uncharacterized protein LOC102580095 [Solanum tuberosum] Length = 315 Score = 213 bits (542), Expect = 2e-52 Identities = 126/318 (39%), Positives = 178/318 (55%), Gaps = 37/318 (11%) Frame = +2 Query: 164 LEMERGSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMN 343 +E GS TRSQAAPDWT+ E VTLVNE+ A + E ++ASFQKWQ V NCN+L +N Sbjct: 1 MERSGGSLRTRSQAAPDWTLHESVTLVNEMKATQIECGNTLASFQKWQSTVHNCNSLGVN 60 Query: 344 RSLNQCKKKWAELLAEYKKVKPWEEGYW-SCDSNEREELGLPEGFDRELFKAIDRYVKKK 520 RSLNQCK++W +L +Y KVKPWE YW S D + EL LPE FD ELF AI RY+ + Sbjct: 61 RSLNQCKRRWESMLEQYNKVKPWESAYWDSFDEERKRELDLPEQFDFELFNAIARYLSLE 120 Query: 521 GGDDNAEGPETDPESD--SQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWR---- 682 G D G ETDP++D +Q N F+ + GP ++E +PWR Sbjct: 121 GEDGG--GAETDPDTDPEAQQVQGNNAFL-EIGPKRQRRRTKTKRYKMEERLNPWRRILN 177 Query: 683 ---------------------------TSVSINTKQESSSLNQMPELPRNDIVSEMKLQP 781 + S+ K+E+SS +M ELP +VS++K + Sbjct: 178 ENRKYEQSKMGIKHEASIDANLEAPRYDNSSLEIKRETSSPEEMTELPNLSMVSKVKAEQ 237 Query: 782 DGEDERQKIMCAELLKSTELINATLQGNLAENVE---ADSKNAEAVQIDFNRLQGDRLID 952 + +++M A L ++ E+I A +GN ++ + AD N + ++ R QG++LID Sbjct: 238 FNVNP-EEMMAATLRENAEMITAITEGNTMDDRDCSLADLNNFDVGRVHLIRSQGNQLID 296 Query: 953 CLGTIANTLAQLCDLVHE 1006 CLG I++TL QLCD +H+ Sbjct: 297 CLGKISDTLIQLCDAIHK 314 >emb|CBI20803.3| unnamed protein product [Vitis vinifera] Length = 250 Score = 185 bits (470), Expect = 4e-44 Identities = 106/285 (37%), Positives = 158/285 (55%), Gaps = 9/285 (3%) Frame = +2 Query: 182 SRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQC 361 SR TRSQ APDWT+++ + LVNEI+A+EGE L +++++QKW+++ ENC AL+++R+ NQC Sbjct: 14 SRRTRSQLAPDWTINDSLILVNEIAAVEGECLNALSTYQKWKIIAENCTALDVSRTFNQC 73 Query: 362 KKKWAELLAEYKKVKPWEE-----GYWSCDSNEREELGLPEGFDRELFKAIDRYVKKKGG 526 ++KW LL EY K+K WE +W+ +S R ELGLP F+RELFKAID V + Sbjct: 74 RRKWDSLLFEYNKIKKWESRSRNVSFWTLESERRRELGLPVDFERELFKAIDDLVSSQEV 133 Query: 527 DDNAEGPETDPESDSQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSVSINTK 706 + + P TDPE+ Sbjct: 134 RSDTD-PGTDPEA----------------------------------------------- 145 Query: 707 QESSSLNQMPEL-PRNDIVSEMKLQPDGEDERQKIMCAELLKSTELINATLQGNLAENVE 883 E L + E P+ EM + +E++++M +L ++ +LI+A ++GNL ++V+ Sbjct: 146 -EDDRLEVIAEYGPKKQKRREMPQKTTSLEEKEQMMVMKLRENADLIDAIVKGNLVDSVD 204 Query: 884 ---ADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQLCDLVHEC 1009 SKN E +Q DF R QGD+LI CL IA+TL QL D+V +C Sbjct: 205 FGLGGSKNRETLQADFKRRQGDKLIACLRDIADTLDQLRDIVQKC 249 >ref|XP_002283217.1| PREDICTED: uncharacterized protein LOC100255883 isoform 1 [Vitis vinifera] gi|359476329|ref|XP_003631820.1| PREDICTED: uncharacterized protein LOC100255883 isoform 2 [Vitis vinifera] Length = 274 Score = 185 bits (470), Expect = 4e-44 Identities = 106/285 (37%), Positives = 158/285 (55%), Gaps = 9/285 (3%) Frame = +2 Query: 182 SRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQC 361 SR TRSQ APDWT+++ + LVNEI+A+EGE L +++++QKW+++ ENC AL+++R+ NQC Sbjct: 38 SRRTRSQLAPDWTINDSLILVNEIAAVEGECLNALSTYQKWKIIAENCTALDVSRTFNQC 97 Query: 362 KKKWAELLAEYKKVKPWEE-----GYWSCDSNEREELGLPEGFDRELFKAIDRYVKKKGG 526 ++KW LL EY K+K WE +W+ +S R ELGLP F+RELFKAID V + Sbjct: 98 RRKWDSLLFEYNKIKKWESRSRNVSFWTLESERRRELGLPVDFERELFKAIDDLVSSQEV 157 Query: 527 DDNAEGPETDPESDSQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSVSINTK 706 + + P TDPE+ Sbjct: 158 RSDTD-PGTDPEA----------------------------------------------- 169 Query: 707 QESSSLNQMPEL-PRNDIVSEMKLQPDGEDERQKIMCAELLKSTELINATLQGNLAENVE 883 E L + E P+ EM + +E++++M +L ++ +LI+A ++GNL ++V+ Sbjct: 170 -EDDRLEVIAEYGPKKQKRREMPQKTTSLEEKEQMMVMKLRENADLIDAIVKGNLVDSVD 228 Query: 884 ---ADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQLCDLVHEC 1009 SKN E +Q DF R QGD+LI CL IA+TL QL D+V +C Sbjct: 229 FGLGGSKNRETLQADFKRRQGDKLIACLRDIADTLDQLRDIVQKC 273 >ref|XP_002330777.1| predicted protein [Populus trichocarpa] Length = 291 Score = 172 bits (436), Expect = 3e-40 Identities = 105/291 (36%), Positives = 160/291 (54%), Gaps = 17/291 (5%) Frame = +2 Query: 191 TRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQCKKK 370 TRSQ +P+WT E + LVNEI+A+E + L++++++QKW+++V+NC L++ R+LNQC+ K Sbjct: 7 TRSQVSPEWTTKEALILVNEIAAVEKDCLKALSTYQKWKIIVDNCVVLDVARNLNQCRTK 66 Query: 371 WAELLAEYKKVKPWE-------EGYWSCDSNEREELGLPEGFDRELFKAIDRYV-KKKGG 526 W L+ EY +K W+ + YWS +S R E GLPE F+ ELF+AID Y+ K Sbjct: 67 WNSLVNEYNLIKNWDKESESRSDFYWSLESERRREFGLPENFNDELFRAIDDYMWCHKEH 126 Query: 527 DDNAEGPETDPESDSQTP----ANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSVS 694 D P+ DP++DS+ P A TN +T + ES H Sbjct: 127 PDTDPDPDPDPDTDSEKPDLLHAITNPENHQTCCTNEKPQSILAETQLQES-HEEEKPQK 185 Query: 695 INTKQESSSL--NQMPELPRNDIVSEMKLQPDGEDERQKIMCAELLKSTELINATLQGNL 868 K+ S + ++ P++ R K P E+ +Q +M +L ++ E+I A + GN Sbjct: 186 CRRKENSQNAHGDEKPKIHR----GRKKKMPSTEEMKQ-MMVEKLHENAEMIQAVVNGNF 240 Query: 869 AENVE---ADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQLCDLVHECN 1012 E + ADSKN E + D R QGD+LI CL I N++ Q L+ EC+ Sbjct: 241 PEMADLEAADSKNIEGFKTDLIRRQGDKLIACLQNIVNSINQFPCLLQECD 291 >ref|XP_002332103.1| predicted protein [Populus trichocarpa] Length = 319 Score = 162 bits (410), Expect = 3e-37 Identities = 100/285 (35%), Positives = 149/285 (52%), Gaps = 20/285 (7%) Frame = +2 Query: 191 TRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQCKKK 370 TRSQ +P+WT + + LVNEI+A+E + ++V++ QKW+++V NC AL + +L+QC+ K Sbjct: 36 TRSQVSPEWTAKQALILVNEIAAVEKDCSKAVSTNQKWKIIVGNCVALGVTHTLSQCRSK 95 Query: 371 WAELLAEYKKVKPWE-------EGYWSCDSNEREELGLPEGFDRELFKAIDRYVKKKGGD 529 W L+ EY ++K W+ + YWS R+E GLPE FD ELFKAID Y+ + Sbjct: 96 WNSLVIEYNQIKKWDKESESRSDFYWSLGCERRKEFGLPENFDDELFKAIDDYMWSQ--- 152 Query: 530 DNAEGPETDPESDSQTP------ANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSV 691 E +TDP++D Q AN ++V + +E H + Sbjct: 153 --KEQLDTDPDTDLQKADLLDVIANLERYV-EENHQTCCTKEKPQTIPAEEELH----EI 205 Query: 692 SINTKQESSSLNQMPELPRND----IVSEMKLQPDGEDERQKIMCAELLKSTELINATLQ 859 + K + + P++ D I S K P ED Q +M +L ++ E+I A + Sbjct: 206 QVKEKPQKRLRKEKPQIGNGDEKPKIYSGRKKMPSTEDMEQ-MMVEKLSENAEMIQAVVN 264 Query: 860 GNLAENVE---ADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQ 985 GNL E + ADS N E + D R QGD+LI CL I NT+ Q Sbjct: 265 GNLPEMADLEAADSNNIEGFKTDLIRSQGDKLIACLENIVNTMRQ 309 >ref|XP_006371828.1| hypothetical protein POPTR_0018s04010g [Populus trichocarpa] gi|550318001|gb|ERP49625.1| hypothetical protein POPTR_0018s04010g [Populus trichocarpa] Length = 378 Score = 161 bits (408), Expect = 6e-37 Identities = 100/285 (35%), Positives = 148/285 (51%), Gaps = 20/285 (7%) Frame = +2 Query: 191 TRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQCKKK 370 TRSQ +P+WT + + LVNEI+A+E + ++V++ QKW+++V NC AL + L+QC+ K Sbjct: 41 TRSQVSPEWTAKQALILVNEIAAVEKDCSKAVSTNQKWKIIVGNCVALGVTHPLSQCRSK 100 Query: 371 WAELLAEYKKVKPWE-------EGYWSCDSNEREELGLPEGFDRELFKAIDRYVKKKGGD 529 W L+ EY ++K W+ + YWS R+E GLPE FD ELFKAID Y+ + Sbjct: 101 WNSLVIEYNQIKKWDKESESRSDFYWSLGCERRKEFGLPENFDDELFKAIDDYMWSQ--- 157 Query: 530 DNAEGPETDPESDSQTP------ANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSV 691 E +TDP++D Q AN ++V + +E H + Sbjct: 158 --KEQLDTDPDTDLQKADLLDVIANLERYV-EENHQTCCTKEKPQTIPAEEELH----EI 210 Query: 692 SINTKQESSSLNQMPELPRND----IVSEMKLQPDGEDERQKIMCAELLKSTELINATLQ 859 + K + + P++ D I S K P ED Q +M +L ++ E+I A + Sbjct: 211 QVKEKPQKRLRKEKPQIGNGDEKPKIYSGRKKMPSTEDMEQ-MMVEKLSENAEMIQAVVN 269 Query: 860 GNLAENVE---ADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQ 985 GNL E + ADS N E + D R QGD+LI CL I NT+ Q Sbjct: 270 GNLPEMADLEAADSNNIEGFKTDLIRSQGDKLIACLENIVNTMRQ 314 >gb|EMJ28290.1| hypothetical protein PRUPE_ppa025574mg [Prunus persica] Length = 335 Score = 160 bits (404), Expect = 2e-36 Identities = 99/315 (31%), Positives = 159/315 (50%), Gaps = 40/315 (12%) Frame = +2 Query: 185 RCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQCK 364 R TRSQ APDW ++ + LVNEI+A+E + L++++SFQKW+++ +NC+AL + R+L+Q + Sbjct: 21 RSTRSQVAPDWNSTDELLLVNEIAAVEADCLKALSSFQKWKIISQNCSALGVPRTLDQYR 80 Query: 365 KKWAELLAEYKKVKPWEE------GYWSCDSNEREELGLPEGFDRELFKAIDRYVKKKGG 526 +KW L +YK +K WE YW + R++ GLPE FD ELF+AID V+ +G Sbjct: 81 RKWDALFLQYKSIKQWESASRGGASYWVLEIGRRKQKGLPENFDNELFRAIDNLVRVRGN 140 Query: 527 DDNAEGPETDPESDSQTPANTNKFVWK-TGPXXXXXXXXXXXXXIDESFHP--WRT--SV 691 + + P++DPE++ A+ V + I+ S W++ Sbjct: 141 QSDTD-PDSDPEAEIDAEADVPDVVAEPESKRRRRRSTHQKSCSIENSLEDVRWKSLKKP 199 Query: 692 SINTKQESSSLNQMPE---------------LPRNDIV--------------SEMKLQPD 784 + K E + + P+ +P+ + S++K + Sbjct: 200 RVEEKPEETHAEEKPQETHAEEKPVGSCLEVIPQKSLAEQKSQKSCAKKHKNSQIKEKAI 259 Query: 785 GEDERQKIMCAELLKSTELINATLQGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGT 964 +E+++I +L ++ ELI A + N AD K+ Q D R QGD++I CLG Sbjct: 260 SIEEQEQIAVMQLHENVELIQAIVNENADHEAAADVKSTGDPQTDLVRRQGDQVIACLGD 319 Query: 965 IANTLAQLCDLVHEC 1009 I TL QL LV EC Sbjct: 320 IVKTLDQLRQLVQEC 334 >ref|XP_004136441.1| PREDICTED: uncharacterized protein LOC101210084 [Cucumis sativus] gi|449511974|ref|XP_004164105.1| PREDICTED: uncharacterized LOC101210084 [Cucumis sativus] Length = 311 Score = 158 bits (400), Expect = 5e-36 Identities = 99/303 (32%), Positives = 155/303 (51%), Gaps = 26/303 (8%) Frame = +2 Query: 179 GSRCTRSQ--AAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSL 352 GSR TRSQ AP WT ++C+ LVN I+A+E + L++++S+QKW++V ENC +L++ R+ Sbjct: 15 GSRRTRSQIAVAPGWTAADCLVLVNVIAAVEADCLKALSSYQKWKIVAENCTSLDVVRTS 74 Query: 353 NQCKKKWAELLAEYKKVKPWE------EGYWSCDSNEREELGLPEGFDRELFKAIDRYVK 514 NQC++KW LL E+ +K WE + YW S R+ELGLPE FD ELFKAID Sbjct: 75 NQCRRKWDCLLIEHDVIKQWELKMPDDDSYWCLASGRRKELGLPENFDEELFKAIDNVAS 134 Query: 515 KKGGDDNAEGPETDPESDSQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSVS 694 + A +T+P+SD + + + GP + E ++ Sbjct: 135 MR-----ANQSDTEPDSDPEAAIGNADEIAEPGPKRQRRRSMSKSNQVLEKSLECERNLG 189 Query: 695 INTKQESSSLNQM-----PELPRNDIVSEMKLQP-------------DGEDERQKIMCAE 820 + E + E+ ++S +L+P D + ++++M Sbjct: 190 LEISLECKEVEDRGERGGEEVEEKPLLSSPELEPRECYIKSNESKVTDNIEPKEQMMAKF 249 Query: 821 LLKSTELINATLQGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQLCDLV 1000 LL++ E + A + N AE +D K A+ Q + R QG +LI CLG I NT+ L L+ Sbjct: 250 LLENAEKVQAIVSEN-AEYTTSDEKCAKD-QTNLVRHQGSKLIRCLGDILNTINDLRGLL 307 Query: 1001 HEC 1009 +C Sbjct: 308 EDC 310 >ref|XP_006412613.1| hypothetical protein EUTSA_v10025787mg [Eutrema salsugineum] gi|557113783|gb|ESQ54066.1| hypothetical protein EUTSA_v10025787mg [Eutrema salsugineum] Length = 310 Score = 157 bits (398), Expect = 8e-36 Identities = 104/314 (33%), Positives = 157/314 (50%), Gaps = 38/314 (12%) Frame = +2 Query: 179 GSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQ 358 GSR TRSQ APDWTV +C+ LVNEI+A+E + +++SFQKW ++ ENCNAL+++R+LNQ Sbjct: 7 GSRRTRSQVAPDWTVKDCLILVNEIAAVEADCSNALSSFQKWTIISENCNALDVHRTLNQ 66 Query: 359 CKKKWAELLAEYKKVKPWEE-------GYWSCDSNEREELGLPEGFDRELFKAIDRYVKK 517 C++KW L+++Y ++K WE YWS + +R++L LP D ELF+AI+ V Sbjct: 67 CRRKWDSLVSDYNQIKKWESQGRGGGHSYWSLSTEKRKKLNLPGNIDNELFEAINAVVML 126 Query: 518 KGGDDNAEGPETDPESD----------------SQTPANTNKFVWKTGPXXXXXXXXXXX 649 + E P++DPE+ S+ V K P Sbjct: 127 QEDKAGTE-PDSDPEAQEGYDVLDVSAELAFVGSKRSRQRTLLVMKENPPHKTKTDA--- 182 Query: 650 XXIDESFHPWRTSVSINTK-QESSSLNQMPELPRNDIVSEMKLQPDGEDERQKI------ 808 P R V TK Q + + NQ + V E+ +GE++ I Sbjct: 183 -------EPRRNRVLDKTKEQRAKATNQKKPMEEKKPVEEISTG-EGEEDTMSIEEEETM 234 Query: 809 --------MCAELLKSTELINATLQGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGT 964 M A+L + +LI+A + NLA+ E + + ++ F R QG+ LI CL Sbjct: 235 NIEKEVEAMEAKLGEKADLIHAIVGRNLAKGSETGDDISISDKMKFVRQQGEELIVCLSE 294 Query: 965 IANTLAQLCDLVHE 1006 I NTL +L ++ E Sbjct: 295 IVNTLNKLREVPQE 308 >ref|NP_194855.2| sequence-specific DNA binding transcription factor [Arabidopsis thaliana] gi|26452367|dbj|BAC43269.1| unknown protein [Arabidopsis thaliana] gi|28950855|gb|AAO63351.1| At4g31270 [Arabidopsis thaliana] gi|332660484|gb|AEE85884.1| sequence-specific DNA binding transcription factor [Arabidopsis thaliana] Length = 294 Score = 156 bits (395), Expect = 2e-35 Identities = 95/291 (32%), Positives = 158/291 (54%), Gaps = 15/291 (5%) Frame = +2 Query: 179 GSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQ 358 GSR TRSQ AP+W V +C+ LVNEI+A+E + +++SFQKW ++ ENCNAL+++R+LNQ Sbjct: 7 GSRRTRSQVAPEWAVKDCLVLVNEIAAVEADCSNALSSFQKWTMITENCNALDVSRNLNQ 66 Query: 359 CKKKWAELLAEYKKVKPWE-------EGYWSCDSNEREELGLPEGFDRELFKAIDRYVKK 517 C++KW L+++Y ++K WE YWS S++R+ L LP D ELF+AI+ V Sbjct: 67 CRRKWDSLMSDYNQIKKWESQYRGTGRSYWSLSSDKRKLLNLPGDIDIELFEAINAVVMI 126 Query: 518 KGGDDNAEGPETDPESDSQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSVSI 697 + D G E+D + ++Q + + + G E P + V + Sbjct: 127 Q---DEKAGTESDSDPEAQDVVDLSAELAFVGSKRSRQRTMVMKETKKE--EPRTSRVQV 181 Query: 698 NTKQE---SSSLNQMPELPRNDIVSEMKLQPDGE-----DERQKIMCAELLKSTELINAT 853 NT+++ + + +Q + V +M + + +E ++M A+L +LI+A Sbjct: 182 NTREKPITTKATHQNKTMGEKKPVEDMSTDEEEDETMNIEEDVEVMEAKLSYKIDLIHAI 241 Query: 854 LQGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQLCDLVHE 1006 + NLA++ E + ++ R QGD LI CL I +TL +L ++ E Sbjct: 242 VGRNLAKDNETKDGVSMDDKLKSVRQQGDELIGCLSEIVSTLNRLHEVPQE 292 >ref|XP_002867306.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297313142|gb|EFH43565.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 297 Score = 156 bits (395), Expect = 2e-35 Identities = 95/289 (32%), Positives = 153/289 (52%), Gaps = 19/289 (6%) Frame = +2 Query: 179 GSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQ 358 GSR TRSQ AP+W V +C+ LVNEI+A+E + +++SFQKW +++ENCNAL++ R+LNQ Sbjct: 7 GSRRTRSQVAPEWAVKDCLILVNEIAAVEADCSNALSSFQKWTMILENCNALDVRRNLNQ 66 Query: 359 CKKKWAELLAEYKKVKPWE-------EGYWSCDSNEREELGLPEGFDRELFKAIDRYVKK 517 C++KW L+++Y ++K WE YWS S++R+ L LP D ELF+AI V Sbjct: 67 CRRKWDSLMSDYNQIKQWESQYRGTGRSYWSLSSDKRKLLNLPGNIDIELFEAISAVVMI 126 Query: 518 KGGDDNAEGPETDPESDSQTPANTN---KFVWKTGPXXXXXXXXXXXXXIDESFHPWRTS 688 + D G E+D + ++Q + FV + P + Sbjct: 127 Q---DEKAGTESDSDPEAQDVVDITAELAFVGSKRSRQRTIVMKENPPQKTKKEEPQISR 183 Query: 689 VSINTKQE---SSSLNQMPELPRNDIVSEMKLQPDGEDERQ------KIMCAELLKSTEL 841 V +NT+++ + + +Q + + E+ + E+E ++M A+L +L Sbjct: 184 VQVNTREKPITAKATHQKKTMEEKRPMEEISTDEEEEEETMNIEEEVEVMEAKLSYKIDL 243 Query: 842 INATLQGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQL 988 I+A + NLA++ E ++ F R QGD LI CL I +TL +L Sbjct: 244 IHAIVGRNLAKDNETRDGINTDDKLKFVRQQGDELIGCLSEIVSTLNRL 292 >emb|CAA16530.1| hypothetical protein [Arabidopsis thaliana] gi|7270029|emb|CAB79845.1| hypothetical protein [Arabidopsis thaliana] Length = 291 Score = 154 bits (390), Expect = 7e-35 Identities = 94/294 (31%), Positives = 156/294 (53%), Gaps = 18/294 (6%) Frame = +2 Query: 179 GSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQ 358 GSR TRSQ AP+W V +C+ LVNEI+A+E + +++SFQKW ++ ENCNAL+++R+LNQ Sbjct: 7 GSRRTRSQVAPEWAVKDCLVLVNEIAAVEADCSNALSSFQKWTMITENCNALDVSRNLNQ 66 Query: 359 CKKKWAELLAEYKKVKPWE-------EGYWSCDSNEREELGLPEGFDRELFKAIDRYV-- 511 C++KW L+++Y ++K WE YWS S++R+ L LP D ELF+AI+ V Sbjct: 67 CRRKWDSLMSDYNQIKKWESQYRGTGRSYWSLSSDKRKLLNLPGDIDIELFEAINAVVMI 126 Query: 512 -KKKGGDDNAEGPETDPESDSQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTS 688 +K G ++ PE D + + +T + P + Sbjct: 127 QDEKAGTESDSDPEAQDVVDLSAELGSKRSRQRT-----------MVMKETKKEEPRTSR 175 Query: 689 VSINTKQE---SSSLNQMPELPRNDIVSEMKLQPDGE-----DERQKIMCAELLKSTELI 844 V +NT+++ + + +Q + V +M + + +E ++M A+L +LI Sbjct: 176 VQVNTREKPITTKATHQNKTMGEKKPVEDMSTDEEEDETMNIEEDVEVMEAKLSYKIDLI 235 Query: 845 NATLQGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQLCDLVHE 1006 +A + NLA++ E + ++ R QGD LI CL I +TL +L ++ E Sbjct: 236 HAIVGRNLAKDNETKDGVSMDDKLKSVRQQGDELIGCLSEIVSTLNRLHEVPQE 289 >ref|XP_006284107.1| hypothetical protein CARUB_v10005240mg [Capsella rubella] gi|482552812|gb|EOA17005.1| hypothetical protein CARUB_v10005240mg [Capsella rubella] Length = 303 Score = 154 bits (388), Expect = 1e-34 Identities = 98/297 (32%), Positives = 153/297 (51%), Gaps = 19/297 (6%) Frame = +2 Query: 155 FQTLEMERGSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNAL 334 F+ E GSR RSQ APDWTV +C+ LVNEI+A+E + +++SFQKW ++ ENCN L Sbjct: 9 FRMDEGSSGSRRLRSQVAPDWTVKDCLILVNEIAAVEADCSNALSSFQKWTMISENCNIL 68 Query: 335 EMNRSLNQCKKKWAELLAEYKKVKPWE-------EGYWSCDSNEREELGLPEGFDRELFK 493 ++ R+LNQC++KW LL++Y ++K WE YWS + +R+ L LP D ELF+ Sbjct: 69 DVRRTLNQCRRKWDSLLSDYNQIKKWESRYAGSARSYWSLSTEKRKLLNLPGNVDNELFE 128 Query: 494 AIDRYVKKKGGDDNAEGPETDPESDSQTPANTN---KFVWKTGPXXXXXXXXXXXXXIDE 664 +I+ V + D+ G E+D + ++Q + FV + Sbjct: 129 SINAVVMIQ---DDKAGTESDSDPEAQDLVDVTAELDFVGSKRSRHRTTVTKEIPQQKTK 185 Query: 665 SFHPWRTSVSINTKQESSSLN----QMPELPRNDIVSEMKLQPDGE-----DERQKIMCA 817 P V NT+Q+ + M E + V E+ + E +E ++M A Sbjct: 186 RKEPQTYRVQENTQQKPTKATHQNINMEE--KKKAVEEISTDEEEEETMNIEEDVEVMEA 243 Query: 818 ELLKSTELINATLQGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQL 988 +L +LI+A NLA++ + + ++ + R QGD LI CL I NTL++L Sbjct: 244 KLSYKIDLIHAIAGRNLAKDNDTGDDISINDKLKYGRQQGDELISCLSEIVNTLSRL 300 >ref|XP_006284106.1| hypothetical protein CARUB_v10005240mg [Capsella rubella] gi|482552811|gb|EOA17004.1| hypothetical protein CARUB_v10005240mg [Capsella rubella] Length = 324 Score = 153 bits (386), Expect = 2e-34 Identities = 97/293 (33%), Positives = 151/293 (51%), Gaps = 19/293 (6%) Frame = +2 Query: 167 EMERGSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNR 346 E GSR RSQ APDWTV +C+ LVNEI+A+E + +++SFQKW ++ ENCN L++ R Sbjct: 34 EGSSGSRRLRSQVAPDWTVKDCLILVNEIAAVEADCSNALSSFQKWTMISENCNILDVRR 93 Query: 347 SLNQCKKKWAELLAEYKKVKPWE-------EGYWSCDSNEREELGLPEGFDRELFKAIDR 505 +LNQC++KW LL++Y ++K WE YWS + +R+ L LP D ELF++I+ Sbjct: 94 TLNQCRRKWDSLLSDYNQIKKWESRYAGSARSYWSLSTEKRKLLNLPGNVDNELFESINA 153 Query: 506 YVKKKGGDDNAEGPETDPESDSQTPANTN---KFVWKTGPXXXXXXXXXXXXXIDESFHP 676 V + D+ G E+D + ++Q + FV + P Sbjct: 154 VVMIQ---DDKAGTESDSDPEAQDLVDVTAELDFVGSKRSRHRTTVTKEIPQQKTKRKEP 210 Query: 677 WRTSVSINTKQESSSLN----QMPELPRNDIVSEMKLQPDGE-----DERQKIMCAELLK 829 V NT+Q+ + M E + V E+ + E +E ++M A+L Sbjct: 211 QTYRVQENTQQKPTKATHQNINMEE--KKKAVEEISTDEEEEETMNIEEDVEVMEAKLSY 268 Query: 830 STELINATLQGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQL 988 +LI+A NLA++ + + ++ + R QGD LI CL I NTL++L Sbjct: 269 KIDLIHAIAGRNLAKDNDTGDDISINDKLKYGRQQGDELISCLSEIVNTLSRL 321 >ref|XP_006483638.1| PREDICTED: uncharacterized protein LOC102622170 isoform X1 [Citrus sinensis] gi|568860253|ref|XP_006483639.1| PREDICTED: uncharacterized protein LOC102622170 isoform X2 [Citrus sinensis] Length = 292 Score = 151 bits (382), Expect = 6e-34 Identities = 97/291 (33%), Positives = 154/291 (52%), Gaps = 14/291 (4%) Frame = +2 Query: 179 GSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQ 358 G+R TRSQ PDW+ E + LVNEI+A+E + L++++S+QKW+++ E C AL++ R+ NQ Sbjct: 7 GTRRTRSQVGPDWSSKEALILVNEIAAVEADCLKALSSYQKWKIISETCTALDVPRTANQ 66 Query: 359 CKKKWAELLAEYKKVKPWEEGYWSCDSNEREELGLPEGFDRELFKAIDRYVKKKGGDDNA 538 C++KW LL EYKK+ + + + + P FD ELFKAI +V K DN Sbjct: 67 CRRKWDSLLDEYKKMIVRSRTFPNSQTQTHTDC-FPPNFDSELFKAIHDFVMSK---DN- 121 Query: 539 EGPETDPESDSQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSVSINTK---- 706 +TDP+SD+ A+ ++ + + P ++ + N + Sbjct: 122 RSDDTDPDSDTDPEADFSEAISQAQLGSKRQRRQSMRVKHCAEQKPLKSCLHENHQKSGC 181 Query: 707 -QESSSLNQMPELPR---------NDIVSEMKLQPDGEDERQKIMCAELLKSTELINATL 856 +E + + E PR N + E K +E +++M A+L ++ ELI+A + Sbjct: 182 TEEKLCNSHVEEEPRIRLVEKKCQNSRIKEKKSLKSCVEENEQMMVAKLQENAELIHA-I 240 Query: 857 QGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQLCDLVHEC 1009 A+ +AD N + ++ +F R QGD+LI CLG I NTL Q D V EC Sbjct: 241 VAESADYSDADLNNVQDLESEFVRRQGDKLIACLGEIVNTLNQFTDHVQEC 291 >ref|XP_006450086.1| hypothetical protein CICLE_v10009072mg [Citrus clementina] gi|567916162|ref|XP_006450087.1| hypothetical protein CICLE_v10009072mg [Citrus clementina] gi|557553312|gb|ESR63326.1| hypothetical protein CICLE_v10009072mg [Citrus clementina] gi|557553313|gb|ESR63327.1| hypothetical protein CICLE_v10009072mg [Citrus clementina] Length = 292 Score = 150 bits (379), Expect = 1e-33 Identities = 97/291 (33%), Positives = 153/291 (52%), Gaps = 14/291 (4%) Frame = +2 Query: 179 GSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQ 358 G+R TRSQ PDW+ E + LVNEI+A+E + L++++S+QKW+++ E C AL++ R+ NQ Sbjct: 7 GTRRTRSQVGPDWSSKEALILVNEIAAVEADCLKALSSYQKWKIISETCTALDVPRTANQ 66 Query: 359 CKKKWAELLAEYKKVKPWEEGYWSCDSNEREELGLPEGFDRELFKAIDRYVKKKGGDDNA 538 C++KW LL EYKK+ + + + + P FD ELFKAI +V K DN Sbjct: 67 CRRKWDSLLDEYKKMIVRSRTFPNSQTQTHTDC-FPPNFDSELFKAIHDFVMSK---DN- 121 Query: 539 EGPETDPESDSQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSVSINTK---- 706 +TDP+SD+ A ++ + + P ++ + N + Sbjct: 122 RSDDTDPDSDTDPEAYFSEAISQAQLGSKRQRRQSMRVKHCAEQKPLKSCLHENHQKSGC 181 Query: 707 -QESSSLNQMPELPR---------NDIVSEMKLQPDGEDERQKIMCAELLKSTELINATL 856 +E + + E PR N + E K +E +++M A+L ++ ELI+A + Sbjct: 182 TEEKLCNSHVEEEPRIRLVEKKCQNSHIKEKKSLKSCVEENEQMMVAKLQENAELIHA-I 240 Query: 857 QGNLAENVEADSKNAEAVQIDFNRLQGDRLIDCLGTIANTLAQLCDLVHEC 1009 A+ +AD N + ++ +F R QGD+LI CLG I NTL Q D V EC Sbjct: 241 VAESADYSDADLNNVQDLESEFVRRQGDKLIACLGEIVNTLNQFTDHVQEC 291 >ref|XP_002527997.1| transcription factor, putative [Ricinus communis] gi|223532623|gb|EEF34409.1| transcription factor, putative [Ricinus communis] Length = 419 Score = 144 bits (363), Expect = 9e-32 Identities = 75/212 (35%), Positives = 123/212 (58%), Gaps = 6/212 (2%) Frame = +2 Query: 191 TRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQCKKK 370 TRSQ APDWT E + LVNEI+A+EG+ L+++++ QKW ++V+NC+ L+++R+LNQC+ K Sbjct: 40 TRSQVAPDWTTKESLILVNEIAAVEGDCLKALSTHQKWNIIVQNCSVLDVSRTLNQCRSK 99 Query: 371 WAELLAEYKKVKPW------EEGYWSCDSNEREELGLPEGFDRELFKAIDRYVKKKGGDD 532 W+ LLA+Y ++K W E YW D R+ GLP FD ELF+AID YV+ + Sbjct: 100 WSSLLADYNRIKQWDSKSSSESSYWLLDPPTRDRCGLPHNFDYELFRAIDHYVRAQ---- 155 Query: 533 NAEGPETDPESDSQTPANTNKFVWKTGPXXXXXXXXXXXXXIDESFHPWRTSVSINTKQE 712 + P+TDP++D + A+ + K G + P TS + TK++ Sbjct: 156 -KDHPDTDPDTDPEADADLLDVIAKLG------SKRHRRRSMSLKIQPEETSQNCCTKEQ 208 Query: 713 SSSLNQMPELPRNDIVSEMKLQPDGEDERQKI 808 + L+ E ++ ++++ D +D+ Q + Sbjct: 209 AQILHAEEEPQQSCKEENLQMRYD-KDQPQTV 239 >gb|EXC32757.1| hypothetical protein L484_019870 [Morus notabilis] Length = 487 Score = 132 bits (333), Expect = 3e-28 Identities = 63/138 (45%), Positives = 94/138 (68%), Gaps = 5/138 (3%) Frame = +2 Query: 179 GSRCTRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQ 358 GSR TRSQAAPDW+ + + LVNEI+A+E + L++++S+QKW+++ ENC A +++RSLNQ Sbjct: 14 GSRRTRSQAAPDWSAMDELILVNEIAAVEADCLKALSSYQKWKIIAENCAAQDVSRSLNQ 73 Query: 359 CKKKWAELLAEYKKVKPWE-----EGYWSCDSNEREELGLPEGFDRELFKAIDRYVKKKG 523 ++KW LL +Y +K WE + YW+ ++ REELGLP FD ELF AI V+ + Sbjct: 74 YRRKWDSLLQDYNSIKRWELKSRRDSYWAMKTDRREELGLPRSFDEELFAAIGNLVRARE 133 Query: 524 GDDNAEGPETDPESDSQT 577 + E E+D E+ +T Sbjct: 134 NHSDTE-QESDGEAKEET 150 >ref|XP_006382204.1| hypothetical protein POPTR_0006s29340g [Populus trichocarpa] gi|118487302|gb|ABK95479.1| unknown [Populus trichocarpa] gi|550337359|gb|ERP60001.1| hypothetical protein POPTR_0006s29340g [Populus trichocarpa] Length = 459 Score = 132 bits (333), Expect = 3e-28 Identities = 61/138 (44%), Positives = 93/138 (67%), Gaps = 8/138 (5%) Frame = +2 Query: 191 TRSQAAPDWTVSECVTLVNEISAIEGEWLQSVASFQKWQLVVENCNALEMNRSLNQCKKK 370 TRSQ +P+WT E + LVNEI+A+E + L++++++QKW+++V+NC L++ R+LNQC+ K Sbjct: 32 TRSQVSPEWTTKEALILVNEIAAVEKDCLKALSTYQKWKIIVDNCVVLDVARNLNQCRTK 91 Query: 371 WAELLAEYKKVKPWE-------EGYWSCDSNEREELGLPEGFDRELFKAIDRYV-KKKGG 526 W L+ EY +K W+ + YWS +S R E GLPE F+ ELF+AID Y+ K Sbjct: 92 WNSLVNEYNLIKNWDKESESRSDFYWSLESERRREFGLPENFNDELFRAIDDYMWCHKEH 151 Query: 527 DDNAEGPETDPESDSQTP 580 D P+ DP++DS+ P Sbjct: 152 PDTDPDPDPDPDTDSEKP 169