BLASTX nr result
ID: Cocculus23_contig00019097
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00019097 (1776 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002533547.1| DNA binding protein, putative [Ricinus commu... 245 5e-62 ref|XP_006481816.1| PREDICTED: uncharacterized protein LOC102610... 238 7e-60 ref|XP_006481817.1| PREDICTED: uncharacterized protein LOC102610... 236 3e-59 ref|XP_006430258.1| hypothetical protein CICLE_v10013541mg, part... 233 2e-58 ref|XP_006381298.1| bZIP transcription factor family protein [Po... 224 1e-55 ref|XP_002277087.1| PREDICTED: uncharacterized protein LOC100257... 222 5e-55 ref|XP_007203618.1| hypothetical protein PRUPE_ppa003901mg [Prun... 219 4e-54 ref|XP_004303007.1| PREDICTED: uncharacterized protein LOC101299... 212 4e-52 ref|XP_007027678.1| Basic-leucine zipper transcription factor fa... 197 1e-47 ref|XP_007027677.1| Basic-leucine zipper transcription factor fa... 197 1e-47 ref|XP_007027676.1| Basic-leucine zipper transcription factor fa... 197 1e-47 gb|AAF79444.1|AC025808_26 F18O14.26 [Arabidopsis thaliana] 184 1e-43 ref|NP_173381.1| basic-leucine zipper transcription factor famil... 184 1e-43 ref|XP_004161242.1| PREDICTED: uncharacterized protein LOC101224... 177 2e-41 ref|XP_004149227.1| PREDICTED: uncharacterized protein LOC101210... 177 2e-41 ref|XP_006416500.1| hypothetical protein EUTSA_v10009681mg, part... 174 2e-40 ref|XP_006303620.1| hypothetical protein CARUB_v10011417mg [Caps... 173 2e-40 ref|XP_002893053.1| hypothetical protein ARALYDRAFT_312884 [Arab... 171 1e-39 gb|EXC26927.1| Transcription factor HBP-1a [Morus notabilis] 160 2e-36 ref|XP_006604635.1| PREDICTED: uncharacterized protein LOC100788... 152 5e-34 >ref|XP_002533547.1| DNA binding protein, putative [Ricinus communis] gi|223526583|gb|EEF28837.1| DNA binding protein, putative [Ricinus communis] Length = 515 Score = 245 bits (625), Expect = 5e-62 Identities = 158/419 (37%), Positives = 224/419 (53%), Gaps = 26/419 (6%) Frame = +1 Query: 346 KWGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNMTLKA 522 +WG KG+R KRVK+ESP + + K + S A V+ Q + +KA Sbjct: 65 RWGSKGKRGKKRVKSESPPLDPFTKPVLDSLTNCLDPAPDPAPVDQQHDEPLCSDTVIKA 124 Query: 523 VKSEEDSESSRPTHMYCRSYMSHGG-KSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQ 699 K E+D++ +P+ + +++ S+GG +S+QNLTEAEKE LANRESARQTIRRRQ Sbjct: 125 AKVEQDADIPKPSLVSVKNHPSYGGGRSRQNLTEAEKEERRLRRILANRESARQTIRRRQ 184 Query: 700 AMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEI 879 A+CEELTRKAADL W+NENLK+EKE V+K++QSL+ +N YLK QMA +K+E+E++P ++ Sbjct: 185 ALCEELTRKAADLAWENENLKREKESVLKEFQSLESRNKYLKAQMAKLIKTEVEDSPADL 244 Query: 880 PSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIH-------HLDPVHTKSI 1038 S H+ +P +L Y + FS WPSII HL P T I Sbjct: 245 KSAHVDNSLAPATNCSLL--------LYNQHPFSSLCWPSIIQSSNSVQSHLGPQSTIMI 296 Query: 1039 PTNASALQDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD 1218 P++ S + + SS ENP+ TN P+ P YI+ CPWFFP+ + N HP + Sbjct: 297 PSSISMPPN----GKLDSSQQPQENPMITNGPRTPLYIVSCPWFFPVPEHANGLHP--LP 350 Query: 1219 SFSSKHKHSDNSTSKQ-PEQSSIKADNLPES---------NSTDNLPENSSKYST----- 1353 SF +HK S + Q SS KA L ++ NS D P + T Sbjct: 351 SFGLQHKQDGTSVNNQCSRTSSAKATALMQNQFSSASEKVNSEDGNPAINDLNETPVGVP 410 Query: 1354 SEDSNFSNYPN--RAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSAL 1524 E + S PN + P L S+ P++++K E + + I T+ + SAL Sbjct: 411 PEGGSHSAAPNHKETVVAPVMLSSITPTVAVKNETGTRSESVPHTDGICTTSKQLISAL 469 >ref|XP_006481816.1| PREDICTED: uncharacterized protein LOC102610701 isoform X1 [Citrus sinensis] Length = 516 Score = 238 bits (607), Expect = 7e-60 Identities = 155/416 (37%), Positives = 219/416 (52%), Gaps = 26/416 (6%) Frame = +1 Query: 349 WGRKGRRSMKRVKNESPGGEW---VKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNMTLK 519 WG KG+R KRVK ESP G+ + ++ S + Q + QR N+ +K Sbjct: 63 WGCKGKRVRKRVKTESPPGQAESAMNPVDPEPPCSDPIDQDQVISDQQRDRTACGNILIK 122 Query: 520 AVKSEEDSESSRPTHMYCRSYMSH-GGKSKQNLTEAEKEAXXXXXXLANRESARQTIRRR 696 VK+++D+ES + + + Y+S GG+S+QNLTEAEKE LANRESARQTIRRR Sbjct: 123 PVKADQDAESLKRSSLCATRYISMAGGRSRQNLTEAEKEERRVRRILANRESARQTIRRR 182 Query: 697 QAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGE 876 QA+CEELTRKAADL +NE+LK+EKEL +K+YQSL+ N +LK Q+A +K+E+ E GE Sbjct: 183 QALCEELTRKAADLSQENESLKREKELAVKEYQSLETINKHLKAQVAKVMKAEVGETQGE 242 Query: 877 IPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASA 1056 + H SP + P Y +P WPSII PV ++ NA Sbjct: 243 VKLAHAEMSSSPTN---------CPLLLYNHHALTPLGWPSIIQSSQPVPSRHGMQNAVT 293 Query: 1057 LQDPI--TIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHV------ 1212 I +I L+S E ENP +NV + P Y++PCPWFFPL D + H + Sbjct: 294 FPSNISTSITGELASSQEQENPTDSNVARTPLYVVPCPWFFPLHDSGSGFHAPISNGLKI 353 Query: 1213 -VDSFSSKHKHSDNSTSKQ-----------PEQSSIKADNLPESNSTDNLPENSSKYSTS 1356 D S+ + + S+SK P + +A LPE+ S ++L + S Sbjct: 354 LQDETSAHNGYGSGSSSKMTADKENHHFLLPVKIKNEAYGLPEAQSYNDLNDIPVTESPQ 413 Query: 1357 ED--SNFSNYPNRAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGS 1518 + Y A P PL SV S +K++N LQ + T +++S A+H+ S Sbjct: 414 DGGCQQIGRYTREATLTPPPLSSVGGSFIVKHDNVLQSDYTGHTKAVSKIANHLVS 469 >ref|XP_006481817.1| PREDICTED: uncharacterized protein LOC102610701 isoform X2 [Citrus sinensis] Length = 515 Score = 236 bits (601), Expect = 3e-59 Identities = 157/417 (37%), Positives = 223/417 (53%), Gaps = 27/417 (6%) Frame = +1 Query: 349 WGRKGRRSMKRVKNESPGGEW---VKDIESSKLRSFSLAQRDCSVEDQRQPRTI-KNMTL 516 WG KG+R KRVK ESP G+ + ++ S + D + DQ++ RT N+ + Sbjct: 63 WGCKGKRVRKRVKTESPPGQAESAMNPVDPEPPCSDPID--DQVISDQQRDRTACGNILI 120 Query: 517 KAVKSEEDSESSRPTHMYCRSYMSH-GGKSKQNLTEAEKEAXXXXXXLANRESARQTIRR 693 K VK+++D+ES + + + Y+S GG+S+QNLTEAEKE LANRESARQTIRR Sbjct: 121 KPVKADQDAESLKRSSLCATRYISMAGGRSRQNLTEAEKEERRVRRILANRESARQTIRR 180 Query: 694 RQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPG 873 RQA+CEELTRKAADL +NE+LK+EKEL +K+YQSL+ N +LK Q+A +K+E+ E G Sbjct: 181 RQALCEELTRKAADLSQENESLKREKELAVKEYQSLETINKHLKAQVAKVMKAEVGETQG 240 Query: 874 EIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNAS 1053 E+ H SP + P Y +P WPSII PV ++ NA Sbjct: 241 EVKLAHAEMSSSPTN---------CPLLLYNHHALTPLGWPSIIQSSQPVPSRHGMQNAV 291 Query: 1054 ALQDPI--TIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHV----- 1212 I +I L+S E ENP +NV + P Y++PCPWFFPL D + H + Sbjct: 292 TFPSNISTSITGELASSQEQENPTDSNVARTPLYVVPCPWFFPLHDSGSGFHAPISNGLK 351 Query: 1213 --VDSFSSKHKHSDNSTSKQ-----------PEQSSIKADNLPESNSTDNLPENSSKYST 1353 D S+ + + S+SK P + +A LPE+ S ++L + S Sbjct: 352 ILQDETSAHNGYGSGSSSKMTADKENHHFLLPVKIKNEAYGLPEAQSYNDLNDIPVTESP 411 Query: 1354 SED--SNFSNYPNRAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGS 1518 + Y A P PL SV S +K++N LQ + T +++S A+H+ S Sbjct: 412 QDGGCQQIGRYTREATLTPPPLSSVGGSFIVKHDNVLQSDYTGHTKAVSKIANHLVS 468 >ref|XP_006430258.1| hypothetical protein CICLE_v10013541mg, partial [Citrus clementina] gi|557532315|gb|ESR43498.1| hypothetical protein CICLE_v10013541mg, partial [Citrus clementina] Length = 511 Score = 233 bits (595), Expect = 2e-58 Identities = 154/416 (37%), Positives = 220/416 (52%), Gaps = 26/416 (6%) Frame = +1 Query: 349 WGRKGRRSMKRVKNESPGGEW---VKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNMTLK 519 WG K +R KRVK ESP G+ + ++ S + + S + QR N+ +K Sbjct: 59 WGCKVKRVRKRVKTESPPGQAGSAMNPVDPEPPCSDPIDDQVIS-DQQRDQTACGNILIK 117 Query: 520 AVKSEEDSESSRPTHMYCRSYMSH-GGKSKQNLTEAEKEAXXXXXXLANRESARQTIRRR 696 K+++D+ES + + + Y+S GG+S+QNLTEAEKE LANRESARQTIRRR Sbjct: 118 PAKADQDAESLKRSSLCATRYISMAGGRSRQNLTEAEKEERRVRRILANRESARQTIRRR 177 Query: 697 QAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGE 876 QA+CEELTRKAADL +NE+LK+EKEL +K+YQSL+ N +LK Q+A +KSE+ E GE Sbjct: 178 QALCEELTRKAADLSQENESLKREKELAVKEYQSLETINKHLKAQVAKVMKSEVGETQGE 237 Query: 877 IPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASA 1056 + H SP + P Y +P WPSII PV ++ NA Sbjct: 238 VKLAHAEMSSSPTN---------CPLLLYNHHALTPLGWPSIIQSSQPVPSRHEMQNAVT 288 Query: 1057 LQDPI--TIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHV------ 1212 I +I L+S E ENP +NV + P Y++PCPWFFPL D + H + Sbjct: 289 FPSNISTSITGKLASSQEQENPTDSNVARTPLYVVPCPWFFPLHDSGSGFHAPISNGLKV 348 Query: 1213 -VDSFSSKHKHSDNSTSKQ-----------PEQSSIKADNLPESNSTDNLPENSSKYSTS 1356 D S+++ + S+SK P + +A LPE+ S ++L + S Sbjct: 349 LQDETSARNGYGSGSSSKMTADKENHHFLLPVKIKNEAYGLPEAQSYNDLNDIPVTESPQ 408 Query: 1357 ED--SNFSNYPNRAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGS 1518 + +Y A P PL SV S +K++N LQ + T +++S A+H+ S Sbjct: 409 DGGCQQIGHYTREATLTPPPLSSVGGSFIVKHDNVLQSDYTGHTKAVSKIANHLVS 464 >ref|XP_006381298.1| bZIP transcription factor family protein [Populus trichocarpa] gi|550336000|gb|ERP59095.1| bZIP transcription factor family protein [Populus trichocarpa] Length = 485 Score = 224 bits (570), Expect = 1e-55 Identities = 155/426 (36%), Positives = 213/426 (50%), Gaps = 25/426 (5%) Frame = +1 Query: 328 REKPSRKWGRKGRRSMKRVKNESPGGEWVKDIESSKLRSFSLAQRDCSVEDQRQPRTIKN 507 RE +WG KG+R+ KRV+ ES D L ++D +V DQ QP I + Sbjct: 39 RESSGSEWGSKGKRARKRVRAESDSVSTYSD----------LPRQDRAVVDQ-QP--IHS 85 Query: 508 MTLKAVKSEEDSESSRPTHMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQT 684 +K + E D++ + + SY S+G G+S+ NLTEAEKE LANRESARQT Sbjct: 86 NVVKPARQELDADVPKSSPSCATSYPSYGTGRSRLNLTEAEKEERRLRRILANRESARQT 145 Query: 685 IRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEE 864 IRRRQA+CEELTRKAADL W+NENLK+EKEL +K YQSL+ N +LK QMA ++K+EME Sbjct: 146 IRRRQALCEELTRKAADLSWENENLKKEKELALKNYQSLETTNKHLKAQMAKQIKAEMEV 205 Query: 865 APGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPT 1044 +PG++ S + P ++ P Y + FSPH WPSII +P+ + Sbjct: 206 SPGDLKSALVDIP--------TTAPTNCPLLVYNQHAFSPHCWPSIIQSSNPIQSHYTTE 257 Query: 1045 NASALQDPI---TIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVV 1215 NA + + T SS + EN + + P+ P Y++ CPWFFP D N H Sbjct: 258 NAIVIPSNMPMPTNGTHDSSQLQQENTVIVSGPRTPLYVVSCPWFFPGPDHGNGLHAQ-- 315 Query: 1216 DSFSSKHKHSDNSTSKQPEQSSIKADNLPESNSTDNLP-ENSSKYSTSEDSNFSN----- 1377 SFS KH+ S + SS P N +L S+ ++SE+ N Sbjct: 316 PSFSFKHRQDGISLNNLCCGSSSPKAAAPMENRHSSLSIIVKSETTSSEEVRVINDLNET 375 Query: 1378 ---------------YPNRAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHV 1512 +P I P P SV P++++K E + F I AS + Sbjct: 376 PVGFTLYGGGQCEGTHPKEMILTPVPPTSVTPAVAVKNEAGQKSEHAFGANGICTKASQL 435 Query: 1513 GSALSE 1530 L E Sbjct: 436 RCVLPE 441 >ref|XP_002277087.1| PREDICTED: uncharacterized protein LOC100257875 [Vitis vinifera] gi|297740087|emb|CBI30269.3| unnamed protein product [Vitis vinifera] Length = 496 Score = 222 bits (565), Expect = 5e-55 Identities = 171/470 (36%), Positives = 232/470 (49%), Gaps = 33/470 (7%) Frame = +1 Query: 250 GADLMVKIEXXXXXXXXXXXXXXXXERE----KPSRKWGRKGRRSMKRVKNESPGGEWVK 417 GAD +VKIE E E + KWG KG+R KRVK+ESP + K Sbjct: 33 GADRLVKIELEAAEVLADLAQSLMRESESNGAESGGKWGSKGKRGRKRVKSESPPSDEFK 92 Query: 418 DIESSKLRSFSLAQRDCSVEDQRQPRTI-KNMTLKAVKSEEDSESSRPTHMYCRSYMSH- 591 + ++ S L ++D Q++ R I +N+ L K+E D E ++P+ M +Y H Sbjct: 93 NPDNLFPGSSDLTEQDKQSVVQQECRKIDRNVFL--TKTETDDEFAKPSPMCTTTYAPHH 150 Query: 592 GGKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEK 771 GK +QNLTEAEKEA LANRESARQTIRRRQA+C EL+RKAADL +NE LK+EK Sbjct: 151 SGKLRQNLTEAEKEARRLRRVLANRESARQTIRRRQALCGELSRKAADLSLENETLKREK 210 Query: 772 ELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPSIHIT--PPKSPCDQQALSSEKP 945 EL MK++QSL++KN +LK Q+A +K E E+ P I S +T PP S C Sbjct: 211 ELAMKEFQSLENKNKHLKAQVAKIIKPEEEKTPESISSHEMTSIPPSSNC---------- 260 Query: 946 QPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASALQDPITIACMLSSLDESENPLST 1125 P Y +P+F+P +W S NA A + DE ENP + Sbjct: 261 -PLLLYNQPSFTPFLWSSPERRFQ---------NAFASH---------AVPDERENP-NI 300 Query: 1126 NVPKIPYYILPCPWFFPLSDCRNVSHPHVVDSFSSKHKH-------SDNSTSKQPEQSSI 1284 + + P YILPCPWFFPL + N H+ S + K K S +S K Sbjct: 301 DAYRTPLYILPCPWFFPLPNHGN--GLHLPPSLNLKDKQDAVNSQCSASSLIKNKSGIET 358 Query: 1285 KADNLPESNSTDNLPE-----------------NSSKYSTSEDS-NFSNYPNRAIFMPTP 1410 K N + S + LP+ + Y S D+ + S++ N I P+P Sbjct: 359 KPANKFQEASFEFLPDGHLITPHHRRMIPANNVHDLSYGFSPDAHHISSHSNAMILSPSP 418 Query: 1411 LRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSALSENIGGSSICSS 1560 L S+K +I+ K+E +LQ + E H+ S SE ICSS Sbjct: 419 LMSLKSAITFKHEGELQSSYVDNGE-----GGHIVSVFSEKNQEPVICSS 463 >ref|XP_007203618.1| hypothetical protein PRUPE_ppa003901mg [Prunus persica] gi|462399149|gb|EMJ04817.1| hypothetical protein PRUPE_ppa003901mg [Prunus persica] Length = 541 Score = 219 bits (557), Expect = 4e-54 Identities = 161/472 (34%), Positives = 234/472 (49%), Gaps = 44/472 (9%) Frame = +1 Query: 247 GGADLMVKIEXXXXXXXXXXXXXXXXERE--KPSRKWGRKGRRSMKRVKNESPGGEWVKD 420 G AD MVK E E + + WG KG+R+ KRVK+ESP G + Sbjct: 39 GAADRMVKEELEAAEALADLAHLAMRESSGAESAGNWGLKGKRAKKRVKSESPPGHLGLN 98 Query: 421 IESSKLRSFSLAQRDCSVEDQRQPRTI------------KNMTLKAVKSEEDSESSRPTH 564 L+Q+D +V RQ T+ + ++ + VK+E D+E ++ + Sbjct: 99 PVDPVPTCPDLSQQDQAVTGLRQCETVCTNVVTELLKTEQVLSNEIVKAEHDAEVTKLSP 158 Query: 565 MYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLG 741 + SY S KS++NLTE EKE LANRESARQTIRRRQA+CEELTRKAADL Sbjct: 159 ICTTSYPSFSCSKSRRNLTEEEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLA 218 Query: 742 WDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPSIHI---TPPKSP 912 +NENLK++KEL +K+YQSL+ N +LKVQMA +K+E+EE P E S ++ PP SP Sbjct: 219 LENENLKKKKELALKEYQSLEKTNKHLKVQMAKVIKAEVEETPSENMSAYVQMQIPPSSP 278 Query: 913 CDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASALQD--PITIACM 1086 + P + RP F+P WPSII + V + + N A+ P+ Sbjct: 279 SN---------SPLFLFNRPPFTPVFWPSIIQSSNSVQLQHVSQNPMAIPSNIPLPANGT 329 Query: 1087 LSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRN---------VSHPHVVDSFSSKHK 1239 S E ENPL+ N + P Y+ PCPWF P D N +++ SF++++ Sbjct: 330 ADSSHEQENPLTNNGTRTPLYVFPCPWFIPHFDNGNGLQPQSSLCLNNKQEETSFNNQYS 389 Query: 1240 HS---------DNSTSKQPEQSSIKADNLPESNSTDNLPENSSKYS-TSEDSNFSNYPN- 1386 S DN P + +A E+ +++L E +++ D + YP Sbjct: 390 ASSSSRTVAQLDNHHCSFPIRLKAEASGSMEARLSNDLNETPAQFPLDGADQHTGPYPKE 449 Query: 1387 ---RAIFM-PTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSALSE 1530 + IF+ P + + S+K+EN + + T E + H+ SAL E Sbjct: 450 NGPKEIFLTPASANHERVASSIKHENGFESDYTATAEK----SFHMFSALPE 497 >ref|XP_004303007.1| PREDICTED: uncharacterized protein LOC101299496 [Fragaria vesca subsp. vesca] Length = 531 Score = 212 bits (540), Expect = 4e-52 Identities = 149/428 (34%), Positives = 214/428 (50%), Gaps = 34/428 (7%) Frame = +1 Query: 349 WGRKGRRSMKRVKNESP----GGEWVKDI----ESSKLRSFSLAQRDCSVEDQRQPRTIK 504 WG KG+R+ KRVK+ESP G V + + + +R C +T Sbjct: 69 WGLKGKRAKKRVKSESPPTLSGSNPVPACPDLPQDEAVIGPAQCERVCINVVAEPVKTET 128 Query: 505 NMTLKAVKSEEDSESSRPTHMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQ 681 M+ + KSE+D+E + T + SY S KS++NLTE EKE LANRESARQ Sbjct: 129 VMSKRIAKSEQDAELTNSTPICNTSYPSFNCTKSRRNLTEEEKEERRIRRILANRESARQ 188 Query: 682 TIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEME 861 TIRRRQA+CE+LT+KAADL +NE+LK +KEL +K+YQSL++ N LKVQM+ K+E+E Sbjct: 189 TIRRRQALCEDLTKKAADLTLENESLKMKKELALKQYQSLEETNRLLKVQMSKARKAEVE 248 Query: 862 EAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIP 1041 E E S ++ P SS PF + RP F+P WPS+I + + + +P Sbjct: 249 ETLDENMSAYVQIPS--------SSPTNSPFVLFNRPPFTPVFWPSVIQSSNSIQLQQVP 300 Query: 1042 TNASALQDPITIAC--MLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPH-- 1209 N A+ I++ C S E NP+S N + P Y++PCPWFFP + N + P Sbjct: 301 QNPMAIPSNISLPCNGTADSSHELGNPISINGSRTPLYVIPCPWFFPQFEIGNGAQPQSS 360 Query: 1210 ---------------VVDSFSSKHKHSDNSTSKQPEQSSIKADNLPESNSTDNLPENSSK 1344 S S DN+ S P + ++A E+ +L EN ++ Sbjct: 361 CPENKQEGAFFNNQGSASSLSRTAAQLDNNQSAFPVRLDVEASGSVEARPRTDLNENPAQ 420 Query: 1345 YSTSEDSNFSN--YPN----RAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTAS 1506 + + +P R IF+ L + ++K EN L+ + + E S TA Sbjct: 421 FPLDGGDQHTGGFHPKENGPREIFLSPLLNHGGIASTIKNENGLESDFSANAEK-SMTAC 479 Query: 1507 HVGSALSE 1530 H SAL E Sbjct: 480 HPFSALPE 487 >ref|XP_007027678.1| Basic-leucine zipper transcription factor family protein, putative isoform 3 [Theobroma cacao] gi|508716283|gb|EOY08180.1| Basic-leucine zipper transcription factor family protein, putative isoform 3 [Theobroma cacao] Length = 594 Score = 197 bits (501), Expect = 1e-47 Identities = 135/391 (34%), Positives = 196/391 (50%), Gaps = 31/391 (7%) Frame = +1 Query: 451 LAQRDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHMYCRSYMSHGG-KSKQNLTEAE 627 + ++D + Q T ++ +K+VK+E+++ES + + YMS GG +S+QNLTEAE Sbjct: 184 MPEKDVWEDRQHDQMTGNDVLIKSVKAEQNAESVKSSPTCATKYMSGGGGRSRQNLTEAE 243 Query: 628 KEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKD 807 KEA LANRESARQTIRRRQA+CE+LT K ADL +NENLK+ KEL +K+Y+S + Sbjct: 244 KEARRLRRILANRESARQTIRRRQALCEKLTLKVADLTRENENLKRAKELALKEYKSQES 303 Query: 808 KNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPH 987 N +LK QM +K+E EAP E+ H Q + PF FY + F P Sbjct: 304 TNKHLKAQMVKAIKAEEGEAPRELKLAH----------QISGPSRNYPFYFYNQHPFPPF 353 Query: 988 VWPSIIHHLDPVHTKSIPTNASALQDPITIAC--MLSSLDESENPLSTNVPKIPYYILPC 1161 WPSI+ +PV T+ NA + I+ L S + ENP++ N PK P Y++P Sbjct: 354 CWPSIVQSSNPVQTQCEHQNAIVVSSSISAPTNGRLDSSHDQENPINVNGPKTPLYVVPY 413 Query: 1162 PWFFPLSDCRNVSH-------PHVVDSFSSKHKHSDNSTSK-----------------QP 1269 PWFF L D N H + D S+ ++ S + K + Sbjct: 414 PWFFSLPDHGNELHLRPCCGPKNNKDETSANNRFSAGCSLKSVVHEEKYNFSLPTEVEKE 473 Query: 1270 EQSSIKADNLPESNSTDNLPENSS----KYSTSEDSNFSNYPNRAIFMPTPLRSVKPSIS 1437 SI+A + ++ ++ LP + S +Y E+ + +PTPL S P+ Sbjct: 474 AYGSIEASSNNQNCTSVRLPSDGSVQCIRYQIKEE----------VILPTPLCSAGPTFV 523 Query: 1438 LKYENKLQQNQTFKEESISPTASHVGSALSE 1530 ++ EN N E+ A H AL E Sbjct: 524 VEQENTPDVN----TEAARVRACHFVGALPE 550 Score = 60.1 bits (144), Expect = 3e-06 Identities = 46/143 (32%), Positives = 71/143 (49%), Gaps = 2/143 (1%) Frame = +1 Query: 334 KPSRKWGRKGRRSMKRV-KNESPGGEWVKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNM 510 K S KWG KG+R +RV +ESP E + S LA+ +V+ Q+ T + Sbjct: 56 KFSAKWGCKGKRVSRRVSSSESPPSEIGLNQVDPVQSSSDLAEDRAAVDQQQSQVTSTPV 115 Query: 511 TLKAVKSEEDSESSRPTHMYCRSYMSH-GGKSKQNLTEAEKEAXXXXXXLANRESARQTI 687 ++++++E++SE +H Y S GKS+QN AEKE L N+ES Q I Sbjct: 116 VIESIEAEQNSELLNGSHTCAARYTSKCVGKSRQN---AEKETLRLHRMLTNKESDWQMI 172 Query: 688 RRRQAMCEELTRKAADLGWDNEN 756 R RQ + + D+ D ++ Sbjct: 173 RERQILYSIMGMPEKDVWEDRQH 195 >ref|XP_007027677.1| Basic-leucine zipper transcription factor family protein, putative isoform 2 [Theobroma cacao] gi|508716282|gb|EOY08179.1| Basic-leucine zipper transcription factor family protein, putative isoform 2 [Theobroma cacao] Length = 434 Score = 197 bits (501), Expect = 1e-47 Identities = 135/391 (34%), Positives = 196/391 (50%), Gaps = 31/391 (7%) Frame = +1 Query: 451 LAQRDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHMYCRSYMSHGG-KSKQNLTEAE 627 + ++D + Q T ++ +K+VK+E+++ES + + YMS GG +S+QNLTEAE Sbjct: 24 MPEKDVWEDRQHDQMTGNDVLIKSVKAEQNAESVKSSPTCATKYMSGGGGRSRQNLTEAE 83 Query: 628 KEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKD 807 KEA LANRESARQTIRRRQA+CE+LT K ADL +NENLK+ KEL +K+Y+S + Sbjct: 84 KEARRLRRILANRESARQTIRRRQALCEKLTLKVADLTRENENLKRAKELALKEYKSQES 143 Query: 808 KNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPH 987 N +LK QM +K+E EAP E+ H Q + PF FY + F P Sbjct: 144 TNKHLKAQMVKAIKAEEGEAPRELKLAH----------QISGPSRNYPFYFYNQHPFPPF 193 Query: 988 VWPSIIHHLDPVHTKSIPTNASALQDPITIAC--MLSSLDESENPLSTNVPKIPYYILPC 1161 WPSI+ +PV T+ NA + I+ L S + ENP++ N PK P Y++P Sbjct: 194 CWPSIVQSSNPVQTQCEHQNAIVVSSSISAPTNGRLDSSHDQENPINVNGPKTPLYVVPY 253 Query: 1162 PWFFPLSDCRNVSH-------PHVVDSFSSKHKHSDNSTSK-----------------QP 1269 PWFF L D N H + D S+ ++ S + K + Sbjct: 254 PWFFSLPDHGNELHLRPCCGPKNNKDETSANNRFSAGCSLKSVVHEEKYNFSLPTEVEKE 313 Query: 1270 EQSSIKADNLPESNSTDNLPENSS----KYSTSEDSNFSNYPNRAIFMPTPLRSVKPSIS 1437 SI+A + ++ ++ LP + S +Y E+ + +PTPL S P+ Sbjct: 314 AYGSIEASSNNQNCTSVRLPSDGSVQCIRYQIKEE----------VILPTPLCSAGPTFV 363 Query: 1438 LKYENKLQQNQTFKEESISPTASHVGSALSE 1530 ++ EN N E+ A H AL E Sbjct: 364 VEQENTPDVN----TEAARVRACHFVGALPE 390 >ref|XP_007027676.1| Basic-leucine zipper transcription factor family protein, putative isoform 1 [Theobroma cacao] gi|508716281|gb|EOY08178.1| Basic-leucine zipper transcription factor family protein, putative isoform 1 [Theobroma cacao] Length = 595 Score = 197 bits (501), Expect = 1e-47 Identities = 135/391 (34%), Positives = 196/391 (50%), Gaps = 31/391 (7%) Frame = +1 Query: 451 LAQRDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHMYCRSYMSHGG-KSKQNLTEAE 627 + ++D + Q T ++ +K+VK+E+++ES + + YMS GG +S+QNLTEAE Sbjct: 185 MPEKDVWEDRQHDQMTGNDVLIKSVKAEQNAESVKSSPTCATKYMSGGGGRSRQNLTEAE 244 Query: 628 KEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKD 807 KEA LANRESARQTIRRRQA+CE+LT K ADL +NENLK+ KEL +K+Y+S + Sbjct: 245 KEARRLRRILANRESARQTIRRRQALCEKLTLKVADLTRENENLKRAKELALKEYKSQES 304 Query: 808 KNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPH 987 N +LK QM +K+E EAP E+ H Q + PF FY + F P Sbjct: 305 TNKHLKAQMVKAIKAEEGEAPRELKLAH----------QISGPSRNYPFYFYNQHPFPPF 354 Query: 988 VWPSIIHHLDPVHTKSIPTNASALQDPITIAC--MLSSLDESENPLSTNVPKIPYYILPC 1161 WPSI+ +PV T+ NA + I+ L S + ENP++ N PK P Y++P Sbjct: 355 CWPSIVQSSNPVQTQCEHQNAIVVSSSISAPTNGRLDSSHDQENPINVNGPKTPLYVVPY 414 Query: 1162 PWFFPLSDCRNVSH-------PHVVDSFSSKHKHSDNSTSK-----------------QP 1269 PWFF L D N H + D S+ ++ S + K + Sbjct: 415 PWFFSLPDHGNELHLRPCCGPKNNKDETSANNRFSAGCSLKSVVHEEKYNFSLPTEVEKE 474 Query: 1270 EQSSIKADNLPESNSTDNLPENSS----KYSTSEDSNFSNYPNRAIFMPTPLRSVKPSIS 1437 SI+A + ++ ++ LP + S +Y E+ + +PTPL S P+ Sbjct: 475 AYGSIEASSNNQNCTSVRLPSDGSVQCIRYQIKEE----------VILPTPLCSAGPTFV 524 Query: 1438 LKYENKLQQNQTFKEESISPTASHVGSALSE 1530 ++ EN N E+ A H AL E Sbjct: 525 VEQENTPDVN----TEAARVRACHFVGALPE 551 Score = 61.2 bits (147), Expect = 1e-06 Identities = 48/144 (33%), Positives = 74/144 (51%), Gaps = 3/144 (2%) Frame = +1 Query: 334 KPSRKWGRKGRRSMKRVKN-ESPGGEWVKDIESSKLRSFSLAQRDCSVEDQRQPR-TIKN 507 K S KWG KG+R +RV + ESP E + S LA++D + DQ+Q + T Sbjct: 56 KFSAKWGCKGKRVSRRVSSSESPPSEIGLNQVDPVQSSSDLAEQDRAAVDQQQSQVTSTP 115 Query: 508 MTLKAVKSEEDSESSRPTHMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQT 684 + ++++++E++SE +H Y S GKS+QN AEKE L N+ES Q Sbjct: 116 VVIESIEAEQNSELLNGSHTCAARYTSKCVGKSRQN---AEKETLRLHRMLTNKESDWQM 172 Query: 685 IRRRQAMCEELTRKAADLGWDNEN 756 IR RQ + + D+ D ++ Sbjct: 173 IRERQILYSIMGMPEKDVWEDRQH 196 >gb|AAF79444.1|AC025808_26 F18O14.26 [Arabidopsis thaliana] Length = 639 Score = 184 bits (467), Expect = 1e-43 Identities = 139/413 (33%), Positives = 203/413 (49%), Gaps = 14/413 (3%) Frame = +1 Query: 349 WGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQPRT---IKNMTL 516 WG KG+R KRVK ESP + +K +S L + LA+ E++ + K +T Sbjct: 225 WGSKGKRVRKRVKTESPPSDSLLKPPDSDTLPTPDLAEERLVKEEEEEEEVEPITKELTK 284 Query: 517 KAVKSEEDSESSRP--THMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTI 687 VKSE + E+ +P R S+G G+S+QNL+EAE+E LANRESARQTI Sbjct: 285 APVKSEINGETPKPILASTLIRCSRSNGCGRSRQNLSEAEREERRIRRILANRESARQTI 344 Query: 688 RRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEA 867 RRRQAMCEEL++KAADL ++NENL++EK+ +K++QSL+ N +LK Q+ VK + +E Sbjct: 345 RRRQAMCEELSKKAADLTYENENLRREKDWALKEFQSLETINKHLKEQVLKSVKPDTKE- 403 Query: 868 PGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDP-VHTKSIPT 1044 P +SP Q S PF FY + + WP + +P + PT Sbjct: 404 ----------PEESPKPSQVEMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPMISPLEFPT 453 Query: 1045 NASALQDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD-- 1218 + A IT E EN N K +Y++PCPWF P D N + D Sbjct: 454 SGGASAKTIT-------TQEHENAADDNGQKTHFYVVPCPWFLPPPDHSNGVPFGLQDTQ 506 Query: 1219 --SFSSKHKHSDNSTSKQPEQSSIKADNLPE--SNSTDNLPENSSKYSTSEDSNFSNYPN 1386 +FS+ H H D+S+++ + + +LP PE Y +E + Sbjct: 507 RGTFSNGH-HIDDSSARPMDVTETPRSHLPTRIKEEDSGSPETRPLYDLNESATEVLSEG 565 Query: 1387 RAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSALSENIGGS 1545 F T + + SLK+E+ ++T ++ P HV +L E GS Sbjct: 566 GDGFPVT-----QQAYSLKHED---VSETTNGVTLMPPGHHVLISLPEKKHGS 610 >ref|NP_173381.1| basic-leucine zipper transcription factor family protein [Arabidopsis thaliana] gi|20466818|gb|AAM20726.1| unknown protein [Arabidopsis thaliana] gi|23198222|gb|AAN15638.1| unknown protein [Arabidopsis thaliana] gi|332191739|gb|AEE29860.1| bZIP transcription factor-like protein [Arabidopsis thaliana] Length = 471 Score = 184 bits (467), Expect = 1e-43 Identities = 139/413 (33%), Positives = 203/413 (49%), Gaps = 14/413 (3%) Frame = +1 Query: 349 WGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQPRT---IKNMTL 516 WG KG+R KRVK ESP + +K +S L + LA+ E++ + K +T Sbjct: 57 WGSKGKRVRKRVKTESPPSDSLLKPPDSDTLPTPDLAEERLVKEEEEEEEVEPITKELTK 116 Query: 517 KAVKSEEDSESSRP--THMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTI 687 VKSE + E+ +P R S+G G+S+QNL+EAE+E LANRESARQTI Sbjct: 117 APVKSEINGETPKPILASTLIRCSRSNGCGRSRQNLSEAEREERRIRRILANRESARQTI 176 Query: 688 RRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEA 867 RRRQAMCEEL++KAADL ++NENL++EK+ +K++QSL+ N +LK Q+ VK + +E Sbjct: 177 RRRQAMCEELSKKAADLTYENENLRREKDWALKEFQSLETINKHLKEQVLKSVKPDTKE- 235 Query: 868 PGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDP-VHTKSIPT 1044 P +SP Q S PF FY + + WP + +P + PT Sbjct: 236 ----------PEESPKPSQVEMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPMISPLEFPT 285 Query: 1045 NASALQDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD-- 1218 + A IT E EN N K +Y++PCPWF P D N + D Sbjct: 286 SGGASAKTIT-------TQEHENAADDNGQKTHFYVVPCPWFLPPPDHSNGVPFGLQDTQ 338 Query: 1219 --SFSSKHKHSDNSTSKQPEQSSIKADNLPE--SNSTDNLPENSSKYSTSEDSNFSNYPN 1386 +FS+ H H D+S+++ + + +LP PE Y +E + Sbjct: 339 RGTFSNGH-HIDDSSARPMDVTETPRSHLPTRIKEEDSGSPETRPLYDLNESATEVLSEG 397 Query: 1387 RAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSALSENIGGS 1545 F T + + SLK+E+ ++T ++ P HV +L E GS Sbjct: 398 GDGFPVT-----QQAYSLKHED---VSETTNGVTLMPPGHHVLISLPEKKHGS 442 >ref|XP_004161242.1| PREDICTED: uncharacterized protein LOC101224097 [Cucumis sativus] Length = 576 Score = 177 bits (448), Expect = 2e-41 Identities = 135/442 (30%), Positives = 202/442 (45%), Gaps = 42/442 (9%) Frame = +1 Query: 253 ADLMVKIEXXXXXXXXXXXXXXXXER--EKPSRKWG--RKGRRSMKRVKNESPGGEWVKD 420 AD MVK+E E + KWG KG+R+ K VK ESP + Sbjct: 49 ADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTESPTSGFADS 108 Query: 421 IESSKLRSFSLAQ-----------RDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHM 567 + + + Q ++C+++ Q +P T +T K ++++ESS+ + Sbjct: 109 LPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVT----KMDKEAESSKVSPA 164 Query: 568 YCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGW 744 SY G +S++ LTEAEKE LANRESARQTIRRRQA+CEELTRKAADL W Sbjct: 165 CTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAW 224 Query: 745 DNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQ 924 +NENLK+EKE+ +K+YQSL+ N LK Q+A VK ++EE PG S H+ P P + Sbjct: 225 ENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTN-- 282 Query: 925 ALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVH-TKSIPTNASALQDPI-TIACMLSSL 1098 P + R P+ WPS++ H ++ S++ P A + S Sbjct: 283 -------CPLFLFSR---LPYFWPSVVQSTSSYHELPNVVVVPSSINPPANNNASVSGSS 332 Query: 1099 DESENPLSTNVPKIPYYIL-PCPWFFPLSDCRNVSHPHV-----VDSFSSKHKHSDNSTS 1260 EN + + P IL P W P D RN P + D K +++ + Sbjct: 333 QTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAIT 392 Query: 1261 KQPEQSSIKADNLPESNSTDNLPENSSKYSTSEDSN------------------FSNYPN 1386 + ++ + +LP + + P+ + S E SN + P Sbjct: 393 SKDVRAESRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPV 452 Query: 1387 RAIFMPTPLRSVKPSISLKYEN 1452 R + P L ++PS + +N Sbjct: 453 RKVLSPVRLECIEPSSAATLDN 474 >ref|XP_004149227.1| PREDICTED: uncharacterized protein LOC101210630 [Cucumis sativus] Length = 536 Score = 177 bits (448), Expect = 2e-41 Identities = 135/442 (30%), Positives = 202/442 (45%), Gaps = 42/442 (9%) Frame = +1 Query: 253 ADLMVKIEXXXXXXXXXXXXXXXXER--EKPSRKWG--RKGRRSMKRVKNESPGGEWVKD 420 AD MVK+E E + KWG KG+R+ K VK ESP + Sbjct: 9 ADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTESPTSGFADS 68 Query: 421 IESSKLRSFSLAQ-----------RDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHM 567 + + + Q ++C+++ Q +P T +T K ++++ESS+ + Sbjct: 69 LPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVT----KMDKEAESSKVSPA 124 Query: 568 YCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGW 744 SY G +S++ LTEAEKE LANRESARQTIRRRQA+CEELTRKAADL W Sbjct: 125 CTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAW 184 Query: 745 DNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQ 924 +NENLK+EKE+ +K+YQSL+ N LK Q+A VK ++EE PG S H+ P P + Sbjct: 185 ENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTN-- 242 Query: 925 ALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVH-TKSIPTNASALQDPI-TIACMLSSL 1098 P + R P+ WPS++ H ++ S++ P A + S Sbjct: 243 -------CPLFLFSR---LPYFWPSVVQSTSSYHELPNVVVVPSSINPPANNNASVSGSS 292 Query: 1099 DESENPLSTNVPKIPYYIL-PCPWFFPLSDCRNVSHPHV-----VDSFSSKHKHSDNSTS 1260 EN + + P IL P W P D RN P + D K +++ + Sbjct: 293 QTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAIT 352 Query: 1261 KQPEQSSIKADNLPESNSTDNLPENSSKYSTSEDSN------------------FSNYPN 1386 + ++ + +LP + + P+ + S E SN + P Sbjct: 353 SKDVRAESRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPV 412 Query: 1387 RAIFMPTPLRSVKPSISLKYEN 1452 R + P L ++PS + +N Sbjct: 413 RKVLSPVRLECIEPSSAATLDN 434 >ref|XP_006416500.1| hypothetical protein EUTSA_v10009681mg, partial [Eutrema salsugineum] gi|557094271|gb|ESQ34853.1| hypothetical protein EUTSA_v10009681mg, partial [Eutrema salsugineum] Length = 475 Score = 174 bits (440), Expect = 2e-40 Identities = 129/367 (35%), Positives = 186/367 (50%), Gaps = 26/367 (7%) Frame = +1 Query: 349 WGRKGRRSMKRVKNESPGGEW-VKDIESSKLRSFSLAQRDC--SVEDQRQPRTIKNMTLK 519 WG KG+R KRVK ESP + +K +S L + LA+ ED+ QP T + +T Sbjct: 59 WGSKGKRVRKRVKTESPPCDSRLKPADSETLPTLDLAEGRAVKDEEDEVQPIT-REVTKV 117 Query: 520 AVKSEEDSESSRP---THMYCRSYMSHGGKSKQNLTEAEKEAXXXXXXLANRESARQTIR 690 VK+E E +P + + CRS S G+S+QNL+EAE+E LANRESARQTIR Sbjct: 118 PVKTEVTDEIPKPNIASTLRCRS--SGCGRSRQNLSEAEREERRIRRILANRESARQTIR 175 Query: 691 RRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAP 870 RRQAMCEEL++KAADL ++NENL++EK+ +K++QSL+ N +LK Q++ K + +E Sbjct: 176 RRQAMCEELSKKAADLTYENENLRREKDWALKEFQSLETINKHLKEQVSKSAKLDTKE-- 233 Query: 871 GEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNA 1050 P +SP Q S PF FY + + WP + +PV + N Sbjct: 234 ---------PEESPKPSQVEMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPVISPLETQNG 284 Query: 1051 SALQDPIT----IACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRN-----VSH 1203 A P T + + E NP N K +Y++PCPWF P D N Sbjct: 285 FAA--PFTTVGGASAKTMTSQEHGNPADDNGQKTHFYVVPCPWFLPAPDQSNGVPFAFQD 342 Query: 1204 PHVVDSFSSKHKHSDNSTSKQPE---------QSSIKADN--LPESNSTDNLPENSSKYS 1350 P V S H D+S++ E Q+ IK ++ PE+ +L E++++ Sbjct: 343 PQRV--IPSNGHHIDDSSANSVEVKKSLPSHLQTRIKEEDSGSPEARPLYDLNESATEVL 400 Query: 1351 TSEDSNF 1371 + F Sbjct: 401 SEGGDGF 407 >ref|XP_006303620.1| hypothetical protein CARUB_v10011417mg [Capsella rubella] gi|482572331|gb|EOA36518.1| hypothetical protein CARUB_v10011417mg [Capsella rubella] Length = 465 Score = 173 bits (439), Expect = 2e-40 Identities = 123/350 (35%), Positives = 179/350 (51%), Gaps = 11/350 (3%) Frame = +1 Query: 349 WGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQP--RTIKNMTLK 519 WG KG+R KRVK ESP + +K +S L + LA+ E++ + IK +T Sbjct: 54 WGSKGKRVRKRVKTESPPSDSLLKPPDSETLPTPDLAEERLMKEEEEEDVQPVIKEVTKA 113 Query: 520 AVKSEEDSESSRPT-HMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRR 693 VK+E + E+ +P R S+G G+S+QNL+EAE+E LANRESARQTIRR Sbjct: 114 PVKTEMNGETLKPNLASTIRCSRSNGCGRSRQNLSEAEREERRIRRILANRESARQTIRR 173 Query: 694 RQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPG 873 RQAMCEEL++KAADL ++NENL++EK+ +K++QSL+ N +LK Q++ VK + +E Sbjct: 174 RQAMCEELSKKAADLTYENENLRREKDWALKEFQSLETMNKHLKEQVSKSVKPDTKE--- 230 Query: 874 EIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNAS 1053 H PPK Q S PF FY + + WP + +P+ + + Sbjct: 231 -----HEEPPK---PSQVEMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPMISPLEFATSG 282 Query: 1054 ALQDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD----S 1221 IT E E+P N K +Y++PCPWF D N D + Sbjct: 283 GAAKTIT-------PQEHEDPADDNGQKTHFYVVPCPWFLSPPDQSNGVSLGDQDTQRGT 335 Query: 1222 FSSKHKHSDNSTSKQPEQSSIKADNLPE--SNSTDNLPENSSKYSTSEDS 1365 FS+ H H D+S+++ E + +LP PE Y +E + Sbjct: 336 FSNGH-HVDDSSARPLEVTKTLWSHLPTRIKEEDSGSPETRPLYDLNESA 384 >ref|XP_002893053.1| hypothetical protein ARALYDRAFT_312884 [Arabidopsis lyrata subsp. lyrata] gi|297338895|gb|EFH69312.1| hypothetical protein ARALYDRAFT_312884 [Arabidopsis lyrata subsp. lyrata] Length = 647 Score = 171 bits (433), Expect = 1e-39 Identities = 121/349 (34%), Positives = 181/349 (51%), Gaps = 10/349 (2%) Frame = +1 Query: 349 WGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNMTLKAV 525 WG KG+R KRVK ESP + +K +S L + LA+ E++ + ++ +T V Sbjct: 239 WGSKGKRVRKRVKTESPPSDSLLKPPDSETLPTPDLAEERLVKEEEEEE--VQPITKAPV 296 Query: 526 KSEEDSESSRPT-HMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQ 699 K+E + E+ + R S+G G+S+QNL+EAE+E LANRESARQTIRRRQ Sbjct: 297 KTEMNGETPKLNLASTLRCSRSNGCGRSRQNLSEAEREERRIRRILANRESARQTIRRRQ 356 Query: 700 AMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEI 879 AMCEEL++KAADL ++NENL++EK+ +K++QSL+ N +LK Q++ VK + +E Sbjct: 357 AMCEELSKKAADLTYENENLRREKDWALKEFQSLETINKHLKEQVSKSVKPDTKE----- 411 Query: 880 PSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASAL 1059 P +S Q S PF FY + + WP + +P + P + Sbjct: 412 ------PEESTKPSQVDMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPTIS---PLEFATS 462 Query: 1060 QDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD-----SF 1224 P + + E ENP N K +Y++PCPWF P D N S P + +F Sbjct: 463 GGP---SAKSMTSQEHENPADDNGQKTHFYVVPCPWFLPPPDQSN-SVPFGLQNTQRGTF 518 Query: 1225 SSKHKHSDNSTSKQPEQSSIKADNLPE--SNSTDNLPENSSKYSTSEDS 1365 S+ H H D+S+++ E + +LP PE Y +E + Sbjct: 519 SNGH-HIDDSSARPIEVTETPRSHLPTRIKEEDSGSPETRPLYDLNESA 566 >gb|EXC26927.1| Transcription factor HBP-1a [Morus notabilis] Length = 509 Score = 160 bits (404), Expect = 2e-36 Identities = 117/326 (35%), Positives = 166/326 (50%), Gaps = 8/326 (2%) Frame = +1 Query: 328 REKPSRKWGRKGRRSMKRVKNES-PGGEWV---KDIESSKLRSFSLAQRDCSVEDQRQPR 495 RE + G R KRVK++S P E V D+ +++S + C +P Sbjct: 47 REGSAADSGGDWTRRRKRVKSQSTPPAESVTLCSDLPQDRIKSPEQSAEACR-NVIAEPS 105 Query: 496 TIKNMTLKAVKSEEDSESSRPTHMYCRS--YMSHG-GKSKQNLTEAEKEAXXXXXXLANR 666 + + K +K ++++E +P+ + Y G GKS+++LTEAEKEA LANR Sbjct: 106 KAHDRSEKNLKVKKETELPKPSLIGSTEPGYSLLGIGKSRRSLTEAEKEARRIRRILANR 165 Query: 667 ESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRV 846 ESARQTIRRRQA+CEEL +KAADL +NE+LK E E+ +K+Y+ L+ N LK +MA V Sbjct: 166 ESARQTIRRRQALCEELIKKAADLASENESLKTEMEMALKEYRMLETTNKQLKDRMAKVV 225 Query: 847 KSEMEEAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVH 1026 K+++EE G + ITP + P Y P F+P W + + V Sbjct: 226 KADVEEILGS-QCVQITPTAA------------SPLFLYNHPPFTPLFWSPVAQSPNSVQ 272 Query: 1027 TKSIPTNASALQDPITI-ACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSH 1203 T I NA + I + A E EN +TN P+ P YI PCPWFFP D + Sbjct: 273 TSHIAQNAIVMPSNIPLPAEGRHDSCEQENLRNTNGPETPLYIFPCPWFFPHLDPGTLLQ 332 Query: 1204 PHVVDSFSSKHKHSDNSTSKQPEQSS 1281 S K+K + ST+ Q +S Sbjct: 333 SQ--SSIFQKNKQDETSTNNQQSPTS 356 >ref|XP_006604635.1| PREDICTED: uncharacterized protein LOC100788624 isoform X3 [Glycine max] Length = 436 Score = 152 bits (384), Expect = 5e-34 Identities = 100/249 (40%), Positives = 133/249 (53%), Gaps = 5/249 (2%) Frame = +1 Query: 598 KSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKEL 777 KS++NLTE EKEA LANRESARQTIRRRQA+CEELTRKAA L +NENLK+EKEL Sbjct: 75 KSRRNLTEEEKEARRIRRILANRESARQTIRRRQALCEELTRKAATLVAENENLKREKEL 134 Query: 778 VMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPS--IHITPPKSPCDQQALSSEKPQP 951 +K+Y+SL+ N LK Q+A + +E+E+ P E S ITP S P Sbjct: 135 ALKEYESLETTNKNLKTQIAKSINTEVEKTPVEPVSSVAEITP-----------SSGNGP 183 Query: 952 FPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASALQDPITIACM--LSSLDESENPLST 1125 + Y S WPSI+ +PVH ++ N+ A+ + C S + N ++ Sbjct: 184 WFLYNHFPVSQIFWPSILQSSNPVHLQNTSFNSIAIPPNANVPCSSESESRHKQNNLIND 243 Query: 1126 NVPKIPYYILPCPWFFPLSDCRN-VSHPHVVDSFSSKHKHSDNSTSKQPEQSSIKADNLP 1302 N + P+Y+ PCPW FPL N S P +S DN + +P SS + L Sbjct: 244 NRTQNPFYMFPCPWLFPLPQFGNGQSSPS-----NSLKDEQDNLSLCKPCSSSSSLNTLA 298 Query: 1303 ESNSTDNLP 1329 + LP Sbjct: 299 NVDYQAALP 307