BLASTX nr result

ID: Cocculus23_contig00019097 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00019097
         (1776 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002533547.1| DNA binding protein, putative [Ricinus commu...   245   5e-62
ref|XP_006481816.1| PREDICTED: uncharacterized protein LOC102610...   238   7e-60
ref|XP_006481817.1| PREDICTED: uncharacterized protein LOC102610...   236   3e-59
ref|XP_006430258.1| hypothetical protein CICLE_v10013541mg, part...   233   2e-58
ref|XP_006381298.1| bZIP transcription factor family protein [Po...   224   1e-55
ref|XP_002277087.1| PREDICTED: uncharacterized protein LOC100257...   222   5e-55
ref|XP_007203618.1| hypothetical protein PRUPE_ppa003901mg [Prun...   219   4e-54
ref|XP_004303007.1| PREDICTED: uncharacterized protein LOC101299...   212   4e-52
ref|XP_007027678.1| Basic-leucine zipper transcription factor fa...   197   1e-47
ref|XP_007027677.1| Basic-leucine zipper transcription factor fa...   197   1e-47
ref|XP_007027676.1| Basic-leucine zipper transcription factor fa...   197   1e-47
gb|AAF79444.1|AC025808_26 F18O14.26 [Arabidopsis thaliana]            184   1e-43
ref|NP_173381.1| basic-leucine zipper transcription factor famil...   184   1e-43
ref|XP_004161242.1| PREDICTED: uncharacterized protein LOC101224...   177   2e-41
ref|XP_004149227.1| PREDICTED: uncharacterized protein LOC101210...   177   2e-41
ref|XP_006416500.1| hypothetical protein EUTSA_v10009681mg, part...   174   2e-40
ref|XP_006303620.1| hypothetical protein CARUB_v10011417mg [Caps...   173   2e-40
ref|XP_002893053.1| hypothetical protein ARALYDRAFT_312884 [Arab...   171   1e-39
gb|EXC26927.1| Transcription factor HBP-1a [Morus notabilis]          160   2e-36
ref|XP_006604635.1| PREDICTED: uncharacterized protein LOC100788...   152   5e-34

>ref|XP_002533547.1| DNA binding protein, putative [Ricinus communis]
            gi|223526583|gb|EEF28837.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 515

 Score =  245 bits (625), Expect = 5e-62
 Identities = 158/419 (37%), Positives = 224/419 (53%), Gaps = 26/419 (6%)
 Frame = +1

Query: 346  KWGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNMTLKA 522
            +WG KG+R  KRVK+ESP  + + K +  S       A     V+ Q       +  +KA
Sbjct: 65   RWGSKGKRGKKRVKSESPPLDPFTKPVLDSLTNCLDPAPDPAPVDQQHDEPLCSDTVIKA 124

Query: 523  VKSEEDSESSRPTHMYCRSYMSHGG-KSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQ 699
             K E+D++  +P+ +  +++ S+GG +S+QNLTEAEKE       LANRESARQTIRRRQ
Sbjct: 125  AKVEQDADIPKPSLVSVKNHPSYGGGRSRQNLTEAEKEERRLRRILANRESARQTIRRRQ 184

Query: 700  AMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEI 879
            A+CEELTRKAADL W+NENLK+EKE V+K++QSL+ +N YLK QMA  +K+E+E++P ++
Sbjct: 185  ALCEELTRKAADLAWENENLKREKESVLKEFQSLESRNKYLKAQMAKLIKTEVEDSPADL 244

Query: 880  PSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIH-------HLDPVHTKSI 1038
             S H+    +P    +L          Y +  FS   WPSII        HL P  T  I
Sbjct: 245  KSAHVDNSLAPATNCSLL--------LYNQHPFSSLCWPSIIQSSNSVQSHLGPQSTIMI 296

Query: 1039 PTNASALQDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD 1218
            P++ S   +      + SS    ENP+ TN P+ P YI+ CPWFFP+ +  N  HP  + 
Sbjct: 297  PSSISMPPN----GKLDSSQQPQENPMITNGPRTPLYIVSCPWFFPVPEHANGLHP--LP 350

Query: 1219 SFSSKHKHSDNSTSKQ-PEQSSIKADNLPES---------NSTDNLPENSSKYST----- 1353
            SF  +HK    S + Q    SS KA  L ++         NS D  P  +    T     
Sbjct: 351  SFGLQHKQDGTSVNNQCSRTSSAKATALMQNQFSSASEKVNSEDGNPAINDLNETPVGVP 410

Query: 1354 SEDSNFSNYPN--RAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSAL 1524
             E  + S  PN    +  P  L S+ P++++K E   +       + I  T+  + SAL
Sbjct: 411  PEGGSHSAAPNHKETVVAPVMLSSITPTVAVKNETGTRSESVPHTDGICTTSKQLISAL 469


>ref|XP_006481816.1| PREDICTED: uncharacterized protein LOC102610701 isoform X1 [Citrus
            sinensis]
          Length = 516

 Score =  238 bits (607), Expect = 7e-60
 Identities = 155/416 (37%), Positives = 219/416 (52%), Gaps = 26/416 (6%)
 Frame = +1

Query: 349  WGRKGRRSMKRVKNESPGGEW---VKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNMTLK 519
            WG KG+R  KRVK ESP G+    +  ++     S  + Q     + QR      N+ +K
Sbjct: 63   WGCKGKRVRKRVKTESPPGQAESAMNPVDPEPPCSDPIDQDQVISDQQRDRTACGNILIK 122

Query: 520  AVKSEEDSESSRPTHMYCRSYMSH-GGKSKQNLTEAEKEAXXXXXXLANRESARQTIRRR 696
             VK+++D+ES + + +    Y+S  GG+S+QNLTEAEKE       LANRESARQTIRRR
Sbjct: 123  PVKADQDAESLKRSSLCATRYISMAGGRSRQNLTEAEKEERRVRRILANRESARQTIRRR 182

Query: 697  QAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGE 876
            QA+CEELTRKAADL  +NE+LK+EKEL +K+YQSL+  N +LK Q+A  +K+E+ E  GE
Sbjct: 183  QALCEELTRKAADLSQENESLKREKELAVKEYQSLETINKHLKAQVAKVMKAEVGETQGE 242

Query: 877  IPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASA 1056
            +   H     SP +          P   Y     +P  WPSII    PV ++    NA  
Sbjct: 243  VKLAHAEMSSSPTN---------CPLLLYNHHALTPLGWPSIIQSSQPVPSRHGMQNAVT 293

Query: 1057 LQDPI--TIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHV------ 1212
                I  +I   L+S  E ENP  +NV + P Y++PCPWFFPL D  +  H  +      
Sbjct: 294  FPSNISTSITGELASSQEQENPTDSNVARTPLYVVPCPWFFPLHDSGSGFHAPISNGLKI 353

Query: 1213 -VDSFSSKHKHSDNSTSKQ-----------PEQSSIKADNLPESNSTDNLPENSSKYSTS 1356
              D  S+ + +   S+SK            P +   +A  LPE+ S ++L +     S  
Sbjct: 354  LQDETSAHNGYGSGSSSKMTADKENHHFLLPVKIKNEAYGLPEAQSYNDLNDIPVTESPQ 413

Query: 1357 ED--SNFSNYPNRAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGS 1518
            +        Y   A   P PL SV  S  +K++N LQ + T   +++S  A+H+ S
Sbjct: 414  DGGCQQIGRYTREATLTPPPLSSVGGSFIVKHDNVLQSDYTGHTKAVSKIANHLVS 469


>ref|XP_006481817.1| PREDICTED: uncharacterized protein LOC102610701 isoform X2 [Citrus
            sinensis]
          Length = 515

 Score =  236 bits (601), Expect = 3e-59
 Identities = 157/417 (37%), Positives = 223/417 (53%), Gaps = 27/417 (6%)
 Frame = +1

Query: 349  WGRKGRRSMKRVKNESPGGEW---VKDIESSKLRSFSLAQRDCSVEDQRQPRTI-KNMTL 516
            WG KG+R  KRVK ESP G+    +  ++     S  +   D  + DQ++ RT   N+ +
Sbjct: 63   WGCKGKRVRKRVKTESPPGQAESAMNPVDPEPPCSDPID--DQVISDQQRDRTACGNILI 120

Query: 517  KAVKSEEDSESSRPTHMYCRSYMSH-GGKSKQNLTEAEKEAXXXXXXLANRESARQTIRR 693
            K VK+++D+ES + + +    Y+S  GG+S+QNLTEAEKE       LANRESARQTIRR
Sbjct: 121  KPVKADQDAESLKRSSLCATRYISMAGGRSRQNLTEAEKEERRVRRILANRESARQTIRR 180

Query: 694  RQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPG 873
            RQA+CEELTRKAADL  +NE+LK+EKEL +K+YQSL+  N +LK Q+A  +K+E+ E  G
Sbjct: 181  RQALCEELTRKAADLSQENESLKREKELAVKEYQSLETINKHLKAQVAKVMKAEVGETQG 240

Query: 874  EIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNAS 1053
            E+   H     SP +          P   Y     +P  WPSII    PV ++    NA 
Sbjct: 241  EVKLAHAEMSSSPTN---------CPLLLYNHHALTPLGWPSIIQSSQPVPSRHGMQNAV 291

Query: 1054 ALQDPI--TIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHV----- 1212
                 I  +I   L+S  E ENP  +NV + P Y++PCPWFFPL D  +  H  +     
Sbjct: 292  TFPSNISTSITGELASSQEQENPTDSNVARTPLYVVPCPWFFPLHDSGSGFHAPISNGLK 351

Query: 1213 --VDSFSSKHKHSDNSTSKQ-----------PEQSSIKADNLPESNSTDNLPENSSKYST 1353
               D  S+ + +   S+SK            P +   +A  LPE+ S ++L +     S 
Sbjct: 352  ILQDETSAHNGYGSGSSSKMTADKENHHFLLPVKIKNEAYGLPEAQSYNDLNDIPVTESP 411

Query: 1354 SED--SNFSNYPNRAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGS 1518
             +        Y   A   P PL SV  S  +K++N LQ + T   +++S  A+H+ S
Sbjct: 412  QDGGCQQIGRYTREATLTPPPLSSVGGSFIVKHDNVLQSDYTGHTKAVSKIANHLVS 468


>ref|XP_006430258.1| hypothetical protein CICLE_v10013541mg, partial [Citrus clementina]
            gi|557532315|gb|ESR43498.1| hypothetical protein
            CICLE_v10013541mg, partial [Citrus clementina]
          Length = 511

 Score =  233 bits (595), Expect = 2e-58
 Identities = 154/416 (37%), Positives = 220/416 (52%), Gaps = 26/416 (6%)
 Frame = +1

Query: 349  WGRKGRRSMKRVKNESPGGEW---VKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNMTLK 519
            WG K +R  KRVK ESP G+    +  ++     S  +  +  S + QR      N+ +K
Sbjct: 59   WGCKVKRVRKRVKTESPPGQAGSAMNPVDPEPPCSDPIDDQVIS-DQQRDQTACGNILIK 117

Query: 520  AVKSEEDSESSRPTHMYCRSYMSH-GGKSKQNLTEAEKEAXXXXXXLANRESARQTIRRR 696
              K+++D+ES + + +    Y+S  GG+S+QNLTEAEKE       LANRESARQTIRRR
Sbjct: 118  PAKADQDAESLKRSSLCATRYISMAGGRSRQNLTEAEKEERRVRRILANRESARQTIRRR 177

Query: 697  QAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGE 876
            QA+CEELTRKAADL  +NE+LK+EKEL +K+YQSL+  N +LK Q+A  +KSE+ E  GE
Sbjct: 178  QALCEELTRKAADLSQENESLKREKELAVKEYQSLETINKHLKAQVAKVMKSEVGETQGE 237

Query: 877  IPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASA 1056
            +   H     SP +          P   Y     +P  WPSII    PV ++    NA  
Sbjct: 238  VKLAHAEMSSSPTN---------CPLLLYNHHALTPLGWPSIIQSSQPVPSRHEMQNAVT 288

Query: 1057 LQDPI--TIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHV------ 1212
                I  +I   L+S  E ENP  +NV + P Y++PCPWFFPL D  +  H  +      
Sbjct: 289  FPSNISTSITGKLASSQEQENPTDSNVARTPLYVVPCPWFFPLHDSGSGFHAPISNGLKV 348

Query: 1213 -VDSFSSKHKHSDNSTSKQ-----------PEQSSIKADNLPESNSTDNLPENSSKYSTS 1356
              D  S+++ +   S+SK            P +   +A  LPE+ S ++L +     S  
Sbjct: 349  LQDETSARNGYGSGSSSKMTADKENHHFLLPVKIKNEAYGLPEAQSYNDLNDIPVTESPQ 408

Query: 1357 ED--SNFSNYPNRAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGS 1518
            +       +Y   A   P PL SV  S  +K++N LQ + T   +++S  A+H+ S
Sbjct: 409  DGGCQQIGHYTREATLTPPPLSSVGGSFIVKHDNVLQSDYTGHTKAVSKIANHLVS 464


>ref|XP_006381298.1| bZIP transcription factor family protein [Populus trichocarpa]
            gi|550336000|gb|ERP59095.1| bZIP transcription factor
            family protein [Populus trichocarpa]
          Length = 485

 Score =  224 bits (570), Expect = 1e-55
 Identities = 155/426 (36%), Positives = 213/426 (50%), Gaps = 25/426 (5%)
 Frame = +1

Query: 328  REKPSRKWGRKGRRSMKRVKNESPGGEWVKDIESSKLRSFSLAQRDCSVEDQRQPRTIKN 507
            RE    +WG KG+R+ KRV+ ES       D          L ++D +V DQ QP  I +
Sbjct: 39   RESSGSEWGSKGKRARKRVRAESDSVSTYSD----------LPRQDRAVVDQ-QP--IHS 85

Query: 508  MTLKAVKSEEDSESSRPTHMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQT 684
              +K  + E D++  + +     SY S+G G+S+ NLTEAEKE       LANRESARQT
Sbjct: 86   NVVKPARQELDADVPKSSPSCATSYPSYGTGRSRLNLTEAEKEERRLRRILANRESARQT 145

Query: 685  IRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEE 864
            IRRRQA+CEELTRKAADL W+NENLK+EKEL +K YQSL+  N +LK QMA ++K+EME 
Sbjct: 146  IRRRQALCEELTRKAADLSWENENLKKEKELALKNYQSLETTNKHLKAQMAKQIKAEMEV 205

Query: 865  APGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPT 1044
            +PG++ S  +  P         ++    P   Y +  FSPH WPSII   +P+ +     
Sbjct: 206  SPGDLKSALVDIP--------TTAPTNCPLLVYNQHAFSPHCWPSIIQSSNPIQSHYTTE 257

Query: 1045 NASALQDPI---TIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVV 1215
            NA  +   +   T     SS  + EN +  + P+ P Y++ CPWFFP  D  N  H    
Sbjct: 258  NAIVIPSNMPMPTNGTHDSSQLQQENTVIVSGPRTPLYVVSCPWFFPGPDHGNGLHAQ-- 315

Query: 1216 DSFSSKHKHSDNSTSKQPEQSSIKADNLPESNSTDNLP-ENSSKYSTSEDSNFSN----- 1377
             SFS KH+    S +     SS      P  N   +L     S+ ++SE+    N     
Sbjct: 316  PSFSFKHRQDGISLNNLCCGSSSPKAAAPMENRHSSLSIIVKSETTSSEEVRVINDLNET 375

Query: 1378 ---------------YPNRAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHV 1512
                           +P   I  P P  SV P++++K E   +    F    I   AS +
Sbjct: 376  PVGFTLYGGGQCEGTHPKEMILTPVPPTSVTPAVAVKNEAGQKSEHAFGANGICTKASQL 435

Query: 1513 GSALSE 1530
               L E
Sbjct: 436  RCVLPE 441


>ref|XP_002277087.1| PREDICTED: uncharacterized protein LOC100257875 [Vitis vinifera]
            gi|297740087|emb|CBI30269.3| unnamed protein product
            [Vitis vinifera]
          Length = 496

 Score =  222 bits (565), Expect = 5e-55
 Identities = 171/470 (36%), Positives = 232/470 (49%), Gaps = 33/470 (7%)
 Frame = +1

Query: 250  GADLMVKIEXXXXXXXXXXXXXXXXERE----KPSRKWGRKGRRSMKRVKNESPGGEWVK 417
            GAD +VKIE                E E    +   KWG KG+R  KRVK+ESP  +  K
Sbjct: 33   GADRLVKIELEAAEVLADLAQSLMRESESNGAESGGKWGSKGKRGRKRVKSESPPSDEFK 92

Query: 418  DIESSKLRSFSLAQRDCSVEDQRQPRTI-KNMTLKAVKSEEDSESSRPTHMYCRSYMSH- 591
            + ++    S  L ++D     Q++ R I +N+ L   K+E D E ++P+ M   +Y  H 
Sbjct: 93   NPDNLFPGSSDLTEQDKQSVVQQECRKIDRNVFL--TKTETDDEFAKPSPMCTTTYAPHH 150

Query: 592  GGKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEK 771
             GK +QNLTEAEKEA      LANRESARQTIRRRQA+C EL+RKAADL  +NE LK+EK
Sbjct: 151  SGKLRQNLTEAEKEARRLRRVLANRESARQTIRRRQALCGELSRKAADLSLENETLKREK 210

Query: 772  ELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPSIHIT--PPKSPCDQQALSSEKP 945
            EL MK++QSL++KN +LK Q+A  +K E E+ P  I S  +T  PP S C          
Sbjct: 211  ELAMKEFQSLENKNKHLKAQVAKIIKPEEEKTPESISSHEMTSIPPSSNC---------- 260

Query: 946  QPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASALQDPITIACMLSSLDESENPLST 1125
             P   Y +P+F+P +W S               NA A           +  DE ENP + 
Sbjct: 261  -PLLLYNQPSFTPFLWSSPERRFQ---------NAFASH---------AVPDERENP-NI 300

Query: 1126 NVPKIPYYILPCPWFFPLSDCRNVSHPHVVDSFSSKHKH-------SDNSTSKQPEQSSI 1284
            +  + P YILPCPWFFPL +  N    H+  S + K K        S +S  K       
Sbjct: 301  DAYRTPLYILPCPWFFPLPNHGN--GLHLPPSLNLKDKQDAVNSQCSASSLIKNKSGIET 358

Query: 1285 KADNLPESNSTDNLPE-----------------NSSKYSTSEDS-NFSNYPNRAIFMPTP 1410
            K  N  +  S + LP+                 +   Y  S D+ + S++ N  I  P+P
Sbjct: 359  KPANKFQEASFEFLPDGHLITPHHRRMIPANNVHDLSYGFSPDAHHISSHSNAMILSPSP 418

Query: 1411 LRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSALSENIGGSSICSS 1560
            L S+K +I+ K+E +LQ +     E       H+ S  SE      ICSS
Sbjct: 419  LMSLKSAITFKHEGELQSSYVDNGE-----GGHIVSVFSEKNQEPVICSS 463


>ref|XP_007203618.1| hypothetical protein PRUPE_ppa003901mg [Prunus persica]
            gi|462399149|gb|EMJ04817.1| hypothetical protein
            PRUPE_ppa003901mg [Prunus persica]
          Length = 541

 Score =  219 bits (557), Expect = 4e-54
 Identities = 161/472 (34%), Positives = 234/472 (49%), Gaps = 44/472 (9%)
 Frame = +1

Query: 247  GGADLMVKIEXXXXXXXXXXXXXXXXERE--KPSRKWGRKGRRSMKRVKNESPGGEWVKD 420
            G AD MVK E                E    + +  WG KG+R+ KRVK+ESP G    +
Sbjct: 39   GAADRMVKEELEAAEALADLAHLAMRESSGAESAGNWGLKGKRAKKRVKSESPPGHLGLN 98

Query: 421  IESSKLRSFSLAQRDCSVEDQRQPRTI------------KNMTLKAVKSEEDSESSRPTH 564
                      L+Q+D +V   RQ  T+            + ++ + VK+E D+E ++ + 
Sbjct: 99   PVDPVPTCPDLSQQDQAVTGLRQCETVCTNVVTELLKTEQVLSNEIVKAEHDAEVTKLSP 158

Query: 565  MYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLG 741
            +   SY S    KS++NLTE EKE       LANRESARQTIRRRQA+CEELTRKAADL 
Sbjct: 159  ICTTSYPSFSCSKSRRNLTEEEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLA 218

Query: 742  WDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPSIHI---TPPKSP 912
             +NENLK++KEL +K+YQSL+  N +LKVQMA  +K+E+EE P E  S ++    PP SP
Sbjct: 219  LENENLKKKKELALKEYQSLEKTNKHLKVQMAKVIKAEVEETPSENMSAYVQMQIPPSSP 278

Query: 913  CDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASALQD--PITIACM 1086
             +          P   + RP F+P  WPSII   + V  + +  N  A+    P+     
Sbjct: 279  SN---------SPLFLFNRPPFTPVFWPSIIQSSNSVQLQHVSQNPMAIPSNIPLPANGT 329

Query: 1087 LSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRN---------VSHPHVVDSFSSKHK 1239
              S  E ENPL+ N  + P Y+ PCPWF P  D  N         +++     SF++++ 
Sbjct: 330  ADSSHEQENPLTNNGTRTPLYVFPCPWFIPHFDNGNGLQPQSSLCLNNKQEETSFNNQYS 389

Query: 1240 HS---------DNSTSKQPEQSSIKADNLPESNSTDNLPENSSKYS-TSEDSNFSNYPN- 1386
             S         DN     P +   +A    E+  +++L E  +++     D +   YP  
Sbjct: 390  ASSSSRTVAQLDNHHCSFPIRLKAEASGSMEARLSNDLNETPAQFPLDGADQHTGPYPKE 449

Query: 1387 ---RAIFM-PTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSALSE 1530
               + IF+ P      + + S+K+EN  + + T   E     + H+ SAL E
Sbjct: 450  NGPKEIFLTPASANHERVASSIKHENGFESDYTATAEK----SFHMFSALPE 497


>ref|XP_004303007.1| PREDICTED: uncharacterized protein LOC101299496 [Fragaria vesca
            subsp. vesca]
          Length = 531

 Score =  212 bits (540), Expect = 4e-52
 Identities = 149/428 (34%), Positives = 214/428 (50%), Gaps = 34/428 (7%)
 Frame = +1

Query: 349  WGRKGRRSMKRVKNESP----GGEWVKDI----ESSKLRSFSLAQRDCSVEDQRQPRTIK 504
            WG KG+R+ KRVK+ESP    G   V       +   +   +  +R C        +T  
Sbjct: 69   WGLKGKRAKKRVKSESPPTLSGSNPVPACPDLPQDEAVIGPAQCERVCINVVAEPVKTET 128

Query: 505  NMTLKAVKSEEDSESSRPTHMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQ 681
             M+ +  KSE+D+E +  T +   SY S    KS++NLTE EKE       LANRESARQ
Sbjct: 129  VMSKRIAKSEQDAELTNSTPICNTSYPSFNCTKSRRNLTEEEKEERRIRRILANRESARQ 188

Query: 682  TIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEME 861
            TIRRRQA+CE+LT+KAADL  +NE+LK +KEL +K+YQSL++ N  LKVQM+   K+E+E
Sbjct: 189  TIRRRQALCEDLTKKAADLTLENESLKMKKELALKQYQSLEETNRLLKVQMSKARKAEVE 248

Query: 862  EAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIP 1041
            E   E  S ++  P         SS    PF  + RP F+P  WPS+I   + +  + +P
Sbjct: 249  ETLDENMSAYVQIPS--------SSPTNSPFVLFNRPPFTPVFWPSVIQSSNSIQLQQVP 300

Query: 1042 TNASALQDPITIAC--MLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPH-- 1209
             N  A+   I++ C     S  E  NP+S N  + P Y++PCPWFFP  +  N + P   
Sbjct: 301  QNPMAIPSNISLPCNGTADSSHELGNPISINGSRTPLYVIPCPWFFPQFEIGNGAQPQSS 360

Query: 1210 ---------------VVDSFSSKHKHSDNSTSKQPEQSSIKADNLPESNSTDNLPENSSK 1344
                              S S      DN+ S  P +  ++A    E+    +L EN ++
Sbjct: 361  CPENKQEGAFFNNQGSASSLSRTAAQLDNNQSAFPVRLDVEASGSVEARPRTDLNENPAQ 420

Query: 1345 YSTSEDSNFSN--YPN----RAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTAS 1506
            +        +   +P     R IF+   L     + ++K EN L+ + +   E  S TA 
Sbjct: 421  FPLDGGDQHTGGFHPKENGPREIFLSPLLNHGGIASTIKNENGLESDFSANAEK-SMTAC 479

Query: 1507 HVGSALSE 1530
            H  SAL E
Sbjct: 480  HPFSALPE 487


>ref|XP_007027678.1| Basic-leucine zipper transcription factor family protein, putative
            isoform 3 [Theobroma cacao] gi|508716283|gb|EOY08180.1|
            Basic-leucine zipper transcription factor family protein,
            putative isoform 3 [Theobroma cacao]
          Length = 594

 Score =  197 bits (501), Expect = 1e-47
 Identities = 135/391 (34%), Positives = 196/391 (50%), Gaps = 31/391 (7%)
 Frame = +1

Query: 451  LAQRDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHMYCRSYMSHGG-KSKQNLTEAE 627
            + ++D   + Q    T  ++ +K+VK+E+++ES + +      YMS GG +S+QNLTEAE
Sbjct: 184  MPEKDVWEDRQHDQMTGNDVLIKSVKAEQNAESVKSSPTCATKYMSGGGGRSRQNLTEAE 243

Query: 628  KEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKD 807
            KEA      LANRESARQTIRRRQA+CE+LT K ADL  +NENLK+ KEL +K+Y+S + 
Sbjct: 244  KEARRLRRILANRESARQTIRRRQALCEKLTLKVADLTRENENLKRAKELALKEYKSQES 303

Query: 808  KNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPH 987
             N +LK QM   +K+E  EAP E+   H          Q     +  PF FY +  F P 
Sbjct: 304  TNKHLKAQMVKAIKAEEGEAPRELKLAH----------QISGPSRNYPFYFYNQHPFPPF 353

Query: 988  VWPSIIHHLDPVHTKSIPTNASALQDPITIAC--MLSSLDESENPLSTNVPKIPYYILPC 1161
             WPSI+   +PV T+    NA  +   I+      L S  + ENP++ N PK P Y++P 
Sbjct: 354  CWPSIVQSSNPVQTQCEHQNAIVVSSSISAPTNGRLDSSHDQENPINVNGPKTPLYVVPY 413

Query: 1162 PWFFPLSDCRNVSH-------PHVVDSFSSKHKHSDNSTSK-----------------QP 1269
            PWFF L D  N  H        +  D  S+ ++ S   + K                 + 
Sbjct: 414  PWFFSLPDHGNELHLRPCCGPKNNKDETSANNRFSAGCSLKSVVHEEKYNFSLPTEVEKE 473

Query: 1270 EQSSIKADNLPESNSTDNLPENSS----KYSTSEDSNFSNYPNRAIFMPTPLRSVKPSIS 1437
               SI+A +  ++ ++  LP + S    +Y   E+          + +PTPL S  P+  
Sbjct: 474  AYGSIEASSNNQNCTSVRLPSDGSVQCIRYQIKEE----------VILPTPLCSAGPTFV 523

Query: 1438 LKYENKLQQNQTFKEESISPTASHVGSALSE 1530
            ++ EN    N     E+    A H   AL E
Sbjct: 524  VEQENTPDVN----TEAARVRACHFVGALPE 550



 Score = 60.1 bits (144), Expect = 3e-06
 Identities = 46/143 (32%), Positives = 71/143 (49%), Gaps = 2/143 (1%)
 Frame = +1

Query: 334 KPSRKWGRKGRRSMKRV-KNESPGGEWVKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNM 510
           K S KWG KG+R  +RV  +ESP  E   +       S  LA+   +V+ Q+   T   +
Sbjct: 56  KFSAKWGCKGKRVSRRVSSSESPPSEIGLNQVDPVQSSSDLAEDRAAVDQQQSQVTSTPV 115

Query: 511 TLKAVKSEEDSESSRPTHMYCRSYMSH-GGKSKQNLTEAEKEAXXXXXXLANRESARQTI 687
            ++++++E++SE    +H     Y S   GKS+QN   AEKE       L N+ES  Q I
Sbjct: 116 VIESIEAEQNSELLNGSHTCAARYTSKCVGKSRQN---AEKETLRLHRMLTNKESDWQMI 172

Query: 688 RRRQAMCEELTRKAADLGWDNEN 756
           R RQ +   +     D+  D ++
Sbjct: 173 RERQILYSIMGMPEKDVWEDRQH 195


>ref|XP_007027677.1| Basic-leucine zipper transcription factor family protein, putative
            isoform 2 [Theobroma cacao] gi|508716282|gb|EOY08179.1|
            Basic-leucine zipper transcription factor family protein,
            putative isoform 2 [Theobroma cacao]
          Length = 434

 Score =  197 bits (501), Expect = 1e-47
 Identities = 135/391 (34%), Positives = 196/391 (50%), Gaps = 31/391 (7%)
 Frame = +1

Query: 451  LAQRDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHMYCRSYMSHGG-KSKQNLTEAE 627
            + ++D   + Q    T  ++ +K+VK+E+++ES + +      YMS GG +S+QNLTEAE
Sbjct: 24   MPEKDVWEDRQHDQMTGNDVLIKSVKAEQNAESVKSSPTCATKYMSGGGGRSRQNLTEAE 83

Query: 628  KEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKD 807
            KEA      LANRESARQTIRRRQA+CE+LT K ADL  +NENLK+ KEL +K+Y+S + 
Sbjct: 84   KEARRLRRILANRESARQTIRRRQALCEKLTLKVADLTRENENLKRAKELALKEYKSQES 143

Query: 808  KNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPH 987
             N +LK QM   +K+E  EAP E+   H          Q     +  PF FY +  F P 
Sbjct: 144  TNKHLKAQMVKAIKAEEGEAPRELKLAH----------QISGPSRNYPFYFYNQHPFPPF 193

Query: 988  VWPSIIHHLDPVHTKSIPTNASALQDPITIAC--MLSSLDESENPLSTNVPKIPYYILPC 1161
             WPSI+   +PV T+    NA  +   I+      L S  + ENP++ N PK P Y++P 
Sbjct: 194  CWPSIVQSSNPVQTQCEHQNAIVVSSSISAPTNGRLDSSHDQENPINVNGPKTPLYVVPY 253

Query: 1162 PWFFPLSDCRNVSH-------PHVVDSFSSKHKHSDNSTSK-----------------QP 1269
            PWFF L D  N  H        +  D  S+ ++ S   + K                 + 
Sbjct: 254  PWFFSLPDHGNELHLRPCCGPKNNKDETSANNRFSAGCSLKSVVHEEKYNFSLPTEVEKE 313

Query: 1270 EQSSIKADNLPESNSTDNLPENSS----KYSTSEDSNFSNYPNRAIFMPTPLRSVKPSIS 1437
               SI+A +  ++ ++  LP + S    +Y   E+          + +PTPL S  P+  
Sbjct: 314  AYGSIEASSNNQNCTSVRLPSDGSVQCIRYQIKEE----------VILPTPLCSAGPTFV 363

Query: 1438 LKYENKLQQNQTFKEESISPTASHVGSALSE 1530
            ++ EN    N     E+    A H   AL E
Sbjct: 364  VEQENTPDVN----TEAARVRACHFVGALPE 390


>ref|XP_007027676.1| Basic-leucine zipper transcription factor family protein, putative
            isoform 1 [Theobroma cacao] gi|508716281|gb|EOY08178.1|
            Basic-leucine zipper transcription factor family protein,
            putative isoform 1 [Theobroma cacao]
          Length = 595

 Score =  197 bits (501), Expect = 1e-47
 Identities = 135/391 (34%), Positives = 196/391 (50%), Gaps = 31/391 (7%)
 Frame = +1

Query: 451  LAQRDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHMYCRSYMSHGG-KSKQNLTEAE 627
            + ++D   + Q    T  ++ +K+VK+E+++ES + +      YMS GG +S+QNLTEAE
Sbjct: 185  MPEKDVWEDRQHDQMTGNDVLIKSVKAEQNAESVKSSPTCATKYMSGGGGRSRQNLTEAE 244

Query: 628  KEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKD 807
            KEA      LANRESARQTIRRRQA+CE+LT K ADL  +NENLK+ KEL +K+Y+S + 
Sbjct: 245  KEARRLRRILANRESARQTIRRRQALCEKLTLKVADLTRENENLKRAKELALKEYKSQES 304

Query: 808  KNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPH 987
             N +LK QM   +K+E  EAP E+   H          Q     +  PF FY +  F P 
Sbjct: 305  TNKHLKAQMVKAIKAEEGEAPRELKLAH----------QISGPSRNYPFYFYNQHPFPPF 354

Query: 988  VWPSIIHHLDPVHTKSIPTNASALQDPITIAC--MLSSLDESENPLSTNVPKIPYYILPC 1161
             WPSI+   +PV T+    NA  +   I+      L S  + ENP++ N PK P Y++P 
Sbjct: 355  CWPSIVQSSNPVQTQCEHQNAIVVSSSISAPTNGRLDSSHDQENPINVNGPKTPLYVVPY 414

Query: 1162 PWFFPLSDCRNVSH-------PHVVDSFSSKHKHSDNSTSK-----------------QP 1269
            PWFF L D  N  H        +  D  S+ ++ S   + K                 + 
Sbjct: 415  PWFFSLPDHGNELHLRPCCGPKNNKDETSANNRFSAGCSLKSVVHEEKYNFSLPTEVEKE 474

Query: 1270 EQSSIKADNLPESNSTDNLPENSS----KYSTSEDSNFSNYPNRAIFMPTPLRSVKPSIS 1437
               SI+A +  ++ ++  LP + S    +Y   E+          + +PTPL S  P+  
Sbjct: 475  AYGSIEASSNNQNCTSVRLPSDGSVQCIRYQIKEE----------VILPTPLCSAGPTFV 524

Query: 1438 LKYENKLQQNQTFKEESISPTASHVGSALSE 1530
            ++ EN    N     E+    A H   AL E
Sbjct: 525  VEQENTPDVN----TEAARVRACHFVGALPE 551



 Score = 61.2 bits (147), Expect = 1e-06
 Identities = 48/144 (33%), Positives = 74/144 (51%), Gaps = 3/144 (2%)
 Frame = +1

Query: 334 KPSRKWGRKGRRSMKRVKN-ESPGGEWVKDIESSKLRSFSLAQRDCSVEDQRQPR-TIKN 507
           K S KWG KG+R  +RV + ESP  E   +       S  LA++D +  DQ+Q + T   
Sbjct: 56  KFSAKWGCKGKRVSRRVSSSESPPSEIGLNQVDPVQSSSDLAEQDRAAVDQQQSQVTSTP 115

Query: 508 MTLKAVKSEEDSESSRPTHMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQT 684
           + ++++++E++SE    +H     Y S   GKS+QN   AEKE       L N+ES  Q 
Sbjct: 116 VVIESIEAEQNSELLNGSHTCAARYTSKCVGKSRQN---AEKETLRLHRMLTNKESDWQM 172

Query: 685 IRRRQAMCEELTRKAADLGWDNEN 756
           IR RQ +   +     D+  D ++
Sbjct: 173 IRERQILYSIMGMPEKDVWEDRQH 196


>gb|AAF79444.1|AC025808_26 F18O14.26 [Arabidopsis thaliana]
          Length = 639

 Score =  184 bits (467), Expect = 1e-43
 Identities = 139/413 (33%), Positives = 203/413 (49%), Gaps = 14/413 (3%)
 Frame = +1

Query: 349  WGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQPRT---IKNMTL 516
            WG KG+R  KRVK ESP  +  +K  +S  L +  LA+     E++ +       K +T 
Sbjct: 225  WGSKGKRVRKRVKTESPPSDSLLKPPDSDTLPTPDLAEERLVKEEEEEEEVEPITKELTK 284

Query: 517  KAVKSEEDSESSRP--THMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTI 687
              VKSE + E+ +P       R   S+G G+S+QNL+EAE+E       LANRESARQTI
Sbjct: 285  APVKSEINGETPKPILASTLIRCSRSNGCGRSRQNLSEAEREERRIRRILANRESARQTI 344

Query: 688  RRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEA 867
            RRRQAMCEEL++KAADL ++NENL++EK+  +K++QSL+  N +LK Q+   VK + +E 
Sbjct: 345  RRRQAMCEELSKKAADLTYENENLRREKDWALKEFQSLETINKHLKEQVLKSVKPDTKE- 403

Query: 868  PGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDP-VHTKSIPT 1044
                      P +SP   Q   S    PF FY +  +    WP +    +P +     PT
Sbjct: 404  ----------PEESPKPSQVEMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPMISPLEFPT 453

Query: 1045 NASALQDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD-- 1218
            +  A    IT         E EN    N  K  +Y++PCPWF P  D  N     + D  
Sbjct: 454  SGGASAKTIT-------TQEHENAADDNGQKTHFYVVPCPWFLPPPDHSNGVPFGLQDTQ 506

Query: 1219 --SFSSKHKHSDNSTSKQPEQSSIKADNLPE--SNSTDNLPENSSKYSTSEDSNFSNYPN 1386
              +FS+ H H D+S+++  + +     +LP          PE    Y  +E +       
Sbjct: 507  RGTFSNGH-HIDDSSARPMDVTETPRSHLPTRIKEEDSGSPETRPLYDLNESATEVLSEG 565

Query: 1387 RAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSALSENIGGS 1545
               F  T     + + SLK+E+    ++T    ++ P   HV  +L E   GS
Sbjct: 566  GDGFPVT-----QQAYSLKHED---VSETTNGVTLMPPGHHVLISLPEKKHGS 610


>ref|NP_173381.1| basic-leucine zipper transcription factor family protein [Arabidopsis
            thaliana] gi|20466818|gb|AAM20726.1| unknown protein
            [Arabidopsis thaliana] gi|23198222|gb|AAN15638.1| unknown
            protein [Arabidopsis thaliana]
            gi|332191739|gb|AEE29860.1| bZIP transcription
            factor-like protein [Arabidopsis thaliana]
          Length = 471

 Score =  184 bits (467), Expect = 1e-43
 Identities = 139/413 (33%), Positives = 203/413 (49%), Gaps = 14/413 (3%)
 Frame = +1

Query: 349  WGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQPRT---IKNMTL 516
            WG KG+R  KRVK ESP  +  +K  +S  L +  LA+     E++ +       K +T 
Sbjct: 57   WGSKGKRVRKRVKTESPPSDSLLKPPDSDTLPTPDLAEERLVKEEEEEEEVEPITKELTK 116

Query: 517  KAVKSEEDSESSRP--THMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTI 687
              VKSE + E+ +P       R   S+G G+S+QNL+EAE+E       LANRESARQTI
Sbjct: 117  APVKSEINGETPKPILASTLIRCSRSNGCGRSRQNLSEAEREERRIRRILANRESARQTI 176

Query: 688  RRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEA 867
            RRRQAMCEEL++KAADL ++NENL++EK+  +K++QSL+  N +LK Q+   VK + +E 
Sbjct: 177  RRRQAMCEELSKKAADLTYENENLRREKDWALKEFQSLETINKHLKEQVLKSVKPDTKE- 235

Query: 868  PGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDP-VHTKSIPT 1044
                      P +SP   Q   S    PF FY +  +    WP +    +P +     PT
Sbjct: 236  ----------PEESPKPSQVEMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPMISPLEFPT 285

Query: 1045 NASALQDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD-- 1218
            +  A    IT         E EN    N  K  +Y++PCPWF P  D  N     + D  
Sbjct: 286  SGGASAKTIT-------TQEHENAADDNGQKTHFYVVPCPWFLPPPDHSNGVPFGLQDTQ 338

Query: 1219 --SFSSKHKHSDNSTSKQPEQSSIKADNLPE--SNSTDNLPENSSKYSTSEDSNFSNYPN 1386
              +FS+ H H D+S+++  + +     +LP          PE    Y  +E +       
Sbjct: 339  RGTFSNGH-HIDDSSARPMDVTETPRSHLPTRIKEEDSGSPETRPLYDLNESATEVLSEG 397

Query: 1387 RAIFMPTPLRSVKPSISLKYENKLQQNQTFKEESISPTASHVGSALSENIGGS 1545
               F  T     + + SLK+E+    ++T    ++ P   HV  +L E   GS
Sbjct: 398  GDGFPVT-----QQAYSLKHED---VSETTNGVTLMPPGHHVLISLPEKKHGS 442


>ref|XP_004161242.1| PREDICTED: uncharacterized protein LOC101224097 [Cucumis sativus]
          Length = 576

 Score =  177 bits (448), Expect = 2e-41
 Identities = 135/442 (30%), Positives = 202/442 (45%), Gaps = 42/442 (9%)
 Frame = +1

Query: 253  ADLMVKIEXXXXXXXXXXXXXXXXER--EKPSRKWG--RKGRRSMKRVKNESPGGEWVKD 420
            AD MVK+E                E   +    KWG   KG+R+ K VK ESP   +   
Sbjct: 49   ADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTESPTSGFADS 108

Query: 421  IESSKLRSFSLAQ-----------RDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHM 567
            + +       + Q           ++C+++ Q +P T   +T    K ++++ESS+ +  
Sbjct: 109  LPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVT----KMDKEAESSKVSPA 164

Query: 568  YCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGW 744
               SY   G  +S++ LTEAEKE       LANRESARQTIRRRQA+CEELTRKAADL W
Sbjct: 165  CTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAW 224

Query: 745  DNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQ 924
            +NENLK+EKE+ +K+YQSL+  N  LK Q+A  VK ++EE PG   S H+  P  P +  
Sbjct: 225  ENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTN-- 282

Query: 925  ALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVH-TKSIPTNASALQDPI-TIACMLSSL 1098
                    P   + R    P+ WPS++      H   ++    S++  P    A +  S 
Sbjct: 283  -------CPLFLFSR---LPYFWPSVVQSTSSYHELPNVVVVPSSINPPANNNASVSGSS 332

Query: 1099 DESENPLSTNVPKIPYYIL-PCPWFFPLSDCRNVSHPHV-----VDSFSSKHKHSDNSTS 1260
               EN  +    + P  IL P  W  P  D RN   P +      D      K  +++ +
Sbjct: 333  QTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAIT 392

Query: 1261 KQPEQSSIKADNLPESNSTDNLPENSSKYSTSEDSN------------------FSNYPN 1386
             +  ++  +  +LP +   +  P+ +   S  E SN                   +  P 
Sbjct: 393  SKDVRAESRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPV 452

Query: 1387 RAIFMPTPLRSVKPSISLKYEN 1452
            R +  P  L  ++PS +   +N
Sbjct: 453  RKVLSPVRLECIEPSSAATLDN 474


>ref|XP_004149227.1| PREDICTED: uncharacterized protein LOC101210630 [Cucumis sativus]
          Length = 536

 Score =  177 bits (448), Expect = 2e-41
 Identities = 135/442 (30%), Positives = 202/442 (45%), Gaps = 42/442 (9%)
 Frame = +1

Query: 253  ADLMVKIEXXXXXXXXXXXXXXXXER--EKPSRKWG--RKGRRSMKRVKNESPGGEWVKD 420
            AD MVK+E                E   +    KWG   KG+R+ K VK ESP   +   
Sbjct: 9    ADQMVKVEIEAAEALAGLAVLAVRETGTQPFQTKWGIKGKGKRARKEVKTESPTSGFADS 68

Query: 421  IESSKLRSFSLAQ-----------RDCSVEDQRQPRTIKNMTLKAVKSEEDSESSRPTHM 567
            + +       + Q           ++C+++ Q +P T   +T    K ++++ESS+ +  
Sbjct: 69   LPARADLDLRIEQDRGVVKHQPSEKECTIQSQPEPETTGEVT----KMDKEAESSKVSPA 124

Query: 568  YCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGW 744
               SY   G  +S++ LTEAEKE       LANRESARQTIRRRQA+CEELTRKAADL W
Sbjct: 125  CTTSYQFFGCRRSRRTLTEAEKEERRIRRILANRESARQTIRRRQALCEELTRKAADLAW 184

Query: 745  DNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPSIHITPPKSPCDQQ 924
            +NENLK+EKE+ +K+YQSL+  N  LK Q+A  VK ++EE PG   S H+  P  P +  
Sbjct: 185  ENENLKREKEVALKEYQSLETTNKELKEQLAEAVKPKVEEIPGNHRSSHVQMPPLPTN-- 242

Query: 925  ALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVH-TKSIPTNASALQDPI-TIACMLSSL 1098
                    P   + R    P+ WPS++      H   ++    S++  P    A +  S 
Sbjct: 243  -------CPLFLFSR---LPYFWPSVVQSTSSYHELPNVVVVPSSINPPANNNASVSGSS 292

Query: 1099 DESENPLSTNVPKIPYYIL-PCPWFFPLSDCRNVSHPHV-----VDSFSSKHKHSDNSTS 1260
               EN  +    + P  IL P  W  P  D RN   P +      D      K  +++ +
Sbjct: 293  QTQENFTNGTGSRAPLCILPPYSWLLPHHDFRNQQSPQIWFPAGNDQEGVYSKSQNSAIT 352

Query: 1261 KQPEQSSIKADNLPESNSTDNLPENSSKYSTSEDSN------------------FSNYPN 1386
             +  ++  +  +LP +   +  P+ +   S  E SN                   +  P 
Sbjct: 353  SKDVRAESRHSSLPSAEEENEAPDLNEAPSLDESSNPKDDTQNTVGVAVEGFDTNARAPV 412

Query: 1387 RAIFMPTPLRSVKPSISLKYEN 1452
            R +  P  L  ++PS +   +N
Sbjct: 413  RKVLSPVRLECIEPSSAATLDN 434


>ref|XP_006416500.1| hypothetical protein EUTSA_v10009681mg, partial [Eutrema salsugineum]
            gi|557094271|gb|ESQ34853.1| hypothetical protein
            EUTSA_v10009681mg, partial [Eutrema salsugineum]
          Length = 475

 Score =  174 bits (440), Expect = 2e-40
 Identities = 129/367 (35%), Positives = 186/367 (50%), Gaps = 26/367 (7%)
 Frame = +1

Query: 349  WGRKGRRSMKRVKNESPGGEW-VKDIESSKLRSFSLAQRDC--SVEDQRQPRTIKNMTLK 519
            WG KG+R  KRVK ESP  +  +K  +S  L +  LA+       ED+ QP T + +T  
Sbjct: 59   WGSKGKRVRKRVKTESPPCDSRLKPADSETLPTLDLAEGRAVKDEEDEVQPIT-REVTKV 117

Query: 520  AVKSEEDSESSRP---THMYCRSYMSHGGKSKQNLTEAEKEAXXXXXXLANRESARQTIR 690
             VK+E   E  +P   + + CRS  S  G+S+QNL+EAE+E       LANRESARQTIR
Sbjct: 118  PVKTEVTDEIPKPNIASTLRCRS--SGCGRSRQNLSEAEREERRIRRILANRESARQTIR 175

Query: 691  RRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAP 870
            RRQAMCEEL++KAADL ++NENL++EK+  +K++QSL+  N +LK Q++   K + +E  
Sbjct: 176  RRQAMCEELSKKAADLTYENENLRREKDWALKEFQSLETINKHLKEQVSKSAKLDTKE-- 233

Query: 871  GEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNA 1050
                     P +SP   Q   S    PF FY +  +    WP +    +PV +     N 
Sbjct: 234  ---------PEESPKPSQVEMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPVISPLETQNG 284

Query: 1051 SALQDPIT----IACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRN-----VSH 1203
             A   P T     +    +  E  NP   N  K  +Y++PCPWF P  D  N        
Sbjct: 285  FAA--PFTTVGGASAKTMTSQEHGNPADDNGQKTHFYVVPCPWFLPAPDQSNGVPFAFQD 342

Query: 1204 PHVVDSFSSKHKHSDNSTSKQPE---------QSSIKADN--LPESNSTDNLPENSSKYS 1350
            P  V    S   H D+S++   E         Q+ IK ++   PE+    +L E++++  
Sbjct: 343  PQRV--IPSNGHHIDDSSANSVEVKKSLPSHLQTRIKEEDSGSPEARPLYDLNESATEVL 400

Query: 1351 TSEDSNF 1371
            +     F
Sbjct: 401  SEGGDGF 407


>ref|XP_006303620.1| hypothetical protein CARUB_v10011417mg [Capsella rubella]
            gi|482572331|gb|EOA36518.1| hypothetical protein
            CARUB_v10011417mg [Capsella rubella]
          Length = 465

 Score =  173 bits (439), Expect = 2e-40
 Identities = 123/350 (35%), Positives = 179/350 (51%), Gaps = 11/350 (3%)
 Frame = +1

Query: 349  WGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQP--RTIKNMTLK 519
            WG KG+R  KRVK ESP  +  +K  +S  L +  LA+     E++ +     IK +T  
Sbjct: 54   WGSKGKRVRKRVKTESPPSDSLLKPPDSETLPTPDLAEERLMKEEEEEDVQPVIKEVTKA 113

Query: 520  AVKSEEDSESSRPT-HMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRR 693
             VK+E + E+ +P      R   S+G G+S+QNL+EAE+E       LANRESARQTIRR
Sbjct: 114  PVKTEMNGETLKPNLASTIRCSRSNGCGRSRQNLSEAEREERRIRRILANRESARQTIRR 173

Query: 694  RQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPG 873
            RQAMCEEL++KAADL ++NENL++EK+  +K++QSL+  N +LK Q++  VK + +E   
Sbjct: 174  RQAMCEELSKKAADLTYENENLRREKDWALKEFQSLETMNKHLKEQVSKSVKPDTKE--- 230

Query: 874  EIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNAS 1053
                 H  PPK     Q   S    PF FY +  +    WP +    +P+ +      + 
Sbjct: 231  -----HEEPPK---PSQVEMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPMISPLEFATSG 282

Query: 1054 ALQDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD----S 1221
                 IT         E E+P   N  K  +Y++PCPWF    D  N       D    +
Sbjct: 283  GAAKTIT-------PQEHEDPADDNGQKTHFYVVPCPWFLSPPDQSNGVSLGDQDTQRGT 335

Query: 1222 FSSKHKHSDNSTSKQPEQSSIKADNLPE--SNSTDNLPENSSKYSTSEDS 1365
            FS+ H H D+S+++  E +     +LP          PE    Y  +E +
Sbjct: 336  FSNGH-HVDDSSARPLEVTKTLWSHLPTRIKEEDSGSPETRPLYDLNESA 384


>ref|XP_002893053.1| hypothetical protein ARALYDRAFT_312884 [Arabidopsis lyrata subsp.
            lyrata] gi|297338895|gb|EFH69312.1| hypothetical protein
            ARALYDRAFT_312884 [Arabidopsis lyrata subsp. lyrata]
          Length = 647

 Score =  171 bits (433), Expect = 1e-39
 Identities = 121/349 (34%), Positives = 181/349 (51%), Gaps = 10/349 (2%)
 Frame = +1

Query: 349  WGRKGRRSMKRVKNESPGGE-WVKDIESSKLRSFSLAQRDCSVEDQRQPRTIKNMTLKAV 525
            WG KG+R  KRVK ESP  +  +K  +S  L +  LA+     E++ +   ++ +T   V
Sbjct: 239  WGSKGKRVRKRVKTESPPSDSLLKPPDSETLPTPDLAEERLVKEEEEEE--VQPITKAPV 296

Query: 526  KSEEDSESSRPT-HMYCRSYMSHG-GKSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQ 699
            K+E + E+ +       R   S+G G+S+QNL+EAE+E       LANRESARQTIRRRQ
Sbjct: 297  KTEMNGETPKLNLASTLRCSRSNGCGRSRQNLSEAEREERRIRRILANRESARQTIRRRQ 356

Query: 700  AMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEI 879
            AMCEEL++KAADL ++NENL++EK+  +K++QSL+  N +LK Q++  VK + +E     
Sbjct: 357  AMCEELSKKAADLTYENENLRREKDWALKEFQSLETINKHLKEQVSKSVKPDTKE----- 411

Query: 880  PSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASAL 1059
                  P +S    Q   S    PF FY +  +    WP +    +P  +   P   +  
Sbjct: 412  ------PEESTKPSQVDMSTSSTPFYFYNQNPYQLFCWPHVTQSSNPTIS---PLEFATS 462

Query: 1060 QDPITIACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSHPHVVD-----SF 1224
              P   +    +  E ENP   N  K  +Y++PCPWF P  D  N S P  +      +F
Sbjct: 463  GGP---SAKSMTSQEHENPADDNGQKTHFYVVPCPWFLPPPDQSN-SVPFGLQNTQRGTF 518

Query: 1225 SSKHKHSDNSTSKQPEQSSIKADNLPE--SNSTDNLPENSSKYSTSEDS 1365
            S+ H H D+S+++  E +     +LP          PE    Y  +E +
Sbjct: 519  SNGH-HIDDSSARPIEVTETPRSHLPTRIKEEDSGSPETRPLYDLNESA 566


>gb|EXC26927.1| Transcription factor HBP-1a [Morus notabilis]
          Length = 509

 Score =  160 bits (404), Expect = 2e-36
 Identities = 117/326 (35%), Positives = 166/326 (50%), Gaps = 8/326 (2%)
 Frame = +1

Query: 328  REKPSRKWGRKGRRSMKRVKNES-PGGEWV---KDIESSKLRSFSLAQRDCSVEDQRQPR 495
            RE  +   G    R  KRVK++S P  E V    D+   +++S   +   C      +P 
Sbjct: 47   REGSAADSGGDWTRRRKRVKSQSTPPAESVTLCSDLPQDRIKSPEQSAEACR-NVIAEPS 105

Query: 496  TIKNMTLKAVKSEEDSESSRPTHMYCRS--YMSHG-GKSKQNLTEAEKEAXXXXXXLANR 666
               + + K +K ++++E  +P+ +      Y   G GKS+++LTEAEKEA      LANR
Sbjct: 106  KAHDRSEKNLKVKKETELPKPSLIGSTEPGYSLLGIGKSRRSLTEAEKEARRIRRILANR 165

Query: 667  ESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKELVMKKYQSLKDKNNYLKVQMASRV 846
            ESARQTIRRRQA+CEEL +KAADL  +NE+LK E E+ +K+Y+ L+  N  LK +MA  V
Sbjct: 166  ESARQTIRRRQALCEELIKKAADLASENESLKTEMEMALKEYRMLETTNKQLKDRMAKVV 225

Query: 847  KSEMEEAPGEIPSIHITPPKSPCDQQALSSEKPQPFPFYPRPTFSPHVWPSIIHHLDPVH 1026
            K+++EE  G    + ITP  +             P   Y  P F+P  W  +    + V 
Sbjct: 226  KADVEEILGS-QCVQITPTAA------------SPLFLYNHPPFTPLFWSPVAQSPNSVQ 272

Query: 1027 TKSIPTNASALQDPITI-ACMLSSLDESENPLSTNVPKIPYYILPCPWFFPLSDCRNVSH 1203
            T  I  NA  +   I + A       E EN  +TN P+ P YI PCPWFFP  D   +  
Sbjct: 273  TSHIAQNAIVMPSNIPLPAEGRHDSCEQENLRNTNGPETPLYIFPCPWFFPHLDPGTLLQ 332

Query: 1204 PHVVDSFSSKHKHSDNSTSKQPEQSS 1281
                 S   K+K  + ST+ Q   +S
Sbjct: 333  SQ--SSIFQKNKQDETSTNNQQSPTS 356


>ref|XP_006604635.1| PREDICTED: uncharacterized protein LOC100788624 isoform X3 [Glycine
            max]
          Length = 436

 Score =  152 bits (384), Expect = 5e-34
 Identities = 100/249 (40%), Positives = 133/249 (53%), Gaps = 5/249 (2%)
 Frame = +1

Query: 598  KSKQNLTEAEKEAXXXXXXLANRESARQTIRRRQAMCEELTRKAADLGWDNENLKQEKEL 777
            KS++NLTE EKEA      LANRESARQTIRRRQA+CEELTRKAA L  +NENLK+EKEL
Sbjct: 75   KSRRNLTEEEKEARRIRRILANRESARQTIRRRQALCEELTRKAATLVAENENLKREKEL 134

Query: 778  VMKKYQSLKDKNNYLKVQMASRVKSEMEEAPGEIPS--IHITPPKSPCDQQALSSEKPQP 951
             +K+Y+SL+  N  LK Q+A  + +E+E+ P E  S    ITP           S    P
Sbjct: 135  ALKEYESLETTNKNLKTQIAKSINTEVEKTPVEPVSSVAEITP-----------SSGNGP 183

Query: 952  FPFYPRPTFSPHVWPSIIHHLDPVHTKSIPTNASALQDPITIACM--LSSLDESENPLST 1125
            +  Y     S   WPSI+   +PVH ++   N+ A+     + C     S  +  N ++ 
Sbjct: 184  WFLYNHFPVSQIFWPSILQSSNPVHLQNTSFNSIAIPPNANVPCSSESESRHKQNNLIND 243

Query: 1126 NVPKIPYYILPCPWFFPLSDCRN-VSHPHVVDSFSSKHKHSDNSTSKQPEQSSIKADNLP 1302
            N  + P+Y+ PCPW FPL    N  S P      +S     DN +  +P  SS   + L 
Sbjct: 244  NRTQNPFYMFPCPWLFPLPQFGNGQSSPS-----NSLKDEQDNLSLCKPCSSSSSLNTLA 298

Query: 1303 ESNSTDNLP 1329
              +    LP
Sbjct: 299  NVDYQAALP 307


Top