BLASTX nr result

ID: Mentha23_contig00014707 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00014707
         (1543 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU22066.1| hypothetical protein MIMGU_mgv1a004475mg [Mimulus...   397   e-108
gb|EYU41250.1| hypothetical protein MIMGU_mgv1a003972mg [Mimulus...   340   1e-90
gb|EPS62872.1| hypothetical protein M569_11916, partial [Genlise...   320   1e-84
gb|EXB22546.1| hypothetical protein L484_002900 [Morus notabilis]     261   6e-67
ref|XP_006472453.1| PREDICTED: putative GPI-anchored protein PB1...   260   1e-66
ref|XP_006433817.1| hypothetical protein CICLE_v10000622mg [Citr...   258   5e-66
ref|XP_007222017.1| hypothetical protein PRUPE_ppa002943mg [Prun...   257   9e-66
ref|XP_006366421.1| PREDICTED: uncharacterized protein LOC102582...   254   7e-65
emb|CBI19274.3| unnamed protein product [Vitis vinifera]              253   1e-64
ref|XP_004252718.1| PREDICTED: uncharacterized protein LOC101249...   253   2e-64
ref|XP_002514048.1| DNA binding protein, putative [Ricinus commu...   252   4e-64
ref|XP_002283801.2| PREDICTED: uncharacterized protein LOC100245...   251   5e-64
ref|XP_002302346.1| myb family transcription factor family prote...   246   3e-62
ref|XP_007018233.1| Homeodomain-like superfamily protein isoform...   243   2e-61
ref|XP_007018232.1| Homeodomain-like superfamily protein isoform...   241   8e-61
ref|XP_007224591.1| hypothetical protein PRUPE_ppa1027142mg [Pru...   239   2e-60
ref|XP_006844749.1| hypothetical protein AMTR_s00016p00255950 [A...   234   6e-59
ref|XP_004136421.1| PREDICTED: uncharacterized protein LOC101205...   228   7e-57
ref|XP_004171594.1| PREDICTED: uncharacterized protein LOC101223...   225   4e-56
ref|XP_004152740.1| PREDICTED: uncharacterized protein LOC101206...   225   4e-56

>gb|EYU22066.1| hypothetical protein MIMGU_mgv1a004475mg [Mimulus guttatus]
          Length = 525

 Score =  397 bits (1019), Expect = e-108
 Identities = 242/418 (57%), Positives = 281/418 (67%), Gaps = 18/418 (4%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHEVAKNSKE-ISGARECQVLWRHLA 1277
            I EDD+S LL+RYS++ VL LL+ V    G KIDW E+ KN+   ISGARE Q+LWRHLA
Sbjct: 10   IGEDDVSTLLQRYSVNTVLALLREVALVDGKKIDWREMVKNTATGISGAREYQMLWRHLA 69

Query: 1276 YGETLIDQLDN-DPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIA-GYPKINQLP 1103
            YGETL DQ DN +  P+DD+SDLE EVE  P +GREAS EA ACVKVLIA GY  +++LP
Sbjct: 70   YGETLADQFDNHEAIPMDDDSDLECEVEAFPNVGREASTEATACVKVLIASGY--VSRLP 127

Query: 1102 SNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPN 923
            SN +IE PLTIN PN++AV A SD S   Y     N+ IPV+V K  L S G  GEKRPN
Sbjct: 128  SNLTIEGPLTINIPNSRAVPAPSDTSVLAYA-HGKNINIPVTVPKQSLPSSGC-GEKRPN 185

Query: 922  NEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWST 743
            +     N P      +WS +ED KLTA+VQK+GE NWANIA+ DF+N+R  SELSQRWST
Sbjct: 186  DG---ANLPPRRRKKAWSTQEDMKLTAAVQKYGEPNWANIAKADFDNERTPSELSQRWST 242

Query: 742  LRKKQ-GNPKAG-TSSQPPETQLAAAHRAMSLALNMPMGDNKKSXXXXXXXAGIKTQHQP 569
            L+KKQ GN K G TSS+P ETQLAAAHRAMSLAL+ PMGD  K+        GIK Q Q 
Sbjct: 243  LKKKQGGNLKVGTTSSKPSETQLAAAHRAMSLALDRPMGDTLKA--PRQLTTGIKPQQQS 300

Query: 568  PKTATPPADQKLQRIGPTKPQMLANRPSVNPISDRDSMVKXXXXXXXXXXXXXADASSLI 389
             K +  P  Q+  R GPTKPQM    PS NP    DSMVK             ADASSL+
Sbjct: 301  QKPSGTPPVQQPGRAGPTKPQMPTKWPSTNPAPTPDSMVKAAAVAAGARIATSADASSLM 360

Query: 388  EAAKSQNVVHI-------------TTSIAHQLPSNVHFIRNGLAKAPISTYSAPKPSV 254
            EAA+SQ VVHI             TTSI +QLPSNVHFIRNGLAKAPIS YSA KP++
Sbjct: 361  EAARSQKVVHIKTASGSTPVVKSSTTSIVNQLPSNVHFIRNGLAKAPISNYSAAKPNI 418


>gb|EYU41250.1| hypothetical protein MIMGU_mgv1a003972mg [Mimulus guttatus]
          Length = 552

 Score =  340 bits (871), Expect = 1e-90
 Identities = 234/544 (43%), Positives = 283/544 (52%), Gaps = 51/544 (9%)
 Frame = -1

Query: 1489 MVDXXXXXXXXSISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHEVAKNSKE-ISG 1313
            MV+        SI E+DMS LL RYS+  VL LLQ VE+ AG KIDW+ + KN+   IS 
Sbjct: 1    MVERSRKPKKGSIDEEDMSILLERYSVKTVLTLLQEVEKVAGEKIDWNAIVKNTTTGISS 60

Query: 1312 ARECQVLWRHLAYGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLI 1133
            ARECQ+LWRHLAYG+ L DQ DN   P+DD+SDLE+EVE  PA+ RE S+EA ACVKVLI
Sbjct: 61   ARECQMLWRHLAYGQNLTDQFDNATNPMDDDSDLEYEVEAFPAVNRETSMEAVACVKVLI 120

Query: 1132 AG-YPKINQLPSNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLA 956
            A  YP  +  P+N +IEAP+TIN P  KA  ++SD S      + TN+ IPVSVQK P++
Sbjct: 121  ASDYPIDSHPPNNLTIEAPMTINVPKLKAFTSASDNSVIARAIQGTNISIPVSVQKQPVS 180

Query: 955  SGGVTGEKRP-NNEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNND 779
            SG   GEKRP NN  + + +P       WS E+D KLTA+V+K+GERNWANIAR DF ND
Sbjct: 181  SG-TCGEKRPPNNATSGITFPPRRRRRGWSTEDDMKLTAAVKKYGERNWANIARGDFKND 239

Query: 778  RRASELS---------------------------QRWSTLRKKQGNPKAGTSSQPPETQL 680
            R+ASELS                           QRW TLRKKQ +   GTSS+  E+QL
Sbjct: 240  RKASELSQVSLVRYSHLYSYEEFLFNLTECNQHAQRWGTLRKKQSDSNVGTSSKHSESQL 299

Query: 679  AAAHRAMSLALNMPMGDNKKSXXXXXXXAGIKTQHQPPKTATPPADQKLQRIGPTKPQML 500
            AAAHRA++LALN PMGDN            + T   PPKT  P           TKP  +
Sbjct: 300  AAAHRAITLALNTPMGDN------FHANRNMSTVAGPPKTQVPTI---------TKPNTI 344

Query: 499  ANRPSVNPISDRDSMVKXXXXXXXXXXXXXADASSLIE-AAKSQNVVHITT--------- 350
                        DS +K             ADASSLIE AA+SQNVVHITT         
Sbjct: 345  I----------PDSKIKAAAVAAGARIATSADASSLIEAAARSQNVVHITTGGGGTSMMK 394

Query: 349  --------SIAHQLPSNVHFIRNGLAKA---PISTYSAPKPSVPETTEXXXXXXXXXXXX 203
                    +   QLPSNVHF+R    KA   PI  +SA  P                   
Sbjct: 395  SSSTTSMLTTTSQLPSNVHFMRTAQKKAAPIPIPPHSATLPPNRRPGVEAQPPQGNSAKP 454

Query: 202  XXXAVATNPTGSVQVSNTIKESAAAPTPSTKLPETKDEAVVTTSDDEKKEVGKSNEGTDV 23
               A    P   +  +     S      S  + ETK E V    D     V +S EG D 
Sbjct: 455  AELATVVEPPSGLPNAAATPPSVEVAVVSKSVNETK-EIVQKAVDLSAPLVDQSKEGVDK 513

Query: 22   AEVS 11
             + S
Sbjct: 514  HQTS 517


>gb|EPS62872.1| hypothetical protein M569_11916, partial [Genlisea aurea]
          Length = 438

 Score =  320 bits (819), Expect = 1e-84
 Identities = 200/428 (46%), Positives = 259/428 (60%), Gaps = 29/428 (6%)
 Frame = -1

Query: 1453 ISEDDMSALLRR-YSMDAVLGLLQMVEQSAGAKIDWHEVAKNSKE-ISGARECQVLWRHL 1280
            I ED  +ALLRR YS + VL LL+ + + A  KIDWHE+ KN+   I+ ARECQ+LWR++
Sbjct: 13   IGEDVAAALLRRLYSANTVLALLREISEVAAEKIDWHELVKNTATGITSARECQILWRYM 72

Query: 1279 AYGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLI-AGYPKINQLP 1103
            AYGETLI+   +D    DD+SD E E+E SP  GREAS EA+A VKVL+ +G+    ++P
Sbjct: 73   AYGETLIEPPGDDSRLADDDSDTEFEMEASPTPGREASFEASAYVKVLMTSGHSNDAEVP 132

Query: 1102 SNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPN 923
            +NS+IEAPL IN PN+                   +  IPV++QK  + SG   G KRP+
Sbjct: 133  NNSTIEAPLFINTPNSHGA----------------SFIIPVTLQKQSVPSG-TQGGKRPS 175

Query: 922  NEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWST 743
            N + E +        +WS EED+KLTA+VQ HGERNW++I + +F NDR  SELS RW++
Sbjct: 176  NGVPEGDLHLRRKRRNWSTEEDAKLTAAVQAHGERNWSHIVKEEFINDRSPSELSHRWAS 235

Query: 742  LRKKQGNPKAGTSSQPPETQLAAAHRAMSLALNMPMGDNKKSXXXXXXXAGIKTQHQPPK 563
            L++KQG+ KAG SSQ PE QLAA +RAMSLALNMPMG+  K          +        
Sbjct: 236  LKRKQGDSKAGNSSQTPEMQLAATNRAMSLALNMPMGEILKVAGQTNTGNNLSALFIYSA 295

Query: 562  ----TATPPADQKLQR-IGPTKPQ-MLANRPSVNPISDRDSMVKXXXXXXXXXXXXXADA 401
                 + P A+Q+L +   P KPQ   A  P  NP +  DSMVK             +DA
Sbjct: 296  CSWLVSAPQANQQLGKPAAPPKPQPSNAKLPVNNPAATPDSMVKAAAVAAGARIATLSDA 355

Query: 400  SSLIEAAKSQNVVHITTS------------------IAHQLPSNVHFIRNGLAKAPISTY 275
            SS +EA +SQNVVHI++S                   A QLPSNVHFIRNGLAKAPI++Y
Sbjct: 356  SSFMEATRSQNVVHISSSATGAEGSATVKKPSGASIAAGQLPSNVHFIRNGLAKAPIASY 415

Query: 274  SAP--KPS 257
            S+P  KPS
Sbjct: 416  SSPASKPS 423


>gb|EXB22546.1| hypothetical protein L484_002900 [Morus notabilis]
          Length = 854

 Score =  261 bits (667), Expect = 6e-67
 Identities = 181/445 (40%), Positives = 242/445 (54%), Gaps = 44/445 (9%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISGARECQVLWRHLA 1277
            +SE+D+ +LL+RY+   VL LL  V      KIDW+  V K+S  IS A E Q+LWRHLA
Sbjct: 14   VSEEDVVSLLQRYTATTVLTLLNEVANCTDVKIDWNVLVEKSSTGISNASEYQMLWRHLA 73

Query: 1276 YGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIAGYPKINQLPSN 1097
            Y  + +++ ++  +P+DD+SDLE+E+E SP +  E S EAAACVKVLIA     +  PS 
Sbjct: 74   YRHSFLEKFEDGAQPLDDDSDLEYELEASPVVNNETSNEAAACVKVLIASGLPSDTNPSG 133

Query: 1096 SSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPNNE 917
            S+IEAPLTIN PN +    S     P  + + TN+ +PVSVQK P  +  V  E    N 
Sbjct: 134  STIEAPLTINIPNGQP---SGALEQPSCSTQGTNIIVPVSVQKQPAPAVTVV-EPLDTNG 189

Query: 916  IAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWSTLR 737
             A  N         WS  ED +L A+VQK GE NWANI R DF  DR AS+LSQRW+ +R
Sbjct: 190  SASGNL-LKRKRKPWSEAEDLELIAAVQKCGEGNWANILRGDFKGDRTASQLSQRWAIIR 248

Query: 736  KKQGNPKAGTSS---QPPETQLAAAHRAMSLALNMP--------------------MGDN 626
            K+ GN   G+SS   Q  E QLAA H AMSLALNMP                    MG N
Sbjct: 249  KRHGNLNLGSSSNGTQLSEAQLAARH-AMSLALNMPVKNLTANTISHAGTTALNNSMGTN 307

Query: 625  K--KSXXXXXXXAG---IKTQHQPPKTATPPADQKLQRIGP-TKPQMLANRPSVNPISDR 464
               KS        G   ++ Q+Q  +      +  +  +GP TK ++   +P V      
Sbjct: 308  STNKSAGTNAAAGGNSSLQLQNQSQENLASK-ESPVGSLGPITKARIPMKKPLVKSTPSS 366

Query: 463  DSMVKXXXXXXXXXXXXXADASSLIEAAKSQNVVHI----TTSIAHQLPS---------- 326
            D+MV+             +DA+SL++AA+++N +HI    + SI   +P           
Sbjct: 367  DAMVRATAVAAGARIASPSDAASLLKAAQAKNAIHIRPTGSGSIKSSMPGGLPAPSEAHP 426

Query: 325  NVHFIRNGLAKAPISTYSAPKPSVP 251
            NVH+IR GLA AP+S Y+A  PSVP
Sbjct: 427  NVHYIRTGLASAPVSNYAAATPSVP 451


>ref|XP_006472453.1| PREDICTED: putative GPI-anchored protein PB15E9.01c-like [Citrus
            sinensis]
          Length = 603

 Score =  260 bits (664), Expect = 1e-66
 Identities = 174/429 (40%), Positives = 244/429 (56%), Gaps = 25/429 (5%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISGARECQVLWRHLA 1277
            ISE D+S+LL+RY+ + VL LLQ V Q    K+DW+  V K S  IS ARE Q+LWRHLA
Sbjct: 14   ISEGDVSSLLQRYTANTVLALLQEVAQFPDVKLDWNALVKKTSTGISNAREYQMLWRHLA 73

Query: 1276 YGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIA-GYPKINQLPS 1100
            Y  TL+D+L+++ +P+DD+SDLE+E+E  P +  EAS EAAACVKVLIA G P  + LP+
Sbjct: 74   YRNTLLDKLEDNAQPLDDDSDLEYELEAFPEVSSEASTEAAACVKVLIASGLPSDSSLPN 133

Query: 1099 NSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPNN 920
            +S +EAPLTIN PN +++ AS++ S P    +  N+ +PV+VQK+PL +   T E    N
Sbjct: 134  SSMVEAPLTINIPNGQSLRASTENSQPSSLMQGMNITVPVAVQKVPLPA--PTPEVLDAN 191

Query: 919  EIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWSTL 740
             +   + P       W+ EED +L ++VQK GE NWANI R DF  DR AS+LSQRW+ L
Sbjct: 192  GLIGGSMPPRKKRKPWTAEEDLELISAVQKCGEGNWANILRGDFKWDRTASQLSQRWNIL 251

Query: 739  RKKQGNPKAGTS---SQPPETQLAAAHRAMSLALNMPMGDNKKS-XXXXXXXAGIKTQHQ 572
            RKK GN   G++   SQ  E QLAA H AMSLAL+MP+ +   S            T + 
Sbjct: 252  RKKHGNVILGSNSSGSQLSEAQLAARH-AMSLALDMPVKNITASCTNTTAGTTSSATMNN 310

Query: 571  P-PKTATPPA-----DQKLQRIG----PTKPQMLANRPSVNPISDRDSMVKXXXXXXXXX 422
            P P TA   A       KL  +G      K ++   +         DS ++         
Sbjct: 311  PVPSTANAEASSVANQSKLSPVGSPGSAAKSRVPLKKMPAKSNFGADSSIRAAAVAAGAR 370

Query: 421  XXXXADASSLIEAAKSQNVVHITTSIAHQLPSNVHFIRN--------GLAKAPISTYSAP 266
                +DA+SL++ A+++  +HI       +PS V  I++         L  +P + Y  P
Sbjct: 371  IVTPSDAASLLKVAQAKKAIHI-------MPSGVSSIKSPSAGSASAHLEASPTTRYVRP 423

Query: 265  K-PSVPETT 242
              P+VP ++
Sbjct: 424  SLPAVPSSS 432


>ref|XP_006433817.1| hypothetical protein CICLE_v10000622mg [Citrus clementina]
            gi|557535939|gb|ESR47057.1| hypothetical protein
            CICLE_v10000622mg [Citrus clementina]
          Length = 612

 Score =  258 bits (659), Expect = 5e-66
 Identities = 174/429 (40%), Positives = 242/429 (56%), Gaps = 25/429 (5%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISGARECQVLWRHLA 1277
            ISE D+S+LL+RY+ + VL LLQ V Q    K+DW+  V K S  IS ARE Q+LWRHLA
Sbjct: 14   ISEGDVSSLLQRYTANTVLALLQEVAQFPDVKLDWNALVKKTSTGISNAREYQMLWRHLA 73

Query: 1276 YGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIA-GYPKINQLPS 1100
            Y  TL D+L+++ +P+DD+SDLE+E+E  P +  EAS EAAACVKVLIA G P  + LP+
Sbjct: 74   YRNTLFDKLEDNAQPLDDDSDLEYELEAFPEVSSEASTEAAACVKVLIASGLPSDSSLPN 133

Query: 1099 NSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPNN 920
            +S +EAPLTIN PN +++ AS++ S P    +  N+ +PV+VQK+PL +   T E    N
Sbjct: 134  SSMVEAPLTINIPNGQSLRASTENSQPSSLMQGMNITVPVAVQKVPLPA--PTPEVLDAN 191

Query: 919  EIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWSTL 740
             +   + P       W+ EED +L ++VQK GE NWANI R DF  DR AS+LSQRW+ L
Sbjct: 192  GLIGGSMPPRKKRKPWTAEEDLELISAVQKCGEGNWANILRGDFKWDRTASQLSQRWNIL 251

Query: 739  RKKQGNPKAGTS---SQPPETQLAAAHRAMSLALNMPMGDNKKS-XXXXXXXAGIKTQHQ 572
            RKK GN   G++   SQ  E QLAA H AMSLAL+MP+ +   S            T + 
Sbjct: 252  RKKHGNVILGSNSSGSQLSEAQLAARH-AMSLALDMPVKNITASCTNTTAGTTSSATMNN 310

Query: 571  P-PKTATPPA-----DQKLQRIG----PTKPQMLANRPSVNPISDRDSMVKXXXXXXXXX 422
            P P TA   A       KL  +G      K ++   +         DS ++         
Sbjct: 311  PVPSTANAEASSVANQSKLSPVGSPGSAVKSRVPLKKMPAKSNFGADSSIRAAAVAAGAR 370

Query: 421  XXXXADASSLIEAAKSQNVVHITTSIAHQLPSNVHFIRN--------GLAKAPISTYSAP 266
                +DA+SL++ A+++  +HI       +PS V  I++         L  +P + Y  P
Sbjct: 371  IVTPSDAASLLKVAQAKKAIHI-------MPSGVSSIKSPSAGSASVHLEASPTTRYVRP 423

Query: 265  K-PSVPETT 242
              P VP ++
Sbjct: 424  SLPVVPSSS 432


>ref|XP_007222017.1| hypothetical protein PRUPE_ppa002943mg [Prunus persica]
            gi|462418953|gb|EMJ23216.1| hypothetical protein
            PRUPE_ppa002943mg [Prunus persica]
          Length = 619

 Score =  257 bits (657), Expect = 9e-66
 Identities = 195/543 (35%), Positives = 268/543 (49%), Gaps = 58/543 (10%)
 Frame = -1

Query: 1489 MVDXXXXXXXXSISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISG 1313
            MV+         I+E+D + LL+RY    VL LLQ V  S   KIDW+  V K S  IS 
Sbjct: 1    MVEKTKDPEKSYITEEDTANLLQRYQAANVLHLLQEVAHSQDVKIDWNRLVEKTSTGISN 60

Query: 1312 ARECQVLWRHLAYGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLI 1133
            ARE Q+LWRHLAY E  +D  DN  +P+DD+SDLEHE+E  PA+  E S EAAACVKVL+
Sbjct: 61   AREYQMLWRHLAYSEAFVDNFDNGAQPVDDDSDLEHELEAFPAVIGEDSTEAAACVKVLM 120

Query: 1132 A-GYPKINQLPSNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPL- 959
            A G P  +   S +++EAPLTIN PN +    +   S PP + +  N+ +PVSVQK PL 
Sbjct: 121  ASGLPSDSTHRSGATVEAPLTINIPNGQP-SRTHQNSQPPCSMQGMNITVPVSVQKQPLL 179

Query: 958  ---ASGGVTGEKRPNNEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDF 788
                S G T E    N  A  N         WS  ED +L A V+++GE NWANI R DF
Sbjct: 180  AMTTSTGATAEGGDANGSASNNMAPRKKRKKWSEAEDLELIAGVRRYGEGNWANILRGDF 239

Query: 787  NNDRRASELSQRWSTLRK---KQGNPKAGTSSQPPETQLAAAHRAMSLALNMP------- 638
              +R A++LSQRW  +RK   +  N    +S++  E QLA  H AMSLALNMP       
Sbjct: 240  KGERTANQLSQRWKYIRKHHHQDLNVGGNSSNKLSEAQLATRH-AMSLALNMPSITANTI 298

Query: 637  --MGDNKKSXXXXXXXAGIKTQHQPPKTATPPADQKLQRIGPTKP------------QML 500
               G N  S           T +  P TA     Q  Q + P KP            Q+ 
Sbjct: 299  GTAGTNTHSKFGGTN----ATTNSLPSTAAEEELQSQQGLKPAKPYQMGLLGSTSKSQLT 354

Query: 499  ANRPSVNPISDRDSMVKXXXXXXXXXXXXXADASSLIEAAKSQNVVHIT----TSIAHQL 332
            + +    P S+ D MV+             +DA+SL++AA+++N VH+     +SI   L
Sbjct: 355  SKKTLTKPNSNTDGMVRATAVAAGARIASPSDAASLLKAAQAKNAVHVLPTGGSSIQSSL 414

Query: 331  PS----------NVHFIRNGLAKAPIS-------TYSAPKP----SVPETTEXXXXXXXX 215
            P           N+H++  GLA  P+S       T SA  P    ++P+T++        
Sbjct: 415  PGSMRTHPEPHPNLHYMHTGLAATPVSTPLSTAVTPSATHPGSLKALPQTSQHA------ 468

Query: 214  XXXXXXXAVATNPTGSVQVSNTIKE---SAAAPTPSTKLPETKDEAVVTTSDDEKKEVGK 44
                        PT S  +S  IK+   S  +    T   + +D AV+  S++ + E G+
Sbjct: 469  ------------PTNSTLLSKQIKDVSCSLDSELGCTPTEQVQDGAVI--SENGQNEEGQ 514

Query: 43   SNE 35
             ++
Sbjct: 515  KDK 517


>ref|XP_006366421.1| PREDICTED: uncharacterized protein LOC102582625 [Solanum tuberosum]
          Length = 574

 Score =  254 bits (649), Expect = 7e-65
 Identities = 172/424 (40%), Positives = 233/424 (54%), Gaps = 24/424 (5%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISGARECQVLWRHLA 1277
            ISE+D++ LL+RYS+  VL +LQ V Q A  KIDW+  V K++  I+ ARE Q+LWRHLA
Sbjct: 12   ISEEDIAILLQRYSVSTVLAILQEVGQVADEKIDWNAMVRKSATGITNAREYQMLWRHLA 71

Query: 1276 YGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIA-GYPKINQLPS 1100
            Y   L+D+ D++ +P+DD+SDLE+E+E  PA+  EAS EAAA  K+LIA G P    + +
Sbjct: 72   YRHGLVDKFDDEAQPLDDDSDLEYELEAFPAVSSEASAEAAASAKMLIAYGAPNDANMLN 131

Query: 1099 NSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPNN 920
             S+IEAPLTIN PN +      D S    +   TN+ +PV+VQK PL S  V  E    +
Sbjct: 132  GSTIEAPLTINIPNGQTSRTGMDNSFQGTSMHGTNITVPVAVQKQPL-STVVAAEGLDTH 190

Query: 919  EIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWSTL 740
                 N P       WS  ED +L A+VQK GE NWANI + DF  DR AS+LSQRW+ +
Sbjct: 191  GPGCTNLPPRRKRKPWSEAEDVELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWAII 250

Query: 739  RKKQGNPKAGTSSQPPETQLAAAHRAMSLALNMPMG------------DNKKSXXXXXXX 596
            RK+QG    G  SQ  E QLAA H AMS ALNMP+G            ++          
Sbjct: 251  RKRQGT-MVGNGSQLSEAQLAARH-AMSHALNMPIGAGVGPNSGSGPSNSSHPVTADLAS 308

Query: 595  AGIKTQHQPPKTATPPADQKLQRIGPTKPQMLANRPSVNPISDRDSMVKXXXXXXXXXXX 416
             G ++QHQ    ++ P      RI P KP   A +P+ +P    DSM+K           
Sbjct: 309  GGAQSQHQQDPLSSKP------RIVPQKP---APKPTTSP----DSMIKVAAVAAGARIA 355

Query: 415  XXADASSLIEAAKSQNVVHI----------TTSIAHQLPSNVHFIRNGLAKAPISTYSAP 266
              ++++S ++ A+ +  + I               + LPSNVHFIR GL      ++SA 
Sbjct: 356  TSSNSASQVKLAQPKTPLQIPGGGPAVKSSVLGSTNGLPSNVHFIRTGLV-----SHSAG 410

Query: 265  KPSV 254
             P V
Sbjct: 411  PPKV 414


>emb|CBI19274.3| unnamed protein product [Vitis vinifera]
          Length = 641

 Score =  253 bits (647), Expect = 1e-64
 Identities = 194/525 (36%), Positives = 261/525 (49%), Gaps = 47/525 (8%)
 Frame = -1

Query: 1489 MVDXXXXXXXXSISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISG 1313
            MV+        +ISE+D+SALL+RY+  AVL LLQ V Q    KIDW+  V K S  IS 
Sbjct: 1    MVEMPKMRKKGTISEEDVSALLQRYTPTAVLALLQEVAQLPDVKIDWNALVNKTSTGISN 60

Query: 1312 ARECQVLWRHLAYGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLI 1133
            ARE Q+LWRHLAYG  L+++L++  +P+DD+SDLE+++E  P+I  EAS EA ACVKVLI
Sbjct: 61   AREYQMLWRHLAYGHALLEKLEDGAQPLDDDSDLEYDLEAFPSISTEASAEATACVKVLI 120

Query: 1132 A-GYPKINQLPSNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLA 956
            A   P  + LP++S +EAPLTIN P  ++  A S+ S    + + TN+ IPVSVQK    
Sbjct: 121  ASSLPSDSSLPNSSMVEAPLTINIPCGQSSRAPSEYSRLSGSMQGTNITIPVSVQK---- 176

Query: 955  SGGVTGEKRPNNEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDR 776
                  E    N     + P       WS +ED +L A+VQK GE NWANI + DF  DR
Sbjct: 177  -----SEGFDANGSTSGSLPARKKRKPWSSDEDKELIAAVQKCGEGNWANILKGDFKGDR 231

Query: 775  RASELSQRWSTLRKKQGNPKAG----TSSQPPETQLAAAHRAMSLALNMPM--------- 635
             AS+LSQRW+ +RKK  N   G      SQ  E QLAA H AMSLAL+MP+         
Sbjct: 232  SASQLSQRWTIIRKKHKNLNVGGANSNGSQLSEAQLAARH-AMSLALDMPVKNLTTSSSI 290

Query: 634  -GDNKKSXXXXXXXAGIKTQHQPPKTATPPADQKLQRIGPT-------------KPQMLA 497
             G N  +            +  P  T    A Q+L + GP              K +  +
Sbjct: 291  AGTNPNATSSNSAFPATPAEALPASTNISQA-QQLSQQGPVSTLSQMGSLGSAPKSRATS 349

Query: 496  NRPSVNPISDRDSMVKXXXXXXXXXXXXXADASSLIEAAKSQNVVHI----TTSI----- 344
             + S        SM+K             + A+SL++ A+S+N VHI    +T I     
Sbjct: 350  KKTSAKSTFSSQSMLKATAVAAGARIATPSAAASLLKDAQSRNAVHIMPGGSTLIKSSVA 409

Query: 343  --AHQLPS-------NVHFIRNGLAKAPISTYSAPKPSVPETTEXXXXXXXXXXXXXXXA 191
              A+ LP+       NVH+   G     +STYSA  PSV  T                  
Sbjct: 410  GGANPLPANHLGAHPNVHYKCAGPPTTSLSTYSAVAPSVSRTGSAKPAAPGGQLAPSPS- 468

Query: 190  VATNPTGSVQVSNTIKESAAAPTPSTKLPETKDEAVVTTSDDEKK 56
             AT+   S + +N    S A   P+ +  +T +E  V  S +  K
Sbjct: 469  -ATSVNISSEQTNAATTSLAVEYPAKQETKTSEETKVPISGNVPK 512


>ref|XP_004252718.1| PREDICTED: uncharacterized protein LOC101249442 [Solanum
            lycopersicum]
          Length = 569

 Score =  253 bits (645), Expect = 2e-64
 Identities = 182/485 (37%), Positives = 253/485 (52%), Gaps = 18/485 (3%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISGARECQVLWRHLA 1277
            ISE+D++ LL+RYS+  VL +L+ V Q A  KIDW+  V K++  I+ ARE Q+LWRHLA
Sbjct: 12   ISEEDIAILLQRYSVSTVLAILREVGQVADEKIDWNVMVRKSTTGITNAREYQMLWRHLA 71

Query: 1276 YGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIA-GYPKINQLPS 1100
            Y   LID+ D++ +P+DD+SDLE E+E  PA+  EAS EAAA  K+LIA G P    + +
Sbjct: 72   YRHDLIDKFDDEAQPLDDDSDLEFELEAFPAVSSEASAEAAASAKMLIASGAPNDANMLN 131

Query: 1099 NSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPNN 920
             S+IEAPLTIN PN +      D S    +   TN+ +PV+VQK PL S  V  E    +
Sbjct: 132  GSTIEAPLTINIPNGQTSRTGMDNSFQGTSMHGTNITVPVAVQKQPL-STVVAAEGLDTH 190

Query: 919  EIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWSTL 740
                 N P       WS  ED +L A+VQK GE NWANI + DF  DR AS+LSQRW+ +
Sbjct: 191  GPGCTNLPPRRKRKPWSEAEDVELIAAVQKCGEGNWANILKGDFKGDRTASQLSQRWAII 250

Query: 739  RKKQGNPKAGTSSQPPETQLAAAHRAMSLALNMPMGDNKKSXXXXXXXAGIKTQHQPPKT 560
            RK+QG    G  SQ  E QLAA H AMS ALNMP+G +           G  +    P T
Sbjct: 251  RKRQGT-MVGNGSQLSEAQLAARH-AMSHALNMPIGAS-----VGPNSGGGSSNSSLPVT 303

Query: 559  ATPPA----DQKLQRIGPTKPQMLANRPSVNPISDRDSMVKXXXXXXXXXXXXXADASSL 392
            A   +     Q  Q    +KP+++  +P+  P +  DSMVK             ++++S 
Sbjct: 304  ADLASGGAQSQHQQDPLSSKPRIVPQKPAPKPTTSSDSMVKVTAVAAGARIATSSNSASQ 363

Query: 391  IEAAKSQNVVHI----------TTSIAHQLPSNVHFIRNGLAKAPISTYSAPKPSVPETT 242
            ++ A+ +  + I               + LPSNVHFIR GL    +S  + P  +V    
Sbjct: 364  VKLAQPKTPLQIPGGGSAVKSSVLGSTNGLPSNVHFIRTGL----VSHSAGPPKAVHSAG 419

Query: 241  EXXXXXXXXXXXXXXXAVATNPTGSVQ-VSNTIKESA-AAPTPSTKLPETKDEAVVTTSD 68
                                +PT   + + N+ K +A A PT  T  P    E  V T+ 
Sbjct: 420  PSHASRPGTQQGLSHSLKPASPTVQPKPIGNSSKPNALAVPTAPTSTPVA--ELKVNTNQ 477

Query: 67   DEKKE 53
            + +++
Sbjct: 478  EVQQD 482


>ref|XP_002514048.1| DNA binding protein, putative [Ricinus communis]
            gi|223547134|gb|EEF48631.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 608

 Score =  252 bits (643), Expect = 4e-64
 Identities = 169/417 (40%), Positives = 227/417 (54%), Gaps = 18/417 (4%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISGARECQVLWRHLA 1277
            ISE+D+S+LL+RY+ + VL LLQ V Q  G KIDW+  V K +  I   RE Q+LWRHLA
Sbjct: 15   ISEEDISSLLQRYTANTVLALLQEVAQFEGVKIDWNALVKKTTTGIKNVREYQMLWRHLA 74

Query: 1276 YGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIA-GYPKINQLPS 1100
            Y  TLID LD+  +P+DD+SDLE+E+E  P +  EAS EAAACVKVLIA G    +  P+
Sbjct: 75   YKHTLIDNLDDGAQPLDDDSDLEYELEAFPDVSSEASAEAAACVKVLIASGATSDSTHPN 134

Query: 1099 NSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPNN 920
            ++++EAPLTIN PN ++  A S+ S P  T R  N+ +PVS+QK PL +   T E    N
Sbjct: 135  SATVEAPLTINIPNGQSARAISENSQPA-TMRGMNITVPVSIQKQPLPTVAST-EVFDGN 192

Query: 919  EIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWSTL 740
             +   N P       WS  ED +L A+VQK+GE NWANI R +F  DR AS+LSQRW+ +
Sbjct: 193  GLGNGNIPPRRKRKPWSEAEDLELIAAVQKYGEGNWANILRSEFTWDRTASQLSQRWAII 252

Query: 739  RKKQG--NPKAGTSSQPPETQLAAAHRAMSLALNMPMGDNKKSXXXXXXXAGIKTQHQPP 566
            RK+ G  NP   TS      +  AA  AM+LAL+ P+   K              QHQ  
Sbjct: 253  RKRHGNWNPVGNTSGVQLSEEWRAARHAMNLALDPPV---KNKFTNNISGEATPAQHQSQ 309

Query: 565  KTATPPADQKLQRIGPTKPQMLANRPSVNPISDRDSMVKXXXXXXXXXXXXXADASSLIE 386
            +     +   +      K Q+   RP+   +S     V+             +DA+SL++
Sbjct: 310  RPFAAKSSPMVPLGSAPKSQIAVKRPAKPDLS--SDPVRATAVAAGARIATQSDAASLLK 367

Query: 385  AAKSQNVVHIT----TSIAHQLPS----------NVHFIRNGLAKAPISTYSAPKPS 257
            AA+++N VHI     +S+   LP           NVH   N LA    ST     PS
Sbjct: 368  AAQAKNAVHIMPTGGSSMKSALPGGASNHSEAHPNVH--TNDLAAGSRSTLPVVSPS 422


>ref|XP_002283801.2| PREDICTED: uncharacterized protein LOC100245507 [Vitis vinifera]
          Length = 606

 Score =  251 bits (642), Expect = 5e-64
 Identities = 189/503 (37%), Positives = 258/503 (51%), Gaps = 25/503 (4%)
 Frame = -1

Query: 1489 MVDXXXXXXXXSISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISG 1313
            MV+        +ISE+D+SALL+RY+  AVL LLQ V Q    KIDW+  V K S  IS 
Sbjct: 1    MVEMPKMRKKGTISEEDVSALLQRYTPTAVLALLQEVAQLPDVKIDWNALVNKTSTGISN 60

Query: 1312 ARECQVLWRHLAYGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLI 1133
            ARE Q+LWRHLAYG  L+++L++  +P+DD+SDLE+++E  P+I  EAS EA ACVKVLI
Sbjct: 61   AREYQMLWRHLAYGHALLEKLEDGAQPLDDDSDLEYDLEAFPSISTEASAEATACVKVLI 120

Query: 1132 A-GYPKINQLPSNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLA 956
            A   P  + LP++S +EAPLTIN P  ++  A S+ S    + + TN+ IPVSVQK    
Sbjct: 121  ASSLPSDSSLPNSSMVEAPLTINIPCGQSSRAPSEYSRLSGSMQGTNITIPVSVQK---- 176

Query: 955  SGGVTGEKRPNNEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDR 776
                  E    N     + P       WS +ED +L A+VQK GE NWANI + DF  DR
Sbjct: 177  -----SEGFDANGSTSGSLPARKKRKPWSSDEDKELIAAVQKCGEGNWANILKGDFKGDR 231

Query: 775  RASELSQRWSTLRKKQGNPKAG----TSSQPPETQLAAAHRAMSLALNMPMGDNKKSXXX 608
             AS+LSQRW+ +RKK  N   G      SQ  E QLAA H AMSLAL+MP+    K+   
Sbjct: 232  SASQLSQRWTIIRKKHKNLNVGGANSNGSQLSEAQLAARH-AMSLALDMPV----KNLTT 286

Query: 607  XXXXAGIKTQHQPPKTATPPADQKLQRIGPT-KPQMLANRPSVNPISDRDSMVKXXXXXX 431
                   +   Q P +       ++  +G   K +  + + S        SM+K      
Sbjct: 287  TNISQAQQLSQQGPVSTL----SQMGSLGSAPKSRATSKKTSAKSTFSSQSMLKATAVAA 342

Query: 430  XXXXXXXADASSLIEAAKSQNVVHI----TTSI-------AHQLPS-------NVHFIRN 305
                   + A+SL++ A+S+N VHI    +T I       A+ LP+       NVH+   
Sbjct: 343  GARIATPSAAASLLKDAQSRNAVHIMPGGSTLIKSSVAGGANPLPANHLGAHPNVHYKCA 402

Query: 304  GLAKAPISTYSAPKPSVPETTEXXXXXXXXXXXXXXXAVATNPTGSVQVSNTIKESAAAP 125
            G     +STYSA  PSV  T                   AT+   S + +N    S A  
Sbjct: 403  GPPTTSLSTYSAVAPSVSRTGSAKPAAPGGQLAPSPS--ATSVNISSEQTNAATTSLAVE 460

Query: 124  TPSTKLPETKDEAVVTTSDDEKK 56
             P+ +  +T +E  V  S +  K
Sbjct: 461  YPAKQETKTSEETKVPISGNVPK 483


>ref|XP_002302346.1| myb family transcription factor family protein [Populus trichocarpa]
            gi|222844072|gb|EEE81619.1| myb family transcription
            factor family protein [Populus trichocarpa]
          Length = 677

 Score =  246 bits (627), Expect = 3e-62
 Identities = 172/458 (37%), Positives = 240/458 (52%), Gaps = 42/458 (9%)
 Frame = -1

Query: 1489 MVDXXXXXXXXSISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISG 1313
            M++         ISE+D+S LL+RY+   +L LLQ V Q  GAKIDW+  V K S  IS 
Sbjct: 1    MIEKSKKNKKGVISEEDVSTLLQRYTATTLLALLQEVAQFDGAKIDWNALVKKTSTGISN 60

Query: 1312 ARECQVLWRHLAYGETLIDQLDNDPEPIDDN-SDLEHEVEPSPAIGREASVEAAACVKVL 1136
            ARE Q+LWRHLAY   L ++ D+   P+DD+ SDLE E+E  P++  EAS EAAACVKVL
Sbjct: 61   AREYQMLWRHLAYRHVLPEKFDDGAHPLDDDDSDLESELEAFPSVTSEASTEAAACVKVL 120

Query: 1135 IA-GYPKINQLPSNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKI-- 965
            IA G P  +  P+N+++EAPLTIN PN +++ A+S+ S      R  N+ +PVSVQK+  
Sbjct: 121  IASGLPSDSTHPNNTTVEAPLTINIPNGRSLRATSENSQSDVM-RGVNIRVPVSVQKLSL 179

Query: 964  PLASGGVTGEKRPNNEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFN 785
            P        E    N      +P       WS  ED +L A+VQK GE NWA+I R +F 
Sbjct: 180  PAVMSCPASEVYDANGSGSGTFPPRRKRKPWSEAEDMELIAAVQKLGEGNWASIVRGEFK 239

Query: 784  NDRRASELSQRWSTLRKKQGNPKAGTSSQPPETQLAAAHRAMSLALNMPMGDNKKSXXXX 605
             DR AS+LSQRW+ +RK+ GN   GT S  P  QL+   RA   A+ M +  +  +    
Sbjct: 240  GDRTASQLSQRWAIIRKRHGNLNVGTVSSAP--QLSETQRAARDAVKMALDPHPAAKSLI 297

Query: 604  XXXAGIKTQHQPPKTATP-------PADQKLQR------------IGP-TKPQMLANRPS 485
               AG  +   P   A+P       PA  + Q+            +GP  K Q++  + S
Sbjct: 298  ASSAGTTSTKTPNNCASPTITAEASPAQHQSQQRTMMTKSSSIWPVGPAAKSQVMLAKAS 357

Query: 484  VNPISDRDSMVKXXXXXXXXXXXXXADASSLIEAAKSQNVVHITTSIAHQLPS------- 326
               I   D  V+             +DA+SL++AA+++N VHI  + +  + S       
Sbjct: 358  EKSILSSDP-VRAAAVAAGARIATQSDAASLLKAAQAKNAVHIMPTGSSSIKSSMTGGIS 416

Query: 325  -------NVHFIRNGLAKAPIST---YSAPKPSVPETT 242
                   N  FI +G+A AP +T    S P P +P+ T
Sbjct: 417  THLDVNPNTRFISSGMATAPTTTRPPASGPCPGLPKAT 454


>ref|XP_007018233.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao]
            gi|508723561|gb|EOY15458.1| Homeodomain-like superfamily
            protein isoform 2 [Theobroma cacao]
          Length = 606

 Score =  243 bits (620), Expect = 2e-61
 Identities = 168/477 (35%), Positives = 241/477 (50%), Gaps = 4/477 (0%)
 Frame = -1

Query: 1489 MVDXXXXXXXXSISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISG 1313
            M++        S+SE+D+S+LL+RY+   VL LLQ V Q  G K++W+  V K S  IS 
Sbjct: 1    MIEKTKKQKKGSVSEEDISSLLQRYTATTVLALLQEVAQFPGVKLNWNALVKKTSTGISN 60

Query: 1312 ARECQVLWRHLAYGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLI 1133
            ARE Q+LWRHLAY + L+++L++  EP+DD SDLE+E+EP P++  EAS EAAACVKVLI
Sbjct: 61   AREYQMLWRHLAYRDVLLEKLEDGAEPLDDESDLEYELEPCPSVSSEASAEAAACVKVLI 120

Query: 1132 A-GYPKINQLPSNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLA 956
            A G P  + LP++S++EAPLTIN PN ++  ASS+ S P  + R  N+ +PVSVQK  L 
Sbjct: 121  ASGLPSDSSLPNSSTVEAPLTINIPNGQSFRASSENSQPTCSMRGMNITVPVSVQKQILP 180

Query: 955  SGGVTGEKRPNNEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDR 776
            +          N ++  N P       WS  ED +L A+VQK G  NWANI R DF  DR
Sbjct: 181  AVTSAETSLEGNGLSGANLPARRKRKPWSEAEDRELIAAVQKCGVGNWANILRGDFKGDR 240

Query: 775  RASELSQRWSTLRKKQGNPKAGTSSQPPETQLA--AAHRAMSLALNMPMGDNKKSXXXXX 602
             AS+L+QRW+ ++K+ GN     +S  P+   A  A   A+SLAL+MP   +K       
Sbjct: 241  SASQLAQRWTIIKKRLGNLNVEGNSTIPQLSEAQLATRSALSLALDMP---DKNLTSACP 297

Query: 601  XXAGIKTQHQPPKTATPPADQKLQRIGPTKPQMLANRPSVNPISDRDSMVKXXXXXXXXX 422
                +KT      +A P    +      ++ Q   N+P   PI+   +            
Sbjct: 298  SNPALKTTSS--NSALPSTSGEASVPAQSQFQQAHNQPQKGPITSVPAQ----------N 345

Query: 421  XXXXADASSLIEAAKSQNVVHITTSIAHQLPSNVHFIRNGLAKAPISTYSAPKPSVPETT 242
                   +SL  + +SQ    IT +      S +   R GL K P  ++S+    +  T 
Sbjct: 346  LSQQGPVASLQVSNQSQQGPMITKTSPGSSGSTLK-SRVGLKKPPAKSFSSTGSILDATA 404

Query: 241  EXXXXXXXXXXXXXXXAVATNPTGSVQVSNTIKESAAAPTPSTKLPETKDEAVVTTS 71
                              A     ++ +  +   SA    PS K P  + E   + S
Sbjct: 405  VAAGARIGGPKAAASLLKAAQSKNAIHIMTSSGSSAKPLMPSVKSPIQRVEHTPSAS 461


>ref|XP_007018232.1| Homeodomain-like superfamily protein isoform 1 [Theobroma cacao]
            gi|508723560|gb|EOY15457.1| Homeodomain-like superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 674

 Score =  241 bits (614), Expect = 8e-61
 Identities = 135/288 (46%), Positives = 184/288 (63%), Gaps = 4/288 (1%)
 Frame = -1

Query: 1489 MVDXXXXXXXXSISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISG 1313
            M++        S+SE+D+S+LL+RY+   VL LLQ V Q  G K++W+  V K S  IS 
Sbjct: 1    MIEKTKKQKKGSVSEEDISSLLQRYTATTVLALLQEVAQFPGVKLNWNALVKKTSTGISN 60

Query: 1312 ARECQVLWRHLAYGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLI 1133
            ARE Q+LWRHLAY + L+++L++  EP+DD SDLE+E+EP P++  EAS EAAACVKVLI
Sbjct: 61   AREYQMLWRHLAYRDVLLEKLEDGAEPLDDESDLEYELEPCPSVSSEASAEAAACVKVLI 120

Query: 1132 A-GYPKINQLPSNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLA 956
            A G P  + LP++S++EAPLTIN PN ++  ASS+ S P  + R  N+ +PVSVQK  L 
Sbjct: 121  ASGLPSDSSLPNSSTVEAPLTINIPNGQSFRASSENSQPTCSMRGMNITVPVSVQKQILP 180

Query: 955  SGGVTGEKRPNNEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDR 776
            +          N ++  N P       WS  ED +L A+VQK G  NWANI R DF  DR
Sbjct: 181  AVTSAETSLEGNGLSGANLPARRKRKPWSEAEDRELIAAVQKCGVGNWANILRGDFKGDR 240

Query: 775  RASELSQRWSTLRKKQGNPKAGTSSQPPETQLA--AAHRAMSLALNMP 638
             AS+L+QRW+ ++K+ GN     +S  P+   A  A   A+SLAL+MP
Sbjct: 241  SASQLAQRWTIIKKRLGNLNVEGNSTIPQLSEAQLATRSALSLALDMP 288


>ref|XP_007224591.1| hypothetical protein PRUPE_ppa1027142mg [Prunus persica]
            gi|462421527|gb|EMJ25790.1| hypothetical protein
            PRUPE_ppa1027142mg [Prunus persica]
          Length = 639

 Score =  239 bits (610), Expect = 2e-60
 Identities = 186/527 (35%), Positives = 265/527 (50%), Gaps = 32/527 (6%)
 Frame = -1

Query: 1489 MVDXXXXXXXXSISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISG 1313
            MV+        SI+E+D + LL+RY+   VL LLQ V     AKIDW   VAK S  IS 
Sbjct: 1    MVEKTKDPKKCSITEEDTATLLQRYTATTVLALLQEVAHWPEAKIDWIRLVAKTSTGISN 60

Query: 1312 ARECQVLWRHLAYGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLI 1133
            ARE Q+LWRHLAY E L+D+ DN  +P+DD+SDLE+E+E  PA+  EAS EAAACVKVLI
Sbjct: 61   AREYQMLWRHLAYREALVDKFDNGSQPLDDDSDLEYELEAFPAVCGEASTEAAACVKVLI 120

Query: 1132 A-GYPKINQLPSNSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPL- 959
            A G P  +   + +++EAPLTIN PN +    + + S P  + +  N+ +PVSV+K PL 
Sbjct: 121  ASGLPSDSSHRNGTTVEAPLTINIPNGQP-SRTHENSEPTCSMQGKNITVPVSVKKQPLP 179

Query: 958  ---ASGGVTGEKRPNNEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDF 788
                S   T +    N  A  +         WS  ED +L A+VQK GE NWANI R DF
Sbjct: 180  SATTSSVATADGGDANGSASNSMAPRKKRKKWSEAEDFELIAAVQKCGEGNWANILRADF 239

Query: 787  NNDRRASELSQRWSTLRKKQGNPKAG--TSSQPPETQLAAAHRAMSLALNMP-------- 638
              DR A +LSQRW+ ++K+      G  +S +  E QLAA H ++S+ALNMP        
Sbjct: 240  KGDRTAGQLSQRWAIIKKRNQELNLGGNSSGKLSEAQLAARH-SLSVALNMPNLTAKTIG 298

Query: 637  -MGDN-------KKSXXXXXXXAGIKTQHQPPKTATP-PADQKLQRIG-PTKPQMLANRP 488
              G N       K +        G K + Q  +   P     +++ +G  TK Q+ +   
Sbjct: 299  TAGTNAHNKFARKVATSNPVLTTGAKAEPQSQQDLKPTKKPYQMELLGSTTKSQVTSKNT 358

Query: 487  SVNPISDRDSMVKXXXXXXXXXXXXXADASSLIEAAKSQNVVHI---TTSIAHQLPSNVH 317
               P  + D +V+             +DA+SL++AA+++N VHI   + SI   LP    
Sbjct: 359  LTKPNCNDDDIVRAIAVAAGARIASPSDAASLLKAAQAKNAVHIMPTSGSIQSSLPGG-- 416

Query: 316  FIRNGLAKAPISTYSAPKPSVPETTEXXXXXXXXXXXXXXXAVATNPTGSVQVSNTIKES 137
                      +ST+S P P++   T                  A +P  S  +       
Sbjct: 417  ----------MSTHSEPHPNLHMRTGLAGITLSTPPPTDVTPSAVHPGSSKAL-----PP 461

Query: 136  AAAPTPSTKLPETKDEAVVTTSDDEK---KEVGKSNEGTDVAEVSGC 5
             + PTP+     ++    V+ S D K   K+  ++ EG+ +AE+ GC
Sbjct: 462  MSQPTPTNGTLLSRQIKGVSCSLDAKLPSKQEVRTEEGSVIAEL-GC 507


>ref|XP_006844749.1| hypothetical protein AMTR_s00016p00255950 [Amborella trichopoda]
            gi|548847220|gb|ERN06424.1| hypothetical protein
            AMTR_s00016p00255950 [Amborella trichopoda]
          Length = 661

 Score =  234 bits (598), Expect = 6e-59
 Identities = 158/379 (41%), Positives = 213/379 (56%), Gaps = 14/379 (3%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHE-VAKNSKEISGARECQVLWRHLA 1277
            ISE+D S LL+RY+   +L LLQ V Q AG K+DW+  V K S  IS ARE Q+LWRHLA
Sbjct: 41   ISEEDASLLLQRYTATTILALLQEVAQFAGPKVDWNVLVKKTSTGISNAREYQMLWRHLA 100

Query: 1276 YGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIAGYPKINQLPSN 1097
            Y   L ++L++D EP+DD+SDLE EVE SP    EA  EA ACVKVLIA     +  PSN
Sbjct: 101  YRTALAEKLEDDAEPMDDDSDLEFEVEASPTPSNEALAEATACVKVLIA---SSDPGPSN 157

Query: 1096 SS-IEAPLTINKP-NTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPN 923
             + IEAPLTIN P N + + A S+  +   T + TN+ +PVSVQK PL +   + E   +
Sbjct: 158  RTIIEAPLTINVPNNAQTLPAQSENRNSSCTGQGTNITVPVSVQKQPLPT-VTSAEGLNS 216

Query: 922  NEIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWST 743
            N +A +          W+ EED +L A+VQK GE NWANI + DF +DR AS+LSQRWS 
Sbjct: 217  NGVAGL---PRRKRKPWTSEEDKELIAAVQKCGEGNWANILKGDFKHDRTASQLSQRWSI 273

Query: 742  LRKKQGN--PKAGTSSQPPETQLA--AAHRAMSLALNMPMGDNKKSXXXXXXXAGIKTQH 575
            ++KKQ N   K G SS       A  A  +A+S+ALNMP+  N  S       + I    
Sbjct: 274  IKKKQANSDSKVGGSSNSSALTEAQQATRQAVSIALNMPISSNTLSSGGSGTFSSIVRPP 333

Query: 574  QPPKTATPPADQKLQRIGPTKPQMLANRPS-------VNPISDRDSMVKXXXXXXXXXXX 416
             P  +  P         GP+K +  A + +       + P +  + +V+           
Sbjct: 334  APLFSQVPQQGPDQAHRGPSKARPPAKKATPTQGQAQMKPTNGPNPLVQAAAVAAGARIA 393

Query: 415  XXADASSLIEAAKSQNVVH 359
              +  +SL++AA+S NVVH
Sbjct: 394  PASTVASLLKAAQSGNVVH 412


>ref|XP_004136421.1| PREDICTED: uncharacterized protein LOC101205013 [Cucumis sativus]
          Length = 385

 Score =  228 bits (580), Expect = 7e-57
 Identities = 144/380 (37%), Positives = 219/380 (57%), Gaps = 14/380 (3%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHEVAKN-SKEISGARECQVLWRHLA 1277
            IS +D S LL RYS+  +L LL+ V Q +G +IDW ++ +N S  IS ARE Q+LWRHLA
Sbjct: 13   ISMEDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISDAREYQLLWRHLA 72

Query: 1276 YGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIA-GYPKINQLPS 1100
            Y +TL++ + +  + +D +SDL+ EVEP P++  E+S EA+ACVKVLIA   P  + +P+
Sbjct: 73   YRQTLLEDMHSVTDSLDYDSDLDFEVEPFPSVSSESSNEASACVKVLIANSIPNESDVPN 132

Query: 1099 NSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPNN 920
            +S++EAPLTI   N +    + D     Y  R+ +V IP+S+Q+ P+     T       
Sbjct: 133  SSAVEAPLTIGISNCQPSTDNLDHHQSTYLQRM-SVTIPLSIQRQPIPMPSAT------- 184

Query: 919  EIAEVN-YPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWST 743
            E+ +VN          WS  ED +L A+V+K GE NWANI + DF  DR AS+LSQRWS 
Sbjct: 185  EVIDVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTASQLSQRWSV 244

Query: 742  LRKKQGNPKAG--TSSQPPETQLAAAHRAMSLALNMPMGDNKKS---------XXXXXXX 596
            +RK++ N   G  TSS   + Q+ AAHRA+S AL++P+ ++K +                
Sbjct: 245  IRKRRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGSE 304

Query: 595  AGIKTQHQPPKTATPPADQKLQRIGPTKPQMLANRPSVNPISDRDSMVKXXXXXXXXXXX 416
            + I+ Q+Q P+ + P      +RI   K  ++     +    D DS+V+           
Sbjct: 305  SSIQMQNQSPQISMPS-----RRINTPKNSLM-----IKSTHDSDSIVRATAVAAGARIV 354

Query: 415  XXADASSLIEAAKSQNVVHI 356
              +DA+SL++A +++N +HI
Sbjct: 355  SPSDAASLLKATQTKNAIHI 374


>ref|XP_004171594.1| PREDICTED: uncharacterized protein LOC101223915 [Cucumis sativus]
          Length = 371

 Score =  225 bits (574), Expect = 4e-56
 Identities = 142/377 (37%), Positives = 217/377 (57%), Gaps = 14/377 (3%)
 Frame = -1

Query: 1444 DDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHEVAKN-SKEISGARECQVLWRHLAYGE 1268
            +D S LL RYS+  +L LL+ V Q +G +IDW ++ +N S  IS ARE Q+LWRHLAY +
Sbjct: 2    EDCSPLLERYSVRTILTLLREVAQVSGVRIDWDKLVENTSTGISDAREYQLLWRHLAYRQ 61

Query: 1267 TLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLIA-GYPKINQLPSNSS 1091
            TL++ + +  + +D +SDL+ EVEP P++  E+S EA+ACVKVLIA   P  + +P++S+
Sbjct: 62   TLLEDMHSVTDSLDYDSDLDFEVEPFPSVSSESSNEASACVKVLIANSIPNESDVPNSSA 121

Query: 1090 IEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPNNEIA 911
            +EAPLTI   N +    + D     Y  R+ +V IP+S+Q+ P+     T       E+ 
Sbjct: 122  VEAPLTIGISNCQPSTDNLDHHQSTYLQRM-SVTIPLSIQRQPIPMPSAT-------EVI 173

Query: 910  EVN-YPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWSTLRK 734
            +VN          WS  ED +L A+V+K GE NWANI + DF  DR AS+LSQRWS +RK
Sbjct: 174  DVNGATSRKRRKPWSKAEDLELIAAVEKCGEGNWANILKGDFKGDRTASQLSQRWSVIRK 233

Query: 733  KQGNPKAG--TSSQPPETQLAAAHRAMSLALNMPMGDNKKS---------XXXXXXXAGI 587
            ++ N   G  TSS   + Q+ AAHRA+S AL++P+ ++K +                + I
Sbjct: 234  RRCNLNIGASTSSTAHKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGSESSI 293

Query: 586  KTQHQPPKTATPPADQKLQRIGPTKPQMLANRPSVNPISDRDSMVKXXXXXXXXXXXXXA 407
            + Q+Q P+ + P      +RI   K  ++     +    D DS+V+             +
Sbjct: 294  QMQNQSPQISMPS-----RRINTPKNSLM-----IKSTHDSDSIVRATAVAAGARIVSPS 343

Query: 406  DASSLIEAAKSQNVVHI 356
            DA+SL++A +++N +HI
Sbjct: 344  DAASLLKATQTKNAIHI 360


>ref|XP_004152740.1| PREDICTED: uncharacterized protein LOC101206820 [Cucumis sativus]
          Length = 659

 Score =  225 bits (574), Expect = 4e-56
 Identities = 169/494 (34%), Positives = 244/494 (49%), Gaps = 21/494 (4%)
 Frame = -1

Query: 1453 ISEDDMSALLRRYSMDAVLGLLQMVEQSAGAKIDWHEVAKN-SKEISGARECQVLWRHLA 1277
            ++E D S+LLRRYS   VL LLQ V Q+  AKIDW+++ KN S  IS  RE Q+LWRHLA
Sbjct: 8    VTEKDFSSLLRRYSPTTVLALLQEVAQAPDAKIDWNDLVKNTSTGISNPREYQMLWRHLA 67

Query: 1276 YGETLIDQLDNDPEPIDDNSDLEHEVEPSPAIGREASVEAAACVKVLI-AGYPKINQLPS 1100
            Y   L+D L+++  P++D+SDLE ++EP P++  E   EAAAC KV I +G P    +P+
Sbjct: 68   YRHALLDDLEDEKAPLEDDSDLECDLEPFPSVSCETLTEAAACAKVFISSGSPSDLNVPN 127

Query: 1099 NSSIEAPLTINKPNTKAVCASSDGSHPPYTPRVTNVCIPVSVQKIPLASGGVTGEKRPNN 920
            +S IEAPLTI+ P +       +   P  + +   + +PVSVQ+ P+ +   + E    N
Sbjct: 128  SSIIEAPLTISLPRSYTDGVQFENVDPACSVKGAIITVPVSVQRQPVLA-PPSAEGLNTN 186

Query: 919  EIAEVNYPXXXXXXSWSLEEDSKLTASVQKHGERNWANIARWDFNNDRRASELSQRWSTL 740
                 N         WS  ED +L A+V+K GE NWANI R DF +DR AS+LSQRW+ +
Sbjct: 187  GPTYGNNASRRKRKPWSEAEDLELMAAVKKCGEGNWANIIRGDFLSDRTASQLSQRWAII 246

Query: 739  RKKQGNPKAGTS---SQPPETQLAAAHRAMSLALNMPMGDNKKSXXXXXXXAGI------ 587
            +KK GN   G +   +Q  E QLAA H AMS+AL   +G  K         + I      
Sbjct: 247  KKKHGNLNVGVNTAGTQLSEVQLAARH-AMSVALGRHVGSLKARINGSASTSTIGNGSSL 305

Query: 586  ----KTQHQPPKTATPPADQKLQRIGPT----KPQMLANRPSVNPIS-DRDSMVKXXXXX 434
                 ++    K    P   K   IG +    K Q+  ++  V   S D D +V+     
Sbjct: 306  TTVATSEQVQDKLHQSPTHAKPSSIGSSSLTAKTQVTTSKKMVPKSSFDSDCIVRAAAVA 365

Query: 433  XXXXXXXXADASSLIEAAKSQNVVHITTSIAHQLPSNVHFIRNGLAKAPISTYSAPKPSV 254
                    ADA+SL++AA+S+N +HI   +    P++   +  G  + P    + P   +
Sbjct: 366  AGARIASPADAASLLKAAQSKNAIHIMAKV----PASTKTLTPG--RGPSHLEAHPSIKL 419

Query: 253  PETTEXXXXXXXXXXXXXXXAVATNPTGSVQV-SNTIKESAAAPTPSTKLPETKDEAVVT 77
            P  +                +  T    SVQ   NT   SA A T S     T   +  +
Sbjct: 420  PTLSTTPTVVPSRGGPLKITSPTTAKLSSVQTDQNTAVASATASTASATDQNTAVASTAS 479

Query: 76   TSDDEKKEVGKSNE 35
                 +KE+  + E
Sbjct: 480  ADSLSEKEIKIAEE 493


Top