BLASTX nr result

ID: Mentha23_contig00001082 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00001082
         (707 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU35496.1| hypothetical protein MIMGU_mgv1a025925mg [Mimulus...   198   1e-48
ref|XP_002325408.2| myb family transcription factor family prote...   194   2e-47
ref|XP_002525443.1| transcription factor, putative [Ricinus comm...   193   4e-47
ref|XP_006339131.1| PREDICTED: uncharacterized protein LOC102602...   190   3e-46
ref|XP_006443380.1| hypothetical protein CICLE_v10020171mg [Citr...   190   5e-46
ref|XP_006339130.1| PREDICTED: uncharacterized protein LOC102602...   187   2e-45
ref|XP_002319702.2| myb family transcription factor family prote...   184   2e-44
ref|XP_007202959.1| hypothetical protein PRUPE_ppa015076mg [Prun...   176   5e-42
ref|XP_004249439.1| PREDICTED: uncharacterized protein LOC101257...   172   1e-40
ref|XP_007030697.1| Homeodomain-like superfamily protein isoform...   172   1e-40
ref|XP_007030696.1| Homeodomain-like superfamily protein isoform...   172   1e-40
ref|XP_004288533.1| PREDICTED: uncharacterized protein LOC101304...   172   1e-40
ref|XP_006443379.1| hypothetical protein CICLE_v10020171mg [Citr...   171   3e-40
ref|XP_006443378.1| hypothetical protein CICLE_v10020171mg [Citr...   171   3e-40
gb|EXC02650.1| Myb family transcription factor APL [Morus notabi...   170   4e-40
ref|XP_006338933.1| PREDICTED: uncharacterized protein LOC102592...   170   4e-40
ref|XP_006339132.1| PREDICTED: uncharacterized protein LOC102602...   169   8e-40
ref|XP_003558005.1| PREDICTED: uncharacterized protein LOC100837...   168   1e-39
ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248...   168   2e-39
ref|XP_006650017.1| PREDICTED: myb family transcription factor A...   167   4e-39

>gb|EYU35496.1| hypothetical protein MIMGU_mgv1a025925mg [Mimulus guttatus]
          Length = 402

 Score =  198 bits (504), Expect = 1e-48
 Identities = 145/304 (47%), Positives = 168/304 (55%), Gaps = 69/304 (22%)
 Frame = +1

Query: 1   SHLQKYRLSKNIHGQAYKNGLIAAMPVEXXXXXXXXXXXXXXXXXMHLGEAIQMQIQVQR 180
           SHLQKYRLSK+++GQA       +                     MH+GEA+QMQI+VQR
Sbjct: 101 SHLQKYRLSKSLNGQANT----VSNKSGNKTLFNNRLSYLLQRPNMHIGEALQMQIEVQR 156

Query: 181 RLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQLSDLASKVS 360
           RLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGG Q+MGT+GLEAAKVQLSDL SKVS
Sbjct: 157 RLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGG-QNMGTVGLEAAKVQLSDLVSKVS 215

Query: 361 TQCLNSAF---PQLSDLCLHPKTTA----DCSIDSCLTSSCEEQ-------LYNNL---- 486
           TQC +SAF    +LSDLC+  +  A    DCSIDSCLTSS E         LYNNL    
Sbjct: 216 TQCFSSAFSDMKELSDLCMSQQKQANKPTDCSIDSCLTSSFEGSIRDQEVVLYNNLMGLN 275

Query: 487 ----VGFRQPTELEETRDD--RIFEEKPYNNNLLSXXXXXXXENKTT------------- 609
               +GFR  TE  + +DD  R  EE   + + +         N +              
Sbjct: 276 DKHALGFRTNTEERDRKDDGARWREETKESGDDVDRGFINSGNNLSMSIGIEGGDQWNDS 335

Query: 610 -----TSDM----------------------KLPFFSTKLDLNT---DEAASGCK--QLD 693
                T DM                      K+P FST+LDLNT   + AAS  K  QLD
Sbjct: 336 GKYYGTEDMFNKGDDGSDGKLLMKTERSEKTKMPLFSTRLDLNTGSENNAASSYKHHQLD 395

Query: 694 LNGF 705
           LNGF
Sbjct: 396 LNGF 399


>ref|XP_002325408.2| myb family transcription factor family protein [Populus trichocarpa]
            gi|550316805|gb|EEE99789.2| myb family transcription
            factor family protein [Populus trichocarpa]
          Length = 420

 Score =  194 bits (494), Expect = 2e-47
 Identities = 135/326 (41%), Positives = 171/326 (52%), Gaps = 91/326 (27%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAY----KNGLIAA----MPVEXXXXXXXXXXXXXXXXXMHLGEAI 156
            SHLQKYRLSKN+HGQA     K+G +A     MP                   +H  EA+
Sbjct: 93   SHLQKYRLSKNLHGQANSGSNKSGTVAVVGDRMPEVNATHINNLSIGSQTNKSLHFSEAL 152

Query: 157  QMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQL 336
            Q+QI+VQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL G Q++GT+GLEAAKVQL
Sbjct: 153  QVQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL-GRQNLGTVGLEAAKVQL 211

Query: 337  SDLASKVSTQCLNSAFPQLSDL------CLHPKTTADCSIDSCLT----SSCEEQLYNNL 486
            S+L SKVS++CLNSAF +L DL         P    DCS+DSCLT    S  E++++N  
Sbjct: 212  SELVSKVSSKCLNSAFSELKDLQGLCPPLTQPTHPNDCSMDSCLTSIEGSQKEQEIHNTG 271

Query: 487  VGFR-------------------QPTEL---EETRDDRIF-------------------- 540
            +G R                   Q TEL   E+ RD+++F                    
Sbjct: 272  MGLRPYNGNALLEPKVIAGEHALQQTELKWGEDQRDNKMFLSSMRNDTDRRTFSAERSCS 331

Query: 541  ----------------------------EEKPYNNNLLSXXXXXXXENKTTTSDMKLPFF 636
                                        E+  + +           EN+  +   +L ++
Sbjct: 332  NLSIGVGLQGERGNVSSSFAEARFKGRSEDDSFQDKTNRRIDAIKLENEKLSPGYRLSYY 391

Query: 637  STKLDLNTD---EAASGCKQLDLNGF 705
            +TKLDLN+    +AASGC+QLDLNGF
Sbjct: 392  ATKLDLNSHGEIDAASGCRQLDLNGF 417


>ref|XP_002525443.1| transcription factor, putative [Ricinus communis]
            gi|223535256|gb|EEF36933.1| transcription factor,
            putative [Ricinus communis]
          Length = 419

 Score =  193 bits (491), Expect = 4e-47
 Identities = 141/326 (43%), Positives = 171/326 (52%), Gaps = 91/326 (27%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAYKN------GLIAAMPVEXXXXXXXXXXXXXXXXX--MHLGEAI 156
            SHLQKYRLSKN+HGQA         G +    +                    +H+GEA+
Sbjct: 93   SHLQKYRLSKNLHGQANSGSNKIGTGAVVGDRISETNVTHINNLSMGTQTNKGLHIGEAL 152

Query: 157  QMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQL 336
            QMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL G Q++G++GLEAAKVQL
Sbjct: 153  QMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL-GRQNLGSIGLEAAKVQL 211

Query: 337  SDLASKVSTQCLNSAFPQLSD---LCLHPKTTA---DCSIDSCLT----SSCEEQLYNNL 486
            S+L SKVSTQCLNSAF +L +   LC     TA   DCS+DSCLT    S  E++++N  
Sbjct: 212  SELVSKVSTQCLNSAFSELKELQGLCHQQTQTAPPTDCSMDSCLTSCEGSQKEQEIHNTG 271

Query: 487  VGFR-------------------QPTEL---EETRDDRIFEEKPYNNNL----------- 567
            +G R                     TEL   E+ +D+++F   P  NN            
Sbjct: 272  MGLRPYNGNALLESKDITEGHVLHQTELKWSEDLKDNKMF-LSPLGNNAARRNFAAERST 330

Query: 568  --LSXXXXXXXENKTTTS-----------------------------------DMKLPFF 636
              LS       EN   +S                                     +LP+F
Sbjct: 331  SDLSMTVGLQGENGNASSFSEGRYKDRNDGDSFPDQTNKSLDSVKLPKGDVSQGYRLPYF 390

Query: 637  STKLDLNTDE---AASGCKQLDLNGF 705
            +TKLDLN+ E   AAS CKQLDLNGF
Sbjct: 391  ATKLDLNSHEEIDAASSCKQLDLNGF 416


>ref|XP_006339131.1| PREDICTED: uncharacterized protein LOC102602766 isoform X2 [Solanum
            tuberosum]
          Length = 414

 Score =  190 bits (483), Expect = 3e-46
 Identities = 140/322 (43%), Positives = 173/322 (53%), Gaps = 87/322 (27%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAYKNGLIAAMPVEXXXXXXXXXXXXXXXXX-----MHLGEAIQMQ 165
            SHLQKYRLSKN HGQA  +G+  A  +E                      + + EAIQMQ
Sbjct: 91   SHLQKYRLSKNHHGQANLSGVNKAASMEKICESTGSPTSNPSIGPQPNNNIPISEAIQMQ 150

Query: 166  IQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQLSDL 345
            I VQRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETL G Q++GT+G EAAKVQLSDL
Sbjct: 151  IDVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETL-GTQNLGTIGFEAAKVQLSDL 209

Query: 346  ASKVSTQCLNSAFPQLSDLC-LHPKTT------ADCSIDSCLTSS-----CEEQLYNNLV 489
             SKVS QCLNSAF ++ +L   H   T      ADCS+DSCLTSS       ++++NN +
Sbjct: 210  VSKVSNQCLNSAFSEIQELSGFHTPQTQATQRLADCSMDSCLTSSEGPLRDLQEMHNNQL 269

Query: 490  GFR------------QPTELEET----RDD----RIF------------EEKPYNNNLLS 573
            G R              T L++T    RDD    R+F            +E  ++N  ++
Sbjct: 270  GLRTLNFGPCTEEIENQTRLQQTALRWRDDLKENRLFPKMDEDTEKEFAKETNWSNLSMN 329

Query: 574  XXXXXXXENKTTT----------SDMK-------------------------LPFFSTKL 648
                    N  ++          +D+K                         LP+F+ KL
Sbjct: 330  VGIQGGKRNVNSSYVDGRLNGIDADIKLFHQAATDRSDSTKPEKQVSPQEYKLPYFAPKL 389

Query: 649  DLNTD---EAASGCKQLDLNGF 705
            DLNTD   +AAS CKQLDLNGF
Sbjct: 390  DLNTDDQTDAASNCKQLDLNGF 411


>ref|XP_006443380.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
            gi|568850794|ref|XP_006479082.1| PREDICTED:
            uncharacterized protein LOC102612777 isoform X1 [Citrus
            sinensis] gi|568850796|ref|XP_006479083.1| PREDICTED:
            uncharacterized protein LOC102612777 isoform X2 [Citrus
            sinensis] gi|557545642|gb|ESR56620.1| hypothetical
            protein CICLE_v10020171mg [Citrus clementina]
          Length = 401

 Score =  190 bits (482), Expect = 5e-46
 Identities = 137/308 (44%), Positives = 164/308 (53%), Gaps = 73/308 (23%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAY----KNGLIAA----MPVEXXXXXXXXXXXXXXXXXMHLGEAI 156
            SHLQKYRLSKN+HGQA     K G +      MP                   +H+ E I
Sbjct: 93   SHLQKYRLSKNLHGQANIGNNKIGPVTVPGERMPEANATHMNNLSIGPQPNKSLHISETI 152

Query: 157  QMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQL 336
            QMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETL G Q++GT GLEAAKVQL
Sbjct: 153  QMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETL-GRQNLGTAGLEAAKVQL 211

Query: 337  SDLASKVSTQCLNSAFPQLSDL---C-LHPKTT--ADCSIDSCLTSSCE-----EQLYNN 483
            S+L SKVSTQCLNS F  L +L   C   P+     DCS+DSCLT SCE     ++++N 
Sbjct: 212  SELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLT-SCEGSQKDQEIHNG 270

Query: 484  LVGFR-------------------QPTELEETRD------------DR------------ 534
             V  R                   Q TEL+  +D            DR            
Sbjct: 271  GVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGELSIGSGSF 330

Query: 535  -------IFEEKPYNNNLLSXXXXXXXENKTTTSDMKLPFFSTKLDLNT----DEAASGC 681
                     E++ + +           EN+    + +LP FSTKLDLN     ++ ASGC
Sbjct: 331  PAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGC 390

Query: 682  KQLDLNGF 705
            KQ DLNGF
Sbjct: 391  KQFDLNGF 398


>ref|XP_006339130.1| PREDICTED: uncharacterized protein LOC102602766 isoform X1 [Solanum
            tuberosum]
          Length = 415

 Score =  187 bits (476), Expect = 2e-45
 Identities = 141/323 (43%), Positives = 174/323 (53%), Gaps = 88/323 (27%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAYKNGLI-AAMPVEXXXXXXXXXXXXXXXXX-----MHLGEAIQM 162
            SHLQKYRLSKN HGQA  +G+  AA  +E                      + + EAIQM
Sbjct: 91   SHLQKYRLSKNHHGQANLSGVNKAAASMEKICESTGSPTSNPSIGPQPNNNIPISEAIQM 150

Query: 163  QIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQLSD 342
            QI VQRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETL G Q++GT+G EAAKVQLSD
Sbjct: 151  QIDVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETL-GTQNLGTIGFEAAKVQLSD 209

Query: 343  LASKVSTQCLNSAFPQLSDLC-LHPKTT------ADCSIDSCLTSS-----CEEQLYNNL 486
            L SKVS QCLNSAF ++ +L   H   T      ADCS+DSCLTSS       ++++NN 
Sbjct: 210  LVSKVSNQCLNSAFSEIQELSGFHTPQTQATQRLADCSMDSCLTSSEGPLRDLQEMHNNQ 269

Query: 487  VGFR------------QPTELEET----RDD----RIF------------EEKPYNNNLL 570
            +G R              T L++T    RDD    R+F            +E  ++N  +
Sbjct: 270  LGLRTLNFGPCTEEIENQTRLQQTALRWRDDLKENRLFPKMDEDTEKEFAKETNWSNLSM 329

Query: 571  SXXXXXXXENKTTT----------SDMK-------------------------LPFFSTK 645
            +        N  ++          +D+K                         LP+F+ K
Sbjct: 330  NVGIQGGKRNVNSSYVDGRLNGIDADIKLFHQAATDRSDSTKPEKQVSPQEYKLPYFAPK 389

Query: 646  LDLNTD---EAASGCKQLDLNGF 705
            LDLNTD   +AAS CKQLDLNGF
Sbjct: 390  LDLNTDDQTDAASNCKQLDLNGF 412


>ref|XP_002319702.2| myb family transcription factor family protein [Populus trichocarpa]
            gi|550325041|gb|EEE95625.2| myb family transcription
            factor family protein [Populus trichocarpa]
          Length = 427

 Score =  184 bits (467), Expect = 2e-44
 Identities = 138/333 (41%), Positives = 171/333 (51%), Gaps = 98/333 (29%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAY----KNGLIAA----MPVEXXXXXXXXXXXXXXXXX------- 135
            SHLQKYRLSKN+HGQA     K G +A     MP                          
Sbjct: 93   SHLQKYRLSKNLHGQANIGSSKIGTVAVVGDRMPEANATHININNLSIGSQPNKILKSRS 152

Query: 136  MHLGEAIQMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGL 315
            +H  EA+QMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETL G Q++GT+GL
Sbjct: 153  LHFSEALQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETL-GRQNLGTVGL 211

Query: 316  EAAKVQLSDLASKVSTQCLNSAFPQLSDL-CLHPKTTA-----DCSIDSCLT----SSCE 465
            EAAKVQLS+L SKVSTQCLNS F +L+DL  L P+ T      DCS+DSCLT    S  E
Sbjct: 212  EAAKVQLSELVSKVSTQCLNSTFSELNDLQGLCPQQTPPTQPNDCSMDSCLTSCEGSQKE 271

Query: 466  EQLYNNLVGFR-------------------QPTEL---EETRDDRIF------------- 540
            ++++N  +G R                   Q TEL   E  RD+++F             
Sbjct: 272  QEIHNIGMGLRPCNSNALLEPKEIAEEHALQQTELKWGEYLRDNKMFLTSIGHETERRTF 331

Query: 541  -----------------------------------EEKPYNNNLLSXXXXXXXENKTTTS 615
                                               E+  + +           E++  + 
Sbjct: 332  SAERSCSDLSIGVGLQGEKGNINSSFAEGRFKGMSEDDSFQDQTNKRAESVKFEDEKMSP 391

Query: 616  DMKLPFFSTKLDLNTD---EAASGCKQLDLNGF 705
              +L +F+TKLDLN+    +AAS CKQLDLNGF
Sbjct: 392  GYRLSYFTTKLDLNSHDEIDAASSCKQLDLNGF 424


>ref|XP_007202959.1| hypothetical protein PRUPE_ppa015076mg [Prunus persica]
           gi|462398490|gb|EMJ04158.1| hypothetical protein
           PRUPE_ppa015076mg [Prunus persica]
          Length = 421

 Score =  176 bits (447), Expect = 5e-42
 Identities = 115/213 (53%), Positives = 137/213 (64%), Gaps = 22/213 (10%)
 Frame = +1

Query: 1   SHLQKYRLSKNIHGQAYKN-GLIAAMPVEXXXXXXXXXXXXXXXXXMHLGEAIQMQIQVQ 177
           SHLQKYRLSKN+HG A      IA  P E                 +H+ E +QMQI+VQ
Sbjct: 109 SHLQKYRLSKNLHGHATSGTSKIALDPNETYNNNGILNCRG-----LHISETLQMQIEVQ 163

Query: 178 RRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQLSDLASKV 357
           RRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL G Q++GT+GLEAAKVQLS+L SKV
Sbjct: 164 RRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL-GRQNLGTVGLEAAKVQLSELVSKV 222

Query: 358 STQCLNSAFPQLSDL-CLHPKTT-----ADCSIDSCLTSSCE-----EQLYNNLVGFRQP 504
           STQCLNSAF +L +L  L P+ T      DCS++SCLT SCE     ++++N+ +G R  
Sbjct: 223 STQCLNSAFTELKELQGLCPQQTQTTQPTDCSMESCLT-SCEGSKKDQEIHNSAMGLRAN 281

Query: 505 TELEETRDD----------RIFEEKPYNNNLLS 573
               E  D+          +  EE   NN LLS
Sbjct: 282 YNGRELLDEKEPMLQKTELKWCEELKENNMLLS 314


>ref|XP_004249439.1| PREDICTED: uncharacterized protein LOC101257914 isoform 1 [Solanum
            lycopersicum]
          Length = 409

 Score =  172 bits (436), Expect = 1e-40
 Identities = 138/323 (42%), Positives = 167/323 (51%), Gaps = 88/323 (27%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAYKNGLI-AAMPVEXXXXXXXXXXXXXXXXX-----MHLGEAIQM 162
            SHLQKYRLSKN HGQA  +G+  AA  +E                      + + EAIQM
Sbjct: 91   SHLQKYRLSKNHHGQANISGVNKAAASMEKICESTGSPKSNPSIGHQPNNNIPISEAIQM 150

Query: 163  QIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQLSD 342
            QI VQRRLHEQLE      LRIEAQGKYLQ+VLEKAQETLG  Q++GT+GLEAAKVQLSD
Sbjct: 151  QIDVQRRLHEQLE------LRIEAQGKYLQAVLEKAQETLG-TQNLGTIGLEAAKVQLSD 203

Query: 343  LASKVSTQCLNSAFPQLSDLC-LHPKTT------ADCSIDSCLTSS-----CEEQLYNNL 486
            L SKVS QCLNSAF ++ +L   H   T      ADCS+DSCLTSS       ++++NN 
Sbjct: 204  LVSKVSNQCLNSAFSEIKELSGFHTPQTQATQRLADCSMDSCLTSSEGPLRDLQEMHNNQ 263

Query: 487  VGFR------------QPTELEET----RDD----RIF------------EEKPYNN--- 561
            +G R              T L++T    RDD    R+F            +E  ++N   
Sbjct: 264  LGLRNLNFRPCTEEIENQTRLQQTALRWRDDLKENRLFPKIDEDTEKEFAKETNWSNLSM 323

Query: 562  --------------------NLLSXXXXXXXENKTTTSD------------MKLPFFSTK 645
                                N +        +  T  SD             KLP+F+ K
Sbjct: 324  NVGIQGGKRNVNSSYVDERLNGIDADIKLFHQTATDRSDSTKPEKQVSPQEYKLPYFAPK 383

Query: 646  LDLNTD---EAASGCKQLDLNGF 705
            LDLNTD   +AAS CKQLDLNGF
Sbjct: 384  LDLNTDDQTDAASNCKQLDLNGF 406


>ref|XP_007030697.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao]
           gi|590643063|ref|XP_007030698.1| Homeodomain-like
           superfamily protein isoform 2 [Theobroma cacao]
           gi|508719302|gb|EOY11199.1| Homeodomain-like superfamily
           protein isoform 2 [Theobroma cacao]
           gi|508719303|gb|EOY11200.1| Homeodomain-like superfamily
           protein isoform 2 [Theobroma cacao]
          Length = 414

 Score =  172 bits (435), Expect = 1e-40
 Identities = 105/179 (58%), Positives = 123/179 (68%), Gaps = 18/179 (10%)
 Frame = +1

Query: 1   SHLQKYRLSKNIHGQAY----KNGLIAA----MPVEXXXXXXXXXXXXXXXXXMHLGEAI 156
           SHLQKYRLSKN+HGQA     K G +A     M                    + +GEA+
Sbjct: 94  SHLQKYRLSKNLHGQANNGSNKIGAVAMAGDRMSEANGTHVNNLSIGPQANNGLQIGEAL 153

Query: 157 QMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQL 336
           QMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETL G Q++G++GLEAAKVQL
Sbjct: 154 QMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETL-GRQNLGSVGLEAAKVQL 212

Query: 337 SDLASKVSTQCLNSAFPQLSDL-CLHPKTT-----ADCSIDSCLT----SSCEEQLYNN 483
           S+L SKVS QCLNSAF  L DL  L P+ T      DCS+DSCLT    S  E++++NN
Sbjct: 213 SELVSKVSNQCLNSAFSDLKDLQGLCPQQTQATPPTDCSMDSCLTSCEGSQKEQEIHNN 271


>ref|XP_007030696.1| Homeodomain-like superfamily protein isoform 1 [Theobroma cacao]
           gi|508719301|gb|EOY11198.1| Homeodomain-like superfamily
           protein isoform 1 [Theobroma cacao]
          Length = 478

 Score =  172 bits (435), Expect = 1e-40
 Identities = 105/179 (58%), Positives = 123/179 (68%), Gaps = 18/179 (10%)
 Frame = +1

Query: 1   SHLQKYRLSKNIHGQAY----KNGLIAA----MPVEXXXXXXXXXXXXXXXXXMHLGEAI 156
           SHLQKYRLSKN+HGQA     K G +A     M                    + +GEA+
Sbjct: 158 SHLQKYRLSKNLHGQANNGSNKIGAVAMAGDRMSEANGTHVNNLSIGPQANNGLQIGEAL 217

Query: 157 QMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQL 336
           QMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETL G Q++G++GLEAAKVQL
Sbjct: 218 QMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETL-GRQNLGSVGLEAAKVQL 276

Query: 337 SDLASKVSTQCLNSAFPQLSDL-CLHPKTT-----ADCSIDSCLT----SSCEEQLYNN 483
           S+L SKVS QCLNSAF  L DL  L P+ T      DCS+DSCLT    S  E++++NN
Sbjct: 277 SELVSKVSNQCLNSAFSDLKDLQGLCPQQTQATPPTDCSMDSCLTSCEGSQKEQEIHNN 335


>ref|XP_004288533.1| PREDICTED: uncharacterized protein LOC101304811 [Fragaria vesca
            subsp. vesca]
          Length = 418

 Score =  172 bits (435), Expect = 1e-40
 Identities = 128/314 (40%), Positives = 167/314 (53%), Gaps = 79/314 (25%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAYKNG----LIAAMPVEXXXXXXXXXXXXXXXXX-----MHLGEA 153
            SHLQKYRLSKN+H  A   G    +  A+P E                      +H+ E 
Sbjct: 104  SHLQKYRLSKNLHVHANSGGTTKIVAVAVPGERISEVNGTHMNNMSIGPQSNKGIHINET 163

Query: 154  IQMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQ 333
            +QMQI+VQRRLH+QLEVQRHLQLRIEAQGKYLQSVLEKAQETL G Q++GT+GLEAAKVQ
Sbjct: 164  LQMQIEVQRRLHQQLEVQRHLQLRIEAQGKYLQSVLEKAQETL-GRQNLGTVGLEAAKVQ 222

Query: 334  LSDLASKVSTQCLNSAFPQLSDL---CLHPKTTADCSIDSCLTSS----CEEQLYNN-LV 489
            LS+L SKVSTQCLNSAF ++ ++   C     T DCS++SCLTSS     ++++ NN  +
Sbjct: 223  LSELVSKVSTQCLNSAFTEMKEVQGSCPQNPPT-DCSMESCLTSSEGSKKDQEIQNNSRM 281

Query: 490  GFRQ-----------------------PTELEETRDDRIFEEKPYNNN----------LL 570
            G R                         + L +  D R+F  +P + +          +L
Sbjct: 282  GLRAYNSSRVLLESEKTMLHLKENSMFVSTLTKNADQRMFPSEPRSGDFSMSIGLEREIL 341

Query: 571  SXXXXXXXE-------------NKTTTSD-MKL------------PFFSTKLDLNT---D 663
            +       E             NK   +D +K+            P+F+ KLDLN+    
Sbjct: 342  NGSHCNSEERFKARNTIDSFLDNKNNRADSVKVDQSRKVSQGYSGPYFAAKLDLNSHDDT 401

Query: 664  EAASGCKQLDLNGF 705
            +A+S CKQ DLN F
Sbjct: 402  DASSSCKQFDLNDF 415


>ref|XP_006443379.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
           gi|557545641|gb|ESR56619.1| hypothetical protein
           CICLE_v10020171mg [Citrus clementina]
          Length = 441

 Score =  171 bits (432), Expect = 3e-40
 Identities = 118/255 (46%), Positives = 143/255 (56%), Gaps = 65/255 (25%)
 Frame = +1

Query: 136 MHLGEAIQMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGL 315
           +H+ E IQMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETL G Q++GT GL
Sbjct: 186 LHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETL-GRQNLGTAGL 244

Query: 316 EAAKVQLSDLASKVSTQCLNSAFPQLSDL---C-LHPKTT--ADCSIDSCLTSSCE---- 465
           EAAKVQLS+L SKVSTQCLNS F  L +L   C   P+     DCS+DSCLT SCE    
Sbjct: 245 EAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLT-SCEGSQK 303

Query: 466 -EQLYNNLVGFR-------------------QPTELEETRD------------DR----- 534
            ++++N  V  R                   Q TEL+  +D            DR     
Sbjct: 304 DQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGEL 363

Query: 535 --------------IFEEKPYNNNLLSXXXXXXXENKTTTSDMKLPFFSTKLDLNT---- 660
                           E++ + +           EN+    + +LP FSTKLDLN     
Sbjct: 364 SIGSGSFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHE 423

Query: 661 DEAASGCKQLDLNGF 705
           ++ ASGCKQ DLNGF
Sbjct: 424 NDVASGCKQFDLNGF 438


>ref|XP_006443378.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
           gi|557545640|gb|ESR56618.1| hypothetical protein
           CICLE_v10020171mg [Citrus clementina]
          Length = 294

 Score =  171 bits (432), Expect = 3e-40
 Identities = 118/255 (46%), Positives = 143/255 (56%), Gaps = 65/255 (25%)
 Frame = +1

Query: 136 MHLGEAIQMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGL 315
           +H+ E IQMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETL G Q++GT GL
Sbjct: 39  LHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETL-GRQNLGTAGL 97

Query: 316 EAAKVQLSDLASKVSTQCLNSAFPQLSDL---C-LHPKTT--ADCSIDSCLTSSCE---- 465
           EAAKVQLS+L SKVSTQCLNS F  L +L   C   P+     DCS+DSCLT SCE    
Sbjct: 98  EAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLT-SCEGSQK 156

Query: 466 -EQLYNNLVGFR-------------------QPTELEETRD------------DR----- 534
            ++++N  V  R                   Q TEL+  +D            DR     
Sbjct: 157 DQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESKFLSSIGKDRGPGEL 216

Query: 535 --------------IFEEKPYNNNLLSXXXXXXXENKTTTSDMKLPFFSTKLDLNT---- 660
                           E++ + +           EN+    + +LP FSTKLDLN     
Sbjct: 217 SIGSGSFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHE 276

Query: 661 DEAASGCKQLDLNGF 705
           ++ ASGCKQ DLNGF
Sbjct: 277 NDVASGCKQFDLNGF 291


>gb|EXC02650.1| Myb family transcription factor APL [Morus notabilis]
          Length = 444

 Score =  170 bits (431), Expect = 4e-40
 Identities = 106/215 (49%), Positives = 129/215 (60%), Gaps = 28/215 (13%)
 Frame = +1

Query: 1   SHLQKYRLSKNIHGQAY----KNGLIAAMPVEXXXXXXXXXXXXXXXXXM-----HLGEA 153
           SHLQKYRLSKN+HGQA     K G +  +  +                       H+ E 
Sbjct: 85  SHLQKYRLSKNLHGQANSGTTKIGTVGGVAADRISEASANQLNNLSIGPQANKGFHISET 144

Query: 154 IQMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQ 333
           +QMQI+VQRRLHEQLEVQRHLQLR+EAQGKYLQ+VLEKAQETL G Q++G +GLEAAKVQ
Sbjct: 145 LQMQIEVQRRLHEQLEVQRHLQLRMEAQGKYLQAVLEKAQETL-GRQNLGAVGLEAAKVQ 203

Query: 334 LSDLASKVSTQCLNSAFPQLSD----LCLHPKTT---------------ADCSIDSCLTS 456
           LS+L SKVSTQCLNSAF +L +    LC  P ++                DCS+DSCLT 
Sbjct: 204 LSELVSKVSTQCLNSAFAELKEVQGGLCRQPNSSNQTQGTVLIRQQQPNNDCSMDSCLT- 262

Query: 457 SCEEQLYNNLVGFRQPTELEETRDDRIFEEKPYNN 561
           SCE        G ++  E+       I + +PYNN
Sbjct: 263 SCE--------GSQKDQEIAHNTSSGIVQLRPYNN 289


>ref|XP_006338933.1| PREDICTED: uncharacterized protein LOC102592272 isoform X1 [Solanum
            tuberosum] gi|565343634|ref|XP_006338934.1| PREDICTED:
            uncharacterized protein LOC102592272 isoform X2 [Solanum
            tuberosum]
          Length = 416

 Score =  170 bits (431), Expect = 4e-40
 Identities = 133/324 (41%), Positives = 155/324 (47%), Gaps = 89/324 (27%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAYKNGLIAAMPVEXXXXXXXXXXXXXXXXXM--------HLGEAI 156
            SHLQKYRLSKN+HGQA  +G   A+ V                  M         + EAI
Sbjct: 91   SHLQKYRLSKNLHGQANASGTNKAVAVAGVERISENSATCMSNPSMVPQPNKNIQISEAI 150

Query: 157  QMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQL 336
            QMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL G Q+M T+GLEA KVQL
Sbjct: 151  QMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL-GRQNMETVGLEAVKVQL 209

Query: 337  SDLASKVSTQCLNSAFPQLSDLC-LHPKTT-----ADCSIDSCLTSS----CEEQLYNNL 486
            S+  SK S QCLNS FP + +L   H + T      D SIDSCLTS      +  +++N 
Sbjct: 210  SEFVSKASNQCLNSPFPDIKELSGFHSQHTQATQPTDRSIDSCLTSRDGSLRDNTMHDNQ 269

Query: 487  VGFR-------------------QPTEL----------------EETRDDRIFEEKPYNN 561
            +G R                   Q TEL                 E R+     E   NN
Sbjct: 270  IGLRPFDFTPSIECKDIENDARLQQTELRWCDNLKENRRLFSPMNEGREKTFTRETNCNN 329

Query: 562  NLLSXXXXXXXENKT----------TTSDMKL-----------------------PFFST 642
              +S        N +          T  D+KL                        +F  
Sbjct: 330  LSMSIGLQDEKLNGSMNHSDGSFNGTERDVKLFHQVTNRSESVPQRHKSSQEYKLSYFQP 389

Query: 643  KLDLN---TDEAASGCKQLDLNGF 705
            KLDLN     +AAS CKQ DLNGF
Sbjct: 390  KLDLNMHDETDAASSCKQFDLNGF 413


>ref|XP_006339132.1| PREDICTED: uncharacterized protein LOC102602766 isoform X3 [Solanum
            tuberosum]
          Length = 409

 Score =  169 bits (428), Expect = 8e-40
 Identities = 135/323 (41%), Positives = 168/323 (52%), Gaps = 88/323 (27%)
 Frame = +1

Query: 1    SHLQKYRLSKNIHGQAYKNGLI-AAMPVEXXXXXXXXXXXXXXXXX-----MHLGEAIQM 162
            SHLQKYRLSKN HGQA  +G+  AA  +E                      + + EAIQM
Sbjct: 91   SHLQKYRLSKNHHGQANLSGVNKAAASMEKICESTGSPTSNPSIGPQPNNNIPISEAIQM 150

Query: 163  QIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQLSD 342
            QI VQRRLHEQLE      LRIEAQGKYLQ+VLEKAQETLG  Q++GT+G EAAKVQLSD
Sbjct: 151  QIDVQRRLHEQLE------LRIEAQGKYLQAVLEKAQETLG-TQNLGTIGFEAAKVQLSD 203

Query: 343  LASKVSTQCLNSAFPQLSDLC-LHPKTT------ADCSIDSCLTSS-----CEEQLYNNL 486
            L SKVS QCLNSAF ++ +L   H   T      ADCS+DSCLTSS       ++++NN 
Sbjct: 204  LVSKVSNQCLNSAFSEIQELSGFHTPQTQATQRLADCSMDSCLTSSEGPLRDLQEMHNNQ 263

Query: 487  VGFR------------QPTELEET----RDD----RIF------------EEKPYNNNLL 570
            +G R              T L++T    RDD    R+F            +E  ++N  +
Sbjct: 264  LGLRTLNFGPCTEEIENQTRLQQTALRWRDDLKENRLFPKMDEDTEKEFAKETNWSNLSM 323

Query: 571  SXXXXXXXENKTTT----------SDMK-------------------------LPFFSTK 645
            +        N  ++          +D+K                         LP+F+ K
Sbjct: 324  NVGIQGGKRNVNSSYVDGRLNGIDADIKLFHQAATDRSDSTKPEKQVSPQEYKLPYFAPK 383

Query: 646  LDLNTD---EAASGCKQLDLNGF 705
            LDLNTD   +AAS CKQLDLNGF
Sbjct: 384  LDLNTDDQTDAASNCKQLDLNGF 406


>ref|XP_003558005.1| PREDICTED: uncharacterized protein LOC100837299 [Brachypodium
           distachyon]
          Length = 350

 Score =  168 bits (426), Expect = 1e-39
 Identities = 115/256 (44%), Positives = 143/256 (55%), Gaps = 21/256 (8%)
 Frame = +1

Query: 1   SHLQKYRLSKNIHGQAY----KNGLIAAMPVEXXXXXXXXXXXXXXXXX----MHLGEAI 156
           SHLQKYRLSKN+H QA     +N +   M  E                     MH+GEA+
Sbjct: 94  SHLQKYRLSKNLHAQANVGNSRNVVGCTMATEKHSEGNGSPVSHHLGAQTNKSMHIGEAL 153

Query: 157 QMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQL 336
           QMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKA ETL   Q+ G+  LE AK+QL
Sbjct: 154 QMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAHETL-AKQNTGSASLENAKMQL 212

Query: 337 SDLASKVSTQCLNSAFPQLSDL----CLHPKTTADCSIDSCLTSSCEEQLYNNLVGFRQP 504
           S+L SKVST+CL++AF    ++     L      D S+DSCLT +CE Q   +++     
Sbjct: 213 SELVSKVSTECLHNAFTGFEEIQGSQMLQTMQLGDGSVDSCLT-ACESQRDQDILSISLS 271

Query: 505 TELEETRDDRIFE--EKPYNNNL----LSXXXXXXXENKTTTSDMKLPFFSTKLDLNTDE 666
            +  +      F+   K  + NL    LS       E    T    +   +TKLDLN +E
Sbjct: 272 AKKGKEIGAMAFDLHMKEGHGNLFLEKLSRRPPNHQEGHERTDGFSISCQTTKLDLNINE 331

Query: 667 AASG---CKQLDLNGF 705
              G   CK+ DLNGF
Sbjct: 332 TNDGPQNCKKFDLNGF 347


>ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248614 isoform 1 [Vitis
           vinifera]
          Length = 418

 Score =  168 bits (425), Expect = 2e-39
 Identities = 109/221 (49%), Positives = 135/221 (61%), Gaps = 13/221 (5%)
 Frame = +1

Query: 1   SHLQKYRLSKNIHGQAY----KNGLIAAMPVEXXXXXXXXXXXXXXXXXMHLGEAIQMQI 168
           SHLQKYRLSKN+HGQA     K  +   MP                   +HL E +QM I
Sbjct: 93  SHLQKYRLSKNLHGQANSATSKTVVGERMPEANGALMSSPNIGNQTNKSLHLSETLQM-I 151

Query: 169 QVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQLSDLA 348
           + QRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETL G Q++G +GLEAAKVQLS+L 
Sbjct: 152 EAQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETL-GRQNLGAVGLEAAKVQLSELV 210

Query: 349 SKVSTQCLNSAFPQLSDL-CLHPKTT----ADCSIDSCLT----SSCEEQLYNNLVGFRQ 501
           SKVSTQCL+SAF +L +L  L P+ T     DCS+DSCLT    S  E++++N  +G R 
Sbjct: 211 SKVSTQCLHSAFSELKELQSLCPQQTQTQPTDCSMDSCLTSCEGSQREQEIHNCGMGLRP 270

Query: 502 PTELEETRDDRIFEEKPYNNNLLSXXXXXXXENKTTTSDMK 624
            T      + +   E P   + +        EN+   S M+
Sbjct: 271 YTNGSTPLEAKDTAEPPGLQHTVLKWCEDTKENRQFISSMQ 311


>ref|XP_006650017.1| PREDICTED: myb family transcription factor APL-like [Oryza
           brachyantha]
          Length = 355

 Score =  167 bits (422), Expect = 4e-39
 Identities = 113/260 (43%), Positives = 144/260 (55%), Gaps = 25/260 (9%)
 Frame = +1

Query: 1   SHLQKYRLSKNIHGQA----YKNGLIAAMPVEXXXXXXXXXXXXXXXXX-----MHLGEA 153
           SHLQKYRLSKN+H QA     KN L+     E                      +H+GEA
Sbjct: 94  SHLQKYRLSKNLHAQANVGNVKNALVCTTATEKPSEGNRSPVSHLTLGTQTNKSVHIGEA 153

Query: 154 IQMQIQVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGGAQSMGTLGLEAAKVQ 333
           +QMQI+VQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL   Q+ G++GLE AK++
Sbjct: 154 LQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETL-AKQNAGSVGLETAKME 212

Query: 334 LSDLASKVSTQCLNSAFPQLSD------LCLHPKTTADCSIDSCLT----SSCEEQLYNN 483
           LS+L SKVST+CL  AF    +      L  H     D S+DSCLT    S  ++ + + 
Sbjct: 213 LSELVSKVSTECLQHAFSGFEEIESSQMLQGHTMHLGDGSVDSCLTVCDGSQKDQDILSI 272

Query: 484 LVGFRQPTELEETRDDRIFEEKPYNN---NLLSXXXXXXXENKTTTSDMKLPFFSTKLDL 654
            +  ++  E+     D   +E+   +   N LS       E         +   +TKLDL
Sbjct: 273 SLSAQKGKEIGCMSFDMHVKERGSEDLFLNKLSRRPSNHQERCERRDGFSMSCQATKLDL 332

Query: 655 NTDEAASG---CKQLDLNGF 705
           N ++   G   CK+ DLNGF
Sbjct: 333 NMNDTYGGPKHCKKFDLNGF 352


Top