BLASTX nr result

ID: Mentha26_contig00041770 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00041770
         (911 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21904.1| hypothetical protein MIMGU_mgv1a000138mg [Mimulus...   290   5e-76
gb|EPS72750.1| hypothetical protein M569_02005 [Genlisea aurea]       255   2e-65
emb|CBI20940.3| unnamed protein product [Vitis vinifera]              245   2e-62
ref|XP_002281922.1| PREDICTED: uncharacterized protein LOC100264...   245   2e-62
ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus c...   238   3e-60
ref|XP_007225478.1| hypothetical protein PRUPE_ppa000151mg [Prun...   233   6e-59
gb|EYU29261.1| hypothetical protein MIMGU_mgv1a000290mg [Mimulus...   230   5e-58
ref|XP_007013731.1| Enhancer of polycomb-like transcription fact...   226   1e-56
ref|XP_007013730.1| Enhancer of polycomb-like transcription fact...   226   1e-56
ref|XP_007013729.1| Enhancer of polycomb-like transcription fact...   226   1e-56
ref|XP_007013727.1| Enhancer of polycomb-like transcription fact...   226   1e-56
ref|XP_004498624.1| PREDICTED: uncharacterized protein LOC101499...   225   2e-56
ref|XP_002309585.2| hypothetical protein POPTR_0006s26240g [Popu...   223   1e-55
ref|XP_002324830.2| hypothetical protein POPTR_0018s01030g [Popu...   222   1e-55
gb|EXC20799.1| hypothetical protein L484_007381 [Morus notabilis]     221   4e-55
ref|XP_006596126.1| PREDICTED: uncharacterized protein LOC100781...   219   1e-54
ref|XP_003545513.1| PREDICTED: uncharacterized protein LOC100781...   219   1e-54
ref|XP_004162065.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   218   3e-54
ref|XP_004136466.1| PREDICTED: uncharacterized protein LOC101216...   218   3e-54
ref|XP_004245412.1| PREDICTED: uncharacterized protein LOC101258...   217   4e-54

>gb|EYU21904.1| hypothetical protein MIMGU_mgv1a000138mg [Mimulus guttatus]
          Length = 1648

 Score =  290 bits (742), Expect = 5e-76
 Identities = 154/280 (55%), Positives = 184/280 (65%), Gaps = 6/280 (2%)
 Frame = +3

Query: 90   DGSGSSKKIQKGNPEGDENAS--KVISQPCLPEPRKEAHQLIDAPIL----PSMPTSITS 251
            + + S++K+QKGNP  D  A      ++   PE   ++HQ +   I+     S+P S TS
Sbjct: 958  ENTESTQKLQKGNPGDDGTAGCFTEFTEISAPEVIAQSHQEVQEQIVVSASTSLPPSTTS 1017

Query: 252  QTPVPRSDSTLGGMTIVIPSSESRQASDVDWNVRDSFIHKPNTIGFRNSWQXXXXXXXXX 431
            + P P+S+S             SR  S V WNV D F+  P+  G               
Sbjct: 1018 RPPYPKSNSASVDTPFAGNGCISRHTSVVGWNVHDGFVPSPSPTG--------------- 1062

Query: 432  XXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLPFVGCDLSEKQKSPSAKSLPCKRI 611
                       G PNF PNGFSNGPKKPRTQVQYTLPFV  D S K+K PS++SLPCKRI
Sbjct: 1063 -----------GKPNFMPNGFSNGPKKPRTQVQYTLPFVDYDSSAKRKMPSSRSLPCKRI 1111

Query: 612  RKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGWREYGAHIVLEVDDHNEWRLAVKL 791
            R+ASLK+ SDGS N++KN+E ++ +ANVLVT+GDKGWRE GAHIVLEV D NEWRLAVKL
Sbjct: 1112 RRASLKKTSDGSENNQKNLESVTSIANVLVTYGDKGWRECGAHIVLEVADQNEWRLAVKL 1171

Query: 792  SGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLEFPD 911
            SGV KYS KVKHILQPGSTNRYSHAMMW+GGKDWVLEFPD
Sbjct: 1172 SGVIKYSCKVKHILQPGSTNRYSHAMMWRGGKDWVLEFPD 1211


>gb|EPS72750.1| hypothetical protein M569_02005 [Genlisea aurea]
          Length = 772

 Score =  255 bits (651), Expect = 2e-65
 Identities = 126/231 (54%), Positives = 160/231 (69%), Gaps = 3/231 (1%)
 Frame = +3

Query: 228 SMPTSITSQTPVPRSDST---LGGMTIVIPSSESRQASDVDWNVRDSFIHKPNTIGFRNS 398
           S+P ++ S +   R DS    + G   +   S S+Q  D +  ++  F+H+ N  G   S
Sbjct: 193 SLPGAVDSASSNSRKDSESTCMIGTPSIEKVSNSKQTGDAELKLQKGFVHEANPAGSIKS 252

Query: 399 WQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLPFVGCDLSEKQKS 578
            +            +   +W D +    P  FSNGPKKPRTQVQY+LP  G DLS KQK 
Sbjct: 253 VKRGRTSSILTPLEYHSPLWLDDTTTSAPRAFSNGPKKPRTQVQYSLPVDGYDLSSKQKM 312

Query: 579 PSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGWREYGAHIVLEVD 758
           P++++LP KRIR+ASLKRIS+   N +KN++LL+CV N+LV+H DKGWRE+GA++VLEV 
Sbjct: 313 PNSRALPYKRIRRASLKRISESHENGQKNLDLLTCVGNILVSHVDKGWREHGANVVLEVA 372

Query: 759 DHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLEFPD 911
           D NEWRLAVK+SGVTKYS+KVKHILQPGSTNRYSHAMMWKGGKDW LEFPD
Sbjct: 373 DRNEWRLAVKVSGVTKYSHKVKHILQPGSTNRYSHAMMWKGGKDWALEFPD 423


>emb|CBI20940.3| unnamed protein product [Vitis vinifera]
          Length = 1634

 Score =  245 bits (626), Expect = 2e-62
 Identities = 125/256 (48%), Positives = 159/256 (62%), Gaps = 18/256 (7%)
 Frame = +3

Query: 198  HQLIDAPILPSMPTSITSQTPVPRSD----STLGGMTIVIPS--------------SESR 323
            H   +  IL   P  +   +   +S+    S L G+ + IP+              S S+
Sbjct: 960  HSEAEQCILSPQPLLLNGHSSTGKSNVGCYSRLNGINVQIPTFDQVEKSFDRGADISISQ 1019

Query: 324  QASDVDWNVRDSFIHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNG 503
            Q+ D+ WNV D  I  PN    R+ WQ            +   +W DG  +F  NGF NG
Sbjct: 1020 QSVDLSWNVNDGVIRSPNPTAPRSMWQRNKNSFSSSFG-YPSHMWSDGKGDFFGNGFGNG 1078

Query: 504  PKKPRTQVQYTLPFVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSC 683
            PKKPRTQV YTLP  G D S KQ+S   K LP KRIR+A+ KR+SDGS +S++N+E LSC
Sbjct: 1079 PKKPRTQVSYTLPVGGFDFSSKQRSHHQKGLPNKRIRRANEKRLSDGSRSSQRNLESLSC 1138

Query: 684  VANVLVTHGDKGWREYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSH 863
             ANVL+T GD+GWRE GA ++LE+ DHNEW+LAVK+SG TKYSYK    LQPG+ NR++H
Sbjct: 1139 EANVLITFGDRGWRESGAQVILELGDHNEWKLAVKVSGATKYSYKAHQFLQPGTANRFTH 1198

Query: 864  AMMWKGGKDWVLEFPD 911
            AMMWKGGKDW+LEFPD
Sbjct: 1199 AMMWKGGKDWILEFPD 1214


>ref|XP_002281922.1| PREDICTED: uncharacterized protein LOC100264575 [Vitis vinifera]
          Length = 1679

 Score =  245 bits (626), Expect = 2e-62
 Identities = 125/256 (48%), Positives = 159/256 (62%), Gaps = 18/256 (7%)
 Frame = +3

Query: 198  HQLIDAPILPSMPTSITSQTPVPRSD----STLGGMTIVIPS--------------SESR 323
            H   +  IL   P  +   +   +S+    S L G+ + IP+              S S+
Sbjct: 983  HSEAEQCILSPQPLLLNGHSSTGKSNVGCYSRLNGINVQIPTFDQVEKSFDRGADISISQ 1042

Query: 324  QASDVDWNVRDSFIHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNG 503
            Q+ D+ WNV D  I  PN    R+ WQ            +   +W DG  +F  NGF NG
Sbjct: 1043 QSVDLSWNVNDGVIRSPNPTAPRSMWQRNKNSFSSSFG-YPSHMWSDGKGDFFGNGFGNG 1101

Query: 504  PKKPRTQVQYTLPFVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSC 683
            PKKPRTQV YTLP  G D S KQ+S   K LP KRIR+A+ KR+SDGS +S++N+E LSC
Sbjct: 1102 PKKPRTQVSYTLPVGGFDFSSKQRSHHQKGLPNKRIRRANEKRLSDGSRSSQRNLESLSC 1161

Query: 684  VANVLVTHGDKGWREYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSH 863
             ANVL+T GD+GWRE GA ++LE+ DHNEW+LAVK+SG TKYSYK    LQPG+ NR++H
Sbjct: 1162 EANVLITFGDRGWRESGAQVILELGDHNEWKLAVKVSGATKYSYKAHQFLQPGTANRFTH 1221

Query: 864  AMMWKGGKDWVLEFPD 911
            AMMWKGGKDW+LEFPD
Sbjct: 1222 AMMWKGGKDWILEFPD 1237


>ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus communis]
            gi|223544424|gb|EEF45945.1| hypothetical protein
            RCOM_0804080 [Ricinus communis]
          Length = 1705

 Score =  238 bits (606), Expect = 3e-60
 Identities = 122/231 (52%), Positives = 145/231 (62%), Gaps = 13/231 (5%)
 Frame = +3

Query: 258  PVPRSD-STLGGMTIVIPSSE------------SRQASDVDWNVRDSFIHKPNTIGFRNS 398
            P P  D + L G+ + IPSS             ++Q++D+ WN+    I  PN    R++
Sbjct: 1035 PKPSVDRALLNGIRVEIPSSNQFDKQVDKDLDGAQQSTDLSWNMNGGIIPSPNPTARRST 1094

Query: 399  WQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLPFVGCDLSEKQKS 578
            W             +    W DG  +F  N F NGPKKPRTQV Y LPF   D S K K 
Sbjct: 1095 WHRNRSNLASVG--YNAHGWSDGRGDFLQNNFRNGPKKPRTQVSYALPFGAFDYSSKSKG 1152

Query: 579  PSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGWREYGAHIVLEVD 758
             S K +P KRIR A+ KR SD S  S +N+ELLSC ANVL+T GDKGWREYGA +VLE+ 
Sbjct: 1153 HSQKGIPHKRIRTANEKRSSDVSRGSERNLELLSCEANVLITLGDKGWREYGAQVVLELS 1212

Query: 759  DHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLEFPD 911
            DHNEW+LAVKLSG TKYSYK    LQPGSTNRY+HAMMWKGGKDW+LEF D
Sbjct: 1213 DHNEWKLAVKLSGTTKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFSD 1263


>ref|XP_007225478.1| hypothetical protein PRUPE_ppa000151mg [Prunus persica]
            gi|462422414|gb|EMJ26677.1| hypothetical protein
            PRUPE_ppa000151mg [Prunus persica]
          Length = 1617

 Score =  233 bits (595), Expect = 6e-59
 Identities = 119/226 (52%), Positives = 146/226 (64%), Gaps = 12/226 (5%)
 Frame = +3

Query: 270  SDSTLGGMTIVIPSSE------------SRQASDVDWNVRDSFIHKPNTIGFRNSWQXXX 413
            S S L G+T+ IPS +            ++Q +D  WN+  S I  PN    R++W    
Sbjct: 956  SQSFLNGLTVEIPSFDRFEKPVDGEVQSAQQPTDCSWNMSGSIIPSPNPTAPRSTWHRSR 1015

Query: 414  XXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLPFVGCDLSEKQKSPSAKS 593
                          W DG  +   NGF NGPKKPRTQV YTLP+ G D S KQ++   K 
Sbjct: 1016 NSSSSFGSLSHG--WSDGKADLFHNGFGNGPKKPRTQVSYTLPYGGFDFSSKQRNLQ-KG 1072

Query: 594  LPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGWREYGAHIVLEVDDHNEW 773
            +P KRIR+A+ KR+SD S  S++N+E LSC ANVL+   D+GWRE GAHIVLE+ DHNEW
Sbjct: 1073 IPPKRIRRANEKRLSDVSRGSQRNLEQLSCEANVLINGSDRGWRECGAHIVLELFDHNEW 1132

Query: 774  RLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLEFPD 911
            +LAVK+SG TKYSYK    LQPGSTNRY+HAMMWKGGKDW+LEFPD
Sbjct: 1133 KLAVKISGTTKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPD 1178


>gb|EYU29261.1| hypothetical protein MIMGU_mgv1a000290mg [Mimulus guttatus]
          Length = 1291

 Score =  230 bits (587), Expect = 5e-58
 Identities = 114/142 (80%), Positives = 122/142 (85%), Gaps = 1/142 (0%)
 Frame = +3

Query: 489  GFSNGPKKPRTQVQYTLPFVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSR-KN 665
            GFSNGPK+PRTQVQYTLPF   D + KQK  S +  PCKRIR+ASLKRISDGS  S  KN
Sbjct: 765  GFSNGPKRPRTQVQYTLPFA--DFNTKQKRHSQRDPPCKRIRRASLKRISDGSSRSNEKN 822

Query: 666  VELLSCVANVLVTHGDKGWREYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGS 845
             ELLSC ANVLVTH DKGWRE GA I+LEV DHNEWRLA+KLSGVTKYSYKVK+ILQPGS
Sbjct: 823  FELLSCGANVLVTHEDKGWRECGAVIILEVADHNEWRLAIKLSGVTKYSYKVKNILQPGS 882

Query: 846  TNRYSHAMMWKGGKDWVLEFPD 911
            TNRYSHAM+WKGGKDWVLEFPD
Sbjct: 883  TNRYSHAMLWKGGKDWVLEFPD 904


>ref|XP_007013731.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 5 [Theobroma cacao] gi|508784094|gb|EOY31350.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 5 [Theobroma cacao]
          Length = 1522

 Score =  226 bits (576), Expect = 1e-56
 Identities = 124/312 (39%), Positives = 173/312 (55%), Gaps = 19/312 (6%)
 Frame = +3

Query: 33   RDGTSDTNIRKIDAEAPGFDG-SGSSKKIQKGNPE------GDENASKVISQPCLPEPRK 191
            +D  SDT +  +D    G +    SS+K + G+              +V +   +P  ++
Sbjct: 942  KDAASDTELTTLDLSVCGDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVPLQKQ 1001

Query: 192  EAHQLIDAPILPSMPTSITSQTPVPRSDSTLGGMTIVIPSSE------------SRQASD 335
            +        ++ S  + +        S+S L  + + IPS +            ++Q+SD
Sbjct: 1002 QCAHSESEQLVSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQQSSD 1061

Query: 336  VDWNVRDSFIHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKP 515
            + WN+    I  PN    R++W             +    W +G  +F  N F NGPKKP
Sbjct: 1062 LTWNMNGGIIPSPNPTAPRSTWHRNRSSSSSIG--YNAHGWSEGKADFFHNNFGNGPKKP 1119

Query: 516  RTQVQYTLPFVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANV 695
            RTQV Y++PF G D S K K    +  P KRIR+A+ KR SD S  S+KN+ELLSC AN+
Sbjct: 1120 RTQVSYSMPFGGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDANL 1179

Query: 696  LVTHGDKGWREYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMW 875
            L+T GD+GWRE GA + LE+ DHNEW+LAVK+SG T+YS+K    LQPGSTNRY+HAMMW
Sbjct: 1180 LITLGDRGWRECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAMMW 1239

Query: 876  KGGKDWVLEFPD 911
            KGGKDW+LEF D
Sbjct: 1240 KGGKDWILEFTD 1251


>ref|XP_007013730.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 4 [Theobroma cacao] gi|508784093|gb|EOY31349.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 4 [Theobroma cacao]
          Length = 1721

 Score =  226 bits (576), Expect = 1e-56
 Identities = 124/312 (39%), Positives = 173/312 (55%), Gaps = 19/312 (6%)
 Frame = +3

Query: 33   RDGTSDTNIRKIDAEAPGFDG-SGSSKKIQKGNPE------GDENASKVISQPCLPEPRK 191
            +D  SDT +  +D    G +    SS+K + G+              +V +   +P  ++
Sbjct: 942  KDAASDTELTTLDLSVCGDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVPLQKQ 1001

Query: 192  EAHQLIDAPILPSMPTSITSQTPVPRSDSTLGGMTIVIPSSE------------SRQASD 335
            +        ++ S  + +        S+S L  + + IPS +            ++Q+SD
Sbjct: 1002 QCAHSESEQLVSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQQSSD 1061

Query: 336  VDWNVRDSFIHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKP 515
            + WN+    I  PN    R++W             +    W +G  +F  N F NGPKKP
Sbjct: 1062 LTWNMNGGIIPSPNPTAPRSTWHRNRSSSSSIG--YNAHGWSEGKADFFHNNFGNGPKKP 1119

Query: 516  RTQVQYTLPFVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANV 695
            RTQV Y++PF G D S K K    +  P KRIR+A+ KR SD S  S+KN+ELLSC AN+
Sbjct: 1120 RTQVSYSMPFGGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDANL 1179

Query: 696  LVTHGDKGWREYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMW 875
            L+T GD+GWRE GA + LE+ DHNEW+LAVK+SG T+YS+K    LQPGSTNRY+HAMMW
Sbjct: 1180 LITLGDRGWRECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAMMW 1239

Query: 876  KGGKDWVLEFPD 911
            KGGKDW+LEF D
Sbjct: 1240 KGGKDWILEFTD 1251


>ref|XP_007013729.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 3 [Theobroma cacao] gi|508784092|gb|EOY31348.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 3 [Theobroma cacao]
          Length = 1674

 Score =  226 bits (576), Expect = 1e-56
 Identities = 124/312 (39%), Positives = 173/312 (55%), Gaps = 19/312 (6%)
 Frame = +3

Query: 33   RDGTSDTNIRKIDAEAPGFDG-SGSSKKIQKGNPE------GDENASKVISQPCLPEPRK 191
            +D  SDT +  +D    G +    SS+K + G+              +V +   +P  ++
Sbjct: 923  KDAASDTELTTLDLSVCGDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVPLQKQ 982

Query: 192  EAHQLIDAPILPSMPTSITSQTPVPRSDSTLGGMTIVIPSSE------------SRQASD 335
            +        ++ S  + +        S+S L  + + IPS +            ++Q+SD
Sbjct: 983  QCAHSESEQLVSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQQSSD 1042

Query: 336  VDWNVRDSFIHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKP 515
            + WN+    I  PN    R++W             +    W +G  +F  N F NGPKKP
Sbjct: 1043 LTWNMNGGIIPSPNPTAPRSTWHRNRSSSSSIG--YNAHGWSEGKADFFHNNFGNGPKKP 1100

Query: 516  RTQVQYTLPFVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANV 695
            RTQV Y++PF G D S K K    +  P KRIR+A+ KR SD S  S+KN+ELLSC AN+
Sbjct: 1101 RTQVSYSMPFGGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDANL 1160

Query: 696  LVTHGDKGWREYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMW 875
            L+T GD+GWRE GA + LE+ DHNEW+LAVK+SG T+YS+K    LQPGSTNRY+HAMMW
Sbjct: 1161 LITLGDRGWRECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAMMW 1220

Query: 876  KGGKDWVLEFPD 911
            KGGKDW+LEF D
Sbjct: 1221 KGGKDWILEFTD 1232


>ref|XP_007013727.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao]
            gi|590579224|ref|XP_007013728.1| Enhancer of
            polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao] gi|508784090|gb|EOY31346.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 1 [Theobroma cacao]
            gi|508784091|gb|EOY31347.1| Enhancer of polycomb-like
            transcription factor protein, putative isoform 1
            [Theobroma cacao]
          Length = 1693

 Score =  226 bits (576), Expect = 1e-56
 Identities = 124/312 (39%), Positives = 173/312 (55%), Gaps = 19/312 (6%)
 Frame = +3

Query: 33   RDGTSDTNIRKIDAEAPGFDG-SGSSKKIQKGNPE------GDENASKVISQPCLPEPRK 191
            +D  SDT +  +D    G +    SS+K + G+              +V +   +P  ++
Sbjct: 942  KDAASDTELTTLDLSVCGDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVPLQKQ 1001

Query: 192  EAHQLIDAPILPSMPTSITSQTPVPRSDSTLGGMTIVIPSSE------------SRQASD 335
            +        ++ S  + +        S+S L  + + IPS +            ++Q+SD
Sbjct: 1002 QCAHSESEQLVSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQQSSD 1061

Query: 336  VDWNVRDSFIHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKP 515
            + WN+    I  PN    R++W             +    W +G  +F  N F NGPKKP
Sbjct: 1062 LTWNMNGGIIPSPNPTAPRSTWHRNRSSSSSIG--YNAHGWSEGKADFFHNNFGNGPKKP 1119

Query: 516  RTQVQYTLPFVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANV 695
            RTQV Y++PF G D S K K    +  P KRIR+A+ KR SD S  S+KN+ELLSC AN+
Sbjct: 1120 RTQVSYSMPFGGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDANL 1179

Query: 696  LVTHGDKGWREYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMW 875
            L+T GD+GWRE GA + LE+ DHNEW+LAVK+SG T+YS+K    LQPGSTNRY+HAMMW
Sbjct: 1180 LITLGDRGWRECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAMMW 1239

Query: 876  KGGKDWVLEFPD 911
            KGGKDW+LEF D
Sbjct: 1240 KGGKDWILEFTD 1251


>ref|XP_004498624.1| PREDICTED: uncharacterized protein LOC101499788 [Cicer arietinum]
          Length = 1658

 Score =  225 bits (573), Expect = 2e-56
 Identities = 123/251 (49%), Positives = 153/251 (60%), Gaps = 8/251 (3%)
 Frame = +3

Query: 183  PRKEAHQLIDAPILPSMPTS--ITSQTPVPRSDSTLGGMTIVIPSSE------SRQASDV 338
            P  ++H+   A  L S+P+S  I        S S  G + + IPS +      ++Q+ D+
Sbjct: 967  PELQSHR--SAQKLGSLPSSSLIHQDKADDSSHSLNGDLHLQIPSVDDFEKPNAQQSPDL 1024

Query: 339  DWNVRDSFIHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPR 518
             WNV  S I   N    R+SW              +   W DG  +   N FSNGPKKPR
Sbjct: 1025 SWNVHGSVIPSSNRTAPRSSWHRTRNSSLSLGF--QSHAWADGKADSLYNDFSNGPKKPR 1082

Query: 519  TQVQYTLPFVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANVL 698
            TQV Y++P  G +LS K KS   K LP KRIRKAS K+ +D +    KN E LSC ANVL
Sbjct: 1083 TQVSYSVPLAGYELSSKHKSHHQKGLPNKRIRKASEKKSADVARAPEKNFECLSCDANVL 1142

Query: 699  VTHGDKGWREYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWK 878
            +T GDKGWREYGAH+VLE+ DHNEW+L+VKL GVT+YSYK    +Q GSTNRY+H+MMWK
Sbjct: 1143 ITVGDKGWREYGAHVVLELFDHNEWKLSVKLLGVTRYSYKAHQFMQLGSTNRYTHSMMWK 1202

Query: 879  GGKDWVLEFPD 911
            GGKDW LEF D
Sbjct: 1203 GGKDWTLEFTD 1213


>ref|XP_002309585.2| hypothetical protein POPTR_0006s26240g [Populus trichocarpa]
            gi|550337121|gb|EEE93108.2| hypothetical protein
            POPTR_0006s26240g [Populus trichocarpa]
          Length = 1685

 Score =  223 bits (567), Expect = 1e-55
 Identities = 118/245 (48%), Positives = 141/245 (57%), Gaps = 11/245 (4%)
 Frame = +3

Query: 210  DAPILPSMPTSITSQTPVPRSDSTLGGMTIVIPSSESRQ-----------ASDVDWNVRD 356
            D  I  + P S T     P S + L G+T+ IPS    Q           +SD+ WN+  
Sbjct: 1008 DGCISRAKPESQTVDGTDPGSRTLLKGITVEIPSVNLNQHVNKELHSVQRSSDLSWNMNG 1067

Query: 357  SFIHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYT 536
              I  PN    R++W                  W DG  +F  N F NGPKKPRT V YT
Sbjct: 1068 GIIPSPNPTARRSTWYRNRSSSASFG-------WSDGRTDFLQNNFGNGPKKPRTHVSYT 1120

Query: 537  LPFVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDK 716
            LP  G D S + +    K    KRIR A+ KR SD S  S +N+ELLSC ANVL+T+GDK
Sbjct: 1121 LPLGGFDYSPRNRGQQQKGFSHKRIRTATEKRTSDISRGSERNLELLSCDANVLITNGDK 1180

Query: 717  GWREYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWV 896
            GWRE G  +VLE+ DHNEWRL +KLSG TKYSYK    LQ GSTNR++HAMMWKGGK+W 
Sbjct: 1181 GWRECGVQVVLELFDHNEWRLGIKLSGTTKYSYKAHQFLQTGSTNRFTHAMMWKGGKEWT 1240

Query: 897  LEFPD 911
            LEFPD
Sbjct: 1241 LEFPD 1245


>ref|XP_002324830.2| hypothetical protein POPTR_0018s01030g [Populus trichocarpa]
            gi|550317762|gb|EEF03395.2| hypothetical protein
            POPTR_0018s01030g [Populus trichocarpa]
          Length = 1722

 Score =  222 bits (566), Expect = 1e-55
 Identities = 128/293 (43%), Positives = 156/293 (53%), Gaps = 21/293 (7%)
 Frame = +3

Query: 96   SGSSKKIQKGNPEGDENASKVISQPCLPEPRKEAH---QLIDAPILPSMPTSITSQTPVP 266
            SG   K    N  GD N     S   L E    A    Q ++     S P  + S+  + 
Sbjct: 997  SGGDWKKSLSNQSGDVNVEISASYRDLGESGSGAIVPLQNLECNHSESQPCDLLSRLSIN 1056

Query: 267  RSDSTLG------GMTIVIPSSES------------RQASDVDWNVRDSFIHKPNTIGFR 392
            + ++  G      G+T+ IPS               +Q+SD+ WN+    I  PN    R
Sbjct: 1057 KDETGAGSHALSNGITVDIPSVNQFDQHVNKELQGVQQSSDLSWNMNGGVIPSPNPTARR 1116

Query: 393  NSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLPFVGCDLSEKQ 572
            ++W                  W +G  +F  N F NGPKKPRTQV Y LPF G D S + 
Sbjct: 1117 STWHRNRSSFASFG-------WSEGRADFLQNNFGNGPKKPRTQVSYALPFGGFDYSPRN 1169

Query: 573  KSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGWREYGAHIVLE 752
            K    K  P KRIR A+ KR S  S  S + +ELLSC ANVL+T+GDKGWRE G  +VLE
Sbjct: 1170 KGYQQKGFPHKRIRTATEKRTSFISRGSERKLELLSCDANVLITNGDKGWRECGVQVVLE 1229

Query: 753  VDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLEFPD 911
            + DHNEWRL VKLSG TKYSYK    LQ GSTNR++HAMMWKGGKDW LEFPD
Sbjct: 1230 LFDHNEWRLGVKLSGTTKYSYKAHQFLQTGSTNRFTHAMMWKGGKDWTLEFPD 1282


>gb|EXC20799.1| hypothetical protein L484_007381 [Morus notabilis]
          Length = 1690

 Score =  221 bits (562), Expect = 4e-55
 Identities = 116/226 (51%), Positives = 141/226 (62%), Gaps = 12/226 (5%)
 Frame = +3

Query: 270  SDSTLGGMTIVIPSSE------------SRQASDVDWNVRDSFIHKPNTIGFRNSWQXXX 413
            S S + G+++ IP               ++QA+D+ WN   +    PN    R++W    
Sbjct: 1025 SQSFVNGLSVEIPPFNQFEKSVDGELHGAQQATDLSWNTNGAIFSSPNPTAPRSTWHRNK 1084

Query: 414  XXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLPFVGCDLSEKQKSPSAKS 593
                     H    W DG  +   NGF NGPKKPRTQV Y LPF G D S KQKS   K 
Sbjct: 1085 QNSSFGHLSHG---WSDGKADPVYNGFGNGPKKPRTQVSYLLPFGGFDCSPKQKSIQ-KG 1140

Query: 594  LPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGWREYGAHIVLEVDDHNEW 773
            LP KR+RKAS KR SD S  S++N+ELLSC  N+L+T  D+GWRE GA +VLE+ D +EW
Sbjct: 1141 LPSKRLRKASEKRSSDVSRGSQRNLELLSCDVNILITATDRGWRECGAQVVLELFDDHEW 1200

Query: 774  RLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLEFPD 911
            +LAVKLSGVTKYSYK    LQPGSTNR++HAMMWKGGKDW LEF D
Sbjct: 1201 KLAVKLSGVTKYSYKAHQFLQPGSTNRFTHAMMWKGGKDWTLEFMD 1246


>ref|XP_006596126.1| PREDICTED: uncharacterized protein LOC100781778 isoform X2 [Glycine
            max]
          Length = 1473

 Score =  219 bits (558), Expect = 1e-54
 Identities = 115/243 (47%), Positives = 146/243 (60%), Gaps = 13/243 (5%)
 Frame = +3

Query: 222  LPSMPTSITSQTPVPRSDSTLGGMTIVIPSSE-------------SRQASDVDWNVRDSF 362
            LPS P  I        S S++G ++I IP+ +             +  + D  WN+    
Sbjct: 962  LPSSPL-IRQDKADDGSHSSIGDLSIQIPAVDQFEKPGDDGDLRNAEHSPDFSWNINGGG 1020

Query: 363  IHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLP 542
            +   N    R+SW              +  VW DG  +   N F NGPKKPRTQV Y++P
Sbjct: 1021 LPNSNPTARRSSWYRNRNSSLSLGF--QSHVWSDGKADSLCNDFINGPKKPRTQVSYSVP 1078

Query: 543  FVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGW 722
              G + S K+++   K  P KRIRKAS K+ SD +    KNVE LSC ANVL+T G+KGW
Sbjct: 1079 SAGYEFSSKRRNHHQKGFPHKRIRKASEKKSSDVARRLEKNVECLSCGANVLITLGNKGW 1138

Query: 723  REYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLE 902
            R+ GAH+VLE+ DHNEWRL+VKL G+T+YSYK    LQPGSTNRY+HAMMWKGGKDW+LE
Sbjct: 1139 RDSGAHVVLELFDHNEWRLSVKLLGITRYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILE 1198

Query: 903  FPD 911
            FPD
Sbjct: 1199 FPD 1201


>ref|XP_003545513.1| PREDICTED: uncharacterized protein LOC100781778 isoform X1 [Glycine
            max]
          Length = 1603

 Score =  219 bits (558), Expect = 1e-54
 Identities = 115/243 (47%), Positives = 146/243 (60%), Gaps = 13/243 (5%)
 Frame = +3

Query: 222  LPSMPTSITSQTPVPRSDSTLGGMTIVIPSSE-------------SRQASDVDWNVRDSF 362
            LPS P  I        S S++G ++I IP+ +             +  + D  WN+    
Sbjct: 962  LPSSPL-IRQDKADDGSHSSIGDLSIQIPAVDQFEKPGDDGDLRNAEHSPDFSWNINGGG 1020

Query: 363  IHKPNTIGFRNSWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLP 542
            +   N    R+SW              +  VW DG  +   N F NGPKKPRTQV Y++P
Sbjct: 1021 LPNSNPTARRSSWYRNRNSSLSLGF--QSHVWSDGKADSLCNDFINGPKKPRTQVSYSVP 1078

Query: 543  FVGCDLSEKQKSPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGW 722
              G + S K+++   K  P KRIRKAS K+ SD +    KNVE LSC ANVL+T G+KGW
Sbjct: 1079 SAGYEFSSKRRNHHQKGFPHKRIRKASEKKSSDVARRLEKNVECLSCGANVLITLGNKGW 1138

Query: 723  REYGAHIVLEVDDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLE 902
            R+ GAH+VLE+ DHNEWRL+VKL G+T+YSYK    LQPGSTNRY+HAMMWKGGKDW+LE
Sbjct: 1139 RDSGAHVVLELFDHNEWRLSVKLLGITRYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILE 1198

Query: 903  FPD 911
            FPD
Sbjct: 1199 FPD 1201


>ref|XP_004162065.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101228859
            [Cucumis sativus]
          Length = 1466

 Score =  218 bits (554), Expect = 3e-54
 Identities = 120/232 (51%), Positives = 147/232 (63%), Gaps = 10/232 (4%)
 Frame = +3

Query: 246  TSQTPVPRSD--STLGGMTIVIPSSES--------RQASDVDWNVRDSFIHKPNTIGFRN 395
            T+   V RSD  S L  +++ IPS +         +Q+ DV WN     I  PN    R+
Sbjct: 977  TALPNVARSDNNSFLNDLSVEIPSFQPVDGELHGPQQSMDVGWNASAVVIPSPNPTAPRS 1036

Query: 396  SWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLPFVGCDLSEKQK 575
            +W                  W DG+ +   NG  N  KKPRTQV Y+LPF G D S K +
Sbjct: 1037 TWHRNKNNSTSLGLASHG--WSDGN-SLLINGLGNRTKKPRTQVSYSLPFGGFDYSSKSR 1093

Query: 576  SPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGWREYGAHIVLEV 755
            +   K+ P KRIR+AS KR SD +  S++N+ELLSC ANVL+T GD+GWRE GA +VLEV
Sbjct: 1094 NSHPKASPYKRIRRASEKR-SDVARGSKRNLELLSCDANVLITLGDRGWRECGAKVVLEV 1152

Query: 756  DDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLEFPD 911
             DHNEW+LAVKLSG+TKYSYK    LQPGSTNRY+HAMMWKGGKDW+LEFPD
Sbjct: 1153 FDHNEWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPD 1204


>ref|XP_004136466.1| PREDICTED: uncharacterized protein LOC101216141 [Cucumis sativus]
          Length = 1476

 Score =  218 bits (554), Expect = 3e-54
 Identities = 120/232 (51%), Positives = 147/232 (63%), Gaps = 10/232 (4%)
 Frame = +3

Query: 246  TSQTPVPRSD--STLGGMTIVIPSSES--------RQASDVDWNVRDSFIHKPNTIGFRN 395
            T+   V RSD  S L  +++ IPS +         +Q+ DV WN     I  PN    R+
Sbjct: 810  TALPNVARSDNNSFLNDLSVEIPSFQPVDGELHGPQQSMDVGWNASAVVIPSPNPTAPRS 869

Query: 396  SWQXXXXXXXXXXXXHKPQVWPDGSPNFKPNGFSNGPKKPRTQVQYTLPFVGCDLSEKQK 575
            +W                  W DG+ +   NG  N  KKPRTQV Y+LPF G D S K +
Sbjct: 870  TWHRNKNNSTSLGLASHG--WSDGN-SLLINGLGNRTKKPRTQVSYSLPFGGFDYSSKSR 926

Query: 576  SPSAKSLPCKRIRKASLKRISDGSGNSRKNVELLSCVANVLVTHGDKGWREYGAHIVLEV 755
            +   K+ P KRIR+AS KR SD +  S++N+ELLSC ANVL+T GD+GWRE GA +VLEV
Sbjct: 927  NSHPKASPYKRIRRASEKR-SDVARGSKRNLELLSCDANVLITLGDRGWRECGAKVVLEV 985

Query: 756  DDHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDWVLEFPD 911
             DHNEW+LAVKLSG+TKYSYK    LQPGSTNRY+HAMMWKGGKDW+LEFPD
Sbjct: 986  FDHNEWKLAVKLSGITKYSYKAHQFLQPGSTNRYTHAMMWKGGKDWILEFPD 1037


>ref|XP_004245412.1| PREDICTED: uncharacterized protein LOC101258290 [Solanum
            lycopersicum]
          Length = 1659

 Score =  217 bits (553), Expect = 4e-54
 Identities = 102/154 (66%), Positives = 122/154 (79%), Gaps = 1/154 (0%)
 Frame = +3

Query: 453  VWPDGSPNFKPNGFSNGPKKPRTQVQYTLPFVGCDLSEKQKSPSAKSLPCKRIRKASLKR 632
            VW DG  NF   GF NGPK+PRTQVQYTL + G D S   K+ S ++LP KRIR+AS K+
Sbjct: 1068 VWVDGKANFTGGGFGNGPKRPRTQVQYTLSYGGYDFSSMHKNHSPRTLPYKRIRRASEKK 1127

Query: 633  ISDGSGNSRKNVELLSCVANVLVTHGD-KGWREYGAHIVLEVDDHNEWRLAVKLSGVTKY 809
             +D  G S++N+ELL+C ANVLVT G  KGWRE+GA IVLE+  HNEW++AVK SG TKY
Sbjct: 1128 NADSCGGSQRNIELLACNANVLVTLGGVKGWREFGARIVLEIAGHNEWKIAVKFSGATKY 1187

Query: 810  SYKVKHILQPGSTNRYSHAMMWKGGKDWVLEFPD 911
            SYKV ++LQPGSTNR++HAMMWKGGKDWVLEFPD
Sbjct: 1188 SYKVHNVLQPGSTNRFTHAMMWKGGKDWVLEFPD 1221


Top