BLASTX nr result

ID: Forsythia23_contig00008415 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00008415
         (1211 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011081867.1| PREDICTED: uncharacterized protein LOC105164...   338   4e-90
ref|XP_009627319.1| PREDICTED: uncharacterized protein LOC104117...   310   2e-81
emb|CDP18428.1| unnamed protein product [Coffea canephora]            301   5e-79
ref|XP_009778721.1| PREDICTED: uncharacterized protein LOC104228...   301   9e-79
ref|XP_010648566.1| PREDICTED: uncharacterized protein LOC100264...   282   3e-73
ref|XP_007013731.1| Enhancer of polycomb-like transcription fact...   280   2e-72
ref|XP_007013730.1| Enhancer of polycomb-like transcription fact...   280   2e-72
ref|XP_007013729.1| Enhancer of polycomb-like transcription fact...   280   2e-72
ref|XP_007013727.1| Enhancer of polycomb-like transcription fact...   280   2e-72
ref|XP_010109047.1| hypothetical protein L484_007381 [Morus nota...   274   9e-71
ref|XP_012462722.1| PREDICTED: uncharacterized protein LOC105782...   270   1e-69
ref|XP_008219843.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   268   5e-69
emb|CBI20940.3| unnamed protein product [Vitis vinifera]              268   5e-69
ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus c...   268   6e-69
ref|XP_012078606.1| PREDICTED: uncharacterized protein LOC105639...   266   2e-68
ref|XP_008378284.1| PREDICTED: uncharacterized protein LOC103441...   263   3e-67
gb|KHG16466.1| DNA mismatch repair Msh6-1 -like protein [Gossypi...   262   4e-67
ref|XP_010325156.1| PREDICTED: uncharacterized protein LOC101258...   262   4e-67
ref|XP_012855912.1| PREDICTED: uncharacterized protein LOC105975...   260   2e-66
ref|XP_008394009.1| PREDICTED: uncharacterized protein LOC103456...   259   3e-66

>ref|XP_011081867.1| PREDICTED: uncharacterized protein LOC105164793 [Sesamum indicum]
          Length = 1713

 Score =  338 bits (868), Expect = 4e-90
 Identities = 201/419 (47%), Positives = 261/419 (62%), Gaps = 19/419 (4%)
 Frame = -3

Query: 1200 ESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYDTLC 1021
            E+CN RM QS F  AAK G++  FALSF AAPTFFL+LHL++L+E +FA  NL+  D LC
Sbjct: 862  EACNTRMSQSAFTLAAKPGKVPQFALSFCAAPTFFLTLHLQLLMEHSFAWFNLQHEDALC 921

Query: 1020 SLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYE-------------L 880
            SLEN E+   Q+V +  Q+E+ SV ++++ AE     +  EA +++             +
Sbjct: 922  SLENSENG-DQLVAECSQLEASSVAVQDVPAEPEIRKMDAEALTFQGLKSCQQDLGMDII 980

Query: 879  LSSYTVSDSNVAYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCF 700
            L+S TV ++N +   ++LQ  K  +     C K+  +    V +    +E  ++V EQ  
Sbjct: 981  LASNTVENTNSS---EELQKGKSDNDGTACCLKEFTEITPEVIAQPHQYEPMKEVDEQ-I 1036

Query: 699  VLPLPCMSNSITCTSSNLSCDSSFG--SVEIPSFEQVDMPCNGRGHISRETSDLGWSMSA 526
            VL  P    S TC   N   DS+ G  +VEIPS E V++  +G+  ISR+TS   W++  
Sbjct: 1037 VLSAPVSVTSATC---NPRSDSTSGGMTVEIPSLEHVNVHFDGKSCISRQTSCGVWNIHD 1093

Query: 525  GFVHSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYA 346
            GFVH+PNPTG RS   RG +SS  SP G+  PV  DG    + +  S G KKPR QVQY 
Sbjct: 1094 GFVHNPNPTGSRSSLQRGRSSSIYSPLGHHSPVWPDGNPNLVSSGLSNGPKKPRTQVQYT 1153

Query: 345  RPFGGY--GAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDK 178
             PF GY   AK K  N R+LP +RI   S KR SD    +Q+NLEL  C AN+LVT  DK
Sbjct: 1154 LPFVGYDFSAKQKMQNLRSLPCKRIRRASLKRTSDGSVNNQKNLELLTCVANILVTHGDK 1213

Query: 177  GWRESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            GWRE GA IVLE AD NEWRLAVKLSG+T+Y YKV+H LQPGSTNR++HAMMWKGGKDW
Sbjct: 1214 GWRECGANIVLEHADHNEWRLAVKLSGVTKYSYKVKHILQPGSTNRYSHAMMWKGGKDW 1272


>ref|XP_009627319.1| PREDICTED: uncharacterized protein LOC104117893 [Nicotiana
            tomentosiformis]
          Length = 1682

 Score =  310 bits (793), Expect = 2e-81
 Identities = 191/433 (44%), Positives = 250/433 (57%), Gaps = 31/433 (7%)
 Frame = -3

Query: 1206 SRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYDT 1027
            S E  + R+  S F  A K GR+ PFALSF AAPTFF+ LHL++L+ERNFAC++L+DYD+
Sbjct: 845  STECSSARVTSSTFSSAMKLGRIPPFALSFTAAPTFFICLHLRLLMERNFACVSLQDYDS 904

Query: 1026 LCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSNV 847
            +       +A Q + +D  +VE   +  ENI A     S   E               ++
Sbjct: 905  I-------NACQPVKDDGSRVECSDI-AENIVASSTGGSSFAERKL-----------GSL 945

Query: 846  AYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFVLPLPCMSNSI 667
            A KL+  QN +   ++++  +K    +   V  +S   ES  Q L+Q    P    SN+ 
Sbjct: 946  ACKLKSSQNCQLDITQSSFIAKYSELDTPDVIVVSNKSESVGQGLDQFVASPGRRQSNNT 1005

Query: 666  TCTSSNLSCDSSFG--SVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGF--------- 520
            + + S+  C S     SV IPSF+QV+    G+G I  ETS L  + S G          
Sbjct: 1006 SHSLSSARCHSGLVGMSVVIPSFDQVEGLSEGKGIILGETSHLTLNKSDGMISSPKLTVT 1065

Query: 519  ----------------VHSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDF 388
                            V SPNP+GPR   +R  NSSSSSPFG + PV  DGKT      F
Sbjct: 1066 SNVVKCPIIAGTSDRMVQSPNPSGPRGLLYRNRNSSSSSPFGEISPVLVDGKTNFTRGGF 1125

Query: 387  SYGLKKPRNQVQYARPFGGY--GAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLEL 220
              G KKPR QVQY  P+GGY  G+ H+ ++ RTLP +RI   SEK+ +D    SQRN+EL
Sbjct: 1126 GNGPKKPRTQVQYTLPYGGYDLGSMHRNHSPRTLPYKRIRRASEKKNADNCSGSQRNIEL 1185

Query: 219  RACGANLLVTVEDKGWRESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNR 40
             +C AN+LVTV DKGWRE GA++VLE+A  NEWR+AVK +G+T+Y YKV + LQPGSTNR
Sbjct: 1186 LSCDANVLVTVPDKGWREFGARVVLEIAGHNEWRIAVKFAGVTKYSYKVHNILQPGSTNR 1245

Query: 39   FTHAMMWKGGKDW 1
            FTHAMMWKGGKDW
Sbjct: 1246 FTHAMMWKGGKDW 1258


>emb|CDP18428.1| unnamed protein product [Coffea canephora]
          Length = 1698

 Score =  301 bits (772), Expect = 5e-79
 Identities = 184/418 (44%), Positives = 254/418 (60%), Gaps = 15/418 (3%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSRES +       F  A K G++  FALSF AAPTFFLSLHLK+L+E+NF+ IN +D  
Sbjct: 838  VSRESSSKTTSSFAFNSAIKLGKIPAFALSFTAAPTFFLSLHLKLLLEQNFSSINFQDNA 897

Query: 1029 TLCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSN 850
            +L ++ + E   Q        ++    N+       + ++ + +A S  L S+   S  +
Sbjct: 898  SLSAIGDSEVDVQSTAILHPDIDPCPENVIGKIPGCDKQTSLADAGSQFLSSAEPCSGKD 957

Query: 849  VAYKLQDLQNDKPTSS---EATVCSK-----DLVKNKTVVNSLSPNFESNEQVLEQCFVL 694
            V+ ++ D+   K  S+   + T+        D+++   VVN    N ES+ Q LEQ    
Sbjct: 958  VSSEVSDVDRGKSASNGKQDMTLSPSISKDFDMLETDRVVNP--SNHESHNQELEQNVAS 1015

Query: 693  PLPCMSNSITCTS-SNLSCDSSFG--SVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAG 523
                +S ++  T  SN +  SS G  S+E+PS +Q D P +   +IS + SDL  +MS G
Sbjct: 1016 SDLSVSRTVAPTGLSNTTGFSSLGGLSIELPSSDQNDKPLDQGVNISGQVSDLAGNMSDG 1075

Query: 522  FVHSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYAR 343
             + SP  +G RS   R  N S++SPFG+  PV   GK+  + N F  G KKPR QVQY  
Sbjct: 1076 VLQSPCTSGLRSSLRRDRNCSNNSPFGDHSPVWPHGKSNFISNGFGNGPKKPRTQVQYTL 1135

Query: 342  PFGGY--GAKHKTNNQRTLPSRRI--DSEKRVSDRPRRSQRNLELRACGANLLVTVEDKG 175
            P G Y   +++++ +Q++ P +RI   +EKRVSD  R SQ+NLEL +C AN+LVTV DKG
Sbjct: 1136 PPGVYDSSSRYQSQSQKSFPYKRIRRSNEKRVSDGSRSSQKNLELLSCDANILVTVRDKG 1195

Query: 174  WRESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            WRE GA+I+LEL D+NEW+LAVK+SG+TRY YKV H LQPGSTNRFTHAMMWKGGKDW
Sbjct: 1196 WRECGARIILELTDQNEWKLAVKVSGVTRYSYKVNHILQPGSTNRFTHAMMWKGGKDW 1253


>ref|XP_009778721.1| PREDICTED: uncharacterized protein LOC104228007 [Nicotiana
            sylvestris]
          Length = 1711

 Score =  301 bits (770), Expect = 9e-79
 Identities = 187/433 (43%), Positives = 246/433 (56%), Gaps = 31/433 (7%)
 Frame = -3

Query: 1206 SRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYDT 1027
            S E  + R+  S F  A K GR+ PFALSF AAPTFF+ LHL++L+ERNFAC++L+DYD+
Sbjct: 845  STECSSARLTSSTFSSAMKLGRIPPFALSFTAAPTFFICLHLRLLMERNFACVSLQDYDS 904

Query: 1026 LCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSNV 847
            +       +A Q + +D  +VE  S   ENI A     +  +     +L +       + 
Sbjct: 905  I-------NACQPVKDDGSRVEC-SDTAENIVASSTGVTGGSSLAERKLGNLACKQQLSE 956

Query: 846  AYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFVLPLPCMSNSI 667
               L+  QN +   + ++  +K      + V  +S   ES  Q L+Q    P    SN+I
Sbjct: 957  RVSLKSSQNCQLDITPSSFIAKHSELGTSDVIVVSHKSESVGQGLDQFVASPGRRQSNNI 1016

Query: 666  TCTSSNLSCDSSFG--SVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGF--------- 520
            + +  +  C S     SV IPSF+QV+    G+G I  E S L  + S G          
Sbjct: 1017 SHSLPSARCHSGLVGMSVVIPSFDQVEGLSEGKGIILGEASHLTLNKSDGMISSPNLTVT 1076

Query: 519  ----------------VHSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDF 388
                            V SPNP+GPR    R  NSSSSSPFG + PV  DGKT      F
Sbjct: 1077 SNVVQCPIIAGMSDRMVQSPNPSGPRGLLCRNRNSSSSSPFGEISPVLVDGKTNFTRGGF 1136

Query: 387  SYGLKKPRNQVQYARPFGGY--GAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLEL 220
              G KKPR QVQY  P+G Y  G+ H+ ++ RTLP +RI   S+K+ +D    SQRN+EL
Sbjct: 1137 GNGPKKPRTQVQYTLPYGSYALGSMHRNHSPRTLPYKRIRRASDKKNADNCSGSQRNIEL 1196

Query: 219  RACGANLLVTVEDKGWRESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNR 40
             +C AN+LVTV DKGWRE GA++VLE+A  NEWR+AVK SG+T+Y YKV + LQPGSTNR
Sbjct: 1197 LSCDANVLVTVPDKGWREFGARVVLEIAGHNEWRIAVKFSGVTKYSYKVHNILQPGSTNR 1256

Query: 39   FTHAMMWKGGKDW 1
            FTHAMMWKGGKDW
Sbjct: 1257 FTHAMMWKGGKDW 1269


>ref|XP_010648566.1| PREDICTED: uncharacterized protein LOC100264575 [Vitis vinifera]
          Length = 1679

 Score =  282 bits (722), Expect = 3e-73
 Identities = 174/415 (41%), Positives = 234/415 (56%), Gaps = 12/415 (2%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSRES  + M QS        G+L PFALSF AAPTFFL LHLK+L+E       L D++
Sbjct: 841  VSRESTFVNMSQSSSSLDVNQGKLPPFALSFNAAPTFFLGLHLKLLMEHRVDSTCLHDHN 900

Query: 1029 TLCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSN 850
                 +N E  T+ +        S   +  N    +  +S   +         Y  S+ N
Sbjct: 901  PTSPKQNLESLTEDVTW------SGQFSGANPQIAKQAQSACNDDDRINSFQKYENSNLN 954

Query: 849  VAYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESN--EQVLEQCFVLPLPCMS 676
            VA                + CS+D    +T ++++    E        EQC + P P + 
Sbjct: 955  VA--------------GTSACSEDT--GETGIDAIVQLQEQQGYHSEAEQCILSPQPLLL 998

Query: 675  NSITCTS-SNLSCDSSFG--SVEIPSFEQVDMPCNGRG---HISRETSDLGWSMSAGFVH 514
            N  + T  SN+ C S     +V+IP+F+QV+   + RG    IS+++ DL W+++ G + 
Sbjct: 999  NGHSSTGKSNVGCYSRLNGINVQIPTFDQVEKSFD-RGADISISQQSVDLSWNVNDGVIR 1057

Query: 513  SPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPFG 334
            SPNPT PRS W R  NS SSS FG    + SDGK     N F  G KKPR QV Y  P G
Sbjct: 1058 SPNPTAPRSMWQRNKNSFSSS-FGYPSHMWSDGKGDFFGNGFGNGPKKPRTQVSYTLPVG 1116

Query: 333  GY--GAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWRE 166
            G+   +K ++++Q+ LP++RI   +EKR+SD  R SQRNLE  +C AN+L+T  D+GWRE
Sbjct: 1117 GFDFSSKQRSHHQKGLPNKRIRRANEKRLSDGSRSSQRNLESLSCEANVLITFGDRGWRE 1176

Query: 165  SGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            SGAQ++LEL D NEW+LAVK+SG T+Y YK    LQPG+ NRFTHAMMWKGGKDW
Sbjct: 1177 SGAQVILELGDHNEWKLAVKVSGATKYSYKAHQFLQPGTANRFTHAMMWKGGKDW 1231


>ref|XP_007013731.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 5 [Theobroma cacao] gi|508784094|gb|EOY31350.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 5 [Theobroma cacao]
          Length = 1522

 Score =  280 bits (716), Expect = 2e-72
 Identities = 181/416 (43%), Positives = 242/416 (58%), Gaps = 13/416 (3%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSRES  L++GQ       KH  L  FALSFGAAPTFFLSLHLK+L+E + A I+ +D+D
Sbjct: 842  VSRESSFLKVGQFTSSSEKKHRNLPLFALSFGAAPTFFLSLHLKLLMEHSVARISFQDHD 901

Query: 1029 TLCSLENPEDATQQIVEDRMQVES-PSVNIENIHAERNFESVVTEAPSYELLSSYTVS-- 859
            +   L +  D    +V+D    E       ++   E+N ++   +A S   L++  +S  
Sbjct: 902  SNEQLGSSGDL---MVDDSSNREDCVDKRFDSSSVEKNLKASSKDAASDTELTTLDLSVC 958

Query: 858  -DSNVAYKLQDLQNDKPT--SSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFVLPL 688
             D +     Q  +N   T   + A+    + V    +V         +E   EQ     L
Sbjct: 959  GDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVPLQKQQCAHSES--EQ-----L 1011

Query: 687  PCMSNSITCTSSNLSCDSSFGS---VEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFV 517
               S S+     N +  +S  +   VEIPSF+Q +   +G    ++++SDL W+M+ G +
Sbjct: 1012 VSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQQSSDLTWNMNGGII 1071

Query: 516  HSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPF 337
             SPNPT PRS WHR  N SSSS  G      S+GK    HN+F  G KKPR QV Y+ PF
Sbjct: 1072 PSPNPTAPRSTWHR--NRSSSSSIGYNAHGWSEGKADFFHNNFGNGPKKPRTQVSYSMPF 1129

Query: 336  GG--YGAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWR 169
            GG  Y +K+K ++QR  P +RI   +EKR SD  R SQ+NLEL +C ANLL+T+ D+GWR
Sbjct: 1130 GGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDANLLITLGDRGWR 1189

Query: 168  ESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            E GAQ+ LEL D NEW+LAVK+SG TRY +K    LQPGSTNR+THAMMWKGGKDW
Sbjct: 1190 ECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAMMWKGGKDW 1245


>ref|XP_007013730.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 4 [Theobroma cacao] gi|508784093|gb|EOY31349.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 4 [Theobroma cacao]
          Length = 1721

 Score =  280 bits (716), Expect = 2e-72
 Identities = 181/416 (43%), Positives = 242/416 (58%), Gaps = 13/416 (3%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSRES  L++GQ       KH  L  FALSFGAAPTFFLSLHLK+L+E + A I+ +D+D
Sbjct: 842  VSRESSFLKVGQFTSSSEKKHRNLPLFALSFGAAPTFFLSLHLKLLMEHSVARISFQDHD 901

Query: 1029 TLCSLENPEDATQQIVEDRMQVES-PSVNIENIHAERNFESVVTEAPSYELLSSYTVS-- 859
            +   L +  D    +V+D    E       ++   E+N ++   +A S   L++  +S  
Sbjct: 902  SNEQLGSSGDL---MVDDSSNREDCVDKRFDSSSVEKNLKASSKDAASDTELTTLDLSVC 958

Query: 858  -DSNVAYKLQDLQNDKPT--SSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFVLPL 688
             D +     Q  +N   T   + A+    + V    +V         +E   EQ     L
Sbjct: 959  GDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVPLQKQQCAHSES--EQ-----L 1011

Query: 687  PCMSNSITCTSSNLSCDSSFGS---VEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFV 517
               S S+     N +  +S  +   VEIPSF+Q +   +G    ++++SDL W+M+ G +
Sbjct: 1012 VSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQQSSDLTWNMNGGII 1071

Query: 516  HSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPF 337
             SPNPT PRS WHR  N SSSS  G      S+GK    HN+F  G KKPR QV Y+ PF
Sbjct: 1072 PSPNPTAPRSTWHR--NRSSSSSIGYNAHGWSEGKADFFHNNFGNGPKKPRTQVSYSMPF 1129

Query: 336  GG--YGAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWR 169
            GG  Y +K+K ++QR  P +RI   +EKR SD  R SQ+NLEL +C ANLL+T+ D+GWR
Sbjct: 1130 GGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDANLLITLGDRGWR 1189

Query: 168  ESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            E GAQ+ LEL D NEW+LAVK+SG TRY +K    LQPGSTNR+THAMMWKGGKDW
Sbjct: 1190 ECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAMMWKGGKDW 1245


>ref|XP_007013729.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 3 [Theobroma cacao] gi|508784092|gb|EOY31348.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 3 [Theobroma cacao]
          Length = 1674

 Score =  280 bits (716), Expect = 2e-72
 Identities = 181/416 (43%), Positives = 242/416 (58%), Gaps = 13/416 (3%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSRES  L++GQ       KH  L  FALSFGAAPTFFLSLHLK+L+E + A I+ +D+D
Sbjct: 823  VSRESSFLKVGQFTSSSEKKHRNLPLFALSFGAAPTFFLSLHLKLLMEHSVARISFQDHD 882

Query: 1029 TLCSLENPEDATQQIVEDRMQVES-PSVNIENIHAERNFESVVTEAPSYELLSSYTVS-- 859
            +   L +  D    +V+D    E       ++   E+N ++   +A S   L++  +S  
Sbjct: 883  SNEQLGSSGDL---MVDDSSNREDCVDKRFDSSSVEKNLKASSKDAASDTELTTLDLSVC 939

Query: 858  -DSNVAYKLQDLQNDKPT--SSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFVLPL 688
             D +     Q  +N   T   + A+    + V    +V         +E   EQ     L
Sbjct: 940  GDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVPLQKQQCAHSES--EQ-----L 992

Query: 687  PCMSNSITCTSSNLSCDSSFGS---VEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFV 517
               S S+     N +  +S  +   VEIPSF+Q +   +G    ++++SDL W+M+ G +
Sbjct: 993  VSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQQSSDLTWNMNGGII 1052

Query: 516  HSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPF 337
             SPNPT PRS WHR  N SSSS  G      S+GK    HN+F  G KKPR QV Y+ PF
Sbjct: 1053 PSPNPTAPRSTWHR--NRSSSSSIGYNAHGWSEGKADFFHNNFGNGPKKPRTQVSYSMPF 1110

Query: 336  GG--YGAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWR 169
            GG  Y +K+K ++QR  P +RI   +EKR SD  R SQ+NLEL +C ANLL+T+ D+GWR
Sbjct: 1111 GGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDANLLITLGDRGWR 1170

Query: 168  ESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            E GAQ+ LEL D NEW+LAVK+SG TRY +K    LQPGSTNR+THAMMWKGGKDW
Sbjct: 1171 ECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAMMWKGGKDW 1226


>ref|XP_007013727.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao]
            gi|590579224|ref|XP_007013728.1| Enhancer of
            polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao] gi|508784090|gb|EOY31346.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 1 [Theobroma cacao]
            gi|508784091|gb|EOY31347.1| Enhancer of polycomb-like
            transcription factor protein, putative isoform 1
            [Theobroma cacao]
          Length = 1693

 Score =  280 bits (716), Expect = 2e-72
 Identities = 181/416 (43%), Positives = 242/416 (58%), Gaps = 13/416 (3%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSRES  L++GQ       KH  L  FALSFGAAPTFFLSLHLK+L+E + A I+ +D+D
Sbjct: 842  VSRESSFLKVGQFTSSSEKKHRNLPLFALSFGAAPTFFLSLHLKLLMEHSVARISFQDHD 901

Query: 1029 TLCSLENPEDATQQIVEDRMQVES-PSVNIENIHAERNFESVVTEAPSYELLSSYTVS-- 859
            +   L +  D    +V+D    E       ++   E+N ++   +A S   L++  +S  
Sbjct: 902  SNEQLGSSGDL---MVDDSSNREDCVDKRFDSSSVEKNLKASSKDAASDTELTTLDLSVC 958

Query: 858  -DSNVAYKLQDLQNDKPT--SSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFVLPL 688
             D +     Q  +N   T   + A+    + V    +V         +E   EQ     L
Sbjct: 959  GDEHWKKSSQKYENGDQTIYGTFASSHEPEEVGATAIVPLQKQQCAHSES--EQ-----L 1011

Query: 687  PCMSNSITCTSSNLSCDSSFGS---VEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFV 517
               S S+     N +  +S  +   VEIPSF+Q +   +G    ++++SDL W+M+ G +
Sbjct: 1012 VSSSKSLVDGDRNNAGSNSVLNDIRVEIPSFDQYENHIDGELPGTQQSSDLTWNMNGGII 1071

Query: 516  HSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPF 337
             SPNPT PRS WHR  N SSSS  G      S+GK    HN+F  G KKPR QV Y+ PF
Sbjct: 1072 PSPNPTAPRSTWHR--NRSSSSSIGYNAHGWSEGKADFFHNNFGNGPKKPRTQVSYSMPF 1129

Query: 336  GG--YGAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWR 169
            GG  Y +K+K ++QR  P +RI   +EKR SD  R SQ+NLEL +C ANLL+T+ D+GWR
Sbjct: 1130 GGLDYSSKNKGHHQRGPPHKRIRRANEKRSSDVSRGSQKNLELLSCDANLLITLGDRGWR 1189

Query: 168  ESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            E GAQ+ LEL D NEW+LAVK+SG TRY +K    LQPGSTNR+THAMMWKGGKDW
Sbjct: 1190 ECGAQVALELFDHNEWKLAVKVSGSTRYSHKAHQFLQPGSTNRYTHAMMWKGGKDW 1245


>ref|XP_010109047.1| hypothetical protein L484_007381 [Morus notabilis]
            gi|587933845|gb|EXC20799.1| hypothetical protein
            L484_007381 [Morus notabilis]
          Length = 1690

 Score =  274 bits (701), Expect = 9e-71
 Identities = 174/422 (41%), Positives = 234/422 (55%), Gaps = 19/422 (4%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            +SRES  + +G+S       + +L P ALSF AAPTFFLSLHLKML+E + A I+LR++D
Sbjct: 835  ISRESAFMDIGRSSHFDKM-YKKLPPLALSFTAAPTFFLSLHLKMLMEHSLAHISLREHD 893

Query: 1029 TLCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSN 850
               S E+ E++     +D   +E  S     +  E N +++  E  S    SS     SN
Sbjct: 894  ---SEEHLENSCSMTADDSSSMEEYSNKGSEMSLEENTKALSGEVASDGCFSSGRPELSN 950

Query: 849  ---VAYKLQDLQNDKP----------TSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLE 709
               V      ++  +P          TS+++ V  K        + +   +   ++Q   
Sbjct: 951  GLSVCCDRDQIKASQPCHNGDAIAAGTSADSPVHKKIRTDATVQLQAWKGHHSESDQSA- 1009

Query: 708  QCFVLPLPCMSNSITCTSSNLSCDSSFG---SVEIPSFEQVDMPCNGRGHISRETSDLGW 538
                     +S S+     +     SF    SVEIP F Q +   +G  H +++ +DL W
Sbjct: 1010 --------LLSRSLDDRDKSEKGSQSFVNGLSVEIPPFNQFEKSVDGELHGAQQATDLSW 1061

Query: 537  SMSAGFVHSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQ 358
            + +     SPNPT PRS WHR   +SS   FG+L    SDGK   ++N F  G KKPR Q
Sbjct: 1062 NTNGAIFSSPNPTAPRSTWHRNKQNSS---FGHLSHGWSDGKADPVYNGFGNGPKKPRTQ 1118

Query: 357  VQYARPFGGYGAKHKTNN-QRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTV 187
            V Y  PFGG+    K  + Q+ LPS+R+   SEKR SD  R SQRNLEL +C  N+L+T 
Sbjct: 1119 VSYLLPFGGFDCSPKQKSIQKGLPSKRLRKASEKRSSDVSRGSQRNLELLSCDVNILITA 1178

Query: 186  EDKGWRESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGK 7
             D+GWRE GAQ+VLEL D +EW+LAVKLSG+T+Y YK    LQPGSTNRFTHAMMWKGGK
Sbjct: 1179 TDRGWRECGAQVVLELFDDHEWKLAVKLSGVTKYSYKAHQFLQPGSTNRFTHAMMWKGGK 1238

Query: 6    DW 1
            DW
Sbjct: 1239 DW 1240


>ref|XP_012462722.1| PREDICTED: uncharacterized protein LOC105782472 [Gossypium raimondii]
            gi|763740311|gb|KJB07810.1| hypothetical protein
            B456_001G045600 [Gossypium raimondii]
          Length = 1686

 Score =  270 bits (691), Expect = 1e-69
 Identities = 178/410 (43%), Positives = 230/410 (56%), Gaps = 7/410 (1%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSRES  L++GQ     + K   L  FALSFGAAPTFFLSLHLK+L+ER+ A I+  D+D
Sbjct: 853  VSRESSFLKLGQ-FSCNSEKLRNLPRFALSFGAAPTFFLSLHLKLLMERSLARISFGDHD 911

Query: 1029 TLCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPS-YELLSSYTVSDS 853
               S+E P  +   +++D    E    N      E+N ++   E  S  EL S  +V  +
Sbjct: 912  ---SIEQPGSSGNLLLDDSSSREDSMNNNSESSVEKNLKASSKEVASDAELTSDLSVCGN 968

Query: 852  NVAYKL--QDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFVLPLPCM 679
                K   +   ND+          +  V     V       +++E    Q FVL     
Sbjct: 969  GCLKKSSREYKNNDQIVDGTFAGSHESEVGAIAFVPLQKQQCDNSET---QQFVLSSKSP 1025

Query: 678  SNSITCTSSNLSCDSSFGSVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFVHSPNPT 499
             ++   T+S+ S  S    VEIP F+Q     +     +R+++DL  +M+ G + SPNPT
Sbjct: 1026 FDADKETASSGSILSGI-RVEIPPFDQYGKHVDSELPSTRQSTDLTLNMNGGIIPSPNPT 1084

Query: 498  GPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPFGG--YG 325
             PRS WHR   + SSS  G      SDGK    H++F  G KKPR QV Y+ P G   Y 
Sbjct: 1085 APRSTWHR---NRSSSSIGFHARGWSDGKADFFHSNFGNGPKKPRTQVSYSMPLGSLDYS 1141

Query: 324  AKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWRESGAQI 151
            +K K   QR LP +RI   +EKR SD  R SQRNL+L +C AN+L+T+ D+GWRE G Q 
Sbjct: 1142 SKSKGLQQRVLPHKRIRRANEKRSSDVSRGSQRNLDLLSCDANVLITIGDRGWRECGVQA 1201

Query: 150  VLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            VLEL D NEW+LAVK+SG TRY YK    LQPGSTNRFTHAMMWKGGKDW
Sbjct: 1202 VLELFDHNEWKLAVKVSGSTRYSYKAHQFLQPGSTNRFTHAMMWKGGKDW 1251


>ref|XP_008219843.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103320015
            [Prunus mume]
          Length = 1780

 Score =  268 bits (686), Expect = 5e-69
 Identities = 175/415 (42%), Positives = 224/415 (53%), Gaps = 13/415 (3%)
 Frame = -3

Query: 1206 SRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYDT 1027
            SRES  + +  S         +L P ALSF AAPTFFLSLHLK+L+E   A I  RD D+
Sbjct: 939  SRESAFVNISHSTSHSDEHPRKLPPLALSFTAAPTFFLSLHLKLLMEHCVANICFRDPDS 998

Query: 1026 LCSLENPEDATQ---QIVEDRMQVESPSVNIENIHA-------ERNFESVVTEAPSYELL 877
            +  L N           +ED     S   +  N+ A       + +F    TE       
Sbjct: 999  VELLGNSGSMLAVDCSSLEDFFNRGSKITHENNLKAPPGNATSDHSFSKPETETALAVCN 1058

Query: 876  SSYTVSDSNVAYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFV 697
              +T S  +     QD       SS  TV     V  KT  +++  + ES     +QC +
Sbjct: 1059 GGWTKSSQHY----QDGVLSVAGSSTVTV-----VPEKTGTDAVVHHPES-----DQCSL 1104

Query: 696  LPLPCMSNSITCTSSNLSCDSSFGSVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFV 517
             P   +    + T S    +    +VEIPSF++ + P +G    +++ +D  W+MS   +
Sbjct: 1105 SPKHLVGKEKSDTDSQSFLNGL--TVEIPSFDRFEKPVDGEVQSAQQPTDCSWNMSGSII 1162

Query: 516  HSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPF 337
             SPNPT PRS WHR  NSSSS  FG L    SDGK    HN F  G KKPR QV Y  P+
Sbjct: 1163 PSPNPTAPRSTWHRSRNSSSS--FGYLSHGWSDGKADLFHNGFGNGPKKPRTQVSYTLPY 1220

Query: 336  GGYGAKHKTNN-QRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWRE 166
            GG+    K  N Q+ +P +RI   +EKR+SD  R SQRNLE  +C AN+L+   D+GWRE
Sbjct: 1221 GGFDFSSKQRNLQKGIPPKRIRRANEKRLSDVSRGSQRNLEQLSCEANVLINGSDRGWRE 1280

Query: 165  SGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
             GA IVLEL D NEW+LAVK+SG T+Y YK    LQPGSTNR+THAMMWKGGKDW
Sbjct: 1281 CGAHIVLELFDHNEWKLAVKISGTTKYSYKAHQFLQPGSTNRYTHAMMWKGGKDW 1335


>emb|CBI20940.3| unnamed protein product [Vitis vinifera]
          Length = 1634

 Score =  268 bits (686), Expect = 5e-69
 Identities = 173/415 (41%), Positives = 234/415 (56%), Gaps = 12/415 (2%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSRES  + M QS        G+L PFALSF AAPTFFL LHLK+L+E          + 
Sbjct: 841  VSRESTFVNMSQSSSSLDVNQGKLPPFALSFNAAPTFFLGLHLKLLMEHRDVT-----WS 895

Query: 1029 TLCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSN 850
               S  NP+ A Q         +S   + + I++ + +E+                S+ N
Sbjct: 896  GQFSGANPQIAKQ--------AQSACNDDDRINSFQKYEN----------------SNLN 931

Query: 849  VAYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESN--EQVLEQCFVLPLPCMS 676
            VA                + CS+D    +T ++++    E        EQC + P P + 
Sbjct: 932  VA--------------GTSACSEDT--GETGIDAIVQLQEQQGYHSEAEQCILSPQPLLL 975

Query: 675  NSITCTS-SNLSCDSSFG--SVEIPSFEQVDMPCNGRG---HISRETSDLGWSMSAGFVH 514
            N  + T  SN+ C S     +V+IP+F+QV+   + RG    IS+++ DL W+++ G + 
Sbjct: 976  NGHSSTGKSNVGCYSRLNGINVQIPTFDQVEKSFD-RGADISISQQSVDLSWNVNDGVIR 1034

Query: 513  SPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPFG 334
            SPNPT PRS W R  NS SSS FG    + SDGK     N F  G KKPR QV Y  P G
Sbjct: 1035 SPNPTAPRSMWQRNKNSFSSS-FGYPSHMWSDGKGDFFGNGFGNGPKKPRTQVSYTLPVG 1093

Query: 333  GY--GAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWRE 166
            G+   +K ++++Q+ LP++RI   +EKR+SD  R SQRNLE  +C AN+L+T  D+GWRE
Sbjct: 1094 GFDFSSKQRSHHQKGLPNKRIRRANEKRLSDGSRSSQRNLESLSCEANVLITFGDRGWRE 1153

Query: 165  SGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            SGAQ++LEL D NEW+LAVK+SG T+Y YK    LQPG+ NRFTHAMMWKGGKDW
Sbjct: 1154 SGAQVILELGDHNEWKLAVKVSGATKYSYKAHQFLQPGTANRFTHAMMWKGGKDW 1208


>ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus communis]
            gi|223544424|gb|EEF45945.1| hypothetical protein
            RCOM_0804080 [Ricinus communis]
          Length = 1705

 Score =  268 bits (685), Expect = 6e-69
 Identities = 169/416 (40%), Positives = 233/416 (56%), Gaps = 13/416 (3%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSR+S  +    S  R    HG   PFALSF AAPTFFLSLHLK+L+E +   I+ +D+D
Sbjct: 858  VSRDSNYVNSPSSSSRFDKSHGWFPPFALSFTAAPTFFLSLHLKLLMEHSVTHISFQDHD 917

Query: 1029 TLCSLENPEDATQQIVEDRMQVE---------SPSVNIENIHAERNFESVVTEAPSYELL 877
               S+E+PE++     +D   V+         +P  N +    + + E  +  A +  L 
Sbjct: 918  ---SVEHPENSGSLQADDCYSVDDSLNKHAETTPDNNSKGSSRDVDCEECLFCANTEPLA 974

Query: 876  SSYTVSDSNVAYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFV 697
               +V+      K      +    +E +  SKD  +    + SL   +  +    EQ   
Sbjct: 975  VGVSVNTVGDWMKPSPKHQNSDVHAETSAFSKDSGELGRDIASLQ-KWRCHHSEAEQNDA 1033

Query: 696  LPLPCMSNSITCTSSNLSCDSSFGSVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFV 517
            LP P +  ++           +   VEIPS  Q D   +     +++++DL W+M+ G +
Sbjct: 1034 LPKPSVDRALL----------NGIRVEIPSSNQFDKQVDKDLDGAQQSTDLSWNMNGGII 1083

Query: 516  HSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPF 337
             SPNPT  RS WHR  N S+ +  G      SDG+   + N+F  G KKPR QV YA PF
Sbjct: 1084 PSPNPTARRSTWHR--NRSNLASVGYNAHGWSDGRGDFLQNNFRNGPKKPRTQVSYALPF 1141

Query: 336  GG--YGAKHKTNNQRTLPSRRIDS--EKRVSDRPRRSQRNLELRACGANLLVTVEDKGWR 169
            G   Y +K K ++Q+ +P +RI +  EKR SD  R S+RNLEL +C AN+L+T+ DKGWR
Sbjct: 1142 GAFDYSSKSKGHSQKGIPHKRIRTANEKRSSDVSRGSERNLELLSCEANVLITLGDKGWR 1201

Query: 168  ESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            E GAQ+VLEL+D NEW+LAVKLSG T+Y YK    LQPGSTNR+THAMMWKGGKDW
Sbjct: 1202 EYGAQVVLELSDHNEWKLAVKLSGTTKYSYKAHQFLQPGSTNRYTHAMMWKGGKDW 1257


>ref|XP_012078606.1| PREDICTED: uncharacterized protein LOC105639237 [Jatropha curcas]
            gi|643722525|gb|KDP32275.1| hypothetical protein
            JCGZ_13200 [Jatropha curcas]
          Length = 1714

 Score =  266 bits (681), Expect = 2e-68
 Identities = 172/416 (41%), Positives = 232/416 (55%), Gaps = 13/416 (3%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSR+S  +    S        G   PFALSF AAPTFFL LHLK+L+E +   I+ +D+ 
Sbjct: 862  VSRDSTYVNANSSSAYFDKSDGWFPPFALSFSAAPTFFLGLHLKLLMEHSVTHISFQDH- 920

Query: 1029 TLCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSN 850
               S+E+P D +  ++++   VE  S     I +  NF+    +A   E LS        
Sbjct: 921  --VSIEHP-DNSDSLLDECSSVEDYSNKDSEITSCNNFKVSSRDANCDECLSCGKAEPQA 977

Query: 849  V---AYKLQDLQNDKPTSSE------ATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFV 697
            +   A  + D     P +        A   SKD  K  +    +     S+    EQ  +
Sbjct: 978  IGISANSVGDWMTSSPNNFNNVANVGAAASSKDPGKFASDAIDVPQKQSSHHSGSEQQGL 1037

Query: 696  LPLPCMSNSITCTSSNLSCDSSFGSVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFV 517
               P      T + S L+  +    VEIP   Q D   +   H +++++DL W+M+ G +
Sbjct: 1038 SVKPAADKCSTGSHSLLNGIT----VEIPPVNQFDKHVDKELHGAQQSTDLSWNMNGGII 1093

Query: 516  HSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPF 337
             SPNPT  RS WHR  + SSS+ FG L    SDG+   +HN+F  G KKPR QV YA PF
Sbjct: 1094 PSPNPTARRSTWHR--SRSSSTSFGYLAHGWSDGRGDFVHNNFGNGPKKPRTQVSYALPF 1151

Query: 336  GG--YGAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWR 169
            GG  Y  K+K+++Q+ +P +RI   SEKR  D  R S+RNLEL +C AN+L+T  D+GWR
Sbjct: 1152 GGFDYCPKNKSHSQKAVPHKRIRTASEKRSLDVSRGSERNLEL-SCEANVLITHGDRGWR 1210

Query: 168  ESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            E GAQ+V+EL D NEW+LAVK+SG T+Y YK    LQPGSTNR+THAMMWKGGKDW
Sbjct: 1211 EGGAQVVVELFDHNEWKLAVKISGTTKYSYKAHQFLQPGSTNRYTHAMMWKGGKDW 1266


>ref|XP_008378284.1| PREDICTED: uncharacterized protein LOC103441387 [Malus domestica]
          Length = 1666

 Score =  263 bits (671), Expect = 3e-67
 Identities = 170/417 (40%), Positives = 224/417 (53%), Gaps = 16/417 (3%)
 Frame = -3

Query: 1203 RESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYDTL 1024
            RES ++    S  R      +L P ALSF AAPTFF+SLHLK+L+E   A I  RD D  
Sbjct: 815  RESISVNSSDSTSRDDELCRKLPPLALSFAAAPTFFISLHLKLLMENCVANICFRDRD-- 872

Query: 1023 CSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSNVA 844
             S+E+ E+    +  D   VE        I  E+N ++  + A S     S    D++ A
Sbjct: 873  -SVEHVENCDNMLAVDWSVVEDFINGGSKITPEKNLKAXPSNATSD---GSCAKXDADNA 928

Query: 843  YKL---------QDLQN---DKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCF 700
              L         Q  QN   D   SS+ T   +    +K V        +S+    +QC 
Sbjct: 929  ISLCHGARTKSSQHFQNGSLDVSVSSDGTGVLEKTGTDKVVQLKA---LQSHHPESDQCS 985

Query: 699  VLPLPCMSNSITCTSSNLSCDSSFGSVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGF 520
            + P P +    + T S    +    +VEIPSF++ + P +      ++ ++  W+MS   
Sbjct: 986  LSPRPLVGRDKSDTDSQSFPNGL--TVEIPSFDRYEKPVDREVQSXQQPTEFSWNMSGSI 1043

Query: 519  VHSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARP 340
            + SPNPT PRS  HR  NSSS    G+L    +DGK    HN F  G KKPR QV Y  P
Sbjct: 1044 IPSPNPTAPRSTGHRNRNSSS---LGHLSNSWTDGKADLFHNGFGSGPKKPRTQVSYTLP 1100

Query: 339  FGGYGAKHKTNN-QRTLPSRRI---DSEKRVSDRPRRSQRNLELRACGANLLVTVEDKGW 172
            +GG+    K  N Q+ L  +RI   ++EKR SD  R SQRNLEL +C  N+LV   D+GW
Sbjct: 1101 YGGFDFSSKQRNLQKGLSHKRIRRANNEKRSSDASRGSQRNLELLSCETNVLVNGSDRGW 1160

Query: 171  RESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            RE GA +VLEL D NEW+LAVK+SG T+Y YK    LQPG+TNR+THAMMWKGGKDW
Sbjct: 1161 RECGAHVVLELFDHNEWKLAVKISGTTKYSYKAHQFLQPGTTNRYTHAMMWKGGKDW 1217


>gb|KHG16466.1| DNA mismatch repair Msh6-1 -like protein [Gossypium arboreum]
          Length = 1632

 Score =  262 bits (669), Expect = 4e-67
 Identities = 170/424 (40%), Positives = 235/424 (55%), Gaps = 21/424 (4%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSR    L +GQ       ++  L  F LSFGAAPTFF SLHLK+L++   A I+ +D+D
Sbjct: 794  VSRGYSCLEVGQLSSSSEKQNKNLPLFTLSFGAAPTFFFSLHLKLLMDYCVARISFQDHD 853

Query: 1029 TLCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSN 850
               S+ENPE +   ++++    E           +++FES          L ++  + S 
Sbjct: 854  ---SIENPESSGNLLLDENSNREDC--------VKKSFESC---------LGNFLKASSK 893

Query: 849  VAYKLQDLQNDKPTSSEATVCSKDLVKNKT---VVNSLSPNFESNEQV----LEQCFVLP 691
            VA   + +  D   SS+     K L K+     +VN     +   E+V    ++Q     
Sbjct: 894  VASVTELMTLDLSVSSDGR-WRKSLQKHANSDQIVNGSPAIYHKPEEVGASAIDQLEKQK 952

Query: 690  LPCMSNSITCTSSNL--SCDSSFGS--------VEIPSFEQVDMPCNGRGHISRETSDLG 541
                 +     SS +   C    GS        VE+P F+Q  +  + +   ++ ++DL 
Sbjct: 953  CDYSESRQPFLSSKVVDGCKKGSGSSSVLNGIRVELPPFDQYKVHVDSKLPSTQRSTDLT 1012

Query: 540  WSMSAGFVHSPNPTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRN 361
            W+M+ G + +PNPT PRS+WHR   + SSS  G      SDGK    HN+F  G KKPR 
Sbjct: 1013 WNMNGGVIPTPNPTAPRSYWHR---NRSSSSIGYHAHRWSDGKADFFHNNFGNGPKKPRT 1069

Query: 360  QVQYARPFGG--YGAKHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLV 193
            QV Y+ PFGG  Y +K+  ++QR LP +RI   +EKR SD  R SQ+N+EL +C ANLL+
Sbjct: 1070 QVSYSMPFGGLDYSSKNIGDHQRGLPHKRIRRANEKRSSDVSRGSQKNMELVSCHANLLL 1129

Query: 192  TVEDKGWRESGAQIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKG 13
            T+ D+GWRE GAQ+ LE  DRNEW+LAVK+SG TR  YK    LQPGSTNR+THAMMWKG
Sbjct: 1130 TLGDRGWRECGAQVALERIDRNEWKLAVKMSGSTRCSYKAHQFLQPGSTNRYTHAMMWKG 1189

Query: 12   GKDW 1
            GKDW
Sbjct: 1190 GKDW 1193


>ref|XP_010325156.1| PREDICTED: uncharacterized protein LOC101258290 [Solanum
            lycopersicum]
          Length = 1719

 Score =  262 bits (669), Expect = 4e-67
 Identities = 178/447 (39%), Positives = 238/447 (53%), Gaps = 45/447 (10%)
 Frame = -3

Query: 1206 SRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIER-NFACINLRDYD 1030
            S E C+ R   S    A K GR+ PFALSF AAPTFF+ LHL++L+E+ NFAC++L+   
Sbjct: 839  STECCSARFTSSTLSSATKLGRVPPFALSFAAAPTFFICLHLRLLMEQHNFACVSLQ--- 895

Query: 1029 TLCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSD-- 856
                 E+  +A Q +  D  +V+   +    I    +         S    SS+      
Sbjct: 896  -----ESSINACQPVKSDGSRVKCSEIAGSEIAGSEDISETSFTGASSAGGSSFAERQLG 950

Query: 855  --------SNVAYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCF 700
                     ++   L+  QN +   S ++  +K    + + V  +S N ES++QVL+Q  
Sbjct: 951  SLACKQQLGSMRVPLKSSQNCQLDVSGSSFTAKLSELDTSDVTVVSNNLESDDQVLDQFV 1010

Query: 699  VLPLPCMSNSITCTSSNLSCDSSFG--SVEIPSFEQVDMPCNGRGHISRETSDLGWSMSA 526
              P    S +++   SN    S     SV IPS +QV+   +G+  I  E S L  S++ 
Sbjct: 1011 GSPGRRHSKNLSHRLSNARRHSGLVGMSVVIPSSDQVEGLSDGKEIIVGEESHL--SLNT 1068

Query: 525  G---------------------------FVHSPNPTGPRSFWHRGINSSSSSPFGNLLPV 427
            G                            V SPNP+GP    HR  N+SSSSPFG + PV
Sbjct: 1069 GNDLISSPNHTVTSDVVRSSNITGTGDRMVQSPNPSGPGGLPHRNRNNSSSSPFGKISPV 1128

Query: 426  CSDGKTKSMHNDFSYGLKKPRNQVQYARPFGGY--GAKHKTNNQRTLPSRRID--SEKRV 259
              DGK       F  G K+PR QVQY   +GGY   + HK ++ RTLP +RI   SEK+ 
Sbjct: 1129 WVDGKANFTGGGFGNGPKRPRTQVQYTLSYGGYDFSSMHKNHSPRTLPYKRIRRASEKKN 1188

Query: 258  SDRPRRSQRNLELRACGANLLVTVED-KGWRESGAQIVLELADRNEWRLAVKLSGITRYL 82
            +D    SQRN+EL AC AN+LVT+   KGWRE GA+IVLE+A  NEW++AVK SG T+Y 
Sbjct: 1189 ADSCGGSQRNIELLACNANVLVTLGGVKGWREFGARIVLEIAGHNEWKIAVKFSGATKYS 1248

Query: 81   YKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            YKV + LQPGSTNRFTHAMMWKGGKDW
Sbjct: 1249 YKVHNVLQPGSTNRFTHAMMWKGGKDW 1275


>ref|XP_012855912.1| PREDICTED: uncharacterized protein LOC105975278 [Erythranthe
            guttatus]
          Length = 1660

 Score =  260 bits (664), Expect = 2e-66
 Identities = 164/409 (40%), Positives = 224/409 (54%), Gaps = 6/409 (1%)
 Frame = -3

Query: 1209 VSRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYD 1030
            VSRE     M QS +  A K G++  FALSF AAP+FFL+LHL++ ++ + A +NL+  +
Sbjct: 850  VSREPSKTAMNQSAYSVALKPGKVPQFALSFSAAPSFFLTLHLQLFMDHSLALVNLQHQN 909

Query: 1029 TLCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSYELLSSYTVSDSN 850
            +LCS ++ E+  + + E   + E  S+ ++++  E             ++L      ++ 
Sbjct: 910  SLCSAKSSENRGEPVAESS-EYELNSIAVQDVTVEHALGVA-------DVLVGNAAENTE 961

Query: 849  VAYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFVLPLPCMSNS 670
                 Q LQ   P       C  +     T +++     +S+++V EQ  V     +  S
Sbjct: 962  ---STQKLQKGNPGDDGTAGCFTEF----TEISAPEVIAQSHQEVQEQIVVSASTSLPPS 1014

Query: 669  ITCTSSNLSCDSSFG--SVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFVHSPNPTG 496
             T        +S+ G  SV+IPS EQVD P  G G ISR TS +GW++  GFV SP+PTG
Sbjct: 1015 TTSRPPYPKSNSASGALSVDIPSSEQVDTPFAGNGCISRHTSVVGWNVHDGFVPSPSPTG 1074

Query: 495  PRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPFGGY--GA 322
                                      GK   M N FS G KKPR QVQY  PF  Y   A
Sbjct: 1075 --------------------------GKPNFMPNGFSNGPKKPRTQVQYTLPFVDYDSSA 1108

Query: 321  KHKTNNQRTLPSRRID--SEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWRESGAQIV 148
            K K  + R+LP +RI   S K+ SD    +Q+NLE     AN+LVT  DKGWRE GA IV
Sbjct: 1109 KRKMPSSRSLPCKRIRRASLKKTSDGSENNQKNLESVTSIANVLVTYGDKGWRECGAHIV 1168

Query: 147  LELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
            LE+AD+NEWRLAVKLSG+ +Y  KV+H LQPGSTNR++HAMMW+GGKDW
Sbjct: 1169 LEVADQNEWRLAVKLSGVIKYSCKVKHILQPGSTNRYSHAMMWRGGKDW 1217


>ref|XP_008394009.1| PREDICTED: uncharacterized protein LOC103456143 [Malus domestica]
          Length = 1662

 Score =  259 bits (662), Expect = 3e-66
 Identities = 167/412 (40%), Positives = 226/412 (54%), Gaps = 10/412 (2%)
 Frame = -3

Query: 1206 SRESCNLRMGQSIFRQAAKHGRLHPFALSFGAAPTFFLSLHLKMLIERNFACINLRDYDT 1027
            SRES ++ +     R  A   +L P ALSF AAPTFF+SLHLK+L+E   A I   D D 
Sbjct: 816  SRESTSVNISHPTSRNDALCRKLPPLALSFAAAPTFFISLHLKLLMENCVANICFGDRD- 874

Query: 1026 LCSLENPEDATQQIVEDRMQVESPSVNIENIHAERNFESVVTEAPSY------ELLSSYT 865
              S+E+ E++   +  D   VE        I  ++N ++  ++A S       +  +  +
Sbjct: 875  --SVEHVENSGSMLAVDWSIVEDFISEGSKITPQKNLKAPPSDATSDGSCAKPDAENXIS 932

Query: 864  VSDSNVAYKLQDLQNDKPTSSEATVCSKDLVKNKTVVNSLSPNFESNEQVLEQCFVLPLP 685
            V         Q  QN     S ++  +  L K  T     S   +S+    +QC + P P
Sbjct: 933  VCHGARTNSSQHFQNGGLYVSVSSGGTGVLEKTGTDEVVQSKVLQSHXPESDQCSLSPRP 992

Query: 684  CMSNSITCTSSNLSCDSSFGSVEIPSFEQVDMPCNGRGHISRETSDLGWSMSAGFVHSPN 505
             +    + T S    +    +VEIPSF+  + P +     +++ +D  W+M+   + SPN
Sbjct: 993  LVGRDKSDTDSQSFPNGL--TVEIPSFDXFEKPVDKEVQSAQQPTDFXWNMNGSIIPSPN 1050

Query: 504  PTGPRSFWHRGINSSSSSPFGNLLPVCSDGKTKSMHNDFSYGLKKPRNQVQYARPFGGYG 325
            PT PRS  HR  N+SS    G+L    SDG T   HN F  G KKPR QV Y  P+GG+ 
Sbjct: 1051 PTAPRSTGHRNRNNSS---LGHLSHNWSDG-TDLFHNGFGSGPKKPRTQVSYTLPYGGFD 1106

Query: 324  AKHKTNN-QRTLPSRRI---DSEKRVSDRPRRSQRNLELRACGANLLVTVEDKGWRESGA 157
               K  N Q+ LP +RI   ++EKR SD  R SQRNLEL +C AN+LV   D+GWRE GA
Sbjct: 1107 FSSKQRNLQKGLPHKRIRRANNEKRSSDASRGSQRNLELLSCEANVLVNGSDRGWRECGA 1166

Query: 156  QIVLELADRNEWRLAVKLSGITRYLYKVQHNLQPGSTNRFTHAMMWKGGKDW 1
             +VLEL D NEW+LAVK+SG T+Y YK    LQPG+TNR+THAMMWKGGKDW
Sbjct: 1167 HVVLELFDHNEWKLAVKISGTTKYSYKAHQFLQPGTTNRYTHAMMWKGGKDW 1218


Top