BLASTX nr result

ID: Mentha25_contig00009132 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00009132
         (2650 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU19798.1| hypothetical protein MIMGU_mgv1a000238mg [Mimulus...   788   0.0  
ref|XP_004250000.1| PREDICTED: transcriptional activator DEMETER...   564   e-158
ref|XP_006360485.1| PREDICTED: transcriptional activator DEMETER...   556   e-155
ref|XP_002267310.1| PREDICTED: transcriptional activator DEMETER...   535   e-149
ref|XP_002530889.1| conserved hypothetical protein [Ricinus comm...   533   e-148
ref|XP_007010232.1| DNA N-glycosylase/DNA-(Apurinic or apyrimidi...   532   e-148
ref|XP_007010230.1| DNA N-glycosylase/DNA-(Apurinic or apyrimidi...   532   e-148
ref|XP_007010229.1| DNA N-glycosylase/DNA-(Apurinic or apyrimidi...   532   e-148
ref|XP_007010228.1| DNA N-glycosylase/DNA-(Apurinic or apyrimidi...   532   e-148
ref|XP_002316518.2| hypothetical protein POPTR_0010s24060g [Popu...   518   e-144
ref|XP_002277401.1| PREDICTED: transcriptional activator DEMETER...   501   e-139
ref|XP_004164145.1| PREDICTED: transcriptional activator DEMETER...   498   e-138
ref|XP_004150492.1| PREDICTED: transcriptional activator DEMETER...   498   e-138
ref|XP_002881449.1| hypothetical protein ARALYDRAFT_902767 [Arab...   463   e-127
ref|XP_004497617.1| PREDICTED: protein ROS1-like [Cicer arietinum]    404   e-109
gb|AEC12445.1| DNA N-glycosylase/DNA-(apurinic or apyrimidinic s...   338   6e-90
gb|EPS65696.1| hypothetical protein M569_09081, partial [Genlise...   333   2e-88
gb|AGU16984.1| DEMETER [Citrus sinensis]                              333   3e-88
ref|XP_006492175.1| PREDICTED: transcriptional activator DEMETER...   332   4e-88
ref|XP_006492173.1| PREDICTED: transcriptional activator DEMETER...   332   4e-88

>gb|EYU19798.1| hypothetical protein MIMGU_mgv1a000238mg [Mimulus guttatus]
          Length = 1381

 Score =  788 bits (2034), Expect = 0.0
 Identities = 457/858 (53%), Positives = 554/858 (64%), Gaps = 60/858 (6%)
 Frame = +2

Query: 251  QQQHTLSQGHLCSESMLPVTPQKFADTKITKSTTAVVN-RKESSSTRDPQPIPINGNMNS 427
            QQQH LSQ  L SE + P T    A+ +IT S TA+ N   +S+S RDP+PIP+NG + +
Sbjct: 221  QQQHALSQERLRSEQIAPQTSHYSANKQITNSVTAMTNWNPKSTSERDPKPIPMNGTITT 280

Query: 428  PLHLVVKKRTPYKKEPVXXXXXXXEPENKKTYGKSSKKFAGN---------SVDDIINGM 580
            P  +  K+  P   +         E +  ++ G SSKKFAG          S++DI + M
Sbjct: 281  PDRVAAKR--PPAGQTSSKKILQQESKKSRSKGYSSKKFAGPVQEKERRVFSINDITDLM 338

Query: 581  KHLRITSSGK------ESALVPYKGDGAVVPY---NLVKKRKPRPRVDLDPETNRLWNLL 733
            + L I ++GK      ++ALVPY+G G VVPY   ++VK+RKPRPRVDLDPETNRLWNLL
Sbjct: 339  QDLSINNNGKKIVRKEQNALVPYRGSGTVVPYVEFDVVKRRKPRPRVDLDPETNRLWNLL 398

Query: 734  MG---GQSAETMDTNKEKWWEEERKVFRGRVDSFIAKMHLVQGDRRFSKWKGSVVDSVIG 904
            MG    ++AET+D NKEKWWEEERK+FRGRVDSFIA+MHLVQGDRRFSKWKGSVVDSVIG
Sbjct: 399  MGKEGDETAETVDNNKEKWWEEERKMFRGRVDSFIARMHLVQGDRRFSKWKGSVVDSVIG 458

Query: 905  VFLTQNVSDHLSSSAFMSLAARFPSKSATAE-----NGASPTKVGNHEVRITYPDGTTFH 1069
            VFLTQNVSDHLSSSAFMSLAA+FP KS +       NG  P K  +HEVR+T+PD TT  
Sbjct: 459  VFLTQNVSDHLSSSAFMSLAAKFPLKSTSTGQTFCGNGERPVK--HHEVRVTHPDETTCD 516

Query: 1070 QKMAMEPVTGQSQVIATETSTDRLDNVMPEKKTFLVNDPFTRRTEEDIIXXXXXXXXFVL 1249
              +  EPV   S V + E+S  R +N M  K  F +ND  TRRTEEDII        FV 
Sbjct: 517  NNIVREPVCNSS-VTSIESSEYRAENDMKGKGAFSMNDQ-TRRTEEDIISSQSSSESFVF 574

Query: 1250 QASEDVRSSSGSNSDAECGWNVSKNLGHQSVSQQAERIAXXXXXXXXXXXXXCMNKMPSI 1429
            QA ED RSSSGSNS+AE G N +KNL H SV++QAERI+               NK P I
Sbjct: 575  QACEDFRSSSGSNSEAEEGLNFNKNLSHVSVTEQAERISALQQDQFQIMGSLFPNKRPFI 634

Query: 1430 KHQQFEKPAYRHIP-ECAGISKVQHHQNSDLPFPSSWTNMLMGKGDWEAEDLSCLGRGSI 1606
             ++  E   Y   P    G +   +   S +P  +S  N  MG   WEA+ L   G+ ++
Sbjct: 635  GNRPLENTTYSQNPGPVRGKNAYYNPLTSTVPSNNSGPNRSMGLEKWEADVLGLSGKETM 694

Query: 1607 STLTSKGTDAPHVD------DYRGQSAESAFMVSKDGISKFQTPSTEHAVLNKGLELRND 1768
            S+L S   + P+        +Y GQSA ++    ++G  +FQ P+  H++ NK  E R D
Sbjct: 695  SSLASTDFEIPNRTGVECGHNYIGQSATNSLTSIQNGRPEFQ-PAVNHSIPNKHFEFRTD 753

Query: 1769 SVDESVNRNCQHSIKHM----------------------SEKPTDNSKCTEVQT----EM 1870
              + S N   Q  IK+M                      S++P DN K  +  T    E+
Sbjct: 754  FSNGSQNGYGQQPIKNMRGKQDSFQQESTSQTNPTRPAESKQPNDNWKHGDHTTLEPNEI 813

Query: 1871 GHGQSPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSKAGTTERSREAMDSLD 2050
               +S D+ SSKI  TT  A+KRK+EKE  EPFNWD+LRK V  K GT E+SR+AMDSLD
Sbjct: 814  RQVRSSDEPSSKISTTTPNAKKRKSEKEKPEPFNWDSLRKGVLLKNGTREKSRDAMDSLD 873

Query: 2051 YEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEPDRAKDYL 2230
            YEA+RTADV +ISDAIKERGMN++LAERMK FLNRLVEDHER+DLEWLRDV+PD+AKDYL
Sbjct: 874  YEALRTADVKQISDAIKERGMNNMLAERMKAFLNRLVEDHERVDLEWLRDVQPDKAKDYL 933

Query: 2231 LSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXXXXXXXIL 2410
            LS+RGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV                  +L
Sbjct: 934  LSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVL 993

Query: 2411 ESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXXXXXXXXX 2590
            ESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKR+PNCNACP+RAEC          
Sbjct: 994  ESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKREPNCNACPMRAECRHFASAFASA 1053

Query: 2591 XXXXPGPQERHIVSSAAP 2644
                PG +E+ IVSSA P
Sbjct: 1054 RLALPGLEEKQIVSSATP 1071


>ref|XP_004250000.1| PREDICTED: transcriptional activator DEMETER-like [Solanum
            lycopersicum]
          Length = 1596

 Score =  564 bits (1453), Expect = e-158
 Identities = 347/739 (46%), Positives = 443/739 (59%), Gaps = 42/739 (5%)
 Frame = +2

Query: 554  SVDDIINGMKHLRITSSGK------ESALVPYKGDGAVVPY---NLVKKRKPRPRVDLDP 706
            SVD I   ++ L I++S K      + ALVPYKG G ++PY   + +K+RK RPRVDLDP
Sbjct: 532  SVDVITQQLERLFISNSKKNAAQVEQKALVPYKGSGTIIPYEGFDPIKRRKARPRVDLDP 591

Query: 707  ETNRLWNLLMGGQ-SAETMDTNKEKWWEEERKVFRGRVDSFIAKMHLVQGDRRFSKWKGS 883
            ETNRLWN+LMG + SAETMD + EKWWE+ERKV RGRVDSF+A+M LVQGDRRFS WKGS
Sbjct: 592  ETNRLWNVLMGKEESAETMDKDNEKWWEDERKVVRGRVDSFVARMRLVQGDRRFSPWKGS 651

Query: 884  VVDSVIGVFLTQNVSDHLSSSAFMSLAARFP----SKSATAENGASPTKVGNHEVRITYP 1051
            VVDSVIGVFLTQNVSDHLSSSAFM LAA+FP    +K+  +++G +   V   EV I  P
Sbjct: 652  VVDSVIGVFLTQNVSDHLSSSAFMCLAAKFPLPTSTKNTLSQDGCNIV-VEEPEVEIIDP 710

Query: 1052 DGTTFHQKMAMEPVTGQSQVIATETSTDRLDNVMPEKKTFLVNDPFTRRTEEDIIXXXXX 1231
            DGTT + K  ++                R++N     + +LV++   +R +E++I     
Sbjct: 711  DGTTIYHKARLQR---------------RMENHTHTSRAYLVSE-HDKRVDEEVISLQNS 754

Query: 1232 XXXFVLQASEDVRSSSGSNSDAE---CGWNVSKNLGHQSVSQQAERIAXXXXXXXXXXXX 1402
                +LQA+E++RSSSGS+ ++E      N++K+    S S   +  A            
Sbjct: 755  PDSLILQANEELRSSSGSDLESEDRPSSPNLNKDRTQASHSPPTKWTAAFQEYQSHFMRN 814

Query: 1403 XCMNKMPSIKHQQFEKPAY--RHIPECAGISKVQ------HHQNSDLPFPS---SWTNML 1549
                K+P   +Q+ E  A   RH       + +       H Q  ++P  S   SW NM 
Sbjct: 815  GISEKLPVFGNQKIETVADMGRHNENLDAETYLHGYPINPHIQVQEIPIRSASNSWLNMT 874

Query: 1550 MGKGDWEAE------DLSCLGR---GSISTLTSKGTD-----APHVDDYRGQSAESAFMV 1687
               G  E        D+S   +   GS S L ++ T      AP + +  G   +   + 
Sbjct: 875  PEFGKHETACHEKEIDMSKSMKQIAGSSSPLIAQRTTHPFIHAPRMGEIGGVEMQPGKVD 934

Query: 1688 SKDGISKFQTPSTEHAVLNKGLELRNDSVDESVNRNCQHSIKHMSEKPTDNSKCTEVQTE 1867
            ++  +S  Q    E A+ +   +L +  + +SVN                +S+      E
Sbjct: 935  NQHSVSSHQN---EMAMAS---QLESSCIRQSVN----------------HSEAVAKGQE 972

Query: 1868 MGHGQSPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSKAGTTERSREAMDSL 2047
             G      K  S  G + S  RKRK E+   + F+WD+LRK+VQSK+G  ERS++AMDSL
Sbjct: 973  EGQAYPSSKQPSITGTSISKTRKRKVEEGDKKAFDWDSLRKEVQSKSGKKERSKDAMDSL 1032

Query: 2048 DYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEPDRAKDY 2227
            +YEA+R+A V EISDAIKERGMN++LAER+K+FL+RLV DH  IDLEWLRDV PD+AK+Y
Sbjct: 1033 NYEAVRSAAVKEISDAIKERGMNNMLAERIKDFLDRLVRDHGSIDLEWLRDVAPDKAKEY 1092

Query: 2228 LLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXXXXXXXI 2407
            LLSIRGLGLKSVEC+RLLTLH+LAFPVDTNVGRIAVRLGWV                  I
Sbjct: 1093 LLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPI 1152

Query: 2408 LESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXXXXXXXX 2587
            LESIQKYLWPRLCKLDQ TLYELHY MITFGKVFCTK  PNCNACPLRAEC         
Sbjct: 1153 LESIQKYLWPRLCKLDQRTLYELHYHMITFGKVFCTKSKPNCNACPLRAECRHFASAYAS 1212

Query: 2588 XXXXXPGPQERHIVSSAAP 2644
                 PGP+E+ IVSSA P
Sbjct: 1213 ARLALPGPEEKSIVSSAVP 1231


>ref|XP_006360485.1| PREDICTED: transcriptional activator DEMETER-like [Solanum tuberosum]
          Length = 1851

 Score =  556 bits (1433), Expect = e-155
 Identities = 345/740 (46%), Positives = 438/740 (59%), Gaps = 43/740 (5%)
 Frame = +2

Query: 554  SVDDIINGMKHLRITSSGK------ESALVPYKGDGAVVP---YNLVKKRKPRPRVDLDP 706
            SVD I   ++ L I++S K      + ALVPYKG G ++P   ++ +K+RK RPRVDLDP
Sbjct: 788  SVDVITQQLERLVISNSKKNAAQVEQKALVPYKGSGTIIPCEGFDPIKRRKARPRVDLDP 847

Query: 707  ETNRLWNLLMGGQ-SAETMDTNKEKWWEEERKVFRGRVDSFIAKMHLVQGDRRFSKWKGS 883
            ETNRLWN+LMG + SAETMD + EKWWE+ERKV RGRVDSF+A+M LVQGDRRFS WKGS
Sbjct: 848  ETNRLWNVLMGKEESAETMDKDNEKWWEDERKVVRGRVDSFVARMRLVQGDRRFSPWKGS 907

Query: 884  VVDSVIGVFLTQNVSDHLSSSAFMSLAARFP----SKSATAENGASPTKVGNHEVRITYP 1051
            VVDSVIGVFLTQNVSDHLSSSAFM LAA+FP    +K+  +++G +   V   EV I  P
Sbjct: 908  VVDSVIGVFLTQNVSDHLSSSAFMCLAAKFPLPTRTKNTLSQDGCNIV-VEEPEVEIIDP 966

Query: 1052 DGTTFHQKMAMEPVTGQSQVIATETSTDRLDNVMPEKKTFLVNDPFTRRTEEDIIXXXXX 1231
            DGTT + K  ++                R++N     + +LV++   +R +E++I     
Sbjct: 967  DGTTIYHKARLQ---------------HRMENHTHTSRAYLVSE-HDKRVDEEVISLQNS 1010

Query: 1232 XXXFVLQASEDVRSSSGSNSDAE---CGWNVSKNLGHQSVSQQAERIAXXXXXXXXXXXX 1402
                +LQA+E++RSSSGS+ ++E      N++K+    S S   +  A            
Sbjct: 1011 PDSLILQANEELRSSSGSDLESEDRPSSPNLNKDRTQASHSPPTKWAAAFQEYQSHFMRN 1070

Query: 1403 XCMNKMPSIKHQQFEKPA------------YRHIPECAGISKVQHHQNSDLPFPS---SW 1537
                K+P   +Q+ E  A            Y H     G     H Q  ++P  S   SW
Sbjct: 1071 RLSEKLPVCGNQKIETVADIGHNENLDAETYLH-----GYPINPHVQVQEIPIRSASNSW 1125

Query: 1538 TNMLMGKGDWEAE------DLSCLGR---GSISTLTSKGTDAPHVDDYRGQSAESAFMVS 1690
             NM    G  E+       D+S   +   GS   L ++ T  P +   R         + 
Sbjct: 1126 LNMTPEFGKHESACHEKEIDMSKSMKQIAGSSGPLIAQQTTLPFIHAPR---------MG 1176

Query: 1691 KDGISKFQTPSTE--HAVLNKGLELRNDSVDESVNRNCQHSIKHMSEKPTDNSKCTEVQT 1864
            + G  K Q    +  H+V +   E+   S  ES     + S+ H        S+      
Sbjct: 1177 EMGGVKMQPHKVDNQHSVRSHQNEMAMASQLESAC--IRQSVNH--------SEAVAKGQ 1226

Query: 1865 EMGHGQSPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSKAGTTERSREAMDS 2044
            E G      K  S  G + S  RKR+ E+   + F+WD+LRK+VQSK+G  ERS++AMDS
Sbjct: 1227 EEGQAYPSSKQPSITGTSISKTRKRRVEEGDKKAFDWDSLRKEVQSKSGKKERSKDAMDS 1286

Query: 2045 LDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEPDRAKD 2224
            L+YEA+R+A V EISDAIKERGMN++LAER+K+FL+RLV DH  +DLEWLRDV PD+ K+
Sbjct: 1287 LNYEAVRSAPVKEISDAIKERGMNNMLAERIKDFLDRLVRDHGSMDLEWLRDVAPDKVKE 1346

Query: 2225 YLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXXXXXXX 2404
            YLLSIRGLGLKSVEC+RLLTLH+LAFPVDTNVGRIAVRLGWV                  
Sbjct: 1347 YLLSIRGLGLKSVECVRLLTLHNLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYP 1406

Query: 2405 ILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXXXXXXX 2584
            ILESIQKYLWPRLCKLDQ TLYELHY MITFGKVFCTK  PNCNACPLRAEC        
Sbjct: 1407 ILESIQKYLWPRLCKLDQRTLYELHYHMITFGKVFCTKSKPNCNACPLRAECRHFASAYA 1466

Query: 2585 XXXXXXPGPQERHIVSSAAP 2644
                  PGP+E+ IVSSA P
Sbjct: 1467 SARLALPGPEEKSIVSSAVP 1486


>ref|XP_002267310.1| PREDICTED: transcriptional activator DEMETER-like [Vitis vinifera]
          Length = 2198

 Score =  535 bits (1378), Expect = e-149
 Identities = 378/993 (38%), Positives = 513/993 (51%), Gaps = 121/993 (12%)
 Frame = +2

Query: 32   SMCSGSTTAEEILQQFESRRNKSLLAQISTGTPNTELRNSDYGRKVMN-VNPNDNSTNFI 208
            +M S +T  E+ L Q E++    L +QI+ G  N     ++  + + N VN     ++  
Sbjct: 918  TMASYTTAGEDELHQAEAKSVNQLTSQINHGILNICFEGNNDSQNLANGVNKTTRDSSMH 977

Query: 209  RDRYMN----------YP----DMRQK----FQQQHTLSQGHLCSESML----PVTPQKF 322
            +    N          +P    DMR+K      Q H L+     ++  L    P+  + +
Sbjct: 978  QTTAGNSMWKHHISNEWPSQTEDMREKQVNGCTQLHRLTVLTAAAKDKLQPPAPIKARSY 1037

Query: 323  ADTKITKSTTAVVNRKESSSTRDPQPIPINGNMNSPLHLVVKKRTPYKKEPVXXXXXXXE 502
            +  + +  +  V+   E    +  +P+  N + +S          P+ +EP        +
Sbjct: 1038 SSGQHSIESCRVITLAE----KQKEPLFSNSHSSSTYK-------PFLQEPKDKLYDYHQ 1086

Query: 503  PENKKTYGKSSKKFAGNSVDDIINGMKHLRI------TSSGKESALVPYKGDGAVVPYNL 664
            P  KK  G+ +KK   + +D II  +K L +      T S +E+A++ YKGDGA++PY  
Sbjct: 1087 PSIKKR-GRPAKKKQPDPIDAIIERLKSLELNDTSNETVSQEENAIILYKGDGAIIPYE- 1144

Query: 665  VKKRKPRPRVDLDPETNRLWNLLMGG-QSAETMDTNKEKWWEEERKVFRGRVDSFIAKMH 841
            +KKRKPRP+VDLD ET R+W LLMG  Q     D  K KWWEEER+VFRGR DSFIA+MH
Sbjct: 1145 IKKRKPRPKVDLDLETERVWKLLMGAEQDVGDSDERKAKWWEEEREVFRGRADSFIARMH 1204

Query: 842  LVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF---PSKSATAENGASP 1012
            LVQGDRRFS WKGSVVDSVIGVFLTQNVSDHLSSSAFMSL +RF   P  + T+ +  + 
Sbjct: 1205 LVQGDRRFSPWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLVSRFPLHPESNKTSYSNEAS 1264

Query: 1013 TKVGNHEVRITYPDGT-TFHQKMAMEPVTGQSQVIATETSTDRLDNV-MPEKKTFLVNDP 1186
              V   EV I  PD T  +H+K++ + V  Q+ V  +E+S  R D+      +T LV  P
Sbjct: 1265 ILVEEPEVCIMNPDDTIKWHEKVSHQQVYNQAFVAYSESSEHRRDSPDSGTSETSLVGAP 1324

Query: 1187 FTRRTEEDIIXXXXXXXXFVLQASEDVRSSSGSNSDAECGWNVSKNLGHQSVSQQAERIA 1366
              +R EE+++         V+Q +  +RS SGSNS+AE         GH++   QA    
Sbjct: 1325 -NQRAEEEVMSSQDSVNSSVVQTTV-LRSCSGSNSEAE-----DPTTGHKTNKVQASAST 1377

Query: 1367 XXXXXXXXXXXXXCMNKMPSIKHQQFEKPAYRHIPECAGISKVQHHQNSD---------- 1516
                         C  +  + K   F++   R+  +   + +V++H  S           
Sbjct: 1378 NILYMEKTFMSQEC--QYHANKSSNFDENTMRYRKQNPRLDRVENHTESSSLTYLINSGN 1435

Query: 1517 -------LPFPSSWTNMLMGKGDWEAEDLSCLGRGSISTLTSKGTDAPHVDDYRGQSAES 1675
                   +P  +   +M    G  E E L  LG  SIS+  S  +   +  D    S  +
Sbjct: 1436 SNKQAPAVPSSNYRLHMTPDSGILEVECLQVLGEESISSWPSAASGIANPKDVNWTSKGT 1495

Query: 1676 AFM--------VSKDGISKFQTPSTEHAVLNKGLELRNDSVDESV----------NRNCQ 1801
              M          ++G+   Q    E  V N    LRN  + +S            ++C+
Sbjct: 1496 QQMTESIRKTTAQQNGLMNLQ----EATVGNPNALLRNYPMQQSSMQPGCTTENDKQSCK 1551

Query: 1802 -----------------------------------HSIKHMSEKPTDNSKCTE------- 1855
                                               H I ++ E   + S   E       
Sbjct: 1552 NHDLERTKTFQMQSMPSREPLKPAEALDTRRDTTMHQIPNVPELTEEASNVRERDSAVDK 1611

Query: 1856 ---VQTEMGHGQSPDKLSS---KIGVTTSPARKRKAEK---ETAEPFNWDTLRKQVQSKA 2008
               ++ E+    S +++ S   + G TT+   K K EK      + F+WD+LRKQVQ+  
Sbjct: 1612 QICLENEVLEPLSREQVHSSNKESGGTTTNILKPKKEKVEGTKKKAFDWDSLRKQVQANG 1671

Query: 2009 GTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLE 2188
               ERS++ MDSLDYEA+R A V+ IS+AIKERGMN++LAER+K+FLNRLV +H  IDLE
Sbjct: 1672 RKRERSKDTMDSLDYEAIRCAHVNVISEAIKERGMNNMLAERIKDFLNRLVREHGSIDLE 1731

Query: 2189 WLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXX 2368
            WLRD  PD+AKDYLLSIRGLGLKSVEC+RLLTLH LAFPVDTNVGRIAVRLGWV      
Sbjct: 1732 WLRDSPPDKAKDYLLSIRGLGLKSVECVRLLTLHQLAFPVDTNVGRIAVRLGWVPLQPLP 1791

Query: 2369 XXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPL 2548
                        +LESIQKYLWPRLCKLDQ TLYELHYQ+ITFGKVFCTK  PNCNACP+
Sbjct: 1792 ESLQLHLLELYPMLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKHKPNCNACPM 1851

Query: 2549 RAECXXXXXXXXXXXXXXPGPQERHIVSSAAPT 2647
            R EC              P P+E+ IVSS AP+
Sbjct: 1852 RGECRHFASAFASARLALPAPEEKSIVSSTAPS 1884


>ref|XP_002530889.1| conserved hypothetical protein [Ricinus communis]
            gi|223529542|gb|EEF31495.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1876

 Score =  533 bits (1374), Expect = e-148
 Identities = 348/876 (39%), Positives = 464/876 (52%), Gaps = 75/876 (8%)
 Frame = +2

Query: 242  QKFQQQHTLSQGHLCSE------SMLPVTPQKFADTKITKSTTAVVNRKESSSTRDPQPI 403
            Q    QH  +  + C E      +++P TP K A               +S     PQ  
Sbjct: 710  QDLSLQHKWAGQNSCIERTGENCNIVPPTPPKMAP--------------QSRDQLQPQIC 755

Query: 404  PINGNMNSPLHLVVKKRTPYKKEPVXXXXXXXEPENKKTYGKSSKKFAGN---SVDDIIN 574
             I+ +    +        P +K  +         + K T  + + + A     ++++II 
Sbjct: 756  HIDASTKQTMASTQSLSVPSRKGNMLQTQKNILKDQKSTAKRKAGQPAKQKPITIEEIIY 815

Query: 575  GMKHLRITS-SGKESALVPYKGDGAVVPYN---LVKKRKPRPRVDLDPETNRLWNLLM-- 736
             M+HL +    G+++A+VPYKGDGA++PY+   ++KKRKPRP+VDLDPET R+W LLM  
Sbjct: 816  RMEHLNLNEVKGEQTAIVPYKGDGALIPYDGFEIIKKRKPRPKVDLDPETERVWKLLMWK 875

Query: 737  -GGQSAETMDTNKEKWWEEERKVFRGRVDSFIAKMHLVQGDRRFSKWKGSVVDSVIGVFL 913
             GG+  E  D  K++WWEEER+VF GR DSFIA+MHLVQGDRRFSKWKGSVVDSVIGVFL
Sbjct: 876  EGGEGLEGTDQEKKQWWEEERRVFGGRADSFIARMHLVQGDRRFSKWKGSVVDSVIGVFL 935

Query: 914  TQNVSDHLSSSAFMSLAARFPSKSA---TAENGASPTKVGNHEVRITYPDGTTFHQKMAM 1084
            TQNVSDHLSSSAFM+LAA+FP KS    T E       +   ++ +  P+ T    +  +
Sbjct: 936  TQNVSDHLSSSAFMNLAAKFPLKSMRNRTCERDEPRRLIQEPDIYMLNPNPTIKWHEKLL 995

Query: 1085 EPVTGQSQVIATETSTDRLDNVMPEKKTFLVNDPFTRRTEEDIIXXXXXXXXFVLQASED 1264
             P   QS +   E+   R D      +   + +  +   EE+++         ++Q++  
Sbjct: 996  TPFYNQSSMTPHESIEHRRDQETSCTERTSIVEAHSYSPEEEVLSSQDSFDSSIVQSNGV 1055

Query: 1265 VRSSSGSNSDAE-----CGWNVSKNLGHQSVSQQAERIAXXXXXXXXXXXXXCMNK-MPS 1426
            +RS SGSN +AE     C  N + N  +    +  E  +               ++ +  
Sbjct: 1056 IRSYSGSNLEAEDPAKGCKHNENHNTSNAQKLEFEEFFSHVSGRSLFHEGSRHRHRELED 1115

Query: 1427 IKHQQFEKPAYRHIPECAGISKVQHHQNSDLP---------------FPSSWTNMLMGKG 1561
            ++  Q      R      G S    H NS+                   SSW +     G
Sbjct: 1116 LEDGQQWTRLDRLDNSLKGSSTFNQHDNSNNSQLQTRVESSQLYREDSISSWPSSTSKVG 1175

Query: 1562 DWEAEDLSCLGRGSISTLT-SKGTDAPHVDDYRGQ------SAESAFMVSKDGISKFQTP 1720
              + +D SC    SI  L  ++    P    Y  +      +AES   + K  + +   P
Sbjct: 1176 --KEKDASCT---SIRVLQGAENVAKPTTQQYGSEKYPETSTAESHAFLCKQLMHEQSNP 1230

Query: 1721 STEHAV----LNKGLELRNDSVDESVNRNCQHSIK------HMSEKPTDNSKCTEVQ--- 1861
               H      +NK  +L + S+ E VN +     +      H+S  P   +K  +V+   
Sbjct: 1231 QLYHGSQSHEMNKTFQLGSKSIAEPVNLSDAQDYRQSSYGQHVSNIPQLAAKVFDVEERI 1290

Query: 1862 TEMGHGQSPDKLS---------------SKIGVTTSPARKRKAEKETAEPFNWDTLRKQV 1996
            T M + Q+  + +               + +    S ARK KAE    +  +WD+LRKQV
Sbjct: 1291 TLMDNKQTDSENNFIGSNSKENTHFTNKANLNRNASKARKAKAESGQKDAVDWDSLRKQV 1350

Query: 1997 QSKAGTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHER 2176
                   ERS  AMDSLDYEAMR+A V+EISD IKERGMN++LAER+K+FLNRLV +H  
Sbjct: 1351 LVNGRKKERSESAMDSLDYEAMRSAHVNEISDTIKERGMNNMLAERIKDFLNRLVREHGS 1410

Query: 2177 IDLEWLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXX 2356
            IDLEWLRDV PD+AK+YLLSIRGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV  
Sbjct: 1411 IDLEWLRDVPPDKAKEYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPL 1470

Query: 2357 XXXXXXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCN 2536
                            ILESIQKYLWPRLCKLDQ TLYELHYQMITFGKVFCTK  PNCN
Sbjct: 1471 QPLPESLQLHLLELYPILESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSRPNCN 1530

Query: 2537 ACPLRAECXXXXXXXXXXXXXXPGPQERHIVSSAAP 2644
            ACP+RAEC              PGP+++ IV++  P
Sbjct: 1531 ACPMRAECRHFASAFASARLALPGPEDKSIVTATVP 1566


>ref|XP_007010232.1| DNA N-glycosylase/DNA-(Apurinic or apyrimidinic site) lyase, putative
            isoform 5 [Theobroma cacao] gi|508727145|gb|EOY19042.1|
            DNA N-glycosylase/DNA-(Apurinic or apyrimidinic site)
            lyase, putative isoform 5 [Theobroma cacao]
          Length = 1978

 Score =  532 bits (1371), Expect = e-148
 Identities = 348/875 (39%), Positives = 466/875 (53%), Gaps = 90/875 (10%)
 Frame = +2

Query: 290  ESMLPVTPQKFADTKITKSTTAVVNRKESSSTRDPQPIPINGNMNSPLHLVVKKRTPYKK 469
            +++LP TP+     ++   T A          R+P                + +R P  +
Sbjct: 814  DNLLPTTPKNAPTLQLGSVTKASHTNVSEKKKREPD---------------LSRRAPSGR 858

Query: 470  -EPVXXXXXXXEPENKKTYGKSSKKFAGNSVDDIINGMKHLRITSSGKES------ALVP 628
             + +       E +     G S+K+     +++IIN    L +     E+      ALV 
Sbjct: 859  GKKLQEQKELYEYQQSSKAGPSAKQIYPIPIEEIINKFMGLTLDERNNEAKSEVQNALVI 918

Query: 629  YKGDGAVVPYN---LVKKRKPRPRVDLDPETNRLWNLLMG--GQSAETMDTNKEKWWEEE 793
            YKG G VVPY     +KKRKPRP+VDLDPETNR+WNLLMG  G+  E  D  KEKWWEEE
Sbjct: 919  YKGAGTVVPYEGFEFIKKRKPRPKVDLDPETNRVWNLLMGKEGEDIEGTDKEKEKWWEEE 978

Query: 794  RKVFRGRVDSFIAKMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 973
            R+VF GRVDSFIA+MHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF
Sbjct: 979  RRVFHGRVDSFIARMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 1038

Query: 974  PSKSATA-ENGASPTKVGNHEVRITYPDGTT---FHQKMAMEPVTGQSQVIATETSTDRL 1141
            P KS+   E      K+   E     P+      +H+K+   P+  QS + +  ++  R 
Sbjct: 1039 PFKSSCKRECDGDGVKILIEEPEFCEPNPNETIKWHEKLFSHPLDRQSPMTSIMSTDYRR 1098

Query: 1142 DNVMPEKKTFLVNDPFTRRTEEDIIXXXXXXXXFVLQASEDVRSSSGSNSDAECGWNVSK 1321
            +   P  +     +  ++  EE+++         V+QA+  +RS SGSNS+ E      K
Sbjct: 1099 NGENPGIERTSFTETHSQSLEEEVLSSQGSFDSSVIQANGVIRSYSGSNSETEDPTTCCK 1158

Query: 1322 -NLGHQSVSQQAERIAXXXXXXXXXXXXXCMNKMPSIKHQQFEKPAYRHIPECAGISKVQ 1498
             N  H S   Q E  A               ++   +K++Q E      + E A  S+++
Sbjct: 1159 FNNFHGSSVDQMENSASFEEFCNSVNGSSPFHE--GLKYKQSE------VTENAQKSRLE 1210

Query: 1499 HHQNSDLPFPSSWT----------------------NMLMGKGDWEAEDLSCLGRGSIST 1612
              +N  L  PSS+                       +M +     E E L   G   +S+
Sbjct: 1211 RKEN--LRGPSSFIQASHFRNQQVQVQAVGVSNHPLHMTLEFEAREREGLEPCGEECMSS 1268

Query: 1613 LTSK----------GTDAPHVDDYRGQSAESAFMVS-------------KDGISKFQTPS 1723
              S           G     +  ++ + A S  M +             +D +S+    +
Sbjct: 1269 WASTASGLNKLKQLGQSEDKITVHQNEQAISQDMATTTLNTLSRKHITHQDTVSQPGAHT 1328

Query: 1724 TEHAVLNKGLELRNDSVD------------ESVNRNCQHSIKH------MSEKPTDNSKC 1849
              + + N   E+RN +              ++VN+  + ++ +      ++E+P+D  K 
Sbjct: 1329 KSNQLCNNHQEMRNKAFQSESASVTMPLTTDAVNKMHKSTLLYAANALKLTERPSDVEKM 1388

Query: 1850 TEVQTEMGHGQSPDKLSSKIGVTTSP----------ARKRKAEKETAEPFNWDTLRKQVQ 1999
            + +  +        + ++K  + +S           +++RKAE E     +WD LRK VQ
Sbjct: 1389 SALNRDKDIENREVQSNTKEQIHSSEKENGAYSFLKSKRRKAEGEKNNATDWDALRKLVQ 1448

Query: 2000 SKAGTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERI 2179
            +     ERS++ MDSLDY+AMR A+V+EIS+AIKERGMN++LAER+K FLNRLV +HE I
Sbjct: 1449 ANGWKKERSKDTMDSLDYKAMRHANVNEISNAIKERGMNNMLAERIKEFLNRLVREHESI 1508

Query: 2180 DLEWLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXX 2359
            DLEWLR+V PD+AKDYLLSIRGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV   
Sbjct: 1509 DLEWLREVPPDKAKDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQ 1568

Query: 2360 XXXXXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNA 2539
                           +LESIQKYLWPRLCKLDQ TLYELHYQMITFGKVFCTK  PNCNA
Sbjct: 1569 PLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNA 1628

Query: 2540 CPLRAECXXXXXXXXXXXXXXPGPQERHIVSSAAP 2644
            CP+R EC              PGP+E+ I SS  P
Sbjct: 1629 CPMRGECRHFASAFASARLALPGPEEKSITSSTVP 1663


>ref|XP_007010230.1| DNA N-glycosylase/DNA-(Apurinic or apyrimidinic site) lyase, putative
            isoform 3 [Theobroma cacao]
            gi|590566430|ref|XP_007010231.1| DNA
            N-glycosylase/DNA-(Apurinic or apyrimidinic site) lyase,
            putative isoform 3 [Theobroma cacao]
            gi|508727143|gb|EOY19040.1| DNA
            N-glycosylase/DNA-(Apurinic or apyrimidinic site) lyase,
            putative isoform 3 [Theobroma cacao]
            gi|508727144|gb|EOY19041.1| DNA
            N-glycosylase/DNA-(Apurinic or apyrimidinic site) lyase,
            putative isoform 3 [Theobroma cacao]
          Length = 1979

 Score =  532 bits (1371), Expect = e-148
 Identities = 348/875 (39%), Positives = 466/875 (53%), Gaps = 90/875 (10%)
 Frame = +2

Query: 290  ESMLPVTPQKFADTKITKSTTAVVNRKESSSTRDPQPIPINGNMNSPLHLVVKKRTPYKK 469
            +++LP TP+     ++   T A          R+P                + +R P  +
Sbjct: 815  DNLLPTTPKNAPTLQLGSVTKASHTNVSEKKKREPD---------------LSRRAPSGR 859

Query: 470  -EPVXXXXXXXEPENKKTYGKSSKKFAGNSVDDIINGMKHLRITSSGKES------ALVP 628
             + +       E +     G S+K+     +++IIN    L +     E+      ALV 
Sbjct: 860  GKKLQEQKELYEYQQSSKAGPSAKQIYPIPIEEIINKFMGLTLDERNNEAKSEVQNALVI 919

Query: 629  YKGDGAVVPYN---LVKKRKPRPRVDLDPETNRLWNLLMG--GQSAETMDTNKEKWWEEE 793
            YKG G VVPY     +KKRKPRP+VDLDPETNR+WNLLMG  G+  E  D  KEKWWEEE
Sbjct: 920  YKGAGTVVPYEGFEFIKKRKPRPKVDLDPETNRVWNLLMGKEGEDIEGTDKEKEKWWEEE 979

Query: 794  RKVFRGRVDSFIAKMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 973
            R+VF GRVDSFIA+MHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF
Sbjct: 980  RRVFHGRVDSFIARMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 1039

Query: 974  PSKSATA-ENGASPTKVGNHEVRITYPDGTT---FHQKMAMEPVTGQSQVIATETSTDRL 1141
            P KS+   E      K+   E     P+      +H+K+   P+  QS + +  ++  R 
Sbjct: 1040 PFKSSCKRECDGDGVKILIEEPEFCEPNPNETIKWHEKLFSHPLDRQSPMTSIMSTDYRR 1099

Query: 1142 DNVMPEKKTFLVNDPFTRRTEEDIIXXXXXXXXFVLQASEDVRSSSGSNSDAECGWNVSK 1321
            +   P  +     +  ++  EE+++         V+QA+  +RS SGSNS+ E      K
Sbjct: 1100 NGENPGIERTSFTETHSQSLEEEVLSSQGSFDSSVIQANGVIRSYSGSNSETEDPTTCCK 1159

Query: 1322 -NLGHQSVSQQAERIAXXXXXXXXXXXXXCMNKMPSIKHQQFEKPAYRHIPECAGISKVQ 1498
             N  H S   Q E  A               ++   +K++Q E      + E A  S+++
Sbjct: 1160 FNNFHGSSVDQMENSASFEEFCNSVNGSSPFHE--GLKYKQSE------VTENAQKSRLE 1211

Query: 1499 HHQNSDLPFPSSWT----------------------NMLMGKGDWEAEDLSCLGRGSIST 1612
              +N  L  PSS+                       +M +     E E L   G   +S+
Sbjct: 1212 RKEN--LRGPSSFIQASHFRNQQVQVQAVGVSNHPLHMTLEFEAREREGLEPCGEECMSS 1269

Query: 1613 LTSK----------GTDAPHVDDYRGQSAESAFMVS-------------KDGISKFQTPS 1723
              S           G     +  ++ + A S  M +             +D +S+    +
Sbjct: 1270 WASTASGLNKLKQLGQSEDKITVHQNEQAISQDMATTTLNTLSRKHITHQDTVSQPGAHT 1329

Query: 1724 TEHAVLNKGLELRNDSVD------------ESVNRNCQHSIKH------MSEKPTDNSKC 1849
              + + N   E+RN +              ++VN+  + ++ +      ++E+P+D  K 
Sbjct: 1330 KSNQLCNNHQEMRNKAFQSESASVTMPLTTDAVNKMHKSTLLYAANALKLTERPSDVEKM 1389

Query: 1850 TEVQTEMGHGQSPDKLSSKIGVTTSP----------ARKRKAEKETAEPFNWDTLRKQVQ 1999
            + +  +        + ++K  + +S           +++RKAE E     +WD LRK VQ
Sbjct: 1390 SALNRDKDIENREVQSNTKEQIHSSEKENGAYSFLKSKRRKAEGEKNNATDWDALRKLVQ 1449

Query: 2000 SKAGTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERI 2179
            +     ERS++ MDSLDY+AMR A+V+EIS+AIKERGMN++LAER+K FLNRLV +HE I
Sbjct: 1450 ANGWKKERSKDTMDSLDYKAMRHANVNEISNAIKERGMNNMLAERIKEFLNRLVREHESI 1509

Query: 2180 DLEWLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXX 2359
            DLEWLR+V PD+AKDYLLSIRGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV   
Sbjct: 1510 DLEWLREVPPDKAKDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQ 1569

Query: 2360 XXXXXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNA 2539
                           +LESIQKYLWPRLCKLDQ TLYELHYQMITFGKVFCTK  PNCNA
Sbjct: 1570 PLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNA 1629

Query: 2540 CPLRAECXXXXXXXXXXXXXXPGPQERHIVSSAAP 2644
            CP+R EC              PGP+E+ I SS  P
Sbjct: 1630 CPMRGECRHFASAFASARLALPGPEEKSITSSTVP 1664


>ref|XP_007010229.1| DNA N-glycosylase/DNA-(Apurinic or apyrimidinic site) lyase, putative
            isoform 2 [Theobroma cacao] gi|508727142|gb|EOY19039.1|
            DNA N-glycosylase/DNA-(Apurinic or apyrimidinic site)
            lyase, putative isoform 2 [Theobroma cacao]
          Length = 1999

 Score =  532 bits (1371), Expect = e-148
 Identities = 348/875 (39%), Positives = 466/875 (53%), Gaps = 90/875 (10%)
 Frame = +2

Query: 290  ESMLPVTPQKFADTKITKSTTAVVNRKESSSTRDPQPIPINGNMNSPLHLVVKKRTPYKK 469
            +++LP TP+     ++   T A          R+P                + +R P  +
Sbjct: 834  DNLLPTTPKNAPTLQLGSVTKASHTNVSEKKKREPD---------------LSRRAPSGR 878

Query: 470  -EPVXXXXXXXEPENKKTYGKSSKKFAGNSVDDIINGMKHLRITSSGKES------ALVP 628
             + +       E +     G S+K+     +++IIN    L +     E+      ALV 
Sbjct: 879  GKKLQEQKELYEYQQSSKAGPSAKQIYPIPIEEIINKFMGLTLDERNNEAKSEVQNALVI 938

Query: 629  YKGDGAVVPYN---LVKKRKPRPRVDLDPETNRLWNLLMG--GQSAETMDTNKEKWWEEE 793
            YKG G VVPY     +KKRKPRP+VDLDPETNR+WNLLMG  G+  E  D  KEKWWEEE
Sbjct: 939  YKGAGTVVPYEGFEFIKKRKPRPKVDLDPETNRVWNLLMGKEGEDIEGTDKEKEKWWEEE 998

Query: 794  RKVFRGRVDSFIAKMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 973
            R+VF GRVDSFIA+MHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF
Sbjct: 999  RRVFHGRVDSFIARMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 1058

Query: 974  PSKSATA-ENGASPTKVGNHEVRITYPDGTT---FHQKMAMEPVTGQSQVIATETSTDRL 1141
            P KS+   E      K+   E     P+      +H+K+   P+  QS + +  ++  R 
Sbjct: 1059 PFKSSCKRECDGDGVKILIEEPEFCEPNPNETIKWHEKLFSHPLDRQSPMTSIMSTDYRR 1118

Query: 1142 DNVMPEKKTFLVNDPFTRRTEEDIIXXXXXXXXFVLQASEDVRSSSGSNSDAECGWNVSK 1321
            +   P  +     +  ++  EE+++         V+QA+  +RS SGSNS+ E      K
Sbjct: 1119 NGENPGIERTSFTETHSQSLEEEVLSSQGSFDSSVIQANGVIRSYSGSNSETEDPTTCCK 1178

Query: 1322 -NLGHQSVSQQAERIAXXXXXXXXXXXXXCMNKMPSIKHQQFEKPAYRHIPECAGISKVQ 1498
             N  H S   Q E  A               ++   +K++Q E      + E A  S+++
Sbjct: 1179 FNNFHGSSVDQMENSASFEEFCNSVNGSSPFHE--GLKYKQSE------VTENAQKSRLE 1230

Query: 1499 HHQNSDLPFPSSWT----------------------NMLMGKGDWEAEDLSCLGRGSIST 1612
              +N  L  PSS+                       +M +     E E L   G   +S+
Sbjct: 1231 RKEN--LRGPSSFIQASHFRNQQVQVQAVGVSNHPLHMTLEFEAREREGLEPCGEECMSS 1288

Query: 1613 LTSK----------GTDAPHVDDYRGQSAESAFMVS-------------KDGISKFQTPS 1723
              S           G     +  ++ + A S  M +             +D +S+    +
Sbjct: 1289 WASTASGLNKLKQLGQSEDKITVHQNEQAISQDMATTTLNTLSRKHITHQDTVSQPGAHT 1348

Query: 1724 TEHAVLNKGLELRNDSVD------------ESVNRNCQHSIKH------MSEKPTDNSKC 1849
              + + N   E+RN +              ++VN+  + ++ +      ++E+P+D  K 
Sbjct: 1349 KSNQLCNNHQEMRNKAFQSESASVTMPLTTDAVNKMHKSTLLYAANALKLTERPSDVEKM 1408

Query: 1850 TEVQTEMGHGQSPDKLSSKIGVTTSP----------ARKRKAEKETAEPFNWDTLRKQVQ 1999
            + +  +        + ++K  + +S           +++RKAE E     +WD LRK VQ
Sbjct: 1409 SALNRDKDIENREVQSNTKEQIHSSEKENGAYSFLKSKRRKAEGEKNNATDWDALRKLVQ 1468

Query: 2000 SKAGTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERI 2179
            +     ERS++ MDSLDY+AMR A+V+EIS+AIKERGMN++LAER+K FLNRLV +HE I
Sbjct: 1469 ANGWKKERSKDTMDSLDYKAMRHANVNEISNAIKERGMNNMLAERIKEFLNRLVREHESI 1528

Query: 2180 DLEWLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXX 2359
            DLEWLR+V PD+AKDYLLSIRGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV   
Sbjct: 1529 DLEWLREVPPDKAKDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQ 1588

Query: 2360 XXXXXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNA 2539
                           +LESIQKYLWPRLCKLDQ TLYELHYQMITFGKVFCTK  PNCNA
Sbjct: 1589 PLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNA 1648

Query: 2540 CPLRAECXXXXXXXXXXXXXXPGPQERHIVSSAAP 2644
            CP+R EC              PGP+E+ I SS  P
Sbjct: 1649 CPMRGECRHFASAFASARLALPGPEEKSITSSTVP 1683


>ref|XP_007010228.1| DNA N-glycosylase/DNA-(Apurinic or apyrimidinic site) lyase, putative
            isoform 1 [Theobroma cacao] gi|508727141|gb|EOY19038.1|
            DNA N-glycosylase/DNA-(Apurinic or apyrimidinic site)
            lyase, putative isoform 1 [Theobroma cacao]
          Length = 1966

 Score =  532 bits (1371), Expect = e-148
 Identities = 348/875 (39%), Positives = 466/875 (53%), Gaps = 90/875 (10%)
 Frame = +2

Query: 290  ESMLPVTPQKFADTKITKSTTAVVNRKESSSTRDPQPIPINGNMNSPLHLVVKKRTPYKK 469
            +++LP TP+     ++   T A          R+P                + +R P  +
Sbjct: 834  DNLLPTTPKNAPTLQLGSVTKASHTNVSEKKKREPD---------------LSRRAPSGR 878

Query: 470  -EPVXXXXXXXEPENKKTYGKSSKKFAGNSVDDIINGMKHLRITSSGKES------ALVP 628
             + +       E +     G S+K+     +++IIN    L +     E+      ALV 
Sbjct: 879  GKKLQEQKELYEYQQSSKAGPSAKQIYPIPIEEIINKFMGLTLDERNNEAKSEVQNALVI 938

Query: 629  YKGDGAVVPYN---LVKKRKPRPRVDLDPETNRLWNLLMG--GQSAETMDTNKEKWWEEE 793
            YKG G VVPY     +KKRKPRP+VDLDPETNR+WNLLMG  G+  E  D  KEKWWEEE
Sbjct: 939  YKGAGTVVPYEGFEFIKKRKPRPKVDLDPETNRVWNLLMGKEGEDIEGTDKEKEKWWEEE 998

Query: 794  RKVFRGRVDSFIAKMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 973
            R+VF GRVDSFIA+MHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF
Sbjct: 999  RRVFHGRVDSFIARMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARF 1058

Query: 974  PSKSATA-ENGASPTKVGNHEVRITYPDGTT---FHQKMAMEPVTGQSQVIATETSTDRL 1141
            P KS+   E      K+   E     P+      +H+K+   P+  QS + +  ++  R 
Sbjct: 1059 PFKSSCKRECDGDGVKILIEEPEFCEPNPNETIKWHEKLFSHPLDRQSPMTSIMSTDYRR 1118

Query: 1142 DNVMPEKKTFLVNDPFTRRTEEDIIXXXXXXXXFVLQASEDVRSSSGSNSDAECGWNVSK 1321
            +   P  +     +  ++  EE+++         V+QA+  +RS SGSNS+ E      K
Sbjct: 1119 NGENPGIERTSFTETHSQSLEEEVLSSQGSFDSSVIQANGVIRSYSGSNSETEDPTTCCK 1178

Query: 1322 -NLGHQSVSQQAERIAXXXXXXXXXXXXXCMNKMPSIKHQQFEKPAYRHIPECAGISKVQ 1498
             N  H S   Q E  A               ++   +K++Q E      + E A  S+++
Sbjct: 1179 FNNFHGSSVDQMENSASFEEFCNSVNGSSPFHE--GLKYKQSE------VTENAQKSRLE 1230

Query: 1499 HHQNSDLPFPSSWT----------------------NMLMGKGDWEAEDLSCLGRGSIST 1612
              +N  L  PSS+                       +M +     E E L   G   +S+
Sbjct: 1231 RKEN--LRGPSSFIQASHFRNQQVQVQAVGVSNHPLHMTLEFEAREREGLEPCGEECMSS 1288

Query: 1613 LTSK----------GTDAPHVDDYRGQSAESAFMVS-------------KDGISKFQTPS 1723
              S           G     +  ++ + A S  M +             +D +S+    +
Sbjct: 1289 WASTASGLNKLKQLGQSEDKITVHQNEQAISQDMATTTLNTLSRKHITHQDTVSQPGAHT 1348

Query: 1724 TEHAVLNKGLELRNDSVD------------ESVNRNCQHSIKH------MSEKPTDNSKC 1849
              + + N   E+RN +              ++VN+  + ++ +      ++E+P+D  K 
Sbjct: 1349 KSNQLCNNHQEMRNKAFQSESASVTMPLTTDAVNKMHKSTLLYAANALKLTERPSDVEKM 1408

Query: 1850 TEVQTEMGHGQSPDKLSSKIGVTTSP----------ARKRKAEKETAEPFNWDTLRKQVQ 1999
            + +  +        + ++K  + +S           +++RKAE E     +WD LRK VQ
Sbjct: 1409 SALNRDKDIENREVQSNTKEQIHSSEKENGAYSFLKSKRRKAEGEKNNATDWDALRKLVQ 1468

Query: 2000 SKAGTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERI 2179
            +     ERS++ MDSLDY+AMR A+V+EIS+AIKERGMN++LAER+K FLNRLV +HE I
Sbjct: 1469 ANGWKKERSKDTMDSLDYKAMRHANVNEISNAIKERGMNNMLAERIKEFLNRLVREHESI 1528

Query: 2180 DLEWLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXX 2359
            DLEWLR+V PD+AKDYLLSIRGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV   
Sbjct: 1529 DLEWLREVPPDKAKDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQ 1588

Query: 2360 XXXXXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNA 2539
                           +LESIQKYLWPRLCKLDQ TLYELHYQMITFGKVFCTK  PNCNA
Sbjct: 1589 PLPESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNA 1648

Query: 2540 CPLRAECXXXXXXXXXXXXXXPGPQERHIVSSAAP 2644
            CP+R EC              PGP+E+ I SS  P
Sbjct: 1649 CPMRGECRHFASAFASARLALPGPEEKSITSSTVP 1683


>ref|XP_002316518.2| hypothetical protein POPTR_0010s24060g [Populus trichocarpa]
            gi|550330487|gb|EEF02689.2| hypothetical protein
            POPTR_0010s24060g [Populus trichocarpa]
          Length = 1867

 Score =  518 bits (1334), Expect = e-144
 Identities = 356/891 (39%), Positives = 471/891 (52%), Gaps = 90/891 (10%)
 Frame = +2

Query: 242  QKFQQQHTLSQGHLCSESMLPVTPQKFADTKITKSTTAVVNRKESSSTRDP-------QP 400
            Q   +Q    Q HLC E +            +  +T    +R   +S +         QP
Sbjct: 676  QNLPKQCISPQPHLCLEMLGETNGSTQVQNSLCPTTIETSHRLSQTSLKTSRASDNQLQP 735

Query: 401  IPINGNMNSPLHL--------VVKKRTPYKKEPVXXXXXXXEPENKKTYGKSSKKFAGNS 556
               N  M+    +        +  ++    +EP        +P  K+  G+ +K+   ++
Sbjct: 736  KTCNAEMSRIQQMSEATVPISIPSEKGKIPQEPKDDLKVHQQPYAKRR-GRPAKQTFSST 794

Query: 557  VDDIINGMKHLRITSSGK------ESALVPYKGDGAVVPYN---LVKKRKPRPRVDLDPE 709
            ++ II  M+ LR+ +  K      ++ALVPYKGDG +VPY+   +VKK KPRP+VDLDPE
Sbjct: 795  IEQIIYQMEGLRLNAGSKKIENKEQNALVPYKGDGKLVPYDGFEVVKKHKPRPKVDLDPE 854

Query: 710  TNRLWNLLMG---GQSAETMDTNKEKWWEEERKVFRGRVDSFIAKMHLVQGDRRFSKWKG 880
            ++R+W LLMG    Q  E  D  KE+WW EERKVF GRVDSFIA+MHLVQGDRRFSKWKG
Sbjct: 855  SDRVWKLLMGKEGSQGLEGTDKGKEQWWGEERKVFHGRVDSFIARMHLVQGDRRFSKWKG 914

Query: 881  SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPSK---SATAENGASPTKVGNHEVRITYP 1051
            SVVDSVIGVFLTQNVSDHLSSSAFMSLA+ FP K   S   +   +   +   +  I  P
Sbjct: 915  SVVDSVIGVFLTQNVSDHLSSSAFMSLASLFPLKLRSSGACDRERTSIVIEEPDTCILNP 974

Query: 1052 DGTTFHQKMAMEPVTGQSQVIATETSTDRLDNVMPEKKTFLVNDPFTRRTEEDIIXXXXX 1231
            +      K    P+  QS V    ++    D+     +   + +  +   EE+ +     
Sbjct: 975  NDI----KWNSNPLYNQSSVTHHGSAEPHKDSETLFIERASMVETQSHSLEEEFVLSQDS 1030

Query: 1232 XXXFVLQASEDVRSSSGSNSDAE-----CGWNVSKNLGHQSVSQQAERIAXXXXXXXXXX 1396
                 +QA+  VRS SGSNS+AE     C  +++ +L    + Q  E             
Sbjct: 1031 FDSSTVQAN-GVRSYSGSNSEAEDPATGCKPSMNDDLSFMDLLQM-ESPTLLGEFYGCEG 1088

Query: 1397 XXXCMNKMPSIKHQQFEKPAYR--------------------HIPEC--AGISKVQHHQN 1510
                 +K    + +Q E    R                    H   C    + KV    +
Sbjct: 1089 GSSLFHKESRHEKEQAEDLQNRQPGPGLERLGNLNCFSTYNQHFDYCNPQMLGKVVPCSD 1148

Query: 1511 SDLPFPSSWTNMLMGKGD--WEAEDLSC-------LGRGSISTLTSK--GTDAPHVDDYR 1657
              L   +S +N+   +G   +  E++S          +   +T TSK  G +A  V    
Sbjct: 1149 YGLLHMTSQSNVQQAEGFELYSEENISSWLSYSSRFDKEKAATCTSKAVGQEAESV---- 1204

Query: 1658 GQSAESAFMVSKDGISKFQTPSTEHA-VLNKGLELRNDSVDESVN-----RNCQHSIKHM 1819
            G++A   + + + G S  Q+         NK L+ ++ SV   VN        Q+S +  
Sbjct: 1205 GKTAAKQYELPRYGQSSSQSCHERQVDERNKTLQWQSMSVGGPVNLAEELPKKQNSYRQQ 1264

Query: 1820 SEKPTDN----------SKCTEVQTEMGHGQSPDKL------SSKIGVTTSPARKRKAEK 1951
                T N          +K T ++  +    + +K+      + K   +TS ARK K E 
Sbjct: 1265 VSSLTGNIFDVERITSVNKQTPLENNVVDPNTKEKVHHNNRENLKENASTSKARKGKVEG 1324

Query: 1952 ETAEPFNWDTLRKQVQSKAGTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAE 2131
            E  + F+WD+LRKQVQ+  G  ER+++ MDSLDYEA+R+A V EISDAIKERGMN++LAE
Sbjct: 1325 EKKDAFDWDSLRKQVQAN-GRKERAKDTMDSLDYEAVRSARVKEISDAIKERGMNNMLAE 1383

Query: 2132 RMKNFLNRLVEDHERIDLEWLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVD 2311
            R++ FLNRLV +H  IDLEWLRDV PD+AKDYLLSIRGLGLKSVEC+RLLTLHHLAFPVD
Sbjct: 1384 RIQEFLNRLVREHGSIDLEWLRDVPPDKAKDYLLSIRGLGLKSVECVRLLTLHHLAFPVD 1443

Query: 2312 TNVGRIAVRLGWVXXXXXXXXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMI 2491
            TNVGRIAVRLGWV                  ILESIQKYLWPRLCKLDQ TLYELHYQMI
Sbjct: 1444 TNVGRIAVRLGWVPLQPLPESLQLHLLELYPILESIQKYLWPRLCKLDQRTLYELHYQMI 1503

Query: 2492 TFGKVFCTKRDPNCNACPLRAECXXXXXXXXXXXXXXPGPQERHIVSSAAP 2644
            TFGKVFCTK  PNCNACP+RAEC              PGP+E+ I +S  P
Sbjct: 1504 TFGKVFCTKSRPNCNACPMRAECRHFASAFASARLALPGPEEKGITTSTVP 1554


>ref|XP_002277401.1| PREDICTED: transcriptional activator DEMETER-like [Vitis vinifera]
          Length = 1942

 Score =  501 bits (1290), Expect = e-139
 Identities = 334/814 (41%), Positives = 422/814 (51%), Gaps = 115/814 (14%)
 Frame = +2

Query: 554  SVDDIINGMKHLRITSSGK------------------ESALVPYKGDGAVVPYN----LV 667
            S+D II  +KHL I    K                  ++ALV YK DG +VP+     LV
Sbjct: 818  SIDTIIEQLKHLDINRESKISYQEQNALVPYNMNKEEKNALVLYKRDGTIVPFEDSFGLV 877

Query: 668  KKRKPRPRVDLDPETNRLWNLLMGGQSAETMD---TNKEKWWEEERKVFRGRVDSFIAKM 838
            KKR+PRPRVDLD ET+R+W LLMG  ++E +D     K KWWEEER VFRGR DSFIA+M
Sbjct: 878  KKRRPRPRVDLDEETSRVWKLLMGNINSEGIDGTDEEKAKWWEEERNVFRGRADSFIARM 937

Query: 839  HLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPSKSATAENGASPTK 1018
            HLVQGDRRFSKWKGSVVDSV+GVFLTQNVSDHLSSSAFMSLAA FP K     +    T+
Sbjct: 938  HLVQGDRRFSKWKGSVVDSVVGVFLTQNVSDHLSSSAFMSLAAHFPCKCNHRPSTELETR 997

Query: 1019 --VGNHEVRITYPDGT-TFHQKMAMEPVTGQSQV-------------------------- 1111
              V   EV    P+ T T+++KM+ + V  QS +                          
Sbjct: 998  ILVEEPEVCTLNPEDTVTWNEKMSNQAVCDQSSMTLHHTEEAVNSNGSYGNSRGTVGTVD 1057

Query: 1112 IATETSTDRLDNVMPEKKTFLVNDPFT--------------RRTEEDIIXXXXXXXXFVL 1249
            I+ +   D     M  K +  VN   T              R   +D           + 
Sbjct: 1058 ISKDKMLDSTGKKMSNKSS--VNGTTTQMIGTELACFIGGDRTAADDAASSQNSLDFSIA 1115

Query: 1250 QASEDVRSSSGSNSDAE----CGWNVSKNLGHQS---VSQQAERIAXXXXXXXXXXXXXC 1408
            Q +E + S S SNS+ E     G+ ++   G  S   + Q AE                C
Sbjct: 1116 QTAEKIGSCSESNSEVEDIMPTGYGLNNFDGSTSFVGLLQMAESTRLHEVFCRSNINATC 1175

Query: 1409 MNKMPSIKHQQFEKPAY-RHIPECAGISKVQHHQN-SDLPFPSSWTNMLMGKGDWEAEDL 1582
                  + +       Y +      G++  +     + +P  +   ++    G  E E  
Sbjct: 1176 GANPKDVNYHSESMSGYNKRSQNMDGLADCRSSLGVTIIPSSNYHLHLNPNSGVLEVEGF 1235

Query: 1583 SCLGRGSISTLTSKGTDAPHVDDYRGQSAESAFMVSKD-----GISKFQTPSTEHAVLNK 1747
               G    S ++    D   V +  G +AES      +      I    T S E+   + 
Sbjct: 1236 EMSGETRSSEISK---DQKCVSEQSGLTAESDNQAKDEKKLTESIQAGPTSSCENTFSDN 1292

Query: 1748 GLELRNDSVDES-------------------VNRNCQ-HSIKHMSEKPTDNSKCTEV--- 1858
             L+  N+ + ES                   ++R  Q  ++ ++S K  D   C      
Sbjct: 1293 NLQGENNKIIESQSSPVGDPKNVVESVGQEQISRMQQSQNLMNISGKALDVIDCPSAFSN 1352

Query: 1859 -------QTEMG---HGQSPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSKA 2008
                   ++E G   HG S  K S++IGV TS A+K KA +E     +WD LRK+ Q   
Sbjct: 1353 QTHIEDRKSETGVKEHGLSSSKASNEIGVDTSKAKKGKARREEKNTLHWDNLRKEAQVNG 1412

Query: 2009 GTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLE 2188
               ER+   MDSLD+EA+R +DV+EI++ IKERGMN++LAER+K+FLNRLV DH  IDLE
Sbjct: 1413 RKRERTVNTMDSLDWEAVRCSDVNEIANTIKERGMNNMLAERIKDFLNRLVRDHGSIDLE 1472

Query: 2189 WLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXX 2368
            WLRDV PD+AK+YLLS RGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV      
Sbjct: 1473 WLRDVPPDKAKEYLLSFRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLP 1532

Query: 2369 XXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPL 2548
                        +LESIQKYLWPRLCKLDQ TLYELHYQMITFGKVFCTK  PNCNACP+
Sbjct: 1533 ESLQLHLLELYPVLESIQKYLWPRLCKLDQRTLYELHYQMITFGKVFCTKSKPNCNACPM 1592

Query: 2549 RAECXXXXXXXXXXXXXXPGPQERHIVSSAAPTS 2650
            R EC               GP+ER IVS+ A  S
Sbjct: 1593 RGECRHFASAFASARLALTGPEERSIVSTNANES 1626


>ref|XP_004164145.1| PREDICTED: transcriptional activator DEMETER-like [Cucumis sativus]
          Length = 1736

 Score =  498 bits (1282), Expect = e-138
 Identities = 320/807 (39%), Positives = 432/807 (53%), Gaps = 93/807 (11%)
 Frame = +2

Query: 509  NKKTYGKSSKKFAGNSV----DDIINGMKHL-----RITSSGKESALVPYKGDGAVVPY- 658
            +K+ Y    +KF   +     ++I++ MK L      ++   +++A+VPYKG+GAVVPY 
Sbjct: 632  HKQGYSFGFQKFPAKTTSLLENEILHKMKRLSLNDHEVSIRSEQNAIVPYKGNGAVVPYV 691

Query: 659  --NLVKKRKPRPRVDLDPETNRLWNLLMGGQSAETMDTN---KEKWWEEERKVFRGRVDS 823
                ++KRK RPRVD+DPET R+WNLLMG + +E ++++   KEKWWEEERKVFRGR DS
Sbjct: 692  ESEYLRKRKARPRVDIDPETERIWNLLMGKEGSEGIESHEKDKEKWWEEERKVFRGRADS 751

Query: 824  FIAKMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFP--SKSATAE 997
            FIA+MHLVQGDRRFS+WKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFP  S S    
Sbjct: 752  FIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPVKSASNLRT 811

Query: 998  NGASPTKVGNHEVR--ITYP-DGTTFHQK--------MAMEPVTGQSQVI--ATETSTDR 1138
             G   T +  +E    + YP +   +H +        M    +  Q+Q+    TE     
Sbjct: 812  QGEVETSIVANESAACVLYPAESIRWHVQELSVPRFEMPQTSINHQNQIANSGTEKIFTE 871

Query: 1139 LDNVMPEKKTFLVNDPF-----------------TRRTEEDIIXXXXXXXXFV------- 1246
            L   + E++     D F                     EE I+        +        
Sbjct: 872  LGGQIVEEEVISSQDSFDSTITQGTAGARSCSGSNSEAEEPIVSYNSSSTHYSNFTDIKQ 931

Query: 1247 LQASEDVRSSSGSNSDAECGWNVSKNLGHQSVSQQAERIAXXXXXXXXXXXXXCMNKMPS 1426
            ++ +  ++ S    + +     VS++   Q    +   +               +N + +
Sbjct: 932  METTATIQKSFSDLNRSSVSDEVSEHKHWQLPDGKQGSLTSEWNEIDNLSGHSLINFLVN 991

Query: 1427 IKHQQFEKPA-----YRHIPECAGISKVQHHQNSDLPFPSSWTNMLMGKGDWEAEDLSCL 1591
            I++Q  + P        HI    G+ +V+  +       SS  +++ G      E     
Sbjct: 992  IENQPKQVPDAPSNNQLHITPDCGVLEVEGREAFSEESTSSGPSIVSG---CSTEKNMTF 1048

Query: 1592 GRGSISTLTSKGTDAPHVDDYRGQSAESAFMVSKDGISKFQTPSTEHAV--LNKGLELRN 1765
             R +I  L  +       D+ + +S E+  M   + +S       EH+V     G++ R+
Sbjct: 1049 HRLNIGALEQRLDKTSAEDNVQARSHETTRMEHSESVS-------EHSVHLQGNGIQFRS 1101

Query: 1766 D------SVDESVNRNCQHSIKHMS--------EKPTDNSKCTEVQTEMGHGQ------- 1882
                      E   RN    ++ +S        + P + S  + V     H +       
Sbjct: 1102 HCEYNLHGKYEPCERNNTSPVESVSVTNPPPELDTPAEKSAVSNVVHVHAHTEKLLPGKG 1161

Query: 1883 -----------SPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSKAGTTERSR 2029
                       S  +  ++  ++ S A++RK   E     +WD+LRKQV++     E+ +
Sbjct: 1162 NLINFSNNEAHSLSQAHNEGNISPSKAKRRKVNSEKKGGMDWDSLRKQVEANGQIKEKGK 1221

Query: 2030 EAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEP 2209
            +AMDS+DYEA+R ADV EIS+AIKERGMN++LAER+K FLNRLV DH  IDLEWLRDV P
Sbjct: 1222 DAMDSIDYEAIRLADVREISNAIKERGMNNMLAERIKEFLNRLVTDHGSIDLEWLRDVPP 1281

Query: 2210 DRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXX 2389
            D+AKDYLLS+RGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV             
Sbjct: 1282 DKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHL 1341

Query: 2390 XXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXX 2569
                 +LESIQKYLWPRLCKLDQ TLYELHYQ+ITFGKVFCTK  PNCNACP+R EC   
Sbjct: 1342 LELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACPMRGECKHF 1401

Query: 2570 XXXXXXXXXXXPGPQERHIVSSAAPTS 2650
                       P P E+ IV+S  P S
Sbjct: 1402 ASAFASARLALPAPDEKGIVASTNPMS 1428


>ref|XP_004150492.1| PREDICTED: transcriptional activator DEMETER-like [Cucumis sativus]
          Length = 1679

 Score =  498 bits (1282), Expect = e-138
 Identities = 320/807 (39%), Positives = 432/807 (53%), Gaps = 93/807 (11%)
 Frame = +2

Query: 509  NKKTYGKSSKKFAGNSV----DDIINGMKHL-----RITSSGKESALVPYKGDGAVVPY- 658
            +K+ Y    +KF   +     ++I++ MK L      ++   +++A+VPYKG+GAVVPY 
Sbjct: 575  HKQGYSFGFQKFPAKTTSLLENEILHKMKRLSLNDHEVSIRSEQNAIVPYKGNGAVVPYV 634

Query: 659  --NLVKKRKPRPRVDLDPETNRLWNLLMGGQSAETMDTN---KEKWWEEERKVFRGRVDS 823
                ++KRK RPRVD+DPET R+WNLLMG + +E ++++   KEKWWEEERKVFRGR DS
Sbjct: 635  ESEYLRKRKARPRVDIDPETERIWNLLMGKEGSEGIESHEKDKEKWWEEERKVFRGRADS 694

Query: 824  FIAKMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFP--SKSATAE 997
            FIA+MHLVQGDRRFS+WKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFP  S S    
Sbjct: 695  FIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPVKSASNLRT 754

Query: 998  NGASPTKVGNHEVR--ITYP-DGTTFHQK--------MAMEPVTGQSQVI--ATETSTDR 1138
             G   T +  +E    + YP +   +H +        M    +  Q+Q+    TE     
Sbjct: 755  QGEVETSIVANESAACVLYPAESIRWHVQELSVPRFEMPQTSINHQNQIANSGTEKIFTE 814

Query: 1139 LDNVMPEKKTFLVNDPF-----------------TRRTEEDIIXXXXXXXXFV------- 1246
            L   + E++     D F                     EE I+        +        
Sbjct: 815  LGGQIVEEEVISSQDSFDSTITQGTAGARSCSGSNSEAEEPIVSYNSSSTHYSNFTDIKQ 874

Query: 1247 LQASEDVRSSSGSNSDAECGWNVSKNLGHQSVSQQAERIAXXXXXXXXXXXXXCMNKMPS 1426
            ++ +  ++ S    + +     VS++   Q    +   +               +N + +
Sbjct: 875  METTATIQKSFSDLNRSSVSDEVSEHKHWQLPDGKQGSLTSEWNEIDNLSGHSLINFLVN 934

Query: 1427 IKHQQFEKPA-----YRHIPECAGISKVQHHQNSDLPFPSSWTNMLMGKGDWEAEDLSCL 1591
            I++Q  + P        HI    G+ +V+  +       SS  +++ G      E     
Sbjct: 935  IENQPKQVPDAPSNNQLHITPDCGVLEVEGREAFSEESTSSGPSIVSG---CSTEKNMTF 991

Query: 1592 GRGSISTLTSKGTDAPHVDDYRGQSAESAFMVSKDGISKFQTPSTEHAV--LNKGLELRN 1765
             R +I  L  +       D+ + +S E+  M   + +S       EH+V     G++ R+
Sbjct: 992  HRLNIGALEQRLDKTSAEDNVQARSHETTRMEHSESVS-------EHSVHLQGNGIQFRS 1044

Query: 1766 D------SVDESVNRNCQHSIKHMS--------EKPTDNSKCTEVQTEMGHGQ------- 1882
                      E   RN    ++ +S        + P + S  + V     H +       
Sbjct: 1045 HCEYNLHGKYEPCERNNTSPVESVSVTNPPPELDTPAEKSAVSNVVHVHAHTEKLLPGKG 1104

Query: 1883 -----------SPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSKAGTTERSR 2029
                       S  +  ++  ++ S A++RK   E     +WD+LRKQV++     E+ +
Sbjct: 1105 NLINFSNNEAHSLSQAHNEGNISPSKAKRRKVNSEKKGGMDWDSLRKQVEANGQIKEKGK 1164

Query: 2030 EAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEP 2209
            +AMDS+DYEA+R ADV EIS+AIKERGMN++LAER+K FLNRLV DH  IDLEWLRDV P
Sbjct: 1165 DAMDSIDYEAIRLADVREISNAIKERGMNNMLAERIKEFLNRLVTDHGSIDLEWLRDVPP 1224

Query: 2210 DRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXX 2389
            D+AKDYLLS+RGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV             
Sbjct: 1225 DKAKDYLLSVRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHL 1284

Query: 2390 XXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXX 2569
                 +LESIQKYLWPRLCKLDQ TLYELHYQ+ITFGKVFCTK  PNCNACP+R EC   
Sbjct: 1285 LELYPVLESIQKYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACPMRGECKHF 1344

Query: 2570 XXXXXXXXXXXPGPQERHIVSSAAPTS 2650
                       P P E+ IV+S  P S
Sbjct: 1345 ASAFASARLALPAPDEKGIVASTNPMS 1371


>ref|XP_002881449.1| hypothetical protein ARALYDRAFT_902767 [Arabidopsis lyrata subsp.
            lyrata] gi|297327288|gb|EFH57708.1| hypothetical protein
            ARALYDRAFT_902767 [Arabidopsis lyrata subsp. lyrata]
          Length = 1619

 Score =  463 bits (1191), Expect = e-127
 Identities = 297/731 (40%), Positives = 385/731 (52%), Gaps = 61/731 (8%)
 Frame = +2

Query: 551  NSVDDIINGMKHLRITSSGKESALVPYK-------------GDGAVVPYNLVKKRKPRPR 691
            N V++I   ++ L I     E+ALVPY              G GA+VP   VKKR+PRP+
Sbjct: 582  NLVEEISEQLRLLDINRENSETALVPYSMKTQGNQIVLFGGGAGAIVPVTPVKKRRPRPK 641

Query: 692  VDLDPETNRLWNLLMGGQSAETMDTN---KEKWWEEERKVFRGRVDSFIAKMHLVQGDRR 862
            VDLD ET ++W LL+   ++E +D +   K KWWEEER VFRGR DSFIA+MHLVQGDRR
Sbjct: 642  VDLDDETEKVWKLLLENINSEGIDGSDDQKAKWWEEERNVFRGRADSFIARMHLVQGDRR 701

Query: 863  FSKWKGSVVDSVIGVFLTQNVSDHLS-------------------------------SSA 949
            F+ WKGSVVDSV+GVFLTQNVSDHLS                               SSA
Sbjct: 702  FTPWKGSVVDSVVGVFLTQNVSDHLSRYKHLRNSTSKPNPTQQQVTKLTCFVFFCPCSSA 761

Query: 950  FMSLAARFPSKSATAENGASPTKVGNHEVRITYPDGTTFHQKMAMEPVTGQSQVIATETS 1129
            FMSL + FP  S  + N  + T      ++ITY D     + M+  P   QS VI     
Sbjct: 762  FMSLVSEFPVTSVPSSNFEAGTS-SMPSIQITYLDS---EESMSSPPNHNQSSVI----- 812

Query: 1130 TDRLDNVMPEKKTFLVNDPFTRRTEEDIIXXXXXXXXFVLQASEDVRSSSGSNSDAECGW 1309
               L N  P+++   V+   T R+  +I              S     S    +D++   
Sbjct: 813  ---LKNTQPDEEKDYVHTSETSRSSSEI--------------SSSAHQSVDKTTDSKTFV 855

Query: 1310 NVSKNLGHQSVSQQAERIAXXXXXXXXXXXXXCMNKMPSIKHQQFEKPAYRHIPECAGIS 1489
               +      V +  +                C + M S   Q  E+         AG S
Sbjct: 856  EPDRKGSSVEVDKTGQNCLVLNLFTSEDSALTCQHSMVSDAPQNTER---------AGSS 906

Query: 1490 KVQHHQNSDLPFPSSWTNMLMG-------KGDWEAEDLSCLGRGSISTLTSKGTDAPHVD 1648
                  N +  + +S+  +L G       K  +E   LS  G   +S   S G  +  + 
Sbjct: 907  T---EINLEGEYRTSYMKLLQGVLEESNQKNQYEVGVLSNPGSLQVSPNMSPGDCSSEIT 963

Query: 1649 DYRGQSAESAFMVSKDGISKFQTPSTEHAVLN--KGLELRNDSVDESVNRNCQHSIKHMS 1822
            D+   S +     S D    +     +  VL+  K     + S   S  R     I  ++
Sbjct: 964  DFH--SLKRPTKSSDDSYEPYCCYQQDGDVLSCQKPEMPESSSSFRSTKRKRSFQIPDLN 1021

Query: 1823 EKPT--DNSKCTEVQTEMGHGQSPDKLSSKIGVT---TSPARKRKAEKETAEPFNWDTLR 1987
            E  +  D  + TE   +    Q PD    ++  T   T  A+ +K  KE  E F+WD+LR
Sbjct: 1022 ESTSCLDVIEDTENPPDPYSRQLPDSSCKELNPTDAATLNAKGKKVLKEKKEAFDWDSLR 1081

Query: 1988 KQVQSKAGTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVED 2167
            ++ + + G  E++   MDS+D+EA+RTADV E+++ IK+RGMNH+LAER++ FLNRLV +
Sbjct: 1082 REAEGREGKREKTTRTMDSVDWEAIRTADVSEVAETIKKRGMNHMLAERIQGFLNRLVNE 1141

Query: 2168 HERIDLEWLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGW 2347
            H  IDLEWLRD+ PD+AK+YLLS RGLGLKSVEC+RLLTLHHLAFPVDTNV RIAVRLGW
Sbjct: 1142 HGSIDLEWLRDIPPDKAKEYLLSFRGLGLKSVECVRLLTLHHLAFPVDTNVARIAVRLGW 1201

Query: 2348 VXXXXXXXXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDP 2527
            V                  ILESIQKYLWPRLCKLDQ+TLYELHYQMITFGKVFCTK  P
Sbjct: 1202 VPLQPLPESLQLHLLEMYPILESIQKYLWPRLCKLDQKTLYELHYQMITFGKVFCTKSKP 1261

Query: 2528 NCNACPLRAEC 2560
            NCNACP+R EC
Sbjct: 1262 NCNACPMRGEC 1272


>ref|XP_004497617.1| PREDICTED: protein ROS1-like [Cicer arietinum]
          Length = 2200

 Score =  404 bits (1037), Expect = e-109
 Identities = 271/732 (37%), Positives = 379/732 (51%), Gaps = 56/732 (7%)
 Frame = +2

Query: 608  KESALVPYKGDGAVVPYNLVKKRKPRPRVDLDPETNRLWNLLM---GGQSAETMDTNKEK 778
            +++ LVP++G      ++ +KKR+PRP+VDLD ET+R+W LL+        +  D +K K
Sbjct: 1119 EQNTLVPFQGS-----FDPIKKRRPRPKVDLDEETDRVWKLLLLDINHDGVDGTDEDKAK 1173

Query: 779  WWEEERKVFRGRVDSFIAKMHLVQGDRRFSKWKG----SVVDSVIGVFLTQNVSDHLSSS 946
            WWEEER VF GR DSFIA+MHLVQGDRRFS+WKG    SVV   +   ++ ++S +  S 
Sbjct: 1174 WWEEERNVFHGRADSFIARMHLVQGDRRFSRWKGSVVDSVVGVFLTQNVSDHLSRYRLSF 1233

Query: 947  AFMSLA------------ARFPSKSATA----ENGASPTKVGNHEVRITYP-DGTTFHQK 1075
             F   A            + FP K  +     +   +  +V   EV I  P + T     
Sbjct: 1234 CFCFFANFEFNMPQQQKISLFPKKCGSMYKAYDGEGTSLEVNKQEVNIVEPEENTECGVN 1293

Query: 1076 MAMEPVTGQSQ--VIATETSTDRLDNVMPEKKTF-----LVNDPFTRRTEEDIIXXXXXX 1234
            +  + V  QS   V   E S ++  N     +T      L ++   ++TE          
Sbjct: 1294 LLNQSVCNQSSMTVDIVEHSGEKAVNSNGSCRTASSLIGLTDESNCKQTESPQTNTTECH 1353

Query: 1235 XXFVL-QASEDVRSSSGSNSD------AECGWNVSKNLGHQSVSQQAERIAXXXXXXXXX 1393
               V+ +  E+     G++ +      ++C    S+  G  S  Q  E+I          
Sbjct: 1354 SPMVMIEEGEEKSCYHGASQELNDIVSSQCSVISSQISGDFSNDQNPEKIGSCSDSNSEV 1413

Query: 1394 XXXXCMNKMPSIKHQQFEKPAYRHIPECAGISKVQHHQNS----------DLPFPSSWTN 1543
                   K  S         ++  + E    +K  H  NS          D     SW  
Sbjct: 1414 EDLSSTAKYNSCG-------SFCKLLEMVSSTKF-HEVNSQRSKSIEIMRDDNAKESWKK 1465

Query: 1544 MLMGKGDWEAEDLSCLGRGSISTLTSKGTDAPHVDDYRGQSAESAFMVSKD----GISKF 1711
              + +   E   +         T  S   +    D  + +++ S F+ +KD     +  F
Sbjct: 1466 SNITQNPLEESIIPSHEYNLKLTHNSGALEVNCSDPSKTEASSSLFLKNKDENEMNMPSF 1525

Query: 1712 QTPSTE-HAVLNKGLELRNDSVDESVNRNCQHSIKHMSEKPTDNSKCTEVQTEMGHGQSP 1888
            QT  +E H  +     + +    +  + + Q S  ++S +  D     + + ++  G   
Sbjct: 1526 QTAESEGHVAVTHSQTILSQVHPQEQSSDMQQSFFNISGQTND---LIQKERDLNLGDHK 1582

Query: 1889 DKLSSKIGVTTSPARKRKAE---KETAEPFNWDTLRKQVQSKAGTTERSREAMDSLDYEA 2059
            D + S+    +S   + K++   KE  E F+WD+LR   Q+KAG  E++   MDSLD++A
Sbjct: 1583 DAVRSETNEISSVPIELKSKSQVKEEKEQFDWDSLRINAQAKAGKREKTESTMDSLDWDA 1642

Query: 2060 MRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEPDRAKDYLLSI 2239
            +R ADV EI++ IKERGMN+ LAER++ FLNRLVEDH  IDLEWLRDV PD+AK+YLLS+
Sbjct: 1643 VRCADVGEIANTIKERGMNNRLAERIQKFLNRLVEDHGSIDLEWLRDVPPDQAKEYLLSV 1702

Query: 2240 RGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXXXXXXXILESI 2419
            RGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV                  +LESI
Sbjct: 1703 RGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLEMYPVLESI 1762

Query: 2420 QKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXXXXXXXXXXXX 2599
            QKYLWPRLCKLDQ+TLYELHYQMITFGKVFCTK  PNCNACP+RAEC             
Sbjct: 1763 QKYLWPRLCKLDQKTLYELHYQMITFGKVFCTKSKPNCNACPMRAECRHFASAFASARLA 1822

Query: 2600 XPGPQERHIVSS 2635
             PGP+++ IV++
Sbjct: 1823 LPGPEQKSIVTA 1834


>gb|AEC12445.1| DNA N-glycosylase/DNA-(apurinic or apyrimidinic site) lyase, partial
            [Gossypium hirsutum]
          Length = 2055

 Score =  338 bits (868), Expect = 6e-90
 Identities = 190/384 (49%), Positives = 232/384 (60%), Gaps = 2/384 (0%)
 Frame = +2

Query: 1499 HHQNSDLPFPSSWTNMLMGKGDWEAEDLSCLGRGSISTLTSKGTDAPHVDDYRGQSAESA 1678
            +HQ     F S  T++ M            + +   ST  S  T      D++ +SA   
Sbjct: 1406 NHQEKRKDFQSESTSVTM------PPTTDAVAKMQKSTSLSVTTHQEKRKDFQSESASVT 1459

Query: 1679 FMVSKDGISKFQTPSTEHAVLNKGLELRNDSVDESVNRNCQHSIKHMSEKPTDNSKCTEV 1858
               S D ++K Q  ++  A     L  R   ++                  +D  K TE 
Sbjct: 1460 MPPSTDAVTKMQKSTSLSAANTHKLTERPSDIERMT--------------ASDKDKATEN 1505

Query: 1859 QTEMGHGQSPDKLS-SKIGVTTS-PARKRKAEKETAEPFNWDTLRKQVQSKAGTTERSRE 2032
            +    + + P   S +++G ++S   ++RKA++      +WD LRKQVQ+     ERS++
Sbjct: 1506 REVQSNAKEPMHSSENQLGESSSLKPKRRKAQEGKNNATDWDQLRKQVQANGLKKERSKD 1565

Query: 2033 AMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEPD 2212
             MDSLDYEAMR A+V+EIS+ IKERGMN++LAER+K+FLNRLV DHE IDLEWLRDV PD
Sbjct: 1566 TMDSLDYEAMRNANVNEISNTIKERGMNNMLAERIKDFLNRLVRDHESIDLEWLRDVPPD 1625

Query: 2213 RAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXXX 2392
            +AKDYLLSIRGLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV              
Sbjct: 1626 KAKDYLLSIRGLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPPPESLQLHLL 1685

Query: 2393 XXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXXX 2572
                ILESIQKYLWPRLCKLDQ TLYELHYQMITFGKVFCTK  PNCNACP+R EC    
Sbjct: 1686 ELYPILESIQKYLWPRLCKLDQYTLYELHYQMITFGKVFCTKSKPNCNACPMRGECRHFA 1745

Query: 2573 XXXXXXXXXXPGPQERHIVSSAAP 2644
                      PGP+ER I SS AP
Sbjct: 1746 GAFASARFALPGPEERSITSSTAP 1769



 Score =  233 bits (594), Expect = 3e-58
 Identities = 230/780 (29%), Positives = 349/780 (44%), Gaps = 59/780 (7%)
 Frame = +2

Query: 2    IEDKFNGTSFSM-CSGSTTAEEILQQFESRRNKSLLAQI-STGTPNTELRNSDYGRKVMN 175
            IED+F+   + M CS S  A  +  +  +  N      +   GT +   R+ +  R+   
Sbjct: 777  IEDEFHAYKYGMKCSVSHAAGLLQTKGTNDVNAGQFTSLRDCGTSDPHFRSDNIDRRKGG 836

Query: 176  VNPNDNSTNFIRDRYMNYPDMRQKFQQQHTLSQGHLCSESMLPVTPQKFADTKITKSTTA 355
            V      +    +RY+N         +Q+ LSQ H   E +  +         +  +   
Sbjct: 837  V-----FSQLTGNRYVNSTAGDLTSSKQNILSQLHSGIEKVGNIN-----GLALVHNLAT 886

Query: 356  VVNRKES-SSTRDPQPIPINGNMNSPLHLVV---KKRTPYKKEPVXXXXXXXEPENK--- 514
            + NR     +T +    P  G +    H  V   KKR P     V         E K   
Sbjct: 887  IENRNLLLPTTPEKVSTPRTGLVGQTFHTNVSENKKREPGLPRNVPFTVGKMVQEKKRVS 946

Query: 515  ------KTYGKSSKKFAGNSVDDIINGMKHLRITSSGK------ESALVPYKGDGAVVPY 658
                  K  G S+K  + N V++IIN  K L +           ++ALV Y G G VVP+
Sbjct: 947  ENQQSTKARGPSAKHVSLNPVEEIINRFKGLTLEEKNNKPKAELQNALVLYNGAGTVVPF 1006

Query: 659  NLVK--KRKPRPRVDLDPETNRLWNLLMGGQSAETMDTNKEKWWEEERKVFRGRVDSFIA 832
               +  K+K RPRVDLDPETNR+WNLLMG +  +T  T+KEKWWEEER+VF GRVDSFIA
Sbjct: 1007 EGFESIKKKVRPRVDLDPETNRVWNLLMGKEGEDTEGTDKEKWWEEERRVFHGRVDSFIA 1066

Query: 833  KMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPSKSA-TAENGAS 1009
            +MHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAA+FP KS+   +  A 
Sbjct: 1067 RMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFPLKSSCKGDCNAE 1126

Query: 1010 PTKVGNHE---VRITYPDGTTFHQKMAMEPVTGQSQVIATETSTDRLDNVM---PEKKTF 1171
             T +   E     +   +   +H+K     +  QS  +    STD   N      E+ +F
Sbjct: 1127 RTTILIEEPEVCELNSEETIKWHEKPFRHQLDSQSS-MTPNRSTDYQRNSEYSGIERTSF 1185

Query: 1172 LVNDPFTRRTEEDIIXXXXXXXXFVLQASEDVRSSSGSNSDAECGWNVSKNLG-HQSVSQ 1348
            +    +++  EE+++         V+QA+  +R+ SGS S+ E      K L  H S   
Sbjct: 1186 M--GTYSQSLEEEVLSSQGSFDSSVIQANGGIRTYSGSYSETEDPTMSCKFLSIHGSTLD 1243

Query: 1349 QAERIAXXXXXXXXXXXXXCMNKMPSIKHQQFEKPAYRHIPECAGISKVQHHQNSDLPFP 1528
            Q E  A              +++   IK++Q E      + E    S+++  +N      
Sbjct: 1244 QIENSASVEEFYHCASGSSQLHE--GIKYKQSE------VTEEGQTSRLERTEN------ 1289

Query: 1529 SSWTNMLMGKGDWEAEDLSCLGRGSIS-----TLTSKGTDAPHVDDYRGQSAESAFMVSK 1693
              W++      ++  +       G+ S     TL S+  +   ++ +R +   S++  + 
Sbjct: 1290 LKWSSSFNQGNNFRNQQFRVQAFGASSHPLHMTLESEPWEGEGLEPFR-EECMSSWASTA 1348

Query: 1694 DGISKFQTPSTEHA---VLNKGLELRNDSVDESVNRNCQHSIKHMSEKPT-DNSKCTEVQ 1861
             G++K + P        V + G  +  D    ++N      I H  E  T  N  C   Q
Sbjct: 1349 SGLNKPKQPGQNGGKIMVQHNGQPISQDMATTTLNTLSGEHIMHQKEVHTRSNQLCNNHQ 1408

Query: 1862 TEMGHGQSPD---------------KLSSKIGVTTSPARKRKAEKETAE---PFNWDTLR 1987
             +    QS                 + S+ + VTT   +++  + E+A    P + D + 
Sbjct: 1409 EKRKDFQSESTSVTMPPTTDAVAKMQKSTSLSVTTHQEKRKDFQSESASVTMPPSTDAVT 1468

Query: 1988 KQVQSKAGTTERSREAMD-SLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVE 2164
            K  +S + +   + +  +   D E M  +D  +   A + R +     E M +  N+L E
Sbjct: 1469 KMQKSTSLSAANTHKLTERPSDIERMTASDKDK---ATENREVQSNAKEPMHSSENQLGE 1525


>gb|EPS65696.1| hypothetical protein M569_09081, partial [Genlisea aurea]
          Length = 591

 Score =  333 bits (855), Expect = 2e-88
 Identities = 170/275 (61%), Positives = 199/275 (72%)
 Frame = +2

Query: 1826 KPTDNSKCTEVQTEMGHGQSPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSK 2005
            KPT +  CT     MGH  +  +LS K  ++ +   K+  EKE  +   WD LRK+ +S+
Sbjct: 13   KPTGD--CTHPHATMGH-PTESQLSDKSIISNTS--KQMTEKEKVDTSKWDDLRKEAESR 67

Query: 2006 AGTTERSREAMDSLDYEAMRTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDL 2185
             G  ERS E+ DSLDYEA+R A V EIS+ IKERGMN+ LAER+K FL+R+V+DHER+DL
Sbjct: 68   IGIKERSLESADSLDYEALRNAPVSEISETIKERGMNNRLAERIKEFLDRVVQDHERVDL 127

Query: 2186 EWLRDVEPDRAKDYLLSIRGLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXX 2365
            EWLR+V+PD+AKDYLLSIRGLGLKSVEC+RLLTL +LAFPVDTNVGRIAVRLGWV     
Sbjct: 128  EWLREVQPDKAKDYLLSIRGLGLKSVECVRLLTLRNLAFPVDTNVGRIAVRLGWVPLQPL 187

Query: 2366 XXXXXXXXXXXXXILESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACP 2545
                         +LESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACP
Sbjct: 188  PESLQLHLLELYPVLESIQKYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACP 247

Query: 2546 LRAECXXXXXXXXXXXXXXPGPQERHIVSSAAPTS 2650
            +RAEC              PGP+E+ IVSS  P S
Sbjct: 248  MRAECRHFASAFASARLALPGPEEKRIVSSVHPIS 282


>gb|AGU16984.1| DEMETER [Citrus sinensis]
          Length = 1573

 Score =  333 bits (853), Expect = 3e-88
 Identities = 164/255 (64%), Positives = 189/255 (74%)
 Frame = +2

Query: 1883 SPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSKAGTTERSREAMDSLDYEAM 2062
            S  K+  +     S ++KRKA+ E     +W++LRK+VQ  +G  ERSR+ MDSLDYEA+
Sbjct: 1004 SAHKVYDETNPNISKSKKRKADGEKKNAIDWESLRKEVQRNSGKQERSRDRMDSLDYEAL 1063

Query: 2063 RTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEPDRAKDYLLSIR 2242
            R A+V EIS+AIKERGMN++LAERMK+FLNRLV +H  IDLEWLRDV PD+AKDYLLSIR
Sbjct: 1064 RCANVKEISEAIKERGMNNMLAERMKDFLNRLVREHGSIDLEWLRDVPPDKAKDYLLSIR 1123

Query: 2243 GLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXXXXXXXILESIQ 2422
            GLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV                  +LESIQ
Sbjct: 1124 GLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQ 1183

Query: 2423 KYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXXXXXXXXXXXXX 2602
            KYLWPRLCKLDQ TLYELHYQ+ITFGKVFCTK  PNCNACP+R EC              
Sbjct: 1184 KYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLAL 1243

Query: 2603 PGPQERHIVSSAAPT 2647
            PGP+E+ IVSS  PT
Sbjct: 1244 PGPEEKSIVSSTMPT 1258



 Score =  238 bits (607), Expect = 1e-59
 Identities = 163/383 (42%), Positives = 209/383 (54%), Gaps = 24/383 (6%)
 Frame = +2

Query: 224  NYPDMRQKFQQQHTLSQGHLCSESMLPV----TPQKFADTKITKSTTA--VVNRKESSST 385
            N   M     +QH  S+ H  +E M       +P  FA +  +K+     +    ++ + 
Sbjct: 363  NTQSMASNMPKQHNSSEKHPSTEKMGETNRLTSPDAFASSIPSKNCDLFPLTPPGKAPAP 422

Query: 386  RDPQP------IPINGNMNSPLHLVVKKRTPYKKEPVXXXXXXXEPENKKTYGKSSKK-F 544
             D QP      I +  N+ S     V       K          +  + K  G   K+ +
Sbjct: 423  VDRQPKTCHTNISVKKNLESAFGKSVSSEMDQAKLVQREAFLDNQQYSAKRGGPEIKQIY 482

Query: 545  AGNSVDDIINGMKHLRITS--SGKESALVPYKGDGAVVPYN---LVKKRKPRPRVDLDPE 709
               SVD+I +  K L I      ++ A+VPYK  G VVPY    L+KKRKPRP+VDLDPE
Sbjct: 483  PIPSVDEITHRFKDLNINQVQDQEQYAIVPYKQGGTVVPYEGFELIKKRKPRPKVDLDPE 542

Query: 710  TNRLWNLLMG---GQSAETMDTNKEKWWEEERKVFRGRVDSFIAKMHLVQGDRRFSKWKG 880
            TNR+WNLLMG   G+  E  D  KEKWWEEER++F+GR DSFIA+MHLVQGDRRFSKWKG
Sbjct: 543  TNRIWNLLMGKEAGEGLEETDKGKEKWWEEERRIFKGRADSFIARMHLVQGDRRFSKWKG 602

Query: 881  SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPSKS--ATAENGASPTKVGNHEVRITYPD 1054
            SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFP KS   T     +   V   EV I   +
Sbjct: 603  SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPLKSNKRTCNIDGTNILVEEPEVCICANE 662

Query: 1055 GTTFHQKMAMEPVTGQSQVIATE-TSTDRLDNVMPEKKTFLVNDPFTRRTEEDIIXXXXX 1231
               +H+ +   P + QS +   E T   R+  +    KT L  +P     EE+II     
Sbjct: 663  SIQWHE-LLRHPGSSQSSITPHEPTEHQRVREMSGVGKTSL-PEPHGIGLEEEIISSQDS 720

Query: 1232 XXXFVLQASEDVRSSSGSNSDAE 1300
                +LQ++  +RS SGSNS+AE
Sbjct: 721  LSSTILQSNGGIRSCSGSNSEAE 743


>ref|XP_006492175.1| PREDICTED: transcriptional activator DEMETER-like isoform X3 [Citrus
            sinensis]
          Length = 1958

 Score =  332 bits (852), Expect = 4e-88
 Identities = 164/255 (64%), Positives = 188/255 (73%)
 Frame = +2

Query: 1883 SPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSKAGTTERSREAMDSLDYEAM 2062
            S  K+  +     S ++KRKA+ E     +W++LRK+VQ  +G  ERSR+ MDSLDYEA+
Sbjct: 1389 SAHKVYDETNPNISKSKKRKADGEKKNAIDWESLRKEVQRNSGKQERSRDRMDSLDYEAL 1448

Query: 2063 RTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEPDRAKDYLLSIR 2242
            R A+V EIS+AIKERGMN++LAERMK FLNRLV +H  IDLEWLRDV PD+AKDYLLSIR
Sbjct: 1449 RCANVKEISEAIKERGMNNMLAERMKEFLNRLVREHGSIDLEWLRDVPPDKAKDYLLSIR 1508

Query: 2243 GLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXXXXXXXILESIQ 2422
            GLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV                  +LESIQ
Sbjct: 1509 GLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQ 1568

Query: 2423 KYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXXXXXXXXXXXXX 2602
            KYLWPRLCKLDQ TLYELHYQ+ITFGKVFCTK  PNCNACP+R EC              
Sbjct: 1569 KYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLAL 1628

Query: 2603 PGPQERHIVSSAAPT 2647
            PGP+E+ IVSS  PT
Sbjct: 1629 PGPEEKSIVSSTMPT 1643



 Score =  235 bits (599), Expect = 9e-59
 Identities = 162/383 (42%), Positives = 206/383 (53%), Gaps = 24/383 (6%)
 Frame = +2

Query: 224  NYPDMRQKFQQQHTLSQGHLCSESMLPV----TPQKFADTKITKSTTAVV----NRKESS 379
            N   M     +QH  S+ H  +E M       +P  FA +  +K+          R  + 
Sbjct: 748  NTQSMASNMPKQHNSSEKHPSTEKMGETNRLTSPHAFASSIPSKNCDLFPLTPPGRAPAP 807

Query: 380  STRDPQP----IPINGNMNSPLHLVVKKRTPYKKEPVXXXXXXXEPENKKTYGKSSKK-F 544
              R P+     I +  N+ S     V       K          +  + K  G   K+ +
Sbjct: 808  VDRQPKTCHTNISVKKNLESAFGKSVSSEMDQAKLVQREAFLDNQQYSAKRGGPEIKQIY 867

Query: 545  AGNSVDDIINGMKHLRITS--SGKESALVPYKGDGAVVPYN---LVKKRKPRPRVDLDPE 709
               SVD+I +  K L I      ++ A+VPYK  G VVPY    L+KKRKPRP+VDLDPE
Sbjct: 868  PIPSVDEITHRFKDLNINQVQDQEQYAIVPYKQGGTVVPYEGFELIKKRKPRPKVDLDPE 927

Query: 710  TNRLWNLLMG---GQSAETMDTNKEKWWEEERKVFRGRVDSFIAKMHLVQGDRRFSKWKG 880
            TNR+WNLLMG   G+  E  D  KEKWWEEER++F+GR DSFIA+MHLVQGDR FSKWKG
Sbjct: 928  TNRIWNLLMGKEAGEGLEETDKGKEKWWEEERRIFKGRADSFIARMHLVQGDRCFSKWKG 987

Query: 881  SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPSKS--ATAENGASPTKVGNHEVRITYPD 1054
            SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFP KS   T     +   V   EV I   +
Sbjct: 988  SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPLKSNKRTCNIDGTNILVEEPEVCIRANE 1047

Query: 1055 GTTFHQKMAMEPVTGQSQVIATE-TSTDRLDNVMPEKKTFLVNDPFTRRTEEDIIXXXXX 1231
               +H+ +   P + QS +   E T   R+  +    KT L  +P     EE+II     
Sbjct: 1048 SIQWHE-LLRHPGSSQSSITPHEPTEHQRVREMSGVGKTSL-PEPHGIGLEEEIISSQDS 1105

Query: 1232 XXXFVLQASEDVRSSSGSNSDAE 1300
                +LQ++  +RS SGSNS+AE
Sbjct: 1106 LSSTILQSNVGIRSCSGSNSEAE 1128


>ref|XP_006492173.1| PREDICTED: transcriptional activator DEMETER-like isoform X1 [Citrus
            sinensis] gi|568878380|ref|XP_006492174.1| PREDICTED:
            transcriptional activator DEMETER-like isoform X2 [Citrus
            sinensis]
          Length = 2029

 Score =  332 bits (852), Expect = 4e-88
 Identities = 164/255 (64%), Positives = 188/255 (73%)
 Frame = +2

Query: 1883 SPDKLSSKIGVTTSPARKRKAEKETAEPFNWDTLRKQVQSKAGTTERSREAMDSLDYEAM 2062
            S  K+  +     S ++KRKA+ E     +W++LRK+VQ  +G  ERSR+ MDSLDYEA+
Sbjct: 1460 SAHKVYDETNPNISKSKKRKADGEKKNAIDWESLRKEVQRNSGKQERSRDRMDSLDYEAL 1519

Query: 2063 RTADVHEISDAIKERGMNHILAERMKNFLNRLVEDHERIDLEWLRDVEPDRAKDYLLSIR 2242
            R A+V EIS+AIKERGMN++LAERMK FLNRLV +H  IDLEWLRDV PD+AKDYLLSIR
Sbjct: 1520 RCANVKEISEAIKERGMNNMLAERMKEFLNRLVREHGSIDLEWLRDVPPDKAKDYLLSIR 1579

Query: 2243 GLGLKSVECIRLLTLHHLAFPVDTNVGRIAVRLGWVXXXXXXXXXXXXXXXXXXILESIQ 2422
            GLGLKSVEC+RLLTLHHLAFPVDTNVGRIAVRLGWV                  +LESIQ
Sbjct: 1580 GLGLKSVECVRLLTLHHLAFPVDTNVGRIAVRLGWVPLQPLPESLQLHLLELYPVLESIQ 1639

Query: 2423 KYLWPRLCKLDQETLYELHYQMITFGKVFCTKRDPNCNACPLRAECXXXXXXXXXXXXXX 2602
            KYLWPRLCKLDQ TLYELHYQ+ITFGKVFCTK  PNCNACP+R EC              
Sbjct: 1640 KYLWPRLCKLDQRTLYELHYQLITFGKVFCTKSKPNCNACPMRGECRHFASAFASARLAL 1699

Query: 2603 PGPQERHIVSSAAPT 2647
            PGP+E+ IVSS  PT
Sbjct: 1700 PGPEEKSIVSSTMPT 1714



 Score =  235 bits (599), Expect = 9e-59
 Identities = 162/383 (42%), Positives = 206/383 (53%), Gaps = 24/383 (6%)
 Frame = +2

Query: 224  NYPDMRQKFQQQHTLSQGHLCSESMLPV----TPQKFADTKITKSTTAVV----NRKESS 379
            N   M     +QH  S+ H  +E M       +P  FA +  +K+          R  + 
Sbjct: 819  NTQSMASNMPKQHNSSEKHPSTEKMGETNRLTSPHAFASSIPSKNCDLFPLTPPGRAPAP 878

Query: 380  STRDPQP----IPINGNMNSPLHLVVKKRTPYKKEPVXXXXXXXEPENKKTYGKSSKK-F 544
              R P+     I +  N+ S     V       K          +  + K  G   K+ +
Sbjct: 879  VDRQPKTCHTNISVKKNLESAFGKSVSSEMDQAKLVQREAFLDNQQYSAKRGGPEIKQIY 938

Query: 545  AGNSVDDIINGMKHLRITS--SGKESALVPYKGDGAVVPYN---LVKKRKPRPRVDLDPE 709
               SVD+I +  K L I      ++ A+VPYK  G VVPY    L+KKRKPRP+VDLDPE
Sbjct: 939  PIPSVDEITHRFKDLNINQVQDQEQYAIVPYKQGGTVVPYEGFELIKKRKPRPKVDLDPE 998

Query: 710  TNRLWNLLMG---GQSAETMDTNKEKWWEEERKVFRGRVDSFIAKMHLVQGDRRFSKWKG 880
            TNR+WNLLMG   G+  E  D  KEKWWEEER++F+GR DSFIA+MHLVQGDR FSKWKG
Sbjct: 999  TNRIWNLLMGKEAGEGLEETDKGKEKWWEEERRIFKGRADSFIARMHLVQGDRCFSKWKG 1058

Query: 881  SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPSKS--ATAENGASPTKVGNHEVRITYPD 1054
            SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFP KS   T     +   V   EV I   +
Sbjct: 1059 SVVDSVIGVFLTQNVSDHLSSSAFMSLAARFPLKSNKRTCNIDGTNILVEEPEVCIRANE 1118

Query: 1055 GTTFHQKMAMEPVTGQSQVIATE-TSTDRLDNVMPEKKTFLVNDPFTRRTEEDIIXXXXX 1231
               +H+ +   P + QS +   E T   R+  +    KT L  +P     EE+II     
Sbjct: 1119 SIQWHE-LLRHPGSSQSSITPHEPTEHQRVREMSGVGKTSL-PEPHGIGLEEEIISSQDS 1176

Query: 1232 XXXFVLQASEDVRSSSGSNSDAE 1300
                +LQ++  +RS SGSNS+AE
Sbjct: 1177 LSSTILQSNVGIRSCSGSNSEAE 1199


Top