BLASTX nr result

ID: Achyranthes22_contig00015634 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes22_contig00015634
         (3566 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006472857.1| PREDICTED: histone-lysine N-methyltransferas...   832   0.0  
ref|XP_006472855.1| PREDICTED: histone-lysine N-methyltransferas...   832   0.0  
ref|XP_006434292.1| hypothetical protein CICLE_v10000005mg [Citr...   828   0.0  
gb|EOY16446.1| Histone methyltransferases(H3-K4 specific),histon...   803   0.0  
ref|XP_004292727.1| PREDICTED: histone-lysine N-methyltransferas...   795   0.0  
ref|XP_006581600.1| PREDICTED: histone-lysine N-methyltransferas...   794   0.0  
ref|XP_006578956.1| PREDICTED: histone-lysine N-methyltransferas...   793   0.0  
ref|XP_006578954.1| PREDICTED: histone-lysine N-methyltransferas...   793   0.0  
ref|XP_004502541.1| PREDICTED: histone-lysine N-methyltransferas...   793   0.0  
ref|XP_004502539.1| PREDICTED: histone-lysine N-methyltransferas...   793   0.0  
gb|EMJ23127.1| hypothetical protein PRUPE_ppa000056mg [Prunus pe...   792   0.0  
gb|ESW09471.1| hypothetical protein PHAVU_009G130100g [Phaseolus...   789   0.0  
ref|XP_002300965.2| hypothetical protein POPTR_0002s07930g [Popu...   788   0.0  
ref|XP_006365937.1| PREDICTED: histone-lysine N-methyltransferas...   783   0.0  
ref|XP_006357338.1| PREDICTED: histone-lysine N-methyltransferas...   776   0.0  
ref|XP_002520307.1| huntingtin interacting protein, putative [Ri...   734   0.0  
gb|EPS67389.1| hypothetical protein M569_07380, partial [Genlise...   718   0.0  
ref|XP_006390102.1| hypothetical protein EUTSA_v10017998mg [Eutr...   701   0.0  
ref|XP_006300643.1| hypothetical protein CARUB_v10019650mg [Caps...   700   0.0  
gb|AAC34358.1| Hypothetical protein [Arabidopsis thaliana]            677   0.0  

>ref|XP_006472857.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like isoform X3
            [Citrus sinensis]
          Length = 2478

 Score =  832 bits (2150), Expect = 0.0
 Identities = 505/1073 (47%), Positives = 613/1073 (57%), Gaps = 59/1073 (5%)
 Frame = +3

Query: 21   DVSKSEFSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEAT 200
            D+ K++        + S  ++   G  E    P  AWV CDDC KWR IP  +AD+I+  
Sbjct: 1391 DIGKTDSGNNSMSVDVSNAEITSGGEPEHYCPPESAWVRCDDCYKWRRIPVSVADLIDE- 1449

Query: 201  NCRWTCKDNQDKAFGDCSISQEKSNAEINAELQISDEEDARDGHLGFKGSGLKPLLASQP 380
            NCRW CKDN D  F DCSI QEK+NA+INAEL +SD E+  DG + +  SG      S P
Sbjct: 1450 NCRWVCKDNMDTTFADCSIPQEKTNADINAELGLSDYEE-EDGLINYNTSGKGLDFQSTP 1508

Query: 381  -STLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTCPC 557
             S+   + SN+FLHRSRK QT+DE+MVCHCKPP+DG LGC   CLNRMLNIECVQGTCPC
Sbjct: 1509 GSSFRRIDSNVFLHRSRKTQTIDEVMVCHCKPPLDGRLGCRDECLNRMLNIECVQGTCPC 1568

Query: 558  GDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQKE 737
            GDLCSNQQFQ+++YAK+ W  CGKKGYGL+ LEDI  G F+IEY+GEVLD+Q YEARQKE
Sbjct: 1569 GDLCSNQQFQKRKYAKMQWRPCGKKGYGLESLEDILTGKFIIEYIGEVLDMQAYEARQKE 1628

Query: 738  YASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALRS 917
            YA+ GHKHFYFMTLNG+EVIDA AKGNLGRF+NHSCDPNCRTEKW+VNGEIC+GLFA+R 
Sbjct: 1629 YAANGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWLVNGEICIGLFAMRD 1688

Query: 918  IKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDSDXXXXXXXXX 1097
            IK+GEELTFDYNYVRVFGAAAKKCHCGS +CRGYIGGDP N EII QGDSD         
Sbjct: 1689 IKEGEELTFDYNYVRVFGAAAKKCHCGSPQCRGYIGGDPLNTEIIYQGDSDEEYPEPLML 1748

Query: 1098 XXXGDIDHSLGD---MMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVECG 1268
                  D   GD    M++ S    +R                             V   
Sbjct: 1749 E-----DGETGDGFKTMSRTSPFYGDRTQISEAIAEDTNKMDDSATAVGQLEISGNVNDS 1803

Query: 1269 PIQSIPGKDSVDEVRESTDS----------SSGFQHDHWRSEPASSVKIHTSSTEHIIGT 1418
              QSIP    +    E  DS           +    ++  S P SSV+   +  +     
Sbjct: 1804 KSQSIPVIPQLLHSLEREDSKGKCPLLQSLETSLVVENESSIPVSSVQQKETMNKTSSVI 1863

Query: 1419 PTSSPKSDVLLVENASQKSLCGSIDSASRVFEVDTECEPHSLTRMKISRPK-SVKNRKSS 1595
            P        L+  N           S S + E D +  P S  R+K SR   S+K  K  
Sbjct: 1864 PQVETSLPALISGNLFTDGSDAGRKSKSDIVE-DNQSLPKSHPRIKTSRKSGSIKKGKVD 1922

Query: 1596 GISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKKKDASK 1775
            G   +  +V S   + +VF  KPK+++EGS+NG  + V+E LNELLDA+GGI K+KDA K
Sbjct: 1923 GSPLSGNKVKSVASKSQVFFIKPKKIMEGSSNGRFEAVQEKLNELLDAEGGISKRKDAPK 1982

Query: 1776 GYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQMLHNMMK 1955
            GYLKLLLLTAASG SGNG +IQSNR LSMILDA+LKTKSR VL+DIINKNGLQMLHNM+K
Sbjct: 1983 GYLKLLLLTAASGGSGNGESIQSNRDLSMILDALLKTKSRVVLMDIINKNGLQMLHNMIK 2042

Query: 1956 LYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTEHDDNQV 2135
             YRRDFKKIPILRKLLKVLE+LA +EILT   I      PG+ESFR SILS TEHDD QV
Sbjct: 2043 QYRRDFKKIPILRKLLKVLEYLAVREILTRNHITAGPPCPGMESFRGSILSLTEHDDKQV 2102

Query: 2136 HQIARSFRDRWIPHARRFHXXXXXXXXXXXXXXXXXXHFGSC-------HHWREQGAIAV 2294
            HQIARSFRDRWIP   R H                     +C       +H R++     
Sbjct: 2103 HQIARSFRDRWIPKPFRKH-----SYKDRDDSGMDIHRVANCNRLPMLHNHRRDESLRPS 2157

Query: 2295 ERLTCSDQSVSMSNLVDARSEEA-SSPKLVDDQTMVTRPRKRKSRWDQPASPKKTGRSPK 2471
            E + C  QS+     VD+ + EA SSP     QT   + RKRKSRWDQPA         K
Sbjct: 2158 EAIDCVMQSLVAKTSVDSAANEAGSSPGAGGCQTNGPKVRKRKSRWDQPAETNLDSIKHK 2217

Query: 2472 VDYSEGGQRTLTYEAKKEESNCSGDQNILRNRD-----EDMMHNLDDEVPPGF------- 2615
                E   R L     +E+ NC    +   N+D     ED      ++VPPGF       
Sbjct: 2218 KLMLE--SRVL---PSREDINCPDHIHNHCNKDEAVSSEDGGQITQEDVPPGFSSPFNPP 2272

Query: 2616 ---SHLHDSTDDTPPGFSSVLC--PVPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQCGT 2780
               S    +TD +    S + C   V + H Q +FN  LPV+YGIP  ++Q+ G+ Q  T
Sbjct: 2273 LVSSDSSSTTDLSQQNVSQLRCAFDVAIAHPQGKFNSRLPVSYGIPLHILQQFGSSQAET 2332

Query: 2781 ADTWEIAXXXXXXXXXXXXXXXXXXXXT------------GDQEEARQGYQ---THAGDQ 2915
             D+W IA                    T            G  EE +Q      +   D+
Sbjct: 2333 VDSWVIAPSMPFHPFPPLPPFPRDKKDTPPASAVSCKTIDGPAEEWQQDSNHGPSCCPDE 2392

Query: 2916 GLPCTSGASKNTV---GTN-QNMVHHERGPRNFLGKRCYWQKNWNGSKRRPPW 3062
              P  +GA+++     GT+ Q+     RG  N LGKR + Q+   G    PPW
Sbjct: 2393 DNPSMTGANQSDADIPGTDGQHTFKRMRGSSNDLGKRYFRQQKRKG----PPW 2441


>ref|XP_006472855.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like isoform X1
            [Citrus sinensis] gi|568837690|ref|XP_006472856.1|
            PREDICTED: histone-lysine N-methyltransferase ASHH2-like
            isoform X2 [Citrus sinensis]
          Length = 2483

 Score =  832 bits (2150), Expect = 0.0
 Identities = 505/1073 (47%), Positives = 613/1073 (57%), Gaps = 59/1073 (5%)
 Frame = +3

Query: 21   DVSKSEFSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEAT 200
            D+ K++        + S  ++   G  E    P  AWV CDDC KWR IP  +AD+I+  
Sbjct: 1396 DIGKTDSGNNSMSVDVSNAEITSGGEPEHYCPPESAWVRCDDCYKWRRIPVSVADLIDE- 1454

Query: 201  NCRWTCKDNQDKAFGDCSISQEKSNAEINAELQISDEEDARDGHLGFKGSGLKPLLASQP 380
            NCRW CKDN D  F DCSI QEK+NA+INAEL +SD E+  DG + +  SG      S P
Sbjct: 1455 NCRWVCKDNMDTTFADCSIPQEKTNADINAELGLSDYEE-EDGLINYNTSGKGLDFQSTP 1513

Query: 381  -STLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTCPC 557
             S+   + SN+FLHRSRK QT+DE+MVCHCKPP+DG LGC   CLNRMLNIECVQGTCPC
Sbjct: 1514 GSSFRRIDSNVFLHRSRKTQTIDEVMVCHCKPPLDGRLGCRDECLNRMLNIECVQGTCPC 1573

Query: 558  GDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQKE 737
            GDLCSNQQFQ+++YAK+ W  CGKKGYGL+ LEDI  G F+IEY+GEVLD+Q YEARQKE
Sbjct: 1574 GDLCSNQQFQKRKYAKMQWRPCGKKGYGLESLEDILTGKFIIEYIGEVLDMQAYEARQKE 1633

Query: 738  YASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALRS 917
            YA+ GHKHFYFMTLNG+EVIDA AKGNLGRF+NHSCDPNCRTEKW+VNGEIC+GLFA+R 
Sbjct: 1634 YAANGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWLVNGEICIGLFAMRD 1693

Query: 918  IKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDSDXXXXXXXXX 1097
            IK+GEELTFDYNYVRVFGAAAKKCHCGS +CRGYIGGDP N EII QGDSD         
Sbjct: 1694 IKEGEELTFDYNYVRVFGAAAKKCHCGSPQCRGYIGGDPLNTEIIYQGDSDEEYPEPLML 1753

Query: 1098 XXXGDIDHSLGD---MMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVECG 1268
                  D   GD    M++ S    +R                             V   
Sbjct: 1754 E-----DGETGDGFKTMSRTSPFYGDRTQISEAIAEDTNKMDDSATAVGQLEISGNVNDS 1808

Query: 1269 PIQSIPGKDSVDEVRESTDS----------SSGFQHDHWRSEPASSVKIHTSSTEHIIGT 1418
              QSIP    +    E  DS           +    ++  S P SSV+   +  +     
Sbjct: 1809 KSQSIPVIPQLLHSLEREDSKGKCPLLQSLETSLVVENESSIPVSSVQQKETMNKTSSVI 1868

Query: 1419 PTSSPKSDVLLVENASQKSLCGSIDSASRVFEVDTECEPHSLTRMKISRPK-SVKNRKSS 1595
            P        L+  N           S S + E D +  P S  R+K SR   S+K  K  
Sbjct: 1869 PQVETSLPALISGNLFTDGSDAGRKSKSDIVE-DNQSLPKSHPRIKTSRKSGSIKKGKVD 1927

Query: 1596 GISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKKKDASK 1775
            G   +  +V S   + +VF  KPK+++EGS+NG  + V+E LNELLDA+GGI K+KDA K
Sbjct: 1928 GSPLSGNKVKSVASKSQVFFIKPKKIMEGSSNGRFEAVQEKLNELLDAEGGISKRKDAPK 1987

Query: 1776 GYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQMLHNMMK 1955
            GYLKLLLLTAASG SGNG +IQSNR LSMILDA+LKTKSR VL+DIINKNGLQMLHNM+K
Sbjct: 1988 GYLKLLLLTAASGGSGNGESIQSNRDLSMILDALLKTKSRVVLMDIINKNGLQMLHNMIK 2047

Query: 1956 LYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTEHDDNQV 2135
             YRRDFKKIPILRKLLKVLE+LA +EILT   I      PG+ESFR SILS TEHDD QV
Sbjct: 2048 QYRRDFKKIPILRKLLKVLEYLAVREILTRNHITAGPPCPGMESFRGSILSLTEHDDKQV 2107

Query: 2136 HQIARSFRDRWIPHARRFHXXXXXXXXXXXXXXXXXXHFGSC-------HHWREQGAIAV 2294
            HQIARSFRDRWIP   R H                     +C       +H R++     
Sbjct: 2108 HQIARSFRDRWIPKPFRKH-----SYKDRDDSGMDIHRVANCNRLPMLHNHRRDESLRPS 2162

Query: 2295 ERLTCSDQSVSMSNLVDARSEEA-SSPKLVDDQTMVTRPRKRKSRWDQPASPKKTGRSPK 2471
            E + C  QS+     VD+ + EA SSP     QT   + RKRKSRWDQPA         K
Sbjct: 2163 EAIDCVMQSLVAKTSVDSAANEAGSSPGAGGCQTNGPKVRKRKSRWDQPAETNLDSIKHK 2222

Query: 2472 VDYSEGGQRTLTYEAKKEESNCSGDQNILRNRD-----EDMMHNLDDEVPPGF------- 2615
                E   R L     +E+ NC    +   N+D     ED      ++VPPGF       
Sbjct: 2223 KLMLE--SRVL---PSREDINCPDHIHNHCNKDEAVSSEDGGQITQEDVPPGFSSPFNPP 2277

Query: 2616 ---SHLHDSTDDTPPGFSSVLC--PVPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQCGT 2780
               S    +TD +    S + C   V + H Q +FN  LPV+YGIP  ++Q+ G+ Q  T
Sbjct: 2278 LVSSDSSSTTDLSQQNVSQLRCAFDVAIAHPQGKFNSRLPVSYGIPLHILQQFGSSQAET 2337

Query: 2781 ADTWEIAXXXXXXXXXXXXXXXXXXXXT------------GDQEEARQGYQ---THAGDQ 2915
             D+W IA                    T            G  EE +Q      +   D+
Sbjct: 2338 VDSWVIAPSMPFHPFPPLPPFPRDKKDTPPASAVSCKTIDGPAEEWQQDSNHGPSCCPDE 2397

Query: 2916 GLPCTSGASKNTV---GTN-QNMVHHERGPRNFLGKRCYWQKNWNGSKRRPPW 3062
              P  +GA+++     GT+ Q+     RG  N LGKR + Q+   G    PPW
Sbjct: 2398 DNPSMTGANQSDADIPGTDGQHTFKRMRGSSNDLGKRYFRQQKRKG----PPW 2446


>ref|XP_006434292.1| hypothetical protein CICLE_v10000005mg [Citrus clementina]
            gi|557536414|gb|ESR47532.1| hypothetical protein
            CICLE_v10000005mg [Citrus clementina]
          Length = 2461

 Score =  828 bits (2138), Expect = 0.0
 Identities = 508/1077 (47%), Positives = 617/1077 (57%), Gaps = 63/1077 (5%)
 Frame = +3

Query: 21   DVSKSEFSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEAT 200
            D+ K++        + S  ++   G  E    P  AWV CDDC KWR IP  +AD+I+  
Sbjct: 1374 DIGKTDSGNNSMSVDVSNAEITSAGEPEHYCPPESAWVRCDDCYKWRRIPVSVADLIDE- 1432

Query: 201  NCRWTCKDNQDKAFGDCSISQEKSNAEINAELQISDEEDARDGHLGFKGSGLKPLLASQP 380
            NCRW CKDN D  F DCSI QEK+NA+INAEL +SD E+  DG + +  SG      S P
Sbjct: 1433 NCRWVCKDNMDTTFADCSIPQEKTNADINAELGLSDYEE-EDGLINYNTSGKGLDFQSTP 1491

Query: 381  -STLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTCPC 557
             S+   + SN+FLHRSRK QT+DE+MVCHCKPP+D  LGC   CLNRMLNIECVQGTCPC
Sbjct: 1492 GSSFRRIDSNVFLHRSRKTQTIDEVMVCHCKPPLDVRLGCRDECLNRMLNIECVQGTCPC 1551

Query: 558  GDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQKE 737
            GDLCSNQQFQ+++YAK+ W  CGKKGYGL+ LEDI  G F+IEYVGEVLD+Q YEARQKE
Sbjct: 1552 GDLCSNQQFQKRKYAKMQWRPCGKKGYGLESLEDIPIGKFIIEYVGEVLDMQAYEARQKE 1611

Query: 738  YASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALRS 917
            YA+ GHKHFYFMTLNG+EVIDA AKGNLGRF+NHSCDPNCRTEKW+VNGEIC+GLFA+R 
Sbjct: 1612 YAANGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFAMRD 1671

Query: 918  IKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDSDXXXXXXXXX 1097
            IK+GEELTFDYNYVRVFGAAAKKCHCGS +CRGYIGGDP N EII QGDSD         
Sbjct: 1672 IKEGEELTFDYNYVRVFGAAAKKCHCGSPQCRGYIGGDPLNTEIIYQGDSDEEYPEPLML 1731

Query: 1098 XXXGDIDHSLGD---MMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVECG 1268
                  D   GD    M++ S    +R                             V   
Sbjct: 1732 E-----DAETGDGFKTMSRTSPFYGDRTQISEAMAEDTNKMDDSATAVGQLEISGNVNDS 1786

Query: 1269 PIQSIPGKDSVDEVRESTDS----------SSGFQHDHWRSEPASSVKIHTSSTEHIIGT 1418
              QSIP    +    E  DS           +    ++  S P SSV+      E +  T
Sbjct: 1787 KSQSIPVIPQLHHSLEREDSKGKCPPLQSLETSLVVENESSIPVSSVQ----QKETMNKT 1842

Query: 1419 PTSSPKSDVLLVENASQKSLCGSIDSASR-VFEV--DTECEPHSLTRMKISRPK-SVKNR 1586
             +  P+ +  L    S        D+  +  F++  D +  P S  R+K SR   S+K  
Sbjct: 1843 SSVIPQVETSLPALISGNLFTDGSDAGRKSKFDIVEDNQSLPKSHPRIKTSRKSGSIKKG 1902

Query: 1587 KSSGISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKKKD 1766
            K  G   +  +V S   + +VF  KPK+++EGS+NG  + V+E LNELLDA+GGI K+KD
Sbjct: 1903 KVDGSPLSGNKVKSIASKSQVFFIKPKKIMEGSSNGRFEAVQEKLNELLDAEGGISKRKD 1962

Query: 1767 ASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQMLHN 1946
            A KGYLKLLLLTAASG SGNG +IQSNR LSMILDA+LKTKSR VL+DIINKNGLQMLHN
Sbjct: 1963 APKGYLKLLLLTAASGGSGNGESIQSNRDLSMILDALLKTKSRVVLMDIINKNGLQMLHN 2022

Query: 1947 MMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTEHDD 2126
            M+K YRRDFKKIPILRKLLKVLE+LA +EILT   I      PG+ESFR SILS TEHDD
Sbjct: 2023 MIKQYRRDFKKIPILRKLLKVLEYLAVREILTRNHITAGPPCPGMESFRGSILSLTEHDD 2082

Query: 2127 NQVHQIARSFRDRWIPHARRFHXXXXXXXXXXXXXXXXXXHFGSC-------HHWREQGA 2285
             QVHQIARSFRDRWIP   R H                     +C       +H R++  
Sbjct: 2083 KQVHQIARSFRDRWIPKPFRKH-----SYKDRDDSGMDIHRVANCNRLPMLHNHRRDESL 2137

Query: 2286 IAVERLTCSDQSVSMSNLVD-ARSEEASSPKLVDDQTMVTRPRKRKSRWDQPASPKKTGR 2462
               E + C  QS+     VD A +E  SSP     QT   + RKRKSRWDQPA       
Sbjct: 2138 RPSEAIDCVMQSLVAKTSVDTAANEVGSSPGAGGCQTNGPKVRKRKSRWDQPAETNLDPI 2197

Query: 2463 SPKVDYSEGGQRTLTYEAKKEESNCSGDQNILRNRD-----EDMMHNLDDEVPPGF---- 2615
              K    E   R L     +E+ NC    +   N+D     ED      ++VPPGF    
Sbjct: 2198 KHKKLMLE--SRVL---PSREDINCPDHIHNHCNKDEAVSSEDGGQITQEDVPPGFSSPF 2252

Query: 2616 ------SHLHDSTDDTPPGFSSVLC--PVPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQ 2771
                  S    +TD +    S + C   V + H Q +FN  LPV+YGIP  ++Q+ G+ Q
Sbjct: 2253 NPPLVSSDSSSTTDLSQQNVSQLRCAFDVAIAHPQGKFNSRLPVSYGIPLHILQQFGSSQ 2312

Query: 2772 CGTADTWEIAXXXXXXXXXXXXXXXXXXXXT------------GDQEEARQGYQTHA--- 2906
              T D+W IA                    T            G  EE +Q    HA   
Sbjct: 2313 AETVDSWVIAPSMPFHPFPPLPPFPRDKKDTPPASAVSCKTIDGPAEEWQQD-SNHAPPC 2371

Query: 2907 -GDQGLPCTSGASKNTV---GTN-QNMVHHERGPRNFLGKRCYWQKNWNGSKRRPPW 3062
              D+  P  +GA+++     GT+ Q+     RG  N LGKR + Q+   G    PPW
Sbjct: 2372 CPDEDNPSMTGANQSDADIPGTDGQHTFKRMRGSSNDLGKRYFRQQKRKG----PPW 2424


>gb|EOY16446.1| Histone methyltransferases(H3-K4 specific),histone
            methyltransferases(H3-K36 specific), putative isoform 1
            [Theobroma cacao] gi|508724550|gb|EOY16447.1| Histone
            methyltransferases(H3-K4 specific),histone
            methyltransferases(H3-K36 specific), putative isoform 1
            [Theobroma cacao] gi|508724551|gb|EOY16448.1| Histone
            methyltransferases(H3-K4 specific),histone
            methyltransferases(H3-K36 specific), putative isoform 1
            [Theobroma cacao]
          Length = 2265

 Score =  803 bits (2075), Expect = 0.0
 Identities = 505/1083 (46%), Positives = 620/1083 (57%), Gaps = 60/1083 (5%)
 Frame = +3

Query: 3    GNLEKVDVSKSEFSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLA 182
            GN    ++  S  S  +AL +   +DL+  G  EQ   P  AWV CDDC KWR IP  L 
Sbjct: 1155 GNHISDNIEISNTSNSIALADMINVDLVSDGTMEQCTQPDNAWVRCDDCHKWRRIPVALV 1214

Query: 183  DVIEATNCRWTCKDNQDKAFGDCSISQEKSNAEINAELQISD-EEDARDGHLGFK----G 347
              I+   CRW C DN DKAF DCSI QEKSNA+INA+L ISD EED  DG L +K    G
Sbjct: 1215 KSIDEA-CRWVCGDNVDKAFADCSIPQEKSNADINADLGISDAEEDGCDG-LNYKELEKG 1272

Query: 348  SGLKPLLASQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLN 527
               K +     S    + SN FLHR RK QT+DEIMVCHCK P DG LGCG  CLNRMLN
Sbjct: 1273 FESKHMTVPPTSHFWRIDSNWFLHRGRKTQTIDEIMVCHCKRPPDGKLGCGDECLNRMLN 1332

Query: 528  IECVQGTCPCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLD 707
            IECVQGTCPCGDLCSNQQFQ+++YAK+ W R G+KG+GL++LEDIS   FLIEYVGEVLD
Sbjct: 1333 IECVQGTCPCGDLCSNQQFQKRKYAKMKWDRFGRKGFGLRMLEDISASQFLIEYVGEVLD 1392

Query: 708  LQTYEARQKEYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGE 887
            +Q YEARQKEYAS+G +HFYFMTLNG+EVIDA  KGNLGRF+NHSCDPNCRTEKW+VNGE
Sbjct: 1393 MQAYEARQKEYASRGQRHFYFMTLNGSEVIDAYVKGNLGRFINHSCDPNCRTEKWMVNGE 1452

Query: 888  ICVGLFALRSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDS 1067
            IC+GLFALR IK+GEE+TFDYNYVRVFGAAAKKCHCGS  CRGYIGGD  +AE IV  DS
Sbjct: 1453 ICIGLFALRDIKQGEEVTFDYNYVRVFGAAAKKCHCGSPHCRGYIGGDLLSAEEIVHDDS 1512

Query: 1068 DXXXXXXXXXXXXGDIDHSLGDMMAKASS------QDVERVVNXXXXXXXXXXXXXXXXX 1229
            D            G+  +   ++++++SS      Q VE VV                  
Sbjct: 1513 D-EESPEPMMLEDGETWNGSDNIISRSSSFDGAEMQSVESVVT-----------DGVIKL 1560

Query: 1230 XXXXXXXXXVECGPIQSIPGKDSVDEVRESTDSSSGFQHDHWRSEPASSVKIHTSSTEHI 1409
                     V      +   K SV+    + +     + +      A+     T+  + +
Sbjct: 1561 ENRPEAEDSVNRSASVTSQLKSSVETEYLNGNFQLSIKPEEVLPAMAAVQPDSTTGKKAL 1620

Query: 1410 IGTPTSSPKSDVLLVENASQKSLCGSIDSASRVFEVDT----ECEPHSLTRMKISR-PKS 1574
              T  S  K D  L  N     L   +  A++  + DT    +  P S   MK SR   S
Sbjct: 1621 NRTSCSIQKLDTSL--NILDNKLPTDVVDANKKSKFDTAEDKQVPPKSRPLMKTSRSSSS 1678

Query: 1575 VKNRKSSGISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGIC 1754
            +K  K S  S    +V   + + +V S KPKRL E S+N   + VEE LNELLD DGGI 
Sbjct: 1679 IKKGKISSNSLNGHKVQITSTKSQVPSVKPKRLSENSSNCRFEAVEEKLNELLDCDGGIT 1738

Query: 1755 KKKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQ 1934
            K+KDASKGYLKLLLLTA SGDSGNG  IQSNR LSMILDA+LKTKSR VL DIINKNGLQ
Sbjct: 1739 KRKDASKGYLKLLLLTATSGDSGNGETIQSNRDLSMILDALLKTKSRLVLTDIINKNGLQ 1798

Query: 1935 MLHNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFT 2114
            MLHN+MK YR DFKKIPILRKLLKVLE+LA +EILT++ I       G +SFRESILS T
Sbjct: 1799 MLHNIMKKYRSDFKKIPILRKLLKVLEYLAMREILTLDHIIGGPSCAGRQSFRESILSLT 1858

Query: 2115 EHDDNQVHQIARSFRDRWIPHARR--FHXXXXXXXXXXXXXXXXXXHFGSCHHWREQGAI 2288
            EHDD QVHQIAR+FRDRWIP   R   +                     S +HWREQ   
Sbjct: 1859 EHDDKQVHQIARNFRDRWIPKPVRKLSYRDKDEGKMEFHRGLDCNRVPASNNHWREQAIR 1918

Query: 2289 AVERLTCSDQSVSMSNLVDARSEE-ASSPKLVDDQTMVTRPRKRKSRWDQPASPKKTG-R 2462
              E ++C  QSV  +  VD  S E  SS      QT  T+ RKRKSRWDQPA  +K G R
Sbjct: 1919 PTEAISCVMQSVVATTSVDTASREGCSSSSTGVCQTNSTKIRKRKSRWDQPAETEKIGSR 1978

Query: 2463 SP-KVDYS----------EGGQRTLTYEAKKEESNCSGDQ-NILRNRDEDMMHNLDDEVP 2606
            SP K+ YS          +   +    + +  +  C G+  N+   R     H+  ++VP
Sbjct: 1979 SPKKLQYSPLPVLVESTPDHIDKMSQGDKECRDCVCKGEAINVDNGR-----HSFQEDVP 2033

Query: 2607 PGFSHLHDST--DDTPPGFS-------SVLCP-VPLGHLQTRFNPHLPVAYGIPFSVVQK 2756
            PGFS   +++    T P  +        + CP V +   Q RF   LPV+YGIP  ++Q+
Sbjct: 2034 PGFSSPPNASLVSSTAPSTAIEFPKPYQLKCPDVIIALPQKRFISRLPVSYGIPLPILQQ 2093

Query: 2757 SGTPQCGTADTWEIAXXXXXXXXXXXXXXXXXXXXT---------GDQEEARQGYQ---- 2897
             G+PQ    ++W IA                    T         G  E+A +G +    
Sbjct: 2094 FGSPQGECVESWIIAPGMPFHPFPPLPPCPRDKKDTRPACTANSIGIDEDAEEGQRDSNR 2153

Query: 2898 --THAGDQGLPCTSGASK---NTVGTNQNMVHHERGPRNFLGKRCYWQKNWNGSKRRPPW 3062
              T   D+ +PC +G ++   +  GTN             LGK+ + Q+   G    PPW
Sbjct: 2154 PATSYPDENIPCMAGGNQPDPDIPGTNIQQTFKRMRESYDLGKKYFRQQKRKG----PPW 2209

Query: 3063 ARN 3071
             ++
Sbjct: 2210 HKS 2212


>ref|XP_004292727.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Fragaria
            vesca subsp. vesca]
          Length = 2112

 Score =  795 bits (2054), Expect = 0.0
 Identities = 492/1103 (44%), Positives = 620/1103 (56%), Gaps = 77/1103 (6%)
 Frame = +3

Query: 21   DVSKSEFSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEAT 200
            D + S  ++ + + N   LD +  G+ +Q + PR AWV CD C KWR IPA LAD I+ T
Sbjct: 1038 DANSSHVAECIGVPN---LDAVPVGLDKQYIPPRNAWVLCDACNKWRRIPAELADFIDET 1094

Query: 201  NCRWTCKDNQDKAFGDCSISQEKSNAEINAELQISD---EEDARDGHLGFKGSGLKPLLA 371
             C WTC++NQD+ F DCSI QEKSNAEINAEL+ISD   EEDA    L +K    +    
Sbjct: 1095 KCTWTCRENQDRDFADCSIPQEKSNAEINAELEISDASGEEDASGTRLHYKTLECRRPSV 1154

Query: 372  SQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTC 551
            SQ +    +K+N FLHR+RK Q++DEIMVCHCKPP +G LGCG  CLNRMLNIECV+GTC
Sbjct: 1155 SQQNVAS-IKTNQFLHRNRKNQSIDEIMVCHCKPPKEGQLGCGEDCLNRMLNIECVRGTC 1213

Query: 552  PCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQ 731
            PC DLCSNQQFQ+++Y+KL   RCGKKG+GL+ LE I +G FLIEYVGEVLD   YEARQ
Sbjct: 1214 PCRDLCSNQQFQKRRYSKLEKFRCGKKGFGLRSLEYIRKGQFLIEYVGEVLDTHAYEARQ 1273

Query: 732  KEYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFAL 911
            KEYA KGH+HFYFMTLN +EVIDA AKGNLGRF+NHSCDPNCRTEKW+VNGE+C+GLFAL
Sbjct: 1274 KEYAVKGHRHFYFMTLNTSEVIDACAKGNLGRFINHSCDPNCRTEKWMVNGEVCIGLFAL 1333

Query: 912  RSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDSDXXXXXXX 1091
            R IKKGEE+TFDYN+VRV GAAAKKCHCGS +C+GYIGGDP N EIIVQ DSD       
Sbjct: 1334 RDIKKGEEVTFDYNFVRVIGAAAKKCHCGSPQCQGYIGGDPLNTEIIVQDDSDEEYVEPV 1393

Query: 1092 XXXXXG--------------DIDHSLGDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXX 1229
                 G               +DH  G ++    S    + ++                 
Sbjct: 1394 MIPEDGVAEDSRGSAEARLDSLDHQYGAIIQHEESASTNKEIDRSTISVCKLDITMQRKE 1453

Query: 1230 XXXXXXXXXVECGPIQSIPGKDSVDEVRESTDSSSGFQHDHWRSEPASSVKIHTSSTEHI 1409
                         P+ S      V +  E   S         RS P    ++        
Sbjct: 1454 SENQYSLELQH--PLPSFVQPVEVFQPTEDVTS---------RSTPVIQQQVFRE----- 1497

Query: 1410 IGTPTSSPKS----DVLLVENASQKSLCGSIDS----ASRVFEVDTECEPHSLTRM---- 1553
            IGT   S  S    ++        K L   ID+    +++  +V+T  +   L+++    
Sbjct: 1498 IGTAEKSSNSCERPEITSPIKVISKPLSDDIDAPASDSNKNSKVNTFEDEQLLSKVHRNV 1557

Query: 1554 KISRPKS-VKNRKSSGISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNEL 1730
            K S   S VK  K         ++     +  V   KPKR +EGS       VEE LNEL
Sbjct: 1558 KTSHSSSFVKKGKVRSTPLNTNKIQVVANKSHVLPFKPKRSIEGS-------VEEKLNEL 1610

Query: 1731 LDADGGICKKKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLD 1910
            LD DGGI K+KD++KGYLKLL LTA SGDSG+G AI+SNR LS+ILDA+LKTKSR VL+D
Sbjct: 1611 LDTDGGISKRKDSAKGYLKLLFLTAQSGDSGSGEAIKSNRDLSIILDALLKTKSRTVLID 1670

Query: 1911 IINKNGLQMLHNMMKLYRRDFKKIPILRKLLKVLEFLAEK-EILTVERINVDSLHPGVES 2087
            IINKNGL+MLHN+MK+ RRDF KIPILRKLLKVLE+LAEK +ILT E I      PG+ES
Sbjct: 1671 IINKNGLRMLHNIMKMCRRDFNKIPILRKLLKVLEYLAEKPQILTQEHITGGPPCPGMES 1730

Query: 2088 FRESILSFTEHDDNQVHQIARSFRDRWIPHARRFHXXXXXXXXXXXXXXXXXXH-FGSCH 2264
            F ESILS TEH D +VH IAR+FR+RWIP A R H                  + F + H
Sbjct: 1731 FTESILSLTEHGDKRVHDIARNFRNRWIPKALRRHCFVDRDDGKMEFNRSSNYNRFPTSH 1790

Query: 2265 -HWREQGAIAVERLTCSDQSV--SMSNLVDARSEEASSPKLVDDQTMVTRPRKRKSRWDQ 2435
             +WR+Q   + E    + QSV  +  +      + AS+P      T  T+ RKRKSRWDQ
Sbjct: 1791 DNWRDQTGRSTEVADSAKQSVVKTPPSASTVTQDGASTPCTGGCTTTETKVRKRKSRWDQ 1850

Query: 2436 PASPKKTGRSPKVDYSEGGQRTLTYEAKKEESNC---SGDQNIL--------RNRDEDMM 2582
            PA      +S     +     +  +  K+++ NC    GD  +L         N    ++
Sbjct: 1851 PAVTVPDSKSRWDQPAVTCPDSSLHPNKEQKINCKQLEGDATLLPENQSREGGNCSSTVL 1910

Query: 2583 HNLD----DEVPPGFSHLHDSTDDTPPGFSSVL-CPV--------PLGHLQTRFNPHLPV 2723
            H  +    D V  G  ++    DD PPGFSS L  PV         +GH Q +F   LPV
Sbjct: 1911 HICEQVGADVVYAGKQNI---LDDAPPGFSSCLNTPVVSYLSTSSVIGHPQAKFVSRLPV 1967

Query: 2724 AYGIPFSVVQKSGTPQCGTADTWEIAXXXXXXXXXXXXXXXXXXXXTGDQ------EEAR 2885
            +YGIP S++Q+ GTP   TADTW +A                               +A 
Sbjct: 1968 SYGIPLSIMQQYGTPHAETADTWVVAPGMPFHPFPPLPPCPRHKKDPSHDVRHASVNQAS 2027

Query: 2886 QGYQTHA------GDQGLPCTSGASKNTVGT----NQNMVHHER--GPRNFLGKRCYWQK 3029
            +G Q          ++  P T+G ++   GT    NQ+ +  ER       LG+R + Q+
Sbjct: 2028 EGQQASCDTTNCHSEESTPSTTGVTQADSGTPCANNQSGIKRERESSYEAPLGRRYFKQQ 2087

Query: 3030 NWNGSKRRPPWARNIGRWGFRGN 3098
             WN  K RPPW R+   WG  GN
Sbjct: 2088 KWNHPKLRPPWMRDRTGWGCNGN 2110


>ref|XP_006581600.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like isoform X2
            [Glycine max] gi|571460083|ref|XP_003527954.2| PREDICTED:
            histone-lysine N-methyltransferase ASHH2-like isoform X1
            [Glycine max]
          Length = 2040

 Score =  794 bits (2050), Expect = 0.0
 Identities = 469/955 (49%), Positives = 583/955 (61%), Gaps = 42/955 (4%)
 Frame = +3

Query: 63   NTSCLDLIR-TGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEATNCRWTCKDNQDKA 239
            N S LD++   G  EQ L PR AWV CDDC KWR IPAVLAD I+ TNC WTCKD+ DKA
Sbjct: 1001 NLSNLDMLSGVGYGEQLLSPRNAWVRCDDCHKWRRIPAVLADRIDETNCTWTCKDSSDKA 1060

Query: 240  FGDCSISQEKSNAEINAELQISD---EEDARDGHLGFKGSGLKPLLASQPSTLMLVKSNL 410
            F DC+I QEKSNAEINAEL +SD   EEDA +G   FK    +P L SQ ST   + +N 
Sbjct: 1061 FADCAIPQEKSNAEINAELGLSDASGEEDAYEGSKNFKELEYRPPLVSQESTFTHILTNE 1120

Query: 411  FLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTCPCGDLCSNQQFQR 590
            FLHRS K QT+DEIMVCHCKP  +G LGCG  CLNR+LNIECVQGTCPCGD CSNQQFQ+
Sbjct: 1121 FLHRSHKTQTIDEIMVCHCKPSQEGKLGCGDECLNRILNIECVQGTCPCGDRCSNQQFQK 1180

Query: 591  KQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQKEYASKGHKHFYF 770
             +YA L W +CGKKGYGL+ +E++++G FLIEYVGEVLD+Q YEARQ+EYA KGH+HFYF
Sbjct: 1181 HKYASLKWFKCGKKGYGLKAIENVAQGQFLIEYVGEVLDMQAYEARQREYALKGHRHFYF 1240

Query: 771  MTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALRSIKKGEELTFDY 950
            MTLNG+EVIDASAKGNLGRF+NHSCDPNCRTEKW+VNGEIC+GLFALR IKK EELTFDY
Sbjct: 1241 MTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRDIKKDEELTFDY 1300

Query: 951  NYVRVFGAAAKKCHCGSRKCRGYI-GGDPHNAEIIVQGDSDXXXXXXXXXXXXGDIDHSL 1127
            NYVRVFGAAAKKC+CGS  CRGYI GGDP NAE+IVQ DS+            G+I+ S+
Sbjct: 1301 NYVRVFGAAAKKCYCGSPNCRGYIGGGDPLNAELIVQSDSEEEFPEPVMLTKDGEIEDSV 1360

Query: 1128 --GDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVECGPIQSIPGKDSV 1301
               +      +Q  + ++                           +      +I    S+
Sbjct: 1361 PTPEYFNNVDTQSAKHMLK-----------------------DRDILDNSTTAIDSDGSL 1397

Query: 1302 DEVRESTDSS------SGFQHDHWRSEPASSVKIHTSSTEHIIGTPTSSPKSDV-----L 1448
            ++ R    +S      S  + +  + +  SSV++   S +  +   TS P   V      
Sbjct: 1398 EKERSMNPASAVSLLHSSAEMEDSKGKLQSSVQVEEISQQ--MEDVTSKPMPAVHQGYEK 1455

Query: 1449 LVENASQKSLCGSIDSASRVFEVDTECEPHSLTRMKISRPKSVKNRKSSGISATV--GRV 1622
              E A + S    +D+ S +  V ++  P+S    + S+ + +  RK+  +  +V  G+V
Sbjct: 1456 ESEFADKTSSIQRLDTTSPLTTV-SKMLPNSAGSNRESKSEIIGGRKTPKLKGSVKKGKV 1514

Query: 1623 LS------KTQ----RPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKKKDAS 1772
             +      KT+    R +V S K K+ +EGS+NG  + V+E LNELLD DGGI K+KDA+
Sbjct: 1515 HANPPNGLKTEVTANRLQVPSIKHKK-VEGSSNGRFEAVQEKLNELLDGDGGISKRKDAT 1573

Query: 1773 KGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQMLHNMM 1952
            KGYLKLL LT ASGD  NG AIQSNR LSMILDA+LKTKSR VL DIINKNGLQMLHN+M
Sbjct: 1574 KGYLKLLFLTVASGDRINGEAIQSNRDLSMILDALLKTKSRAVLNDIINKNGLQMLHNIM 1633

Query: 1953 KLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTEHDDNQ 2132
            K YR DFKKIPILRKLLKVLEFL   +ILT E IN      G+ESFRES+LS TEH+D Q
Sbjct: 1634 KQYRHDFKKIPILRKLLKVLEFLEAGKILTYEHINGGPPCRGMESFRESMLSLTEHEDKQ 1693

Query: 2133 VHQIARSFRDRWIP-HARRFHXXXXXXXXXXXXXXXXXXHFGSCHHWR-EQGAIAVERLT 2306
            VHQIAR+FRDRW P HAR+                     F +   +R EQ     E   
Sbjct: 1694 VHQIARNFRDRWFPRHARKHGYMDRDDNRVESHRSFKCNRFSASQSYRHEQDLKTTEASD 1753

Query: 2307 CSDQSVSMSNLVDARSEEASSPKLVDD-QTMVTRPRKRKSRWDQPASPKKTGRSPKVDYS 2483
            CS QS+ ++  VDA + E    + +D  +T     RKRKSRWDQPA   +T     V  S
Sbjct: 1754 CSQQSMLVTTPVDAEAREGFPVQSLDGVETKTAEKRKRKSRWDQPA---ETNSHSDVVMS 1810

Query: 2484 EGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMHNLDDEVPPGFSHLHDSTDDTPPGFSS 2663
              G+                              N+ ++VPPGFS    S + +    + 
Sbjct: 1811 SIGE----------------------------SQNIHEDVPPGFSCPVGSLNASLNSGNL 1842

Query: 2664 VL-------CP--VPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQCGTADTWEIA 2801
             L       CP  + +GH + +FN  L V++G+P+SV Q+ GTP     + W  A
Sbjct: 1843 ALQNASRSGCPSDIIIGHPKEKFNSCLAVSFGMPWSVAQQYGTPHAEFPECWVTA 1897


>ref|XP_006578956.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like isoform X3
            [Glycine max]
          Length = 2047

 Score =  793 bits (2048), Expect = 0.0
 Identities = 465/951 (48%), Positives = 579/951 (60%), Gaps = 38/951 (3%)
 Frame = +3

Query: 63   NTSCLDLIR-TGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEATNCRWTCKDNQDKA 239
            N S LD++   G  EQ L PR AWV CDDC KWR IPAVLAD I+ TNC WTCKD+ DKA
Sbjct: 1008 NVSNLDMLSGVGFGEQILSPRNAWVRCDDCHKWRRIPAVLADRIDETNCTWTCKDSSDKA 1067

Query: 240  FGDCSISQEKSNAEINAELQISD---EEDARDGHLGFKGSGLKPLLASQPSTLMLVKSNL 410
            F DC+I QEKSNAEINAEL +SD   EEDA +G   FK     P + SQ ST   + +N 
Sbjct: 1068 FADCAIPQEKSNAEINAELGLSDASGEEDAYEGSKNFKELEYWPPIVSQESTFTNILTNE 1127

Query: 411  FLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTCPCGDLCSNQQFQR 590
            FLHRS K QT+DEIMVCHCKP   G LGCG  CLNR+LNIECVQGTCPCGD CSNQQFQ+
Sbjct: 1128 FLHRSHKTQTIDEIMVCHCKPSQGGKLGCGDECLNRILNIECVQGTCPCGDRCSNQQFQK 1187

Query: 591  KQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQKEYASKGHKHFYF 770
             +YA L W +CGKKGYGL+ +ED+++G FLIEYVGEVLD+QTYEARQ+EYA KGH+HFYF
Sbjct: 1188 HKYASLKWFKCGKKGYGLKAIEDVAQGQFLIEYVGEVLDMQTYEARQREYALKGHRHFYF 1247

Query: 771  MTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALRSIKKGEELTFDY 950
            MTLNG+EVIDASAKGNLGRF+NHSCDPNCRTEKW+VNGEIC+GLFALR++KK EELTFDY
Sbjct: 1248 MTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRNVKKDEELTFDY 1307

Query: 951  NYVRVFGAAAKKCHCGSRKCRGYI-GGDPHNAEIIVQGDSDXXXXXXXXXXXXGDIDHSL 1127
            NYVRVFGAAAKKC+CGS  CRGYI GGDP NAE+IVQ DS+            G+I+ ++
Sbjct: 1308 NYVRVFGAAAKKCYCGSSNCRGYIGGGDPLNAELIVQSDSEEEFPEPVMLTKDGEIEDAV 1367

Query: 1128 GDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVECGPIQSIPGKDSVDE 1307
                   ++ D E   +                              P  +I   D   E
Sbjct: 1368 -PTPKYFNNVDTESAKHMLKDRDILE--------------------NPTTAI-DSDGSPE 1405

Query: 1308 VRESTDSSSGFQHDHWRSEPASSV-KIHTSSTEHIIGTPTSSPKSDVLLVENASQKSLCG 1484
               S + +S     H  +E   S  K+ +S  +  I        S  +   +   +    
Sbjct: 1406 KESSMNPASAVSLLHSSAEMEDSKGKLPSSVRDEEISQQMEDVTSKPMPSVHQGYEKESE 1465

Query: 1485 SIDSASRVFEVDTECEPHSLTRM--------KISRPKSVKNRKSSGISATV--GRVLS-- 1628
              D  S +  ++T   P ++++M        + S+ + +  +K+  ++ +V  G+V +  
Sbjct: 1466 FADKTSSIQRLETTSPPTTVSKMLPNSAGSNRESKSEIIGGKKTPKLNGSVKKGKVHANP 1525

Query: 1629 ----KTQ----RPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKKKDASKGYL 1784
                KT+    R +V S K K+ +EGS+NG  + V+E LNELLD DGGI K+KDA+KGYL
Sbjct: 1526 PNGLKTEVTANRLQVSSIKHKK-VEGSSNGRFEAVQEKLNELLDGDGGISKRKDATKGYL 1584

Query: 1785 KLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQMLHNMMKLYR 1964
            KLL LT ASGD  NG AIQSNR LSMILDA+LKTKSR VL DIINKNGLQMLHN+MK YR
Sbjct: 1585 KLLFLTVASGDRINGEAIQSNRDLSMILDALLKTKSRAVLNDIINKNGLQMLHNIMKQYR 1644

Query: 1965 RDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTEHDDNQVHQI 2144
             DFKKIPILRKLLKVLEFL   +ILT E IN      G+ESFRES+LS TEH+D QVHQI
Sbjct: 1645 HDFKKIPILRKLLKVLEFLEASKILTSEHINGGPPCHGMESFRESMLSLTEHEDKQVHQI 1704

Query: 2145 ARSFRDRWIP-HARRFHXXXXXXXXXXXXXXXXXXHFGSCHHWR-EQGAIAVERLTCSDQ 2318
            AR+FRDRW P HAR+                     F + H  R EQ     E + CS Q
Sbjct: 1705 ARNFRDRWFPRHARKHGYMDRDDNRVESHRSFKCNRFSASHSQRHEQDLRTTEAIDCSQQ 1764

Query: 2319 SVSMSNLVDARSEEASSPKLVDD-QTMVTRPRKRKSRWDQPASPKKTGRSPKVDYSEGGQ 2495
            ++ M+  VDA + E    + +D  +    + RKRKSRWDQPA                  
Sbjct: 1765 AMLMTTPVDAETWEGCPVQSLDGVEIKRAKKRKRKSRWDQPA------------------ 1806

Query: 2496 RTLTYEAKKEESNCSGDQNILRNRDEDMMHNLDDEVPPGFSHLHDSTDDTPPGFSSVL-- 2669
                      ++N   D  ++ +  E    N+ ++ PPGFS    S + +    +  L  
Sbjct: 1807 ----------DTNSHSDA-VMSSIGES--QNIPEDGPPGFSCPVGSLNASLNSGNLALQN 1853

Query: 2670 -----CP--VPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQCGTADTWEIA 2801
                 CP  + +GH + +FN HLPV+YG+P+S  Q+ GTP     + W  A
Sbjct: 1854 ASRSGCPSDIVIGHPKEKFNSHLPVSYGMPWS-AQQYGTPHAEFPECWVTA 1903


>ref|XP_006578954.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like isoform X1
            [Glycine max] gi|571452142|ref|XP_006578955.1| PREDICTED:
            histone-lysine N-methyltransferase ASHH2-like isoform X2
            [Glycine max]
          Length = 2084

 Score =  793 bits (2048), Expect = 0.0
 Identities = 465/951 (48%), Positives = 579/951 (60%), Gaps = 38/951 (3%)
 Frame = +3

Query: 63   NTSCLDLIR-TGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEATNCRWTCKDNQDKA 239
            N S LD++   G  EQ L PR AWV CDDC KWR IPAVLAD I+ TNC WTCKD+ DKA
Sbjct: 1045 NVSNLDMLSGVGFGEQILSPRNAWVRCDDCHKWRRIPAVLADRIDETNCTWTCKDSSDKA 1104

Query: 240  FGDCSISQEKSNAEINAELQISD---EEDARDGHLGFKGSGLKPLLASQPSTLMLVKSNL 410
            F DC+I QEKSNAEINAEL +SD   EEDA +G   FK     P + SQ ST   + +N 
Sbjct: 1105 FADCAIPQEKSNAEINAELGLSDASGEEDAYEGSKNFKELEYWPPIVSQESTFTNILTNE 1164

Query: 411  FLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTCPCGDLCSNQQFQR 590
            FLHRS K QT+DEIMVCHCKP   G LGCG  CLNR+LNIECVQGTCPCGD CSNQQFQ+
Sbjct: 1165 FLHRSHKTQTIDEIMVCHCKPSQGGKLGCGDECLNRILNIECVQGTCPCGDRCSNQQFQK 1224

Query: 591  KQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQKEYASKGHKHFYF 770
             +YA L W +CGKKGYGL+ +ED+++G FLIEYVGEVLD+QTYEARQ+EYA KGH+HFYF
Sbjct: 1225 HKYASLKWFKCGKKGYGLKAIEDVAQGQFLIEYVGEVLDMQTYEARQREYALKGHRHFYF 1284

Query: 771  MTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALRSIKKGEELTFDY 950
            MTLNG+EVIDASAKGNLGRF+NHSCDPNCRTEKW+VNGEIC+GLFALR++KK EELTFDY
Sbjct: 1285 MTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALRNVKKDEELTFDY 1344

Query: 951  NYVRVFGAAAKKCHCGSRKCRGYI-GGDPHNAEIIVQGDSDXXXXXXXXXXXXGDIDHSL 1127
            NYVRVFGAAAKKC+CGS  CRGYI GGDP NAE+IVQ DS+            G+I+ ++
Sbjct: 1345 NYVRVFGAAAKKCYCGSSNCRGYIGGGDPLNAELIVQSDSEEEFPEPVMLTKDGEIEDAV 1404

Query: 1128 GDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVECGPIQSIPGKDSVDE 1307
                   ++ D E   +                              P  +I   D   E
Sbjct: 1405 -PTPKYFNNVDTESAKHMLKDRDILE--------------------NPTTAI-DSDGSPE 1442

Query: 1308 VRESTDSSSGFQHDHWRSEPASSV-KIHTSSTEHIIGTPTSSPKSDVLLVENASQKSLCG 1484
               S + +S     H  +E   S  K+ +S  +  I        S  +   +   +    
Sbjct: 1443 KESSMNPASAVSLLHSSAEMEDSKGKLPSSVRDEEISQQMEDVTSKPMPSVHQGYEKESE 1502

Query: 1485 SIDSASRVFEVDTECEPHSLTRM--------KISRPKSVKNRKSSGISATV--GRVLS-- 1628
              D  S +  ++T   P ++++M        + S+ + +  +K+  ++ +V  G+V +  
Sbjct: 1503 FADKTSSIQRLETTSPPTTVSKMLPNSAGSNRESKSEIIGGKKTPKLNGSVKKGKVHANP 1562

Query: 1629 ----KTQ----RPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKKKDASKGYL 1784
                KT+    R +V S K K+ +EGS+NG  + V+E LNELLD DGGI K+KDA+KGYL
Sbjct: 1563 PNGLKTEVTANRLQVSSIKHKK-VEGSSNGRFEAVQEKLNELLDGDGGISKRKDATKGYL 1621

Query: 1785 KLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQMLHNMMKLYR 1964
            KLL LT ASGD  NG AIQSNR LSMILDA+LKTKSR VL DIINKNGLQMLHN+MK YR
Sbjct: 1622 KLLFLTVASGDRINGEAIQSNRDLSMILDALLKTKSRAVLNDIINKNGLQMLHNIMKQYR 1681

Query: 1965 RDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTEHDDNQVHQI 2144
             DFKKIPILRKLLKVLEFL   +ILT E IN      G+ESFRES+LS TEH+D QVHQI
Sbjct: 1682 HDFKKIPILRKLLKVLEFLEASKILTSEHINGGPPCHGMESFRESMLSLTEHEDKQVHQI 1741

Query: 2145 ARSFRDRWIP-HARRFHXXXXXXXXXXXXXXXXXXHFGSCHHWR-EQGAIAVERLTCSDQ 2318
            AR+FRDRW P HAR+                     F + H  R EQ     E + CS Q
Sbjct: 1742 ARNFRDRWFPRHARKHGYMDRDDNRVESHRSFKCNRFSASHSQRHEQDLRTTEAIDCSQQ 1801

Query: 2319 SVSMSNLVDARSEEASSPKLVDD-QTMVTRPRKRKSRWDQPASPKKTGRSPKVDYSEGGQ 2495
            ++ M+  VDA + E    + +D  +    + RKRKSRWDQPA                  
Sbjct: 1802 AMLMTTPVDAETWEGCPVQSLDGVEIKRAKKRKRKSRWDQPA------------------ 1843

Query: 2496 RTLTYEAKKEESNCSGDQNILRNRDEDMMHNLDDEVPPGFSHLHDSTDDTPPGFSSVL-- 2669
                      ++N   D  ++ +  E    N+ ++ PPGFS    S + +    +  L  
Sbjct: 1844 ----------DTNSHSDA-VMSSIGES--QNIPEDGPPGFSCPVGSLNASLNSGNLALQN 1890

Query: 2670 -----CP--VPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQCGTADTWEIA 2801
                 CP  + +GH + +FN HLPV+YG+P+S  Q+ GTP     + W  A
Sbjct: 1891 ASRSGCPSDIVIGHPKEKFNSHLPVSYGMPWS-AQQYGTPHAEFPECWVTA 1940


>ref|XP_004502541.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like isoform X3
            [Cicer arietinum]
          Length = 1978

 Score =  793 bits (2047), Expect = 0.0
 Identities = 489/1087 (44%), Positives = 614/1087 (56%), Gaps = 52/1087 (4%)
 Frame = +3

Query: 3    GNLEKVDVSKSEFSKEVALENTSCLDLIRT-GIAEQNLVPRVAWVCCDDCLKWRCIPAVL 179
            GN +   V K          + S LD++   G+ EQ   PR AWV CDDC KWR IPA+L
Sbjct: 931  GNHKLAGVGKINTGDNRVPVSVSNLDVMPGFGLEEQQQSPRNAWVSCDDCHKWRRIPALL 990

Query: 180  ADVIEATNCRWTCKDNQDKAFGDCSISQEKSNAEINAELQISD---EEDARDGHLGFKGS 350
            AD I+ TNC WTCKD+ DKA+ DC+I QEKSNAEINAEL +SD   EEDA       K  
Sbjct: 991  ADQIDETNCTWTCKDSSDKAYADCAIPQEKSNAEINAELGLSDASGEEDAYGNSKTHKEL 1050

Query: 351  GLKPLLASQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNI 530
              +  L SQ ST   + +N FLHR+ + QT+DE+MVCHCKPP +G +GCG  CLNRMLNI
Sbjct: 1051 EYQLPLVSQESTFTRIFTNEFLHRNPRTQTIDEVMVCHCKPPREGKMGCGDECLNRMLNI 1110

Query: 531  ECVQGTCPCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDL 710
            ECVQGTCPCGD CSNQQFQ++ Y++L W +CGKKGYGL+ LE ++EG F+IEYVGEVLD+
Sbjct: 1111 ECVQGTCPCGDRCSNQQFQKRNYSRLKWFKCGKKGYGLKALERVAEGQFIIEYVGEVLDV 1170

Query: 711  QTYEARQKEYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEI 890
              YEARQ+EYA KGH+HFYFMTLNG+EVIDASAKGNLGRF+NHSCDPNCRTEKW+VNGEI
Sbjct: 1171 HAYEARQREYALKGHRHFYFMTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEI 1230

Query: 891  CVGLFALRSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGG-DPHNAEIIVQGDS 1067
            C+GLFALR+IK+ EELTFDYNYVRVFGAAAKKC+CGS  C+GYIGG DP+N E+IVQG+S
Sbjct: 1231 CIGLFALRNIKQDEELTFDYNYVRVFGAAAKKCYCGSLHCQGYIGGADPNNGELIVQGES 1290

Query: 1068 DXXXXXXXXXXXXGDIDHSLGDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXX 1247
            D            G+ID S+  +M K         VN                       
Sbjct: 1291 DDEFPEPMMLSENGEIDDSV--LMPKCIDS-----VNTKSSR------------------ 1325

Query: 1248 XXXVECGPIQSIPGKDSVDEVRESTDSSSGFQHDHWRSEPASSVKIHTSSTEHIIGTPTS 1427
                       I  +D +D+   +   + G   +   + PAS+V +  SS E +  + ++
Sbjct: 1326 ---------HLITDRDVLDKCTTAI-CADGSPEEDSSTNPASAVSLLHSSVE-VEDSKSN 1374

Query: 1428 SPKSDVL-----LVEN-------ASQKSLCGSID----SASRVFEVDTECEPHSLTRMKI 1559
             P SD +      +E+       A  K L  S D    S S + EV  +    S + + +
Sbjct: 1375 LPSSDRIEEISQQIEDTTSKPMPADSKELPNSTDSNRESKSEMVEVGND---FSQSHLLV 1431

Query: 1560 SRPKSVKNRKSSGISATVGRVLS---KTQRPKVFSCKPKRLLEGSANGHLKDVEENLNEL 1730
              P+   + K   + A     L+      R  V S K K+ +EGS+NG  + V+  LNEL
Sbjct: 1432 KTPRLNASVKKGKVRANAANALTAEVAAPRLPVSSIKNKK-VEGSSNGRFEAVQGKLNEL 1490

Query: 1731 LDADGGICKKKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLD 1910
            LD +GGI K+KDA+KGYLKLLLLT ASGD  NG AIQSNR LSMILDA+LKTKSR VL D
Sbjct: 1491 LDGNGGISKRKDATKGYLKLLLLTVASGDRSNGEAIQSNRDLSMILDALLKTKSRAVLND 1550

Query: 1911 IINKNGLQMLHNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESF 2090
            II+KNGLQMLH +MK YR+DFKKIPILRKLLKVLE+LA  +ILT E IN      G+E F
Sbjct: 1551 IISKNGLQMLHKIMKQYRQDFKKIPILRKLLKVLEYLAAGKILTPEHINGGPPCHGMERF 1610

Query: 2091 RESILSFTEHDDNQVHQIARSFRDRWIP-HARRFHXXXXXXXXXXXXXXXXXXHFGSCH- 2264
            R+S+LS TEHDD QVHQIARSFRDRWIP H R+                     F   H 
Sbjct: 1611 RDSMLSLTEHDDKQVHQIARSFRDRWIPRHGRKHGYMDRDDNRMESHRGFNSNRFSVSHS 1670

Query: 2265 HWREQGAIAVERLTCSDQSVSMSNLVDARSEE-ASSPKLVDDQTMVTRPRKRKSRWDQPA 2441
            H  EQG    E   C  Q + ++  VDAR++E  S+P L   +    + RKRKSRWDQPA
Sbjct: 1671 HRHEQGLRPKEATDCGQQPMLVAT-VDARAQEGCSTPSLDGVEINGAKKRKRKSRWDQPA 1729

Query: 2442 SPKKTGRSPKVDYSEGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMHNLDDEVPPGFS- 2618
                                        E+N   D  I+ + +E    N+ +EVPPGFS 
Sbjct: 1730 ----------------------------ETNSYSDA-IISSINES--QNVHEEVPPGFSC 1758

Query: 2619 ---HLHDSTDDTPPGFSSVL---CP--VPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQC 2774
                L+ + +   P   +     CP  + +G  + +FN  LPV+YG+P+SV Q+ GTP  
Sbjct: 1759 PIRSLNSALNSGTPALQNASHSGCPPSLVIGQPKEKFNSRLPVSYGLPWSVAQQYGTPHA 1818

Query: 2775 GTADTWEIA-------XXXXXXXXXXXXXXXXXXXXTGDQEEARQGYQTH----AGDQGL 2921
                 W  A                                E +Q   T       D  +
Sbjct: 1819 EITGCWITAPGMPFNPFPPLPPYPRDNKDCQPSSMEIDQPAEVKQSDATGPVNCCSDDMI 1878

Query: 2922 PCTSGASKNTVGTNQNMVHHER-----GPRNFLGKRCYWQKNWNGSKRRPPWARNIGRWG 3086
            P T+GA+            H+         + LGK+ + Q+ WN SK    W +    W 
Sbjct: 1879 PSTTGANSEDTNLQCEDAKHDAKRLKGDDSDDLGKKYFRQQKWNNSKIHRTWFKR-DAWK 1937

Query: 3087 FRGNYPS 3107
              GN  S
Sbjct: 1938 CNGNSSS 1944


>ref|XP_004502539.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like isoform X1
            [Cicer arietinum] gi|502136041|ref|XP_004502540.1|
            PREDICTED: histone-lysine N-methyltransferase ASHH2-like
            isoform X2 [Cicer arietinum]
          Length = 1979

 Score =  793 bits (2047), Expect = 0.0
 Identities = 489/1087 (44%), Positives = 614/1087 (56%), Gaps = 52/1087 (4%)
 Frame = +3

Query: 3    GNLEKVDVSKSEFSKEVALENTSCLDLIRT-GIAEQNLVPRVAWVCCDDCLKWRCIPAVL 179
            GN +   V K          + S LD++   G+ EQ   PR AWV CDDC KWR IPA+L
Sbjct: 932  GNHKLAGVGKINTGDNRVPVSVSNLDVMPGFGLEEQQQSPRNAWVSCDDCHKWRRIPALL 991

Query: 180  ADVIEATNCRWTCKDNQDKAFGDCSISQEKSNAEINAELQISD---EEDARDGHLGFKGS 350
            AD I+ TNC WTCKD+ DKA+ DC+I QEKSNAEINAEL +SD   EEDA       K  
Sbjct: 992  ADQIDETNCTWTCKDSSDKAYADCAIPQEKSNAEINAELGLSDASGEEDAYGNSKTHKEL 1051

Query: 351  GLKPLLASQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNI 530
              +  L SQ ST   + +N FLHR+ + QT+DE+MVCHCKPP +G +GCG  CLNRMLNI
Sbjct: 1052 EYQLPLVSQESTFTRIFTNEFLHRNPRTQTIDEVMVCHCKPPREGKMGCGDECLNRMLNI 1111

Query: 531  ECVQGTCPCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDL 710
            ECVQGTCPCGD CSNQQFQ++ Y++L W +CGKKGYGL+ LE ++EG F+IEYVGEVLD+
Sbjct: 1112 ECVQGTCPCGDRCSNQQFQKRNYSRLKWFKCGKKGYGLKALERVAEGQFIIEYVGEVLDV 1171

Query: 711  QTYEARQKEYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEI 890
              YEARQ+EYA KGH+HFYFMTLNG+EVIDASAKGNLGRF+NHSCDPNCRTEKW+VNGEI
Sbjct: 1172 HAYEARQREYALKGHRHFYFMTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEI 1231

Query: 891  CVGLFALRSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGG-DPHNAEIIVQGDS 1067
            C+GLFALR+IK+ EELTFDYNYVRVFGAAAKKC+CGS  C+GYIGG DP+N E+IVQG+S
Sbjct: 1232 CIGLFALRNIKQDEELTFDYNYVRVFGAAAKKCYCGSLHCQGYIGGADPNNGELIVQGES 1291

Query: 1068 DXXXXXXXXXXXXGDIDHSLGDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXX 1247
            D            G+ID S+  +M K         VN                       
Sbjct: 1292 DDEFPEPMMLSENGEIDDSV--LMPKCIDS-----VNTKSSR------------------ 1326

Query: 1248 XXXVECGPIQSIPGKDSVDEVRESTDSSSGFQHDHWRSEPASSVKIHTSSTEHIIGTPTS 1427
                       I  +D +D+   +   + G   +   + PAS+V +  SS E +  + ++
Sbjct: 1327 ---------HLITDRDVLDKCTTAI-CADGSPEEDSSTNPASAVSLLHSSVE-VEDSKSN 1375

Query: 1428 SPKSDVL-----LVEN-------ASQKSLCGSID----SASRVFEVDTECEPHSLTRMKI 1559
             P SD +      +E+       A  K L  S D    S S + EV  +    S + + +
Sbjct: 1376 LPSSDRIEEISQQIEDTTSKPMPADSKELPNSTDSNRESKSEMVEVGND---FSQSHLLV 1432

Query: 1560 SRPKSVKNRKSSGISATVGRVLS---KTQRPKVFSCKPKRLLEGSANGHLKDVEENLNEL 1730
              P+   + K   + A     L+      R  V S K K+ +EGS+NG  + V+  LNEL
Sbjct: 1433 KTPRLNASVKKGKVRANAANALTAEVAAPRLPVSSIKNKK-VEGSSNGRFEAVQGKLNEL 1491

Query: 1731 LDADGGICKKKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLD 1910
            LD +GGI K+KDA+KGYLKLLLLT ASGD  NG AIQSNR LSMILDA+LKTKSR VL D
Sbjct: 1492 LDGNGGISKRKDATKGYLKLLLLTVASGDRSNGEAIQSNRDLSMILDALLKTKSRAVLND 1551

Query: 1911 IINKNGLQMLHNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESF 2090
            II+KNGLQMLH +MK YR+DFKKIPILRKLLKVLE+LA  +ILT E IN      G+E F
Sbjct: 1552 IISKNGLQMLHKIMKQYRQDFKKIPILRKLLKVLEYLAAGKILTPEHINGGPPCHGMERF 1611

Query: 2091 RESILSFTEHDDNQVHQIARSFRDRWIP-HARRFHXXXXXXXXXXXXXXXXXXHFGSCH- 2264
            R+S+LS TEHDD QVHQIARSFRDRWIP H R+                     F   H 
Sbjct: 1612 RDSMLSLTEHDDKQVHQIARSFRDRWIPRHGRKHGYMDRDDNRMESHRGFNSNRFSVSHS 1671

Query: 2265 HWREQGAIAVERLTCSDQSVSMSNLVDARSEE-ASSPKLVDDQTMVTRPRKRKSRWDQPA 2441
            H  EQG    E   C  Q + ++  VDAR++E  S+P L   +    + RKRKSRWDQPA
Sbjct: 1672 HRHEQGLRPKEATDCGQQPMLVAT-VDARAQEGCSTPSLDGVEINGAKKRKRKSRWDQPA 1730

Query: 2442 SPKKTGRSPKVDYSEGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMHNLDDEVPPGFS- 2618
                                        E+N   D  I+ + +E    N+ +EVPPGFS 
Sbjct: 1731 ----------------------------ETNSYSDA-IISSINES--QNVHEEVPPGFSC 1759

Query: 2619 ---HLHDSTDDTPPGFSSVL---CP--VPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQC 2774
                L+ + +   P   +     CP  + +G  + +FN  LPV+YG+P+SV Q+ GTP  
Sbjct: 1760 PIRSLNSALNSGTPALQNASHSGCPPSLVIGQPKEKFNSRLPVSYGLPWSVAQQYGTPHA 1819

Query: 2775 GTADTWEIA-------XXXXXXXXXXXXXXXXXXXXTGDQEEARQGYQTH----AGDQGL 2921
                 W  A                                E +Q   T       D  +
Sbjct: 1820 EITGCWITAPGMPFNPFPPLPPYPRDNKDCQPSSMEIDQPAEVKQSDATGPVNCCSDDMI 1879

Query: 2922 PCTSGASKNTVGTNQNMVHHER-----GPRNFLGKRCYWQKNWNGSKRRPPWARNIGRWG 3086
            P T+GA+            H+         + LGK+ + Q+ WN SK    W +    W 
Sbjct: 1880 PSTTGANSEDTNLQCEDAKHDAKRLKGDDSDDLGKKYFRQQKWNNSKIHRTWFKR-DAWK 1938

Query: 3087 FRGNYPS 3107
              GN  S
Sbjct: 1939 CNGNSSS 1945


>gb|EMJ23127.1| hypothetical protein PRUPE_ppa000056mg [Prunus persica]
          Length = 2066

 Score =  792 bits (2045), Expect = 0.0
 Identities = 494/1135 (43%), Positives = 625/1135 (55%), Gaps = 71/1135 (6%)
 Frame = +3

Query: 24   VSKSEFSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEATN 203
            + K+   K+        LD +   + +Q + PR AWV CDDC KWR IPA LADVI+   
Sbjct: 955  IRKANSVKDAVCIGVPNLDTVPVDLDKQYVPPRNAWVLCDDCHKWRRIPAELADVIDEIK 1014

Query: 204  CRWTCKDNQDKAFGDCSISQEKSNAEINAELQISD---EEDARDGHLGFKGSGLKPLLAS 374
            C WTC+DN+DKAF DCSI QEKSN+EINAEL ISD   +EDA    L +K    +    S
Sbjct: 1015 CTWTCRDNKDKAFADCSIPQEKSNSEINAELDISDASGDEDASVTRLNYKELERRRPTVS 1074

Query: 375  QPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTCP 554
            Q +    +K+N FLHR+RK QT+DEIMVCHCKPP DG LGCG  CLNRMLNIEC++G CP
Sbjct: 1075 QQNVAS-IKTNQFLHRNRKTQTIDEIMVCHCKPPSDGQLGCGDDCLNRMLNIECIRGACP 1133

Query: 555  CGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQK 734
            C DLCSNQQFQ+++YAKL   RCGKKGYGL++L+DI +G FLIEYVGEVLD   YEARQK
Sbjct: 1134 CRDLCSNQQFQKRRYAKLEKFRCGKKGYGLRLLDDIFKGQFLIEYVGEVLDTHAYEARQK 1193

Query: 735  EYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALR 914
            EYA K H+HFYFMTLNG+EVIDA AKGNLGRF+NHSCDPNCRTEKW+VNGEIC+GLFALR
Sbjct: 1194 EYALKAHRHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWMVNGEICIGLFALR 1253

Query: 915  SIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDSDXXXXXXXX 1094
             IKKGEE+TFDYNYVRVFGAAAKKC+CGS +CRGYIGGDP ++E+I+Q DSD        
Sbjct: 1254 DIKKGEEVTFDYNYVRVFGAAAKKCYCGSAQCRGYIGGDPLDSEVIIQDDSDEEYIEPVM 1313

Query: 1095 XXXXGDIDHSLGDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVECGPI 1274
                G        +  K  S    +  +                          +     
Sbjct: 1314 IPEDG--------ISEKVESASTNKETDKSTIAVGELEFTTQREESVNPSESVVLHIH-- 1363

Query: 1275 QSIPGKDSVDEVRESTDSSSGFQHDHWRSEPASSVK---IHTSSTEHIIGTPTSSPKSDV 1445
             S+  + S  ++  S       +H    S P S V+   +  + T+    + TS  + ++
Sbjct: 1364 DSLELEHSRQKLPSSVQPVEASEHKEETSRPMSVVQQEILRENETKE--KSSTSFERLEI 1421

Query: 1446 LLVENASQKSLCGSIDSASRVFEVDT----ECEPHSLTRMKISRPKSVKNRKSSGISATV 1613
                    KSL   ID A+R  + DT    +        +K SR  S   +    I  + 
Sbjct: 1422 ASPIKVLSKSLSDGID-ANRKSKSDTTEDRQVSSQVRPNVKTSRSSSFVKKGKVRIIPSG 1480

Query: 1614 GRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKKKDASKGYLKLL 1793
             ++     +  V S KPKRL EGS  G      E LNELLD DGGI K+KD++KGYLKLL
Sbjct: 1481 NKIQVAANKSHVLSIKPKRLTEGSGKGFF----EKLNELLDVDGGINKRKDSTKGYLKLL 1536

Query: 1794 LLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQMLHNMMKLYRRDF 1973
             LTA SGDSGNG AIQSNR LSMILDA+LKT+SR VL+D+INKNGL+MLHN+MK YR DF
Sbjct: 1537 FLTAVSGDSGNGEAIQSNRDLSMILDALLKTRSRVVLIDVINKNGLRMLHNIMKKYREDF 1596

Query: 1974 KKIPILRKLLK-------------------VLEFLAEKEILTVERINVDSLHPGVESF-R 2093
            KKIPILRKLLK                   VLE+LA K+ILT+E I      PG+ES  R
Sbjct: 1597 KKIPILRKLLKDLSLSLSLSLSLSLSLSCGVLEYLAVKQILTLEHITGGPPCPGMESLNR 1656

Query: 2094 ESILSFTEHDDNQVHQIARSFRDRWIP-HARRFHXXXXXXXXXXXXXXXXXXHFGSCH-H 2267
             SIL        QVHQIAR+FRDRWIP H RR                       + H +
Sbjct: 1657 LSIL--------QVHQIARNFRDRWIPRHLRRHGFVDRDDSKMEFNRGSNCNRLSTSHDN 1708

Query: 2268 WREQGAIAVERLTCSDQSVSMSNLVDARSEEASSPKLVDDQTMVTRPRKRKSRWDQPAS- 2444
            WR+Q   + + +    QSV  +  V    ++ S+P      T VT+ RKRKSRWDQPA  
Sbjct: 1709 WRDQSGRSTDTIDSIKQSVLSTTSVSTGVQDCSAPCTGGCPTSVTKVRKRKSRWDQPAET 1768

Query: 2445 -PKKTGRSPKVDYSEG-----------GQRTLTYE-AKKEESNCSGDQNILRNRDEDMMH 2585
             P  +    K   +E            G+  L  E    ++ NCS   +    +++    
Sbjct: 1769 IPDSSSLQNKEQKTESGLHRPSPLSGTGEVALHLERVSGDDGNCSSSVHDNSQQNDGAQI 1828

Query: 2586 NLDDEVPPGFSHLHDSTDDTPPGFSSVLCPVP-----LGHLQTRFNPHLPVAYGIPFSVV 2750
            NL+D VPPGFS  +  T       SS  CP+      +GH Q +F   L V+YG P S++
Sbjct: 1829 NLED-VPPGFSS-YIRTPTVSSIASSSFCPLKCPAAVIGHPQEKFVSRLSVSYGFPLSMM 1886

Query: 2751 QKSGTPQCGTADTWEIA----------XXXXXXXXXXXXXXXXXXXXTGDQEEARQG--- 2891
            Q+ GTP      TW +A                              +G+Q    Q    
Sbjct: 1887 QQYGTPHAEIVGTWAVAPGIPFQPFPPLPPFPRHKKDPSPYPTVNHVSGNQPAGGQPDWC 1946

Query: 2892 -YQTHAGDQGLPCTSGASKNTVGT----NQNMVHHERGPRNFLGKRCY-WQKNWNGSKRR 3053
               T   ++  P T+G+++   G+    NQ      R   N LG+R +  QK WN +K R
Sbjct: 1947 VPATSQSEESTPSTTGSNQADFGSPCANNQYSSKRVRESSNDLGRRYFKQQKYWNNTKLR 2006

Query: 3054 PPWARNIGRWGFRGNYPSHPRNGSSNINMEMVPNE-SSGEDVTDVNHSVEYAGNS 3215
            PP   +   WG  GN   +   G+  I +  V NE S+     D+++ VE AGN+
Sbjct: 2007 PPSFSDRNGWGCTGN---NSGGGTDGIGVGHVANELSTSYCSEDLSYRVEKAGNN 2058


>gb|ESW09471.1| hypothetical protein PHAVU_009G130100g [Phaseolus vulgaris]
          Length = 2017

 Score =  789 bits (2037), Expect = 0.0
 Identities = 486/1135 (42%), Positives = 617/1135 (54%), Gaps = 64/1135 (5%)
 Frame = +3

Query: 3    GNLEKVDVSKSEFSKEVALENTSCLDLIR-TGIAEQNLVPRVAWVCCDDCLKWRCIPAVL 179
            GN +   V K          N S LD +    +  Q   PR AWV CDDC KWR IPAVL
Sbjct: 938  GNYKLDAVGKINAEDNKVSVNISKLDTLSGVELGGQLPSPRNAWVRCDDCYKWRRIPAVL 997

Query: 180  ADVIEATNCRWTCKDNQDKAFGDCSISQEKSNAEINAELQISD---EEDARDGHLGFKGS 350
            AD+I+ TN  WTCKD+ D AF DC++ QEKSNAEINAEL +SD   EEDA +G   FK  
Sbjct: 998  ADLIDETNRTWTCKDSSDSAFADCAVPQEKSNAEINAELGLSDASGEEDAYEGSKNFKEL 1057

Query: 351  GLKPLLASQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNI 530
              +P   SQ ST   + +N FLHRS K QT+DEIMVCHCK   +G LGCG  CLNRMLNI
Sbjct: 1058 EYRPPFVSQGSTFTHIFTNEFLHRSHKTQTIDEIMVCHCKASQEGKLGCGDECLNRMLNI 1117

Query: 531  ECVQGTCPCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDL 710
            ECVQGTCPCGD CSNQQFQ+++YA L W +CGKKGYGL+ L ++++G FLIEYVGEVLD+
Sbjct: 1118 ECVQGTCPCGDRCSNQQFQKRKYANLRWFKCGKKGYGLKALGNVAQGQFLIEYVGEVLDM 1177

Query: 711  QTYEARQKEYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEI 890
             TYEARQ+EYA KGH+HFYFMTLNG+EVIDASAKGNLGRF+NHSCDPNCRTEKW+VNGEI
Sbjct: 1178 HTYEARQREYALKGHRHFYFMTLNGSEVIDASAKGNLGRFINHSCDPNCRTEKWMVNGEI 1237

Query: 891  CVGLFALRSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYI-GGDPHNAEIIVQGDS 1067
            C+GLFALR IK+ EELTFDYNYVRVFGAAAKKC+C S  CRGYI GGDP NA++IVQ DS
Sbjct: 1238 CIGLFALRDIKQDEELTFDYNYVRVFGAAAKKCYCSSPSCRGYIGGGDPLNADLIVQSDS 1297

Query: 1068 DXXXXXXXXXXXXGDIDHS--LGDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXX 1241
            +            G I+ +  +    +   +Q    ++                      
Sbjct: 1298 EEEFPEPVMLSKDGKIEDAVPIPKYFSNVDTQSARNMLKGRDILEKSTTAIDSDGSPEKE 1357

Query: 1242 XXXXXVECGPIQSIPGK--------------DSVDEVRESTDSSSGFQHDHWRSEPASSV 1379
                      +   P +              + + +  E   S        +  E  S  
Sbjct: 1358 SSVNPASAVSLLHSPAEMEDSKGKLPFSVEVEEISQQMEDVTSKPMSTEQGYEKEKESEF 1417

Query: 1380 KIHTSSTEHIIGTPTSSPKSDVLLVENASQKSLCGSIDSASRVFEVDTECEPHSLTRMKI 1559
               TSST+ +  T   +  S +L    ++++S    I+                 +++K 
Sbjct: 1418 ADKTSSTQRLETTSPLTTASKMLSNSGSNKESKSEIIEGRKN-------------SKLKS 1464

Query: 1560 SRPK-SVKNRKSSGISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNELLD 1736
            S  K  V     +G+ A V        R ++ S K K+ LEGS+NG  + V+E LNELLD
Sbjct: 1465 SVKKGKVHANLPNGLKAEV-----SANRLQLSSVKHKK-LEGSSNGRFEAVQEKLNELLD 1518

Query: 1737 ADGGICKKKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDII 1916
             DGGI K+KDA+KGYLKLL LT ASGD  NG AIQSNR LSMILDA+LKTKSR VL DII
Sbjct: 1519 GDGGISKRKDATKGYLKLLFLTVASGDRSNGEAIQSNRDLSMILDALLKTKSRAVLNDII 1578

Query: 1917 NKNGLQMLHNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRE 2096
            NKNGLQMLHN+MK YR+DFKKIPILRKLLKVLE+LA  +ILT E+IN      G+ESFRE
Sbjct: 1579 NKNGLQMLHNIMKQYRQDFKKIPILRKLLKVLEYLAASKILTPEQINGGPPCHGMESFRE 1638

Query: 2097 SILSFTEHDDNQVHQIARSFRDRWIPHARRFH-XXXXXXXXXXXXXXXXXXHFGSCH-HW 2270
            S+LS TEHDD QVHQIARSFRDRW P   R H                    F + H H 
Sbjct: 1639 SMLSLTEHDDKQVHQIARSFRDRWFPRPNRKHGYLDRDDNRMESNRSFSGSRFSASHSHR 1698

Query: 2271 REQGAIAVERLTCSDQSVSMSNLVDARSEEASSPKLVDD-QTMVTRPRKRKSRWDQPASP 2447
             EQ   A E + CS QS+  +  VDA ++E+     +D  +    + RKRKSRWDQPA  
Sbjct: 1699 PEQDLRAAEVIDCSQQSMLGTTPVDADTQESCPAHSLDGVEIKGAKKRKRKSRWDQPA-- 1756

Query: 2448 KKTGRSPKVDYSEGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMHNLDDEVPPGFS--- 2618
                                      E+N   D  ++ +  E    N+ ++VPPGFS   
Sbjct: 1757 --------------------------ETNSLSDA-VMSSIGES--QNIHEDVPPGFSCPI 1787

Query: 2619 -----HLHDSTDDTPPGFSSVLCPVP--LGHLQTRFNPHLPVAYGIPFSVVQKSGTPQCG 2777
                    +S +      S   CP    +GH + +FN  LPVAYG+P+SV  + GTP   
Sbjct: 1788 GPLNASALNSGNLVLQNASRSGCPSDSVVGHSKRKFNSRLPVAYGMPWSVAHQYGTPHTE 1847

Query: 2778 TADTWEIA--------------------XXXXXXXXXXXXXXXXXXXXTGDQEEARQGYQ 2897
              + W  A                                        +    E ++G+ 
Sbjct: 1848 FPERWVTAPGIPFIPFPPLPPYPRDNKDCQPSNNNSAMIIDLPAEAMISDQSAEVKEGHN 1907

Query: 2898 TH----AGDQGLPCTSGAS--KNTVGTNQNMVHHERGPRNFLGKRCYWQKNWNGSKRRPP 3059
            +       D  +P T+GA+  ++ +   +N     +G  + L ++ Y Q+ WN SK   P
Sbjct: 1908 SSMVSCCADDMIPSTTGANPEESNLLFEENEAKRMKGDSHDLVRKYYKQQKWNNSKIHRP 1967

Query: 3060 WARNIGRWGFRGNYPSHPRNGSSNINMEMVPNESSGEDVTDVNHSV---EYAGNS 3215
            W +       R  +  +  N S ++    V      ED  D  +++   E  GN+
Sbjct: 1968 WFQ-------RNAWKCNENNSSGDMCSIDVDLPKESEDTCDAENAICREEKGGNN 2015


>ref|XP_002300965.2| hypothetical protein POPTR_0002s07930g [Populus trichocarpa]
            gi|550344516|gb|EEE80238.2| hypothetical protein
            POPTR_0002s07930g [Populus trichocarpa]
          Length = 2245

 Score =  788 bits (2036), Expect = 0.0
 Identities = 492/1118 (44%), Positives = 625/1118 (55%), Gaps = 75/1118 (6%)
 Frame = +3

Query: 30   KSEFSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEATNCR 209
            KS     V  E  + LD+  +G+ EQNL P  AWV CDDCLKWR IP  L + I  T+ +
Sbjct: 1149 KSVCDHVVYKEEVTNLDMPSSGVMEQNLFPDNAWVRCDDCLKWRRIPVRLVESISQTHRQ 1208

Query: 210  WTCKDNQDKAFGDCSISQEKSNAEINAELQISD-EEDARDGHLGFKGSGLKPLLASQPST 386
            W C+DN DKAF DCS  QEKS+AEINAEL ISD +ED  D    +      P   S+   
Sbjct: 1209 WICEDNMDKAFADCSFPQEKSDAEINAELGISDADEDVCDAPSNYMELECGPTSVSKEYE 1268

Query: 387  LMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLG-CGSHCLNRMLNIECVQGTCPCGD 563
               + +N FLHR+RK QT+DEIMVC+CK PV G LG CG  CLNRMLNIECVQGTCPCGD
Sbjct: 1269 FTRITTNQFLHRTRKTQTIDEIMVCYCKAPVGGRLGGCGDECLNRMLNIECVQGTCPCGD 1328

Query: 564  LCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQKEYA 743
            LCSNQQFQ+  YAK++W RCGKKG+GL++ EDI+ G FLIEYVGEVLD+  YEARQKEYA
Sbjct: 1329 LCSNQQFQKHNYAKMTWDRCGKKGFGLRLEEDITRGQFLIEYVGEVLDVHAYEARQKEYA 1388

Query: 744  SKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALRSIK 923
            SKGHKHFYFMTL+G+EVIDA  KGNLGRF+NHSCDPNCRTEKWVVNGEIC+GLFALR IK
Sbjct: 1389 SKGHKHFYFMTLDGSEVIDACVKGNLGRFINHSCDPNCRTEKWVVNGEICIGLFALRDIK 1448

Query: 924  KGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDSDXXXXXXXXXXX 1103
            KGEE+TFDYNYVRV GAAAK+C+CGS +C+GYIGGDP ++E+  Q DSD           
Sbjct: 1449 KGEEVTFDYNYVRVVGAAAKRCYCGSPQCQGYIGGDPTSSEVTDQVDSD-EEFPEPVMLE 1507

Query: 1104 XGDIDHSLGDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVECGPIQS- 1280
             G++   L + ++K S   + +                             +   P +S 
Sbjct: 1508 DGEVGDGLKNKISKTSFFGLSKGREMESKTAVGNLEVATEIKDSMNQSTPAISQSPSESE 1567

Query: 1281 ---IPGKDSVDEVR---------ESTDSSSGFQHDHWRSEPA-----SSVKIHTSSTEHI 1409
               +PG  S    R          +T  +   Q +    E       SS K+ TS T  +
Sbjct: 1568 MNGLPGDFSSSSKRVEISPQTEDMTTQPTPAVQQEISMEEMMDKSLYSSQKLKTSLTSVL 1627

Query: 1410 IGTPTSSPKSDVLLVENASQKSLCGSIDSASRVFEVDTECEPHSLTRMKISRPKS---VK 1580
                 + P  D +++   S+ +   +     RVF           +R  I  P     +K
Sbjct: 1628 -----TKPLPDDIMINRKSKSTTAEN----KRVF---------VKSRFIIKTPPQSGLIK 1669

Query: 1581 NRKSSGISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKK 1760
              KS+     + +V + T +P +   KPK+L E +++GH + V+E LNELLD++GGI K+
Sbjct: 1670 KGKSASNFININKVQTITNKPHMPPIKPKKLSESTSDGHFEAVQEKLNELLDSEGGISKR 1729

Query: 1761 KDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQML 1940
            KDA KGYLKLLLLTAASG   NG AIQSNR LSMILDA+LKT+SR VL+DII KNGL+ML
Sbjct: 1730 KDAPKGYLKLLLLTAASGAIRNGEAIQSNRELSMILDALLKTRSRMVLMDIIEKNGLRML 1789

Query: 1941 HNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTEH 2120
            HN+MK YRRDFKKIPILRKLLKVLE+LA +EILT+E IN     PG+ESFRES+LS TEH
Sbjct: 1790 HNIMKQYRRDFKKIPILRKLLKVLEYLAVREILTLEHINGGPPCPGMESFRESMLSLTEH 1849

Query: 2121 DDNQVHQIARSFRDRWIPHARR--FHXXXXXXXXXXXXXXXXXXHFGSCHHWREQGAIAV 2294
            +D QVHQIARSFRDRWIP   R   +                     S   W +QG   +
Sbjct: 1850 NDKQVHQIARSFRDRWIPRQVRKLGYMDRDGGRMEIQRGSNCNKVLASHSQWHDQGVRHL 1909

Query: 2295 ERLTCSDQSVSMSNLVDARSEEASSPKLVDDQTMVTRPRKRKSRWDQPASPKKTGRS-PK 2471
            E L  + +S   +  V     E SS   V      TR RKRKSRWDQPA      RS   
Sbjct: 1910 EALNGTVESNLATTSVGTAVHEDSSANRVGSG---TRTRKRKSRWDQPAEENIASRSLQH 1966

Query: 2472 VDYSEGG--QRT-------LTYEAKKEESNCSGDQNILRN-------RDE-----DMMHN 2588
            V+ +E G  Q++       L+ E         G+ +   +       +DE     +   N
Sbjct: 1967 VEQNESGLLQQSESNSLPELSKEVPDHVDKAGGEYSYCPHCVHSYCWQDEASGADNGRQN 2026

Query: 2589 LDDEVPPGFSHLHD--------STDDTPPGFSSVLCPVPLGHL----QTRFNPHLPVAYG 2732
            + ++VPPGFS   D        ST D  P  +      P+G +    Q +FN   PV+YG
Sbjct: 2027 IHEDVPPGFSSPIDPALVSNASSTVDDLPHQNVFHLKFPVGVVVGLPQRKFNSRFPVSYG 2086

Query: 2733 IPFSVVQKSGTPQCGTADTWEIAXXXXXXXXXXXXXXXXXXXXT--------GDQEEARQ 2888
            IP  VVQ+ G+P   T + W +A                    T           + A +
Sbjct: 2087 IPLPVVQQLGSPLAETVEGWIVAPGMPFHPFPPLPPLPSCKKGTLPSAMNSMEIDDTADR 2146

Query: 2889 GYQ-----THAGDQGLPCTSGASK---NTVGTNQNMVHHERGPRNFLGKRCYWQKNWNGS 3044
            G Q     T   D+  P T+GA++   N+ G   +           LG+R + Q+ W  +
Sbjct: 2147 GKQDCYDRTTCLDENSPSTTGANQPDLNSPGPKDHQTFKRARGSYDLGRRYFRQQKW--T 2204

Query: 3045 KRRPPWARNIGRWGFRGNYPSHPRNGSSNINMEMVPNE 3158
            K  PPW R+   WG  G    + R G  + ++  + NE
Sbjct: 2205 KMLPPWVRSRNGWGCIG---GNSRGGMCSTDLGSLTNE 2239


>ref|XP_006365937.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Solanum
            tuberosum]
          Length = 1664

 Score =  783 bits (2023), Expect = 0.0
 Identities = 485/1108 (43%), Positives = 605/1108 (54%), Gaps = 88/1108 (7%)
 Frame = +3

Query: 39   FSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEATNCRWTC 218
            F K       S ++++++ I E+ L PR AWV CDDCLKWR IP++LAD IE TNCRWTC
Sbjct: 576  FEKRSLDGGISNMEILQSEIGERLLSPRNAWVQCDDCLKWRRIPSLLADQIEETNCRWTC 635

Query: 219  KDNQDKAFGDCSISQEKSNAEINAELQISD---EEDARDGHLGFKGSGLKPLLASQPSTL 389
            KDN D+AF DCS  QEKSN+EINAEL+ISD   EED    HL   GSG K LL +  S+ 
Sbjct: 636  KDNLDRAFADCSFPQEKSNSEINAELEISDGSGEEDVSRAHLSSNGSGQKNLLVAHQSSW 695

Query: 390  MLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTCPCGDLC 569
              +KSNLFLHR RK Q +DEIMVC CKPP DG +GCG  CLNR+LNIEC +GTCPCG+ C
Sbjct: 696  NRIKSNLFLHRHRKNQPIDEIMVCLCKPPSDGRMGCGDGCLNRILNIECAKGTCPCGEFC 755

Query: 570  SNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQKEYASK 749
            SNQQFQ++ YAKL   + GKKGYGLQ+LE++SEG FLIEYVGEVLD+  YEARQKEYA K
Sbjct: 756  SNQQFQKRNYAKLKCFKYGKKGYGLQLLENVSEGQFLIEYVGEVLDMHVYEARQKEYALK 815

Query: 750  GHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALRSIKKG 929
             HKHFYFMTLNG+EVIDA AKGNLGRF+NHSCDPNCRTEKW+VNGE+C+GLFA+R IKKG
Sbjct: 816  CHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWIVNGEVCIGLFAIRDIKKG 875

Query: 930  EELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDSDXXXXXXXXXXXXG 1109
            EE+TFDYN+VR+FGAA KKC CGS  CRGYIGGDP +AE+IVQ DSD             
Sbjct: 876  EEVTFDYNFVRIFGAAVKKCVCGSPNCRGYIGGDPLDAEVIVQEDSDDEYPEPVLLPKYA 935

Query: 1110 DIDHSLGDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVEC-GPIQSIP 1286
             +DH   ++    S+      +N                            C   I S  
Sbjct: 936  KMDHKEDNITCATST------INCAKINIQRKRPKKKNTLDGLIAENQETSCQTDINSFV 989

Query: 1287 GKDSVD----------EVRESTDSSSGFQHDHWRSEPASSVKIHTSSTEHIIGTPTSSPK 1436
            G++ V+           VRE +++            PAS++   T +   +  +   S  
Sbjct: 990  GQEKVNLGNSIAVVSLNVREESENFPDV-------SPASALMAETCAA--LKASECLSHS 1040

Query: 1437 SDVLLVENASQKSLCGSIDSASRVFEVDTECEPHSLT---RMKISRPKSV------KNRK 1589
            S   +  + S K  C ++    + F V  +   +S++    ++I+ P +V      K++ 
Sbjct: 1041 STEPVETSLSLKDTCETVSGVRKGFTVAGKVAKYSISSAQALEITSPDAVVSKSLKKSKS 1100

Query: 1590 SSGISATVGRVLSKTQRPKVFSCKP--------------------------KRLLEGSAN 1691
            S+G       +  KT R      K                           K+  +GS +
Sbjct: 1101 SNGKQTHESFLFVKTSRESSLVKKGKQRNYAVNSRSSPDVDNKLQVPQPNLKKPPDGSIH 1160

Query: 1692 GHLKDVEENLNELLDADGGICKKKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILD 1871
            GH + VEE LNELLD DGGI K+KDAS+ YLKLLLLTAASGD  NG AIQSNR LSMILD
Sbjct: 1161 GHFEAVEEKLNELLDHDGGISKRKDASRCYLKLLLLTAASGDDCNGEAIQSNRDLSMILD 1220

Query: 1872 AMLKTKSRGVLLDIINKNGLQMLHNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVER 2051
            A+LKTKSR VL+DIINKNGLQMLHN+MK YRR+F KIPILRKLLKVLE LA ++IL+ E 
Sbjct: 1221 AILKTKSRTVLMDIINKNGLQMLHNIMKRYRREFNKIPILRKLLKVLEHLAVRDILSPEH 1280

Query: 2052 INVDSLHPGVESFRESILSFTEHDDNQVHQIARSFRDRWI-PHARRFHXXXXXXXXXXXX 2228
            IN  +   GV+S R SIL  TEH+D QVHQIAR+FRDR + P  +R              
Sbjct: 1281 INGGTSRAGVQSLRSSILGLTEHEDKQVHQIARNFRDRILRPLRKRICIDRDDCRINTHS 1340

Query: 2229 XXXXXXHFGSCHHWREQGAIAVERLTCSDQSVSMSNLVDARSEEASSPKLVD-DQTMVTR 2405
                     S + W + G    E    +  S   S   D    + SS    D  +  + +
Sbjct: 1341 GSQYNRCLASQNQWCDLGCKPSEGAEYTCHSTVASVQADGGVLDGSSASCSDIGEACMAK 1400

Query: 2406 PRKRKSRWDQPASPKKTGRSPKVDYSEGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMH 2585
             RK KSRWDQ A  K   R+                    ES+ +          ED   
Sbjct: 1401 KRKCKSRWDQGAEAKSDPRN--------------------ESDVA----------EDQKQ 1430

Query: 2586 NLDDEVPPGFSHLHDSTDDTPPGFSSVL--C------------------------PVPLG 2687
             LDD+VPPG+        + PPGFS  +  C                        PV +G
Sbjct: 1431 VLDDDVPPGY--------EFPPGFSVPIKACRVLSDDSSTAIYSTEERNCGEHPQPVVMG 1482

Query: 2688 HLQTRFNPHLPVAYGIPFSVVQKSGTPQCGTADTWEIAXXXXXXXXXXXXXXXXXXXXTG 2867
            HLQ RF   LPV+YGIPFS VQ+ G+ Q G  D W +A                     G
Sbjct: 1483 HLQQRFVSRLPVSYGIPFSEVQQFGSHQKGRFDAWTVA--PGIPFHPFPPLPPYPCDRRG 1540

Query: 2868 DQEEARQGYQTHAGDQGL----------PCTSGASKNTVGTNQNMVHHERGPRNF-LGKR 3014
                A +  Q    D G           P  SGA +   G N N +  ER   +  LG++
Sbjct: 1541 FVPTASELPQNGGEDWGTCSPSHLAQNPPSVSGADQPQDG-NGNQLDCERASESHNLGRK 1599

Query: 3015 CYWQKNWNGSKRRPPWARNIGRWGFRGN 3098
             + ++ +N SK  PPW R    W +  N
Sbjct: 1600 NFRKQKFNNSKLVPPWLRIRSGWEYTEN 1627


>ref|XP_006357338.1| PREDICTED: histone-lysine N-methyltransferase ASHH2-like [Solanum
            tuberosum]
          Length = 1398

 Score =  776 bits (2003), Expect = 0.0
 Identities = 458/961 (47%), Positives = 562/961 (58%), Gaps = 50/961 (5%)
 Frame = +3

Query: 69   SCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLADVIEATNCRWTCKDNQDKAFGD 248
            S LD++R+ +++  L PR AWV CDDC KWR I +VLAD IE TNC+WTCKDN D+   D
Sbjct: 425  SDLDIMRSEVSQPYLQPRNAWVQCDDCQKWRRIASVLADKIEETNCKWTCKDNLDRDLAD 484

Query: 249  CSISQEKSNAEINAELQISD---EEDARDGHLGFKGSGLKPLLASQPSTLMLVKSNLFLH 419
            CSI+QEKSN+EINAEL+ISD   EED     L    SG K    S  S+  L+K N FLH
Sbjct: 485  CSIAQEKSNSEINAELEISDASGEEDVLRTRLNSNRSGQKKAPVSLQSSWTLIKRNSFLH 544

Query: 420  RSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECVQGTCPCGDLCSNQQFQRKQY 599
            RSRK+QT+DEIMVCHCKP  D  +GCG  CLNRMLN+ECV+GTCPCG+ CSNQQFQ++ Y
Sbjct: 545  RSRKSQTIDEIMVCHCKPS-DRRMGCGDGCLNRMLNVECVRGTCPCGERCSNQQFQKRNY 603

Query: 600  AKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTYEARQKEYASKGHKHFYFMTL 779
            AKL   +CGKKGYGLQ+LED+S+G FLIEYVGEVLDL  Y+ARQKEYA KGHKHFYFMTL
Sbjct: 604  AKLKCFKCGKKGYGLQLLEDVSKGQFLIEYVGEVLDLHAYDARQKEYALKGHKHFYFMTL 663

Query: 780  NGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVGLFALRSIKKGEELTFDYNYV 959
            NG+EVIDA AKGNLGRF+NHSCDPNC TEKW+VNGE+C+GLFALR IKKGEE+TFDYNYV
Sbjct: 664  NGSEVIDACAKGNLGRFINHSCDPNCCTEKWMVNGEVCIGLFALRDIKKGEEVTFDYNYV 723

Query: 960  RVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDSDXXXXXXXXXXXXGDIDHSLGDMM 1139
            RVFGAAAKKC CGS +C GYIGGD  NAE+IVQ DSD            GD+   L  ++
Sbjct: 724  RVFGAAAKKCVCGSPRCLGYIGGDLQNAEVIVQADSDDDYPEPVVFCEDGDVGDELNKIL 783

Query: 1140 AKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXXXXXVECGPIQSIPGKDSVDEVRES 1319
            +  SS DV  +                               G +++     + + +++ 
Sbjct: 784  SARSSFDVTEIRTPGETPKNKYKLDEPF-------------TGNLENTTQTHTQNIMKQE 830

Query: 1320 TDSSSGFQHDHWRSEPASSVKIHTSSTEHIIGTPTSSPKSDVL-LVENASQKSLCGSIDS 1496
              +      D        S K H  S    +    SS   + L  + ++S + +  S+ S
Sbjct: 831  NSNMDNSVADFGLKIKEQSNKFHNESPSLSLKKKESSEAMEGLESLLHSSVRPVGNSLQS 890

Query: 1497 ----ASRVFEVDTEC-----------EPHSLTRMKISRPK------SVKNRKSSGISATV 1613
                A  + E+  EC            P+++      R K      S ++ KSS  S++V
Sbjct: 891  ENITAKTISEIKRECLDADKISSALPSPNAMLSKSSLRKKSGNGEASDESLKSSRRSSSV 950

Query: 1614 GRVLSKTQRPKVFSC------------KPKRLLEGSANGHLKDVEENLNELLDADGGICK 1757
             +  SK     + S             K K+    SANG  + VEE LNELLD DGGI K
Sbjct: 951  KKGKSKNSALNMTSAPDVNNKLQIPQPKFKKPTHDSANGRFEAVEEKLNELLDHDGGISK 1010

Query: 1758 KKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQM 1937
            ++DAS+ YLKLLLLTAASGD+ NG AIQSNR LSMILDA+LKTKSR VL+DII+KNGLQM
Sbjct: 1011 RRDASRCYLKLLLLTAASGDNCNGEAIQSNRDLSMILDALLKTKSRTVLVDIIDKNGLQM 1070

Query: 1938 LHNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTE 2117
            LHN+MK  +R+F KIPILRKLLKVLE+LA +EIL+ E IN     PGVESFR SIL  TE
Sbjct: 1071 LHNIMKRSQREFNKIPILRKLLKVLEYLAAREILSHEHINGGPSRPGVESFRVSILGLTE 1130

Query: 2118 HDDNQVHQIARSFRDRWIPHARRFHXXXXXXXXXXXXXXXXXXHFGSCHHWREQ-GAIAV 2294
            H D QVHQIAR+FRDRWI   R                      +  C   ++  G    
Sbjct: 1131 HIDKQVHQIARNFRDRWI--RRPLRKSSCIDRDDSQIDLRPSPRYNRCSPLQDHCGVKPS 1188

Query: 2295 ERLTCSDQSVSMSNLVDARSEEASSPKLVDDQTMVTRPRKRKSRWDQPASPKKTGRSPKV 2474
            E   C+   +  S  +DA   + SS   VD      R RKRKSRWDQ             
Sbjct: 1189 ETEECTSYLMVESTTIDAGVLDGSSTSCVDGAPNGARKRKRKSRWDQ------------- 1235

Query: 2475 DYSEGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMHNLDDEVPPGFSHLHDSTDDTPPG 2654
                             E+  + DQ I  N   D   ++DD  PPGFS    ++  +   
Sbjct: 1236 -----------------EAELNVDQRIETNAAADRTQDIDD-APPGFSIPRKASRISCGA 1277

Query: 2655 FSSVLC------------PVPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQCGTADTWEI 2798
             SS  C            P+  GHLQ RF   LPV+YGIP S VQ+ G+PQ  + D W +
Sbjct: 1278 SSSADCSLQEPSCKKHPHPMVTGHLQQRFISRLPVSYGIPLSKVQQFGSPQKESCDAWGV 1337

Query: 2799 A 2801
            A
Sbjct: 1338 A 1338


>ref|XP_002520307.1| huntingtin interacting protein, putative [Ricinus communis]
            gi|223540526|gb|EEF42093.1| huntingtin interacting
            protein, putative [Ricinus communis]
          Length = 1746

 Score =  734 bits (1895), Expect = 0.0
 Identities = 483/1132 (42%), Positives = 591/1132 (52%), Gaps = 62/1132 (5%)
 Frame = +3

Query: 3    GNLEKVDVSKSEFSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLA 182
            GN    D  K      +A    + LD+  +   +Q+L    AWV CD+CLKWR IP  L 
Sbjct: 672  GNCIADDTQKFNPHDTIASVAVANLDMASSDAVDQHLPMDNAWVRCDECLKWRRIPVALV 731

Query: 183  DVIEATNCRWTCKDNQDKAFGDCSISQEKSNAEINAELQISD-EEDARDGHLGFKGSGLK 359
            D I  TNC W CKDN DKAF DCSISQEKSNAEINAEL +SD +EDA D  L  +G   K
Sbjct: 732  DSIGQTNCHWICKDNMDKAFADCSISQEKSNAEINAELGLSDADEDACDVPLKNRGLEYK 791

Query: 360  PLLASQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLNIECV 539
               AS+      + +N FLHRSRK QT+DEIMVCHCK P+DG LGC   CLNRMLNIECV
Sbjct: 792  RTAASKEHEFTRISTNQFLHRSRKTQTIDEIMVCHCKLPLDGRLGCRDECLNRMLNIECV 851

Query: 540  QGTCPCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLDLQTY 719
            +GTCPCGDLCSNQQ                                       VLD+ TY
Sbjct: 852  RGTCPCGDLCSNQQ---------------------------------------VLDMHTY 872

Query: 720  EARQKEYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGEICVG 899
            EARQ+EYA +GHKHFYFMTLNG+EVIDA AKGNLGRF+NHSCDPNCRTEKWVVNGEIC+G
Sbjct: 873  EARQREYAFQGHKHFYFMTLNGSEVIDACAKGNLGRFINHSCDPNCRTEKWVVNGEICIG 932

Query: 900  LFALRSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDSDXXX 1079
            LFALR IKKGEELTFDYNYVRV GAAAK+C+CGS +CRGYIGGDP N E+I Q DSD   
Sbjct: 933  LFALRDIKKGEELTFDYNYVRVCGAAAKRCYCGSPQCRGYIGGDPTNTEVIDQVDSDEEF 992

Query: 1080 XXXXXXXXXGDIDHSLGDMMAKASSQD------VERVVNXXXXXXXXXXXXXXXXXXXXX 1241
                     G+  + + + ++++SS D       E + N                     
Sbjct: 993  LEPVMLEV-GEAGYRIRNRISRSSSCDDVELQVTESISNNRDKMDSSTTAAQKMEAATEI 1051

Query: 1242 XXXXXVECGPIQSIPGKDSVDEVRESTDSSSGFQHDHWRSEPASSVKIHTSSTEHIIGTP 1421
                      I  +     VD+++ES  SS   Q D    E   +VK   +S E I G  
Sbjct: 1052 KDSMNPSIPAISRLDSSLEVDDLKESFPSSRQ-QADDATIEFFPAVK-QENSIEQIQGLD 1109

Query: 1422 TSSPK-----SDVLLVENASQKSLCGSIDSASRVFEVDTECEPHSLTRMKISRPKSVKNR 1586
            TSS       S   +V N   K+    +   SR F + T CE               K  
Sbjct: 1110 TSSATVLSKLSSDDMVANRKPKTDEKRVFVKSR-FLIKTSCESGL-----------AKKG 1157

Query: 1587 KSSGISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKKKD 1766
            K   I + V +V     + +V S KPK+  +G+ +G  + VE  LNELLD DGGI K+KD
Sbjct: 1158 KFGSIHSNVNKVQMMACKSQVLSLKPKKFTDGTTSGRFEAVEGKLNELLDNDGGISKRKD 1217

Query: 1767 ASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQMLHN 1946
            A+KGYLK LLLTAASG SGNG AIQSNR LSMILDA+LKTKSR VL+DIINKNGL+MLHN
Sbjct: 1218 AAKGYLKFLLLTAASGASGNGEAIQSNRDLSMILDALLKTKSRAVLIDIINKNGLRMLHN 1277

Query: 1947 MMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTEHDD 2126
            M+K YR DFKK PILRKLLKVLE+LA +EILT E I      PG+ESFR+S+LS TEH+D
Sbjct: 1278 MLKQYRSDFKKTPILRKLLKVLEYLAVREILTPEHIYGGPPCPGMESFRKSMLSLTEHND 1337

Query: 2127 NQVHQIARSFRDRWIP-HARRFHXXXXXXXXXXXXXXXXXXHF-GSCHHWREQGAIAVER 2300
             QVHQIARSFRDRW P H R++                       S  H R+      E 
Sbjct: 1338 KQVHQIARSFRDRWFPRHGRKYSYMDRDDGKMECHRGSISNRVSASQDHLRDLTIRPTEV 1397

Query: 2301 LTCSDQSVSMSNLVDARSEEASSPKLVDDQTMVTRPRKRKSRWDQPASPKKTGRSPKVDY 2480
            +  + Q    +  V+    E  S   V D    T+ RKRKSRWDQPA  K   RS + D 
Sbjct: 1398 IDGAMQPKVTTASVETAVNEGCSLHCVGDD---TKTRKRKSRWDQPAEEKPFRRSHQHDE 1454

Query: 2481 S-------EGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMHNLDDEVPPGFSHLHDST- 2636
                    E  +     +  KE S  +  ++   +     + N   +V    + L   T 
Sbjct: 1455 QRIQSGLLEQSRFNPPTDMGKEVSEHADKRSGENSCCPHCVRNYCRQVEADCADLGRQTI 1514

Query: 2637 -DDTPPGFSSVLCP--------------------VPLGHLQTRFNPHLPVAYGIPFSVVQ 2753
              D PPGFSS L P                    + +GH Q +FN  L V+YGIP  +VQ
Sbjct: 1515 QSDAPPGFSSPLNPPLVLPNASSTIIDGLTFPVDMVVGHPQRKFNSRLSVSYGIPLPIVQ 1574

Query: 2754 KSGTPQCGTADTWEIAXXXXXXXXXXXXXXXXXXXXT----------GDQEEARQGYQTH 2903
            + G PQ GT  +W IA                    T          G  EE +Q  Q  
Sbjct: 1575 QFGLPQHGTVGSWTIAPGMPFHPFPPLPPFPHHKNETPAAAISMAIDGTAEEGQQLRQD- 1633

Query: 2904 AGDQGLPCTSGASKNTV--------GTNQNMVHHERGPRNFLGKRCYWQKNWNGSKRRPP 3059
                  P  +  S N +        G N       R     LG+R + Q+ WN   + PP
Sbjct: 1634 -PPTCYPNENNLSTNAINQPDIVFPGENSQTFKRVRASSQDLGRRYFRQQKWN---KGPP 1689

Query: 3060 WARNIGRWGFRGNYPSHPRNGSSNINMEMVPNESSGEDVT-DVNHSVEYAGN 3212
            W   +  WG  G   S+ +    + ++  V NE      + DV+  +E AG+
Sbjct: 1690 WMHQVNGWGHLG---SNSKGVICSTDVVSVTNEPRNSYCSQDVSCRMEKAGD 1738


>gb|EPS67389.1| hypothetical protein M569_07380, partial [Genlisea aurea]
          Length = 872

 Score =  718 bits (1854), Expect = 0.0
 Identities = 416/909 (45%), Positives = 518/909 (56%), Gaps = 28/909 (3%)
 Frame = +3

Query: 120  RVAWVCCDDCLKWRCIPAVLADVIEATNCRWTCKDNQDKAFGDCSISQEKSNAEINAELQ 299
            R AWV CDDC KWR IPA LAD IE T+C WTCK+N D+ F +CS+ QEKSN+EIN EL+
Sbjct: 1    RNAWVLCDDCQKWRRIPATLADQIEKTDCGWTCKENMDRDFAECSVPQEKSNSEINDELE 60

Query: 300  ISDEEDARDGHLGFKGSGLKPLLASQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPV 479
            + DE    D    F  S          S+  +++SN+FLHR RK QT+DE+MVCHCKP  
Sbjct: 61   LFDESAEEDTQETFVNSSNYQSKVPAQSSWSVIRSNIFLHRKRKTQTIDEVMVCHCKPSS 120

Query: 480  DGSLGCGSHCLNRMLNIECVQGTCPCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLED 659
            +G  GCG++CLNRMLNIECV+GTCPCGDLCSNQQFQ+++YAKL  ++CGKKGYGLQ +ED
Sbjct: 121  EGRKGCGANCLNRMLNIECVRGTCPCGDLCSNQQFQKRKYAKLKRIKCGKKGYGLQAVED 180

Query: 660  ISEGNFLIEYVGEVLDLQTYEARQKEYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNH 839
            ISEG FLIEYVGEVLD+ TYEARQ+EYA  GH HFYFMTLNG+EVIDA AKGNLGR +NH
Sbjct: 181  ISEGRFLIEYVGEVLDMHTYEARQREYAMNGHVHFYFMTLNGSEVIDACAKGNLGRLINH 240

Query: 840  SCDPNCRTEKWVVNGEICVGLFALRSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGY 1019
            SCDPNCRTEKW+VNGE+CVGLFALR IKKGEE+TFDYNYVRVFGAAAKKC CGS  CRGY
Sbjct: 241  SCDPNCRTEKWMVNGEVCVGLFALRDIKKGEEVTFDYNYVRVFGAAAKKCVCGSANCRGY 300

Query: 1020 I-GGDPHNAEIIVQGDSDXXXXXXXXXXXXGDID----HSLGDMMAKASSQDVERVVNXX 1184
            I G   ++ +II     +              +D           AK  +    R V+  
Sbjct: 301  IGGDPTNSDQIIEDDSDEEFKEPISDLSKTEALDVLKVQPAKKCTAKKMTSAASRKVHTK 360

Query: 1185 XXXXXXXXXXXXXXXXXXXXXXXXVECGPIQSIP------GKDSVDEVRESTDSSSGFQH 1346
                                     E    +++P       +D V +V  S         
Sbjct: 361  KQELQDSIEEDIAVKVEQSDRSRISEDSLDETVPVTLDVESQDLVTQVHPSDLPLEFLSS 420

Query: 1347 DHWRSEPASSVKIHTSSTEHIIGTPTSSPKSDVLLVENASQKSLCGSIDSASRVFEVDTE 1526
            +   S+  SS  + T +       P   P  + L  +     +  G ++   +       
Sbjct: 421  EDISSQNTSSANVPTVTAS----APCEEPSPETLESKQMLDHAHIGGVEIPEKPG----- 471

Query: 1527 CEPHSLTRMKISRPKSVKNRKSSGISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKD 1706
                   R+K         R S  +   + +  S+  +      K K ++E S NGH + 
Sbjct: 472  ------LRVKSRFSSLPIKRGSRKMKVGIEKGTSEVNKLNASLDKSKNMVECSLNGHFEA 525

Query: 1707 VEENLNELLDADGGICKKKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKT 1886
            VE+ LNELLD +GGI K+KDAS+GYLKLL LT ASG+SG+G AIQSNR LSMILDA+LKT
Sbjct: 526  VEKKLNELLDTEGGISKRKDASRGYLKLLFLTVASGNSGDGEAIQSNRDLSMILDALLKT 585

Query: 1887 KSRGVLLDIINKNGLQMLHNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDS 2066
            +SR VL+DIINKNGLQMLHN+MK YR++F K PILRKLLKVLE+LA +EILT+E I+   
Sbjct: 586  RSRSVLVDIINKNGLQMLHNIMKRYRKEFIKTPILRKLLKVLEYLAMREILTLEHISGGP 645

Query: 2067 LHPGVESFRESILSFTEHDDNQVHQIARSFRDRWIPHARRFHXXXXXXXXXXXXXXXXXX 2246
              PGVESF++SIL+ TEH D QVHQIARSFRDRWIP   R +                  
Sbjct: 646  ACPGVESFKDSILTLTEHSDKQVHQIARSFRDRWIPKPIRRNDFQQRLMHSSVLG----- 700

Query: 2247 HFGSCHHWREQGAIAVERLTCSDQ-----SVSMSNLVDARSEEASSPKLVDDQTMVTRPR 2411
               S H + ++      + +C D      S S S  V       SS       T  TR R
Sbjct: 701  --SSSHCFADRSG----KSSCGDSQPIAPSASASTAVPV---GLSSTLPCSPATSGTRIR 751

Query: 2412 KRKSRWDQPASPKKTGRSPKVDYSEGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMHNL 2591
            KRKSRWD PA                      Y   +  SN  GD+ +          N+
Sbjct: 752  KRKSRWDCPAE--------------------DYPNSRVRSNFMGDEKM----------NI 781

Query: 2592 DDEVPPGFS------------HLHDSTDDTPPGFSSVLCPVPLGHLQTRFNPHLPVAYGI 2735
            DD+VPPGFS               ++  D        L     G  Q++FN  +P++YGI
Sbjct: 782  DDDVPPGFSFNNCAPLNSCCNQERETKIDEEMHMKQNLWDTVCGEPQSKFNARMPLSYGI 841

Query: 2736 PFSVVQKSG 2762
            P+S VQ+ G
Sbjct: 842  PYSAVQQVG 850


>ref|XP_006390102.1| hypothetical protein EUTSA_v10017998mg [Eutrema salsugineum]
            gi|557086536|gb|ESQ27388.1| hypothetical protein
            EUTSA_v10017998mg [Eutrema salsugineum]
          Length = 1817

 Score =  701 bits (1809), Expect = 0.0
 Identities = 448/1082 (41%), Positives = 572/1082 (52%), Gaps = 48/1082 (4%)
 Frame = +3

Query: 3    GNLEKVDVSKSEFSKEVALENTSCLDLIRTGIA-----EQNLVPRVAWVCCDDCLKWRCI 167
            G+L  VD+ ++      A+  T   D+I          E +     AWV CDDC KWR I
Sbjct: 821  GSLRDVDIGQT-----CAINGTKSSDVIHGEAVLDVAIEDSSSTESAWVRCDDCFKWRRI 875

Query: 168  PAVLADVIEATNCRWTCKDNQDKAFGDCSISQEKSNAEINAELQIS-DEEDARDGHLGFK 344
            PA + + I+ ++ RW C +N DK F  CSISQE SN +IN EL I  DE DA D     +
Sbjct: 876  PASVVESIDESS-RWICGNNSDKDFAHCSISQEMSNEDINEELGIGQDEADAYDCEAAKR 934

Query: 345  GSGL----KPLLASQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCL 512
            G       K    ++ +    +K+N FLHR+RK+QT+DEIMVCHCKPP DG LGCG  CL
Sbjct: 935  GKDKEQKSKRSSGNRKACFKAIKTNQFLHRNRKSQTIDEIMVCHCKPPPDGRLGCGEECL 994

Query: 513  NRMLNIECVQGTCPCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYV 692
            NRMLNIEC+ GTCP GDLCSNQQFQ+++Y K    + GKKGYGL++LED+ EG FLIEYV
Sbjct: 995  NRMLNIECLHGTCPAGDLCSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYV 1054

Query: 693  GEVLDLQTYEARQKEYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKW 872
            GEVLD+Q+YE RQK+YAS G KHFYFMTLNGNEVIDA AKGNLGRF+NHSC+PNCRTEKW
Sbjct: 1055 GEVLDMQSYETRQKDYASMGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKW 1114

Query: 873  VVNGEICVGLFALRSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEII 1052
            +VNGEICVG+F+++ +KKG+ELTFDYNYVRVFGAAAKKC+CGS  CRGYIGGDP N ++I
Sbjct: 1115 MVNGEICVGIFSMQDLKKGQELTFDYNYVRVFGAAAKKCYCGSSHCRGYIGGDPLNGDVI 1174

Query: 1053 VQGDSDXXXXXXXXXXXXGDIDHSLGDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXX 1232
            VQ DSD               D   G+ +   +S+                         
Sbjct: 1175 VQSDSDEEYPELVILD-----DDESGEGIVDVTSR------------------------- 1204

Query: 1233 XXXXXXXXVECGPIQSIPGKDSVDEVRESTDSSSGFQHDHWRSEPASSVKIHTSSTEHII 1412
                    ++   +Q       V++ +E    +S  Q   +   P   V      TE   
Sbjct: 1205 ------IFIDGADVQLPQNSTKVNDFKELASDNSQSQSSVYVKLPEREVLPSLQLTEVSK 1258

Query: 1413 GTPTSSP----KSDVLLVENASQKSLCGS------IDSASRVFEV------DTECEPHSL 1544
             T T  P    + +V + +     SL  S       D A+    V      D +  P   
Sbjct: 1259 ETSTDMPVIAVRQEVHVEKKTKSTSLTSSSLSRLPSDGANADKTVKDGSGEDKKILPRPR 1318

Query: 1545 TRMKISRPKSVKNRKSSGISATVGRV----LSKTQRPKVFSCKPKRLLEGSANGHLKDVE 1712
             RMK SR      R   G+   V +     +SK Q+  V   K K   E S +G ++  E
Sbjct: 1319 PRMKTSRSSGSSKRDKGGLPTGVSKAQSIPVSKLQQQPV---KSKASEEVSPSGRIETFE 1375

Query: 1713 ENLNELLDADGGICKKKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKS 1892
              LNELLDA GGI K++D++KGYLKLLLLTAAS  + N   IQSNR LSMILDA+LKTKS
Sbjct: 1376 GKLNELLDAVGGISKRRDSAKGYLKLLLLTAASRGNANNEGIQSNRDLSMILDALLKTKS 1435

Query: 1893 RGVLLDIINKNGLQMLHNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLH 2072
            R VL+D+INKNGLQMLHN+MK YRRDFKK PILRKLLKVLE+LA +EIL +E I      
Sbjct: 1436 RTVLVDVINKNGLQMLHNIMKQYRRDFKKTPILRKLLKVLEYLATREILALEHIIRRPPC 1495

Query: 2073 PGVESFRESILSFTEHDDNQVHQIARSFRDRWIPHARRFHXXXXXXXXXXXXXXXXXXHF 2252
             G+ESF++SIL+ TEHDD QVHQIAR+FRDRWIP   R                     F
Sbjct: 1496 AGMESFKDSILTLTEHDDKQVHQIARNFRDRWIPKPFRKPWRIDREERSESIRSPINSRF 1555

Query: 2253 GSC------HHWREQGAIAVERLTCSDQSVSMSNLVDARSEEASSPKLVDDQTMVTRPRK 2414
             +       HH           ++    +   + + +A SE  SS          T  RK
Sbjct: 1556 RASQEPRYDHHSPRPAEPYASVISSRAATPETTPVSEASSEPNSS-------NPETNGRK 1608

Query: 2415 RKSRWDQPASPKKTGRSPKVDYSEGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMHNLD 2594
            RKSRWDQP+  K+              RT+T          S  Q  + N ++D+     
Sbjct: 1609 RKSRWDQPSMSKE-------------HRTMT---------VSSQQTDVTNGNQDVQ---- 1642

Query: 2595 DEVPPGFSHLHDSTDDTPPGFSSVLCPVP---LGHLQTRFNPHLPVAYGIPFSVVQKSGT 2765
                          DD PPGFSS     P       Q +F   LPV+YGIP S+V + G+
Sbjct: 1643 --------------DDLPPGFSSPCMDAPDAVTAQPQQKFLSRLPVSYGIPLSIVHQFGS 1688

Query: 2766 PQCGTADTWEIA---XXXXXXXXXXXXXXXXXXXXTGDQEEARQGYQTHAGDQGLPCTSG 2936
            P      +W +A                        G    +  G  T   ++ LP T+ 
Sbjct: 1689 PGKEDPTSWSVAPGMPFYPFPPLPPVSHGEFFAKRNGAVCSSSMGNPT-CSNEILPATTV 1747

Query: 2937 ASKNT------VGTNQNMVHHERGPRNFLGKRCYWQKNWNGSKRRPPWARNIGRWGFRGN 3098
             ++++       GT+    + +R   + +G   + Q+  N     PPW RN G W    N
Sbjct: 1748 PNQSSWNIPSVAGTDSTAPNRKREFSSDIGTSYFRQQKQN----VPPWMRNNG-WEKTVN 1802

Query: 3099 YP 3104
             P
Sbjct: 1803 SP 1804


>ref|XP_006300643.1| hypothetical protein CARUB_v10019650mg [Capsella rubella]
            gi|482569353|gb|EOA33541.1| hypothetical protein
            CARUB_v10019650mg [Capsella rubella]
          Length = 1811

 Score =  700 bits (1806), Expect = 0.0
 Identities = 433/1008 (42%), Positives = 551/1008 (54%), Gaps = 15/1008 (1%)
 Frame = +3

Query: 126  AWVCCDDCLKWRCIPAVLADVIEATNCRWTCKDNQDKAFGDCSISQEKSNAEINAELQIS 305
            AWV CDDC KWR IP+ +   I+ ++ RW CK+N DK F DCS SQE SN +IN EL I 
Sbjct: 866  AWVRCDDCFKWRRIPSSVVGSIDESS-RWICKNNSDKKFADCSKSQEMSNEDINEELGIG 924

Query: 306  -DEEDARDGHLGFKGSGL----KPLLASQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCK 470
             DE DA D     +G       K L   Q +    +K+N FLHR+RK QT+DEIMVCHCK
Sbjct: 925  QDEADAYDCDAAKRGKEKEQKSKRLTGKQKACFKAIKTNQFLHRNRKNQTIDEIMVCHCK 984

Query: 471  PPVDGSLGCGSHCLNRMLNIECVQGTCPCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQV 650
            PP DG LGCG  CLNRMLNIEC+QGTCP G+LCSNQQFQ+++Y K    + GKKGYGL++
Sbjct: 985  PPPDGRLGCGEECLNRMLNIECLQGTCPAGNLCSNQQFQKRKYVKFERFQSGKKGYGLRL 1044

Query: 651  LEDISEGNFLIEYVGEVLDLQTYEARQKEYASKGHKHFYFMTLNGNEVIDASAKGNLGRF 830
            LED+ EG FLIEYVGEVLD+Q+YE RQKEYA KG KHFYFMTLNGNEVIDA AKGNLGRF
Sbjct: 1045 LEDVREGQFLIEYVGEVLDMQSYETRQKEYACKGQKHFYFMTLNGNEVIDAGAKGNLGRF 1104

Query: 831  VNHSCDPNCRTEKWVVNGEICVGLFALRSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKC 1010
            +NHSC+PNCRTEKW+VNGEICVG+F+++ +KKG+ELTFDYNYVRVFGAAAKKC+CGS  C
Sbjct: 1105 INHSCEPNCRTEKWMVNGEICVGIFSMKDLKKGQELTFDYNYVRVFGAAAKKCYCGSSHC 1164

Query: 1011 RGYIGGDPHNAEIIVQGDSDXXXXXXXXXXXXGDIDHSLGDMMAKASSQDVERVVNXXXX 1190
            RGYIGGDP N ++I+Q                 D D    +++     +  E +++    
Sbjct: 1165 RGYIGGDPLNGDVIIQS----------------DSDEEYPELVILDDDESGEGILDATSR 1208

Query: 1191 XXXXXXXXXXXXXXXXXXXXXXVECGPIQSIPGKDSVDEVRESTDSSSGFQHDHWRSEPA 1370
                                  +  G  Q+   + SV       +     Q      E +
Sbjct: 1209 TFIDDVDEQKPQNSEMVNGSKDLSPGNSQT---QSSVSVKLPEREILPPLQPTEVLKELS 1265

Query: 1371 SSVKIHTSSTEHIIGTPTSSPKSDVLLVENASQKSLCGSIDSASRVFEVDTECEPHSLTR 1550
            S + I     E  +   T S  +    +   S         +   + E D +  P    R
Sbjct: 1266 SGMPISPLEQEVPVEKKTKSTSATSSSLSKLSADGTNADKTTKDGLSE-DKKILPRPRPR 1324

Query: 1551 MKISRPKSVKNRKSSGISATVGRV----LSKTQRPKVFSCKPKRLLEGSANGHLKDVEEN 1718
            MK SR      R+  G    V +     +SK Q+  +   K K   + S +G ++  E  
Sbjct: 1325 MKTSRSSGSSKREKGGSLPGVNKAQIIPVSKLQQQPI---KSKGSEDVSPSGRIETFEGK 1381

Query: 1719 LNELLDADGGICKKKDASKGYLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRG 1898
            LNELLDA GGI K++D++KGYLKLLLLTAAS  + N G IQSNR LSMILDA+LKTKS+ 
Sbjct: 1382 LNELLDAAGGISKRRDSAKGYLKLLLLTAASRGTNNEG-IQSNRDLSMILDALLKTKSKS 1440

Query: 1899 VLLDIINKNGLQMLHNMMKLYRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPG 2078
            VL+D+INKNGLQMLHN+MK YR DFKK PILRKLLKVLE+LA +EIL +E I        
Sbjct: 1441 VLVDVINKNGLQMLHNIMKQYRSDFKKTPILRKLLKVLEYLATREILALEHIIRRPPCAA 1500

Query: 2079 VESFRESILSFTEHDDNQVHQIARSFRDRWIPHARRFHXXXXXXXXXXXXXXXXXXHFGS 2258
            +ESF++SILSFTEHDD QVHQIAR+FRDRWIP A R                     F +
Sbjct: 1501 MESFKDSILSFTEHDDKQVHQIARNFRDRWIPKAYRKPRRIDREERSESMRSPINSRFRA 1560

Query: 2259 CHHWR--EQGAIAVERLTCSDQSVSMSNLVDARSEEASSPKLVDDQTMVTRPRKRKSRWD 2432
                R   Q     E +     S + +    + SE  S P   +        RKRKSRWD
Sbjct: 1561 SQEPRYDHQSPRPAEPVASVISSRAATPETASLSERYSEP---NSSLPEKNGRKRKSRWD 1617

Query: 2433 QPASPKKTGRSPKVDYSEGGQRTLTYEAKKEESNCSGDQNILRNRDEDMMHNLD--DEVP 2606
            QP+  K+             QRT+T               IL ++ ++   N D  D++P
Sbjct: 1618 QPSMTKE-------------QRTMT---------------ILSHQTDETKGNQDTQDDLP 1649

Query: 2607 PGFSHLHDSTDDTPPGFSSVLCPVPLGHLQTRFNPHLPVAYGIPFSVVQKSGTPQCGTAD 2786
            PGFS       D P  F+    P      Q +F   LPV+YGIP S++ + G+P      
Sbjct: 1650 PGFSL---PCTDVPDAFAITAQP------QQKFLSRLPVSYGIPLSIIHQFGSPDKEDPT 1700

Query: 2787 TWEIAXXXXXXXXXXXXXXXXXXXXTGDQEEARQGYQTHAGDQGLPCTSGASKNTVG--T 2960
            TW +A                     G+    R G    +  +   C++  S  T G  T
Sbjct: 1701 TWSVAPGMPFYPFPPLPPVSH-----GEFYAKRNGTACSSSMRIPTCSNEISPATTGIVT 1755

Query: 2961 NQNMVHHERGPRNFLGKRCYWQKNWNGSKRRPPWARNIGRWGFRGNYP 3104
            +    + +R   + +G   + Q+  N     PPW RN G WG   + P
Sbjct: 1756 DSTPPNRKREFSSDIGTSYFRQQKQN----IPPWMRNNG-WGKTASSP 1798


>gb|AAC34358.1| Hypothetical protein [Arabidopsis thaliana]
          Length = 1767

 Score =  677 bits (1746), Expect = 0.0
 Identities = 412/946 (43%), Positives = 529/946 (55%), Gaps = 13/946 (1%)
 Frame = +3

Query: 3    GNLEKVDVSKSEFSKEVALENTSCLDLIRTGIAEQNLVPRVAWVCCDDCLKWRCIPAVLA 182
            G L   D+ K+  +      + +  +++     E +     AWV CDDC KWR IPA + 
Sbjct: 823  GALLDADIGKTSATYGTISSDVTHGEMVVDVTIEDSYSTESAWVRCDDCFKWRRIPASVV 882

Query: 183  DVIEATNCRWTCKDNQDKAFGDCSISQEKSNAEINAELQIS-DEEDARDGHLGFKGSGL- 356
              I+ ++ RW C +N DK F DCS SQE SN EIN EL I  DE DA D     +G    
Sbjct: 883  GSIDESS-RWICMNNSDKRFADCSKSQEMSNEEINEELGIGQDEADAYDCDAAKRGKEKE 941

Query: 357  ---KPLLASQPSTLMLVKSNLFLHRSRKAQTMDEIMVCHCKPPVDGSLGCGSHCLNRMLN 527
               K L   Q +    +K+N FLHR+RK+QT+DEIMVCHCKP  DG LGCG  CLNRMLN
Sbjct: 942  QKSKRLTGKQKACFKAIKTNQFLHRNRKSQTIDEIMVCHCKPSPDGRLGCGEECLNRMLN 1001

Query: 528  IECVQGTCPCGDLCSNQQFQRKQYAKLSWLRCGKKGYGLQVLEDISEGNFLIEYVGEVLD 707
            IEC+QGTCP GDLCSNQQFQ+++Y K    + GKKGYGL++LED+ EG FLIEYVGEVLD
Sbjct: 1002 IECLQGTCPAGDLCSNQQFQKRKYVKFERFQSGKKGYGLRLLEDVREGQFLIEYVGEVLD 1061

Query: 708  LQTYEARQKEYASKGHKHFYFMTLNGNEVIDASAKGNLGRFVNHSCDPNCRTEKWVVNGE 887
            +Q+YE RQKEYA KG KHFYFMTLNGNEVIDA AKGNLGRF+NHSC+PNCRTEKW+VNGE
Sbjct: 1062 MQSYETRQKEYAFKGQKHFYFMTLNGNEVIDAGAKGNLGRFINHSCEPNCRTEKWMVNGE 1121

Query: 888  ICVGLFALRSIKKGEELTFDYNYVRVFGAAAKKCHCGSRKCRGYIGGDPHNAEIIVQGDS 1067
            ICVG+F+++ +KKG+ELTFDYNYVRVFGAAAKKC+CGS  CRGYIGGDP N ++I+Q DS
Sbjct: 1122 ICVGIFSMQDLKKGQELTFDYNYVRVFGAAAKKCYCGSSHCRGYIGGDPLNGDVIIQSDS 1181

Query: 1068 DXXXXXXXXXXXXGDIDHSLGDMMAKASSQDVERVVNXXXXXXXXXXXXXXXXXXXXXXX 1247
                            D    +++     +  E ++                        
Sbjct: 1182 ----------------DEEYPELVILDDDESGEGIL------------------------ 1201

Query: 1248 XXXVECGPIQSIPGKDSVDEVRESTDSSSGFQH---DHWRSEPASSVKIHTSSTEHIIGT 1418
                  G        D+ +++ +S +  +G++    D+ +++ + SVK+     E  I  
Sbjct: 1202 ------GATSRTFTDDADEQMPQSFEKVNGYKDLAPDNTQTQSSVSVKL----PEREIPP 1251

Query: 1419 PTSSPKSDVLLVENASQKSLCGSIDSASRVFEVDTECEPHSLTRMKISRPKSVKNRKSSG 1598
            P   P ++VL       K L   I   +   EV  E +  S +    S  +      +S 
Sbjct: 1252 PLLQP-TEVL-------KELSSGISITAVQQEVPAEKKTKSTSPTSSSLSRMSPGGTNSD 1303

Query: 1599 ISATVGRVLSKTQRPKVFSCKPKRLLEGSANGHLKDVEENLNELLDADGGICKKKDASKG 1778
             +   G    K   P+    +P+     S+    +D    LNELLDA GGI K++D++KG
Sbjct: 1304 KTTKHGSGEDKKILPRP---RPRMKTSRSSESSKRDKGGKLNELLDAVGGISKRRDSAKG 1360

Query: 1779 YLKLLLLTAASGDSGNGGAIQSNRILSMILDAMLKTKSRGVLLDIINKNGLQMLHNMMKL 1958
            YLKLLLLTAAS  +   G I SNR LSMILDA+LKTKS+ VL+DIINKNGLQMLHN+MK 
Sbjct: 1361 YLKLLLLTAASRGTDEEG-IYSNRDLSMILDALLKTKSKSVLVDIINKNGLQMLHNIMKQ 1419

Query: 1959 YRRDFKKIPILRKLLKVLEFLAEKEILTVERINVDSLHPGVESFRESILSFTEHDDNQVH 2138
            YR DFK+IPI+RKLLKVLE+LA ++IL +E I       G+ESF++S+LSFTEHDD  VH
Sbjct: 1420 YRGDFKRIPIIRKLLKVLEYLATRKILALEHIIRRPPFAGMESFKDSVLSFTEHDDYTVH 1479

Query: 2139 QIARSFRDRWIPHARRFHXXXXXXXXXXXXXXXXXXHFGSCHHWR--EQGAIAVERLTCS 2312
             IARSFRDRWIP   R                     F +    R   Q     E     
Sbjct: 1480 NIARSFRDRWIPKHFRKPWRINREERSESMRSPINRRFRASQEPRYDHQSPRPAEPAASV 1539

Query: 2313 DQSVSMSNLVDARSEEASSPKLVDDQTMVTRPRKRKSRWDQPASPKKTGRSPKVDYSEGG 2492
              S + +    + SE  S P   +     T  RKRKSRWDQP+  K+             
Sbjct: 1540 TSSKAATPETASVSEGYSEP---NSGLPETNGRKRKSRWDQPSKTKE------------- 1583

Query: 2493 QRTLTYEAKKEESNCSGDQNILRNRDEDMMHNLDDEVPPGFSHLHDSTDDTPPGFSSVLC 2672
            QR +T  +++ +   +G+Q                          D  DD PPGFSS   
Sbjct: 1584 QRIMTILSQQTDET-NGNQ--------------------------DVQDDLPPGFSSPCT 1616

Query: 2673 PVP---LGHLQTRFNPHLPVAYGIPFSVVQKSGTPQCGTADTWEIA 2801
             VP       Q +F   LPV+YGIP S+V + G+P      TW +A
Sbjct: 1617 DVPDAITAQPQQKFLSRLPVSYGIPLSIVHQFGSPGKEDPTTWSVA 1662


Top