BLASTX nr result

ID: Ephedra27_contig00006545 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00006545
         (4593 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21104.3| unnamed protein product [Vitis vinifera]              540   e-150
ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613...   526   e-146
ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613...   526   e-146
ref|XP_006450349.1| hypothetical protein CICLE_v10010421mg [Citr...   526   e-146
gb|EOY29402.1| Uncharacterized protein isoform 3 [Theobroma cacao]    520   e-144
gb|EOY29400.1| Uncharacterized protein isoform 1 [Theobroma caca...   520   e-144
ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [A...   514   e-142
ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812...   499   e-138
ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812...   499   e-138
ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812...   499   e-138
ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812...   499   e-138
ref|XP_006596083.1| PREDICTED: uncharacterized protein LOC100812...   499   e-138
ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816...   493   e-136
ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putativ...   491   e-135
ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816...   489   e-135
ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816...   489   e-135
ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313...   487   e-134
gb|ESW33157.1| hypothetical protein PHAVU_001G047700g [Phaseolus...   484   e-133
gb|ESW33155.1| hypothetical protein PHAVU_001G047700g [Phaseolus...   484   e-133
gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus no...   483   e-133

>emb|CBI21104.3| unnamed protein product [Vitis vinifera]
          Length = 1111

 Score =  540 bits (1392), Expect = e-150
 Identities = 282/569 (49%), Positives = 344/569 (60%), Gaps = 1/569 (0%)
 Frame = +3

Query: 2574 EDAGCIFEGSYSENPLVKRKRREGSDAVSPGETPCCVCGDSNEEGLNRLVQCQSCLIKMH 2753
            ED+      SY  N     K       +S  +  CCVCG SN++ +N L++C  CLI++H
Sbjct: 603  EDSKHSMSESYKVNSKKSIKEHRFESFISDTDAFCCVCGSSNKDEINCLLECSRCLIRVH 662

Query: 2754 QACYGISKIPKSGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVIKSLLHCWKVKKE 2933
            QACYG+S++PK  W CR C+++  NIVCVLCGYGGGA+T A RT N++KSLL  W ++ E
Sbjct: 663  QACYGVSRVPKGRWYCRPCRTSSKNIVCVLCGYGGGAMTRALRTRNIVKSLLKVWNIETE 722

Query: 2934 DNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKN 3113
               K+       ++P   + D L             +S  +GL                 
Sbjct: 723  SWPKS-------SVPPEALQDKL----------GTLDSSRSGLE---------------- 749

Query: 3114 ANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCV 3293
                                   N  F + NTITAG+ + +V QWVHMVCGLWTPGT+C 
Sbjct: 750  -----------------------NESFPIHNTITAGILDSTVKQWVHMVCGLWTPGTRCP 786

Query: 3294 NVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXX 3473
            NV TM  FDV G   PR   +CS+CNRPGG CI+CRV  C + FHPWCAHRKGLLQS   
Sbjct: 787  NVDTMSAFDVSGASRPRANVICSICNRPGGSCIKCRVLNCLVPFHPWCAHRKGLLQSEVE 846

Query: 3474 XXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPQTRTGNCARTEGYKGCKSWEE 3653
                    FYGRC+ HA  A   C++     N+      +     CARTEGYKG K  E 
Sbjct: 847  GVDNENVGFYGRCMLHA--AHPSCELDSDPINIETDSTGEKEL-TCARTEGYKGRKQ-EG 902

Query: 3654 RKEELQKQTFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDYRREYLRYRQE 3830
             +  L  Q+  +    V QEQ+NAWLHING+KS ++ + K P  +V+ D R+E+ RY+Q 
Sbjct: 903  FRHNLNFQSNGNGGCLVPQEQLNAWLHINGQKSCTKGLPKTPISDVEYDCRKEFARYKQA 962

Query: 3831 KRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEG 4010
            K WK LVVYKSGIHALGLYT+ FIS+G MVVEYVGEIVGLRVADKRE+ Y S  K++ + 
Sbjct: 963  KGWKHLVVYKSGIHALGLYTSRFISRGAMVVEYVGEIVGLRVADKRESDYQSGRKLQYKT 1022

Query: 4011 ACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEI 4190
            ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC                 ERDIN GEEI
Sbjct: 1023 ACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEI 1082

Query: 4191 TYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
            TYDY+FNHEDEGKKIPCFC SR CRRYLN
Sbjct: 1083 TYDYHFNHEDEGKKIPCFCNSRNCRRYLN 1111


>ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613578 isoform X2 [Citrus
            sinensis]
          Length = 2119

 Score =  526 bits (1356), Expect = e-146
 Identities = 272/538 (50%), Positives = 336/538 (62%), Gaps = 4/538 (0%)
 Frame = +3

Query: 2676 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYG 2855
            CCVCG SN++ +N L++C  C IK+HQACYG+SK+PK  W CR C++N  +IVCVLCGYG
Sbjct: 1607 CCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYG 1666

Query: 2856 GGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF--- 3026
            GGA+T A R+  ++K LL  W ++ +   KN             ++ A ++         
Sbjct: 1667 GGAMTCALRSRTIVKGLLKAWNIETDSRHKNA------------VSSAQIMEDDLNMLHS 1714

Query: 3027 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3206
             G   ES +  +S+P+    +     + +   NQ +    S+   GN      N  KV N
Sbjct: 1715 SGPMLESSMLPVSRPVNTEPLSTAAWKMDF-PNQLDVLQKSS---GNA-----NNVKVHN 1765

Query: 3207 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3386
            +ITAG  + +V QWVHMVCGLWTPGT+C NV TM  FDV G   P+   VCS+CNRPGG 
Sbjct: 1766 SITAGAFDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGASHPKANVVCSICNRPGGS 1825

Query: 3387 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3566
            CIQCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA     +      + 
Sbjct: 1826 CIQCRVVNCSVKFHPWCAHQKGLLQSEVEGAENESVGFYGRCVLHATHPLCESGSDPFDI 1885

Query: 3567 NLALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3746
             +    + +     CARTEGYKG K  +     L  Q+   +   V QEQ+NAW+HING+
Sbjct: 1886 EVVCSIEKEF---TCARTEGYKGRKR-DGFWHNLHGQSRGKSACLVPQEQLNAWIHINGQ 1941

Query: 3747 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3923
            KSS   + K    +V+ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMVV
Sbjct: 1942 KSSTNGLPKLTVSDVEYDCRKEYARYKQMKGWKHLVVYKSGIHALGLYTSRFISRGEMVV 2001

Query: 3924 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4103
            EYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDAT KGGIARFVNHSC P
Sbjct: 2002 EYVGEIVGLRVADKREIEYQSGRKLQYKSACYFFRIDKEHIIDATCKGGIARFVNHSCLP 2061

Query: 4104 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
            NC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 2062 NCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 2119


>ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613578 isoform X1 [Citrus
            sinensis]
          Length = 2120

 Score =  526 bits (1356), Expect = e-146
 Identities = 272/538 (50%), Positives = 336/538 (62%), Gaps = 4/538 (0%)
 Frame = +3

Query: 2676 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYG 2855
            CCVCG SN++ +N L++C  C IK+HQACYG+SK+PK  W CR C++N  +IVCVLCGYG
Sbjct: 1608 CCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYG 1667

Query: 2856 GGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF--- 3026
            GGA+T A R+  ++K LL  W ++ +   KN             ++ A ++         
Sbjct: 1668 GGAMTCALRSRTIVKGLLKAWNIETDSRHKNA------------VSSAQIMEDDLNMLHS 1715

Query: 3027 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3206
             G   ES +  +S+P+    +     + +   NQ +    S+   GN      N  KV N
Sbjct: 1716 SGPMLESSMLPVSRPVNTEPLSTAAWKMDF-PNQLDVLQKSS---GNA-----NNVKVHN 1766

Query: 3207 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3386
            +ITAG  + +V QWVHMVCGLWTPGT+C NV TM  FDV G   P+   VCS+CNRPGG 
Sbjct: 1767 SITAGAFDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGASHPKANVVCSICNRPGGS 1826

Query: 3387 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3566
            CIQCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA     +      + 
Sbjct: 1827 CIQCRVVNCSVKFHPWCAHQKGLLQSEVEGAENESVGFYGRCVLHATHPLCESGSDPFDI 1886

Query: 3567 NLALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3746
             +    + +     CARTEGYKG K  +     L  Q+   +   V QEQ+NAW+HING+
Sbjct: 1887 EVVCSIEKEF---TCARTEGYKGRKR-DGFWHNLHGQSRGKSACLVPQEQLNAWIHINGQ 1942

Query: 3747 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3923
            KSS   + K    +V+ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMVV
Sbjct: 1943 KSSTNGLPKLTVSDVEYDCRKEYARYKQMKGWKHLVVYKSGIHALGLYTSRFISRGEMVV 2002

Query: 3924 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4103
            EYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDAT KGGIARFVNHSC P
Sbjct: 2003 EYVGEIVGLRVADKREIEYQSGRKLQYKSACYFFRIDKEHIIDATCKGGIARFVNHSCLP 2062

Query: 4104 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
            NC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 2063 NCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 2120


>ref|XP_006450349.1| hypothetical protein CICLE_v10010421mg [Citrus clementina]
            gi|557553575|gb|ESR63589.1| hypothetical protein
            CICLE_v10010421mg [Citrus clementina]
          Length = 765

 Score =  526 bits (1356), Expect = e-146
 Identities = 272/538 (50%), Positives = 336/538 (62%), Gaps = 4/538 (0%)
 Frame = +3

Query: 2676 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYG 2855
            CCVCG SN++ +N L++C  C IK+HQACYG+SK+PK  W CR C++N  +IVCVLCGYG
Sbjct: 253  CCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYG 312

Query: 2856 GGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF--- 3026
            GGA+T A R+  ++K LL  W ++ +   KN             ++ A ++         
Sbjct: 313  GGAMTCALRSRTIVKGLLKAWNIETDSRHKNA------------VSSAQIMEDDLNMLHS 360

Query: 3027 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3206
             G   ES +  +S+P+    +     + +   NQ +    S+   GN      N  KV N
Sbjct: 361  SGPMLESSMLPVSRPVNTEPLSTAAWKMDF-PNQLDVLQKSS---GNA-----NNVKVHN 411

Query: 3207 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3386
            +ITAG  + +V QWVHMVCGLWTPGT+C NV TM  FDV G   P+   VCS+CNRPGG 
Sbjct: 412  SITAGAFDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGASHPKANVVCSICNRPGGS 471

Query: 3387 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3566
            CIQCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA     +      + 
Sbjct: 472  CIQCRVVNCSVKFHPWCAHQKGLLQSEVEGAENESVGFYGRCVLHATHPLCESGSDPFDI 531

Query: 3567 NLALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3746
             +    + +     CARTEGYKG K  +     L  Q+   +   V QEQ+NAW+HING+
Sbjct: 532  EVVCSIEKEF---TCARTEGYKGRKR-DGFWHNLHGQSRGKSACLVPQEQLNAWIHINGQ 587

Query: 3747 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3923
            KSS   + K    +V+ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMVV
Sbjct: 588  KSSTNGLPKLTVSDVEYDCRKEYARYKQMKGWKHLVVYKSGIHALGLYTSRFISRGEMVV 647

Query: 3924 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4103
            EYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDAT KGGIARFVNHSC P
Sbjct: 648  EYVGEIVGLRVADKREIEYQSGRKLQYKSACYFFRIDKEHIIDATCKGGIARFVNHSCLP 707

Query: 4104 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
            NC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 708  NCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 765


>gb|EOY29402.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 2104

 Score =  520 bits (1340), Expect = e-144
 Identities = 287/659 (43%), Positives = 379/659 (57%), Gaps = 9/659 (1%)
 Frame = +3

Query: 2328 CNTPIPKFEGGSKISTRNNDVGNETEDDI--NIKKTTNNPACNVSKKRSL-QSTAEGAMN 2498
            C + I +F+  S +  +  D  +E    I   I    +N  C   +KRSL + T +G  +
Sbjct: 1480 CVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCKEIRKRSLYELTGKGKES 1539

Query: 2499 QGDIVSEQVRGSCSMSSKLKNFNALEDAGCIFEGSYSENPLVKRKR--REGSDAVSPGET 2672
              D  S  +        K+K   +L++ G +    +  + +   K   +    ++   + 
Sbjct: 1540 GSD--SHPLMEISKCMPKMKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDV 1597

Query: 2673 PCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGY 2852
             CCVCG SN++  N L++C  C I++HQACYGI K+P+  W CR C+++  + VCVLCGY
Sbjct: 1598 FCCVCGSSNKDEFNCLLECSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGY 1657

Query: 2853 GGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 3032
            GGGA+T A R+   +K LL  W ++ E   K+       N     + D   +     F  
Sbjct: 1658 GGGAMTQALRSRAFVKGLLKAWNIEAECGPKST------NYSAETVLDDQSLVVSNSFCN 1711

Query: 3033 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3212
                              ++ +D   +  A+ ++ D  + +         +++  + N++
Sbjct: 1712 ------------------LQFKDLELSRTAS-WKLDVQNQLDIIRNSPCPDSKLNLYNSV 1752

Query: 3213 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3392
            TAGV + +V QWVHMVCGLWTPGT+C NV TM  FDV GV   R   VCS+CNRPGG CI
Sbjct: 1753 TAGVLDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGVSRKRENVVCSICNRPGGSCI 1812

Query: 3393 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3572
            QCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA      C+      + 
Sbjct: 1813 QCRVVDCSVRFHPWCAHQKGLLQSEVEGIDNENVGFYGRCMLHASHCT--CESGSEPTDA 1870

Query: 3573 ALKPDPQTRTGNCARTEGYKGCKS---WEERKEELQKQTFNDNTRAVSQEQINAWLHING 3743
             L P  + R   CARTEG+KG K    W     + +++T       V QEQ+NAW+HING
Sbjct: 1871 ELSPSRE-RESTCARTEGFKGRKQDGFWHNIYGQSKRKT----GCFVPQEQLNAWIHING 1925

Query: 3744 RKSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMV 3920
            +KS  + + K P  +++ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMV
Sbjct: 1926 QKSCMQGLPKLPTSDMEYDCRKEYARYKQAKGWKHLVVYKSGIHALGLYTSRFISRGEMV 1985

Query: 3921 VEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCS 4100
            VEYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC 
Sbjct: 1986 VEYVGEIVGLRVADKRENEYESGRKVQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCL 2045

Query: 4101 PNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
            PNC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 2046 PNCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 2104


>gb|EOY29400.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782145|gb|EOY29401.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782147|gb|EOY29403.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782148|gb|EOY29404.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782149|gb|EOY29405.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782150|gb|EOY29406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1738

 Score =  520 bits (1340), Expect = e-144
 Identities = 287/659 (43%), Positives = 379/659 (57%), Gaps = 9/659 (1%)
 Frame = +3

Query: 2328 CNTPIPKFEGGSKISTRNNDVGNETEDDI--NIKKTTNNPACNVSKKRSL-QSTAEGAMN 2498
            C + I +F+  S +  +  D  +E    I   I    +N  C   +KRSL + T +G  +
Sbjct: 1114 CVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCKEIRKRSLYELTGKGKES 1173

Query: 2499 QGDIVSEQVRGSCSMSSKLKNFNALEDAGCIFEGSYSENPLVKRKR--REGSDAVSPGET 2672
              D  S  +        K+K   +L++ G +    +  + +   K   +    ++   + 
Sbjct: 1174 GSD--SHPLMEISKCMPKMKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDV 1231

Query: 2673 PCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGY 2852
             CCVCG SN++  N L++C  C I++HQACYGI K+P+  W CR C+++  + VCVLCGY
Sbjct: 1232 FCCVCGSSNKDEFNCLLECSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGY 1291

Query: 2853 GGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 3032
            GGGA+T A R+   +K LL  W ++ E   K+       N     + D   +     F  
Sbjct: 1292 GGGAMTQALRSRAFVKGLLKAWNIEAECGPKST------NYSAETVLDDQSLVVSNSFCN 1345

Query: 3033 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3212
                              ++ +D   +  A+ ++ D  + +         +++  + N++
Sbjct: 1346 ------------------LQFKDLELSRTAS-WKLDVQNQLDIIRNSPCPDSKLNLYNSV 1386

Query: 3213 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3392
            TAGV + +V QWVHMVCGLWTPGT+C NV TM  FDV GV   R   VCS+CNRPGG CI
Sbjct: 1387 TAGVLDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGVSRKRENVVCSICNRPGGSCI 1446

Query: 3393 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3572
            QCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA      C+      + 
Sbjct: 1447 QCRVVDCSVRFHPWCAHQKGLLQSEVEGIDNENVGFYGRCMLHASHCT--CESGSEPTDA 1504

Query: 3573 ALKPDPQTRTGNCARTEGYKGCKS---WEERKEELQKQTFNDNTRAVSQEQINAWLHING 3743
             L P  + R   CARTEG+KG K    W     + +++T       V QEQ+NAW+HING
Sbjct: 1505 ELSPSRE-RESTCARTEGFKGRKQDGFWHNIYGQSKRKT----GCFVPQEQLNAWIHING 1559

Query: 3744 RKSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMV 3920
            +KS  + + K P  +++ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMV
Sbjct: 1560 QKSCMQGLPKLPTSDMEYDCRKEYARYKQAKGWKHLVVYKSGIHALGLYTSRFISRGEMV 1619

Query: 3921 VEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCS 4100
            VEYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC 
Sbjct: 1620 VEYVGEIVGLRVADKRENEYESGRKVQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCL 1679

Query: 4101 PNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
            PNC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 1680 PNCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 1738


>ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda]
            gi|548856405|gb|ERN14258.1| hypothetical protein
            AMTR_s00033p00150780 [Amborella trichopoda]
          Length = 2123

 Score =  514 bits (1325), Expect = e-142
 Identities = 277/561 (49%), Positives = 347/561 (61%), Gaps = 1/561 (0%)
 Frame = +3

Query: 2544 SSKLKNFNALEDAGCIFEGSYSENPLVKRKRREGSDAVSPGETPCCVCGDSNEEGLNRLV 2723
            S KL   N  E  G I      +      K R+    +   +  CCVCG S+++  N ++
Sbjct: 1568 SEKLCLENVKETQGPIDVSHEVKGKKSSTKCRKRKAFILDSDVFCCVCGGSDKDDFNCIL 1627

Query: 2724 QCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVIKS 2903
            +C  CLIK+HQACYG+ K PK  W CR C++++ +IVCVLCGY GGA+T A R+ N++K+
Sbjct: 1628 ECSQCLIKVHQACYGVLKAPKGRWCCRPCRADIKDIVCVLCGYSGGAMTRALRSRNIVKN 1687

Query: 2904 LLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVA 3083
            LL  WK+KK   S +      P   +    D L   S +   G  R   +  +S   P  
Sbjct: 1688 LLQTWKIKKGRKSLD------PFHLSDSKHDDLNGLSGKLGGGPSRLEKMDSISAMKPGT 1741

Query: 3084 CVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVC 3263
                      AN      DA S ++  +  V   + F+V NTITA V +P+VTQW+HMVC
Sbjct: 1742 LERVSRVMMKANT----LDATSIMRNADILV---DDFQVHNTITAAVLDPNVTQWLHMVC 1794

Query: 3264 GLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAH 3443
            GLW PGT+C NV TM  FDV GV  P+R  VCS+C RPGG CI+CRVA C + FHPWCAH
Sbjct: 1795 GLWMPGTRCPNVDTMSAFDVSGVSPPKRNTVCSICKRPGGSCIRCRVADCSVFFHPWCAH 1854

Query: 3444 RKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPQTRTGNCARTE 3623
            +KGLLQS           FYGRC+ HA +     K V H  N  ++     +   CARTE
Sbjct: 1855 QKGLLQSEIEGVDNENVGFYGRCLFHAVNINCLTKPV-HLVNDKVEDHSDNKDPTCARTE 1913

Query: 3624 GYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDY 3800
            GYKG K  E     L+ Q+ +++   V QEQINAWLHING+KS +R ++K P  + + D 
Sbjct: 1914 GYKGRKK-EGLHYGLRGQSKDNSGCLVPQEQINAWLHINGQKSCTRGLIKPPASDTEYDC 1972

Query: 3801 RREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYY 3980
            R+EY RY+Q K WK+LVVYKSGIHALGLYT++FI +G MVVEYVGEIVGLRVADKREA Y
Sbjct: 1973 RKEYARYKQSKGWKQLVVYKSGIHALGLYTSQFIFRGAMVVEYVGEIVGLRVADKREAEY 2032

Query: 3981 HSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXX 4160
            HS  +++ E ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC                 
Sbjct: 2033 HSGRRIQYESACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITIRNEKKVVFFA 2092

Query: 4161 ERDINAGEEITYDYNFNHEDE 4223
            ERDIN GEEITYDY+FN+EDE
Sbjct: 2093 ERDINPGEEITYDYHFNNEDE 2113



 Score = 70.9 bits (172), Expect = 5e-09
 Identities = 46/110 (41%), Positives = 66/110 (60%), Gaps = 5/110 (4%)
 Frame = +3

Query: 759  EHQMSNVCSEGSDSPVSEFSD-AVGHTDMASGKLTETDVVDEGSGIGK-CSSDGIDNGVW 932
            E QMSNVCSE S + V+EFS     + D+ S + T  ++VDEGSGI K CSSD  + G+W
Sbjct: 1091 EQQMSNVCSESSAAVVTEFSGRCFVNLDLGSTRSTCDEIVDEGSGIEKCCSSDAHNAGMW 1150

Query: 933  TRSKQAYTGSGNHLLGTSVQLTNLSSDVYNGSKVKTSISFKR---PVNSP 1073
              +    +G+ + +LG S  L + S+D  N  KV++S+  K+   P  SP
Sbjct: 1151 AETAN-LSGNTDAVLGRSSTLPSHSTDPINNLKVRSSLRLKKVRLPFGSP 1199


>ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812602 isoform X6 [Glycine
            max]
          Length = 1870

 Score =  499 bits (1284), Expect = e-138
 Identities = 268/536 (50%), Positives = 329/536 (61%), Gaps = 2/536 (0%)
 Frame = +3

Query: 2676 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2852
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1371 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1430

Query: 2853 GGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 3032
            GGGA+T A  +  ++KSLL  W  +K+   KN                     S E F  
Sbjct: 1431 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1469

Query: 3033 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3212
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1470 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1525

Query: 3213 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3392
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1526 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1585

Query: 3393 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3572
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +    + 
Sbjct: 1586 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDPLDE 1643

Query: 3573 ALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3749
                + +  T  CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1644 IGSQEEKEFT--CARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1695

Query: 3750 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3929
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1696 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1755

Query: 3930 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4109
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1756 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1815

Query: 4110 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1816 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 1870


>ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812602 isoform X5 [Glycine
            max]
          Length = 1872

 Score =  499 bits (1284), Expect = e-138
 Identities = 268/536 (50%), Positives = 329/536 (61%), Gaps = 2/536 (0%)
 Frame = +3

Query: 2676 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2852
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1373 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1432

Query: 2853 GGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 3032
            GGGA+T A  +  ++KSLL  W  +K+   KN                     S E F  
Sbjct: 1433 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1471

Query: 3033 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3212
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1472 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1527

Query: 3213 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3392
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1528 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1587

Query: 3393 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3572
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +    + 
Sbjct: 1588 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDPLDE 1645

Query: 3573 ALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3749
                + +  T  CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1646 IGSQEEKEFT--CARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1697

Query: 3750 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3929
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1698 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1757

Query: 3930 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4109
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1758 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1817

Query: 4110 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1818 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 1872


>ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812602 isoform X3 [Glycine
            max]
          Length = 2006

 Score =  499 bits (1284), Expect = e-138
 Identities = 268/536 (50%), Positives = 329/536 (61%), Gaps = 2/536 (0%)
 Frame = +3

Query: 2676 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2852
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1507 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1566

Query: 2853 GGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 3032
            GGGA+T A  +  ++KSLL  W  +K+   KN                     S E F  
Sbjct: 1567 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1605

Query: 3033 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3212
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1606 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1661

Query: 3213 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3392
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1662 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1721

Query: 3393 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3572
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +    + 
Sbjct: 1722 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDPLDE 1779

Query: 3573 ALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3749
                + +  T  CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1780 IGSQEEKEFT--CARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1831

Query: 3750 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3929
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1832 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1891

Query: 3930 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4109
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1892 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1951

Query: 4110 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1952 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 2006


>ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812602 isoform X2 [Glycine
            max]
          Length = 2007

 Score =  499 bits (1284), Expect = e-138
 Identities = 268/536 (50%), Positives = 329/536 (61%), Gaps = 2/536 (0%)
 Frame = +3

Query: 2676 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2852
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1508 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1567

Query: 2853 GGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 3032
            GGGA+T A  +  ++KSLL  W  +K+   KN                     S E F  
Sbjct: 1568 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1606

Query: 3033 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3212
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1607 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1662

Query: 3213 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3392
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1663 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1722

Query: 3393 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3572
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +    + 
Sbjct: 1723 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDPLDE 1780

Query: 3573 ALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3749
                + +  T  CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1781 IGSQEEKEFT--CARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1832

Query: 3750 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3929
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1833 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1892

Query: 3930 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4109
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1893 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1952

Query: 4110 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1953 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 2007


>ref|XP_006596083.1| PREDICTED: uncharacterized protein LOC100812602 isoform X1 [Glycine
            max]
          Length = 2008

 Score =  499 bits (1284), Expect = e-138
 Identities = 268/536 (50%), Positives = 329/536 (61%), Gaps = 2/536 (0%)
 Frame = +3

Query: 2676 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2852
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1509 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1568

Query: 2853 GGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 3032
            GGGA+T A  +  ++KSLL  W  +K+   KN                     S E F  
Sbjct: 1569 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1607

Query: 3033 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3212
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1608 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1663

Query: 3213 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3392
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1664 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1723

Query: 3393 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3572
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +    + 
Sbjct: 1724 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDPLDE 1781

Query: 3573 ALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3749
                + +  T  CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1782 IGSQEEKEFT--CARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1833

Query: 3750 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3929
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1834 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1893

Query: 3930 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4109
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1894 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1953

Query: 4110 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4277
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1954 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 2008


>ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816713 isoform X1 [Glycine
            max]
          Length = 2032

 Score =  493 bits (1268), Expect = e-136
 Identities = 317/860 (36%), Positives = 438/860 (50%), Gaps = 53/860 (6%)
 Frame = +3

Query: 1857 SSMKVSSLQRMKKCRLLERQSNMSEFSTECNRKTYDRVDTP-----KEVRKRSLCKLSNE 2021
            SS  +S   +M     L++ SN S F    N++ +    +        + K    K+  E
Sbjct: 1217 SSSSLSREMQMHSLSSLKKSSNKSSFVQPSNKQKHTAYSSKFLSCKNRLNKHQSFKVGYE 1276

Query: 2022 SPS----------------KMKNHFSSDAMNDSRVFNPTRNFKTCSGQETKNGSLLP--C 2147
            S S                K++   SSD     ++       +  + +E +N  L P  C
Sbjct: 1277 SESSSDAEFHTLPGVSGTKKLEKDLSSDCFEQFQM-------QELAYEEPENDKLRPFSC 1329

Query: 2148 NNQS-----------------SKMHIQSKTESQKDIVAITSCTTNKANHSIFSSANSSLG 2276
              ++                 S  H+  + +    IV+++    +       ++    L 
Sbjct: 1330 RKENAHRITRPVVVCGKYGEISNGHLAREVQKPAKIVSLSKVLKSSKRCMGHTNGKPRLT 1389

Query: 2277 SGRLKIQHTNNELMSA-ICNTPIPKFEGGSKISTRNNDVGNETEDDINIKKTTNN---PA 2444
            S + K +  + E  S   C  P  K +  ++  T N    NET  D++++        PA
Sbjct: 1390 SKK-KWKRLSIETSSGHCCRNPGLKIKEHNE--TENTIFLNETNVDVSMEDLERGGKPPA 1446

Query: 2445 CNVSKKRSLQSTAEGAMNQGDIV----SEQVRGSCSMSS-KLKNFNALEDAGCIFEGSYS 2609
                K+ +     +   N+ +I     ++++R   S++    K    ++   C  +    
Sbjct: 1447 VYKGKRDAKAKQGDSVGNRANISLKVKNKEIRKQRSINELTAKETKVMDMTKCAQDQEPG 1506

Query: 2610 ENPLVKRKRREGSDAVSP--GETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIP 2783
                  R   +G  ++S    +  CCVC  S  + +N L++C  CLI++HQACYG+S +P
Sbjct: 1507 LCGTKSRNSIQGHTSISTINSDAFCCVCRRSTNDKINCLLECSRCLIRVHQACYGVSTLP 1566

Query: 2784 K-SGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGN 2960
            K S W CR C++N  NI CVLCGYGGGA+T A  +  ++KSLL  W  +K+       G 
Sbjct: 1567 KKSSWCCRPCRTNSKNIACVLCGYGGGAMTRAIMSHTIVKSLLKVWNCEKD-------GM 1619

Query: 2961 PCPNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETD 3140
            P               R        E+E D    SK       E   K K  + +    +
Sbjct: 1620 P---------------RDTTSCEVLEKEIDAFPSSKDGLEVDQESVLKPKIVDTSTDLMN 1664

Query: 3141 ANSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFD 3320
              S     +T  + +N FKV N+IT GV +P+V QW+HMVCGLWTP T+C NV TM  FD
Sbjct: 1665 QISTNHIPHTPTSFSN-FKVHNSITEGVLDPTVKQWIHMVCGLWTPRTRCPNVDTMSAFD 1723

Query: 3321 VFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXF 3500
            V GV  PR   VCS+CNR GG CI+CR+A C + FHPWCAH+K LLQS           F
Sbjct: 1724 VSGVSRPRADVVCSICNRWGGSCIECRIADCSVKFHPWCAHQKNLLQSETEGINDEKIGF 1783

Query: 3501 YGRCIAHAEDAKKQCKVVQHEKNLALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQT 3680
            YGRC+ H  + +  C  +    +     + +  T  CAR EGYKG      R +  Q   
Sbjct: 1784 YGRCMLHTIEPR--CLFIYDPLDEIGSQEQKEFT--CARVEGYKG-----RRWDGFQNNQ 1834

Query: 3681 FNDNTRAVSQEQINAWLHINGRK-SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVY 3857
                   V +EQ+NAW+HING+K  S+ + K P ++++ D R+EY RY+Q K WK LVVY
Sbjct: 1835 CQGGC-LVPEEQLNAWIHINGQKLCSQGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVY 1893

Query: 3858 KSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDK 4037
            KS IHALGLYT+ FIS+GEMVVEY+GEIVGLRVADKRE  Y S  K++ + ACYFFRIDK
Sbjct: 1894 KSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKEYQSGRKLQYKSACYFFRIDK 1953

Query: 4038 ENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHE 4217
            E+IIDATRKGGIARFVNHSC PNC                 ERDI  GEEITYDY+FNHE
Sbjct: 1954 EHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHE 2013

Query: 4218 DEGKKIPCFCKSRICRRYLN 4277
            DEG KIPC+C S+ CRRY+N
Sbjct: 2014 DEG-KIPCYCYSKNCRRYMN 2032


>ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
            gi|223540953|gb|EEF42511.1| mixed-lineage leukemia
            protein, mll, putative [Ricinus communis]
          Length = 1125

 Score =  491 bits (1263), Expect = e-135
 Identities = 284/616 (46%), Positives = 359/616 (58%), Gaps = 14/616 (2%)
 Frame = +3

Query: 2454 SKKRSL-QSTAEGAMNQGDIVSEQVR----GSCSMSSKLKNFNALEDAGCIFEGSYSENP 2618
            ++KRSL + T +G  +   +VS +          +   L+N     D      GS   +P
Sbjct: 545  TRKRSLYELTLKGKSSSPKMVSRKKNFKYVPKMKLGKTLRNSEKSHD-----NGSQKVDP 599

Query: 2619 LVKRKRREGSD-AVSPGETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGW 2795
              KR  RE    +++  ++ C VC  SN++ +N L++C+ C I++HQACYG+S++PK  W
Sbjct: 600  --KRCAREQKHLSITDMDSFCSVCRSSNKDEVNCLLECRRCSIRVHQACYGVSRVPKGHW 657

Query: 2796 KCRACKSNLTNIVCVLCGYGGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNL 2975
             CR C+++  +IVCVLCGYGGGA+T A R+  ++K LL  W ++ E  +KN         
Sbjct: 658  YCRPCRTSAKDIVCVLCGYGGGAMTLALRSRTIVKGLLKAWNLEIESVAKN--------- 708

Query: 2976 PTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNI 3155
                      I SPE       E  +   S P P        +  N   +   T  N ++
Sbjct: 709  ---------AISSPEILH---HEMSMLHSSGPGPENRSYPVLRPVNIEPST-STVCNKDV 755

Query: 3156 QEG-----NTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFD 3320
            Q       N+   L+N  KV+N+ITAGV + +V QWVHMVCGLWTPGT+C NV TM  FD
Sbjct: 756  QNHLDILPNSLGHLSN-LKVNNSITAGVLDSTVKQWVHMVCGLWTPGTRCPNVNTMSAFD 814

Query: 3321 VFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXF 3500
            V G   PR   VCS+C+RPGG CIQCRVA C I FHPWCAH+KGLLQS           F
Sbjct: 815  VSGASCPRANVVCSICDRPGGSCIQCRVANCSIQFHPWCAHQKGLLQSEAEGVDNENVGF 874

Query: 3501 YGRCIAHAE--DAKKQCKVVQHEKNLALKPDPQTRTGNCARTEGYKGCKSWEERKEELQK 3674
            YGRC+ HA     +  C     E        P  +  +CARTEGYKG K  +        
Sbjct: 875  YGRCVLHATYPTIESACDSAIFEAGY-----PAEKEVSCARTEGYKGRKR-DGFWHNTNS 928

Query: 3675 QTFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLV 3851
            Q+   +   V QEQ +AW+HING+KS ++ ++K P  E + D R+EY RY+Q K WK LV
Sbjct: 929  QSKGKSGCLVPQEQFDAWVHINGQKSCAQGILKLPMSEKEYDCRKEYTRYKQGKAWKHLV 988

Query: 3852 VYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRI 4031
            VYKSGIHALGLYTA FIS+GEMVVEYVGEIVGLRVADKRE  Y S  K++ + ACYFFRI
Sbjct: 989  VYKSGIHALGLYTARFISRGEMVVEYVGEIVGLRVADKRENEYQSGRKLQYKSACYFFRI 1048

Query: 4032 DKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFN 4211
            DKENIIDAT KGGIARFVNHSC PNC                 ERDI  GEEITYDY+FN
Sbjct: 1049 DKENIIDATHKGGIARFVNHSCLPNCVAKVISVRNDKKVVFFAERDIYPGEEITYDYHFN 1108

Query: 4212 HEDEGKKIPCFCKSRI 4259
            HEDE +K   F   RI
Sbjct: 1109 HEDEVQKFWKFSAVRI 1124


>ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816713 isoform X3 [Glycine
            max]
          Length = 2033

 Score =  489 bits (1258), Expect = e-135
 Identities = 318/863 (36%), Positives = 439/863 (50%), Gaps = 56/863 (6%)
 Frame = +3

Query: 1857 SSMKVSSLQRMKKCRLLERQSNMSEFSTECNRKTYDRVDTP-----KEVRKRSLCKLSNE 2021
            SS  +S   +M     L++ SN S F    N++ +    +        + K    K+  E
Sbjct: 1215 SSSSLSREMQMHSLSSLKKSSNKSSFVQPSNKQKHTAYSSKFLSCKNRLNKHQSFKVGYE 1274

Query: 2022 SPS----------------KMKNHFSSDAMNDSRVFNPTRNFKTCSGQETKNGSLLP--C 2147
            S S                K++   SSD     ++       +  + +E +N  L P  C
Sbjct: 1275 SESSSDAEFHTLPGVSGTKKLEKDLSSDCFEQFQM-------QELAYEEPENDKLRPFSC 1327

Query: 2148 NNQS-----------------SKMHIQSKTESQKDIVAITSCTTNKANHSIFSSANSSLG 2276
              ++                 S  H+  + +    IV+++    +       ++    L 
Sbjct: 1328 RKENAHRITRPVVVCGKYGEISNGHLAREVQKPAKIVSLSKVLKSSKRCMGHTNGKPRLT 1387

Query: 2277 SGRLKIQHTNNELMSA-ICNTPIPKFEGGSKISTRNNDVGNETEDDINIKKTTNN---PA 2444
            S + K +  + E  S   C  P  K +  ++  T N    NET  D++++        PA
Sbjct: 1388 SKK-KWKRLSIETSSGHCCRNPGLKIKEHNE--TENTIFLNETNVDVSMEDLERGGKPPA 1444

Query: 2445 CNVSKKRSLQSTAEGAMNQGDIV----SEQVRGSCSMSS-KLKNFNALEDAGCIFEGSYS 2609
                K+ +     +   N+ +I     ++++R   S++    K    ++   C  +    
Sbjct: 1445 VYKGKRDAKAKQGDSVGNRANISLKVKNKEIRKQRSINELTAKETKVMDMTKCAQDQEPG 1504

Query: 2610 ENPLVKRKRREGSDAVSP--GETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIP 2783
                  R   +G  ++S    +  CCVC  S  + +N L++C  CLI++HQACYG+S +P
Sbjct: 1505 LCGTKSRNSIQGHTSISTINSDAFCCVCRRSTNDKINCLLECSRCLIRVHQACYGVSTLP 1564

Query: 2784 K-SGWKCRACKSNLTNIV---CVLCGYGGGALTHAKRTENVIKSLLHCWKVKKEDNSKNL 2951
            K S W CR C++N  NIV   CVLCGYGGGA+T A  +  ++KSLL  W  +K+      
Sbjct: 1565 KKSSWCCRPCRTNSKNIVYPACVLCGYGGGAMTRAIMSHTIVKSLLKVWNCEKD------ 1618

Query: 2952 KGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKNANANQF 3131
             G P               R        E+E D    SK       E   K K  + +  
Sbjct: 1619 -GMP---------------RDTTSCEVLEKEIDAFPSSKDGLEVDQESVLKPKIVDTSTD 1662

Query: 3132 ETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMG 3311
              +  S     +T  + +N FKV N+IT GV +P+V QW+HMVCGLWTP T+C NV TM 
Sbjct: 1663 LMNQISTNHIPHTPTSFSN-FKVHNSITEGVLDPTVKQWIHMVCGLWTPRTRCPNVDTMS 1721

Query: 3312 VFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXX 3491
             FDV GV  PR   VCS+CNR GG CI+CR+A C + FHPWCAH+K LLQS         
Sbjct: 1722 AFDVSGVSRPRADVVCSICNRWGGSCIECRIADCSVKFHPWCAHQKNLLQSETEGINDEK 1781

Query: 3492 XXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPQTRTGNCARTEGYKGCKSWEERKEELQ 3671
              FYGRC+ H  + +  C  +    +     + +  T  CAR EGYKG      R +  Q
Sbjct: 1782 IGFYGRCMLHTIEPR--CLFIYDPLDEIGSQEQKEFT--CARVEGYKG-----RRWDGFQ 1832

Query: 3672 KQTFNDNTRAVSQEQINAWLHINGRK-SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRL 3848
                      V +EQ+NAW+HING+K  S+ + K P ++++ D R+EY RY+Q K WK L
Sbjct: 1833 NNQCQGGC-LVPEEQLNAWIHINGQKLCSQGLPKFPDLDIEHDCRKEYARYKQAKGWKHL 1891

Query: 3849 VVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFR 4028
            VVYKS IHALGLYT+ FIS+GEMVVEY+GEIVGLRVADKRE  Y S  K++ + ACYFFR
Sbjct: 1892 VVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKEYQSGRKLQYKSACYFFR 1951

Query: 4029 IDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNF 4208
            IDKE+IIDATRKGGIARFVNHSC PNC                 ERDI  GEEITYDY+F
Sbjct: 1952 IDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFLAERDIFPGEEITYDYHF 2011

Query: 4209 NHEDEGKKIPCFCKSRICRRYLN 4277
            NHEDEG KIPC+C S+ CRRY+N
Sbjct: 2012 NHEDEG-KIPCYCYSKNCRRYMN 2033


>ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816713 isoform X2 [Glycine
            max]
          Length = 2035

 Score =  489 bits (1258), Expect = e-135
 Identities = 318/863 (36%), Positives = 439/863 (50%), Gaps = 56/863 (6%)
 Frame = +3

Query: 1857 SSMKVSSLQRMKKCRLLERQSNMSEFSTECNRKTYDRVDTP-----KEVRKRSLCKLSNE 2021
            SS  +S   +M     L++ SN S F    N++ +    +        + K    K+  E
Sbjct: 1217 SSSSLSREMQMHSLSSLKKSSNKSSFVQPSNKQKHTAYSSKFLSCKNRLNKHQSFKVGYE 1276

Query: 2022 SPS----------------KMKNHFSSDAMNDSRVFNPTRNFKTCSGQETKNGSLLP--C 2147
            S S                K++   SSD     ++       +  + +E +N  L P  C
Sbjct: 1277 SESSSDAEFHTLPGVSGTKKLEKDLSSDCFEQFQM-------QELAYEEPENDKLRPFSC 1329

Query: 2148 NNQS-----------------SKMHIQSKTESQKDIVAITSCTTNKANHSIFSSANSSLG 2276
              ++                 S  H+  + +    IV+++    +       ++    L 
Sbjct: 1330 RKENAHRITRPVVVCGKYGEISNGHLAREVQKPAKIVSLSKVLKSSKRCMGHTNGKPRLT 1389

Query: 2277 SGRLKIQHTNNELMSA-ICNTPIPKFEGGSKISTRNNDVGNETEDDINIKKTTNN---PA 2444
            S + K +  + E  S   C  P  K +  ++  T N    NET  D++++        PA
Sbjct: 1390 SKK-KWKRLSIETSSGHCCRNPGLKIKEHNE--TENTIFLNETNVDVSMEDLERGGKPPA 1446

Query: 2445 CNVSKKRSLQSTAEGAMNQGDIV----SEQVRGSCSMSS-KLKNFNALEDAGCIFEGSYS 2609
                K+ +     +   N+ +I     ++++R   S++    K    ++   C  +    
Sbjct: 1447 VYKGKRDAKAKQGDSVGNRANISLKVKNKEIRKQRSINELTAKETKVMDMTKCAQDQEPG 1506

Query: 2610 ENPLVKRKRREGSDAVSP--GETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIP 2783
                  R   +G  ++S    +  CCVC  S  + +N L++C  CLI++HQACYG+S +P
Sbjct: 1507 LCGTKSRNSIQGHTSISTINSDAFCCVCRRSTNDKINCLLECSRCLIRVHQACYGVSTLP 1566

Query: 2784 K-SGWKCRACKSNLTNIV---CVLCGYGGGALTHAKRTENVIKSLLHCWKVKKEDNSKNL 2951
            K S W CR C++N  NIV   CVLCGYGGGA+T A  +  ++KSLL  W  +K+      
Sbjct: 1567 KKSSWCCRPCRTNSKNIVYPACVLCGYGGGAMTRAIMSHTIVKSLLKVWNCEKD------ 1620

Query: 2952 KGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKNANANQF 3131
             G P               R        E+E D    SK       E   K K  + +  
Sbjct: 1621 -GMP---------------RDTTSCEVLEKEIDAFPSSKDGLEVDQESVLKPKIVDTSTD 1664

Query: 3132 ETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMG 3311
              +  S     +T  + +N FKV N+IT GV +P+V QW+HMVCGLWTP T+C NV TM 
Sbjct: 1665 LMNQISTNHIPHTPTSFSN-FKVHNSITEGVLDPTVKQWIHMVCGLWTPRTRCPNVDTMS 1723

Query: 3312 VFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXX 3491
             FDV GV  PR   VCS+CNR GG CI+CR+A C + FHPWCAH+K LLQS         
Sbjct: 1724 AFDVSGVSRPRADVVCSICNRWGGSCIECRIADCSVKFHPWCAHQKNLLQSETEGINDEK 1783

Query: 3492 XXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPQTRTGNCARTEGYKGCKSWEERKEELQ 3671
              FYGRC+ H  + +  C  +    +     + +  T  CAR EGYKG      R +  Q
Sbjct: 1784 IGFYGRCMLHTIEPR--CLFIYDPLDEIGSQEQKEFT--CARVEGYKG-----RRWDGFQ 1834

Query: 3672 KQTFNDNTRAVSQEQINAWLHINGRK-SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRL 3848
                      V +EQ+NAW+HING+K  S+ + K P ++++ D R+EY RY+Q K WK L
Sbjct: 1835 NNQCQGGC-LVPEEQLNAWIHINGQKLCSQGLPKFPDLDIEHDCRKEYARYKQAKGWKHL 1893

Query: 3849 VVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFR 4028
            VVYKS IHALGLYT+ FIS+GEMVVEY+GEIVGLRVADKRE  Y S  K++ + ACYFFR
Sbjct: 1894 VVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRVADKREKEYQSGRKLQYKSACYFFR 1953

Query: 4029 IDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNF 4208
            IDKE+IIDATRKGGIARFVNHSC PNC                 ERDI  GEEITYDY+F
Sbjct: 1954 IDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFLAERDIFPGEEITYDYHF 2013

Query: 4209 NHEDEGKKIPCFCKSRICRRYLN 4277
            NHEDEG KIPC+C S+ CRRY+N
Sbjct: 2014 NHEDEG-KIPCYCYSKNCRRYMN 2035


>ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313577 [Fragaria vesca
            subsp. vesca]
          Length = 2169

 Score =  487 bits (1254), Expect = e-134
 Identities = 351/981 (35%), Positives = 474/981 (48%), Gaps = 29/981 (2%)
 Frame = +3

Query: 1422 RNKDMSVQHVQVKGDKLVIQLRQGALEKNAKQCILDDIEQLVERASSRKILEHEDPNFEN 1601
            R+KD  +Q+++ +G K+  + R+ ALE NA  C   D        SSR   E+ + N  +
Sbjct: 1266 RDKDKHLQNLE-QGLKIGKRKRELALELNAS-CSNSD--------SSRVRQENHNSNGTS 1315

Query: 1602 GQISRFKGTKIGEDRSKSSKYGRSVTASKPIKISNNKQ--IPKGGKMVSLRSILK--YPE 1769
               S+   +K     S S K G  VT +   + S+  +  I    K + LRS L   + +
Sbjct: 1316 QFTSQ--PSKSLMMLSTSRKSGTHVTGNCITQSSSKPRLHISSSAKKLLLRSDLHKLHDD 1373

Query: 1770 KQIGCQAKKSENLRSGYD-WERPIITRGETSSMKVSSLQRMKKCRLLERQSNMSEFSTEC 1946
            K+          L  G +  E P ++ G+T     SS           RQ  + E S + 
Sbjct: 1374 KESEVNNVFQTELNGGANNHELPEVSGGKTCKRDCSSNAF--------RQFQIQESSRKD 1425

Query: 1947 NRKT-YDRVDTPKEVRKRSLCKLSNESPSKMKNHFS---SDAMNDSRVFNPTRNFKTCSG 2114
             ++T Y+ VD  K    + + K+ +     +        +D  +  R+  P +       
Sbjct: 1426 TKRTKYNSVDGFKSTCSQQV-KIGHRKARPIVCGIYGELTDGSSTGRMSKPAKLVPLSRV 1484

Query: 2115 QETKNGSLLP--CNNQSSKMHIQSKTESQKDIVAITSCTTNKANHSIFSSANSSLGSGRL 2288
              +    +LP  CN++SS M        +K +     C T       +   ++ +     
Sbjct: 1485 LNSSRKCILPKLCNSKSSSMR-------KKKLGGAAICNTYDLKTEKYKCHDAMV----- 1532

Query: 2289 KIQHTNNELMSAICNTPIPKFEGGSKISTRNNDVGNETE----DDINIKKTTNNPACNVS 2456
            K+  T+       C+    +         +  DV +E +    D I   +    P     
Sbjct: 1533 KVNDTSMRKKKKECSPGEREIHKELFSMEKQGDVQSEKDHQKLDSITHTQLQMKP--KEI 1590

Query: 2457 KKRSLQSTAEGAMNQGDIVSEQVRGSCSMSSKLKNFNALEDAGCIFEGSYSE--NPLVKR 2630
            +KRS+    E   + G           S  SK+ NF    D   +  G  S       K 
Sbjct: 1591 RKRSIYEFTEKGDDTGF--------KSSSVSKISNFRPANDGKLVNTGEDSGLCQHSAKN 1642

Query: 2631 KRREGSDAVSPGETP-CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRA 2807
              +E     +    P CCVCG SN++ +N L++C  C +++HQACYG+SK+PK  W CR 
Sbjct: 1643 STQEHRCHCNCDSDPICCVCGSSNQDEINILLECSQCSVRVHQACYGVSKVPKGCWSCRP 1702

Query: 2808 CKSNLTNIVCVLCGYGGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSK 2987
            C+ +  +IVCVLCGYGGGA+T A R++ +  S+L  W ++ E   KN     C      K
Sbjct: 1703 CRMSSKDIVCVLCGYGGGAMTQALRSQTIAVSILRAWNIETECGPKN---ELCSIKTLQK 1759

Query: 2988 IADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGN 3167
             +  L       +R  E  S         P+A    +          +  D   N    +
Sbjct: 1760 DSTGLHCSG---YRHSESSSLFVSQQSGQPLAAAHCK------RGMSYRVDGVENSPSVS 1810

Query: 3168 TKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRR 3347
                   + KV N+IT G+ + +  QWVHMVCGLWTP T+C NV TM  FDV  V     
Sbjct: 1811 -------KTKVHNSITMGLVDSATKQWVHMVCGLWTPETRCPNVDTMSAFDVSCVPLSTD 1863

Query: 3348 KQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAE 3527
              VC +C R GG CIQCRV  C + FHPWCAH+KGLLQ+           FYGRC  HA 
Sbjct: 1864 DAVCCMCKRAGGSCIQCRVENCSVRFHPWCAHQKGLLQTEVEGVDNENVGFYGRCGLHAT 1923

Query: 3528 ----------DAKKQCKVVQHEKNLALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQ 3677
                      D +  C     EK L            CARTEGYKG K    R     + 
Sbjct: 1924 HPIYKSEYPVDTEAGCL---DEKKLV-----------CARTEGYKGRKRDGFRHNYCDRS 1969

Query: 3678 TFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVV 3854
              +D    V QEQ+NAW +ING+KS ++ + K    E++ D R+EY RY+Q K WK LVV
Sbjct: 1970 KGSDGC-LVPQEQLNAWAYINGQKSCTQELPKLAISEIEHDSRKEYTRYKQAKLWKHLVV 2028

Query: 3855 YKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRID 4034
            YKSGIHALGLYT+ FIS+ EMVVEYVGEIVG RV+DKRE  Y S  K++ + ACYFFRID
Sbjct: 2029 YKSGIHALGLYTSRFISRDEMVVEYVGEIVGQRVSDKRENEYQSAKKLQYKSACYFFRID 2088

Query: 4035 KENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNH 4214
            KE+IIDAT KGGIARFVNHSCSPNC                 ERDI  GEEITYDY+FNH
Sbjct: 2089 KEHIIDATCKGGIARFVNHSCSPNCVAKVISVRNEKKVVFLAERDIFPGEEITYDYHFNH 2148

Query: 4215 EDEGKKIPCFCKSRICRRYLN 4277
            EDEGKKIPCFC S+ CRRYLN
Sbjct: 2149 EDEGKKIPCFCNSKNCRRYLN 2169


>gb|ESW33157.1| hypothetical protein PHAVU_001G047700g [Phaseolus vulgaris]
            gi|561034628|gb|ESW33158.1| hypothetical protein
            PHAVU_001G047700g [Phaseolus vulgaris]
          Length = 2002

 Score =  484 bits (1246), Expect = e-133
 Identities = 313/866 (36%), Positives = 436/866 (50%), Gaps = 59/866 (6%)
 Frame = +3

Query: 1857 SSMKVSSLQRMKKCRLLERQSNMSEFSTECNRKTYDRVDTP-----KEVRKRSLCKLSNE 2021
            SS  + +  +M     L++  N S F   CN++      +        +RK    K+++E
Sbjct: 1197 SSSSLPNEMQMHSLSSLQKSFNKSSFVQPCNKRIQSAFSSKFNSCKNSLRKHLSYKVAHE 1256

Query: 2022 SPS----------------KMKNHFSSDAMNDSRVFNPTRNFKTCSGQETKNGSLLP--C 2147
            S S                K++N+ +SD      +  P       S +E K   L P  C
Sbjct: 1257 SQSDSYAEFCTLPGVSGTKKLRNNLTSDCFEQFHMQEP-------SYEEPKKAELWPFLC 1309

Query: 2148 NNQSSKM----------------HIQSKTESQKDIVAITSCTTNKANHSIFSSANSSLGS 2279
              ++                   H+  + +    IV++     +      ++     L S
Sbjct: 1310 RKENGHRITRPVVCGKYGEIRNGHLAKEVQKPAKIVSLNKVLKSSKRCMSYTKGKPRLTS 1369

Query: 2280 GRLKIQHTNNELMSAICNTPIPKFEGGSKISTRNNDVGNETEDDINIKKTTNNPACNVSK 2459
             +   + +        C     K +    I T+N  + NE   D++++        + +K
Sbjct: 1370 KKKWKRLSIGTDSEYCCGNRGLKVK--EHIETQNTIIYNEASVDMSLEDLERGGKQD-AK 1426

Query: 2460 KRSLQSTAEGAMNQGDIV----SEQVRGSCSMSS-KLKNFNALEDAGCIFEGSYSENPLV 2624
             ++ Q    G  N+ +++    ++ +R   S++    K     +   C  +    E  L 
Sbjct: 1427 AKAKQGVRVG--NRENVLLKVKNKDIRKHRSINELTAKETKVTDMMSCAQD---REPGLC 1481

Query: 2625 KRKRR---EGSDAVSP--GETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK- 2786
              KRR   +G   +S    +T CCVC  S+ + +N L++C  CLI++HQACYG+S +PK 
Sbjct: 1482 STKRRNSIQGHTNISTIYSDTFCCVCRSSSNDKINCLLECCQCLIRVHQACYGVSTLPKK 1541

Query: 2787 SGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPC 2966
            S W CR C++N  NI CVLCGYGGGA+T A  +  ++KSLL  W  +K+D  K+      
Sbjct: 1542 SRWCCRPCRTNSKNIACVLCGYGGGAMTRATMSHTIVKSLLKVWNSEKDDMPKH------ 1595

Query: 2967 PNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPV-ACVEKEDKRKNANANQFETDA 3143
                      +      E +     ++D     KP    A  +    R + N  Q+    
Sbjct: 1596 --------TTSCEFFGEEIYAFSSSKADQESALKPKIFDASTDLVKVRISTNNTQY---- 1643

Query: 3144 NSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDV 3323
                    T   L + FKV N+IT GV + +V QW+HMVCGLWTPGT+C NV TM  FDV
Sbjct: 1644 --------TPTTLYS-FKVHNSITEGVLDSTVKQWIHMVCGLWTPGTRCPNVDTMSAFDV 1694

Query: 3324 FGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFY 3503
             GV  PR   VCS+CNR GG CI+CR+A C + FHPWCAH K LLQS           FY
Sbjct: 1695 SGVSRPRADVVCSICNRWGGSCIECRMADCSVKFHPWCAHLKNLLQSETEGIDDEKIGFY 1754

Query: 3504 GRCIAHAEDAKKQCKVVQHEKNLALKPDPQTRTGN-------CARTEGYKGCKSWEERKE 3662
            G C+ H             E +     DP  + G+       CAR EGYKG      R+ 
Sbjct: 1755 GSCMLHTI-----------EPSYLSIYDPIDKIGSQEEKEFTCARAEGYKG------RRW 1797

Query: 3663 ELQKQTFNDNTRAVSQEQINAWLHINGRK-SSRAVVKNPGMEVKTDYRREYLRYRQEKRW 3839
            +  +         V +EQ+NAW+HING+K  S+ + K   ++++ + R+EY RY+Q K W
Sbjct: 1798 DGFQNNHCQGGCVVPEEQLNAWIHINGQKLCSQGLTKFSDLDMEHNCRKEYTRYKQAKGW 1857

Query: 3840 KRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACY 4019
            K LVVYKS IHALGLYT+ FIS+GE+VVEY+GEIVGLRVADKRE  Y S  K++ + ACY
Sbjct: 1858 KHLVVYKSRIHALGLYTSRFISRGEVVVEYIGEIVGLRVADKREKDYQSGKKLQDKSACY 1917

Query: 4020 FFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYD 4199
            FFRIDKE+IIDATRKGGIARFVNHSC PNC                 ERDI  GEEITYD
Sbjct: 1918 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFFAERDIFPGEEITYD 1977

Query: 4200 YNFNHEDEGKKIPCFCKSRICRRYLN 4277
            Y+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1978 YHFNHEDEG-KIPCYCNSKNCRRYMN 2002


>gb|ESW33155.1| hypothetical protein PHAVU_001G047700g [Phaseolus vulgaris]
            gi|561034626|gb|ESW33156.1| hypothetical protein
            PHAVU_001G047700g [Phaseolus vulgaris]
          Length = 2000

 Score =  484 bits (1246), Expect = e-133
 Identities = 313/866 (36%), Positives = 436/866 (50%), Gaps = 59/866 (6%)
 Frame = +3

Query: 1857 SSMKVSSLQRMKKCRLLERQSNMSEFSTECNRKTYDRVDTP-----KEVRKRSLCKLSNE 2021
            SS  + +  +M     L++  N S F   CN++      +        +RK    K+++E
Sbjct: 1195 SSSSLPNEMQMHSLSSLQKSFNKSSFVQPCNKRIQSAFSSKFNSCKNSLRKHLSYKVAHE 1254

Query: 2022 SPS----------------KMKNHFSSDAMNDSRVFNPTRNFKTCSGQETKNGSLLP--C 2147
            S S                K++N+ +SD      +  P       S +E K   L P  C
Sbjct: 1255 SQSDSYAEFCTLPGVSGTKKLRNNLTSDCFEQFHMQEP-------SYEEPKKAELWPFLC 1307

Query: 2148 NNQSSKM----------------HIQSKTESQKDIVAITSCTTNKANHSIFSSANSSLGS 2279
              ++                   H+  + +    IV++     +      ++     L S
Sbjct: 1308 RKENGHRITRPVVCGKYGEIRNGHLAKEVQKPAKIVSLNKVLKSSKRCMSYTKGKPRLTS 1367

Query: 2280 GRLKIQHTNNELMSAICNTPIPKFEGGSKISTRNNDVGNETEDDINIKKTTNNPACNVSK 2459
             +   + +        C     K +    I T+N  + NE   D++++        + +K
Sbjct: 1368 KKKWKRLSIGTDSEYCCGNRGLKVK--EHIETQNTIIYNEASVDMSLEDLERGGKQD-AK 1424

Query: 2460 KRSLQSTAEGAMNQGDIV----SEQVRGSCSMSS-KLKNFNALEDAGCIFEGSYSENPLV 2624
             ++ Q    G  N+ +++    ++ +R   S++    K     +   C  +    E  L 
Sbjct: 1425 AKAKQGVRVG--NRENVLLKVKNKDIRKHRSINELTAKETKVTDMMSCAQD---REPGLC 1479

Query: 2625 KRKRR---EGSDAVSP--GETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK- 2786
              KRR   +G   +S    +T CCVC  S+ + +N L++C  CLI++HQACYG+S +PK 
Sbjct: 1480 STKRRNSIQGHTNISTIYSDTFCCVCRSSSNDKINCLLECCQCLIRVHQACYGVSTLPKK 1539

Query: 2787 SGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPC 2966
            S W CR C++N  NI CVLCGYGGGA+T A  +  ++KSLL  W  +K+D  K+      
Sbjct: 1540 SRWCCRPCRTNSKNIACVLCGYGGGAMTRATMSHTIVKSLLKVWNSEKDDMPKH------ 1593

Query: 2967 PNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPV-ACVEKEDKRKNANANQFETDA 3143
                      +      E +     ++D     KP    A  +    R + N  Q+    
Sbjct: 1594 --------TTSCEFFGEEIYAFSSSKADQESALKPKIFDASTDLVKVRISTNNTQY---- 1641

Query: 3144 NSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDV 3323
                    T   L + FKV N+IT GV + +V QW+HMVCGLWTPGT+C NV TM  FDV
Sbjct: 1642 --------TPTTLYS-FKVHNSITEGVLDSTVKQWIHMVCGLWTPGTRCPNVDTMSAFDV 1692

Query: 3324 FGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFY 3503
             GV  PR   VCS+CNR GG CI+CR+A C + FHPWCAH K LLQS           FY
Sbjct: 1693 SGVSRPRADVVCSICNRWGGSCIECRMADCSVKFHPWCAHLKNLLQSETEGIDDEKIGFY 1752

Query: 3504 GRCIAHAEDAKKQCKVVQHEKNLALKPDPQTRTGN-------CARTEGYKGCKSWEERKE 3662
            G C+ H             E +     DP  + G+       CAR EGYKG      R+ 
Sbjct: 1753 GSCMLHTI-----------EPSYLSIYDPIDKIGSQEEKEFTCARAEGYKG------RRW 1795

Query: 3663 ELQKQTFNDNTRAVSQEQINAWLHINGRK-SSRAVVKNPGMEVKTDYRREYLRYRQEKRW 3839
            +  +         V +EQ+NAW+HING+K  S+ + K   ++++ + R+EY RY+Q K W
Sbjct: 1796 DGFQNNHCQGGCVVPEEQLNAWIHINGQKLCSQGLTKFSDLDMEHNCRKEYTRYKQAKGW 1855

Query: 3840 KRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACY 4019
            K LVVYKS IHALGLYT+ FIS+GE+VVEY+GEIVGLRVADKRE  Y S  K++ + ACY
Sbjct: 1856 KHLVVYKSRIHALGLYTSRFISRGEVVVEYIGEIVGLRVADKREKDYQSGKKLQDKSACY 1915

Query: 4020 FFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYD 4199
            FFRIDKE+IIDATRKGGIARFVNHSC PNC                 ERDI  GEEITYD
Sbjct: 1916 FFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFFAERDIFPGEEITYD 1975

Query: 4200 YNFNHEDEGKKIPCFCKSRICRRYLN 4277
            Y+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1976 YHFNHEDEG-KIPCYCNSKNCRRYMN 2000


>gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus notabilis]
          Length = 2073

 Score =  483 bits (1242), Expect = e-133
 Identities = 259/520 (49%), Positives = 320/520 (61%), Gaps = 1/520 (0%)
 Frame = +3

Query: 2667 ETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLC 2846
            E+ CCVCG S+++  N L++C  CLIK+HQACYG+S+ PK  W CR C+++  NIVCVLC
Sbjct: 1545 ESFCCVCGSSDKDDTNNLLECNICLIKVHQACYGVSRAPKGHWYCRPCRTSSRNIVCVLC 1604

Query: 2847 GYGGGALTHAKRTENVIKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF 3026
            GYGGGA+T A R+  ++KSLL  W V+ E  + ++K     +L T    ++         
Sbjct: 1605 GYGGGAMTRALRSRTIVKSLLRVWNVETEWKALSVK-----DLETLTRLNS--------- 1650

Query: 3027 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3206
             G ERE    G S PM   C  +  K   +   + +   N ++   +  V    + KVDN
Sbjct: 1651 SGPEREE---GTSFPM---CQPENTKPLASVVCKMDMPYNVDVLRNSLCV---KKLKVDN 1701

Query: 3207 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3386
            +ITAG  + +  QWVHMVCGLWTPGT+C NV TM  FDV G   PR   VCS+CNRPGG 
Sbjct: 1702 SITAGFLDSTTKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGAPHPRADVVCSMCNRPGGS 1761

Query: 3387 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3566
            CI+CRV  C + FHPWCAH+KGLLQS           FYGRC  HA     +      + 
Sbjct: 1762 CIKCRVLNCSVRFHPWCAHQKGLLQSEVEGIDNENIGFYGRCARHATHPMCESDSDPADT 1821

Query: 3567 NLALKPDPQTRTGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3746
            +  +          CARTEGYKG K    R    Q +        V QEQ+NAW+HING+
Sbjct: 1822 D-RVAGGSAVEELTCARTEGYKGRKRDGVRHNYCQSK--GKVGCYVPQEQLNAWIHINGQ 1878

Query: 3747 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3923
            KS  + V + P  +++ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+ EMVV
Sbjct: 1879 KSCIQGVHRLPTSDIEHDCRKEYARYKQGKGWKHLVVYKSGIHALGLYTSRFISRSEMVV 1938

Query: 3924 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4103
            EYVGEIVG RVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC P
Sbjct: 1939 EYVGEIVGQRVADKRENEYQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCLP 1998

Query: 4104 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDE 4223
            NC                 ERDI  GEEITYDY+FNHEDE
Sbjct: 1999 NCVAKVISIRNEKKVVFFAERDIFPGEEITYDYHFNHEDE 2038


Top