BLASTX nr result

ID: Ephedra28_contig00008629 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00008629
         (4282 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21104.3| unnamed protein product [Vitis vinifera]              542   e-151
ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613...   528   e-147
ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613...   528   e-147
ref|XP_006450349.1| hypothetical protein CICLE_v10010421mg [Citr...   528   e-147
gb|EOY29402.1| Uncharacterized protein isoform 3 [Theobroma cacao]    522   e-145
gb|EOY29400.1| Uncharacterized protein isoform 1 [Theobroma caca...   522   e-145
ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [A...   515   e-143
ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812...   500   e-138
ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812...   500   e-138
ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812...   500   e-138
ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812...   500   e-138
ref|XP_006596083.1| PREDICTED: uncharacterized protein LOC100812...   500   e-138
ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816...   493   e-136
ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putativ...   491   e-135
ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816...   489   e-135
ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816...   489   e-135
ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313...   486   e-134
gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus no...   484   e-133
gb|ESW33157.1| hypothetical protein PHAVU_001G047700g [Phaseolus...   483   e-133
gb|ESW33155.1| hypothetical protein PHAVU_001G047700g [Phaseolus...   483   e-133

>emb|CBI21104.3| unnamed protein product [Vitis vinifera]
          Length = 1111

 Score =  542 bits (1396), Expect = e-151
 Identities = 284/569 (49%), Positives = 344/569 (60%), Gaps = 1/569 (0%)
 Frame = +1

Query: 2506 EDAGCIFEGSYSENPLVKRKRREGSDAVSPGETPCCVCGDSNEEGLNRLVQCQSCLIKMH 2685
            ED+      SY  N     K       +S  +  CCVCG SN++ +N L++C  CLI++H
Sbjct: 603  EDSKHSMSESYKVNSKKSIKEHRFESFISDTDAFCCVCGSSNKDEINCLLECSRCLIRVH 662

Query: 2686 QACYGISKIPKSGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKE 2865
            QACYG+S++PK  W CR C+++  NIVCVLCGYGGGA+T A RT N+VKSLL  W ++ E
Sbjct: 663  QACYGVSRVPKGRWYCRPCRTSSKNIVCVLCGYGGGAMTRALRTRNIVKSLLKVWNIETE 722

Query: 2866 DNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKN 3045
               K+       ++P   + D L             +S  +GL                 
Sbjct: 723  SWPKS-------SVPPEALQDKL----------GTLDSSRSGLE---------------- 749

Query: 3046 ANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCV 3225
                                   N  F + NTITAG+ + +V QWVHMVCGLWTPGT+C 
Sbjct: 750  -----------------------NESFPIHNTITAGILDSTVKQWVHMVCGLWTPGTRCP 786

Query: 3226 NVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXX 3405
            NV TM  FDV G   PR   +CS+CNRPGG CI+CRV  C + FHPWCAHRKGLLQS   
Sbjct: 787  NVDTMSAFDVSGASRPRANVICSICNRPGGSCIKCRVLNCLVPFHPWCAHRKGLLQSEVE 846

Query: 3406 XXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEE 3585
                    FYGRC+ HA  A   C++     N+      E     CARTEGYKG K  E 
Sbjct: 847  GVDNENVGFYGRCMLHA--AHPSCELDSDPINIETDSTGEKEL-TCARTEGYKGRKQ-EG 902

Query: 3586 RKEELQKQTFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDYRREYLRYRQE 3762
             +  L  Q+  +    V QEQ+NAWLHING+KS ++ + K P  +V+ D R+E+ RY+Q 
Sbjct: 903  FRHNLNFQSNGNGGCLVPQEQLNAWLHINGQKSCTKGLPKTPISDVEYDCRKEFARYKQA 962

Query: 3763 KRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEG 3942
            K WK LVVYKSGIHALGLYT+ FIS+G MVVEYVGEIVGLRVADKRE+ Y S  K++ + 
Sbjct: 963  KGWKHLVVYKSGIHALGLYTSRFISRGAMVVEYVGEIVGLRVADKRESDYQSGRKLQYKT 1022

Query: 3943 ACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEI 4122
            ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC                 ERDIN GEEI
Sbjct: 1023 ACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEI 1082

Query: 4123 TYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
            TYDY+FNHEDEGKKIPCFC SR CRRYLN
Sbjct: 1083 TYDYHFNHEDEGKKIPCFCNSRNCRRYLN 1111


>ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613578 isoform X2 [Citrus
            sinensis]
          Length = 2119

 Score =  528 bits (1360), Expect = e-147
 Identities = 274/538 (50%), Positives = 336/538 (62%), Gaps = 4/538 (0%)
 Frame = +1

Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYG 2787
            CCVCG SN++ +N L++C  C IK+HQACYG+SK+PK  W CR C++N  +IVCVLCGYG
Sbjct: 1607 CCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYG 1666

Query: 2788 GGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF--- 2958
            GGA+T A R+  +VK LL  W ++ +   KN             ++ A ++         
Sbjct: 1667 GGAMTCALRSRTIVKGLLKAWNIETDSRHKNA------------VSSAQIMEDDLNMLHS 1714

Query: 2959 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3138
             G   ES +  +S+P+    +     + +   NQ +    S+   GN      N  KV N
Sbjct: 1715 SGPMLESSMLPVSRPVNTEPLSTAAWKMDF-PNQLDVLQKSS---GNA-----NNVKVHN 1765

Query: 3139 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3318
            +ITAG  + +V QWVHMVCGLWTPGT+C NV TM  FDV G   P+   VCS+CNRPGG 
Sbjct: 1766 SITAGAFDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGASHPKANVVCSICNRPGGS 1825

Query: 3319 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3498
            CIQCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA     +      + 
Sbjct: 1826 CIQCRVVNCSVKFHPWCAHQKGLLQSEVEGAENESVGFYGRCVLHATHPLCESGSDPFDI 1885

Query: 3499 NLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3678
             +    + E     CARTEGYKG K  +     L  Q+   +   V QEQ+NAW+HING+
Sbjct: 1886 EVVCSIEKEF---TCARTEGYKGRKR-DGFWHNLHGQSRGKSACLVPQEQLNAWIHINGQ 1941

Query: 3679 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3855
            KSS   + K    +V+ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMVV
Sbjct: 1942 KSSTNGLPKLTVSDVEYDCRKEYARYKQMKGWKHLVVYKSGIHALGLYTSRFISRGEMVV 2001

Query: 3856 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4035
            EYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDAT KGGIARFVNHSC P
Sbjct: 2002 EYVGEIVGLRVADKREIEYQSGRKLQYKSACYFFRIDKEHIIDATCKGGIARFVNHSCLP 2061

Query: 4036 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
            NC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 2062 NCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 2119


>ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613578 isoform X1 [Citrus
            sinensis]
          Length = 2120

 Score =  528 bits (1360), Expect = e-147
 Identities = 274/538 (50%), Positives = 336/538 (62%), Gaps = 4/538 (0%)
 Frame = +1

Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYG 2787
            CCVCG SN++ +N L++C  C IK+HQACYG+SK+PK  W CR C++N  +IVCVLCGYG
Sbjct: 1608 CCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYG 1667

Query: 2788 GGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF--- 2958
            GGA+T A R+  +VK LL  W ++ +   KN             ++ A ++         
Sbjct: 1668 GGAMTCALRSRTIVKGLLKAWNIETDSRHKNA------------VSSAQIMEDDLNMLHS 1715

Query: 2959 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3138
             G   ES +  +S+P+    +     + +   NQ +    S+   GN      N  KV N
Sbjct: 1716 SGPMLESSMLPVSRPVNTEPLSTAAWKMDF-PNQLDVLQKSS---GNA-----NNVKVHN 1766

Query: 3139 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3318
            +ITAG  + +V QWVHMVCGLWTPGT+C NV TM  FDV G   P+   VCS+CNRPGG 
Sbjct: 1767 SITAGAFDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGASHPKANVVCSICNRPGGS 1826

Query: 3319 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3498
            CIQCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA     +      + 
Sbjct: 1827 CIQCRVVNCSVKFHPWCAHQKGLLQSEVEGAENESVGFYGRCVLHATHPLCESGSDPFDI 1886

Query: 3499 NLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3678
             +    + E     CARTEGYKG K  +     L  Q+   +   V QEQ+NAW+HING+
Sbjct: 1887 EVVCSIEKEF---TCARTEGYKGRKR-DGFWHNLHGQSRGKSACLVPQEQLNAWIHINGQ 1942

Query: 3679 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3855
            KSS   + K    +V+ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMVV
Sbjct: 1943 KSSTNGLPKLTVSDVEYDCRKEYARYKQMKGWKHLVVYKSGIHALGLYTSRFISRGEMVV 2002

Query: 3856 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4035
            EYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDAT KGGIARFVNHSC P
Sbjct: 2003 EYVGEIVGLRVADKREIEYQSGRKLQYKSACYFFRIDKEHIIDATCKGGIARFVNHSCLP 2062

Query: 4036 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
            NC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 2063 NCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 2120


>ref|XP_006450349.1| hypothetical protein CICLE_v10010421mg [Citrus clementina]
            gi|557553575|gb|ESR63589.1| hypothetical protein
            CICLE_v10010421mg [Citrus clementina]
          Length = 765

 Score =  528 bits (1360), Expect = e-147
 Identities = 274/538 (50%), Positives = 336/538 (62%), Gaps = 4/538 (0%)
 Frame = +1

Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYG 2787
            CCVCG SN++ +N L++C  C IK+HQACYG+SK+PK  W CR C++N  +IVCVLCGYG
Sbjct: 253  CCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYG 312

Query: 2788 GGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF--- 2958
            GGA+T A R+  +VK LL  W ++ +   KN             ++ A ++         
Sbjct: 313  GGAMTCALRSRTIVKGLLKAWNIETDSRHKNA------------VSSAQIMEDDLNMLHS 360

Query: 2959 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3138
             G   ES +  +S+P+    +     + +   NQ +    S+   GN      N  KV N
Sbjct: 361  SGPMLESSMLPVSRPVNTEPLSTAAWKMDF-PNQLDVLQKSS---GNA-----NNVKVHN 411

Query: 3139 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3318
            +ITAG  + +V QWVHMVCGLWTPGT+C NV TM  FDV G   P+   VCS+CNRPGG 
Sbjct: 412  SITAGAFDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGASHPKANVVCSICNRPGGS 471

Query: 3319 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3498
            CIQCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA     +      + 
Sbjct: 472  CIQCRVVNCSVKFHPWCAHQKGLLQSEVEGAENESVGFYGRCVLHATHPLCESGSDPFDI 531

Query: 3499 NLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3678
             +    + E     CARTEGYKG K  +     L  Q+   +   V QEQ+NAW+HING+
Sbjct: 532  EVVCSIEKEF---TCARTEGYKGRKR-DGFWHNLHGQSRGKSACLVPQEQLNAWIHINGQ 587

Query: 3679 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3855
            KSS   + K    +V+ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMVV
Sbjct: 588  KSSTNGLPKLTVSDVEYDCRKEYARYKQMKGWKHLVVYKSGIHALGLYTSRFISRGEMVV 647

Query: 3856 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4035
            EYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDAT KGGIARFVNHSC P
Sbjct: 648  EYVGEIVGLRVADKREIEYQSGRKLQYKSACYFFRIDKEHIIDATCKGGIARFVNHSCLP 707

Query: 4036 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
            NC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 708  NCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 765


>gb|EOY29402.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 2104

 Score =  522 bits (1344), Expect = e-145
 Identities = 289/659 (43%), Positives = 379/659 (57%), Gaps = 9/659 (1%)
 Frame = +1

Query: 2260 CNTPIPKFEGGSKISTRNNDVGNETEDDI--NIKKTTNNPACNVSKKRSL-QSTAEGAMN 2430
            C + I +F+  S +  +  D  +E    I   I    +N  C   +KRSL + T +G  +
Sbjct: 1480 CVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCKEIRKRSLYELTGKGKES 1539

Query: 2431 QGDIVSEQVRGSCSMSSKLKNFNALEDAGCIFEGSYSENPLVKRKR--REGSDAVSPGET 2604
              D  S  +        K+K   +L++ G +    +  + +   K   +    ++   + 
Sbjct: 1540 GSD--SHPLMEISKCMPKMKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDV 1597

Query: 2605 PCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGY 2784
             CCVCG SN++  N L++C  C I++HQACYGI K+P+  W CR C+++  + VCVLCGY
Sbjct: 1598 FCCVCGSSNKDEFNCLLECSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGY 1657

Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964
            GGGA+T A R+   VK LL  W ++ E   K+       N     + D   +     F  
Sbjct: 1658 GGGAMTQALRSRAFVKGLLKAWNIEAECGPKST------NYSAETVLDDQSLVVSNSFCN 1711

Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144
                              ++ +D   +  A+ ++ D  + +         +++  + N++
Sbjct: 1712 ------------------LQFKDLELSRTAS-WKLDVQNQLDIIRNSPCPDSKLNLYNSV 1752

Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324
            TAGV + +V QWVHMVCGLWTPGT+C NV TM  FDV GV   R   VCS+CNRPGG CI
Sbjct: 1753 TAGVLDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGVSRKRENVVCSICNRPGGSCI 1812

Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504
            QCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA      C+      + 
Sbjct: 1813 QCRVVDCSVRFHPWCAHQKGLLQSEVEGIDNENVGFYGRCMLHASHCT--CESGSEPTDA 1870

Query: 3505 ALKPDPETRAGNCARTEGYKGCKS---WEERKEELQKQTFNDNTRAVSQEQINAWLHING 3675
             L P  E R   CARTEG+KG K    W     + +++T       V QEQ+NAW+HING
Sbjct: 1871 ELSPSRE-RESTCARTEGFKGRKQDGFWHNIYGQSKRKT----GCFVPQEQLNAWIHING 1925

Query: 3676 RKSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMV 3852
            +KS  + + K P  +++ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMV
Sbjct: 1926 QKSCMQGLPKLPTSDMEYDCRKEYARYKQAKGWKHLVVYKSGIHALGLYTSRFISRGEMV 1985

Query: 3853 VEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCS 4032
            VEYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC 
Sbjct: 1986 VEYVGEIVGLRVADKRENEYESGRKVQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCL 2045

Query: 4033 PNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
            PNC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 2046 PNCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 2104


>gb|EOY29400.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782145|gb|EOY29401.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782147|gb|EOY29403.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782148|gb|EOY29404.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782149|gb|EOY29405.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782150|gb|EOY29406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1738

 Score =  522 bits (1344), Expect = e-145
 Identities = 289/659 (43%), Positives = 379/659 (57%), Gaps = 9/659 (1%)
 Frame = +1

Query: 2260 CNTPIPKFEGGSKISTRNNDVGNETEDDI--NIKKTTNNPACNVSKKRSL-QSTAEGAMN 2430
            C + I +F+  S +  +  D  +E    I   I    +N  C   +KRSL + T +G  +
Sbjct: 1114 CVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCKEIRKRSLYELTGKGKES 1173

Query: 2431 QGDIVSEQVRGSCSMSSKLKNFNALEDAGCIFEGSYSENPLVKRKR--REGSDAVSPGET 2604
              D  S  +        K+K   +L++ G +    +  + +   K   +    ++   + 
Sbjct: 1174 GSD--SHPLMEISKCMPKMKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDV 1231

Query: 2605 PCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGY 2784
             CCVCG SN++  N L++C  C I++HQACYGI K+P+  W CR C+++  + VCVLCGY
Sbjct: 1232 FCCVCGSSNKDEFNCLLECSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGY 1291

Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964
            GGGA+T A R+   VK LL  W ++ E   K+       N     + D   +     F  
Sbjct: 1292 GGGAMTQALRSRAFVKGLLKAWNIEAECGPKST------NYSAETVLDDQSLVVSNSFCN 1345

Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144
                              ++ +D   +  A+ ++ D  + +         +++  + N++
Sbjct: 1346 ------------------LQFKDLELSRTAS-WKLDVQNQLDIIRNSPCPDSKLNLYNSV 1386

Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324
            TAGV + +V QWVHMVCGLWTPGT+C NV TM  FDV GV   R   VCS+CNRPGG CI
Sbjct: 1387 TAGVLDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGVSRKRENVVCSICNRPGGSCI 1446

Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504
            QCRV  C + FHPWCAH+KGLLQS           FYGRC+ HA      C+      + 
Sbjct: 1447 QCRVVDCSVRFHPWCAHQKGLLQSEVEGIDNENVGFYGRCMLHASHCT--CESGSEPTDA 1504

Query: 3505 ALKPDPETRAGNCARTEGYKGCKS---WEERKEELQKQTFNDNTRAVSQEQINAWLHING 3675
             L P  E R   CARTEG+KG K    W     + +++T       V QEQ+NAW+HING
Sbjct: 1505 ELSPSRE-RESTCARTEGFKGRKQDGFWHNIYGQSKRKT----GCFVPQEQLNAWIHING 1559

Query: 3676 RKSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMV 3852
            +KS  + + K P  +++ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMV
Sbjct: 1560 QKSCMQGLPKLPTSDMEYDCRKEYARYKQAKGWKHLVVYKSGIHALGLYTSRFISRGEMV 1619

Query: 3853 VEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCS 4032
            VEYVGEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC 
Sbjct: 1620 VEYVGEIVGLRVADKRENEYESGRKVQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCL 1679

Query: 4033 PNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
            PNC                 ERDI  GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN
Sbjct: 1680 PNCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 1738


>ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda]
            gi|548856405|gb|ERN14258.1| hypothetical protein
            AMTR_s00033p00150780 [Amborella trichopoda]
          Length = 2123

 Score =  515 bits (1327), Expect = e-143
 Identities = 278/561 (49%), Positives = 348/561 (62%), Gaps = 1/561 (0%)
 Frame = +1

Query: 2476 SSKLKNFNALEDAGCIFEGSYSENPLVKRKRREGSDAVSPGETPCCVCGDSNEEGLNRLV 2655
            S KL   N  E  G I      +      K R+    +   +  CCVCG S+++  N ++
Sbjct: 1568 SEKLCLENVKETQGPIDVSHEVKGKKSSTKCRKRKAFILDSDVFCCVCGGSDKDDFNCIL 1627

Query: 2656 QCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKS 2835
            +C  CLIK+HQACYG+ K PK  W CR C++++ +IVCVLCGY GGA+T A R+ N+VK+
Sbjct: 1628 ECSQCLIKVHQACYGVLKAPKGRWCCRPCRADIKDIVCVLCGYSGGAMTRALRSRNIVKN 1687

Query: 2836 LLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVA 3015
            LL  WK+KK   S +      P   +    D L   S +   G  R   +  +S   P  
Sbjct: 1688 LLQTWKIKKGRKSLD------PFHLSDSKHDDLNGLSGKLGGGPSRLEKMDSISAMKPGT 1741

Query: 3016 CVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVC 3195
                      AN      DA S ++  +  V   + F+V NTITA V +P+VTQW+HMVC
Sbjct: 1742 LERVSRVMMKANT----LDATSIMRNADILV---DDFQVHNTITAAVLDPNVTQWLHMVC 1794

Query: 3196 GLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAH 3375
            GLW PGT+C NV TM  FDV GV  P+R  VCS+C RPGG CI+CRVA C + FHPWCAH
Sbjct: 1795 GLWMPGTRCPNVDTMSAFDVSGVSPPKRNTVCSICKRPGGSCIRCRVADCSVFFHPWCAH 1854

Query: 3376 RKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPETRAGNCARTE 3555
            +KGLLQS           FYGRC+ HA +     K V H  N  ++   + +   CARTE
Sbjct: 1855 QKGLLQSEIEGVDNENVGFYGRCLFHAVNINCLTKPV-HLVNDKVEDHSDNKDPTCARTE 1913

Query: 3556 GYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDY 3732
            GYKG K  E     L+ Q+ +++   V QEQINAWLHING+KS +R ++K P  + + D 
Sbjct: 1914 GYKGRKK-EGLHYGLRGQSKDNSGCLVPQEQINAWLHINGQKSCTRGLIKPPASDTEYDC 1972

Query: 3733 RREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYY 3912
            R+EY RY+Q K WK+LVVYKSGIHALGLYT++FI +G MVVEYVGEIVGLRVADKREA Y
Sbjct: 1973 RKEYARYKQSKGWKQLVVYKSGIHALGLYTSQFIFRGAMVVEYVGEIVGLRVADKREAEY 2032

Query: 3913 HSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXX 4092
            HS  +++ E ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC                 
Sbjct: 2033 HSGRRIQYESACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITIRNEKKVVFFA 2092

Query: 4093 ERDINAGEEITYDYNFNHEDE 4155
            ERDIN GEEITYDY+FN+EDE
Sbjct: 2093 ERDINPGEEITYDYHFNNEDE 2113



 Score = 70.9 bits (172), Expect = 5e-09
 Identities = 46/110 (41%), Positives = 66/110 (60%), Gaps = 5/110 (4%)
 Frame = +1

Query: 691  EHQMSNVCSEGSDSPVSEFSD-AVGHTDMASGKLTETDVVDEGSGIGK-CSSDGIDNGVW 864
            E QMSNVCSE S + V+EFS     + D+ S + T  ++VDEGSGI K CSSD  + G+W
Sbjct: 1091 EQQMSNVCSESSAAVVTEFSGRCFVNLDLGSTRSTCDEIVDEGSGIEKCCSSDAHNAGMW 1150

Query: 865  TRSKQAYTGSGNHLLGTSVQLTNLSSDVYNGSKVKTSISFKR---PVNSP 1005
              +    +G+ + +LG S  L + S+D  N  KV++S+  K+   P  SP
Sbjct: 1151 AETAN-LSGNTDAVLGRSSTLPSHSTDPINNLKVRSSLRLKKVRLPFGSP 1199


>ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812602 isoform X6 [Glycine
            max]
          Length = 1870

 Score =  500 bits (1287), Expect = e-138
 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%)
 Frame = +1

Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1371 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1430

Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964
            GGGA+T A  +  +VKSLL  W  +K+   KN                     S E F  
Sbjct: 1431 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1469

Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1470 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1525

Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1526 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1585

Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +     L
Sbjct: 1586 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1641

Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681
                  E +   CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1642 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1695

Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1696 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1755

Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1756 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1815

Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1816 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 1870


>ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812602 isoform X5 [Glycine
            max]
          Length = 1872

 Score =  500 bits (1287), Expect = e-138
 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%)
 Frame = +1

Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1373 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1432

Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964
            GGGA+T A  +  +VKSLL  W  +K+   KN                     S E F  
Sbjct: 1433 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1471

Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1472 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1527

Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1528 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1587

Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +     L
Sbjct: 1588 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1643

Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681
                  E +   CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1644 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1697

Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1698 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1757

Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1758 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1817

Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1818 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 1872


>ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812602 isoform X3 [Glycine
            max]
          Length = 2006

 Score =  500 bits (1287), Expect = e-138
 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%)
 Frame = +1

Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1507 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1566

Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964
            GGGA+T A  +  +VKSLL  W  +K+   KN                     S E F  
Sbjct: 1567 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1605

Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1606 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1661

Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1662 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1721

Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +     L
Sbjct: 1722 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1777

Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681
                  E +   CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1778 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1831

Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1832 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1891

Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1892 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1951

Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1952 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 2006


>ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812602 isoform X2 [Glycine
            max]
          Length = 2007

 Score =  500 bits (1287), Expect = e-138
 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%)
 Frame = +1

Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1508 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1567

Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964
            GGGA+T A  +  +VKSLL  W  +K+   KN                     S E F  
Sbjct: 1568 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1606

Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1607 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1662

Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1663 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1722

Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +     L
Sbjct: 1723 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1778

Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681
                  E +   CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1779 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1832

Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1833 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1892

Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1893 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1952

Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1953 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 2007


>ref|XP_006596083.1| PREDICTED: uncharacterized protein LOC100812602 isoform X1 [Glycine
            max]
          Length = 2008

 Score =  500 bits (1287), Expect = e-138
 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%)
 Frame = +1

Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784
            CCVC  S+ + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIVCVLCGY
Sbjct: 1509 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1568

Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964
            GGGA+T A  +  +VKSLL  W  +K+   KN                     S E F  
Sbjct: 1569 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1607

Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144
             E+E D    SK       E   K K  + +       ++IQ   T V+    FKV N+I
Sbjct: 1608 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1663

Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324
            T  V +P+V QW+HMVCGLWTPGT+C NV TM  FDV GV  PR   VC +CNR GG CI
Sbjct: 1664 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1723

Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504
            +CR+A C I FHPWCAH+K LLQS           FYGRC  H  + +  C  +     L
Sbjct: 1724 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1779

Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681
                  E +   CAR EGYKG      R +  Q          V +EQ+NAW+HING+K 
Sbjct: 1780 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1833

Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861
             SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY
Sbjct: 1834 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1893

Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041
            +GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC
Sbjct: 1894 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1953

Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
                             ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1954 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 2008


>ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816713 isoform X1 [Glycine
            max]
          Length = 2032

 Score =  493 bits (1270), Expect = e-136
 Identities = 283/647 (43%), Positives = 366/647 (56%), Gaps = 12/647 (1%)
 Frame = +1

Query: 2305 TRNNDVGNETEDDINIKKTTNN---PACNVSKKRSLQSTAEGAMNQGDIV----SEQVRG 2463
            T N    NET  D++++        PA    K+ +     +   N+ +I     ++++R 
Sbjct: 1420 TENTIFLNETNVDVSMEDLERGGKPPAVYKGKRDAKAKQGDSVGNRANISLKVKNKEIRK 1479

Query: 2464 SCSMSS-KLKNFNALEDAGCIFEGSYSENPLVKRKRREGSDAVSP--GETPCCVCGDSNE 2634
              S++    K    ++   C  +          R   +G  ++S    +  CCVC  S  
Sbjct: 1480 QRSINELTAKETKVMDMTKCAQDQEPGLCGTKSRNSIQGHTSISTINSDAFCCVCRRSTN 1539

Query: 2635 EGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGYGGGALTHAK 2811
            + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NI CVLCGYGGGA+T A 
Sbjct: 1540 DKINCLLECSRCLIRVHQACYGVSTLPKKSSWCCRPCRTNSKNIACVLCGYGGGAMTRAI 1599

Query: 2812 RTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAG 2991
             +  +VKSLL  W  +K+       G P               R        E+E D   
Sbjct: 1600 MSHTIVKSLLKVWNCEKD-------GMP---------------RDTTSCEVLEKEIDAFP 1637

Query: 2992 LSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSV 3171
             SK       E   K K  + +    +  S     +T  + +N FKV N+IT GV +P+V
Sbjct: 1638 SSKDGLEVDQESVLKPKIVDTSTDLMNQISTNHIPHTPTSFSN-FKVHNSITEGVLDPTV 1696

Query: 3172 TQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQI 3351
             QW+HMVCGLWTP T+C NV TM  FDV GV  PR   VCS+CNR GG CI+CR+A C +
Sbjct: 1697 KQWIHMVCGLWTPRTRCPNVDTMSAFDVSGVSRPRADVVCSICNRWGGSCIECRIADCSV 1756

Query: 3352 SFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPETR 3531
             FHPWCAH+K LLQS           FYGRC+ H  + +  C  +     L      E +
Sbjct: 1757 KFHPWCAHQKNLLQSETEGINDEKIGFYGRCMLHTIEPR--CLFIYDP--LDEIGSQEQK 1812

Query: 3532 AGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK-SSRAVVKNP 3708
               CAR EGYKG      R +  Q          V +EQ+NAW+HING+K  S+ + K P
Sbjct: 1813 EFTCARVEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKLCSQGLPKFP 1866

Query: 3709 GMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRV 3888
             ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY+GEIVGLRV
Sbjct: 1867 DLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRV 1926

Query: 3889 ADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXX 4068
            ADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC         
Sbjct: 1927 ADKREKEYQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRH 1986

Query: 4069 XXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
                    ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1987 EKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCYSKNCRRYMN 2032


>ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis]
            gi|223540953|gb|EEF42511.1| mixed-lineage leukemia
            protein, mll, putative [Ricinus communis]
          Length = 1125

 Score =  491 bits (1264), Expect = e-135
 Identities = 285/616 (46%), Positives = 359/616 (58%), Gaps = 14/616 (2%)
 Frame = +1

Query: 2386 SKKRSL-QSTAEGAMNQGDIVSEQVR----GSCSMSSKLKNFNALEDAGCIFEGSYSENP 2550
            ++KRSL + T +G  +   +VS +          +   L+N     D      GS   +P
Sbjct: 545  TRKRSLYELTLKGKSSSPKMVSRKKNFKYVPKMKLGKTLRNSEKSHD-----NGSQKVDP 599

Query: 2551 LVKRKRREGSD-AVSPGETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGW 2727
              KR  RE    +++  ++ C VC  SN++ +N L++C+ C I++HQACYG+S++PK  W
Sbjct: 600  --KRCAREQKHLSITDMDSFCSVCRSSNKDEVNCLLECRRCSIRVHQACYGVSRVPKGHW 657

Query: 2728 KCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNL 2907
             CR C+++  +IVCVLCGYGGGA+T A R+  +VK LL  W ++ E  +KN         
Sbjct: 658  YCRPCRTSAKDIVCVLCGYGGGAMTLALRSRTIVKGLLKAWNLEIESVAKN--------- 708

Query: 2908 PTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNI 3087
                      I SPE       E  +   S P P        +  N   +   T  N ++
Sbjct: 709  ---------AISSPEILH---HEMSMLHSSGPGPENRSYPVLRPVNIEPST-STVCNKDV 755

Query: 3088 QEG-----NTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFD 3252
            Q       N+   L+N  KV+N+ITAGV + +V QWVHMVCGLWTPGT+C NV TM  FD
Sbjct: 756  QNHLDILPNSLGHLSN-LKVNNSITAGVLDSTVKQWVHMVCGLWTPGTRCPNVNTMSAFD 814

Query: 3253 VFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXF 3432
            V G   PR   VCS+C+RPGG CIQCRVA C I FHPWCAH+KGLLQS           F
Sbjct: 815  VSGASCPRANVVCSICDRPGGSCIQCRVANCSIQFHPWCAHQKGLLQSEAEGVDNENVGF 874

Query: 3433 YGRCIAHAE--DAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEERKEELQK 3606
            YGRC+ HA     +  C     E        P  +  +CARTEGYKG K  +        
Sbjct: 875  YGRCVLHATYPTIESACDSAIFEAGY-----PAEKEVSCARTEGYKGRKR-DGFWHNTNS 928

Query: 3607 QTFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLV 3783
            Q+   +   V QEQ +AW+HING+KS ++ ++K P  E + D R+EY RY+Q K WK LV
Sbjct: 929  QSKGKSGCLVPQEQFDAWVHINGQKSCAQGILKLPMSEKEYDCRKEYTRYKQGKAWKHLV 988

Query: 3784 VYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRI 3963
            VYKSGIHALGLYTA FIS+GEMVVEYVGEIVGLRVADKRE  Y S  K++ + ACYFFRI
Sbjct: 989  VYKSGIHALGLYTARFISRGEMVVEYVGEIVGLRVADKRENEYQSGRKLQYKSACYFFRI 1048

Query: 3964 DKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFN 4143
            DKENIIDAT KGGIARFVNHSC PNC                 ERDI  GEEITYDY+FN
Sbjct: 1049 DKENIIDATHKGGIARFVNHSCLPNCVAKVISVRNDKKVVFFAERDIYPGEEITYDYHFN 1108

Query: 4144 HEDEGKKIPCFCKSRI 4191
            HEDE +K   F   RI
Sbjct: 1109 HEDEVQKFWKFSAVRI 1124


>ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816713 isoform X3 [Glycine
            max]
          Length = 2033

 Score =  489 bits (1260), Expect = e-135
 Identities = 284/650 (43%), Positives = 367/650 (56%), Gaps = 15/650 (2%)
 Frame = +1

Query: 2305 TRNNDVGNETEDDINIKKTTNN---PACNVSKKRSLQSTAEGAMNQGDIV----SEQVRG 2463
            T N    NET  D++++        PA    K+ +     +   N+ +I     ++++R 
Sbjct: 1418 TENTIFLNETNVDVSMEDLERGGKPPAVYKGKRDAKAKQGDSVGNRANISLKVKNKEIRK 1477

Query: 2464 SCSMSS-KLKNFNALEDAGCIFEGSYSENPLVKRKRREGSDAVSP--GETPCCVCGDSNE 2634
              S++    K    ++   C  +          R   +G  ++S    +  CCVC  S  
Sbjct: 1478 QRSINELTAKETKVMDMTKCAQDQEPGLCGTKSRNSIQGHTSISTINSDAFCCVCRRSTN 1537

Query: 2635 EGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIV---CVLCGYGGGALT 2802
            + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIV   CVLCGYGGGA+T
Sbjct: 1538 DKINCLLECSRCLIRVHQACYGVSTLPKKSSWCCRPCRTNSKNIVYPACVLCGYGGGAMT 1597

Query: 2803 HAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESD 2982
             A  +  +VKSLL  W  +K+       G P               R        E+E D
Sbjct: 1598 RAIMSHTIVKSLLKVWNCEKD-------GMP---------------RDTTSCEVLEKEID 1635

Query: 2983 LAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGN 3162
                SK       E   K K  + +    +  S     +T  + +N FKV N+IT GV +
Sbjct: 1636 AFPSSKDGLEVDQESVLKPKIVDTSTDLMNQISTNHIPHTPTSFSN-FKVHNSITEGVLD 1694

Query: 3163 PSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAK 3342
            P+V QW+HMVCGLWTP T+C NV TM  FDV GV  PR   VCS+CNR GG CI+CR+A 
Sbjct: 1695 PTVKQWIHMVCGLWTPRTRCPNVDTMSAFDVSGVSRPRADVVCSICNRWGGSCIECRIAD 1754

Query: 3343 CQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDP 3522
            C + FHPWCAH+K LLQS           FYGRC+ H  + +  C  +     L      
Sbjct: 1755 CSVKFHPWCAHQKNLLQSETEGINDEKIGFYGRCMLHTIEPR--CLFIYDP--LDEIGSQ 1810

Query: 3523 ETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK-SSRAVV 3699
            E +   CAR EGYKG      R +  Q          V +EQ+NAW+HING+K  S+ + 
Sbjct: 1811 EQKEFTCARVEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKLCSQGLP 1864

Query: 3700 KNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVG 3879
            K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY+GEIVG
Sbjct: 1865 KFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVG 1924

Query: 3880 LRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXX 4059
            LRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC      
Sbjct: 1925 LRVADKREKEYQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVIT 1984

Query: 4060 XXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
                       ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1985 VRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCYSKNCRRYMN 2033


>ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816713 isoform X2 [Glycine
            max]
          Length = 2035

 Score =  489 bits (1260), Expect = e-135
 Identities = 284/650 (43%), Positives = 367/650 (56%), Gaps = 15/650 (2%)
 Frame = +1

Query: 2305 TRNNDVGNETEDDINIKKTTNN---PACNVSKKRSLQSTAEGAMNQGDIV----SEQVRG 2463
            T N    NET  D++++        PA    K+ +     +   N+ +I     ++++R 
Sbjct: 1420 TENTIFLNETNVDVSMEDLERGGKPPAVYKGKRDAKAKQGDSVGNRANISLKVKNKEIRK 1479

Query: 2464 SCSMSS-KLKNFNALEDAGCIFEGSYSENPLVKRKRREGSDAVSP--GETPCCVCGDSNE 2634
              S++    K    ++   C  +          R   +G  ++S    +  CCVC  S  
Sbjct: 1480 QRSINELTAKETKVMDMTKCAQDQEPGLCGTKSRNSIQGHTSISTINSDAFCCVCRRSTN 1539

Query: 2635 EGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIV---CVLCGYGGGALT 2802
            + +N L++C  CLI++HQACYG+S +PK S W CR C++N  NIV   CVLCGYGGGA+T
Sbjct: 1540 DKINCLLECSRCLIRVHQACYGVSTLPKKSSWCCRPCRTNSKNIVYPACVLCGYGGGAMT 1599

Query: 2803 HAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESD 2982
             A  +  +VKSLL  W  +K+       G P               R        E+E D
Sbjct: 1600 RAIMSHTIVKSLLKVWNCEKD-------GMP---------------RDTTSCEVLEKEID 1637

Query: 2983 LAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGN 3162
                SK       E   K K  + +    +  S     +T  + +N FKV N+IT GV +
Sbjct: 1638 AFPSSKDGLEVDQESVLKPKIVDTSTDLMNQISTNHIPHTPTSFSN-FKVHNSITEGVLD 1696

Query: 3163 PSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAK 3342
            P+V QW+HMVCGLWTP T+C NV TM  FDV GV  PR   VCS+CNR GG CI+CR+A 
Sbjct: 1697 PTVKQWIHMVCGLWTPRTRCPNVDTMSAFDVSGVSRPRADVVCSICNRWGGSCIECRIAD 1756

Query: 3343 CQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDP 3522
            C + FHPWCAH+K LLQS           FYGRC+ H  + +  C  +     L      
Sbjct: 1757 CSVKFHPWCAHQKNLLQSETEGINDEKIGFYGRCMLHTIEPR--CLFIYDP--LDEIGSQ 1812

Query: 3523 ETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK-SSRAVV 3699
            E +   CAR EGYKG      R +  Q          V +EQ+NAW+HING+K  S+ + 
Sbjct: 1813 EQKEFTCARVEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKLCSQGLP 1866

Query: 3700 KNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVG 3879
            K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY+GEIVG
Sbjct: 1867 KFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVG 1926

Query: 3880 LRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXX 4059
            LRVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC      
Sbjct: 1927 LRVADKREKEYQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVIT 1986

Query: 4060 XXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209
                       ERDI  GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N
Sbjct: 1987 VRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCYSKNCRRYMN 2035


>ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313577 [Fragaria vesca
            subsp. vesca]
          Length = 2169

 Score =  486 bits (1251), Expect = e-134
 Identities = 351/981 (35%), Positives = 473/981 (48%), Gaps = 29/981 (2%)
 Frame = +1

Query: 1354 RNKDMSVQHVQVKGDKLVIQLRQGALEKNAKQCILDDIEQLVERASSRKILEHEDPNFEN 1533
            R+KD  +Q+++ +G K+  + R+ ALE NA  C   D        SSR   E+ + N  +
Sbjct: 1266 RDKDKHLQNLE-QGLKIGKRKRELALELNAS-CSNSD--------SSRVRQENHNSNGTS 1315

Query: 1534 GQISRFKGTKIGEDRSKSSKYGRLVTAPKPIKISNNKQ--IPKGGKMVSLRSILK--YPE 1701
               S+   +K     S S K G  VT     + S+  +  I    K + LRS L   + +
Sbjct: 1316 QFTSQ--PSKSLMMLSTSRKSGTHVTGNCITQSSSKPRLHISSSAKKLLLRSDLHKLHDD 1373

Query: 1702 KQIRCQARKSENLRSGYD-WERPIITRGETSSMKVSSLQRMKKCRLLERQSNMSEFSSEC 1878
            K+          L  G +  E P ++ G+T     SS           RQ  + E S + 
Sbjct: 1374 KESEVNNVFQTELNGGANNHELPEVSGGKTCKRDCSSNAF--------RQFQIQESSRKD 1425

Query: 1879 NRKT-YDRVDTPKEVRKRSLCKLSNESPSKMKNHFS---SDAMNDSRVFNPTRNFKTCSG 2046
             ++T Y+ VD  K    + + K+ +     +        +D  +  R+  P +       
Sbjct: 1426 TKRTKYNSVDGFKSTCSQQV-KIGHRKARPIVCGIYGELTDGSSTGRMSKPAKLVPLSRV 1484

Query: 2047 QETKNGSLLP--CNNQSSKMNIQSKTESQKDIVAITSCTTNKANHSIFSSANSSLGSGRL 2220
              +    +LP  CN++SS M        +K +     C T       +   ++ +     
Sbjct: 1485 LNSSRKCILPKLCNSKSSSMR-------KKKLGGAAICNTYDLKTEKYKCHDAMV----- 1532

Query: 2221 KIQHTNNELMSAICNTPIPKFEGGSKISTRNNDVGNETE----DDINIKKTTNNPACNVS 2388
            K+  T+       C+    +         +  DV +E +    D I   +    P     
Sbjct: 1533 KVNDTSMRKKKKECSPGEREIHKELFSMEKQGDVQSEKDHQKLDSITHTQLQMKP--KEI 1590

Query: 2389 KKRSLQSTAEGAMNQGDIVSEQVRGSCSMSSKLKNFNALEDAGCIFEGSYSE--NPLVKR 2562
            +KRS+    E   + G           S  SK+ NF    D   +  G  S       K 
Sbjct: 1591 RKRSIYEFTEKGDDTGF--------KSSSVSKISNFRPANDGKLVNTGEDSGLCQHSAKN 1642

Query: 2563 KRREGSDAVSPGETP-CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRA 2739
              +E     +    P CCVCG SN++ +N L++C  C +++HQACYG+SK+PK  W CR 
Sbjct: 1643 STQEHRCHCNCDSDPICCVCGSSNQDEINILLECSQCSVRVHQACYGVSKVPKGCWSCRP 1702

Query: 2740 CKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSK 2919
            C+ +  +IVCVLCGYGGGA+T A R++ +  S+L  W ++ E   KN     C      K
Sbjct: 1703 CRMSSKDIVCVLCGYGGGAMTQALRSQTIAVSILRAWNIETECGPKN---ELCSIKTLQK 1759

Query: 2920 IADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGN 3099
             +  L       +R  E  S         P+A    +          +  D   N    +
Sbjct: 1760 DSTGLHCSG---YRHSESSSLFVSQQSGQPLAAAHCK------RGMSYRVDGVENSPSVS 1810

Query: 3100 TKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRR 3279
                   + KV N+IT G+ + +  QWVHMVCGLWTP T+C NV TM  FDV  V     
Sbjct: 1811 -------KTKVHNSITMGLVDSATKQWVHMVCGLWTPETRCPNVDTMSAFDVSCVPLSTD 1863

Query: 3280 KQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAE 3459
              VC +C R GG CIQCRV  C + FHPWCAH+KGLLQ+           FYGRC  HA 
Sbjct: 1864 DAVCCMCKRAGGSCIQCRVENCSVRFHPWCAHQKGLLQTEVEGVDNENVGFYGRCGLHAT 1923

Query: 3460 ----------DAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQ 3609
                      D +  C     EK L            CARTEGYKG K    R     + 
Sbjct: 1924 HPIYKSEYPVDTEAGCL---DEKKLV-----------CARTEGYKGRKRDGFRHNYCDRS 1969

Query: 3610 TFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVV 3786
              +D    V QEQ+NAW +ING+KS ++ + K    E++ D R+EY RY+Q K WK LVV
Sbjct: 1970 KGSDGC-LVPQEQLNAWAYINGQKSCTQELPKLAISEIEHDSRKEYTRYKQAKLWKHLVV 2028

Query: 3787 YKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRID 3966
            YKSGIHALGLYT+ FIS+ EMVVEYVGEIVG RV+DKRE  Y S  K++ + ACYFFRID
Sbjct: 2029 YKSGIHALGLYTSRFISRDEMVVEYVGEIVGQRVSDKRENEYQSAKKLQYKSACYFFRID 2088

Query: 3967 KENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNH 4146
            KE+IIDAT KGGIARFVNHSCSPNC                 ERDI  GEEITYDY+FNH
Sbjct: 2089 KEHIIDATCKGGIARFVNHSCSPNCVAKVISVRNEKKVVFLAERDIFPGEEITYDYHFNH 2148

Query: 4147 EDEGKKIPCFCKSRICRRYLN 4209
            EDEGKKIPCFC S+ CRRYLN
Sbjct: 2149 EDEGKKIPCFCNSKNCRRYLN 2169


>gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus notabilis]
          Length = 2073

 Score =  484 bits (1246), Expect = e-133
 Identities = 264/527 (50%), Positives = 325/527 (61%), Gaps = 8/527 (1%)
 Frame = +1

Query: 2599 ETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLC 2778
            E+ CCVCG S+++  N L++C  CLIK+HQACYG+S+ PK  W CR C+++  NIVCVLC
Sbjct: 1545 ESFCCVCGSSDKDDTNNLLECNICLIKVHQACYGVSRAPKGHWYCRPCRTSSRNIVCVLC 1604

Query: 2779 GYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF 2958
            GYGGGA+T A R+  +VKSLL  W V+ E  + ++K     +L T    ++         
Sbjct: 1605 GYGGGAMTRALRSRTIVKSLLRVWNVETEWKALSVK-----DLETLTRLNS--------- 1650

Query: 2959 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3138
             G ERE    G S PM   C  +  K   +   + +   N ++   +  V    + KVDN
Sbjct: 1651 SGPEREE---GTSFPM---CQPENTKPLASVVCKMDMPYNVDVLRNSLCV---KKLKVDN 1701

Query: 3139 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3318
            +ITAG  + +  QWVHMVCGLWTPGT+C NV TM  FDV G   PR   VCS+CNRPGG 
Sbjct: 1702 SITAGFLDSTTKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGAPHPRADVVCSMCNRPGGS 1761

Query: 3319 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3498
            CI+CRV  C + FHPWCAH+KGLLQS           FYGRC  HA        + + + 
Sbjct: 1762 CIKCRVLNCSVRFHPWCAHQKGLLQSEVEGIDNENIGFYGRCARHATHP-----MCESDS 1816

Query: 3499 NLALKPDPETRAGN-------CARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINA 3657
            + A   D +  AG        CARTEGYKG K    R    Q +        V QEQ+NA
Sbjct: 1817 DPA---DTDRVAGGSAVEELTCARTEGYKGRKRDGVRHNYCQSK--GKVGCYVPQEQLNA 1871

Query: 3658 WLHINGRKSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFI 3834
            W+HING+KS  + V + P  +++ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FI
Sbjct: 1872 WIHINGQKSCIQGVHRLPTSDIEHDCRKEYARYKQGKGWKHLVVYKSGIHALGLYTSRFI 1931

Query: 3835 SKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARF 4014
            S+ EMVVEYVGEIVG RVADKRE  Y S  K++ + ACYFFRIDKE+IIDATRKGGIARF
Sbjct: 1932 SRSEMVVEYVGEIVGQRVADKRENEYQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARF 1991

Query: 4015 VNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDE 4155
            VNHSC PNC                 ERDI  GEEITYDY+FNHEDE
Sbjct: 1992 VNHSCLPNCVAKVISIRNEKKVVFFAERDIFPGEEITYDYHFNHEDE 2038


>gb|ESW33157.1| hypothetical protein PHAVU_001G047700g [Phaseolus vulgaris]
            gi|561034628|gb|ESW33158.1| hypothetical protein
            PHAVU_001G047700g [Phaseolus vulgaris]
          Length = 2002

 Score =  483 bits (1244), Expect = e-133
 Identities = 315/859 (36%), Positives = 432/859 (50%), Gaps = 52/859 (6%)
 Frame = +1

Query: 1789 SSMKVSSLQRMKKCRLLERQSNMSEFSSECNRKTYDRVDTP-----KEVRKRSLCKLSNE 1953
            SS  + +  +M     L++  N S F   CN++      +        +RK    K+++E
Sbjct: 1197 SSSSLPNEMQMHSLSSLQKSFNKSSFVQPCNKRIQSAFSSKFNSCKNSLRKHLSYKVAHE 1256

Query: 1954 SPS----------------KMKNHFSSDAMNDSRVFNPTRNFKTCSGQETKNGSLLP--- 2076
            S S                K++N+ +SD      +  P       S +E K   L P   
Sbjct: 1257 SQSDSYAEFCTLPGVSGTKKLRNNLTSDCFEQFHMQEP-------SYEEPKKAELWPFLC 1309

Query: 2077 -------------CNNQSSKMNIQSKTESQKD--IVAITSCTTNKANHSIFSSANSSLGS 2211
                         C       N     E QK   IV++     +      ++     L S
Sbjct: 1310 RKENGHRITRPVVCGKYGEIRNGHLAKEVQKPAKIVSLNKVLKSSKRCMSYTKGKPRLTS 1369

Query: 2212 GRLKIQHTNNELMSAICNTPIPKFEGGSKISTRNNDVGNETEDDINIKKTTNNPACNVSK 2391
             +   + +        C     K +    I T+N  + NE   D++++        + +K
Sbjct: 1370 KKKWKRLSIGTDSEYCCGNRGLKVK--EHIETQNTIIYNEASVDMSLEDLERGGKQD-AK 1426

Query: 2392 KRSLQSTAEGAMNQGDIV----SEQVRGSCSMSS-KLKNFNALEDAGCIFEGSYSENPLV 2556
             ++ Q    G  N+ +++    ++ +R   S++    K     +   C  +    E  L 
Sbjct: 1427 AKAKQGVRVG--NRENVLLKVKNKDIRKHRSINELTAKETKVTDMMSCAQD---REPGLC 1481

Query: 2557 KRKRR---EGSDAVSP--GETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK- 2718
              KRR   +G   +S    +T CCVC  S+ + +N L++C  CLI++HQACYG+S +PK 
Sbjct: 1482 STKRRNSIQGHTNISTIYSDTFCCVCRSSSNDKINCLLECCQCLIRVHQACYGVSTLPKK 1541

Query: 2719 SGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPC 2898
            S W CR C++N  NI CVLCGYGGGA+T A  +  +VKSLL  W  +K+D  K+      
Sbjct: 1542 SRWCCRPCRTNSKNIACVLCGYGGGAMTRATMSHTIVKSLLKVWNSEKDDMPKH------ 1595

Query: 2899 PNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPV-ACVEKEDKRKNANANQFETDA 3075
                      +      E +     ++D     KP    A  +    R + N  Q+    
Sbjct: 1596 --------TTSCEFFGEEIYAFSSSKADQESALKPKIFDASTDLVKVRISTNNTQY---- 1643

Query: 3076 NSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDV 3255
                    T   L + FKV N+IT GV + +V QW+HMVCGLWTPGT+C NV TM  FDV
Sbjct: 1644 --------TPTTLYS-FKVHNSITEGVLDSTVKQWIHMVCGLWTPGTRCPNVDTMSAFDV 1694

Query: 3256 FGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFY 3435
             GV  PR   VCS+CNR GG CI+CR+A C + FHPWCAH K LLQS           FY
Sbjct: 1695 SGVSRPRADVVCSICNRWGGSCIECRMADCSVKFHPWCAHLKNLLQSETEGIDDEKIGFY 1754

Query: 3436 GRCIAHAEDAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTF 3615
            G C+ H  +          +K        E +   CAR EGYKG      R+ +  +   
Sbjct: 1755 GSCMLHTIEPSYLSIYDPIDKI----GSQEEKEFTCARAEGYKG------RRWDGFQNNH 1804

Query: 3616 NDNTRAVSQEQINAWLHINGRK-SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYK 3792
                  V +EQ+NAW+HING+K  S+ + K   ++++ + R+EY RY+Q K WK LVVYK
Sbjct: 1805 CQGGCVVPEEQLNAWIHINGQKLCSQGLTKFSDLDMEHNCRKEYTRYKQAKGWKHLVVYK 1864

Query: 3793 SGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKE 3972
            S IHALGLYT+ FIS+GE+VVEY+GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE
Sbjct: 1865 SRIHALGLYTSRFISRGEVVVEYIGEIVGLRVADKREKDYQSGKKLQDKSACYFFRIDKE 1924

Query: 3973 NIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHED 4152
            +IIDATRKGGIARFVNHSC PNC                 ERDI  GEEITYDY+FNHED
Sbjct: 1925 HIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFFAERDIFPGEEITYDYHFNHED 1984

Query: 4153 EGKKIPCFCKSRICRRYLN 4209
            EG KIPC+C S+ CRRY+N
Sbjct: 1985 EG-KIPCYCNSKNCRRYMN 2002


>gb|ESW33155.1| hypothetical protein PHAVU_001G047700g [Phaseolus vulgaris]
            gi|561034626|gb|ESW33156.1| hypothetical protein
            PHAVU_001G047700g [Phaseolus vulgaris]
          Length = 2000

 Score =  483 bits (1244), Expect = e-133
 Identities = 315/859 (36%), Positives = 432/859 (50%), Gaps = 52/859 (6%)
 Frame = +1

Query: 1789 SSMKVSSLQRMKKCRLLERQSNMSEFSSECNRKTYDRVDTP-----KEVRKRSLCKLSNE 1953
            SS  + +  +M     L++  N S F   CN++      +        +RK    K+++E
Sbjct: 1195 SSSSLPNEMQMHSLSSLQKSFNKSSFVQPCNKRIQSAFSSKFNSCKNSLRKHLSYKVAHE 1254

Query: 1954 SPS----------------KMKNHFSSDAMNDSRVFNPTRNFKTCSGQETKNGSLLP--- 2076
            S S                K++N+ +SD      +  P       S +E K   L P   
Sbjct: 1255 SQSDSYAEFCTLPGVSGTKKLRNNLTSDCFEQFHMQEP-------SYEEPKKAELWPFLC 1307

Query: 2077 -------------CNNQSSKMNIQSKTESQKD--IVAITSCTTNKANHSIFSSANSSLGS 2211
                         C       N     E QK   IV++     +      ++     L S
Sbjct: 1308 RKENGHRITRPVVCGKYGEIRNGHLAKEVQKPAKIVSLNKVLKSSKRCMSYTKGKPRLTS 1367

Query: 2212 GRLKIQHTNNELMSAICNTPIPKFEGGSKISTRNNDVGNETEDDINIKKTTNNPACNVSK 2391
             +   + +        C     K +    I T+N  + NE   D++++        + +K
Sbjct: 1368 KKKWKRLSIGTDSEYCCGNRGLKVK--EHIETQNTIIYNEASVDMSLEDLERGGKQD-AK 1424

Query: 2392 KRSLQSTAEGAMNQGDIV----SEQVRGSCSMSS-KLKNFNALEDAGCIFEGSYSENPLV 2556
             ++ Q    G  N+ +++    ++ +R   S++    K     +   C  +    E  L 
Sbjct: 1425 AKAKQGVRVG--NRENVLLKVKNKDIRKHRSINELTAKETKVTDMMSCAQD---REPGLC 1479

Query: 2557 KRKRR---EGSDAVSP--GETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK- 2718
              KRR   +G   +S    +T CCVC  S+ + +N L++C  CLI++HQACYG+S +PK 
Sbjct: 1480 STKRRNSIQGHTNISTIYSDTFCCVCRSSSNDKINCLLECCQCLIRVHQACYGVSTLPKK 1539

Query: 2719 SGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPC 2898
            S W CR C++N  NI CVLCGYGGGA+T A  +  +VKSLL  W  +K+D  K+      
Sbjct: 1540 SRWCCRPCRTNSKNIACVLCGYGGGAMTRATMSHTIVKSLLKVWNSEKDDMPKH------ 1593

Query: 2899 PNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPV-ACVEKEDKRKNANANQFETDA 3075
                      +      E +     ++D     KP    A  +    R + N  Q+    
Sbjct: 1594 --------TTSCEFFGEEIYAFSSSKADQESALKPKIFDASTDLVKVRISTNNTQY---- 1641

Query: 3076 NSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDV 3255
                    T   L + FKV N+IT GV + +V QW+HMVCGLWTPGT+C NV TM  FDV
Sbjct: 1642 --------TPTTLYS-FKVHNSITEGVLDSTVKQWIHMVCGLWTPGTRCPNVDTMSAFDV 1692

Query: 3256 FGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFY 3435
             GV  PR   VCS+CNR GG CI+CR+A C + FHPWCAH K LLQS           FY
Sbjct: 1693 SGVSRPRADVVCSICNRWGGSCIECRMADCSVKFHPWCAHLKNLLQSETEGIDDEKIGFY 1752

Query: 3436 GRCIAHAEDAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTF 3615
            G C+ H  +          +K        E +   CAR EGYKG      R+ +  +   
Sbjct: 1753 GSCMLHTIEPSYLSIYDPIDKI----GSQEEKEFTCARAEGYKG------RRWDGFQNNH 1802

Query: 3616 NDNTRAVSQEQINAWLHINGRK-SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYK 3792
                  V +EQ+NAW+HING+K  S+ + K   ++++ + R+EY RY+Q K WK LVVYK
Sbjct: 1803 CQGGCVVPEEQLNAWIHINGQKLCSQGLTKFSDLDMEHNCRKEYTRYKQAKGWKHLVVYK 1862

Query: 3793 SGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKE 3972
            S IHALGLYT+ FIS+GE+VVEY+GEIVGLRVADKRE  Y S  K++ + ACYFFRIDKE
Sbjct: 1863 SRIHALGLYTSRFISRGEVVVEYIGEIVGLRVADKREKDYQSGKKLQDKSACYFFRIDKE 1922

Query: 3973 NIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHED 4152
            +IIDATRKGGIARFVNHSC PNC                 ERDI  GEEITYDY+FNHED
Sbjct: 1923 HIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFFAERDIFPGEEITYDYHFNHED 1982

Query: 4153 EGKKIPCFCKSRICRRYLN 4209
            EG KIPC+C S+ CRRY+N
Sbjct: 1983 EG-KIPCYCNSKNCRRYMN 2000


Top