BLASTX nr result
ID: Ephedra28_contig00008629
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra28_contig00008629 (4282 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI21104.3| unnamed protein product [Vitis vinifera] 542 e-151 ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613... 528 e-147 ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613... 528 e-147 ref|XP_006450349.1| hypothetical protein CICLE_v10010421mg [Citr... 528 e-147 gb|EOY29402.1| Uncharacterized protein isoform 3 [Theobroma cacao] 522 e-145 gb|EOY29400.1| Uncharacterized protein isoform 1 [Theobroma caca... 522 e-145 ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [A... 515 e-143 ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812... 500 e-138 ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812... 500 e-138 ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812... 500 e-138 ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812... 500 e-138 ref|XP_006596083.1| PREDICTED: uncharacterized protein LOC100812... 500 e-138 ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816... 493 e-136 ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putativ... 491 e-135 ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816... 489 e-135 ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816... 489 e-135 ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313... 486 e-134 gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus no... 484 e-133 gb|ESW33157.1| hypothetical protein PHAVU_001G047700g [Phaseolus... 483 e-133 gb|ESW33155.1| hypothetical protein PHAVU_001G047700g [Phaseolus... 483 e-133 >emb|CBI21104.3| unnamed protein product [Vitis vinifera] Length = 1111 Score = 542 bits (1396), Expect = e-151 Identities = 284/569 (49%), Positives = 344/569 (60%), Gaps = 1/569 (0%) Frame = +1 Query: 2506 EDAGCIFEGSYSENPLVKRKRREGSDAVSPGETPCCVCGDSNEEGLNRLVQCQSCLIKMH 2685 ED+ SY N K +S + CCVCG SN++ +N L++C CLI++H Sbjct: 603 EDSKHSMSESYKVNSKKSIKEHRFESFISDTDAFCCVCGSSNKDEINCLLECSRCLIRVH 662 Query: 2686 QACYGISKIPKSGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKE 2865 QACYG+S++PK W CR C+++ NIVCVLCGYGGGA+T A RT N+VKSLL W ++ E Sbjct: 663 QACYGVSRVPKGRWYCRPCRTSSKNIVCVLCGYGGGAMTRALRTRNIVKSLLKVWNIETE 722 Query: 2866 DNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKN 3045 K+ ++P + D L +S +GL Sbjct: 723 SWPKS-------SVPPEALQDKL----------GTLDSSRSGLE---------------- 749 Query: 3046 ANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCV 3225 N F + NTITAG+ + +V QWVHMVCGLWTPGT+C Sbjct: 750 -----------------------NESFPIHNTITAGILDSTVKQWVHMVCGLWTPGTRCP 786 Query: 3226 NVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXX 3405 NV TM FDV G PR +CS+CNRPGG CI+CRV C + FHPWCAHRKGLLQS Sbjct: 787 NVDTMSAFDVSGASRPRANVICSICNRPGGSCIKCRVLNCLVPFHPWCAHRKGLLQSEVE 846 Query: 3406 XXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEE 3585 FYGRC+ HA A C++ N+ E CARTEGYKG K E Sbjct: 847 GVDNENVGFYGRCMLHA--AHPSCELDSDPINIETDSTGEKEL-TCARTEGYKGRKQ-EG 902 Query: 3586 RKEELQKQTFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDYRREYLRYRQE 3762 + L Q+ + V QEQ+NAWLHING+KS ++ + K P +V+ D R+E+ RY+Q Sbjct: 903 FRHNLNFQSNGNGGCLVPQEQLNAWLHINGQKSCTKGLPKTPISDVEYDCRKEFARYKQA 962 Query: 3763 KRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEG 3942 K WK LVVYKSGIHALGLYT+ FIS+G MVVEYVGEIVGLRVADKRE+ Y S K++ + Sbjct: 963 KGWKHLVVYKSGIHALGLYTSRFISRGAMVVEYVGEIVGLRVADKRESDYQSGRKLQYKT 1022 Query: 3943 ACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEI 4122 ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC ERDIN GEEI Sbjct: 1023 ACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVISVRNEKKVVFFAERDINPGEEI 1082 Query: 4123 TYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 TYDY+FNHEDEGKKIPCFC SR CRRYLN Sbjct: 1083 TYDYHFNHEDEGKKIPCFCNSRNCRRYLN 1111 >ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613578 isoform X2 [Citrus sinensis] Length = 2119 Score = 528 bits (1360), Expect = e-147 Identities = 274/538 (50%), Positives = 336/538 (62%), Gaps = 4/538 (0%) Frame = +1 Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYG 2787 CCVCG SN++ +N L++C C IK+HQACYG+SK+PK W CR C++N +IVCVLCGYG Sbjct: 1607 CCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYG 1666 Query: 2788 GGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF--- 2958 GGA+T A R+ +VK LL W ++ + KN ++ A ++ Sbjct: 1667 GGAMTCALRSRTIVKGLLKAWNIETDSRHKNA------------VSSAQIMEDDLNMLHS 1714 Query: 2959 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3138 G ES + +S+P+ + + + NQ + S+ GN N KV N Sbjct: 1715 SGPMLESSMLPVSRPVNTEPLSTAAWKMDF-PNQLDVLQKSS---GNA-----NNVKVHN 1765 Query: 3139 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3318 +ITAG + +V QWVHMVCGLWTPGT+C NV TM FDV G P+ VCS+CNRPGG Sbjct: 1766 SITAGAFDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGASHPKANVVCSICNRPGGS 1825 Query: 3319 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3498 CIQCRV C + FHPWCAH+KGLLQS FYGRC+ HA + + Sbjct: 1826 CIQCRVVNCSVKFHPWCAHQKGLLQSEVEGAENESVGFYGRCVLHATHPLCESGSDPFDI 1885 Query: 3499 NLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3678 + + E CARTEGYKG K + L Q+ + V QEQ+NAW+HING+ Sbjct: 1886 EVVCSIEKEF---TCARTEGYKGRKR-DGFWHNLHGQSRGKSACLVPQEQLNAWIHINGQ 1941 Query: 3679 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3855 KSS + K +V+ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMVV Sbjct: 1942 KSSTNGLPKLTVSDVEYDCRKEYARYKQMKGWKHLVVYKSGIHALGLYTSRFISRGEMVV 2001 Query: 3856 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4035 EYVGEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDAT KGGIARFVNHSC P Sbjct: 2002 EYVGEIVGLRVADKREIEYQSGRKLQYKSACYFFRIDKEHIIDATCKGGIARFVNHSCLP 2061 Query: 4036 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 NC ERDI GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN Sbjct: 2062 NCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 2119 >ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613578 isoform X1 [Citrus sinensis] Length = 2120 Score = 528 bits (1360), Expect = e-147 Identities = 274/538 (50%), Positives = 336/538 (62%), Gaps = 4/538 (0%) Frame = +1 Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYG 2787 CCVCG SN++ +N L++C C IK+HQACYG+SK+PK W CR C++N +IVCVLCGYG Sbjct: 1608 CCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYG 1667 Query: 2788 GGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF--- 2958 GGA+T A R+ +VK LL W ++ + KN ++ A ++ Sbjct: 1668 GGAMTCALRSRTIVKGLLKAWNIETDSRHKNA------------VSSAQIMEDDLNMLHS 1715 Query: 2959 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3138 G ES + +S+P+ + + + NQ + S+ GN N KV N Sbjct: 1716 SGPMLESSMLPVSRPVNTEPLSTAAWKMDF-PNQLDVLQKSS---GNA-----NNVKVHN 1766 Query: 3139 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3318 +ITAG + +V QWVHMVCGLWTPGT+C NV TM FDV G P+ VCS+CNRPGG Sbjct: 1767 SITAGAFDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGASHPKANVVCSICNRPGGS 1826 Query: 3319 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3498 CIQCRV C + FHPWCAH+KGLLQS FYGRC+ HA + + Sbjct: 1827 CIQCRVVNCSVKFHPWCAHQKGLLQSEVEGAENESVGFYGRCVLHATHPLCESGSDPFDI 1886 Query: 3499 NLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3678 + + E CARTEGYKG K + L Q+ + V QEQ+NAW+HING+ Sbjct: 1887 EVVCSIEKEF---TCARTEGYKGRKR-DGFWHNLHGQSRGKSACLVPQEQLNAWIHINGQ 1942 Query: 3679 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3855 KSS + K +V+ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMVV Sbjct: 1943 KSSTNGLPKLTVSDVEYDCRKEYARYKQMKGWKHLVVYKSGIHALGLYTSRFISRGEMVV 2002 Query: 3856 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4035 EYVGEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDAT KGGIARFVNHSC P Sbjct: 2003 EYVGEIVGLRVADKREIEYQSGRKLQYKSACYFFRIDKEHIIDATCKGGIARFVNHSCLP 2062 Query: 4036 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 NC ERDI GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN Sbjct: 2063 NCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 2120 >ref|XP_006450349.1| hypothetical protein CICLE_v10010421mg [Citrus clementina] gi|557553575|gb|ESR63589.1| hypothetical protein CICLE_v10010421mg [Citrus clementina] Length = 765 Score = 528 bits (1360), Expect = e-147 Identities = 274/538 (50%), Positives = 336/538 (62%), Gaps = 4/538 (0%) Frame = +1 Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYG 2787 CCVCG SN++ +N L++C C IK+HQACYG+SK+PK W CR C++N +IVCVLCGYG Sbjct: 253 CCVCGGSNKDEINCLIECSRCFIKVHQACYGVSKVPKGHWYCRPCRTNSRDIVCVLCGYG 312 Query: 2788 GGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF--- 2958 GGA+T A R+ +VK LL W ++ + KN ++ A ++ Sbjct: 313 GGAMTCALRSRTIVKGLLKAWNIETDSRHKNA------------VSSAQIMEDDLNMLHS 360 Query: 2959 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3138 G ES + +S+P+ + + + NQ + S+ GN N KV N Sbjct: 361 SGPMLESSMLPVSRPVNTEPLSTAAWKMDF-PNQLDVLQKSS---GNA-----NNVKVHN 411 Query: 3139 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3318 +ITAG + +V QWVHMVCGLWTPGT+C NV TM FDV G P+ VCS+CNRPGG Sbjct: 412 SITAGAFDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGASHPKANVVCSICNRPGGS 471 Query: 3319 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3498 CIQCRV C + FHPWCAH+KGLLQS FYGRC+ HA + + Sbjct: 472 CIQCRVVNCSVKFHPWCAHQKGLLQSEVEGAENESVGFYGRCVLHATHPLCESGSDPFDI 531 Query: 3499 NLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGR 3678 + + E CARTEGYKG K + L Q+ + V QEQ+NAW+HING+ Sbjct: 532 EVVCSIEKEF---TCARTEGYKGRKR-DGFWHNLHGQSRGKSACLVPQEQLNAWIHINGQ 587 Query: 3679 KSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVV 3855 KSS + K +V+ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMVV Sbjct: 588 KSSTNGLPKLTVSDVEYDCRKEYARYKQMKGWKHLVVYKSGIHALGLYTSRFISRGEMVV 647 Query: 3856 EYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSP 4035 EYVGEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDAT KGGIARFVNHSC P Sbjct: 648 EYVGEIVGLRVADKREIEYQSGRKLQYKSACYFFRIDKEHIIDATCKGGIARFVNHSCLP 707 Query: 4036 NCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 NC ERDI GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN Sbjct: 708 NCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 765 >gb|EOY29402.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 2104 Score = 522 bits (1344), Expect = e-145 Identities = 289/659 (43%), Positives = 379/659 (57%), Gaps = 9/659 (1%) Frame = +1 Query: 2260 CNTPIPKFEGGSKISTRNNDVGNETEDDI--NIKKTTNNPACNVSKKRSL-QSTAEGAMN 2430 C + I +F+ S + + D +E I I +N C +KRSL + T +G + Sbjct: 1480 CVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCKEIRKRSLYELTGKGKES 1539 Query: 2431 QGDIVSEQVRGSCSMSSKLKNFNALEDAGCIFEGSYSENPLVKRKR--REGSDAVSPGET 2604 D S + K+K +L++ G + + + + K + ++ + Sbjct: 1540 GSD--SHPLMEISKCMPKMKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDV 1597 Query: 2605 PCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGY 2784 CCVCG SN++ N L++C C I++HQACYGI K+P+ W CR C+++ + VCVLCGY Sbjct: 1598 FCCVCGSSNKDEFNCLLECSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGY 1657 Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964 GGGA+T A R+ VK LL W ++ E K+ N + D + F Sbjct: 1658 GGGAMTQALRSRAFVKGLLKAWNIEAECGPKST------NYSAETVLDDQSLVVSNSFCN 1711 Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144 ++ +D + A+ ++ D + + +++ + N++ Sbjct: 1712 ------------------LQFKDLELSRTAS-WKLDVQNQLDIIRNSPCPDSKLNLYNSV 1752 Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324 TAGV + +V QWVHMVCGLWTPGT+C NV TM FDV GV R VCS+CNRPGG CI Sbjct: 1753 TAGVLDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGVSRKRENVVCSICNRPGGSCI 1812 Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504 QCRV C + FHPWCAH+KGLLQS FYGRC+ HA C+ + Sbjct: 1813 QCRVVDCSVRFHPWCAHQKGLLQSEVEGIDNENVGFYGRCMLHASHCT--CESGSEPTDA 1870 Query: 3505 ALKPDPETRAGNCARTEGYKGCKS---WEERKEELQKQTFNDNTRAVSQEQINAWLHING 3675 L P E R CARTEG+KG K W + +++T V QEQ+NAW+HING Sbjct: 1871 ELSPSRE-RESTCARTEGFKGRKQDGFWHNIYGQSKRKT----GCFVPQEQLNAWIHING 1925 Query: 3676 RKSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMV 3852 +KS + + K P +++ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMV Sbjct: 1926 QKSCMQGLPKLPTSDMEYDCRKEYARYKQAKGWKHLVVYKSGIHALGLYTSRFISRGEMV 1985 Query: 3853 VEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCS 4032 VEYVGEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC Sbjct: 1986 VEYVGEIVGLRVADKRENEYESGRKVQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCL 2045 Query: 4033 PNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 PNC ERDI GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN Sbjct: 2046 PNCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 2104 >gb|EOY29400.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782145|gb|EOY29401.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782147|gb|EOY29403.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782148|gb|EOY29404.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782149|gb|EOY29405.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782150|gb|EOY29406.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1738 Score = 522 bits (1344), Expect = e-145 Identities = 289/659 (43%), Positives = 379/659 (57%), Gaps = 9/659 (1%) Frame = +1 Query: 2260 CNTPIPKFEGGSKISTRNNDVGNETEDDI--NIKKTTNNPACNVSKKRSL-QSTAEGAMN 2430 C + I +F+ S + + D +E I I +N C +KRSL + T +G + Sbjct: 1114 CVSGIKQFDNNSFLLEKGKDDRSEKYCCIPDGIAYNRSNIRCKEIRKRSLYELTGKGKES 1173 Query: 2431 QGDIVSEQVRGSCSMSSKLKNFNALEDAGCIFEGSYSENPLVKRKR--REGSDAVSPGET 2604 D S + K+K +L++ G + + + + K + ++ + Sbjct: 1174 GSD--SHPLMEISKCMPKMKVRKSLKETGDVESHGHRSSNMNAEKSIMQTRCSSIVDSDV 1231 Query: 2605 PCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGY 2784 CCVCG SN++ N L++C C I++HQACYGI K+P+ W CR C+++ + VCVLCGY Sbjct: 1232 FCCVCGSSNKDEFNCLLECSRCSIRVHQACYGILKVPRGHWYCRPCRTSSKDTVCVLCGY 1291 Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964 GGGA+T A R+ VK LL W ++ E K+ N + D + F Sbjct: 1292 GGGAMTQALRSRAFVKGLLKAWNIEAECGPKST------NYSAETVLDDQSLVVSNSFCN 1345 Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144 ++ +D + A+ ++ D + + +++ + N++ Sbjct: 1346 ------------------LQFKDLELSRTAS-WKLDVQNQLDIIRNSPCPDSKLNLYNSV 1386 Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324 TAGV + +V QWVHMVCGLWTPGT+C NV TM FDV GV R VCS+CNRPGG CI Sbjct: 1387 TAGVLDSTVKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGVSRKRENVVCSICNRPGGSCI 1446 Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504 QCRV C + FHPWCAH+KGLLQS FYGRC+ HA C+ + Sbjct: 1447 QCRVVDCSVRFHPWCAHQKGLLQSEVEGIDNENVGFYGRCMLHASHCT--CESGSEPTDA 1504 Query: 3505 ALKPDPETRAGNCARTEGYKGCKS---WEERKEELQKQTFNDNTRAVSQEQINAWLHING 3675 L P E R CARTEG+KG K W + +++T V QEQ+NAW+HING Sbjct: 1505 ELSPSRE-RESTCARTEGFKGRKQDGFWHNIYGQSKRKT----GCFVPQEQLNAWIHING 1559 Query: 3676 RKSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMV 3852 +KS + + K P +++ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FIS+GEMV Sbjct: 1560 QKSCMQGLPKLPTSDMEYDCRKEYARYKQAKGWKHLVVYKSGIHALGLYTSRFISRGEMV 1619 Query: 3853 VEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCS 4032 VEYVGEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC Sbjct: 1620 VEYVGEIVGLRVADKRENEYESGRKVQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCL 1679 Query: 4033 PNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 PNC ERDI GEEITYDY+FNHEDEGKKIPCFC S+ CRRYLN Sbjct: 1680 PNCVAKVISVRNEKKVVFFAERDIYPGEEITYDYHFNHEDEGKKIPCFCNSKNCRRYLN 1738 >ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda] gi|548856405|gb|ERN14258.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda] Length = 2123 Score = 515 bits (1327), Expect = e-143 Identities = 278/561 (49%), Positives = 348/561 (62%), Gaps = 1/561 (0%) Frame = +1 Query: 2476 SSKLKNFNALEDAGCIFEGSYSENPLVKRKRREGSDAVSPGETPCCVCGDSNEEGLNRLV 2655 S KL N E G I + K R+ + + CCVCG S+++ N ++ Sbjct: 1568 SEKLCLENVKETQGPIDVSHEVKGKKSSTKCRKRKAFILDSDVFCCVCGGSDKDDFNCIL 1627 Query: 2656 QCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKS 2835 +C CLIK+HQACYG+ K PK W CR C++++ +IVCVLCGY GGA+T A R+ N+VK+ Sbjct: 1628 ECSQCLIKVHQACYGVLKAPKGRWCCRPCRADIKDIVCVLCGYSGGAMTRALRSRNIVKN 1687 Query: 2836 LLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVA 3015 LL WK+KK S + P + D L S + G R + +S P Sbjct: 1688 LLQTWKIKKGRKSLD------PFHLSDSKHDDLNGLSGKLGGGPSRLEKMDSISAMKPGT 1741 Query: 3016 CVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVC 3195 AN DA S ++ + V + F+V NTITA V +P+VTQW+HMVC Sbjct: 1742 LERVSRVMMKANT----LDATSIMRNADILV---DDFQVHNTITAAVLDPNVTQWLHMVC 1794 Query: 3196 GLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAH 3375 GLW PGT+C NV TM FDV GV P+R VCS+C RPGG CI+CRVA C + FHPWCAH Sbjct: 1795 GLWMPGTRCPNVDTMSAFDVSGVSPPKRNTVCSICKRPGGSCIRCRVADCSVFFHPWCAH 1854 Query: 3376 RKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPETRAGNCARTE 3555 +KGLLQS FYGRC+ HA + K V H N ++ + + CARTE Sbjct: 1855 QKGLLQSEIEGVDNENVGFYGRCLFHAVNINCLTKPV-HLVNDKVEDHSDNKDPTCARTE 1913 Query: 3556 GYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDY 3732 GYKG K E L+ Q+ +++ V QEQINAWLHING+KS +R ++K P + + D Sbjct: 1914 GYKGRKK-EGLHYGLRGQSKDNSGCLVPQEQINAWLHINGQKSCTRGLIKPPASDTEYDC 1972 Query: 3733 RREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYY 3912 R+EY RY+Q K WK+LVVYKSGIHALGLYT++FI +G MVVEYVGEIVGLRVADKREA Y Sbjct: 1973 RKEYARYKQSKGWKQLVVYKSGIHALGLYTSQFIFRGAMVVEYVGEIVGLRVADKREAEY 2032 Query: 3913 HSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXX 4092 HS +++ E ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC Sbjct: 2033 HSGRRIQYESACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITIRNEKKVVFFA 2092 Query: 4093 ERDINAGEEITYDYNFNHEDE 4155 ERDIN GEEITYDY+FN+EDE Sbjct: 2093 ERDINPGEEITYDYHFNNEDE 2113 Score = 70.9 bits (172), Expect = 5e-09 Identities = 46/110 (41%), Positives = 66/110 (60%), Gaps = 5/110 (4%) Frame = +1 Query: 691 EHQMSNVCSEGSDSPVSEFSD-AVGHTDMASGKLTETDVVDEGSGIGK-CSSDGIDNGVW 864 E QMSNVCSE S + V+EFS + D+ S + T ++VDEGSGI K CSSD + G+W Sbjct: 1091 EQQMSNVCSESSAAVVTEFSGRCFVNLDLGSTRSTCDEIVDEGSGIEKCCSSDAHNAGMW 1150 Query: 865 TRSKQAYTGSGNHLLGTSVQLTNLSSDVYNGSKVKTSISFKR---PVNSP 1005 + +G+ + +LG S L + S+D N KV++S+ K+ P SP Sbjct: 1151 AETAN-LSGNTDAVLGRSSTLPSHSTDPINNLKVRSSLRLKKVRLPFGSP 1199 >ref|XP_006596088.1| PREDICTED: uncharacterized protein LOC100812602 isoform X6 [Glycine max] Length = 1870 Score = 500 bits (1287), Expect = e-138 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%) Frame = +1 Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784 CCVC S+ + +N L++C CLI++HQACYG+S +PK S W CR C++N NIVCVLCGY Sbjct: 1371 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1430 Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964 GGGA+T A + +VKSLL W +K+ KN S E F Sbjct: 1431 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1469 Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144 E+E D SK E K K + + ++IQ T V+ FKV N+I Sbjct: 1470 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1525 Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324 T V +P+V QW+HMVCGLWTPGT+C NV TM FDV GV PR VC +CNR GG CI Sbjct: 1526 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1585 Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504 +CR+A C I FHPWCAH+K LLQS FYGRC H + + C + L Sbjct: 1586 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1641 Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681 E + CAR EGYKG R + Q V +EQ+NAW+HING+K Sbjct: 1642 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1695 Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861 SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY Sbjct: 1696 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1755 Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041 +GEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC Sbjct: 1756 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1815 Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 ERDI GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N Sbjct: 1816 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 1870 >ref|XP_006596087.1| PREDICTED: uncharacterized protein LOC100812602 isoform X5 [Glycine max] Length = 1872 Score = 500 bits (1287), Expect = e-138 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%) Frame = +1 Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784 CCVC S+ + +N L++C CLI++HQACYG+S +PK S W CR C++N NIVCVLCGY Sbjct: 1373 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1432 Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964 GGGA+T A + +VKSLL W +K+ KN S E F Sbjct: 1433 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1471 Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144 E+E D SK E K K + + ++IQ T V+ FKV N+I Sbjct: 1472 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1527 Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324 T V +P+V QW+HMVCGLWTPGT+C NV TM FDV GV PR VC +CNR GG CI Sbjct: 1528 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1587 Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504 +CR+A C I FHPWCAH+K LLQS FYGRC H + + C + L Sbjct: 1588 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1643 Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681 E + CAR EGYKG R + Q V +EQ+NAW+HING+K Sbjct: 1644 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1697 Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861 SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY Sbjct: 1698 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1757 Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041 +GEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC Sbjct: 1758 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1817 Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 ERDI GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N Sbjct: 1818 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 1872 >ref|XP_006596085.1| PREDICTED: uncharacterized protein LOC100812602 isoform X3 [Glycine max] Length = 2006 Score = 500 bits (1287), Expect = e-138 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%) Frame = +1 Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784 CCVC S+ + +N L++C CLI++HQACYG+S +PK S W CR C++N NIVCVLCGY Sbjct: 1507 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1566 Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964 GGGA+T A + +VKSLL W +K+ KN S E F Sbjct: 1567 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1605 Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144 E+E D SK E K K + + ++IQ T V+ FKV N+I Sbjct: 1606 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1661 Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324 T V +P+V QW+HMVCGLWTPGT+C NV TM FDV GV PR VC +CNR GG CI Sbjct: 1662 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1721 Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504 +CR+A C I FHPWCAH+K LLQS FYGRC H + + C + L Sbjct: 1722 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1777 Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681 E + CAR EGYKG R + Q V +EQ+NAW+HING+K Sbjct: 1778 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1831 Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861 SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY Sbjct: 1832 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1891 Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041 +GEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC Sbjct: 1892 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1951 Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 ERDI GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N Sbjct: 1952 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 2006 >ref|XP_006596084.1| PREDICTED: uncharacterized protein LOC100812602 isoform X2 [Glycine max] Length = 2007 Score = 500 bits (1287), Expect = e-138 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%) Frame = +1 Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784 CCVC S+ + +N L++C CLI++HQACYG+S +PK S W CR C++N NIVCVLCGY Sbjct: 1508 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1567 Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964 GGGA+T A + +VKSLL W +K+ KN S E F Sbjct: 1568 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1606 Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144 E+E D SK E K K + + ++IQ T V+ FKV N+I Sbjct: 1607 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1662 Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324 T V +P+V QW+HMVCGLWTPGT+C NV TM FDV GV PR VC +CNR GG CI Sbjct: 1663 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1722 Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504 +CR+A C I FHPWCAH+K LLQS FYGRC H + + C + L Sbjct: 1723 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1778 Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681 E + CAR EGYKG R + Q V +EQ+NAW+HING+K Sbjct: 1779 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1832 Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861 SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY Sbjct: 1833 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1892 Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041 +GEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC Sbjct: 1893 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1952 Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 ERDI GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N Sbjct: 1953 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 2007 >ref|XP_006596083.1| PREDICTED: uncharacterized protein LOC100812602 isoform X1 [Glycine max] Length = 2008 Score = 500 bits (1287), Expect = e-138 Identities = 270/536 (50%), Positives = 328/536 (61%), Gaps = 2/536 (0%) Frame = +1 Query: 2608 CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGY 2784 CCVC S+ + +N L++C CLI++HQACYG+S +PK S W CR C++N NIVCVLCGY Sbjct: 1509 CCVCRSSSNDKINYLLECSRCLIRVHQACYGVSSLPKKSSWCCRPCRTNSKNIVCVLCGY 1568 Query: 2785 GGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRG 2964 GGGA+T A + +VKSLL W +K+ KN S E F Sbjct: 1569 GGGAMTRAIMSHTIVKSLLKVWNGEKDGMPKNTT-------------------SHEVF-- 1607 Query: 2965 KERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTI 3144 E+E D SK E K K + + ++IQ T V+ FKV N+I Sbjct: 1608 -EKEIDAFLSSKDGQEVDQESVLKPKIVDTSTDLMKVTNHIQHTPTSVS---NFKVHNSI 1663 Query: 3145 TAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCI 3324 T V +P+V QW+HMVCGLWTPGT+C NV TM FDV GV PR VC +CNR GG CI Sbjct: 1664 TEAVLDPTVKQWIHMVCGLWTPGTRCPNVDTMSAFDVSGVSRPRADVVCYICNRWGGSCI 1723 Query: 3325 QCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNL 3504 +CR+A C I FHPWCAH+K LLQS FYGRC H + + C + L Sbjct: 1724 ECRIADCSIKFHPWCAHQKNLLQSETEGIDDEKIGFYGRCTLHIIEPR--CLPIYDP--L 1779 Query: 3505 ALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK- 3681 E + CAR EGYKG R + Q V +EQ+NAW+HING+K Sbjct: 1780 DEIGSQEEKEFTCARAEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKL 1833 Query: 3682 SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEY 3861 SR + K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY Sbjct: 1834 CSRGLPKFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEY 1893 Query: 3862 VGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNC 4041 +GEIVGLRVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC Sbjct: 1894 IGEIVGLRVADKREKEYQSGRKLQYKTACYFFRIDKEHIIDATRKGGIARFVNHSCLPNC 1953 Query: 4042 XXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 ERDI GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N Sbjct: 1954 VAKVITVRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCNSKNCRRYMN 2008 >ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816713 isoform X1 [Glycine max] Length = 2032 Score = 493 bits (1270), Expect = e-136 Identities = 283/647 (43%), Positives = 366/647 (56%), Gaps = 12/647 (1%) Frame = +1 Query: 2305 TRNNDVGNETEDDINIKKTTNN---PACNVSKKRSLQSTAEGAMNQGDIV----SEQVRG 2463 T N NET D++++ PA K+ + + N+ +I ++++R Sbjct: 1420 TENTIFLNETNVDVSMEDLERGGKPPAVYKGKRDAKAKQGDSVGNRANISLKVKNKEIRK 1479 Query: 2464 SCSMSS-KLKNFNALEDAGCIFEGSYSENPLVKRKRREGSDAVSP--GETPCCVCGDSNE 2634 S++ K ++ C + R +G ++S + CCVC S Sbjct: 1480 QRSINELTAKETKVMDMTKCAQDQEPGLCGTKSRNSIQGHTSISTINSDAFCCVCRRSTN 1539 Query: 2635 EGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIVCVLCGYGGGALTHAK 2811 + +N L++C CLI++HQACYG+S +PK S W CR C++N NI CVLCGYGGGA+T A Sbjct: 1540 DKINCLLECSRCLIRVHQACYGVSTLPKKSSWCCRPCRTNSKNIACVLCGYGGGAMTRAI 1599 Query: 2812 RTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESDLAG 2991 + +VKSLL W +K+ G P R E+E D Sbjct: 1600 MSHTIVKSLLKVWNCEKD-------GMP---------------RDTTSCEVLEKEIDAFP 1637 Query: 2992 LSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGNPSV 3171 SK E K K + + + S +T + +N FKV N+IT GV +P+V Sbjct: 1638 SSKDGLEVDQESVLKPKIVDTSTDLMNQISTNHIPHTPTSFSN-FKVHNSITEGVLDPTV 1696 Query: 3172 TQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQI 3351 QW+HMVCGLWTP T+C NV TM FDV GV PR VCS+CNR GG CI+CR+A C + Sbjct: 1697 KQWIHMVCGLWTPRTRCPNVDTMSAFDVSGVSRPRADVVCSICNRWGGSCIECRIADCSV 1756 Query: 3352 SFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDPETR 3531 FHPWCAH+K LLQS FYGRC+ H + + C + L E + Sbjct: 1757 KFHPWCAHQKNLLQSETEGINDEKIGFYGRCMLHTIEPR--CLFIYDP--LDEIGSQEQK 1812 Query: 3532 AGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK-SSRAVVKNP 3708 CAR EGYKG R + Q V +EQ+NAW+HING+K S+ + K P Sbjct: 1813 EFTCARVEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKLCSQGLPKFP 1866 Query: 3709 GMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRV 3888 ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY+GEIVGLRV Sbjct: 1867 DLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVGLRV 1926 Query: 3889 ADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXX 4068 ADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC Sbjct: 1927 ADKREKEYQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVITVRH 1986 Query: 4069 XXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 ERDI GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N Sbjct: 1987 EKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCYSKNCRRYMN 2032 >ref|XP_002519907.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis] gi|223540953|gb|EEF42511.1| mixed-lineage leukemia protein, mll, putative [Ricinus communis] Length = 1125 Score = 491 bits (1264), Expect = e-135 Identities = 285/616 (46%), Positives = 359/616 (58%), Gaps = 14/616 (2%) Frame = +1 Query: 2386 SKKRSL-QSTAEGAMNQGDIVSEQVR----GSCSMSSKLKNFNALEDAGCIFEGSYSENP 2550 ++KRSL + T +G + +VS + + L+N D GS +P Sbjct: 545 TRKRSLYELTLKGKSSSPKMVSRKKNFKYVPKMKLGKTLRNSEKSHD-----NGSQKVDP 599 Query: 2551 LVKRKRREGSD-AVSPGETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGW 2727 KR RE +++ ++ C VC SN++ +N L++C+ C I++HQACYG+S++PK W Sbjct: 600 --KRCAREQKHLSITDMDSFCSVCRSSNKDEVNCLLECRRCSIRVHQACYGVSRVPKGHW 657 Query: 2728 KCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNL 2907 CR C+++ +IVCVLCGYGGGA+T A R+ +VK LL W ++ E +KN Sbjct: 658 YCRPCRTSAKDIVCVLCGYGGGAMTLALRSRTIVKGLLKAWNLEIESVAKN--------- 708 Query: 2908 PTSKIADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNI 3087 I SPE E + S P P + N + T N ++ Sbjct: 709 ---------AISSPEILH---HEMSMLHSSGPGPENRSYPVLRPVNIEPST-STVCNKDV 755 Query: 3088 QEG-----NTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFD 3252 Q N+ L+N KV+N+ITAGV + +V QWVHMVCGLWTPGT+C NV TM FD Sbjct: 756 QNHLDILPNSLGHLSN-LKVNNSITAGVLDSTVKQWVHMVCGLWTPGTRCPNVNTMSAFD 814 Query: 3253 VFGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXF 3432 V G PR VCS+C+RPGG CIQCRVA C I FHPWCAH+KGLLQS F Sbjct: 815 VSGASCPRANVVCSICDRPGGSCIQCRVANCSIQFHPWCAHQKGLLQSEAEGVDNENVGF 874 Query: 3433 YGRCIAHAE--DAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEERKEELQK 3606 YGRC+ HA + C E P + +CARTEGYKG K + Sbjct: 875 YGRCVLHATYPTIESACDSAIFEAGY-----PAEKEVSCARTEGYKGRKR-DGFWHNTNS 928 Query: 3607 QTFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLV 3783 Q+ + V QEQ +AW+HING+KS ++ ++K P E + D R+EY RY+Q K WK LV Sbjct: 929 QSKGKSGCLVPQEQFDAWVHINGQKSCAQGILKLPMSEKEYDCRKEYTRYKQGKAWKHLV 988 Query: 3784 VYKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRI 3963 VYKSGIHALGLYTA FIS+GEMVVEYVGEIVGLRVADKRE Y S K++ + ACYFFRI Sbjct: 989 VYKSGIHALGLYTARFISRGEMVVEYVGEIVGLRVADKRENEYQSGRKLQYKSACYFFRI 1048 Query: 3964 DKENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFN 4143 DKENIIDAT KGGIARFVNHSC PNC ERDI GEEITYDY+FN Sbjct: 1049 DKENIIDATHKGGIARFVNHSCLPNCVAKVISVRNDKKVVFFAERDIYPGEEITYDYHFN 1108 Query: 4144 HEDEGKKIPCFCKSRI 4191 HEDE +K F RI Sbjct: 1109 HEDEVQKFWKFSAVRI 1124 >ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816713 isoform X3 [Glycine max] Length = 2033 Score = 489 bits (1260), Expect = e-135 Identities = 284/650 (43%), Positives = 367/650 (56%), Gaps = 15/650 (2%) Frame = +1 Query: 2305 TRNNDVGNETEDDINIKKTTNN---PACNVSKKRSLQSTAEGAMNQGDIV----SEQVRG 2463 T N NET D++++ PA K+ + + N+ +I ++++R Sbjct: 1418 TENTIFLNETNVDVSMEDLERGGKPPAVYKGKRDAKAKQGDSVGNRANISLKVKNKEIRK 1477 Query: 2464 SCSMSS-KLKNFNALEDAGCIFEGSYSENPLVKRKRREGSDAVSP--GETPCCVCGDSNE 2634 S++ K ++ C + R +G ++S + CCVC S Sbjct: 1478 QRSINELTAKETKVMDMTKCAQDQEPGLCGTKSRNSIQGHTSISTINSDAFCCVCRRSTN 1537 Query: 2635 EGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIV---CVLCGYGGGALT 2802 + +N L++C CLI++HQACYG+S +PK S W CR C++N NIV CVLCGYGGGA+T Sbjct: 1538 DKINCLLECSRCLIRVHQACYGVSTLPKKSSWCCRPCRTNSKNIVYPACVLCGYGGGAMT 1597 Query: 2803 HAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESD 2982 A + +VKSLL W +K+ G P R E+E D Sbjct: 1598 RAIMSHTIVKSLLKVWNCEKD-------GMP---------------RDTTSCEVLEKEID 1635 Query: 2983 LAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGN 3162 SK E K K + + + S +T + +N FKV N+IT GV + Sbjct: 1636 AFPSSKDGLEVDQESVLKPKIVDTSTDLMNQISTNHIPHTPTSFSN-FKVHNSITEGVLD 1694 Query: 3163 PSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAK 3342 P+V QW+HMVCGLWTP T+C NV TM FDV GV PR VCS+CNR GG CI+CR+A Sbjct: 1695 PTVKQWIHMVCGLWTPRTRCPNVDTMSAFDVSGVSRPRADVVCSICNRWGGSCIECRIAD 1754 Query: 3343 CQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDP 3522 C + FHPWCAH+K LLQS FYGRC+ H + + C + L Sbjct: 1755 CSVKFHPWCAHQKNLLQSETEGINDEKIGFYGRCMLHTIEPR--CLFIYDP--LDEIGSQ 1810 Query: 3523 ETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK-SSRAVV 3699 E + CAR EGYKG R + Q V +EQ+NAW+HING+K S+ + Sbjct: 1811 EQKEFTCARVEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKLCSQGLP 1864 Query: 3700 KNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVG 3879 K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY+GEIVG Sbjct: 1865 KFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVG 1924 Query: 3880 LRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXX 4059 LRVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC Sbjct: 1925 LRVADKREKEYQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVIT 1984 Query: 4060 XXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 ERDI GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N Sbjct: 1985 VRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCYSKNCRRYMN 2033 >ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816713 isoform X2 [Glycine max] Length = 2035 Score = 489 bits (1260), Expect = e-135 Identities = 284/650 (43%), Positives = 367/650 (56%), Gaps = 15/650 (2%) Frame = +1 Query: 2305 TRNNDVGNETEDDINIKKTTNN---PACNVSKKRSLQSTAEGAMNQGDIV----SEQVRG 2463 T N NET D++++ PA K+ + + N+ +I ++++R Sbjct: 1420 TENTIFLNETNVDVSMEDLERGGKPPAVYKGKRDAKAKQGDSVGNRANISLKVKNKEIRK 1479 Query: 2464 SCSMSS-KLKNFNALEDAGCIFEGSYSENPLVKRKRREGSDAVSP--GETPCCVCGDSNE 2634 S++ K ++ C + R +G ++S + CCVC S Sbjct: 1480 QRSINELTAKETKVMDMTKCAQDQEPGLCGTKSRNSIQGHTSISTINSDAFCCVCRRSTN 1539 Query: 2635 EGLNRLVQCQSCLIKMHQACYGISKIPK-SGWKCRACKSNLTNIV---CVLCGYGGGALT 2802 + +N L++C CLI++HQACYG+S +PK S W CR C++N NIV CVLCGYGGGA+T Sbjct: 1540 DKINCLLECSRCLIRVHQACYGVSTLPKKSSWCCRPCRTNSKNIVYPACVLCGYGGGAMT 1599 Query: 2803 HAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQFRGKERESD 2982 A + +VKSLL W +K+ G P R E+E D Sbjct: 1600 RAIMSHTIVKSLLKVWNCEKD-------GMP---------------RDTTSCEVLEKEID 1637 Query: 2983 LAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDNTITAGVGN 3162 SK E K K + + + S +T + +N FKV N+IT GV + Sbjct: 1638 AFPSSKDGLEVDQESVLKPKIVDTSTDLMNQISTNHIPHTPTSFSN-FKVHNSITEGVLD 1696 Query: 3163 PSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGLCIQCRVAK 3342 P+V QW+HMVCGLWTP T+C NV TM FDV GV PR VCS+CNR GG CI+CR+A Sbjct: 1697 PTVKQWIHMVCGLWTPRTRCPNVDTMSAFDVSGVSRPRADVVCSICNRWGGSCIECRIAD 1756 Query: 3343 CQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEKNLALKPDP 3522 C + FHPWCAH+K LLQS FYGRC+ H + + C + L Sbjct: 1757 CSVKFHPWCAHQKNLLQSETEGINDEKIGFYGRCMLHTIEPR--CLFIYDP--LDEIGSQ 1812 Query: 3523 ETRAGNCARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINAWLHINGRK-SSRAVV 3699 E + CAR EGYKG R + Q V +EQ+NAW+HING+K S+ + Sbjct: 1813 EQKEFTCARVEGYKG-----RRWDGFQNNQCQGGC-LVPEEQLNAWIHINGQKLCSQGLP 1866 Query: 3700 KNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFISKGEMVVEYVGEIVG 3879 K P ++++ D R+EY RY+Q K WK LVVYKS IHALGLYT+ FIS+GEMVVEY+GEIVG Sbjct: 1867 KFPDLDIEHDCRKEYARYKQAKGWKHLVVYKSRIHALGLYTSRFISRGEMVVEYIGEIVG 1926 Query: 3880 LRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARFVNHSCSPNCXXXXXX 4059 LRVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARFVNHSC PNC Sbjct: 1927 LRVADKREKEYQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARFVNHSCLPNCVAKVIT 1986 Query: 4060 XXXXXXXXXXXERDINAGEEITYDYNFNHEDEGKKIPCFCKSRICRRYLN 4209 ERDI GEEITYDY+FNHEDEG KIPC+C S+ CRRY+N Sbjct: 1987 VRHEKKVVFLAERDIFPGEEITYDYHFNHEDEG-KIPCYCYSKNCRRYMN 2035 >ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313577 [Fragaria vesca subsp. vesca] Length = 2169 Score = 486 bits (1251), Expect = e-134 Identities = 351/981 (35%), Positives = 473/981 (48%), Gaps = 29/981 (2%) Frame = +1 Query: 1354 RNKDMSVQHVQVKGDKLVIQLRQGALEKNAKQCILDDIEQLVERASSRKILEHEDPNFEN 1533 R+KD +Q+++ +G K+ + R+ ALE NA C D SSR E+ + N + Sbjct: 1266 RDKDKHLQNLE-QGLKIGKRKRELALELNAS-CSNSD--------SSRVRQENHNSNGTS 1315 Query: 1534 GQISRFKGTKIGEDRSKSSKYGRLVTAPKPIKISNNKQ--IPKGGKMVSLRSILK--YPE 1701 S+ +K S S K G VT + S+ + I K + LRS L + + Sbjct: 1316 QFTSQ--PSKSLMMLSTSRKSGTHVTGNCITQSSSKPRLHISSSAKKLLLRSDLHKLHDD 1373 Query: 1702 KQIRCQARKSENLRSGYD-WERPIITRGETSSMKVSSLQRMKKCRLLERQSNMSEFSSEC 1878 K+ L G + E P ++ G+T SS RQ + E S + Sbjct: 1374 KESEVNNVFQTELNGGANNHELPEVSGGKTCKRDCSSNAF--------RQFQIQESSRKD 1425 Query: 1879 NRKT-YDRVDTPKEVRKRSLCKLSNESPSKMKNHFS---SDAMNDSRVFNPTRNFKTCSG 2046 ++T Y+ VD K + + K+ + + +D + R+ P + Sbjct: 1426 TKRTKYNSVDGFKSTCSQQV-KIGHRKARPIVCGIYGELTDGSSTGRMSKPAKLVPLSRV 1484 Query: 2047 QETKNGSLLP--CNNQSSKMNIQSKTESQKDIVAITSCTTNKANHSIFSSANSSLGSGRL 2220 + +LP CN++SS M +K + C T + ++ + Sbjct: 1485 LNSSRKCILPKLCNSKSSSMR-------KKKLGGAAICNTYDLKTEKYKCHDAMV----- 1532 Query: 2221 KIQHTNNELMSAICNTPIPKFEGGSKISTRNNDVGNETE----DDINIKKTTNNPACNVS 2388 K+ T+ C+ + + DV +E + D I + P Sbjct: 1533 KVNDTSMRKKKKECSPGEREIHKELFSMEKQGDVQSEKDHQKLDSITHTQLQMKP--KEI 1590 Query: 2389 KKRSLQSTAEGAMNQGDIVSEQVRGSCSMSSKLKNFNALEDAGCIFEGSYSE--NPLVKR 2562 +KRS+ E + G S SK+ NF D + G S K Sbjct: 1591 RKRSIYEFTEKGDDTGF--------KSSSVSKISNFRPANDGKLVNTGEDSGLCQHSAKN 1642 Query: 2563 KRREGSDAVSPGETP-CCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRA 2739 +E + P CCVCG SN++ +N L++C C +++HQACYG+SK+PK W CR Sbjct: 1643 STQEHRCHCNCDSDPICCVCGSSNQDEINILLECSQCSVRVHQACYGVSKVPKGCWSCRP 1702 Query: 2740 CKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSK 2919 C+ + +IVCVLCGYGGGA+T A R++ + S+L W ++ E KN C K Sbjct: 1703 CRMSSKDIVCVLCGYGGGAMTQALRSQTIAVSILRAWNIETECGPKN---ELCSIKTLQK 1759 Query: 2920 IADALVIRSPEQFRGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGN 3099 + L +R E S P+A + + D N + Sbjct: 1760 DSTGLHCSG---YRHSESSSLFVSQQSGQPLAAAHCK------RGMSYRVDGVENSPSVS 1810 Query: 3100 TKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRR 3279 + KV N+IT G+ + + QWVHMVCGLWTP T+C NV TM FDV V Sbjct: 1811 -------KTKVHNSITMGLVDSATKQWVHMVCGLWTPETRCPNVDTMSAFDVSCVPLSTD 1863 Query: 3280 KQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAE 3459 VC +C R GG CIQCRV C + FHPWCAH+KGLLQ+ FYGRC HA Sbjct: 1864 DAVCCMCKRAGGSCIQCRVENCSVRFHPWCAHQKGLLQTEVEGVDNENVGFYGRCGLHAT 1923 Query: 3460 ----------DAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQ 3609 D + C EK L CARTEGYKG K R + Sbjct: 1924 HPIYKSEYPVDTEAGCL---DEKKLV-----------CARTEGYKGRKRDGFRHNYCDRS 1969 Query: 3610 TFNDNTRAVSQEQINAWLHINGRKS-SRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVV 3786 +D V QEQ+NAW +ING+KS ++ + K E++ D R+EY RY+Q K WK LVV Sbjct: 1970 KGSDGC-LVPQEQLNAWAYINGQKSCTQELPKLAISEIEHDSRKEYTRYKQAKLWKHLVV 2028 Query: 3787 YKSGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRID 3966 YKSGIHALGLYT+ FIS+ EMVVEYVGEIVG RV+DKRE Y S K++ + ACYFFRID Sbjct: 2029 YKSGIHALGLYTSRFISRDEMVVEYVGEIVGQRVSDKRENEYQSAKKLQYKSACYFFRID 2088 Query: 3967 KENIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNH 4146 KE+IIDAT KGGIARFVNHSCSPNC ERDI GEEITYDY+FNH Sbjct: 2089 KEHIIDATCKGGIARFVNHSCSPNCVAKVISVRNEKKVVFLAERDIFPGEEITYDYHFNH 2148 Query: 4147 EDEGKKIPCFCKSRICRRYLN 4209 EDEGKKIPCFC S+ CRRYLN Sbjct: 2149 EDEGKKIPCFCNSKNCRRYLN 2169 >gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus notabilis] Length = 2073 Score = 484 bits (1246), Expect = e-133 Identities = 264/527 (50%), Positives = 325/527 (61%), Gaps = 8/527 (1%) Frame = +1 Query: 2599 ETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPKSGWKCRACKSNLTNIVCVLC 2778 E+ CCVCG S+++ N L++C CLIK+HQACYG+S+ PK W CR C+++ NIVCVLC Sbjct: 1545 ESFCCVCGSSDKDDTNNLLECNICLIKVHQACYGVSRAPKGHWYCRPCRTSSRNIVCVLC 1604 Query: 2779 GYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPCPNLPTSKIADALVIRSPEQF 2958 GYGGGA+T A R+ +VKSLL W V+ E + ++K +L T ++ Sbjct: 1605 GYGGGAMTRALRSRTIVKSLLRVWNVETEWKALSVK-----DLETLTRLNS--------- 1650 Query: 2959 RGKERESDLAGLSKPMPVACVEKEDKRKNANANQFETDANSNIQEGNTKVALNNRFKVDN 3138 G ERE G S PM C + K + + + N ++ + V + KVDN Sbjct: 1651 SGPEREE---GTSFPM---CQPENTKPLASVVCKMDMPYNVDVLRNSLCV---KKLKVDN 1701 Query: 3139 TITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDVFGVCFPRRKQVCSVCNRPGGL 3318 +ITAG + + QWVHMVCGLWTPGT+C NV TM FDV G PR VCS+CNRPGG Sbjct: 1702 SITAGFLDSTTKQWVHMVCGLWTPGTRCPNVDTMSAFDVSGAPHPRADVVCSMCNRPGGS 1761 Query: 3319 CIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFYGRCIAHAEDAKKQCKVVQHEK 3498 CI+CRV C + FHPWCAH+KGLLQS FYGRC HA + + + Sbjct: 1762 CIKCRVLNCSVRFHPWCAHQKGLLQSEVEGIDNENIGFYGRCARHATHP-----MCESDS 1816 Query: 3499 NLALKPDPETRAGN-------CARTEGYKGCKSWEERKEELQKQTFNDNTRAVSQEQINA 3657 + A D + AG CARTEGYKG K R Q + V QEQ+NA Sbjct: 1817 DPA---DTDRVAGGSAVEELTCARTEGYKGRKRDGVRHNYCQSK--GKVGCYVPQEQLNA 1871 Query: 3658 WLHINGRKSS-RAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYKSGIHALGLYTAEFI 3834 W+HING+KS + V + P +++ D R+EY RY+Q K WK LVVYKSGIHALGLYT+ FI Sbjct: 1872 WIHINGQKSCIQGVHRLPTSDIEHDCRKEYARYKQGKGWKHLVVYKSGIHALGLYTSRFI 1931 Query: 3835 SKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKENIIDATRKGGIARF 4014 S+ EMVVEYVGEIVG RVADKRE Y S K++ + ACYFFRIDKE+IIDATRKGGIARF Sbjct: 1932 SRSEMVVEYVGEIVGQRVADKRENEYQSGRKLQYKSACYFFRIDKEHIIDATRKGGIARF 1991 Query: 4015 VNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHEDE 4155 VNHSC PNC ERDI GEEITYDY+FNHEDE Sbjct: 1992 VNHSCLPNCVAKVISIRNEKKVVFFAERDIFPGEEITYDYHFNHEDE 2038 >gb|ESW33157.1| hypothetical protein PHAVU_001G047700g [Phaseolus vulgaris] gi|561034628|gb|ESW33158.1| hypothetical protein PHAVU_001G047700g [Phaseolus vulgaris] Length = 2002 Score = 483 bits (1244), Expect = e-133 Identities = 315/859 (36%), Positives = 432/859 (50%), Gaps = 52/859 (6%) Frame = +1 Query: 1789 SSMKVSSLQRMKKCRLLERQSNMSEFSSECNRKTYDRVDTP-----KEVRKRSLCKLSNE 1953 SS + + +M L++ N S F CN++ + +RK K+++E Sbjct: 1197 SSSSLPNEMQMHSLSSLQKSFNKSSFVQPCNKRIQSAFSSKFNSCKNSLRKHLSYKVAHE 1256 Query: 1954 SPS----------------KMKNHFSSDAMNDSRVFNPTRNFKTCSGQETKNGSLLP--- 2076 S S K++N+ +SD + P S +E K L P Sbjct: 1257 SQSDSYAEFCTLPGVSGTKKLRNNLTSDCFEQFHMQEP-------SYEEPKKAELWPFLC 1309 Query: 2077 -------------CNNQSSKMNIQSKTESQKD--IVAITSCTTNKANHSIFSSANSSLGS 2211 C N E QK IV++ + ++ L S Sbjct: 1310 RKENGHRITRPVVCGKYGEIRNGHLAKEVQKPAKIVSLNKVLKSSKRCMSYTKGKPRLTS 1369 Query: 2212 GRLKIQHTNNELMSAICNTPIPKFEGGSKISTRNNDVGNETEDDINIKKTTNNPACNVSK 2391 + + + C K + I T+N + NE D++++ + +K Sbjct: 1370 KKKWKRLSIGTDSEYCCGNRGLKVK--EHIETQNTIIYNEASVDMSLEDLERGGKQD-AK 1426 Query: 2392 KRSLQSTAEGAMNQGDIV----SEQVRGSCSMSS-KLKNFNALEDAGCIFEGSYSENPLV 2556 ++ Q G N+ +++ ++ +R S++ K + C + E L Sbjct: 1427 AKAKQGVRVG--NRENVLLKVKNKDIRKHRSINELTAKETKVTDMMSCAQD---REPGLC 1481 Query: 2557 KRKRR---EGSDAVSP--GETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK- 2718 KRR +G +S +T CCVC S+ + +N L++C CLI++HQACYG+S +PK Sbjct: 1482 STKRRNSIQGHTNISTIYSDTFCCVCRSSSNDKINCLLECCQCLIRVHQACYGVSTLPKK 1541 Query: 2719 SGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPC 2898 S W CR C++N NI CVLCGYGGGA+T A + +VKSLL W +K+D K+ Sbjct: 1542 SRWCCRPCRTNSKNIACVLCGYGGGAMTRATMSHTIVKSLLKVWNSEKDDMPKH------ 1595 Query: 2899 PNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPV-ACVEKEDKRKNANANQFETDA 3075 + E + ++D KP A + R + N Q+ Sbjct: 1596 --------TTSCEFFGEEIYAFSSSKADQESALKPKIFDASTDLVKVRISTNNTQY---- 1643 Query: 3076 NSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDV 3255 T L + FKV N+IT GV + +V QW+HMVCGLWTPGT+C NV TM FDV Sbjct: 1644 --------TPTTLYS-FKVHNSITEGVLDSTVKQWIHMVCGLWTPGTRCPNVDTMSAFDV 1694 Query: 3256 FGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFY 3435 GV PR VCS+CNR GG CI+CR+A C + FHPWCAH K LLQS FY Sbjct: 1695 SGVSRPRADVVCSICNRWGGSCIECRMADCSVKFHPWCAHLKNLLQSETEGIDDEKIGFY 1754 Query: 3436 GRCIAHAEDAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTF 3615 G C+ H + +K E + CAR EGYKG R+ + + Sbjct: 1755 GSCMLHTIEPSYLSIYDPIDKI----GSQEEKEFTCARAEGYKG------RRWDGFQNNH 1804 Query: 3616 NDNTRAVSQEQINAWLHINGRK-SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYK 3792 V +EQ+NAW+HING+K S+ + K ++++ + R+EY RY+Q K WK LVVYK Sbjct: 1805 CQGGCVVPEEQLNAWIHINGQKLCSQGLTKFSDLDMEHNCRKEYTRYKQAKGWKHLVVYK 1864 Query: 3793 SGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKE 3972 S IHALGLYT+ FIS+GE+VVEY+GEIVGLRVADKRE Y S K++ + ACYFFRIDKE Sbjct: 1865 SRIHALGLYTSRFISRGEVVVEYIGEIVGLRVADKREKDYQSGKKLQDKSACYFFRIDKE 1924 Query: 3973 NIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHED 4152 +IIDATRKGGIARFVNHSC PNC ERDI GEEITYDY+FNHED Sbjct: 1925 HIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFFAERDIFPGEEITYDYHFNHED 1984 Query: 4153 EGKKIPCFCKSRICRRYLN 4209 EG KIPC+C S+ CRRY+N Sbjct: 1985 EG-KIPCYCNSKNCRRYMN 2002 >gb|ESW33155.1| hypothetical protein PHAVU_001G047700g [Phaseolus vulgaris] gi|561034626|gb|ESW33156.1| hypothetical protein PHAVU_001G047700g [Phaseolus vulgaris] Length = 2000 Score = 483 bits (1244), Expect = e-133 Identities = 315/859 (36%), Positives = 432/859 (50%), Gaps = 52/859 (6%) Frame = +1 Query: 1789 SSMKVSSLQRMKKCRLLERQSNMSEFSSECNRKTYDRVDTP-----KEVRKRSLCKLSNE 1953 SS + + +M L++ N S F CN++ + +RK K+++E Sbjct: 1195 SSSSLPNEMQMHSLSSLQKSFNKSSFVQPCNKRIQSAFSSKFNSCKNSLRKHLSYKVAHE 1254 Query: 1954 SPS----------------KMKNHFSSDAMNDSRVFNPTRNFKTCSGQETKNGSLLP--- 2076 S S K++N+ +SD + P S +E K L P Sbjct: 1255 SQSDSYAEFCTLPGVSGTKKLRNNLTSDCFEQFHMQEP-------SYEEPKKAELWPFLC 1307 Query: 2077 -------------CNNQSSKMNIQSKTESQKD--IVAITSCTTNKANHSIFSSANSSLGS 2211 C N E QK IV++ + ++ L S Sbjct: 1308 RKENGHRITRPVVCGKYGEIRNGHLAKEVQKPAKIVSLNKVLKSSKRCMSYTKGKPRLTS 1367 Query: 2212 GRLKIQHTNNELMSAICNTPIPKFEGGSKISTRNNDVGNETEDDINIKKTTNNPACNVSK 2391 + + + C K + I T+N + NE D++++ + +K Sbjct: 1368 KKKWKRLSIGTDSEYCCGNRGLKVK--EHIETQNTIIYNEASVDMSLEDLERGGKQD-AK 1424 Query: 2392 KRSLQSTAEGAMNQGDIV----SEQVRGSCSMSS-KLKNFNALEDAGCIFEGSYSENPLV 2556 ++ Q G N+ +++ ++ +R S++ K + C + E L Sbjct: 1425 AKAKQGVRVG--NRENVLLKVKNKDIRKHRSINELTAKETKVTDMMSCAQD---REPGLC 1479 Query: 2557 KRKRR---EGSDAVSP--GETPCCVCGDSNEEGLNRLVQCQSCLIKMHQACYGISKIPK- 2718 KRR +G +S +T CCVC S+ + +N L++C CLI++HQACYG+S +PK Sbjct: 1480 STKRRNSIQGHTNISTIYSDTFCCVCRSSSNDKINCLLECCQCLIRVHQACYGVSTLPKK 1539 Query: 2719 SGWKCRACKSNLTNIVCVLCGYGGGALTHAKRTENVVKSLLHCWKVKKEDNSKNLKGNPC 2898 S W CR C++N NI CVLCGYGGGA+T A + +VKSLL W +K+D K+ Sbjct: 1540 SRWCCRPCRTNSKNIACVLCGYGGGAMTRATMSHTIVKSLLKVWNSEKDDMPKH------ 1593 Query: 2899 PNLPTSKIADALVIRSPEQFRGKERESDLAGLSKPMPV-ACVEKEDKRKNANANQFETDA 3075 + E + ++D KP A + R + N Q+ Sbjct: 1594 --------TTSCEFFGEEIYAFSSSKADQESALKPKIFDASTDLVKVRISTNNTQY---- 1641 Query: 3076 NSNIQEGNTKVALNNRFKVDNTITAGVGNPSVTQWVHMVCGLWTPGTKCVNVRTMGVFDV 3255 T L + FKV N+IT GV + +V QW+HMVCGLWTPGT+C NV TM FDV Sbjct: 1642 --------TPTTLYS-FKVHNSITEGVLDSTVKQWIHMVCGLWTPGTRCPNVDTMSAFDV 1692 Query: 3256 FGVCFPRRKQVCSVCNRPGGLCIQCRVAKCQISFHPWCAHRKGLLQSXXXXXXXXXXXFY 3435 GV PR VCS+CNR GG CI+CR+A C + FHPWCAH K LLQS FY Sbjct: 1693 SGVSRPRADVVCSICNRWGGSCIECRMADCSVKFHPWCAHLKNLLQSETEGIDDEKIGFY 1752 Query: 3436 GRCIAHAEDAKKQCKVVQHEKNLALKPDPETRAGNCARTEGYKGCKSWEERKEELQKQTF 3615 G C+ H + +K E + CAR EGYKG R+ + + Sbjct: 1753 GSCMLHTIEPSYLSIYDPIDKI----GSQEEKEFTCARAEGYKG------RRWDGFQNNH 1802 Query: 3616 NDNTRAVSQEQINAWLHINGRK-SSRAVVKNPGMEVKTDYRREYLRYRQEKRWKRLVVYK 3792 V +EQ+NAW+HING+K S+ + K ++++ + R+EY RY+Q K WK LVVYK Sbjct: 1803 CQGGCVVPEEQLNAWIHINGQKLCSQGLTKFSDLDMEHNCRKEYTRYKQAKGWKHLVVYK 1862 Query: 3793 SGIHALGLYTAEFISKGEMVVEYVGEIVGLRVADKREAYYHSRGKMRQEGACYFFRIDKE 3972 S IHALGLYT+ FIS+GE+VVEY+GEIVGLRVADKRE Y S K++ + ACYFFRIDKE Sbjct: 1863 SRIHALGLYTSRFISRGEVVVEYIGEIVGLRVADKREKDYQSGKKLQDKSACYFFRIDKE 1922 Query: 3973 NIIDATRKGGIARFVNHSCSPNCXXXXXXXXXXXXXXXXXERDINAGEEITYDYNFNHED 4152 +IIDATRKGGIARFVNHSC PNC ERDI GEEITYDY+FNHED Sbjct: 1923 HIIDATRKGGIARFVNHSCLPNCVAKVITVRHEKKVVFFAERDIFPGEEITYDYHFNHED 1982 Query: 4153 EGKKIPCFCKSRICRRYLN 4209 EG KIPC+C S+ CRRY+N Sbjct: 1983 EG-KIPCYCNSKNCRRYMN 2000