BLASTX nr result
ID: Akebia26_contig00028962
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia26_contig00028962 (1100 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI21105.3| unnamed protein product [Vitis vinifera] 189 1e-45 ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [A... 109 2e-21 ref|XP_006371759.1| hypothetical protein POPTR_0018s02180g [Popu... 108 3e-21 ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613... 103 1e-19 ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613... 103 1e-19 ref|XP_006450350.1| hypothetical protein CICLE_v10010345mg, part... 103 1e-19 ref|XP_007226535.1| hypothetical protein PRUPE_ppa025154mg [Prun... 95 6e-17 ref|XP_002519906.1| conserved hypothetical protein [Ricinus comm... 86 4e-14 ref|XP_007011789.1| Uncharacterized protein isoform 9 [Theobroma... 65 4e-08 ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [... 65 4e-08 ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma... 65 4e-08 ref|XP_007011781.1| Uncharacterized protein isoform 1 [Theobroma... 65 4e-08 gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus no... 65 5e-08 >emb|CBI21105.3| unnamed protein product [Vitis vinifera] Length = 1012 Score = 189 bits (481), Expect = 1e-45 Identities = 144/373 (38%), Positives = 196/373 (52%), Gaps = 14/373 (3%) Frame = +1 Query: 7 ICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHG 186 I + LD KLEQFRG AKSS++SM LS +T TE ++ +A N+V+++ R LH Sbjct: 611 INNALDAAKLEQFRGDAAKSSVISMLLSHLTTPTEGNMQSKAINNVVNDNGHFVPRSLHF 670 Query: 187 ESHSFDFRS-----NGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHN 351 ESH N ++ + R+ N + L F +DKGK TD SY A +++ Sbjct: 671 ESHIAKRDPVYSPWNSANGLERESNINDLSFHRYMDKGKRVGFVTDG-SYAATESTFGFY 729 Query: 352 KQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCHGN 531 KQM S FTG+ +H PS+ H+K S Y Q + DASNA N N K SC G+ Sbjct: 730 KQMGSSGTFTGVAGSDH-PSSSAVHDK-SCYSRQLLGMPPDASNASNSFNFSGKFSCLGS 787 Query: 532 SS-DPAFLRSANSSTVIAGAGSVMP-----MGLSSTNSICRPNLTPASSNNVGIGVSPYF 693 S D F++S + G+G +P G SS +S+ PNLTP+ IGVSPY Sbjct: 788 SGLDNVFVKSISPP---MGSGINVPSQAVSTGFSSASSLSVPNLTPSLPTKESIGVSPYL 844 Query: 694 MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873 +DEN S + HAI SL ++GR S ++Q GS D L S+EL+ Sbjct: 845 LDENFKLLALRHILELSNREHAITSLGMNQKEGRFSSSSDPKVQ--GSVVDTLTSDELKH 902 Query: 874 GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050 G LT +QNASEV K LQS NH +EKL V+ +NW + T +G+ SK +D Sbjct: 903 GLKLTSEQNASEVPLKLLQSGGNHRMGGDMEKLVPVADQNNWFDISTFTQGIPLCSKGID 962 Query: 1051 MQNPPHE--SLTN 1083 Q+ P E SL+N Sbjct: 963 SQDLPCEQPSLSN 975 >ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda] gi|548856405|gb|ERN14258.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda] Length = 2123 Score = 109 bits (273), Expect = 2e-21 Identities = 105/380 (27%), Positives = 175/380 (46%), Gaps = 14/380 (3%) Frame = +1 Query: 1 PGICSTLDLDKLEQFRGAMAKSSLVSMYLSQFST--STEKDLHFR-----APTNMVDNSC 159 PGI + L+ G M+K+S++SM LS + E+ L + AP ++V Sbjct: 568 PGIVNLLE--------GHMSKNSIMSMLLSPMENFGTNEEGLMLQPNSNMAPEHLVPKLI 619 Query: 160 PSTSRILHGESHSFDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTS 339 S S++L ++ F +N S+ M R+L N++D K ++ + S ++ S Sbjct: 620 HSNSQLLKSGTNCFT--TNKSEMMERKL-------ANHIDAVKMSRDMPNGSSTFSSIGS 670 Query: 340 LFHNKQMADSSPFTGLVS-GNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKV 516 H KQ DS V GNH S + + P++ + P+ I+ + RN S+ F K Sbjct: 671 TVHVKQTGDSLLHGISVGHGNHSNSVMLGGQSPAN-LPHPAIILSAEPDVRNTSDHFVKP 729 Query: 517 SCHGNSS--DPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPY 690 SC+ N++ +F A+ S G+ SVMP+ S N I NLT N G+ Sbjct: 730 SCNANANANPDSFFHRADDSAASTGS-SVMPVNFSGWNPIYLSNLTTILPNGDLTGLRHQ 788 Query: 691 FMDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELR 870 DEN SKQ + A+ +QG+ + ST+++ S ++R E + Sbjct: 789 VSDENLRAPTLRSLPQVSKQDNKAATPCMNLDQGQFYCHSTVQLPNDYSQQERFGPEP-K 847 Query: 871 EGPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNW---CNFLTSPRGVFNSK 1041 +GP L Q+ +E K + C D EKL+ ++G +N+ CN T+P + Sbjct: 848 QGPVLNGNQDTTEEQDKTTRFCCKGLLDGGREKLSCLTGPNNYCKCCNLTTAPSISLQPR 907 Query: 1042 ELDMQNPP-HESLTNKQPLL 1098 +D+ + H++ +QPLL Sbjct: 908 GIDVHSSHCHQNCCVEQPLL 927 >ref|XP_006371759.1| hypothetical protein POPTR_0018s02180g [Populus trichocarpa] gi|550317856|gb|ERP49556.1| hypothetical protein POPTR_0018s02180g [Populus trichocarpa] Length = 868 Score = 108 bits (271), Expect = 3e-21 Identities = 111/370 (30%), Positives = 157/370 (42%), Gaps = 6/370 (1%) Frame = +1 Query: 7 ICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHG 186 I +T+++ K+E F+G +AKS+ V + F++ E + + R+ +N+V+++ LH Sbjct: 535 IKNTINVGKIENFKGQVAKST-VFLPFKHFNSPLEGNSYSRSTSNVVNSTEHIVHETLHS 593 Query: 187 ESHSFDFRSN----GSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNK 354 ESH+ + N G + + RQ GF DKGKG T S HN Sbjct: 594 ESHAVKYPGNVPLNGGNGLERQRTDPEFGFSRPRDKGKGVGCLTGNSFDETNLVSKMHNW 653 Query: 355 QMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCHGNS 534 + SS F+ +++GN + HEK + SSI +AS+A Sbjct: 654 KKNPSS-FSEVINGNICAAFPMMHEK-NHIPNHLSSIPLEASDA---------------- 695 Query: 535 SDPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXX 714 GS P S LTPA GI SPY +D+N Sbjct: 696 ------------------GSFFPSQAVPLGS----GLTPAMLKQDGISASPYLLDDNLRL 733 Query: 715 XXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELREGPYLTVK 894 SKQ H ++ L PEQ R +++Q S + AS R K Sbjct: 734 LAFRQILELSKQQHEMSPLGKNPEQDRC-----VKLQH--SLFEPAASGLNRHETTFISK 786 Query: 895 QNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VFNSKELDMQ-NPPH 1068 QN SEV+ K QS V K A V+G+SNWCNF T +G F S+E D Q H Sbjct: 787 QNVSEVSMKSTQSTPTVKMGDDVAKFAHVTGLSNWCNFSTLTQGRPFYSQENDKQCQLSH 846 Query: 1069 ESLTNKQPLL 1098 L N+QP L Sbjct: 847 GHLQNEQPSL 856 >ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613578 isoform X2 [Citrus sinensis] Length = 2119 Score = 103 bits (258), Expect = 1e-19 Identities = 113/380 (29%), Positives = 169/380 (44%), Gaps = 18/380 (4%) Frame = +1 Query: 4 GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILH 183 GI + D KL++F G + K+S+V L+ ST+ E + + +A +MV S+ I+ Sbjct: 569 GISNVTDTTKLDKFDGNVTKTSMVPS-LAHVSTAPEMNANSKANNHMV-----SSDHIIP 622 Query: 184 GESHSFDFRSNGS---------DEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKT 336 H + + + D RQLN S LGF DKGKG D SY + Sbjct: 623 KSVHCEPYSAKSNPVRVPWTVVDGSERQLNVSELGFFRIEDKGKGVGCTADG-SYAKIDS 681 Query: 337 SLFHNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKV 516 KQ + G+ P + H+K Y Q S + DA +ARN N KV Sbjct: 682 VSNIEKQQESRCTCPVAMGGSKDPCSSVVHDK-IYYSHQSSGVPPDAFDARNLFNYPEKV 740 Query: 517 SCHGNS--SDPAFLRSANS----STVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIG 678 G+S +D FL S S S ++ M L+++ S+ + PA G G Sbjct: 741 PSLGSSRHTDHLFLTSKGSPWGSSQLLQSQAVSMASPLATSASM--QGMAPAIPTVEGTG 798 Query: 679 VSPYFMDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRR-GSSEDRLA 855 VSPY +D+N SKQ AI+SL E GR S + ++ G S A Sbjct: 799 VSPYLLDDNMRFLALRQILELSKQQQAISSLGMDQETGRTSNFSNVNIRPLVGPS----A 854 Query: 856 SEELREGPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VF 1032 E GP +T ++++S VA S + +EK + ++ ++N C F T G Sbjct: 855 FGEQTPGPNITSQRDSSAVAMLSPTSSAYTKLGVNIEKSSPIADLNNSCEFSTWICGNPL 914 Query: 1033 NSKELDMQ-NPPHESLTNKQ 1089 S+E+D+Q PH+ +NKQ Sbjct: 915 LSREIDLQCQFPHDPPSNKQ 934 >ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613578 isoform X1 [Citrus sinensis] Length = 2120 Score = 103 bits (258), Expect = 1e-19 Identities = 113/380 (29%), Positives = 169/380 (44%), Gaps = 18/380 (4%) Frame = +1 Query: 4 GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILH 183 GI + D KL++F G + K+S+V L+ ST+ E + + +A +MV S+ I+ Sbjct: 570 GISNVTDTTKLDKFDGNVTKTSMVPS-LAHVSTAPEMNANSKANNHMV-----SSDHIIP 623 Query: 184 GESHSFDFRSNGS---------DEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKT 336 H + + + D RQLN S LGF DKGKG D SY + Sbjct: 624 KSVHCEPYSAKSNPVRVPWTVVDGSERQLNVSELGFFRIEDKGKGVGCTADG-SYAKIDS 682 Query: 337 SLFHNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKV 516 KQ + G+ P + H+K Y Q S + DA +ARN N KV Sbjct: 683 VSNIEKQQESRCTCPVAMGGSKDPCSSVVHDK-IYYSHQSSGVPPDAFDARNLFNYPEKV 741 Query: 517 SCHGNS--SDPAFLRSANS----STVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIG 678 G+S +D FL S S S ++ M L+++ S+ + PA G G Sbjct: 742 PSLGSSRHTDHLFLTSKGSPWGSSQLLQSQAVSMASPLATSASM--QGMAPAIPTVEGTG 799 Query: 679 VSPYFMDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRR-GSSEDRLA 855 VSPY +D+N SKQ AI+SL E GR S + ++ G S A Sbjct: 800 VSPYLLDDNMRFLALRQILELSKQQQAISSLGMDQETGRTSNFSNVNIRPLVGPS----A 855 Query: 856 SEELREGPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VF 1032 E GP +T ++++S VA S + +EK + ++ ++N C F T G Sbjct: 856 FGEQTPGPNITSQRDSSAVAMLSPTSSAYTKLGVNIEKSSPIADLNNSCEFSTWICGNPL 915 Query: 1033 NSKELDMQ-NPPHESLTNKQ 1089 S+E+D+Q PH+ +NKQ Sbjct: 916 LSREIDLQCQFPHDPPSNKQ 935 >ref|XP_006450350.1| hypothetical protein CICLE_v10010345mg, partial [Citrus clementina] gi|557553576|gb|ESR63590.1| hypothetical protein CICLE_v10010345mg, partial [Citrus clementina] Length = 938 Score = 103 bits (258), Expect = 1e-19 Identities = 113/380 (29%), Positives = 169/380 (44%), Gaps = 18/380 (4%) Frame = +1 Query: 4 GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILH 183 GI + D KL++F G + K+S+V L+ ST+ E + + +A +MV S+ I+ Sbjct: 570 GISNVTDTTKLDKFDGNVTKTSMVPS-LAHVSTAPEMNANSKANNHMV-----SSDHIIP 623 Query: 184 GESHSFDFRSNGS---------DEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKT 336 H + + + D RQLN S LGF DKGKG D SY + Sbjct: 624 KSVHCEPYSAKSNPVRVPWTVVDGSERQLNVSELGFFRIEDKGKGVGCTADG-SYAKIDS 682 Query: 337 SLFHNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKV 516 KQ + G+ P + H+K Y Q S + DA +ARN N KV Sbjct: 683 VSNIEKQQESRCTCPVAMGGSKDPCSSVVHDK-IYYSHQSSGVPPDAFDARNLFNYPEKV 741 Query: 517 SCHGNS--SDPAFLRSANS----STVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIG 678 G+S +D FL S S S ++ M L+++ S+ + PA G G Sbjct: 742 PSLGSSRHTDHLFLTSKGSPWGSSQLLQSQAVSMASPLATSASM--QGMAPAIPTVEGTG 799 Query: 679 VSPYFMDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRR-GSSEDRLA 855 VSPY +D+N SKQ AI+SL E GR S + ++ G S A Sbjct: 800 VSPYLLDDNMRFLALRQILELSKQQQAISSLGMDQETGRTSNFSNVNIRPLVGPS----A 855 Query: 856 SEELREGPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VF 1032 E GP +T ++++S VA S + +EK + ++ ++N C F T G Sbjct: 856 FGEQTPGPNITSQRDSSAVAMLSPTSSAYTKLGVNIEKSSPIADLNNSCEFSTWICGNPL 915 Query: 1033 NSKELDMQ-NPPHESLTNKQ 1089 S+E+D+Q PH+ +NKQ Sbjct: 916 LSREIDLQCQFPHDPPSNKQ 935 >ref|XP_007226535.1| hypothetical protein PRUPE_ppa025154mg [Prunus persica] gi|462423471|gb|EMJ27734.1| hypothetical protein PRUPE_ppa025154mg [Prunus persica] Length = 893 Score = 94.7 bits (234), Expect = 6e-17 Identities = 110/371 (29%), Positives = 164/371 (44%), Gaps = 7/371 (1%) Frame = +1 Query: 7 ICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHG 186 I + +D ++E+ + + S++S +L+ + E + +A + + + LH Sbjct: 540 IGNAIDAARVEKSTSNLGQDSVIS-FLTNLNAPPEDNTRPKASKYICNVGEHAMQNTLHY 598 Query: 187 ESHSFDFR-----SNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHN 351 E S + NGS+ + RQL+ S LG +DK KG TD S+++ + Sbjct: 599 EPQSAKYGIVNVPRNGSNSVERQLDMSQLGSYRLIDKDKGVSFVTDD-SHLSKDLGFRNR 657 Query: 352 KQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCHGN 531 K+M SS F GL SG P LTAH K S Y Q S + D ++R SN KV GN Sbjct: 658 KEMEISSSFNGL-SGTSDPRFLTAH-KNSCYSHQLSGVAPDGPDSRKYSNFPDKVLYFGN 715 Query: 532 SSDPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXX 711 + ++ + G+G P S T S P LTPA S I VS D+N Sbjct: 716 RGQVGHVNHRPLASSV-GSGQTFP---SRTVSKGIP-LTPALSRENLIEVSTQLPDDNSR 770 Query: 712 XXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELREGPYLTV 891 SKQ HA+ SL +G IF S+ + S D AS + LT Sbjct: 771 LLALREIMELSKQHHALPSLPMNRGKG-IFDCSS---YMQNSLVDTSASGKQERKLSLTS 826 Query: 892 KQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VFNSKELDMQNP-P 1065 K SE K QS ++ ++ GV+ C+F T +G +SKE+D+++ Sbjct: 827 KNAVSEATIKSHQSGASC-------RIGSDEGVNTCCHFSTLKQGNALHSKEVDLKHQIS 879 Query: 1066 HESLTNKQPLL 1098 L N+QP L Sbjct: 880 FVPLCNEQPSL 890 >ref|XP_002519906.1| conserved hypothetical protein [Ricinus communis] gi|223540952|gb|EEF42510.1| conserved hypothetical protein [Ricinus communis] Length = 903 Score = 85.5 bits (210), Expect = 4e-14 Identities = 96/340 (28%), Positives = 139/340 (40%), Gaps = 14/340 (4%) Frame = +1 Query: 16 TLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPST-----SRIL 180 T+D +LE+ MAK S+VS++ H+ P + ++ S Sbjct: 581 TVDSTELEKLN--MAKPSVVSLFK-----------HYALPEGTPHSKATNSFEYVMSERR 627 Query: 181 HGESHSFDFRSN-----GSDEMGRQLNFSGLGFLNNVDKGK--GAQHNTDTCSYMAAKTS 339 H ESH+ F SN G + + Q FL D GK G N+ SY+ + Sbjct: 628 HCESHAVKFDSNNFSWNGGNSLDEQCIVPESVFLKPADNGKEVGCLANS---SYIKKASG 684 Query: 340 LFHNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVS 519 K M + S +T ++ + H+K + + S++ D S+A N S K Sbjct: 685 SNMQKWMGNPSSYTRAMNDATYSNFSFMHDKNRN-LYHSSNVPPDVSDAANFSVYLQKGP 743 Query: 520 CHGNSS--DPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693 C GN D A L S +S +++ +P S+ S C P LT A N I + PY Sbjct: 744 CFGNGGLLDHAVLTSMDSRQILSSQS--VPKVSPSSTSTCIPGLTLAMLNRESICMGPYL 801 Query: 694 MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873 +D+N SKQ HA++S + EQG S I+ Q S + SEE Sbjct: 802 LDDNQKLLALGQLLDLSKQQHAMSSFGRKIEQGNCSNSSNIKAQH--SFVEPSVSEEQTH 859 Query: 874 GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVS 993 LT KQ SEV K Q C V+K +G S Sbjct: 860 VHDLTRKQEVSEVVMKLDQPCPPSKTVDDVDKSTSGTGKS 899 >ref|XP_007011789.1| Uncharacterized protein isoform 9 [Theobroma cacao] gi|508782152|gb|EOY29408.1| Uncharacterized protein isoform 9 [Theobroma cacao] Length = 1619 Score = 65.5 bits (158), Expect = 4e-08 Identities = 97/361 (26%), Positives = 146/361 (40%), Gaps = 11/361 (3%) Frame = +1 Query: 4 GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDN-SCPSTSRIL 180 G+ S +D KL++ RG KS +V + L Q E R +NM S P T Sbjct: 219 GVSSVMDATKLDKCRGDATKSLVVPL-LPQLPL--EGSARSRGASNMAGEFSMPKT---F 272 Query: 181 HGESHS-----FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLF 345 H ES++ + + +GRQLN LGF DKG C+ A +L Sbjct: 273 HCESNTTKCDPLNTPLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALR 330 Query: 346 HNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCH 525 ++Q+ + TG+V P H S CQ S+I D + R+ N S Sbjct: 331 IHQQVENPRNVTGVV-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFI 382 Query: 526 GNS--SDPAFLR--SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693 G+S +D A+LR S++ + S MG S P T S SP Sbjct: 383 GSSGYTDQAYLRMMSSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCL 437 Query: 694 MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873 +D++ SKQ HA +S+ E GR S +Q + S E R Sbjct: 438 LDDSMRLLALRQILELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRH 494 Query: 874 GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050 G + K + E A + S EK ++G+++ C+F T +G+ S+E+D Sbjct: 495 GAIVPSKLDVFEGAAASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVD 547 Query: 1051 M 1053 + Sbjct: 548 I 548 >ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [Theobroma cacao] gi|508782151|gb|EOY29407.1| Uncharacterized protein isoform 8, partial [Theobroma cacao] Length = 2068 Score = 65.5 bits (158), Expect = 4e-08 Identities = 97/361 (26%), Positives = 146/361 (40%), Gaps = 11/361 (3%) Frame = +1 Query: 4 GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDN-SCPSTSRIL 180 G+ S +D KL++ RG KS +V + L Q E R +NM S P T Sbjct: 585 GVSSVMDATKLDKCRGDATKSLVVPL-LPQLPL--EGSARSRGASNMAGEFSMPKT---F 638 Query: 181 HGESHS-----FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLF 345 H ES++ + + +GRQLN LGF DKG C+ A +L Sbjct: 639 HCESNTTKCDPLNTPLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALR 696 Query: 346 HNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCH 525 ++Q+ + TG+V P H S CQ S+I D + R+ N S Sbjct: 697 IHQQVENPRNVTGVV-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFI 748 Query: 526 GNS--SDPAFLR--SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693 G+S +D A+LR S++ + S MG S P T S SP Sbjct: 749 GSSGYTDQAYLRMMSSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCL 803 Query: 694 MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873 +D++ SKQ HA +S+ E GR S +Q + S E R Sbjct: 804 LDDSMRLLALRQILELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRH 860 Query: 874 GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050 G + K + E A + S EK ++G+++ C+F T +G+ S+E+D Sbjct: 861 GAIVPSKLDVFEGAAASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVD 913 Query: 1051 M 1053 + Sbjct: 914 I 914 >ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508782146|gb|EOY29402.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 2104 Score = 65.5 bits (158), Expect = 4e-08 Identities = 97/361 (26%), Positives = 146/361 (40%), Gaps = 11/361 (3%) Frame = +1 Query: 4 GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDN-SCPSTSRIL 180 G+ S +D KL++ RG KS +V + L Q E R +NM S P T Sbjct: 585 GVSSVMDATKLDKCRGDATKSLVVPL-LPQLPL--EGSARSRGASNMAGEFSMPKT---F 638 Query: 181 HGESHS-----FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLF 345 H ES++ + + +GRQLN LGF DKG C+ A +L Sbjct: 639 HCESNTTKCDPLNTPLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALR 696 Query: 346 HNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCH 525 ++Q+ + TG+V P H S CQ S+I D + R+ N S Sbjct: 697 IHQQVENPRNVTGVV-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFI 748 Query: 526 GNS--SDPAFLR--SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693 G+S +D A+LR S++ + S MG S P T S SP Sbjct: 749 GSSGYTDQAYLRMMSSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCL 803 Query: 694 MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873 +D++ SKQ HA +S+ E GR S +Q + S E R Sbjct: 804 LDDSMRLLALRQILELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRH 860 Query: 874 GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050 G + K + E A + S EK ++G+++ C+F T +G+ S+E+D Sbjct: 861 GAIVPSKLDVFEGAAASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVD 913 Query: 1051 M 1053 + Sbjct: 914 I 914 >ref|XP_007011781.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590572148|ref|XP_007011782.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590572172|ref|XP_007011784.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590572176|ref|XP_007011785.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590572180|ref|XP_007011786.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590572184|ref|XP_007011787.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782144|gb|EOY29400.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782145|gb|EOY29401.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782147|gb|EOY29403.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782148|gb|EOY29404.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782149|gb|EOY29405.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782150|gb|EOY29406.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1738 Score = 65.5 bits (158), Expect = 4e-08 Identities = 97/361 (26%), Positives = 146/361 (40%), Gaps = 11/361 (3%) Frame = +1 Query: 4 GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDN-SCPSTSRIL 180 G+ S +D KL++ RG KS +V + L Q E R +NM S P T Sbjct: 219 GVSSVMDATKLDKCRGDATKSLVVPL-LPQLPL--EGSARSRGASNMAGEFSMPKT---F 272 Query: 181 HGESHS-----FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLF 345 H ES++ + + +GRQLN LGF DKG C+ A +L Sbjct: 273 HCESNTTKCDPLNTPLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALR 330 Query: 346 HNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCH 525 ++Q+ + TG+V P H S CQ S+I D + R+ N S Sbjct: 331 IHQQVENPRNVTGVV-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFI 382 Query: 526 GNS--SDPAFLR--SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693 G+S +D A+LR S++ + S MG S P T S SP Sbjct: 383 GSSGYTDQAYLRMMSSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCL 437 Query: 694 MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873 +D++ SKQ HA +S+ E GR S +Q + S E R Sbjct: 438 LDDSMRLLALRQILELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRH 494 Query: 874 GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050 G + K + E A + S EK ++G+++ C+F T +G+ S+E+D Sbjct: 495 GAIVPSKLDVFEGAAASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVD 547 Query: 1051 M 1053 + Sbjct: 548 I 548 >gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus notabilis] Length = 2073 Score = 65.1 bits (157), Expect = 5e-08 Identities = 83/352 (23%), Positives = 142/352 (40%), Gaps = 11/352 (3%) Frame = +1 Query: 34 LEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHGESH-----S 198 LE+ RG + +S++V L+ F+ E ++ + N+++ + + + E S Sbjct: 578 LEKSRGNLVQSAVVP--LTNFNLLAENNVQIKPSDNILNCLEHTANHTQYYEPRFAKCDS 635 Query: 199 FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPF 378 + N + + RQLN + + +DKGKG + ++ SY+ S H + F Sbjct: 636 SNVLWNSGNGLERQLNINEMSSHGLIDKGKGVKLISEG-SYLKDPGSRIHKE-------F 687 Query: 379 TGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCHGNSSDPAFLRS 558 S + +P A + SS + Q S++ +A R N + GN + + S Sbjct: 688 EFSTSRSQVP----ASQGSSSDLYQWSTVPLEAPEVRKLCNYPENIPSFGNCLNVDHV-S 742 Query: 559 ANSSTVIAGAGSVMPM-----GLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXXXXX 723 S T G+G ++P G S + TP+ IGVSP+ +D+N Sbjct: 743 QRSFTSSVGSGIILPSQVVTKGHPLATSTHLLDQTPSLHREESIGVSPHLLDDNLRMLAL 802 Query: 724 XXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELREGPYLTVKQNA 903 SKQ HA S GR G S + +E A E+ GP + Sbjct: 803 RQILELSKQQHAFPSFGMNKRDGRCDGVSYL---HHSFAESPAAGEQF-NGPGPISSREV 858 Query: 904 SEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELDMQ 1056 SE K + K + G++ C+ T RG+ ++KE+ +Q Sbjct: 859 SEATAKARLGLAG-----ATSKFSGDEGMTGCCDLSTLIRGIPIHTKEIAVQ 905