BLASTX nr result

ID: Akebia26_contig00028962 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00028962
         (1100 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21105.3| unnamed protein product [Vitis vinifera]              189   1e-45
ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [A...   109   2e-21
ref|XP_006371759.1| hypothetical protein POPTR_0018s02180g [Popu...   108   3e-21
ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613...   103   1e-19
ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613...   103   1e-19
ref|XP_006450350.1| hypothetical protein CICLE_v10010345mg, part...   103   1e-19
ref|XP_007226535.1| hypothetical protein PRUPE_ppa025154mg [Prun...    95   6e-17
ref|XP_002519906.1| conserved hypothetical protein [Ricinus comm...    86   4e-14
ref|XP_007011789.1| Uncharacterized protein isoform 9 [Theobroma...    65   4e-08
ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [...    65   4e-08
ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma...    65   4e-08
ref|XP_007011781.1| Uncharacterized protein isoform 1 [Theobroma...    65   4e-08
gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus no...    65   5e-08

>emb|CBI21105.3| unnamed protein product [Vitis vinifera]
          Length = 1012

 Score =  189 bits (481), Expect = 1e-45
 Identities = 144/373 (38%), Positives = 196/373 (52%), Gaps = 14/373 (3%)
 Frame = +1

Query: 7    ICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHG 186
            I + LD  KLEQFRG  AKSS++SM LS  +T TE ++  +A  N+V+++     R LH 
Sbjct: 611  INNALDAAKLEQFRGDAAKSSVISMLLSHLTTPTEGNMQSKAINNVVNDNGHFVPRSLHF 670

Query: 187  ESHSFDFRS-----NGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHN 351
            ESH           N ++ + R+ N + L F   +DKGK     TD  SY A +++    
Sbjct: 671  ESHIAKRDPVYSPWNSANGLERESNINDLSFHRYMDKGKRVGFVTDG-SYAATESTFGFY 729

Query: 352  KQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCHGN 531
            KQM  S  FTG+   +H PS+   H+K S Y  Q   +  DASNA N  N   K SC G+
Sbjct: 730  KQMGSSGTFTGVAGSDH-PSSSAVHDK-SCYSRQLLGMPPDASNASNSFNFSGKFSCLGS 787

Query: 532  SS-DPAFLRSANSSTVIAGAGSVMP-----MGLSSTNSICRPNLTPASSNNVGIGVSPYF 693
            S  D  F++S +      G+G  +P      G SS +S+  PNLTP+      IGVSPY 
Sbjct: 788  SGLDNVFVKSISPP---MGSGINVPSQAVSTGFSSASSLSVPNLTPSLPTKESIGVSPYL 844

Query: 694  MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873
            +DEN            S + HAI SL    ++GR    S  ++Q  GS  D L S+EL+ 
Sbjct: 845  LDENFKLLALRHILELSNREHAITSLGMNQKEGRFSSSSDPKVQ--GSVVDTLTSDELKH 902

Query: 874  GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050
            G  LT +QNASEV  K LQS  NH     +EKL  V+  +NW +  T  +G+   SK +D
Sbjct: 903  GLKLTSEQNASEVPLKLLQSGGNHRMGGDMEKLVPVADQNNWFDISTFTQGIPLCSKGID 962

Query: 1051 MQNPPHE--SLTN 1083
             Q+ P E  SL+N
Sbjct: 963  SQDLPCEQPSLSN 975


>ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda]
            gi|548856405|gb|ERN14258.1| hypothetical protein
            AMTR_s00033p00150780 [Amborella trichopoda]
          Length = 2123

 Score =  109 bits (273), Expect = 2e-21
 Identities = 105/380 (27%), Positives = 175/380 (46%), Gaps = 14/380 (3%)
 Frame = +1

Query: 1    PGICSTLDLDKLEQFRGAMAKSSLVSMYLSQFST--STEKDLHFR-----APTNMVDNSC 159
            PGI + L+        G M+K+S++SM LS      + E+ L  +     AP ++V    
Sbjct: 568  PGIVNLLE--------GHMSKNSIMSMLLSPMENFGTNEEGLMLQPNSNMAPEHLVPKLI 619

Query: 160  PSTSRILHGESHSFDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTS 339
             S S++L   ++ F   +N S+ M R+L        N++D  K ++   +  S  ++  S
Sbjct: 620  HSNSQLLKSGTNCFT--TNKSEMMERKL-------ANHIDAVKMSRDMPNGSSTFSSIGS 670

Query: 340  LFHNKQMADSSPFTGLVS-GNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKV 516
              H KQ  DS      V  GNH  S +   + P++ +  P+ I+    + RN S+ F K 
Sbjct: 671  TVHVKQTGDSLLHGISVGHGNHSNSVMLGGQSPAN-LPHPAIILSAEPDVRNTSDHFVKP 729

Query: 517  SCHGNSS--DPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPY 690
            SC+ N++    +F   A+ S    G+ SVMP+  S  N I   NLT    N    G+   
Sbjct: 730  SCNANANANPDSFFHRADDSAASTGS-SVMPVNFSGWNPIYLSNLTTILPNGDLTGLRHQ 788

Query: 691  FMDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELR 870
              DEN            SKQ +  A+     +QG+ +  ST+++    S ++R   E  +
Sbjct: 789  VSDENLRAPTLRSLPQVSKQDNKAATPCMNLDQGQFYCHSTVQLPNDYSQQERFGPEP-K 847

Query: 871  EGPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNW---CNFLTSPRGVFNSK 1041
            +GP L   Q+ +E   K  + C     D   EKL+ ++G +N+   CN  T+P      +
Sbjct: 848  QGPVLNGNQDTTEEQDKTTRFCCKGLLDGGREKLSCLTGPNNYCKCCNLTTAPSISLQPR 907

Query: 1042 ELDMQNPP-HESLTNKQPLL 1098
             +D+ +   H++   +QPLL
Sbjct: 908  GIDVHSSHCHQNCCVEQPLL 927


>ref|XP_006371759.1| hypothetical protein POPTR_0018s02180g [Populus trichocarpa]
            gi|550317856|gb|ERP49556.1| hypothetical protein
            POPTR_0018s02180g [Populus trichocarpa]
          Length = 868

 Score =  108 bits (271), Expect = 3e-21
 Identities = 111/370 (30%), Positives = 157/370 (42%), Gaps = 6/370 (1%)
 Frame = +1

Query: 7    ICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHG 186
            I +T+++ K+E F+G +AKS+ V +    F++  E + + R+ +N+V+++       LH 
Sbjct: 535  IKNTINVGKIENFKGQVAKST-VFLPFKHFNSPLEGNSYSRSTSNVVNSTEHIVHETLHS 593

Query: 187  ESHSFDFRSN----GSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNK 354
            ESH+  +  N    G + + RQ      GF    DKGKG    T          S  HN 
Sbjct: 594  ESHAVKYPGNVPLNGGNGLERQRTDPEFGFSRPRDKGKGVGCLTGNSFDETNLVSKMHNW 653

Query: 355  QMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCHGNS 534
            +   SS F+ +++GN   +    HEK +      SSI  +AS+A                
Sbjct: 654  KKNPSS-FSEVINGNICAAFPMMHEK-NHIPNHLSSIPLEASDA---------------- 695

Query: 535  SDPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXX 714
                              GS  P       S     LTPA     GI  SPY +D+N   
Sbjct: 696  ------------------GSFFPSQAVPLGS----GLTPAMLKQDGISASPYLLDDNLRL 733

Query: 715  XXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELREGPYLTVK 894
                     SKQ H ++ L   PEQ R      +++Q   S  +  AS   R       K
Sbjct: 734  LAFRQILELSKQQHEMSPLGKNPEQDRC-----VKLQH--SLFEPAASGLNRHETTFISK 786

Query: 895  QNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VFNSKELDMQ-NPPH 1068
            QN SEV+ K  QS         V K A V+G+SNWCNF T  +G  F S+E D Q    H
Sbjct: 787  QNVSEVSMKSTQSTPTVKMGDDVAKFAHVTGLSNWCNFSTLTQGRPFYSQENDKQCQLSH 846

Query: 1069 ESLTNKQPLL 1098
              L N+QP L
Sbjct: 847  GHLQNEQPSL 856


>ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613578 isoform X2 [Citrus
            sinensis]
          Length = 2119

 Score =  103 bits (258), Expect = 1e-19
 Identities = 113/380 (29%), Positives = 169/380 (44%), Gaps = 18/380 (4%)
 Frame = +1

Query: 4    GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILH 183
            GI +  D  KL++F G + K+S+V   L+  ST+ E + + +A  +MV     S+  I+ 
Sbjct: 569  GISNVTDTTKLDKFDGNVTKTSMVPS-LAHVSTAPEMNANSKANNHMV-----SSDHIIP 622

Query: 184  GESHSFDFRSNGS---------DEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKT 336
               H   + +  +         D   RQLN S LGF    DKGKG     D  SY    +
Sbjct: 623  KSVHCEPYSAKSNPVRVPWTVVDGSERQLNVSELGFFRIEDKGKGVGCTADG-SYAKIDS 681

Query: 337  SLFHNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKV 516
                 KQ          + G+  P +   H+K   Y  Q S +  DA +ARN  N   KV
Sbjct: 682  VSNIEKQQESRCTCPVAMGGSKDPCSSVVHDK-IYYSHQSSGVPPDAFDARNLFNYPEKV 740

Query: 517  SCHGNS--SDPAFLRSANS----STVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIG 678
               G+S  +D  FL S  S    S ++      M   L+++ S+    + PA     G G
Sbjct: 741  PSLGSSRHTDHLFLTSKGSPWGSSQLLQSQAVSMASPLATSASM--QGMAPAIPTVEGTG 798

Query: 679  VSPYFMDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRR-GSSEDRLA 855
            VSPY +D+N            SKQ  AI+SL    E GR    S + ++   G S    A
Sbjct: 799  VSPYLLDDNMRFLALRQILELSKQQQAISSLGMDQETGRTSNFSNVNIRPLVGPS----A 854

Query: 856  SEELREGPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VF 1032
              E   GP +T ++++S VA     S +       +EK + ++ ++N C F T   G   
Sbjct: 855  FGEQTPGPNITSQRDSSAVAMLSPTSSAYTKLGVNIEKSSPIADLNNSCEFSTWICGNPL 914

Query: 1033 NSKELDMQ-NPPHESLTNKQ 1089
             S+E+D+Q   PH+  +NKQ
Sbjct: 915  LSREIDLQCQFPHDPPSNKQ 934


>ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613578 isoform X1 [Citrus
            sinensis]
          Length = 2120

 Score =  103 bits (258), Expect = 1e-19
 Identities = 113/380 (29%), Positives = 169/380 (44%), Gaps = 18/380 (4%)
 Frame = +1

Query: 4    GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILH 183
            GI +  D  KL++F G + K+S+V   L+  ST+ E + + +A  +MV     S+  I+ 
Sbjct: 570  GISNVTDTTKLDKFDGNVTKTSMVPS-LAHVSTAPEMNANSKANNHMV-----SSDHIIP 623

Query: 184  GESHSFDFRSNGS---------DEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKT 336
               H   + +  +         D   RQLN S LGF    DKGKG     D  SY    +
Sbjct: 624  KSVHCEPYSAKSNPVRVPWTVVDGSERQLNVSELGFFRIEDKGKGVGCTADG-SYAKIDS 682

Query: 337  SLFHNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKV 516
                 KQ          + G+  P +   H+K   Y  Q S +  DA +ARN  N   KV
Sbjct: 683  VSNIEKQQESRCTCPVAMGGSKDPCSSVVHDK-IYYSHQSSGVPPDAFDARNLFNYPEKV 741

Query: 517  SCHGNS--SDPAFLRSANS----STVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIG 678
               G+S  +D  FL S  S    S ++      M   L+++ S+    + PA     G G
Sbjct: 742  PSLGSSRHTDHLFLTSKGSPWGSSQLLQSQAVSMASPLATSASM--QGMAPAIPTVEGTG 799

Query: 679  VSPYFMDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRR-GSSEDRLA 855
            VSPY +D+N            SKQ  AI+SL    E GR    S + ++   G S    A
Sbjct: 800  VSPYLLDDNMRFLALRQILELSKQQQAISSLGMDQETGRTSNFSNVNIRPLVGPS----A 855

Query: 856  SEELREGPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VF 1032
              E   GP +T ++++S VA     S +       +EK + ++ ++N C F T   G   
Sbjct: 856  FGEQTPGPNITSQRDSSAVAMLSPTSSAYTKLGVNIEKSSPIADLNNSCEFSTWICGNPL 915

Query: 1033 NSKELDMQ-NPPHESLTNKQ 1089
             S+E+D+Q   PH+  +NKQ
Sbjct: 916  LSREIDLQCQFPHDPPSNKQ 935


>ref|XP_006450350.1| hypothetical protein CICLE_v10010345mg, partial [Citrus clementina]
            gi|557553576|gb|ESR63590.1| hypothetical protein
            CICLE_v10010345mg, partial [Citrus clementina]
          Length = 938

 Score =  103 bits (258), Expect = 1e-19
 Identities = 113/380 (29%), Positives = 169/380 (44%), Gaps = 18/380 (4%)
 Frame = +1

Query: 4    GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILH 183
            GI +  D  KL++F G + K+S+V   L+  ST+ E + + +A  +MV     S+  I+ 
Sbjct: 570  GISNVTDTTKLDKFDGNVTKTSMVPS-LAHVSTAPEMNANSKANNHMV-----SSDHIIP 623

Query: 184  GESHSFDFRSNGS---------DEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKT 336
               H   + +  +         D   RQLN S LGF    DKGKG     D  SY    +
Sbjct: 624  KSVHCEPYSAKSNPVRVPWTVVDGSERQLNVSELGFFRIEDKGKGVGCTADG-SYAKIDS 682

Query: 337  SLFHNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKV 516
                 KQ          + G+  P +   H+K   Y  Q S +  DA +ARN  N   KV
Sbjct: 683  VSNIEKQQESRCTCPVAMGGSKDPCSSVVHDK-IYYSHQSSGVPPDAFDARNLFNYPEKV 741

Query: 517  SCHGNS--SDPAFLRSANS----STVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIG 678
               G+S  +D  FL S  S    S ++      M   L+++ S+    + PA     G G
Sbjct: 742  PSLGSSRHTDHLFLTSKGSPWGSSQLLQSQAVSMASPLATSASM--QGMAPAIPTVEGTG 799

Query: 679  VSPYFMDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRR-GSSEDRLA 855
            VSPY +D+N            SKQ  AI+SL    E GR    S + ++   G S    A
Sbjct: 800  VSPYLLDDNMRFLALRQILELSKQQQAISSLGMDQETGRTSNFSNVNIRPLVGPS----A 855

Query: 856  SEELREGPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VF 1032
              E   GP +T ++++S VA     S +       +EK + ++ ++N C F T   G   
Sbjct: 856  FGEQTPGPNITSQRDSSAVAMLSPTSSAYTKLGVNIEKSSPIADLNNSCEFSTWICGNPL 915

Query: 1033 NSKELDMQ-NPPHESLTNKQ 1089
             S+E+D+Q   PH+  +NKQ
Sbjct: 916  LSREIDLQCQFPHDPPSNKQ 935


>ref|XP_007226535.1| hypothetical protein PRUPE_ppa025154mg [Prunus persica]
            gi|462423471|gb|EMJ27734.1| hypothetical protein
            PRUPE_ppa025154mg [Prunus persica]
          Length = 893

 Score = 94.7 bits (234), Expect = 6e-17
 Identities = 110/371 (29%), Positives = 164/371 (44%), Gaps = 7/371 (1%)
 Frame = +1

Query: 7    ICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHG 186
            I + +D  ++E+    + + S++S +L+  +   E +   +A   + +    +    LH 
Sbjct: 540  IGNAIDAARVEKSTSNLGQDSVIS-FLTNLNAPPEDNTRPKASKYICNVGEHAMQNTLHY 598

Query: 187  ESHSFDFR-----SNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHN 351
            E  S  +       NGS+ + RQL+ S LG    +DK KG    TD  S+++      + 
Sbjct: 599  EPQSAKYGIVNVPRNGSNSVERQLDMSQLGSYRLIDKDKGVSFVTDD-SHLSKDLGFRNR 657

Query: 352  KQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCHGN 531
            K+M  SS F GL SG   P  LTAH K S Y  Q S +  D  ++R  SN   KV   GN
Sbjct: 658  KEMEISSSFNGL-SGTSDPRFLTAH-KNSCYSHQLSGVAPDGPDSRKYSNFPDKVLYFGN 715

Query: 532  SSDPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXX 711
                  +     ++ + G+G   P   S T S   P LTPA S    I VS    D+N  
Sbjct: 716  RGQVGHVNHRPLASSV-GSGQTFP---SRTVSKGIP-LTPALSRENLIEVSTQLPDDNSR 770

Query: 712  XXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELREGPYLTV 891
                      SKQ HA+ SL     +G IF  S+     + S  D  AS +      LT 
Sbjct: 771  LLALREIMELSKQHHALPSLPMNRGKG-IFDCSS---YMQNSLVDTSASGKQERKLSLTS 826

Query: 892  KQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VFNSKELDMQNP-P 1065
            K   SE   K  QS ++        ++    GV+  C+F T  +G   +SKE+D+++   
Sbjct: 827  KNAVSEATIKSHQSGASC-------RIGSDEGVNTCCHFSTLKQGNALHSKEVDLKHQIS 879

Query: 1066 HESLTNKQPLL 1098
               L N+QP L
Sbjct: 880  FVPLCNEQPSL 890


>ref|XP_002519906.1| conserved hypothetical protein [Ricinus communis]
            gi|223540952|gb|EEF42510.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 903

 Score = 85.5 bits (210), Expect = 4e-14
 Identities = 96/340 (28%), Positives = 139/340 (40%), Gaps = 14/340 (4%)
 Frame = +1

Query: 16   TLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPST-----SRIL 180
            T+D  +LE+    MAK S+VS++            H+  P     +   ++     S   
Sbjct: 581  TVDSTELEKLN--MAKPSVVSLFK-----------HYALPEGTPHSKATNSFEYVMSERR 627

Query: 181  HGESHSFDFRSN-----GSDEMGRQLNFSGLGFLNNVDKGK--GAQHNTDTCSYMAAKTS 339
            H ESH+  F SN     G + +  Q       FL   D GK  G   N+   SY+   + 
Sbjct: 628  HCESHAVKFDSNNFSWNGGNSLDEQCIVPESVFLKPADNGKEVGCLANS---SYIKKASG 684

Query: 340  LFHNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVS 519
                K M + S +T  ++     +    H+K  + +   S++  D S+A N S    K  
Sbjct: 685  SNMQKWMGNPSSYTRAMNDATYSNFSFMHDKNRN-LYHSSNVPPDVSDAANFSVYLQKGP 743

Query: 520  CHGNSS--DPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693
            C GN    D A L S +S  +++     +P    S+ S C P LT A  N   I + PY 
Sbjct: 744  CFGNGGLLDHAVLTSMDSRQILSSQS--VPKVSPSSTSTCIPGLTLAMLNRESICMGPYL 801

Query: 694  MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873
            +D+N            SKQ HA++S   + EQG     S I+ Q   S  +   SEE   
Sbjct: 802  LDDNQKLLALGQLLDLSKQQHAMSSFGRKIEQGNCSNSSNIKAQH--SFVEPSVSEEQTH 859

Query: 874  GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVS 993
               LT KQ  SEV  K  Q C        V+K    +G S
Sbjct: 860  VHDLTRKQEVSEVVMKLDQPCPPSKTVDDVDKSTSGTGKS 899


>ref|XP_007011789.1| Uncharacterized protein isoform 9 [Theobroma cacao]
            gi|508782152|gb|EOY29408.1| Uncharacterized protein
            isoform 9 [Theobroma cacao]
          Length = 1619

 Score = 65.5 bits (158), Expect = 4e-08
 Identities = 97/361 (26%), Positives = 146/361 (40%), Gaps = 11/361 (3%)
 Frame = +1

Query: 4    GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDN-SCPSTSRIL 180
            G+ S +D  KL++ RG   KS +V + L Q     E     R  +NM    S P T    
Sbjct: 219  GVSSVMDATKLDKCRGDATKSLVVPL-LPQLPL--EGSARSRGASNMAGEFSMPKT---F 272

Query: 181  HGESHS-----FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLF 345
            H ES++      +      + +GRQLN   LGF    DKG         C+  A   +L 
Sbjct: 273  HCESNTTKCDPLNTPLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALR 330

Query: 346  HNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCH 525
             ++Q+ +    TG+V     P     H   S   CQ S+I  D  + R+  N     S  
Sbjct: 331  IHQQVENPRNVTGVV-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFI 382

Query: 526  GNS--SDPAFLR--SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693
            G+S  +D A+LR  S++  +      S   MG     S   P  T   S       SP  
Sbjct: 383  GSSGYTDQAYLRMMSSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCL 437

Query: 694  MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873
            +D++            SKQ HA +S+    E GR    S   +Q       +  S E R 
Sbjct: 438  LDDSMRLLALRQILELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRH 494

Query: 874  GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050
            G  +  K +  E A   + S          EK   ++G+++ C+F T  +G+   S+E+D
Sbjct: 495  GAIVPSKLDVFEGAAASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVD 547

Query: 1051 M 1053
            +
Sbjct: 548  I 548


>ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [Theobroma cacao]
            gi|508782151|gb|EOY29407.1| Uncharacterized protein
            isoform 8, partial [Theobroma cacao]
          Length = 2068

 Score = 65.5 bits (158), Expect = 4e-08
 Identities = 97/361 (26%), Positives = 146/361 (40%), Gaps = 11/361 (3%)
 Frame = +1

Query: 4    GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDN-SCPSTSRIL 180
            G+ S +D  KL++ RG   KS +V + L Q     E     R  +NM    S P T    
Sbjct: 585  GVSSVMDATKLDKCRGDATKSLVVPL-LPQLPL--EGSARSRGASNMAGEFSMPKT---F 638

Query: 181  HGESHS-----FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLF 345
            H ES++      +      + +GRQLN   LGF    DKG         C+  A   +L 
Sbjct: 639  HCESNTTKCDPLNTPLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALR 696

Query: 346  HNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCH 525
             ++Q+ +    TG+V     P     H   S   CQ S+I  D  + R+  N     S  
Sbjct: 697  IHQQVENPRNVTGVV-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFI 748

Query: 526  GNS--SDPAFLR--SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693
            G+S  +D A+LR  S++  +      S   MG     S   P  T   S       SP  
Sbjct: 749  GSSGYTDQAYLRMMSSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCL 803

Query: 694  MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873
            +D++            SKQ HA +S+    E GR    S   +Q       +  S E R 
Sbjct: 804  LDDSMRLLALRQILELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRH 860

Query: 874  GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050
            G  +  K +  E A   + S          EK   ++G+++ C+F T  +G+   S+E+D
Sbjct: 861  GAIVPSKLDVFEGAAASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVD 913

Query: 1051 M 1053
            +
Sbjct: 914  I 914


>ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508782146|gb|EOY29402.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 2104

 Score = 65.5 bits (158), Expect = 4e-08
 Identities = 97/361 (26%), Positives = 146/361 (40%), Gaps = 11/361 (3%)
 Frame = +1

Query: 4    GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDN-SCPSTSRIL 180
            G+ S +D  KL++ RG   KS +V + L Q     E     R  +NM    S P T    
Sbjct: 585  GVSSVMDATKLDKCRGDATKSLVVPL-LPQLPL--EGSARSRGASNMAGEFSMPKT---F 638

Query: 181  HGESHS-----FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLF 345
            H ES++      +      + +GRQLN   LGF    DKG         C+  A   +L 
Sbjct: 639  HCESNTTKCDPLNTPLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALR 696

Query: 346  HNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCH 525
             ++Q+ +    TG+V     P     H   S   CQ S+I  D  + R+  N     S  
Sbjct: 697  IHQQVENPRNVTGVV-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFI 748

Query: 526  GNS--SDPAFLR--SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693
            G+S  +D A+LR  S++  +      S   MG     S   P  T   S       SP  
Sbjct: 749  GSSGYTDQAYLRMMSSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCL 803

Query: 694  MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873
            +D++            SKQ HA +S+    E GR    S   +Q       +  S E R 
Sbjct: 804  LDDSMRLLALRQILELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRH 860

Query: 874  GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050
            G  +  K +  E A   + S          EK   ++G+++ C+F T  +G+   S+E+D
Sbjct: 861  GAIVPSKLDVFEGAAASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVD 913

Query: 1051 M 1053
            +
Sbjct: 914  I 914


>ref|XP_007011781.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590572148|ref|XP_007011782.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572172|ref|XP_007011784.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572176|ref|XP_007011785.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572180|ref|XP_007011786.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572184|ref|XP_007011787.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782144|gb|EOY29400.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782145|gb|EOY29401.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782147|gb|EOY29403.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782148|gb|EOY29404.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782149|gb|EOY29405.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782150|gb|EOY29406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1738

 Score = 65.5 bits (158), Expect = 4e-08
 Identities = 97/361 (26%), Positives = 146/361 (40%), Gaps = 11/361 (3%)
 Frame = +1

Query: 4    GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDN-SCPSTSRIL 180
            G+ S +D  KL++ RG   KS +V + L Q     E     R  +NM    S P T    
Sbjct: 219  GVSSVMDATKLDKCRGDATKSLVVPL-LPQLPL--EGSARSRGASNMAGEFSMPKT---F 272

Query: 181  HGESHS-----FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLF 345
            H ES++      +      + +GRQLN   LGF    DKG         C+  A   +L 
Sbjct: 273  HCESNTTKCDPLNTPLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALR 330

Query: 346  HNKQMADSSPFTGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCH 525
             ++Q+ +    TG+V     P     H   S   CQ S+I  D  + R+  N     S  
Sbjct: 331  IHQQVENPRNVTGVV-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFI 382

Query: 526  GNS--SDPAFLR--SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 693
            G+S  +D A+LR  S++  +      S   MG     S   P  T   S       SP  
Sbjct: 383  GSSGYTDQAYLRMMSSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCL 437

Query: 694  MDENXXXXXXXXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELRE 873
            +D++            SKQ HA +S+    E GR    S   +Q       +  S E R 
Sbjct: 438  LDDSMRLLALRQILELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRH 494

Query: 874  GPYLTVKQNASEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELD 1050
            G  +  K +  E A   + S          EK   ++G+++ C+F T  +G+   S+E+D
Sbjct: 495  GAIVPSKLDVFEGAAASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVD 547

Query: 1051 M 1053
            +
Sbjct: 548  I 548


>gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus notabilis]
          Length = 2073

 Score = 65.1 bits (157), Expect = 5e-08
 Identities = 83/352 (23%), Positives = 142/352 (40%), Gaps = 11/352 (3%)
 Frame = +1

Query: 34   LEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHGESH-----S 198
            LE+ RG + +S++V   L+ F+   E ++  +   N+++    + +   + E       S
Sbjct: 578  LEKSRGNLVQSAVVP--LTNFNLLAENNVQIKPSDNILNCLEHTANHTQYYEPRFAKCDS 635

Query: 199  FDFRSNGSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPF 378
             +   N  + + RQLN + +     +DKGKG +  ++  SY+    S  H +       F
Sbjct: 636  SNVLWNSGNGLERQLNINEMSSHGLIDKGKGVKLISEG-SYLKDPGSRIHKE-------F 687

Query: 379  TGLVSGNHIPSTLTAHEKPSSYVCQPSSIIQDASNARNQSNQFTKVSCHGNSSDPAFLRS 558
                S + +P    A +  SS + Q S++  +A   R   N    +   GN  +   + S
Sbjct: 688  EFSTSRSQVP----ASQGSSSDLYQWSTVPLEAPEVRKLCNYPENIPSFGNCLNVDHV-S 742

Query: 559  ANSSTVIAGAGSVMPM-----GLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXXXXX 723
              S T   G+G ++P      G     S    + TP+      IGVSP+ +D+N      
Sbjct: 743  QRSFTSSVGSGIILPSQVVTKGHPLATSTHLLDQTPSLHREESIGVSPHLLDDNLRMLAL 802

Query: 724  XXXXXXSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSEDRLASEELREGPYLTVKQNA 903
                  SKQ HA  S       GR  G S +       +E   A E+   GP     +  
Sbjct: 803  RQILELSKQQHAFPSFGMNKRDGRCDGVSYL---HHSFAESPAAGEQF-NGPGPISSREV 858

Query: 904  SEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELDMQ 1056
            SE   K     +         K +   G++  C+  T  RG+  ++KE+ +Q
Sbjct: 859  SEATAKARLGLAG-----ATSKFSGDEGMTGCCDLSTLIRGIPIHTKEIAVQ 905


Top