BLASTX nr result

ID: Akebia25_contig00028833 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00028833
         (1775 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21105.3| unnamed protein product [Vitis vinifera]              335   4e-89
ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [A...   236   2e-59
ref|XP_006371759.1| hypothetical protein POPTR_0018s02180g [Popu...   194   1e-46
ref|XP_006450350.1| hypothetical protein CICLE_v10010345mg, part...   167   1e-38
ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613...   165   7e-38
ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613...   165   7e-38
ref|XP_007226535.1| hypothetical protein PRUPE_ppa025154mg [Prun...   163   3e-37
ref|XP_007011789.1| Uncharacterized protein isoform 9 [Theobroma...   160   1e-36
ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [...   160   1e-36
ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma...   160   1e-36
ref|XP_007011781.1| Uncharacterized protein isoform 1 [Theobroma...   160   1e-36
ref|XP_002519906.1| conserved hypothetical protein [Ricinus comm...   154   2e-34
gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus no...   135   6e-29
ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313...   103   2e-19
ref|XP_006345644.1| PREDICTED: uncharacterized protein LOC102579...    88   1e-14
ref|XP_006345643.1| PREDICTED: uncharacterized protein LOC102579...    88   1e-14
ref|XP_006345641.1| PREDICTED: uncharacterized protein LOC102579...    88   1e-14
ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816...    88   1e-14
ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816...    88   1e-14
ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816...    87   3e-14

>emb|CBI21105.3| unnamed protein product [Vitis vinifera]
          Length = 1012

 Score =  335 bits (859), Expect = 4e-89
 Identities = 243/608 (39%), Positives = 326/608 (53%), Gaps = 20/608 (3%)
 Frame = -1

Query: 1772 QPWNGFVSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDI 1593
            QPW+  V P N    Q+ +      V+   Q  N  +  + LLK  +GT+ + L +   +
Sbjct: 380  QPWDNVVFPKNPRTGQNSVND----VLHNKQWGNGSDRSNFLLKGSVGTSQSNLHA---L 432

Query: 1592 PLSQIMKESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLG 1413
              +QIM ES       MS  + +G  D+  QSIS Y+D +++SG SFI +  L N + LG
Sbjct: 433  ESNQIM-ESTRSRCSTMSKVVGRGGTDNDAQSISAYVDSISRSGTSFIYSPPLPNERTLG 491

Query: 1412 TDSDVSRCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQ 1233
             DSD+SR NNSRE  I++RDA+SSNIELRLGQP QQS T   SV+  MG R  DTLGD Q
Sbjct: 492  KDSDISRHNNSREGVILERDAVSSNIELRLGQPCQQSRTSRNSVLPVMGPRILDTLGDPQ 551

Query: 1232 KSLFQDPLIHN--------TVNPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPP 1077
            KS F + LIHN          N  V EE RQ  L+C+   T+ +S R  +   N  NH  
Sbjct: 552  KSFFPEQLIHNILDFFFYAAANSNVMEECRQ-YLQCATG-TSNSSARREQIPFNCVNHTF 609

Query: 1076 GICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILH 897
             I + LD  KLEQFRG  AKSS++SM LS  +T TE ++  +A  N+V+++     R LH
Sbjct: 610  EINNALDAAKLEQFRGDAAKSSVISMLLSHLTTPTEGNMQSKAINNVVNDNGHFVPRSLH 669

Query: 896  GESHSFDFRS-----NRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFH 732
             ESH           N ++ + R+ N + L F   +DKGK     TD  SY A +++   
Sbjct: 670  FESHIAKRDPVYSPWNSANGLERESNINDLSFHRYMDKGKRVGFVTDG-SYAATESTFGF 728

Query: 731  NKQMADSSPFTGLVGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDG 552
             KQM  S  FTG+ G +H PS+   H+K S Y  Q   +  DASNA N  N   K SC G
Sbjct: 729  YKQMGSSGTFTGVAGSDH-PSSSAVHDK-SCYSRQLLGMPPDASNASNSFNFSGKFSCLG 786

Query: 551  NSS-DPAFLRSANSSTVIAGAGSVMP-----MGLSSTNSICRPNLTPASSNNVGIGVSPY 390
            +S  D  F++S +      G+G  +P      G SS +S+  PNLTP+      IGVSPY
Sbjct: 787  SSGLDNVFVKSISPP---MGSGINVPSQAVSTGFSSASSLSVPNLTPSLPTKESIGVSPY 843

Query: 389  FMDENXXXXXXXXXXXLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELR 210
             +DEN           LS + HAI SL    ++GR    S  ++Q  GS  D L S+EL+
Sbjct: 844  LLDENFKLLALRHILELSNREHAITSLGMNQKEGRFSSSSDPKVQ--GSVVDTLTSDELK 901

Query: 209  EGPYLTVKQNVSEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKEL 33
             G  LT +QN SEV  K LQS  NH     +EKL  V+  +NW +  T  +G+   SK +
Sbjct: 902  HGLKLTSEQNASEVPLKLLQSGGNHRMGGDMEKLVPVADQNNWFDISTFTQGIPLCSKGI 961

Query: 32   DMQNPPHE 9
            D Q+ P E
Sbjct: 962  DSQDLPCE 969


>ref|XP_006852791.1| hypothetical protein AMTR_s00033p00150780 [Amborella trichopoda]
            gi|548856405|gb|ERN14258.1| hypothetical protein
            AMTR_s00033p00150780 [Amborella trichopoda]
          Length = 2123

 Score =  236 bits (602), Expect = 2e-59
 Identities = 189/605 (31%), Positives = 293/605 (48%), Gaps = 21/605 (3%)
 Frame = -1

Query: 1772 QPWNGFVSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDI 1593
            QPWN   S N      ++LE  + +  +  + ++          S+     N  +  ++ 
Sbjct: 341  QPWNNANSFNYPRGGLAILESSASRTTEIVRPKDGDNSNLTSPSSMPAFVSNHTTHALN- 399

Query: 1592 PLSQIMKESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLG 1413
                      +PG      SL+KG +   YQSI DYI+F++K GN F++NQ   NLK   
Sbjct: 400  --------DTLPGPKVTRASLDKGSEHCEYQSIVDYIEFISKGGNPFVTNQRSTNLKSFN 451

Query: 1412 TDSDVSRCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQ 1233
              S   RCN +RE F++D+DA++SNIELRLGQPSQQS     S+ SS+ S++F+ +GD Q
Sbjct: 452  GGSTARRCNRTREVFMLDKDAMASNIELRLGQPSQQSQARNCSLPSSIRSQSFNAIGD-Q 510

Query: 1232 KSLFQDPLIHNTVNPRVAEESRQNILRCSPSYTALNSKRENECQSNLTN--------HPP 1077
            KSLF + LI      R+ EESRQN LR  PS  +   +RE E + N  N          P
Sbjct: 511  KSLFCEQLIQRASGSRITEESRQNFLR--PSDLSAMKEREKESRLNSVNPVNRSTHVGEP 568

Query: 1076 GICSTLDLDKLEQFRGAMAKSSLVSMYLS---QFSTSTE----KDLHFRAPTNMVDNSCP 918
            GI + L+        G M+K+S++SM LS    F T+ E    +     AP ++V     
Sbjct: 569  GIVNLLE--------GHMSKNSIMSMLLSPMENFGTNEEGLMLQPNSNMAPEHLVPKLIH 620

Query: 917  STSRILHGESHSFDFRSNRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSL 738
            S S++L  +S +  F +N+S+ M R+L        N++D  K ++   +  S  ++  S 
Sbjct: 621  SNSQLL--KSGTNCFTTNKSEMMERKL-------ANHIDAVKMSRDMPNGSSTFSSIGST 671

Query: 737  FHNKQMADSSPFTGLVG-GNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVS 561
             H KQ  DS      VG GNH  S +   + P++ +   + I+    + RN S+ F K S
Sbjct: 672  VHVKQTGDSLLHGISVGHGNHSNSVMLGGQSPAN-LPHPAIILSAEPDVRNTSDHFVKPS 730

Query: 560  C--DGNSSDPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYF 387
            C  + N++  +F   A+ S    G+ SVMP+  S  N I   NLT    N    G+    
Sbjct: 731  CNANANANPDSFFHRADDSAASTGS-SVMPVNFSGWNPIYLSNLTTILPNGDLTGLRHQV 789

Query: 386  MDENXXXXXXXXXXXLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELRE 207
             DEN           +SKQ +  A+     +QG+ +  ST+++    S  +R   E  ++
Sbjct: 790  SDENLRAPTLRSLPQVSKQDNKAATPCMNLDQGQFYCHSTVQLPNDYSQQERFGPEP-KQ 848

Query: 206  GPYLTVKQNVSEVAGKPLQSCSNHYADKVVEKLADVSGVSNW---CNFLTSPRGVFNSKE 36
            GP L   Q+ +E   K  + C     D   EKL+ ++G +N+   CN  T+P      + 
Sbjct: 849  GPVLNGNQDTTEEQDKTTRFCCKGLLDGGREKLSCLTGPNNYCKCCNLTTAPSISLQPRG 908

Query: 35   LDMQN 21
            +D+ +
Sbjct: 909  IDVHS 913


>ref|XP_006371759.1| hypothetical protein POPTR_0018s02180g [Populus trichocarpa]
            gi|550317856|gb|ERP49556.1| hypothetical protein
            POPTR_0018s02180g [Populus trichocarpa]
          Length = 868

 Score =  194 bits (492), Expect = 1e-46
 Identities = 181/595 (30%), Positives = 270/595 (45%), Gaps = 11/595 (1%)
 Frame = -1

Query: 1775 GQPWNGFVSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEG------CDVLLKSLLGTTHNI 1614
            GQP +  V P N   + +L + P   V+   Q+RN Q G         LL +L G  +N 
Sbjct: 312  GQPIDSVVFPKNPLTDYNLNQNPVFDVLDK-QKRNGQGGNNFLGLAGTLLSNLHGVGNNT 370

Query: 1613 LSSQIDIPLSQIMKESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGL 1434
                 D  +S+            M T + KG  ++G QSIS YID + KSG+   +N  L
Sbjct: 371  PHGVTDSTISRCT---------IMPTFVGKG-PENGSQSISAYIDNIVKSGSFSTTNSAL 420

Query: 1433 GNLKFLGTDSDVSRCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTF 1254
             N + L   SDVSR  + +   I+D+DA SS+IELRLGQP++Q+ + G  V+S++G  + 
Sbjct: 421  QNARTLFRCSDVSRAKDEKHCVIIDKDAASSSIELRLGQPNEQNWSSGNPVLSAVGPPSC 480

Query: 1253 DTLGDCQKSLFQDPLIHNTVNPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPG 1074
            ++L +  K   ++ +IH   +     ESRQ +   +     LNS RE   Q  L      
Sbjct: 481  NSLVNSHKPSTREQMIHYVTSCGGDGESRQGLPHVA---GLLNSARE---QDQLNYGCSA 534

Query: 1073 ICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHG 894
            I +T+++ K+E F+G +AKS+ V +    F++  E + + R+ +N+V+++       LH 
Sbjct: 535  IKNTINVGKIENFKGQVAKST-VFLPFKHFNSPLEGNSYSRSTSNVVNSTEHIVHETLHS 593

Query: 893  ESHSFDFRS----NRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNK 726
            ESH+  +      N  + + RQ      GF    DKGKG    T          S  HN 
Sbjct: 594  ESHAVKYPGNVPLNGGNGLERQRTDPEFGFSRPRDKGKGVGCLTGNSFDETNLVSKMHNW 653

Query: 725  QMADSSPFTGLVGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNS 546
            +   SS F+ ++ GN               +C +  ++ + ++  N  +     + D  S
Sbjct: 654  KKNPSS-FSEVINGN---------------ICAAFPMMHEKNHIPNHLSSIPLEASDAGS 697

Query: 545  SDPAFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXX 366
              P       S  V  G+G                 LTPA     GI  SPY +D+N   
Sbjct: 698  FFP-------SQAVPLGSG-----------------LTPAMLKQDGISASPYLLDDNLRL 733

Query: 365  XXXXXXXXLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVK 186
                    LSKQ H ++ L   PEQ R      +++Q   S  +  AS   R       K
Sbjct: 734  LAFRQILELSKQQHEMSPLGKNPEQDR-----CVKLQH--SLFEPAASGLNRHETTFISK 786

Query: 185  QNVSEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VFNSKELDMQ 24
            QNVSEV+ K  QS         V K A V+G+SNWCNF T  +G  F S+E D Q
Sbjct: 787  QNVSEVSMKSTQSTPTVKMGDDVAKFAHVTGLSNWCNFSTLTQGRPFYSQENDKQ 841


>ref|XP_006450350.1| hypothetical protein CICLE_v10010345mg, partial [Citrus clementina]
            gi|557553576|gb|ESR63590.1| hypothetical protein
            CICLE_v10010345mg, partial [Citrus clementina]
          Length = 938

 Score =  167 bits (424), Expect = 1e-38
 Identities = 177/606 (29%), Positives = 268/606 (44%), Gaps = 17/606 (2%)
 Frame = -1

Query: 1775 GQPWNGFVSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQID 1596
            GQPW+  V P N + +++ +        +     N++E  +++++               
Sbjct: 370  GQPWDSIVYPKNPYTDKNSVID----AFRDKDHSNSRENTNLVMEC-------------- 411

Query: 1595 IPLSQIMKESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFL 1416
                            + S+       D G QSI  YID   KS +  I+N    N +  
Sbjct: 412  -----------QTSRCSTSSKFVDSGPDGGLQSIHAYIDSFLKSRDPCITNPAQ-NSRTY 459

Query: 1415 GTDSDVSRCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDC 1236
              + DVS+  N+ +  I +R A SSNIELRLGQP QQS + G SV      +  DT+   
Sbjct: 460  NENYDVSKIKNACDPVIAERVATSSNIELRLGQPYQQSQSSGNSVPLVTEPKLLDTVVAQ 519

Query: 1235 QKSLFQDPLIHNTV--NPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICST 1062
             +SLF + + +N      RVA   +    +CS     L+++  NE   N+  H  GI + 
Sbjct: 520  PRSLFLEQMTNNAAYCGERVALRQK---FQCSAGPANLSAR--NESNLNIGRHVFGISNV 574

Query: 1061 LDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMV--DNSCPSTSRILHGES 888
             D  KL++F G + K+S+V   L+  ST+ E + + +A  +MV  D+  P +   +H E 
Sbjct: 575  TDTTKLDKFDGNVTKTSMVPS-LAHVSTAPEMNANSKANNHMVSSDHIIPKS---VHCEP 630

Query: 887  HSFDFRSNR-----SDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQ 723
            +S      R      D   RQLN S LGF    DKGKG     D  SY    +     KQ
Sbjct: 631  YSAKSNPVRVPWTVVDGSERQLNVSELGFFRIEDKGKGVGCTADG-SYAKIDSVSNIEKQ 689

Query: 722  MADSSPFTGLVGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNS- 546
                      +GG+  P +   H+K   Y  QSS +  DA +ARN  N   KV   G+S 
Sbjct: 690  QESRCTCPVAMGGSKDPCSSVVHDK-IYYSHQSSGVPPDAFDARNLFNYPEKVPSLGSSR 748

Query: 545  -SDPAFLRSA----NSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMD 381
             +D  FL S      SS ++      M   L+++ S+    + PA     G GVSPY +D
Sbjct: 749  HTDHLFLTSKGSPWGSSQLLQSQAVSMASPLATSASM--QGMAPAIPTVEGTGVSPYLLD 806

Query: 380  ENXXXXXXXXXXXLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGP 201
            +N           LSKQ  AI+SL    E GR    S + ++     +   A  E   GP
Sbjct: 807  DNMRFLALRQILELSKQQQAISSLGMDQETGRTSNFSNVNIRPLVGPS---AFGEQTPGP 863

Query: 200  YLTVKQNVSEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VFNSKELDMQ 24
             +T +++ S VA     S +       +EK + ++ ++N C F T   G    S+E+D+Q
Sbjct: 864  NITSQRDSSAVAMLSPTSSAYTKLGVNIEKSSPIADLNNSCEFSTWICGNPLLSREIDLQ 923

Query: 23   -NPPHE 9
               PH+
Sbjct: 924  CQFPHD 929


>ref|XP_006483425.1| PREDICTED: uncharacterized protein LOC102613578 isoform X2 [Citrus
            sinensis]
          Length = 2119

 Score =  165 bits (417), Expect = 7e-38
 Identities = 176/605 (29%), Positives = 267/605 (44%), Gaps = 16/605 (2%)
 Frame = -1

Query: 1775 GQPWNGFVSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQID 1596
            GQPW+  V P N + +++ +        +     N++E  +++++               
Sbjct: 370  GQPWDSIVYPKNPYTDKNSVID----AFRDKDHSNSRESTNLVMEC-------------- 411

Query: 1595 IPLSQIMKESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFL 1416
                            + S+       D G QSI  YID   KS +  I+N    N +  
Sbjct: 412  -----------QTSRCSTSSKFVDSGPDGGLQSIHAYIDSFLKSRDPCITNPAQ-NSRTY 459

Query: 1415 GTDSDVSRCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDC 1236
              + DVS+  N+ +  I +R A SSNIELRLGQP QQS + G SV      +  DT+   
Sbjct: 460  NENYDVSKIKNACDPVIAERVATSSNIELRLGQPYQQSQSSGNSVPLVTEPKLLDTVVAQ 519

Query: 1235 QKSLFQDPLIHNT-VNPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICSTL 1059
             +SLF + + +N     RVA   +    +CS     L+++  N    N+  H  GI +  
Sbjct: 520  PRSLFLEQMTNNAYCGERVALRQK---FQCSAGPANLSAR--NVSNLNIGRHVFGISNVT 574

Query: 1058 DLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMV--DNSCPSTSRILHGESH 885
            D  KL++F G + K+S+V   L+  ST+ E + + +A  +MV  D+  P +   +H E +
Sbjct: 575  DTTKLDKFDGNVTKTSMVPS-LAHVSTAPEMNANSKANNHMVSSDHIIPKS---VHCEPY 630

Query: 884  SFDFRSNR-----SDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQM 720
            S      R      D   RQLN S LGF    DKGKG     D  SY    +     KQ 
Sbjct: 631  SAKSNPVRVPWTVVDGSERQLNVSELGFFRIEDKGKGVGCTADG-SYAKIDSVSNIEKQQ 689

Query: 719  ADSSPFTGLVGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNS-- 546
                     +GG+  P +   H+K   Y  QSS +  DA +ARN  N   KV   G+S  
Sbjct: 690  ESRCTCPVAMGGSKDPCSSVVHDK-IYYSHQSSGVPPDAFDARNLFNYPEKVPSLGSSRH 748

Query: 545  SDPAFLRSA----NSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDE 378
            +D  FL S      SS ++      M   L+++ S+    + PA     G GVSPY +D+
Sbjct: 749  TDHLFLTSKGSPWGSSQLLQSQAVSMASPLATSASM--QGMAPAIPTVEGTGVSPYLLDD 806

Query: 377  NXXXXXXXXXXXLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPY 198
            N           LSKQ  AI+SL    E GR    S + ++     +   A  E   GP 
Sbjct: 807  NMRFLALRQILELSKQQQAISSLGMDQETGRTSNFSNVNIRPLVGPS---AFGEQTPGPN 863

Query: 197  LTVKQNVSEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VFNSKELDMQ- 24
            +T +++ S VA     S +       +EK + ++ ++N C F T   G    S+E+D+Q 
Sbjct: 864  ITSQRDSSAVAMLSPTSSAYTKLGVNIEKSSPIADLNNSCEFSTWICGNPLLSREIDLQC 923

Query: 23   NPPHE 9
              PH+
Sbjct: 924  QFPHD 928


>ref|XP_006483424.1| PREDICTED: uncharacterized protein LOC102613578 isoform X1 [Citrus
            sinensis]
          Length = 2120

 Score =  165 bits (417), Expect = 7e-38
 Identities = 176/606 (29%), Positives = 267/606 (44%), Gaps = 17/606 (2%)
 Frame = -1

Query: 1775 GQPWNGFVSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQID 1596
            GQPW+  V P N + +++ +        +     N++E  +++++               
Sbjct: 370  GQPWDSIVYPKNPYTDKNSVID----AFRDKDHSNSRESTNLVMEC-------------- 411

Query: 1595 IPLSQIMKESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFL 1416
                            + S+       D G QSI  YID   KS +  I+N    N +  
Sbjct: 412  -----------QTSRCSTSSKFVDSGPDGGLQSIHAYIDSFLKSRDPCITNPAQ-NSRTY 459

Query: 1415 GTDSDVSRCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDC 1236
              + DVS+  N+ +  I +R A SSNIELRLGQP QQS + G SV      +  DT+   
Sbjct: 460  NENYDVSKIKNACDPVIAERVATSSNIELRLGQPYQQSQSSGNSVPLVTEPKLLDTVVAQ 519

Query: 1235 QKSLFQDPLIHNTV--NPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICST 1062
             +SLF + + +N      RVA   +    +CS     L+++  N    N+  H  GI + 
Sbjct: 520  PRSLFLEQMTNNAAYCGERVALRQK---FQCSAGPANLSAR--NVSNLNIGRHVFGISNV 574

Query: 1061 LDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMV--DNSCPSTSRILHGES 888
             D  KL++F G + K+S+V   L+  ST+ E + + +A  +MV  D+  P +   +H E 
Sbjct: 575  TDTTKLDKFDGNVTKTSMVPS-LAHVSTAPEMNANSKANNHMVSSDHIIPKS---VHCEP 630

Query: 887  HSFDFRSNR-----SDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQ 723
            +S      R      D   RQLN S LGF    DKGKG     D  SY    +     KQ
Sbjct: 631  YSAKSNPVRVPWTVVDGSERQLNVSELGFFRIEDKGKGVGCTADG-SYAKIDSVSNIEKQ 689

Query: 722  MADSSPFTGLVGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNS- 546
                      +GG+  P +   H+K   Y  QSS +  DA +ARN  N   KV   G+S 
Sbjct: 690  QESRCTCPVAMGGSKDPCSSVVHDK-IYYSHQSSGVPPDAFDARNLFNYPEKVPSLGSSR 748

Query: 545  -SDPAFLRSA----NSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMD 381
             +D  FL S      SS ++      M   L+++ S+    + PA     G GVSPY +D
Sbjct: 749  HTDHLFLTSKGSPWGSSQLLQSQAVSMASPLATSASM--QGMAPAIPTVEGTGVSPYLLD 806

Query: 380  ENXXXXXXXXXXXLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGP 201
            +N           LSKQ  AI+SL    E GR    S + ++     +   A  E   GP
Sbjct: 807  DNMRFLALRQILELSKQQQAISSLGMDQETGRTSNFSNVNIRPLVGPS---AFGEQTPGP 863

Query: 200  YLTVKQNVSEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRG-VFNSKELDMQ 24
             +T +++ S VA     S +       +EK + ++ ++N C F T   G    S+E+D+Q
Sbjct: 864  NITSQRDSSAVAMLSPTSSAYTKLGVNIEKSSPIADLNNSCEFSTWICGNPLLSREIDLQ 923

Query: 23   -NPPHE 9
               PH+
Sbjct: 924  CQFPHD 929


>ref|XP_007226535.1| hypothetical protein PRUPE_ppa025154mg [Prunus persica]
            gi|462423471|gb|EMJ27734.1| hypothetical protein
            PRUPE_ppa025154mg [Prunus persica]
          Length = 893

 Score =  163 bits (412), Expect = 3e-37
 Identities = 175/575 (30%), Positives = 263/575 (45%), Gaps = 17/575 (2%)
 Frame = -1

Query: 1694 MQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIMKESPMPGGLAMSTSLEKGRK 1515
            ++  QQRN Q+G  + LK   GT  + L    D     ++ E P    ++MS  +  G +
Sbjct: 337  VENKQQRNIQDGNTIFLKGFTGTPQSNLHGMAD----NLILERP----ISMSKLVGSGLQ 388

Query: 1514 DSGYQSISDYIDFLT-----------KSGNSFISNQGLGNLKFLGTDSDVSRCNNSREAF 1368
            D G QS+S Y++ +            K GNS I++  L + + +G  S+  R  N+++  
Sbjct: 389  DGG-QSVSAYVESMKNGNSSIIYPAMKIGNSSITDPSLKDRRIMGKGSNFCRTVNAKDGA 447

Query: 1367 IMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQDPLIHNTVNP 1188
               RDA  SNIELRLGQP Q   + G S   ++G    DTL +  KSLF + +I NT N 
Sbjct: 448  F--RDAAISNIELRLGQPYQLGQSSGNSNPPAVGPLLLDTLVNPLKSLFPEQMIPNT-NC 504

Query: 1187 RVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICSTLDLDKLEQFRGAMAKSSL 1008
            R   E RQ++     S     S + +  Q N  N+   I + +D  ++E+    + + S+
Sbjct: 505  REEMEFRQSLYF---SAVPSASTKSDHKQLNRGNNAFVIGNAIDAARVEKSTSNLGQDSV 561

Query: 1007 VSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHGESHSFDF-----RSNRSDEMGR 843
            +S +L+  +   E +   +A   + +    +    LH E  S  +       N S+ + R
Sbjct: 562  IS-FLTNLNAPPEDNTRPKASKYICNVGEHAMQNTLHYEPQSAKYGIVNVPRNGSNSVER 620

Query: 842  QLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGLVGGNHIPSTL 663
            QL+ S LG    +DK KG    TD  S+++      + K+M  SS F GL  G   P  L
Sbjct: 621  QLDMSQLGSYRLIDKDKGVSFVTDD-SHLSKDLGFRNRKEMEISSSFNGL-SGTSDPRFL 678

Query: 662  TAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNSSDPAFLRSANSSTVIAGAGSV 483
            TAH K S Y  Q S +  D  ++R  SN   KV   GN      +     ++ + G+G  
Sbjct: 679  TAH-KNSCYSHQLSGVAPDGPDSRKYSNFPDKVLYFGNRGQVGHVNHRPLASSV-GSGQT 736

Query: 482  MPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXXXXXXXXXXLSKQGHAIASLET 303
             P   S T S   P LTPA S    I VS    D+N           LSKQ HA+ SL  
Sbjct: 737  FP---SRTVSKGIP-LTPALSRENLIEVSTQLPDDNSRLLALREIMELSKQHHALPSLPM 792

Query: 302  RPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQNVSEVAGKPLQSCSNHYADK 123
               +G IF  S+     + S  D  AS +      LT K  VSE   K  QS ++     
Sbjct: 793  NRGKG-IFDCSS---YMQNSLVDTSASGKQERKLSLTSKNAVSEATIKSHQSGASC---- 844

Query: 122  VVEKLADVSGVSNWCNFLTSPRG-VFNSKELDMQN 21
               ++    GV+  C+F T  +G   +SKE+D+++
Sbjct: 845  ---RIGSDEGVNTCCHFSTLKQGNALHSKEVDLKH 876


>ref|XP_007011789.1| Uncharacterized protein isoform 9 [Theobroma cacao]
            gi|508782152|gb|EOY29408.1| Uncharacterized protein
            isoform 9 [Theobroma cacao]
          Length = 1619

 Score =  160 bits (406), Expect = 1e-36
 Identities = 173/587 (29%), Positives = 269/587 (45%), Gaps = 11/587 (1%)
 Frame = -1

Query: 1754 VSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIM 1575
            +SP+N    Q+     +  ++   Q +  +   + LLK L+G + + L    D+   Q M
Sbjct: 5    MSPSNPQTGQN----SATGLLHNKQDQKIEGSSNFLLKHLIGASQSNLH---DVADGQRM 57

Query: 1574 KESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVS 1395
             E  +     MST + +   D+G QS+S +ID + K+GNS +++  L NL+ LG + DVS
Sbjct: 58   -ECAVTRSSTMSTFVGRD-SDNGCQSMSVWIDSILKTGNSSLAHSSLQNLRSLGQNYDVS 115

Query: 1394 RCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQD 1215
                + +  I DRDA SSN+EL+LGQP QQ+  +G + +  +  + F T+ D  KS + +
Sbjct: 116  AAKIADDGVISDRDATSSNVELKLGQPYQQNQPIGNTALPFIARKRFGTVVDPPKSCYPE 175

Query: 1214 PLIHNTVNPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICSTLDLDKLEQF 1035
            P+IH+  N    EESRQ     + S  +  + R  +    L NH  G+ S +D  KL++ 
Sbjct: 176  PMIHH-ANFCGEEESRQYCHHDADS--SNRTARRQQSHLILGNHAFGVSSVMDATKLDKC 232

Query: 1034 RGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMV-DNSCPSTSRILHGESHS-----FDF 873
            RG   KS +V + L Q     E     R  +NM  + S P T    H ES++      + 
Sbjct: 233  RGDATKSLVVPL-LPQL--PLEGSARSRGASNMAGEFSMPKT---FHCESNTTKCDPLNT 286

Query: 872  RSNRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGL 693
                 + +GRQLN   LGF    DKG         C+  A   +L  ++Q+ +    TG+
Sbjct: 287  PLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALRIHQQVENPRNVTGV 344

Query: 692  VGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNS--SDPAFLR-- 525
            V     P     H   S   CQSS+I  D  + R+  N     S  G+S  +D A+LR  
Sbjct: 345  V-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFIGSSGYTDQAYLRMM 396

Query: 524  SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXXXXXXXXX 345
            S++  +      S   MG     S   P  T   S       SP  +D++          
Sbjct: 397  SSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCLLDDSMRLLALRQIL 451

Query: 344  XLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQNVSEVA 165
             LSKQ HA +S+    E GR    S   +Q     + +  S E R G  +  K +V E A
Sbjct: 452  ELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRHGAIVPSKLDVFEGA 508

Query: 164  GKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELDM 27
               + S          EK   ++G+++ C+F T  +G+   S+E+D+
Sbjct: 509  AASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVDI 548


>ref|XP_007011788.1| Uncharacterized protein isoform 8, partial [Theobroma cacao]
            gi|508782151|gb|EOY29407.1| Uncharacterized protein
            isoform 8, partial [Theobroma cacao]
          Length = 2068

 Score =  160 bits (406), Expect = 1e-36
 Identities = 173/587 (29%), Positives = 269/587 (45%), Gaps = 11/587 (1%)
 Frame = -1

Query: 1754 VSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIM 1575
            +SP+N    Q+     +  ++   Q +  +   + LLK L+G + + L    D+   Q M
Sbjct: 371  MSPSNPQTGQN----SATGLLHNKQDQKIEGSSNFLLKHLIGASQSNLH---DVADGQRM 423

Query: 1574 KESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVS 1395
             E  +     MST + +   D+G QS+S +ID + K+GNS +++  L NL+ LG + DVS
Sbjct: 424  -ECAVTRSSTMSTFVGRD-SDNGCQSMSVWIDSILKTGNSSLAHSSLQNLRSLGQNYDVS 481

Query: 1394 RCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQD 1215
                + +  I DRDA SSN+EL+LGQP QQ+  +G + +  +  + F T+ D  KS + +
Sbjct: 482  AAKIADDGVISDRDATSSNVELKLGQPYQQNQPIGNTALPFIARKRFGTVVDPPKSCYPE 541

Query: 1214 PLIHNTVNPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICSTLDLDKLEQF 1035
            P+IH+  N    EESRQ     + S  +  + R  +    L NH  G+ S +D  KL++ 
Sbjct: 542  PMIHH-ANFCGEEESRQYCHHDADS--SNRTARRQQSHLILGNHAFGVSSVMDATKLDKC 598

Query: 1034 RGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMV-DNSCPSTSRILHGESHS-----FDF 873
            RG   KS +V + L Q     E     R  +NM  + S P T    H ES++      + 
Sbjct: 599  RGDATKSLVVPL-LPQL--PLEGSARSRGASNMAGEFSMPKT---FHCESNTTKCDPLNT 652

Query: 872  RSNRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGL 693
                 + +GRQLN   LGF    DKG         C+  A   +L  ++Q+ +    TG+
Sbjct: 653  PLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALRIHQQVENPRNVTGV 710

Query: 692  VGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNS--SDPAFLR-- 525
            V     P     H   S   CQSS+I  D  + R+  N     S  G+S  +D A+LR  
Sbjct: 711  V-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFIGSSGYTDQAYLRMM 762

Query: 524  SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXXXXXXXXX 345
            S++  +      S   MG     S   P  T   S       SP  +D++          
Sbjct: 763  SSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCLLDDSMRLLALRQIL 817

Query: 344  XLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQNVSEVA 165
             LSKQ HA +S+    E GR    S   +Q     + +  S E R G  +  K +V E A
Sbjct: 818  ELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRHGAIVPSKLDVFEGA 874

Query: 164  GKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELDM 27
               + S          EK   ++G+++ C+F T  +G+   S+E+D+
Sbjct: 875  AASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVDI 914


>ref|XP_007011783.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508782146|gb|EOY29402.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 2104

 Score =  160 bits (406), Expect = 1e-36
 Identities = 173/587 (29%), Positives = 269/587 (45%), Gaps = 11/587 (1%)
 Frame = -1

Query: 1754 VSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIM 1575
            +SP+N    Q+     +  ++   Q +  +   + LLK L+G + + L    D+   Q M
Sbjct: 371  MSPSNPQTGQN----SATGLLHNKQDQKIEGSSNFLLKHLIGASQSNLH---DVADGQRM 423

Query: 1574 KESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVS 1395
             E  +     MST + +   D+G QS+S +ID + K+GNS +++  L NL+ LG + DVS
Sbjct: 424  -ECAVTRSSTMSTFVGRD-SDNGCQSMSVWIDSILKTGNSSLAHSSLQNLRSLGQNYDVS 481

Query: 1394 RCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQD 1215
                + +  I DRDA SSN+EL+LGQP QQ+  +G + +  +  + F T+ D  KS + +
Sbjct: 482  AAKIADDGVISDRDATSSNVELKLGQPYQQNQPIGNTALPFIARKRFGTVVDPPKSCYPE 541

Query: 1214 PLIHNTVNPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICSTLDLDKLEQF 1035
            P+IH+  N    EESRQ     + S  +  + R  +    L NH  G+ S +D  KL++ 
Sbjct: 542  PMIHH-ANFCGEEESRQYCHHDADS--SNRTARRQQSHLILGNHAFGVSSVMDATKLDKC 598

Query: 1034 RGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMV-DNSCPSTSRILHGESHS-----FDF 873
            RG   KS +V + L Q     E     R  +NM  + S P T    H ES++      + 
Sbjct: 599  RGDATKSLVVPL-LPQL--PLEGSARSRGASNMAGEFSMPKT---FHCESNTTKCDPLNT 652

Query: 872  RSNRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGL 693
                 + +GRQLN   LGF    DKG         C+  A   +L  ++Q+ +    TG+
Sbjct: 653  PLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALRIHQQVENPRNVTGV 710

Query: 692  VGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNS--SDPAFLR-- 525
            V     P     H   S   CQSS+I  D  + R+  N     S  G+S  +D A+LR  
Sbjct: 711  V-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFIGSSGYTDQAYLRMM 762

Query: 524  SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXXXXXXXXX 345
            S++  +      S   MG     S   P  T   S       SP  +D++          
Sbjct: 763  SSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCLLDDSMRLLALRQIL 817

Query: 344  XLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQNVSEVA 165
             LSKQ HA +S+    E GR    S   +Q     + +  S E R G  +  K +V E A
Sbjct: 818  ELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRHGAIVPSKLDVFEGA 874

Query: 164  GKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELDM 27
               + S          EK   ++G+++ C+F T  +G+   S+E+D+
Sbjct: 875  AASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVDI 914


>ref|XP_007011781.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590572148|ref|XP_007011782.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572172|ref|XP_007011784.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572176|ref|XP_007011785.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572180|ref|XP_007011786.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590572184|ref|XP_007011787.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782144|gb|EOY29400.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782145|gb|EOY29401.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782147|gb|EOY29403.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782148|gb|EOY29404.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508782149|gb|EOY29405.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508782150|gb|EOY29406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1738

 Score =  160 bits (406), Expect = 1e-36
 Identities = 173/587 (29%), Positives = 269/587 (45%), Gaps = 11/587 (1%)
 Frame = -1

Query: 1754 VSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIM 1575
            +SP+N    Q+     +  ++   Q +  +   + LLK L+G + + L    D+   Q M
Sbjct: 5    MSPSNPQTGQN----SATGLLHNKQDQKIEGSSNFLLKHLIGASQSNLH---DVADGQRM 57

Query: 1574 KESPMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVS 1395
             E  +     MST + +   D+G QS+S +ID + K+GNS +++  L NL+ LG + DVS
Sbjct: 58   -ECAVTRSSTMSTFVGRD-SDNGCQSMSVWIDSILKTGNSSLAHSSLQNLRSLGQNYDVS 115

Query: 1394 RCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQD 1215
                + +  I DRDA SSN+EL+LGQP QQ+  +G + +  +  + F T+ D  KS + +
Sbjct: 116  AAKIADDGVISDRDATSSNVELKLGQPYQQNQPIGNTALPFIARKRFGTVVDPPKSCYPE 175

Query: 1214 PLIHNTVNPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICSTLDLDKLEQF 1035
            P+IH+  N    EESRQ     + S  +  + R  +    L NH  G+ S +D  KL++ 
Sbjct: 176  PMIHH-ANFCGEEESRQYCHHDADS--SNRTARRQQSHLILGNHAFGVSSVMDATKLDKC 232

Query: 1034 RGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMV-DNSCPSTSRILHGESHS-----FDF 873
            RG   KS +V + L Q     E     R  +NM  + S P T    H ES++      + 
Sbjct: 233  RGDATKSLVVPL-LPQL--PLEGSARSRGASNMAGEFSMPKT---FHCESNTTKCDPLNT 286

Query: 872  RSNRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGL 693
                 + +GRQLN   LGF    DKG         C+  A   +L  ++Q+ +    TG+
Sbjct: 287  PLTIGNTLGRQLNMPELGFCRLTDKGNAGSECVSFCT--ATDPALRIHQQVENPRNVTGV 344

Query: 692  VGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNS--SDPAFLR-- 525
            V     P     H   S   CQSS+I  D  + R+  N     S  G+S  +D A+LR  
Sbjct: 345  V-----PGFSAVHGMDS---CQSSNIHSDRFDERSCLNLPGNSSFIGSSGYTDQAYLRMM 396

Query: 524  SANSSTVIAGAGSVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXXXXXXXXX 345
            S++  +      S   MG     S   P  T   S       SP  +D++          
Sbjct: 397  SSHLGSGQISQSSAASMGYQLATSTFIPGPTSTISQE-----SPCLLDDSMRLLALRQIL 451

Query: 344  XLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQNVSEVA 165
             LSKQ HA +S+    E GR    S   +Q     + +  S E R G  +  K +V E A
Sbjct: 452  ELSKQ-HATSSVGMSHELGRFDRTSNPNVQHCLMESSK--SREDRHGAIVPSKLDVFEGA 508

Query: 164  GKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELDM 27
               + S          EK   ++G+++ C+F T  +G+   S+E+D+
Sbjct: 509  AASVPS-------PAAEKSIPMTGLNSRCDFSTLTQGLSLCSREVDI 548


>ref|XP_002519906.1| conserved hypothetical protein [Ricinus communis]
            gi|223540952|gb|EEF42510.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 903

 Score =  154 bits (388), Expect = 2e-34
 Identities = 167/554 (30%), Positives = 245/554 (44%), Gaps = 11/554 (1%)
 Frame = -1

Query: 1715 EKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQ--IMKESPMPGGLAM 1542
            + P    +   QQRN Q+G    LK L+GT+ +   S  D  ++   I + S MP     
Sbjct: 376  QNPVIDALHDEQQRNGQDGNKFYLKGLVGTSLSNSCSVGDNHVTDCSISRCSTMPNFAGR 435

Query: 1541 STSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVSRCNNSREAFIM 1362
                     ++  QS+  YID + KSG+   ++  L N + L   SDV R  ++++   M
Sbjct: 436  GP-------ENVCQSM--YIDAILKSGSLATAHPALQNCRALVKSSDVGRGKDAQDGATM 486

Query: 1361 DRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQDPLIHNTVNPRV 1182
            ++D   S+IEL+LGQP Q   + G  V+  +G + ++TL    K   Q+ LI+N V+ + 
Sbjct: 487  EKDGSPSSIELKLGQPYQHGQSPGNPVLPVIGPQFYNTLVSPHKPFSQEQLINN-VSCQG 545

Query: 1181 AEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICSTLDLDKLEQFRGAMAKSSLVS 1002
             EESR    RC P    L+       Q +L     G   T+D  +LE+    MAK S+VS
Sbjct: 546  EEESR----RCLPHAAHLSDSTIRRKQDHLRYGNSGNDRTVDSTELEKLN--MAKPSVVS 599

Query: 1001 MYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHGESHSFDFRSNR-----SDEMGRQL 837
            ++  +     E   H +A TN  +      S   H ESH+  F SN       + +  Q 
Sbjct: 600  LF--KHYALPEGTPHSKA-TNSFEY---VMSERRHCESHAVKFDSNNFSWNGGNSLDEQC 653

Query: 836  NFSGLGFLNNVDKGK--GAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGLVGGNHIPSTL 663
                  FL   D GK  G   N+   SY+   +     K M + S +T  +      +  
Sbjct: 654  IVPESVFLKPADNGKEVGCLANS---SYIKKASGSNMQKWMGNPSSYTRAMNDATYSNFS 710

Query: 662  TAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNSS--DPAFLRSANSSTVIAGAG 489
              H+K +  +  SS++  D S+A N S    K  C GN    D A L S +S  +++   
Sbjct: 711  FMHDK-NRNLYHSSNVPPDVSDAANFSVYLQKGPCFGNGGLLDHAVLTSMDSRQILSSQS 769

Query: 488  SVMPMGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXXXXXXXXXXLSKQGHAIASL 309
              +P    S+ S C P LT A  N   I + PY +D+N           LSKQ HA++S 
Sbjct: 770  --VPKVSPSSTSTCIPGLTLAMLNRESICMGPYLLDDNQKLLALGQLLDLSKQQHAMSSF 827

Query: 308  ETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQNVSEVAGKPLQSCSNHYA 129
              + EQG     S I+ Q   S  +   SEE      LT KQ VSEV  K  Q C     
Sbjct: 828  GRKIEQGNCSNSSNIKAQH--SFVEPSVSEEQTHVHDLTRKQEVSEVVMKLDQPCPPSKT 885

Query: 128  DKVVEKLADVSGVS 87
               V+K    +G S
Sbjct: 886  VDDVDKSTSGTGKS 899


>gb|EXB80746.1| Histone-lysine N-methyltransferase ATX1 [Morus notabilis]
          Length = 2073

 Score =  135 bits (340), Expect = 6e-29
 Identities = 155/589 (26%), Positives = 257/589 (43%), Gaps = 15/589 (2%)
 Frame = -1

Query: 1745 NNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIMKES 1566
            +N H  QS++   S    Q  Q+R+ Q   ++ LK L+ T+ + +   +    S++   +
Sbjct: 365  SNLHTNQSMVIDAS----QNKQKRDAQAS-NIPLKGLIDTSQSNMHPAVG---SRVTNST 416

Query: 1565 PMPGGLAMSTSLEKGRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVSRCN 1386
                   +S S+  G +D G QSIS Y DF+ K+ +  I+   + +L+ +   SD +   
Sbjct: 417  -------VSKSVGSGLQD-GCQSISAYTDFILKNRDLSITRPSMQDLRTISQKSDFTMFK 468

Query: 1385 NSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQDPLI 1206
            N+  +  + RDA  SNIEL+LGQP Q S     S   ++GS   DT+ +  K +F   +I
Sbjct: 469  NAPNSIFVGRDAAFSNIELKLGQPYQSSQNSKISDRQALGSHLLDTVINPSKLVFPGQMI 528

Query: 1205 HNTVNPRVAEESRQNIL----RCSPSYTALNSKRENECQSNLTNHPPGICSTLDLDKLEQ 1038
            HN+   +V  E  Q++      CSP     N KRE   Q NL N+     +      LE+
Sbjct: 529  HNSCRGKV--ELGQSLYFATGSCSP-----NMKREQN-QLNLGNNGFEGSNINSASILEK 580

Query: 1037 FRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHGESH-----SFDF 873
             RG + +S++V   L+ F+   E ++  +   N+++    + +   + E       S + 
Sbjct: 581  SRGNLVQSAVVP--LTNFNLLAENNVQIKPSDNILNCLEHTANHTQYYEPRFAKCDSSNV 638

Query: 872  RSNRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGL 693
              N  + + RQLN + +     +DKGKG +  ++  SY+    S  H +    +S     
Sbjct: 639  LWNSGNGLERQLNINEMSSHGLIDKGKGVKLISEG-SYLKDPGSRIHKEFEFSTS----- 692

Query: 692  VGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNSSDPAFLRSANS 513
                   S + A +  SS + Q S++  +A   R   N    +   GN  +   + S  S
Sbjct: 693  ------RSQVPASQGSSSDLYQWSTVPLEAPEVRKLCNYPENIPSFGNCLNVDHV-SQRS 745

Query: 512  STVIAGAGSVMP-----MGLSSTNSICRPNLTPASSNNVGIGVSPYFMDENXXXXXXXXX 348
             T   G+G ++P      G     S    + TP+      IGVSP+ +D+N         
Sbjct: 746  FTSSVGSGIILPSQVVTKGHPLATSTHLLDQTPSLHREESIGVSPHLLDDNLRMLALRQI 805

Query: 347  XXLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQNVSEV 168
              LSKQ HA  S       GR  G S +      S A+  A+ E   GP     + VSE 
Sbjct: 806  LELSKQQHAFPSFGMNKRDGRCDGVSYL----HHSFAESPAAGEQFNGPGPISSREVSEA 861

Query: 167  AGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGV-FNSKELDMQ 24
              K     +         K +   G++  C+  T  RG+  ++KE+ +Q
Sbjct: 862  TAKARLGLAG-----ATSKFSGDEGMTGCCDLSTLIRGIPIHTKEIAVQ 905


>ref|XP_004292737.1| PREDICTED: uncharacterized protein LOC101313577 [Fragaria vesca
            subsp. vesca]
          Length = 2169

 Score =  103 bits (258), Expect = 2e-19
 Identities = 154/569 (27%), Positives = 244/569 (42%), Gaps = 18/569 (3%)
 Frame = -1

Query: 1673 NTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIMKESPMPGGLAMSTSLEKGRKDSGYQSI 1494
            N+Q+  +  LK+L GT+ + L    ++ + + M  S + G          G +DS  Q I
Sbjct: 453  NSQDSNNPFLKALTGTSQSNLQMADNMTMERAMATSKLVGN---------GAEDS-CQFI 502

Query: 1493 SDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVSRCNNSREAFIMDRDAISSNIELRLGQP 1314
            S Y   +     + I++  L   +  G +SD  R  N+R+     RDA  SNIELRLGQP
Sbjct: 503  SSYTGSVPN--RTSIAHPPLQERRINGKESDFRRIENTRDGAF--RDAAISNIELRLGQP 558

Query: 1313 SQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQDPLIHNTVNPRVAEESRQ-NILRCSPSY 1137
             Q + T G + +S++G     T+ +  KSLF   +  +  N R   E  Q + L  +PS 
Sbjct: 559  YQLAQTSGNTDLSAVGPPLLGTVVNPMKSLFPQQMNASRANCREEVEFMQCDRLSANPSN 618

Query: 1136 TALNSKRENECQSNLTNHPPGICSTLDLDKLEQFRGAMAKSSLVSMYLSQFSTSTEKDLH 957
                S+  N  Q N  N+   I +  D ++        A++S++S+ L+   +  +++  
Sbjct: 619  P---SRNRNWNQLNHGNNAFVIRNGTDDER--------AQNSVISL-LTNLKSPCKENKP 666

Query: 956  FRAPTNMVDNSCPSTSRILHGESHSFD------FRSNRSDEMGRQLNFSGLGF--LNNVD 801
             +A  +M + S  S    LH E  S        +RS  + E  RQL+ S LG   LN+ D
Sbjct: 667  SKANNSMFNVSGNSMRNTLHSEPLSDKNDLATVWRSGGNSE--RQLDMSHLGSYKLNDND 724

Query: 800  KG-KGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGLVGGNHIPSTLTAHEKPSSYVCQS 624
            KG   A H     S +A        K+M  SS F  L  GN  P+  TAH + S Y  Q 
Sbjct: 725  KGLSSAAH----ASQLAKDLGFRIRKEMEVSSSFNRL-SGNGDPNFSTAH-RNSCYSHQL 778

Query: 623  SSIIQDASNARNQSNQFTKVSCDGNSS--DPAFLRSANSSTVIAGAGSVMPMGLSSTNSI 450
            S +      ++  SN   KV+   NS   D  +LR   SS         +  G+  + S 
Sbjct: 779  SGVPLGTPESKIMSNYPEKVNSLANSGQVDHVYLRPMASSMGSGIPTQAVSKGIPVSAST 838

Query: 449  CRPNLTPASSNNVGIGVSPYFMDENXXXXXXXXXXXLSKQGHAIASLETRPEQGRIFGPS 270
               +L P       +GV  +  D+            +SK    + S      +GR+   +
Sbjct: 839  SLADLIPPFYREEFVGVHTHLPDDTLQVHATRQMQEISK----LPSPSKNQGEGRVGCST 894

Query: 269  TIEMQRRGSSADRLASEELREGPYLTVKQNVSEVAGKPLQSCSNHYADKVV-----EKLA 105
             ++  R  +SA    S +L     L+ K +VSE    P      H +D        E  A
Sbjct: 895  YMQQSRVDTSASGKQSHKLS----LSDKHDVSEAGVNP------HPSDVTCRIGTDEGFA 944

Query: 104  DVSGVSNWCNFLTSPRG-VFNSKELDMQN 21
             ++GV+  C F    +G   + KE+ +++
Sbjct: 945  SLTGVNCCCQFSQYKQGNAIHFKEVGLKH 973


>ref|XP_006345644.1| PREDICTED: uncharacterized protein LOC102579293 isoform X4 [Solanum
            tuberosum]
          Length = 1457

 Score = 88.2 bits (217), Expect = 1e-14
 Identities = 87/363 (23%), Positives = 151/363 (41%), Gaps = 8/363 (2%)
 Frame = -1

Query: 1514 DSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVSRCNNSREAFIMDRDAISSNI 1335
            ++G   + DYI   +K  N+F S   L  LK  G +S + +         MD  ++SS+I
Sbjct: 126  ENGSSFLLDYIGSNSKDTNTFNSLPDLKILKSFGINSSLLQ---------MDGSSVSSSI 176

Query: 1334 ELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQDPLIHNTVNPRVAEESRQNIL 1155
            ELRLG PSQQS  +G     +    +     + Q+ LF +PL+H  V  +  EES+ N  
Sbjct: 177  ELRLGHPSQQSKKLGTLAPQTFEYHSIVKPMEYQQPLFPEPLMHKAVESQAVEESKSN-- 234

Query: 1154 RCSPSYTALNSKR-ENECQSNLTNHPPGICSTLDLDKLEQFRGAMAKSSLVSMYLSQFST 978
                 +  LNS     +CQ +L N   G+ +     +         ++S+  +  +QF  
Sbjct: 235  -----FHMLNSSSISGKCQPDLVNSAYGLHNATSESR--------TRNSIFPVVQAQFKG 281

Query: 977  STEKDLHFRAPTNMVDNS----CPSTSRILHGESHSFDFRSNRSDEMGRQLNFSGLGFLN 810
             +E+ L+     NMV+ S         + L  + +  DF   R     ++ N      L 
Sbjct: 282  PSERLLYSEDIKNMVNGSHTPPREPQCKSLTLKCNQIDFHCARGKMTNKEFNVDTSCALG 341

Query: 809  NVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGLVGGNHIPSTLTAHEKPSSYVC 630
              +  K   +N+      A      ++K   +      +V G    + L  HEK + + C
Sbjct: 342  RGNSDKVVANNSVNLHSAAELNFGLYSKNYENKRTVKRIVEGLSHSNRLALHEK-NLHSC 400

Query: 629  QSSSIIQDASNARNQSNQFTK---VSCDGNSSDPAFLRSANSSTVIAGAGSVMPMGLSST 459
            +   I+ D  +A+N  N + K   +S DG   +          +  A     +P+G  S+
Sbjct: 401  KPCGIMMDMPDAQNTLNLYGKTSLISHDGPFDNGNIRSVCKPMSTAAPPSQAVPLGPLSS 460

Query: 458  NSI 450
            +++
Sbjct: 461  STL 463


>ref|XP_006345643.1| PREDICTED: uncharacterized protein LOC102579293 isoform X3 [Solanum
            tuberosum]
          Length = 1476

 Score = 88.2 bits (217), Expect = 1e-14
 Identities = 87/363 (23%), Positives = 151/363 (41%), Gaps = 8/363 (2%)
 Frame = -1

Query: 1514 DSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVSRCNNSREAFIMDRDAISSNI 1335
            ++G   + DYI   +K  N+F S   L  LK  G +S + +         MD  ++SS+I
Sbjct: 154  ENGSSFLLDYIGSNSKDTNTFNSLPDLKILKSFGINSSLLQ---------MDGSSVSSSI 204

Query: 1334 ELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQDPLIHNTVNPRVAEESRQNIL 1155
            ELRLG PSQQS  +G     +    +     + Q+ LF +PL+H  V  +  EES+ N  
Sbjct: 205  ELRLGHPSQQSKKLGTLAPQTFEYHSIVKPMEYQQPLFPEPLMHKAVESQAVEESKSN-- 262

Query: 1154 RCSPSYTALNSKR-ENECQSNLTNHPPGICSTLDLDKLEQFRGAMAKSSLVSMYLSQFST 978
                 +  LNS     +CQ +L N   G+ +     +         ++S+  +  +QF  
Sbjct: 263  -----FHMLNSSSISGKCQPDLVNSAYGLHNATSESR--------TRNSIFPVVQAQFKG 309

Query: 977  STEKDLHFRAPTNMVDNS----CPSTSRILHGESHSFDFRSNRSDEMGRQLNFSGLGFLN 810
             +E+ L+     NMV+ S         + L  + +  DF   R     ++ N      L 
Sbjct: 310  PSERLLYSEDIKNMVNGSHTPPREPQCKSLTLKCNQIDFHCARGKMTNKEFNVDTSCALG 369

Query: 809  NVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGLVGGNHIPSTLTAHEKPSSYVC 630
              +  K   +N+      A      ++K   +      +V G    + L  HEK + + C
Sbjct: 370  RGNSDKVVANNSVNLHSAAELNFGLYSKNYENKRTVKRIVEGLSHSNRLALHEK-NLHSC 428

Query: 629  QSSSIIQDASNARNQSNQFTK---VSCDGNSSDPAFLRSANSSTVIAGAGSVMPMGLSST 459
            +   I+ D  +A+N  N + K   +S DG   +          +  A     +P+G  S+
Sbjct: 429  KPCGIMMDMPDAQNTLNLYGKTSLISHDGPFDNGNIRSVCKPMSTAAPPSQAVPLGPLSS 488

Query: 458  NSI 450
            +++
Sbjct: 489  STL 491


>ref|XP_006345641.1| PREDICTED: uncharacterized protein LOC102579293 isoform X1 [Solanum
            tuberosum]
          Length = 1485

 Score = 88.2 bits (217), Expect = 1e-14
 Identities = 87/363 (23%), Positives = 151/363 (41%), Gaps = 8/363 (2%)
 Frame = -1

Query: 1514 DSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDSDVSRCNNSREAFIMDRDAISSNI 1335
            ++G   + DYI   +K  N+F S   L  LK  G +S + +         MD  ++SS+I
Sbjct: 154  ENGSSFLLDYIGSNSKDTNTFNSLPDLKILKSFGINSSLLQ---------MDGSSVSSSI 204

Query: 1334 ELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSLFQDPLIHNTVNPRVAEESRQNIL 1155
            ELRLG PSQQS  +G     +    +     + Q+ LF +PL+H  V  +  EES+ N  
Sbjct: 205  ELRLGHPSQQSKKLGTLAPQTFEYHSIVKPMEYQQPLFPEPLMHKAVESQAVEESKSN-- 262

Query: 1154 RCSPSYTALNSKR-ENECQSNLTNHPPGICSTLDLDKLEQFRGAMAKSSLVSMYLSQFST 978
                 +  LNS     +CQ +L N   G+ +     +         ++S+  +  +QF  
Sbjct: 263  -----FHMLNSSSISGKCQPDLVNSAYGLHNATSESR--------TRNSIFPVVQAQFKG 309

Query: 977  STEKDLHFRAPTNMVDNS----CPSTSRILHGESHSFDFRSNRSDEMGRQLNFSGLGFLN 810
             +E+ L+     NMV+ S         + L  + +  DF   R     ++ N      L 
Sbjct: 310  PSERLLYSEDIKNMVNGSHTPPREPQCKSLTLKCNQIDFHCARGKMTNKEFNVDTSCALG 369

Query: 809  NVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGLVGGNHIPSTLTAHEKPSSYVC 630
              +  K   +N+      A      ++K   +      +V G    + L  HEK + + C
Sbjct: 370  RGNSDKVVANNSVNLHSAAELNFGLYSKNYENKRTVKRIVEGLSHSNRLALHEK-NLHSC 428

Query: 629  QSSSIIQDASNARNQSNQFTK---VSCDGNSSDPAFLRSANSSTVIAGAGSVMPMGLSST 459
            +   I+ D  +A+N  N + K   +S DG   +          +  A     +P+G  S+
Sbjct: 429  KPCGIMMDMPDAQNTLNLYGKTSLISHDGPFDNGNIRSVCKPMSTAAPPSQAVPLGPLSS 488

Query: 458  NSI 450
            +++
Sbjct: 489  STL 491


>ref|XP_003549306.2| PREDICTED: uncharacterized protein LOC100816713 isoform X1 [Glycine
            max]
          Length = 2032

 Score = 87.8 bits (216), Expect = 1e-14
 Identities = 146/578 (25%), Positives = 226/578 (39%), Gaps = 5/578 (0%)
 Frame = -1

Query: 1754 VSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIM 1575
            V P N+HA+ +L        +   Q    Q+GC++ LK   G + N L  Q+        
Sbjct: 368  VFPKNAHADNNLFID----ALSGKQATTIQDGCNIPLKGFTGISQNSLYDQL-------- 415

Query: 1574 KESPMPGGLAMSTSLEK---GRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDS 1404
            K       LAM T+       + D G Q I  + D   + GN   ++  L     L  D 
Sbjct: 416  KNQLTVSNLAMYTTAPNFVGTQLDDGCQPIPPFFDSQKRKGNLSSAHSPLQIPASLLKDH 475

Query: 1403 DVSRCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSL 1224
            D  +  N+ +  +  +DA SSNI+LRLGQP Q  N + +     +    F+ L    KS 
Sbjct: 476  DCIKKKNANDGLV-GKDAASSNIDLRLGQPPQTGNLLPSFAEPLL----FNALASPPKSQ 530

Query: 1223 FQDPLIHNTVNPRVAEESRQNILRCSPSYTALNSKRENEC-QSNLTNHPPGICSTLDLDK 1047
                +I+N      A+ SR+  L+ + SY A + K   E  Q  L N+   + +      
Sbjct: 531  PLKQMINN------ADLSREEELQNNFSYAAGSIKMVQEMPQLKLNNYMSAVGNA----- 579

Query: 1046 LEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHGESHSFDFRS 867
                  A +++  V+  LS FS   + D      T   +N     S I+  + +S D+  
Sbjct: 580  ---SARARSETKNVAEGLS-FSPFLQFDNQSGGKTKASENLWNDESSIMPKKLYS-DY-- 632

Query: 866  NRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGLVG 687
                  GRQ N SG+    +++  KG     D  S +   +     + M   S     V 
Sbjct: 633  ---GHTGRQSNNSGIRTNKSLNNDKGVNFAKD--SGVKINSGFGIGQLMEYPSSIKRAVS 687

Query: 686  GNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNSSDPAFLRSANSST 507
             + I   L  + K        SS+  D S   +  +    VS  G  +      +   S 
Sbjct: 688  ASDI---LVVNGK-----IHESSLPSDTSVCADILHGSNNVSFLGQEN-----HTPQRSI 734

Query: 506  VIAGAGSVMPMGLSSTNSICRPNLTP-ASSNNVGIGVSPYFMDENXXXXXXXXXXXLSKQ 330
               G    +P  +SS+ S    N TP       GI +  Y +DEN           LSKQ
Sbjct: 735  PFKGILKGLPHHVSSSVS----NQTPILPQQQQGINMDAYLLDENMRLLALSQILELSKQ 790

Query: 329  GHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQNVSEVAGKPLQ 150
             HA+       +QGR    S ++  R  +S     SE+   G  L + QN          
Sbjct: 791  QHALYLKYINQKQGRSSCISKVQHYRCEAS----TSEQGTSGATLKLSQNRG-------- 838

Query: 149  SCSNHYADKVVEKLADVSGVSNWCNFLTSPRGVFNSKE 36
               NH +   +EKLA ++G++ +C+    P    +SKE
Sbjct: 839  IWGNHESTVGLEKLASLTGMNGYCHLSGLPPIPLHSKE 876


>ref|XP_006601169.1| PREDICTED: uncharacterized protein LOC100816713 isoform X2 [Glycine
            max]
          Length = 2035

 Score = 87.8 bits (216), Expect = 1e-14
 Identities = 146/578 (25%), Positives = 226/578 (39%), Gaps = 5/578 (0%)
 Frame = -1

Query: 1754 VSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIM 1575
            V P N+HA+ +L        +   Q    Q+GC++ LK   G + N L  Q+        
Sbjct: 368  VFPKNAHADNNLFID----ALSGKQATTIQDGCNIPLKGFTGISQNSLYDQL-------- 415

Query: 1574 KESPMPGGLAMSTSLEK---GRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDS 1404
            K       LAM T+       + D G Q I  + D   + GN   ++  L     L  D 
Sbjct: 416  KNQLTVSNLAMYTTAPNFVGTQLDDGCQPIPPFFDSQKRKGNLSSAHSPLQIPASLLKDH 475

Query: 1403 DVSRCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSL 1224
            D  +  N+ +  +  +DA SSNI+LRLGQP Q  N + +     +    F+ L    KS 
Sbjct: 476  DCIKKKNANDGLV-GKDAASSNIDLRLGQPPQTGNLLPSFAEPLL----FNALASPPKSQ 530

Query: 1223 FQDPLIHNTVNPRVAEESRQNILRCSPSYTALNSKRENEC-QSNLTNHPPGICSTLDLDK 1047
                +I+N      A+ SR+  L+ + SY A + K   E  Q  L N+   + +      
Sbjct: 531  PLKQMINN------ADLSREEELQNNFSYAAGSIKMVQEMPQLKLNNYMSAVGNA----- 579

Query: 1046 LEQFRGAMAKSSLVSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILHGESHSFDFRS 867
                  A +++  V+  LS FS   + D      T   +N     S I+  + +S D+  
Sbjct: 580  ---SARARSETKNVAEGLS-FSPFLQFDNQSGGKTKASENLWNDESSIMPKKLYS-DY-- 632

Query: 866  NRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMADSSPFTGLVG 687
                  GRQ N SG+    +++  KG     D  S +   +     + M   S     V 
Sbjct: 633  ---GHTGRQSNNSGIRTNKSLNNDKGVNFAKD--SGVKINSGFGIGQLMEYPSSIKRAVS 687

Query: 686  GNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNSSDPAFLRSANSST 507
             + I   L  + K        SS+  D S   +  +    VS  G  +      +   S 
Sbjct: 688  ASDI---LVVNGK-----IHESSLPSDTSVCADILHGSNNVSFLGQEN-----HTPQRSI 734

Query: 506  VIAGAGSVMPMGLSSTNSICRPNLTP-ASSNNVGIGVSPYFMDENXXXXXXXXXXXLSKQ 330
               G    +P  +SS+ S    N TP       GI +  Y +DEN           LSKQ
Sbjct: 735  PFKGILKGLPHHVSSSVS----NQTPILPQQQQGINMDAYLLDENMRLLALSQILELSKQ 790

Query: 329  GHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQNVSEVAGKPLQ 150
             HA+       +QGR    S ++  R  +S     SE+   G  L + QN          
Sbjct: 791  QHALYLKYINQKQGRSSCISKVQHYRCEAS----TSEQGTSGATLKLSQNRG-------- 838

Query: 149  SCSNHYADKVVEKLADVSGVSNWCNFLTSPRGVFNSKE 36
               NH +   +EKLA ++G++ +C+    P    +SKE
Sbjct: 839  IWGNHESTVGLEKLASLTGMNGYCHLSGLPPIPLHSKE 876


>ref|XP_006601170.1| PREDICTED: uncharacterized protein LOC100816713 isoform X3 [Glycine
            max]
          Length = 2033

 Score = 86.7 bits (213), Expect = 3e-14
 Identities = 145/588 (24%), Positives = 224/588 (38%), Gaps = 15/588 (2%)
 Frame = -1

Query: 1754 VSPNNSHAEQSLLEKPSKKVMQTPQQRNTQEGCDVLLKSLLGTTHNILSSQIDIPLSQIM 1575
            V P N+HA+ +L        +   Q    Q+GC++ LK   G + N L  Q+        
Sbjct: 368  VFPKNAHADNNLFID----ALSGKQATTIQDGCNIPLKGFTGISQNSLYDQL-------- 415

Query: 1574 KESPMPGGLAMSTSLEK---GRKDSGYQSISDYIDFLTKSGNSFISNQGLGNLKFLGTDS 1404
            K       LAM T+       + D G Q I  + D   + GN   ++  L     L  D 
Sbjct: 416  KNQLTVSNLAMYTTAPNFVGTQLDDGCQPIPPFFDSQKRKGNLSSAHSPLQIPASLLKDH 475

Query: 1403 DVSRCNNSREAFIMDRDAISSNIELRLGQPSQQSNTMGASVMSSMGSRTFDTLGDCQKSL 1224
            D  +  N+ +  +  +DA SSNI+LRLGQP Q  N + +                     
Sbjct: 476  DCIKKKNANDGLV-GKDAASSNIDLRLGQPPQTGNLLPS--------------------- 513

Query: 1223 FQDPLIHNTV-NPRVAEESRQNILRCSPSYTALNSKRENECQSNLTNHPPGICSTLDLD- 1050
            F +PL+ N + +P  ++  +Q I          N  RE E Q+N +     I    ++  
Sbjct: 514  FAEPLLFNALASPPKSQPLKQMI---------NNLSREEELQNNFSYAAGSIKMVQEMPQ 564

Query: 1049 -KLEQFRGAMAKSSL--------VSMYLSQFSTSTEKDLHFRAPTNMVDNSCPSTSRILH 897
             KL  +  A+  +S         V+  LS FS   + D      T   +N     S I+ 
Sbjct: 565  LKLNNYMSAVGNASARARSETKNVAEGLS-FSPFLQFDNQSGGKTKASENLWNDESSIMP 623

Query: 896  GESHSFDFRSNRSDEMGRQLNFSGLGFLNNVDKGKGAQHNTDTCSYMAAKTSLFHNKQMA 717
             + +S D+        GRQ N SG+    +++  KG     D  S +   +     + M 
Sbjct: 624  KKLYS-DY-----GHTGRQSNNSGIRTNKSLNNDKGVNFAKD--SGVKINSGFGIGQLME 675

Query: 716  DSSPFTGLVGGNHIPSTLTAHEKPSSYVCQSSSIIQDASNARNQSNQFTKVSCDGNSSDP 537
              S     V  + I   L  + K        SS+  D S   +  +    VS  G  +  
Sbjct: 676  YPSSIKRAVSASDI---LVVNGK-----IHESSLPSDTSVCADILHGSNNVSFLGQEN-- 725

Query: 536  AFLRSANSSTVIAGAGSVMPMGLSSTNSICRPNLTP-ASSNNVGIGVSPYFMDENXXXXX 360
                +   S    G    +P  +SS+ S    N TP       GI +  Y +DEN     
Sbjct: 726  ---HTPQRSIPFKGILKGLPHHVSSSVS----NQTPILPQQQQGINMDAYLLDENMRLLA 778

Query: 359  XXXXXXLSKQGHAIASLETRPEQGRIFGPSTIEMQRRGSSADRLASEELREGPYLTVKQN 180
                  LSKQ HA+       +QGR    S ++  R  +S     SE+   G  L + QN
Sbjct: 779  LSQILELSKQQHALYLKYINQKQGRSSCISKVQHYRCEAS----TSEQGTSGATLKLSQN 834

Query: 179  VSEVAGKPLQSCSNHYADKVVEKLADVSGVSNWCNFLTSPRGVFNSKE 36
                         NH +   +EKLA ++G++ +C+    P    +SKE
Sbjct: 835  RG--------IWGNHESTVGLEKLASLTGMNGYCHLSGLPPIPLHSKE 874


Top