BLASTX nr result

ID: Akebia24_contig00011236 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00011236
         (1218 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI40233.3| unnamed protein product [Vitis vinifera]              146   2e-32
gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis]     135   3e-29
ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Popu...   134   8e-29
ref|XP_007218938.1| hypothetical protein PRUPE_ppa002306mg [Prun...   127   1e-26
ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus c...   117   1e-23
ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citr...   113   2e-22
ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citr...   113   2e-22
ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309...   109   2e-21
emb|CBI17072.3| unnamed protein product [Vitis vinifera]              109   2e-21
emb|CAN76278.1| hypothetical protein VITISV_013226 [Vitis vinifera]   109   2e-21
ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207...   106   2e-20
ref|XP_004172802.1| PREDICTED: uncharacterized LOC101207733, par...   105   5e-20
ref|XP_006606288.1| PREDICTED: micronuclear linker histone polyp...   101   7e-19
ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyp...   101   7e-19
ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyp...   101   7e-19
ref|XP_007143822.1| hypothetical protein PHAVU_007G104500g [Phas...   100   2e-18
ref|XP_007010395.1| Uncharacterized protein isoform 4 [Theobroma...    92   3e-16
ref|XP_007010393.1| Uncharacterized protein isoform 2 [Theobroma...    92   3e-16
ref|XP_007010392.1| Uncharacterized protein isoform 1 [Theobroma...    92   3e-16
ref|XP_004496186.1| PREDICTED: uncharacterized protein LOC101514...    75   4e-11

>emb|CBI40233.3| unnamed protein product [Vitis vinifera]
          Length = 682

 Score =  146 bits (368), Expect = 2e-32
 Identities = 102/234 (43%), Positives = 129/234 (55%), Gaps = 7/234 (2%)
 Frame = +1

Query: 340  KGESSGMQNELQVTTYHGTP-VLGGVLEALQRAKLSLKHELHRLPLPTQGGHMVRVTDTP 516
            KGESS  Q++        T   LGGVLEALQ+A+LSL+H+L+RLPL  +GG + R  +  
Sbjct: 460  KGESSRSQDKHYALVPRETSNELGGVLEALQQARLSLQHKLNRLPL-IEGGSIGRAIEPS 518

Query: 517  VPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDSGSSLARYQQPISLP 696
             P+ +A +  EIPVGCAGLFRVP   Q G       N L    SDS SSL  Y      P
Sbjct: 519  FPSTRAWERVEIPVGCAGLFRVPADYQLGTATEA--NFLG---SDSQSSLKNYY-----P 568

Query: 697  SEANITNQTN--LLGPYSGMGVGVPAGHRYISSPNLEMGSGISSLRPLIDDHSMDNGMGL 870
                + N  +  L  PY   G  VP    +++SP  E GS I  LRP  D +S     GL
Sbjct: 569  DTGFVANPGDRFLTSPYLKTGSSVPTDDSFLTSPYRETGSRIPPLRPSFDYYS---DAGL 625

Query: 871  PASSRYTYP----YADLVPRMPPNNGFPRPYPSVRSGIPTSDHYPLYDDQNRSN 1020
             AS+RYT+P    + DL+ RMP N GF RP  +   GIP++DH+  YDD  R N
Sbjct: 626  SASTRYTHPTYSSHPDLLYRMPFNEGFARPPRNSEVGIPSTDHFSFYDDHIRPN 679


>gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis]
          Length = 654

 Score =  135 bits (341), Expect = 3e-29
 Identities = 117/385 (30%), Positives = 173/385 (44%), Gaps = 45/385 (11%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPH-- 174
            GN SD+TE+RDE++ +T       ++     +S    +    + +SK   NGFL P    
Sbjct: 287  GNHSDVTEDRDEVKAQTLYNVGIDIAQAVDAKSNKVDL---SKESSKPQSNGFLHPTRTR 343

Query: 175  -------LDIGCSHDPQCNGLMVNTEFSFPS------QENLETK----SNGKHY------ 285
                   +    + DP  +      EF+FP+      QE+LE +    S   H+      
Sbjct: 344  AAMGDLKVQASSNIDPVASRFQAQ-EFAFPTAKEKEAQESLENRDFRPSESPHHGQLLHR 402

Query: 286  --QDQSVQKSSSFHADGSFYKGESSGMQNELQVTTYHGTPV-LGGVLEALQRAKLSLKHE 456
               +Q   + +   A  S +K + SG QN+L     H  PV LGGVL+AL++AKLSL+ +
Sbjct: 403  SLPNQPFDRGALSDAGSSSHKRDFSGSQNDLYALVPHNPPVVLGGVLDALKQAKLSLQQK 462

Query: 457  LHRLPLP---TQGGHMVRVTDTPVPAFKAGDDREIPVGCAGLFRVPT-------SLQSGX 606
            ++RLPL    TQ   + R  +   P  + GD  EIPVGC GLFR+PT       S Q+  
Sbjct: 463  INRLPLEGTTTQTVAVNRSIEPTQPGTRVGDRLEIPVGCTGLFRLPTDFATVEASTQANF 522

Query: 607  XXXXXXNLLRPFYSDSGSSLARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYIS 786
                    L P+Y D+  +L    +               L  PY       P   R+++
Sbjct: 523  LSSGSRLSLEPYYPDNKVALTAPDR--------------FLTSPYIESRSEFPPDVRFLT 568

Query: 787  SPNLEMGSGISSLRPLIDDH------SMDNGMGLPASSRYTYPYADLVPRMPPNNGFPRP 948
            S ++  GS  S+L    D H      S++     P    Y  P+ D +PR+P + G  RP
Sbjct: 569  SSSVVSGSRASTLNSRFDSHFDTGPSSVNRYSNYPPHPSYP-PFPDSMPRIPSDEGLRRP 627

Query: 949  YPSVRS-GIPTSDHYPLYDDQNRSN 1020
            + S RS G+P  D +  YDD  R N
Sbjct: 628  FRSSRSFGLP-EDRFSFYDDHGRPN 651


>ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa]
            gi|222850857|gb|EEE88404.1| hypothetical protein
            POPTR_0008s02540g [Populus trichocarpa]
          Length = 684

 Score =  134 bits (337), Expect = 8e-29
 Identities = 116/382 (30%), Positives = 175/382 (45%), Gaps = 42/382 (10%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GNRSD+TEE  EI+ +  +   T+ +     +S V       E AS   PNG L P H++
Sbjct: 312  GNRSDVTEEGYEIKAQVQQHTGTVAAQSNRAKSEV-------EKASNIQPNGILRPSHVN 364

Query: 181  IGCSHDPQCNGLMVNT----EFSFPSQ-----ENLETKSNGKH---------------YQ 288
            IG   + + +    +     +F+F ++     EN E+  N  H               + 
Sbjct: 365  IGQLQEWKSSSAPTSESPAQDFAFRAEKQKQNENEESLGNNYHPSPHSSHDHPQSHSSHD 424

Query: 289  DQSVQKSSSF--HADGSFYKGESSGMQNELQVTTYH-GTPVLGGVLEALQRAKLSLKHEL 459
                Q ++SF  + D  F KG+ SG QNEL     H  +  LGGVL+AL+ A+ SL+ ++
Sbjct: 425  SPGSQSATSFPSNTDSGFSKGQFSGRQNELYALVPHRASNELGGVLDALKLARQSLQQKI 484

Query: 460  HRLPLPTQGGHMVRVTDTPVPAFKAGDDREIPVGCAGLFRVP------TSLQSGXXXXXX 621
              LPL  +GG +    D  +P    GD  +IP+G AGLFR+P       S +        
Sbjct: 485  STLPL-IEGGSIRNSVDPSLPPPIPGDKVDIPLGNAGLFRLPFDFLAEGSTRKNLDSTNA 543

Query: 622  XNLLRPFYSDSGSSLARYQQ-----PISLPSEANITNQTNLLGPYSGMGVGVPAGHRYIS 786
               LR +Y D+G   A   +     P +  S     +Q      YS  G   P   ++++
Sbjct: 544  GLSLRNYYPDTGVPAAAINRFVSRFPTATGSRFPTADQFLASQSYSATGSRFPTEDQFLA 603

Query: 787  SPNLEMGSGISSLRPLIDDHSMDNGMGLPASSRYTYP----YADLVPRMPPNNGFPRPYP 954
            S ++E GS ISS RP    + +D     P S+RY+YP    Y   +P++P     P   P
Sbjct: 604  SQDVEAGSRISSQRPFFYPY-LDTVS--PPSARYSYPTNPSYPGPMPQLPSREP-PSFLP 659

Query: 955  SVRSGIPTSDHYPLYDDQNRSN 1020
            S  +G+P +DH+   D   R N
Sbjct: 660  STTAGVPPADHFSFPDYHIRPN 681


>ref|XP_007218938.1| hypothetical protein PRUPE_ppa002306mg [Prunus persica]
            gi|462415400|gb|EMJ20137.1| hypothetical protein
            PRUPE_ppa002306mg [Prunus persica]
          Length = 690

 Score =  127 bits (318), Expect = 1e-26
 Identities = 118/375 (31%), Positives = 169/375 (45%), Gaps = 47/375 (12%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SDITEERDEI+ +T   A  +++  Q  +S  G VC   E   K   NGFLP  H+D
Sbjct: 309  GNHSDITEERDEIKAQTPCSAGVVVAQAQETKSEEGDVCLPKETF-KIQQNGFLPASHVD 367

Query: 181  IGCSHDPQCNGLMVNT---EFSFPSQ------ENLET----KSNGKH--------YQDQS 297
            +G   D      +  +   EF+FP++      E+LE      S+G H          ++S
Sbjct: 368  MGGLQDQLNKSTVAPSQVEEFAFPTENGKQNHESLENFARHPSHGSHPNPLVHGSAHNRS 427

Query: 298  VQKSSSFHADGSFYKGESSGMQNELQVTTYHGTPV-LGGVLEALQRAKLSLKHELHRLPL 474
               SSS    G F+KG +SG +++L     H +   LGGVL+AL++AKLSL+  + RLPL
Sbjct: 428  SDASSSVAGSG-FHKGNASGSRSDLYALVPHDSQDRLGGVLDALKQAKLSLQQNMTRLPL 486

Query: 475  PTQGGHMVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDS 654
               G  + +  +  +P  K GD  EIPVGCAGLFR+PT             L        
Sbjct: 487  -VDGTSVHKSIEPSIPVMKTGDRVEIPVGCAGLFRLPTDFAVEEAATQSSFL-------- 537

Query: 655  GSSLARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGIS----- 819
            GSS +    P +L + + +  +             + A  RY+ SP +E     S     
Sbjct: 538  GSSWSGRYCPETLVTSSFVETRPTF---------SMNAADRYVPSPYIETRQTFSTNATD 588

Query: 820  -----------------SLRPLIDDHSMDNGMGLPASSRY-TYPYADLVPRMPPNNGFPR 945
                             +  P +   S+D     PA +R+ + PY++     PP   +P 
Sbjct: 589  RFIPNAYVESRPNFPANAAEPFVTSPSVDTRSNFPADNRFLSGPYSESGYAQPP---YPN 645

Query: 946  PYPSVRSGIP--TSD 984
             YPSV    P  TSD
Sbjct: 646  -YPSVPDRTPWITSD 659


>ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus communis]
            gi|223526443|gb|EEF28720.1| hypothetical protein
            RCOM_0152200 [Ricinus communis]
          Length = 665

 Score =  117 bits (293), Expect = 1e-23
 Identities = 117/353 (33%), Positives = 163/353 (46%), Gaps = 35/353 (9%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GNRSDITEER EIR     PA T     +G  S V       E  S   P+GFLP  H+D
Sbjct: 308  GNRSDITEERYEIREPAKGPATTNAIQTEGLLSVV-------EGVSNTQPHGFLPSSHVD 360

Query: 181  IGCSHDPQCNGLMVNTEFS-----FPSQENLETKSN-------------------GKHYQ 288
              C  + + +   V  EFS     FP  +  + + N                   G  Y 
Sbjct: 361  AVCLEERKSSIAPV-PEFSTQDSAFPMAKAKQNQKNPGNNDHSPLLIAHHDSASFGSQYS 419

Query: 289  DQSVQKSSSFHAD--GSFYKGES-SGMQNELQVTTYH-GTPVLGGVLEALQRAKLSLKHE 456
              S Q   SF ++   SF KG++ SG +NE      H  +  LGGVLEAL+ A+ SL+  
Sbjct: 420  SGS-QSVLSFPSNTGSSFNKGKATSGSENERCALVPHKASGGLGGVLEALEEARQSLQQR 478

Query: 457  LHRLPLPTQGGHMVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLR 636
            ++RLP  +    + +  ++ V    + D+ +IPVGC GLFR+PT            NLL 
Sbjct: 479  INRLP--SVATTVRKSVESSVSTTISRDEVQIPVGCVGLFRLPTDF--SVEGNTRANLLS 534

Query: 637  PFYSDSGSSLARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGI 816
               S +  SL  +     +P+ A  +NQ  +  PY           +++SS  +  GS I
Sbjct: 535  ---SSAQLSLGNHYSDRGVPAAA--SNQF-VASPYLQGRSSSSTEDQFLSSQYVGGGSRI 588

Query: 817  SSLRPLIDDHSMDNGMGLPASSRYTYP-------YADLVPRMPPNNGFPRPYP 954
             + +P  D + +D   GLP+SSRYTYP       Y DL+PR+P   G   P P
Sbjct: 589  PTPKPYFDPY-LDT--GLPSSSRYTYPNYPINTSYPDLMPRIPSREGSLAPVP 638


>ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citrus clementina]
            gi|568878417|ref|XP_006492190.1| PREDICTED:
            uncharacterized protein LOC102610545 [Citrus sinensis]
            gi|557538863|gb|ESR49907.1| hypothetical protein
            CICLE_v10030805mg [Citrus clementina]
          Length = 732

 Score =  113 bits (282), Expect = 2e-22
 Identities = 108/352 (30%), Positives = 151/352 (42%), Gaps = 42/352 (11%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN+SD+TEER+E +V+    A T+ S  Q  ++ V    H     S    NGFLPP   D
Sbjct: 314  GNQSDVTEEREESKVQVQRVAGTVNSQVQEAKTEV----HLSNQLSNTKSNGFLPPQSGD 369

Query: 181  IGCSHDPQCNGLMVNTEFSFPSQENLETKSNGKHY----------------QDQSVQKSS 312
              CS  P    L  +  F+  +++  +      HY                ++QS Q  S
Sbjct: 370  QKCSSTPASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHGSPENQSSQTVS 429

Query: 313  SFHADGSFYKGESSGMQNELQVTTYHGTPV-LGGVLEALQRAKLSLKHELHRLPLPTQGG 489
            S    GS  + E SG Q+E      H T      VLEAL++A+LSL+ ++  LP  T+  
Sbjct: 430  S--NTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLRQKMSSLP-STESR 486

Query: 490  HMVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDSGSSLA 669
             + +V +  + A    D  EIPVGC+GLFRVPT            N L    SDS  SLA
Sbjct: 487  SVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDY---AVETSKANFL---VSDSRPSLA 540

Query: 670  RYQ--QPISLPSEANI-------------------TNQTNLLGPYSGMGVGVPAGHRYIS 786
             Y     I L S+                      T    L GP +       A +R ++
Sbjct: 541  NYNPTSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSAENRLLT 600

Query: 787  SPNLEMGSGISSLRPLIDDHSMDNGMGLPASSRYTYP----YADLVPRMPPN 930
                +  S +S +RP  D +      GLP+  +Y YP    Y D VP++P N
Sbjct: 601  RQYSDTRSRVSMMRPSFDSNL---DAGLPSFRQYMYPNFSSYPDQVPQVPRN 649


>ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citrus clementina]
            gi|557538862|gb|ESR49906.1| hypothetical protein
            CICLE_v10030805mg [Citrus clementina]
          Length = 716

 Score =  113 bits (282), Expect = 2e-22
 Identities = 108/352 (30%), Positives = 151/352 (42%), Gaps = 42/352 (11%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN+SD+TEER+E +V+    A T+ S  Q  ++ V    H     S    NGFLPP   D
Sbjct: 298  GNQSDVTEEREESKVQVQRVAGTVNSQVQEAKTEV----HLSNQLSNTKSNGFLPPQSGD 353

Query: 181  IGCSHDPQCNGLMVNTEFSFPSQENLETKSNGKHY----------------QDQSVQKSS 312
              CS  P    L  +  F+  +++  +      HY                ++QS Q  S
Sbjct: 354  QKCSSTPASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHGSPENQSSQTVS 413

Query: 313  SFHADGSFYKGESSGMQNELQVTTYHGTPV-LGGVLEALQRAKLSLKHELHRLPLPTQGG 489
            S    GS  + E SG Q+E      H T      VLEAL++A+LSL+ ++  LP  T+  
Sbjct: 414  S--NTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLRQKMSSLP-STESR 470

Query: 490  HMVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDSGSSLA 669
             + +V +  + A    D  EIPVGC+GLFRVPT            N L    SDS  SLA
Sbjct: 471  SVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDY---AVETSKANFL---VSDSRPSLA 524

Query: 670  RYQ--QPISLPSEANI-------------------TNQTNLLGPYSGMGVGVPAGHRYIS 786
             Y     I L S+                      T    L GP +       A +R ++
Sbjct: 525  NYNPTSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSAENRLLT 584

Query: 787  SPNLEMGSGISSLRPLIDDHSMDNGMGLPASSRYTYP----YADLVPRMPPN 930
                +  S +S +RP  D +      GLP+  +Y YP    Y D VP++P N
Sbjct: 585  RQYSDTRSRVSMMRPSFDSNL---DAGLPSFRQYMYPNFSSYPDQVPQVPRN 633


>ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309582 [Fragaria vesca
            subsp. vesca]
          Length = 807

 Score =  109 bits (273), Expect = 2e-21
 Identities = 98/295 (33%), Positives = 143/295 (48%), Gaps = 21/295 (7%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SDITEERDE++  T  PA+   S  Q  +S     C   E     L NG+LPP  ++
Sbjct: 317  GNHSDITEERDEMK--TPFPAEINASEAQEAKSEARDSCLFEEKMKTQL-NGYLPPSDVE 373

Query: 181  IGCSHDPQCNGLMVNT----EFSFPS------QENLETKSN----GKHYQ----DQSVQK 306
            +G   D      + +     EF+FP+      QE+LE  ++    G H+     + S  +
Sbjct: 374  MGGMQDQMNRSSVASASPIQEFAFPTAYERQTQESLENNAHQPSPGSHHDPLLLESSHNR 433

Query: 307  SSSFHADGSFYKGESSGMQNELQVTTYHGTPV-LGGVLEALQRAKLSLKHELHRLPLPTQ 483
            SS   +DG      +SG +N+L     H +   LGGVL+AL++AKLSL+ ++ RLPL   
Sbjct: 434  SSVVSSDGGSSFHNASGSRNDLYALVPHDSQERLGGVLDALKQAKLSLQQKIIRLPLVDD 493

Query: 484  GGHMVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDSGSS 663
                  + + P+PA   G+  +IPVGCAGLFR+PT               +  Y   GSS
Sbjct: 494  TSVQESI-EPPIPAVTTGNRLDIPVGCAGLFRLPTDF------AVEEAATKHSYLGLGSS 546

Query: 664  L--ARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGISS 822
            L  ARY     L   A+ T+Q  +   Y         G R+++SP +E    +S+
Sbjct: 547  LPSARYCPDKGL--AASSTDQF-VTSTYVETRPPYHVGDRFVASPYVENRRTVST 598


>emb|CBI17072.3| unnamed protein product [Vitis vinifera]
          Length = 539

 Score =  109 bits (273), Expect = 2e-21
 Identities = 107/366 (29%), Positives = 156/366 (42%), Gaps = 27/366 (7%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+   +DE++  T +PA+ I S  +G +                 PNGFL    ++
Sbjct: 222  GNLSDV---KDEVKGLTLQPAEKITSQDRGVK-----------------PNGFLAATDIE 261

Query: 181  IGCSHDPQCNGLMVNTE------FSFPSQENLETKSNGKHYQDQSVQKS----------- 309
              C    QC+  + +        F+  +Q ++E  +N K  Q+    KS           
Sbjct: 262  TECLEPQQCSSTVSSESPSEFPGFAISNQRSIEANANWKQQQEGLEHKSLHPPSNVLPGR 321

Query: 310  -----SSFHADGSFYKGESSGMQNELQVTTY-HGTPVLGGVLEALQRAKLSLKHELHRLP 471
                 SS H  G+  KGESSG +N+LQV         LG VL+ALQ AKLSL ++L R P
Sbjct: 322  HSAQNSSSHMQGNLLKGESSGCKNKLQVKVLSEKANGLGSVLDALQSAKLSLSNKLSRWP 381

Query: 472  LPTQGGHMVRVTD----TPVPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRP 639
            L  + G + R  +     PVPA +A +   +PVG      +   LQ G        L   
Sbjct: 382  LSRENGQLRRALELEPPVPVPAMRARNGMAVPVGYGRPHGLTMDLQPGGVQPPASML--- 438

Query: 640  FYSDSGSSLARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGIS 819
              S  G S   Y                     +  MGV V AG++  ++  +E    IS
Sbjct: 439  -GSSPGFSSTMY---------------------HPEMGVAVSAGNQLGTNHWMEPRPRIS 476

Query: 820  SLRPLIDDHSMDNGMGLPASSRYTYPYADLVPRMPPNNGFPRPYPSVRSGIPTSDHYPLY 999
            +LR   + H  D G   P   +Y++    L+ RMP +N   +P  S  +G+     Y LY
Sbjct: 477  TLRQNSEPHP-DTGRDSPGYGQYSH---ILMQRMPSDNKISKPLQS--NGMKRRS-YSLY 529

Query: 1000 DDQNRS 1017
            DD +RS
Sbjct: 530  DDHSRS 535


>emb|CAN76278.1| hypothetical protein VITISV_013226 [Vitis vinifera]
          Length = 580

 Score =  109 bits (273), Expect = 2e-21
 Identities = 107/366 (29%), Positives = 156/366 (42%), Gaps = 27/366 (7%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+   +DE++  T +PA+ I S  +G +                 PNGFL    ++
Sbjct: 263  GNLSDV---KDEVKGLTLQPAEKITSQDRGVK-----------------PNGFLAATDIE 302

Query: 181  IGCSHDPQCNGLMVNTE------FSFPSQENLETKSNGKHYQDQSVQKS----------- 309
              C    QC+  + +        F+  +Q ++E  +N K  Q+    KS           
Sbjct: 303  TECLEPQQCSSTVSSESPSEFPGFAISNQRSIEANANWKQQQEGLEHKSLHPPSNVLPGR 362

Query: 310  -----SSFHADGSFYKGESSGMQNELQVTTY-HGTPVLGGVLEALQRAKLSLKHELHRLP 471
                 SS H  G+  KGESSG +N+LQV         LG VL+ALQ AKLSL ++L R P
Sbjct: 363  HSAQNSSSHMQGNLLKGESSGCKNKLQVKVLSEKANGLGSVLDALQSAKLSLSNKLSRWP 422

Query: 472  LPTQGGHMVRVTD----TPVPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRP 639
            L  + G + R  +     PVPA +A +   +PVG      +   LQ G        L   
Sbjct: 423  LSRENGQLRRALELEPPVPVPAMRARNGMAVPVGYGRPHGLTMDLQPGGVQPPASML--- 479

Query: 640  FYSDSGSSLARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGIS 819
              S  G S   Y                     +  MGV V AG++  ++  +E    IS
Sbjct: 480  -GSSPGFSSTMY---------------------HPEMGVAVSAGNQLGTNHWMEPRPRIS 517

Query: 820  SLRPLIDDHSMDNGMGLPASSRYTYPYADLVPRMPPNNGFPRPYPSVRSGIPTSDHYPLY 999
            +LR   + H  D G   P   +Y++    L+ RMP +N   +P  S  +G+     Y LY
Sbjct: 518  TLRQNSEPHP-DTGRDSPGYGQYSH---ILMQRMPSDNKISKPLQS--NGMKRRS-YSLY 570

Query: 1000 DDQNRS 1017
            DD +RS
Sbjct: 571  DDHSRS 576


>ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207733 [Cucumis sativus]
          Length = 671

 Score =  106 bits (265), Expect = 2e-20
 Identities = 113/389 (29%), Positives = 158/389 (40%), Gaps = 49/389 (12%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESG--VGRVCHGGEAASKALPNGFLPPP- 171
            GN SDITEERDE+R +        LS+    E+   V   C   +  S+A  NG  P   
Sbjct: 314  GNHSDITEERDEMRAQAPN-----LSNNPANEAKPQVAFDCDTRDL-SQAQTNGLGPSMC 367

Query: 172  HLDIGCSHDPQCNGLMVNT---EFSFP---------SQEN-LETKSNGKHYQDQSVQKSS 312
             +D+    D   N +  +    EF+FP         SQEN  +  S   H      ++  
Sbjct: 368  AVDVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQESQENSAQEPSCTSHLNHGLPERPL 427

Query: 313  SFHADGSFYKGESSGMQNELQVTTYHGTPVLGGVLEALQRAKLSLKHELHRLPLPTQGGH 492
            S H   + Y  E+    N+L     H  P L GVLEAL++AKLSL  ++ +LP       
Sbjct: 428  SSHGGINSYDQETPCSNNDLYALVPHEPPALDGVLEALKQAKLSLTKKIIKLPSVDGESE 487

Query: 493  MVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDSGSSLAR 672
             +  +  P+   K GD  EIPVGCAGLFR+PT                  ++   SS A 
Sbjct: 488  SIDKSIGPLSIPKMGDRLEIPVGCAGLFRLPTD-----------------FAAEASSQAN 530

Query: 673  YQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGISSLRP------- 831
            +     L S + + + T+    Y G G  + A H+    P  EM    S LR        
Sbjct: 531  F-----LASSSQLRSPTH----YPGEGAALSANHQIF--PGHEMEDRSSFLRDSRLRSSG 579

Query: 832  -----------LIDDHSMDNGMGLPASSRYTYPYADLV----------PR-----MPPNN 933
                        + DH  +N    P    +   Y D V          PR     + PN+
Sbjct: 580  YRAGSGFTRDGFLTDHIPENRWKNPGQKHHFDQYFDAVQPSSYVHNYPPRPVSSNIHPND 639

Query: 934  GFPRPYPSVRSGIPTSDHYPLYDDQNRSN 1020
             F R +P   + +P ++ Y  YDDQ R N
Sbjct: 640  TFLRTFPGRSTEMPPTNQYSFYDDQFRPN 668


>ref|XP_004172802.1| PREDICTED: uncharacterized LOC101207733, partial [Cucumis sativus]
          Length = 477

 Score =  105 bits (261), Expect = 5e-20
 Identities = 112/389 (28%), Positives = 157/389 (40%), Gaps = 49/389 (12%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESG--VGRVCHGGEAASKALPNGFLPPP- 171
            GN SDITEERDE+R +        LS+    E+   V   C   +  S+A  NG  P   
Sbjct: 120  GNHSDITEERDEMRAQAPN-----LSNNPANEAKPQVAFDCDTRDL-SQAQTNGLGPSMC 173

Query: 172  HLDIGCSHDPQCNGLMVNT---EFSFP---------SQEN-LETKSNGKHYQDQSVQKSS 312
             +D+    D   N +  +    EF+FP         SQEN  +  S   H      ++  
Sbjct: 174  AVDVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQESQENSAQEPSCTSHLNHGLPERPL 233

Query: 313  SFHADGSFYKGESSGMQNELQVTTYHGTPVLGGVLEALQRAKLSLKHELHRLPLPTQGGH 492
            S H   + Y  E+    N+L     H  P L GVLEAL++AKLSL  ++ +LP       
Sbjct: 234  SSHGGINSYDQETPCSNNDLYALVPHEPPALDGVLEALKQAKLSLTKKIIKLPSVDGESE 293

Query: 493  MVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDSGSSLAR 672
             +  +  P+   K GD  EIPVGCAGLFR+PT                  ++   SS A 
Sbjct: 294  SIDKSIGPLSIPKVGDRLEIPVGCAGLFRLPTD-----------------FAAEASSQAN 336

Query: 673  YQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGISSLRP------- 831
            +     L S + + +  +    Y G G  + A H+    P  EM    S LR        
Sbjct: 337  F-----LASSSQLRSPAH----YPGEGAALSANHQIF--PGHEMEDRSSFLRDSRLRSSG 385

Query: 832  -----------LIDDHSMDNGMGLPASSRYTYPYADLV----------PR-----MPPNN 933
                        + DH  +N    P    +   Y D V          PR     + PN+
Sbjct: 386  YRAGSGFTRDGFLTDHIPENRWKNPGQKHHFDQYFDAVQPSSYVHNYPPRPVSSNIHPND 445

Query: 934  GFPRPYPSVRSGIPTSDHYPLYDDQNRSN 1020
             F R +P   + +P ++ Y  YDDQ R N
Sbjct: 446  TFLRTFPGRSTEMPPTNQYSFYDDQFRPN 474


>ref|XP_006606288.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X5
            [Glycine max]
          Length = 595

 Score =  101 bits (251), Expect = 7e-19
 Identities = 100/367 (27%), Positives = 153/367 (41%), Gaps = 29/367 (7%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+TE++DE +V     A  + S  Q  +     VC   E   KA     +P  H D
Sbjct: 259  GNYSDMTEDKDESKVHIPFAAKVVTSDAQESKGEPRGVCLS-EEKFKAEARDIMPKTHDD 317

Query: 181  IGCSHDPQCNGL----MVNTEFSFPSQENLETKSN---------------GKH-YQDQSV 300
             G   D +        ++  + S P  +  + +S+               G+H Y D   
Sbjct: 318  TGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPGRHGYHDSKP 377

Query: 301  QKSSSFHADGSFYKGESSGMQNELQVTTYHGTP-VLGGVLEALQRAKLSLKHELHRLPLP 477
              S      G  ++ ++S  + +L     H  P    GVLE+L++A++SL+ EL RLPL 
Sbjct: 378  TYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQQELKRLPLV 437

Query: 478  TQGGHMVRVTDTPVPAFKAGDDR-EIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDS 654
              G      T  P  +F   +DR E+PVGC+GLFR+PT    G       N+  P  +  
Sbjct: 438  ESG-----YTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDG--ATARFNVKDP-TAGF 489

Query: 655  GSSLARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGISSLRPL 834
            GS+     + +S  S+           PY    + +PA  + ++   +E G    SL   
Sbjct: 490  GSNF-HLNRAMSRTSDGQFFPSL----PYPDTQLSLPANDQSLAIRYVENGPNGGSL--- 541

Query: 835  IDDHSMDNGMGLPASSRYTYP-------YADLVPRMPPNNGFPRPYPSVRSGIPTSDHYP 993
                         +SS+YTYP       Y +  P+MP  N   RPY S   G+P ++ + 
Sbjct: 542  -------------SSSKYTYPTFPINPSYQNATPQMPFGNEVSRPYSSSTVGVPLANRFS 588

Query: 994  LYDDQNR 1014
               D  R
Sbjct: 589  FNSDHLR 595


>ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X4
            [Glycine max]
          Length = 641

 Score =  101 bits (251), Expect = 7e-19
 Identities = 100/367 (27%), Positives = 153/367 (41%), Gaps = 29/367 (7%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+TE++DE +V     A  + S  Q  +     VC   E   KA     +P  H D
Sbjct: 305  GNYSDMTEDKDESKVHIPFAAKVVTSDAQESKGEPRGVCLS-EEKFKAEARDIMPKTHDD 363

Query: 181  IGCSHDPQCNGL----MVNTEFSFPSQENLETKSN---------------GKH-YQDQSV 300
             G   D +        ++  + S P  +  + +S+               G+H Y D   
Sbjct: 364  TGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPGRHGYHDSKP 423

Query: 301  QKSSSFHADGSFYKGESSGMQNELQVTTYHGTP-VLGGVLEALQRAKLSLKHELHRLPLP 477
              S      G  ++ ++S  + +L     H  P    GVLE+L++A++SL+ EL RLPL 
Sbjct: 424  TYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQQELKRLPLV 483

Query: 478  TQGGHMVRVTDTPVPAFKAGDDR-EIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDS 654
              G      T  P  +F   +DR E+PVGC+GLFR+PT    G       N+  P  +  
Sbjct: 484  ESG-----YTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDG--ATARFNVKDP-TAGF 535

Query: 655  GSSLARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGISSLRPL 834
            GS+     + +S  S+           PY    + +PA  + ++   +E G    SL   
Sbjct: 536  GSNF-HLNRAMSRTSDGQFFPSL----PYPDTQLSLPANDQSLAIRYVENGPNGGSL--- 587

Query: 835  IDDHSMDNGMGLPASSRYTYP-------YADLVPRMPPNNGFPRPYPSVRSGIPTSDHYP 993
                         +SS+YTYP       Y +  P+MP  N   RPY S   G+P ++ + 
Sbjct: 588  -------------SSSKYTYPTFPINPSYQNATPQMPFGNEVSRPYSSSTVGVPLANRFS 634

Query: 994  LYDDQNR 1014
               D  R
Sbjct: 635  FNSDHLR 641


>ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1
            [Glycine max] gi|571568788|ref|XP_006606285.1| PREDICTED:
            micronuclear linker histone polyprotein-like isoform X2
            [Glycine max] gi|571568792|ref|XP_006606286.1| PREDICTED:
            micronuclear linker histone polyprotein-like isoform X3
            [Glycine max]
          Length = 664

 Score =  101 bits (251), Expect = 7e-19
 Identities = 100/367 (27%), Positives = 153/367 (41%), Gaps = 29/367 (7%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+TE++DE +V     A  + S  Q  +     VC   E   KA     +P  H D
Sbjct: 328  GNYSDMTEDKDESKVHIPFAAKVVTSDAQESKGEPRGVCLS-EEKFKAEARDIMPKTHDD 386

Query: 181  IGCSHDPQCNGL----MVNTEFSFPSQENLETKSN---------------GKH-YQDQSV 300
             G   D +        ++  + S P  +  + +S+               G+H Y D   
Sbjct: 387  TGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPGRHGYHDSKP 446

Query: 301  QKSSSFHADGSFYKGESSGMQNELQVTTYHGTP-VLGGVLEALQRAKLSLKHELHRLPLP 477
              S      G  ++ ++S  + +L     H  P    GVLE+L++A++SL+ EL RLPL 
Sbjct: 447  TYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQQELKRLPLV 506

Query: 478  TQGGHMVRVTDTPVPAFKAGDDR-EIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDS 654
              G      T  P  +F   +DR E+PVGC+GLFR+PT    G       N+  P  +  
Sbjct: 507  ESG-----YTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDG--ATARFNVKDP-TAGF 558

Query: 655  GSSLARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGISSLRPL 834
            GS+     + +S  S+           PY    + +PA  + ++   +E G    SL   
Sbjct: 559  GSNF-HLNRAMSRTSDGQFFPSL----PYPDTQLSLPANDQSLAIRYVENGPNGGSL--- 610

Query: 835  IDDHSMDNGMGLPASSRYTYP-------YADLVPRMPPNNGFPRPYPSVRSGIPTSDHYP 993
                         +SS+YTYP       Y +  P+MP  N   RPY S   G+P ++ + 
Sbjct: 611  -------------SSSKYTYPTFPINPSYQNATPQMPFGNEVSRPYSSSTVGVPLANRFS 657

Query: 994  LYDDQNR 1014
               D  R
Sbjct: 658  FNSDHLR 664


>ref|XP_007143822.1| hypothetical protein PHAVU_007G104500g [Phaseolus vulgaris]
            gi|561017012|gb|ESW15816.1| hypothetical protein
            PHAVU_007G104500g [Phaseolus vulgaris]
          Length = 652

 Score =  100 bits (248), Expect = 2e-18
 Identities = 108/375 (28%), Positives = 167/375 (44%), Gaps = 37/375 (9%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+TE++DE +V+    A  + S  +  +   G VC   E   KA     +P  H D
Sbjct: 310  GNHSDMTEDKDEGKVQIPYAAKVVTSKAEESKGEPGGVCLSEEKL-KAEGREIMPKKHDD 368

Query: 181  IGCSHDPQCNGLMVNTEFS---FPSQENLETKSNGK--------HYQDQSVQ-----KSS 312
                 + +      +T FS   F  QEN  +   G         H Q   +      + S
Sbjct: 369  TDVYRNQK------STTFSTSDFLGQENSHSPLKGNQNEILVNGHSQSSDMNHLDQGRHS 422

Query: 313  SFHAD--GSFYKGESSGMQNELQ-VTTYHGTPVLGGVLEALQRAKLSLKHELHRLPLPTQ 483
            SF  D  G  ++ ++S  Q +L  + T   +    GVLE+L++A++SL+ EL+RLP+  +
Sbjct: 423  SFPTDIHGVQHQHDASKNQKDLYALVTREQSHQFDGVLESLKQARISLQQELNRLPV-VE 481

Query: 484  GGHMVRVTDTPVPAFKAGDDR-EIPVGCAGLFRVPTSLQSGXXXXXXXNLLRPFYSDSGS 660
            GG+    T  P+P+    +DR EIP G +GLFR+PT                  +SD  +
Sbjct: 482  GGY----TAKPLPSVSKNEDRFEIPFGFSGLFRLPTD-----------------FSDEAT 520

Query: 661  SLARYQQP---ISLPSEANITNQTNLLG------PYSG-MGVGVPAGHRYISSPNLEMGS 810
                 + P          N T     +G      P+SG M +   A  + +++  LE GS
Sbjct: 521  PRFNVRDPTTGFGSNYHLNGTMSRTSVGQFFTNPPHSGKMLMSPSANDQALATRYLENGS 580

Query: 811  GISSLRPLIDDHSMDNGMGLPASSRYTYP-------YADLVPRMPPNNGFPRPYPSVRSG 969
              SS +   D  S  NG G  +SS+Y+YP       Y +  P+MP  +   RPY +   G
Sbjct: 581  RFSSSQSPFDPFS--NG-GPLSSSKYSYPTFPINPSYQNATPQMPFGDEVSRPYSNSTVG 637

Query: 970  IPTSDHYPLYDDQNR 1014
            +P ++ +   DD  R
Sbjct: 638  VPLANRFSFNDDHLR 652


>ref|XP_007010395.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508727308|gb|EOY19205.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 709

 Score = 92.4 bits (228), Expect = 3e-16
 Identities = 113/403 (28%), Positives = 163/403 (40%), Gaps = 65/403 (16%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+TEERDEI+ +    + T  S  QG E     +    E   K   N  +PP   D
Sbjct: 319  GNHSDVTEERDEIKAQAQYVSGTATSQVQGAEEE--HISFSAELP-KIHSNDLVPPSQAD 375

Query: 181  IGCSHD-------------PQCNG----LMVNTEFSFPSQENLETKSNGKHY-------- 285
            +    D             P   G     ++  E    S ++  + SN  H+        
Sbjct: 376  MDRLQDWRYSRSLSPESLNPNSPGQKLTFLMAKENHHQSMQSNNSPSNSSHHFAHPHDSP 435

Query: 286  QDQSVQKSSSFHADGSFYKGESSGMQNELQVTTYHGTPV-LGGVLEALQRAKLSLKHELH 462
             +Q+VQ  SS    GS    E    +NEL     H T     GVL++L++A+LSL+ ++ 
Sbjct: 436  GNQAVQHISSDL--GSHSCRELPRNKNELYALVPHETSGRFTGVLDSLKQARLSLQQKIS 493

Query: 463  RLPLPTQGGHMVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSL---------------- 594
             L L  +G  + +  +T     K G+  EIP+GC+GLFRVPT +                
Sbjct: 494  TLSL-VEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPKANFLGSSSQL 552

Query: 595  -------QSGXXXXXXXNLLRPFYSDSGSSLARYQQPISLPSEANITNQTNLLGPY---- 741
                     G       +LL   Y ++ SS +   QP+S        +     GPY    
Sbjct: 553  SLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVS--------SDRFFSGPYMYPR 604

Query: 742  --SGMGVGVPAGHRYISSPNL------EMGSGISSLRPLIDDHSMDNGMGLPASSRYTYP 897
              S       A   YI    +      E GS +S+ +P  D  S++    LP+SS   YP
Sbjct: 605  TSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDP-SLE--PVLPSSSLQNYP 661

Query: 898  ----YADLVPRMPPNNGFPRPYPSVRSGIPTSDHYPLYDDQNR 1014
                Y DLVP++    GFP  + + RS   T D +  YD   R
Sbjct: 662  TFPSYPDLVPQIHAKEGFP-AFHTTRSVGATPDWFSFYDSHFR 703


>ref|XP_007010393.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590567007|ref|XP_007010394.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508727306|gb|EOY19203.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508727307|gb|EOY19204.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 665

 Score = 92.4 bits (228), Expect = 3e-16
 Identities = 113/403 (28%), Positives = 163/403 (40%), Gaps = 65/403 (16%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+TEERDEI+ +    + T  S  QG E     +    E   K   N  +PP   D
Sbjct: 275  GNHSDVTEERDEIKAQAQYVSGTATSQVQGAEEE--HISFSAELP-KIHSNDLVPPSQAD 331

Query: 181  IGCSHD-------------PQCNG----LMVNTEFSFPSQENLETKSNGKHY-------- 285
            +    D             P   G     ++  E    S ++  + SN  H+        
Sbjct: 332  MDRLQDWRYSRSLSPESLNPNSPGQKLTFLMAKENHHQSMQSNNSPSNSSHHFAHPHDSP 391

Query: 286  QDQSVQKSSSFHADGSFYKGESSGMQNELQVTTYHGTPV-LGGVLEALQRAKLSLKHELH 462
             +Q+VQ  SS    GS    E    +NEL     H T     GVL++L++A+LSL+ ++ 
Sbjct: 392  GNQAVQHISSDL--GSHSCRELPRNKNELYALVPHETSGRFTGVLDSLKQARLSLQQKIS 449

Query: 463  RLPLPTQGGHMVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSL---------------- 594
             L L  +G  + +  +T     K G+  EIP+GC+GLFRVPT +                
Sbjct: 450  TLSL-VEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPKANFLGSSSQL 508

Query: 595  -------QSGXXXXXXXNLLRPFYSDSGSSLARYQQPISLPSEANITNQTNLLGPY---- 741
                     G       +LL   Y ++ SS +   QP+S        +     GPY    
Sbjct: 509  SLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVS--------SDRFFSGPYMYPR 560

Query: 742  --SGMGVGVPAGHRYISSPNL------EMGSGISSLRPLIDDHSMDNGMGLPASSRYTYP 897
              S       A   YI    +      E GS +S+ +P  D  S++    LP+SS   YP
Sbjct: 561  TSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDP-SLE--PVLPSSSLQNYP 617

Query: 898  ----YADLVPRMPPNNGFPRPYPSVRSGIPTSDHYPLYDDQNR 1014
                Y DLVP++    GFP  + + RS   T D +  YD   R
Sbjct: 618  TFPSYPDLVPQIHAKEGFP-AFHTTRSVGATPDWFSFYDSHFR 659


>ref|XP_007010392.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508727305|gb|EOY19202.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 749

 Score = 92.4 bits (228), Expect = 3e-16
 Identities = 113/403 (28%), Positives = 163/403 (40%), Gaps = 65/403 (16%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+TEERDEI+ +    + T  S  QG E     +    E   K   N  +PP   D
Sbjct: 359  GNHSDVTEERDEIKAQAQYVSGTATSQVQGAEEE--HISFSAELP-KIHSNDLVPPSQAD 415

Query: 181  IGCSHD-------------PQCNG----LMVNTEFSFPSQENLETKSNGKHY-------- 285
            +    D             P   G     ++  E    S ++  + SN  H+        
Sbjct: 416  MDRLQDWRYSRSLSPESLNPNSPGQKLTFLMAKENHHQSMQSNNSPSNSSHHFAHPHDSP 475

Query: 286  QDQSVQKSSSFHADGSFYKGESSGMQNELQVTTYHGTPV-LGGVLEALQRAKLSLKHELH 462
             +Q+VQ  SS    GS    E    +NEL     H T     GVL++L++A+LSL+ ++ 
Sbjct: 476  GNQAVQHISSDL--GSHSCRELPRNKNELYALVPHETSGRFTGVLDSLKQARLSLQQKIS 533

Query: 463  RLPLPTQGGHMVRVTDTPVPAFKAGDDREIPVGCAGLFRVPTSL---------------- 594
             L L  +G  + +  +T     K G+  EIP+GC+GLFRVPT +                
Sbjct: 534  TLSL-VEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPKANFLGSSSQL 592

Query: 595  -------QSGXXXXXXXNLLRPFYSDSGSSLARYQQPISLPSEANITNQTNLLGPY---- 741
                     G       +LL   Y ++ SS +   QP+S        +     GPY    
Sbjct: 593  SLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVS--------SDRFFSGPYMYPR 644

Query: 742  --SGMGVGVPAGHRYISSPNL------EMGSGISSLRPLIDDHSMDNGMGLPASSRYTYP 897
              S       A   YI    +      E GS +S+ +P  D  S++    LP+SS   YP
Sbjct: 645  TSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDP-SLE--PVLPSSSLQNYP 701

Query: 898  ----YADLVPRMPPNNGFPRPYPSVRSGIPTSDHYPLYDDQNR 1014
                Y DLVP++    GFP  + + RS   T D +  YD   R
Sbjct: 702  TFPSYPDLVPQIHAKEGFP-AFHTTRSVGATPDWFSFYDSHFR 743


>ref|XP_004496186.1| PREDICTED: uncharacterized protein LOC101514253 isoform X5 [Cicer
            arietinum]
          Length = 645

 Score = 75.5 bits (184), Expect = 4e-11
 Identities = 89/372 (23%), Positives = 150/372 (40%), Gaps = 34/372 (9%)
 Frame = +1

Query: 1    GNRSDITEERDEIRVETAEPADTILSHGQGGESGVGRVCHGGEAASKALPNGFLPPPHLD 180
            GN SD+TE+++E + +    +  + S+ Q  ++  G V    E   K+     +P  + D
Sbjct: 294  GNHSDMTEDKEESKAQIPYSSKAVTSNAQEDKAEPGGV-RSSEEIFKSEARDVMPKSYDD 352

Query: 181  IGCSHDPQCNGLMVNTEFSFPSQENLETKSNGK--------HYQDQSVQKSS-------- 312
                ++        +   +   QENL +  NG         H Q   V            
Sbjct: 353  TSDYNNQNSPTFRTS---NLLGQENLHSPLNGNQTESSVNSHPQSSEVNYHDPHGRGYPD 409

Query: 313  --------SFHADGSFYKGESSGMQNELQVTTYHG-TPVLGGVLEALQRAKLSLKHELHR 465
                     +   GS ++ +SS  +N+L    +   +    G+LE+L++A+LSL+ EL+R
Sbjct: 410  SKPTLSFPKYIQHGSLHQNDSSRNKNDLYALVFREQSHEFNGILESLKQARLSLQQELNR 469

Query: 466  LPLPTQGGHMVRVTDTPVPAF--KAGDDREIPVGCAGLFRVPTSLQSGXXXXXXXNLLRP 639
            LPL       ++ +     AF  K+    +IPVG +GLFR+PT              +R 
Sbjct: 470  LPLVESSHKGIKPS-----AFVGKSEGRFDIPVGFSGLFRLPTDFSDEATSRFG---VRD 521

Query: 640  FYSDSGSSLARYQQPISLPSEANITNQTNLLGPYSGMGVGVPAGHRYISSPNLEMGSGIS 819
                 GS+     +  S  S+        +  PY G  + + A  +  ++  LE G    
Sbjct: 522  SAGGFGSNFYHNNRGTSRTSDVQF-----VANPYYGTRMSLSANDQAHTTRYLENGPISD 576

Query: 820  SLRPLIDDHSMDNGMGLPASSRYTYP-------YADLVPRMPPNNGFPRPYPSVRSGIPT 978
            S +   D     NG G P SS+  YP       Y    P+ P      +PY S  +G+P 
Sbjct: 577  SKKTPFDPFL--NG-GPPNSSKPVYPSFPVNPSYQVTSPQTPYGGELSKPYSSRPAGVPF 633

Query: 979  SDHYPLYDDQNR 1014
            +D +  + +  R
Sbjct: 634  ADQFSFHGNHLR 645


Top