BLASTX nr result

ID: Sinomenium22_contig00022807 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00022807
         (1740 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Popu...   449   e-123
ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248...   449   e-123
emb|CBI35892.3| unnamed protein product [Vitis vinifera]              449   e-123
emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]   449   e-123
gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]     446   e-122
ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma...   445   e-122
ref|XP_007024586.1| Uncharacterized protein isoform 3 [Theobroma...   445   e-122
ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma...   445   e-122
ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma...   445   e-122
ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma...   444   e-122
ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma...   444   e-122
ref|XP_007214970.1| hypothetical protein PRUPE_ppa001749mg [Prun...   444   e-122
ref|XP_004303676.1| PREDICTED: uncharacterized protein LOC101293...   441   e-121
ref|XP_002521347.1| conserved hypothetical protein [Ricinus comm...   440   e-120
ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus tr...   438   e-120
ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citr...   432   e-118
ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [...   431   e-118
ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citr...   427   e-117
ref|XP_006583149.1| PREDICTED: dentin sialophosphoprotein-like i...   419   e-114
ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like i...   419   e-114

>ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Populus trichocarpa]
            gi|550342535|gb|EEE79123.2| hypothetical protein
            POPTR_0003s06200g [Populus trichocarpa]
          Length = 858

 Score =  449 bits (1155), Expect = e-123
 Identities = 243/497 (48%), Positives = 312/497 (62%), Gaps = 11/497 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PKSSQKSS+ S GV GT       P  NS +  +   +LQ    ++NI EN
Sbjct: 366  ASQHNKEWKPKSSQKSSITSPGVIGTPTKSSLPPTDNSKSMELNAANLQDKFSRVNIHEN 425

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
            ++VII ++ RVPE++R  LTFGSF   FD +RN   G QA   +++S+ E  +       
Sbjct: 426  QNVIIAQHIRVPESDRCKLTFGSFGVEFDPSRNSTPGFQAVGISEESNRESAISLPASCP 485

Query: 350  XXXXXXC--GNELDLSKDQVKTSRSDSPVSA-SSEHPLPEKKQSSSTQDLEKYADFALVQ 520
                     G +++L  DQ + S SDSP +  +SEH LPEK  SSS  DL+ YAD  LV+
Sbjct: 486  ESSSEDAPGGKQIELLDDQARNSESDSPEAGLASEHQLPEK--SSSPPDLDNYADIGLVR 543

Query: 521  NNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASS 697
            N+ PS+ PSE      +P L S F A+DPQ GYD+ +F+P +D+  QGQ  PSP E  ++
Sbjct: 544  NSSPSYAPSESQQQQDHPELPS-FSAYDPQTGYDMSYFQPPIDETVQGQGQPSPREALTA 602

Query: 698  HGANIIPASTVAMIQPQPG-AQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPS 874
            H  N IP ST+  +Q QP  AQ+YPQVH+S F N MPYRQF+SPV+VPPM +PGYS NP+
Sbjct: 603  HTGNHIPTSTMPTMQQQPPMAQMYPQVHVSPFTNLMPYRQFISPVYVPPMPMPGYSSNPA 662

Query: 875  YSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTV 1054
            Y HPSNG +Y LMPG  SH  A GLKYG   YKP+PSS+  GF  +T+  GYAINA G V
Sbjct: 663  YPHPSNGNSYMLMPGGGSHLNANGLKYGIQHYKPVPSSNPAGFGNFTSPSGYAINAPGVV 722

Query: 1055 GGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYM 1228
            G A   ED + MKYKDG  ++PN Q ++SEIWIQ PR++PGLQS+ YY+I GQ  +AAY+
Sbjct: 723  GSAAGLEDPSRMKYKDGNIYVPNPQAESSEIWIQNPRDLPGLQSSPYYNIPGQT-HAAYL 781

Query: 1229 PSQTGNASFNVAATQSTQMQFSGMYHQPQPAPLANPHALXXXXXXXXXXXXXXXXXXXXX 1408
            PS TG+ASFN AA QS+ MQF G+Y  PQP  +A+PH L                     
Sbjct: 782  PSHTGHASFNAAAAQSSHMQFPGLYPPPQPTAMASPHHLGPVMGNNVGVGVAPSAPGAQV 841

Query: 1409 XXXXXXXINHLNWSTNF 1459
                   + HLNW+TNF
Sbjct: 842  GAYQQPQLGHLNWTTNF 858


>ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248075 [Vitis vinifera]
          Length = 860

 Score =  449 bits (1154), Expect = e-123
 Identities = 245/500 (49%), Positives = 308/500 (61%), Gaps = 14/500 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            APQ NKEW+PKSSQKSS +  GV GT A  +S    NS +   E   LQ    + +I EN
Sbjct: 368  APQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKLSQASISEN 427

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
            ++VII ++ RVPE +R  LTFGSF + F S      G QA  +A + S+EP         
Sbjct: 428  QNVIIAQHIRVPETDRCRLTFGSFGADFAS------GFQAVGNADEPSAEPSASLSVSPP 481

Query: 350  XXXXXXCGNELDLSKDQVKTSRSDSPVSASSEHPLPEKKQSSSTQDLEKYADFALVQNNV 529
                     ++DL    + +  +      +SEH LP+KK+SSS Q+LE YAD  LV+ + 
Sbjct: 482  ESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYADIGLVRESS 541

Query: 530  PSHPSEGXXXXXNPSLLSNFP-AFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSHGA 706
            PS+  E         +L +FP A+DPQ GYD+P+FRP MD+  +GQ LPSP E  +SH A
Sbjct: 542  PSYTPESQQQQER-HVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSPQEALASHTA 600

Query: 707  NIIPASTVAMIQPQ----PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPS 874
            N IPAS++AM+Q Q    P  Q+Y QVH+  F N MPYRQFLSPV+VPPMA+PGYS NP+
Sbjct: 601  NSIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVPPMAMPGYSSNPA 660

Query: 875  YSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTV 1054
            YSHPSN  +Y LMPG SSH  A GLKYG  Q KP+P+ S  GF  +TN  GYAINA G V
Sbjct: 661  YSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTNPTGYAINAPGVV 720

Query: 1055 GGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYM 1228
            G A   EDS+ +KYKDG  ++PN Q +TSEIWIQ PRE+PGLQSA YY++  Q  +AAYM
Sbjct: 721  GSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYYNMPAQTPHAAYM 780

Query: 1229 PSQTGNASFN--VAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXX 1399
            PS TG+ASFN   AA QS+ MQF G+YH  PQPA +A+PH L                  
Sbjct: 781  PSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASPHHLGPPMGGNVGVGVAAAAPG 840

Query: 1400 XXXXXXXXXXINHLNWSTNF 1459
                      + HLNW+TNF
Sbjct: 841  PQVGAYQQPQLGHLNWTTNF 860


>emb|CBI35892.3| unnamed protein product [Vitis vinifera]
          Length = 809

 Score =  449 bits (1154), Expect = e-123
 Identities = 245/500 (49%), Positives = 308/500 (61%), Gaps = 14/500 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            APQ NKEW+PKSSQKSS +  GV GT A  +S    NS +   E   LQ    + +I EN
Sbjct: 317  APQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKLSQASISEN 376

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
            ++VII ++ RVPE +R  LTFGSF + F S      G QA  +A + S+EP         
Sbjct: 377  QNVIIAQHIRVPETDRCRLTFGSFGADFAS------GFQAVGNADEPSAEPSASLSVSPP 430

Query: 350  XXXXXXCGNELDLSKDQVKTSRSDSPVSASSEHPLPEKKQSSSTQDLEKYADFALVQNNV 529
                     ++DL    + +  +      +SEH LP+KK+SSS Q+LE YAD  LV+ + 
Sbjct: 431  ESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYADIGLVRESS 490

Query: 530  PSHPSEGXXXXXNPSLLSNFP-AFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSHGA 706
            PS+  E         +L +FP A+DPQ GYD+P+FRP MD+  +GQ LPSP E  +SH A
Sbjct: 491  PSYTPESQQQQER-HVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSPQEALASHTA 549

Query: 707  NIIPASTVAMIQPQ----PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPS 874
            N IPAS++AM+Q Q    P  Q+Y QVH+  F N MPYRQFLSPV+VPPMA+PGYS NP+
Sbjct: 550  NSIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVPPMAMPGYSSNPA 609

Query: 875  YSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTV 1054
            YSHPSN  +Y LMPG SSH  A GLKYG  Q KP+P+ S  GF  +TN  GYAINA G V
Sbjct: 610  YSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTNPTGYAINAPGVV 669

Query: 1055 GGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYM 1228
            G A   EDS+ +KYKDG  ++PN Q +TSEIWIQ PRE+PGLQSA YY++  Q  +AAYM
Sbjct: 670  GSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYYNMPAQTPHAAYM 729

Query: 1229 PSQTGNASFN--VAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXX 1399
            PS TG+ASFN   AA QS+ MQF G+YH  PQPA +A+PH L                  
Sbjct: 730  PSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASPHHLGPPMGGNVGVGVAAAAPG 789

Query: 1400 XXXXXXXXXXINHLNWSTNF 1459
                      + HLNW+TNF
Sbjct: 790  PQVGAYQQPQLGHLNWTTNF 809


>emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]
          Length = 914

 Score =  449 bits (1154), Expect = e-123
 Identities = 245/500 (49%), Positives = 308/500 (61%), Gaps = 14/500 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            APQ NKEW+PKSSQKSS +  GV GT A  +S    NS +   E   LQ    + +I EN
Sbjct: 422  APQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKLSQASISEN 481

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
            ++VII ++ RVPE +R  LTFGSF + F S      G QA  +A + S+EP         
Sbjct: 482  QNVIIAQHIRVPETDRCRLTFGSFGADFAS------GFQAVGNADEPSAEPSASLSVSPP 535

Query: 350  XXXXXXCGNELDLSKDQVKTSRSDSPVSASSEHPLPEKKQSSSTQDLEKYADFALVQNNV 529
                     ++DL    + +  +      +SEH LP+KK+SSS Q+LE YAD  LV+ + 
Sbjct: 536  ESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYADIGLVRESS 595

Query: 530  PSHPSEGXXXXXNPSLLSNFP-AFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSHGA 706
            PS+  E         +L +FP A+DPQ GYD+P+FRP MD+  +GQ LPSP E  +SH A
Sbjct: 596  PSYTPESQQQQER-HVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSPQEALASHTA 654

Query: 707  NIIPASTVAMIQPQ----PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPS 874
            N IPAS++AM+Q Q    P  Q+Y QVH+  F N MPYRQFLSPV+VPPMA+PGYS NP+
Sbjct: 655  NSIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVPPMAMPGYSSNPA 714

Query: 875  YSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTV 1054
            YSHPSN  +Y LMPG SSH  A GLKYG  Q KP+P+ S  GF  +TN  GYAINA G V
Sbjct: 715  YSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTNPTGYAINAPGVV 774

Query: 1055 GGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYM 1228
            G A   EDS+ +KYKDG  ++PN Q +TSEIWIQ PRE+PGLQSA YY++  Q  +AAYM
Sbjct: 775  GSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYYNMPAQTPHAAYM 834

Query: 1229 PSQTGNASFN--VAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXX 1399
            PS TG+ASFN   AA QS+ MQF G+YH  PQPA +A+PH L                  
Sbjct: 835  PSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASPHHLGPPMGGNVGVGVAAAAPG 894

Query: 1400 XXXXXXXXXXINHLNWSTNF 1459
                      + HLNW+TNF
Sbjct: 895  PQVGAYQQPQLGHLNWTTNF 914


>gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]
          Length = 854

 Score =  446 bits (1148), Expect = e-122
 Identities = 237/498 (47%), Positives = 310/498 (62%), Gaps = 12/498 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIE----VGHLQKINIFEN 169
            A Q NKEW+PKSSQK SL + GV GT    +S P  NS  S  E    +  L ++NI EN
Sbjct: 360  ASQPNKEWKPKSSQKPSLNNPGVIGTPTKSVSPPAHNSEVSESEPAKVLEKLSRVNIHEN 419

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
            ++VII ++ RVPE +R  LTFGSF   F+S  +   G QA  +  +S+ E          
Sbjct: 420  QNVIIAQHIRVPETDRCRLTFGSFGKEFESDSDLVNGYQA-GAIGESNGEAASSLSAPES 478

Query: 350  XXXXXXCGNELDLSKDQVKTSRSDSPVSA-SSEHPLPEKKQSSSTQDLEKYADFALVQNN 526
                     ++DL+ +Q++ S SDSP S  +SE+  P+KK+S+S Q+L+ YAD  LVQ N
Sbjct: 479  SIGDASGSKQVDLTDEQIRNSGSDSPTSGGTSENQFPDKKESTSPQNLDNYADIGLVQGN 538

Query: 527  VPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVM--DDAFQGQHLPSPLEVASSH 700
             PS+         +P L   F A+D Q GYD P+FRP    D+A +GQ LP+P E  SSH
Sbjct: 539  SPSYAPADSQQPEHPEL-PGFSAYDSQTGYDFPYFRPASATDEAMRGQGLPTPQEAFSSH 597

Query: 701  GANIIPASTVAMIQPQ---PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNP 871
              N +P +T++M+Q Q   P AQ+YPQVH+S F N MPYRQFLSPV+VPPMA+PGYS +P
Sbjct: 598  NTNSVP-TTISMVQQQQQPPVAQMYPQVHVSHFANLMPYRQFLSPVYVPPMAMPGYSSSP 656

Query: 872  SYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGT 1051
            +Y HPSNG +Y LMPG  +H  A  LKYG  Q+KP+P+ +  GF  ++N  GYAIN  G 
Sbjct: 657  AYPHPSNGNSYLLMPGGGTHLNANSLKYGVQQFKPVPAGNPTGFGNFSNPNGYAINTPGV 716

Query: 1052 VGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAY 1225
            VGGA   EDS+ +KYKDG  ++PN Q +TSE+WIQ PRE+PGLQS  YY++ GQ  +AAY
Sbjct: 717  VGGATGLEDSSRIKYKDGNLYVPNPQAETSEMWIQNPRELPGLQSTPYYNMPGQSPHAAY 776

Query: 1226 MPSQTGNASFNVAATQSTQMQFSGMYHQPQPAPLANPHALXXXXXXXXXXXXXXXXXXXX 1405
            +PS TG+AS+N AA QS+ MQF G+YH PQPA +ANPH L                    
Sbjct: 777  LPSHTGHASYNAAAAQSSHMQFPGLYHPPQPAAIANPHHLGPAMGGNVGVGVAAAAPGAQ 836

Query: 1406 XXXXXXXXINHLNWSTNF 1459
                    + HLNW+TNF
Sbjct: 837  VGAYQQPQLGHLNWTTNF 854


>ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508779953|gb|EOY27209.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 849

 Score =  445 bits (1145), Expect = e-122
 Identities = 242/498 (48%), Positives = 311/498 (62%), Gaps = 12/498 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+EN
Sbjct: 356  ANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYEN 415

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPC--LXXXXX 343
             +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E    L     
Sbjct: 416  ENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVSAP 475

Query: 344  XXXXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFALVQ 520
                     G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  LVQ
Sbjct: 476  DTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQ 535

Query: 521  NNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSH 700
            +N PS+         +P  L +F A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S+H
Sbjct: 536  DNSPSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAH 595

Query: 701  GANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPS 874
             AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS NP+
Sbjct: 596  TANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPA 654

Query: 875  YSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTV 1054
            Y HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA G V
Sbjct: 655  YPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVV 714

Query: 1055 GGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYM 1228
            G     EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++        YM
Sbjct: 715  GNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPHGYM 772

Query: 1229 PSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXXXX 1405
            PS TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                    
Sbjct: 773  PSHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQ 831

Query: 1406 XXXXXXXXINHLNWSTNF 1459
                    + HLNW+TNF
Sbjct: 832  VGAYQQPQLGHLNWTTNF 849


>ref|XP_007024586.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508779952|gb|EOY27208.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 761

 Score =  445 bits (1145), Expect = e-122
 Identities = 242/498 (48%), Positives = 311/498 (62%), Gaps = 12/498 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+EN
Sbjct: 268  ANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYEN 327

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPC--LXXXXX 343
             +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E    L     
Sbjct: 328  ENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVSAP 387

Query: 344  XXXXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFALVQ 520
                     G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  LVQ
Sbjct: 388  DTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQ 447

Query: 521  NNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSH 700
            +N PS+         +P  L +F A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S+H
Sbjct: 448  DNSPSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAH 507

Query: 701  GANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPS 874
             AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS NP+
Sbjct: 508  TANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPA 566

Query: 875  YSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTV 1054
            Y HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA G V
Sbjct: 567  YPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVV 626

Query: 1055 GGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYM 1228
            G     EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++        YM
Sbjct: 627  GNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPHGYM 684

Query: 1229 PSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXXXX 1405
            PS TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                    
Sbjct: 685  PSHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQ 743

Query: 1406 XXXXXXXXINHLNWSTNF 1459
                    + HLNW+TNF
Sbjct: 744  VGAYQQPQLGHLNWTTNF 761


>ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508779955|gb|EOY27211.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 839

 Score =  445 bits (1144), Expect = e-122
 Identities = 241/496 (48%), Positives = 310/496 (62%), Gaps = 10/496 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+EN
Sbjct: 356  ANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYEN 415

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
             +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E          
Sbjct: 416  ENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASDDAAG- 474

Query: 350  XXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFALVQNN 526
                   G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  LVQ+N
Sbjct: 475  -------GKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDN 527

Query: 527  VPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSHGA 706
             PS+         +P  L +F A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S+H A
Sbjct: 528  SPSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAHTA 587

Query: 707  NIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPSYS 880
            N+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS NP+Y 
Sbjct: 588  NV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPAYP 646

Query: 881  HPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTVGG 1060
            HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA G VG 
Sbjct: 647  HPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVGN 706

Query: 1061 AISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYMPS 1234
                EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++        YMPS
Sbjct: 707  PTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPHGYMPS 764

Query: 1235 QTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXXXXXX 1411
             TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                      
Sbjct: 765  HTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQVG 823

Query: 1412 XXXXXXINHLNWSTNF 1459
                  + HLNW+TNF
Sbjct: 824  AYQQPQLGHLNWTTNF 839


>ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508779951|gb|EOY27207.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 852

 Score =  445 bits (1144), Expect = e-122
 Identities = 245/499 (49%), Positives = 312/499 (62%), Gaps = 13/499 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+EN
Sbjct: 358  ANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYEN 417

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPC--LXXXXX 343
             +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E    L     
Sbjct: 418  ENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVSAP 477

Query: 344  XXXXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFALVQ 520
                     G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  LVQ
Sbjct: 478  DTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQ 537

Query: 521  NNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASS 697
            +N PS+ PSE       P L S   A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S+
Sbjct: 538  DNSPSYAPSESQKQQDPPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSA 597

Query: 698  HGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNP 871
            H AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS NP
Sbjct: 598  HTANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNP 656

Query: 872  SYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGT 1051
            +Y HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA G 
Sbjct: 657  AYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGV 716

Query: 1052 VGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAY 1225
            VG     EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++        Y
Sbjct: 717  VGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPHGY 774

Query: 1226 MPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXXX 1402
            MPS TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                   
Sbjct: 775  MPSHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGA 833

Query: 1403 XXXXXXXXXINHLNWSTNF 1459
                     + HLNW+TNF
Sbjct: 834  QVGAYQQPQLGHLNWTTNF 852


>ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508779954|gb|EOY27210.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 842

 Score =  444 bits (1143), Expect = e-122
 Identities = 244/497 (49%), Positives = 311/497 (62%), Gaps = 11/497 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+EN
Sbjct: 358  ANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYEN 417

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
             +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E          
Sbjct: 418  ENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASDDAAG- 476

Query: 350  XXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFALVQNN 526
                   G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  LVQ+N
Sbjct: 477  -------GKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDN 529

Query: 527  VPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSHG 703
             PS+ PSE       P L S   A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S+H 
Sbjct: 530  SPSYAPSESQKQQDPPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAHT 589

Query: 704  ANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPSY 877
            AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS NP+Y
Sbjct: 590  ANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPAY 648

Query: 878  SHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTVG 1057
             HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA G VG
Sbjct: 649  PHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVG 708

Query: 1058 GAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYMP 1231
                 EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++        YMP
Sbjct: 709  NPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPHGYMP 766

Query: 1232 SQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXXXXX 1408
            S TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                     
Sbjct: 767  SHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQV 825

Query: 1409 XXXXXXXINHLNWSTNF 1459
                   + HLNW+TNF
Sbjct: 826  GAYQQPQLGHLNWTTNF 842


>ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508779950|gb|EOY27206.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 883

 Score =  444 bits (1143), Expect = e-122
 Identities = 246/505 (48%), Positives = 313/505 (61%), Gaps = 19/505 (3%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+EN
Sbjct: 383  ANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYEN 442

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSE--------PC 325
             +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E        P 
Sbjct: 443  ENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAARLVFSPN 502

Query: 326  LXXXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYA 502
            L              G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YA
Sbjct: 503  LSVSAPDTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYA 562

Query: 503  DFALVQNNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSP 679
            D  LVQ+N PS+ PSE       P L S   A+DPQ GYD+P+FRP +D+  +GQ LPSP
Sbjct: 563  DIGLVQDNSPSYAPSESQKQQDPPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSP 622

Query: 680  LEVASSHGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVP 853
             E  S+H AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+P
Sbjct: 623  QEALSAHTANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMP 681

Query: 854  GYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYA 1033
            GYS NP+Y HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYA
Sbjct: 682  GYSSNPAYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYA 741

Query: 1034 INAQGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQ 1207
            INA G VG     EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++   
Sbjct: 742  INAPGVVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--P 799

Query: 1208 VLNAAYMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXX 1384
                 YMPS TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L             
Sbjct: 800  QTPHGYMPSHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVA 858

Query: 1385 XXXXXXXXXXXXXXXINHLNWSTNF 1459
                           + HLNW+TNF
Sbjct: 859  PAAPGAQVGAYQQPQLGHLNWTTNF 883


>ref|XP_007214970.1| hypothetical protein PRUPE_ppa001749mg [Prunus persica]
            gi|462411120|gb|EMJ16169.1| hypothetical protein
            PRUPE_ppa001749mg [Prunus persica]
          Length = 771

 Score =  444 bits (1141), Expect = e-122
 Identities = 242/501 (48%), Positives = 301/501 (60%), Gaps = 15/501 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PKSSQK S  S GV GT    +SSP+ NS  S  E   LQ    ++N+++N
Sbjct: 278  ASQPNKEWKPKSSQKPSSNSPGVIGTPTKSVSSPD-NSKVSESEAAKLQDKLSRVNVYDN 336

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
             +V+I +N RVP+++R  LTFGS  +  DST N   G QA    ++S+ EP         
Sbjct: 337  SNVVIAQNIRVPDSDRFRLTFGSLGTELDSTGNMVNGFQA-GGTEESNGEPA----GSLS 391

Query: 350  XXXXXXCGNE------LDLSKDQVKTSRSDSPVS-ASSEHPLPEKKQSSSTQDLEKYADF 508
                  C +E      +DL   QV+ S SDSP S A  E  LPEK  +SS Q L+ YAD 
Sbjct: 392  LSAPQSCSDEASGIKPVDLLDHQVRNSGSDSPASGAVPERQLPEKNDTSSPQTLDNYADI 451

Query: 509  ALVQNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEV 688
             LV++  PS+          P L   F AFDPQ  Y++P+FRP MD++ +GQ LPSP E 
Sbjct: 452  GLVRDTSPSYAPSDSQQQEQPEL-EGFSAFDPQTSYNIPYFRPHMDESVRGQGLPSPQEA 510

Query: 689  ASSHGANIIPASTVAMIQ--PQPGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYS 862
             SSH  N I ASTVAM+Q  P P AQ+YPQVH+S + N MPYRQFLSPV+VPPMAVPGYS
Sbjct: 511  LSSHNVNSIAASTVAMVQQQPPPVAQMYPQVHVSHYANLMPYRQFLSPVYVPPMAVPGYS 570

Query: 863  RNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINA 1042
             NP+Y H SNG +Y LMPG  SH  A  LKYG   +KP+P+ S  G+  +TN  GYAIN 
Sbjct: 571  SNPAYPHMSNGNSYLLMPGGGSHLNANSLKYGVQPFKPVPAGSPTGYGNFTNPNGYAING 630

Query: 1043 QGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLN 1216
             G VGGA   EDS+ +KYKDG  ++ N Q +TSE+WIQ PRE PGLQS  YY++  Q  +
Sbjct: 631  PGVVGGASGLEDSSRIKYKDGNLYVANPQAETSEMWIQNPREHPGLQSTPYYNVPAQSPH 690

Query: 1217 AAYMPSQTGNASFNVAATQSTQMQFSGMYHQPQPAPLANPHALXXXXXXXXXXXXXXXXX 1396
             AYMPS   +ASFN AA QS+ MQF G+YH PQPA + NPH L                 
Sbjct: 691  GAYMPSHAAHASFNAAAAQSSHMQFPGLYHPPQPAAIPNPHHLGPAMGGNVGVGVAAAAP 750

Query: 1397 XXXXXXXXXXXINHLNWSTNF 1459
                       +NH+NW TNF
Sbjct: 751  GAQVGAYQQPQLNHMNWQTNF 771


>ref|XP_004303676.1| PREDICTED: uncharacterized protein LOC101293990 [Fragaria vesca
            subsp. vesca]
          Length = 915

 Score =  441 bits (1134), Expect = e-121
 Identities = 242/496 (48%), Positives = 312/496 (62%), Gaps = 10/496 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEV---GHLQKINIFENR 172
            A Q NKEW+PKSSQK S  + GV GT     S P+ + ++ +  V     L ++NI+EN 
Sbjct: 432  ASQPNKEWKPKSSQKPSSNNPGVIGTPTKSASPPDDSKVSESEAVQLQDKLARVNIYENC 491

Query: 173  HVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXXX 352
            +V+I +N RVPE++R  LTFGS   G +    +  GP      ++S+ EP          
Sbjct: 492  NVVIAQNIRVPESDRFRLTFGSL--GTELVNGFQAGP-----TEESNREPQASLSTSAPE 544

Query: 353  XXXXXCGNE-LDLSKDQVKTSRSD-SPVSASSEHPLPEKKQSSSTQDLEKYADFALVQNN 526
                    + +DL  DQV+ S SD S  SA  EH LPEK+++SS Q L+ YAD  LV++N
Sbjct: 545  SHSDEASTKPIDLLDDQVRNSGSDFSAPSAVPEH-LPEKRETSSPQSLDNYADIGLVRDN 603

Query: 527  VPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSHG 703
             PS  PS+      +P  +  F AFDPQ GYD+P++RP MD++  GQ LPSP E  SSH 
Sbjct: 604  SPSFTPSDSQNQ--DPPEMQGFTAFDPQTGYDIPYYRPSMDESVHGQGLPSPQEALSSHN 661

Query: 704  ANIIPASTVAMIQPQPG--AQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPSY 877
            +N IPASTVAM+Q QP   AQ+YPQVH+S + N MPYRQ++SPV+VPPMAVPGYS NP+Y
Sbjct: 662  SNSIPASTVAMVQQQPPHVAQMYPQVHVSHYANMMPYRQYISPVYVPPMAVPGYSNNPAY 721

Query: 878  SHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTVG 1057
             H SNG +Y LMPG +SH  A  LKYG  Q+KP+ + S  GF  +TN  GYA+NA G VG
Sbjct: 722  PHMSNGNSYLLMPGGASHLNANSLKYGVQQFKPV-AGSPTGFGNFTNPAGYAMNAPGVVG 780

Query: 1058 GAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYMP 1231
            GA   EDS+ MKYKDG  ++PN Q +TSEIWIQ PRE PG+QSA YY++ GQ  +AAYMP
Sbjct: 781  GATGLEDSSRMKYKDGNLYVPNPQAETSEIWIQNPREHPGMQSAPYYNMPGQTPHAAYMP 840

Query: 1232 SQTGNASFNVAATQSTQMQFSGMYHQPQPAPLANPHALXXXXXXXXXXXXXXXXXXXXXX 1411
            S  G+ASFN AA QS+ MQ+ GMYH PQPA +A+PH +                      
Sbjct: 841  SHGGHASFNAAAAQSSHMQYPGMYHPPQPAAMASPHHM-GPAMPGNVGVGVAAAAPGAQA 899

Query: 1412 XXXXXXINHLNWSTNF 1459
                  +NH+NW+TNF
Sbjct: 900  YQQQPQLNHMNWTTNF 915


>ref|XP_002521347.1| conserved hypothetical protein [Ricinus communis]
            gi|223539425|gb|EEF41015.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 864

 Score =  440 bits (1131), Expect = e-120
 Identities = 243/500 (48%), Positives = 310/500 (62%), Gaps = 14/500 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PKSSQK+S+ S GV GT     S P GNS +   +   +Q    ++NI+EN
Sbjct: 368  ATQHNKEWKPKSSQKASVGSPGVIGTPTKSSSPPAGNSKDLESDATDMQEKLLRVNIYEN 427

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPC--LXXXXX 343
            ++VII ++ RVPE +R  LTFGSF   FDS+RN   G QA    + S +E    L     
Sbjct: 428  QNVIIAQHIRVPETDRCRLTFGSFGVEFDSSRNMPSGFQAAGVTKDSKAESAASLSASAP 487

Query: 344  XXXXXXXXCGNELDLSKDQVKTSRSDSPVS-ASSEHPLPEKKQSSSTQDLEKYADFALVQ 520
                       +++L  +QV+ S SDSP S A SEH  P+K  SSS  +L+ YAD  LV+
Sbjct: 488  ESSSDDASGNKQVELLDEQVRNSGSDSPASGAVSEHQSPDK--SSSPPNLDNYADIGLVR 545

Query: 521  NNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSH 700
            ++ P   SE       P L S F A+DPQ  YD+ +FRP +D+  +GQ L S  E   SH
Sbjct: 546  DSSPFTSSESQHQQDPPELPS-FSAYDPQTVYDMSYFRPQIDETVRGQGLQSAQEALISH 604

Query: 701  GANIIPASTVAMIQPQ---PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNP 871
              + +PAS++ M+Q Q   P AQ+YPQVH+S + N MPYRQFLSPV+VP MA+PGYS NP
Sbjct: 605  RVDSMPASSIPMVQQQQQPPIAQMYPQVHVSHYTNLMPYRQFLSPVYVPQMAMPGYSSNP 664

Query: 872  SYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGT 1051
            +Y HPSNG +Y LMPG SSH +A GLKYG  Q+KP+P SS  GF  +T+  GYAINA G 
Sbjct: 665  AYPHPSNGSSYLLMPGGSSHLSANGLKYGIQQFKPVPGSSPTGFGNFTSPTGYAINAPGV 724

Query: 1052 VGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAY 1225
            VG A   EDS+ MKYKDG  ++PN Q +TSEIW+Q PRE+PGLQSA YY++ GQ  +AAY
Sbjct: 725  VGSATGLEDSSRMKYKDGNLYVPNPQAETSEIWVQNPRELPGLQSAPYYNMPGQSPHAAY 784

Query: 1226 MPSQTGNASFNVAATQSTQMQFSGMYHQPQPAP--LANPHALXXXXXXXXXXXXXXXXXX 1399
            +PS TG+ASFN AA QS+ MQFSG+Y  P P P  +ANPH L                  
Sbjct: 785  LPSHTGHASFNAAAAQSSHMQFSGLYPPPPPTPAAMANPHHLGPVMGGNVGVGVAPAAPG 844

Query: 1400 XXXXXXXXXXINHLNWSTNF 1459
                      + HLNW+TNF
Sbjct: 845  AQVGAYQQPQLGHLNWTTNF 864


>ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550347518|gb|EEE84402.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 854

 Score =  438 bits (1127), Expect = e-120
 Identities = 241/498 (48%), Positives = 311/498 (62%), Gaps = 12/498 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PKSSQKSS+ S GV GT     S P  NS N  ++  +LQ    +INI EN
Sbjct: 364  ASQHNKEWKPKSSQKSSVTSPGVIGTPTKSSSPPTDNSKNMELDAANLQDKFSRINIHEN 423

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
            ++VII ++ RVPE +R  LTFGSF  GFD+ R   +  QA   +++S+ E  +       
Sbjct: 424  QNVIIAQHIRVPETDRCKLTFGSFGVGFDAPRTPGF--QAVGISEESNGESAISLPASAP 481

Query: 350  XXXXXXC--GNELDLSKDQVKTSRSDSPV-SASSEHPLPEKKQSSSTQDLEKYADFALVQ 520
                     G +++L  DQ +   SDSP  S  SEHPLP    SSS  +L+ YAD  LV+
Sbjct: 482  DSSSDDASGGKQIELLDDQARNYGSDSPAASLESEHPLPVN--SSSPPNLDNYADIGLVR 539

Query: 521  NNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASS 697
            N+ PS+ PSE      +P L S F A+DPQ GYD+ +FRP +D+  +GQ LPSP E  ++
Sbjct: 540  NSSPSYAPSESQQQQDHPELPS-FSAYDPQTGYDISYFRPQIDETVRGQGLPSPQEALTT 598

Query: 698  HGANIIPASTVAMIQPQPG-AQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPS 874
            H AN+ PAST++ +Q QP  AQ+YPQVH+SQF N +PYRQF+SPV+VPPM +PGYS +P+
Sbjct: 599  HTANV-PASTMSTVQQQPPMAQMYPQVHVSQFTNLVPYRQFISPVYVPPMPMPGYSSSPA 657

Query: 875  YSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTV 1054
            Y HPSNG +Y LMPG  SH  A GLKYG   YKP+P ++  GF  + +  GYAINA G V
Sbjct: 658  YPHPSNGNSYLLMPGGGSHLNANGLKYGIQHYKPVPGNNPAGFGNFVSPSGYAINAPGVV 717

Query: 1055 GGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYM 1228
            G A   EDS+ MKYKDG  ++PN Q + SEIWIQ PREIPG+QSA YY++ GQ  + AY+
Sbjct: 718  GSATGLEDSSRMKYKDGNLYVPNPQAEASEIWIQNPREIPGMQSAPYYNMPGQT-HTAYL 776

Query: 1229 PSQTGNASFNVAATQSTQMQFSGMY-HQPQPAPLANPHALXXXXXXXXXXXXXXXXXXXX 1405
            PS TG+ASFN AA QS+ MQF G+Y   PQP  + +PH L                    
Sbjct: 777  PSHTGHASFNAAAAQSSHMQFPGLYPPTPQPTAMPSPHHLGPVMGGNVGVGVAPSAPGAQ 836

Query: 1406 XXXXXXXXINHLNWSTNF 1459
                    + HLNW+TNF
Sbjct: 837  VGAYQQPQLGHLNWTTNF 854


>ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528616|gb|ESR39866.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 866

 Score =  432 bits (1110), Expect = e-118
 Identities = 234/501 (46%), Positives = 307/501 (61%), Gaps = 15/501 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PKSSQKS+++  GV GT     S P  +S +   +V  LQ    ++NI EN
Sbjct: 366  ASQHNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNIHEN 425

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
            ++VII ++ RVPE +R  LTFGSF   F+S+RN   G  A  SA++S+ E          
Sbjct: 426  QNVIIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAAS 485

Query: 350  XXXXXXCGNE--LDLSKDQVKTSRSDSPVSA-SSEHPLPEK-KQSSSTQDLEKYADFALV 517
                        +D+  D V+ S S+SP S  +SEH LP+  K +SS QDL+ YAD  LV
Sbjct: 486  KTSGNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLV 545

Query: 518  QNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASS 697
            ++  PS+P        + S L++FPA+D Q GYD+ +FRP MD++ +GQ LPSP E  +S
Sbjct: 546  RDTDPSYPLSESQQQQDSSELASFPAYDSQTGYDMSYFRPTMDESVRGQGLPSPQEALAS 605

Query: 698  HGANIIPASTVAMIQPQPG---AQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRN 868
            H AN IPAS++AM+Q Q     AQ+YPQVH+S FPN MPYRQ +SPV+VP MA+PGYS N
Sbjct: 606  HSANSIPASSIAMLQHQQQPQMAQMYPQVHVSHFPNMMPYRQIISPVYVPQMAMPGYSSN 665

Query: 869  PSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQG 1048
            P+Y HPSNG +Y LMPG SSH +  GLKYG  Q+KP+P++S  GF  +T+  GYAINA  
Sbjct: 666  PAYPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPTASPTGFGNFTSPAGYAINAPS 725

Query: 1049 TVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLN-A 1219
             VG     EDS+ MKYKDG  ++ N Q DTSE+WI  PRE+PG+QS  YY++  Q  + A
Sbjct: 726  VVGSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPRELPGMQSGPYYNMPAQTPHAA 785

Query: 1220 AYMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXX 1396
            AY+PS  G+ASFN A  QS+ MQF GMYH   QP  +ANPH +                 
Sbjct: 786  AYLPSHAGHASFNAAVPQSSHMQFPGMYHPTAQPPAMANPHHMGPAMGGNVGVGVPPAAP 845

Query: 1397 XXXXXXXXXXXINHLNWSTNF 1459
                       + + NWS NF
Sbjct: 846  GAQVGAYQQPQLGNFNWSPNF 866


>ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [Citrus sinensis]
          Length = 862

 Score =  431 bits (1108), Expect = e-118
 Identities = 234/501 (46%), Positives = 307/501 (61%), Gaps = 15/501 (2%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PKSSQKS+++  GV GT     S P  +S +   +V  LQ    ++NI EN
Sbjct: 362  ASQHNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNINEN 421

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
            ++VII ++ RVPE +R  LTFGSF   F+S+RN   G  A  SA++S+ E          
Sbjct: 422  QNVIIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAAS 481

Query: 350  XXXXXXCGNE--LDLSKDQVKTSRSDSPVSA-SSEHPLPEK-KQSSSTQDLEKYADFALV 517
                        +D+  D V+ S S+SP S  +SEH LP+  K +SS QDL+ YAD  LV
Sbjct: 482  KTSGNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLV 541

Query: 518  QNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASS 697
            ++  PS+P        + S L++FPA+D Q GYD+ +FRP MD++ +GQ LPSP E  +S
Sbjct: 542  RDTDPSYPLSESQQQQDSSELASFPAYDSQTGYDMSYFRPTMDESVRGQGLPSPQEALAS 601

Query: 698  HGANIIPASTVAMIQPQPG---AQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRN 868
            H AN IPAS++AM+Q Q     AQ+YPQVH+S FPN MPYRQ +SPV+VP MA+PGYS N
Sbjct: 602  HSANSIPASSIAMLQHQQQPQMAQMYPQVHVSHFPNMMPYRQIISPVYVPQMAMPGYSSN 661

Query: 869  PSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQG 1048
            P+Y HPSNG +Y LMPG SSH +  GLKYG  Q+KP+P++S  GF  +T+  GYAINA  
Sbjct: 662  PAYPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPTASPTGFGNFTSPAGYAINAPS 721

Query: 1049 TVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLN-A 1219
             VG     EDS+ MKYKDG  ++ N Q DTSE+WI  PRE+PG+QS  YY++  Q  + A
Sbjct: 722  VVGSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPRELPGMQSGPYYNMPAQTPHAA 781

Query: 1220 AYMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXX 1396
            AY+PS  G+ASFN A  QS+ MQF GMYH   QP  +ANPH +                 
Sbjct: 782  AYLPSHAGHASFNAAVPQSSHMQFPGMYHPTAQPPAMANPHHMGPAMGGNVGVGVPPAAP 841

Query: 1397 XXXXXXXXXXXINHLNWSTNF 1459
                       + + NWS NF
Sbjct: 842  GAQVGAYQQPQLGNFNWSPNF 862


>ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528617|gb|ESR39867.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 867

 Score =  427 bits (1098), Expect = e-117
 Identities = 234/502 (46%), Positives = 307/502 (61%), Gaps = 16/502 (3%)
 Frame = +2

Query: 2    APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFEN 169
            A Q NKEW+PKSSQKS+++  GV GT     S P  +S +   +V  LQ    ++NI EN
Sbjct: 366  ASQHNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNIHEN 425

Query: 170  RHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXX 349
            ++VII ++ RVPE +R  LTFGSF   F+S+RN   G  A  SA++S+ E          
Sbjct: 426  QNVIIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAAS 485

Query: 350  XXXXXXCGNE--LDLSKDQVKTSRSDSPVSA-SSEHPLPEK-KQSSSTQDLEKYADFALV 517
                        +D+  D V+ S S+SP S  +SEH LP+  K +SS QDL+ YAD  LV
Sbjct: 486  KTSGNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLV 545

Query: 518  QNNVPSHPSEGXXXXXNPSLLSNFP-AFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVAS 694
            ++  PS+P        + S L++FP A+D Q GYD+ +FRP MD++ +GQ LPSP E  +
Sbjct: 546  RDTDPSYPLSESQQQQDSSELASFPQAYDSQTGYDMSYFRPTMDESVRGQGLPSPQEALA 605

Query: 695  SHGANIIPASTVAMIQPQPG---AQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSR 865
            SH AN IPAS++AM+Q Q     AQ+YPQVH+S FPN MPYRQ +SPV+VP MA+PGYS 
Sbjct: 606  SHSANSIPASSIAMLQHQQQPQMAQMYPQVHVSHFPNMMPYRQIISPVYVPQMAMPGYSS 665

Query: 866  NPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQ 1045
            NP+Y HPSNG +Y LMPG SSH +  GLKYG  Q+KP+P++S  GF  +T+  GYAINA 
Sbjct: 666  NPAYPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPTASPTGFGNFTSPAGYAINAP 725

Query: 1046 GTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLN- 1216
              VG     EDS+ MKYKDG  ++ N Q DTSE+WI  PRE+PG+QS  YY++  Q  + 
Sbjct: 726  SVVGSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPRELPGMQSGPYYNMPAQTPHA 785

Query: 1217 AAYMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXX 1393
            AAY+PS  G+ASFN A  QS+ MQF GMYH   QP  +ANPH +                
Sbjct: 786  AAYLPSHAGHASFNAAVPQSSHMQFPGMYHPTAQPPAMANPHHMGPAMGGNVGVGVPPAA 845

Query: 1394 XXXXXXXXXXXXINHLNWSTNF 1459
                        + + NWS NF
Sbjct: 846  PGAQVGAYQQPQLGNFNWSPNF 867


>ref|XP_006583149.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Glycine max]
          Length = 830

 Score =  419 bits (1077), Expect = e-114
 Identities = 229/503 (45%), Positives = 310/503 (61%), Gaps = 19/503 (3%)
 Frame = +2

Query: 8    QSNKEWRPKSSQKSSLMSHGVSGTD-------ANPISSPEGNSLNSNIEV-GHLQKINIF 163
            Q NKEW+PKSSQK +  S GV GT        A+P +   G+  ++  E+   L ++NI+
Sbjct: 333  QQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTTELQDKLSQVNIY 392

Query: 164  ENRHVIIPENFRVPEAERELLTFGSFESGFDSTR----NYAYGPQAHDSAQQSSSEPCLX 331
            EN++VII ++ RVPE +R  LTFG+  +  DS+R     +  G     + + ++S   L 
Sbjct: 393  ENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSRLQSKYHIIGASEKSNEELTAS---LT 449

Query: 332  XXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVS-ASSEHPLPEKKQSSSTQDLEKYADF 508
                           ++DL  + +++SRSDSPVS A+SE  LP+ K SS+TQ+L+ YA+ 
Sbjct: 450  VPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSSNTQNLDNYANI 509

Query: 509  ALVQNNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLE 685
             LV+++ PS+ PSE      +   +  F A+DP  GYD+P+FRP +D+  +GQ L SP E
Sbjct: 510  GLVRDSSPSYAPSEPQQQDSHD--MPGFAAYDPPAGYDIPYFRPTIDETVRGQGLSSPQE 567

Query: 686  VASSHGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGY 859
               SH  N  PAST+AM+Q Q  P  Q+YPQVH+S F N MPYRQFLSPV+VPPMA+PGY
Sbjct: 568  ALISHATNNPPASTIAMVQQQQPPVPQMYPQVHVSHFANLMPYRQFLSPVYVPPMAMPGY 627

Query: 860  SRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAIN 1039
            S NP Y HP+NG +Y LMPG  SH  A  LKYG  Q+KP+P+ S  GF  + N  GYA+ 
Sbjct: 628  SSNPPYPHPTNGSSYLLMPGGGSHLNANNLKYGVQQFKPVPAGSPTGFGNFANPTGYAMI 687

Query: 1040 AQGTVGGAISFEDSTGMKYKDG-FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLN 1216
              G VGGA + EDS+ +KYKD  ++PN Q +TSEIW+Q PR++PG+QS  YY++ GQ  +
Sbjct: 688  TPGVVGGATALEDSSRVKYKDNLYVPNPQAETSEIWLQNPRDLPGMQSTPYYNMPGQTPH 747

Query: 1217 AAYMPSQTGNASFNVAATQSTQMQFSGMYHQP-QPAPLANPHALXXXXXXXXXXXXXXXX 1393
            AAYMPS TG+ASFN AA QS+ MQF GMYH P QPA +A+PH L                
Sbjct: 748  AAYMPSHTGHASFNAAAAQSSHMQFPGMYHTPPQPAAMASPHHLGPPAIGNNVGVGVAAA 807

Query: 1394 XXXXXXXXXXXX-INHLNWSTNF 1459
                         + H+NW+TNF
Sbjct: 808  APGAQVGAYQQPQLGHINWTTNF 830


>ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 855

 Score =  419 bits (1077), Expect = e-114
 Identities = 229/503 (45%), Positives = 310/503 (61%), Gaps = 19/503 (3%)
 Frame = +2

Query: 8    QSNKEWRPKSSQKSSLMSHGVSGTD-------ANPISSPEGNSLNSNIEV-GHLQKINIF 163
            Q NKEW+PKSSQK +  S GV GT        A+P +   G+  ++  E+   L ++NI+
Sbjct: 358  QQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTTELQDKLSQVNIY 417

Query: 164  ENRHVIIPENFRVPEAERELLTFGSFESGFDSTR----NYAYGPQAHDSAQQSSSEPCLX 331
            EN++VII ++ RVPE +R  LTFG+  +  DS+R     +  G     + + ++S   L 
Sbjct: 418  ENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSRLQSKYHIIGASEKSNEELTAS---LT 474

Query: 332  XXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVS-ASSEHPLPEKKQSSSTQDLEKYADF 508
                           ++DL  + +++SRSDSPVS A+SE  LP+ K SS+TQ+L+ YA+ 
Sbjct: 475  VPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSSNTQNLDNYANI 534

Query: 509  ALVQNNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLE 685
             LV+++ PS+ PSE      +   +  F A+DP  GYD+P+FRP +D+  +GQ L SP E
Sbjct: 535  GLVRDSSPSYAPSEPQQQDSHD--MPGFAAYDPPAGYDIPYFRPTIDETVRGQGLSSPQE 592

Query: 686  VASSHGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGY 859
               SH  N  PAST+AM+Q Q  P  Q+YPQVH+S F N MPYRQFLSPV+VPPMA+PGY
Sbjct: 593  ALISHATNNPPASTIAMVQQQQPPVPQMYPQVHVSHFANLMPYRQFLSPVYVPPMAMPGY 652

Query: 860  SRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAIN 1039
            S NP Y HP+NG +Y LMPG  SH  A  LKYG  Q+KP+P+ S  GF  + N  GYA+ 
Sbjct: 653  SSNPPYPHPTNGSSYLLMPGGGSHLNANNLKYGVQQFKPVPAGSPTGFGNFANPTGYAMI 712

Query: 1040 AQGTVGGAISFEDSTGMKYKDG-FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLN 1216
              G VGGA + EDS+ +KYKD  ++PN Q +TSEIW+Q PR++PG+QS  YY++ GQ  +
Sbjct: 713  TPGVVGGATALEDSSRVKYKDNLYVPNPQAETSEIWLQNPRDLPGMQSTPYYNMPGQTPH 772

Query: 1217 AAYMPSQTGNASFNVAATQSTQMQFSGMYHQP-QPAPLANPHALXXXXXXXXXXXXXXXX 1393
            AAYMPS TG+ASFN AA QS+ MQF GMYH P QPA +A+PH L                
Sbjct: 773  AAYMPSHTGHASFNAAAAQSSHMQFPGMYHTPPQPAAMASPHHLGPPAIGNNVGVGVAAA 832

Query: 1394 XXXXXXXXXXXX-INHLNWSTNF 1459
                         + H+NW+TNF
Sbjct: 833  APGAQVGAYQQPQLGHINWTTNF 855


Top