BLASTX nr result

ID: Sinomenium21_contig00011086 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00011086
         (2346 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248...   536   e-149
emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]   536   e-149
ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma...   512   e-142
ref|XP_007024586.1| Uncharacterized protein isoform 3 [Theobroma...   512   e-142
ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma...   511   e-142
ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma...   511   e-142
ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma...   511   e-142
emb|CBI35892.3| unnamed protein product [Vitis vinifera]              509   e-141
ref|XP_002521347.1| conserved hypothetical protein [Ricinus comm...   505   e-140
gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]     498   e-138
ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma...   496   e-137
ref|XP_004303676.1| PREDICTED: uncharacterized protein LOC101293...   496   e-137
ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Popu...   494   e-137
ref|XP_007214970.1| hypothetical protein PRUPE_ppa001749mg [Prun...   494   e-137
ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus tr...   491   e-136
ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citr...   481   e-133
ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citr...   477   e-131
ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [...   471   e-130
ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like i...   466   e-128
ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like i...   466   e-128

>ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248075 [Vitis vinifera]
          Length = 860

 Score =  536 bits (1380), Expect = e-149
 Identities = 314/689 (45%), Positives = 400/689 (58%), Gaps = 35/689 (5%)
 Frame = +3

Query: 30   GKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYST 209
            G++S Q+LNG +D+       A+S+  +RKELL E + TIPN+ S+V  +  +DSQ YS 
Sbjct: 180  GRQSSQSLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRVQAVKPNDSQPYSA 239

Query: 210  TTLVSNNSIVGLYXXXXXXXXXXXXXXXXAKI-GAIKREVGVVGVWRQHXXXXXXXXXXX 386
            + L SN+S+VG+Y                + I GAIKREVGVVGV RQ            
Sbjct: 240  S-LASNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVKHSSAP 298

Query: 387  XXXXXXX-------------------KKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKS 509
                                       K  Q  QT VPD V+ S+P +RSF  +QY S+ 
Sbjct: 299  SSSLPSSLLGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSRP 358

Query: 510  HQI-ANHQKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ-- 680
            HQ    HQKAPQ NKEW+PKSSQKSS +  GV GT A  +S    NS +   E   LQ  
Sbjct: 359  HQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDK 418

Query: 681  --KINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEP 854
              + +I EN++VII ++ RVPE +R  LTFGSF + F S      G QA  +A + S+EP
Sbjct: 419  LSQASISENQNVIIAQHIRVPETDRCRLTFGSFGADFAS------GFQAVGNADEPSAEP 472

Query: 855  CLXXXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVSASSEHPLPEKKQSSSTQDLEKYA 1034
                              ++DL    + +  +      +SEH LP+KK+SSS Q+LE YA
Sbjct: 473  SASLSVSPPESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYA 532

Query: 1035 DFALVQNNVPSHPSEGXXXXXNPSLLSNFP-AFDPQPGYDVPFFRPVMDDAFQGQHLPSP 1211
            D  LV+ + PS+  E         +L +FP A+DPQ GYD+P+FRP MD+  +GQ LPSP
Sbjct: 533  DIGLVRESSPSYTPESQQQQER-HVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSP 591

Query: 1212 LEVASSHGANIIPASTVAMIQPQ----PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMA 1379
             E  +SH AN IPAS++AM+Q Q    P  Q+Y QVH+  F N MPYRQFLSPV+VPPMA
Sbjct: 592  QEALASHTANSIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVPPMA 651

Query: 1380 VPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPG 1559
            +PGYS NP+YSHPSN  +Y LMPG SSH  A GLKYG  Q KP+P+ S  GF  +TN  G
Sbjct: 652  MPGYSSNPAYSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTNPTG 711

Query: 1560 YAINAQGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDIS 1733
            YAINA G VG A   EDS+ +KYKDG  ++PN Q +TSEIWIQ PRE+PGLQSA YY++ 
Sbjct: 712  YAINAPGVVGSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYYNMP 771

Query: 1734 GQVLNAAYMPSQTGNASFN--VAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXX 1904
             Q  +AAYMPS TG+ASFN   AA QS+ MQF G+YH  PQPA +A+PH L         
Sbjct: 772  AQTPHAAYMPSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASPHHLGPPMGGNVG 831

Query: 1905 XXXXXXXXXXXXXXXXXXXINHLNWSTNF 1991
                               + HLNW+TNF
Sbjct: 832  VGVAAAAPGPQVGAYQQPQLGHLNWTTNF 860


>emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]
          Length = 914

 Score =  536 bits (1380), Expect = e-149
 Identities = 314/689 (45%), Positives = 400/689 (58%), Gaps = 35/689 (5%)
 Frame = +3

Query: 30   GKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYST 209
            G++S Q+LNG +D+       A+S+  +RKELL E + TIPN+ S+V  +  +DSQ YS 
Sbjct: 234  GRQSSQSLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRVQAVKPNDSQPYSA 293

Query: 210  TTLVSNNSIVGLYXXXXXXXXXXXXXXXXAKI-GAIKREVGVVGVWRQHXXXXXXXXXXX 386
            + L SN+S+VG+Y                + I GAIKREVGVVGV RQ            
Sbjct: 294  S-LASNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVKHSSAP 352

Query: 387  XXXXXXX-------------------KKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKS 509
                                       K  Q  QT VPD V+ S+P +RSF  +QY S+ 
Sbjct: 353  SSSLPSSLLGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSRP 412

Query: 510  HQI-ANHQKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ-- 680
            HQ    HQKAPQ NKEW+PKSSQKSS +  GV GT A  +S    NS +   E   LQ  
Sbjct: 413  HQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDK 472

Query: 681  --KINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEP 854
              + +I EN++VII ++ RVPE +R  LTFGSF + F S      G QA  +A + S+EP
Sbjct: 473  LSQASISENQNVIIAQHIRVPETDRCRLTFGSFGADFAS------GFQAVGNADEPSAEP 526

Query: 855  CLXXXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVSASSEHPLPEKKQSSSTQDLEKYA 1034
                              ++DL    + +  +      +SEH LP+KK+SSS Q+LE YA
Sbjct: 527  SASLSVSPPESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYA 586

Query: 1035 DFALVQNNVPSHPSEGXXXXXNPSLLSNFP-AFDPQPGYDVPFFRPVMDDAFQGQHLPSP 1211
            D  LV+ + PS+  E         +L +FP A+DPQ GYD+P+FRP MD+  +GQ LPSP
Sbjct: 587  DIGLVRESSPSYTPESQQQQER-HVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSP 645

Query: 1212 LEVASSHGANIIPASTVAMIQPQ----PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMA 1379
             E  +SH AN IPAS++AM+Q Q    P  Q+Y QVH+  F N MPYRQFLSPV+VPPMA
Sbjct: 646  QEALASHTANSIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVPPMA 705

Query: 1380 VPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPG 1559
            +PGYS NP+YSHPSN  +Y LMPG SSH  A GLKYG  Q KP+P+ S  GF  +TN  G
Sbjct: 706  MPGYSSNPAYSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTNPTG 765

Query: 1560 YAINAQGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDIS 1733
            YAINA G VG A   EDS+ +KYKDG  ++PN Q +TSEIWIQ PRE+PGLQSA YY++ 
Sbjct: 766  YAINAPGVVGSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYYNMP 825

Query: 1734 GQVLNAAYMPSQTGNASFN--VAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXX 1904
             Q  +AAYMPS TG+ASFN   AA QS+ MQF G+YH  PQPA +A+PH L         
Sbjct: 826  AQTPHAAYMPSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASPHHLGPPMGGNVG 885

Query: 1905 XXXXXXXXXXXXXXXXXXXINHLNWSTNF 1991
                               + HLNW+TNF
Sbjct: 886  VGVAAAAPGPQVGAYQQPQLGHLNWTTNF 914


>ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508779953|gb|EOY27209.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 849

 Score =  512 bits (1318), Expect = e-142
 Identities = 300/680 (44%), Positives = 395/680 (58%), Gaps = 31/680 (4%)
 Frame = +3

Query: 45   QNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYSTTTLVS 224
            Q  NG S S +RHA  A+S+ I RKE+  E    IPN+  +   +  ++SQ ++ T   S
Sbjct: 175  QTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQS-S 233

Query: 225  NNSIVGLYXXXXXXXXXXXXXXXXA-KIGAIKREVGVVGVWRQ----------------- 350
            ++S+VG+Y                +  +GAIKREVGVVGV RQ                 
Sbjct: 234  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 293

Query: 351  HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKSHQIA-NH 527
            +                   +  Q S T+  +S+M  +  SRSF S+QY S+ +Q A  H
Sbjct: 294  NSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 353

Query: 528  QKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIF 695
            QKA Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+
Sbjct: 354  QKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIY 413

Query: 696  ENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPC--LXXX 869
            EN +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E    L   
Sbjct: 414  ENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVS 473

Query: 870  XXXXXXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFAL 1046
                       G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  L
Sbjct: 474  APDTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGL 533

Query: 1047 VQNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVAS 1226
            VQ+N PS+         +P  L +F A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S
Sbjct: 534  VQDNSPSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALS 593

Query: 1227 SHGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRN 1400
            +H AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS N
Sbjct: 594  AHTANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSN 652

Query: 1401 PSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQG 1580
            P+Y HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA G
Sbjct: 653  PAYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPG 712

Query: 1581 TVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAA 1754
             VG     EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++        
Sbjct: 713  VVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPHG 770

Query: 1755 YMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXX 1931
            YMPS TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                  
Sbjct: 771  YMPSHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPG 829

Query: 1932 XXXXXXXXXXINHLNWSTNF 1991
                      + HLNW+TNF
Sbjct: 830  AQVGAYQQPQLGHLNWTTNF 849


>ref|XP_007024586.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508779952|gb|EOY27208.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 761

 Score =  512 bits (1318), Expect = e-142
 Identities = 300/680 (44%), Positives = 395/680 (58%), Gaps = 31/680 (4%)
 Frame = +3

Query: 45   QNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYSTTTLVS 224
            Q  NG S S +RHA  A+S+ I RKE+  E    IPN+  +   +  ++SQ ++ T   S
Sbjct: 87   QTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQS-S 145

Query: 225  NNSIVGLYXXXXXXXXXXXXXXXXA-KIGAIKREVGVVGVWRQ----------------- 350
            ++S+VG+Y                +  +GAIKREVGVVGV RQ                 
Sbjct: 146  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 205

Query: 351  HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKSHQIA-NH 527
            +                   +  Q S T+  +S+M  +  SRSF S+QY S+ +Q A  H
Sbjct: 206  NSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 265

Query: 528  QKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIF 695
            QKA Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+
Sbjct: 266  QKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIY 325

Query: 696  ENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPC--LXXX 869
            EN +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E    L   
Sbjct: 326  ENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVS 385

Query: 870  XXXXXXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFAL 1046
                       G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  L
Sbjct: 386  APDTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGL 445

Query: 1047 VQNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVAS 1226
            VQ+N PS+         +P  L +F A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S
Sbjct: 446  VQDNSPSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALS 505

Query: 1227 SHGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRN 1400
            +H AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS N
Sbjct: 506  AHTANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSN 564

Query: 1401 PSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQG 1580
            P+Y HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA G
Sbjct: 565  PAYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPG 624

Query: 1581 TVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAA 1754
             VG     EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++        
Sbjct: 625  VVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPHG 682

Query: 1755 YMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXX 1931
            YMPS TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                  
Sbjct: 683  YMPSHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPG 741

Query: 1932 XXXXXXXXXXINHLNWSTNF 1991
                      + HLNW+TNF
Sbjct: 742  AQVGAYQQPQLGHLNWTTNF 761


>ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508779955|gb|EOY27211.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 839

 Score =  511 bits (1317), Expect = e-142
 Identities = 299/678 (44%), Positives = 394/678 (58%), Gaps = 29/678 (4%)
 Frame = +3

Query: 45   QNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYSTTTLVS 224
            Q  NG S S +RHA  A+S+ I RKE+  E    IPN+  +   +  ++SQ ++ T   S
Sbjct: 175  QTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQS-S 233

Query: 225  NNSIVGLYXXXXXXXXXXXXXXXXA-KIGAIKREVGVVGVWRQ----------------- 350
            ++S+VG+Y                +  +GAIKREVGVVGV RQ                 
Sbjct: 234  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 293

Query: 351  HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKSHQIA-NH 527
            +                   +  Q S T+  +S+M  +  SRSF S+QY S+ +Q A  H
Sbjct: 294  NSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 353

Query: 528  QKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIF 695
            QKA Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+
Sbjct: 354  QKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIY 413

Query: 696  ENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXX 875
            EN +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E        
Sbjct: 414  ENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASDDAA 473

Query: 876  XXXXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFALVQ 1052
                     G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  LVQ
Sbjct: 474  G--------GKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQ 525

Query: 1053 NNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSH 1232
            +N PS+         +P  L +F A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S+H
Sbjct: 526  DNSPSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAH 585

Query: 1233 GANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPS 1406
             AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS NP+
Sbjct: 586  TANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPA 644

Query: 1407 YSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTV 1586
            Y HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA G V
Sbjct: 645  YPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVV 704

Query: 1587 GGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYM 1760
            G     EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++        YM
Sbjct: 705  GNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPHGYM 762

Query: 1761 PSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXXXX 1937
            PS TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                    
Sbjct: 763  PSHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQ 821

Query: 1938 XXXXXXXXINHLNWSTNF 1991
                    + HLNW+TNF
Sbjct: 822  VGAYQQPQLGHLNWTTNF 839


>ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508779951|gb|EOY27207.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 852

 Score =  511 bits (1317), Expect = e-142
 Identities = 303/681 (44%), Positives = 396/681 (58%), Gaps = 32/681 (4%)
 Frame = +3

Query: 45   QNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYSTTTLVS 224
            Q  NG S S +RHA  A+S+ I RKE+  E    IPN+  +   +  ++SQ ++ T   S
Sbjct: 177  QTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQS-S 235

Query: 225  NNSIVGLYXXXXXXXXXXXXXXXXA-KIGAIKREVGVVGVWRQ----------------- 350
            ++S+VG+Y                +  +GAIKREVGVVGV RQ                 
Sbjct: 236  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 295

Query: 351  HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKSHQIA-NH 527
            +                   +  Q S T+  +S+M  +  SRSF S+QY S+ +Q A  H
Sbjct: 296  NSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 355

Query: 528  QKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIF 695
            QKA Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+
Sbjct: 356  QKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIY 415

Query: 696  ENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPC--LXXX 869
            EN +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E    L   
Sbjct: 416  ENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVS 475

Query: 870  XXXXXXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFAL 1046
                       G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  L
Sbjct: 476  APDTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGL 535

Query: 1047 VQNNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVA 1223
            VQ+N PS+ PSE       P L S   A+DPQ GYD+P+FRP +D+  +GQ LPSP E  
Sbjct: 536  VQDNSPSYAPSESQKQQDPPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQEAL 595

Query: 1224 SSHGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSR 1397
            S+H AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS 
Sbjct: 596  SAHTANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSS 654

Query: 1398 NPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQ 1577
            NP+Y HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA 
Sbjct: 655  NPAYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAP 714

Query: 1578 GTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNA 1751
            G VG     EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++       
Sbjct: 715  GVVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPH 772

Query: 1752 AYMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXX 1928
             YMPS TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                 
Sbjct: 773  GYMPSHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAP 831

Query: 1929 XXXXXXXXXXXINHLNWSTNF 1991
                       + HLNW+TNF
Sbjct: 832  GAQVGAYQQPQLGHLNWTTNF 852


>ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508779954|gb|EOY27210.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 842

 Score =  511 bits (1316), Expect = e-142
 Identities = 302/679 (44%), Positives = 395/679 (58%), Gaps = 30/679 (4%)
 Frame = +3

Query: 45   QNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYSTTTLVS 224
            Q  NG S S +RHA  A+S+ I RKE+  E    IPN+  +   +  ++SQ ++ T   S
Sbjct: 177  QTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQS-S 235

Query: 225  NNSIVGLYXXXXXXXXXXXXXXXXA-KIGAIKREVGVVGVWRQ----------------- 350
            ++S+VG+Y                +  +GAIKREVGVVGV RQ                 
Sbjct: 236  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 295

Query: 351  HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKSHQIA-NH 527
            +                   +  Q S T+  +S+M  +  SRSF S+QY S+ +Q A  H
Sbjct: 296  NSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 355

Query: 528  QKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIF 695
            QKA Q NKEW+PK SQKSS+ + GV GT     S P  ++   + E   LQ    ++NI+
Sbjct: 356  QKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIY 415

Query: 696  ENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXX 875
            EN +VII ++ RVPE +R  LTFGSF   FDS RN+  G QA   A+ S+ E        
Sbjct: 416  ENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASDDAA 475

Query: 876  XXXXXXXXCGNELDLSKDQVKTSRSDSPVSAS-SEHPLPEKKQSSSTQDLEKYADFALVQ 1052
                     G  +++  DQ+  S SDSP+S + SEH LP+ K +SS Q+L+ YAD  LVQ
Sbjct: 476  G--------GKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQ 527

Query: 1053 NNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASS 1229
            +N PS+ PSE       P L S   A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S+
Sbjct: 528  DNSPSYAPSESQKQQDPPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSA 587

Query: 1230 HGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNP 1403
            H AN+ PAST+ M+Q Q  P AQ+YPQVH+S F N MPYRQF+SP+++P MA+PGYS NP
Sbjct: 588  HTANV-PASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNP 646

Query: 1404 SYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGT 1583
            +Y HPSNG +Y LMPG SSH  A GLKYG  Q+KP+P+ S  GF  +T+  GYAINA G 
Sbjct: 647  AYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGV 706

Query: 1584 VGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAY 1757
            VG     EDS+ +KYKDG  ++PN Q DTS++WIQ PRE+PGLQSA YY++        Y
Sbjct: 707  VGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNM--PQTPHGY 764

Query: 1758 MPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXXX 1934
            MPS TG+ASFN AA QS+ MQF G+YH  PQPA +ANPH L                   
Sbjct: 765  MPSHTGHASFNAAAAQSSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGA 823

Query: 1935 XXXXXXXXXINHLNWSTNF 1991
                     + HLNW+TNF
Sbjct: 824  QVGAYQQPQLGHLNWTTNF 842


>emb|CBI35892.3| unnamed protein product [Vitis vinifera]
          Length = 809

 Score =  509 bits (1311), Expect = e-141
 Identities = 293/630 (46%), Positives = 368/630 (58%), Gaps = 16/630 (2%)
 Frame = +3

Query: 150  PNSDSQVLGLNAHDSQRYSTTTLVSNNSIVGLYXXXXXXXXXXXXXXXXAKI-GAIKREV 326
            P        +  +DSQ YS + L SN+S+VG+Y                + I GAIKREV
Sbjct: 204  PGIPQDANSMKPNDSQPYSAS-LASNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREV 262

Query: 327  GVVGVWRQHXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSK 506
            GVVGV RQ                       Q  QT VPD V+ S+P +RSF  +QY S+
Sbjct: 263  GVVGVRRQSTENSSD----------------QPRQTTVPDHVIPSMPVNRSFLGNQYGSR 306

Query: 507  SHQI-ANHQKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ- 680
             HQ    HQKAPQ NKEW+PKSSQKSS +  GV GT A  +S    NS +   E   LQ 
Sbjct: 307  PHQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQD 366

Query: 681  ---KINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSE 851
               + +I EN++VII ++ RVPE +R  LTFGSF + F S      G QA  +A + S+E
Sbjct: 367  KLSQASISENQNVIIAQHIRVPETDRCRLTFGSFGADFAS------GFQAVGNADEPSAE 420

Query: 852  PCLXXXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVSASSEHPLPEKKQSSSTQDLEKY 1031
            P                  ++DL    + +  +      +SEH LP+KK+SSS Q+LE Y
Sbjct: 421  PSASLSVSPPESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENY 480

Query: 1032 ADFALVQNNVPSHPSEGXXXXXNPSLLSNFP-AFDPQPGYDVPFFRPVMDDAFQGQHLPS 1208
            AD  LV+ + PS+  E         +L +FP A+DPQ GYD+P+FRP MD+  +GQ LPS
Sbjct: 481  ADIGLVRESSPSYTPESQQQQER-HVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPS 539

Query: 1209 PLEVASSHGANIIPASTVAMIQPQ----PGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPM 1376
            P E  +SH AN IPAS++AM+Q Q    P  Q+Y QVH+  F N MPYRQFLSPV+VPPM
Sbjct: 540  PQEALASHTANSIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVPPM 599

Query: 1377 AVPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSP 1556
            A+PGYS NP+YSHPSN  +Y LMPG SSH  A GLKYG  Q KP+P+ S  GF  +TN  
Sbjct: 600  AMPGYSSNPAYSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTNPT 659

Query: 1557 GYAINAQGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDI 1730
            GYAINA G VG A   EDS+ +KYKDG  ++PN Q +TSEIWIQ PRE+PGLQSA YY++
Sbjct: 660  GYAINAPGVVGSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYYNM 719

Query: 1731 SGQVLNAAYMPSQTGNASFN--VAATQSTQMQFSGMYH-QPQPAPLANPHALXXXXXXXX 1901
              Q  +AAYMPS TG+ASFN   AA QS+ MQF G+YH  PQPA +A+PH L        
Sbjct: 720  PAQTPHAAYMPSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASPHHLGPPMGGNV 779

Query: 1902 XXXXXXXXXXXXXXXXXXXXINHLNWSTNF 1991
                                + HLNW+TNF
Sbjct: 780  GVGVAAAAPGPQVGAYQQPQLGHLNWTTNF 809


>ref|XP_002521347.1| conserved hypothetical protein [Ricinus communis]
            gi|223539425|gb|EEF41015.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 864

 Score =  505 bits (1300), Expect = e-140
 Identities = 300/696 (43%), Positives = 391/696 (56%), Gaps = 33/696 (4%)
 Frame = +3

Query: 3    STGSLIDCGGKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLN 182
            S+G++   G + S Q  NG  DS SRH   A+SN   RK +  E    +P++ S++  + 
Sbjct: 174  SSGNVKHSGVRSSSQASNGPPDSQSRHTRDATSNFTDRKAMTEEKRAVVPSAASRIQVMK 233

Query: 183  AHDSQRYSTTTLVSNNSIVGLYXXXXXXXXXXXXXXXX-AKIGAIKREVGVVGVWRQHXX 359
               S ++ + TL S+NS+VG+Y                 A +GAIKREVGVVG  RQ   
Sbjct: 234  P--SSQHHSATLASSNSVVGVYSSSMDPVHVPSPESRSSAAVGAIKREVGVVGGRRQSSE 291

Query: 360  XXXXXXXXXXXXXXXX------------------KKFVQSSQTAVPDSVMTSLPTSRSFA 485
                                               K  Q ++    +S M S+   RSF 
Sbjct: 292  NAVKNSSASSSSFSNSVLGRDGSLPESFQPFPTISKNDQVNEPVATESAMPSISVGRSFL 351

Query: 486  SSQYSSKSHQIANHQKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIE 665
             +QYS        HQKA Q NKEW+PKSSQK+S+ S GV GT     S P GNS +   +
Sbjct: 352  GNQYSRTHQTAVGHQKATQHNKEWKPKSSQKASVGSPGVIGTPTKSSSPPAGNSKDLESD 411

Query: 666  VGHLQ----KINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSA 833
               +Q    ++NI+EN++VII ++ RVPE +R  LTFGSF   FDS+RN   G QA    
Sbjct: 412  ATDMQEKLLRVNIYENQNVIIAQHIRVPETDRCRLTFGSFGVEFDSSRNMPSGFQAAGVT 471

Query: 834  QQSSSEPC--LXXXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVS-ASSEHPLPEKKQS 1004
            + S +E    L                +++L  +QV+ S SDSP S A SEH  P+K  S
Sbjct: 472  KDSKAESAASLSASAPESSSDDASGNKQVELLDEQVRNSGSDSPASGAVSEHQSPDK--S 529

Query: 1005 SSTQDLEKYADFALVQNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDA 1184
            SS  +L+ YAD  LV+++ P   SE       P L S F A+DPQ  YD+ +FRP +D+ 
Sbjct: 530  SSPPNLDNYADIGLVRDSSPFTSSESQHQQDPPELPS-FSAYDPQTVYDMSYFRPQIDET 588

Query: 1185 FQGQHLPSPLEVASSHGANIIPASTVAMIQPQ---PGAQLYPQVHLSQFPNFMPYRQFLS 1355
             +GQ L S  E   SH  + +PAS++ M+Q Q   P AQ+YPQVH+S + N MPYRQFLS
Sbjct: 589  VRGQGLQSAQEALISHRVDSMPASSIPMVQQQQQPPIAQMYPQVHVSHYTNLMPYRQFLS 648

Query: 1356 PVFVPPMAVPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGF 1535
            PV+VP MA+PGYS NP+Y HPSNG +Y LMPG SSH +A GLKYG  Q+KP+P SS  GF
Sbjct: 649  PVYVPQMAMPGYSSNPAYPHPSNGSSYLLMPGGSSHLSANGLKYGIQQFKPVPGSSPTGF 708

Query: 1536 CTYTNSPGYAINAQGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQ 1709
              +T+  GYAINA G VG A   EDS+ MKYKDG  ++PN Q +TSEIW+Q PRE+PGLQ
Sbjct: 709  GNFTSPTGYAINAPGVVGSATGLEDSSRMKYKDGNLYVPNPQAETSEIWVQNPRELPGLQ 768

Query: 1710 SASYYDISGQVLNAAYMPSQTGNASFNVAATQSTQMQFSGMYHQPQPAP--LANPHALXX 1883
            SA YY++ GQ  +AAY+PS TG+ASFN AA QS+ MQFSG+Y  P P P  +ANPH L  
Sbjct: 769  SAPYYNMPGQSPHAAYLPSHTGHASFNAAAAQSSHMQFSGLYPPPPPTPAAMANPHHLGP 828

Query: 1884 XXXXXXXXXXXXXXXXXXXXXXXXXXINHLNWSTNF 1991
                                      + HLNW+TNF
Sbjct: 829  VMGGNVGVGVAPAAPGAQVGAYQQPQLGHLNWTTNF 864


>gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]
          Length = 854

 Score =  498 bits (1281), Expect = e-138
 Identities = 294/691 (42%), Positives = 391/691 (56%), Gaps = 28/691 (4%)
 Frame = +3

Query: 3    STGSLIDCGGKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLN 182
            S+ S      K S Q L G SDS  R AH   S  + RKE+  E   T  +  S+V    
Sbjct: 168  SSNSEKPTASKNSSQGLYGPSDSHLRIAHDIESTGLVRKEVSEEKRVTFSSVASRVQAGK 227

Query: 183  AHDSQRYSTTTLVSNNSIVGLYXXXXXXXXXXXXXXXXA-KIGAIKREVGVVGVWRQHXX 359
            A+++ R  +  + S++S +G+Y                +  +GAIKREVGVVGV RQ   
Sbjct: 228  ANNA-RSQSAMVASSSSAIGVYSSSTDPVHVPSPDSRSSGSVGAIKREVGVVGVRRQSSD 286

Query: 360  XXXXXXXXXXXXXXXX-----KKFVQSSQTA--------VPDSVMTSLPTSRSFASSQYS 500
                                  + +QS  T           +S++ S+  SRS  SS YS
Sbjct: 287  NSKSSVPSSSFSNSLLGGEGSAETLQSFSTISKNDEVGQASESILPSVSVSRSLLSSHYS 346

Query: 501  SKSH--QIANHQKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIE--- 665
            ++    Q   HQKA Q NKEW+PKSSQK SL + GV GT    +S P  NS  S  E   
Sbjct: 347  NRQQHQQPVGHQKASQPNKEWKPKSSQKPSLNNPGVIGTPTKSVSPPAHNSEVSESEPAK 406

Query: 666  -VGHLQKINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQS 842
             +  L ++NI EN++VII ++ RVPE +R  LTFGSF   F+S  +   G QA  +  +S
Sbjct: 407  VLEKLSRVNIHENQNVIIAQHIRVPETDRCRLTFGSFGKEFESDSDLVNGYQA-GAIGES 465

Query: 843  SSEPCLXXXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVSA-SSEHPLPEKKQSSSTQD 1019
            + E                   ++DL+ +Q++ S SDSP S  +SE+  P+KK+S+S Q+
Sbjct: 466  NGEAASSLSAPESSIGDASGSKQVDLTDEQIRNSGSDSPTSGGTSENQFPDKKESTSPQN 525

Query: 1020 LEKYADFALVQNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVM--DDAFQG 1193
            L+ YAD  LVQ N PS+         +P L   F A+D Q GYD P+FRP    D+A +G
Sbjct: 526  LDNYADIGLVQGNSPSYAPADSQQPEHPEL-PGFSAYDSQTGYDFPYFRPASATDEAMRG 584

Query: 1194 QHLPSPLEVASSHGANIIPASTVAMIQPQ---PGAQLYPQVHLSQFPNFMPYRQFLSPVF 1364
            Q LP+P E  SSH  N +P +T++M+Q Q   P AQ+YPQVH+S F N MPYRQFLSPV+
Sbjct: 585  QGLPTPQEAFSSHNTNSVP-TTISMVQQQQQPPVAQMYPQVHVSHFANLMPYRQFLSPVY 643

Query: 1365 VPPMAVPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTY 1544
            VPPMA+PGYS +P+Y HPSNG +Y LMPG  +H  A  LKYG  Q+KP+P+ +  GF  +
Sbjct: 644  VPPMAMPGYSSSPAYPHPSNGNSYLLMPGGGTHLNANSLKYGVQQFKPVPAGNPTGFGNF 703

Query: 1545 TNSPGYAINAQGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSAS 1718
            +N  GYAIN  G VGGA   EDS+ +KYKDG  ++PN Q +TSE+WIQ PRE+PGLQS  
Sbjct: 704  SNPNGYAINTPGVVGGATGLEDSSRIKYKDGNLYVPNPQAETSEMWIQNPRELPGLQSTP 763

Query: 1719 YYDISGQVLNAAYMPSQTGNASFNVAATQSTQMQFSGMYHQPQPAPLANPHALXXXXXXX 1898
            YY++ GQ  +AAY+PS TG+AS+N AA QS+ MQF G+YH PQPA +ANPH L       
Sbjct: 764  YYNMPGQSPHAAYLPSHTGHASYNAAAAQSSHMQFPGLYHPPQPAAIANPHHLGPAMGGN 823

Query: 1899 XXXXXXXXXXXXXXXXXXXXXINHLNWSTNF 1991
                                 + HLNW+TNF
Sbjct: 824  VGVGVAAAAPGAQVGAYQQPQLGHLNWTTNF 854


>ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508779950|gb|EOY27206.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 883

 Score =  496 bits (1278), Expect = e-137
 Identities = 304/714 (42%), Positives = 397/714 (55%), Gaps = 65/714 (9%)
 Frame = +3

Query: 45   QNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYSTTTLVS 224
            Q  NG S S +RHA  A+S+ I RKE+  E    IPN+  +   +  ++SQ ++ T   S
Sbjct: 175  QTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQS-S 233

Query: 225  NNSIVGLYXXXXXXXXXXXXXXXXA-KIGAIKREVGVVGVWRQ----------------- 350
            ++S+VG+Y                +  +GAIKREVGVVGV RQ                 
Sbjct: 234  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 293

Query: 351  HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKSHQIA-NH 527
            +                   +  Q S T+  +S+M  +  SRSF S+QY S+ +Q A  H
Sbjct: 294  NSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 353

Query: 528  QK---------------------------APQSNKEWRPKSSQKSSLMSHGVSGTDANPI 626
            QK                           A Q NKEW+PK SQKSS+ + GV GT     
Sbjct: 354  QKEASYCSAFHPFIDQISLWESLSCIFDAANQHNKEWKPKLSQKSSVNNPGVIGTPKKSA 413

Query: 627  SSPEGNSLNSNIEVGHLQ----KINIFENRHVIIPENFRVPEAERELLTFGSFESGFDST 794
            S P  ++   + E   LQ    ++NI+EN +VII ++ RVPE +R  LTFGSF   FDS 
Sbjct: 414  SPPADDAKGLDSETAKLQDKFSQVNIYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSL 473

Query: 795  RNYAYGPQAHDSAQQSSSE--------PCLXXXXXXXXXXXXXCGNELDLSKDQVKTSRS 950
            RN+  G QA   A+ S+ E        P L              G  +++  DQ+  S S
Sbjct: 474  RNFVPGFQATGVAEDSNGESAARLVFSPNLSVSAPDTSSDDAAGGKPIEILDDQIGNSGS 533

Query: 951  DSPVSAS-SEHPLPEKKQSSSTQDLEKYADFALVQNNVPSH-PSEGXXXXXNPSLLSNFP 1124
            DSP+S + SEH LP+ K +SS Q+L+ YAD  LVQ+N PS+ PSE       P L S   
Sbjct: 534  DSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDNSPSYAPSESQKQQDPPELPSFSQ 593

Query: 1125 AFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSHGANIIPASTVAMIQPQ--PGAQLY 1298
            A+DPQ GYD+P+FRP +D+  +GQ LPSP E  S+H AN+ PAST+ M+Q Q  P AQ+Y
Sbjct: 594  AYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAHTANV-PASTIPMMQQQQPPVAQMY 652

Query: 1299 PQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGG 1478
            PQVH+S F N MPYRQF+SP+++P MA+PGYS NP+Y HPSNG +Y LMPG SSH  A G
Sbjct: 653  PQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPGGSSHLNANG 712

Query: 1479 LKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTVGGAISFEDSTGMKYKDG--FIPNG 1652
            LKYG  Q+KP+P+ S  GF  +T+  GYAINA G VG     EDS+ +KYKDG  ++PN 
Sbjct: 713  LKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDGNIYVPNQ 772

Query: 1653 QMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYMPSQTGNASFNVAATQSTQMQFSGM 1832
            Q DTS++WIQ PRE+PGLQSA YY++        YMPS TG+ASFN AA QS+ MQF G+
Sbjct: 773  QADTSDLWIQNPRELPGLQSAPYYNM--PQTPHGYMPSHTGHASFNAAAAQSSHMQFPGL 830

Query: 1833 YH-QPQPAPLANPHALXXXXXXXXXXXXXXXXXXXXXXXXXXXXINHLNWSTNF 1991
            YH  PQPA +ANPH L                            + HLNW+TNF
Sbjct: 831  YHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQVGAYQQPQLGHLNWTTNF 883


>ref|XP_004303676.1| PREDICTED: uncharacterized protein LOC101293990 [Fragaria vesca
            subsp. vesca]
          Length = 915

 Score =  496 bits (1278), Expect = e-137
 Identities = 301/676 (44%), Positives = 393/676 (58%), Gaps = 27/676 (3%)
 Frame = +3

Query: 45   QNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYSTTTLVS 224
            Q LNG +DS  R +   S+ +I RKE   E    +PNS S+V     ++SQ +S     S
Sbjct: 257  QALNGQTDSRIRTSDANSTGTI-RKETSAEKRVALPNSASRVQAGRPNNSQPHSA----S 311

Query: 225  NNSIVGLYXXXXXXXXXXXXXXXX-AKIGAIKREVGVVGVWRQHXXXXXXXXXXXXXXXX 401
            N S++G+Y                 A +GAIKREVGVVGV +Q                 
Sbjct: 312  NTSVIGVYSSSTDPVHVPSPDSRPSASVGAIKREVGVVGVRKQSSDNSKSAVPSSSFSNS 371

Query: 402  XXKK------FVQSSQTAVPD-------SVMTSLPTSRSFASSQYSSKSH-QIANHQK-- 533
               K      F   +  + PD       SVM S+P SR+F S+Q++ + H Q   HQK  
Sbjct: 372  LLGKEGTAESFRSLTGISKPDQLDQTSESVMPSIPVSRTFISNQHNVRPHQQPVGHQKDA 431

Query: 534  APQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEV---GHLQKINIFENR 704
            A Q NKEW+PKSSQK S  + GV GT     S P+ + ++ +  V     L ++NI+EN 
Sbjct: 432  ASQPNKEWKPKSSQKPSSNNPGVIGTPTKSASPPDDSKVSESEAVQLQDKLARVNIYENC 491

Query: 705  HVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXXX 884
            +V+I +N RVPE++R  LTFGS   G +    +  GP      ++S+ EP          
Sbjct: 492  NVVIAQNIRVPESDRFRLTFGSL--GTELVNGFQAGP-----TEESNREPQASLSTSAPE 544

Query: 885  XXXXXCGNE-LDLSKDQVKTSRSD-SPVSASSEHPLPEKKQSSSTQDLEKYADFALVQNN 1058
                    + +DL  DQV+ S SD S  SA  EH LPEK+++SS Q L+ YAD  LV++N
Sbjct: 545  SHSDEASTKPIDLLDDQVRNSGSDFSAPSAVPEH-LPEKRETSSPQSLDNYADIGLVRDN 603

Query: 1059 VPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSHG 1235
             PS  PS+      +P  +  F AFDPQ GYD+P++RP MD++  GQ LPSP E  SSH 
Sbjct: 604  SPSFTPSDS--QNQDPPEMQGFTAFDPQTGYDIPYYRPSMDESVHGQGLPSPQEALSSHN 661

Query: 1236 ANIIPASTVAMIQPQPG--AQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPSY 1409
            +N IPASTVAM+Q QP   AQ+YPQVH+S + N MPYRQ++SPV+VPPMAVPGYS NP+Y
Sbjct: 662  SNSIPASTVAMVQQQPPHVAQMYPQVHVSHYANMMPYRQYISPVYVPPMAVPGYSNNPAY 721

Query: 1410 SHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTVG 1589
             H SNG +Y LMPG +SH  A  LKYG  Q+KP+ + S  GF  +TN  GYA+NA G VG
Sbjct: 722  PHMSNGNSYLLMPGGASHLNANSLKYGVQQFKPV-AGSPTGFGNFTNPAGYAMNAPGVVG 780

Query: 1590 GAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYMP 1763
            GA   EDS+ MKYKDG  ++PN Q +TSEIWIQ PRE PG+QSA YY++ GQ  +AAYMP
Sbjct: 781  GATGLEDSSRMKYKDGNLYVPNPQAETSEIWIQNPREHPGMQSAPYYNMPGQTPHAAYMP 840

Query: 1764 SQTGNASFNVAATQSTQMQFSGMYHQPQPAPLANPHALXXXXXXXXXXXXXXXXXXXXXX 1943
            S  G+ASFN AA QS+ MQ+ GMYH PQPA +A+PH +                      
Sbjct: 841  SHGGHASFNAAAAQSSHMQYPGMYHPPQPAAMASPHHM-GPAMPGNVGVGVAAAAPGAQA 899

Query: 1944 XXXXXXINHLNWSTNF 1991
                  +NH+NW+TNF
Sbjct: 900  YQQQPQLNHMNWTTNF 915


>ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Populus trichocarpa]
            gi|550342535|gb|EEE79123.2| hypothetical protein
            POPTR_0003s06200g [Populus trichocarpa]
          Length = 858

 Score =  494 bits (1273), Expect = e-137
 Identities = 294/681 (43%), Positives = 387/681 (56%), Gaps = 18/681 (2%)
 Frame = +3

Query: 3    STGSLIDCGGKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLN 182
            S  +L     + S Q  NG +    R+   A S +  RK +  E   T  N+ +    + 
Sbjct: 183  SNNNLKPSNAQSSSQTSNGPTYPEPRYNRDAKSRAGDRKVVSEEKRSTASNATTSRAQVV 242

Query: 183  AHDSQRYSTTTLVSNNSIVGLYXXXXXXXXXXXXXXXXA-KIGAIKREVGVVGVWRQ--- 350
              ++ +    +L S+NS+VG+Y                +  +GAIKREVGVVG  RQ   
Sbjct: 243  KPNNSQQHDASLASSNSVVGVYSSSTDPVHVPSPDSRSSGVVGAIKREVGVVGGRRQSEN 302

Query: 351  --HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKSH-QIA 521
                                     Q  QTAV +S M S+P +RS   +QY+S+ H Q  
Sbjct: 303  AVKDLSSSNSFSESFHPLTAISNTDQVRQTAVIES-MPSVPVNRSLLHNQYNSRPHQQTV 361

Query: 522  NHQKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KIN 689
             + KA Q NKEW+PKSSQKSS+ S GV GT       P  NS +  +   +LQ    ++N
Sbjct: 362  GYPKASQHNKEWKPKSSQKSSITSPGVIGTPTKSSLPPTDNSKSMELNAANLQDKFSRVN 421

Query: 690  IFENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXX 869
            I EN++VII ++ RVPE++R  LTFGSF   FD +RN   G QA   +++S+ E  +   
Sbjct: 422  IHENQNVIIAQHIRVPESDRCKLTFGSFGVEFDPSRNSTPGFQAVGISEESNRESAISLP 481

Query: 870  XXXXXXXXXXC--GNELDLSKDQVKTSRSDSP-VSASSEHPLPEKKQSSSTQDLEKYADF 1040
                         G +++L  DQ + S SDSP    +SEH LPEK  SSS  DL+ YAD 
Sbjct: 482  ASCPESSSEDAPGGKQIELLDDQARNSESDSPEAGLASEHQLPEK--SSSPPDLDNYADI 539

Query: 1041 ALVQNNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLE 1217
             LV+N+ PS+ PSE      +P L S F A+DPQ GYD+ +F+P +D+  QGQ  PSP E
Sbjct: 540  GLVRNSSPSYAPSESQQQQDHPELPS-FSAYDPQTGYDMSYFQPPIDETVQGQGQPSPRE 598

Query: 1218 VASSHGANIIPASTVAMIQPQPG-AQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYS 1394
              ++H  N IP ST+  +Q QP  AQ+YPQVH+S F N MPYRQF+SPV+VPPM +PGYS
Sbjct: 599  ALTAHTGNHIPTSTMPTMQQQPPMAQMYPQVHVSPFTNLMPYRQFISPVYVPPMPMPGYS 658

Query: 1395 RNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINA 1574
             NP+Y HPSNG +Y LMPG  SH  A GLKYG   YKP+PSS+  GF  +T+  GYAINA
Sbjct: 659  SNPAYPHPSNGNSYMLMPGGGSHLNANGLKYGIQHYKPVPSSNPAGFGNFTSPSGYAINA 718

Query: 1575 QGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLN 1748
             G VG A   ED + MKYKDG  ++PN Q ++SEIWIQ PR++PGLQS+ YY+I GQ  +
Sbjct: 719  PGVVGSAAGLEDPSRMKYKDGNIYVPNPQAESSEIWIQNPRDLPGLQSSPYYNIPGQT-H 777

Query: 1749 AAYMPSQTGNASFNVAATQSTQMQFSGMYHQPQPAPLANPHALXXXXXXXXXXXXXXXXX 1928
            AAY+PS TG+ASFN AA QS+ MQF G+Y  PQP  +A+PH L                 
Sbjct: 778  AAYLPSHTGHASFNAAAAQSSHMQFPGLYPPPQPTAMASPHHLGPVMGNNVGVGVAPSAP 837

Query: 1929 XXXXXXXXXXXINHLNWSTNF 1991
                       + HLNW+TNF
Sbjct: 838  GAQVGAYQQPQLGHLNWTTNF 858


>ref|XP_007214970.1| hypothetical protein PRUPE_ppa001749mg [Prunus persica]
            gi|462411120|gb|EMJ16169.1| hypothetical protein
            PRUPE_ppa001749mg [Prunus persica]
          Length = 771

 Score =  494 bits (1272), Expect = e-137
 Identities = 296/680 (43%), Positives = 382/680 (56%), Gaps = 27/680 (3%)
 Frame = +3

Query: 33   KKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLNAHDSQRYSTT 212
            + S Q  NG +D   R +   ++ S+ RKE LVE   T+P +  +V  +   +SQ +S  
Sbjct: 101  QNSSQVSNGQTDPQIRTSDANATGSL-RKETLVEKRVTLPTAALRVQAVKPSNSQPHSAV 159

Query: 213  TLVSNNSIVGLYXXXXXXXXXXXXXXXX-AKIGAIKREVGVVGVWRQHXXXXXXXXXXXX 389
             +VS+NS+VGLY                 A +GAIKREVGV     ++            
Sbjct: 160  -VVSSNSVVGLYSSSTDPVHVPSPDSRPSASVGAIKREVGVRRQSSENSNSSAPSSSLSN 218

Query: 390  XXXXXX------KKFVQSSQT----AVPDSVMTSLPTSRSFASSQYSSKSHQI-ANHQKA 536
                        + F   S+T       +SVM S+  SR F S+Q++++ HQ    HQKA
Sbjct: 219  SLLGKEGSTESFRPFTGISKTDQVGQTSESVMPSVSVSRPFLSNQHNARPHQQPVGHQKA 278

Query: 537  PQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFENR 704
             Q NKEW+PKSSQK S  S GV GT    +SSP+ NS  S  E   LQ    ++N+++N 
Sbjct: 279  SQPNKEWKPKSSQKPSSNSPGVIGTPTKSVSSPD-NSKVSESEAAKLQDKLSRVNVYDNS 337

Query: 705  HVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXXX 884
            +V+I +N RVP+++R  LTFGS  +  DST N   G QA    ++S+ EP          
Sbjct: 338  NVVIAQNIRVPDSDRFRLTFGSLGTELDSTGNMVNGFQA-GGTEESNGEPA----GSLSL 392

Query: 885  XXXXXCGNE------LDLSKDQVKTSRSDSPVS-ASSEHPLPEKKQSSSTQDLEKYADFA 1043
                 C +E      +DL   QV+ S SDSP S A  E  LPEK  +SS Q L+ YAD  
Sbjct: 393  SAPQSCSDEASGIKPVDLLDHQVRNSGSDSPASGAVPERQLPEKNDTSSPQTLDNYADIG 452

Query: 1044 LVQNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVA 1223
            LV++  PS+          P L   F AFDPQ  Y++P+FRP MD++ +GQ LPSP E  
Sbjct: 453  LVRDTSPSYAPSDSQQQEQPEL-EGFSAFDPQTSYNIPYFRPHMDESVRGQGLPSPQEAL 511

Query: 1224 SSHGANIIPASTVAMIQ--PQPGAQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSR 1397
            SSH  N I ASTVAM+Q  P P AQ+YPQVH+S + N MPYRQFLSPV+VPPMAVPGYS 
Sbjct: 512  SSHNVNSIAASTVAMVQQQPPPVAQMYPQVHVSHYANLMPYRQFLSPVYVPPMAVPGYSS 571

Query: 1398 NPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQ 1577
            NP+Y H SNG +Y LMPG  SH  A  LKYG   +KP+P+ S  G+  +TN  GYAIN  
Sbjct: 572  NPAYPHMSNGNSYLLMPGGGSHLNANSLKYGVQPFKPVPAGSPTGYGNFTNPNGYAINGP 631

Query: 1578 GTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNA 1751
            G VGGA   EDS+ +KYKDG  ++ N Q +TSE+WIQ PRE PGLQS  YY++  Q  + 
Sbjct: 632  GVVGGASGLEDSSRIKYKDGNLYVANPQAETSEMWIQNPREHPGLQSTPYYNVPAQSPHG 691

Query: 1752 AYMPSQTGNASFNVAATQSTQMQFSGMYHQPQPAPLANPHALXXXXXXXXXXXXXXXXXX 1931
            AYMPS   +ASFN AA QS+ MQF G+YH PQPA + NPH L                  
Sbjct: 692  AYMPSHAAHASFNAAAAQSSHMQFPGLYHPPQPAAIPNPHHLGPAMGGNVGVGVAAAAPG 751

Query: 1932 XXXXXXXXXXINHLNWSTNF 1991
                      +NH+NW TNF
Sbjct: 752  AQVGAYQQPQLNHMNWQTNF 771


>ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550347518|gb|EEE84402.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 854

 Score =  491 bits (1265), Expect = e-136
 Identities = 296/674 (43%), Positives = 392/674 (58%), Gaps = 21/674 (3%)
 Frame = +3

Query: 33   KKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSD-SQVLGLNAHDSQRYST 209
            + S Q  NG  DS  RH   A+S+   RK +  E      N+  S+V    +++SQ+++ 
Sbjct: 190  RSSHQASNGPIDSEPRHNRDANSSVGDRKVVSEEKRSVASNATTSRVQVAKSNNSQQHNA 249

Query: 210  TTLVSNNSIVGLYXXXXXXXXXXXXXXXXAKI-GAIKREVGVVGVWRQHXXXXXXXXXXX 386
                S+N +VG+Y                + + GAIKREVGVVG  RQ            
Sbjct: 250  LQ-ASSNPVVGVYSSSTDPVHVPSPDSRSSGVVGAIKREVGVVGGRRQSFENAVKDLSSS 308

Query: 387  XXXXXXXKKFV------QSSQTAVPDSVMTSLPTSRSFASSQYSSKSHQIA-NHQKAPQS 545
                   + F       Q SQTA  +  M S+P +RSF ++QY+++ HQ A  H KA Q 
Sbjct: 309  NSFSESFRPFTAISKTDQVSQTAAIEP-MPSVPVNRSFLNNQYNNRPHQQAVGHPKASQH 367

Query: 546  NKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVGHLQ----KINIFENRHVI 713
            NKEW+PKSSQKSS+ S GV GT     S P  NS N  ++  +LQ    +INI EN++VI
Sbjct: 368  NKEWKPKSSQKSSVTSPGVIGTPTKSSSPPTDNSKNMELDAANLQDKFSRINIHENQNVI 427

Query: 714  IPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQSSSEPCLXXXXXXXXXXX 893
            I ++ RVPE +R  LTFGSF  GFD+ R   +  QA   +++S+ E  +           
Sbjct: 428  IAQHIRVPETDRCKLTFGSFGVGFDAPRTPGF--QAVGISEESNGESAISLPASAPDSSS 485

Query: 894  XXC--GNELDLSKDQVKTSRSDSPV-SASSEHPLPEKKQSSSTQDLEKYADFALVQNNVP 1064
                 G +++L  DQ +   SDSP  S  SEHPLP    SSS  +L+ YAD  LV+N+ P
Sbjct: 486  DDASGGKQIELLDDQARNYGSDSPAASLESEHPLPVN--SSSPPNLDNYADIGLVRNSSP 543

Query: 1065 SH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAFQGQHLPSPLEVASSHGAN 1241
            S+ PSE      +P L S F A+DPQ GYD+ +FRP +D+  +GQ LPSP E  ++H AN
Sbjct: 544  SYAPSESQQQQDHPELPS-FSAYDPQTGYDISYFRPQIDETVRGQGLPSPQEALTTHTAN 602

Query: 1242 IIPASTVAMIQPQPG-AQLYPQVHLSQFPNFMPYRQFLSPVFVPPMAVPGYSRNPSYSHP 1418
            + PAST++ +Q QP  AQ+YPQVH+SQF N +PYRQF+SPV+VPPM +PGYS +P+Y HP
Sbjct: 603  V-PASTMSTVQQQPPMAQMYPQVHVSQFTNLVPYRQFISPVYVPPMPMPGYSSSPAYPHP 661

Query: 1419 SNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCTYTNSPGYAINAQGTVGGAI 1598
            SNG +Y LMPG  SH  A GLKYG   YKP+P ++  GF  + +  GYAINA G VG A 
Sbjct: 662  SNGNSYLLMPGGGSHLNANGLKYGIQHYKPVPGNNPAGFGNFVSPSGYAINAPGVVGSAT 721

Query: 1599 SFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQSASYYDISGQVLNAAYMPSQT 1772
              EDS+ MKYKDG  ++PN Q + SEIWIQ PREIPG+QSA YY++ GQ  + AY+PS T
Sbjct: 722  GLEDSSRMKYKDGNLYVPNPQAEASEIWIQNPREIPGMQSAPYYNMPGQT-HTAYLPSHT 780

Query: 1773 GNASFNVAATQSTQMQFSGMY-HQPQPAPLANPHALXXXXXXXXXXXXXXXXXXXXXXXX 1949
            G+ASFN AA QS+ MQF G+Y   PQP  + +PH L                        
Sbjct: 781  GHASFNAAAAQSSHMQFPGLYPPTPQPTAMPSPHHLGPVMGGNVGVGVAPSAPGAQVGAY 840

Query: 1950 XXXXINHLNWSTNF 1991
                + HLNW+TNF
Sbjct: 841  QQPQLGHLNWTTNF 854


>ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528616|gb|ESR39866.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 866

 Score =  481 bits (1239), Expect = e-133
 Identities = 288/695 (41%), Positives = 386/695 (55%), Gaps = 32/695 (4%)
 Frame = +3

Query: 3    STGSLIDCGGKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLN 182
            +TGS    GG+   Q  NG ++   RHA+  +     R E   E   T     S V  + 
Sbjct: 178  TTGSEKPSGGRSFSQASNGSTNLHPRHAYDHNITGTDRIEPSAEKFTT-----SAVNFIQ 232

Query: 183  AHDSQRYSTTTLVSNNSIVGLYXXXXXXXXXXXXXXXXAKIGAIKREVGVVGVWRQ---- 350
             + ++ YS T L S+NS+ G +                + +GAIKREVGVVG  RQ    
Sbjct: 233  HNITEGYSAT-LASSNSVGGYFSSKDPVHVPSPDSRASSAVGAIKREVGVVGGGRQCSDN 291

Query: 351  ------------HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQ 494
                                            K  Q +Q A  DS +  +P +R+  ++Q
Sbjct: 292  AVKDSTAPCSSFSNSILGRDNSDSFRPFPSISKADQINQIAATDSGVAGMPANRALFTNQ 351

Query: 495  YSSKSHQIA-NHQKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVG 671
            Y+ +SHQ +  HQKA Q NKEW+PKSSQKS+++  GV GT     S P  +S +   +V 
Sbjct: 352  YTGRSHQQSVGHQKASQHNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVA 411

Query: 672  HLQ----KINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQ 839
             LQ    ++NI EN++VII ++ RVPE +R  LTFGSF   F+S+RN   G  A  SA++
Sbjct: 412  KLQDELSRVNIHENQNVIIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEE 471

Query: 840  SSSEPCLXXXXXXXXXXXXXCGNE--LDLSKDQVKTSRSDSPVSA-SSEHPLPEK-KQSS 1007
            S+ E                      +D+  D V+ S S+SP S  +SEH LP+  K +S
Sbjct: 472  SNGESAASLTGAASKTSGNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDAS 531

Query: 1008 STQDLEKYADFALVQNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAF 1187
            S QDL+ YAD  LV++  PS+P        + S L++FPA+D Q GYD+ +FRP MD++ 
Sbjct: 532  SPQDLDGYADIGLVRDTDPSYPLSESQQQQDSSELASFPAYDSQTGYDMSYFRPTMDESV 591

Query: 1188 QGQHLPSPLEVASSHGANIIPASTVAMIQPQPG---AQLYPQVHLSQFPNFMPYRQFLSP 1358
            +GQ LPSP E  +SH AN IPAS++AM+Q Q     AQ+YPQVH+S FPN MPYRQ +SP
Sbjct: 592  RGQGLPSPQEALASHSANSIPASSIAMLQHQQQPQMAQMYPQVHVSHFPNMMPYRQIISP 651

Query: 1359 VFVPPMAVPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFC 1538
            V+VP MA+PGYS NP+Y HPSNG +Y LMPG SSH +  GLKYG  Q+KP+P++S  GF 
Sbjct: 652  VYVPQMAMPGYSSNPAYPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPTASPTGFG 711

Query: 1539 TYTNSPGYAINAQGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQS 1712
             +T+  GYAINA   VG     EDS+ MKYKDG  ++ N Q DTSE+WI  PRE+PG+QS
Sbjct: 712  NFTSPAGYAINAPSVVGSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPRELPGMQS 771

Query: 1713 ASYYDISGQVLN-AAYMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXX 1886
              YY++  Q  + AAY+PS  G+ASFN A  QS+ MQF GMYH   QP  +ANPH +   
Sbjct: 772  GPYYNMPAQTPHAAAYLPSHAGHASFNAAVPQSSHMQFPGMYHPTAQPPAMANPHHMGPA 831

Query: 1887 XXXXXXXXXXXXXXXXXXXXXXXXXINHLNWSTNF 1991
                                     + + NWS NF
Sbjct: 832  MGGNVGVGVPPAAPGAQVGAYQQPQLGNFNWSPNF 866


>ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528617|gb|ESR39867.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 867

 Score =  477 bits (1227), Expect = e-131
 Identities = 288/696 (41%), Positives = 386/696 (55%), Gaps = 33/696 (4%)
 Frame = +3

Query: 3    STGSLIDCGGKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLN 182
            +TGS    GG+   Q  NG ++   RHA+  +     R E   E   T     S V  + 
Sbjct: 178  TTGSEKPSGGRSFSQASNGSTNLHPRHAYDHNITGTDRIEPSAEKFTT-----SAVNFIQ 232

Query: 183  AHDSQRYSTTTLVSNNSIVGLYXXXXXXXXXXXXXXXXAKIGAIKREVGVVGVWRQ---- 350
             + ++ YS T L S+NS+ G +                + +GAIKREVGVVG  RQ    
Sbjct: 233  HNITEGYSAT-LASSNSVGGYFSSKDPVHVPSPDSRASSAVGAIKREVGVVGGGRQCSDN 291

Query: 351  ------------HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQ 494
                                            K  Q +Q A  DS +  +P +R+  ++Q
Sbjct: 292  AVKDSTAPCSSFSNSILGRDNSDSFRPFPSISKADQINQIAATDSGVAGMPANRALFTNQ 351

Query: 495  YSSKSHQIA-NHQKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVG 671
            Y+ +SHQ +  HQKA Q NKEW+PKSSQKS+++  GV GT     S P  +S +   +V 
Sbjct: 352  YTGRSHQQSVGHQKASQHNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVA 411

Query: 672  HLQ----KINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQ 839
             LQ    ++NI EN++VII ++ RVPE +R  LTFGSF   F+S+RN   G  A  SA++
Sbjct: 412  KLQDELSRVNIHENQNVIIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEE 471

Query: 840  SSSEPCLXXXXXXXXXXXXXCGNE--LDLSKDQVKTSRSDSPVSA-SSEHPLPEK-KQSS 1007
            S+ E                      +D+  D V+ S S+SP S  +SEH LP+  K +S
Sbjct: 472  SNGESAASLTGAASKTSGNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDAS 531

Query: 1008 STQDLEKYADFALVQNNVPSHPSEGXXXXXNPSLLSNFP-AFDPQPGYDVPFFRPVMDDA 1184
            S QDL+ YAD  LV++  PS+P        + S L++FP A+D Q GYD+ +FRP MD++
Sbjct: 532  SPQDLDGYADIGLVRDTDPSYPLSESQQQQDSSELASFPQAYDSQTGYDMSYFRPTMDES 591

Query: 1185 FQGQHLPSPLEVASSHGANIIPASTVAMIQPQPG---AQLYPQVHLSQFPNFMPYRQFLS 1355
             +GQ LPSP E  +SH AN IPAS++AM+Q Q     AQ+YPQVH+S FPN MPYRQ +S
Sbjct: 592  VRGQGLPSPQEALASHSANSIPASSIAMLQHQQQPQMAQMYPQVHVSHFPNMMPYRQIIS 651

Query: 1356 PVFVPPMAVPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGF 1535
            PV+VP MA+PGYS NP+Y HPSNG +Y LMPG SSH +  GLKYG  Q+KP+P++S  GF
Sbjct: 652  PVYVPQMAMPGYSSNPAYPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPTASPTGF 711

Query: 1536 CTYTNSPGYAINAQGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQ 1709
              +T+  GYAINA   VG     EDS+ MKYKDG  ++ N Q DTSE+WI  PRE+PG+Q
Sbjct: 712  GNFTSPAGYAINAPSVVGSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPRELPGMQ 771

Query: 1710 SASYYDISGQVLN-AAYMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXX 1883
            S  YY++  Q  + AAY+PS  G+ASFN A  QS+ MQF GMYH   QP  +ANPH +  
Sbjct: 772  SGPYYNMPAQTPHAAAYLPSHAGHASFNAAVPQSSHMQFPGMYHPTAQPPAMANPHHMGP 831

Query: 1884 XXXXXXXXXXXXXXXXXXXXXXXXXXINHLNWSTNF 1991
                                      + + NWS NF
Sbjct: 832  AMGGNVGVGVPPAAPGAQVGAYQQPQLGNFNWSPNF 867


>ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [Citrus sinensis]
          Length = 862

 Score =  471 bits (1213), Expect = e-130
 Identities = 284/695 (40%), Positives = 380/695 (54%), Gaps = 32/695 (4%)
 Frame = +3

Query: 3    STGSLIDCGGKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEETIPNSDSQVLGLN 182
            +TGS    GG+   Q  NG ++   RHA+  +     R E   E   T        +   
Sbjct: 178  TTGSERPSGGRSFSQASNGSTNLHPRHAYDHNITGTDRIEPSAEKFTT------SAVNFI 231

Query: 183  AHDSQRYSTTTLVSNNSIVGLYXXXXXXXXXXXXXXXXAKIGAIKREVGVVGVWRQ---- 350
             H+     + TL S+NS+ G +                + +GAIKREVGVVG  RQ    
Sbjct: 232  QHNITEGHSATLASSNSVGGYFSSKDPVHVPSPDSRASSAVGAIKREVGVVGGGRQCSDN 291

Query: 351  ------------HXXXXXXXXXXXXXXXXXXKKFVQSSQTAVPDSVMTSLPTSRSFASSQ 494
                                            K  Q +Q A  DS + +    R+  ++Q
Sbjct: 292  AVRDSTAPRSSFSNSILGRDNSDSFRPFPSISKADQINQIAATDSGVAN----RALFTNQ 347

Query: 495  YSSKSHQIA-NHQKAPQSNKEWRPKSSQKSSLMSHGVSGTDANPISSPEGNSLNSNIEVG 671
            Y+ +SHQ +  HQKA Q NKEW+PKSSQKS+++  GV GT     S P  +S +   +V 
Sbjct: 348  YTGRSHQQSVGHQKASQHNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVA 407

Query: 672  HLQ----KINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTRNYAYGPQAHDSAQQ 839
             LQ    ++NI EN++VII ++ RVPE +R  LTFGSF   F+S+RN   G  A  SA++
Sbjct: 408  KLQDELSRVNINENQNVIIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEE 467

Query: 840  SSSEPCLXXXXXXXXXXXXXCGNE--LDLSKDQVKTSRSDSPVSA-SSEHPLPEK-KQSS 1007
            S+ E                      +D+  D V+ S S+SP S  +SEH LP+  K +S
Sbjct: 468  SNGESAASLTGAASKTSGNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDAS 527

Query: 1008 STQDLEKYADFALVQNNVPSHPSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAF 1187
            S QDL+ YAD  LV++  PS+P        + S L++FPA+D Q GYD+ +FRP MD++ 
Sbjct: 528  SPQDLDGYADIGLVRDTDPSYPLSESQQQQDSSELASFPAYDSQTGYDMSYFRPTMDESV 587

Query: 1188 QGQHLPSPLEVASSHGANIIPASTVAMIQPQPG---AQLYPQVHLSQFPNFMPYRQFLSP 1358
            +GQ LPSP E  +SH AN IPAS++AM+Q Q     AQ+YPQVH+S FPN MPYRQ +SP
Sbjct: 588  RGQGLPSPQEALASHSANSIPASSIAMLQHQQQPQMAQMYPQVHVSHFPNMMPYRQIISP 647

Query: 1359 VFVPPMAVPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFC 1538
            V+VP MA+PGYS NP+Y HPSNG +Y LMPG SSH +  GLKYG  Q+KP+P++S  GF 
Sbjct: 648  VYVPQMAMPGYSSNPAYPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPTASPTGFG 707

Query: 1539 TYTNSPGYAINAQGTVGGAISFEDSTGMKYKDG--FIPNGQMDTSEIWIQTPREIPGLQS 1712
             +T+  GYAINA   VG     EDS+ MKYKDG  ++ N Q DTSE+WI  PRE+PG+QS
Sbjct: 708  NFTSPAGYAINAPSVVGSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPRELPGMQS 767

Query: 1713 ASYYDISGQVLN-AAYMPSQTGNASFNVAATQSTQMQFSGMYH-QPQPAPLANPHALXXX 1886
              YY++  Q  + AAY+PS  G+ASFN A  QS+ MQF GMYH   QP  +ANPH +   
Sbjct: 768  GPYYNMPAQTPHAAAYLPSHAGHASFNAAVPQSSHMQFPGMYHPTAQPPAMANPHHMGPA 827

Query: 1887 XXXXXXXXXXXXXXXXXXXXXXXXXINHLNWSTNF 1991
                                     + + NWS NF
Sbjct: 828  MGGNVGVGVPPAAPGAQVGAYQQPQLGNFNWSPNF 862


>ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 855

 Score =  466 bits (1200), Expect = e-128
 Identities = 280/693 (40%), Positives = 392/693 (56%), Gaps = 38/693 (5%)
 Frame = +3

Query: 27   GGKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEET--IPNSDSQVLGLNAHDSQR 200
            G + S    NG SDS +R+   A  N I RK    + ++   I N+  +V  +  +++ +
Sbjct: 169  GSRNSSLASNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNAHQ 228

Query: 201  YSTTTLVSNNSIVGLYXXXXXXXXXXXXXXXXAKI-GAIKREVGVVGVWRQHXXXXXXXX 377
             S + + S +S VG+Y                + + GAI+REVGVVGV RQ         
Sbjct: 229  NSAS-VASTSSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAKQS 287

Query: 378  XXXXXXXXXXK---------------KFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKSH 512
                      K               K  Q SQT V +  ++ +P SR   ++QY+++ H
Sbjct: 288  FAPSISYVVGKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQYNNRPH 347

Query: 513  Q-IANHQKAPQSNKEWRPKSSQKSSLMSHGVSGTD-------ANPISSPEGNSLNSNIEV 668
            Q +  HQ+  Q NKEW+PKSSQK +  S GV GT        A+P +   G+  ++  E+
Sbjct: 348  QQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTTEL 407

Query: 669  -GHLQKINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTR----NYAYGPQAHDSA 833
               L ++NI+EN++VII ++ RVPE +R  LTFG+  +  DS+R     +  G     + 
Sbjct: 408  QDKLSQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSRLQSKYHIIGASEKSNE 467

Query: 834  QQSSSEPCLXXXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVS-ASSEHPLPEKKQSSS 1010
            + ++S   L                ++DL  + +++SRSDSPVS A+SE  LP+ K SS+
Sbjct: 468  ELTAS---LTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSSN 524

Query: 1011 TQDLEKYADFALVQNNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAF 1187
            TQ+L+ YA+  LV+++ PS+ PSE      +   +  F A+DP  GYD+P+FRP +D+  
Sbjct: 525  TQNLDNYANIGLVRDSSPSYAPSEPQQQDSHD--MPGFAAYDPPAGYDIPYFRPTIDETV 582

Query: 1188 QGQHLPSPLEVASSHGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPV 1361
            +GQ L SP E   SH  N  PAST+AM+Q Q  P  Q+YPQVH+S F N MPYRQFLSPV
Sbjct: 583  RGQGLSSPQEALISHATNNPPASTIAMVQQQQPPVPQMYPQVHVSHFANLMPYRQFLSPV 642

Query: 1362 FVPPMAVPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCT 1541
            +VPPMA+PGYS NP Y HP+NG +Y LMPG  SH  A  LKYG  Q+KP+P+ S  GF  
Sbjct: 643  YVPPMAMPGYSSNPPYPHPTNGSSYLLMPGGGSHLNANNLKYGVQQFKPVPAGSPTGFGN 702

Query: 1542 YTNSPGYAINAQGTVGGAISFEDSTGMKYKDG-FIPNGQMDTSEIWIQTPREIPGLQSAS 1718
            + N  GYA+   G VGGA + EDS+ +KYKD  ++PN Q +TSEIW+Q PR++PG+QS  
Sbjct: 703  FANPTGYAMITPGVVGGATALEDSSRVKYKDNLYVPNPQAETSEIWLQNPRDLPGMQSTP 762

Query: 1719 YYDISGQVLNAAYMPSQTGNASFNVAATQSTQMQFSGMYHQP-QPAPLANPHALXXXXXX 1895
            YY++ GQ  +AAYMPS TG+ASFN AA QS+ MQF GMYH P QPA +A+PH L      
Sbjct: 763  YYNMPGQTPHAAYMPSHTGHASFNAAAAQSSHMQFPGMYHTPPQPAAMASPHHLGPPAIG 822

Query: 1896 XXXXXXXXXXXXXXXXXXXXXX-INHLNWSTNF 1991
                                   + H+NW+TNF
Sbjct: 823  NNVGVGVAAAAPGAQVGAYQQPQLGHINWTTNF 855


>ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 863

 Score =  466 bits (1200), Expect = e-128
 Identities = 280/693 (40%), Positives = 392/693 (56%), Gaps = 38/693 (5%)
 Frame = +3

Query: 27   GGKKSFQNLNGFSDSGSRHAHVASSNSIHRKELLVETEET--IPNSDSQVLGLNAHDSQR 200
            G + S    NG SDS +R+   A  N I RK    + ++   I N+  +V  +  +++ +
Sbjct: 177  GSRNSSLASNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNAHQ 236

Query: 201  YSTTTLVSNNSIVGLYXXXXXXXXXXXXXXXXAKI-GAIKREVGVVGVWRQHXXXXXXXX 377
             S + + S +S VG+Y                + + GAI+REVGVVGV RQ         
Sbjct: 237  NSAS-VASTSSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAKQS 295

Query: 378  XXXXXXXXXXK---------------KFVQSSQTAVPDSVMTSLPTSRSFASSQYSSKSH 512
                      K               K  Q SQT V +  ++ +P SR   ++QY+++ H
Sbjct: 296  FAPSISYVVGKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQYNNRPH 355

Query: 513  Q-IANHQKAPQSNKEWRPKSSQKSSLMSHGVSGTD-------ANPISSPEGNSLNSNIEV 668
            Q +  HQ+  Q NKEW+PKSSQK +  S GV GT        A+P +   G+  ++  E+
Sbjct: 356  QQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTTEL 415

Query: 669  -GHLQKINIFENRHVIIPENFRVPEAERELLTFGSFESGFDSTR----NYAYGPQAHDSA 833
               L ++NI+EN++VII ++ RVPE +R  LTFG+  +  DS+R     +  G     + 
Sbjct: 416  QDKLSQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSRLQSKYHIIGASEKSNE 475

Query: 834  QQSSSEPCLXXXXXXXXXXXXXCGNELDLSKDQVKTSRSDSPVS-ASSEHPLPEKKQSSS 1010
            + ++S   L                ++DL  + +++SRSDSPVS A+SE  LP+ K SS+
Sbjct: 476  ELTAS---LTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSSN 532

Query: 1011 TQDLEKYADFALVQNNVPSH-PSEGXXXXXNPSLLSNFPAFDPQPGYDVPFFRPVMDDAF 1187
            TQ+L+ YA+  LV+++ PS+ PSE      +   +  F A+DP  GYD+P+FRP +D+  
Sbjct: 533  TQNLDNYANIGLVRDSSPSYAPSEPQQQDSHD--MPGFAAYDPPAGYDIPYFRPTIDETV 590

Query: 1188 QGQHLPSPLEVASSHGANIIPASTVAMIQPQ--PGAQLYPQVHLSQFPNFMPYRQFLSPV 1361
            +GQ L SP E   SH  N  PAST+AM+Q Q  P  Q+YPQVH+S F N MPYRQFLSPV
Sbjct: 591  RGQGLSSPQEALISHATNNPPASTIAMVQQQQPPVPQMYPQVHVSHFANLMPYRQFLSPV 650

Query: 1362 FVPPMAVPGYSRNPSYSHPSNGGNYFLMPGCSSHRAAGGLKYGASQYKPIPSSSSNGFCT 1541
            +VPPMA+PGYS NP Y HP+NG +Y LMPG  SH  A  LKYG  Q+KP+P+ S  GF  
Sbjct: 651  YVPPMAMPGYSSNPPYPHPTNGSSYLLMPGGGSHLNANNLKYGVQQFKPVPAGSPTGFGN 710

Query: 1542 YTNSPGYAINAQGTVGGAISFEDSTGMKYKDG-FIPNGQMDTSEIWIQTPREIPGLQSAS 1718
            + N  GYA+   G VGGA + EDS+ +KYKD  ++PN Q +TSEIW+Q PR++PG+QS  
Sbjct: 711  FANPTGYAMITPGVVGGATALEDSSRVKYKDNLYVPNPQAETSEIWLQNPRDLPGMQSTP 770

Query: 1719 YYDISGQVLNAAYMPSQTGNASFNVAATQSTQMQFSGMYHQP-QPAPLANPHALXXXXXX 1895
            YY++ GQ  +AAYMPS TG+ASFN AA QS+ MQF GMYH P QPA +A+PH L      
Sbjct: 771  YYNMPGQTPHAAYMPSHTGHASFNAAAAQSSHMQFPGMYHTPPQPAAMASPHHLGPPAIG 830

Query: 1896 XXXXXXXXXXXXXXXXXXXXXX-INHLNWSTNF 1991
                                   + H+NW+TNF
Sbjct: 831  NNVGVGVAAAAPGAQVGAYQQPQLGHINWTTNF 863


Top