BLASTX nr result

ID: Cocculus23_contig00004639 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00004639
         (2556 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma...   610   e-172
ref|XP_007024586.1| Uncharacterized protein isoform 3 [Theobroma...   605   e-170
ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma...   605   e-170
ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248...   603   e-169
ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma...   601   e-169
ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma...   595   e-167
gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]     595   e-167
emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]   594   e-167
ref|XP_002521347.1| conserved hypothetical protein [Ricinus comm...   592   e-166
ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Popu...   590   e-165
ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus tr...   579   e-162
ref|XP_004303676.1| PREDICTED: uncharacterized protein LOC101293...   579   e-162
emb|CBI35892.3| unnamed protein product [Vitis vinifera]              578   e-162
ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [...   575   e-161
ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citr...   573   e-160
ref|XP_007214970.1| hypothetical protein PRUPE_ppa001749mg [Prun...   571   e-160
ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citr...   569   e-159
ref|XP_006583149.1| PREDICTED: dentin sialophosphoprotein-like i...   563   e-157
ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phas...   557   e-156
ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like i...   557   e-155

>ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508779953|gb|EOY27209.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 849

 Score =  610 bits (1574), Expect = e-172
 Identities = 352/781 (45%), Positives = 474/781 (60%), Gaps = 36/781 (4%)
 Frame = -1

Query: 2550 SLERGEHTGLIVYKTKIRTYSGRNAQNGGHPRNGFSGTHQEFRVVRDKRGNQSTSRELXX 2371
            S +R E+ G      K R Y  R ++ G + RN   G ++EFRVVRD R NQ+ ++++  
Sbjct: 80   SRKRSENVG---QGMKFRPYPERGSRRGSYTRNTLPGVNREFRVVRDNRVNQNANKDMKT 136

Query: 2370 XXXXXXXXXNDQVVPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSNGL 2200
                     N+QV      K STGT  +      +   Q  NG S S  RHA  A S+G+
Sbjct: 137  PFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSLSQTSNGPSSSQTRHARDANSSGI 196

Query: 2199 DKKELLVETDRTVPNS--DSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS- 2029
            D+KE+  E    +PN+   SQA++ N   + Q  +     ++S+ G+Y SS+DPVHVPS 
Sbjct: 197  DRKEISEEKRNFIPNAVLRSQAVKPN---NSQAHAATQSSSSSVVGVYSSSTDPVHVPSP 253

Query: 2028 DNRSSGKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT---------------- 1897
            D+RSSG +GAIKREVGVVGVRRQP+ N                                 
Sbjct: 254  DSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDNSSEAFRSFPSIS 313

Query: 1896 -VVQSNQTAVADXXXXXXXXXXXS--NQY-SIKSYPTVNYQKAPQSNKEWRPKSSKKSSL 1729
               Q + T+  +              NQY S ++   + +QKA Q NKEW+PK S+KSS+
Sbjct: 314  RADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGHQKANQHNKEWKPKLSQKSSV 373

Query: 1728 MSHGVTGTDANPISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWAL 1549
             + GV GT     S PA+++     E   LQ+ F + NI EN++VII +H++VPE +   
Sbjct: 374  NNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYENENVIIAQHIRVPENDRCR 433

Query: 1548 LTFGSFGSGFDSTRNYAYGPQEPGSAEHSNSESC--LRVSAPAASNADVSDGNKLDLPDD 1375
            LTFGSFG  FDS RN+  G Q  G AE SN ES   L VSAP  S+ D + G  +++ DD
Sbjct: 434  LTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVSAPDTSSDDAAGGKPIEILDD 493

Query: 1374 QVRTSRSDSTPSAAPSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDP 1195
            Q+  S SDS  S   SEH+LP+ +++S  Q+L++Y+D  L QDN+ S+ P+  Q +QQDP
Sbjct: 494  QIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDNSPSYAPSESQ-KQQDP 552

Query: 1194 SLLSNFPAFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ- 1018
              L +F A+DP  GYD+P+F+  +D T +G+G PSPQE +S+H AN+ PAST+ M+ QQ 
Sbjct: 553  PELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAHTANV-PASTIPMMQQQQ 611

Query: 1017 -PVAQLYPQVHLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCG 841
             PVAQ+YPQVH+S F N MP RQF+SP++LP MA+PGYS NPAY HPSNGS+Y+LMPG  
Sbjct: 612  PPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPGGS 671

Query: 840  SHLVAGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG 667
            SHL A GLKY   Q+KP+P  SPTGF  +T+  GYA+N  G       +EDS+ +KYKDG
Sbjct: 672  SHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDG 731

Query: 666  --YIPNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPA 493
              Y+PN+Q DTS++W+Q  RE+PGLQ A YYN+   TP+  Y+ +  GHASFN AA Q +
Sbjct: 732  NIYVPNQQADTSDLWIQNPRELPGLQSAPYYNMP-QTPH-GYMPSHTGHASFNAAAAQSS 789

Query: 492  QIQYPGFYHPPQPP--LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSH 319
             +Q+PG YHPP  P  +ANPH L   M  NV VGVA A P AQ  AY Q Q+ H NW+++
Sbjct: 790  HMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQVGAYQQPQLGHLNWTTN 848

Query: 318  F 316
            F
Sbjct: 849  F 849


>ref|XP_007024586.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508779952|gb|EOY27208.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 761

 Score =  605 bits (1561), Expect = e-170
 Identities = 348/768 (45%), Positives = 469/768 (61%), Gaps = 38/768 (4%)
 Frame = -1

Query: 2505 KIRTYSGRNAQNGGHPRNGF--SGTHQEFRVVRDKRGNQSTSRELXXXXXXXXXXXNDQV 2332
            K R Y  R ++ G + RN    +G ++EFRVVRD R NQ+ ++++           N+QV
Sbjct: 2    KFRPYPERGSRRGSYTRNTLPDAGVNREFRVVRDNRVNQNANKDMKTPFSQCSTSANEQV 61

Query: 2331 VPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSNGLDKKELLVETDRTV 2161
                  K STGT  +      +   Q  NG S S  RHA  A S+G+D+KE+  E    +
Sbjct: 62   PVNVAEKGSTGTSSNQRPFSSRSLSQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFI 121

Query: 2160 PNS--DSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS-DNRSSGKIGAIKR 1990
            PN+   SQA++ N   + Q  +     ++S+ G+Y SS+DPVHVPS D+RSSG +GAIKR
Sbjct: 122  PNAVLRSQAVKPN---NSQAHAATQSSSSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKR 178

Query: 1989 EVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT-----------------VVQSNQTAVADX 1861
            EVGVVGVRRQP+ N                                    Q + T+  + 
Sbjct: 179  EVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDNSSEAFRSFPSISRADQLSHTSATES 238

Query: 1860 XXXXXXXXXXS--NQY-SIKSYPTVNYQKAPQSNKEWRPKSSKKSSLMSHGVTGTDANPI 1690
                         NQY S ++   + +QKA Q NKEW+PK S+KSS+ + GV GT     
Sbjct: 239  IMPGISGSRSFLSNQYGSRQNQQALGHQKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSA 298

Query: 1689 SSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFDST 1510
            S PA+++     E   LQ+ F + NI EN++VII +H++VPE +   LTFGSFG  FDS 
Sbjct: 299  SPPADDAKGLDSETAKLQDKFSQVNIYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSL 358

Query: 1509 RNYAYGPQEPGSAEHSNSESC--LRVSAPAASNADVSDGNKLDLPDDQVRTSRSDSTPSA 1336
            RN+  G Q  G AE SN ES   L VSAP  S+ D + G  +++ DDQ+  S SDS  S 
Sbjct: 359  RNFVPGFQATGVAEDSNGESAASLSVSAPDTSSDDAAGGKPIEILDDQIGNSGSDSPLSG 418

Query: 1335 APSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFPAFDPHP 1156
              SEH+LP+ +++S  Q+L++Y+D  L QDN+ S+ P+  Q +QQDP  L +F A+DP  
Sbjct: 419  TASEHQLPDTKDTSSPQNLDSYADIGLVQDNSPSYAPSESQ-KQQDPPELPSFSAYDPQT 477

Query: 1155 GYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ--PVAQLYPQVHLS 982
            GYD+P+F+  +D T +G+G PSPQE +S+H AN+ PAST+ M+ QQ  PVAQ+YPQVH+S
Sbjct: 478  GYDLPYFRPPIDETARGQGLPSPQEALSAHTANV-PASTIPMMQQQQPPVAQMYPQVHVS 536

Query: 981  QFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKYSAS 802
             F N MP RQF+SP++LP MA+PGYS NPAY HPSNGS+Y+LMPG  SHL A GLKY   
Sbjct: 537  HFANIMPYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPGGSSHLNANGLKYGIQ 596

Query: 801  QYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG--YIPNRQVDTSE 634
            Q+KP+P  SPTGF  +T+  GYA+N  G       +EDS+ +KYKDG  Y+PN+Q DTS+
Sbjct: 597  QFKPVPAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDGNIYVPNQQADTSD 656

Query: 633  IWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQIQYPGFYHPPQP 454
            +W+Q  RE+PGLQ A YYN+   TP+  Y+ +  GHASFN AA Q + +Q+PG YHPP  
Sbjct: 657  LWIQNPRELPGLQSAPYYNMP-QTPH-GYMPSHTGHASFNAAAAQSSHMQFPGLYHPPPQ 714

Query: 453  P--LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
            P  +ANPH L   M  NV VGVA A P AQ  AY Q Q+ H NW+++F
Sbjct: 715  PAAMANPH-LGPAMGANVGVGVAPAAPGAQVGAYQQPQLGHLNWTTNF 761


>ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508779951|gb|EOY27207.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 852

 Score =  605 bits (1560), Expect = e-170
 Identities = 351/783 (44%), Positives = 472/783 (60%), Gaps = 38/783 (4%)
 Frame = -1

Query: 2550 SLERGEHTGLIVYKTKIRTYSGRNAQNGGHPRNGF--SGTHQEFRVVRDKRGNQSTSREL 2377
            S +R E+ G      K R Y  R ++ G + RN    +G ++EFRVVRD R NQ+ ++++
Sbjct: 80   SRKRSENVG---QGMKFRPYPERGSRRGSYTRNTLPDAGVNREFRVVRDNRVNQNANKDM 136

Query: 2376 XXXXXXXXXXXNDQVVPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSN 2206
                       N+QV      K STGT  +      +   Q  NG S S  RHA  A S+
Sbjct: 137  KTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSLSQTSNGPSSSQTRHARDANSS 196

Query: 2205 GLDKKELLVETDRTVPNS--DSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVP 2032
            G+D+KE+  E    +PN+   SQA++ N   + Q  +     ++S+ G+Y SS+DPVHVP
Sbjct: 197  GIDRKEISEEKRNFIPNAVLRSQAVKPN---NSQAHAATQSSSSSVVGVYSSSTDPVHVP 253

Query: 2031 S-DNRSSGKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT-------------- 1897
            S D+RSSG +GAIKREVGVVGVRRQP+ N                               
Sbjct: 254  SPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDNSSEAFRSFPS 313

Query: 1896 ---VVQSNQTAVADXXXXXXXXXXXS--NQY-SIKSYPTVNYQKAPQSNKEWRPKSSKKS 1735
                 Q + T+  +              NQY S ++   + +QKA Q NKEW+PK S+KS
Sbjct: 314  ISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGHQKANQHNKEWKPKLSQKS 373

Query: 1734 SLMSHGVTGTDANPISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEW 1555
            S+ + GV GT     S PA+++     E   LQ+ F + NI EN++VII +H++VPE + 
Sbjct: 374  SVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYENENVIIAQHIRVPENDR 433

Query: 1554 ALLTFGSFGSGFDSTRNYAYGPQEPGSAEHSNSESC--LRVSAPAASNADVSDGNKLDLP 1381
              LTFGSFG  FDS RN+  G Q  G AE SN ES   L VSAP  S+ D + G  +++ 
Sbjct: 434  CRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVSAPDTSSDDAAGGKPIEIL 493

Query: 1380 DDQVRTSRSDSTPSAAPSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQ 1201
            DDQ+  S SDS  S   SEH+LP+ +++S  Q+L++Y+D  L QDN+ S+ P+  Q QQ 
Sbjct: 494  DDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDNSPSYAPSESQKQQD 553

Query: 1200 DPSLLSNFPAFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQ 1021
             P L S   A+DP  GYD+P+F+  +D T +G+G PSPQE +S+H AN+ PAST+ M+ Q
Sbjct: 554  PPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAHTANV-PASTIPMMQQ 612

Query: 1020 Q--PVAQLYPQVHLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPG 847
            Q  PVAQ+YPQVH+S F N MP RQF+SP++LP MA+PGYS NPAY HPSNGS+Y+LMPG
Sbjct: 613  QQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPG 672

Query: 846  CGSHLVAGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYK 673
              SHL A GLKY   Q+KP+P  SPTGF  +T+  GYA+N  G       +EDS+ +KYK
Sbjct: 673  GSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYK 732

Query: 672  DG--YIPNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQ 499
            DG  Y+PN+Q DTS++W+Q  RE+PGLQ A YYN+   TP+  Y+ +  GHASFN AA Q
Sbjct: 733  DGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNMP-QTPH-GYMPSHTGHASFNAAAAQ 790

Query: 498  PAQIQYPGFYHPPQPP--LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWS 325
             + +Q+PG YHPP  P  +ANPH L   M  NV VGVA A P AQ  AY Q Q+ H NW+
Sbjct: 791  SSHMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQVGAYQQPQLGHLNWT 849

Query: 324  SHF 316
            ++F
Sbjct: 850  TNF 852


>ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248075 [Vitis vinifera]
          Length = 860

 Score =  603 bits (1554), Expect = e-169
 Identities = 357/779 (45%), Positives = 472/779 (60%), Gaps = 49/779 (6%)
 Frame = -1

Query: 2505 KIRTYSGRNAQNGGHPRNGF-------SGTHQEFRVVRDKRGNQSTSRELXXXXXXXXXX 2347
            K R++  RN + GG+ R+         +G  +EFRVVRD R NQ+T+R++          
Sbjct: 94   KFRSFPDRNVRRGGYSRSTLMVRILLDAGIGREFRVVRDNRVNQNTNRDMKPVSPQLATS 153

Query: 2346 XNDQVVPYFPGK-SSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSNGLDKKELLV 2179
             N+QV+     K +STGT  +     G++S Q+LNG +D+       A S+G ++KELL 
Sbjct: 154  VNEQVISNISEKGNSTGTSNNQKPSSGRQSSQSLNGPTDARPGIPQDANSSGSNRKELLE 213

Query: 2178 ETDRTVPNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS-DNRSSGKIG 2002
            E   T+PN+ S+       DS  YS+  L  N+S+ G+Y SSSDPVHVPS D+RSS  +G
Sbjct: 214  ERQATIPNAVSRVQAVKPNDSQPYSA-SLASNSSVVGVYSSSSDPVHVPSPDSRSSAIVG 272

Query: 2001 AIKREVGVVGVRRQPT-------------------GNXXXXXXXXXXXXXXXXTVVQSNQ 1879
            AIKREVGVVGVRRQ T                   G                    Q  Q
Sbjct: 273  AIKREVGVVGVRRQSTENSVKHSSAPSSSLPSSLLGRENSPSTEPFRPFNAIPKSDQPRQ 332

Query: 1878 TAVADXXXXXXXXXXXS--NQYSIKSYPT-VNYQKAPQSNKEWRPKSSKKSSLMSHGVTG 1708
            T V D              NQY  + +   V +QKAPQ NKEW+PKSS+KSS +  GV G
Sbjct: 333  TTVPDHVIPSMPVNRSFLGNQYGSRPHQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIG 392

Query: 1707 TDANPISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFG 1528
            T A  +S  A+NS     E   LQ+   + +I ENQ+VII +H++VPE +   LTFGSFG
Sbjct: 393  TPAKSVSPRADNSKDLESETAKLQDKLSQASISENQNVIIAQHIRVPETDRCRLTFGSFG 452

Query: 1527 SGFDSTRNYAYGPQEPGSAEHSNSE--SCLRVSAPAASNADVSDGNKLDLPDDQVRTSRS 1354
            + F S      G Q  G+A+  ++E  + L VS P +S+ D S   ++DL DDQ   S +
Sbjct: 453  ADFAS------GFQAVGNADEPSAEPSASLSVSPPESSSDDGS--KQVDL-DDQYINSGT 503

Query: 1353 DSTPSAAPSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFP 1174
             S  S   SEH+LP+K+ESS  Q+LENY+D  L ++++ S+ P     QQQ+  +L +FP
Sbjct: 504  ASPESGEASEHQLPDKKESSSPQNLENYADIGLVRESSPSYTPES--QQQQERHVLPSFP 561

Query: 1173 -AFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ----PVA 1009
             A+DP  GYD+P+F+  MD T++G+G PSPQE ++SH AN IPAS++AM+ QQ    PV 
Sbjct: 562  HAYDPQAGYDIPYFRPTMDETVRGQGLPSPQEALASHTANSIPASSIAMVQQQQQQPPVP 621

Query: 1008 QLYPQVHLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLV 829
            Q+Y QVH+  F N MP RQFLSPV++PPMA+PGYS NPAYSHPSN ++YLLMPG  SHL 
Sbjct: 622  QMYQQVHVPHFANLMPYRQFLSPVYVPPMAMPGYSSNPAYSHPSNANSYLLMPGGSSHLG 681

Query: 828  AGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG--YI 661
            A GLKY   Q KP+P  SPTGF  +TN  GYA+N  G    A  +EDS+ +KYKDG  Y+
Sbjct: 682  ANGLKYGIQQLKPVPAGSPTGFGNFTNPTGYAINAPGVVGSATGLEDSSRLKYKDGNIYV 741

Query: 660  PNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFN--VAATQPAQI 487
            PN Q +TSEIW+Q  RE+PGLQ A YYN+   TP+AAY+ +  GHASFN   AA Q + +
Sbjct: 742  PNPQAETSEIWIQNPRELPGLQSAPYYNMPAQTPHAAYMPSHTGHASFNAAAAAAQSSHM 801

Query: 486  QYPGFYHPPQPP--LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
            Q+PG YHPP  P  +A+PH L   M GNV VGVA+A P  Q  AY Q Q+ H NW+++F
Sbjct: 802  QFPGLYHPPPQPAAMASPHHLGPPMGGNVGVGVAAAAPGPQVGAYQQPQLGHLNWTTNF 860


>ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508779955|gb|EOY27211.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 839

 Score =  601 bits (1549), Expect = e-169
 Identities = 347/779 (44%), Positives = 470/779 (60%), Gaps = 34/779 (4%)
 Frame = -1

Query: 2550 SLERGEHTGLIVYKTKIRTYSGRNAQNGGHPRNGFSGTHQEFRVVRDKRGNQSTSRELXX 2371
            S +R E+ G      K R Y  R ++ G + RN   G ++EFRVVRD R NQ+ ++++  
Sbjct: 80   SRKRSENVG---QGMKFRPYPERGSRRGSYTRNTLPGVNREFRVVRDNRVNQNANKDMKT 136

Query: 2370 XXXXXXXXXNDQVVPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSNGL 2200
                     N+QV      K STGT  +      +   Q  NG S S  RHA  A S+G+
Sbjct: 137  PFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSLSQTSNGPSSSQTRHARDANSSGI 196

Query: 2199 DKKELLVETDRTVPNS--DSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS- 2029
            D+KE+  E    +PN+   SQA++ N   + Q  +     ++S+ G+Y SS+DPVHVPS 
Sbjct: 197  DRKEISEEKRNFIPNAVLRSQAVKPN---NSQAHAATQSSSSSVVGVYSSSTDPVHVPSP 253

Query: 2028 DNRSSGKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT---------------- 1897
            D+RSSG +GAIKREVGVVGVRRQP+ N                                 
Sbjct: 254  DSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDNSSEAFRSFPSIS 313

Query: 1896 -VVQSNQTAVADXXXXXXXXXXXS--NQY-SIKSYPTVNYQKAPQSNKEWRPKSSKKSSL 1729
               Q + T+  +              NQY S ++   + +QKA Q NKEW+PK S+KSS+
Sbjct: 314  RADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGHQKANQHNKEWKPKLSQKSSV 373

Query: 1728 MSHGVTGTDANPISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWAL 1549
             + GV GT     S PA+++     E   LQ+ F + NI EN++VII +H++VPE +   
Sbjct: 374  NNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYENENVIIAQHIRVPENDRCR 433

Query: 1548 LTFGSFGSGFDSTRNYAYGPQEPGSAEHSNSESCLRVSAPAASNADVSDGNKLDLPDDQV 1369
            LTFGSFG  FDS RN+  G Q  G AE SN ES        A++ D + G  +++ DDQ+
Sbjct: 434  LTFGSFGVEFDSLRNFVPGFQATGVAEDSNGES--------AASDDAAGGKPIEILDDQI 485

Query: 1368 RTSRSDSTPSAAPSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSL 1189
              S SDS  S   SEH+LP+ +++S  Q+L++Y+D  L QDN+ S+ P+  Q +QQDP  
Sbjct: 486  GNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDNSPSYAPSESQ-KQQDPPE 544

Query: 1188 LSNFPAFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ--P 1015
            L +F A+DP  GYD+P+F+  +D T +G+G PSPQE +S+H AN+ PAST+ M+ QQ  P
Sbjct: 545  LPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAHTANV-PASTIPMMQQQQPP 603

Query: 1014 VAQLYPQVHLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSH 835
            VAQ+YPQVH+S F N MP RQF+SP++LP MA+PGYS NPAY HPSNGS+Y+LMPG  SH
Sbjct: 604  VAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPGGSSH 663

Query: 834  LVAGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG-- 667
            L A GLKY   Q+KP+P  SPTGF  +T+  GYA+N  G       +EDS+ +KYKDG  
Sbjct: 664  LNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDGNI 723

Query: 666  YIPNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQI 487
            Y+PN+Q DTS++W+Q  RE+PGLQ A YYN+   TP+  Y+ +  GHASFN AA Q + +
Sbjct: 724  YVPNQQADTSDLWIQNPRELPGLQSAPYYNMP-QTPH-GYMPSHTGHASFNAAAAQSSHM 781

Query: 486  QYPGFYHPPQPP--LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
            Q+PG YHPP  P  +ANPH L   M  NV VGVA A P AQ  AY Q Q+ H NW+++F
Sbjct: 782  QFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQVGAYQQPQLGHLNWTTNF 839


>ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508779954|gb|EOY27210.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 842

 Score =  595 bits (1535), Expect = e-167
 Identities = 346/781 (44%), Positives = 468/781 (59%), Gaps = 36/781 (4%)
 Frame = -1

Query: 2550 SLERGEHTGLIVYKTKIRTYSGRNAQNGGHPRNGF--SGTHQEFRVVRDKRGNQSTSREL 2377
            S +R E+ G      K R Y  R ++ G + RN    +G ++EFRVVRD R NQ+ ++++
Sbjct: 80   SRKRSENVG---QGMKFRPYPERGSRRGSYTRNTLPDAGVNREFRVVRDNRVNQNANKDM 136

Query: 2376 XXXXXXXXXXXNDQVVPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSN 2206
                       N+QV      K STGT  +      +   Q  NG S S  RHA  A S+
Sbjct: 137  KTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSLSQTSNGPSSSQTRHARDANSS 196

Query: 2205 GLDKKELLVETDRTVPNS--DSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVP 2032
            G+D+KE+  E    +PN+   SQA++ N   + Q  +     ++S+ G+Y SS+DPVHVP
Sbjct: 197  GIDRKEISEEKRNFIPNAVLRSQAVKPN---NSQAHAATQSSSSSVVGVYSSSTDPVHVP 253

Query: 2031 S-DNRSSGKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT-------------- 1897
            S D+RSSG +GAIKREVGVVGVRRQP+ N                               
Sbjct: 254  SPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDNSSEAFRSFPS 313

Query: 1896 ---VVQSNQTAVADXXXXXXXXXXXS--NQY-SIKSYPTVNYQKAPQSNKEWRPKSSKKS 1735
                 Q + T+  +              NQY S ++   + +QKA Q NKEW+PK S+KS
Sbjct: 314  ISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGHQKANQHNKEWKPKLSQKS 373

Query: 1734 SLMSHGVTGTDANPISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEW 1555
            S+ + GV GT     S PA+++     E   LQ+ F + NI EN++VII +H++VPE + 
Sbjct: 374  SVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYENENVIIAQHIRVPENDR 433

Query: 1554 ALLTFGSFGSGFDSTRNYAYGPQEPGSAEHSNSESCLRVSAPAASNADVSDGNKLDLPDD 1375
              LTFGSFG  FDS RN+  G Q  G AE SN ES        A++ D + G  +++ DD
Sbjct: 434  CRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGES--------AASDDAAGGKPIEILDD 485

Query: 1374 QVRTSRSDSTPSAAPSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDP 1195
            Q+  S SDS  S   SEH+LP+ +++S  Q+L++Y+D  L QDN+ S+ P+  Q QQ  P
Sbjct: 486  QIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDNSPSYAPSESQKQQDPP 545

Query: 1194 SLLSNFPAFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ- 1018
             L S   A+DP  GYD+P+F+  +D T +G+G PSPQE +S+H AN+ PAST+ M+ QQ 
Sbjct: 546  ELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAHTANV-PASTIPMMQQQQ 604

Query: 1017 -PVAQLYPQVHLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCG 841
             PVAQ+YPQVH+S F N MP RQF+SP++LP MA+PGYS NPAY HPSNGS+Y+LMPG  
Sbjct: 605  PPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPGGS 664

Query: 840  SHLVAGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG 667
            SHL A GLKY   Q+KP+P  SPTGF  +T+  GYA+N  G       +EDS+ +KYKDG
Sbjct: 665  SHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDG 724

Query: 666  --YIPNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPA 493
              Y+PN+Q DTS++W+Q  RE+PGLQ A YYN+   TP+  Y+ +  GHASFN AA Q +
Sbjct: 725  NIYVPNQQADTSDLWIQNPRELPGLQSAPYYNMP-QTPH-GYMPSHTGHASFNAAAAQSS 782

Query: 492  QIQYPGFYHPPQPP--LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSH 319
             +Q+PG YHPP  P  +ANPH L   M  NV VGVA A P AQ  AY Q Q+ H NW+++
Sbjct: 783  HMQFPGLYHPPPQPAAMANPH-LGPAMGANVGVGVAPAAPGAQVGAYQQPQLGHLNWTTN 841

Query: 318  F 316
            F
Sbjct: 842  F 842


>gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]
          Length = 854

 Score =  595 bits (1533), Expect = e-167
 Identities = 340/770 (44%), Positives = 457/770 (59%), Gaps = 39/770 (5%)
 Frame = -1

Query: 2508 TKIRTYSGRNAQNGGHPRNGF-------SGTHQEFRVVRDKRGNQSTSRELXXXXXXXXX 2350
            +K+ T+S RNA+ GG+ RN         +G  +EFRVVRD R N+S +RE          
Sbjct: 94   SKVNTFSDRNARRGGYARNSLPDRIMLHAGVSREFRVVRDNRVNRSLNREAKPASASPTP 153

Query: 2349 XXNDQVVPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSNGLDKKELLV 2179
                + +    GK STG+         + S Q L G SDS  R AH   S GL +KE+  
Sbjct: 154  PSTFENIS---GKGSTGSSNSEKPTASKNSSQGLYGPSDSHLRIAHDIESTGLVRKEVSE 210

Query: 2178 ETDRTVPNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS-DNRSSGKIG 2002
            E   T  +  S+     A ++   S++    +++IG +Y SS+DPVHVPS D+RSSG +G
Sbjct: 211  EKRVTFSSVASRVQAGKANNARSQSAMVASSSSAIG-VYSSSTDPVHVPSPDSRSSGSVG 269

Query: 2001 AIKREVGVVGVRRQPT------------------GNXXXXXXXXXXXXXXXXTVVQSNQT 1876
            AIKREVGVVGVRRQ +                  G                  V Q++++
Sbjct: 270  AIKREVGVVGVRRQSSDNSKSSVPSSSFSNSLLGGEGSAETLQSFSTISKNDEVGQASES 329

Query: 1875 AVADXXXXXXXXXXXSNQYSIKSYPTVNYQKAPQSNKEWRPKSSKKSSLMSHGVTGTDAN 1696
             +              +       P V +QKA Q NKEW+PKSS+K SL + GV GT   
Sbjct: 330  ILPSVSVSRSLLSSHYSNRQQHQQP-VGHQKASQPNKEWKPKSSQKPSLNNPGVIGTPTK 388

Query: 1695 PISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFD 1516
             +S PA NS  S  E   + E   + NI ENQ+VII +H++VPE +   LTFGSFG  F+
Sbjct: 389  SVSPPAHNSEVSESEPAKVLEKLSRVNIHENQNVIIAQHIRVPETDRCRLTFGSFGKEFE 448

Query: 1515 STRNYAYGPQEPGSAEHSNSESCLRVSAPAASNADVSDGNKLDLPDDQVRTSRSDSTPSA 1336
            S  +   G Q  G+   SN E+   +SAP +S  D S   ++DL D+Q+R S SDS  S 
Sbjct: 449  SDSDLVNGYQA-GAIGESNGEAASSLSAPESSIGDASGSKQVDLTDEQIRNSGSDSPTSG 507

Query: 1335 APSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFPAFDPHP 1156
              SE++ P+K+ES+  Q+L+NY+D  L Q N+ S+ PA    QQ +   L  F A+D   
Sbjct: 508  GTSENQFPDKKESTSPQNLDNYADIGLVQGNSPSYAPA--DSQQPEHPELPGFSAYDSQT 565

Query: 1155 GYDVPFFK--MGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ---PVAQLYPQV 991
            GYD P+F+     D  ++G+G P+PQE  SSH  N +P +T++M+ QQ   PVAQ+YPQV
Sbjct: 566  GYDFPYFRPASATDEAMRGQGLPTPQEAFSSHNTNSVP-TTISMVQQQQQPPVAQMYPQV 624

Query: 990  HLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKY 811
            H+S F N MP RQFLSPV++PPMA+PGYS +PAY HPSNG++YLLMPG G+HL A  LKY
Sbjct: 625  HVSHFANLMPYRQFLSPVYVPPMAMPGYSSSPAYPHPSNGNSYLLMPGGGTHLNANSLKY 684

Query: 810  SASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG--YIPNRQVD 643
               Q+KP+P  +PTGF  ++N  GYA+NT G   GA  +EDS+ +KYKDG  Y+PN Q +
Sbjct: 685  GVQQFKPVPAGNPTGFGNFSNPNGYAINTPGVVGGATGLEDSSRIKYKDGNLYVPNPQAE 744

Query: 642  TSEIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQIQYPGFYHP 463
            TSE+W+Q  RE+PGLQ   YYN+ G +P+AAYL +  GHAS+N AA Q + +Q+PG YHP
Sbjct: 745  TSEMWIQNPRELPGLQSTPYYNMPGQSPHAAYLPSHTGHASYNAAAAQSSHMQFPGLYHP 804

Query: 462  PQP-PLANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
            PQP  +ANPH L   M GNV VGVA+A P AQ  AY Q Q+ H NW+++F
Sbjct: 805  PQPAAIANPHHLGPAMGGNVGVGVAAAAPGAQVGAYQQPQLGHLNWTTNF 854


>emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]
          Length = 914

 Score =  594 bits (1531), Expect = e-167
 Identities = 350/752 (46%), Positives = 460/752 (61%), Gaps = 42/752 (5%)
 Frame = -1

Query: 2445 SGTHQEFRVVRDKRGNQSTSRELXXXXXXXXXXXNDQVVPYFPGK-SSTGTLIDH---GG 2278
            +G  +EFRVVRD R NQ+T+R++           N+QV+     K +STGT  +     G
Sbjct: 175  AGIGREFRVVRDNRVNQNTNRDMKPVSPQLATSANEQVISNISEKGNSTGTSNNQKPSSG 234

Query: 2277 QKSFQNLNGSSDSACRHAHVARSNGLDKKELLVETDRTVPNSDSQALRKNARDSPQYSSI 2098
            ++S Q+LNG +D+       A S+G ++KELL E   T+PN+ S+       DS  YS+ 
Sbjct: 235  RQSSQSLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRVQAVKPNDSQPYSA- 293

Query: 2097 PLVPNNSIGGLYFSSSDPVHVPS-DNRSSGKIGAIKREVGVVGVRRQPT----------- 1954
             L  N+S+ G+Y SSSDPVHVPS D+RSS  +GAIKREVGVVGVRRQ T           
Sbjct: 294  SLASNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVKHSSAPS 353

Query: 1953 --------GNXXXXXXXXXXXXXXXXTVVQSNQTAVADXXXXXXXXXXXS--NQYSIKSY 1804
                    G                    Q  QT V D              NQY  + +
Sbjct: 354  SSLPSSLLGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSRPH 413

Query: 1803 PT-VNYQKAPQSNKEWRPKSSKKSSLMSHGVTGTDANPISSPAENSLKSSIEVDHLQEMF 1627
               V +QKAPQ NKEW+PKSS+KSS +  GV GT A  +S  A+NS     E   LQ+  
Sbjct: 414  QQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKL 473

Query: 1626 LKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFDSTRNYAYGPQEPGSAEHSNSE-- 1453
             + +I ENQ+VII +H++VPE +   LTFGSFG+ F S      G Q  G+A+  ++E  
Sbjct: 474  SQASISENQNVIIAQHIRVPETDRCRLTFGSFGADFAS------GFQAVGNADEPSAEPS 527

Query: 1452 SCLRVSAPAASNADVSDGNKLDLPDDQVRTSRSDSTPSAAPSEHRLPEKRESSGTQDLEN 1273
            + L VS P +S+ D S   ++DL DDQ   S + S  S   SEH+LP+K+ESS  Q+LEN
Sbjct: 528  ASLSVSPPESSSDDGS--KQVDL-DDQYINSGTASPESGEASEHQLPDKKESSSPQNLEN 584

Query: 1272 YSDFALAQDNASSHPPAGLQHQQQDPSLLSNFP-AFDPHPGYDVPFFKMGMDNTLQGRGS 1096
            Y+D  L ++++ S+ P     QQQ+  +L +FP A+DP  GYD+P+F+  MD T++G+G 
Sbjct: 585  YADIGLVRESSPSYTPES--QQQQERHVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGL 642

Query: 1095 PSPQEVVSSHGANIIPASTVAMIHQQ----PVAQLYPQVHLSQFPNFMPCRQFLSPVFLP 928
            PSPQE ++SH AN IPAS++AM+ QQ    PV Q+Y QVH+  F N MP RQFLSPV++P
Sbjct: 643  PSPQEALASHTANSIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVP 702

Query: 927  PMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKYSASQYKPIPPSSPTGFCTYTN 748
            PMA+PGYS NPAYSHPSN ++YLLMPG  SHL A GLKY   Q KP+P  SPTGF  +TN
Sbjct: 703  PMAMPGYSSNPAYSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTN 762

Query: 747  SPGYALNTQGT--GAIAIEDSTAMKYKDG--YIPNRQVDTSEIWVQTTREIPGLQPAAYY 580
              GYA+N  G    A  +EDS+ +KYKDG  Y+PN Q +TSEIW+Q  RE+PGLQ A YY
Sbjct: 763  PTGYAINAPGVVGSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYY 822

Query: 579  NISGHTPNAAYLQTQNGHASFN--VAATQPAQIQYPGFYHPPQPP--LANPHQLVHGMPG 412
            N+   TP+AAY+ +  GHASFN   AA Q + +Q+PG YHPP  P  +A+PH L   M G
Sbjct: 823  NMPAQTPHAAYMPSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASPHHLGPPMGG 882

Query: 411  NVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
            NV VGVA+A P  Q  AY Q Q+ H NW+++F
Sbjct: 883  NVGVGVAAAAPGPQVGAYQQPQLGHLNWTTNF 914


>ref|XP_002521347.1| conserved hypothetical protein [Ricinus communis]
            gi|223539425|gb|EEF41015.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 864

 Score =  592 bits (1525), Expect = e-166
 Identities = 348/770 (45%), Positives = 455/770 (59%), Gaps = 39/770 (5%)
 Frame = -1

Query: 2508 TKIRTYSGRNAQNGGHPRN---GFSGTHQEFRVVRDKRGNQSTSRELXXXXXXXXXXXND 2338
            TK RT+S RN + GG+ R    G +G ++EFRVVRD R N +T+RE            ++
Sbjct: 101  TKFRTFSDRNTRQGGYIRAAVPGNAGINREFRVVRDNRVNLNTTREPKPAMQQGSISSDE 160

Query: 2337 QVVPYFPGKSSTGTL--IDHGG-QKSFQNLNGSSDSACRHAHVARSNGLDKKELLVETDR 2167
              +     K S+G+   + H G + S Q  NG  DS  RH   A SN  D+K +  E   
Sbjct: 161  LGISTVTEKGSSGSSGNVKHSGVRSSSQASNGPPDSQSRHTRDATSNFTDRKAMTEEKRA 220

Query: 2166 TVPNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS-DNRSSGKIGAIKR 1990
             VP++ S+   +  + S Q+ S  L  +NS+ G+Y SS DPVHVPS ++RSS  +GAIKR
Sbjct: 221  VVPSAASRI--QVMKPSSQHHSATLASSNSVVGVYSSSMDPVHVPSPESRSSAAVGAIKR 278

Query: 1989 EVGVVGVRRQPT--------------GNXXXXXXXXXXXXXXXXTVVQSNQ------TAV 1870
            EVGVVG RRQ +               N                  +  N          
Sbjct: 279  EVGVVGGRRQSSENAVKNSSASSSSFSNSVLGRDGSLPESFQPFPTISKNDQVNEPVATE 338

Query: 1869 ADXXXXXXXXXXXSNQYSIKSYPTVNYQKAPQSNKEWRPKSSKKSSLMSHGVTGTDANPI 1690
            +             NQYS      V +QKA Q NKEW+PKSS+K+S+ S GV GT     
Sbjct: 339  SAMPSISVGRSFLGNQYSRTHQTAVGHQKATQHNKEWKPKSSQKASVGSPGVIGTPTKSS 398

Query: 1689 SSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFDST 1510
            S PA NS     +   +QE  L+ NI ENQ+VII +H++VPE +   LTFGSFG  FDS+
Sbjct: 399  SPPAGNSKDLESDATDMQEKLLRVNIYENQNVIIAQHIRVPETDRCRLTFGSFGVEFDSS 458

Query: 1509 RNYAYGPQEPGSAEHSNSESC--LRVSAPAASNADVSDGNKLDLPDDQVRTSRSDSTPSA 1336
            RN   G Q  G  + S +ES   L  SAP +S+ D S   +++L D+QVR S SDS  S 
Sbjct: 459  RNMPSGFQAAGVTKDSKAESAASLSASAPESSSDDASGNKQVELLDEQVRNSGSDSPASG 518

Query: 1335 APSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFPAFDPHP 1156
            A SEH+ P+K  SS   +L+NY+D  L +D+ S    +  QHQQ DP  L +F A+DP  
Sbjct: 519  AVSEHQSPDK--SSSPPNLDNYADIGLVRDS-SPFTSSESQHQQ-DPPELPSFSAYDPQT 574

Query: 1155 GYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ---PVAQLYPQVHL 985
             YD+ +F+  +D T++G+G  S QE + SH  + +PAS++ M+ QQ   P+AQ+YPQVH+
Sbjct: 575  VYDMSYFRPQIDETVRGQGLQSAQEALISHRVDSMPASSIPMVQQQQQPPIAQMYPQVHV 634

Query: 984  SQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKYSA 805
            S + N MP RQFLSPV++P MA+PGYS NPAY HPSNGS+YLLMPG  SHL A GLKY  
Sbjct: 635  SHYTNLMPYRQFLSPVYVPQMAMPGYSSNPAYPHPSNGSSYLLMPGGSSHLSANGLKYGI 694

Query: 804  SQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG--YIPNRQVDTS 637
             Q+KP+P SSPTGF  +T+  GYA+N  G    A  +EDS+ MKYKDG  Y+PN Q +TS
Sbjct: 695  QQFKPVPGSSPTGFGNFTSPTGYAINAPGVVGSATGLEDSSRMKYKDGNLYVPNPQAETS 754

Query: 636  EIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQIQYPGFYHPPQ 457
            EIWVQ  RE+PGLQ A YYN+ G +P+AAYL +  GHASFN AA Q + +Q+ G Y PP 
Sbjct: 755  EIWVQNPRELPGLQSAPYYNMPGQSPHAAYLPSHTGHASFNAAAAQSSHMQFSGLYPPPP 814

Query: 456  P---PLANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
            P    +ANPH L   M GNV VGVA A P AQ  AY Q Q+ H NW+++F
Sbjct: 815  PTPAAMANPHHLGPVMGGNVGVGVAPAAPGAQVGAYQQPQLGHLNWTTNF 864


>ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Populus trichocarpa]
            gi|550342535|gb|EEE79123.2| hypothetical protein
            POPTR_0003s06200g [Populus trichocarpa]
          Length = 858

 Score =  590 bits (1521), Expect = e-165
 Identities = 338/750 (45%), Positives = 447/750 (59%), Gaps = 22/750 (2%)
 Frame = -1

Query: 2499 RTYSGRNAQNGGHPRN---GFSGTHQEFRVVRDKRGNQSTSRELXXXXXXXXXXXNDQ-- 2335
            RT+  R AQ GGH R    G  G ++EFRVVRD R NQ+ +RE             ++  
Sbjct: 113  RTFLDRYAQRGGHTRTDSIGNRGVNREFRVVRDNRINQNANREPKPALPQGSTSAKEKGS 172

Query: 2334 -VVPYFPGKSSTGTLIDHGGQKSFQNLNGSSDSACRHAHVARSNGLDKKELLVETDRTVP 2158
             V        S   L     Q S Q  NG +    R+   A+S   D+K +  E   T  
Sbjct: 173  GVTEKGSAGISNNNLKPSNAQSSSQTSNGPTYPEPRYNRDAKSRAGDRKVVSEEKRSTAS 232

Query: 2157 NSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS-DNRSSGKIGAIKREVG 1981
            N+ +   +    ++ Q     L  +NS+ G+Y SS+DPVHVPS D+RSSG +GAIKREVG
Sbjct: 233  NATTSRAQVVKPNNSQQHDASLASSNSVVGVYSSSTDPVHVPSPDSRSSGVVGAIKREVG 292

Query: 1980 VVGVRRQPTGNXXXXXXXXXXXXXXXXTVVQSN-----QTAVADXXXXXXXXXXXS-NQY 1819
            VVG RRQ                        SN     QTAV +             NQY
Sbjct: 293  VVGGRRQSENAVKDLSSSNSFSESFHPLTAISNTDQVRQTAVIESMPSVPVNRSLLHNQY 352

Query: 1818 SIKSYP-TVNYQKAPQSNKEWRPKSSKKSSLMSHGVTGTDANPISSPAENSLKSSIEVDH 1642
            + + +  TV Y KA Q NKEW+PKSS+KSS+ S GV GT       P +NS    +   +
Sbjct: 353  NSRPHQQTVGYPKASQHNKEWKPKSSQKSSITSPGVIGTPTKSSLPPTDNSKSMELNAAN 412

Query: 1641 LQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFDSTRNYAYGPQEPGSAEHS 1462
            LQ+ F + NI ENQ+VII +H++VPE++   LTFGSFG  FD +RN   G Q  G +E S
Sbjct: 413  LQDKFSRVNIHENQNVIIAQHIRVPESDRCKLTFGSFGVEFDPSRNSTPGFQAVGISEES 472

Query: 1461 NSESCLRV--SAPAASNADVSDGNKLDLPDDQVRTSRSDSTPSAAPSEHRLPEKRESSGT 1288
            N ES + +  S P +S+ D   G +++L DDQ R S SDS  +   SEH+LPEK  SS  
Sbjct: 473  NRESAISLPASCPESSSEDAPGGKQIELLDDQARNSESDSPEAGLASEHQLPEK--SSSP 530

Query: 1287 QDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFPAFDPHPGYDVPFFKMGMDNTLQ 1108
             DL+NY+D  L ++++ S+ P+  Q QQ  P L S F A+DP  GYD+ +F+  +D T+Q
Sbjct: 531  PDLDNYADIGLVRNSSPSYAPSESQQQQDHPELPS-FSAYDPQTGYDMSYFQPPIDETVQ 589

Query: 1107 GRGSPSPQEVVSSHGANIIPASTVAMIHQQP-VAQLYPQVHLSQFPNFMPCRQFLSPVFL 931
            G+G PSP+E +++H  N IP ST+  + QQP +AQ+YPQVH+S F N MP RQF+SPV++
Sbjct: 590  GQGQPSPREALTAHTGNHIPTSTMPTMQQQPPMAQMYPQVHVSPFTNLMPYRQFISPVYV 649

Query: 930  PPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKYSASQYKPIPPSSPTGFCTYT 751
            PPM +PGYS NPAY HPSNG++Y+LMPG GSHL A GLKY    YKP+P S+P GF  +T
Sbjct: 650  PPMPMPGYSSNPAYPHPSNGNSYMLMPGGGSHLNANGLKYGIQHYKPVPSSNPAGFGNFT 709

Query: 750  NSPGYALNTQGT--GAIAIEDSTAMKYKDG--YIPNRQVDTSEIWVQTTREIPGLQPAAY 583
            +  GYA+N  G    A  +ED + MKYKDG  Y+PN Q ++SEIW+Q  R++PGLQ + Y
Sbjct: 710  SPSGYAINAPGVVGSAAGLEDPSRMKYKDGNIYVPNPQAESSEIWIQNPRDLPGLQSSPY 769

Query: 582  YNISGHTPNAAYLQTQNGHASFNVAATQPAQIQYPGFYHPPQP-PLANPHQLVHGMPGNV 406
            YNI G T +AAYL +  GHASFN AA Q + +Q+PG Y PPQP  +A+PH L   M  NV
Sbjct: 770  YNIPGQT-HAAYLPSHTGHASFNAAAAQSSHMQFPGLYPPPQPTAMASPHHLGPVMGNNV 828

Query: 405  RVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
             VGVA + P AQ  AY Q Q+ H NW+++F
Sbjct: 829  GVGVAPSAPGAQVGAYQQPQLGHLNWTTNF 858


>ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550347518|gb|EEE84402.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 854

 Score =  579 bits (1492), Expect = e-162
 Identities = 338/769 (43%), Positives = 458/769 (59%), Gaps = 24/769 (3%)
 Frame = -1

Query: 2550 SLERGEHTGLIVYKTKIRTYSGRNAQNGGHPRN---GFSGTHQEFRVVRDKRGNQSTSRE 2380
            S++  +H+       +  T+S RNAQ GG+ R    G  G ++EFRVVRD R NQ+TSRE
Sbjct: 93   SVDSRKHSENFGQGMRPHTFSDRNAQRGGYTRTASPGNRGINREFRVVRDNRVNQNTSRE 152

Query: 2379 LXXXXXXXXXXXNDQVVPYFPGKSSTG---TLIDHGGQKSFQNLNGSSDSACRHAHVARS 2209
                         +Q       K STG    L     + S Q  NG  DS  RH   A S
Sbjct: 153  PKPALLHGSTSAKEQGSGVVTEKGSTGISSNLKPSDARSSHQASNGPIDSEPRHNRDANS 212

Query: 2208 NGLDKKELLVETDRTVPNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS 2029
            +  D+K +  E      N+ +  ++    ++ Q  +     +N + G+Y SS+DPVHVPS
Sbjct: 213  SVGDRKVVSEEKRSVASNATTSRVQVAKSNNSQQHNALQASSNPVVGVYSSSTDPVHVPS 272

Query: 2028 -DNRSSGKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXTVVQSNQT-------A 1873
             D+RSSG +GAIKREVGVVG RRQ   N                     ++T       A
Sbjct: 273  PDSRSSGVVGAIKREVGVVGGRRQSFENAVKDLSSSNSFSESFRPFTAISKTDQVSQTAA 332

Query: 1872 VADXXXXXXXXXXXSNQYSIKSY-PTVNYQKAPQSNKEWRPKSSKKSSLMSHGVTGTDAN 1696
            +             +NQY+ + +   V + KA Q NKEW+PKSS+KSS+ S GV GT   
Sbjct: 333  IEPMPSVPVNRSFLNNQYNNRPHQQAVGHPKASQHNKEWKPKSSQKSSVTSPGVIGTPTK 392

Query: 1695 PISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFD 1516
              S P +NS    ++  +LQ+ F + NI ENQ+VII +H++VPE +   LTFGSFG GFD
Sbjct: 393  SSSPPTDNSKNMELDAANLQDKFSRINIHENQNVIIAQHIRVPETDRCKLTFGSFGVGFD 452

Query: 1515 STRNYAYGPQEPGSAEHSNSESC--LRVSAPAASNADVSDGNKLDLPDDQVRTSRSDSTP 1342
            + R   +  Q  G +E SN ES   L  SAP +S+ D S G +++L DDQ R   SDS  
Sbjct: 453  APRTPGF--QAVGISEESNGESAISLPASAPDSSSDDASGGKQIELLDDQARNYGSDSPA 510

Query: 1341 SAAPSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFPAFDP 1162
            ++  SEH LP    SS   +L+NY+D  L ++++ S+ P+  Q QQ  P L S F A+DP
Sbjct: 511  ASLESEHPLPV--NSSSPPNLDNYADIGLVRNSSPSYAPSESQQQQDHPELPS-FSAYDP 567

Query: 1161 HPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ-PVAQLYPQVHL 985
              GYD+ +F+  +D T++G+G PSPQE +++H AN +PAST++ + QQ P+AQ+YPQVH+
Sbjct: 568  QTGYDISYFRPQIDETVRGQGLPSPQEALTTHTAN-VPASTMSTVQQQPPMAQMYPQVHV 626

Query: 984  SQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKYSA 805
            SQF N +P RQF+SPV++PPM +PGYS +PAY HPSNG++YLLMPG GSHL A GLKY  
Sbjct: 627  SQFTNLVPYRQFISPVYVPPMPMPGYSSSPAYPHPSNGNSYLLMPGGGSHLNANGLKYGI 686

Query: 804  SQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG--YIPNRQVDTS 637
              YKP+P ++P GF  + +  GYA+N  G    A  +EDS+ MKYKDG  Y+PN Q + S
Sbjct: 687  QHYKPVPGNNPAGFGNFVSPSGYAINAPGVVGSATGLEDSSRMKYKDGNLYVPNPQAEAS 746

Query: 636  EIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQIQYPGFYHP-P 460
            EIW+Q  REIPG+Q A YYN+ G T + AYL +  GHASFN AA Q + +Q+PG Y P P
Sbjct: 747  EIWIQNPREIPGMQSAPYYNMPGQT-HTAYLPSHTGHASFNAAAAQSSHMQFPGLYPPTP 805

Query: 459  QP-PLANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
            QP  + +PH L   M GNV VGVA + P AQ  AY Q Q+ H NW+++F
Sbjct: 806  QPTAMPSPHHLGPVMGGNVGVGVAPSAPGAQVGAYQQPQLGHLNWTTNF 854


>ref|XP_004303676.1| PREDICTED: uncharacterized protein LOC101293990 [Fragaria vesca
            subsp. vesca]
          Length = 915

 Score =  579 bits (1492), Expect = e-162
 Identities = 349/767 (45%), Positives = 458/767 (59%), Gaps = 40/767 (5%)
 Frame = -1

Query: 2496 TYSGRNAQNGGHPRNGF------SGTHQEFRVVRDKRGNQSTSRELXXXXXXXXXXXNDQ 2335
            ++S RN + GG+ R GF      +G  +EFRVVRD R N +   E            N+Q
Sbjct: 171  SFSDRNVRRGGYVRRGFPGISRGTGISREFRVVRDNRANHNMDGETKPASPQCTTSTNEQ 230

Query: 2334 VVPYFPGKSSTGTLIDHGGQKSF------QNLNGSSDSACRHAHVARSNGLDKKELLVET 2173
            V+     K  TG       QKSF      Q LNG +DS  R +  A S G  +KE   E 
Sbjct: 231  VISNVSEKGQTGI---SSNQKSFNRQHASQALNGQTDSRIRTSD-ANSTGTIRKETSAEK 286

Query: 2172 DRTVPNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS-DNRSSGKIGAI 1996
               +PNS S+       +S  +S+     N S+ G+Y SS+DPVHVPS D+R S  +GAI
Sbjct: 287  RVALPNSASRVQAGRPNNSQPHSA----SNTSVIGVYSSSTDPVHVPSPDSRPSASVGAI 342

Query: 1995 KREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXTVV---------------QSNQTAVADX 1861
            KREVGVVGVR+Q + N                                  Q +QT+ +  
Sbjct: 343  KREVGVVGVRKQSSDNSKSAVPSSSFSNSLLGKEGTAESFRSLTGISKPDQLDQTSESVM 402

Query: 1860 XXXXXXXXXXSNQYSIKSYPT-VNYQK--APQSNKEWRPKSSKKSSLMSHGVTGTDANPI 1690
                      SNQ++++ +   V +QK  A Q NKEW+PKSS+K S  + GV GT     
Sbjct: 403  PSIPVSRTFISNQHNVRPHQQPVGHQKDAASQPNKEWKPKSSQKPSSNNPGVIGTPTKS- 461

Query: 1689 SSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFDST 1510
            +SP ++S  S  E   LQ+   + NI EN +V+I ++++VPE++   LTFGS G+  +  
Sbjct: 462  ASPPDDSKVSESEAVQLQDKLARVNIYENCNVVIAQNIRVPESDRFRLTFGSLGT--ELV 519

Query: 1509 RNYAYGPQEPGSAEHSNSESCLRVSAPAASNADVSDGNKLDLPDDQVRTSRSD-STPSAA 1333
              +  GP E  + E    ++ L  SAP  S++D +    +DL DDQVR S SD S PSA 
Sbjct: 520  NGFQAGPTEESNRE---PQASLSTSAPE-SHSDEASTKPIDLLDDQVRNSGSDFSAPSAV 575

Query: 1332 PSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFPAFDPHPG 1153
            P EH LPEKRE+S  Q L+NY+D  L +DN+ S  P+    Q QDP  +  F AFDP  G
Sbjct: 576  P-EH-LPEKRETSSPQSLDNYADIGLVRDNSPSFTPS--DSQNQDPPEMQGFTAFDPQTG 631

Query: 1152 YDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQP--VAQLYPQVHLSQ 979
            YD+P+++  MD ++ G+G PSPQE +SSH +N IPASTVAM+ QQP  VAQ+YPQVH+S 
Sbjct: 632  YDIPYYRPSMDESVHGQGLPSPQEALSSHNSNSIPASTVAMVQQQPPHVAQMYPQVHVSH 691

Query: 978  FPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKYSASQ 799
            + N MP RQ++SPV++PPMAVPGYS NPAY H SNG++YLLMPG  SHL A  LKY   Q
Sbjct: 692  YANMMPYRQYISPVYVPPMAVPGYSNNPAYPHMSNGNSYLLMPGGASHLNANSLKYGVQQ 751

Query: 798  YKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG--YIPNRQVDTSEI 631
            +KP+   SPTGF  +TN  GYA+N  G   GA  +EDS+ MKYKDG  Y+PN Q +TSEI
Sbjct: 752  FKPV-AGSPTGFGNFTNPAGYAMNAPGVVGGATGLEDSSRMKYKDGNLYVPNPQAETSEI 810

Query: 630  WVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQIQYPGFYHPPQP- 454
            W+Q  RE PG+Q A YYN+ G TP+AAY+ +  GHASFN AA Q + +QYPG YHPPQP 
Sbjct: 811  WIQNPREHPGMQSAPYYNMPGQTPHAAYMPSHGGHASFNAAAAQSSHMQYPGMYHPPQPA 870

Query: 453  PLANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQS-QINHFNWSSHF 316
             +A+PH +   MPGNV VGVA+A P AQ  AY Q  Q+NH NW+++F
Sbjct: 871  AMASPHHMGPAMPGNVGVGVAAAAPGAQ--AYQQQPQLNHMNWTTNF 915


>emb|CBI35892.3| unnamed protein product [Vitis vinifera]
          Length = 809

 Score =  578 bits (1491), Expect = e-162
 Identities = 350/769 (45%), Positives = 462/769 (60%), Gaps = 39/769 (5%)
 Frame = -1

Query: 2505 KIRTYSGRNAQNGGHPRNGFSG---THQ-------------EFRVVRDKRGNQSTSRELX 2374
            K R++  RN + GG+ R+   G   T+Q             EFRVVRD R NQ+T+R++ 
Sbjct: 94   KFRSFPDRNVRRGGYSRSTVPGNAKTYQFYHSILLDAGIGREFRVVRDNRVNQNTNRDMK 153

Query: 2373 XXXXXXXXXXNDQVVPYFPGK-SSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSN 2206
                      N+QV+     K +STGT  +     G++S Q+LNG +D+           
Sbjct: 154  PVSPQLATSVNEQVISNISEKGNSTGTSNNQKPSSGRQSSQSLNGPTDA----------- 202

Query: 2205 GLDKKELLVETDRTVPNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS- 2029
                        R     D+ +++ N  DS  YS+  L  N+S+ G+Y SSSDPVHVPS 
Sbjct: 203  ------------RPGIPQDANSMKPN--DSQPYSA-SLASNSSVVGVYSSSSDPVHVPSP 247

Query: 2028 DNRSSGKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXTVVQSNQTAVADXXXXX 1849
            D+RSS  +GAIKREVGVVGVRRQ T N                   Q  QT V D     
Sbjct: 248  DSRSSAIVGAIKREVGVVGVRRQSTENSSD----------------QPRQTTVPDHVIPS 291

Query: 1848 XXXXXXS--NQYSIKSYPT-VNYQKAPQSNKEWRPKSSKKSSLMSHGVTGTDANPISSPA 1678
                     NQY  + +   V +QKAPQ NKEW+PKSS+KSS +  GV GT A  +S  A
Sbjct: 292  MPVNRSFLGNQYGSRPHQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRA 351

Query: 1677 ENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFDSTRNYA 1498
            +NS     E   LQ+   + +I ENQ+VII +H++VPE +   LTFGSFG+ F S     
Sbjct: 352  DNSKDLESETAKLQDKLSQASISENQNVIIAQHIRVPETDRCRLTFGSFGADFAS----- 406

Query: 1497 YGPQEPGSAEHSNSE--SCLRVSAPAASNADVSDGNKLDLPDDQVRTSRSDSTPSAAPSE 1324
             G Q  G+A+  ++E  + L VS P +S+ D S   ++DL DDQ   S + S  S   SE
Sbjct: 407  -GFQAVGNADEPSAEPSASLSVSPPESSSDDGS--KQVDL-DDQYINSGTASPESGEASE 462

Query: 1323 HRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFP-AFDPHPGYD 1147
            H+LP+K+ESS  Q+LENY+D  L ++++ S+ P     QQQ+  +L +FP A+DP  GYD
Sbjct: 463  HQLPDKKESSSPQNLENYADIGLVRESSPSYTPES--QQQQERHVLPSFPHAYDPQAGYD 520

Query: 1146 VPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ----PVAQLYPQVHLSQ 979
            +P+F+  MD T++G+G PSPQE ++SH AN IPAS++AM+ QQ    PV Q+Y QVH+  
Sbjct: 521  IPYFRPTMDETVRGQGLPSPQEALASHTANSIPASSIAMVQQQQQQPPVPQMYQQVHVPH 580

Query: 978  FPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKYSASQ 799
            F N MP RQFLSPV++PPMA+PGYS NPAYSHPSN ++YLLMPG  SHL A GLKY   Q
Sbjct: 581  FANLMPYRQFLSPVYVPPMAMPGYSSNPAYSHPSNANSYLLMPGGSSHLGANGLKYGIQQ 640

Query: 798  YKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG--YIPNRQVDTSEI 631
             KP+P  SPTGF  +TN  GYA+N  G    A  +EDS+ +KYKDG  Y+PN Q +TSEI
Sbjct: 641  LKPVPAGSPTGFGNFTNPTGYAINAPGVVGSATGLEDSSRLKYKDGNIYVPNPQAETSEI 700

Query: 630  WVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFN--VAATQPAQIQYPGFYHPPQ 457
            W+Q  RE+PGLQ A YYN+   TP+AAY+ +  GHASFN   AA Q + +Q+PG YHPP 
Sbjct: 701  WIQNPRELPGLQSAPYYNMPAQTPHAAYMPSHTGHASFNAAAAAAQSSHMQFPGLYHPPP 760

Query: 456  PP--LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
             P  +A+PH L   M GNV VGVA+A P  Q  AY Q Q+ H NW+++F
Sbjct: 761  QPAAMASPHHLGPPMGGNVGVGVAAAAPGPQVGAYQQPQLGHLNWTTNF 809


>ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [Citrus sinensis]
          Length = 862

 Score =  575 bits (1481), Expect = e-161
 Identities = 348/782 (44%), Positives = 459/782 (58%), Gaps = 37/782 (4%)
 Frame = -1

Query: 2550 SLERGEHTGLIVYKT-KIRTYSGRNAQNGGHPRNGF--SGTHQEFRVVRDKRGNQSTSRE 2380
            SLE       I  KT +IRTY+ RNA+  G+ RN    +G ++EFRVVRD R N   ++E
Sbjct: 91   SLEEPRKNSEIFGKTMRIRTYADRNARRRGYNRNALPDAGINREFRVVRDNRVNPEANQE 150

Query: 2379 LXXXXXXXXXXXNDQVVPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARS 2209
                        N++V       S TGT       GG+   Q  NGS++   RHA+    
Sbjct: 151  TKSPLPQSSISTNEKVTNVKEKGSPTGTTGSERPSGGRSFSQASNGSTNLHPRHAYDHNI 210

Query: 2208 NGLDKKELLVETDRTVPNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS 2029
             G D+ E   E   T       A+     +  +  S  L  +NS+GG YFSS DPVHVPS
Sbjct: 211  TGTDRIEPSAEKFTT------SAVNFIQHNITEGHSATLASSNSVGG-YFSSKDPVHVPS 263

Query: 2028 -DNRSSGKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT--------------- 1897
             D+R+S  +GAIKREVGVVG  RQ + N                                
Sbjct: 264  PDSRASSAVGAIKREVGVVGGGRQCSDNAVRDSTAPRSSFSNSILGRDNSDSFRPFPSIS 323

Query: 1896 -VVQSNQTAVADXXXXXXXXXXXSNQYSIKSYP-TVNYQKAPQSNKEWRPKSSKKSSLMS 1723
               Q NQ A  D            NQY+ +S+  +V +QKA Q NKEW+PKSS+KS+++ 
Sbjct: 324  KADQINQIAATDSGVANRALFT--NQYTGRSHQQSVGHQKASQHNKEWKPKSSQKSNVIG 381

Query: 1722 HGVTGTDANPISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLT 1543
             GV GT     S P ++S     +V  LQ+   + NI ENQ+VII +H++VPE +   LT
Sbjct: 382  PGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNINENQNVIIAQHIRVPETDRCRLT 441

Query: 1542 FGSFGSGFDSTRNYAYGPQEPGSAEHSNSESCLRVSAPAA--SNADVSDGNKLDLPDDQV 1369
            FGSFG  F+S+RN   G    GSAE SN ES   ++  A+  S  DVS    +D+ DD V
Sbjct: 442  FGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAASKTSGNDVSGRKPVDILDDLV 501

Query: 1368 RTSRSDSTPSAAPSEHRLPEK-RESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPS 1192
            R S S+S  S   SEH+LP+  +++S  QDL+ Y+D  L +D   S+P +  Q QQQD S
Sbjct: 502  RNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLVRDTDPSYPLSESQ-QQQDSS 560

Query: 1191 LLSNFPAFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIH--QQ 1018
             L++FPA+D   GYD+ +F+  MD +++G+G PSPQE ++SH AN IPAS++AM+   QQ
Sbjct: 561  ELASFPAYDSQTGYDMSYFRPTMDESVRGQGLPSPQEALASHSANSIPASSIAMLQHQQQ 620

Query: 1017 P-VAQLYPQVHLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCG 841
            P +AQ+YPQVH+S FPN MP RQ +SPV++P MA+PGYS NPAY HPSNGS+YLLMPG  
Sbjct: 621  PQMAQMYPQVHVSHFPNMMPYRQIISPVYVPQMAMPGYSSNPAYPHPSNGSSYLLMPGGS 680

Query: 840  SHLVAGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG 667
            SHL   GLKY   Q+KP+P +SPTGF  +T+  GYA+N          +EDS+ MKYKDG
Sbjct: 681  SHLSTNGLKYGIQQFKPVPTASPTGFGNFTSPAGYAINAPSVVGSVTGLEDSSRMKYKDG 740

Query: 666  --YIPNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPN-AAYLQTQNGHASFNVAATQP 496
              Y+ N+Q DTSE+W+   RE+PG+Q   YYN+   TP+ AAYL +  GHASFN A  Q 
Sbjct: 741  NLYVSNQQADTSELWIHNPRELPGMQSGPYYNMPAQTPHAAAYLPSHAGHASFNAAVPQS 800

Query: 495  AQIQYPGFYHP-PQPP-LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSS 322
            + +Q+PG YHP  QPP +ANPH +   M GNV VGV  A P AQ  AY Q Q+ +FNWS 
Sbjct: 801  SHMQFPGMYHPTAQPPAMANPHHMGPAMGGNVGVGVPPAAPGAQVGAYQQPQLGNFNWSP 860

Query: 321  HF 316
            +F
Sbjct: 861  NF 862


>ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528616|gb|ESR39866.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 866

 Score =  573 bits (1478), Expect = e-160
 Identities = 348/784 (44%), Positives = 459/784 (58%), Gaps = 39/784 (4%)
 Frame = -1

Query: 2550 SLERGEHTGLIVYKT-KIRTYSGRNAQNGGHPRNGF--SGTHQEFRVVRDKRGNQSTSRE 2380
            SLE       I  KT +IRTY+ RNA+  G+ RN    +G ++EFRVVRD R N   ++E
Sbjct: 91   SLEEPRKNSEIFGKTMRIRTYADRNARRRGYNRNALPDAGINREFRVVRDNRVNPEANQE 150

Query: 2379 LXXXXXXXXXXXNDQVVPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARS 2209
                        N++V       S TGT       GG+   Q  NGS++   RHA+    
Sbjct: 151  TKSPLPQSSISTNEKVTNVKEKGSPTGTTGSEKPSGGRSFSQASNGSTNLHPRHAYDHNI 210

Query: 2208 NGLDKKELLVETDRTVPNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS 2029
             G D+ E   E   T       A+     +  +  S  L  +NS+GG YFSS DPVHVPS
Sbjct: 211  TGTDRIEPSAEKFTT------SAVNFIQHNITEGYSATLASSNSVGG-YFSSKDPVHVPS 263

Query: 2028 -DNRSSGKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT--------------- 1897
             D+R+S  +GAIKREVGVVG  RQ + N                                
Sbjct: 264  PDSRASSAVGAIKREVGVVGGGRQCSDNAVKDSTAPCSSFSNSILGRDNSDSFRPFPSIS 323

Query: 1896 -VVQSNQTAVADXXXXXXXXXXXS--NQYSIKSYP-TVNYQKAPQSNKEWRPKSSKKSSL 1729
               Q NQ A  D              NQY+ +S+  +V +QKA Q NKEW+PKSS+KS++
Sbjct: 324  KADQINQIAATDSGVAGMPANRALFTNQYTGRSHQQSVGHQKASQHNKEWKPKSSQKSNV 383

Query: 1728 MSHGVTGTDANPISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWAL 1549
            +  GV GT     S P ++S     +V  LQ+   + NI ENQ+VII +H++VPE +   
Sbjct: 384  IGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNIHENQNVIIAQHIRVPETDRCR 443

Query: 1548 LTFGSFGSGFDSTRNYAYGPQEPGSAEHSNSESCLRVSAPAA--SNADVSDGNKLDLPDD 1375
            LTFGSFG  F+S+RN   G    GSAE SN ES   ++  A+  S  DVS    +D+ DD
Sbjct: 444  LTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAASKTSGNDVSGRKPVDILDD 503

Query: 1374 QVRTSRSDSTPSAAPSEHRLPEK-RESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQD 1198
             VR S S+S  S   SEH+LP+  +++S  QDL+ Y+D  L +D   S+P +  Q QQQD
Sbjct: 504  LVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLVRDTDPSYPLSESQ-QQQD 562

Query: 1197 PSLLSNFPAFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIH-- 1024
             S L++FPA+D   GYD+ +F+  MD +++G+G PSPQE ++SH AN IPAS++AM+   
Sbjct: 563  SSELASFPAYDSQTGYDMSYFRPTMDESVRGQGLPSPQEALASHSANSIPASSIAMLQHQ 622

Query: 1023 QQP-VAQLYPQVHLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPG 847
            QQP +AQ+YPQVH+S FPN MP RQ +SPV++P MA+PGYS NPAY HPSNGS+YLLMPG
Sbjct: 623  QQPQMAQMYPQVHVSHFPNMMPYRQIISPVYVPQMAMPGYSSNPAYPHPSNGSSYLLMPG 682

Query: 846  CGSHLVAGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYK 673
              SHL   GLKY   Q+KP+P +SPTGF  +T+  GYA+N          +EDS+ MKYK
Sbjct: 683  GSSHLSTNGLKYGIQQFKPVPTASPTGFGNFTSPAGYAINAPSVVGSVTGLEDSSRMKYK 742

Query: 672  DG--YIPNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPN-AAYLQTQNGHASFNVAAT 502
            DG  Y+ N+Q DTSE+W+   RE+PG+Q   YYN+   TP+ AAYL +  GHASFN A  
Sbjct: 743  DGNLYVSNQQADTSELWIHNPRELPGMQSGPYYNMPAQTPHAAAYLPSHAGHASFNAAVP 802

Query: 501  QPAQIQYPGFYHP-PQPP-LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNW 328
            Q + +Q+PG YHP  QPP +ANPH +   M GNV VGV  A P AQ  AY Q Q+ +FNW
Sbjct: 803  QSSHMQFPGMYHPTAQPPAMANPHHMGPAMGGNVGVGVPPAAPGAQVGAYQQPQLGNFNW 862

Query: 327  SSHF 316
            S +F
Sbjct: 863  SPNF 866


>ref|XP_007214970.1| hypothetical protein PRUPE_ppa001749mg [Prunus persica]
            gi|462411120|gb|EMJ16169.1| hypothetical protein
            PRUPE_ppa001749mg [Prunus persica]
          Length = 771

 Score =  571 bits (1471), Expect = e-160
 Identities = 338/761 (44%), Positives = 449/761 (59%), Gaps = 31/761 (4%)
 Frame = -1

Query: 2505 KIRTYSGRNAQNGGHPRNGFSGT--HQEFRVVRDKRGNQSTSRELXXXXXXXXXXXNDQV 2332
            K  T + RN + GG+ R+G +GT   +EFRVVRD R N++ +RE            N+QV
Sbjct: 21   KSNTSADRNVRRGGYARSGVTGTGISREFRVVRDNRVNRNINRETKPDSPQCTTSTNEQV 80

Query: 2331 VPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSNGLDKKELLVETDRTV 2161
                 GK  TG+         Q S Q  NG +D   R +  A + G  +KE LVE   T+
Sbjct: 81   -SNISGKGPTGSSSSQKPSSRQNSSQVSNGQTDPQIRTSD-ANATGSLRKETLVEKRVTL 138

Query: 2160 PNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS-DNRSSGKIGAIKREV 1984
            P +  +       +S  +S++ +V +NS+ GLY SS+DPVHVPS D+R S  +GAIKREV
Sbjct: 139  PTAALRVQAVKPSNSQPHSAV-VVSSNSVVGLYSSSTDPVHVPSPDSRPSASVGAIKREV 197

Query: 1983 GVVGVRRQPTGNXXXXXXXXXXXXXXXXT---------------VVQSNQTAVADXXXXX 1849
            GV   RRQ + N                                  Q  QT+ +      
Sbjct: 198  GV---RRQSSENSNSSAPSSSLSNSLLGKEGSTESFRPFTGISKTDQVGQTSESVMPSVS 254

Query: 1848 XXXXXXSNQYSIKSYPT-VNYQKAPQSNKEWRPKSSKKSSLMSHGVTGTDANPISSPAEN 1672
                  SNQ++ + +   V +QKA Q NKEW+PKSS+K S  S GV GT    +SSP +N
Sbjct: 255  VSRPFLSNQHNARPHQQPVGHQKASQPNKEWKPKSSQKPSSNSPGVIGTPTKSVSSP-DN 313

Query: 1671 SLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFDSTRNYAYG 1492
            S  S  E   LQ+   + N+ +N +V+I ++++VP+++   LTFGS G+  DST N   G
Sbjct: 314  SKVSESEAAKLQDKLSRVNVYDNSNVVIAQNIRVPDSDRFRLTFGSLGTELDSTGNMVNG 373

Query: 1491 PQEPGSAEHSNSESC--LRVSAPAASNADVSDGNKLDLPDDQVRTSRSDSTPSAAPSEHR 1318
             Q  G  E SN E    L +SAP + + + S    +DL D QVR S SDS  S A  E +
Sbjct: 374  FQA-GGTEESNGEPAGSLSLSAPQSCSDEASGIKPVDLLDHQVRNSGSDSPASGAVPERQ 432

Query: 1317 LPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFPAFDPHPGYDVPF 1138
            LPEK ++S  Q L+NY+D  L +D + S+ P+  Q Q+Q    L  F AFDP   Y++P+
Sbjct: 433  LPEKNDTSSPQTLDNYADIGLVRDTSPSYAPSDSQQQEQPE--LEGFSAFDPQTSYNIPY 490

Query: 1137 FKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQP--VAQLYPQVHLSQFPNFM 964
            F+  MD +++G+G PSPQE +SSH  N I ASTVAM+ QQP  VAQ+YPQVH+S + N M
Sbjct: 491  FRPHMDESVRGQGLPSPQEALSSHNVNSIAASTVAMVQQQPPPVAQMYPQVHVSHYANLM 550

Query: 963  PCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKYSASQYKPIP 784
            P RQFLSPV++PPMAVPGYS NPAY H SNG++YLLMPG GSHL A  LKY    +KP+P
Sbjct: 551  PYRQFLSPVYVPPMAVPGYSSNPAYPHMSNGNSYLLMPGGGSHLNANSLKYGVQPFKPVP 610

Query: 783  PSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG--YIPNRQVDTSEIWVQTT 616
              SPTG+  +TN  GYA+N  G   GA  +EDS+ +KYKDG  Y+ N Q +TSE+W+Q  
Sbjct: 611  AGSPTGYGNFTNPNGYAINGPGVVGGASGLEDSSRIKYKDGNLYVANPQAETSEMWIQNP 670

Query: 615  REIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQIQYPGFYHPPQP-PLANP 439
            RE PGLQ   YYN+   +P+ AY+ +   HASFN AA Q + +Q+PG YHPPQP  + NP
Sbjct: 671  REHPGLQSTPYYNVPAQSPHGAYMPSHAAHASFNAAAAQSSHMQFPGLYHPPQPAAIPNP 730

Query: 438  HQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
            H L   M GNV VGVA+A P AQ  AY Q Q+NH NW ++F
Sbjct: 731  HHLGPAMGGNVGVGVAAAAPGAQVGAYQQPQLNHMNWQTNF 771


>ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528617|gb|ESR39867.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 867

 Score =  569 bits (1466), Expect = e-159
 Identities = 348/785 (44%), Positives = 459/785 (58%), Gaps = 40/785 (5%)
 Frame = -1

Query: 2550 SLERGEHTGLIVYKT-KIRTYSGRNAQNGGHPRNGF--SGTHQEFRVVRDKRGNQSTSRE 2380
            SLE       I  KT +IRTY+ RNA+  G+ RN    +G ++EFRVVRD R N   ++E
Sbjct: 91   SLEEPRKNSEIFGKTMRIRTYADRNARRRGYNRNALPDAGINREFRVVRDNRVNPEANQE 150

Query: 2379 LXXXXXXXXXXXNDQVVPYFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARS 2209
                        N++V       S TGT       GG+   Q  NGS++   RHA+    
Sbjct: 151  TKSPLPQSSISTNEKVTNVKEKGSPTGTTGSEKPSGGRSFSQASNGSTNLHPRHAYDHNI 210

Query: 2208 NGLDKKELLVETDRTVPNSDSQALRKNARDSPQYSSIPLVPNNSIGGLYFSSSDPVHVPS 2029
             G D+ E   E   T       A+     +  +  S  L  +NS+GG YFSS DPVHVPS
Sbjct: 211  TGTDRIEPSAEKFTT------SAVNFIQHNITEGYSATLASSNSVGG-YFSSKDPVHVPS 263

Query: 2028 -DNRSSGKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT--------------- 1897
             D+R+S  +GAIKREVGVVG  RQ + N                                
Sbjct: 264  PDSRASSAVGAIKREVGVVGGGRQCSDNAVKDSTAPCSSFSNSILGRDNSDSFRPFPSIS 323

Query: 1896 -VVQSNQTAVADXXXXXXXXXXXS--NQYSIKSYP-TVNYQKAPQSNKEWRPKSSKKSSL 1729
               Q NQ A  D              NQY+ +S+  +V +QKA Q NKEW+PKSS+KS++
Sbjct: 324  KADQINQIAATDSGVAGMPANRALFTNQYTGRSHQQSVGHQKASQHNKEWKPKSSQKSNV 383

Query: 1728 MSHGVTGTDANPISSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWAL 1549
            +  GV GT     S P ++S     +V  LQ+   + NI ENQ+VII +H++VPE +   
Sbjct: 384  IGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNIHENQNVIIAQHIRVPETDRCR 443

Query: 1548 LTFGSFGSGFDSTRNYAYGPQEPGSAEHSNSESCLRVSAPAA--SNADVSDGNKLDLPDD 1375
            LTFGSFG  F+S+RN   G    GSAE SN ES   ++  A+  S  DVS    +D+ DD
Sbjct: 444  LTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAASKTSGNDVSGRKPVDILDD 503

Query: 1374 QVRTSRSDSTPSAAPSEHRLPEK-RESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQD 1198
             VR S S+S  S   SEH+LP+  +++S  QDL+ Y+D  L +D   S+P +  Q QQQD
Sbjct: 504  LVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLVRDTDPSYPLSESQ-QQQD 562

Query: 1197 PSLLSNFP-AFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIH- 1024
             S L++FP A+D   GYD+ +F+  MD +++G+G PSPQE ++SH AN IPAS++AM+  
Sbjct: 563  SSELASFPQAYDSQTGYDMSYFRPTMDESVRGQGLPSPQEALASHSANSIPASSIAMLQH 622

Query: 1023 -QQP-VAQLYPQVHLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMP 850
             QQP +AQ+YPQVH+S FPN MP RQ +SPV++P MA+PGYS NPAY HPSNGS+YLLMP
Sbjct: 623  QQQPQMAQMYPQVHVSHFPNMMPYRQIISPVYVPQMAMPGYSSNPAYPHPSNGSSYLLMP 682

Query: 849  GCGSHLVAGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKY 676
            G  SHL   GLKY   Q+KP+P +SPTGF  +T+  GYA+N          +EDS+ MKY
Sbjct: 683  GGSSHLSTNGLKYGIQQFKPVPTASPTGFGNFTSPAGYAINAPSVVGSVTGLEDSSRMKY 742

Query: 675  KDG--YIPNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPN-AAYLQTQNGHASFNVAA 505
            KDG  Y+ N+Q DTSE+W+   RE+PG+Q   YYN+   TP+ AAYL +  GHASFN A 
Sbjct: 743  KDGNLYVSNQQADTSELWIHNPRELPGMQSGPYYNMPAQTPHAAAYLPSHAGHASFNAAV 802

Query: 504  TQPAQIQYPGFYHP-PQPP-LANPHQLVHGMPGNVRVGVASAGPEAQGEAYLQSQINHFN 331
             Q + +Q+PG YHP  QPP +ANPH +   M GNV VGV  A P AQ  AY Q Q+ +FN
Sbjct: 803  PQSSHMQFPGMYHPTAQPPAMANPHHMGPAMGGNVGVGVPPAAPGAQVGAYQQPQLGNFN 862

Query: 330  WSSHF 316
            WS +F
Sbjct: 863  WSPNF 867


>ref|XP_006583149.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Glycine max]
          Length = 830

 Score =  563 bits (1451), Expect = e-157
 Identities = 338/763 (44%), Positives = 452/763 (59%), Gaps = 33/763 (4%)
 Frame = -1

Query: 2505 KIRTYSGRNAQNGGHPRNGFSGTHQEFRVVRDKRGNQSTSRELXXXXXXXXXXXNDQVVP 2326
            K    S RN +   + RN   G  +EFRVVRD R N    +E+            +Q+  
Sbjct: 102  KFNAPSERNVRRTNYSRNTLPGISKEFRVVRDNRVNH-IYKEVKPLTQQHSTSATEQLNV 160

Query: 2325 YFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSNGLDKKELLVETDRTVPN 2155
              P K S+ T  +H   G + S    NG SDS  R+   A  N +D+K  +   D+    
Sbjct: 161  NTPDKGSS-TSTNHRSSGSRNSSLASNGPSDSHARYLKDAVPNIIDRK--IASEDK---- 213

Query: 2154 SDSQALRKNARDSPQYSSIPLVPNN------------SIGGLYFSSSDPVHVPS-DNRSS 2014
             D Q +  NA    Q    P+ PNN            S  G+Y SS+DPVHVPS D+RSS
Sbjct: 214  -DKQGMISNAAGRVQ----PIKPNNAHQNSASVASTSSAVGVYSSSTDPVHVPSPDSRSS 268

Query: 2013 GKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXTVVQSNQTAVADXXXXXXXXXX 1834
            G +GAI+REVGVVGVRRQ + N                   QS   +++           
Sbjct: 269  GVVGAIRREVGVVGVRRQSSDNKAK----------------QSFAPSISYVVGKDVSRPS 312

Query: 1833 XSNQYSIKSYPT-VNYQKAPQSNKEWRPKSSKKSSLMSHGVTGTDANPI----SSPAENS 1669
             +NQY+ + +   V +Q+  Q NKEW+PKSS+K +  S GV GT         S PAENS
Sbjct: 313  LNNQYNNRPHQQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENS 372

Query: 1668 LKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFGSGFDSTRNYAYGP 1489
                     LQ+   + NI ENQ+VII +H++VPE +   LTFG+ G+  DS+R  +   
Sbjct: 373  GDIESNTTELQDKLSQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSRLQSKY- 431

Query: 1488 QEPGSAEHSNSE--SCLRVSAPAASNADVSDGNKLDLPDDQVRTSRSDSTPSAAPSEHRL 1315
               G++E SN E  + L V AP  S  DVS   ++DL D+ +R+SRSDS  S A SE +L
Sbjct: 432  HIIGASEKSNEELTASLTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQL 491

Query: 1314 PEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSNFPAFDPHPGYDVPFF 1135
            P+ ++SS TQ+L+NY++  L +D++ S+ P+  + QQQD   +  F A+DP  GYD+P+F
Sbjct: 492  PDNKDSSNTQNLDNYANIGLVRDSSPSYAPS--EPQQQDSHDMPGFAAYDPPAGYDIPYF 549

Query: 1134 KMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ--PVAQLYPQVHLSQFPNFMP 961
            +  +D T++G+G  SPQE + SH  N  PAST+AM+ QQ  PV Q+YPQVH+S F N MP
Sbjct: 550  RPTIDETVRGQGLSSPQEALISHATNNPPASTIAMVQQQQPPVPQMYPQVHVSHFANLMP 609

Query: 960  CRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHLVAGGLKYSASQYKPIPP 781
             RQFLSPV++PPMA+PGYS NP Y HP+NGS+YLLMPG GSHL A  LKY   Q+KP+P 
Sbjct: 610  YRQFLSPVYVPPMAMPGYSSNPPYPHPTNGSSYLLMPGGGSHLNANNLKYGVQQFKPVPA 669

Query: 780  SSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG-YIPNRQVDTSEIWVQTTRE 610
             SPTGF  + N  GYA+ T G   GA A+EDS+ +KYKD  Y+PN Q +TSEIW+Q  R+
Sbjct: 670  GSPTGFGNFANPTGYAMITPGVVGGATALEDSSRVKYKDNLYVPNPQAETSEIWLQNPRD 729

Query: 609  IPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQIQYPGFYH-PPQP-PLANPH 436
            +PG+Q   YYN+ G TP+AAY+ +  GHASFN AA Q + +Q+PG YH PPQP  +A+PH
Sbjct: 730  LPGMQSTPYYNMPGQTPHAAYMPSHTGHASFNAAAAQSSHMQFPGMYHTPPQPAAMASPH 789

Query: 435  QLVHGMP---GNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
             L  G P    NV VGVA+A P AQ  AY Q Q+ H NW+++F
Sbjct: 790  HL--GPPAIGNNVGVGVAAAAPGAQVGAYQQPQLGHINWTTNF 830


>ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phaseolus vulgaris]
            gi|561008519|gb|ESW07468.1| hypothetical protein
            PHAVU_010G132600g [Phaseolus vulgaris]
          Length = 864

 Score =  557 bits (1436), Expect = e-156
 Identities = 335/782 (42%), Positives = 459/782 (58%), Gaps = 52/782 (6%)
 Frame = -1

Query: 2505 KIRTYSGRNAQNGGHPRNGFSGTHQEFRVVRDKRGNQSTSRELXXXXXXXXXXXNDQVVP 2326
            K  T S RN +   + RN   G  +EFRVVRD R N    +E+           ++++  
Sbjct: 100  KFHTPSERNVRRANYSRNTLPGISREFRVVRDNRVNY-IYKEVKPLSQQHLASASEELNV 158

Query: 2325 YFPGKSSTGTLIDH--GGQKSFQNLNGSSDSACRHAHVARSNGLDKKELLVETDRTVPNS 2152
                K S+ +      G + S Q LNG SDS  R+   A  N +D+K    + D+     
Sbjct: 159  NLSEKGSSASTSHRSSGSRNSSQALNGPSDSFARYPKDAVPNIVDRKIASEDKDK----- 213

Query: 2151 DSQALRKNARDSPQYSSIPLVPNN------------SIGGLYFSSSDPVHVPS-DNRSSG 2011
            D Q++  NA +  Q    P+ PN+            S  G+Y SS+DPVHVPS D+RSS 
Sbjct: 214  DKQSMISNAAERVQ----PIKPNHIHQNPASVASSSSAVGVYSSSTDPVHVPSPDSRSSS 269

Query: 2010 KIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT---------------VVQSNQT 1876
             +GAI+REVGVVGVRRQP+ N                                  Q +QT
Sbjct: 270  VVGAIRREVGVVGVRRQPSDNKVKQSFAPSSSYVAGKDGTSADSFQPVGAVLKTEQFSQT 329

Query: 1875 AVADXXXXXXXXXXXS--NQYSIKSYPT-VNYQKAPQSNKEWRPKSSKKSSLMSHGVTGT 1705
             V +           S  NQY+ + +   V +Q+  Q NKEW+PKSS+K +  + GV GT
Sbjct: 330  KVTEPSLSGVPVSRPSVNNQYNGRPHQQLVGHQRVSQQNKEWKPKSSQKPNSNNPGVIGT 389

Query: 1704 DANPISSP-AENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTFGSFG 1528
                 +SP AENS+    +   LQ+   + NI ENQ+VII +H+QVPE +   LTFG+ G
Sbjct: 390  PKKAAASPPAENSVDIESDAVELQDKLSQLNIYENQNVIIAQHIQVPETDRCRLTFGTIG 449

Query: 1527 SGFDSTR----NYAYGPQEPGSAEHSNSESCLRVSAPAASNADVSDGNKLDLPDDQVRTS 1360
            +  DS+R     +  GP E  + E + S   L V AP  S  DVS   ++DL D+ +R+S
Sbjct: 450  TEIDSSRLQSKYHIVGPSEKSNDELAAS---LAVPAPELSTDDVSGSKQVDLLDEHIRSS 506

Query: 1359 RSDSTPSAAPSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLLSN 1180
             SDS  S APSE +LP+ ++SS TQ+L+NY++  L +D++ S+ P+  + QQQ+   +  
Sbjct: 507  GSDSPVSGAPSEQQLPDNKDSSNTQNLDNYANIGLVRDSSPSYAPS--EPQQQESHDMPG 564

Query: 1179 FPAFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ-----P 1015
            F A+DP  GYD+P+F+  +D T++G+G  SPQE + SHG N  PAST+AM+ QQ     P
Sbjct: 565  FAAYDPPTGYDIPYFRPTIDETVRGQGLSSPQEALISHGTNNTPASTIAMVQQQQQQQPP 624

Query: 1014 VAQLYPQVHLSQFPNFMPCRQFLSPVFLPP-MAVPGYSRNPAYSHPSNGSNYLLMPGCGS 838
            V Q+YPQ+H+S F N MP RQFLSPV++PP MA+PGYS NP Y HP+NG++Y+LMPG GS
Sbjct: 625  VPQMYPQMHVSHFANLMPYRQFLSPVYVPPPMAMPGYSSNPPYPHPTNGNSYVLMPGGGS 684

Query: 837  HLVAGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG- 667
            HL A  LKY   QYKP+P  +P GF  + +  GYA+ T G   GA A+EDS+ +KYKD  
Sbjct: 685  HLNANNLKYGVQQYKPVPAGNPAGFGNFASPAGYAMITPGVVGGATALEDSSRVKYKDNL 744

Query: 666  YIPNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQI 487
            Y+PN Q +TSEIW+Q  R++PG+Q A YYN+ G TP+AAY+ +  GHASFN AA Q + +
Sbjct: 745  YVPNPQAETSEIWLQNPRDLPGMQSAPYYNMPGQTPHAAYMPSHTGHASFNAAAAQSSHM 804

Query: 486  QYPGFYH-PPQP-PLANPHQLVHGMP---GNVRVGVASAGPEAQGEAYLQSQINHFNWSS 322
            Q+PG YH PPQP  +A+PH L  G P    NV VGVA+A P AQ  AY Q Q+ H NW++
Sbjct: 805  QFPGMYHTPPQPAAMASPHHL--GPPSIGNNVGVGVAAAAPGAQVGAYQQPQLGHINWTT 862

Query: 321  HF 316
            +F
Sbjct: 863  NF 864


>ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 863

 Score =  557 bits (1435), Expect = e-155
 Identities = 341/780 (43%), Positives = 453/780 (58%), Gaps = 50/780 (6%)
 Frame = -1

Query: 2505 KIRTYSGRNAQNGGHPRNGFSGTHQEFRVVRDKRGNQSTSRELXXXXXXXXXXXNDQVVP 2326
            K    S RN +   + RN   G  +EFRVVRD R N    +E+            +Q+  
Sbjct: 102  KFNAPSERNVRRTNYSRNTLPGISKEFRVVRDNRVNH-IYKEVKPLTQQHSTSATEQLNV 160

Query: 2325 YFPGKSSTGTLIDH---GGQKSFQNLNGSSDSACRHAHVARSNGLDKKELLVETDRTVPN 2155
              P K S+ T  +H   G + S    NG SDS  R+   A  N +D+K  +   D+    
Sbjct: 161  NTPDKGSS-TSTNHRSSGSRNSSLASNGPSDSHARYLKDAVPNIIDRK--IASEDK---- 213

Query: 2154 SDSQALRKNARDSPQYSSIPLVPNN------------SIGGLYFSSSDPVHVPS-DNRSS 2014
             D Q +  NA    Q    P+ PNN            S  G+Y SS+DPVHVPS D+RSS
Sbjct: 214  -DKQGMISNAAGRVQ----PIKPNNAHQNSASVASTSSAVGVYSSSTDPVHVPSPDSRSS 268

Query: 2013 GKIGAIKREVGVVGVRRQPTGNXXXXXXXXXXXXXXXXT---------------VVQSNQ 1879
            G +GAI+REVGVVGVRRQ + N                                  Q +Q
Sbjct: 269  GVVGAIRREVGVVGVRRQSSDNKAKQSFAPSISYVVGKDGTSADSFQSVGAVSKTEQFSQ 328

Query: 1878 TAVADXXXXXXXXXXXS--NQYSIKSYPT-VNYQKAPQSNKEWRPKSSKKSSLMSHGVTG 1708
            T V +           S  NQY+ + +   V +Q+  Q NKEW+PKSS+K +  S GV G
Sbjct: 329  TNVTEPSLSGMPVSRPSLNNQYNNRPHQQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIG 388

Query: 1707 TDANPI----SSPAENSLKSSIEVDHLQEMFLKCNIIENQHVIIPEHLQVPEAEWALLTF 1540
            T         S PAENS         LQ+   + NI ENQ+VII +H++VPE +   LTF
Sbjct: 389  TPKKAAVAAASPPAENSGDIESNTTELQDKLSQVNIYENQNVIIAQHIRVPETDRCQLTF 448

Query: 1539 GSFGSGFDSTRNYAYGPQEPGSAEHSNSE--SCLRVSAPAASNADVSDGNKLDLPDDQVR 1366
            G+ G+  DS+R  +      G++E SN E  + L V AP  S  DVS   ++DL D+ +R
Sbjct: 449  GTIGTELDSSRLQSKY-HIIGASEKSNEELTASLTVPAPELSTDDVSGSKQVDLRDEHIR 507

Query: 1365 TSRSDSTPSAAPSEHRLPEKRESSGTQDLENYSDFALAQDNASSHPPAGLQHQQQDPSLL 1186
            +SRSDS  S A SE +LP+ ++SS TQ+L+NY++  L +D++ S+ P+  + QQQD   +
Sbjct: 508  SSRSDSPVSGAASEQQLPDNKDSSNTQNLDNYANIGLVRDSSPSYAPS--EPQQQDSHDM 565

Query: 1185 SNFPAFDPHPGYDVPFFKMGMDNTLQGRGSPSPQEVVSSHGANIIPASTVAMIHQQ--PV 1012
              F A+DP  GYD+P+F+  +D T++G+G  SPQE + SH  N  PAST+AM+ QQ  PV
Sbjct: 566  PGFAAYDPPAGYDIPYFRPTIDETVRGQGLSSPQEALISHATNNPPASTIAMVQQQQPPV 625

Query: 1011 AQLYPQVHLSQFPNFMPCRQFLSPVFLPPMAVPGYSRNPAYSHPSNGSNYLLMPGCGSHL 832
             Q+YPQVH+S F N MP RQFLSPV++PPMA+PGYS NP Y HP+NGS+YLLMPG GSHL
Sbjct: 626  PQMYPQVHVSHFANLMPYRQFLSPVYVPPMAMPGYSSNPPYPHPTNGSSYLLMPGGGSHL 685

Query: 831  VAGGLKYSASQYKPIPPSSPTGFCTYTNSPGYALNTQGT--GAIAIEDSTAMKYKDG-YI 661
             A  LKY   Q+KP+P  SPTGF  + N  GYA+ T G   GA A+EDS+ +KYKD  Y+
Sbjct: 686  NANNLKYGVQQFKPVPAGSPTGFGNFANPTGYAMITPGVVGGATALEDSSRVKYKDNLYV 745

Query: 660  PNRQVDTSEIWVQTTREIPGLQPAAYYNISGHTPNAAYLQTQNGHASFNVAATQPAQIQY 481
            PN Q +TSEIW+Q  R++PG+Q   YYN+ G TP+AAY+ +  GHASFN AA Q + +Q+
Sbjct: 746  PNPQAETSEIWLQNPRDLPGMQSTPYYNMPGQTPHAAYMPSHTGHASFNAAAAQSSHMQF 805

Query: 480  PGFYH-PPQP-PLANPHQLVHGMP---GNVRVGVASAGPEAQGEAYLQSQINHFNWSSHF 316
            PG YH PPQP  +A+PH L  G P    NV VGVA+A P AQ  AY Q Q+ H NW+++F
Sbjct: 806  PGMYHTPPQPAAMASPHHL--GPPAIGNNVGVGVAAAAPGAQVGAYQQPQLGHINWTTNF 863


Top