BLASTX nr result

ID: Angelica27_contig00016493 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica27_contig00016493
         (3069 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_017218848.1 PREDICTED: AF4/FMR2 family member 4-like [Daucus ...  1397   0.0  
XP_015866284.1 PREDICTED: dentin sialophosphoprotein [Ziziphus j...   711   0.0  
XP_007024431.2 PREDICTED: dentin sialophosphoprotein isoform X1 ...   687   0.0  
EOY27056.1 Dentin sialophosphoprotein-related, putative isoform ...   686   0.0  
EOY27055.1 Dentin sialophosphoprotein-related, putative isoform ...   686   0.0  
EOY27053.1 Dentin sialophosphoprotein-related, putative isoform ...   686   0.0  
XP_007024432.2 PREDICTED: dentin sialophosphoprotein isoform X2 ...   670   0.0  
EOY27054.1 Dentin sialophosphoprotein-related, putative isoform ...   669   0.0  
XP_018843337.1 PREDICTED: dentin sialophosphoprotein isoform X2 ...   657   0.0  
XP_018843336.1 PREDICTED: dentin sialophosphoprotein isoform X1 ...   657   0.0  
XP_011008457.1 PREDICTED: dentin sialophosphoprotein isoform X3 ...   650   0.0  
XP_011008456.1 PREDICTED: dentin sialophosphoprotein isoform X2 ...   644   0.0  
XP_011008458.1 PREDICTED: dentin sialophosphoprotein isoform X4 ...   643   0.0  
XP_011008455.1 PREDICTED: dentin sialophosphoprotein isoform X1 ...   643   0.0  
XP_017606848.1 PREDICTED: dentin sialophosphoprotein [Gossypium ...   630   0.0  
XP_016730003.1 PREDICTED: dentin sialophosphoprotein-like [Gossy...   626   0.0  
KHG19117.1 RNA polymerase II elongation factor ELL [Gossypium ar...   625   0.0  
XP_010274312.1 PREDICTED: dentin sialophosphoprotein [Nelumbo nu...   617   0.0  
XP_012445365.1 PREDICTED: dentin sialophosphoprotein [Gossypium ...   603   0.0  
GAV80268.1 Occludin_ELL domain-containing protein [Cephalotus fo...   603   0.0  

>XP_017218848.1 PREDICTED: AF4/FMR2 family member 4-like [Daucus carota subsp.
            sativus] KZM89300.1 hypothetical protein DCAR_026375
            [Daucus carota subsp. sativus]
          Length = 1234

 Score = 1397 bits (3615), Expect = 0.0
 Identities = 736/985 (74%), Positives = 775/985 (78%), Gaps = 3/985 (0%)
 Frame = -3

Query: 2947 LHSNFHSASLNRPXXXXXXXXXXXXXXXXR-NSRKXXXXXXXXXXXXXXXAVEESYSLET 2771
            L SNFHSASLNRP                  NSRK               AVEES+SL T
Sbjct: 24   LQSNFHSASLNRPSGRRPSAGAAAGGSSASRNSRKTTTTPPSPSAAAPSAAVEESFSLVT 83

Query: 2770 GNPLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGGKDFRFTWSRE 2591
            GNPLDFAMIIRLTPDLVDEIKRVE QGGSARIKFDANAKNTSGNVIDVGGKDFRFTWSRE
Sbjct: 84   GNPLDFAMIIRLTPDLVDEIKRVEVQGGSARIKFDANAKNTSGNVIDVGGKDFRFTWSRE 143

Query: 2590 MGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSEEAERKSKSRQ 2411
            MGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSEEAERKSKSRQ
Sbjct: 144  MGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSEEAERKSKSRQ 203

Query: 2410 AIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPPGPPKSAHKHGLSL 2231
            AIILDHGNPSMKNQMKALAAAEAN+WKMPFKQKIEPPYKKRK+EPPPGPPKSAHKHG SL
Sbjct: 204  AIILDHGNPSMKNQMKALAAAEANTWKMPFKQKIEPPYKKRKSEPPPGPPKSAHKHGFSL 263

Query: 2230 SASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQATGKEKASSSEKGTP 2051
            S+SKGR S SPLPSTPEQ            +TRGYASVED M TQAT KEKASSSEKGTP
Sbjct: 264  SSSKGRMSRSPLPSTPEQPPASASPLGPSSITRGYASVEDTMPTQATSKEKASSSEKGTP 323

Query: 2050 SRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALEKSVGEYFPNSGRQI 1871
            SRV +TALLDK ARK +L VKPTDLRSMLIS LTENPRGMSLKALEKSVGEYFPNS RQI
Sbjct: 324  SRVASTALLDKSARKGSLVVKPTDLRSMLISFLTENPRGMSLKALEKSVGEYFPNSARQI 383

Query: 1870 EPIIRKIATYQAPGRYILKSEAELESLKKPVSESSPENN-QPPVSGNDRGDACDPKNAAS 1694
            EPIIRKIATYQ PGRYILKSEAELESLK PV  SSPENN Q PV GN+RGD  DPKN+AS
Sbjct: 384  EPIIRKIATYQTPGRYILKSEAELESLKNPVPGSSPENNQQAPVPGNNRGDTSDPKNSAS 443

Query: 1693 PKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVPENSEGPVATXXXXXXXX 1514
            PKS AQ +EPVNLN EPGE VTV+EK D+P  SPDHY EEKVP+NSEG VAT        
Sbjct: 444  PKSRAQIDEPVNLNCEPGE-VTVSEKNDLP--SPDHYVEEKVPDNSEGHVATSTDSGSDS 500

Query: 1513 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDEEVDIMTSDD 1334
                                                             DEEVDIMTSDD
Sbjct: 501  DSESDSSDSESDSGSNSRGRSKSKSPVGSASGSSSDSESDASSNSKEGSDEEVDIMTSDD 560

Query: 1333 DKEPKDKLQT-PGQLPASSIPWRADGLLVQNKPDEMEDFHASEVVEIMENSPGYAHKSEI 1157
            DKEPKD LQ    +   S +PWR DGL+VQN PDEMEDFHASEVVEIMENSP YA KSEI
Sbjct: 561  DKEPKDNLQAHRPESHVSPLPWRPDGLMVQNIPDEMEDFHASEVVEIMENSPVYAQKSEI 620

Query: 1156 DPSNGSVSNKEGEKHAQVIKASAGNSSVLPESRVYPENLHRGRDKTARDGFRHEQSDRYQ 977
            D SNG VSNKEGEKHA VIKAS GNS V PESRVY ENLHRGRDKTARD  RHE+SDRYQ
Sbjct: 621  DASNGVVSNKEGEKHAPVIKASPGNSYVHPESRVYTENLHRGRDKTARDDLRHEESDRYQ 680

Query: 976  RKSEGKSKRRSDGKHSDDYAAHSEKVKAGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGP 797
            RKSEGKSKRRSD KH D YA HS+KVKAGS T+APMSEDT+ + SGSPGKCS D      
Sbjct: 681  RKSEGKSKRRSDDKHIDGYAVHSDKVKAGSATQAPMSEDTSFIFSGSPGKCSPD-----S 735

Query: 796  YKALHTHITPKAVKDRTDFSVHRTYNQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRP 617
            YK+LH  ITPKAVK+RTD  VHR+YNQS+PGK+ISD Q SGPRP DISGRTK  S     
Sbjct: 736  YKSLHPQITPKAVKERTDSGVHRSYNQSHPGKSISDSQLSGPRPDDISGRTKAPS----- 790

Query: 616  GIYADNLGSNAKSTERSIQMPAGLPLHKEKVNRDKQSENGYNDEKRPPKNSREGAVDKNL 437
            GIYA+NLGSNAKSTERS+Q P  +PLHKEK+ RD QSE+GYN+EKRPPKN REGAVDKNL
Sbjct: 791  GIYAENLGSNAKSTERSLQTPEAIPLHKEKIKRDIQSEDGYNNEKRPPKNPREGAVDKNL 850

Query: 436  TATDSNIRKRGEFSGKMKEVGSFPNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSEL 257
            T TDS+ RKRGE  GK++EVGSFPNSH  YPS G +RSDMDRSPI+NERGP LRREPSEL
Sbjct: 851  TPTDSHSRKRGELPGKIREVGSFPNSHAGYPSKGGNRSDMDRSPIVNERGPYLRREPSEL 910

Query: 256  ELGELRDPLPEAAPGVAKQFDRKGSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPS 77
            ELGELRDPLPE  PG  KQFDRKGSFKQSENK+TSFDYWN DL KG+PAGRTGVDPMKPS
Sbjct: 911  ELGELRDPLPEETPGSTKQFDRKGSFKQSENKVTSFDYWNSDLGKGRPAGRTGVDPMKPS 970

Query: 76   PPNPDIGVVGNPKGSSKKRSPGHYE 2
            PPN D  VVGN KGSSKKRSPGHYE
Sbjct: 971  PPNSDFEVVGNLKGSSKKRSPGHYE 995


>XP_015866284.1 PREDICTED: dentin sialophosphoprotein [Ziziphus jujuba]
          Length = 1240

 Score =  711 bits (1835), Expect = 0.0
 Identities = 436/941 (46%), Positives = 566/941 (60%), Gaps = 14/941 (1%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEE++SL +GN PL F+MIIRL PDLVDEIKRVEAQGG+ARIKFD+ A N +GNVIDVGG
Sbjct: 71   VEETFSLVSGNNPLAFSMIIRLAPDLVDEIKRVEAQGGTARIKFDSMANNPNGNVIDVGG 130

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE GDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE
Sbjct: 131  KEFRFTWSREFGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 190

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAE-PPPGP 2264
            EAERK KSR+AI+L+HGNPSMK+Q+K LAA E   W+  FKQK +PP+KKRK E P  GP
Sbjct: 191  EAERKHKSRKAIVLEHGNPSMKHQIKQLAAVETTPWR-SFKQKKDPPFKKRKVELPQGGP 249

Query: 2263 PKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQATG 2087
            PKS +K G+ S + +KGR S+SP+PS PEQ            +++ + S EDN+ TQ  G
Sbjct: 250  PKSTYKSGISSTTIAKGRHSSSPIPSPPEQSGPSASPLRTVNVSKIHTSTEDNVPTQLAG 309

Query: 2086 KEKA-SSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALEK 1910
            K+KA +SS++   SR  TT + +   RK N+G KPTDL+SML++LL ENP+GMSLKALEK
Sbjct: 310  KDKATASSDREILSR--TTGIRETVGRKGNIGAKPTDLQSMLVTLLMENPKGMSLKALEK 367

Query: 1909 SVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPV-- 1742
            ++G+  PNS R+IEPII+KIA +QAPGRY LK   E ES KKP SE  SSPE N      
Sbjct: 368  AIGDSIPNSVRKIEPIIKKIAIFQAPGRYFLKPGVESESFKKPSSEGGSSPEENLNQTHI 427

Query: 1741 --SGNDRGDACDPKNAASPKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKV 1568
                 D   A +P       SH + EE   LNSE G      EKI++  HSPD + E+K 
Sbjct: 428  LEDNRDHMPAPEPHFEEKVPSH-ELEEQGQLNSELGGESNALEKINIQQHSPDIFGEKKG 486

Query: 1567 PENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1388
             +NSEG V +                                                  
Sbjct: 487  SDNSEGQVGSSSDSGSDSDSESDSSDSGSDSGSPSRSRSRSRSPVGSGSGSDSDSDSDAS 546

Query: 1387 XXXXXXXDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRADGLLVQNKPDEMEDFHA 1214
                   DE+VDIMTSDDDKEPK K+Q   PG   +       D   V    DE +D H 
Sbjct: 547  SNSKEGSDEDVDIMTSDDDKEPKHKVQASEPGFSKSPITSRTPDAGPVDGGNDEKQDDHE 606

Query: 1213 SEVVEIMENSPGYAHKSEIDPSNGSVSNKEGEKHAQVIKASAGNSSVLPESRVYPENLHR 1034
             + VEI ++ P    ++ +     S  N+E EK A   K  + + + L + + Y  +L  
Sbjct: 607  FD-VEIEKDVPDAHQETGMAVFGSSAPNRENEKPADETKTFSPDGNELQKRQNYIVSLF- 664

Query: 1033 GRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVKAGSVTEAPMSEDTN 854
              +   +D  ++EQSD  +R S+GK+KR  + KH D+ + ++++ K   + E  +S   +
Sbjct: 665  -DEGAVKDSSKYEQSDSTERISKGKNKRGLEVKHLDEKSEYTKRSKTDILPEPSLSGGRD 723

Query: 853  LLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKD-RTDFSVHRTYNQSNPGKAISDFQPS 677
            +    S  K S DR ++ PY++ +   T  A +D  TD S  + YNQ+  GK+ SDFQ S
Sbjct: 724  VYVQESSHKLSSDRLMEDPYRSPNNQTTNGADRDGNTDISSQKGYNQAFSGKSSSDFQQS 783

Query: 676  GPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLPLHKEKVNRDKQSENG 497
            G R  + S + K    ++RP  Y ++LG   K +E+S  +  G+P+   K NRD Q E+G
Sbjct: 784  GRRSFNKSAKVKVPDLSERPDNYLESLGRARKYSEKSSNVNEGVPVQNFKFNRDAQYEDG 843

Query: 496  YNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFSGKMKEVGSFPNSHVSYPSNGRSRSD 320
            Y +EK+  +NS+E  A  K    +D   RK GE  GK KE G   +S          R+ 
Sbjct: 844  YVNEKKVMRNSKESSARSKQSVPSDLQNRKHGEVVGKFKEGGQISSSVGGSSPKDNGRTG 903

Query: 319  MDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGSFKQSENKLTSFDYW 140
             DRSPI+N R   L+RE S+LELGELR+PLPE AP V KQ +R  SFKQSEN+  + D W
Sbjct: 904  ADRSPIVNGRNSKLQREYSDLELGELREPLPEEAP-VKKQPER-SSFKQSENRSGASDNW 961

Query: 139  NPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPKGSSKKRS 17
              D+SKGKPAG++ +   KP  P+       N +GS+KKR+
Sbjct: 962  ISDVSKGKPAGKSALGTGKPCSPDLSTRFQNNVEGSNKKRN 1002


>XP_007024431.2 PREDICTED: dentin sialophosphoprotein isoform X1 [Theobroma cacao]
          Length = 1250

 Score =  687 bits (1773), Expect = 0.0
 Identities = 424/940 (45%), Positives = 558/940 (59%), Gaps = 30/940 (3%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQGG+ARIKFD+   N SGNVIDVGG
Sbjct: 64   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGGTARIKFDSIPTNPSGNVIDVGG 123

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE  DLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDES  NHVKMRSE
Sbjct: 124  KEFRFTWSREFVDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESMTNHVKMRSE 183

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            EAERK KSR+AI+LDHGNPSMKNQ+K LAAAEA+ WK  FK+K EP +KKRK E      
Sbjct: 184  EAERKHKSRKAIVLDHGNPSMKNQIKQLAAAEASPWKSHFKKK-EPAFKKRKVETAQAAV 242

Query: 2269 -GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQ 2096
             GPPKS +K GL S S +KG  S SP+PS PE+            +++ ++ +ED M  Q
Sbjct: 243  GGPPKSGYKSGLISASNAKGGRSTSPIPSPPERSGAAASPIGIGNISKVHSGIEDVMPPQ 302

Query: 2095 ATGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKAL 1916
               KE ASSSEK  P+R  T A+ + P R+ N G KP DL+S+LI+LL ENP+GMSLKAL
Sbjct: 303  VKSKENASSSEKEIPTR-ATGAVREMPGRRGNFGPKPMDLQSLLITLLKENPKGMSLKAL 361

Query: 1915 EKSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPV 1742
            EK+VG+  PNS R+IE I++KIAT+QAPGRY LK   EL+SLKKP SE  SSPE+N    
Sbjct: 362  EKAVGDTIPNSARKIETIVKKIATFQAPGRYFLKPGVELDSLKKPSSESGSSPEDNHHQT 421

Query: 1741 SGNDRGDACDPKNAAS-PKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVP 1565
               +      P   AS  +  ++ EE  +L+S+ G    V E+ID+  HSPD   + K  
Sbjct: 422  PAPEENHDQTPAPVASIVEKVSEMEEQNHLDSKLGVESNVLEQIDIQQHSPDLGGDRKAS 481

Query: 1564 ENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1385
            +NSEG   +                                                   
Sbjct: 482  DNSEGQANSASDSGSDSDSDSDSSDSGSDSGSRSRSRSRSGSPAGSGSGSSSDSESDASS 541

Query: 1384 XXXXXXDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRAD-GLLVQNKPDEMEDFHA 1214
                  DE+VDIMTSDD KE K  LQ   PG +  S IPW+ +    +QN  DE +D   
Sbjct: 542  NSKEGSDEDVDIMTSDDYKETKQDLQASEPG-VVQSPIPWQTEHDRPLQNGMDENQDGDG 600

Query: 1213 SEVVEIMEN-----------SPGYAHKSEIDPSNGSV-----SNKEGEKHAQVIKASAGN 1082
            S+ V+I  N           S G   + ++      +     + KEGEK  +  K S+ +
Sbjct: 601  SDAVDIEGNGSDAVDVEGHGSDGVDIEKDLPEDEQQIGMAVSTRKEGEKPEEGAKPSSSD 660

Query: 1081 SSVLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEK 902
               L E + +  NL    +   +D  RHEQSD  +R  + KSKR SD KH D+ +  +++
Sbjct: 661  CDELQERQNFIGNLFDDAENLVKDSVRHEQSDNSERLPKAKSKRGSDLKHIDEKSERTKR 720

Query: 901  VKAGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVK-DRTDFSVHRT 725
            +K+ S+++  +S   +    GS    S +RPID PY++    +  K  + +  DF   + 
Sbjct: 721  LKSESLSQPHVSGSRDPNFFGSIRNFSPNRPIDDPYQSSSVQMMNKGDREEHADFGSQKG 780

Query: 724  YNQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGL 545
            YNQ  P K+ SDF  SG RP D     K + A +RP  + ++ G   K +E+S+    G 
Sbjct: 781  YNQVFPRKSSSDFHQSGRRPSDQGAWAKATIAAERPMKHTESSGHGRKFSEKSVH--EGH 838

Query: 544  PLHKEKVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFSGKMKEVGSF 368
             + K+  +RD Q+E+G   +K+ P+N++E GA  KN   +D + RK GE  GK K+ G  
Sbjct: 839  FIQKDNPSRDTQNEDGLMKDKKLPRNTKEGGAGGKNAVPSDFHHRKLGETVGKFKDAGQI 898

Query: 367  PNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRK 188
             +S+++ P    SR   DR P +N +  +L+RE S LELGE+R+PL E  P + KQF+RK
Sbjct: 899  SSSYINSPPKDNSRVTADRYP-VNGKSNMLQRELSHLELGEIREPLVEETP-IKKQFERK 956

Query: 187  GSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPN 68
             SFKQS +  ++ + +NPDLS+GK  G+T  D  KPSPPN
Sbjct: 957  SSFKQSGSGPSTSENFNPDLSRGKSVGKTNWDSGKPSPPN 996


>EOY27056.1 Dentin sialophosphoprotein-related, putative isoform 4 [Theobroma
            cacao]
          Length = 1222

 Score =  686 bits (1770), Expect = 0.0
 Identities = 424/940 (45%), Positives = 557/940 (59%), Gaps = 30/940 (3%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQGG+ARIKFD+   N SGNVIDVGG
Sbjct: 64   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGGTARIKFDSIPTNPSGNVIDVGG 123

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE  DLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDES  NHVKMRSE
Sbjct: 124  KEFRFTWSREFVDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESMTNHVKMRSE 183

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            EAERK KSR+AI+LDHGNPSMKNQ+K LAAAEA+ WK  FK+K EP +KKRK E      
Sbjct: 184  EAERKHKSRKAIVLDHGNPSMKNQIKQLAAAEASPWKSHFKKK-EPAFKKRKVETAQAAV 242

Query: 2269 -GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQ 2096
             GPPKS +K GL S S +KG  S SP+PS PE+            +++ ++ +ED M  Q
Sbjct: 243  GGPPKSGYKSGLISASNAKGGRSTSPIPSPPERSGAAASPIGIGNISKVHSGIEDVMPPQ 302

Query: 2095 ATGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKAL 1916
               KE ASSSEK  P+R  T A+ + P R+ N G KP DL+S+LI+LL ENP+GMSLKAL
Sbjct: 303  VKSKENASSSEKEIPTR-ATGAVREMPGRRGNFGPKPMDLQSLLITLLKENPKGMSLKAL 361

Query: 1915 EKSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPV 1742
            EK+VG+  PNS R+IE I++KIAT+QAPGRY LK   EL+SLKKP SE  SSPE+N    
Sbjct: 362  EKAVGDTIPNSARKIETIVKKIATFQAPGRYFLKPGVELDSLKKPSSESGSSPEDNHHQT 421

Query: 1741 SGNDRGDACDPKNAAS-PKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVP 1565
               +      P   AS  +  ++ EE  +L+S+ G    V E+ID+  HSPD   + K  
Sbjct: 422  PAPEENHDQTPAPVASIVEKVSEMEEQNHLDSKLGVESNVLEQIDIQQHSPDLGGDRKAS 481

Query: 1564 ENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1385
            +NSEG   +                                                   
Sbjct: 482  DNSEGQANSASDSGSDSDSDSDSSDSGSDSGSRSRSRSRSGSPAGSGSGSSSDSESDASS 541

Query: 1384 XXXXXXDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRAD-GLLVQNKPDEMEDFHA 1214
                  DE+VDIMTSDD KE K  LQ   PG +  S IPW+ +    +QN  DE +D   
Sbjct: 542  NSKEGSDEDVDIMTSDDYKETKQDLQASEPG-VVQSPIPWQTEHDRPLQNGMDENQDGDG 600

Query: 1213 SEVVEIMEN-----------SPGYAHKSEIDPSNGSV-----SNKEGEKHAQVIKASAGN 1082
            S+ V+I  N           S G   + ++      +     + KEGEK  +  K S+ +
Sbjct: 601  SDAVDIEGNGSDAVDVEGHGSDGVDIEKDLPEDEQQIGMAVSTRKEGEKPEEGAKPSSSD 660

Query: 1081 SSVLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEK 902
               L E + +  NL    +   +D  RHEQSD  +R  + KSKR SD KH D+ +  +++
Sbjct: 661  CDELQERQNFIGNLFDDAENLVKDSVRHEQSDNSERLPKAKSKRGSDLKHIDEKSERTKR 720

Query: 901  VKAGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVK-DRTDFSVHRT 725
             K+ S+++  +S   +    GS    S +RPID PY++    +  K  + +  DF   + 
Sbjct: 721  SKSESLSQPHVSGSRDPNFFGSIRNFSPNRPIDDPYQSSSVQMMNKGDREEHADFGSQKG 780

Query: 724  YNQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGL 545
            YNQ  P K+ SDF  SG RP D     K + A +RP  + ++ G   K +E+S+    G 
Sbjct: 781  YNQVFPRKSSSDFHQSGRRPSDQGAWAKATIAAERPMKHTESSGHGRKFSEKSVH--EGH 838

Query: 544  PLHKEKVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFSGKMKEVGSF 368
             + K+  +RD Q+E+G   +K+ P+N++E GA  KN   +D + RK GE  GK K+ G  
Sbjct: 839  FIQKDNPSRDTQNEDGLMKDKKLPRNTKEGGAGGKNAVPSDFHHRKLGETVGKFKDAGQI 898

Query: 367  PNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRK 188
             +S+++ P    SR   DR P +N +  +L+RE S LELGE+R+PL E  P + KQF+RK
Sbjct: 899  SSSYINSPPKDNSRVTADRYP-VNGKSNMLQRELSHLELGEIREPLVEETP-IKKQFERK 956

Query: 187  GSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPN 68
             SFKQS +  ++ + +NPDLS+GK  G+T  D  KPSPPN
Sbjct: 957  SSFKQSGSGPSTSENFNPDLSRGKSVGKTNWDSGKPSPPN 996


>EOY27055.1 Dentin sialophosphoprotein-related, putative isoform 3 [Theobroma
            cacao]
          Length = 1222

 Score =  686 bits (1770), Expect = 0.0
 Identities = 424/940 (45%), Positives = 557/940 (59%), Gaps = 30/940 (3%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQGG+ARIKFD+   N SGNVIDVGG
Sbjct: 64   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGGTARIKFDSIPTNPSGNVIDVGG 123

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE  DLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDES  NHVKMRSE
Sbjct: 124  KEFRFTWSREFVDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESMTNHVKMRSE 183

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            EAERK KSR+AI+LDHGNPSMKNQ+K LAAAEA+ WK  FK+K EP +KKRK E      
Sbjct: 184  EAERKHKSRKAIVLDHGNPSMKNQIKQLAAAEASPWKSHFKKK-EPAFKKRKVETAQAAV 242

Query: 2269 -GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQ 2096
             GPPKS +K GL S S +KG  S SP+PS PE+            +++ ++ +ED M  Q
Sbjct: 243  GGPPKSGYKSGLISASNAKGGRSTSPIPSPPERSGAAASPIGIGNISKVHSGIEDVMPPQ 302

Query: 2095 ATGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKAL 1916
               KE ASSSEK  P+R  T A+ + P R+ N G KP DL+S+LI+LL ENP+GMSLKAL
Sbjct: 303  VKSKENASSSEKEIPTR-ATGAVREMPGRRGNFGPKPMDLQSLLITLLKENPKGMSLKAL 361

Query: 1915 EKSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPV 1742
            EK+VG+  PNS R+IE I++KIAT+QAPGRY LK   EL+SLKKP SE  SSPE+N    
Sbjct: 362  EKAVGDTIPNSARKIETIVKKIATFQAPGRYFLKPGVELDSLKKPSSESGSSPEDNHHQT 421

Query: 1741 SGNDRGDACDPKNAAS-PKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVP 1565
               +      P   AS  +  ++ EE  +L+S+ G    V E+ID+  HSPD   + K  
Sbjct: 422  PAPEENHDQTPAPVASIVEKVSEMEEQNHLDSKLGVESNVLEQIDIQQHSPDLGGDRKAS 481

Query: 1564 ENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1385
            +NSEG   +                                                   
Sbjct: 482  DNSEGQANSASDSGSDSDSDSDSSDSGSDSGSRSRSRSRSGSPAGSGSGSSSDSESDASS 541

Query: 1384 XXXXXXDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRAD-GLLVQNKPDEMEDFHA 1214
                  DE+VDIMTSDD KE K  LQ   PG +  S IPW+ +    +QN  DE +D   
Sbjct: 542  NSKEGSDEDVDIMTSDDYKETKQDLQASEPG-VVQSPIPWQTEHDRPLQNGMDENQDGDG 600

Query: 1213 SEVVEIMEN-----------SPGYAHKSEIDPSNGSV-----SNKEGEKHAQVIKASAGN 1082
            S+ V+I  N           S G   + ++      +     + KEGEK  +  K S+ +
Sbjct: 601  SDAVDIEGNGSDAVDVEGHGSDGVDIEKDLPEDEQQIGMAVSTRKEGEKPEEGAKPSSSD 660

Query: 1081 SSVLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEK 902
               L E + +  NL    +   +D  RHEQSD  +R  + KSKR SD KH D+ +  +++
Sbjct: 661  CDELQERQNFIGNLFDDAENLVKDSVRHEQSDNSERLPKAKSKRGSDLKHIDEKSERTKR 720

Query: 901  VKAGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVK-DRTDFSVHRT 725
             K+ S+++  +S   +    GS    S +RPID PY++    +  K  + +  DF   + 
Sbjct: 721  SKSESLSQPHVSGSRDPNFFGSIRNFSPNRPIDDPYQSSSVQMMNKGDREEHADFGSQKG 780

Query: 724  YNQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGL 545
            YNQ  P K+ SDF  SG RP D     K + A +RP  + ++ G   K +E+S+    G 
Sbjct: 781  YNQVFPRKSSSDFHQSGRRPSDQGAWAKATIAAERPMKHTESSGHGRKFSEKSVH--EGH 838

Query: 544  PLHKEKVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFSGKMKEVGSF 368
             + K+  +RD Q+E+G   +K+ P+N++E GA  KN   +D + RK GE  GK K+ G  
Sbjct: 839  FIQKDNPSRDTQNEDGLMKDKKLPRNTKEGGAGGKNAVPSDFHHRKLGETVGKFKDAGQI 898

Query: 367  PNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRK 188
             +S+++ P    SR   DR P +N +  +L+RE S LELGE+R+PL E  P + KQF+RK
Sbjct: 899  SSSYINSPPKDNSRVTADRYP-VNGKSNMLQRELSHLELGEIREPLVEETP-IKKQFERK 956

Query: 187  GSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPN 68
             SFKQS +  ++ + +NPDLS+GK  G+T  D  KPSPPN
Sbjct: 957  SSFKQSGSGPSTSENFNPDLSRGKSVGKTNWDSGKPSPPN 996


>EOY27053.1 Dentin sialophosphoprotein-related, putative isoform 1 [Theobroma
            cacao]
          Length = 1250

 Score =  686 bits (1770), Expect = 0.0
 Identities = 424/940 (45%), Positives = 557/940 (59%), Gaps = 30/940 (3%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQGG+ARIKFD+   N SGNVIDVGG
Sbjct: 64   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGGTARIKFDSIPTNPSGNVIDVGG 123

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE  DLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDES  NHVKMRSE
Sbjct: 124  KEFRFTWSREFVDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESMTNHVKMRSE 183

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            EAERK KSR+AI+LDHGNPSMKNQ+K LAAAEA+ WK  FK+K EP +KKRK E      
Sbjct: 184  EAERKHKSRKAIVLDHGNPSMKNQIKQLAAAEASPWKSHFKKK-EPAFKKRKVETAQAAV 242

Query: 2269 -GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQ 2096
             GPPKS +K GL S S +KG  S SP+PS PE+            +++ ++ +ED M  Q
Sbjct: 243  GGPPKSGYKSGLISASNAKGGRSTSPIPSPPERSGAAASPIGIGNISKVHSGIEDVMPPQ 302

Query: 2095 ATGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKAL 1916
               KE ASSSEK  P+R  T A+ + P R+ N G KP DL+S+LI+LL ENP+GMSLKAL
Sbjct: 303  VKSKENASSSEKEIPTR-ATGAVREMPGRRGNFGPKPMDLQSLLITLLKENPKGMSLKAL 361

Query: 1915 EKSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPV 1742
            EK+VG+  PNS R+IE I++KIAT+QAPGRY LK   EL+SLKKP SE  SSPE+N    
Sbjct: 362  EKAVGDTIPNSARKIETIVKKIATFQAPGRYFLKPGVELDSLKKPSSESGSSPEDNHHQT 421

Query: 1741 SGNDRGDACDPKNAAS-PKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVP 1565
               +      P   AS  +  ++ EE  +L+S+ G    V E+ID+  HSPD   + K  
Sbjct: 422  PAPEENHDQTPAPVASIVEKVSEMEEQNHLDSKLGVESNVLEQIDIQQHSPDLGGDRKAS 481

Query: 1564 ENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1385
            +NSEG   +                                                   
Sbjct: 482  DNSEGQANSASDSGSDSDSDSDSSDSGSDSGSRSRSRSRSGSPAGSGSGSSSDSESDASS 541

Query: 1384 XXXXXXDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRAD-GLLVQNKPDEMEDFHA 1214
                  DE+VDIMTSDD KE K  LQ   PG +  S IPW+ +    +QN  DE +D   
Sbjct: 542  NSKEGSDEDVDIMTSDDYKETKQDLQASEPG-VVQSPIPWQTEHDRPLQNGMDENQDGDG 600

Query: 1213 SEVVEIMEN-----------SPGYAHKSEIDPSNGSV-----SNKEGEKHAQVIKASAGN 1082
            S+ V+I  N           S G   + ++      +     + KEGEK  +  K S+ +
Sbjct: 601  SDAVDIEGNGSDAVDVEGHGSDGVDIEKDLPEDEQQIGMAVSTRKEGEKPEEGAKPSSSD 660

Query: 1081 SSVLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEK 902
               L E + +  NL    +   +D  RHEQSD  +R  + KSKR SD KH D+ +  +++
Sbjct: 661  CDELQERQNFIGNLFDDAENLVKDSVRHEQSDNSERLPKAKSKRGSDLKHIDEKSERTKR 720

Query: 901  VKAGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVK-DRTDFSVHRT 725
             K+ S+++  +S   +    GS    S +RPID PY++    +  K  + +  DF   + 
Sbjct: 721  SKSESLSQPHVSGSRDPNFFGSIRNFSPNRPIDDPYQSSSVQMMNKGDREEHADFGSQKG 780

Query: 724  YNQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGL 545
            YNQ  P K+ SDF  SG RP D     K + A +RP  + ++ G   K +E+S+    G 
Sbjct: 781  YNQVFPRKSSSDFHQSGRRPSDQGAWAKATIAAERPMKHTESSGHGRKFSEKSVH--EGH 838

Query: 544  PLHKEKVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFSGKMKEVGSF 368
             + K+  +RD Q+E+G   +K+ P+N++E GA  KN   +D + RK GE  GK K+ G  
Sbjct: 839  FIQKDNPSRDTQNEDGLMKDKKLPRNTKEGGAGGKNAVPSDFHHRKLGETVGKFKDAGQI 898

Query: 367  PNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRK 188
             +S+++ P    SR   DR P +N +  +L+RE S LELGE+R+PL E  P + KQF+RK
Sbjct: 899  SSSYINSPPKDNSRVTADRYP-VNGKSNMLQRELSHLELGEIREPLVEETP-IKKQFERK 956

Query: 187  GSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPN 68
             SFKQS +  ++ + +NPDLS+GK  G+T  D  KPSPPN
Sbjct: 957  SSFKQSGSGPSTSENFNPDLSRGKSVGKTNWDSGKPSPPN 996


>XP_007024432.2 PREDICTED: dentin sialophosphoprotein isoform X2 [Theobroma cacao]
          Length = 1233

 Score =  670 bits (1729), Expect = 0.0
 Identities = 415/935 (44%), Positives = 548/935 (58%), Gaps = 25/935 (2%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQGG+ARIKFD+   N SGNVIDVGG
Sbjct: 64   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGGTARIKFDSIPTNPSGNVIDVGG 123

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE  DLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDES  NHVKMRSE
Sbjct: 124  KEFRFTWSREFVDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESMTNHVKMRSE 183

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPPGPP 2261
            EAERK KSR+AI+LDHGNPSMKNQ+K LAAAEA+ WK  FK+K EP +KKRK E      
Sbjct: 184  EAERKHKSRKAIVLDHGNPSMKNQIKQLAAAEASPWKSHFKKK-EPAFKKRKVET----- 237

Query: 2260 KSAHKHGLSLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQATGKE 2081
                      S +KG  S SP+PS PE+            +++ ++ +ED M  Q   KE
Sbjct: 238  -------AQASNAKGGRSTSPIPSPPERSGAAASPIGIGNISKVHSGIEDVMPPQVKSKE 290

Query: 2080 KASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALEKSVG 1901
             ASSSEK  P+R  T A+ + P R+ N G KP DL+S+LI+LL ENP+GMSLKALEK+VG
Sbjct: 291  NASSSEKEIPTR-ATGAVREMPGRRGNFGPKPMDLQSLLITLLKENPKGMSLKALEKAVG 349

Query: 1900 EYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPVSGNDR 1727
            +  PNS R+IE I++KIAT+QAPGRY LK   EL+SLKKP SE  SSPE+N       + 
Sbjct: 350  DTIPNSARKIETIVKKIATFQAPGRYFLKPGVELDSLKKPSSESGSSPEDNHHQTPAPEE 409

Query: 1726 GDACDPKNAAS-PKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVPENSEG 1550
                 P   AS  +  ++ EE  +L+S+ G    V E+ID+  HSPD   + K  +NSEG
Sbjct: 410  NHDQTPAPVASIVEKVSEMEEQNHLDSKLGVESNVLEQIDIQQHSPDLGGDRKASDNSEG 469

Query: 1549 PVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1370
               +                                                        
Sbjct: 470  QANSASDSGSDSDSDSDSSDSGSDSGSRSRSRSRSGSPAGSGSGSSSDSESDASSNSKEG 529

Query: 1369 XDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRAD-GLLVQNKPDEMEDFHASEVVE 1199
             DE+VDIMTSDD KE K  LQ   PG +  S IPW+ +    +QN  DE +D   S+ V+
Sbjct: 530  SDEDVDIMTSDDYKETKQDLQASEPG-VVQSPIPWQTEHDRPLQNGMDENQDGDGSDAVD 588

Query: 1198 IMEN-----------SPGYAHKSEIDPSNGSV-----SNKEGEKHAQVIKASAGNSSVLP 1067
            I  N           S G   + ++      +     + KEGEK  +  K S+ +   L 
Sbjct: 589  IEGNGSDAVDVEGHGSDGVDIEKDLPEDEQQIGMAVSTRKEGEKPEEGAKPSSSDCDELQ 648

Query: 1066 ESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVKAGS 887
            E + +  NL    +   +D  RHEQSD  +R  + KSKR SD KH D+ +  ++++K+ S
Sbjct: 649  ERQNFIGNLFDDAENLVKDSVRHEQSDNSERLPKAKSKRGSDLKHIDEKSERTKRLKSES 708

Query: 886  VTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVK-DRTDFSVHRTYNQSN 710
            +++  +S   +    GS    S +RPID PY++    +  K  + +  DF   + YNQ  
Sbjct: 709  LSQPHVSGSRDPNFFGSIRNFSPNRPIDDPYQSSSVQMMNKGDREEHADFGSQKGYNQVF 768

Query: 709  PGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLPLHKE 530
            P K+ SDF  SG RP D     K + A +RP  + ++ G   K +E+S+    G  + K+
Sbjct: 769  PRKSSSDFHQSGRRPSDQGAWAKATIAAERPMKHTESSGHGRKFSEKSVH--EGHFIQKD 826

Query: 529  KVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFSGKMKEVGSFPNSHV 353
              +RD Q+E+G   +K+ P+N++E GA  KN   +D + RK GE  GK K+ G   +S++
Sbjct: 827  NPSRDTQNEDGLMKDKKLPRNTKEGGAGGKNAVPSDFHHRKLGETVGKFKDAGQISSSYI 886

Query: 352  SYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGSFKQ 173
            + P    SR   DR P +N +  +L+RE S LELGE+R+PL E  P + KQF+RK SFKQ
Sbjct: 887  NSPPKDNSRVTADRYP-VNGKSNMLQRELSHLELGEIREPLVEETP-IKKQFERKSSFKQ 944

Query: 172  SENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPN 68
            S +  ++ + +NPDLS+GK  G+T  D  KPSPPN
Sbjct: 945  SGSGPSTSENFNPDLSRGKSVGKTNWDSGKPSPPN 979


>EOY27054.1 Dentin sialophosphoprotein-related, putative isoform 2 [Theobroma
            cacao]
          Length = 1233

 Score =  669 bits (1726), Expect = 0.0
 Identities = 415/935 (44%), Positives = 547/935 (58%), Gaps = 25/935 (2%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQGG+ARIKFD+   N SGNVIDVGG
Sbjct: 64   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGGTARIKFDSIPTNPSGNVIDVGG 123

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE  DLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDES  NHVKMRSE
Sbjct: 124  KEFRFTWSREFVDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESMTNHVKMRSE 183

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPPGPP 2261
            EAERK KSR+AI+LDHGNPSMKNQ+K LAAAEA+ WK  FK+K EP +KKRK E      
Sbjct: 184  EAERKHKSRKAIVLDHGNPSMKNQIKQLAAAEASPWKSHFKKK-EPAFKKRKVET----- 237

Query: 2260 KSAHKHGLSLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQATGKE 2081
                      S +KG  S SP+PS PE+            +++ ++ +ED M  Q   KE
Sbjct: 238  -------AQASNAKGGRSTSPIPSPPERSGAAASPIGIGNISKVHSGIEDVMPPQVKSKE 290

Query: 2080 KASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALEKSVG 1901
             ASSSEK  P+R  T A+ + P R+ N G KP DL+S+LI+LL ENP+GMSLKALEK+VG
Sbjct: 291  NASSSEKEIPTR-ATGAVREMPGRRGNFGPKPMDLQSLLITLLKENPKGMSLKALEKAVG 349

Query: 1900 EYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPVSGNDR 1727
            +  PNS R+IE I++KIAT+QAPGRY LK   EL+SLKKP SE  SSPE+N       + 
Sbjct: 350  DTIPNSARKIETIVKKIATFQAPGRYFLKPGVELDSLKKPSSESGSSPEDNHHQTPAPEE 409

Query: 1726 GDACDPKNAAS-PKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVPENSEG 1550
                 P   AS  +  ++ EE  +L+S+ G    V E+ID+  HSPD   + K  +NSEG
Sbjct: 410  NHDQTPAPVASIVEKVSEMEEQNHLDSKLGVESNVLEQIDIQQHSPDLGGDRKASDNSEG 469

Query: 1549 PVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1370
               +                                                        
Sbjct: 470  QANSASDSGSDSDSDSDSSDSGSDSGSRSRSRSRSGSPAGSGSGSSSDSESDASSNSKEG 529

Query: 1369 XDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRAD-GLLVQNKPDEMEDFHASEVVE 1199
             DE+VDIMTSDD KE K  LQ   PG +  S IPW+ +    +QN  DE +D   S+ V+
Sbjct: 530  SDEDVDIMTSDDYKETKQDLQASEPG-VVQSPIPWQTEHDRPLQNGMDENQDGDGSDAVD 588

Query: 1198 IMEN-----------SPGYAHKSEIDPSNGSV-----SNKEGEKHAQVIKASAGNSSVLP 1067
            I  N           S G   + ++      +     + KEGEK  +  K S+ +   L 
Sbjct: 589  IEGNGSDAVDVEGHGSDGVDIEKDLPEDEQQIGMAVSTRKEGEKPEEGAKPSSSDCDELQ 648

Query: 1066 ESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVKAGS 887
            E + +  NL    +   +D  RHEQSD  +R  + KSKR SD KH D+ +  +++ K+ S
Sbjct: 649  ERQNFIGNLFDDAENLVKDSVRHEQSDNSERLPKAKSKRGSDLKHIDEKSERTKRSKSES 708

Query: 886  VTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVK-DRTDFSVHRTYNQSN 710
            +++  +S   +    GS    S +RPID PY++    +  K  + +  DF   + YNQ  
Sbjct: 709  LSQPHVSGSRDPNFFGSIRNFSPNRPIDDPYQSSSVQMMNKGDREEHADFGSQKGYNQVF 768

Query: 709  PGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLPLHKE 530
            P K+ SDF  SG RP D     K + A +RP  + ++ G   K +E+S+    G  + K+
Sbjct: 769  PRKSSSDFHQSGRRPSDQGAWAKATIAAERPMKHTESSGHGRKFSEKSVH--EGHFIQKD 826

Query: 529  KVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFSGKMKEVGSFPNSHV 353
              +RD Q+E+G   +K+ P+N++E GA  KN   +D + RK GE  GK K+ G   +S++
Sbjct: 827  NPSRDTQNEDGLMKDKKLPRNTKEGGAGGKNAVPSDFHHRKLGETVGKFKDAGQISSSYI 886

Query: 352  SYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGSFKQ 173
            + P    SR   DR P +N +  +L+RE S LELGE+R+PL E  P + KQF+RK SFKQ
Sbjct: 887  NSPPKDNSRVTADRYP-VNGKSNMLQRELSHLELGEIREPLVEETP-IKKQFERKSSFKQ 944

Query: 172  SENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPN 68
            S +  ++ + +NPDLS+GK  G+T  D  KPSPPN
Sbjct: 945  SGSGPSTSENFNPDLSRGKSVGKTNWDSGKPSPPN 979


>XP_018843337.1 PREDICTED: dentin sialophosphoprotein isoform X2 [Juglans regia]
          Length = 1162

 Score =  657 bits (1694), Expect = 0.0
 Identities = 425/955 (44%), Positives = 562/955 (58%), Gaps = 28/955 (2%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEE++SL  GN P  FAMIIRL PDLVDEIKR+EAQ G+ARIKFD  A N+SGNVIDVGG
Sbjct: 73   VEENFSLVPGNNPPAFAMIIRLAPDLVDEIKRLEAQAGTARIKFDVTASNSSGNVIDVGG 132

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+F FT SRE GDLCDIYEE QSGEDGNGLLVESGCAW KLNV+RVLDEST  HVKM SE
Sbjct: 133  KEFSFTCSRETGDLCDIYEECQSGEDGNGLLVESGCAWWKLNVKRVLDESTTKHVKMMSE 192

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--G 2267
            EAERK K+R+AI+LD GNP  K+Q+K +AA E N+W+ P+KQK EP +KKRK EPP   G
Sbjct: 193  EAERKLKARKAIVLDPGNPPSKSQIKEIAAVETNAWR-PYKQKKEPAFKKRKVEPPQVGG 251

Query: 2266 PPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQAT 2090
              KSA+K GL S + +K R ++  L S PEQ            +++ +A++ED M +Q  
Sbjct: 252  LHKSAYKSGLSSTTTAKSRQTSPSLLSPPEQSSLPAAPLENANISKSHATIEDPMPSQLI 311

Query: 2089 GKEKAS-SSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALE 1913
             K KA+ +SEK  P++ TT  + + P RK N+G KP DL+SMLI+LLTENP+GMSLKALE
Sbjct: 312  SKVKAAPTSEKEIPTK-TTNVVRETPGRKRNVGAKPMDLQSMLINLLTENPKGMSLKALE 370

Query: 1912 KSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENN--QPP 1745
            K++G+  PNS ++IEPII+KIAT+QAPGRY LK   ELE+LKKP+SE  SSPE+N  Q P
Sbjct: 371  KAIGDTVPNSVKKIEPIIKKIATFQAPGRYFLKPRVELETLKKPLSESGSSPEDNSHQMP 430

Query: 1744 VSGNDRGDACDPKNAASPK-----SHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYA 1580
             +    G       AA P      S  + EEP  L S+ GE  T    ++V  HSPD   
Sbjct: 431  TAEGKHGQT----RAAEPSFEEKVSPDELEEPGYLGSKLGE-TTALANMEVQQHSPDLLG 485

Query: 1579 EEKVPENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1400
            E+   +NSE    +                                              
Sbjct: 486  EKMGSDNSEAQAGSSSNSGSDSDSESGSSDSASDSGSHSKSRSRSRSPVGSGSGSSSDSE 545

Query: 1399 XXXXXXXXXXXDEEVDIMTSDDDKEPKDKLQ--TPGQLPASSIPWRA-DGLLVQNKPDEM 1229
                       DE+VDIMTSDD+KE + KLQ   PG   +S IPW++  G  VQ+   E 
Sbjct: 546  SDASSNSKEGSDEDVDIMTSDDEKEAQRKLQPSEPG-FSSSPIPWKSTGGKTVQSGIAEK 604

Query: 1228 EDFHASEVVEIMENSPGYAHKSEIDPSNGSVSNKEGEKHAQVIKASAGNSSVLPESRVYP 1049
            +D   S+ VEI ++ P   H +E+      + +KEG K  +  K  + +   +PE   Y 
Sbjct: 605  QDDQGSDAVEIEKDFPNVEHGTEMAVETRPIPDKEGGKWIEETKPFSPDHDDVPEHPNYA 664

Query: 1048 ENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVKA------GS 887
             +    R+   +D F+HEQSD  +R  +GK KR SD KH  + +  ++++KA        
Sbjct: 665  ISSSTERENAVKDSFKHEQSDSSERTLKGKPKRGSDVKHFKEKSDGAKRLKADVQPSISG 724

Query: 886  VTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKD-RTDFSVHRTYNQSN 710
            V +A  SE ++ LS         DR    PYK      T +AV+D   D  + + YNQ  
Sbjct: 725  VRDAQFSESSHNLSP--------DRFGGNPYKGPTILATNRAVRDGNPDIVLQKGYNQVF 776

Query: 709  PGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLPLHKE 530
            PG++ SDFQ SG R  D S RTK     ++P   A+   S  K +ER++Q      + K+
Sbjct: 777  PGRSSSDFQQSGRRSFDKSTRTKVPDTAEKPDERAE---SGRKYSERNVQ---SFSVQKD 830

Query: 529  KVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGE---FSGKMKEVGSFPN 362
            K+ RD  ++ GY +EK+  KN +E G+  K     DS+ RK GE   FS  +  +GS P 
Sbjct: 831  KLYRDTPND-GYANEKKVSKNFKEGGSGGKQSVPFDSHYRKHGEGGQFSSSL--MGSSPR 887

Query: 361  SHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGS 182
             +++        + ++ SP++N +G IL+RE S+LELGELR+P PE  P V K+ + K S
Sbjct: 888  DNIT--------TGVNVSPVVNGKGSILQRELSDLELGELREPWPEETP-VKKKLESKNS 938

Query: 181  FKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPKGSSKKRS 17
            FKQS+NK  S D WN D SKGKP G+T  +  KPSPPN + G+  N +GS+KKR+
Sbjct: 939  FKQSDNKPCSSDNWNSDSSKGKPVGKTTSESGKPSPPNLNSGLCSNLEGSNKKRN 993


>XP_018843336.1 PREDICTED: dentin sialophosphoprotein isoform X1 [Juglans regia]
          Length = 1216

 Score =  657 bits (1694), Expect = 0.0
 Identities = 425/955 (44%), Positives = 562/955 (58%), Gaps = 28/955 (2%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEE++SL  GN P  FAMIIRL PDLVDEIKR+EAQ G+ARIKFD  A N+SGNVIDVGG
Sbjct: 73   VEENFSLVPGNNPPAFAMIIRLAPDLVDEIKRLEAQAGTARIKFDVTASNSSGNVIDVGG 132

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+F FT SRE GDLCDIYEE QSGEDGNGLLVESGCAW KLNV+RVLDEST  HVKM SE
Sbjct: 133  KEFSFTCSRETGDLCDIYEECQSGEDGNGLLVESGCAWWKLNVKRVLDESTTKHVKMMSE 192

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--G 2267
            EAERK K+R+AI+LD GNP  K+Q+K +AA E N+W+ P+KQK EP +KKRK EPP   G
Sbjct: 193  EAERKLKARKAIVLDPGNPPSKSQIKEIAAVETNAWR-PYKQKKEPAFKKRKVEPPQVGG 251

Query: 2266 PPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQAT 2090
              KSA+K GL S + +K R ++  L S PEQ            +++ +A++ED M +Q  
Sbjct: 252  LHKSAYKSGLSSTTTAKSRQTSPSLLSPPEQSSLPAAPLENANISKSHATIEDPMPSQLI 311

Query: 2089 GKEKAS-SSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALE 1913
             K KA+ +SEK  P++ TT  + + P RK N+G KP DL+SMLI+LLTENP+GMSLKALE
Sbjct: 312  SKVKAAPTSEKEIPTK-TTNVVRETPGRKRNVGAKPMDLQSMLINLLTENPKGMSLKALE 370

Query: 1912 KSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENN--QPP 1745
            K++G+  PNS ++IEPII+KIAT+QAPGRY LK   ELE+LKKP+SE  SSPE+N  Q P
Sbjct: 371  KAIGDTVPNSVKKIEPIIKKIATFQAPGRYFLKPRVELETLKKPLSESGSSPEDNSHQMP 430

Query: 1744 VSGNDRGDACDPKNAASPK-----SHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYA 1580
             +    G       AA P      S  + EEP  L S+ GE  T    ++V  HSPD   
Sbjct: 431  TAEGKHGQT----RAAEPSFEEKVSPDELEEPGYLGSKLGE-TTALANMEVQQHSPDLLG 485

Query: 1579 EEKVPENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1400
            E+   +NSE    +                                              
Sbjct: 486  EKMGSDNSEAQAGSSSNSGSDSDSESGSSDSASDSGSHSKSRSRSRSPVGSGSGSSSDSE 545

Query: 1399 XXXXXXXXXXXDEEVDIMTSDDDKEPKDKLQ--TPGQLPASSIPWRA-DGLLVQNKPDEM 1229
                       DE+VDIMTSDD+KE + KLQ   PG   +S IPW++  G  VQ+   E 
Sbjct: 546  SDASSNSKEGSDEDVDIMTSDDEKEAQRKLQPSEPG-FSSSPIPWKSTGGKTVQSGIAEK 604

Query: 1228 EDFHASEVVEIMENSPGYAHKSEIDPSNGSVSNKEGEKHAQVIKASAGNSSVLPESRVYP 1049
            +D   S+ VEI ++ P   H +E+      + +KEG K  +  K  + +   +PE   Y 
Sbjct: 605  QDDQGSDAVEIEKDFPNVEHGTEMAVETRPIPDKEGGKWIEETKPFSPDHDDVPEHPNYA 664

Query: 1048 ENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVKA------GS 887
             +    R+   +D F+HEQSD  +R  +GK KR SD KH  + +  ++++KA        
Sbjct: 665  ISSSTERENAVKDSFKHEQSDSSERTLKGKPKRGSDVKHFKEKSDGAKRLKADVQPSISG 724

Query: 886  VTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKD-RTDFSVHRTYNQSN 710
            V +A  SE ++ LS         DR    PYK      T +AV+D   D  + + YNQ  
Sbjct: 725  VRDAQFSESSHNLSP--------DRFGGNPYKGPTILATNRAVRDGNPDIVLQKGYNQVF 776

Query: 709  PGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLPLHKE 530
            PG++ SDFQ SG R  D S RTK     ++P   A+   S  K +ER++Q      + K+
Sbjct: 777  PGRSSSDFQQSGRRSFDKSTRTKVPDTAEKPDERAE---SGRKYSERNVQ---SFSVQKD 830

Query: 529  KVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGE---FSGKMKEVGSFPN 362
            K+ RD  ++ GY +EK+  KN +E G+  K     DS+ RK GE   FS  +  +GS P 
Sbjct: 831  KLYRDTPND-GYANEKKVSKNFKEGGSGGKQSVPFDSHYRKHGEGGQFSSSL--MGSSPR 887

Query: 361  SHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGS 182
             +++        + ++ SP++N +G IL+RE S+LELGELR+P PE  P V K+ + K S
Sbjct: 888  DNIT--------TGVNVSPVVNGKGSILQRELSDLELGELREPWPEETP-VKKKLESKNS 938

Query: 181  FKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPKGSSKKRS 17
            FKQS+NK  S D WN D SKGKP G+T  +  KPSPPN + G+  N +GS+KKR+
Sbjct: 939  FKQSDNKPCSSDNWNSDSSKGKPVGKTTSESGKPSPPNLNSGLCSNLEGSNKKRN 993


>XP_011008457.1 PREDICTED: dentin sialophosphoprotein isoform X3 [Populus euphratica]
          Length = 1244

 Score =  650 bits (1677), Expect = 0.0
 Identities = 414/960 (43%), Positives = 551/960 (57%), Gaps = 30/960 (3%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEE++SL  GN PL FAMIIRL PDLVDEI+R+EAQGG+ARIKF + A N  GNVIDVGG
Sbjct: 67   VEETFSLIPGNNPLAFAMIIRLAPDLVDEIRRIEAQGGTARIKFGSMANNPDGNVIDVGG 126

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE+GDLCDIYEERQ G DGNGLLVESGCAWRK+NVQRVLDESTKNHVKM SE
Sbjct: 127  KEFRFTWSRELGDLCDIYEERQGGVDGNGLLVESGCAWRKVNVQRVLDESTKNHVKMLSE 186

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            EAERK KSR+AI+LD GNP+ K+Q+K LAA E+N W   FK+KIEP +KKRK EPP    
Sbjct: 187  EAERKFKSRKAIVLDQGNPAAKSQIKQLAAVESNPW---FKRKIEPSFKKRKVEPPQVGG 243

Query: 2269 -GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQ 2096
             G PK+ +K  L S +  KGR S SPLPS PE             +T+ + S E+ + TQ
Sbjct: 244  GGFPKTTYKPALPSTAIVKGRLS-SPLPSPPEHSGAPASPFGTGSITKHHVSTEEYIPTQ 302

Query: 2095 ATGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKAL 1916
               KE A+SSE   P++   +AL + P RK NLGVK  DL+SML++LL +NP+GMSLKAL
Sbjct: 303  MKNKENAASSENEIPAKF-NSALWETPGRKGNLGVKAMDLQSMLVNLLIQNPKGMSLKAL 361

Query: 1915 EKSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPV 1742
            EK+V    PNS ++IEPII+KIA +QAPGRYILK   E E +KKP SE  SSPE+N    
Sbjct: 362  EKAVSGTIPNSAKKIEPIIKKIANFQAPGRYILKPGMESEKVKKPSSESGSSPEDNHQQA 421

Query: 1741 SGNDRGDAC----DPKNAASPKS-HAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAE 1577
               +  D C    DP+   + K+    ++E V  NS+ GE     EK+D+   SPD + E
Sbjct: 422  HATE--DNCCQRPDPEPRFAEKNPSVASKELVRSNSKLGEESNALEKLDIDQSSPDLFGE 479

Query: 1576 EKVPENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1397
            +KV +NSEG   +                                               
Sbjct: 480  KKVSDNSEGQAGSSSDSGSDSDSESDSSDSGSDSGSRSRSRSRSPVGSGTGSSSDSESDA 539

Query: 1396 XXXXXXXXXXDEEVDIMTSDDDKEPKDKLQTPGQLPASSIPWRA-----DGLLVQNKPDE 1232
                      D ++     D +   K +   PG L AS  PWR+     D  L  NK   
Sbjct: 540  SSNSKQGSDEDVDIMTSDDDKEPRHKLQTAEPGLL-ASPDPWRSVPNGIDKKLDGNKSAA 598

Query: 1231 ME-----------DFHASEVVEIMENSPGYAHKSEIDPSNGSVSNKEGEKHAQVIKASAG 1085
            ++           + H S+ +E+ ++  G     +I  ++  V++KEGEK  Q  +++  
Sbjct: 599  VDIEGHESDAIEIEGHESDAIEVDKDLAGDEKDIKITKNDSLVTSKEGEKPLQGPESTFH 658

Query: 1084 NSSVLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSE 905
            +  ++ E +++  NL    D   RD FRHEQSD   R S+ KSKR  D K  D  +   +
Sbjct: 659  DHDMIQERQMFIGNLFDDDDNMVRDSFRHEQSDSSDRTSKSKSKRGLDAKPFDSKSERVK 718

Query: 904  KVKAGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKDRTDFSVHRT 725
            ++KA S++  P S+  +   SGSP     D+  +  YK     +  +A K  +DF   + 
Sbjct: 719  RLKAESLSRVPTSKGRDTQFSGSPH----DKHNEDMYKGPAIQVMDRADKQASDFGSEKL 774

Query: 724  YNQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGL 545
            YNQ+  GK+  DFQ SG R  D + R K   A  R   +A+  G + K  E+   +    
Sbjct: 775  YNQAISGKSNPDFQQSGRRSSDQNARLKAQEAASR-SKHAEGSGISCKFPEKGSFVHEVF 833

Query: 544  PLHKEKVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFSGKMKEVGSF 368
             +H+EK +RD Q+E+ ++ EK+ P NS+E GA  K+  + DS+ RK+GE  G+ K+ G  
Sbjct: 834  SIHREKASRDTQNEDTFSKEKKVPINSKEGGAGGKHSASFDSHYRKQGEAFGRPKDPGQI 893

Query: 367  PNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRK 188
             NS+  +     +R+DM++  + + RG  L RE S+LELGELR+PL E  P V K+F+RK
Sbjct: 894  SNSNFGFSPKDSNRADMEKHRVASGRG--LHRELSDLELGELREPLLEETP-VKKRFERK 950

Query: 187  GSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPKGSSKKRSPGH 8
            GSFK SENK ++ D  N D+ KGK  G+  +D  KPS PN   GV         KRSP H
Sbjct: 951  GSFKHSENKSSTSDNCNSDIHKGKSIGKVSLDSGKPS-PNLSAGV---------KRSPEH 1000


>XP_011008456.1 PREDICTED: dentin sialophosphoprotein isoform X2 [Populus euphratica]
          Length = 1253

 Score =  644 bits (1661), Expect = 0.0
 Identities = 415/969 (42%), Positives = 551/969 (56%), Gaps = 39/969 (4%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEE++SL  GN PL FAMIIRL PDLVDEI+R+EAQGG+ARIKF + A N  GNVIDVGG
Sbjct: 67   VEETFSLIPGNNPLAFAMIIRLAPDLVDEIRRIEAQGGTARIKFGSMANNPDGNVIDVGG 126

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE+GDLCDIYEERQ G DGNGLLVESGCAWRK+NVQRVLDESTKNHVKM SE
Sbjct: 127  KEFRFTWSRELGDLCDIYEERQGGVDGNGLLVESGCAWRKVNVQRVLDESTKNHVKMLSE 186

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPPGP- 2264
            EAERK KSR+AI+LD GNP+ K+Q+K LAA E+N W   FK+KIEP +KKRK EPP G  
Sbjct: 187  EAERKFKSRKAIVLDQGNPAAKSQIKQLAAVESNPW---FKRKIEPSFKKRKVEPPQGGG 243

Query: 2263 --PKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQA 2093
              PK+ +K  L S +  KGR S SPLPS PE             +T+ + S E+ + TQ 
Sbjct: 244  GFPKTTYKPALPSTAIVKGRLS-SPLPSPPEHSGAPASPFGTGSITKHHVSTEEYIPTQM 302

Query: 2092 TGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALE 1913
              KE A+SSE   P++   +AL + P RK NLGVK  DL+SML++LL +NP+GMSLKALE
Sbjct: 303  KNKENAASSENEIPAKF-NSALWETPGRKGNLGVKAMDLQSMLVNLLIQNPKGMSLKALE 361

Query: 1912 KSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPVS 1739
            K+V    PNS ++IEPII+KIA +QAPGRYILK   E E +KKP SE  SSPE+N     
Sbjct: 362  KAVSGTIPNSAKKIEPIIKKIANFQAPGRYILKPGMESEKVKKPSSESGSSPEDNHQQAH 421

Query: 1738 GNDRGDAC----DPKNAASPKS-HAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEE 1574
              +  D C    DP+   + K+    ++E V  NS+ GE     EK+D+   SPD + E+
Sbjct: 422  ATE--DNCCQRPDPEPRFAEKNPSVASKELVRSNSKLGEESNALEKLDIDQSSPDLFGEK 479

Query: 1573 KVPENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1394
            KV +NSEG   +                                                
Sbjct: 480  KVSDNSEGQAGSSSDSGSDSDSESDSSDSGSDSGSRSRSRSRSPVGSGTGSSSDSESDAS 539

Query: 1393 XXXXXXXXXDEEVDIMTSDDDKEPKDKLQTPGQLPASSIPWRA-----DGLLVQNKPDEM 1229
                     D ++     D +   K +   PG L AS  PWR+     D  L  NK   +
Sbjct: 540  SNSKQGSDEDVDIMTSDDDKEPRHKLQTAEPGLL-ASPDPWRSVPNGIDKKLDGNKSAAV 598

Query: 1228 E-----------DFHASEVVEIM----------ENSPGYAHKSEIDPSNGSVSNKEGEKH 1112
            +           + H S+ +EI           ++  G     +I  ++  V++KEGEK 
Sbjct: 599  DIEGHGSAAVDIEGHESDAIEIEGHESDAIEVDKDLAGDEKDIKITKNDSLVTSKEGEKP 658

Query: 1111 AQVIKASAGNSSVLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKH 932
             Q  +++  +  ++ E +++  NL    D   RD FRHEQSD   R S+ KSKR  D K 
Sbjct: 659  LQGPESTFHDHDMIQERQMFIGNLFDDDDNMVRDSFRHEQSDSSDRTSKSKSKRGLDAKP 718

Query: 931  SDDYAAHSEKVKAGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKD 752
             D  +   +++KA S++  P S+  +   SGSP     D+  +  YK     +  +A K 
Sbjct: 719  FDSKSERVKRLKAESLSRVPTSKGRDTQFSGSPH----DKHNEDMYKGPAIQVMDRADKQ 774

Query: 751  RTDFSVHRTYNQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTE 572
             +DF   + YNQ+  GK+  DFQ SG R  D + R K   A  R   +A+  G + K  E
Sbjct: 775  ASDFGSEKLYNQAISGKSNPDFQQSGRRSSDQNARLKAQEAASR-SKHAEGSGISCKFPE 833

Query: 571  RSIQMPAGLPLHKEKVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFS 395
            +   +     +H+EK +RD Q+E+ ++ EK+ P NS+E GA  K+  + DS+ RK+GE  
Sbjct: 834  KGSFVHEVFSIHREKASRDTQNEDTFSKEKKVPINSKEGGAGGKHSASFDSHYRKQGEAF 893

Query: 394  GKMKEVGSFPNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAP 215
            G+ K+ G   NS+  +     +R+DM++  + + RG  L RE S+LELGELR+PL E  P
Sbjct: 894  GRPKDPGQISNSNFGFSPKDSNRADMEKHRVASGRG--LHRELSDLELGELREPLLEETP 951

Query: 214  GVAKQFDRKGSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPKG 35
             V K+F+RKGSFK SENK ++ D  N D+ KGK  G+  +D  KPS PN   GV      
Sbjct: 952  -VKKRFERKGSFKHSENKSSTSDNCNSDIHKGKSIGKVSLDSGKPS-PNLSAGV------ 1003

Query: 34   SSKKRSPGH 8
               KRSP H
Sbjct: 1004 ---KRSPEH 1009


>XP_011008458.1 PREDICTED: dentin sialophosphoprotein isoform X4 [Populus euphratica]
          Length = 1239

 Score =  643 bits (1658), Expect = 0.0
 Identities = 415/970 (42%), Positives = 551/970 (56%), Gaps = 40/970 (4%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEE++SL  GN PL FAMIIRL PDLVDEI+R+EAQGG+ARIKF + A N  GNVIDVGG
Sbjct: 67   VEETFSLIPGNNPLAFAMIIRLAPDLVDEIRRIEAQGGTARIKFGSMANNPDGNVIDVGG 126

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE+GDLCDIYEERQ G DGNGLLVESGCAWRK+NVQRVLDESTKNHVKM SE
Sbjct: 127  KEFRFTWSRELGDLCDIYEERQGGVDGNGLLVESGCAWRKVNVQRVLDESTKNHVKMLSE 186

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            EAERK KSR+AI+LD GNP+ K+Q+K LAA E+N W   FK+KIEP +KKRK EPP    
Sbjct: 187  EAERKFKSRKAIVLDQGNPAAKSQIKQLAAVESNPW---FKRKIEPSFKKRKVEPPQVGG 243

Query: 2269 -GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQ 2096
             G PK+ +K  L S +  KGR S SPLPS PE             +T+ + S E+ + TQ
Sbjct: 244  GGFPKTTYKPALPSTAIVKGRLS-SPLPSPPEHSGAPASPFGTGSITKHHVSTEEYIPTQ 302

Query: 2095 ATGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKAL 1916
               KE A+SSE   P++   +AL + P RK NLGVK  DL+SML++LL +NP+GMSLKAL
Sbjct: 303  MKNKENAASSENEIPAKF-NSALWETPGRKGNLGVKAMDLQSMLVNLLIQNPKGMSLKAL 361

Query: 1915 EKSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPV 1742
            EK+V    PNS ++IEPII+KIA +QAPGRYILK   E E +KKP SE  SSPE+N    
Sbjct: 362  EKAVSGTIPNSAKKIEPIIKKIANFQAPGRYILKPGMESEKVKKPSSESGSSPEDNHQQA 421

Query: 1741 SGNDRGDAC----DPKNAASPKS-HAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAE 1577
               +  D C    DP+   + K+    ++E V  NS+ GE     EK+D+   SPD + E
Sbjct: 422  HATE--DNCCQRPDPEPRFAEKNPSVASKELVRSNSKLGEESNALEKLDIDQSSPDLFGE 479

Query: 1576 EKVPENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1397
            +KV +NSEG   +                                               
Sbjct: 480  KKVSDNSEGQAGSSSDSGSDSDSESDSSDSGSDSGSRSRSRSRSPVGSGTGSSSDSESDA 539

Query: 1396 XXXXXXXXXXDEEVDIMTSDDDKEPKDKLQTPGQLPASSIPWRA-----DGLLVQNKPDE 1232
                      D ++     D +   K +   PG L AS  PWR+     D  L  NK   
Sbjct: 540  SSNSKQGSDEDVDIMTSDDDKEPRHKLQTAEPGLL-ASPDPWRSVPNGIDKKLDGNKSAA 598

Query: 1231 ME-----------DFHASEVVEIM----------ENSPGYAHKSEIDPSNGSVSNKEGEK 1115
            ++           + H S+ +EI           ++  G     +I  ++  V++KEGEK
Sbjct: 599  VDIEGHGSAAVDIEGHESDAIEIEGHESDAIEVDKDLAGDEKDIKITKNDSLVTSKEGEK 658

Query: 1114 HAQVIKASAGNSSVLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGK 935
              Q  +++  +  ++ E +++  NL    D   RD FRHEQSD   R S+ KSKR  D K
Sbjct: 659  PLQGPESTFHDHDMIQERQMFIGNLFDDDDNMVRDSFRHEQSDSSDRTSKSKSKRGLDAK 718

Query: 934  HSDDYAAHSEKVKAGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVK 755
              D  +   +++KA S++  P S+  +   SGSP     D+  +  YK     +  +A K
Sbjct: 719  PFDSKSERVKRLKAESLSRVPTSKGRDTQFSGSPH----DKHNEDMYKGPAIQVMDRADK 774

Query: 754  DRTDFSVHRTYNQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKST 575
              +DF   + YNQ+  GK+  DFQ SG R  D + R K   A  R   +A+  G + K  
Sbjct: 775  QASDFGSEKLYNQAISGKSNPDFQQSGRRSSDQNARLKAQEAASR-SKHAEGSGISCKFP 833

Query: 574  ERSIQMPAGLPLHKEKVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEF 398
            E+   +     +H+EK +RD Q+E+ ++ EK+ P NS+E GA  K+  + DS+ RK+GE 
Sbjct: 834  EKGSFVHEVFSIHREKASRDTQNEDTFSKEKKVPINSKEGGAGGKHSASFDSHYRKQGEA 893

Query: 397  SGKMKEVGSFPNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAA 218
             G+ K+ G   NS+  +     +R+DM++  + + RG  L RE S+LELGELR+PL E  
Sbjct: 894  FGRPKDPGQISNSNFGFSPKDSNRADMEKHRVASGRG--LHRELSDLELGELREPLLEET 951

Query: 217  PGVAKQFDRKGSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPK 38
            P V K+F+RKGSFK SENK ++ D  N D+ KGK  G+  +D  KPS PN   GV     
Sbjct: 952  P-VKKRFERKGSFKHSENKSSTSDNCNSDIHKGKSIGKVSLDSGKPS-PNLSAGV----- 1004

Query: 37   GSSKKRSPGH 8
                KRSP H
Sbjct: 1005 ----KRSPEH 1010


>XP_011008455.1 PREDICTED: dentin sialophosphoprotein isoform X1 [Populus euphratica]
          Length = 1254

 Score =  643 bits (1658), Expect = 0.0
 Identities = 415/970 (42%), Positives = 551/970 (56%), Gaps = 40/970 (4%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEE++SL  GN PL FAMIIRL PDLVDEI+R+EAQGG+ARIKF + A N  GNVIDVGG
Sbjct: 67   VEETFSLIPGNNPLAFAMIIRLAPDLVDEIRRIEAQGGTARIKFGSMANNPDGNVIDVGG 126

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWSRE+GDLCDIYEERQ G DGNGLLVESGCAWRK+NVQRVLDESTKNHVKM SE
Sbjct: 127  KEFRFTWSRELGDLCDIYEERQGGVDGNGLLVESGCAWRKVNVQRVLDESTKNHVKMLSE 186

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            EAERK KSR+AI+LD GNP+ K+Q+K LAA E+N W   FK+KIEP +KKRK EPP    
Sbjct: 187  EAERKFKSRKAIVLDQGNPAAKSQIKQLAAVESNPW---FKRKIEPSFKKRKVEPPQVGG 243

Query: 2269 -GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQ 2096
             G PK+ +K  L S +  KGR S SPLPS PE             +T+ + S E+ + TQ
Sbjct: 244  GGFPKTTYKPALPSTAIVKGRLS-SPLPSPPEHSGAPASPFGTGSITKHHVSTEEYIPTQ 302

Query: 2095 ATGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKAL 1916
               KE A+SSE   P++   +AL + P RK NLGVK  DL+SML++LL +NP+GMSLKAL
Sbjct: 303  MKNKENAASSENEIPAKF-NSALWETPGRKGNLGVKAMDLQSMLVNLLIQNPKGMSLKAL 361

Query: 1915 EKSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENNQPPV 1742
            EK+V    PNS ++IEPII+KIA +QAPGRYILK   E E +KKP SE  SSPE+N    
Sbjct: 362  EKAVSGTIPNSAKKIEPIIKKIANFQAPGRYILKPGMESEKVKKPSSESGSSPEDNHQQA 421

Query: 1741 SGNDRGDAC----DPKNAASPKS-HAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAE 1577
               +  D C    DP+   + K+    ++E V  NS+ GE     EK+D+   SPD + E
Sbjct: 422  HATE--DNCCQRPDPEPRFAEKNPSVASKELVRSNSKLGEESNALEKLDIDQSSPDLFGE 479

Query: 1576 EKVPENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1397
            +KV +NSEG   +                                               
Sbjct: 480  KKVSDNSEGQAGSSSDSGSDSDSESDSSDSGSDSGSRSRSRSRSPVGSGTGSSSDSESDA 539

Query: 1396 XXXXXXXXXXDEEVDIMTSDDDKEPKDKLQTPGQLPASSIPWRA-----DGLLVQNKPDE 1232
                      D ++     D +   K +   PG L AS  PWR+     D  L  NK   
Sbjct: 540  SSNSKQGSDEDVDIMTSDDDKEPRHKLQTAEPGLL-ASPDPWRSVPNGIDKKLDGNKSAA 598

Query: 1231 ME-----------DFHASEVVEIM----------ENSPGYAHKSEIDPSNGSVSNKEGEK 1115
            ++           + H S+ +EI           ++  G     +I  ++  V++KEGEK
Sbjct: 599  VDIEGHGSAAVDIEGHESDAIEIEGHESDAIEVDKDLAGDEKDIKITKNDSLVTSKEGEK 658

Query: 1114 HAQVIKASAGNSSVLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGK 935
              Q  +++  +  ++ E +++  NL    D   RD FRHEQSD   R S+ KSKR  D K
Sbjct: 659  PLQGPESTFHDHDMIQERQMFIGNLFDDDDNMVRDSFRHEQSDSSDRTSKSKSKRGLDAK 718

Query: 934  HSDDYAAHSEKVKAGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVK 755
              D  +   +++KA S++  P S+  +   SGSP     D+  +  YK     +  +A K
Sbjct: 719  PFDSKSERVKRLKAESLSRVPTSKGRDTQFSGSPH----DKHNEDMYKGPAIQVMDRADK 774

Query: 754  DRTDFSVHRTYNQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKST 575
              +DF   + YNQ+  GK+  DFQ SG R  D + R K   A  R   +A+  G + K  
Sbjct: 775  QASDFGSEKLYNQAISGKSNPDFQQSGRRSSDQNARLKAQEAASR-SKHAEGSGISCKFP 833

Query: 574  ERSIQMPAGLPLHKEKVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEF 398
            E+   +     +H+EK +RD Q+E+ ++ EK+ P NS+E GA  K+  + DS+ RK+GE 
Sbjct: 834  EKGSFVHEVFSIHREKASRDTQNEDTFSKEKKVPINSKEGGAGGKHSASFDSHYRKQGEA 893

Query: 397  SGKMKEVGSFPNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAA 218
             G+ K+ G   NS+  +     +R+DM++  + + RG  L RE S+LELGELR+PL E  
Sbjct: 894  FGRPKDPGQISNSNFGFSPKDSNRADMEKHRVASGRG--LHRELSDLELGELREPLLEET 951

Query: 217  PGVAKQFDRKGSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPK 38
            P V K+F+RKGSFK SENK ++ D  N D+ KGK  G+  +D  KPS PN   GV     
Sbjct: 952  P-VKKRFERKGSFKHSENKSSTSDNCNSDIHKGKSIGKVSLDSGKPS-PNLSAGV----- 1004

Query: 37   GSSKKRSPGH 8
                KRSP H
Sbjct: 1005 ----KRSPEH 1010


>XP_017606848.1 PREDICTED: dentin sialophosphoprotein [Gossypium arboreum]
          Length = 1225

 Score =  630 bits (1626), Expect = 0.0
 Identities = 412/955 (43%), Positives = 544/955 (56%), Gaps = 24/955 (2%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQG +ARIKFD+   + +GNVIDVGG
Sbjct: 65   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGETARIKFDSIPTHPTGNVIDVGG 124

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWS E GDLCDIYEERQ GEDGNGLLVESGCAWRKLNVQRVLDEST NHVKMRSE
Sbjct: 125  KEFRFTWSPEFGDLCDIYEERQMGEDGNGLLVESGCAWRKLNVQRVLDESTTNHVKMRSE 184

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            E ERK KSR+AIILDHGNPSMKNQ+K + AAEA+ WK  FK+K E   KKRK  P     
Sbjct: 185  EFERKLKSRKAIILDHGNPSMKNQIKQMVAAEASPWKSHFKKK-ELAIKKRKETPQAAVG 243

Query: 2269 GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQA 2093
            GPPKS ++ GL S + +KGR S+SPLPS  E+             T+ +A  ED M +  
Sbjct: 244  GPPKSGYRPGLVSAAPAKGRRSSSPLPSPLERSDAAVSPSGLGNTTKTHAGSEDVMPSLV 303

Query: 2092 TGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALE 1913
              KE  S S+K  PSR  T+A  +   R+ N G KPTDL+S+LISLL ENP+GMSLKALE
Sbjct: 304  KSKESISISDKEIPSR-ATSAGREMQERRGNFGPKPTDLKSLLISLLKENPKGMSLKALE 362

Query: 1912 KSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENN--QPP 1745
            K+VG+  PNS R+IEPI++KIAT+ APGRY LK   ELESLKK  SE  SSPE N  + P
Sbjct: 363  KAVGDTIPNSARKIEPILKKIATFHAPGRYFLKPGVELESLKKSSSESGSSPEGNRHEAP 422

Query: 1744 VSGNDRGDACDPKNAASPK-SHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKV 1568
                ++     P +  + K +H + EE  +L+S+        E+ID+   SPD   E K 
Sbjct: 423  APEENQDQTLAPVSFLAEKITHDEVEEQTHLDSKLTVGSDPMEQIDIQQLSPDLGGERKT 482

Query: 1567 PENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1388
             +NSEG   +                                                  
Sbjct: 483  SDNSEGQANSASDSGSDSDSDSDSSDSGSDSGSHSRSRSRSASPAASGSGSSSDSETDAS 542

Query: 1387 XXXXXXXDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRA--DGLLVQNKPDEMEDF 1220
                   DE+VDIMTSDDDKE K  + T  PG L  S IPW+A  DG L  N  D  +D 
Sbjct: 543  SNSKEGSDEDVDIMTSDDDKETKQDMLTSEPGLL-TSPIPWQAELDGSL-HNGMDGNQDD 600

Query: 1219 HASEVVEIMENSPGY--------AHKSEIDPSNGSVSNKEGEKHAQVIKASAGNSSVLPE 1064
              S  V+I    PG           + E +    + +NKEGEKH +  K S+ +     E
Sbjct: 601  DGSYAVDI--EGPGSDAVDIEKDLPEDEQEIGMAANTNKEGEKHKEGTKPSSFDLDEFQE 658

Query: 1063 SRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVKAGSV 884
             + +  NL    +   ++  R+EQSD  ++  + KSKR SD KH D+ +  S+++K+ S+
Sbjct: 659  RQNFIGNLFDDTENIVKNSVRNEQSDYPEKSGKAKSKRGSDLKHIDEKSERSKRLKSESM 718

Query: 883  TEAPM--SEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKDRTDFSVHRTYNQSN 710
            ++ P+  S D    +S         R +D  Y++     + K  ++  DF          
Sbjct: 719  SQPPVSGSRDAEFFAS--------SRSVDDSYQS-----SNKGDREHADFQKENIL--VF 763

Query: 709  PGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLPLHKE 530
            P K+ +DF  SG R  D   R K  +  +RP  + ++LG  +K  E+++    G  + KE
Sbjct: 764  PQKSSTDFHQSGRRSSDQGARAKAVNTAERPLKHTESLGHGSKFAEKNVH--EGYIIQKE 821

Query: 529  KVNRDKQSENGYNDEKRPPKNSREGAVDKNLTATDSNIRKRGEFSGKMKEVGSFPNSHVS 350
               RD Q+E+G   EK+   N++     KN   +D + RK GE  GK K+ G    S+++
Sbjct: 822  NPTRDAQNEDGVMKEKKLSTNTKNSG-GKNAVPSDFHHRKHGETFGKPKDSGQNSGSYIN 880

Query: 349  YPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGSFKQS 170
                  S+ + DR P  N +  +L+RE S LELGE+R+PL +  P V KQF++KGSFKQS
Sbjct: 881  SSPKDNSKVNEDRYP-ANGKSNVLQRELSHLELGEIREPLIDETP-VKKQFEKKGSFKQS 938

Query: 169  ENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPKGSSKKRSPGHY 5
             ++ ++ +Y+NPDLS+GKPAG+T  D  KP  PN           S  KR+P H+
Sbjct: 939  GSRASTSEYFNPDLSRGKPAGKTNWDSGKPPSPNL----------SGLKRTPEHH 983


>XP_016730003.1 PREDICTED: dentin sialophosphoprotein-like [Gossypium hirsutum]
          Length = 1225

 Score =  626 bits (1615), Expect = 0.0
 Identities = 410/954 (42%), Positives = 541/954 (56%), Gaps = 23/954 (2%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQG +ARIKFD+   + +GNVIDVGG
Sbjct: 65   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGETARIKFDSIPTHPTGNVIDVGG 124

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWS E GDLCDIYEERQ GEDGNGLLVESGCAWRKLNVQRVLDEST NHVKMRSE
Sbjct: 125  KEFRFTWSPEFGDLCDIYEERQMGEDGNGLLVESGCAWRKLNVQRVLDESTTNHVKMRSE 184

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            E ERK KSR+AIILDHGNPSMKNQ+K + AAEA+ WK  FK+K E   KKRK  P     
Sbjct: 185  EFERKLKSRKAIILDHGNPSMKNQIKQMVAAEASPWKSHFKKK-ELALKKRKETPQAAVG 243

Query: 2269 GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQA 2093
            GPPKS ++ GL S + +KGR S+SPLPS  E+             T+ +A  ED M +  
Sbjct: 244  GPPKSGYRPGLVSAAPAKGRLSSSPLPSPLERSDAAVSPSGLGNTTKTHAGSEDVMPSLV 303

Query: 2092 TGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALE 1913
              KE  S S+K  PSR  T+A  +   R+ N G KPTDL+S+LISLL ENP+GMSLKALE
Sbjct: 304  KSKESISISDKEIPSR-ATSAGREMQERRGNFGPKPTDLKSLLISLLKENPKGMSLKALE 362

Query: 1912 KSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENN--QPP 1745
            K+VG+  PNS R+IEPI++KIAT+ APGRY LK   ELESLKK  SE  SSPE N  + P
Sbjct: 363  KAVGDTIPNSARKIEPILKKIATFHAPGRYFLKPGVELESLKKSSSESGSSPEGNRHEAP 422

Query: 1744 VSGNDRGDACDPKNAASPK-SHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKV 1568
                ++     P +  + K +H + EE  +L+S+        E+ID+   SPD   E K 
Sbjct: 423  APEENQDQTLAPVSFLAEKITHDEVEEQTHLDSKLTVGSDPMEQIDIQQLSPDLGGERKT 482

Query: 1567 PENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1388
             +NSEG   +                                                  
Sbjct: 483  SDNSEGQANSASDSGSDSDSDSDSSDSGSDSGSHSRSRSRSASPAASGSGSSSDSETDAS 542

Query: 1387 XXXXXXXDEEVDIMTSDDDKEPK-DKLQTPGQLPASSIPWRA--DGLLVQNKPDEMEDFH 1217
                   DE+VDIMTSDDDKE K D L +   L  S IPW+A  DG L  N  D  +D  
Sbjct: 543  SNSKEGSDEDVDIMTSDDDKETKQDMLISEPGLLTSPIPWQAELDGSL-HNGMDGNQDDD 601

Query: 1216 ASEVVEIMENSPGY--------AHKSEIDPSNGSVSNKEGEKHAQVIKASAGNSSVLPES 1061
             S  V+I    PG           + E +    + +NKEGEKH +  K S+ +     E 
Sbjct: 602  GSYAVDI--EGPGSDAVDIEKDLPEDEQEIGMAANTNKEGEKHKEGTKPSSSDLDEFQER 659

Query: 1060 RVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVKAGSVT 881
            + +  NL    +   ++  R+EQSD  ++  + KSKR SD KH D+ +  S+++K+ S++
Sbjct: 660  QNFIGNLFDDTENIVKNSVRNEQSDYPEKSGKAKSKRGSDLKHIDEKSERSKRLKSESMS 719

Query: 880  EAPM--SEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKDRTDFSVHRTYNQSNP 707
            + P+  S D    +S         R +D  Y++     + K  ++  DF          P
Sbjct: 720  QPPVSGSRDVEFFAS--------SRSVDDSYQS-----SNKGDREHADFQKENIV--VFP 764

Query: 706  GKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLPLHKEK 527
             K+ +DF  SG R  D   R K  +  +RP  + ++ G  +K  E+++    G  + KE 
Sbjct: 765  QKSSTDFHQSGRRSSDQGARAKAVNTAERPLKHTESSGHGSKFAEKNVH--EGYIIQKEN 822

Query: 526  VNRDKQSENGYNDEKRPPKNSREGAVDKNLTATDSNIRKRGEFSGKMKEVGSFPNSHVSY 347
              RD Q+E+G   EK+   N++     KN   +D + RK GE  GK K+ G    S+V+ 
Sbjct: 823  PTRDAQNEDGVMKEKKLSTNTKNSG-GKNAVPSDFHHRKHGETFGKPKDSGQNSGSYVNS 881

Query: 346  PSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGSFKQSE 167
                 S+   DR P  N +  +L++E S LELGE+R+PL +  P V KQF++KGSFKQS 
Sbjct: 882  SPKDNSKVTEDRYP-ANGKSNVLQQELSHLELGEIREPLIDETP-VKKQFEKKGSFKQSG 939

Query: 166  NKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPKGSSKKRSPGHY 5
            ++ ++ +Y+NPDLS+GKPAG+T  D  KP  PN           S  KR+P H+
Sbjct: 940  SRASTSEYFNPDLSRGKPAGKTNWDSGKPPSPNL----------SGLKRTPEHH 983


>KHG19117.1 RNA polymerase II elongation factor ELL [Gossypium arboreum]
          Length = 1229

 Score =  625 bits (1611), Expect = 0.0
 Identities = 412/959 (42%), Positives = 544/959 (56%), Gaps = 28/959 (2%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQG +ARIKFD+   + +GNVIDVGG
Sbjct: 65   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGETARIKFDSIPTHPTGNVIDVGG 124

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWS E GDLCDIYEERQ GEDGNGLLVESGCAWRKLNVQRVLDEST NHVKMRSE
Sbjct: 125  KEFRFTWSPEFGDLCDIYEERQMGEDGNGLLVESGCAWRKLNVQRVLDESTTNHVKMRSE 184

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            E ERK KSR+AIILDHGNPSMKNQ+K + AAEA+ WK  FK+K E   KKRK  P     
Sbjct: 185  EFERKLKSRKAIILDHGNPSMKNQIKQMVAAEASPWKSHFKKK-ELAIKKRKETPQAAVG 243

Query: 2269 GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQA 2093
            GPPKS ++ GL S + +KGR S+SPLPS  E+             T+ +A  ED M +  
Sbjct: 244  GPPKSGYRPGLVSAAPAKGRRSSSPLPSPLERSDAAVSPSGLGNTTKTHAGSEDVMPSLV 303

Query: 2092 TGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLK--- 1922
              KE  S S+K  PSR  T+A  +   R+ N G KPTDL+S+LISLL ENP+GMSLK   
Sbjct: 304  KSKESISISDKEIPSR-ATSAGREMQERRGNFGPKPTDLKSLLISLLKENPKGMSLKASA 362

Query: 1921 -ALEKSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENN- 1754
             ALEK+VG+  PNS R+IEPI++KIAT+ APGRY LK   ELESLKK  SE  SSPE N 
Sbjct: 363  TALEKAVGDTIPNSARKIEPILKKIATFHAPGRYFLKPGVELESLKKSSSESGSSPEGNR 422

Query: 1753 -QPPVSGNDRGDACDPKNAASPK-SHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYA 1580
             + P    ++     P +  + K +H + EE  +L+S+        E+ID+   SPD   
Sbjct: 423  HEAPAPEENQDQTLAPVSFLAEKITHDEVEEQTHLDSKLTVGSDPMEQIDIQQLSPDLGG 482

Query: 1579 EEKVPENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1400
            E K  +NSEG   +                                              
Sbjct: 483  ERKTSDNSEGQANSASDSGSDSDSDSDSSDSGSDSGSHSRSRSRSASPAASGSGSSSDSE 542

Query: 1399 XXXXXXXXXXXDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRA--DGLLVQNKPDE 1232
                       DE+VDIMTSDDDKE K  + T  PG L  S IPW+A  DG L  N  D 
Sbjct: 543  TDASSNSKEGSDEDVDIMTSDDDKETKQDMLTSEPGLL-TSPIPWQAELDGSL-HNGMDG 600

Query: 1231 MEDFHASEVVEIMENSPGY--------AHKSEIDPSNGSVSNKEGEKHAQVIKASAGNSS 1076
             +D   S  V+I    PG           + E +    + +NKEGEKH +  K S+ +  
Sbjct: 601  NQDDDGSYAVDI--EGPGSDAVDIEKDLPEDEQEIGMAANTNKEGEKHKEGTKPSSFDLD 658

Query: 1075 VLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVK 896
               E + +  NL    +   ++  R+EQSD  ++  + KSKR SD KH D+ +  S+++K
Sbjct: 659  EFQERQNFIGNLFDDTENIVKNSVRNEQSDYPEKSGKAKSKRGSDLKHIDEKSERSKRLK 718

Query: 895  AGSVTEAPM--SEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKDRTDFSVHRTY 722
            + S+++ P+  S D    +S         R +D  Y++     + K  ++  DF      
Sbjct: 719  SESMSQPPVSGSRDAEFFAS--------SRSVDDSYQS-----SNKGDREHADFQKENIL 765

Query: 721  NQSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLP 542
                P K+ +DF  SG R  D   R K  +  +RP  + ++LG  +K  E+++    G  
Sbjct: 766  --VFPQKSSTDFHQSGRRSSDQGARAKAVNTAERPLKHTESLGHGSKFAEKNVH--EGYI 821

Query: 541  LHKEKVNRDKQSENGYNDEKRPPKNSREGAVDKNLTATDSNIRKRGEFSGKMKEVGSFPN 362
            + KE   RD Q+E+G   EK+   N++     KN   +D + RK GE  GK K+ G    
Sbjct: 822  IQKENPTRDAQNEDGVMKEKKLSTNTKNSG-GKNAVPSDFHHRKHGETFGKPKDSGQNSG 880

Query: 361  SHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGS 182
            S+++      S+ + DR P  N +  +L+RE S LELGE+R+PL +  P V KQF++KGS
Sbjct: 881  SYINSSPKDNSKVNEDRYP-ANGKSNVLQRELSHLELGEIREPLIDETP-VKKQFEKKGS 938

Query: 181  FKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPKGSSKKRSPGHY 5
            FKQS ++ ++ +Y+NPDLS+GKPAG+T  D  KP  PN           S  KR+P H+
Sbjct: 939  FKQSGSRASTSEYFNPDLSRGKPAGKTNWDSGKPPSPNL----------SGLKRTPEHH 987


>XP_010274312.1 PREDICTED: dentin sialophosphoprotein [Nelumbo nucifera]
          Length = 1250

 Score =  617 bits (1592), Expect = 0.0
 Identities = 390/968 (40%), Positives = 537/968 (55%), Gaps = 41/968 (4%)
 Frame = -3

Query: 2794 EESYSLETGNPLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGGKD 2615
            EE++SL T +PL FAMIIRLTPD+VDEIKRVE+QGG+ARIKFD+N  N SGNVIDVGGK+
Sbjct: 66   EETFSLVTESPLAFAMIIRLTPDIVDEIKRVESQGGTARIKFDSNTNNMSGNVIDVGGKE 125

Query: 2614 FRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSEEA 2435
            FRFTWS+E  +LCDIYEERQ G DGNGLLVESGC+WRKLNVQR+LDESTKNHVKMRSEEA
Sbjct: 126  FRFTWSQEPANLCDIYEERQGGGDGNGLLVESGCSWRKLNVQRILDESTKNHVKMRSEEA 185

Query: 2434 ERKSKSRQAIILDHGNPSMKNQMKALAAA--EANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            ERK +SR++I+LDHGNPS+KN+MK LAAA  + NS  MPFK K EP +KKRK E      
Sbjct: 186  ERKLRSRKSIVLDHGNPSVKNEMKTLAAAAVDVNSRWMPFKSKKEPAFKKRKVESTQVAM 245

Query: 2269 -GPPKSAHKHGLSLSA-SKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQ 2096
             GPPKS  K GLS +A +KGR S SP+PS PEQ            LT+G+ S+E    T 
Sbjct: 246  GGPPKSVFKPGLSSTATAKGRPSVSPVPSPPEQFAASTSPLRSGNLTKGHTSIE-GAITP 304

Query: 2095 ATGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKAL 1916
               KE AS++E+    +    A  +    KV+LG KP DL++ LI+LL ENP+GM+LKAL
Sbjct: 305  VMSKENASNNEREITRKAVHGATREASGSKVSLGDKPMDLQNTLITLLMENPKGMTLKAL 364

Query: 1915 EKSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSESSPENNQPPVSG 1736
            EK+ G+  PN+ R+I+PI+++IAT+QAPGRY+LK   + E+ KKP SES    + P  + 
Sbjct: 365  EKAFGDTIPNAARKIDPILKRIATFQAPGRYLLKQGVDFETFKKPSSES---GSSPECTH 421

Query: 1735 NDRGDACDPKNAASPKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVPENS 1556
            +   D  D      P+  +Q    +  N        + EKID+P +SPD + ++K+ +NS
Sbjct: 422  SPDPDFPDKSGTKEPEQQSQLIPKIESN--------LVEKIDIPQNSPDLFGDKKLSDNS 473

Query: 1555 EGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1376
            EG   +                                                      
Sbjct: 474  EGQAGSSSDSGSDSDSESDSSDSSDSGSQSRSRSKSRSPGGSGSGSSSDSESDGSSSSKE 533

Query: 1375 XXXDEEVDIMTSDDDKEPKDKLQT--------PGQLPASSIPWRADGLLVQNKPDEME-- 1226
                +   + + DD KE + KL T        P Q   SS+P   +G++   +   M   
Sbjct: 534  GSDVDVDIMTSDDD-KEVEHKLPTHEPMFPTSPIQAGTSSLPGE-NGIVEAKQDGNMSDE 591

Query: 1225 ----DFHASEVVEIMENSPGYAHKSEIDPSNGSVSNKEGEKHAQVIKASAGNSSVLPESR 1058
                D H +E++EI +  P       ++ +     +++ ++                E++
Sbjct: 592  DITIDDHENEIIEITDPVPSKVEPGSLEETIPFPVSRDKKQ----------------ETQ 635

Query: 1057 VYPENLHRG--RDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVKAGSV 884
             +   +H    R+   +D   +EQSD  +R S+ KSKR SD K+  + +  + + KAG  
Sbjct: 636  QFSHYVHHDDEREGAMKDDLGNEQSDSSERISKVKSKRGSDTKNFHEKSETAGRAKAGGS 695

Query: 883  TEAPMS-EDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKD-RTDFSVHRTYNQSN 710
            ++ P+S  +   + SGSP   S DR    PYK      T +  +D   D    + YNQ+ 
Sbjct: 696  SQPPISMRNKETVFSGSPNDSSPDRFSQAPYKDQAVRTTERVGRDGNLDTRSQKGYNQTT 755

Query: 709  PGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERS------------ 566
             G+++ D Q  G R  D+S R K    +D+P  Y +NLG  AK +E+S            
Sbjct: 756  -GRSVVDIQKPGQRSADLSARGKTPDMSDKPSRYVENLGRGAKLSEKSSAFADESEVSVM 814

Query: 565  -IQMPAGL-PLHKEKVNRDKQSENGYNDEKRPPKNSRE-GAVDKNLTATDSNIRKRGEFS 395
                P G+ P+ K+KV+R+ Q  + Y  EK   K  +E G  DK    +DS+ R+ GE  
Sbjct: 815  GSMRPHGISPISKDKVHRETQGGDSYAYEKSQTKGGKESGYGDKAPMLSDSHYRRHGEQI 874

Query: 394  GKMKEVGSFPNSHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLP-EAA 218
             K K+ G    S +       +RSD D+SP++N +  +LRRE S+LELGELR+P+P E A
Sbjct: 875  RKFKD-GHMAYSQMDSSPKDNNRSDADKSPVVNGKSNMLRREYSDLELGELREPIPGEEA 933

Query: 217  PGVAKQFDRKGSFKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPK 38
             GV KQF+RKGSFK S++K  + D W  D SKG+ A +   +  K SPPN    V  N  
Sbjct: 934  KGVKKQFERKGSFKLSDSKQNASDNWTSDSSKGRTAAKAIQESRKQSPPNMRANVFNNQD 993

Query: 37   GSSKKRSP 14
             SS+KR+P
Sbjct: 994  SSSRKRTP 1001


>XP_012445365.1 PREDICTED: dentin sialophosphoprotein [Gossypium raimondii]
            KJB56865.1 hypothetical protein B456_009G139700
            [Gossypium raimondii]
          Length = 1213

 Score =  603 bits (1554), Expect = 0.0
 Identities = 401/953 (42%), Positives = 533/953 (55%), Gaps = 22/953 (2%)
 Frame = -3

Query: 2797 VEESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGG 2621
            VEES+SL +GN PL FAMIIRL PDLV+EI+R+EAQG +ARIKFD+   + +GNVIDVGG
Sbjct: 65   VEESFSLVSGNNPLAFAMIIRLAPDLVEEIRRLEAQGETARIKFDSIPTHPTGNVIDVGG 124

Query: 2620 KDFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSE 2441
            K+FRFTWS E GDLCDIYEERQ GEDGNGLLVESGCAWRKLNVQRVLDEST NHVKMRSE
Sbjct: 125  KEFRFTWSPEFGDLCDIYEERQMGEDGNGLLVESGCAWRKLNVQRVLDESTTNHVKMRSE 184

Query: 2440 EAERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--- 2270
            E ERK KSR+AIILDHGNPSMKNQ+K + AAEA+ WK  FK K+E   KKR   P     
Sbjct: 185  EFERKLKSRKAIILDHGNPSMKNQIKQMVAAEASPWKSHFK-KMELALKKRNDTPQAAAG 243

Query: 2269 GPPKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQA 2093
            GPPKS ++ GL S + +KGR S+SPLPS  E+             T+ +A  ED M +  
Sbjct: 244  GPPKSGYRPGLVSAATAKGRRSSSPLPSPLERSDAAVSPSGLGNTTKTHAGSEDVMPSLV 303

Query: 2092 TGKEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALE 1913
              KE  SSS+K  PSR  ++A  +   R+ N G KPTDL+S+LISLL ENP+GMSLKALE
Sbjct: 304  KSKESISSSDKEIPSR-ASSAGREMQERRGNFGPKPTDLQSLLISLLKENPKGMSLKALE 362

Query: 1912 KSVGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENN--QPP 1745
            K+VG+  PNS R+IEPI++KIAT+ APGRY LK   ELESLKK  SE  SSPE N  + P
Sbjct: 363  KAVGDTIPNSARKIEPILKKIATFHAPGRYFLKPGVELESLKKSSSESGSSPEGNRHEAP 422

Query: 1744 VSGNDRGDACDPKNAASPKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVP 1565
                ++     P +  + K+  + EE  +L+S+        E+ID+   SPD   E K  
Sbjct: 423  APEENQDQTLAPVSFLAEKTD-EVEEQTHLDSKLTVASDPMEQIDIQQLSPDLGGERKAS 481

Query: 1564 ENSEGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1385
            +NSEG   +                                                   
Sbjct: 482  DNSEGQANSASDSGSDSDSDSDSSDSGSDSGSHSRSRSRSASPAASGSGSSSDSETDASS 541

Query: 1384 XXXXXXDEEVDIMTSDDDKEPKDKLQT--PGQLPASSIPWRAD-GLLVQNKPDEMEDFHA 1214
                  DE+VDIMTSDDDKE K  + T  PG L  S IPW+A+  + + N  D  +D   
Sbjct: 542  NSKEGSDEDVDIMTSDDDKETKQDMLTSEPGLL-TSPIPWQAEHDMSLHNGMDGNQDDDG 600

Query: 1213 SEVVEIMENSPGY--------AHKSEIDPSNGSVSNKEGEKHAQVIKASAGNSSVLPESR 1058
            S  V+I    PG           + E +    + SNKEGEKH +  K S+ +     E +
Sbjct: 601  SYAVDI--EGPGSDAVDIEKDLPEDEQEIGMAANSNKEGEKHEEGTKPSSSDLDEFQERQ 658

Query: 1057 VYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVKAGSVTE 878
             +  NL    +   ++  R+EQS+  ++  + KSKR SD  H D+ +  S+++K+ S+++
Sbjct: 659  NFIGNLFDDTENIVKNSVRNEQSNYPEKSGKAKSKRGSDLTHIDEKSERSKRLKSESMSQ 718

Query: 877  APM--SEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKDRTDFSVHRTYNQSNPG 704
             P+  S D  L +S         R +D  Y++ +         DR      + Y    P 
Sbjct: 719  PPVSGSRDAELFAS--------SRSVDESYQSSNK-------GDREHADSQKGYILVFPQ 763

Query: 703  KAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLPLHKEKV 524
            K+ +DF  SG R  D   R K  +  +RP  + ++ G  +K  E+++    G  + KE  
Sbjct: 764  KSSTDFHQSGRRSSDQGARGKAVNTAERPLKHTESSGHGSKFAEKNVH--EGYIIQKENP 821

Query: 523  NRDKQSENGYNDEKRPPKNSREGAVDKNLTATDSNIRKRGEFSGKMKEVGSFPNSHVSYP 344
             RD Q+E+G   EK+   N++     KN   +D + RK GE  GK K+ G    ++++  
Sbjct: 822  TRDAQNEDGVMKEKKLSTNTK-NIGGKNAVPSDFHHRKHGETFGKPKDSGQNSGTYINSS 880

Query: 343  SNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGSFKQSEN 164
                S+ + DR P  N +  +L+RE S LELGE+R+PL +  P V KQF++KGSFKQS +
Sbjct: 881  PKDNSKVNEDRYP-ANGKSNVLQRELSHLELGEIREPLIDETP-VKKQFEKKGSFKQSGS 938

Query: 163  KLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPDIGVVGNPKGSSKKRSPGHY 5
            +          LS+GKPAG+T  D  KP  PN           S  KR+P H+
Sbjct: 939  R----------LSRGKPAGKTNWDSGKPPSPNL----------SGLKRTPEHH 971


>GAV80268.1 Occludin_ELL domain-containing protein [Cephalotus follicularis]
          Length = 1228

 Score =  603 bits (1555), Expect = 0.0
 Identities = 397/940 (42%), Positives = 526/940 (55%), Gaps = 29/940 (3%)
 Frame = -3

Query: 2794 EESYSLETGN-PLDFAMIIRLTPDLVDEIKRVEAQGGSARIKFDANAKNTSGNVIDVGGK 2618
            EE+YSL +GN PL FAMIIRL PDLV+EIKRVE+ GG+A+IKFD+ A N +GNVI+VGGK
Sbjct: 66   EENYSLVSGNNPLAFAMIIRLAPDLVEEIKRVESLGGNAKIKFDSIATNPNGNVINVGGK 125

Query: 2617 DFRFTWSREMGDLCDIYEERQSGEDGNGLLVESGCAWRKLNVQRVLDESTKNHVKMRSEE 2438
            +FRFTWSRE GDLCDIYEERQSGEDGNGLLVESG +WRKLNVQRVLDESTKNHVK RSEE
Sbjct: 126  EFRFTWSREPGDLCDIYEERQSGEDGNGLLVESGSSWRKLNVQRVLDESTKNHVKKRSEE 185

Query: 2437 AERKSKSRQAIILDHGNPSMKNQMKALAAAEANSWKMPFKQKIEPPYKKRKAEPPP--GP 2264
            AERKSKSR+AI+LDHGNPSMK+Q+K LAA EAN WK  FKQK E P+KKRK +PP    P
Sbjct: 186  AERKSKSRKAIVLDHGNPSMKSQLKQLAAVEANPWK-NFKQKKESPFKKRKVDPPQVGAP 244

Query: 2263 PKSAHKHGL-SLSASKGRTSASPLPSTPEQXXXXXXXXXXXXLTRGYASVEDNMATQATG 2087
            PK ++K GL S   +K R SASPLPS PEQ            +++ + S ED        
Sbjct: 245  PKPSYKSGLVSTVNAKVRLSASPLPSPPEQSGAPASPIGAGNISKVHGSGEDAAHIPVKT 304

Query: 2086 KEKASSSEKGTPSRVTTTALLDKPARKVNLGVKPTDLRSMLISLLTENPRGMSLKALEKS 1907
            KE  +S EK  P+R  +T+L+ +      +  KP DL+SMLI+LL ENP+GMSLKALEK+
Sbjct: 305  KEIFASFEKEIPTR--STSLVRETPMGRGIRAKPMDLQSMLITLLKENPKGMSLKALEKA 362

Query: 1906 VGEYFPNSGRQIEPIIRKIATYQAPGRYILKSEAELESLKKPVSE--SSPENN-QPPVSG 1736
            +G+  PNS ++IEPII+KIAT+QAPGRY+LKS  +LES +K  SE  SSPE N  P  + 
Sbjct: 363  IGDTPPNSAKKIEPIIKKIATFQAPGRYLLKSGVDLESFRKASSECGSSPEENFHPTPAP 422

Query: 1735 NDRGDACDPKNAASPKSHAQTEEPVNLNSEPGEVVTVNEKIDVPPHSPDHYAEEKVPENS 1556
             D  D      +A   S  + EE V LNS+  E      K+D+   SPD + E+KV +  
Sbjct: 423  EDNQD----HTSAGKFSAVELEERVELNSKLEEESNALGKLDIQECSPDLFGEKKVSDEG 478

Query: 1555 EGPVATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1376
                ++                                                      
Sbjct: 479  HAGSSSDSSTDSDSDSDSSDSGSESGSQSRSRSKSKSRSPVGSGSGSSSDSETDASSHSK 538

Query: 1375 XXXDEEVDIMTSDDDKEPKDKLQTPGQLPASSIPWRADGLLVQNKP-DEMED-------- 1223
               DE+VDIM SDDDKEP  K+Q      +  I WR        K  DE +D        
Sbjct: 539  DGSDEDVDIM-SDDDKEPNHKIQA-----SEPIQWRPGNAKPAEKGIDEKQDDGGSDAID 592

Query: 1222 --FHASEVVEIMENS---------PGYAHKSEIDPSNGSVSNKEGEKHAQVIKASAGNSS 1076
               H S+ ++I ++          P  A + E+  +   V N + EK     K  + +  
Sbjct: 593  IEGHESDAIDIEDHGSDVDIERVLPDDAQEIEVPVNTNLVPNNDREKPVGGSKNFSSDHD 652

Query: 1075 VLPESRVYPENLHRGRDKTARDGFRHEQSDRYQRKSEGKSKRRSDGKHSDDYAAHSEKVK 896
             L E + +  NL   +D   +D  R E+SD  +R ++ KSKR  D K  D  +  +++++
Sbjct: 653  ELLERQNFIGNLFDDKDDMFKDNLRQERSDSSERITKSKSKR--DSKQYDKKSERTKRLR 710

Query: 895  AGSVTEAPMSEDTNLLSSGSPGKCSLDRPIDGPYKALHTHITPKAVKD-RTDFSVHRTYN 719
            A S+T+  +S   N   S SP   S +R I+  Y+    H+   A ++   D    + YN
Sbjct: 711  AESLTQPTVSGSRNAQYSESPRNLSPNRLIEDTYRGAAMHMMNIAGREGNADLPFQKGYN 770

Query: 718  QSNPGKAISDFQPSGPRPVDISGRTKGSSANDRPGIYADNLGSNAKSTERSIQMPAGLPL 539
            Q+  GK+ SD   SG R  + + ++K      RP  Y ++ G   K   ++        +
Sbjct: 771  QAFAGKSSSDSHRSGQRSSEHNQQSKAHDMEGRPNKYGESSGHGRKFAGKNSHAHEDFSI 830

Query: 538  HKEKVNRDKQSENGYNDEKRPPKNSREGAVDKNLTAT-DSNIRKRGEFSGKMKEVGSFPN 362
             KE  +RD Q+E  +   K+ P+N +EG      +A  +S+  K  E +GK  + G   +
Sbjct: 831  QKENASRDAQNEGSF--AKKVPRNPKEGGTGGKRSADFESHSNKSSEITGKSMDAGQISS 888

Query: 361  SHVSYPSNGRSRSDMDRSPIINERGPILRREPSELELGELRDPLPEAAPGVAKQFDRKGS 182
            S   Y   G SR+ +DR  + N +  +LRRE SELELGELRDPLPE    V KQ DRK  
Sbjct: 889  SRTGYSPKGNSRTSVDR-VLTNGQSSVLRREVSELELGELRDPLPEETL-VKKQIDRK-- 944

Query: 181  FKQSENKLTSFDYWNPDLSKGKPAGRTGVDPMKPSPPNPD 62
             K SEN+ ++ D    DLSKG+P G+   +P KPSPPNP+
Sbjct: 945  VKHSENRPSTSD---SDLSKGRPIGKG--NPKKPSPPNPN 979


Top