BLASTX nr result

ID: Akebia25_contig00000030 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00000030
         (2449 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007034174.1| Uncharacterized protein TCM_020195 [Theobrom...   129   5e-27
ref|XP_006375500.1| hypothetical protein POPTR_0014s14320g [Popu...   126   4e-26
ref|XP_002268062.2| PREDICTED: uncharacterized protein LOC100258...   125   9e-26
emb|CBI39381.3| unnamed protein product [Vitis vinifera]              125   9e-26
emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera]   125   9e-26
ref|XP_002303009.1| hypothetical protein POPTR_0002s23700g [Popu...   122   1e-24
ref|XP_006585141.1| PREDICTED: dentin sialophosphoprotein-like i...   117   2e-23
ref|XP_006585140.1| PREDICTED: dentin sialophosphoprotein-like i...   117   2e-23
ref|XP_004247899.1| PREDICTED: uncharacterized protein LOC101252...   117   3e-23
ref|XP_007153592.1| hypothetical protein PHAVU_003G048600g [Phas...   116   4e-23
ref|XP_006358822.1| PREDICTED: dentin sialophosphoprotein-like [...   114   2e-22
ref|XP_006580135.1| PREDICTED: dentin sialophosphoprotein-like i...   112   8e-22
ref|XP_006580136.1| PREDICTED: dentin sialophosphoprotein-like i...   111   1e-21
gb|EXB73708.1| hypothetical protein L484_026874 [Morus notabilis]     108   9e-21
ref|XP_004504384.1| PREDICTED: serine-rich adhesin for platelets...   108   1e-20
ref|XP_002520635.1| hypothetical protein RCOM_1554430 [Ricinus c...   108   2e-20
ref|XP_004504383.1| PREDICTED: serine-rich adhesin for platelets...   105   1e-19
ref|XP_004147674.1| PREDICTED: uncharacterized protein LOC101215...   103   5e-19
ref|XP_004298230.1| PREDICTED: uncharacterized protein LOC101308...   100   2e-18
gb|EYU38966.1| hypothetical protein MIMGU_mgv1a004239mg [Mimulus...    96   6e-17

>ref|XP_007034174.1| Uncharacterized protein TCM_020195 [Theobroma cacao]
            gi|508713203|gb|EOY05100.1| Uncharacterized protein
            TCM_020195 [Theobroma cacao]
          Length = 1095

 Score =  129 bits (325), Expect = 5e-27
 Identities = 180/719 (25%), Positives = 293/719 (40%), Gaps = 87/719 (12%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            +ESHG H+CHKCGW +PN HPSA+HRRAHKK+CG +EG+K+ +S +      SDD  LSD
Sbjct: 14   HESHGVHLCHKCGWPFPNQHPSARHRRAHKKICGNIEGYKLVDSGDITLSTASDDEALSD 73

Query: 2139 ED--------PKVEENQN-----GGIRRQISNRSEEEEFSDAVSEFAD------------ 2035
            ED        PKV E+ +      GI   +SNRSE+E FSDA  EF D            
Sbjct: 74   EDHQSPIAQVPKVLESDSLKKSISGI-GAMSNRSEDEVFSDAAMEFHDGGKGRQDSLDNA 132

Query: 2034 XXXXXXXXXXXXXXXXXXXXXXSNIIQPSNNPSE------GIQIQIPHTSQNITDQLDID 1873
                                  ++I+QP  NP++       +   IP       +  DI 
Sbjct: 133  SKADKIAEKDLTATISFKDCEDTDILQPPQNPADTSQNLNAVLENIPIMPSGTPEHQDIG 192

Query: 1872 EKFQHHS---SGS------TMGSNAGPVLDKSAELLAVEPTKETNKSTNLATPVNYGGSE 1720
              +   S   +GS      T       V ++S ++ AV+   E +         N  G  
Sbjct: 193  LSYSKDSDDRNGSACDVVLTKPETITGVSEESRKVSAVDRVAECSIERETDAIENEKGKL 252

Query: 1719 NANLENILTGNSVM---------ETMAFIEEKVDTKGKSDLDGSLSPVASPSKTIREASQ 1567
            N N    L G SV+         E+++  E +++  G SD   +   V S  +     + 
Sbjct: 253  NKN----LAGGSVLPSQHCGELSESVSVSERRLE--GTSDTVLTDDIVQSKEEFSNRLAS 306

Query: 1566 TDIEIQTMDRFT---GEAFSSDVTDAITTDTEPLDGSFSAIAASPSKAIDGFTGGGISLD 1396
              +  +  +  T   G     ++ D + ++ E     ++ I +   + I   +G    + 
Sbjct: 307  KIVMSENGEEETDGKGHPRKRNLMDVVASNCE-----YATITSEKREDITSESGLADKIV 361

Query: 1395 VTDVNTVIVENTKPLDVVVDLKEEETHSFGQNVSSNEFFHDSCTNSMEVEAAKHMGASVG 1216
              + NT  +   K +D  + LK+E T        S + F      +   ++A  + ++  
Sbjct: 362  ELEENTDKLALNKVID-NLSLKDEPTKLMD---VSADTFQMKTDPAQATDSATSVNSNEV 417

Query: 1215 FSQPEVDCAHEVDIDCPDDNVGDCVAKEVNTNLPALSERTVKDCKYP--ELVKSELSVTP 1042
            + + E +      +  PDD     +    N  +     +  K  K P  E + SE  +  
Sbjct: 418  YEKEEKENESVYVLSVPDD-----IPIVDNAEIKLEGFKDHKGVKLPLLEALASEEIIID 472

Query: 1041 DYGEAIRETEVKNSVSEEIPLGFLSSKSGDGDNISSLAARVHE--------GNPQILGRE 886
                   E EV++ VS+EI   F S++    +NI   ++++H+        G+ + + +E
Sbjct: 473  ------TEDEVRDHVSQEISDTFRSNQL--DENIKVDSSQMHDVEVSHKLGGDNEAMVKE 524

Query: 885  VIVEQVSVDCVADSEANDTTIGGLESHELEILPDSTSTDVKYIEKNQMVCSIEGKES-HS 709
            V+VE    D +  ++ +D             L      D    EK+  VCS+E ++  + 
Sbjct: 525  VLVEG-KADVLQINKGSDA------------LGSPVDADTSENEKDHKVCSLEEQQPVYV 571

Query: 708  QQDLPQT-----PIVVAVAGGTVLSPSDAVVHEAINAVSSQDDIAHENSRI-------DN 565
              DL QT      I V      +++P+DA   +  N V S D    E++RI        N
Sbjct: 572  SDDLHQTGFSGSMINVLPDVNPMVAPADAEARKLSNVVGSDDMGIPESTRIGAIDVAGSN 631

Query: 564  PDNCIREGTTEENS-----ITNKMVAPEA--AILLSESENLKD-----AEAIPLDINET 424
             D  I +G   EN+      TN    P+   A  L E +N  D     AE   +D+ E+
Sbjct: 632  EDRRIDDGNYVENTETLCESTNNSSLPQTNPASNLLEVDNSDDIGTRKAEKYDIDVVES 690


>ref|XP_006375500.1| hypothetical protein POPTR_0014s14320g [Populus trichocarpa]
            gi|550324184|gb|ERP53297.1| hypothetical protein
            POPTR_0014s14320g [Populus trichocarpa]
          Length = 1109

 Score =  126 bits (317), Expect = 4e-26
 Identities = 179/687 (26%), Positives = 275/687 (40%), Gaps = 37/687 (5%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            +ESHG +VCHKCGW +PN HPSA+HRRAHKK+CG LEG+K  +SEE      SDD H SD
Sbjct: 16   HESHGVYVCHKCGWPFPNPHPSARHRRAHKKICGTLEGYKFVDSEETPLSALSDDDHGSD 75

Query: 2139 EDPK----------VEENQNGGIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXX 1990
            EDPK          + E   GG+  + SNRSE++ F+DA++EF                 
Sbjct: 76   EDPKTPSPKGLERGINEKGCGGVGSR-SNRSEDDVFTDAIAEFP---------------- 118

Query: 1989 XXXXXXXSNIIQPSNNPSEGIQIQIPHTSQNITDQLDID-EKFQHHSSGSTMGSNAGPVL 1813
                       +  ++P  G      HT      +++++  K    SS     +   P  
Sbjct: 119  -----------ESGSSPVTG-----EHTRDVKEPEINLEINKATAQSSEDGSITVISPPP 162

Query: 1812 DKSAELLAVEPTKETNKSTNLATPVNYGGSENANLENILTGNSVMETMAFIEEKVDTKG- 1636
              SA+ + ++ T+           +N  GS   +L++    N+ + +M       D +G 
Sbjct: 163  SNSADHIQMQSTE--------VPVINLSGSAQESLDH--GSNATIASMT--RSLTDCRGE 210

Query: 1635 KSDLDGSLSPVASPSKTIREASQTDIEIQTMDRFTGEAFSSDVTDAITTDTEPLDGSFSA 1456
            +SD + S    +S   +I    +T  +    ++ +G       TDA   +   LDG    
Sbjct: 211  ESDFEHSHDNGSSAWDSIPIKLETQTDASQENKKSGTVEDLPETDAKGNEETKLDGQLLD 270

Query: 1455 IAASPSKAIDGFTGGGISLDVTDVNTVIVENTKPLDVVVDLKE-EETHSFGQNVSSNEFF 1279
            +  S     D       S  + DV +  V    P   V+ LKE   T      +S N+  
Sbjct: 271  VVVSTD---DNAEDASESQKMEDVTSQPV----PAAEVLQLKEGGYTDDLASGMSLNDL- 322

Query: 1278 HDSCTNSMEVEAAKHMGASVGFSQ-----PEVDCAHEVDIDCPDDNVGD------CVAKE 1132
                  S EV  A+   +S+  +Q      E+D A  V+     DN G+       +   
Sbjct: 323  ------SPEVNLAEPAHSSISTAQIEGDTQEIDSAVYVNSAVSYDNKGEGNGNMHVLIVP 376

Query: 1131 VNTNLPALSERTVKDCKYPELVKSELSVTPDYGEAIRETEVKNSVSEEIPLGFLSSKSGD 952
             +  L A +E  VK  K  E  K    +  D  E      VK+S  +  P GF S    +
Sbjct: 377  NDLTLVADAENMVKGFKDLEGGKLPQLMNMDSFEV--SNNVKDSDLKNNPQGFNSRPLTE 434

Query: 951  GDNISSLAARVHEGN--PQILGREVIVE---QVSVDCVADSE--ANDTTIGGLESHELEI 793
               +S+    V   N  P+    + IVE   +   D    SE    D   G LE     I
Sbjct: 435  DTEVSASNMHVLNDNLEPKDGTSQHIVELPDEAEADMPQRSEVGVTDVVTGDLEK---SI 491

Query: 792  LPDSTSTDV--KYIEKNQMVCSIEGKESHSQQDLPQTPIVVAVAGGTVLSPSDAVVHEAI 619
               S   DV   + E + M  SIE    H+ +    T         TV+ P DA V +  
Sbjct: 492  SVHSPEEDVPRDHCETSSMTRSIE----HATKATSDT--------NTVVVPMDAEVRQT- 538

Query: 618  NAVSSQDDIAHENSRIDNPDNCIREG---TTEENSITNKMVAPEAAILLS-ESENLKDAE 451
            N +   D +   +    N      E     ++  SI+++      ++L   ++  L++ +
Sbjct: 539  NLIGMDDTVGENDKNKRNTKESFAENRIPPSKHASISSEQADQRNSVLGDVKAAGLEEGK 598

Query: 450  AIPLDINETVTEPLSDPQVSEGEDFRE 370
                + +E VTE  S   + E    RE
Sbjct: 599  IERCNASEIVTEGDSVSGLGEENLLRE 625


>ref|XP_002268062.2| PREDICTED: uncharacterized protein LOC100258866 [Vitis vinifera]
          Length = 1258

 Score =  125 bits (314), Expect = 9e-26
 Identities = 98/307 (31%), Positives = 141/307 (45%), Gaps = 18/307 (5%)
 Frame = -2

Query: 2316 ESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESE-ERNPFDGSDDGHLSD 2140
            ESHG H+CHKCGW +PN HPSAKHRRAHK+VCGK+EG+K+  SE   +     DD H SD
Sbjct: 16   ESHGVHLCHKCGWPFPNPHPSAKHRRAHKRVCGKVEGYKLVHSEGSTHSAVSDDDEHPSD 75

Query: 2139 EDPK------VEENQNG---GIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXXX 1987
            +D K      VE ++NG   G   + SNR E+E FSDAV+EF+D                
Sbjct: 76   DDNKTPSPKNVETSKNGIGTGGIGERSNRMEDEVFSDAVTEFSD-------SGISPGIEQ 128

Query: 1986 XXXXXXSNIIQPSNNPSEGIQIQIPHTSQNITDQLDIDEKFQHHSSGSTMGSNAGPVLDK 1807
                   +I        +G   + P    +IT    I E     S+    G       D 
Sbjct: 129  VLEDARESITNVEKVAKDGFDAKQPLEDNSITVAGSISEDLTRESTLWLSGDGN----DS 184

Query: 1806 SAELLAVEPTKETNK-----STNLATPVNYGGSENANLENILTGNSVMETMAFIEEKVDT 1642
            +  L A++P   T        TN    +         +E  L+GN     MA IE+K D 
Sbjct: 185  ACNLSAIKPETPTEAPQEDCKTNAVEGI---------MECPLSGNIGESPMALIEQKTDA 235

Query: 1641 --KGKSDLDGSLSPVA-SPSKTIREASQTDIEIQTMDRFTGEAFSSDVTDAITTDTEPLD 1471
                + ++D  L  +A SP++   E S+  ++ +  D  T +    DV   + ++ +  D
Sbjct: 236  MENEEKNVDRKLLEIAVSPNENAGETSEAGLKSEKTDEKTLDPVEGDV--IVQSEEDQTD 293

Query: 1470 GSFSAIA 1450
            G  + I+
Sbjct: 294  GRGAKIS 300


>emb|CBI39381.3| unnamed protein product [Vitis vinifera]
          Length = 1127

 Score =  125 bits (314), Expect = 9e-26
 Identities = 98/307 (31%), Positives = 141/307 (45%), Gaps = 18/307 (5%)
 Frame = -2

Query: 2316 ESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESE-ERNPFDGSDDGHLSD 2140
            ESHG H+CHKCGW +PN HPSAKHRRAHK+VCGK+EG+K+  SE   +     DD H SD
Sbjct: 16   ESHGVHLCHKCGWPFPNPHPSAKHRRAHKRVCGKVEGYKLVHSEGSTHSAVSDDDEHPSD 75

Query: 2139 EDPK------VEENQNG---GIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXXX 1987
            +D K      VE ++NG   G   + SNR E+E FSDAV+EF+D                
Sbjct: 76   DDNKTPSPKNVETSKNGIGTGGIGERSNRMEDEVFSDAVTEFSD-------SGISPGIEQ 128

Query: 1986 XXXXXXSNIIQPSNNPSEGIQIQIPHTSQNITDQLDIDEKFQHHSSGSTMGSNAGPVLDK 1807
                   +I        +G   + P    +IT    I E     S+    G       D 
Sbjct: 129  VLEDARESITNVEKVAKDGFDAKQPLEDNSITVAGSISEDLTRESTLWLSGDGN----DS 184

Query: 1806 SAELLAVEPTKETNK-----STNLATPVNYGGSENANLENILTGNSVMETMAFIEEKVDT 1642
            +  L A++P   T        TN    +         +E  L+GN     MA IE+K D 
Sbjct: 185  ACNLSAIKPETPTEAPQEDCKTNAVEGI---------MECPLSGNIGESPMALIEQKTDA 235

Query: 1641 --KGKSDLDGSLSPVA-SPSKTIREASQTDIEIQTMDRFTGEAFSSDVTDAITTDTEPLD 1471
                + ++D  L  +A SP++   E S+  ++ +  D  T +    DV   + ++ +  D
Sbjct: 236  MENEEKNVDRKLLEIAVSPNENAGETSEAGLKSEKTDEKTLDPVEGDV--IVQSEEDQTD 293

Query: 1470 GSFSAIA 1450
            G  + I+
Sbjct: 294  GRGAKIS 300


>emb|CAN80932.1| hypothetical protein VITISV_017362 [Vitis vinifera]
          Length = 1697

 Score =  125 bits (314), Expect = 9e-26
 Identities = 98/307 (31%), Positives = 141/307 (45%), Gaps = 18/307 (5%)
 Frame = -2

Query: 2316 ESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESE-ERNPFDGSDDGHLSD 2140
            ESHG H+CHKCGW +PN HPSAKHRRAHK+VCGK+EG+K+  SE   +     DD H SD
Sbjct: 16   ESHGVHLCHKCGWPFPNPHPSAKHRRAHKRVCGKVEGYKLVHSEGSTHSAVSDDDEHPSD 75

Query: 2139 EDPK------VEENQNG---GIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXXX 1987
            +D K      VE ++NG   G   + SNR E+E FSDAV+EF+D                
Sbjct: 76   DDNKTPSPKNVETSKNGIGTGGIGERSNRMEDEVFSDAVTEFSD-------SGISPGIEQ 128

Query: 1986 XXXXXXSNIIQPSNNPSEGIQIQIPHTSQNITDQLDIDEKFQHHSSGSTMGSNAGPVLDK 1807
                   +I        +G   + P    +IT    I E     S+    G       D 
Sbjct: 129  VLEDARESITNVEKVAKDGFDAKQPLEDNSITVAGSISEDLTRESTLWLSGDGN----DS 184

Query: 1806 SAELLAVEPTKETNK-----STNLATPVNYGGSENANLENILTGNSVMETMAFIEEKVDT 1642
            +  L A++P   T        TN    +         +E  L+GN     MA IE+K D 
Sbjct: 185  ACNLSAIKPETPTEAPQEDCKTNAVEGI---------MECPLSGNIGESPMALIEQKTDA 235

Query: 1641 --KGKSDLDGSLSPVA-SPSKTIREASQTDIEIQTMDRFTGEAFSSDVTDAITTDTEPLD 1471
                + ++D  L  +A SP++   E S+  ++ +  D  T +    DV   + ++ +  D
Sbjct: 236  MENEEKNVDRKLLEIAVSPNENAGETSEAGLKSEKTDEKTLDPVEGDV--IVQSEEDQTD 293

Query: 1470 GSFSAIA 1450
            G  + I+
Sbjct: 294  GRGAKIS 300


>ref|XP_002303009.1| hypothetical protein POPTR_0002s23700g [Populus trichocarpa]
            gi|222844735|gb|EEE82282.1| hypothetical protein
            POPTR_0002s23700g [Populus trichocarpa]
          Length = 1025

 Score =  122 bits (305), Expect = 1e-24
 Identities = 167/673 (24%), Positives = 272/673 (40%), Gaps = 65/673 (9%)
 Frame = -2

Query: 2313 SHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSDED 2134
            SHG HVCH+CGW +P  HPSA+ +RAH K+CG LEG+K+ +SEE +    SDD ++SDE+
Sbjct: 18   SHGVHVCHRCGWPFPKPHPSARCKRAHNKICGTLEGYKVVDSEETSLSALSDDDNVSDEE 77

Query: 2133 PKV----------EENQNGGIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXXXX 1984
            P+            E  +GG+   ISNRSE+E F DAV+EF +                 
Sbjct: 78   PETPSPKGLERSSNEKGSGGV-GNISNRSEDEVFKDAVAEFPESGYSSVTGEHTRDVKEQ 136

Query: 1983 XXXXXSNII--------------QPSNNPSEGIQIQIPHTSQNITDQLDIDEKFQHHSSG 1846
                  N                 P +N ++ IQ+Q   T   + +     ++   H S 
Sbjct: 137  EIGLEFNKATAQTSKDGSINVTGPPPSNSADPIQMQ--STEAPVCNLSGKAQESLDHDSN 194

Query: 1845 STMGSNAGPVLDKSAELLAVEPTKETNKSTNLATPVNYGGSENANLEN--ILTGNSVMET 1672
            ST+G    P++D   E    E + +   S   + P+      +A+ EN  I  G  + E+
Sbjct: 195  STIGFMTRPLIDCRDEESGFEYSHDNEGSACDSIPIKLETQTDASQENKKIGAGKDLSES 254

Query: 1671 MA--FIEEKVDTKGKSDLDGSLSPVAS-PSKTIREASQTDIEIQTMDRFTGEAFSSDVTD 1501
             A  +   K D    S++   L  +    S+ +  A    ++    D        +D++ 
Sbjct: 255  DAKGYEGTKFDAGEASEMVSKLQKMEDLTSEPVPTAESLKLKEGHADVLASGMSLNDLSS 314

Query: 1500 AITTDTEPLDGSFSAIAASPSKAIDGFTGGGI-SLDVTD-----------------VNTV 1375
             + +D EP+D SF A             GGG+  +D+TD                 V+ +
Sbjct: 315  EVKSD-EPVDSSFDAAQTK---------GGGVQEMDLTDYVNSTDSYDNKGEGDENVHVL 364

Query: 1374 IVENTKPL-----DVVVDLKEEETHSFGQ--NVSSNEFFHDSCTNSMEVEAAKHMGASVG 1216
            IV +  P+     ++V   K+ E     Q  NV S+E F     N+++    K   +  G
Sbjct: 365  IVPHDFPVVADAENMVKGFKDHEGGKLPQLINVDSSEVF-----NNVKDSGTKDNPS--G 417

Query: 1215 FSQPEVDCAHEV---DIDCPDDNV--GDCVAKEVNTNLPALSERTVKDCKYPELVKSELS 1051
            F+   +    +V   D+   DDNV      ++ +   LP  +E  V        +KSE+ 
Sbjct: 418  FNSRPLIKDTKVSTSDLHVLDDNVEPRGVASQLIVEELPDEAEDDVP-------LKSEVG 470

Query: 1050 VTPDYGEAIRETEVKNSVSEEIPLGFLSSKSGDGDNISSLAARVHEGNPQILGREVIVEQ 871
            VT D      E  +     EE+P         D    SSL + +      I     +V  
Sbjct: 471  VT-DVVVGDLEKSISVQSPEEVP--------RDHCETSSLTSYLEHTTNAISVTNTLVVP 521

Query: 870  VSVDCVADSEANDTTIGGLESHELEILPDSTSTDVKYIEKNQMVCSIEGKESHSQQDLPQ 691
            +      D+E   T +    +H+ + + +S+   V  I K   V      E+ ++  +P 
Sbjct: 522  I------DAEVRQTNLDDTGNHDKDKI-ESSEIAVNDINKRNAV------ENCAENRIPT 568

Query: 690  T-----PIVVAVAGGTVLSPSDAVVHEAINAVSSQDDIAHEN-SRIDNPDNCIREGTTEE 529
            +     P        ++L   +A  HE       +  I   N S+I+   + +  G  EE
Sbjct: 569  SGHASIPAEQVDRRNSILGDVNADAHE-------EGKIERCNVSKIETEGDSV-PGLGEE 620

Query: 528  NSITNKMVAPEAA 490
            N +      PE+A
Sbjct: 621  NLLREPKATPESA 633


>ref|XP_006585141.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 1053

 Score =  117 bits (293), Expect = 2e-23
 Identities = 165/683 (24%), Positives = 277/683 (40%), Gaps = 81/683 (11%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            +ESHG H+CHKCGW +PN HPSAKHRRAHKK+CG +EG+K+  SE +   +GSDD H+SD
Sbjct: 14   HESHGVHLCHKCGWPFPNPHPSAKHRRAHKKICGTIEGYKLSASEGQPHLNGSDDEHVSD 73

Query: 2139 EDPKV----------EENQNGGIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXX 1990
            +D K           +E  N G   +I  RSE+E FSDAV++F+D               
Sbjct: 74   DDHKTPGPKSLETGNKEKGNEGNGEKII-RSEDEVFSDAVADFSD--------------- 117

Query: 1989 XXXXXXXSNIIQPSNNPSEGIQIQIPHTSQNITDQLDIDEKFQHHSSGSTMGSNAGPVLD 1810
                          + P    ++Q    S    +++DI E     SS     ++A  ++D
Sbjct: 118  ------------SGSIPEIKERLQDSLDSGADVERVDIKETKFSGSSEDKDFNDASQLID 165

Query: 1809 KSAELLAVE-PTKETNKSTNLATPVNYGGSENANLENILTGNSVMETMAFIEEKVDTKGK 1633
            KS +   ++ P    N+S  L   V   G  +       T + +  ++A +  +V T   
Sbjct: 166  KSTDDSQIQNPNIFQNESVELGNMVELQGQLSGP-----TVDPLSSSIADLRTEVSTNVD 220

Query: 1632 SD-----LDGSLSPVAS------PSKTIREASQ-TDIEIQTMDRFTGEAFSSDVTDA--- 1498
            SD     L  SL   A       P K I      TD  + ++ + T      ++  A   
Sbjct: 221  SDVFFGLLSDSLPGKAEAMLDILPEKKIHAVENVTDCILISVAKETNLKEKDEINSAGDV 280

Query: 1497 --ITTDTEPLDG----SFSAIAASPSKAIDGFTGGGI----SLDVTDVNTV--IVENTKP 1354
              I   ++ + G      S IA S + ++D   G G       +  ++N+   +VE  + 
Sbjct: 281  IEIVESSDNVVGETCEGVSKIAVSDAISLDHQVGDGAVHLKENNGAEINSYRDVVEIVES 340

Query: 1353 LDVVVDLKEEETHSFG--QNVSSNEFFHDSCTNSMEVEAAK------------HMGASVG 1216
             D VV    EE         VS +    D   +  E   A+             + + V 
Sbjct: 341  SDKVVGEMSEEVSKIAVCDIVSLDHEVGDGAVHLKENNGAEFLSLLPPDNLPLELNSVVI 400

Query: 1215 FSQPEVDCAHEVDIDCPDDNVGDCVAKEVNTNLPALSERTVKDCKYPELVKSELSVTPDY 1036
             +  + D A+ V      D+       E N N+  L   T  D       +SE     D 
Sbjct: 401  TNDAQGDSAYVVQFATSSDDKILPEKGEGNVNVDLLP--TCDDISDEAHPQSEYGDFKDL 458

Query: 1035 GEAIRET--------------EVKNSVSEEIPLGFLSSKSGDGDNISSLAARVHEG---- 910
               + +               ++KN+V+EE    F +++  +  +I S    V +     
Sbjct: 459  EGVVYQNPFLQSSESLKYKGDDLKNNVTEENKFHFNANQLSEKSDILSPDMDVLDNSMKM 518

Query: 909  ---NPQILGREVIVEQVSVDCVADSEANDTTIGGLESHELEILPDSTSTDVKYIEKNQMV 739
               N +   +EV  EQ    C   S A  T    +ESH+     D++   +K  EKN+ +
Sbjct: 519  ELVNSEPTPKEVHAEQ----CTEVSPAQLT----VESHQRSDETDASMKAMK-TEKNE-I 568

Query: 738  CSIEGKESHSQQDLPQTPIVVAVAGGTVL-SPSDAVVHEAINAVSSQ-------DDIAHE 583
              +   E H   D+ +    +++   +++ S +++   E+  + +S+       D  +H 
Sbjct: 569  HMVHFSEEHGPDDVCKNSQQISLPEDSLMASSNESQRDESFRSATSETTRAINIDSTSHH 628

Query: 582  NSRIDNPDNCIREGTTEENSITN 514
              +I   ++   +G   E+++ N
Sbjct: 629  EEKITEINDVALDGKDVESNLEN 651


>ref|XP_006585140.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 1086

 Score =  117 bits (293), Expect = 2e-23
 Identities = 165/683 (24%), Positives = 277/683 (40%), Gaps = 81/683 (11%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            +ESHG H+CHKCGW +PN HPSAKHRRAHKK+CG +EG+K+  SE +   +GSDD H+SD
Sbjct: 14   HESHGVHLCHKCGWPFPNPHPSAKHRRAHKKICGTIEGYKLSASEGQPHLNGSDDEHVSD 73

Query: 2139 EDPKV----------EENQNGGIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXX 1990
            +D K           +E  N G   +I  RSE+E FSDAV++F+D               
Sbjct: 74   DDHKTPGPKSLETGNKEKGNEGNGEKII-RSEDEVFSDAVADFSD--------------- 117

Query: 1989 XXXXXXXSNIIQPSNNPSEGIQIQIPHTSQNITDQLDIDEKFQHHSSGSTMGSNAGPVLD 1810
                          + P    ++Q    S    +++DI E     SS     ++A  ++D
Sbjct: 118  ------------SGSIPEIKERLQDSLDSGADVERVDIKETKFSGSSEDKDFNDASQLID 165

Query: 1809 KSAELLAVE-PTKETNKSTNLATPVNYGGSENANLENILTGNSVMETMAFIEEKVDTKGK 1633
            KS +   ++ P    N+S  L   V   G  +       T + +  ++A +  +V T   
Sbjct: 166  KSTDDSQIQNPNIFQNESVELGNMVELQGQLSGP-----TVDPLSSSIADLRTEVSTNVD 220

Query: 1632 SD-----LDGSLSPVAS------PSKTIREASQ-TDIEIQTMDRFTGEAFSSDVTDA--- 1498
            SD     L  SL   A       P K I      TD  + ++ + T      ++  A   
Sbjct: 221  SDVFFGLLSDSLPGKAEAMLDILPEKKIHAVENVTDCILISVAKETNLKEKDEINSAGDV 280

Query: 1497 --ITTDTEPLDG----SFSAIAASPSKAIDGFTGGGI----SLDVTDVNTV--IVENTKP 1354
              I   ++ + G      S IA S + ++D   G G       +  ++N+   +VE  + 
Sbjct: 281  IEIVESSDNVVGETCEGVSKIAVSDAISLDHQVGDGAVHLKENNGAEINSYRDVVEIVES 340

Query: 1353 LDVVVDLKEEETHSFG--QNVSSNEFFHDSCTNSMEVEAAK------------HMGASVG 1216
             D VV    EE         VS +    D   +  E   A+             + + V 
Sbjct: 341  SDKVVGEMSEEVSKIAVCDIVSLDHEVGDGAVHLKENNGAEFLSLLPPDNLPLELNSVVI 400

Query: 1215 FSQPEVDCAHEVDIDCPDDNVGDCVAKEVNTNLPALSERTVKDCKYPELVKSELSVTPDY 1036
             +  + D A+ V      D+       E N N+  L   T  D       +SE     D 
Sbjct: 401  TNDAQGDSAYVVQFATSSDDKILPEKGEGNVNVDLLP--TCDDISDEAHPQSEYGDFKDL 458

Query: 1035 GEAIRET--------------EVKNSVSEEIPLGFLSSKSGDGDNISSLAARVHEG---- 910
               + +               ++KN+V+EE    F +++  +  +I S    V +     
Sbjct: 459  EGVVYQNPFLQSSESLKYKGDDLKNNVTEENKFHFNANQLSEKSDILSPDMDVLDNSMKM 518

Query: 909  ---NPQILGREVIVEQVSVDCVADSEANDTTIGGLESHELEILPDSTSTDVKYIEKNQMV 739
               N +   +EV  EQ    C   S A  T    +ESH+     D++   +K  EKN+ +
Sbjct: 519  ELVNSEPTPKEVHAEQ----CTEVSPAQLT----VESHQRSDETDASMKAMK-TEKNE-I 568

Query: 738  CSIEGKESHSQQDLPQTPIVVAVAGGTVL-SPSDAVVHEAINAVSSQ-------DDIAHE 583
              +   E H   D+ +    +++   +++ S +++   E+  + +S+       D  +H 
Sbjct: 569  HMVHFSEEHGPDDVCKNSQQISLPEDSLMASSNESQRDESFRSATSETTRAINIDSTSHH 628

Query: 582  NSRIDNPDNCIREGTTEENSITN 514
              +I   ++   +G   E+++ N
Sbjct: 629  EEKITEINDVALDGKDVESNLEN 651


>ref|XP_004247899.1| PREDICTED: uncharacterized protein LOC101252226 [Solanum
            lycopersicum]
          Length = 998

 Score =  117 bits (292), Expect = 3e-23
 Identities = 150/624 (24%), Positives = 251/624 (40%), Gaps = 57/624 (9%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDG--SDDGHL 2146
            +E+HG+H+CHKC W +PN HPSA+HRRAHKKVCGK+EG+K  ESE  N      SDD H 
Sbjct: 14   HENHGTHLCHKCSWPFPNPHPSARHRRAHKKVCGKIEGYKFSESEAGNSTHSAVSDDEHH 73

Query: 2145 SDEDPKVEE------NQNGGIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXXXX 1984
            SD D +         +   G     S RSE+E FSDA  EF+D                 
Sbjct: 74   SDGDQQTPSPIGKKISVKNGSSGDKSYRSEDETFSDAFMEFSD----------------- 116

Query: 1983 XXXXXSNIIQPSNNPSEGIQIQIPHT-SQNITDQLDIDEKFQHHSSGS---TMGSN--AG 1822
                        +  S G++ ++    S N+  + D DE  +  + G    ++  N    
Sbjct: 117  ------------SGISPGMEERLESVKSLNMNVKKDDDELLKGDAIGGISVSLNDNHLTA 164

Query: 1821 PVLD-KSAELLAVEPTKETNKSTNLATPVNYGGSENANLENILTGNSVMETMAFIE--EK 1651
             V D +S E    +P  + +  + L   V+     +A +++ + G++ M+ M   E  E 
Sbjct: 165  EVNDPESPESATNQPVADKSLGSKLDRSVDLQVDASA-VKSEIPGDASMQEMNAAESIEA 223

Query: 1650 VDTKGKSDLDGSLSPVASPSKTIREASQTDIEIQTMDRFTGEAFSSDVTDAITTDTEPLD 1471
               +  SD    L  +   +     A   +  ++       E  S++  ++  +  +  +
Sbjct: 224  KQMQMSSDQPNDLKAIEDINANEVLADAVEASVEVSQSVVSEKTSNN--ESYESKPQEAE 281

Query: 1470 GSFSAIAASPSKAIDGFTGGGISLDVTDVNTVIVENTKPLDVVVDLKEEETHSF-GQNVS 1294
            G FS + +   +A D  T    +      N  + ++T   ++ +   E E  S  G NV 
Sbjct: 282  GKFSVVESKLLEAEDQATENVPNKAELQHNERVPDST---ELKLAFPEAEVKSLDGVNVD 338

Query: 1293 SNEFFHDSCTN-----SMEVEAAKHMGASVGFSQPEVDCAHEVDID---CPDDNVGDCVA 1138
             +   HD         S E+            S  E+DC  ++++       + + D   
Sbjct: 339  KDHERHDKAEQDEQRISTELSPNAPTLELEAVSPNEIDCGCQMELSDSFKAGEGMEDVHV 398

Query: 1137 KEVNTNLPAL-SERTVKDCKYPELVKSELSVTPDYGEAIRETEVKNSV--SEEIPLGFLS 967
              +  +LPAL +   +KD K     KS   +  D G +     VK+ V  + E+   F+ 
Sbjct: 399  MSLAKDLPALDNPELLKDFKDSNKYKSSFPL--DLGSSEEIFSVKDDVFAASEVTQSFVG 456

Query: 966  SKSGDGDNISSLAARVHEGNPQILGREVIVEQVSVDCVADSEANDTTI----------GG 817
            +   DG +ISS+A  +     Q+   +V V   ++   ++  +N               G
Sbjct: 457  TGRSDG-SISSVA--LDASGDQVSEEKVAVSAEAITDSSELSSNPNAFECGVSSILNSNG 513

Query: 816  LESHE------LEILPDSTSTDVKYIEKNQ---MVCSIEGKESHSQQDL----PQTPIVV 676
            L+  E      L     ST  D   +E+ +   +    E K  H + +L      TP+ +
Sbjct: 514  LQEPEDTSKNSLSDAKQSTEVDDPVVERTKETSLTMEEENKGGHPENELLANNETTPVAI 573

Query: 675  -----AVAGGTVLSPSDAVVHEAI 619
                 A+     L  SD   HE +
Sbjct: 574  SCLSEAIQTTVTLGGSDHGEHEKV 597


>ref|XP_007153592.1| hypothetical protein PHAVU_003G048600g [Phaseolus vulgaris]
            gi|561026946|gb|ESW25586.1| hypothetical protein
            PHAVU_003G048600g [Phaseolus vulgaris]
          Length = 1125

 Score =  116 bits (291), Expect = 4e-23
 Identities = 165/689 (23%), Positives = 273/689 (39%), Gaps = 105/689 (15%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            +ESHG H+CHKCGW +PN HPSAKHRRAHKK+CG +EG+K+  SE R   +GSDD H+SD
Sbjct: 14   SESHGVHLCHKCGWPFPNPHPSAKHRRAHKKICGTIEGYKLSFSEGRPHLNGSDDEHVSD 73

Query: 2139 EDPKV---------------EENQNGGIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXX 2005
            +D K                 E  N G   +   RSE+E FSDAV++F+D          
Sbjct: 74   DDHKTPGLVLPVSNSLDTGNNEKSNAGNGEKFI-RSEDEVFSDAVADFSD---------- 122

Query: 2004 XXXXXXXXXXXXSNIIQPSNNPSEGIQIQIPHTSQNITDQLDIDE-KFQHHSSGST--MG 1834
                               +NP    +++    S    +  DI E KF   SS       
Sbjct: 123  -----------------SGSNPDNKERLRDSLDSGADMEMGDIKEPKFSGPSSEDKDFNA 165

Query: 1833 SNAGPVLDKSAELLAVE-PTKETNKSTNLATPVNYGGSENANLENILTGNSVMETMAFIE 1657
            ++ GP++DKS +    + P    N+S  +   V   G  +    + LT +S  +      
Sbjct: 166  ADLGPLIDKSTDDCQTQNPNILQNESAGVGNTVGLQGQLSGPTVDPLT-SSTADLRTAES 224

Query: 1656 EKVDTKGKSDLDGSLSPVAS-------PSKTIREASQ-TDIEIQTMDRFT---------- 1531
              VD++    L     P+ +       P K I      TD  + ++ + T          
Sbjct: 225  TTVDSEVFLGLSSDSPPIKAEAMPDILPVKNIYAVDNVTDCSLMSVTKGTNLKEKDEINS 284

Query: 1530 -GEAF----SSD--------------VTDAITTDTEPLDGSFSAIAASPSKAIDGFTG-- 1414
             G+      SSD              V+D +  D +  DG+        +  ++   G  
Sbjct: 285  AGDVVEIEESSDYTVGETCEGVSNIVVSDVVCVDHQVGDGAVHLEEKDGTVHLEEKDGAI 344

Query: 1413 ------GGISLD-----VTDVNTVIVENTKPLDVVVDLKEEETHS--FGQNVSSNEFFHD 1273
                  G I L+     V++ N   VE  +P D VV    EE         VS +    D
Sbjct: 345  HLEEKDGAIHLEEKNGAVSNSNRDAVEIVEPSDNVVGKMSEEVSKTVVSDEVSLDNQVVD 404

Query: 1272 SCTNSMEVEAAKHMGASVGFSQP------------EVDCAHEVDIDCPDDNVGDCVAKEV 1129
               N  E   A+ +  S   S P                A+ V     +D+      +E 
Sbjct: 405  EAVNLKEKNEAEFLSLSSPDSLPLELNSTVIKNDAHGQSAYVVQSGTFNDDKILQSKEEG 464

Query: 1128 NTN---LPALSERTVKDCKYP----ELVKSELSVTPD----YGEAIR--ETEVKNSVSEE 988
            N N   LP  +++  ++ ++P    E  K  ++V       + E+++    ++K  V++E
Sbjct: 465  NANVDLLPTCNDKP-ENGEHPQTEYEDFKDHIAVVYQNPFLHSESLKYEGDDIKERVTQE 523

Query: 987  IPLGFLSSKSGDGDNISSLAARVHEGNPQI--LGREVIVEQVSVDCVADSEANDTTIGGL 814
                F +S+  +   + S    V   + ++  L  E I E++  +   D      T+   
Sbjct: 524  NKFHFNTSQFSEKSEVISPDIDVIGSSVKMEKLNSEPISEEMHAEECTDVSPVKLTV--- 580

Query: 813  ESHELEILPDSTSTDVKYIEKNQMVCSIEGKESHSQQDLPQTPIVVAVAGGTVLSPSD-- 640
            ES++    PD  S +    EKN+    I   E H   D+ +  + ++   G+++  S+  
Sbjct: 581  ESYQ---TPDVPSVNAMKTEKNES-HMIHFSEEHGPDDVYKNSVQISFPEGSLMGTSNES 636

Query: 639  -----AVVHEAINAVSSQDDIAHENSRID 568
                 + + E  + ++  D + H +  +D
Sbjct: 637  QREEGSAISETASVINVTDSLNHHHVPLD 665


>ref|XP_006358822.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum]
          Length = 838

 Score =  114 bits (285), Expect = 2e-22
 Identities = 132/525 (25%), Positives = 210/525 (40%), Gaps = 20/525 (3%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDG--SDDGHL 2146
            +E+HGSH+CHKC W +PN HPSA+HRRAHKKVCGK+EG+K+ ESE  N      SDD H 
Sbjct: 16   HENHGSHLCHKCSWPFPNPHPSARHRRAHKKVCGKIEGYKLSESEAGNSTHSAVSDDEHH 75

Query: 2145 SDEDPKV------EENQNGGIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXXXX 1984
            SD D +       + +   G     S RSE+E FSDAV EF+D                 
Sbjct: 76   SDGDQQTPSPIGKKTSVKDGSSGDKSYRSEDETFSDAVMEFSDSGISPGMEERPEGVKSL 135

Query: 1983 XXXXXSNIIQPSNNPSEGIQIQIPHTSQNITDQLDIDEKFQHHSSGSTMGSNAGPVLDKS 1804
                     +     + G  I +    +++T +++  E  +  ++      + G  LD+S
Sbjct: 136  NTNVKKVDDELLKADAIG-GISVSVNDKHLTAEVNDPESPESATNQPVADKSLGSKLDRS 194

Query: 1803 AELLAVEPTKETNKSTNLATPVNYGGSENANLENILTGNSVMETMAFIEEKVDTKGKSDL 1624
             +L               A+ V    S +A+L+ +    S+        E    +  SD 
Sbjct: 195  VDLQVD------------ASAVKSEISGDASLQEMNAPESI--------EAKQMQMSSDQ 234

Query: 1623 DGSLSPVASPSKTIREASQTDIEIQTMDRFTGEAFSSDVTDAITTDTEP--LDGSFSAIA 1450
               L  +   +     A   +  +Q       ++  SD  +    +++P   +G FS + 
Sbjct: 235  PNDLKAIEDINANEGLADAVEASVQ-----VSQSVVSDTDEKTCYESKPQEAEGKFSVVE 289

Query: 1449 ASPSKAIDGFTGGGISLDVTDVNTVIVENTKPLDVVVDLKEEETHSF-GQNVSSNEFFHD 1273
            +   +A D  T      +  ++     EN    ++   L E E  S  G NV      HD
Sbjct: 290  SKLLEAEDQATEN--VPNKAELQHSERENPDSTELKFALSEAEVKSLDGVNVDKEHEQHD 347

Query: 1272 SCTN-----SMEVEAAKHMGASVGFSQPEVDCAHEV---DIDCPDDNVGDCVAKEVNTNL 1117
                     S+E+        S      E+D   ++   D    ++ + D     +  +L
Sbjct: 348  KAEQDKQRISIELSPNAPTLESKAVLSNEIDGGRQMELSDSSKAEEGMEDVHVVSLAKDL 407

Query: 1116 PAL-SERTVKDCKYPELVKSELSVTPDYGEAIRETEVKNSVSEEIPLGFLSSKSGDGDNI 940
            PA  +   +KD K     KS   +     E I   +     + E+   F+ +   DG +I
Sbjct: 408  PASDNPELLKDFKDYNKYKSSFPLDLGSSEEICSVKDDTVAASEVTQSFVGTGRSDG-SI 466

Query: 939  SSLAARVHEGNPQILGREVIVEQVSVDCVADSEANDTTIGGLESH 805
            SS+A             +V  +QVS + VA S    T   GL S+
Sbjct: 467  SSVAL------------DVSGDQVSEEKVAVSAEGITDSSGLSSN 499


>ref|XP_006580135.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 1102

 Score =  112 bits (280), Expect = 8e-22
 Identities = 97/320 (30%), Positives = 141/320 (44%), Gaps = 35/320 (10%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            +ESHG H+CHKCGW +PN HPSAKHRRAHKK+CG +EG+K   SE +   +GSDD H+SD
Sbjct: 14   HESHGVHLCHKCGWPFPNPHPSAKHRRAHKKICGTIEGYKRSASEGQPHLNGSDDEHVSD 73

Query: 2139 EDPKV----------------EENQNGGIRRQISNRSEEEEFSDAVSEFADXXXXXXXXX 2008
            +D K                 E+   G   + I  RSE+E FSDAV++F D         
Sbjct: 74   DDHKTPGLVVSGPKSLETGNNEKGNEGNGEKLI--RSEDEVFSDAVADFLDSGSNPEIKE 131

Query: 2007 XXXXXXXXXXXXXSNIIQPS--NNPSEGIQIQIPHTSQNITDQLDIDEKFQHHSSGSTMG 1834
                            I+ +  +  SEG        SQ I    D D + Q+ +      
Sbjct: 132  RLQDNLDSGANVERVDIKETKFSGSSEGKDFNAADASQFIDKSTD-DSQIQNLNIFQNES 190

Query: 1833 SNAGPVLDKSAELL--AVEPTK------ETNKSTNLATPVNYGGSENANLENILTGNSVM 1678
               G  ++   +L    V+P         T +ST + + V +G S ++     L G +  
Sbjct: 191  VEVGTAVELQGQLSCPTVDPLSSSIADLRTEESTIVDSDVFFGLSSDS-----LLGETEA 245

Query: 1677 ETMAFIEEKVDTKGKSDLDGSLSPVASPSK-----TIREASQTDIEIQTMDRFTGEAFSS 1513
                  E+K+    ++  D SL  VA  S       I  A      +++ D   GEA   
Sbjct: 246  MPDILPEKKIHAV-ENVTDCSLISVAKESNFKEKDEINSAVHVVEIVESSDNGVGEACEE 304

Query: 1512 ----DVTDAITTDTEPLDGS 1465
                 V+DA++ D +  DG+
Sbjct: 305  VSKIAVSDAVSLDYQVGDGA 324


>ref|XP_006580136.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 1100

 Score =  111 bits (278), Expect = 1e-21
 Identities = 55/111 (49%), Positives = 70/111 (63%), Gaps = 16/111 (14%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            +ESHG H+CHKCGW +PN HPSAKHRRAHKK+CG +EG+K   SE +   +GSDD H+SD
Sbjct: 14   HESHGVHLCHKCGWPFPNPHPSAKHRRAHKKICGTIEGYKRSASEGQPHLNGSDDEHVSD 73

Query: 2139 EDPKV----------------EENQNGGIRRQISNRSEEEEFSDAVSEFAD 2035
            +D K                 E+   G   + I  RSE+E FSDAV++F D
Sbjct: 74   DDHKTPGLVVSGPKSLETGNNEKGNEGNGEKLI--RSEDEVFSDAVADFLD 122


>gb|EXB73708.1| hypothetical protein L484_026874 [Morus notabilis]
          Length = 1995

 Score =  108 bits (271), Expect = 9e-21
 Identities = 74/243 (30%), Positives = 112/243 (46%), Gaps = 15/243 (6%)
 Frame = -2

Query: 2310 HGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDG-HLSDED 2134
            HG H+CHKCGW YPNSHPSAKHRRAHK++CGK+EG+K+ + E     + SDD  HLSDED
Sbjct: 18   HGVHICHKCGWPYPNSHPSAKHRRAHKRICGKVEGYKLGDFEGSAHSNVSDDDEHLSDED 77

Query: 2133 PKVEENQ---------NGGIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXXXXX 1981
             K    Q           G  ++    S++E FSDA +EF D                  
Sbjct: 78   HKNPSRQVLGASSHEKGSGGTKEFPCGSKDEAFSDAAAEFLDGGDGGR------------ 125

Query: 1980 XXXXSNIIQPSNNPSEGIQIQIPHTSQNITDQLDIDEKFQHHSSGSTMGSN--AGPVLDK 1807
                      +   +E       +  +N+  + D+ +  +     ++  SN   G   D+
Sbjct: 126  ----------TQGRAEDAGESATNVEKNLKTESDMAQSVERGPVAASFNSNRMVGAFSDR 175

Query: 1806 SAELLAVEPTKETNKSTNLATPVNYGGSENANLENILTG---NSVMETMAFIEEKVDTKG 1636
              E LAV      N S +   P+     E+A+ EN  T    + V  ++   E++ + +G
Sbjct: 176  QTEGLAVLLDGNRNASVDDLHPIKSETLEDASPENQKTNTVDDVVDRSLKLAEQRSNLEG 235

Query: 1635 KSD 1627
            + +
Sbjct: 236  QKE 238


>ref|XP_004504384.1| PREDICTED: serine-rich adhesin for platelets-like isoform X2 [Cicer
            arietinum]
          Length = 926

 Score =  108 bits (270), Expect = 1e-20
 Identities = 49/103 (47%), Positives = 69/103 (66%), Gaps = 8/103 (7%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            NE+HG HVC+KCGW YPN HPSAK+RRAHKK+CG ++G+K+  S+E+  F+ SDD H + 
Sbjct: 14   NENHGVHVCNKCGWLYPNPHPSAKNRRAHKKICGTIQGYKLDLSQEQTLFNASDDDHKTP 73

Query: 2139 EDPKVEENQNG--------GIRRQISNRSEEEEFSDAVSEFAD 2035
                 E+  +G           R ++ RSE++ FSDA +EF+D
Sbjct: 74   PSSNNEKGNDGMNELLGSTKFSRAMTMRSEDDVFSDAAAEFSD 116


>ref|XP_002520635.1| hypothetical protein RCOM_1554430 [Ricinus communis]
            gi|223540196|gb|EEF41771.1| hypothetical protein
            RCOM_1554430 [Ricinus communis]
          Length = 160

 Score =  108 bits (269), Expect = 2e-20
 Identities = 53/104 (50%), Positives = 67/104 (64%), Gaps = 9/104 (8%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            +++HG HVCHKCGW +PN HPSAKHRRAHKK+CG +EG+K+ +SE       S+D H SD
Sbjct: 1    HDNHGVHVCHKCGWPFPNPHPSAKHRRAHKKICGTIEGYKLVQSEGSTHSTMSEDEHQSD 60

Query: 2139 EDPKV--------EENQNG-GIRRQISNRSEEEEFSDAVSEFAD 2035
            ED K           N+ G G     S  SE+E F+DAV+EF D
Sbjct: 61   EDHKTPSPQILERSSNEKGSGAIGDRSGISEDEVFADAVAEFPD 104


>ref|XP_004504383.1| PREDICTED: serine-rich adhesin for platelets-like isoform X1 [Cicer
            arietinum]
          Length = 931

 Score =  105 bits (262), Expect = 1e-19
 Identities = 51/108 (47%), Positives = 70/108 (64%), Gaps = 13/108 (12%)
 Frame = -2

Query: 2319 NESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSD 2140
            NE+HG HVC+KCGW YPN HPSAK+RRAHKK+CG ++G+K+  S+E+  F+ SDD H + 
Sbjct: 14   NENHGVHVCNKCGWLYPNPHPSAKNRRAHKKICGTIQGYKLDLSQEQTLFNASDDDHKTP 73

Query: 2139 EDPKV----EENQNGGIR---------RQISNRSEEEEFSDAVSEFAD 2035
                V     E  N G+          R ++ RSE++ FSDA +EF+D
Sbjct: 74   PSLVVSGSNNEKGNDGMNELLGSTKFSRAMTMRSEDDVFSDAAAEFSD 121


>ref|XP_004147674.1| PREDICTED: uncharacterized protein LOC101215780 [Cucumis sativus]
          Length = 1079

 Score =  103 bits (256), Expect = 5e-19
 Identities = 151/686 (22%), Positives = 264/686 (38%), Gaps = 60/686 (8%)
 Frame = -2

Query: 2331 REMHNESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDG 2152
            R+   E+HG HVC+KCGW +PN HPSAKHRRAHK+VCG +EGFK+ ESE         D 
Sbjct: 4    RDQRQENHGVHVCNKCGWPFPNPHPSAKHRRAHKRVCGTIEGFKLVESEANALLTVVSDD 63

Query: 2151 HLSDE--DPKVEENQNG----GIRRQISNRSEEEEFSDAVSEFADXXXXXXXXXXXXXXX 1990
             + D+   PKV   + G    G++ + S  SE+E FSDAV+EF++               
Sbjct: 64   DVDDKISSPKVLGGRCGDDSVGMKTK-SKESEDEVFSDAVAEFSESVGPKKPMGDALDSS 122

Query: 1989 XXXXXXXSNIIQPSNNPSEGIQIQIPHTSQN---ITDQLDIDEKFQH--------HSSGS 1843
                     I        + + + I  T+ N      +  ++++F +         SS S
Sbjct: 123  AAKMVVEEEISSSQTLKDKEVLV-IAETTINQSGCEQEKKVNQEFVNIETESKTPLSSSS 181

Query: 1842 TMGSNAGPVLDKSAELLAVEPTKETNKSTNLA-TPVNYGGSENANLENILT-----GNSV 1681
            T        +    E+  +   +ET  +  L     +   +EN N+EN +       N +
Sbjct: 182  TENQKDESSVAAETEIDQLGNEQETKVNRELVDLETSSTSTENQNVENSVVVETEQENKI 241

Query: 1680 METMAFIE----------EKVDTKGKSDLDGSLSPVASPSKTIREASQTDIEIQTMDRF- 1534
             +    +E            +D    +   G L P   P   +    Q    + + DR  
Sbjct: 242  NQLYGNLETNFRHENSMIPSIDHINTTTTTGDLYP-NDPDTIVTALEQPQYSLLSPDRIC 300

Query: 1533 ----------------TGEAFSSD-------VTDAITTDTEPL--DGSFSAIAASPSKAI 1429
                              E   SD       + + I   TEPL  DG+F  +  +     
Sbjct: 301  DDEDFDSCKNSTEVAAASEKIDSDESGPSPKMEETIEISTEPLAHDGTFQLVVDNDMSIH 360

Query: 1428 DGFTGGGISLDVTDVNTVIVENTKPLDVVVDLKEEETHSFGQNVSSNEFFHDSCTNSMEV 1249
                   +S    +  +V+V + KP+D+      + T+  G+ +       +SC+++  +
Sbjct: 361  SEIPQSVLS--AANPQSVVVSDVKPIDLT-----QVTYDTGKEL-------ESCSSNNLL 406

Query: 1248 EAAKHMGASVGFSQPEVDCAHEVDIDCPDDNVGDCVAKEVNTNLPALSERTVKDCKYPEL 1069
            E     G +     P V      D++  D                   E  V++ +  + 
Sbjct: 407  ETDIIKGENDNVHLPSVSS----DLNTLDH-----------------PEALVEELENHKE 445

Query: 1068 VKSELSVTPDYGEAIRETEVKNSVSEEIPLG-FLSSKSGDGDNISSLAARVHEGNPQILG 892
            VK    V  D    +  + +K+   + IP G + + ++   D ++S   ++ E   +   
Sbjct: 446  VKLTSCVVQDPHGGV--SGLKDKSKDPIPKGSYFNLQAEPFDQVASFDTKIMESRQK--- 500

Query: 891  REVIVEQVSVDCVADSEANDTTIGGLESHELEILPDSTSTDVKYIEKNQMVCSIEGKESH 712
            +E +V+ VSVD   D  ++     G E+ E+ I  ++ +  +K +        +   E H
Sbjct: 501  QEEVVKNVSVDVKGDCSSH----SGQEAAEIPI-QETNAAQIKNL--------LSENEGH 547

Query: 711  SQQDLPQTPIVVAVAGGTVLSPSDAVVHEAINAVSSQDDIAHENSRIDNPDNCIREGTTE 532
            S+  +      VA+  G++ S S           S  + +A   + +DN  + + E    
Sbjct: 548  SKSQILSD---VAIGIGSIPSAS---------LSSEVESVAPSKNSLDNLSDNVTEVLFS 595

Query: 531  ENSITNKMVAPEAAILLSESENLKDA 454
            E             +LL + EN + A
Sbjct: 596  E--------VERGEVLLQDDENKEGA 613


>ref|XP_004298230.1| PREDICTED: uncharacterized protein LOC101308865 [Fragaria vesca
            subsp. vesca]
          Length = 1195

 Score =  100 bits (250), Expect = 2e-18
 Identities = 54/104 (51%), Positives = 70/104 (67%), Gaps = 10/104 (9%)
 Frame = -2

Query: 2316 ESHGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNPFDGSDDGHLSDE 2137
            E HG HVC KCGW +PNSHPSA+HRRAHKK+CG +EG+K+  S +      SDD   SD+
Sbjct: 15   EGHGVHVCSKCGWPFPNSHPSARHRRAHKKICGSIEGYKLVGSAQ---LSVSDDDQHSDD 71

Query: 2136 DPK------VEENQ----NGGIRRQISNRSEEEEFSDAVSEFAD 2035
            DPK      VE++     +GGI  Q S +S++  FSDAV+EF+D
Sbjct: 72   DPKTPSLKVVEKSAYKKGSGGI-GQGSIQSDDGVFSDAVAEFSD 114


>gb|EYU38966.1| hypothetical protein MIMGU_mgv1a004239mg [Mimulus guttatus]
          Length = 538

 Score = 96.3 bits (238), Expect = 6e-17
 Identities = 49/104 (47%), Positives = 64/104 (61%), Gaps = 12/104 (11%)
 Frame = -2

Query: 2310 HGSHVCHKCGWSYPNSHPSAKHRRAHKKVCGKLEGFKIPESEERNP--FDGSDDGHLSDE 2137
            H  H+C +C W +PN HPSAKHRRAHK+VCG +EG+K+  SEE +      SDD H SD 
Sbjct: 15   HEVHICSRCKWPFPNPHPSAKHRRAHKRVCGTVEGYKLIHSEEEHDRHLSISDDEHASDS 74

Query: 2136 D----------PKVEENQNGGIRRQISNRSEEEEFSDAVSEFAD 2035
            +           K E+  +G      SNRSE++ FSDAV+EF+D
Sbjct: 75   ENHTPSPNLVKKKAEDFASGEGAGAKSNRSEDDVFSDAVTEFSD 118


Top