BLASTX nr result

ID: Astragalus22_contig00020156 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00020156
         (985 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU41757.1| hypothetical protein TSUD_13570, partial [Trifol...   292   8e-92
ref|XP_013467012.1| dentin sialophosphoprotein, putative [Medica...   284   1e-83
ref|XP_017412805.1| PREDICTED: uncharacterized protein LOC108324...   282   6e-83
dbj|BAT96681.1| hypothetical protein VIGAN_08365700 [Vigna angul...   282   6e-83
gb|KRH69071.1| hypothetical protein GLYMA_02G002200 [Glycine max]     274   5e-80
gb|KRH69073.1| hypothetical protein GLYMA_02G002200 [Glycine max]     274   6e-80
ref|XP_014619484.1| PREDICTED: dentin sialophosphoprotein-like i...   274   7e-80
ref|XP_006574501.1| PREDICTED: dentin sialophosphoprotein-like i...   274   7e-80
gb|KRH69074.1| hypothetical protein GLYMA_02G002200 [Glycine max]     274   8e-80
ref|XP_014511304.1| dentin sialophosphoprotein isoform X2 [Vigna...   273   1e-79
ref|XP_022640909.1| dentin sialophosphoprotein isoform X1 [Vigna...   273   1e-79
gb|KHN15387.1| hypothetical protein glysoja_039044 [Glycine soja]     273   2e-79
gb|OIW07722.1| hypothetical protein TanjilG_11849 [Lupinus angus...   265   2e-76
ref|XP_007145920.1| hypothetical protein PHAVU_007G278800g [Phas...   207   1e-56
ref|XP_019450674.1| PREDICTED: dentin sialophosphoprotein-like [...   194   1e-51
gb|PNX93220.1| hypothetical protein L195_g016371, partial [Trifo...   187   1e-49
ref|XP_017975924.1| PREDICTED: uncharacterized protein LOC186030...   159   1e-39
ref|XP_017975923.1| PREDICTED: uncharacterized protein LOC186030...   159   1e-39
ref|XP_017975922.1| PREDICTED: uncharacterized protein LOC186030...   159   1e-39
ref|XP_017975921.1| PREDICTED: uncharacterized protein LOC186030...   159   1e-39

>dbj|GAU41757.1| hypothetical protein TSUD_13570, partial [Trifolium subterraneum]
          Length = 517

 Score =  292 bits (747), Expect = 8e-92
 Identities = 172/324 (53%), Positives = 217/324 (66%), Gaps = 33/324 (10%)
 Frame = -1

Query: 979  VTEKGFQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNADTNENPESSYAMVNAFE 800
            VTE GF LD  + +S K SDE +A P +++SVVPE E  ++  DTNE  ESS+ M+   E
Sbjct: 197  VTENGF-LDANVCDSIKVSDEGNAPPYEENSVVPEVEGSTVVEDTNEKMESSFVMIKKIE 255

Query: 799  ETDMSYQCNSFLETINQEEYFSLQNSTSLKQ--------TNESKYFDV-----TVPSLDL 659
            ET+M  QCNS + TINQEE FSLQNS+SL            +++ F V     +V SLD 
Sbjct: 256  ETNMLEQCNSDMITINQEETFSLQNSSSLLHIYSYNHGNVEQTESFTVASMPKSVQSLDF 315

Query: 658  VDEEAIEKESEKHSQQAD--------------NPNNSILANSGNEIR-----LSIESNSD 536
            VDEE IEKE E +SQ A+              + NNS+ AN G E R     LS ESNSD
Sbjct: 316  VDEEVIEKEREDYSQHAEATSLIVEEQTTSIESSNNSLFANGGYETRDSVTRLSTESNSD 375

Query: 535  NPIISCQIQKSPSFNLDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNSMLHNAE 356
            NP I+CQ+QKSPSFNL+LR EA++EESDQ PLL++  S N++L     +N+SNSM H+  
Sbjct: 376  NPHITCQMQKSPSFNLNLRTEAKREESDQIPLLHE--SANDNLLNKASLNLSNSMPHDEY 433

Query: 355  MPVEEKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEV-SSTSH 179
              +EEKIVTMERSYS++SK+ F   LK+EEEAHLLVM QTQ+N+ G+  EVKEV SSTS 
Sbjct: 434  DHIEEKIVTMERSYSEVSKSSFIGFLKEEEEAHLLVMEQTQDNNAGSKMEVKEVSSSTSP 493

Query: 178  DGKEKRKSRSYFFSSCMCCATVTN 107
             GK+KR  RS+FF++CMCCA V N
Sbjct: 494  KGKDKRNFRSFFFTNCMCCAAVPN 517


>ref|XP_013467012.1| dentin sialophosphoprotein, putative [Medicago truncatula]
 gb|KEH41047.1| dentin sialophosphoprotein, putative [Medicago truncatula]
          Length = 1257

 Score =  284 bits (727), Expect = 1e-83
 Identities = 175/329 (53%), Positives = 213/329 (64%), Gaps = 46/329 (13%)
 Frame = -1

Query: 955  DVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNA--------------------DTNEN 836
            D Y++NS K SDESSA  E+ + VV E E V L +                    DT+E 
Sbjct: 932  DTYVSNSIKVSDESSASQEENNHVVHEVEPVFLISGSTDVDCIHGKGENNGNKVEDTDEK 991

Query: 835  PESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL------KQTNESKYFDVTV 674
             ESSY M++ FEET+M  QCNS + TI+QEE FSLQNS+SL      +Q N  +    T 
Sbjct: 992  TESSYVMLSKFEETNMLEQCNSDMLTISQEESFSLQNSSSLLHIYKYQQGNVEQTKSFTA 1051

Query: 673  PSLDLVDEEAIEKESEKHSQQAD---------------NPNNSILANSG-----NEIRLS 554
             ++   DEE IEKE E +SQ ++               + NNS  AN G     N  RLS
Sbjct: 1052 TTMLKSDEEEIEKEREDYSQHSEATSLIVEKLTTSTELSSNNSTFANGGYETRENVTRLS 1111

Query: 553  IESNSDNPIISCQIQKSPSFNLDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNS 374
             ESNSDNP I+CQ+QKSPSFNL+LR+E+R+EESDQ PLL  DKS+++SL     +NISNS
Sbjct: 1112 TESNSDNPNITCQMQKSPSFNLNLRMESRREESDQIPLL--DKSSDDSLPNKASLNISNS 1169

Query: 373  MLHNAEMPVEEKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEV 194
            M H+    +EEKIVTMERSYS+ISKA F   LK EEEAH+LVMAQTQ+ + G+  EVKEV
Sbjct: 1170 MSHDEYGLIEEKIVTMERSYSEISKASFIGFLK-EEEAHVLVMAQTQDINVGSKIEVKEV 1228

Query: 193  SSTSHDGKEKRKSRSYFFSSCMCCATVTN 107
            SSTS  GKEKRKSRSYFF+SCMCCATV N
Sbjct: 1229 SSTSPKGKEKRKSRSYFFTSCMCCATVPN 1257


>ref|XP_017412805.1| PREDICTED: uncharacterized protein LOC108324370 [Vigna angularis]
          Length = 1180

 Score =  282 bits (721), Expect = 6e-83
 Identities = 182/374 (48%), Positives = 216/374 (57%), Gaps = 84/374 (22%)
 Frame = -1

Query: 982  KVTEKGFQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNA---------------- 851
            KVTE G   D Y+NNSNKAS+ES A  E+KHS VPEA+ VSL                  
Sbjct: 810  KVTENGVPFDAYINNSNKASEESGATSEEKHSAVPEAKRVSLIEGLTDVDCRHDEGNENK 869

Query: 850  --DTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL------------ 713
              +TNE PE S  M+N FE T+MS QCN  L TINQEE F LQN++SL            
Sbjct: 870  IEETNEKPEVSNVMINTFEGTEMSEQCNCDLVTINQEESFPLQNNSSLLHLYDDHQDNVK 929

Query: 712  ---------------KQTNESKYF----------DVTVPSLDLVDEEAIEKESEKHSQQA 608
                           KQTN++K            ++TVPSLDL ++EA EKE ++  +  
Sbjct: 930  QDRSLTATSMPNLGWKQTNKTKNSGHTIDNSNSSELTVPSLDLDNKEAFEKEDKEDPEHT 989

Query: 607  DNP---------------NNSILANSGNEIRLSI-----ESNSDNPIISCQIQKSPSFNL 488
            +                 N SI  N  NE + SI     ESN DNP  S Q+Q+SPSFNL
Sbjct: 990  EAELTTSTATMSIVEPYSNKSIFENGANETKESITRPSTESNPDNPNTSSQMQQSPSFNL 1049

Query: 487  DLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNSM---------LHNAEMPVEEKI 335
            +LR EAR  ESD+ PLL+Q+KS NES SK   +N+ NSM         LH+ EMPVEEKI
Sbjct: 1050 NLRKEARPGESDKIPLLHQNKSANESFSKQNSMNLINSMPHSQYEQCMLHSEEMPVEEKI 1109

Query: 334  VTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEKRKS 155
            VTMERSYS  SKAPF  +LK+EEEAHLL MA+TQ+NH GT      VSSTS   K+KRK 
Sbjct: 1110 VTMERSYSRKSKAPFIGLLKEEEEAHLLGMARTQDNHAGTK---NTVSSTS-PKKDKRKP 1165

Query: 154  RSYFFSSCMCCATV 113
            RS FFSSCMCCATV
Sbjct: 1166 RSSFFSSCMCCATV 1179


>dbj|BAT96681.1| hypothetical protein VIGAN_08365700 [Vigna angularis var. angularis]
          Length = 1181

 Score =  282 bits (721), Expect = 6e-83
 Identities = 182/374 (48%), Positives = 216/374 (57%), Gaps = 84/374 (22%)
 Frame = -1

Query: 982  KVTEKGFQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNA---------------- 851
            KVTE G   D Y+NNSNKAS+ES A  E+KHS VPEA+ VSL                  
Sbjct: 811  KVTENGVPFDAYINNSNKASEESGATSEEKHSAVPEAKRVSLIEGLTDVDCRHDEGNENK 870

Query: 850  --DTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL------------ 713
              +TNE PE S  M+N FE T+MS QCN  L TINQEE F LQN++SL            
Sbjct: 871  IEETNEKPEVSNVMINTFEGTEMSEQCNCDLVTINQEESFPLQNNSSLLHLYDDHQDNVK 930

Query: 712  ---------------KQTNESKYF----------DVTVPSLDLVDEEAIEKESEKHSQQA 608
                           KQTN++K            ++TVPSLDL ++EA EKE ++  +  
Sbjct: 931  QDRSLTATSMPNLGWKQTNKTKNSGHTIDNSNSSELTVPSLDLDNKEAFEKEDKEDPEHT 990

Query: 607  DNP---------------NNSILANSGNEIRLSI-----ESNSDNPIISCQIQKSPSFNL 488
            +                 N SI  N  NE + SI     ESN DNP  S Q+Q+SPSFNL
Sbjct: 991  EAELTTSTATMSIVEPYSNKSIFENGANETKESITRPSTESNPDNPNTSSQMQQSPSFNL 1050

Query: 487  DLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNSM---------LHNAEMPVEEKI 335
            +LR EAR  ESD+ PLL+Q+KS NES SK   +N+ NSM         LH+ EMPVEEKI
Sbjct: 1051 NLRKEARPGESDKIPLLHQNKSANESFSKQNSMNLINSMPHSQYEQCMLHSEEMPVEEKI 1110

Query: 334  VTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEKRKS 155
            VTMERSYS  SKAPF  +LK+EEEAHLL MA+TQ+NH GT      VSSTS   K+KRK 
Sbjct: 1111 VTMERSYSRKSKAPFIGLLKEEEEAHLLGMARTQDNHAGTK---NTVSSTS-PKKDKRKP 1166

Query: 154  RSYFFSSCMCCATV 113
            RS FFSSCMCCATV
Sbjct: 1167 RSSFFSSCMCCATV 1180


>gb|KRH69071.1| hypothetical protein GLYMA_02G002200 [Glycine max]
          Length = 1218

 Score =  274 bits (701), Expect = 5e-80
 Identities = 187/377 (49%), Positives = 217/377 (57%), Gaps = 86/377 (22%)
 Frame = -1

Query: 985  PKVTEKG--FQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNADTN---------- 842
            PKVTE G  F +D Y+NN NKAS+ES  V +QKH VVPEA+ VSL A             
Sbjct: 845  PKVTENGVPFDVDAYVNNYNKASEESGTVSDQKHFVVPEAKRVSLIAGLTVVDCRHEEGE 904

Query: 841  ------ENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL----------- 713
                  E  E+S  MVN F+ T MS QCNS L TINQEE F LQ ++SL           
Sbjct: 905  SYKNKIEESEASRVMVNTFDGTQMSEQCNSDLVTINQEESFPLQYNSSLLHICDDNQDNV 964

Query: 712  ----------------KQTNESKYFDV--------TVPSLDLVDEEAIEKESE--KHSQQ 611
                            K  NE+K F +        TVPS+DLVD+EA EK  E  +H + 
Sbjct: 965  KQDQSFTATSMPNSDWKLANETKNFAIDNHVSSKLTVPSVDLVDDEAFEKREEHLQHVKA 1024

Query: 610  ADN---------------PNN--SILANSGNEIR-----LSIESNSDNPIISCQIQKSPS 497
            A +               P++  SI AN G E R     LS ESN DN I SCQ+QKSPS
Sbjct: 1025 ASSAGAELTTSTATMSIEPSSIYSIFANGGYETRDSVTRLSTESNPDNSI-SCQMQKSPS 1083

Query: 496  FNLDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNS---------MLHNAEMPVE 344
            FNL+LR EAR EESD+ PLL+QD S ++SLSK T  N+  S         MLH  EMPV+
Sbjct: 1084 FNLNLRKEARAEESDKIPLLHQDMSADKSLSKKTSQNLIKSVPHDDYEQCMLHIEEMPVQ 1143

Query: 343  EKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEK 164
            EKIVTMERSYS  SKAPF  +LK+EEEAHLL M Q Q+NH GT      VSSTSH  KEK
Sbjct: 1144 EKIVTMERSYSRKSKAPFIGLLKEEEEAHLLNMPQIQHNHVGTK---NAVSSTSHKRKEK 1200

Query: 163  RKSRSYFFSSCMCCATV 113
            RK RS FFSSC+CCATV
Sbjct: 1201 RKPRSSFFSSCICCATV 1217


>gb|KRH69073.1| hypothetical protein GLYMA_02G002200 [Glycine max]
          Length = 1266

 Score =  274 bits (701), Expect = 6e-80
 Identities = 187/377 (49%), Positives = 217/377 (57%), Gaps = 86/377 (22%)
 Frame = -1

Query: 985  PKVTEKG--FQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNADTN---------- 842
            PKVTE G  F +D Y+NN NKAS+ES  V +QKH VVPEA+ VSL A             
Sbjct: 893  PKVTENGVPFDVDAYVNNYNKASEESGTVSDQKHFVVPEAKRVSLIAGLTVVDCRHEEGE 952

Query: 841  ------ENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL----------- 713
                  E  E+S  MVN F+ T MS QCNS L TINQEE F LQ ++SL           
Sbjct: 953  SYKNKIEESEASRVMVNTFDGTQMSEQCNSDLVTINQEESFPLQYNSSLLHICDDNQDNV 1012

Query: 712  ----------------KQTNESKYFDV--------TVPSLDLVDEEAIEKESE--KHSQQ 611
                            K  NE+K F +        TVPS+DLVD+EA EK  E  +H + 
Sbjct: 1013 KQDQSFTATSMPNSDWKLANETKNFAIDNHVSSKLTVPSVDLVDDEAFEKREEHLQHVKA 1072

Query: 610  ADN---------------PNN--SILANSGNEIR-----LSIESNSDNPIISCQIQKSPS 497
            A +               P++  SI AN G E R     LS ESN DN I SCQ+QKSPS
Sbjct: 1073 ASSAGAELTTSTATMSIEPSSIYSIFANGGYETRDSVTRLSTESNPDNSI-SCQMQKSPS 1131

Query: 496  FNLDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNS---------MLHNAEMPVE 344
            FNL+LR EAR EESD+ PLL+QD S ++SLSK T  N+  S         MLH  EMPV+
Sbjct: 1132 FNLNLRKEARAEESDKIPLLHQDMSADKSLSKKTSQNLIKSVPHDDYEQCMLHIEEMPVQ 1191

Query: 343  EKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEK 164
            EKIVTMERSYS  SKAPF  +LK+EEEAHLL M Q Q+NH GT      VSSTSH  KEK
Sbjct: 1192 EKIVTMERSYSRKSKAPFIGLLKEEEEAHLLNMPQIQHNHVGTK---NAVSSTSHKRKEK 1248

Query: 163  RKSRSYFFSSCMCCATV 113
            RK RS FFSSC+CCATV
Sbjct: 1249 RKPRSSFFSSCICCATV 1265


>ref|XP_014619484.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
 gb|KRH69072.1| hypothetical protein GLYMA_02G002200 [Glycine max]
          Length = 1286

 Score =  274 bits (701), Expect = 7e-80
 Identities = 187/377 (49%), Positives = 217/377 (57%), Gaps = 86/377 (22%)
 Frame = -1

Query: 985  PKVTEKG--FQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNADTN---------- 842
            PKVTE G  F +D Y+NN NKAS+ES  V +QKH VVPEA+ VSL A             
Sbjct: 913  PKVTENGVPFDVDAYVNNYNKASEESGTVSDQKHFVVPEAKRVSLIAGLTVVDCRHEEGE 972

Query: 841  ------ENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL----------- 713
                  E  E+S  MVN F+ T MS QCNS L TINQEE F LQ ++SL           
Sbjct: 973  SYKNKIEESEASRVMVNTFDGTQMSEQCNSDLVTINQEESFPLQYNSSLLHICDDNQDNV 1032

Query: 712  ----------------KQTNESKYFDV--------TVPSLDLVDEEAIEKESE--KHSQQ 611
                            K  NE+K F +        TVPS+DLVD+EA EK  E  +H + 
Sbjct: 1033 KQDQSFTATSMPNSDWKLANETKNFAIDNHVSSKLTVPSVDLVDDEAFEKREEHLQHVKA 1092

Query: 610  ADN---------------PNN--SILANSGNEIR-----LSIESNSDNPIISCQIQKSPS 497
            A +               P++  SI AN G E R     LS ESN DN I SCQ+QKSPS
Sbjct: 1093 ASSAGAELTTSTATMSIEPSSIYSIFANGGYETRDSVTRLSTESNPDNSI-SCQMQKSPS 1151

Query: 496  FNLDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNS---------MLHNAEMPVE 344
            FNL+LR EAR EESD+ PLL+QD S ++SLSK T  N+  S         MLH  EMPV+
Sbjct: 1152 FNLNLRKEARAEESDKIPLLHQDMSADKSLSKKTSQNLIKSVPHDDYEQCMLHIEEMPVQ 1211

Query: 343  EKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEK 164
            EKIVTMERSYS  SKAPF  +LK+EEEAHLL M Q Q+NH GT      VSSTSH  KEK
Sbjct: 1212 EKIVTMERSYSRKSKAPFIGLLKEEEEAHLLNMPQIQHNHVGTK---NAVSSTSHKRKEK 1268

Query: 163  RKSRSYFFSSCMCCATV 113
            RK RS FFSSC+CCATV
Sbjct: 1269 RKPRSSFFSSCICCATV 1285


>ref|XP_006574501.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 1287

 Score =  274 bits (701), Expect = 7e-80
 Identities = 187/377 (49%), Positives = 217/377 (57%), Gaps = 86/377 (22%)
 Frame = -1

Query: 985  PKVTEKG--FQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNADTN---------- 842
            PKVTE G  F +D Y+NN NKAS+ES  V +QKH VVPEA+ VSL A             
Sbjct: 914  PKVTENGVPFDVDAYVNNYNKASEESGTVSDQKHFVVPEAKRVSLIAGLTVVDCRHEEGE 973

Query: 841  ------ENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL----------- 713
                  E  E+S  MVN F+ T MS QCNS L TINQEE F LQ ++SL           
Sbjct: 974  SYKNKIEESEASRVMVNTFDGTQMSEQCNSDLVTINQEESFPLQYNSSLLHICDDNQDNV 1033

Query: 712  ----------------KQTNESKYFDV--------TVPSLDLVDEEAIEKESE--KHSQQ 611
                            K  NE+K F +        TVPS+DLVD+EA EK  E  +H + 
Sbjct: 1034 KQDQSFTATSMPNSDWKLANETKNFAIDNHVSSKLTVPSVDLVDDEAFEKREEHLQHVKA 1093

Query: 610  ADN---------------PNN--SILANSGNEIR-----LSIESNSDNPIISCQIQKSPS 497
            A +               P++  SI AN G E R     LS ESN DN I SCQ+QKSPS
Sbjct: 1094 ASSAGAELTTSTATMSIEPSSIYSIFANGGYETRDSVTRLSTESNPDNSI-SCQMQKSPS 1152

Query: 496  FNLDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNS---------MLHNAEMPVE 344
            FNL+LR EAR EESD+ PLL+QD S ++SLSK T  N+  S         MLH  EMPV+
Sbjct: 1153 FNLNLRKEARAEESDKIPLLHQDMSADKSLSKKTSQNLIKSVPHDDYEQCMLHIEEMPVQ 1212

Query: 343  EKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEK 164
            EKIVTMERSYS  SKAPF  +LK+EEEAHLL M Q Q+NH GT      VSSTSH  KEK
Sbjct: 1213 EKIVTMERSYSRKSKAPFIGLLKEEEEAHLLNMPQIQHNHVGTK---NAVSSTSHKRKEK 1269

Query: 163  RKSRSYFFSSCMCCATV 113
            RK RS FFSSC+CCATV
Sbjct: 1270 RKPRSSFFSSCICCATV 1286


>gb|KRH69074.1| hypothetical protein GLYMA_02G002200 [Glycine max]
          Length = 1334

 Score =  274 bits (701), Expect = 8e-80
 Identities = 187/377 (49%), Positives = 217/377 (57%), Gaps = 86/377 (22%)
 Frame = -1

Query: 985  PKVTEKG--FQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNADTN---------- 842
            PKVTE G  F +D Y+NN NKAS+ES  V +QKH VVPEA+ VSL A             
Sbjct: 961  PKVTENGVPFDVDAYVNNYNKASEESGTVSDQKHFVVPEAKRVSLIAGLTVVDCRHEEGE 1020

Query: 841  ------ENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL----------- 713
                  E  E+S  MVN F+ T MS QCNS L TINQEE F LQ ++SL           
Sbjct: 1021 SYKNKIEESEASRVMVNTFDGTQMSEQCNSDLVTINQEESFPLQYNSSLLHICDDNQDNV 1080

Query: 712  ----------------KQTNESKYFDV--------TVPSLDLVDEEAIEKESE--KHSQQ 611
                            K  NE+K F +        TVPS+DLVD+EA EK  E  +H + 
Sbjct: 1081 KQDQSFTATSMPNSDWKLANETKNFAIDNHVSSKLTVPSVDLVDDEAFEKREEHLQHVKA 1140

Query: 610  ADN---------------PNN--SILANSGNEIR-----LSIESNSDNPIISCQIQKSPS 497
            A +               P++  SI AN G E R     LS ESN DN I SCQ+QKSPS
Sbjct: 1141 ASSAGAELTTSTATMSIEPSSIYSIFANGGYETRDSVTRLSTESNPDNSI-SCQMQKSPS 1199

Query: 496  FNLDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNS---------MLHNAEMPVE 344
            FNL+LR EAR EESD+ PLL+QD S ++SLSK T  N+  S         MLH  EMPV+
Sbjct: 1200 FNLNLRKEARAEESDKIPLLHQDMSADKSLSKKTSQNLIKSVPHDDYEQCMLHIEEMPVQ 1259

Query: 343  EKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEK 164
            EKIVTMERSYS  SKAPF  +LK+EEEAHLL M Q Q+NH GT      VSSTSH  KEK
Sbjct: 1260 EKIVTMERSYSRKSKAPFIGLLKEEEEAHLLNMPQIQHNHVGTK---NAVSSTSHKRKEK 1316

Query: 163  RKSRSYFFSSCMCCATV 113
            RK RS FFSSC+CCATV
Sbjct: 1317 RKPRSSFFSSCICCATV 1333


>ref|XP_014511304.1| dentin sialophosphoprotein isoform X2 [Vigna radiata var. radiata]
          Length = 1171

 Score =  273 bits (697), Expect = 1e-79
 Identities = 181/375 (48%), Positives = 213/375 (56%), Gaps = 85/375 (22%)
 Frame = -1

Query: 982  KVTEKGFQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNA---------------- 851
            KV E G   D Y+NNSNKA +ES A  E+KHS VPEA+ VSL                  
Sbjct: 800  KVIENGVPFDAYVNNSNKALEESGATSEEKHSAVPEAKRVSLIECLTDVNCRHEEGKENK 859

Query: 850  --DTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL------------ 713
              +TNE PE S+ M+N FE T+MS QCN  L TINQEE   LQN++SL            
Sbjct: 860  IEETNEKPEVSHVMINTFEGTEMSEQCNCDLVTINQEESIPLQNNSSLLHFYDDHQDNVK 919

Query: 712  ---------------KQTNE----------SKYFDVTVPSL-DLVDEEAIEKESEKHSQQ 611
                           KQTN+          S   ++TVPSL DL +++A EKE ++  Q 
Sbjct: 920  QDRSFTATIMANLGWKQTNKTTNSGHTIDNSNSSELTVPSLLDLDNKDAFEKEDKEDPQH 979

Query: 610  ADNP---------------NNSILANSGNEIRLSI-----ESNSDNPIISCQIQKSPSFN 491
            A                  N SI  N  NE + SI     ESN DNP  S Q+QKSPSFN
Sbjct: 980  AAAELTTSTATMSIVELYSNKSIFENGANETKESIARPSTESNPDNPNTSSQMQKSPSFN 1039

Query: 490  LDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNSM---------LHNAEMPVEEK 338
            L+LR EAR  ESD+ PLL+Q+KS NES SK   +N+ NSM         LH+ EMPVEEK
Sbjct: 1040 LNLRKEARPGESDKIPLLHQNKSANESFSKQNSMNLINSMPHSQYEQCMLHSEEMPVEEK 1099

Query: 337  IVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEKRK 158
            IVTMERSYS  SKAPF  +LK+EEEAHLL MA+TQ+NH GT      VSSTS   K+KRK
Sbjct: 1100 IVTMERSYSGKSKAPFIGLLKEEEEAHLLGMARTQDNHAGTK---NTVSSTS-PKKDKRK 1155

Query: 157  SRSYFFSSCMCCATV 113
             RS FFSSCMCCATV
Sbjct: 1156 PRSSFFSSCMCCATV 1170


>ref|XP_022640909.1| dentin sialophosphoprotein isoform X1 [Vigna radiata var. radiata]
 ref|XP_022640910.1| dentin sialophosphoprotein isoform X1 [Vigna radiata var. radiata]
          Length = 1172

 Score =  273 bits (697), Expect = 1e-79
 Identities = 181/375 (48%), Positives = 213/375 (56%), Gaps = 85/375 (22%)
 Frame = -1

Query: 982  KVTEKGFQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNA---------------- 851
            KV E G   D Y+NNSNKA +ES A  E+KHS VPEA+ VSL                  
Sbjct: 801  KVIENGVPFDAYVNNSNKALEESGATSEEKHSAVPEAKRVSLIECLTDVNCRHEEGKENK 860

Query: 850  --DTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL------------ 713
              +TNE PE S+ M+N FE T+MS QCN  L TINQEE   LQN++SL            
Sbjct: 861  IEETNEKPEVSHVMINTFEGTEMSEQCNCDLVTINQEESIPLQNNSSLLHFYDDHQDNVK 920

Query: 712  ---------------KQTNE----------SKYFDVTVPSL-DLVDEEAIEKESEKHSQQ 611
                           KQTN+          S   ++TVPSL DL +++A EKE ++  Q 
Sbjct: 921  QDRSFTATIMANLGWKQTNKTTNSGHTIDNSNSSELTVPSLLDLDNKDAFEKEDKEDPQH 980

Query: 610  ADNP---------------NNSILANSGNEIRLSI-----ESNSDNPIISCQIQKSPSFN 491
            A                  N SI  N  NE + SI     ESN DNP  S Q+QKSPSFN
Sbjct: 981  AAAELTTSTATMSIVELYSNKSIFENGANETKESIARPSTESNPDNPNTSSQMQKSPSFN 1040

Query: 490  LDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNSM---------LHNAEMPVEEK 338
            L+LR EAR  ESD+ PLL+Q+KS NES SK   +N+ NSM         LH+ EMPVEEK
Sbjct: 1041 LNLRKEARPGESDKIPLLHQNKSANESFSKQNSMNLINSMPHSQYEQCMLHSEEMPVEEK 1100

Query: 337  IVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEKRK 158
            IVTMERSYS  SKAPF  +LK+EEEAHLL MA+TQ+NH GT      VSSTS   K+KRK
Sbjct: 1101 IVTMERSYSGKSKAPFIGLLKEEEEAHLLGMARTQDNHAGTK---NTVSSTS-PKKDKRK 1156

Query: 157  SRSYFFSSCMCCATV 113
             RS FFSSCMCCATV
Sbjct: 1157 PRSSFFSSCMCCATV 1171


>gb|KHN15387.1| hypothetical protein glysoja_039044 [Glycine soja]
          Length = 1290

 Score =  273 bits (697), Expect = 2e-79
 Identities = 186/377 (49%), Positives = 216/377 (57%), Gaps = 86/377 (22%)
 Frame = -1

Query: 985  PKVTEKG--FQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNADTN---------- 842
            PKVTE G  F +D Y+NN NKAS+ES  V +QKH VVPE + VSL A             
Sbjct: 917  PKVTENGVPFDVDAYVNNYNKASEESGTVSDQKHFVVPEVKRVSLIAGLTVVDCRHEEGE 976

Query: 841  ------ENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSL----------- 713
                  E  E+S  MVN F+ T MS QCNS L TINQEE F LQ ++SL           
Sbjct: 977  SYKNKIEESEASRVMVNTFDGTQMSEQCNSDLVTINQEESFPLQYNSSLLHICDDNQDNV 1036

Query: 712  ----------------KQTNESKYFDV--------TVPSLDLVDEEAIEKESE--KHSQQ 611
                            K  NE+K F +        TVPS+DLVD+EA EK  E  +H + 
Sbjct: 1037 KQDQSFTATSMPNSDWKLANETKNFAIDNHVSSKLTVPSVDLVDDEAFEKREEHLQHVKA 1096

Query: 610  ADN---------------PNN--SILANSGNEIR-----LSIESNSDNPIISCQIQKSPS 497
            A +               P++  SI AN G E R     LS ESN DN I SCQ+QKSPS
Sbjct: 1097 ASSAGAELTTSTATMSIEPSSIYSIFANGGYETRDSVTRLSTESNPDNSI-SCQMQKSPS 1155

Query: 496  FNLDLRIEARKEESDQTPLLYQDKSNNESLSKLTGINISNS---------MLHNAEMPVE 344
            FNL+LR EAR EESD+ PLL+QD S ++SLSK T  N+  S         MLH  EMPV+
Sbjct: 1156 FNLNLRKEARAEESDKIPLLHQDMSADKSLSKKTSQNLIKSVPHDDYEQCMLHIEEMPVQ 1215

Query: 343  EKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEK 164
            EKIVTMERSYS  SKAPF  +LK+EEEAHLL M Q Q+NH GT      VSSTSH  KEK
Sbjct: 1216 EKIVTMERSYSRKSKAPFIGLLKEEEEAHLLNMPQIQHNHVGTK---NAVSSTSHKRKEK 1272

Query: 163  RKSRSYFFSSCMCCATV 113
            RK RS FFSSC+CCATV
Sbjct: 1273 RKPRSSFFSSCICCATV 1289


>gb|OIW07722.1| hypothetical protein TanjilG_11849 [Lupinus angustifolius]
          Length = 1398

 Score =  265 bits (676), Expect = 2e-76
 Identities = 168/345 (48%), Positives = 205/345 (59%), Gaps = 52/345 (15%)
 Frame = -1

Query: 985  PKVTEKGFQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNA--------------- 851
            PKV E G Q D ++NN  K SDE++ V E +HSVVPEAE VSL                 
Sbjct: 1058 PKVGEGGIQFDAHVNNC-KPSDETNFVSEPEHSVVPEAEMVSLIGGSNVVDCRHKSGENY 1116

Query: 850  -----DTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSLKQTNESKYF 686
                 +TN   E+SYA     E T++S +CNS L T+NQEEYF+LQNS+SL    +S   
Sbjct: 1117 KTKMDETNGKSEASYADFETSEGTEISEECNSDLVTLNQEEYFTLQNSSSLLHIYDSS-- 1174

Query: 685  DVTVPSLDLVDEEAIEKESEKHSQQADNP---------------------NNSILANSGN 569
            +VTVPSLDLVD+E+ E E E +S   ++                      NNSI A+ G 
Sbjct: 1175 NVTVPSLDLVDDESFENEGEIYSLHIESTSLKEAKLTSSAATMSSEEPCSNNSIFASGGY 1234

Query: 568  EIR-----LSIESNSDNPIISCQIQKSPSFNLDLRIEARKEESDQTPLLYQDKS--NNES 410
            E R      S ES SDNP  S  IQKSPSFNL+L+IE R EESDQ PL  + +   N  S
Sbjct: 1235 ETREMVTRFSTESESDNPNFSSLIQKSPSFNLNLQIEVRPEESDQAPLKLEIERIPNQAS 1294

Query: 409  LS----KLTGINISNSMLHNAEMPVEEKIVTMERSYSDISKAPFTSILKDEEEAHLLVMA 242
            L+     +  +     ML N E+ VEEKIVTMERSYS+  KAPFT +LK EEE+HL VM 
Sbjct: 1295 LNLINNSMPNVEYGKCMLQNEEVAVEEKIVTMERSYSEKYKAPFTGLLK-EEESHLHVMP 1353

Query: 241  QTQNNHGGTNKEVKEVSSTSHDGKEKRKSRSYFFSSCMCCATVTN 107
            Q Q+NH G  K+VKEV STS  GKEKR++RS FFS+CMCC TV N
Sbjct: 1354 QIQDNHSGAMKDVKEVLSTSPKGKEKRRARSSFFSTCMCCTTVAN 1398


>ref|XP_007145920.1| hypothetical protein PHAVU_007G278800g [Phaseolus vulgaris]
 gb|ESW17914.1| hypothetical protein PHAVU_007G278800g [Phaseolus vulgaris]
          Length = 978

 Score =  207 bits (527), Expect = 1e-56
 Identities = 138/311 (44%), Positives = 175/311 (56%), Gaps = 49/311 (15%)
 Frame = -1

Query: 898  QKHSVVPEAETVSLNADTNENP---ESSYAMVNAFEETDMSYQCNSFLET---INQEEYF 737
            +++   P  +++   + + EN    +   +M ++ +E +M Y C+  +E+    N E   
Sbjct: 670  EENCKAPYGKSILSRSGSMENSHYYKPDQSMKDSLKENNMVYTCDVSIESNGECNGERNM 729

Query: 736  SLQNSTSLKQTNESK----------YFDVTV---PSLDLVDEEAIEKESEKHSQQADNP- 599
            SL ++ S   TN+             FD  +    S +L D+EA EKE ++  Q A+   
Sbjct: 730  SLDSNVSRLVTNDQVEEPKVTDNGVQFDAYINNPDSSELEDDEAFEKEGKEDPQHAEAAS 789

Query: 598  --------------------NNSILANSGNEIR-----LSIESNSDNPIISCQIQKSPSF 494
                                N  IL N G E R     LS ESN +NP  SCQ+QKSPSF
Sbjct: 790  YAGAELSTSTVTMSIIGPCSNKFILENGGYETRESITRLSTESNPENPNTSCQMQKSPSF 849

Query: 493  NLDLRIEARKEESDQTPLLYQDKSNNESLSK----LTGINISNSMLHNAEMPVEEKIVTM 326
            NL+LR EAR EESD+TPLL+Q+KS NES SK    +        MLH+ EMPVEEKIVTM
Sbjct: 850  NLNLRKEARPEESDKTPLLHQNKSANESFSKHINSMPHGEYEQCMLHSKEMPVEEKIVTM 909

Query: 325  ERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEVSSTSHDGKEKRKSRSY 146
            ERSYS  SKAPF  +LK+EEEAHLL MAQ Q NH GT      VSSTSH  +EKRK RS 
Sbjct: 910  ERSYSKKSKAPFIGLLKEEEEAHLLGMAQIQENHVGTK---NTVSSTSHKRQEKRKPRSS 966

Query: 145  FFSSCMCCATV 113
            FFSSCMCCATV
Sbjct: 967  FFSSCMCCATV 977


>ref|XP_019450674.1| PREDICTED: dentin sialophosphoprotein-like [Lupinus angustifolius]
          Length = 1334

 Score =  194 bits (492), Expect = 1e-51
 Identities = 140/345 (40%), Positives = 170/345 (49%), Gaps = 52/345 (15%)
 Frame = -1

Query: 985  PKVTEKGFQLDVYLNNSNKASDESSAVPEQKHSVVPEAETVSLNA--------------- 851
            PKV E G Q D ++NN  K SDE++ V E +HSVVPEAE VSL                 
Sbjct: 1038 PKVGEGGIQFDAHVNNC-KPSDETNFVSEPEHSVVPEAEMVSLIGGSNVVDCRHKSGENY 1096

Query: 850  -----DTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYFSLQNSTSLKQTNESKYF 686
                 +TN   E+SYA     E T++S +CNS L T N                      
Sbjct: 1097 KTKMDETNGKSEASYADFETSEGTEISEECNSDLVTSN---------------------- 1134

Query: 685  DVTVPSLDLVDEEAIEKESEKHSQQADNP---------------------NNSILANSGN 569
             VTVPSLDLVD+E+ E E E +S   ++                      NNSI A+ G 
Sbjct: 1135 -VTVPSLDLVDDESFENEGEIYSLHIESTSLKEAKLTSSAATMSSEEPCSNNSIFASGGY 1193

Query: 568  EIR-----LSIESNSDNPIISCQIQKSPSFNLDLRIEARKEESDQTPLLYQDKS--NNES 410
            E R      S ES SDNP  S  IQKSPSFNL+L+IE R EESDQ PL  + +   N  S
Sbjct: 1194 ETREMVTRFSTESESDNPNFSSLIQKSPSFNLNLQIEVRPEESDQAPLKLEIERIPNQAS 1253

Query: 409  LS----KLTGINISNSMLHNAEMPVEEKIVTMERSYSDISKAPFTSILKDEEEAHLLVMA 242
            L+     +  +     ML N E+ VEEKIVTMERSYS+                      
Sbjct: 1254 LNLINNSMPNVEYGKCMLQNEEVAVEEKIVTMERSYSE---------------------- 1291

Query: 241  QTQNNHGGTNKEVKEVSSTSHDGKEKRKSRSYFFSSCMCCATVTN 107
              + NH G  K+VKEV STS  GKEKR++RS FFS+CMCC TV N
Sbjct: 1292 --KYNHSGAMKDVKEVLSTSPKGKEKRRARSSFFSTCMCCTTVAN 1334


>gb|PNX93220.1| hypothetical protein L195_g016371, partial [Trifolium pratense]
          Length = 980

 Score =  187 bits (476), Expect = 1e-49
 Identities = 138/365 (37%), Positives = 197/365 (53%), Gaps = 74/365 (20%)
 Frame = -1

Query: 979  VTEKGFQLDVYLNNSNKASDES-------SAVPEQKHS-VVPE--------AETVSLNAD 848
            VTE GF  D Y++NS K SDE+       S VPE + S VV E        ++ +++N +
Sbjct: 620  VTENGFLFDAYVSNSIKVSDENASPEEEKSVVPEVEESTVVAETNMLEQCSSDMITINQE 679

Query: 847  TNENPESSYAMVNAF-----------------------EETDMSYQC-----NSFLETIN 752
               + ++S ++++ +                       ++ +  Y C      SF+  I+
Sbjct: 680  ETFSLQNSSSLLHIYSYNHGNVEQTESFTVASMPKSGRKQANSIYFCATVDLTSFVRFIS 739

Query: 751  ----------------------QEEYFSLQNSTSLKQTNE-SKYFDVT-VPSLDLVDEEA 644
                                  +++   L  +T  + T    K   ++ V    L   E 
Sbjct: 740  TDIFNFYFLCGLGGVGSLGERREQQRAKLPRTTKRRGTTSLHKILRISNVVFHSLGGPET 799

Query: 643  IEKESEKHSQQADNPNNSILANSGNEIR-----LSIESNSDNPIISCQIQKSPSFNLDLR 479
            +     +HS  A+  N+S+ AN G E R     LS ESN D+  I+C +QKSPSFNL+LR
Sbjct: 800  LACSCSQHS--AELSNSSLFANGGYETRDSVTRLSTESNPDDKNITCHMQKSPSFNLNLR 857

Query: 478  IEARKEESDQTPLLYQDKSNNESLSKLTGINISNSMLHNAEMPVEEKIVTMERSYSDISK 299
            IEA++EESDQ PLL +  S N+SLS    +N+SNSM H+    +EEKIVTMERSYS++SK
Sbjct: 858  IEAKQEESDQIPLLRE--SANDSLSNKASLNLSNSMPHDEYDHIEEKIVTMERSYSEVSK 915

Query: 298  APFTSILKDEEEAHLLVMAQTQNNHGGTNKEVKEV-SSTSHDGKEKRKSRSYFFSSCMCC 122
            + F   LK+EEEA LLVM QTQ+N+ G+  EVK V SSTS  GK++R  RS+FF++CMCC
Sbjct: 916  SSFIGFLKEEEEARLLVMEQTQDNNVGSKVEVKVVSSSTSPKGKDRRNFRSFFFTNCMCC 975

Query: 121  ATVTN 107
            ATV N
Sbjct: 976  ATVPN 980


>ref|XP_017975924.1| PREDICTED: uncharacterized protein LOC18603083 isoform X5 [Theobroma
            cacao]
          Length = 1027

 Score =  159 bits (402), Expect = 1e-39
 Identities = 120/342 (35%), Positives = 162/342 (47%), Gaps = 50/342 (14%)
 Frame = -1

Query: 982  KVTEKGFQLDVYLNNSNKASDESSAVPEQK-------HSVVP------------EAETVS 860
            K  E G  + V +N  N A +E     ++K       H+V+             E+E  +
Sbjct: 686  KTVENGLLIHVPINYQNGAIEEQQKDSQKKEIHLVTDHAVISADIFPTDQKDKEESEEKN 745

Query: 859  LNADTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYF-----SLQNSTS------L 713
            +  +     E   ++ N     +   QC S    I Q E F     S QN T        
Sbjct: 746  IIEEMLAKIEDLNSIGNDTSNRESGEQCISHSYPIEQAEAFLSPSHSNQNLTEKSVISMA 805

Query: 712  KQTNESKYFDVTV-PSLDLVDEEAIEKESEKHSQQADNPNNSILANSG-----NEIRLSI 551
            +   E   +D++  P    V     +  +++ S+Q        +AN       +  RLS 
Sbjct: 806  ELAGEKPVWDLSNRPKTTPVTMAETKPSAKQCSEQRAIGETPAIANGDYYQQESVGRLST 865

Query: 550  ESNSDNPIISCQIQKSPSFNLDLRIEARKEESDQTPLLYQDKSNNESLSK---------- 401
            ESNSDN  I  Q++KSPSF+LDLRI AR EESDQTPLLYQDK   +S S           
Sbjct: 866  ESNSDNMSIHAQMRKSPSFDLDLRIHARAEESDQTPLLYQDKPTIDSFSSQAADDTLGKP 925

Query: 400  LTGINISNSMLHNAEMPVEEKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHG 221
            L       + LH   MPVEEK+VT+ER  S+ SK PF   L++EEEAH+L+  + Q+NH 
Sbjct: 926  LANTEHGKNSLHYEAMPVEEKVVTLERCDSEKSKTPFLGFLREEEEAHMLITPKKQDNHS 985

Query: 220  GTNKEV----KEVSSTSHDGKEKRKSRSYFFSSCMCCATVTN 107
               K      KEV+     GKEKRK R+  F +CMCCATV N
Sbjct: 986  AAKKATKVSPKEVTPAPPKGKEKRKPRTSLFGTCMCCATVIN 1027


>ref|XP_017975923.1| PREDICTED: uncharacterized protein LOC18603083 isoform X4 [Theobroma
            cacao]
          Length = 1162

 Score =  159 bits (402), Expect = 1e-39
 Identities = 120/342 (35%), Positives = 162/342 (47%), Gaps = 50/342 (14%)
 Frame = -1

Query: 982  KVTEKGFQLDVYLNNSNKASDESSAVPEQK-------HSVVP------------EAETVS 860
            K  E G  + V +N  N A +E     ++K       H+V+             E+E  +
Sbjct: 821  KTVENGLLIHVPINYQNGAIEEQQKDSQKKEIHLVTDHAVISADIFPTDQKDKEESEEKN 880

Query: 859  LNADTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYF-----SLQNSTS------L 713
            +  +     E   ++ N     +   QC S    I Q E F     S QN T        
Sbjct: 881  IIEEMLAKIEDLNSIGNDTSNRESGEQCISHSYPIEQAEAFLSPSHSNQNLTEKSVISMA 940

Query: 712  KQTNESKYFDVTV-PSLDLVDEEAIEKESEKHSQQADNPNNSILANSG-----NEIRLSI 551
            +   E   +D++  P    V     +  +++ S+Q        +AN       +  RLS 
Sbjct: 941  ELAGEKPVWDLSNRPKTTPVTMAETKPSAKQCSEQRAIGETPAIANGDYYQQESVGRLST 1000

Query: 550  ESNSDNPIISCQIQKSPSFNLDLRIEARKEESDQTPLLYQDKSNNESLSK---------- 401
            ESNSDN  I  Q++KSPSF+LDLRI AR EESDQTPLLYQDK   +S S           
Sbjct: 1001 ESNSDNMSIHAQMRKSPSFDLDLRIHARAEESDQTPLLYQDKPTIDSFSSQAADDTLGKP 1060

Query: 400  LTGINISNSMLHNAEMPVEEKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHG 221
            L       + LH   MPVEEK+VT+ER  S+ SK PF   L++EEEAH+L+  + Q+NH 
Sbjct: 1061 LANTEHGKNSLHYEAMPVEEKVVTLERCDSEKSKTPFLGFLREEEEAHMLITPKKQDNHS 1120

Query: 220  GTNKEV----KEVSSTSHDGKEKRKSRSYFFSSCMCCATVTN 107
               K      KEV+     GKEKRK R+  F +CMCCATV N
Sbjct: 1121 AAKKATKVSPKEVTPAPPKGKEKRKPRTSLFGTCMCCATVIN 1162


>ref|XP_017975922.1| PREDICTED: uncharacterized protein LOC18603083 isoform X3 [Theobroma
            cacao]
          Length = 1227

 Score =  159 bits (402), Expect = 1e-39
 Identities = 120/342 (35%), Positives = 162/342 (47%), Gaps = 50/342 (14%)
 Frame = -1

Query: 982  KVTEKGFQLDVYLNNSNKASDESSAVPEQK-------HSVVP------------EAETVS 860
            K  E G  + V +N  N A +E     ++K       H+V+             E+E  +
Sbjct: 886  KTVENGLLIHVPINYQNGAIEEQQKDSQKKEIHLVTDHAVISADIFPTDQKDKEESEEKN 945

Query: 859  LNADTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYF-----SLQNSTS------L 713
            +  +     E   ++ N     +   QC S    I Q E F     S QN T        
Sbjct: 946  IIEEMLAKIEDLNSIGNDTSNRESGEQCISHSYPIEQAEAFLSPSHSNQNLTEKSVISMA 1005

Query: 712  KQTNESKYFDVTV-PSLDLVDEEAIEKESEKHSQQADNPNNSILANSG-----NEIRLSI 551
            +   E   +D++  P    V     +  +++ S+Q        +AN       +  RLS 
Sbjct: 1006 ELAGEKPVWDLSNRPKTTPVTMAETKPSAKQCSEQRAIGETPAIANGDYYQQESVGRLST 1065

Query: 550  ESNSDNPIISCQIQKSPSFNLDLRIEARKEESDQTPLLYQDKSNNESLSK---------- 401
            ESNSDN  I  Q++KSPSF+LDLRI AR EESDQTPLLYQDK   +S S           
Sbjct: 1066 ESNSDNMSIHAQMRKSPSFDLDLRIHARAEESDQTPLLYQDKPTIDSFSSQAADDTLGKP 1125

Query: 400  LTGINISNSMLHNAEMPVEEKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHG 221
            L       + LH   MPVEEK+VT+ER  S+ SK PF   L++EEEAH+L+  + Q+NH 
Sbjct: 1126 LANTEHGKNSLHYEAMPVEEKVVTLERCDSEKSKTPFLGFLREEEEAHMLITPKKQDNHS 1185

Query: 220  GTNKEV----KEVSSTSHDGKEKRKSRSYFFSSCMCCATVTN 107
               K      KEV+     GKEKRK R+  F +CMCCATV N
Sbjct: 1186 AAKKATKVSPKEVTPAPPKGKEKRKPRTSLFGTCMCCATVIN 1227


>ref|XP_017975921.1| PREDICTED: uncharacterized protein LOC18603083 isoform X2 [Theobroma
            cacao]
          Length = 1227

 Score =  159 bits (402), Expect = 1e-39
 Identities = 120/342 (35%), Positives = 162/342 (47%), Gaps = 50/342 (14%)
 Frame = -1

Query: 982  KVTEKGFQLDVYLNNSNKASDESSAVPEQK-------HSVVP------------EAETVS 860
            K  E G  + V +N  N A +E     ++K       H+V+             E+E  +
Sbjct: 886  KTVENGLLIHVPINYQNGAIEEQQKDSQKKEIHLVTDHAVISADIFPTDQKDKEESEEKN 945

Query: 859  LNADTNENPESSYAMVNAFEETDMSYQCNSFLETINQEEYF-----SLQNSTS------L 713
            +  +     E   ++ N     +   QC S    I Q E F     S QN T        
Sbjct: 946  IIEEMLAKIEDLNSIGNDTSNRESGEQCISHSYPIEQAEAFLSPSHSNQNLTEKSVISMA 1005

Query: 712  KQTNESKYFDVTV-PSLDLVDEEAIEKESEKHSQQADNPNNSILANSG-----NEIRLSI 551
            +   E   +D++  P    V     +  +++ S+Q        +AN       +  RLS 
Sbjct: 1006 ELAGEKPVWDLSNRPKTTPVTMAETKPSAKQCSEQRAIGETPAIANGDYYQQESVGRLST 1065

Query: 550  ESNSDNPIISCQIQKSPSFNLDLRIEARKEESDQTPLLYQDKSNNESLSK---------- 401
            ESNSDN  I  Q++KSPSF+LDLRI AR EESDQTPLLYQDK   +S S           
Sbjct: 1066 ESNSDNMSIHAQMRKSPSFDLDLRIHARAEESDQTPLLYQDKPTIDSFSSQAADDTLGKP 1125

Query: 400  LTGINISNSMLHNAEMPVEEKIVTMERSYSDISKAPFTSILKDEEEAHLLVMAQTQNNHG 221
            L       + LH   MPVEEK+VT+ER  S+ SK PF   L++EEEAH+L+  + Q+NH 
Sbjct: 1126 LANTEHGKNSLHYEAMPVEEKVVTLERCDSEKSKTPFLGFLREEEEAHMLITPKKQDNHS 1185

Query: 220  GTNKEV----KEVSSTSHDGKEKRKSRSYFFSSCMCCATVTN 107
               K      KEV+     GKEKRK R+  F +CMCCATV N
Sbjct: 1186 AAKKATKVSPKEVTPAPPKGKEKRKPRTSLFGTCMCCATVIN 1227


Top