BLASTX nr result

ID: Ephedra27_contig00018025 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00018025
         (2298 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI35826.3| unnamed protein product [Vitis vinifera]              340   1e-90
ref|XP_002271999.1| PREDICTED: uncharacterized protein LOC100251...   340   1e-90
gb|EOY27339.1| Uncharacterized protein isoform 3 [Theobroma cacao]    330   1e-87
gb|EOY27337.1| Uncharacterized protein isoform 1 [Theobroma caca...   330   1e-87
ref|XP_006369111.1| hypothetical protein POPTR_0001s16550g [Popu...   326   3e-86
ref|XP_006426753.1| hypothetical protein CICLE_v10024713mg [Citr...   319   4e-84
ref|XP_006342942.1| PREDICTED: SAFB-like transcription modulator...   317   1e-83
ref|XP_006465838.1| PREDICTED: uncharacterized protein LOC102629...   317   2e-83
ref|XP_004236381.1| PREDICTED: uncharacterized protein LOC101252...   312   5e-82
ref|XP_004305768.1| PREDICTED: uncharacterized protein LOC101291...   306   3e-80
ref|XP_006385528.1| hypothetical protein POPTR_0003s06800g [Popu...   305   6e-80
ref|XP_006598844.1| PREDICTED: dentin sialophosphoprotein-like i...   304   1e-79
ref|XP_006843854.1| hypothetical protein AMTR_s00007p00263470 [A...   304   1e-79
ref|XP_004141819.1| PREDICTED: uncharacterized protein LOC101213...   301   1e-78
gb|ESW07394.1| hypothetical protein PHAVU_010G126300g [Phaseolus...   300   2e-78
gb|EOY27342.1| Uncharacterized protein isoform 6 [Theobroma cacao]    294   1e-76
gb|EOY27341.1| Uncharacterized protein isoform 5 [Theobroma cacao]    294   1e-76
ref|XP_006583175.1| PREDICTED: dentin sialophosphoprotein-like i...   293   2e-76
gb|EMJ18855.1| hypothetical protein PRUPE_ppa000250mg [Prunus pe...   293   2e-76
gb|EOY27340.1| Uncharacterized protein isoform 4 [Theobroma cacao]    283   2e-73

>emb|CBI35826.3| unnamed protein product [Vitis vinifera]
          Length = 1163

 Score =  340 bits (873), Expect = 1e-90
 Identities = 240/526 (45%), Positives = 301/526 (57%), Gaps = 12/526 (2%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            +SS  RR   ENPLAQSVPNFSD RKENTKP +G       + K +     R+QL+S  R
Sbjct: 717  SSSGRRRAQSENPLAQSVPNFSDFRKENTKPSSG-------ISKVT----PRSQLRSIAR 765

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKSY------NTMTDLNSSTDGVVLA 1900
             KS +++ +              KE+KPRR Q++RKS         ++DLNS  DGVVLA
Sbjct: 766  TKSNSDEMTLF------------KEEKPRRSQSLRKSSANPVESKDLSDLNS--DGVVLA 811

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PLK  KE ++     +  +       ++KPFLRK           +AKLKAS ASE LKN
Sbjct: 812  PLKFDKEQTEQGLYDKFSKNV-----ESKPFLRKGNGIGPGAGASIAKLKASMASEALKN 866

Query: 1719 EDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            E+E  E+  + +D         + V   +E     E   + TAE      D  D  N   
Sbjct: 867  EEEFDESTFEVEDS-------VDMVKEEEEE----EEFETMTAE------DGTDMDNG-- 907

Query: 1539 LKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNV 1360
             K  +  ESDK         +  S+  N       +   P   ++ PV +  A    G+V
Sbjct: 908  -KPRLSHESDK---------SGNSESENGDTLRSLSQVDPASVAELPVAVPSAFHTIGSV 957

Query: 1359 MTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIAR 1180
              ESP +SP SWNS  HHS S   E SD+DAS DSP+GSPASWNSHSL+Q EA   D AR
Sbjct: 958  Q-ESPGESPVSWNSRMHHSFSYPNETSDIDASVDSPIGSPASWNSHSLTQTEA---DAAR 1013

Query: 1179 TRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGD 1018
             RKKWGSAQKP++        S+KD  KGF+RLLKFG+K RG + +  DW S +TTSEGD
Sbjct: 1014 MRKKWGSAQKPILVANSSHNQSRKDVTKGFKRLLKFGRKHRGTESL-VDWIS-ATTSEGD 1071

Query: 1017 EDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSL 838
            +D EDGRD A RS+ED LRKSRMGF QG P+ + S++         E++ F E   + +L
Sbjct: 1072 DDTEDGRDPANRSSED-LRKSRMGFSQGHPS-DDSFN---------ESELFNEH--VQAL 1118

Query: 837  RSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
             SSIP  P NFKLR+DHL SGSSLKAPRSFFSLS+FRSKGSDSK R
Sbjct: 1119 HSSIPAPPANFKLREDHL-SGSSLKAPRSFFSLSSFRSKGSDSKPR 1163


>ref|XP_002271999.1| PREDICTED: uncharacterized protein LOC100251482 [Vitis vinifera]
          Length = 1409

 Score =  340 bits (873), Expect = 1e-90
 Identities = 240/526 (45%), Positives = 301/526 (57%), Gaps = 12/526 (2%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            +SS  RR   ENPLAQSVPNFSD RKENTKP +G       + K +     R+QL+S  R
Sbjct: 963  SSSGRRRAQSENPLAQSVPNFSDFRKENTKPSSG-------ISKVT----PRSQLRSIAR 1011

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKSY------NTMTDLNSSTDGVVLA 1900
             KS +++ +              KE+KPRR Q++RKS         ++DLNS  DGVVLA
Sbjct: 1012 TKSNSDEMTLF------------KEEKPRRSQSLRKSSANPVESKDLSDLNS--DGVVLA 1057

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PLK  KE ++     +  +       ++KPFLRK           +AKLKAS ASE LKN
Sbjct: 1058 PLKFDKEQTEQGLYDKFSKNV-----ESKPFLRKGNGIGPGAGASIAKLKASMASEALKN 1112

Query: 1719 EDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            E+E  E+  + +D         + V   +E     E   + TAE      D  D  N   
Sbjct: 1113 EEEFDESTFEVEDS-------VDMVKEEEEE----EEFETMTAE------DGTDMDNG-- 1153

Query: 1539 LKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNV 1360
             K  +  ESDK         +  S+  N       +   P   ++ PV +  A    G+V
Sbjct: 1154 -KPRLSHESDK---------SGNSESENGDTLRSLSQVDPASVAELPVAVPSAFHTIGSV 1203

Query: 1359 MTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIAR 1180
              ESP +SP SWNS  HHS S   E SD+DAS DSP+GSPASWNSHSL+Q EA   D AR
Sbjct: 1204 Q-ESPGESPVSWNSRMHHSFSYPNETSDIDASVDSPIGSPASWNSHSLTQTEA---DAAR 1259

Query: 1179 TRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGD 1018
             RKKWGSAQKP++        S+KD  KGF+RLLKFG+K RG + +  DW S +TTSEGD
Sbjct: 1260 MRKKWGSAQKPILVANSSHNQSRKDVTKGFKRLLKFGRKHRGTESL-VDWIS-ATTSEGD 1317

Query: 1017 EDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSL 838
            +D EDGRD A RS+ED LRKSRMGF QG P+ + S++         E++ F E   + +L
Sbjct: 1318 DDTEDGRDPANRSSED-LRKSRMGFSQGHPS-DDSFN---------ESELFNEH--VQAL 1364

Query: 837  RSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
             SSIP  P NFKLR+DHL SGSSLKAPRSFFSLS+FRSKGSDSK R
Sbjct: 1365 HSSIPAPPANFKLREDHL-SGSSLKAPRSFFSLSSFRSKGSDSKPR 1409


>gb|EOY27339.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 1431

 Score =  330 bits (847), Expect = 1e-87
 Identities = 235/526 (44%), Positives = 294/526 (55%), Gaps = 12/526 (2%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            ASS  RR   ENPL QSVPNFSDLRKENTKP +G     S           R+Q+++  R
Sbjct: 986  ASSGRRRAQSENPLVQSVPNFSDLRKENTKPSSGAAKMTS-----------RSQVRNYAR 1034

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLA 1900
             KS NE+ +             GK+D+PRR Q++RKS      ++ ++ LNS  DG+VLA
Sbjct: 1035 TKSTNEEIAL------------GKDDQPRRSQSLRKSSAGPVEFSDLSALNS--DGIVLA 1080

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PLK  KE  +   QS   +   N ++  K FLRK           +AK KAS AS   K 
Sbjct: 1081 PLKFDKEQME---QSFSDKFLQNVET--KTFLRKGNGIGPGAGVNIAKFKASEASVTPKE 1135

Query: 1719 EDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            E E  E   + DD             SMD +  + E    +  ESM    DS D +N   
Sbjct: 1136 EGESDELAFEADD-------------SMDMAKEDEE----DELESMVVE-DSADMENG-- 1175

Query: 1539 LKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNV 1360
             +  +  ESDK++        S S++ + +    +     +    A VP       +   
Sbjct: 1176 -RSRLSQESDKLDN-------SGSENGDCLRSLSQVDPASVAELPAAVPTTFHTAVS--- 1224

Query: 1359 MTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIAR 1180
            + +SP +SP SWNS  HH  S   E SD+DAS DSP+GSPASWNSHSL+Q E    D AR
Sbjct: 1225 LQDSPEESPVSWNSRLHHPFSYPHETSDIDASMDSPIGSPASWNSHSLAQTEV---DAAR 1281

Query: 1179 TRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGD 1018
             RKKWGSAQKP +        S++D  KGF+RLLKFG+KSRG D +  DW S +TTSEGD
Sbjct: 1282 MRKKWGSAQKPFLVANATHNQSRRDVTKGFKRLLKFGRKSRGTDSL-VDWIS-ATTSEGD 1339

Query: 1017 EDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSL 838
            +D EDGRD A RS+ED LRKSRMGF QG P          S  G  E++ F +Q  I SL
Sbjct: 1340 DDTEDGRDPANRSSED-LRKSRMGFSQGHP----------SDDGFNESELFNDQ--IQSL 1386

Query: 837  RSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
             SSIP  P NFKLR+DH+ SGSS+KAPRSFFSLS+FRSKGSDSK R
Sbjct: 1387 HSSIPAPPANFKLREDHM-SGSSIKAPRSFFSLSSFRSKGSDSKPR 1431


>gb|EOY27337.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508780082|gb|EOY27338.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1428

 Score =  330 bits (847), Expect = 1e-87
 Identities = 235/526 (44%), Positives = 294/526 (55%), Gaps = 12/526 (2%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            ASS  RR   ENPL QSVPNFSDLRKENTKP +G     S           R+Q+++  R
Sbjct: 983  ASSGRRRAQSENPLVQSVPNFSDLRKENTKPSSGAAKMTS-----------RSQVRNYAR 1031

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLA 1900
             KS NE+ +             GK+D+PRR Q++RKS      ++ ++ LNS  DG+VLA
Sbjct: 1032 TKSTNEEIAL------------GKDDQPRRSQSLRKSSAGPVEFSDLSALNS--DGIVLA 1077

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PLK  KE  +   QS   +   N ++  K FLRK           +AK KAS AS   K 
Sbjct: 1078 PLKFDKEQME---QSFSDKFLQNVET--KTFLRKGNGIGPGAGVNIAKFKASEASVTPKE 1132

Query: 1719 EDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            E E  E   + DD             SMD +  + E    +  ESM    DS D +N   
Sbjct: 1133 EGESDELAFEADD-------------SMDMAKEDEE----DELESMVVE-DSADMENG-- 1172

Query: 1539 LKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNV 1360
             +  +  ESDK++        S S++ + +    +     +    A VP       +   
Sbjct: 1173 -RSRLSQESDKLDN-------SGSENGDCLRSLSQVDPASVAELPAAVPTTFHTAVS--- 1221

Query: 1359 MTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIAR 1180
            + +SP +SP SWNS  HH  S   E SD+DAS DSP+GSPASWNSHSL+Q E    D AR
Sbjct: 1222 LQDSPEESPVSWNSRLHHPFSYPHETSDIDASMDSPIGSPASWNSHSLAQTEV---DAAR 1278

Query: 1179 TRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGD 1018
             RKKWGSAQKP +        S++D  KGF+RLLKFG+KSRG D +  DW S +TTSEGD
Sbjct: 1279 MRKKWGSAQKPFLVANATHNQSRRDVTKGFKRLLKFGRKSRGTDSL-VDWIS-ATTSEGD 1336

Query: 1017 EDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSL 838
            +D EDGRD A RS+ED LRKSRMGF QG P          S  G  E++ F +Q  I SL
Sbjct: 1337 DDTEDGRDPANRSSED-LRKSRMGFSQGHP----------SDDGFNESELFNDQ--IQSL 1383

Query: 837  RSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
             SSIP  P NFKLR+DH+ SGSS+KAPRSFFSLS+FRSKGSDSK R
Sbjct: 1384 HSSIPAPPANFKLREDHM-SGSSIKAPRSFFSLSSFRSKGSDSKPR 1428


>ref|XP_006369111.1| hypothetical protein POPTR_0001s16550g [Populus trichocarpa]
            gi|550347470|gb|ERP65680.1| hypothetical protein
            POPTR_0001s16550g [Populus trichocarpa]
          Length = 1242

 Score =  326 bits (835), Expect = 3e-86
 Identities = 231/525 (44%), Positives = 290/525 (55%), Gaps = 12/525 (2%)
 Frame = -1

Query: 2238 SSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNRN 2059
            SS  RR   ENPLAQSVPNFSD RKENTKP +G               A R Q+++  R+
Sbjct: 805  SSGRRRVQSENPLAQSVPNFSDFRKENTKPLSG-----------VSKAANRLQVRTYARS 853

Query: 2058 KSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLAP 1897
            KS +E+                KE+K +R Q++RKS      +  +  LNS    VVLAP
Sbjct: 854  KSSSEEIPLA------------KEEKNQRSQSLRKSSAGPIEFKDLPPLNSD---VVLAP 898

Query: 1896 LKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKNE 1717
            LK  KE ++     +  +       ++KPFLRK           VAKLKA  ASE LKNE
Sbjct: 899  LKFDKEQTEQIPYDKFSKNV-----ESKPFLRKGNGIGPGSGATVAKLKAMVASETLKNE 953

Query: 1716 DEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELDL 1537
            + E  A    D              S+DES    +     T        + +D  N  + 
Sbjct: 954  EFEESAFEAED--------------SVDESKEEEDEGLETT--------EIEDRANMDNG 991

Query: 1536 KKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNVM 1357
            K  + L+SDK+        TS S++  ++    +       SS A +P  + + +     
Sbjct: 992  KPRLSLDSDKMG-------TSGSENDESLRSISQIDP----SSVAELPASVPSTFHA--- 1037

Query: 1356 TESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIART 1177
             +SP +SP SWNS   H  S   E SD+DA  DSP+GSPASWNSHSL+Q EA   D+AR 
Sbjct: 1038 -DSPGESPVSWNSRMQHPFSYPHETSDIDAYVDSPIGSPASWNSHSLTQTEA---DVARM 1093

Query: 1176 RKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGDE 1015
            RKKWGSAQKP++        S+KD  KGF+RLLKFG+KSRGA+ +  DW S +TTSEGD+
Sbjct: 1094 RKKWGSAQKPILVANSSHNQSRKDVTKGFKRLLKFGRKSRGAEGL-VDWIS-ATTSEGDD 1151

Query: 1014 DIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSLR 835
            D EDGRD A RS+ED LRKSRMGF QG P          S  G  E++ F EQ  + +L 
Sbjct: 1152 DTEDGRDPANRSSED-LRKSRMGFSQGHP----------SDDGFNESELFNEQ--VQALH 1198

Query: 834  SSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
            SSIP  P NFKLRDDHL SGSS+KAPRSFFSLS+FRSKGSDSK R
Sbjct: 1199 SSIPAPPANFKLRDDHL-SGSSIKAPRSFFSLSSFRSKGSDSKLR 1242


>ref|XP_006426753.1| hypothetical protein CICLE_v10024713mg [Citrus clementina]
            gi|557528743|gb|ESR39993.1| hypothetical protein
            CICLE_v10024713mg [Citrus clementina]
          Length = 1409

 Score =  319 bits (817), Expect = 4e-84
 Identities = 224/529 (42%), Positives = 294/529 (55%), Gaps = 15/529 (2%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            A S  RR   ENPLAQSVPNFSDLRKENTKP +G            G  ATR+Q+++  R
Sbjct: 968  AGSGKRRLQSENPLAQSVPNFSDLRKENTKPSSG-----------IGKVATRSQVRNYAR 1016

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLA 1900
            +KS +E+                KE+KPRR  +++K       ++ M  +N   DGVVLA
Sbjct: 1017 SKSTSEETPLV------------KEEKPRRSNSLKKGSTGPLEFSNMPPVNC--DGVVLA 1062

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PLK  KE S+ +   +  +       ++KPFLR+           +AKLKAS+    L+N
Sbjct: 1063 PLKFDKEQSEQSLHDKYLKGV-----ESKPFLRRGNGIGPGSGASIAKLKASS----LRN 1113

Query: 1719 EDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            ED+  +   Q           AE  G M                   A  D +D    ++
Sbjct: 1114 EDDYDDLAFQ-----------AEVSGDM-------------------AKEDEEDDLETME 1143

Query: 1539 LKK--DMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPY-A 1369
            +++  DMD    ++ + S     S S++ +++     ++  P   S A +P  + + + A
Sbjct: 1144 IEECNDMDNGKPRLSQESEKVVNSGSENGDSL----RSLSQPDPDSVAELPAAVPSTFHA 1199

Query: 1368 GNVMTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSD 1189
               + +SP +SP SWNS  HH  S   E SD+DAS DSP+GSPA WNSHSL+Q EA   D
Sbjct: 1200 TGSLQDSPGESPMSWNSRMHHPFSYPHETSDIDASVDSPIGSPAYWNSHSLNQTEA---D 1256

Query: 1188 IARTRKKWGSAQKPVIA------VSQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTS 1027
             AR RKKWGSAQKP +A       S+KD  KGF+RLLKFG+K+RG + +  DW S +TTS
Sbjct: 1257 AARMRKKWGSAQKPFLASNSSSTQSRKDMTKGFKRLLKFGRKNRGTESL-VDWIS-ATTS 1314

Query: 1026 EGDEDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSI 847
            EGD+D EDGRD   RS+ED  RKSRMGF Q  P          S  G  E++ F EQ  +
Sbjct: 1315 EGDDDTEDGRDPTSRSSED-FRKSRMGFLQSHP----------SDDGYNESELFNEQ--V 1361

Query: 846  NSLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
            + L SSIP  P NFKLR+DH+ SGSS+KAPRSFFSLSTFRSKGSDSK R
Sbjct: 1362 HGLHSSIPAPPANFKLREDHM-SGSSIKAPRSFFSLSTFRSKGSDSKPR 1409


>ref|XP_006342942.1| PREDICTED: SAFB-like transcription modulator-like [Solanum tuberosum]
          Length = 1342

 Score =  317 bits (813), Expect = 1e-83
 Identities = 220/536 (41%), Positives = 296/536 (55%), Gaps = 19/536 (3%)
 Frame = -1

Query: 2250 TGKASSNT---RRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQ 2080
            +GKAS+NT   RR   ENPLAQSVPNFSD+RKENTKP             S+ G  TR+Q
Sbjct: 893  SGKASNNTSGKRRIQSENPLAQSVPNFSDMRKENTKP------------SSTAGKTTRSQ 940

Query: 2079 LKSNNRNKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKSYNTMTDLNSST----DG 1912
             ++  R+KS +E+                KEDK R+ Q++RKS   + +   ++    DG
Sbjct: 941  SRNYTRSKSTSEEVPLI------------KEDKSRKPQSLRKSSANIVEFRETSTFDSDG 988

Query: 1911 VVLAPLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASE 1732
            VVL PLK  K+  + +             S +K  L+K          G+ K +ASA S+
Sbjct: 989  VVLTPLKCDKDEMERSIDK------FPKSSGSKTLLKKGKNTDFSSRGGLTKTRASAVSK 1042

Query: 1731 NLKNEDEECEALTQNDDEEASEIGKAET-----VGSMDESLGNNEHRRSNTAESMDAPVD 1567
             + + DE  + + + +D E     + E         + E+  N E R S+ +E ++    
Sbjct: 1043 IVDDNDEYDDMVFEPEDSEGMGPDEEEEEFEHMTAEIHENFDNGEPRLSHDSEKLE---- 1098

Query: 1566 SDDSQNELDLKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLH 1387
            +  S+N   L+    + S                                 +S+A +P  
Sbjct: 1099 NSGSENGDVLRSFSQVNS---------------------------------ASEAVLPSM 1125

Query: 1386 LANPY-AGNVMTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQ 1210
            ++N   +G ++ +SP +SP SWN+HAHH  S   E SDVDAS DSP+GSPASWNSHSLSQ
Sbjct: 1126 VSNKLLSGGLVQDSPGESPVSWNTHAHHPFSYPHEMSDVDASVDSPVGSPASWNSHSLSQ 1185

Query: 1209 MEASDSDIARTRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDW 1048
               +DSD AR RKKWG AQKP++        S+KD  +GF+R LKFG+K+RG D +  DW
Sbjct: 1186 ---TDSDAARMRKKWGMAQKPMLVANSSNNQSRKDMARGFKRFLKFGRKNRGTDNL-VDW 1241

Query: 1047 ASPSTTSEGDEDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDA 868
             S +TTSEGD+D EDGRD + RS++D LRKSRMGF Q  P+ +  Y          EN+ 
Sbjct: 1242 IS-ATTSEGDDDTEDGRDPSNRSSDD-LRKSRMGFSQEHPSDDSFY----------ENEF 1289

Query: 867  FGEQNSINSLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
            F EQ  + +LRSSIP  P NFKLR+D L SGSS+KAPRSFFSLSTFRSKGSDSK +
Sbjct: 1290 FSEQ--VQALRSSIPAPPANFKLREDQL-SGSSIKAPRSFFSLSTFRSKGSDSKPK 1342


>ref|XP_006465838.1| PREDICTED: uncharacterized protein LOC102629330 isoform X1 [Citrus
            sinensis]
          Length = 1419

 Score =  317 bits (811), Expect = 2e-83
 Identities = 223/529 (42%), Positives = 293/529 (55%), Gaps = 15/529 (2%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            A S  RR   ENPLAQSVPNFSDLRKENTKP +G            G  ATR+Q+++  R
Sbjct: 978  AGSGKRRLQSENPLAQSVPNFSDLRKENTKPSSG-----------IGKVATRSQVRNYAR 1026

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLA 1900
            +KS +E+                KE+KPRR  +++K       ++ M  +N   DGVVLA
Sbjct: 1027 SKSTSEETPLV------------KEEKPRRSNSLKKGSTGPLEFSDMPPVNC--DGVVLA 1072

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PLK  KE S+ +   +  +       ++KPFLR+           +AKLKAS+    L+N
Sbjct: 1073 PLKFDKEQSEQSLHDKYLKGV-----ESKPFLRRGNGIGPGSGASIAKLKASS----LRN 1123

Query: 1719 EDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            ED+  +   Q           AE  G M                   A  D +D    ++
Sbjct: 1124 EDDYDDLAFQ-----------AEVSGDM-------------------AKEDEEDDLETME 1153

Query: 1539 LKK--DMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPY-A 1369
            +++  DMD    ++ + S     S S++ +++     ++  P   S A +P  + + + A
Sbjct: 1154 IEECNDMDNGKPRLSQESEKVVNSGSENGDSL----RSLSQPDPDSVAELPAAVPSTFHA 1209

Query: 1368 GNVMTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSD 1189
               + +SP +SP SWNS  HH  S   E SD+DAS DSP+GSPA WNSHSL+Q EA   D
Sbjct: 1210 TGSLQDSPGESPMSWNSRMHHPFSYPHETSDIDASVDSPIGSPAYWNSHSLNQTEA---D 1266

Query: 1188 IARTRKKWGSAQKPVIA------VSQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTS 1027
             AR RKKWGSAQKP +A       S+KD  KGF+RLL FG+K+RG + +  DW S +TTS
Sbjct: 1267 AARMRKKWGSAQKPFLASNSSSTQSRKDMTKGFKRLLNFGRKNRGTESL-VDWIS-ATTS 1324

Query: 1026 EGDEDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSI 847
            EGD+D EDGRD   RS+ED  RKSRMGF Q  P          S  G  E++ F EQ  +
Sbjct: 1325 EGDDDTEDGRDPTSRSSED-FRKSRMGFLQSHP----------SDDGYNESELFNEQ--V 1371

Query: 846  NSLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
            + L SSIP  P NFKLR+DH+ SGSS+KAPRSFFSLSTFRSKGSDSK R
Sbjct: 1372 HGLHSSIPAPPANFKLREDHM-SGSSIKAPRSFFSLSTFRSKGSDSKPR 1419


>ref|XP_004236381.1| PREDICTED: uncharacterized protein LOC101252575 [Solanum
            lycopersicum]
          Length = 1326

 Score =  312 bits (799), Expect = 5e-82
 Identities = 218/536 (40%), Positives = 294/536 (54%), Gaps = 19/536 (3%)
 Frame = -1

Query: 2250 TGKASSNT---RRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQ 2080
            +GKAS+NT   RR   ENPLAQSVPNFSD+RKENTKP             S+ G  TR+Q
Sbjct: 877  SGKASNNTSGRRRIQSENPLAQSVPNFSDMRKENTKP------------SSAAGKTTRSQ 924

Query: 2079 LKSNNRNKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKSYNTMTDLNSST----DG 1912
             ++  R+KS +E+                KEDK R+ Q++RKS   + +   ++    DG
Sbjct: 925  SRNYARSKSTSEEVPLI------------KEDKSRKPQSLRKSSANIVEFRETSTFDSDG 972

Query: 1911 VVLAPLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASE 1732
            VVL PLK  K+  + +             S +K  ++K          G+ K + SA S+
Sbjct: 973  VVLTPLKFDKDEMERSIDK------FPKSSGSKTSVKKGKNTDFSSRGGLTKTRVSAVSK 1026

Query: 1731 NLKNEDEECEALTQNDDEEASEIGKAET-----VGSMDESLGNNEHRRSNTAESMDAPVD 1567
             + + DE  + +   +D E     + E       G + E+  N E R S+ +E ++    
Sbjct: 1027 IVDDNDEYDDMVFDPEDSEGMGPDEEEEDYETMTGEIHENFDNGEPRLSHDSEKLE---- 1082

Query: 1566 SDDSQNELDLKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLH 1387
            +  S+N   L+    + S                                 +S+A +P  
Sbjct: 1083 NSGSENGDVLRSFSQVNS---------------------------------ASEAVLPSM 1109

Query: 1386 LANPY-AGNVMTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQ 1210
            ++N   +G ++ +SP +SP SWN+HAHH  S   E SDVDAS DSP+GSPASWNSHSLSQ
Sbjct: 1110 VSNKLLSGGLVQDSPGESPVSWNTHAHHPFSYPHEMSDVDASVDSPVGSPASWNSHSLSQ 1169

Query: 1209 MEASDSDIARTRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDW 1048
               +DSD AR RKKWG AQKP++        S+KD  +GF+R LKFG+K+RG D +  DW
Sbjct: 1170 ---TDSDAARMRKKWGMAQKPMLVANSSHNQSRKDMARGFKRFLKFGRKNRGTDTL-VDW 1225

Query: 1047 ASPSTTSEGDEDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDA 868
             S +TTSEGD+D EDGRD + RS++D LRKSRMGF Q   + +  Y          EN+ 
Sbjct: 1226 IS-ATTSEGDDDTEDGRDPSNRSSDD-LRKSRMGFSQDHQSDDSFY----------ENEY 1273

Query: 867  FGEQNSINSLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
            F EQ  + +LRSSIP  P NFKLR+D L SGSS+KAPRSFFSLSTFRSKGSDSK +
Sbjct: 1274 FSEQ--VQALRSSIPAPPANFKLREDQL-SGSSIKAPRSFFSLSTFRSKGSDSKPK 1326


>ref|XP_004305768.1| PREDICTED: uncharacterized protein LOC101291165 [Fragaria vesca
            subsp. vesca]
          Length = 1344

 Score =  306 bits (784), Expect = 3e-80
 Identities = 225/528 (42%), Positives = 285/528 (53%), Gaps = 15/528 (2%)
 Frame = -1

Query: 2238 SSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNRN 2059
            S+  RR   +NPLAQSVPNFSDLRKENTKP +G      A+ K       R+Q++S +R+
Sbjct: 901  STGRRRLESDNPLAQSVPNFSDLRKENTKPSSG--VSKVAVSKIPA----RSQVRSYSRS 954

Query: 2058 KSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLAP 1897
            KS +E+ +              KE+K RR Q++RKS      +NT++ +NS  DGVVL P
Sbjct: 955  KSSSEEATMV------------KEEKSRRSQSLRKSSANPVEFNTLSSMNS--DGVVLVP 1000

Query: 1896 LKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKNE 1717
            L+  KE ++     +          ++K FLRK           ++KLK    SE +  E
Sbjct: 1001 LRFDKEQTEQGLFDKFPETV-----ESKSFLRKGNGIGTGSGVSISKLKGFTGSETMNIE 1055

Query: 1716 DEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELDL 1537
            +E  E        EA ++ K E     DE L           E M A  D D        
Sbjct: 1056 EEFDELAF-----EAEDMAKEE---EEDEEL-----------EMMSAEDDVDMDNG---- 1092

Query: 1536 KKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPY-AGNV 1360
            K     ESDK     F    S      A P           +S A +P+ + + + A   
Sbjct: 1093 KPRSSQESDKSSNSGFDNVNSVRSVSQADP-----------TSVAMLPVAVPSTFHAVGS 1141

Query: 1359 MTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIAR 1180
            + +SP +SP SWN   HH  S   E SD+DAS DSPMGSPASWNSH LSQ   +D D AR
Sbjct: 1142 LPDSPGESPMSWNLQMHHPFSYQHETSDIDASVDSPMGSPASWNSHGLSQ---TDVDAAR 1198

Query: 1179 TRKKWGSAQKPVIAVS------QKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGD 1018
             RKKWGSAQKP++A +      +KD  KGF+RLLKFG+KSRG D ++ DW S +TTSEGD
Sbjct: 1199 MRKKWGSAQKPILATNSSQNQPRKDMTKGFKRLLKFGRKSRGTDNMA-DWIS-ATTSEGD 1256

Query: 1017 EDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFG--EQNSIN 844
            +D EDGRD A RS+ED LRKSRMGF  G                   +D+F   E N   
Sbjct: 1257 DDTEDGRDPANRSSED-LRKSRMGFAHG------------------PDDSFNEIEFNERV 1297

Query: 843  SLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
               SSIP  PVNFKLR++H+ SGSS+KAPRSFFSLS+FRSKGSDSK R
Sbjct: 1298 QALSSIPSPPVNFKLREEHI-SGSSMKAPRSFFSLSSFRSKGSDSKLR 1344


>ref|XP_006385528.1| hypothetical protein POPTR_0003s06800g [Populus trichocarpa]
            gi|550342580|gb|ERP63325.1| hypothetical protein
            POPTR_0003s06800g [Populus trichocarpa]
          Length = 1210

 Score =  305 bits (781), Expect = 6e-80
 Identities = 219/526 (41%), Positives = 280/526 (53%), Gaps = 13/526 (2%)
 Frame = -1

Query: 2238 SSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNRN 2059
            SS  RR   ENPLAQSVPNFSD RKENTKP++G               A R+Q+++   +
Sbjct: 769  SSGRRRVQSENPLAQSVPNFSDFRKENTKPFSG-----------VSKAANRSQVRTYACS 817

Query: 2058 KSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLAP 1897
            KS +E+    +            E+K RR Q++RKS      +N    LNS  DGVVLAP
Sbjct: 818  KSSSEEIPLVN------------EEKNRRSQSLRKSSAGPIEFNDFPPLNS--DGVVLAP 863

Query: 1896 LKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKNE 1717
            LK  +          M     +   + KPFLRK           VA LK   A E+LK E
Sbjct: 864  LKFDQP-------EPMPYDKFSKNVETKPFLRKCNGIGPGSGATVATLKGMVAPESLKTE 916

Query: 1716 D-EECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            + EE     +   +EA E    E   +  E   N ++ +   ++  D  +    S+N   
Sbjct: 917  EFEESPFEAEESVDEAKEEEDEELETTEVEGCANMDNGKLRLSQDSDK-IGMSGSENGDS 975

Query: 1539 LKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNV 1360
            L+    ++   +  ++                             A VP   +  +A   
Sbjct: 976  LRSISQIDPSSVSELA-----------------------------ASVP---STFHALGS 1003

Query: 1359 MTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIAR 1180
            + +SP +SP SWNS  HH  S   E SD+DA  DSP+GSPASWNSHSL Q E   +D AR
Sbjct: 1004 LQDSPGESPVSWNSRMHHPFSYPHETSDIDAYVDSPIGSPASWNSHSLIQRE---TDAAR 1060

Query: 1179 TRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGD 1018
             RKKWGSAQKP++        S+KD  KGF+RLLKFG+KSRGA+ +  DW S +TTSEGD
Sbjct: 1061 MRKKWGSAQKPILVANSFNNQSRKDVTKGFKRLLKFGRKSRGAESL-VDWIS-ATTSEGD 1118

Query: 1017 EDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSL 838
            +D EDGRD A RS+ED LRKSRMGF  G P          S  G  E++ F EQ  +++L
Sbjct: 1119 DDTEDGRDPANRSSED-LRKSRMGFSHGHP----------SDDGLNESELFNEQ--VHTL 1165

Query: 837  RSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
             SSIP  P NFKLRDD L SGSS+KAPRSFFSL++FRSKGSDSK R
Sbjct: 1166 NSSIPAPPENFKLRDD-LMSGSSIKAPRSFFSLTSFRSKGSDSKLR 1210


>ref|XP_006598844.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 1250

 Score =  304 bits (779), Expect = 1e-79
 Identities = 218/532 (40%), Positives = 288/532 (54%), Gaps = 14/532 (2%)
 Frame = -1

Query: 2253 VTGKASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLK 2074
            V+   SS  RRR  +NPLAQSVPNFSDLRKENTKP +G       + K+     TR+Q++
Sbjct: 811  VSVSRSSGGRRR--DNPLAQSVPNFSDLRKENTKPSSG-------VSKT-----TRSQVR 856

Query: 2073 SNNRNKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDG 1912
            S +R+KS  E+             +G KE+K R+  ++RKS      +  ++ LNS  DG
Sbjct: 857  SYSRSKSTTEEM------------QGVKEEKSRQTLSLRKSSANPAEFKDLSPLNS--DG 902

Query: 1911 VVLAPLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASE 1732
            +VL+PLK   + SD+    +  R          PFL+K             ++KAS AS+
Sbjct: 903  IVLSPLKFDMDESDLGPYDQSPR----------PFLKKGNNIGSGSVGNAIQMKASTASD 952

Query: 1731 NLKNEDEECEALTQNDDEEAS--EIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDD 1558
              KN++ E     + D  + +  E    ET+   D +  NN                   
Sbjct: 953  TQKNKEFEDPEFDEEDSLQIAMDEHDDIETMAIEDVAYNNNG------------------ 994

Query: 1557 SQNELDLKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLAN 1378
                   K  +  ES K          S        P  G  M     S+   V      
Sbjct: 995  -------KVSLSQESGKSGNSGSEIGDSARSLAQVDPISGGEMATGFTSTFNGV------ 1041

Query: 1377 PYAGNVMTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEAS 1198
                  + +SP+ SP SWNS   H  S   E+SD+DAS DSP+GSPASWNSHSL+Q    
Sbjct: 1042 ----RSLQDSPVGSPVSWNSRTRHPFSYPHESSDIDASIDSPVGSPASWNSHSLNQ---G 1094

Query: 1197 DSDIARTRKKWGSAQKPVIAVS------QKDPPKGFRRLLKFGKKSRGADLVSTDWASPS 1036
            D+D +R RKKWGSAQKP +  +      +KD  KGF+RLLKFG+K+RG++ ++ DW S +
Sbjct: 1095 DNDASRMRKKWGSAQKPFLVANSSQNQPRKDVTKGFKRLLKFGRKTRGSESMA-DWIS-A 1152

Query: 1035 TTSEGDEDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQ 856
            TTSEGD+D EDGRD A RS+ED LRKSRMGF  G P+ + S++         EN+ F EQ
Sbjct: 1153 TTSEGDDDTEDGRDLANRSSED-LRKSRMGFSHGHPS-DDSFN---------ENELFNEQ 1201

Query: 855  NSINSLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
              + SL+SSIP  P +FKLRDDH+ SGSS+KAP+SFFSLSTFRSKGSDSK R
Sbjct: 1202 --VQSLQSSIPAPPAHFKLRDDHI-SGSSIKAPKSFFSLSTFRSKGSDSKPR 1250


>ref|XP_006843854.1| hypothetical protein AMTR_s00007p00263470 [Amborella trichopoda]
            gi|548846222|gb|ERN05529.1| hypothetical protein
            AMTR_s00007p00263470 [Amborella trichopoda]
          Length = 1529

 Score =  304 bits (778), Expect = 1e-79
 Identities = 229/532 (43%), Positives = 294/532 (55%), Gaps = 18/532 (3%)
 Frame = -1

Query: 2241 ASSNTRRRSY-ENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGAT---RAQLK 2074
            +SS +RRR+  EN +AQSVPNFSD RKENTKP             S G G     R   K
Sbjct: 1083 SSSGSRRRTQTENIMAQSVPNFSDFRKENTKP------------SSVGTGKATLPRTNPK 1130

Query: 2073 SNNRNKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKSYNTMTDLN--SSTDGVVLA 1900
            +  R+KS +E+                KE+K +R Q++RKS  +  +L   SS +  VL 
Sbjct: 1131 TYTRSKSTSEEVIPVV-----------KEEKQKRTQSMRKSSASPGELKDLSSLNSEVLT 1179

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PL+  K+ S     S+   R   + ++A+PFLRK          GVAKLKA+  +E  K+
Sbjct: 1180 PLRFGKDQSQQLHFSKSPIRNGVSSAEAQPFLRKGNGIGPSAGPGVAKLKAAMTAETQKD 1239

Query: 1719 EDEECEALTQN--DDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSD-DSQN 1549
            ED++     +N  D  + S     E +G                A+S D P DS+ D + 
Sbjct: 1240 EDDKNGVSEENGVDVPDISPESDKEVIGI------------EKLADSEDFPADSEEDEEK 1287

Query: 1548 ELDLK----KDMDLESDKIE-RVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHL 1384
            E  L     K  DL SD  E R SF        D +A+                      
Sbjct: 1288 EGRLSHESFKSADLGSDSNEERRSFS-----QADDSAV---------------------- 1320

Query: 1383 ANPYAGNVMTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQME 1204
                  N   ESP  S   W+S   H+ S  LEASDV  S DSP+GSPASWN++SLSQ+ 
Sbjct: 1321 ----GSNHYEESPAAS---WSSRRDHAFSYGLEASDV--SVDSPVGSPASWNTNSLSQIM 1371

Query: 1203 ASDSDIARTRKKWGSAQKPVIAV---SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPST 1033
             +D+ ++R RK+WGSAQKPV+     S+KD  KGF+RLLKFG+KSRGADL++TDW S +T
Sbjct: 1372 EADA-VSRMRKRWGSAQKPVLVTGSGSRKDVTKGFKRLLKFGRKSRGADLLATDWVS-AT 1429

Query: 1032 TSEGDEDIEDGRDYAVRSAEDLLRKSRMGFPQ-GQPAYERSYDYGSSLSGQAENDAFGEQ 856
            TSEGD+D EDGRD A RS+ED LRK+RMGF   G P+Y+          G  + ++  EQ
Sbjct: 1430 TSEGDDDTEDGRDPASRSSED-LRKTRMGFSHGGLPSYD----------GFNDGESLQEQ 1478

Query: 855  NSINSLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
             +I SLRSSIP  P NFKLR+DHL SGSSLKAPRSFFSLS+FRSKGS+SK R
Sbjct: 1479 ATIQSLRSSIPAPPANFKLREDHL-SGSSLKAPRSFFSLSSFRSKGSESKPR 1529


>ref|XP_004141819.1| PREDICTED: uncharacterized protein LOC101213033 [Cucumis sativus]
            gi|449480667|ref|XP_004155962.1| PREDICTED:
            uncharacterized LOC101213033 [Cucumis sativus]
          Length = 1411

 Score =  301 bits (770), Expect = 1e-78
 Identities = 208/523 (39%), Positives = 288/523 (55%), Gaps = 9/523 (1%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            +SS  RR   EN LAQSVPNFS+LRKENTKP   + T             TR  +++ +R
Sbjct: 973  SSSGRRRGQTENLLAQSVPNFSELRKENTKPSERKST-------------TRPLVRNYSR 1019

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKSYNTMTDLNS----STDGVVLAPL 1894
             K+ NE+                KE+KPR  Q+ RK+  +  D       +TD VVLAPL
Sbjct: 1020 GKTSNEEPVI-------------KEEKPRIAQSSRKNSASAIDFKDILPLNTDNVVLAPL 1066

Query: 1893 KASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKNED 1714
               +E +D +   +  +       D+KPFLRK           +AKLKAS  SE  K+++
Sbjct: 1067 LLDEEQNDESIYDKYLKGI-----DSKPFLRKGNGIGPGAGTSIAKLKASMESETSKDDE 1121

Query: 1713 EECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELDLK 1534
            +  E   +  +    +  + E    M+  L + ++ +   ++       S +S +E++  
Sbjct: 1122 DYDEVAFEGSEIMPKQEEEEEGHEKMEMKLAHMDNGKLRLSQESGR---SSNSGSEIE-- 1176

Query: 1533 KDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNVMT 1354
                              + RS  H+ +           +S+ + +P  L + +   ++ 
Sbjct: 1177 -----------------NSMRSHSHSRVD----------HSTISELPSMLPSFHKAGLLQ 1209

Query: 1353 ESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIARTR 1174
            +SP +SP +WNS  HH  +   EASD+DA  DSP+GSPASWNSH+++Q E   +D+AR R
Sbjct: 1210 DSPGESPLAWNSRMHHPFAYPHEASDIDAYMDSPIGSPASWNSHNITQAE---TDVARMR 1266

Query: 1173 KKWGSAQKP-VIAVS----QKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGDEDI 1009
            KKWGSAQKP +IA S    +KD  KGF+RLLKFG+KSRG + +  DW S +TTSEGD+D 
Sbjct: 1267 KKWGSAQKPSLIATSSSQPRKDMAKGFKRLLKFGRKSRGTESM-VDWIS-ATTSEGDDDT 1324

Query: 1008 EDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSLRSS 829
            EDGRD A RS+ED LRKSRMGF +G               G  EN+ + EQ  +  L SS
Sbjct: 1325 EDGRDPASRSSED-LRKSRMGFSEGHD------------DGFNENELYCEQ--VQELHSS 1369

Query: 828  IPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
            IP  P NFKLR+DH+ SGSSLKAPRSFFSLSTFRSKG+D+ +R
Sbjct: 1370 IPAPPANFKLREDHM-SGSSLKAPRSFFSLSTFRSKGTDATSR 1411


>gb|ESW07394.1| hypothetical protein PHAVU_010G126300g [Phaseolus vulgaris]
          Length = 1257

 Score =  300 bits (767), Expect = 2e-78
 Identities = 217/529 (41%), Positives = 289/529 (54%), Gaps = 12/529 (2%)
 Frame = -1

Query: 2250 TGKASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKS 2071
            T  + S +  R  +NPLAQSVPNFSDLRKENTKP +G       + K+     TR Q++S
Sbjct: 817  TAVSVSRSSGRRRDNPLAQSVPNFSDLRKENTKPSSG-------VSKT-----TRTQVRS 864

Query: 2070 NNRNKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGV 1909
             +R+KS  E+             +G KE+K R+ Q++RKS      +  ++ LN   DG+
Sbjct: 865  YSRSKSTTEEM------------QGVKEEKSRQAQSLRKSSANPAEFKDLSALNP--DGI 910

Query: 1908 VLAPLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASEN 1729
            VL+PLK   + +D+    +  R           FL+K             ++KAS AS+ 
Sbjct: 911  VLSPLKFDMDETDLGPYDQSPRS----------FLKKGNNIGSGSVGNAIRMKASMASDT 960

Query: 1728 LKNEDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQN 1549
             KN++         DD E  E          D+SL       +   + ++  V  D + N
Sbjct: 961  QKNKEF--------DDLEFDE----------DDSL----QMATEEQDDIETMVIKDIAYN 998

Query: 1548 ELDLKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYA 1369
              + K  +  ES K          S        P  G  M     S+   V         
Sbjct: 999  N-NGKVSLSQESGKSGNSGSEIGDSTRSFAQVDPISGGEMASGFPSTFNGV--------- 1048

Query: 1368 GNVMTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSD 1189
               + +SP++SP SWNS   H  S   E+SD+DAS DSP+GSPASWNSHSL+Q    D+D
Sbjct: 1049 -RSVQDSPVESPVSWNSRVPHPFSYPHESSDIDASVDSPIGSPASWNSHSLNQ---GDND 1104

Query: 1188 IARTRKKWGSAQKPVIAVS------QKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTS 1027
             AR RKKWGSAQKP +  +      +KD  KGF+RLLKFG+K+RG++ ++ DW S +TTS
Sbjct: 1105 AARMRKKWGSAQKPFLVANSSQNQPRKDVTKGFKRLLKFGRKTRGSESLA-DWIS-ATTS 1162

Query: 1026 EGDEDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSI 847
            EGD+D EDGRD A RS+ED LRKSRMGF  G P+ + S++         EN+ F EQ  +
Sbjct: 1163 EGDDDTEDGRDLANRSSED-LRKSRMGFSHGHPS-DDSFN---------ENELFNEQ--V 1209

Query: 846  NSLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
             SL+SSIP  P +FKLRDDH+ SGSSLKAP+SFFSLSTFRSKGSDSK R
Sbjct: 1210 QSLQSSIPAPPAHFKLRDDHM-SGSSLKAPKSFFSLSTFRSKGSDSKPR 1257


>gb|EOY27342.1| Uncharacterized protein isoform 6 [Theobroma cacao]
          Length = 1415

 Score =  294 bits (753), Expect = 1e-76
 Identities = 216/505 (42%), Positives = 274/505 (54%), Gaps = 12/505 (2%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            ASS  RR   ENPL QSVPNFSDLRKENTKP +G     S           R+Q+++  R
Sbjct: 983  ASSGRRRAQSENPLVQSVPNFSDLRKENTKPSSGAAKMTS-----------RSQVRNYAR 1031

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLA 1900
             KS NE+ +             GK+D+PRR Q++RKS      ++ ++ LNS  DG+VLA
Sbjct: 1032 TKSTNEEIAL------------GKDDQPRRSQSLRKSSAGPVEFSDLSALNS--DGIVLA 1077

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PLK  KE  +   QS   +   N ++  K FLRK           +AK KAS AS   K 
Sbjct: 1078 PLKFDKEQME---QSFSDKFLQNVET--KTFLRKGNGIGPGAGVNIAKFKASEASVTPKE 1132

Query: 1719 EDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            E E  E   + DD             SMD +  + E    +  ESM    DS D +N   
Sbjct: 1133 EGESDELAFEADD-------------SMDMAKEDEE----DELESMVVE-DSADMENG-- 1172

Query: 1539 LKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNV 1360
             +  +  ESDK++        S S++ + +    +     +    A VP       +   
Sbjct: 1173 -RSRLSQESDKLDN-------SGSENGDCLRSLSQVDPASVAELPAAVPTTFHTAVS--- 1221

Query: 1359 MTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIAR 1180
            + +SP +SP SWNS  HH  S   E SD+DAS DSP+GSPASWNSHSL+Q E    D AR
Sbjct: 1222 LQDSPEESPVSWNSRLHHPFSYPHETSDIDASMDSPIGSPASWNSHSLAQTEV---DAAR 1278

Query: 1179 TRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGD 1018
             RKKWGSAQKP +        S++D  KGF+RLLKFG+KSRG D +  DW S +TTSEGD
Sbjct: 1279 MRKKWGSAQKPFLVANATHNQSRRDVTKGFKRLLKFGRKSRGTDSL-VDWIS-ATTSEGD 1336

Query: 1017 EDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSL 838
            +D EDGRD A RS+ED LRKSRMGF QG P          S  G  E++ F +Q  I SL
Sbjct: 1337 DDTEDGRDPANRSSED-LRKSRMGFSQGHP----------SDDGFNESELFNDQ--IQSL 1383

Query: 837  RSSIPIAPVNFKLRDDHLTSGSSLK 763
             SSIP  P NFKLR+DH+ SGSS+K
Sbjct: 1384 HSSIPAPPANFKLREDHM-SGSSIK 1407


>gb|EOY27341.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 1444

 Score =  294 bits (753), Expect = 1e-76
 Identities = 216/505 (42%), Positives = 274/505 (54%), Gaps = 12/505 (2%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            ASS  RR   ENPL QSVPNFSDLRKENTKP +G     S           R+Q+++  R
Sbjct: 983  ASSGRRRAQSENPLVQSVPNFSDLRKENTKPSSGAAKMTS-----------RSQVRNYAR 1031

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLA 1900
             KS NE+ +             GK+D+PRR Q++RKS      ++ ++ LNS  DG+VLA
Sbjct: 1032 TKSTNEEIAL------------GKDDQPRRSQSLRKSSAGPVEFSDLSALNS--DGIVLA 1077

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PLK  KE  +   QS   +   N ++  K FLRK           +AK KAS AS   K 
Sbjct: 1078 PLKFDKEQME---QSFSDKFLQNVET--KTFLRKGNGIGPGAGVNIAKFKASEASVTPKE 1132

Query: 1719 EDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            E E  E   + DD             SMD +  + E    +  ESM    DS D +N   
Sbjct: 1133 EGESDELAFEADD-------------SMDMAKEDEE----DELESMVVE-DSADMENG-- 1172

Query: 1539 LKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNV 1360
             +  +  ESDK++        S S++ + +    +     +    A VP       +   
Sbjct: 1173 -RSRLSQESDKLDN-------SGSENGDCLRSLSQVDPASVAELPAAVPTTFHTAVS--- 1221

Query: 1359 MTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIAR 1180
            + +SP +SP SWNS  HH  S   E SD+DAS DSP+GSPASWNSHSL+Q E    D AR
Sbjct: 1222 LQDSPEESPVSWNSRLHHPFSYPHETSDIDASMDSPIGSPASWNSHSLAQTEV---DAAR 1278

Query: 1179 TRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGD 1018
             RKKWGSAQKP +        S++D  KGF+RLLKFG+KSRG D +  DW S +TTSEGD
Sbjct: 1279 MRKKWGSAQKPFLVANATHNQSRRDVTKGFKRLLKFGRKSRGTDSL-VDWIS-ATTSEGD 1336

Query: 1017 EDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSL 838
            +D EDGRD A RS+ED LRKSRMGF QG P          S  G  E++ F +Q  I SL
Sbjct: 1337 DDTEDGRDPANRSSED-LRKSRMGFSQGHP----------SDDGFNESELFNDQ--IQSL 1383

Query: 837  RSSIPIAPVNFKLRDDHLTSGSSLK 763
             SSIP  P NFKLR+DH+ SGSS+K
Sbjct: 1384 HSSIPAPPANFKLREDHM-SGSSIK 1407


>ref|XP_006583175.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 1250

 Score =  293 bits (750), Expect = 2e-76
 Identities = 218/535 (40%), Positives = 287/535 (53%), Gaps = 17/535 (3%)
 Frame = -1

Query: 2253 VTGKASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLK 2074
            V+   SS  RRR  ++PLAQSVPNFSDLRKENTKP        SA+ K+     TR Q++
Sbjct: 811  VSVSRSSGGRRR--DDPLAQSVPNFSDLRKENTKP-------SSAVSKT-----TRTQVR 856

Query: 2073 SNNRNKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDG 1912
            + +R+KS  E+             +G KE+K R+  ++RKS      +  ++ LNS  DG
Sbjct: 857  TYSRSKSTTEEI------------QGVKEEKSRQTLSLRKSSANPAEFKDLSHLNS--DG 902

Query: 1911 VVLAPLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASE 1732
            +VL+PLK          +S +G    + +S    FL+K             ++KAS  S+
Sbjct: 903  IVLSPLKFDM------GESHLGPYDQSPRS----FLKKGNNIGSGSVGNAIRMKASMVSD 952

Query: 1731 NLKNEDEECEALTQNDD-----EEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVD 1567
              KN++ +     + D      EE  +I   ET+   D +  NN                
Sbjct: 953  TQKNKEFDDLEFDEEDSLRMATEEQDDI---ETMAIKDVAYNNNG--------------- 994

Query: 1566 SDDSQNELDLKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLH 1387
                      K  +  ES K          S        P  G  M     S+   V   
Sbjct: 995  ----------KVSLSQESGKSGNSGSEIGDSTRSLAQVDPISGGEMATGFPSTFNGV--- 1041

Query: 1386 LANPYAGNVMTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQM 1207
                     + +SP+ SP SWNS   H  S   E+SD+DAS DSP+GSPASWNSHSL+Q 
Sbjct: 1042 -------RSLQDSPVGSPVSWNSRVPHPFSYPHESSDIDASIDSPIGSPASWNSHSLNQ- 1093

Query: 1206 EASDSDIARTRKKWGSAQKPVIAVS------QKDPPKGFRRLLKFGKKSRGADLVSTDWA 1045
               D+D AR RKKWGSAQKP +  +      +KD  KGF+RLLKFG+K+RG++ ++ DW 
Sbjct: 1094 --GDNDAARMRKKWGSAQKPFLVANSSQNQPRKDVTKGFKRLLKFGRKTRGSESLA-DWI 1150

Query: 1044 SPSTTSEGDEDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAF 865
            S +TTSEGD+D EDGRD A RS+ED LRKSRMGF  G P+ + S++         EN+ F
Sbjct: 1151 S-ATTSEGDDDTEDGRDLANRSSED-LRKSRMGFSHGHPS-DDSFN---------ENELF 1198

Query: 864  GEQNSINSLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
             EQ  + SL+SSIP  P +FKLRDDH+ SGSSLKAP+SFFSLSTFRSKGSDSK R
Sbjct: 1199 NEQ--VQSLQSSIPAPPAHFKLRDDHI-SGSSLKAPKSFFSLSTFRSKGSDSKPR 1250


>gb|EMJ18855.1| hypothetical protein PRUPE_ppa000250mg [Prunus persica]
          Length = 1402

 Score =  293 bits (750), Expect = 2e-76
 Identities = 223/531 (41%), Positives = 284/531 (53%), Gaps = 18/531 (3%)
 Frame = -1

Query: 2238 SSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNRN 2059
            SS  RR   ENPLAQSVPNFSD RKENTKP +G     +A+ K       R+Q+KS +R+
Sbjct: 988  SSGRRRPELENPLAQSVPNFSDFRKENTKPSSG--VSKTAVSKIPA----RSQVKSYSRS 1041

Query: 2058 KSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLAP 1897
            KS++E+  S             KE+KPRR Q+ RKS      +N ++ LNS  DGVVL P
Sbjct: 1042 KSISEEIMS-------------KEEKPRRSQSSRKSSANPVEFNNLSPLNS--DGVVLVP 1086

Query: 1896 LKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKNE 1717
                KE ++   +            ++K FLRK                +   S ++  E
Sbjct: 1087 F--DKEQTEHYDKFPK-------YVESKSFLRKGNGIGTG---------SGVNSVDMAKE 1128

Query: 1716 DEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNEL-- 1543
            +EE                        +E LGN          +++  VD D+ +  L  
Sbjct: 1129 EEE------------------------EEELGNM---------AVEDEVDMDNGKPRLSQ 1155

Query: 1542 DLKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPY-AG 1366
            + +K  +  SD ++ V      S SQ   A              S A +P  + + + A 
Sbjct: 1156 ESEKSGNSGSDNVDSVR-----SLSQVDPA--------------SVAELPAAVPSTFHAL 1196

Query: 1365 NVMTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDI 1186
              + +SP +SP SWN H HH  S   E SDVDASADSP+GSPASWNSH L+Q+   D D 
Sbjct: 1197 GSLPDSPGESPMSWNLHMHHPFSYPHETSDVDASADSPIGSPASWNSHGLTQI---DVDA 1253

Query: 1185 ARTRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSE 1024
            AR RKKWGSAQKP++A       S+KD  KGF+RLLKFG+KSRG D    DW S +TTSE
Sbjct: 1254 ARMRKKWGSAQKPILATNSAQNQSRKDMTKGFKRLLKFGRKSRGIDNTG-DWIS-ATTSE 1311

Query: 1023 GDEDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGE---QN 853
            GD+D EDGRD A R +ED LRKSRMGF QG                   +D+F E     
Sbjct: 1312 GDDDTEDGRDPANRLSED-LRKSRMGFMQG------------------TDDSFNESEFNE 1352

Query: 852  SINSLRSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
             + +LRSSIP  P+NFKLR+DHL SGSSLKAPRSFFSLS+FRSKGS+SK R
Sbjct: 1353 QVEALRSSIPAPPMNFKLREDHL-SGSSLKAPRSFFSLSSFRSKGSESKLR 1402


>gb|EOY27340.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 1400

 Score =  283 bits (724), Expect = 2e-73
 Identities = 213/526 (40%), Positives = 269/526 (51%), Gaps = 12/526 (2%)
 Frame = -1

Query: 2241 ASSNTRRRSYENPLAQSVPNFSDLRKENTKPYAGRPTQGSALEKSSGGGATRAQLKSNNR 2062
            ASS  RR   ENPL QSVPNFSDLRKENTKP +G     S           R+Q+++  R
Sbjct: 983  ASSGRRRAQSENPLVQSVPNFSDLRKENTKPSSGAAKMTS-----------RSQVRNYAR 1031

Query: 2061 NKSVNEDFSSTDGSLNLGPSRGGKEDKPRRGQAVRKS------YNTMTDLNSSTDGVVLA 1900
             KS NE+ +             GK+D+PRR Q++RKS      ++ ++ LNS  DG+VLA
Sbjct: 1032 TKSTNEEIAL------------GKDDQPRRSQSLRKSSAGPVEFSDLSALNS--DGIVLA 1077

Query: 1899 PLKASKEMSDVASQSRMGRRTANAQSDAKPFLRKXXXXXXXXXXGVAKLKASAASENLKN 1720
            PLK  KE  +   QS   +   N ++  K FLRK           +AK KAS AS   K 
Sbjct: 1078 PLKFDKEQME---QSFSDKFLQNVET--KTFLRKGNGIGPGAGVNIAKFKASEASVTPKE 1132

Query: 1719 EDEECEALTQNDDEEASEIGKAETVGSMDESLGNNEHRRSNTAESMDAPVDSDDSQNELD 1540
            E E  E   + DD             SMD +  + E    +  ESM    DS D +N   
Sbjct: 1133 EGESDELAFEADD-------------SMDMAKEDEE----DELESMVVE-DSADMENG-- 1172

Query: 1539 LKKDMDLESDKIERVSFHFETSRSQDHNAIPFEGETMHVPLYSSKAPVPLHLANPYAGNV 1360
             +  +  ESDK++        S S++ + +    +     +    A VP       +   
Sbjct: 1173 -RSRLSQESDKLDN-------SGSENGDCLRSLSQVDPASVAELPAAVPTTFHTAVS--- 1221

Query: 1359 MTESPLDSPFSWNSHAHHSLSQMLEASDVDASADSPMGSPASWNSHSLSQMEASDSDIAR 1180
            + +SP +SP SWNS  HH  S   E SD+DAS DSP+GSPASWNSHSL+Q E    D AR
Sbjct: 1222 LQDSPEESPVSWNSRLHHPFSYPHETSDIDASMDSPIGSPASWNSHSLAQTEV---DAAR 1278

Query: 1179 TRKKWGSAQKPVIAV------SQKDPPKGFRRLLKFGKKSRGADLVSTDWASPSTTSEGD 1018
             RKKWGSAQKP +        S++D  KGF+RLLKFG+KSRG D +  DW S +TTSEGD
Sbjct: 1279 MRKKWGSAQKPFLVANATHNQSRRDVTKGFKRLLKFGRKSRGTDSL-VDWIS-ATTSEGD 1336

Query: 1017 EDIEDGRDYAVRSAEDLLRKSRMGFPQGQPAYERSYDYGSSLSGQAENDAFGEQNSINSL 838
            +D EDGRD A RS+EDL RKSRMGF QG P+                +D F E    N  
Sbjct: 1337 DDTEDGRDPANRSSEDL-RKSRMGFSQGHPS----------------DDGFNESELFND- 1378

Query: 837  RSSIPIAPVNFKLRDDHLTSGSSLKAPRSFFSLSTFRSKGSDSKTR 700
                                    + PRSFFSLS+FRSKGSDSK R
Sbjct: 1379 ------------------------QTPRSFFSLSSFRSKGSDSKPR 1400


Top