BLASTX nr result

ID: Zanthoxylum22_contig00011197 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00011197
         (2146 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006490856.1| PREDICTED: uncharacterized protein LOC102608...  1035   0.0  
ref|XP_006490855.1| PREDICTED: uncharacterized protein LOC102608...  1035   0.0  
ref|XP_006445325.1| hypothetical protein CICLE_v10018476mg [Citr...  1035   0.0  
gb|KDO85708.1| hypothetical protein CISIN_1g0003872mg, partial [...  1031   0.0  
ref|XP_007052167.1| Nucleotidyltransferase family protein isofor...   692   0.0  
ref|XP_007052158.1| Nucleotidyltransferase family protein isofor...   692   0.0  
ref|XP_007052157.1| Nucleotidyltransferase family protein isofor...   692   0.0  
ref|XP_012083850.1| PREDICTED: uncharacterized protein LOC105643...   689   0.0  
gb|KDP28800.1| hypothetical protein JCGZ_14571 [Jatropha curcas]      689   0.0  
ref|XP_007052162.1| Nucleotidyltransferase family protein isofor...   664   0.0  
ref|XP_007052160.1| Nucleotidyltransferase family protein isofor...   664   0.0  
ref|XP_012489736.1| PREDICTED: uncharacterized protein LOC105802...   624   e-175
gb|KJB41055.1| hypothetical protein B456_007G088700 [Gossypium r...   624   e-175
ref|XP_011033759.1| PREDICTED: uncharacterized protein LOC105132...   612   e-172
ref|XP_011033752.1| PREDICTED: uncharacterized protein LOC105132...   612   e-172
ref|XP_011033743.1| PREDICTED: uncharacterized protein LOC105132...   612   e-172
ref|XP_011033716.1| PREDICTED: uncharacterized protein LOC105132...   612   e-172
emb|CAN65347.1| hypothetical protein VITISV_000637 [Vitis vinifera]   611   e-172
ref|XP_010661312.1| PREDICTED: uncharacterized protein LOC100265...   609   e-171
gb|KJB41060.1| hypothetical protein B456_007G088700 [Gossypium r...   607   e-170

>ref|XP_006490856.1| PREDICTED: uncharacterized protein LOC102608196 isoform X4 [Citrus
            sinensis]
          Length = 1278

 Score = 1035 bits (2676), Expect = 0.0
 Identities = 529/718 (73%), Positives = 564/718 (78%), Gaps = 4/718 (0%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLELFGEGNFK SPN SKE                 RQN LPKSALDELSLDKPPK    
Sbjct: 111  KLELFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKPPKDPEG 170

Query: 1964 ALPDAEKGDSMESDKVPDMPK-KDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            AL D EK D M SDKVP +   KDI +E+STSEMEM VCHQ+HA+ALV G          
Sbjct: 171  ALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTK 230

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSII-MQAGVAKYDNLSIQNVSADNSTHFNVFA 1611
                   NCT +NPVPVKD K+ V +TSS I +Q  V KYD LS QNVS DNST  NV A
Sbjct: 231  TVKNKNKNCTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLA 290

Query: 1610 SNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETTDSKVE 1431
            SN SSCTSAS P REGI TQSTQED V++S++SEC +FSNG ID QT+ FLQETTDSKVE
Sbjct: 291  SNQSSCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVE 350

Query: 1430 CNIIPPDMSARDLNNTLGNN--GIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAVIQD 1257
            CNII PDM ARDL+N  GN+  GI+F  SFHES+TG+IS LPDKG+E LE+KKESAV QD
Sbjct: 351  CNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQD 410

Query: 1256 QRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHIWHNHIRQPF 1077
            QRNESFFGTALKSSLEC SYEWP IAPVYFPSISSH LPATDRLHLDVGH WHNH+RQPF
Sbjct: 411  QRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPF 470

Query: 1076 VPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSARXXXX 897
            VP +HQARN P DGGCNQILSQPLPMSLDWPPMV++VSGIAPSVTCNYDSGF+S+R    
Sbjct: 471  VPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGF 530

Query: 896  XXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHAVSGI 717
               FATKGMQFN KTSDD+G  SGDFMDLPE TTTQE GDECDSHWLSEEELEVH VSGI
Sbjct: 531  QQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGI 590

Query: 716  DYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSSYSTN 537
            DYNQYFGGGVMYWNTSDHPG GFSRP        SWAWHEADIK AVDDMVAFSSSYSTN
Sbjct: 591  DYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTN 650

Query: 536  GLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGSLASF 357
            GLTSPTAASFC PFDPLGPGHQ FSYVVPGNEVPGKVLHSSS TTD ATEEE+SGS AS 
Sbjct: 651  GLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASL 710

Query: 356  SGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIKRPPS 177
            SGDVD+ A+D+           PN SRERSRSDFKRSH HKSPCVPPSRREQPRIKRPPS
Sbjct: 711  SGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPS 770

Query: 176  PVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEACVR 3
            PVVLCV              SRK RGFPTVRSGSSSPR+WGVRGWYH+GTTSEE CVR
Sbjct: 771  PVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVR 828


>ref|XP_006490855.1| PREDICTED: uncharacterized protein LOC102608196 isoform X3 [Citrus
            sinensis]
          Length = 1335

 Score = 1035 bits (2676), Expect = 0.0
 Identities = 529/718 (73%), Positives = 564/718 (78%), Gaps = 4/718 (0%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLELFGEGNFK SPN SKE                 RQN LPKSALDELSLDKPPK    
Sbjct: 421  KLELFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKPPKDPEG 480

Query: 1964 ALPDAEKGDSMESDKVPDMPK-KDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            AL D EK D M SDKVP +   KDI +E+STSEMEM VCHQ+HA+ALV G          
Sbjct: 481  ALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTK 540

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSII-MQAGVAKYDNLSIQNVSADNSTHFNVFA 1611
                   NCT +NPVPVKD K+ V +TSS I +Q  V KYD LS QNVS DNST  NV A
Sbjct: 541  TVKNKNKNCTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLA 600

Query: 1610 SNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETTDSKVE 1431
            SN SSCTSAS P REGI TQSTQED V++S++SEC +FSNG ID QT+ FLQETTDSKVE
Sbjct: 601  SNQSSCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVE 660

Query: 1430 CNIIPPDMSARDLNNTLGNN--GIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAVIQD 1257
            CNII PDM ARDL+N  GN+  GI+F  SFHES+TG+IS LPDKG+E LE+KKESAV QD
Sbjct: 661  CNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQD 720

Query: 1256 QRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHIWHNHIRQPF 1077
            QRNESFFGTALKSSLEC SYEWP IAPVYFPSISSH LPATDRLHLDVGH WHNH+RQPF
Sbjct: 721  QRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPF 780

Query: 1076 VPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSARXXXX 897
            VP +HQARN P DGGCNQILSQPLPMSLDWPPMV++VSGIAPSVTCNYDSGF+S+R    
Sbjct: 781  VPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGF 840

Query: 896  XXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHAVSGI 717
               FATKGMQFN KTSDD+G  SGDFMDLPE TTTQE GDECDSHWLSEEELEVH VSGI
Sbjct: 841  QQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGI 900

Query: 716  DYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSSYSTN 537
            DYNQYFGGGVMYWNTSDHPG GFSRP        SWAWHEADIK AVDDMVAFSSSYSTN
Sbjct: 901  DYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTN 960

Query: 536  GLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGSLASF 357
            GLTSPTAASFC PFDPLGPGHQ FSYVVPGNEVPGKVLHSSS TTD ATEEE+SGS AS 
Sbjct: 961  GLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASL 1020

Query: 356  SGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIKRPPS 177
            SGDVD+ A+D+           PN SRERSRSDFKRSH HKSPCVPPSRREQPRIKRPPS
Sbjct: 1021 SGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPS 1080

Query: 176  PVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEACVR 3
            PVVLCV              SRK RGFPTVRSGSSSPR+WGVRGWYH+GTTSEE CVR
Sbjct: 1081 PVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVR 1138


>ref|XP_006445325.1| hypothetical protein CICLE_v10018476mg [Citrus clementina]
            gi|568875545|ref|XP_006490853.1| PREDICTED:
            uncharacterized protein LOC102608196 isoform X1 [Citrus
            sinensis] gi|568875547|ref|XP_006490854.1| PREDICTED:
            uncharacterized protein LOC102608196 isoform X2 [Citrus
            sinensis] gi|557547587|gb|ESR58565.1| hypothetical
            protein CICLE_v10018476mg [Citrus clementina]
          Length = 1588

 Score = 1035 bits (2676), Expect = 0.0
 Identities = 529/718 (73%), Positives = 564/718 (78%), Gaps = 4/718 (0%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLELFGEGNFK SPN SKE                 RQN LPKSALDELSLDKPPK    
Sbjct: 421  KLELFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKPPKDPEG 480

Query: 1964 ALPDAEKGDSMESDKVPDMPK-KDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            AL D EK D M SDKVP +   KDI +E+STSEMEM VCHQ+HA+ALV G          
Sbjct: 481  ALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTK 540

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSII-MQAGVAKYDNLSIQNVSADNSTHFNVFA 1611
                   NCT +NPVPVKD K+ V +TSS I +Q  V KYD LS QNVS DNST  NV A
Sbjct: 541  TVKNKNKNCTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLA 600

Query: 1610 SNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETTDSKVE 1431
            SN SSCTSAS P REGI TQSTQED V++S++SEC +FSNG ID QT+ FLQETTDSKVE
Sbjct: 601  SNQSSCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVE 660

Query: 1430 CNIIPPDMSARDLNNTLGNN--GIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAVIQD 1257
            CNII PDM ARDL+N  GN+  GI+F  SFHES+TG+IS LPDKG+E LE+KKESAV QD
Sbjct: 661  CNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQD 720

Query: 1256 QRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHIWHNHIRQPF 1077
            QRNESFFGTALKSSLEC SYEWP IAPVYFPSISSH LPATDRLHLDVGH WHNH+RQPF
Sbjct: 721  QRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPF 780

Query: 1076 VPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSARXXXX 897
            VP +HQARN P DGGCNQILSQPLPMSLDWPPMV++VSGIAPSVTCNYDSGF+S+R    
Sbjct: 781  VPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGF 840

Query: 896  XXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHAVSGI 717
               FATKGMQFN KTSDD+G  SGDFMDLPE TTTQE GDECDSHWLSEEELEVH VSGI
Sbjct: 841  QQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGI 900

Query: 716  DYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSSYSTN 537
            DYNQYFGGGVMYWNTSDHPG GFSRP        SWAWHEADIK AVDDMVAFSSSYSTN
Sbjct: 901  DYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTN 960

Query: 536  GLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGSLASF 357
            GLTSPTAASFC PFDPLGPGHQ FSYVVPGNEVPGKVLHSSS TTD ATEEE+SGS AS 
Sbjct: 961  GLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASL 1020

Query: 356  SGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIKRPPS 177
            SGDVD+ A+D+           PN SRERSRSDFKRSH HKSPCVPPSRREQPRIKRPPS
Sbjct: 1021 SGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPS 1080

Query: 176  PVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEACVR 3
            PVVLCV              SRK RGFPTVRSGSSSPR+WGVRGWYH+GTTSEE CVR
Sbjct: 1081 PVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVR 1138


>gb|KDO85708.1| hypothetical protein CISIN_1g0003872mg, partial [Citrus sinensis]
            gi|641867025|gb|KDO85709.1| hypothetical protein
            CISIN_1g0003872mg, partial [Citrus sinensis]
            gi|641867026|gb|KDO85710.1| hypothetical protein
            CISIN_1g0003872mg, partial [Citrus sinensis]
          Length = 1457

 Score = 1031 bits (2666), Expect = 0.0
 Identities = 528/718 (73%), Positives = 563/718 (78%), Gaps = 4/718 (0%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLELFGEGNFK SPN SKE                 RQN LPKSALDELSLDK PK    
Sbjct: 421  KLELFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEG 480

Query: 1964 ALPDAEKGDSMESDKVPDMPK-KDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            AL D EK D M SDKVP +   KDI +E+STSEMEM VCHQ+HA+ALV G          
Sbjct: 481  ALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTK 540

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSII-MQAGVAKYDNLSIQNVSADNSTHFNVFA 1611
                   NCT +NPVPVKD K+ V +TSS I +Q  V KYD LS QNVS DNST  NV A
Sbjct: 541  TVKNKNKNCTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLA 600

Query: 1610 SNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETTDSKVE 1431
            SN SSCTSAS P REGI TQSTQED V++S++SEC +FSNG ID QT+ FLQETTDSKVE
Sbjct: 601  SNQSSCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVE 660

Query: 1430 CNIIPPDMSARDLNNTLGNN--GIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAVIQD 1257
            CNII PDM ARDL+N  GN+  GI+F  SFHES+TG+IS LPDKG+E LE+KKESAV QD
Sbjct: 661  CNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQD 720

Query: 1256 QRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHIWHNHIRQPF 1077
            QRNESFFGTALKSSLEC SYEWP IAPVYFPSISSH LPATDRLHLDVGH WHNH+RQPF
Sbjct: 721  QRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPF 780

Query: 1076 VPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSARXXXX 897
            VP +HQARN P DGGCNQILSQPLPMSLDWPPMV++VSGIAPSVTCNYDSGF+S+R    
Sbjct: 781  VPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGF 840

Query: 896  XXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHAVSGI 717
               FATKGMQFN KTSDD+G  SGDFMDLPE TTTQE GDECDSHWLSEEELEVH VSGI
Sbjct: 841  QQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGI 900

Query: 716  DYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSSYSTN 537
            DYNQYFGGGVMYWNTSDHPG GFSRP        SWAWHEADIK AVDDMVAFSSSYSTN
Sbjct: 901  DYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTN 960

Query: 536  GLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGSLASF 357
            GLTSPTAASFC PFDPLGPGHQ FSYVVPGNEVPGKVLHSSS TTD ATEEE+SGS AS 
Sbjct: 961  GLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASL 1020

Query: 356  SGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIKRPPS 177
            SGDVD+ A+D+           PN SRERSRSDFKRSH HKSPCVPPSRREQPRIKRPPS
Sbjct: 1021 SGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPS 1080

Query: 176  PVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEACVR 3
            PVVLCV              SRK RGFPTVRSGSSSPR+WGVRGWYH+GTTSEE CVR
Sbjct: 1081 PVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVR 1138


>ref|XP_007052167.1| Nucleotidyltransferase family protein isoform 11 [Theobroma cacao]
            gi|508704428|gb|EOX96324.1| Nucleotidyltransferase family
            protein isoform 11 [Theobroma cacao]
          Length = 1261

 Score =  692 bits (1786), Expect = 0.0
 Identities = 376/720 (52%), Positives = 448/720 (62%), Gaps = 6/720 (0%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNF  S + SK+                 +Q  + K+ +++L  +KP K L  
Sbjct: 420  KLELLGEGNFNSSSDKSKDKFSACSRKKKGRSRNIKKQIPVAKAEVNDLLPEKPLKDLES 479

Query: 1964 ALPDAEKGDSMESDKVPDMPKKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXXX 1785
               + +K D  ES K+P +         + S+MEM     +H Q+L+ G           
Sbjct: 480  VSTNNKKADLKESSKMPVITHGKDVNRKTPSQMEM-----EHTQSLIGGKGRAAARKSRK 534

Query: 1784 XXXXXXNCTCSNPVPVKDSKLPV--SKTSSIIMQAGVAK----YDNLSIQNVSADNSTHF 1623
                  +   +    +K SK  V  + TSS I Q          DNL+IQ V  D  +  
Sbjct: 535  EKNKNKHTCVNGTTELKTSKKAVIEASTSSFIFQDEATNSSGVLDNLNIQGVPTDTMSQS 594

Query: 1622 NVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETTD 1443
            NV  SN S     + P RE I      +D  + S   E   +S  + + +     QE ++
Sbjct: 595  NVLESNSSPNRPHNQPFREEIAMNV--QDPEVGSTGQE--DYSKDVTENEFIATGQEDSN 650

Query: 1442 SKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAVI 1263
             +VECN +PP +   + ++     GI+   S   SK    S  PD     L+VK+E +VI
Sbjct: 651  CRVECNRLPPIIPVPESDSVFTGEGINLQNSHSASKIQENSTSPDASGNTLDVKEEVSVI 710

Query: 1262 QDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHIWHNHIRQ 1083
            Q Q ++  + TA  SS +C SYEWP++AP YFPSI+SH   ATDRLHLDVGH WHNHIRQ
Sbjct: 711  QVQ-DKKLYDTAPTSSPQCLSYEWPSVAPFYFPSINSHVPAATDRLHLDVGHNWHNHIRQ 769

Query: 1082 PFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSARXX 903
            PFVP +HQARNP I+ GCN+ILS+P+PMSLDWPPMVRS SG+ P +TCNY SGF+S R  
Sbjct: 770  PFVPTMHQARNPQIESGCNRILSRPMPMSLDWPPMVRSASGLTPPITCNYGSGFISRRQT 829

Query: 902  XXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHAVS 723
                 FA++  QFN K  DD+  YSGDF DLP+L  T E  DECDSHW+SEEE EVHAVS
Sbjct: 830  AFQQGFASQNFQFNTKNLDDERKYSGDFFDLPDLANTVELADECDSHWISEEEFEVHAVS 889

Query: 722  GIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSSYS 543
            GIDYNQYFGGGVMYWN SDHPG GFSRP        SWAWHEAD+  AVDDMVAFSSSYS
Sbjct: 890  GIDYNQYFGGGVMYWNPSDHPGTGFSRPPSLSSDDSSWAWHEADMSRAVDDMVAFSSSYS 949

Query: 542  TNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGSLA 363
            TNGLTSPTAA FC PF+PLGPGHQ  SYVVPGN+VPGKVLHS S T DAATEEE SGSLA
Sbjct: 950  TNGLTSPTAAPFCSPFEPLGPGHQAVSYVVPGNDVPGKVLHSPSPTPDAATEEEASGSLA 1009

Query: 362  SFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIKRP 183
            + S DV+    DS           PN SRERSRSDFKR H+HKSPCVPP+RREQPRIKRP
Sbjct: 1010 NLSSDVEGKTGDSLPYPILRPIIIPNISRERSRSDFKRGHDHKSPCVPPTRREQPRIKRP 1069

Query: 182  PSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEACVR 3
            PSPVVLCV              SRK RGFPTVRSGSSSPR+WG+RG YHDGT SEEACVR
Sbjct: 1070 PSPVVLCVPRAPRPPPPSPVNDSRKQRGFPTVRSGSSSPRHWGMRGLYHDGTNSEEACVR 1129


>ref|XP_007052158.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao]
            gi|590723340|ref|XP_007052159.1| Nucleotidyltransferase
            family protein isoform 2 [Theobroma cacao]
            gi|590723353|ref|XP_007052163.1| Nucleotidyltransferase
            family protein isoform 2 [Theobroma cacao]
            gi|590723356|ref|XP_007052164.1| Nucleotidyltransferase
            family protein isoform 2 [Theobroma cacao]
            gi|508704419|gb|EOX96315.1| Nucleotidyltransferase family
            protein isoform 2 [Theobroma cacao]
            gi|508704420|gb|EOX96316.1| Nucleotidyltransferase family
            protein isoform 2 [Theobroma cacao]
            gi|508704424|gb|EOX96320.1| Nucleotidyltransferase family
            protein isoform 2 [Theobroma cacao]
            gi|508704425|gb|EOX96321.1| Nucleotidyltransferase family
            protein isoform 2 [Theobroma cacao]
          Length = 1577

 Score =  692 bits (1786), Expect = 0.0
 Identities = 376/720 (52%), Positives = 448/720 (62%), Gaps = 6/720 (0%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNF  S + SK+                 +Q  + K+ +++L  +KP K L  
Sbjct: 420  KLELLGEGNFNSSSDKSKDKFSACSRKKKGRSRNIKKQIPVAKAEVNDLLPEKPLKDLES 479

Query: 1964 ALPDAEKGDSMESDKVPDMPKKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXXX 1785
               + +K D  ES K+P +         + S+MEM     +H Q+L+ G           
Sbjct: 480  VSTNNKKADLKESSKMPVITHGKDVNRKTPSQMEM-----EHTQSLIGGKGRAAARKSRK 534

Query: 1784 XXXXXXNCTCSNPVPVKDSKLPV--SKTSSIIMQAGVAK----YDNLSIQNVSADNSTHF 1623
                  +   +    +K SK  V  + TSS I Q          DNL+IQ V  D  +  
Sbjct: 535  EKNKNKHTCVNGTTELKTSKKAVIEASTSSFIFQDEATNSSGVLDNLNIQGVPTDTMSQS 594

Query: 1622 NVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETTD 1443
            NV  SN S     + P RE I      +D  + S   E   +S  + + +     QE ++
Sbjct: 595  NVLESNSSPNRPHNQPFREEIAMNV--QDPEVGSTGQE--DYSKDVTENEFIATGQEDSN 650

Query: 1442 SKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAVI 1263
             +VECN +PP +   + ++     GI+   S   SK    S  PD     L+VK+E +VI
Sbjct: 651  CRVECNRLPPIIPVPESDSVFTGEGINLQNSHSASKIQENSTSPDASGNTLDVKEEVSVI 710

Query: 1262 QDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHIWHNHIRQ 1083
            Q Q ++  + TA  SS +C SYEWP++AP YFPSI+SH   ATDRLHLDVGH WHNHIRQ
Sbjct: 711  QVQ-DKKLYDTAPTSSPQCLSYEWPSVAPFYFPSINSHVPAATDRLHLDVGHNWHNHIRQ 769

Query: 1082 PFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSARXX 903
            PFVP +HQARNP I+ GCN+ILS+P+PMSLDWPPMVRS SG+ P +TCNY SGF+S R  
Sbjct: 770  PFVPTMHQARNPQIESGCNRILSRPMPMSLDWPPMVRSASGLTPPITCNYGSGFISRRQT 829

Query: 902  XXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHAVS 723
                 FA++  QFN K  DD+  YSGDF DLP+L  T E  DECDSHW+SEEE EVHAVS
Sbjct: 830  AFQQGFASQNFQFNTKNLDDERKYSGDFFDLPDLANTVELADECDSHWISEEEFEVHAVS 889

Query: 722  GIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSSYS 543
            GIDYNQYFGGGVMYWN SDHPG GFSRP        SWAWHEAD+  AVDDMVAFSSSYS
Sbjct: 890  GIDYNQYFGGGVMYWNPSDHPGTGFSRPPSLSSDDSSWAWHEADMSRAVDDMVAFSSSYS 949

Query: 542  TNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGSLA 363
            TNGLTSPTAA FC PF+PLGPGHQ  SYVVPGN+VPGKVLHS S T DAATEEE SGSLA
Sbjct: 950  TNGLTSPTAAPFCSPFEPLGPGHQAVSYVVPGNDVPGKVLHSPSPTPDAATEEEASGSLA 1009

Query: 362  SFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIKRP 183
            + S DV+    DS           PN SRERSRSDFKR H+HKSPCVPP+RREQPRIKRP
Sbjct: 1010 NLSSDVEGKTGDSLPYPILRPIIIPNISRERSRSDFKRGHDHKSPCVPPTRREQPRIKRP 1069

Query: 182  PSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEACVR 3
            PSPVVLCV              SRK RGFPTVRSGSSSPR+WG+RG YHDGT SEEACVR
Sbjct: 1070 PSPVVLCVPRAPRPPPPSPVNDSRKQRGFPTVRSGSSSPRHWGMRGLYHDGTNSEEACVR 1129


>ref|XP_007052157.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao]
            gi|508704418|gb|EOX96314.1| Nucleotidyltransferase family
            protein isoform 1 [Theobroma cacao]
          Length = 1577

 Score =  692 bits (1786), Expect = 0.0
 Identities = 376/720 (52%), Positives = 448/720 (62%), Gaps = 6/720 (0%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNF  S + SK+                 +Q  + K+ +++L  +KP K L  
Sbjct: 420  KLELLGEGNFNSSSDKSKDKFSACSRKKKGRSRNIKKQIPVAKAEVNDLLPEKPLKDLES 479

Query: 1964 ALPDAEKGDSMESDKVPDMPKKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXXX 1785
               + +K D  ES K+P +         + S+MEM     +H Q+L+ G           
Sbjct: 480  VSTNNKKADLKESSKMPVITHGKDVNRKTPSQMEM-----EHTQSLIGGKGRAAARKSRK 534

Query: 1784 XXXXXXNCTCSNPVPVKDSKLPV--SKTSSIIMQAGVAK----YDNLSIQNVSADNSTHF 1623
                  +   +    +K SK  V  + TSS I Q          DNL+IQ V  D  +  
Sbjct: 535  EKNKNKHTCVNGTTELKTSKKAVIEASTSSFIFQDEATNSSGVLDNLNIQGVPTDTMSQS 594

Query: 1622 NVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETTD 1443
            NV  SN S     + P RE I      +D  + S   E   +S  + + +     QE ++
Sbjct: 595  NVLESNSSPNRPHNQPFREEIAMNV--QDPEVGSTGQE--DYSKDVTENEFIATGQEDSN 650

Query: 1442 SKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAVI 1263
             +VECN +PP +   + ++     GI+   S   SK    S  PD     L+VK+E +VI
Sbjct: 651  CRVECNRLPPIIPVPESDSVFTGEGINLQNSHSASKIQENSTSPDASGNTLDVKEEVSVI 710

Query: 1262 QDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHIWHNHIRQ 1083
            Q Q ++  + TA  SS +C SYEWP++AP YFPSI+SH   ATDRLHLDVGH WHNHIRQ
Sbjct: 711  QVQ-DKKLYDTAPTSSPQCLSYEWPSVAPFYFPSINSHVPAATDRLHLDVGHNWHNHIRQ 769

Query: 1082 PFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSARXX 903
            PFVP +HQARNP I+ GCN+ILS+P+PMSLDWPPMVRS SG+ P +TCNY SGF+S R  
Sbjct: 770  PFVPTMHQARNPQIESGCNRILSRPMPMSLDWPPMVRSASGLTPPITCNYGSGFISRRQT 829

Query: 902  XXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHAVS 723
                 FA++  QFN K  DD+  YSGDF DLP+L  T E  DECDSHW+SEEE EVHAVS
Sbjct: 830  AFQQGFASQNFQFNTKNLDDERKYSGDFFDLPDLANTVELADECDSHWISEEEFEVHAVS 889

Query: 722  GIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSSYS 543
            GIDYNQYFGGGVMYWN SDHPG GFSRP        SWAWHEAD+  AVDDMVAFSSSYS
Sbjct: 890  GIDYNQYFGGGVMYWNPSDHPGTGFSRPPSLSSDDSSWAWHEADMSRAVDDMVAFSSSYS 949

Query: 542  TNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGSLA 363
            TNGLTSPTAA FC PF+PLGPGHQ  SYVVPGN+VPGKVLHS S T DAATEEE SGSLA
Sbjct: 950  TNGLTSPTAAPFCSPFEPLGPGHQAVSYVVPGNDVPGKVLHSPSPTPDAATEEEASGSLA 1009

Query: 362  SFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIKRP 183
            + S DV+    DS           PN SRERSRSDFKR H+HKSPCVPP+RREQPRIKRP
Sbjct: 1010 NLSSDVEGKTGDSLPYPILRPIIIPNISRERSRSDFKRGHDHKSPCVPPTRREQPRIKRP 1069

Query: 182  PSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEACVR 3
            PSPVVLCV              SRK RGFPTVRSGSSSPR+WG+RG YHDGT SEEACVR
Sbjct: 1070 PSPVVLCVPRAPRPPPPSPVNDSRKQRGFPTVRSGSSSPRHWGMRGLYHDGTNSEEACVR 1129


>ref|XP_012083850.1| PREDICTED: uncharacterized protein LOC105643363 [Jatropha curcas]
          Length = 1526

 Score =  689 bits (1778), Expect = 0.0
 Identities = 380/727 (52%), Positives = 454/727 (62%), Gaps = 13/727 (1%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNFK S N  KE                 +        + E S +KP K   D
Sbjct: 422  KLELLGEGNFKSSTNKPKEKLSAGKRKKKGKTHSMKKSIPATGIGVRESSFNKPLKDHDD 481

Query: 1964 ALPDAEKGDSMESDKVPDMPK-KDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            AL  +E  +S    ++P+MP  ++I +++ +S +EM     +H+Q LV+G          
Sbjct: 482  ALTYSENMESTAVSELPNMPLGREIQEDTLSSAVEM-----EHSQGLVIGKGQTAARKNR 536

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSK--TSSIIMQAGVAKYD----NLSIQNVSADNSTH 1626
                     T +N V VK+++  V++    SII     AK D    N + QNVS D    
Sbjct: 537  KRKNKSKTSTLNNVVEVKNAESSVAEGPCMSIICSEEAAKLDMVSDNSATQNVSNDILVG 596

Query: 1625 FNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECH------QFSNGMIDYQTKP 1464
               F  N +  TSAS P +EGI  QS QED V+      CH      Q SN M++ ++ P
Sbjct: 597  SESFVPNVNLNTSASEPTKEGIGVQSIQEDGVVGQNEGICHIGSEHEQSSNNMMEDESIP 656

Query: 1463 FLQETTDSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEV 1284
               ET + K E ++    +    +N    N  I+F       K+ + S   D+ V  L V
Sbjct: 657  SRIETLNFKTETSVTSHVVPMLKINTNSSNEDINF----QNKKSKARSKFSDRSVRDLNV 712

Query: 1283 KKESAVIQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHI 1104
            K E  +IQ Q N+ F G  L +S E  SYEWPN+APVYFPS++SH  PATDRLHLDVG  
Sbjct: 713  KDEPTLIQGQGNKKFNGARLTNSSEYISYEWPNLAPVYFPSLNSHLPPATDRLHLDVGCN 772

Query: 1103 WHNHIRQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSG 924
            W NH+RQPFVP VHQARN  I+ G N+ LS+PL MSLDWPPMVRS  G+APS+TCNYDSG
Sbjct: 773  WQNHVRQPFVPTVHQARNSAIENGYNRTLSRPLQMSLDWPPMVRSNYGLAPSMTCNYDSG 832

Query: 923  FMSARXXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEE 744
            F+S R       F    MQFN KT+D++  YSGDF+D PE    QE  D+ +SHW+SEEE
Sbjct: 833  FISRRQSVFQQSFTAHNMQFNAKTTDEEKKYSGDFIDAPESANAQELMDDYESHWISEEE 892

Query: 743  LEVHAVSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMV 564
            LEVHAVSGIDYNQYFGGGVMYWN SDHPG GFSRP        +WAWHEADI  AVDDMV
Sbjct: 893  LEVHAVSGIDYNQYFGGGVMYWNPSDHPGKGFSRPPSLSSDDSTWAWHEADINRAVDDMV 952

Query: 563  AFSSSYSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEE 384
            AFSSSYSTNGLTSPTAASFC PF+PLG GHQ   YV+PGNEV GKVLHSS+  TD+ATEE
Sbjct: 953  AFSSSYSTNGLTSPTAASFCSPFEPLGAGHQALGYVLPGNEVSGKVLHSSTTPTDSATEE 1012

Query: 383  EVSGSLASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRRE 204
            EV+G+LA+ S DV+    DS           PN SRERSRSDFKRSH+HKSPCVPPSRRE
Sbjct: 1013 EVTGTLANLSVDVEGKVGDSLPYPILPPIIIPNMSRERSRSDFKRSHDHKSPCVPPSRRE 1072

Query: 203  QPRIKRPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTT 24
            QPRIKRPPSPVVLCV              SRKHRGFPTVRSGSSSPR+W +RGWYH+GT 
Sbjct: 1073 QPRIKRPPSPVVLCVPRAPRPPPPSPVSGSRKHRGFPTVRSGSSSPRHWSMRGWYHEGTN 1132

Query: 23   SEEACVR 3
             EEACVR
Sbjct: 1133 LEEACVR 1139


>gb|KDP28800.1| hypothetical protein JCGZ_14571 [Jatropha curcas]
          Length = 1591

 Score =  689 bits (1778), Expect = 0.0
 Identities = 380/727 (52%), Positives = 454/727 (62%), Gaps = 13/727 (1%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNFK S N  KE                 +        + E S +KP K   D
Sbjct: 422  KLELLGEGNFKSSTNKPKEKLSAGKRKKKGKTHSMKKSIPATGIGVRESSFNKPLKDHDD 481

Query: 1964 ALPDAEKGDSMESDKVPDMPK-KDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            AL  +E  +S    ++P+MP  ++I +++ +S +EM     +H+Q LV+G          
Sbjct: 482  ALTYSENMESTAVSELPNMPLGREIQEDTLSSAVEM-----EHSQGLVIGKGQTAARKNR 536

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSK--TSSIIMQAGVAKYD----NLSIQNVSADNSTH 1626
                     T +N V VK+++  V++    SII     AK D    N + QNVS D    
Sbjct: 537  KRKNKSKTSTLNNVVEVKNAESSVAEGPCMSIICSEEAAKLDMVSDNSATQNVSNDILVG 596

Query: 1625 FNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECH------QFSNGMIDYQTKP 1464
               F  N +  TSAS P +EGI  QS QED V+      CH      Q SN M++ ++ P
Sbjct: 597  SESFVPNVNLNTSASEPTKEGIGVQSIQEDGVVGQNEGICHIGSEHEQSSNNMMEDESIP 656

Query: 1463 FLQETTDSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEV 1284
               ET + K E ++    +    +N    N  I+F       K+ + S   D+ V  L V
Sbjct: 657  SRIETLNFKTETSVTSHVVPMLKINTNSSNEDINF----QNKKSKARSKFSDRSVRDLNV 712

Query: 1283 KKESAVIQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHI 1104
            K E  +IQ Q N+ F G  L +S E  SYEWPN+APVYFPS++SH  PATDRLHLDVG  
Sbjct: 713  KDEPTLIQGQGNKKFNGARLTNSSEYISYEWPNLAPVYFPSLNSHLPPATDRLHLDVGCN 772

Query: 1103 WHNHIRQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSG 924
            W NH+RQPFVP VHQARN  I+ G N+ LS+PL MSLDWPPMVRS  G+APS+TCNYDSG
Sbjct: 773  WQNHVRQPFVPTVHQARNSAIENGYNRTLSRPLQMSLDWPPMVRSNYGLAPSMTCNYDSG 832

Query: 923  FMSARXXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEE 744
            F+S R       F    MQFN KT+D++  YSGDF+D PE    QE  D+ +SHW+SEEE
Sbjct: 833  FISRRQSVFQQSFTAHNMQFNAKTTDEEKKYSGDFIDAPESANAQELMDDYESHWISEEE 892

Query: 743  LEVHAVSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMV 564
            LEVHAVSGIDYNQYFGGGVMYWN SDHPG GFSRP        +WAWHEADI  AVDDMV
Sbjct: 893  LEVHAVSGIDYNQYFGGGVMYWNPSDHPGKGFSRPPSLSSDDSTWAWHEADINRAVDDMV 952

Query: 563  AFSSSYSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEE 384
            AFSSSYSTNGLTSPTAASFC PF+PLG GHQ   YV+PGNEV GKVLHSS+  TD+ATEE
Sbjct: 953  AFSSSYSTNGLTSPTAASFCSPFEPLGAGHQALGYVLPGNEVSGKVLHSSTTPTDSATEE 1012

Query: 383  EVSGSLASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRRE 204
            EV+G+LA+ S DV+    DS           PN SRERSRSDFKRSH+HKSPCVPPSRRE
Sbjct: 1013 EVTGTLANLSVDVEGKVGDSLPYPILPPIIIPNMSRERSRSDFKRSHDHKSPCVPPSRRE 1072

Query: 203  QPRIKRPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTT 24
            QPRIKRPPSPVVLCV              SRKHRGFPTVRSGSSSPR+W +RGWYH+GT 
Sbjct: 1073 QPRIKRPPSPVVLCVPRAPRPPPPSPVSGSRKHRGFPTVRSGSSSPRHWSMRGWYHEGTN 1132

Query: 23   SEEACVR 3
             EEACVR
Sbjct: 1133 LEEACVR 1139


>ref|XP_007052162.1| Nucleotidyltransferase family protein isoform 6 [Theobroma cacao]
            gi|508704423|gb|EOX96319.1| Nucleotidyltransferase family
            protein isoform 6 [Theobroma cacao]
          Length = 1222

 Score =  664 bits (1712), Expect = 0.0
 Identities = 365/720 (50%), Positives = 431/720 (59%), Gaps = 6/720 (0%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNF  S + SK+                 +Q  + K+ +++L  +KP K    
Sbjct: 420  KLELLGEGNFNSSSDKSKDKFSACSRKKKGRSRNIKKQIPVAKAEVNDLLPEKPLK---- 475

Query: 1964 ALPDAEKGDSMESDKVPDMPKKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXXX 1785
                                                    +H Q+L+ G           
Sbjct: 476  ----------------------------------------EHTQSLIGGKGRAAARKSRK 495

Query: 1784 XXXXXXNCTCSNPVPVKDSKLPV--SKTSSIIMQAGVAK----YDNLSIQNVSADNSTHF 1623
                  +   +    +K SK  V  + TSS I Q          DNL+IQ V  D  +  
Sbjct: 496  EKNKNKHTCVNGTTELKTSKKAVIEASTSSFIFQDEATNSSGVLDNLNIQGVPTDTMSQS 555

Query: 1622 NVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETTD 1443
            NV  SN S     + P RE I      +D  + S   E   +S  + + +     QE ++
Sbjct: 556  NVLESNSSPNRPHNQPFREEIAMNV--QDPEVGSTGQE--DYSKDVTENEFIATGQEDSN 611

Query: 1442 SKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAVI 1263
             +VECN +PP +   + ++     GI+   S   SK    S  PD     L+VK+E +VI
Sbjct: 612  CRVECNRLPPIIPVPESDSVFTGEGINLQNSHSASKIQENSTSPDASGNTLDVKEEVSVI 671

Query: 1262 QDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHIWHNHIRQ 1083
            Q Q ++  + TA  SS +C SYEWP++AP YFPSI+SH   ATDRLHLDVGH WHNHIRQ
Sbjct: 672  QVQ-DKKLYDTAPTSSPQCLSYEWPSVAPFYFPSINSHVPAATDRLHLDVGHNWHNHIRQ 730

Query: 1082 PFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSARXX 903
            PFVP +HQARNP I+ GCN+ILS+P+PMSLDWPPMVRS SG+ P +TCNY SGF+S R  
Sbjct: 731  PFVPTMHQARNPQIESGCNRILSRPMPMSLDWPPMVRSASGLTPPITCNYGSGFISRRQT 790

Query: 902  XXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHAVS 723
                 FA++  QFN K  DD+  YSGDF DLP+L  T E  DECDSHW+SEEE EVHAVS
Sbjct: 791  AFQQGFASQNFQFNTKNLDDERKYSGDFFDLPDLANTVELADECDSHWISEEEFEVHAVS 850

Query: 722  GIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSSYS 543
            GIDYNQYFGGGVMYWN SDHPG GFSRP        SWAWHEAD+  AVDDMVAFSSSYS
Sbjct: 851  GIDYNQYFGGGVMYWNPSDHPGTGFSRPPSLSSDDSSWAWHEADMSRAVDDMVAFSSSYS 910

Query: 542  TNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGSLA 363
            TNGLTSPTAA FC PF+PLGPGHQ  SYVVPGN+VPGKVLHS S T DAATEEE SGSLA
Sbjct: 911  TNGLTSPTAAPFCSPFEPLGPGHQAVSYVVPGNDVPGKVLHSPSPTPDAATEEEASGSLA 970

Query: 362  SFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIKRP 183
            + S DV+    DS           PN SRERSRSDFKR H+HKSPCVPP+RREQPRIKRP
Sbjct: 971  NLSSDVEGKTGDSLPYPILRPIIIPNISRERSRSDFKRGHDHKSPCVPPTRREQPRIKRP 1030

Query: 182  PSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEACVR 3
            PSPVVLCV              SRK RGFPTVRSGSSSPR+WG+RG YHDGT SEEACVR
Sbjct: 1031 PSPVVLCVPRAPRPPPPSPVNDSRKQRGFPTVRSGSSSPRHWGMRGLYHDGTNSEEACVR 1090


>ref|XP_007052160.1| Nucleotidyltransferase family protein isoform 4 [Theobroma cacao]
            gi|590723347|ref|XP_007052161.1| Nucleotidyltransferase
            family protein isoform 4 [Theobroma cacao]
            gi|590723359|ref|XP_007052165.1| Nucleotidyltransferase
            family protein isoform 4 [Theobroma cacao]
            gi|590723369|ref|XP_007052166.1| Nucleotidyltransferase
            family protein isoform 4 [Theobroma cacao]
            gi|590723383|ref|XP_007052168.1| Nucleotidyltransferase
            family protein isoform 4 [Theobroma cacao]
            gi|508704421|gb|EOX96317.1| Nucleotidyltransferase family
            protein isoform 4 [Theobroma cacao]
            gi|508704422|gb|EOX96318.1| Nucleotidyltransferase family
            protein isoform 4 [Theobroma cacao]
            gi|508704426|gb|EOX96322.1| Nucleotidyltransferase family
            protein isoform 4 [Theobroma cacao]
            gi|508704427|gb|EOX96323.1| Nucleotidyltransferase family
            protein isoform 4 [Theobroma cacao]
            gi|508704429|gb|EOX96325.1| Nucleotidyltransferase family
            protein isoform 4 [Theobroma cacao]
          Length = 1538

 Score =  664 bits (1712), Expect = 0.0
 Identities = 365/720 (50%), Positives = 431/720 (59%), Gaps = 6/720 (0%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNF  S + SK+                 +Q  + K+ +++L  +KP K    
Sbjct: 420  KLELLGEGNFNSSSDKSKDKFSACSRKKKGRSRNIKKQIPVAKAEVNDLLPEKPLK---- 475

Query: 1964 ALPDAEKGDSMESDKVPDMPKKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXXX 1785
                                                    +H Q+L+ G           
Sbjct: 476  ----------------------------------------EHTQSLIGGKGRAAARKSRK 495

Query: 1784 XXXXXXNCTCSNPVPVKDSKLPV--SKTSSIIMQAGVAK----YDNLSIQNVSADNSTHF 1623
                  +   +    +K SK  V  + TSS I Q          DNL+IQ V  D  +  
Sbjct: 496  EKNKNKHTCVNGTTELKTSKKAVIEASTSSFIFQDEATNSSGVLDNLNIQGVPTDTMSQS 555

Query: 1622 NVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETTD 1443
            NV  SN S     + P RE I      +D  + S   E   +S  + + +     QE ++
Sbjct: 556  NVLESNSSPNRPHNQPFREEIAMNV--QDPEVGSTGQE--DYSKDVTENEFIATGQEDSN 611

Query: 1442 SKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAVI 1263
             +VECN +PP +   + ++     GI+   S   SK    S  PD     L+VK+E +VI
Sbjct: 612  CRVECNRLPPIIPVPESDSVFTGEGINLQNSHSASKIQENSTSPDASGNTLDVKEEVSVI 671

Query: 1262 QDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHIWHNHIRQ 1083
            Q Q ++  + TA  SS +C SYEWP++AP YFPSI+SH   ATDRLHLDVGH WHNHIRQ
Sbjct: 672  QVQ-DKKLYDTAPTSSPQCLSYEWPSVAPFYFPSINSHVPAATDRLHLDVGHNWHNHIRQ 730

Query: 1082 PFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSARXX 903
            PFVP +HQARNP I+ GCN+ILS+P+PMSLDWPPMVRS SG+ P +TCNY SGF+S R  
Sbjct: 731  PFVPTMHQARNPQIESGCNRILSRPMPMSLDWPPMVRSASGLTPPITCNYGSGFISRRQT 790

Query: 902  XXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHAVS 723
                 FA++  QFN K  DD+  YSGDF DLP+L  T E  DECDSHW+SEEE EVHAVS
Sbjct: 791  AFQQGFASQNFQFNTKNLDDERKYSGDFFDLPDLANTVELADECDSHWISEEEFEVHAVS 850

Query: 722  GIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSSYS 543
            GIDYNQYFGGGVMYWN SDHPG GFSRP        SWAWHEAD+  AVDDMVAFSSSYS
Sbjct: 851  GIDYNQYFGGGVMYWNPSDHPGTGFSRPPSLSSDDSSWAWHEADMSRAVDDMVAFSSSYS 910

Query: 542  TNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGSLA 363
            TNGLTSPTAA FC PF+PLGPGHQ  SYVVPGN+VPGKVLHS S T DAATEEE SGSLA
Sbjct: 911  TNGLTSPTAAPFCSPFEPLGPGHQAVSYVVPGNDVPGKVLHSPSPTPDAATEEEASGSLA 970

Query: 362  SFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIKRP 183
            + S DV+    DS           PN SRERSRSDFKR H+HKSPCVPP+RREQPRIKRP
Sbjct: 971  NLSSDVEGKTGDSLPYPILRPIIIPNISRERSRSDFKRGHDHKSPCVPPTRREQPRIKRP 1030

Query: 182  PSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEACVR 3
            PSPVVLCV              SRK RGFPTVRSGSSSPR+WG+RG YHDGT SEEACVR
Sbjct: 1031 PSPVVLCVPRAPRPPPPSPVNDSRKQRGFPTVRSGSSSPRHWGMRGLYHDGTNSEEACVR 1090


>ref|XP_012489736.1| PREDICTED: uncharacterized protein LOC105802572 [Gossypium raimondii]
            gi|763773936|gb|KJB41059.1| hypothetical protein
            B456_007G088700 [Gossypium raimondii]
            gi|763773939|gb|KJB41062.1| hypothetical protein
            B456_007G088700 [Gossypium raimondii]
          Length = 1569

 Score =  624 bits (1608), Expect = e-175
 Identities = 357/729 (48%), Positives = 432/729 (59%), Gaps = 16/729 (2%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNF  S + SK+                  QN + K  +D+    KP K L  
Sbjct: 422  KLELLGEGNFNSSSDKSKDQFSASSRKKKVKSRNIKNQNPVLKMEMDDHPPQKPLKDLEY 481

Query: 1964 ALPDAEKGDSMESDKVPDMPK-KDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
                 +K D MES K   +P  KD+  +S       A   +   +               
Sbjct: 482  KSTHNKKADLMESTKTHVIPHDKDVQTQSGVGGKGQAAARKSRKEK-------------- 527

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSI--IMQAGVAK----YDNLSIQN-VSADNST 1629
                       ++   VK SK  V+ +SS+  + Q    K     DNLS+++ V  D  +
Sbjct: 528  ---NKKKRSYINDTTEVKSSKKAVTGSSSLSFVSQDEATKSNGVLDNLSVEHSVPTDTIS 584

Query: 1628 HFNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECH------QFSNGMIDYQTK 1467
            H N+     S     +   +E I      +D  + S +  CH      Q S  +   +  
Sbjct: 585  HTNILEPISSPTEPDNQLFKEDIALHV--QDHEVGSTNGFCHKGTGHQQDSKDISANEII 642

Query: 1466 PFLQETTDSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEV-- 1293
            P  QE+++ K ECN++PP               +  G+  +E     I      GV V  
Sbjct: 643  PTRQESSNYKRECNVLPPIAPKP--------GSVFIGEGINEHSASKIQENSPSGVSVNA 694

Query: 1292 LEVKKESAVIQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDV 1113
            L++K+  +VIQ Q ++ F+ TA   + +C SYEWP++AP YFPSI+SH   ATDRLHLDV
Sbjct: 695  LDIKEGVSVIQVQ-DKKFYNTA--PTPQCLSYEWPSVAPFYFPSINSHVPAATDRLHLDV 751

Query: 1112 GHIWHNHIRQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNY 933
            GH WHNHIRQPFVP +HQARNP I+ GCN+ILS+P+PMSLDWPPMVRS SG+APSVT NY
Sbjct: 752  GHNWHNHIRQPFVPTMHQARNPSIESGCNRILSRPMPMSLDWPPMVRSASGLAPSVTYNY 811

Query: 932  DSGFMSARXXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLS 753
            DSGF+S R       FA++  QFN+K+ +DD  YSGDF DLP+   T E  DE DSH++S
Sbjct: 812  DSGFISRRQTAFQQSFASQNFQFNMKSFEDDRKYSGDFFDLPDPANTSELADEYDSHYIS 871

Query: 752  EEELEVHAVSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVD 573
            EEE EVHAVSGIDYNQYFGGGVMYWN SD PG GFSRP        SWAW EAD+  AVD
Sbjct: 872  EEEFEVHAVSGIDYNQYFGGGVMYWNPSDLPGTGFSRPPSLSSDDSSWAWREADMNRAVD 931

Query: 572  DMVAFSSSYSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAA 393
            DMVAFSSSYSTNGLTSPTA  FC PFDPLGPGHQ  SYVVPGNEV  KVLHS+S T DAA
Sbjct: 932  DMVAFSSSYSTNGLTSPTATPFCSPFDPLGPGHQAVSYVVPGNEVSSKVLHSASATPDAA 991

Query: 392  TEEEVSGSLASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPS 213
            TEEE SGS  + S DVDA   DS           PN SRERS+SDFKR H+HKSP V P+
Sbjct: 992  TEEEASGSFTNLSSDVDAKTGDSLPYPILRPIIIPNISRERSKSDFKRGHDHKSPRVAPT 1051

Query: 212  RREQPRIKRPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHD 33
            RREQPRI+RPPSPVVLCV              SRK RGFPTVRSGSSSPR+WG+RG Y+D
Sbjct: 1052 RREQPRIRRPPSPVVLCVPRAPRPPPPSPVSDSRKQRGFPTVRSGSSSPRHWGMRGLYYD 1111

Query: 32   GTTSEEACV 6
            GT SE+ACV
Sbjct: 1112 GTNSEDACV 1120


>gb|KJB41055.1| hypothetical protein B456_007G088700 [Gossypium raimondii]
          Length = 1466

 Score =  624 bits (1608), Expect = e-175
 Identities = 357/729 (48%), Positives = 432/729 (59%), Gaps = 16/729 (2%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNF  S + SK+                  QN + K  +D+    KP K L  
Sbjct: 422  KLELLGEGNFNSSSDKSKDQFSASSRKKKVKSRNIKNQNPVLKMEMDDHPPQKPLKDLEY 481

Query: 1964 ALPDAEKGDSMESDKVPDMPK-KDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
                 +K D MES K   +P  KD+  +S       A   +   +               
Sbjct: 482  KSTHNKKADLMESTKTHVIPHDKDVQTQSGVGGKGQAAARKSRKEK-------------- 527

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSI--IMQAGVAK----YDNLSIQN-VSADNST 1629
                       ++   VK SK  V+ +SS+  + Q    K     DNLS+++ V  D  +
Sbjct: 528  ---NKKKRSYINDTTEVKSSKKAVTGSSSLSFVSQDEATKSNGVLDNLSVEHSVPTDTIS 584

Query: 1628 HFNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECH------QFSNGMIDYQTK 1467
            H N+     S     +   +E I      +D  + S +  CH      Q S  +   +  
Sbjct: 585  HTNILEPISSPTEPDNQLFKEDIALHV--QDHEVGSTNGFCHKGTGHQQDSKDISANEII 642

Query: 1466 PFLQETTDSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEV-- 1293
            P  QE+++ K ECN++PP               +  G+  +E     I      GV V  
Sbjct: 643  PTRQESSNYKRECNVLPPIAPKP--------GSVFIGEGINEHSASKIQENSPSGVSVNA 694

Query: 1292 LEVKKESAVIQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDV 1113
            L++K+  +VIQ Q ++ F+ TA   + +C SYEWP++AP YFPSI+SH   ATDRLHLDV
Sbjct: 695  LDIKEGVSVIQVQ-DKKFYNTA--PTPQCLSYEWPSVAPFYFPSINSHVPAATDRLHLDV 751

Query: 1112 GHIWHNHIRQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNY 933
            GH WHNHIRQPFVP +HQARNP I+ GCN+ILS+P+PMSLDWPPMVRS SG+APSVT NY
Sbjct: 752  GHNWHNHIRQPFVPTMHQARNPSIESGCNRILSRPMPMSLDWPPMVRSASGLAPSVTYNY 811

Query: 932  DSGFMSARXXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLS 753
            DSGF+S R       FA++  QFN+K+ +DD  YSGDF DLP+   T E  DE DSH++S
Sbjct: 812  DSGFISRRQTAFQQSFASQNFQFNMKSFEDDRKYSGDFFDLPDPANTSELADEYDSHYIS 871

Query: 752  EEELEVHAVSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVD 573
            EEE EVHAVSGIDYNQYFGGGVMYWN SD PG GFSRP        SWAW EAD+  AVD
Sbjct: 872  EEEFEVHAVSGIDYNQYFGGGVMYWNPSDLPGTGFSRPPSLSSDDSSWAWREADMNRAVD 931

Query: 572  DMVAFSSSYSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAA 393
            DMVAFSSSYSTNGLTSPTA  FC PFDPLGPGHQ  SYVVPGNEV  KVLHS+S T DAA
Sbjct: 932  DMVAFSSSYSTNGLTSPTATPFCSPFDPLGPGHQAVSYVVPGNEVSSKVLHSASATPDAA 991

Query: 392  TEEEVSGSLASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPS 213
            TEEE SGS  + S DVDA   DS           PN SRERS+SDFKR H+HKSP V P+
Sbjct: 992  TEEEASGSFTNLSSDVDAKTGDSLPYPILRPIIIPNISRERSKSDFKRGHDHKSPRVAPT 1051

Query: 212  RREQPRIKRPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHD 33
            RREQPRI+RPPSPVVLCV              SRK RGFPTVRSGSSSPR+WG+RG Y+D
Sbjct: 1052 RREQPRIRRPPSPVVLCVPRAPRPPPPSPVSDSRKQRGFPTVRSGSSSPRHWGMRGLYYD 1111

Query: 32   GTTSEEACV 6
            GT SE+ACV
Sbjct: 1112 GTNSEDACV 1120


>ref|XP_011033759.1| PREDICTED: uncharacterized protein LOC105132110 isoform X4 [Populus
            euphratica]
          Length = 1212

 Score =  612 bits (1577), Expect = e-172
 Identities = 345/727 (47%), Positives = 431/727 (59%), Gaps = 13/727 (1%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            +LEL GEG  K S N   E                 + N  P  ++DE S  K  + +  
Sbjct: 55   RLELLGEGTSKSSANKPSEKLGGGSRRKKGRTHNMKKPNPAPVKSVDESSFKKLAEDIKC 114

Query: 1964 ALPDAEKGDSMESDKVPDMP-KKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            A    +K + MES+++P +P + +  ++ S+S +EM     +H Q LV            
Sbjct: 115  APTCIKKTELMESNEMPGIPHENENHRDISSSTVEM-----EHTQGLVHEKKRTAGRKNR 169

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSIIMQAGVAKY------DNLSIQNVSADNSTH 1626
                     + SNPV V+  ++ VS+  S  + +   +       DNL+ Q  S D+   
Sbjct: 170  KGRNKKKKSSFSNPVEVRKPEIAVSEACSFSVYSSDEEAKLCGLSDNLATQKASNDSLID 229

Query: 1625 FNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHS------ECHQFSNGMIDYQTKP 1464
                        S + P RE I      ED  +           E ++ SNG +D ++ P
Sbjct: 230  -----------PSINEPTREEIDALGIPEDHAVGCTEGISDAGLEHYRSSNGFVDNKSMP 278

Query: 1463 FLQETTDSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEV 1284
              +ET     + NII    + ++L     N G     SF   KT   S + +K V  L+V
Sbjct: 279  SRRETCCGASQ-NIIYQVATTKELITVSSNEGT----SFLNKKTEVKSDVGNKLVRTLDV 333

Query: 1283 KKESAVIQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHI 1104
            K+   + + + +E+F  +  K   +C SYEWP++ PVYFPSI+SH  PAT RLHLDVGH 
Sbjct: 334  KEVPTLNRGEESENFHESGSKGLSDCLSYEWPSLGPVYFPSINSHLPPATYRLHLDVGHN 393

Query: 1103 WHNHIRQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSG 924
            WHNHI QPF+P VHQARN P++GG N++LSQPLPMSLDWPPMVRS  G+AP++TCNYDSG
Sbjct: 394  WHNHIHQPFLPTVHQARNSPVEGGSNRMLSQPLPMSLDWPPMVRSNCGLAPTMTCNYDSG 453

Query: 923  FMSARXXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEE 744
            F+S         +  K MQ+  KT DD+   SGD +D  + T++QE  DE ++HW+SEEE
Sbjct: 454  FISRWQSTFQKSYTAKNMQYISKTFDDERRCSGDAIDFTDATSSQELMDEYENHWISEEE 513

Query: 743  LEVHAVSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMV 564
             EVHAVSGIDYNQ+FGGGVMYW+ SDHPG GFSRP         W WHEA++  AVDDMV
Sbjct: 514  YEVHAVSGIDYNQHFGGGVMYWDPSDHPGTGFSRPPSLSSDDSGWPWHEAELNRAVDDMV 573

Query: 563  AFSSSYSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEE 384
            AFSSSYST GLTSPTAASFC  FDPL PGHQ   YV+ GNEVPGK + SS+  TDAA EE
Sbjct: 574  AFSSSYSTTGLTSPTAASFCSAFDPLVPGHQALGYVMSGNEVPGKAMLSST-MTDAAAEE 632

Query: 383  EVSGSLASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRRE 204
            +VSGSLAS S D +    DS           PN SRERSRSDFKRS +HKSPCVPP+RRE
Sbjct: 633  DVSGSLASLSSDAEGKTGDSLPYPILRPIIIPNMSRERSRSDFKRSLDHKSPCVPPTRRE 692

Query: 203  QPRIKRPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTT 24
             PRIKRPPSPVVLCV              SRKHRGFPTVRSGSSSPR+WGVRGWYHDGT 
Sbjct: 693  HPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGVRGWYHDGTN 752

Query: 23   SEEACVR 3
             EEACVR
Sbjct: 753  LEEACVR 759


>ref|XP_011033752.1| PREDICTED: uncharacterized protein LOC105132110 isoform X3 [Populus
            euphratica]
          Length = 1439

 Score =  612 bits (1577), Expect = e-172
 Identities = 345/727 (47%), Positives = 431/727 (59%), Gaps = 13/727 (1%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            +LEL GEG  K S N   E                 + N  P  ++DE S  K  + +  
Sbjct: 413  RLELLGEGTSKSSANKPSEKLGGGSRRKKGRTHNMKKPNPAPVKSVDESSFKKLAEDIKC 472

Query: 1964 ALPDAEKGDSMESDKVPDMP-KKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            A    +K + MES+++P +P + +  ++ S+S +EM     +H Q LV            
Sbjct: 473  APTCIKKTELMESNEMPGIPHENENHRDISSSTVEM-----EHTQGLVHEKKRTAGRKNR 527

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSIIMQAGVAKY------DNLSIQNVSADNSTH 1626
                     + SNPV V+  ++ VS+  S  + +   +       DNL+ Q  S D+   
Sbjct: 528  KGRNKKKKSSFSNPVEVRKPEIAVSEACSFSVYSSDEEAKLCGLSDNLATQKASNDSLID 587

Query: 1625 FNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHS------ECHQFSNGMIDYQTKP 1464
                        S + P RE I      ED  +           E ++ SNG +D ++ P
Sbjct: 588  -----------PSINEPTREEIDALGIPEDHAVGCTEGISDAGLEHYRSSNGFVDNKSMP 636

Query: 1463 FLQETTDSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEV 1284
              +ET     + NII    + ++L     N G     SF   KT   S + +K V  L+V
Sbjct: 637  SRRETCCGASQ-NIIYQVATTKELITVSSNEGT----SFLNKKTEVKSDVGNKLVRTLDV 691

Query: 1283 KKESAVIQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHI 1104
            K+   + + + +E+F  +  K   +C SYEWP++ PVYFPSI+SH  PAT RLHLDVGH 
Sbjct: 692  KEVPTLNRGEESENFHESGSKGLSDCLSYEWPSLGPVYFPSINSHLPPATYRLHLDVGHN 751

Query: 1103 WHNHIRQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSG 924
            WHNHI QPF+P VHQARN P++GG N++LSQPLPMSLDWPPMVRS  G+AP++TCNYDSG
Sbjct: 752  WHNHIHQPFLPTVHQARNSPVEGGSNRMLSQPLPMSLDWPPMVRSNCGLAPTMTCNYDSG 811

Query: 923  FMSARXXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEE 744
            F+S         +  K MQ+  KT DD+   SGD +D  + T++QE  DE ++HW+SEEE
Sbjct: 812  FISRWQSTFQKSYTAKNMQYISKTFDDERRCSGDAIDFTDATSSQELMDEYENHWISEEE 871

Query: 743  LEVHAVSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMV 564
             EVHAVSGIDYNQ+FGGGVMYW+ SDHPG GFSRP         W WHEA++  AVDDMV
Sbjct: 872  YEVHAVSGIDYNQHFGGGVMYWDPSDHPGTGFSRPPSLSSDDSGWPWHEAELNRAVDDMV 931

Query: 563  AFSSSYSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEE 384
            AFSSSYST GLTSPTAASFC  FDPL PGHQ   YV+ GNEVPGK + SS+  TDAA EE
Sbjct: 932  AFSSSYSTTGLTSPTAASFCSAFDPLVPGHQALGYVMSGNEVPGKAMLSST-MTDAAAEE 990

Query: 383  EVSGSLASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRRE 204
            +VSGSLAS S D +    DS           PN SRERSRSDFKRS +HKSPCVPP+RRE
Sbjct: 991  DVSGSLASLSSDAEGKTGDSLPYPILRPIIIPNMSRERSRSDFKRSLDHKSPCVPPTRRE 1050

Query: 203  QPRIKRPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTT 24
             PRIKRPPSPVVLCV              SRKHRGFPTVRSGSSSPR+WGVRGWYHDGT 
Sbjct: 1051 HPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGVRGWYHDGTN 1110

Query: 23   SEEACVR 3
             EEACVR
Sbjct: 1111 LEEACVR 1117


>ref|XP_011033743.1| PREDICTED: uncharacterized protein LOC105132110 isoform X2 [Populus
            euphratica]
          Length = 1461

 Score =  612 bits (1577), Expect = e-172
 Identities = 345/727 (47%), Positives = 431/727 (59%), Gaps = 13/727 (1%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            +LEL GEG  K S N   E                 + N  P  ++DE S  K  + +  
Sbjct: 413  RLELLGEGTSKSSANKPSEKLGGGSRRKKGRTHNMKKPNPAPVKSVDESSFKKLAEDIKC 472

Query: 1964 ALPDAEKGDSMESDKVPDMP-KKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            A    +K + MES+++P +P + +  ++ S+S +EM     +H Q LV            
Sbjct: 473  APTCIKKTELMESNEMPGIPHENENHRDISSSTVEM-----EHTQGLVHEKKRTAGRKNR 527

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSIIMQAGVAKY------DNLSIQNVSADNSTH 1626
                     + SNPV V+  ++ VS+  S  + +   +       DNL+ Q  S D+   
Sbjct: 528  KGRNKKKKSSFSNPVEVRKPEIAVSEACSFSVYSSDEEAKLCGLSDNLATQKASNDSLID 587

Query: 1625 FNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHS------ECHQFSNGMIDYQTKP 1464
                        S + P RE I      ED  +           E ++ SNG +D ++ P
Sbjct: 588  -----------PSINEPTREEIDALGIPEDHAVGCTEGISDAGLEHYRSSNGFVDNKSMP 636

Query: 1463 FLQETTDSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEV 1284
              +ET     + NII    + ++L     N G     SF   KT   S + +K V  L+V
Sbjct: 637  SRRETCCGASQ-NIIYQVATTKELITVSSNEGT----SFLNKKTEVKSDVGNKLVRTLDV 691

Query: 1283 KKESAVIQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHI 1104
            K+   + + + +E+F  +  K   +C SYEWP++ PVYFPSI+SH  PAT RLHLDVGH 
Sbjct: 692  KEVPTLNRGEESENFHESGSKGLSDCLSYEWPSLGPVYFPSINSHLPPATYRLHLDVGHN 751

Query: 1103 WHNHIRQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSG 924
            WHNHI QPF+P VHQARN P++GG N++LSQPLPMSLDWPPMVRS  G+AP++TCNYDSG
Sbjct: 752  WHNHIHQPFLPTVHQARNSPVEGGSNRMLSQPLPMSLDWPPMVRSNCGLAPTMTCNYDSG 811

Query: 923  FMSARXXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEE 744
            F+S         +  K MQ+  KT DD+   SGD +D  + T++QE  DE ++HW+SEEE
Sbjct: 812  FISRWQSTFQKSYTAKNMQYISKTFDDERRCSGDAIDFTDATSSQELMDEYENHWISEEE 871

Query: 743  LEVHAVSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMV 564
             EVHAVSGIDYNQ+FGGGVMYW+ SDHPG GFSRP         W WHEA++  AVDDMV
Sbjct: 872  YEVHAVSGIDYNQHFGGGVMYWDPSDHPGTGFSRPPSLSSDDSGWPWHEAELNRAVDDMV 931

Query: 563  AFSSSYSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEE 384
            AFSSSYST GLTSPTAASFC  FDPL PGHQ   YV+ GNEVPGK + SS+  TDAA EE
Sbjct: 932  AFSSSYSTTGLTSPTAASFCSAFDPLVPGHQALGYVMSGNEVPGKAMLSST-MTDAAAEE 990

Query: 383  EVSGSLASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRRE 204
            +VSGSLAS S D +    DS           PN SRERSRSDFKRS +HKSPCVPP+RRE
Sbjct: 991  DVSGSLASLSSDAEGKTGDSLPYPILRPIIIPNMSRERSRSDFKRSLDHKSPCVPPTRRE 1050

Query: 203  QPRIKRPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTT 24
             PRIKRPPSPVVLCV              SRKHRGFPTVRSGSSSPR+WGVRGWYHDGT 
Sbjct: 1051 HPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGVRGWYHDGTN 1110

Query: 23   SEEACVR 3
             EEACVR
Sbjct: 1111 LEEACVR 1117


>ref|XP_011033716.1| PREDICTED: uncharacterized protein LOC105132110 isoform X1 [Populus
            euphratica] gi|743788518|ref|XP_011033726.1| PREDICTED:
            uncharacterized protein LOC105132110 isoform X1 [Populus
            euphratica] gi|743788522|ref|XP_011033734.1| PREDICTED:
            uncharacterized protein LOC105132110 isoform X1 [Populus
            euphratica]
          Length = 1570

 Score =  612 bits (1577), Expect = e-172
 Identities = 345/727 (47%), Positives = 431/727 (59%), Gaps = 13/727 (1%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            +LEL GEG  K S N   E                 + N  P  ++DE S  K  + +  
Sbjct: 413  RLELLGEGTSKSSANKPSEKLGGGSRRKKGRTHNMKKPNPAPVKSVDESSFKKLAEDIKC 472

Query: 1964 ALPDAEKGDSMESDKVPDMP-KKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
            A    +K + MES+++P +P + +  ++ S+S +EM     +H Q LV            
Sbjct: 473  APTCIKKTELMESNEMPGIPHENENHRDISSSTVEM-----EHTQGLVHEKKRTAGRKNR 527

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTSSIIMQAGVAKY------DNLSIQNVSADNSTH 1626
                     + SNPV V+  ++ VS+  S  + +   +       DNL+ Q  S D+   
Sbjct: 528  KGRNKKKKSSFSNPVEVRKPEIAVSEACSFSVYSSDEEAKLCGLSDNLATQKASNDSLID 587

Query: 1625 FNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHS------ECHQFSNGMIDYQTKP 1464
                        S + P RE I      ED  +           E ++ SNG +D ++ P
Sbjct: 588  -----------PSINEPTREEIDALGIPEDHAVGCTEGISDAGLEHYRSSNGFVDNKSMP 636

Query: 1463 FLQETTDSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEV 1284
              +ET     + NII    + ++L     N G     SF   KT   S + +K V  L+V
Sbjct: 637  SRRETCCGASQ-NIIYQVATTKELITVSSNEGT----SFLNKKTEVKSDVGNKLVRTLDV 691

Query: 1283 KKESAVIQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVGHI 1104
            K+   + + + +E+F  +  K   +C SYEWP++ PVYFPSI+SH  PAT RLHLDVGH 
Sbjct: 692  KEVPTLNRGEESENFHESGSKGLSDCLSYEWPSLGPVYFPSINSHLPPATYRLHLDVGHN 751

Query: 1103 WHNHIRQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSG 924
            WHNHI QPF+P VHQARN P++GG N++LSQPLPMSLDWPPMVRS  G+AP++TCNYDSG
Sbjct: 752  WHNHIHQPFLPTVHQARNSPVEGGSNRMLSQPLPMSLDWPPMVRSNCGLAPTMTCNYDSG 811

Query: 923  FMSARXXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEE 744
            F+S         +  K MQ+  KT DD+   SGD +D  + T++QE  DE ++HW+SEEE
Sbjct: 812  FISRWQSTFQKSYTAKNMQYISKTFDDERRCSGDAIDFTDATSSQELMDEYENHWISEEE 871

Query: 743  LEVHAVSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMV 564
             EVHAVSGIDYNQ+FGGGVMYW+ SDHPG GFSRP         W WHEA++  AVDDMV
Sbjct: 872  YEVHAVSGIDYNQHFGGGVMYWDPSDHPGTGFSRPPSLSSDDSGWPWHEAELNRAVDDMV 931

Query: 563  AFSSSYSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEE 384
            AFSSSYST GLTSPTAASFC  FDPL PGHQ   YV+ GNEVPGK + SS+  TDAA EE
Sbjct: 932  AFSSSYSTTGLTSPTAASFCSAFDPLVPGHQALGYVMSGNEVPGKAMLSST-MTDAAAEE 990

Query: 383  EVSGSLASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRRE 204
            +VSGSLAS S D +    DS           PN SRERSRSDFKRS +HKSPCVPP+RRE
Sbjct: 991  DVSGSLASLSSDAEGKTGDSLPYPILRPIIIPNMSRERSRSDFKRSLDHKSPCVPPTRRE 1050

Query: 203  QPRIKRPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTT 24
             PRIKRPPSPVVLCV              SRKHRGFPTVRSGSSSPR+WGVRGWYHDGT 
Sbjct: 1051 HPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGVRGWYHDGTN 1110

Query: 23   SEEACVR 3
             EEACVR
Sbjct: 1111 LEEACVR 1117


>emb|CAN65347.1| hypothetical protein VITISV_000637 [Vitis vinifera]
          Length = 1500

 Score =  611 bits (1576), Expect = e-172
 Identities = 343/721 (47%), Positives = 426/721 (59%), Gaps = 8/721 (1%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGN K  PN SKE                 + N +P+S  D+    KP K    
Sbjct: 439  KLELLGEGNLKSPPNKSKEKLGTGXRKKRGKTRNMKKLNPVPRSCGDBSKSLKPLKDHGC 498

Query: 1964 ALPDAEKGDSMESDKVP-DMPKKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
             L  A+  D +ES+++  ++ + D+  E+S+S +EM          +  G          
Sbjct: 499  RLAYAKCVDFVESNRMAGELQQSDLRMEASSSVVEME-------NDMFSGKVQNAARKSR 551

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTS--SIIMQAGVAKY----DNLSIQNVSADNSTH 1626
                     +  +PV V+D +   ++ S  S+I Q+  +K     D+   +NV  D S  
Sbjct: 552  KERNKNRIYSLKDPVEVRDLETITTEPSAPSVISQSEPSKSNWKSDSSVSENVPNDASIG 611

Query: 1625 FNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETT 1446
             + F S+P  C   + P R   T QS +ED V+ S+                        
Sbjct: 612  CDKFISSP--CKPTNGPSRAETTAQSIREDPVVSSI------------------------ 645

Query: 1445 DSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAV 1266
                            +++       I F  S H S+T +   + DK ++  E+++E   
Sbjct: 646  ----------------EVDVAFSGEDIKFQNSEHLSETDT-KCVSDKPIKATELEEEIVQ 688

Query: 1265 IQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPA-TDRLHLDVGHIWHNHI 1089
             Q+Q    F  T   SS EC SYEWP +AP++F SI+S  LPA TDRLHLDVG  WHNH 
Sbjct: 689  NQEQERGKFCNTGSTSSSECPSYEWPTVAPIHFTSINSQHLPAATDRLHLDVGRNWHNHF 748

Query: 1088 RQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSAR 909
             Q FVP++HQ RNPP+D GC+QILS+PLPMSLDWPPMVRS+S +APS+TCNYD GF+S  
Sbjct: 749  HQSFVPSIHQTRNPPLDAGCSQILSRPLPMSLDWPPMVRSISRLAPSMTCNYDPGFISRM 808

Query: 908  XXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHA 729
                   F    +Q N  TS+D+  YSGD MDL +LT  QE  DECDSHW+SEEE E+HA
Sbjct: 809  QSSFRQGFPAHNVQVNTATSEDERKYSGDLMDLSDLTNVQELADECDSHWISEEEFELHA 868

Query: 728  VSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSS 549
            VSG+DY+QYFGGGVMYWN+SDHPG+GFSRP        SWAWHEAD+  AVDDMVAFSSS
Sbjct: 869  VSGLDYSQYFGGGVMYWNSSDHPGSGFSRPPSLSSDDSSWAWHEADMNRAVDDMVAFSSS 928

Query: 548  YSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGS 369
            YSTNGL SPTAASFC PFDPLG GHQ   YV+ GNE PGKVLHSSS + DA  EE+VSGS
Sbjct: 929  YSTNGLASPTAASFCSPFDPLGAGHQPLGYVISGNEGPGKVLHSSSASADAMPEEKVSGS 988

Query: 368  LASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIK 189
            LA+   DV+    D            PN SRERSRS+FKR+ + KSPCVPP+RREQPRIK
Sbjct: 989  LANLPVDVEGKTGDPLPYSLLPPIIIPNMSRERSRSEFKRNFDRKSPCVPPARREQPRIK 1048

Query: 188  RPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEAC 9
            RPPSPVVLCV              SRK+RGFPTVRSGSSSPR+WG+RGWYHDG+  EEAC
Sbjct: 1049 RPPSPVVLCVPRAPRPPPPSPVSDSRKNRGFPTVRSGSSSPRHWGMRGWYHDGSNLEEAC 1108

Query: 8    V 6
            V
Sbjct: 1109 V 1109


>ref|XP_010661312.1| PREDICTED: uncharacterized protein LOC100265029 isoform X1 [Vitis
            vinifera] gi|731420233|ref|XP_010661313.1| PREDICTED:
            uncharacterized protein LOC100265029 isoform X1 [Vitis
            vinifera] gi|731420235|ref|XP_010661314.1| PREDICTED:
            uncharacterized protein LOC100265029 isoform X1 [Vitis
            vinifera]
          Length = 1571

 Score =  609 bits (1570), Expect = e-171
 Identities = 341/721 (47%), Positives = 424/721 (58%), Gaps = 8/721 (1%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGN K  PN SKE                 + N +P+S  D+    KP K    
Sbjct: 452  KLELLGEGNLKSPPNKSKEKLGTGGRKKRGRTRNMKKLNPVPRSCGDDSKSLKPLKDHGC 511

Query: 1964 ALPDAEKGDSMESDKVP-DMPKKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXX 1788
             L  A+  D +ES+++  ++ + D+  E+S+S +EM          +  G          
Sbjct: 512  GLAYAKCVDFVESNRMAGELQQSDLHMEASSSVVEME-------NDMFSGKVQNAARKSR 564

Query: 1787 XXXXXXXNCTCSNPVPVKDSKLPVSKTS--SIIMQAGVAKY----DNLSIQNVSADNSTH 1626
                     +  +PV V+D +   ++ S  S+I Q+  +K     D+   +NV  D S  
Sbjct: 565  KERNKNRIYSLKDPVEVRDLETITTEPSAPSVISQSEPSKSNWKSDSSVSENVPNDASIG 624

Query: 1625 FNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECHQFSNGMIDYQTKPFLQETT 1446
             + F S+P  C   + P R   T QS +ED V+ S+                        
Sbjct: 625  CDKFISSP--CKPTNGPSRAETTAQSIREDPVVSSI------------------------ 658

Query: 1445 DSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEVLEVKKESAV 1266
                            +++       I F  S H S+T +   + DK ++  E+++E   
Sbjct: 659  ----------------EVDVAFSGEDIKFQNSEHLSETDT-KCVSDKPIKATELEEEIVQ 701

Query: 1265 IQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPA-TDRLHLDVGHIWHNHI 1089
             Q+Q    F  T   SS EC SYEWP +AP++F SI+S  LPA TDRLHLDVG  WHNH 
Sbjct: 702  NQEQERGKFCNTGSTSSSECPSYEWPTVAPIHFTSINSQHLPAATDRLHLDVGRNWHNHF 761

Query: 1088 RQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYDSGFMSAR 909
             Q FVP++HQ RNP +D GC+QILS+PLPMSLDWPPMVRS+S +APS+TCNYD GF+S  
Sbjct: 762  HQSFVPSIHQTRNPSLDAGCSQILSRPLPMSLDWPPMVRSISRLAPSMTCNYDPGFISRM 821

Query: 908  XXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSEEELEVHA 729
                   F    +Q N  TS+D+  YSGD MDL +LT  QE  DECDSHW+SEEE E+HA
Sbjct: 822  QSSFRQGFPAHNVQVNTATSEDERKYSGDLMDLSDLTNVQELADECDSHWISEEEFELHA 881

Query: 728  VSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDDMVAFSSS 549
            VSG+DY+QYFGGGVMYWN+SDHPG+GFSRP        SWAWHEAD+  AVDDMVAFSSS
Sbjct: 882  VSGLDYSQYFGGGVMYWNSSDHPGSGFSRPPSLSSDDSSWAWHEADMNRAVDDMVAFSSS 941

Query: 548  YSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAATEEEVSGS 369
            YSTNGL SPTAASFC PFDPLG GHQ   YV+ GNE PGKVLHSSS + DA  EE+VSGS
Sbjct: 942  YSTNGLASPTAASFCSPFDPLGAGHQPLGYVISGNEGPGKVLHSSSASADAMPEEKVSGS 1001

Query: 368  LASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSRREQPRIK 189
            LA+   DV+    D            PN SRERSRS+FKR+ + KSPCVPP+RREQPRIK
Sbjct: 1002 LANLPVDVEGKTGDPLPYSLLPPIIIPNMSRERSRSEFKRNFDRKSPCVPPARREQPRIK 1061

Query: 188  RPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDGTTSEEAC 9
            RPPSPVVLCV              SRK+RGFPTVRSGSSSPR+WG+RGWYHDG+  EEAC
Sbjct: 1062 RPPSPVVLCVPRAPRPPPPSPVSDSRKNRGFPTVRSGSSSPRHWGMRGWYHDGSNLEEAC 1121

Query: 8    V 6
            V
Sbjct: 1122 V 1122


>gb|KJB41060.1| hypothetical protein B456_007G088700 [Gossypium raimondii]
            gi|763773938|gb|KJB41061.1| hypothetical protein
            B456_007G088700 [Gossypium raimondii]
            gi|763773940|gb|KJB41063.1| hypothetical protein
            B456_007G088700 [Gossypium raimondii]
            gi|763773941|gb|KJB41064.1| hypothetical protein
            B456_007G088700 [Gossypium raimondii]
            gi|763773942|gb|KJB41065.1| hypothetical protein
            B456_007G088700 [Gossypium raimondii]
          Length = 1541

 Score =  607 bits (1565), Expect = e-170
 Identities = 352/728 (48%), Positives = 424/728 (58%), Gaps = 15/728 (2%)
 Frame = -3

Query: 2144 KLELFGEGNFKYSPNTSKENTXXXXXXXXXXXXXXXRQNTLPKSALDELSLDKPPKVLVD 1965
            KLEL GEGNF  S + SK+                  QN + K  +D+    KP KV   
Sbjct: 422  KLELLGEGNFNSSSDKSKDQFSASSRKKKVKSRNIKNQNPVLKMEMDDHPPQKPLKVQTQ 481

Query: 1964 ALPDAEKGDSMESDKVPDMPKKDIPKESSTSEMEMAVCHQKHAQALVVGXXXXXXXXXXX 1785
            +     KG +       +  KK     + T+E                            
Sbjct: 482  SGVGG-KGQAAARKSRKEKNKKKRSYINDTTE---------------------------- 512

Query: 1784 XXXXXXNCTCSNPVPVKDSKLPVSKTSSI--IMQAGVAK----YDNLSIQN-VSADNSTH 1626
                           VK SK  V+ +SS+  + Q    K     DNLS+++ V  D  +H
Sbjct: 513  ---------------VKSSKKAVTGSSSLSFVSQDEATKSNGVLDNLSVEHSVPTDTISH 557

Query: 1625 FNVFASNPSSCTSASAPEREGITTQSTQEDDVIDSLHSECH------QFSNGMIDYQTKP 1464
             N+     S     +   +E I      +D  + S +  CH      Q S  +   +  P
Sbjct: 558  TNILEPISSPTEPDNQLFKEDIALHV--QDHEVGSTNGFCHKGTGHQQDSKDISANEIIP 615

Query: 1463 FLQETTDSKVECNIIPPDMSARDLNNTLGNNGIHFGKSFHESKTGSISFLPDKGVEV--L 1290
              QE+++ K ECN++PP               +  G+  +E     I      GV V  L
Sbjct: 616  TRQESSNYKRECNVLPPIAPKP--------GSVFIGEGINEHSASKIQENSPSGVSVNAL 667

Query: 1289 EVKKESAVIQDQRNESFFGTALKSSLECTSYEWPNIAPVYFPSISSHFLPATDRLHLDVG 1110
            ++K+  +VIQ Q ++ F+ TA   + +C SYEWP++AP YFPSI+SH   ATDRLHLDVG
Sbjct: 668  DIKEGVSVIQVQ-DKKFYNTA--PTPQCLSYEWPSVAPFYFPSINSHVPAATDRLHLDVG 724

Query: 1109 HIWHNHIRQPFVPNVHQARNPPIDGGCNQILSQPLPMSLDWPPMVRSVSGIAPSVTCNYD 930
            H WHNHIRQPFVP +HQARNP I+ GCN+ILS+P+PMSLDWPPMVRS SG+APSVT NYD
Sbjct: 725  HNWHNHIRQPFVPTMHQARNPSIESGCNRILSRPMPMSLDWPPMVRSASGLAPSVTYNYD 784

Query: 929  SGFMSARXXXXXXXFATKGMQFNVKTSDDDGNYSGDFMDLPELTTTQEPGDECDSHWLSE 750
            SGF+S R       FA++  QFN+K+ +DD  YSGDF DLP+   T E  DE DSH++SE
Sbjct: 785  SGFISRRQTAFQQSFASQNFQFNMKSFEDDRKYSGDFFDLPDPANTSELADEYDSHYISE 844

Query: 749  EELEVHAVSGIDYNQYFGGGVMYWNTSDHPGAGFSRPTXXXXXXXSWAWHEADIKSAVDD 570
            EE EVHAVSGIDYNQYFGGGVMYWN SD PG GFSRP        SWAW EAD+  AVDD
Sbjct: 845  EEFEVHAVSGIDYNQYFGGGVMYWNPSDLPGTGFSRPPSLSSDDSSWAWREADMNRAVDD 904

Query: 569  MVAFSSSYSTNGLTSPTAASFCPPFDPLGPGHQTFSYVVPGNEVPGKVLHSSSKTTDAAT 390
            MVAFSSSYSTNGLTSPTA  FC PFDPLGPGHQ  SYVVPGNEV  KVLHS+S T DAAT
Sbjct: 905  MVAFSSSYSTNGLTSPTATPFCSPFDPLGPGHQAVSYVVPGNEVSSKVLHSASATPDAAT 964

Query: 389  EEEVSGSLASFSGDVDAMAVDSXXXXXXXXXXXPNFSRERSRSDFKRSHNHKSPCVPPSR 210
            EEE SGS  + S DVDA   DS           PN SRERS+SDFKR H+HKSP V P+R
Sbjct: 965  EEEASGSFTNLSSDVDAKTGDSLPYPILRPIIIPNISRERSKSDFKRGHDHKSPRVAPTR 1024

Query: 209  REQPRIKRPPSPVVLCVXXXXXXXXXXXXXXSRKHRGFPTVRSGSSSPRNWGVRGWYHDG 30
            REQPRI+RPPSPVVLCV              SRK RGFPTVRSGSSSPR+WG+RG Y+DG
Sbjct: 1025 REQPRIRRPPSPVVLCVPRAPRPPPPSPVSDSRKQRGFPTVRSGSSSPRHWGMRGLYYDG 1084

Query: 29   TTSEEACV 6
            T SE+ACV
Sbjct: 1085 TNSEDACV 1092


Top