BLASTX nr result

ID: Zanthoxylum22_contig00003235 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00003235
         (970 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KDO79651.1| hypothetical protein CISIN_1g000234mg [Citrus sin...   475   e-131
gb|KDO79650.1| hypothetical protein CISIN_1g000234mg [Citrus sin...   475   e-131
ref|XP_006476180.1| PREDICTED: uncharacterized protein LOC102626...   475   e-131
ref|XP_006476179.1| PREDICTED: uncharacterized protein LOC102626...   475   e-131
ref|XP_006450577.1| hypothetical protein CICLE_v100072353mg, par...   475   e-131
ref|XP_007013731.1| Enhancer of polycomb-like transcription fact...   372   e-100
ref|XP_007013730.1| Enhancer of polycomb-like transcription fact...   372   e-100
ref|XP_007013729.1| Enhancer of polycomb-like transcription fact...   372   e-100
ref|XP_007013727.1| Enhancer of polycomb-like transcription fact...   372   e-100
ref|XP_012462722.1| PREDICTED: uncharacterized protein LOC105782...   338   5e-90
ref|XP_010109047.1| hypothetical protein L484_007381 [Morus nota...   330   8e-88
ref|XP_008219843.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   330   1e-87
ref|XP_002324830.2| hypothetical protein POPTR_0018s01030g [Popu...   326   2e-86
ref|XP_011026533.1| PREDICTED: uncharacterized protein LOC105127...   325   3e-86
ref|XP_012078606.1| PREDICTED: uncharacterized protein LOC105639...   325   3e-86
gb|KHG16466.1| DNA mismatch repair Msh6-1 -like protein [Gossypi...   323   2e-85
ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus c...   320   1e-84
ref|XP_012463288.1| PREDICTED: uncharacterized protein LOC105782...   312   3e-82
ref|XP_012463287.1| PREDICTED: uncharacterized protein LOC105782...   312   3e-82
ref|XP_008394009.1| PREDICTED: uncharacterized protein LOC103456...   310   1e-81

>gb|KDO79651.1| hypothetical protein CISIN_1g000234mg [Citrus sinensis]
          Length = 1579

 Score =  475 bits (1223), Expect = e-131
 Identities = 240/321 (74%), Positives = 266/321 (82%), Gaps = 1/321 (0%)
 Frame = +3

Query: 9    ESTYEKNASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWT 188
            ESTYE N  +C LE NMS S L  N+ VM K+AAS  C P A  +LE VSSSVCGD +WT
Sbjct: 1027 ESTYENNVPQCTLELNMSKS-LDYNMMVMSKDAASHECSPAATSKLEAVSSSVCGDESWT 1085

Query: 189  RTPQTYRNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPS-GDCDK 365
            R+PQ  RN   NVAGTSA+S+EPE+IG E+IVPLQKLQ HDPKSE+C LLPRPS GDCDK
Sbjct: 1086 RSPQICRNSSTNVAGTSASSQEPEQIGNEAIVPLQKLQYHDPKSEQCVLLPRPSSGDCDK 1145

Query: 366  TDTAAYYSPLNGTRVEIPTFNQFGKHDREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTW 545
            TDTA Y SPLN  RVEIPTF+QF KHDREYHS Q + DLNWNMNGG +PS NPTAPRST 
Sbjct: 1146 TDTA-YNSPLNSIRVEIPTFDQFEKHDREYHSVQCTTDLNWNMNGGIVPSLNPTAPRSTG 1204

Query: 546  HRNRSISSFGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQK 725
            HRNRS SSFGYLA GWS  K D+AH++FG+ PKKPRTQVSYSLPFGG+ +SPK RVNHQK
Sbjct: 1205 HRNRSSSSFGYLAHGWSVEKADVAHSSFGSAPKKPRTQVSYSLPFGGY-YSPKNRVNHQK 1263

Query: 726  GLPCTRIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNE 905
            GLP  RIRR++EKRLSDVSR S KNLELL CDAN+LI  GDKGWRECGAQI LELFEHNE
Sbjct: 1264 GLPHMRIRRANEKRLSDVSRVSKKNLELLPCDANVLIVHGDKGWRECGAQIALELFEHNE 1323

Query: 906  WKLAVKLAGTTRYSHKAHQFL 968
            WKLAVKL+GTTR+S+KAHQFL
Sbjct: 1324 WKLAVKLSGTTRFSYKAHQFL 1344


>gb|KDO79650.1| hypothetical protein CISIN_1g000234mg [Citrus sinensis]
          Length = 1816

 Score =  475 bits (1223), Expect = e-131
 Identities = 240/321 (74%), Positives = 266/321 (82%), Gaps = 1/321 (0%)
 Frame = +3

Query: 9    ESTYEKNASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWT 188
            ESTYE N  +C LE NMS S L  N+ VM K+AAS  C P A  +LE VSSSVCGD +WT
Sbjct: 1027 ESTYENNVPQCTLELNMSKS-LDYNMMVMSKDAASHECSPAATSKLEAVSSSVCGDESWT 1085

Query: 189  RTPQTYRNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPS-GDCDK 365
            R+PQ  RN   NVAGTSA+S+EPE+IG E+IVPLQKLQ HDPKSE+C LLPRPS GDCDK
Sbjct: 1086 RSPQICRNSSTNVAGTSASSQEPEQIGNEAIVPLQKLQYHDPKSEQCVLLPRPSSGDCDK 1145

Query: 366  TDTAAYYSPLNGTRVEIPTFNQFGKHDREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTW 545
            TDTA Y SPLN  RVEIPTF+QF KHDREYHS Q + DLNWNMNGG +PS NPTAPRST 
Sbjct: 1146 TDTA-YNSPLNSIRVEIPTFDQFEKHDREYHSVQCTTDLNWNMNGGIVPSLNPTAPRSTG 1204

Query: 546  HRNRSISSFGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQK 725
            HRNRS SSFGYLA GWS  K D+AH++FG+ PKKPRTQVSYSLPFGG+ +SPK RVNHQK
Sbjct: 1205 HRNRSSSSFGYLAHGWSVEKADVAHSSFGSAPKKPRTQVSYSLPFGGY-YSPKNRVNHQK 1263

Query: 726  GLPCTRIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNE 905
            GLP  RIRR++EKRLSDVSR S KNLELL CDAN+LI  GDKGWRECGAQI LELFEHNE
Sbjct: 1264 GLPHMRIRRANEKRLSDVSRVSKKNLELLPCDANVLIVHGDKGWRECGAQIALELFEHNE 1323

Query: 906  WKLAVKLAGTTRYSHKAHQFL 968
            WKLAVKL+GTTR+S+KAHQFL
Sbjct: 1324 WKLAVKLSGTTRFSYKAHQFL 1344


>ref|XP_006476180.1| PREDICTED: uncharacterized protein LOC102626885 isoform X2 [Citrus
            sinensis]
          Length = 1813

 Score =  475 bits (1223), Expect = e-131
 Identities = 240/321 (74%), Positives = 266/321 (82%), Gaps = 1/321 (0%)
 Frame = +3

Query: 9    ESTYEKNASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWT 188
            ESTYE N  +C LE NMS S L  N+ VM K+AAS  C P A  +LE VSSSVCGD +WT
Sbjct: 1024 ESTYENNVPQCTLELNMSKS-LDYNMMVMSKDAASHECSPAATSKLEAVSSSVCGDESWT 1082

Query: 189  RTPQTYRNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPS-GDCDK 365
            R+PQ  RN   NVAGTSA+S+EPE+IG E+IVPLQKLQ HDPKSE+C LLPRPS GDCDK
Sbjct: 1083 RSPQICRNSSTNVAGTSASSQEPEQIGNEAIVPLQKLQYHDPKSEQCVLLPRPSSGDCDK 1142

Query: 366  TDTAAYYSPLNGTRVEIPTFNQFGKHDREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTW 545
            TDTA Y SPLN  RVEIPTF+QF KHDREYHS Q + DLNWNMNGG +PS NPTAPRST 
Sbjct: 1143 TDTA-YNSPLNSIRVEIPTFDQFEKHDREYHSVQCTTDLNWNMNGGIVPSLNPTAPRSTG 1201

Query: 546  HRNRSISSFGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQK 725
            HRNRS SSFGYLA GWS  K D+AH++FG+ PKKPRTQVSYSLPFGG+ +SPK RVNHQK
Sbjct: 1202 HRNRSSSSFGYLAHGWSVEKADVAHSSFGSAPKKPRTQVSYSLPFGGY-YSPKNRVNHQK 1260

Query: 726  GLPCTRIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNE 905
            GLP  RIRR++EKRLSDVSR S KNLELL CDAN+LI  GDKGWRECGAQI LELFEHNE
Sbjct: 1261 GLPHMRIRRANEKRLSDVSRVSKKNLELLPCDANVLIVHGDKGWRECGAQIALELFEHNE 1320

Query: 906  WKLAVKLAGTTRYSHKAHQFL 968
            WKLAVKL+GTTR+S+KAHQFL
Sbjct: 1321 WKLAVKLSGTTRFSYKAHQFL 1341


>ref|XP_006476179.1| PREDICTED: uncharacterized protein LOC102626885 isoform X1 [Citrus
            sinensis]
          Length = 1816

 Score =  475 bits (1223), Expect = e-131
 Identities = 240/321 (74%), Positives = 266/321 (82%), Gaps = 1/321 (0%)
 Frame = +3

Query: 9    ESTYEKNASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWT 188
            ESTYE N  +C LE NMS S L  N+ VM K+AAS  C P A  +LE VSSSVCGD +WT
Sbjct: 1027 ESTYENNVPQCTLELNMSKS-LDYNMMVMSKDAASHECSPAATSKLEAVSSSVCGDESWT 1085

Query: 189  RTPQTYRNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPS-GDCDK 365
            R+PQ  RN   NVAGTSA+S+EPE+IG E+IVPLQKLQ HDPKSE+C LLPRPS GDCDK
Sbjct: 1086 RSPQICRNSSTNVAGTSASSQEPEQIGNEAIVPLQKLQYHDPKSEQCVLLPRPSSGDCDK 1145

Query: 366  TDTAAYYSPLNGTRVEIPTFNQFGKHDREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTW 545
            TDTA Y SPLN  RVEIPTF+QF KHDREYHS Q + DLNWNMNGG +PS NPTAPRST 
Sbjct: 1146 TDTA-YNSPLNSIRVEIPTFDQFEKHDREYHSVQCTTDLNWNMNGGIVPSLNPTAPRSTG 1204

Query: 546  HRNRSISSFGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQK 725
            HRNRS SSFGYLA GWS  K D+AH++FG+ PKKPRTQVSYSLPFGG+ +SPK RVNHQK
Sbjct: 1205 HRNRSSSSFGYLAHGWSVEKADVAHSSFGSAPKKPRTQVSYSLPFGGY-YSPKNRVNHQK 1263

Query: 726  GLPCTRIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNE 905
            GLP  RIRR++EKRLSDVSR S KNLELL CDAN+LI  GDKGWRECGAQI LELFEHNE
Sbjct: 1264 GLPHMRIRRANEKRLSDVSRVSKKNLELLPCDANVLIVHGDKGWRECGAQIALELFEHNE 1323

Query: 906  WKLAVKLAGTTRYSHKAHQFL 968
            WKLAVKL+GTTR+S+KAHQFL
Sbjct: 1324 WKLAVKLSGTTRFSYKAHQFL 1344


>ref|XP_006450577.1| hypothetical protein CICLE_v100072353mg, partial [Citrus clementina]
            gi|557553803|gb|ESR63817.1| hypothetical protein
            CICLE_v100072353mg, partial [Citrus clementina]
          Length = 595

 Score =  475 bits (1223), Expect = e-131
 Identities = 240/321 (74%), Positives = 266/321 (82%), Gaps = 1/321 (0%)
 Frame = +3

Query: 9    ESTYEKNASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWT 188
            ESTYE N  +C LE NMS S L  N+ VM K+AAS  C P A  +LE VSSSVCGD +WT
Sbjct: 87   ESTYENNVPQCTLELNMSKS-LDYNMMVMSKDAASHECSPAATSKLEAVSSSVCGDESWT 145

Query: 189  RTPQTYRNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPS-GDCDK 365
            R+PQ  RN   NVAGTSA+S+EPE+IG E+IVPLQKLQ HDPKSE+C LLPRPS GDCDK
Sbjct: 146  RSPQICRNSSTNVAGTSASSQEPEQIGNEAIVPLQKLQYHDPKSEQCVLLPRPSSGDCDK 205

Query: 366  TDTAAYYSPLNGTRVEIPTFNQFGKHDREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTW 545
            TDTA Y SPLN  RVEIPTF+QF KHDREYHS Q + DLNWNMNGG +PS NPTAPRST 
Sbjct: 206  TDTA-YNSPLNSIRVEIPTFDQFEKHDREYHSVQCTTDLNWNMNGGIVPSLNPTAPRSTG 264

Query: 546  HRNRSISSFGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQK 725
            HRNRS SSFGYLA GWS  K D+AH++FG+ PKKPRTQVSYSLPFGG+ +SPK RVNHQK
Sbjct: 265  HRNRSSSSFGYLAHGWSVEKADVAHSSFGSAPKKPRTQVSYSLPFGGY-YSPKNRVNHQK 323

Query: 726  GLPCTRIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNE 905
            GLP  RIRR++EKRLSDVSR S KNLELL CDAN+LI  GDKGWRECGAQI LELFEHNE
Sbjct: 324  GLPHMRIRRANEKRLSDVSRVSKKNLELLPCDANVLIVHGDKGWRECGAQIALELFEHNE 383

Query: 906  WKLAVKLAGTTRYSHKAHQFL 968
            WKLAVKL+GTTR+S+KAHQFL
Sbjct: 384  WKLAVKLSGTTRFSYKAHQFL 404


>ref|XP_007013731.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 5 [Theobroma cacao] gi|508784094|gb|EOY31350.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 5 [Theobroma cacao]
          Length = 1522

 Score =  372 bits (955), Expect = e-100
 Identities = 190/316 (60%), Positives = 229/316 (72%), Gaps = 2/316 (0%)
 Frame = +3

Query: 27   NASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWTRTPQTY 206
            N  +C+ +     S++  N+K   K+AASD        EL  +  SVCGD +W ++ Q Y
Sbjct: 919  NREDCV-DKRFDSSSVEKNLKASSKDAASDT-------ELTTLDLSVCGDEHWKKSSQKY 970

Query: 207  RNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYY 386
             NGD  + GT A+S EPEE+G  +IVPLQK QC   +SE+     +   D D+ + A   
Sbjct: 971  ENGDQTIYGTFASSHEPEEVGATAIVPLQKQQCAHSESEQLVSSSKSLVDGDRNN-AGSN 1029

Query: 387  SPLNGTRVEIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSI 563
            S LN  RVEIP+F+Q+  H D E    Q+S+DL WNMNGG IPSPNPTAPRSTWHRNRS 
Sbjct: 1030 SVLNDIRVEIPSFDQYENHIDGELPGTQQSSDLTWNMNGGIIPSPNPTAPRSTWHRNRSS 1089

Query: 564  SS-FGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCT 740
            SS  GY A GWS+GK D  HNNFGN PKKPRTQVSYS+PFGG D+S K + +HQ+G P  
Sbjct: 1090 SSSIGYNAHGWSEGKADFFHNNFGNGPKKPRTQVSYSMPFGGLDYSSKNKGHHQRGPPHK 1149

Query: 741  RIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAV 920
            RIRR++EKR SDVSRGS KNLELLSCDANLLITLGD+GWRECGAQ+ LELF+HNEWKLAV
Sbjct: 1150 RIRRANEKRSSDVSRGSQKNLELLSCDANLLITLGDRGWRECGAQVALELFDHNEWKLAV 1209

Query: 921  KLAGTTRYSHKAHQFL 968
            K++G+TRYSHKAHQFL
Sbjct: 1210 KVSGSTRYSHKAHQFL 1225


>ref|XP_007013730.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 4 [Theobroma cacao] gi|508784093|gb|EOY31349.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 4 [Theobroma cacao]
          Length = 1721

 Score =  372 bits (955), Expect = e-100
 Identities = 190/316 (60%), Positives = 229/316 (72%), Gaps = 2/316 (0%)
 Frame = +3

Query: 27   NASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWTRTPQTY 206
            N  +C+ +     S++  N+K   K+AASD        EL  +  SVCGD +W ++ Q Y
Sbjct: 919  NREDCV-DKRFDSSSVEKNLKASSKDAASDT-------ELTTLDLSVCGDEHWKKSSQKY 970

Query: 207  RNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYY 386
             NGD  + GT A+S EPEE+G  +IVPLQK QC   +SE+     +   D D+ + A   
Sbjct: 971  ENGDQTIYGTFASSHEPEEVGATAIVPLQKQQCAHSESEQLVSSSKSLVDGDRNN-AGSN 1029

Query: 387  SPLNGTRVEIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSI 563
            S LN  RVEIP+F+Q+  H D E    Q+S+DL WNMNGG IPSPNPTAPRSTWHRNRS 
Sbjct: 1030 SVLNDIRVEIPSFDQYENHIDGELPGTQQSSDLTWNMNGGIIPSPNPTAPRSTWHRNRSS 1089

Query: 564  SS-FGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCT 740
            SS  GY A GWS+GK D  HNNFGN PKKPRTQVSYS+PFGG D+S K + +HQ+G P  
Sbjct: 1090 SSSIGYNAHGWSEGKADFFHNNFGNGPKKPRTQVSYSMPFGGLDYSSKNKGHHQRGPPHK 1149

Query: 741  RIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAV 920
            RIRR++EKR SDVSRGS KNLELLSCDANLLITLGD+GWRECGAQ+ LELF+HNEWKLAV
Sbjct: 1150 RIRRANEKRSSDVSRGSQKNLELLSCDANLLITLGDRGWRECGAQVALELFDHNEWKLAV 1209

Query: 921  KLAGTTRYSHKAHQFL 968
            K++G+TRYSHKAHQFL
Sbjct: 1210 KVSGSTRYSHKAHQFL 1225


>ref|XP_007013729.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 3 [Theobroma cacao] gi|508784092|gb|EOY31348.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 3 [Theobroma cacao]
          Length = 1674

 Score =  372 bits (955), Expect = e-100
 Identities = 190/316 (60%), Positives = 229/316 (72%), Gaps = 2/316 (0%)
 Frame = +3

Query: 27   NASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWTRTPQTY 206
            N  +C+ +     S++  N+K   K+AASD        EL  +  SVCGD +W ++ Q Y
Sbjct: 900  NREDCV-DKRFDSSSVEKNLKASSKDAASDT-------ELTTLDLSVCGDEHWKKSSQKY 951

Query: 207  RNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYY 386
             NGD  + GT A+S EPEE+G  +IVPLQK QC   +SE+     +   D D+ + A   
Sbjct: 952  ENGDQTIYGTFASSHEPEEVGATAIVPLQKQQCAHSESEQLVSSSKSLVDGDRNN-AGSN 1010

Query: 387  SPLNGTRVEIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSI 563
            S LN  RVEIP+F+Q+  H D E    Q+S+DL WNMNGG IPSPNPTAPRSTWHRNRS 
Sbjct: 1011 SVLNDIRVEIPSFDQYENHIDGELPGTQQSSDLTWNMNGGIIPSPNPTAPRSTWHRNRSS 1070

Query: 564  SS-FGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCT 740
            SS  GY A GWS+GK D  HNNFGN PKKPRTQVSYS+PFGG D+S K + +HQ+G P  
Sbjct: 1071 SSSIGYNAHGWSEGKADFFHNNFGNGPKKPRTQVSYSMPFGGLDYSSKNKGHHQRGPPHK 1130

Query: 741  RIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAV 920
            RIRR++EKR SDVSRGS KNLELLSCDANLLITLGD+GWRECGAQ+ LELF+HNEWKLAV
Sbjct: 1131 RIRRANEKRSSDVSRGSQKNLELLSCDANLLITLGDRGWRECGAQVALELFDHNEWKLAV 1190

Query: 921  KLAGTTRYSHKAHQFL 968
            K++G+TRYSHKAHQFL
Sbjct: 1191 KVSGSTRYSHKAHQFL 1206


>ref|XP_007013727.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao]
            gi|590579224|ref|XP_007013728.1| Enhancer of
            polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao] gi|508784090|gb|EOY31346.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 1 [Theobroma cacao]
            gi|508784091|gb|EOY31347.1| Enhancer of polycomb-like
            transcription factor protein, putative isoform 1
            [Theobroma cacao]
          Length = 1693

 Score =  372 bits (955), Expect = e-100
 Identities = 190/316 (60%), Positives = 229/316 (72%), Gaps = 2/316 (0%)
 Frame = +3

Query: 27   NASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWTRTPQTY 206
            N  +C+ +     S++  N+K   K+AASD        EL  +  SVCGD +W ++ Q Y
Sbjct: 919  NREDCV-DKRFDSSSVEKNLKASSKDAASDT-------ELTTLDLSVCGDEHWKKSSQKY 970

Query: 207  RNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYY 386
             NGD  + GT A+S EPEE+G  +IVPLQK QC   +SE+     +   D D+ + A   
Sbjct: 971  ENGDQTIYGTFASSHEPEEVGATAIVPLQKQQCAHSESEQLVSSSKSLVDGDRNN-AGSN 1029

Query: 387  SPLNGTRVEIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSI 563
            S LN  RVEIP+F+Q+  H D E    Q+S+DL WNMNGG IPSPNPTAPRSTWHRNRS 
Sbjct: 1030 SVLNDIRVEIPSFDQYENHIDGELPGTQQSSDLTWNMNGGIIPSPNPTAPRSTWHRNRSS 1089

Query: 564  SS-FGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCT 740
            SS  GY A GWS+GK D  HNNFGN PKKPRTQVSYS+PFGG D+S K + +HQ+G P  
Sbjct: 1090 SSSIGYNAHGWSEGKADFFHNNFGNGPKKPRTQVSYSMPFGGLDYSSKNKGHHQRGPPHK 1149

Query: 741  RIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAV 920
            RIRR++EKR SDVSRGS KNLELLSCDANLLITLGD+GWRECGAQ+ LELF+HNEWKLAV
Sbjct: 1150 RIRRANEKRSSDVSRGSQKNLELLSCDANLLITLGDRGWRECGAQVALELFDHNEWKLAV 1209

Query: 921  KLAGTTRYSHKAHQFL 968
            K++G+TRYSHKAHQFL
Sbjct: 1210 KVSGSTRYSHKAHQFL 1225


>ref|XP_012462722.1| PREDICTED: uncharacterized protein LOC105782472 [Gossypium raimondii]
            gi|763740311|gb|KJB07810.1| hypothetical protein
            B456_001G045600 [Gossypium raimondii]
          Length = 1686

 Score =  338 bits (866), Expect = 5e-90
 Identities = 178/308 (57%), Positives = 222/308 (72%), Gaps = 2/308 (0%)
 Frame = +3

Query: 51   SNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSS-SVCGDGNWTRTPQTYRNGDLNV 227
            +N S S++  N+K   KE ASD          E+ S  SVCG+G   ++ + Y+N D  V
Sbjct: 935  NNNSESSVEKNLKASSKEVASDA---------ELTSDLSVCGNGCLKKSSREYKNNDQIV 985

Query: 228  AGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYYSPLNGTR 407
             GT A S E  E+G  + VPLQK QC + ++++  L  +   D DK +TA+  S L+G R
Sbjct: 986  DGTFAGSHE-SEVGAIAFVPLQKQQCDNSETQQFVLSSKSPFDADK-ETASSGSILSGIR 1043

Query: 408  VEIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSISSFGYLA 584
            VEIP F+Q+GKH D E  S ++S DL  NMNGG IPSPNPTAPRSTWHRNRS SS G+ A
Sbjct: 1044 VEIPPFDQYGKHVDSELPSTRQSTDLTLNMNGGIIPSPNPTAPRSTWHRNRSSSSIGFHA 1103

Query: 585  QGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCTRIRRSSEK 764
            +GWSDGK D  H+NFGN PKKPRTQVSYS+P G  D+S K +   Q+ LP  RIRR++EK
Sbjct: 1104 RGWSDGKADFFHSNFGNGPKKPRTQVSYSMPLGSLDYSSKSKGLQQRVLPHKRIRRANEK 1163

Query: 765  RLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAVKLAGTTRY 944
            R SDVSRGS +NL+LLSCDAN+LIT+GD+GWRECG Q VLELF+HNEWKLAVK++G+TRY
Sbjct: 1164 RSSDVSRGSQRNLDLLSCDANVLITIGDRGWRECGVQAVLELFDHNEWKLAVKVSGSTRY 1223

Query: 945  SHKAHQFL 968
            S+KAHQFL
Sbjct: 1224 SYKAHQFL 1231


>ref|XP_010109047.1| hypothetical protein L484_007381 [Morus notabilis]
            gi|587933845|gb|EXC20799.1| hypothetical protein
            L484_007381 [Morus notabilis]
          Length = 1690

 Score =  330 bits (847), Expect = 8e-88
 Identities = 182/331 (54%), Positives = 223/331 (67%), Gaps = 9/331 (2%)
 Frame = +3

Query: 3    DSESTYEKNAS------ECMLESNMSGS--TLANNVKVMPKEAASDGCFPVAKGELEVVS 158
            DSE   E + S        M E +  GS  +L  N K +  E ASDGCF   + EL    
Sbjct: 893  DSEEHLENSCSMTADDSSSMEEYSNKGSEMSLEENTKALSGEVASDGCFSSGRPELSN-G 951

Query: 159  SSVCGDGNWTRTPQTYRNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLL 338
             SVC D +  +  Q   NGD   AGTSA+S   ++I  ++ V LQ  + H  +S++  LL
Sbjct: 952  LSVCCDRDQIKASQPCHNGDAIAAGTSADSPVHKKIRTDATVQLQAWKGHHSESDQSALL 1011

Query: 339  PRPSGDCDKTDTAAYYSPLNGTRVEIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPS 515
             R   D DK++  +  S +NG  VEIP FNQF K  D E H AQ++ DL+WN NG    S
Sbjct: 1012 SRSLDDRDKSEKGSQ-SFVNGLSVEIPPFNQFEKSVDGELHGAQQATDLSWNTNGAIFSS 1070

Query: 516  PNPTAPRSTWHRNRSISSFGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDH 695
            PNPTAPRSTWHRN+  SSFG+L+ GWSDGK D  +N FGN PKKPRTQVSY LPFGGFD 
Sbjct: 1071 PNPTAPRSTWHRNKQNSSFGHLSHGWSDGKADPVYNGFGNGPKKPRTQVSYLLPFGGFDC 1130

Query: 696  SPKIRVNHQKGLPCTRIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQ 875
            SPK + + QKGLP  R+R++SEKR SDVSRGS +NLELLSCD N+LIT  D+GWRECGAQ
Sbjct: 1131 SPK-QKSIQKGLPSKRLRKASEKRSSDVSRGSQRNLELLSCDVNILITATDRGWRECGAQ 1189

Query: 876  IVLELFEHNEWKLAVKLAGTTRYSHKAHQFL 968
            +VLELF+ +EWKLAVKL+G T+YS+KAHQFL
Sbjct: 1190 VVLELFDDHEWKLAVKLSGVTKYSYKAHQFL 1220


>ref|XP_008219843.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103320015
            [Prunus mume]
          Length = 1780

 Score =  330 bits (845), Expect = 1e-87
 Identities = 177/302 (58%), Positives = 223/302 (73%), Gaps = 2/302 (0%)
 Frame = +3

Query: 69   TLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWTRTPQTYRNGDLNVAGTSANS 248
            T  NN+K  P  A SD  F  +K E E  + +VC +G WT++ Q Y++G L+VAG+S  +
Sbjct: 1027 THENNLKAPPGNATSDHSF--SKPETET-ALAVC-NGGWTKSSQHYQDGVLSVAGSSTVT 1082

Query: 249  REPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYYSPLNGTRVEIPTFN 428
              PE+ G +++V       H P+S++C L P+     +K+DT +  S LNG  VEIP+F+
Sbjct: 1083 VVPEKTGTDAVV-------HHPESDQCSLSPKHLVGKEKSDTDSQ-SFLNGLTVEIPSFD 1134

Query: 429  QFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNR-SISSFGYLAQGWSDG 602
            +F K  D E  SAQ+  D +WNM+G  IPSPNPTAPRSTWHR+R S SSFGYL+ GWSDG
Sbjct: 1135 RFEKPVDGEVQSAQQPTDCSWNMSGSIIPSPNPTAPRSTWHRSRNSSSSFGYLSHGWSDG 1194

Query: 603  KVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCTRIRRSSEKRLSDVS 782
            K D+ HN FGN PKKPRTQVSY+LP+GGFD S K R N QKG+P  RIRR++EKRLSDVS
Sbjct: 1195 KADLFHNGFGNGPKKPRTQVSYTLPYGGFDFSSKQR-NLQKGIPPKRIRRANEKRLSDVS 1253

Query: 783  RGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAVKLAGTTRYSHKAHQ 962
            RGS +NLE LSC+AN+LI   D+GWRECGA IVLELF+HNEWKLAVK++GTT+YS+KAHQ
Sbjct: 1254 RGSQRNLEQLSCEANVLINGSDRGWRECGAHIVLELFDHNEWKLAVKISGTTKYSYKAHQ 1313

Query: 963  FL 968
            FL
Sbjct: 1314 FL 1315


>ref|XP_002324830.2| hypothetical protein POPTR_0018s01030g [Populus trichocarpa]
            gi|550317762|gb|EEF03395.2| hypothetical protein
            POPTR_0018s01030g [Populus trichocarpa]
          Length = 1722

 Score =  326 bits (835), Expect = 2e-86
 Identities = 167/301 (55%), Positives = 216/301 (71%), Gaps = 1/301 (0%)
 Frame = +3

Query: 69   TLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWTRTPQTYRNGDLNVAGTSANS 248
            T  N+ K + + A  DGC   AK E + V  S+C  G+W ++    ++GD+NV   SA+ 
Sbjct: 963  TPGNDFKALTRGADYDGCISCAKPESQSVDVSICSGGDWKKSLSN-QSGDVNVE-ISASY 1020

Query: 249  REPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYYSPLNGTRVEIPTFN 428
            R+  E G  +IVPLQ L+C+  +S+ C LL R S + D+T  A  ++  NG  V+IP+ N
Sbjct: 1021 RDLGESGSGAIVPLQNLECNHSESQPCDLLSRLSINKDETG-AGSHALSNGITVDIPSVN 1079

Query: 429  QFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSISSFGYLAQGWSDGK 605
            QF +H ++E    Q+S+DL+WNMNGG IPSPNPTA RSTWHRNRS     + + GWS+G+
Sbjct: 1080 QFDQHVNKELQGVQQSSDLSWNMNGGVIPSPNPTARRSTWHRNRS----SFASFGWSEGR 1135

Query: 606  VDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCTRIRRSSEKRLSDVSR 785
             D   NNFGN PKKPRTQVSY+LPFGGFD+SP+ +   QKG P  RIR ++EKR S +SR
Sbjct: 1136 ADFLQNNFGNGPKKPRTQVSYALPFGGFDYSPRNKGYQQKGFPHKRIRTATEKRTSFISR 1195

Query: 786  GSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAVKLAGTTRYSHKAHQF 965
            GS + LELLSCDAN+LIT GDKGWRECG Q+VLELF+HNEW+L VKL+GTT+YS+KAHQF
Sbjct: 1196 GSERKLELLSCDANVLITNGDKGWRECGVQVVLELFDHNEWRLGVKLSGTTKYSYKAHQF 1255

Query: 966  L 968
            L
Sbjct: 1256 L 1256


>ref|XP_011026533.1| PREDICTED: uncharacterized protein LOC105127107 [Populus euphratica]
          Length = 1726

 Score =  325 bits (833), Expect = 3e-86
 Identities = 174/324 (53%), Positives = 226/324 (69%), Gaps = 2/324 (0%)
 Frame = +3

Query: 3    DSESTYEKNASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGN 182
            DS ++ E     C++       T  N++K M + A  DGC   AK E + V  S+CG G+
Sbjct: 947  DSCTSIEDCCKACLV------CTPGNDLKAMTRGADYDGCMSCAKPESQSVDVSICGGGD 1000

Query: 183  WTRTPQTYRNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCD 362
            W ++    + GD+NV   SA+ R+  E G  +IVPLQ L+ +  +S+ C +L   S + D
Sbjct: 1001 WKKSLSN-QGGDVNVE-ISASYRDLGESGSGAIVPLQNLESNHSESQPCDML---SVNKD 1055

Query: 363  KTDTAAYYSPLNGTRVEIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRS 539
            +T  A  ++  NG  V+IP+ NQF +H ++E    Q+S+DL+WNMNGG IPSPNPTA RS
Sbjct: 1056 ET-RAGSHALSNGITVDIPSVNQFDQHVNKELQGVQQSSDLSWNMNGGVIPSPNPTARRS 1114

Query: 540  TWHRNR-SISSFGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVN 716
            TWHRNR S +SFG     WS+G+ D   NNFGN PKKPRTQVSY+LPFGGFD+SP+ +  
Sbjct: 1115 TWHRNRNSFASFG-----WSEGRADFLQNNFGNGPKKPRTQVSYALPFGGFDYSPRNKGY 1169

Query: 717  HQKGLPCTRIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFE 896
             QKG P  RIR ++EKR SD+SRGS +NLELLSCDAN+LIT GDKGWRECG Q+VLELF+
Sbjct: 1170 QQKGFPHKRIRTATEKRTSDISRGSERNLELLSCDANVLITNGDKGWRECGVQVVLELFD 1229

Query: 897  HNEWKLAVKLAGTTRYSHKAHQFL 968
            HNEW+L VKL+GTT+YS+KAHQFL
Sbjct: 1230 HNEWRLGVKLSGTTKYSYKAHQFL 1253


>ref|XP_012078606.1| PREDICTED: uncharacterized protein LOC105639237 [Jatropha curcas]
            gi|643722525|gb|KDP32275.1| hypothetical protein
            JCGZ_13200 [Jatropha curcas]
          Length = 1714

 Score =  325 bits (833), Expect = 3e-86
 Identities = 175/324 (54%), Positives = 221/324 (68%), Gaps = 2/324 (0%)
 Frame = +3

Query: 3    DSESTYEKNASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGN 182
            +S+S  ++ +S     +  S  T  NN KV  ++A  D C    K E + +  S    G+
Sbjct: 928  NSDSLLDECSSVEDYSNKDSEITSCNNFKVSSRDANCDECLSCGKAEPQAIGISANSVGD 987

Query: 183  WTRTPQTYRNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCD 362
            W  +     N   NV G +A+S++P +   ++I   QK   H   SE+  L  +P+ D  
Sbjct: 988  WMTSSPNNFNNVANV-GAAASSKDPGKFASDAIDVPQKQSSHHSGSEQQGLSVKPAADKC 1046

Query: 363  KTDTAAYYSPLNGTRVEIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRS 539
             T +   +S LNG  VEIP  NQF KH D+E H AQ+S DL+WNMNGG IPSPNPTA RS
Sbjct: 1047 STGS---HSLLNGITVEIPPVNQFDKHVDKELHGAQQSTDLSWNMNGGIIPSPNPTARRS 1103

Query: 540  TWHRNRSIS-SFGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVN 716
            TWHR+RS S SFGYLA GWSDG+ D  HNNFGN PKKPRTQVSY+LPFGGFD+ PK + +
Sbjct: 1104 TWHRSRSSSTSFGYLAHGWSDGRGDFVHNNFGNGPKKPRTQVSYALPFGGFDYCPKNKSH 1163

Query: 717  HQKGLPCTRIRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFE 896
             QK +P  RIR +SEKR  DVSRGS +NLE LSC+AN+LIT GD+GWRE GAQ+V+ELF+
Sbjct: 1164 SQKAVPHKRIRTASEKRSLDVSRGSERNLE-LSCEANVLITHGDRGWREGGAQVVVELFD 1222

Query: 897  HNEWKLAVKLAGTTRYSHKAHQFL 968
            HNEWKLAVK++GTT+YS+KAHQFL
Sbjct: 1223 HNEWKLAVKISGTTKYSYKAHQFL 1246


>gb|KHG16466.1| DNA mismatch repair Msh6-1 -like protein [Gossypium arboreum]
          Length = 1632

 Score =  323 bits (827), Expect = 2e-85
 Identities = 171/315 (54%), Positives = 212/315 (67%), Gaps = 1/315 (0%)
 Frame = +3

Query: 27   NASECMLESNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWTRTPQTY 206
            N  +C+ +S    S L N +K   K A+          EL  +  SV  DG W ++ Q +
Sbjct: 871  NREDCVKKS--FESCLGNFLKASSKVASVT--------ELMTLDLSVSSDGRWRKSLQKH 920

Query: 207  RNGDLNVAGTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYY 386
             N D  V G+ A   +PEE+G  +I  L+K +C   +S + FL  +    C K   ++  
Sbjct: 921  ANSDQIVNGSPAIYHKPEEVGASAIDQLEKQKCDYSESRQPFLSSKVVDGCKKGSGSS-- 978

Query: 387  SPLNGTRVEIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSI 563
            S LNG RVE+P F+Q+  H D +  S QRS DL WNMNGG IP+PNPTAPRS WHRNRS 
Sbjct: 979  SVLNGIRVELPPFDQYKVHVDSKLPSTQRSTDLTWNMNGGVIPTPNPTAPRSYWHRNRSS 1038

Query: 564  SSFGYLAQGWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCTR 743
            SS GY A  WSDGK D  HNNFGN PKKPRTQVSYS+PFGG D+S K   +HQ+GLP  R
Sbjct: 1039 SSIGYHAHRWSDGKADFFHNNFGNGPKKPRTQVSYSMPFGGLDYSSKNIGDHQRGLPHKR 1098

Query: 744  IRRSSEKRLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAVK 923
            IRR++EKR SDVSRGS KN+EL+SC ANLL+TLGD+GWRECGAQ+ LE  + NEWKLAVK
Sbjct: 1099 IRRANEKRSSDVSRGSQKNMELVSCHANLLLTLGDRGWRECGAQVALERIDRNEWKLAVK 1158

Query: 924  LAGTTRYSHKAHQFL 968
            ++G+TR S+KAHQFL
Sbjct: 1159 MSGSTRCSYKAHQFL 1173


>ref|XP_002516604.1| hypothetical protein RCOM_0804080 [Ricinus communis]
            gi|223544424|gb|EEF45945.1| hypothetical protein
            RCOM_0804080 [Ricinus communis]
          Length = 1705

 Score =  320 bits (819), Expect = 1e-84
 Identities = 172/303 (56%), Positives = 215/303 (70%), Gaps = 2/303 (0%)
 Frame = +3

Query: 66   STLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWTRTPQTYRNGDLNVAGTSAN 245
            +T  NN K   ++   + C   A  E   V  SV   G+W +    ++N D++ A TSA 
Sbjct: 946  TTPDNNSKGSSRDVDCEECLFCANTEPLAVGVSVNTVGDWMKPSPKHQNSDVH-AETSAF 1004

Query: 246  SREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYYSPLNGTRVEIPTF 425
            S++  E+GR+ I  LQK +CH  ++E+   LP+PS D          + LNG RVEIP+ 
Sbjct: 1005 SKDSGELGRD-IASLQKWRCHHSEAEQNDALPKPSVD---------RALLNGIRVEIPSS 1054

Query: 426  NQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRS-ISSFGYLAQGWSD 599
            NQF K  D++   AQ+S DL+WNMNGG IPSPNPTA RSTWHRNRS ++S GY A GWSD
Sbjct: 1055 NQFDKQVDKDLDGAQQSTDLSWNMNGGIIPSPNPTARRSTWHRNRSNLASVGYNAHGWSD 1114

Query: 600  GKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCTRIRRSSEKRLSDV 779
            G+ D   NNF N PKKPRTQVSY+LPFG FD+S K + + QKG+P  RIR ++EKR SDV
Sbjct: 1115 GRGDFLQNNFRNGPKKPRTQVSYALPFGAFDYSSKSKGHSQKGIPHKRIRTANEKRSSDV 1174

Query: 780  SRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAVKLAGTTRYSHKAH 959
            SRGS +NLELLSC+AN+LITLGDKGWRE GAQ+VLEL +HNEWKLAVKL+GTT+YS+KAH
Sbjct: 1175 SRGSERNLELLSCEANVLITLGDKGWREYGAQVVLELSDHNEWKLAVKLSGTTKYSYKAH 1234

Query: 960  QFL 968
            QFL
Sbjct: 1235 QFL 1237


>ref|XP_012463288.1| PREDICTED: uncharacterized protein LOC105782825 isoform X2 [Gossypium
            raimondii]
          Length = 1631

 Score =  312 bits (799), Expect = 3e-82
 Identities = 166/294 (56%), Positives = 205/294 (69%), Gaps = 7/294 (2%)
 Frame = +3

Query: 108  ASDGCFPVAKG------ELEVVSSSVCGDGNWTRTPQTYRNGDLNVAGTSANSREPEEIG 269
            +S G FP A        EL  +  SV  DG W +  Q + N D  V G+ A   +PEE+G
Sbjct: 881  SSLGNFPKASSKVASVTELMTLDLSVSSDGRWRKYLQKHANSDQIVNGSPAIYHKPEEVG 940

Query: 270  RESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYYSPLNGTRVEIPTFNQFGKH-D 446
              +I  L+K +C   +S++ FL  +   D DK  + +  S LNG RVE+P F+Q+  H D
Sbjct: 941  ASAIGQLEKQKCDYSESQQPFLSSKVV-DGDKKGSGSS-SVLNGIRVELPPFDQYKNHVD 998

Query: 447  REYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSISSFGYLAQGWSDGKVDIAHNN 626
             +  S Q+S DL WNMNGG IP+PNPTA RS WH+NRS  S GY A   SDGKVDI HNN
Sbjct: 999  SKLPSTQQSTDLTWNMNGGVIPTPNPTASRSYWHQNRSSLSIGYHAHRSSDGKVDIFHNN 1058

Query: 627  FGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCTRIRRSSEKRLSDVSRGSGKNLE 806
            FGN PKKPRTQVSYS+PFGG D+S K    HQ+GLP  RIRR++EKR SDVSRGS +N+E
Sbjct: 1059 FGNGPKKPRTQVSYSMPFGGLDYSSKNIGYHQRGLPHKRIRRANEKRSSDVSRGSQRNME 1118

Query: 807  LLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAVKLAGTTRYSHKAHQFL 968
            L+SC ANLL+TLGD+GWRECGAQ+ LE F+HNEWKLAVK++G+TR S+KAHQFL
Sbjct: 1119 LVSCHANLLLTLGDRGWRECGAQVALERFDHNEWKLAVKMSGSTRCSYKAHQFL 1172


>ref|XP_012463287.1| PREDICTED: uncharacterized protein LOC105782825 isoform X1 [Gossypium
            raimondii] gi|763816407|gb|KJB83259.1| hypothetical
            protein B456_013G238100 [Gossypium raimondii]
          Length = 1674

 Score =  312 bits (799), Expect = 3e-82
 Identities = 166/294 (56%), Positives = 205/294 (69%), Gaps = 7/294 (2%)
 Frame = +3

Query: 108  ASDGCFPVAKG------ELEVVSSSVCGDGNWTRTPQTYRNGDLNVAGTSANSREPEEIG 269
            +S G FP A        EL  +  SV  DG W +  Q + N D  V G+ A   +PEE+G
Sbjct: 924  SSLGNFPKASSKVASVTELMTLDLSVSSDGRWRKYLQKHANSDQIVNGSPAIYHKPEEVG 983

Query: 270  RESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYYSPLNGTRVEIPTFNQFGKH-D 446
              +I  L+K +C   +S++ FL  +   D DK  + +  S LNG RVE+P F+Q+  H D
Sbjct: 984  ASAIGQLEKQKCDYSESQQPFLSSKVV-DGDKKGSGSS-SVLNGIRVELPPFDQYKNHVD 1041

Query: 447  REYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSISSFGYLAQGWSDGKVDIAHNN 626
             +  S Q+S DL WNMNGG IP+PNPTA RS WH+NRS  S GY A   SDGKVDI HNN
Sbjct: 1042 SKLPSTQQSTDLTWNMNGGVIPTPNPTASRSYWHQNRSSLSIGYHAHRSSDGKVDIFHNN 1101

Query: 627  FGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCTRIRRSSEKRLSDVSRGSGKNLE 806
            FGN PKKPRTQVSYS+PFGG D+S K    HQ+GLP  RIRR++EKR SDVSRGS +N+E
Sbjct: 1102 FGNGPKKPRTQVSYSMPFGGLDYSSKNIGYHQRGLPHKRIRRANEKRSSDVSRGSQRNME 1161

Query: 807  LLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAVKLAGTTRYSHKAHQFL 968
            L+SC ANLL+TLGD+GWRECGAQ+ LE F+HNEWKLAVK++G+TR S+KAHQFL
Sbjct: 1162 LVSCHANLLLTLGDRGWRECGAQVALERFDHNEWKLAVKMSGSTRCSYKAHQFL 1215


>ref|XP_008394009.1| PREDICTED: uncharacterized protein LOC103456143 [Malus domestica]
          Length = 1662

 Score =  310 bits (794), Expect = 1e-81
 Identities = 170/308 (55%), Positives = 212/308 (68%), Gaps = 2/308 (0%)
 Frame = +3

Query: 51   SNMSGSTLANNVKVMPKEAASDGCFPVAKGELEVVSSSVCGDGNWTRTPQTYRNGDLNVA 230
            S  S  T   N+K  P +A SDG       E  +   SVC  G  T + Q ++NG L V+
Sbjct: 898  SEGSKITPQKNLKAPPSDATSDGSCAKPDAENXI---SVC-HGARTNSSQHFQNGGLYVS 953

Query: 231  GTSANSREPEEIGRESIVPLQKLQCHDPKSERCFLLPRPSGDCDKTDTAAYYSPLNGTRV 410
             +S  +   E+ G + +V  + LQ H P+S++C L PRP    DK+DT +   P NG  V
Sbjct: 954  VSSGGTGVLEKTGTDEVVQSKVLQSHXPESDQCSLSPRPLVGRDKSDTDSQSFP-NGLTV 1012

Query: 411  EIPTFNQFGKH-DREYHSAQRSADLNWNMNGGNIPSPNPTAPRSTWHRNRSISSFGYLAQ 587
            EIP+F+ F K  D+E  SAQ+  D  WNMNG  IPSPNPTAPRST HRNR+ SS G+L+ 
Sbjct: 1013 EIPSFDXFEKPVDKEVQSAQQPTDFXWNMNGSIIPSPNPTAPRSTGHRNRNNSSLGHLSH 1072

Query: 588  GWSDGKVDIAHNNFGNTPKKPRTQVSYSLPFGGFDHSPKIRVNHQKGLPCTRIRRSS-EK 764
             WSDG  D+ HN FG+ PKKPRTQVSY+LP+GGFD S K R N QKGLP  RIRR++ EK
Sbjct: 1073 NWSDG-TDLFHNGFGSGPKKPRTQVSYTLPYGGFDFSSKQR-NLQKGLPHKRIRRANNEK 1130

Query: 765  RLSDVSRGSGKNLELLSCDANLLITLGDKGWRECGAQIVLELFEHNEWKLAVKLAGTTRY 944
            R SD SRGS +NLELLSC+AN+L+   D+GWRECGA +VLELF+HNEWKLAVK++GTT+Y
Sbjct: 1131 RSSDASRGSQRNLELLSCEANVLVNGSDRGWRECGAHVVLELFDHNEWKLAVKISGTTKY 1190

Query: 945  SHKAHQFL 968
            S+KAHQFL
Sbjct: 1191 SYKAHQFL 1198


Top