BLASTX nr result

ID: Angelica27_contig00000783 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica27_contig00000783
         (1784 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_017229799.1 PREDICTED: uncharacterized protein LOC108204730 [...   577   0.0  
KZN08854.1 hypothetical protein DCAR_001510 [Daucus carota subsp...   574   0.0  
CBI21048.3 unnamed protein product, partial [Vitis vinifera]          338   e-107
XP_002285150.2 PREDICTED: uncharacterized protein LOC100266444 [...   333   e-105
CAN68624.1 hypothetical protein VITISV_010682 [Vitis vinifera]        334   e-104
OAY53668.1 hypothetical protein MANES_03G014300 [Manihot esculenta]   307   8e-96
CDP13523.1 unnamed protein product [Coffea canephora]                 297   8e-92
OAY27387.1 hypothetical protein MANES_16G122300 [Manihot esculenta]   291   3e-89
APR63704.1 hypothetical protein [Populus tomentosa]                   288   5e-88
OMO61658.1 hypothetical protein CCACVL1_23335 [Corchorus capsula...   286   1e-87
XP_002309630.1 hypothetical protein POPTR_0006s27080g [Populus t...   285   5e-87
XP_007011854.2 PREDICTED: uncharacterized protein LOC18587787 is...   286   6e-87
OMO90819.1 hypothetical protein COLO4_18861 [Corchorus olitorius]     284   1e-86
XP_012076555.1 PREDICTED: uncharacterized protein LOC105637632 i...   284   1e-86
XP_011039682.1 PREDICTED: uncharacterized protein LOC105136151 [...   283   2e-86
EOY29473.1 Uncharacterized protein TCM_036994 isoform 3 [Theobro...   285   2e-86
XP_012442361.1 PREDICTED: uncharacterized protein LOC105767390 [...   282   4e-86
XP_002324862.2 hypothetical protein POPTR_0018s01770g [Populus t...   284   5e-86
XP_011036558.1 PREDICTED: uncharacterized protein LOC105134027 [...   282   8e-86
XP_017627999.1 PREDICTED: uncharacterized protein LOC108470968 [...   278   2e-84

>XP_017229799.1 PREDICTED: uncharacterized protein LOC108204730 [Daucus carota subsp.
            sativus]
          Length = 376

 Score =  577 bits (1487), Expect = 0.0
 Identities = 285/376 (75%), Positives = 315/376 (83%), Gaps = 2/376 (0%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG + YECVR+ WHSDRHQPIRGS IQEIFRVVHE+H PATKKNKEWQEKLP+VVL+
Sbjct: 1    MPRPGPKPYECVRKVWHSDRHQPIRGSFIQEIFRVVHEIHSPATKKNKEWQEKLPVVVLR 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AE+IMYSKANSEAEYMDLKTLWDRT DAINTIIRLDET ETGV+LQPCIEAALHLGCTPR
Sbjct: 61   AEDIMYSKANSEAEYMDLKTLWDRTNDAINTIIRLDETIETGVYLQPCIEAALHLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNH-GTNQIMSGYSDADKTSFSCLG 1143
            + SRSQRNITPSYYLSPI+PDKMTI SS LQNSVLGNH  TNQ MSG  DA KTSFS  G
Sbjct: 121  RASRSQRNITPSYYLSPINPDKMTIPSSSLQNSVLGNHRTTNQFMSGCLDAGKTSFSFFG 180

Query: 1142 MPSHSPAVYPLFFGDLQPKDSKFNFDNCSRFNVGTPSKLSSHSMNASEIFGKQNPYGKAM 963
            MPS  PAV PL+FG+ +PKDSKFNFD  S+FN+ TP+K SSHSM AS I GKQNPYGK  
Sbjct: 181  MPSPGPAVKPLYFGNPKPKDSKFNFDVPSKFNLDTPTKFSSHSMKASGICGKQNPYGKIK 240

Query: 962  ASSKPIEADINDTFGNSHEIGCDLSLRLGCLGTPREIIETRIDNDLGNLSSRTPQGNKLI 783
            A  KP EAD+ D   +S+ I CDLSLRLGCLG+P E IETR+D DLGN S   PQG KL 
Sbjct: 241  ALGKPFEADVKDASCDSYGIDCDLSLRLGCLGSPGEKIETRVDKDLGNFSLMNPQGKKLT 300

Query: 782  DSSIQIDKSFFRNSYFCESVDTCSTDRIMEAEKLNLKKTL-KRKAVMDYPCEDERFSWHP 606
            DSSIQIDKSFF+ S+ C+SVDTCST +I EAE LNLK+T  KRK++MD+P EDE FSW P
Sbjct: 301  DSSIQIDKSFFKTSHICDSVDTCSTHQINEAENLNLKETARKRKSIMDHPLEDEDFSWRP 360

Query: 605  KLPFRDYTWRARNAEP 558
            K+PF D+T RARNAEP
Sbjct: 361  KVPFHDFTARARNAEP 376


>KZN08854.1 hypothetical protein DCAR_001510 [Daucus carota subsp. sativus]
          Length = 376

 Score =  574 bits (1480), Expect = 0.0
 Identities = 284/376 (75%), Positives = 314/376 (83%), Gaps = 2/376 (0%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG + YECVR+ WHSDRHQPIRGS IQEIFRVVHE+H PATKKNKEWQEKLP+VVL+
Sbjct: 1    MPRPGPKPYECVRKVWHSDRHQPIRGSFIQEIFRVVHEIHSPATKKNKEWQEKLPVVVLR 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AE+IMYSKANSEAEYMDLKTLWDRT DAINTIIRLDET ETGV+LQPCIEAALHLGCTPR
Sbjct: 61   AEDIMYSKANSEAEYMDLKTLWDRTNDAINTIIRLDETIETGVYLQPCIEAALHLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNH-GTNQIMSGYSDADKTSFSCLG 1143
            + SRSQRNITPSYYLSPI+PDKMTI SS LQNSVLGNH  TNQ MSG  DA KTSFS  G
Sbjct: 121  RASRSQRNITPSYYLSPINPDKMTIPSSSLQNSVLGNHRTTNQFMSGCLDAGKTSFSFFG 180

Query: 1142 MPSHSPAVYPLFFGDLQPKDSKFNFDNCSRFNVGTPSKLSSHSMNASEIFGKQNPYGKAM 963
            MPS  PAV PL+FG+ +PKDSKFNFD  S+FN+ TP+K SSHSM AS I GKQNPYGK  
Sbjct: 181  MPSPGPAVKPLYFGNPKPKDSKFNFDVPSKFNLDTPTKFSSHSMKASGICGKQNPYGKIK 240

Query: 962  ASSKPIEADINDTFGNSHEIGCDLSLRLGCLGTPREIIETRIDNDLGNLSSRTPQGNKLI 783
            A  KP EAD+ D   +S+ I CDLSLRLGCLG+P E IETR+D DLGN S   PQG KL 
Sbjct: 241  ALGKPFEADVKDASCDSYGIDCDLSLRLGCLGSPGEKIETRVDKDLGNFSLMNPQGKKLT 300

Query: 782  DSSIQIDKSFFRNSYFCESVDTCSTDRIMEAEKLNLKKTL-KRKAVMDYPCEDERFSWHP 606
            DSSIQIDKSFF+ S+ C+SVDTCST +I EAE LNLK+T  KRK++MD+P EDE FSW P
Sbjct: 301  DSSIQIDKSFFKTSHICDSVDTCSTHQINEAENLNLKETARKRKSIMDHPLEDEDFSWRP 360

Query: 605  KLPFRDYTWRARNAEP 558
            K+PF D+T RARNA P
Sbjct: 361  KVPFHDFTARARNAGP 376


>CBI21048.3 unnamed protein product, partial [Vitis vinifera]
          Length = 451

 Score =  338 bits (866), Expect = e-107
 Identities = 206/434 (47%), Positives = 258/434 (59%), Gaps = 48/434 (11%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFRVV+E+H  ATKKNKEWQEKLPIVVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPIVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMDLKTLWDR  DAINTIIR DE+TETG FLQPCIEA+L+LGC  R
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGEFLQPCIEASLNLGCPQR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGT-NQIMSGYSDADKTSFSCLG 1143
            + SRSQRN  P  YL+P + + ++I+ S L+NS  GNH T +Q+MS Y+   K S   + 
Sbjct: 121  RASRSQRNNNPRCYLTPSTQEPISISPSILENSPQGNHTTISQVMSRYATFIKPSSMSVI 180

Query: 1142 MPSHSP-----------------------------------------AVYPLFFGD-LQP 1089
             P   P                                         AVYPL+ G+ LQ 
Sbjct: 181  QPGLEPHSTAFHNNDCPTSKFLFSSENCPPSGNKCLQMEVYPASNVCAVYPLYDGNQLQC 240

Query: 1088 KDSKFNFDNCSRFNVGTPSKLSSHSMNASEIFGKQNPYGKAM-ASSKPIEADINDTFGNS 912
            ++S+  F        G  S   S+ M  + +   QN +  A+  + KP + D      NS
Sbjct: 241  EESQCGF--------GVQSHPKSNPMEPAGMGTIQNLFSYAIDPTKKPSQTDFGHVTENS 292

Query: 911  HEIGCDLSLRLGCLGTPREIIETRIDNDLGNL-SSRTPQGNKLIDSSIQIDKS--FFRNS 741
             +I CDLSLRLG L  P   +E     +  ++ SS + +G+K  D S Q+DK   FF   
Sbjct: 293  PKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDLSPQVDKQFPFFPRG 352

Query: 740  YFCESVDTCSTDRIMEAEKLNLKKTL-KRKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
               + +D+C + R  E E LN++ T+ KRKAV+ YP ED +F   PKLP+     R RNA
Sbjct: 353  NTDDPLDSCLSKRSSEGENLNMEATMRKRKAVISYPLEDRQFCCQPKLPYNYLPGRMRNA 412

Query: 563  EP*GLVLCVIFTVA 522
            E   L  C ++ ++
Sbjct: 413  EMLYLTKCYLWNLS 426


>XP_002285150.2 PREDICTED: uncharacterized protein LOC100266444 [Vitis vinifera]
          Length = 414

 Score =  333 bits (855), Expect = e-105
 Identities = 203/420 (48%), Positives = 251/420 (59%), Gaps = 48/420 (11%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFRVV+E+H  ATKKNKEWQEKLPIVVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPIVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMDLKTLWDR  DAINTIIR DE+TETG FLQPCIEA+L+LGC  R
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGEFLQPCIEASLNLGCPQR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGT-NQIMSGYSDADKTSFSCLG 1143
            + SRSQRN  P  YL+P + + ++I+ S L+NS  GNH T +Q+MS Y+   K S   + 
Sbjct: 121  RASRSQRNNNPRCYLTPSTQEPISISPSILENSPQGNHTTISQVMSRYATFIKPSSMSVI 180

Query: 1142 MPSHSP-----------------------------------------AVYPLFFGD-LQP 1089
             P   P                                         AVYPL+ G+ LQ 
Sbjct: 181  QPGLEPHSTAFHNNDCPTSKFLFSSENCPPSGNKCLQMEVYPASNVCAVYPLYDGNQLQC 240

Query: 1088 KDSKFNFDNCSRFNVGTPSKLSSHSMNASEIFGKQNPYGKAM-ASSKPIEADINDTFGNS 912
            ++S+  F        G  S   S+ M  + +   QN +  A+  + KP + D      NS
Sbjct: 241  EESQCGF--------GVQSHPKSNPMEPAGMGTIQNLFSYAIDPTKKPSQTDFGHVTENS 292

Query: 911  HEIGCDLSLRLGCLGTPREIIETRIDNDLGNL-SSRTPQGNKLIDSSIQIDKS--FFRNS 741
             +I CDLSLRLG L  P   +E     +  ++ SS + +G+K  D S Q+DK   FF   
Sbjct: 293  PKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDLSPQVDKQFPFFPRG 352

Query: 740  YFCESVDTCSTDRIMEAEKLNLKKTL-KRKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
               + +D+C + R  E E LN++ T+ KRKAV+ YP ED +F   PKLP+     R RNA
Sbjct: 353  NTDDPLDSCLSKRSSEGENLNMEATMRKRKAVISYPLEDRQFCCQPKLPYNYLPGRMRNA 412


>CAN68624.1 hypothetical protein VITISV_010682 [Vitis vinifera]
          Length = 526

 Score =  334 bits (857), Expect = e-104
 Identities = 203/426 (47%), Positives = 255/426 (59%), Gaps = 48/426 (11%)
 Frame = -1

Query: 1694 SFYQDMPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLP 1515
            +F + MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFRVV+E+H  ATKKNKEWQEKLP
Sbjct: 20   NFNKRMPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKLP 79

Query: 1514 IVVLKAEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHL 1335
            IVVLKAEEIMYSKANSEAEYMDLKTLWDR  DAINTIIR DE+TETG FLQPCIEA+L+L
Sbjct: 80   IVVLKAEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGEFLQPCIEASLNL 139

Query: 1334 GCTPRKTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGT-NQIMSGYSDADKTS 1158
            GC  R+ SRSQRN  P  YL+P + + ++I+ S L+NS  GNH T +Q+MS Y+   K S
Sbjct: 140  GCPQRRASRSQRNNNPRCYLTPSTQEPISISPSILENSPQGNHTTISQVMSRYATFIKPS 199

Query: 1157 FSCLGMPSHSP-----------------------------------------AVYPLFFG 1101
               +  P   P                                         AVYPL+ G
Sbjct: 200  SMSVIQPGLEPHSTAFHNNDCPTXKFLFSSENCPPSGNKCLQMEVYPASNLCAVYPLYDG 259

Query: 1100 D-LQPKDSKFNFDNCSRFNVGTPSKLSSHSMNASEIFGKQNPYGKAM-ASSKPIEADIND 927
            + LQ ++S+  F        G  S   S+ M  + +   QN +  A+  + KP + D   
Sbjct: 260  NQLQCEESQCGF--------GVQSHPKSNPMEPAGMGTIQNLFSYAIDPTKKPSQTDFGH 311

Query: 926  TFGNSHEIGCDLSLRLGCLGTPREIIETRIDNDLGNL-SSRTPQGNKLIDSSIQIDKS-- 756
               NS +I CDLSLRLG L  P   +E     +  ++ SS + +G+K  D S ++DK   
Sbjct: 312  VTENSPKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDLSPRVDKQFP 371

Query: 755  FFRNSYFCESVDTCSTDRIMEAEKLNLKKTL-KRKAVMDYPCEDERFSWHPKLPFRDYTW 579
            FF      + +D+C + R  E E LN++ T+ KRKAV+ YP ED +F   PKLP+     
Sbjct: 372  FFPRGNTDDPLDSCLSKRSSEGENLNMEATMRKRKAVISYPLEDRQFCCQPKLPYNYLPG 431

Query: 578  RARNAE 561
            R RNA+
Sbjct: 432  RMRNAD 437


>OAY53668.1 hypothetical protein MANES_03G014300 [Manihot esculenta]
          Length = 393

 Score =  307 bits (787), Expect = 8e-96
 Identities = 189/406 (46%), Positives = 238/406 (58%), Gaps = 33/406 (8%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFRVV+EVH  ATKKNKEWQEKLP+VVL+
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEVHSSATKKNKEWQEKLPVVVLR 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEI+YSKANSEAEYMDLKTLWDRT DAINTIIR DE+TETG  LQPCIEAAL+LGCTPR
Sbjct: 61   AEEIIYSKANSEAEYMDLKTLWDRTNDAINTIIRRDESTETGELLQPCIEAALNLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGTN-QIMSGYSDADKTSF---- 1155
            + SRSQRN  P  YLS  S +  T +   + +SV  NH T+ Q +  Y +  K +F    
Sbjct: 121  RASRSQRNCNPRCYLSASSQEPNTFSPGIVNSSVQVNHKTSPQCIPNYLNFIKPTFVNST 180

Query: 1154 --------------------SCLGMPSHSP----AVYPLFFGD-LQPKDSKFNFDNCSRF 1050
                                 CL + + +     +VYPL+FG  ++P+            
Sbjct: 181  HLGSDKFLLATDNGCLSNFNQCLPVENRAVSRLCSVYPLYFGSCIEPQQGS------GLL 234

Query: 1049 NVGTPSKLSSHSMNASEIFGKQNPYG-KAMASSKPIEADINDTFGNSHEIGCDLSLRLGC 873
            +   PS      M   E    Q+P G    A  K  ++D  D      ++GCDLSLRLG 
Sbjct: 235  SKSVPSTWEPAKMGGIE----QSPLGCNEYADVKINQSDFKDISMQHQDVGCDLSLRLGS 290

Query: 872  LGTPREIIETRIDNDLGNLSSRTPQGNKLIDSSIQIDKSF--FRNSYFCESVDTCSTDRI 699
            L       +     D+ ++ S   +G+K  +  +Q DK F  F       S+D+C + ++
Sbjct: 291  LSASLPSSQNWQLQDVEDVGS--GEGSKFNNQMLQTDKEFTLFARVDKDNSLDSCPS-KL 347

Query: 698  MEAEKLNLKKTLKRKAVMDYPCEDERFSWHPKLPFRDYTWRARNAE 561
             E   +N K   KRKAV  +P +D+   W PKLP +D T R R+A+
Sbjct: 348  SERVNINAKMK-KRKAVYGHPVDDQACHWQPKLPCKDLTCRMRSAD 392


>CDP13523.1 unnamed protein product [Coffea canephora]
          Length = 403

 Score =  297 bits (761), Expect = 8e-92
 Identities = 184/407 (45%), Positives = 234/407 (57%), Gaps = 34/407 (8%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFR+V+E+H  ATKKNKEWQEKLPIVVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRIVNEIHSSATKKNKEWQEKLPIVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEY+D+ TL DR  DAINTIIR DE+TETG  LQPCIEAALHLGCTPR
Sbjct: 61   AEEIMYSKANSEAEYVDINTLRDRANDAINTIIRRDESTETGELLQPCIEAALHLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGTNQIM---------------- 1188
            ++SRSQRNITP  YL+P  P+ ++++ + L N V GN+ TN                   
Sbjct: 121  RSSRSQRNITPRCYLNPEKPEAISVSLTNLDNKVQGNYATNCSFIPQSPNLLMNSADTCL 180

Query: 1187 ----SGYSDADKTSFSCLGMPSHSPAVYPLFFGDLQPKDSKFNFDNCSRFNVGTPSKLSS 1020
                SG    D         PS +    P     L    S +   + + F     SK +S
Sbjct: 181  NYDNSGIQKTDSEYPKLRNSPSINHQSRPRLAPSLSSSCSVYPLYHGNHFQFQDSSKSNS 240

Query: 1019 HSMNASE--IFGKQNPYGKAMASSKPIEADINDTFGNSHEIGCDLSLRLG-----CLG-- 867
            H M   +  +  +++  G+  A +   +A  +      H  GCDLSLRLG     CLG  
Sbjct: 241  HLMKTDKKGVMKRRSTCGQD-ALNVISQASSHYVSETPHGSGCDLSLRLGPLGVSCLGEE 299

Query: 866  --TPREIIETRIDNDLGNLSSRTPQGNKLIDSSIQIDK--SFFRNSYFCESVDTCSTDRI 699
               P+E+       D G L +    G+K  D S Q DK  SFF      + +D+ S    
Sbjct: 300  NSCPQEV------EDGGGLGT-CKVGSKDNDLSSQSDKDFSFFPKPNGHDMLDSSSNKWS 352

Query: 698  MEAEKLNLKKTL-KRKAVMDYPCEDERFSWHPKLPFRDYTWRARNAE 561
             + + +N++ TL KRK  + +P ED ++ W  K PF+    R  +++
Sbjct: 353  HKTQNVNIEATLRKRKVAVSHPSEDRQYRWPLKFPFKQVNGRINSSD 399


>OAY27387.1 hypothetical protein MANES_16G122300 [Manihot esculenta]
          Length = 406

 Score =  291 bits (744), Expect = 3e-89
 Identities = 181/409 (44%), Positives = 231/409 (56%), Gaps = 37/409 (9%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPR G R YECVRRAWHSDRHQPIRGSLIQEIFRVV+EVH  ATKKNKEWQEKLP+VVLK
Sbjct: 1    MPRSGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEVHGSATKKNKEWQEKLPVVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMDLKTLWDR  DAINTIIR DE+TETG  LQPCIEAAL LGCTPR
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGELLQPCIEAALILGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGTNQ-IMSGYSDADKTSFSCLG 1143
            + SRSQRN  P  YL P + +  T +S  + ++   NH T+   +  Y  A+  + + + 
Sbjct: 121  RASRSQRNCNPRCYLIPGTQEPNTFSSGIVNSTTRANHTTSPPCIPNY--ANFITPTIIN 178

Query: 1142 MPSHSPAVYPLFFGDLQPKDSKFNF----DNCSRFNVGTP-------SKLSSHSMNASEI 996
                 P +  L + ++    +KF F     + + +N   P       S  S + +     
Sbjct: 179  STLLGPELQNLVYKNVAVTPNKFLFATDNSHLANYNQCLPAENRPVSSMCSVYPLYYGSC 238

Query: 995  FGKQNPYGKAMASSKPI---------------------EADINDTFGNSHEIGCDLSLRL 879
               Q   G    +++P+                     ++D +D+    HE+GCDLSLRL
Sbjct: 239  LKPQQDLGILSKAAEPVRVSGIEQNLFSYNEDPAVKINQSDPSDSLLEQHEVGCDLSLRL 298

Query: 878  GCLGTPREIIETRIDNDLGNLSSRTPQGNKLIDSSI-QIDKSF--FRNSYFCESVDTCST 708
            G L      ++ R   D+  + S   Q        + Q DK F  F       S+D+C +
Sbjct: 299  GSLSASLPSVQKRQLQDVEAVGSGYSQERSEFSHRMPQTDKEFSLFTTVNVDNSLDSCPS 358

Query: 707  DRIMEAEKLNLKKTLK-RKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
                  E +N+   +K RKAV  YP ED+ + W PKLP  D T R ++A
Sbjct: 359  KL---REDVNVDAQMKKRKAVFVYPVEDQAYCWQPKLPCNDLTSRMKSA 404


>APR63704.1 hypothetical protein [Populus tomentosa]
          Length = 407

 Score =  288 bits (736), Expect = 5e-88
 Identities = 185/412 (44%), Positives = 232/412 (56%), Gaps = 40/412 (9%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFR+V+E H   TKKNKEWQEKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHSSTTKKNKEWQEKLPVVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYM+LKTLWDRT DAINTIIR DE+ ETG  LQPCIEAAL+LGCTPR
Sbjct: 61   AEEIMYSKANSEAEYMELKTLWDRTNDAINTIIRRDESMETGELLQPCIEAALNLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGTN-QIMSGYSDADKTSFSCLG 1143
            + SRSQRN  PS+YL+P + +  T++S  + +++  N  +N  ++  YS   K       
Sbjct: 121  RASRSQRNCNPSFYLNPSTQEPNTLSSGSVHSAIQANRTSNSHVLPNYSSMVKPIIMNST 180

Query: 1142 MPSHSPAVYPLFFGDLQPKDSKFNF--DNCSRFNVGTPSKLSSHSMNA------------ 1005
             P         F G      ++F F  DN    NV     L ++ + +            
Sbjct: 181  PPGSESQD---FVGQSNGTSNRFLFIDDNIPLSNVNQCLPLGNYRIPSLCSVYPLYYGSC 237

Query: 1004 ----------SEIF-GKQNPYGKAMASS----------KPIEADINDTFGNSHEIGCDLS 888
                       E F G   P   A+  +          K   AD  D+     EIGCDLS
Sbjct: 238  LEPQRGCGALPETFPGTMEPVKVAVMQNFFPCNEDIPVKTCHADHKDSPLQPQEIGCDLS 297

Query: 887  LRLGCLGTPREIIETRIDNDLGNLSSRTPQ-GNKLIDSSIQIDKS--FFRNSYFCESVDT 717
            LRLG L  P   ++T+   D  +      Q G K+ D   Q DK   FF      +S+ +
Sbjct: 298  LRLGSLPAPMLSVKTKQLKDAKDGGHDCSQEGGKVDDWMPQADKELPFFTRVNLADSLVS 357

Query: 716  CSTDRIMEAEKLNLKKTL-KRKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
             S+      E +N+ +T+ KRKAV+D+  ED+ F W PKL     T R ++A
Sbjct: 358  HSS---KSREHVNIDETMKKRKAVLDHHVEDQ-FCWQPKLHCNQLTCRMKSA 405


>OMO61658.1 hypothetical protein CCACVL1_23335 [Corchorus capsularis]
          Length = 396

 Score =  286 bits (732), Expect = 1e-87
 Identities = 180/409 (44%), Positives = 239/409 (58%), Gaps = 37/409 (9%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R Y C RRAWHSDRHQP+RGSLIQEIFRVV+E+H  ATKKNKEWQEKLP+VVLK
Sbjct: 1    MPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSE EYMDLKTLWDRT DAINTIIR DE+TETG  LQPCIEAAL+LGCTPR
Sbjct: 61   AEEIMYSKANSELEYMDLKTLWDRTNDAINTIIRRDESTETGELLQPCIEAALNLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSP--------------------------ISPDKMTIT--SSRLQN 1224
            +T RSQRN  P  YLS                           + P  M +T  SS  Q 
Sbjct: 121  RTLRSQRNCNPGCYLSMGAQEAENTSQGNLTTNSHCVASFPSFMKPTTMDVTHLSSESQK 180

Query: 1223 SVL--GNHGTNQI-MSGYSDADKTSFSCLGMPSHSP----AVYPLFFGDLQPKDSKFNFD 1065
             +    N  TN+  ++  +    ++  CL +  + P    ++YPL++G+  PK  +    
Sbjct: 181  HLADDSNCTTNKFPLTSENCPYLSNDQCLPVEKYPPTNMYSIYPLYYGN-HPKFEEL--- 236

Query: 1064 NCSRFNVGTPSKLSSHSMNASEIFGKQNPYGKAMASSKPI-EADINDTFGNSHEIGCDLS 888
               +   G   K  S+++  ++I    N +   + SS  I + ++ +T  N HEI CDLS
Sbjct: 237  ---QHGFGIFPKSISNTVEPAKISAIHNLFSSDVDSSNKINQTNVRNTSNNPHEIACDLS 293

Query: 887  LRLGCLGTPR-EIIETRIDNDLGNLSSRTPQGNKLIDSSIQIDKSFFRNSYFCESVDTCS 711
            LRLG +G  R + IE      LG  S+ TP        SI    S F +S   + +++ S
Sbjct: 294  LRLGPVGNGRSQEIEDTGSTSLGWKSNLTP--------SIDNKFSSFPSSNRDDPLNSSS 345

Query: 710  TDRIMEAEKLNLKKTLKRKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
             +  +E E +N+  T++++  +  P  D++F   PK+PF   + R ++A
Sbjct: 346  NECSVEGEHMNVGATMRKRKTVYGPSVDQQFCLPPKVPFSHLSGRMKSA 394


>XP_002309630.1 hypothetical protein POPTR_0006s27080g [Populus trichocarpa]
            EEE93153.1 hypothetical protein POPTR_0006s27080g
            [Populus trichocarpa]
          Length = 407

 Score =  285 bits (729), Expect = 5e-87
 Identities = 184/420 (43%), Positives = 235/420 (55%), Gaps = 48/420 (11%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFR+V+E H   TKKNKEWQEKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHSSTTKKNKEWQEKLPVVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYM+LKTLWDRT DAINTIIR DE+TE G  LQPCIEAAL+LGCTPR
Sbjct: 61   AEEIMYSKANSEAEYMELKTLWDRTNDAINTIIRRDESTEIGELLQPCIEAALNLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGTN-QIMSGYSDADK------- 1164
            + SRSQRN  PS+YLSP + +  T++S  + +++  N  +N  ++  YS   K       
Sbjct: 121  RASRSQRNCNPSFYLSPSTQEPNTLSSGSVHSAIQANRTSNSHVLPNYSSMVKPIIMNST 180

Query: 1163 ------------------------------TSFSCLGMPSHS----PAVYPLFFG-DLQP 1089
                                           +  CL + ++      +VYPL++G  L+P
Sbjct: 181  PPGSESQDFVGQSNGTSNRFLFIDDSIPLSNANQCLPLGNYRIPSLCSVYPLYYGCCLEP 240

Query: 1088 KDSKFNFDNCSRFNVGTPSKLSSHSMNASEIFGKQNPYG-KAMASSKPIEADINDTFGNS 912
            +              G   K    +M   ++   QN +        K   AD  D+    
Sbjct: 241  QR-----------GCGALPKTFPGTMEPVKVAVMQNFFPCNEDTPVKTCHADHKDSPLQP 289

Query: 911  HEIGCDLSLRLGCLGTPREIIETRIDNDLGNLSSRTPQ-GNKLIDSSIQIDKS--FFRNS 741
             EIGCDLSLRLG L  P   ++T+   D  +      Q G K+ D   Q+DK   FF   
Sbjct: 290  QEIGCDLSLRLGSLPAPMLSVKTKQLKDAKDGGHDCSQEGGKVDDWMPQVDKELPFFTRV 349

Query: 740  YFCESVDTCSTDRIMEAEKLNLKKT-LKRKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
               + + + S+      E +N+ +T  KRKAV+D+  ED+ F W PKL     T R ++A
Sbjct: 350  NVADPLVSHSS---KSREHVNIDETKKKRKAVLDHHVEDQ-FCWQPKLHCNQLTCRMKSA 405


>XP_007011854.2 PREDICTED: uncharacterized protein LOC18587787 isoform X1 [Theobroma
            cacao] XP_017983375.1 PREDICTED: uncharacterized protein
            LOC18587787 isoform X1 [Theobroma cacao]
          Length = 447

 Score =  286 bits (732), Expect = 6e-87
 Identities = 175/411 (42%), Positives = 243/411 (59%), Gaps = 39/411 (9%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R Y C RRAWHSDRHQP+RGSLIQEIFRVV+E+H  ATKKNKEWQEKLP+VVLK
Sbjct: 43   MPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVVLK 102

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMDLK+LWDRT DAINTII+ DE+TETG  LQPCIEAAL+LGCTPR
Sbjct: 103  AEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDESTETGELLQPCIEAALNLGCTPR 162

Query: 1319 KTSRSQRNITPSYYLSP--------------ISPDKMTITSSRLQNSVLG--NHGTNQIM 1188
            +T RSQRN  P  YLSP               +P+ M   S  ++++++   + G+    
Sbjct: 163  RTLRSQRNCNPRCYLSPGTQEAENTTQANLTTNPNFMASYSGFMKSTIMNVTHLGSESQK 222

Query: 1187 SGYSDADKTSFS---------------CLGMPSHSP----AVYPLFFGD-LQPKDSKFNF 1068
                D++ T++                CL M  + P    +VYPL++G+ LQ ++ +  F
Sbjct: 223  HIAQDSNCTTYKFPFASENGPLPSNSQCLPMEKYPPPNLYSVYPLYYGNHLQFEEMQHGF 282

Query: 1067 DNCSRFNVGTPSKLSSHSMNASEIFGKQNPYGKAMASSKPI-EADINDTFGNSHEIGCDL 891
                    G   K  S+++  +++    N +   + SS  + + D+++T  N HE  CDL
Sbjct: 283  --------GIFPKSISNTVEPAKMGVIDNLFSSDVDSSNNMNQTDVSNTSNNPHENACDL 334

Query: 890  SLRLGCLGTPREIIETRIDNDLGNLSSRTPQGNKLIDSSIQIDK--SFFRNSYFCESVDT 717
            SLRLG L  P   +       + +  S + + N+  D +  IDK  S F  S   + +++
Sbjct: 335  SLRLGPLSIPCLSVGKSRPQVIEDTGSTSLEWNRFGDLTPSIDKMLSSFPRSNRDDPLNS 394

Query: 716  CSTDRIMEAEKLNLKKTLKRKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
                  +E E +N+  T++++  +  P  D++F   PKLP+   T R ++A
Sbjct: 395  SLNRWSLEGEHVNVDATMRKRKTVYGPTVDQQFCLPPKLPYSHLTGRMKSA 445


>OMO90819.1 hypothetical protein COLO4_18861 [Corchorus olitorius]
          Length = 396

 Score =  284 bits (726), Expect = 1e-86
 Identities = 181/410 (44%), Positives = 236/410 (57%), Gaps = 38/410 (9%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R Y C RRAWHSDRHQP+RGSLIQEIFRVV+E+H  ATKKNKEWQEKLP+VVLK
Sbjct: 1    MPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSE EYMDLKTLWDRT DAINTIIR DE+TETG  LQPCIEAAL+LGCTPR
Sbjct: 61   AEEIMYSKANSELEYMDLKTLWDRTNDAINTIIRRDESTETGELLQPCIEAALNLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDK----------------------------MTITSSRLQN 1224
            +T RSQRN  P  YLS  + +                             +T  SS  Q 
Sbjct: 121  RTLRSQRNCNPGCYLSMGAQEAENTSQGNLTTNSHCVASFPSFMKATTMDVTPLSSESQK 180

Query: 1223 SVL--GNHGTNQIMSGYSDADKTSFS-CLGMPSHSP----AVYPLFFGDLQPKDSKFNFD 1065
             V    N  TN+      +    S   CL +  + P    ++YPL++G+  PK  +    
Sbjct: 181  HVADDSNCTTNKFPFTSENCPYLSNDQCLPVEKYPPTNMYSIYPLYYGN-HPKFEEL--- 236

Query: 1064 NCSRFNVGTPSKLSSHSMNASEIFGKQNPYGKAMASSKPI-EADINDTFGNSHEIGCDLS 888
               +   G   K  S+++  ++I    N +   + SS  I + ++ +T  N HEI CDLS
Sbjct: 237  ---QHAFGVFPKSISNTVEPAKIGASHNLFSSDVDSSNKINQTNVRNTSNNPHEIACDLS 293

Query: 887  LRLGCLGTPR-EIIETRIDNDLGNLSSRTPQ-GNKLIDSSIQIDKSFFRNSYFCESVDTC 714
            LRLG +G  R + IE      LG  S+ TP   NKL         S F +S   + +++ 
Sbjct: 294  LRLGPVGNGRSQEIEDTGSTSLGWKSNLTPSTDNKL---------SSFPSSNRDDPLNSS 344

Query: 713  STDRIMEAEKLNLKKTLKRKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
            S +  +E E +N+  T++++  +  P  D++F   PK+PF   + R ++A
Sbjct: 345  SNECSVEGEHMNVGATMRKRKTVYGPSVDQQFCLPPKVPFSHLSGRMKSA 394


>XP_012076555.1 PREDICTED: uncharacterized protein LOC105637632 isoform X1 [Jatropha
            curcas] KDP33588.1 hypothetical protein JCGZ_07159
            [Jatropha curcas]
          Length = 408

 Score =  284 bits (727), Expect = 1e-86
 Identities = 181/409 (44%), Positives = 233/409 (56%), Gaps = 38/409 (9%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFRVV+EVH  ATKKNKEWQEKLP+VVL+
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEVHSSATKKNKEWQEKLPVVVLR 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMDLKTLWDRT DAINTIIR DE+TETG  LQPCIEAAL+LGCTPR
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRTNDAINTIIRRDESTETGELLQPCIEAALNLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGTN-QIMSGYSDADKTSFSCLG 1143
            + SRSQRN  P  YLSP +    + +   + +++  NH  + Q +  YS+  K++   + 
Sbjct: 121  RASRSQRNCNPRCYLSPSTQQPNSSSPGIVNDTIRANHTASPQCIPNYSNFIKSTI--MN 178

Query: 1142 MPSHSPAVYPLFFGDLQPKDSKFNF--DNC--SRFNVGTP-------SKLSSHSMNASEI 996
                   +  L   ++    +KF F  DN   S +N   P       S  S + +     
Sbjct: 179  STQLGSELQNLICQNISIASNKFLFRTDNSRLSNYNQYFPMENRSVSSLYSVYPLYYGNC 238

Query: 995  FGKQNPYG---KAMASS-KPIEADINDTFGNSHE--------------------IGCDLS 888
               QN  G   K + S  +P++  I     + +E                    IGCDLS
Sbjct: 239  LDHQNGLGILPKTLPSILEPVKVGIEQNLLSCNEDAIAKIDQKDPIDKPIEQLEIGCDLS 298

Query: 887  LRLGCLGTPREIIETRIDNDLGNLSSRTPQGNKLIDSSIQIDK--SFFRNSYFCESVDTC 714
            LRLG L      ++ R   D+ ++     +     +   Q+DK  S F       S D+C
Sbjct: 299  LRLGSLSAALPSMQNRHLQDVEDVGFGHSREGIKSNKMPQMDKELSLFNRGNMDYSSDSC 358

Query: 713  STDRIMEAEKLNLKKTLKRKAVMDYPCEDERFSWHPKLPFRDYTWRARN 567
             ++ +   + L++    KRKAV  +P +D+ + W PKLP  D T R R+
Sbjct: 359  PSE-LGRHDSLDVMLR-KRKAVFGHPVDDQAYHWQPKLPCNDLTGRMRS 405


>XP_011039682.1 PREDICTED: uncharacterized protein LOC105136151 [Populus euphratica]
          Length = 407

 Score =  283 bits (725), Expect = 2e-86
 Identities = 186/413 (45%), Positives = 231/413 (55%), Gaps = 41/413 (9%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFR+V+E H   TKKNKEWQEKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHSSTTKKNKEWQEKLPVVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYM+LKTLWDRT DAINTIIR DE+ ETG  LQPCIEAAL+LGCTPR
Sbjct: 61   AEEIMYSKANSEAEYMELKTLWDRTNDAINTIIRRDESMETGELLQPCIEAALNLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNH-GTNQIMSGYSDADKTSFSCLG 1143
            + SRSQRN  PS+YLSP + +  T++S  + +++  N   T+ ++  YS   K     + 
Sbjct: 121  RASRSQRNCNPSFYLSPSTQEPNTLSSGSVHSAIQANRTSTSHVLPNYSSMVKP----II 176

Query: 1142 MPSHSPAVYPL-FFGDLQPKDSKFNF--DNCSRFNVG---------TPSKLSSHSMNASE 999
            M S  P      F G      ++F F  DN    NV           PS  S + +    
Sbjct: 177  MNSIPPGSESQDFVGQSNGTSNRFLFIDDNIPLSNVNQCLPLGNYRIPSLCSVYPLYYGS 236

Query: 998  IF--------------GKQNPYGKAMASS----------KPIEADINDTFGNSHEIGCDL 891
                            G   P   A+  +          K   AD  D+     EIGCDL
Sbjct: 237  CLESQRGCGALPETYPGTMEPVKVAVMQNFFPCNEDTPVKTCHADHKDSPLQPQEIGCDL 296

Query: 890  SLRLGCLGTPREIIETRIDNDLGNLSSRTPQ-GNKLIDSSIQIDKS--FFRNSYFCESVD 720
            SLRLG L  P   ++T+   D  +      Q G K+ D   Q DK   FF      + + 
Sbjct: 297  SLRLGSLPAPMLSVKTKQLKDAKDGGHDCSQEGGKVDDWMPQADKELPFFTRVNVADPLV 356

Query: 719  TCSTDRIMEAEKLNL-KKTLKRKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
            + S+      E +N+ ++  KRKAV+D+  ED+ F W PKL     T R ++A
Sbjct: 357  SHSS---KSREHVNIDERKKKRKAVLDHHVEDQ-FCWQPKLHCNQLTCRMKSA 405


>EOY29473.1 Uncharacterized protein TCM_036994 isoform 3 [Theobroma cacao]
          Length = 447

 Score =  285 bits (728), Expect = 2e-86
 Identities = 174/411 (42%), Positives = 243/411 (59%), Gaps = 39/411 (9%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R Y C RRAWHSDRHQP+RGSLIQEIFRVV+E+H  ATKKNKEWQEKLP+VVLK
Sbjct: 43   MPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVVLK 102

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMDLK+LWDRT DAINTII+ DE+TETG  LQPCIEAAL+LGCTPR
Sbjct: 103  AEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDESTETGELLQPCIEAALNLGCTPR 162

Query: 1319 KTSRSQRNITPSYYLSP--------------ISPDKMTITSSRLQNSVLG--NHGTNQIM 1188
            +T RSQRN  P  YLSP               +P+ M   S  ++++++   + G+    
Sbjct: 163  RTLRSQRNCNPRCYLSPGTQEAENTTQANLTTNPNFMASYSGFMKSTIMNVTHLGSESQK 222

Query: 1187 SGYSDADKTSFS---------------CLGMPSHSP----AVYPLFFGD-LQPKDSKFNF 1068
                D++ T++                CL M  + P    +VYPL++G+ L+ ++ +  F
Sbjct: 223  HIAQDSNCTTYKFPFASENGPLPSNSQCLPMEKYPPPNLYSVYPLYYGNHLKFEEMQHGF 282

Query: 1067 DNCSRFNVGTPSKLSSHSMNASEIFGKQNPYGKAMASSKPI-EADINDTFGNSHEIGCDL 891
                    G   K  S+++  +++    N +   + SS  + + D+++T  N HE  CDL
Sbjct: 283  --------GIFPKSISNTVEPAKMGVIDNLFSSDVDSSNNMNQTDVSNTSNNPHENACDL 334

Query: 890  SLRLGCLGTPREIIETRIDNDLGNLSSRTPQGNKLIDSSIQIDK--SFFRNSYFCESVDT 717
            SLRLG L  P   +       + +  S + + N+  D +  IDK  S F  S   + +++
Sbjct: 335  SLRLGPLSIPCLSVGKSRPQVIEDTGSTSLEWNRFGDLTPSIDKMLSSFPRSNRDDPLNS 394

Query: 716  CSTDRIMEAEKLNLKKTLKRKAVMDYPCEDERFSWHPKLPFRDYTWRARNA 564
                  +E E +N+  T++++  +  P  D++F   PKLP+   T R ++A
Sbjct: 395  SLNRWSLEGEHVNVDATMRKRKTVYGPTVDQQFCLPPKLPYSHLTGRMKSA 445


>XP_012442361.1 PREDICTED: uncharacterized protein LOC105767390 [Gossypium raimondii]
            KJB55018.1 hypothetical protein B456_009G058400
            [Gossypium raimondii]
          Length = 399

 Score =  282 bits (722), Expect = 4e-86
 Identities = 175/405 (43%), Positives = 231/405 (57%), Gaps = 35/405 (8%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R Y C RRAWHSDRHQP+RGSLIQEIFRVV+E+H  ATKKNKEWQEKLP VVLK
Sbjct: 1    MPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPDVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMD+KTLWDRT DAINTIIR DE+TETG  LQPCIEAAL+LGCT R
Sbjct: 61   AEEIMYSKANSEAEYMDIKTLWDRTNDAINTIIRRDESTETGELLQPCIEAALNLGCTAR 120

Query: 1319 KTSRSQRNITPSYYLSP-------------------ISPDKMTITSSRLQNSVLGNHGTN 1197
            +T RSQRN +P  YL+P                   ++ D   +  + +  + +G+    
Sbjct: 121  RTLRSQRNCSPRSYLNPGAQKAEGTTLGNLITNSHCMASDSSFLKHTTVNMTDMGSEAQK 180

Query: 1196 QI-MSGYSDADKTSFSCLGMP------SHSP---AVYPLFFGD---LQPKDSKFNFDNCS 1056
             I  +G    DK SF+    P       H P   +VYPLF+G+   ++ +   +     S
Sbjct: 181  HIAQNGNRGTDKFSFASNNSPLASNVEKHPPNTYSVYPLFYGNHLKVEEQRHGYGISPKS 240

Query: 1055 RFNVGTPSKLS-SHSMNASEIFGKQNPYGKAMASSKPIEADINDTFGNSHEIGCDLSLRL 879
              N   P+ +   HS+ + ++           +S+K  + D+ +T  N HEI CDLSLRL
Sbjct: 241  FSNTVEPAMMGVIHSLFSPDV----------DSSNKMNQTDVRNTSNNPHEIPCDLSLRL 290

Query: 878  GCLGTPREIIETRIDNDLGNLSSRTPQGNKLIDSSIQIDKSF--FRNSYFCESVDTCSTD 705
            G L TP          ++ N  S   + NK    +  ID+S      S     ++  S +
Sbjct: 291  GPLSTPCLSAGNSRHKEIKNTDSTFLEWNKFSYLTPPIDESLSSLPRSNRDAPLNPYSNE 350

Query: 704  RIMEAEKLNLKKTLKRKAVMDYPCEDERFSWHPKLPFRDYTWRAR 570
            R +E   +++  TL ++  +  P  D++F   PKLP  + T R +
Sbjct: 351  RNLEGGHMDVDATLSKRKTIYGPPVDQQFCLSPKLPCSELTGRMK 395


>XP_002324862.2 hypothetical protein POPTR_0018s01770g [Populus trichocarpa]
            EEF03427.2 hypothetical protein POPTR_0018s01770g
            [Populus trichocarpa]
          Length = 448

 Score =  284 bits (726), Expect = 5e-86
 Identities = 185/411 (45%), Positives = 227/411 (55%), Gaps = 37/411 (9%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFR+V+E HCPATKKNKEWQEKLP+VVLK
Sbjct: 42   MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHCPATKKNKEWQEKLPVVVLK 101

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMDLKTLWDR  DAINTIIR DE+ ETG  LQPCIEAAL+LGCTPR
Sbjct: 102  AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESLETGELLQPCIEAALNLGCTPR 161

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGTN-QIMSGYSDADKTSFSCLG 1143
            + SRSQRN    +YLSP + +  T++ + + N++  NH +N   +  YS+  K +     
Sbjct: 162  RASRSQRNCNLRFYLSPSTQESNTLSPAAVHNAIRANHISNSHCLRDYSNLVKPTI-MNS 220

Query: 1142 MPSHSPAVYPLFFGDLQPKDSKFNFDNCSRFNVG---------TPSKLSSHSMNASEIFG 990
             PS S +   +  G+       F  DN    NV           PS  S + +       
Sbjct: 221  APSGSESQDLVGQGNDTSNRFLFRSDNIPPSNVNRCLPLENYRIPSLCSVYPLYYGSCLE 280

Query: 989  KQNPYGK---------------AMASSKPIEADI---NDTFGNS-----HEIGCDLSLRL 879
             Q   G                A+ +  P   D        G+       EI CDLSLRL
Sbjct: 281  PQRGCGALPKTFPGTIEPVKVVAVQNFFPCNEDTPVRTSQVGHKDCLQPQEIECDLSLRL 340

Query: 878  GCLGTPREIIETRIDNDLGNLSSRTPQ-GNKLIDSSIQIDK--SFFRNSYFCESVDTCST 708
            G +  P    +T+   D  +      Q G K  D   Q+DK  SFF      + VD   +
Sbjct: 341  GSILAPVPRAKTKQIKDAKDGGHDCSQEGGKFDDWMPQMDKELSFFPK---VDVVDPQVS 397

Query: 707  DRIMEAEKLNLKKTL-KRKAVMDYPCEDERFSWHPKLPFRDYTWRARNAEP 558
                  E + +  T+ KRK V D+  ED++F W PKLP    T R ++  P
Sbjct: 398  HSSKSREHIIVDVTMKKRKLVFDHHVEDQQFLWQPKLPCNKLTGRMKSVGP 448


>XP_011036558.1 PREDICTED: uncharacterized protein LOC105134027 [Populus euphratica]
          Length = 407

 Score =  282 bits (721), Expect = 8e-86
 Identities = 180/419 (42%), Positives = 229/419 (54%), Gaps = 48/419 (11%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R YECVRRAWHSDRHQPIRGSLIQEIFR+V+E HC ATKKNKEWQEKLP+VVLK
Sbjct: 1    MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHCSATKKNKEWQEKLPVVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMDLKTLWDR  DAINTIIR DE+ ETG  LQPCIEAAL+LGCTPR
Sbjct: 61   AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESLETGELLQPCIEAALNLGCTPR 120

Query: 1319 KTSRSQRNITPSYYLSPISPDKMTITSSRLQNSVLGNHGTN------------------- 1197
            + SRSQRN    +YLSP + +  T++ + + N++  NH +N                   
Sbjct: 121  RASRSQRNCNLRFYLSPSTQESNTLSPAAVHNAIRANHISNSHCLRDYSNLVKPTIMNSA 180

Query: 1196 ------QIMSGYSDADKTSF-------------SCLGMPSHS----PAVYPLFFGD-LQP 1089
                  Q ++G  +     F              CL + ++      +VYPL++G  L+P
Sbjct: 181  PSGSESQDLAGQGNDTSNRFLFRTENIPPSNVNRCLPLENYRIPSLCSVYPLYYGSCLEP 240

Query: 1088 KDSKFNFDNCSRFNVGTPSKLSSHSMNASEIFGKQNPY-GKAMASSKPIEADINDTFGNS 912
            +              G P K    ++   ++   QN +        +  + D  D F   
Sbjct: 241  QR-----------GCGAPPKTVPGTIEPVKVAAVQNFFPSNGDFPVRTSQVDHKDCF-QP 288

Query: 911  HEIGCDLSLRLGCLGTPREIIETRIDNDLGNLSSRTPQ-GNKLIDSSIQIDKSFFRNSYF 735
             EI CDLSLRLG +  P    +T+   D  +      Q G K  D   Q+DK     S F
Sbjct: 289  QEIECDLSLRLGSILAPVPSAKTKQIKDAKDGGHDCSQEGGKFGDWMPQMDKEL---SCF 345

Query: 734  --CESVDTCSTDRIMEAEKLNLKKTL-KRKAVMDYPCEDERFSWHPKLPFRDYTWRARN 567
               + VD   +      E + +  T+ KRK V D+  ED++F W PKLP      R ++
Sbjct: 346  PKVDVVDPLVSHSSKSREHVTVDVTMKKRKLVFDHHVEDQQFLWQPKLPCNKLNGRMKS 404


>XP_017627999.1 PREDICTED: uncharacterized protein LOC108470968 [Gossypium arboreum]
            KHG13215.1 Histone acetyltransferase [Gossypium arboreum]
          Length = 396

 Score =  278 bits (710), Expect = 2e-84
 Identities = 177/405 (43%), Positives = 229/405 (56%), Gaps = 35/405 (8%)
 Frame = -1

Query: 1679 MPRPGARSYECVRRAWHSDRHQPIRGSLIQEIFRVVHEVHCPATKKNKEWQEKLPIVVLK 1500
            MPRPG R Y C RRAWHSDRHQP+RGSLI+EIFRVV+E+H  ATKKNKEWQEKLP+VVLK
Sbjct: 1    MPRPGPRPYVCERRAWHSDRHQPMRGSLIREIFRVVNEIHSSATKKNKEWQEKLPVVVLK 60

Query: 1499 AEEIMYSKANSEAEYMDLKTLWDRTIDAINTIIRLDETTETGVFLQPCIEAALHLGCTPR 1320
            AEEIMYSKANSEAEYMD+KTLWDRT DAINTIIR DE+TETG  LQPCIEAAL+LGCT R
Sbjct: 61   AEEIMYSKANSEAEYMDIKTLWDRTNDAINTIIRRDESTETGELLQPCIEAALNLGCTAR 120

Query: 1319 KTSRSQRNITPSYYLSPISPD-------------------------KMTITSSRLQNSVL 1215
            +T RSQRN +P  YL+  +                            MT   S  QN + 
Sbjct: 121  RTLRSQRNCSPRSYLNQKAEGTTQGNLITNSHCMASYSSFLKHTTMNMTDMGSEAQNHIA 180

Query: 1214 GNHGTNQIMSGYSDADKTSFSCLGMPSHSP---AVYPLFFGD---LQPKDSKFNFDNCSR 1053
             N  +N+    +     TS     +  H P   +VYPLF+G+   ++ +   +     S 
Sbjct: 181  QN--SNRGTDKFPFVSNTSPLASNVEKHPPNTYSVYPLFYGNHLKVEEQRHGYGISPKSF 238

Query: 1052 FNVGTPSKLS-SHSMNASEIFGKQNPYGKAMASSKPIEADINDTFGNSHEIGCDLSLRLG 876
             N   P+ +   HS+ + ++           +S+K  + D+ +T  N HEI CDLSLRLG
Sbjct: 239  SNKIEPAMMGVIHSLFSPDV----------DSSNKMNQTDVRNTSNNPHEIPCDLSLRLG 288

Query: 875  CLGTPREIIETRIDNDLGNLSSRTPQGNKLIDSSIQIDKSF--FRNSYFCESVDTCSTDR 702
             L TP          ++ N  S   + NK+   +  ID+S      S     ++  S +R
Sbjct: 289  PLSTPCLSAGNSRHKEIKNTDSTFLEWNKISYLTPPIDESLSSLPRSNRDAPLNPYSNER 348

Query: 701  IMEAEKLNLKKTL-KRKAVMDYPCEDERFSWHPKLPFRDYTWRAR 570
             +E   +N+  TL KRK +   P  D++F   PKLP  + T R +
Sbjct: 349  NLEGGHMNVDATLSKRKTIYGSPV-DQQFCLSPKLPCSELTGRMK 392


Top