BLASTX nr result

ID: Angelica27_contig00010578 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica27_contig00010578
         (4285 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_017222462.1 PREDICTED: trithorax group protein osa-like isofo...   811   0.0  
XP_017222464.1 PREDICTED: trithorax group protein osa-like isofo...   728   0.0  
KZM84839.1 hypothetical protein DCAR_027739 [Daucus carota subsp...   585   0.0  
KZM84843.1 hypothetical protein DCAR_027735 [Daucus carota subsp...   500   e-151
KZM84841.1 hypothetical protein DCAR_027737 [Daucus carota subsp...   393   e-112
EOY33856.1 Uncharacterized protein TCM_041704 isoform 7 [Theobro...   364   e-103
EOY33857.1 Uncharacterized protein TCM_041704 isoform 8 [Theobro...   361   e-103
XP_017982711.1 PREDICTED: chromatin modification-related protein...   366   e-101
EOY33851.1 Uncharacterized protein TCM_041704 isoform 2 [Theobro...   364   e-101
XP_002520450.1 PREDICTED: mediator of RNA polymerase II transcri...   339   1e-92
XP_018724276.1 PREDICTED: uncharacterized protein LOC104435304 [...   298   6e-89
XP_012068492.1 PREDICTED: uncharacterized protein LOC105631099 i...   319   7e-86
KDO66718.1 hypothetical protein CISIN_1g000597mg [Citrus sinensis]    317   3e-85
XP_012471229.1 PREDICTED: bromodomain-containing protein 4-like ...   312   9e-84
XP_006488440.1 PREDICTED: AT-rich interactive domain-containing ...   311   2e-83
XP_006424987.1 hypothetical protein CICLE_v10027683mg [Citrus cl...   311   3e-83
XP_008220075.1 PREDICTED: uncharacterized protein LOC103320208 [...   308   2e-82
XP_016740198.1 PREDICTED: AT-rich interactive domain-containing ...   305   5e-82
EOY33850.1 Uncharacterized protein TCM_041704 isoform 1 [Theobro...   306   8e-82
EOY33855.1 Uncharacterized protein TCM_041704 isoform 6 [Theobro...   306   9e-82

>XP_017222462.1 PREDICTED: trithorax group protein osa-like isoform X1 [Daucus carota
            subsp. sativus] XP_017222463.1 PREDICTED: trithorax group
            protein osa-like isoform X1 [Daucus carota subsp.
            sativus]
          Length = 1297

 Score =  811 bits (2094), Expect = 0.0
 Identities = 478/965 (49%), Positives = 556/965 (57%), Gaps = 86/965 (8%)
 Frame = +3

Query: 1386 VVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANVSLTN 1565
            VVP YQSH PVQP QQ++ A   YPM MHPS+G SFPPAAQFPQQ   + P   N SL N
Sbjct: 398  VVPGYQSHHPVQPQQQILPAPQHYPMPMHPSSG-SFPPAAQFPQQPPHLRPPPTNPSLPN 456

Query: 1566 QQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQA 1745
            QQQ NL+  QSQ++GV                                          QA
Sbjct: 457  QQQANLMQSQSQIQGVPPAQHPHIYPQTPQQGYIGHQRPAGQPMQQPYQQYGQPPFPSQA 516

Query: 1746 SGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMPPHQPQTHG 1925
            SG V+GP HQ+PF  QPM  QS+ QGP QL Q + A P P  H SVQAHGMPP QP ++G
Sbjct: 517  SGSVQGPFHQIPFGQQPMQTQSQAQGPTQLQQSAVARPPPQMHGSVQAHGMPPQQPPSYG 576

Query: 1926 GRPVAPNQATASQPFPQSGGTFGVNSQPRPIXXXXXXXXXXXXL------------GQQF 2069
            GRP+APNQ   S PF QSGG FG     RP+                         GQQF
Sbjct: 577  GRPIAPNQTATSHPFAQSGGAFGGAPHSRPLPSSSVQQSEHQIFEGGIANQQQVPSGQQF 636

Query: 2070 SQSG------LGEENAAGQEVGSALNNSAGKDVSNAGVDSVGVKVLASDTGKESGDEERR 2231
            SQS       +GE NAA Q  GSALN + G D+S    DSV  K   S+   +SGDEE  
Sbjct: 637  SQSDREIKHIMGEGNAAPQG-GSALNKTVGNDISGPEEDSVRAKAQDSEIRGKSGDEEHN 695

Query: 2232 IISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDP-LSGGNKIEAAALGERDGNV 2408
            I +EG   GT+     +V +AEVDA+K        G+ +P L    K +   L E DG+V
Sbjct: 696  ITTEGEKKGTRS----QVAEAEVDALK-------TGSSEPSLEKTGKEKTGTLNEMDGSV 744

Query: 2409 VRPMQAEYSSGKDSTLLPTEAYMGRRKDDTNSRTQENKSSHEQVTPQGPAVGEYGRFQDK 2588
                     + KDST   TEA++G +KD+TN    ENKSSH QV+ QG A+GEY  F DK
Sbjct: 745  F--------AVKDSTSRQTEAFVGHKKDNTNVLANENKSSHGQVSQQGLAIGEYAGFHDK 796

Query: 2589 GFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPAMPSDFQSGA----------------- 2717
            G  NSSN    TDQGR+ +P G YGP   QQR  MPS+ QSG                  
Sbjct: 797  GLPNSSNQAQLTDQGRYQMPSGTYGPPSQQQRHTMPSNSQSGPYVGAPPNALPGQGPAHL 856

Query: 2718 ---------------HPNE------------SFEGVPRRQHYQNNSTHSQPMFSRPLKAE 2816
                           HP+E            SF+GV R Q+YQNN    QP FSR  KAE
Sbjct: 857  KPQGPGLSGPLHQSLHPSEHFHQSGSSQSHESFQGVQRGQYYQNNPP--QPPFSRTNKAE 914

Query: 2817 PIEGSLHGPDGVPMQHNQRPHHFEGRYPDPHVSGAFDRGQYGQQLLANESRVPGFDAASG 2996
            P  G LHG D      NQR HH EGRYPDP+VSG+FDRG Y +Q LANE+RVPG  AA G
Sbjct: 915  PT-GPLHGSDNAGPLQNQRLHHLEGRYPDPNVSGSFDRGLY-EQTLANENRVPG--AALG 970

Query: 2997 LHVKSADGN---PFLSGPPGRSGQREYEDAPKQFPKPSVMGLEPSSNFGNGFSSRPGDYP 3167
            LH K+ + +    F +GP GR+ Q EYE A KQFPKP+        N GNG S R G+YP
Sbjct: 971  LHAKNVNDDHMKQFRTGPAGRNSQGEYEQALKQFPKPA--------NIGNG-SIRAGEYP 1021

Query: 3168 PHEFNFGAPSRFLPPYHSGGAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFGV 3347
             HE     PS+FLPPY+S                            +QP FHGSGPGFGV
Sbjct: 1022 -HEHKSELPSKFLPPYNSD---------------------------SQPGFHGSGPGFGV 1053

Query: 3348 DHRPPRSPGREFHSVPSRGFGGISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVG 3527
            DH PPRSPGREFH +PSRGFG  SG P +Q GL D  HG  + AV+EG RS DIS DPVG
Sbjct: 1054 DHLPPRSPGREFHGIPSRGFGAQSGGPHNQPGL-DNVHGWGSHAVNEGPRSFDISSDPVG 1112

Query: 3528 KPFPEHFRNGNMGGQDFIPNHLHRGELFGPKNGPSHLRVGDGFGTSLDPGS--------- 3680
            K F +HFR+G+M GQDFIPNH+ RGELFGP+N PSH+R  +GFGT  DP           
Sbjct: 1113 KTFRDHFRSGDMAGQDFIPNHMRRGELFGPRNVPSHIRAVEGFGTFSDPRMGELNGHGGF 1172

Query: 3681 -----------NYPRIGEPGYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTSMGWCRI 3827
                       N+PR+GEPG+RSS+SLH FP  GGF+ GN  S DRFRKRMP SMGWCRI
Sbjct: 1173 PYGESFAGNKLNHPRLGEPGFRSSFSLHEFPRPGGFYEGNLESIDRFRKRMPASMGWCRI 1232

Query: 3828 CRVDCETVDGLDLHSQTTEHQQRAMDMVISIKQQNAKRQKNSKDHSSFEEGSRSRNAGNK 4007
            C+V+C+TVDGLDLHSQT EHQQR M+MV+SIK QNAKRQK SKD S  EEG RSRNAGN+
Sbjct: 1233 CKVNCDTVDGLDLHSQTPEHQQRTMEMVMSIK-QNAKRQKTSKDQSFVEEGIRSRNAGNR 1291

Query: 4008 GRGKK 4022
            GRGKK
Sbjct: 1292 GRGKK 1296



 Score =  356 bits (914), Expect = 9e-99
 Identities = 166/213 (77%), Positives = 179/213 (84%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECIVNI SLAGEYFCPVCRTLVYPNEALQSQCTHLYCK CLTY+VGTTKACPYDG
Sbjct: 1   MGFDNECIVNIQSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKLCLTYIVGTTKACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE+ SKPL+ESDKALAE+I KT VHCLFHRSGCSWQGPLS+CTSHCSGCSFGNSPVV
Sbjct: 61  YLVTEKDSKPLVESDKALAERIGKTPVHCLFHRSGCSWQGPLSECTSHCSGCSFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCGVQI+HRQVQEHAQNCAGANP  QQTAE  KDAAT+V   T N SQ  SQ V  AS
Sbjct: 121 CNRCGVQIIHRQVQEHAQNCAGANPHVQQTAENPKDAATAVAVTTTNSSQATSQPVVSAS 180

Query: 678 HAAVSQIATAPPSVLNANPQVQANTSAAGTTAE 776
            A V Q  TAPP+  ++NP V    ++A  + E
Sbjct: 181 QALVPQTVTAPPATQDSNPHVHTIATSAAMSTE 213


>XP_017222464.1 PREDICTED: trithorax group protein osa-like isoform X2 [Daucus carota
            subsp. sativus]
          Length = 1267

 Score =  728 bits (1880), Expect = 0.0
 Identities = 450/965 (46%), Positives = 525/965 (54%), Gaps = 86/965 (8%)
 Frame = +3

Query: 1386 VVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANVSLTN 1565
            VVP YQSH PVQP QQ++ A   YPM MHPS+G SFPPAAQFPQQ   + P   N SL N
Sbjct: 398  VVPGYQSHHPVQPQQQILPAPQHYPMPMHPSSG-SFPPAAQFPQQPPHLRPPPTNPSLPN 456

Query: 1566 QQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQA 1745
            QQQ NL+  QSQ++GV                                          QA
Sbjct: 457  QQQANLMQSQSQIQGVPPAQHPHIYPQTPQQGYIGHQRPAGQPMQQPYQQYGQPPFPSQA 516

Query: 1746 SGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMPPHQPQTHG 1925
            SG V+GP HQ+PF  QPM  QS+ QGP QL Q + A P P  H SVQAHGMPP QP ++G
Sbjct: 517  SGSVQGPFHQIPFGQQPMQTQSQAQGPTQLQQSAVARPPPQMHGSVQAHGMPPQQPPSYG 576

Query: 1926 GRPVAPNQATASQPFPQSGGTFGVNSQPRP------------IXXXXXXXXXXXXLGQQF 2069
            GRP+APNQ   S PF QSGG FG     RP            I             GQQF
Sbjct: 577  GRPIAPNQTATSHPFAQSGGAFGGAPHSRPLPSSSVQQSEHQIFEGGIANQQQVPSGQQF 636

Query: 2070 SQSG------LGEENAAGQEVGSALNNSAGKDVSNAGVDSVGVKVLASDTGKESGDEERR 2231
            SQS       +GE NAA Q  GSALN + G D+S    DSV  K   S+   +SGDEE  
Sbjct: 637  SQSDREIKHIMGEGNAAPQG-GSALNKTVGNDISGPEEDSVRAKAQDSEIRGKSGDEEHN 695

Query: 2232 IISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDP-LSGGNKIEAAALGERDGNV 2408
            I +EG   GT+     +V +AEVDA+K        G+ +P L    K +   L E DG+V
Sbjct: 696  ITTEGEKKGTR----SQVAEAEVDALK-------TGSSEPSLEKTGKEKTGTLNEMDGSV 744

Query: 2409 VRPMQAEYSSGKDSTLLPTEAYMGRRKDDTNSRTQENKSSHEQVTPQGPAVGEYGRFQDK 2588
                     + KDST   TEA++G +KD+TN    ENKSSH QV+ QG A+GEY  F DK
Sbjct: 745  F--------AVKDSTSRQTEAFVGHKKDNTNVLANENKSSHGQVSQQGLAIGEYAGFHDK 796

Query: 2589 GFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPAMPSDFQSG------------------ 2714
            G  NSSN    TDQGR+ +P G YGP   QQR  MPS+ QSG                  
Sbjct: 797  GLPNSSNQAQLTDQGRYQMPSGTYGPPSQQQRHTMPSNSQSGPYVGAPPNALPGQGPAHL 856

Query: 2715 --------------AHP------------NESFEGVPRRQHYQNNSTHSQPMFSRPLKAE 2816
                           HP            +ESF+GV R Q+YQNN    QP FSR  KAE
Sbjct: 857  KPQGPGLSGPLHQSLHPSEHFHQSGSSQSHESFQGVQRGQYYQNNP--PQPPFSRTNKAE 914

Query: 2817 PIEGSLHGPDGVPMQHNQRPHHFEGRYPDPHVSGAFDRGQYGQQLLANESRVPGFDAASG 2996
            P  G LHG D      NQR HH EGRYPDP+VSG+FDRG Y +Q LANE+RVPG  AA G
Sbjct: 915  P-TGPLHGSDNAGPLQNQRLHHLEGRYPDPNVSGSFDRGLY-EQTLANENRVPG--AALG 970

Query: 2997 LHVKSADGN---PFLSGPPGRSGQREYEDAPKQFPKPSVMGLEPSSNFGNGFSSRPGDYP 3167
            LH K+ + +    F +GP GR+ Q EYE A KQFPKP        +N GNG         
Sbjct: 971  LHAKNVNDDHMKQFRTGPAGRNSQGEYEQALKQFPKP--------ANIGNG--------- 1013

Query: 3168 PHEFNFGAPSRFLPPYHSGGAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFGV 3347
                                      +R  P GF  DH       R   +FHG       
Sbjct: 1014 -------------------------SIRAGP-GFGVDHLPPRSPGR---EFHG------- 1037

Query: 3348 DHRPPRSPGREFHSVPSRGFGGISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVG 3527
                          +PSRGFG  SG P +Q G LD  HG  + AV+EG RS DIS DPVG
Sbjct: 1038 --------------IPSRGFGAQSGGPHNQPG-LDNVHGWGSHAVNEGPRSFDISSDPVG 1082

Query: 3528 KPFPEHFRNGNMGGQDFIPNHLHRGELFGPKNGPSHLRVGDGFGTSLDPGS--------- 3680
            K F +HFR+G+M GQDFIPNH+ RGELFGP+N PSH+R  +GFGT  DP           
Sbjct: 1083 KTFRDHFRSGDMAGQDFIPNHMRRGELFGPRNVPSHIRAVEGFGTFSDPRMGELNGHGGF 1142

Query: 3681 -----------NYPRIGEPGYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTSMGWCRI 3827
                       N+PR+GEPG+RSS+SLH FP  GGF+ GN  S DRFRKRMP SMGWCRI
Sbjct: 1143 PYGESFAGNKLNHPRLGEPGFRSSFSLHEFPRPGGFYEGNLESIDRFRKRMPASMGWCRI 1202

Query: 3828 CRVDCETVDGLDLHSQTTEHQQRAMDMVISIKQQNAKRQKNSKDHSSFEEGSRSRNAGNK 4007
            C+V+C+TVDGLDLHSQT EHQQR M+MV+SIK QNAKRQK SKD S  EEG RSRNAGN+
Sbjct: 1203 CKVNCDTVDGLDLHSQTPEHQQRTMEMVMSIK-QNAKRQKTSKDQSFVEEGIRSRNAGNR 1261

Query: 4008 GRGKK 4022
            GRGKK
Sbjct: 1262 GRGKK 1266



 Score =  356 bits (914), Expect = 6e-99
 Identities = 166/213 (77%), Positives = 179/213 (84%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECIVNI SLAGEYFCPVCRTLVYPNEALQSQCTHLYCK CLTY+VGTTKACPYDG
Sbjct: 1   MGFDNECIVNIQSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKLCLTYIVGTTKACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE+ SKPL+ESDKALAE+I KT VHCLFHRSGCSWQGPLS+CTSHCSGCSFGNSPVV
Sbjct: 61  YLVTEKDSKPLVESDKALAERIGKTPVHCLFHRSGCSWQGPLSECTSHCSGCSFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCGVQI+HRQVQEHAQNCAGANP  QQTAE  KDAAT+V   T N SQ  SQ V  AS
Sbjct: 121 CNRCGVQIIHRQVQEHAQNCAGANPHVQQTAENPKDAATAVAVTTTNSSQATSQPVVSAS 180

Query: 678 HAAVSQIATAPPSVLNANPQVQANTSAAGTTAE 776
            A V Q  TAPP+  ++NP V    ++A  + E
Sbjct: 181 QALVPQTVTAPPATQDSNPHVHTIATSAAMSTE 213


>KZM84839.1 hypothetical protein DCAR_027739 [Daucus carota subsp. sativus]
          Length = 1394

 Score =  585 bits (1509), Expect = 0.0
 Identities = 400/930 (43%), Positives = 490/930 (52%), Gaps = 52/930 (5%)
 Frame = +3

Query: 1386 VVPTYQSHPPVQPHQQLMQATPQ-YPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANVSLT 1562
            V+P YQSHP VQP+ Q++QA+ Q Y M M PS+G   PP+AQFPQQS  + P Q N SL 
Sbjct: 557  VLPGYQSHPLVQPNYQMLQASQQHYSMPMQPSSG-PLPPSAQFPQQSPHIRPPQTNASLP 615

Query: 1563 NQQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQ 1742
            +Q+Q      QSQ +G                                           Q
Sbjct: 616  SQEQ-----SQSQTQGFPPVHHPQHLLQGYIGHQRPAAPGGQPIQQHGQQSTLS-----Q 665

Query: 1743 ASGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMPPHQPQTH 1922
            AS  V       P A +PM    +PQGP Q   Q  A P PP H SVQAHGM P QP   
Sbjct: 666  ASISV-------PSALKPM----QPQGPIQ--SQQNARPPPPSHGSVQAHGMAPQQPPPS 712

Query: 1923 GGRPVAPNQATASQPFPQSGGTFGVNSQPRPIXXXXXXXXXXXXL-GQQFSQSG------ 2081
            G RP  PNQ   S PFPQ       +S+P P+              G QFSQS       
Sbjct: 713  GSRPAVPNQTATSHPFPQCAP----HSRPLPLSSVQPSEQQQQVASGLQFSQSDREITHK 768

Query: 2082 LGEENAAGQEVGSALNNSAGKDVSNAGVDSVGVKVLASDTGKESGDEERRIISEGGNNGT 2261
            LG  +AA Q VGS LN +AG DVS  G DSV +K   S+   +SGD E  I   G N G 
Sbjct: 769  LGGGSAAAQ-VGSTLNKTAGNDVSFPGEDSVRIKAFDSEIRGKSGDVEHNIRIVGENKGN 827

Query: 2262 QKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGN-KIEAAALGERDGNVVRPMQAEYSS 2438
            Q     +V +A VDA+K       +G+ +PL     K +AA   E  G+V         +
Sbjct: 828  QS----QVAEAVVDALK-------SGSSEPLMEKTVKEKAATPNEMHGSVF--------A 868

Query: 2439 GKDSTLLPTEAYMGRRKDDTNSRTQENKSSHEQVTPQGPAVGEYGRFQDKGFMNSSNSVP 2618
             KDST    E Y+G +KD++N    ENK SH QV+ QGPA+ +Y  F DKG   SSN  P
Sbjct: 869  VKDSTSRQRETYVGHKKDNSNVLAHENKLSHGQVSQQGPAMTQYSGFHDKGRPKSSNPTP 928

Query: 2619 HTDQGRHHLPPGPYGPSYHQQRPAMPSDFQSG---------------------------- 2714
             TDQGR+ +P GPYG S  QQR A+ S  QSG                            
Sbjct: 929  LTDQGRYQMPSGPYGLSSQQQRLAISSHSQSGSYIGAPPNALPGQGPAHLNLQGPGLSGP 988

Query: 2715 ------------AHPNESFEGVPRRQHYQNNSTHSQPMFSRPLKAEPIEGSLHGPDGVPM 2858
                        +H +ES +GV R Q+YQNNS+ SQP+FSR  KAEP  G LHG +    
Sbjct: 989  LHPIEHFRQSSSSHTHESLQGVQRGQYYQNNSS-SQPLFSRTNKAEP-TGPLHGSNNAGP 1046

Query: 2859 QHNQRPHHFEGRYPDPHVSGAFDRGQYGQQLLANESRVPGFDAASGLHVKSADG---NPF 3029
              N R  H EGRYPDP+VSG+F+RG + ++ LANE+RV G  AA GLHVK+ +    N F
Sbjct: 1047 LQNPRLRHLEGRYPDPNVSGSFNRGLH-EETLANENRVHG--AALGLHVKNVNDDHMNQF 1103

Query: 3030 LSGPPGRSGQREYEDAPKQFPKPSVMGLEPSSNFGNGFSSRPGDYPPHEFNFGAPSRFLP 3209
             +GP GR+GQ EYE A KQFPKP        +N GNG SS  GDY PHE      S+FLP
Sbjct: 1104 RTGPAGRNGQGEYEQALKQFPKP--------ANIGNG-SSEAGDY-PHEHISEFRSKFLP 1153

Query: 3210 PYHSGGAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFGVDHRPPRSPGREFHS 3389
            PY                           ++   P+FHGSGPGFGVDH PPRSP REF+ 
Sbjct: 1154 PY---------------------------TSNAPPEFHGSGPGFGVDHLPPRSPRREFYG 1186

Query: 3390 VPSRGFGGISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPFPEHFRNGNMGG 3569
            +PS GFGG SG P +Q G LD  +G    A  EG RS  IS DP+G+ F +HF++G+M G
Sbjct: 1187 IPSHGFGGQSGGPHNQPG-LDNVNGWGPNAFPEGPRSFHISSDPIGRHFNDHFKSGDMAG 1245

Query: 3570 QDFIPNHLHRGELFGPKNGPSHLRVGDGFGTSLDPGSNYPRIGEPGYRSSYSLHGFPSDG 3749
            QDFIPNH+  G    P      L    GF  +     N  R+GEPG+RSS+     P  G
Sbjct: 1246 QDFIPNHMRVGFGTFPDPHMVELNGNGGFPYAQSYLGNL-RLGEPGFRSSFYHDKIPRPG 1304

Query: 3750 GFFAGNNNSFDRFRKRMPTSMGWCRICRVDCETVDGLDLHSQTTEHQQRAMDMVISIKQQ 3929
            GF+ GN  S D FR RMP S   CRIC+V+C+ ++GLDLHS+T EH QR MDMV SIK  
Sbjct: 1305 GFYEGNVESSDSFRHRMPKSTVRCRICKVNCDGLEGLDLHSRTAEHLQRTMDMVTSIK-L 1363

Query: 3930 NAKRQKNSKDHSSFEEGSRSRNAGNKGRGK 4019
            +AKRQK  KD SS +EG + + AG + R K
Sbjct: 1364 HAKRQKIIKDRSSGQEGIKPKKAGKRRRKK 1393



 Score =  333 bits (853), Expect = 2e-90
 Identities = 159/213 (74%), Positives = 175/213 (82%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDN+CIVNI SLAGEYFC VCRTLVYP EALQSQCTHL+CKPCLTYVVGTTKACPYDG
Sbjct: 1   MGFDNDCIVNIQSLAGEYFCAVCRTLVYPTEALQSQCTHLFCKPCLTYVVGTTKACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVT++ SKPL+ESDKALAE+I KT VHCLFHRSGCSWQG LS+CTSH S C+FG SPVV
Sbjct: 61  YLVTDKDSKPLVESDKALAERIGKTPVHCLFHRSGCSWQGTLSECTSHRSDCAFGYSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCG+ ++HRQVQ HAQ CAGA    QQTA+ SKDAAT+V   TAN  Q +SQ VA  S
Sbjct: 121 CNRCGMLLLHRQVQNHAQICAGAKSHLQQTAQNSKDAATAVAVNTANSGQASSQPVAHTS 180

Query: 678 HAAVSQIATAPPSVLNANPQVQANTSAAGTTAE 776
            AAV QI  APP+   ANP VQA  S+AG TAE
Sbjct: 181 QAAVPQITVAPPTTQEANPYVQAIASSAGMTAE 213



 Score = 77.4 bits (189), Expect = 2e-10
 Identities = 45/82 (54%), Positives = 50/82 (60%)
 Frame = +3

Query: 531 QVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQASHAAVSQIATAP 710
           Q     QN  GA    QQTA+ SKDAAT+V   TAN  Q +SQ VA  S AAV QI  AP
Sbjct: 292 QAHSFVQN-QGAKSHLQQTAQNSKDAATAVAVNTANSGQASSQPVAHTSQAAVPQITMAP 350

Query: 711 PSVLNANPQVQANTSAAGTTAE 776
           P+   ANP VQA  S+AG TAE
Sbjct: 351 PTTQEANPYVQAIASSAGMTAE 372


>KZM84843.1 hypothetical protein DCAR_027735 [Daucus carota subsp. sativus]
          Length = 1224

 Score =  500 bits (1288), Expect = e-151
 Identities = 343/933 (36%), Positives = 440/933 (47%), Gaps = 54/933 (5%)
 Frame = +3

Query: 1386 VVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANVSLTN 1565
            VVP YQSH PVQP QQ++ A   YPM MHPS+G SFPPAAQFPQQ   + P   N SL N
Sbjct: 398  VVPGYQSHHPVQPQQQILPAPQHYPMPMHPSSG-SFPPAAQFPQQPPHLRPPPTNPSLPN 456

Query: 1566 QQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQA 1745
            QQQ NL+  QSQ++GV                                          QA
Sbjct: 457  QQQANLMQSQSQIQGVPPAQHPHIYPQTPQQGYIGHQRPAGQPMQQPYQQYGQPPFPSQA 516

Query: 1746 SGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMPPHQPQTHG 1925
            SG V+GP HQ+PF  QPM  QS+ QGP QL Q + A P P  H SVQAHGMPP QP ++G
Sbjct: 517  SGSVQGPFHQIPFGQQPMQTQSQAQGPTQLQQSAVARPPPQMHGSVQAHGMPPQQPPSYG 576

Query: 1926 GRPVAPNQATASQPFPQSGGTFGVNSQPRPIXXXXXXXXXXXXLGQQFSQSGLGEENAAG 2105
            GRP+APNQ   S PF QSGG FG     RP+               Q S+  + E   A 
Sbjct: 577  GRPIAPNQTATSHPFAQSGGAFGGAPHSRPLPSSSV----------QQSEHQIFEGGIAN 626

Query: 2106 QEVGSALNNSAGKDVSNAGVDSVGVKVLASDTGKESGDEERRIISEG-----GNNGTQKG 2270
            Q+                       +V +     +S  E + I+ EG     G +   K 
Sbjct: 627  QQ-----------------------QVPSGQQFSQSDREIKHIMGEGNAAPQGGSALNKT 663

Query: 2271 IMGKVDDAEVDAMKKDMREDGNGNLDPLSGGNKIEAAALGERDGNVVRPMQAEYSSGKDS 2450
            +   +   E D+++   ++     +   SG  +      GE+ G   +  +AE  + K  
Sbjct: 664  VGNDISGPEEDSVRAKAQDS---EIRGKSGDEEHNITTEGEKKGTRSQVAEAEVDALKTG 720

Query: 2451 TLLPTEAYMGRRKDDTNSRTQENKSSHEQVTPQGPAVGEYGRFQDKGFMNSSNSVPHTDQ 2630
            +  P+    G+ K  T +    +  + +  T +        +  +   + + N   H   
Sbjct: 721  SSEPSLEKTGKEKTGTLNEMDGSVFAVKDSTSRQTEAFVGHKKDNTNVLANENKSSHGQV 780

Query: 2631 GRHHLPPGPYGPSYHQQRPAMPSDFQSGAHPNESFEGVPRRQHYQNNSTHSQPMFSRPLK 2810
             +  L  G Y                +G H     +G+P       NS++   +  +   
Sbjct: 781  SQQGLAIGEY----------------AGFHD----KGLP-------NSSNQAQLTDQGRY 813

Query: 2811 AEPIEGSLHGPDGVPMQHNQRPHHFEGRYPDPHVSGAFDRGQYGQQLLANESRVPGFDAA 2990
              P     +GP     +H    +   G Y                   A  + +PG   A
Sbjct: 814  QMP--SGTYGPPSQQQRHTMPSNSQSGPYVG-----------------APPNALPGQGPA 854

Query: 2991 SGLHVKSADGNPFLSGPPGRS--GQREYEDAPKQFPKPSVMGLEPSSNFGNGFSSRPGDY 3164
               H+K     P LSGP  +S      +  +       S  G++     G  + + P   
Sbjct: 855  ---HLKPQ--GPGLSGPLHQSLHPSEHFHQSGSSQSHESFQGVQR----GQYYQNNPPQP 905

Query: 3165 PPHEFNFGAPSRFLPPYHSGGAFHPNDVRERPTGFNEDHRARGFSAR-TQPDFHGS---G 3332
            P    N   P+         G  H +D      G  ++ R      R   P+  GS   G
Sbjct: 906  PFSRTNKAEPT---------GPLHGSD----NAGPLQNQRLHHLEGRYPDPNVSGSFDRG 952

Query: 3333 PGFGVDHRPPRSPGREFHSVPSRGFGGISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDIS 3512
            PGFGVDH PPRSPGREFH +PSRGFG  SG P +Q GL D  HG  + AV+EG RS DIS
Sbjct: 953  PGFGVDHLPPRSPGREFHGIPSRGFGAQSGGPHNQPGL-DNVHGWGSHAVNEGPRSFDIS 1011

Query: 3513 LDPVGKPFPEHFRNGNMGGQDFIPNHLHRGELFGPKNGPSHLRVG--------------- 3647
             DPVGK F +HFR+G+M GQDFIPNH+ RGELFGP+N PSH+R                 
Sbjct: 1012 SDPVGKTFRDHFRSGDMAGQDFIPNHMRRGELFGPRNVPSHIRAVXPNHMRRGELFGPRN 1071

Query: 3648 --------DGFGTSLDPGS--------------------NYPRIGEPGYRSSYSLHGFPS 3743
                    +GFGT  DP                      N+PR+GEPG+RSS+SLH FP 
Sbjct: 1072 VPSHIRAVEGFGTFSDPRMGELNGHGGFPYGESFAGNKLNHPRLGEPGFRSSFSLHEFPR 1131

Query: 3744 DGGFFAGNNNSFDRFRKRMPTSMGWCRICRVDCETVDGLDLHSQTTEHQQRAMDMVISIK 3923
             GGF+ GN  S DRFRKRMP SMGWCRIC+V+C+TVDGLDLHSQT EHQQR M+MV+SIK
Sbjct: 1132 PGGFYEGNLESIDRFRKRMPASMGWCRICKVNCDTVDGLDLHSQTPEHQQRTMEMVMSIK 1191

Query: 3924 QQNAKRQKNSKDHSSFEEGSRSRNAGNKGRGKK 4022
             QNAKRQK SKD S  EEG RSRNAGN+GRGKK
Sbjct: 1192 -QNAKRQKTSKDQSFVEEGIRSRNAGNRGRGKK 1223



 Score =  356 bits (914), Expect = 3e-99
 Identities = 166/213 (77%), Positives = 179/213 (84%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECIVNI SLAGEYFCPVCRTLVYPNEALQSQCTHLYCK CLTY+VGTTKACPYDG
Sbjct: 1   MGFDNECIVNIQSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKLCLTYIVGTTKACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE+ SKPL+ESDKALAE+I KT VHCLFHRSGCSWQGPLS+CTSHCSGCSFGNSPVV
Sbjct: 61  YLVTEKDSKPLVESDKALAERIGKTPVHCLFHRSGCSWQGPLSECTSHCSGCSFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCGVQI+HRQVQEHAQNCAGANP  QQTAE  KDAAT+V   T N SQ  SQ V  AS
Sbjct: 121 CNRCGVQIIHRQVQEHAQNCAGANPHVQQTAENPKDAATAVAVTTTNSSQATSQPVVSAS 180

Query: 678 HAAVSQIATAPPSVLNANPQVQANTSAAGTTAE 776
            A V Q  TAPP+  ++NP V    ++A  + E
Sbjct: 181 QALVPQTVTAPPATQDSNPHVHTIATSAAMSTE 213


>KZM84841.1 hypothetical protein DCAR_027737 [Daucus carota subsp. sativus]
          Length = 1158

 Score =  393 bits (1010), Expect = e-112
 Identities = 326/918 (35%), Positives = 425/918 (46%), Gaps = 40/918 (4%)
 Frame = +3

Query: 1386 VVPTYQSHPPVQPHQQLMQATPQ-YPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANVSLT 1562
            V+P YQSHP VQP+ Q++QA  Q Y M M PS+G   PP A FPQQS  + P Q   SL 
Sbjct: 397  VLPGYQSHPLVQPNYQMLQAPQQHYSMPMQPSSG-PLPPPAHFPQQSPHIRPPQTYASLP 455

Query: 1563 NQQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQ 1742
            NQQQP+ +  QSQ++G                                           +
Sbjct: 456  NQQQPSQMQSQSQIQGFPPVHPQNLLQGYIGHH--------------------------R 489

Query: 1743 ASGPVRGPMHQLPFANQPMPVQS-----------RPQGPNQLLQQSGALPAPPPHNSVQA 1889
             + P   P+ Q     QP P Q+           +PQGP Q   Q  A P PP H SVQA
Sbjct: 490  PAAPAGQPIQQ--HGQQPTPSQASISVPSALKPMQPQGPIQ--SQQNARP-PPSHGSVQA 544

Query: 1890 HGMPPHQPQTHGGRPVAPNQATASQPFPQSGGTFG--VNSQPRPIXXXXXXXXXXXXL-G 2060
            HGM P QP  +G RP APNQ  +S PFPQSGGTFG   +SQP P+              G
Sbjct: 545  HGMAPQQPPFYGSRPAAPNQTASSHPFPQSGGTFGGAPHSQPLPLSSVQPSEQQQQVATG 604

Query: 2061 QQFSQSG------LGEENAAGQEVGSALNNSAGKDVSNAGVDSVGVKVLASDTGKESGDE 2222
             QFSQS       LG  NAA Q VGS LN +AGKDVS  G DSV +K   S+   +SGD 
Sbjct: 605  LQFSQSDREITHKLGGGNAAAQ-VGSTLNKTAGKDVSFPGEDSVKIKAFDSEIRGKSGDV 663

Query: 2223 ERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGN-KIEAAALGERD 2399
            E  I   G N    KG   +V +A +DA K       +G+ +PL     K + A   E  
Sbjct: 664  EHNIKIVGEN----KGNWSQVAEAVIDASK-------SGSSEPLMEKTVKEKTATPNEMH 712

Query: 2400 GNVVRPMQAEYSSGKDSTLLPTEAYMGRRKDDTNSRTQENKSSHEQVTPQGPAVGEYGRF 2579
            G+V   M                        D+ SR +E  + H++              
Sbjct: 713  GSVFAVM------------------------DSTSRQRETYAGHKR-------------- 734

Query: 2580 QDKGFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPAMPSDFQSGAHPNESFEGVPRRQH 2759
                  ++SN + H ++               QQ PA+     SG H     +G+P    
Sbjct: 735  ------DNSNVLAHENKLSQE--------QVSQQGPAITQ--YSGFHD----KGLPI--- 771

Query: 2760 YQNNSTHSQPMFSRPLKAEPIEGSLHGPDGVPMQHNQRPHHFEGRYPDPHVSGAFDRGQY 2939
                S+ S P+  R     P      G  G+P Q  +         P    SG++ RG  
Sbjct: 772  ----SSSSTPLTDRSRYQMP-----SGTYGLPSQQQRHT------MPSNSQSGSY-RG-- 813

Query: 2940 GQQLLANESRVPGFDAASGLHVKSADGNPFLSGPPGRS--GQREYEDAPKQFPKPSVMGL 3113
                 A  + +PG   A  L+++     P LSGP  +S      +  +       S+ G+
Sbjct: 814  -----APPNALPGQGPAR-LNLQG----PGLSGPLQQSLHPSEHFRQSSSSHTHESLQGV 863

Query: 3114 EPSSNFGNGFSSRPGDYPPHEFNFGAPSRFLPPYHSGGAFHPNDVRERPTGFNEDHRARG 3293
            +    + +  SS+P         F   ++  P     G  H         G N     + 
Sbjct: 864  QRGQYYQDNSSSQP--------LFLRTNKAEPT----GPLH---------GSNNAGPLQN 902

Query: 3294 FSARTQPDFHGSGPGFGVDHRPPRSPGREFHSVPSRGFGGISGAPPSQSGLLDGAHGRDT 3473
               R   +FHGSGPGFGVD+ PPRSPGREF+ +PS GFGG SG P  Q GL D  +G  +
Sbjct: 903  PRLRHLEEFHGSGPGFGVDYPPPRSPGREFYGIPSHGFGGQSGGPHDQPGL-DNVNGWGS 961

Query: 3474 RAVHEGSRSSDISLDPVGKPFPEHFRNGNMGGQDFIPNHLHRGELFGPKNGPSHLRVGDG 3653
             A  EG RS  IS  PVG+ F +HF++G+M GQDFIPNH+  GE F P++ PSH+   +G
Sbjct: 962  NAFPEGPRSFHISSGPVGRNFSDHFKSGDMAGQDFIPNHMRVGERFCPRDVPSHISAVEG 1021

Query: 3654 FGTSLDP-------GSNYP---------RIGEPGYRSSYSLHGFPSDGGFFAGNNNSFDR 3785
            FGT  DP          +P         R GEPG+RS Y+       GGF+ GN  S D 
Sbjct: 1022 FGTFPDPRMLELNGNGGFPFAESYLGNSRPGEPGFRSFYN-DEISRPGGFYEGNVESIDS 1080

Query: 3786 FRKRMPTSMGWCRICRVDCETVDGLDLHSQTTEHQQRAMDMVISIKQQNAKRQKNSKDHS 3965
            FR RM  S   C+IC+V+C+ ++GLDLHS+T EH QR MDMV SIK  +AKRQK  KD S
Sbjct: 1081 FRLRMSNSTVRCQICKVNCDGLEGLDLHSRTAEHLQRTMDMVTSIK-LHAKRQKILKDRS 1139

Query: 3966 SFEEGSRSRNAGNKGRGK 4019
            S +EG + + AG + R K
Sbjct: 1140 SGQEGIKPKKAGKRRRKK 1157



 Score =  329 bits (844), Expect = 3e-90
 Identities = 158/213 (74%), Positives = 175/213 (82%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDN+CIVNI SLAGEYFC VCRTLVYP EALQSQCTHL+CK CLTYVVGTTKACPYDG
Sbjct: 1   MGFDNDCIVNIQSLAGEYFCAVCRTLVYPTEALQSQCTHLFCKLCLTYVVGTTKACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVT++ SKPL+ESDKALAE+I KT VHCLFHRSGCSWQG LS+CTSH S C+FG SPVV
Sbjct: 61  YLVTDKDSKPLVESDKALAERIGKTPVHCLFHRSGCSWQGTLSECTSHRSDCAFGYSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCG+ ++HRQVQ+HAQ CAGA    QQTA+ SKDAAT+V   TAN  Q +SQ VA  S
Sbjct: 121 CNRCGMLLLHRQVQKHAQICAGAKSHLQQTAQNSKDAATAVAVNTANSGQASSQPVAHTS 180

Query: 678 HAAVSQIATAPPSVLNANPQVQANTSAAGTTAE 776
            AAV QI  APP+   ANP VQA  S+AG TAE
Sbjct: 181 QAAVPQITVAPPTTQEANPYVQAIASSAGMTAE 213


>EOY33856.1 Uncharacterized protein TCM_041704 isoform 7 [Theobroma cacao]
          Length = 975

 Score =  364 bits (934), Expect = e-103
 Identities = 319/1007 (31%), Positives = 436/1007 (43%), Gaps = 127/1007 (12%)
 Frame = +3

Query: 1383 HVVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFP---PAAQ---FPQQSLQMPPQQ 1544
            H V  +QS+P  QPHQQ+   TPQ+PMH+H + GG  P   PA     +PQQ  QM P Q
Sbjct: 17   HAVTGHQSYPLSQPHQQMQLVTPQHPMHVH-AQGGLHPQQHPAQMQNSYPQQPPQMRPPQ 75

Query: 1545 ANVSLTNQQQPNLIPGQ-SQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1721
             +V+++NQQQP L+P   S ++ V                                    
Sbjct: 76   PHVAISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYVQQQPLST 135

Query: 1722 XXXXXLQASGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMP 1901
                 +Q     +GP  Q   + Q    QSRP GP     Q     A P  N   +H + 
Sbjct: 136  QPVGLVQPQMLQQGPFVQQQSSFQS---QSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVH 192

Query: 1902 PHQPQTHGGRPVAPNQATASQPFPQSGGTFGVN------SQPRPIXXXXXXXXXXXXLGQ 2063
             H      GRP+ PN    SQP+P S     V       +QP               +  
Sbjct: 193  FHPSHNLVGRPMTPNHGVQSQPYPHSAAGTPVKPVHLGANQPSSYQNNVFRTNNQSGVTS 252

Query: 2064 QFSQSGLGE----ENAAGQEVGSALNNSAGKDVSNAGV-DSVGVKVLASDTGKESGD--- 2219
            Q      G+    +N A QE  S+   +A K+ +   +  S+G  V   +T K   D   
Sbjct: 253  QPMSEVPGDHGTDKNVAEQEADSSSPGTARKEANELDMASSLGADVAEKNTAKLEADLKS 312

Query: 2220 EERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGNKIEAAALGERD 2399
             + ++  + G++     I  K    E    ++ +  D   + DP+S  N +   A+ ++ 
Sbjct: 313  VDEKLTGDVGDDSNGVDISTK----ETPESRRTVGTDLEQHRDPVSK-NMVTCEAIEDQK 367

Query: 2400 GNVVRPMQAEYSSGKDSTLLPT----EAYMGRRKDDTNSRTQENKS-SHEQVTPQGPAVG 2564
                   + E    KD   L T    EA +G   ++ N + Q++K   H+Q TP+GPA  
Sbjct: 368  DVHNGEHKVEEIKIKDGPSLKTPPLQEAKLG---EEQNGKMQKDKILPHDQGTPKGPAGN 424

Query: 2565 EY------GRFQDKGFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPA-----------M 2693
             +       + Q  G++  S+SVP+ DQGRH     PYG + +QQRPA           +
Sbjct: 425  GFRGIPPSSQVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGL 484

Query: 2694 PSDFQS-GAHPNE-------------------SFEGVPRRQHYQNNSTHSQPMFS---RP 2804
            PS  Q+ G  PN+                   SF   P     Q       P  S   R 
Sbjct: 485  PSHAQTPGLPPNQFRPQGPGQALVPPENLPPGSFGRDPSNYGPQGPYNQGPPSLSGAPRI 544

Query: 2805 LKAEPIEG----------------SLHGPDGVPMQH--NQRPHHFEGRYPDPHVSGAFDR 2930
             + EP+ G                 L+GP+   +QH  N   +H + R  DP  SG    
Sbjct: 545  SQGEPLVGLSYGTPPLTAFDSHGAPLYGPESHSVQHSANMVDYHADNRQLDPRASGLDST 604

Query: 2931 GQYGQQLLANESRVPGFDAASGLHVKSADGNPFLSGPPGRSGQREYEDAPKQFPKPSVMG 3110
              +    L  E   P  D  S         N F      R  + ++E+  K FP+PS + 
Sbjct: 605  STFS---LRGERLKPVQDECS---------NQFPLDRGHRGDRGQFEEDLKHFPRPSHLD 652

Query: 3111 LEPSSNFGNGF-SSRPGDYPPHEFNF---------------------GAPSRFLPPYHSG 3224
             EP   FG+   SSRP D  PH F                         PSRFLPPY   
Sbjct: 653  NEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPY--- 709

Query: 3225 GAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFG---VDHRPPRSPGREFHSVP 3395
               HP+D  ERP G  +D   R       PDF G+ P +G   +D    RSPGRE+  + 
Sbjct: 710  ---HPDDTGERPVGLPKDTLGR-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGIS 759

Query: 3396 SRGFGG-----ISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPFPEHFRNGN 3560
              GFGG     I G     S    G  G     +H G   S   ++       EH R+ +
Sbjct: 760  PHGFGGHPGDEIDGRERRFSDRFPGLPGH----LHRGGFESSDRME-------EHLRSRD 808

Query: 3561 MGGQDFIPNHLHRGELFGPKNGPSHLRVGD--GFGTSLD---------PGS-NYPRIGEP 3704
            M  QD  P +  RGE  G  N P HLR+G+  GFG             PG+  +PR+GEP
Sbjct: 809  MINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNFRHPRLGEP 868

Query: 3705 GYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTSMGWCRICRVDCETVDGLDLHSQTTE 3884
            G+RSS+SL  FP+DGG + G  +SF+  RKR P SMGWCRIC++DCETV+GLDLHSQT E
Sbjct: 869  GFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCRICKIDCETVEGLDLHSQTRE 928

Query: 3885 HQQRAMDMVISIKQQNAKRQK-NSKDHSSFEEGSRSRNAGNKGRGKK 4022
            HQ+ AMDMV++IK QNAK+QK  S DHS   + S+S+N   +GR  K
Sbjct: 929  HQKMAMDMVVTIK-QNAKKQKLTSSDHSIRNDTSKSKNVKFEGRVNK 974


>EOY33857.1 Uncharacterized protein TCM_041704 isoform 8 [Theobroma cacao]
          Length = 972

 Score =  361 bits (927), Expect = e-103
 Identities = 318/1006 (31%), Positives = 435/1006 (43%), Gaps = 126/1006 (12%)
 Frame = +3

Query: 1383 HVVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFP---PAAQ---FPQQSLQMPPQQ 1544
            H V  +QS+P  QPHQQ+   TPQ+PMH+H + GG  P   PA     +PQQ  QM P Q
Sbjct: 17   HAVTGHQSYPLSQPHQQMQLVTPQHPMHVH-AQGGLHPQQHPAQMQNSYPQQPPQMRPPQ 75

Query: 1545 ANVSLTNQQQPNLIPGQ-SQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1721
             +V+++NQQQP L+P   S ++ V                                    
Sbjct: 76   PHVAISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYVQQQPLST 135

Query: 1722 XXXXXLQASGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMP 1901
                 +Q     +GP  Q   + Q    QSRP GP     Q     A P  N   +H + 
Sbjct: 136  QPVGLVQPQMLQQGPFVQQQSSFQS---QSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVH 192

Query: 1902 PHQPQTHGGRPVAPNQATASQPFPQSGGTFGVN------SQPRPIXXXXXXXXXXXXLGQ 2063
             H      GRP+ PN    SQP+P S     V       +QP               +  
Sbjct: 193  FHPSHNLVGRPMTPNHGVQSQPYPHSAAGTPVKPVHLGANQPSSYQNNVFRTNNQSGVTS 252

Query: 2064 QFSQSGLGE----ENAAGQEVGSALNNSAGKDVSNAGV-DSVGVKVLASDTGKESGD--- 2219
            Q      G+    +N A QE  S+   +A K+ +   +  S+G  V   +T K   D   
Sbjct: 253  QPMSEVPGDHGTDKNVAEQEADSSSPGTARKEANELDMASSLGADVAEKNTAKLEADLKS 312

Query: 2220 EERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGNKIEAAALGERD 2399
             + ++  + G++     I  K    E    ++ +  D   + DP+S  N +   A+ ++ 
Sbjct: 313  VDEKLTGDVGDDSNGVDISTK----ETPESRRTVGTDLEQHRDPVSK-NMVTCEAIEDQK 367

Query: 2400 GNVVRPMQAEYSSGKDSTLLPT----EAYMGRRKDDTNSRTQENKS-SHEQVTPQGPAVG 2564
                   + E    KD   L T    EA +G   ++ N + Q++K   H+Q TP+GPA  
Sbjct: 368  DVHNGEHKVEEIKIKDGPSLKTPPLQEAKLG---EEQNGKMQKDKILPHDQGTPKGPAGN 424

Query: 2565 EY------GRFQDKGFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPA-----------M 2693
             +       + Q  G++  S+SVP+ DQGRH     PYG + +QQRPA           +
Sbjct: 425  GFRGIPPSSQVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGL 484

Query: 2694 PSDFQS-GAHPNE-------------------SFEGVPRRQHYQNNSTHSQPMFS---RP 2804
            PS  Q+ G  PN+                   SF   P     Q       P  S   R 
Sbjct: 485  PSHAQTPGLPPNQFRPQGPGQALVPPENLPPGSFGRDPSNYGPQGPYNQGPPSLSGAPRI 544

Query: 2805 LKAEPIEG----------------SLHGPDGVPMQH--NQRPHHFEGRYPDPHVSGAFDR 2930
             + EP+ G                 L+GP+   +QH  N   +H + R  DP  SG    
Sbjct: 545  SQGEPLVGLSYGTPPLTAFDSHGAPLYGPESHSVQHSANMVDYHADNRQLDPRASGLDST 604

Query: 2931 GQYGQQLLANESRVPGFDAASGLHVKSADGNPFLSGPPGRSGQREYEDAPKQFPKPSVMG 3110
              +    L  E   P  D  S         N F      R  + ++E+  K FP+PS + 
Sbjct: 605  STFS---LRGERLKPVQDECS---------NQFPLDRGHRGDRGQFEEDLKHFPRPSHLD 652

Query: 3111 LEPSSNFGNGF-SSRPGDYPPHEFNF---------------------GAPSRFLPPYHSG 3224
             EP   FG+   SSRP D  PH F                         PSRFLPPY   
Sbjct: 653  NEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPY--- 709

Query: 3225 GAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFG---VDHRPPRSPGREFHSVP 3395
               HP+D  ERP G  +D   R       PDF G+ P +G   +D    RSPGRE+  + 
Sbjct: 710  ---HPDDTGERPVGLPKDTLGR-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGIS 759

Query: 3396 SRGFGG-----ISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPFPEHFRNGN 3560
              GFGG     I G     S    G  G     +H G   S   ++       EH R+ +
Sbjct: 760  PHGFGGHPGDEIDGRERRFSDRFPGLPGH----LHRGGFESSDRME-------EHLRSRD 808

Query: 3561 MGGQDFIPNHLHRGELFGPKNGPSHLRVGD--GFGTSLD---------PGS-NYPRIGEP 3704
            M  QD  P +  RGE  G  N P HLR+G+  GFG             PG+  +PR+GEP
Sbjct: 809  MINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNFRHPRLGEP 868

Query: 3705 GYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTSMGWCRICRVDCETVDGLDLHSQTTE 3884
            G+RSS+SL  FP+DGG + G  +SF+  RKR P SMGWCRIC++DCETV+GLDLHSQT E
Sbjct: 869  GFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCRICKIDCETVEGLDLHSQTRE 928

Query: 3885 HQQRAMDMVISIKQQNAKRQKNSKDHSSFEEGSRSRNAGNKGRGKK 4022
            HQ+ AMDMV++IK QNAK+QK   DHS   + S+S+N   +GR  K
Sbjct: 929  HQKMAMDMVVTIK-QNAKKQK--LDHSIRNDTSKSKNVKFEGRVNK 971


>XP_017982711.1 PREDICTED: chromatin modification-related protein eaf-1 [Theobroma
            cacao]
          Length = 1408

 Score =  366 bits (939), Expect = e-101
 Identities = 322/1007 (31%), Positives = 437/1007 (43%), Gaps = 127/1007 (12%)
 Frame = +3

Query: 1383 HVVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFP---PAAQ---FPQQSLQMPPQQ 1544
            H V  +QS+P  QPHQQ+   TPQ+PMH+H + GG  P   PA     +PQQ  QM P Q
Sbjct: 450  HAVTGHQSYPLSQPHQQMQLVTPQHPMHVH-AQGGLHPQQHPAQMQNSYPQQPPQMRPPQ 508

Query: 1545 ANVSLTNQQQPNLIPGQ-SQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1721
             +V+++NQQQP L+P   S ++ V                                    
Sbjct: 509  PHVAISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYVQQQPLST 568

Query: 1722 XXXXXLQASGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMP 1901
                 +Q     +GP  Q   + Q    QSRP GP     Q     A P  N   +H + 
Sbjct: 569  QPVGLVQPQMLQQGPFVQQQSSFQS---QSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVH 625

Query: 1902 PHQPQTHGGRPVAPNQATASQPFPQSGGTFGVN------SQPRPIXXXXXXXXXXXXLGQ 2063
             H      GRP+ PN    SQP+P S     V       +QP               +  
Sbjct: 626  FHPSHNLVGRPMTPNHGVQSQPYPHSAAGTPVKPVHLGANQPSSYQNNVFRTNNQSGVTS 685

Query: 2064 QFSQSGLGE----ENAAGQEVGSALNNSAGKDVSNAGV-DSVGVKVLASDTGKESGD--- 2219
            Q      G+    +N A QE  S+   +A K+ +   +  S+G  V   +T K   D   
Sbjct: 686  QPMSEVPGDHGTDKNVAEQEADSSSPGTARKEANELDMASSLGADVAEKNTAKLEADLKS 745

Query: 2220 EERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGNKIEAAALGERD 2399
             + ++  + G++     I  K    E    ++ +  D   + DP+S  N +   A+ ++ 
Sbjct: 746  VDEKLTGDVGDDSNGVDISTK----ETPESRRTVGTDLEQHRDPVSK-NMVTCEAIEDQK 800

Query: 2400 GNVVRPMQAEYSSGKDSTLLPT----EAYMGRRKDDTNSRTQENKS-SHEQVTPQGPAVG 2564
                   + E    KD  LL T    EA +G   ++ N + Q++K   H+Q TP+GPA  
Sbjct: 801  DVHNGEHKVEEIKIKDGPLLKTPPLQEAKLG---EEQNGKMQKDKILPHDQGTPKGPAGN 857

Query: 2565 EY------GRFQDKGFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPA-----------M 2693
             +       + Q  G++  S+SVP+ DQGRH     PYG + +QQRPA           +
Sbjct: 858  GFRGIPPSSQVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGL 917

Query: 2694 PSDFQS-GAHPNE-------------------SFEGVPRRQHYQNNSTHSQPMFS---RP 2804
            PS  Q+ G  PN+                   SF   P     Q       P  S   R 
Sbjct: 918  PSHAQTPGLPPNQFRPQGPGQALVPPENLPPGSFGRDPSNYGPQGPYNQGPPSLSGAPRI 977

Query: 2805 LKAEPIEG----------------SLHGPDGVPMQH--NQRPHHFEGRYPDPHVSGAFDR 2930
             + EP+ G                 L+GP+   +QH  N   +H + R  DP  SG    
Sbjct: 978  SQGEPLVGLSYGTPPLTAFDSHGAPLYGPESHSVQHSANMVDYHADNRRLDPRASGLDST 1037

Query: 2931 GQYGQQLLANESRVPGFDAASGLHVKSADGNPFLSGPPGRSGQREYEDAPKQFPKPSVMG 3110
              +    L  E   P  D  S          P   G  G  GQ  +E+  K FP+PS + 
Sbjct: 1038 STFS---LRGERLKPVQDECSNQF-------PLDRGHRGDRGQ--FEEDLKHFPRPSHLD 1085

Query: 3111 LEPSSNFGNGFSS-RPGDYPPHEFNF---------------------GAPSRFLPPYHSG 3224
             EP   FG+  SS RP D  PH F                         PSRFLPPYH  
Sbjct: 1086 NEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPYH-- 1143

Query: 3225 GAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFG---VDHRPPRSPGREFHSVP 3395
                P+D  ERP G  +D   R       PDF G+ P +G   +D    RSPGRE+  + 
Sbjct: 1144 ----PDDTGERPVGLPKDTLGR-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGIS 1192

Query: 3396 SRGFGG-----ISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPFPEHFRNGN 3560
              GFGG     I G     S    G  G     +H G   S   ++       EH R+ +
Sbjct: 1193 PHGFGGHPGDEIDGRERRFSDRFPGLPGH----LHRGGFESSDRME-------EHLRSRD 1241

Query: 3561 MGGQDFIPNHLHRGELFGPKNGPSHLRVGD--GFGTSLD---------PGS-NYPRIGEP 3704
            M  QD  P +  RGE  G  N P HLR+G+  GFG             PG+  +PR+GEP
Sbjct: 1242 MINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNFRHPRLGEP 1301

Query: 3705 GYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTSMGWCRICRVDCETVDGLDLHSQTTE 3884
            G+RSS+SL  FP+DGG + G  +SF+  RKR P SMGWCRIC++DCETV+GLDLHSQT E
Sbjct: 1302 GFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCRICKIDCETVEGLDLHSQTRE 1361

Query: 3885 HQQRAMDMVISIKQQNAKRQK-NSKDHSSFEEGSRSRNAGNKGRGKK 4022
            HQ+ AMDMV++IK QNAK+QK  S DHS   + S+S+N   +GR  K
Sbjct: 1362 HQKMAMDMVVTIK-QNAKKQKLTSSDHSIRNDTSKSKNVKFEGRVNK 1407



 Score =  306 bits (784), Expect = 1e-81
 Identities = 149/219 (68%), Positives = 169/219 (77%), Gaps = 6/219 (2%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTYVV TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYVVSTTRACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL+ES+K LA+ I K  VHCL+HRSGC+WQGPLS+CT+HCSGC+FGNSPVV
Sbjct: 61  YLVTEADSKPLVESNKMLADTIGKITVHCLYHRSGCTWQGPLSECTAHCSGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCG+QIVHRQVQEHAQNC    PQAQQ A+  +D A + TTA A+ +Q  SQ     S
Sbjct: 121 CNRCGIQIVHRQVQEHAQNCPSVQPQAQQ-AKGGQDTAATGTTA-ADQAQIASQTGTATS 178

Query: 678 HAAVSQIATA------PPSVLNANPQVQANTSAAGTTAE 776
            A  SQ  T+      P    N NP+ QA + AA  T+E
Sbjct: 179 QAQASQTTTSGTPGQEPNQQANPNPRSQAVSQAAAMTSE 217


>EOY33851.1 Uncharacterized protein TCM_041704 isoform 2 [Theobroma cacao]
            EOY33852.1 Uncharacterized protein TCM_041704 isoform 2
            [Theobroma cacao] EOY33853.1 Uncharacterized protein
            TCM_041704 isoform 2 [Theobroma cacao]
          Length = 1408

 Score =  364 bits (934), Expect = e-101
 Identities = 319/1007 (31%), Positives = 436/1007 (43%), Gaps = 127/1007 (12%)
 Frame = +3

Query: 1383 HVVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFP---PAAQ---FPQQSLQMPPQQ 1544
            H V  +QS+P  QPHQQ+   TPQ+PMH+H + GG  P   PA     +PQQ  QM P Q
Sbjct: 450  HAVTGHQSYPLSQPHQQMQLVTPQHPMHVH-AQGGLHPQQHPAQMQNSYPQQPPQMRPPQ 508

Query: 1545 ANVSLTNQQQPNLIPGQ-SQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1721
             +V+++NQQQP L+P   S ++ V                                    
Sbjct: 509  PHVAISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYVQQQPLST 568

Query: 1722 XXXXXLQASGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMP 1901
                 +Q     +GP  Q   + Q    QSRP GP     Q     A P  N   +H + 
Sbjct: 569  QPVGLVQPQMLQQGPFVQQQSSFQS---QSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVH 625

Query: 1902 PHQPQTHGGRPVAPNQATASQPFPQSGGTFGVN------SQPRPIXXXXXXXXXXXXLGQ 2063
             H      GRP+ PN    SQP+P S     V       +QP               +  
Sbjct: 626  FHPSHNLVGRPMTPNHGVQSQPYPHSAAGTPVKPVHLGANQPSSYQNNVFRTNNQSGVTS 685

Query: 2064 QFSQSGLGE----ENAAGQEVGSALNNSAGKDVSNAGV-DSVGVKVLASDTGKESGD--- 2219
            Q      G+    +N A QE  S+   +A K+ +   +  S+G  V   +T K   D   
Sbjct: 686  QPMSEVPGDHGTDKNVAEQEADSSSPGTARKEANELDMASSLGADVAEKNTAKLEADLKS 745

Query: 2220 EERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGNKIEAAALGERD 2399
             + ++  + G++     I  K    E    ++ +  D   + DP+S  N +   A+ ++ 
Sbjct: 746  VDEKLTGDVGDDSNGVDISTK----ETPESRRTVGTDLEQHRDPVSK-NMVTCEAIEDQK 800

Query: 2400 GNVVRPMQAEYSSGKDSTLLPT----EAYMGRRKDDTNSRTQENKS-SHEQVTPQGPAVG 2564
                   + E    KD   L T    EA +G   ++ N + Q++K   H+Q TP+GPA  
Sbjct: 801  DVHNGEHKVEEIKIKDGPSLKTPPLQEAKLG---EEQNGKMQKDKILPHDQGTPKGPAGN 857

Query: 2565 EY------GRFQDKGFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPA-----------M 2693
             +       + Q  G++  S+SVP+ DQGRH     PYG + +QQRPA           +
Sbjct: 858  GFRGIPPSSQVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGL 917

Query: 2694 PSDFQS-GAHPNE-------------------SFEGVPRRQHYQNNSTHSQPMFS---RP 2804
            PS  Q+ G  PN+                   SF   P     Q       P  S   R 
Sbjct: 918  PSHAQTPGLPPNQFRPQGPGQALVPPENLPPGSFGRDPSNYGPQGPYNQGPPSLSGAPRI 977

Query: 2805 LKAEPIEG----------------SLHGPDGVPMQH--NQRPHHFEGRYPDPHVSGAFDR 2930
             + EP+ G                 L+GP+   +QH  N   +H + R  DP  SG    
Sbjct: 978  SQGEPLVGLSYGTPPLTAFDSHGAPLYGPESHSVQHSANMVDYHADNRQLDPRASGLDST 1037

Query: 2931 GQYGQQLLANESRVPGFDAASGLHVKSADGNPFLSGPPGRSGQREYEDAPKQFPKPSVMG 3110
              +    L  E   P  D  S         N F      R  + ++E+  K FP+PS + 
Sbjct: 1038 STFS---LRGERLKPVQDECS---------NQFPLDRGHRGDRGQFEEDLKHFPRPSHLD 1085

Query: 3111 LEPSSNFGNGF-SSRPGDYPPHEFNF---------------------GAPSRFLPPYHSG 3224
             EP   FG+   SSRP D  PH F                         PSRFLPPY   
Sbjct: 1086 NEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPY--- 1142

Query: 3225 GAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFG---VDHRPPRSPGREFHSVP 3395
               HP+D  ERP G  +D   R       PDF G+ P +G   +D    RSPGRE+  + 
Sbjct: 1143 ---HPDDTGERPVGLPKDTLGR-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGIS 1192

Query: 3396 SRGFGG-----ISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPFPEHFRNGN 3560
              GFGG     I G     S    G  G     +H G   S   ++       EH R+ +
Sbjct: 1193 PHGFGGHPGDEIDGRERRFSDRFPGLPGH----LHRGGFESSDRME-------EHLRSRD 1241

Query: 3561 MGGQDFIPNHLHRGELFGPKNGPSHLRVGD--GFGTSLD---------PGS-NYPRIGEP 3704
            M  QD  P +  RGE  G  N P HLR+G+  GFG             PG+  +PR+GEP
Sbjct: 1242 MINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNFRHPRLGEP 1301

Query: 3705 GYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTSMGWCRICRVDCETVDGLDLHSQTTE 3884
            G+RSS+SL  FP+DGG + G  +SF+  RKR P SMGWCRIC++DCETV+GLDLHSQT E
Sbjct: 1302 GFRSSFSLQEFPNDGGIYTGGMDSFENLRKRKPMSMGWCRICKIDCETVEGLDLHSQTRE 1361

Query: 3885 HQQRAMDMVISIKQQNAKRQK-NSKDHSSFEEGSRSRNAGNKGRGKK 4022
            HQ+ AMDMV++IK QNAK+QK  S DHS   + S+S+N   +GR  K
Sbjct: 1362 HQKMAMDMVVTIK-QNAKKQKLTSSDHSIRNDTSKSKNVKFEGRVNK 1407



 Score =  306 bits (784), Expect = 1e-81
 Identities = 149/219 (68%), Positives = 169/219 (77%), Gaps = 6/219 (2%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTYVV TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYVVSTTRACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL+ES+K LA+ I K  VHCL+HRSGC+WQGPLS+CT+HCSGC+FGNSPVV
Sbjct: 61  YLVTEADSKPLVESNKMLADTIGKITVHCLYHRSGCTWQGPLSECTAHCSGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCG+QIVHRQVQEHAQNC    PQAQQ A+  +D A + TTA A+ +Q  SQ     S
Sbjct: 121 CNRCGIQIVHRQVQEHAQNCPSVQPQAQQ-AKGGQDTAATGTTA-ADQAQIASQTGTATS 178

Query: 678 HAAVSQIATA------PPSVLNANPQVQANTSAAGTTAE 776
            A  SQ  T+      P    N NP+ QA + AA  T+E
Sbjct: 179 QAQASQTTTSGTPGQEPNQQANPNPRSQAVSQAAAMTSE 217


>XP_002520450.1 PREDICTED: mediator of RNA polymerase II transcription subunit 12
            [Ricinus communis] XP_015575503.1 PREDICTED: mediator of
            RNA polymerase II transcription subunit 12 [Ricinus
            communis] EEF41863.1 hypothetical protein RCOM_0731250
            [Ricinus communis]
          Length = 1329

 Score =  339 bits (869), Expect = 1e-92
 Identities = 303/973 (31%), Positives = 423/973 (43%), Gaps = 93/973 (9%)
 Frame = +3

Query: 1383 HVVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANVSLT 1562
            H V  + S+P  QP QQL     Q+P+H   + GG   P  QFPQQS  + P Q++V + 
Sbjct: 426  HAVTGHHSYPQPQPQQQLQLGGLQHPVHY--AQGG---PQPQFPQQSPLLRPPQSHVPVQ 480

Query: 1563 NQQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXL- 1739
            N QQ  L+P   Q+  V                                           
Sbjct: 481  NPQQSGLLPSPGQVPNVPPAQQQPVQAHAQQPGLPVHQLPVMQSVQQPIHQQYVQQQPPF 540

Query: 1740 --QASGPVRGPMHQL-PFANQPMPVQS--RPQGPNQLLQQSGALPAPPPHNSVQAHGMPP 1904
              QA GPV+  +HQ   +  Q +   S  RPQGP+    Q       P  N    HG   
Sbjct: 541  PGQALGPVQNQVHQQGAYMQQHLHGHSQLRPQGPSHAYTQ-------PLQNVPLPHGTQA 593

Query: 1905 HQPQTHGGRPVAPNQATASQPFPQSGGTFGVNSQPRPIXXXXXXXXXXXXLGQQFSQSGL 2084
            HQ Q  GGRP        + P P S     V  Q RP+                 +Q  L
Sbjct: 594  HQAQNLGGRPP---YGVPTYPHPHSS----VGMQVRPMQVGADQQSGNAFRAN--NQMQL 644

Query: 2085 GEENAAGQEVGSALNNSAGKDV--SNAGVDSVGVKVL---------ASDTGKESGDEERR 2231
              E  +G  +    +N  G D+   ++  DS   K +         AS  G +  D  + 
Sbjct: 645  SSEQPSGA-ISRPTSNRQGDDIIEKSSEADSSSQKNVRRDPNDLDVASGLGSDVSDL-KT 702

Query: 2232 IISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGNKIEAAALGERDGNVV 2411
            +ISE              D+  ++ +K++ ++  +   D  +  N  +A   G +DG V+
Sbjct: 703  VISESNLKPVDD------DNKSINEVKEEPKKGNDDQKDISNTDN--DAEDKGVKDGPVM 754

Query: 2412 RPMQAEYSSGKDSTLLPTEAYMGRRKDDTNSRTQENKSSHEQVTPQ-GPAVGEYGRFQDK 2588
            +           +  LP   ++    +D + ++Q  ++    VTPQ       +G+ Q +
Sbjct: 755  K-----------NRPLPEAEHL----EDQSMKSQRGRN----VTPQHSGGFILHGQVQGE 795

Query: 2589 GFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPA--------MPSDFQSGAHPNESFEGV 2744
            G    S+S+P  +QG+   P  P+GPS  QQRP          P     G  P      V
Sbjct: 796  GLAQPSHSIPIAEQGKQQPPVIPHGPSALQQRPIGSSLLTAPPPGSLHHGQIPGHPSARV 855

Query: 2745 -----------PRRQHYQNNSTHSQPMFSRPLKAEPIEGSLHGPDGVPMQH--------- 2864
                       P           S P+  R      ++G+      +P Q          
Sbjct: 856  RPLGPGHIPHGPEVSSAGMTGLGSTPITGRGGSHYGLQGTYTQGHALPSQADRTPYGHDT 915

Query: 2865 ----NQRPHHFEGRYPDPHVSGAFDRGQYGQQLLANESRVPGFDAASGLHVKS------A 3014
                NQRP++ +G+  DP    +   G +   +  N +  PG D++S L ++       +
Sbjct: 916  DMFANQRPNYTDGKRLDPLGQQS---GMHSNAMRMNGA--PGMDSSSALGLRDDRFRPFS 970

Query: 3015 DG--NPFLSGPPGRS-GQREYEDAPKQFPKPSVMGLEPSSNFGNGFSS-RPGDYPP---- 3170
            D   NPF   P  R   +RE+E+  K F +PS +  + ++ FG  FSS RP D  P    
Sbjct: 971  DEYMNPFPKDPSQRIVDRREFEEDLKHFSRPSDLDTQSTTKFGANFSSSRPLDRGPLDKG 1030

Query: 3171 -HEFNFGA-----------PSRFLPPYHSGGAFHPNDVRERPTGFNEDHRARG-FSARTQ 3311
             H  N+ +           PSRF PPYH  G  HPND+ ER  GF+++   R   S R  
Sbjct: 1031 LHGPNYDSGMKLESLGGPPPSRFFPPYHHDGLMHPNDIAERSIGFHDNTLGRQPDSVRAH 1090

Query: 3312 PDFHGSGPGFGVDHRP---PRSPGREFHSVPSRGFGGISGAPPSQSGLLDGAHGRDTRAV 3482
            P+F G G  +   HR    PRSPGR++  V SRGFG I G        LD   GR++R  
Sbjct: 1091 PEFFGPGRRYDRRHRDGMAPRSPGRDYPGVSSRGFGAIPG--------LDDIDGRESRRF 1142

Query: 3483 HEGSRSSDISLDPVGKPFPEHFRNGNMGG--QDFIPNHLHRGELFGPKNGPSHLRVGDGF 3656
             +    S   +       P H R G   G  QD   NH  RGE  G  N  + L    GF
Sbjct: 1143 GDSFHGSRFPV------LPSHMRMGEFEGPSQDGFSNHFRRGEHLGHHNMRNRLGEPIGF 1196

Query: 3657 GTSLDPGS--------NY--PRIGEPGYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPT 3806
            G    P          N+  PR+GEPG+RSS+S  GFP DGG +AG   SFD  R+R  +
Sbjct: 1197 GAFPGPAGMGDLSGTGNFFNPRLGEPGFRSSFSFKGFPGDGGIYAGELESFDNSRRRKSS 1256

Query: 3807 SMGWCRICRVDCETVDGLDLHSQTTEHQQRAMDMVISIKQQNAKRQK-NSKDHSSFEEGS 3983
            SMGWCRIC+VDCETV+GLDLHSQT EHQ+RAMDMV++IK QNAK+QK  + DHSS ++ S
Sbjct: 1257 SMGWCRICKVDCETVEGLDLHSQTREHQKRAMDMVVTIK-QNAKKQKLANNDHSSVDDAS 1315

Query: 3984 RSRNAGNKGRGKK 4022
            +S+N   +GRG K
Sbjct: 1316 KSKNTSIEGRGNK 1328



 Score =  293 bits (749), Expect = 2e-77
 Identities = 143/206 (69%), Positives = 161/206 (78%), Gaps = 3/206 (1%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCL+YVV TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLSYVVSTTRACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL ES+KALAE I K  V+CL+HRSGC+WQGPLS+CTSHCS C+FGNSPVV
Sbjct: 61  YLVTEADSKPLSESNKALAETIGKITVYCLYHRSGCTWQGPLSECTSHCSECAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCGVQIVHRQVQEHAQNC G  PQA   AE +KDAA + T A  + +Q  +Q  A  +
Sbjct: 121 CNRCGVQIVHRQVQEHAQNCPGVQPQAH--AEGAKDAAVTGTPAAGDQNQAATQ--AATT 176

Query: 678 HAAVSQIATAPP---SVLNANPQVQA 746
            A     A++ P   S   ANP  Q+
Sbjct: 177 SATTQTTASSTPGQGSNQQANPTTQS 202


>XP_018724276.1 PREDICTED: uncharacterized protein LOC104435304 [Eucalyptus
           grandis]
          Length = 235

 Score =  298 bits (763), Expect = 6e-89
 Identities = 145/217 (66%), Positives = 167/217 (76%), Gaps = 4/217 (1%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCL Y+ G+TKACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLAYIAGSTKACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPLIES+KALAE I K  VHCL+HRSGC+WQGPLS+C +HC+GC+FGNSPVV
Sbjct: 61  YLVTEADSKPLIESNKALAETISKIPVHCLYHRSGCTWQGPLSECVTHCAGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVA--Q 671
           CNRCG+QIVHRQVQEHAQ+C G +PQ  Q AE S++A+ + TT       TN    A  Q
Sbjct: 121 CNRCGIQIVHRQVQEHAQSCPGVHPQV-QPAEGSQEASGTNTTTAGQGQSTNQAGTASSQ 179

Query: 672 ASHAAVSQIATAPPSV--LNANPQVQANTSAAGTTAE 776
           A  A  SQ AT   S   +++   VQA++ AA  T E
Sbjct: 180 APEAQTSQTATPATSAQQISSTSLVQADSQAATLTPE 216


>XP_012068492.1 PREDICTED: uncharacterized protein LOC105631099 isoform X1 [Jatropha
            curcas] XP_012068499.1 PREDICTED: uncharacterized protein
            LOC105631099 isoform X1 [Jatropha curcas] KDP46575.1
            hypothetical protein JCGZ_08547 [Jatropha curcas]
          Length = 1364

 Score =  319 bits (817), Expect = 7e-86
 Identities = 286/986 (29%), Positives = 412/986 (41%), Gaps = 108/986 (10%)
 Frame = +3

Query: 1383 HVVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFPP-------AAQFPQQSLQMPPQ 1541
            + V  + S+   QP QQ+    PQ+ + M+P  GG  P          QFPQQ   + P 
Sbjct: 427  NAVTGHHSYSQPQPQQQVQLGGPQHAVLMYPQ-GGPHPQNQHPIQMPGQFPQQPPFLRPP 485

Query: 1542 QANVSLTNQQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1721
            Q++V + N QQP L+P   Q+  V                                    
Sbjct: 486  QSHVPVQNSQQPGLLPSPGQVPNVPPAQQQPVQSHPHQPGLPVHQRPVLQSVQQQFHQQH 545

Query: 1722 XXXXXL-QASGPVRGPMHQL-PFANQPMPVQS--RPQGPNQLLQQSGALPAPPPHNSVQA 1889
                   QA GPV+  M Q   +  QP+ +QS  RP G  Q    S A P   PH     
Sbjct: 546  SQQPFSGQALGPVQNQMPQQGTYLQQPLHMQSQLRPHGLPQSFPPSHAYPQSIPH----- 600

Query: 1890 HGMPPHQPQTHGGRPVAPNQATASQPFP-----------------QSGGTFGVNSQPRPI 2018
             G PP+Q Q  GGRP+ P    +++P P                 QSG     N+Q +  
Sbjct: 601  -GTPPYQAQNLGGRPMMPPYGVSTKPHPPAPVGMQARAMQGGLGQQSGNALRTNNQDQLA 659

Query: 2019 XXXXXXXXXXXXLGQQFSQSGLGEENAAGQEVGSALNNSAGKD------VSNAGVDSVGV 2180
                          +Q  Q         G E  S  + +A +D       +  G DS  V
Sbjct: 660  SEQHTGATSRPMPERQGDQI-----IDKGSEAESTSHKNAKRDPYDLDVAAGIGADSGEV 714

Query: 2181 KVLASDTGKESGDEERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSG 2360
            K + S++  +  +++ + + E  +     G   +  +  ++ +KK+ +E G  +   +S 
Sbjct: 715  KTIKSESDLKPVNDDNKPMGEIKDISESLG--AENGEKFINQVKKEPKE-GTNDQKGVSN 771

Query: 2361 GNKIEAAALGERDGNVVRPMQAEYSSGKDSTLLPTEAYMGRRKDDTNSRTQENKSSHEQV 2540
             N+ +           V    +E        +L T       ++  +   Q  KS    V
Sbjct: 772  ANRKQ-----------VEHFVSEDKETMGGPMLKTPPL----QEGDHLEDQSMKSQDRNV 816

Query: 2541 TPQ-GPAVGEYGRFQDKGFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPAMPSDFQS-- 2711
            TPQ       +G+ Q +  +  S++VP  DQG+      P+G S  QQRP   S  Q+  
Sbjct: 817  TPQLSGGFPLHGQMQGENLLQPSHAVPIADQGKQEPSLTPHGHSTFQQRPVGSSSLQAPP 876

Query: 2712 ----------------------GAHPNESFEGVPRRQHYQNNSTHSQPMFSRPLKAEPIE 2825
                                  G  PN      P  +++Q                +P  
Sbjct: 877  PGPPRHTQLPGHPSAPLRPLGAGLTPNSGQPLNPPSEYFQQPLYQQSHGSDVSPTGDPGS 936

Query: 2826 GSL-------HGPDGVPMQHNQRPHHFEGRYPDPHVSGAFDRGQYGQQL---------LA 2957
             S+       +GP G   Q  +RP +       P+ +  F  G+    L         + 
Sbjct: 937  TSVFGRGPSHYGPQGPYTQGERRPSYGHESDMFPNHTANFMDGRRLDPLGHQPGMPPNMM 996

Query: 2958 NESRVPGFDAASGLHVKSADGNPFLSGPPGRSGQREYEDAPKQFPKPSVMGLE------P 3119
              +  PG D++SGL ++     P L  P     + E+E+  K FP+P  +G E      P
Sbjct: 997  RMNGAPGLDSSSGLELRDDRFRP-LPDPSRIVDRGEFEEDLKHFPRPPHLGTEKYGSHFP 1055

Query: 3120 SS--------NFGNGFSSRPGDYPPHEFNF-------GAPSRFLPPYHSGGAFHPNDVRE 3254
            SS        ++G     +P D  PH  N+        AP RF P  H G   HPND+ E
Sbjct: 1056 SSRALDRGPHSYGMDLPPKPLDNGPHGLNYDSVPPGGSAPPRFFPHRHDG-MMHPNDLGE 1114

Query: 3255 RPTGFNEDHRARGFSARTQPDFHGSGPGFG---VDHRPPRSPGREFHSVPSRGFGGISGA 3425
             P GF ++   R    R  PD  G  P +G   +D   PRSP R++      GFG   G 
Sbjct: 1115 IPAGFQDNVVGRQPDGRNHPDIFGPPPRYGRRHMDGMAPRSPSRDYP-----GFGAFHG- 1168

Query: 3426 PPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPFPEHFRNGNMGG--QDFIPNHLHR 3599
                   LD   GR++R   +    +   +       P H R     G  QD  P H  R
Sbjct: 1169 -------LDDIAGRESRRFGDSYHGNRFPV------LPSHLRRSEFEGHGQDVFPKHFRR 1215

Query: 3600 GELFGPKNGPSHLRVGDGFGTS-----LDPGSNY-PRIGEPGYRSSYSLHGFPSDGGFFA 3761
            GE   P   PSH+R   G+G S       PG+ + PR+GEPGYR+SYSL G P DGG + 
Sbjct: 1216 GEHLDPHELPSHIREPIGYGASRMGELTGPGNFFHPRLGEPGYRNSYSLKGIPGDGGNYT 1275

Query: 3762 GNNNSFDRFRKRMPTSMGWCRICRVDCETVDGLDLHSQTTEHQQRAMDMVISIKQQNAKR 3941
            G+  SFD  R+R  +SMGWCRIC+VDCETV+GLDLHSQT +HQ+ AMD+V++IK +NAK+
Sbjct: 1276 GDLESFDGSRRRKSSSMGWCRICKVDCETVEGLDLHSQTRDHQKMAMDVVLTIK-KNAKK 1334

Query: 3942 QKNS-KDHSSFEEGSRSRNAGNKGRG 4016
            QK +  DHSS ++ S+SRNA  +GRG
Sbjct: 1335 QKLAPSDHSSLDDTSKSRNASFEGRG 1360



 Score =  305 bits (782), Expect = 2e-81
 Identities = 148/214 (69%), Positives = 167/214 (78%), Gaps = 1/214 (0%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTYVV TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYVVSTTRACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           +LVTE  SKPLIES+KALAE I K  VHCL+HRSGC+WQGPLS+CTSHCSGC+FGNSPVV
Sbjct: 61  FLVTEADSKPLIESNKALAETIGKITVHCLYHRSGCTWQGPLSECTSHCSGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCG+QIVHRQVQEHAQNC G   Q+Q   E ++DAAT+ TTA A+ +Q  +Q  AQ S
Sbjct: 121 CNRCGIQIVHRQVQEHAQNCPGV--QSQALTEATQDAATTGTTAAADQAQATTQ--AQTS 176

Query: 678 HAAVSQIATAPPS-VLNANPQVQANTSAAGTTAE 776
               S +    PS  +N   Q Q+   A   T E
Sbjct: 177 QTTTSSLPVPDPSQQVNPTTQPQSVVQATVPTTE 210


>KDO66718.1 hypothetical protein CISIN_1g000597mg [Citrus sinensis]
          Length = 1392

 Score =  317 bits (813), Expect = 3e-85
 Identities = 307/1017 (30%), Positives = 415/1017 (40%), Gaps = 139/1017 (13%)
 Frame = +3

Query: 1389 VPTYQSHPPVQPHQQLMQATP-QYPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANVSLTN 1565
            V ++ S+   QPHQQ+  + P Q+PM++HP TG       QFPQQ+  M P Q++ +++N
Sbjct: 423  VTSHHSYSQPQPHQQIPLSGPLQHPMYVHPHTGAQSQMQNQFPQQTPSMRPAQSHATISN 482

Query: 1566 QQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQA 1745
            Q     +P   Q+  +                                         +Q 
Sbjct: 483  QPLSTGLPPLGQVANIPPAQQLPVRPHAPQPGVPVSQHPVMQPVQQPMPYQY-----VQQ 537

Query: 1746 SGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMPPHQPQTHG 1925
              P  G   Q PF  QP   Q RPQ P Q LQ      + P  N    +GM  HQP+  G
Sbjct: 538  HLPFSGQHQQGPFV-QP---QLRPQRPPQSLQLHPPAYSQPLQNVAVINGMQSHQPRNLG 593

Query: 1926 GRPVAPNQATASQPFPQSGGTFGVNSQPRPIXXXXXXXXXXXXLGQQFSQSGLGEENAAG 2105
             +P+ PN    +Q + QS  +  V                      Q   S   +  A  
Sbjct: 594  -QPLTPNYGVHAQSYQQSATSLHVRPAQLGANQSSSNQSNLSWTSNQVQLSSEQQAGATS 652

Query: 2106 QEVGSALNNSAGKDVSNAGVDSVGVKVLASDTGKESGDEERRI-----ISEGGNNGTQKG 2270
            +   S  N  A K       +S   K   +D     G E   +      SE         
Sbjct: 653  KPEMSEKNEVAVKIAHEREAESSSEKTAKTDNFDTPGPEAAAVGMKVPKSETDVKAAVDE 712

Query: 2271 IMGKVDDAE--VDAMKKDMREDGNG----NLDPLSGGNKIEAA--ALGERDGNVVRPMQA 2426
            I  +V+D    VD   K+   D       N+ P++   K E      G++D   V   Q 
Sbjct: 713  IKTEVEDKTNVVDTSSKEFVTDRESHIAENVQPINKMVKEEVIENVEGQKDSANVDIKQE 772

Query: 2427 EYSSGKD---STLLPTEAYM-GRRKDDTNSRTQENKSSHEQVTPQGP-AVGEYGRFQDKG 2591
            E+S  K+     LL T     G +  + + + Q+ +   +    QGP AV   G+ Q  G
Sbjct: 773  EHSVSKEVQEEPLLKTSTMQQGTQFGEQSEKVQKEQKVPQAQGAQGPGAVPPAGQAQAGG 832

Query: 2592 FMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPAMPSDFQS---GAHPN------------ 2726
            F+ S              PP  YG S  QQRPA PS FQ+   GA P             
Sbjct: 833  FVQS--------------PPSLYGSSTLQQRPAAPSIFQAPPPGAVPQTQAPTQFRPPMF 878

Query: 2727 -----------------------------ESFEG--VPRRQHYQNNSTHSQPMFSRPLKA 2813
                                          SFE   V  +  Y     H  P+   P ++
Sbjct: 879  KPEVPPGGIPVSGPAASFGRGPGHNGPHQHSFESPLVAPQGPYNLGHPHPSPVGGPPQRS 938

Query: 2814 EPIEG----------SLHGPDGV--------PMQHN----QRPHHFEGRYPDPHVSGAFD 2927
             P+ G            +GP G         PM+      QRP + +GR  D H  G+  
Sbjct: 939  VPLSGFDSHVGTMVGPAYGPGGPMDLKQPSNPMEAEMFTGQRPGYMDGRESDSHFPGSQQ 998

Query: 2928 RGQYGQQ--LLANESRV---PGFDAASGLHVKSADG--NPFLSGPPGRSGQR-EYEDAPK 3083
            R   G      +N  R+   PG +          DG  NPF   P      R E+E+  K
Sbjct: 999  RSPLGPPSGTRSNMMRMNGGPGSELRDERFKSFPDGRLNPFPVDPARSVIDRGEFEEDLK 1058

Query: 3084 QFPKPSVMGLEPSSNFGNGF-SSRPGDYPPHEFNFGAPSRFLPPYHSGGAFHPNDVRERP 3260
            QF +PS +  EP    G+ F  SRP D  PH +      R   P+  G ++ P  ++  P
Sbjct: 1059 QFSRPSHLDAEPVPKLGSHFLPSRPFDRGPHGYGMDMGPR---PFERGLSYDPG-LKLDP 1114

Query: 3261 TG----------FNEDHRARGFSARTQPDFHGSGPGFGVDHR---PPRSPGREFHSVPSR 3401
             G          +++D   R  S+   PDF   G  +G  H     PRSP REF      
Sbjct: 1115 MGASAPSRFLPAYHDDAAGRSDSSHAHPDFPRPGRAYGRRHMGGLSPRSPFREFC----- 1169

Query: 3402 GFGGISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPF--------PEHFRNG 3557
            GFGG+ G+      + +   GR+ R             DP+G  F        P H R G
Sbjct: 1170 GFGGLPGSLGGSRSVREDIGGREFRRFG----------DPIGNSFHDSRFPVLPSHLRRG 1219

Query: 3558 ---------NMGGQDFIPNHLHRGELFGPKNGPSHLRVGDGFGTSLDPG----------S 3680
                     ++ GQ+F+P+HL RGE  GP N    LR+G+  G    PG           
Sbjct: 1220 EFEGPGRTGDLIGQEFLPSHLRRGEPLGPHN----LRLGETVGLGGFPGPARMEELGGPG 1275

Query: 3681 NYP--RIGEPGYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTSMGWCRICRVDCETVD 3854
            N+P  R+GEPG+RSS+S  GFP+DGGF+ G+  S D  RKR P SMGWCRIC+VDCETVD
Sbjct: 1276 NFPPPRLGEPGFRSSFSRQGFPNDGGFYTGDMESIDNSRKRKPPSMGWCRICKVDCETVD 1335

Query: 3855 GLDLHSQTTEHQQRAMDMVISIKQQNAKRQK-NSKDHSSFEEGSRSRNAGNKGRGKK 4022
            GLDLHSQT EHQ+ AMDMV+SIK QNAK+QK  S D  S ++ ++SRN    GRGKK
Sbjct: 1336 GLDLHSQTREHQKMAMDMVLSIK-QNAKKQKLTSGDRCSSDDANKSRNVNFDGRGKK 1391



 Score =  305 bits (781), Expect = 3e-81
 Identities = 152/218 (69%), Positives = 163/218 (74%), Gaps = 5/218 (2%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTY+V TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYIVNTTQACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL+ES+KALAE I K  VHCLFHRSGC+WQGPLS+CTSHCSGC+FGNSPVV
Sbjct: 61  YLVTEADSKPLVESNKALAETIGKITVHCLFHRSGCTWQGPLSECTSHCSGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRC +QIVHRQVQEHAQNC G  PQA Q  E   DAA   T AT + SQ  +Q    AS
Sbjct: 121 CNRCAIQIVHRQVQEHAQNCPGVQPQASQ-PEGVHDAAAIGTAATGDQSQVATQAGLTAS 179

Query: 678 HAAVSQIATAPPSV-LNANP----QVQANTSAAGTTAE 776
                 IAT PP    N  P    Q  A   AA  TAE
Sbjct: 180 QVQTQTIATPPPGKDTNQQPSSMSQPLAVVQAAVPTAE 217


>XP_012471229.1 PREDICTED: bromodomain-containing protein 4-like [Gossypium
            raimondii] XP_012471230.1 PREDICTED:
            bromodomain-containing protein 4-like [Gossypium
            raimondii] KJB19951.1 hypothetical protein
            B456_003G125700 [Gossypium raimondii] KJB19952.1
            hypothetical protein B456_003G125700 [Gossypium
            raimondii] KJB19953.1 hypothetical protein
            B456_003G125700 [Gossypium raimondii] KJB19954.1
            hypothetical protein B456_003G125700 [Gossypium
            raimondii] KJB19955.1 hypothetical protein
            B456_003G125700 [Gossypium raimondii]
          Length = 1311

 Score =  312 bits (799), Expect = 9e-84
 Identities = 307/971 (31%), Positives = 406/971 (41%), Gaps = 93/971 (9%)
 Frame = +3

Query: 1389 VPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFP---PAAQ---FPQQSLQMPPQQAN 1550
            V  +QS+   QPHQQ+   TPQ PMHM P+ GG  P   PA     +PQQ  QM P Q++
Sbjct: 430  VTGHQSYSQSQPHQQMQLVTPQNPMHM-PAQGGLHPQQHPAEMQNSYPQQPPQMRPPQSH 488

Query: 1551 VSLTNQQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1730
              + NQQQP L+P    M                                          
Sbjct: 489  SQIPNQQQPGLLPLPGPMLQ-QAHHHSLQHPLSVQTQSVMQPPTSLLSQQYMQQQQSLQP 547

Query: 1731 XXLQASGPVRGPMHQL-PFANQPMPVQS--RPQGPNQLLQQSGALPAPPPHNSVQAHGMP 1901
               Q  G V+  MHQ  PF  Q   +QS  RP GP Q   Q       P  N   +H + 
Sbjct: 548  PSTQPMGLVQPQMHQQGPFVQQQQSLQSQIRPPGPPQSFLQPPHAYPQPQQNVAGSHAVQ 607

Query: 1902 PHQPQTHGGRPVAPNQATASQPFPQSGGTFGVNSQPRPIXXXXXXXXXXXXLGQQFSQSG 2081
            P+   T  GRP+ PN    SQP+PQS    G+  +P  +            L    +QSG
Sbjct: 608  PYPTPTLTGRPMTPNHGLQSQPYPQSAP--GMLVKPMQLGVNQPSSYQNNVLRTN-NQSG 664

Query: 2082 LGEE------------NAAGQEVGSALNNSAGKDVSNAGV-DSVGVKVLASDTGKESGDE 2222
            L  +            + A Q+   +    A K+ +   V  S+G  V+ +++ K + D 
Sbjct: 665  LNSQPISEVPGDHGTLHVAEQKADLSSQGFAKKEDNELDVASSLGSDVVKTNSSKSNSDM 724

Query: 2223 ERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGNKIEAAALGERDG 2402
            +       G+ G          D      ++  R D   N D  S  N ++  A+ ++  
Sbjct: 725  KSIDEKPAGDVGDNSSGF----DISTKLTQESRRTDLVLNRDTFSK-NMVKGEAIEDQKD 779

Query: 2403 --NVVRPMQAEYSSGKDSTLLPT----EAYMGRRKDDTNSRTQENKSSHEQVTPQ--GPA 2558
              NV R  + E +  KD  LL T    EA +G          Q  K   E++ PQ  G A
Sbjct: 780  VDNVER--KVEENKFKDGPLLKTPTLQEAKLGEE--------QNGKMQRERIQPQDQGTA 829

Query: 2559 VGEYGRFQDKGFMNSSNSVPHTDQGRHHLPPG---------PYGPSYHQQRPAMPSDFQS 2711
             G  G        N    +P + Q    + PG         PYG + +QQ+ A  +  Q+
Sbjct: 830  KGPTG--------NEFTGIPPSSQ----VQPGSFPQQPLQMPYGSNSNQQKSAASAMLQA 877

Query: 2712 ---GAHPNESFEGVPRRQHYQNNSTHSQPMFSRPLKAEPIEGSLHGPDGVPMQHNQRPHH 2882
               G  PN+     P +      +    P F R        G  +GP G    +NQ P  
Sbjct: 878  PPPGLPPNQVRPQGPGQTLVPPENF--APSFGR--------GPSYGPQG---PYNQGP-- 922

Query: 2883 FEGRYPDPHVSGAFDRGQYGQQLLANESRVPGFDA--ASGLHVKSADGNPFLSGPPGRSG 3056
                     VSGA  R   G+ LL      P  +A  + G      +G+     P     
Sbjct: 923  ---------VSGA-PRIPQGETLLHPPFGPPSLNAFDSHGAPSYGPEGHLVQQRPANMLN 972

Query: 3057 --QREYEDAPKQFPKPSVMGLEPSSNFGNGFSS-RPGDYPPHEF---------------- 3179
              Q ++++  KQF +PS +  EP   +G+ FSS R  D  PH F                
Sbjct: 973  FDQGQFDEDLKQFSRPSHLDTEPVPKYGSYFSSTRSIDRGPHGFAKDAGPWAHDKEPRGL 1032

Query: 3180 NF-----GAPSRFLPPYHSGGAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFG 3344
            NF       PSRFLPPYH      P+D  ERP G  ED   R       PDF G+  G+G
Sbjct: 1033 NFDPMIGSGPSRFLPPYH------PDDAGERPVGLPEDTLGR-------PDFLGTVTGYG 1079

Query: 3345 ---VDHRPPRSPGREFHSVPSRGFGGISGAPPSQSGLLDGAHGRDTRA----------VH 3485
               +D    RSPGRE+  + S  FGG  G         D   GR+ R           +H
Sbjct: 1080 RHRMDGFISRSPGREYSGISSHRFGGYPG---------DEIDGRERRFNDRFSGFPGHIH 1130

Query: 3486 EGSRSSDISLDPVGKPFPEHFRNGNMGGQDFIPNHLHRGELFGPKNGPSHLRVGD--GFG 3659
             G   S   +        EHF      G D  P H  RGE FG  N P  LR+    GFG
Sbjct: 1131 RGGFESSDHM-------AEHF------GPDIRPPHFRRGEHFGRNNMPGQLRMEGPIGFG 1177

Query: 3660 T--------SLDPGSNY--PRIGEPGYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTS 3809
                       D   N+  PR+GEPG+RSSYSL  FP DGG + G+ +SF+  RKR P S
Sbjct: 1178 DFSSHEQMGEFDGPGNFRQPRLGEPGFRSSYSLREFPIDGGIYTGDMDSFENLRKRKPVS 1237

Query: 3810 MGWCRICRVDCETVDGLDLHSQTTEHQQRAMDMVISIKQQNAKRQKNSKDHSSFEEGSRS 3989
            MGWCRIC+VDCETV+GLDLHSQT EHQ+ AMDMV  IKQ   K+++ S DHS   + ++S
Sbjct: 1238 MGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVAIIKQNAKKQKQTSSDHSLRNDSNKS 1297

Query: 3990 RNAGNKGRGKK 4022
            RNA  + R  K
Sbjct: 1298 RNAKFESRSNK 1308



 Score =  305 bits (781), Expect = 2e-81
 Identities = 151/215 (70%), Positives = 169/215 (78%), Gaps = 2/215 (0%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTYVV TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYVVSTTRACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL+ES+KALA+ I K  VHCL+HRSGC+WQGPLS+CT+HCSGC FGNSPVV
Sbjct: 61  YLVTEADSKPLVESNKALADTIGKISVHCLYHRSGCTWQGPLSECTAHCSGCVFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCGVQIVHRQVQEHAQNC    PQAQQ AE  ++ + S TTA A+ +Q  SQ  AQAS
Sbjct: 121 CNRCGVQIVHRQVQEHAQNCPRVQPQAQQ-AEGGQEISASGTTAAADQTQVASQ--AQAS 177

Query: 678 HAAVSQ--IATAPPSVLNANPQVQANTSAAGTTAE 776
            A  S   +    P   N NPQ QA +  A  ++E
Sbjct: 178 QATTSSTPVQGLNPQA-NPNPQSQAASQVAVVSSE 211


>XP_006488440.1 PREDICTED: AT-rich interactive domain-containing protein 1A-like
            [Citrus sinensis] XP_006488441.1 PREDICTED: AT-rich
            interactive domain-containing protein 1A-like [Citrus
            sinensis] XP_006488442.1 PREDICTED: AT-rich interactive
            domain-containing protein 1A-like [Citrus sinensis]
            XP_006488443.1 PREDICTED: AT-rich interactive
            domain-containing protein 1A-like [Citrus sinensis]
            XP_015388856.1 PREDICTED: AT-rich interactive
            domain-containing protein 1A-like [Citrus sinensis]
          Length = 1392

 Score =  311 bits (798), Expect = 2e-83
 Identities = 305/1017 (29%), Positives = 414/1017 (40%), Gaps = 139/1017 (13%)
 Frame = +3

Query: 1389 VPTYQSHPPVQPHQQLMQATP-QYPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANVSLTN 1565
            V ++ S+   QPHQQ+  + P Q+PM++HP TG       QFPQQ+  M P Q++ +++N
Sbjct: 423  VTSHHSYSQPQPHQQIPLSGPLQHPMYVHPHTGAQSQMQNQFPQQTPSMRPAQSHATISN 482

Query: 1566 QQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQA 1745
            Q     +P   Q+  +                                         +Q 
Sbjct: 483  QPLSTGLPPLGQVANIPPAQQLPVRPHAPQPGVPVSQHPVMQPVQQPMPYQY-----VQQ 537

Query: 1746 SGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMPPHQPQTHG 1925
              P  G   Q PF  QP   Q RPQ P Q LQ      + P  N    +GM  HQP+  G
Sbjct: 538  HLPFSGQHQQGPFV-QP---QLRPQRPPQSLQLHPPAYSQPLQNVAVINGMQSHQPRNLG 593

Query: 1926 GRPVAPNQATASQPFPQSGGTFGVNSQPRPIXXXXXXXXXXXXLGQQFSQSGLGEENAAG 2105
             +P+ PN    +Q + QS  +  V                      Q   S   +  A  
Sbjct: 594  -QPLTPNYGVHAQSYQQSATSLHVRPAQLGANQSSSNQSNLSWTSNQVQLSSEQQAGATS 652

Query: 2106 QEVGSALNNSAGKDVSNAGVDSVGVKVLASDTGKESGDEERRI-----ISEGGNNGTQKG 2270
            +   S  N  A K       +S   K   +D     G E   +      SE         
Sbjct: 653  KPEMSEKNEVAVKIAHEREAESSSEKTAKTDNFDTPGPEAAAVGMKVPKSETDVKAAVDE 712

Query: 2271 IMGKVDDAE--VDAMKKDMREDGNG----NLDPLSGGNKIEAA--ALGERDGNVVRPMQA 2426
            I  +V+D    VD   K+   D       N+ P++   K E      G++D   V   Q 
Sbjct: 713  IKTEVEDKTNVVDTSSKEFVTDRESHIAENVQPINKMVKEEVIENVEGQKDSANVDIKQE 772

Query: 2427 EYSSGKD---STLLPTEAYM-GRRKDDTNSRTQENKSSHEQVTPQGP-AVGEYGRFQDKG 2591
            E+S  K+     LL T     G +  + + + Q+ +   +    QGP AV   G+ Q  G
Sbjct: 773  EHSVSKEVQEEPLLKTSTMQQGTQFGEQSEKVQKEQKVPQAQGAQGPGAVPPAGQAQAGG 832

Query: 2592 FMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPAMPSDFQS---GAHPNESFE-------- 2738
            F+ S+              P  YG S  QQRPA PS FQ+   GA P             
Sbjct: 833  FVQSA--------------PSLYGSSTLQQRPAAPSIFQAPPPGAVPQTQAPTQFRPPMF 878

Query: 2739 -------GVP----------------RRQH------------YQNNSTHSQPMFSRPLKA 2813
                   G+P                  QH            Y     H  P+   P ++
Sbjct: 879  KAEVPPGGIPVSGPAASFGRGPGHNGPHQHSFEPPLVAPQGPYNLGHPHPSPVGGPPQRS 938

Query: 2814 EPIEG----------SLHGPDGV--------PMQHN----QRPHHFEGRYPDPHVSGAFD 2927
             P+ G            +GP G         PM+      QRP + +GR  D H  G+  
Sbjct: 939  VPLSGFDSHVGTMVGPAYGPGGPMDLKQPSNPMEAEMFTGQRPGYMDGRESDSHFPGSQQ 998

Query: 2928 RGQYGQQ--LLANESRV---PGFDAASGLHVKSADG--NPFLSGPPGRSGQR-EYEDAPK 3083
            R   G      +N  R+   PG +          DG  NPF   P      R E+E+  K
Sbjct: 999  RSPLGPPSGTRSNMMRMNGGPGSELRDERFKSFPDGRLNPFPVDPARSVIDRGEFEEDLK 1058

Query: 3084 QFPKPSVMGLEPSSNFGNGF-SSRPGDYPPHEFNFGAPSRFLPPYHSGGAFHPNDVRERP 3260
            QF +PS +  EP    G+ F  SRP D  PH +      R   P+  G ++ P  ++  P
Sbjct: 1059 QFSRPSHLDAEPVPKLGSHFLPSRPFDRGPHGYGMDMGPR---PFERGLSYDPG-LKLDP 1114

Query: 3261 TG----------FNEDHRARGFSARTQPDFHGSGPGFGVDHR---PPRSPGREFHSVPSR 3401
             G          +++D   R  S+   PDF   G  +G  H     PRS  REF      
Sbjct: 1115 MGASAPSRFLPAYHDDAAGRSDSSHAHPDFPRPGRAYGRRHMGGLSPRSSFREFC----- 1169

Query: 3402 GFGGISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPF--------PEHFRNG 3557
            GFGG+ G+      + +   GR+ R             DP+G  F        P H R G
Sbjct: 1170 GFGGLPGSLGGSRSVREDIGGREFRRFG----------DPIGNSFHDSRFPVLPSHLRRG 1219

Query: 3558 ---------NMGGQDFIPNHLHRGELFGPKNGPSHLRVGDGFGTSLDPG----------S 3680
                     ++ GQ+F+P+HL RGE  GP N    LR+G+  G    PG           
Sbjct: 1220 EFEGPGRTGDLIGQEFLPSHLRRGEPLGPHN----LRLGETVGLGGFPGPARMEELGGPG 1275

Query: 3681 NYP--RIGEPGYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTSMGWCRICRVDCETVD 3854
            N+P  R+GEPG+RSS+S  GFP+DGGF+ G+  S D  RKR P SMGWCRIC+VDCETVD
Sbjct: 1276 NFPPPRLGEPGFRSSFSRQGFPNDGGFYTGDMESIDNSRKRKPPSMGWCRICKVDCETVD 1335

Query: 3855 GLDLHSQTTEHQQRAMDMVISIKQQNAKRQK-NSKDHSSFEEGSRSRNAGNKGRGKK 4022
            GLDLHSQT EHQ+ AMDMV+SIK QNAK+QK  S D  S ++ ++SRN    GRGKK
Sbjct: 1336 GLDLHSQTREHQKMAMDMVLSIK-QNAKKQKLTSGDRCSTDDANKSRNVNFDGRGKK 1391



 Score =  305 bits (781), Expect = 3e-81
 Identities = 152/218 (69%), Positives = 163/218 (74%), Gaps = 5/218 (2%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTY+V TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYIVNTTQACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL+ES+KALAE I K  VHCLFHRSGC+WQGPLS+CTSHCSGC+FGNSPVV
Sbjct: 61  YLVTEADSKPLVESNKALAETIGKITVHCLFHRSGCTWQGPLSECTSHCSGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRC +QIVHRQVQEHAQNC G  PQA Q  E   DAA   T AT + SQ  +Q    AS
Sbjct: 121 CNRCAIQIVHRQVQEHAQNCPGVQPQASQ-PEGVHDAAAIGTAATGDQSQVATQAGLTAS 179

Query: 678 HAAVSQIATAPPSV-LNANP----QVQANTSAAGTTAE 776
                 IAT PP    N  P    Q  A   AA  TAE
Sbjct: 180 QVQTQTIATPPPGKDTNQQPSSMSQPLAVVQAAVPTAE 217


>XP_006424987.1 hypothetical protein CICLE_v10027683mg [Citrus clementina] ESR38227.1
            hypothetical protein CICLE_v10027683mg [Citrus
            clementina]
          Length = 1392

 Score =  311 bits (797), Expect = 3e-83
 Identities = 305/1017 (29%), Positives = 414/1017 (40%), Gaps = 139/1017 (13%)
 Frame = +3

Query: 1389 VPTYQSHPPVQPHQQLMQATP-QYPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANVSLTN 1565
            V ++ S+   QPHQQ+  + P Q+PM++HP TG       QFPQQ+  M P Q++ +++N
Sbjct: 423  VTSHHSYSQPQPHQQIPLSGPLQHPMYVHPHTGAQSQMQNQFPQQTPSMRPAQSHATISN 482

Query: 1566 QQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQA 1745
            Q     +P   Q+  +                                         +Q 
Sbjct: 483  QPLSTGLPPLGQVANIPPAQQLPVRPHAPQPGVPVSQHPVMQPVQQPMPYQY-----VQQ 537

Query: 1746 SGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMPPHQPQTHG 1925
              P  G   Q PF  QP   Q RPQ P Q LQ      + P  N    +GM  HQP+  G
Sbjct: 538  HLPFSGQHQQGPFV-QP---QLRPQRPPQSLQLHPPAYSQPLQNVAVINGMQSHQPRNLG 593

Query: 1926 GRPVAPNQATASQPFPQSGGTFGVNSQPRPIXXXXXXXXXXXXLGQQFSQSGLGEENAAG 2105
             +P+ PN    +Q + QS  +  V                      Q   S   +  A  
Sbjct: 594  -QPLTPNYGVHAQSYQQSATSLHVRPAQLGANQSSSNQSNLFWTSNQVQLSSEQQAGATS 652

Query: 2106 QEVGSALNNSAGKDVSNAGVDSVGVKVLASDTGKESGDEERRI-----ISEGGNNGTQKG 2270
            +   S  N  A K       +S   K   +D     G E   +      SE         
Sbjct: 653  KPEMSEKNEVAVKIAHEREAESSSEKTAKTDNFDTPGPEAAAVGMKVPKSETDVKAAVDE 712

Query: 2271 IMGKVDDAE--VDAMKKDMREDGNG----NLDPLSGGNKIEAA--ALGERDGNVVRPMQA 2426
            I  +V+D    VD   K+   D       N+ P++   K E      G++D   V   Q 
Sbjct: 713  IKTEVEDKTNVVDTSSKEFVTDRESHIAENVQPINKMVKEEVIENVEGQKDSANVDIKQE 772

Query: 2427 EYSSGKD---STLLPTEAYM-GRRKDDTNSRTQENKSSHEQVTPQGP-AVGEYGRFQDKG 2591
            E+S  K+     LL T     G +  + + + Q+ +   +    QGP AV   G+ Q  G
Sbjct: 773  EHSVSKEVQEEPLLKTSTMQQGTQFGEQSEKVQKEQKVPQAQGAQGPGAVPPAGQAQAGG 832

Query: 2592 FMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPAMPSDFQS---GAHPNESFE-------- 2738
            F+ S+              P  YG S  QQRPA PS FQ+   GA P             
Sbjct: 833  FVQSA--------------PSLYGSSTLQQRPAAPSIFQAPPPGAVPQTQAPTQFRPPMF 878

Query: 2739 -------GVP----------------RRQH------------YQNNSTHSQPMFSRPLKA 2813
                   G+P                  QH            Y     H  P+   P ++
Sbjct: 879  KAEVPPGGIPVSGPAASFGRGPGHNGPHQHSFEPPLVAPQGPYNLGHLHPSPVGGPPQRS 938

Query: 2814 EPIEG----------SLHGPDGV--------PMQHN----QRPHHFEGRYPDPHVSGAFD 2927
             P+ G            +GP G         PM+      QRP + +GR  D H  G+  
Sbjct: 939  VPLSGFDSHVGTMVGPAYGPGGPMDLKQPSNPMEAEMFTGQRPGYMDGRESDSHFPGSQQ 998

Query: 2928 RGQYGQQ--LLANESRV---PGFDAASGLHVKSADG--NPFLSGPPGRSGQR-EYEDAPK 3083
            R   G      +N  R+   PG +          DG  NPF   P      R E+E+  K
Sbjct: 999  RSPLGPPSGTRSNMMRMNGGPGSELRDERFKSFPDGRLNPFPVDPARSVIDRGEFEEDLK 1058

Query: 3084 QFPKPSVMGLEPSSNFGNGF-SSRPGDYPPHEFNFGAPSRFLPPYHSGGAFHPNDVRERP 3260
            QF +PS +  EP    G+ F  SRP D  PH +      R   P+  G ++ P  ++  P
Sbjct: 1059 QFSRPSHLDAEPVPKLGSHFLPSRPFDRGPHGYGMDMGPR---PFERGLSYDPG-LKLDP 1114

Query: 3261 TG----------FNEDHRARGFSARTQPDFHGSGPGFGVDHR---PPRSPGREFHSVPSR 3401
             G          +++D   R  S+   PDF   G  +G  H     PRS  REF      
Sbjct: 1115 MGASAPSRFLPAYHDDAAGRSDSSHAHPDFPRPGRAYGRRHMGGLSPRSSFREFC----- 1169

Query: 3402 GFGGISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPF--------PEHFRNG 3557
            GFGG+ G+      + +   GR+ R             DP+G  F        P H R G
Sbjct: 1170 GFGGLPGSLGGSRSVREDIGGREFRRFG----------DPIGNSFHDSRFPVLPSHLRRG 1219

Query: 3558 ---------NMGGQDFIPNHLHRGELFGPKNGPSHLRVGDGFGTSLDPG----------S 3680
                     ++ GQ+F+P+HL RGE  GP N    LR+G+  G    PG           
Sbjct: 1220 EFEGPGRTGDLIGQEFLPSHLRRGEPLGPHN----LRLGETVGLGGFPGPARMEELGGPG 1275

Query: 3681 NYP--RIGEPGYRSSYSLHGFPSDGGFFAGNNNSFDRFRKRMPTSMGWCRICRVDCETVD 3854
            N+P  R+GEPG+RSS+S  GFP+DGGF+ G+  S D  RKR P SMGWCRIC+VDCETVD
Sbjct: 1276 NFPPPRLGEPGFRSSFSHQGFPNDGGFYTGDMESIDNSRKRKPPSMGWCRICKVDCETVD 1335

Query: 3855 GLDLHSQTTEHQQRAMDMVISIKQQNAKRQK-NSKDHSSFEEGSRSRNAGNKGRGKK 4022
            GLDLHSQT EHQ+ AMDMV+SIK QNAK+QK  S D  S ++ ++SRN    GRGKK
Sbjct: 1336 GLDLHSQTREHQKMAMDMVLSIK-QNAKKQKLTSGDRCSTDDANKSRNVNFDGRGKK 1391



 Score =  305 bits (781), Expect = 3e-81
 Identities = 152/218 (69%), Positives = 163/218 (74%), Gaps = 5/218 (2%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTY+V TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYIVNTTQACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL+ES+KALAE I K  VHCLFHRSGC+WQGPLS+CTSHCSGC+FGNSPVV
Sbjct: 61  YLVTEADSKPLVESNKALAETIGKITVHCLFHRSGCTWQGPLSECTSHCSGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRC +QIVHRQVQEHAQNC G  PQA Q  E   DAA   T AT + SQ  +Q    AS
Sbjct: 121 CNRCAIQIVHRQVQEHAQNCPGVQPQASQ-PEGVHDAAAIGTAATGDQSQVATQAGLTAS 179

Query: 678 HAAVSQIATAPPSV-LNANP----QVQANTSAAGTTAE 776
                 IAT PP    N  P    Q  A   AA  TAE
Sbjct: 180 QVQTQTIATPPPGKDTNQQPSSMSQPLAVVQAAVPTAE 217


>XP_008220075.1 PREDICTED: uncharacterized protein LOC103320208 [Prunus mume]
          Length = 1353

 Score =  308 bits (789), Expect = 2e-82
 Identities = 151/217 (69%), Positives = 169/217 (77%), Gaps = 4/217 (1%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTYVV +T+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYVVSSTRACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  +KPLIES+K+LAE I K  VHCL+HRSGC+WQGPLS+CTSHCSGC+FGNSPVV
Sbjct: 61  YLVTEADAKPLIESNKSLAETIGKIAVHCLYHRSGCTWQGPLSECTSHCSGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCG+QIVHRQVQEHAQNC G  PQAQQ AE + D + S T+ATA+ +Q  +Q     S
Sbjct: 121 CNRCGIQIVHRQVQEHAQNCPGVQPQAQQ-AEGALDTSASGTSATADQTQAATQSGIATS 179

Query: 678 HAAVSQ----IATAPPSVLNANPQVQANTSAAGTTAE 776
            A VSQ     A  P     AN   QA   AA  +AE
Sbjct: 180 QAQVSQTTSVTAPGPDPNQKANSSSQAVVQAAVPSAE 216



 Score =  300 bits (768), Expect = 1e-79
 Identities = 303/978 (30%), Positives = 397/978 (40%), Gaps = 108/978 (11%)
 Frame = +3

Query: 1383 HVVPTYQSHPPVQ---PHQQLMQATPQYPMHMHPSTGGSFPPAAQFPQQSLQMPPQQANV 1553
            H+ P    H P+Q   P Q+ M        H    T       +QFPQQ   M P  ++ 
Sbjct: 447  HLYPQPHLHQPMQSGAPQQRTMHVQSHGMPHSQSQTPVQIQ--SQFPQQPPLMRPPPSHT 504

Query: 1554 SLTNQQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1733
            ++ NQQQP L+P   Q++ +                                        
Sbjct: 505  TIPNQQQPALLPSPGQIQNMNPAQQQPVHSYGHPPGNTVHQRPHMQA------------- 551

Query: 1734 XLQASGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMPPHQP 1913
             +Q   P +   HQ PF  Q  P Q RPQG + L  Q          N   + G+  H  
Sbjct: 552  -VQQPTPQQYFHHQ-PFVQQQPPTQLRPQGQSHLFPQHIHASTQSQQNVALSQGIQ-HTQ 608

Query: 1914 QTHGGRPVAPNQATASQPFPQSGGTFGVNSQP-RPIXXXXXXXXXXXXLGQQFSQSGLGE 2090
               GGRP+ P     SQ + Q+ G  GV  +P  P                   QSG   
Sbjct: 609  SNLGGRPMMPIHGVQSQTYAQTAG--GVYMRPMHPAANLPSTNQNSMVSTNNLVQSGANS 666

Query: 2091 -----ENAAGQEVGSALNNSAGKDVSNAGVDS-----VGVKVLASDTGKESGDEERRIIS 2240
                 E  A QE G +   +A K V + G  S       VK   S+T  +S D E +   
Sbjct: 667  GPTTSERQAEQESGLSAQQNAKKVVHDVGTASGVVADAEVKTAKSETDIKSIDNENK--- 723

Query: 2241 EGGNNGTQKGIMGKVDDAEVDAM-------KKDMREDG-NGNLDPLSGGNKIEAAALGER 2396
              G + T +G     +  ++ A+       K  ++E+G +G LD  S G   E  A G +
Sbjct: 724  PTGEDKTNQGDTSSKEIPDIHALENGESVSKSMLKEEGVDGTLDHSSNGKLGEVVAEGAK 783

Query: 2397 DGNVVRPMQAEYSSGKDSTLLPTEAYMGRRKDDTNSRTQENKSSHEQVTPQGPAVGEYGR 2576
            D +     Q E         +P+E    + +++     Q++ S + Q     P +G    
Sbjct: 784  DVSSSDVKQRELKE------IPSEE--AQLREEQGRMLQKDASGNPQ-----PFIGT--- 827

Query: 2577 FQDKGFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPAMPSDFQSGAHPNESFEGV---- 2744
              D+G+   S S P +DQG+H LP   +GP+   QRP  P   Q    P    +G     
Sbjct: 828  --DEGYQAVSTSAPISDQGKH-LPH--HGPTTIPQRPGAPLLLQVPPGPPHHTQGPGHHL 882

Query: 2745 --PRRQHYQNNSTHSQPMF-----SRPLKAEPIEGSLHGPDGVPMQHNQRPH--HFEGRY 2897
              P   H      HS   F     +    A     S HGP G     +  PH  + EG  
Sbjct: 883  RPPGPAHVPGQPFHSSEHFQPHGGNLGFGASSGRASQHGPQGSIELQSVTPHGPYNEGHL 942

Query: 2898 PDPHVSGAFDR---------------GQYGQQLLANESRVPGFDAASGLH---VKSADG- 3020
            P P  S AFD                G +   L  N +  P   +  G      K+  G 
Sbjct: 943  PFPPTS-AFDSQGGMMSRAAPIGQPSGIHPNMLRMNGTPGPDSSSTHGPRDERFKAFPGE 1001

Query: 3021 --NPFLSGPPGRSGQR-EYEDAPKQFPKPSVMGLEPSSNFGNGFSSRPGDYPPHEFNFG- 3188
              NPF   P      R E+ED  KQFP+PS +  EP + FGN +SSRP D  PH F +  
Sbjct: 1002 RLNPFPVDPTHHVIDRVEFEDDLKQFPRPSYLDSEPVAKFGN-YSSRPFDRAPHGFKYDS 1060

Query: 3189 ----------APSRFLPPYHSGGAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPG 3338
                      APSRFL PY  GG+ H ND  +    F       G      PDF G    
Sbjct: 1061 GPHTDPAAGTAPSRFLSPYRLGGSVHGNDAGD----FGRMEPTHG-----HPDFVGRRL- 1110

Query: 3339 FGVDHRPPRSPGREFHSVPSRGFGGISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLD 3518
              VD   PRSP R++  +P  GF G             G    D R  H          D
Sbjct: 1111 --VDGLAPRSPVRDYTGLPPHGFRGF------------GPDDFDGREFHRFG-------D 1149

Query: 3519 PVGKPF--------PEHFRNGNMGG----------------QDFIPNHLHRGELFGPKN- 3623
            P+G  F        P HFR G   G                QD  P HL RG+  G    
Sbjct: 1150 PLGNQFHEGRFSSLPGHFRRGEFEGPGNLRMVDLRRNDFIGQDGHPGHLRRGDHLGHNLR 1209

Query: 3624 -----GPSHLRVGDGFGTSLDPGS---------NYPRIGEPGYRSSYSLHGFPSDGGFFA 3761
                 G  H R+GD  G    PG+         ++PR+GEPG+RSS+SL  FP+DG +  
Sbjct: 1210 EPLGFGSRHSRMGDMAG----PGNFESFRGNRPSHPRLGEPGFRSSFSLQRFPNDGTY-T 1264

Query: 3762 GNNNSFDRFRKRMPTSMGWCRICRVDCETVDGLDLHSQTTEHQQRAMDMVISIKQQNAKR 3941
            G+  SFD  RKR P SMGWCRIC+VDCETV+GLDLHSQT EHQ+ AMDMV SIK QNAK+
Sbjct: 1265 GDLESFDHSRKRKPASMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVRSIK-QNAKK 1323

Query: 3942 QK-NSKDHSSFEEGSRSR 3992
            QK  S D S  E+ ++S+
Sbjct: 1324 QKLTSSDQSLLEDANKSK 1341


>XP_016740198.1 PREDICTED: AT-rich interactive domain-containing protein 1A-like
           [Gossypium hirsutum]
          Length = 1153

 Score =  305 bits (780), Expect = 5e-82
 Identities = 151/215 (70%), Positives = 169/215 (78%), Gaps = 2/215 (0%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTYVV TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYVVSTTRACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL+ES+KALA+ I K  VHCL+HRSGC+WQGPLS+CT+HCSGC FGNSPVV
Sbjct: 61  YLVTEADSKPLVESNKALADTIGKISVHCLYHRSGCTWQGPLSECTAHCSGCVFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCGVQIVHRQVQEHAQNC    PQAQQ AE  ++ + S TTA A+ +Q  SQ  AQAS
Sbjct: 121 CNRCGVQIVHRQVQEHAQNCPRVQPQAQQ-AEGGQEISASGTTAAADQTQVASQ--AQAS 177

Query: 678 HAAVSQ--IATAPPSVLNANPQVQANTSAAGTTAE 776
            A  S   +    P   N NPQ QA +  A  ++E
Sbjct: 178 QATTSSTPVQGFNPQA-NPNPQSQAASQVAVVSSE 211



 Score =  285 bits (730), Expect = 1e-75
 Identities = 299/985 (30%), Positives = 399/985 (40%), Gaps = 113/985 (11%)
 Frame = +3

Query: 1407 HPPVQPHQQLMQATPQYPMHMHPSTGGSFPPAAQ-----------------------FPQ 1517
            +P  QP  QL    PQ     H       P AAQ                       +PQ
Sbjct: 263  YPQTQPQPQLQ---PQLQAQTHSHLSVQVPVAAQPQNQAQANQQQQTHHTIAEMQNSYPQ 319

Query: 1518 QSLQMPPQQANVSLTNQQQPNLIPGQSQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1697
            Q  QM P Q++  + NQQQP L+P    M                               
Sbjct: 320  QPPQMRPPQSHAQIPNQQQPGLLPLPGPMLQ-QAHHHSLQHPLSVQTQSVMQPPTSLMSQ 378

Query: 1698 XXXXXXXXXXXXXLQASGPVRGPMHQL-PFANQPMPVQS--RPQGPNQLLQQSGALPAPP 1868
                          Q  G V+  MHQ  PF  Q   +QS  RP GP Q   Q       P
Sbjct: 379  QYMQQQQSLQPPSTQPMGLVQPQMHQQGPFVQQQQSLQSQIRPPGPPQSFLQPPHAYPQP 438

Query: 1869 PHNSVQAHGMPPHQPQTHGGRPVAPNQATASQPFPQSGGTFGVNSQPRPIXXXXXXXXXX 2048
              N   +H + P+   T  GRP+ PN    SQP+PQS    G+  +P  +          
Sbjct: 439  QQNVAGSHAVQPYPTPTLTGRPMTPNHGLQSQPYPQSAP--GMLVKPMQLGVNQPSSYQN 496

Query: 2049 XXLGQQFSQSGLGEE------------NAAGQEVGSALNNSAGKDVSNAGV-DSVGVKVL 2189
              L    +QSGL  +            + A Q+   +    A K+ +   V  S+G  V+
Sbjct: 497  NVLRTN-NQSGLNSQPISEVPGDHGTLHVAEQKADLSSQGFAKKEDNEFDVASSLGSDVV 555

Query: 2190 ASDTGKESGDE---ERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSG 2360
             +++ K + D    + +   + G+N +   I  KV        ++  R D   N D  S 
Sbjct: 556  KTNSSKSNSDMKSIDEKPAGDVGDNSSGFDISTKVT-------QESRRTDLVLNRDTFSK 608

Query: 2361 GNKIEAAALGERDG--NVVRPMQAEYSSGKDSTLLPT----EAYMGRRKDDTNSRTQENK 2522
             N ++  A+ ++    NV R  +AE +  KD  LL T    EA +G          Q  K
Sbjct: 609  -NMVKGEAIEDQKDVDNVER--KAEENKIKDGPLLKTPTLQEAKLGEE--------QNGK 657

Query: 2523 SSHEQVTPQ--GPAVGEYGRFQDKGFMNSSNSVPHTDQGRHHLPPG---------PYGPS 2669
               E++ PQ  G A G  G        N    +P + Q    + PG         PYG +
Sbjct: 658  MQRERIQPQDQGTAKGPTG--------NEFTGIPPSSQ----VQPGSFPQQPLQIPYGSN 705

Query: 2670 YHQQRPAMPSDFQS---GAHPNESFEGVPRRQHYQNNSTHSQPMFSRPLKAEPIEGSLHG 2840
             +QQ+ A  +  Q+   G  PN+     P +      +    P F R        G  +G
Sbjct: 706  SNQQKSAASAMLQAPPPGLPPNQVRPQGPGQTLVPPENF--APSFGR--------GPSYG 755

Query: 2841 PDGVPMQHNQRPHHFEGRYPDPHVSGAFDRGQYGQQLLANESRVPGFDA--ASGLHVKSA 3014
            P G    +NQ P           VSGA  R   G+ LL      P  +A  + G      
Sbjct: 756  PQG---PYNQGP-----------VSGA-PRIPQGETLLHPPFGPPSLNAFDSHGAPSYGP 800

Query: 3015 DGNPFLSGPPGRSG--QREYEDAPKQFPKPSVMGLEPSSNFGNGFSS-RPGDYPPHEF-- 3179
            +G+     P       Q ++++  KQF +PS++  EP   +G+ FSS R  D  PH F  
Sbjct: 801  EGHLVQQRPANMLNFDQGQFDEDLKQFSRPSLLDTEPVPKYGSYFSSTRSIDRGPHGFAK 860

Query: 3180 --------------NF-----GAPSRFLPPYHSGGAFHPNDVRERPTGFNEDHRARGFSA 3302
                          NF        SRFLPPYH      P+D  ERP G  ED   R    
Sbjct: 861  DAGPWAHDKEPRGLNFDPMIGSGSSRFLPPYH------PDDAGERPVGLPEDTLGR---- 910

Query: 3303 RTQPDFHGSGPGFG---VDHRPPRSPGREFHSVPSRGFGGISGAPPSQSGLLDGAHGRDT 3473
               PDF G+  G+G   +D    RSPGRE+  + S  FGG  G         D   GR+ 
Sbjct: 911  ---PDFLGTVTGYGRHRMDGFISRSPGREYSGISSHRFGGYPG---------DEIDGRER 958

Query: 3474 RA----------VHEGSRSSDISLDPVGKPFPEHFRNGNMGGQDFIPNHLHRGELFGPKN 3623
            R           +H G   S   +        EHF      G D  P H  RGE FG  N
Sbjct: 959  RFNDRFSGFPGHIHRGGFESSDHM-------AEHF------GPDIRPPHFRRGEHFGRNN 1005

Query: 3624 GPSHLRVGD--GFGT--------SLDPGSNY--PRIGEPGYRSSYSLHGFPSDGGFFAGN 3767
             P  LR+    GFG           D   N+  PR+GEPG+RSSYSL  FP DGG + G+
Sbjct: 1006 MPGQLRMEGPIGFGDFSSHEQMGEFDGPGNFRQPRLGEPGFRSSYSLREFPIDGGIYTGD 1065

Query: 3768 NNSFDRFRKRMPTSMGWCRICRVDCETVDGLDLHSQTTEHQQRAMDMVISIKQQNAKRQK 3947
             +SF+  RKR P SMGWCRIC+VDCETV+GLDLHSQT EHQ+ AMDMV  IKQ   K+++
Sbjct: 1066 MDSFENLRKRKPVSMGWCRICKVDCETVEGLDLHSQTREHQKMAMDMVAIIKQNAKKQKQ 1125

Query: 3948 NSKDHSSFEEGSRSRNAGNKGRGKK 4022
             S DHS   + ++SRNA  +    K
Sbjct: 1126 TSSDHSLRNDSNKSRNAKFESHSNK 1150


>EOY33850.1 Uncharacterized protein TCM_041704 isoform 1 [Theobroma cacao]
          Length = 1326

 Score =  306 bits (784), Expect = 8e-82
 Identities = 149/219 (68%), Positives = 169/219 (77%), Gaps = 6/219 (2%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTYVV TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYVVSTTRACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL+ES+K LA+ I K  VHCL+HRSGC+WQGPLS+CT+HCSGC+FGNSPVV
Sbjct: 61  YLVTEADSKPLVESNKMLADTIGKITVHCLYHRSGCTWQGPLSECTAHCSGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCG+QIVHRQVQEHAQNC    PQAQQ A+  +D A + TTA A+ +Q  SQ     S
Sbjct: 121 CNRCGIQIVHRQVQEHAQNCPSVQPQAQQ-AKGGQDTAATGTTA-ADQAQIASQTGTATS 178

Query: 678 HAAVSQIATA------PPSVLNANPQVQANTSAAGTTAE 776
            A  SQ  T+      P    N NP+ QA + AA  T+E
Sbjct: 179 QAQASQTTTSGTPGQEPNQQANPNPRSQAVSQAAAMTSE 217



 Score =  261 bits (666), Expect = 4e-67
 Identities = 266/920 (28%), Positives = 370/920 (40%), Gaps = 126/920 (13%)
 Frame = +3

Query: 1383 HVVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFP---PAAQ---FPQQSLQMPPQQ 1544
            H V  +QS+P  QPHQQ+   TPQ+PMH+H + GG  P   PA     +PQQ  QM P Q
Sbjct: 450  HAVTGHQSYPLSQPHQQMQLVTPQHPMHVH-AQGGLHPQQHPAQMQNSYPQQPPQMRPPQ 508

Query: 1545 ANVSLTNQQQPNLIPGQ-SQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1721
             +V+++NQQQP L+P   S ++ V                                    
Sbjct: 509  PHVAISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYVQQQPLST 568

Query: 1722 XXXXXLQASGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMP 1901
                 +Q     +GP  Q   + Q    QSRP GP     Q     A P  N   +H + 
Sbjct: 569  QPVGLVQPQMLQQGPFVQQQSSFQS---QSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVH 625

Query: 1902 PHQPQTHGGRPVAPNQATASQPFPQSGGTFGVN------SQPRPIXXXXXXXXXXXXLGQ 2063
             H      GRP+ PN    SQP+P S     V       +QP               +  
Sbjct: 626  FHPSHNLVGRPMTPNHGVQSQPYPHSAAGTPVKPVHLGANQPSSYQNNVFRTNNQSGVTS 685

Query: 2064 QFSQSGLGE----ENAAGQEVGSALNNSAGKDVSNAGV-DSVGVKVLASDTGKESGD--- 2219
            Q      G+    +N A QE  S+   +A K+ +   +  S+G  V   +T K   D   
Sbjct: 686  QPMSEVPGDHGTDKNVAEQEADSSSPGTARKEANELDMASSLGADVAEKNTAKLEADLKS 745

Query: 2220 EERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGNKIEAAALGERD 2399
             + ++  + G++     I  K    E    ++ +  D   + DP+S  N +   A+ ++ 
Sbjct: 746  VDEKLTGDVGDDSNGVDISTK----ETPESRRTVGTDLEQHRDPVSK-NMVTCEAIEDQK 800

Query: 2400 GNVVRPMQAEYSSGKDSTLLPT----EAYMGRRKDDTNSRTQENKS-SHEQVTPQGPAVG 2564
                   + E    KD   L T    EA +G   ++ N + Q++K   H+Q TP+GPA  
Sbjct: 801  DVHNGEHKVEEIKIKDGPSLKTPPLQEAKLG---EEQNGKMQKDKILPHDQGTPKGPAGN 857

Query: 2565 EY------GRFQDKGFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPA-----------M 2693
             +       + Q  G++  S+SVP+ DQGRH     PYG + +QQRPA           +
Sbjct: 858  GFRGIPPSSQVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGL 917

Query: 2694 PSDFQS-GAHPNE-------------------SFEGVPRRQHYQNNSTHSQPMFS---RP 2804
            PS  Q+ G  PN+                   SF   P     Q       P  S   R 
Sbjct: 918  PSHAQTPGLPPNQFRPQGPGQALVPPENLPPGSFGRDPSNYGPQGPYNQGPPSLSGAPRI 977

Query: 2805 LKAEPIEG----------------SLHGPDGVPMQH--NQRPHHFEGRYPDPHVSGAFDR 2930
             + EP+ G                 L+GP+   +QH  N   +H + R  DP  SG    
Sbjct: 978  SQGEPLVGLSYGTPPLTAFDSHGAPLYGPESHSVQHSANMVDYHADNRQLDPRASGLDST 1037

Query: 2931 GQYGQQLLANESRVPGFDAASGLHVKSADGNPFLSGPPGRSGQREYEDAPKQFPKPSVMG 3110
              +    L  E   P  D  S         N F      R  + ++E+  K FP+PS + 
Sbjct: 1038 STFS---LRGERLKPVQDECS---------NQFPLDRGHRGDRGQFEEDLKHFPRPSHLD 1085

Query: 3111 LEPSSNFGNGF-SSRPGDYPPHEFNF---------------------GAPSRFLPPYHSG 3224
             EP   FG+   SSRP D  PH F                         PSRFLPPY   
Sbjct: 1086 NEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPY--- 1142

Query: 3225 GAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFG---VDHRPPRSPGREFHSVP 3395
               HP+D  ERP G  +D   R       PDF G+ P +G   +D    RSPGRE+  + 
Sbjct: 1143 ---HPDDTGERPVGLPKDTLGR-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGIS 1192

Query: 3396 SRGFGG-----ISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPFPEHFRNGN 3560
              GFGG     I G     S    G  G     +H G   S   ++       EH R+ +
Sbjct: 1193 PHGFGGHPGDEIDGRERRFSDRFPGLPGH----LHRGGFESSDRME-------EHLRSRD 1241

Query: 3561 MGGQDFIPNHLHRGELFGPKNGPSHLRVGD--GFGTSLD---------PGS-NYPRIGEP 3704
            M  QD  P +  RGE  G  N P HLR+G+  GFG             PG+  +PR+GEP
Sbjct: 1242 MINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNFRHPRLGEP 1301

Query: 3705 GYRSSYSLHGFPSDGGFFAG 3764
            G+RSS+SL  FP+DGG + G
Sbjct: 1302 GFRSSFSLQEFPNDGGIYTG 1321


>EOY33855.1 Uncharacterized protein TCM_041704 isoform 6 [Theobroma cacao]
          Length = 1345

 Score =  306 bits (784), Expect = 9e-82
 Identities = 149/219 (68%), Positives = 169/219 (77%), Gaps = 6/219 (2%)
 Frame = +3

Query: 138 MGFDNECIVNIHSLAGEYFCPVCRTLVYPNEALQSQCTHLYCKPCLTYVVGTTKACPYDG 317
           MGFDNECI+NI SLAGEYFCPVCR LVYPNEALQSQCTHLYCKPCLTYVV TT+ACPYDG
Sbjct: 1   MGFDNECILNIQSLAGEYFCPVCRLLVYPNEALQSQCTHLYCKPCLTYVVSTTRACPYDG 60

Query: 318 YLVTEEHSKPLIESDKALAEKIDKTLVHCLFHRSGCSWQGPLSQCTSHCSGCSFGNSPVV 497
           YLVTE  SKPL+ES+K LA+ I K  VHCL+HRSGC+WQGPLS+CT+HCSGC+FGNSPVV
Sbjct: 61  YLVTEADSKPLVESNKMLADTIGKITVHCLYHRSGCTWQGPLSECTAHCSGCAFGNSPVV 120

Query: 498 CNRCGVQIVHRQVQEHAQNCAGANPQAQQTAETSKDAATSVTTATANLSQTNSQLVAQAS 677
           CNRCG+QIVHRQVQEHAQNC    PQAQQ A+  +D A + TTA A+ +Q  SQ     S
Sbjct: 121 CNRCGIQIVHRQVQEHAQNCPSVQPQAQQ-AKGGQDTAATGTTA-ADQAQIASQTGTATS 178

Query: 678 HAAVSQIATA------PPSVLNANPQVQANTSAAGTTAE 776
            A  SQ  T+      P    N NP+ QA + AA  T+E
Sbjct: 179 QAQASQTTTSGTPGQEPNQQANPNPRSQAVSQAAAMTSE 217



 Score =  258 bits (660), Expect = 2e-66
 Identities = 265/918 (28%), Positives = 369/918 (40%), Gaps = 126/918 (13%)
 Frame = +3

Query: 1383 HVVPTYQSHPPVQPHQQLMQATPQYPMHMHPSTGGSFP---PAAQ---FPQQSLQMPPQQ 1544
            H V  +QS+P  QPHQQ+   TPQ+PMH+H + GG  P   PA     +PQQ  QM P Q
Sbjct: 450  HAVTGHQSYPLSQPHQQMQLVTPQHPMHVH-AQGGLHPQQHPAQMQNSYPQQPPQMRPPQ 508

Query: 1545 ANVSLTNQQQPNLIPGQ-SQMRGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1721
             +V+++NQQQP L+P   S ++ V                                    
Sbjct: 509  PHVAISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYVQQQPLST 568

Query: 1722 XXXXXLQASGPVRGPMHQLPFANQPMPVQSRPQGPNQLLQQSGALPAPPPHNSVQAHGMP 1901
                 +Q     +GP  Q   + Q    QSRP GP     Q     A P  N   +H + 
Sbjct: 569  QPVGLVQPQMLQQGPFVQQQSSFQS---QSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVH 625

Query: 1902 PHQPQTHGGRPVAPNQATASQPFPQSGGTFGVN------SQPRPIXXXXXXXXXXXXLGQ 2063
             H      GRP+ PN    SQP+P S     V       +QP               +  
Sbjct: 626  FHPSHNLVGRPMTPNHGVQSQPYPHSAAGTPVKPVHLGANQPSSYQNNVFRTNNQSGVTS 685

Query: 2064 QFSQSGLGE----ENAAGQEVGSALNNSAGKDVSNAGV-DSVGVKVLASDTGKESGD--- 2219
            Q      G+    +N A QE  S+   +A K+ +   +  S+G  V   +T K   D   
Sbjct: 686  QPMSEVPGDHGTDKNVAEQEADSSSPGTARKEANELDMASSLGADVAEKNTAKLEADLKS 745

Query: 2220 EERRIISEGGNNGTQKGIMGKVDDAEVDAMKKDMREDGNGNLDPLSGGNKIEAAALGERD 2399
             + ++  + G++     I  K    E    ++ +  D   + DP+S  N +   A+ ++ 
Sbjct: 746  VDEKLTGDVGDDSNGVDISTK----ETPESRRTVGTDLEQHRDPVSK-NMVTCEAIEDQK 800

Query: 2400 GNVVRPMQAEYSSGKDSTLLPT----EAYMGRRKDDTNSRTQENKS-SHEQVTPQGPAVG 2564
                   + E    KD   L T    EA +G   ++ N + Q++K   H+Q TP+GPA  
Sbjct: 801  DVHNGEHKVEEIKIKDGPSLKTPPLQEAKLG---EEQNGKMQKDKILPHDQGTPKGPAGN 857

Query: 2565 EY------GRFQDKGFMNSSNSVPHTDQGRHHLPPGPYGPSYHQQRPA-----------M 2693
             +       + Q  G++  S+SVP+ DQGRH     PYG + +QQRPA           +
Sbjct: 858  GFRGIPPSSQVQPGGYLPPSHSVPNVDQGRHQPLQMPYGSNNNQQRPAVSAILQAPPPGL 917

Query: 2694 PSDFQS-GAHPNE-------------------SFEGVPRRQHYQNNSTHSQPMFS---RP 2804
            PS  Q+ G  PN+                   SF   P     Q       P  S   R 
Sbjct: 918  PSHAQTPGLPPNQFRPQGPGQALVPPENLPPGSFGRDPSNYGPQGPYNQGPPSLSGAPRI 977

Query: 2805 LKAEPIEG----------------SLHGPDGVPMQH--NQRPHHFEGRYPDPHVSGAFDR 2930
             + EP+ G                 L+GP+   +QH  N   +H + R  DP  SG    
Sbjct: 978  SQGEPLVGLSYGTPPLTAFDSHGAPLYGPESHSVQHSANMVDYHADNRQLDPRASGLDST 1037

Query: 2931 GQYGQQLLANESRVPGFDAASGLHVKSADGNPFLSGPPGRSGQREYEDAPKQFPKPSVMG 3110
              +    L  E   P  D  S         N F      R  + ++E+  K FP+PS + 
Sbjct: 1038 STFS---LRGERLKPVQDECS---------NQFPLDRGHRGDRGQFEEDLKHFPRPSHLD 1085

Query: 3111 LEPSSNFGNGF-SSRPGDYPPHEFNF---------------------GAPSRFLPPYHSG 3224
             EP   FG+   SSRP D  PH F                         PSRFLPPY   
Sbjct: 1086 NEPVPKFGSYISSSRPLDRGPHGFGMDMGPRAQEKEPHGFSFDPMIGSGPSRFLPPY--- 1142

Query: 3225 GAFHPNDVRERPTGFNEDHRARGFSARTQPDFHGSGPGFG---VDHRPPRSPGREFHSVP 3395
               HP+D  ERP G  +D   R       PDF G+ P +G   +D    RSPGRE+  + 
Sbjct: 1143 ---HPDDTGERPVGLPKDTLGR-------PDFLGTVPSYGRHRMDGFVSRSPGREYPGIS 1192

Query: 3396 SRGFGG-----ISGAPPSQSGLLDGAHGRDTRAVHEGSRSSDISLDPVGKPFPEHFRNGN 3560
              GFGG     I G     S    G  G     +H G   S   ++       EH R+ +
Sbjct: 1193 PHGFGGHPGDEIDGRERRFSDRFPGLPGH----LHRGGFESSDRME-------EHLRSRD 1241

Query: 3561 MGGQDFIPNHLHRGELFGPKNGPSHLRVGD--GFGTSLD---------PGS-NYPRIGEP 3704
            M  QD  P +  RGE  G  N P HLR+G+  GFG             PG+  +PR+GEP
Sbjct: 1242 MINQDNRPAYFRRGEHVGHHNMPGHLRLGEPIGFGDFSSHERIGEFGGPGNFRHPRLGEP 1301

Query: 3705 GYRSSYSLHGFPSDGGFF 3758
            G+RSS+SL  FP+DGG +
Sbjct: 1302 GFRSSFSLQEFPNDGGIY 1319


Top