BLASTX nr result

ID: Angelica27_contig00018988 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica27_contig00018988
         (2290 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_017247188.1 PREDICTED: cleavage and polyadenylation specifici...  1293   0.0  
XP_017235464.1 PREDICTED: cleavage and polyadenylation specifici...  1291   0.0  
XP_019081674.1 PREDICTED: cleavage and polyadenylation specifici...  1082   0.0  
CBI24510.3 unnamed protein product, partial [Vitis vinifera]         1082   0.0  
XP_002268371.1 PREDICTED: cleavage and polyadenylation specifici...  1082   0.0  
XP_017972870.1 PREDICTED: cleavage and polyadenylation specifici...  1066   0.0  
XP_017972865.1 PREDICTED: cleavage and polyadenylation specifici...  1066   0.0  
XP_017972864.1 PREDICTED: cleavage and polyadenylation specifici...  1066   0.0  
EOY22975.1 Cleavage and polyadenylation specificity factor 160 i...  1063   0.0  
EOY22974.1 Cleavage and polyadenylation specificity factor 160 i...  1063   0.0  
XP_006490256.1 PREDICTED: cleavage and polyadenylation specifici...  1059   0.0  
XP_006421760.1 hypothetical protein CICLE_v10004147mg [Citrus cl...  1058   0.0  
KDO65373.1 hypothetical protein CISIN_1g0005452mg, partial [Citr...  1055   0.0  
XP_006490255.1 PREDICTED: cleavage and polyadenylation specifici...  1055   0.0  
XP_006421759.1 hypothetical protein CICLE_v10004147mg [Citrus cl...  1053   0.0  
XP_007220310.1 hypothetical protein PRUPE_ppa000211mg [Prunus pe...  1052   0.0  
KDO65374.1 hypothetical protein CISIN_1g0005452mg, partial [Citr...  1050   0.0  
XP_018805301.1 PREDICTED: cleavage and polyadenylation specifici...  1049   0.0  
XP_018805300.1 PREDICTED: cleavage and polyadenylation specifici...  1049   0.0  
XP_018805299.1 PREDICTED: cleavage and polyadenylation specifici...  1049   0.0  

>XP_017247188.1 PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Daucus carota subsp. sativus] KZM96626.1
            hypothetical protein DCAR_016012 [Daucus carota subsp.
            sativus]
          Length = 1446

 Score = 1293 bits (3347), Expect = 0.0
 Identities = 640/668 (95%), Positives = 654/668 (97%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEETENGRKENN 180
            ESGIL+VFDVPNF CVFSVDNFESGKAYLGDTFVQES NDS+ +LRKNSEETENGRKENN
Sbjct: 779  ESGILQVFDVPNFCCVFSVDNFESGKAYLGDTFVQESANDSQNHLRKNSEETENGRKENN 838

Query: 181  QRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVNL 360
            QR+KVVELAMHRWSGQHSRPFLFGIL DGTVLCYQAYLYEGSESSVK+E++VP HDSVNL
Sbjct: 839  QRIKVVELAMHRWSGQHSRPFLFGILTDGTVLCYQAYLYEGSESSVKIEEIVPVHDSVNL 898

Query: 361  NNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMIF 540
            NN SSSRLKNLRFARVPLDTYIKEEI PETP PRITTFKNVGGFPGLF+AGSRP+WFMIF
Sbjct: 899  NNASSSRLKNLRFARVPLDTYIKEEILPETPSPRITTFKNVGGFPGLFIAGSRPIWFMIF 958

Query: 541  RERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKIP 720
            RERLRIHPQLCDGPIAAFT+LHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 
Sbjct: 959  RERLRIHPQLCDGPIAAFTILHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKIS 1018

Query: 721  LKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDGTYTVEE 900
            LKGTPHQVTY AEKNLYPLIVSVPV+KPLNQVLSSLVDQEAGHQIEHDNFSSDGTY VEE
Sbjct: 1019 LKGTPHQVTYSAEKNLYPLIVSVPVVKPLNQVLSSLVDQEAGHQIEHDNFSSDGTYAVEE 1078

Query: 901  FEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYVQGED 1080
            FEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENE LLA+GTAYVQGED
Sbjct: 1079 FEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENETLLAIGTAYVQGED 1138

Query: 1081 VAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILHKWTG 1260
            VAGRGRVLLFSVER AESSQT+ISEVYSKELKGAISAVASLQGHLLIASGPKVILHKWTG
Sbjct: 1139 VAGRGRVLLFSVERIAESSQTTISEVYSKELKGAISAVASLQGHLLIASGPKVILHKWTG 1198

Query: 1261 SDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGSLDCF 1440
            SDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGSLDCF
Sbjct: 1199 SDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGSLDCF 1258

Query: 1441 ATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1620
            ATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM
Sbjct: 1259 ATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1318

Query: 1621 LPTPDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESVPHVAG 1800
            LPTPDRTNAA +PDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESVPHVAG
Sbjct: 1319 LPTPDRTNAAAVPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESVPHVAG 1378

Query: 1801 LNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIVSNLND 1980
            LNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIVSNLND
Sbjct: 1379 LNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIVSNLND 1438

Query: 1981 LALGTSFL 2004
            LALGTSFL
Sbjct: 1439 LALGTSFL 1446


>XP_017235464.1 PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Daucus carota subsp. sativus] KZN04680.1
            hypothetical protein DCAR_005517 [Daucus carota subsp.
            sativus]
          Length = 1446

 Score = 1291 bits (3340), Expect = 0.0
 Identities = 637/667 (95%), Positives = 653/667 (97%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEETENGRKENN 180
            ESGIL+VFDVPNF CVFSVDNFESGKAYLGDTFVQES NDS+ +LRKNSEETENGRKENN
Sbjct: 780  ESGILQVFDVPNFCCVFSVDNFESGKAYLGDTFVQESANDSQNHLRKNSEETENGRKENN 839

Query: 181  QRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVNL 360
            QR+KVVELAMHRWSGQHSRPFLFGIL DGTVLCYQAYLYEGSESSVK+E++VP HDSVNL
Sbjct: 840  QRIKVVELAMHRWSGQHSRPFLFGILTDGTVLCYQAYLYEGSESSVKIEEIVPVHDSVNL 899

Query: 361  NNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMIF 540
            NN SSSRLKNLRFARVPLDTYIKEEI PETP PRITTFKNVGGFPGLF+AGSRP+WFMIF
Sbjct: 900  NNASSSRLKNLRFARVPLDTYIKEEILPETPSPRITTFKNVGGFPGLFIAGSRPIWFMIF 959

Query: 541  RERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKIP 720
            RERLRIHPQLCDGPIAAFT+LHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKIP
Sbjct: 960  RERLRIHPQLCDGPIAAFTILHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKIP 1019

Query: 721  LKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDGTYTVEE 900
            LKGTPHQVTYFAEKNLYPLIVSVPV+KPLNQVLSSLVDQEAGHQIEHDNFSSDGTY VEE
Sbjct: 1020 LKGTPHQVTYFAEKNLYPLIVSVPVVKPLNQVLSSLVDQEAGHQIEHDNFSSDGTYAVEE 1079

Query: 901  FEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYVQGED 1080
            FEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENE LLA+GTAYVQGED
Sbjct: 1080 FEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENETLLAIGTAYVQGED 1139

Query: 1081 VAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILHKWTG 1260
            VAGRGRVLLFSVER AESSQT+ISEVYSKELKGAISAVASLQGHLLIASGPKVILHKWTG
Sbjct: 1140 VAGRGRVLLFSVERIAESSQTTISEVYSKELKGAISAVASLQGHLLIASGPKVILHKWTG 1199

Query: 1261 SDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGSLDCF 1440
            SDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGSLDCF
Sbjct: 1200 SDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGSLDCF 1259

Query: 1441 ATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1620
            ATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM
Sbjct: 1260 ATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQM 1319

Query: 1621 LPTPDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESVPHVAG 1800
            LPTPDRTNAA +PDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESVPHVAG
Sbjct: 1320 LPTPDRTNAAAVPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESVPHVAG 1379

Query: 1801 LNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIVSNLND 1980
            LNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQ  +NLND
Sbjct: 1380 LNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQSGTNLND 1439

Query: 1981 LALGTSF 2001
            LALGT+F
Sbjct: 1440 LALGTNF 1446


>XP_019081674.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Vitis vinifera] XP_019081675.1 PREDICTED:
            cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Vitis vinifera]
          Length = 1449

 Score = 1082 bits (2798), Expect = 0.0
 Identities = 532/673 (79%), Positives = 594/673 (88%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVFSVD F SG A+L DT + E   D++K + KNSEE  + GRKEN
Sbjct: 777  ESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKEN 836

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
               +KVVELAM RWSGQHSRPFLFGIL DGT+LCY AYLYEG ES+ K E+ V   +S++
Sbjct: 837  AHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSLS 896

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRF RVPLDTY +EE    T  PR+T FKN+GG  GLF++GSRP+WFM+
Sbjct: 897  ISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFMV 956

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRER+R+HPQLCDG I AFTVLHN+ CNHG+IYVT QG LKICQLP++  YDNYWPVQKI
Sbjct: 957  FRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQKI 1016

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TY 888
            PLKGTPHQVTYFAEKNLYPLIVSVPV+KPLN VLSSLVDQEAGHQ+E+DN SSD    +Y
Sbjct: 1017 PLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRSY 1076

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            +V+EFEVR+LEPEKSG PWQ R TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAYV
Sbjct: 1077 SVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV 1136

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRVLLFSV +N ++SQ  +SE+YSKELKGAISAVASLQGHLLIASGPK+ILH
Sbjct: 1137 QGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILH 1196

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
            KWTG++L GVAF+DAPPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQGAQL LLAKDFGS
Sbjct: 1197 KWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGS 1256

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSDDQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1257 LDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1316

Query: 1609 RLQMLP-TPDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQMLP + DRT+A    DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV++V
Sbjct: 1317 RLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1376

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQF S GKAHRPGPD+IVDCELLC +EML  E+Q EIA QIGTTR QI+
Sbjct: 1377 PHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQIL 1436

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDL+LGTSFL
Sbjct: 1437 SNLNDLSLGTSFL 1449


>CBI24510.3 unnamed protein product, partial [Vitis vinifera]
          Length = 1448

 Score = 1082 bits (2798), Expect = 0.0
 Identities = 532/673 (79%), Positives = 594/673 (88%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVFSVD F SG A+L DT + E   D++K + KNSEE  + GRKEN
Sbjct: 776  ESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKEN 835

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
               +KVVELAM RWSGQHSRPFLFGIL DGT+LCY AYLYEG ES+ K E+ V   +S++
Sbjct: 836  AHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSLS 895

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRF RVPLDTY +EE    T  PR+T FKN+GG  GLF++GSRP+WFM+
Sbjct: 896  ISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFMV 955

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRER+R+HPQLCDG I AFTVLHN+ CNHG+IYVT QG LKICQLP++  YDNYWPVQKI
Sbjct: 956  FRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQKI 1015

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TY 888
            PLKGTPHQVTYFAEKNLYPLIVSVPV+KPLN VLSSLVDQEAGHQ+E+DN SSD    +Y
Sbjct: 1016 PLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRSY 1075

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            +V+EFEVR+LEPEKSG PWQ R TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAYV
Sbjct: 1076 SVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV 1135

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRVLLFSV +N ++SQ  +SE+YSKELKGAISAVASLQGHLLIASGPK+ILH
Sbjct: 1136 QGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILH 1195

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
            KWTG++L GVAF+DAPPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQGAQL LLAKDFGS
Sbjct: 1196 KWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGS 1255

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSDDQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1256 LDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1315

Query: 1609 RLQMLP-TPDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQMLP + DRT+A    DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV++V
Sbjct: 1316 RLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1375

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQF S GKAHRPGPD+IVDCELLC +EML  E+Q EIA QIGTTR QI+
Sbjct: 1376 PHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQIL 1435

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDL+LGTSFL
Sbjct: 1436 SNLNDLSLGTSFL 1448


>XP_002268371.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X2 [Vitis vinifera]
          Length = 1442

 Score = 1082 bits (2798), Expect = 0.0
 Identities = 532/673 (79%), Positives = 594/673 (88%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVFSVD F SG A+L DT + E   D++K + KNSEE  + GRKEN
Sbjct: 770  ESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKEN 829

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
               +KVVELAM RWSGQHSRPFLFGIL DGT+LCY AYLYEG ES+ K E+ V   +S++
Sbjct: 830  AHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSLS 889

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRF RVPLDTY +EE    T  PR+T FKN+GG  GLF++GSRP+WFM+
Sbjct: 890  ISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFMV 949

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRER+R+HPQLCDG I AFTVLHN+ CNHG+IYVT QG LKICQLP++  YDNYWPVQKI
Sbjct: 950  FRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQKI 1009

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TY 888
            PLKGTPHQVTYFAEKNLYPLIVSVPV+KPLN VLSSLVDQEAGHQ+E+DN SSD    +Y
Sbjct: 1010 PLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRSY 1069

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            +V+EFEVR+LEPEKSG PWQ R TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAYV
Sbjct: 1070 SVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV 1129

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRVLLFSV +N ++SQ  +SE+YSKELKGAISAVASLQGHLLIASGPK+ILH
Sbjct: 1130 QGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILH 1189

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
            KWTG++L GVAF+DAPPLYVVSLNIVKNFILLGDIH+SIYFLSWKEQGAQL LLAKDFGS
Sbjct: 1190 KWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGS 1249

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSDDQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1250 LDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1309

Query: 1609 RLQMLP-TPDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQMLP + DRT+A    DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV++V
Sbjct: 1310 RLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1369

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQF S GKAHRPGPD+IVDCELLC +EML  E+Q EIA QIGTTR QI+
Sbjct: 1370 PHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQIL 1429

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDL+LGTSFL
Sbjct: 1430 SNLNDLSLGTSFL 1442


>XP_017972870.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X6 [Theobroma cacao]
          Length = 1198

 Score = 1066 bits (2757), Expect = 0.0
 Identities = 526/673 (78%), Positives = 589/673 (87%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVFS++NF SG+  L D +  ES  DSEK + K+SEE T  GRKEN
Sbjct: 526  ESGALEIFDVPNFNCVFSMENFSSGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKEN 585

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
             Q +KVVELAM RWS  HSRPFLFGIL DGT+LCY AYL+EGSE++ KVED V   +SV 
Sbjct: 586  VQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVG 645

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            L+N ++SRL+NLRF R+PLD Y +EE+S  T   RIT FKN+ G+ G F++GSRP WFM+
Sbjct: 646  LSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMV 705

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQ+PS   YDNYWPVQKI
Sbjct: 706  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKI 765

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TY 888
            PL+GTPHQVTYFAE+NLYP+IVSVPV KP+NQVLSSLVDQE GHQ+++ N SSD    TY
Sbjct: 766  PLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTY 825

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            TV+EFEVRILEPEKSGGPW+ + TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAY+
Sbjct: 826  TVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYI 885

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRV+L S+ RN ++ Q  +SEVYSKELKGAISA+ASLQGHLLIASGPK+ILH
Sbjct: 886  QGEDVAARGRVILCSIGRNTDNPQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILH 945

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
             WTGS+L G+AFYDAPPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQGAQL+LLAKDFGS
Sbjct: 946  NWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGS 1005

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1006 LDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1065

Query: 1609 RLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQML T  DRT+A    DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV++V
Sbjct: 1066 RLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1125

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELLC +EML LE+Q +IA+QIGTTRSQI+
Sbjct: 1126 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1185

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDL LGTSFL
Sbjct: 1186 SNLNDLTLGTSFL 1198


>XP_017972865.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X2 [Theobroma cacao]
          Length = 1456

 Score = 1066 bits (2757), Expect = 0.0
 Identities = 526/673 (78%), Positives = 589/673 (87%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVFS++NF SG+  L D +  ES  DSEK + K+SEE T  GRKEN
Sbjct: 784  ESGALEIFDVPNFNCVFSMENFSSGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKEN 843

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
             Q +KVVELAM RWS  HSRPFLFGIL DGT+LCY AYL+EGSE++ KVED V   +SV 
Sbjct: 844  VQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVG 903

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            L+N ++SRL+NLRF R+PLD Y +EE+S  T   RIT FKN+ G+ G F++GSRP WFM+
Sbjct: 904  LSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMV 963

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQ+PS   YDNYWPVQKI
Sbjct: 964  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKI 1023

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TY 888
            PL+GTPHQVTYFAE+NLYP+IVSVPV KP+NQVLSSLVDQE GHQ+++ N SSD    TY
Sbjct: 1024 PLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTY 1083

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            TV+EFEVRILEPEKSGGPW+ + TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAY+
Sbjct: 1084 TVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYI 1143

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRV+L S+ RN ++ Q  +SEVYSKELKGAISA+ASLQGHLLIASGPK+ILH
Sbjct: 1144 QGEDVAARGRVILCSIGRNTDNPQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILH 1203

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
             WTGS+L G+AFYDAPPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQGAQL+LLAKDFGS
Sbjct: 1204 NWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGS 1263

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1264 LDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1323

Query: 1609 RLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQML T  DRT+A    DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV++V
Sbjct: 1324 RLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1383

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELLC +EML LE+Q +IA+QIGTTRSQI+
Sbjct: 1384 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1443

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDL LGTSFL
Sbjct: 1444 SNLNDLTLGTSFL 1456


>XP_017972864.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Theobroma cacao]
          Length = 1457

 Score = 1066 bits (2757), Expect = 0.0
 Identities = 526/673 (78%), Positives = 589/673 (87%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVFS++NF SG+  L D +  ES  DSEK + K+SEE T  GRKEN
Sbjct: 785  ESGALEIFDVPNFNCVFSMENFSSGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKEN 844

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
             Q +KVVELAM RWS  HSRPFLFGIL DGT+LCY AYL+EGSE++ KVED V   +SV 
Sbjct: 845  VQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVG 904

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            L+N ++SRL+NLRF R+PLD Y +EE+S  T   RIT FKN+ G+ G F++GSRP WFM+
Sbjct: 905  LSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMV 964

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQ+PS   YDNYWPVQKI
Sbjct: 965  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKI 1024

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TY 888
            PL+GTPHQVTYFAE+NLYP+IVSVPV KP+NQVLSSLVDQE GHQ+++ N SSD    TY
Sbjct: 1025 PLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTY 1084

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            TV+EFEVRILEPEKSGGPW+ + TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAY+
Sbjct: 1085 TVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYI 1144

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRV+L S+ RN ++ Q  +SEVYSKELKGAISA+ASLQGHLLIASGPK+ILH
Sbjct: 1145 QGEDVAARGRVILCSIGRNTDNPQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILH 1204

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
             WTGS+L G+AFYDAPPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQGAQL+LLAKDFGS
Sbjct: 1205 NWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGS 1264

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1265 LDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1324

Query: 1609 RLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQML T  DRT+A    DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV++V
Sbjct: 1325 RLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1384

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELLC +EML LE+Q +IA+QIGTTRSQI+
Sbjct: 1385 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1444

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDL LGTSFL
Sbjct: 1445 SNLNDLTLGTSFL 1457


>EOY22975.1 Cleavage and polyadenylation specificity factor 160 isoform 2
            [Theobroma cacao]
          Length = 1257

 Score = 1063 bits (2749), Expect = 0.0
 Identities = 525/673 (78%), Positives = 588/673 (87%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVFS++ F SG+  L D +  ES  DSEK + K+SEE T  GRKEN
Sbjct: 585  ESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKEN 644

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
             Q +KVVELAM RWS  HSRPFLFGIL DGT+LCY AYL+EGSE++ KVED V   +SV 
Sbjct: 645  VQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVG 704

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            L+N ++SRL+NLRF R+PLD Y +EE+S  T   RIT FKN+ G+ G F++GSRP WFM+
Sbjct: 705  LSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMV 764

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQ+PS   YDNYWPVQKI
Sbjct: 765  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKI 824

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TY 888
            PL+GTPHQVTYFAE+NLYP+IVSVPV KP+NQVLSSLVDQE GHQ+++ N SSD    TY
Sbjct: 825  PLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTY 884

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            TV+EFEVRILEPEKSGGPW+ + TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAY+
Sbjct: 885  TVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYI 944

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRV+L S+ RN ++ Q  +SEVYSKELKGAISA+ASLQGHLLIASGPK+ILH
Sbjct: 945  QGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILH 1004

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
             WTGS+L G+AFYDAPPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQGAQL+LLAKDFGS
Sbjct: 1005 NWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGS 1064

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1065 LDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1124

Query: 1609 RLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQML T  DRT+A    DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV++V
Sbjct: 1125 RLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1184

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELLC +EML LE+Q +IA+QIGTTRSQI+
Sbjct: 1185 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1244

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDL LGTSFL
Sbjct: 1245 SNLNDLTLGTSFL 1257


>EOY22974.1 Cleavage and polyadenylation specificity factor 160 isoform 1
            [Theobroma cacao]
          Length = 1457

 Score = 1063 bits (2749), Expect = 0.0
 Identities = 525/673 (78%), Positives = 588/673 (87%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVFS++ F SG+  L D +  ES  DSEK + K+SEE T  GRKEN
Sbjct: 785  ESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKEN 844

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
             Q +KVVELAM RWS  HSRPFLFGIL DGT+LCY AYL+EGSE++ KVED V   +SV 
Sbjct: 845  VQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVG 904

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            L+N ++SRL+NLRF R+PLD Y +EE+S  T   RIT FKN+ G+ G F++GSRP WFM+
Sbjct: 905  LSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMV 964

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQ+PS   YDNYWPVQKI
Sbjct: 965  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKI 1024

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TY 888
            PL+GTPHQVTYFAE+NLYP+IVSVPV KP+NQVLSSLVDQE GHQ+++ N SSD    TY
Sbjct: 1025 PLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTY 1084

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            TV+EFEVRILEPEKSGGPW+ + TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAY+
Sbjct: 1085 TVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYI 1144

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRV+L S+ RN ++ Q  +SEVYSKELKGAISA+ASLQGHLLIASGPK+ILH
Sbjct: 1145 QGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILH 1204

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
             WTGS+L G+AFYDAPPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQGAQL+LLAKDFGS
Sbjct: 1205 NWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGS 1264

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1265 LDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1324

Query: 1609 RLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQML T  DRT+A    DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV++V
Sbjct: 1325 RLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1384

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELLC +EML LE+Q +IA+QIGTTRSQI+
Sbjct: 1385 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQIL 1444

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDL LGTSFL
Sbjct: 1445 SNLNDLTLGTSFL 1457


>XP_006490256.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X2 [Citrus sinensis]
          Length = 1457

 Score = 1059 bits (2739), Expect = 0.0
 Identities = 526/673 (78%), Positives = 584/673 (86%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVF+VD F SG+ ++ DT+++E+  DSE  +  +SEE T  GRKEN
Sbjct: 785  ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 844

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
               MKVVELAM RWSG HSRPFLF IL DGT+LCYQAYL+EG E++ K +D V    S++
Sbjct: 845  IHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 904

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRFAR+PLD Y +EE     P  RIT FKN+ G  G F++GSRP W M+
Sbjct: 905  VSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 964

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQLPS   YDNYWPVQKI
Sbjct: 965  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKI 1024

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSS---DGTY 888
            PLK TPHQ+TYFAEKNLYPLIVSVPV+KPLNQVLS L+DQE GHQI++ N SS     TY
Sbjct: 1025 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1084

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            TVEE+EVRILEP+++GGPWQ R TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAYV
Sbjct: 1085 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV 1144

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRVLLFS  RNA++ Q  ++EVYSKELKGAISA+ASLQGHLLIASGPK+ILH
Sbjct: 1145 QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1204

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
            KWTG++L G+AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL LLAKDFGS
Sbjct: 1205 KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1264

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1265 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1324

Query: 1609 RLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQML T  DRT AAP  DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV+SV
Sbjct: 1325 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1384

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELL  +EML LE+Q EIA+Q GTTRSQI+
Sbjct: 1385 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1444

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDLALGTSFL
Sbjct: 1445 SNLNDLALGTSFL 1457


>XP_006421760.1 hypothetical protein CICLE_v10004147mg [Citrus clementina] ESR35000.1
            hypothetical protein CICLE_v10004147mg [Citrus
            clementina]
          Length = 1457

 Score = 1058 bits (2735), Expect = 0.0
 Identities = 525/673 (78%), Positives = 584/673 (86%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVF+VD F SG+ ++ DT+++E+  DSE  +  +SEE T  GRKEN
Sbjct: 785  ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 844

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
               MKVVELAM RWSG HSRPFLF IL DGT+LCYQAYL+EGSE++ K +D V    S++
Sbjct: 845  IHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVSTSRSLS 904

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRF+R PLD Y +EE     P  RIT FKN+ G  G F++GSRP W M+
Sbjct: 905  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 964

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQLPS   YDNYWPVQKI
Sbjct: 965  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKI 1024

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSS---DGTY 888
            PLK TPHQ+TYFAEKNLYPLIVSVPV+KPLNQVLS L+DQE GHQI++ N SS     TY
Sbjct: 1025 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1084

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            TVEE+EVRILEP+++GGPWQ R TIPMQSSENALTVRVVTLFNTTT+EN+ LLA+GTAYV
Sbjct: 1085 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLAIGTAYV 1144

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRVLLFS  RNA++ Q  ++EVYSKELKGAISA+ASLQGHLLIASGPK+ILH
Sbjct: 1145 QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 1204

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
            KWTG++L G+AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL LLAKDFGS
Sbjct: 1205 KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 1264

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 1265 LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1324

Query: 1609 RLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQML T  DRT AAP  DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV+SV
Sbjct: 1325 RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 1384

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELL  +EML LE+Q EIA+Q GTTRSQI+
Sbjct: 1385 PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 1444

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDLALGTSFL
Sbjct: 1445 SNLNDLALGTSFL 1457


>KDO65373.1 hypothetical protein CISIN_1g0005452mg, partial [Citrus sinensis]
          Length = 890

 Score = 1055 bits (2727), Expect = 0.0
 Identities = 524/673 (77%), Positives = 582/673 (86%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVF+VD F SG+ ++ DT+++E+  DSE  +  +SEE T  GRKEN
Sbjct: 218  ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 277

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
               MKVVELAM RWS  HSRPFLF IL DGT+LCYQAYL+EG E++ K +D V    S++
Sbjct: 278  IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 337

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRF+R PLD Y +EE     P  RIT FKN+ G  G F++GSRP W M+
Sbjct: 338  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 397

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQLPS   YDNYWPVQKI
Sbjct: 398  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKI 457

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSS---DGTY 888
            PLK TPHQ+TYFAEKNLYPLIVSVPV+KPLNQVLS L+DQE GHQI++ N SS     TY
Sbjct: 458  PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 517

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            TVEE+EVRILEP+++GGPWQ R TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAYV
Sbjct: 518  TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV 577

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVA RGRVLLFS  RNA++ Q  ++EVYSKELKGAISA+ASLQGHLLIASGPK+ILH
Sbjct: 578  QGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILH 637

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
            KWTG++L G+AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL LLAKDFGS
Sbjct: 638  KWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGS 697

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL
Sbjct: 698  LDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 757

Query: 1609 RLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQML T  DRT AAP  DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV+SV
Sbjct: 758  RLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSV 817

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
            PHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELL  +EML LE+Q EIA+Q GTTRSQI+
Sbjct: 818  PHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQIL 877

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDLALGTSFL
Sbjct: 878  SNLNDLALGTSFL 890


>XP_006490255.1 PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Citrus sinensis]
          Length = 1458

 Score = 1055 bits (2727), Expect = 0.0
 Identities = 526/674 (78%), Positives = 584/674 (86%), Gaps = 6/674 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVF+VD F SG+ ++ DT+++E+  DSE  +  +SEE T  GRKEN
Sbjct: 785  ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 844

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
               MKVVELAM RWSG HSRPFLF IL DGT+LCYQAYL+EG E++ K +D V    S++
Sbjct: 845  IHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 904

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRFAR+PLD Y +EE     P  RIT FKN+ G  G F++GSRP W M+
Sbjct: 905  VSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 964

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQK- 714
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQLPS   YDNYWPVQK 
Sbjct: 965  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 1024

Query: 715  IPLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSS---DGT 885
            IPLK TPHQ+TYFAEKNLYPLIVSVPV+KPLNQVLS L+DQE GHQI++ N SS     T
Sbjct: 1025 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1084

Query: 886  YTVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAY 1065
            YTVEE+EVRILEP+++GGPWQ R TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAY
Sbjct: 1085 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY 1144

Query: 1066 VQGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVIL 1245
            VQGEDVA RGRVLLFS  RNA++ Q  ++EVYSKELKGAISA+ASLQGHLLIASGPK+IL
Sbjct: 1145 VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIIL 1204

Query: 1246 HKWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFG 1425
            HKWTG++L G+AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL LLAKDFG
Sbjct: 1205 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1264

Query: 1426 SLDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1605
            SLDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1265 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1324

Query: 1606 LRLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVES 1782
            LRLQML T  DRT AAP  DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV+S
Sbjct: 1325 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1384

Query: 1783 VPHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQI 1962
            VPHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELL  +EML LE+Q EIA+Q GTTRSQI
Sbjct: 1385 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1444

Query: 1963 VSNLNDLALGTSFL 2004
            +SNLNDLALGTSFL
Sbjct: 1445 LSNLNDLALGTSFL 1458


>XP_006421759.1 hypothetical protein CICLE_v10004147mg [Citrus clementina] ESR34999.1
            hypothetical protein CICLE_v10004147mg [Citrus
            clementina]
          Length = 1458

 Score = 1053 bits (2723), Expect = 0.0
 Identities = 525/674 (77%), Positives = 584/674 (86%), Gaps = 6/674 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVF+VD F SG+ ++ DT+++E+  DSE  +  +SEE T  GRKEN
Sbjct: 785  ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 844

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
               MKVVELAM RWSG HSRPFLF IL DGT+LCYQAYL+EGSE++ K +D V    S++
Sbjct: 845  IHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVSTSRSLS 904

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRF+R PLD Y +EE     P  RIT FKN+ G  G F++GSRP W M+
Sbjct: 905  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 964

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQK- 714
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQLPS   YDNYWPVQK 
Sbjct: 965  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 1024

Query: 715  IPLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSS---DGT 885
            IPLK TPHQ+TYFAEKNLYPLIVSVPV+KPLNQVLS L+DQE GHQI++ N SS     T
Sbjct: 1025 IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1084

Query: 886  YTVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAY 1065
            YTVEE+EVRILEP+++GGPWQ R TIPMQSSENALTVRVVTLFNTTT+EN+ LLA+GTAY
Sbjct: 1085 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLAIGTAY 1144

Query: 1066 VQGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVIL 1245
            VQGEDVA RGRVLLFS  RNA++ Q  ++EVYSKELKGAISA+ASLQGHLLIASGPK+IL
Sbjct: 1145 VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIIL 1204

Query: 1246 HKWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFG 1425
            HKWTG++L G+AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL LLAKDFG
Sbjct: 1205 HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1264

Query: 1426 SLDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1605
            SLDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1265 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1324

Query: 1606 LRLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVES 1782
            LRLQML T  DRT AAP  DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV+S
Sbjct: 1325 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1384

Query: 1783 VPHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQI 1962
            VPHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELL  +EML LE+Q EIA+Q GTTRSQI
Sbjct: 1385 VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 1444

Query: 1963 VSNLNDLALGTSFL 2004
            +SNLNDLALGTSFL
Sbjct: 1445 LSNLNDLALGTSFL 1458


>XP_007220310.1 hypothetical protein PRUPE_ppa000211mg [Prunus persica] ONI25129.1
            hypothetical protein PRUPE_2G282700 [Prunus persica]
          Length = 1459

 Score = 1052 bits (2720), Expect = 0.0
 Identities = 518/673 (76%), Positives = 585/673 (86%), Gaps = 5/673 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEETEN-GRKEN 177
            ESG LE+FDVPNF+CVFSVD F SG A+L DT +++   D +K + K+SEE    GRKEN
Sbjct: 787  ESGSLEIFDVPNFNCVFSVDKFVSGNAHLIDTLMRDPPKDPQKLINKSSEEVSGQGRKEN 846

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
             Q MKVVELAM RWSGQHSRPFLFGIL DG +LCY AYL+EG E++ K ED     ++  
Sbjct: 847  IQNMKVVELAMQRWSGQHSRPFLFGILNDGMILCYHAYLFEGPETASKTEDSASAQNTTG 906

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRF RVPLDTY K++ S ET   R+T FKN+ G+ GLF++GSRP WFM+
Sbjct: 907  VSNLSASRLRNLRFVRVPLDTYAKKDTSNETSCQRMTIFKNIAGYQGLFLSGSRPAWFMV 966

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKI 717
            FRERLRIHPQLCDG + A TVLHNV CNHG+IYVT QG LKICQLP +  YDNYWPVQKI
Sbjct: 967  FRERLRIHPQLCDGSVVAVTVLHNVNCNHGLIYVTSQGILKICQLPPITSYDNYWPVQKI 1026

Query: 718  PLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TY 888
            PLKGTPHQVTYFAEKNLYPLIVSVPV KPLNQVLSSLVDQE GHQ+E+ N SSD    TY
Sbjct: 1027 PLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVENHNLSSDELHRTY 1086

Query: 889  TVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYV 1068
            +V+EFE+RI+EP+KSGGPWQ + TIPMQ+SENALTVRVVTLFNTTT+ENE LLA+GTAYV
Sbjct: 1087 SVDEFEIRIMEPDKSGGPWQTKATIPMQTSENALTVRVVTLFNTTTKENETLLAIGTAYV 1146

Query: 1069 QGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILH 1248
            QGEDVAGRGRVLLFS  ++A+++QT +SEVYSKELKGAISA+ASLQGHLLIASGPK+ILH
Sbjct: 1147 QGEDVAGRGRVLLFSAGKSADNTQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILH 1206

Query: 1249 KWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGS 1428
            KW G++L GVAF+D PPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQGAQLTLLAKDFG+
Sbjct: 1207 KWNGTELNGVAFFDVPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLTLLAKDFGN 1266

Query: 1429 LDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFL 1608
            LDCFATEFLIDGSTLSL V+D+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVG HVTKFL
Sbjct: 1267 LDCFATEFLIDGSTLSLVVADEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGTHVTKFL 1326

Query: 1609 RLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESV 1785
            RLQML T  DRT   P  DKTNR+ALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV++V
Sbjct: 1327 RLQMLSTSSDRTGTNPGSDKTNRYALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAV 1386

Query: 1786 PHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIV 1965
             HVAGLNPR+FRQF S GKAHRPGPD+IVDCELL  +EML LE+Q EIANQIGTTRSQI 
Sbjct: 1387 HHVAGLNPRAFRQFQSNGKAHRPGPDTIVDCELLSHYEMLPLEEQLEIANQIGTTRSQIF 1446

Query: 1966 SNLNDLALGTSFL 2004
            SNLNDL++GTSFL
Sbjct: 1447 SNLNDLSIGTSFL 1459


>KDO65374.1 hypothetical protein CISIN_1g0005452mg, partial [Citrus sinensis]
          Length = 891

 Score = 1050 bits (2715), Expect = 0.0
 Identities = 524/674 (77%), Positives = 582/674 (86%), Gaps = 6/674 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEE-TENGRKEN 177
            ESG LE+FDVPNF+CVF+VD F SG+ ++ DT+++E+  DSE  +  +SEE T  GRKEN
Sbjct: 218  ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 277

Query: 178  NQRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVN 357
               MKVVELAM RWS  HSRPFLF IL DGT+LCYQAYL+EG E++ K +D V    S++
Sbjct: 278  IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 337

Query: 358  LNNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMI 537
            ++N S+SRL+NLRF+R PLD Y +EE     P  RIT FKN+ G  G F++GSRP W M+
Sbjct: 338  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 397

Query: 538  FRERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQK- 714
            FRERLR+HPQLCDG I AFTVLHNV CNHG IYVT QG LKICQLPS   YDNYWPVQK 
Sbjct: 398  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 457

Query: 715  IPLKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSS---DGT 885
            IPLK TPHQ+TYFAEKNLYPLIVSVPV+KPLNQVLS L+DQE GHQI++ N SS     T
Sbjct: 458  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 517

Query: 886  YTVEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAY 1065
            YTVEE+EVRILEP+++GGPWQ R TIPMQSSENALTVRVVTLFNTTT+ENE LLA+GTAY
Sbjct: 518  YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY 577

Query: 1066 VQGEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVIL 1245
            VQGEDVA RGRVLLFS  RNA++ Q  ++EVYSKELKGAISA+ASLQGHLLIASGPK+IL
Sbjct: 578  VQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIIL 637

Query: 1246 HKWTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFG 1425
            HKWTG++L G+AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQL LLAKDFG
Sbjct: 638  HKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 697

Query: 1426 SLDCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1605
            SLDCFATEFLIDGSTLSL VSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 698  SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 757

Query: 1606 LRLQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVES 1782
            LRLQML T  DRT AAP  DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQSLQKKLV+S
Sbjct: 758  LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 817

Query: 1783 VPHVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQI 1962
            VPHVAGLNPRSFRQFHS GKAHRPGPDSIVDCELL  +EML LE+Q EIA+Q GTTRSQI
Sbjct: 818  VPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQI 877

Query: 1963 VSNLNDLALGTSFL 2004
            +SNLNDLALGTSFL
Sbjct: 878  LSNLNDLALGTSFL 891


>XP_018805301.1 PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X4 [Juglans regia] XP_018805302.1
            PREDICTED: cleavage and polyadenylation specificity
            factor subunit 1-like isoform X4 [Juglans regia]
          Length = 1189

 Score = 1049 bits (2713), Expect = 0.0
 Identities = 521/672 (77%), Positives = 578/672 (86%), Gaps = 4/672 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEETENGRKENN 180
            ESG LE+ DVPNF+CVFS + F SG   L D F+ E   D E   R + E T  GRKE+ 
Sbjct: 518  ESGALEILDVPNFNCVFSAEKFMSGNPLLVDAFMPEPAKDIEVTKRSSEEVTGQGRKEST 577

Query: 181  QRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVNL 360
            Q MKVVELAM RW+GQHSRPFLFGIL+DGT+LCY AYLYEG+ES+ +VED     +S  L
Sbjct: 578  QNMKVVELAMQRWAGQHSRPFLFGILSDGTILCYHAYLYEGAESNSRVEDSASVQNSGGL 637

Query: 361  NNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMIF 540
            ++ S+SRL+NLRF RVPLDTY +EE    +P  RIT FKN+GG  GLF++GSRP WFM+F
Sbjct: 638  SSISASRLRNLRFVRVPLDTYAREETPSGSPCQRITIFKNIGGHQGLFLSGSRPAWFMVF 697

Query: 541  RERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKIP 720
            RERLR+HPQLCDG I AFTVLHNV CNHG+IYVT QG LKICQLPS+  YDNYWPVQKIP
Sbjct: 698  RERLRVHPQLCDGCIVAFTVLHNVNCNHGLIYVTSQGILKICQLPSVSSYDNYWPVQKIP 757

Query: 721  LKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TYT 891
            LKGTPHQVTYFAEKNLYPLIVSVPV KPLNQVLSSLVDQE GHQ+E+ N  SD    TYT
Sbjct: 758  LKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVENHNLGSDEQHRTYT 817

Query: 892  VEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYVQ 1071
            V+E+EVRILEPEKSGGPWQ   TIPMQSSENALTVRVVTL NT T+ENE LLA+GTAYVQ
Sbjct: 818  VDEYEVRILEPEKSGGPWQTMATIPMQSSENALTVRVVTLLNTITKENETLLAIGTAYVQ 877

Query: 1072 GEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILHK 1251
            GEDVA RGRVLLF+V +N ++ Q  +SEVYSKELKGAISA+ASLQGHLLIASGPK+ILH 
Sbjct: 878  GEDVAARGRVLLFAVGKNTDNPQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILHN 937

Query: 1252 WTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGSL 1431
            WTG++L G+AF+DAPPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQG+QL+LLAKDFGSL
Sbjct: 938  WTGTELNGIAFFDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGSQLSLLAKDFGSL 997

Query: 1432 DCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1611
            DCFATEFLIDGSTLSL VSDDQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR
Sbjct: 998  DCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1057

Query: 1612 LQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESVP 1788
            LQML T  DR+ AAP  DK NRFALLFGTLDGS+GCIAPLDELTFRRLQSLQ+KLV++VP
Sbjct: 1058 LQMLSTSSDRSGAAPGSDKINRFALLFGTLDGSIGCIAPLDELTFRRLQSLQRKLVDAVP 1117

Query: 1789 HVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIVS 1968
            HVAGLNPRSFRQF + GKAHR GPDSIVDCELLC +EML LE+Q EIANQIGTTRS I+S
Sbjct: 1118 HVAGLNPRSFRQFRTNGKAHRSGPDSIVDCELLCNYEMLPLEEQLEIANQIGTTRSHILS 1177

Query: 1969 NLNDLALGTSFL 2004
            NL DL+LGTSFL
Sbjct: 1178 NLTDLSLGTSFL 1189


>XP_018805300.1 PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X3 [Juglans regia]
          Length = 1206

 Score = 1049 bits (2713), Expect = 0.0
 Identities = 521/672 (77%), Positives = 578/672 (86%), Gaps = 4/672 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEETENGRKENN 180
            ESG LE+ DVPNF+CVFS + F SG   L D F+ E   D E   R + E T  GRKE+ 
Sbjct: 535  ESGALEILDVPNFNCVFSAEKFMSGNPLLVDAFMPEPAKDIEVTKRSSEEVTGQGRKEST 594

Query: 181  QRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVNL 360
            Q MKVVELAM RW+GQHSRPFLFGIL+DGT+LCY AYLYEG+ES+ +VED     +S  L
Sbjct: 595  QNMKVVELAMQRWAGQHSRPFLFGILSDGTILCYHAYLYEGAESNSRVEDSASVQNSGGL 654

Query: 361  NNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMIF 540
            ++ S+SRL+NLRF RVPLDTY +EE    +P  RIT FKN+GG  GLF++GSRP WFM+F
Sbjct: 655  SSISASRLRNLRFVRVPLDTYAREETPSGSPCQRITIFKNIGGHQGLFLSGSRPAWFMVF 714

Query: 541  RERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKIP 720
            RERLR+HPQLCDG I AFTVLHNV CNHG+IYVT QG LKICQLPS+  YDNYWPVQKIP
Sbjct: 715  RERLRVHPQLCDGCIVAFTVLHNVNCNHGLIYVTSQGILKICQLPSVSSYDNYWPVQKIP 774

Query: 721  LKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TYT 891
            LKGTPHQVTYFAEKNLYPLIVSVPV KPLNQVLSSLVDQE GHQ+E+ N  SD    TYT
Sbjct: 775  LKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVENHNLGSDEQHRTYT 834

Query: 892  VEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYVQ 1071
            V+E+EVRILEPEKSGGPWQ   TIPMQSSENALTVRVVTL NT T+ENE LLA+GTAYVQ
Sbjct: 835  VDEYEVRILEPEKSGGPWQTMATIPMQSSENALTVRVVTLLNTITKENETLLAIGTAYVQ 894

Query: 1072 GEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILHK 1251
            GEDVA RGRVLLF+V +N ++ Q  +SEVYSKELKGAISA+ASLQGHLLIASGPK+ILH 
Sbjct: 895  GEDVAARGRVLLFAVGKNTDNPQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILHN 954

Query: 1252 WTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGSL 1431
            WTG++L G+AF+DAPPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQG+QL+LLAKDFGSL
Sbjct: 955  WTGTELNGIAFFDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGSQLSLLAKDFGSL 1014

Query: 1432 DCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1611
            DCFATEFLIDGSTLSL VSDDQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR
Sbjct: 1015 DCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1074

Query: 1612 LQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESVP 1788
            LQML T  DR+ AAP  DK NRFALLFGTLDGS+GCIAPLDELTFRRLQSLQ+KLV++VP
Sbjct: 1075 LQMLSTSSDRSGAAPGSDKINRFALLFGTLDGSIGCIAPLDELTFRRLQSLQRKLVDAVP 1134

Query: 1789 HVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIVS 1968
            HVAGLNPRSFRQF + GKAHR GPDSIVDCELLC +EML LE+Q EIANQIGTTRS I+S
Sbjct: 1135 HVAGLNPRSFRQFRTNGKAHRSGPDSIVDCELLCNYEMLPLEEQLEIANQIGTTRSHILS 1194

Query: 1969 NLNDLALGTSFL 2004
            NL DL+LGTSFL
Sbjct: 1195 NLTDLSLGTSFL 1206


>XP_018805299.1 PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X2 [Juglans regia]
          Length = 1333

 Score = 1049 bits (2713), Expect = 0.0
 Identities = 521/672 (77%), Positives = 578/672 (86%), Gaps = 4/672 (0%)
 Frame = +1

Query: 1    ESGILEVFDVPNFSCVFSVDNFESGKAYLGDTFVQESFNDSEKYLRKNSEETENGRKENN 180
            ESG LE+ DVPNF+CVFS + F SG   L D F+ E   D E   R + E T  GRKE+ 
Sbjct: 662  ESGALEILDVPNFNCVFSAEKFMSGNPLLVDAFMPEPAKDIEVTKRSSEEVTGQGRKEST 721

Query: 181  QRMKVVELAMHRWSGQHSRPFLFGILADGTVLCYQAYLYEGSESSVKVEDVVPGHDSVNL 360
            Q MKVVELAM RW+GQHSRPFLFGIL+DGT+LCY AYLYEG+ES+ +VED     +S  L
Sbjct: 722  QNMKVVELAMQRWAGQHSRPFLFGILSDGTILCYHAYLYEGAESNSRVEDSASVQNSGGL 781

Query: 361  NNTSSSRLKNLRFARVPLDTYIKEEISPETPYPRITTFKNVGGFPGLFVAGSRPMWFMIF 540
            ++ S+SRL+NLRF RVPLDTY +EE    +P  RIT FKN+GG  GLF++GSRP WFM+F
Sbjct: 782  SSISASRLRNLRFVRVPLDTYAREETPSGSPCQRITIFKNIGGHQGLFLSGSRPAWFMVF 841

Query: 541  RERLRIHPQLCDGPIAAFTVLHNVYCNHGIIYVTQQGTLKICQLPSLLCYDNYWPVQKIP 720
            RERLR+HPQLCDG I AFTVLHNV CNHG+IYVT QG LKICQLPS+  YDNYWPVQKIP
Sbjct: 842  RERLRVHPQLCDGCIVAFTVLHNVNCNHGLIYVTSQGILKICQLPSVSSYDNYWPVQKIP 901

Query: 721  LKGTPHQVTYFAEKNLYPLIVSVPVIKPLNQVLSSLVDQEAGHQIEHDNFSSDG---TYT 891
            LKGTPHQVTYFAEKNLYPLIVSVPV KPLNQVLSSLVDQE GHQ+E+ N  SD    TYT
Sbjct: 902  LKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVENHNLGSDEQHRTYT 961

Query: 892  VEEFEVRILEPEKSGGPWQIRGTIPMQSSENALTVRVVTLFNTTTRENEILLAVGTAYVQ 1071
            V+E+EVRILEPEKSGGPWQ   TIPMQSSENALTVRVVTL NT T+ENE LLA+GTAYVQ
Sbjct: 962  VDEYEVRILEPEKSGGPWQTMATIPMQSSENALTVRVVTLLNTITKENETLLAIGTAYVQ 1021

Query: 1072 GEDVAGRGRVLLFSVERNAESSQTSISEVYSKELKGAISAVASLQGHLLIASGPKVILHK 1251
            GEDVA RGRVLLF+V +N ++ Q  +SEVYSKELKGAISA+ASLQGHLLIASGPK+ILH 
Sbjct: 1022 GEDVAARGRVLLFAVGKNTDNPQNLVSEVYSKELKGAISALASLQGHLLIASGPKIILHN 1081

Query: 1252 WTGSDLTGVAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLTLLAKDFGSL 1431
            WTG++L G+AF+DAPPLYVVSLNIVKNFILLGD+HKSIYFLSWKEQG+QL+LLAKDFGSL
Sbjct: 1082 WTGTELNGIAFFDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGSQLSLLAKDFGSL 1141

Query: 1432 DCFATEFLIDGSTLSLTVSDDQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1611
            DCFATEFLIDGSTLSL VSDDQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR
Sbjct: 1142 DCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLR 1201

Query: 1612 LQMLPT-PDRTNAAPIPDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQKKLVESVP 1788
            LQML T  DR+ AAP  DK NRFALLFGTLDGS+GCIAPLDELTFRRLQSLQ+KLV++VP
Sbjct: 1202 LQMLSTSSDRSGAAPGSDKINRFALLFGTLDGSIGCIAPLDELTFRRLQSLQRKLVDAVP 1261

Query: 1789 HVAGLNPRSFRQFHSKGKAHRPGPDSIVDCELLCQFEMLVLEQQHEIANQIGTTRSQIVS 1968
            HVAGLNPRSFRQF + GKAHR GPDSIVDCELLC +EML LE+Q EIANQIGTTRS I+S
Sbjct: 1262 HVAGLNPRSFRQFRTNGKAHRSGPDSIVDCELLCNYEMLPLEEQLEIANQIGTTRSHILS 1321

Query: 1969 NLNDLALGTSFL 2004
            NL DL+LGTSFL
Sbjct: 1322 NLTDLSLGTSFL 1333


Top