BLASTX nr result

ID: Ophiopogon27_contig00005582 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ophiopogon27_contig00005582
         (4853 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_020260753.1| cleavage and polyadenylation specificity fac...  2454   0.0  
ref|XP_020260752.1| cleavage and polyadenylation specificity fac...  2450   0.0  
ref|XP_008791964.1| PREDICTED: cleavage and polyadenylation spec...  2295   0.0  
ref|XP_008791963.1| PREDICTED: cleavage and polyadenylation spec...  2291   0.0  
ref|XP_008791966.1| PREDICTED: cleavage and polyadenylation spec...  2284   0.0  
ref|XP_009383988.1| PREDICTED: cleavage and polyadenylation spec...  2284   0.0  
ref|XP_018676515.1| PREDICTED: cleavage and polyadenylation spec...  2277   0.0  
ref|XP_018676516.1| PREDICTED: cleavage and polyadenylation spec...  2275   0.0  
ref|XP_010918168.1| PREDICTED: cleavage and polyadenylation spec...  2275   0.0  
ref|XP_010918167.1| PREDICTED: cleavage and polyadenylation spec...  2270   0.0  
ref|XP_020084456.1| probable cleavage and polyadenylation specif...  2239   0.0  
ref|XP_020672482.1| cleavage and polyadenylation specificity fac...  2199   0.0  
gb|OAY63492.1| putative cleavage and polyadenylation specificity...  2199   0.0  
ref|XP_010277399.1| PREDICTED: cleavage and polyadenylation spec...  2165   0.0  
ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation spec...  2158   0.0  
emb|CBI24510.3| unnamed protein product, partial [Vitis vinifera]    2151   0.0  
ref|XP_019081674.1| PREDICTED: cleavage and polyadenylation spec...  2151   0.0  
ref|XP_015877866.1| PREDICTED: cleavage and polyadenylation spec...  2137   0.0  
gb|EOY22974.1| Cleavage and polyadenylation specificity factor 1...  2133   0.0  
ref|XP_017972864.1| PREDICTED: cleavage and polyadenylation spec...  2132   0.0  

>ref|XP_020260753.1| cleavage and polyadenylation specificity factor subunit 1 isoform X2
            [Asparagus officinalis]
 gb|ONK71681.1| uncharacterized protein A4U43_C04F11250 [Asparagus officinalis]
          Length = 1454

 Score = 2454 bits (6361), Expect = 0.0
 Identities = 1226/1454 (84%), Positives = 1309/1454 (90%), Gaps = 5/1454 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAAFKMMHWPTGIENCASGFITH                 DLD + AP RRIGPIPNL
Sbjct: 1    MSYAAFKMMHWPTGIENCASGFITHSPADAAASVILPHLQPDDLDSDFAPTRRIGPIPNL 60

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTA NV+EVY VRVQEDD R   A  EPRGGGG+MDGL GARLELVCHYRLHGNV+SM 
Sbjct: 61   VVTAGNVVEVYIVRVQEDDARVSQAPIEPRGGGGIMDGLRGARLELVCHYRLHGNVQSMA 120

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            VLSLGGDE SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDWL+LK+GR SF
Sbjct: 121  VLSLGGDERSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWLFLKKGRESF 180

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTS-VCIESSFVL 3971
            A GP+VKADPLGRCGG LVYD QMIVLKAAQ GQGLV D+EP+S+GG+T+ V IESS+V+
Sbjct: 181  AMGPLVKADPLGRCGGALVYDLQMIVLKAAQTGQGLVTDDEPTSVGGSTAAVRIESSYVI 240

Query: 3970 NLRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP 3791
            NLR+LDM+HVKDFTFVHGYIEPVMV LHE+EPTWAGRISWKHHTC+ISALSISTTLKQHP
Sbjct: 241  NLRELDMKHVKDFTFVHGYIEPVMVFLHEREPTWAGRISWKHHTCLISALSISTTLKQHP 300

Query: 3790 LIWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMP 3611
            LIWSA NLPHDAYKLLAVPSPIGG+LVICANSIHYHSQS+SC+LGLN FA+QM++SSDMP
Sbjct: 301  LIWSAANLPHDAYKLLAVPSPIGGVLVICANSIHYHSQSSSCALGLNSFAMQMDSSSDMP 360

Query: 3610 KTNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGIT 3431
            K +FNVELDAAKATWLS DV +FSSK GELLLLTLV DGR+VQKLDL KSKASVLTSGIT
Sbjct: 361  KASFNVELDAAKATWLSPDVVIFSSKNGELLLLTLVSDGRIVQKLDLMKSKASVLTSGIT 420

Query: 3430 SIGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 3251
            ++G+SFFFLGSRLGDSLLVQYSYGASSP+SAHVKDE  +IEGD+P  KRL+ MSSDSLQD
Sbjct: 421  TMGSSFFFLGSRLGDSLLVQYSYGASSPTSAHVKDEIAEIEGDVPAVKRLKRMSSDSLQD 480

Query: 3250 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 3071
            VAS EELSLYAT PNSSES QKVFSFAVRD+LINVGPLKDFSYGLRINADPNATG+AKQS
Sbjct: 481  VASAEELSLYATAPNSSESTQKVFSFAVRDTLINVGPLKDFSYGLRINADPNATGLAKQS 540

Query: 3070 NYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTE 2891
            NYELVCCSGHGKNG+LCVLQQSIRPELITEVELPGCKGIWTVYHKSSR HAVDSSKT TE
Sbjct: 541  NYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSSRRHAVDSSKTLTE 600

Query: 2890 DDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGA 2711
            DDEYHAYLIISLETRTMVLETADDLGEVTESV YYVQGSTIAAGNLFGRRRVVQIFARGA
Sbjct: 601  DDEYHAYLIISLETRTMVLETADDLGEVTESVGYYVQGSTIAAGNLFGRRRVVQIFARGA 660

Query: 2710 RILDGSYMTQELSFVVH---XXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPS 2540
            RILDGSYMTQELSF ++                   SIADPYVLLKMTDGSIQLLAGD S
Sbjct: 661  RILDGSYMTQELSFAMNNSESASSASNSEPSTVSSVSIADPYVLLKMTDGSIQLLAGDTS 720

Query: 2539 TCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPD 2360
            TCTVS+NVPARFASLTD ++ACTLYHDK PEPWLRKASSDAWLSTG+AE ID +DGSYPD
Sbjct: 721  TCTVSVNVPARFASLTDPISACTLYHDKGPEPWLRKASSDAWLSTGIAEAIDGSDGSYPD 780

Query: 2359 QGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGLDETAKK 2180
            QGDIYCLVCYESG LEIFDVPNFRSVFS DKF+SGKSHLMD CI+P K K +  +ET KK
Sbjct: 781  QGDIYCLVCYESGILEIFDVPNFRSVFSVDKFMSGKSHLMDMCIVPQKTKPRNFEETIKK 840

Query: 2179 EPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVYPLST 2000
            EP QNLKV+EL MQRWSGQYSRPFLF ILNDGTML YHAYLYEG E APK E A+ P S+
Sbjct: 841  EPTQNLKVMELVMQRWSGQYSRPFLFAILNDGTMLSYHAYLYEGQENAPKAEDALSPHSS 900

Query: 1999 GDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSRPAWL 1820
             ++++TG SR RNLRF RVSID NI+E+SSN+T RPRITIFKNVGG QGLFL+GSRPAW 
Sbjct: 901  SNMNSTGTSRMRNLRFQRVSIDTNIQEDSSNVTVRPRITIFKNVGGYQGLFLTGSRPAWF 960

Query: 1819 MLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNYWPVQ 1640
            MLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPS CNYDNYWPVQ
Sbjct: 961  MLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSACNYDNYWPVQ 1020

Query: 1639 KIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDN-NQMDQDSASSDSLQK 1463
            KIPLR TPHQVTYF EK+LYPVIVSVPVVRPLSQVLSSLVDQD+ NQMDQD+ SSD LQ 
Sbjct: 1021 KIPLRGTPHQVTYFAEKSLYPVIVSVPVVRPLSQVLSSLVDQDSGNQMDQDTVSSDGLQN 1080

Query: 1462 SYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLAIGTA 1283
            +Y IDEFEVRIFEPEKSGGHWEPKATIPMQ SENALTVR VTL+NSTT ENE+LLAIGTA
Sbjct: 1081 NYCIDEFEVRIFEPEKSGGHWEPKATIPMQTSENALTVRMVTLLNSTTKENESLLAIGTA 1140

Query: 1282 YVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASGTKIT 1103
            YVQGED+AARGR+LLF+IGR++EN QNLVSEVYSKELKGAISALASLQGHLLIASG KI 
Sbjct: 1141 YVQGEDIAARGRVLLFSIGRNSENLQNLVSEVYSKELKGAISALASLQGHLLIASGPKII 1200

Query: 1102 LNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLAKDFS 923
            L+KWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLAKDF 
Sbjct: 1201 LHKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLAKDFG 1260

Query: 922  SLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAHVSKF 743
            SLDCFATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLLSRAEFHAGAH++KF
Sbjct: 1261 SLDCFATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLSRAEFHAGAHITKF 1320

Query: 742  LRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXXLVDV 563
            LRLQMLPTSSDR+S  PGSDKTNRFA+LFGTLDGSIGCVAPLEE            LVD 
Sbjct: 1321 LRLQMLPTSSDRASGAPGSDKTNRFALLFGTLDGSIGCVAPLEELTFRRLQTLQKRLVDA 1380

Query: 562  VPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTTRSQI 383
            VPHVCGLNPR+FRQF SNGKAHRPGPDNM+D ELLSHYE+LPLEEQLEIAHQIGTTRSQI
Sbjct: 1381 VPHVCGLNPRSFRQFHSNGKAHRPGPDNMVDFELLSHYELLPLEEQLEIAHQIGTTRSQI 1440

Query: 382  LSNLNDLSLGTSFL 341
            LSNLNDLSLGTSFL
Sbjct: 1441 LSNLNDLSLGTSFL 1454


>ref|XP_020260752.1| cleavage and polyadenylation specificity factor subunit 1 isoform X1
            [Asparagus officinalis]
          Length = 1455

 Score = 2450 bits (6349), Expect = 0.0
 Identities = 1226/1455 (84%), Positives = 1309/1455 (89%), Gaps = 6/1455 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAAFKMMHWPTGIENCASGFITH                 DLD + AP RRIGPIPNL
Sbjct: 1    MSYAAFKMMHWPTGIENCASGFITHSPADAAASVILPHLQPDDLDSDFAPTRRIGPIPNL 60

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTA NV+EVY VRVQEDD R   A  EPRGGGG+MDGL GARLELVCHYRLHGNV+SM 
Sbjct: 61   VVTAGNVVEVYIVRVQEDDARVSQAPIEPRGGGGIMDGLRGARLELVCHYRLHGNVQSMA 120

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            VLSLGGDE SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDWL+LK+GR SF
Sbjct: 121  VLSLGGDERSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWLFLKKGRESF 180

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTS-VCIESSFVL 3971
            A GP+VKADPLGRCGG LVYD QMIVLKAAQ GQGLV D+EP+S+GG+T+ V IESS+V+
Sbjct: 181  AMGPLVKADPLGRCGGALVYDLQMIVLKAAQTGQGLVTDDEPTSVGGSTAAVRIESSYVI 240

Query: 3970 NLRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP 3791
            NLR+LDM+HVKDFTFVHGYIEPVMV LHE+EPTWAGRISWKHHTC+ISALSISTTLKQHP
Sbjct: 241  NLRELDMKHVKDFTFVHGYIEPVMVFLHEREPTWAGRISWKHHTCLISALSISTTLKQHP 300

Query: 3790 LIWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMP 3611
            LIWSA NLPHDAYKLLAVPSPIGG+LVICANSIHYHSQS+SC+LGLN FA+QM++SSDMP
Sbjct: 301  LIWSAANLPHDAYKLLAVPSPIGGVLVICANSIHYHSQSSSCALGLNSFAMQMDSSSDMP 360

Query: 3610 KTNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGIT 3431
            K +FNVELDAAKATWLS DV +FSSK GELLLLTLV DGR+VQKLDL KSKASVLTSGIT
Sbjct: 361  KASFNVELDAAKATWLSPDVVIFSSKNGELLLLTLVSDGRIVQKLDLMKSKASVLTSGIT 420

Query: 3430 SIGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 3251
            ++G+SFFFLGSRLGDSLLVQYSYGASSP+SAHVKDE  +IEGD+P  KRL+ MSSDSLQD
Sbjct: 421  TMGSSFFFLGSRLGDSLLVQYSYGASSPTSAHVKDEIAEIEGDVPAVKRLKRMSSDSLQD 480

Query: 3250 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 3071
            VAS EELSLYAT PNSSES QKVFSFAVRD+LINVGPLKDFSYGLRINADPNATG+AKQS
Sbjct: 481  VASAEELSLYATAPNSSESTQKVFSFAVRDTLINVGPLKDFSYGLRINADPNATGLAKQS 540

Query: 3070 NYELVCCSGHGKNGSLCVLQQSIRPELITE-VELPGCKGIWTVYHKSSRTHAVDSSKTTT 2894
            NYELVCCSGHGKNG+LCVLQQSIRPELITE VELPGCKGIWTVYHKSSR HAVDSSKT T
Sbjct: 541  NYELVCCSGHGKNGALCVLQQSIRPELITEQVELPGCKGIWTVYHKSSRRHAVDSSKTLT 600

Query: 2893 EDDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARG 2714
            EDDEYHAYLIISLETRTMVLETADDLGEVTESV YYVQGSTIAAGNLFGRRRVVQIFARG
Sbjct: 601  EDDEYHAYLIISLETRTMVLETADDLGEVTESVGYYVQGSTIAAGNLFGRRRVVQIFARG 660

Query: 2713 ARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXS---IADPYVLLKMTDGSIQLLAGDP 2543
            ARILDGSYMTQELSF ++                S   IADPYVLLKMTDGSIQLLAGD 
Sbjct: 661  ARILDGSYMTQELSFAMNNSESASSASNSEPSTVSSVSIADPYVLLKMTDGSIQLLAGDT 720

Query: 2542 STCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYP 2363
            STCTVS+NVPARFASLTD ++ACTLYHDK PEPWLRKASSDAWLSTG+AE ID +DGSYP
Sbjct: 721  STCTVSVNVPARFASLTDPISACTLYHDKGPEPWLRKASSDAWLSTGIAEAIDGSDGSYP 780

Query: 2362 DQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGLDETAK 2183
            DQGDIYCLVCYESG LEIFDVPNFRSVFS DKF+SGKSHLMD CI+P K K +  +ET K
Sbjct: 781  DQGDIYCLVCYESGILEIFDVPNFRSVFSVDKFMSGKSHLMDMCIVPQKTKPRNFEETIK 840

Query: 2182 KEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVYPLS 2003
            KEP QNLKV+EL MQRWSGQYSRPFLF ILNDGTML YHAYLYEG E APK E A+ P S
Sbjct: 841  KEPTQNLKVMELVMQRWSGQYSRPFLFAILNDGTMLSYHAYLYEGQENAPKAEDALSPHS 900

Query: 2002 TGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSRPAW 1823
            + ++++TG SR RNLRF RVSID NI+E+SSN+T RPRITIFKNVGG QGLFL+GSRPAW
Sbjct: 901  SSNMNSTGTSRMRNLRFQRVSIDTNIQEDSSNVTVRPRITIFKNVGGYQGLFLTGSRPAW 960

Query: 1822 LMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNYWPV 1643
             MLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPS CNYDNYWPV
Sbjct: 961  FMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSACNYDNYWPV 1020

Query: 1642 QKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDN-NQMDQDSASSDSLQ 1466
            QKIPLR TPHQVTYF EK+LYPVIVSVPVVRPLSQVLSSLVDQD+ NQMDQD+ SSD LQ
Sbjct: 1021 QKIPLRGTPHQVTYFAEKSLYPVIVSVPVVRPLSQVLSSLVDQDSGNQMDQDTVSSDGLQ 1080

Query: 1465 KSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLAIGT 1286
             +Y IDEFEVRIFEPEKSGGHWEPKATIPMQ SENALTVR VTL+NSTT ENE+LLAIGT
Sbjct: 1081 NNYCIDEFEVRIFEPEKSGGHWEPKATIPMQTSENALTVRMVTLLNSTTKENESLLAIGT 1140

Query: 1285 AYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASGTKI 1106
            AYVQGED+AARGR+LLF+IGR++EN QNLVSEVYSKELKGAISALASLQGHLLIASG KI
Sbjct: 1141 AYVQGEDIAARGRVLLFSIGRNSENLQNLVSEVYSKELKGAISALASLQGHLLIASGPKI 1200

Query: 1105 TLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLAKDF 926
             L+KWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLAKDF
Sbjct: 1201 ILHKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLAKDF 1260

Query: 925  SSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAHVSK 746
             SLDCFATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLLSRAEFHAGAH++K
Sbjct: 1261 GSLDCFATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLSRAEFHAGAHITK 1320

Query: 745  FLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXXLVD 566
            FLRLQMLPTSSDR+S  PGSDKTNRFA+LFGTLDGSIGCVAPLEE            LVD
Sbjct: 1321 FLRLQMLPTSSDRASGAPGSDKTNRFALLFGTLDGSIGCVAPLEELTFRRLQTLQKRLVD 1380

Query: 565  VVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTTRSQ 386
             VPHVCGLNPR+FRQF SNGKAHRPGPDNM+D ELLSHYE+LPLEEQLEIAHQIGTTRSQ
Sbjct: 1381 AVPHVCGLNPRSFRQFHSNGKAHRPGPDNMVDFELLSHYELLPLEEQLEIAHQIGTTRSQ 1440

Query: 385  ILSNLNDLSLGTSFL 341
            ILSNLNDLSLGTSFL
Sbjct: 1441 ILSNLNDLSLGTSFL 1455


>ref|XP_008791964.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X2 [Phoenix dactylifera]
          Length = 1455

 Score = 2295 bits (5948), Expect = 0.0
 Identities = 1142/1458 (78%), Positives = 1262/1458 (86%), Gaps = 9/1458 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYA+FKMMHW TGIENCA+GFITH                  L+ E +PKRRIGPIPNL
Sbjct: 1    MSYASFKMMHWATGIENCAAGFITHSRADFAAQIPPIQADD--LESEWSPKRRIGPIPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTA NV+E+Y VR+QEDD R        + GGG+MDGL GARLELVCHYRLHGNVESM 
Sbjct: 59   VVTAGNVLEIYLVRIQEDDGRPDARPGGEQRGGGMMDGLSGARLELVCHYRLHGNVESMA 118

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDW +LKRGR  F
Sbjct: 119  ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWHFLKRGRERF 178

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP++KADPLGRCGG LVY  QMI+LK+AQAGQGLV D+EP+SLGGT    IESS+V+N
Sbjct: 179  ARGPLIKADPLGRCGGALVYGLQMIILKSAQAGQGLVGDDEPASLGGTFPARIESSYVIN 238

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTCMI ALSISTTLKQHP+
Sbjct: 239  LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCMICALSISTTLKQHPM 298

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDA KLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK
Sbjct: 299  IWSAVNLPHDASKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+
Sbjct: 359  SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITT 418

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IG+SFFFLGSRLGDSLLVQYS G S+P++A++KDE  DIEGD P AKRLR MSSD+LQ+V
Sbjct: 419  IGSSFFFLGSRLGDSLLVQYSCGTSTPTTANMKDEVADIEGDGPSAKRLRRMSSDALQEV 478

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
             SGEELSLY+T PNSSESAQK+FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN
Sbjct: 479  VSGEELSLYSTAPNSSESAQKIFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 538

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LC+LQQSIRPELITEVELPGCKGIWTVYHKSSR H  D SKT +ED
Sbjct: 539  YELVCCSGHGKNGALCMLQQSIRPELITEVELPGCKGIWTVYHKSSRGHTADPSKTMSED 598

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR
Sbjct: 599  DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 657

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDGSYMTQEL+F  H                SIADP++LLKMTDGSIQLL GDPS CTV
Sbjct: 658  ILDGSYMTQELNFGGHNSDSTTNSELSAVASVSIADPFILLKMTDGSIQLLVGDPSACTV 717

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            S++VPA FA+ T S++ CTLYHDK PEPWLRK S+DAWLSTG+AE ID NDGSY DQGD+
Sbjct: 718  SVSVPAIFANSTKSISTCTLYHDKGPEPWLRKTSTDAWLSTGIAETIDGNDGSYHDQGDV 777

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2192
            YCLVCY++G L+IFDVP F+ VFS + FISGKSHL+DA I+     T        +G+  
Sbjct: 778  YCLVCYDNGRLDIFDVPTFKCVFSVENFISGKSHLVDAYILEPTISTQVNKEKMSEGVKV 837

Query: 2191 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
             AKKE P+N+K+VELAM +WS QYSRPFLF ILNDGTMLCYHAY+YEG E A K E  V 
Sbjct: 838  QAKKETPENMKIVELAMHQWSSQYSRPFLFAILNDGTMLCYHAYIYEGPEHAAKVEDVVS 897

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
            P +  DISN   SR RNLRF RVS+DI  REES +   RPRIT+FKNVGG QGLFL+GSR
Sbjct: 898  PHNAIDISNISTSRLRNLRFVRVSLDITAREESPDSVMRPRITMFKNVGGYQGLFLTGSR 957

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y
Sbjct: 958  PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1017

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 1475
            WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVVRPL+QVLSSL DQD  +Q D++S SSD
Sbjct: 1018 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVRPLNQVLSSLADQDIGHQTDRESVSSD 1077

Query: 1474 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 1295
             LQK YS+DEFEVRI EPEKSGGHWE +A +PMQ SENALTVR +TL N+TT ENE+LLA
Sbjct: 1078 DLQKFYSVDEFEVRILEPEKSGGHWETRAILPMQTSENALTVRVITLFNTTTRENESLLA 1137

Query: 1294 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 1115
            IGTAYVQGEDVAARGR+LL++  +++EN QNLV EVYSKELKGA+SALA+LQGHLLIASG
Sbjct: 1138 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLVLEVYSKELKGAVSALAALQGHLLIASG 1197

Query: 1114 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 935
             KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLA
Sbjct: 1198 PKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLA 1257

Query: 934  KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 755
            KDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGAH
Sbjct: 1258 KDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGAH 1317

Query: 754  VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 575
            V+KFLRLQMLPTS+DR++A  GSDKTNRFA+LFGTLDGSIGC+APL+E            
Sbjct: 1318 VTKFLRLQMLPTSADRTTAATGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQTLQRK 1377

Query: 574  LVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 395
            LVD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIA QIGTT
Sbjct: 1378 LVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAQQIGTT 1437

Query: 394  RSQILSNLNDLSLGTSFL 341
            RSQILSNLNDLSLGTSFL
Sbjct: 1438 RSQILSNLNDLSLGTSFL 1455


>ref|XP_008791963.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Phoenix dactylifera]
          Length = 1456

 Score = 2291 bits (5936), Expect = 0.0
 Identities = 1142/1459 (78%), Positives = 1262/1459 (86%), Gaps = 10/1459 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYA+FKMMHW TGIENCA+GFITH                  L+ E +PKRRIGPIPNL
Sbjct: 1    MSYASFKMMHWATGIENCAAGFITHSRADFAAQIPPIQADD--LESEWSPKRRIGPIPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTA NV+E+Y VR+QEDD R        + GGG+MDGL GARLELVCHYRLHGNVESM 
Sbjct: 59   VVTAGNVLEIYLVRIQEDDGRPDARPGGEQRGGGMMDGLSGARLELVCHYRLHGNVESMA 118

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDW +LKRGR  F
Sbjct: 119  ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWHFLKRGRERF 178

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP++KADPLGRCGG LVY  QMI+LK+AQAGQGLV D+EP+SLGGT    IESS+V+N
Sbjct: 179  ARGPLIKADPLGRCGGALVYGLQMIILKSAQAGQGLVGDDEPASLGGTFPARIESSYVIN 238

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTCMI ALSISTTLKQHP+
Sbjct: 239  LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCMICALSISTTLKQHPM 298

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDA KLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK
Sbjct: 299  IWSAVNLPHDASKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+
Sbjct: 359  SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITT 418

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IG+SFFFLGSRLGDSLLVQYS G S+P++A++KDE  DIEGD P AKRLR MSSD+LQ+V
Sbjct: 419  IGSSFFFLGSRLGDSLLVQYSCGTSTPTTANMKDEVADIEGDGPSAKRLRRMSSDALQEV 478

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
             SGEELSLY+T PNSSESAQK+FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN
Sbjct: 479  VSGEELSLYSTAPNSSESAQKIFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 538

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LC+LQQSIRPELITEVELPGCKGIWTVYHKSSR H  D SKT +ED
Sbjct: 539  YELVCCSGHGKNGALCMLQQSIRPELITEVELPGCKGIWTVYHKSSRGHTADPSKTMSED 598

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR
Sbjct: 599  DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 657

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDGSYMTQEL+F  H                SIADP++LLKMTDGSIQLL GDPS CTV
Sbjct: 658  ILDGSYMTQELNFGGHNSDSTTNSELSAVASVSIADPFILLKMTDGSIQLLVGDPSACTV 717

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            S++VPA FA+ T S++ CTLYHDK PEPWLRK S+DAWLSTG+AE ID NDGSY DQGD+
Sbjct: 718  SVSVPAIFANSTKSISTCTLYHDKGPEPWLRKTSTDAWLSTGIAETIDGNDGSYHDQGDV 777

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2192
            YCLVCY++G L+IFDVP F+ VFS + FISGKSHL+DA I+     T        +G+  
Sbjct: 778  YCLVCYDNGRLDIFDVPTFKCVFSVENFISGKSHLVDAYILEPTISTQVNKEKMSEGVKV 837

Query: 2191 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
             AKKE P+N+K+VELAM +WS QYSRPFLF ILNDGTMLCYHAY+YEG E A K E  V 
Sbjct: 838  QAKKETPENMKIVELAMHQWSSQYSRPFLFAILNDGTMLCYHAYIYEGPEHAAKVEDVVS 897

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
            P +  DISN   SR RNLRF RVS+DI  REES +   RPRIT+FKNVGG QGLFL+GSR
Sbjct: 898  PHNAIDISNISTSRLRNLRFVRVSLDITAREESPDSVMRPRITMFKNVGGYQGLFLTGSR 957

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y
Sbjct: 958  PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1017

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 1475
            WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVVRPL+QVLSSL DQD  +Q D++S SSD
Sbjct: 1018 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVRPLNQVLSSLADQDIGHQTDRESVSSD 1077

Query: 1474 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 1295
             LQK YS+DEFEVRI EPEKSGGHWE +A +PMQ SENALTVR +TL N+TT ENE+LLA
Sbjct: 1078 DLQKFYSVDEFEVRILEPEKSGGHWETRAILPMQTSENALTVRVITLFNTTTRENESLLA 1137

Query: 1294 IGTAYVQGEDVAARGRILLFTIGRDNENPQNL-VSEVYSKELKGAISALASLQGHLLIAS 1118
            IGTAYVQGEDVAARGR+LL++  +++EN QNL V EVYSKELKGA+SALA+LQGHLLIAS
Sbjct: 1138 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLQVLEVYSKELKGAVSALAALQGHLLIAS 1197

Query: 1117 GTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLL 938
            G KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLL
Sbjct: 1198 GPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLL 1257

Query: 937  AKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGA 758
            AKDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGA
Sbjct: 1258 AKDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGA 1317

Query: 757  HVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXX 578
            HV+KFLRLQMLPTS+DR++A  GSDKTNRFA+LFGTLDGSIGC+APL+E           
Sbjct: 1318 HVTKFLRLQMLPTSADRTTAATGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQTLQR 1377

Query: 577  XLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGT 398
             LVD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIA QIGT
Sbjct: 1378 KLVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAQQIGT 1437

Query: 397  TRSQILSNLNDLSLGTSFL 341
            TRSQILSNLNDLSLGTSFL
Sbjct: 1438 TRSQILSNLNDLSLGTSFL 1456


>ref|XP_008791966.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X3 [Phoenix dactylifera]
          Length = 1455

 Score = 2284 bits (5919), Expect = 0.0
 Identities = 1141/1459 (78%), Positives = 1261/1459 (86%), Gaps = 10/1459 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYA+FKMMHW TGIENCA+GFITH                  L+ E +PKRRIGPIPNL
Sbjct: 1    MSYASFKMMHWATGIENCAAGFITHSRADFAAQIPPIQADD--LESEWSPKRRIGPIPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTA NV+E+Y VR+QEDD R        + GGG+MDGL GARLELVCHYRLHGNVESM 
Sbjct: 59   VVTAGNVLEIYLVRIQEDDGRPDARPGGEQRGGGMMDGLSGARLELVCHYRLHGNVESMA 118

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLRTSSMHCFEGPDW +LKRGR  F
Sbjct: 119  ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRTSSMHCFEGPDWHFLKRGRERF 178

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP++KADPLGRCGG LVY  QMI+LK+AQAGQGLV D+EP+SLGGT    IESS+V+N
Sbjct: 179  ARGPLIKADPLGRCGGALVYGLQMIILKSAQAGQGLVGDDEPASLGGTFPARIESSYVIN 238

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTCMI ALSISTTLKQHP+
Sbjct: 239  LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCMICALSISTTLKQHPM 298

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDA KLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK
Sbjct: 299  IWSAVNLPHDASKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+
Sbjct: 359  SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITT 418

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IG+SFFFLGSRLGDSLLVQYS G S+P++A++KDE  DIEGD P AKRLR MSSD+LQ+V
Sbjct: 419  IGSSFFFLGSRLGDSLLVQYSCGTSTPTTANMKDEVADIEGDGPSAKRLRRMSSDALQEV 478

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
             SGEELSLY+T PNSSESAQ +FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN
Sbjct: 479  VSGEELSLYSTAPNSSESAQ-IFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 537

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LC+LQQSIRPELITEVELPGCKGIWTVYHKSSR H  D SKT +ED
Sbjct: 538  YELVCCSGHGKNGALCMLQQSIRPELITEVELPGCKGIWTVYHKSSRGHTADPSKTMSED 597

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR
Sbjct: 598  DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 656

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDGSYMTQEL+F  H                SIADP++LLKMTDGSIQLL GDPS CTV
Sbjct: 657  ILDGSYMTQELNFGGHNSDSTTNSELSAVASVSIADPFILLKMTDGSIQLLVGDPSACTV 716

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            S++VPA FA+ T S++ CTLYHDK PEPWLRK S+DAWLSTG+AE ID NDGSY DQGD+
Sbjct: 717  SVSVPAIFANSTKSISTCTLYHDKGPEPWLRKTSTDAWLSTGIAETIDGNDGSYHDQGDV 776

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2192
            YCLVCY++G L+IFDVP F+ VFS + FISGKSHL+DA I+     T        +G+  
Sbjct: 777  YCLVCYDNGRLDIFDVPTFKCVFSVENFISGKSHLVDAYILEPTISTQVNKEKMSEGVKV 836

Query: 2191 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
             AKKE P+N+K+VELAM +WS QYSRPFLF ILNDGTMLCYHAY+YEG E A K E  V 
Sbjct: 837  QAKKETPENMKIVELAMHQWSSQYSRPFLFAILNDGTMLCYHAYIYEGPEHAAKVEDVVS 896

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
            P +  DISN   SR RNLRF RVS+DI  REES +   RPRIT+FKNVGG QGLFL+GSR
Sbjct: 897  PHNAIDISNISTSRLRNLRFVRVSLDITAREESPDSVMRPRITMFKNVGGYQGLFLTGSR 956

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y
Sbjct: 957  PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1016

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 1475
            WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVVRPL+QVLSSL DQD  +Q D++S SSD
Sbjct: 1017 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVRPLNQVLSSLADQDIGHQTDRESVSSD 1076

Query: 1474 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 1295
             LQK YS+DEFEVRI EPEKSGGHWE +A +PMQ SENALTVR +TL N+TT ENE+LLA
Sbjct: 1077 DLQKFYSVDEFEVRILEPEKSGGHWETRAILPMQTSENALTVRVITLFNTTTRENESLLA 1136

Query: 1294 IGTAYVQGEDVAARGRILLFTIGRDNENPQNL-VSEVYSKELKGAISALASLQGHLLIAS 1118
            IGTAYVQGEDVAARGR+LL++  +++EN QNL V EVYSKELKGA+SALA+LQGHLLIAS
Sbjct: 1137 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLQVLEVYSKELKGAVSALAALQGHLLIAS 1196

Query: 1117 GTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLL 938
            G KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLL
Sbjct: 1197 GPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLL 1256

Query: 937  AKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGA 758
            AKDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGA
Sbjct: 1257 AKDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGA 1316

Query: 757  HVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXX 578
            HV+KFLRLQMLPTS+DR++A  GSDKTNRFA+LFGTLDGSIGC+APL+E           
Sbjct: 1317 HVTKFLRLQMLPTSADRTTAATGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQTLQR 1376

Query: 577  XLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGT 398
             LVD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIA QIGT
Sbjct: 1377 KLVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAQQIGT 1436

Query: 397  TRSQILSNLNDLSLGTSFL 341
            TRSQILSNLNDLSLGTSFL
Sbjct: 1437 TRSQILSNLNDLSLGTSFL 1455


>ref|XP_009383988.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Musa acuminata subsp. malaccensis]
          Length = 1458

 Score = 2284 bits (5918), Expect = 0.0
 Identities = 1135/1458 (77%), Positives = 1248/1458 (85%), Gaps = 9/1458 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAAFKMMHWPTGIE+CASGFITH                     E   +RR+GP+PNL
Sbjct: 1    MSYAAFKMMHWPTGIEHCASGFITHSPSDSSPPQIPPAQADDLDSSEWPTQRRVGPVPNL 60

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTAAN++EVY VR+QEDD R      +  GGGG MDGL GARLELVCHYRLHGNVESM 
Sbjct: 61   VVTAANILEVYLVRIQEDDDRIPRPTGDQHGGGGTMDGLAGARLELVCHYRLHGNVESMA 120

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            +LSLG D+ +KRRDSI+LAFQDAKITVLE+DDS+H LRTSSMHCFEGPDWLYLKRGR SF
Sbjct: 121  ILSLGVDDHNKRRDSIMLAFQDAKITVLEYDDSSHELRTSSMHCFEGPDWLYLKRGRESF 180

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGPVVKADPLGRC G L+Y  QMIVLKAAQ GQGLV D+EP+S GG  SV IESS+V+N
Sbjct: 181  ARGPVVKADPLGRCSGALIYGLQMIVLKAAQVGQGLVGDDEPTSTGGAVSVRIESSYVIN 240

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLDM+HVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP+
Sbjct: 241  LRDLDMKHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPM 300

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSA N+PHDA KLLAVPSPIGG+LVICAN+IHYHSQSA+CSL LN FA Q E SS+MPK
Sbjct: 301  IWSASNIPHDANKLLAVPSPIGGVLVICANTIHYHSQSATCSLALNSFATQPEGSSEMPK 360

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
              F VELDAA  TWLS DVA+FSSKTG+LLLLTL+YDGRVVQ+L+L KSKASVLTSGIT+
Sbjct: 361  AKFTVELDAANVTWLSPDVAMFSSKTGDLLLLTLIYDGRVVQRLELMKSKASVLTSGITT 420

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IG+SFFFLGSRLGDSLLVQYS G S P+SA+ KDE  D EGDL +AKRLR   SD+LQ+ 
Sbjct: 421  IGSSFFFLGSRLGDSLLVQYSTGTSGPTSANGKDEVADTEGDLHLAKRLRRTPSDALQEF 480

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
            ASGEELSLY TTP+SSE+AQK FSF VRDSLINVGPLKDFSYGLRINADPNATGIAKQSN
Sbjct: 481  ASGEELSLYTTTPDSSETAQKFFSFIVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 540

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LCVLQQSIRPELITEVELPGCKGIW+VYHK SR HA DSSKT  ED
Sbjct: 541  YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWSVYHKGSRGHAADSSKTMMED 600

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DEYHAYLIISLETRTMVLETADDLGEVTE+VDYYVQGSTIAAGNLFGRRRVVQIFARGAR
Sbjct: 601  DEYHAYLIISLETRTMVLETADDLGEVTETVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 660

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDGSYMTQEL+F VH                SIADPYVLLKMTDGSIQLL GDPSTCTV
Sbjct: 661  ILDGSYMTQELTFGVHNSELISNSEAPVVASVSIADPYVLLKMTDGSIQLLVGDPSTCTV 720

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            S+NVPA FAS T+ ++AC LYHDK PEPWLRK S+DAWLSTG+AE +D +DG Y DQGDI
Sbjct: 721  SVNVPATFASSTELISACALYHDKGPEPWLRKTSTDAWLSTGIAEAVDGHDGLYNDQGDI 780

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGL-----DE--- 2192
            YCLVCYESGTLEIFDV NF+ VFS D F+SGK+HL D         ++G+     DE   
Sbjct: 781  YCLVCYESGTLEIFDVSNFKCVFSVDNFVSGKTHLFDKYAREPSRNSQGIKSKVTDEANG 840

Query: 2191 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
            + KKE PQ++K+VELAMQRWSGQYSRPFLFGIL+DGTMLCYHAYLYEG E  PK E AV 
Sbjct: 841  SVKKELPQDMKIVELAMQRWSGQYSRPFLFGILSDGTMLCYHAYLYEGLENTPKVEDAVS 900

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
            P  + ++ N  ASR  NLRF RV++D    EE+SN+ +RPRIT+FKNVGG QGLFLSGSR
Sbjct: 901  PHRSAEMGNVSASRLHNLRFIRVAVDTATSEEASNIVSRPRITVFKNVGGYQGLFLSGSR 960

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLP+VCNYDNY
Sbjct: 961  PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPAVCNYDNY 1020

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDNNQM-DQDSASSD 1475
            WPVQK+PLR TPHQVTY+ EKNLYP+IVSVPVV+PLSQVLSSLVDQD   + D DS +SD
Sbjct: 1021 WPVQKVPLRGTPHQVTYYSEKNLYPLIVSVPVVKPLSQVLSSLVDQDTVHLSDNDSVNSD 1080

Query: 1474 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 1295
             LQK Y++DEFE+RI E  KSGGHWE +ATIPMQ SENALTVR VTL N+TT ENETLL+
Sbjct: 1081 DLQKIYTVDEFEIRILESGKSGGHWETRATIPMQTSENALTVRVVTLFNTTTRENETLLS 1140

Query: 1294 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 1115
            +GTAYVQGEDVAARGR+LL++ G++ EN QNLVSEVYSKELKGA+SALASLQGHLL+ASG
Sbjct: 1141 VGTAYVQGEDVAARGRMLLYSFGKNTENAQNLVSEVYSKELKGAVSALASLQGHLLVASG 1200

Query: 1114 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 935
             KITL+KWTG+EL G+AFYDAPLHVVSLNIVKNF+LLGDIHKSIYFL WKEQGAQLSLLA
Sbjct: 1201 PKITLHKWTGTELNGIAFYDAPLHVVSLNIVKNFVLLGDIHKSIYFLNWKEQGAQLSLLA 1260

Query: 934  KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 755
            KDF SLDC+ATEFLIDG TLSLVVSDD KNIQIFYYAP+ +ESWKGQKLL RAEFHAG H
Sbjct: 1261 KDFGSLDCYATEFLIDGSTLSLVVSDDDKNIQIFYYAPKTLESWKGQKLLPRAEFHAGTH 1320

Query: 754  VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 575
            V+KFLRLQML +S DR++  PGSDKTNRFA+LF TLDGSIGC+APL+E            
Sbjct: 1321 VTKFLRLQMLSSSFDRANTAPGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQRK 1380

Query: 574  LVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 395
            LVD VPH CGLNPR+FRQFRSNGKAHRPGPDNM+DCELLS YEMLPL++QLEIA QIGTT
Sbjct: 1381 LVDAVPHTCGLNPRSFRQFRSNGKAHRPGPDNMVDCELLSQYEMLPLDKQLEIAFQIGTT 1440

Query: 394  RSQILSNLNDLSLGTSFL 341
            RSQILSNLNDLSLGTSFL
Sbjct: 1441 RSQILSNLNDLSLGTSFL 1458


>ref|XP_018676515.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X2 [Musa acuminata subsp. malaccensis]
          Length = 1457

 Score = 2277 bits (5901), Expect = 0.0
 Identities = 1134/1458 (77%), Positives = 1247/1458 (85%), Gaps = 9/1458 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAAFKMMHWPTGIE+CASGFITH                     E   +RR+GP+PNL
Sbjct: 1    MSYAAFKMMHWPTGIEHCASGFITHSPSDSSPPQIPPAQADDLDSSEWPTQRRVGPVPNL 60

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTAAN++EVY VR+QEDD R      +  GGGG MDGL GARLELVCHYRLHGNVESM 
Sbjct: 61   VVTAANILEVYLVRIQEDDDRIPRPTGDQHGGGGTMDGLAGARLELVCHYRLHGNVESMA 120

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            +LSLG D+ +KRRDSI+LAFQDAKITVLE+DDS+H LRTSSMHCFEGPDWLYLKRGR SF
Sbjct: 121  ILSLGVDDHNKRRDSIMLAFQDAKITVLEYDDSSHELRTSSMHCFEGPDWLYLKRGRESF 180

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGPVVKADPLGRC G L+Y  QMIVLKAAQ GQGLV D+EP+S GG  SV IESS+V+N
Sbjct: 181  ARGPVVKADPLGRCSGALIYGLQMIVLKAAQVGQGLVGDDEPTSTGGAVSVRIESSYVIN 240

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLDM+HVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP+
Sbjct: 241  LRDLDMKHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPM 300

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSA N+PHDA KLLAVPSPIGG+LVICAN+IHYHSQSA+CSL LN FA Q E SS+MPK
Sbjct: 301  IWSASNIPHDANKLLAVPSPIGGVLVICANTIHYHSQSATCSLALNSFATQPEGSSEMPK 360

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
              F VELDAA  TWLS DVA+FSSKTG+LLLLTL+YDGRVVQ+L+L KSKASVLTSGIT+
Sbjct: 361  AKFTVELDAANVTWLSPDVAMFSSKTGDLLLLTLIYDGRVVQRLELMKSKASVLTSGITT 420

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IG+SFFFLGSRLGDSLLVQYS G S P+SA+ KDE  D EGDL +AKRLR   SD+LQ+ 
Sbjct: 421  IGSSFFFLGSRLGDSLLVQYSTGTSGPTSANGKDEVADTEGDLHLAKRLRRTPSDALQEF 480

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
            ASGEELSLY TTP+SSE+AQ  FSF VRDSLINVGPLKDFSYGLRINADPNATGIAKQSN
Sbjct: 481  ASGEELSLYTTTPDSSETAQ-FFSFIVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 539

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LCVLQQSIRPELITEVELPGCKGIW+VYHK SR HA DSSKT  ED
Sbjct: 540  YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWSVYHKGSRGHAADSSKTMMED 599

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DEYHAYLIISLETRTMVLETADDLGEVTE+VDYYVQGSTIAAGNLFGRRRVVQIFARGAR
Sbjct: 600  DEYHAYLIISLETRTMVLETADDLGEVTETVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 659

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDGSYMTQEL+F VH                SIADPYVLLKMTDGSIQLL GDPSTCTV
Sbjct: 660  ILDGSYMTQELTFGVHNSELISNSEAPVVASVSIADPYVLLKMTDGSIQLLVGDPSTCTV 719

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            S+NVPA FAS T+ ++AC LYHDK PEPWLRK S+DAWLSTG+AE +D +DG Y DQGDI
Sbjct: 720  SVNVPATFASSTELISACALYHDKGPEPWLRKTSTDAWLSTGIAEAVDGHDGLYNDQGDI 779

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGL-----DE--- 2192
            YCLVCYESGTLEIFDV NF+ VFS D F+SGK+HL D         ++G+     DE   
Sbjct: 780  YCLVCYESGTLEIFDVSNFKCVFSVDNFVSGKTHLFDKYAREPSRNSQGIKSKVTDEANG 839

Query: 2191 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
            + KKE PQ++K+VELAMQRWSGQYSRPFLFGIL+DGTMLCYHAYLYEG E  PK E AV 
Sbjct: 840  SVKKELPQDMKIVELAMQRWSGQYSRPFLFGILSDGTMLCYHAYLYEGLENTPKVEDAVS 899

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
            P  + ++ N  ASR  NLRF RV++D    EE+SN+ +RPRIT+FKNVGG QGLFLSGSR
Sbjct: 900  PHRSAEMGNVSASRLHNLRFIRVAVDTATSEEASNIVSRPRITVFKNVGGYQGLFLSGSR 959

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLP+VCNYDNY
Sbjct: 960  PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPAVCNYDNY 1019

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDNNQM-DQDSASSD 1475
            WPVQK+PLR TPHQVTY+ EKNLYP+IVSVPVV+PLSQVLSSLVDQD   + D DS +SD
Sbjct: 1020 WPVQKVPLRGTPHQVTYYSEKNLYPLIVSVPVVKPLSQVLSSLVDQDTVHLSDNDSVNSD 1079

Query: 1474 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 1295
             LQK Y++DEFE+RI E  KSGGHWE +ATIPMQ SENALTVR VTL N+TT ENETLL+
Sbjct: 1080 DLQKIYTVDEFEIRILESGKSGGHWETRATIPMQTSENALTVRVVTLFNTTTRENETLLS 1139

Query: 1294 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 1115
            +GTAYVQGEDVAARGR+LL++ G++ EN QNLVSEVYSKELKGA+SALASLQGHLL+ASG
Sbjct: 1140 VGTAYVQGEDVAARGRMLLYSFGKNTENAQNLVSEVYSKELKGAVSALASLQGHLLVASG 1199

Query: 1114 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 935
             KITL+KWTG+EL G+AFYDAPLHVVSLNIVKNF+LLGDIHKSIYFL WKEQGAQLSLLA
Sbjct: 1200 PKITLHKWTGTELNGIAFYDAPLHVVSLNIVKNFVLLGDIHKSIYFLNWKEQGAQLSLLA 1259

Query: 934  KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 755
            KDF SLDC+ATEFLIDG TLSLVVSDD KNIQIFYYAP+ +ESWKGQKLL RAEFHAG H
Sbjct: 1260 KDFGSLDCYATEFLIDGSTLSLVVSDDDKNIQIFYYAPKTLESWKGQKLLPRAEFHAGTH 1319

Query: 754  VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 575
            V+KFLRLQML +S DR++  PGSDKTNRFA+LF TLDGSIGC+APL+E            
Sbjct: 1320 VTKFLRLQMLSSSFDRANTAPGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQRK 1379

Query: 574  LVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 395
            LVD VPH CGLNPR+FRQFRSNGKAHRPGPDNM+DCELLS YEMLPL++QLEIA QIGTT
Sbjct: 1380 LVDAVPHTCGLNPRSFRQFRSNGKAHRPGPDNMVDCELLSQYEMLPLDKQLEIAFQIGTT 1439

Query: 394  RSQILSNLNDLSLGTSFL 341
            RSQILSNLNDLSLGTSFL
Sbjct: 1440 RSQILSNLNDLSLGTSFL 1457


>ref|XP_018676516.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X3 [Musa acuminata subsp. malaccensis]
          Length = 1455

 Score = 2275 bits (5896), Expect = 0.0
 Identities = 1133/1458 (77%), Positives = 1246/1458 (85%), Gaps = 9/1458 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAAFKMMHWPTGIE+CASGFITH                     E   +RR+GP+PNL
Sbjct: 1    MSYAAFKMMHWPTGIEHCASGFITHSPSDSSPPQIPPAQADDLDSSEWPTQRRVGPVPNL 60

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTAAN++EVY VR+QEDD R      +  GGGG MDGL GARLELVCHYRLHGNVESM 
Sbjct: 61   VVTAANILEVYLVRIQEDDDRIPRPTGDQHGGGGTMDGLAGARLELVCHYRLHGNVESMA 120

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            +LSLG D+ +KRRDSI+LAFQDAKITVLE+DDS+H LRTSSMHCFEGPDWLYLKRGR SF
Sbjct: 121  ILSLGVDDHNKRRDSIMLAFQDAKITVLEYDDSSHELRTSSMHCFEGPDWLYLKRGRESF 180

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGPVVKADPLGRC G L+Y  QMIVLKAAQ GQGLV D+EP+S GG  SV IESS+V+N
Sbjct: 181  ARGPVVKADPLGRCSGALIYGLQMIVLKAAQVGQGLVGDDEPTSTGGAVSVRIESSYVIN 240

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLDM+HVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHP+
Sbjct: 241  LRDLDMKHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPM 300

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSA N+PHDA KLLAVPSPIGG+LVICAN+IHYHSQSA+CSL LN FA Q E   +MPK
Sbjct: 301  IWSASNIPHDANKLLAVPSPIGGVLVICANTIHYHSQSATCSLALNSFATQPE---EMPK 357

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
              F VELDAA  TWLS DVA+FSSKTG+LLLLTL+YDGRVVQ+L+L KSKASVLTSGIT+
Sbjct: 358  AKFTVELDAANVTWLSPDVAMFSSKTGDLLLLTLIYDGRVVQRLELMKSKASVLTSGITT 417

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IG+SFFFLGSRLGDSLLVQYS G S P+SA+ KDE  D EGDL +AKRLR   SD+LQ+ 
Sbjct: 418  IGSSFFFLGSRLGDSLLVQYSTGTSGPTSANGKDEVADTEGDLHLAKRLRRTPSDALQEF 477

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
            ASGEELSLY TTP+SSE+AQK FSF VRDSLINVGPLKDFSYGLRINADPNATGIAKQSN
Sbjct: 478  ASGEELSLYTTTPDSSETAQKFFSFIVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 537

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LCVLQQSIRPELITEVELPGCKGIW+VYHK SR HA DSSKT  ED
Sbjct: 538  YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWSVYHKGSRGHAADSSKTMMED 597

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DEYHAYLIISLETRTMVLETADDLGEVTE+VDYYVQGSTIAAGNLFGRRRVVQIFARGAR
Sbjct: 598  DEYHAYLIISLETRTMVLETADDLGEVTETVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 657

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDGSYMTQEL+F VH                SIADPYVLLKMTDGSIQLL GDPSTCTV
Sbjct: 658  ILDGSYMTQELTFGVHNSELISNSEAPVVASVSIADPYVLLKMTDGSIQLLVGDPSTCTV 717

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            S+NVPA FAS T+ ++AC LYHDK PEPWLRK S+DAWLSTG+AE +D +DG Y DQGDI
Sbjct: 718  SVNVPATFASSTELISACALYHDKGPEPWLRKTSTDAWLSTGIAEAVDGHDGLYNDQGDI 777

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKTKGL-----DE--- 2192
            YCLVCYESGTLEIFDV NF+ VFS D F+SGK+HL D         ++G+     DE   
Sbjct: 778  YCLVCYESGTLEIFDVSNFKCVFSVDNFVSGKTHLFDKYAREPSRNSQGIKSKVTDEANG 837

Query: 2191 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
            + KKE PQ++K+VELAMQRWSGQYSRPFLFGIL+DGTMLCYHAYLYEG E  PK E AV 
Sbjct: 838  SVKKELPQDMKIVELAMQRWSGQYSRPFLFGILSDGTMLCYHAYLYEGLENTPKVEDAVS 897

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
            P  + ++ N  ASR  NLRF RV++D    EE+SN+ +RPRIT+FKNVGG QGLFLSGSR
Sbjct: 898  PHRSAEMGNVSASRLHNLRFIRVAVDTATSEEASNIVSRPRITVFKNVGGYQGLFLSGSR 957

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            PAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLP+VCNYDNY
Sbjct: 958  PAWFMVCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPAVCNYDNY 1017

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDNNQM-DQDSASSD 1475
            WPVQK+PLR TPHQVTY+ EKNLYP+IVSVPVV+PLSQVLSSLVDQD   + D DS +SD
Sbjct: 1018 WPVQKVPLRGTPHQVTYYSEKNLYPLIVSVPVVKPLSQVLSSLVDQDTVHLSDNDSVNSD 1077

Query: 1474 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 1295
             LQK Y++DEFE+RI E  KSGGHWE +ATIPMQ SENALTVR VTL N+TT ENETLL+
Sbjct: 1078 DLQKIYTVDEFEIRILESGKSGGHWETRATIPMQTSENALTVRVVTLFNTTTRENETLLS 1137

Query: 1294 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 1115
            +GTAYVQGEDVAARGR+LL++ G++ EN QNLVSEVYSKELKGA+SALASLQGHLL+ASG
Sbjct: 1138 VGTAYVQGEDVAARGRMLLYSFGKNTENAQNLVSEVYSKELKGAVSALASLQGHLLVASG 1197

Query: 1114 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 935
             KITL+KWTG+EL G+AFYDAPLHVVSLNIVKNF+LLGDIHKSIYFL WKEQGAQLSLLA
Sbjct: 1198 PKITLHKWTGTELNGIAFYDAPLHVVSLNIVKNFVLLGDIHKSIYFLNWKEQGAQLSLLA 1257

Query: 934  KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 755
            KDF SLDC+ATEFLIDG TLSLVVSDD KNIQIFYYAP+ +ESWKGQKLL RAEFHAG H
Sbjct: 1258 KDFGSLDCYATEFLIDGSTLSLVVSDDDKNIQIFYYAPKTLESWKGQKLLPRAEFHAGTH 1317

Query: 754  VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 575
            V+KFLRLQML +S DR++  PGSDKTNRFA+LF TLDGSIGC+APL+E            
Sbjct: 1318 VTKFLRLQMLSSSFDRANTAPGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQRK 1377

Query: 574  LVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 395
            LVD VPH CGLNPR+FRQFRSNGKAHRPGPDNM+DCELLS YEMLPL++QLEIA QIGTT
Sbjct: 1378 LVDAVPHTCGLNPRSFRQFRSNGKAHRPGPDNMVDCELLSQYEMLPLDKQLEIAFQIGTT 1437

Query: 394  RSQILSNLNDLSLGTSFL 341
            RSQILSNLNDLSLGTSFL
Sbjct: 1438 RSQILSNLNDLSLGTSFL 1455


>ref|XP_010918168.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X2 [Elaeis guineensis]
          Length = 1455

 Score = 2275 bits (5895), Expect = 0.0
 Identities = 1136/1458 (77%), Positives = 1252/1458 (85%), Gaps = 9/1458 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAAFKMMHW TGIENCA+GFITH                  L+ E +P RRIGPIPNL
Sbjct: 1    MSYAAFKMMHWATGIENCAAGFITHSLVDFAAQMPPIQADD--LESEWSPMRRIGPIPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTA NV+E+Y VR+QEDD R        + G G+ DGL GARLELVCHYRLHGNVESM 
Sbjct: 59   VVTAGNVLEIYLVRIQEDDGRPAARPGGEQRGVGITDGLSGARLELVCHYRLHGNVESMA 118

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLR SSMHCFEGPDW +LKRGR  F
Sbjct: 119  ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRASSMHCFEGPDWHFLKRGRERF 178

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP+VKADPLGRC G L+Y  QMI+LK+AQAGQGLV D+EP+S G T    IESS+V+N
Sbjct: 179  ARGPLVKADPLGRCCGALIYGLQMIILKSAQAGQGLVGDDEPASSGSTFPARIESSYVIN 238

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTC ISALSISTTLKQHP+
Sbjct: 239  LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCTISALSISTTLKQHPM 298

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDAYKLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK
Sbjct: 299  IWSAVNLPHDAYKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+
Sbjct: 359  SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITA 418

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IG+SFFFLGSRLGDSLLVQY  G S+P++A++KDE  DIE D   AKRLR MSSD+LQ+V
Sbjct: 419  IGSSFFFLGSRLGDSLLVQYGCGTSTPTTANMKDEVADIEVDGLSAKRLRRMSSDALQEV 478

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
             SGEELSLY+T PNSSESAQK FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN
Sbjct: 479  VSGEELSLYSTAPNSSESAQKSFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 538

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LCVLQQS+RPELITEVELPGCKGIWTVYHKSSR H  DSSKT  ED
Sbjct: 539  YELVCCSGHGKNGALCVLQQSVRPELITEVELPGCKGIWTVYHKSSRGHTADSSKTMPED 598

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR
Sbjct: 599  DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 657

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDGSYMTQEL+F  H                SIADPYVLLKMTDGSIQLL GDPS CTV
Sbjct: 658  ILDGSYMTQELNFGGHNSDSTTNSEVSAVASVSIADPYVLLKMTDGSIQLLVGDPSACTV 717

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            S+NVP  FA+ T  ++ACTLYHDK PEPWLRK S+DAWLSTG+AEPID +DGSY DQGD+
Sbjct: 718  SVNVPPIFANSTKLISACTLYHDKGPEPWLRKTSTDAWLSTGIAEPIDGSDGSYHDQGDV 777

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2192
            YCLVCYE+G L+IFDVP F+ VFS + FISGK+HL+DA I      T        +G   
Sbjct: 778  YCLVCYENGRLDIFDVPTFKCVFSVENFISGKTHLVDAYIPEPTTSTQVNKEKMFEGAKV 837

Query: 2191 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
             AKKE P+N+K+VELAMQRW GQYSRPFLF ILNDGTMLCYHAY+YEG E APK E  V 
Sbjct: 838  QAKKETPENMKIVELAMQRWFGQYSRPFLFAILNDGTMLCYHAYIYEGPENAPKVEDVVS 897

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
            P +  DISN  +SR RNLRF RV++DI  REES +   +PRI +FKNVGG QGLFL+GSR
Sbjct: 898  PHNGMDISNMSSSRLRNLRFIRVALDITAREESPDSVTQPRIMVFKNVGGYQGLFLTGSR 957

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            PAW M+CRERLR+HPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y
Sbjct: 958  PAWFMVCRERLRIHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1017

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 1475
            WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVV+PL+QVLSSL DQD  +Q D++S +SD
Sbjct: 1018 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVKPLNQVLSSLADQDIGHQTDRESINSD 1077

Query: 1474 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 1295
             LQK YS+DEFEVRI EPEKSGG WE +ATIPMQ SENALTVR +TL N+TT ENE+LLA
Sbjct: 1078 DLQKFYSVDEFEVRILEPEKSGGCWETRATIPMQTSENALTVRVITLFNTTTRENESLLA 1137

Query: 1294 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 1115
            IGTAYVQGEDVAARGR+LL++  +++EN QNLV EVYSKELKGA+SALASLQGHLLIASG
Sbjct: 1138 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLVLEVYSKELKGAVSALASLQGHLLIASG 1197

Query: 1114 TKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLA 935
             KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLA
Sbjct: 1198 PKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLA 1257

Query: 934  KDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAH 755
            KDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGAH
Sbjct: 1258 KDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGAH 1317

Query: 754  VSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXX 575
            V+KFLRLQMLPTS+DR++A  GSDKTNRFA+LF TLDGSIGC+APL+E            
Sbjct: 1318 VTKFLRLQMLPTSADRTTAATGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQRK 1377

Query: 574  LVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTT 395
            LVD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIAHQIGTT
Sbjct: 1378 LVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAHQIGTT 1437

Query: 394  RSQILSNLNDLSLGTSFL 341
            RSQILSNLNDLSLGTSFL
Sbjct: 1438 RSQILSNLNDLSLGTSFL 1455


>ref|XP_010918167.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Elaeis guineensis]
          Length = 1456

 Score = 2270 bits (5883), Expect = 0.0
 Identities = 1136/1459 (77%), Positives = 1252/1459 (85%), Gaps = 10/1459 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAAFKMMHW TGIENCA+GFITH                  L+ E +P RRIGPIPNL
Sbjct: 1    MSYAAFKMMHWATGIENCAAGFITHSLVDFAAQMPPIQADD--LESEWSPMRRIGPIPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTA NV+E+Y VR+QEDD R        + G G+ DGL GARLELVCHYRLHGNVESM 
Sbjct: 59   VVTAGNVLEIYLVRIQEDDGRPAARPGGEQRGVGITDGLSGARLELVCHYRLHGNVESMA 118

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            +LSLG D+ SKRRDSIVLAFQDAK+TVLE+DDSTHGLR SSMHCFEGPDW +LKRGR  F
Sbjct: 119  ILSLGVDDRSKRRDSIVLAFQDAKLTVLEYDDSTHGLRASSMHCFEGPDWHFLKRGRERF 178

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP+VKADPLGRC G L+Y  QMI+LK+AQAGQGLV D+EP+S G T    IESS+V+N
Sbjct: 179  ARGPLVKADPLGRCCGALIYGLQMIILKSAQAGQGLVGDDEPASSGSTFPARIESSYVIN 238

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            L DLDM+HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHTC ISALSISTTLKQHP+
Sbjct: 239  LHDLDMKHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCTISALSISTTLKQHPM 298

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDAYKLLAVPSPIGG+LVICAN+IHYHSQSASCSLGLN FA Q ENSS+MPK
Sbjct: 299  IWSAVNLPHDAYKLLAVPSPIGGVLVICANTIHYHSQSASCSLGLNNFANQPENSSEMPK 358

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            +N NVELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGRVVQ+LDL KSKASVLTSGIT+
Sbjct: 359  SNINVELDAANATWLSHDVAMFSSKTGELLLLTLVYDGRVVQRLDLMKSKASVLTSGITA 418

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IG+SFFFLGSRLGDSLLVQY  G S+P++A++KDE  DIE D   AKRLR MSSD+LQ+V
Sbjct: 419  IGSSFFFLGSRLGDSLLVQYGCGTSTPTTANMKDEVADIEVDGLSAKRLRRMSSDALQEV 478

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
             SGEELSLY+T PNSSESAQK FSFAVRDSL+N+GPLKDFSY LRINADPNATGIAKQSN
Sbjct: 479  VSGEELSLYSTAPNSSESAQKSFSFAVRDSLVNIGPLKDFSYSLRINADPNATGIAKQSN 538

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LCVLQQS+RPELITEVELPGCKGIWTVYHKSSR H  DSSKT  ED
Sbjct: 539  YELVCCSGHGKNGALCVLQQSVRPELITEVELPGCKGIWTVYHKSSRGHTADSSKTMPED 598

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DEYHAYLIISLE RTMVLETADDLGEVTE+VDYY +GSTI+AGNLFGRRRVVQIFA GAR
Sbjct: 599  DEYHAYLIISLEARTMVLETADDLGEVTETVDYY-KGSTISAGNLFGRRRVVQIFAHGAR 657

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDGSYMTQEL+F  H                SIADPYVLLKMTDGSIQLL GDPS CTV
Sbjct: 658  ILDGSYMTQELNFGGHNSDSTTNSEVSAVASVSIADPYVLLKMTDGSIQLLVGDPSACTV 717

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            S+NVP  FA+ T  ++ACTLYHDK PEPWLRK S+DAWLSTG+AEPID +DGSY DQGD+
Sbjct: 718  SVNVPPIFANSTKLISACTLYHDKGPEPWLRKTSTDAWLSTGIAEPIDGSDGSYHDQGDV 777

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACIIPHKAKT--------KGLDE 2192
            YCLVCYE+G L+IFDVP F+ VFS + FISGK+HL+DA I      T        +G   
Sbjct: 778  YCLVCYENGRLDIFDVPTFKCVFSVENFISGKTHLVDAYIPEPTTSTQVNKEKMFEGAKV 837

Query: 2191 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
             AKKE P+N+K+VELAMQRW GQYSRPFLF ILNDGTMLCYHAY+YEG E APK E  V 
Sbjct: 838  QAKKETPENMKIVELAMQRWFGQYSRPFLFAILNDGTMLCYHAYIYEGPENAPKVEDVVS 897

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
            P +  DISN  +SR RNLRF RV++DI  REES +   +PRI +FKNVGG QGLFL+GSR
Sbjct: 898  PHNGMDISNMSSSRLRNLRFIRVALDITAREESPDSVTQPRIMVFKNVGGYQGLFLTGSR 957

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            PAW M+CRERLR+HPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG+LKICQLPSV NYD+Y
Sbjct: 958  PAWFMVCRERLRIHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGYLKICQLPSVFNYDSY 1017

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 1475
            WPVQK+PLR TPHQVTYF EKNLYP+I+SVPVV+PL+QVLSSL DQD  +Q D++S +SD
Sbjct: 1018 WPVQKVPLRGTPHQVTYFAEKNLYPLIISVPVVKPLNQVLSSLADQDIGHQTDRESINSD 1077

Query: 1474 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 1295
             LQK YS+DEFEVRI EPEKSGG WE +ATIPMQ SENALTVR +TL N+TT ENE+LLA
Sbjct: 1078 DLQKFYSVDEFEVRILEPEKSGGCWETRATIPMQTSENALTVRVITLFNTTTRENESLLA 1137

Query: 1294 IGTAYVQGEDVAARGRILLFTIGRDNENPQNL-VSEVYSKELKGAISALASLQGHLLIAS 1118
            IGTAYVQGEDVAARGR+LL++  +++EN QNL V EVYSKELKGA+SALASLQGHLLIAS
Sbjct: 1138 IGTAYVQGEDVAARGRVLLYSFSKNSENSQNLQVLEVYSKELKGAVSALASLQGHLLIAS 1197

Query: 1117 GTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLL 938
            G KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLL
Sbjct: 1198 GPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLL 1257

Query: 937  AKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGA 758
            AKDF +LDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLL RAEFHAGA
Sbjct: 1258 AKDFGTLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKLLCRAEFHAGA 1317

Query: 757  HVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXX 578
            HV+KFLRLQMLPTS+DR++A  GSDKTNRFA+LF TLDGSIGC+APL+E           
Sbjct: 1318 HVTKFLRLQMLPTSADRTTAATGSDKTNRFALLFSTLDGSIGCIAPLDELTFRRLQTLQR 1377

Query: 577  XLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGT 398
             LVD VPHVCGLNPR+FRQFR+NGKAHRPGPDNM+DCELLSHYEMLPL+EQLEIAHQIGT
Sbjct: 1378 KLVDAVPHVCGLNPRSFRQFRANGKAHRPGPDNMVDCELLSHYEMLPLDEQLEIAHQIGT 1437

Query: 397  TRSQILSNLNDLSLGTSFL 341
            TRSQILSNLNDLSLGTSFL
Sbjct: 1438 TRSQILSNLNDLSLGTSFL 1456


>ref|XP_020084456.1| probable cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Ananas comosus]
          Length = 1465

 Score = 2239 bits (5803), Expect = 0.0
 Identities = 1129/1470 (76%), Positives = 1247/1470 (84%), Gaps = 21/1470 (1%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAP---------- 4538
            MSYAAFKMMHWPTGIENCA+GF TH                   +    P          
Sbjct: 1    MSYAAFKMMHWPTGIENCAAGFFTHSPSSSSSSSAAAAEATAAAEIPPLPGDELEAAEWQ 60

Query: 4537 ----KRRIGPIPNLVVTAANVIEVYTVRVQEDDTRAVPAAAEPR---GGGGVMDGLCGAR 4379
                +R +G +P+LVVTA NV+E+Y VR QED+ RA  A+ E +   GGGGV+DG+ GAR
Sbjct: 61   QGRRRRGVGAVPDLVVTAGNVLEIYVVRAQEDEGRAPRASGEQKRGGGGGGVVDGISGAR 120

Query: 4378 LELVCHYRLHGNVESMTVLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMH 4199
            LELVCHYRLHGNVESM VLS+G D  S RRDSIVLAFQDAKITVLE+DDS HGLRTSSMH
Sbjct: 121  LELVCHYRLHGNVESMAVLSVGADNRSNRRDSIVLAFQDAKITVLEYDDSLHGLRTSSMH 180

Query: 4198 CFEGPDWLYLKRGRGSFARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPS 4019
            CFEGPDW YLKRGR SFARGP+VKADP GRCGG LVY  QMI+LKAAQAGQ LV D+EP+
Sbjct: 181  CFEGPDWQYLKRGRESFARGPIVKADPSGRCGGALVYGLQMIILKAAQAGQSLVGDDEPN 240

Query: 4018 SLGGTTSVCIESSFVLNLRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHT 3839
            S GGT SV IESS+V+NLR+LDM HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHT
Sbjct: 241  SAGGTISVRIESSYVINLRELDMNHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHT 300

Query: 3838 CMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSL 3659
            CMISALS+STTLKQHP+IWSA NLPHDAYKLLAVPSPIGG+LVICANSIHYHSQSASCSL
Sbjct: 301  CMISALSVSTTLKQHPMIWSASNLPHDAYKLLAVPSPIGGVLVICANSIHYHSQSASCSL 360

Query: 3658 GLNKFAVQMENSSDMPKTNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQK 3479
             LN F+ Q + S +MPK+NF VELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGR+VQ+
Sbjct: 361  SLNSFSSQPDGSLEMPKSNFAVELDAAHATWLSHDVAMFSSKTGELLLLTLVYDGRIVQR 420

Query: 3478 LDLTKSKASVLTSGITSIGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDL 3299
            LDL KSKASVLTSG+T+IG+SFFFLGSRLGDSLLVQYS G S P+S+ VKDE+ DI+GD+
Sbjct: 421  LDLVKSKASVLTSGLTTIGSSFFFLGSRLGDSLLVQYSCGTSVPTSSQVKDEATDIDGDV 480

Query: 3298 PMAKRLRGMSSDSLQDVASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYG 3119
            P AKRLR MSSD+LQDV S EELSLY   PNSSESAQK FSFAVRDSLINVGPLKDFSYG
Sbjct: 481  PSAKRLRRMSSDALQDVTSVEELSLYNNAPNSSESAQKSFSFAVRDSLINVGPLKDFSYG 540

Query: 3118 LRINADPNATGIAKQSNYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYH 2939
            LRINADPN+TG+AKQSNYELVCCSGHGKNG+LCVLQQSIRPELITEV L GCKGIWTVYH
Sbjct: 541  LRINADPNSTGLAKQSNYELVCCSGHGKNGALCVLQQSIRPELITEVVLAGCKGIWTVYH 600

Query: 2938 KSSRTHAVDSSKTTTEDDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAG 2759
            KSSR HA DSSKT TE+DEYHAYLIISLE+RTMVLETADDLGEVTE+VDYYV GSTIAAG
Sbjct: 601  KSSRGHATDSSKTMTENDEYHAYLIISLESRTMVLETADDLGEVTETVDYYVHGSTIAAG 660

Query: 2758 NLFGRRRVVQIFARGARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKM 2579
            NLFGRRRV+QI+A+GARILDGSYMTQEL+FV H                SIADPYVLLKM
Sbjct: 661  NLFGRRRVIQIYAKGARILDGSYMTQELNFVAH-NSEQTSSELPTVASVSIADPYVLLKM 719

Query: 2578 TDGSIQLLAGDPSTCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGV 2399
            TDGSIQLL GDP+ CTVS+N PA F+S T+ ++ACTLYHDK PEPWLRK S+DAWLSTGV
Sbjct: 720  TDGSIQLLLGDPAACTVSLNAPAIFSSSTEPISACTLYHDKGPEPWLRKTSTDAWLSTGV 779

Query: 2398 AEPIDENDGSYPDQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACI-IP 2222
            AEPID NDGSY D GDIYCLVCYE+G LEIFDVP+F+SV+S D F+SGK++L D     P
Sbjct: 780  AEPIDGNDGSYHDHGDIYCLVCYENGKLEIFDVPSFKSVYSVDNFVSGKTYLADTYTKDP 839

Query: 2221 HK-AKTKGLDETAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGG 2045
            +K   TKG      KEP QN++VVELAMQRWSG+YSRPFLFG+L+DGT+LCYHAY YEG 
Sbjct: 840  NKYPDTKGY---LNKEPVQNMRVVELAMQRWSGRYSRPFLFGMLSDGTILCYHAYFYEGT 896

Query: 2044 EIAPKNEVAVYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVG 1865
            E A K    V P  + D S+   SR RNLRF RVSIDI  REE  N  +RPRIT+F NVG
Sbjct: 897  ENAVKGGDPVSPRGSADTSSMSISRLRNLRFLRVSIDITTREEMLNAVSRPRITVFNNVG 956

Query: 1864 GCQGLFLSGSRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKIC 1685
            G QGLFLSGSRPAWLM+CRER+RVHPQLCDGSI AF VLHNVNCNHGLIYVTSQG+LKIC
Sbjct: 957  GYQGLFLSGSRPAWLMVCRERIRVHPQLCDGSIAAFAVLHNVNCNHGLIYVTSQGYLKIC 1016

Query: 1684 QLPSVCNYDNYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-N 1508
            QLPS  NYDN+WPVQKIPL  TPHQVTY+ EKNLYP+I+SVPV+RPL+QVLSSL+DQ+ +
Sbjct: 1017 QLPSSFNYDNHWPVQKIPLLGTPHQVTYYAEKNLYPLILSVPVIRPLNQVLSSLLDQEMS 1076

Query: 1507 NQMDQDSASSDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVN 1328
             Q+D D+ +SD LQK+YS+DEFEVRI EP+KS GHW+ KAT+PMQ SENALTVR VTL N
Sbjct: 1077 QQIDNDNFNSDDLQKTYSVDEFEVRILEPDKS-GHWDTKATVPMQTSENALTVRIVTLFN 1135

Query: 1327 STTNENETLLAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALA 1148
            +TT ENE+L+AIGTAYVQGEDVAARGR+LLF+  + NE+ QNL+SEVYSKELKGA+S LA
Sbjct: 1136 TTTKENESLMAIGTAYVQGEDVAARGRVLLFSFAKTNESSQNLISEVYSKELKGAVSNLA 1195

Query: 1147 SLQGHLLIASGTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIW 968
            SLQGHLLIASG KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFIL GD+HKSIYFL W
Sbjct: 1196 SLQGHLLIASGPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFILFGDVHKSIYFLNW 1255

Query: 967  KEQGAQLSLLAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKL 788
            KEQGAQLSLLAKDF  L+C+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKL
Sbjct: 1256 KEQGAQLSLLAKDFGPLNCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMLESWKGQKL 1315

Query: 787  LSRAEFHAGAHVSKFLRLQMLPTSSDRSSAT-PGSDKTNRFAILFGTLDGSIGCVAPLEE 611
            LSRAEFHAGA VSKFLRLQMLPTSSDRS AT PGS+KTNRFA+LF TLDGSIGC+APLEE
Sbjct: 1316 LSRAEFHAGALVSKFLRLQMLPTSSDRSGATVPGSEKTNRFALLFSTLDGSIGCIAPLEE 1375

Query: 610  XXXXXXXXXXXXLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLE 431
                        LVD VPHVCGLNPR+FRQF SNGKAHRPGPD+M+D ELLSHYEMLPLE
Sbjct: 1376 LTFRRLQTLQRKLVDAVPHVCGLNPRSFRQFHSNGKAHRPGPDHMVDFELLSHYEMLPLE 1435

Query: 430  EQLEIAHQIGTTRSQILSNLNDLSLGTSFL 341
            EQLEIAHQIGTTRSQILSN NDLSLGTSFL
Sbjct: 1436 EQLEIAHQIGTTRSQILSNFNDLSLGTSFL 1465


>ref|XP_020672482.1| cleavage and polyadenylation specificity factor subunit 1 [Dendrobium
            catenatum]
          Length = 1453

 Score = 2199 bits (5698), Expect = 0.0
 Identities = 1100/1457 (75%), Positives = 1233/1457 (84%), Gaps = 8/1457 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAAFKMMHW TGIENCASGFITH                  L+ E   K+RIGPIPN+
Sbjct: 1    MSYAAFKMMHWATGIENCASGFITHSPAELAAQISPIQAVD--LESEWPAKKRIGPIPNV 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTAANV+E+YTVRV E+D R+   ++EPR GG VMDGL GA LELVCHYRLHGN+ESM 
Sbjct: 59   VVTAANVLELYTVRVHEEDGRSSQLSSEPRNGG-VMDGLSGAHLELVCHYRLHGNIESMV 117

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            +LS GGDE SKRRDSI+L+FQDAKITVLE+DD+ H LRTSSMHCFEGPDW YLKRGR SF
Sbjct: 118  ILSSGGDERSKRRDSILLSFQDAKITVLEYDDAAHELRTSSMHCFEGPDWHYLKRGRESF 177

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            +RGP +K DP+GRC G LVYD QM+++KAAQAGQGL  D+EPS++G + SV IESS+++N
Sbjct: 178  SRGPTMKIDPVGRCVGALVYDLQMVIMKAAQAGQGLAVDDEPSNMGASASVRIESSYIIN 237

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLDM HVKDFTF+HGYIEPVMVIL EKEPTWAGRI+WKHHTCMISALSISTTLKQHPL
Sbjct: 238  LRDLDMMHVKDFTFIHGYIEPVMVILQEKEPTWAGRITWKHHTCMISALSISTTLKQHPL 297

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSA NLPHDAYKLL VPSPIGG+LVICANSIHYHSQS SCSLGLN F  Q+ENS DMPK
Sbjct: 298  IWSAANLPHDAYKLLPVPSPIGGVLVICANSIHYHSQSVSCSLGLNSFCAQLENSLDMPK 357

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            +NF VELDAA ATWLSHDVA+FSSKTGELLLLTLV DGRVVQ+LDL KSKASVLTSGIT+
Sbjct: 358  SNFTVELDAANATWLSHDVAMFSSKTGELLLLTLVNDGRVVQRLDLMKSKASVLTSGITT 417

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            +G SFFFLGSRLGDSLLVQYS G S  +S H K+E+ D++ D P+AK+LR MSSDSLQD 
Sbjct: 418  VGCSFFFLGSRLGDSLLVQYSSGTSLSASVHAKEEAADVDSDAPLAKKLRRMSSDSLQDY 477

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
             SGEELSL++TT NSSES QK F+FAVRDSLINVGPLKDF+YGLRINAD NATGI+KQSN
Sbjct: 478  TSGEELSLFSTTSNSSESTQKSFTFAVRDSLINVGPLKDFAYGLRINADLNATGISKQSN 537

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVC SGHGKNG++CV+Q S+RP+LITEVELPGCKGIWTVYHKSSR HAVD+ K + ED
Sbjct: 538  YELVCASGHGKNGAICVMQHSVRPDLITEVELPGCKGIWTVYHKSSRGHAVDALKPSKED 597

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DE+HAYLIISLE+RTMVLETADDLGEVTE VDYYVQG+TIAAGNLFGRRRVVQIFARGAR
Sbjct: 598  DEFHAYLIISLESRTMVLETADDLGEVTERVDYYVQGTTIAAGNLFGRRRVVQIFARGAR 657

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDGSYMTQEL F VH                SIADPYVLLKM DGSIQLL GD STCTV
Sbjct: 658  ILDGSYMTQELPFGVHNSDHNSSPDSCTVSSASIADPYVLLKMVDGSIQLLIGDLSTCTV 717

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            SI VP+  A+ TDSV+ACTLY DK P+ WLR+AS+DAWLS+G+AE ID NDGSY +QGDI
Sbjct: 718  SIYVPSILANSTDSVSACTLYIDKGPDSWLRRASTDAWLSSGIAETIDGNDGSYHEQGDI 777

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACI----IPH---KAKT-KGLDE 2192
            YCL+CYESGTL+I+DVPNF+ VFS DKF+SGKSHL+D  +     P+   K+KT K   +
Sbjct: 778  YCLLCYESGTLDIYDVPNFKCVFSCDKFVSGKSHLVDRSVREPFKPYQKIKSKTNKESTD 837

Query: 2191 TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
              +KE  + +KVVEL MQRWSG Y RPFLF +LNDGTMLCY AYLYEG +   KNE  V 
Sbjct: 838  LGRKETTEKMKVVELVMQRWSGPYCRPFLFAVLNDGTMLCYQAYLYEGLDSYSKNEEVVS 897

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
            P ++ D S+  ASRFRNLRF RV  D   REE SN+  +P+IT+FKN+ G QGLFLSGSR
Sbjct: 898  PNNSVDPSSISASRFRNLRFIRVPTDFTAREEPSNMVIQPKITVFKNIAGYQGLFLSGSR 957

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            PAW M+CRERLRVHPQLCDGSI AFTVLHNVNCNHG IYVT QGFLKICQLPS  NYDNY
Sbjct: 958  PAWFMVCRERLRVHPQLCDGSIAAFTVLHNVNCNHGFIYVTFQGFLKICQLPSAFNYDNY 1017

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDNNQMDQDSASSDS 1472
            W V+KIPLR TPHQ+TY  EKNLY +I+S PV RP++QVLSS++DQD  QM++++ SSD 
Sbjct: 1018 WAVRKIPLRGTPHQLTYCAEKNLYSIILSFPVARPINQVLSSMLDQDIGQMERENTSSDV 1077

Query: 1471 LQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLAI 1292
            LQK+Y++D+FEVR+ EPEKSGG+W+ +ATIPMQ  ENALTVR VTL NS T ENE ++AI
Sbjct: 1078 LQKTYTLDDFEVRLLEPEKSGGNWDTRATIPMQPCENALTVRMVTLFNSATKENEAIMAI 1137

Query: 1291 GTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASGT 1112
            GTAYVQGEDVA+RGRILL +IG++ EN QNLV+EVYSKELKGAISA+ASLQGHLLIASG 
Sbjct: 1138 GTAYVQGEDVASRGRILLLSIGKNGENSQNLVTEVYSKELKGAISAVASLQGHLLIASGP 1197

Query: 1111 KITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLLAK 932
            KITL+KWTGSEL GVAFYD PL+VVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSLLAK
Sbjct: 1198 KITLHKWTGSELTGVAFYDTPLYVVSLNIVKNFILLGDIHKSIYFLNWKEQGAQLSLLAK 1257

Query: 931  DFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGAHV 752
            DF SLDC+ATEFLIDG TLSLVVSDDQKNIQIFYYAP+M+ESWKGQKLLSRAEFHAGAHV
Sbjct: 1258 DFGSLDCYATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMVESWKGQKLLSRAEFHAGAHV 1317

Query: 751  SKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXXXL 572
            +KFLRLQMLPT SDR SAT GSDKTNRFA+LF TLDGSIGCVAPL+E            L
Sbjct: 1318 TKFLRLQMLPT-SDRMSATLGSDKTNRFALLFSTLDGSIGCVAPLDELTFRRLQTLQRKL 1376

Query: 571  VDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGTTR 392
            VD VPHVCGLNPR+FRQFRSNGKAH PGPDN++DCELLS YEMLPLE QLEIA+QIGTTR
Sbjct: 1377 VDSVPHVCGLNPRSFRQFRSNGKAHHPGPDNIVDCELLSQYEMLPLENQLEIANQIGTTR 1436

Query: 391  SQILSNLNDLSLGTSFL 341
            +QILSNL D SL TSFL
Sbjct: 1437 AQILSNLTDYSLSTSFL 1453


>gb|OAY63492.1| putative cleavage and polyadenylation specificity factor subunit 1
            [Ananas comosus]
          Length = 1469

 Score = 2199 bits (5698), Expect = 0.0
 Identities = 1121/1484 (75%), Positives = 1239/1484 (83%), Gaps = 35/1484 (2%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAP---------- 4538
            MSYAAFKMMHWPTGIENCA+GF TH                   +    P          
Sbjct: 1    MSYAAFKMMHWPTGIENCAAGFFTHSPSSSSSSSAAAAEATAAAEIPPLPGDELEAAEWQ 60

Query: 4537 ----KRRIGPIPNLVVTAANVIEVYTVRVQEDDTRAVPAAAEPR---GGGGVMDGLCGAR 4379
                +R +G +P+LVVTA NV+E+Y VR QED+ RA  A+ E +   GGGGV+DG+ GAR
Sbjct: 61   QGRRRRGVGAVPDLVVTAGNVLEIYVVRAQEDEGRAPRASGEQKRGGGGGGVVDGISGAR 120

Query: 4378 LELVCHYRLHGNVESMTVLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMH 4199
            LELVCHYRLHGNVESM VLS+G D  S RRDSIVLAFQDAKITVLE+DDS HGLRTSSMH
Sbjct: 121  LELVCHYRLHGNVESMAVLSVGADNRSNRRDSIVLAFQDAKITVLEYDDSLHGLRTSSMH 180

Query: 4198 CFEGPDWLYLKRGRGSFARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPS 4019
            CFEGPDW YLKRGR SFARGP+VKADP GRCGG LVY  QMI+LKAAQAGQ LV D+EP+
Sbjct: 181  CFEGPDWQYLKRGRESFARGPIVKADPSGRCGGALVYGLQMIILKAAQAGQSLVGDDEPN 240

Query: 4018 SLGGTTSVCIESSFVLNLRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHT 3839
            S GGT SV IESS+V+NLR+LDM HVKDFTFVHGYIEPVMVILHE+EPTWAGRISWKHHT
Sbjct: 241  SAGGTISVRIESSYVINLRELDMNHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHT 300

Query: 3838 CMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSL 3659
            CMISALS+STTLKQHP+IWSA NLPHDAYKLLAVPSPIGG+LVICANSIHYHSQSASCSL
Sbjct: 301  CMISALSVSTTLKQHPMIWSASNLPHDAYKLLAVPSPIGGVLVICANSIHYHSQSASCSL 360

Query: 3658 GLNKFAVQMENSSDMPKTNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGR---- 3491
             LN F++Q + S +MPK+NF VELDAA ATWLSHDVA+FSSKTGELLLLTLVYDGR    
Sbjct: 361  SLNSFSLQPDGSLEMPKSNFAVELDAAHATWLSHDVAMFSSKTGELLLLTLVYDGRYNPK 420

Query: 3490 --------VVQKLDLTKSKASVLTSGITSIGNSFFFLGSRLGDSLLVQYSYGASSPSSAH 3335
                    +VQ+LDL KSKASVLTSG+T+IG+SFFFLGSRLGDSLLVQYS G S P+S+ 
Sbjct: 421  TLLGKLKTIVQRLDLVKSKASVLTSGLTTIGSSFFFLGSRLGDSLLVQYSCGTSVPTSSQ 480

Query: 3334 VKD--ESVDIEGDLPMAKRLRGMSSDSLQDVASGEELSLYATTPNSSESAQKVFSFAVRD 3161
            VKD  E+ DI+GD+P AKRLR MSSD+LQDV S EELSLY   PNSSESAQK FSFAVRD
Sbjct: 481  VKDEVEATDIDGDVPSAKRLRRMSSDALQDVTSVEELSLYNNAPNSSESAQKSFSFAVRD 540

Query: 3160 SLINVGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGSLCVLQQSIRPELITE 2981
            SLINVGPLKDFSYGLRINADPN+TG+AKQSNYELVCCSGHGKNG+LCVLQQSIRPELITE
Sbjct: 541  SLINVGPLKDFSYGLRINADPNSTGLAKQSNYELVCCSGHGKNGALCVLQQSIRPELITE 600

Query: 2980 VELPGCKGIWTVYHKSSRTHAVDSSKTTTEDDEYHAYLIISLETRTMVLETADDLGEVTE 2801
            V L GCKGIWTVYHKSSR HA DSSKT TE+DEYHAYLIISLE+RTMVLETADDLGEVTE
Sbjct: 601  VVLAGCKGIWTVYHKSSRGHATDSSKTMTENDEYHAYLIISLESRTMVLETADDLGEVTE 660

Query: 2800 SVDYYVQGSTIAAGNLFGRRRVVQIFARGARILDGSYMTQELSFVVHXXXXXXXXXXXXX 2621
            +VDYYV GSTIAAGNLFGRRRV+QI+A+GARILDGSYMTQEL+FV H             
Sbjct: 661  TVDYYVHGSTIAAGNLFGRRRVIQIYAKGARILDGSYMTQELNFVAH-NSEQTSTELPTV 719

Query: 2620 XXXSIADPYVLLKMTDGSIQLLAGDPSTCTVSINVPARFASLTDSVAACTLYHDKSPEPW 2441
               SIADPYVLLKMTDGSIQLL GDP+ CTVS+N PA F+S T+ ++ACTLYHDK  EPW
Sbjct: 720  ASVSIADPYVLLKMTDGSIQLLLGDPAACTVSLNAPAIFSSSTEPISACTLYHDKGLEPW 779

Query: 2440 LRKASSDAWLSTGVAEPIDENDGSYPDQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFI 2261
            LRK S+DAWLSTGVAEPID NDGSY D GDIYCLVCYE+G LEIFDVP+F+SV+S D F+
Sbjct: 780  LRKTSTDAWLSTGVAEPIDGNDGSYHDHGDIYCLVCYENGKLEIFDVPSFKSVYSVDNFV 839

Query: 2260 SGKSHLMDACI-IPHK-AKTKGLDETAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILND 2087
            SGK++L+D     P+K   TKG      KEP QN++VVELAMQRWSG+YSRPFLFG+L+D
Sbjct: 840  SGKTYLVDTYTKDPNKYPDTKGY---LNKEPVQNMRVVELAMQRWSGRYSRPFLFGMLSD 896

Query: 2086 GTMLCYHAYLYEGGEIAPKNEVAVYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSN 1907
            GT+LCYHAY YEG E A K    V P  + D S+   SR RNLRF RVSIDI  REE SN
Sbjct: 897  GTILCYHAYFYEGTENAVKGGDPVSPRGSADTSSMSISRLRNLRFLRVSIDITTREEMSN 956

Query: 1906 LTARPRITIFKNVGGCQGLFLSGSRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNH 1727
              +RPRIT+F NVGG QGLFLSGSRPAWLM+CRER+RVHPQLCDGSI AF VLHNVNCNH
Sbjct: 957  AVSRPRITVFNNVGGYQGLFLSGSRPAWLMVCRERIRVHPQLCDGSIAAFAVLHNVNCNH 1016

Query: 1726 GLIYVTSQGFLKICQLPSVCNYDNYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRP 1547
            GLIYVTSQG+LKICQLPS  NYDN+WPVQKIPL  TPHQVTY+ EKNLYP+I+SVPV+RP
Sbjct: 1017 GLIYVTSQGYLKICQLPSSFNYDNHWPVQKIPLLGTPHQVTYYAEKNLYPLILSVPVIRP 1076

Query: 1546 LSQVLSSLVDQD-NNQMDQDSASSDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQA 1370
            L+QVLSSL+DQ+ + Q+D D+ +SD LQK+YS+DEFEVRI EP+KS GHW+ KAT+PMQ 
Sbjct: 1077 LNQVLSSLLDQEMSQQIDNDNFNSDDLQKTYSVDEFEVRILEPDKS-GHWDTKATVPMQT 1135

Query: 1369 SENALTVRNVTLVNSTTNENETLLAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSE 1190
            SENALTVR VTL N+TT ENE+L+AIGTAYVQGEDVAARGR+LLF+  + NE+ QNL+SE
Sbjct: 1136 SENALTVRIVTLFNTTTKENESLMAIGTAYVQGEDVAARGRVLLFSFAKTNESSQNLISE 1195

Query: 1189 VYSKELKGAISALASLQGHLLIASGTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFI 1010
            VYSKELKGA+S LASLQGHLLIASG KITL+KWTGSEL GVAFYDAPLHVVSLNIVKNFI
Sbjct: 1196 VYSKELKGAVSNLASLQGHLLIASGPKITLHKWTGSELNGVAFYDAPLHVVSLNIVKNFI 1255

Query: 1009 LLGDIHKSIYFLIWKEQGAQLSLLAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFY 830
            L GD+HKSIYFL WKEQGAQLSLLAKDF  L+C+ATEFLIDG TLSLVVSDDQKNIQ   
Sbjct: 1256 LFGDVHKSIYFLNWKEQGAQLSLLAKDFGPLNCYATEFLIDGSTLSLVVSDDQKNIQ--- 1312

Query: 829  YAPRMMESWKGQKLLSRAEFHAGAHVSKFLRLQMLPTSSDRSSAT-PGSDKTNRFAILFG 653
                   SWKGQKLLSRAEFHAGA VSKFLRLQMLPTSSDRS AT PGS+KTNRFA+LF 
Sbjct: 1313 -------SWKGQKLLSRAEFHAGALVSKFLRLQMLPTSSDRSGATVPGSEKTNRFALLFS 1365

Query: 652  TLDGSIGCVAPLEEXXXXXXXXXXXXLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMI 473
            TLDGSIGC+APLEE            LVD VPHVCGLNPR+FRQF SNGKAHRPGPD+M+
Sbjct: 1366 TLDGSIGCIAPLEELTFRRLQTLQRKLVDAVPHVCGLNPRSFRQFHSNGKAHRPGPDHMV 1425

Query: 472  DCELLSHYEMLPLEEQLEIAHQIGTTRSQILSNLNDLSLGTSFL 341
            D ELLSHYEMLPLEEQLEIAHQIGTTRSQILSN NDLSLGTSFL
Sbjct: 1426 DFELLSHYEMLPLEEQLEIAHQIGTTRSQILSNFNDLSLGTSFL 1469


>ref|XP_010277399.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Nelumbo nucifera]
          Length = 1457

 Score = 2165 bits (5610), Expect = 0.0
 Identities = 1080/1460 (73%), Positives = 1224/1460 (83%), Gaps = 11/1460 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAAFK+MHW TGIENC +GFITH                   + E   ++  GPIPNL
Sbjct: 1    MSYAAFKVMHWATGIENCVAGFITHSSADAAPKLPPLQTDE--FESEWTARKETGPIPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTA N++EVY VRVQE+D R+  ++ E + GG VM GL GA LELVC Y+LHGNVE+M 
Sbjct: 59   VVTAGNILEVYVVRVQEEDNRSSRSSTEAKRGG-VMAGLSGASLELVCSYKLHGNVETMA 117

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            VLS+GG + S++RDSI+LAFQDAKI+VLEFDDS HGL  SSMHCFEGP+W YLKRGR SF
Sbjct: 118  VLSMGGGDGSRKRDSIILAFQDAKISVLEFDDSIHGLCISSMHCFEGPEWHYLKRGRESF 177

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP+VK DP GRCGG LVYD QMI+LK+AQAG G   D+E S  G T S  +ESS+V++
Sbjct: 178  ARGPIVKVDPQGRCGGVLVYDLQMIILKSAQAGYGFAGDDEASGSGSTISARVESSYVIS 237

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLDM+HVKDF FVHGYIEPVMVILHE+E TWAGR+SWKHHTCM+SALSISTTLKQHPL
Sbjct: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMVSALSISTTLKQHPL 297

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDAYKLLAVPSPIGG+LVI +N+IHYHSQS SC L LN FAV +++S D+P+
Sbjct: 298  IWSAVNLPHDAYKLLAVPSPIGGVLVIGSNTIHYHSQSVSCVLALNNFAVPLDSSQDIPR 357

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            ++FNVELDAA ATWL HDVA+ S+KTGELLLLTLVYDGRVVQ+L+L+KSKASVLTSGIT+
Sbjct: 358  SSFNVELDAANATWLLHDVAILSTKTGELLLLTLVYDGRVVQRLELSKSKASVLTSGITT 417

Query: 3427 IGNSFFFLGSRLGDSLLVQYS--YGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQ 3254
            IGNSFFFLGSRLGDSLLVQY+   G S+ +S HVK+E  DIE D P  KRLR   SD LQ
Sbjct: 418  IGNSFFFLGSRLGDSLLVQYTCGMGTSTTTSGHVKEEVGDIETDAPSVKRLRRSPSDPLQ 477

Query: 3253 DVASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQ 3074
            D+  GEELSLY + PN+SES QK+FSF VRDSLINVGPLKDFSYGLR+NADPNATGIAKQ
Sbjct: 478  DIVGGEELSLYGSAPNNSESVQKIFSFTVRDSLINVGPLKDFSYGLRLNADPNATGIAKQ 537

Query: 3073 SNYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTT 2894
            SNYELVCCSGHGKNG+LCVLQQSIRPE+ITEVELPGCKGIWTVYHK++R H  DSSK  +
Sbjct: 538  SNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNSDSSKMVS 597

Query: 2893 EDDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARG 2714
            EDDEYHAYLIISLE+RTMVLETAD LGEVTE+V+YYV GST+ AGNLFGRRRVVQIFARG
Sbjct: 598  EDDEYHAYLIISLESRTMVLETADLLGEVTETVEYYVLGSTVTAGNLFGRRRVVQIFARG 657

Query: 2713 ARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTC 2534
            AR+LDGSYMTQ++S                    SIADPYVLL+M+DGSIQLL GDPSTC
Sbjct: 658  ARVLDGSYMTQDISLGSPNPGSSSGSDNLTVSSASIADPYVLLRMSDGSIQLLIGDPSTC 717

Query: 2533 TVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQG 2354
            TVS+ VPA F SL +S++ACTLYHDK PEPWLRK S+DAWLSTG+ E ID  DG+  DQG
Sbjct: 718  TVSVTVPAVFESLKESISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAPSDQG 777

Query: 2353 DIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACI-----IPHKAKTKGLDET 2189
            DIYCLVCYESGTLEIF+VP+F+ VFS DKF+SGK+HL+D  I      PH ++ K  DE 
Sbjct: 778  DIYCLVCYESGTLEIFEVPSFKCVFSVDKFVSGKTHLVDTVIGEPSKDPHVSRNKNSDEM 837

Query: 2188 A---KKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVA 2018
            A   KKE   N+KVVELAMQRW GQ++RPFLFGIL DGT+ CYHA+LYEG E + K E A
Sbjct: 838  AGKVKKENVLNMKVVELAMQRWLGQHTRPFLFGILTDGTVFCYHAFLYEGSENSLKTEEA 897

Query: 2017 VYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSG 1838
                ++  +S+   SR RNLRF RV ++   REE+S L+   RITIFKNVGG QGLF+SG
Sbjct: 898  TSLQNSVSLSSISTSRLRNLRFVRVPLESYTREETSGLSTCQRITIFKNVGGYQGLFVSG 957

Query: 1837 SRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYD 1658
            SRPAW M+CRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSV +YD
Sbjct: 958  SRPAWFMICRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVSSYD 1017

Query: 1657 NYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQDN-NQMDQDSAS 1481
            NYWPVQKIPL+ATPHQVTYF EKNLYP+IVS+PVV+PL+QVLSSLVDQ+  +Q+D D  S
Sbjct: 1018 NYWPVQKIPLKATPHQVTYFAEKNLYPLIVSIPVVKPLNQVLSSLVDQEGGHQIDHDGLS 1077

Query: 1480 SDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETL 1301
             D L ++Y++DEFEVRI EPEKSGG W+ K TIPMQ+ E+ALTVR VTL N+TT ENETL
Sbjct: 1078 PDELHRTYTVDEFEVRIMEPEKSGGPWQTKVTIPMQSCESALTVRMVTLFNTTTKENETL 1137

Query: 1300 LAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIA 1121
            LAIGTAYVQGEDVAARGR+LLF+IGR+ +NPQNLVSEVYSKELKGAISALASLQGHLLIA
Sbjct: 1138 LAIGTAYVQGEDVAARGRVLLFSIGRNTDNPQNLVSEVYSKELKGAISALASLQGHLLIA 1197

Query: 1120 SGTKITLNKWTGSELVGVAFYDAPLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSL 941
            SG KI L+KWTG+EL GVAF+DAPL+VVSLNIVKNFILLGDIHKSIYFL WKEQGAQL+L
Sbjct: 1198 SGPKIILHKWTGTELNGVAFFDAPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNL 1257

Query: 940  LAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAG 761
            LAKDF +LDCFATEFLIDG TLSLVVSDDQKN+QIFYYAP+M ESWKG KLLSRAEFH G
Sbjct: 1258 LAKDFGNLDCFATEFLIDGTTLSLVVSDDQKNVQIFYYAPKMSESWKGHKLLSRAEFHVG 1317

Query: 760  AHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXX 581
            AHV+KFLRLQMLPTSSDR++A P SDKTNRFA+LFGTLDGSIGC+APL+E          
Sbjct: 1318 AHVTKFLRLQMLPTSSDRTTAAPSSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQTLQ 1377

Query: 580  XXLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIG 401
              L+D VPHV GLNPRAFRQF SNGKAHRPGP+N++DCELL HYEML LEEQL++AHQIG
Sbjct: 1378 RKLIDAVPHVAGLNPRAFRQFHSNGKAHRPGPENIVDCELLCHYEMLQLEEQLDVAHQIG 1437

Query: 400  TTRSQILSNLNDLSLGTSFL 341
            TTR QILSNLNDLSLGTSFL
Sbjct: 1438 TTREQILSNLNDLSLGTSFL 1457


>ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X2 [Vitis vinifera]
          Length = 1442

 Score = 2158 bits (5591), Expect = 0.0
 Identities = 1080/1459 (74%), Positives = 1228/1459 (84%), Gaps = 10/1459 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAA+KMMHWPTGIENCASGF+TH                  L+ E   KR+IGP+PNL
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDD--LESEWPTKRQIGPLPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            +VTAAN++EVY VRVQEDD+R   A+AE + GG VM G+ GA LELVC YRLHGNVE+MT
Sbjct: 59   IVTAANILEVYMVRVQEDDSRESRASAETKRGG-VMAGISGAALELVCQYRLHGNVETMT 117

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            VL  GG + S+RRDSI+LAFQDAKI+VLEFDDS HGLRTSSMHCFEGP+W +LKRG  SF
Sbjct: 118  VLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESF 177

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP+VK DP GRC G LVY  QMI+LKA+QAG GLV DEE  S G   S  +ESS+V++
Sbjct: 178  ARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVIS 237

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLDM+HVKDFTFVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL
Sbjct: 238  LRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDAYKLL VPSPIGG++VI ANSIHYHSQSASC+L LN +AV  +NS +MP+
Sbjct: 298  IWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPR 357

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            ++F+VELDAA ATWLS+DVA+ S+KTGELLLLTL YDGRVV +LDL+KS+ASVLTSGI +
Sbjct: 358  SSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAA 417

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IGNS FFLGSRLGDSLLVQ+    +S  S+ VK+E  DIEGD+P AKRLR  SSD+LQD+
Sbjct: 418  IGNSLFFLGSRLGDSLLVQF----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDM 473

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
             +GEELSLY + PNS+E++QK FSF+VRDS INVGPLKDF+YGLRINADP ATGIAKQSN
Sbjct: 474  VNGEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSN 533

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LC+LQQSIRPE+ITEVELPGCKGIWTVYHK++R H  DS+K  T+D
Sbjct: 534  YELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKD 593

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGAR 2708
            DEYHAYLIISLE+RTMVLETAD LGEVTESVDYYVQG TI+AGNLFGRRRVVQ++ARGAR
Sbjct: 594  DEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGAR 653

Query: 2707 ILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCTV 2528
            ILDG++MTQ+L                     SIADPYVLL+M+DG+IQLL GDPSTCTV
Sbjct: 654  ILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTV 703

Query: 2527 SINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGDI 2348
            SIN+PA F S   S++ACTLYHDK PEPWLRK S+DAWLSTG+ E ID  DG+  DQGDI
Sbjct: 704  SINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDI 763

Query: 2347 YCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTKGLDETA- 2186
            YC+V YESG LEIFDVPNF  VFS DKF+SG +HL+D  I+       K  +K  +E A 
Sbjct: 764  YCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEAD 823

Query: 2185 --KKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAVY 2012
              +KE   N+KVVELAMQRWSGQ+SRPFLFGIL DGT+LCYHAYLYEG E  PK E AV 
Sbjct: 824  QGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVS 883

Query: 2011 PLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGSR 1832
              ++  ISN  ASR RNLRF RV +D   REE+ + T  PR+T+FKN+GGCQGLFLSGSR
Sbjct: 884  AQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSR 943

Query: 1831 PAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDNY 1652
            P W M+ RER+RVHPQLCDGSIVAFTVLHN+NCNHGLIYVTSQGFLKICQLP+V +YDNY
Sbjct: 944  PLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNY 1003

Query: 1651 WPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASSD 1475
            WPVQKIPL+ TPHQVTYF EKNLYP+IVSVPV++PL+ VLSSLVDQ+  +Q++ D+ SSD
Sbjct: 1004 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSD 1063

Query: 1474 SLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLLA 1295
             L +SYS+DEFEVR+ EPEKSG  W+ +ATIPMQ+SENALTVR VTL N+TT ENETLLA
Sbjct: 1064 ELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLA 1123

Query: 1294 IGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIASG 1115
            IGTAYVQGEDVAARGR+LLF++G++ +N QNLVSE+YSKELKGAISA+ASLQGHLLIASG
Sbjct: 1124 IGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASG 1183

Query: 1114 TKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSLL 938
             KI L+KWTG+EL GVAF+DA PL+VVSLNIVKNFILLGDIH+SIYFL WKEQGAQL+LL
Sbjct: 1184 PKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLL 1243

Query: 937  AKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAGA 758
            AKDF SLDCFATEFLIDG TLSL+VSDDQKNIQIFYYAP+M ESWKGQKLLSRAEFH GA
Sbjct: 1244 AKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1303

Query: 757  HVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXXX 578
            HV+KFLRLQMLP SSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E           
Sbjct: 1304 HVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1363

Query: 577  XLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIGT 398
             LVD VPHV GLNPR+FRQFRSNGKAHRPGPDN++DCELL HYEMLP EEQLEIA QIGT
Sbjct: 1364 KLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGT 1423

Query: 397  TRSQILSNLNDLSLGTSFL 341
            TR QILSNLNDLSLGTSFL
Sbjct: 1424 TRMQILSNLNDLSLGTSFL 1442


>emb|CBI24510.3| unnamed protein product, partial [Vitis vinifera]
          Length = 1448

 Score = 2151 bits (5574), Expect = 0.0
 Identities = 1080/1465 (73%), Positives = 1228/1465 (83%), Gaps = 16/1465 (1%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAA+KMMHWPTGIENCASGF+TH                  L+ E   KR+IGP+PNL
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDD--LESEWPTKRQIGPLPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            +VTAAN++EVY VRVQEDD+R   A+AE + GG VM G+ GA LELVC YRLHGNVE+MT
Sbjct: 59   IVTAANILEVYMVRVQEDDSRESRASAETKRGG-VMAGISGAALELVCQYRLHGNVETMT 117

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            VL  GG + S+RRDSI+LAFQDAKI+VLEFDDS HGLRTSSMHCFEGP+W +LKRG  SF
Sbjct: 118  VLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESF 177

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP+VK DP GRC G LVY  QMI+LKA+QAG GLV DEE  S G   S  +ESS+V++
Sbjct: 178  ARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVIS 237

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLDM+HVKDFTFVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL
Sbjct: 238  LRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDAYKLL VPSPIGG++VI ANSIHYHSQSASC+L LN +AV  +NS +MP+
Sbjct: 298  IWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPR 357

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            ++F+VELDAA ATWLS+DVA+ S+KTGELLLLTL YDGRVV +LDL+KS+ASVLTSGI +
Sbjct: 358  SSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAA 417

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IGNS FFLGSRLGDSLLVQ+    +S  S+ VK+E  DIEGD+P AKRLR  SSD+LQD+
Sbjct: 418  IGNSLFFLGSRLGDSLLVQF----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDM 473

Query: 3247 ASGEELSLYATTPNSSESAQ------KVFSFAVRDSLINVGPLKDFSYGLRINADPNATG 3086
             +GEELSLY + PNS+E++Q      K FSF+VRDS INVGPLKDF+YGLRINADP ATG
Sbjct: 474  VNGEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATG 533

Query: 3085 IAKQSNYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSS 2906
            IAKQSNYELVCCSGHGKNG+LC+LQQSIRPE+ITEVELPGCKGIWTVYHK++R H  DS+
Sbjct: 534  IAKQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADST 593

Query: 2905 KTTTEDDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQI 2726
            K  T+DDEYHAYLIISLE+RTMVLETAD LGEVTESVDYYVQG TI+AGNLFGRRRVVQ+
Sbjct: 594  KMATKDDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQV 653

Query: 2725 FARGARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGD 2546
            +ARGARILDG++MTQ+L                     SIADPYVLL+M+DG+IQLL GD
Sbjct: 654  YARGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGD 703

Query: 2545 PSTCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSY 2366
            PSTCTVSIN+PA F S   S++ACTLYHDK PEPWLRK S+DAWLSTG+ E ID  DG+ 
Sbjct: 704  PSTCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAA 763

Query: 2365 PDQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTKG 2201
             DQGDIYC+V YESG LEIFDVPNF  VFS DKF+SG +HL+D  I+       K  +K 
Sbjct: 764  QDQGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKN 823

Query: 2200 LDETA---KKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPK 2030
             +E A   +KE   N+KVVELAMQRWSGQ+SRPFLFGIL DGT+LCYHAYLYEG E  PK
Sbjct: 824  SEEEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPK 883

Query: 2029 NEVAVYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGL 1850
             E AV   ++  ISN  ASR RNLRF RV +D   REE+ + T  PR+T+FKN+GGCQGL
Sbjct: 884  TEEAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGL 943

Query: 1849 FLSGSRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSV 1670
            FLSGSRP W M+ RER+RVHPQLCDGSIVAFTVLHN+NCNHGLIYVTSQGFLKICQLP+V
Sbjct: 944  FLSGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAV 1003

Query: 1669 CNYDNYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQ 1493
             +YDNYWPVQKIPL+ TPHQVTYF EKNLYP+IVSVPV++PL+ VLSSLVDQ+  +Q++ 
Sbjct: 1004 SSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLEN 1063

Query: 1492 DSASSDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNE 1313
            D+ SSD L +SYS+DEFEVR+ EPEKSG  W+ +ATIPMQ+SENALTVR VTL N+TT E
Sbjct: 1064 DNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1123

Query: 1312 NETLLAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGH 1133
            NETLLAIGTAYVQGEDVAARGR+LLF++G++ +N QNLVSE+YSKELKGAISA+ASLQGH
Sbjct: 1124 NETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGH 1183

Query: 1132 LLIASGTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQG 956
            LLIASG KI L+KWTG+EL GVAF+DA PL+VVSLNIVKNFILLGDIH+SIYFL WKEQG
Sbjct: 1184 LLIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQG 1243

Query: 955  AQLSLLAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRA 776
            AQL+LLAKDF SLDCFATEFLIDG TLSL+VSDDQKNIQIFYYAP+M ESWKGQKLLSRA
Sbjct: 1244 AQLNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRA 1303

Query: 775  EFHAGAHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXX 596
            EFH GAHV+KFLRLQMLP SSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E     
Sbjct: 1304 EFHVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1363

Query: 595  XXXXXXXLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEI 416
                   LVD VPHV GLNPR+FRQFRSNGKAHRPGPDN++DCELL HYEMLP EEQLEI
Sbjct: 1364 LQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEI 1423

Query: 415  AHQIGTTRSQILSNLNDLSLGTSFL 341
            A QIGTTR QILSNLNDLSLGTSFL
Sbjct: 1424 AQQIGTTRMQILSNLNDLSLGTSFL 1448


>ref|XP_019081674.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Vitis vinifera]
 ref|XP_019081675.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Vitis vinifera]
          Length = 1449

 Score = 2151 bits (5573), Expect = 0.0
 Identities = 1080/1466 (73%), Positives = 1228/1466 (83%), Gaps = 17/1466 (1%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAA+KMMHWPTGIENCASGF+TH                  L+ E   KR+IGP+PNL
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDD--LESEWPTKRQIGPLPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            +VTAAN++EVY VRVQEDD+R   A+AE + GG VM G+ GA LELVC YRLHGNVE+MT
Sbjct: 59   IVTAANILEVYMVRVQEDDSRESRASAETKRGG-VMAGISGAALELVCQYRLHGNVETMT 117

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            VL  GG + S+RRDSI+LAFQDAKI+VLEFDDS HGLRTSSMHCFEGP+W +LKRG  SF
Sbjct: 118  VLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESF 177

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP+VK DP GRC G LVY  QMI+LKA+QAG GLV DEE  S G   S  +ESS+V++
Sbjct: 178  ARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVIS 237

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLDM+HVKDFTFVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL
Sbjct: 238  LRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDAYKLL VPSPIGG++VI ANSIHYHSQSASC+L LN +AV  +NS +MP+
Sbjct: 298  IWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPR 357

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            ++F+VELDAA ATWLS+DVA+ S+KTGELLLLTL YDGRVV +LDL+KS+ASVLTSGI +
Sbjct: 358  SSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAA 417

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQDV 3248
            IGNS FFLGSRLGDSLLVQ+    +S  S+ VK+E  DIEGD+P AKRLR  SSD+LQD+
Sbjct: 418  IGNSLFFLGSRLGDSLLVQF----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDM 473

Query: 3247 ASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQSN 3068
             +GEELSLY + PNS+E++QK FSF+VRDS INVGPLKDF+YGLRINADP ATGIAKQSN
Sbjct: 474  VNGEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSN 533

Query: 3067 YELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTED 2888
            YELVCCSGHGKNG+LC+LQQSIRPE+ITEVELPGCKGIWTVYHK++R H  DS+K  T+D
Sbjct: 534  YELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKD 593

Query: 2887 DEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFG-------RRRVVQ 2729
            DEYHAYLIISLE+RTMVLETAD LGEVTESVDYYVQG TI+AGNLFG       RRRVVQ
Sbjct: 594  DEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRLIFCLFRRRVVQ 653

Query: 2728 IFARGARILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAG 2549
            ++ARGARILDG++MTQ+L                     SIADPYVLL+M+DG+IQLL G
Sbjct: 654  VYARGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVG 703

Query: 2548 DPSTCTVSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGS 2369
            DPSTCTVSIN+PA F S   S++ACTLYHDK PEPWLRK S+DAWLSTG+ E ID  DG+
Sbjct: 704  DPSTCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGA 763

Query: 2368 YPDQGDIYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTK 2204
              DQGDIYC+V YESG LEIFDVPNF  VFS DKF+SG +HL+D  I+       K  +K
Sbjct: 764  AQDQGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSK 823

Query: 2203 GLDETA---KKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAP 2033
              +E A   +KE   N+KVVELAMQRWSGQ+SRPFLFGIL DGT+LCYHAYLYEG E  P
Sbjct: 824  NSEEEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTP 883

Query: 2032 KNEVAVYPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQG 1853
            K E AV   ++  ISN  ASR RNLRF RV +D   REE+ + T  PR+T+FKN+GGCQG
Sbjct: 884  KTEEAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQG 943

Query: 1852 LFLSGSRPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPS 1673
            LFLSGSRP W M+ RER+RVHPQLCDGSIVAFTVLHN+NCNHGLIYVTSQGFLKICQLP+
Sbjct: 944  LFLSGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPA 1003

Query: 1672 VCNYDNYWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMD 1496
            V +YDNYWPVQKIPL+ TPHQVTYF EKNLYP+IVSVPV++PL+ VLSSLVDQ+  +Q++
Sbjct: 1004 VSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLE 1063

Query: 1495 QDSASSDSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTN 1316
             D+ SSD L +SYS+DEFEVR+ EPEKSG  W+ +ATIPMQ+SENALTVR VTL N+TT 
Sbjct: 1064 NDNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTK 1123

Query: 1315 ENETLLAIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQG 1136
            ENETLLAIGTAYVQGEDVAARGR+LLF++G++ +N QNLVSE+YSKELKGAISA+ASLQG
Sbjct: 1124 ENETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQG 1183

Query: 1135 HLLIASGTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQ 959
            HLLIASG KI L+KWTG+EL GVAF+DA PL+VVSLNIVKNFILLGDIH+SIYFL WKEQ
Sbjct: 1184 HLLIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQ 1243

Query: 958  GAQLSLLAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSR 779
            GAQL+LLAKDF SLDCFATEFLIDG TLSL+VSDDQKNIQIFYYAP+M ESWKGQKLLSR
Sbjct: 1244 GAQLNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSR 1303

Query: 778  AEFHAGAHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXX 599
            AEFH GAHV+KFLRLQMLP SSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E    
Sbjct: 1304 AEFHVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFR 1363

Query: 598  XXXXXXXXLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLE 419
                    LVD VPHV GLNPR+FRQFRSNGKAHRPGPDN++DCELL HYEMLP EEQLE
Sbjct: 1364 RLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLE 1423

Query: 418  IAHQIGTTRSQILSNLNDLSLGTSFL 341
            IA QIGTTR QILSNLNDLSLGTSFL
Sbjct: 1424 IAQQIGTTRMQILSNLNDLSLGTSFL 1449


>ref|XP_015877866.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Ziziphus jujuba]
          Length = 1453

 Score = 2137 bits (5536), Expect = 0.0
 Identities = 1081/1460 (74%), Positives = 1222/1460 (83%), Gaps = 11/1460 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MS+AAFKMMHWPTGIENCASGFITH                 D D   A +R IGPIPNL
Sbjct: 1    MSFAAFKMMHWPTGIENCASGFITHSRADFVPRIPPIQNDDLDSDWS-ASRREIGPIPNL 59

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            VVTA NV+EVY VR+QE+  R+  A+ E R GG VMDGL GA LELVCHYRLHGNVE+M 
Sbjct: 60   VVTAGNVLEVYVVRIQEESNRSSRASGESRRGG-VMDGLSGASLELVCHYRLHGNVETMA 118

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            VLS GG E S+RRDSI+L+FQDAKI+VL+FDDSTHGLRTSSMHCFEGP WL+LKRGR SF
Sbjct: 119  VLSTGGGESSRRRDSIILSFQDAKISVLDFDDSTHGLRTSSMHCFEGPKWLHLKRGRESF 178

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP+VK DP GRCGG LVYD QMI+LKAAQAG GLV DE+ SS GG  S  IESS+++N
Sbjct: 179  ARGPLVKVDPQGRCGGVLVYDFQMIILKAAQAGSGLVVDEDTSSSGGAVSAHIESSYIIN 238

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLDM+H+KDF FVHGYIEPVMVILHE+E TWAGR++WKHHTCM+SALSISTTLKQHPL
Sbjct: 239  LRDLDMKHIKDFIFVHGYIEPVMVILHERELTWAGRVAWKHHTCMVSALSISTTLKQHPL 298

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSA NLPHDAYKLLAVPSPIGG+LVI ANSIHYHSQS SC+L LN FAV +++S +MP+
Sbjct: 299  IWSAANLPHDAYKLLAVPSPIGGVLVIGANSIHYHSQSTSCALALNNFAVSVDSSQEMPR 358

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            ++FNVELDAA ATWL +DVA+ S+KTGELLLLT+VYDGRVVQ+LDL+KSKASVLTSGIT+
Sbjct: 359  SSFNVELDAANATWLLNDVALLSTKTGELLLLTIVYDGRVVQRLDLSKSKASVLTSGITT 418

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYG-ASSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 3251
            IGNS FFLGSRLGDSLLVQ++ G  SS  S+ +KDE  DIEGD P AKRLR +SSD+ QD
Sbjct: 419  IGNSLFFLGSRLGDSLLVQFTCGVGSSIMSSALKDEVGDIEGDAPSAKRLRRLSSDASQD 478

Query: 3250 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 3071
            +ASGEELSLY + PN++ESAQK FSFAVRDSLINVGP+KDFSYGLR+NAD NATGIAKQS
Sbjct: 479  MASGEELSLYGSAPNNTESAQKSFSFAVRDSLINVGPIKDFSYGLRVNADTNATGIAKQS 538

Query: 3070 NYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTE 2891
            NYELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+R H VDS+K+   
Sbjct: 539  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSTRGHNVDSAKSAAA 598

Query: 2890 DDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGA 2711
            DDEYHAYLIISLE RTMVLETAD L EVTESVDYYVQG TIAAGNLFGRRRVVQ++ RGA
Sbjct: 599  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGA 658

Query: 2710 RILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCT 2531
            RILDGS+MTQ+LS V                  SIADPYV+L+MTDGSI+LL GDPS+CT
Sbjct: 659  RILDGSFMTQDLSIVA--ANSESGSESATVLSVSIADPYVVLRMTDGSIRLLIGDPSSCT 716

Query: 2530 VSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGD 2351
            VSI+ PA F S    ++ACTLYHD  PEPWLRK S+DAWLSTGV E +D  DGS  DQGD
Sbjct: 717  VSISTPAAFESSKKLISACTLYHDDGPEPWLRKTSTDAWLSTGVDEAVDGADGSLHDQGD 776

Query: 2350 IYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACI-----IPHKAKTKGLDET- 2189
            IYC+VCYESG+LEI+DVPNF  VFS +KFISGK +L+D  +      P K   +  ++  
Sbjct: 777  IYCVVCYESGSLEIYDVPNFNCVFSVEKFISGKMNLLDTLVEEQSKDPQKLMNRSSEDVS 836

Query: 2188 --AKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAV 2015
              A+KE  QN+K+VELAMQRWSGQ+SRPFLFGIL+DGT+LCYHAYL+EG E A K E +V
Sbjct: 837  GQARKENVQNMKIVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLFEGPESASKTEDSV 896

Query: 2014 YPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGS 1835
               S   +SN  ASR RNLRF RV++D   +EE+ N T+  RI+IFKN+ G QGLFLSGS
Sbjct: 897  SAQSLSGLSNNSASRLRNLRFVRVALDTYAKEETPNATSCQRISIFKNIAGYQGLFLSGS 956

Query: 1834 RPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDN 1655
            RPAW M+ RERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQG LKICQLPS+ +YD+
Sbjct: 957  RPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGILKICQLPSITSYDS 1016

Query: 1654 YWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASS 1478
            YWPVQKIPL+ TPHQVTYF EKNLYP+IVSVPV +PL+QV+SSL+DQ+  +Q +  + SS
Sbjct: 1017 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVISSLIDQEVGHQAENHNLSS 1076

Query: 1477 DSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLL 1298
            D L ++Y++DEFEVRI EPE SGG W+ KATIPMQ SENALTVR VTL N+TT ENETLL
Sbjct: 1077 DDLHRTYTVDEFEVRILEPEISGGPWQTKATIPMQTSENALTVRVVTLFNTTTKENETLL 1136

Query: 1297 AIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIAS 1118
            AIGTAYVQGEDVAARGR+LLF+IG    NPQNLVSEVY+K+LKGAISALASLQGHLL+AS
Sbjct: 1137 AIGTAYVQGEDVAARGRVLLFSIG---NNPQNLVSEVYTKDLKGAISALASLQGHLLMAS 1193

Query: 1117 GTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSL 941
            G KI L+KWTG EL  VAF+D  PL+VVSLNIVKNFILLGDIHKSIYFL WKEQGAQLSL
Sbjct: 1194 GPKIILHKWTGGELNAVAFFDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSL 1253

Query: 940  LAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAG 761
            LAKDF SLDCFATEFLIDG TLSLVVSD++KNIQIFYYAP+M ESWKGQKLLSRAEFH G
Sbjct: 1254 LAKDFGSLDCFATEFLIDGSTLSLVVSDNRKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1313

Query: 760  AHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXX 581
            AHV+K LRLQML T+SDR+     SDKTNRFA+LFGTLDGS+GC+APL+E          
Sbjct: 1314 AHVTKLLRLQMLSTTSDRTGTASVSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQSLQ 1373

Query: 580  XXLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIG 401
              LVD V HV GLNPR+FRQFRSNGKAHRPGPD+++DCELL HYEMLPLEEQLEIAHQIG
Sbjct: 1374 KKLVDAVSHVAGLNPRSFRQFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQIG 1433

Query: 400  TTRSQILSNLNDLSLGTSFL 341
            TTRSQILSNLNDLSLGTSFL
Sbjct: 1434 TTRSQILSNLNDLSLGTSFL 1453


>gb|EOY22974.1| Cleavage and polyadenylation specificity factor 160 isoform 1
            [Theobroma cacao]
          Length = 1457

 Score = 2133 bits (5527), Expect = 0.0
 Identities = 1073/1460 (73%), Positives = 1215/1460 (83%), Gaps = 11/1460 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAA+KMMHWPTGIENCASGF+TH                  L+ E   +R IGP+PNL
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHCRADFTPQIPLNQTED--LESEWPARRGIGPVPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            +VTAAN++E+Y VRVQE+  R    + E + GG V+DG+ G  LELVC+YRLHGNVESM 
Sbjct: 59   IVTAANLLEIYVVRVQEEGRREARNSTEVKRGG-VLDGVSGVSLELVCNYRLHGNVESMA 117

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            VLS+GG + S+RRDSI+LAF+DAKI+VLEFDDS HGLRT+SMHCFEGP+WL+LKRGR SF
Sbjct: 118  VLSIGGGDGSRRRDSIILAFKDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESF 177

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP+VK DP GRCGG LVYD QMI+LKA+QAG G V +++    GG  S  +ESS+++N
Sbjct: 178  ARGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIIN 237

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLD++H+KDF FVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL
Sbjct: 238  LRDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDAYKLLAVPSPIGG+LVI AN+IHYHSQSASC+L LN +A+ ++NS D+P+
Sbjct: 298  IWSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPR 357

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            +NF+VELDAA ATWL +DVA+ S+KTGELLLLTL+YDGRVVQ+LDL+KSKASVLTS IT+
Sbjct: 358  SNFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITT 417

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGAS-SPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 3251
            IGNS FFLGSRLGDSLLVQ+S G+  S   + +K+E  DIEGD+P+AKRLR  SSD+LQD
Sbjct: 418  IGNSLFFLGSRLGDSLLVQFSGGSGVSALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQD 477

Query: 3250 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 3071
            +  GEELSLY + PN++ESAQK F FAVRDSL NVGPLKDFSYGLRINAD NATGIAKQS
Sbjct: 478  MVGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQS 537

Query: 3070 NYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTE 2891
            NYELVCCSGHGKNG+LCVL+QSIRPE+ITEVEL GCKGIWTVYHKS+R+H+ D SK T +
Sbjct: 538  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 597

Query: 2890 DDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGA 2711
            DDEYHAYLIISLE RTMVLETAD L EVTESVDYYVQG TIAAGNLFGRRRVVQ++ RGA
Sbjct: 598  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGA 657

Query: 2710 RILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCT 2531
            RILDGS+MTQELS                    SIADPYVLL+MTDGSI LL GDP+TCT
Sbjct: 658  RILDGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCT 717

Query: 2530 VSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGD 2351
            VSIN P  F      V+ACTLYHDK PEPWLRKAS+DAWLSTGV E ID  DG   DQGD
Sbjct: 718  VSINTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGD 777

Query: 2350 IYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTKGLDE-- 2192
            IYC+VCYESG LEIFDVPNF  VFS +KF SG++ L+DA  +       K   K  +E  
Sbjct: 778  IYCVVCYESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSSEELT 837

Query: 2191 -TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAV 2015
               +KE  QNLKVVELAMQRWS  +SRPFLFGIL DGT+LCYHAYL+EG E A K E +V
Sbjct: 838  GQGRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSV 897

Query: 2014 YPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGS 1835
               ++  +SN  ASR RNLRF R+ +D   REE SN T   RITIFKN+ G QG FLSGS
Sbjct: 898  VAQNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGS 957

Query: 1834 RPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDN 1655
            RPAW M+ RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQ+PS  NYDN
Sbjct: 958  RPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDN 1017

Query: 1654 YWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASS 1478
            YWPVQKIPLR TPHQVTYF E+NLYP+IVSVPV +P++QVLSSLVDQ+  +QMD  + SS
Sbjct: 1018 YWPVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSS 1077

Query: 1477 DSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLL 1298
            D LQ++Y++DEFEVRI EPEKSGG WE KATIPMQ+SENALTVR VTL N+TT ENE+LL
Sbjct: 1078 DELQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLL 1137

Query: 1297 AIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIAS 1118
            AIGTAY+QGEDVAARGR++L +IGR+ +N QNLVSEVYSKELKGAISALASLQGHLLIAS
Sbjct: 1138 AIGTAYIQGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHLLIAS 1197

Query: 1117 GTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSL 941
            G KI L+ WTGSEL G+AFYDA PL+VVSLNIVKNFILLGD+HKSIYFL WKEQGAQLSL
Sbjct: 1198 GPKIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSL 1257

Query: 940  LAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAG 761
            LAKDF SLDCFATEFLIDG TLSL+VSD+QKNIQIFYYAP+M ESWKGQKLLSRAEFH G
Sbjct: 1258 LAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1317

Query: 760  AHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXX 581
            AHV+KFLRLQML TSSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E          
Sbjct: 1318 AHVTKFLRLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1377

Query: 580  XXLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIG 401
              LVD VPHV GLNPR+FRQF SNGKAHRPGPD+++DCELL HYEMLPLEEQL+IAHQIG
Sbjct: 1378 KKLVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIG 1437

Query: 400  TTRSQILSNLNDLSLGTSFL 341
            TTRSQILSNLNDL+LGTSFL
Sbjct: 1438 TTRSQILSNLNDLTLGTSFL 1457


>ref|XP_017972864.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform X1 [Theobroma cacao]
          Length = 1457

 Score = 2132 bits (5525), Expect = 0.0
 Identities = 1072/1460 (73%), Positives = 1215/1460 (83%), Gaps = 11/1460 (0%)
 Frame = -2

Query: 4687 MSYAAFKMMHWPTGIENCASGFITHXXXXXXXXXXXXXXXXXDLDPELAPKRRIGPIPNL 4508
            MSYAA+KMMHWPTGI+NCASGF+TH                  L+ E   +R IGP+PNL
Sbjct: 1    MSYAAYKMMHWPTGIDNCASGFVTHCRADFTPQIPLNQTED--LESEWPARRGIGPVPNL 58

Query: 4507 VVTAANVIEVYTVRVQEDDTRAVPAAAEPRGGGGVMDGLCGARLELVCHYRLHGNVESMT 4328
            +VTAAN++E+Y VRVQE+  R    + E + GG V+DG+    LELVC+YRLHGNVESM 
Sbjct: 59   IVTAANLLEIYVVRVQEEGRREARNSTEVKRGG-VLDGVSRVSLELVCNYRLHGNVESMA 117

Query: 4327 VLSLGGDECSKRRDSIVLAFQDAKITVLEFDDSTHGLRTSSMHCFEGPDWLYLKRGRGSF 4148
            VLS+GG + S+RRDSI+LAFQDAKI+VLEFDDS HGLRT+SMHCFEGP+WL+LKRGR SF
Sbjct: 118  VLSIGGGDGSRRRDSIILAFQDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESF 177

Query: 4147 ARGPVVKADPLGRCGGTLVYDHQMIVLKAAQAGQGLVADEEPSSLGGTTSVCIESSFVLN 3968
            ARGP+VK DP GRCGG LVYD QMI+LKA+QAG G V +++    GG  S  +ESS+++N
Sbjct: 178  ARGPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIIN 237

Query: 3967 LRDLDMQHVKDFTFVHGYIEPVMVILHEKEPTWAGRISWKHHTCMISALSISTTLKQHPL 3788
            LRDLD++H+KDF FVHGYIEPVMVILHE+E TWAGR+SWKHHTCMISALSISTTLKQHPL
Sbjct: 238  LRDLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297

Query: 3787 IWSAVNLPHDAYKLLAVPSPIGGILVICANSIHYHSQSASCSLGLNKFAVQMENSSDMPK 3608
            IWSAVNLPHDAYKLLAVPSPIGG+LVI AN+IHYHSQSASC+L LN +A+ ++NS D+P+
Sbjct: 298  IWSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPR 357

Query: 3607 TNFNVELDAAKATWLSHDVAVFSSKTGELLLLTLVYDGRVVQKLDLTKSKASVLTSGITS 3428
            +NF+VELDAA ATWL +DVA+ S+KTGELLLLTL+YDGRVVQ+LDL+KSKASVLTS IT+
Sbjct: 358  SNFSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITT 417

Query: 3427 IGNSFFFLGSRLGDSLLVQYSYGA-SSPSSAHVKDESVDIEGDLPMAKRLRGMSSDSLQD 3251
            IGNS FFLGSRLGDSLLVQ+S G+ +S   + +K+E  DIEGD+P+AKRLR  SSD+LQD
Sbjct: 418  IGNSLFFLGSRLGDSLLVQFSGGSGASALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQD 477

Query: 3250 VASGEELSLYATTPNSSESAQKVFSFAVRDSLINVGPLKDFSYGLRINADPNATGIAKQS 3071
            +  GEELSLY + PN++ESAQK F FAVRDSL NVGPLKDFSYGLRINAD NATGIAKQS
Sbjct: 478  MVGGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQS 537

Query: 3070 NYELVCCSGHGKNGSLCVLQQSIRPELITEVELPGCKGIWTVYHKSSRTHAVDSSKTTTE 2891
            NYELVCCSGHGKNG+LCVL+QSIRPE+ITEVEL GCKGIWTVYHKS+R+H+ D SK T +
Sbjct: 538  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDD 597

Query: 2890 DDEYHAYLIISLETRTMVLETADDLGEVTESVDYYVQGSTIAAGNLFGRRRVVQIFARGA 2711
            DDEYHAYLIISLE RTMVLETAD L EVTESVDYYVQG TIAAGNLFGRRRVVQ++ RGA
Sbjct: 598  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGA 657

Query: 2710 RILDGSYMTQELSFVVHXXXXXXXXXXXXXXXXSIADPYVLLKMTDGSIQLLAGDPSTCT 2531
            RILDGS+MTQELS                    SIADPYVLL+MTDGSI LL GDP+TCT
Sbjct: 658  RILDGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCT 717

Query: 2530 VSINVPARFASLTDSVAACTLYHDKSPEPWLRKASSDAWLSTGVAEPIDENDGSYPDQGD 2351
            VSIN P  F      V+ACTLYHDK PEPWLRKAS+DAWLSTGV E ID  DG   DQGD
Sbjct: 718  VSINTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGD 777

Query: 2350 IYCLVCYESGTLEIFDVPNFRSVFSTDKFISGKSHLMDACII-----PHKAKTKGLDE-- 2192
            IYC+VCYESG LEIFDVPNF  VFS + F SG++ L+DA  +       K   K  +E  
Sbjct: 778  IYCVVCYESGALEIFDVPNFNCVFSMENFSSGRTRLVDAYTLESSKDSEKVINKSSEELT 837

Query: 2191 -TAKKEPPQNLKVVELAMQRWSGQYSRPFLFGILNDGTMLCYHAYLYEGGEIAPKNEVAV 2015
               +KE  QNLKVVELAMQRWS  +SRPFLFGIL DGT+LCYHAYL+EG E A K E +V
Sbjct: 838  GQGRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSV 897

Query: 2014 YPLSTGDISNTGASRFRNLRFYRVSIDINIREESSNLTARPRITIFKNVGGCQGLFLSGS 1835
               ++  +SN  ASR RNLRF R+ +D   REE SN T   RITIFKN+ G QG FLSGS
Sbjct: 898  VAQNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGS 957

Query: 1834 RPAWLMLCRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGFLKICQLPSVCNYDN 1655
            RPAW M+ RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQ+PS  NYDN
Sbjct: 958  RPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDN 1017

Query: 1654 YWPVQKIPLRATPHQVTYFLEKNLYPVIVSVPVVRPLSQVLSSLVDQD-NNQMDQDSASS 1478
            YWPVQKIPLR TPHQVTYF E+NLYP+IVSVPV +P++QVLSSLVDQ+  +QMD  + SS
Sbjct: 1018 YWPVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSS 1077

Query: 1477 DSLQKSYSIDEFEVRIFEPEKSGGHWEPKATIPMQASENALTVRNVTLVNSTTNENETLL 1298
            D LQ++Y++DEFEVRI EPEKSGG WE KATIPMQ+SENALTVR VTL N+TT ENE+LL
Sbjct: 1078 DELQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLL 1137

Query: 1297 AIGTAYVQGEDVAARGRILLFTIGRDNENPQNLVSEVYSKELKGAISALASLQGHLLIAS 1118
            AIGTAY+QGEDVAARGR++L +IGR+ +NPQNLVSEVYSKELKGAISALASLQGHLLIAS
Sbjct: 1138 AIGTAYIQGEDVAARGRVILCSIGRNTDNPQNLVSEVYSKELKGAISALASLQGHLLIAS 1197

Query: 1117 GTKITLNKWTGSELVGVAFYDA-PLHVVSLNIVKNFILLGDIHKSIYFLIWKEQGAQLSL 941
            G KI L+ WTGSEL G+AFYDA PL+VVSLNIVKNFILLGD+HKSIYFL WKEQGAQLSL
Sbjct: 1198 GPKIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSL 1257

Query: 940  LAKDFSSLDCFATEFLIDGGTLSLVVSDDQKNIQIFYYAPRMMESWKGQKLLSRAEFHAG 761
            LAKDF SLDCFATEFLIDG TLSL+VSD+QKNIQIFYYAP+M ESWKGQKLLSRAEFH G
Sbjct: 1258 LAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1317

Query: 760  AHVSKFLRLQMLPTSSDRSSATPGSDKTNRFAILFGTLDGSIGCVAPLEEXXXXXXXXXX 581
            AHV+KFLRLQML TSSDR+SAT GSDKTNRFA+LFGTLDGSIGC+APL+E          
Sbjct: 1318 AHVTKFLRLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1377

Query: 580  XXLVDVVPHVCGLNPRAFRQFRSNGKAHRPGPDNMIDCELLSHYEMLPLEEQLEIAHQIG 401
              LVD VPHV GLNPR+FRQF SNGKAHRPGPD+++DCELL HYEMLPLEEQL+IAHQIG
Sbjct: 1378 KKLVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIG 1437

Query: 400  TTRSQILSNLNDLSLGTSFL 341
            TTRSQILSNLNDL+LGTSFL
Sbjct: 1438 TTRSQILSNLNDLTLGTSFL 1457


Top