BLASTX nr result

ID: Ephedra27_contig00010537 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00010537
         (2888 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006827641.1| hypothetical protein AMTR_s00009p00247750 [A...  1065   0.0  
ref|XP_002268591.1| PREDICTED: cleavage and polyadenylation spec...  1053   0.0  
ref|XP_004499957.1| PREDICTED: cleavage and polyadenylation spec...  1019   0.0  
ref|XP_004140773.1| PREDICTED: cleavage and polyadenylation spec...  1016   0.0  
gb|EOY23219.1| Cleavage and polyadenylation specificity factor 1...  1016   0.0  
ref|XP_006587302.1| PREDICTED: cleavage and polyadenylation spec...  1014   0.0  
ref|XP_006490412.1| PREDICTED: cleavage and polyadenylation spec...  1013   0.0  
ref|XP_006421948.1| hypothetical protein CICLE_v10004414mg [Citr...  1011   0.0  
ref|XP_002517902.1| cleavage and polyadenylation specificity fac...  1011   0.0  
gb|ESW24245.1| hypothetical protein PHAVU_004G114000g [Phaseolus...  1008   0.0  
ref|XP_003548179.1| PREDICTED: cleavage and polyadenylation spec...  1004   0.0  
gb|EXC19142.1| Cleavage and polyadenylation specificity factor s...  1003   0.0  
ref|XP_006369487.1| Cleavage and polyadenylation specificity fac...  1001   0.0  
ref|XP_002330904.1| predicted protein [Populus trichocarpa]          1001   0.0  
ref|XP_004234405.1| PREDICTED: cleavage and polyadenylation spec...   999   0.0  
ref|XP_006353867.1| PREDICTED: cleavage and polyadenylation spec...   994   0.0  
ref|NP_001063978.1| Os09g0569400 [Oryza sativa Japonica Group] g...   993   0.0  
gb|EMJ21437.1| hypothetical protein PRUPE_ppa001928mg [Prunus pe...   985   0.0  
ref|XP_003565596.1| PREDICTED: cleavage and polyadenylation spec...   983   0.0  
ref|XP_003578687.1| PREDICTED: cleavage and polyadenylation spec...   981   0.0  

>ref|XP_006827641.1| hypothetical protein AMTR_s00009p00247750 [Amborella trichopoda]
            gi|548832261|gb|ERM95057.1| hypothetical protein
            AMTR_s00009p00247750 [Amborella trichopoda]
          Length = 737

 Score = 1065 bits (2753), Expect = 0.0
 Identities = 533/740 (72%), Positives = 621/740 (83%), Gaps = 9/740 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV +TPLSGVHSE+PLSYLL++DGF FLVDCGWNDFFDP+ L PLS+V+S++DAVL+
Sbjct: 1    MGTSVQLTPLSGVHSENPLSYLLSLDGFNFLVDCGWNDFFDPELLQPLSRVSSTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+FGL AP+Y TEPV++ GLLTMYDH+LSRR VSDFDLF+LDDID
Sbjct: 61   SHPDTVHLGALPYAMKQFGLSAPVYSTEPVHKLGLLTMYDHYLSRRQVSDFDLFSLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNVT L YSQ+Y L+GKGEGIVITP+ AG LLGGT+WKI+KDGEDVIYAVDFNHRKE
Sbjct: 121  AAFQNVTRLTYSQDYHLSGKGEGIVITPHVAGHLLGGTIWKITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGTVLESFVRPAVLITDAYNALNNQPS RQ  DQEFLD ILRTLRGDG VLLPVDTA
Sbjct: 181  RHLNGTVLESFVRPAVLITDAYNALNNQPSTRQR-DQEFLDAILRTLRGDGKVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+L LEQYW +HHL+YPIAFLTNVATST+++ KS LEWM DSI KSFEH+RDN F
Sbjct: 240  GRVLELILILEQYWTQHHLSYPIAFLTNVATSTIEYAKSSLEWMIDSIGKSFEHTRDNVF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LK  N+++++KEL+++PEGPKVVLASMASLEEGFS DIF++WA D KNLV+FTER QFG
Sbjct: 300  VLKNFNIIINKKELEKLPEGPKVVLASMASLEEGFSHDIFVEWAVDSKNLVVFTERAQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQ +P PK VKVT+ KRVPL GEELKAYEEEQNR+K EEALKA+ SKE+D+K+S 
Sbjct: 360  TLARMLQVDPPPKVVKVTMHKRVPLVGEELKAYEEEQNRIKKEEALKASLSKEDDLKASC 419

Query: 1480 V--SDNTSDPMVIDSVAGVV-PEGASSR---HRDVLCDGFIPSSSSIAPMFPFDEGLKEW 1319
            +    + SDPMVIDS  G++  E AS R   +RDVL DGF+P S+S++PMFPF E  +EW
Sbjct: 420  IVPDKSLSDPMVIDSAGGLISSEVASPRIVGYRDVLIDGFVPPSTSVSPMFPFYENSREW 479

Query: 1318 DEYGEVIDPENYVIKND---XXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVH 1148
            D++GEVI+P++Y IK +                   D  + D+L D+K +KVVS+E+TV 
Sbjct: 480  DDFGEVINPDDYAIKEEDMLDPTSVAVLGGGLEDKFDEDSNDMLLDSKPSKVVSNELTVQ 539

Query: 1147 VKCSLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQ 968
            VKCSL Y DFEGRSD RSIK+I+ HVAPLKLVLVHGSAEATEHL+QHCLK   SHVYAPQ
Sbjct: 540  VKCSLIYKDFEGRSDSRSIKTILAHVAPLKLVLVHGSAEATEHLKQHCLKNVCSHVYAPQ 599

Query: 967  IEETIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDDMLSLLPLVNDPP 788
            I ETIDVTSDL AYKV+LSE+LMS+VLFKKLG+YEIAWIDG+V + D ML+L+PL   PP
Sbjct: 600  IGETIDVTSDLCAYKVRLSERLMSNVLFKKLGDYEIAWIDGEVNETDGMLTLVPLSTGPP 659

Query: 787  LHKSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQKSGVHQIVIEG 608
            LHKSV VGDL+LADFKQ LASKGV AEF  G LRCG+ IT+RKVGDS  K    Q+ IEG
Sbjct: 660  LHKSVLVGDLKLADFKQFLASKGVPAEFSKGFLRCGENITLRKVGDS--KGATQQVGIEG 717

Query: 607  PLTEEYFKIRQYLYSQFYVL 548
            PLTEEY+KIR+ LYSQFY+L
Sbjct: 718  PLTEEYYKIRELLYSQFYLL 737


>ref|XP_002268591.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 2
            [Vitis vinifera] gi|302143847|emb|CBI22708.3| unnamed
            protein product [Vitis vinifera]
          Length = 740

 Score = 1053 bits (2722), Expect = 0.0
 Identities = 525/741 (70%), Positives = 617/741 (83%), Gaps = 10/741 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL+++DGF FLVDCGWND FDP  L PL++VAS++DAVL+
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLVDCGWNDHFDPSFLQPLARVASTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            +H DT H+GALPYA+K+ GL AP+Y TEPVYR GLLTMYD +LSR+ VSDFDLFTLDDID
Sbjct: 61   AHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQYLSRKQVSDFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNVT L YSQNY L GKGEGIVI P+ AG LLGGTVWKI+KDGEDVIYAVDFNHRKE
Sbjct: 121  SAFQNVTRLTYSQNYHLFGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            R LNGTVLESFVRPAVLITDAYNALNNQPSRRQ  DQEFLD+IL+TLRGDGNVLLPVDTA
Sbjct: 181  RLLNGTVLESFVRPAVLITDAYNALNNQPSRRQR-DQEFLDVILKTLRGDGNVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+L LEQYW +HHL YPI FLT VA+ST+D+VKSFLEWMSDSIAKSFEH+RDNAF
Sbjct: 240  GRVLELMLILEQYWTQHHLNYPIFFLTYVASSTIDYVKSFLEWMSDSIAKSFEHTRDNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LK++ +L+S+ EL+++P+GPK+VLASMASLE GFS DIF++WA+D KNLV+F+ERGQF 
Sbjct: 300  LLKHVTLLISKSELEKVPDGPKIVLASMASLEAGFSHDIFVEWATDAKNLVLFSERGQFA 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKVT+SKRVPL GEEL AYEEEQ R+K EEALKA+ SKE+++K+S 
Sbjct: 360  TLARMLQADPPPKAVKVTMSKRVPLVGEELAAYEEEQERIKKEEALKASLSKEDEMKASR 419

Query: 1480 VSDN-TSDPMVIDSVAGVVPEGAS----SRHRDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
             SDN   DPMVID+         +      HRD+L DGF+P S+S+APMFPF E   EWD
Sbjct: 420  GSDNKLGDPMVIDTTTPPASSDVAVPHVGGHRDILIDGFVPPSTSVAPMFPFYENSSEWD 479

Query: 1315 EYGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            ++GEVI+PE+YVIK+ D                D  A  L+ DT  +KV+S+E+TV VKC
Sbjct: 480  DFGEVINPEDYVIKDEDMDQATMQVGDDLNGKLDEGAASLIFDTTPSKVISNELTVQVKC 539

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
             L Y+DFEGRSDGRSIKSI+ HVAPLKLVLVHGSAEATEHL+QHCLK    HVYAPQI E
Sbjct: 540  MLVYMDFEGRSDGRSIKSILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYAPQIGE 599

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDD-MLSLLPLVNDPPLH 782
            TIDVTSDL AYKVQLSEKLMS+VLFKKLG+YE+AW+D +VGK +   LSLLPL   PP H
Sbjct: 600  TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEVAWVDAEVGKTESGSLSLLPLSTPPPSH 659

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQKSG---VHQIVIE 611
             +VFVGD+++ADFKQ LASKG+Q EF GG LRCG+Y+T+RKVGD+SQK G   + QIV+E
Sbjct: 660  DTVFVGDIKMADFKQFLASKGIQVEFSGGALRCGEYVTLRKVGDASQKGGGAIIQQIVME 719

Query: 610  GPLTEEYFKIRQYLYSQFYVL 548
            GPL +EY+KIR+YLYSQ+Y+L
Sbjct: 720  GPLCDEYYKIREYLYSQYYLL 740


>ref|XP_004499957.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            2-like [Cicer arietinum]
          Length = 740

 Score = 1019 bits (2634), Expect = 0.0
 Identities = 517/741 (69%), Positives = 607/741 (81%), Gaps = 10/741 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL+++DGF FL+D GWND FDP  L PLSKVASS+DAVL+
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDVGWNDNFDPSLLQPLSKVASSIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+ GL AP++ TEPVYR GLLTMYDHFLSR+ +SDFDLFTLD ID
Sbjct: 61   SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDHFLSRKQISDFDLFTLDHID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQ+VT L YSQN+ L+GKGEGIVI P+ AG LLGGT+WKI+KDGEDVIYAVDFNHRKE
Sbjct: 121  SAFQSVTRLTYSQNHHLSGKGEGIVIAPHNAGHLLGGTIWKITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGTVL SFVRPAVLITDAYNALNNQP RRQ  D+EF D++ +TLR  GNVLLPVDTA
Sbjct: 181  RHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQK-DKEFGDILKKTLRAGGNVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+L LE YW+  +L YPI FLT VA+ST+D+VKSFLEWMSDSIAKSFE +R+N F
Sbjct: 240  GRVLELILMLESYWSDENLNYPIYFLTYVASSTIDYVKSFLEWMSDSIAKSFEQTRENIF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LKY+ +++++ + D  P+GPKVVLASMASLE GFS DIF++W +D KNLV+FTERGQFG
Sbjct: 300  LLKYVTLMVNKTDFDNAPDGPKVVLASMASLEAGFSHDIFVEWGNDVKNLVLFTERGQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKVTVSKRVPL GEEL AYEEEQNR+K EEALKA+  KEE++K+S 
Sbjct: 360  TLARMLQADPPPKAVKVTVSKRVPLVGEELIAYEEEQNRIKKEEALKASLLKEEELKASH 419

Query: 1480 VSD-NTSDPMVIDS-VAGVVPEGASSR---HRDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
             +D NTSDPMVID+      PE    R   +RDV  DGF+P S+S+APMFP  E   EWD
Sbjct: 420  GADNNTSDPMVIDTGNKQPSPEATVQRNGGYRDVFIDGFVPPSTSVAPMFPCYENTSEWD 479

Query: 1315 EYGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            ++GEVI+P++YVIK+ D                D     L+ DTK +KV+SDE TV V+C
Sbjct: 480  DFGEVINPDDYVIKDEDMDQNANHVGGDINGKLDEGPASLILDTKPSKVLSDERTVQVRC 539

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+DFEGRSDGRSIK+I+ HVAPLKLVLVHGSAEAT+HL+QHCLK    HVYAPQIEE
Sbjct: 540  SLIYMDFEGRSDGRSIKNILSHVAPLKLVLVHGSAEATDHLKQHCLKNVCPHVYAPQIEE 599

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGK-NDDMLSLLPLVNDPPLH 782
            TIDVTSDL AYKVQLSE+LMS+VLFKKLGEYEIAW+D +VGK  +DMLSLLP+   P  H
Sbjct: 600  TIDVTSDLCAYKVQLSERLMSNVLFKKLGEYEIAWVDAEVGKAENDMLSLLPVSGPPRPH 659

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIVIE 611
            KSV VGDL+LADFKQ L++KGV  EF GG LRCG+Y+TVRKVGD++QK   SG  QI+IE
Sbjct: 660  KSVLVGDLKLADFKQFLSTKGVPVEFAGGALRCGEYVTVRKVGDAAQKGAGSGTQQIIIE 719

Query: 610  GPLTEEYFKIRQYLYSQFYVL 548
            GPL E+Y+KIR YLYSQFY+L
Sbjct: 720  GPLCEDYYKIRDYLYSQFYLL 740


>ref|XP_004140773.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            2-like [Cucumis sativus]
          Length = 738

 Score = 1016 bits (2628), Expect = 0.0
 Identities = 513/740 (69%), Positives = 608/740 (82%), Gaps = 9/740 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL++VD F FL+DCGWND FDP  L PLS+VAS++DAVLI
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSVDDFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLI 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+ GL AP++ TEPVYR GLLTMYD F++R+ VS+FDLFTLDDID
Sbjct: 61   SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQ VT L YSQN+ L+GKGEGIVI P+ AG LLGGT+WKI+KDGEDVIYAVDFNHRKE
Sbjct: 121  SAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGT+LESFVRPAVLITDAYNALNNQP RRQ  D+EF D I +TLR +GNVLLPVDTA
Sbjct: 181  RHLNGTILESFVRPAVLITDAYNALNNQPYRRQK-DKEFGDTIQKTLRANGNVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+  LE YW +  L YPI FLT VA+ST+D++KSFLEWMSD+IAKSFEH+R+NAF
Sbjct: 240  GRVLELIQILEWYWEEESLNYPIFFLTYVASSTIDYIKSFLEWMSDTIAKSFEHTRNNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LK++ +L+++ ELD  P+GPKVVLASMASLE G+S DIF+DWA D KNLV+F+ERGQFG
Sbjct: 300  LLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKVTVSKRVPL G+EL AYEEEQNR K EEALKA+  KEE  K+S 
Sbjct: 360  TLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-KKEEALKASLLKEEQSKASH 418

Query: 1480 VSDN-TSDPMVIDSVAGVVPEGASSR---HRDVLCDGFIPSSSSIAPMFPFDEGLKEWDE 1313
             +DN T DPM+ID+ + V P+  SS    +RD+L DGF+P S+ +APMFPF E    WD+
Sbjct: 419  GADNDTGDPMIIDASSNVAPDVGSSHGGAYRDILIDGFVPPSTGVAPMFPFYENTSAWDD 478

Query: 1312 YGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKCS 1136
            +GEVI+P++YVIK+ D                D  A +L+ D K +KVVS+E+TV VKCS
Sbjct: 479  FGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCS 538

Query: 1135 LNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEET 956
            L+Y+DFEGRSDGRSIKSI+ HVAPLKLVLVHG+AEATEHL+QHCLK    HVYAPQIEET
Sbjct: 539  LHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEET 598

Query: 955  IDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDD-MLSLLPLVNDPPLHK 779
            IDVTSDL AYKVQLSEKLMS+VLFKKLG+YEI W+D +VGK ++  LSLLPL   P  HK
Sbjct: 599  IDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEITWLDAEVGKTENGTLSLLPLSKAPAPHK 658

Query: 778  SVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIVIEG 608
            SV VGDL++ADFKQ LASKG+Q EF GG LRCG+Y+T+RKV D+SQK   SG  Q+VIEG
Sbjct: 659  SVLVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQKGGGSGTQQVVIEG 718

Query: 607  PLTEEYFKIRQYLYSQFYVL 548
            PL E+Y+KIR+ LYSQFY+L
Sbjct: 719  PLCEDYYKIRELLYSQFYLL 738


>gb|EOY23219.1| Cleavage and polyadenylation specificity factor 100 isoform 1
            [Theobroma cacao]
          Length = 742

 Score = 1016 bits (2627), Expect = 0.0
 Identities = 514/744 (69%), Positives = 613/744 (82%), Gaps = 13/744 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL+++DGF FL+DCGWND FDP  L PLS+VA ++DAVL+
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDPSLLQPLSRVAPTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+FGL AP+Y TEPV+R GLLTMYD +LSR+ VS+F+LFTLDDID
Sbjct: 61   SHPDTLHLGALPYAMKQFGLSAPVYSTEPVFRLGLLTMYDQYLSRKQVSEFELFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNVT L YSQNY L+GKGEGIVI P+ AG LLGGTVWKI+KDGEDVIYAVDFN RKE
Sbjct: 121  SAFQNVTRLTYSQNYHLSGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVIYAVDFNRRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQ-TIDQEFLDMILRTLRGDGNVLLPVDT 2024
            +HLNGTVLESFVRPAVLITDAYNALNNQP ++Q   D++F+D I RTL   GNVLLPVDT
Sbjct: 181  KHLNGTVLESFVRPAVLITDAYNALNNQPPKQQRERDRDFVDTISRTLEAGGNVLLPVDT 240

Query: 2023 AGRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNA 1844
             GRVLELLL LE++WA   L YPI FLT V++ST+D+VKSFLEWMSD+IAKSFE SRDNA
Sbjct: 241  TGRVLELLLVLEEHWAMKSLNYPIFFLTYVSSSTIDYVKSFLEWMSDAIAKSFETSRDNA 300

Query: 1843 FQLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQF 1664
            F L+++ +L+S+ ELD++P+GPKVVLASMASLE GFS DIF++WA+D KNLV+FTERGQF
Sbjct: 301  FLLRHVTLLISKNELDKVPDGPKVVLASMASLEAGFSHDIFVEWAADVKNLVLFTERGQF 360

Query: 1663 GTLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSS 1484
            GTLARMLQA+P PKAVKV +S+RVPL GEEL A+EEEQNRLK EEALKA+  KEE+ K+S
Sbjct: 361  GTLARMLQADPPPKAVKVMMSRRVPLVGEELIAHEEEQNRLKKEEALKASLIKEEESKAS 420

Query: 1483 FVSD-NTSDPMVID------SVAGVVPEGASSRHRDVLCDGFIPSSSSIAPMFPFDEGLK 1325
             V D ++SDPMVID      S+ G+   G  S +RD+L DGF+P S+S+APMFPF E   
Sbjct: 421  IVPDISSSDPMVIDTNNKHSSLDGLGQHG--SGYRDILIDGFVPPSTSVAPMFPFYENAS 478

Query: 1324 EWDEYGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVH 1148
            +WD++GEVI+P++YVIK+ D                D  +  L+ DT  +KV+S+E+TV 
Sbjct: 479  DWDDFGEVINPDDYVIKDEDMDQAAMHVGGDMDGKVDEASASLIVDTTPSKVISNELTVQ 538

Query: 1147 VKCSLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQ 968
            VK SL Y+D+EGRSDGRS+KSI+ HVAPLKLVLVHGSAEATEHL+QHCLK    HVYAPQ
Sbjct: 539  VKSSLIYMDYEGRSDGRSVKSILAHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYAPQ 598

Query: 967  IEETIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKND-DMLSLLPLVNDP 791
            IEETIDVTSDL AYKVQLSEKLMS+VLFKKLG+YEIAW+D +VGK + +MLSLLPL    
Sbjct: 599  IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENEMLSLLPLSTPA 658

Query: 790  PLHKSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQI 620
            P HKSV VGDL+LADFKQ LASKGV+ EF GG LRCG+Y+T+RKVG +SQK   SG  QI
Sbjct: 659  PPHKSVVVGDLKLADFKQFLASKGVKVEFAGGALRCGEYVTLRKVGFASQKGGGSGTQQI 718

Query: 619  VIEGPLTEEYFKIRQYLYSQFYVL 548
            +IEGPL E+Y+KIR YLYSQFY+L
Sbjct: 719  IIEGPLCEDYYKIRDYLYSQFYLL 742


>ref|XP_006587302.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            2-like isoform X1 [Glycine max]
          Length = 739

 Score = 1014 bits (2623), Expect = 0.0
 Identities = 516/741 (69%), Positives = 604/741 (81%), Gaps = 10/741 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL+++DGF FLVDCGWND FDP  L PL++VAS++DAVL+
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLVDCGWNDHFDPSHLQPLARVASTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+ GL AP+Y TEPVYR GLLTMYD +LSR+ VS+FDLFTLDDID
Sbjct: 61   SHADTLHLGALPYAMKRLGLSAPVYSTEPVYRLGLLTMYDQYLSRKQVSEFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQ+VT L YSQN+  +GKGEGIVI P+ AG LLGGT+WKI+KDGEDVIYAVDFNHRKE
Sbjct: 121  SAFQSVTRLTYSQNHHFSGKGEGIVIAPHVAGHLLGGTIWKITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGTVL SFVRPAVLITDAYNALNNQP RRQ  D+EF D++ +TLR  GNVLLPVDT 
Sbjct: 181  RHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQN-DKEFGDILKKTLRAGGNVLLPVDTV 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+L LE YWA  +L YPI FLT VA+ST+D+VKSFLEWMSD+IAKSFE +R+N F
Sbjct: 240  GRVLELILMLELYWADENLNYPIYFLTYVASSTIDYVKSFLEWMSDTIAKSFEKTRENIF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LKY+ +L+++ ELD  P+GPKVVLASMASLE GFS DIF++WA+D KNLV+FTERGQF 
Sbjct: 300  LLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHDIFVEWANDVKNLVLFTERGQFA 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKV VSKRVPL GEEL AYEEEQNR+K +EALKA+  KEE++K+S 
Sbjct: 360  TLARMLQADPPPKAVKVVVSKRVPLVGEELIAYEEEQNRIK-KEALKASLMKEEELKTSH 418

Query: 1480 VSDN-TSDPMVIDSVAGVVPEGAS----SRHRDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
             +DN  SDPMVIDS    VP   +      +RD+  DGF+P S+S+AP+FP  E   EWD
Sbjct: 419  GADNDISDPMVIDSGNNHVPPEVTGPRGGGYRDIFIDGFVPPSTSVAPIFPCYENTSEWD 478

Query: 1315 EYGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            ++GEVI+P++YVIK+ D                D  A  L+ DTK +KVVSDE TV V+C
Sbjct: 479  DFGEVINPDDYVIKDEDMDQTAMHGGSDINGKLDEGAASLILDTKPSKVVSDERTVQVRC 538

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+DFEGRSDGRSIK+I+ HVAPLKLVLVHGSAEATEHL+QHCLK    HVYAPQIEE
Sbjct: 539  SLVYMDFEGRSDGRSIKNILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYAPQIEE 598

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKND-DMLSLLPLVNDPPLH 782
            TIDVTSDL AYKVQLSEKLMS+VLFKKLG+YEIAW+D  VGK + D LSLLP+    P H
Sbjct: 599  TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAVVGKTENDPLSLLPVSGAAPPH 658

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIVIE 611
            KSV VGDL+LAD KQ L+SKGVQ EF GG LRCG+Y+T+RKVGD+SQK   SG  QIVIE
Sbjct: 659  KSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGEYVTLRKVGDASQKGGGSGAQQIVIE 718

Query: 610  GPLTEEYFKIRQYLYSQFYVL 548
            GPL E+Y+KIR YLYSQFY+L
Sbjct: 719  GPLCEDYYKIRDYLYSQFYLL 739


>ref|XP_006490412.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            2-like isoform X2 [Citrus sinensis]
          Length = 738

 Score = 1013 bits (2618), Expect = 0.0
 Identities = 507/740 (68%), Positives = 602/740 (81%), Gaps = 9/740 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPLSGV +E+PLSYL+++DGF FL+DCGWND FDP  L PLSKVAS++DAVL+
Sbjct: 1    MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+ GL AP++ TEPVYR GLLTMYD +LSRR VS+FDLFTLDDID
Sbjct: 61   SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQ+VT L YSQNY L+GKGEGIV+ P+ AG LLGGTVWKI+KDGEDVIYAVD+N RKE
Sbjct: 121  SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            +HLNGTVLESFVRPAVLITDAYNAL+NQP R+Q   + F D I +TLR  GNVLLPVD+A
Sbjct: 181  KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR--EMFQDAISKTLRAGGNVLLPVDSA 238

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLELLL LE YWA+H L YPI FLT V++ST+D+VKSFLEWM DSI KSFE SRDNAF
Sbjct: 239  GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LK++ +L+++ ELD  P+GPK+VLASMASLE GFS DIF++WASD KNLV+FTERGQFG
Sbjct: 299  LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKVT+S+RVPL GEEL AYEEEQ RLK EEALKA+  KEE+ K+S 
Sbjct: 359  TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418

Query: 1480 VSDN--TSDPMVID---SVAGVVPEGASSRHRDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
              DN  + DPMVID   + A  V E    R+RD+L DGF+P S+S+APMFPF E   EWD
Sbjct: 419  GPDNNLSGDPMVIDANNANASAVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 478

Query: 1315 EYGEVIDPENYVIKNDXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKCS 1136
            ++GEVI+P++Y+IK++                D  +  L+ D K +KVVS+E+TV VKC 
Sbjct: 479  DFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVQVKCL 538

Query: 1135 LNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEET 956
            L ++D+EGR+DGRSIK+I+ HVAPLKLVLVHGSAEATEHL+QHCLK    HVY PQIEET
Sbjct: 539  LIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEET 598

Query: 955  IDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDD-MLSLLPLVNDPPLHK 779
            IDVTSDL AYKVQLSEKLMS+VLFKKLG+YEIAW+D +VGK ++ MLSLLP+    P HK
Sbjct: 599  IDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHK 658

Query: 778  SVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIVIEG 608
            SV VGDL++AD K  L+SKG+Q EF GG LRCG+Y+T+RKVG + QK   SG  QIVIEG
Sbjct: 659  SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEG 718

Query: 607  PLTEEYFKIRQYLYSQFYVL 548
            PL E+Y+KIR YLYSQFY+L
Sbjct: 719  PLCEDYYKIRAYLYSQFYLL 738


>ref|XP_006421948.1| hypothetical protein CICLE_v10004414mg [Citrus clementina]
            gi|568874619|ref|XP_006490411.1| PREDICTED: cleavage and
            polyadenylation specificity factor subunit 2-like isoform
            X1 [Citrus sinensis] gi|557523821|gb|ESR35188.1|
            hypothetical protein CICLE_v10004414mg [Citrus
            clementina]
          Length = 739

 Score = 1011 bits (2614), Expect = 0.0
 Identities = 506/741 (68%), Positives = 602/741 (81%), Gaps = 10/741 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPLSGV +E+PLSYL+++DGF FL+DCGWND FDP  L PLSKVAS++DAVL+
Sbjct: 1    MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+ GL AP++ TEPVYR GLLTMYD +LSRR VS+FDLFTLDDID
Sbjct: 61   SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQ+VT L YSQNY L+GKGEGIV+ P+ AG LLGGTVWKI+KDGEDVIYAVD+N RKE
Sbjct: 121  SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            +HLNGTVLESFVRPAVLITDAYNAL+NQP R+Q   + F D I +TLR  GNVLLPVD+A
Sbjct: 181  KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR--EMFQDAISKTLRAGGNVLLPVDSA 238

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLELLL LE YWA+H L YPI FLT V++ST+D+VKSFLEWM DSI KSFE SRDNAF
Sbjct: 239  GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LK++ +L+++ ELD  P+GPK+VLASMASLE GFS DIF++WASD KNLV+FTERGQFG
Sbjct: 299  LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKVT+S+RVPL GEEL AYEEEQ RLK EEALKA+  KEE+ K+S 
Sbjct: 359  TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418

Query: 1480 VSDN--TSDPMVID----SVAGVVPEGASSRHRDVLCDGFIPSSSSIAPMFPFDEGLKEW 1319
              DN  + DPMVID    + +  V E    R+RD+L DGF+P S+S+APMFPF E   EW
Sbjct: 419  GPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEW 478

Query: 1318 DEYGEVIDPENYVIKNDXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            D++GEVI+P++Y+IK++                D  +  L+ D K +KVVS+E+TV VKC
Sbjct: 479  DDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVQVKC 538

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
             L ++D+EGR+DGRSIK+I+ HVAPLKLVLVHGSAEATEHL+QHCLK    HVY PQIEE
Sbjct: 539  LLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEE 598

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDD-MLSLLPLVNDPPLH 782
            TIDVTSDL AYKVQLSEKLMS+VLFKKLG+YEIAW+D +VGK ++ MLSLLP+    P H
Sbjct: 599  TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPH 658

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIVIE 611
            KSV VGDL++AD K  L+SKG+Q EF GG LRCG+Y+T+RKVG + QK   SG  QIVIE
Sbjct: 659  KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIE 718

Query: 610  GPLTEEYFKIRQYLYSQFYVL 548
            GPL E+Y+KIR YLYSQFY+L
Sbjct: 719  GPLCEDYYKIRAYLYSQFYLL 739


>ref|XP_002517902.1| cleavage and polyadenylation specificity factor, putative [Ricinus
            communis] gi|223542884|gb|EEF44420.1| cleavage and
            polyadenylation specificity factor, putative [Ricinus
            communis]
          Length = 740

 Score = 1011 bits (2613), Expect = 0.0
 Identities = 512/743 (68%), Positives = 606/743 (81%), Gaps = 12/743 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL+GV++E+PLSYL+++D F  L+DCGWND FDP  L PLS+VAS++DAVL+
Sbjct: 1    MGTSVQVTPLNGVYNENPLSYLISIDNFNLLIDCGWNDHFDPSLLQPLSRVASTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+ GL AP+Y TEPVYR GLLTMYD +LSR++VS+FDLF+LDDID
Sbjct: 61   SHSDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQYLSRKAVSEFDLFSLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQN+T L YSQN+ L+GKGEGIVI P+ AG LLGGTVWKI+KDGEDV+YAVDFNHRKE
Sbjct: 121  SAFQNITRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLD-MILRTLRGDGNVLLPVDT 2024
            RHLNGTVLESFVRPAVLITDAYNAL+NQP R+Q  D+EFL+  IL+TL   GNVLLPVDT
Sbjct: 181  RHLNGTVLESFVRPAVLITDAYNALSNQPPRQQR-DKEFLEKTILKTLEAGGNVLLPVDT 239

Query: 2023 AGRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNA 1844
            AGRVLELLL LEQ+WA   L YPI FLT V++ST+D+VKSFLEWMSDSIAKSFE SRDNA
Sbjct: 240  AGRVLELLLILEQFWAHRLLNYPIFFLTYVSSSTIDYVKSFLEWMSDSIAKSFETSRDNA 299

Query: 1843 FQLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQF 1664
            F LK++ +L+++ ELD  P  PKVVLASMASLE GFS DIF++WA+D KNLV+FTERGQF
Sbjct: 300  FLLKHVTLLINKNELDNAPNVPKVVLASMASLEAGFSHDIFVEWAADVKNLVLFTERGQF 359

Query: 1663 GTLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSS 1484
            GTLARMLQA+P PKAVKVT+S+RVPL G+EL AYEEEQ RLK EE L A+  KEE+ K S
Sbjct: 360  GTLARMLQADPPPKAVKVTMSRRVPLVGDELIAYEEEQKRLKKEEELNASMIKEEEAKVS 419

Query: 1483 FVSD-NTSDPMVID------SVAGVVPEGASSRHRDVLCDGFIPSSSSIAPMFPFDEGLK 1325
               D N SDPM+ID      S+  V  +G    +RD+L DGF+P S+S+APMFPF E   
Sbjct: 420  HGPDSNLSDPMIIDASNNNASLDAVGSQGTG--YRDILFDGFVPPSTSVAPMFPFYENTT 477

Query: 1324 EWDEYGEVIDPENYVIKNDXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHV 1145
            EWD++GEVI+P++YVIK+D                D  +   + DTK +KVVS E+TV V
Sbjct: 478  EWDDFGEVINPDDYVIKDDDMDQPMHVGGDIDGKFDEGSASWILDTKPSKVVSSELTVQV 537

Query: 1144 KCSLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQI 965
            KCSL Y+D+EGRSDGRSIKSI+ HVAPLKLVLVHGSAE+TEHL+QHCLK    HVYAPQI
Sbjct: 538  KCSLIYMDYEGRSDGRSIKSILAHVAPLKLVLVHGSAESTEHLKQHCLKHVCPHVYAPQI 597

Query: 964  EETIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKND-DMLSLLPLVNDPP 788
            EETIDVTSDL AYKVQLSEKLMS+VLFKKLG++EIAW+D +VGK + D LSLLP+    P
Sbjct: 598  EETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDFEIAWVDAEVGKTESDALSLLPISTSAP 657

Query: 787  LHKSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIV 617
             HKSV VGDL++ADFKQ LASKGVQ EF GG LRCG+Y+T+RKVG+ +QK   SG  QIV
Sbjct: 658  PHKSVLVGDLKMADFKQFLASKGVQVEFAGGALRCGEYVTLRKVGNINQKGGGSGTQQIV 717

Query: 616  IEGPLTEEYFKIRQYLYSQFYVL 548
            IEGPL E+Y+KIR+YLYSQFY+L
Sbjct: 718  IEGPLCEDYYKIREYLYSQFYLL 740


>gb|ESW24245.1| hypothetical protein PHAVU_004G114000g [Phaseolus vulgaris]
          Length = 739

 Score = 1008 bits (2605), Expect = 0.0
 Identities = 517/741 (69%), Positives = 602/741 (81%), Gaps = 10/741 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL+++D F FL+DCGWND FDP  L PLS+VAS++DAVL+
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSIDDFNFLIDCGWNDHFDPSLLQPLSRVASTIDAVLV 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH D  H+GALPYA+K+ GL AP+Y TEPVYR GLLTMYD +LSR+ VS+FDLFTLDDID
Sbjct: 61   SHADILHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQYLSRKQVSEFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQ+VT L YSQN+ L GKGEGIVI P+ AG LLGGTVWKI+KDGEDVIYAVDFNHRKE
Sbjct: 121  SAFQSVTRLTYSQNHHLTGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGT L SFVRPAVLITDAYNALNNQP RRQ  D+EF D++ +TLR  GNVLLPVDTA
Sbjct: 181  RHLNGTALGSFVRPAVLITDAYNALNNQPYRRQN-DKEFGDILKKTLRAGGNVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+L LE YW+  +L YPI FLT VA+ST+D+VKSFLEWMSDSIAKSFE +R+N F
Sbjct: 240  GRVLELILMLESYWSDENLNYPIYFLTYVASSTIDYVKSFLEWMSDSIAKSFEKTRENIF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LKYI +L+++ ELD  PEGPKVVLASMASLE GFS DIF++WA+D KNLV+FTERGQF 
Sbjct: 300  LLKYITLLINKTELDNAPEGPKVVLASMASLEAGFSHDIFVEWANDMKNLVLFTERGQFA 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKV VSKRVPL GEEL AYEEEQNR+K +EALKA+  KEE++K+S 
Sbjct: 360  TLARMLQADPPPKAVKVVVSKRVPLVGEELIAYEEEQNRIK-KEALKASLMKEEELKTSH 418

Query: 1480 VSD-NTSDPMVIDSVAG-VVPEGASSR---HRDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
             SD N SDPMV+DS    V PE A  R   +RD+  DGF+P S+S+APMFP  E   EWD
Sbjct: 419  GSDNNNSDPMVVDSGNNHVPPEVAGPRGGGYRDIYIDGFVPPSTSVAPMFPCYENTLEWD 478

Query: 1315 EYGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            ++GEVI+P++YVIK+ D                D  A  L+ DTK +KVVSDE TV VKC
Sbjct: 479  DFGEVINPDDYVIKDEDMNQIAMHGGGDINGKLDEGAAGLILDTKPSKVVSDERTVQVKC 538

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+DFEGRSDGRSIK+I+ HVAPLKLVLVHGSAEATEHL+QHCLK    HV APQI+E
Sbjct: 539  SLVYMDFEGRSDGRSIKNILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVSAPQIDE 598

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKND-DMLSLLPLVNDPPLH 782
            TIDVTSDL AYKV LSEKLMS+VLFKKLG+YE+AW+D  VGK + D LS+LP+    P H
Sbjct: 599  TIDVTSDLCAYKVLLSEKLMSNVLFKKLGDYEVAWVDAVVGKTESDTLSVLPVSEAAPPH 658

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIVIE 611
            KSV VGDL+LAD KQ L+SKGVQ EF GG LRCG+Y+T+RKVGD++QK   SG  QIVIE
Sbjct: 659  KSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGEYVTLRKVGDATQKGGGSGAQQIVIE 718

Query: 610  GPLTEEYFKIRQYLYSQFYVL 548
            GPL E+Y+KIR YLYSQFY+L
Sbjct: 719  GPLCEDYYKIRDYLYSQFYLL 739


>ref|XP_003548179.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            2-like isoform 1 [Glycine max]
          Length = 738

 Score = 1004 bits (2595), Expect = 0.0
 Identities = 509/740 (68%), Positives = 599/740 (80%), Gaps = 9/740 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL+++DGF FLVDCGWND FDP  L PL++VAS++DAVL+
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLVDCGWNDHFDPSLLQPLARVASTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+ GL AP+Y TEPVYR GLLTMYD +LSR+ VS+FDLFTLDDID
Sbjct: 61   SHADTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQYLSRKQVSEFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             +FQ+VT L YSQN+  +GKGEGIVI P+ AG LLGGT+WKI+KDGEDVIYAVDFNHRKE
Sbjct: 121  SSFQSVTRLTYSQNHHFSGKGEGIVIAPHVAGHLLGGTIWKITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGTVL SFVRPAVLITDAYNALNNQP RRQ  D+EF D++ +TLR  GNVLLPVDT 
Sbjct: 181  RHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQN-DKEFGDILKKTLREGGNVLLPVDTV 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+L LE YW   +L YPI FLT VA+ST+D+VKSFLEWMSD+IAKSFE +R+N F
Sbjct: 240  GRVLELILMLESYWTDENLNYPIYFLTYVASSTIDYVKSFLEWMSDTIAKSFEKTRENIF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LKY+ +L+++ ELD  P+GPKVVLASMASLE GFS +IF++WA+D KNLV+FTERGQF 
Sbjct: 300  LLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHEIFVEWANDVKNLVLFTERGQFA 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKV VSKRV L GEEL AYEEEQNR+K +EALKA+  KEE+ K+S 
Sbjct: 360  TLARMLQADPPPKAVKVVVSKRVALVGEELIAYEEEQNRIK-KEALKASLMKEEEFKTSH 418

Query: 1480 VSD-NTSDPMVIDSVAGVVPEGAS----SRHRDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
             +D NTSD MVIDS    VP   S      +RD+  DGF+P  +S+APMFP  E   EWD
Sbjct: 419  GADNNTSDSMVIDSGNNHVPPEVSGPRGGGYRDIFIDGFVPPLTSVAPMFPCYENTSEWD 478

Query: 1315 EYGEVIDPENYVIKNDXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKCS 1136
            ++GEVI+P++YVIK++                D  A  L+ DTK +KVVSDE TV V+CS
Sbjct: 479  DFGEVINPDDYVIKDEDMDQTAMHGGDINGKLDEGAASLILDTKPSKVVSDERTVQVRCS 538

Query: 1135 LNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEET 956
            L Y+DFEGRSDGRSIK+I+ HVAPLKLVLVHGSAEATEHL+QHCLK    HVYAPQ+EET
Sbjct: 539  LVYMDFEGRSDGRSIKNILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYAPQLEET 598

Query: 955  IDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKND-DMLSLLPLVNDPPLHK 779
            IDVTSDL AYKV LSEKLMS+VLFKKLG+YE+AW+D  VGK + D LSLLP+    P HK
Sbjct: 599  IDVTSDLCAYKVLLSEKLMSNVLFKKLGDYELAWVDAVVGKTENDPLSLLPVSGAAPPHK 658

Query: 778  SVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIVIEG 608
            SV VGDL+LAD KQ L+SKGVQ EF GG LRCG+Y+T+RKVGD+SQK   SG  QIVIEG
Sbjct: 659  SVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGEYVTLRKVGDASQKGGGSGAQQIVIEG 718

Query: 607  PLTEEYFKIRQYLYSQFYVL 548
            PL E+Y+KIR YLYSQFY+L
Sbjct: 719  PLCEDYYKIRDYLYSQFYLL 738


>gb|EXC19142.1| Cleavage and polyadenylation specificity factor subunit 2 [Morus
            notabilis]
          Length = 741

 Score = 1003 bits (2594), Expect = 0.0
 Identities = 511/742 (68%), Positives = 609/742 (82%), Gaps = 11/742 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL+++DGF FL+DCGWND  DP  L PL+KVAS+VDAVL+
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDHLDPSILQPLTKVASTVDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+FGL AP+Y TEPVYR GLLTMYD FL R+ VS+FDLFTLDDID
Sbjct: 61   SHADTLHLGALPYAMKQFGLSAPVYSTEPVYRLGLLTMYDQFLWRKQVSEFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNVT L Y+QN+ L+GKGEGIVI+P+ AG LLGGTVWKI+KDGEDVIYAVDFNHRKE
Sbjct: 121  SAFQNVTRLTYAQNHHLSGKGEGIVISPHVAGHLLGGTVWKITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            +HLNG    SFVRPAVLITDAYNALNNQP RRQ +D+EF D I +TLR DG VLLPVDTA
Sbjct: 181  KHLNGINPASFVRPAVLITDAYNALNNQPYRRQ-MDKEFTDTIKKTLRIDGKVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLELL  LE  WA+  L+YPI FLT VA+ST+D+VKSFLEWMSDSIAKSFE +RDNAF
Sbjct: 240  GRVLELLQILESCWAEESLSYPIYFLTYVASSTIDYVKSFLEWMSDSIAKSFEKTRDNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LK++ +L+++ +L+  P+GPKVVLASMASLE GFS DIF++WA+D +NLV+FTERGQFG
Sbjct: 300  LLKHVTLLVNKTDLNNAPDGPKVVLASMASLEAGFSHDIFVEWATDARNLVLFTERGQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKVT+SKRVPL GEEL AYEEEQNR+K EEALKA+  KEE+ K+S 
Sbjct: 360  TLARMLQADPPPKAVKVTMSKRVPLVGEELIAYEEEQNRIKREEALKASLIKEEESKASH 419

Query: 1480 VSD-NTSDPMVID-SVAGVVPEGA---SSRHRDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
             +D N SDPMVID S+   +P+ A   S  +RDV  DGF+PSS+S+APMFPF E   EWD
Sbjct: 420  GTDINISDPMVIDASITNPLPDVAGPHSGGYRDVFIDGFVPSSTSVAPMFPFFETTSEWD 479

Query: 1315 EYGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            ++GEVI+P+NY+IK+ D                D  +  L+ DTK +KV+S+E+TV VKC
Sbjct: 480  DFGEVINPDNYIIKDEDMDQGAMHVSGDMDGKLDEASASLILDTKPSKVISNELTVPVKC 539

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+DFEGRSD RSIKSI+ H+APLKLVLVHG+AEATEHL+QHC+KQ   HVYAPQIEE
Sbjct: 540  SLLYMDFEGRSDARSIKSILSHMAPLKLVLVHGTAEATEHLKQHCIKQVCPHVYAPQIEE 599

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDD-MLSLLPLVNDPPLH 782
            TID+TSDL AYKVQLSEKLMS+VLFKKLG++E AW+D +VGK ++  LSLLPL +  P H
Sbjct: 600  TIDITSDLCAYKVQLSEKLMSNVLFKKLGDHETAWVDSEVGKTENGTLSLLPLSSAAPPH 659

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFM-GGHLRCGDYITVRKVGDSSQKS---GVHQIVI 614
            KSV VGDL++A+FKQ LA  GVQ EF  GG LRCG+Y+T+RKVGD+S K    G  QIVI
Sbjct: 660  KSVLVGDLKMANFKQFLADNGVQVEFAGGGALRCGEYVTLRKVGDASHKGGGPGTQQIVI 719

Query: 613  EGPLTEEYFKIRQYLYSQFYVL 548
            EGPL EEY+KIR+YLYSQF++L
Sbjct: 720  EGPLCEEYYKIREYLYSQFFLL 741


>ref|XP_006369487.1| Cleavage and polyadenylation specificity factor family protein
            [Populus trichocarpa] gi|550348036|gb|ERP66056.1|
            Cleavage and polyadenylation specificity factor family
            protein [Populus trichocarpa]
          Length = 740

 Score = 1001 bits (2588), Expect = 0.0
 Identities = 507/741 (68%), Positives = 605/741 (81%), Gaps = 10/741 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPLSGV++E+PLSYL+++DGF FL+DCGWND FDP  L PLSKVAS +DAVL+
Sbjct: 1    MGTSVQVTPLSGVYNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASKIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            S+GD  H+GALP+A+K+FGL AP++ TEPVYR GLLTMYD   SR++VS+FDLF+LDDID
Sbjct: 61   SYGDMLHLGALPFAMKQFGLNAPVFSTEPVYRLGLLTMYDQSFSRKAVSEFDLFSLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQN T L YSQN+ L+GKGEGIVI P+ AG LLGGTVWKI+KDGEDV+YAVDFNHRKE
Sbjct: 121  SAFQNFTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGTVLESF RPAVLITDAYNALN+QPSR+Q  D++FL+ IL+TL G GNVLLPVD+A
Sbjct: 181  RHLNGTVLESFYRPAVLITDAYNALNSQPSRQQR-DKQFLETILKTLEGGGNVLLPVDSA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLELLL LEQ+W +  L YPI FL+ V++ST+D++KSFLEWMSDSIAKSFE SRDNAF
Sbjct: 240  GRVLELLLILEQFWGQRFLNYPIFFLSYVSSSTIDYIKSFLEWMSDSIAKSFETSRDNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             +K++ +L+S+ ELD    GPKVVLAS+ASLE GFS DIF +WA+D KNLV+FTERGQFG
Sbjct: 300  LMKHVTLLISKDELDNASTGPKVVLASVASLEAGFSHDIFAEWAADVKNLVLFTERGQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVK+T+S+RVPL G+EL AYEEEQ RLK EE LKA+  KEE+ K S 
Sbjct: 360  TLARMLQADPPPKAVKMTMSRRVPLVGDELIAYEEEQKRLKREEELKASLIKEEESKVSH 419

Query: 1480 VSDNT-SDPMVIDSVAGVVP----EGASSRHRDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
              DN  SDPMVIDS     P        S HRD+L DGF+P S+S+APMFPF E   EWD
Sbjct: 420  GPDNNLSDPMVIDSGNTHSPLDVVGSRGSGHRDILIDGFVPPSTSVAPMFPFYENSLEWD 479

Query: 1315 EYGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            E+GEVI+P++YV+++ D                D  +  L+ DTK +KVVS+E+TV VKC
Sbjct: 480  EFGEVINPDDYVVQDEDMDQAAMHVGADIDGKLDEGSASLILDTKPSKVVSNELTVQVKC 539

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+D+EGRSDGRSIKSI+ HVAPLKLV+VHGSAEATEHL+QH L      VYAPQIEE
Sbjct: 540  SLIYMDYEGRSDGRSIKSILTHVAPLKLVMVHGSAEATEHLKQHFLNIKNVQVYAPQIEE 599

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDD-MLSLLPLVNDPPLH 782
            TIDVTSDL AYKVQLSEKLMS+VLFKKLG+YE+AW+D +VGK ++ MLSLLP+ +  P H
Sbjct: 600  TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEVAWVDAEVGKTENGMLSLLPISSPAPPH 659

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIVIE 611
            KSV VGDL++ADFKQ LASKGVQ EF GG LRCG+Y+T+RKVG+ SQK   SG  QI+IE
Sbjct: 660  KSVLVGDLKMADFKQFLASKGVQVEFAGGALRCGEYVTLRKVGNPSQKGGTSGTQQIIIE 719

Query: 610  GPLTEEYFKIRQYLYSQFYVL 548
            GPL E+Y+KIR+YLYSQFY+L
Sbjct: 720  GPLCEDYYKIREYLYSQFYLL 740


>ref|XP_002330904.1| predicted protein [Populus trichocarpa]
          Length = 740

 Score = 1001 bits (2588), Expect = 0.0
 Identities = 507/741 (68%), Positives = 605/741 (81%), Gaps = 10/741 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPLSGV++E+PLSYL+++DGF FL+DCGWND FDP  L PLSKVAS +DAVL+
Sbjct: 1    MGTSVQVTPLSGVYNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASKIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            S+GD  H+GALP+A+K+FGL AP++ TEPVYR GLLTMYD   SR++VS+FDLF+LDDID
Sbjct: 61   SYGDMLHLGALPFAMKQFGLNAPVFSTEPVYRLGLLTMYDQSFSRKAVSEFDLFSLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQN T L YSQN+ L+GKGEGIVI P+ AG LLGGTVWKI+KDGEDV+YAVDFNHRKE
Sbjct: 121  SAFQNFTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGTVLESF RPAVLITDAYNALN+QPSR+Q  D++FL+ IL+TL G GNVLLPVD+A
Sbjct: 181  RHLNGTVLESFYRPAVLITDAYNALNSQPSRQQR-DKQFLETILKTLEGGGNVLLPVDSA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLELLL LEQ+W +  L YPI FL+ V++ST+D++KSFLEWMSDSIAKSFE SRDNAF
Sbjct: 240  GRVLELLLILEQFWGQRFLNYPIFFLSYVSSSTIDYIKSFLEWMSDSIAKSFETSRDNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             +K++ +L+S+ ELD    GPKVVLAS+ASLE GFS DIF +WA+D KNLV+FTERGQFG
Sbjct: 300  LMKHVTLLISKDELDNASTGPKVVLASVASLEAGFSHDIFAEWAADVKNLVLFTERGQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVK+T+S+RVPL G+EL AYEEEQ RLK EE LKA+  KEE+ K S 
Sbjct: 360  TLARMLQADPPPKAVKMTMSRRVPLVGDELIAYEEEQKRLKREEELKASLIKEEESKVSH 419

Query: 1480 VSDNT-SDPMVIDSVAGVVP----EGASSRHRDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
              DN  SDPMVIDS     P        S HRD+L DGF+P S+S+APMFPF E   EWD
Sbjct: 420  GPDNNLSDPMVIDSGNTHSPLDVVGSRGSGHRDILIDGFVPPSTSVAPMFPFYENSLEWD 479

Query: 1315 EYGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            E+GEVI+P++YV+++ D                D  +  L+ DTK +KVVS+E+TV VKC
Sbjct: 480  EFGEVINPDDYVVQDEDMDQAAMHVGADIDGKLDEGSASLILDTKPSKVVSNELTVQVKC 539

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+D+EGRSDGRSIKSI+ HVAPLKLV+VHGSAEATEHL+QH L      VYAPQIEE
Sbjct: 540  SLIYMDYEGRSDGRSIKSILTHVAPLKLVMVHGSAEATEHLKQHFLNIKNVQVYAPQIEE 599

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDD-MLSLLPLVNDPPLH 782
            TIDVTSDL AYKVQLSEKLMS+VLFKKLG+YE+AW+D +VGK ++ MLSLLP+ +  P H
Sbjct: 600  TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEVAWVDAEVGKTENGMLSLLPISSPAPPH 659

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIVIE 611
            KSV VGDL++ADFKQ LASKGVQ EF GG LRCG+Y+T+RKVG+ SQK   SG  QI+IE
Sbjct: 660  KSVLVGDLKMADFKQFLASKGVQVEFAGGALRCGEYVTLRKVGNPSQKGGASGTQQIIIE 719

Query: 610  GPLTEEYFKIRQYLYSQFYVL 548
            GPL E+Y+KIR+YLYSQFY+L
Sbjct: 720  GPLCEDYYKIREYLYSQFYLL 740


>ref|XP_004234405.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            2-like [Solanum lycopersicum]
          Length = 739

 Score =  999 bits (2584), Expect = 0.0
 Identities = 508/741 (68%), Positives = 601/741 (81%), Gaps = 10/741 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL+++DGF FLVDCGWND FD   L PLS+VAS+VDAVLI
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLVDCGWNDHFDTSLLQPLSRVASTVDAVLI 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+ GL APIY TEPVYR GLLTMYD +LSR+ VS+FDLFTLDDID
Sbjct: 61   SHSDTFHLGALPYAMKQLGLSAPIYATEPVYRLGLLTMYDQYLSRKQVSEFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNVT L YSQN+ ++GKGEGIVI P  AG LLGGT W+I+KDGEDVIYAVDFNHRKE
Sbjct: 121  SAFQNVTRLTYSQNHYMSGKGEGIVIAPLVAGHLLGGTTWRITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGTVLESFVRPAVLITDA+NALNNQP RRQ  DQEFLD I RTL   GNVLLPVDTA
Sbjct: 181  RHLNGTVLESFVRPAVLITDAFNALNNQPPRRQR-DQEFLDAIERTLNVGGNVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+L LEQ+W +  L+ PI FL+ V++ST+D+VKSFLEWMSDSIAKSFEH+RDNAF
Sbjct: 240  GRVLELILTLEQHWTQKQLSTPIYFLSYVSSSTIDYVKSFLEWMSDSIAKSFEHTRDNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             L+ I +++++  L+  P GPKVV+ASMASLE GFS D+F++WA+DPKNLV+FTERGQFG
Sbjct: 300  LLRKIKLVINKSALEEAP-GPKVVMASMASLEAGFSHDLFVEWAADPKNLVMFTERGQFG 358

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLAR+LQ++P PKAVKVT+S+R+PL GEEL AYEEEQNR+K EEALKA   KEE+ K+S 
Sbjct: 359  TLARILQSDPPPKAVKVTMSRRIPLVGEELAAYEEEQNRIKREEALKATLVKEEESKASV 418

Query: 1480 VSD-NTSDPMVIDSVAGVVPEGASSRH----RDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
             ++  T DPM +D+        AS  H    +DVL DGF+ +SSSIAPMFPF +   EWD
Sbjct: 419  GAEVVTDDPMAVDTNVTHPSSNASGLHSGAFKDVLIDGFVTTSSSIAPMFPFYDNTSEWD 478

Query: 1315 EYGEVIDPENYVIKND-XXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            ++GEVI+P++YV+K+D                 D  + +L+ DT  +KV S E+TV VKC
Sbjct: 479  DFGEVINPDDYVVKDDNMEQSFMHVDGDLNGKLDEGSANLILDTTPSKVESSELTVQVKC 538

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+DFEGRSDGRSIKSI+ HVAPLKLVLVHGSAEATEHL+QHCLK     VYAPQ+EE
Sbjct: 539  SLLYMDFEGRSDGRSIKSILAHVAPLKLVLVHGSAEATEHLKQHCLKHVCPQVYAPQLEE 598

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKND-DMLSLLPLVNDPPLH 782
            TIDVTSDL AYKVQLSEKLMS VLFKKLG+YEIAW+D +VGK + DM SLLPL    P H
Sbjct: 599  TIDVTSDLCAYKVQLSEKLMSQVLFKKLGDYEIAWVDAEVGKTENDMFSLLPLSGPSPPH 658

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQKSG---VHQIVIE 611
            K+V VGDL+++DFKQ LASKGVQ EF GG LRCG+Y+T+RKVGD+SQK G   + QIV+E
Sbjct: 659  KTVLVGDLKMSDFKQFLASKGVQVEFGGGALRCGEYVTIRKVGDASQKVGGAAIQQIVLE 718

Query: 610  GPLTEEYFKIRQYLYSQFYVL 548
            GPL+EEY+KIR+YLYS FY L
Sbjct: 719  GPLSEEYYKIREYLYSHFYSL 739


>ref|XP_006353867.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            2-like [Solanum tuberosum]
          Length = 739

 Score =  994 bits (2570), Expect = 0.0
 Identities = 505/741 (68%), Positives = 600/741 (80%), Gaps = 10/741 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV +E+PLSYL+++DGF FLVDCGWND FD   L PLS+VAS+VDAVLI
Sbjct: 1    MGTSVQVTPLCGVFNENPLSYLVSIDGFNFLVDCGWNDHFDTSLLQPLSRVASTVDAVLI 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K+ GL APIY TEPVYR GLLTMYD +LSR+ VS+FDLFTLDDID
Sbjct: 61   SHSDTFHLGALPYAMKQLGLSAPIYATEPVYRLGLLTMYDQYLSRKQVSEFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNVT L YSQN+ ++GKGEGIVI P  AG LLGGT W+I+KDGEDVIYAVDFNHRKE
Sbjct: 121  SAFQNVTRLTYSQNHYMSGKGEGIVIAPLVAGHLLGGTTWRITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGTVLESFVRPAVLITDA+NALNNQP RRQ  DQEFLD I RT+   GNVLLPVDTA
Sbjct: 181  RHLNGTVLESFVRPAVLITDAFNALNNQPPRRQR-DQEFLDAIERTVNVGGNVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+L LEQ+W +  L+ PI FL+ V++ST+D+VKSFLEWMSDSIAKSFEH+RDNAF
Sbjct: 240  GRVLELILTLEQHWTQKQLSTPIYFLSYVSSSTIDYVKSFLEWMSDSIAKSFEHTRDNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             L+ I +++++  L+  P G KVV+ASMASLE GFS D+F++WA+DPKNLV+FTERGQFG
Sbjct: 300  LLRKIKLVINKSALEEAP-GSKVVMASMASLEAGFSHDLFVEWAADPKNLVMFTERGQFG 358

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLAR+LQ++P PKAVKVT+S+R+PL GEEL AYEEEQNR+K EEALKA   KEE+ K+S 
Sbjct: 359  TLARILQSDPPPKAVKVTMSRRIPLVGEELAAYEEEQNRIKREEALKATLVKEEESKASV 418

Query: 1480 VSD-NTSDPMVIDSVAGVVPEGASSRH----RDVLCDGFIPSSSSIAPMFPFDEGLKEWD 1316
             ++  T+DPM +D+        AS  H    +DVL DGF+ +SSS+APMFPF +   EWD
Sbjct: 419  GAEVVTNDPMAVDTNVTHPSSNASGLHSGAFKDVLIDGFVTTSSSVAPMFPFYDNTSEWD 478

Query: 1315 EYGEVIDPENYVIKND-XXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            ++GEVI+P++YV+K+D                 D  + +L+ DT  +KV S E+TV VKC
Sbjct: 479  DFGEVINPDDYVVKDDNMEQSLMHVDGDLNGKLDEGSANLILDTTPSKVESSELTVQVKC 538

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+DFEGRSDGRSIKSI+ HVAPLKLVLVHGSAEATEHL+QHCLK     VYAPQ+EE
Sbjct: 539  SLLYMDFEGRSDGRSIKSILAHVAPLKLVLVHGSAEATEHLKQHCLKHVCPQVYAPQLEE 598

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKND-DMLSLLPLVNDPPLH 782
            TIDVTSDL AYKVQLSEKLMS VLFKKLG+YEIAW+D +VGK + DM SLLPL    P H
Sbjct: 599  TIDVTSDLCAYKVQLSEKLMSQVLFKKLGDYEIAWVDAEVGKTENDMFSLLPLSGPAPPH 658

Query: 781  KSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQKSG---VHQIVIE 611
            K+V VGDL+++DFKQ LASKGVQ EF GG LRCG+Y+T+RKVGD+SQK G   + QIV+E
Sbjct: 659  KTVLVGDLKMSDFKQFLASKGVQVEFGGGALRCGEYVTIRKVGDASQKVGGAAIQQIVLE 718

Query: 610  GPLTEEYFKIRQYLYSQFYVL 548
            GPL+EEY+KIR+YLYS FY L
Sbjct: 719  GPLSEEYYKIREYLYSHFYSL 739


>ref|NP_001063978.1| Os09g0569400 [Oryza sativa Japonica Group]
            gi|75253249|sp|Q652P4.1|CPSF2_ORYSJ RecName:
            Full=Cleavage and polyadenylation specificity factor
            subunit 2; AltName: Full=Cleavage and polyadenylation
            specificity factor 100 kDa subunit; Short=CPSF 100 kDa
            subunit gi|52077178|dbj|BAD46223.1| putative cleavage and
            polyadenylation specificity factor [Oryza sativa Japonica
            Group] gi|113632211|dbj|BAF25892.1| Os09g0569400 [Oryza
            sativa Japonica Group]
          Length = 738

 Score =  993 bits (2567), Expect = 0.0
 Identities = 496/739 (67%), Positives = 595/739 (80%), Gaps = 8/739 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPLSG + E PL YLL VDGF FL+DCGW D  DP  L PL+KVA ++DAVL+
Sbjct: 1    MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDPSHLQPLAKVAPTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALPYA+K  GL AP+Y TEPV+R G+LT+YD+F+SRR VSDFDLFTLDDID
Sbjct: 61   SHADTMHLGALPYAMKHLGLSAPVYATEPVFRLGILTLYDYFISRRQVSDFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNV  LKYSQN+ L  KGEGIVI P+ AG  LGGTVWKI+KDGEDV+YAVDFNHRKE
Sbjct: 121  AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVAGHDLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGT L SFVRPAVLITDAYNALNN   +RQ  DQ+F+D +++ L G G+VLLP+DTA
Sbjct: 181  RHLNGTALGSFVRPAVLITDAYNALNNHVYKRQQ-DQDFIDALVKVLTGGGSVLLPIDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLE+LL LEQYWA+ HL YPI FLTNV+TSTVD+VKSFLEWM+DSI+KSFEH+RDNAF
Sbjct: 240  GRVLEILLILEQYWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LK +  ++++ EL+++ + PKVVLASMASLE GFS DIF+D A++ KNLV+FTE+GQFG
Sbjct: 300  LLKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQ +P PKAVKVT+SKR+PL G+ELKAYEEEQ R+K EEALKA+ +KEE+ K+S 
Sbjct: 360  TLARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASL 419

Query: 1480 VSD-NTSDPMVIDSVAGVVPEGASSR---HRDVLCDGFIPSSSSIAPMFPFDEGLKEWDE 1313
             S+   SDPMVID+     P  A S+   + D+L DGF+P SSS+APMFPF E   EWD+
Sbjct: 420  GSNAKASDPMVIDASTSRKPSNAGSKFGGNVDILIDGFVPPSSSVAPMFPFFENTSEWDD 479

Query: 1312 YGEVIDPENYVIKND--XXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            +GEVI+PE+Y++K +                  D  +  LL D+  +KV+S+E+TV VKC
Sbjct: 480  FGEVINPEDYLMKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVKC 539

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+DFEGRSDGRS+KS+I HVAPLKLVLVHGSAEATEHL+ HC K S  HVYAPQIEE
Sbjct: 540  SLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCSKNSDLHVYAPQIEE 599

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDDMLSLLPLVNDPPLHK 779
            TIDVTSDL AYKVQLSEKLMS+V+ KKLGE+EIAW+D +VGK DD L+LLP  + P  HK
Sbjct: 600  TIDVTSDLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKTDDKLTLLPPSSTPAAHK 659

Query: 778  SVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK--SGVHQIVIEGP 605
            SV VGDL+LADFKQ LA+KG+Q EF GG LRCG+YIT+RK+GD+ QK  +G  QIVIEGP
Sbjct: 660  SVLVGDLKLADFKQFLANKGLQVEFAGGALRCGEYITLRKIGDAGQKGSTGSQQIVIEGP 719

Query: 604  LTEEYFKIRQYLYSQFYVL 548
            L E+Y+KIR+ LYSQFY+L
Sbjct: 720  LCEDYYKIRELLYSQFYLL 738


>gb|EMJ21437.1| hypothetical protein PRUPE_ppa001928mg [Prunus persica]
          Length = 740

 Score =  985 bits (2547), Expect = 0.0
 Identities = 501/743 (67%), Positives = 599/743 (80%), Gaps = 12/743 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPL GV++E+PLSYL+++DGF FL+DCGWND FDP  L PLS+VAS+VDAVL+
Sbjct: 1    MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLEPLSRVASTVDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH DT H+GALP+A+K+ GL A +Y TEPVYR GLLTMYD +LSR+ VSDFDLFTLDDID
Sbjct: 61   SHPDTLHLGALPFAMKQLGLSAVVYSTEPVYRLGLLTMYDQYLSRKQVSDFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNVT L Y+QN+ L+GKGEGIVI+P+ +G LLGGTVWKI+KDGEDVIYAVDFNHRKE
Sbjct: 121  SAFQNVTRLTYAQNHHLSGKGEGIVISPHVSGHLLGGTVWKITKDGEDVIYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            +HLNG    SFVRPAVLITDAYNALNNQP RRQ  D+EF D I +TLR DGNVLLPVDTA
Sbjct: 181  KHLNGINQASFVRPAVLITDAYNALNNQPYRRQK-DKEFTDTIKKTLRSDGNVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLEL+  LE  WA  +L YPI FLT VA+ST+D+VKSFLEWMSDSIAKSFE +R+NAF
Sbjct: 240  GRVLELVQILESCWADENLNYPIFFLTYVASSTIDYVKSFLEWMSDSIAKSFEKTRENAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             LK I +L+++ ELD  P+GPKVVLASMASLE GFS DIF++WA+DPKNLV+FTER QFG
Sbjct: 300  ILKRITLLVNKSELDNAPDGPKVVLASMASLEAGFSHDIFVEWATDPKNLVLFTERAQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQA+P PKAVKVT+S+RVPL GEEL AYEEEQNR++ +EALKA+  KEE+ KS+ 
Sbjct: 360  TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQNRIRKDEALKASLIKEEESKSAQ 419

Query: 1480 VSD-NTSDPMVIDS------VAGVVPEGASSRHRDVLCDGFIPSSSSIAPMFPFDEGLKE 1322
             +D +TSDP V+D+      +    P G    +RD+L DGF P S+S APMFPF E   +
Sbjct: 420  GADVSTSDPTVVDASNTHSLLDAAGPHGGG--YRDMLIDGFTPPSTSAAPMFPFYENNSD 477

Query: 1321 WDEYGEVIDPENYVIKN-DXXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHV 1145
            WD++GEVI+P++YVIK+ D                D  +  L+ DT+ +KVV+ E+TV V
Sbjct: 478  WDDFGEVINPDDYVIKDADMDQGAMHVGGDMDGKLDEGSASLILDTRPSKVVATELTVQV 537

Query: 1144 KCSLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQI 965
            KCSL Y+DFEGRSD RSIKSI+ H+APLKLVLVHG+AEATEHL+QHCL     HVYAPQI
Sbjct: 538  KCSLIYMDFEGRSDARSIKSILSHMAPLKLVLVHGTAEATEHLKQHCLTHVCPHVYAPQI 597

Query: 964  EETIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDD-MLSLLPLVNDPP 788
            EETIDVTSDL AYKVQLSEKLMS+VLFKKLG+YEIAW+D + GK ++  LSLLP+    P
Sbjct: 598  EETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDSEAGKTENGALSLLPISTPAP 657

Query: 787  LHKSVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK---SGVHQIV 617
             H+SV VGDL++A+FKQ L+  GVQ EF GG LRCG+Y+T+RKVGD+S K   SG  QIV
Sbjct: 658  PHESVLVGDLKMANFKQFLSDNGVQVEFAGGALRCGEYVTLRKVGDASHKGGGSGTQQIV 717

Query: 616  IEGPLTEEYFKIRQYLYSQFYVL 548
            IEGPL E+Y+KIR+YLYSQFY+L
Sbjct: 718  IEGPLCEDYYKIREYLYSQFYLL 740


>ref|XP_003565596.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            2-like [Brachypodium distachyon]
          Length = 738

 Score =  983 bits (2540), Expect = 0.0
 Identities = 490/739 (66%), Positives = 597/739 (80%), Gaps = 8/739 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPLSG + E PL YLL VDGF FL+DCGW D  DP  L PL++VA ++DAVL+
Sbjct: 1    MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDHCDPSLLQPLARVAPTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH D  H+GALPYA+K  GL AP+Y TEPV+R GLLTMYD+FLSR  V+DFDLFTLDDID
Sbjct: 61   SHPDIMHLGALPYAMKHLGLSAPVYATEPVFRLGLLTMYDYFLSRWQVADFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNV  LKYSQN+ L  KGEGIVI P+ +G LLGGTVWKI+KDGEDV+YAVDFNHRKE
Sbjct: 121  AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVSGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGT L SFVRPAVLITDAYNALNNQ  +RQ  DQ+F+D +++ L   G+VLLPVDTA
Sbjct: 181  RHLNGTALGSFVRPAVLITDAYNALNNQVYKRQQ-DQDFIDSMVKVLASGGSVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLELLL +EQYWA+ HL YPI FLTNV+TSTVD+VKSFLEWMSDSI+KSFEH+RDNAF
Sbjct: 240  GRVLELLLIMEQYWAQRHLVYPIYFLTNVSTSTVDYVKSFLEWMSDSISKSFEHTRDNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             L+Y++++++++EL+++ + PKVVLASMASLE GFS DIF++ A++ KNLV+FTE+GQFG
Sbjct: 300  LLRYVSLIINKEELEKLGDAPKVVLASMASLEVGFSHDIFVEMANEAKNLVLFTEKGQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQ +P PKAVKVT+ KR+PL G+ELKAYEEEQ R+K EE LKA+ SK+E++K+S 
Sbjct: 360  TLARMLQVDPPPKAVKVTMGKRIPLVGDELKAYEEEQERIKKEELLKASLSKDEELKASH 419

Query: 1480 VSD-NTSDPMVIDSVAGVVPEGASSR---HRDVLCDGFIPSSSSIAPMFPFDEGLKEWDE 1313
             S+   SDPMV+D+ +      A S    + D+L DGF+PS++S APMFPF E   +WD+
Sbjct: 420  GSNAKASDPMVVDASSSRKSSNAGSHVGGNVDILIDGFVPSTTSFAPMFPFFENTADWDD 479

Query: 1312 YGEVIDPENYVIKND--XXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            +GEVI+P++Y++K D                  D  +  LL D+  +KV+S+E+TV VKC
Sbjct: 480  FGEVINPDDYMMKQDEMDNNMMLGAGDGMDGKLDEGSARLLLDSAPSKVISNEMTVQVKC 539

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+DFEGRSDGRS+KS+I HVAPLKLVLVHGSAEATEHL+ HC K S  HVYAPQIEE
Sbjct: 540  SLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCAKNSDLHVYAPQIEE 599

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDDMLSLLPLVNDPPLHK 779
            TIDVTSDL AYKVQLSEKLMS+V+ KKLGE+EIAW+D +VGK D+ L+LLP  + P  HK
Sbjct: 600  TIDVTSDLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKVDEKLNLLPPSSTPSAHK 659

Query: 778  SVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQK--SGVHQIVIEGP 605
            SV VGDL+LADFKQ LA+KG+Q EF GG LRCG+YITVRK+GDS+QK  +G  QIVIEGP
Sbjct: 660  SVLVGDLKLADFKQFLANKGLQVEFAGGALRCGEYITVRKIGDSNQKGSTGSQQIVIEGP 719

Query: 604  LTEEYFKIRQYLYSQFYVL 548
            L E+Y+KIR+ LYSQF++L
Sbjct: 720  LCEDYYKIRELLYSQFFLL 738


>ref|XP_003578687.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            2-like [Brachypodium distachyon]
          Length = 738

 Score =  981 bits (2536), Expect = 0.0
 Identities = 489/739 (66%), Positives = 596/739 (80%), Gaps = 8/739 (1%)
 Frame = -2

Query: 2740 MGTSVHVTPLSGVHSESPLSYLLTVDGFTFLVDCGWNDFFDPDQLLPLSKVASSVDAVLI 2561
            MGTSV VTPLSG + E PL YLL VDGF FL+DCGW D  DP  L PL++VA ++DAVL+
Sbjct: 1    MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDHCDPSLLQPLARVAPTIDAVLL 60

Query: 2560 SHGDTSHIGALPYAVKKFGLCAPIYCTEPVYRTGLLTMYDHFLSRRSVSDFDLFTLDDID 2381
            SH D  H+GALPYA+K  GL AP+Y TEPV+R GLLTMYD+FLSR  V+DFDLFTLDDID
Sbjct: 61   SHPDIMHLGALPYAMKHLGLSAPVYVTEPVFRLGLLTMYDYFLSRWQVADFDLFTLDDID 120

Query: 2380 VAFQNVTSLKYSQNYDLAGKGEGIVITPYAAGRLLGGTVWKISKDGEDVIYAVDFNHRKE 2201
             AFQNV  LKYSQN+ L  KGEGIVI P+ +G LLGGTVWKI+KDGEDV+YAVDFNHRKE
Sbjct: 121  AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVSGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 2200 RHLNGTVLESFVRPAVLITDAYNALNNQPSRRQTIDQEFLDMILRTLRGDGNVLLPVDTA 2021
            RHLNGT L SFVRPAVLITDAYNALNNQ  +RQ  DQ+F+D +++ L   G+VLLPVDTA
Sbjct: 181  RHLNGTALGSFVRPAVLITDAYNALNNQVYKRQQ-DQDFIDSMVKVLASGGSVLLPVDTA 239

Query: 2020 GRVLELLLCLEQYWAKHHLTYPIAFLTNVATSTVDFVKSFLEWMSDSIAKSFEHSRDNAF 1841
            GRVLELLL +EQYWA+ HL YPI FLTNV+TSTVD+VKSFLEWMSDSI+KSFEH+RDNAF
Sbjct: 240  GRVLELLLIMEQYWAQRHLVYPIYFLTNVSTSTVDYVKSFLEWMSDSISKSFEHTRDNAF 299

Query: 1840 QLKYINVLLSRKELDRMPEGPKVVLASMASLEEGFSRDIFIDWASDPKNLVIFTERGQFG 1661
             L+Y++++++++EL+++ + PKVVLASMASLE GFS DIF++ A++ KNLV+FTE+GQFG
Sbjct: 300  LLRYVSLIINKEELEKLGDAPKVVLASMASLEVGFSHDIFVEMANEAKNLVLFTEKGQFG 359

Query: 1660 TLARMLQAEPAPKAVKVTVSKRVPLRGEELKAYEEEQNRLKMEEALKANCSKEEDIKSSF 1481
            TLARMLQ +P PKAVKVT+ KR+PL G+ELKAYEEEQ R+K EE LKA+ SK+E++K+S 
Sbjct: 360  TLARMLQVDPPPKAVKVTMGKRIPLVGDELKAYEEEQERIKKEELLKASLSKDEELKASH 419

Query: 1480 VSD-NTSDPMVIDSVAGVVPEGASSR---HRDVLCDGFIPSSSSIAPMFPFDEGLKEWDE 1313
             S+   SDPMV+D+ +      A S    + D+L DGF+PS++S+APMFPF E   +WD+
Sbjct: 420  GSNAKASDPMVVDASSSRKSSNAGSHVGGNVDILIDGFVPSTTSVAPMFPFFENTADWDD 479

Query: 1312 YGEVIDPENYVIKND--XXXXXXXXXXXXXXXEDNKAGDLLADTKATKVVSDEVTVHVKC 1139
            +GEVI+P++Y++K D                  D  +  LL D+  +KV+S+E+TV VKC
Sbjct: 480  FGEVINPDDYMMKQDEMDNNMMLGAGDGMDGKLDEGSARLLLDSAPSKVISNEMTVQVKC 539

Query: 1138 SLNYVDFEGRSDGRSIKSIIGHVAPLKLVLVHGSAEATEHLRQHCLKQSTSHVYAPQIEE 959
            SL Y+DFEGRSDGRS+KS+I HVAPLKLVLVHGSAEATEHL+ HC K S  HVYAPQIEE
Sbjct: 540  SLVYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCAKNSDLHVYAPQIEE 599

Query: 958  TIDVTSDLSAYKVQLSEKLMSSVLFKKLGEYEIAWIDGQVGKNDDMLSLLPLVNDPPLHK 779
            TIDVTSDL AYKVQLSEKLMS+V+ KKLGE+EIAW+D +VGK D+ L+LLP  + P  HK
Sbjct: 600  TIDVTSDLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKVDEKLNLLPPSSTPSAHK 659

Query: 778  SVFVGDLRLADFKQLLASKGVQAEFMGGHLRCGDYITVRKVGDSSQKSGV--HQIVIEGP 605
            SV VGDL+LADFKQ LA+KG+Q EF GG LRCG+YITVRK+GDS+QK      QIVIEGP
Sbjct: 660  SVLVGDLKLADFKQFLANKGLQVEFAGGALRCGEYITVRKIGDSNQKGSTVSQQIVIEGP 719

Query: 604  LTEEYFKIRQYLYSQFYVL 548
            L E+Y+KIR+ LYSQF++L
Sbjct: 720  LCEDYYKIRELLYSQFFLL 738


Top