BLASTX nr result

ID: Forsythia21_contig00003339 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00003339
         (2615 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011096672.1| PREDICTED: 30-kDa cleavage and polyadenylati...   688   0.0  
ref|XP_011085214.1| PREDICTED: 30-kDa cleavage and polyadenylati...   668   0.0  
ref|XP_012830213.1| PREDICTED: 30-kDa cleavage and polyadenylati...   647   0.0  
ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylati...   638   e-180
ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec...   637   e-179
ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylati...   636   e-179
ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec...   636   e-179
ref|XP_009628296.1| PREDICTED: cleavage and polyadenylation spec...   634   e-178
ref|XP_011011896.1| PREDICTED: 30-kDa cleavage and polyadenylati...   632   e-178
ref|XP_009768488.1| PREDICTED: cleavage and polyadenylation spec...   632   e-178
ref|XP_009789024.1| PREDICTED: cleavage and polyadenylation spec...   631   e-178
ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas...   630   e-177
ref|XP_011046740.1| PREDICTED: 30-kDa cleavage and polyadenylati...   629   e-177
ref|XP_008445183.1| PREDICTED: cleavage and polyadenylation spec...   628   e-177
ref|XP_007041140.1| Cleavage and polyadenylation specificity fac...   624   e-176
ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation spec...   623   e-175
gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sin...   622   e-175
ref|NP_001280880.1| cleavage and polyadenylation specificity fac...   621   e-175
ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr...   620   e-174
ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylati...   620   e-174

>ref|XP_011096672.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor
            30-like [Sesamum indicum]
          Length = 679

 Score =  688 bits (1776), Expect = 0.0
 Identities = 384/686 (55%), Positives = 425/686 (61%), Gaps = 16/686 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            M+DGEGGLSFDFEGG D   +  TA    I+SS     AS                   A
Sbjct: 1    MDDGEGGLSFDFEGGLDTGPSHPTASVPVIKSSGDANTASAAAANANYPSAVPTPATQAA 60

Query: 2154 DGTGEG-RRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMY 1978
            +G G G RR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECR++DC+Y
Sbjct: 61   EGMGGGGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVY 120

Query: 1977 KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFL 1798
            KHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVL+KIQQ +S+NYGN  RF 
Sbjct: 121  KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLRKIQQ-SSHNYGN--RFF 177

Query: 1797 QNRNANYAQQTEKXXXXXXXXXXXXXXXXXXTESANM-RXXXXXXXXXXXXXXXXXXXXX 1621
            QNRNANYAQQTEK                  TES N+ R                     
Sbjct: 178  QNRNANYAQQTEKSQFPQGPNEANQVAKGSTTESGNLIRPPQGQLSQQTGNQGQLQNLPN 237

Query: 1620 XXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVENV 1441
                 A+R ATSLPQGTSRYF+VKSCN+ENLELSVQQGVWATQRSNE KLNEAF+SV+NV
Sbjct: 238  SQQNQASRNATSLPQGTSRYFVVKSCNKENLELSVQQGVWATQRSNEAKLNEAFESVDNV 297

Query: 1440 ILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHKTR 1261
            ILIFSVNKTRHFQGCAKMTSRIGGS  GGNWKH HG+AHYGRNFAVKWLKL ELSF+KTR
Sbjct: 298  ILIFSVNKTRHFQGCAKMTSRIGGSVGGGNWKHTHGSAHYGRNFAVKWLKLGELSFNKTR 357

Query: 1260 HLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXKGV 1081
            HLRNP+NENL VKISRDCQELEPS+GEQLASLLYLEPDS+LMA+SL           KGV
Sbjct: 358  HLRNPYNENLQVKISRDCQELEPSVGEQLASLLYLEPDSDLMAVSLAAESKREEEKAKGV 417

Query: 1080 KPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMPLAHG 901
              +NG+ENPDIVPF                S                 MMWPPHMPLA G
Sbjct: 418  NLENGNENPDIVPFEENEEEEEDESEEEDESLGQVFGAQGRGRGRG--MMWPPHMPLARG 475

Query: 900  ARPFPGIQGFPPNMMGGDGF----------PMPDLFGMAPRGFGRYGPR----FSGELXX 763
            AR F G++GFPPN++ GDGF          PMPD FGMAPR  G Y PR    F+G    
Sbjct: 476  ARAFHGMRGFPPNLVAGDGFSYGPMNPDGFPMPDPFGMAPRSLGPYAPRFSGDFAGPTSG 535

Query: 762  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGMXXXXXXXXXXXXXXNRP 583
                                                                     NR 
Sbjct: 536  MMFPVRPSGGFNMMMGPGRPPFMGGMGVGAAPAARANRPLGLAPFYMPPPTQPSQNSNRA 595

Query: 582  KRDQKVLSGGYRNDRFNSGSDAGLIGGSNDEAPNQPRGKAQSEDHYGAGNDFRKDESESE 403
            KRDQK  +   RN+      D  + G + DE   Q R K Q +DHY AG+  R DES+SE
Sbjct: 596  KRDQKAPTSD-RNEGSAHAKDQEMAGSAGDEGQYQQRAKVQ-QDHYSAGSSHRNDESDSE 653

Query: 402  DEAPRRSRHGEGKKKRHSMEVDATTL 325
            DEAPRRSRHGEGKKKR S+E D+  +
Sbjct: 654  DEAPRRSRHGEGKKKRRSLEADSNAV 679


>ref|XP_011085214.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor
            30-like [Sesamum indicum]
          Length = 688

 Score =  668 bits (1723), Expect = 0.0
 Identities = 347/533 (65%), Positives = 373/533 (69%), Gaps = 12/533 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            M+DGEGGLSFDFEGG D      TA    IQSS     AS                   A
Sbjct: 1    MDDGEGGLSFDFEGGLDTGPAHPTASVPVIQSSADAKTASAASGNPNNPSAGLVPAAQTA 60

Query: 2154 DGTGEG-RRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMY 1978
            +G G G RR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECR++DC+Y
Sbjct: 61   EGMGGGARRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVY 120

Query: 1977 KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFL 1798
            KHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQQLTSYN+GN+N+F 
Sbjct: 121  KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLTSYNHGNTNKFF 180

Query: 1797 QNRNANYAQQTEKXXXXXXXXXXXXXXXXXXTESANM-RXXXXXXXXXXXXXXXXXXXXX 1621
            QNRN  Y QQTEK                   ES+N+ +                     
Sbjct: 181  QNRNTTYTQQTEKTQLPQGPNGVNQAGKTNPIESSNINQQAQVQQSQQQGSQGQIQNTPG 240

Query: 1620 XXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVENV 1441
                 A+RTAT LPQGTSRYF+VKSCNRENLELSVQQGVWATQRSNE KLNEAF+SVENV
Sbjct: 241  GQQNQASRTATPLPQGTSRYFVVKSCNRENLELSVQQGVWATQRSNEAKLNEAFESVENV 300

Query: 1440 ILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHKTR 1261
            ILIFSVNKTRHFQGCAKMTS+IGGS  GGNWKHAHGTAHYGRNFAVKWLKLCELSF KTR
Sbjct: 301  ILIFSVNKTRHFQGCAKMTSKIGGSVGGGNWKHAHGTAHYGRNFAVKWLKLCELSFDKTR 360

Query: 1260 HLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXKGV 1081
            HL+NP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDS+LMA+SL           KGV
Sbjct: 361  HLKNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSDLMAVSLAAELKREEEKAKGV 420

Query: 1080 KPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMPLAHG 901
              DNG+ENPDIVPF                                 GMMW PHMPLA G
Sbjct: 421  NLDNGTENPDIVPFEDNEEEEEEESEEEDE--SPGQVFGAQGRGRGRGMMWLPHMPLARG 478

Query: 900  ARPFPGIQGFPPNMMGG----------DGFPMPDLFGMAPRGFGRYGPRFSGE 772
            +RPF GI+GFPPNMM G          DGFPMPD FGMAPRGFG YGPRFSG+
Sbjct: 479  SRPFSGIRGFPPNMMSGDGFSYGPVNPDGFPMPDPFGMAPRGFGPYGPRFSGD 531



 Score = 97.8 bits (242), Expect = 4e-17
 Identities = 53/91 (58%), Positives = 60/91 (65%), Gaps = 3/91 (3%)
 Frame = -3

Query: 588 RPKRDQKVLSGGYRNDRFNSGSD---AGLIGGSNDEAPNQPRGKAQSEDHYGAGNDFRKD 418
           R KRD K      +ND  + G     +G  GG  DE  N PR KAQ EDHY AGN +R D
Sbjct: 598 RAKRDLKAPFND-KNDGPDQGKGQEISGSSGGHGDEGRNLPRLKAQQEDHYSAGNSYRND 656

Query: 417 ESESEDEAPRRSRHGEGKKKRHSMEVDATTL 325
           ESESEDEAPRRSRHGEGKKKR ++E D+T L
Sbjct: 657 ESESEDEAPRRSRHGEGKKKRRNLEADSTEL 687


>ref|XP_012830213.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Erythranthe guttatus] gi|604344484|gb|EYU43238.1|
            hypothetical protein MIMGU_mgv1a002387mg [Erythranthe
            guttata]
          Length = 681

 Score =  647 bits (1668), Expect = 0.0
 Identities = 335/541 (61%), Positives = 363/541 (67%), Gaps = 20/541 (3%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            M+DGEGGLSFDFEGG D   +  TA    IQSS     AS                    
Sbjct: 1    MDDGEGGLSFDFEGGLDIGPSHPTASVPVIQSSANANTASAAAAAANPYNPSAAPVPATQ 60

Query: 2154 DGTGE---GRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDC 1984
               G    GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFRLYGECR++DC
Sbjct: 61   AAEGMNNGGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDC 120

Query: 1983 MYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNR 1804
            +YKHTNED+KECNMYKLGFCPNGPDCRYRHAKL      VEEVLQKIQQLTSYNYG SN 
Sbjct: 121  VYKHTNEDVKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQQLTSYNYGKSNN 180

Query: 1803 FLQNRNANYAQQTEKXXXXXXXXXXXXXXXXXXTESANMRXXXXXXXXXXXXXXXXXXXX 1624
            F QNRN+N+AQQTEK                   E  N+                     
Sbjct: 181  FFQNRNSNFAQQTEKPQFPQGPNGTHQVGKTNAAEPGNLNQPAQQSQQPGSQGQLQSIPN 240

Query: 1623 XXXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVEN 1444
                   +R AT LPQG SRYF+VKSCNRENLELSVQQGVWATQRSNE KLNEAF+SVEN
Sbjct: 241  DQQNQA-SRNATPLPQGASRYFVVKSCNRENLELSVQQGVWATQRSNEAKLNEAFESVEN 299

Query: 1443 VILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHKT 1264
            +ILIFSVNKTRHFQGCAKMTSRIGGS  GGNWKHAHGTAHYGRNFA+KWLKLCEL+F KT
Sbjct: 300  IILIFSVNKTRHFQGCAKMTSRIGGSVGGGNWKHAHGTAHYGRNFALKWLKLCELTFDKT 359

Query: 1263 RHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXKG 1084
            RHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDS+LMAI++           KG
Sbjct: 360  RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSDLMAIAIAAELKREEEKAKG 419

Query: 1083 VKPDNGSENPDIVPF---XXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHM- 916
            V  DNG+ENPDIVPF                                    GMMW PHM 
Sbjct: 420  VNIDNGAENPDIVPFEDNEEEEEEEEEEEESEDEDEFPGQAFGAQGRGVGRGMMWGPHMP 479

Query: 915  PLAHGARPFPGIQGFPPNMMGG-------------DGFPMPDLFGMAPRGFGRYGPRFSG 775
            PL  G RPFPG++GFPPNMMGG             DGFPM D FGM PRGFG++GPRF G
Sbjct: 480  PLGRGPRPFPGVRGFPPNMMGGDGFPYGHGPPLNHDGFPMHDPFGMVPRGFGQFGPRFGG 539

Query: 774  E 772
            +
Sbjct: 540  D 540



 Score = 63.2 bits (152), Expect = 1e-06
 Identities = 38/82 (46%), Positives = 46/82 (56%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSDAGLIGGSNDEAPNQPRGKAQSEDHYGAGNDFRKDESESE 403
           KRDQK      RND  + G    ++ GS++      RG A   +       +R DESESE
Sbjct: 607 KRDQKAPYSD-RNDVSDQGKGQEIVSGSSN------RGNAAKREE-----SYRNDESESE 654

Query: 402 DEAPRRSRHGEGKKKRHSMEVD 337
           DEAPRRSRHGEGKKKR   E +
Sbjct: 655 DEAPRRSRHGEGKKKRRGSEAE 676


>ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Cicer arietinum]
          Length = 677

 Score =  638 bits (1646), Expect = e-180
 Identities = 333/533 (62%), Positives = 359/533 (67%), Gaps = 12/533 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            MED EG LSFDFEGG D     +  P+ A  S   P +                      
Sbjct: 1    MEDSEGVLSFDFEGGLD-----AAPPSAATVSVPAPPSGPIVHPDSSLPPSISSNGAAAV 55

Query: 2154 DGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMYK 1975
             G   GRR+FRQTVCRHWLRSLCMKG+ACGFLHQYDKARMPVCRFFRLYGECR++DC+YK
Sbjct: 56   SGNIPGRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 115

Query: 1974 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFLQ 1795
            HTNEDIKECNMYKLGFCPNGPDCRYRHAK      P+EEVLQKIQ L SYN+ NS++F+Q
Sbjct: 116  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNSHKFIQ 175

Query: 1794 NRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANMRXXXXXXXXXXXXXXXXXXXXX 1621
             R ++Y QQ EK                     ES N++                     
Sbjct: 176  QRGSSYTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQSQQQVSQIQTQNLA 235

Query: 1620 XXXXXAT-RTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVEN 1444
                    RTAT LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFDSVEN
Sbjct: 236  NGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 295

Query: 1443 VILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHKT 1264
            VILIFSVN+TRHFQGCAKMTSRIGGS +GGNWK+AHGTAHYGRNF+VKWLKLCELSFHKT
Sbjct: 296  VILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 355

Query: 1263 RHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXKG 1084
            RHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS+           KG
Sbjct: 356  RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKAKG 415

Query: 1083 VKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMPLAH 904
            V PDN  ENPDIVPF                S+               GMMWPPHMPL  
Sbjct: 416  VNPDNAGENPDIVPFEDNEEEEEEESDEEEESFVQAVVPVGQGRGRGRGMMWPPHMPLGR 475

Query: 903  GARPFPGIQGFPPNMMGG---------DGFPMPDLFGMAPRGFGRYGPRFSGE 772
            GARP PG+QGF P MMG          DGF MPDLFGM PRGFG YGPRFSG+
Sbjct: 476  GARPMPGMQGFNPVMMGDGLSYGPGAPDGFGMPDLFGMGPRGFGPYGPRFSGD 528



 Score = 64.3 bits (155), Expect = 5e-07
 Identities = 38/83 (45%), Positives = 48/83 (57%), Gaps = 7/83 (8%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSDAGLI-------GGSNDEAPNQPRGKAQSEDHYGAGNDFR 424
           KRDQ+      RNDR++SG + G         GG +DE   Q  G           N+FR
Sbjct: 603 KRDQRTND---RNDRYSSGQEQGKSQDMLSQSGGPDDEMQYQQSG--------APANNFR 651

Query: 423 KDESESEDEAPRRSRHGEGKKKR 355
            ++SESEDEAPRRSRHGEGKK++
Sbjct: 652 NEDSESEDEAPRRSRHGEGKKRK 674


>ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 691

 Score =  637 bits (1643), Expect = e-179
 Identities = 337/541 (62%), Positives = 359/541 (66%), Gaps = 20/541 (3%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTA-----PAFAIQSSTVPTAASGXXXXXXXXXXXXXX 2170
            MED EG LSFDFEGG D   + + A     P     SS   +A S               
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPSTADPA 60

Query: 2169 XXXXADGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQK 1990
                  G   GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECR++
Sbjct: 61   G-----GNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQ 115

Query: 1989 DCMYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNS 1810
            DC+YKHTNEDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ L SYNY +S
Sbjct: 116  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSS 175

Query: 1809 NRFLQNRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANMRXXXXXXXXXXXXXXXX 1636
            N+F Q R A+Y QQ EK                     ES N +                
Sbjct: 176  NKFFQQRGASYNQQAEKPQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQVNQSQ 235

Query: 1635 XXXXXXXXXXAT-RTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAF 1459
                         RTAT LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAF
Sbjct: 236  MQNVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAF 295

Query: 1458 DSVENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCEL 1279
            DSVENVIL+FSVN+TRHFQGCAKMTSRIGGS +GGNWK+AHGTAHYGRNF+VKWLKLCEL
Sbjct: 296  DSVENVILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCEL 355

Query: 1278 SFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXX 1099
            SFHKTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS+        
Sbjct: 356  SFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREE 415

Query: 1098 XXXKGVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPH 919
               KGV PDNG ENPDIVPF                S+               GMMWPPH
Sbjct: 416  EKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFSHGVGPAGQGRGRGRGMMWPPH 475

Query: 918  MPLAHGARPFPGIQGFPPNMMGG------------DGFPMPDLFGMAPRGFGRYGPRFSG 775
            MPL  GARP PG+QGF P MMG             DGF MPDLFG+ PRGF  YGPRFSG
Sbjct: 476  MPLGRGARPMPGMQGFNPVMMGDGLSYGPVGPVGPDGFGMPDLFGVGPRGFAPYGPRFSG 535

Query: 774  E 772
            +
Sbjct: 536  D 536



 Score = 77.0 bits (188), Expect = 7e-11
 Identities = 45/82 (54%), Positives = 52/82 (63%), Gaps = 7/82 (8%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSDAGLI-------GGSNDEAPNQPRGKAQSEDHYGAGNDFR 424
           KRDQ+      RNDRF SGS+ G         GG +D+A  Q   K   +DH  A N+FR
Sbjct: 611 KRDQRTAD---RNDRFGSGSEQGKSQDMLSQSGGPDDDAQYQQGYKGNQDDH-PAVNNFR 666

Query: 423 KDESESEDEAPRRSRHGEGKKK 358
            D+SESEDEAPRRSRHGEGKKK
Sbjct: 667 NDDSESEDEAPRRSRHGEGKKK 688


>ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Vitis vinifera]
          Length = 673

 Score =  636 bits (1641), Expect = e-179
 Identities = 332/533 (62%), Positives = 361/533 (67%), Gaps = 12/533 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            MED EG LSFDFEGG D     +   A  IQS     AA+                    
Sbjct: 1    MEDAEGVLSFDFEGGLDAAPGTAATVAPLIQSDATAAAAA----------PSSVVSAEPT 50

Query: 2154 DGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMYK 1975
             G   GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECR++DC+YK
Sbjct: 51   PGGAPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 110

Query: 1974 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFLQ 1795
            HTNEDIKECNMYKLGFCPNG DCRYRHAKL      +EEV QKIQQL+S+NYG+SNRF Q
Sbjct: 111  HTNEDIKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFYQ 170

Query: 1794 NRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANMRXXXXXXXXXXXXXXXXXXXXX 1621
            NRN  Y QQTEK                  +  E+ N++                     
Sbjct: 171  NRNP-YNQQTEKSQILQGSNAVNLGTVAKSSTTEAINVQQQQVQPPQQQVSQTPMQNLPN 229

Query: 1620 XXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVENV 1441
                 A +TA+ LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFDSVENV
Sbjct: 230  GLPNQANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENV 289

Query: 1440 ILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHKTR 1261
            ILIFSVN+TRHFQGCAKMTS+IGG   GGNWK+AHGTAHYGRNF+VKWLKLCELSFHKTR
Sbjct: 290  ILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 349

Query: 1260 HLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXKGV 1081
            HLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISL           KGV
Sbjct: 350  HLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREEEKAKGV 409

Query: 1080 KPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMPLAHG 901
             PDNG ENPDIVPF                S+               G+MWPPHMPLA G
Sbjct: 410  NPDNGGENPDIVPFEDNEEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMPLARG 469

Query: 900  ARPFPGIQGFPPNMMGG----------DGFPMPDLFGMAPRGFGRYGPRFSGE 772
            ARP P ++GFPP MMG           DGF MPD+FG+ PR F  YGPRFSG+
Sbjct: 470  ARPIPSMRGFPPVMMGADGFSYSAVPPDGFAMPDIFGVGPRAFPPYGPRFSGD 522



 Score = 86.3 bits (212), Expect = 1e-13
 Identities = 46/81 (56%), Positives = 54/81 (66%), Gaps = 3/81 (3%)
 Frame = -3

Query: 588 RPKRDQKVLSGGYRNDRFNSGSDAGL---IGGSNDEAPNQPRGKAQSEDHYGAGNDFRKD 418
           R KRDQ+      RNDR++ GSD G    + G +DE       K+Q +D +G GN FR D
Sbjct: 594 RTKRDQRTPVND-RNDRYSGGSDQGRGQDMAGPDDETQYLQGLKSQQDDQFGGGNSFRND 652

Query: 417 ESESEDEAPRRSRHGEGKKKR 355
           ESESEDEAPRRSRHGEGKKKR
Sbjct: 653 ESESEDEAPRRSRHGEGKKKR 673


>ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 681

 Score =  636 bits (1640), Expect = e-179
 Identities = 333/533 (62%), Positives = 356/533 (66%), Gaps = 12/533 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            MED EG LSFDFEGG D   + + A           +AA+                    
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVSNGGPAAPAPSAVDPVG 60

Query: 2154 DGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMYK 1975
             G   GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECR++DC+YK
Sbjct: 61   GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120

Query: 1974 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFLQ 1795
            HTNEDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ L SYNY +SN+F Q
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQ 180

Query: 1794 NRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANMRXXXXXXXXXXXXXXXXXXXXX 1621
             R A+Y QQ EK                     E  N +                     
Sbjct: 181  QRGASYNQQAEKPLLPQGNNSTNQGVTGNPLPAELGNAQPQQQVQQSQQQVNQSQMQNVA 240

Query: 1620 XXXXXAT-RTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVEN 1444
                    RTAT LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFDSVEN
Sbjct: 241  NGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 300

Query: 1443 VILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHKT 1264
            VILIFSVN+TRHFQGCAKMTS+IGGS +GGNWK+AHGTAHYGRNF+VKWLKLCELSFHKT
Sbjct: 301  VILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 360

Query: 1263 RHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXKG 1084
            RHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS+           KG
Sbjct: 361  RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKAKG 420

Query: 1083 VKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMPLAH 904
            V PDNG ENPDIVPF                S+               GMMWPPHMPL  
Sbjct: 421  VNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGRGRGRGMMWPPHMPLGR 480

Query: 903  GARPFPGIQGFPPNMMGG---------DGFPMPDLFGMAPRGFGRYGPRFSGE 772
            GARP PG+QGF P MMG          DGF MPDLFG+ PRGF  YGPRFSG+
Sbjct: 481  GARPMPGMQGFNPVMMGDGLSYGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGD 533



 Score = 65.5 bits (158), Expect = 2e-07
 Identities = 40/82 (48%), Positives = 46/82 (56%), Gaps = 7/82 (8%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSDAGLI-------GGSNDEAPNQPRGKAQSEDHYGAGNDFR 424
           KRDQ+      RNDRF SGS+ G         GG +D+   Q   K   +DH        
Sbjct: 608 KRDQRTAD---RNDRFGSGSEQGKSQDMLSQSGGPDDDPQYQQGYKGNQDDH-------- 656

Query: 423 KDESESEDEAPRRSRHGEGKKK 358
            D+SESEDEAPRRSRHGEGKKK
Sbjct: 657 PDDSESEDEAPRRSRHGEGKKK 678


>ref|XP_009628296.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Nicotiana tomentosiformis]
          Length = 691

 Score =  634 bits (1635), Expect = e-178
 Identities = 331/537 (61%), Positives = 360/537 (67%), Gaps = 15/537 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAI-QSSTVPTAASGXXXXXXXXXXXXXXXXXX 2158
            M++GEGGLSFDFEGG D   T  TA    + QSS    AA+                   
Sbjct: 1    MDEGEGGLSFDFEGGLDTGPTHPTASVPVMTQSSDHNIAAAAAPNANINQPPTVSAHVGG 60

Query: 2157 ADGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMY 1978
              G    RR+FRQTVCRHWLRSLCMKG+ACGFLHQYDK+RMP+CRFFRLYGECR++DC+Y
Sbjct: 61   DVGFVGNRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVY 120

Query: 1977 KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFL 1798
            KHT EDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ L S NYG SNRF 
Sbjct: 121  KHTIEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLASNNYGYSNRFY 180

Query: 1797 QNRNANYAQQTEKXXXXXXXXXXXXXXXXXXTESANMRXXXXXXXXXXXXXXXXXXXXXX 1618
            QNRNANY+ Q EK                   E+  ++                      
Sbjct: 181  QNRNANYSTQAEKSQASQGQNGMGLAVKSTAAETPIIQQIQPHQQQVLQTQQQGGPTQTQ 240

Query: 1617 XXXXAT-----RTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDS 1453
                       RTA  LPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFDS
Sbjct: 241  IHPNGQQNQTDRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 300

Query: 1452 VENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSF 1273
            VENVILIFSVN+TRHFQGCAKMTSRIGG+A GGNWKH HGTAHYGRNF+VKWLKLCELSF
Sbjct: 301  VENVILIFSVNRTRHFQGCAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSVKWLKLCELSF 360

Query: 1272 HKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXX 1093
             KT HLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMAISL          
Sbjct: 361  QKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRQEEK 420

Query: 1092 XKGVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMP 913
             KGV PDNG++NPDIVPF                S+               G++WPP MP
Sbjct: 421  AKGVNPDNGNDNPDIVPFEDNEEEEDEESEDEDESFDQGFGPAALGRGRGRGIVWPPIMP 480

Query: 912  LAHGARPFPGIQGFPPNMMGG---------DGFPMPDLFGMAPRGFGRYGPRFSGEL 769
            L HG RP PG++GFPP MMG          DGFPMPD FGM PR FG YGPRFS ++
Sbjct: 481  LGHGPRPLPGMRGFPPGMMGDGFSYGAMTPDGFPMPDHFGMGPRPFGPYGPRFSNDM 537



 Score = 94.0 bits (232), Expect = 6e-16
 Identities = 53/99 (53%), Positives = 66/99 (66%), Gaps = 8/99 (8%)
 Frame = -3

Query: 588 RPKRDQKVLSGGYRNDRFNSGSD-------AGLIGGSNDEAPNQP-RGKAQSEDHYGAGN 433
           RPKR+Q+      RNDRF+SGSD       AG +GG   +  N P RGK + +  +GAGN
Sbjct: 596 RPKREQRAPVHD-RNDRFSSGSDQGKGQEMAGSVGGP--DGVNYPQRGKPEQDAQFGAGN 652

Query: 432 DFRKDESESEDEAPRRSRHGEGKKKRHSMEVDATTLSDQ 316
            F+ DESESEDEAPRRSRHG+GKKKR   + DA T S++
Sbjct: 653 SFKNDESESEDEAPRRSRHGDGKKKRRDTDDDAATASEK 691


>ref|XP_011011896.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor
            30-like [Populus euphratica]
          Length = 692

 Score =  632 bits (1630), Expect = e-178
 Identities = 329/536 (61%), Positives = 360/536 (67%), Gaps = 15/536 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            MED EG LSFDFEGG D   T  +A   AI S    +A +                   +
Sbjct: 1    MEDPEGVLSFDFEGGLDSGPTNPSASMAAIPSDNQGSAMAAAPNTATTGASTSNTTANNS 60

Query: 2154 DGTGE-----GRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQK 1990
              TG      GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECR++
Sbjct: 61   SDTGAADMQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 120

Query: 1989 DCMYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNS 1810
            DC+YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PV E +QK QQL SYNYGNS
Sbjct: 121  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVVEAVQKTQQLNSYNYGNS 180

Query: 1809 NRFLQNRNANYAQQTEKXXXXXXXXXXXXXXXXXXTESANMRXXXXXXXXXXXXXXXXXX 1630
            N+F Q R A + QQ EK                   ESAN++                  
Sbjct: 181  NKFFQQRTAGFPQQIEKAPITIIKPSGT--------ESANLQQQQQQPQTQAQAPNLPNG 232

Query: 1629 XXXXXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSV 1450
                      R AT+LPQG SRYFIVKSCN ENLELSVQQGVWATQRSNEPKLNEAFDS 
Sbjct: 233  QQQPNPL--NRIATTLPQGISRYFIVKSCNLENLELSVQQGVWATQRSNEPKLNEAFDSA 290

Query: 1449 ENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFH 1270
            ENVILIFSVN+TRHFQGCAKMTS+IG S  GGNWK+AHGTAHYGRNF+VKWLKLCELSFH
Sbjct: 291  ENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 350

Query: 1269 KTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXX 1090
            KTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDS+LMAIS+           
Sbjct: 351  KTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSDLMAISVAAEAKREEEKE 410

Query: 1089 KGVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMPL 910
            KGV PD+G ENPDIVPF                ++               G+MWPPH+P+
Sbjct: 411  KGVTPDSGGENPDIVPFEDNEEEEEEESEEEVEAFGHPLGAAAQGRGRGRGIMWPPHIPI 470

Query: 909  AHGARPFPGIQGFPPNMMGGDGFP----------MPDLFGMAPRGFGRYGPRFSGE 772
            A GARP  G++ FPP MMG DGF           MPDLFG+APRGF  YGPRFSG+
Sbjct: 471  ARGARPIHGMRAFPPMMMGADGFSYGAVTPDSFGMPDLFGVAPRGFASYGPRFSGD 526



 Score = 73.6 bits (179), Expect = 8e-10
 Identities = 48/91 (52%), Positives = 52/91 (57%), Gaps = 3/91 (3%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSDA--GLIGGSNDEAPN-QPRGKAQSEDHYGAGNDFRKDES 412
           KRDQ+      R DR N  SD   G  G SNDE    Q   K   ED +GA N  R +ES
Sbjct: 600 KRDQRAAPSD-RIDRHNIESDLVRGAAGESNDETRYPQETYKVSHEDQFGAVNSNRNNES 658

Query: 411 ESEDEAPRRSRHGEGKKKRHSMEVDATTLSD 319
            SEDEAPRRSRHGEGKKK+   E DA   SD
Sbjct: 659 GSEDEAPRRSRHGEGKKKQRGSEGDANPGSD 689


>ref|XP_009768488.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like isoform X1 [Nicotiana sylvestris]
          Length = 685

 Score =  632 bits (1629), Expect = e-178
 Identities = 331/541 (61%), Positives = 360/541 (66%), Gaps = 19/541 (3%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSS----TVPTAASGXXXXXXXXXXXXXXX 2167
            M+DGEGGLSFDFEGG D   T  TA    IQSS    T  TA                  
Sbjct: 1    MDDGEGGLSFDFEGGLDTVPTHPTASVPVIQSSDHYNTTTTAGPAPSASTASVPTVGLGQ 60

Query: 2166 XXXADGTGEG-RRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQK 1990
                DG+  G RR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFRLYGECR+ 
Sbjct: 61   GQLGDGSFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREP 120

Query: 1989 DCMYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNS 1810
            DC+YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ LTSYNYG S
Sbjct: 121  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQNLTSYNYGYS 180

Query: 1809 NRFLQNRNANYAQQTEKXXXXXXXXXXXXXXXXXXTESAN-----MRXXXXXXXXXXXXX 1645
            NRF QNRNANY+ Q +K                   E                       
Sbjct: 181  NRFFQNRNANYSTQADKSPTPQVQNVMNQAVKSTAAEPPTGHQHQPHQQQVQQPQHHGAP 240

Query: 1644 XXXXXXXXXXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNE 1465
                         A RTA  LPQGTSRYFIVKSCN ENLELSVQQGVWATQRSNE KLNE
Sbjct: 241  TQTQTLPNGQQNQADRTAIPLPQGTSRYFIVKSCNPENLELSVQQGVWATQRSNEAKLNE 300

Query: 1464 AFDSVENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLC 1285
            AFDSVENVIL+FS+N+TRHFQG AKMTSRIGG++ GGNWKH HGTAHYGRNF+VKWLKLC
Sbjct: 301  AFDSVENVILVFSINRTRHFQGLAKMTSRIGGASKGGNWKHEHGTAHYGRNFSVKWLKLC 360

Query: 1284 ELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXX 1105
            ELSF KTRHLRNP+NENLPVKISRDCQELE S+GEQLASLL LEPDSELMAIS+      
Sbjct: 361  ELSFQKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLCLEPDSELMAISIAAESKR 420

Query: 1104 XXXXXKGVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWP 925
                 KGV PDNG+ENPDIVPF                S+               G++WP
Sbjct: 421  EEERAKGVNPDNGNENPDIVPFEDNEEEEEEESEEEDESFGQAFGPAAFGRGSGGGIVWP 480

Query: 924  PHMPLAHGARPFPGIQGFPPNMMGG---------DGFPMPDLFGMAPRGFGRYGPRFSGE 772
            P +P   GARP+PG++GFPP MMG          DGFPMPDLFGM  R FG +GPRF G+
Sbjct: 481  PIVPFGRGARPYPGMRGFPPGMMGDGFSYGSMTPDGFPMPDLFGMGGRPFGPFGPRFPGD 540

Query: 771  L 769
            +
Sbjct: 541  I 541



 Score = 74.3 bits (181), Expect = 5e-10
 Identities = 49/94 (52%), Positives = 55/94 (58%), Gaps = 7/94 (7%)
 Frame = -3

Query: 588 RPKRDQKVLSGGYRNDRFNSGSD-------AGLIGGSNDEAPNQPRGKAQSEDHYGAGND 430
           R KRD K      RNDRF+SG D       AG +GG  DE    P+           GN 
Sbjct: 600 RVKRDPKA-PVNERNDRFSSGLDQGRGQEMAGSVGGP-DEGVRYPQ----------TGNS 647

Query: 429 FRKDESESEDEAPRRSRHGEGKKKRHSMEVDATT 328
           FR DESESEDEAPRRSR G+GKKK+ SM+ DATT
Sbjct: 648 FRNDESESEDEAPRRSRLGDGKKKKLSMDGDATT 681


>ref|XP_009789024.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Nicotiana sylvestris]
            gi|698484435|ref|XP_009789025.1| PREDICTED: cleavage and
            polyadenylation specificity factor CPSF30-like [Nicotiana
            sylvestris]
          Length = 690

 Score =  631 bits (1628), Expect = e-178
 Identities = 330/536 (61%), Positives = 358/536 (66%), Gaps = 14/536 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAI-QSSTVPTAASGXXXXXXXXXXXXXXXXXX 2158
            M++GEGGLSFDFEGG D   T  TA    + QSS    AA+                   
Sbjct: 1    MDEGEGGLSFDFEGGLDTGPTHPTASVPVMTQSSDHNIAAAAAPNANINQPPTVSAHVGG 60

Query: 2157 ADGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMY 1978
              G    RR+FRQTVCRHWLRSLCMKG+ACGFLHQYDK+RMP+CRFFRLYGECR++DC+Y
Sbjct: 61   DVGFVGNRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVY 120

Query: 1977 KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFL 1798
            KHT EDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ L S NYG SNRF 
Sbjct: 121  KHTIEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLASNNYGYSNRFY 180

Query: 1797 QNRNANYAQQTEKXXXXXXXXXXXXXXXXXXT----ESANMRXXXXXXXXXXXXXXXXXX 1630
            QNRNANY+ Q +K                       +                       
Sbjct: 181  QNRNANYSTQADKPQASQGQNGMGAVKSTATETPIIQQIQPHQQQALQTQQQGGTTQTQI 240

Query: 1629 XXXXXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSV 1450
                    A RTA  LPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFDSV
Sbjct: 241  HPNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSV 300

Query: 1449 ENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFH 1270
            ENVILIFSVN+TRHFQGCAKMTSRIGG+A GGNWKH HGTAHYGRNF+VKWLKLCELSF 
Sbjct: 301  ENVILIFSVNRTRHFQGCAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSVKWLKLCELSFQ 360

Query: 1269 KTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXX 1090
            KT HLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMAISL           
Sbjct: 361  KTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRQEEKA 420

Query: 1089 KGVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMPL 910
            KGV PDNG++NPDIVPF                S+               G++WPP MPL
Sbjct: 421  KGVNPDNGNDNPDIVPFEDNEEEEDEESEDEDESFDQGFGPAALGRGRGRGIVWPPIMPL 480

Query: 909  AHGARPFPGIQGFPPNMMGG---------DGFPMPDLFGMAPRGFGRYGPRFSGEL 769
             HG RP PG++GFPP MMG          DGFPMPD FGM PR FG YGPRFS ++
Sbjct: 481  GHGPRPLPGMRGFPPGMMGDGFSYGAMTPDGFPMPDHFGMGPRPFGPYGPRFSNDM 536



 Score = 94.0 bits (232), Expect = 6e-16
 Identities = 53/99 (53%), Positives = 66/99 (66%), Gaps = 8/99 (8%)
 Frame = -3

Query: 588 RPKRDQKVLSGGYRNDRFNSGSD-------AGLIGGSNDEAPNQP-RGKAQSEDHYGAGN 433
           RPKR+Q+      RNDRF+SGSD       AG +GG   +  N P RGK + +  +GAGN
Sbjct: 595 RPKREQRAPVHD-RNDRFSSGSDQGKGQEMAGSVGGP--DGVNYPQRGKTEQDAQFGAGN 651

Query: 432 DFRKDESESEDEAPRRSRHGEGKKKRHSMEVDATTLSDQ 316
            F+ DESESEDEAPRRSRHG+GKKKR   + DA T S++
Sbjct: 652 GFKNDESESEDEAPRRSRHGDGKKKRRDTDDDAATASEK 690


>ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris]
            gi|561020727|gb|ESW19498.1| hypothetical protein
            PHAVU_006G130200g [Phaseolus vulgaris]
          Length = 697

 Score =  630 bits (1626), Expect = e-177
 Identities = 330/534 (61%), Positives = 356/534 (66%), Gaps = 13/534 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            MED EG LSFDFEGG D   + + AP+  +       AAS                    
Sbjct: 1    MEDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSGTEPAAV 60

Query: 2154 DGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMYK 1975
            +  G  RR+FRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECR++DC+YK
Sbjct: 61   NVPG--RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 118

Query: 1974 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFLQ 1795
            HTNEDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ L SYNY +SN+F Q
Sbjct: 119  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQ 178

Query: 1794 NRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANMRXXXXXXXXXXXXXXXXXXXXX 1621
             R ++Y QQ EK                     ES N +                     
Sbjct: 179  QRGSSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQVSQNQIQNV 238

Query: 1620 XXXXXA--TRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVE 1447
                    +R AT LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFDSVE
Sbjct: 239  ANGQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVE 298

Query: 1446 NVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHK 1267
            NVILIFSVN+TRHFQGCAKMTSRIGGS +GGNWK+AHGTAHYGRNF+VKWLKLCELSFHK
Sbjct: 299  NVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK 358

Query: 1266 TRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXK 1087
            TRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPD ELMA+S+           K
Sbjct: 359  TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKREEEKAK 418

Query: 1086 GVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMPLA 907
            GV PDNG ENPDIVPF                S+               GMMWPPHMPL 
Sbjct: 419  GVNPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGPAGQGRGRGRGMMWPPHMPLP 478

Query: 906  HGARPFPGIQGFPPNMMGG---------DGFPMPDLFGMAPRGFGRYGPRFSGE 772
             GARP PG+QGF P MMG          DGF MPDLF + PR F  YGPRFSG+
Sbjct: 479  RGARPMPGMQGFNPVMMGDGLSYGPVAPDGFGMPDLFSVGPRAFAPYGPRFSGD 532



 Score = 76.3 bits (186), Expect = 1e-10
 Identities = 45/87 (51%), Positives = 53/87 (60%), Gaps = 7/87 (8%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSDAGLI-------GGSNDEAPNQPRGKAQSEDHYGAGNDFR 424
           KRDQ+      RNDR+ SGS+ G         G  +D+   Q   KA  +DH  A N+FR
Sbjct: 607 KRDQRTTD---RNDRYGSGSEQGKSQDMLSQSGAPDDDMQYQQGYKANQDDH-PAVNNFR 662

Query: 423 KDESESEDEAPRRSRHGEGKKKRHSME 343
            D+SESEDEAPRRSRHGEGKKKR   E
Sbjct: 663 NDDSESEDEAPRRSRHGEGKKKRRGPE 689


>ref|XP_011046740.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor
            30-like [Populus euphratica]
            gi|743934932|ref|XP_011011817.1| PREDICTED: 30-kDa
            cleavage and polyadenylation specificity factor 30-like
            [Populus euphratica]
          Length = 687

 Score =  629 bits (1622), Expect = e-177
 Identities = 330/531 (62%), Positives = 355/531 (66%), Gaps = 10/531 (1%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            MED EG LSFDFEGG D       A   AI S     A++                   A
Sbjct: 1    MEDSEGVLSFDFEGGLDSGPANPIASIPAIPSDNYGAASAAAPNTTNTTTNTANNSNSGA 60

Query: 2154 DGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMYK 1975
                 GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECR++DC+YK
Sbjct: 61   ADIQTGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 120

Query: 1974 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFLQ 1795
            HTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEV+QKIQQL SYN  NSN+  Q
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVVQKIQQLNSYNGVNSNKNFQ 180

Query: 1794 NRNANYAQQTEKXXXXXXXXXXXXXXXXXXTESANMRXXXXXXXXXXXXXXXXXXXXXXX 1615
             RNA ++QQ EK                   ESAN++                       
Sbjct: 181  QRNAGFSQQIEKSPNTIIKPSGT--------ESANVQQQQQQQQTQTPHLTNGQHQQPQQ 232

Query: 1614 XXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVENVIL 1435
                 R AT LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEA DS +NVIL
Sbjct: 233  PNPLNRIATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEIKLNEALDSADNVIL 292

Query: 1434 IFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHKTRHL 1255
            IFSVN+TRHFQGCAKMTS+IG S  GGNWK+AHGTAHYGRNF+VKWLKLCELSFHKTRHL
Sbjct: 293  IFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHL 352

Query: 1254 RNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXKGVKP 1075
            RNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMA+SL           KGV P
Sbjct: 353  RNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSLAAEAKREEEKEKGVNP 412

Query: 1074 DNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMPLAHGAR 895
            D+G ENPDIVPF                S+               GMMWP H P+A GAR
Sbjct: 413  DSGCENPDIVPFEDNEEEEEEESEEEDESFGQPLGPAAQGRGRGRGMMWPSHNPMARGAR 472

Query: 894  PFPGIQGFPPNMMGGDGFP----------MPDLFGMAPRGFGRYGPRFSGE 772
            P PGI+GFPP MMG DGF           MPDLFG+A RGF  YGPRFSG+
Sbjct: 473  PIPGIRGFPPMMMGADGFSYGAVTPDSFGMPDLFGVASRGFPPYGPRFSGD 523



 Score = 84.3 bits (207), Expect = 4e-13
 Identities = 52/91 (57%), Positives = 56/91 (61%), Gaps = 3/91 (3%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSDA--GLIGGSNDEAPN-QPRGKAQSEDHYGAGNDFRKDES 412
           KRDQ+  +   RNDR +  SD   G  G SNDE    Q   KA  ED +GA N  R DES
Sbjct: 597 KRDQRAAAND-RNDRHSVESDVVRGAAGESNDERKYLQETLKASHEDQFGAVNSIRNDES 655

Query: 411 ESEDEAPRRSRHGEGKKKRHSMEVDATTLSD 319
           ESEDEAPRRSRHGEGKKKR     DAT  SD
Sbjct: 656 ESEDEAPRRSRHGEGKKKRRGSGEDATPGSD 686


>ref|XP_008445183.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30
            [Cucumis melo]
          Length = 710

 Score =  628 bits (1620), Expect = e-177
 Identities = 328/542 (60%), Positives = 357/542 (65%), Gaps = 21/542 (3%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSS--------TVPTAASGXXXXXXXXXXX 2179
            MED EG LSFDFEGG D   T   A A A  SS        + P   S            
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPTNPAAAAAASSSSLPLIPSDSSAPPPLSNSLPGSLGPTLA 60

Query: 2178 XXXXXXXADGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGEC 1999
                       G  RR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFRLYGEC
Sbjct: 61   PEPLGAPTANVGT-RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGEC 119

Query: 1998 RQKDCMYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNY 1819
            R++DC+YKHTNEDIKECNMYK GFCPNGPDCRYRHAKL      VEE+LQKIQ L SYNY
Sbjct: 120  REQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPSVEEILQKIQHLGSYNY 179

Query: 1818 GNSNRFLQNRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANMRXXXXXXXXXXXXX 1645
            G+SN+F   R     QQ EK                  +  ESAN++             
Sbjct: 180  GSSNKFFSQRGVGLPQQNEKSQFPQGPAPVTQGVIGKPSTAESANVQQQQVQQPAQQTSQ 239

Query: 1644 XXXXXXXXXXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNE 1465
                           RTATSLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNE
Sbjct: 240  TQIQSVSNGQPNQLNRTATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 299

Query: 1464 AFDSVENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLC 1285
            AFDS +NVILIFSVN+TRHFQGCAKM SRIGGS SGGNWK+AHGTAHYG+NF++KWLKLC
Sbjct: 300  AFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLC 359

Query: 1284 ELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXX 1105
            ELSF KTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPD ELMA+S+      
Sbjct: 360  ELSFQKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKR 419

Query: 1104 XXXXXKGVKPDNGSENPDIVPF-XXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMW 928
                 KGV PD G+ENPDIVPF                 S+               G+MW
Sbjct: 420  EEEKAKGVNPDIGNENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPAQGRGRGRGIMW 479

Query: 927  PPHMPLAHGARPFPGIQGFPPNMMG----------GDGFPMPDLFGMAPRGFGRYGPRFS 778
            PPHMP+  GARPF G+Q FPP MMG           DGFPMPD+FGMAPRGFG YGPRFS
Sbjct: 480  PPHMPMGRGARPFHGMQSFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFGPYGPRFS 539

Query: 777  GE 772
            G+
Sbjct: 540  GD 541



 Score = 85.5 bits (210), Expect = 2e-13
 Identities = 48/94 (51%), Positives = 59/94 (62%), Gaps = 5/94 (5%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSDAG----LIGGSNDEAPNQPRG-KAQSEDHYGAGNDFRKD 418
           KRDQ+  +   RNDR+  G D      ++   +DE     +G KA  ++ YG G  FR +
Sbjct: 618 KRDQRGPTSD-RNDRYIVGPDQNKGQEMLSSGHDEGMQYKQGSKAYPDEQYGMGTTFRNE 676

Query: 417 ESESEDEAPRRSRHGEGKKKRHSMEVDATTLSDQ 316
           ESESEDEAPRRSRHGEGKKKR   E DAT +SDQ
Sbjct: 677 ESESEDEAPRRSRHGEGKKKRRGSEGDATAISDQ 710


>ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao]
            gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation
            specificity factor 30 [Theobroma cacao]
          Length = 698

 Score =  624 bits (1610), Expect = e-176
 Identities = 333/537 (62%), Positives = 362/537 (67%), Gaps = 16/537 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTA---ASGXXXXXXXXXXXXXXXX 2164
            M+D EGGLSFDFEGG D      TA    + S     A   ++                 
Sbjct: 1    MDDSEGGLSFDFEGGLDAGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPA 60

Query: 2163 XXADGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDC 1984
                G G GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECR++DC
Sbjct: 61   AAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1983 MYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNR 1804
            +YKHTNEDIKECNMYKLGFCPNG DCRYRHAKL     PVEEVLQKIQQL+SYNY   N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NK 177

Query: 1803 FLQNRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANMRXXXXXXXXXXXXXXXXXX 1630
            F Q RN+ +AQQTEK                  +  ESANM                   
Sbjct: 178  FFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTTESANMHPQQQVQQPQQQVSQTQIQ 237

Query: 1629 XXXXXXXXAT-RTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDS 1453
                       +TA  LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFDS
Sbjct: 238  NVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 297

Query: 1452 VENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSF 1273
             ENVILIFSVN+TRHFQGCAKMTS+IGGS +GGNWK+AHGTAHYGRNF+VKWLKLCELSF
Sbjct: 298  AENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSF 357

Query: 1272 HKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXX 1093
            HKTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS+          
Sbjct: 358  HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELKREEEK 417

Query: 1092 XKGVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHMP 913
             KGV  DNG ENPDIVPF                S+               G+MWPPHMP
Sbjct: 418  AKGVNSDNGGENPDIVPFEDNEEEEEEESEEEDESF----SAAAQGRGRGRGVMWPPHMP 473

Query: 912  LAHGARPFPGIQGFPPNMMGGDGFP----------MPDLFGMAPRGFGRYGPRFSGE 772
            LA GARP PG++GFPP MMGGDGF           +PDLFG APR F  YGPRFSG+
Sbjct: 474  LARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVPDLFG-APRPFPPYGPRFSGD 529



 Score = 85.1 bits (209), Expect = 3e-13
 Identities = 51/96 (53%), Positives = 60/96 (62%), Gaps = 8/96 (8%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSD-------AGLIGGSNDEAPNQPRG-KAQSEDHYGAGNDF 427
           KRDQ+  +    NDR+ +GS+       AG  G  +DE   Q  G KA  ED + AGN F
Sbjct: 606 KRDQRTPT----NDRYGAGSEQGRGQEMAGPGGRLDDETQYQQEGQKAHHEDQFAAGNSF 661

Query: 426 RKDESESEDEAPRRSRHGEGKKKRHSMEVDATTLSD 319
           R DESESEDEAPRRSR+GEGKKKR S+E D    SD
Sbjct: 662 RNDESESEDEAPRRSRYGEGKKKRRSLEGDDANGSD 697


>ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum tuberosum]
          Length = 692

 Score =  623 bits (1607), Expect = e-175
 Identities = 328/539 (60%), Positives = 359/539 (66%), Gaps = 17/539 (3%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            M++GEGGL+FDFEGG D   T  TA    IQS    TAA+                    
Sbjct: 1    MDEGEGGLNFDFEGGLDTGPTHPTASVPVIQSFD-HTAAAAPSANINPPTVSAAVGGQSD 59

Query: 2154 DGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMYK 1975
             G    RR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFRLYGECR++DC+YK
Sbjct: 60   VGFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDCVYK 119

Query: 1974 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFLQ 1795
            HT EDIKECNMYKLGFCPNGPDCRYRHAK+     PVEE+LQKIQ L SYNYG SNRF Q
Sbjct: 120  HTIEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASYNYGYSNRFNQ 179

Query: 1794 NRNANYAQQTEKXXXXXXXXXXXXXXXXXXTESANMRXXXXXXXXXXXXXXXXXXXXXXX 1615
            NRNANY+ Q++K                  TE+  ++                       
Sbjct: 180  NRNANYSTQSDKSQASQAQNGMSLAVKSTATETPIIQQHQPNQQVQPPQLQGGPTQAQIH 239

Query: 1614 XXXAT----RTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVE 1447
                     RTA  LPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFDSVE
Sbjct: 240  PNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVE 299

Query: 1446 NVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHK 1267
            NVILIFSVN+TRHFQGC KMTSRIGG+A+GGNWKH HGTAHYGRNF+VKWLKLCELSF K
Sbjct: 300  NVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSVKWLKLCELSFQK 359

Query: 1266 TRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXK 1087
            T HLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMAISL           K
Sbjct: 360  THHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRQEEKAK 419

Query: 1086 GVKPDNGSENPDIVPF----XXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPH 919
            GV PDNG +NPDIVPF                    S+               G+ WPP 
Sbjct: 420  GVNPDNGKDNPDIVPFEDNEEEEEEEEEEESEDEDESFDQGFGPAALGRGRGRGIAWPPI 479

Query: 918  MPLAHGARPFPGIQGFPPNMMGG---------DGFPMPDLFGMAPRGFGRYGPRFSGEL 769
            MP  HG RP PG++GFPP MMG          +GFPMPD FGM PR FG YGP FS +L
Sbjct: 480  MPFGHGPRPPPGMRGFPPGMMGDGFSYGAMTPEGFPMPDHFGMGPRPFGPYGPPFSSDL 538



 Score = 75.5 bits (184), Expect = 2e-10
 Identities = 42/95 (44%), Positives = 60/95 (63%), Gaps = 5/95 (5%)
 Frame = -3

Query: 588 RPKRDQKVLSGGYRNDRFNSGSDAGL-----IGGSNDEAPNQPRGKAQSEDHYGAGNDFR 424
           + KR+Q+      RNDRF+S    G      +GG   +  +   GK++ ++ +GAGN  +
Sbjct: 597 KAKREQRAPVSD-RNDRFSSDQGKGQEMMGSVGGP--DGVHMQIGKSEHDNQFGAGNSQK 653

Query: 423 KDESESEDEAPRRSRHGEGKKKRHSMEVDATTLSD 319
            +ESESEDEAPRRSRHG+GKKKR  ++ DA T S+
Sbjct: 654 NEESESEDEAPRRSRHGDGKKKRRDVDEDAATGSE 688


>gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sinensis]
          Length = 701

 Score =  622 bits (1605), Expect = e-175
 Identities = 329/537 (61%), Positives = 360/537 (67%), Gaps = 16/537 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            MED EGGLSFDFEGG D      TA   AIQS +   AA+                   A
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAAPSSSGAAPDHA 60

Query: 2154 DGT---GEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDC 1984
                    GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECR++DC
Sbjct: 61   SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1983 MYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNR 1804
            +YKHTNEDIKECNMYKLGFCPNGPDCRYRH KL      VEEVLQKIQQ++SYN+GN N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180

Query: 1803 FLQNRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANM--RXXXXXXXXXXXXXXXX 1636
              Q R A ++ QT+K                  +  ESAN+  +                
Sbjct: 181  HFQQRGA-FSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQM 239

Query: 1635 XXXXXXXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFD 1456
                        R AT LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFD
Sbjct: 240  QNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 299

Query: 1455 SVENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELS 1276
            S ENVILIFSVN+TRHFQGCAKMTS+IGGS  GGNWK+AHGTAHYGRNF+VKWLKLCELS
Sbjct: 300  SAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 359

Query: 1275 FHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXX 1096
            FHKTRHLRNP+NENLPVKISRDCQELEPSIGEQLA+LLYLEPDSELMAIS+         
Sbjct: 360  FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEE 419

Query: 1095 XXKGVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHM 916
              KGV PDNG +NPDIVPF                                 GMMWP  M
Sbjct: 420  KAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEE----EESLGTASQGRGRGRGMMWPGPM 475

Query: 915  PLAHGARPFPGIQGFPPNMMGG---------DGFPMPDLFGMAPRGFGRYGPRFSGE 772
            PLA GARP PG++GFPP M+G          DGFPMPDLFG+APR F  YGPRFSG+
Sbjct: 476  PLARGARPVPGMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPRPFAPYGPRFSGD 532



 Score = 84.7 bits (208), Expect = 3e-13
 Identities = 49/85 (57%), Positives = 54/85 (63%), Gaps = 8/85 (9%)
 Frame = -3

Query: 549 RNDRFNSGSDAGLIG-------GSNDEAPNQPRG-KAQSEDHYGAGNDFRKDESESEDEA 394
           RNDR+++GSD G          G +DE   Q  G KA  ED YG+ N FR DESESEDEA
Sbjct: 617 RNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSKANQEDQYGSRN-FRNDESESEDEA 675

Query: 393 PRRSRHGEGKKKRHSMEVDATTLSD 319
           PRRSRHGEGKKKR   E DA   SD
Sbjct: 676 PRRSRHGEGKKKRRDSEGDAAASSD 700


>ref|NP_001280880.1| cleavage and polyadenylation specificity factor CPSF30 [Malus
            domestica] gi|597438311|gb|AHN05783.1| YTH
            domain-contained RNA binding protein 14 [Malus domestica]
          Length = 667

 Score =  621 bits (1602), Expect = e-175
 Identities = 323/533 (60%), Positives = 356/533 (66%), Gaps = 12/533 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPD-PCTTQSTA-PAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXX 2161
            MED +GGL+FDFEGG D P T  ++A PA  + +S      S                  
Sbjct: 1    MEDSDGGLNFDFEGGLDAPATVSASAGPANTVPTSNYSVMQSDSAVTGLGANQAAAAPQP 60

Query: 2160 XADGTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCM 1981
              +    G R++RQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECR++DC+
Sbjct: 61   NQNANRTGGRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCV 120

Query: 1980 YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRF 1801
            YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ LTSYNY NS++F
Sbjct: 121  YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLTSYNYNNSSKF 180

Query: 1800 LQNRNANYAQQTEKXXXXXXXXXXXXXXXXXXTESANMRXXXXXXXXXXXXXXXXXXXXX 1621
             Q RNA + QQ +K                    +   +                     
Sbjct: 181  YQQRNAGFPQQGDKHQPAQGPNNFVGKPTTAEPGNVQQQQQQQLQQTQQHVGPTQTQTLP 240

Query: 1620 XXXXXAT-RTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFDSVEN 1444
                    R+A  LPQGTSRYFIVKSCNRENLELSVQQG+WATQRSNE KLNEAFDS EN
Sbjct: 241  NGLANQANRSALPLPQGTSRYFIVKSCNRENLELSVQQGLWATQRSNESKLNEAFDSAEN 300

Query: 1443 VILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELSFHKT 1264
            VILIFSVN+TRHFQGCAKM SRIGGS  GGNWK+AHGTAHYGRNF+VKWLKLCELSFHKT
Sbjct: 301  VILIFSVNRTRHFQGCAKMMSRIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 360

Query: 1263 RHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXXXXKG 1084
            RHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAIS+           KG
Sbjct: 361  RHLRNPYNENLPVKISRDCQELELSVGEQLASLLYLEPDSELMAISIAAESKREEEKAKG 420

Query: 1083 VKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSY--XXXXXXXXXXXXXXXGMMWPPHMPL 910
            V P+NG ENPDIVPF                S+                 G+MWPPHM L
Sbjct: 421  VNPENGGENPDIVPFEDNEEEEEEESEDEEDSFGQVPGAGNDGRGRGRGGGVMWPPHMAL 480

Query: 909  AHGARPFPGIQGFPPNMMG-------GDGFPMPDLFGMAPRGFGRYGPRFSGE 772
              G RP PG+QGFPP MMG        DGF MP+ FGMAPRGF  YGPRFSG+
Sbjct: 481  PRGGRPMPGMQGFPPGMMGHDAMPYVPDGFVMPNPFGMAPRGFNPYGPRFSGD 533



 Score = 72.8 bits (177), Expect = 1e-09
 Identities = 34/59 (57%), Positives = 40/59 (67%)
 Frame = -3

Query: 504 GSNDEAPNQPRGKAQSEDHYGAGNDFRKDESESEDEAPRRSRHGEGKKKRHSMEVDATT 328
           G++ +   Q       E HYGAGN  R D+SESEDEAPRRSRHG+GKKKR   E DAT+
Sbjct: 607 GASTDRKGQDMSGPDDETHYGAGNSSRNDDSESEDEAPRRSRHGDGKKKRRDSEGDATS 665


>ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551535|gb|ESR62164.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 701

 Score =  620 bits (1600), Expect = e-174
 Identities = 328/537 (61%), Positives = 359/537 (66%), Gaps = 16/537 (2%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            MED EGGLSFDFEGG D      TA   AIQS +   AA+                   A
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60

Query: 2154 DGT---GEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDC 1984
                    GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECR++DC
Sbjct: 61   SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1983 MYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNR 1804
            +YKHTNEDIKECNMYKLGFCPNGPDCRYRH KL      VEEVLQKIQQ++SYN+GN N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180

Query: 1803 FLQNRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANM--RXXXXXXXXXXXXXXXX 1636
              Q R A ++ Q +K                  +  ESAN+  +                
Sbjct: 181  LFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQM 239

Query: 1635 XXXXXXXXXXATRTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNEAFD 1456
                        R AT LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNEAFD
Sbjct: 240  QNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 299

Query: 1455 SVENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLCELS 1276
            S ENVILIFSVN+TRHFQGCAKMTS+IGGS  GGNWK+AHGTAHYGRNF+VKWLKLCELS
Sbjct: 300  SAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 359

Query: 1275 FHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXXXXX 1096
            FHKTRHLRNP+NENLPVKISRDCQELEPSIGEQLA+LLYLEPDSELMAIS+         
Sbjct: 360  FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEE 419

Query: 1095 XXKGVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWPPHM 916
              KGV PDNG +NPDIVPF                                 GMMWP  M
Sbjct: 420  KAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEE----EESLGTASQGRGRGRGMMWPGPM 475

Query: 915  PLAHGARPFPGIQGFPPNMMGG---------DGFPMPDLFGMAPRGFGRYGPRFSGE 772
            PLA GARP PG++GFPP M+G          DGFPMPDLFG+APR F  YGPRFSG+
Sbjct: 476  PLARGARPVPGMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPRPFAPYGPRFSGD 532



 Score = 84.7 bits (208), Expect = 3e-13
 Identities = 49/85 (57%), Positives = 54/85 (63%), Gaps = 8/85 (9%)
 Frame = -3

Query: 549 RNDRFNSGSDAGLIG-------GSNDEAPNQPRG-KAQSEDHYGAGNDFRKDESESEDEA 394
           RNDR+++GSD G          G +DE   Q  G KA  ED YG+ N FR DESESEDEA
Sbjct: 617 RNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSKANQEDQYGSRN-FRNDESESEDEA 675

Query: 393 PRRSRHGEGKKKRHSMEVDATTLSD 319
           PRRSRHGEGKKKR   E DA   SD
Sbjct: 676 PRRSRHGEGKKKRRDSEGDAAASSD 700


>ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Gossypium raimondii] gi|763780831|gb|KJB47902.1|
            hypothetical protein B456_008G046800 [Gossypium
            raimondii]
          Length = 700

 Score =  620 bits (1599), Expect = e-174
 Identities = 334/541 (61%), Positives = 367/541 (67%), Gaps = 20/541 (3%)
 Frame = -3

Query: 2334 MEDGEGGLSFDFEGGPDPCTTQSTAPAFAIQSSTVPTAASGXXXXXXXXXXXXXXXXXXA 2155
            M+D EGGLSFDFEGG D      TA    + S   P+AA+                   A
Sbjct: 1    MDDAEGGLSFDFEGGLDAGPPAPTASMPVVNSD--PSAANNTNNFTAPGGVQASINDPVA 58

Query: 2154 D-GTGEGRRNFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECRQKDCMY 1978
            + G G GRR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECR++DC+Y
Sbjct: 59   NQGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVY 118

Query: 1977 KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLTSYNYGNSNRFL 1798
            KHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQQL++YNY  +N+F 
Sbjct: 119  KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLSAYNY--NNKFY 176

Query: 1797 QNRNANYAQQTEKXXXXXXXXXXXXXXXXXXT--ESANMRXXXXXXXXXXXXXXXXXXXX 1624
            Q RNA + QQTEK                  +  ES N++                    
Sbjct: 177  QQRNAGFPQQTEKSQIPQAQNNVNQGAAGKPSATESTNVQQQQLQQQQQQIQQPQQQVSQ 236

Query: 1623 XXXXXXAT-------RTATSLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEPKLNE 1465
                           RTA  LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE KLNE
Sbjct: 237  TQIQNVPNGQSNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 296

Query: 1464 AFDSVENVILIFSVNKTRHFQGCAKMTSRIGGSASGGNWKHAHGTAHYGRNFAVKWLKLC 1285
            AFDS ENVIL+FSVN+TRHFQGCAKMTS+IGGS +GGNWK+AHGTAHYGRNF+VKWLKLC
Sbjct: 297  AFDSAENVILVFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLC 356

Query: 1284 ELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLXXXXXX 1105
            ELSFHKTRHLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMAISL      
Sbjct: 357  ELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKR 416

Query: 1104 XXXXXKGVKPDNGSENPDIVPFXXXXXXXXXXXXXXXXSYXXXXXXXXXXXXXXXGMMWP 925
                 KGV  DN +ENPDIVPF                S+               G+MWP
Sbjct: 417  EEEKAKGVNSDN-AENPDIVPFEDNEEEEEEESEEEDESF----GAAAQGRGRGRGIMWP 471

Query: 924  PHMPLAHGARPFPGIQGFPPNMMGGDGFP----------MPDLFGMAPRGFGRYGPRFSG 775
            PHMPLA GARP PG++GFPP MMGGDGF           MPDLFG APR F  YGPRFSG
Sbjct: 472  PHMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGMPDLFG-APRPFAPYGPRFSG 530

Query: 774  E 772
            +
Sbjct: 531  D 531



 Score = 85.5 bits (210), Expect = 2e-13
 Identities = 50/96 (52%), Positives = 58/96 (60%), Gaps = 8/96 (8%)
 Frame = -3

Query: 582 KRDQKVLSGGYRNDRFNSGSDAGLI-------GGSNDEAPNQPRG-KAQSEDHYGAGNDF 427
           KRDQ+  +    NDR ++GS+ G         GG  D    Q  G KA  ED + AGN F
Sbjct: 608 KRDQRTPT----NDRSSAGSEQGRGQEMGGPGGGLEDGTQYQQEGQKAHHEDQFAAGNSF 663

Query: 426 RKDESESEDEAPRRSRHGEGKKKRHSMEVDATTLSD 319
           R D+SESEDEAPRRSRHGEGKKKR  +E D  T SD
Sbjct: 664 RNDDSESEDEAPRRSRHGEGKKKRRGLEGDVATASD 699


Top