BLASTX nr result

ID: Cornus23_contig00006661 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00006661
         (2461 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007041140.1| Cleavage and polyadenylation specificity fac...   895   0.0  
gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sin...   889   0.0  
ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr...   888   0.0  
ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylati...   886   0.0  
ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation spec...   884   0.0  
gb|KJB47903.1| hypothetical protein B456_008G046800 [Gossypium r...   882   0.0  
ref|XP_010092677.1| Cleavage and polyadenylation specificity fac...   854   0.0  
ref|XP_014518648.1| PREDICTED: 30-kDa cleavage and polyadenylati...   853   0.0  
ref|XP_010241185.1| PREDICTED: cleavage and polyadenylation spec...   853   0.0  
ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas...   851   0.0  
ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec...   847   0.0  
ref|XP_011085214.1| PREDICTED: 30-kDa cleavage and polyadenylati...   842   0.0  
ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec...   835   0.0  
ref|XP_010687042.1| PREDICTED: 30-kDa cleavage and polyadenylati...   822   0.0  
ref|XP_009628296.1| PREDICTED: cleavage and polyadenylation spec...   822   0.0  
ref|XP_009789024.1| PREDICTED: cleavage and polyadenylation spec...   819   0.0  
ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylati...   816   0.0  
ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm...   801   0.0  
ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylati...   792   0.0  
ref|XP_012830213.1| PREDICTED: 30-kDa cleavage and polyadenylati...   783   0.0  

>ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao]
            gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation
            specificity factor 30 [Theobroma cacao]
          Length = 698

 Score =  895 bits (2312), Expect = 0.0
 Identities = 467/709 (65%), Positives = 498/709 (70%), Gaps = 20/709 (2%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXA------- 2072
            MDD EGGLSFDFEGGLD GP+ P+AS+P + S                            
Sbjct: 1    MDDSEGGLSFDFEGGLDAGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPA 60

Query: 2071 ----GNVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 1904
                G   GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL+GECREQDC
Sbjct: 61   AAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1903 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHLA-YNYGNSNR 1727
            VYKHTNEDIKECNMYKLGFCPNG DCRYRHAKLPGPPPP+EEVLQKIQ L+ YNY   N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NK 177

Query: 1726 FFQNRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN 1547
            FFQ RN+ ++QQTE+SQ PQG N  +Q A  KPSTTE  NM                   
Sbjct: 178  FFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTTESANMHPQQQVQQPQQQVSQTQIQ 237

Query: 1546 -LPNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDT 1370
             +PN Q NQ NK A PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+
Sbjct: 238  NVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 297

Query: 1369 VENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSF 1190
             ENVILIFSVNRTRHFQGCAKMTSKIGGSV GGNWKYAHGTAHYGRNFSVKWLKLCELSF
Sbjct: 298  AENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSF 357

Query: 1189 HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXX 1010
            HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISV          
Sbjct: 358  HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELKREEEK 417

Query: 1009 XKGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMP 830
             KGVN DNG ENPDIVPFEDN                ++F   AQGRGRGRG++WPPHMP
Sbjct: 418  AKGVNSDNGGENPDIVPFEDNEEEEEEESEEED----ESFSAAAQGRGRGRGVMWPPHMP 473

Query: 829  LARGGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFT 650
            LARG ARP+PGMRGFPP+MMG DGFSYG VTPDGF +PDL+G  PR F PYGPRF GDFT
Sbjct: 474  LARG-ARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVPDLFG-APRPFPPYGPRFSGDFT 531

Query: 649  GAASGMMFHGR-PSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 473
            G ASGMMF GR P                                               
Sbjct: 532  GPASGMMFPGRPPQPGAMFPAGGLGMMMGPGRAPFMGGMGPTGANPVRGGRPVSMPPMFP 591

Query: 472  XXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQEDQ- 296
                       R VKRDQR   NDR   Y AGS+QG+GQ+MAG GG LD+ETQY QE Q 
Sbjct: 592  PPPAPSSQNSGRAVKRDQRTPTNDR---YGAGSEQGRGQEMAGPGGRLDDETQYQQEGQK 648

Query: 295  -----QFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
                 QF +GN+FRNDESESEDEAPRRSR+GEGKKKRRS E D    S+
Sbjct: 649  AHHEDQFAAGNSFRNDESESEDEAPRRSRYGEGKKKRRSLEGDDANGSD 697


>gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sinensis]
          Length = 701

 Score =  889 bits (2297), Expect = 0.0
 Identities = 464/709 (65%), Positives = 499/709 (70%), Gaps = 20/709 (2%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXAGNV---- 2063
            M+D EGGLSFDFEGGLD GP  P+AS PAIQS                    +G      
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAAPSSSGAAPDHA 60

Query: 2062 -------QGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 1904
                    GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL+GECREQDC
Sbjct: 61   SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1903 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHLA-YNYGNSNR 1727
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPP +EEVLQKIQ ++ YN+GN N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180

Query: 1726 FFQNRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN 1547
             FQ R A +S QT++SQF QG NA +Q A  K ST E  N+                   
Sbjct: 181  HFQQRGA-FSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQM 239

Query: 1546 --LPNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 1373
              LPN   NQ N+ ATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD
Sbjct: 240  QNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 299

Query: 1372 TVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 1193
            + ENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS
Sbjct: 300  SAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 359

Query: 1192 FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXX 1013
            FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLA+LLYLEPDSELMAISV         
Sbjct: 360  FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEE 419

Query: 1012 XXKGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHM 833
              KGVNPDNG +NPDIVPFEDN                ++ G  +QGRGRGRGM+WP  M
Sbjct: 420  KAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEE----ESLGTASQGRGRGRGMMWPGPM 475

Query: 832  PLARGGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDF 653
            PLAR GARP+PGMRGFPP+M+GADGFSYG VTPDGFPMPDL+G+ PR F PYGPRF GDF
Sbjct: 476  PLAR-GARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRFSGDF 533

Query: 652  TGAASGMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 473
            TG   GMMF GRP Q                                             
Sbjct: 534  TG-PGGMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPF 592

Query: 472  XXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQE--- 302
                      +R  KRD R + NDRNDRYSAGSDQG+ Q+M G G G D+E QY QE   
Sbjct: 593  PNQPQSSQNSSRAAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSK 652

Query: 301  ---DQQFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
               + Q+GS  NFRNDESESEDEAPRRSRHGEGKKKRR SE DA  SS+
Sbjct: 653  ANQEDQYGS-RNFRNDESESEDEAPRRSRHGEGKKKRRDSEGDAAASSD 700


>ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551535|gb|ESR62164.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 701

 Score =  888 bits (2294), Expect = 0.0
 Identities = 463/709 (65%), Positives = 499/709 (70%), Gaps = 20/709 (2%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXAGNV---- 2063
            M+D EGGLSFDFEGGLD GP  P+AS PAIQS                    +G      
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60

Query: 2062 -------QGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 1904
                    GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL+GECREQDC
Sbjct: 61   SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1903 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHLA-YNYGNSNR 1727
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPP +EEVLQKIQ ++ YN+GN N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180

Query: 1726 FFQNRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN 1547
             FQ R A +S Q ++SQF QG NA +Q A  K ST E  N+                   
Sbjct: 181  LFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQM 239

Query: 1546 --LPNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 1373
              LPN   NQ N+ ATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD
Sbjct: 240  QNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 299

Query: 1372 TVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 1193
            + ENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS
Sbjct: 300  SAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 359

Query: 1192 FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXX 1013
            FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLA+LLYLEPDSELMAISV         
Sbjct: 360  FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEE 419

Query: 1012 XXKGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHM 833
              KGVNPDNG +NPDIVPFEDN                ++ G  +QGRGRGRGM+WP  M
Sbjct: 420  KAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEE----ESLGTASQGRGRGRGMMWPGPM 475

Query: 832  PLARGGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDF 653
            PLAR GARP+PGMRGFPP+M+GADGFSYG VTPDGFPMPDL+G+ PR F PYGPRF GDF
Sbjct: 476  PLAR-GARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRFSGDF 533

Query: 652  TGAASGMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 473
            TG   GMMF GRP Q                                             
Sbjct: 534  TG-PGGMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPF 592

Query: 472  XXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQE--- 302
                      +R+ KRD R + NDRNDRYSAGSDQG+ Q+M G G G D+E QY QE   
Sbjct: 593  PNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSK 652

Query: 301  ---DQQFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
               + Q+GS  NFRNDESESEDEAPRRSRHGEGKKKRR SE DA  SS+
Sbjct: 653  ANQEDQYGS-RNFRNDESESEDEAPRRSRHGEGKKKRRDSEGDAAASSD 700


>ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Gossypium raimondii] gi|763780831|gb|KJB47902.1|
            hypothetical protein B456_008G046800 [Gossypium
            raimondii]
          Length = 700

 Score =  886 bits (2290), Expect = 0.0
 Identities = 460/711 (64%), Positives = 497/711 (69%), Gaps = 22/711 (3%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXA------- 2072
            MDD EGGLSFDFEGGLD GP  P+AS+P + S                            
Sbjct: 1    MDDAEGGLSFDFEGGLDAGPPAPTASMPVVNSDPSAANNTNNFTAPGGVQASINDPVANQ 60

Query: 2071 GNVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKH 1892
            G   GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL+GECREQDCVYKH
Sbjct: 61   GGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVYKH 120

Query: 1891 TNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFFQN 1715
            TNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPP+EEVLQKIQ L AYNY  +N+F+Q 
Sbjct: 121  TNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLSAYNY--NNKFYQQ 178

Query: 1714 RNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN---- 1547
            RNA + QQTE+SQ PQ  N  +Q A  KPS TE  N+                       
Sbjct: 179  RNAGFPQQTEKSQIPQAQNNVNQGAAGKPSATESTNVQQQQLQQQQQQIQQPQQQVSQTQ 238

Query: 1546 ---LPNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 1376
               +PN Q NQ N+ A PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF
Sbjct: 239  IQNVPNGQSNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 298

Query: 1375 DTVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCEL 1196
            D+ ENVIL+FSVNRTRHFQGCAKMTSKIGGSV GGNWKYAHGTAHYGRNFSVKWLKLCEL
Sbjct: 299  DSAENVILVFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCEL 358

Query: 1195 SFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXX 1016
            SFHKTRHLRNPYNENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMAIS+        
Sbjct: 359  SFHKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKREE 418

Query: 1015 XXXKGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPH 836
               KGVN DN  ENPDIVPFEDN                ++FG  AQGRGRGRG++WPPH
Sbjct: 419  EKAKGVNSDNA-ENPDIVPFEDNEEEEEEESEEED----ESFGAAAQGRGRGRGIMWPPH 473

Query: 835  MPLARGGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGD 656
            MPLARG ARP+PGMRGFPP+MMG DGFSYG VTPDGF MPDL+G  PR F PYGPRF GD
Sbjct: 474  MPLARG-ARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGMPDLFG-APRPFAPYGPRFSGD 531

Query: 655  FTGAASGMMFHGR-PSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 479
            FTG ASGMMF GR P                                             
Sbjct: 532  FTGPASGMMFPGRPPQPGGMFPSGGIGMMMGPGRAPFMGGMGPTGANPARGGRPVGMPPM 591

Query: 478  XXXXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQED 299
                         R +KRDQR   NDR+   SAGS+QG+GQ+M G GGGL++ TQY QE 
Sbjct: 592  FPLPPAPASQNSGRAIKRDQRTPTNDRS---SAGSEQGRGQEMGGPGGGLEDGTQYQQEG 648

Query: 298  Q------QFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
            Q      QF +GN+FRND+SESEDEAPRRSRHGEGKKKRR  E D  T+S+
Sbjct: 649  QKAHHEDQFAAGNSFRNDDSESEDEAPRRSRHGEGKKKRRGLEGDVATASD 699


>ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Citrus sinensis]
          Length = 683

 Score =  884 bits (2285), Expect = 0.0
 Identities = 460/698 (65%), Positives = 494/698 (70%), Gaps = 9/698 (1%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXAGNVQGRR 2051
            M+D EGGLSFDFEGGLD GP  P+AS PA                            GRR
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAAAPSSSGAAPDHASAPVPHH-------SGRR 53

Query: 2050 SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKE 1871
            SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL+GECREQDCVYKHTNEDIKE
Sbjct: 54   SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVYKHTNEDIKE 113

Query: 1870 CNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHLA-YNYGNSNRFFQNRNANYSQ 1694
            CNMYKLGFCPNGPDCRYRH KLPGPPP +EEVLQKIQ ++ YN+GN N+ FQ R A +S 
Sbjct: 114  CNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNKHFQQRGA-FSH 172

Query: 1693 QTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN--LPNDQQNQV 1520
            QT++SQF QG NA +Q A  K ST E  N+                     LPN   NQ 
Sbjct: 173  QTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQMQNLPNGLPNQT 232

Query: 1519 NKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVENVILIFSV 1340
            N+ ATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+ ENVILIFSV
Sbjct: 233  NRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAENVILIFSV 292

Query: 1339 NRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPY 1160
            NRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPY
Sbjct: 293  NRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPY 352

Query: 1159 NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGVNPDNGT 980
            NENLPVKISRDCQELEPSIGEQLA+LLYLEPDSELMAISV           KGVNPDNG 
Sbjct: 353  NENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEEKAKGVNPDNGG 412

Query: 979  ENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMPLARGGARPLP 800
            +NPDIVPFEDN                ++ G  +QGRGRGRGM+WP  MPLAR GARP+P
Sbjct: 413  DNPDIVPFEDNEEEEEEESEEEE----ESLGTASQGRGRGRGMMWPGPMPLAR-GARPVP 467

Query: 799  GMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFTGAASGMMFHG 620
            GMRGFPP+M+GADGFSYG VTPDGFPMPDL+G+ PR F PYGPRF GDFTG   GMMF G
Sbjct: 468  GMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRFSGDFTG-PGGMMFPG 525

Query: 619  RPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXN 440
            RP Q                                                       +
Sbjct: 526  RPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPFPNQPQSSQNSS 585

Query: 439  RMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQE------DQQFGSGN 278
            R  KRD R + NDRNDRYSAGSDQG+ Q+M G G G D+E QY QE      + Q+GS  
Sbjct: 586  RAAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSKANQEDQYGS-R 644

Query: 277  NFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
            NFRNDESESEDEAPRRSRHGEGKKKRR SE DA  SS+
Sbjct: 645  NFRNDESESEDEAPRRSRHGEGKKKRRDSEGDAAASSD 682


>gb|KJB47903.1| hypothetical protein B456_008G046800 [Gossypium raimondii]
          Length = 701

 Score =  882 bits (2278), Expect = 0.0
 Identities = 460/712 (64%), Positives = 497/712 (69%), Gaps = 23/712 (3%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXA------- 2072
            MDD EGGLSFDFEGGLD GP  P+AS+P + S                            
Sbjct: 1    MDDAEGGLSFDFEGGLDAGPPAPTASMPVVNSDPSAANNTNNFTAPGGVQASINDPVANQ 60

Query: 2071 GNVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKH 1892
            G   GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL+GECREQDCVYKH
Sbjct: 61   GGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVYKH 120

Query: 1891 TNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFFQN 1715
            TNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPP+EEVLQKIQ L AYNY  +N+F+Q 
Sbjct: 121  TNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLSAYNY--NNKFYQQ 178

Query: 1714 RNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN---- 1547
            RNA + QQTE+SQ PQ  N  +Q A  KPS TE  N+                       
Sbjct: 179  RNAGFPQQTEKSQIPQAQNNVNQGAAGKPSATESTNVQQQQLQQQQQQIQQPQQQVSQTQ 238

Query: 1546 ---LPNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 1376
               +PN Q NQ N+ A PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF
Sbjct: 239  IQNVPNGQSNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 298

Query: 1375 DTVENVILIFSVNRTRHFQ-GCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCE 1199
            D+ ENVIL+FSVNRTRHFQ GCAKMTSKIGGSV GGNWKYAHGTAHYGRNFSVKWLKLCE
Sbjct: 299  DSAENVILVFSVNRTRHFQVGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCE 358

Query: 1198 LSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXX 1019
            LSFHKTRHLRNPYNENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMAIS+       
Sbjct: 359  LSFHKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRE 418

Query: 1018 XXXXKGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPP 839
                KGVN DN  ENPDIVPFEDN                ++FG  AQGRGRGRG++WPP
Sbjct: 419  EEKAKGVNSDNA-ENPDIVPFEDNEEEEEEESEEED----ESFGAAAQGRGRGRGIMWPP 473

Query: 838  HMPLARGGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPG 659
            HMPLARG ARP+PGMRGFPP+MMG DGFSYG VTPDGF MPDL+G  PR F PYGPRF G
Sbjct: 474  HMPLARG-ARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGMPDLFG-APRPFAPYGPRFSG 531

Query: 658  DFTGAASGMMFHGR-PSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 482
            DFTG ASGMMF GR P                                            
Sbjct: 532  DFTGPASGMMFPGRPPQPGGMFPSGGIGMMMGPGRAPFMGGMGPTGANPARGGRPVGMPP 591

Query: 481  XXXXXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQE 302
                          R +KRDQR   NDR+   SAGS+QG+GQ+M G GGGL++ TQY QE
Sbjct: 592  MFPLPPAPASQNSGRAIKRDQRTPTNDRS---SAGSEQGRGQEMGGPGGGLEDGTQYQQE 648

Query: 301  DQ------QFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
             Q      QF +GN+FRND+SESEDEAPRRSRHGEGKKKRR  E D  T+S+
Sbjct: 649  GQKAHHEDQFAAGNSFRNDDSESEDEAPRRSRHGEGKKKRRGLEGDVATASD 700


>ref|XP_010092677.1| Cleavage and polyadenylation specificity factor CPSF30 [Morus
            notabilis] gi|587862159|gb|EXB51974.1| Cleavage and
            polyadenylation specificity factor CPSF30 [Morus
            notabilis]
          Length = 710

 Score =  854 bits (2207), Expect = 0.0
 Identities = 446/712 (62%), Positives = 489/712 (68%), Gaps = 23/712 (3%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTG----PSN----------PSASVPAIQSXXXXXXXXXXXXXX 2093
            M+D EG LSFDFEGGLDT     P N          P +S  A  +              
Sbjct: 1    MEDSEGVLSFDFEGGLDTTAGGCPPNAAAASAALIHPDSSAAAASNNLAASNSAVSADPT 60

Query: 2092 XXXXXXAGNVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE 1913
                  A N    RSFRQTVCRHWLRSLCMKG+ACGFLHQYDKSRMPVCRFFRLYGECRE
Sbjct: 61   SGGGGGASNPGRGRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECRE 120

Query: 1912 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHLA-YNYGN 1736
            QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPP +EEVLQKIQHL+ YNY +
Sbjct: 121  QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQHLSSYNY-H 179

Query: 1735 SNRFFQNRNAN-YSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXX 1559
            SN+FFQ RNA  ++Q  E+   P G NA SQ    KPS  E  N+               
Sbjct: 180  SNKFFQQRNAGGFAQLGEKPLLPLGPNAVSQGVVGKPSILESANVQQPQQQVQPSQQPVG 239

Query: 1558 XXXN--LPNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1385
                  +     NQ N+   PLP GISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN
Sbjct: 240  QNQIQNVFTGLPNQANRTVAPLPPGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 299

Query: 1384 EAFDTVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKL 1205
            EAFD  ENVILIFSVNRTRHFQGCAKM S+IGGS+ GGNWKYAHGTAHYGRNFSVKWLKL
Sbjct: 300  EAFDCAENVILIFSVNRTRHFQGCAKMISRIGGSISGGNWKYAHGTAHYGRNFSVKWLKL 359

Query: 1204 CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXX 1025
            CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS+     
Sbjct: 360  CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESK 419

Query: 1024 XXXXXXKGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIW 845
                  KGV+PDNG ENPDIVPFEDN             SF Q  G   QGRGRGRG++W
Sbjct: 420  REEEKAKGVDPDNGGENPDIVPFEDNEEDEEEESEDEEESFSQVLGAN-QGRGRGRGVMW 478

Query: 844  PPHMPLARGGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRF 665
            PPHMPL+R GARP+P M+GFPPVM+GADG  YG VTPDGFPMPDL+ +GPRAF PYGPRF
Sbjct: 479  PPHMPLSR-GARPMPSMQGFPPVMIGADGSPYGPVTPDGFPMPDLFNVGPRAFNPYGPRF 537

Query: 664  PGDFTGAASGMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 485
            PGDF G  SGMMF GRP+Q                                         
Sbjct: 538  PGDFMGPTSGMMFRGRPTQPGAVFPGGGFGMMMGPGRAPCMGGMGVQGTSPARPMRPGAM 597

Query: 484  XXXXXXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYH- 308
                          NR  +RDQR  ANDRN+RY AGSDQ +GQ+M+G  GG +++  Y  
Sbjct: 598  PPMFQQPPPPSQNMNRPPRRDQRGLANDRNERYGAGSDQVRGQEMSGPAGGPEDDAHYQL 657

Query: 307  ----QEDQQFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
                +++ Q+G+GN+FRNDESESEDEAPRRSRHG+GKKKRRSSE DA T S+
Sbjct: 658  GAKARQEDQYGAGNSFRNDESESEDEAPRRSRHGDGKKKRRSSEEDAATGSD 709


>ref|XP_014518648.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Vigna radiata var. radiata]
          Length = 696

 Score =  853 bits (2205), Expect = 0.0
 Identities = 443/701 (63%), Positives = 482/701 (68%), Gaps = 12/701 (1%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSA-SVPAIQSXXXXXXXXXXXXXXXXXXXXAG----- 2069
            M+D EG LSFDFEGGLDT PS  +A S P +Q                            
Sbjct: 1    MEDSEGVLSFDFEGGLDTVPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPVPSTADPAAV 60

Query: 2068 NVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHT 1889
            NV GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQDCVYKHT
Sbjct: 61   NVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHT 120

Query: 1888 NEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFFQNR 1712
            NEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPP+EEVLQKIQHL +YNY +SN+FFQ R
Sbjct: 121  NEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQQR 180

Query: 1711 NANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPN-MXXXXXXXXXXXXXXXXXXNLPND 1535
             ++Y+QQ E+SQ PQG+N+ +Q    KP   E  N                    N+ N 
Sbjct: 181  GSSYAQQAEKSQLPQGTNSTNQVVTGKPLPAESGNAQPQQQVQQSQQQVSQSQMQNVANG 240

Query: 1534 QQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVENVI 1355
            Q NQ +++ATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFD+ ENVI
Sbjct: 241  QPNQASRSATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSXENVI 300

Query: 1354 LIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRH 1175
            LIFSVNRTRHFQGCAKMTS+IGGSV GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRH
Sbjct: 301  LIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRH 360

Query: 1174 LRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGVN 995
            LRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPD ELMA+SV           KGVN
Sbjct: 361  LRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKREEEKAKGVN 420

Query: 994  PDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMPLARGG 815
            PDNG ENPDIVPFEDN             SFG   GP  QGRGRGRGM+WPPHMPL R G
Sbjct: 421  PDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGRGRGRGMMWPPHMPLGR-G 479

Query: 814  ARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFTGAASG 635
            ARP+PGM+GF PVMMG DG SYG V PDGF MPDL+G+GPRAF PYGPRF GDF G  + 
Sbjct: 480  ARPMPGMQGFNPVMMG-DGLSYGPVAPDGFGMPDLFGVGPRAFAPYGPRFSGDFGGPPAA 538

Query: 634  MMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 455
            MMF GRPSQ                                                   
Sbjct: 539  MMFRGRPSQPGMFPGGGFGMMMNPGRGPFMGGMGVAGANPARGGRPVNMPPMFPPPPPLP 598

Query: 454  XXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQ----EDQQFG 287
                    +  +    NDR   Y +GS+QGK QDM    G  D++TQY Q       +  
Sbjct: 599  QNTNRLAKRDQRATDRNDR---YGSGSEQGKSQDMLSQSGAPDDDTQYQQGYKANQDEHP 655

Query: 286  SGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
            + NNFRND+SESEDEAPRRSRHGEGKKKRR  E D  T+ N
Sbjct: 656  AVNNFRNDDSESEDEAPRRSRHGEGKKKRRGPE-DVNTNYN 695


>ref|XP_010241185.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30
            [Nelumbo nucifera]
          Length = 715

 Score =  853 bits (2204), Expect = 0.0
 Identities = 449/718 (62%), Positives = 491/718 (68%), Gaps = 29/718 (4%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXAGNVQGRR 2051
            M+D EG LSFDFEGGLD GP+NP+ S P I +                    AG   GRR
Sbjct: 1    MEDPEGVLSFDFEGGLDNGPTNPTPSAPLIPADSSIAAAANSAVAPAVVEPVAGGHAGRR 60

Query: 2050 SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKE 1871
            SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR+YGECREQDCVYKHTNEDIKE
Sbjct: 61   SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYKHTNEDIKE 120

Query: 1870 CNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFFQNRNANYSQ 1694
            CNMYK GFCPNGPDCRYRHAK PGPPPP+EEV QKIQHL ++NYG+SNRFFQ R  +Y  
Sbjct: 121  CNMYKFGFCPNGPDCRYRHAKQPGPPPPVEEVFQKIQHLGSFNYGSSNRFFQQRIGSYVP 180

Query: 1693 QTERSQFPQGSNAHSQAAPAKPSTT-EPPNMXXXXXXXXXXXXXXXXXXN---LPNDQQ- 1529
            Q+ERSQFPQGS+  +Q   +KPST  E PN+                  N   + N Q  
Sbjct: 181  QSERSQFPQGSSNVNQGIASKPSTAAESPNVQQQQQQSQIQQPQQQQQVNQTQMQNPQNG 240

Query: 1528 --NQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVENVI 1355
              NQ ++ ATPLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+VENVI
Sbjct: 241  LPNQASRTATPLPQGSSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVI 300

Query: 1354 LIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRH 1175
            LIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRH
Sbjct: 301  LIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRH 360

Query: 1174 LRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGVN 995
            LRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISV           KGVN
Sbjct: 361  LRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKAKGVN 420

Query: 994  PDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMPLARGG 815
            PD G +N DIVPFEDN             SFGQA    AQGRGRGRG++WPPHMPLARGG
Sbjct: 421  PDEGADNHDIVPFEDNEDEEEEESEEEDESFGQAIN-AAQGRGRGRGVMWPPHMPLARGG 479

Query: 814  ARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFTGAAS- 638
             RP+PG+RGFPPVMMGADGFSYGAVTPDGF MPDL+G+ PRAF PYGPRF GDFTG    
Sbjct: 480  -RPIPGIRGFPPVMMGADGFSYGAVTPDGFSMPDLFGIAPRAFAPYGPRFSGDFTGLGQS 538

Query: 637  ---------------GMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 503
                           GM+FHGRPSQ                                   
Sbjct: 539  AAMGFNPIDGTGPTPGMVFHGRPSQPGAVFPPSGLGMMMGPGRAPFMGGMGIGAAPPRAS 598

Query: 502  XXXXXXXXXXXXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDE 323
                                  + K  +R    +      +   +G+   M+G  GG ++
Sbjct: 599  RPIGMPPFRPPAPPLPQSSSRVVNKDQRRPTDRNDRYSAGSDQGKGQEMAMSG--GGPED 656

Query: 322  ETQYH-----QEDQQFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
            E +Y      Q D  F  GN+FRNDESESEDEAPRRSRHGEGKK+RR+ E DA+  S+
Sbjct: 657  EMKYQPGMRTQHDDSFAVGNSFRNDESESEDEAPRRSRHGEGKKRRRALEGDASLVSD 714


>ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris]
            gi|561020727|gb|ESW19498.1| hypothetical protein
            PHAVU_006G130200g [Phaseolus vulgaris]
          Length = 697

 Score =  851 bits (2198), Expect = 0.0
 Identities = 443/702 (63%), Positives = 481/702 (68%), Gaps = 13/702 (1%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSA-SVPAIQ-----SXXXXXXXXXXXXXXXXXXXXAG 2069
            M+D EG LSFDFEGGLDT PS  +A S P +Q     +                    A 
Sbjct: 1    MEDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSGTEPAAV 60

Query: 2068 NVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHT 1889
            NV GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQDCVYKHT
Sbjct: 61   NVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHT 120

Query: 1888 NEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFFQNR 1712
            NEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPP+EEVLQKIQHL +YNY +SN+FFQ R
Sbjct: 121  NEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQQR 180

Query: 1711 NANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN--LPN 1538
             ++Y+QQ E+SQ PQG+N+ +Q    KP   E  N                      + N
Sbjct: 181  GSSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQVSQNQIQNVAN 240

Query: 1537 DQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVENV 1358
             Q NQ ++AATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFD+VENV
Sbjct: 241  GQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVENV 300

Query: 1357 ILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 1178
            ILIFSVNRTRHFQGCAKMTS+IGGSV GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR
Sbjct: 301  ILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 360

Query: 1177 HLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGV 998
            HLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPD ELMA+SV           KGV
Sbjct: 361  HLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKREEEKAKGV 420

Query: 997  NPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMPLARG 818
            NPDNG ENPDIVPFEDN             SFG   GP  QGRGRGRGM+WPPHMPL R 
Sbjct: 421  NPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGPAGQGRGRGRGMMWPPHMPLPR- 479

Query: 817  GARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFTGAAS 638
            GARP+PGM+GF PVMMG DG SYG V PDGF MPDL+ +GPRAF PYGPRF GDF G  +
Sbjct: 480  GARPMPGMQGFNPVMMG-DGLSYGPVAPDGFGMPDLFSVGPRAFAPYGPRFSGDFGGPPA 538

Query: 637  GMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 458
             MMF GRPSQ                                                  
Sbjct: 539  AMMFRGRPSQPGMFPGGGFGMMMNPGRGPFMGGMGVAGANPPRGGRPVNMPPMFPPPPPL 598

Query: 457  XXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQ----EDQQF 290
                     +  +    NDR   Y +GS+QGK QDM    G  D++ QY Q         
Sbjct: 599  PQNTNRLAKRDQRTTDRNDR---YGSGSEQGKSQDMLSQSGAPDDDMQYQQGYKANQDDH 655

Query: 289  GSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
             + NNFRND+SESEDEAPRRSRHGEGKKKRR  E D  T+ N
Sbjct: 656  PAVNNFRNDDSESEDEAPRRSRHGEGKKKRRGPE-DVNTNYN 696


>ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max] gi|947062499|gb|KRH11760.1|
            hypothetical protein GLYMA_15G128500 [Glycine max]
          Length = 691

 Score =  847 bits (2187), Expect = 0.0
 Identities = 437/695 (62%), Positives = 476/695 (68%), Gaps = 17/695 (2%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPA--------IQSXXXXXXXXXXXXXXXXXXXX 2075
            M+D EG LSFDFEGGLD  PS+ +A+VP+          +                    
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPSTADPA 60

Query: 2074 AGNVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 1895
             GNV GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQDCVYK
Sbjct: 61   GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120

Query: 1894 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFFQ 1718
            HTNEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPP+EEVLQKIQHL +YNY +SN+FFQ
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSSNKFFQ 180

Query: 1717 NRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPN-MXXXXXXXXXXXXXXXXXXNLP 1541
             R A+Y+QQ E+ Q PQG+N+ +Q    KP   E  N                    N+ 
Sbjct: 181  QRGASYNQQAEKPQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQVNQSQMQNVA 240

Query: 1540 NDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVEN 1361
            N Q NQ N+ ATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFD+VEN
Sbjct: 241  NGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 300

Query: 1360 VILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 1181
            VIL+FSVNRTRHFQGCAKMTS+IGGSV GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT
Sbjct: 301  VILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 360

Query: 1180 RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKG 1001
            RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISV           KG
Sbjct: 361  RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKAKG 420

Query: 1000 VNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMPLAR 821
            VNPDNG ENPDIVPFEDN             SF    GP  QGRGRGRGM+WPPHMPL R
Sbjct: 421  VNPDNGGENPDIVPFEDNEEEEEEESDEEEESFSHGVGPAGQGRGRGRGMMWPPHMPLGR 480

Query: 820  GGARPLPGMRGFPPVMMGADGFSY---GAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFT 650
             GARP+PGM+GF PVMMG DG SY   G V PDGF MPDL+G+GPR F PYGPRF GDF 
Sbjct: 481  -GARPMPGMQGFNPVMMG-DGLSYGPVGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDFG 538

Query: 649  GAASGMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 470
            G  + MMF GRPSQ                                              
Sbjct: 539  GPPAAMMFRGRPSQPGMFPSGGFGMMMNPGRGPFMGGMGVGGANPPRGGRPVNMPPMFPP 598

Query: 469  XXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQ----E 302
                         +  + A  ND   R+ +GS+QGK QDM    GG D++ QY Q     
Sbjct: 599  PPPLPQNANRAAKRDQRTADRND---RFGSGSEQGKSQDMLSQSGGPDDDAQYQQGYKGN 655

Query: 301  DQQFGSGNNFRNDESESEDEAPRRSRHGEGKKKRR 197
                 + NNFRND+SESEDEAPRRSRHGEGKKK +
Sbjct: 656  QDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKKHK 690


>ref|XP_011085214.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor
            30-like [Sesamum indicum]
          Length = 688

 Score =  842 bits (2174), Expect = 0.0
 Identities = 433/695 (62%), Positives = 479/695 (68%), Gaps = 10/695 (1%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXA----GNV 2063
            MDDGEGGLSFDFEGGLDTGP++P+ASVP IQS                            
Sbjct: 1    MDDGEGGLSFDFEGGLDTGPAHPTASVPVIQSSADAKTASAASGNPNNPSAGLVPAAQTA 60

Query: 2062 QG-----RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVY 1898
            +G     RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVY
Sbjct: 61   EGMGGGARRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVY 120

Query: 1897 KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFF 1721
            KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPP+EEVLQKIQ L +YN+GN+N+FF
Sbjct: 121  KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLTSYNHGNTNKFF 180

Query: 1720 QNRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXNLP 1541
            QNRN  Y+QQTE++Q PQG N  +QA    P  +   N                     P
Sbjct: 181  QNRNTTYTQQTEKTQLPQGPNGVNQAGKTNPIESSNINQQAQVQQSQQQGSQGQIQNT-P 239

Query: 1540 NDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVEN 1361
              QQNQ ++ ATPLPQG SRYF+VKSCNRENLELSVQQGVWATQRSNEAKLNEAF++VEN
Sbjct: 240  GGQQNQASRTATPLPQGTSRYFVVKSCNRENLELSVQQGVWATQRSNEAKLNEAFESVEN 299

Query: 1360 VILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 1181
            VILIFSVN+TRHFQGCAKMTSKIGGSVGGGNWK+AHGTAHYGRNF+VKWLKLCELSF KT
Sbjct: 300  VILIFSVNKTRHFQGCAKMTSKIGGSVGGGNWKHAHGTAHYGRNFAVKWLKLCELSFDKT 359

Query: 1180 RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKG 1001
            RHL+NPYNENLPVKISRDCQELEPS+GEQLASLLYLEPDS+LMA+S+           KG
Sbjct: 360  RHLKNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSDLMAVSLAAELKREEEKAKG 419

Query: 1000 VNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMPLAR 821
            VN DNGTENPDIVPFEDN             S GQ FG  AQGRGRGRGM+W PHMPLAR
Sbjct: 420  VNLDNGTENPDIVPFEDNEEEEEEESEEEDESPGQVFG--AQGRGRGRGMMWLPHMPLAR 477

Query: 820  GGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFTGAA 641
             G+RP  G+RGFPP MM  DGFSYG V PDGFPMPD +GM PR FGPYGPRF GDF G A
Sbjct: 478  -GSRPFSGIRGFPPNMMSGDGFSYGPVNPDGFPMPDPFGMAPRGFGPYGPRFSGDFAGPA 536

Query: 640  SGMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 461
             GMMF GRPS                                                  
Sbjct: 537  PGMMFPGRPSGGFGMMMGPGRAPFMGGMGVGAAAAARAGRTVGMAPFYPPPPPSQQSQNS 596

Query: 460  XXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQEDQQFGSG 281
                   +    D+    +    +  +GS  G G +      G +      Q++  + +G
Sbjct: 597  NRAKRDLKAPFNDKNDGPDQGKGQEISGSSGGHGDE------GRNLPRLKAQQEDHYSAG 650

Query: 280  NNFRNDESESEDEAPRRSRHGEGKKKRRSSEADAT 176
            N++RNDESESEDEAPRRSRHGEGKKKRR+ EAD+T
Sbjct: 651  NSYRNDESESEDEAPRRSRHGEGKKKRRNLEADST 685


>ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max] gi|947088097|gb|KRH36762.1|
            hypothetical protein GLYMA_09G022200 [Glycine max]
          Length = 681

 Score =  835 bits (2157), Expect = 0.0
 Identities = 432/688 (62%), Positives = 471/688 (68%), Gaps = 10/688 (1%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSAS-----VP---AIQSXXXXXXXXXXXXXXXXXXXX 2075
            M+D EG LSFDFEGGLD  PS+ +A+     +P   +  +                    
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVSNGGPAAPAPSAVDPVG 60

Query: 2074 AGNVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 1895
             GNV GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQDCVYK
Sbjct: 61   GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120

Query: 1894 HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFFQ 1718
            HTNEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPP+EEVLQKIQHL +YNY +SN+FFQ
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQ 180

Query: 1717 NRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPN-MXXXXXXXXXXXXXXXXXXNLP 1541
             R A+Y+QQ E+   PQG+N+ +Q     P   E  N                    N+ 
Sbjct: 181  QRGASYNQQAEKPLLPQGNNSTNQGVTGNPLPAELGNAQPQQQVQQSQQQVNQSQMQNVA 240

Query: 1540 NDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVEN 1361
            N Q NQ N+ ATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFD+VEN
Sbjct: 241  NGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 300

Query: 1360 VILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 1181
            VILIFSVNRTRHFQGCAKMTSKIGGSV GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT
Sbjct: 301  VILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 360

Query: 1180 RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKG 1001
            RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISV           KG
Sbjct: 361  RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKAKG 420

Query: 1000 VNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMPLAR 821
            VNPDNG ENPDIVPFEDN             SFG   GP  QGRGRGRGM+WPPHMPL R
Sbjct: 421  VNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGRGRGRGMMWPPHMPLGR 480

Query: 820  GGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFTGAA 641
             GARP+PGM+GF PVMMG DG SYG V PDGF MPDL+G+GPR F PYGPRF GDF G  
Sbjct: 481  -GARPMPGMQGFNPVMMG-DGLSYGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDFGGPP 538

Query: 640  SGMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 461
            + MMF GRPSQ                                                 
Sbjct: 539  AAMMFRGRPSQPGMFPGGGFGMMLNPGRGPFMGGIGVGGANPPRGGRPVNMPPMFPPPPP 598

Query: 460  XXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQEDQQFGSG 281
                      +  + A  ND   R+ +GS+QGK QDM    GG D++ QY    Q +   
Sbjct: 599  LPQNANRAAKRDQRTADRND---RFGSGSEQGKSQDMLSQSGGPDDDPQY---QQGYKGN 652

Query: 280  NNFRNDESESEDEAPRRSRHGEGKKKRR 197
             +   D+SESEDEAPRRSRHGEGKKK +
Sbjct: 653  QDDHPDDSESEDEAPRRSRHGEGKKKHK 680


>ref|XP_010687042.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Beta vulgaris subsp. vulgaris]
            gi|870851989|gb|KMT03954.1| hypothetical protein
            BVRB_8g187630 [Beta vulgaris subsp. vulgaris]
          Length = 680

 Score =  822 bits (2122), Expect = 0.0
 Identities = 426/685 (62%), Positives = 471/685 (68%), Gaps = 2/685 (0%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXAGNVQGRR 2051
            M+D EGGLSFDFEG LD  P+ P+AS P IQ                     +G    RR
Sbjct: 1    MEDTEGGLSFDFEGNLDAAPNIPTASNPVIQPDPNAAPSSGPAAPSPPADPASGQ-GNRR 59

Query: 2050 SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKE 1871
            SFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKE
Sbjct: 60   SFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKE 119

Query: 1870 CNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFFQNRNANYSQ 1694
            CNMYKLGFCPNGPDCRYRHAK PGPPPP++EVLQKIQ L +Y+YG SNRFFQ RN NYSQ
Sbjct: 120  CNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYSYGASNRFFQQRNTNYSQ 179

Query: 1693 QTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXNLPNDQQNQVNK 1514
            Q +RSQFPQG+N+ +Q A  KP+ TE  NM                   LP++  NQ  +
Sbjct: 180  QADRSQFPQGANSTNQGAVPKPTATESSNMQQQLQQLPLAGQDQLQN--LPSNPSNQTGR 237

Query: 1513 AATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVENVILIFSVNR 1334
             ATPLPQG++RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+VE+VIL+FSVNR
Sbjct: 238  IATPLPQGLTRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHVILVFSVNR 297

Query: 1333 TRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPYNE 1154
            TRHFQGCAKMTSKIG + GGGNWK+AHGTAHYGRNFSVKWLKLCEL+F+KTRHLRNPYNE
Sbjct: 298  TRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVKWLKLCELTFNKTRHLRNPYNE 357

Query: 1153 NLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGVNPDNGTEN 974
            NLPVKISRDCQELEPS+GEQLASLLYLEPDSELMA              KGV+ +NG EN
Sbjct: 358  NLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLSAAESKREEEKAKGVDIENGAEN 417

Query: 973  PDIVPFEDNXXXXXXXXXXXXXS-FGQAFGPTAQGRGRGRGMIWPPHMPLARGGARPLPG 797
            PDIVPF+DN               FGQA G   QGRGRGRGM+WPP+ P+ RG  RP+ G
Sbjct: 418  PDIVPFDDNEEEEEEEESEEEDESFGQAPGLAIQGRGRGRGMMWPPNFPMGRG-VRPMQG 476

Query: 796  MRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFTGAASGMMFHGR 617
            MR FPP MMG DGF+YG   PDGFPMPD +GM PR F PYGPRF GDFT  A GMMF   
Sbjct: 477  MRAFPPGMMGVDGFTYGP-GPDGFPMPDPFGMAPRPFMPYGPRFSGDFTSPAPGMMF--- 532

Query: 616  PSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNR 437
            P +                                                       NR
Sbjct: 533  PGRPSQPGGVLPGGGFGMMMGPGRAPFMPGGMGMGGRGGRPMGMPPIFPPQPGPPQGGNR 592

Query: 436  MVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQEDQQFGSGNNFRNDES 257
              KRD R   ND  + + +G +QGK QDM G  GG   + QY Q  ++  S NN  NDES
Sbjct: 593  GPKRDLRGPGNDWGETFGSGPEQGKLQDMGGGRGG---DPQYQQGTEKIVSCNNVTNDES 649

Query: 256  ESEDEAPRRSRHGEGKKKRRSSEAD 182
            ESEDEAPRRSRHGEGKKKRRS + D
Sbjct: 650  ESEDEAPRRSRHGEGKKKRRSLDGD 674


>ref|XP_009628296.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Nicotiana tomentosiformis]
          Length = 691

 Score =  822 bits (2122), Expect = 0.0
 Identities = 430/705 (60%), Positives = 472/705 (66%), Gaps = 15/705 (2%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAI---------QSXXXXXXXXXXXXXXXXXXX 2078
            MD+GEGGLSFDFEGGLDTGP++P+ASVP +          +                   
Sbjct: 1    MDEGEGGLSFDFEGGLDTGPTHPTASVPVMTQSSDHNIAAAAAPNANINQPPTVSAHVGG 60

Query: 2077 XAGNVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVY 1898
              G V  RRSFRQTVCRHWLRSLCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDCVY
Sbjct: 61   DVGFVGNRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVY 120

Query: 1897 KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHLA-YNYGNSNRFF 1721
            KHT EDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPP+EEVLQKIQHLA  NYG SNRF+
Sbjct: 121  KHTIEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLASNNYGYSNRFY 180

Query: 1720 QNRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXNL- 1544
            QNRNANYS Q E+SQ  QG N    A   K +  E P +                     
Sbjct: 181  QNRNANYSTQAEKSQASQGQNGMGLA--VKSTAAETPIIQQIQPHQQQVLQTQQQGGPTQ 238

Query: 1543 ----PNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 1376
                PN QQNQ ++ A  LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF
Sbjct: 239  TQIHPNGQQNQTDRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 298

Query: 1375 DTVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCEL 1196
            D+VENVILIFSVNRTRHFQGCAKMTS+IGG+  GGNWK+ HGTAHYGRNFSVKWLKLCEL
Sbjct: 299  DSVENVILIFSVNRTRHFQGCAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSVKWLKLCEL 358

Query: 1195 SFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXX 1016
            SF KT HLRNPYNENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMAIS+        
Sbjct: 359  SFQKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRQE 418

Query: 1015 XXXKGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPH 836
               KGVNPDNG +NPDIVPFEDN             SF Q FGP A GRGRGRG++WPP 
Sbjct: 419  EKAKGVNPDNGNDNPDIVPFEDNEEEEDEESEDEDESFDQGFGPAALGRGRGRGIVWPPI 478

Query: 835  MPLARGGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGD 656
            MPL   G RPLPGMRGFPP MMG DGFSYGA+TPDGFPMPD +GMGPR FGPYGPRF  D
Sbjct: 479  MPLGH-GPRPLPGMRGFPPGMMG-DGFSYGAMTPDGFPMPDHFGMGPRPFGPYGPRFSND 536

Query: 655  FTGAASGMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 476
                   MMFHGRP                                              
Sbjct: 537  -------MMFHGRPPAGGFGMMMGPGRPPFMGGMGPGATGPPRAGRAVGMHPSFVPPSSQ 589

Query: 475  XXXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQEDQ 296
                        R    D+    +  +D+   G  Q     + G  G    +    ++D 
Sbjct: 590  PSQNPYRPKREQRAPVHDRNDRFSSGSDQ---GKGQEMAGSVGGPDGVNYPQRGKPEQDA 646

Query: 295  QFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSNQ 161
            QFG+GN+F+NDESESEDEAPRRSRHG+GKKKRR ++ DA T+S +
Sbjct: 647  QFGAGNSFKNDESESEDEAPRRSRHGDGKKKRRDTDDDAATASEK 691


>ref|XP_009789024.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Nicotiana sylvestris]
            gi|698484435|ref|XP_009789025.1| PREDICTED: cleavage and
            polyadenylation specificity factor CPSF30-like [Nicotiana
            sylvestris]
          Length = 690

 Score =  819 bits (2115), Expect = 0.0
 Identities = 428/705 (60%), Positives = 470/705 (66%), Gaps = 15/705 (2%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAI---------QSXXXXXXXXXXXXXXXXXXX 2078
            MD+GEGGLSFDFEGGLDTGP++P+ASVP +          +                   
Sbjct: 1    MDEGEGGLSFDFEGGLDTGPTHPTASVPVMTQSSDHNIAAAAAPNANINQPPTVSAHVGG 60

Query: 2077 XAGNVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVY 1898
              G V  RRSFRQTVCRHWLRSLCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDCVY
Sbjct: 61   DVGFVGNRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVY 120

Query: 1897 KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHLA-YNYGNSNRFF 1721
            KHT EDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPP+EEVLQKIQHLA  NYG SNRF+
Sbjct: 121  KHTIEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLASNNYGYSNRFY 180

Query: 1720 QNRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXNL- 1544
            QNRNANYS Q ++ Q  QG N        K + TE P +                     
Sbjct: 181  QNRNANYSTQADKPQASQGQNG---MGAVKSTATETPIIQQIQPHQQQALQTQQQGGTTQ 237

Query: 1543 ----PNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 1376
                PN QQNQ ++ A  LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF
Sbjct: 238  TQIHPNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 297

Query: 1375 DTVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCEL 1196
            D+VENVILIFSVNRTRHFQGCAKMTS+IGG+  GGNWK+ HGTAHYGRNFSVKWLKLCEL
Sbjct: 298  DSVENVILIFSVNRTRHFQGCAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSVKWLKLCEL 357

Query: 1195 SFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXX 1016
            SF KT HLRNPYNENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMAIS+        
Sbjct: 358  SFQKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRQE 417

Query: 1015 XXXKGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPH 836
               KGVNPDNG +NPDIVPFEDN             SF Q FGP A GRGRGRG++WPP 
Sbjct: 418  EKAKGVNPDNGNDNPDIVPFEDNEEEEDEESEDEDESFDQGFGPAALGRGRGRGIVWPPI 477

Query: 835  MPLARGGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGD 656
            MPL   G RPLPGMRGFPP MMG DGFSYGA+TPDGFPMPD +GMGPR FGPYGPRF  D
Sbjct: 478  MPLGH-GPRPLPGMRGFPPGMMG-DGFSYGAMTPDGFPMPDHFGMGPRPFGPYGPRFSND 535

Query: 655  FTGAASGMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 476
                   MMFHGRP                                              
Sbjct: 536  -------MMFHGRPPAGGFGMMMGPGRPPFMGGMGPGATGPPRAGRAVGMHPSFVPPSSQ 588

Query: 475  XXXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQEDQ 296
                        R    D+    +  +D+   G  Q     + G  G    +    ++D 
Sbjct: 589  PSQNPYRPKREQRAPVHDRNDRFSSGSDQ---GKGQEMAGSVGGPDGVNYPQRGKTEQDA 645

Query: 295  QFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSNQ 161
            QFG+GN F+NDESESEDEAPRRSRHG+GKKKRR ++ DA T+S +
Sbjct: 646  QFGAGNGFKNDESESEDEAPRRSRHGDGKKKRRDTDDDAATASEK 690


>ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Vitis vinifera]
          Length = 673

 Score =  816 bits (2108), Expect = 0.0
 Identities = 407/539 (75%), Positives = 427/539 (79%), Gaps = 1/539 (0%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXAGNVQGRR 2051
            M+D EG LSFDFEGGLD  P   +   P IQS                     G   GRR
Sbjct: 1    MEDAEGVLSFDFEGGLDAAPGTAATVAPLIQSDATAAAAAPSSVVSAEPTP--GGAPGRR 58

Query: 2050 SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKE 1871
            SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKE
Sbjct: 59   SFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKE 118

Query: 1870 CNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHLA-YNYGNSNRFFQNRNANYSQ 1694
            CNMYKLGFCPNG DCRYRHAKLPGPPP +EEV QKIQ L+ +NYG+SNRF+QNRN  Y+Q
Sbjct: 119  CNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFYQNRNP-YNQ 177

Query: 1693 QTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXNLPNDQQNQVNK 1514
            QTE+SQ  QGSNA +    AK STTE  N+                  NLPN   NQ NK
Sbjct: 178  QTEKSQILQGSNAVNLGTVAKSSTTEAINVQQQQVQPPQQQVSQTPMQNLPNGLPNQANK 237

Query: 1513 AATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVENVILIFSVNR 1334
             A+PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+VENVILIFSVNR
Sbjct: 238  TASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIFSVNR 297

Query: 1333 TRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPYNE 1154
            TRHFQGCAKMTSKIGG VGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPYNE
Sbjct: 298  TRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPYNE 357

Query: 1153 NLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGVNPDNGTEN 974
            NLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS+           KGVNPDNG EN
Sbjct: 358  NLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREEEKAKGVNPDNGGEN 417

Query: 973  PDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMPLARGGARPLPGM 794
            PDIVPFEDN             SFGQA GP AQGRGRGRG++WPPHMPLAR GARP+P M
Sbjct: 418  PDIVPFEDNEEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMPLAR-GARPIPSM 476

Query: 793  RGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFTGAASGMMFHGR 617
            RGFPPVMMGADGFSY AV PDGF MPD++G+GPRAF PYGPRF GDFTG ASGMMF GR
Sbjct: 477  RGFPPVMMGADGFSYSAVPPDGFAMPDIFGVGPRAFPPYGPRFSGDFTGPASGMMFPGR 535



 Score =  121 bits (303), Expect = 3e-24
 Identities = 61/82 (74%), Positives = 65/82 (79%), Gaps = 5/82 (6%)
 Frame = -1

Query: 430 KRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQY-----HQEDQQFGSGNNFRN 266
           KRDQR   NDRNDRYS GSDQG+GQDMAG     D+ETQY      Q+D QFG GN+FRN
Sbjct: 596 KRDQRTPVNDRNDRYSGGSDQGRGQDMAGP----DDETQYLQGLKSQQDDQFGGGNSFRN 651

Query: 265 DESESEDEAPRRSRHGEGKKKR 200
           DESESEDEAPRRSRHGEGKKKR
Sbjct: 652 DESESEDEAPRRSRHGEGKKKR 673


>ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis]
            gi|223537608|gb|EEF39232.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 702

 Score =  801 bits (2068), Expect = 0.0
 Identities = 402/561 (71%), Positives = 426/561 (75%), Gaps = 20/561 (3%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDT-GPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXA------ 2072
            MDD +GGLSFDFEGGLD+ GP+NP+AS+PAI S                           
Sbjct: 1    MDDTDGGLSFDFEGGLDSSGPTNPTASIPAIPSDNTAAVAAATNNSIVPNVSSNDPASAA 60

Query: 2071 ----GNVQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 1904
                 N  GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC
Sbjct: 61   AAAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 120

Query: 1903 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNR 1727
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPP+EEVLQKIQ L +YNYG+SN+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGSSNK 180

Query: 1726 FFQNRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN 1547
            FFQ R A + Q  ++SQF QG N   Q   AKP  TE  N+                   
Sbjct: 181  FFQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGTESANVQQPQQQQPQPGQGQQSQQQ 240

Query: 1546 --------LPNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 1391
                    LPN Q NQ N+ A PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK
Sbjct: 241  ATQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 300

Query: 1390 LNEAFDTVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWL 1211
            LNEAFD+ ENVILIFSVNRTRHFQGCAKMTSKIG SVGGGNWKYAHGTAHYGRNFSVKWL
Sbjct: 301  LNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSVKWL 360

Query: 1210 KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXX 1031
            KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPS+G QLA LLY EPDSELMAIS+   
Sbjct: 361  KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAISLAAE 420

Query: 1030 XXXXXXXXKGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGM 851
                    KGVNP+NG +NPDIVPFEDN             SFGQA G   QGRGRGRG+
Sbjct: 421  AKREEEKAKGVNPENGGDNPDIVPFEDNEEEEEEESEEEEESFGQALGAPGQGRGRGRGI 480

Query: 850  IWPPHMPLARGGARPLPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGP 671
            IW PHMPLAR GARP+PGMRGFPP+MMGAD FSYG VTPDGF MPDL+G+ PR F PY P
Sbjct: 481  IW-PHMPLAR-GARPIPGMRGFPPMMMGADSFSYGPVTPDGFGMPDLFGVAPRGFTPYAP 538

Query: 670  RFPGDFTGAASGMMFHGRPSQ 608
            RF GDFTGAASGMMF GRP Q
Sbjct: 539  RFSGDFTGAASGMMFPGRPPQ 559



 Score =  111 bits (277), Expect = 4e-21
 Identities = 61/98 (62%), Positives = 69/98 (70%), Gaps = 6/98 (6%)
 Frame = -1

Query: 439 RMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQE------DQQFGSGN 278
           R VKRDQR  ANDR   YS GSDQG+      + G  D+E +Y QE      + QFG+GN
Sbjct: 612 RPVKRDQRMTANDR---YSTGSDQGRN-----TAGEPDDEARYQQEGLKASHEDQFGAGN 663

Query: 277 NFRNDESESEDEAPRRSRHGEGKKKRRSSEADATTSSN 164
           +FRNDESESEDEAPRRSRHGEGKKKRR SE DAT  S+
Sbjct: 664 SFRNDESESEDEAPRRSRHGEGKKKRRGSEGDATPGSD 701


>ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Cicer arietinum]
          Length = 677

 Score =  792 bits (2045), Expect = 0.0
 Identities = 396/546 (72%), Positives = 422/546 (77%), Gaps = 5/546 (0%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGP-SNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXA--GNVQ 2060
            M+D EG LSFDFEGGLD  P S  + SVPA  S                    A  GN+ 
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPPSAATVSVPAPPSGPIVHPDSSLPPSISSNGAAAVSGNIP 60

Query: 2059 GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNED 1880
            GRRSFRQTVCRHWLRSLCMKG+ACGFLHQYDK+RMPVCRFFRLYGECREQDCVYKHTNED
Sbjct: 61   GRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHTNED 120

Query: 1879 IKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNRFFQNRNAN 1703
            IKECNMYKLGFCPNGPDCRYRHAK PGPPPPIEEVLQKIQHL +YN+ NS++F Q R ++
Sbjct: 121  IKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNSHKFIQQRGSS 180

Query: 1702 YSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN-LPNDQQN 1526
            Y+QQ E+SQFPQG N+ +Q    KP   E  N+                    L N Q N
Sbjct: 181  YTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQSQQQVSQIQTQNLANGQPN 240

Query: 1525 QVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTVENVILIF 1346
            Q N+ ATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFD+VENVILIF
Sbjct: 241  QANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVENVILIF 300

Query: 1345 SVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRN 1166
            SVNRTRHFQGCAKMTS+IGGSV GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRN
Sbjct: 301  SVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRN 360

Query: 1165 PYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXXKGVNPDN 986
            PYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS+           KGVNPDN
Sbjct: 361  PYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKAKGVNPDN 420

Query: 985  GTENPDIVPFEDNXXXXXXXXXXXXXSFGQAFGPTAQGRGRGRGMIWPPHMPLARGGARP 806
              ENPDIVPFEDN             SF QA  P  QGRGRGRGM+WPPHMPL R GARP
Sbjct: 421  AGENPDIVPFEDNEEEEEEESDEEEESFVQAVVPVGQGRGRGRGMMWPPHMPLGR-GARP 479

Query: 805  LPGMRGFPPVMMGADGFSYGAVTPDGFPMPDLYGMGPRAFGPYGPRFPGDFTGAASGMMF 626
            +PGM+GF PVMMG DG SYG   PDGF MPDL+GMGPR FGPYGPRF GDF G  + MMF
Sbjct: 480  MPGMQGFNPVMMG-DGLSYGPGAPDGFGMPDLFGMGPRGFGPYGPRFSGDFAGPPAAMMF 538

Query: 625  HGRPSQ 608
             GRPSQ
Sbjct: 539  RGRPSQ 544



 Score =  103 bits (257), Expect = 7e-19
 Identities = 51/80 (63%), Positives = 59/80 (73%)
 Frame = -1

Query: 439 RMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQYHQEDQQFGSGNNFRNDE 260
           R+ KRDQR   NDRNDRYS+G +QGK QDM    GG D+E QY Q        NNFRN++
Sbjct: 600 RIAKRDQRT--NDRNDRYSSGQEQGKSQDMLSQSGGPDDEMQYQQSG---APANNFRNED 654

Query: 259 SESEDEAPRRSRHGEGKKKR 200
           SESEDEAPRRSRHGEGKK++
Sbjct: 655 SESEDEAPRRSRHGEGKKRK 674


>ref|XP_012830213.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Erythranthe guttatus] gi|604344484|gb|EYU43238.1|
            hypothetical protein MIMGU_mgv1a002387mg [Erythranthe
            guttata]
          Length = 681

 Score =  783 bits (2022), Expect = 0.0
 Identities = 422/703 (60%), Positives = 463/703 (65%), Gaps = 20/703 (2%)
 Frame = -1

Query: 2230 MDDGEGGLSFDFEGGLDTGPSNPSASVPAIQSXXXXXXXXXXXXXXXXXXXXAGNVQ--- 2060
            MDDGEGGLSFDFEGGLD GPS+P+ASVP IQS                    A  V    
Sbjct: 1    MDDGEGGLSFDFEGGLDIGPSHPTASVPVIQSSANANTASAAAAAANPYNPSAAPVPATQ 60

Query: 2059 --------GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 1904
                    GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFRLYGECREQDC
Sbjct: 61   AAEGMNNGGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDC 120

Query: 1903 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPIEEVLQKIQHL-AYNYGNSNR 1727
            VYKHTNED+KECNMYKLGFCPNGPDCRYRHAKLPGPPP +EEVLQKIQ L +YNYG SN 
Sbjct: 121  VYKHTNEDVKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQQLTSYNYGKSNN 180

Query: 1726 FFQNRNANYSQQTERSQFPQGSNAHSQAAPAKPSTTEPPNMXXXXXXXXXXXXXXXXXXN 1547
            FFQNRN+N++QQTE+ QFPQG N   Q    K +  EP N+                   
Sbjct: 181  FFQNRNSNFAQQTEKPQFPQGPNGTHQVG--KTNAAEPGNLNQPAQQSQQPGSQGQLQS- 237

Query: 1546 LPNDQQNQVNKAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTV 1367
            +PNDQQNQ ++ ATPLPQG SRYF+VKSCNRENLELSVQQGVWATQRSNEAKLNEAF++V
Sbjct: 238  IPNDQQNQASRNATPLPQGASRYFVVKSCNRENLELSVQQGVWATQRSNEAKLNEAFESV 297

Query: 1366 ENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 1187
            EN+ILIFSVN+TRHFQGCAKMTS+IGGSVGGGNWK+AHGTAHYGRNF++KWLKLCEL+F 
Sbjct: 298  ENIILIFSVNKTRHFQGCAKMTSRIGGSVGGGNWKHAHGTAHYGRNFALKWLKLCELTFD 357

Query: 1186 KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1007
            KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDS+LMAI++           
Sbjct: 358  KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSDLMAIAIAAELKREEEKA 417

Query: 1006 KGVNPDNGTENPDIVPFEDNXXXXXXXXXXXXXSF-----GQAFGPTAQGRGRGRGMIWP 842
            KGVN DNG ENPDIVPFEDN                    GQAFG  AQGRG GRGM+W 
Sbjct: 418  KGVNIDNGAENPDIVPFEDNEEEEEEEEEEEESEDEDEFPGQAFG--AQGRGVGRGMMWG 475

Query: 841  PHMPLARGGARPLPGMRGFPPVMMGADGFSYGAVTP---DGFPMPDLYGMGPRAFGPYGP 671
            PHMP    G RP PG+RGFPP MMG DGF YG   P   DGFPM D +GM         P
Sbjct: 476  PHMPPLGRGPRPFPGVRGFPPNMMGGDGFPYGHGPPLNHDGFPMHDPFGMV--------P 527

Query: 670  RFPGDFTGAASGMMFHGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 491
            R  G F G   G  F G  S                                        
Sbjct: 528  RGFGQF-GPRFGGDFAGPASGPMMFAGRPPGGFGPMMGQGRGPFMGGGRGGRPVGMPPPF 586

Query: 490  XXXXXXXXXXXXXXXXNRMVKRDQRAAANDRNDRYSAGSDQGKGQDMAGSGGGLDEETQY 311
                            +  VKRDQ+A  +DRND     SDQGKGQ++           + 
Sbjct: 587  FPPPPPPVAAQPPPQNSNWVKRDQKAPYSDRNDV----SDQGKGQEIVSGSSNRGNAAKR 642

Query: 310  HQEDQQFGSGNNFRNDESESEDEAPRRSRHGEGKKKRRSSEAD 182
             +         ++RNDESESEDEAPRRSRHGEGKKKRR SEA+
Sbjct: 643  EE---------SYRNDESESEDEAPRRSRHGEGKKKRRGSEAE 676


Top