BLASTX nr result

ID: Catharanthus22_contig00012232 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00012232
         (2511 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOX96971.1| Cleavage and polyadenylation specificity factor 3...   728   0.0  
ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation spec...   724   0.0  
ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm...   706   0.0  
ref|XP_002300333.2| zinc finger family protein [Populus trichoca...   654   0.0  
ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr...   614   e-173
ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation spec...   610   e-172
gb|ESW19498.1| hypothetical protein PHAVU_006G130200g [Phaseolus...   609   e-171
ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation spec...   607   e-171
ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec...   606   e-170
ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation spec...   603   e-170
ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec...   602   e-169
ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation spec...   600   e-169
ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation spec...   599   e-168
ref|XP_004233145.1| PREDICTED: cleavage and polyadenylation spec...   599   e-168
ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation spec...   587   e-165
gb|EPS64393.1| hypothetical protein M569_10389, partial [Genlise...   582   e-163
gb|EXB51974.1| Cleavage and polyadenylation specificity factor C...   580   e-162
gb|EMJ15374.1| hypothetical protein PRUPE_ppa019072mg [Prunus pe...   580   e-162
emb|CBI30994.3| unnamed protein product [Vitis vinifera]              570   e-161
ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation spec...   571   e-160

>gb|EOX96971.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao]
          Length = 698

 Score =  728 bits (1880), Expect = 0.0
 Identities = 412/720 (57%), Positives = 447/720 (62%), Gaps = 25/720 (3%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTASNPVIQHPDSSA--------NAAPGAPAITXXXXX 2250
            MDD EGGLSFDFEGGLD GP  PTAS PV+    S+A        +A PGA   +     
Sbjct: 1    MDDSEGGLSFDFEGGLDAGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPA 60

Query: 2249 XXXXXXXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR 2070
                               RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR
Sbjct: 61   AAVGGGGAG----------RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR 110

Query: 2069 LYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQL 1890
            L+GECREQDCVYKHTNEDIKECNMYKLGFCPNG DCRYRHAKL     PVEEVLQKIQQL
Sbjct: 111  LFGECREQDCVYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQL 170

Query: 1889 NSYNYGNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQVS--KPAVVESXXXXXXXXXXXX 1716
            +SYNY   N+F+Q RN  ++ Q EKSQ PQ  N  NQ +  KP+  ES            
Sbjct: 171  SSYNY---NKFFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTTESANMHPQQQVQQP 227

Query: 1715 XXXXXXXXXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRS 1536
                       N+ NGQ NQAN+TA PLPQG SRYFIVKSCNRENLELSVQQGVWATQRS
Sbjct: 228  QQQVSQTQIQ-NVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRS 286

Query: 1535 NEAKLNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFS 1356
            NEAKLNEAFDS ENVILIFS+NRTRHFQGCAKMTS+IGGSVA GNWK+AHGT HYGRNFS
Sbjct: 287  NEAKLNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFS 346

Query: 1355 VKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIW 1176
            VKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAI 
Sbjct: 347  VKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS 406

Query: 1175 IAAXXXXXXXXXKGLNTDNRSENPDIVLFXXXXXXXXXXXXXXXXSXXXXXXXXXXXXXX 996
            +AA         KG+N+DN  ENPDIV F                               
Sbjct: 407  VAAELKREEEKAKGVNSDNGGENPDIVPF----EDNEEEEEEESEEEDESFSAAAQGRGR 462

Query: 995  XXXXXXXXXMPLAXXXXXXXXXXXXXXXXXXXXXXXXXXVTPDGFPMPDLFGMASRPFGP 816
                     MPLA                          VTPDGF +PDLFG A RPF P
Sbjct: 463  GRGVMWPPHMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVPDLFG-APRPFPP 521

Query: 815  YGPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXXXXXXXXXXGVSPGSQ-TRPV 639
            YGPRFSGDF GP  GM+FPGRP Q                            G+   R  
Sbjct: 522  YGPRFSGDFTGPASGMMFPGRPPQPGAMFPAGGLGMMMGPGRAPFMGGMGPTGANPVRGG 581

Query: 638  RPIGTXXXXXXXXXXXXXXSNR-TKRDQKASATERNDRFSSGSDQGKGQEFAGSVG---- 474
            RP+                S R  KRDQ+   T  NDR+ +GS+QG+GQE AG  G    
Sbjct: 582  RPVSMPPMFPPPPAPSSQNSGRAVKRDQR---TPTNDRYGAGSEQGRGQEMAGPGGRLDD 638

Query: 473  ---------GPQEADHLAAGKTFRNDESDSEDEAPRRSRHGEGKKKRRSVEGDSTTNASD 321
                          D  AAG +FRNDES+SEDEAPRRSR+GEGKKKRRS+EGD   N SD
Sbjct: 639  ETQYQQEGQKAHHEDQFAAGNSFRNDESESEDEAPRRSRYGEGKKKRRSLEGDD-ANGSD 697


>ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Vitis vinifera]
          Length = 673

 Score =  724 bits (1869), Expect = 0.0
 Identities = 396/692 (57%), Positives = 433/692 (62%), Gaps = 10/692 (1%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTASNPVIQHPDSSANAAPGAPAITXXXXXXXXXXXXX 2226
            M+D EG LSFDFEGGLD  P       P+IQ   ++A AAP +                 
Sbjct: 1    MEDAEGVLSFDFEGGLDAAPGTAATVAPLIQSDATAAAAAPSSVVSAEPTPGGAPG---- 56

Query: 2225 XXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 2046
                       RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ
Sbjct: 57   -----------RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 105

Query: 2045 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNYGNT 1866
            DCVYKHTNEDIKECNMYKLGFCPNG DCRYRHAKL      +EEV QKIQQL+S+NYG++
Sbjct: 106  DCVYKHTNEDIKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSS 165

Query: 1865 NRFYQNRNPNYSHQAEKSQFPQMPNASN--QVSKPAVVESXXXXXXXXXXXXXXXXXXXX 1692
            NRFYQNRNP Y+ Q EKSQ  Q  NA N   V+K +  E+                    
Sbjct: 166  NRFYQNRNP-YNQQTEKSQILQGSNAVNLGTVAKSSTTEAINVQQQQVQPPQQQVSQTPM 224

Query: 1691 XXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1512
               NL NG  NQAN+TA+PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA
Sbjct: 225  Q--NLPNGLPNQANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 282

Query: 1511 FDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWLKLCE 1332
            FDSVENVILIFS+NRTRHFQGCAKMTS+IGG V  GNWK+AHGT HYGRNFSVKWLKLCE
Sbjct: 283  FDSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCE 342

Query: 1331 LSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAXXXXX 1152
            LSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAI +AA     
Sbjct: 343  LSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKRE 402

Query: 1151 XXXXKGLNTDNRSENPDIVLFXXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXXXXXXXXX 972
                KG+N DN  ENPDIV F                S                      
Sbjct: 403  EEKAKGVNPDNGGENPDIVPFEDNEEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPP 462

Query: 971  XMPLAXXXXXXXXXXXXXXXXXXXXXXXXXXVTPDGFPMPDLFGMASRPFGPYGPRFSGD 792
             MPLA                          V PDGF MPD+FG+  R F PYGPRFSGD
Sbjct: 463  HMPLARGARPIPSMRGFPPVMMGADGFSYSAVPPDGFAMPDIFGVGPRAFPPYGPRFSGD 522

Query: 791  FAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXXXXXXXXXXGVSPGSQTRPVRPIGTXXXX 612
            F GP  GM+FPGR                           GV   + TR  RP+G     
Sbjct: 523  FTGPASGMMFPGRGQPGAVFPASGYGMMMGPGRAPFMGGMGVPAAAPTRAGRPVG-MPPM 581

Query: 611  XXXXXXXXXXSNRTKRDQKASATERNDRFSSGSDQGKGQEFAGSVG--------GPQEAD 456
                      +NRTKRDQ+    +RNDR+S GSDQG+GQ+ AG             Q+ D
Sbjct: 582  FPPPPPPNSQNNRTKRDQRTPVNDRNDRYSGGSDQGRGQDMAGPDDETQYLQGLKSQQDD 641

Query: 455  HLAAGKTFRNDESDSEDEAPRRSRHGEGKKKR 360
                G +FRNDES+SEDEAPRRSRHGEGKKKR
Sbjct: 642  QFGGGNSFRNDESESEDEAPRRSRHGEGKKKR 673


>ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis]
            gi|223537608|gb|EEF39232.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 702

 Score =  706 bits (1821), Expect = 0.0
 Identities = 403/713 (56%), Positives = 438/713 (61%), Gaps = 23/713 (3%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDV-GPTHPTASNPVIQHPDSSANAAPGAPAITXXXXXXXXXXXX 2229
            MDD +GGLSFDFEGGLD  GPT+PTAS P I   +++A AA    +I             
Sbjct: 1    MDDTDGGLSFDFEGGLDSSGPTNPTASIPAIPSDNTAAVAAATNNSIVPNVSSNDPASAA 60

Query: 2228 XXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE 2049
                        RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE
Sbjct: 61   AAAANNQAG---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE 117

Query: 2048 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNYGN 1869
            QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQQLNSYNYG+
Sbjct: 118  QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGS 177

Query: 1868 TNRFYQNRNPNYSHQAEKSQFPQMPNASNQ--VSKPAVVESXXXXXXXXXXXXXXXXXXX 1695
            +N+F+Q R   +   A+KSQF Q PN   Q   +KP   ES                   
Sbjct: 178  SNKFFQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGTESANVQQPQQQQPQPGQGQQS 237

Query: 1694 XXXP------NLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSN 1533
                      NL NGQ NQANRTA PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSN
Sbjct: 238  QQQATQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSN 297

Query: 1532 EAKLNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSV 1353
            EAKLNEAFDS ENVILIFS+NRTRHFQGCAKMTS+IG SV  GNWK+AHGT HYGRNFSV
Sbjct: 298  EAKLNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSV 357

Query: 1352 KWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWI 1173
            KWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE SVG QLA LLY EPDSELMAI +
Sbjct: 358  KWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAISL 417

Query: 1172 AAXXXXXXXXXKGLNTDNRSENPDIVLFXXXXXXXXXXXXXXXXSXXXXXXXXXXXXXXX 993
            AA         KG+N +N  +NPDIV F                                
Sbjct: 418  AAEAKREEEKAKGVNPENGGDNPDIVPF-EDNEEEEEEESEEEEESFGQALGAPGQGRGR 476

Query: 992  XXXXXXXXMPLAXXXXXXXXXXXXXXXXXXXXXXXXXXVTPDGFPMPDLFGMASRPFGPY 813
                    MPLA                          VTPDGF MPDLFG+A R F PY
Sbjct: 477  GRGIIWPHMPLARGARPIPGMRGFPPMMMGADSFSYGPVTPDGFGMPDLFGVAPRGFTPY 536

Query: 812  GPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXXXXXXXXXXGVSPGSQTRPVRP 633
             PRFSGDF G   GM+FPGRP Q                        G+ P S T P+R 
Sbjct: 537  APRFSGDFTGAASGMMFPGRPPQ-PGGVFPNGGFGMMMGPGRAPFMGGMGPNS-TNPLR- 593

Query: 632  IGTXXXXXXXXXXXXXXSNR-TKRDQKASATERNDRFSSGSDQGKGQEFAGSVGGPQE-- 462
             G                 R  KRDQ+ +A   NDR+S+GSDQG+      + G P +  
Sbjct: 594  -GNWPGGMPFPPLPTPSPQRPVKRDQRMTA---NDRYSTGSDQGR-----NTAGEPDDEA 644

Query: 461  -----------ADHLAAGKTFRNDESDSEDEAPRRSRHGEGKKKRRSVEGDST 336
                        D   AG +FRNDES+SEDEAPRRSRHGEGKKKRR  EGD+T
Sbjct: 645  RYQQEGLKASHEDQFGAGNSFRNDESESEDEAPRRSRHGEGKKKRRGSEGDAT 697


>ref|XP_002300333.2| zinc finger family protein [Populus trichocarpa]
            gi|550349048|gb|EEE85138.2| zinc finger family protein
            [Populus trichocarpa]
          Length = 669

 Score =  654 bits (1688), Expect = 0.0
 Identities = 381/713 (53%), Positives = 413/713 (57%), Gaps = 18/713 (2%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTASNPVIQHPDSSANAAPGAPAITXXXXXXXXXXXXX 2226
            M+D EG LSFDFEGGLD GP +P AS P I   D+   A   AP  T             
Sbjct: 1    MEDSEGVLSFDFEGGLDSGPANPIASIPAIPS-DNYGAATAAAPNTTNTTTNTTNNSNSG 59

Query: 2225 XXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 2046
                       RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ
Sbjct: 60   AADIQAG----RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 115

Query: 2045 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNYGNT 1866
            DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEV+QKIQQLNSYN   +
Sbjct: 116  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVVQKIQQLNSYNGVTS 175

Query: 1865 NRFYQNRNPNYSHQAEKSQFPQMPNASNQVSKPAVVESXXXXXXXXXXXXXXXXXXXXXX 1686
            N+ +Q RN  +S Q EKS         N + KP+  ES                      
Sbjct: 176  NKNFQQRNAGFSQQIEKSP--------NTIIKPSGTESANVQQQQQQQQQTQTP------ 221

Query: 1685 PNLLNGQQNQA------NRTATPLPQGTSR-----------YFIVKSCNRENLELSVQQG 1557
             +L NGQ  Q       NR ATPLPQG S            YFIVKSCNRENLELSVQQG
Sbjct: 222  -HLTNGQHQQPQQPNPLNRIATPLPQGISSFFSCVSPSQFVYFIVKSCNRENLELSVQQG 280

Query: 1556 VWATQRSNEAKLNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTG 1377
            VWATQRSNE KLNEA DS +NVILIFS+NRTRHFQGCAKM S+IG SV  GNWK+AHGT 
Sbjct: 281  VWATQRSNEIKLNEALDSADNVILIFSVNRTRHFQGCAKMASKIGASVGGGNWKYAHGTA 340

Query: 1376 HYGRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPD 1197
            HYGRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELE S+GEQLASLLYLEPD
Sbjct: 341  HYGRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPD 400

Query: 1196 SELMAIWIAAXXXXXXXXXKGLNTDNRSENPDIVLFXXXXXXXXXXXXXXXXSXXXXXXX 1017
            SELMA+ +AA         KG+N D+  ENPDIV F                S       
Sbjct: 401  SELMAVSLAAEAKREEEKEKGVNPDSGGENPDIVPFEDNEEEEEEESEEEEESFGQPLGP 460

Query: 1016 XXXXXXXXXXXXXXXXMPLAXXXXXXXXXXXXXXXXXXXXXXXXXXVTPDGFPMPDLFGM 837
                             P+A                          VTPD F MPDLFG+
Sbjct: 461  AAQGRGRGRGMMWPSHNPMARGARPIPGIRGFPPMMMGADGFSYGAVTPDSFGMPDLFGV 520

Query: 836  ASRPFGPYGPRFSGDFAGPGPGMIFPGRPSQ-XXXXXXXXXXXXXXXXXXXXXXXXGVSP 660
            ASR F PYGPRFSGDF G   GM+FPGRPSQ                         G +P
Sbjct: 521  ASRGFPPYGPRFSGDFTGAASGMMFPGRPSQPGAVFPAGGFGMMMGPGRPPFIGGMGPTP 580

Query: 659  GSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFSSGSDQGKGQEFAGS 480
             +  R  RP G               S   KRDQ+A+A +RNDR                
Sbjct: 581  SNLLRGPRPGG--MFAPFPAPSSQNNSRSVKRDQRAAANDRNDRH--------------- 623

Query: 479  VGGPQEADHLAAGKTFRNDESDSEDEAPRRSRHGEGKKKRRSVEGDSTTNASD 321
                   +   A  + RNDES+SEDEAPRRSRHGEGKKKRR   GD  T  S+
Sbjct: 624  -------NQFGAVNSIRNDESESEDEAPRRSRHGEGKKKRRG-SGDDATPGSE 668


>ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551535|gb|ESR62164.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 701

 Score =  614 bits (1584), Expect = e-173
 Identities = 310/441 (70%), Positives = 332/441 (75%), Gaps = 2/441 (0%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTASNPVIQHPDSSANAAPGAPAITXXXXXXXXXXXXX 2226
            M+D EGGLSFDFEGGLD GP  PTASNP IQ   ++A AA  A A               
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANA--NHAALSSSGAAPD 58

Query: 2225 XXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 2046
                     + RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL+GECREQ
Sbjct: 59   HASAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQ 118

Query: 2045 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNYGNT 1866
            DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KL      VEEVLQKIQQ++SYN+GN 
Sbjct: 119  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNP 178

Query: 1865 NRFYQNRNPNYSHQAEKSQFPQMPNASNQ--VSKPAVVESXXXXXXXXXXXXXXXXXXXX 1692
            N+ +Q R   +SHQ +KSQF Q PNA NQ    K +  ES                    
Sbjct: 179  NKLFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTT 237

Query: 1691 XXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1512
               NL NG  NQ NR ATPLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA
Sbjct: 238  QMQNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 297

Query: 1511 FDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWLKLCE 1332
            FDS ENVILIFS+NRTRHFQGCAKMTS+IGGSV  GNWK+AHGT HYGRNFSVKWLKLCE
Sbjct: 298  FDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCE 357

Query: 1331 LSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAXXXXX 1152
            LSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLA+LLYLEPDSELMAI +AA     
Sbjct: 358  LSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKRE 417

Query: 1151 XXXXKGLNTDNRSENPDIVLF 1089
                KG+N DN  +NPDIV F
Sbjct: 418  EEKAKGVNPDNGGDNPDIVPF 438



 Score =  164 bits (414), Expect = 2e-37
 Identities = 91/198 (45%), Positives = 109/198 (55%), Gaps = 13/198 (6%)
 Frame = -2

Query: 875  TPDGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXX 696
            TPDGFPMPDLFG+A RPF PYGPRFSGDF GPG GM+FPGRP Q                
Sbjct: 505  TPDGFPMPDLFGVAPRPFAPYGPRFSGDFTGPG-GMMFPGRPPQPGSVFPPNGFGGMMMG 563

Query: 695  XXXXXXXXGVSP-GSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFSS 519
                    G+ P  +  R  RP+G               S   KRD + S  +RNDR+S+
Sbjct: 564  PGRPPFMGGMGPAATNPRGGRPVGVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSA 623

Query: 518  GSDQGKGQEFAGSVGGPQE------------ADHLAAGKTFRNDESDSEDEAPRRSRHGE 375
            GSDQG+ QE  G   GP +             +     + FRNDES+SEDEAPRRSRHGE
Sbjct: 624  GSDQGRAQEMGGPGRGPDDEVQYQQEGSKANQEDQYGSRNFRNDESESEDEAPRRSRHGE 683

Query: 374  GKKKRRSVEGDSTTNASD 321
            GKKKRR  EGD+  ++ +
Sbjct: 684  GKKKRRDSEGDAAASSDN 701


>ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum tuberosum]
          Length = 692

 Score =  610 bits (1574), Expect = e-172
 Identities = 310/441 (70%), Positives = 329/441 (74%), Gaps = 2/441 (0%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTASNPVIQHPDSSANAAPGAPAITXXXXXXXXXXXXX 2226
            MD+GEGGL+FDFEGGLD GPTHPTAS PVIQ  D +A AAP A                 
Sbjct: 1    MDEGEGGLNFDFEGGLDTGPTHPTASVPVIQSFDHTAAAAPSA------NINPPTVSAAV 54

Query: 2225 XXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 2046
                       RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFRLYGECREQ
Sbjct: 55   GGQSDVGFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQ 114

Query: 2045 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNYGNT 1866
            DCVYKHT EDIKECNMYKLGFCPNGPDCRYRHAK+     PVEE+LQKIQ L SYNYG +
Sbjct: 115  DCVYKHTIEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASYNYGYS 174

Query: 1865 NRFYQNRNPNYSHQAEKSQFPQMPNASNQVSKPAVVESXXXXXXXXXXXXXXXXXXXXXX 1686
            NRF QNRN NYS Q++KSQ  Q  N  +   K    E+                      
Sbjct: 175  NRFNQNRNANYSTQSDKSQASQAQNGMSLAVKSTATETPIIQQHQPNQQVQPPQLQGGPT 234

Query: 1685 PNLL--NGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1512
               +  NGQQNQA+RTA  LPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA
Sbjct: 235  QAQIHPNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 294

Query: 1511 FDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWLKLCE 1332
            FDSVENVILIFS+NRTRHFQGC KMTSRIGG+   GNWKH HGT HYGRNFSVKWLKLCE
Sbjct: 295  FDSVENVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSVKWLKLCE 354

Query: 1331 LSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAXXXXX 1152
            LSF KT HLRNP+NENLPVKISRDCQELE SVGEQLASLLYLEPDSELMAI +AA     
Sbjct: 355  LSFQKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRQ 414

Query: 1151 XXXXKGLNTDNRSENPDIVLF 1089
                KG+N DN  +NPDIV F
Sbjct: 415  EEKAKGVNPDNGKDNPDIVPF 435



 Score =  136 bits (343), Expect = 4e-29
 Identities = 87/204 (42%), Positives = 109/204 (53%), Gaps = 19/204 (9%)
 Frame = -2

Query: 875  TPDGFPMPDLFGMASRPFGPYGPRFSGD--FAGPGPG-----MIFPGRPSQXXXXXXXXX 717
            TP+GFPMPD FGM  RPFGPYGP FS D  F G  P      M+ PGRP           
Sbjct: 510  TPEGFPMPDHFGMGPRPFGPYGPPFSSDLMFHGRPPAGGFGMMMGPGRPP---------- 559

Query: 716  XXXXXXXXXXXXXXXGVSPGSQTRPV--RPIGTXXXXXXXXXXXXXXSNRTKRDQKASAT 543
                           G+ PG+   P   R +G                 + KR+Q+A  +
Sbjct: 560  ------------FMGGMGPGATGPPRAGRAVGMHPSFVPPSSQPSQYPYKAKREQRAPVS 607

Query: 542  ERNDRFSSGSDQGKGQEFAGSVGGPQ-------EADH---LAAGKTFRNDESDSEDEAPR 393
            +RNDRFSS  DQGKGQE  GSVGGP        +++H     AG + +N+ES+SEDEAPR
Sbjct: 608  DRNDRFSS--DQGKGQEMMGSVGGPDGVHMQIGKSEHDNQFGAGNSQKNEESESEDEAPR 665

Query: 392  RSRHGEGKKKRRSVEGDSTTNASD 321
            RSRHG+GKKKRR V+ D+ T + +
Sbjct: 666  RSRHGDGKKKRRDVDEDAATGSEN 689


>gb|ESW19498.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris]
          Length = 697

 Score =  609 bits (1571), Expect = e-171
 Identities = 310/445 (69%), Positives = 335/445 (75%), Gaps = 6/445 (1%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTA-SNPVIQHPDSSANAAP---GAPAITXXXXXXXXX 2238
            M+D EG LSFDFEGGLD  P+   A S P++QH  S+A +A    G PA T         
Sbjct: 1    MEDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSGTEPAAV 60

Query: 2237 XXXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGE 2058
                           RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGE
Sbjct: 61   NVPG-----------RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGE 109

Query: 2057 CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYN 1878
            CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ L SYN
Sbjct: 110  CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYN 169

Query: 1877 YGNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQ--VSKPAVVESXXXXXXXXXXXXXXXX 1704
            Y ++N+F+Q R  +Y+ QAEKSQ PQ  N++NQ    KP   ES                
Sbjct: 170  YNSSNKFFQQRGSSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQ 229

Query: 1703 XXXXXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAK 1524
                   N+ NGQ NQA+R ATPLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE+K
Sbjct: 230  VSQNQIQNVANGQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESK 289

Query: 1523 LNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWL 1344
            LNEAFDSVENVILIFS+NRTRHFQGCAKMTSRIGGSVA GNWK+AHGT HYGRNFSVKWL
Sbjct: 290  LNEAFDSVENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWL 349

Query: 1343 KLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAX 1164
            KLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPD ELMA+ +AA 
Sbjct: 350  KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAE 409

Query: 1163 XXXXXXXXKGLNTDNRSENPDIVLF 1089
                    KG+N DN  ENPDIV F
Sbjct: 410  SKREEEKAKGVNPDNGGENPDIVPF 434



 Score =  139 bits (351), Expect = 5e-30
 Identities = 83/192 (43%), Positives = 97/192 (50%), Gaps = 11/192 (5%)
 Frame = -2

Query: 872  PDGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXXX 693
            PDGF MPDLF +  R F PYGPRFSGDF GP   M+F GRPSQ                 
Sbjct: 506  PDGFGMPDLFSVGPRAFAPYGPRFSGDFGGPPAAMMFRGRPSQPGMFPGGGFGMMMNPGR 565

Query: 692  XXXXXXXGVSPGSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFSSGS 513
                   GV+  +  R  RP+                +   KRDQ+   T+RNDR+ SGS
Sbjct: 566  GPFMGGMGVAGANPPRGGRPVNMPPMFPPPPPLPQNTNRLAKRDQR--TTDRNDRYGSGS 623

Query: 512  DQGKGQEFAGSVGGPQE-----------ADHLAAGKTFRNDESDSEDEAPRRSRHGEGKK 366
            +QGK Q+     G P +            D   A   FRND+S+SEDEAPRRSRHGEGKK
Sbjct: 624  EQGKSQDMLSQSGAPDDDMQYQQGYKANQDDHPAVNNFRNDDSESEDEAPRRSRHGEGKK 683

Query: 365  KRRSVEGDSTTN 330
            KRR  E D  TN
Sbjct: 684  KRRGPE-DVNTN 694


>ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Citrus sinensis]
          Length = 683

 Score =  607 bits (1566), Expect = e-171
 Identities = 308/441 (69%), Positives = 329/441 (74%), Gaps = 2/441 (0%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTASNPVIQHPDSSANAAPGAPAITXXXXXXXXXXXXX 2226
            M+D EGGLSFDFEGGLD GP  PTASNP      SS+ AAP   +               
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAAA--PSSSGAAPDHASAPVPHHSG------- 51

Query: 2225 XXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 2046
                       RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL+GECREQ
Sbjct: 52   -----------RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQ 100

Query: 2045 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNYGNT 1866
            DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KL      VEEVLQKIQQ++SYN+GN 
Sbjct: 101  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNP 160

Query: 1865 NRFYQNRNPNYSHQAEKSQFPQMPNASNQ--VSKPAVVESXXXXXXXXXXXXXXXXXXXX 1692
            N+ +Q R   +SHQ +KSQF Q PNA NQ    K +  ES                    
Sbjct: 161  NKHFQQRGA-FSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTT 219

Query: 1691 XXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1512
               NL NG  NQ NR ATPLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA
Sbjct: 220  QMQNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 279

Query: 1511 FDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWLKLCE 1332
            FDS ENVILIFS+NRTRHFQGCAKMTS+IGGSV  GNWK+AHGT HYGRNFSVKWLKLCE
Sbjct: 280  FDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCE 339

Query: 1331 LSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAXXXXX 1152
            LSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLA+LLYLEPDSELMAI +AA     
Sbjct: 340  LSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKRE 399

Query: 1151 XXXXKGLNTDNRSENPDIVLF 1089
                KG+N DN  +NPDIV F
Sbjct: 400  EEKAKGVNPDNGGDNPDIVPF 420



 Score =  164 bits (416), Expect = 1e-37
 Identities = 91/198 (45%), Positives = 109/198 (55%), Gaps = 13/198 (6%)
 Frame = -2

Query: 875  TPDGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXX 696
            TPDGFPMPDLFG+A RPF PYGPRFSGDF GPG GM+FPGRP Q                
Sbjct: 487  TPDGFPMPDLFGVAPRPFAPYGPRFSGDFTGPG-GMMFPGRPPQPGSVFPPNGFGGMMMG 545

Query: 695  XXXXXXXXGVSP-GSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFSS 519
                    G+ P  +  R  RP+G               S   KRD + S  +RNDR+S+
Sbjct: 546  PGRPPFMGGMGPAATNPRGGRPVGVPPPFPNQPQSSQNSSRAAKRDVRGSINDRNDRYSA 605

Query: 518  GSDQGKGQEFAGSVGGPQE------------ADHLAAGKTFRNDESDSEDEAPRRSRHGE 375
            GSDQG+ QE  G   GP +             +     + FRNDES+SEDEAPRRSRHGE
Sbjct: 606  GSDQGRAQEMGGPGRGPDDEVQYQQEGSKANQEDQYGSRNFRNDESESEDEAPRRSRHGE 665

Query: 374  GKKKRRSVEGDSTTNASD 321
            GKKKRR  EGD+  ++ +
Sbjct: 666  GKKKRRDSEGDAAASSDN 683


>ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 691

 Score =  606 bits (1563), Expect = e-170
 Identities = 311/449 (69%), Positives = 335/449 (74%), Gaps = 10/449 (2%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTA---SNPVIQHPDSSA-----NAAPGAPAITXXXXX 2250
            M+D EG LSFDFEGGLD  P+   A   S P++QH  S+A     N    APA +     
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPSTADPA 60

Query: 2249 XXXXXXXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFR 2070
                               RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFR
Sbjct: 61   GGNVPG-------------RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFR 107

Query: 2069 LYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQL 1890
            LYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ L
Sbjct: 108  LYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHL 167

Query: 1889 NSYNYGNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQ--VSKPAVVESXXXXXXXXXXXX 1716
             SYNY ++N+F+Q R  +Y+ QAEK Q PQ  N++NQ    KP   ES            
Sbjct: 168  FSYNYNSSNKFFQQRGASYNQQAEKPQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQS 227

Query: 1715 XXXXXXXXXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRS 1536
                       N+ NGQ NQANRTATPLPQG SRYFIVKSCNRENLELSVQQGVWATQRS
Sbjct: 228  QQQVNQSQMQ-NVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRS 286

Query: 1535 NEAKLNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFS 1356
            NE+KLNEAFDSVENVIL+FS+NRTRHFQGCAKMTSRIGGSVA GNWK+AHGT HYGRNFS
Sbjct: 287  NESKLNEAFDSVENVILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFS 346

Query: 1355 VKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIW 1176
            VKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAI 
Sbjct: 347  VKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS 406

Query: 1175 IAAXXXXXXXXXKGLNTDNRSENPDIVLF 1089
            +AA         KG+N DN  ENPDIV F
Sbjct: 407  VAAESKREEEKAKGVNPDNGGENPDIVPF 435



 Score =  138 bits (347), Expect = 1e-29
 Identities = 79/183 (43%), Positives = 93/183 (50%), Gaps = 11/183 (6%)
 Frame = -2

Query: 872  PDGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXXX 693
            PDGF MPDLFG+  R F PYGPRFSGDF GP   M+F GRPSQ                 
Sbjct: 510  PDGFGMPDLFGVGPRGFAPYGPRFSGDFGGPPAAMMFRGRPSQPGMFPSGGFGMMMNPGR 569

Query: 692  XXXXXXXGVSPGSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFSSGS 513
                   GV   +  R  RP+                +   KRDQ+ +  +RNDRF SGS
Sbjct: 570  GPFMGGMGVGGANPPRGGRPVNMPPMFPPPPPLPQNANRAAKRDQRTA--DRNDRFGSGS 627

Query: 512  DQGKGQEFAGSVGGPQE-----------ADHLAAGKTFRNDESDSEDEAPRRSRHGEGKK 366
            +QGK Q+     GGP +            D   A   FRND+S+SEDEAPRRSRHGEGKK
Sbjct: 628  EQGKSQDMLSQSGGPDDDAQYQQGYKGNQDDHPAVNNFRNDDSESEDEAPRRSRHGEGKK 687

Query: 365  KRR 357
            K +
Sbjct: 688  KHK 690


>ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum lycopersicum]
          Length = 689

 Score =  603 bits (1556), Expect = e-170
 Identities = 309/448 (68%), Positives = 330/448 (73%), Gaps = 9/448 (2%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTASNPVIQHPDSSANAA-------PGAPAITXXXXXX 2247
            MD+GEGGL+FDFEGGLD GPTHPTAS PVIQ  D +A AA       P  PA+       
Sbjct: 1    MDEGEGGLNFDFEGGLDTGPTHPTASVPVIQSFDHTAAAASSANINPPTVPAV------- 53

Query: 2246 XXXXXXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL 2067
                              RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFRL
Sbjct: 54   -------GGQGDVGFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRL 106

Query: 2066 YGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLN 1887
            YGECREQDCVYKHT EDIKECNMYKLGFCPNGPDCRYRHAK+     PVEE+LQKIQ L 
Sbjct: 107  YGECREQDCVYKHTIEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLA 166

Query: 1886 SYNYGNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQVSKPAVVESXXXXXXXXXXXXXXX 1707
            S NYG +NRF QNRN NYS Q +KSQ  Q  N ++   K    E+               
Sbjct: 167  SNNYGYSNRFNQNRNANYSTQTDKSQASQAQNGTSLAVKSTATETPIIQQHQPHQQVQPP 226

Query: 1706 XXXXXXXPNLL--NGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSN 1533
                      +  NGQQNQA+RTA  LPQGTSRYFIVKSCNRENLELSVQQGVWATQRSN
Sbjct: 227  QLQGGPTQAQIHPNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSN 286

Query: 1532 EAKLNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSV 1353
            EAKLNEAFDSVENVILIFS+NRTRHFQGC KMTSRIGG+   GNWKH HGT HYGRNFS+
Sbjct: 287  EAKLNEAFDSVENVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSL 346

Query: 1352 KWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWI 1173
            KWLKLCELSF KT HLRNP+NENLPVKISRDCQELE SVGEQLASLLYLEPDSELMAI +
Sbjct: 347  KWLKLCELSFQKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISL 406

Query: 1172 AAXXXXXXXXXKGLNTDNRSENPDIVLF 1089
            AA         KG+N DN  +NPDIV F
Sbjct: 407  AAESKRLEEKAKGVNPDNGKDNPDIVPF 434



 Score =  130 bits (326), Expect = 4e-27
 Identities = 86/204 (42%), Positives = 105/204 (51%), Gaps = 19/204 (9%)
 Frame = -2

Query: 875  TPDGFPMPDLFGMASRPFGPYGPRFSGD--FAGPGPG-----MIFPGRPSQXXXXXXXXX 717
            TP+GFPM D FGM  RPF PYGPRFS D  F G  P      MI PGRP           
Sbjct: 507  TPEGFPMTDHFGMGPRPFPPYGPRFSSDLMFHGRPPAGGFGMMIGPGRPP---------- 556

Query: 716  XXXXXXXXXXXXXXXGVSPGSQTRPV--RPIGTXXXXXXXXXXXXXXSNRTKRDQKASAT 543
                           G+ PG+   P   R +                  R KR+Q+A  +
Sbjct: 557  ------------FVGGMGPGATGPPRAGRAVRMHPSFIPPSSQPSQYPYRAKREQRAPVS 604

Query: 542  ERNDRFSSGSDQGKGQEFAGSVGGPQ-------EADH---LAAGKTFRNDESDSEDEAPR 393
            +RNDRFSS  DQGKGQE  GSV GP        +++H     AG + +ND S+SEDEAPR
Sbjct: 605  DRNDRFSS--DQGKGQEMMGSVNGPDGVHMQIGKSEHDNQFGAGNSLKNDGSESEDEAPR 662

Query: 392  RSRHGEGKKKRRSVEGDSTTNASD 321
            RSRHG+GKKKRR V+ D+ T + +
Sbjct: 663  RSRHGDGKKKRRDVDEDAATGSEN 686


>ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 681

 Score =  602 bits (1551), Expect = e-169
 Identities = 308/447 (68%), Positives = 333/447 (74%), Gaps = 8/447 (1%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTA--SNPVIQHPDSSA-----NAAPGAPAITXXXXXX 2247
            M+D EG LSFDFEGGLD  P+   A  S P+I H  S+A     N  P APA +      
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVSNGGPAAPAPSAVDPVG 60

Query: 2246 XXXXXXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRL 2067
                              RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL
Sbjct: 61   GGNVPG------------RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRL 108

Query: 2066 YGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLN 1887
            YGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ L 
Sbjct: 109  YGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLY 168

Query: 1886 SYNYGNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQ-VSKPAVVESXXXXXXXXXXXXXX 1710
            SYNY ++N+F+Q R  +Y+ QAEK   PQ  N++NQ V+   +                 
Sbjct: 169  SYNYNSSNKFFQQRGASYNQQAEKPLLPQGNNSTNQGVTGNPLPAELGNAQPQQQVQQSQ 228

Query: 1709 XXXXXXXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNE 1530
                     N+ NGQ NQANRTATPLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE
Sbjct: 229  QQVNQSQMQNVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE 288

Query: 1529 AKLNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVK 1350
            +KLNEAFDSVENVILIFS+NRTRHFQGCAKMTS+IGGSVA GNWK+AHGT HYGRNFSVK
Sbjct: 289  SKLNEAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVK 348

Query: 1349 WLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIA 1170
            WLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAI +A
Sbjct: 349  WLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVA 408

Query: 1169 AXXXXXXXXXKGLNTDNRSENPDIVLF 1089
            A         KG+N DN  ENPDIV F
Sbjct: 409  AESKREEEKAKGVNPDNGGENPDIVPF 435



 Score =  132 bits (331), Expect = 1e-27
 Identities = 75/176 (42%), Positives = 90/176 (51%), Gaps = 4/176 (2%)
 Frame = -2

Query: 872  PDGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXXX 693
            PDGF MPDLFG+  R F PYGPRFSGDF GP   M+F GRPSQ                 
Sbjct: 507  PDGFGMPDLFGVGPRGFAPYGPRFSGDFGGPPAAMMFRGRPSQPGMFPGGGFGMMLNPGR 566

Query: 692  XXXXXXXGVSPGSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFSSGS 513
                   GV   +  R  RP+                +   KRDQ+ +  +RNDRF SGS
Sbjct: 567  GPFMGGIGVGGANPPRGGRPVNMPPMFPPPPPLPQNANRAAKRDQRTA--DRNDRFGSGS 624

Query: 512  DQGKGQEFAGSVGGPQEADHLAAG----KTFRNDESDSEDEAPRRSRHGEGKKKRR 357
            +QGK Q+     GGP +      G    +    D+S+SEDEAPRRSRHGEGKKK +
Sbjct: 625  EQGKSQDMLSQSGGPDDDPQYQQGYKGNQDDHPDDSESEDEAPRRSRHGEGKKKHK 680


>ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cicer arietinum]
          Length = 677

 Score =  600 bits (1548), Expect = e-169
 Identities = 310/450 (68%), Positives = 334/450 (74%), Gaps = 11/450 (2%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTH------PTASNPVIQHPDSS---ANAAPGAPAITXXXX 2253
            M+D EG LSFDFEGGLD  P        P   +  I HPDSS   + ++ GA  ++    
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPPSAATVSVPAPPSGPIVHPDSSLPPSISSNGAAPVSGNIP 60

Query: 2252 XXXXXXXXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFF 2073
                                RRSFRQTVCRHWLRSLCMKG+ACGFLHQYDK+RMPVCRFF
Sbjct: 61   G-------------------RRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFF 101

Query: 2072 RLYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQ 1893
            RLYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAK      P+EEVLQKIQ 
Sbjct: 102  RLYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQH 161

Query: 1892 LNSYNYGNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQ--VSKPAVVESXXXXXXXXXXX 1719
            L SYN+ N+++F Q R  +Y+ Q EKSQFPQ  N++NQ    KP   ES           
Sbjct: 162  LYSYNFNNSHKFIQQRGSSYTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQ 221

Query: 1718 XXXXXXXXXXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQR 1539
                        NL NGQ NQANRTATPLPQG SRYFIVKSCNRENLELSVQQGVWATQR
Sbjct: 222  SQQQVSQIQTQ-NLANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQR 280

Query: 1538 SNEAKLNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNF 1359
            SNE+KLNEAFDSVENVILIFS+NRTRHFQGCAKMTSRIGGSVA GNWK+AHGT HYGRNF
Sbjct: 281  SNESKLNEAFDSVENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNF 340

Query: 1358 SVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAI 1179
            SVKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAI
Sbjct: 341  SVKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAI 400

Query: 1178 WIAAXXXXXXXXXKGLNTDNRSENPDIVLF 1089
             IAA         KG+N DN  ENPDIV F
Sbjct: 401  SIAAESKREEEKAKGVNPDNAGENPDIVPF 430



 Score =  139 bits (349), Expect = 8e-30
 Identities = 79/175 (45%), Positives = 94/175 (53%), Gaps = 4/175 (2%)
 Frame = -2

Query: 872  PDGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXXX 693
            PDGF MPDLFGM  R FGPYGPRFSGDFAGP   M+F GRPSQ                 
Sbjct: 502  PDGFGMPDLFGMGPRGFGPYGPRFSGDFAGPPAAMMFRGRPSQPGMFPGGGFGMMMNPGR 561

Query: 692  XXXXXXXGVSPGSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFSSGS 513
                   GV   +  R  RP+                +   KRDQ+ +  +RNDR+SSG 
Sbjct: 562  GPFMGGMGVPGPNPPRGGRPLNMPPMFPPPPPPPQNVNRIAKRDQRTN--DRNDRYSSGQ 619

Query: 512  DQGKGQEFAGSVGGP----QEADHLAAGKTFRNDESDSEDEAPRRSRHGEGKKKR 360
            +QGK Q+     GGP    Q     A    FRN++S+SEDEAPRRSRHGEGKK++
Sbjct: 620  EQGKSQDMLSQSGGPDDEMQYQQSGAPANNFRNEDSESEDEAPRRSRHGEGKKRK 674


>ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum tuberosum]
          Length = 677

 Score =  599 bits (1544), Expect = e-168
 Identities = 304/440 (69%), Positives = 323/440 (73%), Gaps = 1/440 (0%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTASNPVIQHPDS-SANAAPGAPAITXXXXXXXXXXXX 2229
            MDDGEGGL+FDFEGGLD GPTHPTAS PV+Q     +   AP A                
Sbjct: 1    MDDGEGGLNFDFEGGLDTGPTHPTASVPVLQSAGHITTGPAPNASVALVPPGGGVGQGGD 60

Query: 2228 XXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE 2049
                        RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE
Sbjct: 61   GSFVG------NRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE 114

Query: 2048 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNYGN 1869
            QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PV EVLQ+IQ L SY Y  
Sbjct: 115  QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVVEVLQRIQNLTSYGY-- 172

Query: 1868 TNRFYQNRNPNYSHQAEKSQFPQMPNASNQVSKPAVVESXXXXXXXXXXXXXXXXXXXXX 1689
            +NRF+QNRN NYS QA+KSQ PQ+PN  NQ  K    E                      
Sbjct: 173  SNRFFQNRNTNYSTQADKSQIPQVPNVMNQAVKSTAAEPPIGQPHQPHQQQVQQPQHQGA 232

Query: 1688 XPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 1509
                     +Q N+ A PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF
Sbjct: 233  PTQTQTLPSSQQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 292

Query: 1508 DSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWLKLCEL 1329
            DSVENVIL+FSINRTRHFQG AKMTSRIGG+   GNWKH HGT HYGRNFS+KWLKLCEL
Sbjct: 293  DSVENVILVFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSLKWLKLCEL 352

Query: 1328 SFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAXXXXXX 1149
            SF KTRHLRNP+NENLPVKISRDCQELE SVGEQLASLLY+EPDSELMA+ +AA      
Sbjct: 353  SFQKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAVSLAAESKREE 412

Query: 1148 XXXKGLNTDNRSENPDIVLF 1089
               KG+N DN +ENPDIV F
Sbjct: 413  ERAKGVNPDNGNENPDIVPF 432



 Score =  155 bits (391), Expect = 1e-34
 Identities = 93/192 (48%), Positives = 109/192 (56%), Gaps = 11/192 (5%)
 Frame = -2

Query: 875  TPDGFPMPDLFGMASRPFGPYGPRFSGDF-------AGPGPGMIF-PGRPSQXXXXXXXX 720
            TPDGFPMPD +GM  RPFGP+GPRF GD        A  G GM+  PGRP          
Sbjct: 505  TPDGFPMPDPYGMGGRPFGPFGPRFPGDMMFHSRPPAAGGFGMMMGPGRPP--------- 555

Query: 719  XXXXXXXXXXXXXXXXGVSPGSQTRPV--RPIGTXXXXXXXXXXXXXXSNRTKRDQKASA 546
                            G+ PG+   P   RP+G                 R K+DQ+A  
Sbjct: 556  -------------FMGGMGPGAPGPPRGGRPMGIHPSFIPPTPPPSQNP-RVKKDQRAPF 601

Query: 545  TERNDRFSSGSDQGKGQEFAGSVGGPQEADHL-AAGKTFRNDESDSEDEAPRRSRHGEGK 369
             ERNDRFSSG DQG+GQE AGSVGGP E  H      +FRNDES+SEDEAPRRSRHG+GK
Sbjct: 602  NERNDRFSSGPDQGRGQEIAGSVGGPAEGVHYPQTENSFRNDESESEDEAPRRSRHGDGK 661

Query: 368  KKRRSVEGDSTT 333
            KK+ S++GD+TT
Sbjct: 662  KKKNSMDGDATT 673


>ref|XP_004233145.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum lycopersicum]
          Length = 671

 Score =  599 bits (1544), Expect = e-168
 Identities = 306/442 (69%), Positives = 324/442 (73%), Gaps = 3/442 (0%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHPTASNPVIQH---PDSSANAAPGAPAITXXXXXXXXXX 2235
            MDDGEGGL+FDFEGGLD GPTHPTAS PVIQ    P++S    P    +           
Sbjct: 1    MDDGEGGLNFDFEGGLDTGPTHPTASVPVIQAGPAPNASVAVVPPGGGV----------- 49

Query: 2234 XXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGEC 2055
                          RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGEC
Sbjct: 50   ---GLGGDGSFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGEC 106

Query: 2054 REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNY 1875
            REQDCVYKHTNEDIKECNM+KLGFCPNGPDCRYRHAK+     PV EVLQKIQ L S+ Y
Sbjct: 107  REQDCVYKHTNEDIKECNMFKLGFCPNGPDCRYRHAKMPGPPPPVVEVLQKIQNLTSHGY 166

Query: 1874 GNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQVSKPAVVESXXXXXXXXXXXXXXXXXXX 1695
              +NRF+QNRN NYS QA+KSQ PQ+PN  NQ  K    E                    
Sbjct: 167  --SNRFFQNRNTNYSTQADKSQIPQVPNVMNQAVKSTATEPPIGQPHQPHQQQVQQPQHQ 224

Query: 1694 XXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1515
                        Q N+ A PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE
Sbjct: 225  GPPTQTQTLPGTQQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 284

Query: 1514 AFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWLKLC 1335
            AFDSVENVILIFSINRTRHFQG AKMTSRIGG+   GNWKH HGT HYGRNFSVKWLKLC
Sbjct: 285  AFDSVENVILIFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSVKWLKLC 344

Query: 1334 ELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAXXXX 1155
            ELSF KTRHLRNP+NENLPVKISRDCQELE SVGEQLASLLY+EPDSELMAI +AA    
Sbjct: 345  ELSFQKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAISLAAESKR 404

Query: 1154 XXXXXKGLNTDNRSENPDIVLF 1089
                 KG+N DN +ENPDIV F
Sbjct: 405  EEERAKGVNPDNGNENPDIVPF 426



 Score =  149 bits (375), Expect = 8e-33
 Identities = 91/192 (47%), Positives = 108/192 (56%), Gaps = 11/192 (5%)
 Frame = -2

Query: 875  TPDGFPMPDLFGMASRPFGPYGPRFSGDF-------AGPGPGMIF-PGRPSQXXXXXXXX 720
            TPDGFPMPD +GM  RPFGP+GPRF GD        A  G GM+  P RP          
Sbjct: 499  TPDGFPMPDPYGMGGRPFGPFGPRFPGDMMFHSRPPAAGGFGMMMGPARPP--------- 549

Query: 719  XXXXXXXXXXXXXXXXGVSPGSQTRPV--RPIGTXXXXXXXXXXXXXXSNRTKRDQKASA 546
                            G+ PG+   P   RP+G                 R K+DQ+A  
Sbjct: 550  -------------FMGGMGPGAPGPPRGGRPMGMHPSFTPPPPPPSQNP-RVKKDQRAPF 595

Query: 545  TERNDRFSSGSDQGKGQEFAGSVGGPQEADHLAAGK-TFRNDESDSEDEAPRRSRHGEGK 369
             ERNDRFSSG DQG+GQE AGSV GP E  H    + +FRNDES+SEDEAPRRSRHG+GK
Sbjct: 596  NERNDRFSSGPDQGRGQETAGSVVGPDEGVHYPQTENSFRNDESESEDEAPRRSRHGDGK 655

Query: 368  KKRRSVEGDSTT 333
            KK+ S++GD+TT
Sbjct: 656  KKKNSMDGDATT 667


>ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cucumis sativus]
          Length = 707

 Score =  587 bits (1514), Expect = e-165
 Identities = 301/444 (67%), Positives = 329/444 (74%), Gaps = 5/444 (1%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVGPTHP--TASNPVIQHPDSSANAAPGAPAITXXXXXXXXXXX 2232
            M+D EG LSFDFEGGLD GPT+P  T+S P+I   +S ++A P A A++           
Sbjct: 1    MEDSEGVLSFDFEGGLDAGPTNPAATSSLPII---NSDSSAPPAASAVSNPLSGALGPAV 57

Query: 2231 XXXXXXXXXXXN-QRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGEC 2055
                          RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFRLYGEC
Sbjct: 58   SAEPTGAPHGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGEC 117

Query: 2054 REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNY 1875
            REQDCVYKHTNEDIKECNMYK GFCPNGPDCRYRHAKL     P+EE+LQKIQ L SYNY
Sbjct: 118  REQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPLEEILQKIQHLGSYNY 177

Query: 1874 GNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQ--VSKPAVVESXXXXXXXXXXXXXXXXX 1701
            G +N+F+  R    S Q EKSQFPQ+P    Q    KP+  ES                 
Sbjct: 178  GPSNKFFTQRGVGLSQQNEKSQFPQVPALVTQGVTGKPSAAESVNVQQQQGQQSAPQASQ 237

Query: 1700 XXXXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1521
                  +L NGQ NQ NR AT LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 238  TPVQ--SLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 295

Query: 1520 NEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWLK 1341
            NEAFDS +NVILIFS+NRTRHFQGCAKM SRIGGSV+ GNWK+AHGT HYG+NFS+KWLK
Sbjct: 296  NEAFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTPHYGQNFSLKWLK 355

Query: 1340 LCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAXX 1161
            LCELSF KTRHLRNP+NENLPVKISRDCQELE SVGEQLASLLYLEPD ELMA+ +AA  
Sbjct: 356  LCELSFQKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDGELMAVSVAAES 415

Query: 1160 XXXXXXXKGLNTDNRSENPDIVLF 1089
                   KG+N D  SENPDIV F
Sbjct: 416  KREEEKAKGVNPDIGSENPDIVPF 439



 Score =  140 bits (354), Expect = 2e-30
 Identities = 88/195 (45%), Positives = 100/195 (51%), Gaps = 15/195 (7%)
 Frame = -2

Query: 875  TPDGFPMPDLFGMASRPFGPYGP--RFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXX 702
            TPDGFPMPD+FGM  R FGPYGP  RFSGDF GP   M+F GRPSQ              
Sbjct: 512  TPDGFPMPDIFGMTPRGFGPYGPTPRFSGDFMGPPTAMMFRGRPSQPAAMFPPSGFGMMM 571

Query: 701  XXXXXXXXXXG-VSPGSQTRPVRPIGTXXXXXXXXXXXXXXSNRT-KRDQKASATERNDR 528
                        V+  +  RP RP+G                NR  KRDQ+      NDR
Sbjct: 572  GQGRGPFMGGMGVAGANPARPGRPVGVSPLYPPPAVPSSQNMNRAIKRDQRGLT---NDR 628

Query: 527  FSSGSDQGKGQEFAGSVGGPQEADH-----------LAAGKTFRNDESDSEDEAPRRSRH 381
            +  G DQ KG E   S G  +E  +              G TFRN+ES+SEDEAPRRSRH
Sbjct: 629  YIVGMDQNKGVEIQSS-GRDEEMQYKQGSKAYSDEQYGTGTTFRNEESESEDEAPRRSRH 687

Query: 380  GEGKKKRRSVEGDST 336
            GEGKKKRR  EGD+T
Sbjct: 688  GEGKKKRRGSEGDAT 702


>gb|EPS64393.1| hypothetical protein M569_10389, partial [Genlisea aurea]
          Length = 655

 Score =  582 bits (1500), Expect = e-163
 Identities = 291/438 (66%), Positives = 323/438 (73%), Gaps = 1/438 (0%)
 Frame = -2

Query: 2399 DGEGGLSFDFEGGLDVGPTHPTASNPVIQHPDSSANAAPGAPAITXXXXXXXXXXXXXXX 2220
            D EGGLSFDFEGGLD GP   T S P  Q   +SA    G    +               
Sbjct: 2    DDEGGLSFDFEGGLDTGPGQITGSLPTGQ---ASAADGQGHSVSSASNIYPSTAPASAGQ 58

Query: 2219 XXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 2040
                     RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC
Sbjct: 59   ASDGAGGGGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 118

Query: 2039 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYNYGNTNR 1860
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQ++QQL+S NYGN N+
Sbjct: 119  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQRVQQLSSNNYGNLNK 178

Query: 1859 FYQNRNPNYSHQAEKSQFPQMPNASNQVSKPAVVESXXXXXXXXXXXXXXXXXXXXXXPN 1680
            ++ NR   +SHQ++KSQFPQ+ N +N ++K    +S                       N
Sbjct: 179  YFPNRTTAFSHQSDKSQFPQVQNGANHLTKSGTADSASAHPQSQQAQQPLPQSSQAQIQN 238

Query: 1679 LLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSV 1500
                QQ QANR ATPLPQGTSRYF+VKSCNRENLELSVQQGVWATQRSNEAKLNEAF+S+
Sbjct: 239  APINQQTQANRVATPLPQGTSRYFVVKSCNRENLELSVQQGVWATQRSNEAKLNEAFESI 298

Query: 1499 ENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWLKLCELSFH 1320
            ENVILIFS+N+TRHFQGCAKM SRIGG +  GNWKHA+GT HYGRNF+VKWLKL ELSF 
Sbjct: 299  ENVILIFSVNKTRHFQGCAKMASRIGGFIGGGNWKHANGTAHYGRNFAVKWLKLSELSFD 358

Query: 1319 KTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAXXXXXXXXX 1140
            KTRHLRNP+NENLPVKISRDCQELE SVGEQLASLLYLEPDS+L A+ +AA         
Sbjct: 359  KTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSDLTAVLLAAETKREQEKA 418

Query: 1139 KGLNTDN-RSENPDIVLF 1089
            +G+  DN  +E+PDIV F
Sbjct: 419  RGVTVDNGTAEDPDIVPF 436



 Score = 63.9 bits (154), Expect = 3e-07
 Identities = 28/38 (73%), Positives = 29/38 (76%)
 Frame = -2

Query: 869 DGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMIFPG 756
           DGFPM D FGMA R FGPY PRF GDFA P PGM+F G
Sbjct: 498 DGFPMVDPFGMAPRSFGPYAPRFPGDFAVPNPGMMFSG 535


>gb|EXB51974.1| Cleavage and polyadenylation specificity factor CPSF30 [Morus
            notabilis]
          Length = 710

 Score =  580 bits (1494), Expect = e-162
 Identities = 298/446 (66%), Positives = 327/446 (73%), Gaps = 7/446 (1%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVG----PTHPTASNPVIQHPDSSANAAPGAPAITXXXXXXXXX 2238
            M+D EG LSFDFEGGLD      P +  A++  + HPDSSA AA    A +         
Sbjct: 1    MEDSEGVLSFDFEGGLDTTAGGCPPNAAAASAALIHPDSSAAAASNNLAASNSAVSADPT 60

Query: 2237 XXXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGE 2058
                          + RSFRQTVCRHWLRSLCMKG+ACGFLHQYDKSRMPVCRFFRLYGE
Sbjct: 61   SGGGGGASNPG---RGRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGE 117

Query: 2057 CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYN 1878
            CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL      VEEVLQKIQ L+SYN
Sbjct: 118  CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQHLSSYN 177

Query: 1877 YGNTNRFYQNRNPN-YSHQAEKSQFPQMPNASNQ--VSKPAVVESXXXXXXXXXXXXXXX 1707
            Y ++N+F+Q RN   ++   EK   P  PNA +Q  V KP+++ES               
Sbjct: 178  Y-HSNKFFQQRNAGGFAQLGEKPLLPLGPNAVSQGVVGKPSILESANVQQPQQQVQPSQQ 236

Query: 1706 XXXXXXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEA 1527
                    N+  G  NQANRT  PLP G SRYFIVKSCNRENLELSVQQGVWATQRSNEA
Sbjct: 237  PVGQNQIQNVFTGLPNQANRTVAPLPPGISRYFIVKSCNRENLELSVQQGVWATQRSNEA 296

Query: 1526 KLNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKW 1347
            KLNEAFD  ENVILIFS+NRTRHFQGCAKM SRIGGS++ GNWK+AHGT HYGRNFSVKW
Sbjct: 297  KLNEAFDCAENVILIFSVNRTRHFQGCAKMISRIGGSISGGNWKYAHGTAHYGRNFSVKW 356

Query: 1346 LKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAA 1167
            LKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAI +AA
Sbjct: 357  LKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAA 416

Query: 1166 XXXXXXXXXKGLNTDNRSENPDIVLF 1089
                     KG++ DN  ENPDIV F
Sbjct: 417  ESKREEEKAKGVDPDNGGENPDIVPF 442



 Score =  158 bits (399), Expect = 1e-35
 Identities = 87/196 (44%), Positives = 107/196 (54%), Gaps = 13/196 (6%)
 Frame = -2

Query: 875  TPDGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMIFPGRPSQ-XXXXXXXXXXXXXXX 699
            TPDGFPMPDLF +  R F PYGPRF GDF GP  GM+F GRP+Q                
Sbjct: 513  TPDGFPMPDLFNVGPRAFNPYGPRFPGDFMGPTSGMMFRGRPTQPGAVFPGGGFGMMMGP 572

Query: 698  XXXXXXXXXGVSPGSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFSS 519
                     GV   S  RP+RP                 +   +RDQ+  A +RN+R+ +
Sbjct: 573  GRAPCMGGMGVQGTSPARPMRPGAMPPMFQQPPPPSQNMNRPPRRDQRGLANDRNERYGA 632

Query: 518  GSDQGKGQEFAGSVGGPQ------------EADHLAAGKTFRNDESDSEDEAPRRSRHGE 375
            GSDQ +GQE +G  GGP+            + D   AG +FRNDES+SEDEAPRRSRHG+
Sbjct: 633  GSDQVRGQEMSGPAGGPEDDAHYQLGAKARQEDQYGAGNSFRNDESESEDEAPRRSRHGD 692

Query: 374  GKKKRRSVEGDSTTNA 327
            GKKKRRS E D+ T +
Sbjct: 693  GKKKRRSSEEDAATGS 708


>gb|EMJ15374.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica]
          Length = 695

 Score =  580 bits (1494), Expect = e-162
 Identities = 299/445 (67%), Positives = 325/445 (73%), Gaps = 6/445 (1%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDV----GPTHPTASNPVIQHPDSSANAAPGAPAITXXXXXXXXX 2238
            M+D +G ++FDFEGGLD     GPT+P   +  +   DS   A    PA           
Sbjct: 1    MEDSDGDINFDFEGGLDATAAAGPTNPGPPSNSLMQSDSGVAAVDTNPAAAAPQPNHPNP 60

Query: 2237 XXXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGE 2058
                            RS+RQTVCRHWLRSLCMKG+ACGFLHQYDKSRMPVCRFFRLYGE
Sbjct: 61   NRSGG-----------RSYRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGE 109

Query: 2057 CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNSYN 1878
            CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ LNSYN
Sbjct: 110  CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYN 169

Query: 1877 YGNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQ--VSKPAVVESXXXXXXXXXXXXXXXX 1704
            Y  +N+FYQ RN  +  QA+K Q  Q PN+  Q  V KP+  ES                
Sbjct: 170  YNTSNKFYQQRNAGFPQQADKYQSAQGPNSVYQGVVGKPSTGESANVHQQQQVQQTQQQV 229

Query: 1703 XXXXXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAK 1524
                   NL NG  NQANR+A PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE+K
Sbjct: 230  GHTQTQ-NLPNGLANQANRSA-PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESK 287

Query: 1523 LNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKWL 1344
            LNEAFDS ENVILIFS+NRTRHFQGCAKM SRIGGSV+ GNWK+AHG+ HYGRNFSVKWL
Sbjct: 288  LNEAFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGSAHYGRNFSVKWL 347

Query: 1343 KLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAAX 1164
            KLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMA+ IAA 
Sbjct: 348  KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSIAAE 407

Query: 1163 XXXXXXXXKGLNTDNRSENPDIVLF 1089
                    KG+N +N  ENPDIV F
Sbjct: 408  SKREEEKAKGVNPENGGENPDIVPF 432



 Score =  146 bits (368), Expect = 5e-32
 Identities = 85/192 (44%), Positives = 100/192 (52%), Gaps = 12/192 (6%)
 Frame = -2

Query: 872  PDGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXXXX 693
            PDGF MP+ FG+  R F PYGPRFSGDF GP PGM+F GRP Q                 
Sbjct: 505  PDGFGMPNPFGVGPRGFNPYGPRFSGDFTGPTPGMMFRGRPQQ-PGFPPGGYGMMMGPGR 563

Query: 692  XXXXXXXGVSPGSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFSSGS 513
                   GV   +  RP RP G               +   KRD +  + +RN+R+S+GS
Sbjct: 564  APFMGGMGVGGANPGRPGRPTG--MSPMFPPPSSQNTNRMQKRDPRGPSNDRNERYSAGS 621

Query: 512  DQGKGQEFAGSVGGPQE------------ADHLAAGKTFRNDESDSEDEAPRRSRHGEGK 369
             QGKGQE  G  GGP +             D   AG   RND+S+SEDEAPRRSRHGEGK
Sbjct: 622  GQGKGQEIPGLAGGPDDEARYQQASKAYREDQYGAGNNSRNDDSESEDEAPRRSRHGEGK 681

Query: 368  KKRRSVEGDSTT 333
            KK R  EGD T+
Sbjct: 682  KKGRGSEGDVTS 693


>emb|CBI30994.3| unnamed protein product [Vitis vinifera]
          Length = 485

 Score =  570 bits (1469), Expect(2) = e-161
 Identities = 299/461 (64%), Positives = 319/461 (69%), Gaps = 2/461 (0%)
 Frame = -2

Query: 2141 MKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDC 1962
            MKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKECNMYKLGFCPNG DC
Sbjct: 1    MKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKECNMYKLGFCPNGSDC 60

Query: 1961 RYRHAKLXXXXXPVEEVLQKIQQLNSYNYGNTNRFYQNRNPNYSHQAEKSQFPQMPNASN 1782
            RYRHAKL      +EEV QKIQQL+S+NYG++NRFYQNRNP Y+ Q EKSQ  Q  NA N
Sbjct: 61   RYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFYQNRNP-YNQQTEKSQILQGSNAVN 119

Query: 1781 --QVSKPAVVESXXXXXXXXXXXXXXXXXXXXXXPNLLNGQQNQANRTATPLPQGTSRYF 1608
               V+K +  E+                       NL NG  NQAN+TA+PLPQG SRYF
Sbjct: 120  LGTVAKSSTTEAINVQQQQVQPPQQQVSQTPMQ--NLPNGLPNQANKTASPLPQGISRYF 177

Query: 1607 IVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIFSINRTRHFQGCAKMTSR 1428
            IVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIFS+NRTRHFQGCAKMTS+
Sbjct: 178  IVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIFSVNRTRHFQGCAKMTSK 237

Query: 1427 IGGSVASGNWKHAHGTGHYGRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQEL 1248
            IGG V  GNWK+AHGT HYGRNFSVKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQEL
Sbjct: 238  IGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQEL 297

Query: 1247 EASVGEQLASLLYLEPDSELMAIWIAAXXXXXXXXXKGLNTDNRSENPDIVLFXXXXXXX 1068
            E S+GEQLASLLYLEPDSELMAI +AA         KG+N DN  ENPDIV F       
Sbjct: 298  EPSIGEQLASLLYLEPDSELMAISLAAESKREEEKAKGVNPDNGGENPDIVPFEDNEEEE 357

Query: 1067 XXXXXXXXXSXXXXXXXXXXXXXXXXXXXXXXXMPLAXXXXXXXXXXXXXXXXXXXXXXX 888
                     S                       MPLA                       
Sbjct: 358  EEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMPLARGARPIPSMRGFPPVMMGADGFS 417

Query: 887  XXXVTPDGFPMPDLFGMASRPFGPYGPRFSGDFAGPGPGMI 765
               V PDGF MPD+FG+  R F PYGPRFSGDF GP  GMI
Sbjct: 418  YSAVPPDGFAMPDIFGVGPRAFPPYGPRFSGDFTGPASGMI 458



 Score = 29.6 bits (65), Expect(2) = e-161
 Identities = 12/25 (48%), Positives = 17/25 (68%)
 Frame = -1

Query: 657 IPNKTCSTNWYASNVPPSILTTLSK 583
           IPN + S +WYASNVP +    L++
Sbjct: 458 IPNSSWSASWYASNVPTASTPKLAE 482


>ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Fragaria vesca subsp. vesca]
          Length = 689

 Score =  571 bits (1472), Expect = e-160
 Identities = 297/446 (66%), Positives = 327/446 (73%), Gaps = 7/446 (1%)
 Frame = -2

Query: 2405 MDDGEGGLSFDFEGGLDVG----PTHP--TASNPVIQHPDSSANAAPGAPAITXXXXXXX 2244
            M+D +G L+FDFEGGLD      PTH    +S P+     +S      APA         
Sbjct: 1    MEDPDGVLNFDFEGGLDSAAVSAPTHTGLASSAPIQSDSFASQPKNQAAPA--------- 51

Query: 2243 XXXXXXXXXXXXXXXNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLY 2064
                           + R+SFRQTVCRHWLRSLCMKG+ACGFLHQYDKSRMPVCRFFR+Y
Sbjct: 52   ------PQPDPNVNPSGRKSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRMY 105

Query: 2063 GECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQQLNS 1884
            GECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ LNS
Sbjct: 106  GECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNS 165

Query: 1883 YNYGNTNRFYQNRNPNYSHQAEKSQFPQMPNASNQVS-KPAVVESXXXXXXXXXXXXXXX 1707
            YNY N+N+F Q RN  +  Q ++SQ  Q+ N+ NQV  +P+  ES               
Sbjct: 166  YNYNNSNKFSQPRNGGFPQQHDRSQPAQVTNSFNQVVVRPSAAESANVQQPQQFQQTQQP 225

Query: 1706 XXXXXXXPNLLNGQQNQANRTATPLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEA 1527
                    ++ NG  +QANR A PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNE+
Sbjct: 226  VAQTQAQ-SVPNGLASQANRAALPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNES 284

Query: 1526 KLNEAFDSVENVILIFSINRTRHFQGCAKMTSRIGGSVASGNWKHAHGTGHYGRNFSVKW 1347
            KLNEAFDS ENVILIFS+NRTRHFQGCAKM SRIGGSV+ GNWK+AHGT HYGRNFSVKW
Sbjct: 285  KLNEAFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKW 344

Query: 1346 LKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASVGEQLASLLYLEPDSELMAIWIAA 1167
            LKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAI IAA
Sbjct: 345  LKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAA 404

Query: 1166 XXXXXXXXXKGLNTDNRSENPDIVLF 1089
                     KG+N +N  ENPDIV F
Sbjct: 405  ESKREEEKAKGVNPENGGENPDIVPF 430



 Score =  133 bits (335), Expect = 3e-28
 Identities = 82/194 (42%), Positives = 98/194 (50%), Gaps = 13/194 (6%)
 Frame = -2

Query: 875  TPDGFPMPDLFGMAS-RPFGPYGPRFSGDFAGPGPGMIFPGRPSQXXXXXXXXXXXXXXX 699
            TPDGF MP+ FGM   R F PYGPRFSGDF GP PGM+F GRP Q               
Sbjct: 499  TPDGFVMPNPFGMGGPRGFNPYGPRFSGDFGGPNPGMMFRGRPPQPGGMFPPGPYGMMMG 558

Query: 698  XXXXXXXXXG-VSPGSQTRPVRPIGTXXXXXXXXXXXXXXSNRTKRDQKASATERNDRFS 522
                       V   +  R  RP G               +   KRD + S  +RN+R+S
Sbjct: 559  PGRGPFMGGMGVGGNNPARGGRP-GGMPPMFPPHPPSQNNNRLQKRDPRGSGNDRNERYS 617

Query: 521  SGSDQGKGQEFAGSVGGPQEADHL-----------AAGKTFRNDESDSEDEAPRRSRHGE 375
            +GS  GK  +     GGP + +H             AG   RND+S+SEDEAPRRSRHGE
Sbjct: 618  AGSGHGKEMQ----AGGPDDENHYQHSSKSYQEDYGAGNNGRNDDSESEDEAPRRSRHGE 673

Query: 374  GKKKRRSVEGDSTT 333
            GKKKRR  EGD+T+
Sbjct: 674  GKKKRRDSEGDATS 687


Top