BLASTX nr result

ID: Anemarrhena21_contig00012265 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Anemarrhena21_contig00012265
         (2705 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008799098.1| PREDICTED: zinc finger CCCH domain-containin...   830   0.0  
ref|XP_010926956.1| PREDICTED: zinc finger CCCH domain-containin...   818   0.0  
ref|XP_008775232.1| PREDICTED: zinc finger CCCH domain-containin...   816   0.0  
ref|XP_010941538.1| PREDICTED: zinc finger CCCH domain-containin...   814   0.0  
ref|XP_010941539.1| PREDICTED: zinc finger CCCH domain-containin...   795   0.0  
ref|XP_007041140.1| Cleavage and polyadenylation specificity fac...   747   0.0  
ref|XP_009419568.1| PREDICTED: zinc finger CCCH domain-containin...   746   0.0  
ref|XP_010241185.1| PREDICTED: cleavage and polyadenylation spec...   744   0.0  
ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylati...   734   0.0  
ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylati...   731   0.0  
gb|KJB47903.1| hypothetical protein B456_008G046800 [Gossypium r...   729   0.0  
ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm...   729   0.0  
ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr...   716   0.0  
gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sin...   714   0.0  
ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec...   713   0.0  
ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec...   712   0.0  
ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation spec...   712   0.0  
ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas...   711   0.0  
ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylati...   706   0.0  
ref|XP_008459517.1| PREDICTED: cleavage and polyadenylation spec...   703   0.0  

>ref|XP_008799098.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like
            [Phoenix dactylifera]
          Length = 697

 Score =  830 bits (2143), Expect = 0.0
 Identities = 447/688 (64%), Positives = 483/688 (70%), Gaps = 29/688 (4%)
 Frame = -3

Query: 2493 MDD-EGALSFDFEGGLDNAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDP 2317
            MDD +GALSFDFEGGLD A A AP + A                    A  G  +  G  
Sbjct: 1    MDDADGALSFDFEGGLD-AGAPAPASSA-------PTSLMASDPTVAAANAGAAAGPGPS 52

Query: 2316 AASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 2137
              +  GGG  RR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQD
Sbjct: 53   DLAGGGGGPGRRTFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQD 112

Query: 2136 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGN 1957
            CVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQHLSSFNYGS N
Sbjct: 113  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLSSFNYGSSN 172

Query: 1956 RFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXXXXX 1777
            RF+QH+NTGY+QQAEKPQF  GS  ANQ   VK                           
Sbjct: 173  RFYQHRNTGYNQQAEKPQFSQGSAGANQNAAVKPPISVEPPNVQPPQSQIQQSQQQPPQP 232

Query: 1776 XXXQ---NIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAK 1606
                   NI NGL N A RTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ+SNEAK
Sbjct: 233  TTENPVQNISNGLLNQATRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQKSNEAK 292

Query: 1605 LNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWL 1426
            LNEAFESSENVILIFS+NRTRHFQGCAKMTSKIGG+IGGGNWKY HGTAHYGRNFSVKWL
Sbjct: 293  LNEAFESSENVILIFSINRTRHFQGCAKMTSKIGGYIGGGNWKYAHGTAHYGRNFSVKWL 352

Query: 1425 KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXX 1246
            KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDG+LM MLI   
Sbjct: 353  KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGELMAMLIAAE 412

Query: 1245 XXXXXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMWQ 1066
                    KGVSTDDA+DNPDIVLF               D                MWQ
Sbjct: 413  SKREEEKAKGVSTDDATDNPDIVLF-EDNEEEEEEESEEEDESSGQGAQGRGRGRGMMWQ 471

Query: 1065 THMPMVSGGRPM--LRGFPPVMMGADGFGYPEGFGTLDLFGVPPRGVFAPY-GPRFSGDF 895
             HMP+  GGRPM  +RGFPPVMMGADGFGY + F   D FG+PPR VFAP+ GPRFSGDF
Sbjct: 472  PHMPLGRGGRPMHGVRGFPPVMMGADGFGYGDCFAAPDPFGIPPR-VFAPFGGPRFSGDF 530

Query: 894  AG----AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVGRSGRPIGLXX 727
            +G    +GL+FPGRPPQPGAVFP+GGLGMMMG   RAPFMGGM  + G GR  RP+G+  
Sbjct: 531  SGTGPMSGLVFPGRPPQPGAVFPMGGLGMMMGPC-RAPFMGGM-PMGGAGRPNRPMGV-- 586

Query: 726  XXXXXXXXXXSNNRIVKKDQRRLT---NDRYEPALNHGGKGHEV-GAGNGVG-------- 583
                       N+R VK+DQRR     +DR++P  + G KG E+ G  NG+         
Sbjct: 587  SPFLHPPPPPPNSRAVKRDQRRPASDRSDRHDPGSDQGSKGQEMTGPSNGIDGDMAYHHG 646

Query: 582  ------DKFGTKSSLQNDESESEDEAAP 517
                  DKF    S QND+SESEDEAAP
Sbjct: 647  AKVQPEDKFVAGDSFQNDDSESEDEAAP 674


>ref|XP_010926956.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like [Elaeis
            guineensis]
          Length = 686

 Score =  818 bits (2114), Expect = 0.0
 Identities = 444/689 (64%), Positives = 480/689 (69%), Gaps = 30/689 (4%)
 Frame = -3

Query: 2493 MDD-EGALSFDFEGGLD-NAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGD 2320
            MDD +GALSFDFEGGLD  A A A + PA                      T      GD
Sbjct: 1    MDDADGALSFDFEGGLDAGAPAHASSAPASLMPSDPTVAAANAG-------TAAAPGPGD 53

Query: 2319 PAASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQ 2140
            P A   GGG  RR+FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQ
Sbjct: 54   PVA---GGGPGRRTFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 110

Query: 2139 DCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSG 1960
            DCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KL GPPPPVEEV QKIQHLSSFNYGS 
Sbjct: 111  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLLGPPPPVEEVLQKIQHLSSFNYGSS 170

Query: 1959 NRFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXXXX 1780
            NRFFQH+NTGY+QQAEK QF  GS ++NQ   V+                          
Sbjct: 171  NRFFQHRNTGYNQQAEKAQFVQGSAVSNQNAAVRPPPSVEPPNVQQPQSQIQQSQQQPPQ 230

Query: 1779 XXXXQ---NIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEA 1609
                    NI NGL N A RTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ+SNEA
Sbjct: 231  PTTENPVQNISNGLLNQATRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQKSNEA 290

Query: 1608 KLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKW 1429
            KLNEAFESSENVILIFS+NRTRHFQGCAKMTSKIGG+IGGGNWKY HGTAHYGRNFSVKW
Sbjct: 291  KLNEAFESSENVILIFSINRTRHFQGCAKMTSKIGGYIGGGNWKYAHGTAHYGRNFSVKW 350

Query: 1428 LKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXX 1249
            LKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPD +LM MLI  
Sbjct: 351  LKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSELMAMLIAA 410

Query: 1248 XXXXXXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMW 1069
                     KGVSTDDA+DNPDIVLF               D                MW
Sbjct: 411  ESKREEEKAKGVSTDDATDNPDIVLF-EDNEEEEEEESEEEDESSGQGSQGRGRGRGMMW 469

Query: 1068 QTHMPMVSGGRPML--RGFPPVMMGADGFGYPEGFGTLDLFGVPPRGVFAPY-GPRFSGD 898
            Q HMP+V GGRPML  RGF PVMMGADGFGY + F   DLFG+PPR VFAP+ GPRFSGD
Sbjct: 470  QPHMPLVRGGRPMLGVRGFHPVMMGADGFGYGDCFAAPDLFGIPPR-VFAPFGGPRFSGD 528

Query: 897  FAG----AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVGRSGRPIGLX 730
            F+     +GL+FPGRPPQPGAVFP+GGLGMMMG  GRAPFMGGM  + G GR+ RP+G+ 
Sbjct: 529  FSATGPMSGLVFPGRPPQPGAVFPMGGLGMMMG-PGRAPFMGGM-PMGGAGRASRPMGVS 586

Query: 729  XXXXXXXXXXXSNNRIVKKDQRRLT---NDRYEPALNHGGKGHE-VGAGNGVG------- 583
                        N+R  K+DQRR     +DR+EP L+   K  E +G  NG         
Sbjct: 587  PFLHPPPPPPPPNSRPAKRDQRRPASDRSDRHEPVLDQVNKVQEMMGPSNGADGDMGYHR 646

Query: 582  -------DKFGTKSSLQNDESESEDEAAP 517
                   DKF +  + QND+SESE EAAP
Sbjct: 647  GAKVQSEDKFVSGDNFQNDDSESEGEAAP 675


>ref|XP_008775232.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like
            [Phoenix dactylifera]
          Length = 696

 Score =  816 bits (2107), Expect = 0.0
 Identities = 442/686 (64%), Positives = 479/686 (69%), Gaps = 28/686 (4%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAA-ASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDPA 2314
            D EGALSFDFEGGLD  A A A + PA                             GD A
Sbjct: 3    DAEGALSFDFEGGLDTGAPAHASSAPASLMPSDPTAAAANAGAVATPVA-------GD-A 54

Query: 2313 ASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDC 2134
            AS  G    RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFR+YGECREQDC
Sbjct: 55   ASTGGNIPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRIYGECREQDC 114

Query: 2133 VYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGNR 1954
            VYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPPV+EVFQKIQHLS+FNYGS NR
Sbjct: 115  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVDEVFQKIQHLSAFNYGSSNR 174

Query: 1953 FFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1774
            +FQH+NT Y+QQ+E+PQ   GS +ANQ    K                            
Sbjct: 175  YFQHRNTSYNQQSERPQLSQGSAVANQNAAAKPPIPVELSNVQQPQSQIQQSQQPPQPPA 234

Query: 1773 XXQ--NIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLN 1600
              Q  +I NGL   A RTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLN
Sbjct: 235  DNQVQHISNGLSKQATRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLN 294

Query: 1599 EAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWLKL 1420
            EAFESSENVILIFS+NRTRHFQGCAKMTSKIGG++GGGNWKY HGTAHYGRNFSVKWLKL
Sbjct: 295  EAFESSENVILIFSINRTRHFQGCAKMTSKIGGYVGGGNWKYAHGTAHYGRNFSVKWLKL 354

Query: 1419 CELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXXXX 1240
            CELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPD +LM MLI     
Sbjct: 355  CELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSELMAMLIAAESK 414

Query: 1239 XXXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMWQTH 1060
                  KGVSTD+A+DNPDIVLF               D                MWQ H
Sbjct: 415  CEEEKAKGVSTDEAADNPDIVLF-EDNEEEEDEESEEEDESSGQSAQGRGRGRGMMWQPH 473

Query: 1059 MPMVSGGRPML--RGFPPVMMGADGFGYPEGFGTLDLFGVPPRGVFAPY-GPRFSGDFAG 889
            MP V GGRPML  RGFPPVMMGADGFGY +GF T D+FGVPPR VF PY GPRFSGDF+G
Sbjct: 474  MPPVRGGRPMLGVRGFPPVMMGADGFGYGDGFATPDIFGVPPR-VFGPYGGPRFSGDFSG 532

Query: 888  ----AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVGRSGRPIGLXXXX 721
                +GL+FPGRPPQP A+FP+GGLGMMMG  GRAPFMGGM  + GVGR+ RP+G+    
Sbjct: 533  TGSMSGLVFPGRPPQPNAIFPMGGLGMMMG-PGRAPFMGGM-VMRGVGRATRPMGV--PP 588

Query: 720  XXXXXXXXSNNRIVKKDQRRLT---NDRYEPALNHGGKGHEV-GAGNGVGD--------- 580
                     N R  K+DQRR     +D +EP  + G KG E+ G  +GV D         
Sbjct: 589  FLHPPPPLPNTRAAKRDQRRPASDWSDMHEPGSDQGSKGQEMTGPSHGVDDEMVSHHGAK 648

Query: 579  -----KFGTKSSLQNDESESEDEAAP 517
                 KF + +S QND SESEDEAAP
Sbjct: 649  AQTEGKFVSANSFQND-SESEDEAAP 673


>ref|XP_010941538.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like isoform
            X1 [Elaeis guineensis]
          Length = 683

 Score =  814 bits (2102), Expect = 0.0
 Identities = 445/693 (64%), Positives = 479/693 (69%), Gaps = 35/693 (5%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLD-NAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDPA 2314
            D EGALSFDFEGGLD    A A + PA                      T   +  G  A
Sbjct: 3    DPEGALSFDFEGGLDAGGPAHASSAPA---------------SLMPSDPTAAAANAGAVA 47

Query: 2313 ASAAG-----GGN--QRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYG 2155
               AG     GGN   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYG
Sbjct: 48   PPVAGDAAPSGGNIQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYG 107

Query: 2154 ECREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSF 1975
            ECREQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEVFQKIQHLS+F
Sbjct: 108  ECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVFQKIQHLSAF 167

Query: 1974 NY-GSGNRFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXX 1798
            NY GS NR+FQH+NT Y+QQ+E+PQ   GS +ANQ    K                    
Sbjct: 168  NYYGSSNRYFQHRNTSYNQQSERPQLSQGSAVANQNAAAKPIPVEPSNVQQPQTQIQQSQ 227

Query: 1797 XXXXXXXXXXQ-NIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQR 1621
                        NI N L N A RTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQR
Sbjct: 228  PPPQPPPENQVQNISNALLNQATRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQR 287

Query: 1620 SNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNF 1441
            SNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGG++GGGNWKY HGTAHYGRNF
Sbjct: 288  SNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGYVGGGNWKYAHGTAHYGRNF 347

Query: 1440 SVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEM 1261
            SVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPD +LM M
Sbjct: 348  SVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSELMAM 407

Query: 1260 LIXXXXXXXXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXX 1081
            LI           KGVSTD+A+DNPDIVLF               +              
Sbjct: 408  LIAAESKRDEEKAKGVSTDEAADNPDIVLF-EDNEEEEDEESEEEEESGGQSAQGRGRGR 466

Query: 1080 XXMWQTHMPMVSGGRPML--RGFPPVMMGADGFGYPEGFGTLDLFGVPPRGVFAPY-GPR 910
              MWQ HMP+V GGRPML  RGFPPVMMGADGFGY +GF   D+FG+PPR VF PY GPR
Sbjct: 467  GMMWQPHMPLVRGGRPMLGVRGFPPVMMGADGFGYGDGFAAPDIFGIPPR-VFGPYAGPR 525

Query: 909  FSGDFAG----AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVGRSGRP 742
            F GDF+G    +GL+FPGRPPQPGA+FP+GGLGMMMG  GRAPFMGG + + GVGRS RP
Sbjct: 526  FPGDFSGTGPMSGLVFPGRPPQPGAIFPMGGLGMMMG-PGRAPFMGG-SVMGGVGRSTRP 583

Query: 741  IGLXXXXXXXXXXXXSNNRIVKKDQRRLT---NDRYEPALNHGGKGHEV-GAGNGVG--- 583
            +G+             N R  K+DQRR     +DR EP  + G KG E+ G  NGV    
Sbjct: 584  MGV--PPFLHPPPPPPNTRAPKRDQRRPASDWSDRLEPGSDQGSKGQELTGPSNGVDDEM 641

Query: 582  -----------DKFGTKSSLQNDESESEDEAAP 517
                       DKF   +S QND SESEDEAAP
Sbjct: 642  GYHHGARAQTEDKFVAANSFQND-SESEDEAAP 673


>ref|XP_010941539.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like isoform
            X2 [Elaeis guineensis]
          Length = 677

 Score =  795 bits (2054), Expect = 0.0
 Identities = 439/693 (63%), Positives = 473/693 (68%), Gaps = 35/693 (5%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLD-NAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDPA 2314
            D EGALSFDFEGGLD    A A + PA                      T   +  G  A
Sbjct: 3    DPEGALSFDFEGGLDAGGPAHASSAPA---------------SLMPSDPTAAAANAGAVA 47

Query: 2313 ASAAG-----GGN--QRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYG 2155
               AG     GGN   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYG
Sbjct: 48   PPVAGDAAPSGGNIQGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYG 107

Query: 2154 ECREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSF 1975
            ECREQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEVFQKIQHLS+F
Sbjct: 108  ECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVFQKIQHLSAF 167

Query: 1974 NY-GSGNRFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXX 1798
            NY GS NR+FQH+NT Y+QQ+E+PQ   GS +ANQ    K                    
Sbjct: 168  NYYGSSNRYFQHRNTSYNQQSERPQLSQGSAVANQNAAAKPIPVEPSNVQQPQTQIQQSQ 227

Query: 1797 XXXXXXXXXXQ-NIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQR 1621
                        NI N L N A RTASPLPQGQS      SCNRENLEISVQQGVWATQR
Sbjct: 228  PPPQPPPENQVQNISNALLNQATRTASPLPQGQS------SCNRENLEISVQQGVWATQR 281

Query: 1620 SNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNF 1441
            SNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGG++GGGNWKY HGTAHYGRNF
Sbjct: 282  SNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGYVGGGNWKYAHGTAHYGRNF 341

Query: 1440 SVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEM 1261
            SVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPD +LM M
Sbjct: 342  SVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDSELMAM 401

Query: 1260 LIXXXXXXXXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXX 1081
            LI           KGVSTD+A+DNPDIVLF               +              
Sbjct: 402  LIAAESKRDEEKAKGVSTDEAADNPDIVLF-EDNEEEEDEESEEEEESGGQSAQGRGRGR 460

Query: 1080 XXMWQTHMPMVSGGRPML--RGFPPVMMGADGFGYPEGFGTLDLFGVPPRGVFAPY-GPR 910
              MWQ HMP+V GGRPML  RGFPPVMMGADGFGY +GF   D+FG+PPR VF PY GPR
Sbjct: 461  GMMWQPHMPLVRGGRPMLGVRGFPPVMMGADGFGYGDGFAAPDIFGIPPR-VFGPYAGPR 519

Query: 909  FSGDFAG----AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVGRSGRP 742
            F GDF+G    +GL+FPGRPPQPGA+FP+GGLGMMMG  GRAPFMGG + + GVGRS RP
Sbjct: 520  FPGDFSGTGPMSGLVFPGRPPQPGAIFPMGGLGMMMG-PGRAPFMGG-SVMGGVGRSTRP 577

Query: 741  IGLXXXXXXXXXXXXSNNRIVKKDQRRLT---NDRYEPALNHGGKGHEV-GAGNGVG--- 583
            +G+             N R  K+DQRR     +DR EP  + G KG E+ G  NGV    
Sbjct: 578  MGV--PPFLHPPPPPPNTRAPKRDQRRPASDWSDRLEPGSDQGSKGQELTGPSNGVDDEM 635

Query: 582  -----------DKFGTKSSLQNDESESEDEAAP 517
                       DKF   +S QND SESEDEAAP
Sbjct: 636  GYHHGARAQTEDKFVAANSFQND-SESEDEAAP 667


>ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao]
            gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation
            specificity factor 30 [Theobroma cacao]
          Length = 698

 Score =  747 bits (1929), Expect = 0.0
 Identities = 403/683 (59%), Positives = 448/683 (65%), Gaps = 27/683 (3%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDPAA 2311
            D EG LSFDFEGGLD A  +APT                             +   DPAA
Sbjct: 3    DSEGGLSFDFEGGLD-AGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPAA 61

Query: 2310 SAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCV 2131
            +  GGG  RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECREQDCV
Sbjct: 62   AVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCV 121

Query: 2130 YKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGNRF 1951
            YKHTN+DIKECNMYKLGFCPNG DCRYRH KLPGPPPPVEEV QKIQ LSS+NY   N+F
Sbjct: 122  YKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NKF 178

Query: 1950 FQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1771
            FQ +N+G++QQ EK Q P G    NQ    K                             
Sbjct: 179  FQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTTESANMHPQQQVQQPQQQVSQTQIQ- 237

Query: 1770 XQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNEAF 1591
              N+PNG  N AN+TA PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAKLNEAF
Sbjct: 238  --NVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 295

Query: 1590 ESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWLKLCEL 1411
            +S+ENVILIFSVNRTRHFQGCAKMTSKIGG + GGNWKY HGTAHYGRNFSVKWLKLCEL
Sbjct: 296  DSAENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCEL 355

Query: 1410 SFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXXXXXXX 1231
            SF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + +        
Sbjct: 356  SFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELKREE 415

Query: 1230 XXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMWQTHMPM 1051
               KGV++D+  +NPDIV F               D                MW  HMP+
Sbjct: 416  EKAKGVNSDNGGENPDIVPF-EDNEEEEEEESEEEDESFSAAAQGRGRGRGVMWPPHMPL 474

Query: 1050 VSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGVFAPYGPRFSGDFAG 889
              G RPM  +RGFPP+MMG DGF Y    P+GFG  DLFG P    F PYGPRFSGDF G
Sbjct: 475  ARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVPDLFGAP--RPFPPYGPRFSGDFTG 532

Query: 888  --AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVG--RSGRPIGLXXXX 721
              +G++FPGRPPQPGA+FP GGLGMMMG  GRAPFMGGM   TG    R GRP+ +    
Sbjct: 533  PASGMMFPGRPPQPGAMFPAGGLGMMMG-PGRAPFMGGMGP-TGANPVRGGRPVSMPPMF 590

Query: 720  XXXXXXXXSNN-RIVKKDQRRLTNDRYEPALNHGGKGHEVGAGNG--------------- 589
                     N+ R VK+DQR  TNDRY  A +  G+G E+    G               
Sbjct: 591  PPPPAPSSQNSGRAVKRDQRTPTNDRY-GAGSEQGRGQEMAGPGGRLDDETQYQQEGQKA 649

Query: 588  -VGDKFGTKSSLQNDESESEDEA 523
               D+F   +S +NDESESEDEA
Sbjct: 650  HHEDQFAAGNSFRNDESESEDEA 672


>ref|XP_009419568.1| PREDICTED: zinc finger CCCH domain-containing protein 45-like [Musa
            acuminata subsp. malaccensis]
          Length = 700

 Score =  746 bits (1927), Expect = 0.0
 Identities = 410/701 (58%), Positives = 455/701 (64%), Gaps = 43/701 (6%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAAS----------APTNPAVXXXXXXXXXXXXXXXXXXXAQTG 2341
            + EG+L+FDFEGGLD AA S          AP++P                        G
Sbjct: 3    EPEGSLNFDFEGGLDVAAPSVAAVAASGPLAPSDPTAAAA-----------------SAG 45

Query: 2340 LGSFNGDPAASAAGGGNQ--RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFF 2167
              S +G     A  GGN   RRSFRQTVCRHWLR LCMKGDACGFLHQYDK RMPVCRFF
Sbjct: 46   ASSPSGTADRMAVAGGNVSGRRSFRQTVCRHWLRGLCMKGDACGFLHQYDKDRMPVCRFF 105

Query: 2166 RLYGECREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQH 1987
            R YGECREQDCVYKHTN+DIKECNMYK GFCPNGPDCRYRH KLPGPPPPVEEV QKIQH
Sbjct: 106  RQYGECREQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQH 165

Query: 1986 LSSFNYGSGNRFFQHKNTG--YSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXX 1813
            L+S  YGS NRF+ H+N    Y+QQ +K Q     GL NQ T VK               
Sbjct: 166  LNSA-YGSSNRFYHHRNNNNSYNQQPDKNQLSSTPGLPNQNTGVKPVSSFEPSDVKLPQS 224

Query: 1812 XXXXXXXXXXXXXXXQ---------NIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENL 1660
                                     +I N L N   RTASPLPQGQSRYFIVKSCNRENL
Sbjct: 225  LVQQSEQQQQQQQQLPIPSLENQVPSISNALSNQTVRTASPLPQGQSRYFIVKSCNRENL 284

Query: 1659 EISVQQGVWATQRSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNW 1480
            EISVQQG+WATQRSNEAKLNEAFES+ENVILIFS+N+TRHFQGC KMTS+IGGF+GGGNW
Sbjct: 285  EISVQQGMWATQRSNEAKLNEAFESTENVILIFSINKTRHFQGCGKMTSRIGGFVGGGNW 344

Query: 1479 KYVHGTAHYGRNFSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLAS 1300
            KY HGTAHYGRNFSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLAS
Sbjct: 345  KYSHGTAHYGRNFSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLAS 404

Query: 1299 LLYLEPDGQLMEMLIXXXXXXXXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDX 1120
            LLYLEPD +LM ML+           KG   D+A+DNPDIVLF               D 
Sbjct: 405  LLYLEPDSELMAMLVAAESKRDEEKAKGGGADEATDNPDIVLFEDNEEEESEEEESEEDD 464

Query: 1119 XXXXXXXXXXXXXXXMWQTHMPMVSGGRPML--RGFPPVMMGADGFGYPEGFGTLDLFGV 946
                           MWQ HMP+V GGRPML  RGFPP+MMGADGFGY +GF T DLFG 
Sbjct: 465  ESGQAAHGRGRGRGMMWQPHMPLVRGGRPMLGVRGFPPIMMGADGFGYGDGFSTPDLFG- 523

Query: 945  PPRGVFAPY-GPRFSGDFAGAGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAI 769
             PR +F  + GPRFSGDF+ AGL+F GRPPQPGAVFP+G +GMMMG  GRAPFMGGM  +
Sbjct: 524  -PR-IFPQFGGPRFSGDFS-AGLVFSGRPPQPGAVFPMGNIGMMMG-PGRAPFMGGM-PM 578

Query: 768  TGVGRSGRPIGLXXXXXXXXXXXXSNNRIVKKDQRRLT---NDRYEPALNHGGKGHEV-G 601
             G+GR+ RP+G+             N+R  K+D RR     NDRYE   + G +   + G
Sbjct: 579  AGMGRANRPVGV-PPFLHPPPAPPLNSRAAKRDHRRPVSDRNDRYETGSDQGNRSQVMAG 637

Query: 600  AGNGVGD-------------KFGTKSSLQNDESESEDEAAP 517
            A  G  D             K+G   S QN+  +S DE AP
Sbjct: 638  AVGGADDDGAYWQGERASDHKYGPGKSFQNESEKSMDEIAP 678


>ref|XP_010241185.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30
            [Nelumbo nucifera]
          Length = 715

 Score =  744 bits (1921), Expect = 0.0
 Identities = 409/707 (57%), Positives = 447/707 (63%), Gaps = 51/707 (7%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDPAA 2311
            D EG LSFDFEGGLDN     PTNP                     A + + +      A
Sbjct: 3    DPEGVLSFDFEGGLDNG----PTNPT-------------PSAPLIPADSSIAAAANSAVA 45

Query: 2310 SA-----AGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECR 2146
             A     AGG   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFR+YGECR
Sbjct: 46   PAVVEPVAGGHAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECR 105

Query: 2145 EQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYG 1966
            EQDCVYKHTN+DIKECNMYK GFCPNGPDCRYRH K PGPPPPVEEVFQKIQHL SFNYG
Sbjct: 106  EQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKQPGPPPPVEEVFQKIQHLGSFNYG 165

Query: 1965 SGNRFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXX 1786
            S NRFFQ +   Y  Q+E+ QFP GS   NQ    K                        
Sbjct: 166  SSNRFFQQRIGSYVPQSERSQFPQGSSNVNQGIASKPSTAAESPNVQQQQQQSQIQQPQQ 225

Query: 1785 XXXXXXQ---NIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSN 1615
                      N  NGLPN A+RTA+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSN
Sbjct: 226  QQQVNQTQMQNPQNGLPNQASRTATPLPQGSSRYFIVKSCNRENLELSVQQGVWATQRSN 285

Query: 1614 EAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSV 1435
            EAKLNEAF+S ENVILIFSVNRTRHFQGCAKMTSKIGG +GGGNWKY HGTAHYGRNFSV
Sbjct: 286  EAKLNEAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSV 345

Query: 1434 KWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLI 1255
            KWLKLCELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + +
Sbjct: 346  KWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISV 405

Query: 1254 XXXXXXXXXXXKGVSTDDASDNPDIVLF--XXXXXXXXXXXXXXXDXXXXXXXXXXXXXX 1081
                       KGV+ D+ +DN DIV F                                
Sbjct: 406  AAESKREEEKAKGVNPDEGADNHDIVPFEDNEDEEEEESEEEDESFGQAINAAQGRGRGR 465

Query: 1080 XXMWQTHMPMVSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGVFAPY 919
              MW  HMP+  GGRP+  +RGFPPVMMGADGF Y    P+GF   DLFG+ PR  FAPY
Sbjct: 466  GVMWPPHMPLARGGRPIPGIRGFPPVMMGADGFSYGAVTPDGFSMPDLFGIAPR-AFAPY 524

Query: 918  GPRFSGDFAG------------------AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAP 793
            GPRFSGDF G                   G++F GRP QPGAVFP  GLGMMMG  GRAP
Sbjct: 525  GPRFSGDFTGLGQSAAMGFNPIDGTGPTPGMVFHGRPSQPGAVFPPSGLGMMMG-PGRAP 583

Query: 792  FMGGMAAITGVGRSGRPIGLXXXXXXXXXXXXSNNRIVKKDQRRLT--NDRYEPALNHGG 619
            FMGGM       R+ RPIG+            S++R+V KDQRR T  NDRY  A +  G
Sbjct: 584  FMGGMGIGAAPPRASRPIGMPPFRPPAPPLPQSSSRVVNKDQRRPTDRNDRYS-AGSDQG 642

Query: 618  KGHEVGAGNG---------------VGDKFGTKSSLQNDESESEDEA 523
            KG E+    G                 D F   +S +NDESESEDEA
Sbjct: 643  KGQEMAMSGGGPEDEMKYQPGMRTQHDDSFAVGNSFRNDESESEDEA 689


>ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Gossypium raimondii] gi|763780831|gb|KJB47902.1|
            hypothetical protein B456_008G046800 [Gossypium
            raimondii]
          Length = 700

 Score =  734 bits (1895), Expect = 0.0
 Identities = 402/688 (58%), Positives = 453/688 (65%), Gaps = 31/688 (4%)
 Frame = -3

Query: 2493 MDD-EGALSFDFEGGLDNAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDP 2317
            MDD EG LSFDFEGGLD    + P  P                     A  G+ +   DP
Sbjct: 1    MDDAEGGLSFDFEGGLD----AGPPAPTASMPVVNSDPSAANNTNNFTAPGGVQASINDP 56

Query: 2316 AASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 2137
             A+  GGG  RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECREQD
Sbjct: 57   VANQ-GGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 115

Query: 2136 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGN 1957
            CVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ LS++NY   N
Sbjct: 116  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLSAYNY--NN 173

Query: 1956 RFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVK---XXXXXXXXXXXXXXXXXXXXXXXX 1786
            +F+Q +N G+ QQ EK Q P      NQ    K                           
Sbjct: 174  KFYQQRNAGFPQQTEKSQIPQAQNNVNQGAAGKPSATESTNVQQQQLQQQQQQIQQPQQQ 233

Query: 1785 XXXXXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAK 1606
                  QN+PNG  N ANRTA PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAK
Sbjct: 234  VSQTQIQNVPNGQSNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 293

Query: 1605 LNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWL 1426
            LNEAF+S+ENVIL+FSVNRTRHFQGCAKMTSKIGG + GGNWKY HGTAHYGRNFSVKWL
Sbjct: 294  LNEAFDSAENVILVFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWL 353

Query: 1425 KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXX 1246
            KLCELSF+KT HLRNPYN+NLPVKISRDCQELEP +GEQLASLLYLEPD +LM + +   
Sbjct: 354  KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAE 413

Query: 1245 XXXXXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMWQ 1066
                    KGV++D+A +NPDIV F               D                MW 
Sbjct: 414  SKREEEKAKGVNSDNA-ENPDIVPF-EDNEEEEEEESEEEDESFGAAAQGRGRGRGIMWP 471

Query: 1065 THMPMVSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGVFAPYGPRFS 904
             HMP+  G RPM  +RGFPP+MMG DGF Y    P+GFG  DLFG P    FAPYGPRFS
Sbjct: 472  PHMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGMPDLFGAP--RPFAPYGPRFS 529

Query: 903  GDFAG--AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGV--GRSGRPIG 736
            GDF G  +G++FPGRPPQPG +FP GG+GMMMG  GRAPFMGGM   TG    R GRP+G
Sbjct: 530  GDFTGPASGMMFPGRPPQPGGMFPSGGIGMMMG-PGRAPFMGGMGP-TGANPARGGRPVG 587

Query: 735  LXXXXXXXXXXXXSNN-RIVKKDQRRLTNDRYEPALNHGGKGHEVGA-GNGV-------- 586
            +             N+ R +K+DQR  TNDR   A +  G+G E+G  G G+        
Sbjct: 588  MPPMFPLPPAPASQNSGRAIKRDQRTPTNDR-SSAGSEQGRGQEMGGPGGGLEDGTQYQQ 646

Query: 585  -------GDKFGTKSSLQNDESESEDEA 523
                    D+F   +S +ND+SESEDEA
Sbjct: 647  EGQKAHHEDQFAAGNSFRNDDSESEDEA 674


>ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Vitis vinifera]
          Length = 673

 Score =  731 bits (1887), Expect = 0.0
 Identities = 400/681 (58%), Positives = 443/681 (65%), Gaps = 25/681 (3%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDPAA 2311
            D EG LSFDFEGGLD A  +A T   +                               A 
Sbjct: 3    DAEGVLSFDFEGGLDAAPGTAATVAPLIQSDATAAAAAPSSVVS--------------AE 48

Query: 2310 SAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCV 2131
               GG   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQDCV
Sbjct: 49   PTPGGAPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCV 108

Query: 2130 YKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGNRF 1951
            YKHTN+DIKECNMYKLGFCPNG DCRYRH KLPGPPP +EEVFQKIQ LSSFNYGS NRF
Sbjct: 109  YKHTNEDIKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRF 168

Query: 1950 FQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1771
            +Q++N  Y+QQ EK Q   GS   N  T  K                             
Sbjct: 169  YQNRNP-YNQQTEKSQILQGSNAVNLGTVAK----SSTTEAINVQQQQVQPPQQQVSQTP 223

Query: 1770 XQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNEAF 1591
             QN+PNGLPN AN+TASPLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAKLNEAF
Sbjct: 224  MQNLPNGLPNQANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 283

Query: 1590 ESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWLKLCEL 1411
            +S ENVILIFSVNRTRHFQGCAKMTSKIGGF+GGGNWKY HGTAHYGRNFSVKWLKLCEL
Sbjct: 284  DSVENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCEL 343

Query: 1410 SFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXXXXXXX 1231
            SF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + +        
Sbjct: 344  SFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREE 403

Query: 1230 XXXKGVSTDDASDNPDIVLF---XXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMWQTH 1060
               KGV+ D+  +NPDIV F                                   MW  H
Sbjct: 404  EKAKGVNPDNGGENPDIVPFEDNEEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPH 463

Query: 1059 MPMVSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGVFAPYGPRFSGD 898
            MP+  G RP+  +RGFPPVMMGADGF Y    P+GF   D+FGV PR  F PYGPRFSGD
Sbjct: 464  MPLARGARPIPSMRGFPPVMMGADGFSYSAVPPDGFAMPDIFGVGPR-AFPPYGPRFSGD 522

Query: 897  FAG--AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAA-ITGVGRSGRPIGLXX 727
            F G  +G++FPGR  QPGAVFP  G GMMMG  GRAPFMGGM        R+GRP+G+  
Sbjct: 523  FTGPASGMMFPGR-GQPGAVFPASGYGMMMG-PGRAPFMGGMGVPAAAPTRAGRPVGMPP 580

Query: 726  XXXXXXXXXXSNNRIVKKDQRRLTNDRYE--PALNHGGKGHEV-----------GAGNGV 586
                       NNR  K+DQR   NDR +     +  G+G ++           G  +  
Sbjct: 581  MFPPPPPPNSQNNR-TKRDQRTPVNDRNDRYSGGSDQGRGQDMAGPDDETQYLQGLKSQQ 639

Query: 585  GDKFGTKSSLQNDESESEDEA 523
             D+FG  +S +NDESESEDEA
Sbjct: 640  DDQFGGGNSFRNDESESEDEA 660


>gb|KJB47903.1| hypothetical protein B456_008G046800 [Gossypium raimondii]
          Length = 701

 Score =  729 bits (1883), Expect = 0.0
 Identities = 402/689 (58%), Positives = 453/689 (65%), Gaps = 32/689 (4%)
 Frame = -3

Query: 2493 MDD-EGALSFDFEGGLDNAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDP 2317
            MDD EG LSFDFEGGLD    + P  P                     A  G+ +   DP
Sbjct: 1    MDDAEGGLSFDFEGGLD----AGPPAPTASMPVVNSDPSAANNTNNFTAPGGVQASINDP 56

Query: 2316 AASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 2137
             A+  GGG  RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECREQD
Sbjct: 57   VANQ-GGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 115

Query: 2136 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGN 1957
            CVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ LS++NY   N
Sbjct: 116  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLSAYNY--NN 173

Query: 1956 RFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVK---XXXXXXXXXXXXXXXXXXXXXXXX 1786
            +F+Q +N G+ QQ EK Q P      NQ    K                           
Sbjct: 174  KFYQQRNAGFPQQTEKSQIPQAQNNVNQGAAGKPSATESTNVQQQQLQQQQQQIQQPQQQ 233

Query: 1785 XXXXXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAK 1606
                  QN+PNG  N ANRTA PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAK
Sbjct: 234  VSQTQIQNVPNGQSNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 293

Query: 1605 LNEAFESSENVILIFSVNRTRHFQ-GCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKW 1429
            LNEAF+S+ENVIL+FSVNRTRHFQ GCAKMTSKIGG + GGNWKY HGTAHYGRNFSVKW
Sbjct: 294  LNEAFDSAENVILVFSVNRTRHFQVGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKW 353

Query: 1428 LKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXX 1249
            LKLCELSF+KT HLRNPYN+NLPVKISRDCQELEP +GEQLASLLYLEPD +LM + +  
Sbjct: 354  LKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAA 413

Query: 1248 XXXXXXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMW 1069
                     KGV++D+A +NPDIV F               D                MW
Sbjct: 414  ESKREEEKAKGVNSDNA-ENPDIVPF-EDNEEEEEEESEEEDESFGAAAQGRGRGRGIMW 471

Query: 1068 QTHMPMVSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGVFAPYGPRF 907
              HMP+  G RPM  +RGFPP+MMG DGF Y    P+GFG  DLFG P    FAPYGPRF
Sbjct: 472  PPHMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGMPDLFGAP--RPFAPYGPRF 529

Query: 906  SGDFAG--AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGV--GRSGRPI 739
            SGDF G  +G++FPGRPPQPG +FP GG+GMMMG  GRAPFMGGM   TG    R GRP+
Sbjct: 530  SGDFTGPASGMMFPGRPPQPGGMFPSGGIGMMMG-PGRAPFMGGMGP-TGANPARGGRPV 587

Query: 738  GLXXXXXXXXXXXXSNN-RIVKKDQRRLTNDRYEPALNHGGKGHEVGA-GNGV------- 586
            G+             N+ R +K+DQR  TNDR   A +  G+G E+G  G G+       
Sbjct: 588  GMPPMFPLPPAPASQNSGRAIKRDQRTPTNDR-SSAGSEQGRGQEMGGPGGGLEDGTQYQ 646

Query: 585  --------GDKFGTKSSLQNDESESEDEA 523
                     D+F   +S +ND+SESEDEA
Sbjct: 647  QEGQKAHHEDQFAAGNSFRNDDSESEDEA 675


>ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis]
            gi|223537608|gb|EEF39232.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 702

 Score =  729 bits (1883), Expect = 0.0
 Identities = 396/685 (57%), Positives = 442/685 (64%), Gaps = 29/685 (4%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDPAA 2311
            D +G LSFDFEGGLD+   S PTNP                     +     S N   +A
Sbjct: 3    DTDGGLSFDFEGGLDS---SGPTNPTASIPAIPSDNTAAVAAATNNSIVPNVSSNDPASA 59

Query: 2310 SAAGGGNQ--RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 2137
            +AA   NQ  RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRLYGECREQD
Sbjct: 60   AAAAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQD 119

Query: 2136 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGN 1957
            CVYKHTN+DIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ L+S+NYGS N
Sbjct: 120  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGSSN 179

Query: 1956 RFFQHKNTGYSQQAEKPQFPH-----GSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXX 1792
            +FFQ +  G+ Q A+K QF       G G+A +P   +                      
Sbjct: 180  KFFQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGTE-SANVQQPQQQQPQPGQGQQSQ 238

Query: 1791 XXXXXXXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNE 1612
                    QN+PNG PN ANRTA PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNE
Sbjct: 239  QQATQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE 298

Query: 1611 AKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVK 1432
            AKLNEAF+S+ENVILIFSVNRTRHFQGCAKMTSKIG  +GGGNWKY HGTAHYGRNFSVK
Sbjct: 299  AKLNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSVK 358

Query: 1431 WLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIX 1252
            WLKLCELSF+KT HLRNPYN+NLPVKISRDCQELEP +G QLA LLY EPD +LM + + 
Sbjct: 359  WLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAISLA 418

Query: 1251 XXXXXXXXXXKGVSTDDASDNPDIVLF---XXXXXXXXXXXXXXXDXXXXXXXXXXXXXX 1081
                      KGV+ ++  DNPDIV F                                 
Sbjct: 419  AEAKREEEKAKGVNPENGGDNPDIVPFEDNEEEEEEESEEEEESFGQALGAPGQGRGRGR 478

Query: 1080 XXMWQTHMPMVSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGVFAPY 919
              +W  HMP+  G RP+  +RGFPP+MMGAD F Y    P+GFG  DLFGV PRG F PY
Sbjct: 479  GIIW-PHMPLARGARPIPGMRGFPPMMMGADSFSYGPVTPDGFGMPDLFGVAPRG-FTPY 536

Query: 918  GPRFSGDFAGA--GLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAA-ITGVGRSG 748
             PRFSGDF GA  G++FPGRPPQPG VFP GG GMMMG  GRAPFMGGM    T   R  
Sbjct: 537  APRFSGDFTGAASGMMFPGRPPQPGGVFPNGGFGMMMG-PGRAPFMGGMGPNSTNPLRGN 595

Query: 747  RPIGLXXXXXXXXXXXXSNNRIVKKDQRRLTNDRYEPALNHG----------GKGHEVGA 598
             P G+               R VK+DQR   NDRY    + G           +  + G 
Sbjct: 596  WPGGMPFPPLPTPSP----QRPVKRDQRMTANDRYSTGSDQGRNTAGEPDDEARYQQEGL 651

Query: 597  GNGVGDKFGTKSSLQNDESESEDEA 523
                 D+FG  +S +NDESESEDEA
Sbjct: 652  KASHEDQFGAGNSFRNDESESEDEA 676


>ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551535|gb|ESR62164.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 701

 Score =  716 bits (1847), Expect = 0.0
 Identities = 387/683 (56%), Positives = 445/683 (65%), Gaps = 27/683 (3%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAASAPT--NPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDP 2317
            D EG LSFDFEGGLD A    PT  NPA+                     +   +   D 
Sbjct: 3    DSEGGLSFDFEGGLD-AGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAA--PDH 59

Query: 2316 AASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 2137
            A++     + RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECREQD
Sbjct: 60   ASAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 119

Query: 2136 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGN 1957
            CVYKHTN+DIKECNMYKLGFCPNGPDCRYRHVKLPGPPP VEEV QKIQ +SS+N+G+ N
Sbjct: 120  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPN 179

Query: 1956 RFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXXXXX 1777
            + FQ +   +S Q +K QF  G    NQ    K                           
Sbjct: 180  KLFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQ 238

Query: 1776 XXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNE 1597
                N+PNGLPN  NR A+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAKLNE
Sbjct: 239  MQ--NLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 296

Query: 1596 AFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWLKLC 1417
            AF+S+ENVILIFSVNRTRHFQGCAKMTSKIGG +GGGNWKY HGTAHYGRNFSVKWLKLC
Sbjct: 297  AFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLC 356

Query: 1416 ELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXXXXX 1237
            ELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLA+LLYLEPD +LM + +      
Sbjct: 357  ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKR 416

Query: 1236 XXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMWQTHM 1057
                 KGV+ D+  DNPDIV F               +                MW   M
Sbjct: 417  EEEKAKGVNPDNGGDNPDIVPF-EDNEEEEEEESEEEEESLGTASQGRGRGRGMMWPGPM 475

Query: 1056 PMVSGGRPM--LRGFPPVMMGADGFGY---PEGFGTLDLFGVPPRGVFAPYGPRFSGDFA 892
            P+  G RP+  +RGFPP+M+GADGF Y   P+GF   DLFGV PR  FAPYGPRFSGDF 
Sbjct: 476  PLARGARPVPGMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPR-PFAPYGPRFSGDFT 534

Query: 891  G-AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVGRSGRPIGLXXXXXX 715
            G  G++FPGRPPQPG+VFP  G G MM   GR PFMGGM       R GRP+G+      
Sbjct: 535  GPGGMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPFPN 594

Query: 714  XXXXXXSNNRIVKKDQRRLTNDRYE--PALNHGGKGHEVGAGNGVG-------------- 583
                  +++R+ K+D R   NDR +   A +  G+  E+G G G G              
Sbjct: 595  QPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQEMG-GPGRGPDDEVQYQQEGSKA 653

Query: 582  ---DKFGTKSSLQNDESESEDEA 523
               D++G++ + +NDESESEDEA
Sbjct: 654  NQEDQYGSR-NFRNDESESEDEA 675


>gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sinensis]
          Length = 701

 Score =  714 bits (1843), Expect = 0.0
 Identities = 387/683 (56%), Positives = 444/683 (65%), Gaps = 27/683 (3%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAASAPT--NPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDP 2317
            D EG LSFDFEGGLD A    PT  NPA+                     +   +   D 
Sbjct: 3    DSEGGLSFDFEGGLD-AGPGMPTASNPAIQSDSTAAAAAAAANANHAAPSSSGAA--PDH 59

Query: 2316 AASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 2137
            A++     + RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECREQD
Sbjct: 60   ASAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 119

Query: 2136 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGN 1957
            CVYKHTN+DIKECNMYKLGFCPNGPDCRYRHVKLPGPPP VEEV QKIQ +SS+N+G+ N
Sbjct: 120  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPN 179

Query: 1956 RFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXXXXX 1777
            + FQ +   +S Q +K QF  G    NQ    K                           
Sbjct: 180  KHFQQRGA-FSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQ 238

Query: 1776 XXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNE 1597
                N+PNGLPN  NR A+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAKLNE
Sbjct: 239  MQ--NLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 296

Query: 1596 AFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWLKLC 1417
            AF+S+ENVILIFSVNRTRHFQGCAKMTSKIGG +GGGNWKY HGTAHYGRNFSVKWLKLC
Sbjct: 297  AFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLC 356

Query: 1416 ELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXXXXX 1237
            ELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLA+LLYLEPD +LM + +      
Sbjct: 357  ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKR 416

Query: 1236 XXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMWQTHM 1057
                 KGV+ D+  DNPDIV F               +                MW   M
Sbjct: 417  EEEKAKGVNPDNGGDNPDIVPF-EDNEEEEEEESEEEEESLGTASQGRGRGRGMMWPGPM 475

Query: 1056 PMVSGGRPM--LRGFPPVMMGADGFGY---PEGFGTLDLFGVPPRGVFAPYGPRFSGDFA 892
            P+  G RP+  +RGFPP+M+GADGF Y   P+GF   DLFGV PR  FAPYGPRFSGDF 
Sbjct: 476  PLARGARPVPGMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPR-PFAPYGPRFSGDFT 534

Query: 891  G-AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVGRSGRPIGLXXXXXX 715
            G  G++FPGRPPQPG+VFP  G G MM   GR PFMGGM       R GRP+G+      
Sbjct: 535  GPGGMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPFPN 594

Query: 714  XXXXXXSNNRIVKKDQRRLTNDRYE--PALNHGGKGHEVGAGNGVG-------------- 583
                  +++R  K+D R   NDR +   A +  G+  E+G G G G              
Sbjct: 595  QPQSSQNSSRAAKRDVRGSINDRNDRYSAGSDQGRAQEMG-GPGRGPDDEVQYQQEGSKA 653

Query: 582  ---DKFGTKSSLQNDESESEDEA 523
               D++G++ + +NDESESEDEA
Sbjct: 654  NQEDQYGSR-NFRNDESESEDEA 675


>ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 691

 Score =  713 bits (1841), Expect = 0.0
 Identities = 395/695 (56%), Positives = 440/695 (63%), Gaps = 39/695 (5%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLD----NAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNG 2323
            D EG LSFDFEGGLD    +AAA+ P+ P V                        G    
Sbjct: 3    DSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNG----------GHAAP 52

Query: 2322 DPAASAAGGGNQ--RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGEC 2149
             P+ +   GGN   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGEC
Sbjct: 53   APSTADPAGGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGEC 112

Query: 2148 REQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNY 1969
            REQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH K PGPPPPVEEV QKIQHL S+NY
Sbjct: 113  REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNY 172

Query: 1968 GSGNRFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXX 1789
             S N+FFQ +   Y+QQAEKPQ P G+   NQ    K                       
Sbjct: 173  NSSNKFFQQRGASYNQQAEKPQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQVN 232

Query: 1788 XXXXXXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEA 1609
                    N+ NG PN ANRTA+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNE+
Sbjct: 233  QSQMQ---NVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNES 289

Query: 1608 KLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKW 1429
            KLNEAF+S ENVIL+FSVNRTRHFQGCAKMTS+IGG + GGNWKY HGTAHYGRNFSVKW
Sbjct: 290  KLNEAFDSVENVILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKW 349

Query: 1428 LKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXX 1249
            LKLCELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + +  
Sbjct: 350  LKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAA 409

Query: 1248 XXXXXXXXXKGVSTDDASDNPDIVLF---XXXXXXXXXXXXXXXDXXXXXXXXXXXXXXX 1078
                     KGV+ D+  +NPDIV F                                  
Sbjct: 410  ESKREEEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFSHGVGPAGQGRGRGRG 469

Query: 1077 XMWQTHMPMVSGGRPM--LRGFPPVMMGADGFGY-------PEGFGTLDLFGVPPRGVFA 925
             MW  HMP+  G RPM  ++GF PVMMG DG  Y       P+GFG  DLFGV PRG FA
Sbjct: 470  MMWPPHMPLGRGARPMPGMQGFNPVMMG-DGLSYGPVGPVGPDGFGMPDLFGVGPRG-FA 527

Query: 924  PYGPRFSGDFAG--AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVG-- 757
            PYGPRFSGDF G  A ++F GRP QPG +FP GG GMMM N GR PFMGGM    GVG  
Sbjct: 528  PYGPRFSGDFGGPPAAMMFRGRPSQPG-MFPSGGFGMMM-NPGRGPFMGGM----GVGGA 581

Query: 756  ---RSGRPIGLXXXXXXXXXXXXSNNRIVKKDQRRL-TNDRYEPALNHGGKGHEVGAGNG 589
               R GRP+ +            + NR  K+DQR    NDR+      G     +    G
Sbjct: 582  NPPRGGRPVNMPPMFPPPPPLPQNANRAAKRDQRTADRNDRFGSGSEQGKSQDMLSQSGG 641

Query: 588  VGD----KFGTK---------SSLQNDESESEDEA 523
              D    + G K         ++ +ND+SESEDEA
Sbjct: 642  PDDDAQYQQGYKGNQDDHPAVNNFRNDDSESEDEA 676


>ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 681

 Score =  712 bits (1839), Expect = 0.0
 Identities = 398/690 (57%), Positives = 440/690 (63%), Gaps = 34/690 (4%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNA---AASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGD 2320
            D EG LSFDFEGGLD A   AA+AP+ P +                           NG 
Sbjct: 3    DSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVS---------------NGG 47

Query: 2319 PAASA------AGGGNQ--RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFR 2164
            PAA A       GGGN   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFR
Sbjct: 48   PAAPAPSAVDPVGGGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFR 107

Query: 2163 LYGECREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHL 1984
            LYGECREQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH K PGPPPPVEEV QKIQHL
Sbjct: 108  LYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHL 167

Query: 1983 SSFNYGSGNRFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXX 1804
             S+NY S N+FFQ +   Y+QQAEKP  P G+   NQ                       
Sbjct: 168  YSYNYNSSNKFFQQRGASYNQQAEKPLLPQGNNSTNQGVT---GNPLPAELGNAQPQQQV 224

Query: 1803 XXXXXXXXXXXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQ 1624
                        QN+ NG PN ANRTA+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQ
Sbjct: 225  QQSQQQVNQSQMQNVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQ 284

Query: 1623 RSNEAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRN 1444
            RSNE+KLNEAF+S ENVILIFSVNRTRHFQGCAKMTSKIGG + GGNWKY HGTAHYGRN
Sbjct: 285  RSNESKLNEAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRN 344

Query: 1443 FSVKWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLME 1264
            FSVKWLKLCELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM 
Sbjct: 345  FSVKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMA 404

Query: 1263 MLIXXXXXXXXXXXKGVSTDDASDNPDIVLF---XXXXXXXXXXXXXXXDXXXXXXXXXX 1093
            + +           KGV+ D+  +NPDIV F                             
Sbjct: 405  ISVAAESKREEEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGR 464

Query: 1092 XXXXXXMWQTHMPMVSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGV 931
                  MW  HMP+  G RPM  ++GF PVMMG DG  Y    P+GFG  DLFGV PRG 
Sbjct: 465  GRGRGMMWPPHMPLGRGARPMPGMQGFNPVMMG-DGLSYGPVGPDGFGMPDLFGVGPRG- 522

Query: 930  FAPYGPRFSGDFAG--AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVG 757
            FAPYGPRFSGDF G  A ++F GRP QPG +FP GG GMM+ N GR PFMGG+    GVG
Sbjct: 523  FAPYGPRFSGDFGGPPAAMMFRGRPSQPG-MFPGGGFGMML-NPGRGPFMGGI----GVG 576

Query: 756  -----RSGRPIGLXXXXXXXXXXXXSNNRIVKKDQRRL-TNDRYEPALNHGGKGHEVGAG 595
                 R GRP+ +            + NR  K+DQR    NDR+      G     +   
Sbjct: 577  GANPPRGGRPVNMPPMFPPPPPLPQNANRAAKRDQRTADRNDRFGSGSEQGKSQDMLSQS 636

Query: 594  NGVGD----KFGTKSSLQN--DESESEDEA 523
             G  D    + G K +  +  D+SESEDEA
Sbjct: 637  GGPDDDPQYQQGYKGNQDDHPDDSESEDEA 666


>ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Citrus sinensis]
          Length = 683

 Score =  712 bits (1837), Expect = 0.0
 Identities = 387/683 (56%), Positives = 441/683 (64%), Gaps = 27/683 (3%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAASAPT--NPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDP 2317
            D EG LSFDFEGGLD A    PT  NPA                              D 
Sbjct: 3    DSEGGLSFDFEGGLD-AGPGMPTASNPAAAPSSSGAAP--------------------DH 41

Query: 2316 AASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 2137
            A++     + RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPVCRFFRL+GECREQD
Sbjct: 42   ASAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 101

Query: 2136 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGN 1957
            CVYKHTN+DIKECNMYKLGFCPNGPDCRYRHVKLPGPPP VEEV QKIQ +SS+N+G+ N
Sbjct: 102  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPN 161

Query: 1956 RFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXXXXX 1777
            + FQ +   +S Q +K QF  G    NQ    K                           
Sbjct: 162  KHFQQRGA-FSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQ 220

Query: 1776 XXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNE 1597
                N+PNGLPN  NR A+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAKLNE
Sbjct: 221  MQ--NLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 278

Query: 1596 AFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWLKLC 1417
            AF+S+ENVILIFSVNRTRHFQGCAKMTSKIGG +GGGNWKY HGTAHYGRNFSVKWLKLC
Sbjct: 279  AFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLC 338

Query: 1416 ELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXXXXX 1237
            ELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLA+LLYLEPD +LM + +      
Sbjct: 339  ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKR 398

Query: 1236 XXXXXKGVSTDDASDNPDIVLFXXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMWQTHM 1057
                 KGV+ D+  DNPDIV F               +                MW   M
Sbjct: 399  EEEKAKGVNPDNGGDNPDIVPF-EDNEEEEEEESEEEEESLGTASQGRGRGRGMMWPGPM 457

Query: 1056 PMVSGGRPM--LRGFPPVMMGADGFGY---PEGFGTLDLFGVPPRGVFAPYGPRFSGDFA 892
            P+  G RP+  +RGFPP+M+GADGF Y   P+GF   DLFGV PR  FAPYGPRFSGDF 
Sbjct: 458  PLARGARPVPGMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPR-PFAPYGPRFSGDFT 516

Query: 891  G-AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVGRSGRPIGLXXXXXX 715
            G  G++FPGRPPQPG+VFP  G G MM   GR PFMGGM       R GRP+G+      
Sbjct: 517  GPGGMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPFPN 576

Query: 714  XXXXXXSNNRIVKKDQRRLTNDRYE--PALNHGGKGHEVGAGNGVG-------------- 583
                  +++R  K+D R   NDR +   A +  G+  E+G G G G              
Sbjct: 577  QPQSSQNSSRAAKRDVRGSINDRNDRYSAGSDQGRAQEMG-GPGRGPDDEVQYQQEGSKA 635

Query: 582  ---DKFGTKSSLQNDESESEDEA 523
               D++G++ + +NDESESEDEA
Sbjct: 636  NQEDQYGSR-NFRNDESESEDEA 657


>ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris]
            gi|561020727|gb|ESW19498.1| hypothetical protein
            PHAVU_006G130200g [Phaseolus vulgaris]
          Length = 697

 Score =  711 bits (1835), Expect = 0.0
 Identities = 393/685 (57%), Positives = 442/685 (64%), Gaps = 29/685 (4%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNA--AASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNGDP 2317
            D EG LSFDFEGGLD A  AA+AP+ P V                     +G      +P
Sbjct: 3    DSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSGT-----EP 57

Query: 2316 AASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 2137
            AA    G   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD
Sbjct: 58   AAVNVPG---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQD 114

Query: 2136 CVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYGSGN 1957
            CVYKHTN+DIKECNMYKLGFCPNGPDCRYRH K PGPPPPVEEV QKIQHL S+NY S N
Sbjct: 115  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSN 174

Query: 1956 RFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXXXXX 1777
            +FFQ + + Y+QQAEK Q P G+   NQ    K                           
Sbjct: 175  KFFQQRGSSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQVSQNQ 234

Query: 1776 XXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAKLNE 1597
                N+ NG PN A+R A+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNE+KLNE
Sbjct: 235  IQ--NVANGQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNE 292

Query: 1596 AFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWLKLC 1417
            AF+S ENVILIFSVNRTRHFQGCAKMTS+IGG + GGNWKY HGTAHYGRNFSVKWLKLC
Sbjct: 293  AFDSVENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLC 352

Query: 1416 ELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXXXXX 1237
            ELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPDG+LM + +      
Sbjct: 353  ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKR 412

Query: 1236 XXXXXKGVSTDDASDNPDIVLF---XXXXXXXXXXXXXXXDXXXXXXXXXXXXXXXXMWQ 1066
                 KGV+ D+  +NPDIV F                                   MW 
Sbjct: 413  EEEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGPAGQGRGRGRGMMWP 472

Query: 1065 THMPMVSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGVFAPYGPRFS 904
             HMP+  G RPM  ++GF PVMMG DG  Y    P+GFG  DLF V PR  FAPYGPRFS
Sbjct: 473  PHMPLPRGARPMPGMQGFNPVMMG-DGLSYGPVAPDGFGMPDLFSVGPR-AFAPYGPRFS 530

Query: 903  GDFAG--AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGM-AAITGVGRSGRPIGL 733
            GDF G  A ++F GRP QPG +FP GG GMMM N GR PFMGGM  A     R GRP+ +
Sbjct: 531  GDFGGPPAAMMFRGRPSQPG-MFPGGGFGMMM-NPGRGPFMGGMGVAGANPPRGGRPVNM 588

Query: 732  XXXXXXXXXXXXSNNRIVKKDQRRL-TNDRYEPALNHGGKGHEVGAGNGVGD-----KFG 571
                        + NR+ K+DQR    NDRY    +  GK  ++ + +G  D     + G
Sbjct: 589  PPMFPPPPPLPQNTNRLAKRDQRTTDRNDRYGSG-SEQGKSQDMLSQSGAPDDDMQYQQG 647

Query: 570  TK---------SSLQNDESESEDEA 523
             K         ++ +ND+SESEDEA
Sbjct: 648  YKANQDDHPAVNNFRNDDSESEDEA 672


>ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Cicer arietinum]
          Length = 677

 Score =  706 bits (1822), Expect = 0.0
 Identities = 388/684 (56%), Positives = 437/684 (63%), Gaps = 28/684 (4%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAASA--------PTNPAVXXXXXXXXXXXXXXXXXXXAQTGLG 2335
            D EG LSFDFEGGLD A  SA        P+ P V                         
Sbjct: 3    DSEGVLSFDFEGGLDAAPPSAATVSVPAPPSGPIVHPDSSLPP----------------- 45

Query: 2334 SFNGDPAASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYG 2155
            S + + AA+ +G    RRSFRQTVCRHWLRSLCMKG+ACGFLHQYDKARMPVCRFFRLYG
Sbjct: 46   SISSNGAAAVSGNIPGRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYG 105

Query: 2154 ECREQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSF 1975
            ECREQDCVYKHTN+DIKECNMYKLGFCPNGPDCRYRH K PGPPPP+EEV QKIQHL S+
Sbjct: 106  ECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSY 165

Query: 1974 NYGSGNRFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXX 1795
            N+ + ++F Q + + Y+QQ EK QFP G   ANQ    K                     
Sbjct: 166  NFNNSHKFIQQRGSSYTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQSQQQ 225

Query: 1794 XXXXXXXXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSN 1615
                      N+ NG PN ANRTA+PLPQG SRYFIVKSCNRENLE+SVQQGVWATQRSN
Sbjct: 226  VSQIQTQ---NLANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSN 282

Query: 1614 EAKLNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSV 1435
            E+KLNEAF+S ENVILIFSVNRTRHFQGCAKMTS+IGG + GGNWKY HGTAHYGRNFSV
Sbjct: 283  ESKLNEAFDSVENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSV 342

Query: 1434 KWLKLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLI 1255
            KWLKLCELSF+KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPD +LM + I
Sbjct: 343  KWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISI 402

Query: 1254 XXXXXXXXXXXKGVSTDDASDNPDIVLF---XXXXXXXXXXXXXXXDXXXXXXXXXXXXX 1084
                       KGV+ D+A +NPDIV F                                
Sbjct: 403  AAESKREEEKAKGVNPDNAGENPDIVPFEDNEEEEEEESDEEEESFVQAVVPVGQGRGRG 462

Query: 1083 XXXMWQTHMPMVSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGVFAP 922
               MW  HMP+  G RPM  ++GF PVMMG DG  Y    P+GFG  DLFG+ PRG F P
Sbjct: 463  RGMMWPPHMPLGRGARPMPGMQGFNPVMMG-DGLSYGPGAPDGFGMPDLFGMGPRG-FGP 520

Query: 921  YGPRFSGDFAG--AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGVG--R 754
            YGPRFSGDFAG  A ++F GRP QPG +FP GG GMMM N GR PFMGGM  + G    R
Sbjct: 521  YGPRFSGDFAGPPAAMMFRGRPSQPG-MFPGGGFGMMM-NPGRGPFMGGM-GVPGPNPPR 577

Query: 753  SGRPIGLXXXXXXXXXXXXSNNRIVKKDQR-RLTNDRYEPALNHGGKGHEVGAGNGVGDK 577
             GRP+ +            + NRI K+DQR    NDRY      G     +    G  D+
Sbjct: 578  GGRPLNMPPMFPPPPPPPQNVNRIAKRDQRTNDRNDRYSSGQEQGKSQDMLSQSGGPDDE 637

Query: 576  FGTKSS------LQNDESESEDEA 523
               + S       +N++SESEDEA
Sbjct: 638  MQYQQSGAPANNFRNEDSESEDEA 661


>ref|XP_008459517.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cucumis melo]
          Length = 708

 Score =  703 bits (1814), Expect = 0.0
 Identities = 380/691 (54%), Positives = 433/691 (62%), Gaps = 35/691 (5%)
 Frame = -3

Query: 2490 DDEGALSFDFEGGLDNAAASAPTNPAVXXXXXXXXXXXXXXXXXXXAQTGLGSFNG---- 2323
            D EG LSFDFEGGLD    + PTNPA                        L    G    
Sbjct: 3    DSEGVLSFDFEGGLD----AGPTNPAATSSLPLINSDSSAPPAASAVSNSLSGALGPAVS 58

Query: 2322 -DPAASAAGGGNQRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECR 2146
             +P  +  G    RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFRLYGECR
Sbjct: 59   AEPPGAPPGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECR 118

Query: 2145 EQDCVYKHTNDDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQHLSSFNYG 1966
            EQDCVYKHTN+DIKECNMYK GFCPNGPDCRYRH KLPGPPPPVEE+ QKIQHL S+NYG
Sbjct: 119  EQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPVEEILQKIQHLGSYNYG 178

Query: 1965 SGNRFFQHKNTGYSQQAEKPQFPHGSGLANQPTEVKXXXXXXXXXXXXXXXXXXXXXXXX 1786
              N+FF  +  G SQQ EK QFP    +  Q    K                        
Sbjct: 179  PSNKFFTQRGVGLSQQNEKSQFPQVPAITTQGVTGK----PSAAESANVQQQQGQQSAPQ 234

Query: 1785 XXXXXXQNIPNGLPNPANRTASPLPQGQSRYFIVKSCNRENLEISVQQGVWATQRSNEAK 1606
                  QN+ NG PN  NR A+ LPQG SRYFIVKSCNRENLE+SVQQGVWATQRSNEAK
Sbjct: 235  ASQTPVQNLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 294

Query: 1605 LNEAFESSENVILIFSVNRTRHFQGCAKMTSKIGGFIGGGNWKYVHGTAHYGRNFSVKWL 1426
            LNEAF++++NVILIFSVNRTRHFQGCAKM S+IGG + GGNWKY HGTAHYG+NFS+KWL
Sbjct: 295  LNEAFDTADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWL 354

Query: 1425 KLCELSFNKTHHLRNPYNDNLPVKISRDCQELEPFIGEQLASLLYLEPDGQLMEMLIXXX 1246
            KLCELSF KT HLRNPYN+NLPVKISRDCQELEP IGEQLASLLYLEPDG+LM + I   
Sbjct: 355  KLCELSFQKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAE 414

Query: 1245 XXXXXXXXKGVSTDDASDNPDIVLF----XXXXXXXXXXXXXXXDXXXXXXXXXXXXXXX 1078
                    KGV+ D  S+NPDIV F                                   
Sbjct: 415  SKREEEKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPPQGRGRGRG 474

Query: 1077 XMWQTHMPMVSGGRPM--LRGFPPVMMGADGFGY----PEGFGTLDLFGVPPRGVFAPYG 916
             MW   MP+  G RP   ++GFPP MMG DG  Y    P+GF   D+FG+ PRG F PYG
Sbjct: 475  MMWPPQMPIGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRG-FGPYG 533

Query: 915  --PRFSGDFAG--AGLIFPGRPPQPGAVFPIGGLGMMMGNAGRAPFMGGMAAITGV--GR 754
              PRFS DF G    ++F GRP QPGA+FP GG GMMMG     PFMGGM  +TG    R
Sbjct: 534  PTPRFSSDFMGPPTAMMFRGRPSQPGAMFPPGGFGMMMGQGRGGPFMGGM-GVTGANPAR 592

Query: 753  SGRPIGLXXXXXXXXXXXXSN-NRIVKKDQRRLTNDRYEPALNHGGKGHEV--------- 604
             GRP+G+             N NR +K+DQR LTND+Y   ++   KG E+         
Sbjct: 593  PGRPVGVSPLYPPPAVPSSQNMNRAIKRDQRGLTNDKYIVGIDQ-NKGLEIQSSGRDDEM 651

Query: 603  ----GAGNGVGDKFGTKSSLQNDESESEDEA 523
                G+     +++GT ++ +N+ESESEDEA
Sbjct: 652  QYKQGSKAYSDEQYGTGTTFRNEESESEDEA 682


Top