BLASTX nr result

ID: Akebia24_contig00012893 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00012893
         (2417 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation spec...   825   0.0  
ref|XP_007041140.1| Cleavage and polyadenylation specificity fac...   823   0.0  
ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr...   794   0.0  
ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm...   788   0.0  
ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation spec...   786   0.0  
gb|EXB51974.1| Cleavage and polyadenylation specificity factor C...   783   0.0  
ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec...   778   0.0  
ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas...   775   0.0  
ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation spec...   775   0.0  
ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec...   774   0.0  
ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation spec...   744   0.0  
ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prun...   729   0.0  
ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citr...   727   0.0  
ref|XP_002300333.2| zinc finger family protein [Populus trichoca...   722   0.0  
ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation spec...   715   0.0  
gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus...   711   0.0  
ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation spec...   709   0.0  
ref|XP_004233145.1| PREDICTED: cleavage and polyadenylation spec...   704   0.0  
gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malu...   702   0.0  
ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation spec...   698   0.0  

>ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Vitis vinifera]
          Length = 673

 Score =  825 bits (2131), Expect = 0.0
 Identities = 441/714 (61%), Positives = 482/714 (67%), Gaps = 21/714 (2%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108
            MED EGVLSFDFEGGLD AP   +   PL+ +D + AA+        P   V ++P PG 
Sbjct: 1    MEDAEGVLSFDFEGGLDAAPGTAATVAPLIQSDATAAAAA-------PSSVVSAEPTPGG 53

Query: 2107 YPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHTN 1928
             PGRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDCVYKHTN
Sbjct: 54   APGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTN 113

Query: 1927 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNRN 1748
            EDIKECNMYKLGFCPNG DCRYRH KLPGPPP +EEVFQKIQ LSSFNYG  NRF+QNRN
Sbjct: 114  EDIKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFYQNRN 173

Query: 1747 AGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQN 1568
              Y QQ E+ Q  QGSN +N G  AK STT E+ N+                       N
Sbjct: 174  P-YNQQTEKSQILQGSNAVNLGTVAKSSTT-EAINVQQQQVQPPQQQVSQTPMQ-----N 226

Query: 1567 IPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTT 1388
            +P+G+PNQ NKTA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+ 
Sbjct: 227  LPNGLPNQANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSV 286

Query: 1387 ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSFH 1208
            ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKY+HG+AHYGRNFSVKWLKLCELSFH
Sbjct: 287  ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 346

Query: 1207 KTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1028
            KTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAIS+           
Sbjct: 347  KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREEEKA 406

Query: 1027 KGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLWPPHMTL 851
            KGVNPD+G ENPDIVPF                S+GQALG A QGRGRGRG++WPPHM L
Sbjct: 407  KGVNPDNGGENPDIVPFEDNEEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMPL 466

Query: 850  XXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPPDFTG 671
                          PVMMGADGF+Y  + PDG  M P++F + PR FP YGPRF  DFT 
Sbjct: 467  ARGARPIPSMRGFPPVMMGADGFSYSAVPPDGFAM-PDIFGVGPRAFPPYGPRFSGDFT- 524

Query: 670  LGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXXXXXXMV 491
                           G  SGMMF GR  QPG +FP ASG GM+                 
Sbjct: 525  ---------------GPASGMMFPGR-GQPGAVFP-ASGYGMMMGPGRAPFMGGMGVPAA 567

Query: 490  A--------------TPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMAS 353
            A               PP   N                    NDRYS GSD G+GQ+MA 
Sbjct: 568  APTRAGRPVGMPPMFPPPPPPN---SQNNRTKRDQRTPVNDRNDRYSGGSDQGRGQDMAG 624

Query: 352  PGDRTTYDETKYQPGGLKAQSK------NIYRNDESESEDEAPRRSRHGEGKKR 209
            P D T Y +      GLK+Q        N +RNDESESEDEAPRRSRHGEGKK+
Sbjct: 625  PDDETQYLQ------GLKSQQDDQFGGGNSFRNDESESEDEAPRRSRHGEGKKK 672


>ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao]
            gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation
            specificity factor 30 [Theobroma cacao]
          Length = 698

 Score =  823 bits (2125), Expect = 0.0
 Identities = 441/715 (61%), Positives = 488/715 (68%), Gaps = 22/715 (3%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAA---SGNPALVPGPGQSVISDP- 2120
            M+D EG LSFDFEGGLD  P+ P+A++P+V +DPS AA   S N + VPG   +  +DP 
Sbjct: 1    MDDSEGGLSFDFEGGLDAGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPA 60

Query: 2119 --VPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDC 1946
              V G   GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRL+GECREQDC
Sbjct: 61   AAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1945 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNR 1766
            VYKHTNEDIKECNMYKLGFCPNG DCRYRH KLPGPPPPVEEV QKIQ LSS+NY   N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NK 177

Query: 1765 FFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXX 1586
            FFQ RN+G+AQQ E+ Q PQG N +NQG   KPSTT ES NM                  
Sbjct: 178  FFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTT-ESANMHPQQQVQQPQQQVSQTQI 236

Query: 1585 XXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1406
                 N+P+G  NQ NKTA+PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN
Sbjct: 237  Q----NVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 292

Query: 1405 EAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKL 1226
            EAFD+ ENVILIFSVNRTRHFQGCAKMTSKIGG V GGNWKY+HG+AHYGRNFSVKWLKL
Sbjct: 293  EAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKL 352

Query: 1225 CELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXX 1046
            CELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAISV     
Sbjct: 353  CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELK 412

Query: 1045 XXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLWP 866
                  KGVN D+G ENPDIVPF                S+     A QGRGRGRG++WP
Sbjct: 413  REEEKAKGVNSDNGGENPDIVPFEDNEEEEEEESEEEDESFS---AAAQGRGRGRGVMWP 469

Query: 865  PHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFP 686
            PHM L              P+MMG DGF+YG +TPDG  +P +LF  APR FP YGPRF 
Sbjct: 470  PHMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVP-DLFG-APRPFPPYGPRFS 527

Query: 685  PDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXXX 506
             DFTG                  SGMMF GRP QPG +FPA  GLGM+            
Sbjct: 528  GDFTGPA----------------SGMMFPGRPPQPGAMFPAG-GLGMMMGPGRAPFMGGM 570

Query: 505  XXXM---------VATPPVHANXXXXXXXXXXXXXXXXXXP-TNDRYSYGSDPGKGQEMA 356
                         V+ PP+                       TNDRY  GS+ G+GQEMA
Sbjct: 571  GPTGANPVRGGRPVSMPPMFPPPPAPSSQNSGRAVKRDQRTPTNDRYGAGSEQGRGQEMA 630

Query: 355  SPGDRTTYDETKYQPGGLKAQSK------NIYRNDESESEDEAPRRSRHGEGKKR 209
             PG R   DET+YQ  G KA  +      N +RNDESESEDEAPRRSR+GEGKK+
Sbjct: 631  GPGGRLD-DETQYQQEGQKAHHEDQFAAGNSFRNDESESEDEAPRRSRYGEGKKK 684


>ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551535|gb|ESR62164.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 701

 Score =  794 bits (2051), Expect = 0.0
 Identities = 430/716 (60%), Positives = 481/716 (67%), Gaps = 23/716 (3%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASG-----NPALVPGPGQSV--I 2129
            MED EG LSFDFEGGLD  P  P+A+ P + +D + AA+      N A +   G +    
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60

Query: 2128 SDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQD 1949
            S PVP ++ GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRL+GECREQD
Sbjct: 61   SAPVP-HHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 119

Query: 1948 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPN 1769
            CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP VEEV QKIQ +SS+N+G PN
Sbjct: 120  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPN 179

Query: 1768 RFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXX 1589
            + FQ R A ++ QI++ QF QG N +NQG A K S+TAES N+                 
Sbjct: 180  KLFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGK-SSTAESANVHQQQLVQQPQQQGTQTT 237

Query: 1588 XXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1409
                  N+P+G+PNQ N+ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 238  QMQ---NLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 294

Query: 1408 NEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLK 1229
            NEAFD+ ENVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKY+HG+AHYGRNFSVKWLK
Sbjct: 295  NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLK 354

Query: 1228 LCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXX 1049
            LCELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLA+LLYLEPDSELMAISV    
Sbjct: 355  LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEA 414

Query: 1048 XXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLW 869
                   KGVNPD+G +NPDIVPF                S G    A+QGRGRGRGM+W
Sbjct: 415  KREEEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEEESLGT---ASQGRGRGRGMMW 471

Query: 868  PPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRF 689
            P  M L              P+M+GADGF+YG +TPDG PM P+LF +APR F  YGPRF
Sbjct: 472  PGPMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPM-PDLFGVAPRPFAPYGPRF 529

Query: 688  PPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMV--------TXX 533
              DFTG G                 GMMF GRP QPG +FP     GM+           
Sbjct: 530  SGDFTGPG-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGG 572

Query: 532  XXXXXXXXXXXXMVATPPVHAN---XXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQE 362
                         V  PP   N                       NDRYS GSD G+ QE
Sbjct: 573  MGPAATNPRGGRPVGVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQE 632

Query: 361  MASPGDRTTYDETKYQPGGLKAQSKNIY-----RNDESESEDEAPRRSRHGEGKKR 209
            M  PG R   DE +YQ  G KA  ++ Y     RNDESESEDEAPRRSRHGEGKK+
Sbjct: 633  MGGPG-RGPDDEVQYQQEGSKANQEDQYGSRNFRNDESESEDEAPRRSRHGEGKKK 687


>ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis]
            gi|223537608|gb|EEF39232.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 702

 Score =  788 bits (2035), Expect = 0.0
 Identities = 424/719 (58%), Positives = 483/719 (67%), Gaps = 26/719 (3%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTA-PSNPSAAVPLVPTD--PSIAASGNPALVPGPGQSVISDPV 2117
            M+D +G LSFDFEGGLD++ P+NP+A++P +P+D   ++AA+ N ++VP    +   DP 
Sbjct: 1    MDDTDGGLSFDFEGGLDSSGPTNPTASIPAIPSDNTAAVAAATNNSIVPNVSSN---DPA 57

Query: 2116 PG------NYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECRE 1955
                    N  GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECRE
Sbjct: 58   SAAAAAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECRE 117

Query: 1954 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGF 1775
            QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ L+S+NYG 
Sbjct: 118  QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGS 177

Query: 1774 PNRFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXX 1595
             N+FFQ R AG+ Q  ++ QF QG N + QG+AAKP  T ES N+               
Sbjct: 178  SNKFFQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGT-ESANVQQPQQQQPQPGQGQQ 236

Query: 1594 XXXXXXXQ---NIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRS 1424
                       N+P+G PNQ N+TA+PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRS
Sbjct: 237  SQQQATQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRS 296

Query: 1423 NEAKLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFS 1244
            NEAKLNEAFD+ ENVILIFSVNRTRHFQGCAKMTSKIG  VGGGNWKY+HG+AHYGRNFS
Sbjct: 297  NEAKLNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFS 356

Query: 1243 VKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAIS 1064
            VKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+G QLA LLY EPDSELMAIS
Sbjct: 357  VKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAIS 416

Query: 1063 VXXXXXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGAT-QGRGR 887
            +           KGVNP++G +NPDIVPF                S+GQALGA  QGRGR
Sbjct: 417  LAAEAKREEEKAKGVNPENGGDNPDIVPFEDNEEEEEEESEEEEESFGQALGAPGQGRGR 476

Query: 886  GRGMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFP 707
            GRG++W PHM L              P+MMGAD F+YG +TPDG  M P+LF +APR F 
Sbjct: 477  GRGIIW-PHMPLARGARPIPGMRGFPPMMMGADSFSYGPVTPDGFGM-PDLFGVAPRGFT 534

Query: 706  SYGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXX 527
             Y PRF  DFT                G  SGMMF GRP QPG +FP   G GM+     
Sbjct: 535  PYAPRFSGDFT----------------GAASGMMFPGRPPQPGGVFP-NGGFGMMMGPGR 577

Query: 526  XXXXXXXXXXMVATPPVHAN-------XXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKG 368
                        +T P+  N                           NDRYS GSD  +G
Sbjct: 578  APFMGGMGPN--STNPLRGNWPGGMPFPPLPTPSPQRPVKRDQRMTANDRYSTGSD--QG 633

Query: 367  QEMASPGDRTTYDETKYQPGGLKAQSK------NIYRNDESESEDEAPRRSRHGEGKKR 209
            +  A   D    DE +YQ  GLKA  +      N +RNDESESEDEAPRRSRHGEGKK+
Sbjct: 634  RNTAGEPD----DEARYQQEGLKASHEDQFGAGNSFRNDESESEDEAPRRSRHGEGKKK 688


>ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Citrus sinensis]
          Length = 683

 Score =  786 bits (2029), Expect = 0.0
 Identities = 426/709 (60%), Positives = 472/709 (66%), Gaps = 16/709 (2%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108
            MED EG LSFDFEGGLD  P  P+A+ P      S AA  +            S PVP +
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAAAPSSSGAAPDHA-----------SAPVP-H 48

Query: 2107 YPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHTN 1928
            + GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRL+GECREQDCVYKHTN
Sbjct: 49   HSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVYKHTN 108

Query: 1927 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNRN 1748
            EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP VEEV QKIQ +SS+N+G PN+ FQ R 
Sbjct: 109  EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNKHFQQRG 168

Query: 1747 AGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQN 1568
            A ++ Q ++ QF QG N +NQG A K S+TAES N+                       N
Sbjct: 169  A-FSHQTDKSQFSQGPNAVNQGAAGK-SSTAESANVHQQQLVQQPQQQGTQTTQMQ---N 223

Query: 1567 IPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTT 1388
            +P+G+PNQ N+ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+ 
Sbjct: 224  LPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSA 283

Query: 1387 ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSFH 1208
            ENVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKY+HG+AHYGRNFSVKWLKLCELSFH
Sbjct: 284  ENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 343

Query: 1207 KTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1028
            KTRHLRNP+NENLPVKISRDCQELE SIGEQLA+LLYLEPDSELMAISV           
Sbjct: 344  KTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEEKA 403

Query: 1027 KGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLWPPHMTLX 848
            KGVNPD+G +NPDIVPF                S G    A+QGRGRGRGM+WP  M L 
Sbjct: 404  KGVNPDNGGDNPDIVPFEDNEEEEEEESEEEEESLGT---ASQGRGRGRGMMWPGPMPLA 460

Query: 847  XXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPPDFTGL 668
                         P+M+GADGF+YG +TPDG PM P+LF +APR F  YGPRF  DFTG 
Sbjct: 461  RGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPM-PDLFGVAPRPFAPYGPRFSGDFTGP 518

Query: 667  GQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMV--------TXXXXXXXXX 512
            G                 GMMF GRP QPG +FP     GM+                  
Sbjct: 519  G-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATN 561

Query: 511  XXXXXMVATPPVHAN---XXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMASPGDR 341
                  V  PP   N                       NDRYS GSD G+ QEM  PG R
Sbjct: 562  PRGGRPVGVPPPFPNQPQSSQNSSRAAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPG-R 620

Query: 340  TTYDETKYQPGGLKAQSKNIY-----RNDESESEDEAPRRSRHGEGKKR 209
               DE +YQ  G KA  ++ Y     RNDESESEDEAPRRSRHGEGKK+
Sbjct: 621  GPDDEVQYQQEGSKANQEDQYGSRNFRNDESESEDEAPRRSRHGEGKKK 669


>gb|EXB51974.1| Cleavage and polyadenylation specificity factor CPSF30 [Morus
            notabilis]
          Length = 710

 Score =  783 bits (2021), Expect = 0.0
 Identities = 425/722 (58%), Positives = 473/722 (65%), Gaps = 29/722 (4%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTA-----PSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISD 2123
            MED EGVLSFDFEGGLDT      P+  +A+  L+  D S AA+ N   +     +V +D
Sbjct: 1    MEDSEGVLSFDFEGGLDTTAGGCPPNAAAASAALIHPDSSAAAASNN--LAASNSAVSAD 58

Query: 2122 PVPG-----NYPGR-RSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGEC 1961
            P  G     + PGR RSFRQTVCRHWLR LCMKGEACGFLHQYDKSRMP+CRFFRLYGEC
Sbjct: 59   PTSGGGGGASNPGRGRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGEC 118

Query: 1960 REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNY 1781
            REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPP VEEV QKIQ+LSS+NY
Sbjct: 119  REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQHLSSYNY 178

Query: 1780 GFPNRFFQNRNAG-YAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXX 1604
               N+FFQ RNAG +AQ  E+P  P G N ++QGV  KPS   ES N+            
Sbjct: 179  -HSNKFFQQRNAGGFAQLGEKPLLPLGPNAVSQGVVGKPSIL-ESANVQQPQQQVQPSQQ 236

Query: 1603 XXXXXXXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRS 1424
                       N+  G+PNQ N+T  PLP G+SRYFIVKSCNRENLELSVQQGVWATQRS
Sbjct: 237  PVGQNQIQ---NVFTGLPNQANRTVAPLPPGISRYFIVKSCNRENLELSVQQGVWATQRS 293

Query: 1423 NEAKLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFS 1244
            NEAKLNEAFD  ENVILIFSVNRTRHFQGCAKM S+IGG + GGNWKY+HG+AHYGRNFS
Sbjct: 294  NEAKLNEAFDCAENVILIFSVNRTRHFQGCAKMISRIGGSISGGNWKYAHGTAHYGRNFS 353

Query: 1243 VKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAIS 1064
            VKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAIS
Sbjct: 354  VKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAIS 413

Query: 1063 VXXXXXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRG 884
            +           KGV+PD+G ENPDIVPF                S+ Q LGA QGRGRG
Sbjct: 414  LAAESKREEEKAKGVDPDNGGENPDIVPFEDNEEDEEEESEDEEESFSQVLGANQGRGRG 473

Query: 883  RGMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPS 704
            RG++WPPHM L              PVM+GADG  YG +TPDG PM P+LFN+ PR F  
Sbjct: 474  RGVMWPPHMPLSRGARPMPSMQGFPPVMIGADGSPYGPVTPDGFPM-PDLFNVGPRAFNP 532

Query: 703  YGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXX 524
            YGPRFP DF                 G TSGMMF GRP+QPG +FP   G GM+      
Sbjct: 533  YGPRFPGDF----------------MGPTSGMMFRGRPTQPGAVFP-GGGFGMMMGPGRA 575

Query: 523  XXXXXXXXXMV---------ATPPVHANXXXXXXXXXXXXXXXXXXPTND---RYSYGSD 380
                                A PP+                       ND   RY  GSD
Sbjct: 576  PCMGGMGVQGTSPARPMRPGAMPPMFQQPPPPSQNMNRPPRRDQRGLANDRNERYGAGSD 635

Query: 379  PGKGQEMASPGDRTTYDETKYQPGGLKAQ-----SKNIYRNDESESEDEAPRRSRHGEGK 215
              +GQEM+ P      D+  YQ G    Q     + N +RNDESESEDEAPRRSRHG+GK
Sbjct: 636  QVRGQEMSGPAGGPE-DDAHYQLGAKARQEDQYGAGNSFRNDESESEDEAPRRSRHGDGK 694

Query: 214  KR 209
            K+
Sbjct: 695  KK 696


>ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 691

 Score =  778 bits (2009), Expect = 0.0
 Identities = 427/719 (59%), Positives = 473/719 (65%), Gaps = 26/719 (3%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVP---LVPTDPSIAAS-----GNPALVPGPGQSV 2132
            MED EGVLSFDFEGGLD APS+ +AAVP   LV  D S AAS     G+ A  P      
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPST---- 56

Query: 2131 ISDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQ 1952
             +DP  GN PGRRSFRQTVCRHWLR LCMKG+ACGFLHQYDK+RMP+CRFFRLYGECREQ
Sbjct: 57   -ADPAGGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQ 115

Query: 1951 DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFP 1772
            DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH K PGPPPPVEEV QKIQ+L S+NY   
Sbjct: 116  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSS 175

Query: 1771 NRFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXX 1592
            N+FFQ R A Y QQ E+PQ PQG+N+ NQGV  KP   AES N                 
Sbjct: 176  NKFFQQRGASYNQQAEKPQLPQGTNSTNQGVTGKP-LPAESGNAQPQQQVQQSQQQVNQS 234

Query: 1591 XXXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAK 1412
                   N+ +G PNQ N+TA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+K
Sbjct: 235  QMQ----NVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESK 290

Query: 1411 LNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWL 1232
            LNEAFD+ ENVIL+FSVNRTRHFQGCAKMTS+IGG V GGNWKY+HG+AHYGRNFSVKWL
Sbjct: 291  LNEAFDSVENVILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWL 350

Query: 1231 KLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXX 1052
            KLCELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAISV   
Sbjct: 351  KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAE 410

Query: 1051 XXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGM 875
                    KGVNPD+G ENPDIVPF                S+   +G A QGRGRGRGM
Sbjct: 411  SKREEEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFSHGVGPAGQGRGRGRGM 470

Query: 874  LWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITP---DGLPMPPELFNMAPRVFPS 704
            +WPPHM L              PVMMG DG +YG + P   DG  MP +LF + PR F  
Sbjct: 471  MWPPHMPLGRGARPMPGMQGFNPVMMG-DGLSYGPVGPVGPDGFGMP-DLFGVGPRGFAP 528

Query: 703  YGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXX 524
            YGPRF  DF                 G  + MMF GRPSQPG +FP + G GM+      
Sbjct: 529  YGPRFSGDF----------------GGPPAAMMFRGRPSQPG-MFP-SGGFGMMMNPGRG 570

Query: 523  XXXXXXXXXMVATP----PVH------ANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPG 374
                         P    PV+                            NDR+  GS+ G
Sbjct: 571  PFMGGMGVGGANPPRGGRPVNMPPMFPPPPPLPQNANRAAKRDQRTADRNDRFGSGSEQG 630

Query: 373  KGQEMASPGDRTTYDETKYQPGGLKAQ----SKNIYRNDESESEDEAPRRSRHGEGKKR 209
            K Q+M S       D+ +YQ G    Q    + N +RND+SESEDEAPRRSRHGEGKK+
Sbjct: 631  KSQDMLSQSGGPD-DDAQYQQGYKGNQDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKK 688


>ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris]
            gi|561020727|gb|ESW19498.1| hypothetical protein
            PHAVU_006G130200g [Phaseolus vulgaris]
          Length = 697

 Score =  775 bits (2002), Expect = 0.0
 Identities = 422/714 (59%), Positives = 469/714 (65%), Gaps = 21/714 (2%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSA-AVPLVPTDPSIAAS-----GNPALVPGPGQSVIS 2126
            MED EGVLSFDFEGGLDTAPS  +A + PLV  D S AAS     G PA  P       +
Sbjct: 1    MEDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSG-----T 55

Query: 2125 DPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDC 1946
            +P   N PGRRSFRQTVCRHWLR LCMKG+ACGFLHQYDK+RMP+CRFFRLYGECREQDC
Sbjct: 56   EPAAVNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDC 115

Query: 1945 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNR 1766
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRH K PGPPPPVEEV QKIQ+L S+NY   N+
Sbjct: 116  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNK 175

Query: 1765 FFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXX 1586
            FFQ R + Y QQ E+ Q PQG+N+ NQGV  KP   AES N                   
Sbjct: 176  FFQQRGSSYTQQAEKSQLPQGTNSTNQGVTGKP-LPAESGNAQPQQQVQQSQQQQVSQNQ 234

Query: 1585 XXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1406
                 N+ +G PNQ ++ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLN
Sbjct: 235  IQ---NVANGQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLN 291

Query: 1405 EAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKL 1226
            EAFD+ ENVILIFSVNRTRHFQGCAKMTS+IGG V GGNWKY+HG+AHYGRNFSVKWLKL
Sbjct: 292  EAFDSVENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKL 351

Query: 1225 CELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXX 1046
            CELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPD ELMA+SV     
Sbjct: 352  CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESK 411

Query: 1045 XXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLW 869
                  KGVNPD+G ENPDIVPF                S+G  +G A QGRGRGRGM+W
Sbjct: 412  REEEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGPAGQGRGRGRGMMW 471

Query: 868  PPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRF 689
            PPHM L              PVMMG DG +YG + PDG  MP +LF++ PR F  YGPRF
Sbjct: 472  PPHMPLPRGARPMPGMQGFNPVMMG-DGLSYGPVAPDGFGMP-DLFSVGPRAFAPYGPRF 529

Query: 688  PPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXX 509
              DF                 G  + MMF GRPSQPG +FP   G GM+           
Sbjct: 530  SGDF----------------GGPPAAMMFRGRPSQPG-MFP-GGGFGMMMNPGRGPFMGG 571

Query: 508  XXXXMVATP----PVH------ANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEM 359
                    P    PV+                            NDRY  GS+ GK Q+M
Sbjct: 572  MGVAGANPPRGGRPVNMPPMFPPPPPLPQNTNRLAKRDQRTTDRNDRYGSGSEQGKSQDM 631

Query: 358  ASPGDRTTYDETKYQPGGLKAQ----SKNIYRNDESESEDEAPRRSRHGEGKKR 209
             S       D+ +YQ G    Q    + N +RND+SESEDEAPRRSRHGEGKK+
Sbjct: 632  LSQSGAPD-DDMQYQQGYKANQDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKK 684


>ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cicer arietinum]
          Length = 677

 Score =  775 bits (2001), Expect = 0.0
 Identities = 422/706 (59%), Positives = 466/706 (66%), Gaps = 11/706 (1%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108
            MED EGVLSFDFEGGLD AP  PSAA   VP  PS       + +P    S  + PV GN
Sbjct: 1    MEDSEGVLSFDFEGGLDAAP--PSAATVSVPAPPSGPIVHPDSSLPPSISSNGAAPVSGN 58

Query: 2107 YPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHTN 1928
             PGRRSFRQTVCRHWLR LCMKGEACGFLHQYDK+RMP+CRFFRLYGECREQDCVYKHTN
Sbjct: 59   IPGRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHTN 118

Query: 1927 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNRN 1748
            EDIKECNMYKLGFCPNGPDCRYRH K PGPPPP+EEV QKIQ+L S+N+   ++F Q R 
Sbjct: 119  EDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNSHKFIQQRG 178

Query: 1747 AGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQN 1568
            + Y QQ+E+ QFPQG N+ NQGVA KP   AES N+                       N
Sbjct: 179  SSYTQQVEKSQFPQGINSANQGVAGKP-LAAESGNVQQQQQVQQSQQQVSQIQTQ----N 233

Query: 1567 IPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDTT 1388
            + +G PNQ N+TA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFD+ 
Sbjct: 234  LANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSV 293

Query: 1387 ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSFH 1208
            ENVILIFSVNRTRHFQGCAKMTS+IGG V GGNWKY+HG+AHYGRNFSVKWLKLCELSFH
Sbjct: 294  ENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 353

Query: 1207 KTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1028
            KTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAIS+           
Sbjct: 354  KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKA 413

Query: 1027 KGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQA-LGATQGRGRGRGMLWPPHMTL 851
            KGVNPD+  ENPDIVPF                S+ QA +   QGRGRGRGM+WPPHM L
Sbjct: 414  KGVNPDNAGENPDIVPFEDNEEEEEEESDEEEESFVQAVVPVGQGRGRGRGMMWPPHMPL 473

Query: 850  XXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPPDFTG 671
                          PVMMG DG +YG   PDG  M P+LF M PR F  YGPRF  DF  
Sbjct: 474  GRGARPMPGMQGFNPVMMG-DGLSYGPGAPDGFGM-PDLFGMGPRGFGPYGPRFSGDF-- 529

Query: 670  LGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVT---------XXXXXXX 518
                          AG  + MMF GRPSQPG +FP   G GM+                 
Sbjct: 530  --------------AGPPAAMMFRGRPSQPG-MFP-GGGFGMMMNPGRGPFMGGMGVPGP 573

Query: 517  XXXXXXXMVATPPVH-ANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMASPGDR 341
                    +  PP+                        NDRYS G + GK Q+M S    
Sbjct: 574  NPPRGGRPLNMPPMFPPPPPPPQNVNRIAKRDQRTNDRNDRYSSGQEQGKSQDMLSQSGG 633

Query: 340  TTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR*G 203
               DE +YQ  G  A   N +RN++SESEDEAPRRSRHGEGKKR G
Sbjct: 634  PD-DEMQYQQSGAPA---NNFRNEDSESEDEAPRRSRHGEGKKRKG 675


>ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 681

 Score =  774 bits (1998), Expect = 0.0
 Identities = 422/710 (59%), Positives = 464/710 (65%), Gaps = 17/710 (2%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAV--PLVPTDPSIAAS----GNPALVPGPGQSVIS 2126
            MED EGVLSFDFEGGLD APS+ +AA   PL+P D S AAS    G PA    P  S + 
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVSNGGPA---APAPSAVD 57

Query: 2125 DPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDC 1946
                GN PGRRSFRQTVCRHWLR LCMKG+ACGFLHQYDK+RMP+CRFFRLYGECREQDC
Sbjct: 58   PVGGGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDC 117

Query: 1945 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNR 1766
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRH K PGPPPPVEEV QKIQ+L S+NY   N+
Sbjct: 118  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNK 177

Query: 1765 FFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXX 1586
            FFQ R A Y QQ E+P  PQG+N+ NQGV   P      P                    
Sbjct: 178  FFQQRGASYNQQAEKPLLPQGNNSTNQGVTGNPL-----PAELGNAQPQQQVQQSQQQVN 232

Query: 1585 XXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1406
                QN+ +G PNQ N+TA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLN
Sbjct: 233  QSQMQNVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLN 292

Query: 1405 EAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKL 1226
            EAFD+ ENVILIFSVNRTRHFQGCAKMTSKIGG V GGNWKY+HG+AHYGRNFSVKWLKL
Sbjct: 293  EAFDSVENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKL 352

Query: 1225 CELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXX 1046
            CELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAISV     
Sbjct: 353  CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESK 412

Query: 1045 XXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLW 869
                  KGVNPD+G ENPDIVPF                S+G  +G A QGRGRGRGM+W
Sbjct: 413  REEEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGRGRGRGMMW 472

Query: 868  PPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRF 689
            PPHM L              PVMMG DG +YG + PDG  MP +LF + PR F  YGPRF
Sbjct: 473  PPHMPLGRGARPMPGMQGFNPVMMG-DGLSYGPVGPDGFGMP-DLFGVGPRGFAPYGPRF 530

Query: 688  PPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXX 509
              DF                 G  + MMF GRPSQPG +FP   G GM+           
Sbjct: 531  SGDF----------------GGPPAAMMFRGRPSQPG-MFP-GGGFGMMLNPGRGPFMGG 572

Query: 508  XXXXMVATP----PVH------ANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEM 359
                    P    PV+                            NDR+  GS+ GK Q+M
Sbjct: 573  IGVGGANPPRGGRPVNMPPMFPPPPPLPQNANRAAKRDQRTADRNDRFGSGSEQGKSQDM 632

Query: 358  ASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209
             S       D+ +YQ G    Q  +    D+SESEDEAPRRSRHGEGKK+
Sbjct: 633  LSQSGGPD-DDPQYQQGYKGNQDDH---PDDSESEDEAPRRSRHGEGKKK 678


>ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cucumis sativus]
          Length = 707

 Score =  744 bits (1921), Expect = 0.0
 Identities = 409/722 (56%), Positives = 461/722 (63%), Gaps = 29/722 (4%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSA--AVPLVPTD----PSIAASGNPALVPGPGQSVIS 2126
            MED EGVLSFDFEGGLD  P+NP+A  ++P++ +D    P+ +A  NP L    G +V +
Sbjct: 1    MEDSEGVLSFDFEGGLDAGPTNPAATSSLPIINSDSSAPPAASAVSNP-LSGALGPAVSA 59

Query: 2125 DPVP---GNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECRE 1955
            +P     GN   RRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMPICRFFRLYGECRE
Sbjct: 60   EPTGAPHGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECRE 119

Query: 1954 QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGF 1775
            QDCVYKHTNEDIKECNMYK GFCPNGPDCRYRH KLPGPPPP+EE+ QKIQ+L S+NYG 
Sbjct: 120  QDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPLEEILQKIQHLGSYNYGP 179

Query: 1774 PNRFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXX 1595
             N+FF  R  G +QQ E+ QFPQ    + QGV  KPS  AES N+               
Sbjct: 180  SNKFFTQRGVGLSQQNEKSQFPQVPALVTQGVTGKPSA-AESVNVQQQQGQQSAPQASQT 238

Query: 1594 XXXXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEA 1415
                    ++ +G PNQ N+ A  LPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEA
Sbjct: 239  PVQ-----SLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEA 293

Query: 1414 KLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKW 1235
            KLNEAFD+ +NVILIFSVNRTRHFQGCAKM S+IGG V GGNWKY+HG+ HYG+NFS+KW
Sbjct: 294  KLNEAFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTPHYGQNFSLKW 353

Query: 1234 LKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXX 1055
            LKLCELSF KTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPD ELMA+SV  
Sbjct: 354  LKLCELSFQKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDGELMAVSVAA 413

Query: 1054 XXXXXXXXXKGVNPDDGVENPDIVPF-XXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGR 881
                     KGVNPD G ENPDIVPF                 S+GQ+ G   QGRGRGR
Sbjct: 414  ESKREEEKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSAGLPPQGRGRGR 473

Query: 880  GMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSY 701
            GM+WPPHM +              P MMG DG +YG +TPDG PM P++F M PR F  Y
Sbjct: 474  GMMWPPHMPMGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPM-PDIFGMTPRGFGPY 532

Query: 700  G--PRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXX 527
            G  PRF  DF                 G  + MMF GRPSQP  +FP  SG GM+     
Sbjct: 533  GPTPRFSGDF----------------MGPPTAMMFRGRPSQPAAMFP-PSGFGMMMGQGR 575

Query: 526  XXXXXXXXXXMV----------ATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDP 377
                                   +P                        TNDRY  G D 
Sbjct: 576  GPFMGGMGVAGANPARPGRPVGVSPLYPPPAVPSSQNMNRAIKRDQRGLTNDRYIVGMDQ 635

Query: 376  GKGQEMASPGDRTTYDETKYQPGGLKAQSKNIY------RNDESESEDEAPRRSRHGEGK 215
             KG E+ S G     DE      G KA S   Y      RN+ESESEDEAPRRSRHGEGK
Sbjct: 636  NKGVEIQSSG----RDEEMQYKQGSKAYSDEQYGTGTTFRNEESESEDEAPRRSRHGEGK 691

Query: 214  KR 209
            K+
Sbjct: 692  KK 693


>ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica]
            gi|462410040|gb|EMJ15374.1| hypothetical protein
            PRUPE_ppa019072mg [Prunus persica]
          Length = 695

 Score =  729 bits (1881), Expect = 0.0
 Identities = 410/720 (56%), Positives = 459/720 (63%), Gaps = 27/720 (3%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLD-TAPSNPSAAVP----LVPTDPSIAA-SGNPALV-PGPGQSVI 2129
            MED +G ++FDFEGGLD TA + P+   P    L+ +D  +AA   NPA   P P     
Sbjct: 1    MEDSDGDINFDFEGGLDATAAAGPTNPGPPSNSLMQSDSGVAAVDTNPAAAAPQPNH--- 57

Query: 2128 SDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQD 1949
              P P N  G RS+RQTVCRHWLR LCMKGEACGFLHQYDKSRMP+CRFFRLYGECREQD
Sbjct: 58   --PNP-NRSGGRSYRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECREQD 114

Query: 1948 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPN 1769
            CVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ+L+S+NY   N
Sbjct: 115  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNTSN 174

Query: 1768 RFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXX 1589
            +F+Q RNAG+ QQ ++ Q  QG N++ QGV  KPST  ES N+                 
Sbjct: 175  KFYQQRNAGFPQQADKYQSAQGPNSVYQGVVGKPST-GESANVHQQQQVQQTQQQVGHTQ 233

Query: 1588 XXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1409
                  N+P+G+ NQ N++A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KL
Sbjct: 234  TQ----NLPNGLANQANRSA-PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKL 288

Query: 1408 NEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLK 1229
            NEAFD+ ENVILIFSVNRTRHFQGCAKM S+IGG V GGNWKY+HGSAHYGRNFSVKWLK
Sbjct: 289  NEAFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGSAHYGRNFSVKWLK 348

Query: 1228 LCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXX 1049
            LCELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMA+S+    
Sbjct: 349  LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSIAAES 408

Query: 1048 XXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGR-GM 875
                   KGVNP++G ENPDIVPF                S+G   G   +GRGRGR G+
Sbjct: 409  KREEEKAKGVNPENGGENPDIVPFEDNEEEEEEESDDEEESFGPVPGVGNEGRGRGRGGI 468

Query: 874  LWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGP 695
            +WPPHM L              P MMGAD   YG   PDG  M P  F + PR F  YGP
Sbjct: 469  MWPPHMPLARGGRPMPGMQGFPPGMMGADAMPYGP-APDGFGM-PNPFGVGPRGFNPYGP 526

Query: 694  RFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGM---------- 545
            RF  DFT                G T GMMF GRP QPG  FP   G GM          
Sbjct: 527  RFSGDFT----------------GPTPGMMFRGRPQQPG--FP-PGGYGMMMGPGRAPFM 567

Query: 544  ----VTXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDP 377
                V                +  PP   N                    N+RYS GS  
Sbjct: 568  GGMGVGGANPGRPGRPTGMSPMFPPPSSQN----TNRMQKRDPRGPSNDRNERYSAGSGQ 623

Query: 376  GKGQEM----ASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209
            GKGQE+      P D   Y +        +  + N  RND+SESEDEAPRRSRHGEGKK+
Sbjct: 624  GKGQEIPGLAGGPDDEARYQQASKAYREDQYGAGNNSRNDDSESEDEAPRRSRHGEGKKK 683


>ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551536|gb|ESR62165.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 672

 Score =  727 bits (1877), Expect = 0.0
 Identities = 403/716 (56%), Positives = 453/716 (63%), Gaps = 23/716 (3%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASG-----NPALVPGPGQSV--I 2129
            MED EG LSFDFEGGLD  P  P+A+ P + +D + AA+      N A +   G +    
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60

Query: 2128 SDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQD 1949
            S PVP ++ GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRL+GECREQD
Sbjct: 61   SAPVP-HHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQD 119

Query: 1948 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPN 1769
            CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP VEEV QKIQ +SS+N+G PN
Sbjct: 120  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPN 179

Query: 1768 RFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXX 1589
            + FQ R A ++ QI++ QF QG N +NQG A K S+TAES N+                 
Sbjct: 180  KLFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGK-SSTAESANVHQQQLVQQPQQQGTQTT 237

Query: 1588 XXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1409
                  N+P+G+PNQ N+ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 238  QMQ---NLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 294

Query: 1408 NEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLK 1229
            NEAFD+ ENVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKY+HG+AHYGRNFSVKWLK
Sbjct: 295  NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLK 354

Query: 1228 LCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXX 1049
            LCELSFHKTRHLRNP+NENLPVK                             AISV    
Sbjct: 355  LCELSFHKTRHLRNPYNENLPVK-----------------------------AISVAAEA 385

Query: 1048 XXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLW 869
                   KGVNPD+G +NPDIVPF                S G A   +QGRGRGRGM+W
Sbjct: 386  KREEEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEEESLGTA---SQGRGRGRGMMW 442

Query: 868  PPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRF 689
            P  M L              P+M+GADGF+YG +TPDG PM P+LF +APR F  YGPRF
Sbjct: 443  PGPMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPM-PDLFGVAPRPFAPYGPRF 500

Query: 688  PPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMV--------TXX 533
              DFTG G                 GMMF GRP QPG +FP     GM+           
Sbjct: 501  SGDFTGPG-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGG 543

Query: 532  XXXXXXXXXXXXMVATPPVHAN---XXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQE 362
                         V  PP   N                       NDRYS GSD G+ QE
Sbjct: 544  MGPAATNPRGGRPVGVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQE 603

Query: 361  MASPGDRTTYDETKYQPGGLKAQSKNIY-----RNDESESEDEAPRRSRHGEGKKR 209
            M  PG R   DE +YQ  G KA  ++ Y     RNDESESEDEAPRRSRHGEGKK+
Sbjct: 604  MGGPG-RGPDDEVQYQQEGSKANQEDQYGSRNFRNDESESEDEAPRRSRHGEGKKK 658


>ref|XP_002300333.2| zinc finger family protein [Populus trichocarpa]
            gi|550349048|gb|EEE85138.2| zinc finger family protein
            [Populus trichocarpa]
          Length = 669

 Score =  722 bits (1864), Expect = 0.0
 Identities = 405/711 (56%), Positives = 449/711 (63%), Gaps = 18/711 (2%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108
            MED EGVLSFDFEGGLD+ P+NP A++P +P+D   AA+   A  P    +  +     N
Sbjct: 1    MEDSEGVLSFDFEGGLDSGPANPIASIPAIPSDNYGAAT---AAAPNTTNTTTNTTNNSN 57

Query: 2107 ------YPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDC 1946
                    GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDC
Sbjct: 58   SGAADIQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 117

Query: 1945 VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNR 1766
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ L+S+N    N+
Sbjct: 118  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVVQKIQQLNSYNGVTSNK 177

Query: 1765 FFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXX 1586
             FQ RNAG++QQIE+       NTI      KPS T ES N+                  
Sbjct: 178  NFQQRNAGFSQQIEK-----SPNTI-----IKPSGT-ESANVQQQQQQQQQTQTPHLTNG 226

Query: 1585 XXXXQNIPDGMPNQGNKTALPLPQGLSR-----------YFIVKSCNRENLELSVQQGVW 1439
                       PN  N+ A PLPQG+S            YFIVKSCNRENLELSVQQGVW
Sbjct: 227  QHQQPQ----QPNPLNRIATPLPQGISSFFSCVSPSQFVYFIVKSCNRENLELSVQQGVW 282

Query: 1438 ATQRSNEAKLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHY 1259
            ATQRSNE KLNEA D+ +NVILIFSVNRTRHFQGCAKM SKIG  VGGGNWKY+HG+AHY
Sbjct: 283  ATQRSNEIKLNEALDSADNVILIFSVNRTRHFQGCAKMASKIGASVGGGNWKYAHGTAHY 342

Query: 1258 GRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSE 1079
            GRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELE SIGEQLASLLYLEPDSE
Sbjct: 343  GRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSE 402

Query: 1078 LMAISVXXXXXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALG-AT 902
            LMA+S+           KGVNPD G ENPDIVPF                S+GQ LG A 
Sbjct: 403  LMAVSLAAEAKREEEKEKGVNPDSGGENPDIVPFEDNEEEEEEESEEEEESFGQPLGPAA 462

Query: 901  QGRGRGRGMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMA 722
            QGRGRGRGM+WP H  +              P+MMGADGF+YG +TPD   M P+LF +A
Sbjct: 463  QGRGRGRGMMWPSHNPMARGARPIPGIRGFPPMMMGADGFSYGAVTPDSFGM-PDLFGVA 521

Query: 721  PRVFPSYGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMV 542
             R FP YGPRF  DFT                G  SGMMF GRPSQPG +FP A G GM+
Sbjct: 522  SRGFPPYGPRFSGDFT----------------GAASGMMFPGRPSQPGAVFP-AGGFGMM 564

Query: 541  TXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQE 362
                                P  +N                    N+  S      K  +
Sbjct: 565  MGPGRPPFIG-------GMGPTPSNLLRGPRPGGMFAPFPAPSSQNNSRSV-----KRDQ 612

Query: 361  MASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209
             A+  DR   ++   Q G +     N  RNDESESEDEAPRRSRHGEGKK+
Sbjct: 613  RAAANDR---NDRHNQFGAV-----NSIRNDESESEDEAPRRSRHGEGKKK 655


>ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Fragaria vesca subsp. vesca]
          Length = 689

 Score =  715 bits (1845), Expect = 0.0
 Identities = 396/714 (55%), Positives = 444/714 (62%), Gaps = 21/714 (2%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDP----SIAASGNPALVPGPGQSVISDP 2120
            MEDP+GVL+FDFEGGLD+A  +      L  + P    S A+       P P       P
Sbjct: 1    MEDPDGVLNFDFEGGLDSAAVSAPTHTGLASSAPIQSDSFASQPKNQAAPAP------QP 54

Query: 2119 VPGNYP-GRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCV 1943
             P   P GR+SFRQTVCRHWLR LCMKGEACGFLHQYDKSRMP+CRFFR+YGECREQDCV
Sbjct: 55   DPNVNPSGRKSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRMYGECREQDCV 114

Query: 1942 YKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRF 1763
            YKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ+L+S+NY   N+F
Sbjct: 115  YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNNSNKF 174

Query: 1762 FQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXX 1583
             Q RN G+ QQ +R Q  Q +N+ NQ V  +PS  AES N+                   
Sbjct: 175  SQPRNGGFPQQHDRSQPAQVTNSFNQ-VVVRPSA-AESANVQQPQQFQQTQQPVAQTQAQ 232

Query: 1582 XXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1403
                ++P+G+ +Q N+ ALPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNE
Sbjct: 233  ----SVPNGLASQANRAALPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNE 288

Query: 1402 AFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLC 1223
            AFD+ ENVILIFSVNRTRHFQGCAKM S+IGG V GGNWKY+HG+AHYGRNFSVKWLKLC
Sbjct: 289  AFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLC 348

Query: 1222 ELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXX 1043
            ELSFHKTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDSELMAIS+      
Sbjct: 349  ELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKR 408

Query: 1042 XXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGATQGRGRGRGMLWPP 863
                 KGVNP++G ENPDIVPF                 Y    GA + RGRGR ++WPP
Sbjct: 409  EEEKAKGVNPENGGENPDIVPF-EDNEEEEEEESDDEEDYQVPGGAIENRGRGR-VMWPP 466

Query: 862  HMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPP 683
            HM L              P MMG D   YG +TPDG  MP       PR F  YGPRF  
Sbjct: 467  HMPLGGRGGRPMPGMQGFPGMMGPDAMPYGPVTPDGFVMPNPFGMGGPRGFNPYGPRFSG 526

Query: 682  DFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAA--------------SGLGM 545
            DF                 G   GMMF GRP QPG +FP                 G+G+
Sbjct: 527  DF----------------GGPNPGMMFRGRPPQPGGMFPPGPYGMMMGPGRGPFMGGMGV 570

Query: 544  VTXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQ 365
                            M    P   N                    N+RYS GS  GK  
Sbjct: 571  GGNNPARGGRPGGMPPMFPPHPPSQN----NNRLQKRDPRGSGNDRNERYSAGSGHGKEM 626

Query: 364  EMASPGDRTTYDET--KYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209
            +   P D   Y  +   YQ       + N  RND+SESEDEAPRRSRHGEGKK+
Sbjct: 627  QAGGPDDENHYQHSSKSYQE---DYGAGNNGRNDDSESEDEAPRRSRHGEGKKK 677


>gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus guttatus]
          Length = 681

 Score =  711 bits (1834), Expect = 0.0
 Identities = 399/731 (54%), Positives = 451/731 (61%), Gaps = 26/731 (3%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPG- 2111
            M+D EG LSFDFEGGLD  PS+P+A+VP++ +  +   +   A    P  +  + PVP  
Sbjct: 1    MDDGEGGLSFDFEGGLDIGPSHPTASVPVIQSSANANTASAAAAAANP-YNPSAAPVPAT 59

Query: 2110 ------NYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQD 1949
                  N  GRRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMPICRFFRLYGECREQD
Sbjct: 60   QAAEGMNNGGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQD 119

Query: 1948 CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPN 1769
            CVYKHTNED+KECNMYKLGFCPNGPDCRYRH KLPGPPP VEEV QKIQ L+S+NYG  N
Sbjct: 120  CVYKHTNEDVKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQQLTSYNYGKSN 179

Query: 1768 RFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXX 1589
             FFQNRN+ +AQQ E+PQFPQG N  +Q      +  AE  N+                 
Sbjct: 180  NFFQNRNSNFAQQTEKPQFPQGPNGTHQ---VGKTNAAEPGNLNQPAQQSQQPGSQGQLQ 236

Query: 1588 XXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1409
                  +IP+   NQ ++ A PLPQG SRYF+VKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 237  ------SIPNDQQNQASRNATPLPQGASRYFVVKSCNRENLELSVQQGVWATQRSNEAKL 290

Query: 1408 NEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLK 1229
            NEAF++ EN+ILIFSVN+TRHFQGCAKMTS+IGG VGGGNWK++HG+AHYGRNF++KWLK
Sbjct: 291  NEAFESVENIILIFSVNKTRHFQGCAKMTSRIGGSVGGGNWKHAHGTAHYGRNFALKWLK 350

Query: 1228 LCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXX 1049
            LCEL+F KTRHLRNP+NENLPVKISRDCQELE SIGEQLASLLYLEPDS+LMAI++    
Sbjct: 351  LCELTFDKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSDLMAIAIAAEL 410

Query: 1048 XXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSY-----GQALGATQGRGRG 884
                   KGVN D+G ENPDIVPF                       GQA GA QGRG G
Sbjct: 411  KREEEKAKGVNIDNGAENPDIVPFEDNEEEEEEEEEEEESEDEDEFPGQAFGA-QGRGVG 469

Query: 883  RGMLWPPHMT-LXXXXXXXXXXXXXXPVMMGADGFTYGTITP---DGLPMPPELFNMAPR 716
            RGM+W PHM  L              P MMG DGF YG   P   DG PM  + F M PR
Sbjct: 470  RGMMWGPHMPPLGRGPRPFPGVRGFPPNMMGGDGFPYGHGPPLNHDGFPMH-DPFGMVPR 528

Query: 715  VFPSYGPRFPPDFTGLGQSSAM-------GFAPLGGAGLTS--GMMFHGRP-SQPGPIFP 566
             F  +GPRF  DF G      M       GF P+ G G     G    GRP   P P FP
Sbjct: 529  GFGQFGPRFGGDFAGPASGPMMFAGRPPGGFGPMMGQGRGPFMGGGRGGRPVGMPPPFFP 588

Query: 565  AASGLGMVTXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYG 386
                                       PPV A                     ND     
Sbjct: 589  PPP------------------------PPVAAQPPPQNSNWVKRDQKAPYSDRNDV---- 620

Query: 385  SDPGKGQEMASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR* 206
            SD GKGQE+ S               G  A+ +  YRNDESESEDEAPRRSRHGEGKK+ 
Sbjct: 621  SDQGKGQEIVSGSSNR----------GNAAKREESYRNDESESEDEAPRRSRHGEGKKKR 670

Query: 205  GS*HREMEREY 173
                 E + E+
Sbjct: 671  RGSEAETDGEF 681


>ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum tuberosum]
          Length = 677

 Score =  709 bits (1829), Expect = 0.0
 Identities = 395/705 (56%), Positives = 456/705 (64%), Gaps = 12/705 (1%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNP----ALVPGPGQSVISDP 2120
            M+D EG L+FDFEGGLDT P++P+A+VP++ +   I     P    ALVP PG  V    
Sbjct: 1    MDDGEGGLNFDFEGGLDTGPTHPTASVPVLQSAGHITTGPAPNASVALVP-PGGGV-GQG 58

Query: 2119 VPGNYPG-RRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCV 1943
              G++ G RRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDCV
Sbjct: 59   GDGSFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCV 118

Query: 1942 YKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRF 1763
            YKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPV EV Q+IQNL+S  YG+ NRF
Sbjct: 119  YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVVEVLQRIQNLTS--YGYSNRF 176

Query: 1762 FQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXX 1583
            FQNRN  Y+ Q ++ Q PQ  N +NQ V    ST AE P                     
Sbjct: 177  FQNRNTNYSTQADKSQIPQVPNVMNQAVK---STAAEPPIGQPHQPHQQQVQQPQHQGAP 233

Query: 1582 XXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1403
               Q +P    +Q N+ A+PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE
Sbjct: 234  TQTQTLPS---SQQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 290

Query: 1402 AFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLC 1223
            AFD+ ENVIL+FS+NRTRHFQG AKMTS+IGG   GGNWK+ HG+AHYGRNFS+KWLKLC
Sbjct: 291  AFDSVENVILVFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSLKWLKLC 350

Query: 1222 ELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXX 1043
            ELSF KTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLY+EPDSELMA+S+      
Sbjct: 351  ELSFQKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAVSLAAESKR 410

Query: 1042 XXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXS--YGQALG-ATQGRGRGRGML 872
                 KGVNPD+G ENPDIVPF                   +GQA G A  GRGRGRG++
Sbjct: 411  EEERAKGVNPDNGNENPDIVPFEDNEEEEEEESEEEEEDEGFGQAFGPAALGRGRGRGIV 470

Query: 871  WPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPR 692
            WPP +                P MM +DGF+YG++TPDG PMP + + M  R F  +GPR
Sbjct: 471  WPPLVPFGRGARPFPGMRGFPPGMM-SDGFSYGSMTPDGFPMP-DPYGMGGRPFGPFGPR 528

Query: 691  FPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXX 512
            FP D     +  A G     G G+   MM  GRP   G + P A G              
Sbjct: 529  FPGDMMFHSRPPAAG-----GFGM---MMGPGRPPFMGGMGPGAPG-----PPRGGRPMG 575

Query: 511  XXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMAS----PGD 344
                 +  TPP   N                    NDR+S G D G+GQE+A     P +
Sbjct: 576  IHPSFIPPTPPPSQNPRVKKDQRAPFNER------NDRFSSGPDQGRGQEIAGSVGGPAE 629

Query: 343  RTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209
               Y +T+           N +RNDESESEDEAPRRSRHG+GKK+
Sbjct: 630  GVHYPQTE-----------NSFRNDESESEDEAPRRSRHGDGKKK 663


>ref|XP_004233145.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum lycopersicum]
          Length = 671

 Score =  704 bits (1818), Expect = 0.0
 Identities = 391/698 (56%), Positives = 451/698 (64%), Gaps = 5/698 (0%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108
            M+D EG L+FDFEGGLDT P++P+A+VP++   P+  AS    + PG G  +  D   G+
Sbjct: 1    MDDGEGGLNFDFEGGLDTGPTHPTASVPVIQAGPAPNASV-AVVPPGGGVGLGGD---GS 56

Query: 2107 YPG-RRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT 1931
            + G RRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGECREQDCVYKHT
Sbjct: 57   FVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHT 116

Query: 1930 NEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNR 1751
            NEDIKECNM+KLGFCPNGPDCRYRH K+PGPPPPV EV QKIQNL+S  +G+ NRFFQNR
Sbjct: 117  NEDIKECNMFKLGFCPNGPDCRYRHAKMPGPPPPVVEVLQKIQNLTS--HGYSNRFFQNR 174

Query: 1750 NAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQ 1571
            N  Y+ Q ++ Q PQ  N +NQ V    ST  E P                        Q
Sbjct: 175  NTNYSTQADKSQIPQVPNVMNQAVK---STATEPPIGQPHQPHQQQVQQPQHQGPPTQTQ 231

Query: 1570 NIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDT 1391
             +P     Q N+ A+PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+
Sbjct: 232  TLPG---TQQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 288

Query: 1390 TENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSF 1211
             ENVILIFS+NRTRHFQG AKMTS+IGG   GGNWK+ HG+AHYGRNFSVKWLKLCELSF
Sbjct: 289  VENVILIFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSVKWLKLCELSF 348

Query: 1210 HKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXX 1031
             KTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLY+EPDSELMAIS+          
Sbjct: 349  QKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAISLAAESKREEER 408

Query: 1030 XKGVNPDDGVENPDIVPF---XXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLWPP 863
             KGVNPD+G ENPDIVPF                    +GQALG A   RGRGRG++WPP
Sbjct: 409  AKGVNPDNGNENPDIVPFEDNEEEEEEESEEEDEEDEGFGQALGPAALDRGRGRGIVWPP 468

Query: 862  HMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPP 683
             +                 +M  +DGF+YG++TPDG PM P+ + M  R F  +GPRFP 
Sbjct: 469  LVPFRGARPFPGMRGFPPGIM--SDGFSYGSMTPDGFPM-PDPYGMGGRPFGPFGPRFPG 525

Query: 682  DFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXXXX 503
            D     +  A      GG G+   MM   RP   G + P A G                 
Sbjct: 526  DMMFHSRPPA-----AGGFGM---MMGPARPPFMGGMGPGAPG-----PPRGGRPMGMHP 572

Query: 502  XXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMASPGDRTTYDET 323
                  PP   N                    NDR+S G D G+GQE A  G     DE 
Sbjct: 573  SFTPPPPPPSQN------PRVKKDQRAPFNERNDRFSSGPDQGRGQETA--GSVVGPDEG 624

Query: 322  KYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209
             + P     Q++N +RNDESESEDEAPRRSRHG+GKK+
Sbjct: 625  VHYP-----QTENSFRNDESESEDEAPRRSRHGDGKKK 657


>gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malus domestica]
          Length = 667

 Score =  702 bits (1812), Expect = 0.0
 Identities = 392/718 (54%), Positives = 444/718 (61%), Gaps = 25/718 (3%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLD-----TAPSNPSAAVP-----LVPTDPSIAASG--NPALVPGP 2144
            MED +G L+FDFEGGLD     +A + P+  VP     ++ +D ++   G    A  P P
Sbjct: 1    MEDSDGGLNFDFEGGLDAPATVSASAGPANTVPTSNYSVMQSDSAVTGLGANQAAAAPQP 60

Query: 2143 GQSVISDPVPGNYPGRRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGE 1964
             Q+        N  G RS+RQTVCRHWLR LCMKG+ACGFLHQYDKSRMP+CRFFRLYGE
Sbjct: 61   NQNA-------NRTGGRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGE 113

Query: 1963 CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFN 1784
            CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPPVEEV QKIQ+L+S+N
Sbjct: 114  CREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLTSYN 173

Query: 1783 YGFPNRFFQNRNAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXX 1604
            Y   ++F+Q RNAG+ QQ ++ Q  QG N        KP TTAE  N+            
Sbjct: 174  YNNSSKFYQQRNAGFPQQGDKHQPAQGPNNF----VGKP-TTAEPGNVQQQQQQQLQQTQ 228

Query: 1603 XXXXXXXXXXQNIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRS 1424
                        +P+G+ NQ N++ALPLPQG SRYFIVKSCNRENLELSVQQG+WATQRS
Sbjct: 229  QHVGPTQTQ--TLPNGLANQANRSALPLPQGTSRYFIVKSCNRENLELSVQQGLWATQRS 286

Query: 1423 NEAKLNEAFDTTENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFS 1244
            NE+KLNEAFD+ ENVILIFSVNRTRHFQGCAKM S+IGG VGGGNWKY+HG+AHYGRNFS
Sbjct: 287  NESKLNEAFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVGGGNWKYAHGTAHYGRNFS 346

Query: 1243 VKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAIS 1064
            VKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAIS
Sbjct: 347  VKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELELSVGEQLASLLYLEPDSELMAIS 406

Query: 1063 VXXXXXXXXXXXKGVNPDDGVENPDIVPFXXXXXXXXXXXXXXXXSYGQALGA---TQGR 893
            +           KGVNP++G ENPDIVPF                S+GQ  GA    +GR
Sbjct: 407  IAAESKREEEKAKGVNPENGGENPDIVPFEDNEEEEEEESEDEEDSFGQVPGAGNDGRGR 466

Query: 892  GRGRGMLWPPHMTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRV 713
            GRG G++WPPHM L              P MMG D   Y    PDG  M P  F MAPR 
Sbjct: 467  GRGGGVMWPPHMALPRGGRPMPGMQGFPPGMMGHDAMPY---VPDGFVM-PNPFGMAPRG 522

Query: 712  FPSYGPRFPPDFTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFP----------A 563
            F  YGPRF  DFT                G   GMMF GRP QPG  FP           
Sbjct: 523  FNPYGPRFSGDFT----------------GPNPGMMFRGRPQQPG--FPPGGFGIMGPGR 564

Query: 562  ASGLGMVTXXXXXXXXXXXXXXMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGS 383
            A  +G +                   PP   N                           S
Sbjct: 565  APFMGGIHPGRGGRPTGMSPMFPPPPPPSSQNPNRMPKRDPRG---------------AS 609

Query: 382  DPGKGQEMASPGDRTTYDETKYQPGGLKAQSKNIYRNDESESEDEAPRRSRHGEGKKR 209
               KGQ+M+ P D T Y             + N  RND+SESEDEAPRRSRHG+GKK+
Sbjct: 610  TDRKGQDMSGPDDETHYG------------AGNSSRNDDSESEDEAPRRSRHGDGKKK 655


>ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum lycopersicum]
          Length = 689

 Score =  698 bits (1801), Expect = 0.0
 Identities = 385/701 (54%), Positives = 447/701 (63%), Gaps = 8/701 (1%)
 Frame = -2

Query: 2287 MEDPEGVLSFDFEGGLDTAPSNPSAAVPLVPTDPSIAASGNPALVPGPGQSVISDPVPGN 2108
            M++ EG L+FDFEGGLDT P++P+A+VP++ +    AA+ + A +  P    +       
Sbjct: 1    MDEGEGGLNFDFEGGLDTGPTHPTASVPVIQSFDHTAAAASSANINPPTVPAVGGQGDVG 60

Query: 2107 YPG-RRSFRQTVCRHWLRGLCMKGEACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT 1931
            + G RRSFRQTVCRHWLR LCMKG+ACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT
Sbjct: 61   FVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT 120

Query: 1930 NEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPVEEVFQKIQNLSSFNYGFPNRFFQNR 1751
             EDIKECNMYKLGFCPNGPDCRYRH K+PGPPPPVEE+ QKIQ+L+S NYG+ NRF QNR
Sbjct: 121  IEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASNNYGYSNRFNQNR 180

Query: 1750 NAGYAQQIERPQFPQGSNTINQGVAAKPSTTAESPNMXXXXXXXXXXXXXXXXXXXXXXQ 1571
            NA Y+ Q ++ Q  Q  N  +  V    ST  E+P +                       
Sbjct: 181  NANYSTQTDKSQASQAQNGTSLAVK---STATETPIIQQHQPHQQVQPPQLQGGPTQAQI 237

Query: 1570 NIPDGMPNQGNKTALPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDT 1391
            + P+G  NQ ++TA+ LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD+
Sbjct: 238  H-PNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 296

Query: 1390 TENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYSHGSAHYGRNFSVKWLKLCELSF 1211
             ENVILIFSVNRTRHFQGC KMTS+IGG   GGNWK+ HG+AHYGRNFS+KWLKLCELSF
Sbjct: 297  VENVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSLKWLKLCELSF 356

Query: 1210 HKTRHLRNPFNENLPVKISRDCQELEASIGEQLASLLYLEPDSELMAISVXXXXXXXXXX 1031
             KT HLRNP+NENLPVKISRDCQELE S+GEQLASLLYLEPDSELMAIS+          
Sbjct: 357  QKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRLEEK 416

Query: 1030 XKGVNPDDGVENPDIVPF--XXXXXXXXXXXXXXXXSYGQALG-ATQGRGRGRGMLWPPH 860
             KGVNPD+G +NPDIVPF                  ++ Q  G A  GRGRGRG+ WPP 
Sbjct: 417  AKGVNPDNGKDNPDIVPFEDNEEEEDEEEESEDEDENFDQGFGPAALGRGRGRGIAWPPI 476

Query: 859  MTLXXXXXXXXXXXXXXPVMMGADGFTYGTITPDGLPMPPELFNMAPRVFPSYGPRFPPD 680
            M                P MMG DGF+YG +TP+G PM  + F M PR FP YGPRF  D
Sbjct: 477  MPFGHGPRPPPGMRGFPPGMMG-DGFSYGAMTPEGFPM-TDHFGMGPRPFPPYGPRFSSD 534

Query: 679  FTGLGQSSAMGFAPLGGAGLTSGMMFHGRPSQPGPIFPAASGLGMVTXXXXXXXXXXXXX 500
                G+       P GG G+   M+  GRP   G + P A+G                  
Sbjct: 535  LMFHGR------PPAGGFGM---MIGPGRPPFVGGMGPGATGPPRAGRAVRMHPSFIPPS 585

Query: 499  XMVATPPVHANXXXXXXXXXXXXXXXXXXPTNDRYSYGSDPGKGQEMASPGDRTTYDETK 320
               +  P  A                     NDR+S  SD GKGQEM   G     D   
Sbjct: 586  SQPSQYPYRAK----------REQRAPVSDRNDRFS--SDQGKGQEMM--GSVNGPDGVH 631

Query: 319  YQPGGLKAQSK----NIYRNDESESEDEAPRRSRHGEGKKR 209
             Q G  +  ++    N  +ND SESEDEAPRRSRHG+GKK+
Sbjct: 632  MQIGKSEHDNQFGAGNSLKNDGSESEDEAPRRSRHGDGKKK 672


Top