BLASTX nr result

ID: Achyranthes23_contig00005322 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00005322
         (2499 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation spec...   768   0.0  
ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr...   758   0.0  
gb|EMJ15374.1| hypothetical protein PRUPE_ppa019072mg [Prunus pe...   725   0.0  
gb|ESW19498.1| hypothetical protein PHAVU_006G130200g [Phaseolus...   723   0.0  
ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec...   723   0.0  
ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation spec...   723   0.0  
gb|EOX96971.1| Cleavage and polyadenylation specificity factor 3...   722   0.0  
ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation spec...   719   0.0  
ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm...   717   0.0  
ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec...   714   0.0  
ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation spec...   704   0.0  
gb|EXB51974.1| Cleavage and polyadenylation specificity factor C...   689   0.0  
ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citr...   689   0.0  
ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation spec...   687   0.0  
ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation spec...   685   0.0  
ref|XP_004233145.1| PREDICTED: cleavage and polyadenylation spec...   676   0.0  
ref|XP_002300333.2| zinc finger family protein [Populus trichoca...   664   0.0  
ref|XP_002893618.1| hypothetical protein ARALYDRAFT_890588 [Arab...   652   0.0  
ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation spec...   650   0.0  
ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation spec...   647   0.0  

>ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Citrus sinensis]
          Length = 683

 Score =  768 bits (1983), Expect = 0.0
 Identities = 408/684 (59%), Positives = 449/684 (65%), Gaps = 25/684 (3%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQPDHXXXXXXXXXXXXVDQGNRRSFRQTVC 257
            MED+EGGLSFDFEG LD  P +PTASNP   P                   RRSFRQTVC
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAAAPSSSGAAPDHASAPVPHHSGRRSFRQTVC 60

Query: 258  RHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKECNMYKLG 437
            RHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRL+GECREQDCVYKHTNEDIKECNMYKLG
Sbjct: 61   RHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVYKHTNEDIKECNMYKLG 120

Query: 438  FCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHHRNTNHSHQPDKSQS 617
            FCPNGPDCRYRH K PGPPP V+EVLQKIQQ++SYN+G  N+    R    SHQ DKSQ 
Sbjct: 121  FCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNKHFQQRGA-FSHQTDKSQF 179

Query: 618  LQGANATNQGAVPKPTASDSSNM--------PQPAGQD--QVQNLPSNPSNQTGRPATPL 767
             QG NA NQGA  K + ++S+N+        PQ  G    Q+QNLP+   NQT R ATPL
Sbjct: 180  SQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQMQNLPNGLPNQTNRNATPL 239

Query: 768  PQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHVILIFSVNRTRHFQ 947
            PQGI+RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS E+VILIFSVNRTRHFQ
Sbjct: 240  PQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSAENVILIFSVNRTRHFQ 299

Query: 948  GCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKTRHLRNPFNENLPVK 1127
            GCAKMTSKIG + GGGNWK+AHGTAHYGRNFSV WLKLCELSF+KTRHLRNP+NENLPVK
Sbjct: 300  GCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPYNENLPVK 359

Query: 1128 ISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXGVNIDNGADNPDIVP 1307
            ISRDCQELEPS+GEQLA+LLYLEPDSELMA  V            GVN DNG DNPDIVP
Sbjct: 360  ISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEEKAKGVNPDNGGDNPDIVP 419

Query: 1308 FDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXXXXXXXXXXXXXXXX 1487
            F+DN                  ++   A Q       MMW                    
Sbjct: 420  FEDNEEEEEEESEEE------EESLGTASQGRGRGRGMMWPGPMPLARGARPVPGMRGFP 473

Query: 1488 XXXXXVDGFTYGPRPDGFPLPDPFGVGPRPFVPYGPRFSGDFTNPAPGMMFPGRPSQ--- 1658
                  DGF+YG  PDGFP+PD FGV PRPF PYGPRFSGDFT P  GMMFPGRP Q   
Sbjct: 474  PMMIGADGFSYGVTPDGFPMPDLFGVAPRPFAPYGPRFSGDFTGPG-GMMFPGRPPQPGS 532

Query: 1659 -XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQSGNRGPKRDQ 1835
                                                             Q+ +R  KRD 
Sbjct: 533  VFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPFPNQPQSSQNSSRAAKRDV 592

Query: 1836 RGSGNDWGETFGLGPEQGKLLDVGG-GRG--EEPQYQ--GS-----EKLVSRNITNDESE 1985
            RGS ND  + +  G +QG+  ++GG GRG  +E QYQ  GS     ++  SRN  NDESE
Sbjct: 593  RGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSKANQEDQYGSRNFRNDESE 652

Query: 1986 SEDEAPRRSRHGEG-KKHRSMDGD 2054
            SEDEAPRRSRHGEG KK R  +GD
Sbjct: 653  SEDEAPRRSRHGEGKKKRRDSEGD 676


>ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551535|gb|ESR62164.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 701

 Score =  758 bits (1958), Expect = 0.0
 Identities = 410/702 (58%), Positives = 452/702 (64%), Gaps = 43/702 (6%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQPD-------------HXXXXXXXXXXXXV 218
            MED+EGGLSFDFEG LD  P +PTASNP +Q D             H             
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60

Query: 219  D-----QGNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDC 383
                     RRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRL+GECREQDC
Sbjct: 61   SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 384  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNR 563
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRH K PGPPP V+EVLQKIQQ++SYN+G  N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180

Query: 564  FSHHRNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNM--------PQPAGQD--QV 713
                R    SHQ DKSQ  QG NA NQGA  K + ++S+N+        PQ  G    Q+
Sbjct: 181  LFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQM 239

Query: 714  QNLPSNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 893
            QNLP+   NQT R ATPLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD
Sbjct: 240  QNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 299

Query: 894  SVEHVILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELS 1073
            S E+VILIFSVNRTRHFQGCAKMTSKIG + GGGNWK+AHGTAHYGRNFSV WLKLCELS
Sbjct: 300  SAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 359

Query: 1074 FNKTRHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXX 1253
            F+KTRHLRNP+NENLPVKISRDCQELEPS+GEQLA+LLYLEPDSELMA  V         
Sbjct: 360  FHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEE 419

Query: 1254 XXXGVNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXX 1433
               GVN DNG DNPDIVPF+DN                  ++   A Q       MMW  
Sbjct: 420  KAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEE------EESLGTASQGRGRGRGMMWPG 473

Query: 1434 XXXXXXXXXXXXXXXXXXXXXXXVDGFTYGPRPDGFPLPDPFGVGPRPFVPYGPRFSGDF 1613
                                    DGF+YG  PDGFP+PD FGV PRPF PYGPRFSGDF
Sbjct: 474  PMPLARGARPVPGMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPRPFAPYGPRFSGDF 533

Query: 1614 TNPAPGMMFPGRPSQ----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1781
            T P  GMMFPGRP Q                                             
Sbjct: 534  TGPG-GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPF 592

Query: 1782 XXXXXXLQSGNRGPKRDQRGSGNDWGETFGLGPEQGKLLDVGG-GRG--EEPQYQ--GS- 1943
                   Q+ +R  KRD RGS ND  + +  G +QG+  ++GG GRG  +E QYQ  GS 
Sbjct: 593  PNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSK 652

Query: 1944 ----EKLVSRNITNDESESEDEAPRRSRHGEG-KKHRSMDGD 2054
                ++  SRN  NDESESEDEAPRRSRHGEG KK R  +GD
Sbjct: 653  ANQEDQYGSRNFRNDESESEDEAPRRSRHGEGKKKRRDSEGD 694


>gb|EMJ15374.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica]
          Length = 695

 Score =  725 bits (1872), Expect = 0.0
 Identities = 387/693 (55%), Positives = 433/693 (62%), Gaps = 34/693 (4%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLD-TAPNIPT----ASNPVVQPDHXXXXXXXXXXXXVDQGNR--- 233
            MED++G ++FDFEG LD TA   PT     SN ++Q D               Q N    
Sbjct: 1    MEDSDGDINFDFEGGLDATAAAGPTNPGPPSNSLMQSDSGVAAVDTNPAAAAPQPNHPNP 60

Query: 234  -----RSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHT 398
                 RS+RQTVCRHWLRSLCMKG++CGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHT
Sbjct: 61   NRSGGRSYRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHT 120

Query: 399  NEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHHR 578
            NEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV+EVLQKIQ L SYNY  SN+F   R
Sbjct: 121  NEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNTSNKFYQQR 180

Query: 579  NTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNM---------PQPAGQDQVQNLPSN 731
            N     Q DK QS QG N+  QG V KP+  +S+N+          Q  G  Q QNLP+ 
Sbjct: 181  NAGFPQQADKYQSAQGPNSVYQGVVGKPSTGESANVHQQQQVQQTQQQVGHTQTQNLPNG 240

Query: 732  PSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHVI 911
             +NQ  R A PLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS E+VI
Sbjct: 241  LANQANRSA-PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSAENVI 299

Query: 912  LIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKTRH 1091
            LIFSVNRTRHFQGCAKM S+IG +  GGNWK+AHG+AHYGRNFSV WLKLCELSF+KTRH
Sbjct: 300  LIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGSAHYGRNFSVKWLKLCELSFHKTRH 359

Query: 1092 LRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXGVN 1271
            LRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMA  +            GVN
Sbjct: 360  LRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSIAAESKREEEKAKGVN 419

Query: 1272 IDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQ-XXXXXXXMMWXXXXXXX 1448
             +NG +NPDIVPF+DN               +    P V  +        +MW       
Sbjct: 420  PENGGENPDIVPFEDN--EEEEEEESDDEEESFGPVPGVGNEGRGRGRGGIMWPPHMPLA 477

Query: 1449 XXXXXXXXXXXXXXXXXXVDGFTYGPRPDGFPLPDPFGVGPRPFVPYGPRFSGDFTNPAP 1628
                               D   YGP PDGF +P+PFGVGPR F PYGPRFSGDFT P P
Sbjct: 478  RGGRPMPGMQGFPPGMMGADAMPYGPAPDGFGMPNPFGVGPRGFNPYGPRFSGDFTGPTP 537

Query: 1629 GMMFPGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQS 1808
            GMMF GRP Q                                                Q+
Sbjct: 538  GMMFRGRPQQPGFPPGGYGMMMGPGRAPFMGGMGVGGANPGRPGRPTGMSPMFPPPSSQN 597

Query: 1809 GNRGPKRDQRGSGNDWGETFGLGPEQGK---LLDVGGGRGEEPQYQGSEKL-------VS 1958
             NR  KRD RG  ND  E +  G  QGK   +  + GG  +E +YQ + K          
Sbjct: 598  TNRMQKRDPRGPSNDRNERYSAGSGQGKGQEIPGLAGGPDDEARYQQASKAYREDQYGAG 657

Query: 1959 RNITNDESESEDEAPRRSRHGEGKKH-RSMDGD 2054
             N  ND+SESEDEAPRRSRHGEGKK  R  +GD
Sbjct: 658  NNSRNDDSESEDEAPRRSRHGEGKKKGRGSEGD 690


>gb|ESW19498.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris]
          Length = 697

 Score =  723 bits (1867), Expect = 0.0
 Identities = 392/690 (56%), Positives = 434/690 (62%), Gaps = 36/690 (5%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTA-SNPVVQPDHXXXXXXXXXXXX------------V 218
            MED+EG LSFDFEG LDTAP+   A S P+VQ D                         V
Sbjct: 1    MEDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSGTEPAAV 60

Query: 219  DQGNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHT 398
            +   RRSFRQTVCRHWLRSLCMKGD+CGFLHQYDK+RMPVCRFFRLYGECREQDCVYKHT
Sbjct: 61   NVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHT 120

Query: 399  NEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHHR 578
            NEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV+EVLQKIQ L SYNY +SN+F   R
Sbjct: 121  NEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQQR 180

Query: 579  NTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSN----------MPQPAGQDQVQNLPS 728
             ++++ Q +KSQ  QG N+TNQG   KP  ++S N            Q   Q+Q+QN+ +
Sbjct: 181  GSSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQVSQNQIQNVAN 240

Query: 729  NPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHV 908
               NQ  R ATPLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDSVE+V
Sbjct: 241  GQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVENV 300

Query: 909  ILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKTR 1088
            ILIFSVNRTRHFQGCAKMTS+IG +  GGNWK+AHGTAHYGRNFSV WLKLCELSF+KTR
Sbjct: 301  ILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTR 360

Query: 1089 HLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXGV 1268
            HLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPD ELMA  V            GV
Sbjct: 361  HLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKREEEKAKGV 420

Query: 1269 NIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXXX 1448
            N DNG +NPDIVPF+DN                    P  A Q       MMW       
Sbjct: 421  NPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGP--AGQGRGRGRGMMWPPHMPLP 478

Query: 1449 XXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPRFSGDFTNPA 1625
                               DG +YGP  PDGF +PD F VGPR F PYGPRFSGDF  P 
Sbjct: 479  RGARPMPGMQGFNPVMMG-DGLSYGPVAPDGFGMPDLFSVGPRAFAPYGPRFSGDFGGPP 537

Query: 1626 PGMMFPGRPSQ---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1796
              MMF GRPSQ                                                 
Sbjct: 538  AAMMFRGRPSQPGMFPGGGFGMMMNPGRGPFMGGMGVAGANPPRGGRPVNMPPMFPPPPP 597

Query: 1797 XLQSGNRGPKRDQRGSGNDWGETFGLGPEQGK---LLDVGGGRGEEPQYQGSEKL----- 1952
              Q+ NR  KRDQR +  D  + +G G EQGK   +L   G   ++ QYQ   K      
Sbjct: 598  LPQNTNRLAKRDQRTT--DRNDRYGSGSEQGKSQDMLSQSGAPDDDMQYQQGYKANQDDH 655

Query: 1953 -VSRNITNDESESEDEAPRRSRHGEGKKHR 2039
                N  ND+SESEDEAPRRSRHGEGKK R
Sbjct: 656  PAVNNFRNDDSESEDEAPRRSRHGEGKKKR 685


>ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 691

 Score =  723 bits (1867), Expect = 0.0
 Identities = 392/696 (56%), Positives = 434/696 (62%), Gaps = 40/696 (5%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTA---SNPVVQPD------------HXXXXXXXXXXX 212
            MED+EG LSFDFEG LD AP+   A   S P+VQ D            H           
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPSTADPA 60

Query: 213  XVDQGNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 392
              +   RRSFRQTVCRHWLRSLCMKGD+CGFLHQYDK+RMPVCRFFRLYGECREQDCVYK
Sbjct: 61   GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120

Query: 393  HTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSH 572
            HTNEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV+EVLQKIQ L SYNY +SN+F  
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSSNKFFQ 180

Query: 573  HRNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNMP---------QPAGQDQVQNLP 725
             R  +++ Q +K Q  QG N+TNQG   KP  ++S N           Q   Q Q+QN+ 
Sbjct: 181  QRGASYNQQAEKPQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQVNQSQMQNVA 240

Query: 726  SNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEH 905
            +   NQ  R ATPLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDSVE+
Sbjct: 241  NGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 300

Query: 906  VILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKT 1085
            VIL+FSVNRTRHFQGCAKMTS+IG +  GGNWK+AHGTAHYGRNFSV WLKLCELSF+KT
Sbjct: 301  VILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 360

Query: 1086 RHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXG 1265
            RHLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMA  V            G
Sbjct: 361  RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKAKG 420

Query: 1266 VNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXX 1445
            VN DNG +NPDIVPF+DN               + S     A Q       MMW      
Sbjct: 421  VNPDNGGENPDIVPFEDN--EEEEEEESDEEEESFSHGVGPAGQGRGRGRGMMWPPHMPL 478

Query: 1446 XXXXXXXXXXXXXXXXXXXVDGFTYGP----RPDGFPLPDPFGVGPRPFVPYGPRFSGDF 1613
                                DG +YGP     PDGF +PD FGVGPR F PYGPRFSGDF
Sbjct: 479  GRGARPMPGMQGFNPVMMG-DGLSYGPVGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDF 537

Query: 1614 TNPAPGMMFPGRPSQ---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1784
              P   MMF GRPSQ                                             
Sbjct: 538  GGPPAAMMFRGRPSQPGMFPSGGFGMMMNPGRGPFMGGMGVGGANPPRGGRPVNMPPMFP 597

Query: 1785 XXXXXLQSGNRGPKRDQRGSGNDWGETFGLGPEQGK---LLDVGGGRGEEPQYQGSEK-- 1949
                  Q+ NR  KRDQR +  D  + FG G EQGK   +L   GG  ++ QYQ   K  
Sbjct: 598  PPPPLPQNANRAAKRDQRTA--DRNDRFGSGSEQGKSQDMLSQSGGPDDDAQYQQGYKGN 655

Query: 1950 ----LVSRNITNDESESEDEAPRRSRHGEGKKHRSM 2045
                    N  ND+SESEDEAPRRSRHGEGKK   +
Sbjct: 656  QDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKKHKL 691


>ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Vitis vinifera]
          Length = 673

 Score =  723 bits (1866), Expect = 0.0
 Identities = 386/676 (57%), Positives = 429/676 (63%), Gaps = 22/676 (3%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQPDHXXXXXXXXXXXXVDQG-----NRRSF 242
            MED EG LSFDFEG LD AP       P++Q D              +        RRSF
Sbjct: 1    MEDAEGVLSFDFEGGLDAAPGTAATVAPLIQSDATAAAAAPSSVVSAEPTPGGAPGRRSF 60

Query: 243  RQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKECN 422
            RQTVCRHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKECN
Sbjct: 61   RQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKECN 120

Query: 423  MYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHHRNTNHSHQP 602
            MYKLGFCPNG DCRYRHAK PGPPP ++EV QKIQQL+S+NYG+SNRF  +RN  ++ Q 
Sbjct: 121  MYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFYQNRNP-YNQQT 179

Query: 603  DKSQSLQGANATNQGAVPKPTASDSSNMPQP--------AGQDQVQNLPSNPSNQTGRPA 758
            +KSQ LQG+NA N G V K + +++ N+ Q           Q  +QNLP+   NQ  + A
Sbjct: 180  EKSQILQGSNAVNLGTVAKSSTTEAINVQQQQVQPPQQQVSQTPMQNLPNGLPNQANKTA 239

Query: 759  TPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHVILIFSVNRTR 938
            +PLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVE+VILIFSVNRTR
Sbjct: 240  SPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIFSVNRTR 299

Query: 939  HFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKTRHLRNPFNENL 1118
            HFQGCAKMTSKIG   GGGNWK+AHGTAHYGRNFSV WLKLCELSF+KTRHLRNP+NENL
Sbjct: 300  HFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPYNENL 359

Query: 1119 PVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXGVNIDNGADNPD 1298
            PVKISRDCQELEPS+GEQLASLLYLEPDSELMA  +            GVN DNG +NPD
Sbjct: 360  PVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREEEKAKGVNPDNGGENPD 419

Query: 1299 IVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXXXXXXXXXXXXX 1478
            IVPF+DN               +  QA   A Q       +MW                 
Sbjct: 420  IVPFEDN--EEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMPLARGARPIPSMR 477

Query: 1479 XXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPRFSGDFTNPAPGMMFPGR-- 1649
                     DGF+Y    PDGF +PD FGVGPR F PYGPRFSGDFT PA GMMFPGR  
Sbjct: 478  GFPPVMMGADGFSYSAVPPDGFAMPDIFGVGPRAFPPYGPRFSGDFTGPASGMMFPGRGQ 537

Query: 1650 PSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQSGNRGPKR 1829
            P                                                   S N   KR
Sbjct: 538  PGAVFPASGYGMMMGPGRAPFMGGMGVPAAAPTRAGRPVGMPPMFPPPPPPNSQNNRTKR 597

Query: 1830 DQRGSGNDWGETFGLGPEQGKLLDVGGGRGEEPQYQGSEKLV------SRNITNDESESE 1991
            DQR   ND  + +  G +QG+  D+ G   E    QG +           +  NDESESE
Sbjct: 598  DQRTPVNDRNDRYSGGSDQGRGQDMAGPDDETQYLQGLKSQQDDQFGGGNSFRNDESESE 657

Query: 1992 DEAPRRSRHGEGKKHR 2039
            DEAPRRSRHGEGKK R
Sbjct: 658  DEAPRRSRHGEGKKKR 673


>gb|EOX96971.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao]
          Length = 698

 Score =  722 bits (1863), Expect = 0.0
 Identities = 391/701 (55%), Positives = 436/701 (62%), Gaps = 42/701 (5%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQPDHXXXXXXXXXXXXVDQG---------- 227
            M+D+EGGLSFDFEG LD  P  PTAS PVV  D                G          
Sbjct: 1    MDDSEGGLSFDFEGGLDAGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPA 60

Query: 228  --------NRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDC 383
                     RRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRL+GECREQDC
Sbjct: 61   AAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 384  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNR 563
            VYKHTNEDIKECNMYKLGFCPNG DCRYRHAK PGPPPPV+EVLQKIQQL+SYNY   N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NK 177

Query: 564  FSHHRNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNM---------PQPAGQDQVQ 716
            F   RN+  + Q +KSQ  QG N  NQGA  KP+ ++S+NM          Q   Q Q+Q
Sbjct: 178  FFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTTESANMHPQQQVQQPQQQVSQTQIQ 237

Query: 717  NLPSNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 896
            N+P+  SNQ  + A PLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS
Sbjct: 238  NVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 297

Query: 897  VEHVILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSF 1076
             E+VILIFSVNRTRHFQGCAKMTSKIG +  GGNWK+AHGTAHYGRNFSV WLKLCELSF
Sbjct: 298  AENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSF 357

Query: 1077 NKTRHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXX 1256
            +KTRHLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMA  V          
Sbjct: 358  HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELKREEEK 417

Query: 1257 XXGVNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXX 1436
              GVN DNG +NPDIVPF+DN                  ++ + A Q       +MW   
Sbjct: 418  AKGVNSDNGGENPDIVPFEDNEEEEEEESEEE------DESFSAAAQGRGRGRGVMWPPH 471

Query: 1437 XXXXXXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPRFSGDF 1613
                                   DGF+YGP  PDGF +PD FG  PRPF PYGPRFSGDF
Sbjct: 472  MPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVPDLFG-APRPFPPYGPRFSGDF 530

Query: 1614 TNPAPGMMFPGRPSQ--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1787
            T PA GMMFPGRP Q                                             
Sbjct: 531  TGPASGMMFPGRPPQPGAMFPAGGLGMMMGPGRAPFMGGMGPTGANPVRGGRPVSMPPMF 590

Query: 1788 XXXXLQSGNRGPKRDQRGSGNDWGETFGLGPEQGK---LLDVGGGRGEEPQYQ------- 1937
                  S     +  +R       + +G G EQG+   +   GG   +E QYQ       
Sbjct: 591  PPPPAPSSQNSGRAVKRDQRTPTNDRYGAGSEQGRGQEMAGPGGRLDDETQYQQEGQKAH 650

Query: 1938 -GSEKLVSRNITNDESESEDEAPRRSRHGEG-KKHRSMDGD 2054
               +     +  NDESESEDEAPRRSR+GEG KK RS++GD
Sbjct: 651  HEDQFAAGNSFRNDESESEDEAPRRSRYGEGKKKRRSLEGD 691


>ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cicer arietinum]
          Length = 677

 Score =  719 bits (1855), Expect = 0.0
 Identities = 388/680 (57%), Positives = 435/680 (63%), Gaps = 26/680 (3%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAP------NIPTA-SNPVVQPDHXXXXXXXXXXXXVDQGN-- 230
            MED+EG LSFDFEG LD AP      ++P   S P+V PD                GN  
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPPSAATVSVPAPPSGPIVHPDSSLPPSISSNGAAPVSGNIP 60

Query: 231  -RRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNED 407
             RRSFRQTVCRHWLRSLCMKG++CGFLHQYDK+RMPVCRFFRLYGECREQDCVYKHTNED
Sbjct: 61   GRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHTNED 120

Query: 408  IKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHHRNTN 587
            IKECNMYKLGFCPNGPDCRYRHAK PGPPPP++EVLQKIQ L SYN+  S++F   R ++
Sbjct: 121  IKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNSHKFIQQRGSS 180

Query: 588  HSHQPDKSQSLQGANATNQGAVPKPTASDSSNMPQP---------AGQDQVQNLPSNPSN 740
            ++ Q +KSQ  QG N+ NQG   KP A++S N+ Q            Q Q QNL +   N
Sbjct: 181  YTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQSQQQVSQIQTQNLANGQPN 240

Query: 741  QTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHVILIF 920
            Q  R ATPLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDSVE+VILIF
Sbjct: 241  QANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVENVILIF 300

Query: 921  SVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKTRHLRN 1100
            SVNRTRHFQGCAKMTS+IG +  GGNWK+AHGTAHYGRNFSV WLKLCELSF+KTRHLRN
Sbjct: 301  SVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRN 360

Query: 1101 PFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXGVNIDN 1280
            P+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMA  +            GVN DN
Sbjct: 361  PYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKAKGVNPDN 420

Query: 1281 GADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXXXXXXX 1460
              +NPDIVPF+DN               +  QA     Q       MMW           
Sbjct: 421  AGENPDIVPFEDN--EEEEEEESDEEEESFVQAVVPVGQGRGRGRGMMWPPHMPLGRGAR 478

Query: 1461 XXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPRFSGDFTNPAPGMM 1637
                           DG +YGP  PDGF +PD FG+GPR F PYGPRFSGDF  P   MM
Sbjct: 479  PMPGMQGFNPVMMG-DGLSYGPGAPDGFGMPDLFGMGPRGFGPYGPRFSGDFAGPPAAMM 537

Query: 1638 FPGRPSQ---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQS 1808
            F GRPSQ                                                   Q+
Sbjct: 538  FRGRPSQPGMFPGGGFGMMMNPGRGPFMGGMGVPGPNPPRGGRPLNMPPMFPPPPPPPQN 597

Query: 1809 GNRGPKRDQRGSGNDWGETFGLGPEQGK---LLDVGGGRGEEPQYQGSEKLVSRNITNDE 1979
             NR  KRDQR   ND  + +  G EQGK   +L   GG  +E QYQ S    + N  N++
Sbjct: 598  VNRIAKRDQR--TNDRNDRYSSGQEQGKSQDMLSQSGGPDDEMQYQQS-GAPANNFRNED 654

Query: 1980 SESEDEAPRRSRHGEGKKHR 2039
            SESEDEAPRRSRHGEGKK +
Sbjct: 655  SESEDEAPRRSRHGEGKKRK 674


>ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis]
            gi|223537608|gb|EEF39232.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 702

 Score =  717 bits (1852), Expect = 0.0
 Identities = 391/703 (55%), Positives = 430/703 (61%), Gaps = 44/703 (6%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTA-PNIPTASNPVVQPDHXXXXXXXXXXXXV------------ 218
            M+DT+GGLSFDFEG LD++ P  PTAS P +  D+            V            
Sbjct: 1    MDDTDGGLSFDFEGGLDSSGPTNPTASIPAIPSDNTAAVAAATNNSIVPNVSSNDPASAA 60

Query: 219  -----DQGNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDC 383
                 +Q  RRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRLYGECREQDC
Sbjct: 61   AAAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 120

Query: 384  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNR 563
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV+EVLQKIQQL SYNYG+SN+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGSSNK 180

Query: 564  FSHHRNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNMPQP---------------- 695
            F   R        DKSQ  QG N   QG   KP  ++S+N+ QP                
Sbjct: 181  FFQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGTESANVQQPQQQQPQPGQGQQSQQQ 240

Query: 696  AGQDQVQNLPSNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAK 875
            A Q   QNLP+   NQ  R A PLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNEAK
Sbjct: 241  ATQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAK 300

Query: 876  LNEAFDSVEHVILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWL 1055
            LNEAFDS E+VILIFSVNRTRHFQGCAKMTSKIG + GGGNWK+AHGTAHYGRNFSV WL
Sbjct: 301  LNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSVKWL 360

Query: 1056 KLCELSFNKTRHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXX 1235
            KLCELSF+KTRHLRNP+NENLPVKISRDCQELEPSVG QLA LLY EPDSELMA  +   
Sbjct: 361  KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAISLAAE 420

Query: 1236 XXXXXXXXXGVNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXX 1415
                     GVN +NG DNPDIVPF+DN               +  QA     Q      
Sbjct: 421  AKREEEKAKGVNPENGGDNPDIVPFEDN--EEEEEEESEEEEESFGQALGAPGQGRGRGR 478

Query: 1416 XMMWXXXXXXXXXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYG 1592
             ++W                          D F+YGP  PDGF +PD FGV PR F PY 
Sbjct: 479  GIIW-PHMPLARGARPIPGMRGFPPMMMGADSFSYGPVTPDGFGMPDLFGVAPRGFTPYA 537

Query: 1593 PRFSGDFTNPAPGMMFPGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1772
            PRFSGDFT  A GMMFPGRP Q                                      
Sbjct: 538  PRFSGDFTGAASGMMFPGRPPQPGGVFPNGGFGMMMGPGRAPFMGGMGPNSTNPLRGNWP 597

Query: 1773 XXXXXXXXXLQSGNRGPKRDQRGSGNDWGETFGLGPEQGKLLDVGGGRGEEPQYQGSEKL 1952
                       S  R  KRDQR + ND    +  G +QG+  +  G   +E +YQ     
Sbjct: 598  GGMPFPPLPTPSPQRPVKRDQRMTAND---RYSTGSDQGR--NTAGEPDDEARYQQEGLK 652

Query: 1953 VS--------RNITNDESESEDEAPRRSRHGEG-KKHRSMDGD 2054
             S         +  NDESESEDEAPRRSRHGEG KK R  +GD
Sbjct: 653  ASHEDQFGAGNSFRNDESESEDEAPRRSRHGEGKKKRRGSEGD 695


>ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 681

 Score =  714 bits (1844), Expect = 0.0
 Identities = 390/683 (57%), Positives = 431/683 (63%), Gaps = 31/683 (4%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTA--SNPVVQPDHXXXXXXXXXXXX----------VD 221
            MED+EG LSFDFEG LD AP+   A  S P++  D                       V 
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVSNGGPAAPAPSAVDPVG 60

Query: 222  QGN---RRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 392
             GN   RRSFRQTVCRHWLRSLCMKGD+CGFLHQYDK+RMPVCRFFRLYGECREQDCVYK
Sbjct: 61   GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120

Query: 393  HTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSH 572
            HTNEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV+EVLQKIQ L SYNY +SN+F  
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQ 180

Query: 573  HRNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNMP---------QPAGQDQVQNLP 725
             R  +++ Q +K    QG N+TNQG    P  ++  N           Q   Q Q+QN+ 
Sbjct: 181  QRGASYNQQAEKPLLPQGNNSTNQGVTGNPLPAELGNAQPQQQVQQSQQQVNQSQMQNVA 240

Query: 726  SNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEH 905
            +   NQ  R ATPLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDSVE+
Sbjct: 241  NGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 300

Query: 906  VILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKT 1085
            VILIFSVNRTRHFQGCAKMTSKIG +  GGNWK+AHGTAHYGRNFSV WLKLCELSF+KT
Sbjct: 301  VILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 360

Query: 1086 RHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXG 1265
            RHLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMA  V            G
Sbjct: 361  RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKAKG 420

Query: 1266 VNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXX 1445
            VN DNG +NPDIVPF+DN                    P  A Q       MMW      
Sbjct: 421  VNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGP--AGQGRGRGRGMMWPPHMPL 478

Query: 1446 XXXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPRFSGDFTNP 1622
                                DG +YGP  PDGF +PD FGVGPR F PYGPRFSGDF  P
Sbjct: 479  GRGARPMPGMQGFNPVMMG-DGLSYGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDFGGP 537

Query: 1623 APGMMFPGRPSQ---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1793
               MMF GRPSQ                                                
Sbjct: 538  PAAMMFRGRPSQPGMFPGGGFGMMLNPGRGPFMGGIGVGGANPPRGGRPVNMPPMFPPPP 597

Query: 1794 XXLQSGNRGPKRDQRGSGNDWGETFGLGPEQGK---LLDVGGGRGEEPQYQGSEKLVSRN 1964
               Q+ NR  KRDQR +  D  + FG G EQGK   +L   GG  ++PQYQ   K  +++
Sbjct: 598  PLPQNANRAAKRDQRTA--DRNDRFGSGSEQGKSQDMLSQSGGPDDDPQYQQGYK-GNQD 654

Query: 1965 ITNDESESEDEAPRRSRHGEGKK 2033
               D+SESEDEAPRRSRHGEGKK
Sbjct: 655  DHPDDSESEDEAPRRSRHGEGKK 677


>ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Fragaria vesca subsp. vesca]
          Length = 689

 Score =  704 bits (1818), Expect = 0.0
 Identities = 385/691 (55%), Positives = 436/691 (63%), Gaps = 32/691 (4%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAP-NIPT----ASNPVVQPDHXXXXXXXXXXXX------VDQ 224
            MED +G L+FDFEG LD+A  + PT    AS+  +Q D                   V+ 
Sbjct: 1    MEDPDGVLNFDFEGGLDSAAVSAPTHTGLASSAPIQSDSFASQPKNQAAPAPQPDPNVNP 60

Query: 225  GNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNE 404
              R+SFRQTVCRHWLRSLCMKG++CGFLHQYDKSRMPVCRFFR+YGECREQDCVYKHTNE
Sbjct: 61   SGRKSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRMYGECREQDCVYKHTNE 120

Query: 405  DIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHHRNT 584
            DIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV+EVLQKIQ L SYNY  SN+FS  RN 
Sbjct: 121  DIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNNSNKFSQPRNG 180

Query: 585  NHSHQPDKSQSLQGANATNQGAVPKPTASDSSNMPQP---------AGQDQVQNLPSNPS 737
                Q D+SQ  Q  N+ NQ  V +P+A++S+N+ QP           Q Q Q++P+  +
Sbjct: 181  GFPQQHDRSQPAQVTNSFNQ-VVVRPSAAESANVQQPQQFQQTQQPVAQTQAQSVPNGLA 239

Query: 738  NQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHVILI 917
            +Q  R A PLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS E+VILI
Sbjct: 240  SQANRAALPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSAENVILI 299

Query: 918  FSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKTRHLR 1097
            FSVNRTRHFQGCAKM S+IG +  GGNWK+AHGTAHYGRNFSV WLKLCELSF+KTRHLR
Sbjct: 300  FSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKTRHLR 359

Query: 1098 NPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXGVNID 1277
            NP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMA  +            GVN +
Sbjct: 360  NPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKAKGVNPE 419

Query: 1278 NGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXXXXXX 1457
            NG +NPDIVPF+DN                  Q P  A++       +MW          
Sbjct: 420  NGGENPDIVPFEDNEEEEEEESDDEEDY----QVPGGAIE-NRGRGRVMWPPHMPLGGRG 474

Query: 1458 XXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGV-GPRPFVPYGPRFSGDFTNPAPG 1631
                            D   YGP  PDGF +P+PFG+ GPR F PYGPRFSGDF  P PG
Sbjct: 475  GRPMPGMQGFPGMMGPDAMPYGPVTPDGFVMPNPFGMGGPRGFNPYGPRFSGDFGGPNPG 534

Query: 1632 MMFPGRPSQ---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXL 1802
            MMF GRP Q                                                   
Sbjct: 535  MMFRGRPPQPGGMFPPGPYGMMMGPGRGPFMGGMGVGGNNPARGGRPGGMPPMFPPHPPS 594

Query: 1803 QSGNRGPKRDQRGSGNDWGETFGLGPEQGKLLDVGGGRGEEPQYQGSEKL------VSRN 1964
            Q+ NR  KRD RGSGND  E +  G   GK +   GG  +E  YQ S K          N
Sbjct: 595  QNNNRLQKRDPRGSGNDRNERYSAGSGHGKEMQ-AGGPDDENHYQHSSKSYQEDYGAGNN 653

Query: 1965 ITNDESESEDEAPRRSRHGEG-KKHRSMDGD 2054
              ND+SESEDEAPRRSRHGEG KK R  +GD
Sbjct: 654  GRNDDSESEDEAPRRSRHGEGKKKRRDSEGD 684


>gb|EXB51974.1| Cleavage and polyadenylation specificity factor CPSF30 [Morus
            notabilis]
          Length = 710

 Score =  689 bits (1779), Expect = 0.0
 Identities = 380/701 (54%), Positives = 426/701 (60%), Gaps = 47/701 (6%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTA-----PNIPTASNPVVQPDHXXXXXXXXXXXX--------- 215
            MED+EG LSFDFEG LDT      PN   AS  ++ PD                      
Sbjct: 1    MEDSEGVLSFDFEGGLDTTAGGCPPNAAAASAALIHPDSSAAAASNNLAASNSAVSADPT 60

Query: 216  -------VDQGNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECRE 374
                    + G  RSFRQTVCRHWLRSLCMKG++CGFLHQYDKSRMPVCRFFRLYGECRE
Sbjct: 61   SGGGGGASNPGRGRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECRE 120

Query: 375  QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGA 554
            QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAK PGPPP V+EVLQKIQ L+SYNY  
Sbjct: 121  QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQHLSSYNYH- 179

Query: 555  SNRFSHHRNTNHSHQPDKSQSLQ-GANATNQGAVPKPTASDSSNMPQP----------AG 701
            SN+F   RN     Q  +   L  G NA +QG V KP+  +S+N+ QP           G
Sbjct: 180  SNKFFQQRNAGGFAQLGEKPLLPLGPNAVSQGVVGKPSILESANVQQPQQQVQPSQQPVG 239

Query: 702  QDQVQNLPSNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 881
            Q+Q+QN+ +   NQ  R   PLP GI+RYFIVKSCNRENLELSVQQGVWATQRSNEAKLN
Sbjct: 240  QNQIQNVFTGLPNQANRTVAPLPPGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 299

Query: 882  EAFDSVEHVILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKL 1061
            EAFD  E+VILIFSVNRTRHFQGCAKM S+IG +  GGNWK+AHGTAHYGRNFSV WLKL
Sbjct: 300  EAFDCAENVILIFSVNRTRHFQGCAKMISRIGGSISGGNWKYAHGTAHYGRNFSVKWLKL 359

Query: 1062 CELSFNKTRHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXX 1241
            CELSF+KTRHLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMA  +     
Sbjct: 360  CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESK 419

Query: 1242 XXXXXXXGVNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXM 1421
                   GV+ DNG +NPDIVPF+DN               + SQ    A Q       +
Sbjct: 420  REEEKAKGVDPDNGGENPDIVPFEDN--EEDEEEESEDEEESFSQVLG-ANQGRGRGRGV 476

Query: 1422 MWXXXXXXXXXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPR 1598
            MW                          DG  YGP  PDGFP+PD F VGPR F PYGPR
Sbjct: 477  MWPPHMPLSRGARPMPSMQGFPPVMIGADGSPYGPVTPDGFPMPDLFNVGPRAFNPYGPR 536

Query: 1599 FSGDFTNPAPGMMFPGRPSQ----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1766
            F GDF  P  GMMF GRP+Q                                        
Sbjct: 537  FPGDFMGPTSGMMFRGRPTQPGAVFPGGGFGMMMGPGRAPCMGGMGVQGTSPARPMRPGA 596

Query: 1767 XXXXXXXXXXXLQSGNRGPKRDQRGSGNDWGETFGLGPEQGKLLDVGGGRG---EEPQYQ 1937
                        Q+ NR P+RDQRG  ND  E +G G +Q +  ++ G  G   ++  YQ
Sbjct: 597  MPPMFQQPPPPSQNMNRPPRRDQRGLANDRNERYGAGSDQVRGQEMSGPAGGPEDDAHYQ 656

Query: 1938 GSEKL-------VSRNITNDESESEDEAPRRSRHGEGKKHR 2039
               K           +  NDESESEDEAPRRSRHG+GKK R
Sbjct: 657  LGAKARQEDQYGAGNSFRNDESESEDEAPRRSRHGDGKKKR 697


>ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551536|gb|ESR62165.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 672

 Score =  689 bits (1777), Expect = 0.0
 Identities = 383/702 (54%), Positives = 423/702 (60%), Gaps = 43/702 (6%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQPD-------------HXXXXXXXXXXXXV 218
            MED+EGGLSFDFEG LD  P +PTASNP +Q D             H             
Sbjct: 1    MEDSEGGLSFDFEGGLDAGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60

Query: 219  D-----QGNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDC 383
                     RRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRL+GECREQDC
Sbjct: 61   SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 384  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNR 563
            VYKHTNEDIKECNMYKLGFCPNGPDCRYRH K PGPPP V+EVLQKIQQ++SYN+G  N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180

Query: 564  FSHHRNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNM--------PQPAGQD--QV 713
                R    SHQ DKSQ  QG NA NQGA  K + ++S+N+        PQ  G    Q+
Sbjct: 181  LFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTTQM 239

Query: 714  QNLPSNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 893
            QNLP+   NQT R ATPLPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD
Sbjct: 240  QNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 299

Query: 894  SVEHVILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELS 1073
            S E+VILIFSVNRTRHFQGCAKMTSKIG + GGGNWK+AHGTAHYGRNFSV WLKLCELS
Sbjct: 300  SAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELS 359

Query: 1074 FNKTRHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXX 1253
            F+KTRHLRNP+NENLPVK                             A  V         
Sbjct: 360  FHKTRHLRNPYNENLPVK-----------------------------AISVAAEAKREEE 390

Query: 1254 XXXGVNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXX 1433
               GVN DNG DNPDIVPF+DN                  ++   A Q       MMW  
Sbjct: 391  KAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEE------EESLGTASQGRGRGRGMMWPG 444

Query: 1434 XXXXXXXXXXXXXXXXXXXXXXXVDGFTYGPRPDGFPLPDPFGVGPRPFVPYGPRFSGDF 1613
                                    DGF+YG  PDGFP+PD FGV PRPF PYGPRFSGDF
Sbjct: 445  PMPLARGARPVPGMRGFPPMMIGADGFSYGVTPDGFPMPDLFGVAPRPFAPYGPRFSGDF 504

Query: 1614 TNPAPGMMFPGRPSQ----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1781
            T P  GMMFPGRP Q                                             
Sbjct: 505  TGPG-GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGGRPVGVPPPF 563

Query: 1782 XXXXXXLQSGNRGPKRDQRGSGNDWGETFGLGPEQGKLLDVGG-GRG--EEPQYQ--GS- 1943
                   Q+ +R  KRD RGS ND  + +  G +QG+  ++GG GRG  +E QYQ  GS 
Sbjct: 564  PNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQYQQEGSK 623

Query: 1944 ----EKLVSRNITNDESESEDEAPRRSRHGEG-KKHRSMDGD 2054
                ++  SRN  NDESESEDEAPRRSRHGEG KK R  +GD
Sbjct: 624  ANQEDQYGSRNFRNDESESEDEAPRRSRHGEGKKKRRDSEGD 665


>ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum tuberosum]
          Length = 677

 Score =  687 bits (1772), Expect = 0.0
 Identities = 387/696 (55%), Positives = 426/696 (61%), Gaps = 37/696 (5%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQ--------PDHXXXXXXXXXXXXVDQG-- 227
            M+D EGGL+FDFEG LDT P  PTAS PV+Q        P              V QG  
Sbjct: 1    MDDGEGGLNFDFEGGLDTGPTHPTASVPVLQSAGHITTGPAPNASVALVPPGGGVGQGGD 60

Query: 228  -----NRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 392
                 NRRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRLYGECREQDCVYK
Sbjct: 61   GSFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 120

Query: 393  HTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSH 572
            HTNEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV EVLQ+IQ LTSY Y  SNRF  
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVVEVLQRIQNLTSYGY--SNRFFQ 178

Query: 573  HRNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNMPQPAGQDQVQN---------LP 725
            +RNTN+S Q DKSQ  Q  N  NQ AV    A      P    Q QVQ            
Sbjct: 179  NRNTNYSTQADKSQIPQVPNVMNQ-AVKSTAAEPPIGQPHQPHQQQVQQPQHQGAPTQTQ 237

Query: 726  SNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEH 905
            + PS+Q  + A PLPQG +RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVE+
Sbjct: 238  TLPSSQQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEN 297

Query: 906  VILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKT 1085
            VIL+FS+NRTRHFQG AKMTS+IG  A GGNWKH HGTAHYGRNFS+ WLKLCELSF KT
Sbjct: 298  VILVFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSLKWLKLCELSFQKT 357

Query: 1086 RHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXG 1265
            RHLRNP+NENLPVKISRDCQELE SVGEQLASLLY+EPDSELMA  +            G
Sbjct: 358  RHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAVSLAAESKREEERAKG 417

Query: 1266 VNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXX 1445
            VN DNG +NPDIVPF+DN                  QA   A         ++W      
Sbjct: 418  VNPDNGNENPDIVPFEDNEEEEEEESEEEEEDEGFGQAFGPAALGRGRGRGIVWPPLVPF 477

Query: 1446 XXXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPRFSGDF--- 1613
                                DGF+YG   PDGFP+PDP+G+G RPF P+GPRF GD    
Sbjct: 478  GRGARPFPGMRGFPPGMMS-DGFSYGSMTPDGFPMPDPYGMGGRPFGPFGPRFPGDMMFH 536

Query: 1614 -TNPAPG----MMFPGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1778
               PA G    MM PGRP                                          
Sbjct: 537  SRPPAAGGFGMMMGPGRP------------------PFMGGMGPGAPGPPRGGRPMGIHP 578

Query: 1779 XXXXXXXLQSGNRGPKRDQRGSGNDWGETFGLGPEQGKLLDVG---GGRGEEPQYQGSEK 1949
                     S N   K+DQR   N+  + F  GP+QG+  ++    GG  E   Y  +E 
Sbjct: 579  SFIPPTPPPSQNPRVKKDQRAPFNERNDRFSSGPDQGRGQEIAGSVGGPAEGVHYPQTE- 637

Query: 1950 LVSRNITNDESESEDEAPRRSRHGEGKKHR-SMDGD 2054
                +  NDESESEDEAPRRSRHG+GKK + SMDGD
Sbjct: 638  ---NSFRNDESESEDEAPRRSRHGDGKKKKNSMDGD 670


>ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cucumis sativus]
          Length = 707

 Score =  685 bits (1768), Expect = 0.0
 Identities = 379/704 (53%), Positives = 428/704 (60%), Gaps = 45/704 (6%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIP--TASNPVVQPDHXXXXXXXXXXXXV----------- 218
            MED+EG LSFDFEG LD  P  P  T+S P++  D             +           
Sbjct: 1    MEDSEGVLSFDFEGGLDAGPTNPAATSSLPIINSDSSAPPAASAVSNPLSGALGPAVSAE 60

Query: 219  -------DQGNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQ 377
                   + GNRRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMP+CRFFRLYGECREQ
Sbjct: 61   PTGAPHGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQ 120

Query: 378  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGAS 557
            DCVYKHTNEDIKECNMYK GFCPNGPDCRYRHAK PGPPPP++E+LQKIQ L SYNYG S
Sbjct: 121  DCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPLEEILQKIQHLGSYNYGPS 180

Query: 558  NRFSHHRNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNMPQPAGQDQ--------V 713
            N+F   R    S Q +KSQ  Q      QG   KP+A++S N+ Q  GQ          V
Sbjct: 181  NKFFTQRGVGLSQQNEKSQFPQVPALVTQGVTGKPSAAESVNVQQQQGQQSAPQASQTPV 240

Query: 714  QNLPSNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 893
            Q+L +   NQ  R AT LPQGI+RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD
Sbjct: 241  QSLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFD 300

Query: 894  SVEHVILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELS 1073
            S ++VILIFSVNRTRHFQGCAKM S+IG +  GGNWK+AHGT HYG+NFS+ WLKLCELS
Sbjct: 301  SADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTPHYGQNFSLKWLKLCELS 360

Query: 1074 FNKTRHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXX 1253
            F KTRHLRNP+NENLPVKISRDCQELEPSVGEQLASLLYLEPD ELMA  V         
Sbjct: 361  FQKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDGELMAVSVAAESKREEE 420

Query: 1254 XXXGVNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXX 1433
               GVN D G++NPDIVPF+DN               +  Q+  +  Q       MMW  
Sbjct: 421  KAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEE-SFGQSAGLPPQGRGRGRGMMWPP 479

Query: 1434 XXXXXXXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGP--RFS 1604
                                    DG +YGP  PDGFP+PD FG+ PR F PYGP  RFS
Sbjct: 480  HMPMGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMTPRGFGPYGPTPRFS 539

Query: 1605 GDFTNPAPGMMF---PGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1775
            GDF  P   MMF   P +P+                                        
Sbjct: 540  GDFMGPPTAMMFRGRPSQPAAMFPPSGFGMMMGQGRGPFMGGMGVAGANPARPGRPVGVS 599

Query: 1776 XXXXXXXX--LQSGNRGPKRDQRGSGNDWGETFGLGPEQGKLLDV-GGGRGEEPQYQGSE 1946
                       Q+ NR  KRDQRG  ND    + +G +Q K +++   GR EE QY+   
Sbjct: 600  PLYPPPAVPSSQNMNRAIKRDQRGLTND---RYIVGMDQNKGVEIQSSGRDEEMQYKQGS 656

Query: 1947 KLVS-------RNITNDESESEDEAPRRSRHGEG-KKHRSMDGD 2054
            K  S           N+ESESEDEAPRRSRHGEG KK R  +GD
Sbjct: 657  KAYSDEQYGTGTTFRNEESESEDEAPRRSRHGEGKKKRRGSEGD 700


>ref|XP_004233145.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum lycopersicum]
          Length = 671

 Score =  676 bits (1745), Expect = 0.0
 Identities = 377/682 (55%), Positives = 418/682 (61%), Gaps = 23/682 (3%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQ----PDHXXXXXXXXXXXXVDQ-----GN 230
            M+D EGGL+FDFEG LDT P  PTAS PV+Q    P+             +       GN
Sbjct: 1    MDDGEGGLNFDFEGGLDTGPTHPTASVPVIQAGPAPNASVAVVPPGGGVGLGGDGSFVGN 60

Query: 231  RRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDI 410
            RRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDI
Sbjct: 61   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDI 120

Query: 411  KECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHHRNTNH 590
            KECNM+KLGFCPNGPDCRYRHAK PGPPPPV EVLQKIQ LTS+ Y  SNRF  +RNTN+
Sbjct: 121  KECNMFKLGFCPNGPDCRYRHAKMPGPPPPVVEVLQKIQNLTSHGY--SNRFFQNRNTNY 178

Query: 591  SHQPDKSQSLQGANATNQGA--------VPKPTASDSSNMPQPAGQDQVQNLPSNPSNQT 746
            S Q DKSQ  Q  N  NQ          + +P       + QP  Q       + P  Q 
Sbjct: 179  STQADKSQIPQVPNVMNQAVKSTATEPPIGQPHQPHQQQVQQPQHQGPPTQTQTLPGTQQ 238

Query: 747  GRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHVILIFSV 926
             + A PLPQG +RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVE+VILIFS+
Sbjct: 239  NQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENVILIFSI 298

Query: 927  NRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKTRHLRNPF 1106
            NRTRHFQG AKMTS+IG  A GGNWKH HGTAHYGRNFSV WLKLCELSF KTRHLRNP+
Sbjct: 299  NRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSVKWLKLCELSFQKTRHLRNPY 358

Query: 1107 NENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXGVNIDNGA 1286
            NENLPVKISRDCQELE SVGEQLASLLY+EPDSELMA  +            GVN DNG 
Sbjct: 359  NENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAISLAAESKREEERAKGVNPDNGN 418

Query: 1287 DNPDIVPFDDN-XXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXXXXXXXX 1463
            +NPDIVPF+DN                   QA   A         ++W            
Sbjct: 419  ENPDIVPFEDNEEEEEEESEEEDEEDEGFGQALGPAALDRGRGRGIVWPPLVPFRGARPF 478

Query: 1464 XXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPRFSGDFTNPAPGMMF 1640
                          DGF+YG   PDGFP+PDP+G+G RPF P+GPRF GD       MMF
Sbjct: 479  PGMRGFPPGIMS--DGFSYGSMTPDGFPMPDPYGMGGRPFGPFGPRFPGD-------MMF 529

Query: 1641 PGRPSQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQSGNRG 1820
              RP                                                   S N  
Sbjct: 530  HSRPPAAGGFGMMMGPARPPFMGGMGPGAPGPPRGGRPMGMHPSFTPPPPP---PSQNPR 586

Query: 1821 PKRDQRGSGNDWGETFGLGPEQGKLLDVGG---GRGEEPQYQGSEKLVSRNITNDESESE 1991
             K+DQR   N+  + F  GP+QG+  +  G   G  E   Y  +E     +  NDESESE
Sbjct: 587  VKKDQRAPFNERNDRFSSGPDQGRGQETAGSVVGPDEGVHYPQTE----NSFRNDESESE 642

Query: 1992 DEAPRRSRHGEGKKHR-SMDGD 2054
            DEAPRRSRHG+GKK + SMDGD
Sbjct: 643  DEAPRRSRHGDGKKKKNSMDGD 664


>ref|XP_002300333.2| zinc finger family protein [Populus trichocarpa]
            gi|550349048|gb|EEE85138.2| zinc finger family protein
            [Populus trichocarpa]
          Length = 669

 Score =  664 bits (1713), Expect = 0.0
 Identities = 371/697 (53%), Positives = 409/697 (58%), Gaps = 38/697 (5%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQPDHXXXXXXXXXXXXVD------------ 221
            MED+EG LSFDFEG LD+ P  P AS P +  D+                          
Sbjct: 1    MEDSEGVLSFDFEGGLDSGPANPIASIPAIPSDNYGAATAAAPNTTNTTTNTTNNSNSGA 60

Query: 222  ---QGNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 392
               Q  RRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMPVCRFFRLYGECREQDCVYK
Sbjct: 61   ADIQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 120

Query: 393  HTNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSH 572
            HTNEDIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV+EV+QKIQQL SYN   SN+   
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVVQKIQQLNSYNGVTSNKNFQ 180

Query: 573  HRNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNMPQPAGQDQVQNLPSNPSNQTGR 752
             RN   S Q +KS +           + KP+ ++S+N+ Q   Q Q    P   + Q  +
Sbjct: 181  QRNAGFSQQIEKSPN----------TIIKPSGTESANVQQQQQQQQQTQTPHLTNGQHQQ 230

Query: 753  P---------ATPLPQGITR-----------YFIVKSCNRENLELSVQQGVWATQRSNEA 872
            P         ATPLPQGI+            YFIVKSCNRENLELSVQQGVWATQRSNE 
Sbjct: 231  PQQPNPLNRIATPLPQGISSFFSCVSPSQFVYFIVKSCNRENLELSVQQGVWATQRSNEI 290

Query: 873  KLNEAFDSVEHVILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTW 1052
            KLNEA DS ++VILIFSVNRTRHFQGCAKM SKIG + GGGNWK+AHGTAHYGRNFSV W
Sbjct: 291  KLNEALDSADNVILIFSVNRTRHFQGCAKMASKIGASVGGGNWKYAHGTAHYGRNFSVKW 350

Query: 1053 LKLCELSFNKTRHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXX 1232
            LKLCELSF+KTRHLRNPFNENLPVKISRDCQELEPS+GEQLASLLYLEPDSELMA  +  
Sbjct: 351  LKLCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSLAA 410

Query: 1233 XXXXXXXXXXGVNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXX 1412
                      GVN D+G +NPDIVPF+DN               +  Q    A Q     
Sbjct: 411  EAKREEEKEKGVNPDSGGENPDIVPFEDN--EEEEEEESEEEEESFGQPLGPAAQGRGRG 468

Query: 1413 XXMMWXXXXXXXXXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPY 1589
              MMW                          DGF+YG   PD F +PD FGV  R F PY
Sbjct: 469  RGMMWPSHNPMARGARPIPGIRGFPPMMMGADGFSYGAVTPDSFGMPDLFGVASRGFPPY 528

Query: 1590 GPRFSGDFTNPAPGMMFPGRPSQ--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1763
            GPRFSGDFT  A GMMFPGRPSQ                                     
Sbjct: 529  GPRFSGDFTGAASGMMFPGRPSQPGAVFPAGGFGMMMGPGRPPFIGGMGPTPSNLLRGPR 588

Query: 1764 XXXXXXXXXXXXLQSGNRGPKRDQRGSGNDWGETFGLGPEQGKLLDVGGGRGEEPQYQGS 1943
                         Q+ +R  KRDQR + ND                    R +     G+
Sbjct: 589  PGGMFAPFPAPSSQNNSRSVKRDQRAAAND--------------------RNDRHNQFGA 628

Query: 1944 EKLVSRNITNDESESEDEAPRRSRHGEGKKHRSMDGD 2054
                  +I NDESESEDEAPRRSRHGEGKK R   GD
Sbjct: 629  ----VNSIRNDESESEDEAPRRSRHGEGKKKRRGSGD 661


>ref|XP_002893618.1| hypothetical protein ARALYDRAFT_890588 [Arabidopsis lyrata subsp.
            lyrata] gi|297339460|gb|EFH69877.1| hypothetical protein
            ARALYDRAFT_890588 [Arabidopsis lyrata subsp. lyrata]
          Length = 631

 Score =  652 bits (1682), Expect = 0.0
 Identities = 368/683 (53%), Positives = 408/683 (59%), Gaps = 29/683 (4%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQPDHXXXXXXXXXXXX-------VDQGNRR 236
            MED +G LSFDFEG LD+ P  P+AS PV  PD+                      G  R
Sbjct: 1    MEDADG-LSFDFEGGLDSGPAQPSASVPVAPPDNSSSAAVNVAPTYDHSSATVAGAGRGR 59

Query: 237  SFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNEDIKE 416
            SFRQTVCRHWLR LCMKGD+CGFLHQYDK+RMP+CRFFRLYGECREQDCVYKHTNEDIKE
Sbjct: 60   SFRQTVCRHWLRGLCMKGDACGFLHQYDKARMPICRFFRLYGECREQDCVYKHTNEDIKE 119

Query: 417  CNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHHRNTNHSH 596
            CNMYKLGFCPNGPDCRYRHAK PGPPPPV+EVLQKIQQLTSYNYG  NRF   RN     
Sbjct: 120  CNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLTSYNYGP-NRFYQPRNVAPQL 178

Query: 597  QPDKSQSLQGANATNQGAVPKPTASDSSNMPQPA-GQDQV-QNLPSNPSNQTGRPATPLP 770
            Q DK Q         QG   +          QP   Q QV Q    NP++QT R + PLP
Sbjct: 179  Q-DKPQG----QVLTQGQPQEAGNLQQQQQQQPQQSQHQVSQTQIPNPADQTNRTSHPLP 233

Query: 771  QGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHVILIFSVNRTRHFQG 950
            QG+ RYF+VKSCNREN ELSVQQGVWATQRSNE+KLNEAFDSVE+VILIFSVNRTRHFQG
Sbjct: 234  QGVNRYFVVKSCNRENFELSVQQGVWATQRSNESKLNEAFDSVENVILIFSVNRTRHFQG 293

Query: 951  CAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKTRHLRNPFNENLPVKI 1130
            CAKMTS+IG   GGGNWKH HGTA YGRNFSV WLKLCELSF+KTR+LRNP+NENLPVKI
Sbjct: 294  CAKMTSRIGSYIGGGNWKHEHGTAQYGRNFSVKWLKLCELSFHKTRNLRNPYNENLPVKI 353

Query: 1131 SRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXGVNIDNGADNPDIVPF 1310
            SRDCQELEPSVGEQLASLLYLEPDS+LMA  +            GVN ++ A+NPDIVPF
Sbjct: 354  SRDCQELEPSVGEQLASLLYLEPDSDLMAISIAAEAKREEEKAKGVNPESRAENPDIVPF 413

Query: 1311 DDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXXXXXXXXXXXXXXXXX 1490
            +DN               +++  P    Q       MMW                     
Sbjct: 414  EDNEEEEEEEDESEEEEESMAGGP----QGRGRGRGMMWPPQMPLGRGIRPMPGMGGFPL 469

Query: 1491 XXXXV-DGFTYGPRPDGF-PLPDPFGVGPRPFVPYGPRFSGDFTNPAPGMMFPGRPSQXX 1664
                  D F YG  P G+  +PDPFG+GPRPF PYGPRF GDF  P PGMMFPGRP Q  
Sbjct: 470  GVMGPGDAFPYG--PGGYNGMPDPFGMGPRPFGPYGPRFGGDFRGPVPGMMFPGRPPQ-- 525

Query: 1665 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLQSGNRGPKRDQRG- 1841
                                                         +  G RGP     G 
Sbjct: 526  -------------------------------------QFPHGGYGMMGGGRGPHMGGMGN 548

Query: 1842 --------------SGNDWGETFGLGPEQGKLLDVGGGRGEEPQYQGSEKL-VSRNITND 1976
                          S    G T    PE+     VG  +  +      E+  V  ++ N+
Sbjct: 549  APRGGRPMYYPPATSSARPGPTNRKTPERSDERGVGADQQNQDTSHDMEQFEVGNSLRNE 608

Query: 1977 ESES--EDEAPRRSRHGEGKKHR 2039
            ESES  EDEAPRRSRHGEGKK R
Sbjct: 609  ESESEDEDEAPRRSRHGEGKKRR 631


>ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum lycopersicum]
          Length = 689

 Score =  650 bits (1678), Expect = 0.0
 Identities = 337/549 (61%), Positives = 365/549 (66%), Gaps = 24/549 (4%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQP-DHXXXXXXXXXXXXVDQ---------- 224
            M++ EGGL+FDFEG LDT P  PTAS PV+Q  DH                         
Sbjct: 1    MDEGEGGLNFDFEGGLDTGPTHPTASVPVIQSFDHTAAAASSANINPPTVPAVGGQGDVG 60

Query: 225  --GNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHT 398
              GNRRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMP+CRFFRLYGECREQDCVYKHT
Sbjct: 61   FVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDCVYKHT 120

Query: 399  NEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHHR 578
             EDIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV+E+LQKIQ L S NYG SNRF+ +R
Sbjct: 121  IEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASNNYGYSNRFNQNR 180

Query: 579  NTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNMPQP----------AGQDQVQNLPS 728
            N N+S Q DKSQ+ Q  N T+       T +      QP           G  Q Q  P+
Sbjct: 181  NANYSTQTDKSQASQAQNGTSLAVKSTATETPIIQQHQPHQQVQPPQLQGGPTQAQIHPN 240

Query: 729  NPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEHV 908
               NQ  R A  LPQG +RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVE+V
Sbjct: 241  GQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVENV 300

Query: 909  ILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKTR 1088
            ILIFSVNRTRHFQGC KMTS+IG  A GGNWKH HGTAHYGRNFS+ WLKLCELSF KT 
Sbjct: 301  ILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSLKWLKLCELSFQKTH 360

Query: 1089 HLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXGV 1268
            HLRNP+NENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMA  +            GV
Sbjct: 361  HLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRLEEKAKGV 420

Query: 1269 NIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNISQAPAVAMQXXXXXXXMMWXXXXXXX 1448
            N DNG DNPDIVPF+DN               N  Q    A         + W       
Sbjct: 421  NPDNGKDNPDIVPFEDNEEEEDEEEESEDEDENFDQGFGPAALGRGRGRGIAWPPIMPFG 480

Query: 1449 XXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPRFSGDFTNPA 1625
                               DGF+YG   P+GFP+ D FG+GPRPF PYGPRFS D     
Sbjct: 481  HGPRPPPGMRGFPPGMMG-DGFSYGAMTPEGFPMTDHFGMGPRPFPPYGPRFSSD----- 534

Query: 1626 PGMMFPGRP 1652
              +MF GRP
Sbjct: 535  --LMFHGRP 541


>ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum tuberosum]
          Length = 692

 Score =  647 bits (1670), Expect = 0.0
 Identities = 342/559 (61%), Positives = 369/559 (66%), Gaps = 34/559 (6%)
 Frame = +3

Query: 78   MEDTEGGLSFDFEGNLDTAPNIPTASNPVVQP-DHXXXXXXXXXXXXVDQ---------- 224
            M++ EGGL+FDFEG LDT P  PTAS PV+Q  DH                         
Sbjct: 1    MDEGEGGLNFDFEGGLDTGPTHPTASVPVIQSFDHTAAAAPSANINPPTVSAAVGGQSDV 60

Query: 225  ---GNRRSFRQTVCRHWLRSLCMKGDSCGFLHQYDKSRMPVCRFFRLYGECREQDCVYKH 395
               GNRRSFRQTVCRHWLRSLCMKGD+CGFLHQYDKSRMP+CRFFRLYGECREQDCVYKH
Sbjct: 61   GFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDCVYKH 120

Query: 396  TNEDIKECNMYKLGFCPNGPDCRYRHAKQPGPPPPVDEVLQKIQQLTSYNYGASNRFSHH 575
            T EDIKECNMYKLGFCPNGPDCRYRHAK PGPPPPV+E+LQKIQ L SYNYG SNRF+ +
Sbjct: 121  TIEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASYNYGYSNRFNQN 180

Query: 576  RNTNHSHQPDKSQSLQGANATNQGAVPKPTASDSSNMPQP----------AGQDQVQNLP 725
            RN N+S Q DKSQ+ Q  N  +       T +      QP           G  Q Q  P
Sbjct: 181  RNANYSTQSDKSQASQAQNGMSLAVKSTATETPIIQQHQPNQQVQPPQLQGGPTQAQIHP 240

Query: 726  SNPSNQTGRPATPLPQGITRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEH 905
            +   NQ  R A  LPQG +RYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVE+
Sbjct: 241  NGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVEN 300

Query: 906  VILIFSVNRTRHFQGCAKMTSKIGETAGGGNWKHAHGTAHYGRNFSVTWLKLCELSFNKT 1085
            VILIFSVNRTRHFQGC KMTS+IG  A GGNWKH HGTAHYGRNFSV WLKLCELSF KT
Sbjct: 301  VILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSVKWLKLCELSFQKT 360

Query: 1086 RHLRNPFNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMATLVXXXXXXXXXXXXG 1265
             HLRNP+NENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMA  +            G
Sbjct: 361  HHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKRQEEKAKG 420

Query: 1266 VNIDNGADNPDIVPFDDNXXXXXXXXXXXXXXXNIS--QAPAVAMQXXXXXXXMMWXXXX 1439
            VN DNG DNPDIVPF+DN               + S  Q    A         + W    
Sbjct: 421  VNPDNGKDNPDIVPFEDNEEEEEEEEEEESEDEDESFDQGFGPAALGRGRGRGIAWPPIM 480

Query: 1440 XXXXXXXXXXXXXXXXXXXXXVDGFTYGP-RPDGFPLPDPFGVGPRPFVPYGPRFSGDF- 1613
                                  DGF+YG   P+GFP+PD FG+GPRPF PYGP FS D  
Sbjct: 481  PFGHGPRPPPGMRGFPPGMMG-DGFSYGAMTPEGFPMPDHFGMGPRPFGPYGPPFSSDLM 539

Query: 1614 ---TNPAPG---MMFPGRP 1652
                 PA G   MM PGRP
Sbjct: 540  FHGRPPAGGFGMMMGPGRP 558


Top