BLASTX nr result

ID: Akebia24_contig00015567 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00015567
         (2507 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation spec...   824   0.0  
ref|XP_007041140.1| Cleavage and polyadenylation specificity fac...   810   0.0  
ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm...   790   0.0  
ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation spec...   786   0.0  
ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas...   784   0.0  
ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr...   781   0.0  
ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation spec...   780   0.0  
ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec...   779   0.0  
gb|EXB51974.1| Cleavage and polyadenylation specificity factor C...   776   0.0  
ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec...   773   0.0  
ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation spec...   759   0.0  
ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prun...   734   0.0  
ref|XP_002300333.2| zinc finger family protein [Populus trichoca...   728   0.0  
ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation spec...   726   0.0  
gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus...   722   0.0  
gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malu...   716   0.0  
ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citr...   713   0.0  
ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation spec...   701   0.0  
ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation spec...   700   0.0  
ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation spec...   696   0.0  

>ref|XP_002281594.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Vitis vinifera]
          Length = 673

 Score =  824 bits (2129), Expect = 0.0
 Identities = 442/712 (62%), Positives = 483/712 (67%), Gaps = 14/712 (1%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPVAGNN 2148
            +D EGVLSFDFEGGLD AP +  + A PLI +D++  +               EP  G  
Sbjct: 2    EDAEGVLSFDFEGGLDAAPGTAATVA-PLIQSDATAAAAAPSSVVSA------EPTPGGA 54

Query: 2147 IARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSNE 1968
              RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQDCVYKH+NE
Sbjct: 55   PGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVYKHTNE 114

Query: 1967 DIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRNA 1788
            DIKECNMYKLGFCPNG DCRYRH KLPGPPP  EEV QKIQ  S+FNYGSSNRF+Q RN 
Sbjct: 115  DIKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFYQNRNP 174

Query: 1787 SYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXXXIETQN 1608
             Y  QTE+SQ  QGSN VN     K STT                             QN
Sbjct: 175  -YNQQTEKSQILQGSNAVNLGTVAKSSTTE-------AINVQQQQVQPPQQQVSQTPMQN 226

Query: 1607 LQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSI 1428
            L N LP +ANKTA+PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS+
Sbjct: 227  LPNGLPNQANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSV 286

Query: 1427 DNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 1248
            +NVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH
Sbjct: 287  ENVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 346

Query: 1247 KTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1068
            KTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAIS+           
Sbjct: 347  KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREEEKA 406

Query: 1067 KGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXXXXGMMWAPHMP 891
            KGVN D+  ENPDIVPF                 SF Q L  AAQ      G+MW PHMP
Sbjct: 407  KGVNPDNGGENPDIVPF-EDNEEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMP 465

Query: 890  LARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSG 711
            LARGARP+P +RGFPPVMMG DGF+Y A+ PDGF MPD+FG+ PRAF PYGPRFSGD   
Sbjct: 466  LARGARPIPSMRGFPPVMMGADGFSYSAVPPDGFAMPDIFGVGPRAFPPYGPRFSGDF-- 523

Query: 710  LGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSV 531
                          TGP SGMMF GR  QPG VF                       +  
Sbjct: 524  --------------TGPASGMMFPGR-GQPGAVF--PASGYGMMMGPGRAPFMGGMGVPA 566

Query: 530  AAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLD-------------MGQEMA 390
            AAP RA               P +QNN     K+DQR P++              GQ+MA
Sbjct: 567  AAPTRAGRPVGMPPMFPPPPPPNSQNNR---TKRDQRTPVNDRNDRYSGGSDQGRGQDMA 623

Query: 389  GPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRK 234
            GP   D+ +Y  G+K Q +D FGG NSFRNDESESEDEAPRRSRHGEGKK++
Sbjct: 624  GPD--DETQYLQGLKSQQDDQFGGGNSFRNDESESEDEAPRRSRHGEGKKKR 673


>ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao]
            gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation
            specificity factor 30 [Theobroma cacao]
          Length = 698

 Score =  810 bits (2092), Expect = 0.0
 Identities = 442/727 (60%), Positives = 489/727 (67%), Gaps = 20/727 (2%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVI----SNTXXXXXXXXXXSLLEPV 2160
            DD EG LSFDFEGGLD A P+ P+A++P++ +D S      SN           S  +P 
Sbjct: 2    DDSEGGLSFDFEGGLD-AGPAAPTASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTNDPA 60

Query: 2159 A---GNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 1989
            A   G    RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RL+GECREQDC
Sbjct: 61   AAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1988 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1809
            VYKH+NEDIKECNMYKLGFCPNG DCRYRH KLPGPPPP EEV+QKIQ  S++NY   N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNY---NK 177

Query: 1808 FFQQRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXX 1629
            FFQQRN+ +  QTE+SQ PQG N VNQ    K STT                        
Sbjct: 178  FFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTTESANMHPQQQVQQPQQQVSQT--- 234

Query: 1628 XXIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1449
               + QN+ N    +ANKTA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 235  ---QIQNVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 291

Query: 1448 NEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLK 1269
            NEAFDS +NVILIFSVNRTRHFQGCAKMTSKIGG V GGNWKYAHGTAHYGRNFSVKWLK
Sbjct: 292  NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLK 351

Query: 1268 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXX 1089
            LCELSFHKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAISV    
Sbjct: 352  LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAEL 411

Query: 1088 XXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMM 909
                   KGVN D+  ENPDIVPF                 SFS   +AAQ      G+M
Sbjct: 412  KREEEKAKGVNSDNGGENPDIVPF-EDNEEEEEEESEEEDESFS---AAAQGRGRGRGVM 467

Query: 908  WAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRF 729
            W PHMPLARGARPMPG+RGFPP+MMGGDGF+YG +TPDGF +PDLFG APR F PYGPRF
Sbjct: 468  WPPHMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVPDLFG-APRPFPPYGPRF 526

Query: 728  SGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXX 549
            SGD                 TGP SGMMF GRP QPG +F                    
Sbjct: 527  SGDF----------------TGPASGMMFPGRPPQPGAMF--PAGGLGMMMGPGRAPFMG 568

Query: 548  XXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLD----------MGQ 399
                + A PVR                P +Q N+ R VK+DQR P +           GQ
Sbjct: 569  GMGPTGANPVRGGRPVSMPPMFPPPPAPSSQ-NSGRAVKRDQRTPTNDRYGAGSEQGRGQ 627

Query: 398  EMAGPG--MLDDGKY-QSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRKDS 228
            EMAGPG  + D+ +Y Q G K   ED F   NSFRNDESESEDEAPRRSR+GEGKK++ S
Sbjct: 628  EMAGPGGRLDDETQYQQEGQKAHHEDQFAAGNSFRNDESESEDEAPRRSRYGEGKKKRRS 687

Query: 227  EVDEQQN 207
               +  N
Sbjct: 688  LEGDDAN 694


>ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis]
            gi|223537608|gb|EEF39232.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 702

 Score =  790 bits (2041), Expect = 0.0
 Identities = 426/721 (59%), Positives = 478/721 (66%), Gaps = 18/721 (2%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPV---- 2160
            DD +G LSFDFEGGLD++ P+NP+A++P I +D++               S  +P     
Sbjct: 2    DDTDGGLSFDFEGGLDSSGPTNPTASIPAIPSDNTAAVAAATNNSIVPNVSSNDPASAAA 61

Query: 2159 --AGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCV 1986
              A N   RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQDCV
Sbjct: 62   AAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCV 121

Query: 1985 YKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRF 1806
            YKH+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEV+QKIQ  +++NYGSSN+F
Sbjct: 122  YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGSSNKF 181

Query: 1805 FQQRNASYTNQTERSQFPQGSNIVNQVVAVKQ-STTADXXXXXXXXXXXXXXXXXXXXXX 1629
            FQQR A +    ++SQF QG N + Q +A K   T +                       
Sbjct: 182  FQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGTESANVQQPQQQQPQPGQGQQSQQQA 241

Query: 1628 XXIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1449
                TQNL N  P +AN+TA PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 242  TQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 301

Query: 1448 NEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLK 1269
            NEAFDS +NVILIFSVNRTRHFQGCAKMTSKIG  VGGGNWKYAHGTAHYGRNFSVKWLK
Sbjct: 302  NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSVKWLK 361

Query: 1268 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXX 1089
            LCELSFHKTRHLRNPYNENLPVKISRDCQELEP VG QLA LLY EPDSELMAIS+    
Sbjct: 362  LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAISLAAEA 421

Query: 1088 XXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSA-AQXXXXXXGM 912
                   KGVN ++  +NPDIVPF                 SF Q L A  Q      G+
Sbjct: 422  KREEEKAKGVNPENGGDNPDIVPF-EDNEEEEEEESEEEEESFGQALGAPGQGRGRGRGI 480

Query: 911  MWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPR 732
            +W PHMPLARGARP+PG+RGFPP+MMG D F+YG +TPDGF MPDLFG+APR F PY PR
Sbjct: 481  IW-PHMPLARGARPIPGMRGFPPMMMGADSFSYGPVTPDGFGMPDLFGVAPRGFTPYAPR 539

Query: 731  FSGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXX 552
            FSGD                 TG  SGMMF GRP QPG VF                   
Sbjct: 540  FSGDF----------------TGAASGMMFPGRPPQPGGVF--PNGGFGMMMGPGRAPFM 581

Query: 551  XXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPL--------DMGQE 396
                 +   P+R +              PL   +  R VK+DQR           D G+ 
Sbjct: 582  GGMGPNSTNPLRGN------WPGGMPFPPLPTPSPQRPVKRDQRMTANDRYSTGSDQGRN 635

Query: 395  MAGPGMLDDGKY-QSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKDSEV 222
             AG    D+ +Y Q G+K   ED FG  NSFRNDESESEDEAPRRSRHGEG KKR+ SE 
Sbjct: 636  TAGEPD-DEARYQQEGLKASHEDQFGAGNSFRNDESESEDEAPRRSRHGEGKKKRRGSEG 694

Query: 221  D 219
            D
Sbjct: 695  D 695


>ref|XP_004486563.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cicer arietinum]
          Length = 677

 Score =  786 bits (2029), Expect = 0.0
 Identities = 418/706 (59%), Positives = 463/706 (65%), Gaps = 8/706 (1%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPVAGNN 2148
            +D EGVLSFDFEGGLD APPS  + +VP  A  S  I +           +   PV+GN 
Sbjct: 2    EDSEGVLSFDFEGGLDAAPPSAATVSVP--APPSGPIVHPDSSLPPSISSNGAAPVSGNI 59

Query: 2147 IARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSNE 1968
              RR FRQTVCRHWLRSLCMKG+ACGFLHQYDKARMP+CRF+RLYGECREQDCVYKH+NE
Sbjct: 60   PGRRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHTNE 119

Query: 1967 DIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRNA 1788
            DIKECNMYKLGFCPNGPDCRYRH K PGPPPP EEV+QKIQH  ++N+ +S++F QQR +
Sbjct: 120  DIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNSHKFIQQRGS 179

Query: 1787 SYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXXXIETQN 1608
            SYT Q E+SQFPQG N  NQ VA K                               +TQN
Sbjct: 180  SYTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQSQQQVSQI------QTQN 233

Query: 1607 LQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSI 1428
            L N  P +AN+TATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS+
Sbjct: 234  LANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSV 293

Query: 1427 DNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 1248
            +NVILIFSVNRTRHFQGCAKMTS+IGG V GGNWKYAHGTAHYGRNFSVKWLKLCELSFH
Sbjct: 294  ENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 353

Query: 1247 KTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1068
            KTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAIS+           
Sbjct: 354  KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKA 413

Query: 1067 KGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMPL 888
            KGVN D+  ENPDIVPF                      +   Q      GMMW PHMPL
Sbjct: 414  KGVNPDNAGENPDIVPFEDNEEEEEEESDEEEESFVQAVVPVGQGRGRGRGMMWPPHMPL 473

Query: 887  ARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSGL 708
             RGARPMPG++GF PVMM GDG +YG   PDGF MPDLFGM PR F PYGPRFSGD +  
Sbjct: 474  GRGARPMPGMQGFNPVMM-GDGLSYGPGAPDGFGMPDLFGMGPRGFGPYGPRFSGDFA-- 530

Query: 707  GQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSVA 528
                          GP + MMF GRP+QPG                          M V 
Sbjct: 531  --------------GPPAAMMFRGRPSQPG-----MFPGGGFGMMMNPGRGPFMGGMGVP 571

Query: 527  APVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRR-----PLDMGQEMAGPGMLDDGK 363
             P                  P    N NR+ K+DQR          GQE    G   D  
Sbjct: 572  GPNPPRGGRPLNMPPMFPPPPPPPQNVNRIAKRDQRTNDRNDRYSSGQEQ---GKSQDML 628

Query: 362  YQSG---IKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRK 234
             QSG    ++Q + S    N+FRN++SESEDEAPRRSRHGEGKKRK
Sbjct: 629  SQSGGPDDEMQYQQSGAPANNFRNEDSESEDEAPRRSRHGEGKKRK 674


>ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris]
            gi|561020727|gb|ESW19498.1| hypothetical protein
            PHAVU_006G130200g [Phaseolus vulgaris]
          Length = 697

 Score =  784 bits (2025), Expect = 0.0
 Identities = 416/723 (57%), Positives = 465/723 (64%), Gaps = 16/723 (2%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSL-LEPVAGN 2151
            +D EGVLSFDFEGGLDTAP +  + + PL+  DSS  ++               EP A N
Sbjct: 2    EDSEGVLSFDFEGGLDTAPSAAAAPSGPLVQHDSSAAASAVSNGGPPAPTPSGTEPAAVN 61

Query: 2150 NIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSN 1971
               RR FRQTVCRHWLRSLCMKGDACGFLHQYDKARMP+CRF+RLYGECREQDCVYKH+N
Sbjct: 62   VPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYKHTN 121

Query: 1970 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRN 1791
            EDIKECNMYKLGFCPNGPDCRYRH K PGPPPP EEV+QKIQH  ++NY SSN+FFQQR 
Sbjct: 122  EDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQQRG 181

Query: 1790 ASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXXXIETQ 1611
            +SYT Q E+SQ PQG+N  NQ V  K                               + Q
Sbjct: 182  SSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQVSQN-----QIQ 236

Query: 1610 NLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 1431
            N+ N  P +A++ ATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS
Sbjct: 237  NVANGQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDS 296

Query: 1430 IDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSF 1251
            ++NVILIFSVNRTRHFQGCAKMTS+IGG V GGNWKYAHGTAHYGRNFSVKWLKLCELSF
Sbjct: 297  VENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSF 356

Query: 1250 HKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXXXXXXX 1071
            HKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPD ELMA+SV          
Sbjct: 357  HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKREEEK 416

Query: 1070 XKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMP 891
             KGVN D+  ENPDIVPF                        A Q      GMMW PHMP
Sbjct: 417  AKGVNPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGPAGQGRGRGRGMMWPPHMP 476

Query: 890  LARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSG 711
            L RGARPMPG++GF PVMM GDG +YG + PDGF MPDLF + PRAFAPYGPRFSGD   
Sbjct: 477  LPRGARPMPGMQGFNPVMM-GDGLSYGPVAPDGFGMPDLFSVGPRAFAPYGPRFSGDFG- 534

Query: 710  LGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSV 531
                           GP + MMF GRP+QPG                          ++ 
Sbjct: 535  ---------------GPPAAMMFRGRPSQPG---MFPGGGFGMMMNPGRGPFMGGMGVAG 576

Query: 530  AAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR---------------RPLDMGQE 396
            A P R                     N NR+ K+DQR               +  DM  +
Sbjct: 577  ANPPRGGRPVNMPPMFPPPPP--LPQNTNRLAKRDQRTTDRNDRYGSGSEQGKSQDMLSQ 634

Query: 395  MAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRKDSEVDE 216
               P   DD +YQ G K   +D     N+FRND+SESEDEAPRRSRHGEGKK++    D 
Sbjct: 635  SGAPD--DDMQYQQGYKAN-QDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKKRRGPEDV 691

Query: 215  QQN 207
              N
Sbjct: 692  NTN 694


>ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551535|gb|ESR62164.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 701

 Score =  781 bits (2016), Expect = 0.0
 Identities = 426/727 (58%), Positives = 478/727 (65%), Gaps = 24/727 (3%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVIS-------NTXXXXXXXXXXSLL 2169
            +D EG LSFDFEGGLD A P  P+A+ P I +DS+  +       N              
Sbjct: 2    EDSEGGLSFDFEGGLD-AGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60

Query: 2168 EPVAGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 1989
                 ++  RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RL+GECREQDC
Sbjct: 61   SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1988 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1809
            VYKH+NEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP  EEV+QKIQ  S++N+G+ N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180

Query: 1808 FFQQRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXX 1629
             FQQR A +++Q ++SQF QG N VNQ  A K ST                         
Sbjct: 181  LFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTT-- 237

Query: 1628 XXIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1449
               + QNL N LP + N+ ATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 238  ---QMQNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 294

Query: 1448 NEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLK 1269
            NEAFDS +NVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKYAHGTAHYGRNFSVKWLK
Sbjct: 295  NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLK 354

Query: 1268 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXX 1089
            LCELSFHKTRHLRNPYNENLPVKISRDCQELEP +GEQLA+LLYLEPDSELMAISV    
Sbjct: 355  LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEA 414

Query: 1088 XXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMM 909
                   KGVN D+  +NPDIVPF                       +A+Q      GMM
Sbjct: 415  KREEEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEE----ESLGTASQGRGRGRGMM 470

Query: 908  WAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRF 729
            W   MPLARGARP+PG+RGFPP+M+G DGF+YG +TPDGFPMPDLFG+APR FAPYGPRF
Sbjct: 471  WPGPMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRF 529

Query: 728  SGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXX 549
            SGD +G G                 GMMF GRP QPG+VF                    
Sbjct: 530  SGDFTGPG-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGG 572

Query: 548  XXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPL-----------DMG 402
                   A                   P +  N++RV K+D R  +           D G
Sbjct: 573  MG----PAATNPRGGRPVGVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQG 628

Query: 401  --QEMAGPGMLDDGK---YQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KK 240
              QEM GPG   D +    Q G K   ED +G RN FRNDESESEDEAPRRSRHGEG KK
Sbjct: 629  RAQEMGGPGRGPDDEVQYQQEGSKANQEDQYGSRN-FRNDESESEDEAPRRSRHGEGKKK 687

Query: 239  RKDSEVD 219
            R+DSE D
Sbjct: 688  RRDSEGD 694


>ref|XP_006468290.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Citrus sinensis]
          Length = 683

 Score =  780 bits (2013), Expect = 0.0
 Identities = 427/720 (59%), Positives = 477/720 (66%), Gaps = 17/720 (2%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPVAGNN 2148
            +D EG LSFDFEGGLD A P  P+A+ P  A  SS  +                PV  ++
Sbjct: 2    EDSEGGLSFDFEGGLD-AGPGMPTASNPAAAPSSSGAAPDHASA----------PVPHHS 50

Query: 2147 IARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSNE 1968
              RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RL+GECREQDCVYKH+NE
Sbjct: 51   -GRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCVYKHTNE 109

Query: 1967 DIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRNA 1788
            DIKECNMYKLGFCPNGPDCRYRHVKLPGPPP  EEV+QKIQ  S++N+G+ N+ FQQR A
Sbjct: 110  DIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNKHFQQRGA 169

Query: 1787 SYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXXXIETQN 1608
             +++QT++SQF QG N VNQ  A K ST                            + QN
Sbjct: 170  -FSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTT-----QMQN 223

Query: 1607 LQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSI 1428
            L N LP + N+ ATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 
Sbjct: 224  LPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSA 283

Query: 1427 DNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 1248
            +NVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH
Sbjct: 284  ENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 343

Query: 1247 KTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXXXXXXXX 1068
            KTRHLRNPYNENLPVKISRDCQELEP +GEQLA+LLYLEPDSELMAISV           
Sbjct: 344  KTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEAKREEEKA 403

Query: 1067 KGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMPL 888
            KGVN D+  +NPDIVPF                       +A+Q      GMMW   MPL
Sbjct: 404  KGVNPDNGGDNPDIVPFEDNEEEEEEESEEEE----ESLGTASQGRGRGRGMMWPGPMPL 459

Query: 887  ARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDLSGL 708
            ARGARP+PG+RGFPP+M+G DGF+YG +TPDGFPMPDLFG+APR FAPYGPRFSGD +G 
Sbjct: 460  ARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRFSGDFTGP 518

Query: 707  GQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXMSVA 528
            G                 GMMF GRP QPG+VF                           
Sbjct: 519  G-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMG----P 557

Query: 527  APVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPL-----------DMG--QEMAG 387
            A                   P +  N++R  K+D R  +           D G  QEM G
Sbjct: 558  AATNPRGGRPVGVPPPFPNQPQSSQNSSRAAKRDVRGSINDRNDRYSAGSDQGRAQEMGG 617

Query: 386  PGMLDDGK---YQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKDSEVD 219
            PG   D +    Q G K   ED +G RN FRNDESESEDEAPRRSRHGEG KKR+DSE D
Sbjct: 618  PGRGPDDEVQYQQEGSKANQEDQYGSRN-FRNDESESEDEAPRRSRHGEGKKKRRDSEGD 676


>ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 691

 Score =  779 bits (2011), Expect = 0.0
 Identities = 419/719 (58%), Positives = 463/719 (64%), Gaps = 22/719 (3%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVP---LIATDSSVISNTXXXXXXXXXXS-LLEPV 2160
            +D EGVLSFDFEGGLD AP S+ +AAVP   L+  DSS  ++               +P 
Sbjct: 2    EDSEGVLSFDFEGGLDAAP-SSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPSTADPA 60

Query: 2159 AGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYK 1980
             GN   RR FRQTVCRHWLRSLCMKGDACGFLHQYDKARMP+CRF+RLYGECREQDCVYK
Sbjct: 61   GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120

Query: 1979 HSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQ 1800
            H+NEDIKECNMYKLGFCPNGPDCRYRH K PGPPPP EEV+QKIQH  ++NY SSN+FFQ
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSSNKFFQ 180

Query: 1799 QRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXXXI 1620
            QR ASY  Q E+ Q PQG+N  NQ V      T                           
Sbjct: 181  QRGASYNQQAEKPQLPQGTNSTNQGV------TGKPLPAESGNAQPQQQVQQSQQQVNQS 234

Query: 1619 ETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1440
            + QN+ N  P +AN+TATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEA
Sbjct: 235  QMQNVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEA 294

Query: 1439 FDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCE 1260
            FDS++NVIL+FSVNRTRHFQGCAKMTS+IGG V GGNWKYAHGTAHYGRNFSVKWLKLCE
Sbjct: 295  FDSVENVILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCE 354

Query: 1259 LSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXXXX 1080
            LSFHKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAISV       
Sbjct: 355  LSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKRE 414

Query: 1079 XXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAP 900
                KGVN D+  ENPDIVPF                        A Q      GMMW P
Sbjct: 415  EEKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFSHGVGPAGQGRGRGRGMMWPP 474

Query: 899  HMPLARGARPMPGLRGFPPVMMGGDGFTY---GAITPDGFPMPDLFGMAPRAFAPYGPRF 729
            HMPL RGARPMPG++GF PVMM GDG +Y   G + PDGF MPDLFG+ PR FAPYGPRF
Sbjct: 475  HMPLGRGARPMPGMQGFNPVMM-GDGLSYGPVGPVGPDGFGMPDLFGVGPRGFAPYGPRF 533

Query: 728  SGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXX 549
            SGD                  GP + MMF GRP+QPG                       
Sbjct: 534  SGDFG----------------GPPAAMMFRGRPSQPG---MFPSGGFGMMMNPGRGPFMG 574

Query: 548  XXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR---------------RP 414
               +  A P R                     N NR  K+DQR               + 
Sbjct: 575  GMGVGGANPPRGGRPVNMPPMFPPPPP--LPQNANRAAKRDQRTADRNDRFGSGSEQGKS 632

Query: 413  LDMGQEMAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKR 237
             DM  +  GP   DD +YQ G K   +D     N+FRND+SESEDEAPRRSRHGEGKK+
Sbjct: 633  QDMLSQSGGPD--DDAQYQQGYKGN-QDDHPAVNNFRNDDSESEDEAPRRSRHGEGKKK 688


>gb|EXB51974.1| Cleavage and polyadenylation specificity factor CPSF30 [Morus
            notabilis]
          Length = 710

 Score =  776 bits (2003), Expect = 0.0
 Identities = 425/731 (58%), Positives = 474/731 (64%), Gaps = 28/731 (3%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTA----PPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPV 2160
            +D EGVLSFDFEGGLDT     PP+  +A+  LI  DSS  + +          S  +P 
Sbjct: 2    EDSEGVLSFDFEGGLDTTAGGCPPNAAAASAALIHPDSSAAAASNNLAASNSAVSA-DPT 60

Query: 2159 AG-----NNIAR-RCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECRE 1998
            +G     +N  R R FRQTVCRHWLRSLCMKG+ACGFLHQYDK+RMP+CRF+RLYGECRE
Sbjct: 61   SGGGGGASNPGRGRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECRE 120

Query: 1997 QDCVYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGS 1818
            QDCVYKH+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPP  EEV+QKIQH S++NY  
Sbjct: 121  QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQHLSSYNY-H 179

Query: 1817 SNRFFQQRNAS-YTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXX 1641
            SN+FFQQRNA  +    E+   P G N V+Q V  K S                      
Sbjct: 180  SNKFFQQRNAGGFAQLGEKPLLPLGPNAVSQGVVGKPSILESANVQQPQQQVQPSQQPVG 239

Query: 1640 XXXXXXIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSN 1461
                   + QN+   LP +AN+T  PLP G+SRYFIVKSCNRENLELSVQQGVWATQRSN
Sbjct: 240  QN-----QIQNVFTGLPNQANRTVAPLPPGISRYFIVKSCNRENLELSVQQGVWATQRSN 294

Query: 1460 EAKLNEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSV 1281
            EAKLNEAFD  +NVILIFSVNRTRHFQGCAKM S+IGG + GGNWKYAHGTAHYGRNFSV
Sbjct: 295  EAKLNEAFDCAENVILIFSVNRTRHFQGCAKMISRIGGSISGGNWKYAHGTAHYGRNFSV 354

Query: 1280 KWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISV 1101
            KWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAIS+
Sbjct: 355  KWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISL 414

Query: 1100 XXXXXXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXX 921
                       KGV+ D+  ENPDIVPF                 SFSQ L A Q     
Sbjct: 415  AAESKREEEKAKGVDPDNGGENPDIVPF-EDNEEDEEEESEDEEESFSQVLGANQGRGRG 473

Query: 920  XGMMWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPY 741
             G+MW PHMPL+RGARPMP ++GFPPVM+G DG  YG +TPDGFPMPDLF + PRAF PY
Sbjct: 474  RGVMWPPHMPLSRGARPMPSMQGFPPVMIGADGSPYGPVTPDGFPMPDLFNVGPRAFNPY 533

Query: 740  GPRFSGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVF-XXXXXXXXXXXXXXX 564
            GPRF GD                  GPTSGMMF GRP QPG VF                
Sbjct: 534  GPRFPGDF----------------MGPTSGMMFRGRPTQPGAVFPGGGFGMMMGPGRAPC 577

Query: 563  XXXXXXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLD-------- 408
                     S A P+R                     N NR  ++DQR   +        
Sbjct: 578  MGGMGVQGTSPARPMRPGAMPPMFQQPPP-----PSQNMNRPPRRDQRGLANDRNERYGA 632

Query: 407  -----MGQEMAGP--GMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGE 249
                  GQEM+GP  G  DD  YQ G K + ED +G  NSFRNDESESEDEAPRRSRHG+
Sbjct: 633  GSDQVRGQEMSGPAGGPEDDAHYQLGAKARQEDQYGAGNSFRNDESESEDEAPRRSRHGD 692

Query: 248  G-KKRKDSEVD 219
            G KKR+ SE D
Sbjct: 693  GKKKRRSSEED 703


>ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max]
          Length = 681

 Score =  773 bits (1996), Expect = 0.0
 Identities = 416/715 (58%), Positives = 458/715 (64%), Gaps = 18/715 (2%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSA-AVPLIATDSSVISNTXXXXXXXXXXS-LLEPVAG 2154
            +D EGVLSFDFEGGLD AP S  +A + PLI  DSS  ++              ++PV G
Sbjct: 2    EDSEGVLSFDFEGGLDAAPSSAAAAPSGPLIPHDSSAAASAVSNGGPAAPAPSAVDPVGG 61

Query: 2153 NNI-ARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKH 1977
             N+  RR FRQTVCRHWLRSLCMKGDACGFLHQYDKARMP+CRF+RLYGECREQDCVYKH
Sbjct: 62   GNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYKH 121

Query: 1976 SNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQ 1797
            +NEDIKECNMYKLGFCPNGPDCRYRH K PGPPPP EEV+QKIQH  ++NY SSN+FFQQ
Sbjct: 122  TNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQQ 181

Query: 1796 RNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXXXIE 1617
            R ASY  Q E+   PQG+N  NQ V      T +                         +
Sbjct: 182  RGASYNQQAEKPLLPQGNNSTNQGV------TGNPLPAELGNAQPQQQVQQSQQQVNQSQ 235

Query: 1616 TQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 1437
             QN+ N  P +AN+TATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAF
Sbjct: 236  MQNVANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAF 295

Query: 1436 DSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCEL 1257
            DS++NVILIFSVNRTRHFQGCAKMTSKIGG V GGNWKYAHGTAHYGRNFSVKWLKLCEL
Sbjct: 296  DSVENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCEL 355

Query: 1256 SFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXXXXX 1077
            SFHKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAISV        
Sbjct: 356  SFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREE 415

Query: 1076 XXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPH 897
               KGVN D+  ENPDIVPF                        A Q      GMMW PH
Sbjct: 416  EKAKGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGRGRGRGMMWPPH 475

Query: 896  MPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFSGDL 717
            MPL RGARPMPG++GF PVMM GDG +YG + PDGF MPDLFG+ PR FAPYGPRFSGD 
Sbjct: 476  MPLGRGARPMPGMQGFNPVMM-GDGLSYGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDF 534

Query: 716  SGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXM 537
                             GP + MMF GRP+QPG                          +
Sbjct: 535  G----------------GPPAAMMFRGRPSQPG---MFPGGGFGMMLNPGRGPFMGGIGV 575

Query: 536  SVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR---------------RPLDMG 402
              A P R                     N NR  K+DQR               +  DM 
Sbjct: 576  GGANPPRGGRPVNMPPMFPPPPP--LPQNANRAAKRDQRTADRNDRFGSGSEQGKSQDML 633

Query: 401  QEMAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKR 237
             +  GP   DD +YQ G K        G      D+SESEDEAPRRSRHGEGKK+
Sbjct: 634  SQSGGPD--DDPQYQQGYK--------GNQDDHPDDSESEDEAPRRSRHGEGKKK 678


>ref|XP_004141524.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cucumis sativus]
          Length = 707

 Score =  759 bits (1961), Expect = 0.0
 Identities = 412/728 (56%), Positives = 468/728 (64%), Gaps = 25/728 (3%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSA--AVPLIATDSSV------ISNTXXXXXXXXXXSL 2172
            +D EGVLSFDFEGGLD A P+NP+A  ++P+I +DSS       +SN           + 
Sbjct: 2    EDSEGVLSFDFEGGLD-AGPTNPAATSSLPIINSDSSAPPAASAVSNPLSGALGPAVSA- 59

Query: 2171 LEPVA---GNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECR 2001
             EP     GN   RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPICRF+RLYGECR
Sbjct: 60   -EPTGAPHGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECR 118

Query: 2000 EQDCVYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYG 1821
            EQDCVYKH+NEDIKECNMYK GFCPNGPDCRYRH KLPGPPPP EE++QKIQH  ++NYG
Sbjct: 119  EQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPLEEILQKIQHLGSYNYG 178

Query: 1820 SSNRFFQQRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXX 1641
             SN+FF QR    + Q E+SQFPQ   +V Q V  K S                      
Sbjct: 179  PSNKFFTQRGVGLSQQNEKSQFPQVPALVTQGVTGKPSAAESVNVQQQQGQQSAPQASQT 238

Query: 1640 XXXXXXIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSN 1461
                     Q+L N  P + N+ AT LPQG+SRYFIVKSCNRENLELSVQQGVWATQRSN
Sbjct: 239  P-------VQSLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSN 291

Query: 1460 EAKLNEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSV 1281
            EAKLNEAFDS DNVILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHGT HYG+NFS+
Sbjct: 292  EAKLNEAFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTPHYGQNFSL 351

Query: 1280 KWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISV 1101
            KWLKLCELSF KTRHLRNPYNENLPVKISRDCQELEP VGEQLASLLYLEPD ELMA+SV
Sbjct: 352  KWLKLCELSFQKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDGELMAVSV 411

Query: 1100 XXXXXXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXX 924
                       KGVN D  +ENPDIVPF                 SF Q+     Q    
Sbjct: 412  AAESKREEEKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSAGLPPQGRGR 471

Query: 923  XXGMMWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAP 744
              GMMW PHMP+ RGARP  G++GFPP MMG DG +YG +TPDGFPMPD+FGM PR F P
Sbjct: 472  GRGMMWPPHMPMGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMTPRGFGP 531

Query: 743  YG--PRFSGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXX 570
            YG  PRFSGD                  GP + MMF GRP+QP  +F             
Sbjct: 532  YGPTPRFSGDF----------------MGPPTAMMFRGRPSQPAAMF--PPSGFGMMMGQ 573

Query: 569  XXXXXXXXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR---------- 420
                      ++ A P R                P +Q N NR +K+DQR          
Sbjct: 574  GRGPFMGGMGVAGANPARPGRPVGVSPLYPPPAVPSSQ-NMNRAIKRDQRGLTNDRYIVG 632

Query: 419  RPLDMGQEMAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-K 243
               + G E+   G  ++ +Y+ G K   ++ +G   +FRN+ESESEDEAPRRSRHGEG K
Sbjct: 633  MDQNKGVEIQSSGRDEEMQYKQGSKAYSDEQYGTGTTFRNEESESEDEAPRRSRHGEGKK 692

Query: 242  KRKDSEVD 219
            KR+ SE D
Sbjct: 693  KRRGSEGD 700


>ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica]
            gi|462410040|gb|EMJ15374.1| hypothetical protein
            PRUPE_ppa019072mg [Prunus persica]
          Length = 695

 Score =  734 bits (1895), Expect = 0.0
 Identities = 400/725 (55%), Positives = 458/725 (63%), Gaps = 22/725 (3%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVP----LIATDSSVISNTXXXXXXXXXXSLLEPV 2160
            +D +G ++FDFEGGLD    + P+   P    L+ +DS V +            +   P 
Sbjct: 2    EDSDGDINFDFEGGLDATAAAGPTNPGPPSNSLMQSDSGVAAVDTNPAAAAPQPNHPNP- 60

Query: 2159 AGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYK 1980
              N    R +RQTVCRHWLRSLCMKG+ACGFLHQYDK+RMP+CRF+RLYGECREQDCVYK
Sbjct: 61   --NRSGGRSYRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECREQDCVYK 118

Query: 1979 HSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQ 1800
            H+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEV+QKIQH +++NY +SN+F+Q
Sbjct: 119  HTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNTSNKFYQ 178

Query: 1799 QRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXXXI 1620
            QRNA +  Q ++ Q  QG N V Q V  K ST                            
Sbjct: 179  QRNAGFPQQADKYQSAQGPNSVYQGVVGKPSTGESANVHQQQQVQQTQQQVGHT------ 232

Query: 1619 ETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1440
            +TQNL N L  +AN++A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEA
Sbjct: 233  QTQNLPNGLANQANRSA-PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEA 291

Query: 1439 FDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCE 1260
            FDS +NVILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHG+AHYGRNFSVKWLKLCE
Sbjct: 292  FDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGSAHYGRNFSVKWLKLCE 351

Query: 1259 LSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXXXX 1080
            LSFHKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMA+S+       
Sbjct: 352  LSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSIAAESKRE 411

Query: 1079 XXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQ--XXXXXXGMMW 906
                KGVN ++  ENPDIVPF                 SF                G+MW
Sbjct: 412  EEKAKGVNPENGGENPDIVPF-EDNEEEEEEESDDEEESFGPVPGVGNEGRGRGRGGIMW 470

Query: 905  APHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRFS 726
             PHMPLARG RPMPG++GFPP MMG D   YG   PDGF MP+ FG+ PR F PYGPRFS
Sbjct: 471  PPHMPLARGGRPMPGMQGFPPGMMGADAMPYGP-APDGFGMPNPFGVGPRGFNPYGPRFS 529

Query: 725  GDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXX 546
            GD                 TGPT GMMF GRP QPG                        
Sbjct: 530  GDF----------------TGPTPGMMFRGRPQQPG----FPPGGYGMMMGPGRAPFMGG 569

Query: 545  XXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLD-------------M 405
              +  A P R                  +  N NR+ K+D R P +              
Sbjct: 570  MGVGGANPGRPGRPTGMSPMFPPP----SSQNTNRMQKRDPRGPSNDRNERYSAGSGQGK 625

Query: 404  GQEMAG--PGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKR-K 234
            GQE+ G   G  D+ +YQ   K   ED +G  N+ RND+SESEDEAPRRSRHGEGKK+ +
Sbjct: 626  GQEIPGLAGGPDDEARYQQASKAYREDQYGAGNNSRNDDSESEDEAPRRSRHGEGKKKGR 685

Query: 233  DSEVD 219
             SE D
Sbjct: 686  GSEGD 690


>ref|XP_002300333.2| zinc finger family protein [Populus trichocarpa]
            gi|550349048|gb|EEE85138.2| zinc finger family protein
            [Populus trichocarpa]
          Length = 669

 Score =  728 bits (1879), Expect = 0.0
 Identities = 407/741 (54%), Positives = 457/741 (61%), Gaps = 37/741 (4%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDS--------SVISNTXXXXXXXXXXSL 2172
            +D EGVLSFDFEGGLD+ P +NP A++P I +D+           +NT            
Sbjct: 2    EDSEGVLSFDFEGGLDSGP-ANPIASIPAIPSDNYGAATAAAPNTTNTTTNTTNNSNSGA 60

Query: 2171 LEPVAGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQD 1992
             +  AG    RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQD
Sbjct: 61   ADIQAG----RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQD 116

Query: 1991 CVYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSN 1812
            CVYKH+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEVVQKIQ  +++N  +SN
Sbjct: 117  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVVQKIQQLNSYNGVTSN 176

Query: 1811 RFFQQRNASYTNQTERSQF----PQGS---NIVNQVVAVKQSTTADXXXXXXXXXXXXXX 1653
            + FQQRNA ++ Q E+S      P G+   N+  Q    +Q+ T                
Sbjct: 177  KNFQQRNAGFSQQIEKSPNTIIKPSGTESANVQQQQQQQQQTQTPHLTNG---------- 226

Query: 1652 XXXXXXXXXXIETQNLQNSLPTEANKTATPLPQGLSR-----------YFIVKSCNRENL 1506
                         Q+ Q   P   N+ ATPLPQG+S            YFIVKSCNRENL
Sbjct: 227  -------------QHQQPQQPNPLNRIATPLPQGISSFFSCVSPSQFVYFIVKSCNRENL 273

Query: 1505 ELSVQQGVWATQRSNEAKLNEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNW 1326
            ELSVQQGVWATQRSNE KLNEA DS DNVILIFSVNRTRHFQGCAKM SKIG  VGGGNW
Sbjct: 274  ELSVQQGVWATQRSNEIKLNEALDSADNVILIFSVNRTRHFQGCAKMASKIGASVGGGNW 333

Query: 1325 KYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLAS 1146
            KYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELEP +GEQLAS
Sbjct: 334  KYAHGTAHYGRNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLAS 393

Query: 1145 LLYLEPDSELMAISVXXXXXXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXX 966
            LLYLEPDSELMA+S+           KGVN D   ENPDIVPF                 
Sbjct: 394  LLYLEPDSELMAVSLAAEAKREEEKEKGVNPDSGGENPDIVPF-EDNEEEEEEESEEEEE 452

Query: 965  SFSQTLS-AAQXXXXXXGMMWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGF 789
            SF Q L  AAQ      GMMW  H P+ARGARP+PG+RGFPP+MMG DGF+YGA+TPD F
Sbjct: 453  SFGQPLGPAAQGRGRGRGMMWPSHNPMARGARPIPGIRGFPPMMMGADGFSYGAVTPDSF 512

Query: 788  PMPDLFGMAPRAFAPYGPRFSGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVF 609
             MPDLFG+A R F PYGPRFSGD                 TG  SGMMF GRP+QPG VF
Sbjct: 513  GMPDLFGVASRGFPPYGPRFSGDF----------------TGAASGMMFPGRPSQPGAVF 556

Query: 608  XXXXXXXXXXXXXXXXXXXXXXXMS----------VAAPVRASXXXXXXXXXXXXXXPLA 459
                                    S          + AP  A                 +
Sbjct: 557  PAGGFGMMMGPGRPPFIGGMGPTPSNLLRGPRPGGMFAPFPAP----------------S 600

Query: 458  QNNNNRVVKKDQRRPLDMGQEMAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESED 279
              NN+R VK+DQR   +   +                     + FG  NS RNDESESED
Sbjct: 601  SQNNSRSVKRDQRAAANDRNDR-------------------HNQFGAVNSIRNDESESED 641

Query: 278  EAPRRSRHGEGKKRKDSEVDE 216
            EAPRRSRHGEGKK++    D+
Sbjct: 642  EAPRRSRHGEGKKKRRGSGDD 662


>ref|XP_004295608.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Fragaria vesca subsp. vesca]
          Length = 689

 Score =  726 bits (1873), Expect = 0.0
 Identities = 395/718 (55%), Positives = 454/718 (63%), Gaps = 15/718 (2%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPVAG-N 2151
            +D +GVL+FDFEGGLD+A  S P+     +A+ + + S++             +P    N
Sbjct: 2    EDPDGVLNFDFEGGLDSAAVSAPTHTG--LASSAPIQSDSFASQPKNQAAPAPQPDPNVN 59

Query: 2150 NIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVYKHSN 1971
               R+ FRQTVCRHWLRSLCMKG+ACGFLHQYDK+RMP+CRF+R+YGECREQDCVYKH+N
Sbjct: 60   PSGRKSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRMYGECREQDCVYKHTN 119

Query: 1970 EDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFFQQRN 1791
            EDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEV+QKIQH +++NY +SN+F Q RN
Sbjct: 120  EDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNNSNKFSQPRN 179

Query: 1790 ASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXXXIETQ 1611
              +  Q +RSQ  Q +N  NQVV    +  +                          + Q
Sbjct: 180  GGFPQQHDRSQPAQVTNSFNQVVVRPSAAES-------ANVQQPQQFQQTQQPVAQTQAQ 232

Query: 1610 NLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 1431
            ++ N L ++AN+ A PLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS
Sbjct: 233  SVPNGLASQANRAALPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDS 292

Query: 1430 IDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSF 1251
             +NVILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHGTAHYGRNFSVKWLKLCELSF
Sbjct: 293  AENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSF 352

Query: 1250 HKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXXXXXXX 1071
            HKTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDSELMAIS+          
Sbjct: 353  HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEK 412

Query: 1070 XKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMMWAPHMP 891
             KGVN ++  ENPDIVPF                    Q    A        +MW PHMP
Sbjct: 413  AKGVNPENGGENPDIVPFEDNEEEEEEESDDEEDY---QVPGGAIENRGRGRVMWPPHMP 469

Query: 890  L-ARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGM-APRAFAPYGPRFSGDL 717
            L  RG RPMPG++GFP  MMG D   YG +TPDGF MP+ FGM  PR F PYGPRFSGD 
Sbjct: 470  LGGRGGRPMPGMQGFPG-MMGPDAMPYGPVTPDGFVMPNPFGMGGPRGFNPYGPRFSGDF 528

Query: 716  SGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXXXXXM 537
                             GP  GMMF GRP QPG +F                        
Sbjct: 529  G----------------GPNPGMMFRGRPPQPGGMFPPGPYGMMMGPGRGPFMGGMGVGG 572

Query: 536  SVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR-----------RPLDMGQEMA 390
            +   P R                     NNNR+ K+D R                G+EM 
Sbjct: 573  N--NPARGGRPGGMPPMFPPHP---PSQNNNRLQKRDPRGSGNDRNERYSAGSGHGKEMQ 627

Query: 389  GPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKRKDSEVD 219
              G  D+  YQ   K   ED +G  N+ RND+SESEDEAPRRSRHGEG KKR+DSE D
Sbjct: 628  AGGPDDENHYQHSSKSYQED-YGAGNNGRNDDSESEDEAPRRSRHGEGKKKRRDSEGD 684


>gb|EYU43238.1| hypothetical protein MIMGU_mgv1a002387mg [Mimulus guttatus]
          Length = 681

 Score =  722 bits (1863), Expect = 0.0
 Identities = 394/730 (53%), Positives = 456/730 (62%), Gaps = 27/730 (3%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPVAG-- 2154
            DD EG LSFDFEGGLD  P S+P+A+VP+I + ++  + +              PV    
Sbjct: 2    DDGEGGLSFDFEGGLDIGP-SHPTASVPVIQSSANANTASAAAAAANPYNPSAAPVPATQ 60

Query: 2153 -----NNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 1989
                 NN  RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPICRF+RLYGECREQDC
Sbjct: 61   AAEGMNNGGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDC 120

Query: 1988 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1809
            VYKH+NED+KECNMYKLGFCPNGPDCRYRH KLPGPPP  EEV+QKIQ  +++NYG SN 
Sbjct: 121  VYKHTNEDVKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQQLTSYNYGKSNN 180

Query: 1808 FFQQRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXX 1629
            FFQ RN+++  QTE+ QFPQG N  +QV     +   +                      
Sbjct: 181  FFQNRNSNFAQQTEKPQFPQGPNGTHQVGKTNAAEPGNLNQPAQQSQQPGSQG------- 233

Query: 1628 XXIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1449
               + Q++ N    +A++ ATPLPQG SRYF+VKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 234  ---QLQSIPNDQQNQASRNATPLPQGASRYFVVKSCNRENLELSVQQGVWATQRSNEAKL 290

Query: 1448 NEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLK 1269
            NEAF+S++N+ILIFSVN+TRHFQGCAKMTS+IGG VGGGNWK+AHGTAHYGRNF++KWLK
Sbjct: 291  NEAFESVENIILIFSVNKTRHFQGCAKMTSRIGGSVGGGNWKHAHGTAHYGRNFALKWLK 350

Query: 1268 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXX 1089
            LCEL+F KTRHLRNPYNENLPVKISRDCQELEP +GEQLASLLYLEPDS+LMAI++    
Sbjct: 351  LCELTFDKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSDLMAIAIAAEL 410

Query: 1088 XXXXXXXKGVNLDDETENPDIVPF---XXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXX 918
                   KGVN+D+  ENPDIVPF                     F      AQ      
Sbjct: 411  KREEEKAKGVNIDNGAENPDIVPFEDNEEEEEEEEEEEESEDEDEFPGQAFGAQGRGVGR 470

Query: 917  GMMWAPHM-PLARGARPMPGLRGFPPVMMGGDGFTYGAITP---DGFPMPDLFGMAPRAF 750
            GMMW PHM PL RG RP PG+RGFPP MMGGDGF YG   P   DGFPM D FGM PR F
Sbjct: 471  GMMWGPHMPPLGRGPRPFPGVRGFPPNMMGGDGFPYGHGPPLNHDGFPMHDPFGMVPRGF 530

Query: 749  APYGPRFSGDLSGLGQSSAM-------GFTPI--DGTGPTSGMMFHGRP-NQPGNVFXXX 600
              +GPRF GD +G      M       GF P+   G GP  G    GRP   P   F   
Sbjct: 531  GQFGPRFGGDFAGPASGPMMFAGRPPGGFGPMMGQGRGPFMGGGRGGRPVGMPPPFFPPP 590

Query: 599  XXXXXXXXXXXXXXXXXXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQR 420
                                     PV A                     N+  VK+DQ+
Sbjct: 591  -----------------------PPPVAAQPPP----------------QNSNWVKRDQK 611

Query: 419  RPLDMGQEMAGPGMLDDGKYQSGIKVQCEDSFGGR--NSFRNDESESEDEAPRRSRHGEG 246
             P     +++     D GK Q  +          +   S+RNDESESEDEAPRRSRHGEG
Sbjct: 612  APYSDRNDVS-----DQGKGQEIVSGSSNRGNAAKREESYRNDESESEDEAPRRSRHGEG 666

Query: 245  -KKRKDSEVD 219
             KKR+ SE +
Sbjct: 667  KKKRRGSEAE 676


>gb|AHN05783.1| YTH domain-contained RNA binding protein 14 [Malus domestica]
          Length = 667

 Score =  716 bits (1847), Expect = 0.0
 Identities = 398/726 (54%), Positives = 454/726 (62%), Gaps = 23/726 (3%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDT----APPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPV 2160
            +D +G L+FDFEGGLD     +  + P+  VP   ++ SV+ +           +   P 
Sbjct: 2    EDSDGGLNFDFEGGLDAPATVSASAGPANTVP--TSNYSVMQSDSAVTGLGANQAAAAPQ 59

Query: 2159 AGNNIAR---RCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 1989
               N  R   R +RQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQDC
Sbjct: 60   PNQNANRTGGRSYRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDC 119

Query: 1988 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1809
            VYKH+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP EEV+QKIQH +++NY +S++
Sbjct: 120  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLTSYNYNNSSK 179

Query: 1808 FFQQRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXX 1629
            F+QQRNA +  Q ++ Q  QG N       V + TTA+                      
Sbjct: 180  FYQQRNAGFPQQGDKHQPAQGPNNF-----VGKPTTAEPGNVQQQQQQQLQQTQQHVGPT 234

Query: 1628 XXIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1449
               +TQ L N L  +AN++A PLPQG SRYFIVKSCNRENLELSVQQG+WATQRSNE+KL
Sbjct: 235  ---QTQTLPNGLANQANRSALPLPQGTSRYFIVKSCNRENLELSVQQGLWATQRSNESKL 291

Query: 1448 NEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLK 1269
            NEAFDS +NVILIFSVNRTRHFQGCAKM S+IGG VGGGNWKYAHGTAHYGRNFSVKWLK
Sbjct: 292  NEAFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVGGGNWKYAHGTAHYGRNFSVKWLK 351

Query: 1268 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXX 1089
            LCELSFHKTRHLRNPYNENLPVKISRDCQELE  VGEQLASLLYLEPDSELMAIS+    
Sbjct: 352  LCELSFHKTRHLRNPYNENLPVKISRDCQELELSVGEQLASLLYLEPDSELMAISIAAES 411

Query: 1088 XXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSA---AQXXXXXX 918
                   KGVN ++  ENPDIVPF                 SF Q   A    +      
Sbjct: 412  KREEEKAKGVNPENGGENPDIVPF-EDNEEEEEEESEDEEDSFGQVPGAGNDGRGRGRGG 470

Query: 917  GMMWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYG 738
            G+MW PHM L RG RPMPG++GFPP MMG D   Y    PDGF MP+ FGMAPR F PYG
Sbjct: 471  GVMWPPHMALPRGGRPMPGMQGFPPGMMGHDAMPY---VPDGFVMPNPFGMAPRGFNPYG 527

Query: 737  PRFSGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXX 558
            PRFSGD                 TGP  GMMF GRP QPG                    
Sbjct: 528  PRFSGDF----------------TGPNPGMMFRGRPQQPG-------------------- 551

Query: 557  XXXXXXMSVAAPVRASXXXXXXXXXXXXXXPL----------AQNNNNRVVKKDQRRPLD 408
                    +  P RA                +          +  N NR+ K+D R    
Sbjct: 552  -FPPGGFGIMGPGRAPFMGGIHPGRGGRPTGMSPMFPPPPPPSSQNPNRMPKRDPRGAST 610

Query: 407  --MGQEMAGPGMLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KKR 237
               GQ+M+GP   DD           E  +G  NS RND+SESEDEAPRRSRHG+G KKR
Sbjct: 611  DRKGQDMSGP---DD-----------ETHYGAGNSSRNDDSESEDEAPRRSRHGDGKKKR 656

Query: 236  KDSEVD 219
            +DSE D
Sbjct: 657  RDSEGD 662


>ref|XP_006448925.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551536|gb|ESR62165.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 672

 Score =  713 bits (1841), Expect = 0.0
 Identities = 400/727 (55%), Positives = 450/727 (61%), Gaps = 24/727 (3%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVIS-------NTXXXXXXXXXXSLL 2169
            +D EG LSFDFEGGLD A P  P+A+ P I +DS+  +       N              
Sbjct: 2    EDSEGGLSFDFEGGLD-AGPGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGAAPDHA 60

Query: 2168 EPVAGNNIARRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDC 1989
                 ++  RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RL+GECREQDC
Sbjct: 61   SAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDC 120

Query: 1988 VYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNR 1809
            VYKH+NEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPP  EEV+QKIQ  S++N+G+ N+
Sbjct: 121  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNHGNPNK 180

Query: 1808 FFQQRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXX 1629
             FQQR A +++Q ++SQF QG N VNQ  A K ST                         
Sbjct: 181  LFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGTQTT-- 237

Query: 1628 XXIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1449
               + QNL N LP + N+ ATPLPQG+SRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 238  ---QMQNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 294

Query: 1448 NEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLK 1269
            NEAFDS +NVILIFSVNRTRHFQGCAKMTSKIGG VGGGNWKYAHGTAHYGRNFSVKWLK
Sbjct: 295  NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLK 354

Query: 1268 LCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXX 1089
            LCELSFHKTRHLRNPYNENLPVK                             AISV    
Sbjct: 355  LCELSFHKTRHLRNPYNENLPVK-----------------------------AISVAAEA 385

Query: 1088 XXXXXXXKGVNLDDETENPDIVPFXXXXXXXXXXXXXXXXXSFSQTLSAAQXXXXXXGMM 909
                   KGVN D+  +NPDIVPF                       +A+Q      GMM
Sbjct: 386  KREEEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEE----ESLGTASQGRGRGRGMM 441

Query: 908  WAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPRF 729
            W   MPLARGARP+PG+RGFPP+M+G DGF+YG +TPDGFPMPDLFG+APR FAPYGPRF
Sbjct: 442  WPGPMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRF 500

Query: 728  SGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXXXXX 549
            SGD +G G                 GMMF GRP QPG+VF                    
Sbjct: 501  SGDFTGPG-----------------GMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGG 543

Query: 548  XXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPL-----------DMG 402
                   A                   P +  N++RV K+D R  +           D G
Sbjct: 544  MG----PAATNPRGGRPVGVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQG 599

Query: 401  --QEMAGPGMLDDGK---YQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEG-KK 240
              QEM GPG   D +    Q G K   ED +G RN FRNDESESEDEAPRRSRHGEG KK
Sbjct: 600  RAQEMGGPGRGPDDEVQYQQEGSKANQEDQYGSRN-FRNDESESEDEAPRRSRHGEGKKK 658

Query: 239  RKDSEVD 219
            R+DSE D
Sbjct: 659  RRDSEGD 665


>ref|XP_006359103.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum tuberosum]
          Length = 692

 Score =  701 bits (1810), Expect = 0.0
 Identities = 391/715 (54%), Positives = 451/715 (63%), Gaps = 11/715 (1%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPVAGNN 2148
            D+ EG L+FDFEGGLDT P ++P+A+VP+I +     +             +   V G +
Sbjct: 2    DEGEGGLNFDFEGGLDTGP-THPTASVPVIQSFDHTAAAAPSANINPPT--VSAAVGGQS 58

Query: 2147 IA-----RRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCVY 1983
                   RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPICRF+RLYGECREQDCVY
Sbjct: 59   DVGFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDCVY 118

Query: 1982 KHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRFF 1803
            KH+ EDIKECNMYKLGFCPNGPDCRYRH K+PGPPPP EE++QKIQH +++NYG SNRF 
Sbjct: 119  KHTIEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASYNYGYSNRFN 178

Query: 1802 QQRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXXX 1623
            Q RNA+Y+ Q+++SQ  Q  N ++  +AVK + T                          
Sbjct: 179  QNRNANYSTQSDKSQASQAQNGMS--LAVKSTATETPIIQQHQPNQQVQPPQLQGGPT-- 234

Query: 1622 IETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1443
             + Q   N    +A++TA  LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE
Sbjct: 235  -QAQIHPNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 293

Query: 1442 AFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLC 1263
            AFDS++NVILIFSVNRTRHFQGC KMTS+IGG   GGNWK+ HGTAHYGRNFSVKWLKLC
Sbjct: 294  AFDSVENVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSVKWLKLC 353

Query: 1262 ELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXXX 1083
            ELSF KT HLRNPYNENLPVKISRDCQELEP VGEQLASLLYLEPDSELMAIS+      
Sbjct: 354  ELSFQKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESKR 413

Query: 1082 XXXXXKGVNLDDETENPDIVPF---XXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXXXXG 915
                 KGVN D+  +NPDIVPF                    SF Q    AA       G
Sbjct: 414  QEEKAKGVNPDNGKDNPDIVPFEDNEEEEEEEEEEESEDEDESFDQGFGPAALGRGRGRG 473

Query: 914  MMWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGP 735
            + W P MP   G RP PG+RGFPP MM GDGF+YGA+TP+GFPMPD FGM PR F PYGP
Sbjct: 474  IAWPPIMPFGHGPRPPPGMRGFPPGMM-GDGFSYGAMTPEGFPMPDHFGMGPRPFGPYGP 532

Query: 734  RFSGDLSGLGQSSAMGFTPIDGTG--PTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXX 561
             FS DL   G+  A GF  + G G  P  G M  G    P                    
Sbjct: 533  PFSSDLMFHGRPPAGGFGMMMGPGRPPFMGGMGPGATGPPRAGRAVGMHPSFVPPSSQPS 592

Query: 560  XXXXXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLDMGQEMAGPG 381
                       APV                     ++ N     DQ +    GQEM G  
Sbjct: 593  QYPYKAKREQRAPV---------------------SDRNDRFSSDQGK----GQEMMGSV 627

Query: 380  MLDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRKDSEVDE 216
               DG +    K + ++ FG  NS +N+ESESEDEAPRRSRHG+GKK++  +VDE
Sbjct: 628  GGPDGVHMQIGKSEHDNQFGAGNSQKNEESESEDEAPRRSRHGDGKKKR-RDVDE 681


>ref|XP_004231555.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum lycopersicum]
          Length = 689

 Score =  700 bits (1807), Expect = 0.0
 Identities = 391/714 (54%), Positives = 447/714 (62%), Gaps = 10/714 (1%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPVAGNN 2148
            D+ EG L+FDFEGGLDT P ++P+A+VP+I +      +T              P  G  
Sbjct: 2    DEGEGGLNFDFEGGLDTGP-THPTASVPVIQS----FDHTAAAASSANINPPTVPAVGGQ 56

Query: 2147 IA------RRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQDCV 1986
                    RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMPICRF+RLYGECREQDCV
Sbjct: 57   GDVGFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECREQDCV 116

Query: 1985 YKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSNRF 1806
            YKH+ EDIKECNMYKLGFCPNGPDCRYRH K+PGPPPP EE++QKIQH ++ NYG SNRF
Sbjct: 117  YKHTIEDIKECNMYKLGFCPNGPDCRYRHAKMPGPPPPVEEILQKIQHLASNNYGYSNRF 176

Query: 1805 FQQRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXXXX 1626
             Q RNA+Y+ QT++SQ  Q  N  +  +AVK + T                         
Sbjct: 177  NQNRNANYSTQTDKSQASQAQNGTS--LAVKSTATETPIIQQHQPHQQVQPPQLQGGPT- 233

Query: 1625 XIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1446
              + Q   N    +A++TA  LPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN
Sbjct: 234  --QAQIHPNGQQNQADRTAVVLPQGTSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 291

Query: 1445 EAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKL 1266
            EAFDS++NVILIFSVNRTRHFQGC KMTS+IGG   GGNWK+ HGTAHYGRNFS+KWLKL
Sbjct: 292  EAFDSVENVILIFSVNRTRHFQGCGKMTSRIGGAANGGNWKHEHGTAHYGRNFSLKWLKL 351

Query: 1265 CELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXXXX 1086
            CELSF KT HLRNPYNENLPVKISRDCQELEP VGEQLASLLYLEPDSELMAIS+     
Sbjct: 352  CELSFQKTHHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESK 411

Query: 1085 XXXXXXKGVNLDDETENPDIVPF-XXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXXXXGM 912
                  KGVN D+  +NPDIVPF                  +F Q    AA       G+
Sbjct: 412  RLEEKAKGVNPDNGKDNPDIVPFEDNEEEEDEEEESEDEDENFDQGFGPAALGRGRGRGI 471

Query: 911  MWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYGPR 732
             W P MP   G RP PG+RGFPP MM GDGF+YGA+TP+GFPM D FGM PR F PYGPR
Sbjct: 472  AWPPIMPFGHGPRPPPGMRGFPPGMM-GDGFSYGAMTPEGFPMTDHFGMGPRPFPPYGPR 530

Query: 731  FSGDLSGLGQSSAMGFTPIDGTG--PTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXX 558
            FS DL   G+  A GF  + G G  P  G M  G    P                     
Sbjct: 531  FSSDLMFHGRPPAGGFGMMIGPGRPPFVGGMGPGATGPPRAGRAVRMHPSFIPPSSQPSQ 590

Query: 557  XXXXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLDMGQEMAGPGM 378
                      APV                     ++ N     DQ +    GQEM G   
Sbjct: 591  YPYRAKREQRAPV---------------------SDRNDRFSSDQGK----GQEMMGSVN 625

Query: 377  LDDGKYQSGIKVQCEDSFGGRNSFRNDESESEDEAPRRSRHGEGKKRKDSEVDE 216
              DG +    K + ++ FG  NS +ND SESEDEAPRRSRHG+GKK++  +VDE
Sbjct: 626  GPDGVHMQIGKSEHDNQFGAGNSLKNDGSESEDEAPRRSRHGDGKKKR-RDVDE 678


>ref|XP_006352991.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Solanum tuberosum]
          Length = 677

 Score =  696 bits (1795), Expect = 0.0
 Identities = 390/720 (54%), Positives = 452/720 (62%), Gaps = 20/720 (2%)
 Frame = -1

Query: 2327 DDQEGVLSFDFEGGLDTAPPSNPSAAVPLIATDSSVISNTXXXXXXXXXXSLLEPVAGNN 2148
            DD EG L+FDFEGGLDT P ++P+A+VP++ +   + +             L+ P  G  
Sbjct: 2    DDGEGGLNFDFEGGLDTGP-THPTASVPVLQSAGHITTGPAPNASVA----LVPPGGGVG 56

Query: 2147 IA--------RRCFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPICRFYRLYGECREQD 1992
                      RR FRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRF+RLYGECREQD
Sbjct: 57   QGGDGSFVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQD 116

Query: 1991 CVYKHSNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPPFEEVVQKIQHQSTFNYGSSN 1812
            CVYKH+NEDIKECNMYKLGFCPNGPDCRYRH KLPGPPPP  EV+Q+IQ+ ++  YG SN
Sbjct: 117  CVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVVEVLQRIQNLTS--YGYSN 174

Query: 1811 RFFQQRNASYTNQTERSQFPQGSNIVNQVVAVKQSTTADXXXXXXXXXXXXXXXXXXXXX 1632
            RFFQ RN +Y+ Q ++SQ PQ  N++NQ V     +TA                      
Sbjct: 175  RFFQNRNTNYSTQADKSQIPQVPNVMNQAV----KSTAAEPPIGQPHQPHQQQVQQPQHQ 230

Query: 1631 XXXIETQNLQNSLPTEANKTATPLPQGLSRYFIVKSCNRENLELSVQQGVWATQRSNEAK 1452
                +TQ L +S   + N+ A PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAK
Sbjct: 231  GAPTQTQTLPSS---QQNQAAIPLPQGPSRYFIVKSCNRENLELSVQQGVWATQRSNEAK 287

Query: 1451 LNEAFDSIDNVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWL 1272
            LNEAFDS++NVIL+FS+NRTRHFQG AKMTS+IGG   GGNWK+ HGTAHYGRNFS+KWL
Sbjct: 288  LNEAFDSVENVILVFSINRTRHFQGLAKMTSRIGGAAKGGNWKHEHGTAHYGRNFSLKWL 347

Query: 1271 KLCELSFHKTRHLRNPYNENLPVKISRDCQELEPLVGEQLASLLYLEPDSELMAISVXXX 1092
            KLCELSF KTRHLRNPYNENLPVKISRDCQELE  VGEQLASLLY+EPDSELMA+S+   
Sbjct: 348  KLCELSFQKTRHLRNPYNENLPVKISRDCQELEISVGEQLASLLYVEPDSELMAVSLAAE 407

Query: 1091 XXXXXXXXKGVNLDDETENPDIVPF-XXXXXXXXXXXXXXXXXSFSQTLS-AAQXXXXXX 918
                    KGVN D+  ENPDIVPF                   F Q    AA       
Sbjct: 408  SKREEERAKGVNPDNGNENPDIVPFEDNEEEEEEESEEEEEDEGFGQAFGPAALGRGRGR 467

Query: 917  GMMWAPHMPLARGARPMPGLRGFPPVMMGGDGFTYGAITPDGFPMPDLFGMAPRAFAPYG 738
            G++W P +P  RGARP PG+RGFPP MM  DGF+YG++TPDGFPMPD +GM  R F P+G
Sbjct: 468  GIVWPPLVPFGRGARPFPGMRGFPPGMM-SDGFSYGSMTPDGFPMPDPYGMGGRPFGPFG 526

Query: 737  PRFSGDLSGLGQSSAMGFTPIDGTGPTSGMMFHGRPNQPGNVFXXXXXXXXXXXXXXXXX 558
            PRF GD+    +  A G     G G    MM  GRP   G +                  
Sbjct: 527  PRFPGDMMFHSRPPAAG-----GFGM---MMGPGRPPFMGGM------------------ 560

Query: 557  XXXXXXMSVAAPVRASXXXXXXXXXXXXXXPLAQNNNNRVVKKDQRRPLDMGQEMAGPGM 378
                       P R                P +QN     VKKDQR P +   +    G 
Sbjct: 561  -----GPGAPGPPRGGRPMGIHPSFIPPTPPPSQNPR---VKKDQRAPFNERNDRFSSGP 612

Query: 377  LDDGKYQSGIKVQCEDSFGG----------RNSFRNDESESEDEAPRRSRHGEGKKRKDS 228
             D G+ Q     +   S GG           NSFRNDESESEDEAPRRSRHG+GKK+K+S
Sbjct: 613  -DQGRGQ-----EIAGSVGGPAEGVHYPQTENSFRNDESESEDEAPRRSRHGDGKKKKNS 666


Top