BLASTX nr result

ID: Ziziphus21_contig00000331 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ziziphus21_contig00000331
         (2672 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010092677.1| Cleavage and polyadenylation specificity fac...   936   0.0  
ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prun...   920   0.0  
ref|XP_008225626.1| PREDICTED: cleavage and polyadenylation spec...   912   0.0  
ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation spec...   885   0.0  
ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylati...   882   0.0  
ref|XP_007041140.1| Cleavage and polyadenylation specificity fac...   882   0.0  
ref|XP_014518648.1| PREDICTED: 30-kDa cleavage and polyadenylati...   875   0.0  
ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phas...   874   0.0  
ref|XP_008445183.1| PREDICTED: cleavage and polyadenylation spec...   872   0.0  
ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylati...   872   0.0  
ref|XP_004295608.1| PREDICTED: 30-kDa cleavage and polyadenylati...   871   0.0  
gb|KJB47903.1| hypothetical protein B456_008G046800 [Gossypium r...   867   0.0  
ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation spec...   858   0.0  
ref|XP_002523201.1| conserved hypothetical protein [Ricinus comm...   852   0.0  
ref|XP_008459517.1| PREDICTED: cleavage and polyadenylation spec...   850   0.0  
ref|XP_004141524.1| PREDICTED: 30-kDa cleavage and polyadenylati...   850   0.0  
ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylati...   849   0.0  
ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citr...   845   0.0  
gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sin...   843   0.0  
ref|XP_010241185.1| PREDICTED: cleavage and polyadenylation spec...   838   0.0  

>ref|XP_010092677.1| Cleavage and polyadenylation specificity factor CPSF30 [Morus
            notabilis] gi|587862159|gb|EXB51974.1| Cleavage and
            polyadenylation specificity factor CPSF30 [Morus
            notabilis]
          Length = 710

 Score =  936 bits (2420), Expect = 0.0
 Identities = 492/713 (69%), Positives = 522/713 (73%), Gaps = 11/713 (1%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAA--TTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPT-DPS 2317
            MEDSEGVLSFDFEGGLD  A     N   AS  LI  D S               + DP+
Sbjct: 1    MEDSEGVLSFDFEGGLDTTAGGCPPNAAAASAALIHPDSSAAAASNNLAASNSAVSADPT 60

Query: 2316 VPG----VNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECRE 2149
              G     NP   RSFRQTVCRHWLRSLCMKG+ACGFLHQYDKSRMP+CRFFR++GECRE
Sbjct: 61   SGGGGGASNPGRGRSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECRE 120

Query: 2148 QDCVYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNT 1969
            QDCVYKHT+EDIKECNMYKLGFCPNGPDCRYRHAKL      VEEVLQKIQ+L+SYNY+ 
Sbjct: 121  QDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPSVEEVLQKIQHLSSYNYH- 179

Query: 1968 SNKFFQQRNAG-FSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQ 1792
            SNKFFQQRNAG F+Q  EK  L  G  AV+QGVVGKPS +ES N                
Sbjct: 180  SNKFFQQRNAGGFAQLGEKPLLPLGPNAVSQGVVGKPSILESANVQQPQQQVQPSQQPVG 239

Query: 1791 -NPIVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1615
             N I NV  GLPNQANRT +PLP GISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN
Sbjct: 240  QNQIQNVFTGLPNQANRTVAPLPPGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 299

Query: 1614 EAFDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKL 1435
            EAFD  ENVILIFSVNRTRHFQGCAKM+SRIGGS+SGGNWKYAHGTAHYGRNFSVKWLKL
Sbjct: 300  EAFDCAENVILIFSVNRTRHFQGCAKMISRIGGSISGGNWKYAHGTAHYGRNFSVKWLKL 359

Query: 1434 CELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXX 1255
            CELSFHKTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELM         
Sbjct: 360  CELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESK 419

Query: 1254 XXXXXXKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMW 1075
                  KGV+PDN GENPDIVPF                S SQV G ANQ      GVMW
Sbjct: 420  REEEKAKGVDPDNGGENPDIVPFEDNEEDEEEESEDEEESFSQVLG-ANQGRGRGRGVMW 478

Query: 1074 PPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFS 895
            PPHMPL+RGARPMP MQGFPPVM+GADGSPYGPVTPDGF MPDLF VGPRAFNPYGPRF 
Sbjct: 479  PPHMPLSRGARPMPSMQGFPPVMIGADGSPYGPVTPDGFPMPDLFNVGPRAFNPYGPRFP 538

Query: 894  SDFMGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXX 715
             DFMGP+SGMMFRGRPTQPG+V            GRAP MGGMGVQGT+P R +R     
Sbjct: 539  GDFMGPTSGMMFRGRPTQPGAVFPGGGFGMMMGPGRAPCMGGMGVQGTSPARPMRPGAMP 598

Query: 714  XXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQQ 541
                   P S QN NR  +RDQRG ANDRNER+  GSDQ++GQE  G AGGP+D+AHYQ 
Sbjct: 599  PMFQQPPPPS-QNMNRPPRRDQRGLANDRNERYGAGSDQVRGQEMSGPAGGPEDDAHYQL 657

Query: 540  GLKPHQEDQYGAGNSFRNDESESEDEAPXXXXXXXXXXXXXXXXXXGATGSDH 382
            G K  QEDQYGAGNSFRNDESESEDEAP                   ATGSDH
Sbjct: 658  GAKARQEDQYGAGNSFRNDESESEDEAPRRSRHGDGKKKRRSSEEDAATGSDH 710


>ref|XP_007214175.1| hypothetical protein PRUPE_ppa019072mg [Prunus persica]
            gi|462410040|gb|EMJ15374.1| hypothetical protein
            PRUPE_ppa019072mg [Prunus persica]
          Length = 695

 Score =  920 bits (2378), Expect = 0.0
 Identities = 474/682 (69%), Positives = 507/682 (74%), Gaps = 5/682 (0%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAAT--TNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPSV 2314
            MEDS+G ++FDFEGGLDA AA   TNPG  S  L+QSD                   P+ 
Sbjct: 1    MEDSDGDINFDFEGGLDATAAAGPTNPGPPSNSLMQSDSGVAAVDTNPAAAAP---QPNH 57

Query: 2313 PGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCVY 2134
            P  N +  RS+RQTVCRHWLRSLCMKG+ACGFLHQYDKSRMP+CRFFR++GECREQDCVY
Sbjct: 58   PNPNRSGGRSYRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRLYGECREQDCVY 117

Query: 2133 KHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKFF 1954
            KHT+EDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ+LNSYNYNTSNKF+
Sbjct: 118  KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNTSNKFY 177

Query: 1953 QQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQNPIVNV 1774
            QQRNAGF QQA+K Q AQG  +V QGVVGKPS  ES N                    N+
Sbjct: 178  QQRNAGFPQQADKYQSAQGPNSVYQGVVGKPSTGESANVHQQQQVQQTQQQVGHTQTQNL 237

Query: 1773 PNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSTE 1594
            PNGL NQANR+A PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS E
Sbjct: 238  PNGLANQANRSA-PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSAE 296

Query: 1593 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK 1414
            NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHG+AHYGRNFSVKWLKLCELSFHK
Sbjct: 297  NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGSAHYGRNFSVKWLKLCELSFHK 356

Query: 1413 TRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXXXXXXK 1234
            TRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELM               K
Sbjct: 357  TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAVSIAAESKREEEKAK 416

Query: 1233 GVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQ-XXXXXXGVMWPPHMPL 1057
            GVNP+N GENPDIVPF                S   VPG  N+       G+MWPPHMPL
Sbjct: 417  GVNPENGGENPDIVPFEDNEEEEEEESDDEEESFGPVPGVGNEGRGRGRGGIMWPPHMPL 476

Query: 1056 ARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFSSDFMGP 877
            ARG RPMPGMQGFPP MMGAD  PYGP  PDGF MP+ FGVGPR FNPYGPRFS DF GP
Sbjct: 477  ARGGRPMPGMQGFPPGMMGADAMPYGP-APDGFGMPNPFGVGPRGFNPYGPRFSGDFTGP 535

Query: 876  SSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXXXXXXXX 697
            + GMMFRGRP QPG              GRAPFMGGMGV G NP R  R           
Sbjct: 536  TPGMMFRGRPQQPG--FPPGGYGMMMGPGRAPFMGGMGVGGANPGRPGRPTGMSPMFPPP 593

Query: 696  XPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQQGLKPHQ 523
               S QNTNR+ KRD RGP+NDRNER+S GS Q KGQE  G AGGPDDEA YQQ  K ++
Sbjct: 594  ---SSQNTNRMQKRDPRGPSNDRNERYSAGSGQGKGQEIPGLAGGPDDEARYQQASKAYR 650

Query: 522  EDQYGAGNSFRNDESESEDEAP 457
            EDQYGAGN+ RND+SESEDEAP
Sbjct: 651  EDQYGAGNNSRNDDSESEDEAP 672


>ref|XP_008225626.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30
            [Prunus mume]
          Length = 715

 Score =  912 bits (2357), Expect = 0.0
 Identities = 473/699 (67%), Positives = 508/699 (72%), Gaps = 22/699 (3%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAAT--TNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPS- 2317
            MEDS+G ++FDFEGGLDA AA   TNPG  S  L+QSD                P  P+ 
Sbjct: 1    MEDSDGDINFDFEGGLDATAAAGPTNPGPPSNSLMQSDSGVAAVDTNPAAAAPQPNHPNP 60

Query: 2316 ----------------VPGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPI 2185
                            +   N +  RS+RQTVCRHWLRSLCMKG+ACGFLHQYDKSRMP+
Sbjct: 61   NRSGGRSYRQTVCRHWLANPNRSGGRSYRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPV 120

Query: 2184 CRFFRMFGECREQDCVYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQ 2005
            CRFFR++GECREQDCVYKHT+EDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQ
Sbjct: 121  CRFFRLYGECREQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQ 180

Query: 2004 KIQNLNSYNYNTSNKFFQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXX 1825
            KIQ+LNSYNYNTSNKF+QQRNAGF QQA+K Q AQG  ++ QGVVGKPS  ES N     
Sbjct: 181  KIQHLNSYNYNTSNKFYQQRNAGFPQQADKYQSAQGPNSIYQGVVGKPSTGESANVHQQQ 240

Query: 1824 XXXXXXXXXXQNPIVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWA 1645
                           N+PNGL NQANR+A PLPQGISRYFIVKSCNRENLELSVQQGVWA
Sbjct: 241  QVQQTQQQVGHTQTQNLPNGLVNQANRSA-PLPQGISRYFIVKSCNRENLELSVQQGVWA 299

Query: 1644 TQRSNEAKLNEAFDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYG 1465
            TQRSNE+KLNEAFDS ENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHG+AHYG
Sbjct: 300  TQRSNESKLNEAFDSAENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGSAHYG 359

Query: 1464 RNFSVKWLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSEL 1285
            RNFSVKWLKLCELSFHKTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSEL
Sbjct: 360  RNFSVKWLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSEL 419

Query: 1284 MXXXXXXXXXXXXXXXKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQ 1105
            M               KGVNP+N GENPDIVPF                S   VPG  N+
Sbjct: 420  MAVSIAAESKREEEKAKGVNPENGGENPDIVPFEDNEEEEEEESDDEEESFGPVPGVGNE 479

Query: 1104 -XXXXXXGVMWPPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGP 928
                   G+MWPPHMPLARG RPMPGMQGFPP MMGAD  PYGP  PDGF MP+ FGVGP
Sbjct: 480  GRGRGRGGIMWPPHMPLARGGRPMPGMQGFPPGMMGADAMPYGP-APDGFGMPNPFGVGP 538

Query: 927  RAFNPYGPRFSSDFMGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTN 748
            R FNPYGPRFS DF GP+ GMMFRGRP QPG              GRAPFMGGMGV G N
Sbjct: 539  RGFNPYGPRFSGDFTGPTPGMMFRGRPQQPG--FPPGGYGMMMGPGRAPFMGGMGVGGAN 596

Query: 747  PNRAVRXXXXXXXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQA 574
            P R  R              S QNTNR+ KRD RGP+NDRNER+S GS Q KGQE  G A
Sbjct: 597  PGRPGRPTGMSPMFPPP---SSQNTNRMQKRDPRGPSNDRNERYSAGSGQGKGQEIPGSA 653

Query: 573  GGPDDEAHYQQGLKPHQEDQYGAGNSFRNDESESEDEAP 457
            GGPDDEA YQQ  K ++EDQYGAGN+ RND+SESEDEAP
Sbjct: 654  GGPDDEARYQQASKAYREDQYGAGNNSRNDDSESEDEAP 692


>ref|XP_003546247.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max] gi|947062499|gb|KRH11760.1|
            hypothetical protein GLYMA_15G128500 [Glycine max]
          Length = 691

 Score =  885 bits (2287), Expect = 0.0
 Identities = 463/683 (67%), Positives = 499/683 (73%), Gaps = 6/683 (0%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPSVP- 2311
            MEDSEGVLSFDFEGGLDAA ++      SGPL+Q D S                  + P 
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSAAAAVPSGPLVQHDSSAAASAVSNGGHAAPAPSTADPA 60

Query: 2310 GVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCVYK 2131
            G N   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFR++GECREQDCVYK
Sbjct: 61   GGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 120

Query: 2130 HTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKFFQ 1951
            HT+EDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ+L SYNYN+SNKFFQ
Sbjct: 121  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLFSYNYNSSNKFFQ 180

Query: 1950 QRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQNPIVNVP 1771
            QR A ++QQAEK QL QG+ + NQGV GKP   ES NA              Q+ + NV 
Sbjct: 181  QRGASYNQQAEKPQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQVNQSQMQNVA 240

Query: 1770 NGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSTEN 1591
            NG PNQANRTA+PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS EN
Sbjct: 241  NGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVEN 300

Query: 1590 VILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 1411
            VIL+FSVNRTRHFQGCAKM SRIGGSV+GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT
Sbjct: 301  VILVFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 360

Query: 1410 RHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXXXXXXKG 1231
            RHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELM               KG
Sbjct: 361  RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKAKG 420

Query: 1230 VNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMWPPHMPLAR 1051
            VNPDN GENPDIVPF                S S   G A Q      G+MWPPHMPL R
Sbjct: 421  VNPDNGGENPDIVPFEDNEEEEEEESDEEEESFSHGVGPAGQGRGRGRGMMWPPHMPLGR 480

Query: 1050 GARPMPGMQGFPPVMMGADG---SPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFSSDFMG 880
            GARPMPGMQGF PVMMG DG    P GPV PDGF MPDLFGVGPR F PYGPRFS DF G
Sbjct: 481  GARPMPGMQGFNPVMMG-DGLSYGPVGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDFGG 539

Query: 879  PSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXXXXXXX 700
            P + MMFRGRP+QPG +            GR PFMGGMGV G NP R  R          
Sbjct: 540  PPAAMMFRGRPSQPG-MFPSGGFGMMMNPGRGPFMGGMGVGGANPPRGGRPVNMPPMFPP 598

Query: 699  XXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQQGLKPH 526
              PL  QN NR  KRDQR    DRN+RF  GS+Q K Q+   Q+GGPDD+A YQQG K +
Sbjct: 599  PPPLP-QNANRAAKRDQR--TADRNDRFGSGSEQGKSQDMLSQSGGPDDDAQYQQGYKGN 655

Query: 525  QEDQYGAGNSFRNDESESEDEAP 457
            Q+D + A N+FRND+SESEDEAP
Sbjct: 656  QDD-HPAVNNFRNDDSESEDEAP 677


>ref|XP_002281594.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Vitis vinifera]
          Length = 673

 Score =  882 bits (2279), Expect = 0.0
 Identities = 460/679 (67%), Positives = 498/679 (73%), Gaps = 2/679 (0%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTAS--GPLIQSDPSXXXXXXXXXXXXXXPTDPSV 2314
            MED+EGVLSFDFEGGLDAA     PGTA+   PLIQSD +                +P+ 
Sbjct: 1    MEDAEGVLSFDFEGGLDAA-----PGTAATVAPLIQSDATAAAAAPSSVVS----AEPT- 50

Query: 2313 PGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCVY 2134
            PG  P  RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR++GECREQDCVY
Sbjct: 51   PGGAPG-RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQDCVY 109

Query: 2133 KHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKFF 1954
            KHT+EDIKECNMYKLGFCPNG DCRYRHAKL      +EEV QKIQ L+S+NY +SN+F+
Sbjct: 110  KHTNEDIKECNMYKLGFCPNGSDCRYRHAKLPGPPPTMEEVFQKIQQLSSFNYGSSNRFY 169

Query: 1953 QQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQNPIVNV 1774
            Q RN  ++QQ EK+Q+ QGS AVN G V K S  E+ N                 P+ N+
Sbjct: 170  QNRNP-YNQQTEKSQILQGSNAVNLGTVAKSSTTEAINVQQQQVQPPQQQVSQ-TPMQNL 227

Query: 1773 PNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSTE 1594
            PNGLPNQAN+TASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS E
Sbjct: 228  PNGLPNQANKTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSVE 287

Query: 1593 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK 1414
            NVILIFSVNRTRHFQGCAKM S+IGG V GGNWKYAHGTAHYGRNFSVKWLKLCELSFHK
Sbjct: 288  NVILIFSVNRTRHFQGCAKMTSKIGGFVGGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK 347

Query: 1413 TRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXXXXXXK 1234
            TRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELM               K
Sbjct: 348  TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISLAAESKREEEKAK 407

Query: 1233 GVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMWPPHMPLA 1054
            GVNPDN GENPDIVPF                S  Q  G A Q      G+MWPPHMPLA
Sbjct: 408  GVNPDNGGENPDIVPFEDNEEEEEEESEEEEESFGQALGPAAQGRGRGRGIMWPPHMPLA 467

Query: 1053 RGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFSSDFMGPS 874
            RGARP+P M+GFPPVMMGADG  Y  V PDGFAMPD+FGVGPRAF PYGPRFS DF GP+
Sbjct: 468  RGARPIPSMRGFPPVMMGADGFSYSAVPPDGFAMPDIFGVGPRAFPPYGPRFSGDFTGPA 527

Query: 873  SGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXXXXXXXXX 694
            SGMMF GR  QPG+V            GRAPFMGGMGV    P RA R            
Sbjct: 528  SGMMFPGR-GQPGAVFPASGYGMMMGPGRAPFMGGMGVPAAAPTRAGRPVGMPPMFPPPP 586

Query: 693  PLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQEGQAGGPDDEAHYQQGLKPHQEDQ 514
            P + QN    TKRDQR P NDRN+R+S GSDQ +GQ+    GPDDE  Y QGLK  Q+DQ
Sbjct: 587  PPNSQNNR--TKRDQRTPVNDRNDRYSGGSDQGRGQD--MAGPDDETQYLQGLKSQQDDQ 642

Query: 513  YGAGNSFRNDESESEDEAP 457
            +G GNSFRNDESESEDEAP
Sbjct: 643  FGGGNSFRNDESESEDEAP 661


>ref|XP_007041140.1| Cleavage and polyadenylation specificity factor 30 [Theobroma cacao]
            gi|508705075|gb|EOX96971.1| Cleavage and polyadenylation
            specificity factor 30 [Theobroma cacao]
          Length = 698

 Score =  882 bits (2278), Expect = 0.0
 Identities = 468/712 (65%), Positives = 495/712 (69%), Gaps = 10/712 (1%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPS--- 2317
            M+DSEG LSFDFEGGLDA  A     TAS P++ SDPS                 P+   
Sbjct: 1    MDDSEGGLSFDFEGGLDAGPAAP---TASMPVVNSDPSAAANNNSNNNSAVPGAAPTSTN 57

Query: 2316 ----VPGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECRE 2149
                  G   A RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR+FGECRE
Sbjct: 58   DPAAAVGGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECRE 117

Query: 2148 QDCVYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNT 1969
            QDCVYKHT+EDIKECNMYKLGFCPNG DCRYRHAKL     PVEEVLQKIQ L+SYNYN 
Sbjct: 118  QDCVYKHTNEDIKECNMYKLGFCPNGADCRYRHAKLPGPPPPVEEVLQKIQQLSSYNYN- 176

Query: 1968 SNKFFQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQN 1789
              KFFQQRN+GF+QQ EK+Q+ QG   VNQG  GKPS  ES N               Q 
Sbjct: 177  --KFFQQRNSGFAQQTEKSQIPQGQNNVNQGAGGKPSTTESANMHPQQQVQQPQQQVSQT 234

Query: 1788 PIVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 1609
             I NVPNG  NQAN+TA PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA
Sbjct: 235  QIQNVPNGQSNQANKTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEA 294

Query: 1608 FDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCE 1429
            FDS ENVILIFSVNRTRHFQGCAKM S+IGGSV+GGNWKYAHGTAHYGRNFSVKWLKLCE
Sbjct: 295  FDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCE 354

Query: 1428 LSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXX 1249
            LSFHKTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELM           
Sbjct: 355  LSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAELKRE 414

Query: 1248 XXXXKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMWPP 1069
                KGVN DN GENPDIVPF                S S    AA Q      GVMWPP
Sbjct: 415  EEKAKGVNSDNGGENPDIVPFEDNEEEEEEESEEEDESFS----AAAQGRGRGRGVMWPP 470

Query: 1068 HMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFSSD 889
            HMPLARGARPMPGM+GFPP+MMG DG  YGPVTPDGF +PDLFG  PR F PYGPRFS D
Sbjct: 471  HMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGVPDLFG-APRPFPPYGPRFSGD 529

Query: 888  FMGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXXXX 709
            F GP+SGMMF GRP QPG++            GRAPFMGGMG  G NP R  R       
Sbjct: 530  FTGPASGMMFPGRPPQPGAMFPAGGLGMMMGPGRAPFMGGMGPTGANPVRGGRPVSMPPM 589

Query: 708  XXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQQ-G 538
                   S QN+ R  KRDQR P ND   R+  GS+Q +GQE  G  G  DDE  YQQ G
Sbjct: 590  FPPPPAPSSQNSGRAVKRDQRTPTND---RYGAGSEQGRGQEMAGPGGRLDDETQYQQEG 646

Query: 537  LKPHQEDQYGAGNSFRNDESESEDEAPXXXXXXXXXXXXXXXXXXGATGSDH 382
             K H EDQ+ AGNSFRNDESESEDEAP                   A GSDH
Sbjct: 647  QKAHHEDQFAAGNSFRNDESESEDEAPRRSRYGEGKKKRRSLEGDDANGSDH 698


>ref|XP_014518648.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Vigna radiata var. radiata]
          Length = 696

 Score =  875 bits (2262), Expect = 0.0
 Identities = 456/680 (67%), Positives = 496/680 (72%), Gaps = 3/680 (0%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPSVPG 2308
            MEDSEGVLSFDFEGGLD   +       SGPL+Q D S                  + P 
Sbjct: 1    MEDSEGVLSFDFEGGLDTVPSAA--AAPSGPLVQHDSSAAASAVSNGGPPAPVPSTADPA 58

Query: 2307 -VNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCVYK 2131
             VN   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFR++GECREQDCVYK
Sbjct: 59   AVNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 118

Query: 2130 HTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKFFQ 1951
            HT+EDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ+L SYNYN+SNKFFQ
Sbjct: 119  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQ 178

Query: 1950 QRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQNPIVNVP 1771
            QR + ++QQAEK+QL QG+ + NQ V GKP   ES NA              Q+ + NV 
Sbjct: 179  QRGSSYAQQAEKSQLPQGTNSTNQVVTGKPLPAESGNAQPQQQVQQSQQQVSQSQMQNVA 238

Query: 1770 NGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSTEN 1591
            NG PNQA+R+A+PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS EN
Sbjct: 239  NGQPNQASRSATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSXEN 298

Query: 1590 VILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 1411
            VILIFSVNRTRHFQGCAKM SRIGGSV+GGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT
Sbjct: 299  VILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHKT 358

Query: 1410 RHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXXXXXXKG 1231
            RHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPD ELM               KG
Sbjct: 359  RHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKREEEKAKG 418

Query: 1230 VNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMWPPHMPLAR 1051
            VNPDN GENPDIVPF                S     G A Q      G+MWPPHMPL R
Sbjct: 419  VNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGRGRGRGMMWPPHMPLGR 478

Query: 1050 GARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFSSDFMGPSS 871
            GARPMPGMQGF PVMMG DG  YGPV PDGF MPDLFGVGPRAF PYGPRFS DF GP +
Sbjct: 479  GARPMPGMQGFNPVMMG-DGLSYGPVAPDGFGMPDLFGVGPRAFAPYGPRFSGDFGGPPA 537

Query: 870  GMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXXXXXXXXXP 691
             MMFRGRP+QPG +            GR PFMGGMGV G NP R  R            P
Sbjct: 538  AMMFRGRPSQPG-MFPGGGFGMMMNPGRGPFMGGMGVAGANPARGGRPVNMPPMFPPPPP 596

Query: 690  LSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQQGLKPHQED 517
            L  QNTNR+ KRDQR  A DRN+R+  GS+Q K Q+   Q+G PDD+  YQQG K +Q D
Sbjct: 597  LP-QNTNRLAKRDQR--ATDRNDRYGSGSEQGKSQDMLSQSGAPDDDTQYQQGYKANQ-D 652

Query: 516  QYGAGNSFRNDESESEDEAP 457
            ++ A N+FRND+SESEDEAP
Sbjct: 653  EHPAVNNFRNDDSESEDEAP 672


>ref|XP_007147504.1| hypothetical protein PHAVU_006G130200g [Phaseolus vulgaris]
            gi|561020727|gb|ESW19498.1| hypothetical protein
            PHAVU_006G130200g [Phaseolus vulgaris]
          Length = 697

 Score =  874 bits (2259), Expect = 0.0
 Identities = 457/681 (67%), Positives = 494/681 (72%), Gaps = 4/681 (0%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPSVPG 2308
            MEDSEGVLSFDFEGGLD A +       SGPL+Q D S                  + P 
Sbjct: 1    MEDSEGVLSFDFEGGLDTAPSAA--AAPSGPLVQHDSSAAASAVSNGGPPAPTPSGTEPA 58

Query: 2307 -VNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCVYK 2131
             VN   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFR++GECREQDCVYK
Sbjct: 59   AVNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCVYK 118

Query: 2130 HTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKFFQ 1951
            HT+EDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ+L SYNYN+SNKFFQ
Sbjct: 119  HTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKFFQ 178

Query: 1950 QRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQ-NPIVNV 1774
            QR + ++QQAEK+QL QG+ + NQGV GKP   ES NA                N I NV
Sbjct: 179  QRGSSYTQQAEKSQLPQGTNSTNQGVTGKPLPAESGNAQPQQQVQQSQQQQVSQNQIQNV 238

Query: 1773 PNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSTE 1594
             NG PNQA+R A+PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS E
Sbjct: 239  ANGQPNQASRAATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSVE 298

Query: 1593 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK 1414
            NVILIFSVNRTRHFQGCAKM SRIGGSV+GGNWKYAHGTAHYGRNFSVKWLKLCELSFHK
Sbjct: 299  NVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK 358

Query: 1413 TRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXXXXXXK 1234
            TRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPD ELM               K
Sbjct: 359  TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSVAAESKREEEKAK 418

Query: 1233 GVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMWPPHMPLA 1054
            GVNPDN GENPDIVPF                S     G A Q      G+MWPPHMPL 
Sbjct: 419  GVNPDNGGENPDIVPFEDNEEEEEEESDEEDESFGHGVGPAGQGRGRGRGMMWPPHMPLP 478

Query: 1053 RGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFSSDFMGPS 874
            RGARPMPGMQGF PVMMG DG  YGPV PDGF MPDLF VGPRAF PYGPRFS DF GP 
Sbjct: 479  RGARPMPGMQGFNPVMMG-DGLSYGPVAPDGFGMPDLFSVGPRAFAPYGPRFSGDFGGPP 537

Query: 873  SGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXXXXXXXXX 694
            + MMFRGRP+QPG +            GR PFMGGMGV G NP R  R            
Sbjct: 538  AAMMFRGRPSQPG-MFPGGGFGMMMNPGRGPFMGGMGVAGANPPRGGRPVNMPPMFPPPP 596

Query: 693  PLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQQGLKPHQE 520
            PL  QNTNR+ KRDQR    DRN+R+  GS+Q K Q+   Q+G PDD+  YQQG K +Q+
Sbjct: 597  PLP-QNTNRLAKRDQR--TTDRNDRYGSGSEQGKSQDMLSQSGAPDDDMQYQQGYKANQD 653

Query: 519  DQYGAGNSFRNDESESEDEAP 457
            D + A N+FRND+SESEDEAP
Sbjct: 654  D-HPAVNNFRNDDSESEDEAP 673


>ref|XP_008445183.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30
            [Cucumis melo]
          Length = 710

 Score =  872 bits (2254), Expect = 0.0
 Identities = 450/688 (65%), Positives = 486/688 (70%), Gaps = 11/688 (1%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASG------PLIQSDPSXXXXXXXXXXXXXXPT 2326
            MEDSEGVLSFDFEGGLDAA   TNP  A+       PLI SD S              PT
Sbjct: 1    MEDSEGVLSFDFEGGLDAAP--TNPAAAAAASSSSLPLIPSDSSAPPPLSNSLPGSLGPT 58

Query: 2325 ---DP-SVPGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGE 2158
               +P   P  N  +RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFR++GE
Sbjct: 59   LAPEPLGAPTANVGTRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGE 118

Query: 2157 CREQDCVYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYN 1978
            CREQDCVYKHT+EDIKECNMYK GFCPNGPDCRYRHAKL      VEE+LQKIQ+L SYN
Sbjct: 119  CREQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPSVEEILQKIQHLGSYN 178

Query: 1977 YNTSNKFFQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXX 1798
            Y +SNKFF QR  G  QQ EK+Q  QG   V QGV+GKPS  ES N              
Sbjct: 179  YGSSNKFFSQRGVGLPQQNEKSQFPQGPAPVTQGVIGKPSTAESANVQQQQVQQPAQQTS 238

Query: 1797 XQNPIVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1618
                I +V NG PNQ NRTA+ LPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 239  QTQ-IQSVSNGQPNQLNRTATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 297

Query: 1617 NEAFDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLK 1438
            NEAFDS +NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYG+NFS+KWLK
Sbjct: 298  NEAFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLK 357

Query: 1437 LCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXX 1258
            LCELSF KTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPD ELM        
Sbjct: 358  LCELSFQKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAES 417

Query: 1257 XXXXXXXKGVNPDNSGENPDIVPF-XXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGV 1081
                   KGVNPD   ENPDIVPF                 S  Q  G   Q      G+
Sbjct: 418  KREEEKAKGVNPDIGNENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPAQGRGRGRGI 477

Query: 1080 MWPPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPR 901
            MWPPHMP+ RGARP  GMQ FPP MMG DG  YGPVTPDGF MPD+FG+ PR F PYGPR
Sbjct: 478  MWPPHMPMGRGARPFHGMQSFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFGPYGPR 537

Query: 900  FSSDFMGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXX 721
            FS DFMGP S MMFRGRP+QPG++            GR PFMGGMGV GT+P R  R   
Sbjct: 538  FSGDFMGPPSAMMFRGRPSQPGAMFTPGGFGMMMGQGRGPFMGGMGVTGTSPARPGRPVG 597

Query: 720  XXXXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQEGQAGGPDDEAHYQQ 541
                       S QN NR  KRDQRGP +DRN+R+ VG DQ KGQE  + G D+   Y+Q
Sbjct: 598  VSPLYPPPAVPSAQNINRAIKRDQRGPTSDRNDRYIVGPDQNKGQEMLSSGHDEGMQYKQ 657

Query: 540  GLKPHQEDQYGAGNSFRNDESESEDEAP 457
            G K + ++QYG G +FRN+ESESEDEAP
Sbjct: 658  GSKAYPDEQYGMGTTFRNEESESEDEAP 685


>ref|XP_012436534.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Gossypium raimondii] gi|763780831|gb|KJB47902.1|
            hypothetical protein B456_008G046800 [Gossypium
            raimondii]
          Length = 700

 Score =  872 bits (2253), Expect = 0.0
 Identities = 463/714 (64%), Positives = 495/714 (69%), Gaps = 12/714 (1%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPSVPG 2308
            M+D+EG LSFDFEGGLDA        TAS P++ SDPS                  + P 
Sbjct: 1    MDDAEGGLSFDFEGGLDAGPPAP---TASMPVVNSDPSAANNTNNFTAPGGVQASINDPV 57

Query: 2307 VNP---ASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCV 2137
             N    A RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR+FGECREQDCV
Sbjct: 58   ANQGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCV 117

Query: 2136 YKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKF 1957
            YKHT+EDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ L++YNYN  NKF
Sbjct: 118  YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLSAYNYN--NKF 175

Query: 1956 FQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQNP--- 1786
            +QQRNAGF QQ EK+Q+ Q    VNQG  GKPSA ESTN                     
Sbjct: 176  YQQRNAGFPQQTEKSQIPQAQNNVNQGAAGKPSATESTNVQQQQLQQQQQQIQQPQQQVS 235

Query: 1785 ---IVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1615
               I NVPNG  NQANRTA PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN
Sbjct: 236  QTQIQNVPNGQSNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 295

Query: 1614 EAFDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKL 1435
            EAFDS ENVIL+FSVNRTRHFQGCAKM S+IGGSV+GGNWKYAHGTAHYGRNFSVKWLKL
Sbjct: 296  EAFDSAENVILVFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKL 355

Query: 1434 CELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXX 1255
            CELSFHKTRHLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELM         
Sbjct: 356  CELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAESK 415

Query: 1254 XXXXXXKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMW 1075
                  KGVN DN+ ENPDIVPF                S     GAA Q      G+MW
Sbjct: 416  REEEKAKGVNSDNA-ENPDIVPFEDNEEEEEEESEEEDESF----GAAAQGRGRGRGIMW 470

Query: 1074 PPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFS 895
            PPHMPLARGARPMPGM+GFPP+MMG DG  YGPVTPDGF MPDLFG  PR F PYGPRFS
Sbjct: 471  PPHMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGMPDLFG-APRPFAPYGPRFS 529

Query: 894  SDFMGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXX 715
             DF GP+SGMMF GRP QPG +            GRAPFMGGMG  G NP R  R     
Sbjct: 530  GDFTGPASGMMFPGRPPQPGGMFPSGGIGMMMGPGRAPFMGGMGPTGANPARGGRPVGMP 589

Query: 714  XXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQQ 541
                     + QN+ R  KRDQR P NDR+   S GS+Q +GQE  G  GG +D   YQQ
Sbjct: 590  PMFPLPPAPASQNSGRAIKRDQRTPTNDRS---SAGSEQGRGQEMGGPGGGLEDGTQYQQ 646

Query: 540  -GLKPHQEDQYGAGNSFRNDESESEDEAPXXXXXXXXXXXXXXXXXXGATGSDH 382
             G K H EDQ+ AGNSFRND+SESEDEAP                   AT SDH
Sbjct: 647  EGQKAHHEDQFAAGNSFRNDDSESEDEAPRRSRHGEGKKKRRGLEGDVATASDH 700


>ref|XP_004295608.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Fragaria vesca subsp. vesca]
          Length = 689

 Score =  871 bits (2251), Expect = 0.0
 Identities = 460/681 (67%), Positives = 493/681 (72%), Gaps = 4/681 (0%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAA--ATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPSV 2314
            MED +GVL+FDFEGGLD+AA  A T+ G AS   IQSD                      
Sbjct: 1    MEDPDGVLNFDFEGGLDSAAVSAPTHTGLASSAPIQSDSFASQPKNQAAPAPQPD----- 55

Query: 2313 PGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCVY 2134
            P VNP+ R+SFRQTVCRHWLRSLCMKG+ACGFLHQYDKSRMP+CRFFRM+GECREQDCVY
Sbjct: 56   PNVNPSGRKSFRQTVCRHWLRSLCMKGEACGFLHQYDKSRMPVCRFFRMYGECREQDCVY 115

Query: 2133 KHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKFF 1954
            KHT+EDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ+LNSYNYN SNKF 
Sbjct: 116  KHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQHLNSYNYNNSNKFS 175

Query: 1953 QQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQNPIVNV 1774
            Q RN GF QQ +++Q AQ + + NQ VV +PSA ES N               Q    +V
Sbjct: 176  QPRNGGFPQQHDRSQPAQVTNSFNQVVV-RPSAAESANVQQPQQFQQTQQPVAQTQAQSV 234

Query: 1773 PNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDSTE 1594
            PNGL +QANR A PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS E
Sbjct: 235  PNGLASQANRAALPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSAE 294

Query: 1593 NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK 1414
            NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK
Sbjct: 295  NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFHK 354

Query: 1413 TRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXXXXXXK 1234
            TRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELM               K
Sbjct: 355  TRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEKAK 414

Query: 1233 GVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMWPPHMPL- 1057
            GVNP+N GENPDIVPF                   QVPG A +       VMWPPHMPL 
Sbjct: 415  GVNPENGGENPDIVPFEDNEEEEEEESDDEEDY--QVPGGAIE-NRGRGRVMWPPHMPLG 471

Query: 1056 ARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGV-GPRAFNPYGPRFSSDFMG 880
             RG RPMPGMQGFP  MMG D  PYGPVTPDGF MP+ FG+ GPR FNPYGPRFS DF G
Sbjct: 472  GRGGRPMPGMQGFPG-MMGPDAMPYGPVTPDGFVMPNPFGMGGPRGFNPYGPRFSGDFGG 530

Query: 879  PSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXXXXXXX 700
            P+ GMMFRGRP QPG +            GR PFMGGMGV G NP R  R          
Sbjct: 531  PNPGMMFRGRPPQPGGMFPPGPYGMMMGPGRGPFMGGMGVGGNNPARGGRPGGMPPMFPP 590

Query: 699  XXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQEGQAGGPDDEAHYQQGLKPHQE 520
                  QN NR+ KRD RG  NDRNER+S GS    G+E QAGGPDDE HYQ   K +QE
Sbjct: 591  HP--PSQNNNRLQKRDPRGSGNDRNERYSAGSGH--GKEMQAGGPDDENHYQHSSKSYQE 646

Query: 519  DQYGAGNSFRNDESESEDEAP 457
            D YGAGN+ RND+SESEDEAP
Sbjct: 647  D-YGAGNNGRNDDSESEDEAP 666


>gb|KJB47903.1| hypothetical protein B456_008G046800 [Gossypium raimondii]
          Length = 701

 Score =  867 bits (2241), Expect = 0.0
 Identities = 464/715 (64%), Positives = 496/715 (69%), Gaps = 13/715 (1%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPSVPG 2308
            M+D+EG LSFDFEGGLDA        TAS P++ SDPS                  + P 
Sbjct: 1    MDDAEGGLSFDFEGGLDAGPPAP---TASMPVVNSDPSAANNTNNFTAPGGVQASINDPV 57

Query: 2307 VNP---ASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCV 2137
             N    A RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR+FGECREQDCV
Sbjct: 58   ANQGGGAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGECREQDCV 117

Query: 2136 YKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKF 1957
            YKHT+EDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ L++YNYN  NKF
Sbjct: 118  YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLSAYNYN--NKF 175

Query: 1956 FQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNA------XXXXXXXXXXXXXX 1795
            +QQRNAGF QQ EK+Q+ Q    VNQG  GKPSA ESTN                     
Sbjct: 176  YQQRNAGFPQQTEKSQIPQAQNNVNQGAAGKPSATESTNVQQQQLQQQQQQIQQPQQQVS 235

Query: 1794 QNPIVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 1615
            Q  I NVPNG  NQANRTA PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN
Sbjct: 236  QTQIQNVPNGQSNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLN 295

Query: 1614 EAFDSTENVILIFSVNRTRHFQ-GCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLK 1438
            EAFDS ENVIL+FSVNRTRHFQ GCAKM S+IGGSV+GGNWKYAHGTAHYGRNFSVKWLK
Sbjct: 296  EAFDSAENVILVFSVNRTRHFQVGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLK 355

Query: 1437 LCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXX 1258
            LCELSFHKTRHLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPDSELM        
Sbjct: 356  LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDSELMAISLAAES 415

Query: 1257 XXXXXXXKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVM 1078
                   KGVN DN+ ENPDIVPF                S     GAA Q      G+M
Sbjct: 416  KREEEKAKGVNSDNA-ENPDIVPFEDNEEEEEEESEEEDESF----GAAAQGRGRGRGIM 470

Query: 1077 WPPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRF 898
            WPPHMPLARGARPMPGM+GFPP+MMG DG  YGPVTPDGF MPDLFG  PR F PYGPRF
Sbjct: 471  WPPHMPLARGARPMPGMRGFPPMMMGGDGFSYGPVTPDGFGMPDLFG-APRPFAPYGPRF 529

Query: 897  SSDFMGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXX 718
            S DF GP+SGMMF GRP QPG +            GRAPFMGGMG  G NP R  R    
Sbjct: 530  SGDFTGPASGMMFPGRPPQPGGMFPSGGIGMMMGPGRAPFMGGMGPTGANPARGGRPVGM 589

Query: 717  XXXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQ 544
                      + QN+ R  KRDQR P NDR+   S GS+Q +GQE  G  GG +D   YQ
Sbjct: 590  PPMFPLPPAPASQNSGRAIKRDQRTPTNDRS---SAGSEQGRGQEMGGPGGGLEDGTQYQ 646

Query: 543  Q-GLKPHQEDQYGAGNSFRNDESESEDEAPXXXXXXXXXXXXXXXXXXGATGSDH 382
            Q G K H EDQ+ AGNSFRND+SESEDEAP                   AT SDH
Sbjct: 647  QEGQKAHHEDQFAAGNSFRNDDSESEDEAPRRSRHGEGKKKRRGLEGDVATASDH 701


>ref|XP_003534764.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Glycine max] gi|947088097|gb|KRH36762.1|
            hypothetical protein GLYMA_09G022200 [Glycine max]
          Length = 681

 Score =  858 bits (2216), Expect = 0.0
 Identities = 454/682 (66%), Positives = 487/682 (71%), Gaps = 5/682 (0%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXP---TDPS 2317
            MEDSEGVLSFDFEGGLDAA ++      SGPLI  D S                   DP 
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPSSA-AAAPSGPLIPHDSSAAASAVSNGGPAAPAPSAVDP- 58

Query: 2316 VPGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCV 2137
            V G N   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDK+RMP+CRFFR++GECREQDCV
Sbjct: 59   VGGGNVPGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKARMPVCRFFRLYGECREQDCV 118

Query: 2136 YKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKF 1957
            YKHT+EDIKECNMYKLGFCPNGPDCRYRHAK      PVEEVLQKIQ+L SYNYN+SNKF
Sbjct: 119  YKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPVEEVLQKIQHLYSYNYNSSNKF 178

Query: 1956 FQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQNPIVN 1777
            FQQR A ++QQAEK  L QG+ + NQGV G P   E  NA              Q+ + N
Sbjct: 179  FQQRGASYNQQAEKPLLPQGNNSTNQGVTGNPLPAELGNAQPQQQVQQSQQQVNQSQMQN 238

Query: 1776 VPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDST 1597
            V NG PNQANRTA+PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS 
Sbjct: 239  VANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDSV 298

Query: 1596 ENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 1417
            ENVILIFSVNRTRHFQGCAKM S+IGGSV+GGNWKYAHGTAHYGRNFSVKWLKLCELSFH
Sbjct: 299  ENVILIFSVNRTRHFQGCAKMTSKIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSFH 358

Query: 1416 KTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXXXXXX 1237
            KTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELM               
Sbjct: 359  KTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREEEKA 418

Query: 1236 KGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMWPPHMPL 1057
            KGVNPDN GENPDIVPF                S     G A Q      G+MWPPHMPL
Sbjct: 419  KGVNPDNGGENPDIVPFEDNEEEEEEESDEEEESFGHGVGPAGQGRGRGRGMMWPPHMPL 478

Query: 1056 ARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFSSDFMGP 877
             RGARPMPGMQGF PVMMG DG  YGPV PDGF MPDLFGVGPR F PYGPRFS DF GP
Sbjct: 479  GRGARPMPGMQGFNPVMMG-DGLSYGPVGPDGFGMPDLFGVGPRGFAPYGPRFSGDFGGP 537

Query: 876  SSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXXXXXXXX 697
             + MMFRGRP+QPG +            GR PFMGG+GV G NP R  R           
Sbjct: 538  PAAMMFRGRPSQPG-MFPGGGFGMMLNPGRGPFMGGIGVGGANPPRGGRPVNMPPMFPPP 596

Query: 696  XPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQQGLKPHQ 523
             PL  QN NR  KRDQR    DRN+RF  GS+Q K Q+   Q+GGPDD+  YQQG K +Q
Sbjct: 597  PPLP-QNANRAAKRDQR--TADRNDRFGSGSEQGKSQDMLSQSGGPDDDPQYQQGYKGNQ 653

Query: 522  EDQYGAGNSFRNDESESEDEAP 457
            +D          D+SESEDEAP
Sbjct: 654  DD--------HPDDSESEDEAP 667


>ref|XP_002523201.1| conserved hypothetical protein [Ricinus communis]
            gi|223537608|gb|EEF39232.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 702

 Score =  852 bits (2200), Expect = 0.0
 Identities = 455/716 (63%), Positives = 489/716 (68%), Gaps = 14/716 (1%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPT--DPSV 2314
            M+D++G LSFDFEGGLD++  T NP TAS P I SD +               +  DP+ 
Sbjct: 1    MDDTDGGLSFDFEGGLDSSGPT-NP-TASIPAIPSDNTAAVAAATNNSIVPNVSSNDPAS 58

Query: 2313 PGV----NPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQ 2146
                   N A RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR++GECREQ
Sbjct: 59   AAAAAANNQAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLYGECREQ 118

Query: 2145 DCVYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTS 1966
            DCVYKHT+EDIKECNMYKLGFCPNGPDCRYRHAKL     PVEEVLQKIQ LNSYNY +S
Sbjct: 119  DCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKLPGPPPPVEEVLQKIQQLNSYNYGSS 178

Query: 1965 NKFFQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQN- 1789
            NKFFQQR AGF Q A+K+Q +QG   + QG+  KP   ES N               Q+ 
Sbjct: 179  NKFFQQRGAGFQQHADKSQFSQGPNNMGQGMAAKPPGTESANVQQPQQQQPQPGQGQQSQ 238

Query: 1788 ------PIVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE 1627
                  P  N+PNG PNQANRTA PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE
Sbjct: 239  QQATQTPTQNLPNGQPNQANRTAIPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE 298

Query: 1626 AKLNEAFDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVK 1447
            AKLNEAFDS ENVILIFSVNRTRHFQGCAKM S+IG SV GGNWKYAHGTAHYGRNFSVK
Sbjct: 299  AKLNEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGASVGGGNWKYAHGTAHYGRNFSVK 358

Query: 1446 WLKLCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXX 1267
            WLKLCELSFHKTRHLRNP+NENLPVKISRDCQELEPS+G QLA LLY EPDSELM     
Sbjct: 359  WLKLCELSFHKTRHLRNPYNENLPVKISRDCQELEPSVGGQLACLLYDEPDSELMAISLA 418

Query: 1266 XXXXXXXXXXKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXX 1087
                      KGVNP+N G+NPDIVPF                S  Q  GA  Q      
Sbjct: 419  AEAKREEEKAKGVNPENGGDNPDIVPFEDNEEEEEEESEEEEESFGQALGAPGQGRGRGR 478

Query: 1086 GVMWPPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYG 907
            G++W PHMPLARGARP+PGM+GFPP+MMGAD   YGPVTPDGF MPDLFGV PR F PY 
Sbjct: 479  GIIW-PHMPLARGARPIPGMRGFPPMMMGADSFSYGPVTPDGFGMPDLFGVAPRGFTPYA 537

Query: 906  PRFSSDFMGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRX 727
            PRFS DF G +SGMMF GRP QPG V            GRAPFMGGMG   TNP R    
Sbjct: 538  PRFSGDFTGAASGMMFPGRPPQPGGVFPNGGFGMMMGPGRAPFMGGMGPNSTNPLRG--- 594

Query: 726  XXXXXXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQEGQAGGPDDEAHY 547
                       PL   +  R  KRDQR  AND   R+S GSDQ       AG PDDEA Y
Sbjct: 595  --NWPGGMPFPPLPTPSPQRPVKRDQRMTAND---RYSTGSDQ---GRNTAGEPDDEARY 646

Query: 546  QQ-GLKPHQEDQYGAGNSFRNDESESEDEAPXXXXXXXXXXXXXXXXXXGATGSDH 382
            QQ GLK   EDQ+GAGNSFRNDESESEDEAP                     GSDH
Sbjct: 647  QQEGLKASHEDQFGAGNSFRNDESESEDEAPRRSRHGEGKKKRRGSEGDATPGSDH 702


>ref|XP_008459517.1| PREDICTED: cleavage and polyadenylation specificity factor
            CPSF30-like [Cucumis melo]
          Length = 708

 Score =  850 bits (2197), Expect = 0.0
 Identities = 448/689 (65%), Positives = 482/689 (69%), Gaps = 12/689 (1%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPG-TASGPLIQSDPSXXXXXXXXXXXXXXPTDPSV- 2314
            MEDSEGVLSFDFEGGLDA    TNP  T+S PLI SD S                 P+V 
Sbjct: 1    MEDSEGVLSFDFEGGLDAGP--TNPAATSSLPLINSDSSAPPAASAVSNSLSGALGPAVS 58

Query: 2313 ---PGVNPAS---RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECR 2152
               PG  P +   RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFR++GECR
Sbjct: 59   AEPPGAPPGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECR 118

Query: 2151 EQDCVYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYN 1972
            EQDCVYKHT+EDIKECNMYK GFCPNGPDCRYRHAKL     PVEE+LQKIQ+L SYNY 
Sbjct: 119  EQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPVEEILQKIQHLGSYNYG 178

Query: 1971 TSNKFFQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQ 1792
             SNKFF QR  G SQQ EK+Q  Q      QGV GKPSA ES N                
Sbjct: 179  PSNKFFTQRGVGLSQQNEKSQFPQVPAITTQGVTGKPSAAESANVQQQQGQQSAPQASQ- 237

Query: 1791 NPIVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1612
             P+ N+ NG PNQ NR A+ LPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE
Sbjct: 238  TPVQNLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 297

Query: 1611 AFDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLC 1432
            AFD+ +NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYG+NFS+KWLKLC
Sbjct: 298  AFDTADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGQNFSLKWLKLC 357

Query: 1431 ELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXX 1252
            ELSF KTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPD ELM          
Sbjct: 358  ELSFQKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDGELMAVSIAAESKR 417

Query: 1251 XXXXXKGVNPDNSGENPDIVPF-XXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMW 1075
                 KGVNPD   ENPDIVPF                 S  Q  G   Q      G+MW
Sbjct: 418  EEEKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSVGLPPQGRGRGRGMMW 477

Query: 1074 PPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYG--PR 901
            PP MP+ RGARP  GMQGFPP MMG DG  YGPVTPDGF MPD+FG+ PR F PYG  PR
Sbjct: 478  PPQMPIGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMAPRGFGPYGPTPR 537

Query: 900  FSSDFMGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGR-APFMGGMGVQGTNPNRAVRXX 724
            FSSDFMGP + MMFRGRP+QPG++            GR  PFMGGMGV G NP R  R  
Sbjct: 538  FSSDFMGPPTAMMFRGRPSQPGAMFPPGGFGMMMGQGRGGPFMGGMGVTGANPARPGRPV 597

Query: 723  XXXXXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQEGQAGGPDDEAHYQ 544
                        S QN NR  KRDQRG  ND   ++ VG DQ KG E Q+ G DDE  Y+
Sbjct: 598  GVSPLYPPPAVPSSQNMNRAIKRDQRGLTND---KYIVGIDQNKGLEIQSSGRDDEMQYK 654

Query: 543  QGLKPHQEDQYGAGNSFRNDESESEDEAP 457
            QG K + ++QYG G +FRN+ESESEDEAP
Sbjct: 655  QGSKAYSDEQYGTGTTFRNEESESEDEAP 683


>ref|XP_004141524.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor
            30-like [Cucumis sativus] gi|700197436|gb|KGN52613.1|
            hypothetical protein Csa_5G647360 [Cucumis sativus]
          Length = 707

 Score =  850 bits (2196), Expect = 0.0
 Identities = 443/688 (64%), Positives = 480/688 (69%), Gaps = 11/688 (1%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPG-TASGPLIQSDPSXXXXXXXXXXXXXXPTDPSV- 2314
            MEDSEGVLSFDFEGGLDA    TNP  T+S P+I SD S                 P+V 
Sbjct: 1    MEDSEGVLSFDFEGGLDAGP--TNPAATSSLPIINSDSSAPPAASAVSNPLSGALGPAVS 58

Query: 2313 ------PGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECR 2152
                  P  N  +RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFR++GECR
Sbjct: 59   AEPTGAPHGNVGNRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRLYGECR 118

Query: 2151 EQDCVYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYN 1972
            EQDCVYKHT+EDIKECNMYK GFCPNGPDCRYRHAKL     P+EE+LQKIQ+L SYNY 
Sbjct: 119  EQDCVYKHTNEDIKECNMYKFGFCPNGPDCRYRHAKLPGPPPPLEEILQKIQHLGSYNYG 178

Query: 1971 TSNKFFQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQ 1792
             SNKFF QR  G SQQ EK+Q  Q    V QGV GKPSA ES N                
Sbjct: 179  PSNKFFTQRGVGLSQQNEKSQFPQVPALVTQGVTGKPSAAESVNVQQQQGQQSAPQASQ- 237

Query: 1791 NPIVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 1612
             P+ ++ NG PNQ NR A+ LPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE
Sbjct: 238  TPVQSLSNGQPNQLNRNATSLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNE 297

Query: 1611 AFDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLC 1432
            AFDS +NVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGT HYG+NFS+KWLKLC
Sbjct: 298  AFDSADNVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTPHYGQNFSLKWLKLC 357

Query: 1431 ELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXX 1252
            ELSF KTRHLRNP+NENLPVKISRDCQELEPS+GEQLASLLYLEPD ELM          
Sbjct: 358  ELSFQKTRHLRNPYNENLPVKISRDCQELEPSVGEQLASLLYLEPDGELMAVSVAAESKR 417

Query: 1251 XXXXXKGVNPDNSGENPDIVPF-XXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMW 1075
                 KGVNPD   ENPDIVPF                 S  Q  G   Q      G+MW
Sbjct: 418  EEEKAKGVNPDIGSENPDIVPFEDNEEEEEEESEEEEEESFGQSAGLPPQGRGRGRGMMW 477

Query: 1074 PPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYG--PR 901
            PPHMP+ RGARP  GMQGFPP MMG DG  YGPVTPDGF MPD+FG+ PR F PYG  PR
Sbjct: 478  PPHMPMGRGARPFHGMQGFPPGMMGPDGLSYGPVTPDGFPMPDIFGMTPRGFGPYGPTPR 537

Query: 900  FSSDFMGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXX 721
            FS DFMGP + MMFRGRP+QP ++            GR PFMGGMGV G NP R  R   
Sbjct: 538  FSGDFMGPPTAMMFRGRPSQPAAMFPPSGFGMMMGQGRGPFMGGMGVAGANPARPGRPVG 597

Query: 720  XXXXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQEGQAGGPDDEAHYQQ 541
                       S QN NR  KRDQRG  ND   R+ VG DQ KG E Q+ G D+E  Y+Q
Sbjct: 598  VSPLYPPPAVPSSQNMNRAIKRDQRGLTND---RYIVGMDQNKGVEIQSSGRDEEMQYKQ 654

Query: 540  GLKPHQEDQYGAGNSFRNDESESEDEAP 457
            G K + ++QYG G +FRN+ESESEDEAP
Sbjct: 655  GSKAYSDEQYGTGTTFRNEESESEDEAP 682


>ref|XP_012569987.1| PREDICTED: 30-kDa cleavage and polyadenylation specificity factor 30
            [Cicer arietinum]
          Length = 677

 Score =  849 bits (2193), Expect = 0.0
 Identities = 446/683 (65%), Positives = 486/683 (71%), Gaps = 6/683 (0%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAA---AATTN-PGTASGPLIQSDPSXXXXXXXXXXXXXXPTDP 2320
            MEDSEGVLSFDFEGGLDAA   AAT + P   SGP++  D S                  
Sbjct: 1    MEDSEGVLSFDFEGGLDAAPPSAATVSVPAPPSGPIVHPDSSLPPSISSNGAAA---VSG 57

Query: 2319 SVPGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDC 2140
            ++PG     RRSFRQTVCRHWLRSLCMKG+ACGFLHQYDK+RMP+CRFFR++GECREQDC
Sbjct: 58   NIPG-----RRSFRQTVCRHWLRSLCMKGEACGFLHQYDKARMPVCRFFRLYGECREQDC 112

Query: 2139 VYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNK 1960
            VYKHT+EDIKECNMYKLGFCPNGPDCRYRHAK      P+EEVLQKIQ+L SYN+N S+K
Sbjct: 113  VYKHTNEDIKECNMYKLGFCPNGPDCRYRHAKSPGPPPPIEEVLQKIQHLYSYNFNNSHK 172

Query: 1959 FFQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXXQNPIV 1780
            F QQR + ++QQ EK+Q  QG  + NQGV GKP A ES N               Q    
Sbjct: 173  FIQQRGSSYTQQVEKSQFPQGINSANQGVAGKPLAAESGNVQQQQQVQQSQQQVSQIQTQ 232

Query: 1779 NVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAFDS 1600
            N+ NG PNQANRTA+PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNE+KLNEAFDS
Sbjct: 233  NLANGQPNQANRTATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNESKLNEAFDS 292

Query: 1599 TENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCELSF 1420
             ENVILIFSVNRTRHFQGCAKM SRIGGSV+GGNWKYAHGTAHYGRNFSVKWLKLCELSF
Sbjct: 293  VENVILIFSVNRTRHFQGCAKMTSRIGGSVAGGNWKYAHGTAHYGRNFSVKWLKLCELSF 352

Query: 1419 HKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXXXXX 1240
            HKTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELM              
Sbjct: 353  HKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISIAAESKREEEK 412

Query: 1239 XKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMWPPHMP 1060
             KGVNPDN+GENPDIVPF                S  Q      Q      G+MWPPHMP
Sbjct: 413  AKGVNPDNAGENPDIVPFEDNEEEEEEESDEEEESFVQAVVPVGQGRGRGRGMMWPPHMP 472

Query: 1059 LARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFSSDFMG 880
            L RGARPMPGMQGF PVMMG DG  YGP  PDGF MPDLFG+GPR F PYGPRFS DF G
Sbjct: 473  LGRGARPMPGMQGFNPVMMG-DGLSYGPGAPDGFGMPDLFGMGPRGFGPYGPRFSGDFAG 531

Query: 879  PSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXXXXXXXXX 700
            P + MMFRGRP+QPG +            GR PFMGGMGV G NP R  R          
Sbjct: 532  PPAAMMFRGRPSQPG-MFPGGGFGMMMNPGRGPFMGGMGVPGPNPPRGGRPLNMPPMFPP 590

Query: 699  XXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHYQQGLKPH 526
              P   QN NR+ KRDQR   NDRN+R+S G +Q K Q+   Q+GGPDDE  YQQ   P 
Sbjct: 591  PPP-PPQNVNRIAKRDQR--TNDRNDRYSSGQEQGKSQDMLSQSGGPDDEMQYQQSGAP- 646

Query: 525  QEDQYGAGNSFRNDESESEDEAP 457
                    N+FRN++SESEDEAP
Sbjct: 647  -------ANNFRNEDSESEDEAP 662


>ref|XP_006448924.1| hypothetical protein CICLE_v10014454mg [Citrus clementina]
            gi|557551535|gb|ESR62164.1| hypothetical protein
            CICLE_v10014454mg [Citrus clementina]
          Length = 701

 Score =  845 bits (2182), Expect = 0.0
 Identities = 455/691 (65%), Positives = 488/691 (70%), Gaps = 14/691 (2%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPG--TASGPLIQSDPSXXXXXXXXXXXXXXPTDP-- 2320
            MEDSEG LSFDFEGGLDA      PG  TAS P IQSD +               +    
Sbjct: 1    MEDSEGGLSFDFEGGLDAG-----PGMPTASNPAIQSDSTAAAAAAAANANHAALSSSGA 55

Query: 2319 -----SVPGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGEC 2155
                 S P  + + RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR+FGEC
Sbjct: 56   APDHASAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGEC 115

Query: 2154 REQDCVYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNY 1975
            REQDCVYKHT+EDIKECNMYKLGFCPNGPDCRYRH KL      VEEVLQKIQ ++SYN+
Sbjct: 116  REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNH 175

Query: 1974 NTSNKFFQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXX 1795
               NK FQQR A FS Q +K+Q +QG  AVNQG  GK S  ES N               
Sbjct: 176  GNPNKLFQQRGA-FSHQIDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGT 234

Query: 1794 QNP-IVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1618
            Q   + N+PNGLPNQ NR A+PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 235  QTTQMQNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 294

Query: 1617 NEAFDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLK 1438
            NEAFDS ENVILIFSVNRTRHFQGCAKM S+IGGSV GGNWKYAHGTAHYGRNFSVKWLK
Sbjct: 295  NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLK 354

Query: 1437 LCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXX 1258
            LCELSFHKTRHLRNP+NENLPVKISRDCQELEPSIGEQLA+LLYLEPDSELM        
Sbjct: 355  LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEA 414

Query: 1257 XXXXXXXKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVM 1078
                   KGVNPDN G+NPDIVPF                SL    G A+Q      G+M
Sbjct: 415  KREEEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEEESL----GTASQGRGRGRGMM 470

Query: 1077 WPPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRF 898
            WP  MPLARGARP+PGM+GFPP+M+GADG  YG VTPDGF MPDLFGV PR F PYGPRF
Sbjct: 471  WPGPMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRF 529

Query: 897  SSDFMGPSSGMMFRGRPTQPGSV-XXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXX 721
            S DF GP  GMMF GRP QPGSV             GR PFMGGMG   TNP        
Sbjct: 530  SGDFTGP-GGMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGG--RPV 586

Query: 720  XXXXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHY 547
                     P S QN++RV KRD RG  NDRN+R+S GSDQ + QE  G   GPDDE  Y
Sbjct: 587  GVPPPFPNQPQSSQNSSRVAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQY 646

Query: 546  QQ-GLKPHQEDQYGAGNSFRNDESESEDEAP 457
            QQ G K +QEDQYG+ N FRNDESESEDEAP
Sbjct: 647  QQEGSKANQEDQYGSRN-FRNDESESEDEAP 676


>gb|KDO75297.1| hypothetical protein CISIN_1g005338mg [Citrus sinensis]
          Length = 701

 Score =  843 bits (2178), Expect = 0.0
 Identities = 455/691 (65%), Positives = 488/691 (70%), Gaps = 14/691 (2%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPG--TASGPLIQSDPSXXXXXXXXXXXXXXPTDP-- 2320
            MEDSEG LSFDFEGGLDA      PG  TAS P IQSD +              P+    
Sbjct: 1    MEDSEGGLSFDFEGGLDAG-----PGMPTASNPAIQSDSTAAAAAAAANANHAAPSSSGA 55

Query: 2319 -----SVPGVNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGEC 2155
                 S P  + + RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFR+FGEC
Sbjct: 56   APDHASAPVPHHSGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRLFGEC 115

Query: 2154 REQDCVYKHTHEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNY 1975
            REQDCVYKHT+EDIKECNMYKLGFCPNGPDCRYRH KL      VEEVLQKIQ ++SYN+
Sbjct: 116  REQDCVYKHTNEDIKECNMYKLGFCPNGPDCRYRHVKLPGPPPSVEEVLQKIQQISSYNH 175

Query: 1974 NTSNKFFQQRNAGFSQQAEKTQLAQGSTAVNQGVVGKPSAMESTNAXXXXXXXXXXXXXX 1795
               NK FQQR A FS Q +K+Q +QG  AVNQG  GK S  ES N               
Sbjct: 176  GNPNKHFQQRGA-FSHQTDKSQFSQGPNAVNQGAAGKSSTAESANVHQQQLVQQPQQQGT 234

Query: 1794 QNP-IVNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 1618
            Q   + N+PNGLPNQ NR A+PLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL
Sbjct: 235  QTTQMQNLPNGLPNQTNRNATPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKL 294

Query: 1617 NEAFDSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLK 1438
            NEAFDS ENVILIFSVNRTRHFQGCAKM S+IGGSV GGNWKYAHGTAHYGRNFSVKWLK
Sbjct: 295  NEAFDSAENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLK 354

Query: 1437 LCELSFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXX 1258
            LCELSFHKTRHLRNP+NENLPVKISRDCQELEPSIGEQLA+LLYLEPDSELM        
Sbjct: 355  LCELSFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLAALLYLEPDSELMAISVAAEA 414

Query: 1257 XXXXXXXKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVM 1078
                   KGVNPDN G+NPDIVPF                SL    G A+Q      G+M
Sbjct: 415  KREEEKAKGVNPDNGGDNPDIVPFEDNEEEEEEESEEEEESL----GTASQGRGRGRGMM 470

Query: 1077 WPPHMPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRF 898
            WP  MPLARGARP+PGM+GFPP+M+GADG  YG VTPDGF MPDLFGV PR F PYGPRF
Sbjct: 471  WPGPMPLARGARPVPGMRGFPPMMIGADGFSYG-VTPDGFPMPDLFGVAPRPFAPYGPRF 529

Query: 897  SSDFMGPSSGMMFRGRPTQPGSV-XXXXXXXXXXXXGRAPFMGGMGVQGTNPNRAVRXXX 721
            S DF GP  GMMF GRP QPGSV             GR PFMGGMG   TNP        
Sbjct: 530  SGDFTGP-GGMMFPGRPPQPGSVFPPNGFGGMMMGPGRPPFMGGMGPAATNPRGG--RPV 586

Query: 720  XXXXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--GQAGGPDDEAHY 547
                     P S QN++R  KRD RG  NDRN+R+S GSDQ + QE  G   GPDDE  Y
Sbjct: 587  GVPPPFPNQPQSSQNSSRAAKRDVRGSINDRNDRYSAGSDQGRAQEMGGPGRGPDDEVQY 646

Query: 546  QQ-GLKPHQEDQYGAGNSFRNDESESEDEAP 457
            QQ G K +QEDQYG+ N FRNDESESEDEAP
Sbjct: 647  QQEGSKANQEDQYGSRN-FRNDESESEDEAP 676


>ref|XP_010241185.1| PREDICTED: cleavage and polyadenylation specificity factor CPSF30
            [Nelumbo nucifera]
          Length = 715

 Score =  838 bits (2165), Expect = 0.0
 Identities = 448/701 (63%), Positives = 490/701 (69%), Gaps = 24/701 (3%)
 Frame = -3

Query: 2487 MEDSEGVLSFDFEGGLDAAAATTNPGTASGPLIQSDPSXXXXXXXXXXXXXXPTDPSVPG 2308
            MED EGVLSFDFEGGLD     TNP T S PLI +D S                +P   G
Sbjct: 1    MEDPEGVLSFDFEGGLDNGP--TNP-TPSAPLIPADSSIAAAANSAVAPAV--VEPVAGG 55

Query: 2307 VNPASRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPICRFFRMFGECREQDCVYKH 2128
               A RRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMP+CRFFRM+GECREQDCVYKH
Sbjct: 56   --HAGRRSFRQTVCRHWLRSLCMKGDACGFLHQYDKSRMPVCRFFRMYGECREQDCVYKH 113

Query: 2127 THEDIKECNMYKLGFCPNGPDCRYRHAKLXXXXXPVEEVLQKIQNLNSYNYNTSNKFFQQ 1948
            T+EDIKECNMYK GFCPNGPDCRYRHAK      PVEEV QKIQ+L S+NY +SN+FFQQ
Sbjct: 114  TNEDIKECNMYKFGFCPNGPDCRYRHAKQPGPPPPVEEVFQKIQHLGSFNYGSSNRFFQQ 173

Query: 1947 RNAGFSQQAEKTQLAQGSTAVNQGVVGKPS-AMESTNAXXXXXXXXXXXXXXQNPI---- 1783
            R   +  Q+E++Q  QGS+ VNQG+  KPS A ES N               Q  +    
Sbjct: 174  RIGSYVPQSERSQFPQGSSNVNQGIASKPSTAAESPNVQQQQQQSQIQQPQQQQQVNQTQ 233

Query: 1782 -VNVPNGLPNQANRTASPLPQGISRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 1606
              N  NGLPNQA+RTA+PLPQG SRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF
Sbjct: 234  MQNPQNGLPNQASRTATPLPQGSSRYFIVKSCNRENLELSVQQGVWATQRSNEAKLNEAF 293

Query: 1605 DSTENVILIFSVNRTRHFQGCAKMMSRIGGSVSGGNWKYAHGTAHYGRNFSVKWLKLCEL 1426
            DS ENVILIFSVNRTRHFQGCAKM S+IGGSV GGNWKYAHGTAHYGRNFSVKWLKLCEL
Sbjct: 294  DSVENVILIFSVNRTRHFQGCAKMTSKIGGSVGGGNWKYAHGTAHYGRNFSVKWLKLCEL 353

Query: 1425 SFHKTRHLRNPFNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMXXXXXXXXXXXX 1246
            SFHKTRHLRNP+NENLPVKISRDCQELEPSIGEQLASLLYLEPDSELM            
Sbjct: 354  SFHKTRHLRNPYNENLPVKISRDCQELEPSIGEQLASLLYLEPDSELMAISVAAESKREE 413

Query: 1245 XXXKGVNPDNSGENPDIVPFXXXXXXXXXXXXXXXXSLSQVPGAANQXXXXXXGVMWPPH 1066
               KGVNPD   +N DIVPF                S  Q   AA Q      GVMWPPH
Sbjct: 414  EKAKGVNPDEGADNHDIVPFEDNEDEEEEESEEEDESFGQAINAA-QGRGRGRGVMWPPH 472

Query: 1065 MPLARGARPMPGMQGFPPVMMGADGSPYGPVTPDGFAMPDLFGVGPRAFNPYGPRFSSDF 886
            MPLARG RP+PG++GFPPVMMGADG  YG VTPDGF+MPDLFG+ PRAF PYGPRFS DF
Sbjct: 473  MPLARGGRPIPGIRGFPPVMMGADGFSYGAVTPDGFSMPDLFGIAPRAFAPYGPRFSGDF 532

Query: 885  ----------------MGPSSGMMFRGRPTQPGSVXXXXXXXXXXXXGRAPFMGGMGVQG 754
                             GP+ GM+F GRP+QPG+V            GRAPFMGGMG+ G
Sbjct: 533  TGLGQSAAMGFNPIDGTGPTPGMVFHGRPSQPGAVFPPSGLGMMMGPGRAPFMGGMGI-G 591

Query: 753  TNPNRAVRXXXXXXXXXXXXPLSLQNTNRVTKRDQRGPANDRNERFSVGSDQLKGQE--G 580
              P RA R            PL  Q+++RV  +DQR P  DRN+R+S GSDQ KGQE   
Sbjct: 592  AAPPRASRPIGMPPFRPPAPPLP-QSSSRVVNKDQRRP-TDRNDRYSAGSDQGKGQEMAM 649

Query: 579  QAGGPDDEAHYQQGLKPHQEDQYGAGNSFRNDESESEDEAP 457
              GGP+DE  YQ G++   +D +  GNSFRNDESESEDEAP
Sbjct: 650  SGGGPEDEMKYQPGMRTQHDDSFAVGNSFRNDESESEDEAP 690


Top