BLASTX nr result

ID: Rauwolfia21_contig00001145 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00001145
         (3824 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ21509.1| hypothetical protein PRUPE_ppa000211mg [Prunus pe...  1884   0.0  
ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation spec...  1872   0.0  
gb|EOY22975.1| Cleavage and polyadenylation specificity factor 1...  1871   0.0  
gb|EOY22974.1| Cleavage and polyadenylation specificity factor 1...  1871   0.0  
emb|CBI24510.3| unnamed protein product [Vitis vinifera]             1865   0.0  
ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation spec...  1860   0.0  
ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation spec...  1846   0.0  
ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation spec...  1836   0.0  
ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citr...  1835   0.0  
ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation spec...  1831   0.0  
ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citr...  1830   0.0  
ref|XP_002510905.1| cleavage and polyadenylation specificity fac...  1821   0.0  
ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation spec...  1812   0.0  
ref|XP_006587381.1| PREDICTED: cleavage and polyadenylation spec...  1810   0.0  
ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation spec...  1810   0.0  
gb|ESW24391.1| hypothetical protein PHAVU_004G126600g [Phaseolus...  1804   0.0  
gb|EXC20897.1| Cleavage and polyadenylation specificity factor s...  1798   0.0  
ref|XP_002318462.2| cleavage and polyadenylation specificity fac...  1795   0.0  
ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation spec...  1789   0.0  
ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation spec...  1789   0.0  

>gb|EMJ21509.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica]
          Length = 1459

 Score = 1884 bits (4881), Expect = 0.0
 Identities = 924/1190 (77%), Positives = 1046/1190 (87%), Gaps = 8/1190 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 270  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISAN 329

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            +IHYHSQS+SC LALN++A+S+D SQ+M RSSF VELD ANATWL NDVA+LSTKTGELL
Sbjct: 330  SIHYHSQSASCALALNSYAVSADNSQEMPRSSFTVELDTANATWLLNDVALLSTKTGELL 389

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGSLPLG 542
             LTLVYDGR+VQRL+LSKS+A+VLTS IT +G+S FFLGSRLGDSLLVQFTCG G   L 
Sbjct: 390  LLTLVYDGRVVQRLDLSKSKASVLTSGITKVGNSLFFLGSRLGDSLLVQFTCGVGGSVLS 449

Query: 543  --VKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
              +K+EVGDIEG APLAKRLR SSSDALQDMV+GEELSLYGS PNNA SAQ+SFSF VRD
Sbjct: 450  SDMKDEVGDIEGDAPLAKRLRMSSSDALQDMVSGEELSLYGSAPNNAESAQKSFSFAVRD 509

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+
Sbjct: 510  SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 569

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHKN R    DSSKIAA DDE+HAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 570  VELPGCKGIWTVYHKNARGHNADSSKIAASDDEFHAYLIISLEARTMVLETADLLSEVTE 629

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDY+ QG T+AAGNLFGRRRV+QVY  GARILDG+FMTQ+LSF   NSE+GSGSE   V
Sbjct: 630  SVDYFVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQDLSFGTSNSEMGSGSESSTV 689

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSI DPYVLLRM+DG I+LLVGDPS CTVS   P  FESSK+SISACTLYHDKGPEPW
Sbjct: 690  LSVSIVDPYVLLRMSDGGIRLLVGDPSLCTVSTSIPAAFESSKKSISACTLYHDKGPEPW 749

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTGI EAIDG+D  SHDQGD+YC+VCYESG LEIFDVPNF+ V+SVDKFV
Sbjct: 750  LRKTSTDAWLSTGIDEAIDGADGVSHDQGDVYCVVCYESGSLEIFDVPNFNCVFSVDKFV 809

Query: 1617 SGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790
            SG A+L+D ++ DP     K+  K++++V+  G+KE IQN+++VELAM RW GQHSRPFL
Sbjct: 810  SGNAHLIDTLMRDPPKDPQKLINKSSEEVSGQGRKENIQNMKVVELAMQRWSGQHSRPFL 869

Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970
            FGIL+DG ILCYH+Y+F+  + ASK E++ ++Q    + ++S SRLRNLRF+RV LDTYA
Sbjct: 870  FGILNDGMILCYHAYLFEGPETASKTEDSASAQNTTGVSNLSASRLRNLRFVRVPLDTYA 929

Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150
            +++TS  T CQR+TIFKN+ G QGLF+SGSRP+W+M+FRERLRIHPQLCDG++VA TVLH
Sbjct: 930  KKDTSNETSCQRMTIFKNIAGYQGLFLSGSRPAWFMVFRERLRIHPQLCDGSVVAVTVLH 989

Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330
            NVNCNHG IYVTSQG LKICQLP   SYDNYWPVQKIPLKGTPHQVTYFA+KNLYPLIVS
Sbjct: 990  NVNCNHGLIYVTSQGILKICQLPPITSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVS 1049

Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498
             PV KPLNQVLSSLVDQE G Q+EN          +Y V+EFE+RIMEPD+SGGPWQT+A
Sbjct: 1050 VPVHKPLNQVLSSLVDQEVGHQVENHNLSSDELHRTYSVDEFEIRIMEPDKSGGPWQTKA 1109

Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678
            TIPMQ+SENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVA RGRVLLFS  K+ DN+
Sbjct: 1110 TIPMQTSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAGRGRVLLFSAGKSADNT 1169

Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858
            QTLVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNGVAFFD PPLYVVS+
Sbjct: 1170 QTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGVAFFDVPPLYVVSL 1229

Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038
            NIVKNFILLGD+HKSIYFL+WKEQG+QL LL+KDFG LDC ATEFLIDGSTLSL+V+DEQ
Sbjct: 1230 NIVKNFILLGDVHKSIYFLSWKEQGAQLTLLAKDFGNLDCFATEFLIDGSTLSLVVADEQ 1289

Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218
            KNIQ+FYYAPK+  SWKGQKLLSRAEFH+G HVTKF+RLQ+L TSSD+    PGSDK NR
Sbjct: 1290 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGTHVTKFLRLQMLSTSSDRTGTNPGSDKTNR 1349

Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398
            + LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPR+FRQF+SNG+AHRP
Sbjct: 1350 YALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVHHVAGLNPRAFRQFQSNGKAHRP 1409

Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            GPD IVDCELL HYEMLPLEEQLEIA Q+GTTR+QI SNLN+L++GTSFL
Sbjct: 1410 GPDTIVDCELLSHYEMLPLEEQLEIANQIGTTRSQIFSNLNDLSIGTSFL 1459


>ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Vitis vinifera]
          Length = 1442

 Score = 1872 bits (4849), Expect = 0.0
 Identities = 922/1188 (77%), Positives = 1042/1188 (87%), Gaps = 6/1188 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLL VPSPIGGV+V+ AN
Sbjct: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLPVPSPIGGVVVISAN 327

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            +IHYHSQS+SC LALNN+A+S+D SQ+M RSSF+VELDAANATWL+NDVAMLSTKTGELL
Sbjct: 328  SIHYHSQSASCALALNNYAVSADNSQEMPRSSFSVELDAANATWLSNDVAMLSTKTGELL 387

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGSLPLG 542
             LTL YDGR+V RL+LSKSRA+VLTS I  IG+S FFLGSRLGDSLLVQFT     L   
Sbjct: 388  LLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGNSLFFLGSRLGDSLLVQFT---SILSSS 444

Query: 543  VKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRDSL 722
            VKEEVGDIEG  P AKRLR+SSSDALQDMVNGEELSLYGS PN+  ++Q++FSF+VRDS 
Sbjct: 445  VKEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEELSLYGSAPNSTETSQKTFSFSVRDSF 504

Query: 723  INIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQES 902
            IN+GPLKDF+ GLR NAD  A+G AKQSNYELVCCSGHGKNGALC+LQ+SIRPEMIT+  
Sbjct: 505  INVGPLKDFAYGLRINADPKATGIAKQSNYELVCCSGHGKNGALCILQQSIRPEMITEVE 564

Query: 903  LPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTENV 1082
            LPGCKG+WTVYHKN R    DS+K+A  DDEYHAYLIISLE+RTMVL+TA+ L EVTE+V
Sbjct: 565  LPGCKGIWTVYHKNTRGHNADSTKMATKDDEYHAYLIISLESRTMVLETADLLGEVTESV 624

Query: 1083 DYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMVSS 1262
            DYY QG T++AGNLFGRRRV+QVYA GARILDGAFMTQ+L            SE   V S
Sbjct: 625  DYYVQGCTISAGNLFGRRRVVQVYARGARILDGAFMTQDLPI----------SESSTVLS 674

Query: 1263 VSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPWLR 1442
            VSIADPYVLLRM+DG+IQLLVGDPS CTVSI  P +FESSK+SISACTLYHDKGPEPWLR
Sbjct: 675  VSIADPYVLLRMSDGNIQLLVGDPSTCTVSINIPAVFESSKKSISACTLYHDKGPEPWLR 734

Query: 1443 RTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFVSG 1622
            +TSTDAWLSTGIGEAIDG+D A+ DQGDIYC+V YESG LEIFDVPNF+ V+SVDKF+SG
Sbjct: 735  KTSTDAWLSTGIGEAIDGADGAAQDQGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSG 794

Query: 1623 KANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796
             A+LVD +IL+P++   KV  K +++    G+KE   NI++VELAM RW GQHSRPFLFG
Sbjct: 795  NAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFG 854

Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976
            IL+DGTILCYH+Y+++  ++  K EEAV++Q   +I ++S SRLRNLRF+RV LDTY RE
Sbjct: 855  ILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTRE 914

Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156
            E   GT   R+T+FKN+GGCQGLF+SGSRP W+M+FRER+R+HPQLCDG+IVAFTVLHN+
Sbjct: 915  EALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNI 974

Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336
            NCNHG IYVTSQG LKICQLP+  SYDNYWPVQKIPLKGTPHQVTYFA+KNLYPLIVS P
Sbjct: 975  NCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVP 1034

Query: 2337 VLKPLNQVLSSLVDQEAGQQIENDV----SFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504
            VLKPLN VLSSLVDQEAG Q+END         SY V+EFEVR++EP++SG PWQTRATI
Sbjct: 1035 VLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATI 1094

Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684
            PMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFSV KN DNSQ 
Sbjct: 1095 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQN 1154

Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864
            LVSE+YSKELKGAISA+ASLQGHLLIASGPKI LHKW G+ELNGVAFFDAPPLYVVS+NI
Sbjct: 1155 LVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWTGTELNGVAFFDAPPLYVVSLNI 1214

Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044
            VKNFILLGDIH+SIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSD+QKN
Sbjct: 1215 VKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLIVSDDQKN 1274

Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224
            IQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+LP SSD+ +A  GSDK NRF 
Sbjct: 1275 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLPASSDRTSATQGSDKTNRFA 1334

Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404
            LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPRSFRQFRSNG+AHRPGP
Sbjct: 1335 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGP 1394

Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            DNIVDCELLCHYEMLP EEQLEIAQQ+GTTR QI+SNLN+L+LGTSFL
Sbjct: 1395 DNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQILSNLNDLSLGTSFL 1442


>gb|EOY22975.1| Cleavage and polyadenylation specificity factor 160 isoform 2
            [Theobroma cacao]
          Length = 1257

 Score = 1871 bits (4847), Expect = 0.0
 Identities = 914/1190 (76%), Positives = 1045/1190 (87%), Gaps = 8/1190 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 68   LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISAN 127

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALNN+AIS D SQD+ RS+F+VELDAANATWL NDVA+LSTKTGELL
Sbjct: 128  TIHYHSQSASCALALNNYAISVDNSQDLPRSNFSVELDAANATWLLNDVALLSTKTGELL 187

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536
             LTL+YDGR+VQRL+LSKS+A+VLTS+ITTIG+S FFLGSRLGDSLLVQF+ G+G  +LP
Sbjct: 188  LLTLIYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRLGDSLLVQFSGGSGVSALP 247

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             G+KEEVGDIEG  PLAKRLRRSSSDALQDMV GEELSLYGS PNN  SAQ++F F VRD
Sbjct: 248  SGLKEEVGDIEGDVPLAKRLRRSSSDALQDMVGGEELSLYGSAPNNTESAQKTFLFAVRD 307

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SL N+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+
Sbjct: 308  SLTNVGPLKDFSYGLRINADVNATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 367

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              L GCKG+WTVYHK+ RS   D SK+  DDDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 368  VELTGCKGIWTVYHKSTRSHSADLSKVTDDDDEYHAYLIISLEARTMVLETADLLTEVTE 427

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDYY QG T+AAGNLFGRRRV+QVY  GARILDG+FMTQELS    NSE   GSE   V
Sbjct: 428  SVDYYVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQELSIPSPNSESSPGSENSTV 487

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADPYVLLRMTDGSI LLVGDP+ CTVSI  P  FE SK+ +SACTLYHDKGPEPW
Sbjct: 488  ISVSIADPYVLLRMTDGSILLLVGDPATCTVSINTPTAFEGSKKMVSACTLYHDKGPEPW 547

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+ STDAWLSTG+GE+IDG+D   HDQGDIYC+VCYESG LEIFDVPNF+ V+S++KF 
Sbjct: 548  LRKASTDAWLSTGVGESIDGADGGPHDQGDIYCVVCYESGALEIFDVPNFNCVFSMEKFA 607

Query: 1617 SGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790
            SG+  LVDA  L+ +  + KV  K+++++T  G+KE +QN+++VELAM RW   HSRPFL
Sbjct: 608  SGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKENVQNLKVVELAMQRWSANHSRPFL 667

Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970
            FGIL+DGTILCYH+Y+F+ S+ ASKVE++V +Q    + +++ SRLRNLRFIR+ LD Y 
Sbjct: 668  FGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVGLSNINASRLRNLRFIRIPLDAYT 727

Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150
            REE S GT  QRITIFKN+ G QG F+SGSRP+W+M+FRERLR+HPQLCDG+IVAFTVLH
Sbjct: 728  REEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLH 787

Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330
            NVNCNHGFIYVTSQG LKICQ+PS  +YDNYWPVQKIPL+GTPHQVTYFA++NLYP+IVS
Sbjct: 788  NVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKIPLRGTPHQVTYFAERNLYPIIVS 847

Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498
             PV KP+NQVLSSLVDQE G Q++N        + +Y V+EFEVRI+EP++SGGPW+T+A
Sbjct: 848  VPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTYTVDEFEVRILEPEKSGGPWETKA 907

Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678
            TIPMQSSENALTVRVVTL+N+TT+ENE+LLAIGTAY+QGEDVAARGRV+L S+ +N DN 
Sbjct: 908  TIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYIQGEDVAARGRVILCSIGRNTDNL 967

Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858
            Q LVSEVYSKELKGAISALASLQGHLLIASGPKI LH W GSELNG+AF+DAPPLYVVS+
Sbjct: 968  QNLVSEVYSKELKGAISALASLQGHLLIASGPKIILHNWTGSELNGIAFYDAPPLYVVSL 1027

Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038
            NIVKNFILLGD+HKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSDEQ
Sbjct: 1028 NIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDEQ 1087

Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218
            KNIQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+ +A  GSDK NR
Sbjct: 1088 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSSDRTSATAGSDKTNR 1147

Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398
            F LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPRSFRQF SNG+AHRP
Sbjct: 1148 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFHSNGKAHRP 1207

Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            GPD+IVDCELLCHYEMLPLEEQL+IA Q+GTTR+QI+SNLN+LTLGTSFL
Sbjct: 1208 GPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLNDLTLGTSFL 1257


>gb|EOY22974.1| Cleavage and polyadenylation specificity factor 160 isoform 1
            [Theobroma cacao]
          Length = 1457

 Score = 1871 bits (4847), Expect = 0.0
 Identities = 914/1190 (76%), Positives = 1045/1190 (87%), Gaps = 8/1190 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISAN 327

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALNN+AIS D SQD+ RS+F+VELDAANATWL NDVA+LSTKTGELL
Sbjct: 328  TIHYHSQSASCALALNNYAISVDNSQDLPRSNFSVELDAANATWLLNDVALLSTKTGELL 387

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536
             LTL+YDGR+VQRL+LSKS+A+VLTS+ITTIG+S FFLGSRLGDSLLVQF+ G+G  +LP
Sbjct: 388  LLTLIYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRLGDSLLVQFSGGSGVSALP 447

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             G+KEEVGDIEG  PLAKRLRRSSSDALQDMV GEELSLYGS PNN  SAQ++F F VRD
Sbjct: 448  SGLKEEVGDIEGDVPLAKRLRRSSSDALQDMVGGEELSLYGSAPNNTESAQKTFLFAVRD 507

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SL N+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+
Sbjct: 508  SLTNVGPLKDFSYGLRINADVNATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              L GCKG+WTVYHK+ RS   D SK+  DDDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 568  VELTGCKGIWTVYHKSTRSHSADLSKVTDDDDEYHAYLIISLEARTMVLETADLLTEVTE 627

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDYY QG T+AAGNLFGRRRV+QVY  GARILDG+FMTQELS    NSE   GSE   V
Sbjct: 628  SVDYYVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQELSIPSPNSESSPGSENSTV 687

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADPYVLLRMTDGSI LLVGDP+ CTVSI  P  FE SK+ +SACTLYHDKGPEPW
Sbjct: 688  ISVSIADPYVLLRMTDGSILLLVGDPATCTVSINTPTAFEGSKKMVSACTLYHDKGPEPW 747

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+ STDAWLSTG+GE+IDG+D   HDQGDIYC+VCYESG LEIFDVPNF+ V+S++KF 
Sbjct: 748  LRKASTDAWLSTGVGESIDGADGGPHDQGDIYCVVCYESGALEIFDVPNFNCVFSMEKFA 807

Query: 1617 SGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790
            SG+  LVDA  L+ +  + KV  K+++++T  G+KE +QN+++VELAM RW   HSRPFL
Sbjct: 808  SGRTRLVDAYTLESSKDSEKVINKSSEELTGQGRKENVQNLKVVELAMQRWSANHSRPFL 867

Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970
            FGIL+DGTILCYH+Y+F+ S+ ASKVE++V +Q    + +++ SRLRNLRFIR+ LD Y 
Sbjct: 868  FGILTDGTILCYHAYLFEGSENASKVEDSVVAQNSVGLSNINASRLRNLRFIRIPLDAYT 927

Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150
            REE S GT  QRITIFKN+ G QG F+SGSRP+W+M+FRERLR+HPQLCDG+IVAFTVLH
Sbjct: 928  REEMSNGTLSQRITIFKNISGYQGFFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLH 987

Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330
            NVNCNHGFIYVTSQG LKICQ+PS  +YDNYWPVQKIPL+GTPHQVTYFA++NLYP+IVS
Sbjct: 988  NVNCNHGFIYVTSQGILKICQIPSASNYDNYWPVQKIPLRGTPHQVTYFAERNLYPIIVS 1047

Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498
             PV KP+NQVLSSLVDQE G Q++N        + +Y V+EFEVRI+EP++SGGPW+T+A
Sbjct: 1048 VPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDELQRTYTVDEFEVRILEPEKSGGPWETKA 1107

Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678
            TIPMQSSENALTVRVVTL+N+TT+ENE+LLAIGTAY+QGEDVAARGRV+L S+ +N DN 
Sbjct: 1108 TIPMQSSENALTVRVVTLFNTTTKENESLLAIGTAYIQGEDVAARGRVILCSIGRNTDNL 1167

Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858
            Q LVSEVYSKELKGAISALASLQGHLLIASGPKI LH W GSELNG+AF+DAPPLYVVS+
Sbjct: 1168 QNLVSEVYSKELKGAISALASLQGHLLIASGPKIILHNWTGSELNGIAFYDAPPLYVVSL 1227

Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038
            NIVKNFILLGD+HKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSDEQ
Sbjct: 1228 NIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDEQ 1287

Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218
            KNIQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+ +A  GSDK NR
Sbjct: 1288 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSSDRTSATAGSDKTNR 1347

Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398
            F LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPRSFRQF SNG+AHRP
Sbjct: 1348 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFHSNGKAHRP 1407

Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            GPD+IVDCELLCHYEMLPLEEQL+IA Q+GTTR+QI+SNLN+LTLGTSFL
Sbjct: 1408 GPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLNDLTLGTSFL 1457


>emb|CBI24510.3| unnamed protein product [Vitis vinifera]
          Length = 1448

 Score = 1865 bits (4832), Expect = 0.0
 Identities = 922/1194 (77%), Positives = 1042/1194 (87%), Gaps = 12/1194 (1%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLL VPSPIGGV+V+ AN
Sbjct: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLPVPSPIGGVVVISAN 327

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            +IHYHSQS+SC LALNN+A+S+D SQ+M RSSF+VELDAANATWL+NDVAMLSTKTGELL
Sbjct: 328  SIHYHSQSASCALALNNYAVSADNSQEMPRSSFSVELDAANATWLSNDVAMLSTKTGELL 387

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGSLPLG 542
             LTL YDGR+V RL+LSKSRA+VLTS I  IG+S FFLGSRLGDSLLVQFT     L   
Sbjct: 388  LLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGNSLFFLGSRLGDSLLVQFT---SILSSS 444

Query: 543  VKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQ------ESFSF 704
            VKEEVGDIEG  P AKRLR+SSSDALQDMVNGEELSLYGS PN+  ++Q      ++FSF
Sbjct: 445  VKEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEELSLYGSAPNSTETSQVEAQVGKTFSF 504

Query: 705  TVRDSLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPE 884
            +VRDS IN+GPLKDF+ GLR NAD  A+G AKQSNYELVCCSGHGKNGALC+LQ+SIRPE
Sbjct: 505  SVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYELVCCSGHGKNGALCILQQSIRPE 564

Query: 885  MITQESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLE 1064
            MIT+  LPGCKG+WTVYHKN R    DS+K+A  DDEYHAYLIISLE+RTMVL+TA+ L 
Sbjct: 565  MITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDEYHAYLIISLESRTMVLETADLLG 624

Query: 1065 EVTENVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSE 1244
            EVTE+VDYY QG T++AGNLFGRRRV+QVYA GARILDGAFMTQ+L            SE
Sbjct: 625  EVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARILDGAFMTQDLPI----------SE 674

Query: 1245 GQMVSSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKG 1424
               V SVSIADPYVLLRM+DG+IQLLVGDPS CTVSI  P +FESSK+SISACTLYHDKG
Sbjct: 675  SSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSINIPAVFESSKKSISACTLYHDKG 734

Query: 1425 PEPWLRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSV 1604
            PEPWLR+TSTDAWLSTGIGEAIDG+D A+ DQGDIYC+V YESG LEIFDVPNF+ V+SV
Sbjct: 735  PEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYCVVSYESGDLEIFDVPNFNCVFSV 794

Query: 1605 DKFVSGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHS 1778
            DKF+SG A+LVD +IL+P++   KV  K +++    G+KE   NI++VELAM RW GQHS
Sbjct: 795  DKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKENAHNIKVVELAMQRWSGQHS 854

Query: 1779 RPFLFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTL 1958
            RPFLFGIL+DGTILCYH+Y+++  ++  K EEAV++Q   +I ++S SRLRNLRF+RV L
Sbjct: 855  RPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSLSISNVSASRLRNLRFVRVPL 914

Query: 1959 DTYAREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAF 2138
            DTY REE   GT   R+T+FKN+GGCQGLF+SGSRP W+M+FRER+R+HPQLCDG+IVAF
Sbjct: 915  DTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFMVFRERIRVHPQLCDGSIVAF 974

Query: 2139 TVLHNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYP 2318
            TVLHN+NCNHG IYVTSQG LKICQLP+  SYDNYWPVQKIPLKGTPHQVTYFA+KNLYP
Sbjct: 975  TVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQKIPLKGTPHQVTYFAEKNLYP 1034

Query: 2319 LIVSFPVLKPLNQVLSSLVDQEAGQQIENDV----SFEGSYPVEEFEVRIMEPDRSGGPW 2486
            LIVS PVLKPLN VLSSLVDQEAG Q+END         SY V+EFEVR++EP++SG PW
Sbjct: 1035 LIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRSYSVDEFEVRVLEPEKSGAPW 1094

Query: 2487 QTRATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKN 2666
            QTRATIPMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFSV KN
Sbjct: 1095 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSVGKN 1154

Query: 2667 NDNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLY 2846
             DNSQ LVSE+YSKELKGAISA+ASLQGHLLIASGPKI LHKW G+ELNGVAFFDAPPLY
Sbjct: 1155 TDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWTGTELNGVAFFDAPPLY 1214

Query: 2847 VVSMNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMV 3026
            VVS+NIVKNFILLGDIH+SIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+V
Sbjct: 1215 VVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLIV 1274

Query: 3027 SDEQKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSD 3206
            SD+QKNIQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+LP SSD+ +A  GSD
Sbjct: 1275 SDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLPASSDRTSATQGSD 1334

Query: 3207 KINRFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGR 3386
            K NRF LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNPRSFRQFRSNG+
Sbjct: 1335 KTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFRSNGK 1394

Query: 3387 AHRPGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            AHRPGPDNIVDCELLCHYEMLP EEQLEIAQQ+GTTR QI+SNLN+L+LGTSFL
Sbjct: 1395 AHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQILSNLNDLSLGTSFL 1448


>ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Solanum tuberosum]
          Length = 1447

 Score = 1860 bits (4817), Expect = 0.0
 Identities = 914/1188 (76%), Positives = 1050/1188 (88%), Gaps = 6/1188 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT +GRVSWKHHTCMVSA SISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 262  LTWSGRVSWKHHTCMVSAFSISTTLKQHPLIWSAANLPHDAYKLLAVPSPIGGVLVIGAN 321

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQSSSC LALNN A   D SQ+M RSSFNVELDAANATWLT+DVAMLSTKTGELL
Sbjct: 322  TIHYHSQSSSCSLALNNFAFFGDNSQEMPRSSFNVELDAANATWLTSDVAMLSTKTGELL 381

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536
             LT++YDGRIVQ+L+LSKSRA+VLTS ITTIG S FFLGSRLGDSLLVQF+CG G  +LP
Sbjct: 382  LLTIIYDGRIVQKLDLSKSRASVLTSGITTIGDSLFFLGSRLGDSLLVQFSCGLGGSNLP 441

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             GV+EEVGDIE  AP AKRLR SSSDALQDM+NGEELSLYG+ PNNA SAQ++FSF VRD
Sbjct: 442  PGVQEEVGDIESDAPSAKRLRMSSSDALQDMINGEELSLYGTAPNNAQSAQKTFSFAVRD 501

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SLIN+GPLKDFS G+R NAD NA+G AKQSNYELVCCSGHGKNG+LCVLQ+SIRPE ITQ
Sbjct: 502  SLINVGPLKDFSYGMRINADLNATGIAKQSNYELVCCSGHGKNGSLCVLQQSIRPETITQ 561

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
            E+LPGCKG+WTVYHKN R  + +SS++A ++DEYHAYLIISLE RTMVLQTANNLEEVTE
Sbjct: 562  EALPGCKGIWTVYHKNTRIHLSESSRMADEEDEYHAYLIISLEARTMVLQTANNLEEVTE 621

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            NVDYY QG+T+AAGNLFGRRRVIQV+A GARILDGAFMTQELSFK  N E GS S+  +V
Sbjct: 622  NVDYYVQGTTLAAGNLFGRRRVIQVFAHGARILDGAFMTQELSFKASNVESGSSSDTSIV 681

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
            +SVSIADPYVLLRMT+GS+QLLVGDPS+C+VS+  P +FESSK+SISACTLYHDKGPEPW
Sbjct: 682  ASVSIADPYVLLRMTNGSLQLLVGDPSSCSVSLTVPSVFESSKKSISACTLYHDKGPEPW 741

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLS+G+GEAIDG+D  + DQGD+YC+VCYE+G LEIFDVPNF+ V+SVDKF+
Sbjct: 742  LRKTSTDAWLSSGMGEAIDGADGVTQDQGDVYCVVCYENGTLEIFDVPNFTCVFSVDKFI 801

Query: 1617 SGKANLVDAIILDPTDSAH-KVKTAQDVTAPGKKEQIQNIRI--VELAMHRWPGQHSRPF 1787
            SG+  LVD  + D  +  H   K  +DV  PG+KE  ++++I  VEL MHRW G+HSRPF
Sbjct: 802  SGRTYLVDTFMQDSVNGLHAHSKNTEDVIRPGQKENSKDVKINVVELMMHRWIGKHSRPF 861

Query: 1788 LFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTY 1967
            LFGIL+DGTIL YH+YVF+ S+ +SKVE +V+SQ   ++ S + SRLRNLRF+RV +D Y
Sbjct: 862  LFGILADGTILSYHAYVFEGSENSSKVEGSVSSQNSISLSSTNASRLRNLRFVRVPVDNY 921

Query: 1968 AREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVL 2147
            AREE   GT+ QR+ ++KN+GG QG+F++GSRPSW+M+FRERLRIHPQLCDG IVAFTVL
Sbjct: 922  AREEMPSGTQLQRMNVYKNIGGSQGIFLTGSRPSWFMVFRERLRIHPQLCDGPIVAFTVL 981

Query: 2148 HNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIV 2327
            HNVNCNHG IYVT+ G LKICQLPSFLSYDNYWPVQKIPLKGTPHQV YFA+KN+Y +IV
Sbjct: 982  HNVNCNHGLIYVTALGTLKICQLPSFLSYDNYWPVQKIPLKGTPHQVAYFAEKNVYSVIV 1041

Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEND-VSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504
            S PVLKPLNQVLS++ DQE GQQ + D +++EGSYP+EEFEVRI+EP++SGG W+TRA+I
Sbjct: 1042 SVPVLKPLNQVLSTIADQEVGQQFDPDNLNYEGSYPIEEFEVRIVEPEKSGGLWKTRASI 1101

Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684
            PMQSSENALTVR+VTL N+TTRENETLLA+GTAYVQGEDVAARGRVLLFS+++  DNS+T
Sbjct: 1102 PMQSSENALTVRMVTLLNTTTRENETLLAVGTAYVQGEDVAARGRVLLFSIDRTADNSRT 1161

Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864
            LVSEVYSKELKGAI ALASLQGHLLIASGPKI LHKW GSELNGVAF D PPL+ VS+NI
Sbjct: 1162 LVSEVYSKELKGAIPALASLQGHLLIASGPKIILHKWTGSELNGVAFCDYPPLHAVSLNI 1221

Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044
            VKNFILLGDIHKSI F++WKE   QL+LL+KDF  LDCLATEFLIDGSTLSL+VSD+QKN
Sbjct: 1222 VKNFILLGDIHKSISFVSWKE--PQLSLLAKDFSPLDCLATEFLIDGSTLSLVVSDDQKN 1279

Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224
            +Q+FYYAPK+  SWKGQKLLSRAEFH+G+ +TKF+RLQLLPT+S++    PGSDK NRF 
Sbjct: 1280 VQIFYYAPKVSESWKGQKLLSRAEFHVGSRITKFLRLQLLPTTSERTATTPGSDKTNRFA 1339

Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404
             +FGTL+GS+GCI+PLDELTFRRLQSLQ+KLV AV H+AGLNPRSFRQFRSNG+AHRPGP
Sbjct: 1340 TVFGTLEGSLGCIAPLDELTFRRLQSLQKKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGP 1399

Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            DNIVDCELL HYEMLPLEEQLEIAQQ+GTTR QIMSNLN++ LGTSFL
Sbjct: 1400 DNIVDCELLSHYEMLPLEEQLEIAQQIGTTRMQIMSNLNDMILGTSFL 1447


>ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Solanum lycopersicum]
          Length = 1447

 Score = 1846 bits (4782), Expect = 0.0
 Identities = 908/1188 (76%), Positives = 1046/1188 (88%), Gaps = 6/1188 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT +GRVSWKHHTCMVSA SISTTLKQ PLIWSATNLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 262  LTWSGRVSWKHHTCMVSAFSISTTLKQHPLIWSATNLPHDAYKLLAVPSPIGGVLVIGAN 321

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQSSSC LALNN     D SQ+M RSS NVELDAANATWLT+DVAMLSTKTGELL
Sbjct: 322  TIHYHSQSSSCSLALNNFVFFGDNSQEMPRSSINVELDAANATWLTSDVAMLSTKTGELL 381

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536
             LT++YDGRIVQ+L+LSKSRA+VLTS ITTIG S FFLGSRLGDSLLVQF+ G G  +LP
Sbjct: 382  LLTIIYDGRIVQKLDLSKSRASVLTSGITTIGDSLFFLGSRLGDSLLVQFSSGLGGSNLP 441

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             GV+EEVGDIE  AP AKRLR SSSDALQDM+NGEELSLYG+ PNNA SAQ++FSF VRD
Sbjct: 442  PGVQEEVGDIESDAPSAKRLRMSSSDALQDMINGEELSLYGTAPNNAQSAQKTFSFAVRD 501

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SLIN+GPLKDFS G+R NAD NA+G AKQSNYELVCCSGHGKNG+L VLQ+SIRPE ITQ
Sbjct: 502  SLINVGPLKDFSYGMRINADLNATGIAKQSNYELVCCSGHGKNGSLSVLQQSIRPETITQ 561

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
             SLPGCKG+WTVYHKN R  + +SS++A ++DEYHAYLIISLE RTMVLQTANNLEEVTE
Sbjct: 562  VSLPGCKGIWTVYHKNTRIHLSESSRMADEEDEYHAYLIISLEARTMVLQTANNLEEVTE 621

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            NVDYY QG+T+AAGNLFGRRRVIQV+A GARILDGAFMTQELSFK  N E GS S+  +V
Sbjct: 622  NVDYYVQGTTLAAGNLFGRRRVIQVFAHGARILDGAFMTQELSFKASNVESGSSSDTSIV 681

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
            +SVSIADPYVLLRMT+GS+QLLVGDPS+C+VS+  P +FESSK+SISACTLYHDKGPEPW
Sbjct: 682  ASVSIADPYVLLRMTNGSLQLLVGDPSSCSVSLTVPSVFESSKKSISACTLYHDKGPEPW 741

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLS+G+GEAIDG+D    DQGD+YC+VCYE+G LEIFDVP+F+ V+SVDKF+
Sbjct: 742  LRKTSTDAWLSSGMGEAIDGADGVIQDQGDVYCVVCYENGTLEIFDVPSFTCVFSVDKFI 801

Query: 1617 SGKANLVDAIILDPTDSAH-KVKTAQDVTAPGKKEQIQNIRI--VELAMHRWPGQHSRPF 1787
            SG+  LVD  + D  +  H   K  +DV  PG+KE  ++++I  VEL MHRW G+HSRPF
Sbjct: 802  SGRTYLVDTFMQDSVNGLHAHSKNTEDVIRPGQKENSKDVKINVVELMMHRWIGKHSRPF 861

Query: 1788 LFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTY 1967
            LFGIL+DGTIL YH+YVF+ S+ +SKV+ +V+SQ   ++ S + SRLRNLRF+RV +D Y
Sbjct: 862  LFGILADGTILSYHAYVFEGSENSSKVDGSVSSQNSISLSSTNASRLRNLRFVRVPVDNY 921

Query: 1968 AREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVL 2147
            AREE   G++ QR+ ++KN+GG QG+F++GSRPSW+M+FRERLRIHPQLCDG IVAFTVL
Sbjct: 922  AREEMPSGSQLQRMNVYKNIGGSQGIFLTGSRPSWFMVFRERLRIHPQLCDGPIVAFTVL 981

Query: 2148 HNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIV 2327
            HNVNCNHG IYVT+ G LKICQLPSFLSYDNYWPVQKIPLKGTPHQV YFA+KN+Y +IV
Sbjct: 982  HNVNCNHGLIYVTALGTLKICQLPSFLSYDNYWPVQKIPLKGTPHQVAYFAEKNVYSVIV 1041

Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEND-VSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504
            S PVLKPLNQVLSS+ DQE GQQ + D +++EGSYP+EEFEVRI+EP++SGGPW+TRA+I
Sbjct: 1042 SVPVLKPLNQVLSSIADQEVGQQFDPDNLNYEGSYPIEEFEVRILEPEKSGGPWKTRASI 1101

Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684
            PMQSSENALTVR+VTL+N+ T+ENETLLA+GTAYVQGEDVAARGRVLLFS+++  DNS+T
Sbjct: 1102 PMQSSENALTVRMVTLFNTKTKENETLLAVGTAYVQGEDVAARGRVLLFSIDRTADNSRT 1161

Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864
            LVSEVYSKELKGAI ALASLQGHLLIASGPKI LHKW GSELNGVAF D PPL+ VS+NI
Sbjct: 1162 LVSEVYSKELKGAIPALASLQGHLLIASGPKIILHKWTGSELNGVAFCDYPPLHAVSLNI 1221

Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044
            VKNFILLGDIHKSI F++WKE   QL+LL+KDF  LDCLATEFLIDGSTLSL+VSD+QKN
Sbjct: 1222 VKNFILLGDIHKSISFVSWKE--PQLSLLAKDFSPLDCLATEFLIDGSTLSLVVSDDQKN 1279

Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224
            +Q+FYYAPK+  SWKGQKLLSRAEFH+G+ +TKF+RLQLLPT+S++    PGSDK NRF 
Sbjct: 1280 VQIFYYAPKVSESWKGQKLLSRAEFHVGSRITKFLRLQLLPTTSERTATTPGSDKTNRFA 1339

Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404
             +FGTL+GS+GCI+PLDELTFRRLQSLQ+KLV AV H+AGLNPRSFRQFRSNG+AHRPGP
Sbjct: 1340 TVFGTLEGSLGCIAPLDELTFRRLQSLQKKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGP 1399

Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            DNIVDCELL HYEMLPLEEQLEIAQQ+GTTR QIMSNLN++ LGTSFL
Sbjct: 1400 DNIVDCELLSHYEMLPLEEQLEIAQQIGTTRMQIMSNLNDMILGTSFL 1447


>ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X2 [Citrus sinensis]
          Length = 1457

 Score = 1836 bits (4756), Expect = 0.0
 Identities = 905/1190 (76%), Positives = 1035/1190 (86%), Gaps = 8/1190 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALNN+A+S D SQ++ RSSF+VELDAA+ATWL NDVA+LSTKTG+L+
Sbjct: 328  TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536
             LT+VYDGR+VQRL+LSK+  +VLTS+ITTIG+S FFLGSRLGDSLLVQFTCG+G+  L 
Sbjct: 388  LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             G+KEE GDIE  AP  KRLRRSSSDALQDMVNGEELSLYGS  NN  SAQ++FSF VRD
Sbjct: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SL+NIGPLKDFS GLR NAD +A+G +KQSNYELVCCSGHGKNGALCVL++SIRPEMIT+
Sbjct: 508  SLVNIGPLKDFSYGLRINADASATGISKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ R    DSS++AA DDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 568  VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 627

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDY+ QG T+AAGNLFGRRRVIQV+  GARILDG++MTQ+LSF   NSE GSGSE   V
Sbjct: 628  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 687

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADPYVLL M+DGSI+LLVGDPS CTVS+  P   ESSK+ +S+CTLYHDKGPEPW
Sbjct: 688  LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 747

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+GEAIDG+D    DQGDIY +VCYESG LEIFDVPNF+ V++VDKFV
Sbjct: 748  LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 807

Query: 1617 SGKANLVDAIILDPT-DSAHKVKTA-QDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790
            SG+ ++VD  + +   DS  ++ ++ ++ T  G+KE I ++++VELAM RW G HSRPFL
Sbjct: 808  SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFL 867

Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970
            F IL+DGTILCY +Y+F+  +  SK ++ V++ +  ++ ++S SRLRNLRF R+ LD Y 
Sbjct: 868  FAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFARIPLDAYT 927

Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150
            REET  G  CQRITIFKN+ G QG F+SGSRP W M+FRERLR+HPQLCDG+IVAFTVLH
Sbjct: 928  REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 987

Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330
            NVNCNHGFIYVTSQG LKICQLPS  +YDNYWPVQKIPLK TPHQ+TYFA+KNLYPLIVS
Sbjct: 988  NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1047

Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498
             PVLKPLNQVLS L+DQE G QI+N     V    +Y VEE+EVRI+EPDR+GGPWQTRA
Sbjct: 1048 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1107

Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678
            TIPMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFS  +N DN 
Sbjct: 1108 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1167

Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858
            Q LV+EVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AF+DAPPLYVVS+
Sbjct: 1168 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1227

Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038
            NIVKNFILLGDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSDEQ
Sbjct: 1228 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1287

Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218
            KNIQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+  A PGSDK NR
Sbjct: 1288 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1347

Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398
            F LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVD+V H+AGLNPRSFRQF SNG+AHRP
Sbjct: 1348 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1407

Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            GPD+IVDCELL HYEMLPLEEQLEIA Q GTTR+QI+SNLN+L LGTSFL
Sbjct: 1408 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1457


>ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citrus clementina]
            gi|557523633|gb|ESR35000.1| hypothetical protein
            CICLE_v10004147mg [Citrus clementina]
          Length = 1457

 Score = 1835 bits (4752), Expect = 0.0
 Identities = 906/1190 (76%), Positives = 1034/1190 (86%), Gaps = 8/1190 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALNN+A+S D SQ++ RSSF+VELDAA+ATWL NDVA+LSTKTG+L+
Sbjct: 328  TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536
             LT+VYDGR+VQRL+LSK+  +VLTS+ITTIG+S FFLGSRLGDSLLVQFTCG+G+  L 
Sbjct: 388  LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             G KEE GDIE  AP  KRLRRSSSDALQDMVNGEELSLYGS  NN  SAQ++FSF VRD
Sbjct: 448  SGPKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SL+NIGPLKDFS GLR NAD +A+G +KQSNYELVCCSGHGKNGALCVL++SIRPEMIT+
Sbjct: 508  SLVNIGPLKDFSYGLRINADASATGISKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ R    DSS++AA DDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 568  VELPGCKGIWTVYHKSSRGHNTDSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 627

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDY+ QG T+AAGNLFGRRRVIQV+  GARILDG++MTQ+LSF   NSE GSGSE   V
Sbjct: 628  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 687

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADPYVLL M+DGSI+LLVGDPS CTVS+  P   ESSK+ +SACTLYHDKGPEPW
Sbjct: 688  LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSACTLYHDKGPEPW 747

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+GEAIDG+D    DQGDIY +VCYESG LEIFDVPNF+ V++VDKFV
Sbjct: 748  LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 807

Query: 1617 SGKANLVDAIILDPT-DSAHKVKTA-QDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790
            SG+ ++VD  + +   DS  ++ ++ ++ T  G+KE I ++++VELAM RW G HSRPFL
Sbjct: 808  SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFL 867

Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970
            F IL+DGTILCY +Y+F+ S+  SK ++ V++ +  ++ ++S SRLRNLRF R  LD Y 
Sbjct: 868  FAILTDGTILCYQAYLFEGSENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT 927

Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150
            REET  G  CQRITIFKN+ G QG F+SGSRP W M+FRERLR+HPQLCDG+IVAFTVLH
Sbjct: 928  REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 987

Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVS 2330
            NVNCNHGFIYVTSQG LKICQLPS  +YDNYWPVQKIPLK TPHQ+TYFA+KNLYPLIVS
Sbjct: 988  NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKIPLKATPHQITYFAEKNLYPLIVS 1047

Query: 2331 FPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTRA 2498
             PVLKPLNQVLS L+DQE G QI+N     V    +Y VEE+EVRI+EPDR+GGPWQTRA
Sbjct: 1048 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1107

Query: 2499 TIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNS 2678
            TIPMQSSENALTVRVVTL+N+TT+EN+TLLAIGTAYVQGEDVAARGRVLLFS  +N DN 
Sbjct: 1108 TIPMQSSENALTVRVVTLFNTTTKENDTLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1167

Query: 2679 QTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSM 2858
            Q LV+EVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AF+DAPPLYVVS+
Sbjct: 1168 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1227

Query: 2859 NIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQ 3038
            NIVKNFILLGDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSDEQ
Sbjct: 1228 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1287

Query: 3039 KNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINR 3218
            KNIQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+  A PGSDK NR
Sbjct: 1288 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1347

Query: 3219 FGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRP 3398
            F LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVD+V H+AGLNPRSFRQF SNG+AHRP
Sbjct: 1348 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1407

Query: 3399 GPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            GPD+IVDCELL HYEMLPLEEQLEIA Q GTTR+QI+SNLN+L LGTSFL
Sbjct: 1408 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1457


>ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X1 [Citrus sinensis]
          Length = 1458

 Score = 1832 bits (4744), Expect = 0.0
 Identities = 905/1191 (75%), Positives = 1035/1191 (86%), Gaps = 9/1191 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALNN+A+S D SQ++ RSSF+VELDAA+ATWL NDVA+LSTKTG+L+
Sbjct: 328  TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536
             LT+VYDGR+VQRL+LSK+  +VLTS+ITTIG+S FFLGSRLGDSLLVQFTCG+G+  L 
Sbjct: 388  LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             G+KEE GDIE  AP  KRLRRSSSDALQDMVNGEELSLYGS  NN  SAQ++FSF VRD
Sbjct: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SL+NIGPLKDFS GLR NAD +A+G +KQSNYELVCCSGHGKNGALCVL++SIRPEMIT+
Sbjct: 508  SLVNIGPLKDFSYGLRINADASATGISKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ R    DSS++AA DDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 568  VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 627

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDY+ QG T+AAGNLFGRRRVIQV+  GARILDG++MTQ+LSF   NSE GSGSE   V
Sbjct: 628  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 687

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADPYVLL M+DGSI+LLVGDPS CTVS+  P   ESSK+ +S+CTLYHDKGPEPW
Sbjct: 688  LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 747

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+GEAIDG+D    DQGDIY +VCYESG LEIFDVPNF+ V++VDKFV
Sbjct: 748  LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 807

Query: 1617 SGKANLVDAIILDPT-DSAHKVKTA-QDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790
            SG+ ++VD  + +   DS  ++ ++ ++ T  G+KE I ++++VELAM RW G HSRPFL
Sbjct: 808  SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFL 867

Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970
            F IL+DGTILCY +Y+F+  +  SK ++ V++ +  ++ ++S SRLRNLRF R+ LD Y 
Sbjct: 868  FAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFARIPLDAYT 927

Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150
            REET  G  CQRITIFKN+ G QG F+SGSRP W M+FRERLR+HPQLCDG+IVAFTVLH
Sbjct: 928  REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 987

Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQK-IPLKGTPHQVTYFADKNLYPLIV 2327
            NVNCNHGFIYVTSQG LKICQLPS  +YDNYWPVQK IPLK TPHQ+TYFA+KNLYPLIV
Sbjct: 988  NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1047

Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTR 2495
            S PVLKPLNQVLS L+DQE G QI+N     V    +Y VEE+EVRI+EPDR+GGPWQTR
Sbjct: 1048 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1107

Query: 2496 ATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDN 2675
            ATIPMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFS  +N DN
Sbjct: 1108 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1167

Query: 2676 SQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVS 2855
             Q LV+EVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AF+DAPPLYVVS
Sbjct: 1168 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1227

Query: 2856 MNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDE 3035
            +NIVKNFILLGDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSDE
Sbjct: 1228 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1287

Query: 3036 QKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKIN 3215
            QKNIQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+  A PGSDK N
Sbjct: 1288 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1347

Query: 3216 RFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHR 3395
            RF LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVD+V H+AGLNPRSFRQF SNG+AHR
Sbjct: 1348 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1407

Query: 3396 PGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            PGPD+IVDCELL HYEMLPLEEQLEIA Q GTTR+QI+SNLN+L LGTSFL
Sbjct: 1408 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1458


>ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citrus clementina]
            gi|557523632|gb|ESR34999.1| hypothetical protein
            CICLE_v10004147mg [Citrus clementina]
          Length = 1458

 Score = 1830 bits (4740), Expect = 0.0
 Identities = 906/1191 (76%), Positives = 1034/1191 (86%), Gaps = 9/1191 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALNN+A+S D SQ++ RSSF+VELDAA+ATWL NDVA+LSTKTG+L+
Sbjct: 328  TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536
             LT+VYDGR+VQRL+LSK+  +VLTS+ITTIG+S FFLGSRLGDSLLVQFTCG+G+  L 
Sbjct: 388  LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             G KEE GDIE  AP  KRLRRSSSDALQDMVNGEELSLYGS  NN  SAQ++FSF VRD
Sbjct: 448  SGPKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SL+NIGPLKDFS GLR NAD +A+G +KQSNYELVCCSGHGKNGALCVL++SIRPEMIT+
Sbjct: 508  SLVNIGPLKDFSYGLRINADASATGISKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 567

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ R    DSS++AA DDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 568  VELPGCKGIWTVYHKSSRGHNTDSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 627

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDY+ QG T+AAGNLFGRRRVIQV+  GARILDG++MTQ+LSF   NSE GSGSE   V
Sbjct: 628  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 687

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADPYVLL M+DGSI+LLVGDPS CTVS+  P   ESSK+ +SACTLYHDKGPEPW
Sbjct: 688  LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSACTLYHDKGPEPW 747

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+GEAIDG+D    DQGDIY +VCYESG LEIFDVPNF+ V++VDKFV
Sbjct: 748  LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 807

Query: 1617 SGKANLVDAIILDPT-DSAHKVKTA-QDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790
            SG+ ++VD  + +   DS  ++ ++ ++ T  G+KE I ++++VELAM RW G HSRPFL
Sbjct: 808  SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSGHHSRPFL 867

Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970
            F IL+DGTILCY +Y+F+ S+  SK ++ V++ +  ++ ++S SRLRNLRF R  LD Y 
Sbjct: 868  FAILTDGTILCYQAYLFEGSENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT 927

Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150
            REET  G  CQRITIFKN+ G QG F+SGSRP W M+FRERLR+HPQLCDG+IVAFTVLH
Sbjct: 928  REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 987

Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQK-IPLKGTPHQVTYFADKNLYPLIV 2327
            NVNCNHGFIYVTSQG LKICQLPS  +YDNYWPVQK IPLK TPHQ+TYFA+KNLYPLIV
Sbjct: 988  NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1047

Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTR 2495
            S PVLKPLNQVLS L+DQE G QI+N     V    +Y VEE+EVRI+EPDR+GGPWQTR
Sbjct: 1048 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1107

Query: 2496 ATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDN 2675
            ATIPMQSSENALTVRVVTL+N+TT+EN+TLLAIGTAYVQGEDVAARGRVLLFS  +N DN
Sbjct: 1108 ATIPMQSSENALTVRVVTLFNTTTKENDTLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1167

Query: 2676 SQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVS 2855
             Q LV+EVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AF+DAPPLYVVS
Sbjct: 1168 PQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVS 1227

Query: 2856 MNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDE 3035
            +NIVKNFILLGDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL+VSDE
Sbjct: 1228 LNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDE 1287

Query: 3036 QKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKIN 3215
            QKNIQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+L TSSD+  A PGSDK N
Sbjct: 1288 QKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTN 1347

Query: 3216 RFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHR 3395
            RF LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVD+V H+AGLNPRSFRQF SNG+AHR
Sbjct: 1348 RFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1407

Query: 3396 PGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            PGPD+IVDCELL HYEMLPLEEQLEIA Q GTTR+QI+SNLN+L LGTSFL
Sbjct: 1408 PGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1458


>ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis] gi|223550020|gb|EEF51507.1| cleavage
            and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
          Length = 1461

 Score = 1821 bits (4717), Expect = 0.0
 Identities = 901/1193 (75%), Positives = 1033/1193 (86%), Gaps = 11/1193 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ  LIWS  NLPHDAYKLLAVP PIGGVLV+CAN
Sbjct: 269  LTWAGRVSWKHHTCMISALSISTTLKQPTLIWSVVNLPHDAYKLLAVPPPIGGVLVICAN 328

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHS+S++  LALNN+A+S D SQ++ R+SF+VELDA  A WL NDVA+LS K GELL
Sbjct: 329  TIHYHSESATYALALNNYAVSIDSSQELPRASFSVELDAVKAAWLLNDVALLSAKNGELL 388

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536
             L+LVYDGR+VQRL+LSKS+A+VLTS+ITTIG+S FFLGSRLGDSLLVQFT G G   + 
Sbjct: 389  LLSLVYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRLGDSLLVQFTNGLGPSVVS 448

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             G+KEEVG+IEG  P AKRL+RS+SD LQDMV+GEELSLYGS  NN  SAQ+SFSF VRD
Sbjct: 449  SGLKEEVGEIEGDVPSAKRLKRSASDGLQDMVSGEELSLYGSTANNTESAQKSFSFAVRD 508

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SLIN+GPLKDFS GLR+N D +A+G AKQSNY+LVCCSGHGKNG LC+L++SIRPEMIT+
Sbjct: 509  SLINVGPLKDFSYGLRSNYDASATGIAKQSNYDLVCCSGHGKNGTLCILRQSIRPEMITE 568

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGC+G+WTVYHKN R    D SK+AA  DEYHAYLIIS+E RTMVL+TA+ L EVTE
Sbjct: 569  VDLPGCRGIWTVYHKNARGHNVDLSKMAAAADEYHAYLIISMEARTMVLETADLLSEVTE 628

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDY+ QG T+AAGNLFGRRRVIQV+  GARILDG+FMTQ+LS    NSE   GSE   V
Sbjct: 629  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSFMTQDLSIGSSNSESSPGSESATV 688

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
            SSVSIADPYVL++MTDGSI+LL+GD S C VSI  P  FE+S+RS+SACTLYHDKGPEPW
Sbjct: 689  SSVSIADPYVLIKMTDGSIRLLIGDSSTCMVSINTPSAFENSERSVSACTLYHDKGPEPW 748

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAAS---HDQGDIYCIVCYESGVLEIFDVPNFSSVYSVD 1607
            LR+ STDAWLSTG+ EAIDG+++A    HDQGDIYCIVCYESG LEIFDVPNF+ V+SVD
Sbjct: 749  LRKASTDAWLSTGVSEAIDGAESADGGPHDQGDIYCIVCYESGALEIFDVPNFNRVFSVD 808

Query: 1608 KFVSGKANLVDAIILDPT-DSAHKV-KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSR 1781
            KFVSGK +L DA + +P  DS  K  + +++V   G+KE   N++ VELAM RW G HSR
Sbjct: 809  KFVSGKTHLADAYVREPPKDSQEKTNRISEEVAGLGRKENAHNMKAVELAMQRWSGHHSR 868

Query: 1782 PFLFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLD 1961
            PFLFG+L+DGTILCYH+Y+F++ DA SK E++V++Q P  + S+S SRLRNLRF+RV LD
Sbjct: 869  PFLFGVLTDGTILCYHAYLFEAPDATSKTEDSVSAQNPVGLGSISASRLRNLRFVRVPLD 928

Query: 1962 TYAREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFT 2141
            +Y +EETS    CQRITIF N+ G QG F+ GSRP+W+M+FRERLR+HPQLCDG+IVAFT
Sbjct: 929  SYIKEETSTENSCQRITIFNNISGHQGFFLLGSRPAWFMVFRERLRVHPQLCDGSIVAFT 988

Query: 2142 VLHNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPL 2321
            VLHNVNCNHG IYVTSQGNLKICQLPSF +YDNYWPVQKIPLKGTPHQVTYF +KNLYPL
Sbjct: 989  VLHNVNCNHGLIYVTSQGNLKICQLPSFSNYDNYWPVQKIPLKGTPHQVTYFPEKNLYPL 1048

Query: 2322 IVSFPVLKPLNQVLSSLVDQEAGQQIEN-DVSFEG---SYPVEEFEVRIMEPDRSGGPWQ 2489
            IVS PV KP+NQVLSSLVDQE G QIEN ++S +    +Y VEEFEVRI+E +  GGPWQ
Sbjct: 1049 IVSVPVHKPVNQVLSSLVDQEVGHQIENHNLSSDELLQTYSVEEFEVRILESENGGGPWQ 1108

Query: 2490 TRATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNN 2669
            T+ATIPMQSSENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVAARGRVLLFSV K+ 
Sbjct: 1109 TKATIPMQSSENALTVRVVTLFNATTKENETLLAIGTAYVQGEDVAARGRVLLFSVVKST 1168

Query: 2670 DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYV 2849
            +NSQ LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNGVAF+DAPPLYV
Sbjct: 1169 ENSQVLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGVAFYDAPPLYV 1228

Query: 2850 VSMNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVS 3029
             SMNIVKNFILLGDIHKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSL+VS
Sbjct: 1229 ASMNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLVVS 1288

Query: 3030 DEQKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDK 3209
            DEQKNIQ+FYYAPK+L SWKGQKLLSRAEFH+GAH+TKF+RL +L TSSD+  A PG DK
Sbjct: 1289 DEQKNIQIFYYAPKMLESWKGQKLLSRAEFHVGAHITKFIRLSMLSTSSDRSGAAPGPDK 1348

Query: 3210 INRFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRA 3389
             NRF LLFGTLDGSIGCI+PLDELTFRRLQSLQRKLVDAV H+AGLNPRSFRQFRS+G+ 
Sbjct: 1349 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQRKLVDAVPHVAGLNPRSFRQFRSDGKV 1408

Query: 3390 HRPGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            HRPGP++IVDCELL H+EMLPLEEQLEIAQQVGTTR QI+SNLN+L+LGTSFL
Sbjct: 1409 HRPGPESIVDCELLSHFEMLPLEEQLEIAQQVGTTRAQILSNLNDLSLGTSFL 1461


>ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1447

 Score = 1812 bits (4694), Expect = 0.0
 Identities = 897/1188 (75%), Positives = 1037/1188 (87%), Gaps = 6/1188 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 264  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVISAN 323

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALN++A++ D SQ++ RSSFNVELDAANATWL +DVA+LSTKTGELL
Sbjct: 324  TIHYHSQSASCALALNSYAVTLDSSQEIPRSSFNVELDAANATWLLSDVALLSTKTGELL 383

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536
             LTLVYDGR+VQRL+LSKS+A+VL+S ITTIG+S FFL SRLGDS+LVQF+CG+G   L 
Sbjct: 384  LLTLVYDGRVVQRLDLSKSKASVLSSGITTIGNSLFFLASRLGDSMLVQFSCGSGVSMLS 443

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
              +KEEVGDIE  AP +KRLRRS SDALQDMV+GEELSLYGS PN   SAQ+SFSF VRD
Sbjct: 444  SNLKEEVGDIEADAP-SKRLRRSPSDALQDMVSGEELSLYGSAPNRTESAQKSFSFAVRD 502

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+
Sbjct: 503  SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 562

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ RS   DSSK+A DDDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 563  VELPGCKGIWTVYHKSTRSHNADSSKMADDDDEYHAYLIISLEARTMVLETADLLSEVTE 622

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDYY QG T+AAGNLFGR RVIQVY  GARILDG+FMTQ++SF   N E GS S+  + 
Sbjct: 623  SVDYYVQGKTLAAGNLFGRCRVIQVYERGARILDGSFMTQDVSFGASNLESGSASDSAIA 682

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADP+VLLRM+DGSI+LL+GDPS CT+S+ +P  FESSK S+S+CTLYHDKGPEPW
Sbjct: 683  LSVSIADPFVLLRMSDGSIRLLIGDPSTCTISVTSPASFESSKGSVSSCTLYHDKGPEPW 742

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+GE IDG+D A+ D GDIYC+VC+++G LEIFDVPNF+ V+SV+ F+
Sbjct: 743  LRKTSTDAWLSTGVGETIDGTDGAAQDHGDIYCVVCFDNGNLEIFDVPNFNCVFSVENFM 802

Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796
            SGK++LVDA++ +    + K      V   G+KE I ++++VELAM RW GQHSRPFLFG
Sbjct: 803  SGKSHLVDALMKEVLKDS-KQGDRDGVINQGRKENIPDMKVVELAMQRWSGQHSRPFLFG 861

Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976
            ILSDGTILCYH+Y+++S D+ SKVE++ ++     + S + SRLRNLRF+RV LD YARE
Sbjct: 862  ILSDGTILCYHAYLYESPDSTSKVEDSASAGGSIGLSSTNVSRLRNLRFVRVPLDAYARE 921

Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156
            +TS G  CQ+ITIFKN+G  +G F+SGSRP+W M+ RERLR+HPQLCDG+IVAFTVLHNV
Sbjct: 922  DTSNGPPCQQITIFKNIGSYEGFFLSGSRPAWVMVLRERLRVHPQLCDGSIVAFTVLHNV 981

Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336
            NCN G IYVTSQG LKICQLPS  +YD+YWPVQKIPLK TPHQVTYFA+KNLYPLIVSFP
Sbjct: 982  NCNQGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPLKATPHQVTYFAEKNLYPLIVSFP 1041

Query: 2337 VLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504
            VLKPLNQV+ SLVDQ+   Q E    N       YP++EFEVRIMEP++SGGPWQT+ATI
Sbjct: 1042 VLKPLNQVI-SLVDQDINHQNESQNMNPDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATI 1100

Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684
            PMQSSENALTVR+VTL N+T++ENETLLAIGTAYVQGEDVAARGR+LLFS+ KN DN QT
Sbjct: 1101 PMQSSENALTVRMVTLVNTTSKENETLLAIGTAYVQGEDVAARGRILLFSLGKNTDNPQT 1160

Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864
            LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AFFDAPPL+VVS+NI
Sbjct: 1161 LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGIAFFDAPPLHVVSLNI 1220

Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044
            VKNFIL+GDIHKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSD+ +N
Sbjct: 1221 VKNFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDDNRN 1280

Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224
            IQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+L T SD+  AVPGSDK NRF 
Sbjct: 1281 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLST-SDRAGAVPGSDKTNRFA 1339

Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404
            LLFGTLDGSIGCI+PLDE+TFRRLQSLQRKLVDAV H+AGLNPR+FR FRSNG+AHRPGP
Sbjct: 1340 LLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGP 1399

Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            D+IVDCELLCHYEMLPLEEQLEIA QVGTTR+QI+SNL++L+LGTSFL
Sbjct: 1400 DSIVDCELLCHYEMLPLEEQLEIAHQVGTTRSQILSNLSDLSLGTSFL 1447


>ref|XP_006587381.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X2 [Glycine max]
          Length = 1217

 Score = 1810 bits (4688), Expect = 0.0
 Identities = 895/1188 (75%), Positives = 1036/1188 (87%), Gaps = 6/1188 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSW HHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 34   LTWAGRVSWTHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGAN 93

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALNN+A++ D SQ++ RSSFNVELDAANATWL +DVA+LSTKTGELL
Sbjct: 94   TIHYHSQSASCALALNNYAVTLDSSQEIPRSSFNVELDAANATWLLSDVALLSTKTGELL 153

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536
             L LVYDGR+VQRL+LSKS+A+VL+S ITTIG+S FFL SRLGDS+LVQF+CG+G   + 
Sbjct: 154  LLMLVYDGRVVQRLDLSKSKASVLSSGITTIGNSLFFLASRLGDSMLVQFSCGSGVSMMS 213

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
              +KEEVGDIE  AP +KRLRRS SDALQDMV+GEELSLYGS  N   SAQ+SFSF VRD
Sbjct: 214  SNLKEEVGDIEVDAP-SKRLRRSPSDALQDMVSGEELSLYGSATNRTESAQKSFSFAVRD 272

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+
Sbjct: 273  SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 332

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ RS   DSSK+A DDDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 333  VELPGCKGIWTVYHKSTRSHNADSSKMADDDDEYHAYLIISLEARTMVLETADLLSEVTE 392

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDYY QG T+AAGNLFGRRRVIQVY  GARILDG+FMTQ++SF   NSE GS SE  + 
Sbjct: 393  SVDYYVQGKTLAAGNLFGRRRVIQVYERGARILDGSFMTQDVSFGASNSESGSASESAIA 452

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADP+VLLRM+DGSI+LL+GDPS CT+S+ +P  FESSK S+S+CTLYHDKGPEPW
Sbjct: 453  LSVSIADPFVLLRMSDGSIRLLIGDPSTCTISVTSPASFESSKGSVSSCTLYHDKGPEPW 512

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+GEAIDG+D A+ D GDIYC+VC+++G LEIFD+PNF+ V+SV+ F+
Sbjct: 513  LRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIYCVVCFDNGNLEIFDIPNFNCVFSVENFM 572

Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796
            SGK++LVDA++ +    + K      V   G+K+ I N+++VELAM RW GQHSRPFLFG
Sbjct: 573  SGKSHLVDALMKEVLKDS-KQGDRDGVVNQGRKDNIPNMKVVELAMQRWSGQHSRPFLFG 631

Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976
            ILSDGTILCYH+Y+++S D  SKVE++ ++     + S + SRLRNLRF+RV LD Y RE
Sbjct: 632  ILSDGTILCYHAYLYESPDGTSKVEDSASAGGSIGLSSTNVSRLRNLRFVRVPLDAYPRE 691

Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156
            +TS G+ CQ+ITIFKN+G  QG F+SGSRP+W M+ RERLR+HPQLCDG+IVAFTVLHNV
Sbjct: 692  DTSNGSPCQQITIFKNIGSYQGFFLSGSRPAWVMVLRERLRVHPQLCDGSIVAFTVLHNV 751

Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336
            NCNHG IYVTSQG LKICQLPS  +YD+YWPVQKIPLK TPHQVTYFA+KNLYPLIVSFP
Sbjct: 752  NCNHGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPLKATPHQVTYFAEKNLYPLIVSFP 811

Query: 2337 VLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504
            VLKPLNQV+ SLVDQ+   Q E    N       YP++EFEVRIMEP++SGGPWQT+ATI
Sbjct: 812  VLKPLNQVI-SLVDQDFNHQNESQNMNPDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATI 870

Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684
            PMQSSENALTVR+VTL N+T++ENETLLAIGTAYVQGEDVAARGR+LLFS+ K  DN QT
Sbjct: 871  PMQSSENALTVRMVTLLNTTSKENETLLAIGTAYVQGEDVAARGRILLFSLGKITDNPQT 930

Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864
            LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AFFDAPPL+VVS+NI
Sbjct: 931  LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGIAFFDAPPLHVVSLNI 990

Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044
            VKNFIL+GDIHKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSD+ +N
Sbjct: 991  VKNFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDDNRN 1050

Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224
            IQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+L T SD+  +VPGSDK NRF 
Sbjct: 1051 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLST-SDRAGSVPGSDKTNRFA 1109

Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404
            LLFGTLDGSIGCI+PLDE+TFRRLQSLQRKLVDAV H+AGLNPR+FR FRSNG+AHRPGP
Sbjct: 1110 LLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGP 1169

Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            D+IVDCELLCHYEMLPLEEQLEIA Q+GTTR+QI+SNL++L+LGTSFL
Sbjct: 1170 DSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNLSDLSLGTSFL 1217


>ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X1 [Glycine max]
          Length = 1449

 Score = 1810 bits (4688), Expect = 0.0
 Identities = 895/1188 (75%), Positives = 1036/1188 (87%), Gaps = 6/1188 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSW HHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 266  LTWAGRVSWTHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGAN 325

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALNN+A++ D SQ++ RSSFNVELDAANATWL +DVA+LSTKTGELL
Sbjct: 326  TIHYHSQSASCALALNNYAVTLDSSQEIPRSSFNVELDAANATWLLSDVALLSTKTGELL 385

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536
             L LVYDGR+VQRL+LSKS+A+VL+S ITTIG+S FFL SRLGDS+LVQF+CG+G   + 
Sbjct: 386  LLMLVYDGRVVQRLDLSKSKASVLSSGITTIGNSLFFLASRLGDSMLVQFSCGSGVSMMS 445

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
              +KEEVGDIE  AP +KRLRRS SDALQDMV+GEELSLYGS  N   SAQ+SFSF VRD
Sbjct: 446  SNLKEEVGDIEVDAP-SKRLRRSPSDALQDMVSGEELSLYGSATNRTESAQKSFSFAVRD 504

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+
Sbjct: 505  SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 564

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ RS   DSSK+A DDDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 565  VELPGCKGIWTVYHKSTRSHNADSSKMADDDDEYHAYLIISLEARTMVLETADLLSEVTE 624

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDYY QG T+AAGNLFGRRRVIQVY  GARILDG+FMTQ++SF   NSE GS SE  + 
Sbjct: 625  SVDYYVQGKTLAAGNLFGRRRVIQVYERGARILDGSFMTQDVSFGASNSESGSASESAIA 684

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADP+VLLRM+DGSI+LL+GDPS CT+S+ +P  FESSK S+S+CTLYHDKGPEPW
Sbjct: 685  LSVSIADPFVLLRMSDGSIRLLIGDPSTCTISVTSPASFESSKGSVSSCTLYHDKGPEPW 744

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+GEAIDG+D A+ D GDIYC+VC+++G LEIFD+PNF+ V+SV+ F+
Sbjct: 745  LRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIYCVVCFDNGNLEIFDIPNFNCVFSVENFM 804

Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796
            SGK++LVDA++ +    + K      V   G+K+ I N+++VELAM RW GQHSRPFLFG
Sbjct: 805  SGKSHLVDALMKEVLKDS-KQGDRDGVVNQGRKDNIPNMKVVELAMQRWSGQHSRPFLFG 863

Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976
            ILSDGTILCYH+Y+++S D  SKVE++ ++     + S + SRLRNLRF+RV LD Y RE
Sbjct: 864  ILSDGTILCYHAYLYESPDGTSKVEDSASAGGSIGLSSTNVSRLRNLRFVRVPLDAYPRE 923

Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156
            +TS G+ CQ+ITIFKN+G  QG F+SGSRP+W M+ RERLR+HPQLCDG+IVAFTVLHNV
Sbjct: 924  DTSNGSPCQQITIFKNIGSYQGFFLSGSRPAWVMVLRERLRVHPQLCDGSIVAFTVLHNV 983

Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336
            NCNHG IYVTSQG LKICQLPS  +YD+YWPVQKIPLK TPHQVTYFA+KNLYPLIVSFP
Sbjct: 984  NCNHGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPLKATPHQVTYFAEKNLYPLIVSFP 1043

Query: 2337 VLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504
            VLKPLNQV+ SLVDQ+   Q E    N       YP++EFEVRIMEP++SGGPWQT+ATI
Sbjct: 1044 VLKPLNQVI-SLVDQDFNHQNESQNMNPDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATI 1102

Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684
            PMQSSENALTVR+VTL N+T++ENETLLAIGTAYVQGEDVAARGR+LLFS+ K  DN QT
Sbjct: 1103 PMQSSENALTVRMVTLLNTTSKENETLLAIGTAYVQGEDVAARGRILLFSLGKITDNPQT 1162

Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864
            LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AFFDAPPL+VVS+NI
Sbjct: 1163 LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGIAFFDAPPLHVVSLNI 1222

Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044
            VKNFIL+GDIHKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSD+ +N
Sbjct: 1223 VKNFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDDNRN 1282

Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224
            IQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+L T SD+  +VPGSDK NRF 
Sbjct: 1283 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLST-SDRAGSVPGSDKTNRFA 1341

Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404
            LLFGTLDGSIGCI+PLDE+TFRRLQSLQRKLVDAV H+AGLNPR+FR FRSNG+AHRPGP
Sbjct: 1342 LLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGP 1401

Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            D+IVDCELLCHYEMLPLEEQLEIA Q+GTTR+QI+SNL++L+LGTSFL
Sbjct: 1402 DSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNLSDLSLGTSFL 1449


>gb|ESW24391.1| hypothetical protein PHAVU_004G126600g [Phaseolus vulgaris]
          Length = 1445

 Score = 1804 bits (4672), Expect = 0.0
 Identities = 888/1188 (74%), Positives = 1039/1188 (87%), Gaps = 6/1188 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 262  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGAN 321

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            T+HYHSQS+SC LALN++A+S D SQ++ RSSFNVELD+ANATWL +DVA+LSTKTGELL
Sbjct: 322  TVHYHSQSASCALALNSYAVSLDNSQEIPRSSFNVELDSANATWLLSDVALLSTKTGELL 381

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536
             LTLVYDGR+VQRL+LSKS+A+VL+S ITTIG+S FFL SRLGDS+LVQF+CG+G   L 
Sbjct: 382  LLTLVYDGRVVQRLDLSKSKASVLSSGITTIGNSLFFLASRLGDSMLVQFSCGSGGSMLS 441

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
              +KEEVGDIE  AP +KRLRRS SD LQD+V+GEELSLYGS PN   SAQ+SFSF VRD
Sbjct: 442  SNLKEEVGDIEADAP-SKRLRRSPSDTLQDVVSGEELSLYGSAPNRTESAQKSFSFAVRD 500

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+
Sbjct: 501  SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 560

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ RS   DSSK+A DDDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 561  VELPGCKGIWTVYHKSTRSHNTDSSKLADDDDEYHAYLIISLEARTMVLETADLLSEVTE 620

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDYY QG T+AAGNLFGRRRVIQVY  GARILDG+FMTQ+++F   NSE  S SE  + 
Sbjct: 621  SVDYYVQGKTLAAGNLFGRRRVIQVYERGARILDGSFMTQDVTFGASNSESASASESAIA 680

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADP+VLLRM+DGS++LL+GDP  CT+S+ +P  FES+K S+S+CTLYHDKGPEPW
Sbjct: 681  LSVSIADPFVLLRMSDGSVRLLIGDPITCTISVTSPASFESTKGSVSSCTLYHDKGPEPW 740

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+GEAIDG+D A+ D GDIYC+VC+++G LEIFDVPNF+ V+SV  F+
Sbjct: 741  LRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIYCVVCFDNGNLEIFDVPNFNCVFSVGNFM 800

Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796
            SGK++LVDA++ +    + K      V   G+KE + ++++VELAM RW GQHSRPFLFG
Sbjct: 801  SGKSHLVDALMKEVLKDSKK-GDRDGVIIQGRKENVPDMKVVELAMQRWSGQHSRPFLFG 859

Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976
            ILSDGTILCYH+Y+++S D  SKVE++ ++     + + + SRLRNLRF+RV+LD YARE
Sbjct: 860  ILSDGTILCYHAYLYESPDGTSKVEDSASAGGSIGLGTTNISRLRNLRFVRVSLDAYARE 919

Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156
            ETS G+  Q+ITIFKN+G  QG F+SGSRP+W M+ RERLR+HPQLCDG+IVAFTVLHNV
Sbjct: 920  ETSNGSLHQQITIFKNIGSYQGFFLSGSRPAWVMVLRERLRVHPQLCDGSIVAFTVLHNV 979

Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336
            NCNHG IYVTSQG LKICQLPS  +YD+YWPVQKIPLK TPHQVTYFA+KNLYPLIVSFP
Sbjct: 980  NCNHGLIYVTSQGVLKICQLPSGSNYDSYWPVQKIPLKATPHQVTYFAEKNLYPLIVSFP 1039

Query: 2337 VLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504
            VLKPL+QV+ SLVDQ+   Q E    N       YP++EFEVRIMEP++SGGPWQT+ATI
Sbjct: 1040 VLKPLSQVI-SLVDQDVNHQNESQNMNSDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATI 1098

Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684
            PMQSSENALTVR+VTL N+T++ENETLLAIGTAYVQGEDVAARGR+LLFS+ KN DN Q+
Sbjct: 1099 PMQSSENALTVRMVTLLNTTSKENETLLAIGTAYVQGEDVAARGRILLFSLGKNTDNPQS 1158

Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864
            LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+ELNG+AFFDAPPL+VVS+NI
Sbjct: 1159 LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWNGTELNGIAFFDAPPLHVVSLNI 1218

Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044
            VKNFIL+GDIHKSIYFL+WKEQG+QL+LL+KDF  LDC ATEFLIDGSTLSLMVSD+++N
Sbjct: 1219 VKNFILIGDIHKSIYFLSWKEQGAQLSLLAKDFSSLDCFATEFLIDGSTLSLMVSDDKRN 1278

Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224
            IQ+FYYAPK+  SWKGQKLLSRAEFH+GAHVTKF+RLQ+LPT SD+  + PGSDK NRF 
Sbjct: 1279 IQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLPT-SDRAGSAPGSDKTNRFA 1337

Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404
            LLFGTLDGSIGCI+PLDE+TFRRLQSLQ+KLVDAVAH+AGLNPR+FR+F+SNG+AHRPGP
Sbjct: 1338 LLFGTLDGSIGCIAPLDEITFRRLQSLQKKLVDAVAHVAGLNPRAFRKFQSNGKAHRPGP 1397

Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            D+IVDCELLCHYEMLPLEEQLEIA QVGTTR+QI+SNL++L+LGTSFL
Sbjct: 1398 DSIVDCELLCHYEMLPLEEQLEIAHQVGTTRSQILSNLSDLSLGTSFL 1445


>gb|EXC20897.1| Cleavage and polyadenylation specificity factor subunit 1 [Morus
            notabilis]
          Length = 1479

 Score = 1798 bits (4658), Expect = 0.0
 Identities = 893/1220 (73%), Positives = 1036/1220 (84%), Gaps = 38/1220 (3%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRV WKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+CAN
Sbjct: 270  LTWAGRVLWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVICAN 329

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            T+HY SQS+SC LALN++A+S D SQ+M R+ F+VELDAANATWL+NDV +LSTK GELL
Sbjct: 330  TLHYQSQSNSCTLALNSYAVSVDSSQEMRRAPFSVELDAANATWLSNDVVLLSTKAGELL 389

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536
             LTLVYDGR+VQRL+LSKS+A+VLTS ITTIG+S FFLGSRLGDSLLVQFT G G+  L 
Sbjct: 390  LLTLVYDGRVVQRLDLSKSKASVLTSGITTIGNSLFFLGSRLGDSLLVQFTYGLGTSMLS 449

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
             G+K+EVGDIEG A LAKRLRRSSSD LQDM +GEELSLY S PNN+ S Q+SFSFTVRD
Sbjct: 450  SGLKDEVGDIEGDAHLAKRLRRSSSDVLQDMTSGEELSLYVSAPNNSESTQKSFSFTVRD 509

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SL+N+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+
Sbjct: 510  SLVNVGPLKDFSYGLRINADPNATGVAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 569

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ RS   DSSK+ A DDEYHAYLIISLE RTMVL+TA+ L EVTE
Sbjct: 570  VELPGCKGIWTVYHKSTRSH--DSSKLVAADDEYHAYLIISLEARTMVLETADLLTEVTE 627

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDYY QG T+AAGNLFGRRRV+QVY  GARILDG+FMTQ+LSF    SE  SGSE  +V
Sbjct: 628  SVDYYVQGRTIAAGNLFGRRRVVQVYERGARILDGSFMTQDLSFGPAPSESSSGSENAVV 687

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
            +SVSIADPYV+LRM+DGSI+LLVGDP++CTVS+  P  FESSK  ISACTLY DKGPEPW
Sbjct: 688  TSVSIADPYVVLRMSDGSIRLLVGDPTSCTVSVSTPADFESSKSIISACTLYRDKGPEPW 747

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+ EAIDG+D    DQGDIYC+VCYESG L+I+DVP+F+ V+SVD F+
Sbjct: 748  LRKTSTDAWLSTGVDEAIDGADETLQDQGDIYCVVCYESGSLDIYDVPSFNYVFSVDNFI 807

Query: 1617 SGKANLVDAIILDPTDSAHKV--KTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFL 1790
            SG+ +LVDA + +      K   K +++    G+KE +QN++IVELAM RW G+HSRPFL
Sbjct: 808  SGRPHLVDAFVQEQPKDLQKATNKNSEESAGQGRKENVQNMKIVELAMQRWSGKHSRPFL 867

Query: 1791 FGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYA 1970
             GIL+DG+ILCYH+Y+F+  ++ S+ E++V+S      ++ SGSRLRNLRF+RV LD+YA
Sbjct: 868  LGILTDGSILCYHAYLFEGPESTSRTEDSVSS------RNSSGSRLRNLRFVRVPLDSYA 921

Query: 1971 REETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLH 2150
            REETS G  CQRI++FKN+ G QGLF+SGSRP+W+M+FRERLR+HPQLCDG+IVAFTVLH
Sbjct: 922  REETSDGMPCQRISVFKNIAGYQGLFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLH 981

Query: 2151 NVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQK-IPLKGTPHQVTYFADKNLYPLIV 2327
            NVNCNHGFIYVTS+G LKICQLPS  SYDNYWPVQK IPLKGTPHQVTYFA++NLYPLIV
Sbjct: 982  NVNCNHGFIYVTSEGILKICQLPSITSYDNYWPVQKVIPLKGTPHQVTYFAERNLYPLIV 1041

Query: 2328 SFPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQTR 2495
            S PV KPLNQV+SSL+DQE G Q EN          +Y ++EFEVRI+EP+RSGGPWQT+
Sbjct: 1042 SVPVPKPLNQVMSSLLDQEVGHQFENPNLSPDDLNRTYTIDEFEVRILEPERSGGPWQTK 1101

Query: 2496 ATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLL--------- 2648
             TIPMQSSENALT+RVVTL+N+TT ENETLLAIGTAYVQGEDVAARGR++L         
Sbjct: 1102 VTIPMQSSENALTIRVVTLFNTTTNENETLLAIGTAYVQGEDVAARGRIILRALAPWWER 1161

Query: 2649 --------------------FSVEKNNDNSQTLVSEVYSKELKGAISALASLQGHLLIAS 2768
                                F ++  + + Q  VSE+YSKELKGAISALASLQGHLLIAS
Sbjct: 1162 LHLHPGSRVQIPEMASPSGVFKIDSADFHLQ--VSEIYSKELKGAISALASLQGHLLIAS 1219

Query: 2769 GPKITLHKWIGSELNGVAFFDAPPLYVVSMNIVKNFILLGDIHKSIYFLNWKEQGSQLNL 2948
            GPKI LHKW G+ELNG+AFFDAPPLYVVS+NIVKNFIL+GD+HKSIYFL+WKEQG+QL+L
Sbjct: 1220 GPKIILHKWTGTELNGIAFFDAPPLYVVSLNIVKNFILIGDVHKSIYFLSWKEQGAQLSL 1279

Query: 2949 LSKDFGFLDCLATEFLIDGSTLSLMVSDEQKNIQVFYYAPKLLNSWKGQKLLSRAEFHIG 3128
            L+KDFG LDC ATEFLIDGSTLSL+VSD+QKNIQ+FYYAPK+  SWKGQ+LLSRAEFH+G
Sbjct: 1280 LAKDFGSLDCFATEFLIDGSTLSLVVSDDQKNIQIFYYAPKMSESWKGQRLLSRAEFHVG 1339

Query: 3129 AHVTKFMRLQLLPTSSDQINAVPGSDKINRFGLLFGTLDGSIGCISPLDELTFRRLQSLQ 3308
            AHVTKF+RLQ+LPTS+D+  + PGSDK NRF LLFG LDGSIGCI+PLDELTFRRLQSLQ
Sbjct: 1340 AHVTKFLRLQMLPTSTDRTGSTPGSDKTNRFALLFGALDGSIGCIAPLDELTFRRLQSLQ 1399

Query: 3309 RKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGPDNIVDCELLCHYEMLPLEEQLEIAQQVG 3488
            +KLVDAV H+AGLNPRSFRQF SNG+AHRPGPD+IVDCELLCHYEMLPLEEQLEIA  +G
Sbjct: 1400 KKLVDAVPHVAGLNPRSFRQFCSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHLIG 1459

Query: 3489 TTRTQIMSNLNELTLGTSFL 3548
            TTR+QI+SNLN+L LGTSFL
Sbjct: 1460 TTRSQILSNLNDLFLGTSFL 1479


>ref|XP_002318462.2| cleavage and polyadenylation specificity factor family protein
            [Populus trichocarpa] gi|550326263|gb|EEE96682.2|
            cleavage and polyadenylation specificity factor family
            protein [Populus trichocarpa]
          Length = 1455

 Score = 1795 bits (4648), Expect = 0.0
 Identities = 892/1193 (74%), Positives = 1027/1193 (86%), Gaps = 11/1193 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRV WKHHTCM+SALSISTTLKQ  LIWS  NLPHDAYKLLAVPSPIGGVLV+  N
Sbjct: 268  LTWAGRVVWKHHTCMISALSISTTLKQPTLIWSIGNLPHDAYKLLAVPSPIGGVLVIGVN 327

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHS+S+SC LALN++A S D SQ++ R++F+VELDAANATWL  DVA+LSTKTGELL
Sbjct: 328  TIHYHSESASCALALNSYAASVDSSQELPRATFSVELDAANATWLLKDVALLSTKTGELL 387

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGSLPL- 539
             LTLVYDGR+VQRL+LSKS+A+VLTS+ITT+G+SFFFLGSRLGDSLLVQFT G GS  L 
Sbjct: 388  LLTLVYDGRVVQRLDLSKSKASVLTSDITTLGNSFFFLGSRLGDSLLVQFTSGLGSSMLS 447

Query: 540  -GVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQ-----ESFS 701
             G+KEEVGDIEG  P AKRL+ SSSDALQDMV+GEELSLY S PNNA S+Q     ++FS
Sbjct: 448  PGLKEEVGDIEGDLPSAKRLKVSSSDALQDMVSGEELSLYSSAPNNAESSQVVSVIKTFS 507

Query: 702  FTVRDSLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRP 881
            FTVRDSLIN+GPLKDF+ GLR NAD NA+G +KQSNYELVCCSGHGKNGALCVLQ+SIRP
Sbjct: 508  FTVRDSLINVGPLKDFAYGLRINADANATGISKQSNYELVCCSGHGKNGALCVLQQSIRP 567

Query: 882  EMITQESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNL 1061
            EMIT+  LPGCKG+WTVYHKN RS   DS K+A+DD EYHAYLIIS+E RTMVL+TA++L
Sbjct: 568  EMITEVELPGCKGIWTVYHKNARSHSVDSLKMASDD-EYHAYLIISMEARTMVLETADHL 626

Query: 1062 EEVTENVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGS 1241
             EVTE+VDY+ QG T+AAGNLFGRRRV+QV+  GARILDG+FMTQ+LSF   NSE G  S
Sbjct: 627  TEVTESVDYFVQGRTIAAGNLFGRRRVVQVFERGARILDGSFMTQDLSFGGSNSETGR-S 685

Query: 1242 EGQMVSSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDK 1421
            E   V  VSI DPYVL+RM DGSIQ+LVGDPSACTVS+  P  F+SS +S+SACTLYHDK
Sbjct: 686  ESSTVMHVSIVDPYVLVRMADGSIQILVGDPSACTVSVNTPSAFQSSTKSVSACTLYHDK 745

Query: 1422 GPEPWLRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYS 1601
            GPEPWLR+TSTDAWLSTGI EAIDG+D+ +H+QGDIYC+VCYE+G LEIFDVPNF+SV+ 
Sbjct: 746  GPEPWLRKTSTDAWLSTGISEAIDGADSGAHEQGDIYCVVCYETGALEIFDVPNFNSVFF 805

Query: 1602 VDKFVSGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSR 1781
            VDKFVSGK +L+D    +P     K    ++V   G+KE  QN+++VEL M RW G+HSR
Sbjct: 806  VDKFVSGKTHLLDTCTGEPAKDMMK-GVKEEVAGAGRKESTQNMKVVELTMLRWSGRHSR 864

Query: 1782 PFLFGILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLD 1961
            PFLFGIL+DGTILCYH+Y+F+  D  SK+E++V++Q      ++S SRLRNLRF+RV LD
Sbjct: 865  PFLFGILTDGTILCYHAYLFEGPDGTSKLEDSVSAQNSVGASTISASRLRNLRFVRVPLD 924

Query: 1962 TYAREETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFT 2141
            TY REETS  T CQRIT FKN+ G QG F+SGSRP+W+M+FRERLR+HPQLCDG+IVAFT
Sbjct: 925  TYTREETSSETSCQRITTFKNISGYQGFFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFT 984

Query: 2142 VLHNVNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPL 2321
            VLH VNCNHG IYVTSQGNLKIC L S  SYDNYWPVQKIPLKGTPHQVTYFA++NLYPL
Sbjct: 985  VLHTVNCNHGLIYVTSQGNLKICHLSSVSSYDNYWPVQKIPLKGTPHQVTYFAERNLYPL 1044

Query: 2322 IVSFPVLKPLNQVLSSLVDQEAGQQIEN----DVSFEGSYPVEEFEVRIMEPDRSGGPWQ 2489
            IVS PV KP+NQVLSSLVDQE G QIEN          +Y V+EFEVRI+EP  S GPWQ
Sbjct: 1045 IVSVPVQKPVNQVLSSLVDQEVGHQIENHNLSSEEIHRTYSVDEFEVRILEP--SNGPWQ 1102

Query: 2490 TRATIPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNN 2669
             +ATIPMQ+SENALTVR+V+L+N++T+ENETLLA+GTAYVQGEDVAARGR+LLFSV KN 
Sbjct: 1103 VKATIPMQTSENALTVRMVSLFNTSTKENETLLAVGTAYVQGEDVAARGRILLFSVVKNP 1162

Query: 2670 DNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYV 2849
            +NSQ LVSEVYSKELKGAISALASLQGHLLIASGPKI LHKW G+EL GVAF DAPPLYV
Sbjct: 1163 ENSQILVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELTGVAFSDAPPLYV 1222

Query: 2850 VSMNIVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVS 3029
            VS+NIVKNFILLGDIHKSIYFL+WKEQG+QL+LL+KDF  LDC +TEFLIDGSTLSL+VS
Sbjct: 1223 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFASLDCFSTEFLIDGSTLSLVVS 1282

Query: 3030 DEQKNIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDK 3209
            DEQKN+Q+FYYAPK+  SWKGQKLLSRAEFH+GA VTKFMRLQ+L  S D+  A P SDK
Sbjct: 1283 DEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGALVTKFMRLQMLSPSLDRSGAAPVSDK 1342

Query: 3210 INRFGLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRA 3389
             NRF LLFGTLDGSIGCI+PLDELTFRRLQSLQ+KLVDAV H+AGLNP+SFRQFRS+G+A
Sbjct: 1343 TNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPKSFRQFRSDGKA 1402

Query: 3390 HRPGPDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            HRPGP++IVDCE+L +YEM+PLEEQ+EIAQQ+GTTR QI+SNLN+LTLGTSFL
Sbjct: 1403 HRPGPESIVDCEMLSYYEMIPLEEQVEIAQQIGTTRAQILSNLNDLTLGTSFL 1455


>ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cicer arietinum]
          Length = 1447

 Score = 1789 bits (4633), Expect = 0.0
 Identities = 885/1189 (74%), Positives = 1028/1189 (86%), Gaps = 7/1189 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 266  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAVNLPHDAYKLLAVPSPIGGVLVIGAN 325

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            TIHYHSQS+SC LALN++A+S D SQ+M RSSFNVELDAANATWL NDVA+LSTKTGELL
Sbjct: 326  TIHYHSQSASCALALNSYAVSVDNSQEMPRSSFNVELDAANATWLLNDVALLSTKTGELL 385

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAG--SLP 536
             LTL+YDGR+VQRL+LSKS+A+VL+S +TTIG+S FFL SRLGDS+LVQF+ G+G   L 
Sbjct: 386  LLTLIYDGRVVQRLDLSKSKASVLSSGVTTIGNSLFFLASRLGDSMLVQFSSGSGVSMLS 445

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
              +KEEVGD +  A  AKR+RRS SD LQDMV+GEELSLYGS  N   SAQ+SFSF VRD
Sbjct: 446  SNLKEEVGDFDVDASSAKRMRRSPSDTLQDMVSGEELSLYGSATNRTESAQKSFSFAVRD 505

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SLIN+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNG+LCVL++SIRPE+IT+
Sbjct: 506  SLINVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGSLCVLRQSIRPEVITE 565

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
              LPGCKG+WTVYHK+ RS   DSSK+A D+DEYHAYLIISLE+RTMVL+TA+ L EVTE
Sbjct: 566  VELPGCKGIWTVYHKSTRSLNADSSKLADDEDEYHAYLIISLESRTMVLETADLLSEVTE 625

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
            +VDYY QG T+AAGNLFGRRRVIQVY  GARILDG+FMTQ++SF   NSE   GSE  + 
Sbjct: 626  SVDYYVQGKTLAAGNLFGRRRVIQVYERGARILDGSFMTQDVSFGASNSEANYGSESALA 685

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSIADPYVLL+M+DGS++LLVGDPS CT+S+ +P  FESSK S+S CTLYHDKGPEPW
Sbjct: 686  LSVSIADPYVLLKMSDGSVRLLVGDPSTCTISVTSPASFESSKGSVSTCTLYHDKGPEPW 745

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR+TSTDAWLSTG+GEAIDG+D A+ D GDIYC+VCYE+  LEIFDVPNFS V+SV+ F+
Sbjct: 746  LRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIYCVVCYENDSLEIFDVPNFSCVFSVENFL 805

Query: 1617 SGKANLVDAIILD-PTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLF 1793
            SGK++LVDA+  + P DS    K +  V + G+K+ + N+++VELAM RW G+H RPFLF
Sbjct: 806  SGKSHLVDALTKEVPKDSQKGDKVSDGVVSQGRKDAL-NMKVVELAMQRWSGKHGRPFLF 864

Query: 1794 GILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYAR 1973
            GILSDGT LCYH+Y+++S D  SKVE++V++     + + S SRLRNLRF+RV LD +AR
Sbjct: 865  GILSDGTTLCYHAYLYESPDGTSKVEDSVSA----GLSNSSVSRLRNLRFVRVPLDVHAR 920

Query: 1974 EETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHN 2153
            EETS G  CQ+I IFKN+G  +G F+SGSRP+W ML RERLR+HPQLCDG+IVAFTVLHN
Sbjct: 921  EETSNGPPCQQINIFKNIGSYEGFFLSGSRPAWVMLLRERLRVHPQLCDGSIVAFTVLHN 980

Query: 2154 VNCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSF 2333
            VNCNHG IYVTSQG LKICQLPS  +YD YWPVQK+PLK TPHQVTYFA+KNLYPLIVS+
Sbjct: 981  VNCNHGLIYVTSQGVLKICQLPSGSNYDCYWPVQKVPLKATPHQVTYFAEKNLYPLIVSY 1040

Query: 2334 PVLKPLNQVLSSLVDQEAGQQIE----NDVSFEGSYPVEEFEVRIMEPDRSGGPWQTRAT 2501
            PV KPLNQV+ +LVDQ+A Q  E    N+      Y +EEFEVRIMEP++SGGPWQ +AT
Sbjct: 1041 PVPKPLNQVI-ALVDQDANQLTESQNLNNDEQSHLYTIEEFEVRIMEPEKSGGPWQLKAT 1099

Query: 2502 IPMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQ 2681
            IPMQSSENALTVR+VTL N++++ENETLLAIGTAYVQGEDVAARGR+LLFS+ KN DN Q
Sbjct: 1100 IPMQSSENALTVRMVTLMNTSSKENETLLAIGTAYVQGEDVAARGRILLFSLGKNTDNPQ 1159

Query: 2682 TLVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMN 2861
             LVSEVYSKELKGAISALA+LQGHLL+ASGPKI LHKW G+ELNGVAFFD PPL+VVS+N
Sbjct: 1160 NLVSEVYSKELKGAISALAALQGHLLVASGPKIILHKWTGTELNGVAFFDVPPLHVVSLN 1219

Query: 2862 IVKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQK 3041
            IVKNFIL+GD+HKSIYFL+WKEQG+QL+LL+KDFG LDC ATEFLIDGSTLSLMVSDEQK
Sbjct: 1220 IVKNFILIGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLMVSDEQK 1279

Query: 3042 NIQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRF 3221
            NIQ+FYYAPK+  SWKGQKLLSRAEFH+GAH+TKF+RLQ+L T SD+  + PGSDK NRF
Sbjct: 1280 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHITKFLRLQMLST-SDKTGSGPGSDKTNRF 1338

Query: 3222 GLLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPG 3401
             LLFGTLDGSIGCI+PLDE+TFRRLQSLQ+KLVDAV H+AGLNPR+FR F SNG+AHRPG
Sbjct: 1339 ALLFGTLDGSIGCIAPLDEITFRRLQSLQKKLVDAVPHVAGLNPRAFRLFHSNGKAHRPG 1398

Query: 3402 PDNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            PD+IVDCELLCHYEML LEEQLEIA QVGTTR+QI+SNL++L+LGTSFL
Sbjct: 1399 PDSIVDCELLCHYEMLQLEEQLEIAHQVGTTRSQILSNLSDLSLGTSFL 1447


>ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Fragaria vesca subsp. vesca]
          Length = 1439

 Score = 1789 bits (4633), Expect = 0.0
 Identities = 886/1188 (74%), Positives = 1019/1188 (85%), Gaps = 6/1188 (0%)
 Frame = +3

Query: 3    LT*AGRVSWKHHTCMVSALSISTTLKQQPLIWSATNLPHDAYKLLAVPSPIGGVLVLCAN 182
            LT AGRVSWKHHTCM+SALSISTTLKQ PLIWSA NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 270  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAINLPHDAYKLLAVPSPIGGVLVISAN 329

Query: 183  TIHYHSQSSSCVLALNNHAISSDGSQDMLRSSFNVELDAANATWLTNDVAMLSTKTGELL 362
            +IHYHSQS+SC LALN++A S D SQ+M RSSF VELDAANA+WL+NDV +LSTKTGELL
Sbjct: 330  SIHYHSQSASCALALNSYAGSVDSSQEMPRSSFTVELDAANASWLSNDVILLSTKTGELL 389

Query: 363  SLTLVYDGRIVQRLELSKSRAAVLTSEITTIGSSFFFLGSRLGDSLLVQFTCGAGS--LP 536
             LTLVYDGR+V RL+LSKS+A+VLTS I T+G+S FFLGSRLGDSLLVQFT G G+  L 
Sbjct: 390  LLTLVYDGRVVHRLDLSKSKASVLTSGIATVGNSLFFLGSRLGDSLLVQFTSGVGASMLS 449

Query: 537  LGVKEEVGDIEGAAPLAKRLRRSSSDALQDMVNGEELSLYGSGPNNAHSAQESFSFTVRD 716
              +K+EVGDIEG AP AKRLR SSSDALQDM++GEELSLYGS  NNA SAQ SFSF VRD
Sbjct: 450  ADLKDEVGDIEGDAPSAKRLRMSSSDALQDMISGEELSLYGSAQNNAESAQRSFSFAVRD 509

Query: 717  SLINIGPLKDFSNGLRNNADHNASGFAKQSNYELVCCSGHGKNGALCVLQKSIRPEMITQ 896
            SL+N+GPLKDFS GLR NAD NA+G AKQSNYELVCCSGHGKNGALCVL++SIRPEMIT+
Sbjct: 510  SLVNVGPLKDFSYGLRINADANATGIAKQSNYELVCCSGHGKNGALCVLRQSIRPEMITE 569

Query: 897  ESLPGCKGVWTVYHKNVRSQMGDSSKIAADDDEYHAYLIISLETRTMVLQTANNLEEVTE 1076
             +LPGCKG+WTVYHKN R    +S      DDEYHA+LIISLE RTMVL+TA++L EVT+
Sbjct: 570  VALPGCKGIWTVYHKNARGHNAESY-----DDEYHAFLIISLEARTMVLETADHLSEVTD 624

Query: 1077 NVDYYTQGSTVAAGNLFGRRRVIQVYASGARILDGAFMTQELSFKVQNSEIGSGSEGQMV 1256
             VDY+ QG T+AAGNLFGRRRV+Q+Y  GARIL+G +MTQ+LSF   NSE GSGSE   V
Sbjct: 625  KVDYFLQGRTIAAGNLFGRRRVVQIYERGARILEGYYMTQDLSFGASNSESGSGSESATV 684

Query: 1257 SSVSIADPYVLLRMTDGSIQLLVGDPSACTVSIVNPEIFESSKRSISACTLYHDKGPEPW 1436
             SVSI DPYVLLRM+DG I+LLVGDPS+CTVS+ NP  FESSK+ +SACTLYHD+GPEPW
Sbjct: 685  LSVSIVDPYVLLRMSDGGIRLLVGDPSSCTVSVSNPAAFESSKKLVSACTLYHDEGPEPW 744

Query: 1437 LRRTSTDAWLSTGIGEAIDGSDAASHDQGDIYCIVCYESGVLEIFDVPNFSSVYSVDKFV 1616
            LR++STDAWLSTGI EAIDG     HDQGD+YC++CYESG LEIFDVPNF+ V+SV+KFV
Sbjct: 745  LRKSSTDAWLSTGIDEAIDG---VLHDQGDVYCVICYESGSLEIFDVPNFNCVFSVEKFV 801

Query: 1617 SGKANLVDAIILDPTDSAHKVKTAQDVTAPGKKEQIQNIRIVELAMHRWPGQHSRPFLFG 1796
            SGK  LVD  + DP     K +++++V+   +KE++QN+R+VEL M RW GQHSRPFLFG
Sbjct: 802  SGKPLLVDTFMGDP----QKSQSSEEVSGLSRKEKLQNMRVVELTMQRWSGQHSRPFLFG 857

Query: 1797 ILSDGTILCYHSYVFDSSDAASKVEEAVTSQKPQNIKSMSGSRLRNLRFIRVTLDTYARE 1976
            IL+DG I CYH+Y+++S D+ SK E + +SQ      + + SRLRNLRF+RV LDTY+R 
Sbjct: 858  ILNDGMIFCYHAYLYESMDSTSKTEVSASSQ------NTTASRLRNLRFVRVPLDTYSRN 911

Query: 1977 ETSPGTRCQRITIFKNVGGCQGLFISGSRPSWYMLFRERLRIHPQLCDGAIVAFTVLHNV 2156
            + S GT CQR+T+FKN+ G QGLF++GSRP+W M+FRER+R+HPQLCDG+IVAFTVLHNV
Sbjct: 912  DLSNGTSCQRMTVFKNIAGNQGLFLAGSRPAWLMVFRERIRVHPQLCDGSIVAFTVLHNV 971

Query: 2157 NCNHGFIYVTSQGNLKICQLPSFLSYDNYWPVQKIPLKGTPHQVTYFADKNLYPLIVSFP 2336
            NCNHG IYVTS+G +KICQLPS  SYDNYWPVQKIPLKGTPHQVTYFA+KNLYPLIVS P
Sbjct: 972  NCNHGLIYVTSEGIMKICQLPSITSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSIP 1031

Query: 2337 VLKPLNQVLSSLVDQEAGQQIEND----VSFEGSYPVEEFEVRIMEPDRSGGPWQTRATI 2504
            V KPLNQVLSSLVDQE   Q+EN          +Y V+EFEVRIMEP++SGGPWQTRATI
Sbjct: 1032 VQKPLNQVLSSLVDQEFSHQVENHNLSPEELHRTYTVDEFEVRIMEPEKSGGPWQTRATI 1091

Query: 2505 PMQSSENALTVRVVTLYNSTTRENETLLAIGTAYVQGEDVAARGRVLLFSVEKNNDNSQT 2684
            PMQ+SENALTVRVVTL+N+TT+ENETLLAIGTAYVQGEDVA RGRVLLFS E N DN Q 
Sbjct: 1092 PMQTSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAGRGRVLLFSAENNVDNPQN 1151

Query: 2685 LVSEVYSKELKGAISALASLQGHLLIASGPKITLHKWIGSELNGVAFFDAPPLYVVSMNI 2864
            LVSEV+SKELKGAISALASLQG+LLIASGPKI LHKW GS+L G+AFFD PPLYVVS+NI
Sbjct: 1152 LVSEVFSKELKGAISALASLQGNLLIASGPKIILHKWTGSDLTGIAFFDVPPLYVVSLNI 1211

Query: 2865 VKNFILLGDIHKSIYFLNWKEQGSQLNLLSKDFGFLDCLATEFLIDGSTLSLMVSDEQKN 3044
            VKNFIL+GDIHKSIYFL+WKEQG+QLNLL+KDFG LDC ATEFLIDGSTLSL V+D QKN
Sbjct: 1212 VKNFILIGDIHKSIYFLSWKEQGAQLNLLAKDFGNLDCFATEFLIDGSTLSLAVADAQKN 1271

Query: 3045 IQVFYYAPKLLNSWKGQKLLSRAEFHIGAHVTKFMRLQLLPTSSDQINAVPGSDKINRFG 3224
            IQ+ YYAPK+  SW+GQKLL+RAEFH+GAHVTKF+RLQ+L TSSD+    PGSDK  R+ 
Sbjct: 1272 IQILYYAPKISESWRGQKLLTRAEFHVGAHVTKFLRLQMLSTSSDRTGKNPGSDKTVRYA 1331

Query: 3225 LLFGTLDGSIGCISPLDELTFRRLQSLQRKLVDAVAHIAGLNPRSFRQFRSNGRAHRPGP 3404
            LLFGTLDG IG I+PL+ELTFRRLQSLQ KLVDAV H+AGLNPRSFRQFRSNG+AHRPGP
Sbjct: 1332 LLFGTLDGGIGSIAPLEELTFRRLQSLQNKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGP 1391

Query: 3405 DNIVDCELLCHYEMLPLEEQLEIAQQVGTTRTQIMSNLNELTLGTSFL 3548
            D+IVDCELL HYEML LEEQLEIAQQ+GTTR QI+SNL++L+LGTSFL
Sbjct: 1392 DSIVDCELLFHYEMLSLEEQLEIAQQIGTTRLQILSNLDDLSLGTSFL 1439


Top