BLASTX nr result

ID: Akebia23_contig00003492 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00003492
         (4895 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation spec...  2379   0.0  
ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prun...  2375   0.0  
emb|CBI24510.3| unnamed protein product [Vitis vinifera]             2372   0.0  
ref|XP_007038473.1| Cleavage and polyadenylation specificity fac...  2341   0.0  
ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citr...  2321   0.0  
ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation spec...  2321   0.0  
ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citr...  2317   0.0  
ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation spec...  2316   0.0  
gb|EXC20897.1| Cleavage and polyadenylation specificity factor s...  2286   0.0  
ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation spec...  2283   0.0  
ref|XP_002318462.2| cleavage and polyadenylation specificity fac...  2266   0.0  
ref|XP_002510905.1| cleavage and polyadenylation specificity fac...  2262   0.0  
ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation spec...  2260   0.0  
ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation spec...  2249   0.0  
ref|XP_007152397.1| hypothetical protein PHAVU_004G126600g [Phas...  2236   0.0  
ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation spec...  2227   0.0  
ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation spec...  2145   0.0  
ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation spec...  2142   0.0  
ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation spec...  2140   0.0  
ref|XP_006851814.1| hypothetical protein AMTR_s00041p00037520 [A...  2130   0.0  

>ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Vitis vinifera]
          Length = 1442

 Score = 2379 bits (6165), Expect = 0.0
 Identities = 1171/1454 (80%), Positives = 1302/1454 (89%), Gaps = 3/1454 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MSYAA+KMMHWPTGIENCASGF++H  +DFAPQI  I  +DL+S+WP ++ IGP+PNL+V
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383
            TA NILEVY++RVQE+ D R SR+ +E KRGG MAG+SGA+LELVC Y+LHGNVETM VL
Sbjct: 61   TAANILEVYMVRVQED-DSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVL 119

Query: 384  SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563
               G D  +RRDSIILAFQDAKISVLEFDDSI+GLRTSSMHCFEGPEW HLKRG ESFAR
Sbjct: 120  PSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFAR 179

Query: 564  GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743
            GPLVKVDPQGRC GVLVYGLQMIILK                        +ESSYVISLR
Sbjct: 180  GPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLR 239

Query: 744  DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923
            DLDMKHVKDF+FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW
Sbjct: 240  DLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299

Query: 924  SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103
            SAVNLPHDAYKL+ VPSPIGGV+VI ANSIHYHSQS SCALA+N+YAV+ DNSQE+PRSS
Sbjct: 300  SAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSS 359

Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283
            F+VELDAANATWLS+DV MLSTKTGELLLLT+ YDGRVV RL+LSKS+ASVLTSG+  IG
Sbjct: 360  FSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIG 419

Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVPKEEVGDIEGDVPSAKRLRMASSDALQDIVSGEE 1463
            NS  FLGSRLGDS+LVQFT  + +SV KEEVGDIEGDVPSAKRLR +SSDALQD+V+GEE
Sbjct: 420  NSLFFLGSRLGDSLLVQFTSILSSSV-KEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEE 478

Query: 1464 LSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVC 1643
            LSLY SAPN++E +QKTFSF+VRDS IN+GPLKDF+YGLRINADP ATGIAKQSNYELVC
Sbjct: 479  LSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYELVC 538

Query: 1644 CSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDDEYHA 1823
            CSGHGKNGALC+LQQSIRPE+ITEVELPGCKGIWTVYHK+ RGHNADS+KM + DDEYHA
Sbjct: 539  CSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDEYHA 598

Query: 1824 FLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARVLDGS 2003
            +LIISLESRTMVLETADLLGEVTESVDYYVQG TI+AGNLFGRRRV+QV+ARGAR+LDG+
Sbjct: 599  YLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARILDGA 658

Query: 2004 YMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVSISIP 2183
            +MTQDL     ++                  PYVLLRM+DG+IQLLVGDPSTCTVSI+IP
Sbjct: 659  FMTQDLPISESST----------VLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSINIP 708

Query: 2184 PAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIYCVVC 2363
              FESSK SI+ACTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADGA  DQGDIYCVV 
Sbjct: 709  AVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYCVVS 768

Query: 2364 YESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQN--SKN-EVVAGQVKKE 2534
            YESG LEIFDVP+F CVFSV+KF+SG  HLVDT I E ++D+Q   SKN E  A Q +KE
Sbjct: 769  YESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKE 828

Query: 2535 TAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQNTV 2714
             A N+K+VELAMQRW+G+H+RP+LFG+LTDGT+LCYHAYLYEG EST K EE+VS QN++
Sbjct: 829  NAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSL 888

Query: 2715 DLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPAWFM 2894
             +S++S SRLRNLRFVRVPL+TYTREE  + +   RMTVFKN+GG QGLFLSGSRP WFM
Sbjct: 889  SISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFM 948

Query: 2895 VCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWPVQK 3074
            V RER+RVHPQLCDGSIVAF VLHN+NCNHGLIYVTS+GFLKICQLP+V SYDNYWPVQK
Sbjct: 949  VFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQK 1008

Query: 3075 IPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDELHKT 3254
            IPLKGTPHQVTYFAEKNLYPLIVSVPV++PLN VLSSLVDQ+AGHQ+E+DN S DELH++
Sbjct: 1009 IPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRS 1068

Query: 3255 YTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIGTAY 3434
            Y+VDE+EVR+LEPEKSGAPWQTRATIPMQ+SENALTVR+VTLFNTT+KENETLLAIGTAY
Sbjct: 1069 YSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY 1128

Query: 3435 VQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPKVIL 3614
            VQGEDVAARGRVLLFSVG+NTDNSQNLVSE+YSKELKGAISA+ASLQGHLLIASGPK+IL
Sbjct: 1129 VQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIIL 1188

Query: 3615 HKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAKDFG 3794
            HKWTG EL GVAF+D PPLYVVSLNIVKNFIL+GDIH+SIYFLSWKEQGAQL+LLAKDFG
Sbjct: 1189 HKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFG 1248

Query: 3795 NLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 3974
            +LDCF+TEFLIDGSTLSLIVSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1249 SLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1308

Query: 3975 LRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKLVDS 4154
            LRLQMLP SSDRT AT GSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKKLVD+
Sbjct: 1309 LRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDA 1368

Query: 4155 VPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTRTQI 4334
            VPH+AGLNPR+FR FRSNGKAHRPGPDN++DCELL HYEMLP E+QLE+A QIGTTR QI
Sbjct: 1369 VPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQI 1428

Query: 4335 LTNLNDLSLGTSFL 4376
            L+NLNDLSLGTSFL
Sbjct: 1429 LSNLNDLSLGTSFL 1442


>ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica]
            gi|462416772|gb|EMJ21509.1| hypothetical protein
            PRUPE_ppa000211mg [Prunus persica]
          Length = 1459

 Score = 2375 bits (6156), Expect = 0.0
 Identities = 1158/1459 (79%), Positives = 1294/1459 (88%), Gaps = 8/1459 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWP-ARKGIGPIPNLV 200
            MS+AA+KMMHWPTGIENCASGFISH  SDF P+IP I  EDL+S+WP +R+ IGPIP+LV
Sbjct: 1    MSFAAYKMMHWPTGIENCASGFISHSRSDFVPRIPPIQTEDLESEWPTSRREIGPIPDLV 60

Query: 201  VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380
            VTAGN+LEVYV+RVQEE+  R  R+  E KRGG M G+SGASLELVCHY+LHGNV TMAV
Sbjct: 61   VTAGNVLEVYVVRVQEEDGTRGPRASGEPKRGGLMDGVSGASLELVCHYRLHGNVVTMAV 120

Query: 381  LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560
            LS  G DG +RRDSIIL F+DAKISVLEFDDSI+GLRTSSMHCFEGPEW HL+RGRESFA
Sbjct: 121  LSSGGGDGSRRRDSIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESFA 180

Query: 561  RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740
            RGPLVKVDPQGRCG +LVYGLQMIILK                        IESSY+++L
Sbjct: 181  RGPLVKVDPQGRCGSILVYGLQMIILKASQGGSGLVGDDDSFGSGGAISSRIESSYIVNL 240

Query: 741  RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920
            RD+DMKHVKDF+F+HGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLI
Sbjct: 241  RDMDMKHVKDFTFLHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 300

Query: 921  WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100
            WSAVNLPHDAYKL++VPSPIGGVLVI ANSIHYHSQS SCALA+NSYAV+ DNSQE+PRS
Sbjct: 301  WSAVNLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAVSADNSQEMPRS 360

Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280
            SF VELD ANATWL +DV +LSTKTGELLLLT+VYDGRVVQRL+LSKSKASVLTSG+T +
Sbjct: 361  SFTVELDTANATWLLNDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITKV 420

Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSV----PKEEVGDIEGDVPSAKRLRMASSDALQDI 1448
            GNS  FLGSRLGDS+LVQFTCG+G SV     K+EVGDIEGD P AKRLRM+SSDALQD+
Sbjct: 421  GNSLFFLGSRLGDSLLVQFTCGVGGSVLSSDMKDEVGDIEGDAPLAKRLRMSSSDALQDM 480

Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628
            VSGEELSLY SAPN +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAKQSN
Sbjct: 481  VSGEELSLYGSAPNNAESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 540

Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808
            YELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHK+ARGHNADSSK+ + D
Sbjct: 541  YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKNARGHNADSSKIAASD 600

Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988
            DE+HA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRV+QV+ RGAR
Sbjct: 601  DEFHAYLIISLEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVVQVYERGAR 660

Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168
            +LDGS+MTQDLSFG  NSE                 PYVLLRM+DG I+LLVGDPS CTV
Sbjct: 661  ILDGSFMTQDLSFGTSNSEMGSGSESSTVLSVSIVDPYVLLRMSDGGIRLLVGDPSLCTV 720

Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348
            S SIP AFESSK SI+ACTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGADG  HDQGD+
Sbjct: 721  STSIPAAFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIDEAIDGADGVSHDQGDV 780

Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAG 2519
            YCVVCYESG+LEIFDVP+F CVFSV+KF+SG  HL+DT +R+  KD Q   N  +E V+G
Sbjct: 781  YCVVCYESGSLEIFDVPNFNCVFSVDKFVSGNAHLIDTLMRDPPKDPQKLINKSSEEVSG 840

Query: 2520 QVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVS 2699
            Q +KE  QNMK+VELAMQRW+G+H+RP+LFG+L DG +LCYHAYL+EG E+ +K E+S S
Sbjct: 841  QGRKENIQNMKVVELAMQRWSGQHSRPFLFGILNDGMILCYHAYLFEGPETASKTEDSAS 900

Query: 2700 GQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSR 2879
             QNT  +S++S SRLRNLRFVRVPL+TY +++TS  + CQRMT+FKN+ GYQGLFLSGSR
Sbjct: 901  AQNTTGVSNLSASRLRNLRFVRVPLDTYAKKDTSNETSCQRMTIFKNIAGYQGLFLSGSR 960

Query: 2880 PAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNY 3059
            PAWFMV RERLR+HPQLCDGS+VA  VLHNVNCNHGLIYVTS+G LKICQLP + SYDNY
Sbjct: 961  PAWFMVFRERLRIHPQLCDGSVVAVTVLHNVNCNHGLIYVTSQGILKICQLPPITSYDNY 1020

Query: 3060 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPD 3239
            WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPV +PLNQVLSSLVDQ+ GHQVE+ N S D
Sbjct: 1021 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVENHNLSSD 1080

Query: 3240 ELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLA 3419
            ELH+TY+VDE+E+RI+EP+KSG PWQT+ATIPMQ SENALTVR+VTLFNTT+KENETLLA
Sbjct: 1081 ELHRTYSVDEFEIRIMEPDKSGGPWQTKATIPMQTSENALTVRVVTLFNTTTKENETLLA 1140

Query: 3420 IGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASG 3599
            IGTAYVQGEDVA RGRVLLFS G++ DN+Q LVSEVYSKELKGAISALASLQGHLLIASG
Sbjct: 1141 IGTAYVQGEDVAGRGRVLLFSAGKSADNTQTLVSEVYSKELKGAISALASLQGHLLIASG 1200

Query: 3600 PKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLL 3779
            PK+ILHKW G EL GVAF+D PPLYVVSLNIVKNFIL+GD+HKSIYFLSWKEQGAQL+LL
Sbjct: 1201 PKIILHKWNGTELNGVAFFDVPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLTLL 1260

Query: 3780 AKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGA 3959
            AKDFGNLDCF+TEFLIDGSTLSL+V+DEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVG 
Sbjct: 1261 AKDFGNLDCFATEFLIDGSTLSLVVADEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGT 1320

Query: 3960 HVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQK 4139
            HVTKFLRLQML TSSDRTG  PGSDKTNR+ALLFGTLDGS+GCIAPLDELTFRRLQ+LQK
Sbjct: 1321 HVTKFLRLQMLSTSSDRTGTNPGSDKTNRYALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1380

Query: 4140 KLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGT 4319
            KLVD+V H+AGLNPRAFR F+SNGKAHRPGPD ++DCELL HYEMLPLE+QLE+A+QIGT
Sbjct: 1381 KLVDAVHHVAGLNPRAFRQFQSNGKAHRPGPDTIVDCELLSHYEMLPLEEQLEIANQIGT 1440

Query: 4320 TRTQILTNLNDLSLGTSFL 4376
            TR+QI +NLNDLS+GTSFL
Sbjct: 1441 TRSQIFSNLNDLSIGTSFL 1459


>emb|CBI24510.3| unnamed protein product [Vitis vinifera]
          Length = 1448

 Score = 2372 bits (6148), Expect = 0.0
 Identities = 1171/1460 (80%), Positives = 1302/1460 (89%), Gaps = 9/1460 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MSYAA+KMMHWPTGIENCASGF++H  +DFAPQI  I  +DL+S+WP ++ IGP+PNL+V
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383
            TA NILEVY++RVQE+ D R SR+ +E KRGG MAG+SGA+LELVC Y+LHGNVETM VL
Sbjct: 61   TAANILEVYMVRVQED-DSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVL 119

Query: 384  SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563
               G D  +RRDSIILAFQDAKISVLEFDDSI+GLRTSSMHCFEGPEW HLKRG ESFAR
Sbjct: 120  PSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFAR 179

Query: 564  GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743
            GPLVKVDPQGRC GVLVYGLQMIILK                        +ESSYVISLR
Sbjct: 180  GPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLR 239

Query: 744  DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923
            DLDMKHVKDF+FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW
Sbjct: 240  DLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299

Query: 924  SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103
            SAVNLPHDAYKL+ VPSPIGGV+VI ANSIHYHSQS SCALA+N+YAV+ DNSQE+PRSS
Sbjct: 300  SAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSS 359

Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283
            F+VELDAANATWLS+DV MLSTKTGELLLLT+ YDGRVV RL+LSKS+ASVLTSG+  IG
Sbjct: 360  FSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIG 419

Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVPKEEVGDIEGDVPSAKRLRMASSDALQDIVSGEE 1463
            NS  FLGSRLGDS+LVQFT  + +SV KEEVGDIEGDVPSAKRLR +SSDALQD+V+GEE
Sbjct: 420  NSLFFLGSRLGDSLLVQFTSILSSSV-KEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEE 478

Query: 1464 LSLYSSAPNTSELAQ------KTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQS 1625
            LSLY SAPN++E +Q      KTFSF+VRDS IN+GPLKDF+YGLRINADP ATGIAKQS
Sbjct: 479  LSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQS 538

Query: 1626 NYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSL 1805
            NYELVCCSGHGKNGALC+LQQSIRPE+ITEVELPGCKGIWTVYHK+ RGHNADS+KM + 
Sbjct: 539  NYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATK 598

Query: 1806 DDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGA 1985
            DDEYHA+LIISLESRTMVLETADLLGEVTESVDYYVQG TI+AGNLFGRRRV+QV+ARGA
Sbjct: 599  DDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGA 658

Query: 1986 RVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCT 2165
            R+LDG++MTQDL     ++                  PYVLLRM+DG+IQLLVGDPSTCT
Sbjct: 659  RILDGAFMTQDLPISESST----------VLSVSIADPYVLLRMSDGNIQLLVGDPSTCT 708

Query: 2166 VSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGD 2345
            VSI+IP  FESSK SI+ACTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADGA  DQGD
Sbjct: 709  VSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGD 768

Query: 2346 IYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQN--SKN-EVVA 2516
            IYCVV YESG LEIFDVP+F CVFSV+KF+SG  HLVDT I E ++D+Q   SKN E  A
Sbjct: 769  IYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEA 828

Query: 2517 GQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESV 2696
             Q +KE A N+K+VELAMQRW+G+H+RP+LFG+LTDGT+LCYHAYLYEG EST K EE+V
Sbjct: 829  DQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAV 888

Query: 2697 SGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGS 2876
            S QN++ +S++S SRLRNLRFVRVPL+TYTREE  + +   RMTVFKN+GG QGLFLSGS
Sbjct: 889  SAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGS 948

Query: 2877 RPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDN 3056
            RP WFMV RER+RVHPQLCDGSIVAF VLHN+NCNHGLIYVTS+GFLKICQLP+V SYDN
Sbjct: 949  RPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDN 1008

Query: 3057 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSP 3236
            YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPV++PLN VLSSLVDQ+AGHQ+E+DN S 
Sbjct: 1009 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSS 1068

Query: 3237 DELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLL 3416
            DELH++Y+VDE+EVR+LEPEKSGAPWQTRATIPMQ+SENALTVR+VTLFNTT+KENETLL
Sbjct: 1069 DELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1128

Query: 3417 AIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIAS 3596
            AIGTAYVQGEDVAARGRVLLFSVG+NTDNSQNLVSE+YSKELKGAISA+ASLQGHLLIAS
Sbjct: 1129 AIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIAS 1188

Query: 3597 GPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSL 3776
            GPK+ILHKWTG EL GVAF+D PPLYVVSLNIVKNFIL+GDIH+SIYFLSWKEQGAQL+L
Sbjct: 1189 GPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNL 1248

Query: 3777 LAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVG 3956
            LAKDFG+LDCF+TEFLIDGSTLSLIVSD+QKN+QIFYYAPKMSESWKGQKLLSRAEFHVG
Sbjct: 1249 LAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1308

Query: 3957 AHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQ 4136
            AHVTKFLRLQMLP SSDRT AT GSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQ
Sbjct: 1309 AHVTKFLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1368

Query: 4137 KKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIG 4316
            KKLVD+VPH+AGLNPR+FR FRSNGKAHRPGPDN++DCELL HYEMLP E+QLE+A QIG
Sbjct: 1369 KKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIG 1428

Query: 4317 TTRTQILTNLNDLSLGTSFL 4376
            TTR QIL+NLNDLSLGTSFL
Sbjct: 1429 TTRMQILSNLNDLSLGTSFL 1448


>ref|XP_007038473.1| Cleavage and polyadenylation specificity factor 160 isoform 1
            [Theobroma cacao] gi|508775718|gb|EOY22974.1| Cleavage
            and polyadenylation specificity factor 160 isoform 1
            [Theobroma cacao]
          Length = 1457

 Score = 2342 bits (6068), Expect = 0.0
 Identities = 1142/1458 (78%), Positives = 1290/1458 (88%), Gaps = 7/1458 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MSYAA+KMMHWPTGIENCASGF++HC +DF PQIP    EDL+S+WPAR+GIGP+PNL+V
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHCRADFTPQIPLNQTEDLESEWPARRGIGPVPNLIV 60

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383
            TA N+LE+YV+RVQEE   R +R+ +E KRGG + G+SG SLELVC+Y+LHGNVE+MAVL
Sbjct: 61   TAANLLEIYVVRVQEE-GRREARNSTEVKRGGVLDGVSGVSLELVCNYRLHGNVESMAVL 119

Query: 384  SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563
            S+ G DG +RRDSIILAF+DAKISVLEFDDSI+GLRT+SMHCFEGPEW HLKRGRESFAR
Sbjct: 120  SIGGGDGSRRRDSIILAFKDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFAR 179

Query: 564  GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743
            GPLVKVDPQGRCGGVLVY LQMIILK                        +ESSY+I+LR
Sbjct: 180  GPLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINLR 239

Query: 744  DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923
            DLD+KH+KDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW
Sbjct: 240  DLDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299

Query: 924  SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103
            SAVNLPHDAYKL++VPSPIGGVLVI AN+IHYHSQS SCALA+N+YA++VDNSQ+LPRS+
Sbjct: 300  SAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRSN 359

Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283
            F+VELDAANATWL +DV +LSTKTGELLLLT++YDGRVVQRL+LSKSKASVLTS +TTIG
Sbjct: 360  FSVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTIG 419

Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDIV 1451
            NS  FLGSRLGDS+LVQF+ G G S      KEEVGDIEGDVP AKRLR +SSDALQD+V
Sbjct: 420  NSLFFLGSRLGDSLLVQFSGGSGVSALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDMV 479

Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631
             GEELSLY SAPN +E AQKTF FAVRDSL N+GPLKDFSYGLRINAD NATGIAKQSNY
Sbjct: 480  GGEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSNY 539

Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811
            ELVCCSGHGKNGALCVL+QSIRPE+ITEVEL GCKGIWTVYHKS R H+AD SK+   DD
Sbjct: 540  ELVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDDDD 599

Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991
            EYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG TIAAGNLFGRRRV+QV+ RGAR+
Sbjct: 600  EYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGARI 659

Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171
            LDGS+MTQ+LS  +PNSE                 PYVLLRMTDGSI LLVGDP+TCTVS
Sbjct: 660  LDGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCTVS 719

Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351
            I+ P AFE SK  ++ACTLYHDKGPEPWLRK STDAWLSTG+GE+IDGADG  HDQGDIY
Sbjct: 720  INTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGDIY 779

Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522
            CVVCYESG LEIFDVP+F CVFS+EKF SG T LVD +  E++KDS+   N  +E + GQ
Sbjct: 780  CVVCYESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSSEELTGQ 839

Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702
             +KE  QN+K+VELAMQRW+  H+RP+LFG+LTDGT+LCYHAYL+EG E+ +K+E+SV  
Sbjct: 840  GRKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVA 899

Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882
            QN+V LS+I+ SRLRNLRF+R+PL+ YTREE S  +  QR+T+FKN+ GYQG FLSGSRP
Sbjct: 900  QNSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRP 959

Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062
            AWFMV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQ+PS  +YDNYW
Sbjct: 960  AWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYW 1019

Query: 3063 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDE 3242
            PVQKIPL+GTPHQVTYFAE+NLYP+IVSVPV +P+NQVLSSLVDQ+ GHQ+++ N S DE
Sbjct: 1020 PVQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDE 1079

Query: 3243 LHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAI 3422
            L +TYTVDE+EVRILEPEKSG PW+T+ATIPMQ+SENALTVR+VTLFNTT+KENE+LLAI
Sbjct: 1080 LQRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAI 1139

Query: 3423 GTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGP 3602
            GTAY+QGEDVAARGRV+L S+GRNTDN QNLVSEVYSKELKGAISALASLQGHLLIASGP
Sbjct: 1140 GTAYIQGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHLLIASGP 1199

Query: 3603 KVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLA 3782
            K+ILH WTG+EL G+AFYD PPLYVVSLNIVKNFIL+GD+HKSIYFLSWKEQGAQLSLLA
Sbjct: 1200 KIILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLA 1259

Query: 3783 KDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAH 3962
            KDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAH
Sbjct: 1260 KDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1319

Query: 3963 VTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKK 4142
            VTKFLRLQML TSSDRT AT GSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKK
Sbjct: 1320 VTKFLRLQMLSTSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1379

Query: 4143 LVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTT 4322
            LVD+VPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QL++AHQIGTT
Sbjct: 1380 LVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTT 1439

Query: 4323 RTQILTNLNDLSLGTSFL 4376
            R+QIL+NLNDL+LGTSFL
Sbjct: 1440 RSQILSNLNDLTLGTSFL 1457


>ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citrus clementina]
            gi|557523633|gb|ESR35000.1| hypothetical protein
            CICLE_v10004147mg [Citrus clementina]
          Length = 1457

 Score = 2321 bits (6016), Expect = 0.0
 Identities = 1133/1458 (77%), Positives = 1281/1458 (87%), Gaps = 7/1458 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MS+AA+KMMHWPTGI NC SGFI+H  +D+ PQIP I  E+LDS+ P+++GIGP+PNLVV
Sbjct: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383
            TA N++E+YV+RVQEE   + S++  E KR   M G+S ASLELVCHY+LHGNVE++A+L
Sbjct: 61   TAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119

Query: 384  SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563
            S  G+D  +RRDSIILAF+DAKISVLEFDDSI+GLR +SMHCFE PEW HLKRGRESFAR
Sbjct: 120  SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179

Query: 564  GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743
            GPLVKVDPQGRCGGVLVYGLQMIILK                        IESS+VI+LR
Sbjct: 180  GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239

Query: 744  DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923
            DLDMKHVKDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW
Sbjct: 240  DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299

Query: 924  SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103
            SA+NLPHDAYKL++VPSPIGGVLV+GAN+IHYHSQS SCALA+N+YAV++D+SQELPRSS
Sbjct: 300  SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359

Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283
            F+VELDAA+ATWL +DV +LSTKTG+L+LLTVVYDGRVVQRL+LSK+  SVLTS +TTIG
Sbjct: 360  FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419

Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSV----PKEEVGDIEGDVPSAKRLRMASSDALQDIV 1451
            NS  FLGSRLGDS+LVQFTCG GTS+    PKEE GDIE D PS KRLR +SSDALQD+V
Sbjct: 420  NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGPKEEFGDIEADAPSTKRLRRSSSDALQDMV 479

Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631
            +GEELSLY SA N +E AQKTFSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNY
Sbjct: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539

Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811
            ELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+RGHN DSS+M + DD
Sbjct: 540  ELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNTDSSRMAAYDD 599

Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991
            EYHA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR+
Sbjct: 600  EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 659

Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171
            LDGSYMTQDLSFG  NSE                 PYVLL M+DGSI+LLVGDPSTCTVS
Sbjct: 660  LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 719

Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351
            +  P A ESSK  ++ACTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   DQGDIY
Sbjct: 720  VQTPAAIESSKKPVSACTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 779

Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522
             VVCYESG LEIFDVP+F CVF+V+KF+SG TH+VDT++RE  KDS+   NS +E   GQ
Sbjct: 780  SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 839

Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702
             +KE   +MK+VELAMQRW+G H+RP+LF +LTDGT+LCY AYL+EG E+T+K ++ VS 
Sbjct: 840  GRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVST 899

Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882
              ++ +S++S SRLRNLRF R PL+ YTREET   +PCQR+T+FKN+ G+QG FLSGSRP
Sbjct: 900  SRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRP 959

Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062
             W MV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQLPS  +YDNYW
Sbjct: 960  CWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 1019

Query: 3063 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDE 3242
            PVQKIPLK TPHQ+TYFAEKNLYPLIVSVPV++PLNQVLS L+DQ+ GHQ+++ N S  +
Sbjct: 1020 PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1079

Query: 3243 LHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAI 3422
            LH+TYTV+EYEVRILEP+++G PWQTRATIPMQ+SENALTVR+VTLFNTT+KEN+TLLAI
Sbjct: 1080 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLAI 1139

Query: 3423 GTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGP 3602
            GTAYVQGEDVAARGRVLLFS GRN DN QNLV+EVYSKELKGAISALASLQGHLLIASGP
Sbjct: 1140 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1199

Query: 3603 KVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLA 3782
            K+ILHKWTG EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LLA
Sbjct: 1200 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1259

Query: 3783 KDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAH 3962
            KDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAH
Sbjct: 1260 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1319

Query: 3963 VTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKK 4142
            VTKFLRLQML TSSDRTGA PGSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKK
Sbjct: 1320 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1379

Query: 4143 LVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTT 4322
            LVDSVPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ GTT
Sbjct: 1380 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1439

Query: 4323 RTQILTNLNDLSLGTSFL 4376
            R+QIL+NLNDL+LGTSFL
Sbjct: 1440 RSQILSNLNDLALGTSFL 1457


>ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X2 [Citrus sinensis]
          Length = 1457

 Score = 2321 bits (6014), Expect = 0.0
 Identities = 1133/1458 (77%), Positives = 1282/1458 (87%), Gaps = 7/1458 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MS+AA+KMMHWPTGI NC SGFI+H  +D+ PQIP I  E+LDS+ P+++GIGP+PNLVV
Sbjct: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383
            TA N++E+YV+RVQEE   + S++  E KR   M G+S ASLELVCHY+LHGNVE++A+L
Sbjct: 61   TAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119

Query: 384  SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563
            S  G+D  +RRDSIILAF+DAKISVLEFDDSI+GLR +SMHCFE PEW HLKRGRESFAR
Sbjct: 120  SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179

Query: 564  GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743
            GPLVKVDPQGRCGGVLVYGLQMIILK                        IESS+VI+LR
Sbjct: 180  GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239

Query: 744  DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923
            DLDMKHVKDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW
Sbjct: 240  DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299

Query: 924  SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103
            SA+NLPHDAYKL++VPSPIGGVLV+GAN+IHYHSQS SCALA+N+YAV++D+SQELPRSS
Sbjct: 300  SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359

Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283
            F+VELDAA+ATWL +DV +LSTKTG+L+LLTVVYDGRVVQRL+LSK+  SVLTS +TTIG
Sbjct: 360  FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419

Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDIV 1451
            NS  FLGSRLGDS+LVQFTCG GTS+     KEE GDIE D PS KRLR +SSDALQD+V
Sbjct: 420  NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479

Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631
            +GEELSLY SA N +E AQKTFSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNY
Sbjct: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539

Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811
            ELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+RGHNADSS+M + DD
Sbjct: 540  ELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 599

Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991
            EYHA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR+
Sbjct: 600  EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 659

Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171
            LDGSYMTQDLSFG  NSE                 PYVLL M+DGSI+LLVGDPSTCTVS
Sbjct: 660  LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 719

Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351
            +  P A ESSK  +++CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   DQGDIY
Sbjct: 720  VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 779

Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522
             VVCYESG LEIFDVP+F CVF+V+KF+SG TH+VDT++RE  KDS+   NS +E   GQ
Sbjct: 780  SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 839

Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702
             +KE   +MK+VELAMQRW+G H+RP+LF +LTDGT+LCY AYL+EG E+T+K ++ VS 
Sbjct: 840  GRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 899

Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882
              ++ +S++S SRLRNLRF R+PL+ YTREET   +PCQR+T+FKN+ G+QG FLSGSRP
Sbjct: 900  SRSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRP 959

Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062
             W MV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQLPS  +YDNYW
Sbjct: 960  CWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 1019

Query: 3063 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDE 3242
            PVQKIPLK TPHQ+TYFAEKNLYPLIVSVPV++PLNQVLS L+DQ+ GHQ+++ N S  +
Sbjct: 1020 PVQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1079

Query: 3243 LHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAI 3422
            LH+TYTV+EYEVRILEP+++G PWQTRATIPMQ+SENALTVR+VTLFNTT+KENETLLAI
Sbjct: 1080 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1139

Query: 3423 GTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGP 3602
            GTAYVQGEDVAARGRVLLFS GRN DN QNLV+EVYSKELKGAISALASLQGHLLIASGP
Sbjct: 1140 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1199

Query: 3603 KVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLA 3782
            K+ILHKWTG EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LLA
Sbjct: 1200 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1259

Query: 3783 KDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAH 3962
            KDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGAH
Sbjct: 1260 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1319

Query: 3963 VTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKK 4142
            VTKFLRLQML TSSDRTGA PGSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKK
Sbjct: 1320 VTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKK 1379

Query: 4143 LVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTT 4322
            LVDSVPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ GTT
Sbjct: 1380 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1439

Query: 4323 RTQILTNLNDLSLGTSFL 4376
            R+QIL+NLNDL+LGTSFL
Sbjct: 1440 RSQILSNLNDLALGTSFL 1457


>ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citrus clementina]
            gi|557523632|gb|ESR34999.1| hypothetical protein
            CICLE_v10004147mg [Citrus clementina]
          Length = 1458

 Score = 2317 bits (6004), Expect = 0.0
 Identities = 1133/1459 (77%), Positives = 1281/1459 (87%), Gaps = 8/1459 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MS+AA+KMMHWPTGI NC SGFI+H  +D+ PQIP I  E+LDS+ P+++GIGP+PNLVV
Sbjct: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383
            TA N++E+YV+RVQEE   + S++  E KR   M G+S ASLELVCHY+LHGNVE++A+L
Sbjct: 61   TAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119

Query: 384  SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563
            S  G+D  +RRDSIILAF+DAKISVLEFDDSI+GLR +SMHCFE PEW HLKRGRESFAR
Sbjct: 120  SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179

Query: 564  GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743
            GPLVKVDPQGRCGGVLVYGLQMIILK                        IESS+VI+LR
Sbjct: 180  GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239

Query: 744  DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923
            DLDMKHVKDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW
Sbjct: 240  DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299

Query: 924  SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103
            SA+NLPHDAYKL++VPSPIGGVLV+GAN+IHYHSQS SCALA+N+YAV++D+SQELPRSS
Sbjct: 300  SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359

Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283
            F+VELDAA+ATWL +DV +LSTKTG+L+LLTVVYDGRVVQRL+LSK+  SVLTS +TTIG
Sbjct: 360  FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419

Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSV----PKEEVGDIEGDVPSAKRLRMASSDALQDIV 1451
            NS  FLGSRLGDS+LVQFTCG GTS+    PKEE GDIE D PS KRLR +SSDALQD+V
Sbjct: 420  NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGPKEEFGDIEADAPSTKRLRRSSSDALQDMV 479

Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631
            +GEELSLY SA N +E AQKTFSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNY
Sbjct: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539

Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811
            ELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+RGHN DSS+M + DD
Sbjct: 540  ELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNTDSSRMAAYDD 599

Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991
            EYHA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR+
Sbjct: 600  EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 659

Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171
            LDGSYMTQDLSFG  NSE                 PYVLL M+DGSI+LLVGDPSTCTVS
Sbjct: 660  LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 719

Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351
            +  P A ESSK  ++ACTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   DQGDIY
Sbjct: 720  VQTPAAIESSKKPVSACTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 779

Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522
             VVCYESG LEIFDVP+F CVF+V+KF+SG TH+VDT++RE  KDS+   NS +E   GQ
Sbjct: 780  SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 839

Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702
             +KE   +MK+VELAMQRW+G H+RP+LF +LTDGT+LCY AYL+EG E+T+K ++ VS 
Sbjct: 840  GRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVST 899

Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882
              ++ +S++S SRLRNLRF R PL+ YTREET   +PCQR+T+FKN+ G+QG FLSGSRP
Sbjct: 900  SRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRP 959

Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062
             W MV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQLPS  +YDNYW
Sbjct: 960  CWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 1019

Query: 3063 PVQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPD 3239
            PVQK IPLK TPHQ+TYFAEKNLYPLIVSVPV++PLNQVLS L+DQ+ GHQ+++ N S  
Sbjct: 1020 PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1079

Query: 3240 ELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLA 3419
            +LH+TYTV+EYEVRILEP+++G PWQTRATIPMQ+SENALTVR+VTLFNTT+KEN+TLLA
Sbjct: 1080 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLA 1139

Query: 3420 IGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASG 3599
            IGTAYVQGEDVAARGRVLLFS GRN DN QNLV+EVYSKELKGAISALASLQGHLLIASG
Sbjct: 1140 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1199

Query: 3600 PKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLL 3779
            PK+ILHKWTG EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LL
Sbjct: 1200 PKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1259

Query: 3780 AKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGA 3959
            AKDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGA
Sbjct: 1260 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1319

Query: 3960 HVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQK 4139
            HVTKFLRLQML TSSDRTGA PGSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQK
Sbjct: 1320 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1379

Query: 4140 KLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGT 4319
            KLVDSVPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ GT
Sbjct: 1380 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1439

Query: 4320 TRTQILTNLNDLSLGTSFL 4376
            TR+QIL+NLNDL+LGTSFL
Sbjct: 1440 TRSQILSNLNDLALGTSFL 1458


>ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X1 [Citrus sinensis]
          Length = 1458

 Score = 2316 bits (6002), Expect = 0.0
 Identities = 1133/1459 (77%), Positives = 1282/1459 (87%), Gaps = 8/1459 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MS+AA+KMMHWPTGI NC SGFI+H  +D+ PQIP I  E+LDS+ P+++GIGP+PNLVV
Sbjct: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383
            TA N++E+YV+RVQEE   + S++  E KR   M G+S ASLELVCHY+LHGNVE++A+L
Sbjct: 61   TAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119

Query: 384  SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563
            S  G+D  +RRDSIILAF+DAKISVLEFDDSI+GLR +SMHCFE PEW HLKRGRESFAR
Sbjct: 120  SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179

Query: 564  GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743
            GPLVKVDPQGRCGGVLVYGLQMIILK                        IESS+VI+LR
Sbjct: 180  GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239

Query: 744  DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923
            DLDMKHVKDF FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLIW
Sbjct: 240  DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299

Query: 924  SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103
            SA+NLPHDAYKL++VPSPIGGVLV+GAN+IHYHSQS SCALA+N+YAV++D+SQELPRSS
Sbjct: 300  SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359

Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283
            F+VELDAA+ATWL +DV +LSTKTG+L+LLTVVYDGRVVQRL+LSK+  SVLTS +TTIG
Sbjct: 360  FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419

Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDIV 1451
            NS  FLGSRLGDS+LVQFTCG GTS+     KEE GDIE D PS KRLR +SSDALQD+V
Sbjct: 420  NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479

Query: 1452 SGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNY 1631
            +GEELSLY SA N +E AQKTFSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNY
Sbjct: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539

Query: 1632 ELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDD 1811
            ELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS+RGHNADSS+M + DD
Sbjct: 540  ELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 599

Query: 1812 EYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARV 1991
            EYHA+LIISLE+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR+
Sbjct: 600  EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 659

Query: 1992 LDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVS 2171
            LDGSYMTQDLSFG  NSE                 PYVLL M+DGSI+LLVGDPSTCTVS
Sbjct: 660  LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 719

Query: 2172 ISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIY 2351
            +  P A ESSK  +++CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   DQGDIY
Sbjct: 720  VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 779

Query: 2352 CVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAGQ 2522
             VVCYESG LEIFDVP+F CVF+V+KF+SG TH+VDT++RE  KDS+   NS +E   GQ
Sbjct: 780  SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 839

Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702
             +KE   +MK+VELAMQRW+G H+RP+LF +LTDGT+LCY AYL+EG E+T+K ++ VS 
Sbjct: 840  GRKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 899

Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882
              ++ +S++S SRLRNLRF R+PL+ YTREET   +PCQR+T+FKN+ G+QG FLSGSRP
Sbjct: 900  SRSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRP 959

Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062
             W MV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTS+G LKICQLPS  +YDNYW
Sbjct: 960  CWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 1019

Query: 3063 PVQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPD 3239
            PVQK IPLK TPHQ+TYFAEKNLYPLIVSVPV++PLNQVLS L+DQ+ GHQ+++ N S  
Sbjct: 1020 PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1079

Query: 3240 ELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLA 3419
            +LH+TYTV+EYEVRILEP+++G PWQTRATIPMQ+SENALTVR+VTLFNTT+KENETLLA
Sbjct: 1080 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLA 1139

Query: 3420 IGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASG 3599
            IGTAYVQGEDVAARGRVLLFS GRN DN QNLV+EVYSKELKGAISALASLQGHLLIASG
Sbjct: 1140 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASG 1199

Query: 3600 PKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLL 3779
            PK+ILHKWTG EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LL
Sbjct: 1200 PKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLL 1259

Query: 3780 AKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGA 3959
            AKDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHVGA
Sbjct: 1260 AKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1319

Query: 3960 HVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQK 4139
            HVTKFLRLQML TSSDRTGA PGSDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQK
Sbjct: 1320 HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1379

Query: 4140 KLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGT 4319
            KLVDSVPH+AGLNPR+FR F SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ GT
Sbjct: 1380 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1439

Query: 4320 TRTQILTNLNDLSLGTSFL 4376
            TR+QIL+NLNDL+LGTSFL
Sbjct: 1440 TRSQILSNLNDLALGTSFL 1458


>gb|EXC20897.1| Cleavage and polyadenylation specificity factor subunit 1 [Morus
            notabilis]
          Length = 1479

 Score = 2286 bits (5923), Expect = 0.0
 Identities = 1132/1487 (76%), Positives = 1283/1487 (86%), Gaps = 36/1487 (2%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPA-RKGIGPIPNLV 200
            MS+AA+KMMHWPTGIENCA+GF+SH  +DF P+IP I ++DLDSDWPA R+  GP+PNLV
Sbjct: 1    MSFAAYKMMHWPTGIENCAAGFVSHSRADFVPRIPPIQSDDLDSDWPAGRRETGPVPNLV 60

Query: 201  VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380
            VTAGN+LEVYV+R+QEE+D R SR+P+E++RGG M GLSGASLELVCHY+LHGNV+T+AV
Sbjct: 61   VTAGNVLEVYVVRLQEEDDTRSSRAPAESRRGGLMDGLSGASLELVCHYRLHGNVQTIAV 120

Query: 381  LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560
            LS  G DG +RRDSIIL+FQDAKISVLEFDDSI+GLRTSSMHCFEGPEW +LKRGRESFA
Sbjct: 121  LSSGGGDGSRRRDSIILSFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWLYLKRGRESFA 180

Query: 561  RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740
            RGPLVKVDPQGRC GVL Y +QMI+LK                        IESSY+I+L
Sbjct: 181  RGPLVKVDPQGRCAGVLAYNIQMIMLKAAQAGSGLVGEEDALGSGGAVSARIESSYIINL 240

Query: 741  RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920
            RDLDMKH+KDF FVHGYIEPVMVILHERELTWAGR+ WKHHTCMISALSISTTLKQ PLI
Sbjct: 241  RDLDMKHIKDFVFVHGYIEPVMVILHERELTWAGRVLWKHHTCMISALSISTTLKQHPLI 300

Query: 921  WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100
            WSAVNLPHDAYKL++VPSPIGGVLVI AN++HY SQS SC LA+NSYAV+VD+SQE+ R+
Sbjct: 301  WSAVNLPHDAYKLLAVPSPIGGVLVICANTLHYQSQSNSCTLALNSYAVSVDSSQEMRRA 360

Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280
             F+VELDAANATWLS+DVV+LSTK GELLLLT+VYDGRVVQRL+LSKSKASVLTSG+TTI
Sbjct: 361  PFSVELDAANATWLSNDVVLLSTKAGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITTI 420

Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSV----PKEEVGDIEGDVPSAKRLRMASSDALQDI 1448
            GNS  FLGSRLGDS+LVQFT G+GTS+     K+EVGDIEGD   AKRLR +SSD LQD+
Sbjct: 421  GNSLFFLGSRLGDSLLVQFTYGLGTSMLSSGLKDEVGDIEGDAHLAKRLRRSSSDVLQDM 480

Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628
             SGEELSLY SAPN SE  QK+FSF VRDSL+N+GPLKDFSYGLRINADPNATG+AKQSN
Sbjct: 481  TSGEELSLYVSAPNNSESTQKSFSFTVRDSLVNVGPLKDFSYGLRINADPNATGVAKQSN 540

Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808
            YELVCCSGHGKNGALCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R H  DSSK+V+ D
Sbjct: 541  YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSTRSH--DSSKLVAAD 598

Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988
            DEYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG TIAAGNLFGRRRV+QV+ RGAR
Sbjct: 599  DEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGAR 658

Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168
            +LDGS+MTQDLSFG   SE                 PYV+LRM+DGSI+LLVGDP++CTV
Sbjct: 659  ILDGSFMTQDLSFGPAPSESSSGSENAVVTSVSIADPYVVLRMSDGSIRLLVGDPTSCTV 718

Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348
            S+S P  FESSK  I+ACTLY DKGPEPWLRKTSTDAWLSTG+ EAIDGAD  L DQGDI
Sbjct: 719  SVSTPADFESSKSIISACTLYRDKGPEPWLRKTSTDAWLSTGVDEAIDGADETLQDQGDI 778

Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQ---NSKNEVVAG 2519
            YCVVCYESG+L+I+DVPSF  VFSV+ FISG  HLVD F++E  KD Q   N  +E  AG
Sbjct: 779  YCVVCYESGSLDIYDVPSFNYVFSVDNFISGRPHLVDAFVQEQPKDLQKATNKNSEESAG 838

Query: 2520 QVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVS 2699
            Q +KE  QNMKIVELAMQRW+G+H+RP+L G+LTDG++LCYHAYL+EG EST++ E+SVS
Sbjct: 839  QGRKENVQNMKIVELAMQRWSGKHSRPFLLGILTDGSILCYHAYLFEGPESTSRTEDSVS 898

Query: 2700 GQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSR 2879
             +N+      S SRLRNLRFVRVPL++Y REETS   PCQR++VFKN+ GYQGLFLSGSR
Sbjct: 899  SRNS------SGSRLRNLRFVRVPLDSYAREETSDGMPCQRISVFKNIAGYQGLFLSGSR 952

Query: 2880 PAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNY 3059
            PAWFMV RERLRVHPQLCDGSIVAF VLHNVNCNHG IYVTSEG LKICQLPS+ SYDNY
Sbjct: 953  PAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSEGILKICQLPSITSYDNY 1012

Query: 3060 WPVQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSP 3236
            WPVQK IPLKGTPHQVTYFAE+NLYPLIVSVPV +PLNQV+SSL+DQ+ GHQ E+ N SP
Sbjct: 1013 WPVQKVIPLKGTPHQVTYFAERNLYPLIVSVPVPKPLNQVMSSLLDQEVGHQFENPNLSP 1072

Query: 3237 DELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLL 3416
            D+L++TYT+DE+EVRILEPE+SG PWQT+ TIPMQ+SENALT+R+VTLFNTT+ ENETLL
Sbjct: 1073 DDLNRTYTIDEFEVRILEPERSGGPWQTKVTIPMQSSENALTIRVVTLFNTTTNENETLL 1132

Query: 3417 AIGTAYVQGEDVAARGRVLLFSVGR---------------------------NTDNSQNL 3515
            AIGTAYVQGEDVAARGR++L ++                             ++ +    
Sbjct: 1133 AIGTAYVQGEDVAARGRIILRALAPWWERLHLHPGSRVQIPEMASPSGVFKIDSADFHLQ 1192

Query: 3516 VSEVYSKELKGAISALASLQGHLLIASGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIV 3695
            VSE+YSKELKGAISALASLQGHLLIASGPK+ILHKWTG EL G+AF+D PPLYVVSLNIV
Sbjct: 1193 VSEIYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFFDAPPLYVVSLNIV 1252

Query: 3696 KNFILVGDIHKSIYFLSWKEQGAQLSLLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNV 3875
            KNFIL+GD+HKSIYFLSWKEQGAQLSLLAKDFG+LDCF+TEFLIDGSTLSL+VSD+QKN+
Sbjct: 1253 KNFILIGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLVVSDDQKNI 1312

Query: 3876 QIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFAL 4055
            QIFYYAPKMSESWKGQ+LLSRAEFHVGAHVTKFLRLQMLPTS+DRTG+TPGSDKTNRFAL
Sbjct: 1313 QIFYYAPKMSESWKGQRLLSRAEFHVGAHVTKFLRLQMLPTSTDRTGSTPGSDKTNRFAL 1372

Query: 4056 LFGTLDGSVGCIAPLDELTFRRLQTLQKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPD 4235
            LFG LDGS+GCIAPLDELTFRRLQ+LQKKLVD+VPH+AGLNPR+FR F SNGKAHRPGPD
Sbjct: 1373 LFGALDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFCSNGKAHRPGPD 1432

Query: 4236 NMIDCELLYHYEMLPLEDQLEMAHQIGTTRTQILTNLNDLSLGTSFL 4376
            +++DCELL HYEMLPLE+QLE+AH IGTTR+QIL+NLNDL LGTSFL
Sbjct: 1433 SIVDCELLCHYEMLPLEEQLEIAHLIGTTRSQILSNLNDLFLGTSFL 1479


>ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Fragaria vesca subsp. vesca]
          Length = 1439

 Score = 2283 bits (5917), Expect = 0.0
 Identities = 1125/1456 (77%), Positives = 1271/1456 (87%), Gaps = 5/1456 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPA-RKGIGPIPNLV 200
            MSYAA KMMHWPTGIENCA+GFI+H  +DF P+IP I  +DLDSDWPA R+ IGP+PNLV
Sbjct: 1    MSYAAHKMMHWPTGIENCAAGFITHSRADFVPRIPQIQTDDLDSDWPAPRREIGPVPNLV 60

Query: 201  VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380
            VTA N+LEVYV+RVQE++  R SR+  E+KRGG M G++GASLELVCHY+LHGNV TMAV
Sbjct: 61   VTAANVLEVYVVRVQEQDTARGSRASGESKRGGLMDGVAGASLELVCHYRLHGNVMTMAV 120

Query: 381  LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560
            LS  G DG KRRD+IIL F+DAKISVLEFDDSI+GLRTSSMHCFEGPEW HL+RGRESFA
Sbjct: 121  LSSGGGDGSKRRDAIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESFA 180

Query: 561  RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740
            RGP VKVDPQGRCGGVLVY LQ+IILK                        +ESSY+ISL
Sbjct: 181  RGPSVKVDPQGRCGGVLVYDLQLIILKAAQGGYGLVGDDDGFASGAAISARVESSYIISL 240

Query: 741  RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920
            RD+DMKHVKDF+FVHGYIEPV+VILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLI
Sbjct: 241  RDMDMKHVKDFTFVHGYIEPVLVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 300

Query: 921  WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100
            WSA+NLPHDAYKL++VPSPIGGVLVI ANSIHYHSQS SCALA+NSYA +VD+SQE+PRS
Sbjct: 301  WSAINLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAGSVDSSQEMPRS 360

Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280
            SF VELDAANA+WLS+DV++LSTKTGELLLLT+VYDGRVV RL+LSKSKASVLTSG+ T+
Sbjct: 361  SFTVELDAANASWLSNDVILLSTKTGELLLLTLVYDGRVVHRLDLSKSKASVLTSGIATV 420

Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDI 1448
            GNS  FLGSRLGDS+LVQFT G+G S+     K+EVGDIEGD PSAKRLRM+SSDALQD+
Sbjct: 421  GNSLFFLGSRLGDSLLVQFTSGVGASMLSADLKDEVGDIEGDAPSAKRLRMSSSDALQDM 480

Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628
            +SGEELSLY SA N +E AQ++FSFAVRDSL+N+GPLKDFSYGLRINAD NATGIAKQSN
Sbjct: 481  ISGEELSLYGSAQNNAESAQRSFSFAVRDSLVNVGPLKDFSYGLRINADANATGIAKQSN 540

Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808
            YELVCCSGHGKNGALCVL+QSIRPE+ITEV LPGCKGIWTVYHK+ARGHNA+S      D
Sbjct: 541  YELVCCSGHGKNGALCVLRQSIRPEMITEVALPGCKGIWTVYHKNARGHNAES-----YD 595

Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988
            DEYHAFLIISLE+RTMVLETAD L EVT+ VDY++QG TIAAGNLFGRRRV+Q++ RGAR
Sbjct: 596  DEYHAFLIISLEARTMVLETADHLSEVTDKVDYFLQGRTIAAGNLFGRRRVVQIYERGAR 655

Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168
            +L+G YMTQDLSFGA NSE                 PYVLLRM+DG I+LLVGDPS+CTV
Sbjct: 656  ILEGYYMTQDLSFGASNSESGSGSESATVLSVSIVDPYVLLRMSDGGIRLLVGDPSSCTV 715

Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348
            S+S P AFESSK  ++ACTLYHD+GPEPWLRK+STDAWLSTG+ EAIDG    LHDQGD+
Sbjct: 716  SVSNPAAFESSKKLVSACTLYHDEGPEPWLRKSSTDAWLSTGIDEAIDGV---LHDQGDV 772

Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVVAGQVK 2528
            YCV+CYESG+LEIFDVP+F CVFSVEKF+SG   LVDTF+ +  K SQ+S  E V+G  +
Sbjct: 773  YCVICYESGSLEIFDVPNFNCVFSVEKFVSGKPLLVDTFMGDPQK-SQSS--EEVSGLSR 829

Query: 2529 KETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQN 2708
            KE  QNM++VEL MQRW+G+H+RP+LFG+L DG + CYHAYLYE  +ST+K E S S QN
Sbjct: 830  KEKLQNMRVVELTMQRWSGQHSRPFLFGILNDGMIFCYHAYLYESMDSTSKTEVSASSQN 889

Query: 2709 TVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPAW 2888
            T      + SRLRNLRFVRVPL+TY+R + S  + CQRMTVFKN+ G QGLFL+GSRPAW
Sbjct: 890  T------TASRLRNLRFVRVPLDTYSRNDLSNGTSCQRMTVFKNIAGNQGLFLAGSRPAW 943

Query: 2889 FMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWPV 3068
             MV RER+RVHPQLCDGSIVAF VLHNVNCNHGLIYVTSEG +KICQLPS+ SYDNYWPV
Sbjct: 944  LMVFRERIRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSEGIMKICQLPSITSYDNYWPV 1003

Query: 3069 QKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDELH 3248
            QKIPLKGTPHQVTYFAEKNLYPLIVS+PV +PLNQVLSSLVDQ+  HQVE+ N SP+ELH
Sbjct: 1004 QKIPLKGTPHQVTYFAEKNLYPLIVSIPVQKPLNQVLSSLVDQEFSHQVENHNLSPEELH 1063

Query: 3249 KTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIGT 3428
            +TYTVDE+EVRI+EPEKSG PWQTRATIPMQ SENALTVR+VTLFNTT+KENETLLAIGT
Sbjct: 1064 RTYTVDEFEVRIMEPEKSGGPWQTRATIPMQTSENALTVRVVTLFNTTTKENETLLAIGT 1123

Query: 3429 AYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPKV 3608
            AYVQGEDVA RGRVLLFS   N DN QNLVSEV+SKELKGAISALASLQG+LLIASGPK+
Sbjct: 1124 AYVQGEDVAGRGRVLLFSAENNVDNPQNLVSEVFSKELKGAISALASLQGNLLIASGPKI 1183

Query: 3609 ILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAKD 3788
            ILHKWTG++LTG+AF+D PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQL+LLAKD
Sbjct: 1184 ILHKWTGSDLTGIAFFDVPPLYVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLNLLAKD 1243

Query: 3789 FGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 3968
            FGNLDCF+TEFLIDGSTLSL V+D QKN+QI YYAPK+SESW+GQKLL+RAEFHVGAHVT
Sbjct: 1244 FGNLDCFATEFLIDGSTLSLAVADAQKNIQILYYAPKISESWRGQKLLTRAEFHVGAHVT 1303

Query: 3969 KFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKLV 4148
            KFLRLQML TSSDRTG  PGSDKT R+ALLFGTLDG +G IAPL+ELTFRRLQ+LQ KLV
Sbjct: 1304 KFLRLQMLSTSSDRTGKNPGSDKTVRYALLFGTLDGGIGSIAPLEELTFRRLQSLQNKLV 1363

Query: 4149 DSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTRT 4328
            D+VPH+AGLNPR+FR FRSNGKAHRPGPD+++DCELL+HYEML LE+QLE+A QIGTTR 
Sbjct: 1364 DAVPHVAGLNPRSFRQFRSNGKAHRPGPDSIVDCELLFHYEMLSLEEQLEIAQQIGTTRL 1423

Query: 4329 QILTNLNDLSLGTSFL 4376
            QIL+NL+DLSLGTSFL
Sbjct: 1424 QILSNLDDLSLGTSFL 1439


>ref|XP_002318462.2| cleavage and polyadenylation specificity factor family protein
            [Populus trichocarpa] gi|550326263|gb|EEE96682.2|
            cleavage and polyadenylation specificity factor family
            protein [Populus trichocarpa]
          Length = 1455

 Score = 2266 bits (5871), Expect = 0.0
 Identities = 1123/1464 (76%), Positives = 1271/1464 (86%), Gaps = 13/1464 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARK----GIGPIP 191
            MSYAA+KMMHWPT I+ C SGF++H  S+ A  +P +  +DLDSDWP+R+    GIGP P
Sbjct: 1    MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59

Query: 192  NLVVTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVET 371
            NL+V +GN+LE+YV+RVQEE     +RS  E KRGG M G++GASLELVCHY+LHGNVE+
Sbjct: 60   NLIVASGNVLELYVVRVQEEG----ARSSGELKRGGVMDGVAGASLELVCHYRLHGNVES 115

Query: 372  MAVLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRE 551
            M VLS+EG D  +RRDSIILAF+DAKISVLEFDDSI+GLRTSSMHCFEGP+W HLKRGRE
Sbjct: 116  MGVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRE 175

Query: 552  SFARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYV 731
            SFARGPLVKVDPQGRCGGVLVY LQMIILK                       HI SSY+
Sbjct: 176  SFARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYI 235

Query: 732  ISLRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQ 911
            I+LRDLDMKHVKDF FVH YIEPV+V+LHERELTWAGR+ WKHHTCMISALSISTTLKQ 
Sbjct: 236  INLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQP 295

Query: 912  PLIWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQEL 1091
             LIWS  NLPHDAYKL++VPSPIGGVLVIG N+IHYHS+S SCALA+NSYA +VD+SQEL
Sbjct: 296  TLIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQEL 355

Query: 1092 PRSSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGV 1271
            PR++F+VELDAANATWL  DV +LSTKTGELLLLT+VYDGRVVQRL+LSKSKASVLTS +
Sbjct: 356  PRATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDI 415

Query: 1272 TTIGNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDAL 1439
            TT+GNSF FLGSRLGDS+LVQFT G+G+S+     KEEVGDIEGD+PSAKRL+++SSDAL
Sbjct: 416  TTLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDAL 475

Query: 1440 QDIVSGEELSLYSSAPNTSELAQ-----KTFSFAVRDSLINIGPLKDFSYGLRINADPNA 1604
            QD+VSGEELSLYSSAPN +E +Q     KTFSF VRDSLIN+GPLKDF+YGLRINAD NA
Sbjct: 476  QDMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANA 535

Query: 1605 TGIAKQSNYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNAD 1784
            TGI+KQSNYELVCCSGHGKNGALCVLQQSIRPE+ITEVELPGCKGIWTVYHK+AR H+ D
Sbjct: 536  TGISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARSHSVD 595

Query: 1785 SSKMVSLDDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVI 1964
            S KM S DDEYHA+LIIS+E+RTMVLETAD L EVTESVDY+VQG TIAAGNLFGRRRV+
Sbjct: 596  SLKMAS-DDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVV 654

Query: 1965 QVFARGARVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLV 2144
            QVF RGAR+LDGS+MTQDLSFG  NSE                 PYVL+RM DGSIQ+LV
Sbjct: 655  QVFERGARILDGSFMTQDLSFGGSNSETGRSESSTVMHVSIVD-PYVLVRMADGSIQILV 713

Query: 2145 GDPSTCTVSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADG 2324
            GDPS CTVS++ P AF+SS  S++ACTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGAD 
Sbjct: 714  GDPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADS 773

Query: 2325 ALHDQGDIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKN 2504
              H+QGDIYCVVCYE+G LEIFDVP+F  VF V+KF+SG THL+DT   E AKD      
Sbjct: 774  GAHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDMMKGVK 833

Query: 2505 EVVAGQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKI 2684
            E VAG  +KE+ QNMK+VEL M RW+G H+RP+LFG+LTDGT+LCYHAYL+EG + T+K+
Sbjct: 834  EEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTSKL 893

Query: 2685 EESVSGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLF 2864
            E+SVS QN+V  S+IS SRLRNLRFVRVPL+TYTREETS+ + CQR+T FKN+ GYQG F
Sbjct: 894  EDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQGFF 953

Query: 2865 LSGSRPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVL 3044
            LSGSRPAWFMV RERLRVHPQLCDGSIVAF VLH VNCNHGLIYVTS+G LKIC L SV 
Sbjct: 954  LSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSSVS 1013

Query: 3045 SYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHD 3224
            SYDNYWPVQKIPLKGTPHQVTYFAE+NLYPLIVSVPV +P+NQVLSSLVDQ+ GHQ+E+ 
Sbjct: 1014 SYDNYWPVQKIPLKGTPHQVTYFAERNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQIENH 1073

Query: 3225 NFSPDELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKEN 3404
            N S +E+H+TY+VDE+EVRILEP  S  PWQ +ATIPMQ SENALTVRMV+LFNT++KEN
Sbjct: 1074 NLSSEEIHRTYSVDEFEVRILEP--SNGPWQVKATIPMQTSENALTVRMVSLFNTSTKEN 1131

Query: 3405 ETLLAIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHL 3584
            ETLLA+GTAYVQGEDVAARGR+LLFSV +N +NSQ LVSEVYSKELKGAISALASLQGHL
Sbjct: 1132 ETLLAVGTAYVQGEDVAARGRILLFSVVKNPENSQILVSEVYSKELKGAISALASLQGHL 1191

Query: 3585 LIASGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGA 3764
            LIASGPK+ILHKWTG ELTGVAF D PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGA
Sbjct: 1192 LIASGPKIILHKWTGTELTGVAFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1251

Query: 3765 QLSLLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAE 3944
            QLSLLAKDF +LDCFSTEFLIDGSTLSL+VSDEQKNVQIFYYAPKMSESWKGQKLLSRAE
Sbjct: 1252 QLSLLAKDFASLDCFSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQKLLSRAE 1311

Query: 3945 FHVGAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRL 4124
            FHVGA VTKF+RLQML  S DR+GA P SDKTNRFALLFGTLDGS+GCIAPLDELTFRRL
Sbjct: 1312 FHVGALVTKFMRLQMLSPSLDRSGAAPVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRL 1371

Query: 4125 QTLQKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMA 4304
            Q+LQKKLVD+VPH+AGLNP++FR FRS+GKAHRPGP++++DCE+L +YEM+PLE+Q+E+A
Sbjct: 1372 QSLQKKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQVEIA 1431

Query: 4305 HQIGTTRTQILTNLNDLSLGTSFL 4376
             QIGTTR QIL+NLNDL+LGTSFL
Sbjct: 1432 QQIGTTRAQILSNLNDLTLGTSFL 1455


>ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis] gi|223550020|gb|EEF51507.1| cleavage
            and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
          Length = 1461

 Score = 2262 bits (5861), Expect = 0.0
 Identities = 1109/1462 (75%), Positives = 1268/1462 (86%), Gaps = 11/1462 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWP-ARKGIGPIPNLV 200
            MSYAA+KM+HWPTGIE+CASG+I+H  +DF PQIP I  ++LDS+WP +++GIGP+PNL+
Sbjct: 1    MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60

Query: 201  VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380
            VTAG++LEVYV+RVQE+   R SRS  E KRGG M G+SGASLELVCHY+LHGNVE+M V
Sbjct: 61   VTAGSVLEVYVVRVQED-GSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVV 119

Query: 381  LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560
            L  EG D  +RRDSIILAF+DAKISVLEFDDSI+GLRTSSMHCFEGPEW HLKRGRESFA
Sbjct: 120  LPTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFA 179

Query: 561  RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740
            RGPL+KVDPQGRCGG+LVY +QMIIL+                        ++SSYVI+L
Sbjct: 180  RGPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINL 239

Query: 741  RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920
            RD+DMKHVKDF F+H YIEPV+VILHERELTWAGR+SWKHHTCMISALSISTTLKQ  LI
Sbjct: 240  RDMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLI 299

Query: 921  WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100
            WS VNLPHDAYKL++VP PIGGVLVI AN+IHYHS+S + ALA+N+YAV++D+SQELPR+
Sbjct: 300  WSVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRA 359

Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280
            SF+VELDA  A WL +DV +LS K GELLLL++VYDGRVVQRL+LSKSKASVLTS +TTI
Sbjct: 360  SFSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTI 419

Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDI 1448
            GNS  FLGSRLGDS+LVQFT G+G SV     KEEVG+IEGDVPSAKRL+ ++SD LQD+
Sbjct: 420  GNSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDM 479

Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628
            VSGEELSLY S  N +E AQK+FSFAVRDSLIN+GPLKDFSYGLR N D +ATGIAKQSN
Sbjct: 480  VSGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSN 539

Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808
            Y+LVCCSGHGKNG LC+L+QSIRPE+ITEV+LPGC+GIWTVYHK+ARGHN D SKM +  
Sbjct: 540  YDLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAA 599

Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988
            DEYHA+LIIS+E+RTMVLETADLL EVTESVDY+VQG TIAAGNLFGRRRVIQVF RGAR
Sbjct: 600  DEYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 659

Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168
            +LDGS+MTQDLS G+ NSE                 PYVL++MTDGSI+LL+GD STC V
Sbjct: 660  ILDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMV 719

Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGA---DGALHDQ 2339
            SI+ P AFE+S+ S++ACTLYHDKGPEPWLRK STDAWLSTG+ EAIDGA   DG  HDQ
Sbjct: 720  SINTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQ 779

Query: 2340 GDIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEV--- 2510
            GDIYC+VCYESG LEIFDVP+F  VFSV+KF+SG THL D ++RE  KDSQ   N +   
Sbjct: 780  GDIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEE 839

Query: 2511 VAGQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEE 2690
            VAG  +KE A NMK VELAMQRW+G H+RP+LFG+LTDGT+LCYHAYL+E  ++T+K E+
Sbjct: 840  VAGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTED 899

Query: 2691 SVSGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLS 2870
            SVS QN V L SIS SRLRNLRFVRVPL++Y +EETS  + CQR+T+F N+ G+QG FL 
Sbjct: 900  SVSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLL 959

Query: 2871 GSRPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSY 3050
            GSRPAWFMV RERLRVHPQLCDGSIVAF VLHNVNCNHGLIYVTS+G LKICQLPS  +Y
Sbjct: 960  GSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNY 1019

Query: 3051 DNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNF 3230
            DNYWPVQKIPLKGTPHQVTYF EKNLYPLIVSVPV +P+NQVLSSLVDQ+ GHQ+E+ N 
Sbjct: 1020 DNYWPVQKIPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQVLSSLVDQEVGHQIENHNL 1079

Query: 3231 SPDELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENET 3410
            S DEL +TY+V+E+EVRILE E  G PWQT+ATIPMQ+SENALTVR+VTLFN T+KENET
Sbjct: 1080 SSDELLQTYSVEEFEVRILESENGGGPWQTKATIPMQSSENALTVRVVTLFNATTKENET 1139

Query: 3411 LLAIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLI 3590
            LLAIGTAYVQGEDVAARGRVLLFSV ++T+NSQ LVSEVYSKELKGAISALASLQGHLLI
Sbjct: 1140 LLAIGTAYVQGEDVAARGRVLLFSVVKSTENSQVLVSEVYSKELKGAISALASLQGHLLI 1199

Query: 3591 ASGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQL 3770
            ASGPK+ILHKWTG EL GVAFYD PPLYV S+NIVKNFIL+GDIHKSIYFLSWKEQGAQL
Sbjct: 1200 ASGPKIILHKWTGTELNGVAFYDAPPLYVASMNIVKNFILLGDIHKSIYFLSWKEQGAQL 1259

Query: 3771 SLLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFH 3950
            SLLAKDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKM ESWKGQKLLSRAEFH
Sbjct: 1260 SLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMLESWKGQKLLSRAEFH 1319

Query: 3951 VGAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQT 4130
            VGAH+TKF+RL ML TSSDR+GA PG DKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+
Sbjct: 1320 VGAHITKFIRLSMLSTSSDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1379

Query: 4131 LQKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQ 4310
            LQ+KLVD+VPH+AGLNPR+FR FRS+GK HRPGP++++DCELL H+EMLPLE+QLE+A Q
Sbjct: 1380 LQRKLVDAVPHVAGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEEQLEIAQQ 1439

Query: 4311 IGTTRTQILTNLNDLSLGTSFL 4376
            +GTTR QIL+NLNDLSLGTSFL
Sbjct: 1440 VGTTRAQILSNLNDLSLGTSFL 1461


>ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1447

 Score = 2260 bits (5856), Expect = 0.0
 Identities = 1108/1457 (76%), Positives = 1267/1457 (86%), Gaps = 6/1457 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKG--IGPIPNL 197
            MS+AA+KMM  PTGI+NCA+GF++H  SDF P  P    +DLD++WP+R    +G +PNL
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVPLQP----DDLDAEWPSRPRHHVGSLPNL 56

Query: 198  VVTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMA 377
            VVTA N+LEVY +R+QE++  + +   ++++RG  + G++GASLELVCHY+LHGNVETMA
Sbjct: 57   VVTAANVLEVYAVRLQEDQPPKAA---ADSRRGALLDGIAGASLELVCHYRLHGNVETMA 113

Query: 378  VLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESF 557
            VLS+ G D  +RRDSI+L F DAKISVLE+DDSI+GLRTSS+HCFEGPEW HLKRGRE F
Sbjct: 114  VLSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 173

Query: 558  ARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVIS 737
            ARGP+VKVDPQGRCGGVL+Y LQMIILK                        IESSY+I+
Sbjct: 174  ARGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMIN 233

Query: 738  LRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPL 917
            LRDLDM+HVKDF+FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PL
Sbjct: 234  LRDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 293

Query: 918  IWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPR 1097
            IWSAVNLPHDAYKL++VPSPIGGVLVI AN+IHYHSQS SCALA+NSYAV +D+SQE+PR
Sbjct: 294  IWSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPR 353

Query: 1098 SSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTT 1277
            SSFNVELDAANATWL  DV +LSTKTGELLLLT+VYDGRVVQRL+LSKSKASVL+SG+TT
Sbjct: 354  SSFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITT 413

Query: 1278 IGNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQD 1445
            IGNS  FL SRLGDSMLVQF+CG G S+     KEEVGDIE D PS KRLR + SDALQD
Sbjct: 414  IGNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQD 472

Query: 1446 IVSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQS 1625
            +VSGEELSLY SAPN +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAKQS
Sbjct: 473  MVSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 532

Query: 1626 NYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSL 1805
            NYELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R HNADSSKM   
Sbjct: 533  NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 592

Query: 1806 DDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGA 1985
            DDEYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG T+AAGNLFGR RVIQV+ RGA
Sbjct: 593  DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGA 652

Query: 1986 RVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCT 2165
            R+LDGS+MTQD+SFGA N E                 P+VLLRM+DGSI+LL+GDPSTCT
Sbjct: 653  RILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 712

Query: 2166 VSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGD 2345
            +S++ P +FESSK S+++CTLYHDKGPEPWLRKTSTDAWLSTG+GE IDG DGA  D GD
Sbjct: 713  ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGD 772

Query: 2346 IYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVVAGQV 2525
            IYCVVC+++G LEIFDVP+F CVFSVE F+SG +HLVD  ++E  KDS+    + V  Q 
Sbjct: 773  IYCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSKQGDRDGVINQG 832

Query: 2526 KKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQ 2705
            +KE   +MK+VELAMQRW+G+H+RP+LFG+L+DGT+LCYHAYLYE  +ST+K+E+S S  
Sbjct: 833  RKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSASAG 892

Query: 2706 NTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPA 2885
             ++ LSS + SRLRNLRFVRVPL+ Y RE+TS   PCQ++T+FKN+G Y+G FLSGSRPA
Sbjct: 893  GSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSRPA 952

Query: 2886 WFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWP 3065
            W MV RERLRVHPQLCDGSIVAF VLHNVNCN GLIYVTS+G LKICQLPS  +YD+YWP
Sbjct: 953  WVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSYWP 1012

Query: 3066 VQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDEL 3245
            VQKIPLK TPHQVTYFAEKNLYPLIVS PV++PLNQV+ SLVDQD  HQ E  N +PDE 
Sbjct: 1013 VQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVI-SLVDQDINHQNESQNMNPDEQ 1071

Query: 3246 HKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIG 3425
            ++ Y +DE+EVRI+EPEKSG PWQT+ATIPMQ+SENALTVRMVTL NTTSKENETLLAIG
Sbjct: 1072 NRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLVNTTSKENETLLAIG 1131

Query: 3426 TAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPK 3605
            TAYVQGEDVAARGR+LLFS+G+NTDN Q LVSEVYSKELKGAISALASLQGHLLIASGPK
Sbjct: 1132 TAYVQGEDVAARGRILLFSLGKNTDNPQTLVSEVYSKELKGAISALASLQGHLLIASGPK 1191

Query: 3606 VILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAK 3785
            +ILHKW G EL G+AF+D PPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQLSLLAK
Sbjct: 1192 IILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLLAK 1251

Query: 3786 DFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 3965
            DFG+LDCF+TEFLIDGSTLSL+VSD+ +N+QIFYYAPKMSESWKGQKLLSRAEFHVGAHV
Sbjct: 1252 DFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1311

Query: 3966 TKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKL 4145
            TKFLRLQML T SDR GA PGSDKTNRFALLFGTLDGS+GCIAPLDE+TFRRLQ+LQ+KL
Sbjct: 1312 TKFLRLQMLST-SDRAGAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKL 1370

Query: 4146 VDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTR 4325
            VD+VPH+AGLNPRAFR+FRSNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ+GTTR
Sbjct: 1371 VDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGTTR 1430

Query: 4326 TQILTNLNDLSLGTSFL 4376
            +QIL+NL+DLSLGTSFL
Sbjct: 1431 SQILSNLSDLSLGTSFL 1447


>ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X1 [Glycine max]
          Length = 1449

 Score = 2249 bits (5829), Expect = 0.0
 Identities = 1105/1458 (75%), Positives = 1267/1458 (86%), Gaps = 7/1458 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDS-DWPARKG--IGPIPN 194
            MS+AA+KMM  PTGI+NCA+GF++H  SDF P  P    +DLD+ +WP+R    +GP+PN
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVPLQP----DDLDAAEWPSRPRHHVGPLPN 56

Query: 195  LVVTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETM 374
            LVVTA N+LEVY +R+QE++  +   +  +++RG  + G++GASLEL CHY+LHGNVETM
Sbjct: 57   LVVTAANVLEVYAVRLQEDQQPK--DASDDSRRGTLLDGIAGASLELECHYRLHGNVETM 114

Query: 375  AVLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRES 554
            AVLS+ G D  ++RDSIIL F DAKISVLE+DDSI+GLRTSS+HCFEGPEW HLKRGRE 
Sbjct: 115  AVLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQ 174

Query: 555  FARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVI 734
            FARGP+VK+DPQGRCGGVL+Y LQMIILK                        IESSY+I
Sbjct: 175  FARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMI 234

Query: 735  SLRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQP 914
            +LRDLDM+HVKDF+FV+GYIEPVMVILHERELTWAGR+SW HHTCMISALSISTTLKQ P
Sbjct: 235  NLRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHP 294

Query: 915  LIWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELP 1094
            LIWSAVNLPHDAYKL++VPSPIGGVLVIGAN+IHYHSQS SCALA+N+YAV +D+SQE+P
Sbjct: 295  LIWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIP 354

Query: 1095 RSSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVT 1274
            RSSFNVELDAANATWL  DV +LSTKTGELLLL +VYDGRVVQRL+LSKSKASVL+SG+T
Sbjct: 355  RSSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGIT 414

Query: 1275 TIGNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQ 1442
            TIGNS  FL SRLGDSMLVQF+CG G S+     KEEVGDIE D PS KRLR + SDALQ
Sbjct: 415  TIGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQ 473

Query: 1443 DIVSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQ 1622
            D+VSGEELSLY SA N +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAKQ
Sbjct: 474  DMVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQ 533

Query: 1623 SNYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVS 1802
            SNYELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R HNADSSKM  
Sbjct: 534  SNYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMAD 593

Query: 1803 LDDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARG 1982
             DDEYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG T+AAGNLFGRRRVIQV+ RG
Sbjct: 594  DDDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERG 653

Query: 1983 ARVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTC 2162
            AR+LDGS+MTQD+SFGA NSE                 P+VLLRM+DGSI+LL+GDPSTC
Sbjct: 654  ARILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTC 713

Query: 2163 TVSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQG 2342
            T+S++ P +FESSK S+++CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDG DGA  D G
Sbjct: 714  TISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHG 773

Query: 2343 DIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVVAGQ 2522
            DIYCVVC+++G LEIFD+P+F CVFSVE F+SG +HLVD  ++E  KDS+    + V  Q
Sbjct: 774  DIYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSKQGDRDGVVNQ 833

Query: 2523 VKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSG 2702
             +K+   NMK+VELAMQRW+G+H+RP+LFG+L+DGT+LCYHAYLYE  + T+K+E+S S 
Sbjct: 834  GRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSASA 893

Query: 2703 QNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRP 2882
              ++ LSS + SRLRNLRFVRVPL+ Y RE+TS  SPCQ++T+FKN+G YQG FLSGSRP
Sbjct: 894  GGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGSRP 953

Query: 2883 AWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYW 3062
            AW MV RERLRVHPQLCDGSIVAF VLHNVNCNHGLIYVTS+G LKICQLPS  +YD+YW
Sbjct: 954  AWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDSYW 1013

Query: 3063 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDE 3242
            PVQKIPLK TPHQVTYFAEKNLYPLIVS PV++PLNQV+ SLVDQD  HQ E  N +PDE
Sbjct: 1014 PVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVI-SLVDQDFNHQNESQNMNPDE 1072

Query: 3243 LHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAI 3422
             ++ Y +DE+EVRI+EPEKSG PWQT+ATIPMQ+SENALTVRMVTL NTTSKENETLLAI
Sbjct: 1073 QNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLLAI 1132

Query: 3423 GTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGP 3602
            GTAYVQGEDVAARGR+LLFS+G+ TDN Q LVSEVYSKELKGAISALASLQGHLLIASGP
Sbjct: 1133 GTAYVQGEDVAARGRILLFSLGKITDNPQTLVSEVYSKELKGAISALASLQGHLLIASGP 1192

Query: 3603 KVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLA 3782
            K+ILHKW G EL G+AF+D PPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQLSLLA
Sbjct: 1193 KIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLLA 1252

Query: 3783 KDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAH 3962
            KDFG+LDCF+TEFLIDGSTLSL+VSD+ +N+QIFYYAPKMSESWKGQKLLSRAEFHVGAH
Sbjct: 1253 KDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1312

Query: 3963 VTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKK 4142
            VTKFLRLQML T SDR G+ PGSDKTNRFALLFGTLDGS+GCIAPLDE+TFRRLQ+LQ+K
Sbjct: 1313 VTKFLRLQMLST-SDRAGSVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRK 1371

Query: 4143 LVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTT 4322
            LVD+VPH+AGLNPRAFR+FRSNGKAHRPGPD+++DCELL HYEMLPLE+QLE+A+QIGTT
Sbjct: 1372 LVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIGTT 1431

Query: 4323 RTQILTNLNDLSLGTSFL 4376
            R+QIL+NL+DLSLGTSFL
Sbjct: 1432 RSQILSNLSDLSLGTSFL 1449


>ref|XP_007152397.1| hypothetical protein PHAVU_004G126600g [Phaseolus vulgaris]
            gi|561025706|gb|ESW24391.1| hypothetical protein
            PHAVU_004G126600g [Phaseolus vulgaris]
          Length = 1445

 Score = 2236 bits (5793), Expect = 0.0
 Identities = 1095/1456 (75%), Positives = 1260/1456 (86%), Gaps = 5/1456 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPAR-KGIGPIPNLV 200
            MS+AA+KMM   TGI+NCA+GF++H  +D  P  P    EDLD++WP+R + +GP+PNLV
Sbjct: 1    MSFAAYKMMQCSTGIDNCAAGFLTHSRADSVPLQP----EDLDAEWPSRPRRVGPLPNLV 56

Query: 201  VTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAV 380
            VTA N+LEVY +R+QE++  + +    + +RG  + G+ GASLELVCHY+LHGNVETMAV
Sbjct: 57   VTAANVLEVYTVRIQEDQPPKAA----DPRRGTLLDGIDGASLELVCHYRLHGNVETMAV 112

Query: 381  LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560
            LS+ G D  ++RDSIIL F DAKISVLE+DDSI+GLRTSS+HCFEGPEW HLKRGRE FA
Sbjct: 113  LSIGGGDASRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 172

Query: 561  RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740
            RGP+VKVDPQGRCGG L+Y LQMIILK                        IESSY+I+L
Sbjct: 173  RGPVVKVDPQGRCGGTLIYDLQMIILKATQAGSGLVGDDDALGFSGAVAARIESSYMINL 232

Query: 741  RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920
            RDLDM+HVKDF+FVHGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ PLI
Sbjct: 233  RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 292

Query: 921  WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100
            WSAVNLPHDAYKL++VPSPIGGVLVIGAN++HYHSQS SCALA+NSYAV++DNSQE+PRS
Sbjct: 293  WSAVNLPHDAYKLLAVPSPIGGVLVIGANTVHYHSQSASCALALNSYAVSLDNSQEIPRS 352

Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280
            SFNVELD+ANATWL  DV +LSTKTGELLLLT+VYDGRVVQRL+LSKSKASVL+SG+TTI
Sbjct: 353  SFNVELDSANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 412

Query: 1281 GNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDALQDI 1448
            GNS  FL SRLGDSMLVQF+CG G S+     KEEVGDIE D PS KRLR + SD LQD+
Sbjct: 413  GNSLFFLASRLGDSMLVQFSCGSGGSMLSSNLKEEVGDIEADAPS-KRLRRSPSDTLQDV 471

Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628
            VSGEELSLY SAPN +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAKQSN
Sbjct: 472  VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 531

Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808
            YELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R HN DSSK+   D
Sbjct: 532  YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNTDSSKLADDD 591

Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988
            DEYHA+LIISLE+RTMVLETADLL EVTESVDYYVQG T+AAGNLFGRRRVIQV+ RGAR
Sbjct: 592  DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGAR 651

Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168
            +LDGS+MTQD++FGA NSE                 P+VLLRM+DGS++LL+GDP TCT+
Sbjct: 652  ILDGSFMTQDVTFGASNSESASASESAIALSVSIADPFVLLRMSDGSVRLLIGDPITCTI 711

Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348
            S++ P +FES+K S+++CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDG DGA  D GDI
Sbjct: 712  SVTSPASFESTKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGDI 771

Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVVAGQVK 2528
            YCVVC+++G LEIFDVP+F CVFSV  F+SG +HLVD  ++E  KDS+    + V  Q +
Sbjct: 772  YCVVCFDNGNLEIFDVPNFNCVFSVGNFMSGKSHLVDALMKEVLKDSKKGDRDGVIIQGR 831

Query: 2529 KETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQN 2708
            KE   +MK+VELAMQRW+G+H+RP+LFG+L+DGT+LCYHAYLYE  + T+K+E+S S   
Sbjct: 832  KENVPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSASAGG 891

Query: 2709 TVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPAW 2888
            ++ L + + SRLRNLRFVRV L+ Y REETS  S  Q++T+FKN+G YQG FLSGSRPAW
Sbjct: 892  SIGLGTTNISRLRNLRFVRVSLDAYAREETSNGSLHQQITIFKNIGSYQGFFLSGSRPAW 951

Query: 2889 FMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWPV 3068
             MV RERLRVHPQLCDGSIVAF VLHNVNCNHGLIYVTS+G LKICQLPS  +YD+YWPV
Sbjct: 952  VMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDSYWPV 1011

Query: 3069 QKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDELH 3248
            QKIPLK TPHQVTYFAEKNLYPLIVS PV++PL+QV+ SLVDQD  HQ E  N + DE +
Sbjct: 1012 QKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLSQVI-SLVDQDVNHQNESQNMNSDEQN 1070

Query: 3249 KTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIGT 3428
            + Y +DE+EVRI+EPEKSG PWQT+ATIPMQ+SENALTVRMVTL NTTSKENETLLAIGT
Sbjct: 1071 RFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLLAIGT 1130

Query: 3429 AYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPKV 3608
            AYVQGEDVAARGR+LLFS+G+NTDN Q+LVSEVYSKELKGAISALASLQGHLLIASGPK+
Sbjct: 1131 AYVQGEDVAARGRILLFSLGKNTDNPQSLVSEVYSKELKGAISALASLQGHLLIASGPKI 1190

Query: 3609 ILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAKD 3788
            ILHKW G EL G+AF+D PPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQLSLLAKD
Sbjct: 1191 ILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLLAKD 1250

Query: 3789 FGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 3968
            F +LDCF+TEFLIDGSTLSL+VSD+++N+QIFYYAPKMSESWKGQKLLSRAEFHVGAHVT
Sbjct: 1251 FSSLDCFATEFLIDGSTLSLMVSDDKRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVT 1310

Query: 3969 KFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKLV 4148
            KFLRLQMLPT SDR G+ PGSDKTNRFALLFGTLDGS+GCIAPLDE+TFRRLQ+LQKKLV
Sbjct: 1311 KFLRLQMLPT-SDRAGSAPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQKKLV 1369

Query: 4149 DSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTRT 4328
            D+V H+AGLNPRAFR F+SNGKAHRPGPD+++DCELL HYEMLPLE+QLE+AHQ+GTTR+
Sbjct: 1370 DAVAHVAGLNPRAFRKFQSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVGTTRS 1429

Query: 4329 QILTNLNDLSLGTSFL 4376
            QIL+NL+DLSLGTSFL
Sbjct: 1430 QILSNLSDLSLGTSFL 1445


>ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cicer arietinum]
          Length = 1447

 Score = 2227 bits (5771), Expect = 0.0
 Identities = 1102/1461 (75%), Positives = 1256/1461 (85%), Gaps = 10/1461 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITN---EDLDSDW-PARKGIGPIP 191
            MS+AA+KMM WPTGI+NCASGF++H  SD  P+IP I +   +D+DSDW P  + + P+P
Sbjct: 1    MSFAAYKMMQWPTGIQNCASGFLTHSRSDSTPRIPPIQHNDDDDIDSDWVPQPRDLAPLP 60

Query: 192  NLVVTAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVET 371
            NLV+TA NILEVY +R+Q++        P  +     + GL+GASLELVCHY+LHGNVE+
Sbjct: 61   NLVITAANILEVYTVRIQQDP-------PKSSADPRVLDGLAGASLELVCHYRLHGNVES 113

Query: 372  MAVLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRE 551
            +AVLS+ G D  +RRDSIIL F+DAKISVLE+DDSI+GLRTSS+HCFEGPEW HLKRGRE
Sbjct: 114  VAVLSVGGGDASRRRDSIILTFKDAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGRE 173

Query: 552  SFARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYV 731
             FARGP+ KVDPQGRCGGVLVY LQMIILK                        IESSY+
Sbjct: 174  HFARGPVAKVDPQGRCGGVLVYDLQMIILKTTQAGSGLVGEDDVLGSGGAVAARIESSYM 233

Query: 732  ISLRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQ 911
            I+LRDLDM+HVKDF+F+HGYIEPVMVILHERELTWAGR+SWKHHTCMISALSISTTLKQ 
Sbjct: 234  INLRDLDMRHVKDFTFLHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 293

Query: 912  PLIWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQEL 1091
            PLIWSAVNLPHDAYKL++VPSPIGGVLVIGAN+IHYHSQS SCALA+NSYAV+VDNSQE+
Sbjct: 294  PLIWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNSYAVSVDNSQEM 353

Query: 1092 PRSSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGV 1271
            PRSSFNVELDAANATWL +DV +LSTKTGELLLLT++YDGRVVQRL+LSKSKASVL+SGV
Sbjct: 354  PRSSFNVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLSSGV 413

Query: 1272 TTIGNSFIFLGSRLGDSMLVQFTCGMGTSVP----KEEVGDIEGDVPSAKRLRMASSDAL 1439
            TTIGNS  FL SRLGDSMLVQF+ G G S+     KEEVGD + D  SAKR+R + SD L
Sbjct: 414  TTIGNSLFFLASRLGDSMLVQFSSGSGVSMLSSNLKEEVGDFDVDASSAKRMRRSPSDTL 473

Query: 1440 QDIVSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAK 1619
            QD+VSGEELSLY SA N +E AQK+FSFAVRDSLIN+GPLKDFSYGLRINAD NATGIAK
Sbjct: 474  QDMVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAK 533

Query: 1620 QSNYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMV 1799
            QSNYELVCCSGHGKNG+LCVL+QSIRPE+ITEVELPGCKGIWTVYHKS R  NADSSK+ 
Sbjct: 534  QSNYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSLNADSSKLA 593

Query: 1800 SLDDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFAR 1979
              +DEYHA+LIISLESRTMVLETADLL EVTESVDYYVQG T+AAGNLFGRRRVIQV+ R
Sbjct: 594  DDEDEYHAYLIISLESRTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYER 653

Query: 1980 GARVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPST 2159
            GAR+LDGS+MTQD+SFGA NSE                 PYVLL+M+DGS++LLVGDPST
Sbjct: 654  GARILDGSFMTQDVSFGASNSEANYGSESALALSVSIADPYVLLKMSDGSVRLLVGDPST 713

Query: 2160 CTVSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQ 2339
            CT+S++ P +FESSK S++ CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDG DGA  D 
Sbjct: 714  CTISVTSPASFESSKGSVSTCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDH 773

Query: 2340 GDIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSK--NEVV 2513
            GDIYCVVCYE+ +LEIFDVP+F CVFSVE F+SG +HLVD   +E  KDSQ     ++ V
Sbjct: 774  GDIYCVVCYENDSLEIFDVPNFSCVFSVENFLSGKSHLVDALTKEVPKDSQKGDKVSDGV 833

Query: 2514 AGQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEES 2693
              Q +K+ A NMK+VELAMQRW+G+H RP+LFG+L+DGT LCYHAYLYE  + T+K+E+S
Sbjct: 834  VSQGRKD-ALNMKVVELAMQRWSGKHGRPFLFGILSDGTTLCYHAYLYESPDGTSKVEDS 892

Query: 2694 VSGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSG 2873
            VS      LS+ S SRLRNLRFVRVPL+ + REETS   PCQ++ +FKN+G Y+G FLSG
Sbjct: 893  VSA----GLSNSSVSRLRNLRFVRVPLDVHAREETSNGPPCQQINIFKNIGSYEGFFLSG 948

Query: 2874 SRPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYD 3053
            SRPAW M+ RERLRVHPQLCDGSIVAF VLHNVNCNHGLIYVTS+G LKICQLPS  +YD
Sbjct: 949  SRPAWVMLLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYD 1008

Query: 3054 NYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFS 3233
             YWPVQK+PLK TPHQVTYFAEKNLYPLIVS PV +PLNQV++ LVDQDA    E  N +
Sbjct: 1009 CYWPVQKVPLKATPHQVTYFAEKNLYPLIVSYPVPKPLNQVIA-LVDQDANQLTESQNLN 1067

Query: 3234 PDELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETL 3413
             DE    YT++E+EVRI+EPEKSG PWQ +ATIPMQ+SENALTVRMVTL NT+SKENETL
Sbjct: 1068 NDEQSHLYTIEEFEVRIMEPEKSGGPWQLKATIPMQSSENALTVRMVTLMNTSSKENETL 1127

Query: 3414 LAIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIA 3593
            LAIGTAYVQGEDVAARGR+LLFS+G+NTDN QNLVSEVYSKELKGAISALA+LQGHLL+A
Sbjct: 1128 LAIGTAYVQGEDVAARGRILLFSLGKNTDNPQNLVSEVYSKELKGAISALAALQGHLLVA 1187

Query: 3594 SGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLS 3773
            SGPK+ILHKWTG EL GVAF+D PPL+VVSLNIVKNFIL+GD+HKSIYFLSWKEQGAQLS
Sbjct: 1188 SGPKIILHKWTGTELNGVAFFDVPPLHVVSLNIVKNFILIGDVHKSIYFLSWKEQGAQLS 1247

Query: 3774 LLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHV 3953
            LLAKDFG+LDCF+TEFLIDGSTLSL+VSDEQKN+QIFYYAPKMSESWKGQKLLSRAEFHV
Sbjct: 1248 LLAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1307

Query: 3954 GAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTL 4133
            GAH+TKFLRLQML T SD+TG+ PGSDKTNRFALLFGTLDGS+GCIAPLDE+TFRRLQ+L
Sbjct: 1308 GAHITKFLRLQMLST-SDKTGSGPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSL 1366

Query: 4134 QKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQI 4313
            QKKLVD+VPH+AGLNPRAFR+F SNGKAHRPGPD+++DCELL HYEML LE+QLE+AHQ+
Sbjct: 1367 QKKLVDAVPHVAGLNPRAFRLFHSNGKAHRPGPDSIVDCELLCHYEMLQLEEQLEIAHQV 1426

Query: 4314 GTTRTQILTNLNDLSLGTSFL 4376
            GTTR+QIL+NL+DLSLGTSFL
Sbjct: 1427 GTTRSQILSNLSDLSLGTSFL 1447


>ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cucumis sativus]
          Length = 1504

 Score = 2145 bits (5559), Expect = 0.0
 Identities = 1076/1508 (71%), Positives = 1242/1508 (82%), Gaps = 57/1508 (3%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MS+AA++MMHWPTGIENC S +I+H  +DF P + S  ++DLDSDW  R+ IGP+PNLVV
Sbjct: 1    MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTS-HSDDLDSDWHPRRDIGPVPNLVV 59

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVETMAVL 383
            TAGN+LEVYV+RV EE  GR S+S  E +RGG M G+SGASLELVCHY+LHGNVE+MA+L
Sbjct: 60   TAGNVLEVYVVRVLEE-GGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAIL 118

Query: 384  SMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFAR 563
            S  G DG K+RDSIIL FQ+AKISVLEFDDS + LRTSSMHCF+GP+W HLKRGRESFAR
Sbjct: 119  SSRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFAR 178

Query: 564  GPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISLR 743
            GP+VKVDPQGRCGGVLVYGLQMIILK                        +ESSY+I+LR
Sbjct: 179  GPVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLR 238

Query: 744  DLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLIW 923
            DLD+KHVKDF FVHGYIEPVMVILHE+ELTWAGR+SWKHHTCM+SALSISTTLKQ PLIW
Sbjct: 239  DLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIW 298

Query: 924  SAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRSS 1103
            SA NLPHDAYKL++VPSPIGGVLVI ANSIHY+SQS SC LA+N+YAV+ D+SQ++PRS+
Sbjct: 299  SASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSN 358

Query: 1104 FNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTIG 1283
            FNVELDAANATWL +DV +LSTKTGELLLL +VYDGRVVQRL+LSKSKASVLTSG+ +IG
Sbjct: 359  FNVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIG 418

Query: 1284 NSFIFLGSRLGDSMLVQFTCGMGTS----------------------------------- 1358
            NS  FLGSRLGDS+LVQF+CG+G+S                                   
Sbjct: 419  NSLFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEITYYTQNLQKEMVPPTLPSALVHESK 478

Query: 1359 ----------------VPKEEVGDIEGDVPSAKRL---RMASSD-ALQDI-VSGEELSLY 1475
                            V  + V  +E D+ +   L   R+A  D  L D  V G+ELSLY
Sbjct: 479  PTQAKGTIELNNNNLCVENDIVDVVEVDITNMTILGENRIARRDETLTDTQVGGDELSLY 538

Query: 1476 SSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGH 1655
             SA N +E AQK FSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGH
Sbjct: 539  GSAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGH 598

Query: 1656 GKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLDDEYHAFLII 1835
            GKNGALC+L+QSIRPE+ITEVELPGCKGIWTVYHK+ RG  ADSS+MV  DDEYHA+LII
Sbjct: 599  GKNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLII 658

Query: 1836 SLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGARVLDGSYMTQ 2015
            SLE+RTMVL T +LL EVTESVDY+V G TIAAGNLFGRRRVIQV+  GAR+LDGS+MTQ
Sbjct: 659  SLEARTMVLVTGELLTEVTESVDYFVHGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQ 718

Query: 2016 DLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTVSISIPPAFE 2195
            DL+     +E                 PYVLL MTDGSI+LLVGD S+C+VS+S P AF 
Sbjct: 719  DLNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVSAPAAFG 778

Query: 2196 SSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDIYCVVCYESG 2375
            SSK  +++CTLY DKG EPWLR TSTDAWLSTG+GE IDG DG+L DQGDIYCV CY++G
Sbjct: 779  SSKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDNG 838

Query: 2376 TLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKN-EVVAGQVKKETAQNMK 2552
             LEIFDVP+F  VF V+KF+SG +HLVD  I +  K S+  +N + +    + E++QNMK
Sbjct: 839  DLEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNSQELISHGRNESSQNMK 898

Query: 2553 IVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESVSGQNTVDLSSIS 2732
            ++E+AMQRW+G+H+RP+LFG+LTDGT+LCYHAYL+E  +S +KI++SVS  N+V  S++S
Sbjct: 899  VIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNSVSSSNMS 958

Query: 2733 TSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGSRPAWFMVCRERL 2912
            +SRLRNLRF+RVPL+   RE+    +   R+++FKN+ GYQGLFL GSRPAWFMV RERL
Sbjct: 959  SSRLRNLRFLRVPLDIQGREDMPNGTLSCRLSIFKNISGYQGLFLCGSRPAWFMVFRERL 1018

Query: 2913 RVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDNYWPVQKIPLKGT 3092
            RVHPQLCDG IVAFAVLHNVNCNHGLIYVTS+G LKICQLPS  +YDNYWPVQK+PLKGT
Sbjct: 1019 RVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKVPLKGT 1078

Query: 3093 PHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSPDELHKTYTVDEY 3272
            PHQVTYF EKNLYP+I+S PV +PLNQVLSS+VDQD GH VE+ N S DEL +TY+V+E+
Sbjct: 1079 PHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQQTYSVEEF 1137

Query: 3273 EVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLLAIGTAYVQGEDV 3452
            E+RILEPEKSG PWQTRATI M +SENALT+R+VTL NTT+KENETLLA+GTAYVQGEDV
Sbjct: 1138 EIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAYVQGEDV 1197

Query: 3453 AARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIASGPKVILHKWTGA 3632
            AARGRVLLFSVG++ DNSQ LVSEVYSKELKGAISALASLQGHLLIASGPK+ILHKWTGA
Sbjct: 1198 AARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGA 1257

Query: 3633 ELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSLLAKDFGNLDCFS 3812
            EL G+AFYD PPLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQLSLLAKDFG+LDC++
Sbjct: 1258 ELNGIAFYDVPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCYA 1317

Query: 3813 TEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML 3992
            TEFLIDGSTLSL VSD+QKN+QIFYYAPK +ESWKGQKLLSRAEFHVGAHVTKFLRLQML
Sbjct: 1318 TEFLIDGSTLSLTVSDDQKNIQIFYYAPKSTESWKGQKLLSRAEFHVGAHVTKFLRLQML 1377

Query: 3993 PTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQKKLVDSVPHMAG 4172
             TSSD+  +T  SDKTNRFALLFGTLDGS+GCIAPLDELTFRRLQ+LQKKL D+VPH+ G
Sbjct: 1378 STSSDKACSTV-SDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLGDAVPHVGG 1436

Query: 4173 LNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIGTTRTQILTNLND 4352
            LNPR+FR F SNGK HR GPD+++DCELL HYEMLPLE+QL++AHQIGTTR+QIL+NLND
Sbjct: 1437 LNPRSFRQFHSNGKVHRRGPDSIVDCELLCHYEMLPLEEQLDIAHQIGTTRSQILSNLND 1496

Query: 4353 LSLGTSFL 4376
            LSLGTSFL
Sbjct: 1497 LSLGTSFL 1504


>ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Solanum lycopersicum]
          Length = 1447

 Score = 2142 bits (5550), Expect = 0.0
 Identities = 1054/1460 (72%), Positives = 1245/1460 (85%), Gaps = 9/1460 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MS+AA K MH PTGIENCASGFI+H ++D  PQI +    D+DSDWPA K IGP+PNLVV
Sbjct: 1    MSFAACKTMHCPTGIENCASGFITHSAADITPQIQTA---DVDSDWPATKPIGPVPNLVV 57

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEA-KRGGFMAGLSGASLELVCHYKLHGNVETMAV 380
            +AGN+L+VY+IRV++      SR  +E  KRGG +AG+S ASLELVC Y+LHGN+ +M V
Sbjct: 58   SAGNVLDVYLIRVEQAS----SRDAAEVVKRGGLVAGISAASLELVCTYRLHGNIYSMGV 113

Query: 381  LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560
            ++  G+DG KRRDSIIL+F+DAK+SVLEFDD+ +GLRTSSMH FEGP+W HLKRGRESF 
Sbjct: 114  ITAGGADGGKRRDSIILSFEDAKMSVLEFDDATHGLRTSSMHFFEGPDWFHLKRGRESFD 173

Query: 561  RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740
            +GP++KVDPQGRC GV  +  QMI+LK                        IESSY+I+L
Sbjct: 174  KGPIIKVDPQGRCAGVFAFEQQMIVLKAAEVNSSLAGEDSAFSAGGASA-RIESSYIITL 232

Query: 741  RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920
            RDLD++HVKDF+F+HGYIEPVMVILHERELTW+GR+SWKHHTCM+SA SISTTLKQ PLI
Sbjct: 233  RDLDVRHVKDFTFLHGYIEPVMVILHERELTWSGRVSWKHHTCMVSAFSISTTLKQHPLI 292

Query: 921  WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100
            WSA NLPHDAYKL++VPSPIGGVLVIGAN+IHYHSQS SC+LA+N++    DNSQE+PRS
Sbjct: 293  WSATNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSSSCSLALNNFVFFGDNSQEMPRS 352

Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280
            S NVELDAANATWL+ DV MLSTKTGELLLLT++YDGR+VQ+L+LSKS+ASVLTSG+TTI
Sbjct: 353  SINVELDAANATWLTSDVAMLSTKTGELLLLTIIYDGRIVQKLDLSKSRASVLTSGITTI 412

Query: 1281 GNSFIFLGSRLGDSMLVQFTCGM-GTSVP---KEEVGDIEGDVPSAKRLRMASSDALQDI 1448
            G+S  FLGSRLGDS+LVQF+ G+ G+++P   +EEVGDIE D PSAKRLRM+SSDALQD+
Sbjct: 413  GDSLFFLGSRLGDSLLVQFSSGLGGSNLPPGVQEEVGDIESDAPSAKRLRMSSSDALQDM 472

Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628
            ++GEELSLY +APN ++ AQKTFSFAVRDSLIN+GPLKDFSYG+RINAD NATGIAKQSN
Sbjct: 473  INGEELSLYGTAPNNAQSAQKTFSFAVRDSLINVGPLKDFSYGMRINADLNATGIAKQSN 532

Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808
            YELVCCSGHGKNG+L VLQQSIRPE IT+V LPGCKGIWTVYHK+ R H ++SS+M   +
Sbjct: 533  YELVCCSGHGKNGSLSVLQQSIRPETITQVSLPGCKGIWTVYHKNTRIHLSESSRMADEE 592

Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988
            DEYHA+LIISLE+RTMVL+TA+ L EVTE+VDYYVQG+T+AAGNLFGRRRVIQVFA GAR
Sbjct: 593  DEYHAYLIISLEARTMVLQTANNLEEVTENVDYYVQGTTLAAGNLFGRRRVIQVFAHGAR 652

Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168
            +LDG++MTQ+LSF A N E                 PYVLLRMT+GS+QLLVGDPS+C+V
Sbjct: 653  ILDGAFMTQELSFKASNVESGSSSDTSIVASVSIADPYVLLRMTNGSLQLLVGDPSSCSV 712

Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348
            S+++P  FESSK SI+ACTLYHDKGPEPWLRKTSTDAWLS+GMGEAIDGADG + DQGD+
Sbjct: 713  SLTVPSVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSSGMGEAIDGADGVIQDQGDV 772

Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAK----DSQNSKNEVVA 2516
            YCVVCYE+GTLEIFDVPSF CVFSV+KFISG T+LVDTF++++       S+N+++ +  
Sbjct: 773  YCVVCYENGTLEIFDVPSFTCVFSVDKFISGRTYLVDTFMQDSVNGLHAHSKNTEDVIRP 832

Query: 2517 GQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESV 2696
            GQ +      + +VEL M RW G+H+RP+LFG+L DGT+L YHAY++EG E+++K++ SV
Sbjct: 833  GQKENSKDVKINVVELMMHRWIGKHSRPFLFGILADGTILSYHAYVFEGSENSSKVDGSV 892

Query: 2697 SGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGS 2876
            S QN++ LSS + SRLRNLRFVRVP++ Y REE  + S  QRM V+KN+GG QG+FL+GS
Sbjct: 893  SSQNSISLSSTNASRLRNLRFVRVPVDNYAREEMPSGSQLQRMNVYKNIGGSQGIFLTGS 952

Query: 2877 RPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDN 3056
            RP+WFMV RERLR+HPQLCDG IVAF VLHNVNCNHGLIYVT+ G LKICQLPS LSYDN
Sbjct: 953  RPSWFMVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTALGTLKICQLPSFLSYDN 1012

Query: 3057 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSP 3236
            YWPVQKIPLKGTPHQV YFAEKN+Y +IVSVPV++PLNQVLSS+ DQ+ G Q + DN + 
Sbjct: 1013 YWPVQKIPLKGTPHQVAYFAEKNVYSVIVSVPVLKPLNQVLSSIADQEVGQQFDPDNLN- 1071

Query: 3237 DELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLL 3416
                 +Y ++E+EVRILEPEKSG PW+TRA+IPMQ+SENALTVRMVTLFNT +KENETLL
Sbjct: 1072 --YEGSYPIEEFEVRILEPEKSGGPWKTRASIPMQSSENALTVRMVTLFNTKTKENETLL 1129

Query: 3417 AIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIAS 3596
            A+GTAYVQGEDVAARGRVLLFS+ R  DNS+ LVSEVYSKELKGAI ALASLQGHLLIAS
Sbjct: 1130 AVGTAYVQGEDVAARGRVLLFSIDRTADNSRTLVSEVYSKELKGAIPALASLQGHLLIAS 1189

Query: 3597 GPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSL 3776
            GPK+ILHKWTG+EL GVAF D PPL+ VSLNIVKNFIL+GDIHKSI F+SWKE   QLSL
Sbjct: 1190 GPKIILHKWTGSELNGVAFCDYPPLHAVSLNIVKNFILLGDIHKSISFVSWKE--PQLSL 1247

Query: 3777 LAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVG 3956
            LAKDF  LDC +TEFLIDGSTLSL+VSD+QKNVQIFYYAPK+SESWKGQKLLSRAEFHVG
Sbjct: 1248 LAKDFSPLDCLATEFLIDGSTLSLVVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHVG 1307

Query: 3957 AHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQ 4136
            + +TKFLRLQ+LPT+S+RT  TPGSDKTNRFA +FGTL+GS+GCIAPLDELTFRRLQ+LQ
Sbjct: 1308 SRITKFLRLQLLPTTSERTATTPGSDKTNRFATVFGTLEGSLGCIAPLDELTFRRLQSLQ 1367

Query: 4137 KKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIG 4316
            KKLV +V H+AGLNPR+FR FRSNGKAHRPGPDN++DCELL HYEMLPLE+QLE+A QIG
Sbjct: 1368 KKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLSHYEMLPLEEQLEIAQQIG 1427

Query: 4317 TTRTQILTNLNDLSLGTSFL 4376
            TTR QI++NLND+ LGTSFL
Sbjct: 1428 TTRMQIMSNLNDMILGTSFL 1447


>ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Solanum tuberosum]
          Length = 1447

 Score = 2140 bits (5544), Expect = 0.0
 Identities = 1051/1460 (71%), Positives = 1246/1460 (85%), Gaps = 9/1460 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDWPARKGIGPIPNLVV 203
            MS+AA K MH PTGIENCASGFI+H +++  PQI +    D+DSDWPA K +GP+PNLVV
Sbjct: 1    MSFAACKTMHCPTGIENCASGFITHSAAEITPQIRTA---DVDSDWPATKPVGPMPNLVV 57

Query: 204  TAGNILEVYVIRVQEEEDGRVSRSPSEA-KRGGFMAGLSGASLELVCHYKLHGNVETMAV 380
            +AGN+LEVY+IR+++      SR  +E  KRGG MAG+S ASLELVC Y+LHGN+ +M V
Sbjct: 58   SAGNVLEVYLIRIEQAS----SRDAAEVVKRGGLMAGISAASLELVCTYRLHGNIYSMGV 113

Query: 381  LSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGRESFA 560
            ++  G+DG KRRDSIIL+F+DAK+SVLEFDD+ +GLRTSSMH FEGP+W HLKRGRESF 
Sbjct: 114  ITAGGADGGKRRDSIILSFEDAKMSVLEFDDATHGLRTSSMHFFEGPDWLHLKRGRESFD 173

Query: 561  RGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSYVISL 740
            +GP++KVDPQGRC GV  +  QMI+LK                        IESSY+I+L
Sbjct: 174  KGPIIKVDPQGRCAGVFAFEQQMIVLKAAEVNSSLAGEDSAFSAGGASA-RIESSYIITL 232

Query: 741  RDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQQPLI 920
            RDLD++HVKDF+F+HGYIEPVMVILHERELTW+GR+SWKHHTCM+SA SISTTLKQ PLI
Sbjct: 233  RDLDVRHVKDFTFLHGYIEPVMVILHERELTWSGRVSWKHHTCMVSAFSISTTLKQHPLI 292

Query: 921  WSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQELPRS 1100
            WSA NLPHDAYKL++VPSPIGGVLVIGAN+IHYHSQS SC+LA+N++A   DNSQE+PRS
Sbjct: 293  WSAANLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSSSCSLALNNFAFFGDNSQEMPRS 352

Query: 1101 SFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSGVTTI 1280
            SFNVELDAANATWL+ DV MLSTKTGELLLLT++YDGR+VQ+L+LSKS+ASVLTSG+TTI
Sbjct: 353  SFNVELDAANATWLTSDVAMLSTKTGELLLLTIIYDGRIVQKLDLSKSRASVLTSGITTI 412

Query: 1281 GNSFIFLGSRLGDSMLVQFTCGM-GTSVP---KEEVGDIEGDVPSAKRLRMASSDALQDI 1448
            G+S  FLGSRLGDS+LVQF+CG+ G+++P   +EEVGDIE D PSAKRLRM+SSDALQD+
Sbjct: 413  GDSLFFLGSRLGDSLLVQFSCGLGGSNLPPGVQEEVGDIESDAPSAKRLRMSSSDALQDM 472

Query: 1449 VSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSN 1628
            ++GEELSLY +APN ++ AQKTFSFAVRDSLIN+GPLKDFSYG+RINAD NATGIAKQSN
Sbjct: 473  INGEELSLYGTAPNNAQSAQKTFSFAVRDSLINVGPLKDFSYGMRINADLNATGIAKQSN 532

Query: 1629 YELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSKMVSLD 1808
            YELVCCSGHGKNG+LCVLQQSIRPE IT+  LPGCKGIWTVYHK+ R H ++SS+M   +
Sbjct: 533  YELVCCSGHGKNGSLCVLQQSIRPETITQEALPGCKGIWTVYHKNTRIHLSESSRMADEE 592

Query: 1809 DEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVFARGAR 1988
            DEYHA+LIISLE+RTMVL+TA+ L EVTE+VDYYVQG+T+AAGNLFGRRRVIQVFA GAR
Sbjct: 593  DEYHAYLIISLEARTMVLQTANNLEEVTENVDYYVQGTTLAAGNLFGRRRVIQVFAHGAR 652

Query: 1989 VLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDPSTCTV 2168
            +LDG++MTQ+LSF A N E                 PYVLLRMT+GS+QLLVGDPS+C+V
Sbjct: 653  ILDGAFMTQELSFKASNVESGSSSDTSIVASVSIADPYVLLRMTNGSLQLLVGDPSSCSV 712

Query: 2169 SISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALHDQGDI 2348
            S+++P  FESSK SI+ACTLYHDKGPEPWLRKTSTDAWLS+GMGEAIDGADG   DQGD+
Sbjct: 713  SLTVPSVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSSGMGEAIDGADGVTQDQGDV 772

Query: 2349 YCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAK----DSQNSKNEVVA 2516
            YCVVCYE+GTLEIFDVP+F CVFSV+KFISG T+LVDTF++++       S+N+++ +  
Sbjct: 773  YCVVCYENGTLEIFDVPNFTCVFSVDKFISGRTYLVDTFMQDSVNGLHAHSKNTEDVIRP 832

Query: 2517 GQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIEESV 2696
            GQ +      + +VEL M RW G+H+RP+LFG+L DGT+L YHAY++EG E+++K+E SV
Sbjct: 833  GQKENSKDVKINVVELMMHRWIGKHSRPFLFGILADGTILSYHAYVFEGSENSSKVEGSV 892

Query: 2697 SGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFLSGS 2876
            S QN++ LSS + SRLRNLRFVRVP++ Y REE  + +  QRM V+KN+GG QG+FL+GS
Sbjct: 893  SSQNSISLSSTNASRLRNLRFVRVPVDNYAREEMPSGTQLQRMNVYKNIGGSQGIFLTGS 952

Query: 2877 RPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLSYDN 3056
            RP+WFMV RERLR+HPQLCDG IVAF VLHNVNCNHGLIYVT+ G LKICQLPS LSYDN
Sbjct: 953  RPSWFMVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTALGTLKICQLPSFLSYDN 1012

Query: 3057 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDNFSP 3236
            YWPVQKIPLKGTPHQV YFAEKN+Y +IVSVPV++PLNQVLS++ DQ+ G Q + DN + 
Sbjct: 1013 YWPVQKIPLKGTPHQVAYFAEKNVYSVIVSVPVLKPLNQVLSTIADQEVGQQFDPDNLN- 1071

Query: 3237 DELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENETLL 3416
                 +Y ++E+EVRI+EPEKSG  W+TRA+IPMQ+SENALTVRMVTL NTT++ENETLL
Sbjct: 1072 --YEGSYPIEEFEVRIVEPEKSGGLWKTRASIPMQSSENALTVRMVTLLNTTTRENETLL 1129

Query: 3417 AIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLLIAS 3596
            A+GTAYVQGEDVAARGRVLLFS+ R  DNS+ LVSEVYSKELKGAI ALASLQGHLLIAS
Sbjct: 1130 AVGTAYVQGEDVAARGRVLLFSIDRTADNSRTLVSEVYSKELKGAIPALASLQGHLLIAS 1189

Query: 3597 GPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQLSL 3776
            GPK+ILHKWTG+EL GVAF D PPL+ VSLNIVKNFIL+GDIHKSI F+SWKE   QLSL
Sbjct: 1190 GPKIILHKWTGSELNGVAFCDYPPLHAVSLNIVKNFILLGDIHKSISFVSWKE--PQLSL 1247

Query: 3777 LAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEFHVG 3956
            LAKDF  LDC +TEFLIDGSTLSL+VSD+QKNVQIFYYAPK+SESWKGQKLLSRAEFHVG
Sbjct: 1248 LAKDFSPLDCLATEFLIDGSTLSLVVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHVG 1307

Query: 3957 AHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQTLQ 4136
            + +TKFLRLQ+LPT+S+RT  TPGSDKTNRFA +FGTL+GS+GCIAPLDELTFRRLQ+LQ
Sbjct: 1308 SRITKFLRLQLLPTTSERTATTPGSDKTNRFATVFGTLEGSLGCIAPLDELTFRRLQSLQ 1367

Query: 4137 KKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAHQIG 4316
            KKLV +V H+AGLNPR+FR FRSNGKAHRPGPDN++DCELL HYEMLPLE+QLE+A QIG
Sbjct: 1368 KKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLSHYEMLPLEEQLEIAQQIG 1427

Query: 4317 TTRTQILTNLNDLSLGTSFL 4376
            TTR QI++NLND+ LGTSFL
Sbjct: 1428 TTRMQIMSNLNDMILGTSFL 1447


>ref|XP_006851814.1| hypothetical protein AMTR_s00041p00037520 [Amborella trichopoda]
            gi|548855397|gb|ERN13281.1| hypothetical protein
            AMTR_s00041p00037520 [Amborella trichopoda]
          Length = 1459

 Score = 2130 bits (5520), Expect = 0.0
 Identities = 1039/1463 (71%), Positives = 1232/1463 (84%), Gaps = 12/1463 (0%)
 Frame = +3

Query: 24   MSYAAFKMMHWPTGIENCASGFISHCSSDFAPQIPSITNEDLDSDW---PARKGIGPIPN 194
            MS++AFK+MHW T +ENCASG+I+H  SDFA  +P+I N+D DS+W   P +  IGPIPN
Sbjct: 1    MSFSAFKLMHWATAVENCASGYITHSFSDFAASVPAIQNDDPDSEWDQPPVKARIGPIPN 60

Query: 195  LVVTAGNILEVYVIRVQEEED--GRVSRSPSEAKRGGFMAGLSGASLELVCHYKLHGNVE 368
            LV++A N+LEVYVIRVQEE+    R S + S +K GG +AG+SGA LELVCHY+LHGNVE
Sbjct: 61   LVISAANVLEVYVIRVQEEDSKASRHSGASSGSKGGGVVAGVSGAWLELVCHYRLHGNVE 120

Query: 369  TMAVLSMEGSDGCKRRDSIILAFQDAKISVLEFDDSINGLRTSSMHCFEGPEWHHLKRGR 548
            TMA+LS    DG ++RDSIIL F++AKISVLE+DDS++GLRTSSMHCFEGPEW +LKRGR
Sbjct: 121  TMAILSAGSDDGRRKRDSIILTFKEAKISVLEYDDSLHGLRTSSMHCFEGPEWQYLKRGR 180

Query: 549  ESFARGPLVKVDPQGRCGGVLVYGLQMIILKXXXXXXXXXXXXXXXXXXXXXXXHIESSY 728
            E+FARGPLV VDPQGRCGGVL+   QM+ILK                        IESSY
Sbjct: 181  ENFARGPLVYVDPQGRCGGVLICDSQMLILKASKAGYGFVGDDDANGPTGTVSIRIESSY 240

Query: 729  VISLRDLDMKHVKDFSFVHGYIEPVMVILHERELTWAGRISWKHHTCMISALSISTTLKQ 908
            VISLR+LDMKHVKDF FVHGYIEPVMVILHE+ELTWAGR+SWKHHTCMISA SISTTLKQ
Sbjct: 241  VISLRELDMKHVKDFVFVHGYIEPVMVILHEKELTWAGRLSWKHHTCMISAFSISTTLKQ 300

Query: 909  QPLIWSAVNLPHDAYKLISVPSPIGGVLVIGANSIHYHSQSVSCALAVNSYAVAVDNSQE 1088
             PLIWSA NLPHDAYKL+SVPSPIGGVLVI ANSIHYHSQS+SCALA+N +AVA +NSQE
Sbjct: 301  HPLIWSASNLPHDAYKLLSVPSPIGGVLVICANSIHYHSQSMSCALALNDFAVAGENSQE 360

Query: 1089 LPRSSFNVELDAANATWLSHDVVMLSTKTGELLLLTVVYDGRVVQRLELSKSKASVLTSG 1268
            +PRS+ NVELDAA+ATWLS+DV + S KTGELLLLT+VYDGR VQRLELSKS+ASVLTSG
Sbjct: 361  MPRSNVNVELDAAHATWLSNDVALFSIKTGELLLLTLVYDGRTVQRLELSKSRASVLTSG 420

Query: 1269 VTTIGNSFIFLGSRLGDSMLVQFTCGMGTSV-----PKEEVGDIEGDVPSAKRLRMASSD 1433
            +TTI N F FLGSRLGDS+LVQF  G   S+      K+E  +IEGD P+AKRLR ASSD
Sbjct: 421  ITTISNHFFFLGSRLGDSLLVQFNSGASQSILTSRQGKDEGVEIEGDAPAAKRLRRASSD 480

Query: 1434 ALQDIVSGEELSLYSSAPNTSELAQKTFSFAVRDSLINIGPLKDFSYGLRINADPNATGI 1613
              QD+   EELSLY SAPN  + AQK+FSFAVRDSL+N+GPLKDFSYGLRINADPNA+G+
Sbjct: 481  VSQDV--NEELSLYVSAPNNLDPAQKSFSFAVRDSLLNVGPLKDFSYGLRINADPNASGV 538

Query: 1614 AKQSNYELVCCSGHGKNGALCVLQQSIRPELITEVELPGCKGIWTVYHKSARGHNADSSK 1793
            AKQSNYELV CSGHGKNGALC+LQQSIRPEL+TEVEL G KGIWTVYHK+ R H+ DSSK
Sbjct: 539  AKQSNYELVSCSGHGKNGALCLLQQSIRPELVTEVELTGVKGIWTVYHKNPRSHSTDSSK 598

Query: 1794 MVSLDDEYHAFLIISLESRTMVLETADLLGEVTESVDYYVQGSTIAAGNLFGRRRVIQVF 1973
            + +  DE+HA+LIISLESRTMVLETADLLGEVTESVDYYVQGSTI AGNLFGRRRV+Q++
Sbjct: 599  VAAEGDEFHAYLIISLESRTMVLETADLLGEVTESVDYYVQGSTIVAGNLFGRRRVVQIY 658

Query: 1974 ARGARVLDGSYMTQDLSFGAPNSEXXXXXXXXXXXXXXXXXPYVLLRMTDGSIQLLVGDP 2153
             RGAR+LDG+YMTQDL FG  +++                 PYVLLRM DGSIQLL GDP
Sbjct: 659  VRGARILDGAYMTQDLPFGHSSTDSTSNSEASTVSSASIADPYVLLRMVDGSIQLLTGDP 718

Query: 2154 STCTVSISIPPAFESSKDSIAACTLYHDKGPEPWLRKTSTDAWLSTGMGEAIDGADGALH 2333
            +TCT++ S+P  FE+  D I ACT YHDKGPEPWLRK S DAWLS+G+ EA+DG+DG+ H
Sbjct: 719  ATCTITSSLPAVFENLSDPITACTFYHDKGPEPWLRKASPDAWLSSGVAEALDGSDGSQH 778

Query: 2334 DQGDIYCVVCYESGTLEIFDVPSFKCVFSVEKFISGTTHLVDTFIRETAKDSQNSKNEVV 2513
            DQGDIYC+VCYESG LEIFDVPSFKC+FSV++F++  THL D   +E+ KD   +K+   
Sbjct: 779  DQGDIYCLVCYESGRLEIFDVPSFKCLFSVDRFLTVRTHLFDAHHQESFKDLHKAKSSYS 838

Query: 2514 --AGQVKKETAQNMKIVELAMQRWAGEHTRPYLFGMLTDGTMLCYHAYLYEGQESTTKIE 2687
              + Q++KE  +NMK+VEL M RW+G++ RP+LF  L DG+ LCYHAYLYEGQ++ TK++
Sbjct: 839  EDSDQLEKEAVKNMKVVELCMHRWSGQYGRPFLFAGLADGSTLCYHAYLYEGQDNMTKLD 898

Query: 2688 ESVSGQNTVDLSSISTSRLRNLRFVRVPLETYTREETSAVSPCQRMTVFKNVGGYQGLFL 2867
            +S S +N V+ S+ STSRLRNLRF+RV L++ TREE+  V   +++  F N+GG+QG F+
Sbjct: 899  DSNSAENVVEASNTSTSRLRNLRFIRVSLQSLTREESPGVVSYRKIITFMNIGGHQGAFI 958

Query: 2868 SGSRPAWFMVCRERLRVHPQLCDGSIVAFAVLHNVNCNHGLIYVTSEGFLKICQLPSVLS 3047
            +GSRPAW +VCRERLR+H QLCDG IVAF VLHNVNCN+G IYVTS+GFLKIC+LPS L+
Sbjct: 959  AGSRPAWLIVCRERLRIHHQLCDGPIVAFTVLHNVNCNYGCIYVTSQGFLKICRLPSWLN 1018

Query: 3048 YDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVVRPLNQVLSSLVDQDAGHQVEHDN 3227
            YDNYWPVQKIPL+GTPHQVTY AEKNLY LI+S+PV +P +QVL+SLVD +  HQ EHDN
Sbjct: 1019 YDNYWPVQKIPLRGTPHQVTYIAEKNLYALILSIPVAKPSSQVLTSLVDHEVSHQSEHDN 1078

Query: 3228 FSPDELHKTYTVDEYEVRILEPEKSGAPWQTRATIPMQNSENALTVRMVTLFNTTSKENE 3407
             +P++L+++Y +DE+EVRILEP K+G PW+T+ATIPMQ+SE+A+T+R+V+L NT +KE +
Sbjct: 1079 LNPEDLNRSYFIDEFEVRILEPAKAGGPWETKATIPMQSSEHAITIRLVSLSNTLTKETD 1138

Query: 3408 TLLAIGTAYVQGEDVAARGRVLLFSVGRNTDNSQNLVSEVYSKELKGAISALASLQGHLL 3587
            TLLA+GTAYVQGEDVAARGR++L+SVG+N DN QN V+EV+SKELKGAISALA LQGHLL
Sbjct: 1139 TLLAVGTAYVQGEDVAARGRIILYSVGQNADNPQNWVTEVHSKELKGAISALAPLQGHLL 1198

Query: 3588 IASGPKVILHKWTGAELTGVAFYDTPPLYVVSLNIVKNFILVGDIHKSIYFLSWKEQGAQ 3767
            IASGPK++LHKW G ELT VAF+D  PLYVVSLNIVKNFIL+GDIHKSIYFLSWKEQGAQ
Sbjct: 1199 IASGPKIVLHKWNGTELTAVAFFDA-PLYVVSLNIVKNFILMGDIHKSIYFLSWKEQGAQ 1257

Query: 3768 LSLLAKDFGNLDCFSTEFLIDGSTLSLIVSDEQKNVQIFYYAPKMSESWKGQKLLSRAEF 3947
            LSLLAKDFG+L+C++TEFLIDGSTLSL+VSD+QKN+QIFYYAPK+ ESWKGQKLL +AEF
Sbjct: 1258 LSLLAKDFGSLNCYTTEFLIDGSTLSLVVSDDQKNIQIFYYAPKLMESWKGQKLLPKAEF 1317

Query: 3948 HVGAHVTKFLRLQMLPTSSDRTGATPGSDKTNRFALLFGTLDGSVGCIAPLDELTFRRLQ 4127
            HVG+HVT+ +RLQMLPT SDRT A+ G+DKTNRFALLFGTLDGS+GCIAPL+ELTFRRLQ
Sbjct: 1318 HVGSHVTRLMRLQMLPT-SDRTSASSGTDKTNRFALLFGTLDGSIGCIAPLEELTFRRLQ 1376

Query: 4128 TLQKKLVDSVPHMAGLNPRAFRMFRSNGKAHRPGPDNMIDCELLYHYEMLPLEDQLEMAH 4307
             LQ+KLVD VPH  GLNPRAFR FRSNGKAH+PGPDNM+DCELL HYEMLPLE+QL++AH
Sbjct: 1377 MLQRKLVDRVPHACGLNPRAFRQFRSNGKAHKPGPDNMVDCELLCHYEMLPLEEQLDIAH 1436

Query: 4308 QIGTTRTQILTNLNDLSLGTSFL 4376
            QIGTTR QI+TNL+DL+LGTSFL
Sbjct: 1437 QIGTTRAQIVTNLSDLTLGTSFL 1459


Top