BLASTX nr result

ID: Paeonia24_contig00002825 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia24_contig00002825
         (5122 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation spec...  2432   0.0  
emb|CBI24510.3| unnamed protein product [Vitis vinifera]             2426   0.0  
ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prun...  2411   0.0  
ref|XP_007038473.1| Cleavage and polyadenylation specificity fac...  2410   0.0  
ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation spec...  2350   0.0  
ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation spec...  2346   0.0  
ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citr...  2342   0.0  
ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citr...  2337   0.0  
ref|XP_002510905.1| cleavage and polyadenylation specificity fac...  2313   0.0  
gb|EXC20897.1| Cleavage and polyadenylation specificity factor s...  2302   0.0  
ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation spec...  2302   0.0  
ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation spec...  2268   0.0  
ref|XP_007152397.1| hypothetical protein PHAVU_004G126600g [Phas...  2264   0.0  
ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation spec...  2256   0.0  
ref|XP_002318462.2| cleavage and polyadenylation specificity fac...  2255   0.0  
ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation spec...  2222   0.0  
ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arab...  2181   0.0  
ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation spec...  2170   0.0  
ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation spec...  2170   0.0  
ref|NP_199979.2| cleavage and polyadenylation specificity factor...  2163   0.0  

>ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Vitis vinifera]
          Length = 1442

 Score = 2432 bits (6304), Expect = 0.0
 Identities = 1208/1456 (82%), Positives = 1313/1456 (90%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531
            MSYAAYKMMHWPTGIENCASGF+TH  ADFAPQI  +QTDDLES+WP +R IGP+PNLI+
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351
            TA N+LEVY++RVQE+D R+SR S E KRGGV+AGISGAALELVC YRLHGNVETM VL 
Sbjct: 61   TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120

Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171
             GGGD  RRRDSIILAFQDAKISVLEFDDS+HGLRTSSMH FEGPEW HLKRG ESFARG
Sbjct: 121  SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180

Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991
            PLVK DPQGRC+GVLVYGLQMIILK +QAG GLVGD++AL+SG AVSARVESSY+ISLRD
Sbjct: 181  PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240

Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811
            L MKHVKDF FVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631
            A+NLPHDAYKLL VPSPIGGV+VISAN+IHYHSQSASCALA+NN+AV+AD+SQ+MPRSS 
Sbjct: 301  AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360

Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451
            SVELD ANA WLSNDVAMLSTKTGELLLLTL YDGRVVHRLDL+KSRASVLTSGI  IG+
Sbjct: 361  SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420

Query: 3450 SLFFLGSRLGDSLLVQYTCGVGASSGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVNGEE 3271
            SLFFLGSRLGDSLLVQ+T  +  SS VKEEVGDIEGD PSAKRLR+SSSDALQD+VNGEE
Sbjct: 421  SLFFLGSRLGDSLLVQFTSIL--SSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEE 478

Query: 3270 LSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYELVC 3091
            LSLYGSAPN+ E++QK FSF+VRDS INVGPLKDF+YGLRINADP A GIAKQSNYELVC
Sbjct: 479  LSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYELVC 538

Query: 3090 CSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDEYHA 2911
            CSGHGKNGALC+LQQSI PE+ITEVEL GC+GIWTVYHKNTRGHNADS+KM+  DDEYHA
Sbjct: 539  CSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDEYHA 598

Query: 2910 YLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRILDGS 2731
            YLIISLESRTMVLET D+LGEVTESVDYYVQG TI+AGNLFGRRRVVQV+ARG+RILDG+
Sbjct: 599  YLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARILDGA 658

Query: 2730 FMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSINIP 2551
            FMTQDL I          SESSTV   SIADPYVLLRM+DG+IQLLVGDPSTCTVSINIP
Sbjct: 659  FMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSINIP 708

Query: 2550 SVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYCVVC 2371
            +VFESSKK IS CTLYHDKGPEPWLRK STDAWLSTG+GEAIDGADGA  DQGDIYCVV 
Sbjct: 709  AVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYCVVS 768

Query: 2370 YESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQARKE 2191
            YESG LEIFDVPNF+CVFSV  FMSG  +LVDT + EPS+D +   + NSEE A Q RKE
Sbjct: 769  YESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQGRKE 828

Query: 2190 NTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQNSV 2011
            N  N+KVVE+ MQRWSG HS PFLFGILTDGTILCYHAYL+EGPE+T K EEA+S QNS+
Sbjct: 829  NAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQNSL 888

Query: 2010 NLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPAWFM 1831
            ++++ SASRLRNLRFVRV LDTYTREE  +G TS RMTVFKN+GG QGLFLSGSRP WFM
Sbjct: 889  SISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPLWFM 948

Query: 1830 VVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWPVQK 1651
            V RER+RVHPQ+CDGSIVAFTVLHN+NCNHGLIYVTS+G LKICQLP+V+SYDNYWPVQK
Sbjct: 949  VFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWPVQK 1008

Query: 1650 IPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSNDLHQT 1471
            IPLKGTPHQVTYFAEKNLYPLIVSV VLKPLN V+SSLVDQEA HQ+ENDNLSS++LH++
Sbjct: 1009 IPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDELHRS 1068

Query: 1470 YVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAIGTAY 1291
            Y VDEFEVR+LEPEKSG PWQTRATIPMQSSENALTVRVVTL+N TTKENETLLAIGTAY
Sbjct: 1069 YSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY 1128

Query: 1290 LQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGPKITL 1111
            +QGEDVA RGRVLLFSVG+NTDN+QNLVSE++SKE KGAISA+ASLQGHLL+ASGPKI L
Sbjct: 1129 VQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIIL 1188

Query: 1110 YKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLAKDYA 931
            +KW  TEL  VAFFD PPL+VVSLNIVKNFILLGDIH+SIYFLSWKEQG QL+LLAKD+ 
Sbjct: 1189 HKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFG 1248

Query: 930  SLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAHVTKF 751
            SLDCFATEFLIDGSTLSL+VSDDQKN+QIFYYAPK SESWKGQKLLSRAEFHVGAHVTKF
Sbjct: 1249 SLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1308

Query: 750  QRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQRKLV 571
             RLQML               DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ+KLV
Sbjct: 1309 LRLQML--PASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLV 1366

Query: 570  DAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIGTTRS 391
            DAV HVAGLNPRSFRQF+S+GKAH+PGPDNIVDCELLCHYEMLP EEQLEIA QIGTTR 
Sbjct: 1367 DAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRM 1426

Query: 390  QIISNLNDLSLGTSFL 343
            QI+SNLNDLSLGTSFL
Sbjct: 1427 QILSNLNDLSLGTSFL 1442


>emb|CBI24510.3| unnamed protein product [Vitis vinifera]
          Length = 1448

 Score = 2426 bits (6287), Expect = 0.0
 Identities = 1208/1462 (82%), Positives = 1313/1462 (89%), Gaps = 6/1462 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531
            MSYAAYKMMHWPTGIENCASGF+TH  ADFAPQI  +QTDDLES+WP +R IGP+PNLI+
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351
            TA N+LEVY++RVQE+D R+SR S E KRGGV+AGISGAALELVC YRLHGNVETM VL 
Sbjct: 61   TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120

Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171
             GGGD  RRRDSIILAFQDAKISVLEFDDS+HGLRTSSMH FEGPEW HLKRG ESFARG
Sbjct: 121  SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180

Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991
            PLVK DPQGRC+GVLVYGLQMIILK +QAG GLVGD++AL+SG AVSARVESSY+ISLRD
Sbjct: 181  PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240

Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811
            L MKHVKDF FVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631
            A+NLPHDAYKLL VPSPIGGV+VISAN+IHYHSQSASCALA+NN+AV+AD+SQ+MPRSS 
Sbjct: 301  AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360

Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451
            SVELD ANA WLSNDVAMLSTKTGELLLLTL YDGRVVHRLDL+KSRASVLTSGI  IG+
Sbjct: 361  SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420

Query: 3450 SLFFLGSRLGDSLLVQYTCGVGASSGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVNGEE 3271
            SLFFLGSRLGDSLLVQ+T  +  SS VKEEVGDIEGD PSAKRLR+SSSDALQD+VNGEE
Sbjct: 421  SLFFLGSRLGDSLLVQFTSIL--SSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVNGEE 478

Query: 3270 LSLYGSAPNNAESAQ------KNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQS 3109
            LSLYGSAPN+ E++Q      K FSF+VRDS INVGPLKDF+YGLRINADP A GIAKQS
Sbjct: 479  LSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQS 538

Query: 3108 NYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAV 2929
            NYELVCCSGHGKNGALC+LQQSI PE+ITEVEL GC+GIWTVYHKNTRGHNADS+KM+  
Sbjct: 539  NYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATK 598

Query: 2928 DDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGS 2749
            DDEYHAYLIISLESRTMVLET D+LGEVTESVDYYVQG TI+AGNLFGRRRVVQV+ARG+
Sbjct: 599  DDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGA 658

Query: 2748 RILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCT 2569
            RILDG+FMTQDL I          SESSTV   SIADPYVLLRM+DG+IQLLVGDPSTCT
Sbjct: 659  RILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCT 708

Query: 2568 VSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGD 2389
            VSINIP+VFESSKK IS CTLYHDKGPEPWLRK STDAWLSTG+GEAIDGADGA  DQGD
Sbjct: 709  VSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGD 768

Query: 2388 IYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVA 2209
            IYCVV YESG LEIFDVPNF+CVFSV  FMSG  +LVDT + EPS+D +   + NSEE A
Sbjct: 769  IYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEA 828

Query: 2208 GQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAI 2029
             Q RKEN  N+KVVE+ MQRWSG HS PFLFGILTDGTILCYHAYL+EGPE+T K EEA+
Sbjct: 829  DQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAV 888

Query: 2028 SGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGS 1849
            S QNS+++++ SASRLRNLRFVRV LDTYTREE  +G TS RMTVFKN+GG QGLFLSGS
Sbjct: 889  SAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGS 948

Query: 1848 RPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDN 1669
            RP WFMV RER+RVHPQ+CDGSIVAFTVLHN+NCNHGLIYVTS+G LKICQLP+V+SYDN
Sbjct: 949  RPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDN 1008

Query: 1668 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489
            YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSV VLKPLN V+SSLVDQEA HQ+ENDNLSS
Sbjct: 1009 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSS 1068

Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309
            ++LH++Y VDEFEVR+LEPEKSG PWQTRATIPMQSSENALTVRVVTL+N TTKENETLL
Sbjct: 1069 DELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1128

Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVAS 1129
            AIGTAY+QGEDVA RGRVLLFSVG+NTDN+QNLVSE++SKE KGAISA+ASLQGHLL+AS
Sbjct: 1129 AIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIAS 1188

Query: 1128 GPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSL 949
            GPKI L+KW  TEL  VAFFD PPL+VVSLNIVKNFILLGDIH+SIYFLSWKEQG QL+L
Sbjct: 1189 GPKIILHKWTGTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNL 1248

Query: 948  LAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVG 769
            LAKD+ SLDCFATEFLIDGSTLSL+VSDDQKN+QIFYYAPK SESWKGQKLLSRAEFHVG
Sbjct: 1249 LAKDFGSLDCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1308

Query: 768  AHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQS 589
            AHVTKF RLQML               DKTNRFALLF TLDGSIGCIAPLD LTFRRLQS
Sbjct: 1309 AHVTKFLRLQML--PASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQS 1366

Query: 588  LQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQ 409
            LQ+KLVDAV HVAGLNPRSFRQF+S+GKAH+PGPDNIVDCELLCHYEMLP EEQLEIA Q
Sbjct: 1367 LQKKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQ 1426

Query: 408  IGTTRSQIISNLNDLSLGTSFL 343
            IGTTR QI+SNLNDLSLGTSFL
Sbjct: 1427 IGTTRMQILSNLNDLSLGTSFL 1448


>ref|XP_007220310.1| hypothetical protein PRUPE_ppa000211mg [Prunus persica]
            gi|462416772|gb|EMJ21509.1| hypothetical protein
            PRUPE_ppa000211mg [Prunus persica]
          Length = 1459

 Score = 2411 bits (6249), Expect = 0.0
 Identities = 1180/1461 (80%), Positives = 1316/1461 (90%), Gaps = 5/1461 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWP-ARRGIGPVPNLI 4534
            MS+AAYKMMHWPTGIENCASGFI+H  +DF P+IP +QT+DLES+WP +RR IGP+P+L+
Sbjct: 1    MSFAAYKMMHWPTGIENCASGFISHSRSDFVPRIPPIQTEDLESEWPTSRREIGPIPDLV 60

Query: 4533 ITAGNVLEVYIIRVQEED-VRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAV 4357
            +TAGNVLEVY++RVQEED  R  R SGE KRGG++ G+SGA+LELVCHYRLHGNV TMAV
Sbjct: 61   VTAGNVLEVYVVRVQEEDGTRGPRASGEPKRGGLMDGVSGASLELVCHYRLHGNVVTMAV 120

Query: 4356 LSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFA 4177
            LS GGGDG RRRDSIIL F+DAKISVLEFDDS+HGLRTSSMH FEGPEWLHL+RGRESFA
Sbjct: 121  LSSGGGDGSRRRDSIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESFA 180

Query: 4176 RGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISL 3997
            RGPLVK DPQGRC  +LVYGLQMIILK +Q GSGLVGDDD+  SGGA+S+R+ESSYI++L
Sbjct: 181  RGPLVKVDPQGRCGSILVYGLQMIILKASQGGSGLVGDDDSFGSGGAISSRIESSYIVNL 240

Query: 3996 RDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLI 3817
            RD+ MKHVKDF F+HGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLI
Sbjct: 241  RDMDMKHVKDFTFLHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 300

Query: 3816 WSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRS 3637
            WSA+NLPHDAYKLLAVPSPIGGVLVISAN+IHYHSQSASCALA+N++AV+AD+SQ+MPRS
Sbjct: 301  WSAVNLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAVSADNSQEMPRS 360

Query: 3636 SISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTI 3457
            S +VELD ANA WL NDVA+LSTKTGELLLLTLVYDGRVV RLDL+KS+ASVLTSGIT +
Sbjct: 361  SFTVELDTANATWLLNDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITKV 420

Query: 3456 GSSLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDI 3286
            G+SLFFLGSRLGDSLLVQ+TCGVG S   S +K+EVGDIEGDAP AKRLR SSSDALQD+
Sbjct: 421  GNSLFFLGSRLGDSLLVQFTCGVGGSVLSSDMKDEVGDIEGDAPLAKRLRMSSSDALQDM 480

Query: 3285 VNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSN 3106
            V+GEELSLYGSAPNNAESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQSN
Sbjct: 481  VSGEELSLYGSAPNNAESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 540

Query: 3105 YELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVD 2926
            YELVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHKN RGHNADSSK++A D
Sbjct: 541  YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKNARGHNADSSKIAASD 600

Query: 2925 DEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSR 2746
            DE+HAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRVVQV+ RG+R
Sbjct: 601  DEFHAYLIISLEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVVQVYERGAR 660

Query: 2745 ILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTV 2566
            ILDGSFMTQDLS GT N+E G GSESSTV   SI DPYVLLRM+DG I+LLVGDPS CTV
Sbjct: 661  ILDGSFMTQDLSFGTSNSEMGSGSESSTVLSVSIVDPYVLLRMSDGGIRLLVGDPSLCTV 720

Query: 2565 SINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDI 2386
            S +IP+ FESSKK IS CTLYHDKGPEPWLRK STDAWLSTG+ EAIDGADG  HDQGD+
Sbjct: 721  STSIPAAFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIDEAIDGADGVSHDQGDV 780

Query: 2385 YCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAG 2206
            YCVVCYESG+LEIFDVPNF+CVFSV  F+SG  +L+DT +R+P KDP+   N +SEEV+G
Sbjct: 781  YCVVCYESGSLEIFDVPNFNCVFSVDKFVSGNAHLIDTLMRDPPKDPQKLINKSSEEVSG 840

Query: 2205 QARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026
            Q RKEN +NMKVVE+ MQRWSG HS PFLFGIL DG ILCYHAYLFEGPE  SK E++ S
Sbjct: 841  QGRKENIQNMKVVELAMQRWSGQHSRPFLFGILNDGMILCYHAYLFEGPETASKTEDSAS 900

Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846
             QN+  +++ SASRLRNLRFVRV LDTY +++T    + QRMT+FKN+ GYQGLFLSGSR
Sbjct: 901  AQNTTGVSNLSASRLRNLRFVRVPLDTYAKKDTSNETSCQRMTIFKNIAGYQGLFLSGSR 960

Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666
            PAWFMV RERLR+HPQ+CDGS+VA TVLHNVNCNHGLIYVTS+G+LKICQLP +TSYDNY
Sbjct: 961  PAWFMVFRERLRIHPQLCDGSVVAVTVLHNVNCNHGLIYVTSQGILKICQLPPITSYDNY 1020

Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486
            WPVQKIPLKGTPHQVTYFAEKNLYPLIVSV V KPLNQV+SSLVDQE  HQ+EN NLSS+
Sbjct: 1021 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVPVHKPLNQVLSSLVDQEVGHQVENHNLSSD 1080

Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306
            +LH+TY VDEFE+RI+EP+KSGGPWQT+ATIPMQ+SENALTVRVVTL+N TTKENETLLA
Sbjct: 1081 ELHRTYSVDEFEIRIMEPDKSGGPWQTKATIPMQTSENALTVRVVTLFNTTTKENETLLA 1140

Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126
            IGTAY+QGEDVAGRGRVLLFS G++ DNTQ LVSEV+SKE KGAISALASLQGHLL+ASG
Sbjct: 1141 IGTAYVQGEDVAGRGRVLLFSAGKSADNTQTLVSEVYSKELKGAISALASLQGHLLIASG 1200

Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946
            PKI L+KWN TEL  VAFFDVPPL+VVSLNIVKNFILLGD+HKSIYFLSWKEQG QL+LL
Sbjct: 1201 PKIILHKWNGTELNGVAFFDVPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLTLL 1260

Query: 945  AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766
            AKD+ +LDCFATEFLIDGSTLSL+V+D+QKN+QIFYYAPK SESWKGQKLLSRAEFHVG 
Sbjct: 1261 AKDFGNLDCFATEFLIDGSTLSLVVADEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGT 1320

Query: 765  HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586
            HVTKF RLQML               DKTNR+ALLF TLDGSIGCIAPLD LTFRRLQSL
Sbjct: 1321 HVTKFLRLQML--STSSDRTGTNPGSDKTNRYALLFGTLDGSIGCIAPLDELTFRRLQSL 1378

Query: 585  QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406
            Q+KLVDAVHHVAGLNPR+FRQF+S+GKAH+PGPD IVDCELL HYEMLPLEEQLEIA+QI
Sbjct: 1379 QKKLVDAVHHVAGLNPRAFRQFQSNGKAHRPGPDTIVDCELLSHYEMLPLEEQLEIANQI 1438

Query: 405  GTTRSQIISNLNDLSLGTSFL 343
            GTTRSQI SNLNDLS+GTSFL
Sbjct: 1439 GTTRSQIFSNLNDLSIGTSFL 1459


>ref|XP_007038473.1| Cleavage and polyadenylation specificity factor 160 isoform 1
            [Theobroma cacao] gi|508775718|gb|EOY22974.1| Cleavage
            and polyadenylation specificity factor 160 isoform 1
            [Theobroma cacao]
          Length = 1457

 Score = 2410 bits (6247), Expect = 0.0
 Identities = 1184/1459 (81%), Positives = 1309/1459 (89%), Gaps = 3/1459 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531
            MSYAAYKMMHWPTGIENCASGF+THC ADF PQIP  QT+DLES+WPARRGIGPVPNLI+
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHCRADFTPQIPLNQTEDLESEWPARRGIGPVPNLIV 60

Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351
            TA N+LE+Y++RVQEE  R++R S E KRGGV+ G+SG +LELVC+YRLHGNVE+MAVLS
Sbjct: 61   TAANLLEIYVVRVQEEGRREARNSTEVKRGGVLDGVSGVSLELVCNYRLHGNVESMAVLS 120

Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171
            IGGGDG RRRDSIILAF+DAKISVLEFDDS+HGLRT+SMH FEGPEWLHLKRGRESFARG
Sbjct: 121  IGGGDGSRRRDSIILAFKDAKISVLEFDDSIHGLRTTSMHCFEGPEWLHLKRGRESFARG 180

Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991
            PLVK DPQGRC GVLVY LQMIILK +QAGSG VG+DDA  SGGAVSARVESSYII+LRD
Sbjct: 181  PLVKVDPQGRCGGVLVYDLQMIILKASQAGSGFVGEDDAFGSGGAVSARVESSYIINLRD 240

Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811
            L +KH+KDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS
Sbjct: 241  LDVKHIKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631
            A+NLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALA+NN+A++ D+SQD+PRS+ 
Sbjct: 301  AVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNNYAISVDNSQDLPRSNF 360

Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451
            SVELD ANA WL NDVA+LSTKTGELLLLTL+YDGRVV RLDL+KS+ASVLTS ITTIG+
Sbjct: 361  SVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLTSDITTIGN 420

Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280
            SLFFLGSRLGDSLLVQ++ G G S   SG+KEEVGDIEGD P AKRLRRSSSDALQD+V 
Sbjct: 421  SLFFLGSRLGDSLLVQFSGGSGVSALPSGLKEEVGDIEGDVPLAKRLRRSSSDALQDMVG 480

Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100
            GEELSLYGSAPNN ESAQK F FAVRDS+ NVGPLKDFSYGLRINAD NA GIAKQSNYE
Sbjct: 481  GEELSLYGSAPNNTESAQKTFLFAVRDSLTNVGPLKDFSYGLRINADVNATGIAKQSNYE 540

Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920
            LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR H+AD SK++  DDE
Sbjct: 541  LVCCSGHGKNGALCVLRQSIRPEMITEVELTGCKGIWTVYHKSTRSHSADLSKVTDDDDE 600

Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740
            YHAYLIISLE+RTMVLET D+L EVTESVDYYVQG TIAAGNLFGRRRVVQV+ RG+RIL
Sbjct: 601  YHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGARIL 660

Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560
            DGSFMTQ+LSI +PN+ES  GSE+STV   SIADPYVLLRMTDGSI LLVGDP+TCTVSI
Sbjct: 661  DGSFMTQELSIPSPNSESSPGSENSTVISVSIADPYVLLRMTDGSILLLVGDPATCTVSI 720

Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380
            N P+ FE SKK++S CTLYHDKGPEPWLRKASTDAWLSTGVGE+IDGADG PHDQGDIYC
Sbjct: 721  NTPTAFEGSKKMVSACTLYHDKGPEPWLRKASTDAWLSTGVGESIDGADGGPHDQGDIYC 780

Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200
            VVCYESG LEIFDVPNF+CVFS+  F SG+  LVD    E SKD +   N +SEE+ GQ 
Sbjct: 781  VVCYESGALEIFDVPNFNCVFSMEKFASGRTRLVDAYTLESSKDSEKVINKSSEELTGQG 840

Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020
            RKEN +N+KVVE+ MQRWS  HS PFLFGILTDGTILCYHAYLFEG EN SK+E+++  Q
Sbjct: 841  RKENVQNLKVVELAMQRWSANHSRPFLFGILTDGTILCYHAYLFEGSENASKVEDSVVAQ 900

Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840
            NSV L++ +ASRLRNLRF+R+ LD YTREE   G  SQR+T+FKN+ GYQG FLSGSRPA
Sbjct: 901  NSVGLSNINASRLRNLRFIRIPLDAYTREEMSNGTLSQRITIFKNISGYQGFFLSGSRPA 960

Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660
            WFMV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQ+PS ++YDNYWP
Sbjct: 961  WFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQIPSASNYDNYWP 1020

Query: 1659 VQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSNDL 1480
            VQKIPL+GTPHQVTYFAE+NLYP+IVSV V KP+NQV+SSLVDQE  HQ++N NLSS++L
Sbjct: 1021 VQKIPLRGTPHQVTYFAERNLYPIIVSVPVHKPVNQVLSSLVDQEVGHQMDNHNLSSDEL 1080

Query: 1479 HQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAIG 1300
             +TY VDEFEVRILEPEKSGGPW+T+ATIPMQSSENALTVRVVTL+N TTKENE+LLAIG
Sbjct: 1081 QRTYTVDEFEVRILEPEKSGGPWETKATIPMQSSENALTVRVVTLFNTTTKENESLLAIG 1140

Query: 1299 TAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGPK 1120
            TAY+QGEDVA RGRV+L S+GRNTDN QNLVSEV+SKE KGAISALASLQGHLL+ASGPK
Sbjct: 1141 TAYIQGEDVAARGRVILCSIGRNTDNLQNLVSEVYSKELKGAISALASLQGHLLIASGPK 1200

Query: 1119 ITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLAK 940
            I L+ W  +EL  +AF+D PPL+VVSLNIVKNFILLGD+HKSIYFLSWKEQG QLSLLAK
Sbjct: 1201 IILHNWTGSELNGIAFYDAPPLYVVSLNIVKNFILLGDVHKSIYFLSWKEQGAQLSLLAK 1260

Query: 939  DYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAHV 760
            D+ SLDCFATEFLIDGSTLSLMVSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAHV
Sbjct: 1261 DFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1320

Query: 759  TKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQR 580
            TKF RLQML               DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ+
Sbjct: 1321 TKFLRLQML--STSSDRTSATAGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1378

Query: 579  KLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIGT 400
            KLVDAV HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELLCHYEMLPLEEQL+IAHQIGT
Sbjct: 1379 KLVDAVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLDIAHQIGT 1438

Query: 399  TRSQIISNLNDLSLGTSFL 343
            TRSQI+SNLNDL+LGTSFL
Sbjct: 1439 TRSQILSNLNDLTLGTSFL 1457


>ref|XP_006490256.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X2 [Citrus sinensis]
          Length = 1457

 Score = 2350 bits (6091), Expect = 0.0
 Identities = 1152/1459 (78%), Positives = 1291/1459 (88%), Gaps = 3/1459 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531
            MS+AAYKMMHWPTGI NC SGFITH  AD+ PQIP +QT++L+S+ P++RGIGPVPNL++
Sbjct: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60

Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351
            TA NV+E+Y++RVQEE  ++S+ SGE KR  ++ GIS A+LELVCHYRLHGNVE++A+LS
Sbjct: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120

Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171
             GG D  RRRDSIILAF+DAKISVLEFDDS+HGLR +SMH FE PEWLHLKRGRESFARG
Sbjct: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180

Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991
            PLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGD+D   SGG  SAR+ESS++I+LRD
Sbjct: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240

Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811
            L MKHVKDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631
            A+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+AV+ DSSQ++PRSS 
Sbjct: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360

Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451
            SVELD A+A WL NDVA+LSTKTG+L+LLT+VYDGRVV RLDL+K+  SVLTS ITTIG+
Sbjct: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420

Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280
            SLFFLGSRLGDSLLVQ+TCG G S   SG+KEE GDIE DAPS KRLRRSSSDALQD+VN
Sbjct: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480

Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100
            GEELSLYGSA NN ESAQK FSFAVRDS++N+GPLKDFSYGLRINAD +A GI+KQSNYE
Sbjct: 481  GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540

Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920
            LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHNADSS+M+A DDE
Sbjct: 541  LVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 600

Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740
            YHAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RIL
Sbjct: 601  YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 660

Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560
            DGS+MTQDLS G  N+ESG GSE+STV   SIADPYVLL M+DGSI+LLVGDPSTCTVS+
Sbjct: 661  DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 720

Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380
              P+  ESSKK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDGADG P DQGDIY 
Sbjct: 721  QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 780

Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200
            VVCYESG LEIFDVPNF+CVF+V  F+SG+ ++VDT +RE  KD +   N++SEE  GQ 
Sbjct: 781  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 840

Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020
            RKEN  +MKVVE+ MQRWSG HS PFLF ILTDGTILCY AYLFEGPENTSK ++ +S  
Sbjct: 841  RKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 900

Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840
             S+++++ SASRLRNLRF R+ LD YTREETP G   QR+T+FKN+ G+QG FLSGSRP 
Sbjct: 901  RSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 960

Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660
            W MV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQLPS ++YDNYWP
Sbjct: 961  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 1020

Query: 1659 VQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSNDL 1480
            VQKIPLK TPHQ+TYFAEKNLYPLIVSV VLKPLNQV+S L+DQE  HQI+N NLSS DL
Sbjct: 1021 VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1080

Query: 1479 HQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAIG 1300
            H+TY V+E+EVRILEP+++GGPWQTRATIPMQSSENALTVRVVTL+N TTKENETLLAIG
Sbjct: 1081 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIG 1140

Query: 1299 TAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGPK 1120
            TAY+QGEDVA RGRVLLFS GRN DN QNLV+EV+SKE KGAISALASLQGHLL+ASGPK
Sbjct: 1141 TAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1200

Query: 1119 ITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLAK 940
            I L+KW  TEL  +AF+D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQG QL+LLAK
Sbjct: 1201 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1260

Query: 939  DYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAHV 760
            D+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAHV
Sbjct: 1261 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1320

Query: 759  TKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQR 580
            TKF RLQML               DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ+
Sbjct: 1321 TKFLRLQML--ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1378

Query: 579  KLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIGT 400
            KLVD+V HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELL HYEMLPLEEQLEIAHQ GT
Sbjct: 1379 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1438

Query: 399  TRSQIISNLNDLSLGTSFL 343
            TRSQI+SNLNDL+LGTSFL
Sbjct: 1439 TRSQILSNLNDLALGTSFL 1457


>ref|XP_006490255.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X1 [Citrus sinensis]
          Length = 1458

 Score = 2346 bits (6079), Expect = 0.0
 Identities = 1152/1460 (78%), Positives = 1291/1460 (88%), Gaps = 4/1460 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531
            MS+AAYKMMHWPTGI NC SGFITH  AD+ PQIP +QT++L+S+ P++RGIGPVPNL++
Sbjct: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60

Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351
            TA NV+E+Y++RVQEE  ++S+ SGE KR  ++ GIS A+LELVCHYRLHGNVE++A+LS
Sbjct: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120

Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171
             GG D  RRRDSIILAF+DAKISVLEFDDS+HGLR +SMH FE PEWLHLKRGRESFARG
Sbjct: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180

Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991
            PLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGD+D   SGG  SAR+ESS++I+LRD
Sbjct: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240

Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811
            L MKHVKDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631
            A+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+AV+ DSSQ++PRSS 
Sbjct: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360

Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451
            SVELD A+A WL NDVA+LSTKTG+L+LLT+VYDGRVV RLDL+K+  SVLTS ITTIG+
Sbjct: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420

Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280
            SLFFLGSRLGDSLLVQ+TCG G S   SG+KEE GDIE DAPS KRLRRSSSDALQD+VN
Sbjct: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480

Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100
            GEELSLYGSA NN ESAQK FSFAVRDS++N+GPLKDFSYGLRINAD +A GI+KQSNYE
Sbjct: 481  GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540

Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920
            LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHNADSS+M+A DDE
Sbjct: 541  LVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 600

Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740
            YHAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RIL
Sbjct: 601  YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 660

Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560
            DGS+MTQDLS G  N+ESG GSE+STV   SIADPYVLL M+DGSI+LLVGDPSTCTVS+
Sbjct: 661  DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 720

Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380
              P+  ESSKK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDGADG P DQGDIY 
Sbjct: 721  QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 780

Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200
            VVCYESG LEIFDVPNF+CVF+V  F+SG+ ++VDT +RE  KD +   N++SEE  GQ 
Sbjct: 781  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 840

Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020
            RKEN  +MKVVE+ MQRWSG HS PFLF ILTDGTILCY AYLFEGPENTSK ++ +S  
Sbjct: 841  RKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 900

Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840
             S+++++ SASRLRNLRF R+ LD YTREETP G   QR+T+FKN+ G+QG FLSGSRP 
Sbjct: 901  RSLSVSNVSASRLRNLRFARIPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 960

Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660
            W MV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQLPS ++YDNYWP
Sbjct: 961  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 1020

Query: 1659 VQK-IPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSND 1483
            VQK IPLK TPHQ+TYFAEKNLYPLIVSV VLKPLNQV+S L+DQE  HQI+N NLSS D
Sbjct: 1021 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1080

Query: 1482 LHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAI 1303
            LH+TY V+E+EVRILEP+++GGPWQTRATIPMQSSENALTVRVVTL+N TTKENETLLAI
Sbjct: 1081 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1140

Query: 1302 GTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGP 1123
            GTAY+QGEDVA RGRVLLFS GRN DN QNLV+EV+SKE KGAISALASLQGHLL+ASGP
Sbjct: 1141 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1200

Query: 1122 KITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLA 943
            KI L+KW  TEL  +AF+D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQG QL+LLA
Sbjct: 1201 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1260

Query: 942  KDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAH 763
            KD+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAH
Sbjct: 1261 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1320

Query: 762  VTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQ 583
            VTKF RLQML               DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ
Sbjct: 1321 VTKFLRLQML--ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1378

Query: 582  RKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIG 403
            +KLVD+V HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELL HYEMLPLEEQLEIAHQ G
Sbjct: 1379 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1438

Query: 402  TTRSQIISNLNDLSLGTSFL 343
            TTRSQI+SNLNDL+LGTSFL
Sbjct: 1439 TTRSQILSNLNDLALGTSFL 1458


>ref|XP_006421760.1| hypothetical protein CICLE_v10004147mg [Citrus clementina]
            gi|557523633|gb|ESR35000.1| hypothetical protein
            CICLE_v10004147mg [Citrus clementina]
          Length = 1457

 Score = 2342 bits (6069), Expect = 0.0
 Identities = 1149/1459 (78%), Positives = 1287/1459 (88%), Gaps = 3/1459 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531
            MS+AAYKMMHWPTGI NC SGFITH  AD+ PQIP +QT++L+S+ P++RGIGPVPNL++
Sbjct: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60

Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351
            TA NV+E+Y++RVQEE  ++S+ SGE KR  ++ GIS A+LELVCHYRLHGNVE++A+LS
Sbjct: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120

Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171
             GG D  RRRDSIILAF+DAKISVLEFDDS+HGLR +SMH FE PEWLHLKRGRESFARG
Sbjct: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180

Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991
            PLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGD+D   SGG  SAR+ESS++I+LRD
Sbjct: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240

Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811
            L MKHVKDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631
            A+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+AV+ DSSQ++PRSS 
Sbjct: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360

Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451
            SVELD A+A WL NDVA+LSTKTG+L+LLT+VYDGRVV RLDL+K+  SVLTS ITTIG+
Sbjct: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420

Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280
            SLFFLGSRLGDSLLVQ+TCG G S   SG KEE GDIE DAPS KRLRRSSSDALQD+VN
Sbjct: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGPKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480

Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100
            GEELSLYGSA NN ESAQK FSFAVRDS++N+GPLKDFSYGLRINAD +A GI+KQSNYE
Sbjct: 481  GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540

Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920
            LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHN DSS+M+A DDE
Sbjct: 541  LVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNTDSSRMAAYDDE 600

Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740
            YHAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RIL
Sbjct: 601  YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 660

Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560
            DGS+MTQDLS G  N+ESG GSE+STV   SIADPYVLL M+DGSI+LLVGDPSTCTVS+
Sbjct: 661  DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 720

Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380
              P+  ESSKK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDGADG P DQGDIY 
Sbjct: 721  QTPAAIESSKKPVSACTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 780

Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200
            VVCYESG LEIFDVPNF+CVF+V  F+SG+ ++VDT +RE  KD +   N++SEE  GQ 
Sbjct: 781  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 840

Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020
            RKEN  +MKVVE+ MQRWSG HS PFLF ILTDGTILCY AYLFEG ENTSK ++ +S  
Sbjct: 841  RKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVSTS 900

Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840
             S+++++ SASRLRNLRF R  LD YTREETP G   QR+T+FKN+ G+QG FLSGSRP 
Sbjct: 901  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 960

Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660
            W MV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQLPS ++YDNYWP
Sbjct: 961  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 1020

Query: 1659 VQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSNDL 1480
            VQKIPLK TPHQ+TYFAEKNLYPLIVSV VLKPLNQV+S L+DQE  HQI+N NLSS DL
Sbjct: 1021 VQKIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1080

Query: 1479 HQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAIG 1300
            H+TY V+E+EVRILEP+++GGPWQTRATIPMQSSENALTVRVVTL+N TTKEN+TLLAIG
Sbjct: 1081 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLAIG 1140

Query: 1299 TAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGPK 1120
            TAY+QGEDVA RGRVLLFS GRN DN QNLV+EV+SKE KGAISALASLQGHLL+ASGPK
Sbjct: 1141 TAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPK 1200

Query: 1119 ITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLAK 940
            I L+KW  TEL  +AF+D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQG QL+LLAK
Sbjct: 1201 IILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAK 1260

Query: 939  DYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAHV 760
            D+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAHV
Sbjct: 1261 DFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHV 1320

Query: 759  TKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQR 580
            TKF RLQML               DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ+
Sbjct: 1321 TKFLRLQML--ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQK 1378

Query: 579  KLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIGT 400
            KLVD+V HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELL HYEMLPLEEQLEIAHQ GT
Sbjct: 1379 KLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGT 1438

Query: 399  TRSQIISNLNDLSLGTSFL 343
            TRSQI+SNLNDL+LGTSFL
Sbjct: 1439 TRSQILSNLNDLALGTSFL 1457


>ref|XP_006421759.1| hypothetical protein CICLE_v10004147mg [Citrus clementina]
            gi|557523632|gb|ESR34999.1| hypothetical protein
            CICLE_v10004147mg [Citrus clementina]
          Length = 1458

 Score = 2337 bits (6057), Expect = 0.0
 Identities = 1149/1460 (78%), Positives = 1287/1460 (88%), Gaps = 4/1460 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531
            MS+AAYKMMHWPTGI NC SGFITH  AD+ PQIP +QT++L+S+ P++RGIGPVPNL++
Sbjct: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60

Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351
            TA NV+E+Y++RVQEE  ++S+ SGE KR  ++ GIS A+LELVCHYRLHGNVE++A+LS
Sbjct: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120

Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171
             GG D  RRRDSIILAF+DAKISVLEFDDS+HGLR +SMH FE PEWLHLKRGRESFARG
Sbjct: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180

Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991
            PLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGD+D   SGG  SAR+ESS++I+LRD
Sbjct: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240

Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811
            L MKHVKDFIFVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631
            A+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+AV+ DSSQ++PRSS 
Sbjct: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360

Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451
            SVELD A+A WL NDVA+LSTKTG+L+LLT+VYDGRVV RLDL+K+  SVLTS ITTIG+
Sbjct: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420

Query: 3450 SLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280
            SLFFLGSRLGDSLLVQ+TCG G S   SG KEE GDIE DAPS KRLRRSSSDALQD+VN
Sbjct: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGPKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480

Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100
            GEELSLYGSA NN ESAQK FSFAVRDS++N+GPLKDFSYGLRINAD +A GI+KQSNYE
Sbjct: 481  GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540

Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920
            LVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHN DSS+M+A DDE
Sbjct: 541  LVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNTDSSRMAAYDDE 600

Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740
            YHAYLIISLE+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RIL
Sbjct: 601  YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 660

Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560
            DGS+MTQDLS G  N+ESG GSE+STV   SIADPYVLL M+DGSI+LLVGDPSTCTVS+
Sbjct: 661  DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 720

Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380
              P+  ESSKK +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDGADG P DQGDIY 
Sbjct: 721  QTPAAIESSKKPVSACTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 780

Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200
            VVCYESG LEIFDVPNF+CVF+V  F+SG+ ++VDT +RE  KD +   N++SEE  GQ 
Sbjct: 781  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 840

Query: 2199 RKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISGQ 2020
            RKEN  +MKVVE+ MQRWSG HS PFLF ILTDGTILCY AYLFEG ENTSK ++ +S  
Sbjct: 841  RKENIHSMKVVELAMQRWSGHHSRPFLFAILTDGTILCYQAYLFEGSENTSKSDDPVSTS 900

Query: 2019 NSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRPA 1840
             S+++++ SASRLRNLRF R  LD YTREETP G   QR+T+FKN+ G+QG FLSGSRP 
Sbjct: 901  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 960

Query: 1839 WFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYWP 1660
            W MV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTS+G+LKICQLPS ++YDNYWP
Sbjct: 961  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 1020

Query: 1659 VQK-IPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSND 1483
            VQK IPLK TPHQ+TYFAEKNLYPLIVSV VLKPLNQV+S L+DQE  HQI+N NLSS D
Sbjct: 1021 VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1080

Query: 1482 LHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAI 1303
            LH+TY V+E+EVRILEP+++GGPWQTRATIPMQSSENALTVRVVTL+N TTKEN+TLLAI
Sbjct: 1081 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENDTLLAI 1140

Query: 1302 GTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGP 1123
            GTAY+QGEDVA RGRVLLFS GRN DN QNLV+EV+SKE KGAISALASLQGHLL+ASGP
Sbjct: 1141 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGP 1200

Query: 1122 KITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLA 943
            KI L+KW  TEL  +AF+D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQG QL+LLA
Sbjct: 1201 KIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLA 1260

Query: 942  KDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAH 763
            KD+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHVGAH
Sbjct: 1261 KDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1320

Query: 762  VTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQ 583
            VTKF RLQML               DKTNRFALLF TLDGSIGCIAPLD LTFRRLQSLQ
Sbjct: 1321 VTKFLRLQML--ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1378

Query: 582  RKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIG 403
            +KLVD+V HVAGLNPRSFRQF S+GKAH+PGPD+IVDCELL HYEMLPLEEQLEIAHQ G
Sbjct: 1379 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1438

Query: 402  TTRSQIISNLNDLSLGTSFL 343
            TTRSQI+SNLNDL+LGTSFL
Sbjct: 1439 TTRSQILSNLNDLALGTSFL 1458


>ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis] gi|223550020|gb|EEF51507.1| cleavage
            and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
          Length = 1461

 Score = 2313 bits (5995), Expect = 0.0
 Identities = 1146/1463 (78%), Positives = 1286/1463 (87%), Gaps = 7/1463 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWP-ARRGIGPVPNLI 4534
            MSYAAYKM+HWPTGIE+CASG+ITH  ADF PQIP +QTD+L+S+WP ++RGIGP+PNLI
Sbjct: 1    MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60

Query: 4533 ITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVL 4354
            +TAG+VLEVY++RVQE+  R+SR S E KRGG++ G+SGA+LELVCHYRLHGNVE+M VL
Sbjct: 61   VTAGSVLEVYVVRVQEDGSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVVL 120

Query: 4353 SIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFAR 4174
               GGD  RRRDSIILAF+DAKISVLEFDDS+HGLRTSSMH FEGPEWLHLKRGRESFAR
Sbjct: 121  PTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFAR 180

Query: 4173 GPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLR 3994
            GPL+K DPQGRC G+LVY +QMIIL+ AQA SGLVGDDDAL+SGG++SARV+SSY+I+LR
Sbjct: 181  GPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLR 240

Query: 3993 DLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIW 3814
            D+ MKHVKDFIF+H YIEPV+VILHERELTW+GR+SWKHHTC ISALSISTTLKQ  LIW
Sbjct: 241  DMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIW 300

Query: 3813 SAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSS 3634
            S +NLPHDAYKLLAVP PIGGVLVI ANTIHYHS+SA+ ALA+NN+AV+ DSSQ++PR+S
Sbjct: 301  SVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRAS 360

Query: 3633 ISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIG 3454
             SVELD   AAWL NDVA+LS K GELLLL+LVYDGRVV RLDL+KS+ASVLTS ITTIG
Sbjct: 361  FSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIG 420

Query: 3453 SSLFFLGSRLGDSLLVQYTCGVG---ASSGVKEEVGDIEGDAPSAKRLRRSSSDALQDIV 3283
            +SLFFLGSRLGDSLLVQ+T G+G    SSG+KEEVG+IEGD PSAKRL+RS+SD LQD+V
Sbjct: 421  NSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDMV 480

Query: 3282 NGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNY 3103
            +GEELSLYGS  NN ESAQK+FSFAVRDS+INVGPLKDFSYGLR N D +A GIAKQSNY
Sbjct: 481  SGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNY 540

Query: 3102 ELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDD 2923
            +LVCCSGHGKNG LC+L+QSI PE+ITEV+L GCRGIWTVYHKN RGHN D SKM+A  D
Sbjct: 541  DLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAAD 600

Query: 2922 EYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRI 2743
            EYHAYLIIS+E+RTMVLET D+L EVTESVDY+VQG TIAAGNLFGRRRV+QVF RG+RI
Sbjct: 601  EYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 660

Query: 2742 LDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVS 2563
            LDGSFMTQDLSIG+ N+ES  GSES+TVS  SIADPYVL++MTDGSI+LL+GD STC VS
Sbjct: 661  LDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMVS 720

Query: 2562 INIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGA---DGAPHDQG 2392
            IN PS FE+S++ +S CTLYHDKGPEPWLRKASTDAWLSTGV EAIDGA   DG PHDQG
Sbjct: 721  INTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQG 780

Query: 2391 DIYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEV 2212
            DIYC+VCYESG LEIFDVPNF+ VFSV  F+SGK +L D  +REP KD +  TN  SEEV
Sbjct: 781  DIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEEV 840

Query: 2211 AGQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEA 2032
            AG  RKEN  NMK VE+ MQRWSG HS PFLFG+LTDGTILCYHAYLFE P+ TSK E++
Sbjct: 841  AGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTEDS 900

Query: 2031 ISGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSG 1852
            +S QN V L S SASRLRNLRFVRV LD+Y +EET T  + QR+T+F N+ G+QG FL G
Sbjct: 901  VSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLLG 960

Query: 1851 SRPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYD 1672
            SRPAWFMV RERLRVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTS+G LKICQLPS ++YD
Sbjct: 961  SRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNYD 1020

Query: 1671 NYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLS 1492
            NYWPVQKIPLKGTPHQVTYF EKNLYPLIVSV V KP+NQV+SSLVDQE  HQIEN NLS
Sbjct: 1021 NYWPVQKIPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQVLSSLVDQEVGHQIENHNLS 1080

Query: 1491 SNDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETL 1312
            S++L QTY V+EFEVRILE E  GGPWQT+ATIPMQSSENALTVRVVTL+NATTKENETL
Sbjct: 1081 SDELLQTYSVEEFEVRILESENGGGPWQTKATIPMQSSENALTVRVVTLFNATTKENETL 1140

Query: 1311 LAIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVA 1132
            LAIGTAY+QGEDVA RGRVLLFSV ++T+N+Q LVSEV+SKE KGAISALASLQGHLL+A
Sbjct: 1141 LAIGTAYVQGEDVAARGRVLLFSVVKSTENSQVLVSEVYSKELKGAISALASLQGHLLIA 1200

Query: 1131 SGPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLS 952
            SGPKI L+KW  TEL  VAF+D PPL+V S+NIVKNFILLGDIHKSIYFLSWKEQG QLS
Sbjct: 1201 SGPKIILHKWTGTELNGVAFYDAPPLYVASMNIVKNFILLGDIHKSIYFLSWKEQGAQLS 1260

Query: 951  LLAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHV 772
            LLAKD+ SLDCFATEFLIDGSTLSL+VSD+QKN+QIFYYAPK  ESWKGQKLLSRAEFHV
Sbjct: 1261 LLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMLESWKGQKLLSRAEFHV 1320

Query: 771  GAHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQ 592
            GAH+TKF RL ML               DKTNRFALLF TLDGSIGCIAPLD LTFRRLQ
Sbjct: 1321 GAHITKFIRLSML--STSSDRSGAAPGPDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1378

Query: 591  SLQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAH 412
            SLQRKLVDAV HVAGLNPRSFRQF+S GK H+PGP++IVDCELL H+EMLPLEEQLEIA 
Sbjct: 1379 SLQRKLVDAVPHVAGLNPRSFRQFRSDGKVHRPGPESIVDCELLSHFEMLPLEEQLEIAQ 1438

Query: 411  QIGTTRSQIISNLNDLSLGTSFL 343
            Q+GTTR+QI+SNLNDLSLGTSFL
Sbjct: 1439 QVGTTRAQILSNLNDLSLGTSFL 1461


>gb|EXC20897.1| Cleavage and polyadenylation specificity factor subunit 1 [Morus
            notabilis]
          Length = 1479

 Score = 2302 bits (5965), Expect = 0.0
 Identities = 1148/1489 (77%), Positives = 1292/1489 (86%), Gaps = 33/1489 (2%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPA-RRGIGPVPNLI 4534
            MS+AAYKMMHWPTGIENCA+GF++H  ADF P+IP +Q+DDL+SDWPA RR  GPVPNL+
Sbjct: 1    MSFAAYKMMHWPTGIENCAAGFVSHSRADFVPRIPPIQSDDLDSDWPAGRRETGPVPNLV 60

Query: 4533 ITAGNVLEVYIIRVQEED-VRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAV 4357
            +TAGNVLEVY++R+QEED  R SR   E++RGG++ G+SGA+LELVCHYRLHGNV+T+AV
Sbjct: 61   VTAGNVLEVYVVRLQEEDDTRSSRAPAESRRGGLMDGLSGASLELVCHYRLHGNVQTIAV 120

Query: 4356 LSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFA 4177
            LS GGGDG RRRDSIIL+FQDAKISVLEFDDS+HGLRTSSMH FEGPEWL+LKRGRESFA
Sbjct: 121  LSSGGGDGSRRRDSIILSFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWLYLKRGRESFA 180

Query: 4176 RGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISL 3997
            RGPLVK DPQGRCAGVL Y +QMI+LK AQAGSGLVG++DAL SGGAVSAR+ESSYII+L
Sbjct: 181  RGPLVKVDPQGRCAGVLAYNIQMIMLKAAQAGSGLVGEEDALGSGGAVSARIESSYIINL 240

Query: 3996 RDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLI 3817
            RDL MKH+KDF+FVHGYIEPVMVILHERELTW+GR+ WKHHTC ISALSISTTLKQHPLI
Sbjct: 241  RDLDMKHIKDFVFVHGYIEPVMVILHERELTWAGRVLWKHHTCMISALSISTTLKQHPLI 300

Query: 3816 WSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRS 3637
            WSA+NLPHDAYKLLAVPSPIGGVLVI ANT+HY SQS SC LA+N++AV+ DSSQ+M R+
Sbjct: 301  WSAVNLPHDAYKLLAVPSPIGGVLVICANTLHYQSQSNSCTLALNSYAVSVDSSQEMRRA 360

Query: 3636 SISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTI 3457
              SVELD ANA WLSNDV +LSTK GELLLLTLVYDGRVV RLDL+KS+ASVLTSGITTI
Sbjct: 361  PFSVELDAANATWLSNDVVLLSTKAGELLLLTLVYDGRVVQRLDLSKSKASVLTSGITTI 420

Query: 3456 GSSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQDI 3286
            G+SLFFLGSRLGDSLLVQ+T G+G    SSG+K+EVGDIEGDA  AKRLRRSSSD LQD+
Sbjct: 421  GNSLFFLGSRLGDSLLVQFTYGLGTSMLSSGLKDEVGDIEGDAHLAKRLRRSSSDVLQDM 480

Query: 3285 VNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSN 3106
             +GEELSLY SAPNN+ES QK+FSF VRDS++NVGPLKDFSYGLRINADPNA G+AKQSN
Sbjct: 481  TSGEELSLYVSAPNNSESTQKSFSFTVRDSLVNVGPLKDFSYGLRINADPNATGVAKQSN 540

Query: 3105 YELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVD 2926
            YELVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR H  DSSK+ A D
Sbjct: 541  YELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSTRSH--DSSKLVAAD 598

Query: 2925 DEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSR 2746
            DEYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG TIAAGNLFGRRRVVQV+ RG+R
Sbjct: 599  DEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVVQVYERGAR 658

Query: 2745 ILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTV 2566
            ILDGSFMTQDLS G   +ES  GSE++ V+  SIADPYV+LRM+DGSI+LLVGDP++CTV
Sbjct: 659  ILDGSFMTQDLSFGPAPSESSSGSENAVVTSVSIADPYVVLRMSDGSIRLLVGDPTSCTV 718

Query: 2565 SINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDI 2386
            S++ P+ FESSK +IS CTLY DKGPEPWLRK STDAWLSTGV EAIDGAD    DQGDI
Sbjct: 719  SVSTPADFESSKSIISACTLYRDKGPEPWLRKTSTDAWLSTGVDEAIDGADETLQDQGDI 778

Query: 2385 YCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAG 2206
            YCVVCYESG+L+I+DVP+F+ VFSV NF+SG+P+LVD  ++E  KD + ATN NSEE AG
Sbjct: 779  YCVVCYESGSLDIYDVPSFNYVFSVDNFISGRPHLVDAFVQEQPKDLQKATNKNSEESAG 838

Query: 2205 QARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026
            Q RKEN +NMK+VE+ MQRWSG HS PFL GILTDG+ILCYHAYLFEGPE+TS+ E+++S
Sbjct: 839  QGRKENVQNMKIVELAMQRWSGKHSRPFLLGILTDGSILCYHAYLFEGPESTSRTEDSVS 898

Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846
             +NS      S SRLRNLRFVRV LD+Y REET  G+  QR++VFKN+ GYQGLFLSGSR
Sbjct: 899  SRNS------SGSRLRNLRFVRVPLDSYAREETSDGMPCQRISVFKNIAGYQGLFLSGSR 952

Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666
            PAWFMV RERLRVHPQ+CDGSIVAFTVLHNVNCNHG IYVTSEG+LKICQLPS+TSYDNY
Sbjct: 953  PAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSEGILKICQLPSITSYDNY 1012

Query: 1665 WPVQK-IPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489
            WPVQK IPLKGTPHQVTYFAE+NLYPLIVSV V KPLNQV+SSL+DQE  HQ EN NLS 
Sbjct: 1013 WPVQKVIPLKGTPHQVTYFAERNLYPLIVSVPVPKPLNQVMSSLLDQEVGHQFENPNLSP 1072

Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309
            +DL++TY +DEFEVRILEPE+SGGPWQT+ TIPMQSSENALT+RVVTL+N TT ENETLL
Sbjct: 1073 DDLNRTYTIDEFEVRILEPERSGGPWQTKVTIPMQSSENALTIRVVTLFNTTTNENETLL 1132

Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGR---------------------------NTDNTQNL 1210
            AIGTAY+QGEDVA RGR++L ++                             ++ +    
Sbjct: 1133 AIGTAYVQGEDVAARGRIILRALAPWWERLHLHPGSRVQIPEMASPSGVFKIDSADFHLQ 1192

Query: 1209 VSEVFSKEYKGAISALASLQGHLLVASGPKITLYKWNATELTPVAFFDVPPLHVVSLNIV 1030
            VSE++SKE KGAISALASLQGHLL+ASGPKI L+KW  TEL  +AFFD PPL+VVSLNIV
Sbjct: 1193 VSEIYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFFDAPPLYVVSLNIV 1252

Query: 1029 KNFILLGDIHKSIYFLSWKEQGCQLSLLAKDYASLDCFATEFLIDGSTLSLMVSDDQKNV 850
            KNFIL+GD+HKSIYFLSWKEQG QLSLLAKD+ SLDCFATEFLIDGSTLSL+VSDDQKN+
Sbjct: 1253 KNFILIGDVHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLVVSDDQKNI 1312

Query: 849  QIFYYAPKQSESWKGQKLLSRAEFHVGAHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRF 670
            QIFYYAPK SESWKGQ+LLSRAEFHVGAHVTKF RLQML               DKTNRF
Sbjct: 1313 QIFYYAPKMSESWKGQRLLSRAEFHVGAHVTKFLRLQML--PTSTDRTGSTPGSDKTNRF 1370

Query: 669  ALLFATLDGSIGCIAPLDFLTFRRLQSLQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPG 490
            ALLF  LDGSIGCIAPLD LTFRRLQSLQ+KLVDAV HVAGLNPRSFRQF S+GKAH+PG
Sbjct: 1371 ALLFGALDGSIGCIAPLDELTFRRLQSLQKKLVDAVPHVAGLNPRSFRQFCSNGKAHRPG 1430

Query: 489  PDNIVDCELLCHYEMLPLEEQLEIAHQIGTTRSQIISNLNDLSLGTSFL 343
            PD+IVDCELLCHYEMLPLEEQLEIAH IGTTRSQI+SNLNDL LGTSFL
Sbjct: 1431 PDSIVDCELLCHYEMLPLEEQLEIAHLIGTTRSQILSNLNDLFLGTSFL 1479


>ref|XP_004308159.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Fragaria vesca subsp. vesca]
          Length = 1439

 Score = 2302 bits (5965), Expect = 0.0
 Identities = 1147/1461 (78%), Positives = 1285/1461 (87%), Gaps = 5/1461 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPA-RRGIGPVPNLI 4534
            MSYAA+KMMHWPTGIENCA+GFITH  ADF P+IP +QTDDL+SDWPA RR IGPVPNL+
Sbjct: 1    MSYAAHKMMHWPTGIENCAAGFITHSRADFVPRIPQIQTDDLDSDWPAPRREIGPVPNLV 60

Query: 4533 ITAGNVLEVYIIRVQEEDV-RDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAV 4357
            +TA NVLEVY++RVQE+D  R SR SGE+KRGG++ G++GA+LELVCHYRLHGNV TMAV
Sbjct: 61   VTAANVLEVYVVRVQEQDTARGSRASGESKRGGLMDGVAGASLELVCHYRLHGNVMTMAV 120

Query: 4356 LSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFA 4177
            LS GGGDG +RRD+IIL F+DAKISVLEFDDS+HGLRTSSMH FEGPEWLHL+RGRESFA
Sbjct: 121  LSSGGGDGSKRRDAIILTFEDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLRRGRESFA 180

Query: 4176 RGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISL 3997
            RGP VK DPQGRC GVLVY LQ+IILK AQ G GLVGDDD   SG A+SARVESSYIISL
Sbjct: 181  RGPSVKVDPQGRCGGVLVYDLQLIILKAAQGGYGLVGDDDGFASGAAISARVESSYIISL 240

Query: 3996 RDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLI 3817
            RD+ MKHVKDF FVHGYIEPV+VILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLI
Sbjct: 241  RDMDMKHVKDFTFVHGYIEPVLVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 300

Query: 3816 WSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRS 3637
            WSAINLPHDAYKLLAVPSPIGGVLVISAN+IHYHSQSASCALA+N++A + DSSQ+MPRS
Sbjct: 301  WSAINLPHDAYKLLAVPSPIGGVLVISANSIHYHSQSASCALALNSYAGSVDSSQEMPRS 360

Query: 3636 SISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTI 3457
            S +VELD ANA+WLSNDV +LSTKTGELLLLTLVYDGRVVHRLDL+KS+ASVLTSGI T+
Sbjct: 361  SFTVELDAANASWLSNDVILLSTKTGELLLLTLVYDGRVVHRLDLSKSKASVLTSGIATV 420

Query: 3456 GSSLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQDI 3286
            G+SLFFLGSRLGDSLLVQ+T GVGAS   + +K+EVGDIEGDAPSAKRLR SSSDALQD+
Sbjct: 421  GNSLFFLGSRLGDSLLVQFTSGVGASMLSADLKDEVGDIEGDAPSAKRLRMSSSDALQDM 480

Query: 3285 VNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSN 3106
            ++GEELSLYGSA NNAESAQ++FSFAVRDS++NVGPLKDFSYGLRINAD NA GIAKQSN
Sbjct: 481  ISGEELSLYGSAQNNAESAQRSFSFAVRDSLVNVGPLKDFSYGLRINADANATGIAKQSN 540

Query: 3105 YELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVD 2926
            YELVCCSGHGKNGALCVL+QSI PE+ITEV L GC+GIWTVYHKN RGHNA+S      D
Sbjct: 541  YELVCCSGHGKNGALCVLRQSIRPEMITEVALPGCKGIWTVYHKNARGHNAES-----YD 595

Query: 2925 DEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSR 2746
            DEYHA+LIISLE+RTMVLET D L EVT+ VDY++QG TIAAGNLFGRRRVVQ++ RG+R
Sbjct: 596  DEYHAFLIISLEARTMVLETADHLSEVTDKVDYFLQGRTIAAGNLFGRRRVVQIYERGAR 655

Query: 2745 ILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTV 2566
            IL+G +MTQDLS G  N+ESG GSES+TV   SI DPYVLLRM+DG I+LLVGDPS+CTV
Sbjct: 656  ILEGYYMTQDLSFGASNSESGSGSESATVLSVSIVDPYVLLRMSDGGIRLLVGDPSSCTV 715

Query: 2565 SINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDI 2386
            S++ P+ FESSKKL+S CTLYHD+GPEPWLRK+STDAWLSTG+ EAIDG     HDQGD+
Sbjct: 716  SVSNPAAFESSKKLVSACTLYHDEGPEPWLRKSSTDAWLSTGIDEAIDGV---LHDQGDV 772

Query: 2385 YCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAG 2206
            YCV+CYESG+LEIFDVPNF+CVFSV  F+SGKP LVDT + +P K      + +SEEV+G
Sbjct: 773  YCVICYESGSLEIFDVPNFNCVFSVEKFVSGKPLLVDTFMGDPQK------SQSSEEVSG 826

Query: 2205 QARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026
             +RKE  +NM+VVE+TMQRWSG HS PFLFGIL DG I CYHAYL+E  ++TSK E + S
Sbjct: 827  LSRKEKLQNMRVVELTMQRWSGQHSRPFLFGILNDGMIFCYHAYLYESMDSTSKTEVSAS 886

Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846
             QN      T+ASRLRNLRFVRV LDTY+R +   G + QRMTVFKN+ G QGLFL+GSR
Sbjct: 887  SQN------TTASRLRNLRFVRVPLDTYSRNDLSNGTSCQRMTVFKNIAGNQGLFLAGSR 940

Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666
            PAW MV RER+RVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTSEG++KICQLPS+TSYDNY
Sbjct: 941  PAWLMVFRERIRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSEGIMKICQLPSITSYDNY 1000

Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486
            WPVQKIPLKGTPHQVTYFAEKNLYPLIVS+ V KPLNQV+SSLVDQE SHQ+EN NLS  
Sbjct: 1001 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSIPVQKPLNQVLSSLVDQEFSHQVENHNLSPE 1060

Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306
            +LH+TY VDEFEVRI+EPEKSGGPWQTRATIPMQ+SENALTVRVVTL+N TTKENETLLA
Sbjct: 1061 ELHRTYTVDEFEVRIMEPEKSGGPWQTRATIPMQTSENALTVRVVTLFNTTTKENETLLA 1120

Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126
            IGTAY+QGEDVAGRGRVLLFS   N DN QNLVSEVFSKE KGAISALASLQG+LL+ASG
Sbjct: 1121 IGTAYVQGEDVAGRGRVLLFSAENNVDNPQNLVSEVFSKELKGAISALASLQGNLLIASG 1180

Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946
            PKI L+KW  ++LT +AFFDVPPL+VVSLNIVKNFIL+GDIHKSIYFLSWKEQG QL+LL
Sbjct: 1181 PKIILHKWTGSDLTGIAFFDVPPLYVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLNLL 1240

Query: 945  AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766
            AKD+ +LDCFATEFLIDGSTLSL V+D QKN+QI YYAPK SESW+GQKLL+RAEFHVGA
Sbjct: 1241 AKDFGNLDCFATEFLIDGSTLSLAVADAQKNIQILYYAPKISESWRGQKLLTRAEFHVGA 1300

Query: 765  HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586
            HVTKF RLQML               DKT R+ALLF TLDG IG IAPL+ LTFRRLQSL
Sbjct: 1301 HVTKFLRLQML--STSSDRTGKNPGSDKTVRYALLFGTLDGGIGSIAPLEELTFRRLQSL 1358

Query: 585  QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406
            Q KLVDAV HVAGLNPRSFRQF+S+GKAH+PGPD+IVDCELL HYEML LEEQLEIA QI
Sbjct: 1359 QNKLVDAVPHVAGLNPRSFRQFRSNGKAHRPGPDSIVDCELLFHYEMLSLEEQLEIAQQI 1418

Query: 405  GTTRSQIISNLNDLSLGTSFL 343
            GTTR QI+SNL+DLSLGTSFL
Sbjct: 1419 GTTRLQILSNLDDLSLGTSFL 1439


>ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1447

 Score = 2268 bits (5876), Expect = 0.0
 Identities = 1123/1461 (76%), Positives = 1277/1461 (87%), Gaps = 5/1461 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPAR--RGIGPVPNL 4537
            MS+AAYKMM  PTGI+NCA+GF+TH  +DF P    LQ DDL+++WP+R    +G +PNL
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAEWPSRPRHHVGSLPNL 56

Query: 4536 IITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAV 4357
            ++TA NVLEVY +R+QE+  +  + + +++RG ++ GI+GA+LELVCHYRLHGNVETMAV
Sbjct: 57   VVTAANVLEVYAVRLQED--QPPKAAADSRRGALLDGIAGASLELVCHYRLHGNVETMAV 114

Query: 4356 LSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFA 4177
            LSIGGGD  RRRDSI+L F DAKISVLE+DDS+HGLRTSS+H FEGPEWLHLKRGRE FA
Sbjct: 115  LSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 174

Query: 4176 RGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISL 3997
            RGP+VK DPQGRC GVL+Y LQMIILK  QAGSGLVG+DDAL S GAV+AR+ESSY+I+L
Sbjct: 175  RGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINL 234

Query: 3996 RDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLI 3817
            RDL M+HVKDF FVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLI
Sbjct: 235  RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 294

Query: 3816 WSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRS 3637
            WSA+NLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALA+N++AV  DSSQ++PRS
Sbjct: 295  WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRS 354

Query: 3636 SISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTI 3457
            S +VELD ANA WL +DVA+LSTKTGELLLLTLVYDGRVV RLDL+KS+ASVL+SGITTI
Sbjct: 355  SFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 414

Query: 3456 GSSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQDI 3286
            G+SLFFL SRLGDS+LVQ++CG G    SS +KEEVGDIE DAPS KRLRRS SDALQD+
Sbjct: 415  GNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQDM 473

Query: 3285 VNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSN 3106
            V+GEELSLYGSAPN  ESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQSN
Sbjct: 474  VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 533

Query: 3105 YELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVD 2926
            YELVCCSGHGKNG+LCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR HNADSSKM+  D
Sbjct: 534  YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDD 593

Query: 2925 DEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSR 2746
            DEYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG T+AAGNLFGR RV+QV+ RG+R
Sbjct: 594  DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653

Query: 2745 ILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTV 2566
            ILDGSFMTQD+S G  N ESG  S+S+     SIADP+VLLRM+DGSI+LL+GDPSTCT+
Sbjct: 654  ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCTI 713

Query: 2565 SINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDI 2386
            S+  P+ FESSK  +S CTLYHDKGPEPWLRK STDAWLSTGVGE IDG DGA  D GDI
Sbjct: 714  SVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGDI 773

Query: 2385 YCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAG 2206
            YCVVC+++G LEIFDVPNF+CVFSV NFMSGK +LVD  ++E  KD K     + + V  
Sbjct: 774  YCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVIN 830

Query: 2205 QARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026
            Q RKEN  +MKVVE+ MQRWSG HS PFLFGIL+DGTILCYHAYL+E P++TSK+E++ S
Sbjct: 831  QGRKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSAS 890

Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846
               S+ L+ST+ SRLRNLRFVRV LD Y RE+T  G   Q++T+FKN+G Y+G FLSGSR
Sbjct: 891  AGGSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSR 950

Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666
            PAW MV+RERLRVHPQ+CDGSIVAFTVLHNVNCN GLIYVTS+G+LKICQLPS ++YD+Y
Sbjct: 951  PAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSY 1010

Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486
            WPVQKIPLK TPHQVTYFAEKNLYPLIVS  VLKPLNQV+ SLVDQ+ +HQ E+ N++ +
Sbjct: 1011 WPVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVI-SLVDQDINHQNESQNMNPD 1069

Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306
            + ++ Y +DEFEVRI+EPEKSGGPWQT+ATIPMQSSENALTVR+VTL N T+KENETLLA
Sbjct: 1070 EQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLVNTTSKENETLLA 1129

Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126
            IGTAY+QGEDVA RGR+LLFS+G+NTDN Q LVSEV+SKE KGAISALASLQGHLL+ASG
Sbjct: 1130 IGTAYVQGEDVAARGRILLFSLGKNTDNPQTLVSEVYSKELKGAISALASLQGHLLIASG 1189

Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946
            PKI L+KWN TEL  +AFFD PPLHVVSLNIVKNFIL+GDIHKSIYFLSWKEQG QLSLL
Sbjct: 1190 PKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLL 1249

Query: 945  AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766
            AKD+ SLDCFATEFLIDGSTLSLMVSDD +N+QIFYYAPK SESWKGQKLLSRAEFHVGA
Sbjct: 1250 AKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGA 1309

Query: 765  HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586
            HVTKF RLQML               DKTNRFALLF TLDGSIGCIAPLD +TFRRLQSL
Sbjct: 1310 HVTKFLRLQML---STSDRAGAVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSL 1366

Query: 585  QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406
            QRKLVDAV HVAGLNPR+FR F+S+GKAH+PGPD+IVDCELLCHYEMLPLEEQLEIAHQ+
Sbjct: 1367 QRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQV 1426

Query: 405  GTTRSQIISNLNDLSLGTSFL 343
            GTTRSQI+SNL+DLSLGTSFL
Sbjct: 1427 GTTRSQILSNLSDLSLGTSFL 1447


>ref|XP_007152397.1| hypothetical protein PHAVU_004G126600g [Phaseolus vulgaris]
            gi|561025706|gb|ESW24391.1| hypothetical protein
            PHAVU_004G126600g [Phaseolus vulgaris]
          Length = 1445

 Score = 2264 bits (5867), Expect = 0.0
 Identities = 1117/1460 (76%), Positives = 1277/1460 (87%), Gaps = 4/1460 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPAR-RGIGPVPNLI 4534
            MS+AAYKMM   TGI+NCA+GF+TH  AD  P    LQ +DL+++WP+R R +GP+PNL+
Sbjct: 1    MSFAAYKMMQCSTGIDNCAAGFLTHSRADSVP----LQPEDLDAEWPSRPRRVGPLPNLV 56

Query: 4533 ITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVL 4354
            +TA NVLEVY +R+QE+    +    + +RG ++ GI GA+LELVCHYRLHGNVETMAVL
Sbjct: 57   VTAANVLEVYTVRIQEDQPPKA---ADPRRGTLLDGIDGASLELVCHYRLHGNVETMAVL 113

Query: 4353 SIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFAR 4174
            SIGGGD  R+RDSIIL F DAKISVLE+DDS+HGLRTSS+H FEGPEWLHLKRGRE FAR
Sbjct: 114  SIGGGDASRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFAR 173

Query: 4173 GPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLR 3994
            GP+VK DPQGRC G L+Y LQMIILK  QAGSGLVGDDDAL   GAV+AR+ESSY+I+LR
Sbjct: 174  GPVVKVDPQGRCGGTLIYDLQMIILKATQAGSGLVGDDDALGFSGAVAARIESSYMINLR 233

Query: 3993 DLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIW 3814
            DL M+HVKDF FVHGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHPLIW
Sbjct: 234  DLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 293

Query: 3813 SAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSS 3634
            SA+NLPHDAYKLLAVPSPIGGVLVI ANT+HYHSQSASCALA+N++AV+ D+SQ++PRSS
Sbjct: 294  SAVNLPHDAYKLLAVPSPIGGVLVIGANTVHYHSQSASCALALNSYAVSLDNSQEIPRSS 353

Query: 3633 ISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIG 3454
             +VELD+ANA WL +DVA+LSTKTGELLLLTLVYDGRVV RLDL+KS+ASVL+SGITTIG
Sbjct: 354  FNVELDSANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTIG 413

Query: 3453 SSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQDIV 3283
            +SLFFL SRLGDS+LVQ++CG G    SS +KEEVGDIE DAPS KRLRRS SD LQD+V
Sbjct: 414  NSLFFLASRLGDSMLVQFSCGSGGSMLSSNLKEEVGDIEADAPS-KRLRRSPSDTLQDVV 472

Query: 3282 NGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNY 3103
            +GEELSLYGSAPN  ESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQSNY
Sbjct: 473  SGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSNY 532

Query: 3102 ELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDD 2923
            ELVCCSGHGKNG+LCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR HN DSSK++  DD
Sbjct: 533  ELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNTDSSKLADDDD 592

Query: 2922 EYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRI 2743
            EYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG T+AAGNLFGRRRV+QV+ RG+RI
Sbjct: 593  EYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGARI 652

Query: 2742 LDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVS 2563
            LDGSFMTQD++ G  N+ES   SES+     SIADP+VLLRM+DGS++LL+GDP TCT+S
Sbjct: 653  LDGSFMTQDVTFGASNSESASASESAIALSVSIADPFVLLRMSDGSVRLLIGDPITCTIS 712

Query: 2562 INIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIY 2383
            +  P+ FES+K  +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDG DGA  D GDIY
Sbjct: 713  VTSPASFESTKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGDIY 772

Query: 2382 CVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQ 2203
            CVVC+++G LEIFDVPNF+CVFSVGNFMSGK +LVD  ++E  KD K     + + V  Q
Sbjct: 773  CVVCFDNGNLEIFDVPNFNCVFSVGNFMSGKSHLVDALMKEVLKDSK---KGDRDGVIIQ 829

Query: 2202 ARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAISG 2023
             RKEN  +MKVVE+ MQRWSG HS PFLFGIL+DGTILCYHAYL+E P+ TSK+E++ S 
Sbjct: 830  GRKENVPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSASA 889

Query: 2022 QNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSRP 1843
              S+ L +T+ SRLRNLRFVRVSLD Y REET  G   Q++T+FKN+G YQG FLSGSRP
Sbjct: 890  GGSIGLGTTNISRLRNLRFVRVSLDAYAREETSNGSLHQQITIFKNIGSYQGFFLSGSRP 949

Query: 1842 AWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNYW 1663
            AW MV+RERLRVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTS+G+LKICQLPS ++YD+YW
Sbjct: 950  AWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDSYW 1009

Query: 1662 PVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSND 1483
            PVQKIPLK TPHQVTYFAEKNLYPLIVS  VLKPL+QV+ SLVDQ+ +HQ E+ N++S++
Sbjct: 1010 PVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLSQVI-SLVDQDVNHQNESQNMNSDE 1068

Query: 1482 LHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLAI 1303
             ++ Y +DEFEVRI+EPEKSGGPWQT+ATIPMQSSENALTVR+VTL N T+KENETLLAI
Sbjct: 1069 QNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLLAI 1128

Query: 1302 GTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASGP 1123
            GTAY+QGEDVA RGR+LLFS+G+NTDN Q+LVSEV+SKE KGAISALASLQGHLL+ASGP
Sbjct: 1129 GTAYVQGEDVAARGRILLFSLGKNTDNPQSLVSEVYSKELKGAISALASLQGHLLIASGP 1188

Query: 1122 KITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLLA 943
            KI L+KWN TEL  +AFFD PPLHVVSLNIVKNFIL+GDIHKSIYFLSWKEQG QLSLLA
Sbjct: 1189 KIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSLLA 1248

Query: 942  KDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGAH 763
            KD++SLDCFATEFLIDGSTLSLMVSDD++N+QIFYYAPK SESWKGQKLLSRAEFHVGAH
Sbjct: 1249 KDFSSLDCFATEFLIDGSTLSLMVSDDKRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAH 1308

Query: 762  VTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSLQ 583
            VTKF RLQML               DKTNRFALLF TLDGSIGCIAPLD +TFRRLQSLQ
Sbjct: 1309 VTKFLRLQML---PTSDRAGSAPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQ 1365

Query: 582  RKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQIG 403
            +KLVDAV HVAGLNPR+FR+F+S+GKAH+PGPD+IVDCELLCHYEMLPLEEQLEIAHQ+G
Sbjct: 1366 KKLVDAVAHVAGLNPRAFRKFQSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIAHQVG 1425

Query: 402  TTRSQIISNLNDLSLGTSFL 343
            TTRSQI+SNL+DLSLGTSFL
Sbjct: 1426 TTRSQILSNLSDLSLGTSFL 1445


>ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform X1 [Glycine max]
          Length = 1449

 Score = 2256 bits (5846), Expect = 0.0
 Identities = 1123/1462 (76%), Positives = 1272/1462 (87%), Gaps = 6/1462 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLES-DWPAR--RGIGPVPN 4540
            MS+AAYKMM  PTGI+NCA+GF+TH  +DF P    LQ DDL++ +WP+R    +GP+PN
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAAEWPSRPRHHVGPLPN 56

Query: 4539 LIITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMA 4360
            L++TA NVLEVY +R+QE D +    S +++RG ++ GI+GA+LEL CHYRLHGNVETMA
Sbjct: 57   LVVTAANVLEVYAVRLQE-DQQPKDASDDSRRGTLLDGIAGASLELECHYRLHGNVETMA 115

Query: 4359 VLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESF 4180
            VLSIGGGD  R+RDSIIL F DAKISVLE+DDS+HGLRTSS+H FEGPEWLHLKRGRE F
Sbjct: 116  VLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 175

Query: 4179 ARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIIS 4000
            ARGP+VK DPQGRC GVL+Y LQMIILK  Q GSGLVGDDDA  S GAV+AR+ESSY+I+
Sbjct: 176  ARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMIN 235

Query: 3999 LRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPL 3820
            LRDL M+HVKDF FV+GYIEPVMVILHERELTW+GR+SW HHTC ISALSISTTLKQHPL
Sbjct: 236  LRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPL 295

Query: 3819 IWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPR 3640
            IWSA+NLPHDAYKLLAVPSPIGGVLVI ANTIHYHSQSASCALA+NN+AV  DSSQ++PR
Sbjct: 296  IWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPR 355

Query: 3639 SSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITT 3460
            SS +VELD ANA WL +DVA+LSTKTGELLLL LVYDGRVV RLDL+KS+ASVL+SGITT
Sbjct: 356  SSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITT 415

Query: 3459 IGSSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQD 3289
            IG+SLFFL SRLGDS+LVQ++CG G    SS +KEEVGDIE DAPS KRLRRS SDALQD
Sbjct: 416  IGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQD 474

Query: 3288 IVNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQS 3109
            +V+GEELSLYGSA N  ESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQS
Sbjct: 475  MVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 534

Query: 3108 NYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAV 2929
            NYELVCCSGHGKNG+LCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR HNADSSKM+  
Sbjct: 535  NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 594

Query: 2928 DDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGS 2749
            DDEYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG T+AAGNLFGRRRV+QV+ RG+
Sbjct: 595  DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGA 654

Query: 2748 RILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCT 2569
            RILDGSFMTQD+S G  N+ESG  SES+     SIADP+VLLRM+DGSI+LL+GDPSTCT
Sbjct: 655  RILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 714

Query: 2568 VSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGD 2389
            +S+  P+ FESSK  +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDG DGA  D GD
Sbjct: 715  ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGD 774

Query: 2388 IYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVA 2209
            IYCVVC+++G LEIFD+PNF+CVFSV NFMSGK +LVD  ++E  KD K     + + V 
Sbjct: 775  IYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVV 831

Query: 2208 GQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAI 2029
             Q RK+N  NMKVVE+ MQRWSG HS PFLFGIL+DGTILCYHAYL+E P+ TSK+E++ 
Sbjct: 832  NQGRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSA 891

Query: 2028 SGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGS 1849
            S   S+ L+ST+ SRLRNLRFVRV LD Y RE+T  G   Q++T+FKN+G YQG FLSGS
Sbjct: 892  SAGGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGS 951

Query: 1848 RPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDN 1669
            RPAW MV+RERLRVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTS+G+LKICQLPS ++YD+
Sbjct: 952  RPAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDS 1011

Query: 1668 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489
            YWPVQKIPLK TPHQVTYFAEKNLYPLIVS  VLKPLNQV+ SLVDQ+ +HQ E+ N++ 
Sbjct: 1012 YWPVQKIPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVI-SLVDQDFNHQNESQNMNP 1070

Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309
            ++ ++ Y +DEFEVRI+EPEKSGGPWQT+ATIPMQSSENALTVR+VTL N T+KENETLL
Sbjct: 1071 DEQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETLL 1130

Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVAS 1129
            AIGTAY+QGEDVA RGR+LLFS+G+ TDN Q LVSEV+SKE KGAISALASLQGHLL+AS
Sbjct: 1131 AIGTAYVQGEDVAARGRILLFSLGKITDNPQTLVSEVYSKELKGAISALASLQGHLLIAS 1190

Query: 1128 GPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSL 949
            GPKI L+KWN TEL  +AFFD PPLHVVSLNIVKNFIL+GDIHKSIYFLSWKEQG QLSL
Sbjct: 1191 GPKIILHKWNGTELNGIAFFDAPPLHVVSLNIVKNFILIGDIHKSIYFLSWKEQGAQLSL 1250

Query: 948  LAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVG 769
            LAKD+ SLDCFATEFLIDGSTLSLMVSDD +N+QIFYYAPK SESWKGQKLLSRAEFHVG
Sbjct: 1251 LAKDFGSLDCFATEFLIDGSTLSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVG 1310

Query: 768  AHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQS 589
            AHVTKF RLQML               DKTNRFALLF TLDGSIGCIAPLD +TFRRLQS
Sbjct: 1311 AHVTKFLRLQML---STSDRAGSVPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQS 1367

Query: 588  LQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQ 409
            LQRKLVDAV HVAGLNPR+FR F+S+GKAH+PGPD+IVDCELLCHYEMLPLEEQLEIA+Q
Sbjct: 1368 LQRKLVDAVPHVAGLNPRAFRLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQ 1427

Query: 408  IGTTRSQIISNLNDLSLGTSFL 343
            IGTTRSQI+SNL+DLSLGTSFL
Sbjct: 1428 IGTTRSQILSNLSDLSLGTSFL 1449


>ref|XP_002318462.2| cleavage and polyadenylation specificity factor family protein
            [Populus trichocarpa] gi|550326263|gb|EEE96682.2|
            cleavage and polyadenylation specificity factor family
            protein [Populus trichocarpa]
          Length = 1455

 Score = 2255 bits (5844), Expect = 0.0
 Identities = 1135/1468 (77%), Positives = 1268/1468 (86%), Gaps = 12/1468 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARR----GIGPVP 4543
            MSYAAYKMMHWPT I+ C SGF+TH  ++ A  +P L TDDL+SDWP+RR    GIGP P
Sbjct: 1    MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59

Query: 4542 NLIITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETM 4363
            NLI+ +GNVLE+Y++RVQEE    +R SGE KRGGV+ G++GA+LELVCHYRLHGNVE+M
Sbjct: 60   NLIVASGNVLELYVVRVQEEG---ARSSGELKRGGVMDGVAGASLELVCHYRLHGNVESM 116

Query: 4362 AVLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRES 4183
             VLS+ GGD  RRRDSIILAF+DAKISVLEFDDS+HGLRTSSMH FEGP+W HLKRGRES
Sbjct: 117  GVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRES 176

Query: 4182 FARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYII 4003
            FARGPLVK DPQGRC GVLVY LQMIILK AQAGS LV D+DA  SG A+SA + SSYII
Sbjct: 177  FARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYII 236

Query: 4002 SLRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHP 3823
            +LRDL MKHVKDFIFVH YIEPV+V+LHERELTW+GR+ WKHHTC ISALSISTTLKQ  
Sbjct: 237  NLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPT 296

Query: 3822 LIWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMP 3643
            LIWS  NLPHDAYKLLAVPSPIGGVLVI  NTIHYHS+SASCALA+N++A + DSSQ++P
Sbjct: 297  LIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELP 356

Query: 3642 RSSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGIT 3463
            R++ SVELD ANA WL  DVA+LSTKTGELLLLTLVYDGRVV RLDL+KS+ASVLTS IT
Sbjct: 357  RATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDIT 416

Query: 3462 TIGSSLFFLGSRLGDSLLVQYTCGVGA---SSGVKEEVGDIEGDAPSAKRLRRSSSDALQ 3292
            T+G+S FFLGSRLGDSLLVQ+T G+G+   S G+KEEVGDIEGD PSAKRL+ SSSDALQ
Sbjct: 417  TLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDALQ 476

Query: 3291 DIVNGEELSLYGSAPNNAESAQ-----KNFSFAVRDSVINVGPLKDFSYGLRINADPNAA 3127
            D+V+GEELSLY SAPNNAES+Q     K FSF VRDS+INVGPLKDF+YGLRINAD NA 
Sbjct: 477  DMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANAT 536

Query: 3126 GIAKQSNYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADS 2947
            GI+KQSNYELVCCSGHGKNGALCVLQQSI PE+ITEVEL GC+GIWTVYHKN R H+ DS
Sbjct: 537  GISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARSHSVDS 596

Query: 2946 SKMSAVDDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQ 2767
             KM A DDEYHAYLIIS+E+RTMVLET D L EVTESVDY+VQG TIAAGNLFGRRRVVQ
Sbjct: 597  LKM-ASDDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVVQ 655

Query: 2766 VFARGSRILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVG 2587
            VF RG+RILDGSFMTQDLS G  N+E+G  SESSTV   SI DPYVL+RM DGSIQ+LVG
Sbjct: 656  VFERGARILDGSFMTQDLSFGGSNSETG-RSESSTVMHVSIVDPYVLVRMADGSIQILVG 714

Query: 2586 DPSTCTVSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGA 2407
            DPS CTVS+N PS F+SS K +S CTLYHDKGPEPWLRK STDAWLSTG+ EAIDGAD  
Sbjct: 715  DPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADSG 774

Query: 2406 PHDQGDIYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNT 2227
             H+QGDIYCVVCYE+G LEIFDVPNF+ VF V  F+SGK +L+DT   EP+KD       
Sbjct: 775  AHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDMMKGV-- 832

Query: 2226 NSEEVAGQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTS 2047
              EEVAG  RKE+T+NMKVVE+TM RWSG HS PFLFGILTDGTILCYHAYLFEGP+ TS
Sbjct: 833  -KEEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTS 891

Query: 2046 KMEEAISGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQG 1867
            K+E+++S QNSV  ++ SASRLRNLRFVRV LDTYTREET +  + QR+T FKN+ GYQG
Sbjct: 892  KLEDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQG 951

Query: 1866 LFLSGSRPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPS 1687
             FLSGSRPAWFMV RERLRVHPQ+CDGSIVAFTVLH VNCNHGLIYVTS+G LKIC L S
Sbjct: 952  FFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSS 1011

Query: 1686 VTSYDNYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIE 1507
            V+SYDNYWPVQKIPLKGTPHQVTYFAE+NLYPLIVSV V KP+NQV+SSLVDQE  HQIE
Sbjct: 1012 VSSYDNYWPVQKIPLKGTPHQVTYFAERNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQIE 1071

Query: 1506 NDNLSSNDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTK 1327
            N NLSS ++H+TY VDEFEVRILEP  S GPWQ +ATIPMQ+SENALTVR+V+L+N +TK
Sbjct: 1072 NHNLSSEEIHRTYSVDEFEVRILEP--SNGPWQVKATIPMQTSENALTVRMVSLFNTSTK 1129

Query: 1326 ENETLLAIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQG 1147
            ENETLLA+GTAY+QGEDVA RGR+LLFSV +N +N+Q LVSEV+SKE KGAISALASLQG
Sbjct: 1130 ENETLLAVGTAYVQGEDVAARGRILLFSVVKNPENSQILVSEVYSKELKGAISALASLQG 1189

Query: 1146 HLLVASGPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQ 967
            HLL+ASGPKI L+KW  TELT VAF D PPL+VVSLNIVKNFILLGDIHKSIYFLSWKEQ
Sbjct: 1190 HLLIASGPKIILHKWTGTELTGVAFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQ 1249

Query: 966  GCQLSLLAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSR 787
            G QLSLLAKD+ASLDCF+TEFLIDGSTLSL+VSD+QKNVQIFYYAPK SESWKGQKLLSR
Sbjct: 1250 GAQLSLLAKDFASLDCFSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQKLLSR 1309

Query: 786  AEFHVGAHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLT 607
            AEFHVGA VTKF RLQML               DKTNRFALLF TLDGSIGCIAPLD LT
Sbjct: 1310 AEFHVGALVTKFMRLQML--SPSLDRSGAAPVSDKTNRFALLFGTLDGSIGCIAPLDELT 1367

Query: 606  FRRLQSLQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQ 427
            FRRLQSLQ+KLVDAV HVAGLNP+SFRQF+S GKAH+PGP++IVDCE+L +YEM+PLEEQ
Sbjct: 1368 FRRLQSLQKKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQ 1427

Query: 426  LEIAHQIGTTRSQIISNLNDLSLGTSFL 343
            +EIA QIGTTR+QI+SNLNDL+LGTSFL
Sbjct: 1428 VEIAQQIGTTRAQILSNLNDLTLGTSFL 1455


>ref|XP_004514987.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cicer arietinum]
          Length = 1447

 Score = 2222 bits (5759), Expect = 0.0
 Identities = 1102/1463 (75%), Positives = 1261/1463 (86%), Gaps = 7/1463 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQ---TDDLESDW-PARRGIGPVP 4543
            MS+AAYKMM WPTGI+NCASGF+TH  +D  P+IP +Q    DD++SDW P  R + P+P
Sbjct: 1    MSFAAYKMMQWPTGIQNCASGFLTHSRSDSTPRIPPIQHNDDDDIDSDWVPQPRDLAPLP 60

Query: 4542 NLIITAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETM 4363
            NL+ITA N+LEVY +R+Q++  + S          V+ G++GA+LELVCHYRLHGNVE++
Sbjct: 61   NLVITAANILEVYTVRIQQDPPKSSADPR------VLDGLAGASLELVCHYRLHGNVESV 114

Query: 4362 AVLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRES 4183
            AVLS+GGGD  RRRDSIIL F+DAKISVLE+DDS+HGLRTSS+H FEGPEWLHLKRGRE 
Sbjct: 115  AVLSVGGGDASRRRDSIILTFKDAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREH 174

Query: 4182 FARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYII 4003
            FARGP+ K DPQGRC GVLVY LQMIILKT QAGSGLVG+DD L SGGAV+AR+ESSY+I
Sbjct: 175  FARGPVAKVDPQGRCGGVLVYDLQMIILKTTQAGSGLVGEDDVLGSGGAVAARIESSYMI 234

Query: 4002 SLRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHP 3823
            +LRDL M+HVKDF F+HGYIEPVMVILHERELTW+GR+SWKHHTC ISALSISTTLKQHP
Sbjct: 235  NLRDLDMRHVKDFTFLHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 294

Query: 3822 LIWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMP 3643
            LIWSA+NLPHDAYKLLAVPSPIGGVLVI ANTIHYHSQSASCALA+N++AV+ D+SQ+MP
Sbjct: 295  LIWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNSYAVSVDNSQEMP 354

Query: 3642 RSSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGIT 3463
            RSS +VELD ANA WL NDVA+LSTKTGELLLLTL+YDGRVV RLDL+KS+ASVL+SG+T
Sbjct: 355  RSSFNVELDAANATWLLNDVALLSTKTGELLLLTLIYDGRVVQRLDLSKSKASVLSSGVT 414

Query: 3462 TIGSSLFFLGSRLGDSLLVQYTCGVGAS---SGVKEEVGDIEGDAPSAKRLRRSSSDALQ 3292
            TIG+SLFFL SRLGDS+LVQ++ G G S   S +KEEVGD + DA SAKR+RRS SD LQ
Sbjct: 415  TIGNSLFFLASRLGDSMLVQFSSGSGVSMLSSNLKEEVGDFDVDASSAKRMRRSPSDTLQ 474

Query: 3291 DIVNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQ 3112
            D+V+GEELSLYGSA N  ESAQK+FSFAVRDS+INVGPLKDFSYGLRINAD NA GIAKQ
Sbjct: 475  DMVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQ 534

Query: 3111 SNYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSA 2932
            SNYELVCCSGHGKNG+LCVL+QSI PE+ITEVEL GC+GIWTVYHK+TR  NADSSK++ 
Sbjct: 535  SNYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSLNADSSKLAD 594

Query: 2931 VDDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARG 2752
             +DEYHAYLIISLESRTMVLET D+L EVTESVDYYVQG T+AAGNLFGRRRV+QV+ RG
Sbjct: 595  DEDEYHAYLIISLESRTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERG 654

Query: 2751 SRILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTC 2572
            +RILDGSFMTQD+S G  N+E+  GSES+     SIADPYVLL+M+DGS++LLVGDPSTC
Sbjct: 655  ARILDGSFMTQDVSFGASNSEANYGSESALALSVSIADPYVLLKMSDGSVRLLVGDPSTC 714

Query: 2571 TVSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQG 2392
            T+S+  P+ FESSK  +S CTLYHDKGPEPWLRK STDAWLSTGVGEAIDG DGA  D G
Sbjct: 715  TISVTSPASFESSKGSVSTCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHG 774

Query: 2391 DIYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEV 2212
            DIYCVVCYE+ +LEIFDVPNFSCVFSV NF+SGK +LVD   +E  KD +      S+ V
Sbjct: 775  DIYCVVCYENDSLEIFDVPNFSCVFSVENFLSGKSHLVDALTKEVPKDSQKGDKV-SDGV 833

Query: 2211 AGQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEA 2032
              Q RK+   NMKVVE+ MQRWSG H  PFLFGIL+DGT LCYHAYL+E P+ TSK+E++
Sbjct: 834  VSQGRKD-ALNMKVVELAMQRWSGKHGRPFLFGILSDGTTLCYHAYLYESPDGTSKVEDS 892

Query: 2031 ISGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSG 1852
            +    S  L+++S SRLRNLRFVRV LD + REET  G   Q++ +FKN+G Y+G FLSG
Sbjct: 893  V----SAGLSNSSVSRLRNLRFVRVPLDVHAREETSNGPPCQQINIFKNIGSYEGFFLSG 948

Query: 1851 SRPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYD 1672
            SRPAW M++RERLRVHPQ+CDGSIVAFTVLHNVNCNHGLIYVTS+G+LKICQLPS ++YD
Sbjct: 949  SRPAWVMLLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYD 1008

Query: 1671 NYWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLS 1492
             YWPVQK+PLK TPHQVTYFAEKNLYPLIVS  V KPLNQV+ +LVDQ+A+   E+ NL+
Sbjct: 1009 CYWPVQKVPLKATPHQVTYFAEKNLYPLIVSYPVPKPLNQVI-ALVDQDANQLTESQNLN 1067

Query: 1491 SNDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETL 1312
            +++    Y ++EFEVRI+EPEKSGGPWQ +ATIPMQSSENALTVR+VTL N ++KENETL
Sbjct: 1068 NDEQSHLYTIEEFEVRIMEPEKSGGPWQLKATIPMQSSENALTVRMVTLMNTSSKENETL 1127

Query: 1311 LAIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVA 1132
            LAIGTAY+QGEDVA RGR+LLFS+G+NTDN QNLVSEV+SKE KGAISALA+LQGHLLVA
Sbjct: 1128 LAIGTAYVQGEDVAARGRILLFSLGKNTDNPQNLVSEVYSKELKGAISALAALQGHLLVA 1187

Query: 1131 SGPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLS 952
            SGPKI L+KW  TEL  VAFFDVPPLHVVSLNIVKNFIL+GD+HKSIYFLSWKEQG QLS
Sbjct: 1188 SGPKIILHKWTGTELNGVAFFDVPPLHVVSLNIVKNFILIGDVHKSIYFLSWKEQGAQLS 1247

Query: 951  LLAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHV 772
            LLAKD+ SLDCFATEFLIDGSTLSLMVSD+QKN+QIFYYAPK SESWKGQKLLSRAEFHV
Sbjct: 1248 LLAKDFGSLDCFATEFLIDGSTLSLMVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHV 1307

Query: 771  GAHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQ 592
            GAH+TKF RLQML               DKTNRFALLF TLDGSIGCIAPLD +TFRRLQ
Sbjct: 1308 GAHITKFLRLQML---STSDKTGSGPGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQ 1364

Query: 591  SLQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAH 412
            SLQ+KLVDAV HVAGLNPR+FR F S+GKAH+PGPD+IVDCELLCHYEML LEEQLEIAH
Sbjct: 1365 SLQKKLVDAVPHVAGLNPRAFRLFHSNGKAHRPGPDSIVDCELLCHYEMLQLEEQLEIAH 1424

Query: 411  QIGTTRSQIISNLNDLSLGTSFL 343
            Q+GTTRSQI+SNL+DLSLGTSFL
Sbjct: 1425 QVGTTRSQILSNLSDLSLGTSFL 1447


>ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata] gi|297309955|gb|EFH40379.1| hypothetical protein
            ARALYDRAFT_495232 [Arabidopsis lyrata subsp. lyrata]
          Length = 1444

 Score = 2181 bits (5652), Expect = 0.0
 Identities = 1073/1462 (73%), Positives = 1255/1462 (85%), Gaps = 6/1462 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQ-TDDLESDWPA-RRGIGPVPNL 4537
            MS+AA+KMMHWPTG+ENCASG+ITH  +D   QIP +   DD+E++WP  +RGIGP+PN+
Sbjct: 1    MSFAAFKMMHWPTGVENCASGYITHSLSDSTLQIPIVSGDDDMEAEWPNHKRGIGPLPNV 60

Query: 4536 IITAGNVLEVYIIRVQEE-DVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMA 4360
            +ITAGN+LEVYI+R QEE + ++ R     KRGGV+ G+SG +LELVCHYRLHGNVE++A
Sbjct: 61   VITAGNILEVYIVRAQEEGNTQELRIPKLVKRGGVMDGVSGVSLELVCHYRLHGNVESIA 120

Query: 4359 VLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESF 4180
            VL +GGG+  + RDSIIL F+DAKISVLEFDDS+H LR +SMH FEGP+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 4179 ARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIIS 4000
             RGPLVK DPQGRC GVLVYGLQMIILK +Q GSGLVGDDDA +SGG VSARVESSYII+
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKASQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 3999 LRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPL 3820
            LRDL MKHVKDF+F+HGYIEPV+VIL E E TW+GR+SWKHHTC +SALSI+TTLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINTTLKQHPV 300

Query: 3819 IWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPR 3640
            IWSAINLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+A +ADSSQ++P 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 3639 SSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITT 3460
            S+ SVELD A+  W+S+DVA+LSTK+GELLLLTL+YDGR V RLDL+KS+ASVL S IT+
Sbjct: 361  SNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 3459 IGSSLFFLGSRLGDSLLVQYTCGVGASS---GVKEEVGDIEGDAPSAKRLRRSSSDALQD 3289
            +G+SLFFLGSRLGDSLLVQ++C  G ++   G+++E  DIEG+   AKRLR  SSD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLR-ISSDTFQD 479

Query: 3288 IVNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQS 3109
             +  EELSL+GS PNN++SAQK+FSFAVRDS++NVGP+KDF+YGLRINAD NA G++KQS
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 3108 NYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAV 2929
            NYELVCCSGHGKNGALCVL+QS+ PE+ITEVEL GC+GIWTVYHK++RGHNADSSKM+A 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 2928 DDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGS 2749
            +DEYHAYLIIS+E+RTMVLET D+L EVTESVDYYVQG TIAAGNLFGRRRV+QVF  G+
Sbjct: 600  EDEYHAYLIISVEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 2748 RILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCT 2569
            RILDGSFM Q+LS G PN+ES  GSESSTVS  SIADPYVLLRMTD SI+LLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGAPNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 2568 VSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGD 2389
            VSI+ PSV E SKK IS CTL+HDKGPEPWLRKASTDAWLS+GVGEA+D ADG P DQGD
Sbjct: 720  VSISSPSVLEGSKKKISACTLFHDKGPEPWLRKASTDAWLSSGVGEAVDSADGGPQDQGD 779

Query: 2388 IYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVA 2209
            IYCV+CYESG LEIFDVP F+CVFSV  F SG+ +L D  + E   +     N NSE+ A
Sbjct: 780  IYCVLCYESGALEIFDVPGFNCVFSVDKFASGRRHLSDMPIHELEYE----LNKNSEDNA 835

Query: 2208 GQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAI 2029
              +R E  +N KVVE++MQRWSGPH+ PFLF +L DGTILCYHAYLFEG ++T K E ++
Sbjct: 836  S-SRNEEIKNTKVVELSMQRWSGPHTRPFLFAVLADGTILCYHAYLFEGVDST-KAENSV 893

Query: 2028 SGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGS 1849
            S +N   LNS+ +S+LRNL+F+R+  DT TRE T  G+ SQR+T+FKN+ G+QG FLSGS
Sbjct: 894  SSENPAALNSSGSSKLRNLKFLRIPFDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 953

Query: 1848 RPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDN 1669
            RP W M+ RERLR H Q+CDGSI AFTVLHNVNCNHG IYVTS+ +LKICQLPS + YDN
Sbjct: 954  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIYDN 1013

Query: 1668 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489
            YWPVQKIPLK TPHQVTY+AEKNLYPLIVS  V KP+NQV+SSLVDQEA  QI+N NLSS
Sbjct: 1014 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPINQVLSSLVDQEAGQQIDNHNLSS 1073

Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309
            +DL +TY V+EFE++ILEPE+SGGPW+T+ATIPMQSSE+ALTVRVVTL NA+T ENETLL
Sbjct: 1074 DDLQRTYTVEEFEIQILEPERSGGPWETKATIPMQSSEHALTVRVVTLLNASTGENETLL 1133

Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVAS 1129
            A+GTAY+QGEDVA RGRVLLFS G+N DN+QN+V+EV+S+E KGAISA+AS+QGHLL++S
Sbjct: 1134 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1193

Query: 1128 GPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSL 949
            GPKI L+KWN TEL  VAFFD PPL+VVS+N+VK FILLGD+HKSIYFLSWKEQG QLSL
Sbjct: 1194 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKTFILLGDVHKSIYFLSWKEQGSQLSL 1253

Query: 948  LAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVG 769
            LAKD+ SLDCFATEFLIDG+TLSL VSD+QKN+Q+FYYAPK +ESWKGQKLLSRAEFHVG
Sbjct: 1254 LAKDFGSLDCFATEFLIDGNTLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVG 1313

Query: 768  AHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQS 589
            +HVTKF RLQM+               DKTNRFALLF TLDGS GCIAPLD +TFRRLQS
Sbjct: 1314 SHVTKFLRLQMV-----------TSGADKTNRFALLFGTLDGSFGCIAPLDEVTFRRLQS 1362

Query: 588  LQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQ 409
            LQ+KLVDAV HVAGLNP SFRQF++ GKA + GPD+I+DCELLCHYEMLPLEEQLE+AHQ
Sbjct: 1363 LQKKLVDAVPHVAGLNPHSFRQFRTSGKARRSGPDSIIDCELLCHYEMLPLEEQLELAHQ 1422

Query: 408  IGTTRSQIISNLNDLSLGTSFL 343
            IGTTRS I+ NL +LS+GTSFL
Sbjct: 1423 IGTTRSVILLNLVELSVGTSFL 1444


>ref|XP_004234158.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Solanum lycopersicum]
          Length = 1447

 Score = 2170 bits (5624), Expect = 0.0
 Identities = 1082/1461 (74%), Positives = 1254/1461 (85%), Gaps = 5/1461 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531
            MS+AA K MH PTGIENCASGFITH +AD  PQI   QT D++SDWPA + IGPVPNL++
Sbjct: 1    MSFAACKTMHCPTGIENCASGFITHSAADITPQI---QTADVDSDWPATKPIGPVPNLVV 57

Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351
            +AGNVL+VY+IRV++   RD+  +   KRGG+VAGIS A+LELVC YRLHGN+ +M V++
Sbjct: 58   SAGNVLDVYLIRVEQASSRDA--AEVVKRGGLVAGISAASLELVCTYRLHGNIYSMGVIT 115

Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171
             GG DG +RRDSIIL+F+DAK+SVLEFDD+ HGLRTSSMH FEGP+W HLKRGRESF +G
Sbjct: 116  AGGADGGKRRDSIILSFEDAKMSVLEFDDATHGLRTSSMHFFEGPDWFHLKRGRESFDKG 175

Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991
            P++K DPQGRCAGV  +  QMI+LK A+  S L G+D A ++GGA SAR+ESSYII+LRD
Sbjct: 176  PIIKVDPQGRCAGVFAFEQQMIVLKAAEVNSSLAGEDSAFSAGGA-SARIESSYIITLRD 234

Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811
            L ++HVKDF F+HGYIEPVMVILHERELTWSGR+SWKHHTC +SA SISTTLKQHPLIWS
Sbjct: 235  LDVRHVKDFTFLHGYIEPVMVILHERELTWSGRVSWKHHTCMVSAFSISTTLKQHPLIWS 294

Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631
            A NLPHDAYKLLAVPSPIGGVLVI ANTIHYHSQS+SC+LA+NNF    D+SQ+MPRSSI
Sbjct: 295  ATNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSSSCSLALNNFVFFGDNSQEMPRSSI 354

Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451
            +VELD ANA WL++DVAMLSTKTGELLLLT++YDGR+V +LDL+KSRASVLTSGITTIG 
Sbjct: 355  NVELDAANATWLTSDVAMLSTKTGELLLLTIIYDGRIVQKLDLSKSRASVLTSGITTIGD 414

Query: 3450 SLFFLGSRLGDSLLVQYTCGVGASS---GVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280
            SLFFLGSRLGDSLLVQ++ G+G S+   GV+EEVGDIE DAPSAKRLR SSSDALQD++N
Sbjct: 415  SLFFLGSRLGDSLLVQFSSGLGGSNLPPGVQEEVGDIESDAPSAKRLRMSSSDALQDMIN 474

Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100
            GEELSLYG+APNNA+SAQK FSFAVRDS+INVGPLKDFSYG+RINAD NA GIAKQSNYE
Sbjct: 475  GEELSLYGTAPNNAQSAQKTFSFAVRDSLINVGPLKDFSYGMRINADLNATGIAKQSNYE 534

Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920
            LVCCSGHGKNG+L VLQQSI PE IT+V L GC+GIWTVYHKNTR H ++SS+M+  +DE
Sbjct: 535  LVCCSGHGKNGSLSVLQQSIRPETITQVSLPGCKGIWTVYHKNTRIHLSESSRMADEEDE 594

Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740
            YHAYLIISLE+RTMVL+T + L EVTE+VDYYVQG+T+AAGNLFGRRRV+QVFA G+RIL
Sbjct: 595  YHAYLIISLEARTMVLQTANNLEEVTENVDYYVQGTTLAAGNLFGRRRVIQVFAHGARIL 654

Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560
            DG+FMTQ+LS    N ESG  S++S V+  SIADPYVLLRMT+GS+QLLVGDPS+C+VS+
Sbjct: 655  DGAFMTQELSFKASNVESGSSSDTSIVASVSIADPYVLLRMTNGSLQLLVGDPSSCSVSL 714

Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380
             +PSVFESSKK IS CTLYHDKGPEPWLRK STDAWLS+G+GEAIDGADG   DQGD+YC
Sbjct: 715  TVPSVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSSGMGEAIDGADGVIQDQGDVYC 774

Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200
            VVCYE+GTLEIFDVP+F+CVFSV  F+SG+  LVDT +++ S +   A + N+E+V    
Sbjct: 775  VVCYENGTLEIFDVPSFTCVFSVDKFISGRTYLVDTFMQD-SVNGLHAHSKNTEDVIRPG 833

Query: 2199 RKENTENMK--VVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026
            +KEN++++K  VVE+ M RW G HS PFLFGIL DGTIL YHAY+FEG EN+SK++ ++S
Sbjct: 834  QKENSKDVKINVVELMMHRWIGKHSRPFLFGILADGTILSYHAYVFEGSENSSKVDGSVS 893

Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846
             QNS++L+ST+ASRLRNLRFVRV +D Y REE P+G   QRM V+KN+GG QG+FL+GSR
Sbjct: 894  SQNSISLSSTNASRLRNLRFVRVPVDNYAREEMPSGSQLQRMNVYKNIGGSQGIFLTGSR 953

Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666
            P+WFMV RERLR+HPQ+CDG IVAFTVLHNVNCNHGLIYVT+ G LKICQLPS  SYDNY
Sbjct: 954  PSWFMVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTALGTLKICQLPSFLSYDNY 1013

Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486
            WPVQKIPLKGTPHQV YFAEKN+Y +IVSV VLKPLNQV+SS+ DQE   Q + DNL   
Sbjct: 1014 WPVQKIPLKGTPHQVAYFAEKNVYSVIVSVPVLKPLNQVLSSIADQEVGQQFDPDNL--- 1070

Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306
            +   +Y ++EFEVRILEPEKSGGPW+TRA+IPMQSSENALTVR+VTL+N  TKENETLLA
Sbjct: 1071 NYEGSYPIEEFEVRILEPEKSGGPWKTRASIPMQSSENALTVRMVTLFNTKTKENETLLA 1130

Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126
            +GTAY+QGEDVA RGRVLLFS+ R  DN++ LVSEV+SKE KGAI ALASLQGHLL+ASG
Sbjct: 1131 VGTAYVQGEDVAARGRVLLFSIDRTADNSRTLVSEVYSKELKGAIPALASLQGHLLIASG 1190

Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946
            PKI L+KW  +EL  VAF D PPLH VSLNIVKNFILLGDIHKSI F+SWKE   QLSLL
Sbjct: 1191 PKIILHKWTGSELNGVAFCDYPPLHAVSLNIVKNFILLGDIHKSISFVSWKEP--QLSLL 1248

Query: 945  AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766
            AKD++ LDC ATEFLIDGSTLSL+VSDDQKNVQIFYYAPK SESWKGQKLLSRAEFHVG+
Sbjct: 1249 AKDFSPLDCLATEFLIDGSTLSLVVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHVGS 1308

Query: 765  HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586
             +TKF RLQ+L               DKTNRFA +F TL+GS+GCIAPLD LTFRRLQSL
Sbjct: 1309 RITKFLRLQLL--PTTSERTATTPGSDKTNRFATVFGTLEGSLGCIAPLDELTFRRLQSL 1366

Query: 585  QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406
            Q+KLV AV HVAGLNPRSFRQF+S+GKAH+PGPDNIVDCELL HYEMLPLEEQLEIA QI
Sbjct: 1367 QKKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLSHYEMLPLEEQLEIAQQI 1426

Query: 405  GTTRSQIISNLNDLSLGTSFL 343
            GTTR QI+SNLND+ LGTSFL
Sbjct: 1427 GTTRMQIMSNLNDMILGTSFL 1447


>ref|XP_006348057.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Solanum tuberosum]
          Length = 1447

 Score = 2170 bits (5622), Expect = 0.0
 Identities = 1078/1461 (73%), Positives = 1255/1461 (85%), Gaps = 5/1461 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQTDDLESDWPARRGIGPVPNLII 4531
            MS+AA K MH PTGIENCASGFITH +A+  PQI   +T D++SDWPA + +GP+PNL++
Sbjct: 1    MSFAACKTMHCPTGIENCASGFITHSAAEITPQI---RTADVDSDWPATKPVGPMPNLVV 57

Query: 4530 TAGNVLEVYIIRVQEEDVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMAVLS 4351
            +AGNVLEVY+IR+++   RD+  +   KRGG++AGIS A+LELVC YRLHGN+ +M V++
Sbjct: 58   SAGNVLEVYLIRIEQASSRDA--AEVVKRGGLMAGISAASLELVCTYRLHGNIYSMGVIT 115

Query: 4350 IGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESFARG 4171
             GG DG +RRDSIIL+F+DAK+SVLEFDD+ HGLRTSSMH FEGP+WLHLKRGRESF +G
Sbjct: 116  AGGADGGKRRDSIILSFEDAKMSVLEFDDATHGLRTSSMHFFEGPDWLHLKRGRESFDKG 175

Query: 4170 PLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIISLRD 3991
            P++K DPQGRCAGV  +  QMI+LK A+  S L G+D A ++GGA SAR+ESSYII+LRD
Sbjct: 176  PIIKVDPQGRCAGVFAFEQQMIVLKAAEVNSSLAGEDSAFSAGGA-SARIESSYIITLRD 234

Query: 3990 LGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPLIWS 3811
            L ++HVKDF F+HGYIEPVMVILHERELTWSGR+SWKHHTC +SA SISTTLKQHPLIWS
Sbjct: 235  LDVRHVKDFTFLHGYIEPVMVILHERELTWSGRVSWKHHTCMVSAFSISTTLKQHPLIWS 294

Query: 3810 AINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPRSSI 3631
            A NLPHDAYKLLAVPSPIGGVLVI ANTIHYHSQS+SC+LA+NNFA   D+SQ+MPRSS 
Sbjct: 295  AANLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSSSCSLALNNFAFFGDNSQEMPRSSF 354

Query: 3630 SVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITTIGS 3451
            +VELD ANA WL++DVAMLSTKTGELLLLT++YDGR+V +LDL+KSRASVLTSGITTIG 
Sbjct: 355  NVELDAANATWLTSDVAMLSTKTGELLLLTIIYDGRIVQKLDLSKSRASVLTSGITTIGD 414

Query: 3450 SLFFLGSRLGDSLLVQYTCGVGASS---GVKEEVGDIEGDAPSAKRLRRSSSDALQDIVN 3280
            SLFFLGSRLGDSLLVQ++CG+G S+   GV+EEVGDIE DAPSAKRLR SSSDALQD++N
Sbjct: 415  SLFFLGSRLGDSLLVQFSCGLGGSNLPPGVQEEVGDIESDAPSAKRLRMSSSDALQDMIN 474

Query: 3279 GEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQSNYE 3100
            GEELSLYG+APNNA+SAQK FSFAVRDS+INVGPLKDFSYG+RINAD NA GIAKQSNYE
Sbjct: 475  GEELSLYGTAPNNAQSAQKTFSFAVRDSLINVGPLKDFSYGMRINADLNATGIAKQSNYE 534

Query: 3099 LVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAVDDE 2920
            LVCCSGHGKNG+LCVLQQSI PE IT+  L GC+GIWTVYHKNTR H ++SS+M+  +DE
Sbjct: 535  LVCCSGHGKNGSLCVLQQSIRPETITQEALPGCKGIWTVYHKNTRIHLSESSRMADEEDE 594

Query: 2919 YHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGSRIL 2740
            YHAYLIISLE+RTMVL+T + L EVTE+VDYYVQG+T+AAGNLFGRRRV+QVFA G+RIL
Sbjct: 595  YHAYLIISLEARTMVLQTANNLEEVTENVDYYVQGTTLAAGNLFGRRRVIQVFAHGARIL 654

Query: 2739 DGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCTVSI 2560
            DG+FMTQ+LS    N ESG  S++S V+  SIADPYVLLRMT+GS+QLLVGDPS+C+VS+
Sbjct: 655  DGAFMTQELSFKASNVESGSSSDTSIVASVSIADPYVLLRMTNGSLQLLVGDPSSCSVSL 714

Query: 2559 NIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGDIYC 2380
             +PSVFESSKK IS CTLYHDKGPEPWLRK STDAWLS+G+GEAIDGADG   DQGD+YC
Sbjct: 715  TVPSVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSSGMGEAIDGADGVTQDQGDVYC 774

Query: 2379 VVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVAGQA 2200
            VVCYE+GTLEIFDVPNF+CVFSV  F+SG+  LVDT +++ S +   A + N+E+V    
Sbjct: 775  VVCYENGTLEIFDVPNFTCVFSVDKFISGRTYLVDTFMQD-SVNGLHAHSKNTEDVIRPG 833

Query: 2199 RKENTENMK--VVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAIS 2026
            +KEN++++K  VVE+ M RW G HS PFLFGIL DGTIL YHAY+FEG EN+SK+E ++S
Sbjct: 834  QKENSKDVKINVVELMMHRWIGKHSRPFLFGILADGTILSYHAYVFEGSENSSKVEGSVS 893

Query: 2025 GQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGSR 1846
             QNS++L+ST+ASRLRNLRFVRV +D Y REE P+G   QRM V+KN+GG QG+FL+GSR
Sbjct: 894  SQNSISLSSTNASRLRNLRFVRVPVDNYAREEMPSGTQLQRMNVYKNIGGSQGIFLTGSR 953

Query: 1845 PAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDNY 1666
            P+WFMV RERLR+HPQ+CDG IVAFTVLHNVNCNHGLIYVT+ G LKICQLPS  SYDNY
Sbjct: 954  PSWFMVFRERLRIHPQLCDGPIVAFTVLHNVNCNHGLIYVTALGTLKICQLPSFLSYDNY 1013

Query: 1665 WPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSSN 1486
            WPVQKIPLKGTPHQV YFAEKN+Y +IVSV VLKPLNQV+S++ DQE   Q + DNL   
Sbjct: 1014 WPVQKIPLKGTPHQVAYFAEKNVYSVIVSVPVLKPLNQVLSTIADQEVGQQFDPDNL--- 1070

Query: 1485 DLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLLA 1306
            +   +Y ++EFEVRI+EPEKSGG W+TRA+IPMQSSENALTVR+VTL N TT+ENETLLA
Sbjct: 1071 NYEGSYPIEEFEVRIVEPEKSGGLWKTRASIPMQSSENALTVRMVTLLNTTTRENETLLA 1130

Query: 1305 IGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVASG 1126
            +GTAY+QGEDVA RGRVLLFS+ R  DN++ LVSEV+SKE KGAI ALASLQGHLL+ASG
Sbjct: 1131 VGTAYVQGEDVAARGRVLLFSIDRTADNSRTLVSEVYSKELKGAIPALASLQGHLLIASG 1190

Query: 1125 PKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSLL 946
            PKI L+KW  +EL  VAF D PPLH VSLNIVKNFILLGDIHKSI F+SWKE   QLSLL
Sbjct: 1191 PKIILHKWTGSELNGVAFCDYPPLHAVSLNIVKNFILLGDIHKSISFVSWKEP--QLSLL 1248

Query: 945  AKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVGA 766
            AKD++ LDC ATEFLIDGSTLSL+VSDDQKNVQIFYYAPK SESWKGQKLLSRAEFHVG+
Sbjct: 1249 AKDFSPLDCLATEFLIDGSTLSLVVSDDQKNVQIFYYAPKVSESWKGQKLLSRAEFHVGS 1308

Query: 765  HVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQSL 586
             +TKF RLQ+L               DKTNRFA +F TL+GS+GCIAPLD LTFRRLQSL
Sbjct: 1309 RITKFLRLQLL--PTTSERTATTPGSDKTNRFATVFGTLEGSLGCIAPLDELTFRRLQSL 1366

Query: 585  QRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQI 406
            Q+KLV AV HVAGLNPRSFRQF+S+GKAH+PGPDNIVDCELL HYEMLPLEEQLEIA QI
Sbjct: 1367 QKKLVTAVTHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLSHYEMLPLEEQLEIAQQI 1426

Query: 405  GTTRSQIISNLNDLSLGTSFL 343
            GTTR QI+SNLND+ LGTSFL
Sbjct: 1427 GTTRMQIMSNLNDMILGTSFL 1447


>ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
            gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName:
            Full=Cleavage and polyadenylation specificity factor
            subunit 1; AltName: Full=Cleavage and polyadenylation
            specificity factor 160 kDa subunit; Short=AtCPSF160;
            Short=CPSF 160 kDa subunit gi|332008729|gb|AED96112.1|
            cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
          Length = 1442

 Score = 2163 bits (5605), Expect = 0.0
 Identities = 1066/1462 (72%), Positives = 1247/1462 (85%), Gaps = 6/1462 (0%)
 Frame = -2

Query: 4710 MSYAAYKMMHWPTGIENCASGFITHCSADFAPQIPTLQT-DDLESDWP-ARRGIGPVPNL 4537
            MS+AAYKMMHWPTG+ENCASG+ITH  +D   QIP +   DD+E++WP  +RGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 4536 IITAGNVLEVYIIRVQEE-DVRDSRGSGEAKRGGVVAGISGAALELVCHYRLHGNVETMA 4360
            +ITA N+LEVYI+R QEE + ++ R    AKRGGV+ G+ G +LELVCHYRLHGNVE++A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 4359 VLSIGGGDGCRRRDSIILAFQDAKISVLEFDDSVHGLRTSSMHSFEGPEWLHLKRGRESF 4180
            VL +GGG+  + RDSIIL F+DAKISVLEFDDS+H LR +SMH FEGP+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 4179 ARGPLVKADPQGRCAGVLVYGLQMIILKTAQAGSGLVGDDDALNSGGAVSARVESSYIIS 4000
             RGPLVK DPQGRC GVLVYGLQMIILKT+Q GSGLVGDDDA +SGG VSARVESSYII+
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 3999 LRDLGMKHVKDFIFVHGYIEPVMVILHERELTWSGRISWKHHTCSISALSISTTLKQHPL 3820
            LRDL MKHVKDF+F+HGYIEPV+VIL E E TW+GR+SWKHHTC +SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 3819 IWSAINLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALAVNNFAVAADSSQDMPR 3640
            IWSAINLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALA+NN+A +ADSSQ++P 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 3639 SSISVELDNANAAWLSNDVAMLSTKTGELLLLTLVYDGRVVHRLDLTKSRASVLTSGITT 3460
            S+ SVELD A+  W+SNDVA+LSTK+GELLLLTL+YDGR V RLDL+KS+ASVL S IT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 3459 IGSSLFFLGSRLGDSLLVQYTCGVGASS---GVKEEVGDIEGDAPSAKRLRRSSSDALQD 3289
            +G+SLFFLGSRLGDSLLVQ++C  G ++   G+++E  DIEG+   AKRLR +S D  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479

Query: 3288 IVNGEELSLYGSAPNNAESAQKNFSFAVRDSVINVGPLKDFSYGLRINADPNAAGIAKQS 3109
             +  EELSL+GS PNN++SAQK+FSFAVRDS++NVGP+KDF+YGLRINAD NA G++KQS
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 3108 NYELVCCSGHGKNGALCVLQQSIHPELITEVELQGCRGIWTVYHKNTRGHNADSSKMSAV 2929
            NYELVCCSGHGKNGALCVL+QSI PE+ITEVEL GC+GIWTVYHK++RGHNADSSKM+A 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 2928 DDEYHAYLIISLESRTMVLETVDVLGEVTESVDYYVQGSTIAAGNLFGRRRVVQVFARGS 2749
            +DEYHAYLIISLE+RTMVLET D+L EVTESVDYYVQG TIAAGNLFGRRRV+QVF  G+
Sbjct: 600  EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 2748 RILDGSFMTQDLSIGTPNTESGLGSESSTVSFASIADPYVLLRMTDGSIQLLVGDPSTCT 2569
            RILDGSFM Q+LS G  N+ES  GSESSTVS  SIADPYVLLRMTD SI+LLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 2568 VSINIPSVFESSKKLISCCTLYHDKGPEPWLRKASTDAWLSTGVGEAIDGADGAPHDQGD 2389
            VSI+ PSV E SK+ IS CTLYHDKGPEPWLRKASTDAWLS+GVGEA+D  DG P DQGD
Sbjct: 720  VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779

Query: 2388 IYCVVCYESGTLEIFDVPNFSCVFSVGNFMSGKPNLVDTSLREPSKDPKIATNTNSEEVA 2209
            IYCVVCYESG LEIFDVP+F+CVFSV  F SG+ +L D  + E   +     N NSE+  
Sbjct: 780  IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHELEYE----LNKNSEDNT 835

Query: 2208 GQARKENTENMKVVEVTMQRWSGPHSCPFLFGILTDGTILCYHAYLFEGPENTSKMEEAI 2029
                 +NT   +VVE+ MQRWSG H+ PFLF +L DGTILCYHAYLF+G ++T K E ++
Sbjct: 836  SSKEIKNT---RVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891

Query: 2028 SGQNSVNLNSTSASRLRNLRFVRVSLDTYTREETPTGITSQRMTVFKNVGGYQGLFLSGS 1849
            S +N   LNS+ +S+LRNL+F+R+ LDT TRE T  G+ SQR+T+FKN+ G+QG FLSGS
Sbjct: 892  SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951

Query: 1848 RPAWFMVVRERLRVHPQICDGSIVAFTVLHNVNCNHGLIYVTSEGLLKICQLPSVTSYDN 1669
            RP W M+ RERLR H Q+CDGSI AFTVLHNVNCNHG IYVT++G+LKICQLPS + YDN
Sbjct: 952  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011

Query: 1668 YWPVQKIPLKGTPHQVTYFAEKNLYPLIVSVSVLKPLNQVVSSLVDQEASHQIENDNLSS 1489
            YWPVQKIPLK TPHQVTY+AEKNLYPLIVS  V KPLNQV+SSLVDQEA  Q++N N+SS
Sbjct: 1012 YWPVQKIPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMSS 1071

Query: 1488 NDLHQTYVVDEFEVRILEPEKSGGPWQTRATIPMQSSENALTVRVVTLYNATTKENETLL 1309
            +DL +TY V+EFE++ILEPE+SGGPW+T+A IPMQ+SE+ALTVRVVTL NA+T ENETLL
Sbjct: 1072 DDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLL 1131

Query: 1308 AIGTAYLQGEDVAGRGRVLLFSVGRNTDNTQNLVSEVFSKEYKGAISALASLQGHLLVAS 1129
            A+GTAY+QGEDVA RGRVLLFS G+N DN+QN+V+EV+S+E KGAISA+AS+QGHLL++S
Sbjct: 1132 AVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISS 1191

Query: 1128 GPKITLYKWNATELTPVAFFDVPPLHVVSLNIVKNFILLGDIHKSIYFLSWKEQGCQLSL 949
            GPKI L+KWN TEL  VAFFD PPL+VVS+N+VK+FILLGD+HKSIYFLSWKEQG QLSL
Sbjct: 1192 GPKIILHKWNGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSL 1251

Query: 948  LAKDYASLDCFATEFLIDGSTLSLMVSDDQKNVQIFYYAPKQSESWKGQKLLSRAEFHVG 769
            LAKD+ SLDCFATEFLIDGSTLSL VSD+QKN+Q+FYYAPK  ESWKG KLLSRAEFHVG
Sbjct: 1252 LAKDFESLDCFATEFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVG 1311

Query: 768  AHVTKFQRLQMLXXXXXXXXXXXXXXXDKTNRFALLFATLDGSIGCIAPLDFLTFRRLQS 589
            AHV+KF RLQM+               DK NRFALLF TLDGS GCIAPLD +TFRRLQS
Sbjct: 1312 AHVSKFLRLQMV-----------SSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQS 1360

Query: 588  LQRKLVDAVHHVAGLNPRSFRQFKSHGKAHKPGPDNIVDCELLCHYEMLPLEEQLEIAHQ 409
            LQ+KLVDAV HVAGLNP +FRQF+S GKA + GPD+IVDCELLCHYEMLPLEEQLE+AHQ
Sbjct: 1361 LQKKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQ 1420

Query: 408  IGTTRSQIISNLNDLSLGTSFL 343
            IGTTR  I+ +L DLS+GTSFL
Sbjct: 1421 IGTTRYSILKDLVDLSVGTSFL 1442


Top