BLASTX nr result
ID: Mentha29_contig00028494
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00028494 (2230 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU39703.1| hypothetical protein MIMGU_mgv1a002729mg [Mimulus... 887 0.0 ref|XP_004237193.1| PREDICTED: cleavage and polyadenylation spec... 884 0.0 ref|XP_007045271.1| Cleavage and polyadenylation specificity fac... 876 0.0 ref|XP_007225129.1| hypothetical protein PRUPE_ppa002557mg [Prun... 861 0.0 ref|XP_002314781.2| metallo-beta-lactamase family protein [Popul... 857 0.0 ref|XP_004298504.1| PREDICTED: cleavage and polyadenylation spec... 853 0.0 ref|XP_006469322.1| PREDICTED: cleavage and polyadenylation spec... 842 0.0 ref|XP_002526000.1| cleavage and polyadenylation specificity fac... 842 0.0 ref|XP_006355061.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and... 841 0.0 ref|XP_006448159.1| hypothetical protein CICLE_v10014563mg [Citr... 841 0.0 ref|XP_004148116.1| PREDICTED: cleavage and polyadenylation spec... 833 0.0 ref|XP_006395858.1| hypothetical protein EUTSA_v10005303mg [Eutr... 803 0.0 ref|NP_178282.2| cleavage and polyadenylation specificity factor... 800 0.0 ref|XP_007153236.1| hypothetical protein PHAVU_003G018200g [Phas... 796 0.0 gb|AAN87883.1| FEG protein [Arabidopsis thaliana] 792 0.0 gb|AAS80153.1| ACT11D09.9 [Cucumis melo] 791 0.0 gb|AFK42005.1| unknown [Medicago truncatula] 790 0.0 ref|XP_006574816.1| PREDICTED: cleavage and polyadenylation spec... 787 0.0 gb|EPS71695.1| hypothetical protein M569_03060, partial [Genlise... 784 0.0 ref|XP_004498247.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and... 782 0.0 >gb|EYU39703.1| hypothetical protein MIMGU_mgv1a002729mg [Mimulus guttatus] Length = 643 Score = 887 bits (2293), Expect = 0.0 Identities = 454/640 (70%), Positives = 515/640 (80%), Gaps = 27/640 (4%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLIS-DSASFTDA 343 MAIECLVLGAGQ+VGKSCV+V INGK IMFDCG+H GY DHRRYPDFSLIS +S FTD+ Sbjct: 1 MAIECLVLGAGQDVGKSCVIVKINGKSIMFDCGIHTGYQDHRRYPDFSLISANSGDFTDS 60 Query: 344 LSCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQ 523 LSC+IITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAP+MLEDYRK++ + R EE+Q Sbjct: 61 LSCIIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPIMLEDYRKLIFEGREEEKQ 120 Query: 524 FSSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYN 703 FSSE IAECMKKVTAVDLKQTI VDKDLQIRAYYAGHVIGAAMFYAKVGD+A+VYTGDYN Sbjct: 121 FSSEDIAECMKKVTAVDLKQTIQVDKDLQIRAYYAGHVIGAAMFYAKVGDSAMVYTGDYN 180 Query: 704 MVPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTF 883 M D HLGAAQIDRLQLDLVITESTYAT+YRDSKYVREREFLK VH CVA GGKVLIPTF Sbjct: 181 MTADRHLGAAQIDRLQLDLVITESTYATSYRDSKYVREREFLKVVHNCVAGGGKVLIPTF 240 Query: 884 ALGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAF 1063 ALGRAQELCM+LDDYWE+TNLKVPIYFSAGLT+QAN+YYKILINWTS+KVKD + N F Sbjct: 241 ALGRAQELCMILDDYWEKTNLKVPIYFSAGLTLQANMYYKILINWTSQKVKDTYTTRNPF 300 Query: 1064 DFKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIG 1243 DFKNV SFDR+LI+APGPCVLFATP MIS GFSL+VFK WAP E NLVTL GYC +GTIG Sbjct: 301 DFKNVCSFDRSLIHAPGPCVLFATPAMIS-GFSLDVFKLWAPDERNLVTLTGYCGSGTIG 359 Query: 1244 HKLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKP 1423 KLM AK PT+I LD++V++DVRCQI+QLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKP Sbjct: 360 RKLMVAKPPTRITLDDNVQLDVRCQIYQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKP 419 Query: 1424 KMGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSR 1603 KM LKE I+SEL I C +PAN+E +HIPS+ IEA+ASDAFLQSCL PNFKF + D R Sbjct: 420 KMVKLKEIIESELEIPCYHPANHETLHIPSTRRIEASASDAFLQSCLCPNFKFLKTD-PR 478 Query: 1604 PDTP-----------TSLQVYDDRVAQGILTLQPDNQDHPRVVVTRDEM----RVENHQV 1738 D+ LQV ++RVAQG+L NQ R VVT +E+ E H+ Sbjct: 479 QDSELDDDSNNKRKMPQLQVCENRVAQGLLITLQKNQS--RKVVTENELIDTIGTEIHEE 536 Query: 1739 KFVHCCSATHSDSLSEKDTLLHL-LYGKLSRDFPDCVMHDCQTRVQVGSSFFASLCSREE 1915 KF +CC +S S + HL L+GKLSR+FPD + DC+ +++ SF S CS+ + Sbjct: 537 KFAYCCPIYFPNSESRDISSFHLILFGKLSREFPDLNIQDCEDVIRI-QSFVGSFCSKAK 595 Query: 1916 CR----------IDALHFCCTWSSTEDEELAWKIISVVDN 2005 C +D +HFCC W S DEELAW++IS + N Sbjct: 596 CPYRTDVSFQSVLDTVHFCCAW-SMGDEELAWRVISTMKN 634 >ref|XP_004237193.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 3-II-like [Solanum lycopersicum] Length = 1043 Score = 884 bits (2285), Expect = 0.0 Identities = 447/641 (69%), Positives = 520/641 (81%), Gaps = 35/641 (5%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 M I+CLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMG++DH+RYPDFSLIS+S F +AL Sbjct: 1 MTIDCLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGHSDHQRYPDFSLISESGDFDNAL 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 SC+IITHFHLDHIGALPYFT+VCGYNGPIYMTYPTKALAPLMLEDYR+V+VDRRGE+EQF Sbjct: 61 SCIIITHFHLDHIGALPYFTEVCGYNGPIYMTYPTKALAPLMLEDYRRVLVDRRGEKEQF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 SSE+IA+CMKKVTAVDLKQT+ VD+DLQIRAYYAGHV+GAAMFYAKVGDAA+VYTGDYNM Sbjct: 121 SSENIADCMKKVTAVDLKQTMLVDRDLQIRAYYAGHVLGAAMFYAKVGDAAMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 D HLGAAQIDRLQLDLVITESTYATT RDSKYVREREFL+A+HKCV +GGKVLIP FA Sbjct: 181 TADRHLGAAQIDRLQLDLVITESTYATTIRDSKYVREREFLEAIHKCVDSGGKVLIPAFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELCMLLDDYWER NLKVPIYFSAGLTIQAN+YYK+LINW S+KVK+ A NAFD Sbjct: 241 LGRAQELCMLLDDYWERMNLKVPIYFSAGLTIQANMYYKVLINWASQKVKNLSATRNAFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV SF+R++INAPGPCVLFATPGM+SGGFSLEVFKQWAP E+NL+ LPGYCLA T+GH Sbjct: 301 FKNVHSFERSMINAPGPCVLFATPGMLSGGFSLEVFKQWAPCEQNLIVLPGYCLAETVGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLM AK P +I +D+ +IDVRCQIHQLSFSPHTD+KGIMDL++FLSPK+VILVHGEKPK Sbjct: 361 KLMRAKPPARIDVDKSTQIDVRCQIHQLSFSPHTDSKGIMDLIRFLSPKNVILVHGEKPK 420 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M +LKERI+S+L I C YPANNE I ++H+I+A AS +FLQS LSPNFKF + SR Sbjct: 421 MASLKERIESDLRIPCYYPANNESQRIETTHYIKAEASKSFLQSSLSPNFKFLKTI-SRA 479 Query: 1607 DT--------PTSLQVYDDRVAQGILTLQPDNQDHPRVV---VTRDEMRVENHQVKFVHC 1753 DT + +QV DDRVA+G + +Q D HP++V D + ENH+V+ +C Sbjct: 480 DTGFVLNERAESCVQVCDDRVAEGAVIMQKD--QHPKIVHQNELMDILEAENHKVQVAYC 537 Query: 1754 CSATHSDS--------------LSEKDTLLHLLYGKLSRDFPDCVMHDCQTRVQVGSSFF 1891 C D + +K +LLHLLY KLS F D + + R+Q+ SF Sbjct: 538 CPVCVPDEPKNVALSPGEDMHPVLDKCSLLHLLYTKLSNGFQDVTILNDGDRLQI-QSFT 596 Query: 1892 ASLCSREEC--RI--------DALHFCCTWSSTEDEELAWK 1984 S C +E+C RI +A++FCCTW S EDE+LAW+ Sbjct: 597 VSPCLKEKCPHRIHVNPDSTSEAVNFCCTW-SMEDEKLAWR 636 >ref|XP_007045271.1| Cleavage and polyadenylation specificity factor 73 kDa subunit-II [Theobroma cacao] gi|508709206|gb|EOY01103.1| Cleavage and polyadenylation specificity factor 73 kDa subunit-II [Theobroma cacao] Length = 657 Score = 876 bits (2263), Expect = 0.0 Identities = 453/655 (69%), Positives = 520/655 (79%), Gaps = 38/655 (5%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAI+CLVLGAGQEVGKSCVVV+INGKRIMFDCGMHMGYTD RRYPDFSLIS + F +AL Sbjct: 1 MAIDCLVLGAGQEVGKSCVVVSINGKRIMFDCGMHMGYTDSRRYPDFSLISKTGDFDNAL 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 +CVIITHFHLDHIGALPYFT+VCGY GP+YMTYPTKALAPLMLEDYRK M DRRGE+ QF Sbjct: 61 TCVIITHFHLDHIGALPYFTEVCGYRGPVYMTYPTKALAPLMLEDYRKNM-DRRGEDGQF 119 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 +S+HI ECMKKV VDLKQT+ VDKDLQIRAYYAGHV+GAAMFYAKVGDAA+VYTGDYNM Sbjct: 120 TSDHITECMKKVIPVDLKQTVQVDKDLQIRAYYAGHVLGAAMFYAKVGDAAMVYTGDYNM 179 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAAQIDRLQLDL+ITESTYATT RDS+Y REREFLKAVH CVAAGGKVLIPTFA Sbjct: 180 TPDRHLGAAQIDRLQLDLLITESTYATTIRDSRYGREREFLKAVHNCVAAGGKVLIPTFA 239 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LL+DYWER NLKVPIYFS+GLTIQAN+YYK+LINWTS+K+K+ +A HNAFD Sbjct: 240 LGRAQELCILLEDYWERMNLKVPIYFSSGLTIQANMYYKMLINWTSQKIKETYATHNAFD 299 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV +FDR+LINAPGPCVLFATPGMISGGFSLEVF QWAP E NL+TLPGYC+AGTIGH Sbjct: 300 FKNVQNFDRSLINAPGPCVLFATPGMISGGFSLEVFMQWAPSEINLITLPGYCVAGTIGH 359 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS K PT+I LD+D ++DVRCQIHQLSFSPHTDAKGIMDLVKFLSPKH ILVHGEKPK Sbjct: 360 KLMSGK-PTKIDLDKDTQVDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHAILVHGEKPK 418 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRP----- 1591 M LKERIQSEL I+C PANN+ V IP++H+++A ASDAF++SCL+PNFKFS+ Sbjct: 419 MATLKERIQSELGIQCYCPANNDTVTIPTTHYVKADASDAFIKSCLNPNFKFSKSSSVDK 478 Query: 1592 --DGSRPDTP-TSLQVYDDRVAQGILTLQPDNQDHPRVVVTRDE---MRVEN-HQVKFVH 1750 GS LQV D+RVA+GIL ++ + V+ +DE M EN H+V+F + Sbjct: 479 SYSGSNDSKAIPGLQVSDERVAEGILVVEKGKK---AKVIHQDELLHMLGENKHEVQFAY 535 Query: 1751 C-------CSATHSDSLSEKD---------TLLHLLYGKLSRDFPDCVMHDCQTRVQVGS 1882 C T S+ L D TL+ LL KLS + D + D ++QV Sbjct: 536 CFPMRTERLEKTRSEDLPSADDLLCGLDKCTLISLLSTKLSNELSDGNIQDLGEQLQV-E 594 Query: 1883 SFFASLCSREEC--RI--------DALHFCCTWSSTEDEELAWKIISVVDNENNP 2017 SF S+C ++ C RI + + FCC+W S DE LAWKIIS++ N P Sbjct: 595 SFCLSICLKDNCPHRISDSLQNDSEVVFFCCSW-SVADEMLAWKIISIMKNYTLP 648 >ref|XP_007225129.1| hypothetical protein PRUPE_ppa002557mg [Prunus persica] gi|462422065|gb|EMJ26328.1| hypothetical protein PRUPE_ppa002557mg [Prunus persica] Length = 658 Score = 861 bits (2224), Expect = 0.0 Identities = 440/655 (67%), Positives = 514/655 (78%), Gaps = 40/655 (6%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLI---SDSASFT 337 MAI+ LVLGAGQEVGKSCVVVTINGKRIMFDCGMHMG+ DH+RYPDFSLI +F Sbjct: 1 MAIDSLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGHLDHQRYPDFSLIPKPEPDPNFD 60 Query: 338 DALSCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEE 517 AL+C+IITHFHLDH+GALPYFT+VCGY GPIYMTYPTKALAP+MLEDYRKVMV+RRGEE Sbjct: 61 HALTCIIITHFHLDHVGALPYFTEVCGYRGPIYMTYPTKALAPIMLEDYRKVMVERRGEE 120 Query: 518 EQFSSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGD 697 EQFSS+HIAECMKKV VDLKQT+ VDKDLQIRAYYAGHV+GAAMFYAKVGDAA+VYTGD Sbjct: 121 EQFSSDHIAECMKKVIPVDLKQTVQVDKDLQIRAYYAGHVLGAAMFYAKVGDAAMVYTGD 180 Query: 698 YNMVPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIP 877 YNM PD HLGAAQI+RL LDL+I+ESTY TT RDSKY REREFL+AVHKCVA GGKVLIP Sbjct: 181 YNMTPDRHLGAAQIERLNLDLLISESTYGTTIRDSKYAREREFLRAVHKCVAGGGKVLIP 240 Query: 878 TFALGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHN 1057 TFALGRAQELC+LL+DYWER NLKVPIYFSAGLT+QAN+YYK+LI+WTS+KVK+ ++ N Sbjct: 241 TFALGRAQELCILLEDYWERMNLKVPIYFSAGLTLQANMYYKMLISWTSQKVKETYSTRN 300 Query: 1058 AFDFKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGT 1237 AFDFKN FDR++INAPGPCVLFATPGMISGGFSLEVFK WAP E NLVTLPGYC+AGT Sbjct: 301 AFDFKNAHKFDRSMINAPGPCVLFATPGMISGGFSLEVFKHWAPSEMNLVTLPGYCVAGT 360 Query: 1238 IGHKLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGE 1417 IGHKLMS K PT+I LD+D +IDVRCQIH LSFSPHTDAKGIMDL+KFLSPK+VILVHGE Sbjct: 361 IGHKLMSGK-PTKIDLDKDTQIDVRCQIHHLSFSPHTDAKGIMDLIKFLSPKNVILVHGE 419 Query: 1418 KPKMGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRP-- 1591 KPKM LK +IQSEL I+C PANNE V I S+H+++A ASDAF++SC +PNFKFS+ Sbjct: 420 KPKMATLKGKIQSELGIQCHDPANNETVSISSTHYVKALASDAFIRSCSNPNFKFSKSSQ 479 Query: 1592 -----DGSRPDTPT-SLQVYDDRVAQGILTLQPDNQDHPRVVVTRDEMRV----ENHQVK 1741 SR + T L+V D+RVA+G+L ++ ++ VV +DE+ + + HQV+ Sbjct: 480 EDEHGSNSRNNNFTPRLRVSDERVAEGVLVME---RNKKAKVVHQDELLLMLGEKKHQVQ 536 Query: 1742 FVHCC------------SATHSDSLSEKDT---LLHLLYGKLSRDFPDCVMHDCQTRVQV 1876 F +CC S T+ L + +T LL L KLS +F + D + +QV Sbjct: 537 FAYCCPADIGHLGETKSSTTNDGQLCKSETCSRLLRQLSAKLSNEFSQGNIQDFEDHLQV 596 Query: 1877 GSSFFASLCSREECRI----------DALHFCCTWSSTEDEELAWKIISVVDNEN 2011 SF S+C + C +A FCC+W T DE+LAWKIIS+ N N Sbjct: 597 -ESFHVSICLKNNCPYRLMDVQNKSQEAAFFCCSW-GTADEKLAWKIISICQNFN 649 >ref|XP_002314781.2| metallo-beta-lactamase family protein [Populus trichocarpa] gi|550329586|gb|EEF00952.2| metallo-beta-lactamase family protein [Populus trichocarpa] Length = 625 Score = 857 bits (2214), Expect = 0.0 Identities = 437/627 (69%), Positives = 509/627 (81%), Gaps = 33/627 (5%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGY DHRRYPDFSLIS S F +L Sbjct: 1 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYDDHRRYPDFSLISKSRDFDHSL 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 CVIITHFHLDH+GALPYFT+VCGYNGPIYMTYPTKALAPLMLED+RKV+VDRRGEEEQF Sbjct: 61 DCVIITHFHLDHVGALPYFTEVCGYNGPIYMTYPTKALAPLMLEDFRKVLVDRRGEEEQF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 +S HI++CM+KV AVDLKQT+ VD DLQIRAYYAGHV+GAAMFYAKVGD+A+VYTGDYNM Sbjct: 121 TSLHISQCMEKVIAVDLKQTVQVDDDLQIRAYYAGHVLGAAMFYAKVGDSAMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAAQIDRL+LDL+ITESTYATT RDSKY REREFLKAVH+CVA GGKVLIPTFA Sbjct: 181 TPDRHLGAAQIDRLELDLLITESTYATTIRDSKYAREREFLKAVHECVAGGGKVLIPTFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LLDDYWER NLKVPIYFSAGLTIQANLYYKILI+WTS+KVK+ +A NAFD Sbjct: 241 LGRAQELCILLDDYWERMNLKVPIYFSAGLTIQANLYYKILISWTSQKVKETYATRNAFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FK+V +FDR+LINAPGPCVLFATPGMISGGFSLEVFKQWAP E NL+TLPGYC+AGT+GH Sbjct: 301 FKHVHNFDRSLINAPGPCVLFATPGMISGGFSLEVFKQWAPCEMNLITLPGYCVAGTVGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS K PT+I LD+D +IDVRCQIHQLSFSPHTD+KGIMDL KFLSP++VILVHGEKPK Sbjct: 361 KLMSGK-PTKINLDKDTQIDVRCQIHQLSFSPHTDSKGIMDLTKFLSPRNVILVHGEKPK 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKF---SRPDG 1597 M +LKERIQ+EL I C PAN + VHIPS+ +++A AS+ F++SCL+PNF+F S+ D Sbjct: 420 MVSLKERIQTELRIPCYLPANCDAVHIPSTIYVKAHASNTFIRSCLNPNFRFLKRSKEDN 479 Query: 1598 S----RPDTPTS-LQVYDDRVAQGILTLQPDNQDHPRVVVTRDE----MRVENHQVKFVH 1750 S R PT+ LQV D+RVA+GIL ++ + VV +D+ +R + H V+F + Sbjct: 480 SDQVLRNTNPTAPLQVNDERVAEGILIMEKGKKAR---VVHQDDLLLMLRQKKHDVQFAY 536 Query: 1751 CCSATHSD-----------SLSEKDTLLHLLYGKLSRDFPDCVMHDCQTRVQVGSSFFAS 1897 CC+A + LS+K + L LL+ +LS F + D +QV SF S Sbjct: 537 CCAAQLDNLEETRNRDDALGLSDKCSSLQLLFKELSNYFSGVNIEDLGEHLQV-ESFHVS 595 Query: 1898 LCSREECR---ID-------ALHFCCT 1948 +C ++ C ID ++FCC+ Sbjct: 596 VCLKDNCPYRIIDNSQKEAVTVYFCCS 622 >ref|XP_004298504.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 3-II-like [Fragaria vesca subsp. vesca] Length = 1009 Score = 853 bits (2205), Expect = 0.0 Identities = 431/639 (67%), Positives = 503/639 (78%), Gaps = 29/639 (4%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAI+ LVLGAGQEVGKSCV+VTINGKRIMFDCGMHMG+ DHRRYPDFSLI++ L Sbjct: 1 MAIDSLVLGAGQEVGKSCVIVTINGKRIMFDCGMHMGHLDHRRYPDFSLINNQT-----L 55 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 +C++ITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRK+MV+RRGEEEQF Sbjct: 56 TCIVITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKIMVERRGEEEQF 115 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 +SEHIAECMKKV AV+LK+T+ VDKDLQIRAYYAGHV+GAAMFYAKVGDAA+VYTGDYNM Sbjct: 116 TSEHIAECMKKVIAVNLKETVQVDKDLQIRAYYAGHVLGAAMFYAKVGDAAMVYTGDYNM 175 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAAQIDRL LDLVITESTYATT RDSKY REREFLKAVH CVA GGKVLIP+FA Sbjct: 176 TPDRHLGAAQIDRLSLDLVITESTYATTIRDSKYPREREFLKAVHTCVAGGGKVLIPSFA 235 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LL+DYWER NLKVPIYFS LT QAN+YY +LI+WTS+K+K+ H+ HNAFD Sbjct: 236 LGRAQELCILLEDYWERMNLKVPIYFSTALTRQANMYYMMLISWTSQKIKETHSTHNAFD 295 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV F+R++I+APGPCVLFA PGMISGGFSLEVFK WAP E+NLV +PGYC+AGTIGH Sbjct: 296 FKNVHKFERSMIDAPGPCVLFAGPGMISGGFSLEVFKHWAPSEKNLVIMPGYCVAGTIGH 355 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS K PT+I LD+D +IDVRCQIHQL+FSPHTDAKGIMDLVKFLSPKHVILVHGEKPK Sbjct: 356 KLMSGK-PTKIDLDKDTRIDVRCQIHQLAFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 414 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPD---- 1594 M LK RI++EL I+C YPA NE V IPS+H+ A SDAF+QSC SPNFKFS Sbjct: 415 MATLKGRIETELGIQCYYPAINETVSIPSTHYATAVTSDAFVQSCSSPNFKFSTSSSEHS 474 Query: 1595 --GSRPDTPTSLQVYDDRVAQGILTLQPDNQDHPRVVVTRDEMRV----ENHQVKFVHCC 1756 S + L+V DDRVA+GIL ++ + + VV +DE+ + + HQVKF +CC Sbjct: 475 QGSSNSKSIPRLRVSDDRVAEGILVMEKNKK---AKVVHQDELPLMLGEKKHQVKFAYCC 531 Query: 1757 SATHSDS--------LSEKDTLLHLLYGKLSRDFPDCVMHDCQTRVQVGSSFFASLCSRE 1912 A +S S + LL L KL+ + + + +Q+ S S+CS Sbjct: 532 PAHIGNSQENTSLPLSSTNNDLLQLFSAKLANELSEGNIQHFGDHLQL-ESIHVSICSNN 590 Query: 1913 EC-----------RIDALHFCCTWSSTEDEELAWKIISV 1996 +C + + + FCC+W EDE+LAWK+IS+ Sbjct: 591 DCPYRLSDNGAEEQREPVFFCCSW-EMEDEKLAWKVISI 628 >ref|XP_006469322.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 3-II-like [Citrus sinensis] Length = 1019 Score = 842 bits (2176), Expect = 0.0 Identities = 427/653 (65%), Positives = 508/653 (77%), Gaps = 38/653 (5%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAI+CLVLGAGQEVGKSCVVVTINGKRIMFDCGMHM Y DHR+YPDFS IS S F +A+ Sbjct: 1 MAIDCLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMAYNDHRQYPDFSRISKSCDFNNAI 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 C++ITHFHLDHIGALP+FT++CGYNGPIYMTYPT+ALAP+MLEDYRKV+VDRRGE EQF Sbjct: 61 DCIVITHFHLDHIGALPFFTEICGYNGPIYMTYPTRALAPIMLEDYRKVLVDRRGEVEQF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 +S+HIAECMKKV AVDLKQT+ VDKDLQIRAYYAGHV+GAAMFYAKVGD+A+VYTGDYNM Sbjct: 121 TSDHIAECMKKVIAVDLKQTVQVDKDLQIRAYYAGHVLGAAMFYAKVGDSAMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAA+IDRLQLDL+ITESTYATT RDSKY REREFLKAVHKCVA GGKVLIP FA Sbjct: 181 TPDRHLGAARIDRLQLDLLITESTYATTVRDSKYAREREFLKAVHKCVAGGGKVLIPAFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LLDDYWER NL+VPIYFSAGLTIQAN+YYK+LI+WTS+KVK+ +NAFD Sbjct: 241 LGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLISWTSQKVKET---YNAFD 297 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV +FDR+LI+APGPCVLFATPGM++GGFSLEVFK WAP E NL+TLPGYCLAGTIG+ Sbjct: 298 FKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSEMNLITLPGYCLAGTIGN 357 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS ++ E KIDVRCQIHQL+FSPHTD KGIMDLVKFLSP+HVILVHGEKPK Sbjct: 358 KLMSGNPTIEL---EGTKIDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPK 414 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M LKERIQSEL I+C PANNE + IPS+H+++A ASDAF++SC++PNF++ + Sbjct: 415 MATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSCMNPNFQYLKSGSEEK 474 Query: 1607 DTPTS--------LQVYDDRVAQGILTLQPDNQDHPRVVVTRDEMRV----ENHQVKFVH 1750 S L + D+RVA+GIL L+ + VV +DE+ + + H+V+F + Sbjct: 475 SVSGSKCTEGTLPLWIKDERVAEGILVLE---KSEKAKVVHQDELLLMLGEKRHEVQFAY 531 Query: 1751 CCSATHSD-------SLS---------EKDTLLHLLYGKLSRDFPDCVMHDCQTRVQVGS 1882 CC + SL+ K +L+ LL KLSR + + D +QV Sbjct: 532 CCPVNVDELEKFTTTSLTPTARMLRDPNKSSLIRLLVAKLSRKLSEGNIQDFGEHLQV-E 590 Query: 1883 SFFASLCSREEC----------RIDALHFCCTWSSTEDEELAWKIISVVDNEN 2011 SF S+C ++ C + FCCTWS+ D++LA KIIS ++N + Sbjct: 591 SFHLSVCLKDTCPYRITNGLEDKPRTAFFCCTWSAA-DDKLARKIISAMENRD 642 >ref|XP_002526000.1| cleavage and polyadenylation specificity factor, putative [Ricinus communis] gi|223534732|gb|EEF36424.1| cleavage and polyadenylation specificity factor, putative [Ricinus communis] Length = 963 Score = 842 bits (2175), Expect = 0.0 Identities = 419/566 (74%), Positives = 476/566 (84%), Gaps = 19/566 (3%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGY DHRRYPDFSLIS S F AL Sbjct: 1 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYDDHRRYPDFSLISKSGDFDSAL 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 CVIITHFHLDH+GALPYFT+VCGYNGP+YMTYPTKAL+PLMLEDYRKVMVDRRGEEEQF Sbjct: 61 HCVIITHFHLDHVGALPYFTEVCGYNGPVYMTYPTKALSPLMLEDYRKVMVDRRGEEEQF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 +++HI +C+ KV AVDLKQT+ VDKDLQIRAYYAGHV+GAAMFYAKVGD+A+VYTGDYNM Sbjct: 121 TADHIKQCLNKVIAVDLKQTVQVDKDLQIRAYYAGHVLGAAMFYAKVGDSAMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAAQIDRLQLDL+ITESTYATT RDSKY REREFLK VHKCVA GGKVLIPTFA Sbjct: 181 TPDRHLGAAQIDRLQLDLLITESTYATTIRDSKYAREREFLKVVHKCVAGGGKVLIPTFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LLDDYWER NLKVPIYFSAGLTIQAN+YYK+LI WTS+K+K+ + NAFD Sbjct: 241 LGRAQELCLLLDDYWERMNLKVPIYFSAGLTIQANMYYKMLIGWTSQKIKETYTSRNAFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV +FDR+L++APGPCVLFATPGMISGGFSLEVFK+WAP E NLVTLPGYC+AGTIGH Sbjct: 301 FKNVYTFDRSLLDAPGPCVLFATPGMISGGFSLEVFKRWAPCEMNLVTLPGYCVAGTIGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS K P++I LD+D +IDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK Sbjct: 361 KLMSGK-PSKINLDKDTQIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKF---SRPDG 1597 M +LKERIQSEL I+C PAN E + IPS+ ++A AS+AF++SCLSPNF+F S D Sbjct: 420 MASLKERIQSELEIQCYVPANCETLCIPSTLFVKADASEAFIRSCLSPNFRFLNKSLKDT 479 Query: 1598 S-----RPDTPTSLQVYDDRVAQGILTLQPDNQDHPRVVVTRDEMRV----ENHQVKFVH 1750 S + + L+V D+RVA+GIL ++ + + VV +DE+ + + H+V+F + Sbjct: 480 SDLVLHSTNATSRLEVSDERVAEGILVVEKNKKAR---VVHQDELLLMLGAKQHEVQFAY 536 Query: 1751 CC-------SATHSDSLSEKDTLLHL 1807 CC T D S D LL L Sbjct: 537 CCPVQVDNMDQTRRDPSSTHDELLTL 562 >ref|XP_006355061.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation specificity factor subunit 3-II-like [Solanum tuberosum] Length = 948 Score = 841 bits (2172), Expect = 0.0 Identities = 411/541 (75%), Positives = 468/541 (86%), Gaps = 11/541 (2%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 M I+CLVLGAGQ+VGKSCVVVTINGKRIMFDCGMHMG+ DHRRYPDFSLIS+S F +AL Sbjct: 1 MTIDCLVLGAGQDVGKSCVVVTINGKRIMFDCGMHMGHDDHRRYPDFSLISESGDFDNAL 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 SC+IITHFHLDHIGALPYFT+VCGYNGPIYMTYPTKALAPLMLEDYR+V+VDRRGE+EQF Sbjct: 61 SCIIITHFHLDHIGALPYFTEVCGYNGPIYMTYPTKALAPLMLEDYRRVLVDRRGEKEQF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 SSE+I +CMKKVTAVDLKQT+ VD+DLQIRAYYAGHV+GAAMFYAKVGDAA+VYTGDYNM Sbjct: 121 SSENITDCMKKVTAVDLKQTVLVDRDLQIRAYYAGHVLGAAMFYAKVGDAAMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 D HLGAAQIDRLQLDLVITESTYATT RDSKYVREREFL+A+HKCV +GGKVLIP FA Sbjct: 181 TADRHLGAAQIDRLQLDLVITESTYATTIRDSKYVREREFLEAIHKCVDSGGKVLIPAFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELCMLLDDYWER NLKVPIYFSAGLTIQAN+YYK+LINW S+KVK+ A NAFD Sbjct: 241 LGRAQELCMLLDDYWERMNLKVPIYFSAGLTIQANMYYKVLINWASQKVKNLSATRNAFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV SF+R++INAPGPCVLFATPGM+SGGFSLEVFKQWAPYE+NL+ LPGYCLA T+GH Sbjct: 301 FKNVHSFERSMINAPGPCVLFATPGMLSGGFSLEVFKQWAPYEQNLIALPGYCLAETVGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLM AK P +I +D+ +IDVRCQIHQLSFSPHTD+KGIMDL++FLSPK+VILVHGEKPK Sbjct: 361 KLMRAKPPARIDVDKSTQIDVRCQIHQLSFSPHTDSKGIMDLIRFLSPKNVILVHGEKPK 420 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M +LKERI+S+L I C YPANNE I S+ +I+A AS +FLQS LSPNFKF + SR Sbjct: 421 MASLKERIESDLRIPCYYPANNESQCIESTQYIKAEASKSFLQSSLSPNFKFLKTI-SRA 479 Query: 1607 DT--------PTSLQVYDDRVAQGILTLQPDNQDHPRVV---VTRDEMRVENHQVKFVHC 1753 DT + +QV DDRVA+G + +Q D HP++V D + ENH+V+ +C Sbjct: 480 DTGFILNERAESCIQVCDDRVAEGAVIMQKD--QHPKIVHQNELMDILEAENHKVQVAYC 537 Query: 1754 C 1756 C Sbjct: 538 C 538 >ref|XP_006448159.1| hypothetical protein CICLE_v10014563mg [Citrus clementina] gi|557550770|gb|ESR61399.1| hypothetical protein CICLE_v10014563mg [Citrus clementina] Length = 646 Score = 841 bits (2172), Expect = 0.0 Identities = 427/653 (65%), Positives = 507/653 (77%), Gaps = 38/653 (5%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAI+CLVLGAGQEVGKSCVVVTINGKRIMFDCGMHM Y DHR+YPDFS IS S F +A+ Sbjct: 1 MAIDCLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMAYNDHRQYPDFSRISKSCDFNNAI 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 C++ITHFHLDHIGALP+FT++CGYNGPIYMTYPT+ALAP+MLEDYRKV+VDRRGE EQF Sbjct: 61 DCIVITHFHLDHIGALPFFTEICGYNGPIYMTYPTRALAPIMLEDYRKVLVDRRGEVEQF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 +S+HIAECMKKV AVDLKQT+ VDKDLQIRAYYAGHV+GAAMFYAKVGD+A+VYTGDYNM Sbjct: 121 TSDHIAECMKKVIAVDLKQTVQVDKDLQIRAYYAGHVLGAAMFYAKVGDSAMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAA+IDRLQLDL+ITESTYATT RDSKY REREFLKAVHKCVA GGKVLIP FA Sbjct: 181 TPDRHLGAARIDRLQLDLLITESTYATTVRDSKYAREREFLKAVHKCVAGGGKVLIPAFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LLDDYWER NL+VPIYFSAGLTIQAN+YYK+LI+WTS+KVK+ +NAFD Sbjct: 241 LGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLISWTSQKVKET---YNAFD 297 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV +FDR+LI+APGPCVLFATPGM++GGFSLEVFK WAP E NL+TLPGYCLAGTIG+ Sbjct: 298 FKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSEMNLITLPGYCLAGTIGN 357 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS ++ E KIDVRCQIHQL+FSPHTD KGIMDLVKFLSP+HVILVHGEKPK Sbjct: 358 KLMSGNPTIEL---EGTKIDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPK 414 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M LKERIQSEL I+C PANNE + IPS+H+++A ASDAF++SC++PNF+F + Sbjct: 415 MATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSCMNPNFQFLKSGSEEK 474 Query: 1607 DTPTS--------LQVYDDRVAQGILTLQPDNQDHPRVVVTRDEMRV----ENHQVKFVH 1750 S L + D+RVA+GIL L+ + VV +DE+ + + H+V+F + Sbjct: 475 SVSGSKCTEGTLPLWIKDERVAEGILVLE---KSEKAKVVHQDELLLMLGEKRHEVQFAY 531 Query: 1751 CCSATHSD-------SLS---------EKDTLLHLLYGKLSRDFPDCVMHDCQTRVQVGS 1882 CC + SL+ K +L+ LL KLSR + + D +QV Sbjct: 532 CCPVNVDELEKFTTTSLTPTARMLRDPNKSSLIRLLVAKLSRKLSEGNIQDFGEHLQV-E 590 Query: 1883 SFFASLCSREEC----------RIDALHFCCTWSSTEDEELAWKIISVVDNEN 2011 SF S+C ++ C + CCTWS+ D++LA KIIS ++N + Sbjct: 591 SFHLSVCLKDTCPYRITNGLEDKPRTAFVCCTWSAA-DDKLARKIISAMENRD 642 >ref|XP_004148116.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 3-II-like [Cucumis sativus] Length = 649 Score = 833 bits (2151), Expect = 0.0 Identities = 428/650 (65%), Positives = 498/650 (76%), Gaps = 35/650 (5%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAI+CLVLGAGQEVGKSCVVVTINGKRIMFDCGMH+GY DHRRYPDFS IS S + + L Sbjct: 1 MAIDCLVLGAGQEVGKSCVVVTINGKRIMFDCGMHLGYVDHRRYPDFSRISASHDYNNVL 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 SC+IITHFHLDHIGALPYFT+VCGYNGPIYMTYPT ALAP+ LEDYRKVMVDRRGE EQF Sbjct: 61 SCIIITHFHLDHIGALPYFTEVCGYNGPIYMTYPTMALAPITLEDYRKVMVDRRGEAEQF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 +++HI EC+KKV VDLKQTI VD+DLQIRAYYAGHV+GAAMFYAKVGDAA+VYTGDYNM Sbjct: 121 TNDHIMECLKKVVPVDLKQTIQVDEDLQIRAYYAGHVLGAAMFYAKVGDAAMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAAQIDR+QLDL+ITESTYATT RDSKY REREFLKAVH C+A+GGKVLIPTFA Sbjct: 181 TPDRHLGAAQIDRMQLDLLITESTYATTIRDSKYAREREFLKAVHNCLASGGKVLIPTFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LLDDYWER NLK PIY SAGLT+QAN+YYK+LI+WTS+KVK+ + NAFD Sbjct: 241 LGRAQELCVLLDDYWERMNLKFPIYVSAGLTVQANMYYKMLISWTSQKVKETYTTRNAFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV FDR++I+APGPCVLFATPGMIS GFSLEVFK+WAP + NL+TLPGYC+AGT+GH Sbjct: 301 FKNVQKFDRSMIDAPGPCVLFATPGMISSGFSLEVFKRWAPSKLNLITLPGYCVAGTVGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS K PT+I LD+ +IDV+CQ+HQL+FSPHTD+KGIMDLVKFLSPKHVILVHGEKPK Sbjct: 361 KLMSGK-PTKIDLDKVTQIDVQCQVHQLAFSPHTDSKGIMDLVKFLSPKHVILVHGEKPK 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M LKERI SEL I C PANNE V I S+ ++A AS F+QSC +PNFKF + + P Sbjct: 420 MAVLKERIHSELGIPCHDPANNETVSISSTLSVKAEASSMFIQSCSTPNFKFLKRNLIDP 479 Query: 1607 DTP-------------TSLQVYDDRVAQGILTLQPDNQDHPRVVVTRDEMRV----ENHQ 1735 D LQV DDRV +GIL ++ + + +DE+ + + H+ Sbjct: 480 DLKDLSYKAERTSNMLIPLQVSDDRVNEGILVMENGKKTK---ALHQDELLLLLGQQEHE 536 Query: 1736 VKFVHC-------CSATH-SDSLSEKDTLLHLLYGKLSRDFPDCVMHDCQTRVQVGSSFF 1891 V+F HC H DSLS K L L KLS + D + + +QV S Sbjct: 537 VRFAHCRPIYFGSLDEIHVMDSLSRKSLWLSQLSFKLSTELSDRNVQNLGEYLQV-ESIT 595 Query: 1892 ASLCSREEC---RID-------ALHFCCTWSSTEDEELAWKIISVVDNEN 2011 S+CS+E C ID A+ FCC DE LAWKIIS+++ + Sbjct: 596 LSICSKENCPYRTIDRIKNESTAMVFCCCSWLVADEILAWKIISILEKHD 645 >ref|XP_006395858.1| hypothetical protein EUTSA_v10005303mg [Eutrema salsugineum] gi|557092497|gb|ESQ33144.1| hypothetical protein EUTSA_v10005303mg [Eutrema salsugineum] Length = 617 Score = 803 bits (2075), Expect = 0.0 Identities = 403/627 (64%), Positives = 481/627 (76%), Gaps = 16/627 (2%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAI+CLVLGAGQE+GKSCVVVTINGKRIMFDCGMHMG DH RYPDFSL+S S F +A+ Sbjct: 1 MAIDCLVLGAGQEIGKSCVVVTINGKRIMFDCGMHMGCDDHNRYPDFSLLSKSGDFDNAI 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 SC+IITHFH+DH+GALPYFT+VCGYNGP+YM+YPTKAL+PLMLEDYR+VMVDRRGEEE F Sbjct: 61 SCIIITHFHMDHVGALPYFTEVCGYNGPVYMSYPTKALSPLMLEDYRRVMVDRRGEEELF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 ++ HIA CM+KV A+DLKQTI VD+DLQIRAYYAGHV+GA M YAKVGDAA+VYTGDYNM Sbjct: 121 TTAHIANCMEKVIALDLKQTIQVDQDLQIRAYYAGHVLGAVMVYAKVGDAAIVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 D HLGAA+IDRLQLDL+I+ESTYATT R SKY REREFL+AVHKCVA GGK LIP+FA Sbjct: 181 TTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELCMLLDDYWER N+KVPIYFS+GLTIQAN+YYK+LI+WTS+ VK+ HA HN FD Sbjct: 241 LGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHATHNPFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV FDR+LI+APGPCVLFATPGM+ GFSLEVFK WAP NLV LPGY +AGT+GH Sbjct: 301 FKNVKDFDRSLIHAPGPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS K PT + L K+DVRC+IHQ++FSPHTDAKGIMDL KFLSPK+V+LVHGEKP Sbjct: 361 KLMSGK-PTTVDLYNGTKVDVRCKIHQVAFSPHTDAKGIMDLTKFLSPKNVVLVHGEKPS 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M +LK++I SEL + C PAN E V + S+ I+A ASD FL+SC SPNF+FS Sbjct: 420 MMSLKDKITSELDVPCFVPANGETVSVSSTTFIKANASDMFLKSCSSPNFRFS------- 472 Query: 1607 DTPTSLQVYDDRVAQGILTLQPDNQDHPRVVVTRDE----MRVENHQVKFVHCCSA---T 1765 ++ T L+V D R A G+L ++ + +V DE + +NH V CC Sbjct: 473 NSSTELRVTDQRTADGVLVIEKSKK---AKIVHLDEVSEVLHEKNHVVSLACCCPVKVKK 529 Query: 1766 HSDSLSEKDTLLHLLYGKLSRDFPDCVMHDCQTRVQVGSSFFASLCSREEC--------- 1918 SD + + D ++ L K+S +H+ +QVG SF SLC +E+C Sbjct: 530 ESDDV-DVDLIIKQLSEKISEKVSGAEIHETGNCLQVG-SFKGSLCLKEKCAHRREISSS 587 Query: 1919 RIDALHFCCTWSSTEDEELAWKIISVV 1999 +A+ CC W S D EL W+II+V+ Sbjct: 588 SSEAVFLCCNW-SVSDLELGWEIINVI 613 >ref|NP_178282.2| cleavage and polyadenylation specificity factor subunit 3-II [Arabidopsis thaliana] gi|332278175|sp|Q8GUU3.2|CPS3B_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit 3-II; AltName: Full=Cleavage and polyadenylation specificity factor 73 kDa subunit II; Short=AtCPSF73-II; Short=CPSF 73 kDa subunit II; AltName: Full=Protein EMBRYO SAC DEVELOPMENT ARREST 26 gi|62320470|dbj|BAD94982.1| putative cleavage and polyadenylation specifity factor [Arabidopsis thaliana] gi|330250395|gb|AEC05489.1| cleavage and polyadenylation specificity factor subunit 3-II [Arabidopsis thaliana] Length = 613 Score = 800 bits (2066), Expect = 0.0 Identities = 398/623 (63%), Positives = 476/623 (76%), Gaps = 12/623 (1%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAI+CLVLGAGQE+GKSCVVVTINGK+IMFDCGMHMG DH RYP+FSLIS S F +A+ Sbjct: 1 MAIDCLVLGAGQEIGKSCVVVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAI 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 SC+IITHFH+DH+GALPYFT+VCGYNGPIYM+YPTKAL+PLMLEDYR+VMVDRRGEEE F Sbjct: 61 SCIIITHFHMDHVGALPYFTEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEEELF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 ++ HIA CMKKV A+DLKQTI VD+DLQIRAYYAGHV+GA M YAK+GDAA+VYTGDYNM Sbjct: 121 TTTHIANCMKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 D HLGAA+IDRLQLDL+I+ESTYATT R SKY REREFL+AVHKCVA GGK LIP+FA Sbjct: 181 TTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELCMLLDDYWER N+KVPIYFS+GLTIQAN+YYK+LI+WTS+ VK+ H HN FD Sbjct: 241 LGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTHNPFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV FDR+LI+APGPCVLFATPGM+ GFSLEVFK WAP NLV LPGY +AGT+GH Sbjct: 301 FKNVKDFDRSLIHAPGPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLM+ K PT + L K+DVRC++HQ++FSPHTDAKGIMDL KFLSPK+V+LVHGEKP Sbjct: 361 KLMAGK-PTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDLTKFLSPKNVVLVHGEKPS 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M LKE+I SEL I C PAN E V S+ +I+A ASD FL+SC +PNFKFS Sbjct: 420 MMILKEKITSELDIPCFVPANGETVSFASTTYIKANASDMFLKSCSNPNFKFS------- 472 Query: 1607 DTPTSLQVYDDRVAQGILTLQPDNQDHPRVVVTRDE----MRVENHQVKFVHCCSATHSD 1774 T L+V D R A G+L ++ + +V +DE + +NH V HCC Sbjct: 473 -NSTQLRVTDHRTADGVLVIEKSKK---AKIVHQDEISEVLHEKNHVVSLAHCCPVKVKG 528 Query: 1775 SLSEKDT-LLHLLYGKLSRDFPDCVMHDCQTRVQVGSSFFASLCSREEC-------RIDA 1930 + D L+ L K+ + +H+ + +QV +SF SLC +++C +A Sbjct: 529 ESEDDDVDLIKQLSAKILKTVSGAQIHESENCLQV-ASFKGSLCLKDKCMHRSSSSSSEA 587 Query: 1931 LHFCCTWSSTEDEELAWKIISVV 1999 + CC W S D EL W+II+ + Sbjct: 588 VFLCCNW-SIADLELGWEIINAI 609 >ref|XP_007153236.1| hypothetical protein PHAVU_003G018200g [Phaseolus vulgaris] gi|561026590|gb|ESW25230.1| hypothetical protein PHAVU_003G018200g [Phaseolus vulgaris] Length = 537 Score = 796 bits (2055), Expect = 0.0 Identities = 386/525 (73%), Positives = 448/525 (85%), Gaps = 8/525 (1%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAIE LVLGAGQEVGKSCV+VTINGKRIMFDCGMHMG+ DHRRYPDF+L+ + F A+ Sbjct: 1 MAIETLVLGAGQEVGKSCVLVTINGKRIMFDCGMHMGFLDHRRYPDFTLVDPNQDFNSAI 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 +C+IITHFHLDH+GAL YFT+VCGY+GPIYMTYPTKALAPLMLEDYRKVMVDRRGEEE F Sbjct: 61 TCIIITHFHLDHVGALAYFTEVCGYSGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEELF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 SS IAECMKKVTAVDL+QT+ VD+DLQIRAYYAGHVIGAAMFYAKVGDA +VYTGDYNM Sbjct: 121 SSNQIAECMKKVTAVDLRQTVQVDEDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAAQIDRL+LDL+ITESTYATT RDS+Y REREFLKAVHKCV+ GGKVLIPTFA Sbjct: 181 TPDRHLGAAQIDRLRLDLLITESTYATTIRDSRYAREREFLKAVHKCVSCGGKVLIPTFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LL+DYWER NLKVPIYFSAGLTIQAN YYK+LI+WTS+K+KD +++HNAFD Sbjct: 241 LGRAQELCILLEDYWERMNLKVPIYFSAGLTIQANAYYKMLISWTSQKIKDTYSKHNAFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV F++++I+APGPCVLFATPGMISGGFSLEVFK WA E NLVTLPGYC+AGTIGH Sbjct: 301 FKNVQKFEKSMIDAPGPCVLFATPGMISGGFSLEVFKHWAVSENNLVTLPGYCVAGTIGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS K ++ LD + +IDVRCQ+HQL+FSPHTD+KGIMDLV FL+PKHVILVHGEK K Sbjct: 361 KLMSDK-RGKVDLDANTRIDVRCQVHQLAFSPHTDSKGIMDLVNFLAPKHVILVHGEKHK 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSR---PDG 1597 M +LKE+I SE I+C PANNE + IPS+H++ A ASD F++SCLSPNF F + D Sbjct: 420 MASLKEKIHSEFGIQCYDPANNETICIPSTHYVNAEASDTFIRSCLSPNFTFQKCNSADI 479 Query: 1598 SRPDTPTS-----LQVYDDRVAQGILTLQPDNQDHPRVVVTRDEM 1717 TP LQV D+RV++G+L ++ + +V +DE+ Sbjct: 480 YNSTTPDKNLMPMLQVEDERVSEGVLVVEKGKK---AKIVHQDEL 521 >gb|AAN87883.1| FEG protein [Arabidopsis thaliana] Length = 613 Score = 792 bits (2046), Expect = 0.0 Identities = 395/623 (63%), Positives = 473/623 (75%), Gaps = 12/623 (1%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAI+CLVLGAGQE+GKSCVVVTINGK+IMFDCGMHMG DH RYP+FSLIS S F +A+ Sbjct: 1 MAIDCLVLGAGQEIGKSCVVVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAI 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 SC+IITHFH+DH+GALPYFT+VCGYNGPIYM+YPTKAL+PLMLEDYR+VMVDRRGEEE F Sbjct: 61 SCIIITHFHMDHVGALPYFTEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEEELF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 ++ HIA CMKKV A+DLKQTI VD+DLQIRAYYAGHV+GA M YAK+GDAA+VYTGDYNM Sbjct: 121 TTTHIANCMKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 D HLGAA+IDRLQLDL+I+ESTYATT R SKY REREFL+AVHKCVA GGK LIP+FA Sbjct: 181 TTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELCMLLDDYWER N+KVPIYFS+GLTIQAN+YYK+LI+WTS+ VK+ H HN FD Sbjct: 241 LGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTHNPFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV FDR+LI+APGPCVLFA PGM+ G SLEVFK WAP NLV L GY +AGT+GH Sbjct: 301 FKNVKDFDRSLIHAPGPCVLFAIPGMLCAGLSLEVFKHWAPSPLNLVALLGYSVAGTVGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLM+ K PT + L K+DVRC++HQ++FSPHTDAKGIMDL KFLSPK+V+LVHGEKP Sbjct: 361 KLMAGK-PTTVDLHNGTKVDVRCKVHQVAFSPHTDAKGIMDLTKFLSPKNVVLVHGEKPS 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M LKE+I SEL I C PAN E V S+ +I+A ASD FL+SC +PNFKFS Sbjct: 420 MMILKEKITSELDIPCFVPANGETVSFASTTYIKANASDMFLKSCSNPNFKFS------- 472 Query: 1607 DTPTSLQVYDDRVAQGILTLQPDNQDHPRVVVTRDE----MRVENHQVKFVHCCSATHSD 1774 T L+V D R A G+L ++ + +V +DE + +NH V HCC Sbjct: 473 -NSTQLRVTDHRTADGVLVIEKSKK---AKIVHQDEISEVLHEKNHVVSLAHCCPVKVKG 528 Query: 1775 SLSEKDT-LLHLLYGKLSRDFPDCVMHDCQTRVQVGSSFFASLCSREEC-------RIDA 1930 + D L+ L K+ + +H+ + +QV +SF SLC +++C +A Sbjct: 529 ESEDDDVDLIKQLSAKILKTVSGAQIHESENCLQV-ASFKGSLCLKDKCMHRSSSSSSEA 587 Query: 1931 LHFCCTWSSTEDEELAWKIISVV 1999 + CC W S D EL W+II+ + Sbjct: 588 VFLCCNW-SIADLELGWEIINAI 609 >gb|AAS80153.1| ACT11D09.9 [Cucumis melo] Length = 708 Score = 791 bits (2044), Expect = 0.0 Identities = 422/673 (62%), Positives = 488/673 (72%), Gaps = 67/673 (9%) Frame = +2 Query: 194 AGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDALSCVIITHFH 373 AGQEVGKSCVVVTINGKRIMFDCGMH+GY DHRRYPDFS IS S + + LSC+IITHFH Sbjct: 42 AGQEVGKSCVVVTINGKRIMFDCGMHLGYVDHRRYPDFSRISASRDYNNTLSCIIITHFH 101 Query: 374 LDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQFSSEHIAECM 553 LDHIGALPYFT++CGYNGPIYMTYPT ALAP+ LEDYRKVMVDRRGE EQF+++HI EC+ Sbjct: 102 LDHIGALPYFTEICGYNGPIYMTYPTMALAPITLEDYRKVMVDRRGEAEQFTNDHIMECL 161 Query: 554 KKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNMVPDTHLGAA 733 KKV VDLKQTI VD+DLQIRAYYAGHV+GAAMFYAKVGDAA+VYTGDYNM PD HLGAA Sbjct: 162 KKVVPVDLKQTIQVDEDLQIRAYYAGHVLGAAMFYAKVGDAAMVYTGDYNMTPDRHLGAA 221 Query: 734 QIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFALGRA-QELC 910 QIDR+QLDL+ITESTYATT RDSKY REREFLKAVH C+A+GGKVLIPTFALGRA QELC Sbjct: 222 QIDRMQLDLLITESTYATTIRDSKYAREREFLKAVHNCLASGGKVLIPTFALGRAQQELC 281 Query: 911 MLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFDFKNVISFD 1090 +LLDDYWER NLK PIY SAGLT+QAN+YYK+LI+WTS+KVK+ + NAFDFKNV FD Sbjct: 282 VLLDDYWERMNLKFPIYVSAGLTVQANMYYKMLISWTSQKVKETYTTRNAFDFKNVQKFD 341 Query: 1091 RALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGHKLMSAKIP 1270 R++I+APGPCVLFATPGMIS GFSLEVFK+WAP + NL+TLPGYC+AGT+GHKLMS K P Sbjct: 342 RSMIDAPGPCVLFATPGMISSGFSLEVFKRWAPSKLNLITLPGYCVAGTVGHKLMSGK-P 400 Query: 1271 TQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPKMGALKERI 1450 T+I LD+D +IDV HQL+FSPHTD+KGIMDLVKFLSPKHVILVHGEKPKM LKERI Sbjct: 401 TKIDLDKDTQIDV----HQLAFSPHTDSKGIMDLVKFLSPKHVILVHGEKPKMAVLKERI 456 Query: 1451 QSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKF---------------- 1582 SEL I C PANNE V I S+ I+A AS F+QSC +PNFKF Sbjct: 457 HSELGIPCHDPANNETVSISSTLSIKAEASSMFIQSCSTPNFKFLKRNLIDKIDPDLKDL 516 Query: 1583 ---------------SRP-------------DGSRPDTPTSLQVYDDRVAQGILTLQPDN 1678 S P D S P LQV DDRV +GIL ++ Sbjct: 517 SYKAVRTSNMLIRECSNPHFKHLNRNLDAKFDSSLSGGP-ELQVSDDRVNEGILVMENGK 575 Query: 1679 QDHPRVVVTRDEMRV----ENHQVKFVHC-------CSATH-SDSLSEKDTLLHLLYGKL 1822 + + +DE+ + + H+V+F HC H DSLS K L L KL Sbjct: 576 KTK---ALHQDELLLLLGEQEHEVRFAHCRPIYFGSLDEIHVMDSLSRKSLWLSQLSFKL 632 Query: 1823 SRDFPDCVMHDCQTRVQVGSSFFASLCSREEC------RID----ALHFCCTWSSTEDEE 1972 S + D + + +QV S S+CS+E C RI+ A+ FCC DE Sbjct: 633 STELSDRNVQNLGEYLQV-ESITLSICSKENCPYRTTNRIENESTAMVFCCCSWLVADEI 691 Query: 1973 LAWKIISVVDNEN 2011 LAWKIIS+++ + Sbjct: 692 LAWKIISILEKHD 704 >gb|AFK42005.1| unknown [Medicago truncatula] Length = 534 Score = 790 bits (2040), Expect = 0.0 Identities = 382/520 (73%), Positives = 440/520 (84%), Gaps = 9/520 (1%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 M IE LVLGAGQEVGKSCV+V INGKRIMFDCGM M +TDH RYPDF ISDS +F DAL Sbjct: 1 MTIEVLVLGAGQEVGKSCVIVKINGKRIMFDCGMRMRHTDHSRYPDFKKISDSGNFNDAL 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 C+IITHFHLDH+GAL YFT+VCGY+GP+YMTYP KAL+PLMLEDYRKVMVDRRGEEEQF Sbjct: 61 DCIIITHFHLDHVGALAYFTEVCGYSGPVYMTYPIKALSPLMLEDYRKVMVDRRGEEEQF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 +S+HIAECMKKV AVDLKQT+ VD+DLQIRAYYAGHVIGAAMFY KVGDA +VYTGDYNM Sbjct: 121 TSDHIAECMKKVIAVDLKQTVQVDEDLQIRAYYAGHVIGAAMFYVKVGDAEMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAAQIDRL+LDL+ITESTYATT RDSKY REREFLKAVHKCV+ GGKVLIPTFA Sbjct: 181 TPDRHLGAAQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQEL +LLDDYWER NLKVPIYFS+GLTIQAN Y+K+LI WTS+K+KD ++ HNAFD Sbjct: 241 LGRAQELRILLDDYWERMNLKVPIYFSSGLTIQANTYHKMLIGWTSQKIKDTYSTHNAFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV F+R++++APGPCVLFATPGM+ GGFSLEVFK WAP E+NLV LPGYC+AGT+GH Sbjct: 301 FKNVHKFERSMLDAPGPCVLFATPGMLIGGFSLEVFKHWAPSEKNLVALPGYCMAGTVGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 +L S K PT++ D D +IDVRCQIHQL+FS HTD+KGIMDLVKFLSPKHV+LVHG+KPK Sbjct: 361 RLTSGK-PTKVDTDPDTQIDVRCQIHQLAFSAHTDSKGIMDLVKFLSPKHVMLVHGDKPK 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDG--- 1597 M +LKERI SEL I C +PANNEIV I S+ ++ A ASD F ++CL+PNFKF + Sbjct: 420 MVSLKERIDSELGIPCSHPANNEIVTISSTQYVNAEASDTFTKNCLNPNFKFQKCSSMDT 479 Query: 1598 ------SRPDTPTSLQVYDDRVAQGILTLQPDNQDHPRVV 1699 R TP LQV D+RVA G+L ++ +N ++V Sbjct: 480 CNSTLIDRNLTP-ELQVEDERVADGVLVMENNNNKKAKIV 518 >ref|XP_006574816.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 3-II-like isoform X1 [Glycine max] Length = 532 Score = 787 bits (2032), Expect = 0.0 Identities = 387/525 (73%), Positives = 441/525 (84%), Gaps = 8/525 (1%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAIE LVLGAGQEVGKSCVVVTIN KRIMFDCGMHMGY DHRRYPDF+ IS S AL Sbjct: 1 MAIETLVLGAGQEVGKSCVVVTINAKRIMFDCGMHMGYLDHRRYPDFTRISPSRDLNSAL 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 SC+IITHFHLDH+GAL YFT+V GYNGP+YMTYPTKALAPLMLEDYRKVMVDRRGEEE F Sbjct: 61 SCIIITHFHLDHVGALAYFTEVLGYNGPVYMTYPTKALAPLMLEDYRKVMVDRRGEEELF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 SS+ IAECMKKV AVDL+QT+ V+KDLQIRAYYAGHVIGAAMFYAKVGDA +VYTGDYNM Sbjct: 121 SSDQIAECMKKVIAVDLRQTVQVEKDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAAQIDRL+LDL+ITESTYATT RDS+Y REREFLKAVHKCV+ GGKVLIPTFA Sbjct: 181 TPDRHLGAAQIDRLRLDLLITESTYATTIRDSRYAREREFLKAVHKCVSCGGKVLIPTFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LL+DYWER NLKVPIYFSAGLTIQAN YYK+LI WT +K+KD +++HNAFD Sbjct: 241 LGRAQELCILLEDYWERMNLKVPIYFSAGLTIQANAYYKMLIRWTRQKIKDTYSKHNAFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV F+R++I+APGPCVLFATPGM+SGGFS+EVFK WA E NLV+LPGYC+ GTIGH Sbjct: 301 FKNVQKFERSMIDAPGPCVLFATPGMLSGGFSVEVFKHWAVSENNLVSLPGYCVPGTIGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KLMS K ++ LD + KIDVRCQIHQL+FSPHTD+KGIMDLV FLSPKHVILVHGEK K Sbjct: 361 KLMSDK-HDKVDLDPNTKIDVRCQIHQLAFSPHTDSKGIMDLVNFLSPKHVILVHGEKHK 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M +LKE+I SEL I+C PANNE V IPS++++ A SD F++SCLSPNF F + Sbjct: 420 MASLKEKIHSELGIQCYDPANNETVTIPSANYVYAETSDTFIRSCLSPNFTFQKCSSVDL 479 Query: 1607 DTPTS--------LQVYDDRVAQGILTLQPDNQDHPRVVVTRDEM 1717 T+ LQV D+RVA+G+L L+ + +V +DE+ Sbjct: 480 CNSTTVDRNLMPELQVEDERVAEGVLVLEKGKK---AKIVHQDEL 521 >gb|EPS71695.1| hypothetical protein M569_03060, partial [Genlisea aurea] Length = 470 Score = 784 bits (2024), Expect = 0.0 Identities = 372/470 (79%), Positives = 419/470 (89%), Gaps = 3/470 (0%) Frame = +2 Query: 182 LVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDALSCVII 361 +V GAGQEVGKSCVVVT+NGKRIMFDCGMHMGY DHR+YPDFSLI +S FT+++SC+II Sbjct: 1 IVTGAGQEVGKSCVVVTLNGKRIMFDCGMHMGYLDHRQYPDFSLIPNSHDFTNSISCIII 60 Query: 362 THFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQFSSEHI 541 THFHLDHIGALPYFTQ+CGYNGPIYMTYPTKAL PLMLEDYRKVMVDRRGE+E F+SE I Sbjct: 61 THFHLDHIGALPYFTQICGYNGPIYMTYPTKALGPLMLEDYRKVMVDRRGEKELFTSEDI 120 Query: 542 AECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNMVPDTH 721 CMKKVTAVDLKQT+ VD DLQIRAYYAGHV+GAAMFYAKVGDAA+VYTGDYNM PD H Sbjct: 121 LHCMKKVTAVDLKQTVQVDNDLQIRAYYAGHVLGAAMFYAKVGDAAIVYTGDYNMTPDRH 180 Query: 722 LGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFALGRAQ 901 LGAAQIDRLQLDLVITESTYATT RDSKY REREFL+ VHKCVA GGKVLIPTF LGRAQ Sbjct: 181 LGAAQIDRLQLDLVITESTYATTRRDSKYFREREFLQVVHKCVAGGGKVLIPTFGLGRAQ 240 Query: 902 ELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFDFKNVI 1081 E+CMLLDDYWER NLKVPIY+SAGLT+QAN+YYK+ I WTS+KVKD++ N FDFK+V Sbjct: 241 EICMLLDDYWERMNLKVPIYYSAGLTMQANMYYKVFIGWTSQKVKDSYPTRNPFDFKHVC 300 Query: 1082 SFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGHKLMSA 1261 SFDR+LINAPGPCVLFA+PGMISGG SLEVFKQWAP+E+NL+TLPGYC+AGT+GHKLMS+ Sbjct: 301 SFDRSLINAPGPCVLFASPGMISGGLSLEVFKQWAPFEQNLITLPGYCVAGTVGHKLMSS 360 Query: 1262 K---IPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPKMG 1432 K +Q+++D V+IDVRCQIHQLSFS HTD KGIMDL++FLSPKHVILVHGEKPKM Sbjct: 361 KGKGSCSQMRVDGSVQIDVRCQIHQLSFSHHTDGKGIMDLLRFLSPKHVILVHGEKPKMA 420 Query: 1433 ALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKF 1582 L++ ++SEL I C +P N E V IPSS H + +ASDAFLQSCLSPN +F Sbjct: 421 MLRDSVESELGIPCYHPGNGEKVCIPSSRHAKCSASDAFLQSCLSPNLEF 470 >ref|XP_004498247.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation specificity factor subunit 3-II-like [Cicer arietinum] Length = 532 Score = 782 bits (2019), Expect = 0.0 Identities = 383/525 (72%), Positives = 433/525 (82%), Gaps = 8/525 (1%) Frame = +2 Query: 167 MAIECLVLGAGQEVGKSCVVVTINGKRIMFDCGMHMGYTDHRRYPDFSLISDSASFTDAL 346 MAIE LVLGAGQEVGKSCVVV INGKRIMFDCGMHMGYTDHRR+P+F Sbjct: 1 MAIETLVLGAGQEVGKSCVVVNINGKRIMFDCGMHMGYTDHRRFPNFFFFFXDXCCCCLF 60 Query: 347 SCVIITHFHLDHIGALPYFTQVCGYNGPIYMTYPTKALAPLMLEDYRKVMVDRRGEEEQF 526 C I HLDH+GAL YFT+VCGY GP+YMTYPTKAL+PLMLEDYRKVMVDRRGEEEQF Sbjct: 61 VCEIKLCSHLDHVGALVYFTEVCGYRGPVYMTYPTKALSPLMLEDYRKVMVDRRGEEEQF 120 Query: 527 SSEHIAECMKKVTAVDLKQTIHVDKDLQIRAYYAGHVIGAAMFYAKVGDAALVYTGDYNM 706 +S+HIAECMKKV AVDL+QT+ VD+DLQIRAYYAGHVIGAAMFY KVGDA +VYTGDYNM Sbjct: 121 TSDHIAECMKKVIAVDLRQTVQVDEDLQIRAYYAGHVIGAAMFYVKVGDAEMVYTGDYNM 180 Query: 707 VPDTHLGAAQIDRLQLDLVITESTYATTYRDSKYVREREFLKAVHKCVAAGGKVLIPTFA 886 PD HLGAAQIDRL+LDL+ITESTYATT RDSKY REREFLKAVHKCV+ GGKVLIPTFA Sbjct: 181 TPDRHLGAAQIDRLRLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFA 240 Query: 887 LGRAQELCMLLDDYWERTNLKVPIYFSAGLTIQANLYYKILINWTSKKVKDAHARHNAFD 1066 LGRAQELC+LLDDYWER NLKVPIYFSAGLTIQAN+YYK+LI WTS+K+KD ++ HNAFD Sbjct: 241 LGRAQELCILLDDYWERMNLKVPIYFSAGLTIQANMYYKMLIGWTSQKIKDTYSTHNAFD 300 Query: 1067 FKNVISFDRALINAPGPCVLFATPGMISGGFSLEVFKQWAPYEENLVTLPGYCLAGTIGH 1246 FKNV F+R++I+A GPCVLFATPGMISGGFSLEVFK WAP E NLVTLPGYC+AGT+GH Sbjct: 301 FKNVHKFERSMIDATGPCVLFATPGMISGGFSLEVFKHWAPSENNLVTLPGYCVAGTVGH 360 Query: 1247 KLMSAKIPTQIKLDEDVKIDVRCQIHQLSFSPHTDAKGIMDLVKFLSPKHVILVHGEKPK 1426 KL S K PT+I D D +IDVRCQIHQL+FSPHTD+KGIMDLVKFLSPKHVILVHGEKPK Sbjct: 361 KLTSGK-PTKINTDPDTQIDVRCQIHQLAFSPHTDSKGIMDLVKFLSPKHVILVHGEKPK 419 Query: 1427 MGALKERIQSELSIECLYPANNEIVHIPSSHHIEAAASDAFLQSCLSPNFKFSRPDGSRP 1606 M +LKERI SEL I+C PANNE V IPS+ ++ A AS F+++CL+PNF+F + Sbjct: 420 MASLKERIHSELGIQCYNPANNETVCIPSTQYVNAEASGTFIRNCLNPNFQFQKCSSEDT 479 Query: 1607 DTPT--------SLQVYDDRVAQGILTLQPDNQDHPRVVVTRDEM 1717 T LQV D+RVA G+L ++ + +V RDE+ Sbjct: 480 SNSTMIDKNLTPKLQVEDERVADGVLVMEKSKK---AKIVNRDEL 521