BLASTX nr result
ID: Papaver23_contig00015623
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver23_contig00015623 (3535 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-lik... 849 0.0 ref|XP_002882757.1| predicted protein [Arabidopsis lyrata subsp.... 791 0.0 ref|NP_187802.2| Cleavage and polyadenylation specificity factor... 786 0.0 ref|NP_850565.1| Cleavage and polyadenylation specificity factor... 784 0.0 gb|AAF23212.1|AC016795_25 hypothetical protein [Arabidopsis thal... 741 0.0 >ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-like [Cucumis sativus] Length = 1376 Score = 849 bits (2194), Expect(2) = 0.0 Identities = 447/740 (60%), Positives = 551/740 (74%), Gaps = 20/740 (2%) Frame = +3 Query: 1080 IYEMQHVRLQNEVSSISIPQR-ISKYKSSTSVVSLPNISKPCFGLPIGVEISDTFVIGTH 1256 IYE Q++RLQ E+S ISIP++ +K +S+ + S+ N L V VIGTH Sbjct: 647 IYEKQYLRLQYELSCISIPEKHFAKKESNFPMNSVENSIMST--LLNEVSCDTIIVIGTH 704 Query: 1257 RPSVEILSFVPEEGLRIVACGIISLSNTLGTAISGCVPQDVRLVLVDRFYVLSGLRNGML 1436 RPSVEILSFVP GL ++A G ISL N LG A+SGC+PQDVRLVLVDRFYVL+GLRNGML Sbjct: 705 RPSVEILSFVPSIGLTVLASGTISLMNILGNAVSGCIPQDVRLVLVDRFYVLTGLRNGML 764 Query: 1437 LRFEWPNMXXXXXXXXXXXXXXFMISNMAASFVSPSSSNEQCRDSNILEKAE-KTPIHLE 1613 LRFEWP+ F++S S +++ +++ILEK E + P L+ Sbjct: 765 LRFEWPHTATMNSSDMPHTVVPFLLS-------CSDSFSKEFHNADILEKHEDEIPSCLQ 817 Query: 1614 LIAIRRIGVTPVFLVPLCXXXXXXXXXXXXRPWLLQTARHSLSFTSISFQPATHVTPVCS 1793 LIAIRRIG+TPVFLVPL RPWLL +ARHSLS+TSISFQP+THVTPVCS Sbjct: 818 LIAIRRIGITPVFLVPLTDRLDSDIIALSDRPWLLHSARHSLSYTSISFQPSTHVTPVCS 877 Query: 1794 MDCPKGILFVAENRLHLVEMVHSKRLNVQKFSLGGTPRKVVYHSESRLLLVMRTELSGES 1973 DCP G+LFVAE+ LHLVEMVH+KRLNVQKF LGGTPRKV+YHSES+LLLVMRT+L ++ Sbjct: 878 ADCPSGLLFVAESSLHLVEMVHTKRLNVQKFHLGGTPRKVLYHSESKLLLVMRTQLINDT 937 Query: 1974 CSSDICCVDPLSGSLLSSFKLEPGETGKSMQLVKVGDERVLVVGTNRFGGRAIMHTGEAE 2153 SSDICCVDPLSGS+LSS KLE GETGKSM+LV+ G+E+VLVVGT+ G AIM +GEAE Sbjct: 938 SSSDICCVDPLSGSILSSHKLEIGETGKSMELVRNGNEQVLVVGTSLSSGPAIMASGEAE 997 Query: 2154 SSKGRLLVLCLEHTLNSVNSS----------------FREIVGYATEQXXXXXXXXXPED 2285 S+KGRL+VLCLEH NS S FREIVGYATEQ P+D Sbjct: 998 STKGRLIVLCLEHVQNSDTGSMTFCSKAGLSSLQASPFREIVGYATEQLSSSSLCSSPDD 1057 Query: 2286 NGFEGVKLEETEVWQLVLAYQTIIPGVVLAVCPYLDRYFLASAGNFFYLYGFVNENPQRV 2465 +G+KLEETE WQL + Y T +PG+VLA+CPYLDRYFLASAGN FY+ GF N++ QRV Sbjct: 1058 ASSDGIKLEETEAWQLRVVYSTSLPGMVLAICPYLDRYFLASAGNAFYVCGFPNDSFQRV 1117 Query: 2466 RRLAYARTRFTITSLASDFTRIVVGDCRDGVLFYSYDEEPRRLKQLYCDPVQRLVADCTL 2645 +R A RTRF ITSL + RI VGDCRDG+LF+SY E+ ++L+Q+Y DP QRLVADCTL Sbjct: 1118 KRFAVGRTRFMITSLTAHVNRIAVGDCRDGILFFSYQEDAKKLEQIYSDPSQRLVADCTL 1177 Query: 2646 MDMDTAVVSDRKGNLTVLSCPNRVEDNASPECNLTLSCSYYIGETAMSIRKGSYSYKLPV 2825 +D+DTAVVSDRKG++ +LSC +R+EDNASPECNLTL+C+YY+GE AM++RKGS+SYKLP Sbjct: 1178 LDVDTAVVSDRKGSIAILSCSDRLEDNASPECNLTLNCAYYMGEIAMTLRKGSFSYKLPA 1237 Query: 2826 DDTPNGCDIADTILNSSHNSIVASTMLGSVVVFIAISREEFELLKAVQARLIIHPLTAPI 3005 DD GC + + +SSHN+I+AST+LGS+V+F +SR+E+ELL+AVQA+L +HPLT+PI Sbjct: 1238 DDLLRGCAVPGSDFDSSHNTIIASTLLGSIVIFTPLSRDEYELLEAVQAKLAVHPLTSPI 1297 Query: 3006 LGNDHNEFRGRGSAAGVSKMLDGDMLTQFLELTSSQQESVLAVPLGLKET--GASISMPP 3179 LGNDH E+R R + GV K+LDGD+LTQFLELTS QQE VL+ +G +S SMP Sbjct: 1298 LGNDHYEYRSRENPIGVPKILDGDILTQFLELTSMQQELVLSSSVGSLSAVKPSSKSMP- 1356 Query: 3180 SHKSISVNQVVRLLERVHYA 3239 SI +NQVV+LLER+HYA Sbjct: 1357 --ASIPINQVVQLLERIHYA 1374 Score = 473 bits (1217), Expect(2) = 0.0 Identities = 238/375 (63%), Positives = 289/375 (77%), Gaps = 16/375 (4%) Frame = +2 Query: 2 EVPYSYGFAVLFRVGDALLMDLSDPHNPRCVHKICLGLLP------------IEDGVDD- 142 EVP SYGFA+LFRVGDALLMDL D H+P CV++I L P ++D D+ Sbjct: 277 EVPQSYGFALLFRVGDALLMDLRDVHSPCCVYRIGLHFPPNVEQNFIEESYRVQDADDEG 336 Query: 143 --DVAVRALLELGMEMSKGDDPMIIDNENGQYNSLFKSMCSWSWEPGHNSNPTMFVSLDT 316 +VA ALLEL + DPM ID+++G N+ +CSWSWEPG+N N M +DT Sbjct: 337 LFNVAACALLEL-----RDYDPMCIDSDDGSLNTNQNHVCSWSWEPGNNRNRRMIFCMDT 391 Query: 317 GELLTLEISFESDGSKMNLSEPLYKCLPCKTLLWVKGDFVVALTEMGDGTVLKFEGGKLS 496 G+L +E++F+SDG K+N S LYK P K LLWV+G ++ AL EMGDG VLK E G+L Sbjct: 392 GDLFMIEMNFDSDGLKVNQSACLYKGQPYKALLWVEGGYLAALVEMGDGMVLKLENGRLI 451 Query: 497 YMSPVQNIAPVLD-SVVDYHDDKQDQMFACCGVAPEGSLRIIRSGISVENLLRTAPIYQG 673 Y +P+QNIAP+LD SVVD HD+KQDQMFACCG+APEGSLRIIR+GISVENLLRT+PIYQG Sbjct: 452 YANPIQNIAPILDMSVVDKHDEKQDQMFACCGMAPEGSLRIIRNGISVENLLRTSPIYQG 511 Query: 674 ITGTWTLRMKVLDSFDSFLVLSFVEETRVLSVGLSFSDVTDAVGFQPDACTLACGLVGDG 853 IT WT++MK D++ S+LVLSFVEETRVLSVGLSF DVTD+VGFQ D CTLACGL+ DG Sbjct: 512 ITSIWTIKMKRSDTYHSYLVLSFVEETRVLSVGLSFIDVTDSVGFQSDTCTLACGLLDDG 571 Query: 854 LLVQIHRNAVRLCLPTTIAHPEGIPLSAPICTSWFPENVNISLGAVGQNMIIVATSNPCF 1033 L++QIH+NAVRLCLPT IAH EGI LS+P CTSWFP+N+ ISLGAVG N+I+V+TSNPCF Sbjct: 572 LVIQIHQNAVRLCLPTKIAHSEGIELSSPACTSWFPDNIGISLGAVGHNVIVVSTSNPCF 631 Query: 1034 LFILGARSLSAYHYD 1078 LFILG R +S Y Y+ Sbjct: 632 LFILGVRKVSGYDYE 646 >ref|XP_002882757.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297328597|gb|EFH59016.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 1384 Score = 791 bits (2044), Expect(2) = 0.0 Identities = 424/754 (56%), Positives = 536/754 (71%), Gaps = 34/754 (4%) Frame = +3 Query: 1080 IYEMQHVRLQNEVSSISIPQR-ISKYKSSTSVVSLPNISKPCFGLPIGVEISDTFVIGTH 1256 IYE+Q V LQ EVS IS+PQ+ I K +S S S N K +P G+E +F+IGTH Sbjct: 654 IYEIQRVTLQYEVSCISVPQKHIGKKRSCAS--SPDNSCKAA--IPSGMEQGYSFLIGTH 709 Query: 1257 RPSVEILSFVPEE-GLRIVACGIISLSNTLGTAISGCVPQDVRLVLVDRFYVLSGLRNGM 1433 +PSVE+LSF + G+R++A G++SL+NT+G ISGC+PQDVRLVLVD+ YVLSGLRNGM Sbjct: 710 KPSVEVLSFSEDGVGVRVLASGLVSLTNTMGAVISGCIPQDVRLVLVDQLYVLSGLRNGM 769 Query: 1434 LLRFEWPNMXXXXXXXXXXXXXXFMISNMAASFVSPSSSNEQCRDSN--ILEKAEKTPIH 1607 LLRFEWP + SN AS ++ C++ ++ K + PI+ Sbjct: 770 LLRFEWP-----------------LFSN--ASGLNCPDYFSYCKEEMDIVVGKKDNLPIN 810 Query: 1608 LELIAIRRIGVTPVFLVPLCXXXXXXXXXXXXRPWLLQTARHSLSFTSISFQPATHVTPV 1787 L LIA RRIG+TPVFLVP RPWLLQTAR SLS+TSISFQP+TH TPV Sbjct: 811 LLLIATRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQPSTHATPV 870 Query: 1788 CSMDCPKGILFVAENRLHLVEMVHSKRLNVQKFSLGGTPRKVVYHSESRLLLVMRTELSG 1967 CS +CP+GILFV+EN LHLVEMVHSKR N QKF LGGTPRKV+YHSES+LL+VMRT+L Sbjct: 871 CSSECPQGILFVSENCLHLVEMVHSKRRNAQKFHLGGTPRKVIYHSESKLLIVMRTDLY- 929 Query: 1968 ESCSSDICCVDPLSGSLLSSFKLEPGETGKSMQLVKVGDERVLVVGTNRFGGRAIMHTGE 2147 ++C+SDICCVDPLSGS+LSS+KL+PGETGKSM+LV+VG+E VLVVGT+ G AI+ +GE Sbjct: 930 DTCTSDICCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLVVGTSLSSGPAILPSGE 989 Query: 2148 AESSKGRLLVLCLEHTLNSVNSS----------------FREIVGYATEQXXXXXXXXXP 2279 AES+KGRL++LCLEHT NS + S FR++VGY TEQ P Sbjct: 990 AESTKGRLIILCLEHTQNSDSGSMTICSKAGSSSQRTSPFRDVVGYTTEQLSSSSHCSSP 1049 Query: 2280 EDNGFEGVKLEETEVWQLVLAYQTIIPGVVLAVCPYLDRYFLASAGNFFYLYGFVNENPQ 2459 +DN ++G+K +E E WQL LA T PG+VLA+CPYLD YFLASAGN FY+ GF N++P+ Sbjct: 1050 DDNSYDGIKFDEAETWQLRLASATTWPGMVLAICPYLDHYFLASAGNAFYVCGFPNDSPE 1109 Query: 2460 RVRRLAYARTRFTITSLASDFTRIVVGDCRDGVLFYSYDEEPRRLKQLYCDPVQRLVADC 2639 R++R A RTRF ITSL + FTRIVVGDCRDGVLFYSY EE ++L Q+YCDP QRLVADC Sbjct: 1110 RMKRFAVGRTRFMITSLRTYFTRIVVGDCRDGVLFYSYHEESKKLHQIYCDPAQRLVADC 1169 Query: 2640 TLMDMDTAVVSDRKGNLTVLSCPNRVE--------------DNASPECNLTLSCSYYIGE 2777 LMD ++ VSDRKG++ +LSC + E + +SPE NL L+C+YY+GE Sbjct: 1170 FLMDANSVAVSDRKGSIAILSCQDHSEFGTKHLAFSPRDDPEYSSPESNLNLNCAYYMGE 1229 Query: 2778 TAMSIRKGSYSYKLPVDDTPNGCDIADTILNSSHNSIVASTMLGSVVVFIAISREEFELL 2957 AM+I+KG YKLP DD ++ +I +++ ++I+A T+LGS+ VF IS EE+ELL Sbjct: 1230 IAMAIKKGCNIYKLPADDVLRSYGLSKSI-DTADDTIIAGTLLGSIFVFAPISSEEYELL 1288 Query: 2958 KAVQARLIIHPLTAPILGNDHNEFRGRGSAAGVSKMLDGDMLTQFLELTSSQQESVLAVP 3137 +AVQA+L IHPLTAP+LGNDHNEFRGR + + +K+LDGDML QFLELT+ QQESVL P Sbjct: 1289 EAVQAKLGIHPLTAPVLGNDHNEFRGRENPSQATKILDGDMLAQFLELTNRQQESVLLTP 1348 Query: 3138 LGLKETGASISMPPSHKSISVNQVVRLLERVHYA 3239 T + S S + ++QVV+LLERVHYA Sbjct: 1349 QPSPSTSKASSKQRSSPPLMLHQVVQLLERVHYA 1382 Score = 440 bits (1132), Expect(2) = 0.0 Identities = 224/372 (60%), Positives = 276/372 (74%), Gaps = 17/372 (4%) Frame = +2 Query: 2 EVPYSYGFAVLFRVGDALLMDLSDPHNPRCVHKICLGLLP--------------IEDGVD 139 EVP+S GFA LFR+GDALLMDL DP NP C+ + L L+P ++DG D Sbjct: 283 EVPHSSGFAFLFRIGDALLMDLRDPQNPCCLFRTSLDLVPASLVEEHFVEESCRVQDGDD 342 Query: 140 D--DVAVRALLELGMEMSKGDDPMIIDNENGQYNSLFKSMCSWSWEPGHNSNPTMFVSLD 313 + +VA ALLEL DPM ID E+ K + SW+WEP +N NP M + LD Sbjct: 343 EGFNVAACALLELS-----DHDPMFIDTESDIGKLSSKHVSSWTWEPENNHNPRMIICLD 397 Query: 314 TGELLTLEISFESDGSKMNLSEPLYKCLPCKTLLWVKGDFVVALTEMGDGTVLKFEGGKL 493 GE E+ +E DG K+NLSE LYK LPCK +LWV+G F+ EM DGTV K KL Sbjct: 398 DGEFYMFELIYEDDGVKVNLSECLYKGLPCKEILWVEGGFLATFAEMADGTVFKLGSEKL 457 Query: 494 SYMSPVQNIAPVLD-SVVDYHDDKQDQMFACCGVAPEGSLRIIRSGISVENLLRTAPIYQ 670 +MS +QNIAP+LD SV+D ++K+DQ+FACCGV EGSLRIIRSGI+VE LL+TAP+YQ Sbjct: 458 HWMSSIQNIAPILDFSVMDDQNEKRDQIFACCGVTREGSLRIIRSGINVEKLLKTAPVYQ 517 Query: 671 GITGTWTLRMKVLDSFDSFLVLSFVEETRVLSVGLSFSDVTDAVGFQPDACTLACGLVGD 850 GITGTWT++MK+ D + SFLVLSFVEETRVLSVGLSF DVTD+VGFQ D CTLACGLV D Sbjct: 518 GITGTWTVKMKLTDVYHSFLVLSFVEETRVLSVGLSFKDVTDSVGFQSDVCTLACGLVAD 577 Query: 851 GLLVQIHRNAVRLCLPTTIAHPEGIPLSAPICTSWFPENVNISLGAVGQNMIIVATSNPC 1030 GLLVQIH++A+RLC+PT AH +GIP+S+P +SWFP+NV+ISLGAVGQN+I+V+TSNPC Sbjct: 578 GLLVQIHQDAIRLCMPTMDAHSDGIPVSSPFFSSWFPDNVSISLGAVGQNLIVVSTSNPC 637 Query: 1031 FLFILGARSLSA 1066 FL ILG +S+S+ Sbjct: 638 FLSILGVKSVSS 649 >ref|NP_187802.2| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein [Arabidopsis thaliana] gi|29824376|gb|AAP04148.1| unknown protein [Arabidopsis thaliana] gi|110739103|dbj|BAF01468.1| hypothetical protein [Arabidopsis thaliana] gi|332641608|gb|AEE75129.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein [Arabidopsis thaliana] Length = 1379 Score = 786 bits (2030), Expect(2) = 0.0 Identities = 419/751 (55%), Positives = 530/751 (70%), Gaps = 31/751 (4%) Frame = +3 Query: 1080 IYEMQHVRLQNEVSSISIPQR-ISKYKSSTSVVSLPNISKPCFGLPIGVEISDTFVIGTH 1256 IYE+Q V LQ EVS IS+PQ+ I K +S S S N K +P +E TF+IGTH Sbjct: 657 IYEIQRVTLQYEVSCISVPQKHIGKKRSRDS--SPDNFCKAA--IPSAMEQGYTFLIGTH 712 Query: 1257 RPSVEILSFVPEE-GLRIVACGIISLSNTLGTAISGCVPQDVRLVLVDRFYVLSGLRNGM 1433 +PSVE+LSF + G+R++A G++SL+NT+GT ISGC+PQDVRLVLVD+ YVLSGLRNGM Sbjct: 713 KPSVEVLSFTEDGVGVRVLASGLVSLTNTMGTVISGCIPQDVRLVLVDQLYVLSGLRNGM 772 Query: 1434 LLRFEWPNMXXXXXXXXXXXXXXFMISNMAASFVSPSSSN-----EQCRDS--NILEKAE 1592 LLRFEW A F + S N C++ ++ K + Sbjct: 773 LLRFEW------------------------APFSNSSGLNCPDYFSHCKEEMDTVVGKKD 808 Query: 1593 KTPIHLELIAIRRIGVTPVFLVPLCXXXXXXXXXXXXRPWLLQTARHSLSFTSISFQPAT 1772 P++L LIA RRIG+TPVFLVP RPWLLQTAR SLS+TSISFQP+T Sbjct: 809 NLPVNLLLIATRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQPST 868 Query: 1773 HVTPVCSMDCPKGILFVAENRLHLVEMVHSKRLNVQKFSLGGTPRKVVYHSESRLLLVMR 1952 H TPVCS +CP+GILFV+EN LHLVEMVHSKR N QKF LGGTPRKV+YHSES+LL+VMR Sbjct: 869 HATPVCSFECPQGILFVSENCLHLVEMVHSKRRNAQKFQLGGTPRKVIYHSESKLLIVMR 928 Query: 1953 TELSGESCSSDICCVDPLSGSLLSSFKLEPGETGKSMQLVKVGDERVLVVGTNRFGGRAI 2132 T+L ++C+SDICCVDPLSGS+LSS+KL+PGETGKSM+LV+VG+E VLVVGT+ G AI Sbjct: 929 TDLY-DTCTSDICCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLVVGTSLSSGPAI 987 Query: 2133 MHTGEAESSKGRLLVLCLEHTLNSVNSS----------------FREIVGYATEQXXXXX 2264 + +GEAES+KGR+++LCLEHT NS + S F ++VGY TE Sbjct: 988 LPSGEAESTKGRVIILCLEHTQNSDSGSMTICSKACSSSQRTSPFHDVVGYTTENLSSSS 1047 Query: 2265 XXXXPEDNGFEGVKLEETEVWQLVLAYQTIIPGVVLAVCPYLDRYFLASAGNFFYLYGFV 2444 P+D ++G+KL+E E WQL LA T PG+VLA+CPYLD YFLASAGN FY+ GF Sbjct: 1048 LCSSPDDYSYDGIKLDEAETWQLRLASSTTWPGMVLAICPYLDHYFLASAGNAFYVCGFP 1107 Query: 2445 NENPQRVRRLAYARTRFTITSLASDFTRIVVGDCRDGVLFYSYDEEPRRLKQLYCDPVQR 2624 N++P+R++R A RTRF ITSL + FTRIVVGDCRDGVLFYSY EE ++L Q+YCDP QR Sbjct: 1108 NDSPERMKRFAVGRTRFMITSLRTYFTRIVVGDCRDGVLFYSYHEESKKLHQIYCDPAQR 1167 Query: 2625 LVADCTLMDMDTAVVSDRKGNLTVLSCPNRVE------DNASPECNLTLSCSYYIGETAM 2786 LVADC LMD ++ VSDRKG++ +LSC + + + +SPE NL L+C+YY+GE AM Sbjct: 1168 LVADCFLMDANSVAVSDRKGSIAILSCKDHSDFGMKHLEYSSPESNLNLNCAYYMGEIAM 1227 Query: 2787 SIRKGSYSYKLPVDDTPNGCDIADTILNSSHNSIVASTMLGSVVVFIAISREEFELLKAV 2966 SI+KG YKLP DD ++ +I +++ ++I+A T+LGS+ VF IS EE+ELL+ V Sbjct: 1228 SIKKGCNIYKLPADDVLRSYGLSKSI-DTADDTIIAGTLLGSIFVFAPISSEEYELLEGV 1286 Query: 2967 QARLIIHPLTAPILGNDHNEFRGRGSAAGVSKMLDGDMLTQFLELTSSQQESVLAVPLGL 3146 QA+L IHPLTAP+LGNDHNEFRGR + + K+LDGDML QFLELT+ QQESVL+ P Sbjct: 1287 QAKLGIHPLTAPVLGNDHNEFRGRENPSQARKILDGDMLAQFLELTNRQQESVLSTPQPS 1346 Query: 3147 KETGASISMPPSHKSISVNQVVRLLERVHYA 3239 T + S S + ++QVV+LLERVHYA Sbjct: 1347 PSTSKASSKQRSFPPLMLHQVVQLLERVHYA 1377 Score = 443 bits (1139), Expect(2) = 0.0 Identities = 221/372 (59%), Positives = 276/372 (74%), Gaps = 17/372 (4%) Frame = +2 Query: 2 EVPYSYGFAVLFRVGDALLMDLSDPHNPRCVHKICLGLLP--------------IEDGVD 139 EVP+S GFA LFR+GD LLMDL DP NP C+ + L +P ++DG D Sbjct: 281 EVPHSSGFAFLFRIGDVLLMDLRDPQNPCCLFRTSLDFVPASLMEEHFVEESCRVQDGDD 340 Query: 140 D--DVAVRALLELGMEMSKGDDPMIIDNENGQYNSLFKSMCSWSWEPGHNSNPTMFVSLD 313 + +V V ALLEL + DPM ID E+ K++ SW+WEP +N NP M + LD Sbjct: 341 EGCNVVVCALLELRDHEVRDHDPMFIDTESDIGKLSSKNVSSWTWEPENNHNPRMIICLD 400 Query: 314 TGELLTLEISFESDGSKMNLSEPLYKCLPCKTLLWVKGDFVVALTEMGDGTVLKFEGGKL 493 G+ E+ +E DG K+NLSE LYK LPCK +LW++G F+ EM DGTV K KL Sbjct: 401 NGDFFMFELIYEDDGVKVNLSECLYKGLPCKDILWIEGGFLATFAEMADGTVFKLGTEKL 460 Query: 494 SYMSPVQNIAPVLD-SVVDYHDDKQDQMFACCGVAPEGSLRIIRSGISVENLLRTAPIYQ 670 +MS +QNIAP+LD SV+D ++K+DQ+FACCGV PEGSLRIIRSGI+VE LL+TAP+YQ Sbjct: 461 HWMSSIQNIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQ 520 Query: 671 GITGTWTLRMKVLDSFDSFLVLSFVEETRVLSVGLSFSDVTDAVGFQPDACTLACGLVGD 850 GITGTWT++MK+ D + SFLVLSFVEETRVLSVGLSF DVTD+VGFQ D CT ACGLV D Sbjct: 521 GITGTWTVKMKLTDVYHSFLVLSFVEETRVLSVGLSFKDVTDSVGFQSDVCTFACGLVAD 580 Query: 851 GLLVQIHRNAVRLCLPTTIAHPEGIPLSAPICTSWFPENVNISLGAVGQNMIIVATSNPC 1030 GLLVQIH++A+RLC+PT AH +GIP+S+P +SWFPENV+ISLGAVGQN+I+V+TSNPC Sbjct: 581 GLLVQIHQDAIRLCMPTMDAHSDGIPVSSPFFSSWFPENVSISLGAVGQNLIVVSTSNPC 640 Query: 1031 FLFILGARSLSA 1066 FL ILG +S+S+ Sbjct: 641 FLSILGVKSVSS 652 >ref|NP_850565.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein [Arabidopsis thaliana] gi|332641609|gb|AEE75130.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit protein [Arabidopsis thaliana] Length = 1329 Score = 784 bits (2024), Expect(2) = 0.0 Identities = 420/759 (55%), Positives = 530/759 (69%), Gaps = 39/759 (5%) Frame = +3 Query: 1080 IYEMQHVRLQNEVSSISIPQR-ISKYKSSTSVVSLPNISKPCFGLPIGVEISDTFVIGTH 1256 IYE+Q V LQ EVS IS+PQ+ I K +S S S N K +P +E TF+IGTH Sbjct: 599 IYEIQRVTLQYEVSCISVPQKHIGKKRSRDS--SPDNFCKAA--IPSAMEQGYTFLIGTH 654 Query: 1257 RPSVEILSFVPEE-GLRIVACGIISLSNTLGTAISGCVPQDVRLVLVDRFYVLSGLRNGM 1433 +PSVE+LSF + G+R++A G++SL+NT+GT ISGC+PQDVRLVLVD+ YVLSGLRNGM Sbjct: 655 KPSVEVLSFTEDGVGVRVLASGLVSLTNTMGTVISGCIPQDVRLVLVDQLYVLSGLRNGM 714 Query: 1434 LLRFEWPNMXXXXXXXXXXXXXXFMISNMAASFVSPSSSN-----EQCRDS--NILEKAE 1592 LLRFEW A F + S N C++ ++ K + Sbjct: 715 LLRFEW------------------------APFSNSSGLNCPDYFSHCKEEMDTVVGKKD 750 Query: 1593 KTPIHLELIAIRRIGVTPVFLVPLCXXXXXXXXXXXXRPWLLQTARHSLSFTSISFQPAT 1772 P++L LIA RRIG+TPVFLVP RPWLLQTAR SLS+TSISFQP+T Sbjct: 751 NLPVNLLLIATRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQPST 810 Query: 1773 HVTPVCSMDCPKGILFVAENRLHLVEMVHSKRLNVQKFSLGGTPRKVVYHSESRLLLVMR 1952 H TPVCS +CP+GILFV+EN LHLVEMVHSKR N QKF LGGTPRKV+YHSES+LL+VMR Sbjct: 811 HATPVCSFECPQGILFVSENCLHLVEMVHSKRRNAQKFQLGGTPRKVIYHSESKLLIVMR 870 Query: 1953 TELSGESCSSDICCVDPLSGSLLSSFKLEPGETGKSMQLVKVGDERVLVVGTNRFGGRAI 2132 T+L ++C+SDICCVDPLSGS+LSS+KL+PGETGKSM+LV+VG+E VLVVGT+ G AI Sbjct: 871 TDLY-DTCTSDICCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLVVGTSLSSGPAI 929 Query: 2133 MHTGEAESSKGRLLVLCLEHTLNSVNSS----------------FREIVGYATEQXXXXX 2264 + +GEAES+KGR+++LCLEHT NS + S F ++VGY TE Sbjct: 930 LPSGEAESTKGRVIILCLEHTQNSDSGSMTICSKACSSSQRTSPFHDVVGYTTENLSSSS 989 Query: 2265 XXXXPEDNGFEGVKLEETEVWQLVLAYQTIIPGVVLAVCPYLDRYFLASAGNFFYLYGFV 2444 P+D ++G+KL+E E WQL LA T PG+VLA+CPYLD YFLASAGN FY+ GF Sbjct: 990 LCSSPDDYSYDGIKLDEAETWQLRLASSTTWPGMVLAICPYLDHYFLASAGNAFYVCGFP 1049 Query: 2445 NENPQRVRRLAYARTRFTITSLASDFTRIVVGDCRDGVLFYSYDEEPRRLKQLYCDPVQR 2624 N++P+R++R A RTRF ITSL + FTRIVVGDCRDGVLFYSY EE ++L Q+YCDP QR Sbjct: 1050 NDSPERMKRFAVGRTRFMITSLRTYFTRIVVGDCRDGVLFYSYHEESKKLHQIYCDPAQR 1109 Query: 2625 LVADCTLMDMDTAVVSDRKGNLTVLSC--------------PNRVEDNASPECNLTLSCS 2762 LVADC LMD ++ VSDRKG++ +LSC P+ + +SPE NL L+C+ Sbjct: 1110 LVADCFLMDANSVAVSDRKGSIAILSCKDHSDFGMKHLVKIPHDNPEYSSPESNLNLNCA 1169 Query: 2763 YYIGETAMSIRKGSYSYKLPVDDTPNGCDIADTILNSSHNSIVASTMLGSVVVFIAISRE 2942 YY+GE AMSI+KG YKLP DD ++ +I +++ ++I+A T+LGS+ VF IS E Sbjct: 1170 YYMGEIAMSIKKGCNIYKLPADDVLRSYGLSKSI-DTADDTIIAGTLLGSIFVFAPISSE 1228 Query: 2943 EFELLKAVQARLIIHPLTAPILGNDHNEFRGRGSAAGVSKMLDGDMLTQFLELTSSQQES 3122 E+ELL+ VQA+L IHPLTAP+LGNDHNEFRGR + + K+LDGDML QFLELT+ QQES Sbjct: 1229 EYELLEGVQAKLGIHPLTAPVLGNDHNEFRGRENPSQARKILDGDMLAQFLELTNRQQES 1288 Query: 3123 VLAVPLGLKETGASISMPPSHKSISVNQVVRLLERVHYA 3239 VL+ P T + S S + ++QVV+LLERVHYA Sbjct: 1289 VLSTPQPSPSTSKASSKQRSFPPLMLHQVVQLLERVHYA 1327 Score = 443 bits (1139), Expect(2) = 0.0 Identities = 221/372 (59%), Positives = 276/372 (74%), Gaps = 17/372 (4%) Frame = +2 Query: 2 EVPYSYGFAVLFRVGDALLMDLSDPHNPRCVHKICLGLLP--------------IEDGVD 139 EVP+S GFA LFR+GD LLMDL DP NP C+ + L +P ++DG D Sbjct: 223 EVPHSSGFAFLFRIGDVLLMDLRDPQNPCCLFRTSLDFVPASLMEEHFVEESCRVQDGDD 282 Query: 140 D--DVAVRALLELGMEMSKGDDPMIIDNENGQYNSLFKSMCSWSWEPGHNSNPTMFVSLD 313 + +V V ALLEL + DPM ID E+ K++ SW+WEP +N NP M + LD Sbjct: 283 EGCNVVVCALLELRDHEVRDHDPMFIDTESDIGKLSSKNVSSWTWEPENNHNPRMIICLD 342 Query: 314 TGELLTLEISFESDGSKMNLSEPLYKCLPCKTLLWVKGDFVVALTEMGDGTVLKFEGGKL 493 G+ E+ +E DG K+NLSE LYK LPCK +LW++G F+ EM DGTV K KL Sbjct: 343 NGDFFMFELIYEDDGVKVNLSECLYKGLPCKDILWIEGGFLATFAEMADGTVFKLGTEKL 402 Query: 494 SYMSPVQNIAPVLD-SVVDYHDDKQDQMFACCGVAPEGSLRIIRSGISVENLLRTAPIYQ 670 +MS +QNIAP+LD SV+D ++K+DQ+FACCGV PEGSLRIIRSGI+VE LL+TAP+YQ Sbjct: 403 HWMSSIQNIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQ 462 Query: 671 GITGTWTLRMKVLDSFDSFLVLSFVEETRVLSVGLSFSDVTDAVGFQPDACTLACGLVGD 850 GITGTWT++MK+ D + SFLVLSFVEETRVLSVGLSF DVTD+VGFQ D CT ACGLV D Sbjct: 463 GITGTWTVKMKLTDVYHSFLVLSFVEETRVLSVGLSFKDVTDSVGFQSDVCTFACGLVAD 522 Query: 851 GLLVQIHRNAVRLCLPTTIAHPEGIPLSAPICTSWFPENVNISLGAVGQNMIIVATSNPC 1030 GLLVQIH++A+RLC+PT AH +GIP+S+P +SWFPENV+ISLGAVGQN+I+V+TSNPC Sbjct: 523 GLLVQIHQDAIRLCMPTMDAHSDGIPVSSPFFSSWFPENVSISLGAVGQNLIVVSTSNPC 582 Query: 1031 FLFILGARSLSA 1066 FL ILG +S+S+ Sbjct: 583 FLSILGVKSVSS 594 >gb|AAF23212.1|AC016795_25 hypothetical protein [Arabidopsis thaliana] gi|10998135|dbj|BAB03106.1| unnamed protein product [Arabidopsis thaliana] Length = 1331 Score = 741 bits (1914), Expect(2) = 0.0 Identities = 404/751 (53%), Positives = 512/751 (68%), Gaps = 31/751 (4%) Frame = +3 Query: 1080 IYEMQHVRLQNEVSSISIPQR-ISKYKSSTSVVSLPNISKPCFGLPIGVEISDTFVIGTH 1256 IYE+Q V LQ EVS IS+PQ+ I K +S S S N K +P +E TF+IGTH Sbjct: 629 IYEIQRVTLQYEVSCISVPQKHIGKKRSRDS--SPDNFCKAA--IPSAMEQGYTFLIGTH 684 Query: 1257 RPSVEILSFVPEE-GLRIVACGIISLSNTLGTAISGCVPQDVRLVLVDRFYVLSGLRNGM 1433 +PSVE+LSF + G+R++A G++SL+NT+GT ISGC+PQDVRLVLVD+ YVLSGLRNGM Sbjct: 685 KPSVEVLSFTEDGVGVRVLASGLVSLTNTMGTVISGCIPQDVRLVLVDQLYVLSGLRNGM 744 Query: 1434 LLRFEWPNMXXXXXXXXXXXXXXFMISNMAASFVSPSSSN-----EQCRDS--NILEKAE 1592 LLRFEW A F + S N C++ ++ K + Sbjct: 745 LLRFEW------------------------APFSNSSGLNCPDYFSHCKEEMDTVVGKKD 780 Query: 1593 KTPIHLELIAIRRIGVTPVFLVPLCXXXXXXXXXXXXRPWLLQTARHSLSFTSISFQPAT 1772 P++L LIA RRIG+TPVFLVP RPWLLQTAR SLS+TSISFQP+T Sbjct: 781 NLPVNLLLIATRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQPST 840 Query: 1773 HVTPVCSMDCPKGILFVAENRLHLVEMVHSKRLNVQKFSLGGTPRKVVYHSESRLLLVMR 1952 H TPV EMVHSKR N QKF LGGTPRKV+YHSES+LL+VMR Sbjct: 841 HATPV--------------------EMVHSKRRNAQKFQLGGTPRKVIYHSESKLLIVMR 880 Query: 1953 TELSGESCSSDICCVDPLSGSLLSSFKLEPGETGKSMQLVKVGDERVLVVGTNRFGGRAI 2132 T+L ++C+SDICCVDPLSGS+LSS+KL+PGETGKSM+LV+VG+E VLVVGT+ G AI Sbjct: 881 TDLY-DTCTSDICCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLVVGTSLSSGPAI 939 Query: 2133 MHTGEAESSKGRLLVLCLEHTLNSVNSS----------------FREIVGYATEQXXXXX 2264 + +GEAES+KGR+++LCLEHT NS + S F ++VGY TE Sbjct: 940 LPSGEAESTKGRVIILCLEHTQNSDSGSMTICSKACSSSQRTSPFHDVVGYTTENLSSSS 999 Query: 2265 XXXXPEDNGFEGVKLEETEVWQLVLAYQTIIPGVVLAVCPYLDRYFLASAGNFFYLYGFV 2444 P+D ++G+KL+E E WQL LA T PG+VLA+CPYLD YFLASAGN FY+ GF Sbjct: 1000 LCSSPDDYSYDGIKLDEAETWQLRLASSTTWPGMVLAICPYLDHYFLASAGNAFYVCGFP 1059 Query: 2445 NENPQRVRRLAYARTRFTITSLASDFTRIVVGDCRDGVLFYSYDEEPRRLKQLYCDPVQR 2624 N++P+R++R A RTRF ITSL + FTRIVVGDCRDGVLFYSY EE ++L Q+YCDP QR Sbjct: 1060 NDSPERMKRFAVGRTRFMITSLRTYFTRIVVGDCRDGVLFYSYHEESKKLHQIYCDPAQR 1119 Query: 2625 LVADCTLMDMDTAVVSDRKGNLTVLSCPNRVE------DNASPECNLTLSCSYYIGETAM 2786 LVADC LMD ++ VSDRKG++ +LSC + + + +SPE NL L+C+YY+GE AM Sbjct: 1120 LVADCFLMDANSVAVSDRKGSIAILSCKDHSDFGMKHLEYSSPESNLNLNCAYYMGEIAM 1179 Query: 2787 SIRKGSYSYKLPVDDTPNGCDIADTILNSSHNSIVASTMLGSVVVFIAISREEFELLKAV 2966 SI+KG YKLP DD ++ +I +++ ++I+A T+LGS+ VF IS EE+ELL+ V Sbjct: 1180 SIKKGCNIYKLPADDVLRSYGLSKSI-DTADDTIIAGTLLGSIFVFAPISSEEYELLEGV 1238 Query: 2967 QARLIIHPLTAPILGNDHNEFRGRGSAAGVSKMLDGDMLTQFLELTSSQQESVLAVPLGL 3146 QA+L IHPLTAP+LGNDHNEFRGR + + K+LDGDML QFLELT+ QQESVL+ P Sbjct: 1239 QAKLGIHPLTAPVLGNDHNEFRGRENPSQARKILDGDMLAQFLELTNRQQESVLSTPQPS 1298 Query: 3147 KETGASISMPPSHKSISVNQVVRLLERVHYA 3239 T + S S + ++QVV+LLERVHYA Sbjct: 1299 PSTSKASSKQRSFPPLMLHQVVQLLERVHYA 1329 Score = 443 bits (1139), Expect(2) = 0.0 Identities = 221/372 (59%), Positives = 276/372 (74%), Gaps = 17/372 (4%) Frame = +2 Query: 2 EVPYSYGFAVLFRVGDALLMDLSDPHNPRCVHKICLGLLP--------------IEDGVD 139 EVP+S GFA LFR+GD LLMDL DP NP C+ + L +P ++DG D Sbjct: 253 EVPHSSGFAFLFRIGDVLLMDLRDPQNPCCLFRTSLDFVPASLMEEHFVEESCRVQDGDD 312 Query: 140 D--DVAVRALLELGMEMSKGDDPMIIDNENGQYNSLFKSMCSWSWEPGHNSNPTMFVSLD 313 + +V V ALLEL + DPM ID E+ K++ SW+WEP +N NP M + LD Sbjct: 313 EGCNVVVCALLELRDHEVRDHDPMFIDTESDIGKLSSKNVSSWTWEPENNHNPRMIICLD 372 Query: 314 TGELLTLEISFESDGSKMNLSEPLYKCLPCKTLLWVKGDFVVALTEMGDGTVLKFEGGKL 493 G+ E+ +E DG K+NLSE LYK LPCK +LW++G F+ EM DGTV K KL Sbjct: 373 NGDFFMFELIYEDDGVKVNLSECLYKGLPCKDILWIEGGFLATFAEMADGTVFKLGTEKL 432 Query: 494 SYMSPVQNIAPVLD-SVVDYHDDKQDQMFACCGVAPEGSLRIIRSGISVENLLRTAPIYQ 670 +MS +QNIAP+LD SV+D ++K+DQ+FACCGV PEGSLRIIRSGI+VE LL+TAP+YQ Sbjct: 433 HWMSSIQNIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQ 492 Query: 671 GITGTWTLRMKVLDSFDSFLVLSFVEETRVLSVGLSFSDVTDAVGFQPDACTLACGLVGD 850 GITGTWT++MK+ D + SFLVLSFVEETRVLSVGLSF DVTD+VGFQ D CT ACGLV D Sbjct: 493 GITGTWTVKMKLTDVYHSFLVLSFVEETRVLSVGLSFKDVTDSVGFQSDVCTFACGLVAD 552 Query: 851 GLLVQIHRNAVRLCLPTTIAHPEGIPLSAPICTSWFPENVNISLGAVGQNMIIVATSNPC 1030 GLLVQIH++A+RLC+PT AH +GIP+S+P +SWFPENV+ISLGAVGQN+I+V+TSNPC Sbjct: 553 GLLVQIHQDAIRLCMPTMDAHSDGIPVSSPFFSSWFPENVSISLGAVGQNLIVVSTSNPC 612 Query: 1031 FLFILGARSLSA 1066 FL ILG +S+S+ Sbjct: 613 FLSILGVKSVSS 624