BLASTX nr result
ID: Forsythia23_contig00032321
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00032321 (1524 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011074351.1| PREDICTED: polyadenylation and cleavage fact... 506 e-140 ref|XP_011074350.1| PREDICTED: polyadenylation and cleavage fact... 506 e-140 ref|XP_011074352.1| PREDICTED: polyadenylation and cleavage fact... 496 e-137 ref|XP_010655357.1| PREDICTED: polyadenylation and cleavage fact... 428 e-117 ref|XP_012838214.1| PREDICTED: polyadenylation and cleavage fact... 409 e-111 gb|EYU36382.1| hypothetical protein MIMGU_mgv1a0020322mg, partia... 408 e-111 ref|XP_012091393.1| PREDICTED: polyadenylation and cleavage fact... 399 e-108 ref|XP_002518518.1| conserved hypothetical protein [Ricinus comm... 399 e-108 ref|XP_011000684.1| PREDICTED: polyadenylation and cleavage fact... 395 e-107 ref|XP_007026009.1| PCF11P-similar protein 4, putative isoform 2... 394 e-106 ref|XP_007026008.1| PCF11P-similar protein 4, putative isoform 1... 394 e-106 ref|XP_010275999.1| PREDICTED: uncharacterized protein LOC104610... 392 e-106 ref|XP_010275998.1| PREDICTED: uncharacterized protein LOC104610... 392 e-106 ref|XP_011037705.1| PREDICTED: polyadenylation and cleavage fact... 389 e-105 ref|XP_009601446.1| PREDICTED: uncharacterized protein LOC104096... 387 e-104 ref|XP_009601447.1| PREDICTED: uncharacterized protein LOC104096... 385 e-104 ref|XP_011037707.1| PREDICTED: polyadenylation and cleavage fact... 385 e-104 ref|XP_011037706.1| PREDICTED: polyadenylation and cleavage fact... 385 e-104 ref|XP_011037702.1| PREDICTED: polyadenylation and cleavage fact... 385 e-104 ref|XP_009601448.1| PREDICTED: uncharacterized protein LOC104096... 385 e-104 >ref|XP_011074351.1| PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X2 [Sesamum indicum] Length = 964 Score = 506 bits (1303), Expect = e-140 Identities = 283/516 (54%), Positives = 337/516 (65%), Gaps = 11/516 (2%) Frame = -1 Query: 1518 GFGVINKIT----GLRTPTS-QITASSSARESWKFP-----DHLNXXXXXXXXXXXXXXS 1369 G G INKI + P+ I S RES P HLN + Sbjct: 450 GRGSINKIVEVFPNVAGPSDLPIQIPPSFRESLILPHLQSQSHLNVKGGGSFSESRSSLT 509 Query: 1368 ACEVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQSS 1189 E K +I NF + DGK ST SS +D+ +I +A A T++W PAK Q+ Sbjct: 510 GGEQKLPLIDNFSNTDGKLGGPSSTASTFSSTYDTPISDIRTAHDAALTKAWRPAKFQTP 569 Query: 1188 YPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSKLPQFPRQ 1009 + +PSL+ALP QM +RGQ+ ++ N++ DQGLNK+I+S+Q G T +M Q LP P Q Sbjct: 570 H-MPSLSALPPQMHIRGQYGMKTAPNIVADQGLNKTIYSEQHLGTTRNMPQVTLPLIPSQ 628 Query: 1008 RPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGS 829 RP L+P+NLQ +AQ + Q M+QG S+ + P ++ Y A G Sbjct: 629 RPSLIPINLQGTAQPSLAQS---MAQGA---GQLPSSVPAPSNTMVPPKSYGYLAHAQGP 682 Query: 828 P-SDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLA 652 P T+L NIV GVQSSLP+ NAPN S H GT+Q++P Q +G++A Sbjct: 683 PIGTTSLSNIVPGVQSSLPVLNAPNMSFHVPGAALQPLPGVPLPGTSQALPSGQTVGRVA 742 Query: 651 PSPPAGGALSGLISSLVAQGLISLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTT 472 P+PP GGALSGLISSLVAQGLISLTKQDS+GVEFDQD LKVRHES ITALYADLPRQC T Sbjct: 743 PNPPGGGALSGLISSLVAQGLISLTKQDSVGVEFDQDSLKVRHESTITALYADLPRQCKT 802 Query: 471 CGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLP 292 CGLRFK QEEHSKHMDWHV KPSP WFVS+SMWL GAEALGTEAVPGFLP Sbjct: 803 CGLRFKSQEEHSKHMDWHVNKNRTLKTRKTKPSPKWFVSVSMWLSGAEALGTEAVPGFLP 862 Query: 291 AENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGM 112 AEN VEK + E+MAVPADE+Q CALCGEPFDDFYSDEM+EWMYKGAVYM +PAGS GM Sbjct: 863 AENTVEKPEDEEMAVPADEDQNTCALCGEPFDDFYSDEMEEWMYKGAVYMYAPAGSIVGM 922 Query: 111 NRSELGPIVHAKCRSESHAIPTEDFTKDEVELTEEG 4 +RS+LGPIVHAKCRS+SH IP E+ KDE E TEEG Sbjct: 923 DRSQLGPIVHAKCRSDSHGIPPEE--KDERESTEEG 956 >ref|XP_011074350.1| PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X1 [Sesamum indicum] Length = 967 Score = 506 bits (1303), Expect = e-140 Identities = 283/516 (54%), Positives = 337/516 (65%), Gaps = 11/516 (2%) Frame = -1 Query: 1518 GFGVINKIT----GLRTPTS-QITASSSARESWKFP-----DHLNXXXXXXXXXXXXXXS 1369 G G INKI + P+ I S RES P HLN + Sbjct: 453 GRGSINKIVEVFPNVAGPSDLPIQIPPSFRESLILPHLQSQSHLNVKGGGSFSESRSSLT 512 Query: 1368 ACEVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQSS 1189 E K +I NF + DGK ST SS +D+ +I +A A T++W PAK Q+ Sbjct: 513 GGEQKLPLIDNFSNTDGKLGGPSSTASTFSSTYDTPISDIRTAHDAALTKAWRPAKFQTP 572 Query: 1188 YPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSKLPQFPRQ 1009 + +PSL+ALP QM +RGQ+ ++ N++ DQGLNK+I+S+Q G T +M Q LP P Q Sbjct: 573 H-MPSLSALPPQMHIRGQYGMKTAPNIVADQGLNKTIYSEQHLGTTRNMPQVTLPLIPSQ 631 Query: 1008 RPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGS 829 RP L+P+NLQ +AQ + Q M+QG S+ + P ++ Y A G Sbjct: 632 RPSLIPINLQGTAQPSLAQS---MAQGA---GQLPSSVPAPSNTMVPPKSYGYLAHAQGP 685 Query: 828 P-SDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLA 652 P T+L NIV GVQSSLP+ NAPN S H GT+Q++P Q +G++A Sbjct: 686 PIGTTSLSNIVPGVQSSLPVLNAPNMSFHVPGAALQPLPGVPLPGTSQALPSGQTVGRVA 745 Query: 651 PSPPAGGALSGLISSLVAQGLISLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTT 472 P+PP GGALSGLISSLVAQGLISLTKQDS+GVEFDQD LKVRHES ITALYADLPRQC T Sbjct: 746 PNPPGGGALSGLISSLVAQGLISLTKQDSVGVEFDQDSLKVRHESTITALYADLPRQCKT 805 Query: 471 CGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLP 292 CGLRFK QEEHSKHMDWHV KPSP WFVS+SMWL GAEALGTEAVPGFLP Sbjct: 806 CGLRFKSQEEHSKHMDWHVNKNRTLKTRKTKPSPKWFVSVSMWLSGAEALGTEAVPGFLP 865 Query: 291 AENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGM 112 AEN VEK + E+MAVPADE+Q CALCGEPFDDFYSDEM+EWMYKGAVYM +PAGS GM Sbjct: 866 AENTVEKPEDEEMAVPADEDQNTCALCGEPFDDFYSDEMEEWMYKGAVYMYAPAGSIVGM 925 Query: 111 NRSELGPIVHAKCRSESHAIPTEDFTKDEVELTEEG 4 +RS+LGPIVHAKCRS+SH IP E+ KDE E TEEG Sbjct: 926 DRSQLGPIVHAKCRSDSHGIPPEE--KDERESTEEG 959 >ref|XP_011074352.1| PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X3 [Sesamum indicum] Length = 940 Score = 496 bits (1276), Expect = e-137 Identities = 262/441 (59%), Positives = 311/441 (70%), Gaps = 1/441 (0%) Frame = -1 Query: 1323 DGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQSSYPLPSLAALPSQMQM 1144 DGK ST SS +D+ +I +A A T++W PAK Q+ + +PSL+ALP QM + Sbjct: 501 DGKLGGPSSTASTFSSTYDTPISDIRTAHDAALTKAWRPAKFQTPH-MPSLSALPPQMHI 559 Query: 1143 RGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSKLPQFPRQRPGLVPLNLQSSAQA 964 RGQ+ ++ N++ DQGLNK+I+S+Q G T +M Q LP P QRP L+P+NLQ +AQ Sbjct: 560 RGQYGMKTAPNIVADQGLNKTIYSEQHLGTTRNMPQVTLPLIPSQRPSLIPINLQGTAQP 619 Query: 963 ARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGSP-SDTALHNIVLGVQ 787 + Q M+QG S+ + P ++ Y A G P T+L NIV GVQ Sbjct: 620 SLAQS---MAQGA---GQLPSSVPAPSNTMVPPKSYGYLAHAQGPPIGTTSLSNIVPGVQ 673 Query: 786 SSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPAGGALSGLISS 607 SSLP+ NAPN S H GT+Q++P Q +G++AP+PP GGALSGLISS Sbjct: 674 SSLPVLNAPNMSFHVPGAALQPLPGVPLPGTSQALPSGQTVGRVAPNPPGGGALSGLISS 733 Query: 606 LVAQGLISLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCGLRFKRQEEHSKHM 427 LVAQGLISLTKQDS+GVEFDQD LKVRHES ITALYADLPRQC TCGLRFK QEEHSKHM Sbjct: 734 LVAQGLISLTKQDSVGVEFDQDSLKVRHESTITALYADLPRQCKTCGLRFKSQEEHSKHM 793 Query: 426 DWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAENIVEKKKYEDMAV 247 DWHV KPSP WFVS+SMWL GAEALGTEAVPGFLPAEN VEK + E+MAV Sbjct: 794 DWHVNKNRTLKTRKTKPSPKWFVSVSMWLSGAEALGTEAVPGFLPAENTVEKPEDEEMAV 853 Query: 246 PADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNRSELGPIVHAKCRS 67 PADE+Q CALCGEPFDDFYSDEM+EWMYKGAVYM +PAGS GM+RS+LGPIVHAKCRS Sbjct: 854 PADEDQNTCALCGEPFDDFYSDEMEEWMYKGAVYMYAPAGSIVGMDRSQLGPIVHAKCRS 913 Query: 66 ESHAIPTEDFTKDEVELTEEG 4 +SH IP E+ KDE E TEEG Sbjct: 914 DSHGIPPEE--KDERESTEEG 932 >ref|XP_010655357.1| PREDICTED: polyadenylation and cleavage factor homolog 4 [Vitis vinifera] Length = 1046 Score = 428 bits (1101), Expect = e-117 Identities = 235/460 (51%), Positives = 288/460 (62%), Gaps = 5/460 (1%) Frame = -1 Query: 1368 ACEVKPTIIGNFPSADGKFCRRPDVVSTI-SSMFDSLSPEIPSADAPASTESWLPAKLQS 1192 A E +I N P AD + R P V S + SS +S++ E+ SA APAST W P + Sbjct: 588 AAETISPLISNIPDADAQLRRLPTVASRMGSSSLNSMNVEVQSAAAPASTGMWPPVNVHK 647 Query: 1191 SYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSKLPQFPR 1012 ++ P L+ LP Q+R QFN M++ +V+Q NKS+ + SKLPQ Sbjct: 648 THLPPLLSNLPQTKQIRNQFNLMNATTAVVNQDPNKSLFLPE--------LDSKLPQMAN 699 Query: 1011 QRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHG 832 ++ G +PLN ++ Q R+QP L Q N S++ PLN Y Q H Sbjct: 700 RQAGSIPLNGKNQTQVTRLQPQFL-PQETHGNFVPSTTAPVSSYSVAPPLNPGYTPQGHA 758 Query: 831 SPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLA 652 + + T L N V GV SS+PI+N N+S+H T+Q I IPQN G + Sbjct: 759 AATSTILLNPVPGVHSSIPIHNISNSSVHFQGGALPPLPPGPPPATSQMINIPQNTGPIV 818 Query: 651 PSPPAGGALSGLISSLVAQGLISLTKQ----DSLGVEFDQDLLKVRHESAITALYADLPR 484 + G ALSGLISSL+AQGLISL KQ DS+G+EF+ DLLKVRHESAI+ALY D+ R Sbjct: 819 SNQQPGSALSGLISSLMAQGLISLAKQPTVQDSVGIEFNVDLLKVRHESAISALYGDMSR 878 Query: 483 QCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVP 304 QCTTCGLRFK QEEHS HMDWHV KPS WFVS SMWL AEALGT+AVP Sbjct: 879 QCTTCGLRFKCQEEHSSHMDWHVTKNRISKNRKQKPSRKWFVSASMWLSSAEALGTDAVP 938 Query: 303 GFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGS 124 GFLP E I EKK E++AVPADE+Q CALCGEPFDDFYSDE +EWMYKGAVY+N+P GS Sbjct: 939 GFLPTETIAEKKDDEELAVPADEDQNVCALCGEPFDDFYSDETEEWMYKGAVYLNAPEGS 998 Query: 123 FEGMNRSELGPIVHAKCRSESHAIPTEDFTKDEVELTEEG 4 GM+RS+LGPIVHAKCRSES+ + EDF +DE EEG Sbjct: 999 AAGMDRSQLGPIVHAKCRSESNVVSPEDFGQDEGGNMEEG 1038 >ref|XP_012838214.1| PREDICTED: polyadenylation and cleavage factor homolog 4 [Erythranthe guttatus] Length = 865 Score = 409 bits (1052), Expect = e-111 Identities = 230/451 (50%), Positives = 291/451 (64%), Gaps = 1/451 (0%) Frame = -1 Query: 1362 EVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPAS-TESWLPAKLQSSY 1186 E+ P + GNF + DGKF R P +DS +PEI SADA A T++W P+K Q+S+ Sbjct: 467 ELNPALTGNFSNTDGKF-RLP---------YDSTAPEIQSADAAAPLTKAWHPSKFQNSH 516 Query: 1185 PLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSKLPQFPRQR 1006 PSL+ALPSQMQ+RGQF N VDQ +HS+Q+ G +Q+ LP R Sbjct: 517 IRPSLSALPSQMQIRGQFGM----NNAVDQ-----LHSEQQLG----RSQANLPHISSIR 563 Query: 1005 PGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGSP 826 PG VP NLQ +AQ P+L + + +A++ P+N+RY P Sbjct: 564 PGPVPANLQHTAQ-----PNLYLPSPYSEHIPS--------NASVPPMNYRYFG-----P 605 Query: 825 SDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPS 646 S T N+V G S H GT Q +PI N Q+A + Sbjct: 606 SGTTSSNLVPGFPS-----------FHVPRPTLQSLPRGPFPGTAQPLPIGSNANQVAQN 654 Query: 645 PPAGGALSGLISSLVAQGLISLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCG 466 P AG ALSGLI+SL+AQGLISL+ QDS+GVEFD D+LKVRHESAIT+LYA+LPRQC TCG Sbjct: 655 PSAGPALSGLINSLMAQGLISLSNQDSVGVEFDPDILKVRHESAITSLYAELPRQCKTCG 714 Query: 465 LRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAE 286 LRFK QEEHS HMDWHV KPSP WFV+ +MWL G EA+GTEAVPGF+PAE Sbjct: 715 LRFKSQEEHSSHMDWHVNKNRTLRNRKAKPSPKWFVNAAMWLSGTEAMGTEAVPGFMPAE 774 Query: 285 NIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNR 106 N EK++ E+MAVPADE+Q +CALCGEPF+D+YSD+++EWMYKGAVYM++P G+ GM+R Sbjct: 775 NSAEKEEDEEMAVPADEDQNSCALCGEPFEDYYSDDLEEWMYKGAVYMHAPTGATVGMDR 834 Query: 105 SELGPIVHAKCRSESHAIPTEDFTKDEVELT 13 S+LGPIVHAKC S+SHA+ +E+ KDE + T Sbjct: 835 SQLGPIVHAKCMSDSHAVSSENNKKDEEDST 865 >gb|EYU36382.1| hypothetical protein MIMGU_mgv1a0020322mg, partial [Erythranthe guttata] Length = 571 Score = 408 bits (1049), Expect = e-111 Identities = 229/447 (51%), Positives = 289/447 (64%), Gaps = 1/447 (0%) Frame = -1 Query: 1362 EVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPAS-TESWLPAKLQSSY 1186 E+ P + GNF + DGKF R P +DS +PEI SADA A T++W P+K Q+S+ Sbjct: 160 ELNPALTGNFSNTDGKF-RLP---------YDSTAPEIQSADAAAPLTKAWHPSKFQNSH 209 Query: 1185 PLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSKLPQFPRQR 1006 PSL+ALPSQMQ+RGQF N VDQ +HS+Q+ G +Q+ LP R Sbjct: 210 IRPSLSALPSQMQIRGQFGM----NNAVDQ-----LHSEQQLG----RSQANLPHISSIR 256 Query: 1005 PGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGSP 826 PG VP NLQ +AQ P+L + + +A++ P+N+RY P Sbjct: 257 PGPVPANLQHTAQ-----PNLYLPSPYSEHIPS--------NASVPPMNYRYFG-----P 298 Query: 825 SDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPS 646 S T N+V G S H GT Q +PI N Q+A + Sbjct: 299 SGTTSSNLVPGFPS-----------FHVPRPTLQSLPRGPFPGTAQPLPIGSNANQVAQN 347 Query: 645 PPAGGALSGLISSLVAQGLISLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCG 466 P AG ALSGLI+SL+AQGLISL+ QDS+GVEFD D+LKVRHESAIT+LYA+LPRQC TCG Sbjct: 348 PSAGPALSGLINSLMAQGLISLSNQDSVGVEFDPDILKVRHESAITSLYAELPRQCKTCG 407 Query: 465 LRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAE 286 LRFK QEEHS HMDWHV KPSP WFV+ +MWL G EA+GTEAVPGF+PAE Sbjct: 408 LRFKSQEEHSSHMDWHVNKNRTLRNRKAKPSPKWFVNAAMWLSGTEAMGTEAVPGFMPAE 467 Query: 285 NIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNR 106 N EK++ E+MAVPADE+Q +CALCGEPF+D+YSD+++EWMYKGAVYM++P G+ GM+R Sbjct: 468 NSAEKEEDEEMAVPADEDQNSCALCGEPFEDYYSDDLEEWMYKGAVYMHAPTGATVGMDR 527 Query: 105 SELGPIVHAKCRSESHAIPTEDFTKDE 25 S+LGPIVHAKC S+SHA+ +E+ KDE Sbjct: 528 SQLGPIVHAKCMSDSHAVSSENNKKDE 554 >ref|XP_012091393.1| PREDICTED: polyadenylation and cleavage factor homolog 4 [Jatropha curcas] gi|643703717|gb|KDP20781.1| hypothetical protein JCGZ_21252 [Jatropha curcas] Length = 1029 Score = 399 bits (1025), Expect = e-108 Identities = 240/524 (45%), Positives = 291/524 (55%), Gaps = 19/524 (3%) Frame = -1 Query: 1521 SGFGVINKITGLRTPTSQITASSSARESWKFPDHL-----------NXXXXXXXXXXXXX 1375 SG G K+ G + +QI AS RE+WK +H N Sbjct: 517 SGRGSTAKLPGFQPERNQIMASHYPREAWKLLNHYPQSTDLNAKGRNREFRMPFSRSVIS 576 Query: 1374 XSACEVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQ 1195 S + ++ P DG++ R P + S + S AP++ W + Sbjct: 577 SSVSDSLAPLVDKLPDTDGQYVRPPTLPSRVGSSI-----------APSTAGVWPLVNVH 625 Query: 1194 SSYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNKS-IHSKQRFGGTNSMAQS--KLP 1024 S+P P P Q Q R QF+ ++ N +V+QGL +S S+Q+F G SM S K P Sbjct: 626 KSHPPPVHPIFPPQKQSRSQFDSTNARNTVVNQGLQQSTFSSEQQFNGFESMEPSLTKQP 685 Query: 1023 QFPRQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHAT-IQPLNHRYA 847 P + LN Q+ AQ QP L S + N H T + L+ +A Sbjct: 686 LLPSRH---ATLNQQNQAQVNHFQPQFLPSNEARENFPLSISSLP--HQTRVSTLDPVHA 740 Query: 846 AQRHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQN 667 Q HG+ N V LP+NN PNT Q I +PQN Sbjct: 741 TQGHGAAMSMVRSNPV-PFMLPLPVNNIPNT---LQPHAGTRPPLPPGPHPAQMIHVPQN 796 Query: 666 IGQLAPSPPAGGALSGLISSLVAQGLISLTKQ----DSLGVEFDQDLLKVRHESAITALY 499 +G +AP+ P G A SGLI SL+AQGLISLTKQ DS+G+EF+ DL+KVRHESAI+ALY Sbjct: 797 VGPVAPNQPPGSAFSGLIGSLMAQGLISLTKQTPGQDSVGLEFNADLIKVRHESAISALY 856 Query: 498 ADLPRQCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALG 319 ADLPRQCTTCGLRFK QEEHS HMDWHV KPS WFV SMWL GAEALG Sbjct: 857 ADLPRQCTTCGLRFKCQEEHSSHMDWHVTKNRMSKNRKHKPSRKWFVDTSMWLSGAEALG 916 Query: 318 TEAVPGFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMN 139 T+AVPGFLP E++VEKK E+MAVPADE Q ACALCGEPFDDFYSDE +EWMYKGAVYMN Sbjct: 917 TDAVPGFLPTESVVEKKDDEEMAVPADEEQNACALCGEPFDDFYSDETEEWMYKGAVYMN 976 Query: 138 SPAGSFEGMNRSELGPIVHAKCRSESHAIPTEDFTKDEVELTEE 7 +P GS GM RS+LGPIVHAKCRSES P EDF D+ +EE Sbjct: 977 APNGSTAGMERSQLGPIVHAKCRSESSVAPPEDFRCDDGGDSEE 1020 >ref|XP_002518518.1| conserved hypothetical protein [Ricinus communis] gi|223542363|gb|EEF43905.1| conserved hypothetical protein [Ricinus communis] Length = 1023 Score = 399 bits (1024), Expect = e-108 Identities = 236/522 (45%), Positives = 285/522 (54%), Gaps = 18/522 (3%) Frame = -1 Query: 1518 GFGVINKITGLRTPTSQITASSSARESWKFPDHL------------NXXXXXXXXXXXXX 1375 G G K++G +T +Q S RE+WK P H N Sbjct: 517 GRGSGGKLSGFQTDRNQTMGSRYPREAWKSPHHFSQSADLINAKGRNRDLQMPFSGSGIS 576 Query: 1374 XSACEVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQ 1195 S E+ +++ P AD + R P + S +SS + A +ST W + Sbjct: 577 SSGSEILASLVDQLPDADAQIIRPPTLPSRMSS-----------STALSSTGVWPLVNVH 625 Query: 1194 SSYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNKSIH-SKQRFGGTNSMAQS--KLP 1024 S+ P P QMQ R +P ++ N V+QG KS S+Q+ G S S K P Sbjct: 626 KSHQPPLRPIFPPQMQSRSLLDPRNASNTAVNQGFQKSSFLSEQQLNGLESKEHSLTKQP 685 Query: 1023 QFPRQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAA 844 P Q + N Q+ Q QP Q H +HRY Sbjct: 686 LLPSQHAAM---NQQNQGQVNPFQP--------QRENFPPSVASLPPHPLAPTFDHRYVT 734 Query: 843 QRHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNI 664 Q HGS N+V + LP+NN PNT + IPIPQN Sbjct: 735 QAHGSAMSRIHSNLVSSMPLPLPVNNIPNTM--HLQVGVRPPLPPGPPPASHMIPIPQNA 792 Query: 663 GQLAPSPPAGGALSGLISSLVAQGLISLTK---QDSLGVEFDQDLLKVRHESAITALYAD 493 G +A + PAGGA SGLI+SLVAQGLISL + QDS+G+EF+ DLLKVRHESAI+ALYAD Sbjct: 793 GPVASNQPAGGAFSGLINSLVAQGLISLKQTPVQDSVGLEFNADLLKVRHESAISALYAD 852 Query: 492 LPRQCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 313 LPRQCTTCGLRFK QE+HS HMDWHV KPS WFVS +MWL GAEALGT+ Sbjct: 853 LPRQCTTCGLRFKCQEDHSSHMDWHVTRNRMSKNRKQKPSRKWFVSATMWLRGAEALGTD 912 Query: 312 AVPGFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 133 AVPGFLP E +VEKK E+MAVPADE Q ACALCGEPFDDFYSDE +EWMYKGAVY+N+P Sbjct: 913 AVPGFLPTEAVVEKKDDEEMAVPADEEQNACALCGEPFDDFYSDETEEWMYKGAVYLNAP 972 Query: 132 AGSFEGMNRSELGPIVHAKCRSESHAIPTEDFTKDEVELTEE 7 +GS M+RS+LGPIVHAKCRSES P ED +E TEE Sbjct: 973 SGSTASMDRSQLGPIVHAKCRSESSVAPPEDIRSNEGPDTEE 1014 >ref|XP_011000684.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like [Populus euphratica] Length = 980 Score = 395 bits (1016), Expect = e-107 Identities = 226/524 (43%), Positives = 292/524 (55%), Gaps = 19/524 (3%) Frame = -1 Query: 1518 GFGVINKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPT--- 1348 G G NK+ GL T + I+ S ++E+W FP H+ + + Sbjct: 464 GHGSTNKMPGLLTERNHISGSRYSQEAWNFPPHIRQPSHLLNAKGRGRDFQMPLSGSGVS 523 Query: 1347 ---------IIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQ 1195 ++ P D + R P + S S DS S S+ P + +W P + Sbjct: 524 SMGGENFNPLVDKLPDMDAQLVRPPAIASRFGSSIDSNSSGTWSSAVPPISGAWPPVNVH 583 Query: 1194 SSYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSM--AQSKLP 1024 S P P ++ P + Q RGQF+P+++ + + +Q L K S+ +Q F S K Sbjct: 584 KSLPPPVHSSFPPEKQGRGQFDPVNTNSTVTNQALQKASVMPEQSFNSFESKDYVLMKPT 643 Query: 1023 QFPRQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAA 844 P Q GL N Q+ A QP L S + N +P+NH Y Sbjct: 644 PLPNQHAGL---NQQNQAHFNPFQPKFLPSHEARENFHPSGIALLPPRRLARPMNHGYTT 700 Query: 843 QRHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNI 664 H S N++ VQ L ++N PNT +H +Q+IP PQN Sbjct: 701 HGHSSS------NVLPAVQLPLAVSNVPNT-LHSQVGVRPTLPQGP----SQTIPFPQNA 749 Query: 663 GQLAPSPPAGGALSGLISSLVAQGLISLTKQ----DSLGVEFDQDLLKVRHESAITALYA 496 A + P+G A SGLI+SL+AQGLI++TKQ DS+G+EF+ DLLK+R+ESAI+ALY+ Sbjct: 750 SSGALAQPSGSAFSGLINSLMAQGLITMTKQTPLQDSVGLEFNADLLKLRYESAISALYS 809 Query: 495 DLPRQCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGT 316 DLPRQCTTCGLR K QEEHS HMDWHV PS WFVS SMWL GAEALGT Sbjct: 810 DLPRQCTTCGLRLKCQEEHSSHMDWHVTKNRMSKNRKQNPSRKWFVSASMWLSGAEALGT 869 Query: 315 EAVPGFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNS 136 +AVPGFLP E IVEKK ++MAVPADE Q+ CALCGEPFDDFYSDE +EWMYKGAVY+N+ Sbjct: 870 DAVPGFLPTETIVEKKDDDEMAVPADEEQSTCALCGEPFDDFYSDETEEWMYKGAVYLNA 929 Query: 135 PAGSFEGMNRSELGPIVHAKCRSESHAIPTEDFTKDEVELTEEG 4 GS M+RS+LGPIVHAKCRS+S +P+EDF +E TEEG Sbjct: 930 SDGSTADMDRSQLGPIVHAKCRSDSSGVPSEDFGHEEGGNTEEG 973 >ref|XP_007026009.1| PCF11P-similar protein 4, putative isoform 2 [Theobroma cacao] gi|508781375|gb|EOY28631.1| PCF11P-similar protein 4, putative isoform 2 [Theobroma cacao] Length = 733 Score = 394 bits (1012), Expect = e-106 Identities = 218/446 (48%), Positives = 270/446 (60%), Gaps = 7/446 (1%) Frame = -1 Query: 1347 IIGNFPSADGKFCRRPDVVS-TISSMFDSLSPEIPSADAPASTESWLPAKLQSSYPLPSL 1171 +I P +F R P VV T SS DS++ A P++T W P + S P Sbjct: 275 LIDKLPDGGSQFLRPPAVVPRTGSSSLDSVTVGARPAIIPSTTGVWPPVNVHKSQPPAMH 334 Query: 1170 AALPSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQS--KLPQFPRQRPGL 997 + Q R QF+ ++ N+++++G NK + ++F S QS ++PQ P QR L Sbjct: 335 SNYSLQQHSRSQFDSINPINMVMNEGPNKRSYMAEQFDRFESKEQSLTRVPQLPDQRAAL 394 Query: 996 VPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGSPSDT 817 + ++ Q +QPH L SQ ++ N LNH Y Q HG+ Sbjct: 395 ---HQRNQMQVTSLQPHFLPSQDLRENFLSSATAPLPPRLLAPSLNHGYTPQMHGAVISM 451 Query: 816 ALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPA 637 N + Q LPI N P S+ +Q IP QN G L P+ Sbjct: 452 VPSNPIHVAQPPLPIPNMPTVSLQLQGGALPPLPPGPPP-ASQMIPATQNAGPLLPNQAQ 510 Query: 636 GGALSGLISSLVAQGLISLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTC 469 G SGLISSL+AQGLISLTK QD +G+EF+ DLLKVRHES+I+ALYADLPRQCTTC Sbjct: 511 SGPYSGLISSLMAQGLISLTKPTPIQDPVGLEFNADLLKVRHESSISALYADLPRQCTTC 570 Query: 468 GLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPA 289 GLRFK QEEHS HMDWHV KPS WFVS SMWL GAEALGT+AVPGFLP Sbjct: 571 GLRFKFQEEHSTHMDWHVTRNRMSKNRKQKPSRKWFVSASMWLSGAEALGTDAVPGFLPT 630 Query: 288 ENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMN 109 EN+VEKK E++AVPADE+Q+ CALCGEPFDDFYSDE +EWMY+GAVYMN+P GS EGM+ Sbjct: 631 ENVVEKKDDEELAVPADEDQSVCALCGEPFDDFYSDETEEWMYRGAVYMNAPNGSIEGMD 690 Query: 108 RSELGPIVHAKCRSESHAIPTEDFTK 31 RS+LGPIVHAKCRSES +P+EDF + Sbjct: 691 RSQLGPIVHAKCRSESSVVPSEDFVR 716 >ref|XP_007026008.1| PCF11P-similar protein 4, putative isoform 1 [Theobroma cacao] gi|508781374|gb|EOY28630.1| PCF11P-similar protein 4, putative isoform 1 [Theobroma cacao] Length = 1004 Score = 394 bits (1012), Expect = e-106 Identities = 218/446 (48%), Positives = 270/446 (60%), Gaps = 7/446 (1%) Frame = -1 Query: 1347 IIGNFPSADGKFCRRPDVVS-TISSMFDSLSPEIPSADAPASTESWLPAKLQSSYPLPSL 1171 +I P +F R P VV T SS DS++ A P++T W P + S P Sbjct: 546 LIDKLPDGGSQFLRPPAVVPRTGSSSLDSVTVGARPAIIPSTTGVWPPVNVHKSQPPAMH 605 Query: 1170 AALPSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQS--KLPQFPRQRPGL 997 + Q R QF+ ++ N+++++G NK + ++F S QS ++PQ P QR L Sbjct: 606 SNYSLQQHSRSQFDSINPINMVMNEGPNKRSYMAEQFDRFESKEQSLTRVPQLPDQRAAL 665 Query: 996 VPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGSPSDT 817 + ++ Q +QPH L SQ ++ N LNH Y Q HG+ Sbjct: 666 ---HQRNQMQVTSLQPHFLPSQDLRENFLSSATAPLPPRLLAPSLNHGYTPQMHGAVISM 722 Query: 816 ALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPA 637 N + Q LPI N P S+ +Q IP QN G L P+ Sbjct: 723 VPSNPIHVAQPPLPIPNMPTVSLQLQGGALPPLPPGPPP-ASQMIPATQNAGPLLPNQAQ 781 Query: 636 GGALSGLISSLVAQGLISLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTC 469 G SGLISSL+AQGLISLTK QD +G+EF+ DLLKVRHES+I+ALYADLPRQCTTC Sbjct: 782 SGPYSGLISSLMAQGLISLTKPTPIQDPVGLEFNADLLKVRHESSISALYADLPRQCTTC 841 Query: 468 GLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPA 289 GLRFK QEEHS HMDWHV KPS WFVS SMWL GAEALGT+AVPGFLP Sbjct: 842 GLRFKFQEEHSTHMDWHVTRNRMSKNRKQKPSRKWFVSASMWLSGAEALGTDAVPGFLPT 901 Query: 288 ENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMN 109 EN+VEKK E++AVPADE+Q+ CALCGEPFDDFYSDE +EWMY+GAVYMN+P GS EGM+ Sbjct: 902 ENVVEKKDDEELAVPADEDQSVCALCGEPFDDFYSDETEEWMYRGAVYMNAPNGSIEGMD 961 Query: 108 RSELGPIVHAKCRSESHAIPTEDFTK 31 RS+LGPIVHAKCRSES +P+EDF + Sbjct: 962 RSQLGPIVHAKCRSESSVVPSEDFVR 987 >ref|XP_010275999.1| PREDICTED: uncharacterized protein LOC104610875 isoform X2 [Nelumbo nucifera] Length = 895 Score = 392 bits (1007), Expect = e-106 Identities = 229/478 (47%), Positives = 281/478 (58%), Gaps = 23/478 (4%) Frame = -1 Query: 1368 ACEVKPTIIGNFPSADGKFCRRPDVVSTISSM------FDSLSPEIPSADAPASTES--- 1216 A + P+ + NF D +F R VVS + S ++LS +P A A Sbjct: 416 AIKKMPSQVDNFLDTDAQFQRFSGVVSRMGSSNRDTMNVEALSTMMPPASALQKHRGQRP 475 Query: 1215 ------WLPAKLQSSYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGG 1054 W P + S+P P L+ LP Q Q++ Q N M + NKS+ + G Sbjct: 476 SLAPLVWPPVNVPKSHPPPPLSVLPQQNQIKSQSNIMDISRIP-----NKSLTLPGQHLG 530 Query: 1053 T---NSMAQSKLPQFPRQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXS 883 N++ +KL QFP Q+ GL+ LN +S QA+ + LMSQ Q N + Sbjct: 531 VIERNTLTPTKLLQFPNQQAGLISLNQRSQGQASHLPAQPLMSQNAQENFVPSAVAQMST 590 Query: 882 HATIQPLNHRYAAQRHGSPSDTALHNIVLGV-QSSLPINNAPNTSIHXXXXXXXXXXXXX 706 H QPLNH + Q H S + + L N + G+ SS+ I+ NT H Sbjct: 591 HKMEQPLNHGHIPQGHLSVTSSILPNPIPGLASSSVTIHGLSNTPFHLPGRALPPLPPGP 650 Query: 705 XXGTTQSIPIPQNIGQLAPSPPAGGALSGLISSLVAQGLISLTK----QDSLGVEFDQDL 538 ++Q PI QN+G +A +G A SGLISSL+AQGLISLT QDS+GVEF+ DL Sbjct: 651 PPVSSQIEPISQNVGPIATHASSGSAFSGLISSLMAQGLISLTTPASVQDSIGVEFNLDL 710 Query: 537 LKVRHESAITALYADLPRQCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFV 358 LKVRHESAI ALYADLPRQCTTCGLRFK QEEHS HMDWHV KPS WFV Sbjct: 711 LKVRHESAIKALYADLPRQCTTCGLRFKCQEEHSSHMDWHVTKNRISKSRKQKPSRKWFV 770 Query: 357 SISMWLHGAEALGTEAVPGFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDE 178 S ++WL GAEALG +AVPGFLP E + EK ++MAVPADENQ CALCGEPFDDFYSDE Sbjct: 771 STNVWLSGAEALGVDAVPGFLPTEAVAEKDD-QEMAVPADENQNVCALCGEPFDDFYSDE 829 Query: 177 MDEWMYKGAVYMNSPAGSFEGMNRSELGPIVHAKCRSESHAIPTEDFTKDEVELTEEG 4 +EWMYKGAVY+N+P G M+RS+LGPIVHAKCRSES +P EDF DE TEEG Sbjct: 830 TEEWMYKGAVYLNAPDGPPADMDRSQLGPIVHAKCRSESTVVPPEDFQLDEGGTTEEG 887 >ref|XP_010275998.1| PREDICTED: uncharacterized protein LOC104610875 isoform X1 [Nelumbo nucifera] Length = 1071 Score = 392 bits (1007), Expect = e-106 Identities = 229/478 (47%), Positives = 281/478 (58%), Gaps = 23/478 (4%) Frame = -1 Query: 1368 ACEVKPTIIGNFPSADGKFCRRPDVVSTISSM------FDSLSPEIPSADAPASTES--- 1216 A + P+ + NF D +F R VVS + S ++LS +P A A Sbjct: 592 AIKKMPSQVDNFLDTDAQFQRFSGVVSRMGSSNRDTMNVEALSTMMPPASALQKHRGQRP 651 Query: 1215 ------WLPAKLQSSYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGG 1054 W P + S+P P L+ LP Q Q++ Q N M + NKS+ + G Sbjct: 652 SLAPLVWPPVNVPKSHPPPPLSVLPQQNQIKSQSNIMDISRIP-----NKSLTLPGQHLG 706 Query: 1053 T---NSMAQSKLPQFPRQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXS 883 N++ +KL QFP Q+ GL+ LN +S QA+ + LMSQ Q N + Sbjct: 707 VIERNTLTPTKLLQFPNQQAGLISLNQRSQGQASHLPAQPLMSQNAQENFVPSAVAQMST 766 Query: 882 HATIQPLNHRYAAQRHGSPSDTALHNIVLGV-QSSLPINNAPNTSIHXXXXXXXXXXXXX 706 H QPLNH + Q H S + + L N + G+ SS+ I+ NT H Sbjct: 767 HKMEQPLNHGHIPQGHLSVTSSILPNPIPGLASSSVTIHGLSNTPFHLPGRALPPLPPGP 826 Query: 705 XXGTTQSIPIPQNIGQLAPSPPAGGALSGLISSLVAQGLISLTK----QDSLGVEFDQDL 538 ++Q PI QN+G +A +G A SGLISSL+AQGLISLT QDS+GVEF+ DL Sbjct: 827 PPVSSQIEPISQNVGPIATHASSGSAFSGLISSLMAQGLISLTTPASVQDSIGVEFNLDL 886 Query: 537 LKVRHESAITALYADLPRQCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFV 358 LKVRHESAI ALYADLPRQCTTCGLRFK QEEHS HMDWHV KPS WFV Sbjct: 887 LKVRHESAIKALYADLPRQCTTCGLRFKCQEEHSSHMDWHVTKNRISKSRKQKPSRKWFV 946 Query: 357 SISMWLHGAEALGTEAVPGFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDE 178 S ++WL GAEALG +AVPGFLP E + EK ++MAVPADENQ CALCGEPFDDFYSDE Sbjct: 947 STNVWLSGAEALGVDAVPGFLPTEAVAEKDD-QEMAVPADENQNVCALCGEPFDDFYSDE 1005 Query: 177 MDEWMYKGAVYMNSPAGSFEGMNRSELGPIVHAKCRSESHAIPTEDFTKDEVELTEEG 4 +EWMYKGAVY+N+P G M+RS+LGPIVHAKCRSES +P EDF DE TEEG Sbjct: 1006 TEEWMYKGAVYLNAPDGPPADMDRSQLGPIVHAKCRSESTVVPPEDFQLDEGGTTEEG 1063 >ref|XP_011037705.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X4 [Populus euphratica] Length = 1051 Score = 389 bits (998), Expect = e-105 Identities = 226/523 (43%), Positives = 282/523 (53%), Gaps = 17/523 (3%) Frame = -1 Query: 1521 SGFGVINKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1342 SG G +KI G RT +QI S +E+W FP H++ + + + Sbjct: 526 SGRGSTSKIPGFRTERNQILGSRHHQEAWNFPPHIHQSAHLLNSKGRGRDFQMPLSGSGV 585 Query: 1341 GNF------------PSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKL 1198 + P D + R P + S S DS S S+ P S+ W P Sbjct: 586 SSLGGENYSPLAEKLPDIDAQLNRSPAIASRWGSNIDSTSSGTWSSVVPPSSGVWPPVNA 645 Query: 1197 QSSYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSMAQSKLPQ 1021 + S P P P Q R QF+P+++ + +++Q L K S +Q F G + + + Sbjct: 646 RKSLPPPVHRIFPPPEQSRSQFDPINASSTVINQVLQKGSAMPEQPFNGFENKDYNSMKP 705 Query: 1020 FPRQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQ 841 P LN Q+ A QP L S + N QPLNH Y Sbjct: 706 TPMSNQHAA-LNQQNQAHVNPFQPQQLPSHETRENFHPSGVTSMPPRPLGQPLNHGYNTH 764 Query: 840 RHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIG 661 H + N + VQ LP+NN PN +H Q++P QN+ Sbjct: 765 GHSTAISMVPSNALPAVQLPLPVNNIPNM-LHSQVGLRPPLPPGPPP---QTMPFSQNVS 820 Query: 660 QLAPSPPAGGALSGLISSLVAQGLISLTKQ----DSLGVEFDQDLLKVRHESAITALYAD 493 P P+G A SGL +SL+AQGLISLTKQ DS+G+EF+ DLLK+R+ESAI+ALY D Sbjct: 821 SSVPGQPSGSAFSGLFNSLMAQGLISLTKQSPVQDSVGLEFNADLLKLRYESAISALYGD 880 Query: 492 LPRQCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 313 LPRQCTTCGLRFK QEEHS HMDWHV K S +WFVS SMWL GAEALGT+ Sbjct: 881 LPRQCTTCGLRFKCQEEHSTHMDWHVTKNRMSKNRKQKSSRNWFVSASMWLSGAEALGTD 940 Query: 312 AVPGFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 133 A PGFLP E VEKK MAVPADE Q+ CALCGEPFDDFYSDE +EWMY+GAVY+NS Sbjct: 941 AAPGFLPTETTVEKKDDHGMAVPADEEQSTCALCGEPFDDFYSDETEEWMYRGAVYLNSS 1000 Query: 132 AGSFEGMNRSELGPIVHAKCRSESHAIPTEDFTKDEVELTEEG 4 GS GM+RS+LGPIVHAKCRS+S +P EDF DE +EEG Sbjct: 1001 NGSTAGMDRSQLGPIVHAKCRSDSSVVPPEDFGHDEGVNSEEG 1043 >ref|XP_009601446.1| PREDICTED: uncharacterized protein LOC104096744 isoform X1 [Nicotiana tomentosiformis] Length = 989 Score = 387 bits (995), Expect = e-104 Identities = 226/510 (44%), Positives = 286/510 (56%), Gaps = 5/510 (0%) Frame = -1 Query: 1521 SGFGVINKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1342 SG G NKITG TS I+ S + K P+++ E K +I Sbjct: 470 SGRGARNKITGYCDETSLISGSPYLQ---KLPENVPLLHQRHLKVEGSGIVTGEPKHPLI 526 Query: 1341 GNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQSSYPLPSLAAL 1162 N ADG R P + ++ F+S +I + A W P + + L S + Sbjct: 527 SNLV-ADGHTWRPPYIPPRMNPTFESSVQDIRAVTGRAPIVPWPPTDVHNPQSLTSKPFV 585 Query: 1161 PSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSKLPQFPRQRPGLVPLNL 982 +R F + N + + L+K + Q+ + S + K PQFP Q P +L Sbjct: 586 LPHQHIRSPFEVKNGSNSVANHNLDKPVLPGQQIDNSKSNSYIKFPQFPSQHPASFSASL 645 Query: 981 QSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGSPSDTALHNI 802 Q+ Q A + LL SQ + +H + P+ + Y Q GS T L Sbjct: 646 QNPEQVASAESQLLFSQRMHQTTVPSASLPASNHFLLPPI-YGYNPQGPGSSVGTLLPLP 704 Query: 801 VLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPAGGALS 622 V G Q SLP+ N PNTS ++Q P QN+GQ+ P+PPAGG S Sbjct: 705 VSGPQVSLPLVNIPNTSSQFSSGALPPLPRGPLPMSSQFTPTSQNLGQVTPNPPAGG-FS 763 Query: 621 GLISSLVAQGLISLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCGLRFK 454 LISSL+AQGLISLT QDS+G++F+ DLLKVRH+SA+TALYADLPRQCTTCGLRFK Sbjct: 764 SLISSLMAQGLISLTNEAPPQDSVGLDFNPDLLKVRHDSAVTALYADLPRQCTTCGLRFK 823 Query: 453 RQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAENIVE 274 QE HS HMDWHV K S WFVS++MW G EALG++A PGFLPAE +VE Sbjct: 824 CQEAHSSHMDWHVTKNRVSKNRKQKSSRKWFVSVNMWFSGTEALGSDAAPGFLPAEQVVE 883 Query: 273 KKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNRSELG 94 KK E++AVPAD+ Q CALCGEPFDDFYSDE +EWMYKGAVYMN+P+GS GM +S+LG Sbjct: 884 KKDDEELAVPADDEQNVCALCGEPFDDFYSDETEEWMYKGAVYMNAPSGSTAGMEKSQLG 943 Query: 93 PIVHAKCRSESHAIPTEDFTK-DEVELTEE 7 PI+HAKCRSES A P ED + DE L EE Sbjct: 944 PIIHAKCRSESSATPQEDSRRVDEKFLQEE 973 >ref|XP_009601447.1| PREDICTED: uncharacterized protein LOC104096744 isoform X2 [Nicotiana tomentosiformis] Length = 985 Score = 385 bits (990), Expect = e-104 Identities = 222/505 (43%), Positives = 283/505 (56%), Gaps = 4/505 (0%) Frame = -1 Query: 1521 SGFGVINKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1342 SG G NKITG TS I+ S + K P+++ E K +I Sbjct: 470 SGRGARNKITGYCDETSLISGSPYLQ---KLPENVPLLHQRHLKVEGSGIVTGEPKHPLI 526 Query: 1341 GNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQSSYPLPSLAAL 1162 N ADG R P + ++ F+S +I + A W P + + L S + Sbjct: 527 SNLV-ADGHTWRPPYIPPRMNPTFESSVQDIRAVTGRAPIVPWPPTDVHNPQSLTSKPFV 585 Query: 1161 PSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSKLPQFPRQRPGLVPLNL 982 +R F + N + + L+K + Q+ + S + K PQFP Q P +L Sbjct: 586 LPHQHIRSPFEVKNGSNSVANHNLDKPVLPGQQIDNSKSNSYIKFPQFPSQHPASFSASL 645 Query: 981 QSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGSPSDTALHNI 802 Q+ Q A + LL SQ + +H + P+ + Y Q GS T L Sbjct: 646 QNPEQVASAESQLLFSQRMHQTTVPSASLPASNHFLLPPI-YGYNPQGPGSSVGTLLPLP 704 Query: 801 VLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPAGGALS 622 V G Q SLP+ N PNTS ++Q P QN+GQ+ P+PPAGG S Sbjct: 705 VSGPQVSLPLVNIPNTSSQFSSGALPPLPRGPLPMSSQFTPTSQNLGQVTPNPPAGG-FS 763 Query: 621 GLISSLVAQGLISLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCGLRFK 454 LISSL+AQGLISLT QDS+G++F+ DLLKVRH+SA+TALYADLPRQCTTCGLRFK Sbjct: 764 SLISSLMAQGLISLTNEAPPQDSVGLDFNPDLLKVRHDSAVTALYADLPRQCTTCGLRFK 823 Query: 453 RQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAENIVE 274 QE HS HMDWHV K S WFVS++MW G EALG++A PGFLPAE +VE Sbjct: 824 CQEAHSSHMDWHVTKNRVSKNRKQKSSRKWFVSVNMWFSGTEALGSDAAPGFLPAEQVVE 883 Query: 273 KKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNRSELG 94 KK E++AVPAD+ Q CALCGEPFDDFYSDE +EWMYKGAVYMN+P+GS GM +S+LG Sbjct: 884 KKDDEELAVPADDEQNVCALCGEPFDDFYSDETEEWMYKGAVYMNAPSGSTAGMEKSQLG 943 Query: 93 PIVHAKCRSESHAIPTEDFTKDEVE 19 PI+HAKCRSES A P ED + + E Sbjct: 944 PIIHAKCRSESSATPQEDSRRVDEE 968 >ref|XP_011037707.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X6 [Populus euphratica] Length = 886 Score = 385 bits (989), Expect = e-104 Identities = 223/516 (43%), Positives = 278/516 (53%), Gaps = 17/516 (3%) Frame = -1 Query: 1521 SGFGVINKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1342 SG G +KI G RT +QI S +E+W FP H++ + + + Sbjct: 359 SGRGSTSKIPGFRTERNQILGSRHHQEAWNFPPHIHQSAHLLNSKGRGRDFQMPLSGSGV 418 Query: 1341 GNF------------PSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKL 1198 + P D + R P + S S DS S S+ P S+ W P Sbjct: 419 SSLGGENYSPLAEKLPDIDAQLNRSPAIASRWGSNIDSTSSGTWSSVVPPSSGVWPPVNA 478 Query: 1197 QSSYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSMAQSKLPQ 1021 + S P P P Q R QF+P+++ + +++Q L K S +Q F G + + + Sbjct: 479 RKSLPPPVHRIFPPPEQSRSQFDPINASSTVINQVLQKGSAMPEQPFNGFENKDYNSMKP 538 Query: 1020 FPRQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQ 841 P LN Q+ A QP L S + N QPLNH Y Sbjct: 539 TPMSNQHAA-LNQQNQAHVNPFQPQQLPSHETRENFHPSGVTSMPPRPLGQPLNHGYNTH 597 Query: 840 RHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIG 661 H + N + VQ LP+NN PN +H Q++P QN+ Sbjct: 598 GHSTAISMVPSNALPAVQLPLPVNNIPNM-LHSQVGLRPPLPPGPPP---QTMPFSQNVS 653 Query: 660 QLAPSPPAGGALSGLISSLVAQGLISLTKQ----DSLGVEFDQDLLKVRHESAITALYAD 493 P P+G A SGL +SL+AQGLISLTKQ DS+G+EF+ DLLK+R+ESAI+ALY D Sbjct: 654 SSVPGQPSGSAFSGLFNSLMAQGLISLTKQSPVQDSVGLEFNADLLKLRYESAISALYGD 713 Query: 492 LPRQCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 313 LPRQCTTCGLRFK QEEHS HMDWHV K S +WFVS SMWL GAEALGT+ Sbjct: 714 LPRQCTTCGLRFKCQEEHSTHMDWHVTKNRMSKNRKQKSSRNWFVSASMWLSGAEALGTD 773 Query: 312 AVPGFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 133 A PGFLP E VEKK MAVPADE Q+ CALCGEPFDDFYSDE +EWMY+GAVY+NS Sbjct: 774 AAPGFLPTETTVEKKDDHGMAVPADEEQSTCALCGEPFDDFYSDETEEWMYRGAVYLNSS 833 Query: 132 AGSFEGMNRSELGPIVHAKCRSESHAIPTEDFTKDE 25 GS GM+RS+LGPIVHAKCRS+S +P EDF DE Sbjct: 834 NGSTAGMDRSQLGPIVHAKCRSDSSVVPPEDFGHDE 869 >ref|XP_011037706.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X5 [Populus euphratica] Length = 1035 Score = 385 bits (989), Expect = e-104 Identities = 223/516 (43%), Positives = 278/516 (53%), Gaps = 17/516 (3%) Frame = -1 Query: 1521 SGFGVINKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1342 SG G +KI G RT +QI S +E+W FP H++ + + + Sbjct: 508 SGRGSTSKIPGFRTERNQILGSRHHQEAWNFPPHIHQSAHLLNSKGRGRDFQMPLSGSGV 567 Query: 1341 GNF------------PSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKL 1198 + P D + R P + S S DS S S+ P S+ W P Sbjct: 568 SSLGGENYSPLAEKLPDIDAQLNRSPAIASRWGSNIDSTSSGTWSSVVPPSSGVWPPVNA 627 Query: 1197 QSSYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSMAQSKLPQ 1021 + S P P P Q R QF+P+++ + +++Q L K S +Q F G + + + Sbjct: 628 RKSLPPPVHRIFPPPEQSRSQFDPINASSTVINQVLQKGSAMPEQPFNGFENKDYNSMKP 687 Query: 1020 FPRQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQ 841 P LN Q+ A QP L S + N QPLNH Y Sbjct: 688 TPMSNQHAA-LNQQNQAHVNPFQPQQLPSHETRENFHPSGVTSMPPRPLGQPLNHGYNTH 746 Query: 840 RHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIG 661 H + N + VQ LP+NN PN +H Q++P QN+ Sbjct: 747 GHSTAISMVPSNALPAVQLPLPVNNIPNM-LHSQVGLRPPLPPGPPP---QTMPFSQNVS 802 Query: 660 QLAPSPPAGGALSGLISSLVAQGLISLTKQ----DSLGVEFDQDLLKVRHESAITALYAD 493 P P+G A SGL +SL+AQGLISLTKQ DS+G+EF+ DLLK+R+ESAI+ALY D Sbjct: 803 SSVPGQPSGSAFSGLFNSLMAQGLISLTKQSPVQDSVGLEFNADLLKLRYESAISALYGD 862 Query: 492 LPRQCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 313 LPRQCTTCGLRFK QEEHS HMDWHV K S +WFVS SMWL GAEALGT+ Sbjct: 863 LPRQCTTCGLRFKCQEEHSTHMDWHVTKNRMSKNRKQKSSRNWFVSASMWLSGAEALGTD 922 Query: 312 AVPGFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 133 A PGFLP E VEKK MAVPADE Q+ CALCGEPFDDFYSDE +EWMY+GAVY+NS Sbjct: 923 AAPGFLPTETTVEKKDDHGMAVPADEEQSTCALCGEPFDDFYSDETEEWMYRGAVYLNSS 982 Query: 132 AGSFEGMNRSELGPIVHAKCRSESHAIPTEDFTKDE 25 GS GM+RS+LGPIVHAKCRS+S +P EDF DE Sbjct: 983 NGSTAGMDRSQLGPIVHAKCRSDSSVVPPEDFGHDE 1018 >ref|XP_011037702.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X1 [Populus euphratica] gi|743885952|ref|XP_011037703.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X2 [Populus euphratica] gi|743885954|ref|XP_011037704.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X3 [Populus euphratica] Length = 1053 Score = 385 bits (989), Expect = e-104 Identities = 223/516 (43%), Positives = 278/516 (53%), Gaps = 17/516 (3%) Frame = -1 Query: 1521 SGFGVINKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1342 SG G +KI G RT +QI S +E+W FP H++ + + + Sbjct: 526 SGRGSTSKIPGFRTERNQILGSRHHQEAWNFPPHIHQSAHLLNSKGRGRDFQMPLSGSGV 585 Query: 1341 GNF------------PSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKL 1198 + P D + R P + S S DS S S+ P S+ W P Sbjct: 586 SSLGGENYSPLAEKLPDIDAQLNRSPAIASRWGSNIDSTSSGTWSSVVPPSSGVWPPVNA 645 Query: 1197 QSSYPLPSLAALPSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSMAQSKLPQ 1021 + S P P P Q R QF+P+++ + +++Q L K S +Q F G + + + Sbjct: 646 RKSLPPPVHRIFPPPEQSRSQFDPINASSTVINQVLQKGSAMPEQPFNGFENKDYNSMKP 705 Query: 1020 FPRQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQ 841 P LN Q+ A QP L S + N QPLNH Y Sbjct: 706 TPMSNQHAA-LNQQNQAHVNPFQPQQLPSHETRENFHPSGVTSMPPRPLGQPLNHGYNTH 764 Query: 840 RHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIG 661 H + N + VQ LP+NN PN +H Q++P QN+ Sbjct: 765 GHSTAISMVPSNALPAVQLPLPVNNIPNM-LHSQVGLRPPLPPGPPP---QTMPFSQNVS 820 Query: 660 QLAPSPPAGGALSGLISSLVAQGLISLTKQ----DSLGVEFDQDLLKVRHESAITALYAD 493 P P+G A SGL +SL+AQGLISLTKQ DS+G+EF+ DLLK+R+ESAI+ALY D Sbjct: 821 SSVPGQPSGSAFSGLFNSLMAQGLISLTKQSPVQDSVGLEFNADLLKLRYESAISALYGD 880 Query: 492 LPRQCTTCGLRFKRQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 313 LPRQCTTCGLRFK QEEHS HMDWHV K S +WFVS SMWL GAEALGT+ Sbjct: 881 LPRQCTTCGLRFKCQEEHSTHMDWHVTKNRMSKNRKQKSSRNWFVSASMWLSGAEALGTD 940 Query: 312 AVPGFLPAENIVEKKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 133 A PGFLP E VEKK MAVPADE Q+ CALCGEPFDDFYSDE +EWMY+GAVY+NS Sbjct: 941 AAPGFLPTETTVEKKDDHGMAVPADEEQSTCALCGEPFDDFYSDETEEWMYRGAVYLNSS 1000 Query: 132 AGSFEGMNRSELGPIVHAKCRSESHAIPTEDFTKDE 25 GS GM+RS+LGPIVHAKCRS+S +P EDF DE Sbjct: 1001 NGSTAGMDRSQLGPIVHAKCRSDSSVVPPEDFGHDE 1036 >ref|XP_009601448.1| PREDICTED: uncharacterized protein LOC104096744 isoform X3 [Nicotiana tomentosiformis] Length = 980 Score = 385 bits (989), Expect = e-104 Identities = 221/498 (44%), Positives = 280/498 (56%), Gaps = 4/498 (0%) Frame = -1 Query: 1521 SGFGVINKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1342 SG G NKITG TS I+ S + K P+++ E K +I Sbjct: 470 SGRGARNKITGYCDETSLISGSPYLQ---KLPENVPLLHQRHLKVEGSGIVTGEPKHPLI 526 Query: 1341 GNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQSSYPLPSLAAL 1162 N ADG R P + ++ F+S +I + A W P + + L S + Sbjct: 527 SNLV-ADGHTWRPPYIPPRMNPTFESSVQDIRAVTGRAPIVPWPPTDVHNPQSLTSKPFV 585 Query: 1161 PSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSKLPQFPRQRPGLVPLNL 982 +R F + N + + L+K + Q+ + S + K PQFP Q P +L Sbjct: 586 LPHQHIRSPFEVKNGSNSVANHNLDKPVLPGQQIDNSKSNSYIKFPQFPSQHPASFSASL 645 Query: 981 QSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQRHGSPSDTALHNI 802 Q+ Q A + LL SQ + +H + P+ + Y Q GS T L Sbjct: 646 QNPEQVASAESQLLFSQRMHQTTVPSASLPASNHFLLPPI-YGYNPQGPGSSVGTLLPLP 704 Query: 801 VLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPAGGALS 622 V G Q SLP+ N PNTS ++Q P QN+GQ+ P+PPAGG S Sbjct: 705 VSGPQVSLPLVNIPNTSSQFSSGALPPLPRGPLPMSSQFTPTSQNLGQVTPNPPAGG-FS 763 Query: 621 GLISSLVAQGLISLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCGLRFK 454 LISSL+AQGLISLT QDS+G++F+ DLLKVRH+SA+TALYADLPRQCTTCGLRFK Sbjct: 764 SLISSLMAQGLISLTNEAPPQDSVGLDFNPDLLKVRHDSAVTALYADLPRQCTTCGLRFK 823 Query: 453 RQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAENIVE 274 QE HS HMDWHV K S WFVS++MW G EALG++A PGFLPAE +VE Sbjct: 824 CQEAHSSHMDWHVTKNRVSKNRKQKSSRKWFVSVNMWFSGTEALGSDAAPGFLPAEQVVE 883 Query: 273 KKKYEDMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNRSELG 94 KK E++AVPAD+ Q CALCGEPFDDFYSDE +EWMYKGAVYMN+P+GS GM +S+LG Sbjct: 884 KKDDEELAVPADDEQNVCALCGEPFDDFYSDETEEWMYKGAVYMNAPSGSTAGMEKSQLG 943 Query: 93 PIVHAKCRSESHAIPTED 40 PI+HAKCRSES A P ED Sbjct: 944 PIIHAKCRSESSATPQED 961