BLASTX nr result
ID: Paeonia22_contig00007463
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia22_contig00007463 (2362 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] 770 0.0 ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258... 766 0.0 ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm... 656 0.0 ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr... 633 e-178 ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni... 625 e-176 gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] 613 e-172 ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu... 612 e-172 ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni... 611 e-172 ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni... 605 e-170 gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus... 590 e-166 ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Th... 581 e-163 ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prun... 577 e-162 ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni... 572 e-160 ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subuni... 550 e-153 ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni... 542 e-151 ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma c... 535 e-149 ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma c... 533 e-148 gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlise... 501 e-139 ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobr... 500 e-138 ref|XP_006841578.1| hypothetical protein AMTR_s00003p00194360 [A... 454 e-125 >emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] Length = 659 Score = 770 bits (1988), Expect = 0.0 Identities = 401/663 (60%), Positives = 479/663 (72%), Gaps = 17/663 (2%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M D PI VKDAV+KLQ LL+GI++ENQLFAAGSLMSRSDYEDVV ER++A +CGYPLC Sbjct: 1 MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 +N+LPSER RKG YR+SLKEHKVYDL ETYMYC S CVVNSR+FA SLQEERCSV + + Sbjct: 61 SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 I+ IL+LFG+ SLES SELKI+E + KAGEV +E+WIGPSNAIEGYVP Sbjct: 121 INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180 Query: 644 -KDRNSKPLLLKQRKG---------DTGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXX 793 +DRN KP +K K D+GKN V ++MDF S I DEYSI+ Sbjct: 181 QRDRNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDT 240 Query: 794 XXXXXXXXXXXXXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKS------ELSIPE 955 D Q ++LE A +QN E KL+ES +S E S E Sbjct: 241 TSHAKSKEPKEKASIGD---QLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAE 297 Query: 956 VPSIPCQNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSM 1135 VPS+P Q+GS++ +GK++ TE AQ+G + KSS+K SG KK+ RSVTWAD+K DS Sbjct: 298 VPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADEKMDSA 357 Query: 1136 NNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVT 1315 ++ + C +RELE KED LG +++GDDDNALRF V SG++ +T Sbjct: 358 DSRDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMT 417 Query: 1316 DAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDT 1492 DAV+EAGI+I+P P D DEG+SL+ D D+L P P+KWP KP ++ D F+S +SWYDT Sbjct: 418 DAVSEAGIIILPHPRDMDEGESLK-DADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDT 476 Query: 1493 PPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGR 1672 PPEGF+LTLSPFATMWMALFAW+TSSS+AYIYGRDESFHEEYLSVNGREYP+KIVLTDGR Sbjct: 477 PPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGR 536 Query: 1673 SSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVI 1852 SSEIKQ LAGCL+RALPGLVA+L LP P+S LE+G+G LL+TMSF DALPSFR +QWQVI Sbjct: 537 SSEIKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVI 596 Query: 1853 ALLFMDALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQ 2032 LLF+DALSVCRIP LTPHMTSRR KV + AQV +EYEVMKD IIPLGR PQFSAQ Sbjct: 597 VLLFIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQ 656 Query: 2033 RGG 2041 GG Sbjct: 657 SGG 659 >ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera] gi|296089830|emb|CBI39649.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 766 bits (1978), Expect = 0.0 Identities = 399/663 (60%), Positives = 478/663 (72%), Gaps = 17/663 (2%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M D PI VKDAV+KLQ LL+GI++ENQLFAAGSLMSRSDYEDVV ER++A +CGYPLC Sbjct: 1 MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 +N+LPSER RKG YR+SLKEHKVYDL ETYMYC S CVVNSR+FA SLQEERCSV + + Sbjct: 61 SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 I+ IL+LFG+ SLES SELKI+E + KAGEV +E+WIGPSNAIEGYVP Sbjct: 121 INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180 Query: 644 -KDRNSKPLLLKQRKG---------DTGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXX 793 +DRN KP +K RK D+GKN V ++MDF I DEYSI+ Sbjct: 181 QRDRNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDT 240 Query: 794 XXXXXXXXXXXXXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKS------ELSIPE 955 D Q ++LE A +QN E KL+ES +S E S E Sbjct: 241 TSHAKSKEPKEKASIGD---QLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAE 297 Query: 956 VPSIPCQNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSM 1135 VPS+P Q+GS++ +GK++ TE AQ+G + LKS +K SG KK+TRSVTWAD+K DS Sbjct: 298 VPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADEKMDSA 357 Query: 1136 NNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVT 1315 ++ + C +RELE KED LG +++GDDDNALRF V SG++ +T Sbjct: 358 DSRDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMT 417 Query: 1316 DAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDT 1492 DAV+EA I+I+P P D DEG+SL+ D D+L P P+KWP KP ++ D F+S +SWYDT Sbjct: 418 DAVSEARIIILPHPRDMDEGESLK-DADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDT 476 Query: 1493 PPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGR 1672 PPEGF+LTLSPFATMWMALFAW+TSSS+AYIYGRDESFHEEYLSVNGREYP+KIVLTDGR Sbjct: 477 PPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGR 536 Query: 1673 SSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVI 1852 SSEIKQ LAGCLARALPGLVA+L LP P+S LE+G+G LL+TMSF DALPSFR +QWQVI Sbjct: 537 SSEIKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVI 596 Query: 1853 ALLFMDALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQ 2032 LLF+DALSVC+IP LTPHM S+R KV + AQV +EYEVMKD IIPLGR PQFSAQ Sbjct: 597 VLLFIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQ 656 Query: 2033 RGG 2041 GG Sbjct: 657 SGG 659 >ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis] gi|223538861|gb|EEF40460.1| conserved hypothetical protein [Ricinus communis] Length = 645 Score = 656 bits (1692), Expect = 0.0 Identities = 344/649 (53%), Positives = 440/649 (67%), Gaps = 10/649 (1%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M ++ + VKD VYKLQ SLL+GI +E+QL AAGSLMSRSDYEDVV ERS++ +CGYPLC Sbjct: 1 MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 NN+LPS+RP KGRYR+SLKEH+VYDLQETYMYC SSC+VNSRAF+ SLQE+RCSV +P K Sbjct: 61 NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 ++ IL+ F DL+L+S+ S LKIQEKS++ G+V LEEWIGPSNAIEGYVP Sbjct: 121 LNEILRKFNDLTLDSEGLGRSGDLGL-SNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179 Query: 644 K-DRNSKPLLLKQRKG--------DTGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXXX 796 + DR+ P L ++G + ++ F+D DFTS I DEYSI+ Sbjct: 180 QGDRDPNPSLKNHKEGLKAICKKPVSKQDCFFSDTDFTSTIITNDEYSISKGPSGLTSTA 239 Query: 797 XXXXXXXXXXXXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPCQ 976 N ++ + + K +G+ KE K +L+ ++PS Sbjct: 240 SDIKLQAQTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKEKVI-KEQLNFQDLPS---- 294 Query: 977 NGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCA 1156 S E + + A + +S+LK S+KSSGAK+ RSVTWAD++ D+ + NLC Sbjct: 295 --SSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADERVDNAGSRNLCE 352 Query: 1157 IRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAG 1336 ++E+E T E E S GDD + LRF V SG + V A++EAG Sbjct: 353 VQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEAG 412 Query: 1337 IVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNL 1513 I+++P D +G ++E++ DM+ A +KWPTKP + D F+ ++SWYD PPEGF+L Sbjct: 413 IIVLPPSQDLGQGGNVEKN-DMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAPPEGFSL 471 Query: 1514 TLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQA 1693 TLSPFATMWMALFAWVTSSSLAYIYGRDES HE+YLSVNGREYP+KIVL DGRSSEI+ Sbjct: 472 TLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRSSEIRLT 531 Query: 1694 LAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDA 1873 CLAR PGLVA L LP P+S LE+G G LLETMSF DALP+FRT+QWQVIALLF++A Sbjct: 532 AESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIALLFIEA 591 Query: 1874 LSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQ 2020 LSVCRIP LT +MTSRR LH+VL+GA + +EY++MKDF++PLGR PQ Sbjct: 592 LSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ 640 >ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] Length = 739 Score = 633 bits (1632), Expect = e-178 Identities = 353/690 (51%), Positives = 445/690 (64%), Gaps = 44/690 (6%) Frame = +2 Query: 101 TMTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPL 280 +M + I V +AV+K+Q LLDGIRDE QL A+GSL+SRSDYEDVV ER+++ CGYPL Sbjct: 54 SMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPL 113 Query: 281 CNNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPA 460 C N LPSE RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + A Sbjct: 114 CANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHA 173 Query: 461 KIDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV 640 K++ IL LFGDL L+ S L+I+E + KA +V L GPSNAIEGYV Sbjct: 174 KLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229 Query: 641 P-KDRNSKPLLLKQRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXX 781 P ++ SKP K K D+ + V N++DF I + DEY I+ Sbjct: 230 PQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGS 289 Query: 782 XXXXXXXXXXXXXXXXXXPNDN-------EHQFTILEMQATSVQNKGEGKLKE------- 919 + ++TI +M + S Q+ + LKE Sbjct: 290 FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGIC 349 Query: 920 -------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSM 1054 S+ + + SI E+PS Q+G D + E +K+ +K +++ Sbjct: 350 KDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETV 409 Query: 1055 LKSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA 1231 LKSS+KS+GAKKL R VTWAD KKAD+ N NLC ++E+E K DSE GS E G DDN Sbjct: 410 LKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNM 469 Query: 1232 LRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGL 1411 LRF V SG S VTDAV E G++I+P + D+ + + ED DML Sbjct: 470 LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPM-EDGDMLEP 528 Query: 1412 GPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIY 1588 AP+KWP KP + D F ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIY Sbjct: 529 ETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIY 588 Query: 1589 GRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSIL 1768 GRDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S L Sbjct: 589 GRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTL 648 Query: 1769 ERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSVCRIPGLTPHMTSRRTSLHKVLE 1948 E+GMGHL++T+SF +ALP+FR +QWQVI LLF+DALSVCRIP LTPHMT+ R LHKVL+ Sbjct: 649 EQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLLHKVLD 708 Query: 1949 GAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 2038 GAQ+ ++EYEVMKD IIPLGRAP FSAQ G Sbjct: 709 GAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738 >ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cicer arietinum] Length = 666 Score = 625 bits (1612), Expect = e-176 Identities = 336/667 (50%), Positives = 434/667 (65%), Gaps = 22/667 (3%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M D PI VKDAV+KLQ +LL+GI+ E+QLFAAGSL+SRSDYEDVV ERS+ +VC YPLC Sbjct: 1 MEKDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 N LPSERPRKGRYR+SLKEHKVYDL ETYM+C SSCVVNS+AFA SL+++RC DP K Sbjct: 61 CNALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQK 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 ++ IL+LFG+ +LE S L+IQ+K+++ EV LE+W+GPSNAIEGYVP Sbjct: 121 LNNILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTET-VTEVSLEQWVGPSNAIEGYVP 179 Query: 644 K--DRNSKPLLLKQRKGDTG--------KNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXX 793 K D SK +KG KNL+ ++ DF S I + DEYS++ Sbjct: 180 KKRDNGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTDA 239 Query: 794 XXXXXXXXXXXXXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPC 973 P +H+ + + + L S+ K + ++ Sbjct: 240 TVDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVLK 299 Query: 974 QNGSDIIATEGKK----DP-------RTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADK 1120 + + A + DP + EKE + KSS+KS+G KKL RSVTWADK Sbjct: 300 GKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLGRSVTWADK 359 Query: 1121 KADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXVGSG 1300 K D +T+LCA +E + K++S+ ++++ DD++ LR V SG Sbjct: 360 KIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQAAEAVASG 419 Query: 1301 QSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQN 1477 S DAV+EAGI+I+P + E +S +DVD+L +KWP KP ++D+D F S + Sbjct: 420 DSDAIDAVSEAGIIILPHTENAVE-ESTVDDVDILETDSVTLKWPRKPGISDFDLFASDD 478 Query: 1478 SWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIV 1657 SW+D PPEGF+LTLSPFAT+W A F+W+TSSSLAYIYGRD SF+EE+LSV+GREYP KIV Sbjct: 479 SWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGREYPCKIV 538 Query: 1658 LTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTR 1837 L+DGRSSEIKQ LA CLARALP +VAEL LP P+S LE+GM LL+TMSF D LP FR + Sbjct: 539 LSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVDPLPGFRFK 598 Query: 1838 QWQVIALLFMDALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAP 2017 QWQV+ALLF+DALSVCRIP L +MT RR HKVL G+Q+G++EY V+KD I+PLGRAP Sbjct: 599 QWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDLIVPLGRAP 658 Query: 2018 QFSAQRG 2038 FS+Q G Sbjct: 659 HFSSQSG 665 >gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] Length = 695 Score = 613 bits (1581), Expect = e-172 Identities = 340/698 (48%), Positives = 428/698 (61%), Gaps = 58/698 (8%) Frame = +2 Query: 119 PIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLP 298 PI VKD VY+LQ SLL G+ E+QLFAAGS+MSRSDY DVV ERS+A +CGYPLC N LP Sbjct: 8 PISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLP 67 Query: 299 SERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRIL 478 S+RPRKGRYR+SLKEHKVYDL ETYMYC S CV+NSR FAASL++ERC+V D A+ID +L Sbjct: 68 SDRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVL 127 Query: 479 KLFGDLS-LESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV-PKDR 652 ++F D S LE + S+LKI+EK+++ G+V LE+W GPSNAIEGYV ++R Sbjct: 128 RMFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRER 187 Query: 653 NSKPLLLKQ-RKGDTGKNLVF-NDMDFTSEIFIGDEYSIAXXXXXXXXXXXXXXXXXXXX 826 K L K ++G N V NDMDF S I DEY+++ Sbjct: 188 KPKELGSKSPKRGSKANNTVLINDMDFVSTIITEDEYTVSKTPSSLKKTGLDSKVREQEE 247 Query: 827 XXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPCQNGSDIIATEG 1006 ++F +LE N + L +V S + GS + + Sbjct: 248 ILAKKAMGNEFAVLETSYAPASN----------VSRVGLVFEDVTS-SLRAGSCLSSARA 296 Query: 1007 KKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCAIRELEDTKED 1186 +++ +K + ++ +KSS+K S KKL+R+VTWAD+K DS LC IRE+ED KED Sbjct: 297 EEESHDDKAEKCTEASIKSSLKPSRKKKLSRTVTWADEKTDSSGGRKLCEIREIEDMKED 356 Query: 1187 ---------------------------------------------------SESLGSMEI 1213 ++ L + + Sbjct: 357 PSVVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADMLCNADT 416 Query: 1214 GDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEED 1393 G++D+ RF V S + V DA++EAGI+I+PRP + DEG+ +EED Sbjct: 417 GENDDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGEPMEED 476 Query: 1394 VDMLGLGP--APIKWPTKPVTDY-DFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVT 1564 D P APIKWP KP + + D F+ ++SW+D PPE F+LTLSPFA MW ALF W T Sbjct: 477 DDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNALFTWTT 536 Query: 1565 SSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELG 1744 SS+LAYIYGRDES HEEY VNGREYP+KIV DGRSSEIKQ LAG LARALPGLVA+L Sbjct: 537 SSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGLVADLR 596 Query: 1745 LPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSVCRIPGLTPHMTSRR 1924 L TP+S LE+GMG LL+TMSF DALP FR +QWQVI LLF++ALSV R+P LTPHM RR Sbjct: 597 LSTPISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPHMMYRR 656 Query: 1925 TSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 2038 HKVL+ AQ+ +EYEVMKD +IPLGR P FSAQ G Sbjct: 657 VLFHKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSG 694 >ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] gi|550321730|gb|EEF05523.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] Length = 696 Score = 612 bits (1579), Expect = e-172 Identities = 337/700 (48%), Positives = 433/700 (61%), Gaps = 55/700 (7%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M D VKD +YKLQ SLLDGI++E+QL AAGS+MS SDYEDVV ER++A +CGYPLC Sbjct: 1 MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 N+LPS+RP+KGRYR+SLKEHKVYDL ETYMYC SSCV+NSR F+ SLQEERC V +PAK Sbjct: 61 GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 ++ +L LF + SL S+ S LKI+EK++ GEV E+WIGPSNAIEGYVP Sbjct: 121 LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180 Query: 644 K----------------------------------------DRNSKPLLLKQRKGDTGKN 703 + + KP KG G Sbjct: 181 QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240 Query: 704 L-----------VFNDMDFTSEIFIG-DEYSIAXXXXXXXXXXXXXXXXXXXXXXXPNDN 847 NDM+FTS I I DEYSI+ + Sbjct: 241 AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300 Query: 848 EHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEV--PSIPCQNGSDIIATEGKKDPR 1021 E+Q + +S ++ + + K ELS ++ P CQ S I E K+ Sbjct: 301 ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360 Query: 1022 TEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCAIRELEDTKEDSESLG 1201 +EK A+ +S LK S+K+SGAK+LTRSVTWAD+K S + +LC +R +EDTK E + Sbjct: 361 SEKAAKPVESSLKPSLKTSGAKQLTRSVTWADEKVGSSGSRDLCEVRGMEDTKAGPEIVD 420 Query: 1202 SMEIGDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDS 1381 +++ DD +F V SG + ++A++EAG+VI+P+PHD D+GD Sbjct: 421 NIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLDQGDP 480 Query: 1382 LEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAW 1558 +E DVD+L + IKWP KP + + F+ +NSWYD PPEGF+L LS FAT+WMALFAW Sbjct: 481 ME-DVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIWMALFAW 539 Query: 1559 VTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAE 1738 VTSSSLAY+YG+DES HEEYL VNGREYP+KIVL DGRS EI+Q + GCL RA P +VA+ Sbjct: 540 VTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAFPVVVAD 599 Query: 1739 LGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSVCRIPGLTPHMTS 1918 L LP P+S LE+G +LL TMSF DA+P+FR +QWQVIALLF++ALSVCRIP L +M + Sbjct: 600 LRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPALISYMDN 659 Query: 1919 RRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 2038 RR V++G ++ +EYEVMKD +IPLGRAPQFS Q G Sbjct: 660 RR----MVVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695 >ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Solanum lycopersicum] Length = 660 Score = 611 bits (1575), Expect = e-172 Identities = 348/670 (51%), Positives = 439/670 (65%), Gaps = 24/670 (3%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M + + VKDAV+KLQ LL+GI+DE+QL AAGSL+SRSDY+DVV ERS+A +CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 +N+LPSER RKG YR+SLKEHKVYDL ETYMYC ++CVVNS AFA SLQ+ER S +PAK Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 ++++L LF L L S S+LKIQEK K GEV LEEW+GPSNAIEGYVP Sbjct: 121 LNQVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVP 180 Query: 644 -KDRNSKPLLLKQ-RKGDTGK--------NLVFNDMDFTSEIFIGDEYSIAXXXXXXXXX 793 +DR+ P LLK KG K N++ N+ DF+S I DEYS++ Sbjct: 181 QRDRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNAD 240 Query: 794 XXXXXXXXXXXXXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIP-----EV 958 ++ + IL Q ++Q + + ++S L + EV Sbjct: 241 SNVKFKETQAKTRYKVRDDDVY-ILGKQVDALQLRSGEETEKSDKNTRFLKVDKFNSGEV 299 Query: 959 PSIPCQ----NGSDIIATEGKKDPRTEKEAQIGD-SMLKSSMKSSGAKKLTRSVTWADKK 1123 S P Q N S +I ++ + K A G+ LKSS+KSS +KK++RSVTWAD+ Sbjct: 300 SSGPSQHDVKNKSVLIMSDDGR-----KYASHGEHDKLKSSLKSSNSKKMSRSVTWADES 354 Query: 1124 ADS---MNNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXVG 1294 D + I E E + ME ++D++ RF V Sbjct: 355 IDGGIGKKTESSSKISEYESQAYGGSASTDME--ENDDSYRFESAEACAAALSQAAEAVA 412 Query: 1295 SGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFES 1471 SG S V DAV++AGIVI+P + DE L+E +ML L AP+KWP KP + +YD FES Sbjct: 413 SG-SDVPDAVSKAGIVILPPSQEVDEA-ILQETDEMLDLETAPLKWPRKPGMPNYDVFES 470 Query: 1472 QNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQK 1651 ++SWYD+PPEGFN+TLSPF TM+ +LF W++SSSLA+IYG DES +EEYLS+NGREYP+K Sbjct: 471 EDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRK 530 Query: 1652 IVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFR 1831 IVL+DGRS+EIKQ LAGCLARALPGLVA+L LP P+S LE+GM LL TMSF D LP+FR Sbjct: 531 IVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFR 590 Query: 1832 TRQWQVIALLFMDALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGR 2011 +QWQ+I LLF+DALSVCRIP LTP+MT RRTS KVL+GAQ+ EYE+MKD IIPLGR Sbjct: 591 MKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLIIPLGR 650 Query: 2012 APQFSAQRGG 2041 PQFS Q GG Sbjct: 651 VPQFSMQSGG 660 >ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Solanum tuberosum] Length = 662 Score = 605 bits (1560), Expect = e-170 Identities = 342/671 (50%), Positives = 438/671 (65%), Gaps = 25/671 (3%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M + + VKDAV+KLQ LL+GI+DENQL AAGSL+SRSDY+DVV ERS+A +CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 +N+LPSER RKG YR+SLKEHKVYDL ETYMYC ++CVVNS AFA SLQ+ER S +PAK Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAG-EVPLEEWIGPSNAIEGYV 640 ++++L LF L L S S+LKIQEK K G EV LEEW+GPSNAIEGYV Sbjct: 121 LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180 Query: 641 P-KDRNSKPLLLKQ-RKG--------DTGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXX 790 P +DR+ P LLK KG KN++ N+ DF+S I DEYS++ Sbjct: 181 PQRDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNA 240 Query: 791 XXXXXXXXXXXXXXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIP-----E 955 + +IL + ++Q + + ++S L + E Sbjct: 241 VSSEKFKEAQAKTRY-KVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGE 299 Query: 956 VPSIPCQNGSD-----IIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADK 1120 V S P Q+ I++ +G+K + +LKSS+KSS +KK+++SVTWAD+ Sbjct: 300 VSSGPSQHDVKNKSVLIMSDDGRK---YASHGEHDKQLLKSSLKSSNSKKMSQSVTWADE 356 Query: 1121 KADS---MNNTNLCAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXV 1291 D + I E E+ + ME +DD++ RF V Sbjct: 357 IIDGGIGKKTESSSKISEYENQAYGGSASTDME--EDDDSYRFESAEACAAALSQAAEAV 414 Query: 1292 GSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFE 1468 SG S V DAV++AGIVI+P + DE ++ ++ +ML + PAP+KWP KP + +YD FE Sbjct: 415 ASG-SDVPDAVSKAGIVILPTSQEVDE--AILQETEMLDIEPAPLKWPRKPGMPNYDVFE 471 Query: 1469 SQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQ 1648 S++ WYD PPEGFN+TLSPFATM+ +LF W++SSSLA+IYG DE+ +EEYLS+NGREYP Sbjct: 472 SEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPH 531 Query: 1649 KIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSF 1828 KIVL+DG S+EIKQ LAGCLARALPGLVA+L LP P+S LE+GM LL TMSF D LP+F Sbjct: 532 KIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAF 591 Query: 1829 RTRQWQVIALLFMDALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLG 2008 R +QWQ+I LLF+DALSVCRIP LTP+MT RRTSL KVL+GAQ+ EYE+MKD IIPLG Sbjct: 592 RMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLG 651 Query: 2009 RAPQFSAQRGG 2041 R PQFS Q GG Sbjct: 652 RVPQFSMQSGG 662 >gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus guttatus] Length = 597 Score = 590 bits (1522), Expect = e-166 Identities = 325/652 (49%), Positives = 431/652 (66%), Gaps = 14/652 (2%) Frame = +2 Query: 128 VKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLPSER 307 VKDAV+KLQ SLL+GI+ E+QL AAGSL+S+SDY+DVV ER++A VCGYPLC N+LPSE Sbjct: 9 VKDAVHKLQLSLLEGIKHESQLIAAGSLISQSDYQDVVTERTIAHVCGYPLCVNSLPSEP 68 Query: 308 PRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRILKLF 487 PRKG YR+SLKEHKVYDL ET+MYC + C++ SRAF ASL+EER S DPAKI+ +LK+F Sbjct: 69 PRKGHYRISLKEHKVYDLHETHMYCSTECLIRSRAFGASLEEERSSSLDPAKINSVLKMF 128 Query: 488 GDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVPK-DRNSK- 661 LSL+S S LKI+EK + +GE+ LEEW+GPSNAI+GYVP+ D+NS+ Sbjct: 129 DGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVPRRDQNSER 188 Query: 662 --PLLLKQRKGDTGKNLVFN---DMDFTSEIFIGDEYSIAXXXXXXXXXXXXXXXXXXXX 826 P K NL D++FTS I + DEYS++ Sbjct: 189 KQPSRKKTESNHAKPNLADTLPFDVNFTSTIIMQDEYSVSK------------------- 229 Query: 827 XXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKS----ELSIPEVPSIPCQNGSDII 994 T+V + +GK+K KS ++S+ + + P QN + Sbjct: 230 ------------------TAVPREAKGKVKGKMIRKSVKAEKISVLDDTAGPSQNDT--- 268 Query: 995 ATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCAIRELED 1174 ++LKSS+K+ +KK TRSVTWAD+K+D + ++ RE+ D Sbjct: 269 ------------------TLLKSSLKTLDSKKETRSVTWADEKSDG-DGKSISECREIGD 309 Query: 1175 TKED--SESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIV 1348 K L ++GD+ + RF V SG++ +DAV+EAG++I+ Sbjct: 310 NKGAVVMPHLTDEDVGDE--SYRFTSAEACARALSQASEAVASGKTDASDAVSEAGVIIL 367 Query: 1349 PRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSP 1525 P PH+ DE E+ +++ + P +KWP KP + D F+S++SWYD+PPEGFNLTLSP Sbjct: 368 PPPHEVDEA-KYEQIGEVVDVDPIELKWPPKPGFSSEDLFDSEDSWYDSPPEGFNLTLSP 426 Query: 1526 FATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGC 1705 F+TM+M+LFAW++SSSLAYIYG++E FHE+YLS+NGREYP KI++ DGRS+E+K LAGC Sbjct: 427 FSTMFMSLFAWISSSSLAYIYGKEERFHEDYLSINGREYPPKIII-DGRSAEVKHTLAGC 485 Query: 1706 LARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSVC 1885 LARALPGLV+E+ +PTP+S +E+GMG LL+TMSF DALP FR +QWQVIALLF+DALSV Sbjct: 486 LARALPGLVSEIRIPTPVSTIEQGMGRLLDTMSFTDALPGFRMKQWQVIALLFLDALSVS 545 Query: 1886 RIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRGG 2041 RIP L+P+MT RR L KVLEGAQ+ V+E+E+MKD IIPLGR PQFS Q GG Sbjct: 546 RIPALSPYMTGRRILLPKVLEGAQINVEEFEIMKDLIIPLGRVPQFSTQSGG 597 >ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] gi|508787291|gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] Length = 703 Score = 581 bits (1497), Expect = e-163 Identities = 329/666 (49%), Positives = 417/666 (62%), Gaps = 44/666 (6%) Frame = +2 Query: 101 TMTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPL 280 +M + I V +AV+K+Q LLDGIRDE QL A+GSL+SRSDYEDVV ER+++ CGYPL Sbjct: 54 SMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPL 113 Query: 281 CNNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPA 460 C N LPSE RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + A Sbjct: 114 CANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHA 173 Query: 461 KIDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV 640 K++ IL LFGDL L+ S L+I+E + KA +V L GPSNAIEGYV Sbjct: 174 KLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229 Query: 641 P-KDRNSKPLLLKQRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXX 781 P ++ SKP K K D+ + V N++DF I + DEY I+ Sbjct: 230 PQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGS 289 Query: 782 XXXXXXXXXXXXXXXXXXPNDN-------EHQFTILEMQATSVQNKGEGKLKE------- 919 + ++TI +M + S Q+ + LKE Sbjct: 290 FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGIC 349 Query: 920 -------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSM 1054 S+ + + SI E+PS Q+G D + E +K+ +K +++ Sbjct: 350 KDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETV 409 Query: 1055 LKSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA 1231 LKSS+KS+GAKKL R VTWAD KKAD+ N NLC ++E+E K DSE GS E G DDN Sbjct: 410 LKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNM 469 Query: 1232 LRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGL 1411 LRF V SG S VTDAV E V + ++GD LE + Sbjct: 470 LRFVSAEACAMALSKAAEAVASGDSDVTDAVCE-----VDKEEPMEDGDMLEPET----- 519 Query: 1412 GPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIY 1588 AP+KWP KP + D F ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIY Sbjct: 520 --APVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIY 577 Query: 1589 GRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSIL 1768 GRDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S L Sbjct: 578 GRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTL 637 Query: 1769 ERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSVCRIPGLTPHMTSRRTSLHKVLE 1948 E+GMGHL++T+SF +ALP+FR +QWQVI LLF+DALSVCRIP LTPHMT+ R LHKVL+ Sbjct: 638 EQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLLHKVLD 697 Query: 1949 GAQVGV 1966 GAQ+ + Sbjct: 698 GAQISM 703 >ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] gi|462404075|gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] Length = 711 Score = 577 bits (1487), Expect = e-162 Identities = 336/716 (46%), Positives = 433/716 (60%), Gaps = 77/716 (10%) Frame = +2 Query: 122 IPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLPS 301 I VKD VYKLQ +LL+GI+ ++ L+ AGS++SRSDY DVV ER++A +CGYPLC+N LPS Sbjct: 13 ISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSNALPS 72 Query: 302 E--RPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRI 475 + RP KG YR+SLKEHKVYDL ETYMYC S CV+ S+AFA SL EERC V D K++RI Sbjct: 73 DSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGKVERI 132 Query: 476 LKLFGDLSLES-QXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEW--------------- 607 L+ FGD+ + + S+LKI+EK ++ G++ + Sbjct: 133 LRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIGDLGA 192 Query: 608 IGPSNAIEGYVP-KDRNSKPLLLKQRK-GDTGKN--------LVFNDMDFTSEIFIGDEY 757 +GPSNAIEGYVP K+R SKPL K+ K G GK+ ++FN+MDF S I DEY Sbjct: 193 VGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSSGMDIIFNEMDFMSTIITSDEY 252 Query: 758 SIAXXXXXXXXXXXXXXXXXXXXXXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKS 937 S++ N N+ S Q+KG K + K Sbjct: 253 SVSKIPPSVGEPDFETKFKKSKGKVGLNKNDSV-------KKSRQSKGG---KNKNVKKD 302 Query: 938 ELSIPEVPSIP-----CQNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRS 1102 ++ I EVPS NGS E K++ EK Q G+++L+SS+K SG KKL RS Sbjct: 303 DVCIREVPSTSDASQTVLNGS---TKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNRS 359 Query: 1103 VTWADKKADSMNNTNLCAIRELEDTKE--------------------------------- 1183 VTWAD+ DS + NL +RE+E E Sbjct: 360 VTWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTKS 419 Query: 1184 ----------DSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEA 1333 D++ LGS+++ +++ V SG+S V+ AV+ A Sbjct: 420 KNICEVREVQDADVLGSLDLQENEI---LESAEACAMALNQAAEAVASGESDVSGAVSGA 476 Query: 1334 GIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFN 1510 GI+I+PRP DE + E DVDML AP+ WP KP + D F+ ++SW+D PPEGF+ Sbjct: 477 GIIILPRPDGLDEEEPTE-DVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFS 534 Query: 1511 LTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQ 1690 +TLSPFATMW +LF W+TSS+LAYIYGRDESFHEE+LSVNGREYP KIVL GRSSEIK+ Sbjct: 535 VTLSPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKK 594 Query: 1691 ALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMD 1870 L ARALPG+V+EL LPTP+S LE+GMG +L TMSF DA+P+FR +QWQVI LLF++ Sbjct: 595 TLDESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLE 654 Query: 1871 ALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 2038 LSVCRIP LTPHMT+RR +KVLE Q+ ++YE+MKD IIPLGRAPQFSAQ G Sbjct: 655 GLSVCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFSAQSG 710 >ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 662 Score = 572 bits (1475), Expect = e-160 Identities = 324/670 (48%), Positives = 418/670 (62%), Gaps = 29/670 (4%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M + + +KD VYKLQ +L +GI++ENQLFAAGSLMSRSDYEDVV ERS+A +CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 ++ LPS+ R+GRYR+SLKEHKVYDL+ETY YC S+C++NSRAF+ LQ+ERCSV +P K Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 + ILKLF ++SL+S+ S L+IQEK +S GEVP+EEW+GPSNAIEGYVP Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177 Query: 644 KDRNSKPLLLKQRKGDTGKNL-------------VFNDMDFTSEIFIGDEYSIAXXXXXX 784 R+ K + L + G K+ F+D TS I +EYS++ Sbjct: 178 H-RDHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGL 236 Query: 785 XXXXXXXXXXXXXXXXXPNDNEHQFTILEMQ------ATSVQNKGEG---KLKESSCGKS 937 ++ QF ILE SV K G + K S+ +S Sbjct: 237 KEMALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKES 296 Query: 938 ELSIPEVPSIPCQNGSDI-IATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWA 1114 ++ + PS ++ + TE +PR + + LKSS+K G K L RSVTWA Sbjct: 297 TDNLSDAPSTSKNRSTNFNLMTE---EPRGGFN-DLSGTELKSSLKKPGKKNLCRSVTWA 352 Query: 1115 DKKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA--LRFXXXXXXXXXXXXXXXX 1288 D+K D + NL + E+ TKE S + ++ D+DN LR Sbjct: 353 DEKTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEA 412 Query: 1289 VGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP----VTDY 1456 + SGQS V+DAV+EAGI+I+P P D +E E D + P + K V Sbjct: 413 ITSGQSEVSDAVSEAGIIILPHPSDANE----EASTDPVNASE-PHSFSEKSNKLGVLRS 467 Query: 1457 DFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGR 1636 D F+ +SWYD PPEGF+LTLS FATMWMA+FAWVTSSSLAYIYG+D+ FHEE+L ++G+ Sbjct: 468 DLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGK 527 Query: 1637 EYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDA 1816 EYP KIV DGRSSEIKQ LAGCL RA+PGL +EL L TP+S LE GM HLL+TM+F DA Sbjct: 528 EYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDA 587 Query: 1817 LPSFRTRQWQVIALLFMDALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFI 1996 LP+FR +QWQVI LLF++ALSV RIP L HM+S R HKVL+ AQ+ DEYE+M+D I Sbjct: 588 LPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHI 647 Query: 1997 IPLGRAPQFS 2026 +PLGR Q S Sbjct: 648 LPLGRTAQLS 657 >ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Fragaria vesca subsp. vesca] Length = 692 Score = 550 bits (1416), Expect = e-153 Identities = 325/717 (45%), Positives = 421/717 (58%), Gaps = 80/717 (11%) Frame = +2 Query: 128 VKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLPSE- 304 V DAVYKLQ +LLD ++ ++L+ AGS++SRSDY DVV ERS+A +CGYPLC+N LP E Sbjct: 13 VNDAVYKLQLALLDSVKTLDRLYLAGSIISRSDYTDVVTERSIADLCGYPLCSNALPPEA 72 Query: 305 -RPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRILK 481 R RKG YR+SLKEHKVYDL+ET +YC S CV++S+AFA L EERC V D K++R+L+ Sbjct: 73 SRTRKGHYRISLKEHKVYDLRETKLYCSSKCVIDSKAFAQGLSEERCDVLDLGKVERVLR 132 Query: 482 LFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVPK-DRNS 658 FG+ E S LKI+EKS + +G+V E GPSNAIEGYVP+ DR S Sbjct: 133 EFGEEKKE-------IGDLGLSSLKIEEKSGTYSGKV---EEFGPSNAIEGYVPRRDRVS 182 Query: 659 KPLLLKQRKGDT----------GKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXXXXXXX 808 K K+ K + GK L+ NDMDF S + DEYS++ Sbjct: 183 KASGAKKNKQGSKGKDAKPSGGGKQLILNDMDFMSTLLACDEYSVSKMPPNVADNNVDTE 242 Query: 809 XXXXXXXXXPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPCQNGSD 988 D E F++LE AT NK EG + G S L I Sbjct: 243 LKKSKG----KDLESGFSVLETSATP--NKSEGVMDVGDLGMSRLKI------------- 283 Query: 989 IIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNNTNLCAIREL 1168 E +++ + K + + L+SS+K SG KKL+RSVTWAD+K+DS NLC +R++ Sbjct: 284 ----EAEEESQVGKGEKSSEGTLRSSLKHSGTKKLSRSVTWADEKSDSTGRRNLCEVRDM 339 Query: 1169 E-----------------------------------------------DTKEDSESLGSM 1207 E D KE E +GS Sbjct: 340 EDGLENPGAFDSLYKPSSSSEAGSSFSWVDKTIDSTKCENICEVSGTHDAKEVPEVVGSS 399 Query: 1208 EIGDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIVPRPH--DE----- 1366 + ++ F V +G+ +DAV++AGI+I+PR DE Sbjct: 400 VVQGNE---WFESAEACAVALSEAAGAVETGEFDTSDAVSKAGIIILPRTDGVDEEEFIV 456 Query: 1367 ---DEGDSLE---------EDVDMLGLGPAPIKWPTKPVTD-YDFFESQNSWYDTPPEGF 1507 DE DS+E ED+DML A KWP KP + +D F ++SW+D PP+GF Sbjct: 457 DGADEEDSIEDSVDEEESTEDIDMLEPEQALSKWPKKPESSQFDLFNPEDSWFDAPPDGF 516 Query: 1508 NLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIK 1687 NLTLSPFATMW ALF W TSS+LAYIYG+D+SFHEE+L+VNGR YP KIVL DGRSSEIK Sbjct: 517 NLTLSPFATMWNALFTWTTSSTLAYIYGKDDSFHEEFLNVNGRSYPHKIVLADGRSSEIK 576 Query: 1688 QALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFM 1867 + L+RALP +VAELGL P LE+GMG +L TMSF +ALP+FR +QWQVIALLF+ Sbjct: 577 LTVGASLSRALPEIVAELGLAVP--NLEKGMGFMLNTMSFIEALPAFRMKQWQVIALLFI 634 Query: 1868 DALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQFSAQRG 2038 + LSVCR+P LTPHMT+RR + +VL+GA++ V+EYE+MKDF+IPLGRAPQF++Q G Sbjct: 635 EGLSVCRMPALTPHMTNRRVLIQRVLDGARISVEEYEIMKDFLIPLGRAPQFASQSG 691 >ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 632 Score = 542 bits (1396), Expect = e-151 Identities = 309/661 (46%), Positives = 402/661 (60%), Gaps = 20/661 (3%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M + + +KD VYKLQ +L +GI++ENQLFAAGSLMSRSDYEDVV ERS+A +CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 ++ LPS+ R+GRYR+SLKEHKVYDL+ETY YC S+C++NSRAF+ LQ+ERCSV +P K Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 + ILKLF ++SL+S+ S L+IQEK +S GEVP+EEW+GPSNAIEGYVP Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177 Query: 644 KDRNSKPLLLKQRKGDTGKNL-------------VFNDMDFTSEIFIGDEYSIAXXXXXX 784 R+ K + L + G K+ F+D FTS I +EYS++ Sbjct: 178 H-RDHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGL 236 Query: 785 XXXXXXXXXXXXXXXXXPNDNEHQFTILEM-QATSVQNKGEGKLKESSCGKSELSIPEVP 961 + QF ILE A + G+ S ++++S Sbjct: 237 KEMALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVS----- 291 Query: 962 SIPCQNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKKADSMNN 1141 AT+ D + D+ S+ +S+ +T D+K D + Sbjct: 292 -----------ATKESTD-------NLSDAPSTSNNRSTNFNLMTEEPR--DEKTDDASI 331 Query: 1142 TNLCAIRELEDTKEDSESLGSMEIGDDDNA--LRFXXXXXXXXXXXXXXXXVGSGQSGVT 1315 NL + E+ TKE S + ++ D+DN LR + SGQS V+ Sbjct: 332 MNLPEVGEMGKTKECSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKAITSGQSEVS 391 Query: 1316 DAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP----VTDYDFFESQNSW 1483 DAV+EAGI+I+P P D +E E D + P + K V D F+ +SW Sbjct: 392 DAVSEAGIIILPHPSDANE----EASTDPVNASE-PHSFSEKSNKLGVLRSDLFDPSDSW 446 Query: 1484 YDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLT 1663 YD PPEGF+LTLS FATMWMA+FAWVTSSSLAYIYG+D+ FHEE+L ++G+EYP KIV Sbjct: 447 YDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSA 506 Query: 1664 DGRSSEIKQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQW 1843 DGRSSEIKQ LAGCL RA+PGL +EL L TP+S LE GM HLL+TM+F DALP+FR +QW Sbjct: 507 DGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMKQW 566 Query: 1844 QVIALLFMDALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLGRAPQF 2023 QVI LLF++ALSV RIP L HM+S R HKVL+ AQ+ DEYE+M+D I+PLGR Q Sbjct: 567 QVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTAQL 626 Query: 2024 S 2026 S Sbjct: 627 S 627 >ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma cacao] gi|508787293|gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao] Length = 708 Score = 535 bits (1377), Expect = e-149 Identities = 303/627 (48%), Positives = 391/627 (62%), Gaps = 44/627 (7%) Frame = +2 Query: 101 TMTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPL 280 +M + I V +AV+K+Q LLDGIRDE QL A+GSL+SRSDYEDVV ER+++ CGYPL Sbjct: 54 SMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPL 113 Query: 281 CNNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPA 460 C N LPSE RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + A Sbjct: 114 CANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHA 173 Query: 461 KIDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV 640 K++ IL LFGDL L+ S L+I+E + KA +V L GPSNAIEGYV Sbjct: 174 KLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229 Query: 641 P-KDRNSKPLLLKQRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXX 781 P ++ SKP K K D+ + V N++DF I + DEY I+ Sbjct: 230 PQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGS 289 Query: 782 XXXXXXXXXXXXXXXXXXPNDN-------EHQFTILEMQATSVQNKGEGKLKE------- 919 + ++TI +M + S Q+ + LKE Sbjct: 290 FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGIC 349 Query: 920 -------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSM 1054 S+ + + SI E+PS Q+G D + E +K+ +K +++ Sbjct: 350 KDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETV 409 Query: 1055 LKSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA 1231 LKSS+KS+GAKKL R VTWAD KKAD+ N NLC ++E+E K DSE GS E G DDN Sbjct: 410 LKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNM 469 Query: 1232 LRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGL 1411 LRF V SG S VTDAV E G++I+P + D+ + + ED DML Sbjct: 470 LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPM-EDGDMLEP 528 Query: 1412 GPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIY 1588 AP+KWP KP + D F ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIY Sbjct: 529 ETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIY 588 Query: 1589 GRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSIL 1768 GRDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S L Sbjct: 589 GRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTL 648 Query: 1769 ERGMGHLLETMSFFDALPSFRTRQWQV 1849 E+GMGHL++T+SF +ALP+FR +QW++ Sbjct: 649 EQGMGHLIDTISFMEALPAFRMKQWEI 675 >ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma cacao] gi|508787290|gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao] Length = 679 Score = 533 bits (1372), Expect = e-148 Identities = 303/625 (48%), Positives = 389/625 (62%), Gaps = 44/625 (7%) Frame = +2 Query: 101 TMTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPL 280 +M + I V +AV+K+Q LLDGIRDE QL A+GSL+SRSDYEDVV ER+++ CGYPL Sbjct: 54 SMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPL 113 Query: 281 CNNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPA 460 C N LPSE RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + A Sbjct: 114 CANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHA 173 Query: 461 KIDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYV 640 K++ IL LFGDL L+ S L+I+E + KA +V L GPSNAIEGYV Sbjct: 174 KLNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYV 229 Query: 641 P-KDRNSKPLLLKQRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXX 781 P ++ SKP K K D+ + V N++DF I + DEY I+ Sbjct: 230 PQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGS 289 Query: 782 XXXXXXXXXXXXXXXXXXPNDN-------EHQFTILEMQATSVQNKGEGKLKE------- 919 + ++TI +M + S Q+ + LKE Sbjct: 290 FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGIC 349 Query: 920 -------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSM 1054 S+ + + SI E+PS Q+G D + E +K+ +K +++ Sbjct: 350 KDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETV 409 Query: 1055 LKSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNA 1231 LKSS+KS+GAKKL R VTWAD KKAD+ N NLC ++E+E K DSE GS E G DDN Sbjct: 410 LKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNM 469 Query: 1232 LRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGL 1411 LRF V SG S VTDAV E G++I+P + D+ + + ED DML Sbjct: 470 LRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPM-EDGDMLEP 528 Query: 1412 GPAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIY 1588 AP+KWP KP + D F ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIY Sbjct: 529 ETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIY 588 Query: 1589 GRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSIL 1768 GRDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S L Sbjct: 589 GRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTL 648 Query: 1769 ERGMGHLLETMSFFDALPSFRTRQW 1843 E+GMGHL++T+SF +ALP+FR +QW Sbjct: 649 EQGMGHLIDTISFMEALPAFRMKQW 673 >gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlisea aurea] Length = 597 Score = 501 bits (1289), Expect = e-139 Identities = 292/648 (45%), Positives = 395/648 (60%), Gaps = 13/648 (2%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M D+ + +K+AVY+LQ SLL+G ++ENQL AAGSLMSR DY+D+V ER +AK+CGYPLC Sbjct: 1 MAKDEILTMKEAVYRLQTSLLEGAKNENQLSAAGSLMSRGDYQDLVTERVIAKICGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 +N L SERP KGRYR+SLKEHKVYD+QETY +C S C++NSRAF+ L +ER S DP K Sbjct: 61 SNNLNSERPSKGRYRISLKEHKVYDVQETYSFCSSGCLINSRAFSIGLPDERTSDLDPIK 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 ++ +LK F S S+L+I EK +AGEV EWIGPS+AI+GYVP Sbjct: 121 LNEVLKRFDGFGANSTPNMGRNEDLGLSQLRIMEKENIEAGEVSSNEWIGPSDAIDGYVP 180 Query: 644 K-DRNSKPLLLKQRKGDTGKNLVF--------NDMDFTSEIFIGDEYSIAXXXXXXXXXX 796 + DRNS L KQ+KG++ +L +DM FTS I +EYSIA Sbjct: 181 RRDRNSNTLSSKQKKGESRYHLSLQVLTSIFPSDMSFTSVIIDQNEYSIAKTTTPSSSKQ 240 Query: 797 XXXXXXXXXXXXXPNDNEHQFTILEMQATSVQNKG-EGKLKESSCGKSELSIPEVPSIPC 973 P ++ + +++ G K + K + + Sbjct: 241 SGESNEKVI----PEEDVRPKQSPDSSVANIKGSGFRNPSKRNGRAKIDAKLSASEDKAS 296 Query: 974 QNGSDIIATEGKKDPRTEKEAQIGDSMLKSSMKSSGAKKLT-RSVTWADKKADSMNNTNL 1150 +NG + +G +K AQ G ++LKSS+K+S +K+ T R+V+WAD KA+ + NL Sbjct: 297 ENGGEPKLADG------DKSAQ-GAAVLKSSLKTSYSKETTTRTVSWADVKAE--DGQNL 347 Query: 1151 CAIRELEDTKEDSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAE 1330 + E+ D S + V S ++ T A + Sbjct: 348 ETVCEMNDPHGGGISRETSS--------------------------VESHKTASTKASKD 381 Query: 1331 A-GIVIVPRPHDEDEGDSLEEDVDMLGLGPAPIKWPTKP-VTDYDFFESQNSWYDTPPEG 1504 A G ++ D +EG+ E + +KWP KP ++ D ES ++ YD PP+G Sbjct: 382 APGKFLLT---DFNEGEIFTEAI---------LKWPPKPGFSEADLVESDDTLYDRPPDG 429 Query: 1505 FNLTLSPFATMWMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEI 1684 FNL+LSPF T++ +LF+W++SSSLAYIYG+D+SFHEEY++ NGREYP K+V DGRSSEI Sbjct: 430 FNLSLSPFCTLFNSLFSWISSSSLAYIYGKDDSFHEEYVNANGREYPCKVVAEDGRSSEI 489 Query: 1685 KQALAGCLARALPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLF 1864 KQ L+ LARALPG+V+EL LPTP+SILE+GMG LL+TMSF D LPS RT+QWQ I LLF Sbjct: 490 KQTLSAALARALPGVVSELRLPTPISILEQGMGRLLDTMSFIDPLPSLRTKQWQAIVLLF 549 Query: 1865 MDALSVCRIPGLTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIPLG 2008 ++ALSV RIP L+ ++ RR S+ KVLEGA +GV+E+EVMKD IIPLG Sbjct: 550 LNALSVSRIPALSKYLEDRRASIQKVLEGAGIGVEEFEVMKDLIIPLG 597 >ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao] gi|508787292|gb|EOY34548.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao] Length = 607 Score = 500 bits (1287), Expect = e-138 Identities = 290/603 (48%), Positives = 369/603 (61%), Gaps = 44/603 (7%) Frame = +2 Query: 104 MTNDDPIPVKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLC 283 M + I V +AV+K+Q LLDGIRDE QL A+GSL+SRSDYEDVV ER+++ CGYPLC Sbjct: 1 MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 60 Query: 284 NNTLPSERPRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAK 463 N LPSE RKGRYR+SLKEHKVYDLQETYM+C ++C++NSRAFA SLQEERCSV + AK Sbjct: 61 ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 120 Query: 464 IDRILKLFGDLSLESQXXXXXXXXXXXSELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVP 643 ++ IL LFGDL L+ S L+I+E + KA +V L GPSNAIEGYVP Sbjct: 121 LNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 176 Query: 644 -KDRNSKPLLLKQRKG---DTGKN---------LVFNDMDFTSEIFIGDEYSIAXXXXXX 784 ++ SKP K K D+ + V N++DF I + DEY I+ Sbjct: 177 QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 236 Query: 785 XXXXXXXXXXXXXXXXXPNDN-------EHQFTILEMQATSVQNKGEGKLKE-------- 919 + ++TI +M + S Q+ + LKE Sbjct: 237 KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 296 Query: 920 ------------SSCGKSELSIPEVPSIP--CQNGSDIIATEGKKDPRTEKEAQIGDSML 1057 S+ + + SI E+PS Q+G D + E +K+ +K +++L Sbjct: 297 DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVL 356 Query: 1058 KSSMKSSGAKKLTRSVTWAD-KKADSMNNTNLCAIRELEDTKEDSESLGSMEIGDDDNAL 1234 KSS+KS+GAKKL R VTWAD KKAD+ N NLC ++E+E K DSE GS E G DDN L Sbjct: 357 KSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGGDDNML 416 Query: 1235 RFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIVPRPHDEDEGDSLEEDVDMLGLG 1414 RF V SG S VTDAV E G++I+P + D+ + + ED DML Sbjct: 417 RFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPM-EDGDMLEPE 475 Query: 1415 PAPIKWPTKP-VTDYDFFESQNSWYDTPPEGFNLTLSPFATMWMALFAWVTSSSLAYIYG 1591 AP+KWP KP + D F ++SW+D PPEGF+LTLS FATMW ALF W+TSSSLAYIYG Sbjct: 476 TAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLAYIYG 535 Query: 1592 RDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARALPGLVAELGLPTPLSILE 1771 RDESFHEEYLS+NGREYP+KI L DGRSSEIK+ LA C++RALP +V +L LP P+S LE Sbjct: 536 RDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPISTLE 595 Query: 1772 RGM 1780 +GM Sbjct: 596 QGM 598 >ref|XP_006841578.1| hypothetical protein AMTR_s00003p00194360 [Amborella trichopoda] gi|548843599|gb|ERN03253.1| hypothetical protein AMTR_s00003p00194360 [Amborella trichopoda] Length = 591 Score = 454 bits (1169), Expect = e-125 Identities = 265/635 (41%), Positives = 367/635 (57%), Gaps = 10/635 (1%) Frame = +2 Query: 128 VKDAVYKLQHSLLDGIRDENQLFAAGSLMSRSDYEDVVKERSLAKVCGYPLCNNTLPSER 307 +KDA+YK+Q LLDGI ENQL AA +L+SRSDY+DVV ER++ +CGYPLCN LP +R Sbjct: 9 LKDAIYKIQTYLLDGISKENQLLAAANLISRSDYDDVVTERTITNLCGYPLCNKYLPCDR 68 Query: 308 PRKGRYRVSLKEHKVYDLQETYMYCCSSCVVNSRAFAASLQEERCSVFDPAKIDRILKLF 487 P+KGRYR+SLKEH VYDL+ET++YC CV+NS+AF+ L+ ERC DP KI IL LF Sbjct: 69 PKKGRYRISLKEHSVYDLKETWLYCSPECVINSQAFSKLLKPERCEFSDPGKIAEILNLF 128 Query: 488 GDLSLESQXXXXXXXXXXXS----ELKIQEKSQSKAGEVPLEEWIGPSNAIEGYVPKDRN 655 S+E S L I EK G++ +++GP NAIEGYVP+ Sbjct: 129 SSPSIEESNAGGAEKNEKISLAFSSLTIHEKEDVSVGDIQSMDFVGPYNAIEGYVPRQDQ 188 Query: 656 SKPLLLKQRKGD-TGKNLVFNDMDFTSEIFIGDEYSIAXXXXXXXXXXXXXXXXXXXXXX 832 P+ QRKG +GK+ D + F Sbjct: 189 VPPV---QRKGSKSGKSTTKKDPIYPETNFAS---------------------------- 217 Query: 833 XPNDNEHQFTILEMQATSVQNKGEGKLKESSCGKSELSIPEVPSIPCQNGSDIIATEGKK 1012 TI+ + +S G L+++S K V ++ ++ ++ Sbjct: 218 ---------TIIIGEPSS------GNLQKNSSSKFVNDHVHV---------NVEGSKREQ 253 Query: 1013 DPRTEKEAQIGDSMLKSSMKSSGAKKLTRSVTWADKK---ADSMNNTNLCAIRELEDTKE 1183 + + ++ ++ L+S++K+ GAK TR+V+WAD++ + + N L + +E + Sbjct: 254 HAQEKSQSHPKETKLRSALKNLGAKASTRTVSWADEQQTIVEGIQNMTLNNCQGIESGSK 313 Query: 1184 DSESLGSMEIGDDDNALRFXXXXXXXXXXXXXXXXVGSGQSGVTDAVAEAGIVIVPRPHD 1363 ES S+ + D + R V SGQS DA +EAGI+I P P+ Sbjct: 314 CKESSDSLSVEDTMISSRRASAEACASALTEAAAAVASGQSNTLDAASEAGILIFPCPNS 373 Query: 1364 EDEGDSLEEDVDMLGLGPAPIKWPTKPVTDYD--FFESQNSWYDTPPEGFNLTLSPFATM 1537 +E +++++ D L KW +P + F ++SWYD PPEGF+LTLS FATM Sbjct: 374 VEE-ENIQKVADELKPEEGE-KWVKRPSLLHTGAFDTEEDSWYDAPPEGFSLTLSSFATM 431 Query: 1538 WMALFAWVTSSSLAYIYGRDESFHEEYLSVNGREYPQKIVLTDGRSSEIKQALAGCLARA 1717 WMALF WVT+SS+AYIYGR ES EE++ V+GREYP K VL DG SSEIK+ L+GCLARA Sbjct: 432 WMALFGWVTASSMAYIYGRAESAEEEFVVVDGREYPHKFVLGDGLSSEIKETLSGCLARA 491 Query: 1718 LPGLVAELGLPTPLSILERGMGHLLETMSFFDALPSFRTRQWQVIALLFMDALSVCRIPG 1897 LPG+VA + LPTP+S LE +G LL+TM+F +ALP FR +QW VI LLF+DALSV +P Sbjct: 492 LPGVVANIKLPTPISTLEVALGRLLDTMTFTEALPPFRMKQWHVIVLLFLDALSVHIVPA 551 Query: 1898 LTPHMTSRRTSLHKVLEGAQVGVDEYEVMKDFIIP 2002 L H+ SRRT +HK+LE AQV +EY +M+D +P Sbjct: 552 LEQHIASRRTLVHKMLEDAQVSNEEYNIMRDLFLP 586