BLASTX nr result
ID: Cocculus22_contig00008025
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00008025 (2470 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] 582 e-163 ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258... 577 e-162 ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr... 536 e-149 ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni... 501 e-139 ref|XP_007145767.1| hypothetical protein PHAVU_007G265900g [Phas... 500 e-138 ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni... 497 e-138 ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni... 495 e-137 ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni... 489 e-135 ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni... 489 e-135 gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] 486 e-134 ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm... 485 e-134 ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni... 481 e-133 ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prun... 477 e-131 ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Th... 476 e-131 ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni... 476 e-131 ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu... 464 e-128 ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni... 447 e-122 ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma c... 439 e-120 ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma c... 438 e-120 gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus... 434 e-118 >emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] Length = 659 Score = 582 bits (1499), Expect = e-163 Identities = 333/657 (50%), Positives = 427/657 (64%), Gaps = 14/657 (2%) Frame = -2 Query: 2445 EQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNNP 2266 +Q I++KDAVHK+QL L EGI +EN LFAA SLMSRSDY+DV+T+R+I N+CGYPLC+N Sbjct: 4 DQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNS 63 Query: 2265 LPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXXXXX 2086 LP+E RK HYR SLK K YD E Y++CS GCV NSR+FAGSL +RCSV NS Sbjct: 64 LPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERING 123 Query: 2085 XXXXXXEQGLKENEYLGVTDDL--TELKIQEKV--KAGDVLLEN-SSPFFSIEGYIPEVD 1921 E L+ N+ LG DL +ELKI+E V KAG+V +E+ P +IEGY+P+ D Sbjct: 124 ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183 Query: 1920 GVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKENGAE 1741 L G++S+NS++ GK +++EMD ++ D+++ K K+ + Sbjct: 184 RNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDTTSH 243 Query: 1740 TKSDEVKHKPFSDGAAVS-------PAFSGSETKSKESDVRSISAAENTQFGELLLNG-S 1585 KS E K K S G +S P + SE+K +ES R +F + Sbjct: 244 AKSKEPKEKA-SIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVP 302 Query: 1584 MQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGD 1405 Q+ SE+ AQ KSSLKP G K + +SVTWAD+ K++ DS D Sbjct: 303 SQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADE-KMDSADSRD 361 Query: 1404 LCTFQEINDIKDNGSSRNS-NVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 1228 C +E+ K++ + +V D D +LR ASAEAC +ALSQ+AEAV+SGE D++DA S Sbjct: 362 FCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDAVS 421 Query: 1227 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 1048 ++GIIILP D D E VPLK P +PG+ +S +FD ++SWYD PP GF Sbjct: 422 EAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGF 481 Query: 1047 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 868 SLTLS FATMW ALF WITSSS+AYIYGRD S HEE++SVNG+EYP+K +DGRSSEIK Sbjct: 482 SLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEIK 541 Query: 867 ESLAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFI 688 ++LAG L+R +P LVA+L+L P+S LE G+ LLDTMSF+D LPS +QW V+ LLFI Sbjct: 542 QTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLLFI 601 Query: 687 DALSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSG 517 DALSVCRIP L PHMTS R L KV D+AQV+ EEYE MKD+IIPLGR PQFSAQSG Sbjct: 602 DALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSG 658 >ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera] gi|296089830|emb|CBI39649.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 577 bits (1488), Expect = e-162 Identities = 331/657 (50%), Positives = 426/657 (64%), Gaps = 14/657 (2%) Frame = -2 Query: 2445 EQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNNP 2266 +Q I++KDAVHK+QL L EGI +EN LFAA SLMSRSDY+DV+T+R+I N+CGYPLC+N Sbjct: 4 DQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNS 63 Query: 2265 LPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXXXXX 2086 LP+E RK HYR SLK K YD E Y++CS GCV NSR+FAGSL +RCSV NS Sbjct: 64 LPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERING 123 Query: 2085 XXXXXXEQGLKENEYLGVTDDL--TELKIQEKV--KAGDVLLEN-SSPFFSIEGYIPEVD 1921 E L+ N+ LG DL +ELKI+E V KAG+V +E+ P +IEGY+P+ D Sbjct: 124 ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183 Query: 1920 GVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKENGAE 1741 L G++S+NS++ GK +++EMD ++ D+++ K K+ + Sbjct: 184 RNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDTTSH 243 Query: 1740 TKSDEVKHKPFSDGAAVS-------PAFSGSETKSKESDVRSISAAENTQFGELLLNG-S 1585 KS E K K S G +S P + SE+K +ES R +F + Sbjct: 244 AKSKEPKEKA-SIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVP 302 Query: 1584 MQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGD 1405 Q+ SE+ AQ LKS LKP G K +++SVTWAD+ K++ DS D Sbjct: 303 SQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE-KMDSADSRD 361 Query: 1404 LCTFQEINDIKDNGSSRNS-NVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 1228 C +E+ K++ + +V D D +LR ASAEAC +ALSQ+AEAV+SGE D++DA S Sbjct: 362 FCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDAVS 421 Query: 1227 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 1048 ++ IIILP D D E VPLK P +PG+ +S +FD ++SWYD PP GF Sbjct: 422 EARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGF 481 Query: 1047 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 868 SLTLS FATMW ALF WITSSS+AYIYGRD S HEE++SVNG+EYP+K +DGRSSEIK Sbjct: 482 SLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEIK 541 Query: 867 ESLAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFI 688 ++LAG LAR +P LVA+L+L P+S LE G+ LLDTMSF+D LPS +QW V+ LLFI Sbjct: 542 QTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLLFI 601 Query: 687 DALSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSG 517 DALSVC+IP L PHM S R L KV D+AQV+ EEYE MKD+IIPLGR PQFSAQSG Sbjct: 602 DALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSG 658 >ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] Length = 739 Score = 536 bits (1381), Expect = e-149 Identities = 314/703 (44%), Positives = 424/703 (60%), Gaps = 51/703 (7%) Frame = -2 Query: 2469 RTSPTLVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVC 2290 ++S ++ KEQSIS+ +AVHKIQL L +GI E L A+ SL+SRSDY+DV+T+R+I+N C Sbjct: 50 KSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTC 109 Query: 2289 GYPLCNNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSV 2110 GYPLC NPLP+E RK YR SLK K YD +E Y+FCS C+ NSRAFAGSL +RCSV Sbjct: 110 GYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSV 169 Query: 2109 YNSXXXXXXXXXXXEQGLKENEYLGVTDDL----TELKIQEKVKAGDVLLENSSPFFSIE 1942 N + L +N+ LG DL +K E+VKA DV L + P +IE Sbjct: 170 LNHAKLNDILSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSL--AGPSNAIE 226 Query: 1941 GYIPEVDGVLXXXXXXXXXXGA-QSNNSRLQKGK------------GKIV---------- 1831 GY+P+ + + S++S+L K G I+ Sbjct: 227 GYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKK 286 Query: 1830 ---------------------NEMDLTCNLVAADQFTGPKLFSIFKENGAETKSDEVKHK 1714 NEMD T ++ D++T K+ S K++ ++ EV+ K Sbjct: 287 PGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEK 346 Query: 1713 PFSDGAAVSPAFSGSET--KSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXXXXXXX 1540 + SGS + + K+S + + + +N Q+ + + Sbjct: 347 GICKDSEDKCVISGSSSALREKDSSIVELPSTKNV----------YQSGLDTSSAEAEKE 396 Query: 1539 XXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIK-DNG 1363 K S+E LKSSLK GAK L++ VTWADK K ++ +G+LC +E+ +K D+ Sbjct: 397 THADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSE 456 Query: 1362 SSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLNDYDX 1183 S ++ D LR SAEAC +ALS++AEAV+SG+ D++DA ++G+IILP L + D Sbjct: 457 ISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDK 516 Query: 1182 XXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMWTALF 1003 ET P+K P +PG+ +S +F+PE+SW+DAPP GFSLTLS+FATMW ALF Sbjct: 517 EEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALF 576 Query: 1002 GWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMVPELV 823 WITSSSLAYIYGRD S HEE++S+NG+EYPRK DGRSSEIKE+LA ++R +P +V Sbjct: 577 EWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIV 636 Query: 822 AELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFIDALSVCRIPGLAPHM 643 +L+L P+S LE GM L+DT+SFM+ LP+ +QW V+ LLFIDALSVCRIP L PHM Sbjct: 637 TDLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHM 696 Query: 642 TSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSGA 514 T+ R LLHKVLD AQ++ EEYE MKD+IIPLGR P FSAQSGA Sbjct: 697 TNGRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSGA 739 >ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cicer arietinum] Length = 666 Score = 501 bits (1289), Expect = e-139 Identities = 300/676 (44%), Positives = 397/676 (58%), Gaps = 31/676 (4%) Frame = -2 Query: 2448 KEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNN 2269 K+Q IS+KDAV K+QL+L EGI E+ LFAA SL+SRSDY+DV+T+RSIT VC YPLC N Sbjct: 3 KDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLCCN 62 Query: 2268 PLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXXXX 2089 LP+E PRK YR SLK K YD E Y+FCS CV NS+AFAGSL KRC + Sbjct: 63 ALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQKLN 122 Query: 2088 XXXXXXXEQGLKENEYLGVTDD--LTELKIQEKVK-AGDVLLEN-SSPFFSIEGYIPEVD 1921 L+ E G + L+ L+IQ+K + +V LE P +IEGY+P+ Sbjct: 123 NILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVTEVSLEQWVGPSNAIEGYVPKKR 182 Query: 1920 GVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKENGAE 1741 G+++++ + K I +E D ++ D+ +S+ K + + Sbjct: 183 DNGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDE------YSVSKVSSGQ 236 Query: 1740 TKSDEVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAAENTQFGELLLNGSMQNVSEIA 1561 T + V H+ P E K+ D++ +S++ F L + + EIA Sbjct: 237 TDA-TVDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSS----FASSLNLSASKKDKEIA 291 Query: 1560 XXXXXXXXXXXKTAQSNENAL--------------------------KSSLKPPGAKTLS 1459 +N+++ KSSLK G K L Sbjct: 292 KSCKNVLKGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLG 351 Query: 1458 QSVTWADKIKVNDTDSGDLCTFQEINDI-KDNGSSRNSNVEDVDASLRLASAEACVLALS 1282 +SVTWADK K++ S DLC F+E +I K++ + N +V D + LR SAEAC +ALS Sbjct: 352 RSVTWADK-KIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALS 410 Query: 1281 QSAEAVSSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFN 1102 Q+AEAV+SG+ D DA S++GIIILP + ++V LK P +PG+ + Sbjct: 411 QAAEAVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISD 470 Query: 1101 SKLFDPENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNG 922 LF ++SW+DAPP GFSLTLS FAT+W A F WITSSSLAYIYGRDVS +EEF+SV+G Sbjct: 471 FDLFASDDSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDG 530 Query: 921 KEYPRKNFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMD 742 +EYP K SDGRSSEIK++LA LAR +P +VAELKL P+S LE GM CLLDTMSF+D Sbjct: 531 REYPCKIVLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVD 590 Query: 741 PLPSLGTEQWHVLTLLFIDALSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDV 562 PLP +QW V+ LLF+DALSVCRIP L +MT R L HKVL +Q+ EEY +KD+ Sbjct: 591 PLPGFRFKQWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDL 650 Query: 561 IIPLGRRPQFSAQSGA 514 I+PLGR P FS+QSGA Sbjct: 651 IVPLGRAPHFSSQSGA 666 >ref|XP_007145767.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris] gi|561018957|gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris] Length = 706 Score = 500 bits (1288), Expect = e-138 Identities = 302/707 (42%), Positives = 406/707 (57%), Gaps = 60/707 (8%) Frame = -2 Query: 2454 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 2275 + K++++S+KDAV K+Q+ L EGI +E+ LFAA SLMSRSDY+D++T+RSITNVCGYPLC Sbjct: 1 MAKDKAVSVKDAVFKLQMLLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60 Query: 2274 NNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 N LP+E PRK YR SLK K YD +E Y+FCS CV +S+AF+G L +RCS + Sbjct: 61 CNALPSERPRKGKYRISLKEHKVYDLQETYMFCSSNCVVSSKAFSGILQAERCSALDPEK 120 Query: 2094 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKV--KAGDVLLEN-SSPFFSIEGYIP 1930 L++ E + DL + LKIQEK +G+V LE P +IEGY+P Sbjct: 121 LNNVLGLFENLNLEQTENVPKDGDLGLSNLKIQEKTVTTSGEVPLEQWVGPSNAIEGYVP 180 Query: 1929 EVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKL------- 1771 + G+++ + + K I +EM+ ++ D+++ K Sbjct: 181 KPRERESKGLRKNVKKGSKAGHGKSNNDKDLINSEMNFVSTIIMQDEYSVSKASPGQTDT 240 Query: 1770 --FSIFKENGAETKSDE-----VKHKP----------FSDGAAVSPAFSGSETKS----- 1657 K + + +E V K F G +S + G E Sbjct: 241 TAHHQIKPTAVDRQQEEKVGLKVVRKDEDSIQDLSSSFESGLHLSASEKGKEVSKSCEVV 300 Query: 1656 ---------KESDVRSISAAE--------NTQFGELLLNGSMQNV--------SEIAXXX 1552 K+ D S+S +E N+ + L G V S Sbjct: 301 VKSTPNLAIKKKDAHSVSISERHYDVEKNNSARKSVQLKGETSRVTVNGDASTSNFDPDN 360 Query: 1551 XXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDI- 1375 K E LKSSLK G K LS++VTWAD+ K+N + DLC +E DI Sbjct: 361 VKEKFQVEKVGGLCETKLKSSLKSAGEKKLSRTVTWADE-KINGAGNKDLCEVKEFGDII 419 Query: 1374 KDNGSSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLN 1195 K++ S N +V + + LR ASAEAC +ALSQ++EAV+SG+ D +DA S++GIIILPQ + Sbjct: 420 KESESVGNEDVANNEDMLRQASAEACAIALSQASEAVASGDSDATDAVSEAGIIILPQPH 479 Query: 1194 DYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMW 1015 D ++V LK P +PG+ + F+ ++SW+DAPP GFSLTLS FA MW Sbjct: 480 DAVEEGTMEDADILQNDSVTLKWPRKPGISDIDFFESDDSWFDAPPEGFSLTLSPFANMW 539 Query: 1014 TALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMV 835 A+F W+TS SLAYIYGRD S HEE++SVNG+EYP K SDGRSSEIK++ AG LAR Sbjct: 540 NAIFSWMTSYSLAYIYGRDESFHEEYLSVNGREYPCKVVLSDGRSSEIKQTFAGCLARAF 599 Query: 834 PELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFIDALSVCRIPGL 655 P LVA L+L P+S LE GM CLL+TMSF+D LP+ T+QW V+ LLF+DALSVCRIP L Sbjct: 600 PALVAGLRLPIPISTLEQGMACLLETMSFVDALPAFRTKQWQVVALLFVDALSVCRIPSL 659 Query: 654 APHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSGA 514 +MT RAL HKVL +Q+ EEYE +KD+++PLGR P S QSGA Sbjct: 660 ISYMTDRRALFHKVLSGSQIGMEEYEILKDLVVPLGRAPHISVQSGA 706 >ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Solanum lycopersicum] Length = 660 Score = 497 bits (1280), Expect = e-138 Identities = 295/669 (44%), Positives = 404/669 (60%), Gaps = 23/669 (3%) Frame = -2 Query: 2454 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 2275 + K +++++KDAVHK+QL L EGI E+ L AA SL+SRSDY DV+T+RSI N+CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 2274 NNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 +N LP+E RK HYR SLK K YD E Y++CS CV NS AFAGSL +R S N Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 2094 XXXXXXXXXEQGLKENEYLGVTDDLTE--------LKIQEKV--KAGDVLLEN-SSPFFS 1948 L + +L DD+ E LKIQEKV K G+V LE P + Sbjct: 121 LNQVL------NLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNA 174 Query: 1947 IEGYIPEVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLF 1768 IEGY+P+ D + G+++ ++RLQ K I+NE D + ++ D+++ K Sbjct: 175 IEGYVPQRDRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFP 234 Query: 1767 SI--------FKENGAETKSDEVKHKPFSDGAAVSPAF--SGSETKSKESDVRSISAAEN 1618 + FKE A+T+ + G V SG ET+ + + R + + Sbjct: 235 APVNADSNVKFKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETEKSDKNTRFLKV-DK 293 Query: 1617 TQFGELLLNGSMQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWAD 1438 GE+ S +V + + + LKSSLK +K +S+SVTWAD Sbjct: 294 FNSGEVSSGPSQHDVKNKSVLIMSDDGRKY-ASHGEHDKLKSSLKSSNSKKMSRSVTWAD 352 Query: 1437 KIKVNDTDSGDLCTFQEINDIKDN--GSSRNSNVEDVDASLRLASAEACVLALSQSAEAV 1264 + ++ + +I++ + G S ++++E+ D S R SAEAC ALSQ+AEAV Sbjct: 353 E-SIDGGIGKKTESSSKISEYESQAYGGSASTDMEENDDSYRFESAEACAAALSQAAEAV 411 Query: 1263 SSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDP 1084 +SG D+ DA S +GI+ILP + D ET PLK P +PG+ N +F+ Sbjct: 412 ASGS-DVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFES 470 Query: 1083 ENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRK 904 E+SWYD+PP GF++TLS F TM+ +LF WI+SSSLA+IYG D S++EE++S+NG+EYPRK Sbjct: 471 EDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRK 530 Query: 903 NFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLG 724 SDGRS+EIK++LAG LAR +P LVA+L+L P+S LE GM LL+TMSF+DPLP+ Sbjct: 531 IVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFR 590 Query: 723 TEQWHVLTLLFIDALSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGR 544 +QW ++ LLF+DALSVCRIP L P+MT R KVLD AQ++ EYE MKD+IIPLGR Sbjct: 591 MKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLIIPLGR 650 Query: 543 RPQFSAQSG 517 PQFS QSG Sbjct: 651 VPQFSMQSG 659 >ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Glycine max] Length = 706 Score = 495 bits (1274), Expect = e-137 Identities = 297/709 (41%), Positives = 409/709 (57%), Gaps = 64/709 (9%) Frame = -2 Query: 2448 KEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNN 2269 K++ +S+KDAV K+Q+SL EGI +E+ LFAA SLMSRSDY+D++T+RSITNVCGYPLC+N Sbjct: 3 KDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLCSN 62 Query: 2268 PLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCS------VY 2107 LP++ PRK YR SLK K YD E Y+FC CV +S+AFAGSL +RCS + Sbjct: 63 ALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEKLN 122 Query: 2106 NSXXXXXXXXXXXEQGLKENEYLGVTDDLTELKIQEKVK--AGDVLLEN-SSPFFSIEGY 1936 N + L++NE G++D LKIQEK + +G+V LE + P +IEGY Sbjct: 123 NILSLFENLNLEPAENLQKNEDFGLSD----LKIQEKTETSSGEVSLEQWAGPSNAIEGY 178 Query: 1935 IPEVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLF---- 1768 +P+ G+++ + + I +EM ++ D ++ K+ Sbjct: 179 VPKPRDHDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKVLPGQR 238 Query: 1767 ---------------SIFKENGAETKSDEVKHKPFSDGAAVSPAFSGSETKS-------- 1657 + K + + D+ + S S SE + Sbjct: 239 DATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELAQSCEA 298 Query: 1656 ----------KESDVRSISAAE--------NTQFGELLLNGSMQNV--------SEIAXX 1555 K+ DV S+S +E ++ + + G M V S + Sbjct: 299 ALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTSNLDPA 358 Query: 1554 XXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDI 1375 K S KSSLK G K LS++VTWADK K+N T S DLC F+ DI Sbjct: 359 NVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWADK-KINSTGSKDLCGFKNFGDI 417 Query: 1374 KDNGSSRNSNVE--DVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQ 1201 ++ S ++++ + + +LR ASAEACV+ALS ++EAV+SG+ D+SDA S++GIIILP Sbjct: 418 RNESDSAGNSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVSEAGIIILPP 477 Query: 1200 LNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFAT 1021 +D ++V +K P +PG+ + F+ ++SW+DA P GFSLTLS FAT Sbjct: 478 PHDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGFSLTLSPFAT 537 Query: 1020 MWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILAR 841 MW LF WITSSSLAYIYGRD S EE++SVNG+EYP K +DGRSSEIK++LA LAR Sbjct: 538 MWNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIKQTLASCLAR 597 Query: 840 MVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFIDALSVCRIP 661 +P LVA L+L P+S +E GM CLL+TMSF+D LP+ T+QW V+ LLFIDALSVCR+P Sbjct: 598 ALPTLVAVLRLPIPVSTMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALSVCRLP 657 Query: 660 GLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSGA 514 L +MT RA H+VL +Q+ EEYE +KD+ +PLGR P SAQSGA Sbjct: 658 ALISYMTDRRASFHRVLSGSQIGMEEYEVLKDLAVPLGRAPHISAQSGA 706 >ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Glycine max] Length = 706 Score = 489 bits (1260), Expect = e-135 Identities = 293/708 (41%), Positives = 405/708 (57%), Gaps = 61/708 (8%) Frame = -2 Query: 2454 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 2275 + K++ +S+KDAV K+Q+SL EGI +E+ LFAA SLMSRSDY+D++T+RSITN+CGYPLC Sbjct: 1 MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60 Query: 2274 NNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 +N LP++ PRK YR SLK K YD +E Y+FCS C+ +S+ FAGSL +RCS + Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120 Query: 2094 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKVK--AGDVLLEN-SSPFFSIEGYIP 1930 L+ E L DL ++LKIQEK + +G+V LE + P +IEGY+P Sbjct: 121 LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180 Query: 1929 EVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPK-------- 1774 + G+++ + + I +EM ++ D+++ K Sbjct: 181 KPRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKVPPGQMDA 240 Query: 1773 --------------------------------LFSIFKEN---GAETKSDEVKHK----- 1714 L S FK + K +EV Sbjct: 241 TANHQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAVL 300 Query: 1713 PFSDGAAV------SPAFSGSETKSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXXX 1552 FS G A+ S + S + +++D S + ++ N + S + Sbjct: 301 KFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDA-STSNLDPAN 359 Query: 1551 XXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIK 1372 K S + +SSLK G K S++VTWAD+ K+N T S DLC F+E DIK Sbjct: 360 VEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADE-KINSTGSKDLCEFKEFGDIK 418 Query: 1371 DNGSSRNSNVEDVDAS--LRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQL 1198 S +N++ + LR ASAEAC +ALS ++EAV+SG+ D+SDA S++GI ILP Sbjct: 419 KESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVSEAGITILPPP 478 Query: 1197 NDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATM 1018 +D ++V LK P + G+ + F+ ++SW+DAPP GFSLTLS FATM Sbjct: 479 HDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGFSLTLSPFATM 538 Query: 1017 WTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARM 838 W LF W TSSSLAYIYGRD S HEE++SVNG+EYP K +DGRSSEIK++LA LAR Sbjct: 539 WNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIKQTLASCLARA 598 Query: 837 VPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFIDALSVCRIPG 658 +P LVA L+L P+S +E GM CLL+TMSF+D LP+ T+QW V+ LLFIDALSVCR+P Sbjct: 599 LPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALSVCRLPA 658 Query: 657 LAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSGA 514 L +MT RA H+VL +Q+ EEYE +KD+++PLGR P S+QSGA Sbjct: 659 LISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQSGA 706 >ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Solanum tuberosum] Length = 662 Score = 489 bits (1259), Expect = e-135 Identities = 292/670 (43%), Positives = 402/670 (60%), Gaps = 24/670 (3%) Frame = -2 Query: 2454 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 2275 + K +++++KDAVHK+QL L EGI EN L AA SL+SRSDY DV+T+RSI N+CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 2274 NNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 +N LP+E RK HYR SLK K YD E Y++CS CV NS AFAGSL +R S N Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 2094 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKVKA---GDVLLEN-SSPFFSIEGYI 1933 L E + DL ++LKIQEKV G+V LE P +IEGY+ Sbjct: 121 LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180 Query: 1932 PEVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKL------ 1771 P+ D + G ++ ++RLQ K I+NE D + ++ D+++ K Sbjct: 181 PQRDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNA 240 Query: 1770 --FSIFKENGAETKSDEVKHKPFSDGAAVS-------PAFSGSETKSKESDVRSISAAEN 1618 FKE A+T+ +K D ++ SG ET+ + + R + + Sbjct: 241 VSSEKFKEAQAKTR-----YKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKV-DK 294 Query: 1617 TQFGELLLNGSMQNVSEIAXXXXXXXXXXXKT-AQSNENALKSSLKPPGAKTLSQSVTWA 1441 GE+ S +V + + + ++ LKSSLK +K +SQSVTWA Sbjct: 295 FNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWA 354 Query: 1440 DKIKVNDTDSGDLCTFQEINDIKDN--GSSRNSNVEDVDASLRLASAEACVLALSQSAEA 1267 D+I ++ + +I++ ++ G S ++++E+ D S R SAEAC ALSQ+AEA Sbjct: 355 DEI-IDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEA 413 Query: 1266 VSSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFD 1087 V+SG D+ DA S +GI+ILP + D PLK P +PG+ N +F+ Sbjct: 414 VASGS-DVPDAVSKAGIVILPTSQEVDEAILQETEMLDIEPA-PLKWPRKPGMPNYDVFE 471 Query: 1086 PENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPR 907 E+ WYD PP GF++TLS FATM+ +LF WI+SSSLA+IYG D +++EE++S+NG+EYP Sbjct: 472 SEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPH 531 Query: 906 KNFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSL 727 K SDG S+EIK++LAG LAR +P LVA+L+L P+S LE GM LL+TMSF+DPLP+ Sbjct: 532 KIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAF 591 Query: 726 GTEQWHVLTLLFIDALSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLG 547 +QW ++ LLF+DALSVCRIP L P+MT R L KVLD AQ++ EYE MKD+IIPLG Sbjct: 592 RMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLG 651 Query: 546 RRPQFSAQSG 517 R PQFS QSG Sbjct: 652 RVPQFSMQSG 661 >gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] Length = 695 Score = 486 bits (1250), Expect = e-134 Identities = 299/702 (42%), Positives = 402/702 (57%), Gaps = 61/702 (8%) Frame = -2 Query: 2436 ISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNNPLPA 2257 IS+KD V+++QLSL +G+H E+ LFAA S+MSRSDY+DV+T+RSI N+CGYPLC NPLP+ Sbjct: 9 ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68 Query: 2256 EWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXXXXXXXX 2077 + PRK YR SLK K YD E Y++CS CV NSR FA SL +RC+V +S Sbjct: 69 DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128 Query: 2076 XXXE-QGLKENEYLGVTDDL--TELKIQEKVK--AGDVLLEN-SSPFFSIEGYIPEVDGV 1915 + GL+ G DL ++LKI+EK + GDV LE + P +IEGY+ + + Sbjct: 129 MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188 Query: 1914 LXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKENGAETK 1735 G+++NN+ L +N+MD ++ D++T K S K+ G ++K Sbjct: 189 PKELGSKSPKRGSKANNTVL-------INDMDFVSTIITEDEYTVSKTPSSLKKTGLDSK 241 Query: 1734 SDEVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXX 1555 E + ++ G+E E+ S + S++ S ++ Sbjct: 242 VREQEE-------ILAKKAMGNEFAVLETSYAPASNVSRVGLVFEDVTSSLRAGSCLSSA 294 Query: 1554 XXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDI 1375 K + E ++KSSLKP K LS++VTWAD+ K + + LC +EI D+ Sbjct: 295 RAEEESHDDKAEKCTEASIKSSLKPSRKKKLSRTVTWADE-KTDSSGGRKLCEIREIEDM 353 Query: 1374 KD--------NGSSRNSN-----------------------------VEDV--------- 1333 K+ NG S S+ +ED Sbjct: 354 KEDPSVVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADMLCN 413 Query: 1332 ------DASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLNDYDXXXXX 1171 D + R ASAEAC AL +++EAV+S EL+++DA S++GIIILP+ + D Sbjct: 414 ADTGENDDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGEPM 473 Query: 1170 XXXXXXXXET---VPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMWTALFG 1000 P+K P +PG +S LFDPE+SW+DAPP FSLTLS FA MW ALF Sbjct: 474 EEDDDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNALFT 533 Query: 999 WITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMVPELVA 820 W TSS+LAYIYGRD S HEE+ VNG+EYP K DGRSSEIK++LAG LAR +P LVA Sbjct: 534 WTTSSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGLVA 593 Query: 819 ELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFIDALSVCRIPGLAPHMT 640 +L+L P+S+LE GM LLDTMSF+D LP +QW V+ LLF++ALSV R+P L PHM Sbjct: 594 DLRLSTPISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPHMM 653 Query: 639 SMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSGA 514 R L HKVLDSAQ++ EEYE MKD++IPLGR P FSAQSGA Sbjct: 654 YRRVLFHKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSGA 695 >ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis] gi|223538861|gb|EEF40460.1| conserved hypothetical protein [Ricinus communis] Length = 645 Score = 485 bits (1249), Expect = e-134 Identities = 289/656 (44%), Positives = 396/656 (60%), Gaps = 9/656 (1%) Frame = -2 Query: 2454 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 2275 + KE+S+S+KD V+K+QLSL EGI +E+ L AA SLMSRSDY+DV+ +RSI+N+CGYPLC Sbjct: 1 MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60 Query: 2274 NNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 NN LP++ P K YR SLK + YD +E Y++CS C+ NSRAF+ SL KRCSV N Sbjct: 61 NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120 Query: 2094 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEK--VKAGDVLLEN-SSPFFSIEGYIP 1930 + L ++E LG + DL + LKIQEK G V LE P +IEGY+P Sbjct: 121 LNEILRKFNDLTL-DSEGLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179 Query: 1929 EVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKEN 1750 + D + K + ++ D T ++ D+++ K + Sbjct: 180 QGDRDPNPSLKNHKEGLKAICKKPVSK-QDCFFSDTDFTSTIITNDEYSISK-----GPS 233 Query: 1749 GAETKSDEVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAAENTQFGELL---LNGSMQ 1579 G + + ++K + G + + K+ +++ ++ + +++ LN Sbjct: 234 GLTSTASDIKLQA-QTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKEKVIKEQLNFQDL 292 Query: 1578 NVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLC 1399 S A NE+ LK SLK GAK ++SVTWAD+ +V++ S +LC Sbjct: 293 PSSSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADE-RVDNAGSRNLC 351 Query: 1398 TFQEINDIKDNGS-SRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDS 1222 QE+ ++ S ++N D LR SAEAC +ALSQ+AEAV+SG+ D++ A S++ Sbjct: 352 EVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEA 411 Query: 1221 GIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSL 1042 GII+LP D E+ LK P +PG+ S LFDPE+SWYDAPP GFSL Sbjct: 412 GIIVLPPSQDLGQGGNVEKNDMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAPPEGFSL 471 Query: 1041 TLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKES 862 TLS FATMW ALF W+TSSSLAYIYGRD S+HE+++SVNG+EYPRK DGRSSEI+ + Sbjct: 472 TLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRSSEIRLT 531 Query: 861 LAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFIDA 682 LAR P LVA L+L P+S LE G LL+TMSF+D LP+ T+QW V+ LLFI+A Sbjct: 532 AESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIALLFIEA 591 Query: 681 LSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSGA 514 LSVCRIP L +MTS R +LH+VLD A ++ EEY+ MKD ++PLGR PQ A+SGA Sbjct: 592 LSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ--ARSGA 645 >ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Glycine max] Length = 716 Score = 481 bits (1239), Expect = e-133 Identities = 293/718 (40%), Positives = 405/718 (56%), Gaps = 71/718 (9%) Frame = -2 Query: 2454 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 2275 + K++ +S+KDAV K+Q+SL EGI +E+ LFAA SLMSRSDY+D++T+RSITN+CGYPLC Sbjct: 1 MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60 Query: 2274 NNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 +N LP++ PRK YR SLK K YD +E Y+FCS C+ +S+ FAGSL +RCS + Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120 Query: 2094 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKVK--AGDVLLEN-SSPFFSIEGYIP 1930 L+ E L DL ++LKIQEK + +G+V LE + P +IEGY+P Sbjct: 121 LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180 Query: 1929 EVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPK-------- 1774 + G+++ + + I +EM ++ D+++ K Sbjct: 181 KPRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKVPPGQMDA 240 Query: 1773 --------------------------------LFSIFKEN---GAETKSDEVKHK----- 1714 L S FK + K +EV Sbjct: 241 TANHQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAVL 300 Query: 1713 PFSDGAAV------SPAFSGSETKSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXXX 1552 FS G A+ S + S + +++D S + ++ N + S + Sbjct: 301 KFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDA-STSNLDPAN 359 Query: 1551 XXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIK 1372 K S + +SSLK G K S++VTWAD+ K+N T S DLC F+E DIK Sbjct: 360 VEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADE-KINSTGSKDLCEFKEFGDIK 418 Query: 1371 DNGSSRNSNVEDVDAS--LRLASAEACVLALSQSAEAVSSGELDISDAE----------S 1228 S +N++ + LR ASAEAC +ALS ++EAV+SG+ D+SDA S Sbjct: 419 KESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVFSPMNETCAVS 478 Query: 1227 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 1048 ++GI ILP +D ++V LK P + G+ + F+ ++SW+DAPP GF Sbjct: 479 EAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGF 538 Query: 1047 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 868 SLTLS FATMW LF W TSSSLAYIYGRD S HEE++SVNG+EYP K +DGRSSEIK Sbjct: 539 SLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIK 598 Query: 867 ESLAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFI 688 ++LA LAR +P LVA L+L P+S +E GM CLL+TMSF+D LP+ T+QW V+ LLFI Sbjct: 599 QTLASCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVALLFI 658 Query: 687 DALSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSGA 514 DALSVCR+P L +MT RA H+VL +Q+ EEYE +KD+++PLGR P S+QSGA Sbjct: 659 DALSVCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQSGA 716 >ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] gi|462404075|gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] Length = 711 Score = 477 bits (1227), Expect = e-131 Identities = 293/718 (40%), Positives = 408/718 (56%), Gaps = 73/718 (10%) Frame = -2 Query: 2448 KEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNN 2269 ++ IS+KD V+K+QL+L EGI ++ L+ A S++SRSDY+DV+T+R+I N+CGYPLC+N Sbjct: 9 QQPRISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSN 68 Query: 2268 PLPAEW--PRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 LP++ P K HYR SLK K YD E Y++CS CV S+AFA SL +RC V + Sbjct: 69 ALPSDSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGK 128 Query: 2094 XXXXXXXXXEQGLKENEY-LGVTDDL--TELKIQEKVKAG-------DVLLENSS----- 1960 + G + E G DL ++LKI+EKV+ G + +E S Sbjct: 129 VERILRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIG 188 Query: 1959 ------PFFSIEGYIPEVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVA 1798 P +IEGY+P+ + + G++ ++++ G I NEMD ++ Sbjct: 189 DLGAVGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSSGMDIIFNEMDFMSTIIT 248 Query: 1797 ADQFTGPKLFSIFKENGAETKSDEVKHKPF---SDGAAVSPAFSGSETKS-KESDVRSIS 1630 +D+++ K+ E ETK + K K +D S G + K+ K+ DV Sbjct: 249 SDEYSVSKIPPSVGEPDFETKFKKSKGKVGLNKNDSVKKSRQSKGGKNKNVKKDDVCIRE 308 Query: 1629 AAENTQFGELLLNGSMQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSV 1450 + + +LNGS + E K QS E L+SSLKP G K L++SV Sbjct: 309 VPSTSDASQTVLNGSTKEEKE--------EFIVEKAEQSGEALLRSSLKPSGTKKLNRSV 360 Query: 1449 TWADKI----------------------------------------------KVNDTDSG 1408 TWAD++ K++ T S Sbjct: 361 TWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTKSK 420 Query: 1407 DLCTFQEINDIKDNGSSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 1228 ++C +E+ D GS D+ + L SAEAC +AL+Q+AEAV+SGE D+S A S Sbjct: 421 NICEVREVQDADVLGSL------DLQENEILESAEACAMALNQAAEAVASGESDVSGAVS 474 Query: 1227 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 1048 +GIIILP+ + D E PL P +PG+ S LFDPE+SW+DAPP GF Sbjct: 475 GAGIIILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGF 533 Query: 1047 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 868 S+TLS FATMW +LF WITSS+LAYIYGRD S HEEF+SVNG+EYP K + GRSSEIK Sbjct: 534 SVTLSPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIK 593 Query: 867 ESLAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFI 688 ++L AR +P +V+EL+L P+S+LE GM +L+TMSF+D +P+ +QW V+ LLF+ Sbjct: 594 KTLDESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFL 653 Query: 687 DALSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSGA 514 + LSVCRIP L PHMT+ R L +KVL++ Q++ E+YE MKD+IIPLGR PQFSAQSGA Sbjct: 654 EGLSVCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFSAQSGA 711 >ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] gi|508787291|gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] Length = 703 Score = 476 bits (1225), Expect = e-131 Identities = 287/677 (42%), Positives = 394/677 (58%), Gaps = 51/677 (7%) Frame = -2 Query: 2469 RTSPTLVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVC 2290 ++S ++ KEQSIS+ +AVHKIQL L +GI E L A+ SL+SRSDY+DV+T+R+I+N C Sbjct: 50 KSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTC 109 Query: 2289 GYPLCNNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSV 2110 GYPLC NPLP+E RK YR SLK K YD +E Y+FCS C+ NSRAFAGSL +RCSV Sbjct: 110 GYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSV 169 Query: 2109 YNSXXXXXXXXXXXEQGLKENEYLGVTDDL----TELKIQEKVKAGDVLLENSSPFFSIE 1942 N + L +N+ LG DL +K E+VKA DV L + P +IE Sbjct: 170 LNHAKLNDILSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSL--AGPSNAIE 226 Query: 1941 GYIPEVDGVLXXXXXXXXXXGA-QSNNSRLQKGK------------GKIV---------- 1831 GY+P+ + + S++S+L K G I+ Sbjct: 227 GYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKK 286 Query: 1830 ---------------------NEMDLTCNLVAADQFTGPKLFSIFKENGAETKSDEVKHK 1714 NEMD T ++ D++T K+ S K++ ++ EV+ K Sbjct: 287 PGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEK 346 Query: 1713 PFSDGAAVSPAFSGSET--KSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXXXXXXX 1540 + SGS + + K+S + + + +N Q+ + + Sbjct: 347 GICKDSEDKCVISGSSSALREKDSSIVELPSTKNV----------YQSGLDTSSAEAEKE 396 Query: 1539 XXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIK-DNG 1363 K S+E LKSSLK GAK L++ VTWADK K ++ +G+LC +E+ +K D+ Sbjct: 397 THADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSE 456 Query: 1362 SSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLNDYDX 1183 S ++ D LR SAEAC +ALS++AEAV+SG+ D++DA + + D Sbjct: 457 ISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDA-----------VCEVDK 505 Query: 1182 XXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMWTALF 1003 ET P+K P +PG+ +S +F+PE+SW+DAPP GFSLTLS+FATMW ALF Sbjct: 506 EEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALF 565 Query: 1002 GWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMVPELV 823 WITSSSLAYIYGRD S HEE++S+NG+EYPRK DGRSSEIKE+LA ++R +P +V Sbjct: 566 EWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIV 625 Query: 822 AELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFIDALSVCRIPGLAPHM 643 +L+L P+S LE GM L+DT+SFM+ LP+ +QW V+ LLFIDALSVCRIP L PHM Sbjct: 626 TDLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHM 685 Query: 642 TSMRALLHKVLDSAQVT 592 T+ R LLHKVLD AQ++ Sbjct: 686 TNGRMLLHKVLDGAQIS 702 >ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 662 Score = 476 bits (1225), Expect = e-131 Identities = 284/673 (42%), Positives = 393/673 (58%), Gaps = 26/673 (3%) Frame = -2 Query: 2454 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 2275 + K QS+ +KD V+K+QL+L EGI +EN LFAA SLMSRSDY+DV+T+RSI ++CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 2274 NNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 ++ LP++ R+ YR SLK K YD E Y +CS C+ NSRAF+G L +RCSV N Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 2094 XXXXXXXXXEQGLKENEYLGVTDDLTELKIQEKVKA--GDVLLEN-SSPFFSIEGYIPEV 1924 L E +G D + L+IQEK+++ G+V +E P +IEGY+P Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD-SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHR 179 Query: 1923 DGVLXXXXXXXXXXGAQSNNSRLQK-GKGK-IVNEMDLTCNLVAADQFTGPKLFSIFKEN 1750 D + + ++++ G GK ++ +T ++ ++++ K+ S KE Sbjct: 180 DHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEM 239 Query: 1749 GAETKSD------------------EVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAA 1624 +T S E H P +V GS+ ++K S + + Sbjct: 240 ALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATK--EST 297 Query: 1623 ENTQFGELLLNGSMQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTW 1444 +N N + + T LKSSLK PG K L +SVTW Sbjct: 298 DNLSDAPSTSKNRSTNFNLMTEEPRGGFNDLSGT------ELKSSLKKPGKKNLCRSVTW 351 Query: 1443 ADKIKVNDTDSGDLCTFQEINDIKDNGSSRNSNV---EDVDASLRLASAEACVLALSQSA 1273 AD+ K +D +L E+ K+ + ++ V D + LR+ SAEAC +ALSQ+A Sbjct: 352 ADE-KTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAA 410 Query: 1272 EAVSSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKL 1093 EA++SG+ ++SDA S++GIIILP +D + + K N+ G+ S L Sbjct: 411 EAITSGQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDL 469 Query: 1092 FDPENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEY 913 FDP +SWYDAPP GFSLTLSSFATMW A+F W+TSSSLAYIYG+D HEEF+ ++GKEY Sbjct: 470 FDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEY 529 Query: 912 PRKNFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLP 733 P K S+DGRSSEIK++LAG L R +P L +EL L P+S LE+GM LLDTM+F+D LP Sbjct: 530 PSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALP 589 Query: 732 SLGTEQWHVLTLLFIDALSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIP 553 + +QW V+ LLFI+ALSV RIP LA HM+S R L HKVLD AQ+ +EYE M+D I+P Sbjct: 590 AFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILP 649 Query: 552 LGRRPQFSAQSGA 514 LGR Q S ++ A Sbjct: 650 LGRTAQLSDENDA 662 >ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] gi|550321730|gb|EEF05523.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] Length = 696 Score = 464 bits (1194), Expect = e-128 Identities = 286/702 (40%), Positives = 397/702 (56%), Gaps = 55/702 (7%) Frame = -2 Query: 2454 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 2275 + K+QS +KD ++K+QLSL +GI +E+ L AA S+MS SDY+DV+T+R+I N+CGYPLC Sbjct: 1 MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60 Query: 2274 NNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 N LP++ P+K YR SLK K YD E Y++CS CV NSR F+GSL +RC V N Sbjct: 61 GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120 Query: 2094 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKVKA--GDVLLEN-SSPFFSIEGYIP 1930 L LG DL + LKI+EK + G+V E P +IEGY+P Sbjct: 121 LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180 Query: 1929 EVDGVLXXXXXXXXXXGAQ-------------------SNNSRLQKGKGK---------- 1837 + D + + + + + QK K K Sbjct: 181 QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240 Query: 1836 ------------IVNEMDLTCNLVAA-DQFTGPKLFSIFKENGAETK----SDEVKHKPF 1708 +N+M+ T ++ D+++ K S ++TK ++V K Sbjct: 241 AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300 Query: 1707 SDGAAVSPAFSGSETKSKESDVRSISAAENTQFGELLLN--GSMQNVSEIAXXXXXXXXX 1534 + ++ + S+T K + RS A ++ + L + S Q S Sbjct: 301 ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360 Query: 1533 XXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIKDNGSSR 1354 K A+ E++LK SLK GAK L++SVTWAD+ KV + S DLC + + D K G Sbjct: 361 SEKAAKPVESSLKPSLKTSGAKQLTRSVTWADE-KVGSSGSRDLCEVRGMEDTKA-GPEI 418 Query: 1353 NSNVEDVDASL--RLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLNDYDXX 1180 N++ D + SAEAC ALSQ+AEAV+SG+ D S+A S++G++ILPQ +D D Sbjct: 419 VDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLDQG 478 Query: 1179 XXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMWTALFG 1000 E+ +K P +PG+ S+ FDPENSWYDAPP GFSL LSSFAT+W ALF Sbjct: 479 DPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIWMALFA 538 Query: 999 WITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMVPELVA 820 W+TSSSLAY+YG+D SSHEE++ VNG+EYPRK DGRS EI++++ G L R P +VA Sbjct: 539 WVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAFPVVVA 598 Query: 819 ELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFIDALSVCRIPGLAPHMT 640 +L+L P+S LE G LL TMSF+D +P+ +QW V+ LLFI+ALSVCRIP L +M Sbjct: 599 DLRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPALISYMD 658 Query: 639 SMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSGA 514 + R V+D +++ EEYE MKD++IPLGR PQFS QSGA Sbjct: 659 NRR----MVVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSGA 696 >ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 632 Score = 447 bits (1150), Expect = e-122 Identities = 273/673 (40%), Positives = 386/673 (57%), Gaps = 26/673 (3%) Frame = -2 Query: 2454 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 2275 + K QS+ +KD V+K+QL+L EGI +EN LFAA SLMSRSDY+DV+T+RSI ++CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 2274 NNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 2095 ++ LP++ R+ YR SLK K YD E Y +CS C+ NSRAF+G L +RCSV N Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 2094 XXXXXXXXXEQGLKENEYLGVTDDLTELKIQEKVKA--GDVLLEN-SSPFFSIEGYIPEV 1924 L E +G D + L+IQEK+++ G+V +E P +IEGY+P Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD-SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHR 179 Query: 1923 DGVLXXXXXXXXXXGAQSNNSRLQK-GKGK-IVNEMDLTCNLVAADQFTGPKLFSIFKEN 1750 D + + ++++ G GK ++ T ++ ++++ K+ S KE Sbjct: 180 DHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEM 239 Query: 1749 GAETKSD------------------EVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAA 1624 +T S E H P +V GS+ ++K S + Sbjct: 240 ALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKE---- 295 Query: 1623 ENTQFGELLLNGSMQNVSEIAXXXXXXXXXXXKTAQSNENALKSSL---KPPGAKTLSQS 1453 S N+S+ + SN + +L +P KT S Sbjct: 296 ------------STDNLSDAP-------------STSNNRSTNFNLMTEEPRDEKTDDAS 330 Query: 1452 VTWADKIKVNDTDSGDLCTFQEINDIKDNGSSRNSNVEDVDASLRLASAEACVLALSQSA 1273 + +N + G++ +E + N + +++ ED+ LR+ SAEAC +ALSQ+A Sbjct: 331 I-------MNLPEVGEMGKTKECSRTTSNLVNFDNDNEDL---LRVESAEACAMALSQAA 380 Query: 1272 EAVSSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKL 1093 +A++SG+ ++SDA S++GIIILP +D + + K N+ G+ S L Sbjct: 381 KAITSGQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDL 439 Query: 1092 FDPENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEY 913 FDP +SWYDAPP GFSLTLSSFATMW A+F W+TSSSLAYIYG+D HEEF+ ++GKEY Sbjct: 440 FDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEY 499 Query: 912 PRKNFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLP 733 P K S+DGRSSEIK++LAG L R +P L +EL L P+S LE+GM LLDTM+F+D LP Sbjct: 500 PSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALP 559 Query: 732 SLGTEQWHVLTLLFIDALSVCRIPGLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIP 553 + +QW V+ LLFI+ALSV RIP LA HM+S R L HKVLD AQ+ +EYE M+D I+P Sbjct: 560 AFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILP 619 Query: 552 LGRRPQFSAQSGA 514 LGR Q S ++ A Sbjct: 620 LGRTAQLSDENDA 632 >ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma cacao] gi|508787293|gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao] Length = 708 Score = 439 bits (1129), Expect = e-120 Identities = 264/639 (41%), Positives = 370/639 (57%), Gaps = 51/639 (7%) Frame = -2 Query: 2469 RTSPTLVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVC 2290 ++S ++ KEQSIS+ +AVHKIQL L +GI E L A+ SL+SRSDY+DV+T+R+I+N C Sbjct: 50 KSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTC 109 Query: 2289 GYPLCNNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSV 2110 GYPLC NPLP+E RK YR SLK K YD +E Y+FCS C+ NSRAFAGSL +RCSV Sbjct: 110 GYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSV 169 Query: 2109 YNSXXXXXXXXXXXEQGLKENEYLGVTDDL----TELKIQEKVKAGDVLLENSSPFFSIE 1942 N + L +N+ LG DL +K E+VKA DV L + P +IE Sbjct: 170 LNHAKLNDILSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSL--AGPSNAIE 226 Query: 1941 GYIPEVDGVLXXXXXXXXXXGA-QSNNSRLQKGK------------GKIV---------- 1831 GY+P+ + + S++S+L K G I+ Sbjct: 227 GYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKK 286 Query: 1830 ---------------------NEMDLTCNLVAADQFTGPKLFSIFKENGAETKSDEVKHK 1714 NEMD T ++ D++T K+ S K++ ++ EV+ K Sbjct: 287 PGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEK 346 Query: 1713 PFSDGAAVSPAFSGSET--KSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXXXXXXX 1540 + SGS + + K+S + + + +N Q+ + + Sbjct: 347 GICKDSEDKCVISGSSSALREKDSSIVELPSTKNV----------YQSGLDTSSAEAEKE 396 Query: 1539 XXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIK-DNG 1363 K S+E LKSSLK GAK L++ VTWADK K ++ +G+LC +E+ +K D+ Sbjct: 397 THADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSE 456 Query: 1362 SSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLNDYDX 1183 S ++ D LR SAEAC +ALS++AEAV+SG+ D++DA ++G+IILP L + D Sbjct: 457 ISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDK 516 Query: 1182 XXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMWTALF 1003 ET P+K P +PG+ +S +F+PE+SW+DAPP GFSLTLS+FATMW ALF Sbjct: 517 EEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALF 576 Query: 1002 GWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMVPELV 823 WITSSSLAYIYGRD S HEE++S+NG+EYPRK DGRSSEIKE+LA ++R +P +V Sbjct: 577 EWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIV 636 Query: 822 AELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHV 706 +L+L P+S LE GM L+DT+SFM+ LP+ +QW + Sbjct: 637 TDLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWEI 675 >ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma cacao] gi|508787290|gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao] Length = 679 Score = 438 bits (1126), Expect = e-120 Identities = 264/637 (41%), Positives = 369/637 (57%), Gaps = 51/637 (8%) Frame = -2 Query: 2469 RTSPTLVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVC 2290 ++S ++ KEQSIS+ +AVHKIQL L +GI E L A+ SL+SRSDY+DV+T+R+I+N C Sbjct: 50 KSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTC 109 Query: 2289 GYPLCNNPLPAEWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSV 2110 GYPLC NPLP+E RK YR SLK K YD +E Y+FCS C+ NSRAFAGSL +RCSV Sbjct: 110 GYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSV 169 Query: 2109 YNSXXXXXXXXXXXEQGLKENEYLGVTDDL----TELKIQEKVKAGDVLLENSSPFFSIE 1942 N + L +N+ LG DL +K E+VKA DV L + P +IE Sbjct: 170 LNHAKLNDILSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSL--AGPSNAIE 226 Query: 1941 GYIPEVDGVLXXXXXXXXXXGA-QSNNSRLQKGK------------GKIV---------- 1831 GY+P+ + + S++S+L K G I+ Sbjct: 227 GYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKK 286 Query: 1830 ---------------------NEMDLTCNLVAADQFTGPKLFSIFKENGAETKSDEVKHK 1714 NEMD T ++ D++T K+ S K++ ++ EV+ K Sbjct: 287 PGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEK 346 Query: 1713 PFSDGAAVSPAFSGSET--KSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXXXXXXX 1540 + SGS + + K+S + + + +N Q+ + + Sbjct: 347 GICKDSEDKCVISGSSSALREKDSSIVELPSTKNV----------YQSGLDTSSAEAEKE 396 Query: 1539 XXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIK-DNG 1363 K S+E LKSSLK GAK L++ VTWADK K ++ +G+LC +E+ +K D+ Sbjct: 397 THADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSE 456 Query: 1362 SSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLNDYDX 1183 S ++ D LR SAEAC +ALS++AEAV+SG+ D++DA ++G+IILP L + D Sbjct: 457 ISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDK 516 Query: 1182 XXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMWTALF 1003 ET P+K P +PG+ +S +F+PE+SW+DAPP GFSLTLS+FATMW ALF Sbjct: 517 EEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALF 576 Query: 1002 GWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMVPELV 823 WITSSSLAYIYGRD S HEE++S+NG+EYPRK DGRSSEIKE+LA ++R +P +V Sbjct: 577 EWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIV 636 Query: 822 AELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQW 712 +L+L P+S LE GM L+DT+SFM+ LP+ +QW Sbjct: 637 TDLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673 >gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus guttatus] Length = 597 Score = 434 bits (1115), Expect = e-118 Identities = 268/648 (41%), Positives = 370/648 (57%), Gaps = 8/648 (1%) Frame = -2 Query: 2436 ISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNNPLPA 2257 + +KDAVHK+QLSL EGI HE+ L AA SL+S+SDY DV+T+R+I +VCGYPLC N LP+ Sbjct: 7 LGVKDAVHKLQLSLLEGIKHESQLIAAGSLISQSDYQDVVTERTIAHVCGYPLCVNSLPS 66 Query: 2256 EWPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXXXXXXXX 2077 E PRK HYR SLK K YD E +++CS C+ SRAF SL +R S + Sbjct: 67 EPPRKGHYRISLKEHKVYDLHETHMYCSTECLIRSRAFGASLEEERSS--SLDPAKINSV 124 Query: 2076 XXXEQGLKENEYLGVTDD----LTELKIQEKVKAGD---VLLENSSPFFSIEGYIPEVDG 1918 GL + +G+ L+ LKI+EK+ G L E P +I+GY+P D Sbjct: 125 LKMFDGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVPRRD- 183 Query: 1917 VLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKENGAET 1738 +SN+++ + +++ T ++ D+++ K ++ +E + Sbjct: 184 -QNSERKQPSRKKTESNHAKPNLA-DTLPFDVNFTSTIIMQDEYSVSKT-AVPREAKGKV 240 Query: 1737 KSDEVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAAENTQFGELLLNGSMQNVSEIAX 1558 K ++ K IS ++T G QN Sbjct: 241 KGKMIR---------------------KSVKAEKISVLDDTA-------GPSQN------ 266 Query: 1557 XXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEIND 1378 + LKSSLK +K ++SVTWAD + +D D + +EI D Sbjct: 267 ---------------DTTLLKSSLKTLDSKKETRSVTWAD--EKSDGDGKSISECREIGD 309 Query: 1377 IKDNGSSRNSNVEDV-DASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQ 1201 K + EDV D S R SAEAC ALSQ++EAV+SG+ D SDA S++G+IILP Sbjct: 310 NKGAVVMPHLTDEDVGDESYRFTSAEACARALSQASEAVASGKTDASDAVSEAGVIILPP 369 Query: 1200 LNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFAT 1021 ++ D + + LK P +PG + LFD E+SWYD+PP GF+LTLS F+T Sbjct: 370 PHEVDEAKYEQIGEVVDVDPIELKWPPKPGFSSEDLFDSEDSWYDSPPEGFNLTLSPFST 429 Query: 1020 MWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILAR 841 M+ +LF WI+SSSLAYIYG++ HE+++S+NG+EYP K DGRS+E+K +LAG LAR Sbjct: 430 MFMSLFAWISSSSLAYIYGKEERFHEDYLSINGREYPPK-IIIDGRSAEVKHTLAGCLAR 488 Query: 840 MVPELVAELKLRAPLSALEHGMECLLDTMSFMDPLPSLGTEQWHVLTLLFIDALSVCRIP 661 +P LV+E+++ P+S +E GM LLDTMSF D LP +QW V+ LLF+DALSV RIP Sbjct: 489 ALPGLVSEIRIPTPVSTIEQGMGRLLDTMSFTDALPGFRMKQWQVIALLFLDALSVSRIP 548 Query: 660 GLAPHMTSMRALLHKVLDSAQVTWEEYESMKDVIIPLGRRPQFSAQSG 517 L+P+MT R LL KVL+ AQ+ EE+E MKD+IIPLGR PQFS QSG Sbjct: 549 ALSPYMTGRRILLPKVLEGAQINVEEFEIMKDLIIPLGRVPQFSTQSG 596