BLASTX nr result
ID: Cocculus23_contig00010225
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00010225 (2774 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] 467 e-129 ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258... 467 e-128 ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr... 422 e-116 ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobr... 425 e-116 ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma c... 422 e-115 ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma c... 422 e-115 ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Th... 402 e-110 ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni... 394 e-106 ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni... 392 e-106 ref|XP_007145767.1| hypothetical protein PHAVU_007G265900g [Phas... 392 e-106 ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni... 390 e-105 ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm... 389 e-105 ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni... 384 e-103 ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni... 384 e-103 ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni... 382 e-103 gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] 379 e-102 ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu... 379 e-102 ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni... 376 e-101 ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prun... 371 e-100 ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni... 353 2e-94 >emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] Length = 659 Score = 467 bits (1202), Expect(2) = e-129 Identities = 272/571 (47%), Positives = 360/571 (63%), Gaps = 14/571 (2%) Frame = -1 Query: 1721 EQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNNP 1542 +Q I++KDAVHK+QL L EGI +EN LFAA SLMSRSDY+DV+T+R+I N+CGYPLC+N Sbjct: 4 DQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNS 63 Query: 1541 LPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXXXXX 1362 LP+ER RK HYR SLK K YD E Y++CS GCV NSR+FAGSL +RCSV NS Sbjct: 64 LPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERING 123 Query: 1361 XXXXXXEQGLKENEYLGVTDDL--TELKIQEKV--KAGDVLLEN-SSPFFSIEGYIPEVD 1197 E L+ N+ LG DL +ELKI+E V KAG+V +E+ P +IEGY+P+ D Sbjct: 124 ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183 Query: 1196 GVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKENGAE 1017 L G++S+NS++ GK +++EMD ++ D+++ K K+ + Sbjct: 184 RNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDTTSH 243 Query: 1016 TKSDEVKHKPFSDGAAVS-------PAFSGSETKSKESDVRSISAAENTQFGELLLNG-S 861 KS E K K S G +S P + SE+K +ES R +F + Sbjct: 244 AKSKEPKEKA-SIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVP 302 Query: 860 MQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGD 681 Q+ SE+ AQ KSSLKP G K + +SVTWAD+ K++ DS D Sbjct: 303 SQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADE-KMDSADSRD 361 Query: 680 LCTFQEINDIKDNGSSRNS-NVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 504 C +E+ K++ + +V D D +LR ASAEAC +ALSQ+AEAV+SGE D++DA S Sbjct: 362 FCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDAVS 421 Query: 503 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 324 ++GIIILP D D E VPLK P +PG+ +S +FD ++SWYD PP GF Sbjct: 422 EAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGF 481 Query: 323 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 144 SLTLS FATMW ALF WITSSS+AYIYGRD S HEE++SVNG+EYP+K +DGRSSEIK Sbjct: 482 SLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEIK 541 Query: 143 ESLAGILARMVPELVAELKLRAPLSALEHGM 51 ++LAG L+R +P LVA+L+L P+S LE G+ Sbjct: 542 QTLAGCLSRALPGLVADLRLPIPVSNLEQGV 572 Score = 25.0 bits (53), Expect(2) = e-129 Identities = 10/14 (71%), Positives = 11/14 (78%) Frame = -2 Query: 49 IPGLAPHMTSMRVL 8 IP L PHMTS R+L Sbjct: 609 IPALTPHMTSRRML 622 Score = 186 bits (473), Expect = 4e-44 Identities = 127/351 (36%), Positives = 187/351 (53%), Gaps = 9/351 (2%) Frame = -1 Query: 2771 VLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLD 2592 +L+LF E S G SELKI EN+ +AGEV +EDW+GPSNAIEGYVP D Sbjct: 124 ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183 Query: 2591 FSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDTK 2412 + KP + KEG +S+ V++EMDF STII D++++ K S G + + + Sbjct: 184 RNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDTTSH 243 Query: 2411 KRTEELKR-----NQFSIHEATSAPVQNGSEIQLKESKLEDGNAASKAQKAEHEKSHGPT 2247 +++E K +Q S+ E ++ P+QN SE +L+ESK K + + E P+ Sbjct: 244 AKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVPS 303 Query: 2246 QSVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDNISIDNL 2067 QS S+ + + K++ +E A T KSSL SG ++ RSVTWADE K+D+ + Sbjct: 304 QSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADE-KMDSADSRDF 362 Query: 2066 NSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESDVTD--- 1896 V+E++ + N +V D AS A + SQ A+A+ E+D+TD Sbjct: 363 CKVRELE-VKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDAVS 421 Query: 1895 DARPIIKLQP-DASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 +A II P D EG ++ D+ E PL+ P KPG S+ FD+ DS Sbjct: 422 EAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDS 472 >ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera] gi|296089830|emb|CBI39649.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 467 bits (1201), Expect = e-128 Identities = 272/571 (47%), Positives = 360/571 (63%), Gaps = 14/571 (2%) Frame = -1 Query: 1721 EQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNNP 1542 +Q I++KDAVHK+QL L EGI +EN LFAA SLMSRSDY+DV+T+R+I N+CGYPLC+N Sbjct: 4 DQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNS 63 Query: 1541 LPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXXXXX 1362 LP+ER RK HYR SLK K YD E Y++CS GCV NSR+FAGSL +RCSV NS Sbjct: 64 LPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERING 123 Query: 1361 XXXXXXEQGLKENEYLGVTDDL--TELKIQEKV--KAGDVLLEN-SSPFFSIEGYIPEVD 1197 E L+ N+ LG DL +ELKI+E V KAG+V +E+ P +IEGY+P+ D Sbjct: 124 ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183 Query: 1196 GVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKENGAE 1017 L G++S+NS++ GK +++EMD ++ D+++ K K+ + Sbjct: 184 RNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDTTSH 243 Query: 1016 TKSDEVKHKPFSDGAAVS-------PAFSGSETKSKESDVRSISAAENTQFGELLLNG-S 861 KS E K K S G +S P + SE+K +ES R +F + Sbjct: 244 AKSKEPKEKA-SIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVP 302 Query: 860 MQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGD 681 Q+ SE+ AQ LKS LKP G K +++SVTWAD+ K++ DS D Sbjct: 303 SQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE-KMDSADSRD 361 Query: 680 LCTFQEINDIKDNGSSRNS-NVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 504 C +E+ K++ + +V D D +LR ASAEAC +ALSQ+AEAV+SGE D++DA S Sbjct: 362 FCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDAVS 421 Query: 503 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 324 ++ IIILP D D E VPLK P +PG+ +S +FD ++SWYD PP GF Sbjct: 422 EARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGF 481 Query: 323 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 144 SLTLS FATMW ALF WITSSS+AYIYGRD S HEE++SVNG+EYP+K +DGRSSEIK Sbjct: 482 SLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEIK 541 Query: 143 ESLAGILARMVPELVAELKLRAPLSALEHGM 51 ++LAG LAR +P LVA+L+L P+S LE G+ Sbjct: 542 QTLAGCLARALPGLVADLRLPIPVSNLEQGV 572 Score = 191 bits (486), Expect = 1e-45 Identities = 128/351 (36%), Positives = 189/351 (53%), Gaps = 9/351 (2%) Frame = -1 Query: 2771 VLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLD 2592 +L+LF E S G SELKI EN+ +AGEV +EDW+GPSNAIEGYVP D Sbjct: 124 ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183 Query: 2591 FSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDTK 2412 + KP + RKEG +S+ V++EMDF TII D++++ K S G + + + Sbjct: 184 RNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDTTSH 243 Query: 2411 KRTEELKR-----NQFSIHEATSAPVQNGSEIQLKESKLEDGNAASKAQKAEHEKSHGPT 2247 +++E K +Q S+ E ++ P+QN SE +L+ESK K + + E P+ Sbjct: 244 AKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVPS 303 Query: 2246 QSVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDNISIDNL 2067 QS S+ + + K++ +E A T LKS L SG ++ +RSVTWADE K+D+ + Sbjct: 304 QSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE-KMDSADSRDF 362 Query: 2066 NSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESDVTD--- 1896 V+E++ + N +V D AS A + SQ A+A+ E+D+TD Sbjct: 363 CKVRELE-VKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDAVS 421 Query: 1895 DARPIIKLQP-DASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 +AR II P D EG ++ D+ E PL+ P KPG S+ FD+ DS Sbjct: 422 EARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDS 472 >ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] Length = 739 Score = 422 bits (1085), Expect(2) = e-116 Identities = 258/631 (40%), Positives = 360/631 (57%), Gaps = 51/631 (8%) Frame = -1 Query: 1790 KPGALISESFDAKDSRTSPTLVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRS 1611 KP +L S + ++S ++ KEQSIS+ +AVHKIQL L +GI E L A+ SL+SRS Sbjct: 35 KPPSLQQHSRERLSKKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRS 94 Query: 1610 DYDDVITKRSITNVCGYPLCNNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTN 1431 DY+DV+T+R+I+N CGYPLC NPLP+E RK YR SLK K YD +E Y+FCS C+ N Sbjct: 95 DYEDVVTERTISNTCGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLIN 154 Query: 1430 SRAFAGSLPVKRCSVYNSXXXXXXXXXXXEQGLKENEYLGVTDDL----TELKIQEKVKA 1263 SRAFAGSL +RCSV N + L +N+ LG DL +K E+VKA Sbjct: 155 SRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKA 213 Query: 1262 GDVLLENSSPFFSIEGYIPEVDGVLXXXXXXXXXXGA-QSNNSRLQKGK----------- 1119 DV L + P +IEGY+P+ + + S++S+L K Sbjct: 214 EDVSL--AGPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDF 271 Query: 1118 -GKIV-------------------------------NEMDLTCNLVAADQFTGPKLFSIF 1035 G I+ NEMD T ++ D++T K+ S Sbjct: 272 AGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGS 331 Query: 1034 KENGAETKSDEVKHKPFSDGAAVSPAFSGSET--KSKESDVRSISAAENTQFGELLLNGS 861 K++ ++ EV+ K + SGS + + K+S + + + +N Sbjct: 332 KQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNV---------- 381 Query: 860 MQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGD 681 Q+ + + K S+E LKSSLK GAK L++ VTWADK K ++ +G+ Sbjct: 382 YQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGN 441 Query: 680 LCTFQEINDIK-DNGSSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 504 LC +E+ +K D+ S ++ D LR SAEAC +ALS++AEAV+SG+ D++DA Sbjct: 442 LCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVY 501 Query: 503 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 324 ++G+IILP L + D ET P+K P +PG+ +S +F+PE+SW+DAPP GF Sbjct: 502 ENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGF 561 Query: 323 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 144 SLTLS+FATMW ALF WITSSSLAYIYGRD S HEE++S+NG+EYPRK DGRSSEIK Sbjct: 562 SLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIK 621 Query: 143 ESLAGILARMVPELVAELKLRAPLSALEHGM 51 E+LA ++R +P +V +L+L P+S LE GM Sbjct: 622 ETLASCISRALPAIVTDLRLPIPISTLEQGM 652 Score = 27.7 bits (60), Expect(2) = e-116 Identities = 11/16 (68%), Positives = 13/16 (81%) Frame = -2 Query: 49 IPGLAPHMTSMRVLLH 2 IP L PHMT+ R+LLH Sbjct: 689 IPALTPHMTNGRMLLH 704 Score = 135 bits (341), Expect = 8e-29 Identities = 109/357 (30%), Positives = 168/357 (47%), Gaps = 38/357 (10%) Frame = -1 Query: 2702 FSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLDFSSKPPTQERRKE---GPESNXX 2532 FS L+I+EN V+A +V L GPSNAIEGYVP + SKP + K S+ Sbjct: 200 FSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKL 256 Query: 2531 XXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDTKK---------------RTEE 2397 V NE+DF TII+ D++ + K +Q D K +E Sbjct: 257 GSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEI 316 Query: 2396 LKRNQFSIHEATSAPVQNGSEIQLKE-------SKLEDGNAASKAQKAEHEK-------- 2262 + ++++I + S Q+ + LKE ED S + A EK Sbjct: 317 IMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELP 376 Query: 2261 -SHGPTQSVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDN 2085 + QS D+S ++++ ++K SSET LKSSL +G ++ +R VTWAD++K DN Sbjct: 377 STKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADN 436 Query: 2084 ISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESD 1905 NL V+EM+++ + S S+ D S A + S+ A+A+ +SD Sbjct: 437 AGNGNLCEVKEMETMKGDSEIS-GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSD 495 Query: 1904 VTD----DARPIIKLQPDASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 VTD + I+ + + P + D+ E AP++ PKKPG S+ F+ +DS Sbjct: 496 VTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDS 552 >ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao] gi|508787292|gb|EOY34548.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao] Length = 607 Score = 425 bits (1092), Expect = e-116 Identities = 258/620 (41%), Positives = 357/620 (57%), Gaps = 51/620 (8%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + KEQSIS+ +AVHKIQL L +GI E L A+ SL+SRSDY+DV+T+R+I+N CGYPLC Sbjct: 1 MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 NPLP+E RK YR SLK K YD +E Y+FCS C+ NSRAFAGSL +RCSV N Sbjct: 61 ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDL----TELKIQEKVKAGDVLLENSSPFFSIEGYIPE 1203 + L +N+ LG DL +K E+VKA DV L + P +IEGY+P+ Sbjct: 121 LNDILSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKAEDVSL--AGPSNAIEGYVPQ 177 Query: 1202 VDGVLXXXXXXXXXXGA-QSNNSRLQKGK------------GKIV--------------- 1107 + + S++S+L K G I+ Sbjct: 178 RELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSFK 237 Query: 1106 ----------------NEMDLTCNLVAADQFTGPKLFSIFKENGAETKSDEVKHKPFSDG 975 NEMD T ++ D++T K+ S K++ ++ EV+ K Sbjct: 238 QGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKD 297 Query: 974 AAVSPAFSGSET--KSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXXXXXXXXXXXK 801 + SGS + + K+S + + + +N Q+ + + K Sbjct: 298 SEDKCVISGSSSALREKDSSIVELPSTKNV----------YQSGLDTSSAEAEKETHADK 347 Query: 800 TAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIK-DNGSSRNS 624 S+E LKSSLK GAK L++ VTWADK K ++ +G+LC +E+ +K D+ S ++ Sbjct: 348 AVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSA 407 Query: 623 NVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLNDYDXXXXXX 444 D LR SAEAC +ALS++AEAV+SG+ D++DA ++G+IILP L + D Sbjct: 408 EDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPME 467 Query: 443 XXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMWTALFGWITS 264 ET P+K P +PG+ +S +F+PE+SW+DAPP GFSLTLS+FATMW ALF WITS Sbjct: 468 DGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITS 527 Query: 263 SSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMVPELVAELKL 84 SSLAYIYGRD S HEE++S+NG+EYPRK DGRSSEIKE+LA ++R +P +V +L+L Sbjct: 528 SSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRL 587 Query: 83 RAPLSALEHGMNSWPCSTHD 24 P+S LE GMN+ P ST+D Sbjct: 588 PIPISTLEQGMNTCPHSTYD 607 Score = 135 bits (341), Expect = 8e-29 Identities = 109/357 (30%), Positives = 168/357 (47%), Gaps = 38/357 (10%) Frame = -1 Query: 2702 FSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLDFSSKPPTQERRKE---GPESNXX 2532 FS L+I+EN V+A +V L GPSNAIEGYVP + SKP + K S+ Sbjct: 146 FSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKL 202 Query: 2531 XXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDTKK---------------RTEE 2397 V NE+DF TII+ D++ + K +Q D K +E Sbjct: 203 GSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEI 262 Query: 2396 LKRNQFSIHEATSAPVQNGSEIQLKE-------SKLEDGNAASKAQKAEHEK-------- 2262 + ++++I + S Q+ + LKE ED S + A EK Sbjct: 263 IMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELP 322 Query: 2261 -SHGPTQSVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDN 2085 + QS D+S ++++ ++K SSET LKSSL +G ++ +R VTWAD++K DN Sbjct: 323 STKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADN 382 Query: 2084 ISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESD 1905 NL V+EM+++ + S S+ D S A + S+ A+A+ +SD Sbjct: 383 AGNGNLCEVKEMETMKGDSEIS-GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSD 441 Query: 1904 VTD----DARPIIKLQPDASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 VTD + I+ + + P + D+ E AP++ PKKPG S+ F+ +DS Sbjct: 442 VTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDS 498 >ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma cacao] gi|508787293|gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao] Length = 708 Score = 422 bits (1085), Expect = e-115 Identities = 258/631 (40%), Positives = 360/631 (57%), Gaps = 51/631 (8%) Frame = -1 Query: 1790 KPGALISESFDAKDSRTSPTLVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRS 1611 KP +L S + ++S ++ KEQSIS+ +AVHKIQL L +GI E L A+ SL+SRS Sbjct: 35 KPPSLQQHSRERLSKKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRS 94 Query: 1610 DYDDVITKRSITNVCGYPLCNNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTN 1431 DY+DV+T+R+I+N CGYPLC NPLP+E RK YR SLK K YD +E Y+FCS C+ N Sbjct: 95 DYEDVVTERTISNTCGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLIN 154 Query: 1430 SRAFAGSLPVKRCSVYNSXXXXXXXXXXXEQGLKENEYLGVTDDL----TELKIQEKVKA 1263 SRAFAGSL +RCSV N + L +N+ LG DL +K E+VKA Sbjct: 155 SRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKA 213 Query: 1262 GDVLLENSSPFFSIEGYIPEVDGVLXXXXXXXXXXGA-QSNNSRLQKGK----------- 1119 DV L + P +IEGY+P+ + + S++S+L K Sbjct: 214 EDVSL--AGPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDF 271 Query: 1118 -GKIV-------------------------------NEMDLTCNLVAADQFTGPKLFSIF 1035 G I+ NEMD T ++ D++T K+ S Sbjct: 272 AGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGS 331 Query: 1034 KENGAETKSDEVKHKPFSDGAAVSPAFSGSET--KSKESDVRSISAAENTQFGELLLNGS 861 K++ ++ EV+ K + SGS + + K+S + + + +N Sbjct: 332 KQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNV---------- 381 Query: 860 MQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGD 681 Q+ + + K S+E LKSSLK GAK L++ VTWADK K ++ +G+ Sbjct: 382 YQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGN 441 Query: 680 LCTFQEINDIK-DNGSSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 504 LC +E+ +K D+ S ++ D LR SAEAC +ALS++AEAV+SG+ D++DA Sbjct: 442 LCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVY 501 Query: 503 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 324 ++G+IILP L + D ET P+K P +PG+ +S +F+PE+SW+DAPP GF Sbjct: 502 ENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGF 561 Query: 323 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 144 SLTLS+FATMW ALF WITSSSLAYIYGRD S HEE++S+NG+EYPRK DGRSSEIK Sbjct: 562 SLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIK 621 Query: 143 ESLAGILARMVPELVAELKLRAPLSALEHGM 51 E+LA ++R +P +V +L+L P+S LE GM Sbjct: 622 ETLASCISRALPAIVTDLRLPIPISTLEQGM 652 Score = 135 bits (341), Expect = 8e-29 Identities = 109/357 (30%), Positives = 168/357 (47%), Gaps = 38/357 (10%) Frame = -1 Query: 2702 FSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLDFSSKPPTQERRKE---GPESNXX 2532 FS L+I+EN V+A +V L GPSNAIEGYVP + SKP + K S+ Sbjct: 200 FSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKL 256 Query: 2531 XXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDTKK---------------RTEE 2397 V NE+DF TII+ D++ + K +Q D K +E Sbjct: 257 GSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEI 316 Query: 2396 LKRNQFSIHEATSAPVQNGSEIQLKE-------SKLEDGNAASKAQKAEHEK-------- 2262 + ++++I + S Q+ + LKE ED S + A EK Sbjct: 317 IMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELP 376 Query: 2261 -SHGPTQSVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDN 2085 + QS D+S ++++ ++K SSET LKSSL +G ++ +R VTWAD++K DN Sbjct: 377 STKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADN 436 Query: 2084 ISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESD 1905 NL V+EM+++ + S S+ D S A + S+ A+A+ +SD Sbjct: 437 AGNGNLCEVKEMETMKGDSEIS-GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSD 495 Query: 1904 VTD----DARPIIKLQPDASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 VTD + I+ + + P + D+ E AP++ PKKPG S+ F+ +DS Sbjct: 496 VTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDS 552 >ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma cacao] gi|508787290|gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao] Length = 679 Score = 422 bits (1085), Expect = e-115 Identities = 258/631 (40%), Positives = 360/631 (57%), Gaps = 51/631 (8%) Frame = -1 Query: 1790 KPGALISESFDAKDSRTSPTLVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRS 1611 KP +L S + ++S ++ KEQSIS+ +AVHKIQL L +GI E L A+ SL+SRS Sbjct: 35 KPPSLQQHSRERLSKKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRS 94 Query: 1610 DYDDVITKRSITNVCGYPLCNNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTN 1431 DY+DV+T+R+I+N CGYPLC NPLP+E RK YR SLK K YD +E Y+FCS C+ N Sbjct: 95 DYEDVVTERTISNTCGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLIN 154 Query: 1430 SRAFAGSLPVKRCSVYNSXXXXXXXXXXXEQGLKENEYLGVTDDL----TELKIQEKVKA 1263 SRAFAGSL +RCSV N + L +N+ LG DL +K E+VKA Sbjct: 155 SRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKA 213 Query: 1262 GDVLLENSSPFFSIEGYIPEVDGVLXXXXXXXXXXGA-QSNNSRLQKGK----------- 1119 DV L + P +IEGY+P+ + + S++S+L K Sbjct: 214 EDVSL--AGPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDF 271 Query: 1118 -GKIV-------------------------------NEMDLTCNLVAADQFTGPKLFSIF 1035 G I+ NEMD T ++ D++T K+ S Sbjct: 272 AGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGS 331 Query: 1034 KENGAETKSDEVKHKPFSDGAAVSPAFSGSET--KSKESDVRSISAAENTQFGELLLNGS 861 K++ ++ EV+ K + SGS + + K+S + + + +N Sbjct: 332 KQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNV---------- 381 Query: 860 MQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGD 681 Q+ + + K S+E LKSSLK GAK L++ VTWADK K ++ +G+ Sbjct: 382 YQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGN 441 Query: 680 LCTFQEINDIK-DNGSSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 504 LC +E+ +K D+ S ++ D LR SAEAC +ALS++AEAV+SG+ D++DA Sbjct: 442 LCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVY 501 Query: 503 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 324 ++G+IILP L + D ET P+K P +PG+ +S +F+PE+SW+DAPP GF Sbjct: 502 ENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGF 561 Query: 323 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 144 SLTLS+FATMW ALF WITSSSLAYIYGRD S HEE++S+NG+EYPRK DGRSSEIK Sbjct: 562 SLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIK 621 Query: 143 ESLAGILARMVPELVAELKLRAPLSALEHGM 51 E+LA ++R +P +V +L+L P+S LE GM Sbjct: 622 ETLASCISRALPAIVTDLRLPIPISTLEQGM 652 Score = 135 bits (341), Expect = 8e-29 Identities = 109/357 (30%), Positives = 168/357 (47%), Gaps = 38/357 (10%) Frame = -1 Query: 2702 FSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLDFSSKPPTQERRKE---GPESNXX 2532 FS L+I+EN V+A +V L GPSNAIEGYVP + SKP + K S+ Sbjct: 200 FSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKL 256 Query: 2531 XXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDTKK---------------RTEE 2397 V NE+DF TII+ D++ + K +Q D K +E Sbjct: 257 GSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEI 316 Query: 2396 LKRNQFSIHEATSAPVQNGSEIQLKE-------SKLEDGNAASKAQKAEHEK-------- 2262 + ++++I + S Q+ + LKE ED S + A EK Sbjct: 317 IMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELP 376 Query: 2261 -SHGPTQSVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDN 2085 + QS D+S ++++ ++K SSET LKSSL +G ++ +R VTWAD++K DN Sbjct: 377 STKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADN 436 Query: 2084 ISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESD 1905 NL V+EM+++ + S S+ D S A + S+ A+A+ +SD Sbjct: 437 AGNGNLCEVKEMETMKGDSEIS-GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSD 495 Query: 1904 VTD----DARPIIKLQPDASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 VTD + I+ + + P + D+ E AP++ PKKPG S+ F+ +DS Sbjct: 496 VTDAVYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDS 552 >ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] gi|508787291|gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] Length = 703 Score = 402 bits (1034), Expect(2) = e-110 Identities = 252/631 (39%), Positives = 352/631 (55%), Gaps = 51/631 (8%) Frame = -1 Query: 1790 KPGALISESFDAKDSRTSPTLVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRS 1611 KP +L S + ++S ++ KEQSIS+ +AVHKIQL L +GI E L A+ SL+SRS Sbjct: 35 KPPSLQQHSRERLSKKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRS 94 Query: 1610 DYDDVITKRSITNVCGYPLCNNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTN 1431 DY+DV+T+R+I+N CGYPLC NPLP+E RK YR SLK K YD +E Y+FCS C+ N Sbjct: 95 DYEDVVTERTISNTCGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLIN 154 Query: 1430 SRAFAGSLPVKRCSVYNSXXXXXXXXXXXEQGLKENEYLGVTDDL----TELKIQEKVKA 1263 SRAFAGSL +RCSV N + L +N+ LG DL +K E+VKA Sbjct: 155 SRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDND-LGKNGDLGFSNLRIKENEEVKA 213 Query: 1262 GDVLLENSSPFFSIEGYIPEVDGVLXXXXXXXXXXGA-QSNNSRLQKGK----------- 1119 DV L + P +IEGY+P+ + + S++S+L K Sbjct: 214 EDVSL--AGPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDF 271 Query: 1118 -GKIV-------------------------------NEMDLTCNLVAADQFTGPKLFSIF 1035 G I+ NEMD T ++ D++T K+ S Sbjct: 272 AGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGS 331 Query: 1034 KENGAETKSDEVKHKPFSDGAAVSPAFSGSET--KSKESDVRSISAAENTQFGELLLNGS 861 K++ ++ EV+ K + SGS + + K+S + + + +N Sbjct: 332 KQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNV---------- 381 Query: 860 MQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGD 681 Q+ + + K S+E LKSSLK GAK L++ VTWADK K ++ +G+ Sbjct: 382 YQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGN 441 Query: 680 LCTFQEINDIK-DNGSSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 504 LC +E+ +K D+ S ++ D LR SAEAC +ALS++AEAV+SG+ D++DA Sbjct: 442 LCEVKEMETMKGDSEISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDA-- 499 Query: 503 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 324 + + D ET P+K P +PG+ +S +F+PE+SW+DAPP GF Sbjct: 500 ---------VCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGF 550 Query: 323 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 144 SLTLS+FATMW ALF WITSSSLAYIYGRD S HEE++S+NG+EYPRK DGRSSEIK Sbjct: 551 SLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIK 610 Query: 143 ESLAGILARMVPELVAELKLRAPLSALEHGM 51 E+LA ++R +P +V +L+L P+S LE GM Sbjct: 611 ETLASCISRALPAIVTDLRLPIPISTLEQGM 641 Score = 27.7 bits (60), Expect(2) = e-110 Identities = 11/16 (68%), Positives = 13/16 (81%) Frame = -2 Query: 49 IPGLAPHMTSMRVLLH 2 IP L PHMT+ R+LLH Sbjct: 678 IPALTPHMTNGRMLLH 693 Score = 136 bits (343), Expect = 5e-29 Identities = 109/353 (30%), Positives = 166/353 (47%), Gaps = 34/353 (9%) Frame = -1 Query: 2702 FSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLDFSSKPPTQERRKE---GPESNXX 2532 FS L+I+EN V+A +V L GPSNAIEGYVP + SKP + K S+ Sbjct: 200 FSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKL 256 Query: 2531 XXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDTKK---------------RTEE 2397 V NE+DF TII+ D++ + K +Q D K +E Sbjct: 257 GSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEI 316 Query: 2396 LKRNQFSIHEATSAPVQNGSEIQLKE-------SKLEDGNAASKAQKAEHEK-------- 2262 + ++++I + S Q+ + LKE ED S + A EK Sbjct: 317 IMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELP 376 Query: 2261 -SHGPTQSVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDN 2085 + QS D+S ++++ ++K SSET LKSSL +G ++ +R VTWAD++K DN Sbjct: 377 STKNVYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADN 436 Query: 2084 ISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESD 1905 NL V+EM+++ + S S+ D S A + S+ A+A+ +SD Sbjct: 437 AGNGNLCEVKEMETMKGDSEIS-GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSD 495 Query: 1904 VTDDARPIIKLQPDASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 VTD + K + P + D+ E AP++ PKKPG S+ F+ +DS Sbjct: 496 VTDAVCEVDKEE-------PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDS 541 >ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Solanum lycopersicum] Length = 660 Score = 394 bits (1012), Expect = e-106 Identities = 242/583 (41%), Positives = 340/583 (58%), Gaps = 23/583 (3%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + K +++++KDAVHK+QL L EGI E+ L AA SL+SRSDY DV+T+RSI N+CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 +N LP+ER RK HYR SLK K YD E Y++CS CV NS AFAGSL +R S N Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDLTE--------LKIQEKV--KAGDVLLEN-SSPFFS 1224 L + +L DD+ E LKIQEKV K G+V LE P + Sbjct: 121 LNQVL------NLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNA 174 Query: 1223 IEGYIPEVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLF 1044 IEGY+P+ D + G+++ ++RLQ K I+NE D + ++ D+++ K Sbjct: 175 IEGYVPQRDRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFP 234 Query: 1043 SI--------FKENGAETKSDEVKHKPFSDGAAVSPAF--SGSETKSKESDVRSISAAEN 894 + FKE A+T+ + G V SG ET+ + + R + + Sbjct: 235 APVNADSNVKFKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETEKSDKNTRFLKV-DK 293 Query: 893 TQFGELLLNGSMQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWAD 714 GE+ S +V + + + LKSSLK +K +S+SVTWAD Sbjct: 294 FNSGEVSSGPSQHDVKNKSVLIMSDDGRKY-ASHGEHDKLKSSLKSSNSKKMSRSVTWAD 352 Query: 713 KIKVNDTDSGDLCTFQEINDIKDN--GSSRNSNVEDVDASLRLASAEACVLALSQSAEAV 540 + ++ + +I++ + G S ++++E+ D S R SAEAC ALSQ+AEAV Sbjct: 353 E-SIDGGIGKKTESSSKISEYESQAYGGSASTDMEENDDSYRFESAEACAAALSQAAEAV 411 Query: 539 SSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDP 360 +SG D+ DA S +GI+ILP + D ET PLK P +PG+ N +F+ Sbjct: 412 ASGS-DVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFES 470 Query: 359 ENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRK 180 E+SWYD+PP GF++TLS F TM+ +LF WI+SSSLA+IYG D S++EE++S+NG+EYPRK Sbjct: 471 EDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRK 530 Query: 179 NFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGM 51 SDGRS+EIK++LAG LAR +P LVA+L+L P+S LE GM Sbjct: 531 IVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGM 573 Score = 127 bits (319), Expect = 3e-26 Identities = 109/363 (30%), Positives = 165/363 (45%), Gaps = 20/363 (5%) Frame = -1 Query: 2774 EVLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNL 2595 +VL LF+ + G S+LKI+E ++++ GEV LE+W+GPSNAIEGYVP Sbjct: 123 QVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVPQR 182 Query: 2594 DFSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDT 2415 D S P + +G ++ ++NE DF+STII D+++V KF Sbjct: 183 DRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNADSN 242 Query: 2414 KKRTEELKRNQFSIHEATSAPVQNGSEIQLKESKLEDGNAASKAQKAEH----------E 2265 K E + ++ + + + Q+ +L G K+ K E Sbjct: 243 VKFKETQAKTRYKVRDDDVYILGK----QVDALQLRSGEETEKSDKNTRFLKVDKFNSGE 298 Query: 2264 KSHGPTQ------SVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWAD 2103 S GP+Q SV SD+ K E LKSSL S ++ SRSVTWAD Sbjct: 299 VSSGPSQHDVKNKSVLIMSDDGRKYASHGE------HDKLKSSLKSSNSKKMSRSVTWAD 352 Query: 2102 ERKIDNISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAI 1923 E I +S + + S ++ S S+++E D S A SQ A+A+ Sbjct: 353 ESIDGGIGKKTESSSKISEYESQAYGGSA-STDMEENDDSYRFESAEACAAALSQAAEAV 411 Query: 1922 TFSESDVTD--DARPIIKLQP--DASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDA 1755 S SDV D I+ L P + E Q +++ + APL+ P+KPG + F++ Sbjct: 412 A-SGSDVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFES 470 Query: 1754 KDS 1746 +DS Sbjct: 471 EDS 473 >ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cicer arietinum] Length = 666 Score = 392 bits (1008), Expect = e-106 Identities = 247/589 (41%), Positives = 333/589 (56%), Gaps = 31/589 (5%) Frame = -1 Query: 1724 KEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNN 1545 K+Q IS+KDAV K+QL+L EGI E+ LFAA SL+SRSDY+DV+T+RSIT VC YPLC N Sbjct: 3 KDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLCCN 62 Query: 1544 PLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXXXX 1365 LP+ERPRK YR SLK K YD E Y+FCS CV NS+AFAGSL KRC + Sbjct: 63 ALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQKLN 122 Query: 1364 XXXXXXXEQGLKENEYLGVTDD--LTELKIQEKVK-AGDVLLEN-SSPFFSIEGYIPEVD 1197 L+ E G + L+ L+IQ+K + +V LE P +IEGY+P+ Sbjct: 123 NILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVTEVSLEQWVGPSNAIEGYVPKKR 182 Query: 1196 GVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKENGAE 1017 G+++++ + K I +E D ++ D+ +S+ K + + Sbjct: 183 DNGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDE------YSVSKVSSGQ 236 Query: 1016 TKSDEVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAAENTQFGELLLNGSMQNVSEIA 837 T + V H+ P E K+ D++ +S++ F L + + EIA Sbjct: 237 TDA-TVDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSS----FASSLNLSASKKDKEIA 291 Query: 836 XXXXXXXXXXXKTAQSNENAL--------------------------KSSLKPPGAKTLS 735 +N+++ KSSLK G K L Sbjct: 292 KSCKNVLKGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLG 351 Query: 734 QSVTWADKIKVNDTDSGDLCTFQEINDI-KDNGSSRNSNVEDVDASLRLASAEACVLALS 558 +SVTWADK K++ S DLC F+E +I K++ + N +V D + LR SAEAC +ALS Sbjct: 352 RSVTWADK-KIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALS 410 Query: 557 QSAEAVSSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFN 378 Q+AEAV+SG+ D DA S++GIIILP + ++V LK P +PG+ + Sbjct: 411 QAAEAVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISD 470 Query: 377 SKLFDPENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNG 198 LF ++SW+DAPP GFSLTLS FAT+W A F WITSSSLAYIYGRDVS +EEF+SV+G Sbjct: 471 FDLFASDDSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDG 530 Query: 197 KEYPRKNFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGM 51 +EYP K SDGRSSEIK++LA LAR +P +VAELKL P+S LE GM Sbjct: 531 REYPCKIVLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGM 579 Score = 118 bits (296), Expect = 1e-23 Identities = 109/365 (29%), Positives = 167/365 (45%), Gaps = 23/365 (6%) Frame = -1 Query: 2771 VLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLD 2592 +L+LF + G S L+I++ EV LE W+GPSNAIEGYVP Sbjct: 124 ILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVPKKR 182 Query: 2591 FSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDTK 2412 + +Q+ K+G +++ + +E DF STII+ D+++V K S G + Sbjct: 183 DNGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTDATVD 242 Query: 2411 ---KRTEELKRNQFSIHEATSAPVQNGSEIQ-LKESKLEDGNAASKAQKAEHEKS----- 2259 K T L++ + HE V+ +IQ L S N ++ + E KS Sbjct: 243 HQIKPTAILEQPKRVDHEL----VRKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVL 298 Query: 2258 HGPTQSVSDSSDERS--------KQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWAD 2103 G T V+ + D + ++K+ EK S T KSSL +G ++ RSVTWAD Sbjct: 299 KGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLGRSVTWAD 358 Query: 2102 ERKIDNISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAAS--ELTPSQVAK 1929 +KID +L + +E +I K S + NV+VVD L S +A + SQ A+ Sbjct: 359 -KKIDGCGSTDLCAFKEFGNIK---KESDVADNVDVVDDEDILRSVSAEACAIALSQAAE 414 Query: 1928 AITFSESDVTDDARP----IIKLQPDASEGYPQRNEDVYEQMVAPLERPKKPGALISESF 1761 A+ +SD D I+ +A E + D+ E L+ P+KPG + F Sbjct: 415 AVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISDFDLF 474 Query: 1760 DAKDS 1746 + DS Sbjct: 475 ASDDS 479 >ref|XP_007145767.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris] gi|561018957|gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris] Length = 706 Score = 392 bits (1007), Expect = e-106 Identities = 250/620 (40%), Positives = 341/620 (55%), Gaps = 60/620 (9%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + K++++S+KDAV K+Q+ L EGI +E+ LFAA SLMSRSDY+D++T+RSITNVCGYPLC Sbjct: 1 MAKDKAVSVKDAVFKLQMLLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 N LP+ERPRK YR SLK K YD +E Y+FCS CV +S+AF+G L +RCS + Sbjct: 61 CNALPSERPRKGKYRISLKEHKVYDLQETYMFCSSNCVVSSKAFSGILQAERCSALDPEK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKV--KAGDVLLEN-SSPFFSIEGYIP 1206 L++ E + DL + LKIQEK +G+V LE P +IEGY+P Sbjct: 121 LNNVLGLFENLNLEQTENVPKDGDLGLSNLKIQEKTVTTSGEVPLEQWVGPSNAIEGYVP 180 Query: 1205 EVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKL------- 1047 + G+++ + + K I +EM+ ++ D+++ K Sbjct: 181 KPRERESKGLRKNVKKGSKAGHGKSNNDKDLINSEMNFVSTIIMQDEYSVSKASPGQTDT 240 Query: 1046 --FSIFKENGAETKSDE-----VKHKP----------FSDGAAVSPAFSGSETKS----- 933 K + + +E V K F G +S + G E Sbjct: 241 TAHHQIKPTAVDRQQEEKVGLKVVRKDEDSIQDLSSSFESGLHLSASEKGKEVSKSCEVV 300 Query: 932 ---------KESDVRSISAAE--------NTQFGELLLNGSMQNV--------SEIAXXX 828 K+ D S+S +E N+ + L G V S Sbjct: 301 VKSTPNLAIKKKDAHSVSISERHYDVEKNNSARKSVQLKGETSRVTVNGDASTSNFDPDN 360 Query: 827 XXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDI- 651 K E LKSSLK G K LS++VTWAD+ K+N + DLC +E DI Sbjct: 361 VKEKFQVEKVGGLCETKLKSSLKSAGEKKLSRTVTWADE-KINGAGNKDLCEVKEFGDII 419 Query: 650 KDNGSSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLN 471 K++ S N +V + + LR ASAEAC +ALSQ++EAV+SG+ D +DA S++GIIILPQ + Sbjct: 420 KESESVGNEDVANNEDMLRQASAEACAIALSQASEAVASGDSDATDAVSEAGIIILPQPH 479 Query: 470 DYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMW 291 D ++V LK P +PG+ + F+ ++SW+DAPP GFSLTLS FA MW Sbjct: 480 DAVEEGTMEDADILQNDSVTLKWPRKPGISDIDFFESDDSWFDAPPEGFSLTLSPFANMW 539 Query: 290 TALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMV 111 A+F W+TS SLAYIYGRD S HEE++SVNG+EYP K SDGRSSEIK++ AG LAR Sbjct: 540 NAIFSWMTSYSLAYIYGRDESFHEEYLSVNGREYPCKVVLSDGRSSEIKQTFAGCLARAF 599 Query: 110 PELVAELKLRAPLSALEHGM 51 P LVA L+L P+S LE GM Sbjct: 600 PALVAGLRLPIPISTLEQGM 619 Score = 130 bits (328), Expect = 3e-27 Identities = 116/400 (29%), Positives = 179/400 (44%), Gaps = 58/400 (14%) Frame = -1 Query: 2771 VLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLD 2592 VL LFE ++ G S LKI+E +GEV LE W+GPSNAIEGYVP Sbjct: 124 VLGLFENLNLEQTENVPKDGDLGLSNLKIQEKTVTTSGEVPLEQWVGPSNAIEGYVPKPR 183 Query: 2591 FSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYG------- 2433 ++ K+G ++ + +EM+F STII+ D+++V K S G Sbjct: 184 ERESKGLRKNVKKGSKAGHGKSNNDKDLINSEMNFVSTIIMQDEYSVSKASPGQTDTTAH 243 Query: 2432 --------SEQSDTKKRTEELKRNQFSIHEATSA--------------PVQNGSEIQLKE 2319 Q + K + +++++ SI + +S+ V E+ +K Sbjct: 244 HQIKPTAVDRQQEEKVGLKVVRKDEDSIQDLSSSFESGLHLSASEKGKEVSKSCEVVVKS 303 Query: 2318 S--------------------KLEDGNAASKAQKAEHEKSH---GPTQSVSDSSDERSKQ 2208 + +E N+A K+ + + E S S S+ + K+ Sbjct: 304 TPNLAIKKKDAHSVSISERHYDVEKNNSARKSVQLKGETSRVTVNGDASTSNFDPDNVKE 363 Query: 2207 KVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDNISIDNLNSVQEMDSISDSF 2028 K EK+ ET LKSSL +G ++ SR+VTWADE KI+ +L V+E D Sbjct: 364 KFQVEKVGGLCETKLKSSLKSAGEKKLSRTVTWADE-KINGAGNKDLCEVKE---FGDII 419 Query: 2027 KNSRNSSNVEVVDGRPCL--ASEAASELTPSQVAKAITFSESDVTD---DARPIIKLQP- 1866 K S + N +V + L AS A + SQ ++A+ +SD TD +A II QP Sbjct: 420 KESESVGNEDVANNEDMLRQASAEACAIALSQASEAVASGDSDATDAVSEAGIIILPQPH 479 Query: 1865 DASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 DA E + D+ + L+ P+KPG + F++ DS Sbjct: 480 DAVEEGTMEDADILQNDSVTLKWPRKPGISDIDFFESDDS 519 >ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Glycine max] Length = 706 Score = 390 bits (1003), Expect = e-105 Identities = 246/622 (39%), Positives = 345/622 (55%), Gaps = 64/622 (10%) Frame = -1 Query: 1724 KEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNN 1545 K++ +S+KDAV K+Q+SL EGI +E+ LFAA SLMSRSDY+D++T+RSITNVCGYPLC+N Sbjct: 3 KDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLCSN 62 Query: 1544 PLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCS------VY 1383 LP++RPRK YR SLK K YD E Y+FC CV +S+AFAGSL +RCS + Sbjct: 63 ALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEKLN 122 Query: 1382 NSXXXXXXXXXXXEQGLKENEYLGVTDDLTELKIQEKVK--AGDVLLEN-SSPFFSIEGY 1212 N + L++NE G++D LKIQEK + +G+V LE + P +IEGY Sbjct: 123 NILSLFENLNLEPAENLQKNEDFGLSD----LKIQEKTETSSGEVSLEQWAGPSNAIEGY 178 Query: 1211 IPEVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLF---- 1044 +P+ G+++ + + I +EM ++ D ++ K+ Sbjct: 179 VPKPRDHDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKVLPGQR 238 Query: 1043 ---------------SIFKENGAETKSDEVKHKPFSDGAAVSPAFSGSETKS-------- 933 + K + + D+ + S S SE + Sbjct: 239 DATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELAQSCEA 298 Query: 932 ----------KESDVRSISAAE--------NTQFGELLLNGSMQNV--------SEIAXX 831 K+ DV S+S +E ++ + + G M V S + Sbjct: 299 ALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTSNLDPA 358 Query: 830 XXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDI 651 K S KSSLK G K LS++VTWADK K+N T S DLC F+ DI Sbjct: 359 NVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWADK-KINSTGSKDLCGFKNFGDI 417 Query: 650 KDNGSSRNSNVE--DVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQ 477 ++ S ++++ + + +LR ASAEACV+ALS ++EAV+SG+ D+SDA S++GIIILP Sbjct: 418 RNESDSAGNSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVSEAGIIILPP 477 Query: 476 LNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFAT 297 +D ++V +K P +PG+ + F+ ++SW+DA P GFSLTLS FAT Sbjct: 478 PHDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGFSLTLSPFAT 537 Query: 296 MWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILAR 117 MW LF WITSSSLAYIYGRD S EE++SVNG+EYP K +DGRSSEIK++LA LAR Sbjct: 538 MWNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIKQTLASCLAR 597 Query: 116 MVPELVAELKLRAPLSALEHGM 51 +P LVA L+L P+S +E GM Sbjct: 598 ALPTLVAVLRLPIPVSTMEQGM 619 Score = 109 bits (273), Expect = 6e-21 Identities = 102/397 (25%), Positives = 168/397 (42%), Gaps = 55/397 (13%) Frame = -1 Query: 2771 VLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLD 2592 +L LFE ++ G S+LKI+E +GEV LE W GPSNAIEGYVP Sbjct: 124 ILSLFENLNLEPAENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYVPKPR 183 Query: 2591 FSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSDT- 2415 ++ K+G ++ + +EM F STII+ D ++V K G + Sbjct: 184 DHDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKVLPGQRDATAH 243 Query: 2414 -------------KKRTEELKRNQFSIHEATSA--------------PVQNGSEIQLKES 2316 K + ++++ SI + +S+ + E LK S Sbjct: 244 HQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELAQSCEAALKSS 303 Query: 2315 --------------------KLEDGNAASKAQKAEHEKSH---GPTQSVSDSSDERSKQK 2205 +E ++A K+ + + + S S S+ ++K Sbjct: 304 PDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTSNLDPANVEEK 363 Query: 2204 VSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDNISIDNLNSVQEMDSISDSFK 2025 EK S T KSSL +G ++ SR+VTWAD +KI++ +L + I + Sbjct: 364 FQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWAD-KKINSTGSKDLCGFKNFGDIRNESD 422 Query: 2024 NSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESDVTD--DARPIIKLQP--DAS 1857 ++ NS +V + AS A + S ++A+ +SDV+D II L P DA Sbjct: 423 SAGNSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVSEAGIIILPPPHDAG 482 Query: 1856 EGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 E + D+ + ++ P+KPG ++ F++ DS Sbjct: 483 EEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDS 519 >ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis] gi|223538861|gb|EEF40460.1| conserved hypothetical protein [Ricinus communis] Length = 645 Score = 389 bits (999), Expect(2) = e-105 Identities = 238/568 (41%), Positives = 330/568 (58%), Gaps = 9/568 (1%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + KE+S+S+KD V+K+QLSL EGI +E+ L AA SLMSRSDY+DV+ +RSI+N+CGYPLC Sbjct: 1 MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 NN LP++RP K YR SLK + YD +E Y++CS C+ NSRAF+ SL KRCSV N Sbjct: 61 NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEK--VKAGDVLLEN-SSPFFSIEGYIP 1206 + L ++E LG + DL + LKIQEK G V LE P +IEGY+P Sbjct: 121 LNEILRKFNDLTL-DSEGLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179 Query: 1205 EVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKEN 1026 + D + K + ++ D T ++ D+++ K + Sbjct: 180 QGDRDPNPSLKNHKEGLKAICKKPVSK-QDCFFSDTDFTSTIITNDEYSISK-----GPS 233 Query: 1025 GAETKSDEVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAAENTQFGELL---LNGSMQ 855 G + + ++K + G + + K+ +++ ++ + +++ LN Sbjct: 234 GLTSTASDIKLQA-QTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKEKVIKEQLNFQDL 292 Query: 854 NVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLC 675 S A NE+ LK SLK GAK ++SVTWAD+ +V++ S +LC Sbjct: 293 PSSSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADE-RVDNAGSRNLC 351 Query: 674 TFQEINDIKDNGS-SRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAESDS 498 QE+ ++ S ++N D LR SAEAC +ALSQ+AEAV+SG+ D++ A S++ Sbjct: 352 EVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEA 411 Query: 497 GIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSL 318 GII+LP D E+ LK P +PG+ S LFDPE+SWYDAPP GFSL Sbjct: 412 GIIVLPPSQDLGQGGNVEKNDMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAPPEGFSL 471 Query: 317 TLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKES 138 TLS FATMW ALF W+TSSSLAYIYGRD S+HE+++SVNG+EYPRK DGRSSEI+ + Sbjct: 472 TLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRSSEIRLT 531 Query: 137 LAGILARMVPELVAELKLRAPLSALEHG 54 LAR P LVA L+L P+S LE G Sbjct: 532 AESCLARTFPGLVANLRLPIPVSTLEQG 559 Score = 23.1 bits (48), Expect(2) = e-105 Identities = 9/16 (56%), Positives = 12/16 (75%) Frame = -2 Query: 49 IPGLAPHMTSMRVLLH 2 IP L +MTS R++LH Sbjct: 597 IPALTSYMTSRRMVLH 612 Score = 140 bits (353), Expect = 3e-30 Identities = 111/326 (34%), Positives = 160/326 (49%), Gaps = 8/326 (2%) Frame = -1 Query: 2699 SELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLDFSSKPPTQERRKEGPESNXXXXXX 2520 S LKI+E G+V LE+W+GPSNAIEGYVP D P+ + KEG ++ Sbjct: 147 SNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVPQGD-RDPNPSLKNHKEGLKAICKKPVS 205 Query: 2519 XXXKVVNEMDFTSTIIVGDQFAVPKFSYG--SEQSDTKKRTEELKRNQFSIHEATSAPVQ 2346 ++ DFTSTII D++++ K G S SD K + + K HE +A + Sbjct: 206 KQDCFFSDTDFTSTIITNDEYSISKGPSGLTSTASDIKLQAQTGKG-----HEGLNAQLS 260 Query: 2345 N-GSEIQLKESKLEDGNAASKAQKAEHEKSHGPTQSVSDSSDERSKQKVSSEKLALSSET 2169 + + +K S+ G K K + P+ S + E Q + L +E+ Sbjct: 261 SLRKQDSIKASRKSKGRRKEKVIKEQLNFQDLPSSSYYTAEAEDISQATGAANL---NES 317 Query: 2168 TLKSSLNHSGPRRFSRSVTWADERKIDNISIDNLNSVQEMDSISDSFKNSRNSSNVEVVD 1989 LK SL SG +R +RSVTWADER +DN NL VQEM+ ++S + S +++ + D Sbjct: 318 VLKPSLKSSGAKRSNRSVTWADER-VDNAGSRNLCEVQEMEQTNESHEISESANKGD--D 374 Query: 1988 GRPC-LASEAASELTPSQVAKAITFSESDVTD--DARPIIKLQP--DASEGYPQRNEDVY 1824 G S A + SQ A+A+ ++DV II L P D +G D+ Sbjct: 375 GHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEAGIIVLPPSQDLGQGGNVEKNDMI 434 Query: 1823 EQMVAPLERPKKPGALISESFDAKDS 1746 EQ A L+ P KPG S+ FD +DS Sbjct: 435 EQESASLKWPTKPGIPQSDLFDPEDS 460 >ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Glycine max] Length = 706 Score = 384 bits (987), Expect = e-103 Identities = 243/621 (39%), Positives = 340/621 (54%), Gaps = 61/621 (9%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + K++ +S+KDAV K+Q+SL EGI +E+ LFAA SLMSRSDY+D++T+RSITN+CGYPLC Sbjct: 1 MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 +N LP++RPRK YR SLK K YD +E Y+FCS C+ +S+ FAGSL +RCS + Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKVK--AGDVLLEN-SSPFFSIEGYIP 1206 L+ E L DL ++LKIQEK + +G+V LE + P +IEGY+P Sbjct: 121 LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180 Query: 1205 EVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPK-------- 1050 + G+++ + + I +EM ++ D+++ K Sbjct: 181 KPRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKVPPGQMDA 240 Query: 1049 --------------------------------LFSIFKEN---GAETKSDEVKHK----- 990 L S FK + K +EV Sbjct: 241 TANHQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAVL 300 Query: 989 PFSDGAAV------SPAFSGSETKSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXXX 828 FS G A+ S + S + +++D S + ++ N + S + Sbjct: 301 KFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDA-STSNLDPAN 359 Query: 827 XXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIK 648 K S + +SSLK G K S++VTWAD+ K+N T S DLC F+E DIK Sbjct: 360 VEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADE-KINSTGSKDLCEFKEFGDIK 418 Query: 647 DNGSSRNSNVEDVDAS--LRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQL 474 S +N++ + LR ASAEAC +ALS ++EAV+SG+ D+SDA S++GI ILP Sbjct: 419 KESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVSEAGITILPPP 478 Query: 473 NDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATM 294 +D ++V LK P + G+ + F+ ++SW+DAPP GFSLTLS FATM Sbjct: 479 HDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGFSLTLSPFATM 538 Query: 293 WTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARM 114 W LF W TSSSLAYIYGRD S HEE++SVNG+EYP K +DGRSSEIK++LA LAR Sbjct: 539 WNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIKQTLASCLARA 598 Query: 113 VPELVAELKLRAPLSALEHGM 51 +P LVA L+L P+S +E GM Sbjct: 599 LPALVAVLRLPIPVSIMEQGM 619 Score = 112 bits (279), Expect = 1e-21 Identities = 107/400 (26%), Positives = 174/400 (43%), Gaps = 58/400 (14%) Frame = -1 Query: 2771 VLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLD 2592 VL LFE ++ G S+LKI+E +GEV LE W GPSNAIEGYVP Sbjct: 124 VLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVPKPR 183 Query: 2591 FSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYG------- 2433 ++ K+G ++ + +EM F STII+ D+++V K G Sbjct: 184 NRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKVPPGQMDATAN 243 Query: 2432 -------SEQSDTKKRTEELKRNQFSIHEATSA--------------PVQNGSEIQLK-- 2322 + + K E ++++ SI + +S+ V E LK Sbjct: 244 HQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAVLKFS 303 Query: 2321 ------------------ESKLEDGNAASKAQKAEHEKSH---GPTQSVSDSSDERSKQK 2205 + +E ++A K+ + + + S S S+ ++K Sbjct: 304 PGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDASTSNLDPANVEEK 363 Query: 2204 VSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDNISIDNLNSVQEMDSI---SD 2034 EK S +T +SSL +G ++FSR+VTWADE KI++ +L +E I SD Sbjct: 364 FQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADE-KINSTGSKDLCEFKEFGDIKKESD 422 Query: 2033 SFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESDVTDDAR----PIIKLQP 1866 S N+ + +N E + R AS A + S ++A+ +SDV+D I+ Sbjct: 423 SVGNNIDVANDEDILRR---ASAEACAIALSSASEAVASGDSDVSDAVSEAGITILPPPH 479 Query: 1865 DASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 DA+E + D+ + L+ P+K G ++ F++ DS Sbjct: 480 DAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDS 519 >ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Solanum tuberosum] Length = 662 Score = 384 bits (986), Expect = e-103 Identities = 238/584 (40%), Positives = 337/584 (57%), Gaps = 24/584 (4%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + K +++++KDAVHK+QL L EGI EN L AA SL+SRSDY DV+T+RSI N+CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 +N LP+ER RK HYR SLK K YD E Y++CS CV NS AFAGSL +R S N Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKVKA---GDVLLEN-SSPFFSIEGYI 1209 L E + DL ++LKIQEKV G+V LE P +IEGY+ Sbjct: 121 LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180 Query: 1208 PEVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKL------ 1047 P+ D + G ++ ++RLQ K I+NE D + ++ D+++ K Sbjct: 181 PQRDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNA 240 Query: 1046 --FSIFKENGAETKSDEVKHKPFSDGAAVS-------PAFSGSETKSKESDVRSISAAEN 894 FKE A+T+ +K D ++ SG ET+ + + R + + Sbjct: 241 VSSEKFKEAQAKTR-----YKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKV-DK 294 Query: 893 TQFGELLLNGSMQNVSEIAXXXXXXXXXXXKT-AQSNENALKSSLKPPGAKTLSQSVTWA 717 GE+ S +V + + + ++ LKSSLK +K +SQSVTWA Sbjct: 295 FNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWA 354 Query: 716 DKIKVNDTDSGDLCTFQEINDIKDN--GSSRNSNVEDVDASLRLASAEACVLALSQSAEA 543 D+I ++ + +I++ ++ G S ++++E+ D S R SAEAC ALSQ+AEA Sbjct: 355 DEI-IDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEA 413 Query: 542 VSSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFD 363 V+SG D+ DA S +GI+ILP + D PLK P +PG+ N +F+ Sbjct: 414 VASGS-DVPDAVSKAGIVILPTSQEVDEAILQETEMLDIEPA-PLKWPRKPGMPNYDVFE 471 Query: 362 PENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPR 183 E+ WYD PP GF++TLS FATM+ +LF WI+SSSLA+IYG D +++EE++S+NG+EYP Sbjct: 472 SEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPH 531 Query: 182 KNFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGM 51 K SDG S+EIK++LAG LAR +P LVA+L+L P+S LE GM Sbjct: 532 KIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGM 575 Score = 115 bits (287), Expect = 1e-22 Identities = 109/359 (30%), Positives = 165/359 (45%), Gaps = 17/359 (4%) Frame = -1 Query: 2774 EVLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAG-EVLLEDWLGPSNAIEGYVPN 2598 +VL LF+ + G S+LKI+E ++V+ G EV LE+W+GPSNAIEGYVP Sbjct: 123 QVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYVPQ 182 Query: 2597 LDFSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQSD 2418 D S P + +G ++ ++NE DF+STII D+++V KF Sbjct: 183 RDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNAVS 242 Query: 2417 TKKRTEELKRNQFSIH-EATSAPVQNGSEIQLK---ESKLEDGNAA-SKAQKAEH-EKSH 2256 ++K E + ++ + + S + +QL+ E++ D N K K E S Sbjct: 243 SEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVSS 302 Query: 2255 GPTQ------SVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERK 2094 GP+Q SV SD+ K E + LKSSL S ++ S+SVTWADE Sbjct: 303 GPSQHDVKNKSVLIMSDDGRKYASHGE----HDKQLLKSSLKSSNSKKMSQSVTWADE-I 357 Query: 2093 IDNISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFS 1914 ID S ++ + S+++E D S A SQ A+A+ S Sbjct: 358 IDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEAVA-S 416 Query: 1913 ESDVTDDARP----IIKLQPDASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKD 1749 SDV D I+ + E Q E + + APL+ P+KPG + F+++D Sbjct: 417 GSDVPDAVSKAGIVILPTSQEVDEAILQETE-MLDIEPAPLKWPRKPGMPNYDVFESED 474 >ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 662 Score = 382 bits (982), Expect = e-103 Identities = 234/586 (39%), Positives = 330/586 (56%), Gaps = 26/586 (4%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + K QS+ +KD V+K+QL+L EGI +EN LFAA SLMSRSDY+DV+T+RSI ++CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 ++ LP++ R+ YR SLK K YD E Y +CS C+ NSRAF+G L +RCSV N Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDLTELKIQEKVKA--GDVLLEN-SSPFFSIEGYIPEV 1200 L E +G D + L+IQEK+++ G+V +E P +IEGY+P Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD-SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHR 179 Query: 1199 DGVLXXXXXXXXXXGAQSNNSRLQK-GKGK-IVNEMDLTCNLVAADQFTGPKLFSIFKEN 1026 D + + ++++ G GK ++ +T ++ ++++ K+ S KE Sbjct: 180 DHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEM 239 Query: 1025 GAETKSD------------------EVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAA 900 +T S E H P +V GS+ ++K S + + Sbjct: 240 ALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATK--EST 297 Query: 899 ENTQFGELLLNGSMQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTW 720 +N N + + T LKSSLK PG K L +SVTW Sbjct: 298 DNLSDAPSTSKNRSTNFNLMTEEPRGGFNDLSGT------ELKSSLKKPGKKNLCRSVTW 351 Query: 719 ADKIKVNDTDSGDLCTFQEINDIKDNGSSRNSNV---EDVDASLRLASAEACVLALSQSA 549 AD+ K +D +L E+ K+ + ++ V D + LR+ SAEAC +ALSQ+A Sbjct: 352 ADE-KTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAA 410 Query: 548 EAVSSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKL 369 EA++SG+ ++SDA S++GIIILP +D + + K N+ G+ S L Sbjct: 411 EAITSGQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDL 469 Query: 368 FDPENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEY 189 FDP +SWYDAPP GFSLTLSSFATMW A+F W+TSSSLAYIYG+D HEEF+ ++GKEY Sbjct: 470 FDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEY 529 Query: 188 PRKNFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGM 51 P K S+DGRSSEIK++LAG L R +P L +EL L P+S LE+GM Sbjct: 530 PSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGM 575 Score = 124 bits (310), Expect = 3e-25 Identities = 108/358 (30%), Positives = 165/358 (46%), Gaps = 15/358 (4%) Frame = -1 Query: 2774 EVLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNL 2595 E+LKLFE MS S L+I+E + GEV +E+W+GPSNAIEGYVP+ Sbjct: 123 EILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHR 179 Query: 2594 DFSSKPPTQERRKEGPESN--XXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQ- 2424 D + KE + + ++ TSTII ++++V K S G ++ Sbjct: 180 DHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEM 239 Query: 2423 ---SDTKKRTEEL----KRNQFSIHEATSAPVQNGSEIQLKESKLEDGNAASKAQKAEHE 2265 +++K +T E +QF+I E AP + + K ++ S +++ Sbjct: 240 ALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKESTDN 299 Query: 2264 KSHGPTQSVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDN 2085 S P+ S + S++ + S T LKSSL G + RSVTWADE K D+ Sbjct: 300 LSDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWADE-KTDD 358 Query: 2084 ISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAAS--ELTPSQVAKAITFSE 1911 SI NL V EM + + + N N + D L E+A + SQ A+AIT + Sbjct: 359 ASIMNLPEVGEMGKTKECSRTTSNLVNFD-NDNEDILRVESAEACAMALSQAAEAITSGQ 417 Query: 1910 SDVTDDARPI-IKLQPDASEGYPQRNEDVY--EQMVAPLERPKKPGALISESFDAKDS 1746 S+V+D I + P S+ + + D + + E+ K G L S+ FD DS Sbjct: 418 SEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDS 475 >gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] Length = 695 Score = 379 bits (973), Expect(2) = e-102 Identities = 244/615 (39%), Positives = 337/615 (54%), Gaps = 61/615 (9%) Frame = -1 Query: 1712 ISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNNPLPA 1533 IS+KD V+++QLSL +G+H E+ LFAA S+MSRSDY+DV+T+RSI N+CGYPLC NPLP+ Sbjct: 9 ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68 Query: 1532 ERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXXXXXXXX 1353 +RPRK YR SLK K YD E Y++CS CV NSR FA SL +RC+V +S Sbjct: 69 DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128 Query: 1352 XXXE-QGLKENEYLGVTDDL--TELKIQEKVK--AGDVLLEN-SSPFFSIEGYIPEVDGV 1191 + GL+ G DL ++LKI+EK + GDV LE + P +IEGY+ + + Sbjct: 129 MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188 Query: 1190 LXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPKLFSIFKENGAETK 1011 G+++NN+ L +N+MD ++ D++T K S K+ G ++K Sbjct: 189 PKELGSKSPKRGSKANNTVL-------INDMDFVSTIITEDEYTVSKTPSSLKKTGLDSK 241 Query: 1010 SDEVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXX 831 E + ++ G+E E+ S + S++ S ++ Sbjct: 242 VREQEE-------ILAKKAMGNEFAVLETSYAPASNVSRVGLVFEDVTSSLRAGSCLSSA 294 Query: 830 XXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDI 651 K + E ++KSSLKP K LS++VTWAD+ K + + LC +EI D+ Sbjct: 295 RAEEESHDDKAEKCTEASIKSSLKPSRKKKLSRTVTWADE-KTDSSGGRKLCEIREIEDM 353 Query: 650 KD--------NGSSRNSN-----------------------------VEDV--------- 609 K+ NG S S+ +ED Sbjct: 354 KEDPSVVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADMLCN 413 Query: 608 ------DASLRLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLNDYDXXXXX 447 D + R ASAEAC AL +++EAV+S EL+++DA S++GIIILP+ + D Sbjct: 414 ADTGENDDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGEPM 473 Query: 446 XXXXXXXXET---VPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMWTALFG 276 P+K P +PG +S LFDPE+SW+DAPP FSLTLS FA MW ALF Sbjct: 474 EEDDDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNALFT 533 Query: 275 WITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMVPELVA 96 W TSS+LAYIYGRD S HEE+ VNG+EYP K DGRSSEIK++LAG LAR +P LVA Sbjct: 534 WTTSSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGLVA 593 Query: 95 ELKLRAPLSALEHGM 51 +L+L P+S+LE GM Sbjct: 594 DLRLSTPISSLEQGM 608 Score = 23.9 bits (50), Expect(2) = e-102 Identities = 9/16 (56%), Positives = 10/16 (62%) Frame = -2 Query: 49 IPGLAPHMTSMRVLLH 2 +P L PHM RVL H Sbjct: 645 LPALTPHMMYRRVLFH 660 Score = 118 bits (296), Expect = 1e-23 Identities = 114/394 (28%), Positives = 173/394 (43%), Gaps = 52/394 (13%) Frame = -1 Query: 2771 VLKLFEEMSXXXXXXXXXXXXXG-FSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNL 2595 VL++FE+ S FS+LKIEE G+V LE W GPSNAIEGYV Sbjct: 126 VLRMFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQR 185 Query: 2594 DFSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQS-- 2421 + K + K G ++N ++N+MDF STII D++ V K +++ Sbjct: 186 ERKPKELGSKSPKRGSKANNTV-------LINDMDFVSTIITEDEYTVSKTPSSLKKTGL 238 Query: 2420 DTKKRTEE------LKRNQFSIHEATSAPVQNGSEIQLK----ESKLEDGNAASKAQKAE 2271 D+K R +E N+F++ E + AP N S + L S L G+ S A+ E Sbjct: 239 DSKVREQEEILAKKAMGNEFAVLETSYAPASNVSRVGLVFEDVTSSLRAGSCLSSARAEE 298 Query: 2270 H---EKSHGPTQSVSDSSDERSKQKVSSEKLALSSETTLKS------------------- 2157 +K+ T++ SS + S++K S + + E T S Sbjct: 299 ESHDDKAEKCTEASIKSSLKPSRKKKLSRTVTWADEKTDSSGGRKLCEIREIEDMKEDPS 358 Query: 2156 --------SLNHSGPRRFSRSVTWADERKIDNISID--NLNSVQEMDSISDSFKNSRNSS 2007 S SG + +SV WADE+ + SID + +++ +D N+ Sbjct: 359 VVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADMLCNADTGE 418 Query: 2006 NVEVVDGRPCLASEAASELTPSQVAKAITFSESDVTD---DARPIIKLQPD-ASEGYPQR 1839 N D AS A + ++A+ E +V D +A II +P+ EG P Sbjct: 419 N----DDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGEPME 474 Query: 1838 NED---VYEQMVAPLERPKKPGALISESFDAKDS 1746 +D E AP++ PKKPG+ S+ FD +DS Sbjct: 475 EDDDDETSEPEQAPIKWPKKPGSQHSDLFDPEDS 508 >ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] gi|550321730|gb|EEF05523.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] Length = 696 Score = 379 bits (973), Expect = e-102 Identities = 239/614 (38%), Positives = 336/614 (54%), Gaps = 55/614 (8%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + K+QS +KD ++K+QLSL +GI +E+ L AA S+MS SDY+DV+T+R+I N+CGYPLC Sbjct: 1 MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 N LP++RP+K YR SLK K YD E Y++CS CV NSR F+GSL +RC V N Sbjct: 61 GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKVKA--GDVLLEN-SSPFFSIEGYIP 1206 L LG DL + LKI+EK + G+V E P +IEGY+P Sbjct: 121 LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180 Query: 1205 EVDGVLXXXXXXXXXXGAQ-------------------SNNSRLQKGKGK---------- 1113 + D + + + + + QK K K Sbjct: 181 QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240 Query: 1112 ------------IVNEMDLTCNLVAA-DQFTGPKLFSIFKENGAETK----SDEVKHKPF 984 +N+M+ T ++ D+++ K S ++TK ++V K Sbjct: 241 AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300 Query: 983 SDGAAVSPAFSGSETKSKESDVRSISAAENTQFGELLLN--GSMQNVSEIAXXXXXXXXX 810 + ++ + S+T K + RS A ++ + L + S Q S Sbjct: 301 ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360 Query: 809 XXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIKDNGSSR 630 K A+ E++LK SLK GAK L++SVTWAD+ KV + S DLC + + D K G Sbjct: 361 SEKAAKPVESSLKPSLKTSGAKQLTRSVTWADE-KVGSSGSRDLCEVRGMEDTKA-GPEI 418 Query: 629 NSNVEDVDASL--RLASAEACVLALSQSAEAVSSGELDISDAESDSGIIILPQLNDYDXX 456 N++ D + SAEAC ALSQ+AEAV+SG+ D S+A S++G++ILPQ +D D Sbjct: 419 VDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLDQG 478 Query: 455 XXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGFSLTLSSFATMWTALFG 276 E+ +K P +PG+ S+ FDPENSWYDAPP GFSL LSSFAT+W ALF Sbjct: 479 DPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIWMALFA 538 Query: 275 WITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIKESLAGILARMVPELVA 96 W+TSSSLAY+YG+D SSHEE++ VNG+EYPRK DGRS EI++++ G L R P +VA Sbjct: 539 WVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAFPVVVA 598 Query: 95 ELKLRAPLSALEHG 54 +L+L P+S LE G Sbjct: 599 DLRLPIPISTLEQG 612 Score = 127 bits (318), Expect = 4e-26 Identities = 115/396 (29%), Positives = 178/396 (44%), Gaps = 53/396 (13%) Frame = -1 Query: 2774 EVLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVP-- 2601 EVL LF+ S GFS LKIEE GEV E W+GPSNAIEGYVP Sbjct: 123 EVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVPQR 182 Query: 2600 ----------NLDFSSKPPTQE-----------------------------RRKEGPESN 2538 ++DF+S TQ+ + +G ++ Sbjct: 183 DRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSKAK 242 Query: 2537 XXXXXXXXXKVVNEMDFTSTIIV-GDQFAVPKFSYG--SEQSDTKKRTEELKRNQFSIHE 2367 +N+M+FTSTII+ D++++ K G S TK + ++ K +Q S Sbjct: 243 GTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSSEN 302 Query: 2366 ATSAPVQNGSEIQLKESKLEDGNAASKAQKAEHEKSHGPTQSVSDSS---DERSKQKVSS 2196 +SA + GS ++ K + A K + + + S P S SS +K+K S Sbjct: 303 QSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLS-SPFDSCQTSSITITAEAKEKSVS 361 Query: 2195 EKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDNISIDNLNSVQEMDSISDSFKNSR 2016 EK A E++LK SL SG ++ +RSVTWADE+ + + E+ + D+ Sbjct: 362 EKAAKPVESSLKPSLKTSGAKQLTRSVTWADEK----VGSSGSRDLCEVRGMEDTKAGPE 417 Query: 2015 NSSNVEVVDGRPCLASEAASELTP--SQVAKAITFSESDVTD---DARPIIKLQP-DASE 1854 N++ D E+A SQ A+A+ ++D ++ +A +I QP D + Sbjct: 418 IVDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLDQ 477 Query: 1853 GYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 G P + DV ++ + ++ P KPG SE FD ++S Sbjct: 478 GDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENS 513 >ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Glycine max] Length = 716 Score = 376 bits (966), Expect = e-101 Identities = 243/631 (38%), Positives = 340/631 (53%), Gaps = 71/631 (11%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + K++ +S+KDAV K+Q+SL EGI +E+ LFAA SLMSRSDY+D++T+RSITN+CGYPLC Sbjct: 1 MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 +N LP++RPRK YR SLK K YD +E Y+FCS C+ +S+ FAGSL +RCS + Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDL--TELKIQEKVK--AGDVLLEN-SSPFFSIEGYIP 1206 L+ E L DL ++LKIQEK + +G+V LE + P +IEGY+P Sbjct: 121 LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180 Query: 1205 EVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVAADQFTGPK-------- 1050 + G+++ + + I +EM ++ D+++ K Sbjct: 181 KPRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKVPPGQMDA 240 Query: 1049 --------------------------------LFSIFKEN---GAETKSDEVKHK----- 990 L S FK + K +EV Sbjct: 241 TANHQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAVL 300 Query: 989 PFSDGAAV------SPAFSGSETKSKESDVRSISAAENTQFGELLLNGSMQNVSEIAXXX 828 FS G A+ S + S + +++D S + ++ N + S + Sbjct: 301 KFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDA-STSNLDPAN 359 Query: 827 XXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSVTWADKIKVNDTDSGDLCTFQEINDIK 648 K S + +SSLK G K S++VTWAD+ K+N T S DLC F+E DIK Sbjct: 360 VEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADE-KINSTGSKDLCEFKEFGDIK 418 Query: 647 DNGSSRNSNVEDVDAS--LRLASAEACVLALSQSAEAVSSGELDISDAE----------S 504 S +N++ + LR ASAEAC +ALS ++EAV+SG+ D+SDA S Sbjct: 419 KESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVFSPMNETCAVS 478 Query: 503 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 324 ++GI ILP +D ++V LK P + G+ + F+ ++SW+DAPP GF Sbjct: 479 EAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGF 538 Query: 323 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 144 SLTLS FATMW LF W TSSSLAYIYGRD S HEE++SVNG+EYP K +DGRSSEIK Sbjct: 539 SLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIK 598 Query: 143 ESLAGILARMVPELVAELKLRAPLSALEHGM 51 ++LA LAR +P LVA L+L P+S +E GM Sbjct: 599 QTLASCLARALPALVAVLRLPIPVSIMEQGM 629 Score = 108 bits (269), Expect = 2e-20 Identities = 109/410 (26%), Positives = 175/410 (42%), Gaps = 68/410 (16%) Frame = -1 Query: 2771 VLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLD 2592 VL LFE ++ G S+LKI+E +GEV LE W GPSNAIEGYVP Sbjct: 124 VLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVPKPR 183 Query: 2591 FSSKPPTQERRKEGPESNXXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYG------- 2433 ++ K+G ++ + +EM F STII+ D+++V K G Sbjct: 184 NRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKVPPGQMDATAN 243 Query: 2432 -------SEQSDTKKRTEELKRNQFSIHEATSA--------------PVQNGSEIQLK-- 2322 + + K E ++++ SI + +S+ V E LK Sbjct: 244 HQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAVLKFS 303 Query: 2321 ------------------ESKLEDGNAASKAQKAEHEKSH---GPTQSVSDSSDERSKQK 2205 + +E ++A K+ + + + S S S+ ++K Sbjct: 304 PGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDASTSNLDPANVEEK 363 Query: 2204 VSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDNISIDNLNSVQEMDSI---SD 2034 EK S +T +SSL +G ++FSR+VTWADE KI++ +L +E I SD Sbjct: 364 FQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADE-KINSTGSKDLCEFKEFGDIKKESD 422 Query: 2033 SFKNSRNSSNVEVVDGRPCLASEAASELTPSQVAKAITFSESDVTD------------DA 1890 S N+ + +N E + R AS A + S ++A+ +SDV+D Sbjct: 423 SVGNNIDVANDEDILRR---ASAEACAIALSSASEAVASGDSDVSDAVFSPMNETCAVSE 479 Query: 1889 RPIIKLQP--DASEGYPQRNEDVYEQMVAPLERPKKPGALISESFDAKDS 1746 I L P DA+E + D+ + L+ P+K G ++ F++ DS Sbjct: 480 AGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDS 529 >ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] gi|462404075|gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] Length = 711 Score = 371 bits (953), Expect = e-100 Identities = 242/631 (38%), Positives = 340/631 (53%), Gaps = 73/631 (11%) Frame = -1 Query: 1724 KEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLCNN 1545 ++ IS+KD V+K+QL+L EGI ++ L+ A S++SRSDY+DV+T+R+I N+CGYPLC+N Sbjct: 9 QQPRISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSN 68 Query: 1544 PLPAE--RPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 LP++ RP K HYR SLK K YD E Y++CS CV S+AFA SL +RC V + Sbjct: 69 ALPSDSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGK 128 Query: 1370 XXXXXXXXXEQGLKENEY-LGVTDDL--TELKIQEKVKAG-------DVLLENSS----- 1236 + G + E G DL ++LKI+EKV+ G + +E S Sbjct: 129 VERILRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIG 188 Query: 1235 ------PFFSIEGYIPEVDGVLXXXXXXXXXXGAQSNNSRLQKGKGKIVNEMDLTCNLVA 1074 P +IEGY+P+ + + G++ ++++ G I NEMD ++ Sbjct: 189 DLGAVGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSSGMDIIFNEMDFMSTIIT 248 Query: 1073 ADQFTGPKLFSIFKENGAETKSDEVKHKPF---SDGAAVSPAFSGSETKS-KESDVRSIS 906 +D+++ K+ E ETK + K K +D S G + K+ K+ DV Sbjct: 249 SDEYSVSKIPPSVGEPDFETKFKKSKGKVGLNKNDSVKKSRQSKGGKNKNVKKDDVCIRE 308 Query: 905 AAENTQFGELLLNGSMQNVSEIAXXXXXXXXXXXKTAQSNENALKSSLKPPGAKTLSQSV 726 + + +LNGS + E K QS E L+SSLKP G K L++SV Sbjct: 309 VPSTSDASQTVLNGSTKEEKE--------EFIVEKAEQSGEALLRSSLKPSGTKKLNRSV 360 Query: 725 TWADKI----------------------------------------------KVNDTDSG 684 TWAD++ K++ T S Sbjct: 361 TWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTKSK 420 Query: 683 DLCTFQEINDIKDNGSSRNSNVEDVDASLRLASAEACVLALSQSAEAVSSGELDISDAES 504 ++C +E+ D GS D+ + L SAEAC +AL+Q+AEAV+SGE D+S A S Sbjct: 421 NICEVREVQDADVLGSL------DLQENEILESAEACAMALNQAAEAVASGESDVSGAVS 474 Query: 503 DSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKLFDPENSWYDAPPNGF 324 +GIIILP+ + D E PL P +PG+ S LFDPE+SW+DAPP GF Sbjct: 475 GAGIIILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGF 533 Query: 323 SLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEYPRKNFSSDGRSSEIK 144 S+TLS FATMW +LF WITSS+LAYIYGRD S HEEF+SVNG+EYP K + GRSSEIK Sbjct: 534 SVTLSPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIK 593 Query: 143 ESLAGILARMVPELVAELKLRAPLSALEHGM 51 ++L AR +P +V+EL+L P+S+LE GM Sbjct: 594 KTLDESFARALPGVVSELRLPTPISSLEQGM 624 Score = 126 bits (316), Expect = 6e-26 Identities = 117/368 (31%), Positives = 163/368 (44%), Gaps = 50/368 (13%) Frame = -1 Query: 2699 SELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNLDFSSKPPTQERRKEGPESNXXXXXX 2520 S LKIEE G++ +GPSNAIEGYVP + SKP ++ KEG + Sbjct: 175 SRLKIEEKSETHIGDL---GAVGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSS 231 Query: 2519 XXXKVVNEMDFTSTIIVGDQFAVPKF--SYGSEQSDTKKRTEELKRNQFSIHEATSAPVQ 2346 + NEMDF STII D+++V K S G +TK + + K V Sbjct: 232 GMDIIFNEMDFMSTIITSDEYSVSKIPPSVGEPDFETKFKKSKGK-------------VG 278 Query: 2345 NGSEIQLKESKLEDGNAASKAQK-----AEHEKSHGPTQSVSDSSDERSKQKVSSEKLAL 2181 +K+S+ G +K E + +Q+V + S + K++ EK Sbjct: 279 LNKNDSVKKSRQSKGGKNKNVKKDDVCIREVPSTSDASQTVLNGSTKEEKEEFIVEKAEQ 338 Query: 2180 SSETTLKSSLNHSGPRRFSRSVTWADERKIDNISIDNLNSVQEMDSI---SDSFK----- 2025 S E L+SSL SG ++ +RSVTWADE ID+ NL V+EM+ I SD+F Sbjct: 339 SGEALLRSSLKPSGTKKLNRSVTWADE-MIDSTGSRNLYEVREMEQIMEYSDAFSSMHKP 397 Query: 2024 -----------------NSRNSSNV----EVVDG----------RPCLASEAASELTPSQ 1938 +S S N+ EV D L S A + +Q Sbjct: 398 SVENKVGCSNTWFDEKIDSTKSKNICEVREVQDADVLGSLDLQENEILESAEACAMALNQ 457 Query: 1937 VAKAITFSESDVT---DDARPIIKLQPDA-SEGYPQRNEDVYEQMVAPLERPKKPGALIS 1770 A+A+ ESDV+ A II +PD E P + D+ E APL P+KPG S Sbjct: 458 AAEAVASGESDVSGAVSGAGIIILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCS 516 Query: 1769 ESFDAKDS 1746 + FD +DS Sbjct: 517 DLFDPEDS 524 >ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 632 Score = 353 bits (907), Expect = 2e-94 Identities = 223/586 (38%), Positives = 323/586 (55%), Gaps = 26/586 (4%) Frame = -1 Query: 1730 LVKEQSISLKDAVHKIQLSLTEGIHHENLLFAAQSLMSRSDYDDVITKRSITNVCGYPLC 1551 + K QS+ +KD V+K+QL+L EGI +EN LFAA SLMSRSDY+DV+T+RSI ++CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 1550 NNPLPAERPRKNHYRPSLKHQKAYDPREIYIFCSPGCVTNSRAFAGSLPVKRCSVYNSXX 1371 ++ LP++ R+ YR SLK K YD E Y +CS C+ NSRAF+G L +RCSV N Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 1370 XXXXXXXXXEQGLKENEYLGVTDDLTELKIQEKVKA--GDVLLEN-SSPFFSIEGYIPEV 1200 L E +G D + L+IQEK+++ G+V +E P +IEGY+P Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD-SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHR 179 Query: 1199 DGVLXXXXXXXXXXGAQSNNSRLQK-GKGK-IVNEMDLTCNLVAADQFTGPKLFSIFKEN 1026 D + + ++++ G GK ++ T ++ ++++ K+ S KE Sbjct: 180 DHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEM 239 Query: 1025 GAETKSD------------------EVKHKPFSDGAAVSPAFSGSETKSKESDVRSISAA 900 +T S E H P +V GS+ ++K S + Sbjct: 240 ALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKE---- 295 Query: 899 ENTQFGELLLNGSMQNVSEIAXXXXXXXXXXXKTAQSNENALKSSL---KPPGAKTLSQS 729 S N+S+ + SN + +L +P KT S Sbjct: 296 ------------STDNLSDAP-------------STSNNRSTNFNLMTEEPRDEKTDDAS 330 Query: 728 VTWADKIKVNDTDSGDLCTFQEINDIKDNGSSRNSNVEDVDASLRLASAEACVLALSQSA 549 + +N + G++ +E + N + +++ ED+ LR+ SAEAC +ALSQ+A Sbjct: 331 I-------MNLPEVGEMGKTKECSRTTSNLVNFDNDNEDL---LRVESAEACAMALSQAA 380 Query: 548 EAVSSGELDISDAESDSGIIILPQLNDYDXXXXXXXXXXXXXETVPLKLPNQPGLFNSKL 369 +A++SG+ ++SDA S++GIIILP +D + + K N+ G+ S L Sbjct: 381 KAITSGQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDL 439 Query: 368 FDPENSWYDAPPNGFSLTLSSFATMWTALFGWITSSSLAYIYGRDVSSHEEFISVNGKEY 189 FDP +SWYDAPP GFSLTLSSFATMW A+F W+TSSSLAYIYG+D HEEF+ ++GKEY Sbjct: 440 FDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEY 499 Query: 188 PRKNFSSDGRSSEIKESLAGILARMVPELVAELKLRAPLSALEHGM 51 P K S+DGRSSEIK++LAG L R +P L +EL L P+S LE+GM Sbjct: 500 PSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGM 545 Score = 97.1 bits (240), Expect = 4e-17 Identities = 98/358 (27%), Positives = 153/358 (42%), Gaps = 15/358 (4%) Frame = -1 Query: 2774 EVLKLFEEMSXXXXXXXXXXXXXGFSELKIEENLNVEAGEVLLEDWLGPSNAIEGYVPNL 2595 E+LKLFE MS S L+I+E + GEV +E+W+GPSNAIEGYVP+ Sbjct: 123 EILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHR 179 Query: 2594 DFSSKPPTQERRKEGPESN--XXXXXXXXXKVVNEMDFTSTIIVGDQFAVPKFSYGSEQ- 2424 D + KE + + ++ FTSTII ++++V K S G ++ Sbjct: 180 DHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEM 239 Query: 2423 ---SDTKKRTEEL----KRNQFSIHEATSAPVQNGSEIQLKESKLEDGNAASKAQKAEHE 2265 +++K +T E +QF+I E AP + + +KA Sbjct: 240 ALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSV---------------GRKARGS 284 Query: 2264 KSHGPTQSVSDSSDERSKQKVSSEKLALSSETTLKSSLNHSGPRRFSRSVTWADERKIDN 2085 K + +S+D S+ + S+ + +L PR + K D+ Sbjct: 285 KERTKVSATKESTDN------LSDAPSTSNNRSTNFNLMTEEPR----------DEKTDD 328 Query: 2084 ISIDNLNSVQEMDSISDSFKNSRNSSNVEVVDGRPCLASEAAS--ELTPSQVAKAITFSE 1911 SI NL V EM + + + N N + D L E+A + SQ AKAIT + Sbjct: 329 ASIMNLPEVGEMGKTKECSRTTSNLVNFD-NDNEDLLRVESAEACAMALSQAAKAITSGQ 387 Query: 1910 SDVTDDARPI-IKLQPDASEGYPQRNEDVY--EQMVAPLERPKKPGALISESFDAKDS 1746 S+V+D I + P S+ + + D + + E+ K G L S+ FD DS Sbjct: 388 SEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDS 445