BLASTX nr result
ID: Cinnamomum24_contig00013487
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum24_contig00013487 (2463 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] 635 e-179 ref|XP_002280625.1| PREDICTED: putative RNA polymerase II subuni... 633 e-178 ref|XP_012072543.1| PREDICTED: putative RNA polymerase II subuni... 604 e-170 ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm... 588 e-165 ref|XP_011087531.1| PREDICTED: putative RNA polymerase II subuni... 586 e-164 ref|XP_011087530.1| PREDICTED: putative RNA polymerase II subuni... 586 e-164 ref|XP_011087529.1| PREDICTED: putative RNA polymerase II subuni... 586 e-164 ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni... 574 e-160 ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr... 570 e-159 ref|XP_012479689.1| PREDICTED: putative RNA polymerase II subuni... 560 e-156 ref|XP_010271590.1| PREDICTED: putative RNA polymerase II subuni... 558 e-156 ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni... 557 e-155 ref|XP_012479683.1| PREDICTED: putative RNA polymerase II subuni... 555 e-155 ref|XP_010042212.1| PREDICTED: putative RNA polymerase II subuni... 552 e-154 gb|KHG00854.1| hypothetical protein F383_23706 [Gossypium arboreum] 551 e-154 ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni... 551 e-153 gb|KHG00855.1| hypothetical protein F383_23706 [Gossypium arboreum] 546 e-152 ref|XP_012859052.1| PREDICTED: putative RNA polymerase II subuni... 546 e-152 ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu... 545 e-152 ref|XP_009771014.1| PREDICTED: putative RNA polymerase II subuni... 543 e-151 >emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] Length = 659 Score = 635 bits (1638), Expect = e-179 Identities = 363/660 (55%), Positives = 450/660 (68%), Gaps = 43/660 (6%) Frame = -3 Query: 2332 PIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLP 2153 PIAVKDAVHKLQL LL+GI +E QLF+AGSL+SRSDYEDVVTER+I+NLCGYPLC NSLP Sbjct: 6 PIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNSLP 65 Query: 2152 SDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEV 1973 S+RL RKG ++IS +E + DL ET YCSS C S++F S +EER +V + +I + Sbjct: 66 SERL-RKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERINGI 124 Query: 1972 LGLFGELSLEDKG--KKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799 L LFGE SLE K+G +G+ ELKI + + KAGEVS+EDWIGP NAIEGYVP+ Sbjct: 125 LRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRDR 184 Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVI------------------ 1673 +L P K+ ++GSK + + V+ +E FVST+I Sbjct: 185 NLK--PKNIKNHKEGSKSSNSKMDSGKNFVI-DEMDFVSTIITKDEYSISKSSKGLKDTT 241 Query: 1672 -------------IGDQLSASEASMGPQKNDSKPKANRKSKG-------KDIVEKAGKQS 1553 IGDQLS E S P +NDS+ K R+SKG KD A S Sbjct: 242 SHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKL-RESKGRRSRVIFKDEFSTAEVPS 300 Query: 1552 ETQSRSALSKGPQGEDSVAAAVVKQNG-SQLKSALKSSGVKPLSRSVTWADEKKAENIDA 1376 + G +G++ Q G ++ KS+LK SG K + RSVTWADEK ++ D+ Sbjct: 301 VPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADEKM-DSADS 359 Query: 1375 GNLFNGQKTEEKSESIKNXXXXXXXXXXS-LRFALAEACAIALSQAAEAVASGECDAEDA 1199 + ++ E K E + LRFA AEACA+ALSQAAEAVASGE D DA Sbjct: 360 RDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDA 419 Query: 1198 ATEAGIVILP-PVEVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPE 1022 +EAGI+ILP P ++DEG S + D+ EP+ +KWP KP + +D+FD +DSW+DTPPE Sbjct: 420 VSEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPE 479 Query: 1021 GFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSE 842 GFSLTLSPFATMWMALF W+++SS+AYIYGRDES + +L VNGREYP+K+ L+DGRSSE Sbjct: 480 GFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSE 539 Query: 841 IKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLL 662 IKQT AGC+SR+LPG+VADLRLP P+S LEQ +GRLLDTMS+VD LP FR QW+VIVLL Sbjct: 540 IKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLL 599 Query: 661 FIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482 FIDALSVCRIPALT +M+S+RML KV DAAQV EEYEVMKDL+IPLGR P+FSAQSGG Sbjct: 600 FIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSGG 659 >ref|XP_002280625.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Vitis vinifera] gi|731415977|ref|XP_010659731.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Vitis vinifera] gi|731415979|ref|XP_010659732.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Vitis vinifera] gi|296089830|emb|CBI39649.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 633 bits (1633), Expect = e-178 Identities = 363/660 (55%), Positives = 449/660 (68%), Gaps = 43/660 (6%) Frame = -3 Query: 2332 PIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLP 2153 PIAVKDAVHKLQL LL+GI +E QLF+AGSL+SRSDYEDVVTER+I+NLCGYPLC NSLP Sbjct: 6 PIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNSLP 65 Query: 2152 SDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEV 1973 S+RL RKG ++IS +E + DL ET YCSS C S++F S +EER +V + +I + Sbjct: 66 SERL-RKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERINGI 124 Query: 1972 LGLFGELSLEDKG--KKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799 L LFGE SLE K+G +G+ ELKI + + KAGEVS+EDWIGP NAIEGYVP + Sbjct: 125 LRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP--QR 182 Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVI------------------ 1673 D + P K+R++GSK + + V+ +E FV T+I Sbjct: 183 DRNLKPKNIKNRKEGSKSSNSKMDSGKNFVI-DEMDFVRTIITEDEYSISKSSKGLKDTT 241 Query: 1672 -------------IGDQLSASEASMGPQKNDSKPKANRKSKG-------KDIVEKAGKQS 1553 IGDQLS E S P +NDS+ K R+SKG KD A S Sbjct: 242 SHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKL-RESKGRRSRVIFKDEFSTAEVPS 300 Query: 1552 ETQSRSALSKGPQGEDSVAAAVVKQNG-SQLKSALKSSGVKPLSRSVTWADEKKAENIDA 1376 + G +G++ Q G ++LKS LK SG K ++RSVTWADE K ++ D+ Sbjct: 301 VPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE-KMDSADS 359 Query: 1375 GNLFNGQKTEEKSESIKN-XXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDA 1199 + ++ E K E +LRFA AEACAIALSQAAEAVASGE D DA Sbjct: 360 RDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDA 419 Query: 1198 ATEAGIVILP-PVEVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPE 1022 +EA I+ILP P ++DEG S + D+ EP+ +KWP KP + +D+FD +DSW+DTPPE Sbjct: 420 VSEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPE 479 Query: 1021 GFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSE 842 GFSLTLSPFATMWMALF W+++SS+AYIYGRDES + +L VNGREYP+K+ L+DGRSSE Sbjct: 480 GFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSE 539 Query: 841 IKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLL 662 IKQT AGC++R+LPG+VADLRLP P+S LEQ +GRLLDTMS+VD LP FR QW+VIVLL Sbjct: 540 IKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLL 599 Query: 661 FIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482 FIDALSVC+IPALT +M SKRML KV DAAQV EEYEVMKDL+IPLGR P+FSAQSGG Sbjct: 600 FIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSGG 659 >ref|XP_012072543.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Jatropha curcas] gi|802599693|ref|XP_012072544.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Jatropha curcas] gi|802599695|ref|XP_012072546.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Jatropha curcas] gi|643730423|gb|KDP37902.1| hypothetical protein JCGZ_05341 [Jatropha curcas] Length = 654 Score = 604 bits (1558), Expect = e-170 Identities = 341/663 (51%), Positives = 438/663 (66%), Gaps = 39/663 (5%) Frame = -3 Query: 2356 MAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGY 2177 MAKD I+VKD VHKLQL+LL+GI +E QLF+AGSL+SRSDYEDVVTERSI+NLCGY Sbjct: 1 MAKDQS---ISVKDTVHKLQLSLLEGIKNEDQLFTAGSLMSRSDYEDVVTERSIANLCGY 57 Query: 2176 PLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVS 1997 PLC NSLP DR KGR++IS +E + DL ET YCSS+C S+ F S +EER +V Sbjct: 58 PLCNNSLPLDR-PYKGRYRISLKEHKVYDLHETYMYCSSSCIVNSRAFAGSLQEERCSVL 116 Query: 1996 DPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEG 1820 +P K+ E+L +F LSL+ K +NG +G+ LKI +K ++ GEVSLE+WIGP NAIEG Sbjct: 117 NPMKLDEILRMFNNLSLDSKNLVENGDLGLSNLKIQEKIESNVGEVSLEEWIGPSNAIEG 176 Query: 1819 YVPRDRSDLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEAS 1640 YVP+ D + K+ ++ SK V ++E N+ F+ST+I D+ S S+A Sbjct: 177 YVPQRDRDFKG--SSFKNPKEASKAISTKPVNKQECFF-NDMDFMSTIITKDEYSISKAP 233 Query: 1639 MGP---------------------QKNDSKP------KANRKSKG---KDIVEKAGKQSE 1550 G + S P K +RKSKG K I+++ + Sbjct: 234 SGSISTGSDMKLQEQRGKETHKGSEAQSSSPGKHAFVKTSRKSKGGRSKQIIKEELSDKD 293 Query: 1549 -------TQSRSALSKGPQGEDSVAAAVVKQNGSQLKSALKSSGVKPLSRSVTWADEKKA 1391 +Q+ S+++ E S A + S LK +LK SG K SVTWADEK Sbjct: 294 LLSASNYSQTGSSMNNAEPEEKSGAKQAANLSESMLKPSLKPSGAKKSVHSVTWADEK-F 352 Query: 1390 ENIDAGNLFNGQKTEEKSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECD 1211 +N + NL ++ E+ ++ LRF AEACAIALSQAAEAVASG+ D Sbjct: 353 DNAKSRNLCEVREMEDTKSGLEILDSLENNNDNMLRFESAEACAIALSQAAEAVASGDAD 412 Query: 1210 AEDAATEAGIVILP-PVEVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHD 1034 DA +EAG+++LP P + G+S+++ D+ E + S+KWP KP + +DLFD EDSW+D Sbjct: 413 VNDAMSEAGVIVLPQPHHLAPGDSTDIADMLERESASLKWPAKPAVEQSDLFDSEDSWYD 472 Query: 1033 TPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDG 854 PPEGFSL LSPFATMWMALF WV++SSLA+IYGRDE++ + +L VNGREYP+K+ L DG Sbjct: 473 APPEGFSLMLSPFATMWMALFAWVTSSSLAFIYGRDETAHEDYLSVNGREYPQKIVLRDG 532 Query: 853 RSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKV 674 RSSEIK T GC+SR+ PGVVADLRLP PIS LEQ GRLLDTMS+VD LPPFR QW+V Sbjct: 533 RSSEIKLTVEGCLSRAFPGVVADLRLPIPISTLEQGAGRLLDTMSFVDALPPFRMKQWQV 592 Query: 673 IVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSA 494 LFI+ALSVCRIPALT YM+++RM+LH+VLD AQ+ EEYEVMKDL+IPLGR P A Sbjct: 593 TAFLFIEALSVCRIPALTSYMTNRRMVLHQVLDGAQISAEEYEVMKDLMIPLGRDPR--A 650 Query: 493 QSG 485 +SG Sbjct: 651 RSG 653 >ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis] gi|223538861|gb|EEF40460.1| conserved hypothetical protein [Ricinus communis] Length = 645 Score = 588 bits (1517), Expect = e-165 Identities = 328/646 (50%), Positives = 436/646 (67%), Gaps = 31/646 (4%) Frame = -3 Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150 ++VKD V+KLQL+LL+GI +E QL +AGSL+SRSDYEDVV ERSISNLCGYPLC NSLPS Sbjct: 7 VSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLCNNSLPS 66 Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970 DR KGR++IS +E + DL+ET YCSS+C S+ F+ S +E+R +V +P K+ E+L Sbjct: 67 DR-PYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIKLNEIL 125 Query: 1969 GLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRSDL 1793 F +L+L+ +G ++G +G+ LKI +K++ G+VSLE+WIGP NAIEGYVP+ D Sbjct: 126 RKFNDLTLDSEGLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVPQGDRDP 185 Query: 1792 SSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASE----------- 1646 + P+ + H+E G K V +++ ++T F ST+I D+ S S+ Sbjct: 186 N--PSLKNHKE-GLKAICKKPVSKQDCFF-SDTDFTSTIITNDEYSISKGPSGLTSTASD 241 Query: 1645 ---------------ASMGPQKNDSKPKANRKSKG--KDIVEKAGKQSETQSRSALSKGP 1517 A + + KA+RKSKG K+ V K + S+ Sbjct: 242 IKLQAQTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKEKVIKEQLNFQDLPSSSYYTAE 301 Query: 1516 QGEDSVAAAVVKQNGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKS 1337 + S A N S LK +LKSSG K +RSVTWADE+ +N + NL Q+ E+ + Sbjct: 302 AEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADER-VDNAGSRNLCEVQEMEQTN 360 Query: 1336 ESIK-NXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILPPVE 1160 ES + + LRF AEACA+ALSQAAEAVASG+ D A +EAGI++LPP + Sbjct: 361 ESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEAGIIVLPPSQ 420 Query: 1159 -VDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMW 983 + +G + E D+ E + S+KWP KP + +DLFD EDSW+D PPEGFSLTLSPFATMW Sbjct: 421 DLGQGGNVEKNDMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAPPEGFSLTLSPFATMW 480 Query: 982 MALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSL 803 MALF WV++SSLAYIYGRDES+ + +L VNGREYPRK+ L DGRSSEI+ T C++R+ Sbjct: 481 MALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRSSEIRLTAESCLARTF 540 Query: 802 PGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPAL 623 PG+VA+LRLP P+S LEQ GRLL+TMS+VD LP FRT QW+VI LLFI+ALSVCRIPAL Sbjct: 541 PGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIALLFIEALSVCRIPAL 600 Query: 622 TQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485 T YM+S+RM+LH+VLD A + EEY++MKD ++PLGR P+ A+SG Sbjct: 601 TSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ--ARSG 644 >ref|XP_011087531.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X3 [Sesamum indicum] gi|747080559|ref|XP_011087533.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X3 [Sesamum indicum] Length = 655 Score = 586 bits (1510), Expect = e-164 Identities = 328/656 (50%), Positives = 439/656 (66%), Gaps = 40/656 (6%) Frame = -3 Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150 + VKDAVHKLQL+LL+GI++E QL +AGSL+ RSDY+DVVTER+I N+CGYPLC NSLPS Sbjct: 7 LTVKDAVHKLQLSLLEGINNENQLSAAGSLICRSDYQDVVTERTIINMCGYPLCSNSLPS 66 Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970 +R RKGR++IS +E + DL+ET YCSS+C S+ F AS +EER + +P + EVL Sbjct: 67 ER-PRKGRYRISLKEHKVYDLQETYLYCSSSCLINSRAFAASLQEERSSSLNPATLNEVL 125 Query: 1969 GLFGELSLE---DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799 LF LSL+ D G NG +G+ ELKI +K D +AGEVSLE+WIGP NAI+GYVPR+ Sbjct: 126 KLFDGLSLDSAVDMG--NGDLGLSELKIEEKTDTEAGEVSLEEWIGPSNAIDGYVPRNER 183 Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEV-----VVGNETAFVSTVIIGDQLSASEASMG 1634 +L P + + +KG++ ++ + + + ++ ++ F ST+I D+ S S++ Sbjct: 184 NLK--PKQSSNLKKGARQEQVESEYKHDPPDVADILSSDLNFTSTIITQDEYSISKSVPL 241 Query: 1633 PQKNDSKPKAN-----------------------RKSKGKDIVEKAGKQSETQSRSALSK 1523 + +SK K + KSK D + K + S + Sbjct: 242 VKDKESKGKVSINDVNSQGNQMEKPDAPLPNVQETKSKKSDKHKHVTKTDDKLSILEAAA 301 Query: 1522 GPQGEDSVAAAVVKQNGSQ-------LKSALKSSGVKPLSRSVTWADEKKAENIDAGNLF 1364 GP D + G + LKS+LK+S K +RSVTWAD K + D NL Sbjct: 302 GPSQNDLTKEENGHRLGKECASGATILKSSLKTSDSKKATRSVTWADAKT--DGDGQNLC 359 Query: 1363 NGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEA 1187 ++ ++ K + + S R A AEACA ALSQAAEAVA+G+ D DA +EA Sbjct: 360 EFREVKDGKGALVTSHSADQEVGDESYRIASAEACARALSQAAEAVATGQHDVSDAVSEA 419 Query: 1186 GIVILPPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSL 1010 G++ILPP EVDE E+ DV++ D +KWP KP +ADLFD EDSW+D+PPEGFSL Sbjct: 420 GVIILPPPHEVDEAKHEEIGDVTDTDPVLLKWPPKPGFSNADLFDSEDSWYDSPPEGFSL 479 Query: 1009 TLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQT 830 TLSPF+TM+MALF W+++SSLAYIYG++ES + ++ VNGREYP KV + DGRSSEIKQT Sbjct: 480 TLSPFSTMFMALFAWITSSSLAYIYGKEESFHEEYISVNGREYPHKVVMPDGRSSEIKQT 539 Query: 829 FAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDA 650 AGC++R+LPG+VA+LRLP P+S +EQ +GRLLDTMS++DPLP FR QW+VIVLLF+DA Sbjct: 540 LAGCLARALPGLVAELRLPIPMSTIEQGMGRLLDTMSFIDPLPAFRMKQWQVIVLLFLDA 599 Query: 649 LSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482 LSV RIPALT Y+ +R+LL KVL+ AQ+ EE+E+MKDL+IPLGR P+FS QSGG Sbjct: 600 LSVSRIPALTPYLMGRRILLPKVLEGAQISAEEFEIMKDLIIPLGRVPQFSTQSGG 655 >ref|XP_011087530.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Sesamum indicum] Length = 687 Score = 586 bits (1510), Expect = e-164 Identities = 328/651 (50%), Positives = 436/651 (66%), Gaps = 35/651 (5%) Frame = -3 Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150 + VKDAVHKLQL+LL+GI++E QL +AGSL+ RSDY+DVVTER+I N+CGYPLC NSLPS Sbjct: 51 LTVKDAVHKLQLSLLEGINNENQLSAAGSLICRSDYQDVVTERTIINMCGYPLCSNSLPS 110 Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970 +R RKGR++IS +E + DL+ET YCSS+C S+ F AS +EER + +P + EVL Sbjct: 111 ER-PRKGRYRISLKEHKVYDLQETYLYCSSSCLINSRAFAASLQEERSSSLNPATLNEVL 169 Query: 1969 GLFGELSLE---DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799 LF LSL+ D G NG +G+ ELKI +K D +AGEVSLE+WIGP NAI+GYVPR+ Sbjct: 170 KLFDGLSLDSAVDMG--NGDLGLSELKIEEKTDTEAGEVSLEEWIGPSNAIDGYVPRNER 227 Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEASMGPQKND 1619 +L P + + +KG++ ++ ++ ++ F ST+I D+ S S++ + + Sbjct: 228 NLK--PKQSSNLKKGARQEQVD-------ILSSDLNFTSTIITQDEYSISKSVPLVKDKE 278 Query: 1618 SKPKAN-----------------------RKSKGKDIVEKAGKQSETQSRSALSKGPQGE 1508 SK K + KSK D + K + S + GP Sbjct: 279 SKGKVSINDVNSQGNQMEKPDAPLPNVQETKSKKSDKHKHVTKTDDKLSILEAAAGPSQN 338 Query: 1507 DSVAAAVVKQNGSQ-------LKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKT 1349 D + G + LKS+LK+S K +RSVTWAD K + D NL ++ Sbjct: 339 DLTKEENGHRLGKECASGATILKSSLKTSDSKKATRSVTWADAKT--DGDGQNLCEFREV 396 Query: 1348 EE-KSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVIL 1172 ++ K + + S R A AEACA ALSQAAEAVA+G+ D DA +EAG++IL Sbjct: 397 KDGKGALVTSHSADQEVGDESYRIASAEACARALSQAAEAVATGQHDVSDAVSEAGVIIL 456 Query: 1171 PPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPF 995 PP EVDE E+ DV++ D +KWP KP +ADLFD EDSW+D+PPEGFSLTLSPF Sbjct: 457 PPPHEVDEAKHEEIGDVTDTDPVLLKWPPKPGFSNADLFDSEDSWYDSPPEGFSLTLSPF 516 Query: 994 ATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCI 815 +TM+MALF W+++SSLAYIYG++ES + ++ VNGREYP KV + DGRSSEIKQT AGC+ Sbjct: 517 STMFMALFAWITSSSLAYIYGKEESFHEEYISVNGREYPHKVVMPDGRSSEIKQTLAGCL 576 Query: 814 SRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCR 635 +R+LPG+VA+LRLP P+S +EQ +GRLLDTMS++DPLP FR QW+VIVLLF+DALSV R Sbjct: 577 ARALPGLVAELRLPIPMSTIEQGMGRLLDTMSFIDPLPAFRMKQWQVIVLLFLDALSVSR 636 Query: 634 IPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482 IPALT Y+ +R+LL KVL+ AQ+ EE+E+MKDL+IPLGR P+FS QSGG Sbjct: 637 IPALTPYLMGRRILLPKVLEGAQISAEEFEIMKDLIIPLGRVPQFSTQSGG 687 >ref|XP_011087529.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Sesamum indicum] Length = 699 Score = 586 bits (1510), Expect = e-164 Identities = 328/656 (50%), Positives = 439/656 (66%), Gaps = 40/656 (6%) Frame = -3 Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150 + VKDAVHKLQL+LL+GI++E QL +AGSL+ RSDY+DVVTER+I N+CGYPLC NSLPS Sbjct: 51 LTVKDAVHKLQLSLLEGINNENQLSAAGSLICRSDYQDVVTERTIINMCGYPLCSNSLPS 110 Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970 +R RKGR++IS +E + DL+ET YCSS+C S+ F AS +EER + +P + EVL Sbjct: 111 ER-PRKGRYRISLKEHKVYDLQETYLYCSSSCLINSRAFAASLQEERSSSLNPATLNEVL 169 Query: 1969 GLFGELSLE---DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799 LF LSL+ D G NG +G+ ELKI +K D +AGEVSLE+WIGP NAI+GYVPR+ Sbjct: 170 KLFDGLSLDSAVDMG--NGDLGLSELKIEEKTDTEAGEVSLEEWIGPSNAIDGYVPRNER 227 Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEV-----VVGNETAFVSTVIIGDQLSASEASMG 1634 +L P + + +KG++ ++ + + + ++ ++ F ST+I D+ S S++ Sbjct: 228 NLK--PKQSSNLKKGARQEQVESEYKHDPPDVADILSSDLNFTSTIITQDEYSISKSVPL 285 Query: 1633 PQKNDSKPKAN-----------------------RKSKGKDIVEKAGKQSETQSRSALSK 1523 + +SK K + KSK D + K + S + Sbjct: 286 VKDKESKGKVSINDVNSQGNQMEKPDAPLPNVQETKSKKSDKHKHVTKTDDKLSILEAAA 345 Query: 1522 GPQGEDSVAAAVVKQNGSQ-------LKSALKSSGVKPLSRSVTWADEKKAENIDAGNLF 1364 GP D + G + LKS+LK+S K +RSVTWAD K + D NL Sbjct: 346 GPSQNDLTKEENGHRLGKECASGATILKSSLKTSDSKKATRSVTWADAKT--DGDGQNLC 403 Query: 1363 NGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEA 1187 ++ ++ K + + S R A AEACA ALSQAAEAVA+G+ D DA +EA Sbjct: 404 EFREVKDGKGALVTSHSADQEVGDESYRIASAEACARALSQAAEAVATGQHDVSDAVSEA 463 Query: 1186 GIVILPPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSL 1010 G++ILPP EVDE E+ DV++ D +KWP KP +ADLFD EDSW+D+PPEGFSL Sbjct: 464 GVIILPPPHEVDEAKHEEIGDVTDTDPVLLKWPPKPGFSNADLFDSEDSWYDSPPEGFSL 523 Query: 1009 TLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQT 830 TLSPF+TM+MALF W+++SSLAYIYG++ES + ++ VNGREYP KV + DGRSSEIKQT Sbjct: 524 TLSPFSTMFMALFAWITSSSLAYIYGKEESFHEEYISVNGREYPHKVVMPDGRSSEIKQT 583 Query: 829 FAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDA 650 AGC++R+LPG+VA+LRLP P+S +EQ +GRLLDTMS++DPLP FR QW+VIVLLF+DA Sbjct: 584 LAGCLARALPGLVAELRLPIPMSTIEQGMGRLLDTMSFIDPLPAFRMKQWQVIVLLFLDA 643 Query: 649 LSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482 LSV RIPALT Y+ +R+LL KVL+ AQ+ EE+E+MKDL+IPLGR P+FS QSGG Sbjct: 644 LSVSRIPALTPYLMGRRILLPKVLEGAQISAEEFEIMKDLIIPLGRVPQFSTQSGG 699 >ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Solanum lycopersicum] Length = 660 Score = 574 bits (1479), Expect = e-160 Identities = 332/666 (49%), Positives = 435/666 (65%), Gaps = 50/666 (7%) Frame = -3 Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150 +AVKDAVHKLQL LL+GI E QL +AGSLLSRSDY+DVVTERSI+N+CGYPLC NSLPS Sbjct: 7 VAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLCSNSLPS 66 Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970 +R +RKG ++IS +E + DL ET YCS+ C S F S ++ER + +P K+ +VL Sbjct: 67 ER-SRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAKLNQVL 125 Query: 1969 GLFGELSLE--DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRSD 1796 LF L L D K+NG G +LKI +K D K GEVSLE+W+GP NAIEGYVP+ D Sbjct: 126 NLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVPQ--RD 183 Query: 1795 LSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASE------ASMG 1634 S P K+ KGSK K A + + +++ NE F ST+I D+ S S+ A Sbjct: 184 RSVNPALLKNINKGSKNKHARLQDEKNMIL-NEFDFSSTIITQDEYSVSKFPAPVNADSN 242 Query: 1633 PQKNDSKPKANRKSKGKDIVE----------KAGKQSETQSRSA------------LSKG 1520 + +++ K K + D+ ++G+++E ++ +S G Sbjct: 243 VKFKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVSSG 302 Query: 1519 PQGED--SVAAAVVKQNG---------SQLKSALKSSGVKPLSRSVTWADEKKAENIDAG 1373 P D + + ++ +G +LKS+LKSS K +SRSVTWADE +ID G Sbjct: 303 PSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRSVTWADE----SIDGG 358 Query: 1372 NLFNGQKTEEKSESIK--------NXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGE 1217 G+KTE S+ + + S RF AEACA ALSQAAEAVASG Sbjct: 359 I---GKKTESSSKISEYESQAYGGSASTDMEENDDSYRFESAEACAAALSQAAEAVASGS 415 Query: 1216 CDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSW 1040 D DA ++AGIVILPP EVDE E ++ + + +KWPRKP + + D+F+ EDSW Sbjct: 416 -DVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFESEDSW 474 Query: 1039 HDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALS 860 +D+PPEGF++TLSPF TM+ +LF W+S+SSLA+IYG DES+ + +L +NGREYPRK+ LS Sbjct: 475 YDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRKIVLS 534 Query: 859 DGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQW 680 DGRS+EIKQT AGC++R+LPG+VADLRLP PIS LEQ + LL+TMS+VDPLP FR QW Sbjct: 535 DGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRMKQW 594 Query: 679 KVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEF 500 ++IVLLF+DALSVCRIP LT YM+ +R KVLD AQ+ EYE+MKDL+IPLGR P+F Sbjct: 595 QLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLIIPLGRVPQF 654 Query: 499 SAQSGG 482 S QSGG Sbjct: 655 SMQSGG 660 >ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] Length = 739 Score = 570 bits (1468), Expect = e-159 Identities = 337/695 (48%), Positives = 438/695 (63%), Gaps = 69/695 (9%) Frame = -3 Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183 S MAK+ I+V +AVHK+QL LLDGI E QL ++GSL+SRSDYEDVVTER+ISN C Sbjct: 53 SSMAKEQS---ISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTC 109 Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003 GYPLC N LPS+ RKGR++IS +E + DL+ET +CS+ C S+ F S +EER + Sbjct: 110 GYPLCANPLPSEP-RRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168 Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826 V + K+ ++L LFG+L L+D KNG +G L+I + + KA +VSL GP NAI Sbjct: 169 VLNHAKLNDILSLFGDLDLDDNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNAI 225 Query: 1825 EGYVPRDRSDLSSLPTREKHREK----------GSKPKE---------AGTV-------- 1727 EGYVP+ +L S PT K+ + GSK +E AGT+ Sbjct: 226 EGYVPQ--RELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYII 283 Query: 1726 ----------------EREEVVVGNETAFVSTVIIGDQLSASEASMGPQKN--------- 1622 ++E V NE F S +I+ D+ + S+ G +++ Sbjct: 284 SKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEV 343 Query: 1621 -------DSKPK-------ANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAAVV 1484 DS+ K + + K IVE ++ QS S +++ A V Sbjct: 344 EEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAV 403 Query: 1483 KQNGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKNXXXXX 1307 + + LKS+LKS+G K L+R VTWAD+KKA+N GNL ++ E K +S + Sbjct: 404 TSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAED 463 Query: 1306 XXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGNSSELM 1130 LRF AEACA+ALS+AAEAVASG+ D DA E G++ILP + EVD+ E Sbjct: 464 GGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDG 523 Query: 1129 DVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMWMALFGWVSASS 950 D+ EP+ VKWP+KP + +D+F+ EDSW D PPEGFSLTLS FATMW ALF W+++SS Sbjct: 524 DMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSS 583 Query: 949 LAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSLPGVVADLRLPT 770 LAYIYGRDES + +L +NGREYPRK+AL DGRSSEIK+T A CISR+LP +V DLRLP Sbjct: 584 LAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPI 643 Query: 769 PISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLL 590 PIS LEQ +G L+DT+S+++ LP FR QW+VIVLLFIDALSVCRIPALT +M++ RMLL Sbjct: 644 PISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLL 703 Query: 589 HKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485 HKVLD AQ+ EEYEVMKDL+IPLGR P FSAQSG Sbjct: 704 HKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738 >ref|XP_012479689.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Gossypium raimondii] Length = 695 Score = 560 bits (1442), Expect = e-156 Identities = 336/702 (47%), Positives = 434/702 (61%), Gaps = 76/702 (10%) Frame = -3 Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183 S MAKD I+V +AVHK+QL LLDGI E QL S+GSL+SRSDYEDVVTERSISN C Sbjct: 6 SSMAKDQS---ISVSEAVHKIQLHLLDGIRDEKQLISSGSLISRSDYEDVVTERSISNTC 62 Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003 GYPLC N LPS+ R+GR++IS +E + DL+ET ++CS+ C S+ F S +EER + Sbjct: 63 GYPLCQNPLPSEP-RRRGRYRISLKEHRVYDLQETSRFCSADCLINSRAFAGSLQEERCS 121 Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826 V + K+ +L LF ++ L D+ KNG +G LKI + + KAGEVS +GP NAI Sbjct: 122 VLNHAKLNAILSLFDDVDLNDEDLGKNGDLGFSNLKIKENEEIKAGEVSS---VGPSNAI 178 Query: 1825 EGYVPRDRSDLSSLPTREKHREKG---SKPKEAGTVEREEVV------------------ 1709 EGYVP+ +L S P+ K+ + G S + G ++ + V Sbjct: 179 EGYVPQ--RELVSKPSSSKNSKNGVFDSSSSKLGDIKGDYFVNNEIDFTSAVIMNNEYTT 236 Query: 1708 --------------------VGNETAFVSTVIIGDQLSASEASMGPQKNDSKPKANRKSK 1589 V NE F S +I+ D+ + S+ G ++ S K +K++ Sbjct: 237 SKNPGSLRQSQRTKPSSMKDVINEMDFTSEIIMNDEYTVSKTPPGSRQGSSGSKL-KKTE 295 Query: 1588 G----KDIVEKAGKQSETQSRSALSK--------------GPQGEDSVAAAVVKQ----- 1478 G KD EK + ++S SAL+K G D++ A K+ Sbjct: 296 GQGVCKDFEEKCMR---SESSSALTKEDSGIVEMPSTKCVDQSGLDTINAEAEKETHSDK 352 Query: 1477 ----NGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXX 1310 +G LKS+LKS+G K L+RSVTWAD+K + G+L ++ + + +N Sbjct: 353 AVASSGVVLKSSLKSAGAKKLNRSVTWADKKNVDGARKGSLCEVKEMDAQKGDSENLGRA 412 Query: 1309 XXXXXXS--LRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILP-PVEVDEGNSS 1139 LRFA AEACA+ALS+AA AVASG+ D DA +EAG++IL P+E D+ Sbjct: 413 EDGDDDDNMLRFASAEACAMALSEAAAAVASGDSDVNDAVSEAGLIILAHPLEADKEEKV 472 Query: 1138 ELMDV----SEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMWMALF 971 E +D EP+ VKWP KP + +D FD EDSW D PPEGFSLTLS FATMW ALF Sbjct: 473 ENIDTLEAEPEPEEGPVKWPTKPGIPRSDFFDPEDSWFDAPPEGFSLTLSTFATMWNALF 532 Query: 970 GWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSLPGVV 791 W+++SSLAYIYGRDE+ + +L VNGREYP+K+ L DGRSSEIK+T AGCISR+ P +V Sbjct: 533 EWITSSSLAYIYGRDETFHEEYLSVNGREYPQKIVLRDGRSSEIKETLAGCISRAFPAIV 592 Query: 790 ADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPALTQYM 611 LRLP PIS LEQ +GRLLDTMS+V+ LP FR QW+VIVLL IDALSVCRIPALT +M Sbjct: 593 TALRLPIPISTLEQGMGRLLDTMSFVEALPAFRMKQWQVIVLLLIDALSVCRIPALTPHM 652 Query: 610 SSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485 ++ RMLLHKVLD AQ+ EEYEVMKDL+IPLGR P FSAQSG Sbjct: 653 TNGRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 694 >ref|XP_010271590.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Nelumbo nucifera] gi|720049898|ref|XP_010271591.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Nelumbo nucifera] Length = 650 Score = 558 bits (1439), Expect = e-156 Identities = 328/658 (49%), Positives = 430/658 (65%), Gaps = 41/658 (6%) Frame = -3 Query: 2332 PIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLP 2153 P++VKDAVHKLQL+LL+GI +E QLF+AGSL+SRSDYEDVVTER I+ +CGYPLC N L Sbjct: 6 PLSVKDAVHKLQLSLLEGICNEDQLFAAGSLMSRSDYEDVVTERHITKVCGYPLCKNPLS 65 Query: 2152 SDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEV 1973 +R RKGR++IS +E + DL+ET YCSS C S+ F S ER +VSD KI EV Sbjct: 66 LER-PRKGRYRISVKEHKVYDLQETYMYCSSGCLVNSRAFAGSLATERCSVSDSSKINEV 124 Query: 1972 LGLFGELSLEDKG--KKNGAMGIPELKILDKADAKA-GEVSLEDWIGPPNAIEGYVPRDR 1802 L LF +LS +DK + G +G +LKI +K D G VSLEDWIGP NAIEGYVP++ Sbjct: 125 LRLFEDLSSKDKEILGEEGNLGFSKLKIQEKEDVNVTGNVSLEDWIGPSNAIEGYVPKNC 184 Query: 1801 SDLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLS---ASEASMGP 1631 KH E+GSK K A + + ++ V E F ST+IIGDQ A AS G Sbjct: 185 GS--------KHLEEGSKQKIAKSKKGKDKVA-KEMDFKSTIIIGDQFKIPKAPAASNGY 235 Query: 1630 QKNDSKPKAN----------------------------RKSKGK---DIVEKAGKQSETQ 1544 ++N K K+ ++S+G+ ++++ G +T Sbjct: 236 EQNLGKSKSGESSCVPEEWLSILNPSPAPEKSGSGITVKESEGEISGNVLKDHGIPGKTL 295 Query: 1543 SRSALSKGPQGEDSVAAAVVK--QNG-SQLKSALKSSGVKPLSRSVTWADEKKAENIDAG 1373 S +S E + V K Q+G + LKS++K G K L+R+VTWADE+++ + Sbjct: 296 SGQNVSDTSGQETKIKLDVGKTIQSGETALKSSIKPPGAKKLTRNVTWADERESGKVGND 355 Query: 1372 NLFNGQKTEEKSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAAT 1193 NL +T+E + +++ +L FA AEACAIALSQAAEAVASGE D DA + Sbjct: 356 NLVKIAETQETA--VRSDGSNVEDEDCTLCFASAEACAIALSQAAEAVASGESDVFDAVS 413 Query: 1192 EAGIVILP-PVEVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGF 1016 +AGIVI+P P + DEG++ +DV E +R +WPR+ V LD F ED + PP+GF Sbjct: 414 DAGIVIMPHPPDADEGDTQGEVDVLESERIPFRWPRRRVDLDPQFFYFEDILSE-PPDGF 472 Query: 1015 SLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIK 836 S++LSPF T+WMALFGW+++S+LAYIYGRDE+S F VNG+EYP KV DGRS EIK Sbjct: 473 SMSLSPFGTIWMALFGWITSSTLAYIYGRDENSHLEFQLVNGKEYPCKVVFRDGRSYEIK 532 Query: 835 QTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFI 656 +T A C+SR+LPG+VAD+ LPTPIS LEQ +G LLDTM++V+ LP R QW VIV LF+ Sbjct: 533 ETLASCLSRALPGLVADVNLPTPISTLEQGMGCLLDTMTFVEALPSLRMKQWHVIVFLFV 592 Query: 655 DALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482 DALSVCR+PAL ++S+RMLL KVLD AQ+ GEEYE+MKD ++PLGR P+FS QSGG Sbjct: 593 DALSVCRMPALNPLVTSRRMLLQKVLDGAQISGEEYELMKDHILPLGRLPQFSTQSGG 650 >ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cicer arietinum] Length = 666 Score = 557 bits (1435), Expect = e-155 Identities = 330/667 (49%), Positives = 427/667 (64%), Gaps = 51/667 (7%) Frame = -3 Query: 2332 PIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLP 2153 PI+VKDAV KLQLALL+GI SE QLF+AGSL+SRSDYEDVVTERSI+ +C YPLC N+LP Sbjct: 6 PISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLCCNALP 65 Query: 2152 SDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEV 1973 S+R RKGR++IS +E + DL ET +CSS+C SK F S K++R DP K+ + Sbjct: 66 SER-PRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQKLNNI 124 Query: 1972 LGLFGELSLE--DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799 L LFG +LE + K+G +G+ L+I DK + EVSLE W+GP NAIEGYVP+ R Sbjct: 125 LRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVPKKRD 183 Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEASMGPQK-- 1625 + S +K+ +KGSK G + ++ +E F+ST+I+ D+ S S+ S G Sbjct: 184 NGSK--GSQKNTKKGSKASH-GKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTDAT 240 Query: 1624 --NDSKPKANRKS----------KGKDIVEKAGKQSETQSRSALSKGPQ----------G 1511 + KP A + K DI + + + + + SA K + G Sbjct: 241 VDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVLKG 300 Query: 1510 EDSVAAA-------------------VVKQNGS---QLKSALKSSGVKPLSRSVTWADEK 1397 + + AA + K+ GS + KS+LKS+G K L RSVTWAD K Sbjct: 301 KTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLGRSVTWAD-K 359 Query: 1396 KAENIDAGNLFNGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASG 1220 K + + +L ++ K ES LR AEACAIALSQAAEAVASG Sbjct: 360 KIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQAAEAVASG 419 Query: 1219 ECDAEDAATEAGIVILPPVE--VDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCED 1046 + DA DA +EAGI+ILP E V+E ++ D+ E D ++KWPRKP + D DLF +D Sbjct: 420 DSDAIDAVSEAGIIILPHTENAVEESTVDDV-DILETDSVTLKWPRKPGISDFDLFASDD 478 Query: 1045 SWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVA 866 SW D PPEGFSLTLSPFAT+W A F W+++SSLAYIYGRD S + FL V+GREYP K+ Sbjct: 479 SWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGREYPCKIV 538 Query: 865 LSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTN 686 LSDGRSSEIKQT A C++R+LP VVA+L+LP P+S LEQ + LLDTMS+VDPLP FR Sbjct: 539 LSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVDPLPGFRFK 598 Query: 685 QWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQP 506 QW+V+ LLF+DALSVCRIPAL YM+ +R L HKVL +Q+G EEY V+KDL++PLGR P Sbjct: 599 QWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDLIVPLGRAP 658 Query: 505 EFSAQSG 485 FS+QSG Sbjct: 659 HFSSQSG 665 >ref|XP_012479683.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|823159708|ref|XP_012479685.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|823159710|ref|XP_012479686.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|823159712|ref|XP_012479687.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|823159714|ref|XP_012479688.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Gossypium raimondii] gi|763764410|gb|KJB31664.1| hypothetical protein B456_005G200700 [Gossypium raimondii] gi|763764411|gb|KJB31665.1| hypothetical protein B456_005G200700 [Gossypium raimondii] gi|763764412|gb|KJB31666.1| hypothetical protein B456_005G200700 [Gossypium raimondii] gi|763764413|gb|KJB31667.1| hypothetical protein B456_005G200700 [Gossypium raimondii] gi|763764414|gb|KJB31668.1| hypothetical protein B456_005G200700 [Gossypium raimondii] Length = 708 Score = 555 bits (1429), Expect = e-155 Identities = 336/715 (46%), Positives = 434/715 (60%), Gaps = 89/715 (12%) Frame = -3 Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183 S MAKD I+V +AVHK+QL LLDGI E QL S+GSL+SRSDYEDVVTERSISN C Sbjct: 6 SSMAKDQS---ISVSEAVHKIQLHLLDGIRDEKQLISSGSLISRSDYEDVVTERSISNTC 62 Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003 GYPLC N LPS+ R+GR++IS +E + DL+ET ++CS+ C S+ F S +EER + Sbjct: 63 GYPLCQNPLPSEP-RRRGRYRISLKEHRVYDLQETSRFCSADCLINSRAFAGSLQEERCS 121 Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826 V + K+ +L LF ++ L D+ KNG +G LKI + + KAGEVS +GP NAI Sbjct: 122 VLNHAKLNAILSLFDDVDLNDEDLGKNGDLGFSNLKIKENEEIKAGEVSS---VGPSNAI 178 Query: 1825 EGYVPRDRSDLSSLPTREKHREKG---SKPKEAGTVEREEVV------------------ 1709 EGYVP+ +L S P+ K+ + G S + G ++ + V Sbjct: 179 EGYVPQ--RELVSKPSSSKNSKNGVFDSSSSKLGDIKGDYFVNNEIDFTSAVIMNNEYLD 236 Query: 1708 ---------------------------------VGNETAFVSTVIIGDQLSASEASMGPQ 1628 V NE F S +I+ D+ + S+ G + Sbjct: 237 FTSAVIMNNEYTTSKNPGSLRQSQRTKPSSMKDVINEMDFTSEIIMNDEYTVSKTPPGSR 296 Query: 1627 KNDSKPKANRKSKG----KDIVEKAGKQSETQSRSALSK--------------GPQGEDS 1502 + S K +K++G KD EK + ++S SAL+K G D+ Sbjct: 297 QGSSGSKL-KKTEGQGVCKDFEEKCMR---SESSSALTKEDSGIVEMPSTKCVDQSGLDT 352 Query: 1501 VAAAVVKQ---------NGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKT 1349 + A K+ +G LKS+LKS+G K L+RSVTWAD+K + G+L ++ Sbjct: 353 INAEAEKETHSDKAVASSGVVLKSSLKSAGAKKLNRSVTWADKKNVDGARKGSLCEVKEM 412 Query: 1348 EEKSESIKNXXXXXXXXXXS--LRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVI 1175 + + +N LRFA AEACA+ALS+AA AVASG+ D DA +EAG++I Sbjct: 413 DAQKGDSENLGRAEDGDDDDNMLRFASAEACAMALSEAAAAVASGDSDVNDAVSEAGLII 472 Query: 1174 LP-PVEVDEGNSSELMDV----SEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSL 1010 L P+E D+ E +D EP+ VKWP KP + +D FD EDSW D PPEGFSL Sbjct: 473 LAHPLEADKEEKVENIDTLEAEPEPEEGPVKWPTKPGIPRSDFFDPEDSWFDAPPEGFSL 532 Query: 1009 TLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQT 830 TLS FATMW ALF W+++SSLAYIYGRDE+ + +L VNGREYP+K+ L DGRSSEIK+T Sbjct: 533 TLSTFATMWNALFEWITSSSLAYIYGRDETFHEEYLSVNGREYPQKIVLRDGRSSEIKET 592 Query: 829 FAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDA 650 AGCISR+ P +V LRLP PIS LEQ +GRLLDTMS+V+ LP FR QW+VIVLL IDA Sbjct: 593 LAGCISRAFPAIVTALRLPIPISTLEQGMGRLLDTMSFVEALPAFRMKQWQVIVLLLIDA 652 Query: 649 LSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485 LSVCRIPALT +M++ RMLLHKVLD AQ+ EEYEVMKDL+IPLGR P FSAQSG Sbjct: 653 LSVCRIPALTPHMTNGRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 707 >ref|XP_010042212.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Eucalyptus grandis] gi|629120488|gb|KCW84978.1| hypothetical protein EUGRSUZ_B01798 [Eucalyptus grandis] Length = 672 Score = 552 bits (1422), Expect = e-154 Identities = 320/672 (47%), Positives = 423/672 (62%), Gaps = 56/672 (8%) Frame = -3 Query: 2332 PIAVKDAVHKLQLALLDGIHS-EYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSL 2156 P++VKDAV++LQ LLDG + E QL +AG++LSR DYEDVV ERSI+ LCGYPLC L Sbjct: 8 PVSVKDAVYRLQHLLLDGAAAGEAQLLAAGAILSRRDYEDVVAERSIAGLCGYPLCATPL 67 Query: 2155 PSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWE 1976 P+DR RKGR++IS +E + DL+ET YCS C S+ F S + ER AV D K+ E Sbjct: 68 PADR-PRKGRYRISLKEHRVYDLQETYMYCSPGCVVDSRAFAGSLQPERCAVLDLVKVEE 126 Query: 1975 VLGLFGELSLEDKGKKNGA---MGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRD 1805 VL +FG+ L + + +G +G+ LKI + + +AGEV LE+W+GP NAIEGYVPR Sbjct: 127 VLRVFGDKGLGSQERGDGGVGELGMSGLKIKENEEVRAGEVPLEEWVGPSNAIEGYVPRK 186 Query: 1804 RSDLSSLPTREKHREK-----GSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEAS 1640 R D ++ R K GSK + + ++E ++ N+ F S +I D+ S S+ Sbjct: 187 RDDKAAAAAAAASRAKKEPREGSKSRNSKPSKKE--LIFNDMDFTSIIITQDEYSISKLP 244 Query: 1639 MGPQKNDSKPKANRKSKGKDI--------------------------------------- 1577 + + S KA ++SKGK + Sbjct: 245 VNSVEEVSATKA-KESKGKKVNGKDKQSRRAVIETSSAKPGTPNINQRELKGKSHDITED 303 Query: 1576 ---VEKAGKQSETQSRSALSK--GPQGEDSVAAAVVKQNGSQLKSALKSSGVKPLSRSVT 1412 +K SE ++LS G +G D A ++LK +LKS+G K ++RSVT Sbjct: 304 EYSAQKVPSPSEVCQSNSLSHFTGAEGADDDGKADGTSTETRLKPSLKSTGTKKVTRSVT 363 Query: 1411 WADEKKAENIDAGNLFN-GQKTEEKSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAE 1235 WADEK D G+L + +EK + + +RF+ AEACA+ALSQAAE Sbjct: 364 WADEK-VNVADGGHLCEIREMVDEKEPPLTSAIENEHDDENLMRFSSAEACAMALSQAAE 422 Query: 1234 AVASGECDAEDAATEAGIVILP-PVEVDEGNSSE-LMDVSEPDRQSVKWPRKPVLLDADL 1061 A SGE D DAA G++ILP P EVDE E D E D SVKWP+KP + AD+ Sbjct: 423 AATSGESDVFDAA---GLIILPRPHEVDEKAPVEDNADPLEVDSASVKWPKKPGIPTADI 479 Query: 1060 FDCEDSWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREY 881 FD +DSW+D PP+GF++TLSPFATMW ALF W ++S+LAYIYG+DES + ++ VNGREY Sbjct: 480 FDADDSWYDAPPDGFNMTLSPFATMWGALFAWTTSSTLAYIYGKDESFHEEYMSVNGREY 539 Query: 880 PRKVALSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLP 701 P+K+ L DGRS+EIKQT AGC+SR+LPG+++DLRLP P+S LEQ LGRLLDTM+++D LP Sbjct: 540 PQKLVLPDGRSTEIKQTLAGCLSRALPGLISDLRLPLPVSTLEQGLGRLLDTMTFMDALP 599 Query: 700 PFRTNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIP 521 RT QW+VIVLLFIDALSVCR+P LT +MS++ L KVL AA++ EEYE+MKDLLIP Sbjct: 600 ALRTKQWQVIVLLFIDALSVCRVPVLTAHMSNRHPSLQKVLQAARMSVEEYEIMKDLLIP 659 Query: 520 LGRQPEFSAQSG 485 LGR P+FSAQSG Sbjct: 660 LGRAPQFSAQSG 671 >gb|KHG00854.1| hypothetical protein F383_23706 [Gossypium arboreum] Length = 729 Score = 551 bits (1420), Expect = e-154 Identities = 332/701 (47%), Positives = 432/701 (61%), Gaps = 77/701 (10%) Frame = -3 Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183 S MAKD I+V +AVHK+QL LLDGI E QL S+GSL+SRSDYEDV+TERSISN C Sbjct: 6 SSMAKDQS---ISVSEAVHKIQLHLLDGIRDEKQLISSGSLISRSDYEDVITERSISNTC 62 Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003 GYPLC N LPS+ R+GR++IS +E + DL+ET ++C + C S+ F S +EER + Sbjct: 63 GYPLCQNPLPSEP-RRRGRYRISLKEHRVYDLQETSRFCLADCLINSRAFAGSLQEERCS 121 Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826 V + K+ +L LF ++ L DK KNG +G LKI + + KAGE+S +GP NAI Sbjct: 122 VLNHAKLNAILSLFDDVDLNDKDLGKNGDLGFSNLKIKENEEIKAGEISS---VGPSNAI 178 Query: 1825 EGYVPRDRSDLSSLPTREKHREKG---SKPKEAGTVEREEVV------------------ 1709 EGYVP+ +L S P+ K+ + G S + G ++ + V Sbjct: 179 EGYVPQ--RELVSKPSSSKNSKNGVFDSSSSKLGDIKGDYFVNNEIDFTSAVIMNNEYTT 236 Query: 1708 --------------------VGNETAFVSTVIIGDQLSASEASMGPQKNDSKPKANR-KS 1592 V NE F S +I+ D+ + S+ G ++ S K + + Sbjct: 237 SKNPGSLRQSQRTKPSSMKDVINEMDFTSEIIMNDEYTVSKTPPGSRQGSSGSKLEKTEG 296 Query: 1591 KG--KDIVEKAGKQSETQSRSALSK--------------GPQGEDSVAAAVVKQ------ 1478 KG KD EK + ++S SAL+K G D++ A K+ Sbjct: 297 KGVCKDFEEKCMR---SESSSALTKEDSGIVQMPSTKCVDQSGLDTINAEAEKETHSDKA 353 Query: 1477 ---NGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXX 1307 +G LKS+LK +G K L+RSVTWAD+K ++ G+L ++ + + +N Sbjct: 354 MASSGVVLKSSLKPAGAKKLNRSVTWADKKNVDSARKGSLCEVKEMDAQKGDSENIGRAE 413 Query: 1306 XXXXXS--LRFALAEACAIALSQAAEA--VASGECDAEDAATEAGIVILP-PVEVDEGNS 1142 LRFA AEACA+ALS+AA A VASG+ D DA +EAG++ILP P+E D+ Sbjct: 414 DGDADDKMLRFASAEACAMALSKAAAAAAVASGDSDVNDAVSEAGLIILPHPLEADKEEK 473 Query: 1141 SELMDV----SEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMWMAL 974 E +D EP+ VKWP KP + +D FD EDSW D PPEGFSLTLS FATMW AL Sbjct: 474 VENIDTLEADPEPEEGPVKWPTKPGIPRSDFFDPEDSWFDAPPEGFSLTLSTFATMWNAL 533 Query: 973 FGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSLPGV 794 F W+++SSLAYIYGRDE+ + +L VNGREYP+K+ L DGRSSEIK+T AGCISR+LP + Sbjct: 534 FEWITSSSLAYIYGRDETFHEEYLSVNGREYPQKIVLRDGRSSEIKETLAGCISRALPAI 593 Query: 793 VADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPALTQY 614 V LRLP PIS LEQ +GRLLDTMS+V+ LP FR QW+V+VLL IDALSVCRIPALT + Sbjct: 594 VTALRLPIPISTLEQGMGRLLDTMSFVEALPAFRMKQWQVLVLLLIDALSVCRIPALTPH 653 Query: 613 MSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQ 491 M++ RMLLHKVLD AQ+ EEYEVMKDL+IPLGR P FSAQ Sbjct: 654 MTNGRMLLHKVLDGAQISLEEYEVMKDLIIPLGRAPHFSAQ 694 >ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Solanum tuberosum] Length = 662 Score = 551 bits (1419), Expect = e-153 Identities = 330/671 (49%), Positives = 435/671 (64%), Gaps = 55/671 (8%) Frame = -3 Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150 +AVKDAVHKLQL LL+GI E QL +AGSLLSRSDY+DVVTERSI+N+CGYPLC NSLPS Sbjct: 7 VAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLCSNSLPS 66 Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970 +R +RKG ++IS +E + DL ET YCS+ C S F S ++ER + +P K+ +VL Sbjct: 67 ER-SRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAKLNQVL 125 Query: 1969 GLFGELSLE--DKGKKNGAMGIPELKILDKADAK-AGEVSLEDWIGPPNAIEGYVPRDRS 1799 LF L L + K+NG +G +LKI +K D K GEVSLE+W+GP NAIEGYVP + Sbjct: 126 NLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYVP--QR 183 Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASE------ASM 1637 D S P K+ KG K K A ++ E+ ++ NE F ST+I D+ S S+ A Sbjct: 184 DRSVNPALLKNINKGFKNKHA-RLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNAVS 242 Query: 1636 GPQKNDSKPKANRKSKGKDI----------VEKAGKQSETQSRSA------------LSK 1523 + +++ K K + D+ ++G+++E ++ +S Sbjct: 243 SEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVSS 302 Query: 1522 GPQGED--SVAAAVVKQNGSQ-----------LKSALKSSGVKPLSRSVTWADEKKAENI 1382 GP D + + ++ +G + LKS+LKSS K +S+SVTWAD E I Sbjct: 303 GPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWAD----EII 358 Query: 1381 DAGNLFNGQKTEEKSESIK--------NXXXXXXXXXXSLRFALAEACAIALSQAAEAVA 1226 D G G+KTE S+ + + S RF AEACA ALSQAAEAVA Sbjct: 359 DGG---IGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEAVA 415 Query: 1225 SGECDAEDAATEAGIVILP-PVEVDEG--NSSELMDVSEPDRQSVKWPRKPVLLDADLFD 1055 SG D DA ++AGIVILP EVDE +E++D+ EP +KWPRKP + + D+F+ Sbjct: 416 SGS-DVPDAVSKAGIVILPTSQEVDEAILQETEMLDI-EP--APLKWPRKPGMPNYDVFE 471 Query: 1054 CEDSWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPR 875 ED W+D PPEGF++TLSPFATM+ +LF W+S+SSLA+IYG DE++ + +L +NGREYP Sbjct: 472 SEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPH 531 Query: 874 KVALSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPF 695 K+ LSDG S+EIKQT AGC++R+LPG+VADLRLP PIS LEQ + LL+TMS+VDPLP F Sbjct: 532 KIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAF 591 Query: 694 RTNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLG 515 R QW++IVLLF+DALSVCRIP LT YM+ +R L KVLD AQ+ EYE+MKDL+IPLG Sbjct: 592 RMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLG 651 Query: 514 RQPEFSAQSGG 482 R P+FS QSGG Sbjct: 652 RVPQFSMQSGG 662 >gb|KHG00855.1| hypothetical protein F383_23706 [Gossypium arboreum] Length = 708 Score = 546 bits (1408), Expect = e-152 Identities = 334/714 (46%), Positives = 434/714 (60%), Gaps = 88/714 (12%) Frame = -3 Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183 S MAKD I+V +AVHK+QL LLDGI E QL S+GSL+SRSDYEDV+TERSISN C Sbjct: 6 SSMAKDQS---ISVSEAVHKIQLHLLDGIRDEKQLISSGSLISRSDYEDVITERSISNTC 62 Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003 GYPLC N LPS+ R+GR++IS +E + DL+ET ++C + C S+ F S +EER + Sbjct: 63 GYPLCQNPLPSEP-RRRGRYRISLKEHRVYDLQETSRFCLADCLINSRAFAGSLQEERCS 121 Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826 V + K+ +L LF ++ L DK KNG +G LKI + + KAGE+S +GP NAI Sbjct: 122 VLNHAKLNAILSLFDDVDLNDKDLGKNGDLGFSNLKIKENEEIKAGEISS---VGPSNAI 178 Query: 1825 EGYVPRDRSDLSSLPTREKHREKG---SKPKEAGTVEREEVV------------------ 1709 EGYVP+ +L S P+ K+ + G S + G ++ + V Sbjct: 179 EGYVPQ--RELVSKPSSSKNSKNGVFDSSSSKLGDIKGDYFVNNEIDFTSAVIMNNEYTT 236 Query: 1708 --------------------VGNETAFVSTVIIGDQLSASEASMGPQKNDSKPKANR-KS 1592 V NE F S +I+ D+ + S+ G ++ S K + + Sbjct: 237 SKNPGSLRQSQRTKPSSMKDVINEMDFTSEIIMNDEYTVSKTPPGSRQGSSGSKLEKTEG 296 Query: 1591 KG--KDIVEKAGKQSETQSRSALSK--------------GPQGEDSVAAAVVKQ------ 1478 KG KD EK + ++S SAL+K G D++ A K+ Sbjct: 297 KGVCKDFEEKCMR---SESSSALTKEDSGIVQMPSTKCVDQSGLDTINAEAEKETHSDKA 353 Query: 1477 ---NGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXX 1307 +G LKS+LK +G K L+RSVTWAD+K ++ G+L ++ + + +N Sbjct: 354 MASSGVVLKSSLKPAGAKKLNRSVTWADKKNVDSARKGSLCEVKEMDAQKGDSENIGRAE 413 Query: 1306 XXXXXS--LRFALAEACAIALSQAAEA--VASGECDAEDAATEAGIVILP-PVEVDEGNS 1142 LRFA AEACA+ALS+AA A VASG+ D DA +EAG++ILP P+E D+ Sbjct: 414 DGDADDKMLRFASAEACAMALSKAAAAAAVASGDSDVNDAVSEAGLIILPHPLEADKEEK 473 Query: 1141 SELMDV----SEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLT----------- 1007 E +D EP+ VKWP KP + +D FD EDSW D PPEGFSLT Sbjct: 474 VENIDTLEADPEPEEGPVKWPTKPGIPRSDFFDPEDSWFDAPPEGFSLTVSLIDGQECHK 533 Query: 1006 LSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTF 827 LS FATMW ALF W+++SSLAYIYGRDE+ + +L VNGREYP+K+ L DGRSSEIK+T Sbjct: 534 LSTFATMWNALFEWITSSSLAYIYGRDETFHEEYLSVNGREYPQKIVLRDGRSSEIKETL 593 Query: 826 AGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDAL 647 AGCISR+LP +V LRLP PIS LEQ +GRLLDTMS+V+ LP FR QW+V+VLL IDAL Sbjct: 594 AGCISRALPAIVTALRLPIPISTLEQGMGRLLDTMSFVEALPAFRMKQWQVLVLLLIDAL 653 Query: 646 SVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485 SVCRIPALT +M++ RMLLHKVLD AQ+ EEYEVMKDL+IPLGR P FSAQSG Sbjct: 654 SVCRIPALTPHMTNGRMLLHKVLDGAQISLEEYEVMKDLIIPLGRAPHFSAQSG 707 >ref|XP_012859052.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Erythranthe guttatus] gi|604299511|gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Erythranthe guttata] Length = 597 Score = 546 bits (1406), Expect = e-152 Identities = 305/623 (48%), Positives = 417/623 (66%), Gaps = 7/623 (1%) Frame = -3 Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150 + VKDAVHKLQL+LL+GI E QL +AGSL+S+SDY+DVVTER+I+++CGYPLC NSLPS Sbjct: 7 LGVKDAVHKLQLSLLEGIKHESQLIAAGSLISQSDYQDVVTERTIAHVCGYPLCVNSLPS 66 Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970 + RKG ++IS +E + DL ET YCS+ C S+ F AS +EER + DP KI VL Sbjct: 67 EP-PRKGHYRISLKEHKVYDLHETHMYCSTECLIRSRAFGASLEEERSSSLDPAKINSVL 125 Query: 1969 GLFGELSLEDKG--KKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPR-DRS 1799 +F LSL+ K+G +G+ LKI +K +GE+SLE+W+GP NAI+GYVPR D++ Sbjct: 126 KMFDGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVPRRDQN 185 Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEASMGPQKND 1619 P+R+K +KP A T+ + F ST+I+ D+ S S+ ++ Sbjct: 186 SERKQPSRKKTESNHAKPNLADTLPFD-------VNFTSTIIMQDEYSVSKTAVP----- 233 Query: 1618 SKPKANRKSKGKDIVEKAGKQSETQSRSAL--SKGPQGEDSVAAAVVKQNGSQLKSALKS 1445 R++KGK + K + + S L + GP D+ LKS+LK+ Sbjct: 234 ------REAKGKVKGKMIRKSVKAEKISVLDDTAGPSQNDTTL----------LKSSLKT 277 Query: 1444 SGVKPLSRSVTWADEKKAENIDAGNLFNGQKT-EEKSESIKNXXXXXXXXXXSLRFALAE 1268 K +RSVTWADEK + D ++ ++ + K + S RF AE Sbjct: 278 LDSKKETRSVTWADEKS--DGDGKSISECREIGDNKGAVVMPHLTDEDVGDESYRFTSAE 335 Query: 1267 ACAIALSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDRQSVKWP 1091 ACA ALSQA+EAVASG+ DA DA +EAG++ILPP EVDE ++ +V + D +KWP Sbjct: 336 ACARALSQASEAVASGKTDASDAVSEAGVIILPPPHEVDEAKYEQIGEVVDVDPIELKWP 395 Query: 1090 RKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQD 911 KP DLFD EDSW+D+PPEGF+LTLSPF+TM+M+LF W+S+SSLAYIYG++E + Sbjct: 396 PKPGFSSEDLFDSEDSWYDSPPEGFNLTLSPFSTMFMSLFAWISSSSLAYIYGKEERFHE 455 Query: 910 YFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLL 731 +L +NGREYP K+ + DGRS+E+K T AGC++R+LPG+V+++R+PTP+S +EQ +GRLL Sbjct: 456 DYLSINGREYPPKIII-DGRSAEVKHTLAGCLARALPGLVSEIRIPTPVSTIEQGMGRLL 514 Query: 730 DTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEE 551 DTMS+ D LP FR QW+VI LLF+DALSV RIPAL+ YM+ +R+LL KVL+ AQ+ EE Sbjct: 515 DTMSFTDALPGFRMKQWQVIALLFLDALSVSRIPALSPYMTGRRILLPKVLEGAQINVEE 574 Query: 550 YEVMKDLLIPLGRQPEFSAQSGG 482 +E+MKDL+IPLGR P+FS QSGG Sbjct: 575 FEIMKDLIIPLGRVPQFSTQSGG 597 >ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] gi|550321730|gb|EEF05523.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] Length = 696 Score = 545 bits (1405), Expect = e-152 Identities = 330/706 (46%), Positives = 426/706 (60%), Gaps = 82/706 (11%) Frame = -3 Query: 2356 MAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGY 2177 MAKD VKD ++KLQL+LLDGI +E QL +AGS++S SDYEDVVTER+I+NLCGY Sbjct: 1 MAKDQST---VVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGY 57 Query: 2176 PLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVS 1997 PLC NSLPSDR +KGR++IS +E + DL ET YCSS+C S+TF+ S +EER V Sbjct: 58 PLCGNSLPSDR-PQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVL 116 Query: 1996 DPGKIWEVLGLFGELSLEDKGK--KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIE 1823 +P K+ EVL LF SL +G KNG +G LKI +K + GEVS E WIGP NAIE Sbjct: 117 NPAKLNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIE 176 Query: 1822 GYVP-RDRSD----------LSSLPTREKHR----------------------------E 1760 GYVP RDR + SS+ T++++ Sbjct: 177 GYVPQRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGS 236 Query: 1759 KGSKPKEAGTVEREEVVVGNETAFVSTVIIG-DQLSASEASMGPQKNDSKPKANRKSKG- 1586 KGSK K ++E + N+ F ST+II D+ S S++ G SK K ++ + Sbjct: 237 KGSKAKGTKQSSKQESFI-NDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKV 295 Query: 1585 --------KDIVEKAG------KQSETQSRSALSKGPQGED------------------- 1505 K G K E +S+ A+ +D Sbjct: 296 SQKSSENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEA 355 Query: 1504 ---SVAAAVVKQNGSQLKSALKSSGVKPLSRSVTWADEK--KAENIDAGNLFNGQKTEEK 1340 SV+ K S LK +LK+SG K L+RSVTWADEK + + D + + T+ Sbjct: 356 KEKSVSEKAAKPVESSLKPSLKTSGAKQLTRSVTWADEKVGSSGSRDLCEVRGMEDTKAG 415 Query: 1339 SESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILP-PV 1163 E + N +F AEACA ALSQAAEAVASG+ DA +A +EAG+VILP P Sbjct: 416 PEIVDNIDKRDDGYVS--KFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPH 473 Query: 1162 EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMW 983 ++D+G+ E +DV + + ++KWP KP + ++ FD E+SW+D PPEGFSL LS FAT+W Sbjct: 474 DLDQGDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIW 533 Query: 982 MALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSL 803 MALF WV++SSLAY+YG+DESS + +L VNGREYPRK+ L DGRS EI+QT GC+ R+ Sbjct: 534 MALFAWVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAF 593 Query: 802 PGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPAL 623 P VVADLRLP PIS LEQ LL TMS+VD +P FR QW+VI LLFI+ALSVCRIPAL Sbjct: 594 PVVVADLRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPAL 653 Query: 622 TQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485 YM ++RM V+D ++ EEYEVMKDL+IPLGR P+FS QSG Sbjct: 654 ISYMDNRRM----VVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695 >ref|XP_009771014.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Nicotiana sylvestris] gi|698557405|ref|XP_009771015.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Nicotiana sylvestris] Length = 664 Score = 543 bits (1400), Expect = e-151 Identities = 324/670 (48%), Positives = 427/670 (63%), Gaps = 54/670 (8%) Frame = -3 Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150 IAVKDA+HKLQL LL+GI E QLF+AGSLLSR DY+DVVTERSI+N+CGYPLC NSLPS Sbjct: 7 IAVKDAIHKLQLYLLEGIKDENQLFAAGSLLSRRDYQDVVTERSIANMCGYPLCSNSLPS 66 Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970 +R R G ++IS +E + DL ET YCS+ C S F S ++ER + + K+ EVL Sbjct: 67 ER-PRNGHYRISLKEHKVYDLHETYMYCSTNCAVNSGAFARSLQDERSSTLNTAKLNEVL 125 Query: 1969 GLFGELSLE--DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRSD 1796 LF L L + K++G +G+ +LKI +K D K GEVS+E+W+GP +AIEGYVP+ + Sbjct: 126 KLFVGLHLHSTEDVKESGDLGLSKLKIQEKVDVKGGEVSMEEWMGPSDAIEGYVPQRERN 185 Query: 1795 LSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEASMGPQKNDS 1616 L P + +K SK K+A ++ E+ ++ +E F ST+I D S S+ S Sbjct: 186 LK--PALLNNIKKSSKNKQA-KLQNEKNMILHEMDFSSTIITQDGYSISKLPAPVNAVSS 242 Query: 1615 KPKANRKSKG----KDI-VEKAGKQ--------------SETQSRS---------ALSKG 1520 K +++ +D+ V GKQ +++ +RS +S G Sbjct: 243 KKVKEAQTRTSYEVRDVDVSILGKQVDALQLHSGEETEKTDSNNRSYKVDKFNTGEVSSG 302 Query: 1519 PQGEDSVAAAVVKQNGSQ---------------LKSALKSSGVKPLSRSVTWADEKKAEN 1385 P D ++ N S L+S+LKSS K ++RSVTWADE N Sbjct: 303 PCQHDVKNKSLEVLNMSDAGREYASDDAREKQSLRSSLKSSKYKKMARSVTWADE----N 358 Query: 1384 IDAGNLFNGQKTEEKSE--------SIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAV 1229 +D G G+ TE SE + ++ S RF AEACA AL QAAEAV Sbjct: 359 VDNGT---GKLTESSSEISEKGDQANRESGPTNMEEDDDSYRFESAEACAAALKQAAEAV 415 Query: 1228 ASGECDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDC 1052 ASG D DA + AGI+ILPP EVDE E +V + + +KWPRKP + + D+F+ Sbjct: 416 ASGS-DVPDAVSTAGIIILPPPKEVDEAVLKENDEVLDIEPAPLKWPRKPGVPNYDVFES 474 Query: 1051 EDSWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRK 872 EDSW+D+PPEGF+L LSPF+TM+ +LF W+S+SSL++IYG DES + +L VNG EYPRK Sbjct: 475 EDSWYDSPPEGFNLNLSPFSTMFNSLFTWISSSSLSFIYGNDESFNEEYLSVNGSEYPRK 534 Query: 871 VALSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFR 692 + LSDGRS+EIKQT A C++R+LPG+VADLRLP PIS LEQ L L+DTMS+VDPLP FR Sbjct: 535 IVLSDGRSTEIKQTLARCLARALPGLVADLRLPVPISVLEQGLVLLIDTMSFVDPLPAFR 594 Query: 691 TNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGR 512 QW++IVLLF+DALS+CRIP LT YM+ +R LL KVLD AQ+ EYE++KDL+IPLGR Sbjct: 595 MKQWQLIVLLFLDALSICRIPVLTPYMTGRRTLLPKVLDGAQISAAEYEILKDLIIPLGR 654 Query: 511 QPEFSAQSGG 482 P+FS QSGG Sbjct: 655 VPQFSMQSGG 664