BLASTX nr result
ID: Rehmannia26_contig00009616
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia26_contig00009616 (2326 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] 714 0.0 ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258... 711 0.0 ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni... 686 0.0 ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni... 674 0.0 ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm... 654 0.0 ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu... 651 0.0 gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus... 642 0.0 ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni... 630 e-178 ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni... 625 e-176 ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni... 623 e-175 ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni... 622 e-175 gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma c... 619 e-174 gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] 611 e-172 gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlise... 595 e-167 ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni... 590 e-166 gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus pe... 585 e-164 ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni... 574 e-161 gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobro... 569 e-159 gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao] 528 e-147 gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao] 528 e-147 >emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] Length = 659 Score = 714 bits (1843), Expect = 0.0 Identities = 390/669 (58%), Positives = 485/669 (72%), Gaps = 25/669 (3%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M D+ + VKDAVHKLQL LL+GI++ENQL AAGSL+SRSDY+DVVTERTIAN+CGYPLC Sbjct: 1 MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 NSL +ER KG YRISLKEHKVYDL ETYMYCSS C++NSR+FA SLQEER S LN + Sbjct: 61 SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 +N +L+LF SL+S+ +GK+GDLGLS LKI+E + AG+V++E+WIGPSNAI+GYVP Sbjct: 121 INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180 Query: 589 RRDRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 756 +RDR+LK N+K + SK + + + +M+F STIIT+DEYSISK Sbjct: 181 QRDRNLKPKNIKNHKEGSKSSNSK----MDSGKNFVIDEMDFVSTIITKDEYSISKSSKG 236 Query: 757 ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETR---SKNKSKNVITKDDK 918 T K+KEPK KAS + Q + ++K P+ N E++ SK + VI KD+ Sbjct: 237 LKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDE- 292 Query: 919 LSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXXXXXXXXXA-----TRSVTWADE 1071 S E + PSQ+ S K +E A + RSVTWADE Sbjct: 293 FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADE 352 Query: 1072 KTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEEP--YRFASAEACAMALTQAAEEVA 1242 K D D ++ + REL+ KK + D +VG++ RFASAEACA+AL+QAAE VA Sbjct: 353 KMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVA 410 Query: 1243 SGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMETDPLQLKWPPKPGXXXXXXXXXX 1413 SG+++ +DAVSEAG+IILP P DE E+ D++E +P+ LKWP KPG Sbjct: 411 SGETDMTDAVSEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSD 470 Query: 1414 XXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 1593 WYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIYG++ESFHEEYLSVNGREYP+KI Sbjct: 471 DSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKI 530 Query: 1594 VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 1773 V+ DGRSSEIKQTLAGCL+RALPGLVA+LRLPIPVS LEQG+GRLLDTMSF+D LP+FRM Sbjct: 531 VLTDGRSSEIKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRM 590 Query: 1774 KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 1953 KQW IVLLF+DALSV RIPALTP+M RR+L PKV + AQ+SAEE+E+MKDLIIPLGRV Sbjct: 591 KQWQVIVLLFIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRV 650 Query: 1954 PQFSTQSGG 1980 PQFS QSGG Sbjct: 651 PQFSAQSGG 659 >ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera] gi|296089830|emb|CBI39649.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 711 bits (1834), Expect = 0.0 Identities = 388/669 (57%), Positives = 484/669 (72%), Gaps = 25/669 (3%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M D+ + VKDAVHKLQL LL+GI++ENQL AAGSL+SRSDY+DVVTERTIAN+CGYPLC Sbjct: 1 MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 NSL +ER KG YRISLKEHKVYDL ETYMYCSS C++NSR+FA SLQEER S LN + Sbjct: 61 SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 +N +L+LF SL+S+ +GK+GDLGLS LKI+E + AG+V++E+WIGPSNAI+GYVP Sbjct: 121 INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180 Query: 589 RRDRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 756 +RDR+LK N K + SK + + + +M+F TIIT+DEYSISK Sbjct: 181 QRDRNLKPKNIKNRKEGSKSSNSK----MDSGKNFVIDEMDFVRTIITEDEYSISKSSKG 236 Query: 757 ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETR---SKNKSKNVITKDDK 918 T K+KEPK KAS + Q + ++K P+ N E++ SK + VI KD+ Sbjct: 237 LKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDE- 292 Query: 919 LSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXXXXXXXXXA-----TRSVTWADE 1071 S E + PSQ+ S K +E A + TRSVTWADE Sbjct: 293 FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE 352 Query: 1072 KTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEEP--YRFASAEACAMALTQAAEEVA 1242 K D D ++ + REL+ KK + D +VG++ RFASAEACA+AL+QAAE VA Sbjct: 353 KMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVA 410 Query: 1243 SGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMETDPLQLKWPPKPGXXXXXXXXXX 1413 SG+++ +DAVSEA +IILP P DE E+ D++E +P+ LKWP KPG Sbjct: 411 SGETDMTDAVSEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSD 470 Query: 1414 XXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 1593 WYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIYG++ESFHEEYLSVNGREYP+KI Sbjct: 471 DSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKI 530 Query: 1594 VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 1773 V+ DGRSSEIKQTLAGCLARALPGLVA+LRLPIPVS LEQG+GRLLDTMSF+D LP+FRM Sbjct: 531 VLTDGRSSEIKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRM 590 Query: 1774 KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 1953 KQW IVLLF+DALSV +IPALTP+M+ +R+L PKV + AQ+SAEE+E+MKDLIIPLGRV Sbjct: 591 KQWQVIVLLFIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRV 650 Query: 1954 PQFSTQSGG 1980 PQFS QSGG Sbjct: 651 PQFSAQSGG 659 >ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Solanum lycopersicum] Length = 660 Score = 686 bits (1769), Expect = 0.0 Identities = 386/674 (57%), Positives = 469/674 (69%), Gaps = 30/674 (4%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M K E + VKDAVHKLQL LL+GIK E+QL AAGSL+SRSDYQDVVTER+IAN+CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 NSL +ER KG YRISLKEHKVYDL ETYMYCS++C++NS AFA SLQ+ERSSTLNPAK Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 LN+VL LF GL L S ++ +NGD G S LKIQEK D G+V+LEEW+GPSNAI+GYVP Sbjct: 121 LNQVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVP 180 Query: 589 RRDRDLKHPQSNN-NKGERREVGSKHRHVR-PNAADILSYDMNFTSTIITQDEYSISKTV 762 +RDR + N NKG SK++H R + +++ + +F+STIITQDEYS+SK Sbjct: 181 QRDRSVNPALLKNINKG------SKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSK-F 233 Query: 763 PA-------VKAKEPKGKASSKE-------VNRQSNPVQKPTAPLTNIQETRSKNKSKNV 900 PA VK KE + K K + +Q + +Q L + +ET +K+ Sbjct: 234 PAPVNADSNVKFKETQAKTRYKVRDDDVYILGKQVDALQ-----LRSGEETEKSDKNTRF 288 Query: 901 ITKDDKLSLLENIAGPSQND---------STKAVKELQESTAGAXXXXXXXXXXXXATRS 1053 + K DK + E +GPSQ+D S K +RS Sbjct: 289 L-KVDKFNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRS 347 Query: 1054 VTWADEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEVGEEPYRFASAEACAMALTQA 1227 VTWADE DG G+ ++ + + A S S D E ++ YRF SAEACA AL+QA Sbjct: 348 VTWADESIDGGIGKKTESSSKISEYESQAYGGSASTDMEENDDSYRFESAEACAAALSQA 407 Query: 1228 AEEVASGKSEASDAVSEAGVIILPPPHGEDE---EENGDVMETDPLQLKWPPKPGXXXXX 1398 AE VASG S+ DAVS+AG++ILPP DE +E ++++ + LKWP KPG Sbjct: 408 AEAVASG-SDVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYD 466 Query: 1399 XXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGRE 1578 WYDSPPEGFN+TLSPF TMF +LF+W+SSSSLA+IYG +ES +EEYLS+NGRE Sbjct: 467 VFESEDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGRE 526 Query: 1579 YPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPL 1758 YP+KIV+ DGRS+EIKQTLAGCLARALPGLVA+LRLP+P+STLEQGM LL+TMSF+DPL Sbjct: 527 YPRKIVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPL 586 Query: 1759 PAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLII 1938 PAFRMKQW IVLLFLDALSV RIP LTPYM RR PKV++GAQISA E+EIMKDLII Sbjct: 587 PAFRMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLII 646 Query: 1939 PLGRVPQFSTQSGG 1980 PLGRVPQFS QSGG Sbjct: 647 PLGRVPQFSMQSGG 660 >ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Solanum tuberosum] Length = 662 Score = 674 bits (1740), Expect = 0.0 Identities = 381/669 (56%), Positives = 466/669 (69%), Gaps = 25/669 (3%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M K E + VKDAVHKLQL LL+GIK ENQL AAGSL+SRSDYQDVVTER+IAN+CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 NSL +ER KG YRISLKEHKVYDL ETYMYCS++C++NS AFA SLQ+ERSSTLNPAK Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAG-QVALEEWIGPSNAIDGYV 585 LN+VL LF GL L S ++ +NGDLG S LKIQEK D G +V+LEEW+GPSNAI+GYV Sbjct: 121 LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180 Query: 586 PRRDRDLKHPQSNN-NKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK-- 756 P+RDR + N NKG + +KH ++ IL+ + +F+STIITQDEYS+SK Sbjct: 181 PQRDRSVNPALLKNINKGFK----NKHARLQDEKNMILN-EFDFSSTIITQDEYSVSKFP 235 Query: 757 ----TVPAVKAKEPKGKASSKEVNRQSNPVQK--PTAPLTNIQETRSKNKSKNVITKDDK 918 V + K KE + K K + + + K L + +ET +K+ + K DK Sbjct: 236 APVNAVSSEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFL-KVDK 294 Query: 919 LSLLENIAGPSQND-STKAVKELQ----------ESTAGAXXXXXXXXXXXXATRSVTWA 1065 + E +GPSQ+D K+V + E ++SVTWA Sbjct: 295 FNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWA 354 Query: 1066 DEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEV 1239 DE DG G+ ++ + + A S S D E ++ YRF SAEACA AL+QAAE V Sbjct: 355 DEIIDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEAV 414 Query: 1240 ASGKSEASDAVSEAGVIILPPPHGEDEE--ENGDVMETDPLQLKWPPKPGXXXXXXXXXX 1413 ASG S+ DAVS+AG++ILP DE + ++++ +P LKWP KPG Sbjct: 415 ASG-SDVPDAVSKAGIVILPTSQEVDEAILQETEMLDIEPAPLKWPRKPGMPNYDVFESE 473 Query: 1414 XXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 1593 WYD PPEGFN+TLSPF+TMF +LF+W+SSSSLA+IYG +E+ +EEYLS+NGREYP KI Sbjct: 474 DCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPHKI 533 Query: 1594 VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 1773 V+ DG S+EIKQTLAGCLARALPGLVA+LRLP+P+STLEQGM LL+TMSF+DPLPAFRM Sbjct: 534 VLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRM 593 Query: 1774 KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 1953 KQW IVLLFLDALSV RIP LTPYM RR LPKV++GAQIS E+EIMKDLIIPLGRV Sbjct: 594 KQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLGRV 653 Query: 1954 PQFSTQSGG 1980 PQFS QSGG Sbjct: 654 PQFSMQSGG 662 >ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis] gi|223538861|gb|EEF40460.1| conserved hypothetical protein [Ricinus communis] Length = 645 Score = 654 bits (1686), Expect = 0.0 Identities = 357/655 (54%), Positives = 464/655 (70%), Gaps = 18/655 (2%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M K+E ++VKD V+KLQLSLL+GI++E+QL AAGSL+SRSDY+DVV ER+I+N+CGYPLC Sbjct: 1 MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 NSL ++RP+KGRYRISLKEH+VYDLQETYMYCSSSCL+NSRAF+ SLQE+R S LNP K Sbjct: 61 NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 LNE+L+ F+ L+LDS+ +G++GDLGLS LKIQEK++T G+V+LEEWIGPSNAI+GYVP Sbjct: 121 LNEILRKFNDLTLDSE-GLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179 Query: 589 RRDRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 756 + DRD +P N+K + + K + D D +FTSTIIT DEYSISK Sbjct: 180 QGDRD-PNPSLKNHKEGLKAICKKP----VSKQDCFFSDTDFTSTIITNDEYSISKGPSG 234 Query: 757 ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVITKDDKLSL 927 T +K + GK + +N Q + ++K + + +SK + K + K+ Sbjct: 235 LTSTASDIKLQAQTGKGH-EGLNAQLSSLRKQDSIKAS---RKSKGRRKEKVIKEQ---- 286 Query: 928 LENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXXAT------RSVTWADEKTDGDG 1089 L PS + T +++ ++T A ++ RSVTWADE+ D G Sbjct: 287 LNFQDLPSSSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADERVDNAG 346 Query: 1090 -QNLNECRELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASD 1266 +NL E +E++ + S SA++ RF SAEACA+AL+QAAE VASG ++ + Sbjct: 347 SRNLCEVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNK 406 Query: 1267 AVSEAGVIILPPPH----GEDEEENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSP 1434 A+SEAG+I+LPP G + E+N D++E + LKWP KPG WYD+P Sbjct: 407 AMSEAGIIVLPPSQDLGQGGNVEKN-DMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAP 465 Query: 1435 PEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRS 1614 PEGF+LTLSPF+TM+MALF+WV+SSSLAYIYG++ES HE+YLSVNGREYP+KIV+ DGRS Sbjct: 466 PEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRS 525 Query: 1615 SEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIV 1794 SEI+ T CLAR PGLVA LRLPIPVSTLEQG GRLL+TMSF+D LPAFR KQW I Sbjct: 526 SEIRLTAESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIA 585 Query: 1795 LLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQ 1959 LLF++ALSV RIPALT YM RR++L +V++GA ISAEE++IMKD ++PLGR PQ Sbjct: 586 LLFIEALSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ 640 >ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] gi|550321730|gb|EEF05523.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] Length = 696 Score = 651 bits (1680), Expect = 0.0 Identities = 371/709 (52%), Positives = 469/709 (66%), Gaps = 66/709 (9%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M KD+ VKD ++KLQLSLLDGI++E+QL AAGS++S SDY+DVVTERTIAN+CGYPLC Sbjct: 1 MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 GNSL ++RP KGRYRISLKEHKVYDL ETYMYCSSSC+INSR F+ SLQEER LNPAK Sbjct: 61 GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 LNEVL LFD SL S+ ++GKNGDLG S LKI+EKT+ V G+V+ E+WIGPSNAI+GYVP Sbjct: 121 LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180 Query: 589 RRDR--------DL---------------------------KHPQSNNNKGERR-EVGSK 660 +RDR D+ K Q KG + GSK Sbjct: 181 QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240 Query: 661 HRHVRPNAA-DILSYDMNFTSTII-TQDEYSISK-------TVPAVKAKEPKGKASSKEV 813 + + ++ + DMNFTSTII TQDEYSISK T K ++ K K S K Sbjct: 241 AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300 Query: 814 NRQSNPVQKPTAPLTN--IQETRSKNKSKNVITKDDKLSLLENIAGPS---------QND 960 QS+ +K + T+ ++E RSK K+ ++ D S ++ S ++ Sbjct: 301 ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360 Query: 961 STKAVKELQES------TAGAXXXXXXXXXXXXATRSVTWADEKTDGDG-QNLNECRELK 1119 S KA K ++ S T+GA TRSVTWADEK G ++L E R ++ Sbjct: 361 SEKAAKPVESSLKPSLKTSGAKQL----------TRSVTWADEKVGSSGSRDLCEVRGME 410 Query: 1120 DKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILP 1299 D K + D+ +F SAEACA AL+QAAE VASG ++AS+A+SEAG++ILP Sbjct: 411 DTKAGPEIVDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILP 470 Query: 1300 PPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFS 1470 PH D+ E+ DV++ + +KWP KPG WYD+PPEGF+L LS F+ Sbjct: 471 QPHDLDQGDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFA 530 Query: 1471 TMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLA 1650 T++MALF+WV+SSSLAY+YGK+ES HEEYL VNGREYP+KIV+ DGRS EI+QT+ GCL Sbjct: 531 TIWMALFAWVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLG 590 Query: 1651 RALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRI 1830 RA P +VA+LRLPIP+STLEQG LL TMSF+D +PAFRMKQW I LLF++ALSV RI Sbjct: 591 RAFPVVVADLRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRI 650 Query: 1831 PALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977 PAL YM +RR+ V++G ++SAEE+E+MKDL+IPLGR PQFS QSG Sbjct: 651 PALISYMDNRRM----VVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695 >gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris] Length = 706 Score = 642 bits (1656), Expect = 0.0 Identities = 368/713 (51%), Positives = 464/713 (65%), Gaps = 70/713 (9%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M KD+ ++VKDAV KLQ+ LL+GI++E+QL AAGSL+SRSDY+D+VTER+I NVCGYPLC Sbjct: 1 MAKDKAVSVKDAVFKLQMLLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 N+L +ERP KG+YRISLKEHKVYDLQETYM+CSS+C+++S+AF+ LQ ER S L+P K Sbjct: 61 CNALPSERPRKGKYRISLKEHKVYDLQETYMFCSSNCVVSSKAFSGILQAERCSALDPEK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 LN VL LF+ L+L+ N+ K+GDLGLS LKIQEKT T +G+V LE+W+GPSNAI+GYVP Sbjct: 121 LNNVLGLFENLNLEQTENVPKDGDLGLSNLKIQEKTVTTSGEVPLEQWVGPSNAIEGYVP 180 Query: 589 RRDRDLKHPQSNNNKGERREV--GSKHRHVRPNA-ADILSYDMNFTSTIITQDEYSISKT 759 + P+ +KG R+ V GSK H + N D+++ +MNF STII QDEYS+SK Sbjct: 181 K-------PRERESKGLRKNVKKGSKAGHGKSNNDKDLINSEMNFVSTIIMQDEYSVSKA 233 Query: 760 VPA-----------------------------------------------VKAKEPKGKA 798 P + A E KGK Sbjct: 234 SPGQTDTTAHHQIKPTAVDRQQEEKVGLKVVRKDEDSIQDLSSSFESGLHLSASE-KGKE 292 Query: 799 SSK--EVNRQSNP---VQKPTAPLTNIQETR---SKNKS--KNVITKDDKLSLLENIAGP 948 SK EV +S P ++K A +I E KN S K+V K + + N Sbjct: 293 VSKSCEVVVKSTPNLAIKKKDAHSVSISERHYDVEKNNSARKSVQLKGETSRVTVNGDAS 352 Query: 949 SQNDSTKAVKE-LQESTAGAXXXXXXXXXXXXA-----TRSVTWADEKTDGDG-QNLNEC 1107 + N VKE Q G A +R+VTWADEK +G G ++L E Sbjct: 353 TSNFDPDNVKEKFQVEKVGGLCETKLKSSLKSAGEKKLSRTVTWADEKINGAGNKDLCEV 412 Query: 1108 RELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1287 +E D + + D E+ R ASAEACA+AL+QA+E VASG S+A+DAVSEAG+ Sbjct: 413 KEFGDIIKESESVGNEDVANNEDMLRQASAEACAIALSQASEAVASGDSDATDAVSEAGI 472 Query: 1288 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTL 1458 IILP PH EE E+ D+++ D + LKWP KPG W+D+PPEGF+LTL Sbjct: 473 IILPQPHDAVEEGTMEDADILQNDSVTLKWPRKPGISDIDFFESDDSWFDAPPEGFSLTL 532 Query: 1459 SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 1638 SPF+ M+ A+FSW++S SLAYIYG++ESFHEEYLSVNGREYP K+V+ DGRSSEIKQT A Sbjct: 533 SPFANMWNAIFSWMTSYSLAYIYGRDESFHEEYLSVNGREYPCKVVLSDGRSSEIKQTFA 592 Query: 1639 GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 1818 GCLARA P LVA LRLPIP+STLEQGM LL+TMSF+D LPAFR KQW + LLF+DALS Sbjct: 593 GCLARAFPALVAGLRLPIPISTLEQGMACLLETMSFVDALPAFRTKQWQVVALLFVDALS 652 Query: 1819 VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977 V RIP+L YM DRR L KV+ G+QI EE+EI+KDL++PLGR P S QSG Sbjct: 653 VCRIPSLISYMTDRRALFHKVLSGSQIGMEEYEILKDLVVPLGRAPHISVQSG 705 >ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Glycine max] Length = 706 Score = 630 bits (1626), Expect = e-178 Identities = 354/713 (49%), Positives = 463/713 (64%), Gaps = 70/713 (9%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I N+CGYPLC Sbjct: 1 MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 N+L ++RP KGRYRISLKEHKVYDLQETYM+CSS+CL++S+ FA SLQ ER S L+ K Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 LN VL LF+ L+L+ + KNGDLGLS LKIQEKT+ +G+V+LE+W GPSNAI+GYVP Sbjct: 121 LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180 Query: 589 RRDRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 759 + P++ ++KG R+ V GSK H + + ++++ +M F STII QDEYS+SK Sbjct: 181 K-------PRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKV 233 Query: 760 VPA-------------VKAKEPKGKASSKEVNRQSNPVQ------KPTAPLTNIQETRSK 882 P K+P+ K ++ V + + +Q K + L+ ++ Sbjct: 234 PPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEV 292 Query: 883 NKSKNVITKDD-----------KLSLLENIAGPSQNDSTKAVKELQEST----------- 996 KS + K +S+ E QNDS + +++ T Sbjct: 293 TKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDAST 352 Query: 997 ----------------AGAXXXXXXXXXXXXA-----TRSVTWADEKTDGDG-QNLNECR 1110 AG A +R+VTWADEK + G ++L E + Sbjct: 353 SNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFK 412 Query: 1111 ELKD-KKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1287 E D KK + ++ D E+ R ASAEACA+AL+ A+E VASG S+ SDAVSEAG+ Sbjct: 413 EFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVSEAGI 472 Query: 1288 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTL 1458 ILPPPH EE E+ D+++ D + LKWP K G W+D+PPEGF+LTL Sbjct: 473 TILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGFSLTL 532 Query: 1459 SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 1638 SPF+TM+ LFSW +SSSLAYIYG++ESFHEEYLSVNGREYP K+V+ DGRSSEIKQTLA Sbjct: 533 SPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIKQTLA 592 Query: 1639 GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 1818 CLARALP LVA LRLPIPVS +EQGM LL+TMSF+D LPAFR KQW + LLF+DALS Sbjct: 593 SCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALS 652 Query: 1819 VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977 V R+PAL YM DRR +V+ G+QI EE+E++KDL++PLGR P S+QSG Sbjct: 653 VCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQSG 705 >ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Glycine max] Length = 706 Score = 625 bits (1611), Expect = e-176 Identities = 352/717 (49%), Positives = 454/717 (63%), Gaps = 74/717 (10%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I NVCGYPLC Sbjct: 1 MEKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 N+L ++RP KGRYRISLKEHKVYDL ETYM+C S+C+++S+AFA SLQ ER S L+ K Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 LN +L LF+ L+L+ N+ KN D GLS LKIQEKT+T +G+V+LE+W GPSNAI+GYVP Sbjct: 121 LNNILSLFENLNLEPAENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYVP 180 Query: 589 RRDRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 759 + P+ +++KG R+ V GSK H +P + +++S +M F STII QD YS+SK Sbjct: 181 K-------PRDHDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKV 233 Query: 760 VPAVKAK------------EPKGKASSKEVNRQSNPVQKPTAP---------------LT 858 +P + + GK +K V + +Q ++ L Sbjct: 234 LPGQRDATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELA 293 Query: 859 NIQETRSKNKSKNVITKDD--KLSLLENIAGPSQNDSTK--------------------- 969 E K+ I K D +S+ E QNDS K Sbjct: 294 QSCEAALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTS 353 Query: 970 ------AVKELQESTAGAXXXXXXXXXXXXA-----TRSVTWADEKTDGDG-------QN 1095 ++ Q AG A +R+VTWAD+K + G +N Sbjct: 354 NLDPANVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWADKKINSTGSKDLCGFKN 413 Query: 1096 LNECRELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVS 1275 + R D G +S D E+ R ASAEAC +AL+ A+E VASG S+ SDAVS Sbjct: 414 FGDIRNESDSAG-----NSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVS 468 Query: 1276 EAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGF 1446 EAG+IILPPPH EE E+ D+++ D + +KWP KPG W+D+ PEGF Sbjct: 469 EAGIIILPPPHDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGF 528 Query: 1447 NLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIK 1626 +LTLSPF+TM+ LFSW++SSSLAYIYG++ESF EEYLSVNGREYP K+V+ DGRSSEIK Sbjct: 529 SLTLSPFATMWNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIK 588 Query: 1627 QTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFL 1806 QTLA CLARALP LVA LRLPIPVST+EQGM LL+TMSF+D LPAFR KQW + LLF+ Sbjct: 589 QTLASCLARALPTLVAVLRLPIPVSTMEQGMACLLETMSFVDALPAFRTKQWQVVALLFI 648 Query: 1807 DALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977 DALSV R+PAL YM DRR +V+ G+QI EE+E++KDL +PLGR P S QSG Sbjct: 649 DALSVCRLPALISYMTDRRASFHRVLSGSQIGMEEYEVLKDLAVPLGRAPHISAQSG 705 >ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cicer arietinum] Length = 666 Score = 623 bits (1607), Expect = e-175 Identities = 348/674 (51%), Positives = 451/674 (66%), Gaps = 31/674 (4%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M KD+ ++VKDAV KLQL+LL+GI+ E+QL AAGSLISRSDY+DVVTER+I VC YPLC Sbjct: 1 MEKDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 N+L +ERP KGRYRISLKEHKVYDL ETYM+CSSSC++NS+AFA SL+++R L+P K Sbjct: 61 CNALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 LN +L+LF +L+ N GK+G+LGLS L+IQ+KT+TV +V+LE+W+GPSNAI+GYVP Sbjct: 121 LNNILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVP 179 Query: 589 R-RDRDLKHPQSNNNKGERREVGSKHRHVRPNAA-DILSYDMNFTSTIITQDEYSISKTV 762 + RD K Q N KG SK H + N ++++ + +F STII QDEYS+SK Sbjct: 180 KKRDNGSKGSQKNTKKG------SKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVS 233 Query: 763 -------------PAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSK--- 894 P ++PK E+ R+ + +Q ++ + + K K Sbjct: 234 SGQTDATVDHQIKPTAILEQPK--RVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIA 291 Query: 895 ----NVITKDDKLSLLENIAGPSQNDSTKAVKELQ-ESTAGAXXXXXXXXXXXXAT---- 1047 NV+ + + S D + +++Q E G+ Sbjct: 292 KSCKNVLKGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLG 351 Query: 1048 RSVTWADEKTDGDGQ-NLNECRELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQ 1224 RSVTWAD+K DG G +L +E + K + + D E+ R SAEACA+AL+Q Sbjct: 352 RSVTWADKKIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQ 411 Query: 1225 AAEEVASGKSEASDAVSEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXX 1395 AAE VASG S+A DAVSEAG+IILP EE ++ D++ETD + LKWP KPG Sbjct: 412 AAEAVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISDF 471 Query: 1396 XXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGR 1575 W+D+PPEGF+LTLSPF+T++ A FSW++SSSLAYIYG++ SF+EE+LSV+GR Sbjct: 472 DLFASDDSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGR 531 Query: 1576 EYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDP 1755 EYP KIV+ DGRSSEIKQTLA CLARALP +VAEL+LP+PVSTLEQGM LLDTMSF+DP Sbjct: 532 EYPCKIVLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVDP 591 Query: 1756 LPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLI 1935 LP FR KQW + LLF+DALSV RIPAL YM DRR L KV+ G+QI EE+ ++KDLI Sbjct: 592 LPGFRFKQWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDLI 651 Query: 1936 IPLGRVPQFSTQSG 1977 +PLGR P FS+QSG Sbjct: 652 VPLGRAPHFSSQSG 665 >ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Glycine max] Length = 716 Score = 622 bits (1605), Expect = e-175 Identities = 354/723 (48%), Positives = 463/723 (64%), Gaps = 80/723 (11%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I N+CGYPLC Sbjct: 1 MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 N+L ++RP KGRYRISLKEHKVYDLQETYM+CSS+CL++S+ FA SLQ ER S L+ K Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 LN VL LF+ L+L+ + KNGDLGLS LKIQEKT+ +G+V+LE+W GPSNAI+GYVP Sbjct: 121 LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180 Query: 589 RRDRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 759 + P++ ++KG R+ V GSK H + + ++++ +M F STII QDEYS+SK Sbjct: 181 K-------PRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKV 233 Query: 760 VPA-------------VKAKEPKGKASSKEVNRQSNPVQ------KPTAPLTNIQETRSK 882 P K+P+ K ++ V + + +Q K + L+ ++ Sbjct: 234 PPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEV 292 Query: 883 NKSKNVITKDD-----------KLSLLENIAGPSQNDSTKAVKELQEST----------- 996 KS + K +S+ E QNDS + +++ T Sbjct: 293 TKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDAST 352 Query: 997 ----------------AGAXXXXXXXXXXXXA-----TRSVTWADEKTDGDG-QNLNECR 1110 AG A +R+VTWADEK + G ++L E + Sbjct: 353 SNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFK 412 Query: 1111 ELKD-KKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAV----- 1272 E D KK + ++ D E+ R ASAEACA+AL+ A+E VASG S+ SDAV Sbjct: 413 EFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVFSPMN 472 Query: 1273 -----SEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYD 1428 SEAG+ ILPPPH EE E+ D+++ D + LKWP K G W+D Sbjct: 473 ETCAVSEAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFD 532 Query: 1429 SPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDG 1608 +PPEGF+LTLSPF+TM+ LFSW +SSSLAYIYG++ESFHEEYLSVNGREYP K+V+ DG Sbjct: 533 APPEGFSLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADG 592 Query: 1609 RSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHA 1788 RSSEIKQTLA CLARALP LVA LRLPIPVS +EQGM LL+TMSF+D LPAFR KQW Sbjct: 593 RSSEIKQTLASCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQV 652 Query: 1789 IVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFST 1968 + LLF+DALSV R+PAL YM DRR +V+ G+QI EE+E++KDL++PLGR P S+ Sbjct: 653 VALLFIDALSVCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISS 712 Query: 1969 QSG 1977 QSG Sbjct: 713 QSG 715 >gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] Length = 739 Score = 619 bits (1597), Expect = e-174 Identities = 354/700 (50%), Positives = 454/700 (64%), Gaps = 51/700 (7%) Frame = +1 Query: 31 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 210 K++S M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N Sbjct: 49 KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108 Query: 211 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 390 CGYPLC N L +E KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S Sbjct: 109 CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168 Query: 391 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 570 LN AKLN++L LF L LD D ++GKNGDLG S L+I+E + A V+L GPSNA Sbjct: 169 VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224 Query: 571 IDGYVPRRDRDLKHPQSNNNKGE-----RREVGSKHRHVRPNAADILSYDMNFTSTIITQ 735 I+GYVP+R+ K NNK + ++GSK ++ +++F TII Sbjct: 225 IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278 Query: 736 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 873 DEY ISK + K + +S KE +N + + P+ + ++ Sbjct: 279 DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338 Query: 874 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 987 K + I KD DK + + + + DS+ +A KE Sbjct: 339 NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398 Query: 988 ESTAGAXXXXXXXXXXXXA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1146 A A R VTWAD+K D G NL E +E++ KG S Sbjct: 399 ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458 Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 1326 SA++ + RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP D+EE Sbjct: 459 GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518 Query: 1327 ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSW 1497 +GD++E + +KWP KPG W+D+PPEGF+LTLS F+TM+ ALF W Sbjct: 519 PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578 Query: 1498 VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 1677 ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V + Sbjct: 579 ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638 Query: 1678 LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMD 1857 LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW IVLLF+DALSV RIPALTP+M + Sbjct: 639 LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTN 698 Query: 1858 RRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977 R+LL KV++GAQIS EE+E+MKDLIIPLGR P FS QSG Sbjct: 699 GRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738 >gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] Length = 695 Score = 611 bits (1575), Expect = e-172 Identities = 347/703 (49%), Positives = 453/703 (64%), Gaps = 66/703 (9%) Frame = +1 Query: 67 LTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLCGNSLSA 246 ++VKD V++LQLSLL G+ E+QL AAGS++SRSDY DVVTER+IAN+CGYPLC N L + Sbjct: 9 ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68 Query: 247 ERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAKLNEVLK 426 +RP KGRYRISLKEHKVYDL ETYMYCSS C+INSR FAASL++ER + L+ A+++ VL+ Sbjct: 69 DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128 Query: 427 LFDGLS-LDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVPRRDRD 603 +F+ S L+ ++ GK+ DLG S LKI+EKT+ G V+LE+W GPSNAI+GYV +R+R Sbjct: 129 MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188 Query: 604 LKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISKTVPAVK--- 774 P+ +K +R GSK + +L DM+F STIIT+DEY++SKT ++K Sbjct: 189 ---PKELGSKSPKR--GSKANNT------VLINDMDFVSTIITEDEYTVSKTPSSLKKTG 237 Query: 775 ----AKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVITKDDKLSLLENIA 942 +E + + K + + ++ AP +N+ +R ++V + S L + Sbjct: 238 LDSKVREQEEILAKKAMGNEFAVLETSYAPASNV--SRVGLVFEDVTSSLRAGSCLSSAR 295 Query: 943 GPSQNDSTKAVKELQESTAGAXXXXXXXXXXXXATRSVTWADEKTDGDG----------- 1089 ++ KA K T + +R+VTWADEKTD G Sbjct: 296 AEEESHDDKAEK----CTEASIKSSLKPSRKKKLSRTVTWADEKTDSSGGRKLCEIREIE 351 Query: 1090 ---------QNLN--------------------------------ECRELKDKKGAVVTS 1146 +N N E RE++D K A Sbjct: 352 DMKEDPSVVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADML 411 Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDE-- 1320 +AD ++ +RFASAEACA AL +A+E VAS + E +DA+SEAG+IILP P DE Sbjct: 412 CNADTGENDDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGE 471 Query: 1321 --EENGDVMETDPLQ--LKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMAL 1488 EE+ D ++P Q +KWP KPG W+D+PPE F+LTLSPF+ M+ AL Sbjct: 472 PMEEDDDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNAL 531 Query: 1489 FSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGL 1668 F+W +SS+LAYIYG++ES HEEY VNGREYP+KIV DGRSSEIKQTLAG LARALPGL Sbjct: 532 FTWTTSSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGL 591 Query: 1669 VAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPY 1848 VA+LRL P+S+LEQGMGRLLDTMSF+D LP FRMKQW I+LLFL+ALSV R+PALTP+ Sbjct: 592 VADLRLSTPISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPH 651 Query: 1849 MMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977 MM RR+L KV++ AQISAEE+E+MKDL+IPLGR P FS QSG Sbjct: 652 MMYRRVLFHKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSG 694 >gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlisea aurea] Length = 597 Score = 595 bits (1534), Expect = e-167 Identities = 338/639 (52%), Positives = 427/639 (66%), Gaps = 6/639 (0%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M KDE+LT+K+AV++LQ SLL+G K+ENQL+AAGSL+SR DYQD+VTER IA +CGYPLC Sbjct: 1 MAKDEILTMKEAVYRLQTSLLEGAKNENQLSAAGSLMSRGDYQDLVTERVIAKICGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 N+L++ERP KGRYRISLKEHKVYD+QETY +CSS CLINSRAF+ L +ER+S L+P K Sbjct: 61 SNNLNSERPSKGRYRISLKEHKVYDVQETYSFCSSGCLINSRAFSIGLPDERTSDLDPIK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 LNEVLK FDG +S NMG+N DLGLS L+I EK + AG+V+ EWIGPS+AIDGYVP Sbjct: 121 LNEVLKRFDGFGANSTPNMGRNEDLGLSQLRIMEKENIEAGEVSSNEWIGPSDAIDGYVP 180 Query: 589 RRDRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISKTVPA 768 RRDR+ S KGE S++ I DM+FTS II Q+EYSI+KT Sbjct: 181 RRDRNSNTLSSKQKKGE-----SRYHLSLQVLTSIFPSDMSFTSVIIDQNEYSIAKTTTP 235 Query: 769 VKAKEPKGKASSKEVNRQS-NPVQKPTAPLTNIQETRSKNKSK-NVITK-DDKLSLLENI 939 +K+ G+++ K + + P Q P + + NI+ + +N SK N K D KLS E+ Sbjct: 236 SSSKQ-SGESNEKVIPEEDVRPKQSPDSSVANIKGSGFRNPSKRNGRAKIDAKLSASEDK 294 Query: 940 AGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXX---ATRSVTWADEKTDGDGQNLNECR 1110 A S+N + + +S GA TR+V+WAD K + DGQNL Sbjct: 295 A--SENGGEPKLADGDKSAQGAAVLKSSLKTSYSKETTTRTVSWADVKAE-DGQNLETVC 351 Query: 1111 ELKDKKGAVVTSHSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVI 1290 E+ D G ++ ++ S E+ A T+A+++ A GK +D Sbjct: 352 EMNDPHGGGISRETS------------SVESHKTASTKASKD-APGKFLLTDF------- 391 Query: 1291 ILPPPHGEDEEENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFS 1470 G++ T+ + LKWPPKPG YD PP+GFNL+LSPF Sbjct: 392 -----------NEGEIF-TEAI-LKWPPKPGFSEADLVESDDTLYDRPPDGFNLSLSPFC 438 Query: 1471 TMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLA 1650 T+F +LFSW+SSSSLAYIYGK++SFHEEY++ NGREYP K+V DGRSSEIKQTL+ LA Sbjct: 439 TLFNSLFSWISSSSLAYIYGKDDSFHEEYVNANGREYPCKVVAEDGRSSEIKQTLSAALA 498 Query: 1651 RALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRI 1830 RALPG+V+ELRLP P+S LEQGMGRLLDTMSFIDPLP+ R KQW AIVLLFL+ALSVSRI Sbjct: 499 RALPGVVSELRLPTPISILEQGMGRLLDTMSFIDPLPSLRTKQWQAIVLLFLNALSVSRI 558 Query: 1831 PALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLG 1947 PAL+ Y+ DRR + KV+EGA I EEFE+MKDLIIPLG Sbjct: 559 PALSKYLEDRRASIQKVLEGAGIGVEEFEVMKDLIIPLG 597 >ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 662 Score = 590 bits (1522), Expect = e-166 Identities = 341/673 (50%), Positives = 442/673 (65%), Gaps = 31/673 (4%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M K++ + +KD V+KLQL+L +GIK+ENQL AAGSL+SRSDY+DVVTER+IA++CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 ++L ++ +GRYRISLKEHKVYDL+ETY YCSS+CLINSRAF+ LQ+ER S +NP K Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 L E+LKLF+ +SLDS NMG N D SGL+IQEK ++ G+V +EEW+GPSNAI+GYVP Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177 Query: 589 RRDRDLKHPQSNNNKGERREVGSKHRHVRP--NAADILSYDMNFTSTIITQDEYSISKTV 762 RD + S + G+ + GSK + ++P D S D + TSTIIT +EYS+SK Sbjct: 178 HRDHKVMTLHSKD--GKESKDGSKAK-IKPLGGGKDFFS-DFSITSTIITDEEYSVSKIS 233 Query: 763 PAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPL-----TNIQETRSKNKSKNVIT 906 +K +K G+ KE N Q ++ P AP + SK ++K T Sbjct: 234 SGLKEMALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSAT 293 Query: 907 KDDKLSLLENIAGPSQNDSTKAVKELQESTAG-------AXXXXXXXXXXXXATRSVTWA 1065 K+ +L + S+N ST +E G RSVTWA Sbjct: 294 KESTDNL-SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWA 352 Query: 1066 DEKTDGDG-QNLNECREL-KDKKGAVVTSHSAD-EEVGEEPYRFASAEACAMALTQAAEE 1236 DEKTD NL E E+ K K+ + TS+ + + E+ R SAEACAMAL+QAAE Sbjct: 353 DEKTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEA 412 Query: 1237 VASGKSEASDAVSEAGVIILPPPHGEDEEENGDVMETDPLQLKWPP-------KPGXXXX 1395 + SG+SE SDAVSEAG+IILP P +EE + TDP+ P K G Sbjct: 413 ITSGQSEVSDAVSEAGIIILPHPSDANEEAS-----TDPVNASEPHSFSEKSNKLGVLRS 467 Query: 1396 XXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGR 1575 WYD+PPEGF+LTLS F+TM+MA+F+WV+SSSLAYIYGK++ FHEE+L ++G+ Sbjct: 468 DLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGK 527 Query: 1576 EYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDP 1755 EYP KIV DGRSSEIKQTLAGCL RA+PGL +EL L P+S LE GM LLDTM+F+D Sbjct: 528 EYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDA 587 Query: 1756 LPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLI 1935 LPAFRMKQW IVLLF++ALSVSRIP+L +M R L KV++ AQI ++E+EIM+D I Sbjct: 588 LPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHI 647 Query: 1936 IPLGRVPQFSTQS 1974 +PLGR Q S ++ Sbjct: 648 LPLGRTAQLSDEN 660 >gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] Length = 711 Score = 585 bits (1508), Expect = e-164 Identities = 349/713 (48%), Positives = 448/713 (62%), Gaps = 76/713 (10%) Frame = +1 Query: 67 LTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLCGNSLSA 246 ++VKD V+KLQL+LL+GIK ++ L AGS+ISRSDY DVVTERTIAN+CGYPLC N+L + Sbjct: 13 ISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSNALPS 72 Query: 247 E--RPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAKLNEV 420 + RP KG YRISLKEHKVYDL ETYMYCSS C+I S+AFA SL EER L+ K+ + Sbjct: 73 DSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGKVERI 132 Query: 421 LKLFDGLSLD-SDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEW--------------- 552 L+ F + D +V G+ GDLG+S LKI+EK +T G + + Sbjct: 133 LRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIGDLGA 192 Query: 553 IGPSNAIDGYVPRRDRDLKHPQSNNNKGERREVGSKHRHVRPNAA-DILSYDMNFTSTII 729 +GPSNAI+GYVP+++R K S NK GSK + + ++ DI+ +M+F STII Sbjct: 193 VGPSNAIEGYVPQKERISKPLGSKKNK-----EGSKGKDAKMSSGMDIIFNEMDFMSTII 247 Query: 730 TQDEYSISKTVPAVKAK--EPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVI 903 T DEYS+SK P+V E K K S +V N +++++R KN Sbjct: 248 TSDEYSVSKIPPSVGEPDFETKFKKSKGKVGLNKN---------DSVKKSRQSKGGKNKN 298 Query: 904 TKDDKLSLLE--NIAGPSQ---NDSTKAVKE------LQESTAGAXXXXXXXXXXXXATR 1050 K D + + E + + SQ N STK KE ++S R Sbjct: 299 VKKDDVCIREVPSTSDASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNR 358 Query: 1051 SVTWADEKTDGDG-QNLNECRELK---DKKGAVVTSH--SADEEVG-------------- 1170 SVTWADE D G +NL E RE++ + A + H S + +VG Sbjct: 359 SVTWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTK 418 Query: 1171 ---------------------EEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1287 +E SAEACAMAL QAAE VASG+S+ S AVS AG+ Sbjct: 419 SKNICEVREVQDADVLGSLDLQENEILESAEACAMALNQAAEAVASGESDVSGAVSGAGI 478 Query: 1288 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTL 1458 IILP P G DEE E+ D++E++ L WP KPG W+D+PPEGF++TL Sbjct: 479 IILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFSVTL 537 Query: 1459 SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 1638 SPF+TM+ +LF+W++SS+LAYIYG++ESFHEE+LSVNGREYP KIV+ GRSSEIK+TL Sbjct: 538 SPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKKTLD 597 Query: 1639 GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 1818 ARALPG+V+ELRLP P+S+LEQGMGR+L+TMSFID +PAFRMKQW IVLLFL+ LS Sbjct: 598 ESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLEGLS 657 Query: 1819 VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 1977 V RIPALTP+M +RR+L KV+E QISAE++E+MKDLIIPLGR PQFS QSG Sbjct: 658 VCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFSAQSG 710 >ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 632 Score = 574 bits (1479), Expect = e-161 Identities = 333/666 (50%), Positives = 434/666 (65%), Gaps = 24/666 (3%) Frame = +1 Query: 49 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 228 M K++ + +KD V+KLQL+L +GIK+ENQL AAGSL+SRSDY+DVVTER+IA++CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 229 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 408 ++L ++ +GRYRISLKEHKVYDL+ETY YCSS+CLINSRAF+ LQ+ER S +NP K Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 409 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 588 L E+LKLF+ +SLDS NMG N D SGL+IQEK ++ G+V +EEW+GPSNAI+GYVP Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177 Query: 589 RRDRDLKHPQSNNNKGERREVGSKHRHVRP--NAADILSYDMNFTSTIITQDEYSISKTV 762 RD + S + G+ + GSK + ++P D S D +FTSTIIT +EYS+SK Sbjct: 178 HRDHKVMTLHSKD--GKESKDGSKAK-IKPLGGGKDFFS-DFSFTSTIITDEEYSVSKIS 233 Query: 763 PAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPL-----TNIQETRSKNKSKNVIT 906 +K +K G+ K+ N Q ++ P AP + SK ++K T Sbjct: 234 SGLKEMALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSAT 293 Query: 907 KDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXXATRSVTWADEKTDGD 1086 K + L + S N ST +E DEKTD Sbjct: 294 K-ESTDNLSDAPSTSNNRSTNFNLMTEEP-----------------------RDEKTDDA 329 Query: 1087 G-QNLNECREL-KDKKGAVVTSHSAD-EEVGEEPYRFASAEACAMALTQAAEEVASGKSE 1257 NL E E+ K K+ + TS+ + + E+ R SAEACAMAL+QAA+ + SG+SE Sbjct: 330 SIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKAITSGQSE 389 Query: 1258 ASDAVSEAGVIILPPPHGEDEEENGDVMETDPLQLKWP-------PKPGXXXXXXXXXXX 1416 SDAVSEAG+IILP P +EE + TDP+ P K G Sbjct: 390 VSDAVSEAGIIILPHPSDANEEAS-----TDPVNASEPHSFSEKSNKLGVLRSDLFDPSD 444 Query: 1417 XWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIV 1596 WYD+PPEGF+LTLS F+TM+MA+F+WV+SSSLAYIYGK++ FHEE+L ++G+EYP KIV Sbjct: 445 SWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIV 504 Query: 1597 MPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMK 1776 DGRSSEIKQTLAGCL RA+PGL +EL L P+S LE GM LLDTM+F+D LPAFRMK Sbjct: 505 SADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMK 564 Query: 1777 QWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVP 1956 QW IVLLF++ALSVSRIP+L +M R L KV++ AQI ++E+EIM+D I+PLGR Sbjct: 565 QWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTA 624 Query: 1957 QFSTQS 1974 Q S ++ Sbjct: 625 QLSDEN 630 >gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] Length = 703 Score = 569 bits (1467), Expect = e-159 Identities = 329/672 (48%), Positives = 426/672 (63%), Gaps = 48/672 (7%) Frame = +1 Query: 31 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 210 K++S M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N Sbjct: 49 KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108 Query: 211 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 390 CGYPLC N L +E KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S Sbjct: 109 CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168 Query: 391 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 570 LN AKLN++L LF L LD D ++GKNGDLG S L+I+E + A V+L GPSNA Sbjct: 169 VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224 Query: 571 IDGYVPRRDRDLKHPQSNNNKGE-----RREVGSKHRHVRPNAADILSYDMNFTSTIITQ 735 I+GYVP+R+ K NNK + ++GSK ++ +++F TII Sbjct: 225 IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278 Query: 736 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 873 DEY ISK + K + +S KE +N + + P+ + ++ Sbjct: 279 DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338 Query: 874 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 987 K + I KD DK + + + + DS+ +A KE Sbjct: 339 NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398 Query: 988 ESTAGAXXXXXXXXXXXXA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1146 A A R VTWAD+K D G NL E +E++ KG S Sbjct: 399 ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458 Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 1326 SA++ + RF SAEACAMAL++AAE VASG S+ +DAV E E+ E Sbjct: 459 GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVCEVDK--------EEPME 510 Query: 1327 NGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSWVSS 1506 +GD++E + +KWP KPG W+D+PPEGF+LTLS F+TM+ ALF W++S Sbjct: 511 DGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITS 570 Query: 1507 SSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRL 1686 SSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +LRL Sbjct: 571 SSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRL 630 Query: 1687 PIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRI 1866 PIP+STLEQGMG L+DT+SF++ LPAFRMKQW IVLLF+DALSV RIPALTP+M + R+ Sbjct: 631 PIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRM 690 Query: 1867 LLPKVIEGAQIS 1902 LL KV++GAQIS Sbjct: 691 LLHKVLDGAQIS 702 >gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao] Length = 708 Score = 528 bits (1361), Expect = e-147 Identities = 307/635 (48%), Positives = 399/635 (62%), Gaps = 51/635 (8%) Frame = +1 Query: 31 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 210 K++S M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N Sbjct: 49 KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108 Query: 211 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 390 CGYPLC N L +E KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S Sbjct: 109 CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168 Query: 391 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 570 LN AKLN++L LF L LD D ++GKNGDLG S L+I+E + A V+L GPSNA Sbjct: 169 VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224 Query: 571 IDGYVPRRDRDLKHPQSNNNKGE-----RREVGSKHRHVRPNAADILSYDMNFTSTIITQ 735 I+GYVP+R+ K NNK + ++GSK ++ +++F TII Sbjct: 225 IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278 Query: 736 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 873 DEY ISK + K + +S KE +N + + P+ + ++ Sbjct: 279 DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338 Query: 874 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 987 K + I KD DK + + + + DS+ +A KE Sbjct: 339 NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398 Query: 988 ESTAGAXXXXXXXXXXXXA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1146 A A R VTWAD+K D G NL E +E++ KG S Sbjct: 399 ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458 Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 1326 SA++ + RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP D+EE Sbjct: 459 GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518 Query: 1327 ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSW 1497 +GD++E + +KWP KPG W+D+PPEGF+LTLS F+TM+ ALF W Sbjct: 519 PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578 Query: 1498 VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 1677 ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V + Sbjct: 579 ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638 Query: 1678 LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQW 1782 LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW Sbjct: 639 LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673 >gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao] Length = 679 Score = 528 bits (1361), Expect = e-147 Identities = 307/635 (48%), Positives = 399/635 (62%), Gaps = 51/635 (8%) Frame = +1 Query: 31 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 210 K++S M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N Sbjct: 49 KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108 Query: 211 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 390 CGYPLC N L +E KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S Sbjct: 109 CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168 Query: 391 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 570 LN AKLN++L LF L LD D ++GKNGDLG S L+I+E + A V+L GPSNA Sbjct: 169 VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224 Query: 571 IDGYVPRRDRDLKHPQSNNNKGE-----RREVGSKHRHVRPNAADILSYDMNFTSTIITQ 735 I+GYVP+R+ K NNK + ++GSK ++ +++F TII Sbjct: 225 IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278 Query: 736 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 873 DEY ISK + K + +S KE +N + + P+ + ++ Sbjct: 279 DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338 Query: 874 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 987 K + I KD DK + + + + DS+ +A KE Sbjct: 339 NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398 Query: 988 ESTAGAXXXXXXXXXXXXA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1146 A A R VTWAD+K D G NL E +E++ KG S Sbjct: 399 ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458 Query: 1147 HSADEEVGEEPYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 1326 SA++ + RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP D+EE Sbjct: 459 GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518 Query: 1327 ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXXWYDSPPEGFNLTLSPFSTMFMALFSW 1497 +GD++E + +KWP KPG W+D+PPEGF+LTLS F+TM+ ALF W Sbjct: 519 PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578 Query: 1498 VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 1677 ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V + Sbjct: 579 ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638 Query: 1678 LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQW 1782 LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW Sbjct: 639 LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673