BLASTX nr result
ID: Rehmannia25_contig00006343
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00006343 (2511 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] 711 0.0 ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258... 708 0.0 ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni... 684 0.0 ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni... 673 0.0 ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm... 650 0.0 ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu... 648 0.0 gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus... 642 0.0 ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni... 631 e-178 ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni... 625 e-176 ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni... 623 e-175 ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni... 622 e-175 gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma c... 619 e-174 gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] 610 e-172 gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlise... 592 e-166 ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni... 589 e-165 gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus pe... 585 e-164 ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni... 573 e-160 gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobro... 568 e-159 gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao] 528 e-147 gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao] 528 e-147 >emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] Length = 659 Score = 711 bits (1836), Expect = 0.0 Identities = 390/669 (58%), Positives = 486/669 (72%), Gaps = 25/669 (3%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M D+ + VKDAVHKLQL LL+GI++ENQL AAGSL+SRSDY+DVVTERTIAN+CGYPLC Sbjct: 1 MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 NSL +ER KG YRISLKEHKVYDL ETYMYCSS C++NSR+FA SLQEER S LN + Sbjct: 61 SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 +N +L+LF SL+S+ +GK+GDLGLS LKI+E + AG+V++E+WIGPSNAI+GYVP Sbjct: 121 INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180 Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 1533 +R R+LK N+K + SK + + + +M+F STIIT+DEYSISK Sbjct: 181 QRDRNLKPKNIKNHKEGSKSSNSK----MDSGKNFVIDEMDFVSTIITKDEYSISKSSKG 236 Query: 1532 ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETR---SKNKSKNVITKDDK 1371 T K+KEPK KAS + Q + ++K P+ N E++ SK + VI KD+ Sbjct: 237 LKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDE- 292 Query: 1370 LSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXXXXXXXXKA-----TRSVTWADE 1218 S E + PSQ+ S K +E A + RSVTWADE Sbjct: 293 FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADE 352 Query: 1217 KTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEE--SYRFASAEACAMALTQAAEEVA 1047 K D D ++ + REL+ KK + D +VG++ + RFASAEACA+AL+QAAE VA Sbjct: 353 KMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVA 410 Query: 1046 SGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMETDPLQLKWPPKPGXXXXXXXXXX 876 SG+++ +DAVSEAG+IILP P DE E+ D++E +P+ LKWP KPG Sbjct: 411 SGETDMTDAVSEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSD 470 Query: 875 XSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 696 SWYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIYG++ESFHEEYLSVNGREYP+KI Sbjct: 471 DSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKI 530 Query: 695 VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 516 V+ DGRSSEIKQTLAGCL+RALPGLVA+LRLPIPVS LEQG+GRLLDTMSF+D LP+FRM Sbjct: 531 VLTDGRSSEIKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRM 590 Query: 515 KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 336 KQW IVLLF+DALSV RIPALTP+M RR+L PKV + AQ+SAEE+E+MKDLIIPLGRV Sbjct: 591 KQWQVIVLLFIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRV 650 Query: 335 PQFSTQSGG 309 PQFS QSGG Sbjct: 651 PQFSAQSGG 659 >ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera] gi|296089830|emb|CBI39649.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 708 bits (1827), Expect = 0.0 Identities = 388/669 (57%), Positives = 485/669 (72%), Gaps = 25/669 (3%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M D+ + VKDAVHKLQL LL+GI++ENQL AAGSL+SRSDY+DVVTERTIAN+CGYPLC Sbjct: 1 MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 NSL +ER KG YRISLKEHKVYDL ETYMYCSS C++NSR+FA SLQEER S LN + Sbjct: 61 SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 +N +L+LF SL+S+ +GK+GDLGLS LKI+E + AG+V++E+WIGPSNAI+GYVP Sbjct: 121 INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180 Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 1533 +R R+LK N K + SK + + + +M+F TIIT+DEYSISK Sbjct: 181 QRDRNLKPKNIKNRKEGSKSSNSK----MDSGKNFVIDEMDFVRTIITEDEYSISKSSKG 236 Query: 1532 ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETR---SKNKSKNVITKDDK 1371 T K+KEPK KAS + Q + ++K P+ N E++ SK + VI KD+ Sbjct: 237 LKDTTSHAKSKEPKEKAS---IGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDE- 292 Query: 1370 LSLLENIAGPSQNDS----TKAVKELQESTAGAXXXXXXXXXXXKA-----TRSVTWADE 1218 S E + PSQ+ S K +E A + TRSVTWADE Sbjct: 293 FSTAEVPSVPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE 352 Query: 1217 KTDG-DGQNLNECRELKDKKGAVVTSHSADEEVGEE--SYRFASAEACAMALTQAAEEVA 1047 K D D ++ + REL+ KK + D +VG++ + RFASAEACA+AL+QAAE VA Sbjct: 353 KMDSADSRDFCKVRELEVKKED--PNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVA 410 Query: 1046 SGKSEASDAVSEAGVIILPPPHGEDEEEN---GDVMETDPLQLKWPPKPGXXXXXXXXXX 876 SG+++ +DAVSEA +IILP P DE E+ D++E +P+ LKWP KPG Sbjct: 411 SGETDMTDAVSEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSD 470 Query: 875 XSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 696 SWYD+PPEGF+LTLSPF+TM+MALF+W++SSS+AYIYG++ESFHEEYLSVNGREYP+KI Sbjct: 471 DSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKI 530 Query: 695 VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 516 V+ DGRSSEIKQTLAGCLARALPGLVA+LRLPIPVS LEQG+GRLLDTMSF+D LP+FRM Sbjct: 531 VLTDGRSSEIKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRM 590 Query: 515 KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 336 KQW IVLLF+DALSV +IPALTP+M+ +R+L PKV + AQ+SAEE+E+MKDLIIPLGRV Sbjct: 591 KQWQVIVLLFIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRV 650 Query: 335 PQFSTQSGG 309 PQFS QSGG Sbjct: 651 PQFSAQSGG 659 >ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Solanum lycopersicum] Length = 660 Score = 684 bits (1765), Expect = 0.0 Identities = 388/674 (57%), Positives = 471/674 (69%), Gaps = 30/674 (4%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M K E + VKDAVHKLQL LL+GIK E+QL AAGSL+SRSDYQDVVTER+IAN+CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 NSL +ER KG YRISLKEHKVYDL ETYMYCS++C++NS AFA SLQ+ERSSTLNPAK Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 LN+VL LF GL L S ++ +NGD G S LKIQEK D G+V+LEEW+GPSNAI+GYVP Sbjct: 121 LNQVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVP 180 Query: 1700 RRVRDLKHPQSNN-NKGERREVGSKHRHVR-PNAADILSYDMNFTSTIITQDEYSISKTV 1527 +R R + N NKG SK++H R + +++ + +F+STIITQDEYS+SK Sbjct: 181 QRDRSVNPALLKNINKG------SKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSK-F 233 Query: 1526 PA-------VKAKEPKGKASSKE-------VNRQSNPVQKPTAPLTNIQETRSKNKSKNV 1389 PA VK KE + K K + +Q + +Q L + +ET +K+ Sbjct: 234 PAPVNADSNVKFKETQAKTRYKVRDDDVYILGKQVDALQ-----LRSGEETEKSDKNTRF 288 Query: 1388 ITKDDKLSLLENIAGPSQND---------STKAVKELQESTAGAXXXXXXXXXXXKATRS 1236 + K DK + E +GPSQ+D S K K +RS Sbjct: 289 L-KVDKFNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRS 347 Query: 1235 VTWADEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEVGEESYRFASAEACAMALTQA 1062 VTWADE DG G+ ++ + + A S S D E ++SYRF SAEACA AL+QA Sbjct: 348 VTWADESIDGGIGKKTESSSKISEYESQAYGGSASTDMEENDDSYRFESAEACAAALSQA 407 Query: 1061 AEEVASGKSEASDAVSEAGVIILPPPHGEDE---EENGDVMETDPLQLKWPPKPGXXXXX 891 AE VASG S+ DAVS+AG++ILPP DE +E ++++ + LKWP KPG Sbjct: 408 AEAVASG-SDVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYD 466 Query: 890 XXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGRE 711 SWYDSPPEGFN+TLSPF TMF +LF+W+SSSSLA+IYG +ES +EEYLS+NGRE Sbjct: 467 VFESEDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGRE 526 Query: 710 YPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPL 531 YP+KIV+ DGRS+EIKQTLAGCLARALPGLVA+LRLP+P+STLEQGM LL+TMSF+DPL Sbjct: 527 YPRKIVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPL 586 Query: 530 PAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLII 351 PAFRMKQW IVLLFLDALSV RIP LTPYM RR PKV++GAQISA E+EIMKDLII Sbjct: 587 PAFRMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLII 646 Query: 350 PLGRVPQFSTQSGG 309 PLGRVPQFS QSGG Sbjct: 647 PLGRVPQFSMQSGG 660 >ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Solanum tuberosum] Length = 662 Score = 673 bits (1736), Expect = 0.0 Identities = 382/669 (57%), Positives = 467/669 (69%), Gaps = 25/669 (3%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M K E + VKDAVHKLQL LL+GIK ENQL AAGSL+SRSDYQDVVTER+IAN+CGYPLC Sbjct: 1 MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 NSL +ER KG YRISLKEHKVYDL ETYMYCS++C++NS AFA SLQ+ERSSTLNPAK Sbjct: 61 SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAG-QVALEEWIGPSNAIDGYV 1704 LN+VL LF GL L S ++ +NGDLG S LKIQEK D G +V+LEEW+GPSNAI+GYV Sbjct: 121 LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180 Query: 1703 PRRVRDLKHPQSNN-NKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK-- 1533 P+R R + N NKG + +KH ++ IL+ + +F+STIITQDEYS+SK Sbjct: 181 PQRDRSVNPALLKNINKGFK----NKHARLQDEKNMILN-EFDFSSTIITQDEYSVSKFP 235 Query: 1532 ----TVPAVKAKEPKGKASSKEVNRQSNPVQK--PTAPLTNIQETRSKNKSKNVITKDDK 1371 V + K KE + K K + + + K L + +ET +K+ + K DK Sbjct: 236 APVNAVSSEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFL-KVDK 294 Query: 1370 LSLLENIAGPSQND-STKAVKELQ----------ESTAGAXXXXXXXXXXXKATRSVTWA 1224 + E +GPSQ+D K+V + E K ++SVTWA Sbjct: 295 FNSGEVSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWA 354 Query: 1223 DEKTDGD-GQNLNECRELKDKKG-AVVTSHSADEEVGEESYRFASAEACAMALTQAAEEV 1050 DE DG G+ ++ + + A S S D E ++SYRF SAEACA AL+QAAE V Sbjct: 355 DEIIDGGIGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEAV 414 Query: 1049 ASGKSEASDAVSEAGVIILPPPHGEDEE--ENGDVMETDPLQLKWPPKPGXXXXXXXXXX 876 ASG S+ DAVS+AG++ILP DE + ++++ +P LKWP KPG Sbjct: 415 ASG-SDVPDAVSKAGIVILPTSQEVDEAILQETEMLDIEPAPLKWPRKPGMPNYDVFESE 473 Query: 875 XSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKI 696 WYD PPEGFN+TLSPF+TMF +LF+W+SSSSLA+IYG +E+ +EEYLS+NGREYP KI Sbjct: 474 DCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPHKI 533 Query: 695 VMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRM 516 V+ DG S+EIKQTLAGCLARALPGLVA+LRLP+P+STLEQGM LL+TMSF+DPLPAFRM Sbjct: 534 VLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRM 593 Query: 515 KQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRV 336 KQW IVLLFLDALSV RIP LTPYM RR LPKV++GAQIS E+EIMKDLIIPLGRV Sbjct: 594 KQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLGRV 653 Query: 335 PQFSTQSGG 309 PQFS QSGG Sbjct: 654 PQFSMQSGG 662 >ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis] gi|223538861|gb|EEF40460.1| conserved hypothetical protein [Ricinus communis] Length = 645 Score = 650 bits (1678), Expect = 0.0 Identities = 358/655 (54%), Positives = 465/655 (70%), Gaps = 18/655 (2%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M K+E ++VKD V+KLQLSLL+GI++E+QL AAGSL+SRSDY+DVV ER+I+N+CGYPLC Sbjct: 1 MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 NSL ++RP+KGRYRISLKEH+VYDLQETYMYCSSSCL+NSRAF+ SLQE+R S LNP K Sbjct: 61 NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 LNE+L+ F+ L+LDS+ +G++GDLGLS LKIQEK++T G+V+LEEWIGPSNAI+GYVP Sbjct: 121 LNEILRKFNDLTLDSE-GLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179 Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISK---- 1533 + RD +P N+K + + K + D D +FTSTIIT DEYSISK Sbjct: 180 QGDRD-PNPSLKNHKEGLKAICKKP----VSKQDCFFSDTDFTSTIITNDEYSISKGPSG 234 Query: 1532 ---TVPAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVITKDDKLSL 1362 T +K + GK + +N Q + ++K + + +SK + K + K+ Sbjct: 235 LTSTASDIKLQAQTGKGH-EGLNAQLSSLRKQDSIKAS---RKSKGRRKEKVIKEQ---- 286 Query: 1361 LENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXKAT------RSVTWADEKTDGDG 1200 L PS + T +++ ++T A K++ RSVTWADE+ D G Sbjct: 287 LNFQDLPSSSYYTAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADERVDNAG 346 Query: 1199 -QNLNECRELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASD 1023 +NL E +E++ + S SA++ RF SAEACA+AL+QAAE VASG ++ + Sbjct: 347 SRNLCEVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNK 406 Query: 1022 AVSEAGVIILPPPH----GEDEEENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSP 855 A+SEAG+I+LPP G + E+N D++E + LKWP KPG SWYD+P Sbjct: 407 AMSEAGIIVLPPSQDLGQGGNVEKN-DMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAP 465 Query: 854 PEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRS 675 PEGF+LTLSPF+TM+MALF+WV+SSSLAYIYG++ES HE+YLSVNGREYP+KIV+ DGRS Sbjct: 466 PEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRS 525 Query: 674 SEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIV 495 SEI+ T CLAR PGLVA LRLPIPVSTLEQG GRLL+TMSF+D LPAFR KQW I Sbjct: 526 SEIRLTAESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIA 585 Query: 494 LLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQ 330 LLF++ALSV RIPALT YM RR++L +V++GA ISAEE++IMKD ++PLGR PQ Sbjct: 586 LLFIEALSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ 640 >ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] gi|550321730|gb|EEF05523.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] Length = 696 Score = 648 bits (1671), Expect = 0.0 Identities = 371/709 (52%), Positives = 469/709 (66%), Gaps = 66/709 (9%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M KD+ VKD ++KLQLSLLDGI++E+QL AAGS++S SDY+DVVTERTIAN+CGYPLC Sbjct: 1 MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 GNSL ++RP KGRYRISLKEHKVYDL ETYMYCSSSC+INSR F+ SLQEER LNPAK Sbjct: 61 GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 LNEVL LFD SL S+ ++GKNGDLG S LKI+EKT+ V G+V+ E+WIGPSNAI+GYVP Sbjct: 121 LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180 Query: 1700 RRVR--------DL---------------------------KHPQSNNNKGERR-EVGSK 1629 +R R D+ K Q KG + GSK Sbjct: 181 QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240 Query: 1628 HRHVRPNAA-DILSYDMNFTSTII-TQDEYSISK-------TVPAVKAKEPKGKASSKEV 1476 + + ++ + DMNFTSTII TQDEYSISK T K ++ K K S K Sbjct: 241 AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300 Query: 1475 NRQSNPVQKPTAPLTN--IQETRSKNKSKNVITKDDKLSLLENIAGPS---------QND 1329 QS+ +K + T+ ++E RSK K+ ++ D S ++ S ++ Sbjct: 301 ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360 Query: 1328 STKAVKELQES------TAGAXXXXXXXXXXXKATRSVTWADEKTDGDG-QNLNECRELK 1170 S KA K ++ S T+GA TRSVTWADEK G ++L E R ++ Sbjct: 361 SEKAAKPVESSLKPSLKTSGAKQL----------TRSVTWADEKVGSSGSRDLCEVRGME 410 Query: 1169 DKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILP 990 D K + D+ +F SAEACA AL+QAAE VASG ++AS+A+SEAG++ILP Sbjct: 411 DTKAGPEIVDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILP 470 Query: 989 PPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFS 819 PH D+ E+ DV++ + +KWP KPG SWYD+PPEGF+L LS F+ Sbjct: 471 QPHDLDQGDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFA 530 Query: 818 TMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLA 639 T++MALF+WV+SSSLAY+YGK+ES HEEYL VNGREYP+KIV+ DGRS EI+QT+ GCL Sbjct: 531 TIWMALFAWVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLG 590 Query: 638 RALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRI 459 RA P +VA+LRLPIP+STLEQG LL TMSF+D +PAFRMKQW I LLF++ALSV RI Sbjct: 591 RAFPVVVADLRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRI 650 Query: 458 PALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312 PAL YM +RR+ V++G ++SAEE+E+MKDL+IPLGR PQFS QSG Sbjct: 651 PALISYMDNRRM----VVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695 >gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris] Length = 706 Score = 642 bits (1657), Expect = 0.0 Identities = 369/713 (51%), Positives = 465/713 (65%), Gaps = 70/713 (9%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M KD+ ++VKDAV KLQ+ LL+GI++E+QL AAGSL+SRSDY+D+VTER+I NVCGYPLC Sbjct: 1 MAKDKAVSVKDAVFKLQMLLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 N+L +ERP KG+YRISLKEHKVYDLQETYM+CSS+C+++S+AF+ LQ ER S L+P K Sbjct: 61 CNALPSERPRKGKYRISLKEHKVYDLQETYMFCSSNCVVSSKAFSGILQAERCSALDPEK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 LN VL LF+ L+L+ N+ K+GDLGLS LKIQEKT T +G+V LE+W+GPSNAI+GYVP Sbjct: 121 LNNVLGLFENLNLEQTENVPKDGDLGLSNLKIQEKTVTTSGEVPLEQWVGPSNAIEGYVP 180 Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRPNA-ADILSYDMNFTSTIITQDEYSISKT 1530 + P+ +KG R+ V GSK H + N D+++ +MNF STII QDEYS+SK Sbjct: 181 K-------PRERESKGLRKNVKKGSKAGHGKSNNDKDLINSEMNFVSTIIMQDEYSVSKA 233 Query: 1529 VPA-----------------------------------------------VKAKEPKGKA 1491 P + A E KGK Sbjct: 234 SPGQTDTTAHHQIKPTAVDRQQEEKVGLKVVRKDEDSIQDLSSSFESGLHLSASE-KGKE 292 Query: 1490 SSK--EVNRQSNP---VQKPTAPLTNIQETR---SKNKS--KNVITKDDKLSLLENIAGP 1341 SK EV +S P ++K A +I E KN S K+V K + + N Sbjct: 293 VSKSCEVVVKSTPNLAIKKKDAHSVSISERHYDVEKNNSARKSVQLKGETSRVTVNGDAS 352 Query: 1340 SQNDSTKAVKE-LQESTAGAXXXXXXXXXXXKA-----TRSVTWADEKTDGDG-QNLNEC 1182 + N VKE Q G A +R+VTWADEK +G G ++L E Sbjct: 353 TSNFDPDNVKEKFQVEKVGGLCETKLKSSLKSAGEKKLSRTVTWADEKINGAGNKDLCEV 412 Query: 1181 RELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1002 +E D + + D E+ R ASAEACA+AL+QA+E VASG S+A+DAVSEAG+ Sbjct: 413 KEFGDIIKESESVGNEDVANNEDMLRQASAEACAIALSQASEAVASGDSDATDAVSEAGI 472 Query: 1001 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTL 831 IILP PH EE E+ D+++ D + LKWP KPG SW+D+PPEGF+LTL Sbjct: 473 IILPQPHDAVEEGTMEDADILQNDSVTLKWPRKPGISDIDFFESDDSWFDAPPEGFSLTL 532 Query: 830 SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 651 SPF+ M+ A+FSW++S SLAYIYG++ESFHEEYLSVNGREYP K+V+ DGRSSEIKQT A Sbjct: 533 SPFANMWNAIFSWMTSYSLAYIYGRDESFHEEYLSVNGREYPCKVVLSDGRSSEIKQTFA 592 Query: 650 GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 471 GCLARA P LVA LRLPIP+STLEQGM LL+TMSF+D LPAFR KQW + LLF+DALS Sbjct: 593 GCLARAFPALVAGLRLPIPISTLEQGMACLLETMSFVDALPAFRTKQWQVVALLFVDALS 652 Query: 470 VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312 V RIP+L YM DRR L KV+ G+QI EE+EI+KDL++PLGR P S QSG Sbjct: 653 VCRIPSLISYMTDRRALFHKVLSGSQIGMEEYEILKDLVVPLGRAPHISVQSG 705 >ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Glycine max] Length = 706 Score = 631 bits (1627), Expect = e-178 Identities = 355/713 (49%), Positives = 464/713 (65%), Gaps = 70/713 (9%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I N+CGYPLC Sbjct: 1 MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 N+L ++RP KGRYRISLKEHKVYDLQETYM+CSS+CL++S+ FA SLQ ER S L+ K Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 LN VL LF+ L+L+ + KNGDLGLS LKIQEKT+ +G+V+LE+W GPSNAI+GYVP Sbjct: 121 LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180 Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 1530 + P++ ++KG R+ V GSK H + + ++++ +M F STII QDEYS+SK Sbjct: 181 K-------PRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKV 233 Query: 1529 VPA-------------VKAKEPKGKASSKEVNRQSNPVQ------KPTAPLTNIQETRSK 1407 P K+P+ K ++ V + + +Q K + L+ ++ Sbjct: 234 PPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEV 292 Query: 1406 NKSKNVITKDD-----------KLSLLENIAGPSQNDSTKAVKELQEST----------- 1293 KS + K +S+ E QNDS + +++ T Sbjct: 293 TKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDAST 352 Query: 1292 ----------------AGAXXXXXXXXXXXKA-----TRSVTWADEKTDGDG-QNLNECR 1179 AG A +R+VTWADEK + G ++L E + Sbjct: 353 SNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFK 412 Query: 1178 ELKD-KKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1002 E D KK + ++ D E+ R ASAEACA+AL+ A+E VASG S+ SDAVSEAG+ Sbjct: 413 EFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVSEAGI 472 Query: 1001 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTL 831 ILPPPH EE E+ D+++ D + LKWP K G SW+D+PPEGF+LTL Sbjct: 473 TILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGFSLTL 532 Query: 830 SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 651 SPF+TM+ LFSW +SSSLAYIYG++ESFHEEYLSVNGREYP K+V+ DGRSSEIKQTLA Sbjct: 533 SPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIKQTLA 592 Query: 650 GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 471 CLARALP LVA LRLPIPVS +EQGM LL+TMSF+D LPAFR KQW + LLF+DALS Sbjct: 593 SCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALS 652 Query: 470 VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312 V R+PAL YM DRR +V+ G+QI EE+E++KDL++PLGR P S+QSG Sbjct: 653 VCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQSG 705 >ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Glycine max] Length = 706 Score = 625 bits (1613), Expect = e-176 Identities = 353/717 (49%), Positives = 456/717 (63%), Gaps = 74/717 (10%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I NVCGYPLC Sbjct: 1 MEKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 N+L ++RP KGRYRISLKEHKVYDL ETYM+C S+C+++S+AFA SLQ ER S L+ K Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 LN +L LF+ L+L+ N+ KN D GLS LKIQEKT+T +G+V+LE+W GPSNAI+GYVP Sbjct: 121 LNNILSLFENLNLEPAENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYVP 180 Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 1530 + P+ +++KG R+ V GSK H +P + +++S +M F STII QD YS+SK Sbjct: 181 K-------PRDHDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKV 233 Query: 1529 VPAVKAK------------EPKGKASSKEVNRQSNPVQKPTAP---------------LT 1431 +P + + GK +K V + +Q ++ L Sbjct: 234 LPGQRDATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELA 293 Query: 1430 NIQETRSKNKSKNVITKDD--KLSLLENIAGPSQNDSTK--------------------- 1320 E K+ I K D +S+ E QNDS K Sbjct: 294 QSCEAALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTS 353 Query: 1319 ------AVKELQESTAGAXXXXXXXXXXXKA-----TRSVTWADEKTDGDG-------QN 1194 ++ Q AG A +R+VTWAD+K + G +N Sbjct: 354 NLDPANVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWADKKINSTGSKDLCGFKN 413 Query: 1193 LNECRELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVS 1014 + R D G +S D E++ R ASAEAC +AL+ A+E VASG S+ SDAVS Sbjct: 414 FGDIRNESDSAG-----NSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVS 468 Query: 1013 EAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGF 843 EAG+IILPPPH EE E+ D+++ D + +KWP KPG SW+D+ PEGF Sbjct: 469 EAGIIILPPPHDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGF 528 Query: 842 NLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIK 663 +LTLSPF+TM+ LFSW++SSSLAYIYG++ESF EEYLSVNGREYP K+V+ DGRSSEIK Sbjct: 529 SLTLSPFATMWNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIK 588 Query: 662 QTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFL 483 QTLA CLARALP LVA LRLPIPVST+EQGM LL+TMSF+D LPAFR KQW + LLF+ Sbjct: 589 QTLASCLARALPTLVAVLRLPIPVSTMEQGMACLLETMSFVDALPAFRTKQWQVVALLFI 648 Query: 482 DALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312 DALSV R+PAL YM DRR +V+ G+QI EE+E++KDL +PLGR P S QSG Sbjct: 649 DALSVCRLPALISYMTDRRASFHRVLSGSQIGMEEYEVLKDLAVPLGRAPHISAQSG 705 >ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Glycine max] Length = 716 Score = 623 bits (1606), Expect = e-175 Identities = 355/723 (49%), Positives = 464/723 (64%), Gaps = 80/723 (11%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M KD+ ++VKDAV KLQ+SLL+GI++E+QL AAGSL+SRSDY+D+VTER+I N+CGYPLC Sbjct: 1 MAKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 N+L ++RP KGRYRISLKEHKVYDLQETYM+CSS+CL++S+ FA SLQ ER S L+ K Sbjct: 61 SNALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 LN VL LF+ L+L+ + KNGDLGLS LKIQEKT+ +G+V+LE+W GPSNAI+GYVP Sbjct: 121 LNNVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVP 180 Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRP-NAADILSYDMNFTSTIITQDEYSISKT 1530 + P++ ++KG R+ V GSK H + + ++++ +M F STII QDEYS+SK Sbjct: 181 K-------PRNRDSKGLRKNVKKGSKTGHGKSISDINLINSEMGFVSTIIMQDEYSVSKV 233 Query: 1529 VPA-------------VKAKEPKGKASSKEVNRQSNPVQ------KPTAPLTNIQETRSK 1407 P K+P+ K ++ V + + +Q K + L+ ++ Sbjct: 234 PPGQMDATANHQIKPTATVKQPE-KVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEV 292 Query: 1406 NKSKNVITKDD-----------KLSLLENIAGPSQNDSTKAVKELQEST----------- 1293 KS + K +S+ E QNDS + +++ T Sbjct: 293 TKSCEAVLKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDAST 352 Query: 1292 ----------------AGAXXXXXXXXXXXKA-----TRSVTWADEKTDGDG-QNLNECR 1179 AG A +R+VTWADEK + G ++L E + Sbjct: 353 SNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFK 412 Query: 1178 ELKD-KKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAV----- 1017 E D KK + ++ D E+ R ASAEACA+AL+ A+E VASG S+ SDAV Sbjct: 413 EFGDIKKESDSVGNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVFSPMN 472 Query: 1016 -----SEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYD 861 SEAG+ ILPPPH EE E+ D+++ D + LKWP K G SW+D Sbjct: 473 ETCAVSEAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFD 532 Query: 860 SPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDG 681 +PPEGF+LTLSPF+TM+ LFSW +SSSLAYIYG++ESFHEEYLSVNGREYP K+V+ DG Sbjct: 533 APPEGFSLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADG 592 Query: 680 RSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHA 501 RSSEIKQTLA CLARALP LVA LRLPIPVS +EQGM LL+TMSF+D LPAFR KQW Sbjct: 593 RSSEIKQTLASCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQV 652 Query: 500 IVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFST 321 + LLF+DALSV R+PAL YM DRR +V+ G+QI EE+E++KDL++PLGR P S+ Sbjct: 653 VALLFIDALSVCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISS 712 Query: 320 QSG 312 QSG Sbjct: 713 QSG 715 >ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cicer arietinum] Length = 666 Score = 622 bits (1604), Expect = e-175 Identities = 346/675 (51%), Positives = 454/675 (67%), Gaps = 32/675 (4%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M KD+ ++VKDAV KLQL+LL+GI+ E+QL AAGSLISRSDY+DVVTER+I VC YPLC Sbjct: 1 MEKDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 N+L +ERP KGRYRISLKEHKVYDL ETYM+CSSSC++NS+AFA SL+++R L+P K Sbjct: 61 CNALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 LN +L+LF +L+ N GK+G+LGLS L+IQ+KT+TV +V+LE+W+GPSNAI+GYVP Sbjct: 121 LNNILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVP 179 Query: 1700 RRVRDLKHPQSNNNKGERREV--GSKHRHVRPNAA-DILSYDMNFTSTIITQDEYSISKT 1530 ++ + N +KG ++ GSK H + N ++++ + +F STII QDEYS+SK Sbjct: 180 KK-------RDNGSKGSQKNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKV 232 Query: 1529 V-------------PAVKAKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSK-- 1395 P ++PK E+ R+ + +Q ++ + + K K Sbjct: 233 SSGQTDATVDHQIKPTAILEQPK--RVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEI 290 Query: 1394 -----NVITKDDKLSLLENIAGPSQNDSTKAVKELQ-ESTAGAXXXXXXXXXXXKAT--- 1242 NV+ + + S D + +++Q E G+ Sbjct: 291 AKSCKNVLKGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKL 350 Query: 1241 -RSVTWADEKTDGDGQ-NLNECRELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALT 1068 RSVTWAD+K DG G +L +E + K + + D E+ R SAEACA+AL+ Sbjct: 351 GRSVTWADKKIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALS 410 Query: 1067 QAAEEVASGKSEASDAVSEAGVIILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXX 897 QAAE VASG S+A DAVSEAG+IILP EE ++ D++ETD + LKWP KPG Sbjct: 411 QAAEAVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISD 470 Query: 896 XXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNG 717 SW+D+PPEGF+LTLSPF+T++ A FSW++SSSLAYIYG++ SF+EE+LSV+G Sbjct: 471 FDLFASDDSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDG 530 Query: 716 REYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFID 537 REYP KIV+ DGRSSEIKQTLA CLARALP +VAEL+LP+PVSTLEQGM LLDTMSF+D Sbjct: 531 REYPCKIVLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVD 590 Query: 536 PLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDL 357 PLP FR KQW + LLF+DALSV RIPAL YM DRR L KV+ G+QI EE+ ++KDL Sbjct: 591 PLPGFRFKQWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDL 650 Query: 356 IIPLGRVPQFSTQSG 312 I+PLGR P FS+QSG Sbjct: 651 IVPLGRAPHFSSQSG 665 >gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] Length = 739 Score = 619 bits (1595), Expect = e-174 Identities = 355/700 (50%), Positives = 456/700 (65%), Gaps = 51/700 (7%) Frame = -2 Query: 2258 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 2079 K++S M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N Sbjct: 49 KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108 Query: 2078 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 1899 CGYPLC N L +E KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S Sbjct: 109 CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168 Query: 1898 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 1719 LN AKLN++L LF L LD D ++GKNGDLG S L+I+E + A V+L GPSNA Sbjct: 169 VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224 Query: 1718 IDGYVPRR--VRDLKHPQSNNNK---GERREVGSKHRHVRPNAADILSYDMNFTSTIITQ 1554 I+GYVP+R + P++N NK ++GSK ++ +++F TII Sbjct: 225 IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278 Query: 1553 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 1416 DEY ISK + K + +S KE +N + + P+ + ++ Sbjct: 279 DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338 Query: 1415 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 1302 K + I KD DK + + + + DS+ +A KE Sbjct: 339 NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398 Query: 1301 ESTAGAXXXXXXXXXXXKA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1143 A A R VTWAD+K D G NL E +E++ KG S Sbjct: 399 ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458 Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 963 SA++ + RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP D+EE Sbjct: 459 GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518 Query: 962 ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSW 792 +GD++E + +KWP KPG SW+D+PPEGF+LTLS F+TM+ ALF W Sbjct: 519 PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578 Query: 791 VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 612 ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V + Sbjct: 579 ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638 Query: 611 LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMD 432 LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW IVLLF+DALSV RIPALTP+M + Sbjct: 639 LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTN 698 Query: 431 RRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312 R+LL KV++GAQIS EE+E+MKDLIIPLGR P FS QSG Sbjct: 699 GRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738 >gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] Length = 695 Score = 610 bits (1573), Expect = e-172 Identities = 349/703 (49%), Positives = 455/703 (64%), Gaps = 66/703 (9%) Frame = -2 Query: 2222 LTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLCGNSLSA 2043 ++VKD V++LQLSLL G+ E+QL AAGS++SRSDY DVVTER+IAN+CGYPLC N L + Sbjct: 9 ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68 Query: 2042 ERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAKLNEVLK 1863 +RP KGRYRISLKEHKVYDL ETYMYCSS C+INSR FAASL++ER + L+ A+++ VL+ Sbjct: 69 DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128 Query: 1862 LFDGLS-LDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVPRRVRD 1686 +F+ S L+ ++ GK+ DLG S LKI+EKT+ G V+LE+W GPSNAI+GYV +R R Sbjct: 129 MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188 Query: 1685 LKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISKTVPAVK--- 1515 P+ +K +R GSK + +L DM+F STIIT+DEY++SKT ++K Sbjct: 189 ---PKELGSKSPKR--GSKANNT------VLINDMDFVSTIITEDEYTVSKTPSSLKKTG 237 Query: 1514 ----AKEPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVITKDDKLSLLENIA 1347 +E + + K + + ++ AP +N+ +R ++V + S L + Sbjct: 238 LDSKVREQEEILAKKAMGNEFAVLETSYAPASNV--SRVGLVFEDVTSSLRAGSCLSSAR 295 Query: 1346 GPSQNDSTKAVKELQESTAGAXXXXXXXXXXXKATRSVTWADEKTDGDG----------- 1200 ++ KA K T + K +R+VTWADEKTD G Sbjct: 296 AEEESHDDKAEK----CTEASIKSSLKPSRKKKLSRTVTWADEKTDSSGGRKLCEIREIE 351 Query: 1199 ---------QNLN--------------------------------ECRELKDKKGAVVTS 1143 +N N E RE++D K A Sbjct: 352 DMKEDPSVVENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADML 411 Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDE-- 969 +AD ++++RFASAEACA AL +A+E VAS + E +DA+SEAG+IILP P DE Sbjct: 412 CNADTGENDDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGE 471 Query: 968 --EENGDVMETDPLQ--LKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMAL 801 EE+ D ++P Q +KWP KPG SW+D+PPE F+LTLSPF+ M+ AL Sbjct: 472 PMEEDDDDETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNAL 531 Query: 800 FSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGL 621 F+W +SS+LAYIYG++ES HEEY VNGREYP+KIV DGRSSEIKQTLAG LARALPGL Sbjct: 532 FTWTTSSTLAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGL 591 Query: 620 VAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPY 441 VA+LRL P+S+LEQGMGRLLDTMSF+D LP FRMKQW I+LLFL+ALSV R+PALTP+ Sbjct: 592 VADLRLSTPISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPH 651 Query: 440 MMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312 MM RR+L KV++ AQISAEE+E+MKDL+IPLGR P FS QSG Sbjct: 652 MMYRRVLFHKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSG 694 >gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlisea aurea] Length = 597 Score = 592 bits (1525), Expect = e-166 Identities = 337/639 (52%), Positives = 427/639 (66%), Gaps = 6/639 (0%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M KDE+LT+K+AV++LQ SLL+G K+ENQL+AAGSL+SR DYQD+VTER IA +CGYPLC Sbjct: 1 MAKDEILTMKEAVYRLQTSLLEGAKNENQLSAAGSLMSRGDYQDLVTERVIAKICGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 N+L++ERP KGRYRISLKEHKVYD+QETY +CSS CLINSRAF+ L +ER+S L+P K Sbjct: 61 SNNLNSERPSKGRYRISLKEHKVYDVQETYSFCSSGCLINSRAFSIGLPDERTSDLDPIK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 LNEVLK FDG +S NMG+N DLGLS L+I EK + AG+V+ EWIGPS+AIDGYVP Sbjct: 121 LNEVLKRFDGFGANSTPNMGRNEDLGLSQLRIMEKENIEAGEVSSNEWIGPSDAIDGYVP 180 Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRPNAADILSYDMNFTSTIITQDEYSISKTVPA 1521 RR R+ S KGE S++ I DM+FTS II Q+EYSI+KT Sbjct: 181 RRDRNSNTLSSKQKKGE-----SRYHLSLQVLTSIFPSDMSFTSVIIDQNEYSIAKTTTP 235 Query: 1520 VKAKEPKGKASSKEVNRQS-NPVQKPTAPLTNIQETRSKNKSK-NVITK-DDKLSLLENI 1350 +K+ G+++ K + + P Q P + + NI+ + +N SK N K D KLS E+ Sbjct: 236 SSSKQ-SGESNEKVIPEEDVRPKQSPDSSVANIKGSGFRNPSKRNGRAKIDAKLSASEDK 294 Query: 1349 AGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXK---ATRSVTWADEKTDGDGQNLNECR 1179 A S+N + + +S GA TR+V+WAD K + DGQNL Sbjct: 295 A--SENGGEPKLADGDKSAQGAAVLKSSLKTSYSKETTTRTVSWADVKAE-DGQNLETVC 351 Query: 1178 ELKDKKGAVVTSHSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVI 999 E+ D G ++ ++ S E+ A T+A+++ A GK +D Sbjct: 352 EMNDPHGGGISRETS------------SVESHKTASTKASKD-APGKFLLTDF------- 391 Query: 998 ILPPPHGEDEEENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFS 819 G++ T+ + LKWPPKPG + YD PP+GFNL+LSPF Sbjct: 392 -----------NEGEIF-TEAI-LKWPPKPGFSEADLVESDDTLYDRPPDGFNLSLSPFC 438 Query: 818 TMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLA 639 T+F +LFSW+SSSSLAYIYGK++SFHEEY++ NGREYP K+V DGRSSEIKQTL+ LA Sbjct: 439 TLFNSLFSWISSSSLAYIYGKDDSFHEEYVNANGREYPCKVVAEDGRSSEIKQTLSAALA 498 Query: 638 RALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRI 459 RALPG+V+ELRLP P+S LEQGMGRLLDTMSFIDPLP+ R KQW AIVLLFL+ALSVSRI Sbjct: 499 RALPGVVSELRLPTPISILEQGMGRLLDTMSFIDPLPSLRTKQWQAIVLLFLNALSVSRI 558 Query: 458 PALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLG 342 PAL+ Y+ DRR + KV+EGA I EEFE+MKDLIIPLG Sbjct: 559 PALSKYLEDRRASIQKVLEGAGIGVEEFEVMKDLIIPLG 597 >ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 662 Score = 589 bits (1519), Expect = e-165 Identities = 342/673 (50%), Positives = 443/673 (65%), Gaps = 31/673 (4%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M K++ + +KD V+KLQL+L +GIK+ENQL AAGSL+SRSDY+DVVTER+IA++CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 ++L ++ +GRYRISLKEHKVYDL+ETY YCSS+CLINSRAF+ LQ+ER S +NP K Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 L E+LKLF+ +SLDS NMG N D SGL+IQEK ++ G+V +EEW+GPSNAI+GYVP Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177 Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRP--NAADILSYDMNFTSTIITQDEYSISKTV 1527 R D K ++ G+ + GSK + ++P D S D + TSTIIT +EYS+SK Sbjct: 178 HR--DHKVMTLHSKDGKESKDGSKAK-IKPLGGGKDFFS-DFSITSTIITDEEYSVSKIS 233 Query: 1526 PAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPL-----TNIQETRSKNKSKNVIT 1383 +K +K G+ KE N Q ++ P AP + SK ++K T Sbjct: 234 SGLKEMALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSAT 293 Query: 1382 KDDKLSLLENIAGPSQNDSTKAVKELQESTAG-------AXXXXXXXXXXXKATRSVTWA 1224 K+ +L + S+N ST +E G RSVTWA Sbjct: 294 KESTDNL-SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWA 352 Query: 1223 DEKTDGDG-QNLNECREL-KDKKGAVVTSHSAD-EEVGEESYRFASAEACAMALTQAAEE 1053 DEKTD NL E E+ K K+ + TS+ + + E+ R SAEACAMAL+QAAE Sbjct: 353 DEKTDDASIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEA 412 Query: 1052 VASGKSEASDAVSEAGVIILPPPHGEDEEENGDVMETDPLQLKWPP-------KPGXXXX 894 + SG+SE SDAVSEAG+IILP P +EE + TDP+ P K G Sbjct: 413 ITSGQSEVSDAVSEAGIIILPHPSDANEEAS-----TDPVNASEPHSFSEKSNKLGVLRS 467 Query: 893 XXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGR 714 SWYD+PPEGF+LTLS F+TM+MA+F+WV+SSSLAYIYGK++ FHEE+L ++G+ Sbjct: 468 DLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGK 527 Query: 713 EYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDP 534 EYP KIV DGRSSEIKQTLAGCL RA+PGL +EL L P+S LE GM LLDTM+F+D Sbjct: 528 EYPSKIVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDA 587 Query: 533 LPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLI 354 LPAFRMKQW IVLLF++ALSVSRIP+L +M R L KV++ AQI ++E+EIM+D I Sbjct: 588 LPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHI 647 Query: 353 IPLGRVPQFSTQS 315 +PLGR Q S ++ Sbjct: 648 LPLGRTAQLSDEN 660 >gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] Length = 711 Score = 585 bits (1507), Expect = e-164 Identities = 351/713 (49%), Positives = 450/713 (63%), Gaps = 76/713 (10%) Frame = -2 Query: 2222 LTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLCGNSLSA 2043 ++VKD V+KLQL+LL+GIK ++ L AGS+ISRSDY DVVTERTIAN+CGYPLC N+L + Sbjct: 13 ISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSNALPS 72 Query: 2042 E--RPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAKLNEV 1869 + RP KG YRISLKEHKVYDL ETYMYCSS C+I S+AFA SL EER L+ K+ + Sbjct: 73 DSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGKVERI 132 Query: 1868 LKLFDGLSLD-SDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEW--------------- 1737 L+ F + D +V G+ GDLG+S LKI+EK +T G + + Sbjct: 133 LRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIGDLGA 192 Query: 1736 IGPSNAIDGYVPRRVRDLKHPQSNNNKGERREVGSKHRHVRPNAA-DILSYDMNFTSTII 1560 +GPSNAI+GYVP++ R K S NK GSK + + ++ DI+ +M+F STII Sbjct: 193 VGPSNAIEGYVPQKERISKPLGSKKNK-----EGSKGKDAKMSSGMDIIFNEMDFMSTII 247 Query: 1559 TQDEYSISKTVPAVKAK--EPKGKASSKEVNRQSNPVQKPTAPLTNIQETRSKNKSKNVI 1386 T DEYS+SK P+V E K K S +V N +++++R KN Sbjct: 248 TSDEYSVSKIPPSVGEPDFETKFKKSKGKVGLNKN---------DSVKKSRQSKGGKNKN 298 Query: 1385 TKDDKLSLLE--NIAGPSQ---NDSTKAVKE------LQESTAGAXXXXXXXXXXXKATR 1239 K D + + E + + SQ N STK KE ++S K R Sbjct: 299 VKKDDVCIREVPSTSDASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNR 358 Query: 1238 SVTWADEKTDGDG-QNLNECRELK---DKKGAVVTSH--SADEEVG-------------- 1119 SVTWADE D G +NL E RE++ + A + H S + +VG Sbjct: 359 SVTWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTK 418 Query: 1118 ---------------------EESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGV 1002 +E+ SAEACAMAL QAAE VASG+S+ S AVS AG+ Sbjct: 419 SKNICEVREVQDADVLGSLDLQENEILESAEACAMALNQAAEAVASGESDVSGAVSGAGI 478 Query: 1001 IILPPPHGEDEE---ENGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTL 831 IILP P G DEE E+ D++E++ L WP KPG SW+D+PPEGF++TL Sbjct: 479 IILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFSVTL 537 Query: 830 SPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLA 651 SPF+TM+ +LF+W++SS+LAYIYG++ESFHEE+LSVNGREYP KIV+ GRSSEIK+TL Sbjct: 538 SPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKKTLD 597 Query: 650 GCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALS 471 ARALPG+V+ELRLP P+S+LEQGMGR+L+TMSFID +PAFRMKQW IVLLFL+ LS Sbjct: 598 ESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLEGLS 657 Query: 470 VSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVPQFSTQSG 312 V RIPALTP+M +RR+L KV+E QISAE++E+MKDLIIPLGR PQFS QSG Sbjct: 658 VCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFSAQSG 710 >ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 632 Score = 573 bits (1476), Expect = e-160 Identities = 334/666 (50%), Positives = 435/666 (65%), Gaps = 24/666 (3%) Frame = -2 Query: 2240 MTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANVCGYPLC 2061 M K++ + +KD V+KLQL+L +GIK+ENQL AAGSL+SRSDY+DVVTER+IA++CGYPLC Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60 Query: 2060 GNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSSTLNPAK 1881 ++L ++ +GRYRISLKEHKVYDL+ETY YCSS+CLINSRAF+ LQ+ER S +NP K Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120 Query: 1880 LNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNAIDGYVP 1701 L E+LKLF+ +SLDS NMG N D SGL+IQEK ++ G+V +EEW+GPSNAI+GYVP Sbjct: 121 LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177 Query: 1700 RRVRDLKHPQSNNNKGERREVGSKHRHVRP--NAADILSYDMNFTSTIITQDEYSISKTV 1527 RD K ++ G+ + GSK + ++P D S D +FTSTIIT +EYS+SK Sbjct: 178 H--RDHKVMTLHSKDGKESKDGSKAK-IKPLGGGKDFFS-DFSFTSTIITDEEYSVSKIS 233 Query: 1526 PAVK-------AKEPKGKASSKEVNRQSNPVQKPTAPL-----TNIQETRSKNKSKNVIT 1383 +K +K G+ K+ N Q ++ P AP + SK ++K T Sbjct: 234 SGLKEMALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSAT 293 Query: 1382 KDDKLSLLENIAGPSQNDSTKAVKELQESTAGAXXXXXXXXXXXKATRSVTWADEKTDGD 1203 K + L + S N ST +E DEKTD Sbjct: 294 K-ESTDNLSDAPSTSNNRSTNFNLMTEEP-----------------------RDEKTDDA 329 Query: 1202 G-QNLNECREL-KDKKGAVVTSHSAD-EEVGEESYRFASAEACAMALTQAAEEVASGKSE 1032 NL E E+ K K+ + TS+ + + E+ R SAEACAMAL+QAA+ + SG+SE Sbjct: 330 SIMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKAITSGQSE 389 Query: 1031 ASDAVSEAGVIILPPPHGEDEEENGDVMETDPLQLKWP-------PKPGXXXXXXXXXXX 873 SDAVSEAG+IILP P +EE + TDP+ P K G Sbjct: 390 VSDAVSEAGIIILPHPSDANEEAS-----TDPVNASEPHSFSEKSNKLGVLRSDLFDPSD 444 Query: 872 SWYDSPPEGFNLTLSPFSTMFMALFSWVSSSSLAYIYGKEESFHEEYLSVNGREYPQKIV 693 SWYD+PPEGF+LTLS F+TM+MA+F+WV+SSSLAYIYGK++ FHEE+L ++G+EYP KIV Sbjct: 445 SWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIV 504 Query: 692 MPDGRSSEIKQTLAGCLARALPGLVAELRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMK 513 DGRSSEIKQTLAGCL RA+PGL +EL L P+S LE GM LLDTM+F+D LPAFRMK Sbjct: 505 SADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMK 564 Query: 512 QWHAIVLLFLDALSVSRIPALTPYMMDRRILLPKVIEGAQISAEEFEIMKDLIIPLGRVP 333 QW IVLLF++ALSVSRIP+L +M R L KV++ AQI ++E+EIM+D I+PLGR Sbjct: 565 QWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTA 624 Query: 332 QFSTQS 315 Q S ++ Sbjct: 625 QLSDEN 630 >gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] Length = 703 Score = 568 bits (1465), Expect = e-159 Identities = 330/672 (49%), Positives = 428/672 (63%), Gaps = 48/672 (7%) Frame = -2 Query: 2258 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 2079 K++S M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N Sbjct: 49 KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108 Query: 2078 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 1899 CGYPLC N L +E KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S Sbjct: 109 CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168 Query: 1898 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 1719 LN AKLN++L LF L LD D ++GKNGDLG S L+I+E + A V+L GPSNA Sbjct: 169 VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224 Query: 1718 IDGYVPRR--VRDLKHPQSNNNK---GERREVGSKHRHVRPNAADILSYDMNFTSTIITQ 1554 I+GYVP+R + P++N NK ++GSK ++ +++F TII Sbjct: 225 IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278 Query: 1553 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 1416 DEY ISK + K + +S KE +N + + P+ + ++ Sbjct: 279 DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338 Query: 1415 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 1302 K + I KD DK + + + + DS+ +A KE Sbjct: 339 NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398 Query: 1301 ESTAGAXXXXXXXXXXXKA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1143 A A R VTWAD+K D G NL E +E++ KG S Sbjct: 399 ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458 Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 963 SA++ + RF SAEACAMAL++AAE VASG S+ +DAV E E+ E Sbjct: 459 GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVCEVDK--------EEPME 510 Query: 962 NGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSWVSS 783 +GD++E + +KWP KPG SW+D+PPEGF+LTLS F+TM+ ALF W++S Sbjct: 511 DGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITS 570 Query: 782 SSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAELRL 603 SSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V +LRL Sbjct: 571 SSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRL 630 Query: 602 PIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQWHAIVLLFLDALSVSRIPALTPYMMDRRI 423 PIP+STLEQGMG L+DT+SF++ LPAFRMKQW IVLLF+DALSV RIPALTP+M + R+ Sbjct: 631 PIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRM 690 Query: 422 LLPKVIEGAQIS 387 LL KV++GAQIS Sbjct: 691 LLHKVLDGAQIS 702 >gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao] Length = 708 Score = 528 bits (1359), Expect = e-147 Identities = 308/635 (48%), Positives = 401/635 (63%), Gaps = 51/635 (8%) Frame = -2 Query: 2258 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 2079 K++S M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N Sbjct: 49 KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108 Query: 2078 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 1899 CGYPLC N L +E KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S Sbjct: 109 CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168 Query: 1898 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 1719 LN AKLN++L LF L LD D ++GKNGDLG S L+I+E + A V+L GPSNA Sbjct: 169 VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224 Query: 1718 IDGYVPRR--VRDLKHPQSNNNK---GERREVGSKHRHVRPNAADILSYDMNFTSTIITQ 1554 I+GYVP+R + P++N NK ++GSK ++ +++F TII Sbjct: 225 IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278 Query: 1553 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 1416 DEY ISK + K + +S KE +N + + P+ + ++ Sbjct: 279 DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338 Query: 1415 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 1302 K + I KD DK + + + + DS+ +A KE Sbjct: 339 NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398 Query: 1301 ESTAGAXXXXXXXXXXXKA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1143 A A R VTWAD+K D G NL E +E++ KG S Sbjct: 399 ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458 Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 963 SA++ + RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP D+EE Sbjct: 459 GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518 Query: 962 ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSW 792 +GD++E + +KWP KPG SW+D+PPEGF+LTLS F+TM+ ALF W Sbjct: 519 PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578 Query: 791 VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 612 ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V + Sbjct: 579 ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638 Query: 611 LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQW 507 LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW Sbjct: 639 LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673 >gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao] Length = 679 Score = 528 bits (1359), Expect = e-147 Identities = 308/635 (48%), Positives = 401/635 (63%), Gaps = 51/635 (8%) Frame = -2 Query: 2258 KRNSKKMTKDEVLTVKDAVHKLQLSLLDGIKHENQLTAAGSLISRSDYQDVVTERTIANV 2079 K++S M K++ ++V +AVHK+QL LLDGI+ E QL A+GSLISRSDY+DVVTERTI+N Sbjct: 49 KKSSSSMAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNT 108 Query: 2078 CGYPLCGNSLSAERPWKGRYRISLKEHKVYDLQETYMYCSSSCLINSRAFAASLQEERSS 1899 CGYPLC N L +E KGRYRISLKEHKVYDLQETYM+CS++CLINSRAFA SLQEER S Sbjct: 109 CGYPLCANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168 Query: 1898 TLNPAKLNEVLKLFDGLSLDSDVNMGKNGDLGLSGLKIQEKTDTVAGQVALEEWIGPSNA 1719 LN AKLN++L LF L LD D ++GKNGDLG S L+I+E + A V+L GPSNA Sbjct: 169 VLNHAKLNDILSLFGDLDLD-DNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNA 224 Query: 1718 IDGYVPRR--VRDLKHPQSNNNK---GERREVGSKHRHVRPNAADILSYDMNFTSTIITQ 1554 I+GYVP+R + P++N NK ++GSK ++ +++F TII Sbjct: 225 IEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKKEEY------FVNNELDFAGTIIMN 278 Query: 1553 DEYSISKTVPAVKAKEPKGKASSKE--------------VNRQSNPVQKPTAPLTNIQET 1416 DEY ISK + K + +S KE +N + + P+ + ++ Sbjct: 279 DEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDS 338 Query: 1415 RSKNKSKNVITKD--DKLSLLENIAGPSQNDST--------------------KAVKELQ 1302 K + I KD DK + + + + DS+ +A KE Sbjct: 339 NLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETH 398 Query: 1301 ESTAGAXXXXXXXXXXXKA-----TRSVTWADEK-TDGDGQ-NLNECRELKDKKGAVVTS 1143 A A R VTWAD+K D G NL E +E++ KG S Sbjct: 399 ADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEIS 458 Query: 1142 HSADEEVGEESYRFASAEACAMALTQAAEEVASGKSEASDAVSEAGVIILPPPHGEDEEE 963 SA++ + RF SAEACAMAL++AAE VASG S+ +DAV E G+IILP D+EE Sbjct: 459 GSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEE 518 Query: 962 ---NGDVMETDPLQLKWPPKPGXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFSW 792 +GD++E + +KWP KPG SW+D+PPEGF+LTLS F+TM+ ALF W Sbjct: 519 PMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEW 578 Query: 791 VSSSSLAYIYGKEESFHEEYLSVNGREYPQKIVMPDGRSSEIKQTLAGCLARALPGLVAE 612 ++SSSLAYIYG++ESFHEEYLS+NGREYP+KI + DGRSSEIK+TLA C++RALP +V + Sbjct: 579 ITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTD 638 Query: 611 LRLPIPVSTLEQGMGRLLDTMSFIDPLPAFRMKQW 507 LRLPIP+STLEQGMG L+DT+SF++ LPAFRMKQW Sbjct: 639 LRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673