BLASTX nr result
ID: Mentha29_contig00017072
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00017072 (2413 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus... 687 0.0 emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] 614 e-173 ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258... 608 e-171 ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni... 597 e-167 ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni... 582 e-163 ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm... 558 e-156 ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni... 553 e-154 ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni... 548 e-153 ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni... 545 e-152 ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu... 545 e-152 ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr... 536 e-149 ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni... 534 e-149 gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlise... 532 e-148 gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] 525 e-146 ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prun... 503 e-139 ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni... 497 e-137 ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni... 485 e-134 ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Th... 482 e-133 ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma c... 453 e-124 ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma c... 452 e-124 >gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus guttatus] Length = 597 Score = 687 bits (1774), Expect = 0.0 Identities = 372/647 (57%), Positives = 450/647 (69%), Gaps = 2/647 (0%) Frame = -3 Query: 2162 EILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCANSL 1983 +IL VKDA+HKLQL LLEGI E QL+AAGSLIS DY DVVTERTIA +CGYPLC NSL Sbjct: 5 KILGVKDAVHKLQLSLLEGIKHESQLIAAGSLISQSDYQDVVTERTIAHVCGYPLCVNSL 64 Query: 1982 PSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLNQI 1803 PSE PRKG YRISLKEHKVYDL ET+MYCS+ CLI SRAF A+LEEERS++ +PAK+N + Sbjct: 65 PSEPPRKGHYRISLKEHKVYDLHETHMYCSTECLIRSRAFGASLEEERSSSLDPAKINSV 124 Query: 1802 LKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRSDQ 1623 LK+F+GL DSV+ + K+GDLGLS LKI+EK +GE+S+EEW+GP NAIDGYVPR DQ Sbjct: 125 LKMFDGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVPRRDQ 184 Query: 1622 KINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKT-VPSVSAKEARG 1446 + H+ PN D L FD+NFTS II QDEY+VSKT VP +EA+G Sbjct: 185 NSERKQPSRKKTESNHA--KPNLADTLPFDVNFTSTIIMQDEYSVSKTAVP----REAKG 238 Query: 1445 KLTGKNVNCEIKPVKKPAAKKEIRPKKSDECLNATERDGDLNVSEDISSGSQSDNTR-KG 1269 K+ GK + +K K ++V +D + SQ+D T K Sbjct: 239 KVKGKMIRKSVKAEK-------------------------ISVLDDTAGPSQNDTTLLKS 273 Query: 1268 KTKLREGKESSSGANGXXXXXXXXXXXKAVCSVTWADAKTDFDGQNLEEFRELEGEKAAI 1089 K + K+ + SVTWAD K+D DG+++ E RE+ K A+ Sbjct: 274 SLKTLDSKKETR-------------------SVTWADEKSDGDGKSISECREIGDNKGAV 314 Query: 1088 VIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAGVIILPPPD 909 V+P H T E+V G++SYR SG+ DASDAVSEAGVIILPPP Sbjct: 315 VMP--HLTDEDV-GDESYRFTSAEACARALSQASEAVASGKTDASDAVSEAGVIILPPPH 371 Query: 908 GVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMF 729 VD AK ++ E+V+ DP++LKWP KP SWYDSPPEGFNLTLSPFSTMF Sbjct: 372 EVDEAKYEQIGEVVDVDPIELKWPPKPGFSSEDLFDSEDSWYDSPPEGFNLTLSPFSTMF 431 Query: 728 MALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGCLARAL 549 M+LFAWISSS+LAY+YGKEE F+E+Y+S+NGREYP KI + DGRS+E+K TLAGCLARAL Sbjct: 432 MSLFAWISSSSLAYIYGKEERFHEDYLSINGREYPPKIII-DGRSAEVKHTLAGCLARAL 490 Query: 548 PTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLFLDALSVSRITSL 369 P LV+E+R+P PVS +EQG+GRLLDTMSFT+ +P RMKQW VI LFLDALSVSRI +L Sbjct: 491 PGLVSEIRIPTPVSTIEQGMGRLLDTMSFTDALPGFRMKQWQVIALLFLDALSVSRIPAL 550 Query: 368 TQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSGG 228 + YM RR LLPK+LEGAQI+ EEFEIMKDLIIPLGR P+FSTQSGG Sbjct: 551 SPYMTGRRILLPKVLEGAQINVEEFEIMKDLIIPLGRVPQFSTQSGG 597 >emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera] Length = 659 Score = 614 bits (1584), Expect = e-173 Identities = 337/659 (51%), Positives = 441/659 (66%), Gaps = 13/659 (1%) Frame = -3 Query: 2165 DEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCANS 1986 D+ + VKDA+HKLQLFLLEGI +E QL AAGSL+S DY DVVTERTIA +CGYPLC+NS Sbjct: 4 DQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNS 63 Query: 1985 LPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLNQ 1806 LPSER RKG YRISLKEHKVYDL ETYMYCSS C++NSR+FA +L+EER + N ++N Sbjct: 64 LPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERING 123 Query: 1805 ILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRSD 1626 IL+LF +S +GK+GDLGLSELKI+E E++AGEVSME+WIGP NAI+GYVP+ D Sbjct: 124 ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183 Query: 1625 QKINNRXXXXXXXXXKHSLSDPNA-QDMLSFDMNFTSAIITQDEYTVSK-------TVPS 1470 + + + K S S ++ ++ + +M+F S IIT+DEY++SK T Sbjct: 184 RNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDTTSH 243 Query: 1469 VSAKEARGKLT-GKNVNCEIKPVKKPAAKKEIRPKKSDECLNATERDGDLNVSEDISSGS 1293 +KE + K + G ++ K E + ++S + + + +E S S Sbjct: 244 AKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVPS 303 Query: 1292 QSD---NTRKGKTKLREGKESSSGANGXXXXXXXXXXXKAVCSVTWADAKTDF-DGQNLE 1125 QS N KGK + + G K + SVTWAD K D D ++ Sbjct: 304 QSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADEKMDSADSRDFC 363 Query: 1124 EFRELEGEKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAV 945 + RELE +K P ++ +++ R SGE D +DAV Sbjct: 364 KVRELEVKKED---PNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDAV 420 Query: 944 SEAGVIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEG 765 SEAG+IILP P +D +S ++A+++E +P+ LKWP+KP SWYD+PPEG Sbjct: 421 SEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEG 480 Query: 764 FNLTLSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEI 585 F+LTLSPF+TM+MALFAWI+SS++AY+YG++ESF+EEY+SVNGREYP+KI L DGRSSEI Sbjct: 481 FSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEI 540 Query: 584 KQTLAGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLF 405 KQTLAGCL+RALP LV +LRLP+PVS LEQG+GRLLDTMSF + +P+ RMKQW VIV LF Sbjct: 541 KQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLLF 600 Query: 404 LDALSVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSGG 228 +DALSV RI +LT +M SRR L PK+ + AQ+S+EE+E+MKDLIIPLGR P+FS QSGG Sbjct: 601 IDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSGG 659 >ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera] gi|296089830|emb|CBI39649.3| unnamed protein product [Vitis vinifera] Length = 659 Score = 608 bits (1568), Expect = e-171 Identities = 334/659 (50%), Positives = 439/659 (66%), Gaps = 13/659 (1%) Frame = -3 Query: 2165 DEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCANS 1986 D+ + VKDA+HKLQLFLLEGI +E QL AAGSL+S DY DVVTERTIA +CGYPLC+NS Sbjct: 4 DQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNS 63 Query: 1985 LPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLNQ 1806 LPSER RKG YRISLKEHKVYDL ETYMYCSS C++NSR+FA +L+EER + N ++N Sbjct: 64 LPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERING 123 Query: 1805 ILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRSD 1626 IL+LF +S +GK+GDLGLSELKI+E E++AGEVSME+WIGP NAI+GYVP+ D Sbjct: 124 ILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRD 183 Query: 1625 QKINNRXXXXXXXXXKHSLSDPNA-QDMLSFDMNFTSAIITQDEYTVSK-------TVPS 1470 + + + K S S ++ ++ + +M+F IIT+DEY++SK T Sbjct: 184 RNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDTTSH 243 Query: 1469 VSAKEARGKLT-GKNVNCEIKPVKKPAAKKEIRPKKSDECLNATERDGDLNVSEDISSGS 1293 +KE + K + G ++ K E + ++S + + + +E S S Sbjct: 244 AKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPSVPS 303 Query: 1292 QSD---NTRKGKTKLREGKESSSGANGXXXXXXXXXXXKAVCSVTWADAKTDF-DGQNLE 1125 QS N KGK + + G K SVTWAD K D D ++ Sbjct: 304 QSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADEKMDSADSRDFC 363 Query: 1124 EFRELEGEKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAV 945 + RELE +K P ++ +++ R SGE D +DAV Sbjct: 364 KVRELEVKKED---PNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDAV 420 Query: 944 SEAGVIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEG 765 SEA +IILP P +D +S ++A+++E +P+ LKWP+KP SWYD+PPEG Sbjct: 421 SEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEG 480 Query: 764 FNLTLSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEI 585 F+LTLSPF+TM+MALFAWI+SS++AY+YG++ESF+EEY+SVNGREYP+KI L DGRSSEI Sbjct: 481 FSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEI 540 Query: 584 KQTLAGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLF 405 KQTLAGCLARALP LV +LRLP+PVS LEQG+GRLLDTMSF + +P+ RMKQW VIV LF Sbjct: 541 KQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLLF 600 Query: 404 LDALSVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSGG 228 +DALSV +I +LT +M+S+R L PK+ + AQ+S+EE+E+MKDLIIPLGR P+FS QSGG Sbjct: 601 IDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSGG 659 >ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Solanum lycopersicum] Length = 660 Score = 597 bits (1538), Expect = e-167 Identities = 340/661 (51%), Positives = 433/661 (65%), Gaps = 14/661 (2%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 K E + VKDA+HKLQL LLEGI DE QL+AAGSL+S DY DVVTER+IA MCGYPLC+N Sbjct: 3 KGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLCSN 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 SLPSER RKG YRISLKEHKVYDL ETYMYCS+NC++NS AFA +L++ERS+ NPAKLN Sbjct: 63 SLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAKLN 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 Q+L LF+GL S+ D+ +NGD G S+LKI+EK + + GEVS+EEW+GP NAI+GYVP+ Sbjct: 123 QVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVPQR 182 Query: 1628 DQKINNRXXXXXXXXXKHSLSD-PNAQDMLSFDMNFTSAIITQDEYTVSKTVPSVSA--- 1461 D+ +N K+ + + ++M+ + +F+S IITQDEY+VSK V+A Sbjct: 183 DRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNADSN 242 Query: 1460 ---KEARGKLTGKNVNCEIKPVKKPAAKKEIRPKKSDECLNATER--DGDLNVSEDISSG 1296 KE + K K + ++ + K ++R + E + R D S ++SSG Sbjct: 243 VKFKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVSSG 302 Query: 1295 -SQSDNTRKGKTKLREG--KESSSGANGXXXXXXXXXXXKAVC-SVTWADAKTDFD-GQN 1131 SQ D K + + K +S G + K + SVTWAD D G+ Sbjct: 303 PSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRSVTWADESIDGGIGKK 362 Query: 1130 LEEFRELEGEKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASD 951 E ++ ++ + +EE +DSYR SG D D Sbjct: 363 TESSSKISEYESQAYGGSASTDMEE--NDDSYRFESAEACAAALSQAAEAVASGS-DVPD 419 Query: 950 AVSEAGVIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPP 771 AVS+AG++ILPP VD A +E E+++ + LKWP KP SWYDSPP Sbjct: 420 AVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFESEDSWYDSPP 479 Query: 770 EGFNLTLSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSS 591 EGFN+TLSPF TMF +LF WISSS+LA++YG +ES EEY+S+NGREYPRKI L DGRS+ Sbjct: 480 EGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRKIVLSDGRST 539 Query: 590 EIKQTLAGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVF 411 EIKQTLAGCLARALP LV +LRLPVP+S LEQG+ LL+TMSF +P+PA RMKQW +IV Sbjct: 540 EIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRMKQWQLIVL 599 Query: 410 LFLDALSVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSG 231 LFLDALSV RI +LT YM RR PK+L+GAQIS+ E+EIMKDLIIPLGR P+FS QSG Sbjct: 600 LFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLIIPLGRVPQFSMQSG 659 Query: 230 G 228 G Sbjct: 660 G 660 >ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Solanum tuberosum] Length = 662 Score = 582 bits (1501), Expect = e-163 Identities = 337/664 (50%), Positives = 430/664 (64%), Gaps = 17/664 (2%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 K E + VKDA+HKLQL LLEGI DE QL+AAGSL+S DY DVVTER+IA MCGYPLC+N Sbjct: 3 KGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLCSN 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 SLPSER RKG YRISLKEHKVYDL ETYMYCS+NC++NS AFA +L++ERS+ NPAKLN Sbjct: 63 SLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAKLN 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTE-REAGEVSMEEWIGPPNAIDGYVPR 1632 Q+L LF+GL S D+ +NGDLG S+LKI+EK + + GEVS+EEW+GP NAI+GYVP+ Sbjct: 123 QVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYVPQ 182 Query: 1631 SDQKINNRXXXXXXXXXKHSLSD-PNAQDMLSFDMNFTSAIITQDEYTVSKTVPSVSA-- 1461 D+ +N K+ + + ++M+ + +F+S IITQDEY+VSK V+A Sbjct: 183 RDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNAVS 242 Query: 1460 ----KEARGKLTGKNVNCEIKPVKKPAAKKEIRPKKSDECLNATER--DGDLNVSEDISS 1299 KEA+ K K + ++ + K ++R + E + R D S ++SS Sbjct: 243 SEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVSS 302 Query: 1298 G-SQSDNTRKGKTKLREG--KESSSGANGXXXXXXXXXXXKAVC---SVTWADAKTDFD- 1140 G SQ D K + + K +S G + + SVTWAD D Sbjct: 303 GPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWADEIIDGGI 362 Query: 1139 GQNLEEFRELEGEKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYD 960 G+ E ++ + + +EE +DSYR SG D Sbjct: 363 GKKTESSSKISEYENQAYGGSASTDMEE--DDDSYRFESAEACAAALSQAAEAVASGS-D 419 Query: 959 ASDAVSEAGVIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYD 780 DAVS+AG++ILP VD A +E E+++ +P LKWP KP WYD Sbjct: 420 VPDAVSKAGIVILPTSQEVDEAILQET-EMLDIEPAPLKWPRKPGMPNYDVFESEDCWYD 478 Query: 779 SPPEGFNLTLSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDG 600 PPEGFN+TLSPF+TMF +LF WISSS+LA++YG +E+ EEY+S+NGREYP KI L DG Sbjct: 479 GPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPHKIVLSDG 538 Query: 599 RSSEIKQTLAGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHV 420 S+EIKQTLAGCLARALP LV +LRLPVP+S LEQG+ LL+TMSF +P+PA RMKQW + Sbjct: 539 LSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRMKQWQL 598 Query: 419 IVFLFLDALSVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFST 240 IV LFLDALSV RI +LT YM RR LPK+L+GAQIS+ E+EIMKDLIIPLGR P+FS Sbjct: 599 IVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLGRVPQFSM 658 Query: 239 QSGG 228 QSGG Sbjct: 659 QSGG 662 >ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis] gi|223538861|gb|EEF40460.1| conserved hypothetical protein [Ricinus communis] Length = 645 Score = 558 bits (1439), Expect = e-156 Identities = 311/657 (47%), Positives = 419/657 (63%), Gaps = 17/657 (2%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 K+E ++VKD ++KLQL LLEGI +E QLLAAGSL+S DY DVV ER+I+ +CGYPLC N Sbjct: 3 KEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLCNN 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 SLPS+RP KGRYRISLKEH+VYDLQETYMYCSS+CL+NSRAF+ +L+E+R + NP KLN Sbjct: 63 SLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIKLN 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 +IL+ F L DS +G++GDLGLS LKI+EK+E G+VS+EEWIGP NAI+GYVP+ Sbjct: 123 EILRKFNDLTLDSE-GLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVPQG 181 Query: 1628 DQKINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSVSAKEAR 1449 D+ N + QD D +FTS IIT DEY++SK +++ + Sbjct: 182 DRDPNPSLKNHKEGLKAICKKPVSKQDCFFSDTDFTSTIITNDEYSISKGPSGLTSTASD 241 Query: 1448 GKL---TGKN---VNCEIKPVKKP---AAKKEIRPKKSDECLNATERDGDLNVSEDISSG 1296 KL TGK +N ++ ++K A ++ + ++ ++ + DL S ++ Sbjct: 242 IKLQAQTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKEKVIKEQLNFQDLPSSSYYTAE 301 Query: 1295 SQSDNTRKGKTKLREG----KESSSGANGXXXXXXXXXXXKAVCSVTWADAKTDFDG-QN 1131 ++ + G L E SSGA SVTWAD + D G +N Sbjct: 302 AEDISQATGAANLNESVLKPSLKSSGAKRSNR------------SVTWADERVDNAGSRN 349 Query: 1130 LEEFRELEGEKAAIVIPTPHPTVEEVS-GEDSY--RXXXXXXXXXXXXXXXXXXXSGEYD 960 L E +E+E + H E + G+D + R SG+ D Sbjct: 350 LCEVQEMEQTNES------HEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDAD 403 Query: 959 ASDAVSEAGVIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYD 780 + A+SEAG+I+LPP + + E +++E + LKWP KP SWYD Sbjct: 404 VNKAMSEAGIIVLPPSQDLGQGGNVEKNDMIEQESASLKWPTKPGIPQSDLFDPEDSWYD 463 Query: 779 SPPEGFNLTLSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDG 600 +PPEGF+LTLSPF+TM+MALFAW++SS+LAY+YG++ES +E+Y+SVNGREYPRKI L+DG Sbjct: 464 APPEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDG 523 Query: 599 RSSEIKQTLAGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHV 420 RSSEI+ T CLAR P LV LRLP+PVS LEQG GRLL+TMSF + +PA R KQW V Sbjct: 524 RSSEIRLTAESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQV 583 Query: 419 IVFLFLDALSVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPE 249 I LF++ALSV RI +LT YM SRR +L ++L+GA IS+EE++IMKD ++PLGR P+ Sbjct: 584 IALLFIEALSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ 640 >ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Glycine max] Length = 706 Score = 553 bits (1426), Expect = e-154 Identities = 315/711 (44%), Positives = 424/711 (59%), Gaps = 65/711 (9%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 KD+ ++VKDA+ KLQ+ LLEGI +E QL AAGSL+S DY D+VTER+I MCGYPLC+N Sbjct: 3 KDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLCSN 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 +LPS+RPRKGRYRISLKEHKVYDLQETYM+CSSNCL++S+ FA +L+ ER + + KLN Sbjct: 63 ALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEKLN 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVP-- 1635 +L LFE L + V + KNGDLGLS+LKI+EKTER +GEVS+E+W GP NAI+GYVP Sbjct: 123 NVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVPKP 182 Query: 1634 --RSDQKINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSVSA 1461 R + + S+SD N +++ +M F S II QDEY+VSK P Sbjct: 183 RNRDSKGLRKNVKKGSKTGHGKSISDIN---LINSEMGFVSTIIMQDEYSVSKVPPGQMD 239 Query: 1460 KEARGKLTG-------KNVNCEIKPVKKPAAKKEIRPKKSDECLNATERDGD-------- 1326 A ++ + V+ E+ + + KS L+ +E++ + Sbjct: 240 ATANHQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAV 299 Query: 1325 ----------------LNVSEDISSGSQSDNTRK-----GKTKLREGKESSSGAN----- 1224 +++SE Q+D+ RK GKT + +S +N Sbjct: 300 LKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDASTSNLDPAN 359 Query: 1223 -----------GXXXXXXXXXXXKA-----VCSVTWADAKTDFDGQN----LEEFRELEG 1104 G A +VTWAD K + G +EF +++ Sbjct: 360 VEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFKEFGDIKK 419 Query: 1103 EKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAGVII 924 E ++ ++ + ED R SG+ D SDAVSEAG+ I Sbjct: 420 ESDSV-----GNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVSEAGITI 474 Query: 923 LPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSP 744 LPPP + E+A+I++ D + LKWP K SW+D+PPEGF+LTLSP Sbjct: 475 LPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAPPEGFSLTLSP 534 Query: 743 FSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGC 564 F+TM+ LF+W +SS+LAY+YG++ESF+EEY+SVNGREYP K+ L DGRSSEIKQTLA C Sbjct: 535 FATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRSSEIKQTLASC 594 Query: 563 LARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLFLDALSVS 384 LARALP LV LRLP+PVSI+EQG+ LL+TMSF + +PA R KQW V+ LF+DALSV Sbjct: 595 LARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALSVC 654 Query: 383 RITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSG 231 R+ +L YM RRA ++L G+QI EE+E++KDL++PLGRAP S+QSG Sbjct: 655 RLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQSG 705 >ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cicer arietinum] Length = 666 Score = 548 bits (1413), Expect = e-153 Identities = 317/677 (46%), Positives = 418/677 (61%), Gaps = 31/677 (4%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 KD+ ++VKDA+ KLQL LLEGI E QL AAGSLIS DY DVVTER+I E+C YPLC N Sbjct: 3 KDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLCCN 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 +LPSERPRKGRYRISLKEHKVYDL ETYM+CSS+C++NS+AFA +L+++R A +P KLN Sbjct: 63 ALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQKLN 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVP-- 1635 IL+LF + + + GK+G+LGLS L+I++KTE EVS+E+W+GP NAI+GYVP Sbjct: 123 NILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVPKK 181 Query: 1634 RSDQKINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSVSAKE 1455 R + ++ H S+ +++++ + +F S II QDEY+VSK VS+ + Sbjct: 182 RDNGSKGSQKNTKKGSKASHGKSN-GVKNLINSEFDFMSTIIMQDEYSVSK----VSSGQ 236 Query: 1454 ARGKLTGKNVNCEIKP---VKKPAAKKEIRPKKSDECLNATER-DGDLNVS---EDISSG 1296 T V+ +IKP +++P +K D+ + + LN+S +D Sbjct: 237 -----TDATVDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIA 291 Query: 1295 SQSDNTRKGKTKLREGKESSSGAN---GXXXXXXXXXXXKAVC----------------- 1176 N KGKT + SS +N C Sbjct: 292 KSCKNVLKGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLG 351 Query: 1175 -SVTWADAKTDFDGQ-NLEEFRELEGEKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXX 1002 SVTWAD K D G +L F+E K + V+ V ED R Sbjct: 352 RSVTWADKKIDGCGSTDLCAFKEFGNIKKESDVAD---NVDVVDDEDILRSVSAEACAIA 408 Query: 1001 XXXXXXXXXSGEYDASDAVSEAGVIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXX 822 SG+ DA DAVSEAG+IILP + + ++ +I+ETD + LKWP KP Sbjct: 409 LSQAAEAVASGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGI 468 Query: 821 XXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSV 642 SW+D+PPEGF+LTLSPF+T++ A F+WI+SS+LAY+YG++ SFYEE++SV Sbjct: 469 SDFDLFASDDSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSV 528 Query: 641 NGREYPRKIFLQDGRSSEIKQTLAGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSF 462 +GREYP KI L DGRSSEIKQTLA CLARALP +V EL+LP+PVS LEQG+ LLDTMSF Sbjct: 529 DGREYPCKIVLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSF 588 Query: 461 TNPIPALRMKQWHVIVFLFLDALSVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMK 282 +P+P R KQW V+ LF+DALSV RI +L YM RR L K+L G+QI EE+ ++K Sbjct: 589 VDPLPGFRFKQWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLK 648 Query: 281 DLIIPLGRAPEFSTQSG 231 DLI+PLGRAP FS+QSG Sbjct: 649 DLIVPLGRAPHFSSQSG 665 >ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X2 [Glycine max] Length = 716 Score = 545 bits (1405), Expect = e-152 Identities = 315/721 (43%), Positives = 424/721 (58%), Gaps = 75/721 (10%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 KD+ ++VKDA+ KLQ+ LLEGI +E QL AAGSL+S DY D+VTER+I MCGYPLC+N Sbjct: 3 KDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNMCGYPLCSN 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 +LPS+RPRKGRYRISLKEHKVYDLQETYM+CSSNCL++S+ FA +L+ ER + + KLN Sbjct: 63 ALPSDRPRKGRYRISLKEHKVYDLQETYMFCSSNCLVSSKTFAGSLQAERCSGLDLEKLN 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVP-- 1635 +L LFE L + V + KNGDLGLS+LKI+EKTER +GEVS+E+W GP NAI+GYVP Sbjct: 123 NVLSLFENLNLEPVETLQKNGDLGLSDLKIQEKTERSSGEVSLEQWAGPSNAIEGYVPKP 182 Query: 1634 --RSDQKINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSVSA 1461 R + + S+SD N +++ +M F S II QDEY+VSK P Sbjct: 183 RNRDSKGLRKNVKKGSKTGHGKSISDIN---LINSEMGFVSTIIMQDEYSVSKVPPGQMD 239 Query: 1460 KEARGKLTG-------KNVNCEIKPVKKPAAKKEIRPKKSDECLNATERDGD-------- 1326 A ++ + V+ E+ + + KS L+ +E++ + Sbjct: 240 ATANHQIKPTATVKQPEKVDAEVVRKDDDSIQDLSSSFKSSLILSTSEKEEEVTKSCEAV 299 Query: 1325 ----------------LNVSEDISSGSQSDNTRK-----GKTKLREGKESSSGAN----- 1224 +++SE Q+D+ RK GKT + +S +N Sbjct: 300 LKFSPGCAIQKKDVHSISISERQCDVEQNDSARKSVQVKGKTSRVIANDDASTSNLDPAN 359 Query: 1223 -----------GXXXXXXXXXXXKA-----VCSVTWADAKTDFDGQN----LEEFRELEG 1104 G A +VTWAD K + G +EF +++ Sbjct: 360 VEEKFQVEKAGGSLKTKPRSSLKSAGEKKFSRTVTWADEKINSTGSKDLCEFKEFGDIKK 419 Query: 1103 EKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAV------- 945 E ++ ++ + ED R SG+ D SDAV Sbjct: 420 ESDSV-----GNNIDVANDEDILRRASAEACAIALSSASEAVASGDSDVSDAVFSPMNET 474 Query: 944 ---SEAGVIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSP 774 SEAG+ ILPPP + E+A+I++ D + LKWP K SW+D+P Sbjct: 475 CAVSEAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTGISEADFFESDDSWFDAP 534 Query: 773 PEGFNLTLSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRS 594 PEGF+LTLSPF+TM+ LF+W +SS+LAY+YG++ESF+EEY+SVNGREYP K+ L DGRS Sbjct: 535 PEGFSLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLSVNGREYPCKVVLADGRS 594 Query: 593 SEIKQTLAGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIV 414 SEIKQTLA CLARALP LV LRLP+PVSI+EQG+ LL+TMSF + +PA R KQW V+ Sbjct: 595 SEIKQTLASCLARALPALVAVLRLPIPVSIMEQGMACLLETMSFVDALPAFRTKQWQVVA 654 Query: 413 FLFLDALSVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQS 234 LF+DALSV R+ +L YM RRA ++L G+QI EE+E++KDL++PLGRAP S+QS Sbjct: 655 LLFIDALSVCRLPALISYMTDRRASFHRVLSGSQIRMEEYEVLKDLVVPLGRAPHISSQS 714 Query: 233 G 231 G Sbjct: 715 G 715 >ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] gi|550321730|gb|EEF05523.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa] Length = 696 Score = 545 bits (1405), Expect = e-152 Identities = 316/703 (44%), Positives = 412/703 (58%), Gaps = 57/703 (8%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 KD+ VKD I+KLQL LL+GI +E QLLAAGS++S DY DVVTERTIA +CGYPLC N Sbjct: 3 KDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLCGN 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 SLPS+RP+KGRYRISLKEHKVYDL ETYMYCSS+C+INSR F+ +L+EER NPAKLN Sbjct: 63 SLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAKLN 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 ++L LF+ S +GKNGDLG S LKI+EKTE+ GEVS E+WIGP NAI+GYVP+ Sbjct: 123 EVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVPQR 182 Query: 1628 DQ----------------------KINNRXXXXXXXXXKHSLSDPNA------------- 1554 D+ I+ P A Sbjct: 183 DRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSKAK 242 Query: 1553 -------QDMLSFDMNFTSAII-TQDEYTVSK-------TVPSVSAKEARGKLTGKNVNC 1419 Q+ DMNFTS II TQDEY++SK T ++ + K++ K+ Sbjct: 243 GTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSSEN 302 Query: 1418 EIKPVKKPAAKKEIRPKKSDECLNATERD---GDLNVSEDISSGSQSDNTRKGKTKLREG 1248 + +K + K R K D A + + DL+ D S T + K K Sbjct: 303 QSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSVSE 362 Query: 1247 KESSSGANGXXXXXXXXXXXKAVCSVTWADAKTDFDG-QNLEEFRELEGEKAAIVIPTPH 1071 K + + + SVTWAD K G ++L E R +E KA I Sbjct: 363 KAAKPVESSLKPSLKTSGAKQLTRSVTWADEKVGSSGSRDLCEVRGMEDTKAGPEI---- 418 Query: 1070 PTVEEVSGEDS---YRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAGVIILPPPDGVD 900 V+ + D + SG+ DAS+A+SEAG++ILP P +D Sbjct: 419 --VDNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLD 476 Query: 899 TAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMAL 720 E+ ++++ + +KWP KP SWYD+PPEGF+L LS F+T++MAL Sbjct: 477 QGDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIWMAL 536 Query: 719 FAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGCLARALPTL 540 FAW++SS+LAYVYGK+ES +EEY+ VNGREYPRKI L DGRS EI+QT+ GCL RA P + Sbjct: 537 FAWVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAFPVV 596 Query: 539 VTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLFLDALSVSRITSLTQY 360 V +LRLP+P+S LEQG LL TMSF + +PA RMKQW VI LF++ALSV RI +L Y Sbjct: 597 VADLRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPALISY 656 Query: 359 MLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSG 231 M +RR +++G ++S+EE+E+MKDL+IPLGRAP+FS QSG Sbjct: 657 MDNRR----MVVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695 >ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao] Length = 739 Score = 536 bits (1382), Expect = e-149 Identities = 326/695 (46%), Positives = 417/695 (60%), Gaps = 49/695 (7%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 K++ ++V +A+HK+QL LL+GI DE+QLLA+GSLIS DY DVVTERTI+ CGYPLCAN Sbjct: 57 KEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLCAN 116 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 LPSE RKGRYRISLKEHKVYDLQETYM+CS+NCLINSRAFA +L+EER + N AKLN Sbjct: 117 PLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAKLN 176 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 IL LF L D D+GKNGDLG S L+IKE E +A +VS+ GP NAI+GYVP+ Sbjct: 177 DILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQR 232 Query: 1628 D------QKINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSV 1467 + NN+ L + ++ +++F II DEY +SK P Sbjct: 233 ELISKPTPPKNNKNKVFDSSSS--KLGSKKEEYFVNNELDFAGTIIMNDEYIISKK-PGS 289 Query: 1466 SAKEARGKLTGKN---------------VNCEIKPVKKPAAKKEIRPKKSDECLNATERD 1332 + R KL+ K +N E K P+ K+ D L E Sbjct: 290 FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQ---SCFDSNLKEVEEK 346 Query: 1331 GDLNVSED--ISSGSQSDNTRKG---------KTKLREGKESSSG-ANGXXXXXXXXXXX 1188 G SED + SGS S K K + G ++SS A Sbjct: 347 GICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSS 406 Query: 1187 KAVCS--------------VTWADAK-TDFDGQ-NLEEFRELEGEKAAIVIPTPHPTVEE 1056 + V VTWAD K D G NL E +E+E K I + E+ Sbjct: 407 ETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISG---SAED 463 Query: 1055 VSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAGVIILPPPDGVDTAKSKENA 876 ++ R SG+ D +DAV E G+IILP VD + E+ Sbjct: 464 GGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDG 523 Query: 875 EIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFAWISSST 696 +++E + +KWP KP SW+D+PPEGF+LTLS F+TM+ ALF WI+SS+ Sbjct: 524 DMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSS 583 Query: 695 LAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGCLARALPTLVTELRLPV 516 LAY+YG++ESF+EEY+S+NGREYPRKI L+DGRSSEIK+TLA C++RALP +VT+LRLP+ Sbjct: 584 LAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPI 643 Query: 515 PVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLFLDALSVSRITSLTQYMLSRRALL 336 P+S LEQG+G L+DT+SF +PA RMKQW VIV LF+DALSV RI +LT +M + R LL Sbjct: 644 PISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLL 703 Query: 335 PKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSG 231 K+L+GAQIS EE+E+MKDLIIPLGRAP FS QSG Sbjct: 704 HKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738 >ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Glycine max] Length = 706 Score = 534 bits (1376), Expect = e-149 Identities = 307/711 (43%), Positives = 415/711 (58%), Gaps = 65/711 (9%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 KD+ ++VKDA+ KLQ+ LLEGI +E QL AAGSL+S DY D+VTER+I +CGYPLC+N Sbjct: 3 KDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLCSN 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 +LPS+RPRKGRYRISLKEHKVYDL ETYM+C SNC+++S+AFA +L+ ER + + KLN Sbjct: 63 ALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEKLN 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 IL LFE L + ++ KN D GLS+LKI+EKTE +GEVS+E+W GP NAI+GYVP+ Sbjct: 123 NILSLFENLNLEPAENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYVPKP 182 Query: 1628 ---DQK-INNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSVSA 1461 D K + +SD N ++S +M F S II QD Y+VSK +P Sbjct: 183 RDHDSKGLRKNVKKGSKAGHGKPISDIN---LISSEMGFVSTIIMQDGYSVSKVLPGQRD 239 Query: 1460 KEARGKLTGKNVNCEIKPVKKPAAKKE-------IRPKKSDECLNATERDGDL------- 1323 A ++ + ++ V +K+ KS L +E++ +L Sbjct: 240 ATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELAQSCEAA 299 Query: 1322 -----------------NVSEDISSGSQSDNTRK-----GKTKLREGKESSSGAN----- 1224 ++SE Q+D+ +K GK + +S +N Sbjct: 300 LKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTSNLDPAN 359 Query: 1223 -----------GXXXXXXXXXXXKA-----VCSVTWADAKTDFDGQN----LEEFRELEG 1104 G A +VTWAD K + G + F ++ Sbjct: 360 VEEKFQVEKAGGSLNTKPKSSLKSAGEKKLSRTVTWADKKINSTGSKDLCGFKNFGDIRN 419 Query: 1103 EKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAGVII 924 E + +++ + ED+ R SG+ D SDAVSEAG+II Sbjct: 420 ES-----DSAGNSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVSEAGIII 474 Query: 923 LPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSP 744 LPPP + E+ +I++ D + +KWP KP SW+D+ PEGF+LTLSP Sbjct: 475 LPPPHDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGFSLTLSP 534 Query: 743 FSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGC 564 F+TM+ LF+WI+SS+LAY+YG++ESF EEY+SVNGREYP K+ L DGRSSEIKQTLA C Sbjct: 535 FATMWNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIKQTLASC 594 Query: 563 LARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLFLDALSVS 384 LARALPTLV LRLP+PVS +EQG+ LL+TMSF + +PA R KQW V+ LF+DALSV Sbjct: 595 LARALPTLVAVLRLPIPVSTMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALSVC 654 Query: 383 RITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSG 231 R+ +L YM RRA ++L G+QI EE+E++KDL +PLGRAP S QSG Sbjct: 655 RLPALISYMTDRRASFHRVLSGSQIGMEEYEVLKDLAVPLGRAPHISAQSG 705 >gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlisea aurea] Length = 597 Score = 532 bits (1371), Expect = e-148 Identities = 303/640 (47%), Positives = 404/640 (63%), Gaps = 4/640 (0%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 KDEILT+K+A+++LQ LLEG +E QL AAGSL+S DY D+VTER IA++CGYPLC+N Sbjct: 3 KDEILTMKEAVYRLQTSLLEGAKNENQLSAAGSLMSRGDYQDLVTERVIAKICGYPLCSN 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 +L SERP KGRYRISLKEHKVYD+QETY +CSS CLINSRAF+ L +ER++ +P KLN Sbjct: 63 NLNSERPSKGRYRISLKEHKVYDVQETYSFCSSGCLINSRAFSIGLPDERTSDLDPIKLN 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 ++LK F+G G +S +MG+N DLGLS+L+I EK EAGEVS EWIGP +AIDGYVPR Sbjct: 123 EVLKRFDGFGANSTPNMGRNEDLGLSQLRIMEKENIEAGEVSSNEWIGPSDAIDGYVPRR 182 Query: 1628 DQKINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSVSAKEAR 1449 D+ N ++ LS + DM+FTS II Q+EY+++KT S+K++ Sbjct: 183 DRNSNTLSSKQKKGESRYHLSLQVLTSIFPSDMSFTSVIIDQNEYSIAKTTTPSSSKQS- 241 Query: 1448 GKLTGKNVNCE-IKPVKKPAAKKEIRPKKSDECLNATERDGDLNVSEDISSGSQSDNTRK 1272 G+ K + E ++P + P + K N ++R+G + +S+ + Sbjct: 242 GESNEKVIPEEDVRPKQSP--DSSVANIKGSGFRNPSKRNGRAKIDAKLSASEDKASENG 299 Query: 1271 GKTKLREGKESSSGANGXXXXXXXXXXXKAVC-SVTWADAKTDFDGQNLEEFRELEGEKA 1095 G+ KL +G +S+ GA + +V+WAD K + DGQNLE E+ Sbjct: 300 GEPKLADGDKSAQGAAVLKSSLKTSYSKETTTRTVSWADVKAE-DGQNLETVCEMND--- 355 Query: 1094 AIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSE--AGVIIL 921 P E S +S++ AS S+ G +L Sbjct: 356 ----PHGGGISRETSSVESHK-----------------------TASTKASKDAPGKFLL 388 Query: 920 PPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSPF 741 D + + E + LKWP KP + YD PP+GFNL+LSPF Sbjct: 389 -----TDFNEGEIFTEAI------LKWPPKPGFSEADLVESDDTLYDRPPDGFNLSLSPF 437 Query: 740 STMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGCL 561 T+F +LF+WISSS+LAY+YGK++SF+EEY++ NGREYP K+ +DGRSSEIKQTL+ L Sbjct: 438 CTLFNSLFSWISSSSLAYIYGKDDSFHEEYVNANGREYPCKVVAEDGRSSEIKQTLSAAL 497 Query: 560 ARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLFLDALSVSR 381 ARALP +V+ELRLP P+SILEQG+GRLLDTMSF +P+P+LR KQW IV LFL+ALSVSR Sbjct: 498 ARALPGVVSELRLPTPISILEQGMGRLLDTMSFIDPLPSLRTKQWQAIVLLFLNALSVSR 557 Query: 380 ITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLG 261 I +L++Y+ RRA + K+LEGA I EEFE+MKDLIIPLG Sbjct: 558 IPALSKYLEDRRASIQKVLEGAGIGVEEFEVMKDLIIPLG 597 >gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis] Length = 695 Score = 525 bits (1352), Expect = e-146 Identities = 312/695 (44%), Positives = 420/695 (60%), Gaps = 53/695 (7%) Frame = -3 Query: 2156 LTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCANSLPS 1977 ++VKD +++LQL LL+G++ E QL AAGS++S DYNDVVTER+IA +CGYPLC N LPS Sbjct: 9 ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68 Query: 1976 ERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLNQILK 1797 +RPRKGRYRISLKEHKVYDL ETYMYCSS+C+INSR FAA+L++ER A + A+++ +L+ Sbjct: 69 DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128 Query: 1796 LFEGL-GTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRSDQK 1620 +FE G + + GK+ DLG S+LKI+EKTE G+VS+E+W GP NAI+GYV + ++K Sbjct: 129 MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188 Query: 1619 INNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSVS-------- 1464 K + +L DM+F S IIT+DEYTVSKT S+ Sbjct: 189 PKELGSKSPKRGSKAN------NTVLINDMDFVSTIITEDEYTVSKTPSSLKKTGLDSKV 242 Query: 1463 -------AKEARGK-------------------LTGKNVNCEIKP---VKKPAAKKEIRP 1371 AK+A G L ++V ++ + A++E Sbjct: 243 REQEEILAKKAMGNEFAVLETSYAPASNVSRVGLVFEDVTSSLRAGSCLSSARAEEESHD 302 Query: 1370 KKSDECLNATERDG-----DLNVSEDISSGSQSDNTRKGKT-----KLREGKESSSGA-N 1224 K+++C A+ + +S ++ + ++ G+ ++ + KE S N Sbjct: 303 DKAEKCTEASIKSSLKPSRKKKLSRTVTWADEKTDSSGGRKLCEIREIEDMKEDPSVVEN 362 Query: 1223 GXXXXXXXXXXXKAVCSVTWADAKTDFDGQ-NLEEFRELEGEKAAIVIPTPHPTVEEVSG 1047 KA SV WAD K D ++ E RE+E K A + T E Sbjct: 363 KNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADMLCNADTGEN--- 419 Query: 1046 EDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAGVIILPPPDGVDTAKSKENAEIV 867 +D++R S E + +DA+SEAG+IILP P+ D + E + Sbjct: 420 DDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGEPMEEDDDD 479 Query: 866 ET-DPMQ--LKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFAWISSST 696 ET +P Q +KWP KP SW+D+PPE F+LTLSPF+ M+ ALF W +SST Sbjct: 480 ETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNALFTWTTSST 539 Query: 695 LAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGCLARALPTLVTELRLPV 516 LAY+YG++ES +EEY VNGREYP KI DGRSSEIKQTLAG LARALP LV +LRL Sbjct: 540 LAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGLVADLRLST 599 Query: 515 PVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLFLDALSVSRITSLTQYMLSRRALL 336 P+S LEQG+GRLLDTMSF + +P RMKQW VI+ LFL+ALSV R+ +LT +M+ RR L Sbjct: 600 PISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPHMMYRRVLF 659 Query: 335 PKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSG 231 K+L+ AQIS+EE+E+MKDL+IPLGR P FS QSG Sbjct: 660 HKVLDSAQISAEEYEVMKDLVIPLGRTPHFSAQSG 694 >ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] gi|462404075|gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica] Length = 711 Score = 503 bits (1294), Expect = e-139 Identities = 306/714 (42%), Positives = 411/714 (57%), Gaps = 72/714 (10%) Frame = -3 Query: 2156 LTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCANSLPS 1977 ++VKD ++KLQL LLEGI + L AGS+IS DYNDVVTERTIA +CGYPLC+N+LPS Sbjct: 13 ISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCSNALPS 72 Query: 1976 E--RPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLNQI 1803 + RP KG YRISLKEHKVYDL ETYMYCSS C+I S+AFA +L EER + K+ +I Sbjct: 73 DSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFGKVERI 132 Query: 1802 LKLFEGLGTDS-VVDMGKNGDLGLSELKIKEKTEREAGEVSMEEW--------------- 1671 L+ F +G D V G+ GDLG+S+LKI+EK E G++ + Sbjct: 133 LRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHIGDLGA 192 Query: 1670 IGPPNAIDGYVPRSDQ---KINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQD 1500 +GP NAI+GYVP+ ++ + ++ +S + D++ +M+F S IIT D Sbjct: 193 VGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMS--SGMDIIFNEMDFMSTIITSD 250 Query: 1499 EYTVSKTVPSVSA-------KEARGKLTGKNVNCEIKPVKKPAAKKEIRPKKSDECLNAT 1341 EY+VSK PSV K+++GK+ G N N +K ++ K KK D C+ Sbjct: 251 EYSVSKIPPSVGEPDFETKFKKSKGKV-GLNKNDSVKKSRQSKGGKNKNVKKDDVCIREV 309 Query: 1340 ERDGDLNVSEDISSGSQSDNTRK---------------------GKTKL----------- 1257 D S+ + +GS + + G KL Sbjct: 310 PSTSD--ASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNRSVTWADEMI 367 Query: 1256 -----------REGKESSSGANGXXXXXXXXXXXKAVCSVTWADAKTDFD-GQNLEEFRE 1113 RE ++ ++ K CS TW D K D +N+ E RE Sbjct: 368 DSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTKSKNICEVRE 427 Query: 1112 LEGEKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAG 933 ++ + + E + SGE D S AVS AG Sbjct: 428 VQDADVLGSLDLQENEILESA----------EACAMALNQAAEAVASGESDVSGAVSGAG 477 Query: 932 VIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLT 753 +IILP PDG+D + E+ +++E++ L WP KP SW+D+PPEGF++T Sbjct: 478 IIILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFSVT 536 Query: 752 LSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTL 573 LSPF+TM+ +LF WI+SSTLAY+YG++ESF+EE++SVNGREYP KI L GRSSEIK+TL Sbjct: 537 LSPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKKTL 596 Query: 572 AGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLFLDAL 393 ARALP +V+ELRLP P+S LEQG+GR+L+TMSF + IPA RMKQW VIV LFL+ L Sbjct: 597 DESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLEGL 656 Query: 392 SVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEFSTQSG 231 SV RI +LT +M +RR L K+LE QIS+E++E+MKDLIIPLGRAP+FS QSG Sbjct: 657 SVCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLGRAPQFSAQSG 710 >ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 662 Score = 497 bits (1279), Expect = e-137 Identities = 297/666 (44%), Positives = 393/666 (59%), Gaps = 21/666 (3%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 K++ + +KD ++KLQL L EGI +E QL AAGSL+S DY DVVTER+IA++CGYPLC + Sbjct: 3 KNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLCHS 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 +LPS+ R+GRYRISLKEHKVYDL+ETY YCSS CLINSRAF+ L++ER + NP KL Sbjct: 63 NLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDKLK 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 +ILKLFE + DS +MG N D GL +I+EK E GEV +EEW+GP NAI+GYVP Sbjct: 123 EILKLFENMSLDSKENMGNNCDSGL---EIQEKIESNIGEVPIEEWMGPSNAIEGYVPHR 179 Query: 1628 DQKINNRXXXXXXXXXKHSLSD----PNAQDMLSFDMNFTSAIITQDEYTVSKTVPSV-- 1467 D K+ S + +D S D + TS IIT +EY+VSK + Sbjct: 180 DHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFS-DFSITSTIITDEEYSVSKISSGLKE 238 Query: 1466 -----SAKEARGKLTGKNVNCEIKPVKKPAAK--------KEIRPKKSDECLNATERDGD 1326 ++K G+ GK N + ++ P A ++ R K ++AT+ D Sbjct: 239 MALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKESTD 298 Query: 1325 LNVSEDISSGSQSDNTRKGKTKLREGKESSSGANGXXXXXXXXXXXKAVC-SVTWADAKT 1149 +S ++S N + R G SG K +C SVTWAD KT Sbjct: 299 NLSDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTE--LKSSLKKPGKKNLCRSVTWADEKT 356 Query: 1148 DFDG-QNLEEFRELEGEKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXS 972 D NL E E+ G+ T + + ED R S Sbjct: 357 DDASIMNLPEVGEM-GKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEAITS 415 Query: 971 GEYDASDAVSEAGVIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXX 792 G+ + SDAVSEAG+IILP P + S + E K K Sbjct: 416 GQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDLFDPSD 474 Query: 791 SWYDSPPEGFNLTLSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIF 612 SWYD+PPEGF+LTLS F+TM+MA+FAW++SS+LAY+YGK++ F+EE++ ++G+EYP KI Sbjct: 475 SWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIV 534 Query: 611 LQDGRSSEIKQTLAGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMK 432 DGRSSEIKQTLAGCL RA+P L +EL L P+S LE G+ LLDTM+F + +PA RMK Sbjct: 535 SADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMK 594 Query: 431 QWHVIVFLFLDALSVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAP 252 QW VIV LF++ALSVSRI SL +M S R L K+L+ AQI S+E+EIM+D I+PLGR Sbjct: 595 QWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTA 654 Query: 251 EFSTQS 234 + S ++ Sbjct: 655 QLSDEN 660 >ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] Length = 632 Score = 485 bits (1249), Expect = e-134 Identities = 287/664 (43%), Positives = 386/664 (58%), Gaps = 19/664 (2%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 K++ + +KD ++KLQL L EGI +E QL AAGSL+S DY DVVTER+IA++CGYPLC + Sbjct: 3 KNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLCHS 62 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 +LPS+ R+GRYRISLKEHKVYDL+ETY YCSS CLINSRAF+ L++ER + NP KL Sbjct: 63 NLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDKLK 122 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 +ILKLFE + DS +MG N D GL +I+EK E GEV +EEW+GP NAI+GYVP Sbjct: 123 EILKLFENMSLDSKENMGNNCDSGL---EIQEKIESNIGEVPIEEWMGPSNAIEGYVPHR 179 Query: 1628 DQKINNRXXXXXXXXXKHSLSD----PNAQDMLSFDMNFTSAIITQDEYTVSKTVPSV-- 1467 D K+ S + +D S D +FTS IIT +EY+VSK + Sbjct: 180 DHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFS-DFSFTSTIITDEEYSVSKISSGLKE 238 Query: 1466 -----SAKEARGKLTGKNVNCEIKPVKKPAAK--------KEIRPKKSDECLNATERDGD 1326 ++K G+ GK N + ++ P A ++ R K ++AT+ D Sbjct: 239 MALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKESTD 298 Query: 1325 LNVSEDISSGSQSDNTRKGKTKLREGKESSSGANGXXXXXXXXXXXKAVCSVTWADAKTD 1146 N+S D S S + +T E + A+ CS T ++ + Sbjct: 299 -NLS-DAPSTSNNRSTNFNLMTEEPRDEKTDDASIMNLPEVGEMGKTKECSRTTSNL-VN 355 Query: 1145 FDGQNLEEFRELEGEKAAIVIPTPHPTVEEVSGEDSYRXXXXXXXXXXXXXXXXXXXSGE 966 FD N + R E A+ + + G+ Sbjct: 356 FDNDNEDLLRVESAEACAMALSQAAKAITS----------------------------GQ 387 Query: 965 YDASDAVSEAGVIILPPPDGVDTAKSKENAEIVETDPMQLKWPLKPXXXXXXXXXXXXSW 786 + SDAVSEAG+IILP P + S + E K K SW Sbjct: 388 SEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDLFDPSDSW 446 Query: 785 YDSPPEGFNLTLSPFSTMFMALFAWISSSTLAYVYGKEESFYEEYMSVNGREYPRKIFLQ 606 YD+PPEGF+LTLS F+TM+MA+FAW++SS+LAY+YGK++ F+EE++ ++G+EYP KI Sbjct: 447 YDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSA 506 Query: 605 DGRSSEIKQTLAGCLARALPTLVTELRLPVPVSILEQGLGRLLDTMSFTNPIPALRMKQW 426 DGRSSEIKQTLAGCL RA+P L +EL L P+S LE G+ LLDTM+F + +PA RMKQW Sbjct: 507 DGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMKQW 566 Query: 425 HVIVFLFLDALSVSRITSLTQYMLSRRALLPKILEGAQISSEEFEIMKDLIIPLGRAPEF 246 VIV LF++ALSVSRI SL +M S R L K+L+ AQI S+E+EIM+D I+PLGR + Sbjct: 567 QVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTAQL 626 Query: 245 STQS 234 S ++ Sbjct: 627 SDEN 630 >ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] gi|508787291|gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao] Length = 703 Score = 482 bits (1240), Expect = e-133 Identities = 301/670 (44%), Positives = 389/670 (58%), Gaps = 49/670 (7%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 K++ ++V +A+HK+QL LL+GI DE+QLLA+GSLIS DY DVVTERTI+ CGYPLCAN Sbjct: 57 KEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLCAN 116 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 LPSE RKGRYRISLKEHKVYDLQETYM+CS+NCLINSRAFA +L+EER + N AKLN Sbjct: 117 PLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAKLN 176 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 IL LF L D D+GKNGDLG S L+IKE E +A +VS+ GP NAI+GYVP+ Sbjct: 177 DILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNAIEGYVPQR 232 Query: 1628 D------QKINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSV 1467 + NN+ L + ++ +++F II DEY +SK P Sbjct: 233 ELISKPTPPKNNKNKVFDSSSS--KLGSKKEEYFVNNELDFAGTIIMNDEYIISKK-PGS 289 Query: 1466 SAKEARGKLTGKN---------------VNCEIKPVKKPAAKKEIRPKKSDECLNATERD 1332 + R KL+ K +N E K P+ K+ D L E Sbjct: 290 FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQ---SCFDSNLKEVEEK 346 Query: 1331 GDLNVSED--ISSGSQSDNTRKG---------KTKLREGKESSSG-ANGXXXXXXXXXXX 1188 G SED + SGS S K K + G ++SS A Sbjct: 347 GICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSS 406 Query: 1187 KAVCS--------------VTWADAK-TDFDGQ-NLEEFRELEGEKAAIVIPTPHPTVEE 1056 + V VTWAD K D G NL E +E+E K I + E+ Sbjct: 407 ETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISG---SAED 463 Query: 1055 VSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAGVIILPPPDGVDTAKSKENA 876 ++ R SG+ D +DAV E VD + E+ Sbjct: 464 GGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVCE-----------VDKEEPMEDG 512 Query: 875 EIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFAWISSST 696 +++E + +KWP KP SW+D+PPEGF+LTLS F+TM+ ALF WI+SS+ Sbjct: 513 DMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSS 572 Query: 695 LAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGCLARALPTLVTELRLPV 516 LAY+YG++ESF+EEY+S+NGREYPRKI L+DGRSSEIK+TLA C++RALP +VT+LRLP+ Sbjct: 573 LAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPI 632 Query: 515 PVSILEQGLGRLLDTMSFTNPIPALRMKQWHVIVFLFLDALSVSRITSLTQYMLSRRALL 336 P+S LEQG+G L+DT+SF +PA RMKQW VIV LF+DALSV RI +LT +M + R LL Sbjct: 633 PISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLL 692 Query: 335 PKILEGAQIS 306 K+L+GAQIS Sbjct: 693 HKVLDGAQIS 702 >ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma cacao] gi|508787293|gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao] Length = 708 Score = 453 bits (1165), Expect = e-124 Identities = 281/632 (44%), Positives = 365/632 (57%), Gaps = 49/632 (7%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 K++ ++V +A+HK+QL LL+GI DE+QLLA+GSLIS DY DVVTERTI+ CGYPLCAN Sbjct: 57 KEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLCAN 116 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 LPSE RKGRYRISLKEHKVYDLQETYM+CS+NCLINSRAFA +L+EER + N AKLN Sbjct: 117 PLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAKLN 176 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 IL LF L D D+GKNGDLG S L+IKE E +A +VS+ GP NAI+GYVP+ Sbjct: 177 DILSLFGDLDLDD-NDLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYVPQR 232 Query: 1628 D------QKINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSV 1467 + NN+ L + ++ +++F II DEY +SK P Sbjct: 233 ELISKPTPPKNNK--NKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKK-PGS 289 Query: 1466 SAKEARGKLTGKN---------------VNCEIKPVKKPAAKKEIRPKKSDECLNATERD 1332 + R KL+ K +N E K P+ K+ D L E Sbjct: 290 FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQ---SCFDSNLKEVEEK 346 Query: 1331 GDLNVSED--ISSGSQSDNTRKG---------KTKLREGKESSSG-ANGXXXXXXXXXXX 1188 G SED + SGS S K K + G ++SS A Sbjct: 347 GICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSS 406 Query: 1187 KAVCS--------------VTWAD-AKTDFDGQ-NLEEFRELEGEKAAIVIPTPHPTVEE 1056 + V VTWAD K D G NL E +E+E K I + E+ Sbjct: 407 ETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEI---SGSAED 463 Query: 1055 VSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAGVIILPPPDGVDTAKSKENA 876 ++ R SG+ D +DAV E G+IILP VD + E+ Sbjct: 464 GGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDG 523 Query: 875 EIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFAWISSST 696 +++E + +KWP KP SW+D+PPEGF+LTLS F+TM+ ALF WI+SS+ Sbjct: 524 DMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSS 583 Query: 695 LAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGCLARALPTLVTELRLPV 516 LAY+YG++ESF+EEY+S+NGREYPRKI L+DGRSSEIK+TLA C++RALP +VT+LRLP+ Sbjct: 584 LAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPI 643 Query: 515 PVSILEQGLGRLLDTMSFTNPIPALRMKQWHV 420 P+S LEQG+G L+DT+SF +PA RMKQW + Sbjct: 644 PISTLEQGMGHLIDTISFMEALPAFRMKQWEI 675 >ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma cacao] gi|508787290|gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao] Length = 679 Score = 452 bits (1162), Expect = e-124 Identities = 281/630 (44%), Positives = 364/630 (57%), Gaps = 49/630 (7%) Frame = -3 Query: 2168 KDEILTVKDAIHKLQLFLLEGINDERQLLAAGSLISLRDYNDVVTERTIAEMCGYPLCAN 1989 K++ ++V +A+HK+QL LL+GI DE+QLLA+GSLIS DY DVVTERTI+ CGYPLCAN Sbjct: 57 KEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLCAN 116 Query: 1988 SLPSERPRKGRYRISLKEHKVYDLQETYMYCSSNCLINSRAFAANLEEERSAATNPAKLN 1809 LPSE RKGRYRISLKEHKVYDLQETYM+CS+NCLINSRAFA +L+EER + N AKLN Sbjct: 117 PLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAKLN 176 Query: 1808 QILKLFEGLGTDSVVDMGKNGDLGLSELKIKEKTEREAGEVSMEEWIGPPNAIDGYVPRS 1629 IL LF L D D+GKNGDLG S L+IKE E +A +VS+ GP NAI+GYVP+ Sbjct: 177 DILSLFGDLDLDD-NDLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYVPQR 232 Query: 1628 D------QKINNRXXXXXXXXXKHSLSDPNAQDMLSFDMNFTSAIITQDEYTVSKTVPSV 1467 + NN+ L + ++ +++F II DEY +SK P Sbjct: 233 ELISKPTPPKNNK--NKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKK-PGS 289 Query: 1466 SAKEARGKLTGKN---------------VNCEIKPVKKPAAKKEIRPKKSDECLNATERD 1332 + R KL+ K +N E K P+ K+ D L E Sbjct: 290 FKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQ---SCFDSNLKEVEEK 346 Query: 1331 GDLNVSED--ISSGSQSDNTRKG---------KTKLREGKESSSG-ANGXXXXXXXXXXX 1188 G SED + SGS S K K + G ++SS A Sbjct: 347 GICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSS 406 Query: 1187 KAVCS--------------VTWAD-AKTDFDGQ-NLEEFRELEGEKAAIVIPTPHPTVEE 1056 + V VTWAD K D G NL E +E+E K I + E+ Sbjct: 407 ETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEI---SGSAED 463 Query: 1055 VSGEDSYRXXXXXXXXXXXXXXXXXXXSGEYDASDAVSEAGVIILPPPDGVDTAKSKENA 876 ++ R SG+ D +DAV E G+IILP VD + E+ Sbjct: 464 GGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDG 523 Query: 875 EIVETDPMQLKWPLKPXXXXXXXXXXXXSWYDSPPEGFNLTLSPFSTMFMALFAWISSST 696 +++E + +KWP KP SW+D+PPEGF+LTLS F+TM+ ALF WI+SS+ Sbjct: 524 DMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSS 583 Query: 695 LAYVYGKEESFYEEYMSVNGREYPRKIFLQDGRSSEIKQTLAGCLARALPTLVTELRLPV 516 LAY+YG++ESF+EEY+S+NGREYPRKI L+DGRSSEIK+TLA C++RALP +VT+LRLP+ Sbjct: 584 LAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPI 643 Query: 515 PVSILEQGLGRLLDTMSFTNPIPALRMKQW 426 P+S LEQG+G L+DT+SF +PA RMKQW Sbjct: 644 PISTLEQGMGHLIDTISFMEALPAFRMKQW 673