BLASTX nr result
ID: Catharanthus22_contig00007276
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00007276 (1547 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004241043.1| PREDICTED: parafibromin-like isoform 1 [Sola... 574 e-161 ref|XP_006350645.1| PREDICTED: parafibromin-like [Solanum tubero... 570 e-160 gb|AGH32907.1| RNA polymerase II accessory factor [Camellia olei... 533 e-149 ref|XP_004297112.1| PREDICTED: parafibromin-like [Fragaria vesca... 531 e-148 gb|EMJ23945.1| hypothetical protein PRUPE_ppa006499mg [Prunus pe... 521 e-145 ref|XP_002282888.1| PREDICTED: parafibromin-like [Vitis vinifera] 520 e-145 ref|XP_004134132.1| PREDICTED: parafibromin-like [Cucumis sativu... 518 e-144 ref|XP_002517109.1| conserved hypothetical protein [Ricinus comm... 513 e-143 ref|XP_004511395.1| PREDICTED: parafibromin-like [Cicer arietinum] 512 e-142 gb|EOY05726.1| PAF1 complex component isoform 1 [Theobroma cacao... 511 e-142 ref|XP_003610782.1| Parafibromin [Medicago truncatula] gi|217073... 508 e-141 gb|ESW29088.1| hypothetical protein PHAVU_002G042300g [Phaseolus... 505 e-140 gb|EPS68631.1| hypothetical protein M569_06137 [Genlisea aurea] 504 e-140 ref|XP_003537641.1| PREDICTED: parafibromin-like isoform X1 [Gly... 503 e-140 ref|XP_002315762.1| RNA pol 2 accessory factor Cdc73 family prot... 498 e-138 ref|XP_006406146.1| hypothetical protein EUTSA_v10020820mg [Eutr... 496 e-137 ref|NP_188898.1| protein PLANT HOMOLOGOUS TO PARAFIBROMIN [Arabi... 495 e-137 ref|XP_002883372.1| predicted protein [Arabidopsis lyrata subsp.... 494 e-137 gb|EXB63474.1| hypothetical protein L484_005437 [Morus notabilis] 484 e-134 ref|XP_006297782.1| hypothetical protein CARUB_v10013819mg [Caps... 484 e-134 >ref|XP_004241043.1| PREDICTED: parafibromin-like isoform 1 [Solanum lycopersicum] gi|460390863|ref|XP_004241044.1| PREDICTED: parafibromin-like isoform 2 [Solanum lycopersicum] Length = 393 Score = 574 bits (1479), Expect = e-161 Identities = 299/402 (74%), Positives = 333/402 (82%), Gaps = 12/402 (2%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKH--DTRYTLETLVHF 236 MDPLT LREYTIRNDL KI+RIGD++RFGNDYTFP TIETAY SKH RYTLETL++F Sbjct: 1 MDPLTLLREYTIRNDLHKIVRIGDDYRFGNDYTFPCTIETAYRSKHVQANRYTLETLINF 60 Query: 237 ITNQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVG-----P 401 ITN HLKH +YI+ + SLRIPAVT PDRK LLDYLTGK +SSDSI+F K P P Sbjct: 61 ITNHHLKHTEYIQQSRSLRIPAVTLPDRKPLLDYLTGKTASSDSIEFL-KFPQSNDTSVP 119 Query: 402 IQNDLNVLPPREGEIEDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCKNRDFYA 581 + V E + D L N N PIE IKA E+PLKDRE IL CKNRDFY+ Sbjct: 120 VSVSAGVTGNEENVMSDVRVLENQN-------PIELIKAAEKPLKDREAILFCKNRDFYS 172 Query: 582 LLTAATRRDEERQKAEALQRKDNLVAKNRIERG---GEELGSGFDGA-KAKLHLKGSKIG 749 + TAA RRDEER +AE+LQRKD LVAKNRI+RG G+E+G +DG KAK+HLKGSKIG Sbjct: 173 VFTAALRRDEERHRAESLQRKDGLVAKNRIDRGYGGGDEIG--YDGGPKAKMHLKGSKIG 230 Query: 750 EGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKKFSRDRVV 926 EGVPIILVPSAFSTLITIYNVK+FLEDG+FIPTDVK+KQMK KPDC+TVQKKFSRDRVV Sbjct: 231 EGVPIILVPSAFSTLITIYNVKDFLEDGVFIPTDVKLKQMKGSKPDCITVQKKFSRDRVV 290 Query: 927 TAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLRFEDDSVE 1106 TAYEVRDKPSALK +DWDRVVAVFVLGKDWQFKDWPFKDHVE FN+++GF+LRFEDDSVE Sbjct: 291 TAYEVRDKPSALKPEDWDRVVAVFVLGKDWQFKDWPFKDHVETFNRVVGFFLRFEDDSVE 350 Query: 1107 SARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 SA+ VKQWNVKIISISKNKRHQDRAAALEVW++LEEFMRSRS Sbjct: 351 SAKTVKQWNVKIISISKNKRHQDRAAALEVWEKLEEFMRSRS 392 >ref|XP_006350645.1| PREDICTED: parafibromin-like [Solanum tuberosum] Length = 393 Score = 570 bits (1470), Expect = e-160 Identities = 297/402 (73%), Positives = 333/402 (82%), Gaps = 12/402 (2%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKH--DTRYTLETLVHF 236 MDPLT LREYTIRN+L KI+RIGD++RFGNDYTFP TIETAY SKH +YTLETL++F Sbjct: 1 MDPLTLLREYTIRNELHKIVRIGDDYRFGNDYTFPCTIETAYRSKHVQANQYTLETLINF 60 Query: 237 ITNQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVG-----P 401 ITN HLKH +YI+ + SLRIPAVT PDRK LLDYLTGK +SSDSI+F K P P Sbjct: 61 ITNHHLKHTEYIQQSRSLRIPAVTLPDRKPLLDYLTGKTASSDSIEFL-KFPQSNDTTVP 119 Query: 402 IQNDLNVLPPREGEIEDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCKNRDFYA 581 + V E + D L N N PIE IKA E+PLKDRE IL CKNRDFY+ Sbjct: 120 VSVSAGVTGNEENVLGDVRVLENQN-------PIELIKAAEKPLKDREAILFCKNRDFYS 172 Query: 582 LLTAATRRDEERQKAEALQRKDNLVAKNRIERG---GEELGSGFDGA-KAKLHLKGSKIG 749 + TAA RRDEER +AE+LQRKD LVAKNRI+RG G+E+G +DG KAK+HLKGSKIG Sbjct: 173 VFTAALRRDEERHRAESLQRKDGLVAKNRIDRGYGGGDEIG--YDGGPKAKMHLKGSKIG 230 Query: 750 EGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKKFSRDRVV 926 EGVPIILVPSAFSTLITIYNVK+FLEDG+FIPTDVK+KQMK KPDC+TVQKKFSRDRVV Sbjct: 231 EGVPIILVPSAFSTLITIYNVKDFLEDGVFIPTDVKLKQMKGSKPDCITVQKKFSRDRVV 290 Query: 927 TAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLRFEDDSVE 1106 TAYEVRDKPSALK +DWDRVVAVFVLGKDWQFKDWPFKDHVE FN+++GF+LRFEDDSVE Sbjct: 291 TAYEVRDKPSALKPEDWDRVVAVFVLGKDWQFKDWPFKDHVETFNRVVGFFLRFEDDSVE 350 Query: 1107 SARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 SA+ VKQWNVKIISISKNKRHQDRAAALEVW++LEEFMRSRS Sbjct: 351 SAKTVKQWNVKIISISKNKRHQDRAAALEVWEKLEEFMRSRS 392 >gb|AGH32907.1| RNA polymerase II accessory factor [Camellia oleifera] Length = 401 Score = 533 bits (1374), Expect = e-149 Identities = 282/410 (68%), Positives = 324/410 (79%), Gaps = 19/410 (4%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALR++TIRNDL KI+RIGDEFRFG DY+FP + TAY SK + Y+LETL+ F+ Sbjct: 1 MDPLSALRDFTIRNDLDKIVRIGDEFRFGGDYSFPCGVATAYRSKQGSLYSLETLISFVK 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQNDLNV 422 N HLKH DY+ NA S +PAVTF DRK LLDYL GK+SSSDSI F P P Sbjct: 61 NHHLKHTDYMHNARSHNLPAVTFIDRKPLLDYLQGKVSSSDSIQFLA--PQNPKFTSDEY 118 Query: 423 LPPREGEIEDKSFL-------NNLNVPEEVIVPIE-----WIKATERPLKDREQILLCKN 566 P ED S + N+ +V +E+ + I+A ERPLKDRE +L C+N Sbjct: 119 RP------EDPSLIQITPNDDNDFDVNDEIGARVSDNYMAMIRAMERPLKDRETMLECRN 172 Query: 567 RDFYALLTAATRRDEERQKAEALQRKDNLVAKNRIERG-----GEELGSGFDGA-KAKLH 728 R+FY +LTAAT+RDEERQ+ E+ QRKD LVAKNR+ RG G+E+G +D K K+ Sbjct: 173 RNFYVVLTAATKRDEERQRLESQQRKDGLVAKNRLMRGDERGFGDEMG--YDSTPKPKML 230 Query: 729 LKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKK 905 +KGSKIGEGVPIILVPSAF TLITIYNVKEFLEDG+FIPTDVKVKQMK KP+CVTVQKK Sbjct: 231 MKGSKIGEGVPIILVPSAFQTLITIYNVKEFLEDGVFIPTDVKVKQMKGPKPECVTVQKK 290 Query: 906 FSRDRVVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLR 1085 FSRDR+VTAYEVRDKPS LKA+DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNKILGF++R Sbjct: 291 FSRDRLVTAYEVRDKPSVLKAEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKILGFFMR 350 Query: 1086 FEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRSR 1235 FEDDSVESA++VKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRSR Sbjct: 351 FEDDSVESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRSR 400 >ref|XP_004297112.1| PREDICTED: parafibromin-like [Fragaria vesca subsp. vesca] Length = 414 Score = 531 bits (1368), Expect = e-148 Identities = 280/416 (67%), Positives = 332/416 (79%), Gaps = 26/416 (6%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALR++TIR +L KI+R+ DE R G+DY+FP + ETAY SK YTLETL+H++ Sbjct: 1 MDPLSALRDFTIRGELDKIVRVNDELRLGSDYSFPCSAETAYRSKQGNLYTLETLLHYVN 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTT----KVPVGPIQN 410 N HLKH +Y+ NA + IP VTFPDRK LLDYLTGKISSSDSI+F KVP P+ N Sbjct: 61 NHHLKHTEYLINARAQMIPCVTFPDRKPLLDYLTGKISSSDSIEFVLPQNPKVPDLPLHN 120 Query: 411 DLNVLPPREGEI-------EDKSFLNNLNVPEEVIVPIEW---IKATERPLKDREQILLC 560 N P E ++ ++ + +N V +EV P+++ I +ERPLKDRE++L C Sbjct: 121 --NDFPFSENDVARHHTPDQNHNNINGFTVLKEVEAPVDYMSLIYGSERPLKDREELLEC 178 Query: 561 KNRDFYALLTAATRRDEERQKAEALQRKDNLVAKNRI----ERG----GEELGSGFDGA- 713 K R+FY +LTAAT+R+EERQ+ E+ QRKD LVAK+R+ +RG G+E+G +D A Sbjct: 179 KGRNFYGVLTAATKREEERQRIESQQRKDGLVAKSRLMGSDDRGMAGYGDEMG--YDQAP 236 Query: 714 KAKLHLKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCV 890 K K+HLKG KIGEGVPIILVPSAF TLITIYNVKEFLEDG++IPTDVKVKQMK KPDCV Sbjct: 237 KPKMHLKGGKIGEGVPIILVPSAFQTLITIYNVKEFLEDGVYIPTDVKVKQMKGAKPDCV 296 Query: 891 TVQKKFSRDR--VVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNK 1064 TVQKKFSRDR VVTAYEVRDKPSALK +DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNK Sbjct: 297 TVQKKFSRDRDRVVTAYEVRDKPSALKTEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNK 356 Query: 1065 ILGFYLRFEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 I+GF++RFEDDSVESA+ VKQWNVKIISISKNKRHQDRAAALEVWDRLEEF+RSRS Sbjct: 357 IMGFFMRFEDDSVESAKIVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFVRSRS 412 >gb|EMJ23945.1| hypothetical protein PRUPE_ppa006499mg [Prunus persica] Length = 409 Score = 521 bits (1342), Expect = e-145 Identities = 276/407 (67%), Positives = 322/407 (79%), Gaps = 17/407 (4%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALR++TIR +L KI+R+ DEFRF DY+FP ETAY SK YTLETL++++T Sbjct: 1 MDPLSALRDFTIRGELEKIVRVNDEFRFDTDYSFPCHAETAYRSKQGNLYTLETLLYYVT 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQNDLNV 422 N HLKH DYI++A + IP+VTFPDRK LLDYLTGKISSSDSI+F + L Sbjct: 61 NHHLKHTDYIQSARTQGIPSVTFPDRKPLLDYLTGKISSSDSIEFLLPPQNDAVHPKLPS 120 Query: 423 LPPR--EGEIEDKSFLNNLN--VPEEVIVPIEWIK---ATERPLKDREQILLCKNRDFYA 581 L P G D + + V ++ P++++ + ERPLKDRE +L CK R+FY Sbjct: 121 LDPNVNSGINNDSNDYGTTDSRVFSQIETPVDYMSLICSGERPLKDREGLLECKGRNFYG 180 Query: 582 LLTAATRRDEERQKAEALQRKDNLVAKNRI----ERGGEELG--SGFD-GAKAKLHLKGS 740 +LT+AT+R+EERQ+ E+ QRKD LVAK+R+ ERG G SG+D K KLHLKG Sbjct: 181 VLTSATKREEERQRIESQQRKDGLVAKSRLMGSDERGLTGFGDESGYDPNPKPKLHLKGG 240 Query: 741 KIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKKFSRD 917 KIGEGVPIILVPSAF TLITIYNVKEFLEDG++IPTDVKVKQMK KPDCVTVQKKFSRD Sbjct: 241 KIGEGVPIILVPSAFQTLITIYNVKEFLEDGVYIPTDVKVKQMKGAKPDCVTVQKKFSRD 300 Query: 918 R--VVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLRFE 1091 R VVTAYEVRDKPSALKA+DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNKI+GF++RFE Sbjct: 301 RDRVVTAYEVRDKPSALKAEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKIVGFFMRFE 360 Query: 1092 DDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 DDSVESA+ VKQWNVKIISISKNKRHQDRAAALEVWDRLEEF+RSRS Sbjct: 361 DDSVESAKIVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFVRSRS 407 >ref|XP_002282888.1| PREDICTED: parafibromin-like [Vitis vinifera] Length = 413 Score = 520 bits (1339), Expect = e-145 Identities = 274/415 (66%), Positives = 324/415 (78%), Gaps = 25/415 (6%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALR++T+R +L KI+R+GDEFRFG+DYTFP + ETAY SK YTLETLV+++ Sbjct: 1 MDPLSALRDFTVRGELDKIVRVGDEFRFGSDYTFPCSAETAYRSKQGNLYTLETLVYYVK 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQNDLNV 422 N H+KH +Y+++A + RIPAVT PDRK LL+YL GK++S+D+I+F VP P D+ V Sbjct: 61 NHHIKHTEYLQSARTQRIPAVTLPDRKPLLEYLQGKVASTDAIEFV--VPQNPKIPDIGV 118 Query: 423 LPPREGEIEDKSFLNNLNVPE-------------EVIVPIEWIKATERPLKDREQILLCK 563 E ED + L + P + + I I+A+ERPLKDRE +L CK Sbjct: 119 DAVDEYRPEDPTLLAIRDPPGSEDALDNSRVRGFDNVDYISMIRASERPLKDRESLLECK 178 Query: 564 NRDFYALLTAATRRDEERQKAEALQRKDNLVAKNRI----ERG------GEELGSGFDGA 713 RDFY++L A+TRR+EER + E+ QRKD LVAK+R+ ERG G+ELG +DG Sbjct: 179 QRDFYSVLMASTRREEERHRLESHQRKDGLVAKSRLMGADERGLGFWKDGDELG--YDGT 236 Query: 714 -KAKLHLKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDC 887 K K+ L SKIGEGVPIILVPSAF TLITIYNVKEFLEDG+FIPTDVK KQMK KPDC Sbjct: 237 PKPKMLLNRSKIGEGVPIILVPSAFQTLITIYNVKEFLEDGVFIPTDVKAKQMKGAKPDC 296 Query: 888 VTVQKKFSRDRVVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKI 1067 VTVQKKFSRDRVV AYEVRDKPSALK +DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNKI Sbjct: 297 VTVQKKFSRDRVVMAYEVRDKPSALKTEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKI 356 Query: 1068 LGFYLRFEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 +GFY+RFEDDSVESA+ VKQWNVKIISISKNKRHQDRAAALEVWDRLEEF+RSRS Sbjct: 357 IGFYMRFEDDSVESAKIVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFVRSRS 411 >ref|XP_004134132.1| PREDICTED: parafibromin-like [Cucumis sativus] gi|449513423|ref|XP_004164322.1| PREDICTED: parafibromin-like [Cucumis sativus] Length = 407 Score = 518 bits (1333), Expect = e-144 Identities = 274/413 (66%), Positives = 327/413 (79%), Gaps = 23/413 (5%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALR++TIR +L KI+R+ DEFRF +DY+FP ++ETAY SK YTLETLV++I Sbjct: 1 MDPLSALRDFTIRGELDKIVRVNDEFRFASDYSFPCSVETAYRSKQGNLYTLETLVYYIK 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQNDLNV 422 N H+KH +Y++NA + I +VTFPDRK LLDYLTGK+SSSD+I+F VP P DL Sbjct: 61 NHHVKHTEYLQNARTQGITSVTFPDRKPLLDYLTGKVSSSDAIEFL--VPQNPKFPDLPS 118 Query: 423 LP---PREGEI---------EDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCKN 566 + P + I ED F ++ NV + I+A ERPLKDRE +L CKN Sbjct: 119 VDEYRPEDPVIVGAAMDAVDEDDGFKDSTNVDYMTM-----IRAIERPLKDRESLLECKN 173 Query: 567 RDFYALLTAATRRDEERQKAEALQRKDNLVAKNRI----ERG----GEELGSGFDGAKAK 722 R+FY +L +T+R+EERQ+ E+ QRKD LVAK+R+ +RG G++LG + K K Sbjct: 174 RNFYNVLVMSTKREEERQRLESQQRKDGLVAKSRLMGSDDRGLVGYGDDLGYDAN-PKPK 232 Query: 723 LHLKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQ 899 +HLKG KIGEGVPIILVPSAF TLITIYNVKEFLEDG+FIPTDVKVKQMK +PDCVTVQ Sbjct: 233 MHLKGGKIGEGVPIILVPSAFQTLITIYNVKEFLEDGVFIPTDVKVKQMKGARPDCVTVQ 292 Query: 900 KKFSRDR--VVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILG 1073 KKFSRDR VVTAYEVRDKPSALK++DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNKI+G Sbjct: 293 KKFSRDRDRVVTAYEVRDKPSALKSEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKIIG 352 Query: 1074 FYLRFEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 FY+RFEDDS+ESA++VKQWNVKIISISKNKRHQDRAAALEVWDRLEEF+RSRS Sbjct: 353 FYMRFEDDSLESAKNVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFVRSRS 405 >ref|XP_002517109.1| conserved hypothetical protein [Ricinus communis] gi|223543744|gb|EEF45272.1| conserved hypothetical protein [Ricinus communis] Length = 409 Score = 513 bits (1322), Expect = e-143 Identities = 268/409 (65%), Positives = 320/409 (78%), Gaps = 19/409 (4%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALR++T+RND+ KI+RI DEFRF N+YTFP I+TAY SK YTLETLV++I Sbjct: 1 MDPLSALRDFTMRNDVDKIVRINDEFRFSNEYTFPCNIKTAYRSKQGNLYTLETLVYYIQ 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGP---IQND 413 N HLK DY+++A + +PA+TF DRK L DYLTGK+SS+DSI F P + ND Sbjct: 61 NSHLKFTDYLQHARAAGLPAITFIDRKPLYDYLTGKVSSTDSIVFPLPQNPNPNLDLDND 120 Query: 414 LNVLPPREGEIEDKSFLNNL-------NVPEEVIVPIEWIKATERPLKDREQILLCKNRD 572 LN + I + S ++ NV E+ ++ I I + ERP+KDRE +L CK +D Sbjct: 121 LNSNAVLDSTINNNSADADVASGGGGNNVKEDNLISI--IYSMERPIKDREALLECKTKD 178 Query: 573 FYALLTAATRRDEERQKAEALQRKDNLVAKNRIER------GGEELGSGFDGAKAKLHLK 734 FY++L A+TRR+EERQ+ E+ QRKD LVAK+R+ GG+E+G + LHLK Sbjct: 179 FYSVLVASTRREEERQRIESQQRKDGLVAKSRLMGSEDRGYGGDEMGYDANSKPKMLHLK 238 Query: 735 GSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKKFS 911 G K GEGVPIILVPSAF TLITIYNVKEFLEDG++IPTDVKVKQMK KPDCVTVQKKFS Sbjct: 239 GGKFGEGVPIILVPSAFQTLITIYNVKEFLEDGVYIPTDVKVKQMKGAKPDCVTVQKKFS 298 Query: 912 --RDRVVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLR 1085 R+RV+TAYEVRDKPSALKA+DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNKI+GF++R Sbjct: 299 TDRNRVMTAYEVRDKPSALKAEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKIIGFFMR 358 Query: 1086 FEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 FEDDSVESA+ VKQWNVKIISISKNKRHQDRAAALEVWDRLEEF+RSRS Sbjct: 359 FEDDSVESAKTVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFVRSRS 407 >ref|XP_004511395.1| PREDICTED: parafibromin-like [Cicer arietinum] Length = 399 Score = 512 bits (1318), Expect = e-142 Identities = 263/409 (64%), Positives = 312/409 (76%), Gaps = 19/409 (4%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ LR++T+R DL KI+RI +FRFG++YTFP+++ETAY S RYTLETLVH+I Sbjct: 8 MDPLSLLRDFTMRGDLDKIVRINGDFRFGDEYTFPSSLETAYRSTKGNRYTLETLVHYIK 67 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQNDLNV 422 N HLKH +Y +N +L IP+VT PDRK +L+YL G +S++DSI++ Sbjct: 68 NHHLKHTEYFQNTLALSIPSVTLPDRKPILNYLQGILSTTDSIEYL-------------- 113 Query: 423 LPPREGEIEDKSFLNNLNVPEEVIVP---------------IEWIKATERPLKDREQILL 557 P E +ED S L N + ++P I I+ E+PLKDRE +L Sbjct: 114 --PEEPSLEDPSSLYNQQHQQSSLIPQSNEAVVVEDPPLDFISMIRTVEKPLKDRESLLE 171 Query: 558 CKNRDFYALLTAATRRDEERQKAEALQRKDNLVAKNRIERGGEELGS--GFDGA-KAKLH 728 CKNRDFY +L AAT+R+ ERQ+ E+ QRKD LVAK+RI G ++ G G+D K K+H Sbjct: 172 CKNRDFYGVLVAATKREVERQRMESHQRKDGLVAKSRIMGGSDDFGDELGYDATPKPKMH 231 Query: 729 LKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKK 905 LK IGEGVPIILVPSAF TLITIYNVKEFLEDG++IPTDVKVKQMK +PDCVTVQKK Sbjct: 232 LK---IGEGVPIILVPSAFQTLITIYNVKEFLEDGVYIPTDVKVKQMKGARPDCVTVQKK 288 Query: 906 FSRDRVVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLR 1085 SRDRVVTAYEVRDKPSALK +DWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKI GF++R Sbjct: 289 LSRDRVVTAYEVRDKPSALKPEDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKITGFFMR 348 Query: 1086 FEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 FEDDS+ESA+HVKQWNVKIISISKNKRHQDRAAALEVWDRLEEF+RSRS Sbjct: 349 FEDDSIESAKHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFVRSRS 397 >gb|EOY05726.1| PAF1 complex component isoform 1 [Theobroma cacao] gi|508713830|gb|EOY05727.1| PAF1 complex component isoform 1 [Theobroma cacao] Length = 413 Score = 511 bits (1317), Expect = e-142 Identities = 275/413 (66%), Positives = 320/413 (77%), Gaps = 23/413 (5%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALR++TIR +L KI+R+ DEFRFG DY+FP + ETAY SK YTLETLV +I Sbjct: 1 MDPLSALRDFTIRGELDKIVRVNDEFRFGTDYSFPCSGETAYRSKQGNLYTLETLVFYIQ 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSI-----DFTTKVPVGPIQ 407 N HLKH DY+ N+ SLRIPAVTF DRK LLDYLTGK+S+SDSI F + P Sbjct: 61 NHHLKHTDYMHNSLSLRIPAVTFTDRKPLLDYLTGKVSTSDSIVWNPPKFPDEFRPDPSG 120 Query: 408 NDLNVLPPREG-------EIEDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCKN 566 D + P+ EI D F + + E+ + I++ E+PLKDRE IL CKN Sbjct: 121 FDPDSSKPKGNTNDVVLDEIGDIHF-DIKDKETELADYMGIIRSIEKPLKDREGILECKN 179 Query: 567 RDFYALLTAATRRDEERQKAEALQRKDNLVAKNRIERGGEELGSGFD--------GAKAK 722 RDFY++L A+T+R+EERQ+ E+ QRKD LVAK+R+ G EE G +K K Sbjct: 180 RDFYSVLVASTKREEERQRLESQQRKDGLVAKSRL-MGAEERRLGLSYGDEMVGYDSKPK 238 Query: 723 LHLKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQ 899 +HLKGSKIGEGVPIILVPSAF TLITIYNVKEFLEDG+F+PTDVKVKQMK +P+CVTVQ Sbjct: 239 MHLKGSKIGEGVPIILVPSAFQTLITIYNVKEFLEDGVFVPTDVKVKQMKGARPECVTVQ 298 Query: 900 KKFS--RDRVVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILG 1073 KKFS RDRVVTAYEVRDKPSALK +DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNKI+G Sbjct: 299 KKFSRDRDRVVTAYEVRDKPSALKPEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKIIG 358 Query: 1074 FYLRFEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 F++RFEDDSVESA+ VKQWNVKIISISKNKRHQDRAAALEVWDRLEEF+RSRS Sbjct: 359 FFMRFEDDSVESAKIVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFVRSRS 411 >ref|XP_003610782.1| Parafibromin [Medicago truncatula] gi|217073460|gb|ACJ85089.1| unknown [Medicago truncatula] gi|355512117|gb|AES93740.1| Parafibromin [Medicago truncatula] gi|388521181|gb|AFK48652.1| unknown [Medicago truncatula] Length = 398 Score = 508 bits (1308), Expect = e-141 Identities = 260/400 (65%), Positives = 310/400 (77%), Gaps = 10/400 (2%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPLT LR++TIR DL KI+R+ FRFG DYTFP ++ETAY S RYTLETLVH+I Sbjct: 8 MDPLTLLRDFTIRGDLDKIVRLNGNFRFGEDYTFPCSLETAYRSTKGNRYTLETLVHYIK 67 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQNDLNV 422 N HLKH +Y +N +L IP+VT PDRK +L+YL G +S++DSI++ + P P + + Sbjct: 68 NHHLKHTEYFQNTLALGIPSVTLPDRKPILNYLQGILSTTDSIEYLPEQPSIPDEPSSHQ 127 Query: 423 LPPREGEIEDKSFLNNLNVPEEVIVP----IEWIKATERPLKDREQILLCKNRDFYALLT 590 + F N+ + E+ P I I+ E+PLKDRE +L CKNRDFY++L Sbjct: 128 --------QHSQFPNSDEIITELESPPLDFISMIRTAEKPLKDRESLLECKNRDFYSVLV 179 Query: 591 AATRRDEERQKAEALQRKDNLVAKNRI-----ERGGEELGSGFDGAKAKLHLKGSKIGEG 755 AAT+R+EERQ+AE+ QRKD LVAK+R+ + GG+E+G K K+HLK IGEG Sbjct: 180 AATKREEERQRAESHQRKDGLVAKSRLLGSADDFGGDEMGYDHQTPKPKMHLK---IGEG 236 Query: 756 VPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKKFSRDRVVTA 932 VPIILVPSAF TLITIYNVK+FLEDG+++PTDVKVK MK KPDCVTVQKK SRDR VTA Sbjct: 237 VPIILVPSAFQTLITIYNVKDFLEDGVYVPTDVKVKAMKGAKPDCVTVQKKLSRDRAVTA 296 Query: 933 YEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLRFEDDSVESA 1112 YEVRDKPSALK +DWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKI GF++RFEDDS+ESA Sbjct: 297 YEVRDKPSALKPEDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKITGFFMRFEDDSIESA 356 Query: 1113 RHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 + VKQWNVKIISISKNKRHQDRAAALEVWDRLEEF+RSRS Sbjct: 357 KTVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFVRSRS 396 >gb|ESW29088.1| hypothetical protein PHAVU_002G042300g [Phaseolus vulgaris] Length = 392 Score = 505 bits (1300), Expect = e-140 Identities = 255/399 (63%), Positives = 320/399 (80%), Gaps = 8/399 (2%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALRE+T+R ++ KI+R+ +EFRFG +YTFP +ETA+ S RYTLETLVH+I Sbjct: 1 MDPLSALREFTMRGEVEKIVRLNNEFRFGEEYTFPCWVETAFRSTKGNRYTLETLVHYIK 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQNDLNV 422 N HLKH +YI+N ++ IP+VT PDRK LL YL G ++S+DSI++ P P + Sbjct: 61 NHHLKHTEYIQNTFAVGIPSVTLPDRKPLLHYLQGTLASTDSIEYR---PEDPSFAPKST 117 Query: 423 LPPREGEIEDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCKNRDFYALLTAATR 602 L P + + + + + P+++ + I I + ERPLKDR+ +L CKNRDFY++L AAT+ Sbjct: 118 LLPSQAQAQAQP----QDQPDKLDL-ISLITSVERPLKDRQALLECKNRDFYSVLVAATK 172 Query: 603 RDEERQKAEALQRKDNLVAKNRI----ERG---GEELGSGFDGAKAKLHLKGSKIGEGVP 761 R+E+RQ+ E+ QRKD LVAK+R+ +RG +++G K K+HLKG+KIGEGVP Sbjct: 173 REEDRQRMESQQRKDGLVAKSRLMAADDRGLGFSDDMGGYDPTPKPKMHLKGTKIGEGVP 232 Query: 762 IILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKKFSRDRVVTAYE 938 IILVPSAF TLITIYNVKEFLEDG++IPTDVKVKQMK +PDCVTVQKK SRDRVVTAYE Sbjct: 233 IILVPSAFQTLITIYNVKEFLEDGVYIPTDVKVKQMKGARPDCVTVQKKLSRDRVVTAYE 292 Query: 939 VRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLRFEDDSVESARH 1118 VRDKPS LK DDWDRVVAVFVLGK+WQFK+WPFKDHVEIFNKI+GF++RFEDDS+ESA++ Sbjct: 293 VRDKPSTLKPDDWDRVVAVFVLGKEWQFKEWPFKDHVEIFNKIIGFFMRFEDDSLESAKN 352 Query: 1119 VKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRSR 1235 VKQWNVKIISISKNKRHQDRAAAL+VW+RLEEF+R+RSR Sbjct: 353 VKQWNVKIISISKNKRHQDRAAALDVWERLEEFVRARSR 391 >gb|EPS68631.1| hypothetical protein M569_06137 [Genlisea aurea] Length = 384 Score = 504 bits (1299), Expect = e-140 Identities = 278/404 (68%), Positives = 318/404 (78%), Gaps = 13/404 (3%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKH---DTRYTLETLVH 233 MDPLT LREYT +N L KI+R+GDEFRFG DY+FPA IETAY SKH + RYTLETLVH Sbjct: 1 MDPLTLLREYTTKNSLNKIVRVGDEFRFGTDYSFPAAIETAYCSKHASQNRRYTLETLVH 60 Query: 234 FITNQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQND 413 FIT+Q LKH DYI+NA +LRIPAVT PDRK LL+YLTGKI +SDSI VP+ D Sbjct: 61 FITSQDLKHTDYIQNARALRIPAVTLPDRKSLLEYLTGKIQTSDSI-----VPL-----D 110 Query: 414 LNVLPPREGEIEDKSFLNNL---NVPEEVIVPIEWIKATERPLKDREQILLCKNRDFYAL 584 L + P D++ L+ +V P+E I A ERPLKDR+ +L K RDF+++ Sbjct: 111 LPAVNPNL----DRAALHEPTAESVSSGEANPMELIMAKERPLKDRKAMLSFKKRDFFSV 166 Query: 585 LTAATRRDEERQKAEALQRKDNLVAKNRIERGGEELGSGFDGAKAKLHLKGSKIGEGVPI 764 LTAA +RDEERQK EALQRKDNLVAKNRIE G GF G + + K KIGEGVPI Sbjct: 167 LTAALKRDEERQKMEALQRKDNLVAKNRIESRG-----GFPGGE-EAATKVRKIGEGVPI 220 Query: 765 ILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKKFS-RDRVVTA-- 932 ILVPSAF+TLITIYNVKEFLEDG+FIPT+VK+KQM+ KPDCVTVQKKFS RDRV A Sbjct: 221 ILVPSAFTTLITIYNVKEFLEDGVFIPTEVKLKQMQGQKPDCVTVQKKFSSRDRVAAAAA 280 Query: 933 YEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLRFEDDSV--- 1103 YEVRDKPS+LK+DDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFY+RF DDS+ Sbjct: 281 YEVRDKPSSLKSDDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYMRFGDDSMESS 340 Query: 1104 ESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRSR 1235 ES++ +KQWNVKIIS+SKNKRHQDR AALEVW++LEEFMRSR R Sbjct: 341 ESSKAIKQWNVKIISLSKNKRHQDRTAALEVWEKLEEFMRSRLR 384 >ref|XP_003537641.1| PREDICTED: parafibromin-like isoform X1 [Glycine max] gi|571486641|ref|XP_006590411.1| PREDICTED: parafibromin-like isoform X2 [Glycine max] gi|571486643|ref|XP_006590412.1| PREDICTED: parafibromin-like isoform X3 [Glycine max] Length = 389 Score = 503 bits (1296), Expect = e-140 Identities = 258/401 (64%), Positives = 313/401 (78%), Gaps = 11/401 (2%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALRE+T+R ++ KI+R+ EFRFG +YTFP +ETAY S RYTLETLVH+I Sbjct: 1 MDPLSALREFTMRGEVEKIVRVNAEFRFGEEYTFPCWVETAYRSTKGNRYTLETLVHYIQ 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQNDLNV 422 N HLKH +YI+N ++ IP+VT PDRK LL YL G +SSSDSI++ +D + Sbjct: 61 NHHLKHTEYIQNTFAVGIPSVTLPDRKPLLQYLQGTLSSSDSIEYRP-------HDDPSS 113 Query: 423 LPPREGEIEDKSFLNNLNVPEEVIVP--IEWIKATERPLKDREQILLCKNRDFYALLTAA 596 P KS N ++P E + I I++ E+PLKDR+ +L CKNRDFY++L +A Sbjct: 114 FPA------PKSTPNPPSLPPEDLNLDFISMIRSAEKPLKDRQSLLECKNRDFYSVLVSA 167 Query: 597 TRRDEERQKAEALQRKDNLVAKNRIERGGEELGSGFDG--------AKAKLHLKGSKIGE 752 T+R+EERQ+ E+ QRKD LVAK+R+ G ++ G GF K K+HLKG+KIGE Sbjct: 168 TKREEERQRMESHQRKDGLVAKSRL-MGSDDRGLGFSDDMGGYDPTPKPKMHLKGTKIGE 226 Query: 753 GVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKKFSRDRVVT 929 GVPIILVPSAF TLITIYNVKEFLEDG++IPTDVKVKQMK +PDCVTVQKK SRDRVVT Sbjct: 227 GVPIILVPSAFQTLITIYNVKEFLEDGVYIPTDVKVKQMKGARPDCVTVQKKLSRDRVVT 286 Query: 930 AYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLRFEDDSVES 1109 AYEVRDKPS LK DDWDRVVAVFVLGK+WQFKDWPFKDHVEIFNKI+GF++RFEDDS+ES Sbjct: 287 AYEVRDKPSTLKPDDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKIIGFFMRFEDDSLES 346 Query: 1110 ARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 + VKQWNVKIISISKNKRHQDRAAAL+VW+RLE+F+R+RS Sbjct: 347 CKTVKQWNVKIISISKNKRHQDRAAALDVWERLEDFVRARS 387 >ref|XP_002315762.1| RNA pol 2 accessory factor Cdc73 family protein [Populus trichocarpa] gi|222864802|gb|EEF01933.1| RNA pol 2 accessory factor Cdc73 family protein [Populus trichocarpa] Length = 405 Score = 498 bits (1282), Expect = e-138 Identities = 266/406 (65%), Positives = 315/406 (77%), Gaps = 16/406 (3%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALR++TIR DL KI+RI DEFRFGN+YTFP + +TAY SK YTLETLV+ I Sbjct: 1 MDPLSALRDFTIRGDLDKIIRINDEFRFGNEYTFPCSTKTAYRSKQGNLYTLETLVYCIQ 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTTKVPVGPIQNDLNV 422 N +K +Y+++A +L IP VT+ D K + +YL+GK+SS+DSI F +P +LN Sbjct: 61 NTKIKFTNYLQDALALGIPPVTYIDWKPVKEYLSGKLSSTDSIVFP--LPQESQNPNLNY 118 Query: 423 LPPR----EGEIEDKSFLNNLNVPEEVIVP-IEWIKATERPLKDREQILLCKNRDFYALL 587 P + I+D + + +N E + + I A ERPLKDRE +L CKNRDFY +L Sbjct: 119 RPDDPMLLDSRIDDSAAADKVNNGNEGVENHVSLIYANERPLKDRESLLECKNRDFYGVL 178 Query: 588 TAATRRDEERQKAEALQRKDNLVAKNRIERGGEELGSGFDG--------AKAKLHLKGSK 743 A+TRR+EER K E+ QRKD LVAK+R+ G +E G G+ G AK K+H KG K Sbjct: 179 VASTRREEERHKFESQQRKDGLVAKSRL-MGTDERGIGYGGDELGYDSAAKPKMHSKGGK 237 Query: 744 IGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQKKFS--R 914 IGEGVPIILVPSAF TLITIYNVKEFLEDGIFIPTDVK KQMK KP+CVTVQKKFS R Sbjct: 238 IGEGVPIILVPSAFQTLITIYNVKEFLEDGIFIPTDVKAKQMKGPKPECVTVQKKFSTDR 297 Query: 915 DRVVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKILGFYLRFED 1094 +RV+TAYEVRDKPSALK DDWDRVVAVFVLGK+WQFKDWPFKDHVEIFNKI+GF++RFED Sbjct: 298 NRVMTAYEVRDKPSALKGDDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKIIGFFMRFED 357 Query: 1095 DSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 DSVESA+ VKQWNVKIISISKNKRHQDRAAALEVWDRLEEF+RS+S Sbjct: 358 DSVESAKIVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFVRSQS 403 >ref|XP_006406146.1| hypothetical protein EUTSA_v10020820mg [Eutrema salsugineum] gi|557107292|gb|ESQ47599.1| hypothetical protein EUTSA_v10020820mg [Eutrema salsugineum] Length = 414 Score = 496 bits (1277), Expect = e-137 Identities = 263/414 (63%), Positives = 312/414 (75%), Gaps = 24/414 (5%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ L+ +T R DL KI R+G +RFG++Y+FP ETAY SK T YTLE LVH++ Sbjct: 1 MDPLSVLKNFTTRGDLDKIERVGANYRFGSEYSFPCATETAYRSKGGTLYTLEALVHYVK 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTT---KVPVGPIQN- 410 NQHLK +Y+++ +PAVT PDRK LLDYLTG+++SSDSIDF + QN Sbjct: 61 NQHLKPGEYMQSTVKNAVPAVTLPDRKPLLDYLTGRVASSDSIDFLLLQQQNAQSQKQNE 120 Query: 411 ------DLNVLPPREGEIEDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCKNRD 572 D + RE I+D + + E + I I++ ERPLK R+ IL CKNRD Sbjct: 121 EYRPDQDNSTFVSRESAIDDME-VEDFGKSGEDVDYIMLIRSNERPLKSRDAILQCKNRD 179 Query: 573 FYALLTAATRRDEERQKAEALQRKDNLVAKNRIERGGEELG----------SGFDG-AKA 719 FY++L +T+R+EERQ+ E+ QRKD LVAK+R+ G EE G SG+D K+ Sbjct: 180 FYSVLVNSTKREEERQRIESHQRKDGLVAKSRL-MGAEERGIVGFSGGGDDSGYDANPKS 238 Query: 720 KLHLKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTV 896 KLH K KIGEGVPIILVPSAF TLITIYNVKEFLEDG++IP DVK KQMK +KPDC+TV Sbjct: 239 KLHFKAGKIGEGVPIILVPSAFQTLITIYNVKEFLEDGVYIPNDVKAKQMKGLKPDCITV 298 Query: 897 QKKFSRDR--VVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKIL 1070 QKKFSRDR VVTAYEVRDKPSALK DDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKI+ Sbjct: 299 QKKFSRDRERVVTAYEVRDKPSALKPDDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKII 358 Query: 1071 GFYLRFEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 GF+LRFEDDS+ESA+ VKQWNVKIISISKNKRHQDRAAALEVW++LEEF+RSRS Sbjct: 359 GFFLRFEDDSIESAKTVKQWNVKIISISKNKRHQDRAAALEVWEKLEEFVRSRS 412 >ref|NP_188898.1| protein PLANT HOMOLOGOUS TO PARAFIBROMIN [Arabidopsis thaliana] gi|11994291|dbj|BAB01474.1| unnamed protein product [Arabidopsis thaliana] gi|17529302|gb|AAL38878.1| unknown protein [Arabidopsis thaliana] gi|23296828|gb|AAN13180.1| unknown protein [Arabidopsis thaliana] gi|332643135|gb|AEE76656.1| Paf1 complex subunit parafibromin-like protein [Arabidopsis thaliana] Length = 415 Score = 495 bits (1274), Expect = e-137 Identities = 261/415 (62%), Positives = 313/415 (75%), Gaps = 25/415 (6%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ L+E+TIR D+ KI R+G +RFG++Y+FP ETAY SK + YTLE LVH++ Sbjct: 1 MDPLSVLKEFTIRGDIDKIERVGANYRFGSEYSFPCATETAYRSKSGSLYTLEALVHYVK 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTT---KVPVGPIQN- 410 NQ LKH +Y+++ +PAVT PDRK LLDYLTG+++SSDSIDF + QN Sbjct: 61 NQQLKHGEYMQSTVKNSVPAVTLPDRKPLLDYLTGRVASSDSIDFLLLQQQNAQSQKQNE 120 Query: 411 ------DLNVLPPREGEIEDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCKNRD 572 D + RE I D + + E + I I++ ERPLK R+ IL CKNRD Sbjct: 121 EYRPDQDNSAFVSRENAIADME-VEDFGKSGEDVDYIMLIRSNERPLKSRDAILQCKNRD 179 Query: 573 FYALLTAATRRDEERQKAEALQRKDNLVAKNRIERGGEELG-----------SGFDG-AK 716 FY++L +T+R+EERQ+ E+ QRKD LVAK+R+ G EE G +G+D K Sbjct: 180 FYSVLVNSTKREEERQRIESHQRKDGLVAKSRL-MGAEERGIVGFSSGGGDDNGYDANPK 238 Query: 717 AKLHLKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVT 893 +KLH K KIGEGVPIILVPSAF TLITIYNVKEFLEDG++IP DVK K+MK +KPDC+T Sbjct: 239 SKLHFKAGKIGEGVPIILVPSAFQTLITIYNVKEFLEDGVYIPNDVKAKEMKGLKPDCIT 298 Query: 894 VQKKFSRDR--VVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKI 1067 VQKKFSRDR VVTAYEVRDKPSALK DDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKI Sbjct: 299 VQKKFSRDRERVVTAYEVRDKPSALKPDDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKI 358 Query: 1068 LGFYLRFEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 +GF+LRFEDDS+ESA+ VKQWNVKIISISKNKRHQDRAAALEVW++LEEF+RSRS Sbjct: 359 IGFFLRFEDDSIESAKTVKQWNVKIISISKNKRHQDRAAALEVWEKLEEFVRSRS 413 >ref|XP_002883372.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297329212|gb|EFH59631.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 414 Score = 494 bits (1271), Expect = e-137 Identities = 260/414 (62%), Positives = 313/414 (75%), Gaps = 24/414 (5%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ L+++TIR D+ KI R+G +RFG++Y+FP ETAY SK + YTLE LVH++ Sbjct: 1 MDPLSVLKDFTIRGDVDKIERVGVNYRFGSEYSFPCATETAYRSKSGSLYTLEALVHYVK 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTT---KVPVGPIQN- 410 NQHLKH +Y+++ +PAVT PDRK LLDYLTG+++SSDSID+ + QN Sbjct: 61 NQHLKHGEYMQSTVKNSVPAVTLPDRKPLLDYLTGRVASSDSIDYLLLQQQNAQSQKQNE 120 Query: 411 ------DLNVLPPREGEIEDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCKNRD 572 D + RE IED + + E + I I++ ERPLK R+ IL CKNRD Sbjct: 121 EYRPDQDNSAFVSRENAIEDME-VEDFGKSGEDVDYIMLIRSNERPLKSRDAILQCKNRD 179 Query: 573 FYALLTAATRRDEERQKAEALQRKDNLVAKNRIERGGEELGS-GFDGA----------KA 719 FY++L +T+R+EERQ+ E+ QRKD LVAK+R+ G EE G GF G K+ Sbjct: 180 FYSVLVNSTKREEERQRIESHQRKDGLVAKSRL-MGAEERGIVGFSGGGDDNGYDANPKS 238 Query: 720 KLHLKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTV 896 KLH + KIGEGVPIILVPSA TLITIYNVKEFLEDG++IP DVK K+MK +KPDC+TV Sbjct: 239 KLHFRAGKIGEGVPIILVPSASQTLITIYNVKEFLEDGVYIPNDVKAKEMKGLKPDCITV 298 Query: 897 QKKFSRDR--VVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKIL 1070 QKKFSRDR VVTAYEVRDKPSALK DDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKI+ Sbjct: 299 QKKFSRDRERVVTAYEVRDKPSALKPDDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKII 358 Query: 1071 GFYLRFEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 GF+LRFEDDS+ESA+ VKQWNVKIISISKNKRHQDRAAALEVW++LEEF+RSRS Sbjct: 359 GFFLRFEDDSIESAKTVKQWNVKIISISKNKRHQDRAAALEVWEKLEEFVRSRS 412 >gb|EXB63474.1| hypothetical protein L484_005437 [Morus notabilis] Length = 452 Score = 484 bits (1246), Expect = e-134 Identities = 268/453 (59%), Positives = 323/453 (71%), Gaps = 63/453 (13%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ALR++TIR +L KI R DEFRFG+D++FP + TA+ SK YTLETLV++I Sbjct: 1 MDPLSALRDFTIRGELDKISRFNDEFRFGSDFSFPCSTPTAFRSKQGNLYTLETLVYYIK 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDF----TTKVPVGPIQN 410 N KH +Y++NA + PAVTF DRK LLDYLTGK+S+SDSI+F + P PI + Sbjct: 61 NHQAKHTEYLQNARTQGFPAVTFIDRKPLLDYLTGKVSTSDSIEFLVPQNPRFPDPPIPS 120 Query: 411 DLNVLPP---------REGEIEDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCK 563 ++ P G +++++ + + + E + + I+A+ERPLKDRE +L CK Sbjct: 121 SVDEYRPDDVVLGDAVEHGAVDERARVGDGEL--EKVDFMAMIRASERPLKDREALLECK 178 Query: 564 NRDFYALLTAATRRDEERQKAEALQRKDNLVAKNRI----ER--GGEELGSGFDGAKAKL 725 R+F+A+LTA+ RR+EERQ+AE+ QRKD LVAKNR+ ER GG SG+D A K Sbjct: 179 GRNFHAVLTASVRREEERQRAESQQRKDGLVAKNRLMSADERGIGGYGDDSGYDPA-PKP 237 Query: 726 HLKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTVQK 902 +KG KIGEGVPIILVPSAF TLITIYNVKEFLEDG+FIPTDVKVKQMK KPDCVTVQK Sbjct: 238 KMKGGKIGEGVPIILVPSAFQTLITIYNVKEFLEDGVFIPTDVKVKQMKGPKPDCVTVQK 297 Query: 903 KFS--RDRVVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNK---- 1064 KFS RDRVVTAYEVRDKPSALKA+DWDRVVAVFVLGK+WQFKDWPFKDHVEIFNK Sbjct: 298 KFSRDRDRVVTAYEVRDKPSALKAEDWDRVVAVFVLGKEWQFKDWPFKDHVEIFNKNNLE 357 Query: 1065 -------------------------------------ILGFYLRFEDDSVESARHVKQWN 1133 + GF++RFEDDS+ESA++VKQWN Sbjct: 358 TDISRIIMMRFVDRSFGVLGTGFLAGILILVFRIGCFVKGFFMRFEDDSIESAKNVKQWN 417 Query: 1134 VKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 VKIISISKNKRHQDRAAALEVWDRLEEF+RSRS Sbjct: 418 VKIISISKNKRHQDRAAALEVWDRLEEFVRSRS 450 >ref|XP_006297782.1| hypothetical protein CARUB_v10013819mg [Capsella rubella] gi|482566491|gb|EOA30680.1| hypothetical protein CARUB_v10013819mg [Capsella rubella] Length = 414 Score = 484 bits (1246), Expect = e-134 Identities = 256/414 (61%), Positives = 308/414 (74%), Gaps = 24/414 (5%) Frame = +3 Query: 63 MDPLTALREYTIRNDLAKILRIGDEFRFGNDYTFPATIETAYLSKHDTRYTLETLVHFIT 242 MDPL+ L+++T+R D+ KI R+G +RFG++Y+FP ETAY SK T YTLE LVH+ Sbjct: 1 MDPLSVLKDFTVRGDVDKIERVGANYRFGSEYSFPCATETAYRSKGGTLYTLEALVHYAK 60 Query: 243 NQHLKHADYIKNASSLRIPAVTFPDRKVLLDYLTGKISSSDSIDFTT---KVPVGPIQN- 410 NQHLKH +Y+++ +PAVT PDRK LLDYLTG+++SSDSID+ + QN Sbjct: 61 NQHLKHGEYMQSTVKSSVPAVTLPDRKPLLDYLTGRVASSDSIDYLLLQQQNAQSQKQNE 120 Query: 411 ------DLNVLPPREGEIEDKSFLNNLNVPEEVIVPIEWIKATERPLKDREQILLCKNRD 572 D + RE IED + + E + I I++ ERPLK R+ IL CKNRD Sbjct: 121 EYRPDQDNSAFVSRESAIEDME-VEDFGKSGEDVDYIMLIRSNERPLKSRDAILQCKNRD 179 Query: 573 FYALLTAATRRDEERQKAEALQRKDNLVAKNRIERGGEELGS-GFDGA----------KA 719 FY++L +T+R+EERQ+ E+ QRKD LVAK+R+ G EE G GF G K+ Sbjct: 180 FYSVLVNSTKREEERQRIESHQRKDGLVAKSRL-MGAEERGIVGFSGGGDDNGYDANPKS 238 Query: 720 KLHLKGSKIGEGVPIILVPSAFSTLITIYNVKEFLEDGIFIPTDVKVKQMK-VKPDCVTV 896 KLH K KIGEGVPIILVPSA TLITIYNVKEFLEDG+FI +DVK K+MK +KPDC+TV Sbjct: 239 KLHFKAGKIGEGVPIILVPSASQTLITIYNVKEFLEDGVFIESDVKAKEMKGLKPDCITV 298 Query: 897 QKKFSRDR--VVTAYEVRDKPSALKADDWDRVVAVFVLGKDWQFKDWPFKDHVEIFNKIL 1070 QKKFSRDR VVTAYEVRDKPSALK DDWDRVVAVFVLGKDWQFK WPFKDHVEIFNKI+ Sbjct: 299 QKKFSRDRERVVTAYEVRDKPSALKPDDWDRVVAVFVLGKDWQFKGWPFKDHVEIFNKII 358 Query: 1071 GFYLRFEDDSVESARHVKQWNVKIISISKNKRHQDRAAALEVWDRLEEFMRSRS 1232 GF++RF DDS+ESA+ VKQWNVKIISISKNKRH DR AALEVW++LEEF+RSRS Sbjct: 359 GFFMRFADDSIESAKTVKQWNVKIISISKNKRHHDRTAALEVWEKLEEFVRSRS 412