BLASTX nr result
ID: Akebia23_contig00011985
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00011985 (899 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278210.2| PREDICTED: uncharacterized protein LOC100256... 300 7e-79 emb|CBI27823.3| unnamed protein product [Vitis vinifera] 296 1e-77 ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4... 271 3e-70 ref|XP_006428587.1| hypothetical protein CICLE_v10012311mg [Citr... 268 3e-69 ref|XP_002516293.1| DNA binding protein, putative [Ricinus commu... 258 2e-66 gb|EXB38927.1| DNA-directed RNA polymerase III subunit RPC4 [Mor... 254 3e-65 ref|XP_007204524.1| hypothetical protein PRUPE_ppa017748mg [Prun... 254 4e-65 ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600... 252 1e-64 ref|XP_004144123.1| PREDICTED: uncharacterized protein LOC101209... 247 5e-63 ref|XP_003541303.2| PREDICTED: uncharacterized protein LOC100782... 239 1e-60 ref|XP_003550619.1| PREDICTED: uncharacterized protein LOC100802... 238 2e-60 ref|XP_007038340.1| DNA binding protein, putative isoform 2 [The... 237 5e-60 ref|XP_006381642.1| DNA-directed RNA polymerase 3 RPC4 family pr... 231 2e-58 ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phas... 230 5e-58 ref|XP_002510979.1| DNA binding protein, putative [Ricinus commu... 226 9e-57 ref|XP_004239580.1| PREDICTED: DNA-directed RNA polymerase III s... 224 4e-56 ref|XP_006847937.1| hypothetical protein AMTR_s00029p00131600 [A... 223 1e-55 ref|XP_007038339.1| DNA binding protein, putative isoform 1 [The... 223 1e-55 ref|XP_006383244.1| hypothetical protein POPTR_0005s12820g, part... 219 1e-54 ref|XP_004287620.1| PREDICTED: uncharacterized protein LOC101290... 218 3e-54 >ref|XP_002278210.2| PREDICTED: uncharacterized protein LOC100256088 [Vitis vinifera] Length = 289 Score = 300 bits (767), Expect = 7e-79 Identities = 154/234 (65%), Positives = 177/234 (75%), Gaps = 7/234 (2%) Frame = -3 Query: 897 APVQVAFGHGNASSSIRSYGTPK--SSASRSQDNGKASGQNGE-----KYYKEPWNYYTY 739 AP QVAFG+G AS+SIRSYGTP+ +++SR QD G G K YKEPW+YYTY Sbjct: 69 APTQVAFGYGGASASIRSYGTPRGATNSSRYQDPASGGGLYGSGLSDHKEYKEPWDYYTY 128 Query: 738 YPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLP 559 YPVTLP RRPYSGNP LLDE+EFGE S +T YDENS NPA ELGLMDEN+EA MLFLQLP Sbjct: 129 YPVTLPLRRPYSGNPELLDEEEFGEASESTAYDENSTNPAMELGLMDENQEASMLFLQLP 188 Query: 558 ASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLG 379 A++P++K + + ++ K C LEELP GFMGK+LVYKSG IKLKLG Sbjct: 189 ATMPMIK--------------QAATAEVKENKTCRLEELPSGFMGKMLVYKSGAIKLKLG 234 Query: 378 DTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 DTLYDVSPG DCVFAQDVVAINTE+K CV+GELKKRA+VTPDVDS +S M DL Sbjct: 235 DTLYDVSPGLDCVFAQDVVAINTEDKCCCVLGELKKRAVVTPDVDSALSSMDDL 288 >emb|CBI27823.3| unnamed protein product [Vitis vinifera] Length = 294 Score = 296 bits (757), Expect = 1e-77 Identities = 153/234 (65%), Positives = 174/234 (74%), Gaps = 7/234 (2%) Frame = -3 Query: 897 APVQVAFGHGNASSSIRSYGTPK--SSASRSQDNGKASGQNGE-----KYYKEPWNYYTY 739 AP QVAFG+G AS+SIRSYGTP+ +++SR QD G G K YKEPW+YYTY Sbjct: 79 APTQVAFGYGGASASIRSYGTPRGATNSSRYQDPASGGGLYGSGLSDHKEYKEPWDYYTY 138 Query: 738 YPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLP 559 YPVTLP RRPYSGNP LLDE+EFGE S +T YDENS NPA ELGLMDEN+EA MLFLQLP Sbjct: 139 YPVTLPLRRPYSGNPELLDEEEFGEASESTAYDENSTNPAMELGLMDENQEASMLFLQLP 198 Query: 558 ASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLG 379 A++P++K+ + C LEELP GFMGK+LVYKSG IKLKLG Sbjct: 199 ATMPMIKQ-------------------AATAETCRLEELPSGFMGKMLVYKSGAIKLKLG 239 Query: 378 DTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 DTLYDVSPG DCVFAQDVVAINTE+K CV+GELKKRA+VTPDVDS +S M DL Sbjct: 240 DTLYDVSPGLDCVFAQDVVAINTEDKCCCVLGELKKRAVVTPDVDSALSSMDDL 293 >ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma cacao] gi|508783039|gb|EOY30295.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma cacao] Length = 294 Score = 271 bits (692), Expect = 3e-70 Identities = 138/231 (59%), Positives = 169/231 (73%), Gaps = 4/231 (1%) Frame = -3 Query: 897 APVQVAFGHGNASSSIRSYGTPKSSASRSQD--NG--KASGQNGEKYYKEPWNYYTYYPV 730 A QVAFGHG AS+S++ +G K ++ S++ NG G EK Y+EPW+YY+YYPV Sbjct: 66 ASSQVAFGHGGASASMKLFGVSKGASRTSRETLNGVVHTPGLREEKEYREPWDYYSYYPV 125 Query: 729 TLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASL 550 TLP RRPYSGNP LDE+EF S N T+DENSV PA ELGLMDEN E M FLQLP +L Sbjct: 126 TLPMRRPYSGNPEFLDEEEFA--SENITFDENSVEPAVELGLMDENLEPSMFFLQLPPTL 183 Query: 549 PLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTL 370 P++K+ SKP+ +G +K C LEELP G MGK+LV+KSG +KLKLGDTL Sbjct: 184 PMIKQSGTTAGLEVDSSSKPAARVGSVKKTCGLEELPAGLMGKMLVHKSGAVKLKLGDTL 243 Query: 369 YDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 YDV+PG +CVFAQDVVA+NT EK CVVGEL KRA++TPDVDS+++ MADL Sbjct: 244 YDVTPGLNCVFAQDVVAVNTAEKQCCVVGELDKRAVLTPDVDSVLNSMADL 294 >ref|XP_006428587.1| hypothetical protein CICLE_v10012311mg [Citrus clementina] gi|568853572|ref|XP_006480425.1| PREDICTED: uncharacterized protein LOC102622464 [Citrus sinensis] gi|557530644|gb|ESR41827.1| hypothetical protein CICLE_v10012311mg [Citrus clementina] Length = 303 Score = 268 bits (684), Expect = 3e-69 Identities = 135/234 (57%), Positives = 165/234 (70%), Gaps = 7/234 (2%) Frame = -3 Query: 897 APVQVAFGHGNASSSIRSYGTPKSSASRSQDNGKA-------SGQNGEKYYKEPWNYYTY 739 AP Q+AFG G AS+ I+SYG PK +S S+ G A SG K Y+EPW+YY+Y Sbjct: 70 APSQIAFGQGGASTFIKSYGIPKGGSSSSRGQGSAVNGGAHASGTRLGKEYQEPWDYYSY 129 Query: 738 YPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLP 559 YPV+LP RRPYSG+P LLDE+EFGE S YDE+S+NPAEELGLM+EN E M+FLQLP Sbjct: 130 YPVSLPLRRPYSGSPELLDEEEFGEASETINYDESSMNPAEELGLMEENLEPNMIFLQLP 189 Query: 558 ASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLG 379 +LPL K+ S +EK SL ELP FMGKLLVY+SG +KLKLG Sbjct: 190 PTLPLKKQPATGNERQVTESSSKHEGATAKEKTSSLSELPGAFMGKLLVYRSGAVKLKLG 249 Query: 378 DTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 +T+Y+V+PG DC+FAQDVV INT EKHFCV GEL KRAI++PDVD +++ ADL Sbjct: 250 ETVYNVTPGMDCMFAQDVVVINTAEKHFCVAGELNKRAILSPDVDFILNNFADL 303 >ref|XP_002516293.1| DNA binding protein, putative [Ricinus communis] gi|223544779|gb|EEF46295.1| DNA binding protein, putative [Ricinus communis] Length = 286 Score = 258 bits (660), Expect = 2e-66 Identities = 131/230 (56%), Positives = 158/230 (68%), Gaps = 6/230 (2%) Frame = -3 Query: 888 QVAFGHGNASSSIRSYGTPKSSASRSQDNGKA------SGQNGEKYYKEPWNYYTYYPVT 727 Q+AFG G AS SI+SY PK A+ + + G + S + GEK Y EPWNYY+YYPVT Sbjct: 68 QIAFGFGAASPSIKSYAAPKVGAAVNHNQGSSVNGGAYSSELGEKEYIEPWNYYSYYPVT 127 Query: 726 LPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLP 547 LP RRPYSGNP L+ +EFGE S + YDENS N A LGLM+EN EA M FLQLP ++P Sbjct: 128 LPLRRPYSGNPATLNAEEFGEASDTSEYDENSTNSAINLGLMEENVEANMFFLQLPPTVP 187 Query: 546 LVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLY 367 ++KR ++EK C L+ELP G MGK+LVY+SG +KLKLGDTLY Sbjct: 188 MIKRLATADGHKV-----------KEEKTCKLDELPAGHMGKMLVYRSGAVKLKLGDTLY 236 Query: 366 DVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 DVSPG D FAQD+ AINT EKH CVV E+ K AIVTPDVD++I+ MADL Sbjct: 237 DVSPGLDFAFAQDIAAINTAEKHCCVVAEIDKHAIVTPDVDAIINSMADL 286 >gb|EXB38927.1| DNA-directed RNA polymerase III subunit RPC4 [Morus notabilis] Length = 328 Score = 254 bits (649), Expect = 3e-65 Identities = 135/261 (51%), Positives = 159/261 (60%), Gaps = 34/261 (13%) Frame = -3 Query: 897 APVQVAFGHGNASSSIRSYGTPKSSASRSQ------------------------------ 808 A QVAFG+G AS++IRSYG PK SQ Sbjct: 68 AAAQVAFGYGGASNTIRSYGVPKGGYRNSQGPPATRMLFTSAAFLSTVNKSFPMHDIKNH 127 Query: 807 ---DNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDE 637 D SG EK YKEPW+YY+YYP TLPFRRP+SGNP LDE+EFG + YDE Sbjct: 128 VLTDGAFPSGTRQEKEYKEPWDYYSYYPSTLPFRRPHSGNPEFLDEEEFGADTETINYDE 187 Query: 636 NSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQ-EKG 460 S A ELGL++EN E M+ LQLP +PL+KR S P+ + + K Sbjct: 188 TSAKAATELGLVEENPETSMILLQLPPIMPLMKRSANTAAGQEATKSSPAPVVAQATHKA 247 Query: 459 CSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGE 280 C+L ELP GFMGK+LVY+SG IKLK+GDTLYDVS G DCVF+QDVVAINT EKH C VGE Sbjct: 248 CALHELPAGFMGKMLVYRSGAIKLKIGDTLYDVSSGMDCVFSQDVVAINTVEKHCCAVGE 307 Query: 279 LKKRAIVTPDVDSLISGMADL 217 LKKRA +TPDVD ++ MADL Sbjct: 308 LKKRAAITPDVDFILQSMADL 328 >ref|XP_007204524.1| hypothetical protein PRUPE_ppa017748mg [Prunus persica] gi|462400055|gb|EMJ05723.1| hypothetical protein PRUPE_ppa017748mg [Prunus persica] Length = 281 Score = 254 bits (648), Expect = 4e-65 Identities = 119/226 (52%), Positives = 157/226 (69%) Frame = -3 Query: 894 PVQVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFR 715 P Q+ FG+G AS++++SYG PK ++ S N ASG EK Y PW+ Y+YYPVTLP R Sbjct: 56 PTQIVFGYGGASTTMKSYGAPKGGSASSATNAGASGVKEEKEYSSPWDQYSYYPVTLPLR 115 Query: 714 RPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKR 535 PYSGNP + +E+EFGEGS +TYDENS PA +LGL++EN+ M FLQLP ++P +KR Sbjct: 116 PPYSGNPEIRNEEEFGEGSEESTYDENSTTPANDLGLLEENKATSMFFLQLPPNMPTIKR 175 Query: 534 XXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSP 355 S P +K CSL ELP GFMGK+LVY+SG +K+K+GD+L+DVSP Sbjct: 176 SATADSQEVTKSSGPPGGARNMQKPCSLSELPAGFMGKMLVYRSGAVKMKIGDSLFDVSP 235 Query: 354 GSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 G +C FAQDVV +N EK ++GEL KRAI+TPDVDS+++ + L Sbjct: 236 GMNCDFAQDVVVVNKAEKGCGIIGELNKRAIITPDVDSILASIDGL 281 >ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600766 [Solanum tuberosum] Length = 283 Score = 252 bits (644), Expect = 1e-64 Identities = 127/223 (56%), Positives = 162/223 (72%) Frame = -3 Query: 894 PVQVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFR 715 P QVAFG+G +SSS++SYG + S S +G G+ +K Y EPW+YYT YPVTLP R Sbjct: 68 PTQVAFGYGGSSSSLKSYGH-YNKVSGSMSDGGIGGERVQKEYTEPWDYYTNYPVTLPVR 126 Query: 714 RPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKR 535 RPYSGNP LLDE+EFGE S + TYDENS+ PA +LGLM+E+ E +M +QLP ++P++K+ Sbjct: 127 RPYSGNPELLDEEEFGEASRSLTYDENSIKPAMDLGLMEESLEEKMFLVQLP-TMPMLKQ 185 Query: 534 XXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSP 355 SKPS+ K CSL ELP GFMGK+LVYKSG +KLKLG+TL+++SP Sbjct: 186 SIKTEGSEMANSSKPSKA-----KACSLNELPAGFMGKMLVYKSGAVKLKLGETLFNLSP 240 Query: 354 GSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGM 226 G DC FAQDVVA+NTEEK+ +GEL KR I+TPDVDSL+ + Sbjct: 241 GMDCSFAQDVVAVNTEEKYCSNIGELTKRIIITPDVDSLLDSI 283 >ref|XP_004144123.1| PREDICTED: uncharacterized protein LOC101209454 [Cucumis sativus] gi|449500539|ref|XP_004161125.1| PREDICTED: uncharacterized LOC101209454 [Cucumis sativus] Length = 293 Score = 247 bits (630), Expect = 5e-63 Identities = 121/227 (53%), Positives = 157/227 (69%) Frame = -3 Query: 897 APVQVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPF 718 AP QVAFG G +SS++RSYG K+ ++G ++Y EPW+YY+YYPVTLP Sbjct: 68 APTQVAFGSGGSSSTLRSYGVSKAGNRPRNEDGTLPASTSKEYV-EPWDYYSYYPVTLPL 126 Query: 717 RRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVK 538 RRPYSGNP L+E+EFGE S N TYDEN+ A LGL++EN EA +LFLQLP +P++K Sbjct: 127 RRPYSGNPDSLNEEEFGEASENLTYDENTTTAAMNLGLLEENPEADVLFLQLPPMVPMIK 186 Query: 537 RXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 358 + S+ ++ ++K CS+ ELP G +GKLLVY+SG +KLKLGD +YDVS Sbjct: 187 QSSSVEDMGSGNSSEQNKASQPRQKTCSMNELPSGSIGKLLVYRSGAVKLKLGDIIYDVS 246 Query: 357 PGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 G DC FAQ+V AIN E K C+VGEL KRAI+TPDVDS++ + DL Sbjct: 247 SGMDCGFAQEVAAINVEGKRCCIVGELSKRAILTPDVDSMLKNIEDL 293 >ref|XP_003541303.2| PREDICTED: uncharacterized protein LOC100782982 [Glycine max] Length = 318 Score = 239 bits (609), Expect = 1e-60 Identities = 116/224 (51%), Positives = 149/224 (66%) Frame = -3 Query: 888 QVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRP 709 Q+AFG+G S+S++SYG P+ +S + + AS EK Y+EPW+Y + YPVTLP RRP Sbjct: 79 QIAFGYGGESTSMKSYGIPRGGSSININQSSASNGAKEKEYQEPWDYDSNYPVTLPLRRP 138 Query: 708 YSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXX 529 YSGNP LLD+ EFGE + YDEN+ N A EL L++ N EA M F+ LP LP++K+ Sbjct: 139 YSGNPALLDDQEFGEAAEPRAYDENASNSAMELDLLEHNPEASMFFINLPTKLPMIKQSA 198 Query: 528 XXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGS 349 S+P E+ C L EL GFMGK+LVYKSG IKLKLGDTLYDVS G Sbjct: 199 TAGSSDVNVKSRPHGGSKNVEELCELNELSSGFMGKMLVYKSGAIKLKLGDTLYDVSSGM 258 Query: 348 DCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 C AQD+VAINT +KH C +GE+ KR +TPD+D++I + DL Sbjct: 259 KCACAQDLVAINTAQKHCCTIGEISKRVSITPDIDAIIDNLPDL 302 >ref|XP_003550619.1| PREDICTED: uncharacterized protein LOC100802173 [Glycine max] Length = 298 Score = 238 bits (607), Expect = 2e-60 Identities = 113/224 (50%), Positives = 152/224 (67%) Frame = -3 Query: 888 QVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRP 709 Q+AFG+G S+S++SYG P+ +S + + AS EK Y+EPW+YY+ YPVTLP RRP Sbjct: 75 QIAFGYGGESTSMKSYGIPRGGSSININLSSASSGGKEKEYQEPWDYYSNYPVTLPLRRP 134 Query: 708 YSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXX 529 YSGNP LLD++EF E + + TY+EN+ N +LGL++EN EA M + LP LP++K+ Sbjct: 135 YSGNPALLDDEEFAEAAQSRTYEENASNSTMDLGLLEENPEASMFLINLPTKLPMIKQSA 194 Query: 528 XXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGS 349 S P E+ C L EL GFMGK+LVYKSG IKLKLG+TLYDVS G Sbjct: 195 TAGDKDVNEKSIPHGGSKNVEELCELNELSSGFMGKMLVYKSGAIKLKLGNTLYDVSSGM 254 Query: 348 DCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 +C AQD+VA+NT +KH C +GE+ K +TPDVD++I ++DL Sbjct: 255 NCACAQDLVAVNTAQKHCCTIGEISKHVTITPDVDAIIDNLSDL 298 >ref|XP_007038340.1| DNA binding protein, putative isoform 2 [Theobroma cacao] gi|508775585|gb|EOY22841.1| DNA binding protein, putative isoform 2 [Theobroma cacao] Length = 328 Score = 237 bits (604), Expect = 5e-60 Identities = 124/259 (47%), Positives = 165/259 (63%), Gaps = 35/259 (13%) Frame = -3 Query: 888 QVAFGHGNASSSI-RSYGTPKSSAS--------RSQDNG--------------------- 799 Q++FG G SS++ R+YG+ + S RS D+ Sbjct: 70 QISFGPGAPSSNLLRAYGSQRGGTSGKSTDSRQRSPDDNDGQIIGSFPSASKEDRTDICS 129 Query: 798 ----KASGQNGEKYYKEPWNYY-TYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDEN 634 +AS ++ Y+EPW+Y+ TYYP+TLP RRPYSG+P LLD+ EF E +A YDE Sbjct: 130 SDAIEASAPKIKREYREPWDYHHTYYPITLPLRRPYSGDPELLDQAEFVE-AARKEYDEK 188 Query: 633 SVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCS 454 ++NPA +LGL++E E+ +M F QLPA+LP++KR S G +KGC Sbjct: 189 TINPASDLGLLEEGEKGKMFFFQLPANLPVIKRLASTKGKEKAENLGSSERFGALKKGCQ 248 Query: 453 LEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELK 274 LEELP GFMGK+LVYKSG +KLKLG+TLYDVSPGSDC+FAQDV A+NT EKH CV+GEL Sbjct: 249 LEELPGGFMGKMLVYKSGAVKLKLGETLYDVSPGSDCIFAQDVAAVNTTEKHCCVIGELG 308 Query: 273 KRAIVTPDVDSLISGMADL 217 KR +VTPD+ S+++ + DL Sbjct: 309 KRVVVTPDISSVLNSVIDL 327 >ref|XP_006381642.1| DNA-directed RNA polymerase 3 RPC4 family protein [Populus trichocarpa] gi|550336350|gb|ERP59439.1| DNA-directed RNA polymerase 3 RPC4 family protein [Populus trichocarpa] Length = 292 Score = 231 bits (590), Expect = 2e-58 Identities = 122/227 (53%), Positives = 156/227 (68%), Gaps = 2/227 (0%) Frame = -3 Query: 891 VQVAFGHGNASSS-IRSYGT-PKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPF 718 + +AFG G A++ S+ T + S S N A G EK Y EPW+YY+ YPV+LP Sbjct: 67 LDIAFGPGAAATKPFPSWSTINRDQGSSSNGNADAPGPR-EKEYIEPWDYYSNYPVSLPM 125 Query: 717 RRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVK 538 RRPYSGN +LDE+EFGE S TYDENS N A ELGLM+EN EA MLF+QLP ++P++K Sbjct: 126 RRPYSGNSAILDEEEFGEVSEAATYDENSTNSAVELGLMEENVEASMLFVQLPPTMPMIK 185 Query: 537 RXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 358 R S+PS EK C L+ELP G+MGK+LVY+SG +KLKLGDTLYDVS Sbjct: 186 RSATAVGPEVKESSRPSGGARAIEKTCRLDELPAGYMGKVLVYRSGAVKLKLGDTLYDVS 245 Query: 357 PGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 PG + +FAQDVVAIN E+ CVV E++KR + PDVD++IS +A++ Sbjct: 246 PGMNSIFAQDVVAINRGEETCCVVAEIEKRVTLIPDVDAIISRVAEM 292 >ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|593783109|ref|XP_007154595.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|561027948|gb|ESW26588.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|561027949|gb|ESW26589.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] Length = 291 Score = 230 bits (587), Expect = 5e-58 Identities = 113/224 (50%), Positives = 150/224 (66%) Frame = -3 Query: 888 QVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRP 709 Q+AFG+G S+S++SYG + + + + S EK Y EPW+YY+ YPVTLP RRP Sbjct: 70 QIAFGYGGESTSLKSYGIGRGGRNVNINPNSTSSAVAEKEYTEPWDYYSNYPVTLPLRRP 129 Query: 708 YSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXX 529 YSGNP LLDE+EFGE + TYDE + N A ELGL++EN EA M ++LP+ LP++ Sbjct: 130 YSGNPELLDEEEFGEAAEARTYDEEATNSAMELGLLEENLEANMFLIKLPSKLPIIS--T 187 Query: 528 XXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGS 349 SKP + E+ C L++LP GFMGK+LVYKSG IKLKLG+TLYDVS G Sbjct: 188 ADGGKDVNAKSKPPVGTKKGERLCELKDLPSGFMGKMLVYKSGKIKLKLGNTLYDVSSGM 247 Query: 348 DCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 +C F+QDVVAIN EK C +GE+ K +TPD+D ++ ++DL Sbjct: 248 NCSFSQDVVAINKAEKTLCSIGEISKHVTITPDIDDILDNLSDL 291 >ref|XP_002510979.1| DNA binding protein, putative [Ricinus communis] gi|223550094|gb|EEF51581.1| DNA binding protein, putative [Ricinus communis] Length = 328 Score = 226 bits (576), Expect = 9e-57 Identities = 127/257 (49%), Positives = 160/257 (62%), Gaps = 32/257 (12%) Frame = -3 Query: 891 VQVAFGHGNASS-SIRSYGTPKSS-------ASRSQDNGK-------------------- 796 VQVAFG G SS SIR++G K + D+GK Sbjct: 71 VQVAFGPGATSSTSIRTFGVSKGENPVSSGIKDSTDDDGKIVISSLSTDKEDEIINCASE 130 Query: 795 ---ASGQNGEKYYKEPWNY-YTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSV 628 A +K Y+EPW+Y TYYP TLP RRPYSG+PVLLDE EFGE + YDE+++ Sbjct: 131 DIDALPLKIKKDYREPWDYDRTYYPTTLPLRRPYSGDPVLLDEAEFGEAARKLEYDESTM 190 Query: 627 NPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLE 448 NPA +L L++E + +M+F QLPA LPLVKR S PS+ +K SL+ Sbjct: 191 NPASDLELLEECDTEKMIFFQLPAKLPLVKRSASAKGKEKAEGSIPSQGKNAAKKESSLD 250 Query: 447 ELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKR 268 L G+MGK+LVY+SG +KLKLGDTLYDVS GSDC+FAQDV+AINT KH C +GEL+KR Sbjct: 251 GLSAGYMGKMLVYRSGAVKLKLGDTLYDVSQGSDCMFAQDVMAINTAAKHCCTIGELEKR 310 Query: 267 AIVTPDVDSLISGMADL 217 A+VTPDVDSL+ + +L Sbjct: 311 AVVTPDVDSLLDSVVNL 327 >ref|XP_004239580.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4-like [Solanum lycopersicum] Length = 188 Score = 224 bits (571), Expect = 4e-56 Identities = 112/186 (60%), Positives = 138/186 (74%) Frame = -3 Query: 792 SGQNGEKYYKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEE 613 +G+ +K Y EPW+YYT YPVTLP RRPYSGNP LLDE+EFGE S + TYDENS+ PA + Sbjct: 6 NGERVQKEYTEPWDYYTNYPVTLPVRRPYSGNPELLDEEEFGEASQSLTYDENSIKPAMD 65 Query: 612 LGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPG 433 LGLM+EN E +M +QLP ++P++K+ SK S+ K CSL ELP G Sbjct: 66 LGLMEENLEEKMFLVQLP-TMPMLKQSIKTEGSEMANSSKTSKA-----KACSLNELPAG 119 Query: 432 FMGKLLVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTP 253 MGKLLVYKSG +KLKLG+TL++VSPG DC FAQDVVA+NTEEK+ +GEL KR I+TP Sbjct: 120 LMGKLLVYKSGAVKLKLGETLFNVSPGMDCSFAQDVVAVNTEEKYCSNIGELTKRIIITP 179 Query: 252 DVDSLI 235 DVDSL+ Sbjct: 180 DVDSLL 185 >ref|XP_006847937.1| hypothetical protein AMTR_s00029p00131600 [Amborella trichopoda] gi|548851242|gb|ERN09518.1| hypothetical protein AMTR_s00029p00131600 [Amborella trichopoda] Length = 305 Score = 223 bits (567), Expect = 1e-55 Identities = 122/236 (51%), Positives = 158/236 (66%), Gaps = 15/236 (6%) Frame = -3 Query: 897 APVQVAFGHGNAS--SSIRSYGTPKSSASRSQ-----DNG-------KASGQNGEKYYKE 760 APVQVAFG+GNA+ SS SY SS+ + D+G + + EK Y E Sbjct: 67 APVQVAFGYGNAANFSSSSSYSKGGSSSKPKEIGHAFDDGSQLVDVKRDVDEKREKEYVE 126 Query: 759 PWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTY-DENSVNPAEELGLMDENEEA 583 PW+YY+ YPVTLP RRPYSG+P LDE EFGE +A+ + +E+S N AEELGL +E EE Sbjct: 127 PWDYYSKYPVTLPLRRPYSGDPETLDEKEFGESAASKSVCNEDSTNAAEELGLKEEREER 186 Query: 582 RMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKS 403 +++F QLP SLP+ KR S RT G+ E LE+L GFMGKLL+Y+S Sbjct: 187 QLVFFQLPESLPIPKRSATADGKEVQDDSGQKRT-GKSEMPSRLEDLQAGFMGKLLIYES 245 Query: 402 GVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLI 235 G +KLK+GDTL++VSPGS C FAQ+V AINT ++ +CV+GE+ KRAIVTPD+D L+ Sbjct: 246 GAVKLKIGDTLFNVSPGSKCEFAQEVAAINTRDRQYCVLGEINKRAIVTPDIDDLL 301 >ref|XP_007038339.1| DNA binding protein, putative isoform 1 [Theobroma cacao] gi|508775584|gb|EOY22840.1| DNA binding protein, putative isoform 1 [Theobroma cacao] Length = 359 Score = 223 bits (567), Expect = 1e-55 Identities = 104/178 (58%), Positives = 133/178 (74%) Frame = -3 Query: 750 YYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLF 571 ++TYYP+TLP RRPYSG+P LLD+ EF E +A YDE ++NPA +LGL++E E+ +M F Sbjct: 182 HHTYYPITLPLRRPYSGDPELLDQAEFVE-AARKEYDEKTINPASDLGLLEEGEKGKMFF 240 Query: 570 LQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIK 391 QLPA+LP++KR S G +KGC LEELP GFMGK+LVYKSG +K Sbjct: 241 FQLPANLPVIKRLASTKGKEKAENLGSSERFGALKKGCQLEELPGGFMGKMLVYKSGAVK 300 Query: 390 LKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 LKLG+TLYDVSPGSDC+FAQDV A+NT EKH CV+GEL KR +VTPD+ S+++ + DL Sbjct: 301 LKLGETLYDVSPGSDCIFAQDVAAVNTTEKHCCVIGELGKRVVVTPDISSVLNSVIDL 358 >ref|XP_006383244.1| hypothetical protein POPTR_0005s12820g, partial [Populus trichocarpa] gi|550338826|gb|ERP61041.1| hypothetical protein POPTR_0005s12820g, partial [Populus trichocarpa] Length = 194 Score = 219 bits (558), Expect = 1e-54 Identities = 108/181 (59%), Positives = 132/181 (72%) Frame = -3 Query: 777 EKYYKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMD 598 EK Y EPW+YY+ YPV+LP RRPYSGN +LDE+EFGE S TYDENS N A ELGLM+ Sbjct: 9 EKEYIEPWDYYSNYPVSLPMRRPYSGNSAILDEEEFGEVSEAATYDENSTNSAVELGLME 68 Query: 597 ENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKL 418 EN EA MLF+QLP ++P++KR S+PS EK C L+ELP G+MGK+ Sbjct: 69 ENVEASMLFVQLPPTMPMIKRSATAVGPEVKESSRPSGGARAIEKTCRLDELPAGYMGKV 128 Query: 417 LVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSL 238 LVY+SG +KLKLGDTLYDVSPG + +FAQDVVAIN E+ CVV E++KR + PDVD L Sbjct: 129 LVYRSGAVKLKLGDTLYDVSPGMNSIFAQDVVAINRGEETCCVVAEIEKRVTLIPDVDKL 188 Query: 237 I 235 + Sbjct: 189 L 189 >ref|XP_004287620.1| PREDICTED: uncharacterized protein LOC101290984 [Fragaria vesca subsp. vesca] Length = 286 Score = 218 bits (554), Expect = 3e-54 Identities = 114/227 (50%), Positives = 149/227 (65%), Gaps = 1/227 (0%) Frame = -3 Query: 894 PVQVAFGHGNASSS-IRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPF 718 PV+VAFG G SSS +RSYG P+ G EK YK P++ Y +YPV+LP Sbjct: 67 PVEVAFGSGGQSSSTLRSYGAPRGV----NGGGLNPVIQEEKEYKSPFDIYGHYPVSLPL 122 Query: 717 RRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVK 538 R+P S +P +L++ EFG+GS +TYDEN+ A++L L +EN M FL LP +LP++K Sbjct: 123 RQPSSEDPAILNQQEFGDGSEESTYDENATPAADDLDLREENRATSMFFLHLPPTLPMLK 182 Query: 537 RXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 358 + +R EK CSL +LP GFMGK+LVY+SG IK+KLGDTLYDVS Sbjct: 183 QPAGQQVNNSSGAPGGARNT---EKPCSLGDLPAGFMGKMLVYRSGAIKMKLGDTLYDVS 239 Query: 357 PGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217 G +C FAQDVVAINT EK C +GEL KRA+VTPD+DS+++ + DL Sbjct: 240 TGMNCDFAQDVVAINTTEKKCCTIGELNKRAVVTPDIDSVLNSLEDL 286