BLASTX nr result
ID: Akebia27_contig00035041
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00035041 (937 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278210.2| PREDICTED: uncharacterized protein LOC100256... 280 6e-73 ref|XP_006428587.1| hypothetical protein CICLE_v10012311mg [Citr... 254 3e-65 ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4... 251 4e-64 emb|CBI27823.3| unnamed protein product [Vitis vinifera] 247 4e-63 ref|XP_007204524.1| hypothetical protein PRUPE_ppa017748mg [Prun... 239 9e-61 ref|XP_002516293.1| DNA binding protein, putative [Ricinus commu... 239 1e-60 ref|XP_004144123.1| PREDICTED: uncharacterized protein LOC101209... 237 4e-60 ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600... 235 2e-59 gb|EXB38927.1| DNA-directed RNA polymerase III subunit RPC4 [Mor... 232 1e-58 ref|XP_006381642.1| DNA-directed RNA polymerase 3 RPC4 family pr... 222 1e-55 ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phas... 222 2e-55 ref|XP_003550619.1| PREDICTED: uncharacterized protein LOC100802... 216 8e-54 ref|XP_003541303.2| PREDICTED: uncharacterized protein LOC100782... 209 1e-51 ref|XP_007038340.1| DNA binding protein, putative isoform 2 [The... 209 1e-51 ref|XP_004495578.1| PREDICTED: DNA-directed RNA polymerase III s... 199 1e-48 ref|XP_002303490.2| hypothetical protein POPTR_0003s10650g [Popu... 193 9e-47 ref|XP_006847937.1| hypothetical protein AMTR_s00029p00131600 [A... 192 1e-46 ref|XP_006490148.1| PREDICTED: DNA-directed RNA polymerase III s... 192 2e-46 ref|XP_006421620.1| hypothetical protein CICLE_v10005426mg [Citr... 192 2e-46 ref|XP_004167167.1| PREDICTED: uncharacterized protein LOC101227... 191 4e-46 >ref|XP_002278210.2| PREDICTED: uncharacterized protein LOC100256088 [Vitis vinifera] Length = 289 Score = 280 bits (716), Expect = 6e-73 Identities = 147/244 (60%), Positives = 173/244 (70%), Gaps = 7/244 (2%) Frame = -1 Query: 712 TRKVRFAPKIPSRKATKPAEPKTELVEDAEATQTKNLLRRINEGPGRGKLKADKKSAPVQ 533 TRKVRFAPK P R+ K PK+E+ ED +A Q L+R NE +GK KA+KK AP Q Sbjct: 14 TRKVRFAPKAP-RRVPKSVVPKSEVAEDDDAAQANELMRHFNEASMKGKPKAEKKLAPTQ 72 Query: 532 VAFGHGNASSFIRSYGTPK--SSASRSQDNGKASGQNGE-----KYYKEPWNYYTYYPVT 374 VAFG+G AS+ IRSYGTP+ +++SR QD G G K YKEPW+YYTYYPVT Sbjct: 73 VAFGYGGASASIRSYGTPRGATNSSRYQDPASGGGLYGSGLSDHKEYKEPWDYYTYYPVT 132 Query: 373 LPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLP 194 LP RRPYSGNP LLDE+EFGE S +T YDENS NPA ELGLMDEN+EA MLFLQLPA++P Sbjct: 133 LPLRRPYSGNPELLDEEEFGEASESTAYDENSTNPAMELGLMDENQEASMLFLQLPATMP 192 Query: 193 LVKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLY 14 ++K + + A ++ K C LEELP GFMGK+LVYKSG IKLKLGDTLY Sbjct: 193 MIK--------------QAATAEVKENKTCRLEELPSGFMGKMLVYKSGAIKLKLGDTLY 238 Query: 13 DVSP 2 DVSP Sbjct: 239 DVSP 242 >ref|XP_006428587.1| hypothetical protein CICLE_v10012311mg [Citrus clementina] gi|568853572|ref|XP_006480425.1| PREDICTED: uncharacterized protein LOC102622464 [Citrus sinensis] gi|557530644|gb|ESR41827.1| hypothetical protein CICLE_v10012311mg [Citrus clementina] Length = 303 Score = 254 bits (649), Expect = 3e-65 Identities = 136/258 (52%), Positives = 172/258 (66%), Gaps = 15/258 (5%) Frame = -1 Query: 730 MEPSTP------TRKVRFAPKIPSRKATKPAEPKTELVEDAEATQTKNLLRRINEGPG-- 575 MEP P TRK+++APK P R+ K AE KTE+VE+A+A Q +LL+R N G Sbjct: 1 MEPEPPKSTSNATRKIKYAPKAPPRRVPK-AEVKTEMVENADAAQAMDLLQRFNANQGAL 59 Query: 574 RGKLKADKKSAPVQVAFGHGNASSFIRSYGTPKSSASRSQDNGKA-------SGQNGEKY 416 +G+ K +KK AP Q+AFG G AS+FI+SYG PK +S S+ G A SG K Sbjct: 60 KGRPKVEKKVAPSQIAFGQGGASTFIKSYGIPKGGSSSSRGQGSAVNGGAHASGTRLGKE 119 Query: 415 YKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENE 236 Y+EPW+YY+YYPV+LP RRPYSG+P LLDE+EFGE S YDE+S+NPAEELGLM+EN Sbjct: 120 YQEPWDYYSYYPVSLPLRRPYSGSPELLDEEEFGEASETINYDESSMNPAEELGLMEENL 179 Query: 235 EARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVY 56 E M+FLQLP +LPL K+ S +EK SL ELP FMGKLLVY Sbjct: 180 EPNMIFLQLPPTLPLKKQPATGNERQVTESSSKHEGATAKEKTSSLSELPGAFMGKLLVY 239 Query: 55 KSGVIKLKLGDTLYDVSP 2 +SG +KLKLG+T+Y+V+P Sbjct: 240 RSGAVKLKLGETVYNVTP 257 >ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma cacao] gi|508783039|gb|EOY30295.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma cacao] Length = 294 Score = 251 bits (640), Expect = 4e-64 Identities = 132/240 (55%), Positives = 167/240 (69%), Gaps = 4/240 (1%) Frame = -1 Query: 709 RKVRFAPKIPSRKATKPAEPKTELVEDAEATQTKNLLRRINEGPGRGKLKADKKSAPVQV 530 RK+RFAPK P R+A K E KTE+VED +A Q ++LL+R+N+ + K K +KK A QV Sbjct: 12 RKMRFAPKAPPRQAPK-LEVKTEVVEDTDAVQARDLLQRLNQTSAKTKPKVEKKVASSQV 70 Query: 529 AFGHGNASSFIRSYGTPKSSASRSQD--NG--KASGQNGEKYYKEPWNYYTYYPVTLPFR 362 AFGHG AS+ ++ +G K ++ S++ NG G EK Y+EPW+YY+YYPVTLP R Sbjct: 71 AFGHGGASASMKLFGVSKGASRTSRETLNGVVHTPGLREEKEYREPWDYYSYYPVTLPMR 130 Query: 361 RPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKR 182 RPYSGNP LDE+EF S N T+DENSV PA ELGLMDEN E M FLQLP +LP++K+ Sbjct: 131 RPYSGNPEFLDEEEFA--SENITFDENSVEPAVELGLMDENLEPSMFFLQLPPTLPMIKQ 188 Query: 181 XXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSP 2 SKP+ +G +K C LEELP G MGK+LV+KSG +KLKLGDTLYDV+P Sbjct: 189 SGTTAGLEVDSSSKPAARVGSVKKTCGLEELPAGLMGKMLVHKSGAVKLKLGDTLYDVTP 248 >emb|CBI27823.3| unnamed protein product [Vitis vinifera] Length = 294 Score = 247 bits (631), Expect = 4e-63 Identities = 138/254 (54%), Positives = 163/254 (64%), Gaps = 17/254 (6%) Frame = -1 Query: 712 TRKVRFAPKIPSRKATKPAEPKTELVEDAEATQTKNLLRRINEGPGRGKLK--------- 560 TRKVRFAPK P R+ K PK+E+ ED +A Q L+R N ++ Sbjct: 14 TRKVRFAPKAP-RRVPKSVVPKSEVAEDDDAAQANELMRHFNVFILWRRIIFYFFIFFLF 72 Query: 559 -ADKKSAPVQVAFGHGNASSFIRSYGTPK--SSASRSQDNGKASGQNGE-----KYYKEP 404 + AP QVAFG+G AS+ IRSYGTP+ +++SR QD G G K YKEP Sbjct: 73 CSHNCMAPTQVAFGYGGASASIRSYGTPRGATNSSRYQDPASGGGLYGSGLSDHKEYKEP 132 Query: 403 WNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARM 224 W+YYTYYPVTLP RRPYSGNP LLDE+EFGE S +T YDENS NPA ELGLMDEN+EA M Sbjct: 133 WDYYTYYPVTLPLRRPYSGNPELLDEEEFGEASESTAYDENSTNPAMELGLMDENQEASM 192 Query: 223 LFLQLPASLPLVKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGV 44 LFLQLPA++P++K+ + C LEELP GFMGK+LVYKSG Sbjct: 193 LFLQLPATMPMIKQ-------------------AATAETCRLEELPSGFMGKMLVYKSGA 233 Query: 43 IKLKLGDTLYDVSP 2 IKLKLGDTLYDVSP Sbjct: 234 IKLKLGDTLYDVSP 247 >ref|XP_007204524.1| hypothetical protein PRUPE_ppa017748mg [Prunus persica] gi|462400055|gb|EMJ05723.1| hypothetical protein PRUPE_ppa017748mg [Prunus persica] Length = 281 Score = 239 bits (611), Expect = 9e-61 Identities = 119/237 (50%), Positives = 157/237 (66%), Gaps = 3/237 (1%) Frame = -1 Query: 703 VRFAPKIPSRKATKPAEPKTEL---VEDAEATQTKNLLRRINEGPGRGKLKADKKSAPVQ 533 +RF PK P R+ KP E KTE+ E+++A + K LL+R NE R +LK +KK P Q Sbjct: 1 MRFIPKAP-RRVPKP-EVKTEVDHGAEESDAEKAKELLKRFNEQSSRARLKVEKKVVPTQ 58 Query: 532 VAFGHGNASSFIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRPY 353 + FG+G AS+ ++SYG PK ++ S N ASG EK Y PW+ Y+YYPVTLP R PY Sbjct: 59 IVFGYGGASTTMKSYGAPKGGSASSATNAGASGVKEEKEYSSPWDQYSYYPVTLPLRPPY 118 Query: 352 SGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXX 173 SGNP + +E+EFGEGS +TYDENS PA +LGL++EN+ M FLQLP ++P +KR Sbjct: 119 SGNPEIRNEEEFGEGSEESTYDENSTTPANDLGLLEENKATSMFFLQLPPNMPTIKRSAT 178 Query: 172 XXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSP 2 S P +K CSL ELP GFMGK+LVY+SG +K+K+GD+L+DVSP Sbjct: 179 ADSQEVTKSSGPPGGARNMQKPCSLSELPAGFMGKMLVYRSGAVKMKIGDSLFDVSP 235 >ref|XP_002516293.1| DNA binding protein, putative [Ricinus communis] gi|223544779|gb|EEF46295.1| DNA binding protein, putative [Ricinus communis] Length = 286 Score = 239 bits (609), Expect = 1e-60 Identities = 125/247 (50%), Positives = 157/247 (63%), Gaps = 6/247 (2%) Frame = -1 Query: 724 PSTPTRKVRFAPKIPSRKATKPAEPKTELVEDAEATQTKNLLRRINEGPGRGKLKADKKS 545 P P RK+++ PK P R+ KP E K+E ED +ATQ L+++ E R K KA+KK Sbjct: 6 PQDPPRKLKYMPKAPPRRPPKP-EVKSEKAEDEDATQAMKLMKQFQERSMRAKPKAEKKV 64 Query: 544 APVQVAFGHGNASSFIRSYGTPKSSASRSQDNGKA------SGQNGEKYYKEPWNYYTYY 383 Q+AFG G AS I+SY PK A+ + + G + S + GEK Y EPWNYY+YY Sbjct: 65 QASQIAFGFGAASPSIKSYAAPKVGAAVNHNQGSSVNGGAYSSELGEKEYIEPWNYYSYY 124 Query: 382 PVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPA 203 PVTLP RRPYSGNP L+ +EFGE S + YDENS N A LGLM+EN EA M FLQLP Sbjct: 125 PVTLPLRRPYSGNPATLNAEEFGEASDTSEYDENSTNSAINLGLMEENVEANMFFLQLPP 184 Query: 202 SLPLVKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGD 23 ++P++KR ++EK C L+ELP G MGK+LVY+SG +KLKLGD Sbjct: 185 TVPMIKRLATADGHKV-----------KEEKTCKLDELPAGHMGKMLVYRSGAVKLKLGD 233 Query: 22 TLYDVSP 2 TLYDVSP Sbjct: 234 TLYDVSP 240 >ref|XP_004144123.1| PREDICTED: uncharacterized protein LOC101209454 [Cucumis sativus] gi|449500539|ref|XP_004161125.1| PREDICTED: uncharacterized LOC101209454 [Cucumis sativus] Length = 293 Score = 237 bits (605), Expect = 4e-60 Identities = 119/235 (50%), Positives = 159/235 (67%) Frame = -1 Query: 709 RKVRFAPKIPSRKATKPAEPKTELVEDAEATQTKNLLRRINEGPGRGKLKADKKSAPVQV 530 RK++FAPK P R+ KP E K E+ EDA+A Q ++LL+R NE R K + +K+AP QV Sbjct: 14 RKLKFAPKAPVRRIPKP-EVKAEVAEDADAAQARDLLKRFNESTQRAKQRVGRKAAPTQV 72 Query: 529 AFGHGNASSFIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRPYS 350 AFG G +SS +RSYG K+ ++G ++Y EPW+YY+YYPVTLP RRPYS Sbjct: 73 AFGSGGSSSTLRSYGVSKAGNRPRNEDGTLPASTSKEYV-EPWDYYSYYPVTLPLRRPYS 131 Query: 349 GNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXX 170 GNP L+E+EFGE S N TYDEN+ A LGL++EN EA +LFLQLP +P++K+ Sbjct: 132 GNPDSLNEEEFGEASENLTYDENTTTAAMNLGLLEENPEADVLFLQLPPMVPMIKQSSSV 191 Query: 169 XXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 5 S+ ++A ++K CS+ ELP G +GKLLVY+SG +KLKLGD +YDVS Sbjct: 192 EDMGSGNSSEQNKASQPRQKTCSMNELPSGSIGKLLVYRSGAVKLKLGDIIYDVS 246 >ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600766 [Solanum tuberosum] Length = 283 Score = 235 bits (600), Expect = 2e-59 Identities = 126/237 (53%), Positives = 163/237 (68%), Gaps = 1/237 (0%) Frame = -1 Query: 709 RKVRFAPKIPSRKATKPAEPKTELVE-DAEATQTKNLLRRINEGPGRGKLKADKKSAPVQ 533 RKVRFAPK P R+A K PK E VE D +A + + L++R NE + K K +KK P Q Sbjct: 12 RKVRFAPKGPPRRAQKTVLPKPENVEADGDAAKAEELMQRFNEASAKVKHKVEKKG-PTQ 70 Query: 532 VAFGHGNASSFIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRPY 353 VAFG+G +SS ++SYG + S S +G G+ +K Y EPW+YYT YPVTLP RRPY Sbjct: 71 VAFGYGGSSSSLKSYGH-YNKVSGSMSDGGIGGERVQKEYTEPWDYYTNYPVTLPVRRPY 129 Query: 352 SGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXX 173 SGNP LLDE+EFGE S + TYDENS+ PA +LGLM+E+ E +M +QLP ++P++K+ Sbjct: 130 SGNPELLDEEEFGEASRSLTYDENSIKPAMDLGLMEESLEEKMFLVQLP-TMPMLKQSIK 188 Query: 172 XXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSP 2 SKPS+A K CSL ELP GFMGK+LVYKSG +KLKLG+TL+++SP Sbjct: 189 TEGSEMANSSKPSKA-----KACSLNELPAGFMGKMLVYKSGAVKLKLGETLFNLSP 240 >gb|EXB38927.1| DNA-directed RNA polymerase III subunit RPC4 [Morus notabilis] Length = 328 Score = 232 bits (592), Expect = 1e-58 Identities = 132/275 (48%), Positives = 157/275 (57%), Gaps = 34/275 (12%) Frame = -1 Query: 727 EPSTPTRKVRFAPKIPSRKATKPAEPKTELVEDAEATQTKNLLRRINEGPGRGKLKADKK 548 EP P RK RF PK P + K AE K E+VE+ +A Q + LLRR NEG R K K +KK Sbjct: 9 EPDAP-RKRRFMPKAPPSRVPK-AEVKAEVVEETDADQARVLLRRFNEGSTRAKPKVEKK 66 Query: 547 SAPVQVAFGHGNASSFIRSYGTPKSSASRSQ----------------------------- 455 A QVAFG+G AS+ IRSYG PK SQ Sbjct: 67 VAAAQVAFGYGGASNTIRSYGVPKGGYRNSQGPPATRMLFTSAAFLSTVNKSFPMHDIKN 126 Query: 454 ----DNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYD 287 D SG EK YKEPW+YY+YYP TLPFRRP+SGNP LDE+EFG + YD Sbjct: 127 HVLTDGAFPSGTRQEKEYKEPWDYYSYYPSTLPFRRPHSGNPEFLDEEEFGADTETINYD 186 Query: 286 ENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRAMGR-QEK 110 E S A ELGL++EN E M+ LQLP +PL+KR S P+ + + K Sbjct: 187 ETSAKAATELGLVEENPETSMILLQLPPIMPLMKRSANTAAGQEATKSSPAPVVAQATHK 246 Query: 109 GCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 5 C+L ELP GFMGK+LVY+SG IKLK+GDTLYDVS Sbjct: 247 ACALHELPAGFMGKMLVYRSGAIKLKIGDTLYDVS 281 >ref|XP_006381642.1| DNA-directed RNA polymerase 3 RPC4 family protein [Populus trichocarpa] gi|550336350|gb|ERP59439.1| DNA-directed RNA polymerase 3 RPC4 family protein [Populus trichocarpa] Length = 292 Score = 222 bits (566), Expect = 1e-55 Identities = 122/243 (50%), Positives = 154/243 (63%), Gaps = 2/243 (0%) Frame = -1 Query: 724 PSTPTRKVRFAPKIPSRKATKPAEPKTELVEDAEATQTKNLLRRINEGPGRGKLKADKKS 545 P RK RF PK P R+ KP E KTE VE+ + Q NL+++ E + K+ +KK Sbjct: 6 PQDAQRKYRFMPKAPPRRVPKP-EVKTEKVENVDTLQAMNLMKQFQERSLKQKITNEKKV 64 Query: 544 APVQVAFGHGNASSF-IRSYGT-PKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTL 371 + +AFG G A++ S+ T + S S N A G EK Y EPW+YY+ YPV+L Sbjct: 65 QKLDIAFGPGAAATKPFPSWSTINRDQGSSSNGNADAPGPR-EKEYIEPWDYYSNYPVSL 123 Query: 370 PFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPL 191 P RRPYSGN +LDE+EFGE S TYDENS N A ELGLM+EN EA MLF+QLP ++P+ Sbjct: 124 PMRRPYSGNSAILDEEEFGEVSEAATYDENSTNSAVELGLMEENVEASMLFVQLPPTMPM 183 Query: 190 VKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYD 11 +KR S+PS EK C L+ELP G+MGK+LVY+SG +KLKLGDTLYD Sbjct: 184 IKRSATAVGPEVKESSRPSGGARAIEKTCRLDELPAGYMGKVLVYRSGAVKLKLGDTLYD 243 Query: 10 VSP 2 VSP Sbjct: 244 VSP 246 >ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|593783109|ref|XP_007154595.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|561027948|gb|ESW26588.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|561027949|gb|ESW26589.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] Length = 291 Score = 222 bits (565), Expect = 2e-55 Identities = 121/244 (49%), Positives = 157/244 (64%), Gaps = 3/244 (1%) Frame = -1 Query: 727 EPSTPTR-KVRFAPKIPSRKATKPAEPKTELVEDAEA--TQTKNLLRRINEGPGRGKLKA 557 EPS P R K +FAP+ P R K E K E+VEDA+A Q K+LLRR NE + + K Sbjct: 4 EPSAPVRRKHKFAPRAPPRVVPKK-EVKAEVVEDAQADANQAKDLLRRFNESAMKARNKV 62 Query: 556 DKKSAPVQVAFGHGNASSFIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPV 377 +KK + Q+AFG+G S+ ++SYG + + + + S EK Y EPW+YY+ YPV Sbjct: 63 EKKVSASQIAFGYGGESTSLKSYGIGRGGRNVNINPNSTSSAVAEKEYTEPWDYYSNYPV 122 Query: 376 TLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASL 197 TLP RRPYSGNP LLDE+EFGE + TYDE + N A ELGL++EN EA M ++LP+ L Sbjct: 123 TLPLRRPYSGNPELLDEEEFGEAAEARTYDEEATNSAMELGLLEENLEANMFLIKLPSKL 182 Query: 196 PLVKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTL 17 P++ SKP + E+ C L++LP GFMGK+LVYKSG IKLKLG+TL Sbjct: 183 PIIS--TADGGKDVNAKSKPPVGTKKGERLCELKDLPSGFMGKMLVYKSGKIKLKLGNTL 240 Query: 16 YDVS 5 YDVS Sbjct: 241 YDVS 244 >ref|XP_003550619.1| PREDICTED: uncharacterized protein LOC100802173 [Glycine max] Length = 298 Score = 216 bits (551), Expect = 8e-54 Identities = 116/244 (47%), Positives = 154/244 (63%), Gaps = 4/244 (1%) Frame = -1 Query: 724 PSTPTRKVRFAPKIPSRKATKPAEPKTELVEDAEATQT--KNLLRRINEGPG--RGKLKA 557 P P RK +F P+ P R+ K E K E+V+DA+A Q +NLL+R NE + K K Sbjct: 10 PGVP-RKPKFKPRAPPRRVIKQ-EVKAEVVDDADAEQAAKENLLKRFNERESAMKAKYKV 67 Query: 556 DKKSAPVQVAFGHGNASSFIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPV 377 +KK Q+AFG+G S+ ++SYG P+ +S + + AS EK Y+EPW+YY+ YPV Sbjct: 68 EKKVLASQIAFGYGGESTSMKSYGIPRGGSSININLSSASSGGKEKEYQEPWDYYSNYPV 127 Query: 376 TLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASL 197 TLP RRPYSGNP LLD++EF E + + TY+EN+ N +LGL++EN EA M + LP L Sbjct: 128 TLPLRRPYSGNPALLDDEEFAEAAQSRTYEENASNSTMDLGLLEENPEASMFLINLPTKL 187 Query: 196 PLVKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTL 17 P++K+ S P E+ C L EL GFMGK+LVYKSG IKLKLG+TL Sbjct: 188 PMIKQSATAGDKDVNEKSIPHGGSKNVEELCELNELSSGFMGKMLVYKSGAIKLKLGNTL 247 Query: 16 YDVS 5 YDVS Sbjct: 248 YDVS 251 >ref|XP_003541303.2| PREDICTED: uncharacterized protein LOC100782982 [Glycine max] Length = 318 Score = 209 bits (532), Expect = 1e-51 Identities = 113/243 (46%), Positives = 148/243 (60%), Gaps = 8/243 (3%) Frame = -1 Query: 709 RKVRFAPKIPSRKATKPAEPKTELVED------AEATQTKNLLRRINEGPG--RGKLKAD 554 RK++F P+ P R+ K E K E+ +D AE +NLL+R +E + K K + Sbjct: 14 RKLKFKPRAPPRRVIKQ-EVKAEVADDVDVDADAEHAAKENLLKRFHERESAVKAKYKVE 72 Query: 553 KKSAPVQVAFGHGNASSFIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVT 374 KK Q+AFG+G S+ ++SYG P+ +S + + AS EK Y+EPW+Y + YPVT Sbjct: 73 KKVLASQIAFGYGGESTSMKSYGIPRGGSSININQSSASNGAKEKEYQEPWDYDSNYPVT 132 Query: 373 LPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLP 194 LP RRPYSGNP LLD+ EFGE + YDEN+ N A EL L++ N EA M F+ LP LP Sbjct: 133 LPLRRPYSGNPALLDDQEFGEAAEPRAYDENASNSAMELDLLEHNPEASMFFINLPTKLP 192 Query: 193 LVKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLY 14 ++K+ S+P E+ C L EL GFMGK+LVYKSG IKLKLGDTLY Sbjct: 193 MIKQSATAGSSDVNVKSRPHGGSKNVEELCELNELSSGFMGKMLVYKSGAIKLKLGDTLY 252 Query: 13 DVS 5 DVS Sbjct: 253 DVS 255 >ref|XP_007038340.1| DNA binding protein, putative isoform 2 [Theobroma cacao] gi|508775585|gb|EOY22841.1| DNA binding protein, putative isoform 2 [Theobroma cacao] Length = 328 Score = 209 bits (532), Expect = 1e-51 Identities = 125/278 (44%), Positives = 164/278 (58%), Gaps = 37/278 (13%) Frame = -1 Query: 724 PSTPTRKVRFAPKIP-SRKATKPAEPKTELV-EDAEATQTKNLLRRINEGPGRGKLKADK 551 PS+ RKVRFAPK P S + K K+E+ ED EA Q + LL R NE R + K +K Sbjct: 6 PSSGRRKVRFAPKAPQSSRRLKTTVSKSEVNDEDGEAAQAQYLLGRFNENQTRQRPKVEK 65 Query: 550 KSAPVQVAFGHGNASS-FIRSYGTPKSSAS--------RSQDNG---------------- 446 KS+ Q++FG G SS +R+YG+ + S RS D+ Sbjct: 66 KSS-AQISFGPGAPSSNLLRAYGSQRGGTSGKSTDSRQRSPDDNDGQIIGSFPSASKEDR 124 Query: 445 ---------KASGQNGEKYYKEPWNYY-TYYPVTLPFRRPYSGNPVLLDEDEFGEGSANT 296 +AS ++ Y+EPW+Y+ TYYP+TLP RRPYSG+P LLD+ EF E +A Sbjct: 125 TDICSSDAIEASAPKIKREYREPWDYHHTYYPITLPLRRPYSGDPELLDQAEFVE-AARK 183 Query: 295 TYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRAMGRQ 116 YDE ++NPA +LGL++E E+ +M F QLPA+LP++KR S G Sbjct: 184 EYDEKTINPASDLGLLEEGEKGKMFFFQLPANLPVIKRLASTKGKEKAENLGSSERFGAL 243 Query: 115 EKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSP 2 +KGC LEELP GFMGK+LVYKSG +KLKLG+TLYDVSP Sbjct: 244 KKGCQLEELPGGFMGKMLVYKSGAVKLKLGETLYDVSP 281 >ref|XP_004495578.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4-like [Cicer arietinum] Length = 298 Score = 199 bits (506), Expect = 1e-48 Identities = 112/244 (45%), Positives = 145/244 (59%), Gaps = 3/244 (1%) Frame = -1 Query: 727 EPSTPTRKVRFAPKIPSRKATKPAEPKTELVEDAEAT---QTKNLLRRINEGPGRGKLKA 557 +P P RKV+FAPK RK K E K+E+ E+ AT K LLRR NE + ++K Sbjct: 8 KPPAP-RKVKFAPKALPRKVPK-IEVKSEVAEEDNATAAADAKELLRRFNENAMKARMKV 65 Query: 556 DKKSAPVQVAFGHGNASSFIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPV 377 +KK + Q+AFG G S +SY PKS + S A EK Y+EPW+ + YP+ Sbjct: 66 EKKVSASQIAFGFGGESVSRKSYNIPKSESKTSFGENLAFNGVKEKEYQEPWDMNSNYPI 125 Query: 376 TLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASL 197 LP R+PYSG+P L+E+EFGE + TYDE+ N A ELGL++EN EA F++LP + Sbjct: 126 ALPLRKPYSGDPEYLNEEEFGEAAITRTYDESKSNSAMELGLLEENPEASAFFIKLPPVV 185 Query: 196 PLVKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTL 17 P++K SK R K L ELPPG MGK+LVYKSG +KLKLG+TL Sbjct: 186 PMIKNPAKAGSQDVKENSKRPRVSKGVGKLYKLNELPPGLMGKMLVYKSGAVKLKLGNTL 245 Query: 16 YDVS 5 YDVS Sbjct: 246 YDVS 249 >ref|XP_002303490.2| hypothetical protein POPTR_0003s10650g [Populus trichocarpa] gi|550342916|gb|EEE78469.2| hypothetical protein POPTR_0003s10650g [Populus trichocarpa] Length = 368 Score = 193 bits (490), Expect = 9e-47 Identities = 121/280 (43%), Positives = 158/280 (56%), Gaps = 35/280 (12%) Frame = -1 Query: 736 DLMEPSTPTR-KVRFAPKIPSRKATKPAEPKTELV-------EDAEATQTKNLLRRINEG 581 D ++PS+P+R K++F PK+P R+ +P+ PKTE + ED EA Q + L+ + NE Sbjct: 46 DQVDPSSPSRTKLKFKPKLP-RRQRRPSVPKTEEINDDRRSNEDEEAAQAQMLIHKFNEN 104 Query: 580 PGRGKLKADKKSAPVQVAFGHGNASS--FIRSYGTPK-----SSASRSQDNGKASGQ--- 431 R K +K VQVAFG G S IR Y P SS S ++D G+ Sbjct: 105 LRRQVPK--EKKPQVQVAFGPGAPSPPLLIRKYNVPVHENTGSSWSGTEDTRDDDGKIFV 162 Query: 430 ----------------NGEKYYKEPWNYY-TYYPVTLPFRRPYSGNPVLLDEDEFGEGSA 302 G++ YKEPW+Y+ YYP TLP R PYSG+P LLDE EFGE + Sbjct: 163 PPSAARVDGAINPLSLKGKRRYKEPWDYHHIYYPNTLPLRPPYSGDPKLLDEAEFGEEAR 222 Query: 301 NTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRAMG 122 N YDE ++NPA +LGL++E + R+ F Q+P LP +KR S PS + Sbjct: 223 NLEYDETTINPASDLGLLEECDNERLFFFQVPEKLPFLKRSASAKGKERADMSMPSESKS 282 Query: 121 RQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSP 2 K S EELP G+MGK+LVY+SG IKLKLGD LYDVSP Sbjct: 283 AARK-TSFEELPKGYMGKMLVYRSGAIKLKLGDALYDVSP 321 >ref|XP_006847937.1| hypothetical protein AMTR_s00029p00131600 [Amborella trichopoda] gi|548851242|gb|ERN09518.1| hypothetical protein AMTR_s00029p00131600 [Amborella trichopoda] Length = 305 Score = 192 bits (489), Expect = 1e-46 Identities = 112/257 (43%), Positives = 155/257 (60%), Gaps = 17/257 (6%) Frame = -1 Query: 721 STPTRKVRFAPKIPSRKATKPAEPKTELVEDAEATQTKNLLRRINEG--PGRGKLKADKK 548 + P + +F PK+ ++ + PA K+E +E + + LL+ I + G G K +KK Sbjct: 6 NNPKKPRKFMPKVRPKRVSNPAGVKSESIETPDEKLSNELLKLIKQRREDGGGWGKNEKK 65 Query: 547 SAPVQVAFGHGNASSFIRSYGTPKS-SASRSQDNGKASGQNG-------------EKYYK 410 +APVQVAFG+GNA++F S K S+S+ ++ G A EK Y Sbjct: 66 AAPVQVAFGYGNAANFSSSSSYSKGGSSSKPKEIGHAFDDGSQLVDVKRDVDEKREKEYV 125 Query: 409 EPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSAN-TTYDENSVNPAEELGLMDENEE 233 EPW+YY+ YPVTLP RRPYSG+P LDE EFGE +A+ + +E+S N AEELGL +E EE Sbjct: 126 EPWDYYSKYPVTLPLRRPYSGDPETLDEKEFGESAASKSVCNEDSTNAAEELGLKEEREE 185 Query: 232 ARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRAMGRQEKGCSLEELPPGFMGKLLVYK 53 +++F QLP SLP+ KR S R G+ E LE+L GFMGKLL+Y+ Sbjct: 186 RQLVFFQLPESLPIPKRSATADGKEVQDDSGQKRT-GKSEMPSRLEDLQAGFMGKLLIYE 244 Query: 52 SGVIKLKLGDTLYDVSP 2 SG +KLK+GDTL++VSP Sbjct: 245 SGAVKLKIGDTLFNVSP 261 >ref|XP_006490148.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4-like [Citrus sinensis] Length = 324 Score = 192 bits (488), Expect = 2e-46 Identities = 120/284 (42%), Positives = 159/284 (55%), Gaps = 39/284 (13%) Frame = -1 Query: 739 RDLMEPSTPTRKVRFAPKIP----SRKATKPAE-PKTELVEDAEATQTKNLLRRINEGPG 575 +D +PS RKVRFAPK P K T P P+ E + + + LLR+ NE Sbjct: 3 QDPDKPSGSGRKVRFAPKAPPPSRQPKVTAPTPVPRPESKHEDPEAEAQRLLRQFNEANA 62 Query: 574 RGKLKADKKSAPVQVAFGHGNASS-FIRSYG----------------------------- 485 R + K +KKS+ QVAFG G++SS I+S+G Sbjct: 63 RRRPKVEKKSS--QVAFGAGDSSSPSIKSFGPRREVSSAKGTESEIIDSTSDERQIVNFS 120 Query: 484 --TPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGE 311 T + S + +S Q ++ YKEPWNY TYYP TLP+R+P SG+P +LD++EFGE Sbjct: 121 PVTAREDRSAPISSDASSTQKIKEDYKEPWNYDTYYPTTLPWRKPNSGDPEVLDQEEFGE 180 Query: 310 GSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSR 131 + N+ YDENSVN A +LGL+DE+E ++ F QLP LPL KR SKP Sbjct: 181 NTRNSEYDENSVNSAADLGLLDESENRKLFFFQLPKKLPLDKRPASTKGKEKAESSKP-- 238 Query: 130 AMGRQE--KGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 5 +GR + K L +LP G+MGK+LVYKSG +K KLGDTL+DVS Sbjct: 239 -LGRTDAPKDLDLSKLPGGYMGKMLVYKSGAVKFKLGDTLFDVS 281 >ref|XP_006421620.1| hypothetical protein CICLE_v10005426mg [Citrus clementina] gi|557523493|gb|ESR34860.1| hypothetical protein CICLE_v10005426mg [Citrus clementina] Length = 324 Score = 192 bits (488), Expect = 2e-46 Identities = 120/284 (42%), Positives = 159/284 (55%), Gaps = 39/284 (13%) Frame = -1 Query: 739 RDLMEPSTPTRKVRFAPKIP----SRKATKPAE-PKTELVEDAEATQTKNLLRRINEGPG 575 +D +PS RKVRFAPK P K T P P+ E + + + LLR+ NE Sbjct: 3 QDPDKPSGSGRKVRFAPKAPPPSRQPKVTAPTPVPRPESKHEDPEAEAQRLLRQFNEANA 62 Query: 574 RGKLKADKKSAPVQVAFGHGNASS-FIRSYG----------------------------- 485 R + K +KKS+ QVAFG G++SS I+S+G Sbjct: 63 RRRPKVEKKSS--QVAFGAGDSSSPSIKSFGPRREVSSAKGTESEIIDSTSDERQIVNFS 120 Query: 484 --TPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGE 311 T + S + +S Q ++ YKEPWNY TYYP TLP+R+P SG+P +LD++EFGE Sbjct: 121 PATAREDRSAPISSDASSTQKIKEDYKEPWNYDTYYPTTLPWRKPNSGDPEVLDQEEFGE 180 Query: 310 GSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSR 131 + N+ YDENSVN A +LGL+DE+E ++ F QLP LPL KR SKP Sbjct: 181 NTRNSEYDENSVNSAADLGLLDESENRKLFFFQLPKKLPLDKRPASTKGKEKAESSKP-- 238 Query: 130 AMGRQE--KGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 5 +GR + K L +LP G+MGK+LVYKSG +K KLGDTL+DVS Sbjct: 239 -LGRTDAPKDLDLSKLPGGYMGKMLVYKSGAVKFKLGDTLFDVS 281 >ref|XP_004167167.1| PREDICTED: uncharacterized protein LOC101227599 [Cucumis sativus] Length = 322 Score = 191 bits (485), Expect = 4e-46 Identities = 123/274 (44%), Positives = 156/274 (56%), Gaps = 33/274 (12%) Frame = -1 Query: 727 EPSTPTRKVRFAPKIPSRKATKPAEPKTELVEDAEA----TQTKNLLRRINEGPGRGKLK 560 +PS P RKV+FAPK RK +P P + ED + QT+ LLRR NE G+ K Sbjct: 5 DPSPPRRKVKFAPKSSQRK--RPPPPPVQKTEDEDGEGYVAQTRYLLRRANENLGKRANK 62 Query: 559 ADKKSAPVQVAFGHG--NASSFIRSYGTPK-SSASRSQDN--------------GKASGQ 431 +KKS+ VQVAFG G + SS IR+YG PK + SR D + + + Sbjct: 63 VEKKSS-VQVAFGPGAESTSSSIRTYGVPKVENGSRKNDIEPEVDEDEEFVLPVARDANE 121 Query: 430 NGEKY-----------YKEPWNYY-TYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYD 287 +G+ + YKEPW+Y +YYP TLP R PYSG+P LLDE EFG+ N YD Sbjct: 122 DGKYFDKKPKMETKRDYKEPWDYQNSYYPTTLPLRMPYSGDPELLDEAEFGQDVMNREYD 181 Query: 286 ENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRAMGRQEKG 107 ENSV PA +LGL+DEN E+ F QLPA LPL K+ S+ S + + Sbjct: 182 ENSVIPALDLGLLDENTESTKYFFQLPARLPLPKQSSTATGKEKVGNSRSSNSTSSSDLD 241 Query: 106 CSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 5 L++L G MGKLL+YKSG IKL+LGD LYDVS Sbjct: 242 -DLKKLSAGCMGKLLIYKSGAIKLRLGDILYDVS 274