BLASTX nr result

ID: Akebia23_contig00011985 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00011985
         (899 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278210.2| PREDICTED: uncharacterized protein LOC100256...   300   7e-79
emb|CBI27823.3| unnamed protein product [Vitis vinifera]              296   1e-77
ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4...   271   3e-70
ref|XP_006428587.1| hypothetical protein CICLE_v10012311mg [Citr...   268   3e-69
ref|XP_002516293.1| DNA binding protein, putative [Ricinus commu...   258   2e-66
gb|EXB38927.1| DNA-directed RNA polymerase III subunit RPC4 [Mor...   254   3e-65
ref|XP_007204524.1| hypothetical protein PRUPE_ppa017748mg [Prun...   254   4e-65
ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600...   252   1e-64
ref|XP_004144123.1| PREDICTED: uncharacterized protein LOC101209...   247   5e-63
ref|XP_003541303.2| PREDICTED: uncharacterized protein LOC100782...   239   1e-60
ref|XP_003550619.1| PREDICTED: uncharacterized protein LOC100802...   238   2e-60
ref|XP_007038340.1| DNA binding protein, putative isoform 2 [The...   237   5e-60
ref|XP_006381642.1| DNA-directed RNA polymerase 3 RPC4 family pr...   231   2e-58
ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phas...   230   5e-58
ref|XP_002510979.1| DNA binding protein, putative [Ricinus commu...   226   9e-57
ref|XP_004239580.1| PREDICTED: DNA-directed RNA polymerase III s...   224   4e-56
ref|XP_006847937.1| hypothetical protein AMTR_s00029p00131600 [A...   223   1e-55
ref|XP_007038339.1| DNA binding protein, putative isoform 1 [The...   223   1e-55
ref|XP_006383244.1| hypothetical protein POPTR_0005s12820g, part...   219   1e-54
ref|XP_004287620.1| PREDICTED: uncharacterized protein LOC101290...   218   3e-54

>ref|XP_002278210.2| PREDICTED: uncharacterized protein LOC100256088 [Vitis vinifera]
          Length = 289

 Score =  300 bits (767), Expect = 7e-79
 Identities = 154/234 (65%), Positives = 177/234 (75%), Gaps = 7/234 (2%)
 Frame = -3

Query: 897 APVQVAFGHGNASSSIRSYGTPK--SSASRSQDNGKASGQNGE-----KYYKEPWNYYTY 739
           AP QVAFG+G AS+SIRSYGTP+  +++SR QD     G  G      K YKEPW+YYTY
Sbjct: 69  APTQVAFGYGGASASIRSYGTPRGATNSSRYQDPASGGGLYGSGLSDHKEYKEPWDYYTY 128

Query: 738 YPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLP 559
           YPVTLP RRPYSGNP LLDE+EFGE S +T YDENS NPA ELGLMDEN+EA MLFLQLP
Sbjct: 129 YPVTLPLRRPYSGNPELLDEEEFGEASESTAYDENSTNPAMELGLMDENQEASMLFLQLP 188

Query: 558 ASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLG 379
           A++P++K              + +    ++ K C LEELP GFMGK+LVYKSG IKLKLG
Sbjct: 189 ATMPMIK--------------QAATAEVKENKTCRLEELPSGFMGKMLVYKSGAIKLKLG 234

Query: 378 DTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           DTLYDVSPG DCVFAQDVVAINTE+K  CV+GELKKRA+VTPDVDS +S M DL
Sbjct: 235 DTLYDVSPGLDCVFAQDVVAINTEDKCCCVLGELKKRAVVTPDVDSALSSMDDL 288


>emb|CBI27823.3| unnamed protein product [Vitis vinifera]
          Length = 294

 Score =  296 bits (757), Expect = 1e-77
 Identities = 153/234 (65%), Positives = 174/234 (74%), Gaps = 7/234 (2%)
 Frame = -3

Query: 897 APVQVAFGHGNASSSIRSYGTPK--SSASRSQDNGKASGQNGE-----KYYKEPWNYYTY 739
           AP QVAFG+G AS+SIRSYGTP+  +++SR QD     G  G      K YKEPW+YYTY
Sbjct: 79  APTQVAFGYGGASASIRSYGTPRGATNSSRYQDPASGGGLYGSGLSDHKEYKEPWDYYTY 138

Query: 738 YPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLP 559
           YPVTLP RRPYSGNP LLDE+EFGE S +T YDENS NPA ELGLMDEN+EA MLFLQLP
Sbjct: 139 YPVTLPLRRPYSGNPELLDEEEFGEASESTAYDENSTNPAMELGLMDENQEASMLFLQLP 198

Query: 558 ASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLG 379
           A++P++K+                       + C LEELP GFMGK+LVYKSG IKLKLG
Sbjct: 199 ATMPMIKQ-------------------AATAETCRLEELPSGFMGKMLVYKSGAIKLKLG 239

Query: 378 DTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           DTLYDVSPG DCVFAQDVVAINTE+K  CV+GELKKRA+VTPDVDS +S M DL
Sbjct: 240 DTLYDVSPGLDCVFAQDVVAINTEDKCCCVLGELKKRAVVTPDVDSALSSMDDL 293


>ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma
           cacao] gi|508783039|gb|EOY30295.1| DNA-directed RNA
           polymerase III subunit RPC4, putative [Theobroma cacao]
          Length = 294

 Score =  271 bits (692), Expect = 3e-70
 Identities = 138/231 (59%), Positives = 169/231 (73%), Gaps = 4/231 (1%)
 Frame = -3

Query: 897 APVQVAFGHGNASSSIRSYGTPKSSASRSQD--NG--KASGQNGEKYYKEPWNYYTYYPV 730
           A  QVAFGHG AS+S++ +G  K ++  S++  NG     G   EK Y+EPW+YY+YYPV
Sbjct: 66  ASSQVAFGHGGASASMKLFGVSKGASRTSRETLNGVVHTPGLREEKEYREPWDYYSYYPV 125

Query: 729 TLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASL 550
           TLP RRPYSGNP  LDE+EF   S N T+DENSV PA ELGLMDEN E  M FLQLP +L
Sbjct: 126 TLPMRRPYSGNPEFLDEEEFA--SENITFDENSVEPAVELGLMDENLEPSMFFLQLPPTL 183

Query: 549 PLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTL 370
           P++K+            SKP+  +G  +K C LEELP G MGK+LV+KSG +KLKLGDTL
Sbjct: 184 PMIKQSGTTAGLEVDSSSKPAARVGSVKKTCGLEELPAGLMGKMLVHKSGAVKLKLGDTL 243

Query: 369 YDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           YDV+PG +CVFAQDVVA+NT EK  CVVGEL KRA++TPDVDS+++ MADL
Sbjct: 244 YDVTPGLNCVFAQDVVAVNTAEKQCCVVGELDKRAVLTPDVDSVLNSMADL 294


>ref|XP_006428587.1| hypothetical protein CICLE_v10012311mg [Citrus clementina]
           gi|568853572|ref|XP_006480425.1| PREDICTED:
           uncharacterized protein LOC102622464 [Citrus sinensis]
           gi|557530644|gb|ESR41827.1| hypothetical protein
           CICLE_v10012311mg [Citrus clementina]
          Length = 303

 Score =  268 bits (684), Expect = 3e-69
 Identities = 135/234 (57%), Positives = 165/234 (70%), Gaps = 7/234 (2%)
 Frame = -3

Query: 897 APVQVAFGHGNASSSIRSYGTPKSSASRSQDNGKA-------SGQNGEKYYKEPWNYYTY 739
           AP Q+AFG G AS+ I+SYG PK  +S S+  G A       SG    K Y+EPW+YY+Y
Sbjct: 70  APSQIAFGQGGASTFIKSYGIPKGGSSSSRGQGSAVNGGAHASGTRLGKEYQEPWDYYSY 129

Query: 738 YPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLP 559
           YPV+LP RRPYSG+P LLDE+EFGE S    YDE+S+NPAEELGLM+EN E  M+FLQLP
Sbjct: 130 YPVSLPLRRPYSGSPELLDEEEFGEASETINYDESSMNPAEELGLMEENLEPNMIFLQLP 189

Query: 558 ASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLG 379
            +LPL K+            S        +EK  SL ELP  FMGKLLVY+SG +KLKLG
Sbjct: 190 PTLPLKKQPATGNERQVTESSSKHEGATAKEKTSSLSELPGAFMGKLLVYRSGAVKLKLG 249

Query: 378 DTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           +T+Y+V+PG DC+FAQDVV INT EKHFCV GEL KRAI++PDVD +++  ADL
Sbjct: 250 ETVYNVTPGMDCMFAQDVVVINTAEKHFCVAGELNKRAILSPDVDFILNNFADL 303


>ref|XP_002516293.1| DNA binding protein, putative [Ricinus communis]
           gi|223544779|gb|EEF46295.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 286

 Score =  258 bits (660), Expect = 2e-66
 Identities = 131/230 (56%), Positives = 158/230 (68%), Gaps = 6/230 (2%)
 Frame = -3

Query: 888 QVAFGHGNASSSIRSYGTPKSSASRSQDNGKA------SGQNGEKYYKEPWNYYTYYPVT 727
           Q+AFG G AS SI+SY  PK  A+ + + G +      S + GEK Y EPWNYY+YYPVT
Sbjct: 68  QIAFGFGAASPSIKSYAAPKVGAAVNHNQGSSVNGGAYSSELGEKEYIEPWNYYSYYPVT 127

Query: 726 LPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLP 547
           LP RRPYSGNP  L+ +EFGE S  + YDENS N A  LGLM+EN EA M FLQLP ++P
Sbjct: 128 LPLRRPYSGNPATLNAEEFGEASDTSEYDENSTNSAINLGLMEENVEANMFFLQLPPTVP 187

Query: 546 LVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLY 367
           ++KR                    ++EK C L+ELP G MGK+LVY+SG +KLKLGDTLY
Sbjct: 188 MIKRLATADGHKV-----------KEEKTCKLDELPAGHMGKMLVYRSGAVKLKLGDTLY 236

Query: 366 DVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           DVSPG D  FAQD+ AINT EKH CVV E+ K AIVTPDVD++I+ MADL
Sbjct: 237 DVSPGLDFAFAQDIAAINTAEKHCCVVAEIDKHAIVTPDVDAIINSMADL 286


>gb|EXB38927.1| DNA-directed RNA polymerase III subunit RPC4 [Morus notabilis]
          Length = 328

 Score =  254 bits (649), Expect = 3e-65
 Identities = 135/261 (51%), Positives = 159/261 (60%), Gaps = 34/261 (13%)
 Frame = -3

Query: 897 APVQVAFGHGNASSSIRSYGTPKSSASRSQ------------------------------ 808
           A  QVAFG+G AS++IRSYG PK     SQ                              
Sbjct: 68  AAAQVAFGYGGASNTIRSYGVPKGGYRNSQGPPATRMLFTSAAFLSTVNKSFPMHDIKNH 127

Query: 807 ---DNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDE 637
              D    SG   EK YKEPW+YY+YYP TLPFRRP+SGNP  LDE+EFG  +    YDE
Sbjct: 128 VLTDGAFPSGTRQEKEYKEPWDYYSYYPSTLPFRRPHSGNPEFLDEEEFGADTETINYDE 187

Query: 636 NSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQ-EKG 460
            S   A ELGL++EN E  M+ LQLP  +PL+KR            S P+  + +   K 
Sbjct: 188 TSAKAATELGLVEENPETSMILLQLPPIMPLMKRSANTAAGQEATKSSPAPVVAQATHKA 247

Query: 459 CSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGE 280
           C+L ELP GFMGK+LVY+SG IKLK+GDTLYDVS G DCVF+QDVVAINT EKH C VGE
Sbjct: 248 CALHELPAGFMGKMLVYRSGAIKLKIGDTLYDVSSGMDCVFSQDVVAINTVEKHCCAVGE 307

Query: 279 LKKRAIVTPDVDSLISGMADL 217
           LKKRA +TPDVD ++  MADL
Sbjct: 308 LKKRAAITPDVDFILQSMADL 328


>ref|XP_007204524.1| hypothetical protein PRUPE_ppa017748mg [Prunus persica]
           gi|462400055|gb|EMJ05723.1| hypothetical protein
           PRUPE_ppa017748mg [Prunus persica]
          Length = 281

 Score =  254 bits (648), Expect = 4e-65
 Identities = 119/226 (52%), Positives = 157/226 (69%)
 Frame = -3

Query: 894 PVQVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFR 715
           P Q+ FG+G AS++++SYG PK  ++ S  N  ASG   EK Y  PW+ Y+YYPVTLP R
Sbjct: 56  PTQIVFGYGGASTTMKSYGAPKGGSASSATNAGASGVKEEKEYSSPWDQYSYYPVTLPLR 115

Query: 714 RPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKR 535
            PYSGNP + +E+EFGEGS  +TYDENS  PA +LGL++EN+   M FLQLP ++P +KR
Sbjct: 116 PPYSGNPEIRNEEEFGEGSEESTYDENSTTPANDLGLLEENKATSMFFLQLPPNMPTIKR 175

Query: 534 XXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSP 355
                       S P       +K CSL ELP GFMGK+LVY+SG +K+K+GD+L+DVSP
Sbjct: 176 SATADSQEVTKSSGPPGGARNMQKPCSLSELPAGFMGKMLVYRSGAVKMKIGDSLFDVSP 235

Query: 354 GSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           G +C FAQDVV +N  EK   ++GEL KRAI+TPDVDS+++ +  L
Sbjct: 236 GMNCDFAQDVVVVNKAEKGCGIIGELNKRAIITPDVDSILASIDGL 281


>ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600766 [Solanum tuberosum]
          Length = 283

 Score =  252 bits (644), Expect = 1e-64
 Identities = 127/223 (56%), Positives = 162/223 (72%)
 Frame = -3

Query: 894 PVQVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFR 715
           P QVAFG+G +SSS++SYG   +  S S  +G   G+  +K Y EPW+YYT YPVTLP R
Sbjct: 68  PTQVAFGYGGSSSSLKSYGH-YNKVSGSMSDGGIGGERVQKEYTEPWDYYTNYPVTLPVR 126

Query: 714 RPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKR 535
           RPYSGNP LLDE+EFGE S + TYDENS+ PA +LGLM+E+ E +M  +QLP ++P++K+
Sbjct: 127 RPYSGNPELLDEEEFGEASRSLTYDENSIKPAMDLGLMEESLEEKMFLVQLP-TMPMLKQ 185

Query: 534 XXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSP 355
                       SKPS+      K CSL ELP GFMGK+LVYKSG +KLKLG+TL+++SP
Sbjct: 186 SIKTEGSEMANSSKPSKA-----KACSLNELPAGFMGKMLVYKSGAVKLKLGETLFNLSP 240

Query: 354 GSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGM 226
           G DC FAQDVVA+NTEEK+   +GEL KR I+TPDVDSL+  +
Sbjct: 241 GMDCSFAQDVVAVNTEEKYCSNIGELTKRIIITPDVDSLLDSI 283


>ref|XP_004144123.1| PREDICTED: uncharacterized protein LOC101209454 [Cucumis sativus]
           gi|449500539|ref|XP_004161125.1| PREDICTED:
           uncharacterized LOC101209454 [Cucumis sativus]
          Length = 293

 Score =  247 bits (630), Expect = 5e-63
 Identities = 121/227 (53%), Positives = 157/227 (69%)
 Frame = -3

Query: 897 APVQVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPF 718
           AP QVAFG G +SS++RSYG  K+      ++G       ++Y  EPW+YY+YYPVTLP 
Sbjct: 68  APTQVAFGSGGSSSTLRSYGVSKAGNRPRNEDGTLPASTSKEYV-EPWDYYSYYPVTLPL 126

Query: 717 RRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVK 538
           RRPYSGNP  L+E+EFGE S N TYDEN+   A  LGL++EN EA +LFLQLP  +P++K
Sbjct: 127 RRPYSGNPDSLNEEEFGEASENLTYDENTTTAAMNLGLLEENPEADVLFLQLPPMVPMIK 186

Query: 537 RXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 358
           +            S+ ++    ++K CS+ ELP G +GKLLVY+SG +KLKLGD +YDVS
Sbjct: 187 QSSSVEDMGSGNSSEQNKASQPRQKTCSMNELPSGSIGKLLVYRSGAVKLKLGDIIYDVS 246

Query: 357 PGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
            G DC FAQ+V AIN E K  C+VGEL KRAI+TPDVDS++  + DL
Sbjct: 247 SGMDCGFAQEVAAINVEGKRCCIVGELSKRAILTPDVDSMLKNIEDL 293


>ref|XP_003541303.2| PREDICTED: uncharacterized protein LOC100782982 [Glycine max]
          Length = 318

 Score =  239 bits (609), Expect = 1e-60
 Identities = 116/224 (51%), Positives = 149/224 (66%)
 Frame = -3

Query: 888 QVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRP 709
           Q+AFG+G  S+S++SYG P+  +S + +   AS    EK Y+EPW+Y + YPVTLP RRP
Sbjct: 79  QIAFGYGGESTSMKSYGIPRGGSSININQSSASNGAKEKEYQEPWDYDSNYPVTLPLRRP 138

Query: 708 YSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXX 529
           YSGNP LLD+ EFGE +    YDEN+ N A EL L++ N EA M F+ LP  LP++K+  
Sbjct: 139 YSGNPALLDDQEFGEAAEPRAYDENASNSAMELDLLEHNPEASMFFINLPTKLPMIKQSA 198

Query: 528 XXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGS 349
                     S+P       E+ C L EL  GFMGK+LVYKSG IKLKLGDTLYDVS G 
Sbjct: 199 TAGSSDVNVKSRPHGGSKNVEELCELNELSSGFMGKMLVYKSGAIKLKLGDTLYDVSSGM 258

Query: 348 DCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
            C  AQD+VAINT +KH C +GE+ KR  +TPD+D++I  + DL
Sbjct: 259 KCACAQDLVAINTAQKHCCTIGEISKRVSITPDIDAIIDNLPDL 302


>ref|XP_003550619.1| PREDICTED: uncharacterized protein LOC100802173 [Glycine max]
          Length = 298

 Score =  238 bits (607), Expect = 2e-60
 Identities = 113/224 (50%), Positives = 152/224 (67%)
 Frame = -3

Query: 888 QVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRP 709
           Q+AFG+G  S+S++SYG P+  +S + +   AS    EK Y+EPW+YY+ YPVTLP RRP
Sbjct: 75  QIAFGYGGESTSMKSYGIPRGGSSININLSSASSGGKEKEYQEPWDYYSNYPVTLPLRRP 134

Query: 708 YSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXX 529
           YSGNP LLD++EF E + + TY+EN+ N   +LGL++EN EA M  + LP  LP++K+  
Sbjct: 135 YSGNPALLDDEEFAEAAQSRTYEENASNSTMDLGLLEENPEASMFLINLPTKLPMIKQSA 194

Query: 528 XXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGS 349
                     S P       E+ C L EL  GFMGK+LVYKSG IKLKLG+TLYDVS G 
Sbjct: 195 TAGDKDVNEKSIPHGGSKNVEELCELNELSSGFMGKMLVYKSGAIKLKLGNTLYDVSSGM 254

Query: 348 DCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           +C  AQD+VA+NT +KH C +GE+ K   +TPDVD++I  ++DL
Sbjct: 255 NCACAQDLVAVNTAQKHCCTIGEISKHVTITPDVDAIIDNLSDL 298


>ref|XP_007038340.1| DNA binding protein, putative isoform 2 [Theobroma cacao]
           gi|508775585|gb|EOY22841.1| DNA binding protein,
           putative isoform 2 [Theobroma cacao]
          Length = 328

 Score =  237 bits (604), Expect = 5e-60
 Identities = 124/259 (47%), Positives = 165/259 (63%), Gaps = 35/259 (13%)
 Frame = -3

Query: 888 QVAFGHGNASSSI-RSYGTPKSSAS--------RSQDNG--------------------- 799
           Q++FG G  SS++ R+YG+ +   S        RS D+                      
Sbjct: 70  QISFGPGAPSSNLLRAYGSQRGGTSGKSTDSRQRSPDDNDGQIIGSFPSASKEDRTDICS 129

Query: 798 ----KASGQNGEKYYKEPWNYY-TYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDEN 634
               +AS    ++ Y+EPW+Y+ TYYP+TLP RRPYSG+P LLD+ EF E +A   YDE 
Sbjct: 130 SDAIEASAPKIKREYREPWDYHHTYYPITLPLRRPYSGDPELLDQAEFVE-AARKEYDEK 188

Query: 633 SVNPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCS 454
           ++NPA +LGL++E E+ +M F QLPA+LP++KR               S   G  +KGC 
Sbjct: 189 TINPASDLGLLEEGEKGKMFFFQLPANLPVIKRLASTKGKEKAENLGSSERFGALKKGCQ 248

Query: 453 LEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELK 274
           LEELP GFMGK+LVYKSG +KLKLG+TLYDVSPGSDC+FAQDV A+NT EKH CV+GEL 
Sbjct: 249 LEELPGGFMGKMLVYKSGAVKLKLGETLYDVSPGSDCIFAQDVAAVNTTEKHCCVIGELG 308

Query: 273 KRAIVTPDVDSLISGMADL 217
           KR +VTPD+ S+++ + DL
Sbjct: 309 KRVVVTPDISSVLNSVIDL 327


>ref|XP_006381642.1| DNA-directed RNA polymerase 3 RPC4 family protein [Populus
           trichocarpa] gi|550336350|gb|ERP59439.1| DNA-directed
           RNA polymerase 3 RPC4 family protein [Populus
           trichocarpa]
          Length = 292

 Score =  231 bits (590), Expect = 2e-58
 Identities = 122/227 (53%), Positives = 156/227 (68%), Gaps = 2/227 (0%)
 Frame = -3

Query: 891 VQVAFGHGNASSS-IRSYGT-PKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPF 718
           + +AFG G A++    S+ T  +   S S  N  A G   EK Y EPW+YY+ YPV+LP 
Sbjct: 67  LDIAFGPGAAATKPFPSWSTINRDQGSSSNGNADAPGPR-EKEYIEPWDYYSNYPVSLPM 125

Query: 717 RRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVK 538
           RRPYSGN  +LDE+EFGE S   TYDENS N A ELGLM+EN EA MLF+QLP ++P++K
Sbjct: 126 RRPYSGNSAILDEEEFGEVSEAATYDENSTNSAVELGLMEENVEASMLFVQLPPTMPMIK 185

Query: 537 RXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 358
           R            S+PS      EK C L+ELP G+MGK+LVY+SG +KLKLGDTLYDVS
Sbjct: 186 RSATAVGPEVKESSRPSGGARAIEKTCRLDELPAGYMGKVLVYRSGAVKLKLGDTLYDVS 245

Query: 357 PGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           PG + +FAQDVVAIN  E+  CVV E++KR  + PDVD++IS +A++
Sbjct: 246 PGMNSIFAQDVVAINRGEETCCVVAEIEKRVTLIPDVDAIISRVAEM 292


>ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris]
           gi|593783109|ref|XP_007154595.1| hypothetical protein
           PHAVU_003G132000g [Phaseolus vulgaris]
           gi|561027948|gb|ESW26588.1| hypothetical protein
           PHAVU_003G132000g [Phaseolus vulgaris]
           gi|561027949|gb|ESW26589.1| hypothetical protein
           PHAVU_003G132000g [Phaseolus vulgaris]
          Length = 291

 Score =  230 bits (587), Expect = 5e-58
 Identities = 113/224 (50%), Positives = 150/224 (66%)
 Frame = -3

Query: 888 QVAFGHGNASSSIRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPFRRP 709
           Q+AFG+G  S+S++SYG  +   + + +    S    EK Y EPW+YY+ YPVTLP RRP
Sbjct: 70  QIAFGYGGESTSLKSYGIGRGGRNVNINPNSTSSAVAEKEYTEPWDYYSNYPVTLPLRRP 129

Query: 708 YSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVKRXX 529
           YSGNP LLDE+EFGE +   TYDE + N A ELGL++EN EA M  ++LP+ LP++    
Sbjct: 130 YSGNPELLDEEEFGEAAEARTYDEEATNSAMELGLLEENLEANMFLIKLPSKLPIIS--T 187

Query: 528 XXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGS 349
                     SKP     + E+ C L++LP GFMGK+LVYKSG IKLKLG+TLYDVS G 
Sbjct: 188 ADGGKDVNAKSKPPVGTKKGERLCELKDLPSGFMGKMLVYKSGKIKLKLGNTLYDVSSGM 247

Query: 348 DCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           +C F+QDVVAIN  EK  C +GE+ K   +TPD+D ++  ++DL
Sbjct: 248 NCSFSQDVVAINKAEKTLCSIGEISKHVTITPDIDDILDNLSDL 291


>ref|XP_002510979.1| DNA binding protein, putative [Ricinus communis]
           gi|223550094|gb|EEF51581.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 328

 Score =  226 bits (576), Expect = 9e-57
 Identities = 127/257 (49%), Positives = 160/257 (62%), Gaps = 32/257 (12%)
 Frame = -3

Query: 891 VQVAFGHGNASS-SIRSYGTPKSS-------ASRSQDNGK-------------------- 796
           VQVAFG G  SS SIR++G  K            + D+GK                    
Sbjct: 71  VQVAFGPGATSSTSIRTFGVSKGENPVSSGIKDSTDDDGKIVISSLSTDKEDEIINCASE 130

Query: 795 ---ASGQNGEKYYKEPWNY-YTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSV 628
              A     +K Y+EPW+Y  TYYP TLP RRPYSG+PVLLDE EFGE +    YDE+++
Sbjct: 131 DIDALPLKIKKDYREPWDYDRTYYPTTLPLRRPYSGDPVLLDEAEFGEAARKLEYDESTM 190

Query: 627 NPAEELGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLE 448
           NPA +L L++E +  +M+F QLPA LPLVKR            S PS+     +K  SL+
Sbjct: 191 NPASDLELLEECDTEKMIFFQLPAKLPLVKRSASAKGKEKAEGSIPSQGKNAAKKESSLD 250

Query: 447 ELPPGFMGKLLVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKR 268
            L  G+MGK+LVY+SG +KLKLGDTLYDVS GSDC+FAQDV+AINT  KH C +GEL+KR
Sbjct: 251 GLSAGYMGKMLVYRSGAVKLKLGDTLYDVSQGSDCMFAQDVMAINTAAKHCCTIGELEKR 310

Query: 267 AIVTPDVDSLISGMADL 217
           A+VTPDVDSL+  + +L
Sbjct: 311 AVVTPDVDSLLDSVVNL 327


>ref|XP_004239580.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4-like
           [Solanum lycopersicum]
          Length = 188

 Score =  224 bits (571), Expect = 4e-56
 Identities = 112/186 (60%), Positives = 138/186 (74%)
 Frame = -3

Query: 792 SGQNGEKYYKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEE 613
           +G+  +K Y EPW+YYT YPVTLP RRPYSGNP LLDE+EFGE S + TYDENS+ PA +
Sbjct: 6   NGERVQKEYTEPWDYYTNYPVTLPVRRPYSGNPELLDEEEFGEASQSLTYDENSIKPAMD 65

Query: 612 LGLMDENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPG 433
           LGLM+EN E +M  +QLP ++P++K+            SK S+      K CSL ELP G
Sbjct: 66  LGLMEENLEEKMFLVQLP-TMPMLKQSIKTEGSEMANSSKTSKA-----KACSLNELPAG 119

Query: 432 FMGKLLVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTP 253
            MGKLLVYKSG +KLKLG+TL++VSPG DC FAQDVVA+NTEEK+   +GEL KR I+TP
Sbjct: 120 LMGKLLVYKSGAVKLKLGETLFNVSPGMDCSFAQDVVAVNTEEKYCSNIGELTKRIIITP 179

Query: 252 DVDSLI 235
           DVDSL+
Sbjct: 180 DVDSLL 185


>ref|XP_006847937.1| hypothetical protein AMTR_s00029p00131600 [Amborella trichopoda]
           gi|548851242|gb|ERN09518.1| hypothetical protein
           AMTR_s00029p00131600 [Amborella trichopoda]
          Length = 305

 Score =  223 bits (567), Expect = 1e-55
 Identities = 122/236 (51%), Positives = 158/236 (66%), Gaps = 15/236 (6%)
 Frame = -3

Query: 897 APVQVAFGHGNAS--SSIRSYGTPKSSASRSQ-----DNG-------KASGQNGEKYYKE 760
           APVQVAFG+GNA+  SS  SY    SS+   +     D+G       +   +  EK Y E
Sbjct: 67  APVQVAFGYGNAANFSSSSSYSKGGSSSKPKEIGHAFDDGSQLVDVKRDVDEKREKEYVE 126

Query: 759 PWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTY-DENSVNPAEELGLMDENEEA 583
           PW+YY+ YPVTLP RRPYSG+P  LDE EFGE +A+ +  +E+S N AEELGL +E EE 
Sbjct: 127 PWDYYSKYPVTLPLRRPYSGDPETLDEKEFGESAASKSVCNEDSTNAAEELGLKEEREER 186

Query: 582 RMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKS 403
           +++F QLP SLP+ KR            S   RT G+ E    LE+L  GFMGKLL+Y+S
Sbjct: 187 QLVFFQLPESLPIPKRSATADGKEVQDDSGQKRT-GKSEMPSRLEDLQAGFMGKLLIYES 245

Query: 402 GVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLI 235
           G +KLK+GDTL++VSPGS C FAQ+V AINT ++ +CV+GE+ KRAIVTPD+D L+
Sbjct: 246 GAVKLKIGDTLFNVSPGSKCEFAQEVAAINTRDRQYCVLGEINKRAIVTPDIDDLL 301


>ref|XP_007038339.1| DNA binding protein, putative isoform 1 [Theobroma cacao]
           gi|508775584|gb|EOY22840.1| DNA binding protein,
           putative isoform 1 [Theobroma cacao]
          Length = 359

 Score =  223 bits (567), Expect = 1e-55
 Identities = 104/178 (58%), Positives = 133/178 (74%)
 Frame = -3

Query: 750 YYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLF 571
           ++TYYP+TLP RRPYSG+P LLD+ EF E +A   YDE ++NPA +LGL++E E+ +M F
Sbjct: 182 HHTYYPITLPLRRPYSGDPELLDQAEFVE-AARKEYDEKTINPASDLGLLEEGEKGKMFF 240

Query: 570 LQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIK 391
            QLPA+LP++KR               S   G  +KGC LEELP GFMGK+LVYKSG +K
Sbjct: 241 FQLPANLPVIKRLASTKGKEKAENLGSSERFGALKKGCQLEELPGGFMGKMLVYKSGAVK 300

Query: 390 LKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
           LKLG+TLYDVSPGSDC+FAQDV A+NT EKH CV+GEL KR +VTPD+ S+++ + DL
Sbjct: 301 LKLGETLYDVSPGSDCIFAQDVAAVNTTEKHCCVIGELGKRVVVTPDISSVLNSVIDL 358


>ref|XP_006383244.1| hypothetical protein POPTR_0005s12820g, partial [Populus
           trichocarpa] gi|550338826|gb|ERP61041.1| hypothetical
           protein POPTR_0005s12820g, partial [Populus trichocarpa]
          Length = 194

 Score =  219 bits (558), Expect = 1e-54
 Identities = 108/181 (59%), Positives = 132/181 (72%)
 Frame = -3

Query: 777 EKYYKEPWNYYTYYPVTLPFRRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMD 598
           EK Y EPW+YY+ YPV+LP RRPYSGN  +LDE+EFGE S   TYDENS N A ELGLM+
Sbjct: 9   EKEYIEPWDYYSNYPVSLPMRRPYSGNSAILDEEEFGEVSEAATYDENSTNSAVELGLME 68

Query: 597 ENEEARMLFLQLPASLPLVKRXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKL 418
           EN EA MLF+QLP ++P++KR            S+PS      EK C L+ELP G+MGK+
Sbjct: 69  ENVEASMLFVQLPPTMPMIKRSATAVGPEVKESSRPSGGARAIEKTCRLDELPAGYMGKV 128

Query: 417 LVYKSGVIKLKLGDTLYDVSPGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSL 238
           LVY+SG +KLKLGDTLYDVSPG + +FAQDVVAIN  E+  CVV E++KR  + PDVD L
Sbjct: 129 LVYRSGAVKLKLGDTLYDVSPGMNSIFAQDVVAINRGEETCCVVAEIEKRVTLIPDVDKL 188

Query: 237 I 235
           +
Sbjct: 189 L 189


>ref|XP_004287620.1| PREDICTED: uncharacterized protein LOC101290984 [Fragaria vesca
           subsp. vesca]
          Length = 286

 Score =  218 bits (554), Expect = 3e-54
 Identities = 114/227 (50%), Positives = 149/227 (65%), Gaps = 1/227 (0%)
 Frame = -3

Query: 894 PVQVAFGHGNASSS-IRSYGTPKSSASRSQDNGKASGQNGEKYYKEPWNYYTYYPVTLPF 718
           PV+VAFG G  SSS +RSYG P+         G       EK YK P++ Y +YPV+LP 
Sbjct: 67  PVEVAFGSGGQSSSTLRSYGAPRGV----NGGGLNPVIQEEKEYKSPFDIYGHYPVSLPL 122

Query: 717 RRPYSGNPVLLDEDEFGEGSANTTYDENSVNPAEELGLMDENEEARMLFLQLPASLPLVK 538
           R+P S +P +L++ EFG+GS  +TYDEN+   A++L L +EN    M FL LP +LP++K
Sbjct: 123 RQPSSEDPAILNQQEFGDGSEESTYDENATPAADDLDLREENRATSMFFLHLPPTLPMLK 182

Query: 537 RXXXXXXXXXXXXSKPSRTMGRQEKGCSLEELPPGFMGKLLVYKSGVIKLKLGDTLYDVS 358
           +               +R     EK CSL +LP GFMGK+LVY+SG IK+KLGDTLYDVS
Sbjct: 183 QPAGQQVNNSSGAPGGARNT---EKPCSLGDLPAGFMGKMLVYRSGAIKMKLGDTLYDVS 239

Query: 357 PGSDCVFAQDVVAINTEEKHFCVVGELKKRAIVTPDVDSLISGMADL 217
            G +C FAQDVVAINT EK  C +GEL KRA+VTPD+DS+++ + DL
Sbjct: 240 TGMNCDFAQDVVAINTTEKKCCTIGELNKRAVVTPDIDSVLNSLEDL 286


Top