BLASTX nr result

ID: Akebia27_contig00007018 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00007018
         (843 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007211940.1| hypothetical protein PRUPE_ppa010773mg [Prun...   218   3e-54
ref|XP_007211941.1| hypothetical protein PRUPE_ppa010773mg [Prun...   207   3e-51
ref|XP_002274855.1| PREDICTED: uncharacterized protein LOC100246...   206   7e-51
ref|XP_004134991.1| PREDICTED: uncharacterized protein LOC101213...   204   3e-50
ref|XP_004158834.1| PREDICTED: DNA-directed RNA polymerase II su...   204   4e-50
ref|XP_006373571.1| hypothetical protein POPTR_0016s00490g [Popu...   192   2e-46
ref|XP_007026413.1| DNA-directed RNA polymerase II subunit rpb4,...   189   1e-45
gb|EXB64613.1| Calcineurin B-like protein 3 [Morus notabilis]         187   4e-45
ref|XP_006350243.1| PREDICTED: DNA-directed RNA polymerases IV a...   186   1e-44
ref|XP_002525033.1| conserved hypothetical protein [Ricinus comm...   185   2e-44
ref|XP_006467223.1| PREDICTED: DNA-directed RNA polymerases IV a...   179   1e-42
ref|XP_006449991.1| hypothetical protein CICLE_v10016499mg [Citr...   178   3e-42
ref|XP_004236644.1| PREDICTED: uncharacterized protein LOC101254...   175   2e-41
ref|XP_007026415.1| DNA-directed RNA polymerase II subunit rpb4,...   174   4e-41
ref|XP_006467222.1| PREDICTED: DNA-directed RNA polymerases IV a...   170   5e-40
ref|XP_004486377.1| PREDICTED: uncharacterized protein LOC101499...   170   5e-40
ref|XP_006449990.1| hypothetical protein CICLE_v10016499mg [Citr...   169   1e-39
ref|XP_007147572.1| hypothetical protein PHAVU_006G135800g, part...   169   2e-39
ref|XP_006398486.1| hypothetical protein EUTSA_v10001026mg [Eutr...   167   3e-39
ref|XP_003594343.1| DNA-directed RNA polymerase II subunit rpb4 ...   167   3e-39

>ref|XP_007211940.1| hypothetical protein PRUPE_ppa010773mg [Prunus persica]
           gi|462407805|gb|EMJ13139.1| hypothetical protein
           PRUPE_ppa010773mg [Prunus persica]
          Length = 225

 Score =  218 bits (554), Expect = 3e-54
 Identities = 130/235 (55%), Positives = 156/235 (66%), Gaps = 6/235 (2%)
 Frame = +2

Query: 95  MAEKGGKGFSL---KARKSSVK-TPSAKGASLKGKDGSS--NKKGRSVHFENSSDSEGSP 256
           M+EKGGKGFSL   KA KSS+K TPS K ASLKGKD SS  +KKGR V F    DSEG  
Sbjct: 1   MSEKGGKGFSLPTGKAVKSSLKSTPSTKDASLKGKDDSSTKSKKGRKVQF----DSEGLH 56

Query: 257 RAKISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAK 436
             K + +SK    F +    S                   +   K P   ELKIE+EL +
Sbjct: 57  EPKSNFSSK----FDNPAAASGKDWGKGGKGDKV-----GNGRKKEPQPLELKIEQELPQ 107

Query: 437 NATCLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLE 616
           +A CLMDCEAA +LQGIQEQ+++LS+D TIK+P SFDKGLQYAK + HYTNPQSVR+VLE
Sbjct: 108 SAKCLMDCEAADILQGIQEQMILLSKDPTIKIPVSFDKGLQYAKRTSHYTNPQSVRKVLE 167

Query: 617 TLKSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLK 781
           TL  + +SDGEI +IAN+CPET DEVFAL+PSLK  R  + + +  VL EL KLK
Sbjct: 168 TLTKYGVSDGEISVIANVCPETTDEVFALVPSLKTTRSVLSQPLKEVLSELTKLK 222


>ref|XP_007211941.1| hypothetical protein PRUPE_ppa010773mg [Prunus persica]
           gi|462407806|gb|EMJ13140.1| hypothetical protein
           PRUPE_ppa010773mg [Prunus persica]
          Length = 237

 Score =  207 bits (528), Expect = 3e-51
 Identities = 129/247 (52%), Positives = 156/247 (63%), Gaps = 18/247 (7%)
 Frame = +2

Query: 95  MAEKGGKGFSL---KARKSSVK-TPSAKG------------ASLKGKDGSS--NKKGRSV 220
           M+EKGGKGFSL   KA KSS+K TPS K             +SLKGKD SS  +KKGR V
Sbjct: 1   MSEKGGKGFSLPTGKAVKSSLKSTPSTKDGLLFLRSLLFLTSSLKGKDDSSTKSKKGRKV 60

Query: 221 HFENSSDSEGSPRAKISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPT 400
            F    DSEG    K + +SK    F +    S                   +   K P 
Sbjct: 61  QF----DSEGLHEPKSNFSSK----FDNPAAASGKDWGKGGKGDKV-----GNGRKKEPQ 107

Query: 401 IPELKIEEELAKNATCLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDH 580
             ELKIE+EL ++A CLMDCEAA +LQGIQEQ+++LS+D TIK+P SFDKGLQYAK + H
Sbjct: 108 PLELKIEQELPQSAKCLMDCEAADILQGIQEQMILLSKDPTIKIPVSFDKGLQYAKRTSH 167

Query: 581 YTNPQSVRRVLETLKSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVL 760
           YTNPQSVR+VLETL  + +SDGEI +IAN+CPET DEVFAL+PSLK  R  + + +  VL
Sbjct: 168 YTNPQSVRKVLETLTKYGVSDGEISVIANVCPETTDEVFALVPSLKTTRSVLSQPLKEVL 227

Query: 761 HELAKLK 781
            EL KLK
Sbjct: 228 SELTKLK 234


>ref|XP_002274855.1| PREDICTED: uncharacterized protein LOC100246461 isoform 1 [Vitis
           vinifera] gi|359495604|ref|XP_003635035.1| PREDICTED:
           uncharacterized protein LOC100246461 isoform 2 [Vitis
           vinifera] gi|297736694|emb|CBI25730.3| unnamed protein
           product [Vitis vinifera]
          Length = 239

 Score =  206 bits (525), Expect = 7e-51
 Identities = 117/231 (50%), Positives = 150/231 (64%), Gaps = 2/231 (0%)
 Frame = +2

Query: 95  MAEKGGKGFSLKARKSSVKTPSAKGASLKGKDGSS--NKKGRSVHFENSSDSEGSPRAKI 268
           M EKGGKGFSL +      T S   ASL GKD S+  +K+GR V F N     G P A++
Sbjct: 1   MGEKGGKGFSLNSNLGKSSTKSPLEASLTGKDDSAAKSKRGRKVQFNNG----GLPEARL 56

Query: 269 SVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNATC 448
           + +  SGGK  ++ P++                   S +PK P   EL+IE+EL KNA C
Sbjct: 57  TSSLMSGGK--TDIPIAKGDLSKGGKGGRVLNGE-KSAVPKPPAPLELRIEQELPKNAKC 113

Query: 449 LMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETLKS 628
           +MDCEAA++L+GIQEQ+V+LSED TIK+P SF+KGLQYA+SS+ Y +PQSVR VLE L  
Sbjct: 114 MMDCEAALILKGIQEQMVVLSEDPTIKIPLSFNKGLQYAQSSNCYASPQSVRLVLEPLSK 173

Query: 629 HNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLK 781
           + +SDGEIC+IAN CPET DEVFAL+PSLK     +   +  VL  LA+LK
Sbjct: 174 YGVSDGEICVIANTCPETIDEVFALVPSLKPKWSTLREPLKDVLRGLAELK 224


>ref|XP_004134991.1| PREDICTED: uncharacterized protein LOC101213971 [Cucumis sativus]
          Length = 242

 Score =  204 bits (520), Expect = 3e-50
 Identities = 120/236 (50%), Positives = 154/236 (65%), Gaps = 6/236 (2%)
 Frame = +2

Query: 95  MAEKGGKGFSLKAR--KSSVKTPSAKGASLKGKDGSSNK--KGRSVHFENSSDSEGSPRA 262
           M+EKG KGFS++ R  KSS+K+ + K ASLKGKD S +K  KGR V F    D++GS  A
Sbjct: 1   MSEKGEKGFSVQKRPAKSSLKSSALKDASLKGKDDSLSKLKKGRKVQF----DAQGSVDA 56

Query: 263 KISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNA 442
             + + K  GK                           +++ K P   ELKIE+EL KN 
Sbjct: 57  TNTFSMKYSGKNGD-------------LGKGGKGANTKASIAKEPQALELKIEQELPKNV 103

Query: 443 TC--LMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLE 616
            C  LMDCEAA +LQGIQ+Q+V LS D TIK+P SFD+GLQYAK ++HY N +SVR VLE
Sbjct: 104 KCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVNAESVRPVLE 163

Query: 617 TLKSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLKS 784
           TLK + ++D EIC+IAN+CP+T DEVFAL+PSLK  R ++   IN+VL ELAK+KS
Sbjct: 164 TLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKRKRSKLSEPINNVLSELAKVKS 219


>ref|XP_004158834.1| PREDICTED: DNA-directed RNA polymerase II subunit rpb4-like
           [Cucumis sativus]
          Length = 220

 Score =  204 bits (518), Expect = 4e-50
 Identities = 120/236 (50%), Positives = 153/236 (64%), Gaps = 6/236 (2%)
 Frame = +2

Query: 95  MAEKGGKGFSL--KARKSSVKTPSAKGASLKGKDGSSNK--KGRSVHFENSSDSEGSPRA 262
           M+EKG KGFS+  K  KSS+K+ + K ASLKGKD S +K  KGR V F    D++GS  A
Sbjct: 1   MSEKGEKGFSVQKKPAKSSLKSSALKDASLKGKDDSLSKLKKGRKVQF----DAQGSVDA 56

Query: 263 KISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNA 442
             + + K  GK                           +++ K P   ELKIE+EL KN 
Sbjct: 57  TNTFSMKYSGKNGD-------------LGKGGKGANTKASIAKEPQALELKIEQELPKNV 103

Query: 443 TC--LMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLE 616
            C  LMDCEAA +LQGIQ+Q+V LS D TIK+P SFD+GLQYAK ++HY N +SVR VLE
Sbjct: 104 KCQCLMDCEAAQLLQGIQDQMVFLSADPTIKIPTSFDRGLQYAKRANHYVNAESVRPVLE 163

Query: 617 TLKSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLKS 784
           TLK + ++D EIC+IAN+CP+T DEVFAL+PSLK  R ++   IN+VL ELAK+KS
Sbjct: 164 TLKKYGVTDSEICVIANVCPDTTDEVFALLPSLKRKRSKLSEPINNVLSELAKVKS 219


>ref|XP_006373571.1| hypothetical protein POPTR_0016s00490g [Populus trichocarpa]
           gi|550320481|gb|ERP51368.1| hypothetical protein
           POPTR_0016s00490g [Populus trichocarpa]
          Length = 197

 Score =  192 bits (487), Expect = 2e-46
 Identities = 115/230 (50%), Positives = 143/230 (62%), Gaps = 1/230 (0%)
 Frame = +2

Query: 95  MAEKGGKGFSLKARKSSVKTPSAKGASLKGKDGSSNKKGRSVHFENSSD-SEGSPRAKIS 271
           M + GGKGFSL +++      S K +S KGKD S+N  GR +HFE+  + S+G    KI+
Sbjct: 1   MEKGGGKGFSLPSKEPK---SSLKSSSTKGKDDSNN--GRKIHFESEGNLSKGGKGGKIA 55

Query: 272 VTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNATCL 451
               +GGK                           S + K P   ELKIE EL +NA  L
Sbjct: 56  ----NGGK---------------------------SPMTKEPPPLELKIESELPQNAKPL 84

Query: 452 MDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETLKSH 631
           MDCEAA +LQGIQ+Q+V+LS+D TIKLP SFDKGLQYAK+  HYTNPQSVRRVLE L+ +
Sbjct: 85  MDCEAAQILQGIQDQMVLLSQDPTIKLPVSFDKGLQYAKNGAHYTNPQSVRRVLEALRKY 144

Query: 632 NISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLK 781
            +SDGEI LIAN+ PETADE FAL+PSLK     +   +  +L ELAK K
Sbjct: 145 GVSDGEISLIANVFPETADEAFALVPSLKSKASTLREPLKDILGELAKFK 194


>ref|XP_007026413.1| DNA-directed RNA polymerase II subunit rpb4, putative isoform 1
           [Theobroma cacao] gi|590627315|ref|XP_007026414.1|
           DNA-directed RNA polymerase II subunit rpb4, putative
           isoform 1 [Theobroma cacao] gi|508781779|gb|EOY29035.1|
           DNA-directed RNA polymerase II subunit rpb4, putative
           isoform 1 [Theobroma cacao] gi|508781780|gb|EOY29036.1|
           DNA-directed RNA polymerase II subunit rpb4, putative
           isoform 1 [Theobroma cacao]
          Length = 231

 Score =  189 bits (479), Expect = 1e-45
 Identities = 113/235 (48%), Positives = 146/235 (62%), Gaps = 6/235 (2%)
 Frame = +2

Query: 95  MAEKGGKGFSLKAR---KSSVKTPSAKGASLKGKDGSS--NKKGRSVHFENSSDSEGSPR 259
           M+EKGGKGFSL  +   KS++K+  A   +  GKD +S  +K+GR V F      EG P 
Sbjct: 1   MSEKGGKGFSLPTKTTPKSALKSTPASATARHGKDDNSAKSKRGRKVQF----GMEGLPN 56

Query: 260 AKISVTS-KSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAK 436
              + +S KS GKF+   PV                    + + K     EL++E+EL +
Sbjct: 57  LGFNFSSPKSDGKFA--IPVGKGDWAKGGKGEKVVNGG-KAPVAKEAKSLELRVEQELPE 113

Query: 437 NATCLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLE 616
           N  CLMDCEAA +L+GIQEQ+V+LS+DSTIKLP SF  GLQYAK+  +YTNPQSVRRVLE
Sbjct: 114 NVKCLMDCEAANILEGIQEQMVMLSQDSTIKLPESFHLGLQYAKTRSYYTNPQSVRRVLE 173

Query: 617 TLKSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLK 781
            L  + +S  EIC+IAN CPET DEVFAL+ SL+  + R+   +  VL EL KLK
Sbjct: 174 ALSKYGVSYSEICVIANTCPETVDEVFALVRSLEAKKSRLSEPLKDVLDELGKLK 228


>gb|EXB64613.1| Calcineurin B-like protein 3 [Morus notabilis]
          Length = 500

 Score =  187 bits (475), Expect = 4e-45
 Identities = 117/249 (46%), Positives = 150/249 (60%), Gaps = 7/249 (2%)
 Frame = +2

Query: 56  HFLVQL--KLL*IAAMAEKGGKGFSL---KARKSSVKTPSAKGASLKGKDGSS--NKKGR 214
           H++ +L   L  I AM +K GK F++   K  KSS+K+ S K  +LKGKD SS  +KKGR
Sbjct: 218 HYIPKLCIPLTVIVAMGDKAGKNFTMPQKKGVKSSLKSSSGKEGTLKGKDDSSAKSKKGR 277

Query: 215 SVHFENSSDSEGSPRAKISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKV 394
            V F+      G    K + +SK GGK   +TP +                   S++PK 
Sbjct: 278 KVQFD-----AGVAEPKSNFSSKYGGK--GDTPTAFPKGKGDRLFNSG-----KSSVPKT 325

Query: 395 PTIPELKIEEELAKNATCLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSS 574
           P   EL+IE+EL KN  CL+ CEAA +LQGIQE +V LS+D TIK P SFD+GLQYAK  
Sbjct: 326 PQPLELRIEDELPKNVKCLLACEAAEILQGIQEHMVFLSKDPTIKTPVSFDRGLQYAKRG 385

Query: 575 DHYTNPQSVRRVLETLKSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINH 754
            HYTN  S+R+VLE           IC+IA++CPETADEVFAL+PSLK N+  +   +  
Sbjct: 386 SHYTNALSIRKVLE------YPQFIICIIADVCPETADEVFALVPSLKANKSLLREPVKK 439

Query: 755 VLHELAKLK 781
           VL ELAKLK
Sbjct: 440 VLLELAKLK 448


>ref|XP_006350243.1| PREDICTED: DNA-directed RNA polymerases IV and V subunit 4-like
           [Solanum tuberosum]
          Length = 220

 Score =  186 bits (471), Expect = 1e-44
 Identities = 112/232 (48%), Positives = 150/232 (64%), Gaps = 3/232 (1%)
 Frame = +2

Query: 95  MAEKGGKGFSL-KARKSSVKTPSAKGASLKGKDGSS--NKKGRSVHFENSSDSEGSPRAK 265
           MAEKGGKGFSL K+ KS++K+P++KG     KD SS  +K+GR V F    DSEGS    
Sbjct: 1   MAEKGGKGFSLPKSGKSALKSPASKG-----KDDSSVKSKRGRKVQF----DSEGSLDTN 51

Query: 266 ISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNAT 445
              ++KS GK  ++ P S                   S   K P   EL++E+EL  N+T
Sbjct: 52  ---STKSNGK--ADIP-SLKGDLGKAGKGEKAGSAGKSQKAKAPDPLELRVEQELPANST 105

Query: 446 CLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETLK 625
           CLMDCEAA +LQGIQE +V+LS+D  IKLP SFD+GL YA+ +  Y NPQ+V+++LE LK
Sbjct: 106 CLMDCEAADILQGIQENMVVLSDDPAIKLPVSFDRGLAYAQRNRLYDNPQAVKQILEPLK 165

Query: 626 SHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLK 781
            H +SDGE+C+IAN   E+ DEVFAL+PS K  + ++   + +VL ELAKL+
Sbjct: 166 QHGVSDGELCMIANFPLESVDEVFALVPSFKNKKSKLRVPLENVLAELAKLR 217


>ref|XP_002525033.1| conserved hypothetical protein [Ricinus communis]
           gi|223535695|gb|EEF37360.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 203

 Score =  185 bits (470), Expect = 2e-44
 Identities = 113/232 (48%), Positives = 145/232 (62%), Gaps = 5/232 (2%)
 Frame = +2

Query: 101 EKGGKGFSL--KARKSSVKTPSAKGASLKGKDGSS--NKKGRSVHFENSSD-SEGSPRAK 265
           EKGGKGFSL  K  KSS+K  S   AS KGKD +S  +K+G+ V F +  + S+G    K
Sbjct: 2   EKGGKGFSLPGKGLKSSLK--SITPASTKGKDDTSAKSKRGKKVQFNSQGNMSKGGKGDK 59

Query: 266 ISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNAT 445
           +S    +G K SS                            K P   ELKIE++L KNA 
Sbjct: 60  VS----NGVKISST---------------------------KEPQPLELKIEQDLPKNAK 88

Query: 446 CLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETLK 625
           CLMDCEAA VLQGIQEQ+V+LS D TIKLP SFD+ LQ+A++   +TNPQSVRR+LE LK
Sbjct: 89  CLMDCEAAQVLQGIQEQMVLLSRDPTIKLPVSFDRALQHARTGARFTNPQSVRRILEGLK 148

Query: 626 SHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLK 781
            H +S+GEIC IAN+CP+  DEVFAL+PSLK  +  +   +  +L +L++LK
Sbjct: 149 KHGVSEGEICTIANVCPDGVDEVFALVPSLKSKKNVLREPLKDILGQLSELK 200


>ref|XP_006467223.1| PREDICTED: DNA-directed RNA polymerases IV and V subunit 4-like
           isoform X2 [Citrus sinensis]
          Length = 225

 Score =  179 bits (454), Expect = 1e-42
 Identities = 105/229 (45%), Positives = 143/229 (62%), Gaps = 3/229 (1%)
 Frame = +2

Query: 107 GGKGFSLKAR-KSSVKTPSAKGASLKGKDGSS--NKKGRSVHFENSSDSEGSPRAKISVT 277
           GGKG  +    K+S+K+  A   SL GKD +S  +K+GR V F     SEG    K + +
Sbjct: 6   GGKGGGVGGGGKTSLKSIPA---SLGGKDDNSAKSKRGRKVQFNTEGLSEG----KFTFS 58

Query: 278 SKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNATCLMD 457
           SKS GKF  ET                      S + +   + EL++E+EL KNA CLMD
Sbjct: 59  SKSDGKF--ETTYGKGGLTKGGKGDKVANGAKVSVVKEALPL-ELRVEQELPKNAKCLMD 115

Query: 458 CEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETLKSHNI 637
           CEAA +L+GIQEQ+ +LS D TIK+P SFDKGL YAK+  H+TNPQ+V+ + ++L  H +
Sbjct: 116 CEAAHILEGIQEQMALLSADPTIKIPVSFDKGLLYAKTHSHFTNPQAVKGLFQSLSEHGV 175

Query: 638 SDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLKS 784
           +DGEIC+IAN+CPET +E +A++PSLK  R R+   +  VL +LAK KS
Sbjct: 176 TDGEICVIANICPETVEEAYAIVPSLKAKRSRLNDLLKEVLIQLAKFKS 224


>ref|XP_006449991.1| hypothetical protein CICLE_v10016499mg [Citrus clementina]
           gi|557552602|gb|ESR63231.1| hypothetical protein
           CICLE_v10016499mg [Citrus clementina]
          Length = 225

 Score =  178 bits (451), Expect = 3e-42
 Identities = 103/228 (45%), Positives = 138/228 (60%), Gaps = 2/228 (0%)
 Frame = +2

Query: 107 GGKGFSLKARKSSVKTPSAKGASLKGKDGSS--NKKGRSVHFENSSDSEGSPRAKISVTS 280
           GG G   K    S+ T      SL GKD +S  +K+GR V F     SEG    K + +S
Sbjct: 10  GGVGGGGKTSLKSIPT------SLGGKDDNSAKSKRGRKVQFNTEGLSEG----KFTFSS 59

Query: 281 KSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNATCLMDC 460
           KS GKF  ET                      S + +   + EL++E+EL KNA CLMDC
Sbjct: 60  KSDGKF--ETTCGKGGLTKGGKGDKVANGAKVSVVKEALPL-ELRVEQELPKNAKCLMDC 116

Query: 461 EAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETLKSHNIS 640
           EAA +L+GIQEQ+ +LS D TIK+P SFDKGL YAK+  H+TNPQ+V+ + ++L  H ++
Sbjct: 117 EAAHILEGIQEQMALLSADPTIKIPVSFDKGLLYAKTHSHFTNPQAVKGLFQSLSEHGVT 176

Query: 641 DGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLKS 784
           DGEIC+IAN+CPET +E +A++PSLK  R R+   +  VL +LAK KS
Sbjct: 177 DGEICVIANICPETVEEAYAIVPSLKAKRSRLNDLLKEVLIQLAKFKS 224


>ref|XP_004236644.1| PREDICTED: uncharacterized protein LOC101254374 [Solanum
           lycopersicum]
          Length = 220

 Score =  175 bits (444), Expect = 2e-41
 Identities = 104/230 (45%), Positives = 141/230 (61%), Gaps = 1/230 (0%)
 Frame = +2

Query: 95  MAEKGGKGFSL-KARKSSVKTPSAKGASLKGKDGSSNKKGRSVHFENSSDSEGSPRAKIS 271
           MAEKGGKGFSL K+ KSS+K+P++KG   K    + +K+GR V F    DSEGS      
Sbjct: 1   MAEKGGKGFSLPKSGKSSLKSPASKG---KDDSSAKSKRGRKVQF----DSEGSLDTN-- 51

Query: 272 VTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNATCL 451
            ++KS GK  ++ P S                   S   K P   EL++E+EL+   TC+
Sbjct: 52  -STKSNGK--ADIP-SFKGDLGKAGKGEKAGSAGKSQKAKAPDPLELRVEQELSTKTTCM 107

Query: 452 MDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETLKSH 631
           MDCEAA +LQGIQE +V+LS+D  IKLP SFD+GL Y +    Y NPQ+V ++L  LK H
Sbjct: 108 MDCEAADILQGIQENMVVLSDDPAIKLPVSFDRGLAYGQRIRLYDNPQAVEQILGPLKQH 167

Query: 632 NISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLK 781
            +SDGE+C+IAN   E+ DEVFA +PS K  + ++   + +VL EL KL+
Sbjct: 168 GVSDGELCMIANFPLESVDEVFAFVPSFKNRKSKLRVPLENVLAELTKLR 217


>ref|XP_007026415.1| DNA-directed RNA polymerase II subunit rpb4, putative isoform 3,
           partial [Theobroma cacao] gi|508781781|gb|EOY29037.1|
           DNA-directed RNA polymerase II subunit rpb4, putative
           isoform 3, partial [Theobroma cacao]
          Length = 214

 Score =  174 bits (441), Expect = 4e-41
 Identities = 102/218 (46%), Positives = 135/218 (61%), Gaps = 4/218 (1%)
 Frame = +2

Query: 95  MAEKGGKGFSLKAR---KSSVKTPSAKGASLKGKDGSSNKKGRSVHFENSSDSEGSPRAK 265
           M+EKGGKGFSL  +   KS++K+  A   +    + + +K+GR V F      EG P   
Sbjct: 1   MSEKGGKGFSLPTKTTPKSALKSTPASATARHDDNSAKSKRGRKVQF----GMEGLPNLG 56

Query: 266 ISVTS-KSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNA 442
            + +S KS GKF+   PV                    + + K     EL++E+EL +N 
Sbjct: 57  FNFSSPKSDGKFA--IPVGKGDWAKGGKGEKVVNGG-KAPVAKEAKSLELRVEQELPENV 113

Query: 443 TCLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETL 622
            CLMDCEAA +L+GIQEQ+V+LS+DSTIKLP SF  GLQYAK+  +YTNPQSVRRVLE L
Sbjct: 114 KCLMDCEAANILEGIQEQMVMLSQDSTIKLPESFHLGLQYAKTRSYYTNPQSVRRVLEAL 173

Query: 623 KSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRI 736
             + +S  EIC+IAN CPET DEVFAL+ SL+  + R+
Sbjct: 174 SKYGVSYSEICVIANTCPETVDEVFALVRSLEAKKSRL 211


>ref|XP_006467222.1| PREDICTED: DNA-directed RNA polymerases IV and V subunit 4-like
           isoform X1 [Citrus sinensis]
          Length = 237

 Score =  170 bits (431), Expect = 5e-40
 Identities = 105/241 (43%), Positives = 143/241 (59%), Gaps = 15/241 (6%)
 Frame = +2

Query: 107 GGKGFSLKAR-KSSVKTPSAKGASLKGKDGSS--NKKGRSVHFENSSDSEGSPRAKISVT 277
           GGKG  +    K+S+K+  A   SL GKD +S  +K+GR V F     SEG    K + +
Sbjct: 6   GGKGGGVGGGGKTSLKSIPA---SLGGKDDNSAKSKRGRKVQFNTEGLSEG----KFTFS 58

Query: 278 SKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNATCLMD 457
           SKS GKF  ET                      S + +   + EL++E+EL KNA CLMD
Sbjct: 59  SKSDGKF--ETTYGKGGLTKGGKGDKVANGAKVSVVKEALPL-ELRVEQELPKNAKCLMD 115

Query: 458 CEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETLKSHNI 637
           CEAA +L+GIQEQ+ +LS D TIK+P SFDKGL YAK+  H+TNPQ+V+ + ++L  H +
Sbjct: 116 CEAAHILEGIQEQMALLSADPTIKIPVSFDKGLLYAKTHSHFTNPQAVKGLFQSLSEHGV 175

Query: 638 SDGEICLIANLCPETADEVFALIPSLK------------VNRLRIERAINHVLHELAKLK 781
           +DGEIC+IAN+CPET +E +A++PSLK              R R+   +  VL +LAK K
Sbjct: 176 TDGEICVIANICPETVEEAYAIVPSLKRMVDERICELFQAKRSRLNDLLKEVLIQLAKFK 235

Query: 782 S 784
           S
Sbjct: 236 S 236


>ref|XP_004486377.1| PREDICTED: uncharacterized protein LOC101499719 isoform X1 [Cicer
           arietinum]
          Length = 240

 Score =  170 bits (431), Expect = 5e-40
 Identities = 114/255 (44%), Positives = 152/255 (59%), Gaps = 11/255 (4%)
 Frame = +2

Query: 50  GLHFLV----QLKLL*IAAMAEKGGKGFSLKARKSSVKTPSAKGASLKGKDGSSNK--KG 211
           G+HFL     +LK + +AAM++KGGKG SL ++            SLKGKD S+ K  KG
Sbjct: 30  GVHFLCFKFCRLKQV-LAAMSDKGGKGGSLLSK-----------GSLKGKDDSATKSAKG 77

Query: 212 RSVHFENSSD-SEGSPRAKISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLP 388
           R V F   +D ++G    K++    +GGK S+                            
Sbjct: 78  RKVQFSKEADFTKGGKGDKVA----NGGKSSAS--------------------------- 106

Query: 389 KVPTIPELKIEEELAKNATCLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAK 568
           K P   E ++++ L +N  CLMDCEAAV+LQGIQ+ +V+LS D +IK+P SFDKGL YAK
Sbjct: 107 KDPHQFEHRVDQGLPENFKCLMDCEAAVILQGIQDHMVMLSRDPSIKIPVSFDKGLSYAK 166

Query: 569 SSDHYTNPQSVRRVLETLKSHNISDGEICLIANLCPETADEVFALIPSLK----VNRLRI 736
           SS  Y+N +SVRR LE L  H +++ EI +IAN+CPETADEVFAL+PSLK    +N L I
Sbjct: 167 SSSKYSNHESVRRTLEPLMDHGLTESEISVIANVCPETADEVFALLPSLKGKRGINSLPI 226

Query: 737 ERAINHVLHELAKLK 781
           E++    L ELAKLK
Sbjct: 227 EKS----LSELAKLK 237


>ref|XP_006449990.1| hypothetical protein CICLE_v10016499mg [Citrus clementina]
           gi|557552601|gb|ESR63230.1| hypothetical protein
           CICLE_v10016499mg [Citrus clementina]
          Length = 237

 Score =  169 bits (428), Expect = 1e-39
 Identities = 103/240 (42%), Positives = 138/240 (57%), Gaps = 14/240 (5%)
 Frame = +2

Query: 107 GGKGFSLKARKSSVKTPSAKGASLKGKDGSS--NKKGRSVHFENSSDSEGSPRAKISVTS 280
           GG G   K    S+ T      SL GKD +S  +K+GR V F     SEG    K + +S
Sbjct: 10  GGVGGGGKTSLKSIPT------SLGGKDDNSAKSKRGRKVQFNTEGLSEG----KFTFSS 59

Query: 281 KSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKNATCLMDC 460
           KS GKF  ET                      S + +   + EL++E+EL KNA CLMDC
Sbjct: 60  KSDGKF--ETTCGKGGLTKGGKGDKVANGAKVSVVKEALPL-ELRVEQELPKNAKCLMDC 116

Query: 461 EAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLETLKSHNIS 640
           EAA +L+GIQEQ+ +LS D TIK+P SFDKGL YAK+  H+TNPQ+V+ + ++L  H ++
Sbjct: 117 EAAHILEGIQEQMALLSADPTIKIPVSFDKGLLYAKTHSHFTNPQAVKGLFQSLSEHGVT 176

Query: 641 DGEICLIANLCPETADEVFALIPSLK------------VNRLRIERAINHVLHELAKLKS 784
           DGEIC+IAN+CPET +E +A++PSLK              R R+   +  VL +LAK KS
Sbjct: 177 DGEICVIANICPETVEEAYAIVPSLKRMVDERICELFQAKRSRLNDLLKEVLIQLAKFKS 236


>ref|XP_007147572.1| hypothetical protein PHAVU_006G135800g, partial [Phaseolus
           vulgaris] gi|561020795|gb|ESW19566.1| hypothetical
           protein PHAVU_006G135800g, partial [Phaseolus vulgaris]
          Length = 242

 Score =  169 bits (427), Expect = 2e-39
 Identities = 101/234 (43%), Positives = 134/234 (57%), Gaps = 2/234 (0%)
 Frame = +2

Query: 86  IAAMAEKGGKGFSLKARKSSVKTPSAKGASLKGKDGSSNK--KGRSVHFENSSDSEGSPR 259
           + AM+EKGGKG SL ++            S+KGKD S+ K  KGR V F      E    
Sbjct: 29  VFAMSEKGGKGGSLLSKGGL--------GSMKGKDDSATKSAKGRRVQFSKDGPYESGIS 80

Query: 260 AKISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKN 439
           +    + KSGGK      V+                   S+  K     E +++++L +N
Sbjct: 81  SSSHSSLKSGGKGGKGDKVANGG---------------KSSQSKDSQSSEQRVDQKLPEN 125

Query: 440 ATCLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVLET 619
             CLMDCEAA VLQGIQ+Q+++LS DS IK+P  F+KGLQYAK++  YTN QSVR VLE 
Sbjct: 126 IKCLMDCEAADVLQGIQDQMIMLSRDSNIKMPTPFEKGLQYAKNNSKYTNAQSVRHVLEP 185

Query: 620 LKSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLK 781
           L ++ ++D EIC+I N+CPET DEVFAL+P LK  R      +   L ELAKL+
Sbjct: 186 LANNGLTDSEICVIGNVCPETIDEVFALLPPLKGKRNVQREVLEDSLSELAKLR 239


>ref|XP_006398486.1| hypothetical protein EUTSA_v10001026mg [Eutrema salsugineum]
           gi|557099575|gb|ESQ39939.1| hypothetical protein
           EUTSA_v10001026mg [Eutrema salsugineum]
          Length = 208

 Score =  167 bits (424), Expect = 3e-39
 Identities = 99/236 (41%), Positives = 140/236 (59%), Gaps = 7/236 (2%)
 Frame = +2

Query: 95  MAEKGGKGFSLKARKSSVKTPSAKGASLKGKDGSS--NKKGRSVHFENSSDSEGSPRAKI 268
           M+EKGGKG      KSS+K+PS  G+   GKD +S  +KK R V F    D  G+  +K 
Sbjct: 1   MSEKGGKGI-----KSSLKSPSKYGS---GKDDNSTKSKKTRKVQF----DPLGTSDSKY 48

Query: 269 SVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIP-----ELKIEEELA 433
            +   S  +F   +                      +N  K+         ELK E+EL 
Sbjct: 49  KIVQDSDYQFQGSS----------------AKGGKGTNAKKITRSKESQPLELKTEKELP 92

Query: 434 KNATCLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDHYTNPQSVRRVL 613
           +NA CLMDCEA  +L+GI+EQL +LS+D +IK+P SF++GL+Y K    YTNPQS R++L
Sbjct: 93  ENAKCLMDCEAFQILEGIKEQLAVLSDDPSIKIPVSFNRGLEYLKLGSCYTNPQSARQIL 152

Query: 614 ETLKSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLK 781
           E LK H +S+GE+C+IAN+CPE+ DEVFA +PS+K  + +I + +   L +L+KLK
Sbjct: 153 EPLKKHGVSEGELCMIANVCPESIDEVFAFVPSMKGKKDKISQPLEEALTKLSKLK 208


>ref|XP_003594343.1| DNA-directed RNA polymerase II subunit rpb4 [Medicago truncatula]
           gi|124360902|gb|ABN08874.1| RNA polymerase Rpb4
           [Medicago truncatula] gi|355483391|gb|AES64594.1|
           DNA-directed RNA polymerase II subunit rpb4 [Medicago
           truncatula]
          Length = 223

 Score =  167 bits (424), Expect = 3e-39
 Identities = 107/238 (44%), Positives = 138/238 (57%), Gaps = 4/238 (1%)
 Frame = +2

Query: 86  IAAMAEKGGKGFSLKARKSSVKTPSAKGASLKGKDGSSNK--KGRSVHFENSSDSEGSPR 259
           +  M++KGGKG SL +          KG  LKGKD S+ K  K R V F     SEG   
Sbjct: 19  VLTMSDKGGKGGSLLS----------KGGGLKGKDDSATKSAKARKVQF-----SEGLFE 63

Query: 260 AKISVTSKSGGKFSSETPVSXXXXXXXXXXXXXXXXXXNSNLPKVPTIPELKIEEELAKN 439
           ++ S    SGGK                           S+  K P   E ++++EL +N
Sbjct: 64  SR-SNGPTSGGKGDKVA------------------NGGKSSAAKDPHQFEHRVDQELPEN 104

Query: 440 ATCLMDCEAAVVLQGIQEQLVILSEDSTIKLPGSFDKGLQYAKSSDH--YTNPQSVRRVL 613
             CLMDCEAAV+LQGIQ+Q+V LS D +IK+P SFDKGL YAKSS    Y+NP+SVR  L
Sbjct: 105 FKCLMDCEAAVMLQGIQDQMVALSRDPSIKMPASFDKGLYYAKSSSSSKYSNPESVRHTL 164

Query: 614 ETLKSHNISDGEICLIANLCPETADEVFALIPSLKVNRLRIERAINHVLHELAKLKSI 787
           E L +H++++ EIC+IAN+CPETADEVFAL+PSLK  R    + +   L ELAK K +
Sbjct: 165 EPLMNHDLTESEICVIANVCPETADEVFALLPSLKSKRGINSQPVEEALSELAKFKQM 222


Top