BLASTX nr result

ID: Akebia27_contig00002868 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00002868
         (1029 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI15766.3| unnamed protein product [Vitis vinifera]              356   8e-96
ref|XP_002284683.1| PREDICTED: pre-rRNA-processing protein ESF2 ...   356   8e-96
emb|CAN83161.1| hypothetical protein VITISV_022556 [Vitis vinifera]   350   5e-94
ref|XP_006470204.1| PREDICTED: pre-rRNA-processing protein esf2-...   330   4e-88
ref|XP_006446656.1| hypothetical protein CICLE_v10015979mg [Citr...   329   1e-87
ref|XP_004172361.1| PREDICTED: pre-rRNA-processing protein esf2-...   326   1e-86
ref|XP_004142778.1| PREDICTED: pre-rRNA-processing protein ESF2-...   326   1e-86
ref|XP_006858558.1| hypothetical protein AMTR_s00071p00171330 [A...   325   2e-86
gb|EYU44840.1| hypothetical protein MIMGU_mgv1a012651mg [Mimulus...   324   4e-86
gb|EXB37441.1| Pre-rRNA-processing protein esf2 [Morus notabilis]     309   1e-81
ref|XP_006402979.1| hypothetical protein EUTSA_v10006186mg [Eutr...   304   4e-80
ref|XP_006366296.1| PREDICTED: pre-rRNA-processing protein ESF2-...   301   3e-79
ref|XP_004248182.1| PREDICTED: pre-rRNA-processing protein ESF2-...   301   3e-79
ref|XP_006291598.1| hypothetical protein CARUB_v10017753mg, part...   298   2e-78
ref|XP_002526047.1| Activator of basal transcription, putative [...   298   3e-78
ref|XP_006364111.1| PREDICTED: pre-rRNA-processing protein ESF2-...   296   7e-78
ref|XP_007031651.1| RNA-binding family protein isoform 1 [Theobr...   296   9e-78
ref|XP_007215875.1| hypothetical protein PRUPE_ppa010533mg [Prun...   295   2e-77
ref|NP_191210.2| RNA recognition motif-containing protein [Arabi...   293   1e-76
ref|XP_002878090.1| hypothetical protein ARALYDRAFT_486092 [Arab...   292   1e-76

>emb|CBI15766.3| unnamed protein product [Vitis vinifera]
          Length = 344

 Score =  356 bits (914), Expect = 8e-96
 Identities = 182/253 (71%), Positives = 202/253 (79%), Gaps = 4/253 (1%)
 Frame = +1

Query: 1   ERDQSGTSHSEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDP 180
           E++   TSHSEEG +   E  D+    RK+K KKRL+KE   ADKRGVCYLSRIPPHMD 
Sbjct: 92  EQEIKRTSHSEEGES---EGNDEKGRIRKNKLKKRLLKEASKADKRGVCYLSRIPPHMDH 148

Query: 181 LKLRQIISQYGEIQRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANML 360
           +KLR I+SQYGEIQRIYL PEDPA QVHRKRAGGFRGQ FSEGWVEFTKK VAKRVA ML
Sbjct: 149 VKLRHILSQYGEIQRIYLAPEDPATQVHRKRAGGFRGQVFSEGWVEFTKKTVAKRVAKML 208

Query: 361 NGEQVGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFY 540
           NGEQ+GGRK+SSFYYD+WNIKYLSKFKWDDLT EIAYKNAIREQKL LE+SAAKRERDFY
Sbjct: 209 NGEQIGGRKRSSFYYDLWNIKYLSKFKWDDLTEEIAYKNAIREQKLALELSAAKRERDFY 268

Query: 541 LKKVDQSHALTSIQERL----XXXXXXXXHDSDPVDQQEPKVIRQFPQTRPVADSSFQSK 708
           L KVD+S AL+SI+ERL            +   P + Q PKVIRQFPQ  P+AD + +SK
Sbjct: 269 LSKVDKSRALSSIEERLKKKQKVQQDAGTNTEAPANDQGPKVIRQFPQKPPLADKAAESK 328

Query: 709 SRLSKDILAGVFG 747
            RLSKDILAGVFG
Sbjct: 329 PRLSKDILAGVFG 341


>ref|XP_002284683.1| PREDICTED: pre-rRNA-processing protein ESF2 [Vitis vinifera]
          Length = 257

 Score =  356 bits (914), Expect = 8e-96
 Identities = 182/253 (71%), Positives = 202/253 (79%), Gaps = 4/253 (1%)
 Frame = +1

Query: 1   ERDQSGTSHSEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDP 180
           E++   TSHSEEG +   E  D+    RK+K KKRL+KE   ADKRGVCYLSRIPPHMD 
Sbjct: 5   EQEIKRTSHSEEGES---EGNDEKGRIRKNKLKKRLLKEASKADKRGVCYLSRIPPHMDH 61

Query: 181 LKLRQIISQYGEIQRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANML 360
           +KLR I+SQYGEIQRIYL PEDPA QVHRKRAGGFRGQ FSEGWVEFTKK VAKRVA ML
Sbjct: 62  VKLRHILSQYGEIQRIYLAPEDPATQVHRKRAGGFRGQVFSEGWVEFTKKTVAKRVAKML 121

Query: 361 NGEQVGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFY 540
           NGEQ+GGRK+SSFYYD+WNIKYLSKFKWDDLT EIAYKNAIREQKL LE+SAAKRERDFY
Sbjct: 122 NGEQIGGRKRSSFYYDLWNIKYLSKFKWDDLTEEIAYKNAIREQKLALELSAAKRERDFY 181

Query: 541 LKKVDQSHALTSIQERL----XXXXXXXXHDSDPVDQQEPKVIRQFPQTRPVADSSFQSK 708
           L KVD+S AL+SI+ERL            +   P + Q PKVIRQFPQ  P+AD + +SK
Sbjct: 182 LSKVDKSRALSSIEERLKKKQKVQQDAGTNTEAPANDQGPKVIRQFPQKPPLADKAAESK 241

Query: 709 SRLSKDILAGVFG 747
            RLSKDILAGVFG
Sbjct: 242 PRLSKDILAGVFG 254


>emb|CAN83161.1| hypothetical protein VITISV_022556 [Vitis vinifera]
          Length = 486

 Score =  350 bits (898), Expect = 5e-94
 Identities = 179/250 (71%), Positives = 199/250 (79%), Gaps = 4/250 (1%)
 Frame = +1

Query: 1   ERDQSGTSHSEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDP 180
           E++   TSHSEEG +   E  D+    RK+K KKRL+KE   ADKRGVCYLSRIPPHMD 
Sbjct: 5   EQEIKRTSHSEEGES---EGNDEKGRIRKNKLKKRLLKEASKADKRGVCYLSRIPPHMDH 61

Query: 181 LKLRQIISQYGEIQRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANML 360
           +KLR I+SQYGEIQRIYL PEDPA QVHRKRAGGFRGQ FSEGWVEFTKK VAKRVA ML
Sbjct: 62  VKLRHILSQYGEIQRIYLAPEDPATQVHRKRAGGFRGQVFSEGWVEFTKKTVAKRVAKML 121

Query: 361 NGEQVGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFY 540
           NGEQ+GGRK+SSFYYD+WNIKYLSKFKWDDLT EIAYKNAIREQKL LE+SAAKRERDFY
Sbjct: 122 NGEQIGGRKRSSFYYDLWNIKYLSKFKWDDLTEEIAYKNAIREQKLALELSAAKRERDFY 181

Query: 541 LKKVDQSHALTSIQERL----XXXXXXXXHDSDPVDQQEPKVIRQFPQTRPVADSSFQSK 708
           L KVD+S AL+SI+ERL            +   P + Q PKVIRQFPQ  P+AD + +SK
Sbjct: 182 LSKVDKSRALSSIEERLKKKQKVQQDAGTNTEAPANDQGPKVIRQFPQKPPLADKAAESK 241

Query: 709 SRLSKDILAG 738
            RLSKDILAG
Sbjct: 242 PRLSKDILAG 251


>ref|XP_006470204.1| PREDICTED: pre-rRNA-processing protein esf2-like [Citrus sinensis]
          Length = 315

 Score =  330 bits (847), Expect = 4e-88
 Identities = 168/248 (67%), Positives = 197/248 (79%), Gaps = 9/248 (3%)
 Frame = +1

Query: 31  EEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQY 210
           EEG     E  +KN++  KSK+K+RL++E   AD+RG+CYLSRIPPHMDP+KLRQI+SQY
Sbjct: 68  EEGKQ--EELNEKNSKHSKSKKKQRLLEEAAKADQRGICYLSRIPPHMDPVKLRQILSQY 125

Query: 211 GEIQRIYLTPEDPAAQV-----HRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQV 375
           GEIQRIYL PEDP+ +V     +RKR GGF+ Q FSEGWVEFTKK VAKRVANMLNGEQ+
Sbjct: 126 GEIQRIYLAPEDPSTRVLRKRENRKRDGGFQDQGFSEGWVEFTKKGVAKRVANMLNGEQI 185

Query: 376 GGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVD 555
           GG+K+SSFYYD+WNIKYLSKFKWDDLT EIAYKNA+REQ+L LE+SAAKRERDFYL KVD
Sbjct: 186 GGKKRSSFYYDLWNIKYLSKFKWDDLTAEIAYKNAVREQRLALEISAAKRERDFYLSKVD 245

Query: 556 QSHALTSIQERL----XXXXXXXXHDSDPVDQQEPKVIRQFPQTRPVADSSFQSKSRLSK 723
           +S AL+SI+ERL            H   P  +Q  KVIR FPQ +PV D++  SKSRLSK
Sbjct: 246 KSRALSSIEERLKKKQKVQQESQTHPELPGSEQVTKVIRHFPQKQPVTDNAAPSKSRLSK 305

Query: 724 DILAGVFG 747
           DILAGVFG
Sbjct: 306 DILAGVFG 313


>ref|XP_006446656.1| hypothetical protein CICLE_v10015979mg [Citrus clementina]
           gi|567908687|ref|XP_006446657.1| hypothetical protein
           CICLE_v10015979mg [Citrus clementina]
           gi|567908689|ref|XP_006446658.1| hypothetical protein
           CICLE_v10015979mg [Citrus clementina]
           gi|557549267|gb|ESR59896.1| hypothetical protein
           CICLE_v10015979mg [Citrus clementina]
           gi|557549268|gb|ESR59897.1| hypothetical protein
           CICLE_v10015979mg [Citrus clementina]
           gi|557549269|gb|ESR59898.1| hypothetical protein
           CICLE_v10015979mg [Citrus clementina]
          Length = 317

 Score =  329 bits (843), Expect = 1e-87
 Identities = 167/248 (67%), Positives = 197/248 (79%), Gaps = 9/248 (3%)
 Frame = +1

Query: 31  EEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQY 210
           EEG     E  +KN++  KSK+K+RL++E   AD+RG+CYLSRIPPHMDP+KLRQI+SQY
Sbjct: 69  EEGKQ--EELNEKNSKHSKSKKKQRLLEEAAKADQRGICYLSRIPPHMDPVKLRQILSQY 126

Query: 211 GEIQRIYLTPEDPAAQV-----HRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQV 375
           GEIQRIYL PEDP+ +V     +RKR GGF+ Q FSEGWVEFTKK VAKRVANMLNGEQ+
Sbjct: 127 GEIQRIYLAPEDPSTRVLRKRENRKRDGGFQDQGFSEGWVEFTKKGVAKRVANMLNGEQI 186

Query: 376 GGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVD 555
           GG+K+SSFYYD+WNIKYLSKFKWDDLT EIAYKNA+REQ+L LE+SAAKRERDFYL KVD
Sbjct: 187 GGKKRSSFYYDLWNIKYLSKFKWDDLTAEIAYKNAVREQRLALEISAAKRERDFYLSKVD 246

Query: 556 QSHALTSIQERL----XXXXXXXXHDSDPVDQQEPKVIRQFPQTRPVADSSFQSKSRLSK 723
           +S AL+SI+ERL            H   P  +Q  KVIR FPQ +PVA+++  SK RLSK
Sbjct: 247 KSRALSSIEERLKKKQKVQQESQTHPELPGSEQVTKVIRHFPQKKPVAENAAPSKPRLSK 306

Query: 724 DILAGVFG 747
           DILAGVFG
Sbjct: 307 DILAGVFG 314


>ref|XP_004172361.1| PREDICTED: pre-rRNA-processing protein esf2-like, partial [Cucumis
           sativus]
          Length = 326

 Score =  326 bits (835), Expect = 1e-86
 Identities = 169/255 (66%), Positives = 197/255 (77%), Gaps = 5/255 (1%)
 Frame = +1

Query: 1   ERDQSGTSHSEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDP 180
           ER  S    + +G  + +   D     RK KRKK+L+KE  +AD RG+CYLSR+PPHMDP
Sbjct: 75  ERTDSILRENSDGKILES---DNGKNQRKIKRKKQLLKEAANADMRGICYLSRVPPHMDP 131

Query: 181 LKLRQIISQYGEIQRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANML 360
           LKLRQI+SQ+GEIQRIYL PED A+QV RKRAGGFRGQ FSEGWVEFT K VAKRVANML
Sbjct: 132 LKLRQILSQHGEIQRIYLAPEDAASQVQRKRAGGFRGQFFSEGWVEFTDKRVAKRVANML 191

Query: 361 NGEQVGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFY 540
           NGE +GGRK+SSFYYD+WNIKYLSKFKWDDLT E AYK+AIREQKL LE+SAAKRERDFY
Sbjct: 192 NGEPIGGRKRSSFYYDLWNIKYLSKFKWDDLTEETAYKHAIREQKLALEISAAKRERDFY 251

Query: 541 LKKVDQSHALTSIQERLXXXXXXXXHDSD-----PVDQQEPKVIRQFPQTRPVADSSFQS 705
           L KVD+S AL SI+ERL         DS+        Q+ PK+IR FPQT+PVAD + Q+
Sbjct: 252 LAKVDKSRALNSIEERL-KKKQKMREDSEMNSTLDDSQKLPKLIRSFPQTQPVADFAVQN 310

Query: 706 KSRLSKDILAGVFGA 750
           K RLS ++LAGVFG+
Sbjct: 311 KPRLSTNVLAGVFGS 325


>ref|XP_004142778.1| PREDICTED: pre-rRNA-processing protein ESF2-like [Cucumis sativus]
          Length = 380

 Score =  326 bits (835), Expect = 1e-86
 Identities = 169/255 (66%), Positives = 197/255 (77%), Gaps = 5/255 (1%)
 Frame = +1

Query: 1   ERDQSGTSHSEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDP 180
           ER  S    + +G  + +   D     RK KRKK+L+KE  +AD RG+CYLSR+PPHMDP
Sbjct: 129 ERTDSILRENSDGKILES---DNGKNQRKIKRKKQLLKEAANADMRGICYLSRVPPHMDP 185

Query: 181 LKLRQIISQYGEIQRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANML 360
           LKLRQI+SQ+GEIQRIYL PED A+QV RKRAGGFRGQ FSEGWVEFT K VAKRVANML
Sbjct: 186 LKLRQILSQHGEIQRIYLAPEDAASQVQRKRAGGFRGQFFSEGWVEFTDKRVAKRVANML 245

Query: 361 NGEQVGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFY 540
           NGE +GGRK+SSFYYD+WNIKYLSKFKWDDLT E AYK+AIREQKL LE+SAAKRERDFY
Sbjct: 246 NGEPIGGRKRSSFYYDLWNIKYLSKFKWDDLTEETAYKHAIREQKLALEISAAKRERDFY 305

Query: 541 LKKVDQSHALTSIQERLXXXXXXXXHDSD-----PVDQQEPKVIRQFPQTRPVADSSFQS 705
           L KVD+S AL SI+ERL         DS+        Q+ PK+IR FPQT+PVAD + Q+
Sbjct: 306 LAKVDKSRALNSIEERL-KKKQKMREDSEMNSTLDDSQKLPKLIRSFPQTQPVADFAVQN 364

Query: 706 KSRLSKDILAGVFGA 750
           K RLS ++LAGVFG+
Sbjct: 365 KPRLSTNVLAGVFGS 379


>ref|XP_006858558.1| hypothetical protein AMTR_s00071p00171330 [Amborella trichopoda]
           gi|548862667|gb|ERN20025.1| hypothetical protein
           AMTR_s00071p00171330 [Amborella trichopoda]
          Length = 264

 Score =  325 bits (832), Expect = 2e-86
 Identities = 159/244 (65%), Positives = 193/244 (79%), Gaps = 6/244 (2%)
 Frame = +1

Query: 28  SEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQ 207
           S + + +NT  +D   +  K KRKKRL+KE  ++ KRGVCYLSR+PPHMD +KLR I+SQ
Sbjct: 14  STDASELNTNVEDDEEKIAKEKRKKRLLKEKEASGKRGVCYLSRVPPHMDHVKLRHILSQ 73

Query: 208 YGEIQRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQVGGRK 387
           YGEI RIYL PEDP A+VHRKR GG RG E+SEGWVEF KK+VAKRVANMLNGEQ+GG++
Sbjct: 74  YGEILRIYLAPEDPTAKVHRKRIGGNRGHEYSEGWVEFAKKSVAKRVANMLNGEQIGGKR 133

Query: 388 KSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVDQSHA 567
           +SSFYYD+WNIKYL KFKWD+LT EIAYKNA+REQKL LE+SAAKRERDFYL KVDQS A
Sbjct: 134 RSSFYYDLWNIKYLRKFKWDNLTEEIAYKNAVREQKLGLEISAAKRERDFYLSKVDQSRA 193

Query: 568 LTSIQERLXXXXXXXXHDSD------PVDQQEPKVIRQFPQTRPVADSSFQSKSRLSKDI 729
           L SI+ER          DSD        +Q+E KV+R+FPQTRP+AD++ + + RLSK++
Sbjct: 194 LASIREREKKKKKVAEQDSDCKGGGEVENQEEVKVVRRFPQTRPIADNTNRKEPRLSKEV 253

Query: 730 LAGV 741
           LAGV
Sbjct: 254 LAGV 257


>gb|EYU44840.1| hypothetical protein MIMGU_mgv1a012651mg [Mimulus guttatus]
          Length = 244

 Score =  324 bits (830), Expect = 4e-86
 Identities = 157/224 (70%), Positives = 186/224 (83%)
 Frame = +1

Query: 76  ETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQYGEIQRIYLTPEDPAA 255
           E RK KRK+ L+KE   A++RGVCYLSR+PPHMDPLKLRQI+SQYG++QR+YLTPEDPAA
Sbjct: 28  EKRKEKRKRLLLKEAEKAERRGVCYLSRVPPHMDPLKLRQILSQYGDLQRLYLTPEDPAA 87

Query: 256 QVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQVGGRKKSSFYYDIWNIKYLSK 435
           QV RK++GGFRGQEFSEGWVEFT K VAKRVANMLNGEQ+GG+K+SSFYYD+WNIKYLSK
Sbjct: 88  QVRRKKSGGFRGQEFSEGWVEFTDKKVAKRVANMLNGEQIGGKKRSSFYYDLWNIKYLSK 147

Query: 436 FKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVDQSHALTSIQERLXXXXXXXX 615
           FKWDDLT EIA KNA REQKL +E+SAAKRERDFYL KVDQS  L+ I ERL        
Sbjct: 148 FKWDDLTEEIAMKNATREQKLAMELSAAKRERDFYLSKVDQSKTLSKIGERLKKKKKI-- 205

Query: 616 HDSDPVDQQEPKVIRQFPQTRPVADSSFQSKSRLSKDILAGVFG 747
                  +  PKV+RQFPQ +PV++ + +++++LSKDILAGVFG
Sbjct: 206 -------EMVPKVVRQFPQKKPVSNDNGENRAQLSKDILAGVFG 242


>gb|EXB37441.1| Pre-rRNA-processing protein esf2 [Morus notabilis]
          Length = 483

 Score =  309 bits (791), Expect = 1e-81
 Identities = 154/238 (64%), Positives = 185/238 (77%), Gaps = 5/238 (2%)
 Frame = +1

Query: 43  NVNTEAKDKNAETRK-SKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQYGEI 219
           ++  EA D+  + +K +KRK+RL+KE   A+KRG+CYLSR+PPHMDP KLRQ++SQYGEI
Sbjct: 97  SLEEEADDEKTDMKKINKRKRRLLKEAAMANKRGICYLSRVPPHMDPFKLRQLLSQYGEI 156

Query: 220 QRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQVGGRKKSSF 399
           QRIYL PE  A +  RKRAG F+ Q FSEGW EF+ K +AKR ANMLNGEQ+GGRK+SSF
Sbjct: 157 QRIYLVPEKSAGKAPRKRAGRFQEQGFSEGWAEFSDKRIAKRAANMLNGEQIGGRKRSSF 216

Query: 400 YYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVDQSHALTSI 579
           YYD+WNIKYLSKFKWDDLT EIAY NA REQKL LE+SAAKRERDFYL KVDQ+ AL+SI
Sbjct: 217 YYDLWNIKYLSKFKWDDLTEEIAYNNAAREQKLALEISAAKRERDFYLSKVDQARALSSI 276

Query: 580 QERLXXXXXXXXHDSD----PVDQQEPKVIRQFPQTRPVADSSFQSKSRLSKDILAGV 741
           ++RL                PV QQ PKV+R+F Q +PVAD++  SK RLSKDILAG+
Sbjct: 277 EKRLKKKQKLQEEAETNADLPVSQQSPKVVRKFQQKQPVADNTTVSKRRLSKDILAGL 334


>ref|XP_006402979.1| hypothetical protein EUTSA_v10006186mg [Eutrema salsugineum]
           gi|567184429|ref|XP_006402980.1| hypothetical protein
           EUTSA_v10006186mg [Eutrema salsugineum]
           gi|557104078|gb|ESQ44432.1| hypothetical protein
           EUTSA_v10006186mg [Eutrema salsugineum]
           gi|557104079|gb|ESQ44433.1| hypothetical protein
           EUTSA_v10006186mg [Eutrema salsugineum]
          Length = 257

 Score =  304 bits (778), Expect = 4e-80
 Identities = 153/248 (61%), Positives = 182/248 (73%), Gaps = 3/248 (1%)
 Frame = +1

Query: 13  SGTSHSEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLR 192
           +G S  EE        K + A+ +K K K+RL+KE   AD RGVCYLSRIPPHMD ++LR
Sbjct: 11  NGISEEEESKE---RLKSQKADRKKKKLKERLLKEAAKADNRGVCYLSRIPPHMDHVRLR 67

Query: 193 QIISQYGEIQRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQ 372
           QI+SQ+GEI RIYL PEDP AQVHRKRAGGFRGQ FSEGWVEF KK VAKRVA MLNGEQ
Sbjct: 68  QILSQFGEIGRIYLAPEDPEAQVHRKRAGGFRGQLFSEGWVEFGKKRVAKRVAEMLNGEQ 127

Query: 373 VGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKV 552
           +GG+KKS+ YYDIWNIKYL+KFKWDDLT EIAYK+AIREQKLN+ +SAAKRE+DFYL KV
Sbjct: 128 IGGKKKSAIYYDIWNIKYLTKFKWDDLTEEIAYKSAIREQKLNMVLSAAKREKDFYLSKV 187

Query: 553 DQSHALTSIQERLXXXXXXXXHDSDPVDQQ---EPKVIRQFPQTRPVADSSFQSKSRLSK 723
           ++S A+T I ER+              +      P+ IRQF Q + + + + QSK  LS 
Sbjct: 188 EKSRAMTEIDERMKKKRKIQEESGSNAEAAPVFPPRAIRQFRQKKSIKNETSQSKPGLST 247

Query: 724 DILAGVFG 747
           D+LA VFG
Sbjct: 248 DVLASVFG 255


>ref|XP_006366296.1| PREDICTED: pre-rRNA-processing protein ESF2-like isoform X1
           [Solanum tuberosum] gi|565401624|ref|XP_006366297.1|
           PREDICTED: pre-rRNA-processing protein ESF2-like isoform
           X2 [Solanum tuberosum]
          Length = 247

 Score =  301 bits (771), Expect = 3e-79
 Identities = 153/233 (65%), Positives = 187/233 (80%), Gaps = 4/233 (1%)
 Frame = +1

Query: 61  KDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQYGEIQRIYLTP 240
           ++   +T+  K KK   K+ V A+KRGVC++SR+PP MD +KLRQ++SQ+GEIQRIYL P
Sbjct: 16  REDETKTQVGKVKK---KKKVKAEKRGVCHVSRVPPRMDHVKLRQVLSQFGEIQRIYLVP 72

Query: 241 EDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQVGGRKKSSFYYDIWNI 420
           E  AAQ++RKRAGGFRGQ FSEGWVEFTKK+VAKRVANMLNG+Q+GGRK+SSFYYDIWN+
Sbjct: 73  EAAAAQMNRKRAGGFRGQAFSEGWVEFTKKSVAKRVANMLNGQQMGGRKRSSFYYDIWNV 132

Query: 421 KYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVDQSHALTSIQERLXXX 600
           KYLSK KWDD+T EIA ++A+REQKL LE+SAAKRERDFYL +VD+S AL+SI+ER+   
Sbjct: 133 KYLSKIKWDDVTDEIAQRHAVREQKLALELSAAKRERDFYLTQVDKSRALSSIEERMKKK 192

Query: 601 XXXXXHD---SD-PVDQQEPKVIRQFPQTRPVADSSFQSKSRLSKDILAGVFG 747
                     SD P DQ  PKVIRQFPQ +PVAD + + K  LSKD+LAGVFG
Sbjct: 193 QKVQQESGVISDFPSDQFAPKVIRQFPQKKPVADQAGKLKPSLSKDVLAGVFG 245


>ref|XP_004248182.1| PREDICTED: pre-rRNA-processing protein ESF2-like [Solanum
           lycopersicum]
          Length = 247

 Score =  301 bits (771), Expect = 3e-79
 Identities = 153/233 (65%), Positives = 187/233 (80%), Gaps = 4/233 (1%)
 Frame = +1

Query: 61  KDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQYGEIQRIYLTP 240
           ++   +T+  K KK   K+ V A+KRGVC++SR+PP MD +KLRQ++SQ+GEIQRIYL P
Sbjct: 16  REDETKTQVGKVKK---KKKVKAEKRGVCHVSRVPPRMDHVKLRQVLSQFGEIQRIYLVP 72

Query: 241 EDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQVGGRKKSSFYYDIWNI 420
           E  AAQ++RKRAGGFRGQ FSEGWVEFTKK+VAKRVANMLNG+Q+GGRK+SSFYYDIWN+
Sbjct: 73  EAAAAQMNRKRAGGFRGQAFSEGWVEFTKKSVAKRVANMLNGQQMGGRKRSSFYYDIWNV 132

Query: 421 KYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVDQSHALTSIQERLXXX 600
           KYLSK KWDD+T EIA ++A+REQKL LE+SAAKRERDFYL +VD+S AL+SI+ER+   
Sbjct: 133 KYLSKIKWDDVTDEIAQRHAVREQKLALELSAAKRERDFYLTQVDKSRALSSIEERMKKK 192

Query: 601 XXXXXHD---SD-PVDQQEPKVIRQFPQTRPVADSSFQSKSRLSKDILAGVFG 747
                     SD P DQ  PKVIRQFPQ +PVAD + + K  LSKD+LAGVFG
Sbjct: 193 QKVQQESGVISDFPSDQFAPKVIRQFPQKKPVADQAGKLKPSLSKDVLAGVFG 245


>ref|XP_006291598.1| hypothetical protein CARUB_v10017753mg, partial [Capsella rubella]
           gi|482560305|gb|EOA24496.1| hypothetical protein
           CARUB_v10017753mg, partial [Capsella rubella]
          Length = 292

 Score =  298 bits (763), Expect = 2e-78
 Identities = 155/257 (60%), Positives = 191/257 (74%), Gaps = 8/257 (3%)
 Frame = +1

Query: 1   ERDQS---GTSHSEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPH 171
           + DQS    T  SEE +    + K++ A+ +K K K++L+KE   AD RGVCYLSRIPPH
Sbjct: 38  QSDQSHELATGMSEEDSK--EKMKNQKADRKKKKLKEKLLKEASKADNRGVCYLSRIPPH 95

Query: 172 MDPLKLRQIISQYGEIQRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVA 351
           MD ++LRQI+ Q+GE+ RIYL PEDP AQVHRK+AGGFRGQ FSEGWVEF KK VAKRVA
Sbjct: 96  MDHVRLRQILCQFGELGRIYLAPEDPEAQVHRKKAGGFRGQLFSEGWVEFAKKRVAKRVA 155

Query: 352 NMLNGEQVGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRER 531
           +MLNGEQ+GG+KKSS YYDIWNIKYL+KFKWDDLT EIAYK+AIREQKLN+ +SAAKRE+
Sbjct: 156 DMLNGEQIGGKKKSSIYYDIWNIKYLTKFKWDDLTEEIAYKSAIREQKLNMVLSAAKREK 215

Query: 532 DFYLKKVDQSHALTSIQERLXXXXXXXXH-----DSDPVDQQEPKVIRQFPQTRPVADSS 696
           DFYL KV++S A+T I  R+              ++ PV  Q  +VIRQF Q + + + +
Sbjct: 216 DFYLSKVEKSRAMTEIDARMEKKRKIQEESGSNAEAGPVFPQ--RVIRQFRQKKSIKNET 273

Query: 697 FQSKSRLSKDILAGVFG 747
            QSK  LS D+LA VFG
Sbjct: 274 SQSKPGLSTDVLASVFG 290


>ref|XP_002526047.1| Activator of basal transcription, putative [Ricinus communis]
           gi|223534628|gb|EEF36324.1| Activator of basal
           transcription, putative [Ricinus communis]
          Length = 406

 Score =  298 bits (762), Expect = 3e-78
 Identities = 153/253 (60%), Positives = 187/253 (73%), Gaps = 4/253 (1%)
 Frame = +1

Query: 1   ERDQSGTSHSEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDP 180
           +++ SG +  EE N   T   ++  +  K K+KKRL+KE   AD+RGVCYLSRIPPHMD 
Sbjct: 155 KQEASGINLVEEENQTLT---NEMVDRLKKKKKKRLLKEAAQADRRGVCYLSRIPPHMDH 211

Query: 181 LKLRQIISQYGEIQRIYLTPE----DPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRV 348
           +KLR I+ +YGEIQRIYL PE        +V R++A G     FSEGWVEFT K++AKRV
Sbjct: 212 VKLRHILCRYGEIQRIYLAPEVNKHRVQYRVQRRKADGLEDLGFSEGWVEFTNKSIAKRV 271

Query: 349 ANMLNGEQVGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRE 528
           ANMLNGEQ+GGRK+S FYYD+WNIKYLSKFKWDDLT EIAYK+AIREQKL LE+SAAKRE
Sbjct: 272 ANMLNGEQMGGRKRSQFYYDLWNIKYLSKFKWDDLTEEIAYKSAIREQKLALELSAAKRE 331

Query: 529 RDFYLKKVDQSHALTSIQERLXXXXXXXXHDSDPVDQQEPKVIRQFPQTRPVADSSFQSK 708
           RDFYL KV++S AL+SI+ERL                  PKVIRQF QT+P+AD + +++
Sbjct: 332 RDFYLSKVEKSRALSSIEERLKKKQKVQLETGGEFSVSIPKVIRQFAQTKPIADRAEENR 391

Query: 709 SRLSKDILAGVFG 747
            RLSKD+LAGVFG
Sbjct: 392 PRLSKDVLAGVFG 404


>ref|XP_006364111.1| PREDICTED: pre-rRNA-processing protein ESF2-like, partial [Solanum
           tuberosum]
          Length = 279

 Score =  296 bits (759), Expect = 7e-78
 Identities = 151/233 (64%), Positives = 185/233 (79%), Gaps = 4/233 (1%)
 Frame = +1

Query: 61  KDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQYGEIQRIYLTP 240
           ++   +T+  K KK   K+ V A+K GVC++SR+PP MD +KLRQ++SQ+GEIQRIYL P
Sbjct: 48  REDETKTQVGKVKK---KKKVKAEKHGVCHVSRVPPRMDHVKLRQVLSQFGEIQRIYLVP 104

Query: 241 EDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQVGGRKKSSFYYDIWNI 420
           E  AAQ++RKRA GFRGQ FSEGWVEFTKK+VAKRVANMLNG+Q+GGRK+SSFYYDIWN+
Sbjct: 105 EAAAAQMNRKRASGFRGQAFSEGWVEFTKKSVAKRVANMLNGQQMGGRKRSSFYYDIWNV 164

Query: 421 KYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVDQSHALTSIQERLXXX 600
           KYLSK KWDD+T EIA ++A+REQKL LE+SAAKRERDFYL +VD+S AL+SI+ER+   
Sbjct: 165 KYLSKIKWDDVTDEIAQRHAVREQKLALELSAAKRERDFYLTQVDKSRALSSIEERMKKK 224

Query: 601 XXXXXHD---SD-PVDQQEPKVIRQFPQTRPVADSSFQSKSRLSKDILAGVFG 747
                     SD P DQ  PKVIRQFPQ +PVAD + + K  LSKD+LAGVFG
Sbjct: 225 QKVQQKSGVVSDFPSDQFAPKVIRQFPQKKPVADQAGKLKPSLSKDVLAGVFG 277


>ref|XP_007031651.1| RNA-binding family protein isoform 1 [Theobroma cacao]
           gi|508710680|gb|EOY02577.1| RNA-binding family protein
           isoform 1 [Theobroma cacao]
          Length = 305

 Score =  296 bits (758), Expect = 9e-78
 Identities = 158/254 (62%), Positives = 182/254 (71%), Gaps = 6/254 (2%)
 Frame = +1

Query: 1   ERDQSGTSHSEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDP 180
           E D       E+GN V  +        +K K+K++L+KE   AD RGVCYLSRIPPHMD 
Sbjct: 59  EADDDEADGLEDGNGVPNK--------KKKKKKEKLLKEAAEADNRGVCYLSRIPPHMDH 110

Query: 181 LKLRQIISQYGEIQRIYLTPEDPAAQVHRKRA--GGFRGQEFSEGWVEFTKKNVAKRVAN 354
           +KLRQ++SQYGEI RIYLTP     QV  KR      + QEFSEGWVEF +K +AKRVAN
Sbjct: 111 VKLRQLLSQYGEILRIYLTPSGHLPQVKGKRTRPSKVQEQEFSEGWVEFARKGIAKRVAN 170

Query: 355 MLNGEQVGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERD 534
           MLNGEQVGGRK+SSFYYD+WNIKYLSKFKWDDLT EIAYK+AIREQKL LE+SAA+RERD
Sbjct: 171 MLNGEQVGGRKRSSFYYDLWNIKYLSKFKWDDLTEEIAYKSAIREQKLALEISAARRERD 230

Query: 535 FYLKKVDQSHALTSIQERLXXXXXXXXHDSD----PVDQQEPKVIRQFPQTRPVADSSFQ 702
           FYL KVDQSHAL SI+ER+                PV Q+  KVIRQFPQ +PV     Q
Sbjct: 231 FYLSKVDQSHALNSIEERMKKKQKVQQESETNSELPVSQK--KVIRQFPQKKPVTVDKSQ 288

Query: 703 SKSRLSKDILAGVF 744
           SK +LSKDILAG+F
Sbjct: 289 SKPQLSKDILAGIF 302


>ref|XP_007215875.1| hypothetical protein PRUPE_ppa010533mg [Prunus persica]
           gi|462412025|gb|EMJ17074.1| hypothetical protein
           PRUPE_ppa010533mg [Prunus persica]
          Length = 246

 Score =  295 bits (756), Expect = 2e-77
 Identities = 155/237 (65%), Positives = 181/237 (76%), Gaps = 14/237 (5%)
 Frame = +1

Query: 79  TRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQYGEIQRIYLTPEDPAAQ 258
           TRKSK+KK L+KE    +KRGVCYL RIPP MDP  LRQ++SQ+GEIQR+YLTP+DP+AQ
Sbjct: 4   TRKSKKKK-LVKEGGKDEKRGVCYLGRIPPRMDPSTLRQMLSQFGEIQRVYLTPQDPSAQ 62

Query: 259 VHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQVGGRKKSSFYYDIWNIKYLSKF 438
           VH  RAG F+ Q FSEGWVEF+ K VAKRVANMLNGEQ+GGRK+SSFYYD+WNIKYLSKF
Sbjct: 63  VHNIRAGKFQRQNFSEGWVEFSDKRVAKRVANMLNGEQIGGRKRSSFYYDLWNIKYLSKF 122

Query: 439 KWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVDQSHALTSIQERLXXXXXXXXH 618
           KWDDLT EIAYK A REQKL LE+SAAKRERDFYL KVD+S AL+ I+ERL         
Sbjct: 123 KWDDLTEEIAYKKATREQKLALEISAAKRERDFYLSKVDKSRALSCIEERLKKKQKVEED 182

Query: 619 -------DSD-------PVDQQEPKVIRQFPQTRPVADSSFQSKSRLSKDILAGVFG 747
                  + D       PV Q + +VIR+F Q  PVAD++ + + RLSKDILAGVFG
Sbjct: 183 PGLKQKAEEDPENKPDLPVSQPKREVIRRFRQKTPVADNAAEIRPRLSKDILAGVFG 239


>ref|NP_191210.2| RNA recognition motif-containing protein [Arabidopsis thaliana]
           gi|79315331|ref|NP_001030873.1| RNA recognition
           motif-containing protein [Arabidopsis thaliana]
           gi|38564304|gb|AAR23731.1| At3g56510 [Arabidopsis
           thaliana] gi|45592922|gb|AAS68115.1| At3g56510
           [Arabidopsis thaliana] gi|110736352|dbj|BAF00145.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332646008|gb|AEE79529.1| RNA recognition
           motif-containing protein [Arabidopsis thaliana]
           gi|332646009|gb|AEE79530.1| RNA recognition
           motif-containing protein [Arabidopsis thaliana]
          Length = 257

 Score =  293 bits (749), Expect = 1e-76
 Identities = 151/255 (59%), Positives = 183/255 (71%), Gaps = 9/255 (3%)
 Frame = +1

Query: 10  QSGTSH------SEEGNNVNTEAKDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPH 171
           QS  SH      SEE  +  T  K + A+ +K K K++L+KE   AD RGVCYLSRIPPH
Sbjct: 2   QSEESHELTDGISEEKESKET-MKSQKADRKKKKLKEKLLKEASKADNRGVCYLSRIPPH 60

Query: 172 MDPLKLRQIISQYGEIQRIYLTPEDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVA 351
           MD ++LR I++QYGE+ RIYL PED  AQVHRKRAGGFRGQ FSEGWVEF KK+VAKRVA
Sbjct: 61  MDHVRLRHILAQYGELGRIYLAPEDSEAQVHRKRAGGFRGQRFSEGWVEFAKKSVAKRVA 120

Query: 352 NMLNGEQVGGRKKSSFYYDIWNIKYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRER 531
           +MLNGEQ+GG+KKSS YYDIWNIKYL+KFKWDDLT EIAYK+AIREQKLN+ +SAAKRE+
Sbjct: 121 DMLNGEQIGGKKKSSVYYDIWNIKYLTKFKWDDLTEEIAYKSAIREQKLNMVLSAAKREK 180

Query: 532 DFYLKKVDQSHALTSIQERLXXXXXXXXHDSDPVDQQ---EPKVIRQFPQTRPVADSSFQ 702
           DFYL K+++S A+T I  R+              +      P+ I QF Q + + + + Q
Sbjct: 181 DFYLSKIEKSRAMTEIDARMEKKRKIQEESGSNAEAAPVFPPRAIFQFRQKKSIENETSQ 240

Query: 703 SKSRLSKDILAGVFG 747
           SK  LS D LA VFG
Sbjct: 241 SKPGLSTDFLASVFG 255


>ref|XP_002878090.1| hypothetical protein ARALYDRAFT_486092 [Arabidopsis lyrata subsp.
           lyrata] gi|297323928|gb|EFH54349.1| hypothetical protein
           ARALYDRAFT_486092 [Arabidopsis lyrata subsp. lyrata]
          Length = 256

 Score =  292 bits (748), Expect = 1e-76
 Identities = 143/232 (61%), Positives = 175/232 (75%), Gaps = 3/232 (1%)
 Frame = +1

Query: 61  KDKNAETRKSKRKKRLMKETVSADKRGVCYLSRIPPHMDPLKLRQIISQYGEIQRIYLTP 240
           K + A+ +K K K++L+KE   AD RGVCYLSRIPPHMD ++LR I++Q+GE+ RIYL P
Sbjct: 23  KSQKADRKKKKLKEKLLKEASKADNRGVCYLSRIPPHMDHVRLRHILAQFGELGRIYLAP 82

Query: 241 EDPAAQVHRKRAGGFRGQEFSEGWVEFTKKNVAKRVANMLNGEQVGGRKKSSFYYDIWNI 420
           ED  AQVHRKRAGGFRGQ FSEGWVEF KK+VAKRVA+MLNGEQ+GG+KKSS YYDIWNI
Sbjct: 83  EDSEAQVHRKRAGGFRGQRFSEGWVEFAKKSVAKRVADMLNGEQIGGKKKSSVYYDIWNI 142

Query: 421 KYLSKFKWDDLTGEIAYKNAIREQKLNLEMSAAKRERDFYLKKVDQSHALTSIQERLXXX 600
           KYL+KFKWDDLT EIAYK+AIREQKLN+ +SAAKRE+DFYL K+++S A+T I  R+   
Sbjct: 143 KYLTKFKWDDLTEEIAYKSAIREQKLNMVLSAAKREKDFYLSKIEKSRAMTEIDARMKKK 202

Query: 601 XXXXXHDSDPVDQQ---EPKVIRQFPQTRPVADSSFQSKSRLSKDILAGVFG 747
                      +      P+VIR F Q + + + + QSK  LS D LA VFG
Sbjct: 203 RKIQEESGSNAEAAPVFPPRVIRHFRQKKSIENETSQSKPGLSTDFLASVFG 254


Top