BLASTX nr result

ID: Chrysanthemum21_contig00023831 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00023831
         (476 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_023751305.1| protein THYLAKOID ASSEMBLY 8, chloroplastic ...   149   9e-42
gb|KVH95768.1| hypothetical protein Ccrd_002146 [Cynara carduncu...   142   8e-39
ref|XP_022030155.1| pentatricopeptide repeat-containing protein ...   139   2e-37
gb|OVA00679.1| Pentatricopeptide repeat [Macleaya cordata]            113   1e-27
ref|XP_010491549.1| PREDICTED: pentatricopeptide repeat-containi...   108   4e-26
ref|XP_010270549.1| PREDICTED: pentatricopeptide repeat-containi...   106   3e-25
ref|XP_019084443.1| PREDICTED: pentatricopeptide repeat-containi...   104   1e-24
dbj|GAV81821.1| hypothetical protein CFOL_v3_25274 [Cephalotus f...   102   1e-23
ref|XP_007201145.2| pentatricopeptide repeat-containing protein ...   102   1e-23
ref|XP_020426449.1| pentatricopeptide repeat-containing protein ...   102   2e-23
ref|XP_021809868.1| pentatricopeptide repeat-containing protein ...   101   2e-23
ref|XP_020877308.1| pentatricopeptide repeat-containing protein ...   100   9e-23
ref|XP_010090721.1| protein THYLAKOID ASSEMBLY 8-like, chloropla...   100   1e-22
gb|KZM82342.1| hypothetical protein DCAR_029911 [Daucus carota s...    99   1e-22
ref|XP_007043105.2| PREDICTED: pentatricopeptide repeat-containi...    99   2e-22
ref|XP_008382506.1| PREDICTED: pentatricopeptide repeat-containi...    99   3e-22
ref|XP_017226437.1| PREDICTED: pentatricopeptide repeat-containi...    99   3e-22
emb|CBI37720.3| unnamed protein product, partial [Vitis vinifera]      97   4e-22
gb|AAM67277.1| unknown [Arabidopsis thaliana]                          99   4e-22
ref|NP_001330185.1| vacuolar sorting protein 9 domain protein [A...    99   4e-22

>ref|XP_023751305.1| protein THYLAKOID ASSEMBLY 8, chloroplastic [Lactuca sativa]
 gb|PLY94948.1| hypothetical protein LSAT_4X71240 [Lactuca sativa]
          Length = 253

 Score =  149 bits (375), Expect = 9e-42
 Identities = 75/98 (76%), Positives = 84/98 (85%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           IFLELK+EKGRLE K TEGFN  L+TLM YN+  L MDCFELMKEVGCEPDRSTFKLLV 
Sbjct: 157 IFLELKTEKGRLEGK-TEGFNLFLETLMSYNITRLAMDCFELMKEVGCEPDRSTFKLLVS 215

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMAMS 183
            LESKGE SLSE I++EA KYYGDS+E++DEQ+EMA S
Sbjct: 216 YLESKGERSLSESIRQEAWKYYGDSIEYVDEQDEMATS 253


>gb|KVH95768.1| hypothetical protein Ccrd_002146 [Cynara cardunculus var. scolymus]
          Length = 272

 Score =  142 bits (357), Expect = 8e-39
 Identities = 71/94 (75%), Positives = 79/94 (84%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           IF+ELKSEKGRLE K TEGFN LL+ LM YN+  L MDCFELMKE+ CEPDRSTFKLLV 
Sbjct: 160 IFVELKSEKGRLEGK-TEGFNALLENLMSYNMTGLAMDCFELMKEIDCEPDRSTFKLLVA 218

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEE 195
            LESKGE  LSE IK+EA KYYGDSL+F+DEQE+
Sbjct: 219 HLESKGETGLSEGIKQEARKYYGDSLDFLDEQED 252


>ref|XP_022030155.1| pentatricopeptide repeat-containing protein At3g46870-like
           [Helianthus annuus]
 gb|OTG33076.1| putative pentatricopeptide repeat protein [Helianthus annuus]
          Length = 301

 Score =  139 bits (350), Expect = 2e-37
 Identities = 68/92 (73%), Positives = 81/92 (88%)
 Frame = -1

Query: 467 ELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQLE 288
           E+K+EKGRLEAK TEGFN LL++LM YNL+E  MDCFELMKEV CEPDRSTFKLLV QL+
Sbjct: 208 EMKAEKGRLEAK-TEGFNLLLESLMSYNLIEAAMDCFELMKEVDCEPDRSTFKLLVAQLD 266

Query: 287 SKGEISLSELIKEEAHKYYGDSLEFIDEQEEM 192
           SKGE  LSE I++EA +YYGDS+EF++EQEE+
Sbjct: 267 SKGETGLSESIRKEAFRYYGDSIEFVNEQEEV 298


>gb|OVA00679.1| Pentatricopeptide repeat [Macleaya cordata]
          Length = 265

 Score =  113 bits (282), Expect = 1e-27
 Identities = 54/98 (55%), Positives = 79/98 (80%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           +FL+LK E  RLEA  T+GFN LL+TLM + +  L+M+CF+LMK+VGCEPD STF++L+ 
Sbjct: 169 VFLDLKMEN-RLEAD-TDGFNALLRTLMEFGIYRLSMECFQLMKKVGCEPDESTFRILIN 226

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMAMS 183
            L+SKGE  LS + ++EA KY+G+ LEF++E+EE+++S
Sbjct: 227 GLDSKGETGLSAIFRQEAEKYFGEHLEFLEEKEEISLS 264


>ref|XP_010491549.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g62350-like [Camelina sativa]
          Length = 260

 Score =  108 bits (271), Expect = 4e-26
 Identities = 49/96 (51%), Positives = 73/96 (76%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           ++ E+KSEKG +     EGFN LL TL+ + L +L MDC+  M+ +G EPDRS+F++LV 
Sbjct: 159 LYYEMKSEKGLMA--DVEGFNTLLTTLLNHKLFDLVMDCYAFMQSIGYEPDRSSFRILVL 216

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMA 189
            LES GE+ LS +++++AH+YYGDSLEF++E EE++
Sbjct: 217 GLESNGEMGLSAIVRQDAHEYYGDSLEFVEEDEEVS 252


>ref|XP_010270549.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g62350-like [Nelumbo nucifera]
 ref|XP_010270550.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g62350-like [Nelumbo nucifera]
          Length = 262

 Score =  106 bits (265), Expect = 3e-25
 Identities = 53/97 (54%), Positives = 73/97 (75%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           +FL +K E   LEA   E FN LLQT M ++++EL M+CF LMK V CEPD+STFK+L+ 
Sbjct: 165 VFLYMKMESN-LEAD-LEAFNALLQTFMDFSIIELVMECFHLMKVVECEPDKSTFKILIN 222

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMAM 186
            LESKGE  LS ++++EA KYYG SLEF+ E+E++++
Sbjct: 223 GLESKGETGLSTIVRQEAKKYYGGSLEFLQEKEDISL 259


>ref|XP_019084443.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46870-like [Camelina sativa]
          Length = 250

 Score =  104 bits (260), Expect = 1e-24
 Identities = 47/96 (48%), Positives = 72/96 (75%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           ++  +KSEKG +     EGFN LL TL+ + L +L MDC+  M+ +G EPDRS+F++LV 
Sbjct: 149 LYYAMKSEKGLMA--DIEGFNTLLTTLLNHKLFDLVMDCYAFMQSIGYEPDRSSFRILVL 206

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMA 189
            LES GE+ LS +++++AH+YYG+SLEF++E EE++
Sbjct: 207 GLESNGEMGLSAIVRQDAHEYYGESLEFVEEDEEVS 242


>dbj|GAV81821.1| hypothetical protein CFOL_v3_25274 [Cephalotus follicularis]
          Length = 248

 Score =  102 bits (254), Expect = 1e-23
 Identities = 45/81 (55%), Positives = 64/81 (79%)
 Frame = -1

Query: 425 EGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQLESKGEISLSELIKEE 246
           E F +LL+TLM YNLVEL MDC++LMK + CEPDRS+F++L+  LES GE  LS +++ +
Sbjct: 160 EAFESLLRTLMSYNLVELVMDCYDLMKAIDCEPDRSSFRILINGLESMGETVLSAIVRHD 219

Query: 245 AHKYYGDSLEFIDEQEEMAMS 183
           A  YYG+SLEF++E++E  +S
Sbjct: 220 AQNYYGESLEFLNEEQETILS 240


>ref|XP_007201145.2| pentatricopeptide repeat-containing protein At1g62350 isoform X2
           [Prunus persica]
 gb|ONH92579.1| hypothetical protein PRUPE_8G182100 [Prunus persica]
          Length = 248

 Score =  102 bits (253), Expect = 1e-23
 Identities = 46/80 (57%), Positives = 64/80 (80%)
 Frame = -1

Query: 425 EGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQLESKGEISLSELIKEE 246
           E FN LL TL+ + + +L M+CF LMKEVGCEPDRS+F++L+  LES GE  LS +++++
Sbjct: 168 EAFNALLTTLISFKIPKLAMECFYLMKEVGCEPDRSSFRILINGLESMGETGLSGILRQD 227

Query: 245 AHKYYGDSLEFIDEQEEMAM 186
           A KYYG+SLEF++E EEMA+
Sbjct: 228 AQKYYGESLEFLEENEEMAV 247


>ref|XP_020426449.1| pentatricopeptide repeat-containing protein At3g46870 isoform X1
           [Prunus persica]
 gb|ONH92580.1| hypothetical protein PRUPE_8G182100 [Prunus persica]
          Length = 262

 Score =  102 bits (253), Expect = 2e-23
 Identities = 46/80 (57%), Positives = 64/80 (80%)
 Frame = -1

Query: 425 EGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQLESKGEISLSELIKEE 246
           E FN LL TL+ + + +L M+CF LMKEVGCEPDRS+F++L+  LES GE  LS +++++
Sbjct: 182 EAFNALLTTLISFKIPKLAMECFYLMKEVGCEPDRSSFRILINGLESMGETGLSGILRQD 241

Query: 245 AHKYYGDSLEFIDEQEEMAM 186
           A KYYG+SLEF++E EEMA+
Sbjct: 242 AQKYYGESLEFLEENEEMAV 261


>ref|XP_021809868.1| pentatricopeptide repeat-containing protein At1g62350-like [Prunus
           avium]
          Length = 248

 Score =  101 bits (252), Expect = 2e-23
 Identities = 45/80 (56%), Positives = 65/80 (81%)
 Frame = -1

Query: 425 EGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQLESKGEISLSELIKEE 246
           E FN LL TL+ +N+ +L M+C+ LMKEVGCEPDRS+F++L+  LES GE  LS +++++
Sbjct: 168 EAFNALLTTLISFNIPKLAMECYYLMKEVGCEPDRSSFRILINGLESMGETGLSGILRQD 227

Query: 245 AHKYYGDSLEFIDEQEEMAM 186
           A +YYG+SLEF++E EEMA+
Sbjct: 228 AQQYYGESLEFLEENEEMAV 247


>ref|XP_020877308.1| pentatricopeptide repeat-containing protein At1g62350 [Arabidopsis
           lyrata subsp. lyrata]
          Length = 256

 Score =  100 bits (248), Expect = 9e-23
 Identities = 46/96 (47%), Positives = 72/96 (75%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           ++  +KSEKG +     E FN LL  L+ + L +L MDC+  M+ +G EPDR++F++LV 
Sbjct: 156 LYSAMKSEKGLMA--DIEWFNTLLTILLNHKLFDLVMDCYAFMQSIGYEPDRASFRILVL 213

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMA 189
            LES GE+SLS +++++AH+YYG+SLEFI+E+EE++
Sbjct: 214 GLESNGEMSLSAIVRKDAHEYYGESLEFIEEEEEIS 249


>ref|XP_010090721.1| protein THYLAKOID ASSEMBLY 8-like, chloroplastic [Morus notabilis]
 gb|EXB40440.1| hypothetical protein L484_013743 [Morus notabilis]
          Length = 242

 Score = 99.8 bits (247), Expect = 1e-22
 Identities = 48/94 (51%), Positives = 69/94 (73%)
 Frame = -1

Query: 470 LELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQL 291
           L LK E+  L  +  EGFN LL+ L+  N+ EL M+C+ LMK+VGC+PDRSTF++L+  L
Sbjct: 144 LYLKKEEANLRPE-IEGFNALLRALVSLNIAELAMECYCLMKQVGCDPDRSTFRILINGL 202

Query: 290 ESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMA 189
           ES GE   S +++ +A K+YG+SLEF+DE E++A
Sbjct: 203 ESMGETGASAIVRLDAQKFYGESLEFLDEIEDLA 236


>gb|KZM82342.1| hypothetical protein DCAR_029911 [Daucus carota subsp. sativus]
          Length = 214

 Score = 98.6 bits (244), Expect = 1e-22
 Identities = 47/81 (58%), Positives = 60/81 (74%)
 Frame = -1

Query: 425 EGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQLESKGEISLSELIKEE 246
           EGFN +L+TL+ + ++ LTM+CF LMK  GCEPDRSTFK+L+  LESK E SLS  I+EE
Sbjct: 128 EGFNAILETLLSFGIIGLTMECFYLMKSKGCEPDRSTFKILISGLESKKETSLSVTIREE 187

Query: 245 AHKYYGDSLEFIDEQEEMAMS 183
           A K YG   E ++E E+ AMS
Sbjct: 188 AEKAYGSPFEILEENEDGAMS 208


>ref|XP_007043105.2| PREDICTED: pentatricopeptide repeat-containing protein At3g46870
           [Theobroma cacao]
          Length = 245

 Score = 99.0 bits (245), Expect = 2e-22
 Identities = 44/81 (54%), Positives = 61/81 (75%)
 Frame = -1

Query: 425 EGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQLESKGEISLSELIKEE 246
           EGFN L   L+ + L +L MDC+ LMK +GCEPDRS+F++L+  LESKGE   S L++++
Sbjct: 164 EGFNALFNALINFKLTQLVMDCYGLMKAIGCEPDRSSFRILINGLESKGETGSSALLRQD 223

Query: 245 AHKYYGDSLEFIDEQEEMAMS 183
           A KYYG+SLEF+ E+EE+  S
Sbjct: 224 AQKYYGESLEFLKEEEEVTAS 244


>ref|XP_008382506.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46870-like [Malus domestica]
          Length = 250

 Score = 98.6 bits (244), Expect = 3e-22
 Identities = 49/98 (50%), Positives = 71/98 (72%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           +FL LK E       + E FN L+ TL+ +NL +L ++C+ LMKEVGCEPDRS+F++LV 
Sbjct: 153 LFLCLKKETNL--QPEIEAFNALMTTLISFNLPKLAIECYYLMKEVGCEPDRSSFRILVN 210

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMAMS 183
            LES GE   S +++++A + YG+SLEF++E EEMA+S
Sbjct: 211 GLESMGETGSSGIVRQDAQQIYGESLEFLEENEEMAVS 248


>ref|XP_017226437.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46870-like isoform X2 [Daucus carota subsp. sativus]
          Length = 252

 Score = 98.6 bits (244), Expect = 3e-22
 Identities = 47/81 (58%), Positives = 60/81 (74%)
 Frame = -1

Query: 425 EGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQLESKGEISLSELIKEE 246
           EGFN +L+TL+ + ++ LTM+CF LMK  GCEPDRSTFK+L+  LESK E SLS  I+EE
Sbjct: 166 EGFNAILETLLSFGIIGLTMECFYLMKSKGCEPDRSTFKILISGLESKKETSLSVTIREE 225

Query: 245 AHKYYGDSLEFIDEQEEMAMS 183
           A K YG   E ++E E+ AMS
Sbjct: 226 AEKAYGSPFEILEENEDGAMS 246


>emb|CBI37720.3| unnamed protein product, partial [Vitis vinifera]
          Length = 208

 Score = 97.4 bits (241), Expect = 4e-22
 Identities = 47/93 (50%), Positives = 72/93 (77%)
 Frame = -1

Query: 464 LKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVCQLES 285
           LK E G LE + TEGFN LL+TL+ +++    M+CF+LMK  GCEP++S+F++L+  LES
Sbjct: 113 LKKETG-LELE-TEGFNALLRTLIDFDMTGPAMECFQLMKTSGCEPNKSSFRILIKGLES 170

Query: 284 KGEISLSELIKEEAHKYYGDSLEFIDEQEEMAM 186
           KGE+ +S  +K +A KY+G+SLEF++E+E+M +
Sbjct: 171 KGELDISATVKLDAQKYFGESLEFLEEEEDMTV 203


>gb|AAM67277.1| unknown [Arabidopsis thaliana]
          Length = 262

 Score = 98.6 bits (244), Expect = 4e-22
 Identities = 45/96 (46%), Positives = 71/96 (73%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           ++  +KSEKG +   + E FN LL  L+ + L +L MDC+  M+ +G EPDR++F++LV 
Sbjct: 162 LYSAMKSEKGLMA--EIEWFNTLLTILLNHKLFDLVMDCYAFMQSIGYEPDRASFRVLVL 219

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMA 189
            LES GE+ LS +++++AH+YYG+SLEFI+E EE++
Sbjct: 220 GLESNGEMGLSAIVRQDAHEYYGESLEFIEEDEEIS 255


>ref|NP_001330185.1| vacuolar sorting protein 9 domain protein [Arabidopsis thaliana]
 gb|AAM97056.1| putative protein [Arabidopsis thaliana]
 gb|AAP13403.1| At5g09320 [Arabidopsis thaliana]
 gb|OAO96176.1| hypothetical protein AXX17_AT5G08840 [Arabidopsis thaliana]
 gb|ANM68427.1| vacuolar sorting protein 9 domain protein [Arabidopsis thaliana]
          Length = 262

 Score = 98.6 bits (244), Expect = 4e-22
 Identities = 45/96 (46%), Positives = 71/96 (73%)
 Frame = -1

Query: 476 IFLELKSEKGRLEAKKTEGFNNLLQTLMIYNLVELTMDCFELMKEVGCEPDRSTFKLLVC 297
           ++  +KSEKG +   + E FN LL  L+ + L +L MDC+  M+ +G EPDR++F++LV 
Sbjct: 162 LYSAMKSEKGLMA--EIEWFNTLLTILLNHKLFDLVMDCYAFMQSIGYEPDRASFRVLVL 219

Query: 296 QLESKGEISLSELIKEEAHKYYGDSLEFIDEQEEMA 189
            LES GE+ LS +++++AH+YYG+SLEFI+E EE++
Sbjct: 220 GLESNGEMGLSAIVRQDAHEYYGESLEFIEEDEEIS 255


Top