BLASTX nr result
ID: Chrysanthemum21_contig00008289
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00008289 (445 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PLY84559.1| hypothetical protein LSAT_1X25461 [Lactuca sativa] 101 3e-22 ref|XP_023764965.1| protein TIC 40, chloroplastic [Lactuca sativa] 101 3e-22 ref|XP_022013419.1| protein TIC 40, chloroplastic-like [Helianth... 100 1e-21 gb|KVH98232.1| hypothetical protein Ccrd_023546 [Cynara carduncu... 94 1e-19 ref|XP_022728171.1| protein TIC 40, chloroplastic-like [Durio zi... 82 2e-15 ref|XP_022034055.1| protein TIC 40, chloroplastic-like [Helianth... 82 2e-15 emb|CDP16507.1| unnamed protein product [Coffea canephora] 81 4e-15 ref|XP_022876697.1| protein TIC 40, chloroplastic [Olea europaea... 81 5e-15 gb|PON78447.1| Protein TIC [Trema orientalis] 79 9e-15 ref|XP_018851016.1| PREDICTED: protein TIC 40, chloroplastic-lik... 79 2e-14 gb|KZM98246.1| hypothetical protein DCAR_014392 [Daucus carota s... 79 2e-14 ref|XP_017246794.1| PREDICTED: protein TIC 40, chloroplastic [Da... 79 3e-14 gb|EOY03910.1| Hydroxyproline-rich glycoprotein family protein i... 79 3e-14 gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein i... 79 3e-14 ref|XP_017614300.1| PREDICTED: protein TIC 40, chloroplastic [Go... 79 4e-14 ref|XP_016725066.1| PREDICTED: protein TIC 40, chloroplastic [Go... 79 4e-14 ref|XP_012471943.1| PREDICTED: protein TIC 40, chloroplastic [Go... 79 4e-14 ref|XP_022771913.1| protein TIC 40, chloroplastic-like [Durio zi... 79 4e-14 ref|XP_007032983.2| PREDICTED: protein TIC 40, chloroplastic [Th... 79 4e-14 gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein i... 79 4e-14 >gb|PLY84559.1| hypothetical protein LSAT_1X25461 [Lactuca sativa] Length = 433 Score = 101 bits (251), Expect = 3e-22 Identities = 64/143 (44%), Positives = 66/143 (46%) Frame = -3 Query: 434 ERLSKDCFARIXXXXXXXXXXXXXXXXXXXXXXXXXXXXPLFWVGVGVAFSAAFSWTASY 255 ERL KDCFARI PLFWVGVGVAFSA FSW ASY Sbjct: 73 ERLGKDCFARISSSSNQHTSSVGATPQIAVPPPSSQVGSPLFWVGVGVAFSAVFSWAASY 132 Query: 254 LKKAAMQQAFKTMMGQMDTQNNQFGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 75 LKK AMQQAFKTMMGQMDTQNNQF N Sbjct: 133 LKKYAMQQAFKTMMGQMDTQNNQFAN-SGFSPASPFPFPTPPMSGSTASSPGSPFPFPTP 191 Query: 74 XXXXXXXXXGRTLTVDVPPTKTE 6 RT+TVD+PPTKTE Sbjct: 192 SAASSGPASQRTVTVDMPPTKTE 214 >ref|XP_023764965.1| protein TIC 40, chloroplastic [Lactuca sativa] Length = 449 Score = 101 bits (251), Expect = 3e-22 Identities = 64/143 (44%), Positives = 66/143 (46%) Frame = -3 Query: 434 ERLSKDCFARIXXXXXXXXXXXXXXXXXXXXXXXXXXXXPLFWVGVGVAFSAAFSWTASY 255 ERL KDCFARI PLFWVGVGVAFSA FSW ASY Sbjct: 73 ERLGKDCFARISSSSNQHTSSVGATPQIAVPPPSSQVGSPLFWVGVGVAFSAVFSWAASY 132 Query: 254 LKKAAMQQAFKTMMGQMDTQNNQFGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 75 LKK AMQQAFKTMMGQMDTQNNQF N Sbjct: 133 LKKYAMQQAFKTMMGQMDTQNNQFAN-SGFSPASPFPFPTPPMSGSTASSPGSPFPFPTP 191 Query: 74 XXXXXXXXXGRTLTVDVPPTKTE 6 RT+TVD+PPTKTE Sbjct: 192 SAASSGPASQRTVTVDMPPTKTE 214 >ref|XP_022013419.1| protein TIC 40, chloroplastic-like [Helianthus annuus] gb|OTG33625.1| putative protein TIC 40 protein [Helianthus annuus] Length = 452 Score = 99.8 bits (247), Expect = 1e-21 Identities = 62/143 (43%), Positives = 64/143 (44%) Frame = -3 Query: 434 ERLSKDCFARIXXXXXXXXXXXXXXXXXXXXXXXXXXXXPLFWVGVGVAFSAAFSWTASY 255 + + KDCFARI PLFWVGVGVAFSAAFSWTASY Sbjct: 69 DSMGKDCFARISSSSDQHTSSVGATPQIAVPPPYSQVGSPLFWVGVGVAFSAAFSWTASY 128 Query: 254 LKKAAMQQAFKTMMGQMDTQNNQFGNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 75 LKK AMQQAFKTMMGQMD QNNQ N Sbjct: 129 LKKYAMQQAFKTMMGQMDAQNNQSANAGFSPGSPFPFPPPASPGSPPGSPFPFPTPTSQS 188 Query: 74 XXXXXXXXXGRTLTVDVPPTKTE 6 RTLTVDVPPTK E Sbjct: 189 SAATSAPASQRTLTVDVPPTKIE 211 >gb|KVH98232.1| hypothetical protein Ccrd_023546 [Cynara cardunculus var. scolymus] Length = 382 Score = 93.6 bits (231), Expect = 1e-19 Identities = 50/86 (58%), Positives = 52/86 (60%) Frame = -3 Query: 434 ERLSKDCFARIXXXXXXXXXXXXXXXXXXXXXXXXXXXXPLFWVGVGVAFSAAFSWTASY 255 ERL KDCFAR+ PLFWVGVGVA SA FSW ASY Sbjct: 73 ERLGKDCFARLGSSSNQHTSSVGASPQIAVPPPSSQVGSPLFWVGVGVALSAVFSWAASY 132 Query: 254 LKKAAMQQAFKTMMGQMDTQNNQFGN 177 LKK AMQQAFKTMMGQMD+QNNQF N Sbjct: 133 LKKYAMQQAFKTMMGQMDSQNNQFTN 158 >ref|XP_022728171.1| protein TIC 40, chloroplastic-like [Durio zibethinus] Length = 426 Score = 82.4 bits (202), Expect = 2e-15 Identities = 39/46 (84%), Positives = 40/46 (86%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFWVGVGV SA FSW AS LKK AMQQAFKTMMGQM+TQNNQFGN Sbjct: 108 LFWVGVGVGLSALFSWVASSLKKYAMQQAFKTMMGQMNTQNNQFGN 153 >ref|XP_022034055.1| protein TIC 40, chloroplastic-like [Helianthus annuus] gb|OTG27625.1| putative heat shock chaperonin-binding protein [Helianthus annuus] Length = 437 Score = 82.0 bits (201), Expect = 2e-15 Identities = 52/108 (48%), Positives = 54/108 (50%), Gaps = 5/108 (4%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN-----XXXXXXXXX 150 LFWVGVGVAFSAAFSWTASYLK+ AMQQAFKTMMG TQNNQF N Sbjct: 99 LFWVGVGVAFSAAFSWTASYLKQKAMQQAFKTMMG---TQNNQFANAGFSPGSPFPFPPP 155 Query: 149 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGRTLTVDVPPTKTE 6 RT+TVDVPPTKTE Sbjct: 156 AAPGTTPGSPFPFPPSPAPATSAPRPAATSAPASQRTVTVDVPPTKTE 203 >emb|CDP16507.1| unnamed protein product [Coffea canephora] Length = 431 Score = 81.3 bits (199), Expect = 4e-15 Identities = 37/46 (80%), Positives = 41/46 (89%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV FSA FSW A+ LKK AMQQAFKTMMGQM+TQ+NQFGN Sbjct: 103 LFWIGVGVGFSALFSWVATNLKKYAMQQAFKTMMGQMNTQSNQFGN 148 >ref|XP_022876697.1| protein TIC 40, chloroplastic [Olea europaea var. sylvestris] Length = 407 Score = 80.9 bits (198), Expect = 5e-15 Identities = 36/46 (78%), Positives = 40/46 (86%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA FSW A+YLKK AMQQAFKT+MGQM+TQNNQF N Sbjct: 87 LFWIGVGVGLSALFSWVATYLKKYAMQQAFKTLMGQMNTQNNQFSN 132 >gb|PON78447.1| Protein TIC [Trema orientalis] Length = 298 Score = 79.3 bits (194), Expect = 9e-15 Identities = 36/46 (78%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA FSW A+ LKK AMQQAFKT+MGQMDTQNNQF N Sbjct: 104 LFWIGVGVGLSALFSWVAATLKKYAMQQAFKTLMGQMDTQNNQFSN 149 >ref|XP_018851016.1| PREDICTED: protein TIC 40, chloroplastic-like [Juglans regia] Length = 434 Score = 79.3 bits (194), Expect = 2e-14 Identities = 35/46 (76%), Positives = 40/46 (86%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA FSW A+YLKK AMQQAFKT+MGQM++QNNQF N Sbjct: 110 LFWIGVGVGLSALFSWVATYLKKYAMQQAFKTLMGQMNSQNNQFNN 155 >gb|KZM98246.1| hypothetical protein DCAR_014392 [Daucus carota subsp. sativus] Length = 389 Score = 79.0 bits (193), Expect = 2e-14 Identities = 35/46 (76%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV FSA FSW A +KK AMQQAFKT+MGQMD+QNNQF N Sbjct: 105 LFWIGVGVGFSALFSWVAGRMKKYAMQQAFKTLMGQMDSQNNQFSN 150 >ref|XP_017246794.1| PREDICTED: protein TIC 40, chloroplastic [Daucus carota subsp. sativus] Length = 440 Score = 79.0 bits (193), Expect = 3e-14 Identities = 35/46 (76%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV FSA FSW A +KK AMQQAFKT+MGQMD+QNNQF N Sbjct: 116 LFWIGVGVGFSALFSWVAGRMKKYAMQQAFKTLMGQMDSQNNQFSN 161 >gb|EOY03910.1| Hydroxyproline-rich glycoprotein family protein isoform 3 [Theobroma cacao] Length = 368 Score = 78.6 bits (192), Expect = 3e-14 Identities = 36/46 (78%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA F+W AS LKK AMQQAFKTMMGQM+TQNNQF N Sbjct: 113 LFWIGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSN 158 >gb|EOY03911.1| Hydroxyproline-rich glycoprotein family protein isoform 4, partial [Theobroma cacao] Length = 412 Score = 78.6 bits (192), Expect = 3e-14 Identities = 36/46 (78%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA F+W AS LKK AMQQAFKTMMGQM+TQNNQF N Sbjct: 113 LFWIGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSN 158 >ref|XP_017614300.1| PREDICTED: protein TIC 40, chloroplastic [Gossypium arboreum] Length = 423 Score = 78.6 bits (192), Expect = 4e-14 Identities = 36/46 (78%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA F+W AS LKK AMQQAFKTMMGQM+TQNNQF N Sbjct: 101 LFWIGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFAN 146 >ref|XP_016725066.1| PREDICTED: protein TIC 40, chloroplastic [Gossypium hirsutum] Length = 428 Score = 78.6 bits (192), Expect = 4e-14 Identities = 36/46 (78%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA F+W AS LKK AMQQAFKTMMGQM+TQNNQF N Sbjct: 106 LFWIGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFAN 151 >ref|XP_012471943.1| PREDICTED: protein TIC 40, chloroplastic [Gossypium raimondii] gb|KJB08546.1| hypothetical protein B456_001G088300 [Gossypium raimondii] Length = 428 Score = 78.6 bits (192), Expect = 4e-14 Identities = 36/46 (78%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA F+W AS LKK AMQQAFKTMMGQM+TQNNQF N Sbjct: 106 LFWIGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFAN 151 >ref|XP_022771913.1| protein TIC 40, chloroplastic-like [Durio zibethinus] Length = 430 Score = 78.6 bits (192), Expect = 4e-14 Identities = 36/46 (78%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA F+W AS LKK AMQQAFKTMMGQ +TQNNQFGN Sbjct: 109 LFWIGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQTNTQNNQFGN 154 >ref|XP_007032983.2| PREDICTED: protein TIC 40, chloroplastic [Theobroma cacao] Length = 433 Score = 78.6 bits (192), Expect = 4e-14 Identities = 36/46 (78%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA F+W AS LKK AMQQAFKTMMGQM+TQNNQF N Sbjct: 113 LFWIGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSN 158 >gb|EOY03909.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 433 Score = 78.6 bits (192), Expect = 4e-14 Identities = 36/46 (78%), Positives = 39/46 (84%) Frame = -3 Query: 314 LFWVGVGVAFSAAFSWTASYLKKAAMQQAFKTMMGQMDTQNNQFGN 177 LFW+GVGV SA F+W AS LKK AMQQAFKTMMGQM+TQNNQF N Sbjct: 113 LFWIGVGVGLSALFTWVASSLKKYAMQQAFKTMMGQMNTQNNQFSN 158