BLASTX nr result
ID: Chrysanthemum21_contig00047031
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00047031 (422 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_022036646.1| keratin, type I cytoskeletal 9-like [Heliant... 137 4e-38 ref|XP_023765584.1| uncharacterized protein LOC111914074 [Lactuc... 115 1e-29 gb|KVH90204.1| hypothetical protein Ccrd_007763 [Cynara carduncu... 108 2e-26 gb|KVH96436.1| hypothetical protein Ccrd_001478 [Cynara carduncu... 104 2e-25 gb|PLY84071.1| hypothetical protein LSAT_6X114241 [Lactuca sativa] 99 1e-24 ref|XP_023747052.1| uncharacterized protein LOC111895184 [Lactuc... 100 5e-24 ref|XP_021637067.1| uncharacterized protein LOC110632967 [Hevea ... 96 1e-21 dbj|GAV91410.1| hypothetical protein CFOL_v3_34805 [Cephalotus f... 92 7e-21 ref|XP_019180176.1| PREDICTED: uncharacterized protein LOC109175... 91 5e-20 ref|XP_007035739.2| PREDICTED: uncharacterized protein LOC186036... 90 6e-20 gb|EOY06665.1| Glycine-rich family protein, putative isoform 1 [... 90 6e-20 ref|XP_019180175.1| PREDICTED: uncharacterized protein LOC109175... 91 7e-20 ref|XP_019180173.1| PREDICTED: uncharacterized protein LOC109175... 91 8e-20 ref|XP_017974125.1| PREDICTED: uncharacterized protein LOC186036... 90 8e-20 gb|EOY06666.1| Glycine-rich family protein, putative isoform 2 [... 90 9e-20 ref|XP_017222107.1| PREDICTED: uncharacterized protein LOC108198... 90 1e-19 ref|XP_022871706.1| uncharacterized protein LOC111390822, partia... 89 1e-19 ref|XP_015086485.1| PREDICTED: uncharacterized protein LOC107029... 90 2e-19 gb|PON61938.1| hypothetical protein PanWU01x14_141400 [Parasponi... 89 2e-19 ref|XP_021608748.1| uncharacterized protein LOC110612305 [Maniho... 89 3e-19 >ref|XP_022036646.1| keratin, type I cytoskeletal 9-like [Helianthus annuus] gb|OTG25486.1| hypothetical protein HannXRQ_Chr05g0148181 [Helianthus annuus] Length = 206 Score = 137 bits (345), Expect = 4e-38 Identities = 63/81 (77%), Positives = 73/81 (90%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 LKETDLVLNCINDV SNF+F NRAT+ VKDTIK GCS+GPSRGDFNVAEH+Q+YESNS+ Sbjct: 126 LKETDLVLNCINDVFSNFIFNNRATILIVKDTIKAGCSYGPSRGDFNVAEHVQAYESNSY 185 Query: 242 MFSYPILFGLVPLVSICLLLF 180 FSYPILF L+PL+S+ +LLF Sbjct: 186 KFSYPILFWLIPLISVFILLF 206 >ref|XP_023765584.1| uncharacterized protein LOC111914074 [Lactuca sativa] ref|XP_023736652.1| uncharacterized protein LOC111884576 [Lactuca sativa] gb|PLY71612.1| hypothetical protein LSAT_2X45680 [Lactuca sativa] Length = 195 Score = 115 bits (288), Expect = 1e-29 Identities = 55/79 (69%), Positives = 65/79 (82%) Frame = -1 Query: 419 KETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSFM 240 KET+LVLNC+ND LS+F+F+NRATVR VKDTI GCS GPSRGDFNVAEHIQ+Y SNS+ Sbjct: 116 KETNLVLNCMNDALSSFLFYNRATVRDVKDTILSGCSSGPSRGDFNVAEHIQAYGSNSYK 175 Query: 239 FSYPILFGLVPLVSICLLL 183 FSY L LV +SI ++L Sbjct: 176 FSYSFLLWLVSFISISIIL 194 >gb|KVH90204.1| hypothetical protein Ccrd_007763 [Cynara cardunculus var. scolymus] Length = 230 Score = 108 bits (269), Expect = 2e-26 Identities = 51/80 (63%), Positives = 64/80 (80%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 LKET LVLNCIND+L++FVF+N AT++ VK+TIK GC +GP RGDFNVAEHI ESNS+ Sbjct: 152 LKETHLVLNCINDILTHFVFYNHATIKDVKETIKVGCGYGPHRGDFNVAEHI---ESNSY 208 Query: 242 MFSYPILFGLVPLVSICLLL 183 +PILFG+ L+ +C LL Sbjct: 209 QHLHPILFGVTSLILLCSLL 228 >gb|KVH96436.1| hypothetical protein Ccrd_001478 [Cynara cardunculus var. scolymus] Length = 195 Score = 104 bits (260), Expect = 2e-25 Identities = 48/67 (71%), Positives = 59/67 (88%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L+ET LVL+CINDVLSNFVF+NRATVR V+DTI GCS+GP+RGDFNVAE++++Y SNS+ Sbjct: 124 LRETKLVLSCINDVLSNFVFYNRATVRDVRDTITAGCSYGPTRGDFNVAEYVRAYGSNSY 183 Query: 242 MFSYPIL 222 SYP L Sbjct: 184 KLSYPNL 190 >gb|PLY84071.1| hypothetical protein LSAT_6X114241 [Lactuca sativa] Length = 71 Score = 99.0 bits (245), Expect = 1e-24 Identities = 47/70 (67%), Positives = 56/70 (80%) Frame = -1 Query: 392 INDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSFMFSYPILFGL 213 +ND LS+F+F+NRATVR VKDTI GCS GPSRGDFNVAEHIQ+Y SNS+ FSY L L Sbjct: 1 MNDALSSFLFYNRATVRDVKDTILSGCSSGPSRGDFNVAEHIQAYGSNSYKFSYSFLLWL 60 Query: 212 VPLVSICLLL 183 V +SI ++L Sbjct: 61 VSFISISIIL 70 >ref|XP_023747052.1| uncharacterized protein LOC111895184 [Lactuca sativa] Length = 184 Score = 100 bits (250), Expect = 5e-24 Identities = 51/83 (61%), Positives = 62/83 (74%), Gaps = 2/83 (2%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 LKET LVLNCI+D+LS++VF+N ATV VK+TIK GCS+GP RGDFNVAEHI+ ESN++ Sbjct: 102 LKETHLVLNCIDDMLSHYVFYNHATVNDVKETIKAGCSYGPHRGDFNVAEHIEDDESNAY 161 Query: 242 MFSY--PILFGLVPLVSICLLLF 180 Y ILFG L + LLF Sbjct: 162 KIYYTNSILFGFQSLFVVFTLLF 184 >ref|XP_021637067.1| uncharacterized protein LOC110632967 [Hevea brasiliensis] Length = 222 Score = 95.5 bits (236), Expect = 1e-21 Identities = 40/70 (57%), Positives = 58/70 (82%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L E +LVLNCI +++++FVF+N+AT++ +KDTIK GCS+GP RGDFNVAEH+Q+ E+ ++ Sbjct: 142 LSEVNLVLNCIENIMTHFVFYNKATIQDIKDTIKAGCSYGPERGDFNVAEHLQAQENRAY 201 Query: 242 MFSYPILFGL 213 M I+FGL Sbjct: 202 MNQIQIVFGL 211 >dbj|GAV91410.1| hypothetical protein CFOL_v3_34805 [Cephalotus follicularis] Length = 159 Score = 92.0 bits (227), Expect = 7e-21 Identities = 40/70 (57%), Positives = 56/70 (80%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L+ET LVLNCI ++L+NFVF+N AT++ V+DT+K GC +GP RG+FNVAEH+Q+ ++S Sbjct: 79 LQETQLVLNCIENILTNFVFYNEATIQDVRDTVKAGCGYGPERGNFNVAEHLQAERNSSH 138 Query: 242 MFSYPILFGL 213 + ILFGL Sbjct: 139 KVAAQILFGL 148 >ref|XP_019180176.1| PREDICTED: uncharacterized protein LOC109175367 isoform X3 [Ipomoea nil] Length = 199 Score = 90.9 bits (224), Expect = 5e-20 Identities = 40/70 (57%), Positives = 55/70 (78%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 LKETDL+L+CI +LS+F F+N+AT+ V+ TIK+GCS+GPSRG+FNVAEH+Q E+N+F Sbjct: 119 LKETDLILHCIEAILSDFEFYNKATINDVRHTIKEGCSYGPSRGNFNVAEHVQYEENNAF 178 Query: 242 MFSYPILFGL 213 S + L Sbjct: 179 QASKAVFHNL 188 >ref|XP_007035739.2| PREDICTED: uncharacterized protein LOC18603607 isoform X2 [Theobroma cacao] Length = 171 Score = 90.1 bits (222), Expect = 6e-20 Identities = 39/80 (48%), Positives = 59/80 (73%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L ET LVLNCI ++++NF+F+NRAT++ ++DTI+ GC +GP RGDFNV EHI++ S++ Sbjct: 91 LSETHLVLNCIENIMTNFLFYNRATIQDIRDTIQAGCGYGPERGDFNVEEHIEAEGSSAT 150 Query: 242 MFSYPILFGLVPLVSICLLL 183 + IL G+ ++ C LL Sbjct: 151 KAATQILIGIGSMIIACTLL 170 >gb|EOY06665.1| Glycine-rich family protein, putative isoform 1 [Theobroma cacao] Length = 171 Score = 90.1 bits (222), Expect = 6e-20 Identities = 39/80 (48%), Positives = 59/80 (73%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L ET LVLNCI ++++NF+F+NRAT++ ++DTI+ GC +GP RGDFNV EHI++ S++ Sbjct: 91 LSETHLVLNCIENIMTNFLFYNRATIQDIRDTIQAGCGYGPERGDFNVEEHIEAEGSSAS 150 Query: 242 MFSYPILFGLVPLVSICLLL 183 + IL G+ ++ C LL Sbjct: 151 KAATQILIGIGSMIIACTLL 170 >ref|XP_019180175.1| PREDICTED: uncharacterized protein LOC109175367 isoform X2 [Ipomoea nil] Length = 208 Score = 90.9 bits (224), Expect = 7e-20 Identities = 40/70 (57%), Positives = 55/70 (78%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 LKETDL+L+CI +LS+F F+N+AT+ V+ TIK+GCS+GPSRG+FNVAEH+Q E+N+F Sbjct: 128 LKETDLILHCIEAILSDFEFYNKATINDVRHTIKEGCSYGPSRGNFNVAEHVQYEENNAF 187 Query: 242 MFSYPILFGL 213 S + L Sbjct: 188 QASKAVFHNL 197 >ref|XP_019180173.1| PREDICTED: uncharacterized protein LOC109175367 isoform X1 [Ipomoea nil] ref|XP_019180174.1| PREDICTED: uncharacterized protein LOC109175367 isoform X1 [Ipomoea nil] Length = 215 Score = 90.9 bits (224), Expect = 8e-20 Identities = 40/70 (57%), Positives = 55/70 (78%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 LKETDL+L+CI +LS+F F+N+AT+ V+ TIK+GCS+GPSRG+FNVAEH+Q E+N+F Sbjct: 135 LKETDLILHCIEAILSDFEFYNKATINDVRHTIKEGCSYGPSRGNFNVAEHVQYEENNAF 194 Query: 242 MFSYPILFGL 213 S + L Sbjct: 195 QASKAVFHNL 204 >ref|XP_017974125.1| PREDICTED: uncharacterized protein LOC18603607 isoform X1 [Theobroma cacao] ref|XP_017974126.1| PREDICTED: uncharacterized protein LOC18603607 isoform X1 [Theobroma cacao] Length = 187 Score = 90.1 bits (222), Expect = 8e-20 Identities = 39/80 (48%), Positives = 59/80 (73%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L ET LVLNCI ++++NF+F+NRAT++ ++DTI+ GC +GP RGDFNV EHI++ S++ Sbjct: 107 LSETHLVLNCIENIMTNFLFYNRATIQDIRDTIQAGCGYGPERGDFNVEEHIEAEGSSAT 166 Query: 242 MFSYPILFGLVPLVSICLLL 183 + IL G+ ++ C LL Sbjct: 167 KAATQILIGIGSMIIACTLL 186 >gb|EOY06666.1| Glycine-rich family protein, putative isoform 2 [Theobroma cacao] Length = 190 Score = 90.1 bits (222), Expect = 9e-20 Identities = 39/80 (48%), Positives = 59/80 (73%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L ET LVLNCI ++++NF+F+NRAT++ ++DTI+ GC +GP RGDFNV EHI++ S++ Sbjct: 110 LSETHLVLNCIENIMTNFLFYNRATIQDIRDTIQAGCGYGPERGDFNVEEHIEAEGSSAS 169 Query: 242 MFSYPILFGLVPLVSICLLL 183 + IL G+ ++ C LL Sbjct: 170 KAATQILIGIGSMIIACTLL 189 >ref|XP_017222107.1| PREDICTED: uncharacterized protein LOC108198839 [Daucus carota subsp. sativus] gb|KZM85561.1| hypothetical protein DCAR_027017 [Daucus carota subsp. sativus] Length = 200 Score = 90.1 bits (222), Expect = 1e-19 Identities = 41/72 (56%), Positives = 59/72 (81%) Frame = -1 Query: 416 ETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSFMF 237 ET+LVLNCI+++LS+F+F+NRAT+ VK TIK GCS+GP RG+FNV EHIQ+ E+++ Sbjct: 124 ETNLVLNCIDNILSHFLFYNRATIYDVKATIKAGCSYGPERGNFNVEEHIQARENSAQRD 183 Query: 236 SYPILFGLVPLV 201 S P+L GL+ ++ Sbjct: 184 SKPLLLGLLLMI 195 >ref|XP_022871706.1| uncharacterized protein LOC111390822, partial [Olea europaea var. sylvestris] Length = 151 Score = 88.6 bits (218), Expect = 1e-19 Identities = 39/81 (48%), Positives = 61/81 (75%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 LKET VL+CI+ ++ +FVF+N+AT+ A+++TIK GCS+GP RGDFNVAEH+++ E ++ Sbjct: 71 LKETHHVLDCIDGIMKHFVFYNKATLSAIRETIKIGCSYGPKRGDFNVAEHLEAEEDSAR 130 Query: 242 MFSYPILFGLVPLVSICLLLF 180 S P+++GL+ + LF Sbjct: 131 PASKPVIYGLLLMFMAWRALF 151 >ref|XP_015086485.1| PREDICTED: uncharacterized protein LOC107029559 [Solanum pennellii] Length = 199 Score = 89.7 bits (221), Expect = 2e-19 Identities = 42/81 (51%), Positives = 60/81 (74%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L+ET VLNC+ +LS+F F+N+ATVRAV++TIK+GCS+GP RG FNVAEHI +Y++ +F Sbjct: 118 LEETHHVLNCLESILSHFRFYNKATVRAVEETIKEGCSYGPERGLFNVAEHILAYDNTAF 177 Query: 242 MFSYPILFGLVPLVSICLLLF 180 S L L+++ L+ F Sbjct: 178 RASKSSLLHSFVLMTLALIFF 198 >gb|PON61938.1| hypothetical protein PanWU01x14_141400 [Parasponia andersonii] Length = 175 Score = 88.6 bits (218), Expect = 2e-19 Identities = 42/80 (52%), Positives = 58/80 (72%), Gaps = 1/80 (1%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L ET LVLNCI ++L+NFVF+NRAT++ ++DTI GC +GP RG+ NVAEHI++ E+N+ Sbjct: 94 LTETHLVLNCIENILANFVFYNRATIQDIRDTIHAGCGYGPERGNLNVAEHIEAEENNAE 153 Query: 242 MFSYPILFGL-VPLVSICLL 186 IL G + +V CLL Sbjct: 154 RIGNQILVGFGLMVVGYCLL 173 >ref|XP_021608748.1| uncharacterized protein LOC110612305 [Manihot esculenta] gb|OAY56093.1| hypothetical protein MANES_03G202000 [Manihot esculenta] Length = 198 Score = 89.0 bits (219), Expect = 3e-19 Identities = 39/70 (55%), Positives = 54/70 (77%) Frame = -1 Query: 422 LKETDLVLNCINDVLSNFVFFNRATVRAVKDTIKDGCSFGPSRGDFNVAEHIQSYESNSF 243 L ET LVLNC+ +V+ +FVF+N+AT+ A+++TIK GCS+GP RGDFNVAEH+Q+ E+ + Sbjct: 118 LSETHLVLNCVENVMKHFVFYNKATIEAIRETIKAGCSYGPERGDFNVAEHLQAEENRAD 177 Query: 242 MFSYPILFGL 213 IL GL Sbjct: 178 KTQIKILSGL 187