BLASTX nr result
ID: Chrysanthemum22_contig00023605
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00023605 (463 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_023758686.1| uncharacterized protein LOC111907122 [Lactuc... 50 4e-09 ref|XP_021975266.1| uncharacterized protein LOC110870390 [Helian... 64 5e-09 ref|XP_021971650.1| uncharacterized protein LOC110866811 [Helian... 64 7e-09 dbj|GAU43034.1| hypothetical protein TSUD_12890 [Trifolium subte... 64 8e-09 gb|KYP71360.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu... 60 2e-08 ref|XP_019465331.1| PREDICTED: uncharacterized protein LOC109363... 63 2e-08 ref|XP_022028823.1| uncharacterized protein LOC110929940 [Helian... 62 3e-08 gb|KHN27546.1| LINE-1 reverse transcriptase like, partial [Glyci... 62 5e-08 gb|KHN06639.1| hypothetical protein glysoja_048361, partial [Gly... 57 5e-08 dbj|GAU18134.1| hypothetical protein TSUD_248350 [Trifolium subt... 61 6e-08 dbj|GAU34591.1| hypothetical protein TSUD_15060 [Trifolium subte... 61 6e-08 ref|XP_023758688.1| uncharacterized protein LOC111907126 [Lactuc... 46 6e-08 ref|XP_023771941.1| uncharacterized protein LOC111920591 [Lactuc... 61 6e-08 gb|KHN25569.1| hypothetical protein glysoja_035697, partial [Gly... 58 6e-08 gb|KHN46635.1| hypothetical protein glysoja_043348, partial [Gly... 57 7e-08 ref|XP_021996168.1| uncharacterized protein LOC110893365 [Helian... 61 8e-08 gb|PNY12727.1| ribonuclease H [Trifolium pratense] 61 8e-08 ref|XP_021994198.1| uncharacterized protein LOC110890853 [Helian... 42 8e-08 ref|XP_022003153.1| uncharacterized protein LOC110900576 [Helian... 45 8e-08 gb|OTG21358.1| putative RNA-directed DNA polymerase, eukaryota [... 61 8e-08 >ref|XP_023758686.1| uncharacterized protein LOC111907122 [Lactuca sativa] Length = 531 Score = 50.4 bits (119), Expect(2) = 4e-09 Identities = 27/57 (47%), Positives = 34/57 (59%), Gaps = 4/57 (7%) Frame = -3 Query: 290 IFLLPSTIIKEIKRLLKGALWCNGELTKGKAKIAW----*IKVYGRLLIFNVDSRRD 132 IF +P IKEIK+L + LW NGE+ KGKAK+ W KVYG L + N+ D Sbjct: 217 IFKIPIATIKEIKKLCRTFLWANGEIVKGKAKVKWNDICKSKVYGGLGVKNLRKWND 273 Score = 38.1 bits (87), Expect(2) = 4e-09 Identities = 15/28 (53%), Positives = 23/28 (82%) Frame = -1 Query: 382 IYDWRNKSLSFAQRLQLTTAVLASVEVY 299 I++W++K+LSFA RLQL ++L S+ VY Sbjct: 186 IFNWKSKALSFAGRLQLINSILTSIHVY 213 >ref|XP_021975266.1| uncharacterized protein LOC110870390 [Helianthus annuus] Length = 1652 Score = 64.3 bits (155), Expect = 5e-09 Identities = 31/64 (48%), Positives = 41/64 (64%) Frame = -3 Query: 194 IAW*IKVYGRLLIFNVDSRRDIYDGINWSFLFNVMNRMGFGNK*CKWIEACLKSSSTSIL 15 + W K ++LIF VD + YD +NW FLF VM MGF +K WI+ CL+S+ S+L Sbjct: 996 VTWAKKRKVKMLIFKVDFEK-AYDSLNWKFLFKVMEYMGFPDKWVNWIKGCLRSAKGSVL 1054 Query: 14 VNGS 3 VNGS Sbjct: 1055 VNGS 1058 >ref|XP_021971650.1| uncharacterized protein LOC110866811 [Helianthus annuus] Length = 1031 Score = 63.9 bits (154), Expect = 7e-09 Identities = 33/64 (51%), Positives = 41/64 (64%) Frame = -3 Query: 194 IAW*IKVYGRLLIFNVDSRRDIYDGINWSFLFNVMNRMGFGNK*CKWIEACLKSSSTSIL 15 +AW K +LLIF VD + YD INW FLF++M+ MGF K WI+ CL S S+L Sbjct: 456 VAWAKKKKEKLLIFKVDFEK-AYDSINWKFLFHLMDLMGFPEKWICWIKGCLVSGMGSVL 514 Query: 14 VNGS 3 VNGS Sbjct: 515 VNGS 518 >dbj|GAU43034.1| hypothetical protein TSUD_12890 [Trifolium subterraneum] Length = 389 Score = 63.5 bits (153), Expect = 8e-09 Identities = 29/55 (52%), Positives = 40/55 (72%) Frame = -3 Query: 167 RLLIFNVDSRRDIYDGINWSFLFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 + L+F VD R +YD +NW+FL +M+RMGF + C+WI AC+ SS S+LVNGS Sbjct: 32 KCLLFKVDFER-VYDTVNWNFLDYMMSRMGFADGWCRWIRACVFQSSMSVLVNGS 85 >gb|KYP71360.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 158 Score = 60.5 bits (145), Expect = 2e-08 Identities = 28/53 (52%), Positives = 37/53 (69%) Frame = -3 Query: 161 LIFNVDSRRDIYDGINWSFLFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 L F VD + +YD +NW+FL ++ R+GF NK WI CL+SSS S+LVNGS Sbjct: 56 LFFKVDYEK-VYDSVNWNFLKYMLRRLGFCNKWIAWISVCLESSSISVLVNGS 107 >ref|XP_019465331.1| PREDICTED: uncharacterized protein LOC109363522 [Lupinus angustifolius] Length = 765 Score = 62.8 bits (151), Expect = 2e-08 Identities = 36/82 (43%), Positives = 49/82 (59%) Frame = -3 Query: 248 LLKGALWCNGELTKGKAKIAW*IKVYGRLLIFNVDSRRDIYDGINWSFLFNVMNRMGFGN 69 ++ G + N + +GK K YG IF VD + YD +NWSFL +M RMGF Sbjct: 454 IMDGVVIVNEVIDQGKKKN------YGDCFIFKVDFEK-AYDCVNWSFLLYMMERMGFCL 506 Query: 68 K*CKWIEACLKSSSTSILVNGS 3 K WI++CL+S+ TSILVNG+ Sbjct: 507 KWRTWIKSCLQSNFTSILVNGN 528 >ref|XP_022028823.1| uncharacterized protein LOC110929940 [Helianthus annuus] Length = 914 Score = 62.0 bits (149), Expect = 3e-08 Identities = 35/94 (37%), Positives = 54/94 (57%) Frame = -3 Query: 284 LLPSTIIKEIKRLLKGALWCNGELTKGKAKIAW*IKVYGRLLIFNVDSRRDIYDGINWSF 105 ++PS + ++G +G L + I+W K ++LIF VD + YD +NW F Sbjct: 116 VIPSLVNPVQTAFVEGRSIFDGPLITSEI-ISWAKKSKKKMLIFKVDFEK-AYDSVNWKF 173 Query: 104 LFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 L + + MGF ++ KW+ ACLKSS S+LV+GS Sbjct: 174 LLSNLKAMGFPSRWTKWVGACLKSSWASVLVSGS 207 >gb|KHN27546.1| LINE-1 reverse transcriptase like, partial [Glycine soja] Length = 1371 Score = 61.6 bits (148), Expect = 5e-08 Identities = 30/53 (56%), Positives = 37/53 (69%) Frame = -3 Query: 161 LIFNVDSRRDIYDGINWSFLFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 L+F VD R YD I+W FL +M R+GF +K WIE CLKS+S S+LVNGS Sbjct: 724 LVFKVDYER-AYDSISWEFLSYMMKRLGFCHKWISWIEGCLKSASISVLVNGS 775 >gb|KHN06639.1| hypothetical protein glysoja_048361, partial [Glycine soja] Length = 85 Score = 57.4 bits (137), Expect = 5e-08 Identities = 26/53 (49%), Positives = 37/53 (69%) Frame = -3 Query: 161 LIFNVDSRRDIYDGINWSFLFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 ++F VD + YD ++W FLF +M RMGF ++ WI+ CL S+S SIL+NGS Sbjct: 22 MVFKVDFEK-AYDSVSWQFLFYMMGRMGFHDRWIGWIKGCLTSASISILMNGS 73 >dbj|GAU18134.1| hypothetical protein TSUD_248350 [Trifolium subterraneum] Length = 694 Score = 61.2 bits (147), Expect = 6e-08 Identities = 36/91 (39%), Positives = 49/91 (53%) Frame = -3 Query: 275 STIIKEIKRLLKGALWCNGELTKGKAKIAW*IKVYGRLLIFNVDSRRDIYDGINWSFLFN 96 S I K LKG L +G L + + W K L+F VD + YD ++WSFL Sbjct: 3 SLISKNQSAFLKGRLLVDGVLAINEV-VDWVKKAKKECLVFKVDFEK-AYDSVSWSFLEY 60 Query: 95 VMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 ++ R GF K WI+AC+ + S S+LVNGS Sbjct: 61 MLRRFGFDEKWRSWIKACVFAGSLSVLVNGS 91 >dbj|GAU34591.1| hypothetical protein TSUD_15060 [Trifolium subterraneum] Length = 776 Score = 61.2 bits (147), Expect = 6e-08 Identities = 36/94 (38%), Positives = 50/94 (53%) Frame = -3 Query: 284 LLPSTIIKEIKRLLKGALWCNGELTKGKAKIAW*IKVYGRLLIFNVDSRRDIYDGINWSF 105 ++ S K LKG L +G L + + W K L+F VD + YD ++WSF Sbjct: 240 VMDSLTSKNQSAFLKGRLLVDGVLAINEV-VDWVKKTKKECLVFKVDFEK-AYDSVSWSF 297 Query: 104 LFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 L ++ R GF K WI+AC+ S S S+LVNGS Sbjct: 298 LEYMLRRFGFDGKWRSWIKACVFSGSLSVLVNGS 331 >ref|XP_023758688.1| uncharacterized protein LOC111907126 [Lactuca sativa] Length = 1144 Score = 45.8 bits (107), Expect(2) = 6e-08 Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 4/85 (4%) Frame = -3 Query: 290 IFLLPSTIIKEIKRLLKGALWCNGELTKGKAKIAW----*IKVYGRLLIFNVDSRRDIYD 123 IF +P I EI+++ + LW NGE+ KGKA++ W K YG L I N+ R D Sbjct: 813 IFKIPIATINEIEKMCRSFLWANGEIVKGKAEVKWQDICKPKEYGGLGIKNL---RRWND 869 Query: 122 GINWSFLFNVMNRMGFGNK*CKWIE 48 + ++NV+N NK W++ Sbjct: 870 ALLAKHVWNVIN-----NKNSLWVQ 889 Score = 38.5 bits (88), Expect(2) = 6e-08 Identities = 16/28 (57%), Positives = 23/28 (82%) Frame = -1 Query: 382 IYDWRNKSLSFAQRLQLTTAVLASVEVY 299 I++W++K+LSFA RLQL +VL S+ VY Sbjct: 782 IFNWKSKTLSFAGRLQLINSVLTSIHVY 809 >ref|XP_023771941.1| uncharacterized protein LOC111920591 [Lactuca sativa] Length = 343 Score = 60.8 bits (146), Expect = 6e-08 Identities = 36/97 (37%), Positives = 53/97 (54%) Frame = -3 Query: 293 IIFLLPSTIIKEIKRLLKGALWCNGELTKGKAKIAW*IKVYGRLLIFNVDSRRDIYDGIN 114 ++ +LP+ I E LKG +G L + I+W + LIF V+ + YD ++ Sbjct: 14 LVLVLPNVISVEQSTFLKGRKVLDGPLMVSEL-ISWGKRKKKEFLIFKVEFEK-AYDSVS 71 Query: 113 WSFLFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 W +L VM MGFG K WI+ LK++ S+LVNGS Sbjct: 72 WDYLDKVMEFMGFGVKWRSWIQGLLKNARLSVLVNGS 108 >gb|KHN25569.1| hypothetical protein glysoja_035697, partial [Glycine soja] Length = 111 Score = 57.8 bits (138), Expect = 6e-08 Identities = 27/53 (50%), Positives = 36/53 (67%) Frame = -3 Query: 161 LIFNVDSRRDIYDGINWSFLFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 L+F VD R YD ++W+FL ++ R+GF K WIE CL S+S S+LVNGS Sbjct: 53 LVFKVDYER-AYDSVSWAFLSYMVRRLGFYTKWISWIEGCLHSASVSVLVNGS 104 >gb|KHN46635.1| hypothetical protein glysoja_043348, partial [Glycine soja] Length = 85 Score = 57.0 bits (136), Expect = 7e-08 Identities = 25/53 (47%), Positives = 36/53 (67%) Frame = -3 Query: 161 LIFNVDSRRDIYDGINWSFLFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 ++F VD + YD ++W FLF +M+RMGF + +W CL S++ SILVNGS Sbjct: 22 MVFKVDFEK-AYDSVSWQFLFYMMSRMGFHERWIRWFRGCLTSATMSILVNGS 73 >ref|XP_021996168.1| uncharacterized protein LOC110893365 [Helianthus annuus] Length = 605 Score = 60.8 bits (146), Expect = 8e-08 Identities = 33/94 (35%), Positives = 52/94 (55%) Frame = -3 Query: 284 LLPSTIIKEIKRLLKGALWCNGELTKGKAKIAW*IKVYGRLLIFNVDSRRDIYDGINWSF 105 +L I + L+G +G L + I+W K R I +D + YD +NW+F Sbjct: 299 VLEEVISENQSAFLEGKFILDGPLIVNEV-ISWLKKEKSRAFIMKIDFEK-AYDNVNWNF 356 Query: 104 LFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 + +V+N+MGF + C W+ LKS+ +S+LVNGS Sbjct: 357 VISVLNQMGFPPRWCCWVLGILKSARSSVLVNGS 390 >gb|PNY12727.1| ribonuclease H [Trifolium pratense] Length = 698 Score = 60.8 bits (146), Expect = 8e-08 Identities = 37/93 (39%), Positives = 49/93 (52%) Frame = -3 Query: 281 LPSTIIKEIKRLLKGALWCNGELTKGKAKIAW*IKVYGRLLIFNVDSRRDIYDGINWSFL 102 + S I K LKG L +G L + + W K LIF VD + YD ++WSFL Sbjct: 1 MDSLISKNQSAFLKGRLLVDGVLAINEV-VDWVKKKKKECLIFKVDFEK-AYDSVSWSFL 58 Query: 101 FNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 ++ GFG K WI AC+ + S S+LVNGS Sbjct: 59 DYMLRGFGFGEKWRSWIRACVFAGSLSVLVNGS 91 >ref|XP_021994198.1| uncharacterized protein LOC110890853 [Helianthus annuus] Length = 1146 Score = 42.0 bits (97), Expect(2) = 8e-08 Identities = 20/28 (71%), Positives = 23/28 (82%) Frame = -1 Query: 382 IYDWRNKSLSFAQRLQLTTAVLASVEVY 299 I DWRNKSLSFA RLQL +VL+S+ VY Sbjct: 715 ITDWRNKSLSFAGRLQLIRSVLSSLHVY 742 Score = 42.0 bits (97), Expect(2) = 8e-08 Identities = 17/35 (48%), Positives = 24/35 (68%) Frame = -3 Query: 290 IFLLPSTIIKEIKRLLKGALWCNGELTKGKAKIAW 186 +F+LP IIKE++ +K LW G+ KGKAK+ W Sbjct: 746 VFILPKRIIKELEDRMKRFLWAQGDDIKGKAKVKW 780 >ref|XP_022003153.1| uncharacterized protein LOC110900576 [Helianthus annuus] Length = 385 Score = 45.1 bits (105), Expect(2) = 8e-08 Identities = 17/35 (48%), Positives = 29/35 (82%) Frame = -3 Query: 290 IFLLPSTIIKEIKRLLKGALWCNGELTKGKAKIAW 186 +F+LP++IIKE++ +K LW +G ++KG+AK+AW Sbjct: 161 VFILPASIIKELESKMKWFLWGHGTVSKGRAKVAW 195 Score = 38.9 bits (89), Expect(2) = 8e-08 Identities = 17/28 (60%), Positives = 22/28 (78%) Frame = -1 Query: 382 IYDWRNKSLSFAQRLQLTTAVLASVEVY 299 I DW+N+ LSFA RLQL +VL+S+ VY Sbjct: 130 ILDWKNRFLSFAGRLQLVISVLSSIHVY 157 >gb|OTG21358.1| putative RNA-directed DNA polymerase, eukaryota [Helianthus annuus] Length = 1780 Score = 60.8 bits (146), Expect = 8e-08 Identities = 32/94 (34%), Positives = 52/94 (55%) Frame = -3 Query: 284 LLPSTIIKEIKRLLKGALWCNGELTKGKAKIAW*IKVYGRLLIFNVDSRRDIYDGINWSF 105 +L I LKG +G L + I+W K + +D + YD +NW+F Sbjct: 1099 VLEGVISDSQSAFLKGRYILDGPLIINEI-ISWIKKSKKKAFFLKIDFEK-AYDNVNWNF 1156 Query: 104 LFNVMNRMGFGNK*CKWIEACLKSSSTSILVNGS 3 + +++++MGF + CKWI LKS+S+S+LVNG+ Sbjct: 1157 VLSILSQMGFPERWCKWIRGILKSASSSVLVNGA 1190