BLASTX nr result
ID: Chrysanthemum21_contig00025810
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00025810 (1851 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_021993298.1| protein ROS1-like [Helianthus annuus] >gi|11... 388 e-116 gb|KVI10426.1| DNA glycosylase, partial [Cynara cardunculus var.... 365 e-108 gb|PLY70678.1| hypothetical protein LSAT_3X76420 [Lactuca sativa] 263 4e-72 ref|XP_017247046.1| PREDICTED: uncharacterized protein LOC108218... 95 3e-16 ref|XP_017247043.1| PREDICTED: uncharacterized protein LOC108218... 95 3e-16 gb|KZM97904.1| hypothetical protein DCAR_014734 [Daucus carota s... 95 3e-16 >ref|XP_021993298.1| protein ROS1-like [Helianthus annuus] gb|OTG07760.1| putative DNA glycosylase [Helianthus annuus] Length = 1420 Score = 388 bits (996), Expect = e-116 Identities = 260/598 (43%), Positives = 325/598 (54%), Gaps = 26/598 (4%) Frame = +1 Query: 136 MERVGNWIPLTPGKPISGMSEDNCESVLTSKQGTEKVNINGKFPFTGAFDTMVNTETIPV 315 MER G WIPLTPGKP L+S+QG K ++N +F T F++M Sbjct: 1 MERDGVWIPLTPGKP-----------ALSSEQGMGKESLNEEFSCTDCFESM-------- 41 Query: 316 TPAKPNPGRSGQSCETVLISEQVTEKEIITRELPCMLNETCELTAEINDGHELVFGATGK 495 + EK C++NE CE A+ D E G GK Sbjct: 42 ---------------------RCMEK--------CVMNEMCEFPADTLDEFEAGLGVAGK 72 Query: 496 DFISTTASRDDASCVQEEAVKIPTSECDGKKAHSGDESPVDKGSIPTPSKTKESRKRRND 675 + TTA +DD SCVQEE VK SE D KKAH DES GSIPTPSKTKE+RKRRND Sbjct: 73 EAEGTTAVQDDTSCVQEETVKSSPSEHDSKKAHGVDESQDGIGSIPTPSKTKETRKRRND 132 Query: 676 GIDMNKKAKQKFRMKKHRPKVYDDSXXXXXXXAQXXXXXXXXXXXXXXXXXXXXXXVQEK 855 G DMNKK Q+ R+KKHRPK++D+S AQ VQE+ Sbjct: 133 GNDMNKKPSQRPRVKKHRPKIFDNSKPKKVPKAQ----TPRSTLRPKTPKPATPNRVQER 188 Query: 856 RKNSNKNE----SSCMQSSTTCGFGDVVRDNEEASTASIITADSCKRSLNFDDH-IAKES 1020 R S KN+ S+C Q ST G DV +D +EAS S I SCKR L+FD H + KE Sbjct: 189 RMQSKKNKFTDSSNCTQKSTCSGIEDVGQDVQEASRTS-IAVSSCKRFLDFDQHPVGKE- 246 Query: 1021 NPVSKSHEEPPRLEIFVGDFYNF----GKIVTSKRNTPRRSRFLKKSLEASEDLLGDIS- 1185 SKSHEEPP L DF F GKIVTSKRNTPRRSRF KSL+AS++LLGD + Sbjct: 247 ---SKSHEEPPILGF---DFEKFDCFRGKIVTSKRNTPRRSRFQNKSLKASDNLLGDGNT 300 Query: 1186 --------VTRNQDQADGNGRKYIHVYQRRKKINLNATSFTPTVKVYRRMIKENKCLQFS 1341 + RNQ+Q D G+ Y+H YQRRKKI ++TS +PT+ VY R +++ CL S Sbjct: 301 PQHGQDSFIIRNQEQVDKYGKTYVHFYQRRKKICSSSTSRSPTLLVYHRSCRKDLCLYHS 360 Query: 1342 KRCGPVFPKLFKKQRSLRKR--MKIHRWCDKADEVGKSSVXXXXXXXXXXXXXXXQXXXX 1515 K+CGPVFPKLFKKQR++RK+ +K++ W + E+ KS + + Sbjct: 361 KKCGPVFPKLFKKQRTMRKKVNIKVNHWYIISGELHKSLMNRSHRKLTQTTRKNVEIRVN 420 Query: 1516 XXXXXXXXXXPNIYTEEWLRRVFSPPRRTRSV--IRYKEVIRDLNSNQTMPYDDHDFLYC 1689 P+IYT+E LRRVF RR RS+ R +E IR PYD ++L C Sbjct: 421 KSKDKKGVVKPHIYTDE-LRRVFLHQRRKRSIRHTRLRESIR------MPPYDPENYLLC 473 Query: 1690 HEENFLQIPECLPLQEVPALRIESFTSIYSDFLRVD----NLKWFGLREVPVIQSLSL 1851 E+ F I ECLPL EVP RI+SFTS + D R+D N W L+EV V++S SL Sbjct: 474 QEKIFSPITECLPLHEVPVQRIKSFTSPHWDSPRLDNSLGNQNWSQLQEVVVLESQSL 531 >gb|KVI10426.1| DNA glycosylase, partial [Cynara cardunculus var. scolymus] Length = 1405 Score = 365 bits (938), Expect = e-108 Identities = 238/582 (40%), Positives = 309/582 (53%), Gaps = 67/582 (11%) Frame = +1 Query: 307 IPVTPAKPNPGRSG------------QSCETVLISEQVTEKEIITRELPCM--------- 423 +P+TP KP P R G +C+ L S Q T E + P M Sbjct: 14 MPLTPGKPIPARLGLGLHSVQPMGGRHNCKATLTSGQGTGIENLNEAFPFMASFNATGYL 73 Query: 424 ----LNETCELTAEINDGHELVFGATGKDFISTTASRDDASCVQEEAVKIPTSECDGKKA 591 +NE L A + E G GKD S A + D SC +EE VK+P SECD K Sbjct: 74 EHNGINEMYGLKAGFGE-READLGVAGKDPKSNAADQHDTSCAREEEVKVPHSECDSKNV 132 Query: 592 HSGDESPVDKGSIPTPSKTKESRKRRNDGIDMNKKAKQKFRMKKHRPKVYDDSXXXXXXX 771 H ES GS+PT S+ K+SRKRRNDGID+NKK Q+ RMKKHRPK+ DDS Sbjct: 133 HRVHESRAGVGSVPTTSEKKDSRKRRNDGIDLNKKPSQRPRMKKHRPKILDDSKPKKVPK 192 Query: 772 AQXXXXXXXXXXXXXXXXXXXXXXVQEKRKNSNKN----ESSCMQSSTTCGFGDVVRDNE 939 AQ VQE+RK + KN +SCMQSST+ G+ DVV+D + Sbjct: 193 AQ---TPRASTPRPKTPKPVTPNRVQERRKPARKNTFTGSTSCMQSSTSYGYKDVVQDVQ 249 Query: 940 EASTASIITADSCKRSLNFDDHIAK-ESNPVSKSHEEPPRLEIFVGDFYNFGKIVTSKRN 1116 AS +SII SCKRSL+F+ H+ +S+ VSKSH +P + +G+F FGK+VTSKRN Sbjct: 250 SASMSSIIVFKSCKRSLDFNHHLVDGKSHYVSKSHVQPRSVNYDLGEFSYFGKLVTSKRN 309 Query: 1117 TPRRSRFLKKSLEASEDLLG---------DISVTRNQDQADGNGRKYIHVYQRRKKINLN 1269 TPRRSRF KK L+ASEDLL D SV RNQ+ A+ +GR++++ YQRRKK + N Sbjct: 310 TPRRSRFQKKCLKASEDLLADNNKQQCHQDTSVIRNQELAETHGRRFVYFYQRRKKRSSN 369 Query: 1270 ATSFTPTVKVYRRMIKENKCLQFSKRCGPVFPKLFKKQRSLRKR--MKIHRWCDKADEVG 1443 ATS PT++VYRR + N+CLQ SK+ GP FP +FKKQR+ R++ M ++ W KA E G Sbjct: 370 ATSVIPTLQVYRRKFRANQCLQNSKKSGPNFPSIFKKQRAKRRKATMNVNWWYIKAFEDG 429 Query: 1444 KSSVXXXXXXXXXXXXXXXQXXXXXXXXXXXXXXPNIYTEEWLRRVFSPPRRTRSV---- 1611 K V PN++T E VF +R RS+ Sbjct: 430 KKRVKRSHRKHIQTTGKSVHNGVNKSKDHKGVVKPNLHTAERFLHVFLTKKRKRSIRHTR 489 Query: 1612 ------------------IRYKEVIRDLNSNQTMPYDDHDFLYCHEENFLQIPECLPLQE 1737 + +E I +L+ + MPY+ L EE+FL++ EC PLQE Sbjct: 490 RRENILDIPIFKTTPYESEKRRENIMELSIFKAMPYETEICLPQQEESFLKVTECFPLQE 549 Query: 1738 VPALRIESFTSIYSDFLRVDN----LKWFGLREVPVIQSLSL 1851 VP IESFTS + + VDN LK L+EVPV+ S SL Sbjct: 550 VPIQTIESFTSFHRNVQLVDNSVDALKSLQLQEVPVLGSQSL 591 >gb|PLY70678.1| hypothetical protein LSAT_3X76420 [Lactuca sativa] Length = 1581 Score = 263 bits (673), Expect = 4e-72 Identities = 202/562 (35%), Positives = 277/562 (49%), Gaps = 48/562 (8%) Frame = +1 Query: 310 PVTPAKPNPGRSGQSCETVLISEQVTEKEIITRELPCMLN-ETCELTAEINDGHELVFGA 486 P+TPAKP P RSG + + S + TEKE I E C + T E + + V G Sbjct: 22 PLTPAKPVPARSGHNTQVASTSGRGTEKENINDEFLCTSSLGTTEYLEDNGNKGSSVLGV 81 Query: 487 TGKDFISTTASRDDASCVQEEAVKIPTSECDGKKAHSGDESPVDKGSIPTPSKTKESRKR 666 GKD ISTT +++ SC+QEE ++ G ++ + S+P+PS+TK+SRKR Sbjct: 82 AGKDPISTTTDQNNTSCIQEET--------KSNESQYGIDNSIP--SVPSPSETKDSRKR 131 Query: 667 RNDGIDMNKKAKQKFRMKKHRPKVYDDSXXXXXXXAQXXXXXXXXXXXXXXXXXXXXXXV 846 RN+GID+NKK ++ RMKKHRPKVYDDS Q V Sbjct: 132 RNNGIDLNKKPNKRTRMKKHRPKVYDDS---KPKKVQKPKTPKPKTPKPKTPKPVTPNRV 188 Query: 847 QEK----RKNSNKNESSCMQSSTT------------------CGFGDV---VRDNEEAST 951 EK RK K +SCMQ ST+ C + + D E+AS Sbjct: 189 HEKSVRSRKEKFKEPTSCMQKSTSYVDQIDSHHMSKLHEEAICMQNTMNYNIEDVEQASR 248 Query: 952 AS---IITADSCKRSLNFDDHIAKESNPVSKSHEEPPRLEIFVGDFYNFGKIVTSKRNTP 1122 S +I CKR L+ + SKSH++ GD+ FG+IVTSKRNT Sbjct: 249 VSRNALIPMIPCKRRLDLN------YEGESKSHDKHLSFNFDNGDYDFFGRIVTSKRNTK 302 Query: 1123 RRSRFLKKS--LEASEDLLG----------DISVTRNQDQADGNGRKYIHVYQRRKKINL 1266 RRSRF KKS LE SEDLLG D SV N Q + R++++VY+ +KK N Sbjct: 303 RRSRFQKKSVELEVSEDLLGDNNNKQHCGLDFSVIENLKQTKKHVRRFVYVYKCQKKRN- 361 Query: 1267 NATSFTPTVKVYRRMIKENKCLQFSKRCGPVFPKLFKKQRSLRKRMKIH-RWCDKADEVG 1443 S T T++V +R + ++CLQ S++ GP FPKLFKKQR +RK++ I+ W K + Sbjct: 362 ---SKTSTLQVNQRKCRLDQCLQSSRKSGPNFPKLFKKQRKMRKKVTINPNWLLKFLDNN 418 Query: 1444 KSSVXXXXXXXXXXXXXXXQXXXXXXXXXXXXXXPNIYTEEWLRRVFSPPRRTRSVI--R 1617 K L +FSP ++ RS++ R Sbjct: 419 KKKKEPHKKLKKKVAK----------------------NNLLLLPIFSPMKKKRSILQTR 456 Query: 1618 YKEVIRDLNSNQTMPYDDHDFLYCHEENFLQIPECLPLQEVPALRIESFTSIYSDFLRVD 1797 +E + D ++ + D FL C EENFLQ+ ECLPLQEVP +I+SFTS+ V+ Sbjct: 457 RRENLVDFPISKAISLYDERFLLCQEENFLQMTECLPLQEVPIHQIDSFTSLPLHVQGVE 516 Query: 1798 N----LKWFGLREVPVIQSLSL 1851 N L W +EVP+++S SL Sbjct: 517 NTLAALDWLQAQEVPLLESQSL 538 >ref|XP_017247046.1| PREDICTED: uncharacterized protein LOC108218564 isoform X2 [Daucus carota subsp. sativus] Length = 1453 Score = 94.7 bits (234), Expect = 3e-16 Identities = 114/457 (24%), Positives = 188/457 (41%), Gaps = 43/457 (9%) Frame = +1 Query: 154 WIPLTPGK----PISGMSEDNCESVLTSKQGTEKVNINGKFPFTGAFDTMVNTETIPVTP 321 W+PLTP K ISG+ NC+ TE V++N F +V++ + +T Sbjct: 9 WVPLTPQKVCLESISGVK--NCKD----PNFTESVDVNSDF----CDGKVVSSNGVEITL 58 Query: 322 AKPNPGRSGQSCETVLISEQVTEKEIITRELPCMLNETCELTAEINDGHELVFGATGKDF 501 +S T ++VTE E R L C +L ++ G K Sbjct: 59 GV----EKDESRCTEKEGQRVTELEDSERNLYC----AGKLMEHMDSPSVSTPGLGEKQN 110 Query: 502 ISTTASRDDASCVQEEAVKIPTSECDGKKAHSGDESP--VDKGSI--PTPSKTKESRKRR 669 + D++ ++ ++ E + +K H + P +D S+ P P++ ++SRKR Sbjct: 111 TRPRHNDDESRHTDKKENQVGKFEDEERKLHCAENLPRDIDSPSVLTPCPAEKQDSRKRN 170 Query: 670 NDGIDMNKKAKQKFRMKKHRPKVYDDSXXXXXXXAQXXXXXXXXXXXXXXXXXXXXXXVQ 849 NDGID+NKK KQ+ ++KKHRPK+ D + Sbjct: 171 NDGIDLNKKPKQRPKVKKHRPKIAVDRWMPRKIPKSQTPRNSTP---------------K 215 Query: 850 EKRKNSNKN-----ESSCMQSSTTCGFGDVVRD--NEEASTASIITADSCKRSLNFDDHI 1008 + R ++ KN + + + GF D + S S A CKRSL F+ Sbjct: 216 DNRPSTTKNVNLRGKRYLGKLTNRKGFTSTSEDVRGDANSETSAGVAILCKRSLKFECEA 275 Query: 1009 AKESNPV----SKSHEEPPRLEIFVG-------DFYNFGKIVTSKRNTPRRSRFL----- 1140 + + V S+ H++ L G D + + + +R SRFL Sbjct: 276 VDKCDCVVESCSRPHQDQFHLRSVEGLQMSTDLDMCSDFNVQSKERGENCSSRFLNQYQR 335 Query: 1141 ------KKSLEASEDL-LGDISVTRNQDQADGNGRKYIHVYQRRKKI-----NLNATSFT 1284 K S EA+ + +G T D + + + ++ K+ NL+ S Sbjct: 336 RMRGVDKPSSEANTGINIGTKECTNLSFSTDESQLMFSNPHELEKQSVFHHNNLHDNSKA 395 Query: 1285 PTVKVYRRMIKENKCLQFSKRCGPVFPKLFKKQRSLR 1395 ++ YRR + N+C Q S++ GP FPK+FKK R++R Sbjct: 396 GCLQFYRRTFRVNQCRQNSRKSGPNFPKIFKKSRTMR 432 >ref|XP_017247043.1| PREDICTED: uncharacterized protein LOC108218564 isoform X1 [Daucus carota subsp. sativus] ref|XP_017247045.1| PREDICTED: uncharacterized protein LOC108218564 isoform X1 [Daucus carota subsp. sativus] Length = 1816 Score = 94.7 bits (234), Expect = 3e-16 Identities = 114/457 (24%), Positives = 188/457 (41%), Gaps = 43/457 (9%) Frame = +1 Query: 154 WIPLTPGK----PISGMSEDNCESVLTSKQGTEKVNINGKFPFTGAFDTMVNTETIPVTP 321 W+PLTP K ISG+ NC+ TE V++N F +V++ + +T Sbjct: 9 WVPLTPQKVCLESISGVK--NCKD----PNFTESVDVNSDF----CDGKVVSSNGVEITL 58 Query: 322 AKPNPGRSGQSCETVLISEQVTEKEIITRELPCMLNETCELTAEINDGHELVFGATGKDF 501 +S T ++VTE E R L C +L ++ G K Sbjct: 59 GV----EKDESRCTEKEGQRVTELEDSERNLYC----AGKLMEHMDSPSVSTPGLGEKQN 110 Query: 502 ISTTASRDDASCVQEEAVKIPTSECDGKKAHSGDESP--VDKGSI--PTPSKTKESRKRR 669 + D++ ++ ++ E + +K H + P +D S+ P P++ ++SRKR Sbjct: 111 TRPRHNDDESRHTDKKENQVGKFEDEERKLHCAENLPRDIDSPSVLTPCPAEKQDSRKRN 170 Query: 670 NDGIDMNKKAKQKFRMKKHRPKVYDDSXXXXXXXAQXXXXXXXXXXXXXXXXXXXXXXVQ 849 NDGID+NKK KQ+ ++KKHRPK+ D + Sbjct: 171 NDGIDLNKKPKQRPKVKKHRPKIAVDRWMPRKIPKSQTPRNSTP---------------K 215 Query: 850 EKRKNSNKN-----ESSCMQSSTTCGFGDVVRD--NEEASTASIITADSCKRSLNFDDHI 1008 + R ++ KN + + + GF D + S S A CKRSL F+ Sbjct: 216 DNRPSTTKNVNLRGKRYLGKLTNRKGFTSTSEDVRGDANSETSAGVAILCKRSLKFECEA 275 Query: 1009 AKESNPV----SKSHEEPPRLEIFVG-------DFYNFGKIVTSKRNTPRRSRFL----- 1140 + + V S+ H++ L G D + + + +R SRFL Sbjct: 276 VDKCDCVVESCSRPHQDQFHLRSVEGLQMSTDLDMCSDFNVQSKERGENCSSRFLNQYQR 335 Query: 1141 ------KKSLEASEDL-LGDISVTRNQDQADGNGRKYIHVYQRRKKI-----NLNATSFT 1284 K S EA+ + +G T D + + + ++ K+ NL+ S Sbjct: 336 RMRGVDKPSSEANTGINIGTKECTNLSFSTDESQLMFSNPHELEKQSVFHHNNLHDNSKA 395 Query: 1285 PTVKVYRRMIKENKCLQFSKRCGPVFPKLFKKQRSLR 1395 ++ YRR + N+C Q S++ GP FPK+FKK R++R Sbjct: 396 GCLQFYRRTFRVNQCRQNSRKSGPNFPKIFKKSRTMR 432 >gb|KZM97904.1| hypothetical protein DCAR_014734 [Daucus carota subsp. sativus] Length = 1917 Score = 94.7 bits (234), Expect = 3e-16 Identities = 114/457 (24%), Positives = 188/457 (41%), Gaps = 43/457 (9%) Frame = +1 Query: 154 WIPLTPGK----PISGMSEDNCESVLTSKQGTEKVNINGKFPFTGAFDTMVNTETIPVTP 321 W+PLTP K ISG+ NC+ TE V++N F +V++ + +T Sbjct: 9 WVPLTPQKVCLESISGVK--NCKD----PNFTESVDVNSDF----CDGKVVSSNGVEITL 58 Query: 322 AKPNPGRSGQSCETVLISEQVTEKEIITRELPCMLNETCELTAEINDGHELVFGATGKDF 501 +S T ++VTE E R L C +L ++ G K Sbjct: 59 GV----EKDESRCTEKEGQRVTELEDSERNLYC----AGKLMEHMDSPSVSTPGLGEKQN 110 Query: 502 ISTTASRDDASCVQEEAVKIPTSECDGKKAHSGDESP--VDKGSI--PTPSKTKESRKRR 669 + D++ ++ ++ E + +K H + P +D S+ P P++ ++SRKR Sbjct: 111 TRPRHNDDESRHTDKKENQVGKFEDEERKLHCAENLPRDIDSPSVLTPCPAEKQDSRKRN 170 Query: 670 NDGIDMNKKAKQKFRMKKHRPKVYDDSXXXXXXXAQXXXXXXXXXXXXXXXXXXXXXXVQ 849 NDGID+NKK KQ+ ++KKHRPK+ D + Sbjct: 171 NDGIDLNKKPKQRPKVKKHRPKIAVDRWMPRKIPKSQTPRNSTP---------------K 215 Query: 850 EKRKNSNKN-----ESSCMQSSTTCGFGDVVRD--NEEASTASIITADSCKRSLNFDDHI 1008 + R ++ KN + + + GF D + S S A CKRSL F+ Sbjct: 216 DNRPSTTKNVNLRGKRYLGKLTNRKGFTSTSEDVRGDANSETSAGVAILCKRSLKFECEA 275 Query: 1009 AKESNPV----SKSHEEPPRLEIFVG-------DFYNFGKIVTSKRNTPRRSRFL----- 1140 + + V S+ H++ L G D + + + +R SRFL Sbjct: 276 VDKCDCVVESCSRPHQDQFHLRSVEGLQMSTDLDMCSDFNVQSKERGENCSSRFLNQYQR 335 Query: 1141 ------KKSLEASEDL-LGDISVTRNQDQADGNGRKYIHVYQRRKKI-----NLNATSFT 1284 K S EA+ + +G T D + + + ++ K+ NL+ S Sbjct: 336 RMRGVDKPSSEANTGINIGTKECTNLSFSTDESQLMFSNPHELEKQSVFHHNNLHDNSKA 395 Query: 1285 PTVKVYRRMIKENKCLQFSKRCGPVFPKLFKKQRSLR 1395 ++ YRR + N+C Q S++ GP FPK+FKK R++R Sbjct: 396 GCLQFYRRTFRVNQCRQNSRKSGPNFPKIFKKSRTMR 432