BLASTX nr result
ID: Chrysanthemum22_contig00037651
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00037651 (1216 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|OTG07182.1| putative zinc finger, CCHC-type [Helianthus annuus] 211 4e-70 gb|OTG33639.1| putative zinc finger, CCHC-type [Helianthus annuus] 207 6e-60 ref|XP_023768909.1| protein FREE1-like [Lactuca sativa] 205 2e-56 ref|XP_017639741.1| PREDICTED: uncharacterized protein LOC108481... 195 7e-56 ref|XP_017609417.1| PREDICTED: uncharacterized protein LOC108455... 194 2e-55 ref|XP_017604411.1| PREDICTED: uncharacterized protein LOC108451... 193 4e-55 ref|XP_022032409.1| uncharacterized protein LOC110933498 [Helian... 192 5e-55 dbj|BAA22288.1| polyprotein [Oryza australiensis] 178 2e-54 gb|OTG21781.1| putative zinc finger, CCHC-type [Helianthus annuus] 187 2e-53 gb|OTG05603.1| putative zinc finger, CCHC-type [Helianthus annuus] 185 2e-52 gb|PNX82340.1| retrotransposon protein putative Ty1-copia subcla... 184 4e-50 ref|XP_013639199.1| PREDICTED: uncharacterized protein LOC106344... 176 9e-49 ref|XP_022008408.1| uncharacterized protein LOC110907786 [Helian... 172 5e-48 ref|XP_012477568.1| PREDICTED: uncharacterized protein LOC105793... 169 6e-47 gb|PNY09707.1| nuclear matrix protein, partial [Trifolium pratense] 177 1e-46 ref|XP_022004366.1| uncharacterized protein LOC110901916 [Helian... 177 1e-46 ref|XP_021975761.1| uncharacterized protein LOC110871136 [Helian... 168 4e-46 ref|XP_017613650.1| PREDICTED: uncharacterized protein LOC108458... 165 1e-45 gb|OTG36157.1| hypothetical protein HannXRQ_Chr01g0004521 [Helia... 166 1e-45 ref|XP_013700463.1| uncharacterized protein LOC106404280 [Brassi... 166 3e-45 >gb|OTG07182.1| putative zinc finger, CCHC-type [Helianthus annuus] Length = 1325 Score = 211 bits (536), Expect(2) = 4e-70 Identities = 112/287 (39%), Positives = 159/287 (55%), Gaps = 2/287 (0%) Frame = +3 Query: 54 NNSS--IRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNA 227 NNSS ++SILEK+KLN NFL+W+RNLRIVL+ ++L + + A Sbjct: 5 NNSSFALKSILEKDKLNNTNFLEWHRNLRIVLKMAKRL-YVLETPIPTAPENNTVVAKKA 63 Query: 228 YNKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHA 407 ++K + EVACLM A+M P++QKN+ED+ AFD++ +LK MFQ+QA QE ++T+K + Sbjct: 64 FDKHKEDAMEVACLMQATMSPDLQKNMEDMNAFDMIEQLKGMFQKQARQERYDTMKQLIS 123 Query: 408 CKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMG 587 CK +EG +VS ++LKMKGY+DQ+ +LGYP+ + V L +LS Y+QFV N+NM+ Sbjct: 124 CKMQEGSSVSAHVLKMKGYIDQLNKLGYPLQDEMAVDFILKSLSSHYDQFVMNFNMNDWV 183 Query: 588 KTIPEQHAMLKLTEKGLPKKAPAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXXXXXXXXX 767 KT+PE H MLK E + K VL + Sbjct: 184 KTVPELHGMLKTAEMNISNKVSQVL--MVRSGGIKKPKTKKKSYNSKNKGKAPVAAKPAT 241 Query: 768 XXXXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGAST 908 C+ C +GHW+RNCP YL +LK KA+G T Sbjct: 242 NKSKQAVKAPLPEERQCYECNKMGHWKRNCPEYLTKLKVKKANGEGT 288 Score = 84.7 bits (208), Expect(2) = 4e-70 Identities = 43/92 (46%), Positives = 60/92 (65%) Frame = +1 Query: 928 RKLNKGA*DL*VGNGNRASAEAIGSFDLILPNGLIIVLDNCHYAPTITRGVVSLSRLKDN 1107 +KL G +L VG+G R A G+F+L P+GLI+VL+NC YAP +TR ++S+S L + Sbjct: 328 KKLKPGDMELIVGDGKRVPVLATGTFNLTCPSGLIVVLNNCLYAPGLTRNIISVSLLYEQ 387 Query: 1108 GFIHVFTDYGISVSKDNIVYFNAIPRDGIFEI 1203 GF +VF IS + I YF A R+GI+EI Sbjct: 388 GFRYVFNGIAISAYLNGIYYFEAKSRNGIYEI 419 >gb|OTG33639.1| putative zinc finger, CCHC-type [Helianthus annuus] Length = 348 Score = 207 bits (527), Expect = 6e-60 Identities = 113/296 (38%), Positives = 154/296 (52%) Frame = +3 Query: 30 MTTPNQTINNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXX 209 M+T N S++SILE KL+ +NF+DWYRNL+I+LR+E+K + Sbjct: 60 MSTNNSDNTKFSLKSILENAKLDNSNFMDWYRNLKIILRAEKKA-YVLEGPIPEAPAANT 118 Query: 210 XXVRNAYNKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFET 389 R + K ++ Q+V CLMLA MIPE+QKNLE+L A+++ +LK MFQQQA E FE Sbjct: 119 GPARRTWEKHVDDSQDVTCLMLAMMIPELQKNLENLGAYEMSEQLKNMFQQQARHERFEV 178 Query: 390 VKAFHACKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNY 569 +++ C +EG +VS ++LKMK ++DQ+ERL P+ L + L +L Y QFV NY Sbjct: 179 MRSLITCHMQEGTSVSAHVLKMKSFVDQLERLNAPLSNELATDVILNSLPNSYHQFVMNY 238 Query: 570 NMHSMGKTIPEQHAMLKLTEKGLPKKAPAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXXX 749 NM+ +TIPE H MLK E +P K VLA + Sbjct: 239 NMNGWERTIPELHQMLKTAETNIPTKGNPVLAIREGRITKKKQSKGKG---------KAS 289 Query: 750 XXXXXXXXXXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGASTLGT 917 C HC +GHW+RNCP YLAELK K + GT Sbjct: 290 KQDKGKGKKVATPKAKPHKDTKCFHCDEIGHWKRNCPKYLAELKLKKLQIGESSGT 345 >ref|XP_023768909.1| protein FREE1-like [Lactuca sativa] Length = 622 Score = 205 bits (522), Expect = 2e-56 Identities = 114/299 (38%), Positives = 160/299 (53%), Gaps = 2/299 (0%) Frame = +3 Query: 24 SKMTTPNQTINNS-SIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXX 200 ++MT N + + + S+RSIL+KEKL+G NFLDW+RN+RIVLRSE+K Sbjct: 319 TQMTNSNVSTHQTLSLRSILDKEKLSGLNFLDWFRNMRIVLRSEKKDYILDGPIPEEPPP 378 Query: 201 XXXXXVRNAYNKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQEL 380 +++ + K ++ EVAC+MLA M+PE+QK E AFD++ +LK MFQ QA+QE Sbjct: 379 NASKAIKDTWAKHNDDSNEVACIMLACMVPELQKTFEFHTAFDMIEQLKLMFQVQAKQER 438 Query: 381 FETVKAFHACKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFV 560 F T+ F CK EG +VS ++LKMK Y++Q+ RLG+ + L + + L +L K + QFV Sbjct: 439 FATMNEFIGCKMTEGSSVSAHVLKMKAYVEQLSRLGFSIRDELAIDVILGSLPKSFSQFV 498 Query: 561 QNYNMHSMGKTIPEQHAMLKLTEKGLPKKAPAVL-AXXXXXXXXXXXXXPQAXXXXXXXX 737 NYN ++ K+I E H+MLK E + K P VL Sbjct: 499 LNYNKNNWEKSISELHSMLKTVEANMKKSRPQVLMVREGKITKNVKTGNVIGKGKAFEKG 558 Query: 738 XXXXXXXXXXXXXXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGASTLG 914 C HC GHW+RNCP+YL E+KKNKA+ T G Sbjct: 559 KEKFKVKGKAPTSKPIAKTKPHADADCFHCNQKGHWKRNCPLYLEEIKKNKANAIGTSG 617 >ref|XP_017639741.1| PREDICTED: uncharacterized protein LOC108481080 [Gossypium arboreum] Length = 289 Score = 195 bits (495), Expect = 7e-56 Identities = 105/288 (36%), Positives = 148/288 (51%) Frame = +3 Query: 54 NNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNAYN 233 N S+R +LE +KLN NFLDW+ NLRIVL+ E KL ++AY Sbjct: 6 NTLSLRLVLENDKLNVLNFLDWFCNLRIVLKQEWKLYVIEKSLPDEPPANASRVDKDAYK 65 Query: 234 KRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHACK 413 K ++ +V CLMLA+M PE+QK ED++A+D++ LK ++Q Q QE F+ KA C Sbjct: 66 KHLDDMVDVGCLMLATMNPELQKQHEDMVAYDMIEHLKELYQGQTRQERFDISKALFQCM 125 Query: 414 QEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMGKT 593 EG V ++LKM GY++ + +LG+P+ Q L + L +L Y QFV N+NM+ + KT Sbjct: 126 LAEGSPVGPHVLKMIGYIESLSKLGFPLGQELATDVILQSLPDSYSQFVLNFNMNEIDKT 185 Query: 594 IPEQHAMLKLTEKGLPKKAPAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXXXXXXXXXXX 773 +P+ +ML++ E + K P + P+ Sbjct: 186 LPQLLSMLRIAEGNMKKVGPKPMLMVRNNKGKGKAKVPK-------KPKGKGKPNLGKGK 238 Query: 774 XXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGASTLGT 917 C HCGV GHW+ NCPVYL E+KK KASG S GT Sbjct: 239 AAMKPKGGVSKEGNCFHCGVTGHWKWNCPVYLEEIKKAKASGTSASGT 286 >ref|XP_017609417.1| PREDICTED: uncharacterized protein LOC108455356 [Gossypium arboreum] Length = 289 Score = 194 bits (492), Expect = 2e-55 Identities = 106/288 (36%), Positives = 149/288 (51%) Frame = +3 Query: 54 NNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNAYN 233 N S+RS+LE +KLNG NFLDW+RNLRIVL+ E+KL R+ Y Sbjct: 6 NTLSLRSVLENDKLNGLNFLDWFRNLRIVLKQERKLYVNEQPLPNEPPANASRADRDVYK 65 Query: 234 KRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHACK 413 K N+ +V CLMLA+M PE+QK ED++A+D++ LK ++Q QA Q+ F+ KA CK Sbjct: 66 KHLNDMVDVRCLMLATMNPELQKQHEDMVAYDMIDHLKELYQGQARQKRFDISKALFQCK 125 Query: 414 QEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMGKT 593 EG V ++LKM Y++ + +LG+P+ Q L + L +L Y QFV N+NM+ + KT Sbjct: 126 MAEGSPVGPHVLKMIRYIESLFKLGFPLSQELATDVILQSLPDSYSQFVLNFNMNEIDKT 185 Query: 594 IPEQHAMLKLTEKGLPKKAPAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXXXXXXXXXXX 773 +P+ +ML+ E + + P + + Sbjct: 186 LPQLLSMLRTVEGNMKRVRPKPILMVRNNKG-------KGKAKVQKKPKGKGRPNSGKGK 238 Query: 774 XXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGASTLGT 917 C H GV GHW+RN PVYL E+KK KASG S GT Sbjct: 239 AALKPKSGVSKEGNCFHYGVTGHWKRNYPVYLEEIKKAKASGTSASGT 286 >ref|XP_017604411.1| PREDICTED: uncharacterized protein LOC108451194 [Gossypium arboreum] Length = 303 Score = 193 bits (491), Expect = 4e-55 Identities = 105/284 (36%), Positives = 148/284 (52%) Frame = +3 Query: 54 NNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNAYN 233 N S+R +LEK+KLNG NFLDW+RNLRIVL+ E+KL R+AY Sbjct: 6 NTLSLRLVLEKDKLNGLNFLDWFRNLRIVLKQERKLYFIEQLLPNEPPTNASRADRDAYK 65 Query: 234 KRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHACK 413 K + +V CLMLA+M PE++K ED++A+D++ LK ++Q+QA QE F+ KA CK Sbjct: 66 KHLDNMVDVGCLMLATMNPELRKQHEDMVAYDMIEHLKELYQRQARQERFDIFKALFQCK 125 Query: 414 QEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMGKT 593 EG V ++LKM GY++ + +LG+ + Q L + L +L Y QFV N+NM+ + KT Sbjct: 126 LAEGSPVGPHVLKMIGYIESLSKLGFLLSQELATDVILQSLLDSYRQFVLNFNMNEIDKT 185 Query: 594 IPEQHAMLKLTEKGLPKKAPAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXXXXXXXXXXX 773 +P+ ++L+ E + K P + + Sbjct: 186 LPQLLSILRTAEGNIKKVGPKPILIVRNNNG-------KGNAKAQTKPKGKGRPNLGKGK 238 Query: 774 XXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGAS 905 C HCGV GHWR N PVYL E+KK KASG S Sbjct: 239 AALKPKGGVSKEGNCFHCGVTGHWRWNYPVYLEEIKKAKASGMS 282 >ref|XP_022032409.1| uncharacterized protein LOC110933498 [Helianthus annuus] Length = 266 Score = 192 bits (487), Expect = 5e-55 Identities = 95/205 (46%), Positives = 136/205 (66%), Gaps = 2/205 (0%) Frame = +3 Query: 54 NNSS--IRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNA 227 NNSS ++SILEK+KLN NFL+W+RNLRIVL+ ++L + + A Sbjct: 5 NNSSFALKSILEKDKLNNTNFLEWHRNLRIVLKMAKRL-YVLETPIPTAPENNTVVAKKA 63 Query: 228 YNKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHA 407 ++K + EVACLM A+M P++QKN+ED+ AFD++ +LK MFQ+QA QE ++T+K + Sbjct: 64 FDKHKEDAMEVACLMQATMSPDLQKNMEDMNAFDMIEQLKGMFQKQARQERYDTMKQLIS 123 Query: 408 CKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMG 587 CK +EG +VS ++LKMKGY+DQ+ +LGYP+ + V L +LS Y+QFV N+NM+ Sbjct: 124 CKMQEGSSVSAHVLKMKGYIDQLNKLGYPLQDEMAVDFILNSLSSHYDQFVMNFNMNDWV 183 Query: 588 KTIPEQHAMLKLTEKGLPKKAPAVL 662 KT+PE H MLK E + K VL Sbjct: 184 KTVPELHGMLKTAEMNISNKVSQVL 208 >dbj|BAA22288.1| polyprotein [Oryza australiensis] Length = 1317 Score = 178 bits (451), Expect(2) = 2e-54 Identities = 96/289 (33%), Positives = 146/289 (50%) Frame = +3 Query: 42 NQTINNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVR 221 N T + ++RSILEKEKLNG NF+DWYRNLRIVL+ E+K R Sbjct: 4 NTTPSTFNLRSILEKEKLNGTNFMDWYRNLRIVLKQERKEYVLEVPYPEELPNNATATAR 63 Query: 222 NAYNKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAF 401 + K N+ +++CLMLA+M PE+QK E A ++ L+ MF+ QA E F T K+ Sbjct: 64 RGFEKHTNDALDISCLMLATMSPELQKQYESSDAHTTIQGLRGMFENQARDERFNTSKSL 123 Query: 402 HACKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHS 581 AC+ EG VS +++KM GY++ +E+LG+P+ Q L + L +L +E F+ NY+M++ Sbjct: 124 FACRLVEGNPVSPHVIKMIGYIESLEKLGFPLSQELATDVILQSLPPSFEPFILNYHMNN 183 Query: 582 MGKTIPEQHAMLKLTEKGLPKKAPAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXXXXXXX 761 M +T+ E H MLK E+ + K V+ Sbjct: 184 MDRTLAELHGMLKTVEESIQKNGHHVM-------MMQNAKRKPPVKKLCTKRKLTPDEIA 236 Query: 762 XXXXXXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGAST 908 C +C GHW+RNC Y+ +LKK +++ +++ Sbjct: 237 SASNAKKGKKGSAASDAVCFYCKETGHWKRNCKKYMEDLKKKQSTTSAS 285 Score = 64.7 bits (156), Expect(2) = 2e-54 Identities = 34/95 (35%), Positives = 58/95 (61%) Frame = +1 Query: 913 GLRGSRKLNKGA*DL*VGNGNRASAEAIGSFDLILPNGLIIVLDNCHYAPTITRGVVSLS 1092 G+R SR L +G +L VGNG + A+G+ L LP+GL++ L+NC+ PT+ + V+S S Sbjct: 318 GMRRSRGLRRGEVNLRVGNGASVATVAVGTVPLHLPSGLVLELNNCYCVPTLCQNVISAS 377 Query: 1093 RLKDNGFIHVFTDYGISVSKDNIVYFNAIPRDGIF 1197 L+ G+ + G S+ ++ YF+A +G++ Sbjct: 378 CLQAEGYDFRSMNNGCSIYLRDMFYFHAPLVNGLY 412 >gb|OTG21781.1| putative zinc finger, CCHC-type [Helianthus annuus] Length = 262 Score = 187 bits (476), Expect = 2e-53 Identities = 102/269 (37%), Positives = 138/269 (51%) Frame = +3 Query: 111 LDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNAYNKRFNEQQEVACLMLASMIP 290 +DWYRNL+IVLR+E+K + R + K ++ Q+V CLMLA MIP Sbjct: 1 MDWYRNLKIVLRAEKKA-YVLEGPIPEAPAANTGAARRTWEKHVDDSQDVTCLMLAMMIP 59 Query: 291 EIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHACKQEEGQTVSTYILKMKGYLD 470 E+QKNLE+L A+++ +LK MFQQQA E FE +++ C+ +EG +VS ++LKMK ++D Sbjct: 60 ELQKNLENLGAYEMSEQLKNMFQQQARHERFEVMRSLITCRMQEGTSVSAHVLKMKSFID 119 Query: 471 QIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMGKTIPEQHAMLKLTEKGLPKKA 650 Q+ERL P+ L + L +L Y QFV NYNM+ +TIPE H MLK E +P K Sbjct: 120 QLERLNAPLSNELATDVILNSLPNSYHQFVMNYNMNGWERTIPELHQMLKTAETNIPTKG 179 Query: 651 PAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCHHCG 830 VLA + C HC Sbjct: 180 NPVLAIREGRITKKKQSKGKG---------KASKQDKGKGKKVATPKAKPHKDAKCFHCD 230 Query: 831 VVGHWRRNCPVYLAELKKNKASGASTLGT 917 +GHW+RNCP YLAELK K + GT Sbjct: 231 EIGHWKRNCPKYLAELKLKKLQIGESSGT 259 >gb|OTG05603.1| putative zinc finger, CCHC-type [Helianthus annuus] Length = 274 Score = 185 bits (470), Expect = 2e-52 Identities = 100/260 (38%), Positives = 135/260 (51%) Frame = +3 Query: 111 LDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNAYNKRFNEQQEVACLMLASMIP 290 +DWYRNL+IVLR+E+K + R + K ++ Q+V CLMLA MIP Sbjct: 1 MDWYRNLKIVLRAEKKA-YVLEGPIPEAPAANTGPARRTWEKHVDDSQDVTCLMLAMMIP 59 Query: 291 EIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHACKQEEGQTVSTYILKMKGYLD 470 E+QKNLE+L A+++ +LK MFQQQA E FE +++ C+ +EG +VS ++LKMK ++D Sbjct: 60 ELQKNLENLGAYEMSEQLKNMFQQQARHERFEVMRSLITCRMQEGTSVSAHVLKMKSFVD 119 Query: 471 QIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMGKTIPEQHAMLKLTEKGLPKKA 650 Q+ERL P+ L + L +L Y QFV NYNM+ +TIPE H MLK E +P K Sbjct: 120 QLERLNAPLSNELATDVILNSLPNSYHQFVMNYNMNGWERTIPELHQMLKTAETNIPTKG 179 Query: 651 PAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCHHCG 830 VLA + C HC Sbjct: 180 NPVLAIREGRITKKKQSKGKG---------KASKQDKGKGKKVATPKAKPHKDAKCFHCD 230 Query: 831 VVGHWRRNCPVYLAELKKNK 890 +GHW+RNCP YLAELK K Sbjct: 231 EIGHWKRNCPKYLAELKLKK 250 >gb|PNX82340.1| retrotransposon protein putative Ty1-copia subclass, partial [Trifolium pratense] Length = 437 Score = 184 bits (467), Expect = 4e-50 Identities = 104/288 (36%), Positives = 151/288 (52%), Gaps = 1/288 (0%) Frame = +3 Query: 54 NNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXV-RNAY 230 +N+ +RSILEKEKL+GNNFLDW+RNLRIVL++++KL ++AY Sbjct: 6 SNNILRSILEKEKLSGNNFLDWHRNLRIVLKNKKKLYVLEEPVPEEAPASSATRAEKDAY 65 Query: 231 NKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHAC 410 K ++ EV+C+MLA+M E++K E++ AFD++ LK ++Q+QA E FE K C Sbjct: 66 KKHVDDALEVSCIMLATMNSELKKQHENMNAFDMIEHLKMLYQEQARHERFEVSKTLFQC 125 Query: 411 KQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMGK 590 + EG V ++LKM GY++ +ERLG+ + Q L + L L +L + Y QFV N+ M+ M K Sbjct: 126 RLAEGSPVGPHVLKMIGYVENLERLGFALEQDLAIDLILQSLPESYNQFVMNFIMNDMDK 185 Query: 591 TIPEQHAMLKLTEKGLPKKAPAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXXXXXXXXXX 770 T+P+ AML+ EK + K + P Sbjct: 186 TLPQLSAMLRTAEKNINKGKGKAIMLVNDGKFKKQNKKPN---KWIGKGNGKEVAKPKPV 242 Query: 771 XXXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGASTLG 914 C HCG GHW+RNCP YL E KKN +++ G Sbjct: 243 THALKPKGGIAKEGNCFHCGRTGHWKRNCPKYL-EDKKNGIESSNSAG 289 >ref|XP_013639199.1| PREDICTED: uncharacterized protein LOC106344346 [Brassica oleracea var. oleracea] Length = 290 Score = 176 bits (447), Expect = 9e-49 Identities = 104/294 (35%), Positives = 149/294 (50%), Gaps = 1/294 (0%) Frame = +3 Query: 30 MTTPNQTINNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXX 209 M+TP+ + S+R +LEKEKLNG+NFL+WYRNLRIVL+ E+K Sbjct: 1 MSTPHSSF---SLRPVLEKEKLNGSNFLEWYRNLRIVLKQEKKDYVLEKVLPEKPKTNVQ 57 Query: 210 XXVRNAYNKRFNEQQEVACLMLASMIPEIQKNLEDLIA-FDILRELKTMFQQQAEQELFE 386 RNAY+ +++ +V CL+LA+M ++QK E++ + DI+ LK MFQ+QA E ++ Sbjct: 58 HAERNAYDTHISDRVDVCCLILATMNSDLQKQYENVDSPIDIITSLKGMFQEQARTERYQ 117 Query: 387 TVKAFHACKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQN 566 TVK+ CK VS +++KM GY+D + +L P+ Q + L L +LS Y+QFV N Sbjct: 118 TVKSLIECKLPIDGPVSPHVIKMMGYIDNLAKLDCPISQEMATDLILQSLSSSYDQFVMN 177 Query: 567 YNMHSMGKTIPEQHAMLKLTEKGLPKKAPAVLAXXXXXXXXXXXXXPQAXXXXXXXXXXX 746 YNM+++ KT+ E H MLK E + K P VL + Sbjct: 178 YNMNNLTKTLTELHGMLKTVEPNIKKDTPNVLMVQGGNKF-------KKQGKNKGKGKSG 230 Query: 747 XXXXXXXXXXXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGAST 908 H+C GHW+RNC YL LK K S S+ Sbjct: 231 WIANPSKLNPSPKFKPGPSNVDKFHYCNGSGHWKRNCEKYLENLKNKKISETSS 284 >ref|XP_022008408.1| uncharacterized protein LOC110907786 [Helianthus annuus] Length = 221 Score = 172 bits (436), Expect = 5e-48 Identities = 90/208 (43%), Positives = 130/208 (62%), Gaps = 3/208 (1%) Frame = +3 Query: 48 TINNSS---IRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXV 218 T NNSS ++SILEK+KLN +NF+DW+RNL+IVL++E KL Sbjct: 3 TENNSSQFSLKSILEKDKLNHSNFMDWHRNLKIVLKAESKLYVLETPVPDEPINQQTVAY 62 Query: 219 RNAYNKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKA 398 RN Y K F++ +V+CLMLASMIPE+Q+ ++D +A+++ L MFQQ+A E F+ +++ Sbjct: 63 RN-YAKHFDDSMKVSCLMLASMIPELQRTMDDKMAYEMNEHLVEMFQQKARHERFDVMRS 121 Query: 399 FHACKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMH 578 + +EG +VS ++LKMK Y+D + RL P+ L + L +L K Y+QF NYNM+ Sbjct: 122 LITTRMQEGTSVSAHVLKMKAYVDHLARLESPLSDELAGDIILNSLPKSYDQFTMNYNMN 181 Query: 579 SMGKTIPEQHAMLKLTEKGLPKKAPAVL 662 M KT+ E H MLK E +P K VL Sbjct: 182 GMEKTLAELHQMLKTAEVNIPSKTAPVL 209 >ref|XP_012477568.1| PREDICTED: uncharacterized protein LOC105793188 [Gossypium raimondii] Length = 203 Score = 169 bits (427), Expect = 6e-47 Identities = 84/197 (42%), Positives = 124/197 (62%) Frame = +3 Query: 54 NNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNAYN 233 N S+RS+LEK+KLNG NFLDW+RNLRIVL+ E+KL R+AY Sbjct: 6 NTLSLRSVLEKDKLNGLNFLDWFRNLRIVLKQERKLYVIEQPVPNEPPANASRADRDAYK 65 Query: 234 KRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHACK 413 K ++ +V CLMLA+M PE+QK ED++A++++ LK ++Q QA QE F+ KA CK Sbjct: 66 KHLDDMVDVGCLMLATMNPELQKQHEDMVAYEMIEHLKELYQGQAWQERFDISKALFQCK 125 Query: 414 QEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMGKT 593 EG V ++ KM GY++ + +LG+ + Q L ++ L +L Y QFV N+NM+ + KT Sbjct: 126 LAEGSPVGPHVHKMIGYIESLSKLGFQLSQELATNVILQSLPDSYSQFVLNFNMNEIDKT 185 Query: 594 IPEQHAMLKLTEKGLPK 644 +P+ +ML+ TE + K Sbjct: 186 LPQLLSMLRTTEGNMKK 202 >gb|PNY09707.1| nuclear matrix protein, partial [Trifolium pratense] Length = 511 Score = 177 bits (448), Expect = 1e-46 Identities = 102/295 (34%), Positives = 148/295 (50%), Gaps = 1/295 (0%) Frame = +3 Query: 24 SKMTTPNQTINNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXX 203 +K+T T N S++S+LE +KL+ NF +WY NLRIVL+ E+KL Sbjct: 232 TKITMATNTSGNLSLQSLLENDKLSITNFPEWYSNLRIVLKHEKKLYVLEQKLPELPAAT 291 Query: 204 XXXXVRNAYNKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELF 383 ++AY K F++ ++ CLMLA+M E+QK E++ A+D++ LK ++++QA E Sbjct: 292 APKADKDAYKKHFDDAHDIGCLMLATMNSELQKQHENMEAYDMIMHLKMLYKEQARYERI 351 Query: 384 ETVKAFHACKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQ 563 E +A CK EG V ++L+M GY++ +ERLGYP+ VL L L +L Y Q V Sbjct: 352 EVSRALFRCKLAEGNPVGPHVLQMIGYIENLERLGYPLGLVLATDLILQSLPDSYSQLVM 411 Query: 564 NYNMHSMGKTIPEQHAMLKLTEKGLPK-KAPAVLAXXXXXXXXXXXXXPQAXXXXXXXXX 740 NY M+++ KT+PE +ML+ E+ L K K +L Sbjct: 412 NYIMNNIYKTLPELASMLRTAEQILKKGKGKTIL-----------------MVQNGNEHK 454 Query: 741 XXXXXXXXXXXXXXXXXXXXXXXXXCHHCGVVGHWRRNCPVYLAELKKNKASGAS 905 C HCG +GHW+RNC YL E KK K S AS Sbjct: 455 DMGKKKFKPSTSSLKPVGGVTKKGTCFHCGQIGHWKRNCTRYLEECKKRKVSDAS 509 >ref|XP_022004366.1| uncharacterized protein LOC110901916 [Helianthus annuus] Length = 550 Score = 177 bits (450), Expect = 1e-46 Identities = 89/218 (40%), Positives = 134/218 (61%), Gaps = 3/218 (1%) Frame = +3 Query: 18 HASKMTTPNQTINNSS---IRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXX 188 HA K + N+S ++SILEK+KLN NFL W+ NLRIVL+ ++L + Sbjct: 286 HAQKKQQQQMSSENNSSFALKSILEKDKLNNTNFLKWHHNLRIVLKMAKRL-YVLETPIP 344 Query: 189 XXXXXXXXXVRNAYNKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQA 368 + A++K + EVACLM A+M P++QKN+ED+ +FD++ +LK MF +QA Sbjct: 345 TAPENNTVVAKKAFDKHKEDAMEVACLMQATMSPDLQKNMEDMNSFDMIEQLKGMFHKQA 404 Query: 369 EQELFETVKAFHACKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDY 548 QE ++T+K +C+ ++G +VS ++LKMKGY+ Q+ +LGYP+ + V +LS Y Sbjct: 405 RQERYDTMKQLISCRMQDGSSVSAHVLKMKGYIYQLNKLGYPLQDEMAVDFIRNSLSSHY 464 Query: 549 EQFVQNYNMHSMGKTIPEQHAMLKLTEKGLPKKAPAVL 662 +QFV N+NM+ KT+PE H MLK E + K VL Sbjct: 465 DQFVMNFNMNDWVKTMPELHGMLKTAEMNISNKVSQVL 502 >ref|XP_021975761.1| uncharacterized protein LOC110871136 [Helianthus annuus] Length = 242 Score = 168 bits (425), Expect = 4e-46 Identities = 90/208 (43%), Positives = 129/208 (62%), Gaps = 3/208 (1%) Frame = +3 Query: 48 TINNSS---IRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXV 218 T NNSS ++SILEK+KLN +NF+DW+RNL+IVL++E KL Sbjct: 3 TENNSSQFSLKSILEKDKLNHSNFMDWHRNLKIVLKAENKLYVLETPVPDEPINQQTVAY 62 Query: 219 RNAYNKRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKA 398 RN Y K F++ +V+CLMLASMI E+Q+ ++D +A+++ L MFQQ+A E F+ +++ Sbjct: 63 RN-YAKHFDDSLKVSCLMLASMIVELQRTMDDKMAYEMNEHLVEMFQQKARHERFDVMRS 121 Query: 399 FHACKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMH 578 +EG +VST++LKMK Y+D + RL P+ L + L +L K Y+QF NYNM+ Sbjct: 122 LITTCMQEGTSVSTHVLKMKAYVDHLARLESPLSDELAGDIILNSLPKSYDQFTMNYNMN 181 Query: 579 SMGKTIPEQHAMLKLTEKGLPKKAPAVL 662 M KT+ E H MLK E +P K VL Sbjct: 182 GMEKTLAELHQMLKTAEVNIPSKTAPVL 209 >ref|XP_017613650.1| PREDICTED: uncharacterized protein LOC108458763 [Gossypium arboreum] Length = 187 Score = 165 bits (417), Expect = 1e-45 Identities = 81/182 (44%), Positives = 114/182 (62%) Frame = +3 Query: 54 NNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNAYN 233 N S+RS+LEK+KLNG NFLDW+RNLRIVL+ E+KL ++AY Sbjct: 6 NTLSLRSVLEKDKLNGLNFLDWFRNLRIVLKQERKLYVIEKPLPDEPPANASRADKDAYK 65 Query: 234 KRFNEQQEVACLMLASMIPEIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHACK 413 K +E V CLMLA+M PE QK ED++A+D++ LK ++Q QA QE F+ KA CK Sbjct: 66 KHLDEMVNVGCLMLATMNPEFQKQHEDMVAYDMIEHLKELYQGQARQERFDISKALFQCK 125 Query: 414 QEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMGKT 593 +G V ++LKM GY++ + LG+P+ Q L ++ L +L Y QFV N+NM+ + KT Sbjct: 126 LAKGSPVGPHVLKMIGYIESLSELGFPLSQELATNVILQSLPDSYSQFVLNFNMNEIDKT 185 Query: 594 IP 599 +P Sbjct: 186 LP 187 >gb|OTG36157.1| hypothetical protein HannXRQ_Chr01g0004521 [Helianthus annuus] Length = 226 Score = 166 bits (420), Expect = 1e-45 Identities = 83/185 (44%), Positives = 115/185 (62%) Frame = +3 Query: 111 LDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXXXXVRNAYNKRFNEQQEVACLMLASMIP 290 +DWYRNL+IVLR+E+K + R + K ++ Q+V CLMLA MIP Sbjct: 1 MDWYRNLKIVLRAEKKA-YVLEGPIPEAPAANTGAARRTWEKHVDDSQDVTCLMLAMMIP 59 Query: 291 EIQKNLEDLIAFDILRELKTMFQQQAEQELFETVKAFHACKQEEGQTVSTYILKMKGYLD 470 E+QKNLE+L A+++ +LK MFQQQA E FE +++ C+ +EG +VS ++LKMK ++D Sbjct: 60 ELQKNLENLGAYEMSEQLKNMFQQQARHERFEVMRSLITCRMQEGTSVSAHVLKMKSFVD 119 Query: 471 QIERLGYPMPQVLGVSLNLTTLSKDYEQFVQNYNMHSMGKTIPEQHAMLKLTEKGLPKKA 650 Q+ERL P+ L + L +L Y QFV NYNM+ +TIPE H MLK E +P K Sbjct: 120 QLERLNAPLSNELATDVILNSLPNSYHQFVMNYNMNGWERTIPELHQMLKTAETNIPTKG 179 Query: 651 PAVLA 665 VLA Sbjct: 180 NPVLA 184 >ref|XP_013700463.1| uncharacterized protein LOC106404280 [Brassica napus] Length = 262 Score = 166 bits (421), Expect = 3e-45 Identities = 90/212 (42%), Positives = 131/212 (61%), Gaps = 1/212 (0%) Frame = +3 Query: 30 MTTPNQTINNSSIRSILEKEKLNGNNFLDWYRNLRIVLRSEQKLNHXXXXXXXXXXXXXX 209 M+TP+ N S+RS+LEK+KLNG+NFL+W+RNLRIVL+ E+K Sbjct: 1 MSTPH---NLFSLRSVLEKDKLNGSNFLEWHRNLRIVLKQEKKDYVLENVLPEKSKTNVQ 57 Query: 210 XXVRNAYNKRFNEQQEVACLMLASMIPEIQKNLEDLIA-FDILRELKTMFQQQAEQELFE 386 RNAY+K +++ +V CLMLA+M ++QK E++ + D++ LK MFQ+QA E ++ Sbjct: 58 HAERNAYDKHVSDRVDVCCLMLATMNSDLQKQYENVDSPIDMITSLKGMFQEQARTERYQ 117 Query: 387 TVKAFHACKQEEGQTVSTYILKMKGYLDQIERLGYPMPQVLGVSLNLTTLSKDYEQFVQN 566 TVK+ CK VS +++KM GY+D + +L P+ Q L L L +L Y+QFV N Sbjct: 118 TVKSLIECKLPIDGPVSPHVIKMMGYIDNLAKLDCPISQELATDLILQSLPSSYDQFVMN 177 Query: 567 YNMHSMGKTIPEQHAMLKLTEKGLPKKAPAVL 662 YNM+++ KT+ E H MLK E + K P VL Sbjct: 178 YNMNNLTKTLTELHGMLKTAEPNMKKDTPNVL 209