BLASTX nr result
ID: Chrysanthemum22_contig00020938
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00020938 (1554 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU13030.1| hypothetical protein TSUD_173300 [Trifolium subt... 412 e-137 ref|XP_015576947.1| PREDICTED: uncharacterized protein LOC107261... 402 e-133 gb|PNY06260.1| hypothetical protein L195_g002723 [Trifolium prat... 399 e-127 gb|PNX97498.1| retrotransposon-realted protein, partial [Trifoli... 399 e-125 gb|KYP47345.1| Retrovirus-related Pol polyprotein from transposo... 362 e-118 gb|PNX93619.1| retrovirus-related Pol polyprotein from transposo... 349 e-112 ref|XP_022018581.1| uncharacterized protein LOC110918596 [Helian... 334 e-107 ref|XP_022027294.1| uncharacterized protein LOC110928582 [Helian... 333 e-106 ref|XP_022025760.1| uncharacterized protein LOC110926319 [Helian... 331 e-106 ref|XP_021978925.1| uncharacterized protein LOC110874848 [Helian... 330 e-105 ref|XP_022002369.1| uncharacterized protein LOC110899785 [Helian... 334 e-105 ref|XP_021991106.1| uncharacterized protein LOC110887848 [Helian... 333 e-105 ref|XP_021974337.1| uncharacterized protein LOC110869343 [Helian... 332 e-104 dbj|GAU35084.1| hypothetical protein TSUD_70070 [Trifolium subte... 343 e-104 ref|XP_022041201.1| uncharacterized protein LOC110943776 [Helian... 331 e-104 ref|XP_022039984.1| uncharacterized protein LOC110942516 [Helian... 328 e-103 ref|XP_014489620.1| uncharacterized protein LOC106752450 [Vigna ... 313 3e-99 ref|XP_021974932.1| uncharacterized protein LOC110870042 [Helian... 310 6e-97 dbj|GAU39490.1| hypothetical protein TSUD_279110 [Trifolium subt... 310 8e-97 gb|PNY04738.1| retrovirus-related Pol polyprotein from transposo... 303 1e-94 >dbj|GAU13030.1| hypothetical protein TSUD_173300 [Trifolium subterraneum] Length = 411 Score = 412 bits (1058), Expect = e-137 Identities = 231/440 (52%), Positives = 272/440 (61%), Gaps = 30/440 (6%) Frame = +3 Query: 81 LSQMADNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAAS 260 ++ D +HPAL VTNIKN I ITLEMEKS Y SWAELFKIHCRAY+VIDHIIP P + Sbjct: 1 MANKTDKPFHPALAVTNIKNFISITLEMEKSHYPSWAELFKIHCRAYEVIDHIIPPPVSK 60 Query: 261 ----------SSTNKDKETSDPIT--PETWSRLDAIVLQWIYTTISNDLLHTILAPDSTA 404 +++ K+KE P+ P+ WSRLDAIVLQWIY TIS DLLHTIL P+STA Sbjct: 61 PNDTAVETSKTASEKEKEKEQPVESDPKLWSRLDAIVLQWIYGTISTDLLHTILKPNSTA 120 Query: 405 RQAWDRLQSIFQDNKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSN 584 +QAW+RL++IFQDNK+SRA+YLE QF NL +D F N+SAYCQELK+LADQL++VGSPVS Sbjct: 121 QQAWERLETIFQDNKNSRAVYLENQFANLSMDEFPNISAYCQELKMLADQLASVGSPVSE 180 Query: 585 DRLVLQLVAGLNESYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXX 764 R VLQL+AGLNE+YD VA+ +Q +DPLP FYEARS+LILEETRK KQ Sbjct: 181 HRQVLQLIAGLNENYDGVATLIQQSDPLPLFYEARSKLILEETRKTKQAAI--------- 231 Query: 765 XXXXXXXYSKANHAGGSNNSRNTTGSGYFH---XXXXXXXXXXXXXXXXXXXXXXXXXXX 935 AN AG + + NT S Y + Sbjct: 232 ---------AANAAGTTLLTTNTDASSYGNNSISANRNNNGQKRNQNRNNNRGKNGGNRG 282 Query: 936 XXXXXXSHYPGP----------QQQQP--WAPYGPWA-WQQQSWTPPPCPYPTSNWVR-- 1070 +H GP QQQ+P WAPY PW W QQ W PPCPYPTS W R Sbjct: 283 RGRGGNNHGRGPQYGGQNRLWQQQQKPYQWAPYPPWEYWGQQPWAAPPCPYPTSPWTRPP 342 Query: 1071 PTTQPGQSGILGPRPQQAYTMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTG 1250 PT QPG P QAY+ AP YAPTDIDTAM ++LN PDENWY+DTG Sbjct: 343 PTRQPGNPSNRSP---QAYS-AQAP-------NYAPTDIDTAMQNLSLNVPDENWYMDTG 391 Query: 1251 ATSHMTTSPGIPEGDTTHEM 1310 TSHMT+S G P+ D T+EM Sbjct: 392 TTSHMTSSQGYPDRDATNEM 411 >ref|XP_015576947.1| PREDICTED: uncharacterized protein LOC107261521 [Ricinus communis] Length = 421 Score = 402 bits (1032), Expect = e-133 Identities = 219/398 (55%), Positives = 250/398 (62%), Gaps = 2/398 (0%) Frame = +3 Query: 90 MADNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSST 269 M + K+HPAL V N+KN I ITLEMEK SSWAELFKIHCRAYQVIDHIIP +S Sbjct: 1 MTETKFHPALAVNNVKNFISITLEMEKGHSSSWAELFKIHCRAYQVIDHIIPPTQRTSEP 60 Query: 270 NKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQDNK 449 K+K + E WS LDAIVLQWIY TISNDLLHTIL PDSTA+QAW+RL +I QDNK Sbjct: 61 AKEKAVARD--DELWSHLDAIVLQWIYGTISNDLLHTILEPDSTAQQAWERLHNILQDNK 118 Query: 450 HSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNESY 629 +SRA+YLE QFT++ LD F VSAYCQELK+L DQLSNVG+PVSN RLVLQL+ GLNE+Y Sbjct: 119 NSRAVYLENQFTHVHLDDFQTVSAYCQELKMLVDQLSNVGAPVSNKRLVLQLITGLNENY 178 Query: 630 DNVASHLQHTDPLPQFYEARSRLILEETRKQKQ-XXXXXXXXXXXXXXXXXXXYSKANHA 806 D VA +Q ++PLP FYEARSRLILEETRK KQ SK +++ Sbjct: 179 DGVAIFIQQSNPLPLFYEARSRLILEETRKAKQTAASASAAGIALFIANNSTNESKGSYS 238 Query: 807 GGSNN-SRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQQQQ 983 G N +R+ T S QQ Q Sbjct: 239 GSVNALNRHNTTSKRGQNRNNNRGKDNGNRGRGRSGNNHANGGSNHTQQSQQRTWQQQPQ 298 Query: 984 PWAPYGPWAWQQQSWTPPPCPYPTSNWVRPTTQPGQSGILGPRPQQAYTMTYAPTVATST 1163 W Y W W Q W PPCPYPTS+W RP Q ILG RPQQA YA +V TS+ Sbjct: 299 QWGQYPLWGWAPQPWADPPCPYPTSSWTRP-PPARQPSILGNRPQQA----YAASVPTSS 353 Query: 1164 MGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTSP 1277 Y PT+I+ AMHTMTLN PD NWY+DTGATSHMT SP Sbjct: 354 -SYDPTEIEQAMHTMTLNTPDTNWYMDTGATSHMTASP 390 >gb|PNY06260.1| hypothetical protein L195_g002723 [Trifolium pratense] Length = 805 Score = 399 bits (1024), Expect = e-127 Identities = 233/466 (50%), Positives = 275/466 (59%), Gaps = 38/466 (8%) Frame = +3 Query: 81 LSQMADNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAAS 260 ++ D +HPAL VTNIKN I ITLEMEKS Y SWAELFKIHCRAY+VIDHIIP P + Sbjct: 1 MANKTDKPFHPALAVTNIKNFISITLEMEKSHYPSWAELFKIHCRAYEVIDHIIPPPVSK 60 Query: 261 ----------SSTNKDKETSDPIT--PETWSRLDAIVLQWIYTTISNDLLHTILAPDSTA 404 +++ K+KE P+ P+ WSRLDAIVLQWIY TIS DLLHTIL P+STA Sbjct: 61 PNDTAVETSKTASEKEKEKEQPVESDPKLWSRLDAIVLQWIYGTISTDLLHTILEPNSTA 120 Query: 405 RQAWDRLQSIFQDNKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSN 584 +QAW+RL++IFQDNK+SRA+YLE QF N+ ++ F N+SAYCQE+K+LADQL++VGSPVS Sbjct: 121 QQAWERLENIFQDNKNSRAVYLENQFANVNMNDFPNISAYCQEVKMLADQLASVGSPVSE 180 Query: 585 DRLVLQLVAGLNESYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXX 764 RLVLQL+AGLNE+YD VA+ +Q D LP FYEARS+LILEETRK KQ Sbjct: 181 HRLVLQLIAGLNENYDGVATLIQQHDSLPHFYEARSKLILEETRKTKQAAI--------- 231 Query: 765 XXXXXXXYSKANHAGGSNNSRNTTGSGYFH---XXXXXXXXXXXXXXXXXXXXXXXXXXX 935 AN AG + + NT S Y + Sbjct: 232 ---------AANAAGTTLLTTNTDASSYGNNSISANRNNNGQKRNQNRNNNRGKNGGNRG 282 Query: 936 XXXXXXSHYPGP----------QQQQP--WAPYGPWA-WQQQSWTPPPCPYPTSNWVR-- 1070 +H GP QQQ+P WAPY PW W QQ W PPCPYPTS W R Sbjct: 283 RGRGGNNHGRGPQYGGQNRLWQQQQKPYQWAPYPPWEYWGQQPWAAPPCPYPTSPWTRPP 342 Query: 1071 PTTQPGQSGILGPRPQQAYTMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTG 1250 PT QPG P QAY+ AP YAPTDIDTAM ++LN PDENWY+DTG Sbjct: 343 PTRQPGNPSNRSP---QAYS-AQAP-------NYAPTDIDTAMQNLSLNVPDENWYMDTG 391 Query: 1251 ATSHMTTSPGIPEG----DTTHEMQ*FGGSLSSIHHYA----SPPH 1364 ATSHMT+S G T + G IH Y SPPH Sbjct: 392 ATSHMTSSQGTLSPYFNMSTNKNIVVGSGQEIPIHGYGQACLSPPH 437 Score = 80.9 bits (198), Expect = 3e-12 Identities = 35/57 (61%), Positives = 44/57 (77%) Frame = +1 Query: 1273 LQEFPKGTPLMRCNSSGDLYPLSTTMLRHLTSPSTFAALSQDLWHHRLGHPGARILN 1443 +++ G PLMRCNS+GDLYP++ T TSPSTFAA+S LWH+RLGHPGA +LN Sbjct: 478 VKDIQTGMPLMRCNSTGDLYPITKTTRHSSTSPSTFAAISSVLWHNRLGHPGAHVLN 534 >gb|PNX97498.1| retrotransposon-realted protein, partial [Trifolium pratense] Length = 978 Score = 399 bits (1024), Expect = e-125 Identities = 226/447 (50%), Positives = 269/447 (60%), Gaps = 19/447 (4%) Frame = +3 Query: 81 LSQMADNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIP-KPAA 257 ++ +D +HPAL VTN+KN I ITLEMEKS Y SWAELFKIHCRAY+VIDHIIP +PA Sbjct: 1 MANKSDKPFHPALAVTNVKNFISITLEMEKSHYPSWAELFKIHCRAYEVIDHIIPPEPAV 60 Query: 258 SSS---TNKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQ 428 +S + K+KE ++ P+ WSRLDAIVLQWIY TIS DLLHTIL P+STA+QAW+RL+ Sbjct: 61 ETSKTASEKEKEVAES-DPKLWSRLDAIVLQWIYGTISTDLLHTILEPNSTAQQAWERLE 119 Query: 429 SIFQDNKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLV 608 +IFQDNK+SRA+YLE QF N+ ++ F N+SAYCQE+K+LADQL++VGSPVS RLVLQL+ Sbjct: 120 NIFQDNKNSRAVYLENQFANVNMNDFPNISAYCQEVKMLADQLASVGSPVSEHRLVLQLI 179 Query: 609 AGLNESYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXXY 788 AGLNE+YD VA+ +Q D LP FYEARS+LILEETRK KQ Sbjct: 180 AGLNENYDGVATLIQQHDSLPHFYEARSKLILEETRKTKQAAIAANAAGTALLTTNTDAS 239 Query: 789 SKANHAGGSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPG 968 S N++ +N+S Sbjct: 240 SYGNNSISANHSNTGQKRNQNRNSNRGKNGGNRGRGRGGNNHGRGSQHGGQHRPWQQQQN 299 Query: 969 PQQQQ----PWAPYGPWA-WQQQSWTPPPCPYPTSNWVR--PTTQPGQSGILGPRPQQAY 1127 P QQQ PWA Y PW W Q W PPCPYPTS W R PT QPG RP QAY Sbjct: 300 PWQQQQKPYPWASYPPWEYWGQHPWAAPPCPYPTSPWARPPPTRQPGNP---SNRPPQAY 356 Query: 1128 TMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTSPGIPEG----D 1295 + AP YAPTDIDTAM M+LN PD+NWY+DTGATSHMT+S G Sbjct: 357 S-AQAP-------NYAPTDIDTAMQNMSLNVPDDNWYMDTGATSHMTSSHGTLSSYFNMS 408 Query: 1296 TTHEMQ*FGGSLSSIHHYA----SPPH 1364 T + G IH Y SPPH Sbjct: 409 TNKNIVVGSGQEIPIHGYGQTCLSPPH 435 Score = 79.0 bits (193), Expect = 1e-11 Identities = 35/57 (61%), Positives = 43/57 (75%) Frame = +1 Query: 1273 LQEFPKGTPLMRCNSSGDLYPLSTTMLRHLTSPSTFAALSQDLWHHRLGHPGARILN 1443 +++ G PLMRCNS+GDLYP++ T TSPSTFAA+S LWH RLGHPGA +LN Sbjct: 476 VKDIQTGMPLMRCNSTGDLYPITKTTGHPSTSPSTFAAISSVLWHDRLGHPGAHVLN 532 >gb|KYP47345.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 377 Score = 362 bits (930), Expect = e-118 Identities = 202/404 (50%), Positives = 239/404 (59%), Gaps = 9/404 (2%) Frame = +3 Query: 90 MADN-KYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSS 266 MAD K HPAL V N+KN IPITLEME YSSW ELFKIHCRAYQVIDHI+ P+ S+ Sbjct: 1 MADQTKIHPALVVNNVKNFIPITLEMENGHYSSWVELFKIHCRAYQVIDHILQPPSKSTD 60 Query: 267 TNKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQDN 446 T +K P WSRLDAIVLQWIY TISNDLLHTIL PDST +QAW++LQ+IF DN Sbjct: 61 TATEKVAEGD--PALWSRLDAIVLQWIYGTISNDLLHTILQPDSTTQQAWEQLQNIFLDN 118 Query: 447 KHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNES 626 K+SRA+YLE QFT++ +D + N+ AYCQELK+LADQLSNV GLNE+ Sbjct: 119 KNSRAVYLENQFTHVHMDDYPNILAYCQELKMLADQLSNVN--------------GLNEN 164 Query: 627 YDNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXXYSKANHA 806 YD VA+ +Q T+PLP FYEARSRLILEETRK KQ ++ Sbjct: 165 YDGVATFIQQTNPLPPFYEARSRLILEETRKAKQVAFAATNAGTALLTTNHSIDDASSVG 224 Query: 807 GGSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQQQQP 986 G N+S + + F+ + GP QQ+P Sbjct: 225 RGPNSSPHPS----FNHGNNGSKRGQNCGRGRNGGGRGRGRNNNHHNNGQQHGGPSQQRP 280 Query: 987 -------WAPYGPWA-WQQQSWTPPPCPYPTSNWVRPTTQPGQSGILGPRPQQAYTMTYA 1142 W Y PW W QQSW PPCPYPTS+W P Q GILG RPQQA+ Sbjct: 281 WQHQQSQWISYPPWGHWGQQSWAAPPCPYPTSSWSSPPL-ARQPGILGNRPQQAFNAHGP 339 Query: 1143 PTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTS 1274 P+ T PTDI+ AM T++LN PDENWY+DT ATSHM+ S Sbjct: 340 PSQHT------PTDINAAMQTLSLNVPDENWYMDTSATSHMSAS 377 >gb|PNX93619.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 438 Score = 349 bits (895), Expect = e-112 Identities = 188/401 (46%), Positives = 248/401 (61%), Gaps = 2/401 (0%) Frame = +3 Query: 84 SQMADNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASS 263 S+ ++ HPALTVTNI N I ITL++EKSQY++W+ELFKIH +AY+V+DHIIP P ++ Sbjct: 21 SKTINHTIHPALTVTNITNFIKITLDIEKSQYNTWSELFKIHAQAYEVLDHIIP-PTETN 79 Query: 264 STNKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQD 443 +T+ + P W RLDAIVLQWIY TIS DLLHTI+ DSTA+QAW+RL IF D Sbjct: 80 TTSSSPSLKET-DPTLWKRLDAIVLQWIYGTISTDLLHTIIERDSTAQQAWNRLFDIFFD 138 Query: 444 NKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNE 623 NK+SRALYLEQ+F+ + ++ FS+ S+YCQ +K L+DQL NVG+PVS +R+VLQLV+GL++ Sbjct: 139 NKNSRALYLEQEFSRVNMEQFSDASSYCQHIKTLSDQLYNVGAPVSEERMVLQLVSGLSD 198 Query: 624 SYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXXYSKANH 803 +Y NV + ++H+ LP+FY+ARS ++LEET K+ + N Sbjct: 199 AYANVGTQIRHSATLPRFYKARSMVVLEETALAKRTTPNTDNSAFLTSQENNSNFPNNNR 258 Query: 804 AG-GSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQQQ 980 G G+NN+ GS H + PQQQ Sbjct: 259 GGRGNNNNSGGRGSRGGHHNNRGGRNRGRGGGRGGRSHHNQQGAW-------NQQTPQQQ 311 Query: 981 QPWAPYGPWAWQQQSWTPPPCPYPTS-NWVRPTTQPGQSGILGPRPQQAYTMTYAPTVAT 1157 WA Y PW Q Q W PPCPYPT+ NW + Q GILG +PQQA+ VA+ Sbjct: 312 --WA-YPPWVTQWQPWATPPCPYPTTGNWQQSFAPNRQPGILGQKPQQAH-------VAS 361 Query: 1158 STMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTSPG 1280 + Y PTDI AMHT+++ PPDE WY+DTGATSHMT + G Sbjct: 362 AQPSYTPTDIQAAMHTLSMAPPDEQWYMDTGATSHMTANKG 402 >ref|XP_022018581.1| uncharacterized protein LOC110918596 [Helianthus annuus] Length = 400 Score = 334 bits (857), Expect = e-107 Identities = 189/420 (45%), Positives = 237/420 (56%), Gaps = 14/420 (3%) Frame = +3 Query: 96 DNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSSTNK 275 ++K HPA TV++IKN IP+TLE+E SQY+SWA LFK+HC+A+ V DH+ PKPAAS +T + Sbjct: 12 ESKLHPATTVSHIKNYIPVTLEIESSQYNSWATLFKLHCKAFLVFDHLSPKPAASETTTE 71 Query: 276 DKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQDNKHS 455 ++ T E W RLDAIVLQWIY+TISNDLLHTI+ +TA AW ++ +FQDNK S Sbjct: 72 TSSSTTKPTAE-WERLDAIVLQWIYSTISNDLLHTIINKTATAHDAWKAIEDLFQDNKSS 130 Query: 456 RALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNESYDN 635 RA++L Q+F+N RLD F N+SAYCQELKVLADQL+NV +PV NDRLVLQL+AGLNE Y+ Sbjct: 131 RAIHLMQKFSNTRLDGFPNISAYCQELKVLADQLANVNAPVDNDRLVLQLIAGLNEQYEG 190 Query: 636 VASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXXYSKANHAGGS 815 +A+ LQ DPLP FY ARS LI ETRK +Q + N + + Sbjct: 191 IATILQQQDPLPSFYSARSSLIQVETRKAEQALNASKSASTALT-------ASTNRSPAA 243 Query: 816 NNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQQQQPWAP 995 + SRN + H Q PW P Sbjct: 244 DQSRNHDRNYGTHDRGYGSGRGRSGRRGRGRSSSARGRF-------------SSQYPWQP 290 Query: 996 --YGPW--AWQQ---------QSWTPPPCPYPTSNWVRPT-TQPGQSGILGPRPQQAYTM 1133 Y PW W W PPCPYP++N RP T GILGPRP Sbjct: 291 PSYYPWMNTWPTPPSNNQTPFYPWPVPPCPYPSTN--RPNHTHTNSPGILGPRPNH---- 344 Query: 1134 TYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTSPGIPEGDTTHEMQ 1313 S + Y PTDI+ AM+TM+L PD Y+DTGAT +MT G+ + D MQ Sbjct: 345 -------QSNVAYTPTDIEQAMYTMSLQQPDPTQYMDTGATGNMTHEQGLQDSDPAPAMQ 397 >ref|XP_022027294.1| uncharacterized protein LOC110928582 [Helianthus annuus] gb|OTG30192.1| putative retrotransposon gag protein [Helianthus annuus] Length = 394 Score = 333 bits (854), Expect = e-106 Identities = 183/404 (45%), Positives = 244/404 (60%), Gaps = 17/404 (4%) Frame = +3 Query: 96 DNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPK----PAASS 263 D K HPA+TV+NIKN IPITLEME SQY+SWAELFKIHCRA+QV++H+ P+ PAA++ Sbjct: 2 DPKNHPAITVSNIKNFIPITLEMETSQYASWAELFKIHCRAFQVLEHLSPRQVDPPAAAA 61 Query: 264 -------STNKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDR 422 S++ DK+ S + W+RLDAIVLQWIY TIS+DLL TI+ PDSTA AW Sbjct: 62 KDADKEKSSDSDKQKSSAKDDDIWNRLDAIVLQWIYGTISHDLLSTIIRPDSTASYAWTA 121 Query: 423 LQSIFQDNKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQ 602 L+SIF DN+ +RA+ L+++F N +LDSF +++AYCQE+K+L+DQL+NVGSPV+ LV+Q Sbjct: 122 LKSIFHDNQTTRAVLLQKKFANTKLDSFPSMTAYCQEVKMLSDQLANVGSPVNEQTLVIQ 181 Query: 603 LVAGLNESYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXX 782 L++GLNE+Y + A+ +Q+ DP P FYEARS+LILEET K + Sbjct: 182 LLSGLNEAYGSAATIIQNKDPFPTFYEARSQLILEETTKANRVANDTAF----------- 230 Query: 783 XYSKANHAGGSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHY 962 +H ++N+ +T + ++Y Sbjct: 231 ---HTSHQPPNSNTNTSTQPTFNRNGNRGRGRHRGRGRGRTSTSTFPSQYPHSQQPPTYY 287 Query: 963 P--GPQQQQPWA-PYGPWAWQQQS---WTPPPCPYPTSNWVRPTTQPGQSGILGPRPQQA 1124 P G QQP A + P W Q + W PPCPYPT+ RP GILGPRPQQA Sbjct: 288 PPFGAHSQQPTAQTHSPSYWTQPNNNYWPAPPCPYPTTLPSRPNNPNPSQGILGPRPQQA 347 Query: 1125 YTMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGAT 1256 YT T + PT++ A ++ TLNPPD+NWY+DTGAT Sbjct: 348 YTAT--------EPAHIPTELGQAFNSFTLNPPDQNWYMDTGAT 383 >ref|XP_022025760.1| uncharacterized protein LOC110926319 [Helianthus annuus] ref|XP_021977462.1| uncharacterized protein LOC110872874 [Helianthus annuus] gb|OTG35205.1| hypothetical protein HannXRQ_Chr02g0054271 [Helianthus annuus] Length = 380 Score = 331 bits (848), Expect = e-106 Identities = 190/409 (46%), Positives = 234/409 (57%), Gaps = 22/409 (5%) Frame = +3 Query: 96 DNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPK---PAASSS 266 D HPA+TV+NIK IPITLEME SQYSSWAELFKIHCRAYQVIDH+ P+ P+ ++S Sbjct: 2 DPAKHPAITVSNIKTFIPITLEMETSQYSSWAELFKIHCRAYQVIDHLSPRKKDPSTAAS 61 Query: 267 TNKDKETSDPITPET-WSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQD 443 DK+ P+ W+RLDAIVLQWIY TISNDLL TI+ PDS A AW L++IF D Sbjct: 62 KESDKDKQSVAQPDDEWNRLDAIVLQWIYGTISNDLLCTIIRPDSNACYAWTALKNIFHD 121 Query: 444 NKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNE 623 N+ +RA+ L+++F N+RL+SF ++SAYCQE+KV++DQL+NVGSPV+ LV+QL++GLNE Sbjct: 122 NQATRAVLLQKKFANIRLESFQSMSAYCQEVKVISDQLANVGSPVNEHSLVIQLLSGLNE 181 Query: 624 SYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQ---XXXXXXXXXXXXXXXXXXXYSK 794 +Y VAS LQ+T PLP FYEARS+LILEET K + Sbjct: 182 AYGGVASILQNTKPLPTFYEARSQLILEETTKANRVANETALHTSHQPSNPPPTPTPTQT 241 Query: 795 ANHAGGSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQ 974 NHA N+ G G SH P Sbjct: 242 TNHAPNRNSRGRGRGRG-------------------------------RGRGRSHTPTWS 270 Query: 975 QQQPWAPYGPWAWQQQS---------------WTPPPCPYPTSNWVRPTTQPGQSGILGP 1109 QQ W Y PW Q + W PPCPYPT RP Q GILGP Sbjct: 271 GQQ-WNGY-PWTAQPSAHTNRPPSWANPNAYYWATPPCPYPTVPPSRPVNSNPQQGILGP 328 Query: 1110 RPQQAYTMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGAT 1256 RPQQAYT S Y PT++D A + +L+PPD +WY+DTGAT Sbjct: 329 RPQQAYT--------ASEASYTPTELDQAFNAFSLHPPDHSWYMDTGAT 369 >ref|XP_021978925.1| uncharacterized protein LOC110874848 [Helianthus annuus] Length = 400 Score = 330 bits (847), Expect = e-105 Identities = 186/420 (44%), Positives = 236/420 (56%), Gaps = 14/420 (3%) Frame = +3 Query: 96 DNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSSTNK 275 ++K HPA TV++IKN IP+TLE+E SQY+SWA FK+HC+A+ V DH+ PKPAAS +T + Sbjct: 12 ESKLHPATTVSHIKNYIPVTLEIESSQYNSWATFFKLHCKAFLVFDHLSPKPAASETTTE 71 Query: 276 DKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQDNKHS 455 ++ T E W RLDAIVLQWIY+TISNDLLHTI+ +TA AW ++ +FQDNK S Sbjct: 72 TSSSTTKPTAE-WERLDAIVLQWIYSTISNDLLHTIINKTATAHDAWKAIEDLFQDNKSS 130 Query: 456 RALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNESYDN 635 RA++L Q+F+N RLD F N+SAYCQELKVLADQL+NV +P+ NDRLVLQL+AGLNE Y+ Sbjct: 131 RAIHLMQKFSNTRLDGFPNISAYCQELKVLADQLANVNAPIDNDRLVLQLIAGLNEQYEG 190 Query: 636 VASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXXYSKANHAGGS 815 +A+ LQ DPLP FY ARS +I ETRK +Q + N + + Sbjct: 191 IATILQQQDPLPSFYSARSSVIQVETRKAEQALNASKSASTALT-------ASTNRSPAA 243 Query: 816 NNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQQQQPWAP 995 + SRN + H Q PW P Sbjct: 244 DQSRNHDRNYGTHDRGYGSGRGRSGHWGRGRSSSGRGRF-------------SSQYPWQP 290 Query: 996 --YGPW--AWQQ---------QSWTPPPCPYPTSNWVRPT-TQPGQSGILGPRPQQAYTM 1133 Y PW W W PPCPYP++N RP T GILGPRP Sbjct: 291 PSYYPWMNTWPTPPSNNQTPFYPWPVPPCPYPSTN--RPNQTHTKSPGILGPRPNH---- 344 Query: 1134 TYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTSPGIPEGDTTHEMQ 1313 S + Y PTDI+ AM+TM+L PD Y+DTGAT +MT G+ + D MQ Sbjct: 345 -------QSNVAYTPTDIEQAMYTMSLQQPDPTQYMDTGATGNMTHEQGLQDSDPAPAMQ 397 >ref|XP_022002369.1| uncharacterized protein LOC110899785 [Helianthus annuus] Length = 523 Score = 334 bits (856), Expect = e-105 Identities = 189/424 (44%), Positives = 239/424 (56%), Gaps = 29/424 (6%) Frame = +3 Query: 96 DNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSS--- 266 D K HPA TV NIK+L+P+TLEM+ Y+SW+ELF++HCRA+QVI H+ PKP A SS Sbjct: 2 DAKLHPASTVNNIKSLVPVTLEMDSGLYASWSELFRLHCRAFQVIQHLSPKPEAESSSSK 61 Query: 267 -TNKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQD 443 T KDK+ P ++W RLDAIVLQWIY TIS+DLLHTIL P++TA +AW L++IF D Sbjct: 62 TTEKDKDKITPPADDSWDRLDAIVLQWIYATISSDLLHTILKPNATAHEAWVALENIFHD 121 Query: 444 NKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNE 623 NK SRA++L +F+N RL F NVSAYCQ+LKVL+DQL++VGSPV N LVLQL++GLNE Sbjct: 122 NKSSRAIHLRHKFSNTRLSGFPNVSAYCQQLKVLSDQLASVGSPVDNQSLVLQLISGLNE 181 Query: 624 SYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQ----------XXXXXXXXXXXXXXX 773 Y+ +A+ LQ+ DPLP FY+ARS+LIL E+RK +Q Sbjct: 182 QYEGIATILQNQDPLPSFYDARSKLILVESRKAEQALQASKAAGTALTANSTRSTGNNNN 241 Query: 774 XXXXYSKANHAGGSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 953 Y N+ GG R + G G Sbjct: 242 GPADYRTENNYGGRGRGRGSRGRG--------------RGRNTWGRGRGNHSNHAWQQQQ 287 Query: 954 SHYPGPQQQQPWAPYGPWAWQ--------------QQSWTPPPCPYPTSNWVRPTTQPGQ 1091 H P P Q W+ PWA Q QQ W PPCPYPT N P G Sbjct: 288 WHTPSPWAPQSWSTPPPWAAQQWAAQAQQPWAATSQQQWAMPPCPYPTMN--PPAGNLGS 345 Query: 1092 S-GILGPRPQQAYTMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMT 1268 + GILGPRP QA + Y PTDI+ AM++M+LN D +DTGAT++MT Sbjct: 346 APGILGPRPNQA-----------NHAAYTPTDIEQAMYSMSLNQHDPVPVMDTGATTNMT 394 Query: 1269 TSPG 1280 + G Sbjct: 395 NTQG 398 >ref|XP_021991106.1| uncharacterized protein LOC110887848 [Helianthus annuus] Length = 523 Score = 333 bits (853), Expect = e-105 Identities = 189/420 (45%), Positives = 241/420 (57%), Gaps = 31/420 (7%) Frame = +3 Query: 108 HPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIP--KPAASSSTNK-- 275 HPA+TV+NIK +P+TLE+E +++W+ELFK+HC+A+QV DH++P KPAASSS + Sbjct: 7 HPAVTVSNIKTFVPVTLEIESGHFTAWSELFKLHCKAFQVYDHLLPRQKPAASSSDKEKA 66 Query: 276 -DKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQDNKH 452 DK+T+ P T W RLDAIVLQWIY TIS DL+HTIL P++TA AW L+S+FQDNK Sbjct: 67 TDKDTTTP-TNALWERLDAIVLQWIYGTISKDLVHTILVPNTTAHTAWTTLESLFQDNKS 125 Query: 453 SRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNESYD 632 +RA+ L+ +F+N RL+SF N+SAYCQELKVLADQLSNVG+P+ ND LVLQL+ LNE Y+ Sbjct: 126 ARAIQLKHRFSNTRLESFPNMSAYCQELKVLADQLSNVGAPLDNDGLVLQLLTSLNEQYE 185 Query: 633 NVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXXYSKANH-AG 809 +A+ LQ DPLP FY ARS+LI+ E+RK + +ANH Sbjct: 186 GIATILQGQDPLPNFYTARSKLIMVESRKAENALHSAKTAGTALQTSSSSRPDQANHNFS 245 Query: 810 GSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQQQQPW 989 G R++ G G H QQ PW Sbjct: 246 GRGRGRSSRGRG-------------------------RGRNHNSWGRGRHQQQSQQFSPW 280 Query: 990 A------PYGPWA------W-------------QQQSWTPPPCPYPTSNWVRPTTQPGQS 1094 A PYGPWA W QQ+W PPCPYPT+ T P + Sbjct: 281 AWWASPPPYGPWAGAPQPHWVGSSSQWAAPSTSSQQNWAAPPCPYPTTQPNSTTPNPA-A 339 Query: 1095 GILGPRPQQAYTMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTS 1274 GILG +P QA+ T Y PTDID A+HTMTLN D Y+DTGAT +M+ + Sbjct: 340 GILGSKPSQAH-----------TAAYTPTDIDQALHTMTLNHSDLTSYMDTGATGNMSNT 388 Score = 62.0 bits (149), Expect = 2e-06 Identities = 31/56 (55%), Positives = 38/56 (67%), Gaps = 4/56 (7%) Frame = +1 Query: 1270 LLQEFPKGTPLMRCNSSGDLYPL----STTMLRHLTSPSTFAALSQDLWHHRLGHP 1425 L++++ TP++ CNSSGDLYPL TT SPSTFAA+S LWH RLGHP Sbjct: 466 LVKDYKTRTPILWCNSSGDLYPLVLGSGTT-----NSPSTFAAISSSLWHQRLGHP 516 >ref|XP_021974337.1| uncharacterized protein LOC110869343 [Helianthus annuus] Length = 523 Score = 332 bits (850), Expect = e-104 Identities = 188/424 (44%), Positives = 238/424 (56%), Gaps = 29/424 (6%) Frame = +3 Query: 96 DNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSS--- 266 D K HPA TV NIK+L+P+TLEM+ Y+SW+ELF++HCRA+QVI H+ PKP A SS Sbjct: 2 DAKLHPASTVNNIKSLVPVTLEMDSGLYASWSELFRLHCRAFQVIQHLSPKPEAESSSSK 61 Query: 267 -TNKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQD 443 T KDK+ ++W RLDAIVLQWIY TIS+DLLHTIL P++TA +AW L++IF D Sbjct: 62 TTEKDKDKITTPADDSWDRLDAIVLQWIYATISSDLLHTILKPNATAHEAWVALENIFHD 121 Query: 444 NKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNE 623 NK SRA++L +F+N RL F NVSAYCQ+LKVL+DQL++VGSPV N LVLQL++GLNE Sbjct: 122 NKSSRAIHLRHKFSNTRLSGFPNVSAYCQQLKVLSDQLASVGSPVDNQSLVLQLISGLNE 181 Query: 624 SYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQ----------XXXXXXXXXXXXXXX 773 Y+ +A+ LQ+ DPLP FY+ARS+LIL E+RK +Q Sbjct: 182 QYEGIATILQNQDPLPSFYDARSKLILVESRKAEQALQASQAAGTALTANSTRSTGNNNN 241 Query: 774 XXXXYSKANHAGGSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 953 Y N+ GG R + G G Sbjct: 242 GPADYRTENNYGGRGRGRGSRGRG--------------RGRNTWGRSRGNPSNHAWQQQQ 287 Query: 954 SHYPGPQQQQPWAPYGPWAWQ--------------QQSWTPPPCPYPTSNWVRPTTQPGQ 1091 H P P Q W+ PWA Q QQ W PPCPYPT N P G Sbjct: 288 WHTPSPWAPQSWSTPPPWAAQQWAAQAQQPWAATSQQQWAMPPCPYPTMN--PPAGNSGS 345 Query: 1092 S-GILGPRPQQAYTMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMT 1268 + GILGPRP QA + Y PTDI+ AM++M+LN D +DTGAT++MT Sbjct: 346 APGILGPRPNQA-----------NHAAYTPTDIEQAMYSMSLNQHDPVPVMDTGATTNMT 394 Query: 1269 TSPG 1280 + G Sbjct: 395 NTQG 398 >dbj|GAU35084.1| hypothetical protein TSUD_70070 [Trifolium subterraneum] Length = 980 Score = 343 bits (881), Expect = e-104 Identities = 198/415 (47%), Positives = 237/415 (57%), Gaps = 13/415 (3%) Frame = +3 Query: 105 YHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSS------ 266 +HPAL VTNIKN IPITLEME S YSSW ELFKI C QVIDHI+P + + S Sbjct: 7 FHPALAVTNIKNFIPITLEMETSHYSSWVELFKIQCTVNQVIDHIVPTQSEAISDGKTPE 66 Query: 267 -------TNKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRL 425 +NK+KE S E WSRLDAIVLQWIY TIS DLLHTIL P STA+QAW+RL Sbjct: 67 GASEPEGSNKEKEKSPKSDKELWSRLDAIVLQWIYGTISTDLLHTILKPGSTAKQAWERL 126 Query: 426 QSIFQDNKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQL 605 ++IFQDNK SR ++LE QFT + +D + N+SAYCQELK+LADQLS+VG+PVS RLVLQL Sbjct: 127 ENIFQDNKASRVVHLENQFTRVHMDDYPNISAYCQELKMLADQLSSVGAPVSEQRLVLQL 186 Query: 606 VAGLNESYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXX 785 +AGLNE+YD VA +Q ++PLP F+EARSRLILEETRK KQ Sbjct: 187 IAGLNENYDGVAIFIQQSNPLPLFHEARSRLILEETRKAKQTAAAA-------------- 232 Query: 786 YSKANHAGGSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYP 965 S A + N N SG H Y Sbjct: 233 -SAVGTALLTTNGSNDDNSGRNH------------------------------DSVQGYG 261 Query: 966 GPQQQQPWAPYGPWAWQQQSWTPPPCPYPTSNWVRPTTQPGQSGILGPRPQQAYTMTYAP 1145 Q P P + +WT PP PT+ G RP QAY+ Sbjct: 262 NTANQNSHIP--PCPYPTSNWTRPPSNRPTAT-------------TGSRPPQAYS----- 301 Query: 1146 TVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTSPGIPEGDTTHEM 1310 ++ YAPTDID AM +++N PDENWY+DTGATSHMT S G +G+ T+EM Sbjct: 302 AHVSTPSNYAPTDIDAAMQNLSINIPDENWYMDTGATSHMTASQGYTDGNATNEM 356 >ref|XP_022041201.1| uncharacterized protein LOC110943776 [Helianthus annuus] Length = 523 Score = 331 bits (848), Expect = e-104 Identities = 187/417 (44%), Positives = 239/417 (57%), Gaps = 22/417 (5%) Frame = +3 Query: 96 DNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSS--- 266 D K HPA TV NIK+L+P+TLEM+ Y+SW+ELF++HCRA+QVI H+ PKP A SS Sbjct: 2 DAKLHPASTVNNIKSLVPVTLEMDSGLYASWSELFRLHCRAFQVIQHLSPKPEAESSSSK 61 Query: 267 -TNKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQD 443 T KDK+T ++W RLDAIVLQWIY TIS+DLLHTIL P++TA +AW L++IF D Sbjct: 62 TTEKDKDTITKPADDSWDRLDAIVLQWIYATISSDLLHTILKPNATAHEAWVALENIFHD 121 Query: 444 NKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNE 623 NK SRA++L +F N RL F NVSAYCQ+LKVL+DQL++VGSPV N LVLQL++GLNE Sbjct: 122 NKSSRAIHLRHKFANTRLSGFPNVSAYCQQLKVLSDQLASVGSPVDNQSLVLQLISGLNE 181 Query: 624 SYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXXYSKANH 803 Y+ +A+ LQ+ DPLP FY+ARS+LI E+RK +Q + + Sbjct: 182 QYEGIATILQNQDPLPSFYDARSKLIPVESRKAEQALQASQAAGTALT-------ANSTR 234 Query: 804 AGGSNNSRNT---TGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQ 974 + G+NNS T + Y H P P Sbjct: 235 STGNNNSGPADYRTDNNYGGRGRGRGSRGRGRGRNTWGRGRGNHSNHAWQQQQWHTPSPW 294 Query: 975 QQQPWAPYGPWAWQ--------------QQSWTPPPCPYPTSNWVRPTTQPGQS-GILGP 1109 Q W+ PWA Q QQ W PPCPYPT N P G + GILGP Sbjct: 295 APQSWSTPPPWAAQQWAAQAQQPWAATSQQQWAMPPCPYPTMN--PPAGNSGSAPGILGP 352 Query: 1110 RPQQAYTMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTSPG 1280 RP QA + Y PTDI+ AM++M+LN D +DTGAT++MT + G Sbjct: 353 RPNQA-----------NHAAYTPTDIEQAMYSMSLNQHDPVPVMDTGATTNMTNTQG 398 >ref|XP_022039984.1| uncharacterized protein LOC110942516 [Helianthus annuus] Length = 519 Score = 328 bits (840), Expect = e-103 Identities = 191/402 (47%), Positives = 232/402 (57%), Gaps = 7/402 (1%) Frame = +3 Query: 90 MADNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKP---AAS 260 M K HPA TV+NIK IPITLEME QY+SW+ELFKI CRA+ VID + PKP AAS Sbjct: 1 MESTKLHPASTVSNIKGFIPITLEMETGQYASWSELFKIQCRAFLVIDQLSPKPPAPAAS 60 Query: 261 SSTNKDKETSDPITP--ETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSI 434 SS + DK+ TP + W RLDAIVLQWIY T SNDLLHTIL P++TA AW L+ I Sbjct: 61 SSKDTDKDKDKVTTPFDDQWDRLDAIVLQWIYATFSNDLLHTILKPNTTAYDAWTTLEGI 120 Query: 435 FQDNKHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAG 614 FQDNK SRA++L +F+N LD FSNVSAYCQ+L ++ADQL+NVGSPV NDRLVLQL++G Sbjct: 121 FQDNKSSRAIHLLHKFSNTHLDGFSNVSAYCQQLTMIADQLANVGSPVENDRLVLQLISG 180 Query: 615 LNESYDNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXXYSK 794 LNE Y+ + + LQ DPLP FYEARS+L+L E+RK +Q + Sbjct: 181 LNEQYEGITTILQQQDPLPTFYEARSKLVLVESRKAEQAPSNQTHSDAERGRGRGRGCDR 240 Query: 795 ANHAGGSNNSRNTTGS-GYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGP 971 SN R +GS GY +++ Sbjct: 241 GRQ---SNRVRGASGSLGY--------PTGAPGQGWPPNTWAGNHQWAANNNFNNNWAAN 289 Query: 972 QQQQPWAPYGPWAWQQQSWTPPP-CPYPTSNWVRPTTQPGQSGILGPRPQQAYTMTYAPT 1148 QQ WA W SW PP CPYPT++ RP T GILG RP QA+ Sbjct: 290 QQ---WAAPPQW----NSWAAPPLCPYPTTS-PRPNTNTSGQGILGSRPNQAH------- 334 Query: 1149 VATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTS 1274 Y PTDID A+HTM+LN D Y+DTGAT +MT + Sbjct: 335 ----YTAYTPTDIDQALHTMSLNQQDPTLYMDTGATGNMTNN 372 Score = 78.6 bits (192), Expect = 1e-11 Identities = 35/66 (53%), Positives = 47/66 (71%) Frame = +1 Query: 1270 LLQEFPKGTPLMRCNSSGDLYPLSTTMLRHLTSPSTFAALSQDLWHHRLGHPGARILNSL 1449 L++++ P++RCNSSGDLYPL T+PSTFAA+S DLWH RLGHPG +L+SL Sbjct: 450 LVKDYKTRIPILRCNSSGDLYPLFLGSGSTTTTPSTFAAISSDLWHQRLGHPGQHLLHSL 509 Query: 1450 RNKRYI 1467 + +I Sbjct: 510 KQSCFI 515 >ref|XP_014489620.1| uncharacterized protein LOC106752450 [Vigna radiata var. radiata] Length = 371 Score = 313 bits (802), Expect = 3e-99 Identities = 175/349 (50%), Positives = 207/349 (59%), Gaps = 5/349 (1%) Frame = +3 Query: 90 MADN-KYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSS 266 MAD K+H AL V NIK+ I ITLEM+K YSSWAELFKIH RAYQV+DHI+P + S Sbjct: 1 MADQTKFHLALAVNNIKHFIFITLEMDKGHYSSWAELFKIHFRAYQVLDHILPPTSQSID 60 Query: 267 TNKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQDN 446 TN ET + WSRL IVLQWIY+TISNDLLHTIL D TA+QAW+RLQ+IFQDN Sbjct: 61 TNT--ETIVEVDFALWSRLGTIVLQWIYSTISNDLLHTILQSDFTAQQAWERLQNIFQDN 118 Query: 447 KHSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNES 626 K S A YLE QFT++ +D + N+SAYCQELK+LADQLSNVG+PVSN RLVLQL+AGLNE+ Sbjct: 119 KPSCAAYLENQFTHVHMDDYPNISAYCQELKMLADQLSNVGAPVSNQRLVLQLIAGLNEN 178 Query: 627 YDNVASHLQHTDPLPQFYEARSRLILEETRKQKQ--XXXXXXXXXXXXXXXXXXXYSKAN 800 YD VA+ +Q T+PLP+FY+ARSRLILEET K KQ S N Sbjct: 179 YDGVATFIQQTNPLPRFYKARSRLILEETHKAKQVASTVNNVSTALLITNNSSDVVSNVN 238 Query: 801 HAGGSN-NSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQQ 977 H S+ + +T G+ P Q Sbjct: 239 HINNSSTHPFSTHGNNGLKCGPNRGRGRNNGGRGRGRNNTHHHNGQQYGGPSHQRPWQNQ 298 Query: 978 QQPWAPYGPWA-WQQQSWTPPPCPYPTSNWVRPTTQPGQSGILGPRPQQ 1121 Q W Y PW W Q+W P CPYPT +W P Q IL RP + Sbjct: 299 QSQWTSYPPWGPWGPQTWVAPRCPYPTYSWATPQPPTRQPEILSNRPNR 347 >ref|XP_021974932.1| uncharacterized protein LOC110870042 [Helianthus annuus] Length = 454 Score = 310 bits (794), Expect = 6e-97 Identities = 177/418 (42%), Positives = 238/418 (56%), Gaps = 25/418 (5%) Frame = +3 Query: 96 DNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSSTNK 275 D K HPA+TVTNIK IP+TLE+E SQY++WAELFKIHCR YQV+DH+ P+ + +K Sbjct: 2 DPKNHPAITVTNIKAHIPVTLELETSQYATWAELFKIHCRVYQVVDHLSPRK--ETDKDK 59 Query: 276 DKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQDNKHS 455 +K+++ + + W RLDAIVLQWIY TIS+DLL TI+ PDSTA AW ++SIF DN+ + Sbjct: 60 EKDSAGSTSDDNWDRLDAIVLQWIYGTISSDLLSTIIRPDSTAAYAWTAIKSIFHDNQTT 119 Query: 456 RALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNESYDN 635 RA+YLE++F N+RLDSF N+ +YCQE+KVL+DQL+NVGSPV+ +LV+QL++GL E+Y Sbjct: 120 RAVYLERKFANVRLDSFLNMPSYCQEVKVLSDQLANVGSPVTEQKLVIQLLSGLTEAYGG 179 Query: 636 VASHLQHTDPLPQFYEARSRLILEETRKQKQ----XXXXXXXXXXXXXXXXXXXYSKANH 803 +AS +Q+T PLP FYEARS+L LEET K + S+ N+ Sbjct: 180 IASIIQNTKPLPTFYEARSQLCLEETTKANRVSTDAAFHTSHPPNYPQSTGAHPPSQPNN 239 Query: 804 AGGSNNSRNT---TGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGP- 971 GG R GS Y + P P Sbjct: 240 RGGRTRGRGRGRGRGSSYSTNQTQSQNYNPKNNYLNTQPAPHWPHTQPQWPNTHNSPSPT 299 Query: 972 ---------QQQQPWAPYGPWAWQQQSWTPPPCPYPTS-------NWVRPTTQPG-QSGI 1100 P +PY +W PPCPYPT+ + P + P G+ Sbjct: 300 WPISYNHSTHSPGPNSPY-------PTWPAPPCPYPTALAPSLRPTYNGPPSYPSPHQGL 352 Query: 1101 LGPRPQQAYTMTYAPTVATSTMGYAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTS 1274 LG PQQ + YA + T T PT++D A + M+L+PPD+ WY+DTGAT MT + Sbjct: 353 LGSLPQQ--SQVYAASKHTCT----PTELDQAFNAMSLHPPDQTWYMDTGATGTMTNN 404 >dbj|GAU39490.1| hypothetical protein TSUD_279110 [Trifolium subterraneum] Length = 452 Score = 310 bits (793), Expect = 8e-97 Identities = 174/397 (43%), Positives = 217/397 (54%) Frame = +3 Query: 90 MADNKYHPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSST 269 M+ +HP L VTNIKN IP LE+EK Y+ WAELF++H A++VI+HIIP+P Sbjct: 28 MSKYDFHPTLAVTNIKNSIPFVLELEKDHYNMWAELFEVHAHAHKVINHIIPQPG----- 82 Query: 270 NKDKETSDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQDNK 449 K+K E W+ LD+ VLQWIY+TIS DLL TI+ STA AW+RL +F+DN+ Sbjct: 83 -KEKPAPTDANFEMWTVLDSTVLQWIYSTISFDLLTTIMEKGSTAMAAWNRLAGLFEDNQ 141 Query: 450 HSRALYLEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNESY 629 +SRA+ LE+ F++ R++ F NVSAYCQ LK L+DQL NVG+PVS RLVLQL +GL E Y Sbjct: 142 NSRAVALEEDFSSTRMEDFPNVSAYCQCLKQLSDQLKNVGAPVSEQRLVLQLFSGLTEPY 201 Query: 630 DNVASHLQHTDPLPQFYEARSRLILEETRKQKQXXXXXXXXXXXXXXXXXXXYSKANHAG 809 VA+ ++ + PLP F EARS L LEE+ K SK Sbjct: 202 RGVATLIRQSKPLPLFLEARSMLTLEESDLAKMHSTNSPTALHTTAPRDIDDSSKQR--- 258 Query: 810 GSNNSRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQQQQPW 989 + N GSG H S P QQ + Sbjct: 259 SNRRQNNRYGSGRHH-------------NNQSRNGGRGQRGVSRSNGPSWSSQPWQQPQY 305 Query: 990 APYGPWAWQQQSWTPPPCPYPTSNWVRPTTQPGQSGILGPRPQQAYTMTYAPTVATSTMG 1169 + PW W W+ PPCPYPTS W RP Q GILG RP QA+T T +P Sbjct: 306 PSWSPWGWTPPPWSVPPCPYPTSQWTRPIGPSRQPGILGQRP-QAHTATASP-------- 356 Query: 1170 YAPTDIDTAMHTMTLNPPDENWYLDTGATSHMTTSPG 1280 PTDI AMHTM+L PPD WY+DTGA+SH S G Sbjct: 357 -VPTDIAAAMHTMSLTPPDNMWYMDTGASSHTAASQG 392 >gb|PNY04738.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 409 Score = 303 bits (775), Expect = 1e-94 Identities = 175/394 (44%), Positives = 222/394 (56%), Gaps = 3/394 (0%) Frame = +3 Query: 108 HPALTVTNIKNLIPITLEMEKSQYSSWAELFKIHCRAYQVIDHIIPKPAASSSTNKDKET 287 HPALTV NI N I ITL MEK Y++W+ELFKIH R +QVIDHIIP + ST +T Sbjct: 27 HPALTVPNITNFIKITLSMEKGTYNTWSELFKIHARVFQVIDHIIP--STEPSTIPSLKT 84 Query: 288 SDPITPETWSRLDAIVLQWIYTTISNDLLHTILAPDSTARQAWDRLQSIFQDNKHSRALY 467 DP W RLDA+ WIY TIS DLL+ I+ DSTA AW+RL IF DNK+SRALY Sbjct: 85 IDP---SLWRRLDAV---WIYGTISEDLLNIIIEQDSTAETAWNRLFEIFYDNKNSRALY 138 Query: 468 LEQQFTNLRLDSFSNVSAYCQELKVLADQLSNVGSPVSNDRLVLQLVAGLNESYDNVASH 647 LEQ+F+ R++ FS+ S+YCQ LK L DQ++NVG+ +S +RLVLQL++GL ++Y V S Sbjct: 139 LEQEFSRTRMEQFSDASSYCQHLKSLFDQMANVGTSISKERLVLQLISGLTDAYAAVGSE 198 Query: 648 LQHTDPLPQFYEARSRLILEETRKQKQ--XXXXXXXXXXXXXXXXXXXYSKANHAGGSNN 821 ++H LP FY+ARS +ILEET QK+ S + GGSNN Sbjct: 199 IRHAAILPDFYKARSMIILEETALQKKVSNSVDNTALMASNNETSTDHTSSRPNRGGSNN 258 Query: 822 SRNTTGSGYFHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHYPGPQQQQPWAPYG 1001 + +G H PQ PW P Sbjct: 259 NHGGRSNGRGHRGRGRGQNHGRGGGRF----------------------PQHSPPWQP-- 294 Query: 1002 PWAWQQQSWTPPPCPYPTSNWVRPTTQPGQSGILGPRP-QQAYTMTYAPTVATSTMGYAP 1178 W PPCPYPT++ Q GILGP+P QA+ VA++ Y+P Sbjct: 295 --------WATPPCPYPTTD-----NSNKQPGILGPKPLHQAH-------VASTLPSYSP 334 Query: 1179 TDIDTAMHTMTLNPPDENWYLDTGATSHMTTSPG 1280 TDI TAMHT++L+PPD+ WY+DTGATSHMT + G Sbjct: 335 TDIQTAMHTLSLSPPDDQWYMDTGATSHMTANEG 368