BLASTX nr result
ID: Chrysanthemum21_contig00039972
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00039972 (821 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KYP71190.1| Putative ribonuclease H protein At1g65750 family ... 204 1e-59 dbj|GAU10476.1| hypothetical protein TSUD_420710, partial [Trifo... 203 5e-59 gb|KYP54863.1| Putative ribonuclease H protein At1g65750 family ... 204 2e-57 ref|XP_021995642.1| uncharacterized protein LOC110892803 [Helian... 199 3e-57 ref|XP_022032886.1| uncharacterized protein LOC110933974 [Helian... 207 6e-57 ref|XP_023737697.1| uncharacterized protein LOC111885685 [Lactuc... 194 1e-56 gb|OTG26767.1| putative reverse transcriptase domain, Reverse tr... 203 3e-56 gb|KYP61726.1| Putative ribonuclease H protein At1g65750 family ... 198 3e-56 gb|OTG08794.1| putative RNA-directed DNA polymerase, eukaryota, ... 203 9e-56 gb|OTF94555.1| putative RNA-directed DNA polymerase, eukaryota, ... 203 9e-56 ref|XP_021986150.1| uncharacterized protein LOC110882438 [Helian... 194 1e-55 gb|KYP47723.1| Putative ribonuclease H protein At1g65750 family ... 194 2e-55 gb|OTG29886.1| putative reverse transcriptase domain, Reverse tr... 199 9e-55 ref|XP_021995896.1| uncharacterized protein LOC110893084 [Helian... 200 1e-54 dbj|GAU22997.1| hypothetical protein TSUD_98260 [Trifolium subte... 197 1e-54 dbj|GAU29439.1| hypothetical protein TSUD_150090 [Trifolium subt... 199 2e-54 ref|XP_021987015.1| uncharacterized protein LOC110883607 [Helian... 184 4e-54 gb|KYP44439.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu... 198 6e-54 dbj|GAU34535.1| hypothetical protein TSUD_394090 [Trifolium subt... 197 9e-54 gb|OTF85059.1| putative RNA-directed DNA polymerase, eukaryota, ... 197 1e-53 >gb|KYP71190.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 417 Score = 204 bits (518), Expect = 1e-59 Identities = 109/274 (39%), Positives = 154/274 (56%), Gaps = 3/274 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N+ I +IL+ F L +GL++N KS+L G ++ ++ A + C V + F YLG+ + Sbjct: 114 NVWAIKSILQIFELVAGLKVNFHKSKLFGFNINIEVLNLMAQFLNCKVGSLPFCYLGLPL 173 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G P + W+ +I K++ RLSKWK TLS GGR LLKSVL S PIY LS FK P+G++ Sbjct: 174 GANPRCIKTWEPVISKVKKRLSKWKSSTLSFGGRSVLLKSVLNSIPIYYLSFFKAPQGII 233 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 +ES+ F G D+ +KI WVAW + K HGGLG+ A N ALL KW WR L Sbjct: 234 SKLESLFKLFLWGGDENHRKIAWVAWQEVCRGKEHGGLGILDLRAFNLALLGKWRWRLLV 293 Query: 548 QDGSLWCRIIRALY--GXXXXXXXXXXXXXXXXXMREVQSLKTKGFD-FLSHCKKRVGNG 718 + G W R++ ++Y G + + S FD F S C K VG+G Sbjct: 294 EKGRFWHRVVTSIYGEGCFQGVGDKVQSSKWWVDLWTIDSAPYTSFDWFSSRCTKVVGDG 353 Query: 719 INSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 N+ FW D W G PL + + RLF++ DK+++V Sbjct: 354 RNTFFWKDGWSGQGPLCNPYSRLFSIASDKDVSV 387 >dbj|GAU10476.1| hypothetical protein TSUD_420710, partial [Trifolium subterraneum] Length = 441 Score = 203 bits (516), Expect = 5e-59 Identities = 109/276 (39%), Positives = 154/276 (55%), Gaps = 5/276 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N++ + IL F SGL++N KS L+GV + S + +AAS +GC V F YLG+ + Sbjct: 99 NVRALRVILVLFEKVSGLKVNFHKSMLVGVNIGESWLMEAASVLGCKVGKIPFMYLGLPI 158 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G P RL W+ I+ +R+RLS+WK + LS GGRL L+KSVL S P+Y LS FK P G++ Sbjct: 159 GGDPRRLAFWEPIVSNIRSRLSRWKNRLLSFGGRLILIKSVLTSLPVYALSFFKAPSGII 218 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 ++ES+ FF G +G +KI W++W + HGGLGV N ALL KW WR L Sbjct: 219 SSLESLLSSFFWGGGEGHRKIAWISWQTVCLGQEHGGLGVRQLREFNTALLGKWCWRMLV 278 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLK-----TKGFDFLSHCKKRVG 712 G +W R++ A YG REV ++ G F ++RVG Sbjct: 279 DKGGMWYRVLAARYG-EVAGRLAVGGRNGSAWWREVARIRDGDGAVGGAWFAESIERRVG 337 Query: 713 NGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 NG ++ FW D W+ PL + RLF L +++I+V Sbjct: 338 NGSDTSFWSDPWLDGVPLRVRYRRLFDLHFNQSISV 373 >gb|KYP54863.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 648 Score = 204 bits (518), Expect = 2e-57 Identities = 109/274 (39%), Positives = 154/274 (56%), Gaps = 3/274 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N+ I +IL+ F L +GL++N KS+L G ++ ++ A + C V + F YLG+ + Sbjct: 104 NVWAIKSILQIFELVAGLKVNFHKSKLFGFNINIEVLNLMAQFLNCKVGSLPFCYLGLPL 163 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G P + W+ +I K++ RLSKWK TLS GGR LLKSVL S PIY LS FK P+G++ Sbjct: 164 GANPRCIKTWEPVISKVKKRLSKWKSSTLSFGGRSVLLKSVLNSIPIYYLSFFKAPQGII 223 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 +ES+ F G D+ +KI WVAW + K HGGLG+ A N ALL KW WR L Sbjct: 224 SKLESLFKLFLWGGDENHRKIAWVAWQEVCRGKEHGGLGILDLRAFNLALLGKWRWRLLV 283 Query: 548 QDGSLWCRIIRALY--GXXXXXXXXXXXXXXXXXMREVQSLKTKGFD-FLSHCKKRVGNG 718 + G W R++ ++Y G + + S FD F S C K VG+G Sbjct: 284 EKGRFWHRVVTSIYGEGCFQGVGDKVQSSKWWVDLWTIDSTPYTSFDWFSSRCTKVVGDG 343 Query: 719 INSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 N+ FW D W G PL + + RLF++ DK+++V Sbjct: 344 RNTFFWKDGWSGQGPLCNRYSRLFSIASDKDVSV 377 >ref|XP_021995642.1| uncharacterized protein LOC110892803 [Helianthus annuus] Length = 466 Score = 199 bits (506), Expect = 3e-57 Identities = 109/277 (39%), Positives = 149/277 (53%), Gaps = 4/277 (1%) Frame = +2 Query: 2 DGNLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGV 181 + N+K + L+ F+LASGL+IN+ KS + G+GV EV + IGC + YLG+ Sbjct: 96 EDNVKNVARCLRIFYLASGLKINLQKSNIYGLGVGNDEVLNMCNVIGCKSDSIPLTYLGI 155 Query: 182 MVGQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRG 361 VG +R+N W II RLS WK KTLSIGGRLTL+ SVL S PIY S++K P G Sbjct: 156 SVGSNMNRINNWTPIIEVFDKRLSAWKAKTLSIGGRLTLINSVLESLPIYYFSLYKAPVG 215 Query: 362 VLKNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRF 541 V+K +E+ +F K+ WVAWD K GGLG++ +N ALL+KW WRF Sbjct: 216 VIKTLEAKMRKFLWVGSSNINKMNWVAWDWVTWPKNIGGLGINRLLEVNEALLVKWGWRF 275 Query: 542 LSQDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREV----QSLKTKGFDFLSHCKKRV 709 ++ +LW +++ A +G + V LK G +V Sbjct: 276 RVENHNLWRKVVEACHGKANHWSFLPLNSNIAGCWKNVVKLLNKLKLNGRGLNRLILGKV 335 Query: 710 GNGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 GNG+ + FW D W+GD P +P LF LEL K+ V Sbjct: 336 GNGVETRFWIDSWLGDLPFMERWPLLFGLELFKSCRV 372 >ref|XP_022032886.1| uncharacterized protein LOC110933974 [Helianthus annuus] Length = 1354 Score = 207 bits (526), Expect = 6e-57 Identities = 111/275 (40%), Positives = 156/275 (56%), Gaps = 4/275 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 NL+ I +L+ F+L SGL+IN+ KS L GVGV S+++ ++ +GC + F YLG+ V Sbjct: 835 NLEKIHRLLRIFYLCSGLKINIHKSVLFGVGVEDSDIEAMSNVLGCRIGRLPFVYLGIKV 894 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G +R++ W+ ++ +R RL+ WK K LSIGGRLTL+KSVL S P+Y LS+F+ P+ V+ Sbjct: 895 GANMNRISNWEPVLEAIRDRLTSWKTKVLSIGGRLTLIKSVLTSLPVYYLSLFRAPKAVV 954 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 N+E I F + K + WV+W+ SK GGLG+S +N ALL KWVWRF + Sbjct: 955 DNIEKIMRHFLWAGCKVGKGLHWVSWEVATKSKKSGGLGISKIAEVNSALLAKWVWRFKN 1014 Query: 548 QDGSLWCRIIRALYG----XXXXXXXXXXXXXXXXXMREVQSLKTKGFDFLSHCKKRVGN 715 SLW RII ++G R V L G F + K +GN Sbjct: 1015 DKNSLWKRIIEDIHGGRKRWIFLPVNNSIKGCWKSISRHVDGLNFNGQPFKALFKGSIGN 1074 Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 G FW D W G PL + +PRL+A + +KN V Sbjct: 1075 GSRLRFWKDLWWGSTPLMNRWPRLYAQDSNKNAVV 1109 >ref|XP_023737697.1| uncharacterized protein LOC111885685 [Lactuca sativa] Length = 355 Score = 194 bits (494), Expect = 1e-56 Identities = 103/275 (37%), Positives = 145/275 (52%), Gaps = 4/275 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N+K + IL+ F ++SGL++N KSQ+ G+GV EV A +GC N F YLGV V Sbjct: 37 NIKNLAGILRCFHVSSGLKVNFKKSQVFGIGVDSQEVLSLARPLGCEPANLPFTYLGVPV 96 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G W +I + RLS WK K LS+GGRLTL KSV+GS P + S+F P G+L Sbjct: 97 GANMKLKKYWKPVIENFQLRLSAWKSKNLSLGGRLTLTKSVIGSLPTFYFSLFIAPAGIL 156 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 K +E IR RF G + +KI WV+W K K +GGLG+ S ALN +L++KW WR Sbjct: 157 KALEKIRRRFLWGGSEDSRKINWVSWGKVTTPKENGGLGLGSLKALNLSLIMKWWWRLRV 216 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQ----SLKTKGFDFLSHCKKRVGN 715 ++ LW ++I ++ + + L G + K VG Sbjct: 217 ENTCLWSKVIEGIHNLKNKPGDYMSKQSITGVWKNITQARGELMKVGINIEDVILKEVGT 276 Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 G ++FW D W G+ L + FP ++ LE K+ V Sbjct: 277 GEKTMFWHDRWTGNMTLKASFPEMYKLERHKHCMV 311 >gb|OTG26767.1| putative reverse transcriptase domain, Reverse transcriptase zinc-binding domain protein [Helianthus annuus] Length = 881 Score = 203 bits (517), Expect = 3e-56 Identities = 105/268 (39%), Positives = 154/268 (57%), Gaps = 4/268 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N + IL+ F+LASGL++N++KS + GVG+++ EV A+ +GC + F++LG++V Sbjct: 363 NASNLRRILRCFYLASGLKVNLAKSSVYGVGINQHEVQSMATFLGCKSGSFPFKHLGLVV 422 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G + + W II + RL+ WK K LS GGR+TLLKSVL + P Y S++K P V+ Sbjct: 423 GANMNLVKNWKPIIDLFKNRLAIWKAKQLSYGGRVTLLKSVLNALPTYFFSLYKAPNQVI 482 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 ++ +R FF G + + K+ WVAWD +A +GGLG S N A+L KW WRF Sbjct: 483 DALDRLRRVFFWGGSEEKAKMNWVAWDNVIAPIEYGGLGFGSLKDANHAMLAKWWWRFKV 542 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREV----QSLKTKGFDFLSHCKKRVGN 715 ++ LW R+I A++ +++ SL+ KG D + VGN Sbjct: 543 ENKGLWRRVIWAIHHNSRSWSAIPAKISMPGIWKQIVNIHHSLQQKGIDLFKAIRNVVGN 602 Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALE 799 G N+LFW D WIG+ P H FP LF+LE Sbjct: 603 GSNTLFWLDLWIGNTPFHIRFPTLFSLE 630 >gb|KYP61726.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 554 Score = 198 bits (504), Expect = 3e-56 Identities = 105/274 (38%), Positives = 152/274 (55%), Gaps = 3/274 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N+ I +IL+ F L +GL++N KS+L G ++ ++ A + C V + F YLG+ + Sbjct: 41 NVWAIKSILQIFELVAGLKVNFHKSKLFGFNINIEVLNLMAQFLNCKVGSLPFCYLGLPL 100 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G+ P + W+ +I KL+ RLSKWK TLS GGR LLKSVL S P Y LS FK P+G++ Sbjct: 101 GENPHCIKTWEPVISKLKKRLSKWKSSTLSFGGRSALLKSVLNSIPTYYLSFFKAPQGII 160 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 +ES+ F G D+ +KI WVAW + K HGGLG+ A N A+L KW W L Sbjct: 161 SKLESLFKLFLWGGDENHRKIAWVAWQEVCKGKEHGGLGILDLRAFNLAILEKWRWHLLV 220 Query: 548 QDGSLWCRIIRALY--GXXXXXXXXXXXXXXXXXMREVQSLKTKGFD-FLSHCKKRVGNG 718 + G W +++ ++Y G + + FD F S C K VG+G Sbjct: 221 EKGRFWHKVVTSIYGEGCFQGVGDKVQSSKWWVDLWTIDFAPYASFDWFSSRCTKVVGDG 280 Query: 719 INSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 N+ FW D W G PL + + RLF++ DK+++V Sbjct: 281 QNTFFWKDGWSGQGPLCNRYSRLFSIASDKDVSV 314 >gb|OTG08794.1| putative RNA-directed DNA polymerase, eukaryota, Reverse transcriptase zinc-binding domain protein [Helianthus annuus] Length = 1217 Score = 203 bits (517), Expect = 9e-56 Identities = 105/268 (39%), Positives = 154/268 (57%), Gaps = 4/268 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N + IL+ F+LASGL++N++KS + GVG+++ EV A+ +GC + F++LG++V Sbjct: 699 NASNLRRILRCFYLASGLKVNLAKSSVYGVGINQHEVQSMATFLGCKSGSFPFKHLGLVV 758 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G + + W II + RL+ WK K LS GGR+TLLKSVL + P Y S++K P V+ Sbjct: 759 GANMNLVKNWKPIIDLFKNRLAIWKAKQLSYGGRVTLLKSVLNALPTYFFSLYKAPNQVI 818 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 ++ +R FF G + + K+ WVAWD +A +GGLG S N A+L KW WRF Sbjct: 819 DALDRLRRVFFWGGSEEKAKMNWVAWDNVIAPIEYGGLGFGSLKDANHAMLAKWWWRFKV 878 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREV----QSLKTKGFDFLSHCKKRVGN 715 ++ LW R+I A++ +++ SL+ KG D + VGN Sbjct: 879 ENKGLWRRVIWAIHHNSRSWSAIPAKISMPGIWKQIVNIHHSLQQKGIDLFKAIRNVVGN 938 Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALE 799 G N+LFW D WIG+ P H FP LF+LE Sbjct: 939 GSNTLFWLDLWIGNTPFHIRFPTLFSLE 966 >gb|OTF94555.1| putative RNA-directed DNA polymerase, eukaryota, Reverse transcriptase zinc-binding domain protein [Helianthus annuus] Length = 1282 Score = 203 bits (517), Expect = 9e-56 Identities = 105/268 (39%), Positives = 154/268 (57%), Gaps = 4/268 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N + IL+ F+LASGL++N++KS + GVG+++ EV A+ +GC + F++LG++V Sbjct: 562 NASNLRRILRCFYLASGLKVNLAKSSVYGVGINQHEVQSMATFLGCKSGSFPFKHLGLVV 621 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G + + W II + RL+ WK K LS GGR+TLLKSVL + P Y S++K P V+ Sbjct: 622 GANMNLVKNWKPIIDLFKNRLAIWKAKQLSYGGRVTLLKSVLNALPTYFFSLYKAPNQVI 681 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 ++ +R FF G + + K+ WVAWD +A +GGLG S N A+L KW WRF Sbjct: 682 DALDRLRRVFFWGGSEEKAKMNWVAWDNVIAPIEYGGLGFGSLKDANHAMLAKWWWRFKV 741 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREV----QSLKTKGFDFLSHCKKRVGN 715 ++ LW R+I A++ +++ SL+ KG D + VGN Sbjct: 742 ENKGLWRRVIWAIHHNSRSWSAIPAKISMPGIWKQIVNIHHSLQQKGIDLFKAIRNVVGN 801 Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALE 799 G N+LFW D WIG+ P H FP LF+LE Sbjct: 802 GSNTLFWLDLWIGNTPFHIRFPTLFSLE 829 >ref|XP_021986150.1| uncharacterized protein LOC110882438 [Helianthus annuus] Length = 445 Score = 194 bits (494), Expect = 1e-55 Identities = 110/275 (40%), Positives = 147/275 (53%), Gaps = 4/275 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N++ IL+ F+L SGL+IN+ KS L GVG EVD +GC F YLG+ V Sbjct: 100 NIQSTTKILRIFYLFSGLRINLYKSNLFGVGTEDMEVDNMMEILGCKRGGIPFVYLGIQV 159 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G +R++ W II ++ARL WK KTLSIGGRL L+KSVL S PIY LS++K P+ V+ Sbjct: 160 GAKMTRISNWTSIIEVIKARLVSWKAKTLSIGGRLILIKSVLESLPIYYLSLYKAPKVVI 219 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 +E+I RF E+KI WVAWD K GGL V+ +N ALLLKW WRF Sbjct: 220 DIIEAIMRRFLWAGSSAERKIPWVAWDIITTPKKKGGLCVTKLQEVNEALLLKWTWRFKK 279 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTK----GFDFLSHCKKRVGN 715 + SLW +II +G +++ + K G S+ +G+ Sbjct: 280 EGNSLWKKIIMGCHGSSRPWAMLPCSASASGCWKQIVKVGEKKLPNGKSLNSYFVGMLGD 339 Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 G FW D W+ + PL +P LF LE K + V Sbjct: 340 GSTINFWGDTWLREEPLRITYPNLFRLEKKKWVKV 374 >gb|KYP47723.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 434 Score = 194 bits (492), Expect = 2e-55 Identities = 109/273 (39%), Positives = 148/273 (54%) Frame = +2 Query: 2 DGNLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGV 181 + N+ I +IL+ F LAS L+IN KSQLLG V + A + C + + F YLG+ Sbjct: 97 ESNIWAIKSILRLFELASRLKINFLKSQLLGFHVDTLWLQSMAMFLHCRIGSLPFTYLGL 156 Query: 182 MVGQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRG 361 +G P RL+ W +I K++ RLS WK ++S GGR+TLLKSVL S PIY LS FK PRG Sbjct: 157 PIGANPKRLDTWQPVIEKIQKRLSSWKCDSMSFGGRITLLKSVLHSIPIYFLSFFKAPRG 216 Query: 362 VLKNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRF 541 ++ +ES+ F G D KKI WVAWD K GGLG+ A N ALL KW WR Sbjct: 217 IISQLESLFKSFLWGGDADHKKIHWVAWDDVCREKNKGGLGIRDLIAFNLALLGKWKWRM 276 Query: 542 LSQDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTKGFDFLSHCKKRVGNGI 721 L + SLW ++I +LYG + + KG + C+K VGNG Sbjct: 277 LVETNSLWVKVINSLYG---------------DHLSFSSGSRVKGSRW---CRKVVGNGK 318 Query: 722 NSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 N+ FW + W+ L + RL+ + +K + Sbjct: 319 NTYFWEEDWLQGGRLSQRYNRLYLIAENKKAKI 351 >gb|OTG29886.1| putative reverse transcriptase domain, Reverse transcriptase zinc-binding domain protein [Helianthus annuus] Length = 853 Score = 199 bits (506), Expect = 9e-55 Identities = 108/275 (39%), Positives = 150/275 (54%), Gaps = 4/275 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 NL+ + IL+ F++ SGL+ N+ KS L GVG +EVD +GC F YLG+ V Sbjct: 333 NLQNMARILRIFYICSGLRTNIHKSNLFGVGTEDNEVDNMMEVLGCKRGAYPFFYLGIQV 392 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G SR++ W+ +I ++ RL WK +TLSIGGRL L+KSVL + PIY S+++ P V+ Sbjct: 393 GANMSRISNWNVVIEVVKRRLESWKARTLSIGGRLILIKSVLENLPIYYFSLYQAPMAVI 452 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 ++ESI RF EKKI WVAWD K +GGLG+S +N ALLLKW WRF Sbjct: 453 NSIESIMRRFLWAGSSEEKKIPWVAWDVIARPKNNGGLGISRLQDINEALLLKWTWRFKL 512 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTK----GFDFLSHCKKRVGN 715 + SLW ++I G + + +K G + S+ VG Sbjct: 513 EGNSLWKKVIVGCNGSSRAWTMLPCSSSASGCWKRIVKTGSKKIDNGRELNSYFVADVGA 572 Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 G + FW D W+ D PL ++P LF LE +K + V Sbjct: 573 GSSVNFWTDTWLRDQPLRDVYPNLFRLEKNKWVNV 607 >ref|XP_021995896.1| uncharacterized protein LOC110893084 [Helianthus annuus] Length = 1152 Score = 200 bits (508), Expect = 1e-54 Identities = 109/275 (39%), Positives = 152/275 (55%), Gaps = 4/275 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N+K I +L+ F+L SGL+IN+ KS + GV EVD +GC + F YLG+ V Sbjct: 635 NIKSIARVLRIFYLCSGLRINLHKSNIYGVCTDDLEVDNMMEVLGCKRGDFPFTYLGIKV 694 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G +R+ W+ ++ ++ RL+ WK K LSIGGRLTL+KSVL S P+Y S++K P+ V+ Sbjct: 695 GAKMTRIINWEPVVDVIKGRLASWKAKHLSIGGRLTLIKSVLESLPVYYFSLYKAPKAVI 754 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 ++E RF EKKI+WVAW+ K GGLG+S +N ALLLKW WRF + Sbjct: 755 DSIEMCMRRFLWADSYVEKKISWVAWEIVTLPKNQGGLGISKLQEVNDALLLKWTWRFKT 814 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMR---EVQSLK-TKGFDFLSHCKKRVGN 715 +D LW ++I +G + +V +K G S +G+ Sbjct: 815 EDSCLWKKVIMGCHGSSRPWAMLPCNASSSGCWKYIVKVGDIKVANGMPLHSFFVGNLGD 874 Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 G + FW D W+ DAPL I+P LF LE DK I V Sbjct: 875 GRSIYFWGDVWLRDAPLRIIYPNLFRLEKDKWIKV 909 >dbj|GAU22997.1| hypothetical protein TSUD_98260 [Trifolium subterraneum] Length = 767 Score = 197 bits (502), Expect = 1e-54 Identities = 106/277 (38%), Positives = 158/277 (57%), Gaps = 6/277 (2%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N++ + +L F + SGL++N +KS L+GV ++ S + +AAS +GC V F YLG+ + Sbjct: 207 NVRALWAVLMLFEVVSGLRVNFNKSMLVGVNIADSWLIEAASVLGCRVGTMTFMYLGLPI 266 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G P RL+ W+ ++ ++R+RL WK + LS GGRL LLK VL S P+Y LS FK P G++ Sbjct: 267 GGDPRRLSFWEPVVNRIRSRLVGWKSRFLSFGGRLVLLKLVLTSLPVYALSFFKAPSGII 326 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 ++ES+ FF G + +KI+W++W + GGLGV N ALL KW WR L Sbjct: 327 SSIESLLNNFFWGGCEDRRKISWISWKIVCLREEAGGLGVRQLREFNMALLGKWCWRLLV 386 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTKGFD------FLSHCKKRV 709 LW R++ A YG RE+ ++ + D F + ++RV Sbjct: 387 NKSGLWYRVLAARYG-EEVGRLREGGRTGSAWWREIVRIRDEEGDVGERGWFAASIERRV 445 Query: 710 GNGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 GNG+++ FW D W+G PL + RLF L L+K+ TV Sbjct: 446 GNGVDTFFWTDPWLGGVPLSVKYMRLFDLSLNKHRTV 482 >dbj|GAU29439.1| hypothetical protein TSUD_150090 [Trifolium subterraneum] Length = 919 Score = 199 bits (505), Expect = 2e-54 Identities = 112/277 (40%), Positives = 155/277 (55%), Gaps = 6/277 (2%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N++ + L F SGL++N KS L GV V+ + + AA +GC F YLG+ + Sbjct: 508 NVRTLKVTLLLFEAISGLKVNFHKSMLFGVNVNATWLHDAAVVLGCRHGQLPFLYLGLPI 567 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G PS+L W ++ ++R +LS WK K LS GGRL LLKSVL S P+Y LS FK P G++ Sbjct: 568 GGDPSKLCFWHPLVDRIRKKLSGWKCKNLSFGGRLILLKSVLSSIPVYFLSFFKAPSGII 627 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 +ESI C FF G + +KI W+ WD ++++GGLGV N ALL KWVWR L Sbjct: 628 STLESIFCHFFWGGCEVNRKIAWIKWDTICLNRVNGGLGVRRLKEFNIALLGKWVWRCLV 687 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTKGFD------FLSHCKKRV 709 ++ SLW ++RA YG R + S+++ G L + K++V Sbjct: 688 ENDSLWSLVLRAKYGQEGGRVRFSEGVGSTWW-RALNSVRS-GVGVRDVRWLLDNIKRKV 745 Query: 710 GNGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 G G SLFW D W+ D+P F RL+ L +DKNI V Sbjct: 746 GGGRGSLFWLDPWLEDSPFSRSFSRLYDLAVDKNILV 782 >ref|XP_021987015.1| uncharacterized protein LOC110883607 [Helianthus annuus] Length = 246 Score = 184 bits (468), Expect = 4e-54 Identities = 88/196 (44%), Positives = 129/196 (65%) Frame = +2 Query: 2 DGNLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGV 181 + + ++ I++ F+L SGL+I+ KS L G+GV S V A++I C V + +YLG+ Sbjct: 46 ENTIMNLVRIMRGFYLISGLKISHKKSHLFGIGVDPSTVHVTANNIHCKVGSFPCKYLGL 105 Query: 182 MVGQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRG 361 +VG ++ W +I L++RLSKWK TLSIGGR+TLLKSVL S P+Y S++K P G Sbjct: 106 LVGANMNQARHWSGVIEILKSRLSKWKASTLSIGGRITLLKSVLDSLPLYFFSLYKAPIG 165 Query: 362 VLKNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRF 541 VL +E IR RFF G D+ + K WV W++ + + GG G+ S +N +LL+KW WRF Sbjct: 166 VLDKLEVIRRRFFRGGDESKNKTNWVCWERVIGPREKGGTGIGSLRDMNLSLLVKWWWRF 225 Query: 542 LSQDGSLWCRIIRALY 589 ++DGSLW R+I A++ Sbjct: 226 KTEDGSLWKRVISAIH 241 >gb|KYP44439.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 1142 Score = 198 bits (503), Expect = 6e-54 Identities = 105/274 (38%), Positives = 155/274 (56%), Gaps = 3/274 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N+ I +IL+ F LASGL++N SKS +G + + AS + V + F YLG+ + Sbjct: 619 NIWTIKSILRLFELASGLKVNFSKSTFMGYNIESQWLQIMASVLHFRVGSTPFSYLGLPI 678 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G + W +I K++ RLS+WK TLS GGR+ LLKSVL S PIY LS K P+G++ Sbjct: 679 GANHRISSTWHPVIEKVKKRLSRWKCTTLSFGGRIALLKSVLHSIPIYFLSFLKAPKGII 738 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 ++ES+ F G DQ +KI WVAWD K+HGGLG+ A N +LL KW WR L Sbjct: 739 SSIESLFKSFLWGADQDNRKINWVAWDVVCRDKIHGGLGMKDLSAFNLSLLGKWHWRMLV 798 Query: 548 QDGSLWCRIIRALY--GXXXXXXXXXXXXXXXXXMREVQSLKTKGFDFL-SHCKKRVGNG 718 + SLW R+IR+LY + ++ +++ S+C K +GNG Sbjct: 799 EKNSLWVRVIRSLYDIASHLPNGSGAKGSRWWVDLNRIEEGDLVSNEWMSSNCCKVIGNG 858 Query: 719 INSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 +++ FW D W+G L F RL+ + ++KN+++ Sbjct: 859 VDTKFWLDKWVGHGILAHTFSRLYQIAINKNVSI 892 >dbj|GAU34535.1| hypothetical protein TSUD_394090 [Trifolium subterraneum] Length = 916 Score = 197 bits (500), Expect = 9e-54 Identities = 109/276 (39%), Positives = 155/276 (56%), Gaps = 5/276 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N++ + IL F SGL+IN KS L GV ++ + + +AA +GC F YLG+ + Sbjct: 463 NVRALKAILLLFEATSGLKINFHKSMLFGVNINVTWLHEAAVVLGCRHGQLPFLYLGLPI 522 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G P +L W ++ ++R RLS WK K LS GGRL LLK VL S P+Y LS FK P G++ Sbjct: 523 GGDPRKLCFWYPLVDRIRKRLSGWKCKNLSYGGRLILLKFVLSSIPVYFLSFFKAPTGII 582 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 +ESI C+FF G + +KI W+ WD ++ +GGLGV N +LL KWVWR L Sbjct: 583 STLESIFCQFFWGGCEANRKIAWIKWDTICLNRENGGLGVRRLKEFNISLLGKWVWRCLV 642 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKT-----KGFDFLSHCKKRVG 712 ++ SLW ++RA YG R + ++++ G + + +++VG Sbjct: 643 ENDSLWSLVLRAKYG-EEGGRVRFSEGVGSSWWRGLNTVRSGVGLRDGRWLVDNIRRKVG 701 Query: 713 NGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820 G SLFW D W+ D PL F RL+ L +DKNI V Sbjct: 702 GGCGSLFWLDPWLEDNPLSRSFSRLYDLAVDKNILV 737 >gb|OTF85059.1| putative RNA-directed DNA polymerase, eukaryota, Reverse transcriptase zinc-binding domain protein [Helianthus annuus] Length = 1099 Score = 197 bits (500), Expect = 1e-53 Identities = 107/271 (39%), Positives = 149/271 (54%), Gaps = 4/271 (1%) Frame = +2 Query: 8 NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187 N + IL+ F LASGL++N+SK L GVGV EV A + C + FRYLG++V Sbjct: 627 NALNLRRILRCFNLASGLRVNLSKCSLYGVGVGDHEVSDMAYVLRCRAGSFPFRYLGLLV 686 Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367 G + + WD +I + RLS WK KTLS GGR+TL+KSVL + P Y S++K P VL Sbjct: 687 GANMNLVKNWDPVIKLFKNRLSIWKAKTLSFGGRITLIKSVLSALPTYFFSLYKAPLQVL 746 Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547 K +E +R FF G + + K+ W AW+KT+ +GGLG S N A+L KW WRF Sbjct: 747 KQLERLRRVFFWGGSEEKAKLNWTAWEKTIGPIEYGGLGFGSLQDANLAMLSKWWWRFKV 806 Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQS----LKTKGFDFLSHCKKRVGN 715 LW ++I AL+ +++ L+T+G D +G+ Sbjct: 807 DRNGLWRKVIWALHQSSRAWTFIPTKVSIIGPWKQITRCAGILETRGIDLSKSIIGILGS 866 Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALELDK 808 G++ FW D W G PL S+FP LFA+E +K Sbjct: 867 GVDIYFWVDIWFGTEPLASLFPNLFAIERNK 897