BLASTX nr result

ID: Chrysanthemum21_contig00039972 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00039972
         (821 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP71190.1| Putative ribonuclease H protein At1g65750 family ...   204   1e-59
dbj|GAU10476.1| hypothetical protein TSUD_420710, partial [Trifo...   203   5e-59
gb|KYP54863.1| Putative ribonuclease H protein At1g65750 family ...   204   2e-57
ref|XP_021995642.1| uncharacterized protein LOC110892803 [Helian...   199   3e-57
ref|XP_022032886.1| uncharacterized protein LOC110933974 [Helian...   207   6e-57
ref|XP_023737697.1| uncharacterized protein LOC111885685 [Lactuc...   194   1e-56
gb|OTG26767.1| putative reverse transcriptase domain, Reverse tr...   203   3e-56
gb|KYP61726.1| Putative ribonuclease H protein At1g65750 family ...   198   3e-56
gb|OTG08794.1| putative RNA-directed DNA polymerase, eukaryota, ...   203   9e-56
gb|OTF94555.1| putative RNA-directed DNA polymerase, eukaryota, ...   203   9e-56
ref|XP_021986150.1| uncharacterized protein LOC110882438 [Helian...   194   1e-55
gb|KYP47723.1| Putative ribonuclease H protein At1g65750 family ...   194   2e-55
gb|OTG29886.1| putative reverse transcriptase domain, Reverse tr...   199   9e-55
ref|XP_021995896.1| uncharacterized protein LOC110893084 [Helian...   200   1e-54
dbj|GAU22997.1| hypothetical protein TSUD_98260 [Trifolium subte...   197   1e-54
dbj|GAU29439.1| hypothetical protein TSUD_150090 [Trifolium subt...   199   2e-54
ref|XP_021987015.1| uncharacterized protein LOC110883607 [Helian...   184   4e-54
gb|KYP44439.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu...   198   6e-54
dbj|GAU34535.1| hypothetical protein TSUD_394090 [Trifolium subt...   197   9e-54
gb|OTF85059.1| putative RNA-directed DNA polymerase, eukaryota, ...   197   1e-53

>gb|KYP71190.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 417

 Score =  204 bits (518), Expect = 1e-59
 Identities = 109/274 (39%), Positives = 154/274 (56%), Gaps = 3/274 (1%)
 Frame = +2

Query: 8   NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
           N+  I +IL+ F L +GL++N  KS+L G  ++   ++  A  + C V +  F YLG+ +
Sbjct: 114 NVWAIKSILQIFELVAGLKVNFHKSKLFGFNINIEVLNLMAQFLNCKVGSLPFCYLGLPL 173

Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
           G  P  +  W+ +I K++ RLSKWK  TLS GGR  LLKSVL S PIY LS FK P+G++
Sbjct: 174 GANPRCIKTWEPVISKVKKRLSKWKSSTLSFGGRSVLLKSVLNSIPIYYLSFFKAPQGII 233

Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
             +ES+   F  G D+  +KI WVAW +    K HGGLG+    A N ALL KW WR L 
Sbjct: 234 SKLESLFKLFLWGGDENHRKIAWVAWQEVCRGKEHGGLGILDLRAFNLALLGKWRWRLLV 293

Query: 548 QDGSLWCRIIRALY--GXXXXXXXXXXXXXXXXXMREVQSLKTKGFD-FLSHCKKRVGNG 718
           + G  W R++ ++Y  G                 +  + S     FD F S C K VG+G
Sbjct: 294 EKGRFWHRVVTSIYGEGCFQGVGDKVQSSKWWVDLWTIDSAPYTSFDWFSSRCTKVVGDG 353

Query: 719 INSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
            N+ FW D W G  PL + + RLF++  DK+++V
Sbjct: 354 RNTFFWKDGWSGQGPLCNPYSRLFSIASDKDVSV 387


>dbj|GAU10476.1| hypothetical protein TSUD_420710, partial [Trifolium subterraneum]
          Length = 441

 Score =  203 bits (516), Expect = 5e-59
 Identities = 109/276 (39%), Positives = 154/276 (55%), Gaps = 5/276 (1%)
 Frame = +2

Query: 8   NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
           N++ +  IL  F   SGL++N  KS L+GV +  S + +AAS +GC V    F YLG+ +
Sbjct: 99  NVRALRVILVLFEKVSGLKVNFHKSMLVGVNIGESWLMEAASVLGCKVGKIPFMYLGLPI 158

Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
           G  P RL  W+ I+  +R+RLS+WK + LS GGRL L+KSVL S P+Y LS FK P G++
Sbjct: 159 GGDPRRLAFWEPIVSNIRSRLSRWKNRLLSFGGRLILIKSVLTSLPVYALSFFKAPSGII 218

Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
            ++ES+   FF G  +G +KI W++W      + HGGLGV      N ALL KW WR L 
Sbjct: 219 SSLESLLSSFFWGGGEGHRKIAWISWQTVCLGQEHGGLGVRQLREFNTALLGKWCWRMLV 278

Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLK-----TKGFDFLSHCKKRVG 712
             G +W R++ A YG                  REV  ++       G  F    ++RVG
Sbjct: 279 DKGGMWYRVLAARYG-EVAGRLAVGGRNGSAWWREVARIRDGDGAVGGAWFAESIERRVG 337

Query: 713 NGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
           NG ++ FW D W+   PL   + RLF L  +++I+V
Sbjct: 338 NGSDTSFWSDPWLDGVPLRVRYRRLFDLHFNQSISV 373


>gb|KYP54863.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 648

 Score =  204 bits (518), Expect = 2e-57
 Identities = 109/274 (39%), Positives = 154/274 (56%), Gaps = 3/274 (1%)
 Frame = +2

Query: 8   NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
           N+  I +IL+ F L +GL++N  KS+L G  ++   ++  A  + C V +  F YLG+ +
Sbjct: 104 NVWAIKSILQIFELVAGLKVNFHKSKLFGFNINIEVLNLMAQFLNCKVGSLPFCYLGLPL 163

Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
           G  P  +  W+ +I K++ RLSKWK  TLS GGR  LLKSVL S PIY LS FK P+G++
Sbjct: 164 GANPRCIKTWEPVISKVKKRLSKWKSSTLSFGGRSVLLKSVLNSIPIYYLSFFKAPQGII 223

Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
             +ES+   F  G D+  +KI WVAW +    K HGGLG+    A N ALL KW WR L 
Sbjct: 224 SKLESLFKLFLWGGDENHRKIAWVAWQEVCRGKEHGGLGILDLRAFNLALLGKWRWRLLV 283

Query: 548 QDGSLWCRIIRALY--GXXXXXXXXXXXXXXXXXMREVQSLKTKGFD-FLSHCKKRVGNG 718
           + G  W R++ ++Y  G                 +  + S     FD F S C K VG+G
Sbjct: 284 EKGRFWHRVVTSIYGEGCFQGVGDKVQSSKWWVDLWTIDSTPYTSFDWFSSRCTKVVGDG 343

Query: 719 INSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
            N+ FW D W G  PL + + RLF++  DK+++V
Sbjct: 344 RNTFFWKDGWSGQGPLCNRYSRLFSIASDKDVSV 377


>ref|XP_021995642.1| uncharacterized protein LOC110892803 [Helianthus annuus]
          Length = 466

 Score =  199 bits (506), Expect = 3e-57
 Identities = 109/277 (39%), Positives = 149/277 (53%), Gaps = 4/277 (1%)
 Frame = +2

Query: 2   DGNLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGV 181
           + N+K +   L+ F+LASGL+IN+ KS + G+GV   EV    + IGC   +    YLG+
Sbjct: 96  EDNVKNVARCLRIFYLASGLKINLQKSNIYGLGVGNDEVLNMCNVIGCKSDSIPLTYLGI 155

Query: 182 MVGQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRG 361
            VG   +R+N W  II     RLS WK KTLSIGGRLTL+ SVL S PIY  S++K P G
Sbjct: 156 SVGSNMNRINNWTPIIEVFDKRLSAWKAKTLSIGGRLTLINSVLESLPIYYFSLYKAPVG 215

Query: 362 VLKNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRF 541
           V+K +E+   +F         K+ WVAWD     K  GGLG++    +N ALL+KW WRF
Sbjct: 216 VIKTLEAKMRKFLWVGSSNINKMNWVAWDWVTWPKNIGGLGINRLLEVNEALLVKWGWRF 275

Query: 542 LSQDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREV----QSLKTKGFDFLSHCKKRV 709
             ++ +LW +++ A +G                  + V      LK  G         +V
Sbjct: 276 RVENHNLWRKVVEACHGKANHWSFLPLNSNIAGCWKNVVKLLNKLKLNGRGLNRLILGKV 335

Query: 710 GNGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
           GNG+ + FW D W+GD P    +P LF LEL K+  V
Sbjct: 336 GNGVETRFWIDSWLGDLPFMERWPLLFGLELFKSCRV 372


>ref|XP_022032886.1| uncharacterized protein LOC110933974 [Helianthus annuus]
          Length = 1354

 Score =  207 bits (526), Expect = 6e-57
 Identities = 111/275 (40%), Positives = 156/275 (56%), Gaps = 4/275 (1%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            NL+ I  +L+ F+L SGL+IN+ KS L GVGV  S+++  ++ +GC +    F YLG+ V
Sbjct: 835  NLEKIHRLLRIFYLCSGLKINIHKSVLFGVGVEDSDIEAMSNVLGCRIGRLPFVYLGIKV 894

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G   +R++ W+ ++  +R RL+ WK K LSIGGRLTL+KSVL S P+Y LS+F+ P+ V+
Sbjct: 895  GANMNRISNWEPVLEAIRDRLTSWKTKVLSIGGRLTLIKSVLTSLPVYYLSLFRAPKAVV 954

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
             N+E I   F     +  K + WV+W+    SK  GGLG+S    +N ALL KWVWRF +
Sbjct: 955  DNIEKIMRHFLWAGCKVGKGLHWVSWEVATKSKKSGGLGISKIAEVNSALLAKWVWRFKN 1014

Query: 548  QDGSLWCRIIRALYG----XXXXXXXXXXXXXXXXXMREVQSLKTKGFDFLSHCKKRVGN 715
               SLW RII  ++G                      R V  L   G  F +  K  +GN
Sbjct: 1015 DKNSLWKRIIEDIHGGRKRWIFLPVNNSIKGCWKSISRHVDGLNFNGQPFKALFKGSIGN 1074

Query: 716  GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
            G    FW D W G  PL + +PRL+A + +KN  V
Sbjct: 1075 GSRLRFWKDLWWGSTPLMNRWPRLYAQDSNKNAVV 1109


>ref|XP_023737697.1| uncharacterized protein LOC111885685 [Lactuca sativa]
          Length = 355

 Score =  194 bits (494), Expect = 1e-56
 Identities = 103/275 (37%), Positives = 145/275 (52%), Gaps = 4/275 (1%)
 Frame = +2

Query: 8   NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
           N+K +  IL+ F ++SGL++N  KSQ+ G+GV   EV   A  +GC   N  F YLGV V
Sbjct: 37  NIKNLAGILRCFHVSSGLKVNFKKSQVFGIGVDSQEVLSLARPLGCEPANLPFTYLGVPV 96

Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
           G        W  +I   + RLS WK K LS+GGRLTL KSV+GS P +  S+F  P G+L
Sbjct: 97  GANMKLKKYWKPVIENFQLRLSAWKSKNLSLGGRLTLTKSVIGSLPTFYFSLFIAPAGIL 156

Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
           K +E IR RF  G  +  +KI WV+W K    K +GGLG+ S  ALN +L++KW WR   
Sbjct: 157 KALEKIRRRFLWGGSEDSRKINWVSWGKVTTPKENGGLGLGSLKALNLSLIMKWWWRLRV 216

Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQ----SLKTKGFDFLSHCKKRVGN 715
           ++  LW ++I  ++                   + +      L   G +      K VG 
Sbjct: 217 ENTCLWSKVIEGIHNLKNKPGDYMSKQSITGVWKNITQARGELMKVGINIEDVILKEVGT 276

Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
           G  ++FW D W G+  L + FP ++ LE  K+  V
Sbjct: 277 GEKTMFWHDRWTGNMTLKASFPEMYKLERHKHCMV 311


>gb|OTG26767.1| putative reverse transcriptase domain, Reverse transcriptase
            zinc-binding domain protein [Helianthus annuus]
          Length = 881

 Score =  203 bits (517), Expect = 3e-56
 Identities = 105/268 (39%), Positives = 154/268 (57%), Gaps = 4/268 (1%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            N   +  IL+ F+LASGL++N++KS + GVG+++ EV   A+ +GC   +  F++LG++V
Sbjct: 363  NASNLRRILRCFYLASGLKVNLAKSSVYGVGINQHEVQSMATFLGCKSGSFPFKHLGLVV 422

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G   + +  W  II   + RL+ WK K LS GGR+TLLKSVL + P Y  S++K P  V+
Sbjct: 423  GANMNLVKNWKPIIDLFKNRLAIWKAKQLSYGGRVTLLKSVLNALPTYFFSLYKAPNQVI 482

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
              ++ +R  FF G  + + K+ WVAWD  +A   +GGLG  S    N A+L KW WRF  
Sbjct: 483  DALDRLRRVFFWGGSEEKAKMNWVAWDNVIAPIEYGGLGFGSLKDANHAMLAKWWWRFKV 542

Query: 548  QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREV----QSLKTKGFDFLSHCKKRVGN 715
            ++  LW R+I A++                   +++     SL+ KG D     +  VGN
Sbjct: 543  ENKGLWRRVIWAIHHNSRSWSAIPAKISMPGIWKQIVNIHHSLQQKGIDLFKAIRNVVGN 602

Query: 716  GINSLFWFDCWIGDAPLHSIFPRLFALE 799
            G N+LFW D WIG+ P H  FP LF+LE
Sbjct: 603  GSNTLFWLDLWIGNTPFHIRFPTLFSLE 630


>gb|KYP61726.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 554

 Score =  198 bits (504), Expect = 3e-56
 Identities = 105/274 (38%), Positives = 152/274 (55%), Gaps = 3/274 (1%)
 Frame = +2

Query: 8   NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
           N+  I +IL+ F L +GL++N  KS+L G  ++   ++  A  + C V +  F YLG+ +
Sbjct: 41  NVWAIKSILQIFELVAGLKVNFHKSKLFGFNINIEVLNLMAQFLNCKVGSLPFCYLGLPL 100

Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
           G+ P  +  W+ +I KL+ RLSKWK  TLS GGR  LLKSVL S P Y LS FK P+G++
Sbjct: 101 GENPHCIKTWEPVISKLKKRLSKWKSSTLSFGGRSALLKSVLNSIPTYYLSFFKAPQGII 160

Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
             +ES+   F  G D+  +KI WVAW +    K HGGLG+    A N A+L KW W  L 
Sbjct: 161 SKLESLFKLFLWGGDENHRKIAWVAWQEVCKGKEHGGLGILDLRAFNLAILEKWRWHLLV 220

Query: 548 QDGSLWCRIIRALY--GXXXXXXXXXXXXXXXXXMREVQSLKTKGFD-FLSHCKKRVGNG 718
           + G  W +++ ++Y  G                 +  +       FD F S C K VG+G
Sbjct: 221 EKGRFWHKVVTSIYGEGCFQGVGDKVQSSKWWVDLWTIDFAPYASFDWFSSRCTKVVGDG 280

Query: 719 INSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
            N+ FW D W G  PL + + RLF++  DK+++V
Sbjct: 281 QNTFFWKDGWSGQGPLCNRYSRLFSIASDKDVSV 314


>gb|OTG08794.1| putative RNA-directed DNA polymerase, eukaryota, Reverse
            transcriptase zinc-binding domain protein [Helianthus
            annuus]
          Length = 1217

 Score =  203 bits (517), Expect = 9e-56
 Identities = 105/268 (39%), Positives = 154/268 (57%), Gaps = 4/268 (1%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            N   +  IL+ F+LASGL++N++KS + GVG+++ EV   A+ +GC   +  F++LG++V
Sbjct: 699  NASNLRRILRCFYLASGLKVNLAKSSVYGVGINQHEVQSMATFLGCKSGSFPFKHLGLVV 758

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G   + +  W  II   + RL+ WK K LS GGR+TLLKSVL + P Y  S++K P  V+
Sbjct: 759  GANMNLVKNWKPIIDLFKNRLAIWKAKQLSYGGRVTLLKSVLNALPTYFFSLYKAPNQVI 818

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
              ++ +R  FF G  + + K+ WVAWD  +A   +GGLG  S    N A+L KW WRF  
Sbjct: 819  DALDRLRRVFFWGGSEEKAKMNWVAWDNVIAPIEYGGLGFGSLKDANHAMLAKWWWRFKV 878

Query: 548  QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREV----QSLKTKGFDFLSHCKKRVGN 715
            ++  LW R+I A++                   +++     SL+ KG D     +  VGN
Sbjct: 879  ENKGLWRRVIWAIHHNSRSWSAIPAKISMPGIWKQIVNIHHSLQQKGIDLFKAIRNVVGN 938

Query: 716  GINSLFWFDCWIGDAPLHSIFPRLFALE 799
            G N+LFW D WIG+ P H  FP LF+LE
Sbjct: 939  GSNTLFWLDLWIGNTPFHIRFPTLFSLE 966


>gb|OTF94555.1| putative RNA-directed DNA polymerase, eukaryota, Reverse
            transcriptase zinc-binding domain protein [Helianthus
            annuus]
          Length = 1282

 Score =  203 bits (517), Expect = 9e-56
 Identities = 105/268 (39%), Positives = 154/268 (57%), Gaps = 4/268 (1%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            N   +  IL+ F+LASGL++N++KS + GVG+++ EV   A+ +GC   +  F++LG++V
Sbjct: 562  NASNLRRILRCFYLASGLKVNLAKSSVYGVGINQHEVQSMATFLGCKSGSFPFKHLGLVV 621

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G   + +  W  II   + RL+ WK K LS GGR+TLLKSVL + P Y  S++K P  V+
Sbjct: 622  GANMNLVKNWKPIIDLFKNRLAIWKAKQLSYGGRVTLLKSVLNALPTYFFSLYKAPNQVI 681

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
              ++ +R  FF G  + + K+ WVAWD  +A   +GGLG  S    N A+L KW WRF  
Sbjct: 682  DALDRLRRVFFWGGSEEKAKMNWVAWDNVIAPIEYGGLGFGSLKDANHAMLAKWWWRFKV 741

Query: 548  QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREV----QSLKTKGFDFLSHCKKRVGN 715
            ++  LW R+I A++                   +++     SL+ KG D     +  VGN
Sbjct: 742  ENKGLWRRVIWAIHHNSRSWSAIPAKISMPGIWKQIVNIHHSLQQKGIDLFKAIRNVVGN 801

Query: 716  GINSLFWFDCWIGDAPLHSIFPRLFALE 799
            G N+LFW D WIG+ P H  FP LF+LE
Sbjct: 802  GSNTLFWLDLWIGNTPFHIRFPTLFSLE 829


>ref|XP_021986150.1| uncharacterized protein LOC110882438 [Helianthus annuus]
          Length = 445

 Score =  194 bits (494), Expect = 1e-55
 Identities = 110/275 (40%), Positives = 147/275 (53%), Gaps = 4/275 (1%)
 Frame = +2

Query: 8   NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
           N++    IL+ F+L SGL+IN+ KS L GVG    EVD     +GC      F YLG+ V
Sbjct: 100 NIQSTTKILRIFYLFSGLRINLYKSNLFGVGTEDMEVDNMMEILGCKRGGIPFVYLGIQV 159

Query: 188 GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
           G   +R++ W  II  ++ARL  WK KTLSIGGRL L+KSVL S PIY LS++K P+ V+
Sbjct: 160 GAKMTRISNWTSIIEVIKARLVSWKAKTLSIGGRLILIKSVLESLPIYYLSLYKAPKVVI 219

Query: 368 KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
             +E+I  RF       E+KI WVAWD     K  GGL V+    +N ALLLKW WRF  
Sbjct: 220 DIIEAIMRRFLWAGSSAERKIPWVAWDIITTPKKKGGLCVTKLQEVNEALLLKWTWRFKK 279

Query: 548 QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTK----GFDFLSHCKKRVGN 715
           +  SLW +II   +G                  +++  +  K    G    S+    +G+
Sbjct: 280 EGNSLWKKIIMGCHGSSRPWAMLPCSASASGCWKQIVKVGEKKLPNGKSLNSYFVGMLGD 339

Query: 716 GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
           G    FW D W+ + PL   +P LF LE  K + V
Sbjct: 340 GSTINFWGDTWLREEPLRITYPNLFRLEKKKWVKV 374


>gb|KYP47723.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 434

 Score =  194 bits (492), Expect = 2e-55
 Identities = 109/273 (39%), Positives = 148/273 (54%)
 Frame = +2

Query: 2   DGNLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGV 181
           + N+  I +IL+ F LAS L+IN  KSQLLG  V    +   A  + C + +  F YLG+
Sbjct: 97  ESNIWAIKSILRLFELASRLKINFLKSQLLGFHVDTLWLQSMAMFLHCRIGSLPFTYLGL 156

Query: 182 MVGQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRG 361
            +G  P RL+ W  +I K++ RLS WK  ++S GGR+TLLKSVL S PIY LS FK PRG
Sbjct: 157 PIGANPKRLDTWQPVIEKIQKRLSSWKCDSMSFGGRITLLKSVLHSIPIYFLSFFKAPRG 216

Query: 362 VLKNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRF 541
           ++  +ES+   F  G D   KKI WVAWD     K  GGLG+    A N ALL KW WR 
Sbjct: 217 IISQLESLFKSFLWGGDADHKKIHWVAWDDVCREKNKGGLGIRDLIAFNLALLGKWKWRM 276

Query: 542 LSQDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTKGFDFLSHCKKRVGNGI 721
           L +  SLW ++I +LYG                 +      + KG  +   C+K VGNG 
Sbjct: 277 LVETNSLWVKVINSLYG---------------DHLSFSSGSRVKGSRW---CRKVVGNGK 318

Query: 722 NSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
           N+ FW + W+    L   + RL+ +  +K   +
Sbjct: 319 NTYFWEEDWLQGGRLSQRYNRLYLIAENKKAKI 351


>gb|OTG29886.1| putative reverse transcriptase domain, Reverse transcriptase
            zinc-binding domain protein [Helianthus annuus]
          Length = 853

 Score =  199 bits (506), Expect = 9e-55
 Identities = 108/275 (39%), Positives = 150/275 (54%), Gaps = 4/275 (1%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            NL+ +  IL+ F++ SGL+ N+ KS L GVG   +EVD     +GC      F YLG+ V
Sbjct: 333  NLQNMARILRIFYICSGLRTNIHKSNLFGVGTEDNEVDNMMEVLGCKRGAYPFFYLGIQV 392

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G   SR++ W+ +I  ++ RL  WK +TLSIGGRL L+KSVL + PIY  S+++ P  V+
Sbjct: 393  GANMSRISNWNVVIEVVKRRLESWKARTLSIGGRLILIKSVLENLPIYYFSLYQAPMAVI 452

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
             ++ESI  RF       EKKI WVAWD     K +GGLG+S    +N ALLLKW WRF  
Sbjct: 453  NSIESIMRRFLWAGSSEEKKIPWVAWDVIARPKNNGGLGISRLQDINEALLLKWTWRFKL 512

Query: 548  QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTK----GFDFLSHCKKRVGN 715
            +  SLW ++I    G                  + +    +K    G +  S+    VG 
Sbjct: 513  EGNSLWKKVIVGCNGSSRAWTMLPCSSSASGCWKRIVKTGSKKIDNGRELNSYFVADVGA 572

Query: 716  GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
            G +  FW D W+ D PL  ++P LF LE +K + V
Sbjct: 573  GSSVNFWTDTWLRDQPLRDVYPNLFRLEKNKWVNV 607


>ref|XP_021995896.1| uncharacterized protein LOC110893084 [Helianthus annuus]
          Length = 1152

 Score =  200 bits (508), Expect = 1e-54
 Identities = 109/275 (39%), Positives = 152/275 (55%), Gaps = 4/275 (1%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            N+K I  +L+ F+L SGL+IN+ KS + GV     EVD     +GC   +  F YLG+ V
Sbjct: 635  NIKSIARVLRIFYLCSGLRINLHKSNIYGVCTDDLEVDNMMEVLGCKRGDFPFTYLGIKV 694

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G   +R+  W+ ++  ++ RL+ WK K LSIGGRLTL+KSVL S P+Y  S++K P+ V+
Sbjct: 695  GAKMTRIINWEPVVDVIKGRLASWKAKHLSIGGRLTLIKSVLESLPVYYFSLYKAPKAVI 754

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
             ++E    RF       EKKI+WVAW+     K  GGLG+S    +N ALLLKW WRF +
Sbjct: 755  DSIEMCMRRFLWADSYVEKKISWVAWEIVTLPKNQGGLGISKLQEVNDALLLKWTWRFKT 814

Query: 548  QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMR---EVQSLK-TKGFDFLSHCKKRVGN 715
            +D  LW ++I   +G                  +   +V  +K   G    S     +G+
Sbjct: 815  EDSCLWKKVIMGCHGSSRPWAMLPCNASSSGCWKYIVKVGDIKVANGMPLHSFFVGNLGD 874

Query: 716  GINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
            G +  FW D W+ DAPL  I+P LF LE DK I V
Sbjct: 875  GRSIYFWGDVWLRDAPLRIIYPNLFRLEKDKWIKV 909


>dbj|GAU22997.1| hypothetical protein TSUD_98260 [Trifolium subterraneum]
          Length = 767

 Score =  197 bits (502), Expect = 1e-54
 Identities = 106/277 (38%), Positives = 158/277 (57%), Gaps = 6/277 (2%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            N++ +  +L  F + SGL++N +KS L+GV ++ S + +AAS +GC V    F YLG+ +
Sbjct: 207  NVRALWAVLMLFEVVSGLRVNFNKSMLVGVNIADSWLIEAASVLGCRVGTMTFMYLGLPI 266

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G  P RL+ W+ ++ ++R+RL  WK + LS GGRL LLK VL S P+Y LS FK P G++
Sbjct: 267  GGDPRRLSFWEPVVNRIRSRLVGWKSRFLSFGGRLVLLKLVLTSLPVYALSFFKAPSGII 326

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
             ++ES+   FF G  +  +KI+W++W      +  GGLGV      N ALL KW WR L 
Sbjct: 327  SSIESLLNNFFWGGCEDRRKISWISWKIVCLREEAGGLGVRQLREFNMALLGKWCWRLLV 386

Query: 548  QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTKGFD------FLSHCKKRV 709
                LW R++ A YG                  RE+  ++ +  D      F +  ++RV
Sbjct: 387  NKSGLWYRVLAARYG-EEVGRLREGGRTGSAWWREIVRIRDEEGDVGERGWFAASIERRV 445

Query: 710  GNGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
            GNG+++ FW D W+G  PL   + RLF L L+K+ TV
Sbjct: 446  GNGVDTFFWTDPWLGGVPLSVKYMRLFDLSLNKHRTV 482


>dbj|GAU29439.1| hypothetical protein TSUD_150090 [Trifolium subterraneum]
          Length = 919

 Score =  199 bits (505), Expect = 2e-54
 Identities = 112/277 (40%), Positives = 155/277 (55%), Gaps = 6/277 (2%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            N++ +   L  F   SGL++N  KS L GV V+ + +  AA  +GC      F YLG+ +
Sbjct: 508  NVRTLKVTLLLFEAISGLKVNFHKSMLFGVNVNATWLHDAAVVLGCRHGQLPFLYLGLPI 567

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G  PS+L  W  ++ ++R +LS WK K LS GGRL LLKSVL S P+Y LS FK P G++
Sbjct: 568  GGDPSKLCFWHPLVDRIRKKLSGWKCKNLSFGGRLILLKSVLSSIPVYFLSFFKAPSGII 627

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
              +ESI C FF G  +  +KI W+ WD    ++++GGLGV      N ALL KWVWR L 
Sbjct: 628  STLESIFCHFFWGGCEVNRKIAWIKWDTICLNRVNGGLGVRRLKEFNIALLGKWVWRCLV 687

Query: 548  QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKTKGFD------FLSHCKKRV 709
            ++ SLW  ++RA YG                  R + S+++ G         L + K++V
Sbjct: 688  ENDSLWSLVLRAKYGQEGGRVRFSEGVGSTWW-RALNSVRS-GVGVRDVRWLLDNIKRKV 745

Query: 710  GNGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
            G G  SLFW D W+ D+P    F RL+ L +DKNI V
Sbjct: 746  GGGRGSLFWLDPWLEDSPFSRSFSRLYDLAVDKNILV 782


>ref|XP_021987015.1| uncharacterized protein LOC110883607 [Helianthus annuus]
          Length = 246

 Score =  184 bits (468), Expect = 4e-54
 Identities = 88/196 (44%), Positives = 129/196 (65%)
 Frame = +2

Query: 2   DGNLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGV 181
           +  +  ++ I++ F+L SGL+I+  KS L G+GV  S V   A++I C V +   +YLG+
Sbjct: 46  ENTIMNLVRIMRGFYLISGLKISHKKSHLFGIGVDPSTVHVTANNIHCKVGSFPCKYLGL 105

Query: 182 MVGQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRG 361
           +VG   ++   W  +I  L++RLSKWK  TLSIGGR+TLLKSVL S P+Y  S++K P G
Sbjct: 106 LVGANMNQARHWSGVIEILKSRLSKWKASTLSIGGRITLLKSVLDSLPLYFFSLYKAPIG 165

Query: 362 VLKNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRF 541
           VL  +E IR RFF G D+ + K  WV W++ +  +  GG G+ S   +N +LL+KW WRF
Sbjct: 166 VLDKLEVIRRRFFRGGDESKNKTNWVCWERVIGPREKGGTGIGSLRDMNLSLLVKWWWRF 225

Query: 542 LSQDGSLWCRIIRALY 589
            ++DGSLW R+I A++
Sbjct: 226 KTEDGSLWKRVISAIH 241


>gb|KYP44439.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 1142

 Score =  198 bits (503), Expect = 6e-54
 Identities = 105/274 (38%), Positives = 155/274 (56%), Gaps = 3/274 (1%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            N+  I +IL+ F LASGL++N SKS  +G  +    +   AS +   V +  F YLG+ +
Sbjct: 619  NIWTIKSILRLFELASGLKVNFSKSTFMGYNIESQWLQIMASVLHFRVGSTPFSYLGLPI 678

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G      + W  +I K++ RLS+WK  TLS GGR+ LLKSVL S PIY LS  K P+G++
Sbjct: 679  GANHRISSTWHPVIEKVKKRLSRWKCTTLSFGGRIALLKSVLHSIPIYFLSFLKAPKGII 738

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
             ++ES+   F  G DQ  +KI WVAWD     K+HGGLG+    A N +LL KW WR L 
Sbjct: 739  SSIESLFKSFLWGADQDNRKINWVAWDVVCRDKIHGGLGMKDLSAFNLSLLGKWHWRMLV 798

Query: 548  QDGSLWCRIIRALY--GXXXXXXXXXXXXXXXXXMREVQSLKTKGFDFL-SHCKKRVGNG 718
            +  SLW R+IR+LY                    +  ++       +++ S+C K +GNG
Sbjct: 799  EKNSLWVRVIRSLYDIASHLPNGSGAKGSRWWVDLNRIEEGDLVSNEWMSSNCCKVIGNG 858

Query: 719  INSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
            +++ FW D W+G   L   F RL+ + ++KN+++
Sbjct: 859  VDTKFWLDKWVGHGILAHTFSRLYQIAINKNVSI 892


>dbj|GAU34535.1| hypothetical protein TSUD_394090 [Trifolium subterraneum]
          Length = 916

 Score =  197 bits (500), Expect = 9e-54
 Identities = 109/276 (39%), Positives = 155/276 (56%), Gaps = 5/276 (1%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            N++ +  IL  F   SGL+IN  KS L GV ++ + + +AA  +GC      F YLG+ +
Sbjct: 463  NVRALKAILLLFEATSGLKINFHKSMLFGVNINVTWLHEAAVVLGCRHGQLPFLYLGLPI 522

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G  P +L  W  ++ ++R RLS WK K LS GGRL LLK VL S P+Y LS FK P G++
Sbjct: 523  GGDPRKLCFWYPLVDRIRKRLSGWKCKNLSYGGRLILLKFVLSSIPVYFLSFFKAPTGII 582

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
              +ESI C+FF G  +  +KI W+ WD    ++ +GGLGV      N +LL KWVWR L 
Sbjct: 583  STLESIFCQFFWGGCEANRKIAWIKWDTICLNRENGGLGVRRLKEFNISLLGKWVWRCLV 642

Query: 548  QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQSLKT-----KGFDFLSHCKKRVG 712
            ++ SLW  ++RA YG                  R + ++++      G   + + +++VG
Sbjct: 643  ENDSLWSLVLRAKYG-EEGGRVRFSEGVGSSWWRGLNTVRSGVGLRDGRWLVDNIRRKVG 701

Query: 713  NGINSLFWFDCWIGDAPLHSIFPRLFALELDKNITV 820
             G  SLFW D W+ D PL   F RL+ L +DKNI V
Sbjct: 702  GGCGSLFWLDPWLEDNPLSRSFSRLYDLAVDKNILV 737


>gb|OTF85059.1| putative RNA-directed DNA polymerase, eukaryota, Reverse
            transcriptase zinc-binding domain protein [Helianthus
            annuus]
          Length = 1099

 Score =  197 bits (500), Expect = 1e-53
 Identities = 107/271 (39%), Positives = 149/271 (54%), Gaps = 4/271 (1%)
 Frame = +2

Query: 8    NLKGILNILKSFFLASGLQINVSKSQLLGVGVSRSEVDQAASSIGCSVMNNQFRYLGVMV 187
            N   +  IL+ F LASGL++N+SK  L GVGV   EV   A  + C   +  FRYLG++V
Sbjct: 627  NALNLRRILRCFNLASGLRVNLSKCSLYGVGVGDHEVSDMAYVLRCRAGSFPFRYLGLLV 686

Query: 188  GQCPSRLNAWDDIIFKLRARLSKWKVKTLSIGGRLTLLKSVLGSSPIYNLSIFKVPRGVL 367
            G   + +  WD +I   + RLS WK KTLS GGR+TL+KSVL + P Y  S++K P  VL
Sbjct: 687  GANMNLVKNWDPVIKLFKNRLSIWKAKTLSFGGRITLIKSVLSALPTYFFSLYKAPLQVL 746

Query: 368  KNMESIRCRFFNGMDQGEKKITWVAWDKTLASKLHGGLGVSSYFALNRALLLKWVWRFLS 547
            K +E +R  FF G  + + K+ W AW+KT+    +GGLG  S    N A+L KW WRF  
Sbjct: 747  KQLERLRRVFFWGGSEEKAKLNWTAWEKTIGPIEYGGLGFGSLQDANLAMLSKWWWRFKV 806

Query: 548  QDGSLWCRIIRALYGXXXXXXXXXXXXXXXXXMREVQS----LKTKGFDFLSHCKKRVGN 715
                LW ++I AL+                   +++      L+T+G D        +G+
Sbjct: 807  DRNGLWRKVIWALHQSSRAWTFIPTKVSIIGPWKQITRCAGILETRGIDLSKSIIGILGS 866

Query: 716  GINSLFWFDCWIGDAPLHSIFPRLFALELDK 808
            G++  FW D W G  PL S+FP LFA+E +K
Sbjct: 867  GVDIYFWVDIWFGTEPLASLFPNLFAIERNK 897


Top