BLASTX nr result

ID: Chrysanthemum22_contig00041120 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00041120
         (765 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|OMO92976.1| reverse transcriptase [Corchorus capsularis]           221   3e-62
gb|PNX84009.1| ribonuclease H [Trifolium pratense]                    196   4e-56
gb|EOY07997.1| Uncharacterized protein TCM_022315 [Theobroma cacao]   197   3e-55
ref|XP_023737697.1| uncharacterized protein LOC111885685 [Lactuc...   188   2e-54
gb|PNY12727.1| ribonuclease H [Trifolium pratense]                    193   2e-53
dbj|GAU51457.1| hypothetical protein TSUD_413540, partial [Trifo...   188   4e-53
gb|PNX63822.1| ribonuclease H, partial [Trifolium pratense]           181   5e-53
gb|PNX99941.1| cysteine-rich receptor-like protein kinase, parti...   193   1e-52
ref|XP_022030427.1| uncharacterized protein LOC110931338 [Helian...   187   3e-52
gb|PNX83328.1| ribonuclease H, partial [Trifolium pratense]           187   4e-52
gb|OTG28613.1| putative RNA-directed DNA polymerase, eukaryota [...   191   5e-52
ref|XP_022007235.1| uncharacterized protein LOC110906407 [Helian...   179   6e-52
dbj|GAU30676.1| hypothetical protein TSUD_39000 [Trifolium subte...   189   8e-52
gb|KYP31897.1| Putative ribonuclease H protein At1g65750 family ...   191   1e-51
dbj|GAU27881.1| hypothetical protein TSUD_159750 [Trifolium subt...   190   2e-51
ref|XP_022027925.1| uncharacterized protein LOC110929112 isoform...   190   2e-51
ref|XP_022027924.1| uncharacterized protein LOC110929112 isoform...   190   2e-51
ref|XP_021986150.1| uncharacterized protein LOC110882438 [Helian...   182   2e-51
gb|PNX79113.1| ribonuclease H, partial [Trifolium pratense]           186   3e-51
dbj|GAU19703.1| hypothetical protein TSUD_78280 [Trifolium subte...   181   4e-51

>gb|OMO92976.1| reverse transcriptase [Corchorus capsularis]
          Length = 1535

 Score =  221 bits (563), Expect = 3e-62
 Identities = 109/251 (43%), Positives = 153/251 (60%)
 Frame = -2

Query: 755  NVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPIGAN 576
            NV+R+LRCFQL +GLK+NF KS L G+  +  +++ W D++ C VG +P  YLGLP+GA 
Sbjct: 1236 NVRRILRCFQLVSGLKVNFAKSSLIGINANPDVIKQWADEVNCKVGSLPCSYLGLPLGAR 1295

Query: 575  PVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVMEEI 396
            P   ++W PVIE  + +L+ WK + +S AGR+VLIKSV  S P+YFMSLF +P  V  E+
Sbjct: 1296 PNAVAIWKPVIERCQKKLATWKARHLSTAGRLVLIKSVFASCPVYFMSLFNLPCAVKMEL 1355

Query: 395  EKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGNEKE 216
            EK+   F W     +RK+  +DW TI + K         L +RN ALL KW+WRYGNE E
Sbjct: 1356 EKLMRKFLWSGSDDKRKIHYVDWDTICKYKEFVGLGSIDLGLRNRALLNKWLWRYGNEHE 1415

Query: 215  ALWRKLIDAKYGRIEYSLSPTIPRKALASPVWKKITDILCSRSPSGVVANKGLVHCVGKG 36
            +LW K+I  K   ++ S+ P+       S +W  I+  L S     +    GL+  VG G
Sbjct: 1416 SLWHKVIIGKNKLVDDSIIPS-GNIRHCSAIWNAISKPLRSGDSLSLFTKSGLMISVGDG 1474

Query: 35   DCTRFWDDHWV 3
               +FW+D+WV
Sbjct: 1475 KRVKFWEDNWV 1485


>gb|PNX84009.1| ribonuclease H [Trifolium pratense]
          Length = 471

 Score =  196 bits (497), Expect = 4e-56
 Identities = 103/263 (39%), Positives = 153/263 (58%), Gaps = 9/263 (3%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           NL  +K +LR F+L +GLK+NF KS L+G+ + D  +    D L C +G +PF YLGLP+
Sbjct: 146 NLWTIKAILRWFELISGLKVNFFKSKLFGINVGDDFINSAADFLNCKIGSLPFVYLGLPV 205

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           G NP   S W+PV+E  + RL+ WK K++S  GRVVL+ SVL ++P++++SLF++PV V 
Sbjct: 206 GGNPRRASTWNPVLEVLQRRLASWKNKYVSHGGRVVLLNSVLAAIPIFYLSLFKMPVGVW 265

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
           ++I +++  F WG      K+  ++W  + RSK         LR+ N +LL KW WR  +
Sbjct: 266 KKIIRLQRRFLWGGAAGASKIPWVNWQDVCRSKKEGGLGVKDLRIMNISLLAKWKWRLLS 325

Query: 224 EKEALWRKLIDAKYGRIEY--SLSPTIPRKALASPVWKKITDILCSRSPSGVVANKGLVH 51
           E +++W+ ++  KYG  E   +   TI   A ASP W  + +I       GVV   G+ H
Sbjct: 326 EGDSIWKNVLRDKYGGGESGDAWMSTIVPSAKASPWWNDLMNI-------GVV--NGIDH 376

Query: 50  CV-------GKGDCTRFWDDHWV 3
            V       G G  TRFW D W+
Sbjct: 377 LVGSFFKKIGNGGTTRFWHDSWL 399


>gb|EOY07997.1| Uncharacterized protein TCM_022315 [Theobroma cacao]
          Length = 667

 Score =  197 bits (501), Expect = 3e-55
 Identities = 100/253 (39%), Positives = 145/253 (57%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           +L+N KR+LRCFQ  +GL+INF KS L G+G ++  VR   + +     +IP  YLGLP+
Sbjct: 147 SLVNAKRILRCFQAISGLRINFHKSSLAGIGTNENFVRECAERINYMFEVIPMVYLGLPL 206

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
            ANP     W P+IE F  RL+ WK K +S  GRV L++SVL+SLP+++MS+FQ+P  V+
Sbjct: 207 KANPNSIQTWKPIIEKFETRLAGWKAKTLSIGGRVALLRSVLSSLPIFYMSIFQIPKRVI 266

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
           +E+EKI   F W     ++K+  ++W  +   K         + V+N ALL KW+WRYG+
Sbjct: 267 KELEKIERRFLWCGSEKKQKIHYIEWSKVCNYKENGGLGIINMEVKNRALLNKWLWRYGS 326

Query: 224 EKEALWRKLIDAKYGRIEYSLSPTIPRKALASPVWKKITDILCSRSPSGVVANKGLVHCV 45
           E  +LWR++I  K G    +L P +      S VWK I   L   +   +  +  +   V
Sbjct: 327 EMGSLWREVIVKKVGGNLINLIPEMSANKRVSTVWKNIIKPLSPTNDFSLQVSTDMQLVV 386

Query: 44  GKGDCTRFWDDHW 6
           G G    FW D W
Sbjct: 387 GDGSRILFWADRW 399


>ref|XP_023737697.1| uncharacterized protein LOC111885685 [Lactuca sativa]
          Length = 355

 Score =  188 bits (477), Expect = 2e-54
 Identities = 96/253 (37%), Positives = 141/253 (55%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           N+ N+  +LRCF +S+GLK+NF KS ++G+G+D   V      LGC    +PF YLG+P+
Sbjct: 37  NIKNLAGILRCFHVSSGLKVNFKKSQVFGIGVDSQEVLSLARPLGCEPANLPFTYLGVPV 96

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GAN   K  W PVIE+F+ RLS WK+K +S  GR+ L KSV+ SLP ++ SLF  P  ++
Sbjct: 97  GANMKLKKYWKPVIENFQLRLSAWKSKNLSLGGRLTLTKSVIGSLPTFYFSLFIAPAGIL 156

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
           + +EKIR  F WG     RK+  + WG +T  K         L+  N +L+ KW WR   
Sbjct: 157 KALEKIRRRFLWGGSEDSRKINWVSWGKVTTPKENGGLGLGSLKALNLSLIMKWWWRLRV 216

Query: 224 EKEALWRKLIDAKYGRIEYSLSPTIPRKALASPVWKKITDILCSRSPSGVVANKGLVHCV 45
           E   LW K+I+  +  ++      + ++++   VWK IT         G+     ++  V
Sbjct: 217 ENTCLWSKVIEGIH-NLKNKPGDYMSKQSITG-VWKNITQARGELMKVGINIEDVILKEV 274

Query: 44  GKGDCTRFWDDHW 6
           G G+ T FW D W
Sbjct: 275 GTGEKTMFWHDRW 287


>gb|PNY12727.1| ribonuclease H [Trifolium pratense]
          Length = 698

 Score =  193 bits (490), Expect = 2e-53
 Identities = 102/262 (38%), Positives = 151/262 (57%), Gaps = 8/262 (3%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           NL  +K +LR F+L +GLK+NF KS L+G+ + DG +      L C +G +PF YLGLP+
Sbjct: 166 NLWTIKAILRWFELISGLKVNFFKSKLFGINVGDGFINSATAFLKCKIGSLPFIYLGLPV 225

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GANP   S W+PVIE  + RL+ WK K++S  GRVVL+ SVL  +P++++SLF++PV V 
Sbjct: 226 GANPRRVSTWNPVIEVLQKRLASWKNKYVSLWGRVVLLNSVLAEIPIFYLSLFKMPVGVW 285

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
           ++I +++  F WG      K+  ++W  + RSK         LR+ N +LL KW WR  +
Sbjct: 286 KKIIRLQRRFLWGGAAGASKISWVNWLDVCRSKKEGGLGVKDLRIMNISLLAKWKWRLLS 345

Query: 224 EKEALWRKLIDAKYGRIEYS---LSPTIPRKALASPVWKKITDILCSRSPSGVVANKGLV 54
           E E++W+ ++  KYG  E     +S T+P  A  SP W  +  I       G+V     +
Sbjct: 346 EDESIWKNVLRDKYGSGEVGTVWMSRTLP-SAKVSPWWNDLMSI-------GMVVGVDHI 397

Query: 53  H-----CVGKGDCTRFWDDHWV 3
           H      +G G  T FW D W+
Sbjct: 398 HGMFFKKIGNGGTTSFWHDSWL 419


>dbj|GAU51457.1| hypothetical protein TSUD_413540, partial [Trifolium subterraneum]
          Length = 485

 Score =  188 bits (477), Expect = 4e-53
 Identities = 91/215 (42%), Positives = 129/215 (60%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           NL ++K VLR F+L +GLK+NF KS LYG+ +DD  +      L C V  IPFR+LG+P+
Sbjct: 163 NLWSLKTVLRSFELVSGLKVNFFKSKLYGINLDDNFLSAASSFLHCEVDSIPFRFLGIPV 222

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GANP  K  W+PV+E+ + RL+ W  + +S  G+V LI SVL+SLPLYF S F+VPVCV+
Sbjct: 223 GANPRRKITWNPVVEAMKKRLNAWNCRNLSIGGKVTLINSVLSSLPLYFFSFFKVPVCVL 282

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
           +++  I+  F WG     +K+C + W TI   K         L   N+ALLCKW WR   
Sbjct: 283 QDLINIQRRFLWGGRSDIKKICWVSWDTICLPKDKGGLGIKNLNCFNQALLCKWKWRGLC 342

Query: 224 EKEALWRKLIDAKYGRIEYSLSPTIPRKALASPVW 120
           +   LW KL++ +YG +  ++     R      +W
Sbjct: 343 DHNTLWTKLLEHRYGSLADNVLRDTTRDVKGQSLW 377


>gb|PNX63822.1| ribonuclease H, partial [Trifolium pratense]
          Length = 254

 Score =  181 bits (460), Expect = 5e-53
 Identities = 85/201 (42%), Positives = 126/201 (62%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           NL  +K +LR F+L +GLKINF+KS LYGV +D  L+      L C    IPF++LGLP+
Sbjct: 41  NLWTIKSLLRGFELVSGLKINFVKSKLYGVNVDSSLLEAGAAFLSCKTAAIPFKFLGLPV 100

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GANP  +  W PV+++   RL+ W ++ +S  GR+ LI SVL S+PLYF S F+ P CV+
Sbjct: 101 GANPRRRETWKPVVDALTKRLNSWNSRQLSFGGRLSLINSVLASIPLYFFSFFKAPRCVL 160

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
           + +E+I+ +FFWG    E+K+C + W  +   K         L + N ALL KW WR+ N
Sbjct: 161 KSLERIQRNFFWGGGTEEKKVCWIKWEQVCLPKEKGGLGVKNLELFNLALLSKWKWRFLN 220

Query: 224 EKEALWRKLIDAKYGRIEYSL 162
             +A+W +L+  +YG++  S+
Sbjct: 221 HNDAIWFELLRFRYGKLSSSV 241


>gb|PNX99941.1| cysteine-rich receptor-like protein kinase, partial [Trifolium
            pratense]
          Length = 1092

 Score =  193 bits (491), Expect = 1e-52
 Identities = 103/256 (40%), Positives = 148/256 (57%), Gaps = 3/256 (1%)
 Frame = -2

Query: 764  NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
            NL  +K +LR F+L +GLKINF+KS LYGV +D  L+      L C+   IPF++LGLP+
Sbjct: 698  NLWTIKSLLRGFELVSGLKINFVKSKLYGVNVDTRLLEAGAVFLSCNTAAIPFKFLGLPV 757

Query: 584  GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
            GANP  +  W PV+E+   RL+ W ++ +S  GR+ LI SVL S+PLYF S ++ P CV+
Sbjct: 758  GANPRRRETWKPVVEALTKRLNSWNSRLLSFGGRLSLINSVLASIPLYFFSFYKAPRCVL 817

Query: 404  EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
              +E+I+ +FFWG    ERKLC + W  +   K         L   N ALL KW WR+ N
Sbjct: 818  NSLERIQRNFFWGGGLEERKLCWIKWEQVCLPKEKGGLGVKNLEFFNLALLSKWKWRFLN 877

Query: 224  EKEALWRKLIDAKYGRIEYSL--SPTIPRKALASPVWKKITDILCSRSPSGVVANKGLVH 51
            + +A+W  L+  +YG++  S+  S  +  + L+S  W+ I       S  G  +N   + 
Sbjct: 878  DNDAIWAALLRFRYGKLSSSVLTSRALGGRDLSSLWWRDIIYKGRDFSDGGFSSN---IS 934

Query: 50   C-VGKGDCTRFWDDHW 6
            C VG GD   FW+  W
Sbjct: 935  CRVGNGDNIDFWNFKW 950


>ref|XP_022030427.1| uncharacterized protein LOC110931338 [Helianthus annuus]
          Length = 543

 Score =  187 bits (475), Expect = 3e-52
 Identities = 101/238 (42%), Positives = 134/238 (56%), Gaps = 1/238 (0%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           N+L + R+LR   L +GLK+N  KS L+G+G+DD  V      + C VG +PF +LG+PI
Sbjct: 100 NILALNRLLRWLNLLSGLKVNRQKSKLFGIGVDDAEVARLAQVVSCDVGSLPFTHLGIPI 159

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           G N      W PV+E F ARLSKWK   +S AGR+ LIKSVL SLP YF+SLF  P  V+
Sbjct: 160 GVNMKRAKCWKPVLEKFSARLSKWKAAHLSFAGRLTLIKSVLGSLPSYFLSLFAAPKGVI 219

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
            ++EKIR  F WGK    RKL  M W  + +SK         +R  N A+L KW WR+  
Sbjct: 220 NKLEKIRRDFLWGKTSAGRKLRWMRWSLLLKSKKYGGLGVGSIRDFNLAMLAKWWWRFKE 279

Query: 224 EKEALWRKLIDAKY-GRIEYSLSPTIPRKALASPVWKKITDILCSRSPSGVVANKGLV 54
               LW  ++DA + GR      P IP K     VWK +  +  S + +G+   + LV
Sbjct: 280 NPNQLWAIVVDAIHKGRASNGNPPFIPVKKTLPGVWKDVASVDGSLAKNGINIKENLV 337


>gb|PNX83328.1| ribonuclease H, partial [Trifolium pratense]
          Length = 573

 Score =  187 bits (475), Expect = 4e-52
 Identities = 87/195 (44%), Positives = 124/195 (63%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           N+  +K +LR F+L +GLKINF+KS LYG+ +DD  +      L CS   IPF++LG+P+
Sbjct: 149 NVKTIKTILRGFELVSGLKINFVKSKLYGINVDDNFIAAAASFLNCSFDSIPFKFLGIPV 208

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GANP  +  W P++ES   RLS W  + +S  GRV LI SVL+SLPLYF S ++ P C++
Sbjct: 209 GANPRRQESWQPIVESLTKRLSSWSGRNLSIGGRVTLINSVLSSLPLYFFSFYKAPRCII 268

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
            ++ +I+ +F WG    ++KLC + W  +   K         L + N ALLCKW WRY N
Sbjct: 269 NKLVRIQRNFLWGGGLEDKKLCWIKWEQVCLPKDKGGLGVKDLELFNTALLCKWKWRYLN 328

Query: 224 EKEALWRKLIDAKYG 180
           EK+ALW +L+  +YG
Sbjct: 329 EKDALWYELLCYRYG 343


>gb|OTG28613.1| putative RNA-directed DNA polymerase, eukaryota [Helianthus annuus]
          Length = 1008

 Score =  191 bits (486), Expect = 5e-52
 Identities = 99/254 (38%), Positives = 144/254 (56%)
 Frame = -2

Query: 764  NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
            N+  + R+LR F   +GLKINF KS L+GVG++D  + +    +GC VG  PF YLG+P+
Sbjct: 690  NVRVITRILRVFYACSGLKINFFKSHLFGVGVEDEALSLMASRVGCLVGEAPFNYLGIPL 749

Query: 584  GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
            GAN      WDP+++ F+ RLS WK   +S  GRV+LIK+VL SLP+YF SLF+ PV V+
Sbjct: 750  GANMNRVKNWDPIVKIFKGRLSSWKASSLSIGGRVILIKAVLESLPIYFFSLFKAPVKVI 809

Query: 404  EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
            E++E +  +F WG     RK+  + W  +TR K         L++ NEALL KW+WR+  
Sbjct: 810  EKLEALMRNFLWGGSEEVRKMSWVAWDVVTRPKRYGGLGINKLKLVNEALLSKWIWRFKY 869

Query: 224  EKEALWRKLIDAKYGRIEYSLSPTIPRKALASPVWKKITDILCSRSPSGVVANKGLVHCV 45
            + ++LW K++ A +G         +P     S  W  I  +      +G   N  +   +
Sbjct: 870  DVDSLWSKVVAACHG--NNRAWSVLPFNTAISGTWLNIVRLEKKLIINGQKINNLIKGVL 927

Query: 44   GKGDCTRFWDDHWV 3
            G G   RFW D W+
Sbjct: 928  GDGKRIRFWIDVWL 941


>ref|XP_022007235.1| uncharacterized protein LOC110906407 [Helianthus annuus]
          Length = 258

 Score =  179 bits (453), Expect = 6e-52
 Identities = 89/195 (45%), Positives = 122/195 (62%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           N  N+KR+LR F L +GLKIN  K +LYGVG++D  V    + LGC  G +PF YLG+ +
Sbjct: 47  NFTNMKRILRIFYLCSGLKINLHKLVLYGVGVEDADVVGMAETLGCKQGALPFSYLGIKV 106

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GAN    S W+PV+ SF+ RLSKWK   +S AGR+ LIKSVL+SLP Y+ SLF+ P  V+
Sbjct: 107 GANMNRVSNWEPVVSSFKRRLSKWKANTLSIAGRLTLIKSVLDSLPTYYFSLFKAPKKVI 166

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
            E+E +   F WG     RK+  + W  +T+           L V N A+L KW+WR+ N
Sbjct: 167 GELEGLMRRFLWGGTEDVRKMSWVSWEIVTKRIKDGGLGIAPLEVNNNAMLVKWLWRFLN 226

Query: 224 EKEALWRKLIDAKYG 180
           E  ALWR+++ + +G
Sbjct: 227 EPNALWRRVVVSIHG 241


>dbj|GAU30676.1| hypothetical protein TSUD_39000 [Trifolium subterraneum]
          Length = 748

 Score =  189 bits (480), Expect = 8e-52
 Identities = 96/256 (37%), Positives = 146/256 (57%), Gaps = 2/256 (0%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           NL  +K +LR F++ +GLKINF KS LYG+ +DD  +      L C   +IPF++LG+P+
Sbjct: 163 NLWTIKTMLRGFEMVSGLKINFTKSKLYGINVDDRFLEAGSTFLSCRSDVIPFKFLGIPV 222

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GANP  +  W PV+E+   RLS+W    +S  GR+ LI SVL SLPLYF S F+ P+CV+
Sbjct: 223 GANPRRRETWRPVVEAMSKRLSRWSGGHLSYGGRITLINSVLASLPLYFFSFFKAPICVL 282

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
            ++  I+ +F WG    E+KLC + W  I   +         L++ N ALL KW WR  N
Sbjct: 283 NQLVSIQRNFLWGGGMEEKKLCWVKWDHICLPRDVGGLGVKNLKLFNIALLSKWKWRCVN 342

Query: 224 EKEALWRKLIDAKYGRIEYSLSPTIPR--KALASPVWKKITDILCSRSPSGVVANKGLVH 51
           + EA+W  ++  +YG +   +   +P   ++ AS  WK + +I  S       +N  ++ 
Sbjct: 343 DSEAIWMDVLRYRYGHLPSVILNGVPTTCESKASIWWKDLANIGESFGSDWFKSNISII- 401

Query: 50  CVGKGDCTRFWDDHWV 3
            +G G+   FW + W+
Sbjct: 402 -IGDGNNIAFWKNKWL 416


>gb|KYP31897.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1101

 Score =  191 bits (484), Expect = 1e-51
 Identities = 99/255 (38%), Positives = 150/255 (58%), Gaps = 1/255 (0%)
 Frame = -2

Query: 764  NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
            N++ +K ++RCF+L +GLK+NF+KS    +G++D ++  +   L C +  +PF YLGLPI
Sbjct: 607  NVVVIKAIMRCFELISGLKVNFIKSKFGAIGLEDQVLERFAQLLNCKLLKLPFNYLGLPI 666

Query: 584  GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
            GANP  K  W PV+E  + RLS WK K +S AGRV LI SVL SLPLY++S +++P  V 
Sbjct: 667  GANPRRKVTWIPVVEKIKKRLSNWKGKVLSMAGRVQLINSVLTSLPLYYLSFYKIPKGVC 726

Query: 404  EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
             EI +++  F WG +   +K+  + W  IT +K         + + N+ALL KW W   +
Sbjct: 727  SEISRLQRQFLWGSNQGSKKMAWIKWQKITVAKEKGGLNIKDINLFNKALLAKWRWCLFH 786

Query: 224  EKEALWRKLIDAKYGRIEYSLSPTIPRKALAS-PVWKKITDILCSRSPSGVVANKGLVHC 48
              EALW KL+ +KYG  +   +    +KAL +  +W K   ++C    S    ++ +   
Sbjct: 787  NPEALWVKLLCSKYGGFQNLCA----QKALQNDSIWWKDLIMVCGGLESEGWFDRQVEWK 842

Query: 47   VGKGDCTRFWDDHWV 3
            V +G   RFW D+WV
Sbjct: 843  VNRGSAVRFWLDNWV 857


>dbj|GAU27881.1| hypothetical protein TSUD_159750 [Trifolium subterraneum]
          Length = 1175

 Score =  190 bits (483), Expect = 2e-51
 Identities = 102/263 (38%), Positives = 147/263 (55%), Gaps = 9/263 (3%)
 Frame = -2

Query: 764  NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
            NL   K +LR F+L +GLK+NF KS LYG+ + D  + V      C VG +PF YLGLP+
Sbjct: 643  NLWTTKAILRWFELISGLKVNFFKSKLYGINVCDDFINVAASFFKCKVGKLPFIYLGLPV 702

Query: 584  GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
            GANP   + W+PVIE  + RL+ WK K++S  GRVVL+ SVL+++P++++SLF++P  V 
Sbjct: 703  GANPRRAATWNPVIEVLQKRLASWKNKYVSLGGRVVLLNSVLSAIPIFYLSLFKMPAGVW 762

Query: 404  EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
            ++I  ++  F WG      K+  + W  + R K         LR+ N +LL KW WR  +
Sbjct: 763  KKIVSLQRRFLWGGAAGSSKISWVKWTDVCRPKKEGGLGVKDLRIMNISLLAKWKWRLLS 822

Query: 224  EKEALWRKLIDAKYGRIEYSLS--PTIPRKALASPVWKKITDILCSRSPSGVVANKGLVH 51
            E EA+W+ +I  +YG  E  +     +   + ASP W  +  I       GVVA  G+ H
Sbjct: 823  EGEAIWKNIIRERYGGGERGVGWMSKVRVSSKASPWWNDLMTI-------GVVA--GVDH 873

Query: 50   C-------VGKGDCTRFWDDHWV 3
                    +G G  T FW D WV
Sbjct: 874  LSGIFFKKIGNGGATSFWHDSWV 896


>ref|XP_022027925.1| uncharacterized protein LOC110929112 isoform X2 [Helianthus annuus]
          Length = 1080

 Score =  190 bits (482), Expect = 2e-51
 Identities = 103/255 (40%), Positives = 142/255 (55%), Gaps = 1/255 (0%)
 Frame = -2

Query: 764  NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
            N++N++R+L CF L++GLK+N  K  +YG+G+ D  V+     L C  G+ PF++LGL +
Sbjct: 559  NVINLRRILCCFYLTSGLKVNLAKCSVYGIGVSDQEVQSMAGLLNCKPGVFPFKHLGLLV 618

Query: 584  GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
            GAN      W  V+E FR RLS WK K +S  GR+ L+KSVLNSLP Y+ SLF+ PV V+
Sbjct: 619  GANMNLVRNWKSVVEIFRNRLSIWKAKHLSYGGRITLLKSVLNSLPTYYFSLFKAPVQVL 678

Query: 404  EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
            E +E+IR  FFWG    + K+  + W                L+  N A+L KW WR+  
Sbjct: 679  ESLERIRRVFFWGGSEEKAKMNWVAWEKTIGPIEYGGLGFGSLKDANLAMLAKWWWRFKT 738

Query: 224  EKEALWRKLIDA-KYGRIEYSLSPTIPRKALASPVWKKITDILCSRSPSGVVANKGLVHC 48
            EK  LWRK+I A  +    +S    IP K   +  WK+I  I  +    G+   K +   
Sbjct: 739  EKTGLWRKVIWAIHHNSRSWS---AIPAKVSIAGPWKQIVSIHDALLKVGIDLKKAISIS 795

Query: 47   VGKGDCTRFWDDHWV 3
            V  G CT FW D WV
Sbjct: 796  VANGSCTSFWLDSWV 810


>ref|XP_022027924.1| uncharacterized protein LOC110929112 isoform X1 [Helianthus annuus]
          Length = 1081

 Score =  190 bits (482), Expect = 2e-51
 Identities = 103/255 (40%), Positives = 142/255 (55%), Gaps = 1/255 (0%)
 Frame = -2

Query: 764  NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
            N++N++R+L CF L++GLK+N  K  +YG+G+ D  V+     L C  G+ PF++LGL +
Sbjct: 559  NVINLRRILCCFYLTSGLKVNLAKCSVYGIGVSDQEVQSMAGLLNCKPGVFPFKHLGLLV 618

Query: 584  GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
            GAN      W  V+E FR RLS WK K +S  GR+ L+KSVLNSLP Y+ SLF+ PV V+
Sbjct: 619  GANMNLVRNWKSVVEIFRNRLSIWKAKHLSYGGRITLLKSVLNSLPTYYFSLFKAPVQVL 678

Query: 404  EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
            E +E+IR  FFWG    + K+  + W                L+  N A+L KW WR+  
Sbjct: 679  ESLERIRRVFFWGGSEEKAKMNWVAWEKTIGPIEYGGLGFGSLKDANLAMLAKWWWRFKT 738

Query: 224  EKEALWRKLIDA-KYGRIEYSLSPTIPRKALASPVWKKITDILCSRSPSGVVANKGLVHC 48
            EK  LWRK+I A  +    +S    IP K   +  WK+I  I  +    G+   K +   
Sbjct: 739  EKTGLWRKVIWAIHHNSRSWS---AIPAKVSIAGPWKQIVSIHDALLKVGIDLKKAISIS 795

Query: 47   VGKGDCTRFWDDHWV 3
            V  G CT FW D WV
Sbjct: 796  VANGSCTSFWLDSWV 810


>ref|XP_021986150.1| uncharacterized protein LOC110882438 [Helianthus annuus]
          Length = 445

 Score =  182 bits (463), Expect = 2e-51
 Identities = 103/256 (40%), Positives = 141/256 (55%), Gaps = 2/256 (0%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           N+ +  ++LR F L +GL+IN  KS L+GVG +D  V    + LGC  G IPF YLG+ +
Sbjct: 100 NIQSTTKILRIFYLFSGLRINLYKSNLFGVGTEDMEVDNMMEILGCKRGGIPFVYLGIQV 159

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GA     S W  +IE  +ARL  WK K +S  GR++LIKSVL SLP+Y++SL++ P  V+
Sbjct: 160 GAKMTRISNWTSIIEVIKARLVSWKAKTLSIGGRLILIKSVLESLPIYYLSLYKAPKVVI 219

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
           + IE I   F W     ERK+  + W  IT  K         L+  NEALL KW WR+  
Sbjct: 220 DIIEAIMRRFLWAGSSAERKIPWVAWDIITTPKKKGGLCVTKLQEVNEALLLKWTWRFKK 279

Query: 224 EKEALWRKLIDAKYGRIEYSLSP--TIPRKALASPVWKKITDILCSRSPSGVVANKGLVH 51
           E  +LW+K+I   +G    S  P   +P  A AS  WK+I  +   + P+G   N   V 
Sbjct: 280 EGNSLWKKIIMGCHG----SSRPWAMLPCSASASGCWKQIVKVGEKKLPNGKSLNSYFVG 335

Query: 50  CVGKGDCTRFWDDHWV 3
            +G G    FW D W+
Sbjct: 336 MLGDGSTINFWGDTWL 351


>gb|PNX79113.1| ribonuclease H, partial [Trifolium pratense]
          Length = 615

 Score =  186 bits (471), Expect = 3e-51
 Identities = 87/256 (33%), Positives = 150/256 (58%), Gaps = 2/256 (0%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           NL ++K +LR F+  +GLK+NF KS + GV + +  +R+    L C VG +PF+YLGLP+
Sbjct: 116 NLWSLKAILRGFEQVSGLKVNFFKSCVMGVNVSNDFIRLASAFLNCRVGSVPFKYLGLPV 175

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GANP     W+P++E+ R RL  W  K++S  GR+VL+ +VLN++P++++S  ++PV V 
Sbjct: 176 GANPRRARTWEPLLEALRQRLGVWGNKYVSLGGRIVLLNAVLNAIPIFYLSFIKIPVLVW 235

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
           +++++I+  F WG      ++  + W T+ + K         +R  N +LL KW WR   
Sbjct: 236 KKVQRIQREFLWGSKGGRNRISWVKWDTVCKPKKLGGLGVRDIRAVNISLLAKWRWRLLE 295

Query: 224 EKEALWRKLIDAKYGRIEYSLSPTIP--RKALASPVWKKITDILCSRSPSGVVANKGLVH 51
           +  A+W++++ +KYG +   +       +   +S  WK I  I  + + +     +G++ 
Sbjct: 296 DDNAMWKEVLKSKYGELVTGMVTVGEDCKPWFSSTWWKDIWSIRVNLNTNWF--TQGVIK 353

Query: 50  CVGKGDCTRFWDDHWV 3
            +G GD T+FW D WV
Sbjct: 354 RIGCGDQTKFWRDIWV 369


>dbj|GAU19703.1| hypothetical protein TSUD_78280 [Trifolium subterraneum]
          Length = 424

 Score =  181 bits (460), Expect = 4e-51
 Identities = 96/256 (37%), Positives = 144/256 (56%), Gaps = 3/256 (1%)
 Frame = -2

Query: 764 NLLNVKRVLRCFQLSAGLKINFLKSMLYGVGIDDGLVRVWGDDLGCSVGLIPFRYLGLPI 585
           N+  +K +LR F+L + LKINF+KS LYG+ ID+ L+      L C    +PF++LG+P+
Sbjct: 41  NIWTIKSLLRGFELVSRLKINFVKSKLYGINIDNSLLNAGAAFLSCKTASVPFKFLGIPV 100

Query: 584 GANPVFKSVWDPVIESFRARLSKWKTKFISRAGRVVLIKSVLNSLPLYFMSLFQVPVCVM 405
           GANP  +  W+P++E+   RL+ W    +S  GRV LI SVL+S+PLYF S F+ P CV+
Sbjct: 101 GANPRRRETWNPILEALTKRLNSWTGHHLSYGGRVTLINSVLSSMPLYFFSFFKAPRCVI 160

Query: 404 EEIEKIRLSFFWGKDPCERKLCTMDWGTITRSKPXXXXXXXXLRVRNEALLCKWVWRYGN 225
           ++IEKI+ +F WG    E+K+  + W  +   K         L + N ALL KW WR+ N
Sbjct: 161 KDIEKIQRTFLWGGSLDEKKINWVKWDHVCLPKKNGGLGVKNLELFNIALLSKWRWRFLN 220

Query: 224 EKEALWRKLIDAKYGRIEYSL--SPTIPRKALASPVWKKITDILCSRSPSGVVANKGLVH 51
              A+W  L+  +YG +  SL     +    ++S  W+   D++ S +   V   K  + 
Sbjct: 221 HDNAIWNDLLRHRYGHLPSSLLGKHDLISGGISSLWWR---DVISSGNICNVDWFKSNIG 277

Query: 50  C-VGKGDCTRFWDDHW 6
           C VG G+   FW   W
Sbjct: 278 CRVGNGNDIEFWHFKW 293


Top