BLASTX nr result

ID: Chrysanthemum21_contig00017632 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00017632
         (2388 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022014041.1| uncharacterized protein LOC110913529 [Helian...   800   0.0  
gb|OTG21088.1| putative ribonuclease H-like domain-containing pr...   784   0.0  
gb|OTG33444.1| putative ribonuclease H-like domain-containing pr...   780   0.0  
ref|XP_022042162.1| uncharacterized protein LOC110944826 [Helian...   760   0.0  
gb|OTG37431.1| putative ribonuclease H-like domain-containing pr...   779   0.0  
gb|OTG16942.1| putative ribonuclease H-like domain-containing pr...   775   0.0  
ref|XP_021986042.1| uncharacterized protein LOC110882290 [Helian...   749   0.0  
ref|XP_021980336.1| uncharacterized protein LOC110876473 [Helian...   694   0.0  
ref|XP_021971692.1| uncharacterized protein LOC110866849 [Helian...   566   0.0  
ref|XP_022014401.1| uncharacterized protein LOC110913892 [Helian...   546   0.0  
ref|XP_021991826.1| uncharacterized protein LOC110888615 [Helian...   492   e-162
ref|XP_022023932.1| uncharacterized protein LOC110924205 [Helian...   471   e-154
emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera]   478   e-147
gb|PNX93622.1| retrovirus-related Pol polyprotein from transposo...   474   e-146
gb|OMO88216.1| Integrase, catalytic core [Corchorus capsularis]       476   e-145
gb|PNX92904.1| retrovirus-related Pol polyprotein from transposo...   472   e-145
gb|KYP42518.1| Retrovirus-related Pol polyprotein from transposo...   451   e-144
ref|XP_017415202.1| PREDICTED: retrovirus-related Pol polyprotei...   469   e-144
gb|PNX93517.1| retrovirus-related Pol polyprotein from transposo...   468   e-143
ref|XP_017415203.1| PREDICTED: retrovirus-related Pol polyprotei...   459   e-140

>ref|XP_022014041.1| uncharacterized protein LOC110913529 [Helianthus annuus]
          Length = 784

 Score =  800 bits (2067), Expect = 0.0
 Identities = 402/795 (50%), Positives = 536/795 (67%), Gaps = 10/795 (1%)
 Frame = -2

Query: 2357 NNESVINSSEL---KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHN 2187
            NN+S++++S     K+  GD LYLHPSD++   I+++KL G++NY +W+ AM  AL+  N
Sbjct: 5    NNDSLVSTSGTLVSKIDAGDPLYLHPSDSANLTIVNIKLKGTDNYNVWANAMNLALQVKN 64

Query: 2186 KLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKE 2007
            KLGF+DG+C + T +  L  QWD C+S+VLTWILNS+S EL+ G +YSK A  +W +LKE
Sbjct: 65   KLGFIDGSCARSTTDEVLGKQWDRCNSIVLTWILNSVSEELYLGHVYSKLASVVWKELKE 124

Query: 2006 TYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEK 1827
            TYDKVDGS VFN+++ INS SQNG  +++YY+ LN +WKQ D ++SLPTCTCDA   F  
Sbjct: 125  TYDKVDGSVVFNLYQKINSFSQNGMPVSEYYHKLNCMWKQLDQILSLPTCTCDASKQFND 184

Query: 1826 HNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPN 1647
             N LIKLMQFLMGLD  Y  +R+ +LT++ LPSVK AFS++S EESHRN ++ +  I  N
Sbjct: 185  FNHLIKLMQFLMGLDSVYQSVRTTLLTREVLPSVKEAFSVVSREESHRNSNNFSEKISNN 244

Query: 1646 VSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRC 1467
               FA                                      NLKCS+CNK GHT+DRC
Sbjct: 245  PVGFAVKTSQSFDSKKKNVRPPNP-------------------NLKCSHCNKTGHTVDRC 285

Query: 1466 YGLVGYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLND 1287
            Y LVGYP+    +  TS  K               E ++S++   L++EQ+ +L+SLLND
Sbjct: 286  YELVGYPSWMKSK--TSGNKGGRASNNVVVDAS--ETTSSSTVNGLTNEQIAQLLSLLND 341

Query: 1286 SPAHSNMAGKCFSG----TFFNAS---VKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGAN 1128
              + S   G  F+G      FN+    +K    F+  +  + NF       GWI+DSGAN
Sbjct: 342  K-SRSETQGNNFAGRSNYVCFNSYADVLKPTCDFKPAYCFS-NFGNNGKKAGWIIDSGAN 399

Query: 1127 QHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSL 948
            QHM      LIN +DV++  + V HPNGT AL+T+IGD+K+++ +IL+DV V+P+Y V+L
Sbjct: 400  QHMITDDTNLINQMDVTEYNIKVKHPNGTSALVTKIGDVKLSDKVILYDVFVIPDYCVNL 459

Query: 947  LSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVS 768
            +SVHKLA+D  L+VSFDE  CYIQD  TK+ +  G+Q  GLY    +T    + N  ++ 
Sbjct: 460  VSVHKLAKDCNLTVSFDEHNCYIQDSQTKKVLVTGSQLDGLYFCGNSTMSDKVCN-ASLD 518

Query: 767  RNLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQ 588
             NLWH RLGHP++ VLHVLK  L++ K       C+TCH+AKQ REPFPLS+HK+  +G 
Sbjct: 519  VNLWHARLGHPAEPVLHVLKDKLDIKKNV-KLEPCETCHRAKQHREPFPLSDHKTKSLGD 577

Query: 587  LVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFES 408
            LVHLDVWGPY++ S+DG++YFLT+VDDY+RAVWVY++KNK +V+ +I  F  ML  QF  
Sbjct: 578  LVHLDVWGPYRVQSRDGFRYFLTVVDDYTRAVWVYLMKNKDEVFYNIKGFFNMLKTQFNK 637

Query: 407  NIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQ 228
            +IK+FRSDNGTEF+N +M+ F +  GI HQTSCVYTPQQNGI ERKHRHLLNVAR+L+FQ
Sbjct: 638  HIKMFRSDNGTEFINKQMKDFCYNNGIIHQTSCVYTPQQNGIVERKHRHLLNVARALLFQ 697

Query: 227  GELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNN 48
               PL  W EC+LTA YLINR PSSVL+G+SP+ LV+G  P LS +R+ GCLC++T+LNN
Sbjct: 698  AGFPLKFWSECILTATYLINRTPSSVLNGRSPYELVFGFAPELSQLRIVGCLCFSTVLNN 757

Query: 47   SDKFYSRSEKSVLIG 3
             DKF S +EK VL+G
Sbjct: 758  FDKFNSHAEKCVLVG 772


>gb|OTG21088.1| putative ribonuclease H-like domain-containing protein [Helianthus
            annuus]
          Length = 1460

 Score =  784 bits (2025), Expect = 0.0
 Identities = 390/778 (50%), Positives = 524/778 (67%), Gaps = 4/778 (0%)
 Frame = -2

Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145
            KL  GD LYLHPSD+S   I+SVKL G+ENY +WS AM  AL   NK GF+DG  +K   
Sbjct: 16   KLDIGDPLYLHPSDSSSLTIVSVKLKGTENYAVWSSAMKLALEAKNKYGFIDGKVEKSKD 75

Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965
            +  LA QWD C+SVVLTW+LNS+S ELF G ++SK A E+W DLKE++DK+DGS V++++
Sbjct: 76   DEILAAQWDRCNSVVLTWLLNSVSEELFLGQVFSKLASEVWTDLKESFDKIDGSVVYDLY 135

Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785
            K IN ++QNGS++A+YYN L ++WKQFD+M+ LP+C+C A   +   + LIKLMQFLMGL
Sbjct: 136  KKINCIAQNGSTVAEYYNRLTTMWKQFDAMLQLPSCSCQAAKDYNDFSALIKLMQFLMGL 195

Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605
            DD Y P+R+NILT++  PSVK AFSI+S EESHR  S G+ S   +  A +S        
Sbjct: 196  DDVYQPVRTNILTRESFPSVKVAFSIVSREESHRLSSGGSKSQSVSYVARSSQPNQSSSR 255

Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRN 1425
                                          LKC++CN  GHT+DRC+ ++GYP G  KR+
Sbjct: 256  RNFRGSNSV---------------------LKCTHCNMLGHTVDRCFEIIGYPPGMKKRS 294

Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLNDSPA----HSNMAGK 1257
               + +             +   S+  S +  + EQ+ +LMSL+ +        SNMAG+
Sbjct: 295  VGQSGR---NNVNSRSNQSAAPSSSVASALPFTSEQITKLMSLIGEKSEGEQQKSNMAGE 351

Query: 1256 CFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVS 1077
                            +  +F   ++F+       W+VDSGANQHM  + K +IN VDVS
Sbjct: 352  S--------------SYVNNFVSCSSFVNFEHGYRWVVDSGANQHMVNTDKDMINCVDVS 397

Query: 1076 KLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFD 897
            + GL VGHPNGT   + +IG++K+ NN++L DV  VP Y+V+LLSVHKLA+D+ ++V F+
Sbjct: 398  ECGLKVGHPNGTSVNVIKIGELKLINNVVLKDVFFVPGYSVNLLSVHKLAKDNNIAVLFN 457

Query: 896  ESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQVLH 717
            ES C +QD  +K+ + IGNQ +GLY    + N   +  N     N+WH RLGHPSDQVL 
Sbjct: 458  ESNCMLQDLKSKKVLVIGNQENGLYYVGRHGNSVNLCYNSVDKSNVWHSRLGHPSDQVLA 517

Query: 716  VLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKDG 537
            VLK  L + KT +    C+ CHK+KQ R PFPLS+HKS  IG LVHLD+WGPY++TS +G
Sbjct: 518  VLKDKLEI-KTVEHDP-CEICHKSKQVRVPFPLSDHKSKGIGDLVHLDLWGPYRVTSYEG 575

Query: 536  YKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNNK 357
            YKYFLT+VDDY+R+VW Y+L+NK +V++++ +F +++  QF++ +K+FRSDNGTEFVNN+
Sbjct: 576  YKYFLTVVDDYTRSVWCYLLRNKMEVFENLKDFYELILTQFKTKVKVFRSDNGTEFVNNQ 635

Query: 356  MQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLTAAY 177
            M  F  +KGI HQTSC YTPQQNG+ ERKHRHLLN+AR+LMFQG LPL  W +CVLTA Y
Sbjct: 636  MNFFMKQKGILHQTSCSYTPQQNGVVERKHRHLLNIARALMFQGGLPLRFWSDCVLTAVY 695

Query: 176  LINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSEKSVLIG 3
            LINRLPSSVL GKSP+ L++G +P+LSH+R FGCLC++T+LN SDKF   ++K VLIG
Sbjct: 696  LINRLPSSVLGGKSPYELMFGFEPSLSHLRSFGCLCFSTVLNESDKFAYHADKCVLIG 753


>gb|OTG33444.1| putative ribonuclease H-like domain-containing protein [Helianthus
            annuus]
          Length = 1427

 Score =  780 bits (2014), Expect = 0.0
 Identities = 394/784 (50%), Positives = 518/784 (66%)
 Frame = -2

Query: 2354 NESVINSSELKLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGF 2175
            NE++++    KL   D LYLH SD+S   I+++KL GSENY IWS AM  AL+  NK+GF
Sbjct: 10   NETLVS----KLDASDPLYLHASDSSNLTIVNIKLKGSENYTIWSSAMKLALQVKNKIGF 65

Query: 2174 VDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDK 1995
            +DG+C K   N  LA QWD C+SVV+TWILNS+S EL+ G ++SK A E+W DLKETYDK
Sbjct: 66   IDGSCTKSKDNDVLAKQWDRCNSVVITWILNSVSEELYMGQVFSKLASEVWADLKETYDK 125

Query: 1994 VDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQL 1815
            +DGS +F +H+ INSLSQNG+S+++YY+ LN++WKQFD MI LP+CTC A   F   + +
Sbjct: 126  IDGSVIFGLHQKINSLSQNGTSVSEYYHKLNTMWKQFDQMIQLPSCTCRASKEFNDFSHM 185

Query: 1814 IKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAF 1635
            IKLMQFLMGLDD Y P+R+N+LT + LPSVK+AFSIIS EESHRN  +       NV   
Sbjct: 186  IKLMQFLMGLDDVYHPVRTNLLTSETLPSVKTAFSIISREESHRNSKNPLKDQTQNVGFV 245

Query: 1634 ASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLV 1455
            +                                      NLKC++CNK GHT+DRCY +V
Sbjct: 246  SKTNQSFETKKKFNRGPNP--------------------NLKCTHCNKLGHTVDRCYEIV 285

Query: 1454 GYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLNDSPAH 1275
            GYP     R+  S                 +E S++++  +L+ +Q+ RL+ LLN+    
Sbjct: 286  GYPQNSKSRSNQSTKS----FASNNSVSNKVESSSASTIPALTPDQVSRLLGLLNERTGE 341

Query: 1274 SNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLI 1095
            S+                                 +N ++G   DSGANQHM  + + + 
Sbjct: 342  SS---------------------------------QNANVG---DSGANQHMVRTEEGIF 365

Query: 1094 NTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSK 915
            + +DVS+  + V HPNG+ A +T+IG  K+N+ +IL DV VVPEY V+LLSV+KLA+D+K
Sbjct: 366  DAIDVSEFNIKVKHPNGSDATVTKIGKYKLNDKVILTDVFVVPEYYVNLLSVYKLAKDNK 425

Query: 914  LSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHP 735
            L V FDE+ CYIQD  TK  +  GNQ  GLY     +    +  N   + NLWH RLGHP
Sbjct: 426  LRVLFDENNCYIQDSHTKNTLVTGNQVDGLYFCGDTSKTMKVCFNSHDTLNLWHSRLGHP 485

Query: 734  SDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYK 555
            S+ VL VLK SLNL K T++   C+ CH+AKQ R PFPLS+HK++ +G+L+HLDVWGPY+
Sbjct: 486  SNPVLSVLKDSLNL-KFTNNDIPCEVCHRAKQHRVPFPLSDHKTSSLGELIHLDVWGPYR 544

Query: 554  ITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGT 375
            I S++GYKYFL++VDDYSRAVWVY++++K +V+D+I +F  M+  QF   IK FRSDNGT
Sbjct: 545  IQSREGYKYFLSVVDDYSRAVWVYLMEHKNEVFDNIKSFFNMIKTQFGKTIKTFRSDNGT 604

Query: 374  EFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPEC 195
            EFVN++ + FF+  GI HQT+C YTPQQNGI ERKHRHLLNVARSL+FQG LPL  W EC
Sbjct: 605  EFVNHQTKNFFNTNGIIHQTTCPYTPQQNGIVERKHRHLLNVARSLLFQGGLPLRFWSEC 664

Query: 194  VLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSEKS 15
            +LTA YLINR PSS+L+GKSP+ LVYG  P L+H+R FGCLC++T+LNN DKF S +EK 
Sbjct: 665  ILTAVYLINRTPSSILNGKSPYDLVYGFKPFLNHLRNFGCLCFSTVLNNPDKFGSHAEKC 724

Query: 14   VLIG 3
            V +G
Sbjct: 725  VFLG 728


>ref|XP_022042162.1| uncharacterized protein LOC110944826 [Helianthus annuus]
          Length = 846

 Score =  760 bits (1962), Expect = 0.0
 Identities = 387/786 (49%), Positives = 523/786 (66%), Gaps = 12/786 (1%)
 Frame = -2

Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGF--VDGTCKKD 2151
            KL   D LYLH SD+SG  ++++KL G ENY +WS AM  AL   NKLG   +DG+CK+ 
Sbjct: 16   KLDASDPLYLHASDSSGLTVVNIKLKGIENYVVWSNAMHLALMTKNKLGQKKIDGSCKRS 75

Query: 2150 TANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFN 1971
            T +  LA+QWD C+S+VLTWILNS+S EL+ G +YSK A E+W+DLKETY+K+DGS VF 
Sbjct: 76   TTDDVLASQWDRCNSIVLTWILNSVSDELYVGQVYSKLASEVWDDLKETYNKIDGSVVFG 135

Query: 1970 IHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLM 1791
            + + INS+SQNG+S++ YY+ +N++WKQFD+M+ LP+C+C A   F + N LIKLMQFLM
Sbjct: 136  LFQKINSVSQNGASVSKYYHKINTMWKQFDAMLQLPSCSCQASTKFNEFNHLIKLMQFLM 195

Query: 1790 GLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXX 1611
            GLDD Y P+R+N+LT+ PLP+VK+AFSIIS EESHR+  S + S  PNV   A       
Sbjct: 196  GLDDVYQPVRTNLLTRYPLPTVKTAFSIISREESHRD--SNSSSKVPNVGFAAKTNQFNE 253

Query: 1610 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGK 1431
                                           NLKC++CNK GH +D+C+ L GYP+ +  
Sbjct: 254  NKKRFTKVSNP--------------------NLKCTHCNKIGHVVDKCFELHGYPSNFRP 293

Query: 1430 RNFTSN---VKPXXXXXXXXXXXXSL---ECSTSNSPVSLSDEQMVRLMSLLN----DSP 1281
            R   +N    KP            +    + S SNS  SL+ +Q  +L+ LLN    D+ 
Sbjct: 294  RPNQNNNQWSKPNISANSSINSTVNHSFNDKSASNSLNSLTSDQFTKLLDLLNEKKTDNG 353

Query: 1280 AHSNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKP 1101
              +N+ GK     + N     +  +++ +  N+    +N ++ WI+DS ANQHM +SS+ 
Sbjct: 354  PKTNVRGK-----YHNVISSLDC-YKRSYCFNSKSWSQN-NMSWIIDSSANQHMIMSSEN 406

Query: 1100 LINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARD 921
            + N VDVS   +TV HPNGT A +T IG  K++N++IL DV VVP+Y V+L+ VHKLA+D
Sbjct: 407  MFNKVDVSDYNITVKHPNGTDAKVTIIGCYKLSNSVILRDVFVVPKYCVNLIFVHKLAKD 466

Query: 920  SKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLG 741
            ++L V FDE  CYIQD   K+ +    Q+ GLY      N      N   +  LWH RLG
Sbjct: 467  NQLRVVFDEDTCYIQDLYLKKTLVTSRQTDGLYFCGNYFNSVIACFNKAETIKLWHSRLG 526

Query: 740  HPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGP 561
            HP DQ L+VL    NL     +   C+ CHKAKQ R PFPLSEHK++K+G L+HLDVWGP
Sbjct: 527  HPVDQALNVL----NLKTDKANIDPCEVCHKAKQHRVPFPLSEHKTSKVGDLIHLDVWGP 582

Query: 560  YKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDN 381
            YK++S +G+KYFLT+VDDYSR+VWVY++K+K +V+++I +F  ++  QFE NIK FRSDN
Sbjct: 583  YKVSSIEGFKYFLTVVDDYSRSVWVYLMKSKVEVFENIQSFYNLVKTQFEVNIKAFRSDN 642

Query: 380  GTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWP 201
            GTEFVN++M  F +  GI HQTSC YTPQQNG+ ERKH HLLNVAR+L+FQ  +PL  W 
Sbjct: 643  GTEFVNSQMSNFVNTHGIIHQTSCAYTPQQNGVVERKHGHLLNVARALLFQSGVPLKFWS 702

Query: 200  ECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSE 21
            ECVLTA+YLINR PSSVL+GK+P+ L++G +P+LSH+++FGCLC+ T+LNN DK    +E
Sbjct: 703  ECVLTASYLINRTPSSVLNGKTPYELLFGFEPSLSHLKIFGCLCFFTVLNNPDKLDEEAE 762

Query: 20   KSVLIG 3
            K + +G
Sbjct: 763  KCIFMG 768


>gb|OTG37431.1| putative ribonuclease H-like domain-containing protein [Helianthus
            annuus]
          Length = 1459

 Score =  779 bits (2012), Expect = 0.0
 Identities = 393/786 (50%), Positives = 525/786 (66%), Gaps = 12/786 (1%)
 Frame = -2

Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145
            KL  GD LYLHPSD+S   I+S+KL G+ENY +WS AM  AL   NK GF+DG  +K   
Sbjct: 16   KLDIGDPLYLHPSDSSSLTIVSIKLKGTENYAVWSSAMKLALEAKNKYGFIDGKVEKSKD 75

Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965
            +  LA QWD C+SVVLTW+LNS+S ELF G ++SK A E+W DLKE++DK+DGS V++++
Sbjct: 76   DEILAAQWDRCNSVVLTWLLNSISEELFLGQVFSKLASEVWTDLKESFDKIDGSVVYDLY 135

Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785
            K IN ++QNGS++A+YYN L ++WKQFD+M+ LP+C+C A   +   + LIKLMQFLMGL
Sbjct: 136  KKINCIAQNGSTVAEYYNRLTTMWKQFDAMLQLPSCSCQAAKDYNDFSALIKLMQFLMGL 195

Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605
            DD Y P+R+NILT++  PSVK AFSI+S EESHR  SSG GS   NVS  +         
Sbjct: 196  DDVYQPVRTNILTRESFPSVKVAFSIVSREESHR-LSSG-GSKTQNVSFVSKPNQAFDPK 253

Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGY---G 1434
                                         NLKC++CN  GHT+DRC+ +VGYP G+   G
Sbjct: 254  RRNNRGPNP--------------------NLKCTHCNMIGHTVDRCFEIVGYPPGFRRKG 293

Query: 1433 KRNFTS--NVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLND----SPAHS 1272
              N T+  N                   S  +S +  + EQ+ +L+SL+ +    S  ++
Sbjct: 294  TNNQTNKTNSSVNNNNSNKSNNVGGSSVSAVSSGLPFTSEQISKLLSLVGEKSGSSAQNT 353

Query: 1271 NMAGKCFSGTFF---NASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKP 1101
            ++ G+CF+ + F   ++SV FN  F                  WIVDSGA+QHM  S K 
Sbjct: 354  SVGGECFNVSNFVSCSSSVSFNNSFV-----------------WIVDSGASQHMIKSDKY 396

Query: 1100 LINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARD 921
            +IN VDVS+  +TVGHPNGT+  + +IGD+K+ +N++L DV  VP+Y V+LLSV+KLA+D
Sbjct: 397  MINVVDVSEFNITVGHPNGTKVKVLKIGDLKLTDNVVLRDVFYVPDYCVNLLSVYKLAKD 456

Query: 920  SKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLG 741
            + +SV F E+ C +QD ++++ +  G+Q SGLY  +   N   +  N +V    WH RLG
Sbjct: 457  NHISVIFKENSCVLQDSSSRKVLMNGSQDSGLYFVENYGNSVNVCLNSSVKSFTWHTRLG 516

Query: 740  HPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGP 561
            HPSDQVL VLK SL ++        CD CH+AKQ R PFPLSEHKS  +G L+HLD+WGP
Sbjct: 517  HPSDQVLAVLKGSLKINSNEHGP--CDVCHRAKQVRVPFPLSEHKSKFVGDLIHLDLWGP 574

Query: 560  YKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDN 381
            YK++S DG+KYFLT+VDDYSR+VW Y L NK +V++++ NF +++  QF+  IK+FRSDN
Sbjct: 575  YKVSSYDGFKYFLTVVDDYSRSVWCYFLTNKTEVFENLKNFYELVVTQFKKRIKVFRSDN 634

Query: 380  GTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWP 201
            GTEFVNN+M  F   KGI HQTSC YTPQQNG+ ERKHRHLLN AR+LMFQG LPL  W 
Sbjct: 635  GTEFVNNQMSMFCKSKGILHQTSCSYTPQQNGVVERKHRHLLNTARALMFQGGLPLRYWS 694

Query: 200  ECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSE 21
            +CVLTA YLINRLPSSVL+G+SPF +++G  P+LSH+R FGCLC++T+L +SDKF   ++
Sbjct: 695  DCVLTAVYLINRLPSSVLNGRSPFEMMFGFSPSLSHLRNFGCLCFSTVLTDSDKFAYHAD 754

Query: 20   KSVLIG 3
            K V +G
Sbjct: 755  KCVFLG 760


>gb|OTG16942.1| putative ribonuclease H-like domain-containing protein [Helianthus
            annuus]
          Length = 1458

 Score =  775 bits (2002), Expect = 0.0
 Identities = 390/780 (50%), Positives = 521/780 (66%), Gaps = 6/780 (0%)
 Frame = -2

Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145
            KL  GD LYLHPSD+S   I+SVKL G+ENY +WS AM  AL   NK GF+DG  +K   
Sbjct: 16   KLDIGDPLYLHPSDSSSLTIVSVKLKGTENYAVWSSAMKLALEAKNKYGFIDGKVEKSKD 75

Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965
            +  LA QWD C+SVVLTW+LNS+S ELF G ++SK A E+W DLKE++DK+DGS V++++
Sbjct: 76   DEILAAQWDRCNSVVLTWLLNSVSEELFLGQVFSKLASEVWTDLKESFDKIDGSVVYDLY 135

Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785
            K IN ++QNGS++A+YYN L ++WKQFD+M+ LP+C+C A   +   + LIKLMQFLMGL
Sbjct: 136  KKINCIAQNGSTVAEYYNKLTTMWKQFDAMLQLPSCSCQAAKDYNDFSALIKLMQFLMGL 195

Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605
            DD Y P+R+NILT++  PSVK AFSI+S EESHR  SSG+ S   +  A ++        
Sbjct: 196  DDIYQPVRTNILTRETFPSVKVAFSIVSREESHRLSSSGSKSQSVSYVARSNQSNQNTSK 255

Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRN 1425
                                         NLKC++CN  GHT+DRC+ ++GYP G  KR 
Sbjct: 256  RNFRGPNS---------------------NLKCTHCNMIGHTVDRCFEIIGYPPGMKKRG 294

Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLNDSP----AHSNMAGK 1257
              S  K                 S++ S ++ + EQ+ +LMSL+ + P      SNM G 
Sbjct: 295  NMSFGKNNGNNTSRSGMSSG-PSSSAVSALTFTPEQIAKLMSLVGEKPDGDQEKSNMGGM 353

Query: 1256 --CFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVD 1083
              C SG     S   ++ F   +N             W+VDSGANQHM  S K + N +D
Sbjct: 354  SACMSGFL---SCSSSVCFSHEYN-------------WVVDSGANQHMIKSDKDMFNCID 397

Query: 1082 VSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVS 903
            VS+ GL VGHPNGT   + +IGD+K+ NN+I+ DV  VP Y+V+LLSVHKLA+D+K++V 
Sbjct: 398  VSECGLKVGHPNGTSVSVLKIGDLKLINNVIIKDVFYVPGYSVNLLSVHKLAKDNKIAVL 457

Query: 902  FDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQV 723
            F+E+ C +QD  +K+ + IG Q +GLY    N N   +  N +V  +LWH RLGHPSDQV
Sbjct: 458  FNENNCMLQDLRSKKILVIGRQENGLYFVGRNGNFANLCFNSSVKSDLWHSRLGHPSDQV 517

Query: 722  LHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSK 543
            L VLK SL++     +   C+ CH++KQ R PFPLS+HKS ++G L+HLD+WGPYK++S 
Sbjct: 518  LAVLKDSLDVKIVEHNP--CEVCHRSKQVRVPFPLSDHKSKELGDLIHLDLWGPYKVSSY 575

Query: 542  DGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVN 363
            +GYKYFLT+VDD++R VW YMLK+K +V++++  F +++  QF+  +K+FRSDNGTEF+N
Sbjct: 576  EGYKYFLTVVDDFTRTVWCYMLKSKVEVFENLKYFYELVLTQFKKKVKMFRSDNGTEFIN 635

Query: 362  NKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLTA 183
            N+M  F  +KGI HQTSC YTPQQNG+ ERKHRHLLN AR+LMFQ  LPL  W +CVLTA
Sbjct: 636  NQMSTFCKQKGIVHQTSCSYTPQQNGVVERKHRHLLNTARTLMFQSGLPLRFWSDCVLTA 695

Query: 182  AYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSEKSVLIG 3
             Y+INRLPSSVLSGKSP+ L++G  P+LS+ R FGCLC++T LN  DKF   ++K VLIG
Sbjct: 696  VYIINRLPSSVLSGKSPYELMFGFRPSLSYFRNFGCLCFSTNLNEPDKFAYHADKCVLIG 755


>ref|XP_021986042.1| uncharacterized protein LOC110882290 [Helianthus annuus]
          Length = 851

 Score =  749 bits (1933), Expect = 0.0
 Identities = 377/746 (50%), Positives = 506/746 (67%), Gaps = 9/746 (1%)
 Frame = -2

Query: 2213 MTFALRNHNKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTA 2034
            M  AL+  NK+GF+DGTC++ T +  L  QWD C+S+VLTWILNS+S +L+ G +YSK A
Sbjct: 1    MNLALQVKNKIGFIDGTCRRSTTDEVLGRQWDRCNSIVLTWILNSVSEDLYLGHVYSKLA 60

Query: 2033 YEMWNDLKETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCT 1854
             ++W DLKETYDKVDGS VFN+++ INS +Q+G  +++YY+ LN +WKQ D +++LP CT
Sbjct: 61   SDVWKDLKETYDKVDGSVVFNLYQKINSFTQSGMPVSEYYHKLNCMWKQMDQLLALPACT 120

Query: 1853 CDAGIHFEKHNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFS 1674
            CDA   F   N LIKLMQFLMGLD TY  +R+N+LT++ LPSVK AFSIIS EESH N  
Sbjct: 121  CDASKQFNDFNHLIKLMQFLMGLDSTYQSVRTNLLTREILPSVKDAFSIISREESHLNSK 180

Query: 1673 SGTGSIKPNVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCN 1494
            + +     +V  FA+                                     NLKCS+CN
Sbjct: 181  NFSDKTHNSVVGFATKTNQLIDTKKKGIRTPNP-------------------NLKCSHCN 221

Query: 1493 KPGHTIDRCYGLVGYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECS-TSNSPVS-LSDE 1320
            K GHTI++C+ LVGYP+           KP             +E S T++S VS L+ +
Sbjct: 222  KTGHTIEKCFELVGYPSWMKS-------KPGGNKGSRVSNNSVVENSDTTSSAVSYLTSD 274

Query: 1319 QMVRLMSLLNDSPAH----SNMAGKCFSGTFFNASVKFNLKFEKHFNGN---TNFLKKNT 1161
            Q+ +L+SLL+D P +    SN AG+C S  +F+++V    K    F      +NF+    
Sbjct: 275  QIAQLLSLLHDKPKNDPSCSNFAGRCNS-VYFDSNVDVFSKPTSDFKPAYCFSNFINAGK 333

Query: 1160 SLGWIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHD 981
             +GWI+DSGANQHM  +   LIN +DVS+  + V HPNGT AL+T+IGD+K++  +IL+D
Sbjct: 334  KVGWIIDSGANQHMVKNDIGLINQMDVSEYNIKVKHPNGTSALVTKIGDIKLSEKVILYD 393

Query: 980  VLVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTN 801
            V VVP+Y V+L+SVHKLA+D  L+VSFDE  CYIQD  TK+    G+Q  GLY    +  
Sbjct: 394  VFVVPDYCVNLVSVHKLAKDCNLTVSFDEHNCYIQDSRTKKVQVTGSQLDGLYFCGGSAL 453

Query: 800  CKAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFP 621
               + +  ++  N WH RLGHP++ VLHVLK  L++ K       C+TCH+AKQ REPFP
Sbjct: 454  SDKVCS-ASLDVNRWHARLGHPAEPVLHVLKNKLDI-KAGIKLEPCETCHRAKQHREPFP 511

Query: 620  LSEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVN 441
            LSEHK+  + +L+HLDVWGPY++ S++G+++FLT+VDDY+RAVWVY++K+K DV+ +I +
Sbjct: 512  LSEHKTKNLSELIHLDVWGPYRVQSREGFRFFLTVVDDYTRAVWVYLMKSKEDVFYNIKD 571

Query: 440  FTQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRH 261
            F  ML  QF+ ++K+FRSDNGTEF+N +M++F H  GI HQTS V+TPQQNGI ERKHRH
Sbjct: 572  FFNMLKTQFDKHVKMFRSDNGTEFINKQMKEFCHNHGIIHQTSGVHTPQQNGIVERKHRH 631

Query: 260  LLNVARSLMFQGELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVF 81
            LLNVAR+L+FQ   PL  W EC+LTA YLINR PSSVL+G+SP+ LVYG  P LS +RV 
Sbjct: 632  LLNVARALLFQVGFPLKFWSECILTATYLINRTPSSVLNGRSPYKLVYGFAPVLSQLRVI 691

Query: 80   GCLCYATILNNSDKFYSRSEKSVLIG 3
            GCLC++T+LNN+DKF S +EK VLIG
Sbjct: 692  GCLCFSTVLNNTDKFNSHAEKCVLIG 717


>ref|XP_021980336.1| uncharacterized protein LOC110876473 [Helianthus annuus]
          Length = 801

 Score =  694 bits (1791), Expect = 0.0
 Identities = 371/805 (46%), Positives = 491/805 (60%), Gaps = 20/805 (2%)
 Frame = -2

Query: 2357 NNESVINSSELKLVF----GDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNH 2190
            ++ +VINS    LV     GD L+LHPSD++   I+SVKL GSENY+IWS AM  AL+  
Sbjct: 4    DDNTVINSPGATLVSKIDAGDPLFLHPSDSANLSIVSVKLKGSENYRIWSNAMYLALQVK 63

Query: 2189 NKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLK 2010
            NK+GFVDG+C +   +  L  QWD C+S+VLTWILNS+S EL+ G +YSK A ++W DLK
Sbjct: 64   NKIGFVDGSCLRSKTDEVLGRQWDRCNSIVLTWILNSVSEELYLGLVYSKIASDVWKDLK 123

Query: 2009 ETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFE 1830
            +TYDK+DGS VFN+++ INS SQNG  +++YY+ LN +WKQ D +++LP C+CDA   F 
Sbjct: 124  DTYDKIDGSVVFNMYQKINSFSQNGMPISEYYHKLNCMWKQLDQLLALPACSCDASKQFN 183

Query: 1829 KHNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKP 1650
              N LIKLMQFLMGLD +Y  +R+N+LT++ LPSVK AFS+IS EESH +  +       
Sbjct: 184  DFNHLIKLMQFLMGLDSSYQSVRTNLLTRETLPSVKDAFSVISREESHLHSKNIFDKTPN 243

Query: 1649 NVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDR 1470
            N   F+                                      NLKCS+CNK GHTI++
Sbjct: 244  NPVGFSVKTGQTIDSRKRNNRTLNP-------------------NLKCSHCNKTGHTIEK 284

Query: 1469 CYGLVGYPA------GYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPV--SLSDEQM 1314
            C+ LVGYP+      G  K N  SN                   +T  +P   SLS++Q+
Sbjct: 285  CFELVGYPSWIKTKPGGNKGNKVSN--------------NVTADTTDTTPAMSSLSNDQI 330

Query: 1313 VRLMSLLNDSPA----HSNMAGKCFS----GTFFNASVKFNLKFEKHFNGNTNFLKKNTS 1158
             +L+SLLND P      S  AG C +     +F N S K    F+  F  + NF+     
Sbjct: 331  AQLLSLLNDKPKGDPQSSGFAGMCVNPVCLNSFVNLSAKPICDFKPVFCFS-NFINDGKK 389

Query: 1157 LGWIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDV 978
            +GWIVDSGANQHM ++ + LIN  DV++  + V HPNGT AL+T+IGD+K+++       
Sbjct: 390  VGWIVDSGANQHMVMTDECLINQKDVTEFNIKVKHPNGTSALVTKIGDIKLSD------- 442

Query: 977  LVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNC 798
                                             +D  T++    G+Q  GLY        
Sbjct: 443  ---------------------------------KDSQTQKVQVTGSQFDGLYF------- 462

Query: 797  KAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPL 618
                              GHP++ VLHVLK +LN+ KT    + C+TCHKAKQ REPFPL
Sbjct: 463  -----------------CGHPAEPVLHVLKNNLNI-KTGAKLNPCETCHKAKQHREPFPL 504

Query: 617  SEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNF 438
            S+HKS  +G L+HLDV GPY++ S++G++YFLT+VDDYSRAVWVY++K+K +V+ +I  F
Sbjct: 505  SDHKSEALGDLIHLDVRGPYRVQSREGFRYFLTMVDDYSRAVWVYLMKSKDEVFYNIKGF 564

Query: 437  TQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHL 258
               L  QF  ++KIFRSDNGTEF N +M  F ++ GI HQTSCV+TPQQNGI ERKHRHL
Sbjct: 565  YNFLKTQFSKSVKIFRSDNGTEFTNKQMSNFCYENGILHQTSCVHTPQQNGIVERKHRHL 624

Query: 257  LNVARSLMFQGELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFG 78
            LNVAR+L+FQG  P+  W EC+LTA+YLINR PSS+LSGKSP+ LV+G  P L  +RV G
Sbjct: 625  LNVARTLLFQGGFPIKFWSECILTASYLINRTPSSILSGKSPYELVFGFSPVLGQLRVIG 684

Query: 77   CLCYATILNNSDKFYSRSEKSVLIG 3
            CLC+ T+LNNSDKF + +EK VL+G
Sbjct: 685  CLCFNTVLNNSDKFTTHAEKCVLVG 709


>ref|XP_021971692.1| uncharacterized protein LOC110866849 [Helianthus annuus]
          Length = 828

 Score =  566 bits (1459), Expect = 0.0
 Identities = 294/683 (43%), Positives = 415/683 (60%), Gaps = 3/683 (0%)
 Frame = -2

Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145
            KL   D LYLHPSD+S   I+SVKL GSENY +WS AM                      
Sbjct: 23   KLDASDPLYLHPSDSSNLTIVSVKLKGSENYTVWSNAM---------------------- 60

Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965
                  QWD C+S+VLTWILNS+S EL+ G ++SK A ++W+DLKETY+KV+GS VF ++
Sbjct: 61   -----QQWDRCNSIVLTWILNSISEELYMGQVFSKLACDVWSDLKETYNKVEGSVVFYLY 115

Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785
            K IN  +QNG+++++YY+ LN +W+Q D ++ LP+CTC+A   F   N +I+LMQFLMGL
Sbjct: 116  KKINGFTQNGTNVSEYYHKLNVMWRQLDEILQLPSCTCEAAKEFNNFNHMIELMQFLMGL 175

Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605
            DD Y  +R+N+L K+ LP+VK AF+I+S EESHRN  S   S +    AF S        
Sbjct: 176  DDVYQGVRTNLLMKETLPTVKEAFAIVSREESHRN--SSNSSKEGLTMAFVSKVSQPIEF 233

Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRN 1425
                                         NLKCS+CNK GH++D+C+ ++GYP       
Sbjct: 234  KRGNKVANQ--------------------NLKCSHCNKVGHSVDKCFEIIGYP------- 266

Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVS---LSDEQMVRLMSLLNDSPAHSNMAGKC 1254
              S +KP            +   S  ++ VS   L+ EQ+ RL+SL+ D P+ +  +   
Sbjct: 267  --SWMKPPRGNQVKKAVASNSSTSVESANVSVNSLTSEQITRLLSLIGDKPSGAPQSCSV 324

Query: 1253 FSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVSK 1074
                F  ++V F     K     ++  K   S+GW++DSGANQHM    K L +++DVS+
Sbjct: 325  SGSNFLCSNVFF-----KPVICFSSESKDEQSVGWVIDSGANQHMIKDEKVLSHSIDVSE 379

Query: 1073 LGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFDE 894
              +TV HPNGT AL+T+IG++K+ NN+IL DV +VPEY ++L+ VHKL +D+ L V FDE
Sbjct: 380  FKITVKHPNGTNALVTKIGNVKLVNNVILKDVFLVPEYNINLIYVHKLVKDNGLYVGFDE 439

Query: 893  SKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQVLHV 714
            +KCY+QD +TK+ +  G+Q  GLY    +  C  +  N +   +LWH RLGHPSDQ + V
Sbjct: 440  NKCYVQDISTKKVLVTGSQVDGLYFCGSSFMCNKVCFNSSSLNDLWHVRLGHPSDQAIRV 499

Query: 713  LKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKDGY 534
             K  L L   +D++  C+ C +AKQ REPFPLS H+S+ +G LVHLDVWGPY++TS++G+
Sbjct: 500  FKYKLKLG-NSDTSLPCEVCQRAKQHREPFPLSSHRSSSLGDLVHLDVWGPYRVTSREGH 558

Query: 533  KYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNNKM 354
             +FLTIVDDYSR VWV ++K K +V++++V+F  ++  QF   +K FRSDNGTEF+N + 
Sbjct: 559  WFFLTIVDDYSRVVWVCLMKTKQEVFENVVDFVNIIKTQFHKEVKCFRSDNGTEFINQQT 618

Query: 353  QQFFHKKGIFHQTSCVYTPQQNG 285
             +F   K + H      TP   G
Sbjct: 619  NRFCKIKDVQHSK----TPDDEG 637


>ref|XP_022014401.1| uncharacterized protein LOC110913892 [Helianthus annuus]
          Length = 583

 Score =  546 bits (1407), Expect = 0.0
 Identities = 293/614 (47%), Positives = 395/614 (64%), Gaps = 7/614 (1%)
 Frame = -2

Query: 2369 TESMNNESVINSSELKLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNH 2190
            TE     SV   S+L +   D LYLH SD+S   I+++KL G+ENY +WS AM  AL   
Sbjct: 4    TEKQGESSVTLVSKLDV--SDPLYLHASDSSSLFIVNIKLKGTENYVVWSNAMKLALTAK 61

Query: 2189 NKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLK 2010
            NKLGF++GTC K T +  LA+QWDMC+SV+LTWILNS+S EL+ G +YS  A E+W+DLK
Sbjct: 62   NKLGFINGTCTKSTKDDVLASQWDMCNSVILTWILNSVSKELYVGQVYSSLASEVWSDLK 121

Query: 2009 ETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFE 1830
            +TYD+VDGS VF +++ INS+SQNG+S+++YY+ LN++WKQFD+++ LP+CTCDA   + 
Sbjct: 122  DTYDRVDGSVVFGLYQKINSVSQNGTSVSEYYHRLNTMWKQFDAIVQLPSCTCDASSKYN 181

Query: 1829 KHNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKP 1650
            + +QLIKLMQFLMGLDD Y P+R+N+LT+DPLP+VK+AFS+IS EESHR+  S   S  P
Sbjct: 182  EFSQLIKLMQFLMGLDDIYQPVRTNLLTRDPLPTVKTAFSVISREESHRD--SNKSSKTP 239

Query: 1649 NVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDR 1470
            NV   A                                      NLKC++CNK GH I++
Sbjct: 240  NVGFVAKATQYNDNKKRFNKGPNP--------------------NLKCTHCNKVGHVIEK 279

Query: 1469 CYGLVGYPAGY-GKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPV-SLSDEQMVRLMSL 1296
            C+ L GYP+ Y  K N  ++               ++    +NS + +L+ +Q  +L+ L
Sbjct: 280  CFKLHGYPSNYRNKSNQNNSQWSKTNLSANNSVANTMNDQPANSSLNALTVDQFSKLLGL 339

Query: 1295 LNDSPAH----SNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGAN 1128
            LN++       SNM+GK     F+NA       ++  +  N++F  +N  L WI+DSGAN
Sbjct: 340  LNENKLEDSHKSNMSGK-----FYNAFTSLG-SYKNTYCFNSSFFHQN-KLKWIIDSGAN 392

Query: 1127 QHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSL 948
            QHM  ++  + N VDVS+  +T+ HPNGT A + EIG  K++ ++IL DV VVPEY V+L
Sbjct: 393  QHMVTNNDNMFNLVDVSEYDITIKHPNGTDAKVKEIGCFKLSEDVILKDVFVVPEYCVNL 452

Query: 947  LSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAIS-NNCTV 771
            +SVHKLA+D+KL V FDE  CYIQD + K+N+ IG Q+ GLY F  N++   I+  N T 
Sbjct: 453  ISVHKLAKDNKLKVVFDEHNCYIQDVSLKKNLVIGRQTDGLY-FCGNSSISVIACFNKTE 511

Query: 770  SRNLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIG 591
            +  LWH RLGHPSDQVLHV     NL   +     CDTCHKAKQ R PFPLS+HKS ++ 
Sbjct: 512  TIKLWHSRLGHPSDQVLHV----RNLKNESGQAEPCDTCHKAKQHRIPFPLSDHKSKRVD 567

Query: 590  QLVHLDVWGPYKIT 549
             LVHLDVWGPYK T
Sbjct: 568  DLVHLDVWGPYKTT 581


>ref|XP_021991826.1| uncharacterized protein LOC110888615 [Helianthus annuus]
          Length = 555

 Score =  492 bits (1266), Expect = e-162
 Identities = 264/572 (46%), Positives = 359/572 (62%), Gaps = 4/572 (0%)
 Frame = -2

Query: 2369 TESMNNESVINSSELKLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNH 2190
            TE+    SV   S  KL   D LYLH SD+S   I+++KL G+ENY +WS AM  AL   
Sbjct: 4    TENQGESSVTLIS--KLDASDPLYLHASDSSSLTIVNIKLKGTENYVVWSNAMKLALTAK 61

Query: 2189 NKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLK 2010
            NKLGF++GTC K T +  LA+QWD C+SVVLTWILNS+S EL+ G +YS+ A E+W+DLK
Sbjct: 62   NKLGFINGTCTKSTKDDVLASQWDRCNSVVLTWILNSVSEELYVGQVYSRLASEVWSDLK 121

Query: 2009 ETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFE 1830
            +TYD VDGS VF +++ INS++QNG+S+++YY+ LN++WKQFD+M+ LP+CTCDA   + 
Sbjct: 122  DTYDMVDGSVVFGLYQKINSVNQNGASVSEYYHKLNTMWKQFDAMVQLPSCTCDASTKYN 181

Query: 1829 KHNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKP 1650
            + +QLIKL+ FLMGLDD Y P+R+N+LT+DPLP+VK+AFSIIS EESHR+  S   S  P
Sbjct: 182  EFSQLIKLVHFLMGLDDIYQPVRTNLLTRDPLPTVKTAFSIISREESHRD--SNKSSKIP 239

Query: 1649 NVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDR 1470
            NV   A                                      NLKC++CNK GH I++
Sbjct: 240  NVGFVAKETQFNENKKRFNKGPNP--------------------NLKCTHCNKVGHVIEK 279

Query: 1469 CYGLVGYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLN 1290
            C+ + GYP  Y  +   +N +             + + S ++S  +L+ +Q  +L+ LLN
Sbjct: 280  CFEIHGYPLNYRNKPNQNNSQWSKANVSANSSVANNDQSANSSLNALTADQFSKLLGLLN 339

Query: 1289 DS----PAHSNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQH 1122
            ++     A SNM+G+CFS      S K    F      N++F  +N  L WIVDSGANQH
Sbjct: 340  ENKLEDSAKSNMSGECFSAFTPLGSYKNTYCF------NSSFFHQN-KLKWIVDSGANQH 392

Query: 1121 MTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLS 942
            M +++  + N VDVS+  +T+ HPNGT A + +IG  K++ ++IL DV VVPEY V+L+S
Sbjct: 393  MVMNNDNMFNLVDVSEYDITIKHPNGTDAKVKQIGCFKLSEDVILKDVFVVPEYCVNLIS 452

Query: 941  VHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRN 762
            VHKLA+D+KL V FDE  CYIQD + KRN+ IG Q  GLY    ++       N T +  
Sbjct: 453  VHKLAKDNKLKVVFDEHNCYIQDVSLKRNLVIGRQMGGLYFCGNSSKPVIACFNKTETIK 512

Query: 761  LWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHI 666
            LWH RLGHP DQVLHVLK  +  DK +  T +
Sbjct: 513  LWHSRLGHPEDQVLHVLKLKMKQDKLSRVTRV 544


>ref|XP_022023932.1| uncharacterized protein LOC110924205 [Helianthus annuus]
          Length = 541

 Score =  471 bits (1211), Expect = e-154
 Identities = 247/544 (45%), Positives = 340/544 (62%), Gaps = 7/544 (1%)
 Frame = -2

Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145
            K+  GD L+LHPSD +   I+++KL G+ENY +W+ +M  AL+  NK+GF+DG+C++ T 
Sbjct: 23   KIDAGDPLFLHPSDCANLSIVTIKLKGTENYTVWANSMNLALQVKNKIGFIDGSCRRSTT 82

Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965
            +  L  QWD C+S+VLTWILNS+S EL+ G +YSK A E+W DLKETYDKVDGS VFN++
Sbjct: 83   DEVLGRQWDRCNSIVLTWILNSVSDELYLGHVYSKLASEVWRDLKETYDKVDGSIVFNLY 142

Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785
            + I+S +Q+G  +++YY+ LN +WKQ D +++LP+CTCDA   F   N LIKLMQFLMGL
Sbjct: 143  QKIDSFTQSGMPVSEYYHKLNCMWKQLDQLLALPSCTCDASKQFNDFNHLIKLMQFLMGL 202

Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605
            D TY  +R+N+LT++ LP+VK AFSIIS EESH +    +  I  N   FA+        
Sbjct: 203  DSTYQSVRTNLLTRETLPTVKDAFSIISREESHLHMKIFSERIPNNTVGFAAKTNQSFES 262

Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRN 1425
                                         NLKCS+CNK GHTI++C+ LVG P     +N
Sbjct: 263  KKRGIRPPNP-------------------NLKCSHCNKTGHTIEKCFELVGNPTWMKSKN 303

Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLND----SPAHSNMAGK 1257
               N               S   S S++  SL+ EQ+ +L+SLLND     P  S  AG+
Sbjct: 304  ---NGNKGSRVSNNVITETSDTVSPSSAMSSLTSEQVAQLLSLLNDKSKNDPQSSGFAGR 360

Query: 1256 CFSGTFFNASVKFNLKFE---KHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTV 1086
                  FN+ V  + K     K     +NF K    +GWI+DSGANQHM ++    IN +
Sbjct: 361  SDDSMCFNSFVDMSSKTSCDPKPVYCFSNFFKDGNRVGWIIDSGANQHMVMTDVGFINQI 420

Query: 1085 DVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSV 906
            DV++  + V HPNGT AL+T+IGD+K+++ +IL+DV +VP+Y V+L+SVHKLA+D KL+V
Sbjct: 421  DVTEYNIKVKHPNGTSALVTKIGDIKLSDKVILYDVFLVPDYCVNLVSVHKLAKDCKLTV 480

Query: 905  SFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQ 726
            +FDE+ CYIQD  TK+     NQ  GLY F  +++ K  +  C V  NLWH RLGHP++ 
Sbjct: 481  TFDENNCYIQDSQTKKIQVTDNQLDGLY-FCGSSSVKVCNAKCDV--NLWHARLGHPAEP 537

Query: 725  VLHV 714
            VLHV
Sbjct: 538  VLHV 541


>emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera]
          Length = 1523

 Score =  478 bits (1229), Expect = e-147
 Identities = 285/783 (36%), Positives = 417/783 (53%), Gaps = 12/783 (1%)
 Frame = -2

Query: 2315 FGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANAS 2136
            F   L+LH SD  G  ++S  L   +NY  W  +M  AL   NK GFVDGT  + T N +
Sbjct: 28   FNHPLFLHHSDQPGAVLVSQPLM-EDNYTTWVQSMDMALTIKNKKGFVDGTLNRPTHNPN 86

Query: 2135 LANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNI 1956
               QWD C+ +V TW+L ++S E+    I+ K A  MW +L+E +   +   +FNI   I
Sbjct: 87   EQQQWDRCNILVKTWLLGAISKEISNSVIHCKDAKTMWLELQERFSHTNTVQLFNIENAI 146

Query: 1955 NSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGLDDT 1776
            +  +Q   ++  ++  L  LW + D++   P CTC      + + +  K M+FLMGL D 
Sbjct: 147  HECAQGTGTVTSFFTKLKGLWDEKDALCGFPPCTCATAAEVKTYMETQKTMKFLMGLGDN 206

Query: 1775 YVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXX 1596
            Y  +RSNI+  DPLP+V  A+++    E     S+G  ++    SAF+            
Sbjct: 207  YATVRSNIIGMDPLPTVNKAYAMALRHEKQAEASNGKVAVPNEASAFS------VRKLDQ 260

Query: 1595 XXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCY---GLVGYPAGYGKRN 1425
                                      +LKC+ C   GHT D C      +G   G  K N
Sbjct: 261  DPNTTEREVKCEKCNMTNHSTKNCRAHLKCTYCGGKGHTYDYCRRRKNTMGGGQGRSKVN 320

Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSD-EQMVRLMSLLNDSP-AHSN---MAG 1260
              + +                    +N P+S S+ +QM+ L+S +  +  +HS+   M  
Sbjct: 321  HAATLNEGKE-------------DVTNFPLSQSECQQMMGLLSKIKTAATSHSDGHQMLE 367

Query: 1259 KCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTS-LGWIVDSGANQHMTLSSKPLINTVD 1083
               +    +A++  N+   +  +G    L ++     WI+DSGA+ H+   S  L +   
Sbjct: 368  MLHATKQASANLVGNVPNYEELSGRVFALSRDIKDTMWILDSGASDHIVCDSSFLTSFQP 427

Query: 1082 VSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVS 903
            V      V  P+GT A ++ IG +  +   +LH+VL VP + ++L+S+ KLA DS     
Sbjct: 428  VHNR--IVKLPDGTSAHVSHIGTVSFSAQFVLHNVLCVPLFYLNLISISKLAFDSFYVTI 485

Query: 902  FDESKCYIQDCATKRNVGIGNQSSGLYLFDV--NTNCKAISNNCTVSRNLWHQRLGHPSD 729
            F    C+IQD  + + +G+G +S GLY  ++     C  ++   T +++LWHQRLGHPS 
Sbjct: 486  FLRQVCFIQDLQSGKMIGMGTESEGLYCLNLPRKGTCNVVN---TKTQDLWHQRLGHPSS 542

Query: 728  QVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKIT 549
            +V  VL P L  +KT D +  C  C  AK TR PFPLS   S     L+H+D+WG Y + 
Sbjct: 543  KV-SVLFPFLQ-NKTLDVS-TCSICPLAKHTRTPFPLSVSSSDSCFDLIHVDIWGGYHVP 599

Query: 548  SKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEF 369
            S  G +YFLTIVDD+SR+ WVY++ +K +    +V+F  +++NQF S +KI RSDNG EF
Sbjct: 600  SLSGAQYFLTIVDDHSRSTWVYLMHHKSEARSLLVHFVNLVANQFGSQVKIVRSDNGPEF 659

Query: 368  VNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVL 189
               K  QF+  +GI HQTSC+ TPQQNG+ ERKHRHLLNVAR+L+FQ  LP   W + +L
Sbjct: 660  ---KHTQFYSSRGILHQTSCINTPQQNGVVERKHRHLLNVARALLFQSHLPKPFWGDAIL 716

Query: 188  TAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCY-ATILNNSDKFYSRSEKSV 12
            TAAYLINR P+ +L GK+PF  ++   P  SH+RVFGC C+ +T      KF  RS +SV
Sbjct: 717  TAAYLINRTPTPLLQGKTPFEKLFHKSPNYSHLRVFGCRCFVSTHPLRPSKFDPRSIESV 776

Query: 11   LIG 3
             IG
Sbjct: 777  FIG 779


>gb|PNX93622.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1454

 Score =  474 bits (1219), Expect = e-146
 Identities = 273/791 (34%), Positives = 406/791 (51%), Gaps = 26/791 (3%)
 Frame = -2

Query: 2297 LHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWD 2118
            L+ +D  GN I  V+L G  NY  W+ AM  +LR   K GF++GT KK     +    W 
Sbjct: 32   LNSNDNPGNLITQVQLRGENNYDEWTRAMKTSLRARRKWGFIEGTVKKPDEGTAEIEDWW 91

Query: 2117 MCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQN 1938
               S++++WILN++ P L +   Y + A ++W D+KE +   +G  +  +  ++ +  Q 
Sbjct: 92   TVQSMLVSWILNTVEPNLRSTMTYMENARDLWEDIKERFSVANGPKIHQLKADLVACKQA 151

Query: 1937 GSSLADYYNNLNSLWKQFDSMISLPTCTCDA-----GIHFEKHNQLIKLMQFLMGLDDT- 1776
            G ++A YY  L  LW +  +   +P C+C+          EK  +  ++ QFLMGLDD  
Sbjct: 152  GMTIAAYYGKLKLLWDELANYEQVPVCSCEGCSCRITTKLEKRREEERVHQFLMGLDDVV 211

Query: 1775 YVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXX 1596
            Y   RSN+L  DPLP++   +S++  EE  R  +      + +V   A            
Sbjct: 212  YGTARSNLLASDPLPNLNRIYSVMIQEERVRTIARNKEE-RGDVMGLAVQIGGKNRGRDE 270

Query: 1595 XXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKR---- 1428
                                        KC+NCN+ GH    C+ L+GYP  +G R    
Sbjct: 271  FKD-------------------------KCTNCNRDGHVAANCFQLIGYPDWWGDRPRGE 305

Query: 1427 ------NFTSNVKPXXXXXXXXXXXXSLECSTSNSP--------VSLSDEQMVRLMSLLN 1290
                    + N               + +   ++S           ++ +Q  +LM +LN
Sbjct: 306  GKSGTRGRSQNRGAGRGKGAAIVRANAAQAGGNSSAREAESHGFPGITSDQWQKLMEILN 365

Query: 1289 DSP-AHSNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTL 1113
              P     M GK  S                                WI+DSGA+ HMT 
Sbjct: 366  IQPDTAERMTGKSQSNE------------------------------WILDSGASNHMTG 395

Query: 1112 SSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHK 933
            + + +    D+      +G P+G  A  T+ G + ++  + L++VL VP    +L+S+ +
Sbjct: 396  TLEIMRELHDIQTC--PIGLPDGKNASATKEGVVLLDEGLKLYNVLYVPNLKCNLISLSQ 453

Query: 932  LARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWH 753
            L  D    V F +  C +QD  ++  +G G +  GLY F      +A S        LWH
Sbjct: 454  LMDDLDCIVHFSDKLCVMQDRTSRMLIGAGKRRDGLYYFRTIQRVQACSVVGVNQLELWH 513

Query: 752  QRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLD 573
            +RLGHPS +V  ++  +   +   +    CD C +AKQTRE F LSEH +    +L+H D
Sbjct: 514  RRLGHPSLKVTRLVSGTSKNNDHVELNKNCDVCLRAKQTREKFSLSEHVANDAFELIHCD 573

Query: 572  VWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIF 393
            +WGPY+  S  G  YF+TIVDDYSRAVW+Y++ +K +V  +++NF  ++  QF+  +KIF
Sbjct: 574  LWGPYRTASSCGAFYFVTIVDDYSRAVWIYLIGDKREVSQTLINFFTLIKRQFDKQVKIF 633

Query: 392  RSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPL 213
            RSDNGTEFV   M+++FH+ GI  QTSCV TPQQNG  ERKHRH+LNVAR+L FQ  LP+
Sbjct: 634  RSDNGTEFV--CMKRYFHENGIIFQTSCVGTPQQNGRVERKHRHILNVARALRFQSNLPI 691

Query: 212  NLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILN-NSDKF 36
            + W EC+L A YL+NR PS++L+GK+P+ +++G  P+L HIRVFGCLCYA  LN   DKF
Sbjct: 692  DFWGECILAAGYLLNRTPSAILNGKTPYEMLHGQAPSLEHIRVFGCLCYAHNLNRKGDKF 751

Query: 35   YSRSEKSVLIG 3
             S+S K + IG
Sbjct: 752  ASKSRKCIFIG 762


>gb|OMO88216.1| Integrase, catalytic core [Corchorus capsularis]
          Length = 1609

 Score =  476 bits (1224), Expect = e-145
 Identities = 272/771 (35%), Positives = 400/771 (51%), Gaps = 6/771 (0%)
 Frame = -2

Query: 2297 LHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWD 2118
            LH SD  G  +++  L   ENY  W  AMT AL+  +K GFVDG+  + +  +   + W 
Sbjct: 343  LHASDNPGTTLVTCLLK-EENYPTWRRAMTNALQAKSKFGFVDGSVPRPSLGSQEESSWV 401

Query: 2117 MCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQN 1938
             C+S+V++WI N+L P L     Y  TA EMWNDL+E + + + + +  +   + +  Q 
Sbjct: 402  KCNSMVISWIFNALHPTLHDSVAYCVTAQEMWNDLEERFSQGNAARIHQLKTEMVNTLQQ 461

Query: 1937 GSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGLDDTYVPIRS 1758
            G S++ YY  L  +W +  +   +P CTC +        +  K+ QFLMGL++ Y  + S
Sbjct: 462  GMSVSAYYTKLKGIWDELGTYSHIPPCTCGSAKGLAAEREKEKVHQFLMGLNEKYNVVHS 521

Query: 1757 NILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXXXXXXXX 1578
             IL  DPL S+  A+++++ EE  +  ++   S  P+V   A                  
Sbjct: 522  QILNTDPLHSLSRAYALVAQEERQQLVAA---SRLPSVEGAAFMTNNANKSNFNRKPASN 578

Query: 1577 XXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRNFTSNVKPXX 1398
                                 L C +C K  HT D C+ L+GYP  + K       KP  
Sbjct: 579  RDLS----------------KLFCEHCKKTRHTKDSCFELLGYPEWWDK-----GKKPSK 617

Query: 1397 XXXXXXXXXXSLECSTSNSPVS-LSDEQMVRLMSLLNDSPAH---SNMAGKCFSGTFFNA 1230
                            +N P++ L+ EQ  +L+S+LN        +N AGK  S      
Sbjct: 618  TKAANTAQHMETASGNNNVPINGLTSEQYAQLISMLNLDKIQIPTANFAGKATS------ 671

Query: 1229 SVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHP 1050
                                 NT++ WI+DSGA+ HMT     + +   V      +  P
Sbjct: 672  -------------------LSNTAIEWILDSGASDHMTCHKSAITSHKTVPHFS-PIKIP 711

Query: 1049 NGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDC 870
            +G+       GD+ +N+ + L+DVL +P ++ +L+S+ KL +       F  + C +QD 
Sbjct: 712  DGSFVPAKSCGDVPLNSLVTLNDVLYIPSFSCNLISISKLTQALNCVAHFFPTFCTLQDL 771

Query: 869  ATKRNVGIGNQSSGLYLFD-VNTNCKAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNL 693
            AT++ +G+G    GLY F  +     A S +      LWH+RLGH S   L +L   L  
Sbjct: 772  ATRKLIGMGELRDGLYYFQAIKVPIAATSISRDSQLILWHRRLGHLSFDRLSLLN-DLGP 830

Query: 692  DKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIV 513
                     CD+CH+AKQTR PFP+S  K+ +  +L+H DVWGPY   S     YFL+IV
Sbjct: 831  FPVKSFNKCCDSCHRAKQTRPPFPISSIKTHEAFELIHCDVWGPYHTPSLSNAHYFLSIV 890

Query: 512  DDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKK 333
            DD+SR  WVY+LK K +VY  +++F  M++ QF   +K  RSDNGTEF N   Q F  + 
Sbjct: 891  DDFSRTSWVYLLKTKTEVYTWLLSFIAMVAKQFGKAVKQIRSDNGTEFTNQNFQLFCQQN 950

Query: 332  GIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLTAAYLINRLPSS 153
            GI  Q SCV TPQQNG+ ERKHRH+L VAR+L FQ  LP+  W ECVLTA YLIN +P+ 
Sbjct: 951  GILTQFSCVSTPQQNGVVERKHRHILEVARALRFQANLPIKFWGECVLTATYLINYVPTP 1010

Query: 152  VLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNS-DKFYSRSEKSVLIG 3
            +LSGKSP+ +++   P+ SH+RVFGCLCY +++  S DKF++R+   + +G
Sbjct: 1011 LLSGKSPYEVLFSRKPSYSHLRVFGCLCYTSVIPRSRDKFHARATACLFLG 1061


>gb|PNX92904.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1457

 Score =  472 bits (1214), Expect = e-145
 Identities = 281/795 (35%), Positives = 405/795 (50%), Gaps = 29/795 (3%)
 Frame = -2

Query: 2300 YLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQW 2121
            YLHPSD  G  I  ++L G +NY  W+ A+  ALR   KL F+DGT  +   +++    W
Sbjct: 33   YLHPSDNPGMIITPIQLKG-DNYDEWAKAIRNALRAKKKLAFIDGTLTEPKEDSADLEDW 91

Query: 2120 DMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQ 1941
                S+++ WILN++ P L +   Y +   ++W D+++ +   +G  ++ +  ++ +  Q
Sbjct: 92   WAVSSMLVAWILNTIEPGLRSTITYMENVKDLWEDIRQRFSIGNGPRIYQLKADLAACKQ 151

Query: 1940 NGSSLADYYNNLNSLWKQFDSMISLPTCTC-----DAGIHFEKHNQLIKLMQFLMGLDDT 1776
             G ++A+YY  +  +W +  S    PTC C     +      K  +  K+ QFLMGLDD 
Sbjct: 152  MGKTVAEYYGKIKVMWDELASYEPAPTCKCGGCKCNISKDLVKKREEEKVYQFLMGLDDV 211

Query: 1775 -YVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXX 1599
             Y  +RSNIL+ DPLP++   ++I   EE HR+ + G       V               
Sbjct: 212  VYGTVRSNILSMDPLPNLSRVYAIAVQEERHRDIARGKEERSDAVGFTMQVGAGARAAVV 271

Query: 1598 XXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRNFT 1419
                                        + C++C K GH +  C+ + GYP  +G R  +
Sbjct: 272  RTKEK----------------------GMNCNHCGKTGHDVKGCFEVNGYPEWWGDRPRS 309

Query: 1418 S--------------------NVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMS 1299
            +                    +V+                   +  P  LS EQ   L++
Sbjct: 310  TGRHGNRGRGNAGSTGRGRGQSVRANATLVGGVEKQHGAGDEHAGIP-GLSGEQWTTLIN 368

Query: 1298 LLNDSPAHS--NMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQ 1125
            LLN   A S   ++GK                       N N         WI+DSGA+ 
Sbjct: 369  LLNTHKAGSVDRLSGK-----------------------NNN---------WIIDSGASH 396

Query: 1124 HMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLL 945
            HMT   + L    +++     VG PNG Q    + G + +  NI L +VL VP    +L+
Sbjct: 397  HMTGVIELLSEARNITPR--PVGLPNGKQTDAVKEGTLCLGENIYLQNVLYVPNMNCTLI 454

Query: 944  SVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSR 765
            SV KL +D +  V+F E+ C +QD  ++  +G+G +  G+++F       A        R
Sbjct: 455  SVSKLVQDLRCIVTFTENLCVMQDRTSRTLIGVGEECDGIFIFRRAAPMHANKAKVMDVR 514

Query: 764  NLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQL 585
             LWHQRLGHPS QVL  L  +++ D  +D    C+TC+KAKQTR+ F  S +K      L
Sbjct: 515  RLWHQRLGHPSKQVLSYLPETISSDLGSDLVDFCETCYKAKQTRDVFQESNNKVDDCFSL 574

Query: 584  VHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESN 405
            +H D+WGPYK+ +  G  YFLTIVDD+SRA+WVY+L  K +V  +I NF  M   QFE  
Sbjct: 575  IHCDLWGPYKVPASCGAFYFLTIVDDFSRAIWVYLLLEKKEVSQTIKNFCAMTERQFEKP 634

Query: 404  IKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQG 225
            +KI RSDNGTEF    ++ +F  KGI HQTSCV TPQQNG  ERKHRH+LNVAR+L FQ 
Sbjct: 635  VKIVRSDNGTEF--TCLKSYFEVKGILHQTSCVGTPQQNGRVERKHRHILNVARALRFQA 692

Query: 224  ELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATIL-NN 48
             LP+  W ECVLTA+YLINR PSS+L GKSP+ +++   P  + ++VFGCLCY      +
Sbjct: 693  NLPIQFWGECVLTASYLINRTPSSLLRGKSPYEVLFKKKPIYNQLKVFGCLCYVHHRGRD 752

Query: 47   SDKFYSRSEKSVLIG 3
             DKF  RS+K V +G
Sbjct: 753  KDKFSERSKKCVFLG 767


>gb|KYP42518.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 769

 Score =  451 bits (1161), Expect = e-144
 Identities = 260/751 (34%), Positives = 390/751 (51%), Gaps = 5/751 (0%)
 Frame = -2

Query: 2240 ENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELF 2061
            +NY+ W+ +M  ALR   KLGF+D + KK T+ +     W+  DS+V+ WI+NS  P L 
Sbjct: 5    DNYRNWARSMRTALRAKTKLGFIDRSIKKPTSTSPDYQHWERADSMVVAWIINSTDPILH 64

Query: 2060 AGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFD 1881
                ++ TA ++W DL++         +     N+        ++ +YY    S+  +  
Sbjct: 65   GSISHAMTAKDIWLDLRKL-------CLMQQETNV--------TVTEYYTKFKSIIDELR 109

Query: 1880 SMISLPTCTCDAGIHFEKHNQLIKLMQFLMGLD-DTYVPIRSNILTKDPLPSVKSAFSII 1704
             +  LP CTC A  +  +  +  ++  FL GLD D +   +  IL  DPLPS+   F+ +
Sbjct: 110  ELQPLPECTCGAAKNLAQREEEHRVHLFLGGLDSDRFAHAKGIILNTDPLPSLLRVFNHV 169

Query: 1703 SGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1524
              EE+      G      + +AF S                                   
Sbjct: 170  LREETRVLTEKGKDHKIESGTAFHSSTFNKKKNRDGPKP--------------------- 208

Query: 1523 NLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSN 1344
                +C +C K GH   +C+ +VGYPA +  R  T N                   ST  
Sbjct: 209  ----RCDHCGKIGHDKTKCFEIVGYPANWNPRRNTRN-------------------STKR 245

Query: 1343 SPVSLSDEQMVRLMSLLNDSPAHSNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKN 1164
            +  S                   +N+A +   GT  +A          H + N +++  N
Sbjct: 246  TEHS-----------------GGANLAWENNQGTDGHALSGSQDSGGSHGSKN-DYMSGN 287

Query: 1163 TSLG--WIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNII 990
              +   W++DSGA+ HMT     L    D S + L +  P G   L+ + G MK+N NI 
Sbjct: 288  QMINDVWVLDSGASHHMTSLYSQLDEVQDFS-IPLRITVPIGDVVLVHKKGTMKLNENIK 346

Query: 989  LHDVLVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDV 810
            L++VL +PE+  +L+S+HKL  D    V++   +C IQD   KR +G G    G+Y+F  
Sbjct: 347  LYNVLFIPEFRCNLISIHKLTHDLNCVVTYSVDECVIQDQTRKRMIGFGRLCDGIYIFTQ 406

Query: 809  NTNCKAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNLD-KTTDSTHICDTCHKAKQTR 633
                 ++  +      LWH R+GHPSDQVL  L   ++      +    CD CH++KQ R
Sbjct: 407  QVGGYSLVASSGDITTLWHARMGHPSDQVLSKLSTIISFSFNANNKMECCDICHRSKQCR 466

Query: 632  EPFPLSEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYD 453
             PF L+ +K +K+  L+H D+WG Y   S +G  YFLTIVDD++RAVW+Y+LK+K +  +
Sbjct: 467  LPFSLNYNKVSKVFDLIHCDLWGKYHTASHNGSHYFLTIVDDFTRAVWIYLLKDKTETTN 526

Query: 452  SIVNFTQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAER 273
             I+N+ +M+  QF++ +K+ RSDNGT+FVN+K+  FF + GI HQTSCV +PQQNG  ER
Sbjct: 527  VIINYYRMVQTQFDTKVKVVRSDNGTKFVNSKIHSFFQEVGILHQTSCVSSPQQNGRVER 586

Query: 272  KHRHLLNVARSLMFQGELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSH 93
            KHRH+LNVAR+L FQ  LPL  W ECVLTA +LINR P+    G +P+ ++YG  P+ +H
Sbjct: 587  KHRHILNVARALRFQANLPLTFWGECVLTAIHLINRTPTVANQGLTPYEMLYGKQPSYAH 646

Query: 92   IRVFGCLCYA-TILNNSDKFYSRSEKSVLIG 3
            IRVFGCLCYA T+   +DKF +++++ + IG
Sbjct: 647  IRVFGCLCYAKTLTKKTDKFEAQADRCIFIG 677


>ref|XP_017415202.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT
            1-94 isoform X1 [Vigna angularis]
          Length = 1472

 Score =  469 bits (1207), Expect = e-144
 Identities = 278/814 (34%), Positives = 417/814 (51%), Gaps = 28/814 (3%)
 Frame = -2

Query: 2360 MNNESVINSSELKLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKL 2181
            M  E   + SE+         L  SD  GN I  V+L G ENY+ W+ A+  +LR   K 
Sbjct: 21   MAKEGEKSESEVVKKMSSPYDLSASDNPGNVITQVQLKG-ENYEEWAKAVKISLRARRKW 79

Query: 2180 GFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETY 2001
            GF+DGT  +   + S    W    S++++WILN++ P L +   Y + A ++W+D+KE +
Sbjct: 80   GFIDGTHTEPETDTSKIEDWWTIQSMLVSWILNTIEPNLRSTIAYMENAKDLWDDIKERF 139

Query: 2000 DKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTC-----DAGIH 1836
              V+G  +  +   +    Q G ++  YY  L  LW +  +   +P C C     +    
Sbjct: 140  SIVNGPRIQQLKSKLAECKQQGMTMVAYYGKLKILWDELANYEQIPQCKCGGCKCNIATK 199

Query: 1835 FEKHNQLIKLMQFLMGLDDT-YVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGS 1659
             EK  +  ++ QFLMGLDD  Y   RSN+L  DPLPS+   ++ +  EE  R  +     
Sbjct: 200  LEKRREEERVHQFLMGLDDEGYGTTRSNVLATDPLPSLNRVYATMVQEERVRMITRSKEE 259

Query: 1658 IKPNVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHT 1479
                V                                          ++ C++C + GH 
Sbjct: 260  RGMIVGMVVQTETKGKLRNEVKEK-----------------------SIVCTHCGRTGHD 296

Query: 1478 IDRCYGLVGYPAGYGKRNFTSN-----------------VKPXXXXXXXXXXXXSLECST 1350
               C+ ++GYP  +G+R    N                 V P              +  T
Sbjct: 297  KRNCFEIIGYPDWWGERPRNENKSGGRHQQRTTFFRGKGVTPRVNIAHTSTSSSDSKSDT 356

Query: 1349 SNSPVS-LSDEQMVRLMSLLNDSPAHSN--MAGKCFSGTFFNASVKFNLKFEKHFNGNTN 1179
                V+ LS+EQ   L ++LN   A++   M GK                          
Sbjct: 357  KKPEVAGLSNEQWEILATMLNSHKANTTEKMTGK-------------------------- 390

Query: 1178 FLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINN 999
               K+  L WIVDSGA+ HMT +   L  +  +   G  VG PNG   L  + G + ++ 
Sbjct: 391  ---KSRDL-WIVDSGASNHMTGTLDNLWESRTLE--GCPVGLPNGELVLADKEGSVFLDG 444

Query: 998  NIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYL 819
             + L +VL VP+   +L+SV +L  ++K +V F +  C +QD   +  +G G +  GLY 
Sbjct: 445  GLKLENVLYVPKLNCNLISVSQLIDEAKCTVHFTDKFCAMQDHTLRMLIGAGERKDGLYW 504

Query: 818  FDVNTNCKAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNLDKTTDST-HICDTCHKAK 642
            +   ++ KA   N      +WH+R+GHP+ Q++  + P++ + +   +T  +C+ C K+K
Sbjct: 505  YRGVSDVKAHHINTESQLEIWHKRMGHPAYQIVEKI-PNMTITRGDKNTSRVCEVCEKSK 563

Query: 641  QTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGD 462
            Q+R  FPLS+ +++ +  L+H D+WGPY+  S  G  YFLT+VDD SRAVW+Y+L +K  
Sbjct: 564  QSRNKFPLSDSQASNVFDLIHCDLWGPYRTLSSCGASYFLTLVDDCSRAVWIYLLNSKKG 623

Query: 461  VYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGI 282
            V  +++NF  ++  Q++  +K+ RSDNGTEF+   +QQ+F   GI HQTSC  TPQQNG 
Sbjct: 624  VSQTLMNFITLIERQYKKQVKMIRSDNGTEFM--CLQQYFQLHGILHQTSCTGTPQQNGR 681

Query: 281  AERKHRHLLNVARSLMFQGELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPT 102
             ERKH+H+LNVAR+L FQG+LPL  W EC+LTA YLINR PS++L GK+P+ +V G+ PT
Sbjct: 682  VERKHQHILNVARALRFQGQLPLKFWGECILTAGYLINRTPSTILQGKTPYEIVNGNPPT 741

Query: 101  LSHIRVFGCLCYATILN-NSDKFYSRSEKSVLIG 3
              H+RVFGCLCYA   + N DKF SRS KSV +G
Sbjct: 742  YDHLRVFGCLCYAHNQDRNGDKFASRSRKSVFVG 775


>gb|PNX93517.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1465

 Score =  468 bits (1204), Expect = e-143
 Identities = 279/782 (35%), Positives = 407/782 (52%), Gaps = 20/782 (2%)
 Frame = -2

Query: 2288 SDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWDMCD 2109
            +D  G+ I  V+L G ENY  W+ ++  ALR   K GFVDGT  +    +S    W   +
Sbjct: 38   NDNPGSLITHVQLKG-ENYDEWASSIRTALRARKKFGFVDGTIGRPGEESSDLEDWWTNN 96

Query: 2108 SVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQNGSS 1929
            S++++WI+N++ P L +   + + A ++WND+KE +   +G  +  +   +    Q G +
Sbjct: 97   SLLVSWIMNTIEPSLRSTMSHMEVAMDLWNDIKERFSIANGPRIQQLKAELVECKQKGLT 156

Query: 1928 LADYYNNLNSLWKQ---FDSMISLPT--CTCDAGIHFEKHNQLIKLMQFLMGLDDT-YVP 1767
            +  YY  L  LW++   +D +++     CTC+ G    K  +  K+ QFLMGLDDT Y  
Sbjct: 157  IVTYYGKLKKLWEELVNYDQILTCKCGLCTCNLGNQITKKREEEKIHQFLMGLDDTLYGT 216

Query: 1766 IRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXXXXX 1587
            +RSN+L +DPLP++   ++ +  EE  R  +  T   +    AFA               
Sbjct: 217  VRSNLLAQDPLPTLNKVYATLVQEERLRMVTRVTEE-RGEAMAFAVHSKFKNKEKEE--- 272

Query: 1586 XXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYP------------- 1446
                                      CS+CN+ GH  + C+ L+GYP             
Sbjct: 273  -------------------------SCSHCNQVGHNSEGCFQLIGYPEWWGDRRRRPMKG 307

Query: 1445 AGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLNDSPAHSNM 1266
            +G GK   ++N                +    S +   L+ +Q+  L SLLN+    S  
Sbjct: 308  SGRGKPEQSNNRNRGGTAKAHVAQAKEITAEVSAADFGLTSDQLQTLSSLLNNVKLGS-- 365

Query: 1265 AGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTV 1086
                                EK  NG  +FL       WI+D+GA+ HMT   + L N  
Sbjct: 366  -------------------IEK-LNGKCSFLP------WIIDTGASHHMTGQLECLTNIR 399

Query: 1085 DVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSV 906
            ++ +   ++G PNG + + T+ G++ +N  + L +VL VP    +L+SV +L ++S   V
Sbjct: 400  NIFEC--SIGLPNGEETVATKEGNVVLNERLQLKNVLYVPSLQCNLISVSQLLKNSNYVV 457

Query: 905  SFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQ 726
             F +  C +QD   +  +G G Q  GLY         A+  N  VS +L HQRLGH S +
Sbjct: 458  QFTDKFCLVQDPTLRTPIGAGEQREGLYYLRGMVKAAAMKTNKEVSFDLLHQRLGHASLK 517

Query: 725  VLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITS 546
            VL +L       K    T  C+ C +AKQ+R+ FP+SE+K+     L+H D+WGPY+  +
Sbjct: 518  VLQMLPNVRPSSKNNSCTQTCEICLRAKQSRDNFPVSENKAATPFHLIHCDLWGPYRNAT 577

Query: 545  KDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFV 366
              G KYFLTIVDD+SRAVW+Y+L +K +V   +  F  M+  QF + +KI RSDNGTEF 
Sbjct: 578  FCGAKYFLTIVDDFSRAVWIYLLIDKTEVSKHLYQFLAMVERQFSAQVKIIRSDNGTEF- 636

Query: 365  NNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLT 186
               M+Q F   GI H+TSCV TPQQNG  ERKHRH+LNVAR+L FQ +LP+  W EC L 
Sbjct: 637  -TCMKQNFRDCGIIHETSCVGTPQQNGRVERKHRHILNVARALRFQAQLPIEFWGECALA 695

Query: 185  AAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYA-TILNNSDKFYSRSEKSVL 9
            A YLINR P+  LSGK+P+ L+YG  P+L H+RV GCL YA    +  DKF +RS K V 
Sbjct: 696  ACYLINRTPTKTLSGKTPYELLYGKAPSLEHLRVVGCLAYAHNQHHKGDKFATRSRKCVF 755

Query: 8    IG 3
            +G
Sbjct: 756  VG 757


>ref|XP_017415203.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT
            1-94 isoform X2 [Vigna angularis]
          Length = 1435

 Score =  459 bits (1180), Expect = e-140
 Identities = 268/780 (34%), Positives = 405/780 (51%), Gaps = 28/780 (3%)
 Frame = -2

Query: 2258 VKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNS 2079
            V+L G ENY+ W+ A+  +LR   K GF+DGT  +   + S    W    S++++WILN+
Sbjct: 18   VQLKG-ENYEEWAKAVKISLRARRKWGFIDGTHTEPETDTSKIEDWWTIQSMLVSWILNT 76

Query: 2078 LSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNS 1899
            + P L +   Y + A ++W+D+KE +  V+G  +  +   +    Q G ++  YY  L  
Sbjct: 77   IEPNLRSTIAYMENAKDLWDDIKERFSIVNGPRIQQLKSKLAECKQQGMTMVAYYGKLKI 136

Query: 1898 LWKQFDSMISLPTCTC-----DAGIHFEKHNQLIKLMQFLMGLDDT-YVPIRSNILTKDP 1737
            LW +  +   +P C C     +     EK  +  ++ QFLMGLDD  Y   RSN+L  DP
Sbjct: 137  LWDELANYEQIPQCKCGGCKCNIATKLEKRREEERVHQFLMGLDDEGYGTTRSNVLATDP 196

Query: 1736 LPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXXXXXXXXXXXXXXX 1557
            LPS+   ++ +  EE  R  +         V                             
Sbjct: 197  LPSLNRVYATMVQEERVRMITRSKEERGMIVGMVVQTETKGKLRNEVKEK---------- 246

Query: 1556 XXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRNFTSN------------ 1413
                         ++ C++C + GH    C+ ++GYP  +G+R    N            
Sbjct: 247  -------------SIVCTHCGRTGHDKRNCFEIIGYPDWWGERPRNENKSGGRHQQRTTF 293

Query: 1412 -----VKPXXXXXXXXXXXXSLECSTSNSPVS-LSDEQMVRLMSLLNDSPAHSN--MAGK 1257
                 V P              +  T    V+ LS+EQ   L ++LN   A++   M GK
Sbjct: 294  FRGKGVTPRVNIAHTSTSSSDSKSDTKKPEVAGLSNEQWEILATMLNSHKANTTEKMTGK 353

Query: 1256 CFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVS 1077
                                         K+  L WIVDSGA+ HMT +   L  +  + 
Sbjct: 354  -----------------------------KSRDL-WIVDSGASNHMTGTLDNLWESRTLE 383

Query: 1076 KLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFD 897
              G  VG PNG   L  + G + ++  + L +VL VP+   +L+SV +L  ++K +V F 
Sbjct: 384  --GCPVGLPNGELVLADKEGSVFLDGGLKLENVLYVPKLNCNLISVSQLIDEAKCTVHFT 441

Query: 896  ESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQVLH 717
            +  C +QD   +  +G G +  GLY +   ++ KA   N      +WH+R+GHP+ Q++ 
Sbjct: 442  DKFCAMQDHTLRMLIGAGERKDGLYWYRGVSDVKAHHINTESQLEIWHKRMGHPAYQIVE 501

Query: 716  VLKPSLNLDKTTDST-HICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKD 540
             + P++ + +   +T  +C+ C K+KQ+R  FPLS+ +++ +  L+H D+WGPY+  S  
Sbjct: 502  KI-PNMTITRGDKNTSRVCEVCEKSKQSRNKFPLSDSQASNVFDLIHCDLWGPYRTLSSC 560

Query: 539  GYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNN 360
            G  YFLT+VDD SRAVW+Y+L +K  V  +++NF  ++  Q++  +K+ RSDNGTEF+  
Sbjct: 561  GASYFLTLVDDCSRAVWIYLLNSKKGVSQTLMNFITLIERQYKKQVKMIRSDNGTEFM-- 618

Query: 359  KMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLTAA 180
             +QQ+F   GI HQTSC  TPQQNG  ERKH+H+LNVAR+L FQG+LPL  W EC+LTA 
Sbjct: 619  CLQQYFQLHGILHQTSCTGTPQQNGRVERKHQHILNVARALRFQGQLPLKFWGECILTAG 678

Query: 179  YLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILN-NSDKFYSRSEKSVLIG 3
            YLINR PS++L GK+P+ +V G+ PT  H+RVFGCLCYA   + N DKF SRS KSV +G
Sbjct: 679  YLINRTPSTILQGKTPYEIVNGNPPTYDHLRVFGCLCYAHNQDRNGDKFASRSRKSVFVG 738


Top