BLASTX nr result

ID: Chrysanthemum22_contig00041528 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00041528
         (806 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_023742945.1| uncharacterized protein LOC111891088 [Lactuc...   413   e-143
ref|XP_023750032.1| uncharacterized protein LOC111898332 [Lactuc...   412   e-141
ref|XP_021975260.1| uncharacterized protein LOC110870384 [Helian...   355   e-115
ref|XP_021995445.1| uncharacterized protein LOC110892596 [Helian...   339   e-115
ref|XP_022022818.1| uncharacterized protein LOC110922934 [Helian...   341   e-114
ref|XP_022004432.1| uncharacterized protein LOC110901994 [Helian...   337   e-113
ref|XP_021998957.1| uncharacterized protein LOC110895884 isoform...   339   e-113
ref|XP_021998956.1| uncharacterized protein LOC110895884 isoform...   339   e-113
ref|XP_022006954.1| uncharacterized protein LOC110905727 [Helian...   338   e-112
gb|OTF97514.1| putative GAG-pre-integrase domain, Gag-polypeptid...   343   e-112
ref|XP_022019315.1| uncharacterized protein LOC110919350 [Helian...   334   e-112
ref|XP_021991470.1| uncharacterized protein LOC110888246 [Helian...   332   e-111
ref|XP_021986813.1| uncharacterized protein LOC110883333 [Helian...   341   e-111
ref|XP_022007266.1| uncharacterized protein LOC110906440 [Helian...   329   e-111
ref|XP_022023560.1| uncharacterized protein LOC110923810 [Helian...   330   e-111
ref|XP_022015104.1| uncharacterized protein LOC110914625 [Helian...   332   e-110
ref|XP_022008139.1| uncharacterized protein LOC110907465 [Helian...   328   e-109
gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA...   345   e-109
gb|OTG28537.1| putative GAG-pre-integrase domain, Gag-polypeptid...   333   e-109
ref|XP_021991821.1| uncharacterized protein LOC110888610 [Helian...   326   e-108

>ref|XP_023742945.1| uncharacterized protein LOC111891088 [Lactuca sativa]
          Length = 278

 Score =  413 bits (1061), Expect = e-143
 Identities = 200/262 (76%), Positives = 219/262 (83%), Gaps = 3/262 (1%)
 Frame = +2

Query: 29  GDSGKKTH---EGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNK 199
           GD   KTH   E ++IDHNSPLYLHASDYPKQMHVN+VLTDKNY+DWEQEMMN +FAKNK
Sbjct: 3   GDDATKTHNQNEKEIIDHNSPLYLHASDYPKQMHVNEVLTDKNYNDWEQEMMNLLFAKNK 62

Query: 200 TGFIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKER 379
           +GF+DGSIK+ ET+SEKYLPWM CDAMIKGWLTTAMEKEIRN+VKYAKTA EIW DLKER
Sbjct: 63  SGFVDGSIKRLETESEKYLPWMHCDAMIKGWLTTAMEKEIRNNVKYAKTATEIWQDLKER 122

Query: 380 FGKESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXX 559
           FGKESAPKAYELKQ++ +TRQD TTVSAYYTRLRVLWDEME+ILPTPRC+C+ C      
Sbjct: 123 FGKESAPKAYELKQAMNNTRQDDTTVSAYYTRLRVLWDEMETILPTPRCSCNGCSCGLEK 182

Query: 560 XXXXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGP 739
                    RTYEF +GLDDQFSVIKTQILAMKPTP LS  YHLVAED+QQR I +SK P
Sbjct: 183 KLTELKEKERTYEFFIGLDDQFSVIKTQILAMKPTPKLSTVYHLVAEDDQQRMITASKKP 242

Query: 740 VREVAAFQTGFQGVREQTRNQQ 805
            REVAAFQ  FQG RE TRN Q
Sbjct: 243 AREVAAFQASFQGRREPTRNSQ 264


>ref|XP_023750032.1| uncharacterized protein LOC111898332 [Lactuca sativa]
          Length = 451

 Score =  412 bits (1059), Expect = e-141
 Identities = 200/262 (76%), Positives = 219/262 (83%), Gaps = 3/262 (1%)
 Frame = +2

Query: 29  GDSGKKTH---EGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNK 199
           GD   KTH   E ++IDHNS LYL+ASDYPKQMHVN+VLTDKNY+DWEQEMMNF+FAKNK
Sbjct: 3   GDDATKTHNQNEKEIIDHNSLLYLYASDYPKQMHVNEVLTDKNYNDWEQEMMNFLFAKNK 62

Query: 200 TGFIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKER 379
           TGF+DGSIK+PET+S+KYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKT  EIW DLKER
Sbjct: 63  TGFVDGSIKRPETESKKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTTTEIWQDLKER 122

Query: 380 FGKESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXX 559
           FGKESAPKAYELKQ++ +TRQD TTVSAYYTRL VLWDEME+ILPTPRC+C+ C      
Sbjct: 123 FGKESAPKAYELKQAMNNTRQDDTTVSAYYTRLHVLWDEMETILPTPRCSCNGCLCGLAK 182

Query: 560 XXXXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGP 739
                    RTYEFLMGLDDQFSVIKTQILAMKPTP LS  YHLVAEDEQQ+ I +SK P
Sbjct: 183 KLTELKEKERTYEFLMGLDDQFSVIKTQILAMKPTPKLSTVYHLVAEDEQQQMITASKKP 242

Query: 740 VREVAAFQTGFQGVREQTRNQQ 805
            RE+AAFQ  FQG RE  RN Q
Sbjct: 243 AREMAAFQASFQGRREPARNSQ 264


>ref|XP_021975260.1| uncharacterized protein LOC110870384 [Helianthus annuus]
          Length = 657

 Score =  355 bits (910), Expect = e-115
 Identities = 165/260 (63%), Positives = 198/260 (76%)
 Frame = +2

Query: 26  GGDSGKKTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTG 205
           G D  K  ++ + I+H+SP Y+HASDYP+QMHVNDVLTD NY DW QEMMNF+FAKNKTG
Sbjct: 3   GNDKEKPKNQENAINHDSPYYIHASDYPRQMHVNDVLTDNNYIDWAQEMMNFLFAKNKTG 62

Query: 206 FIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFG 385
           FIDG++KKPE  S +Y+PWMRCDAMIKGWL T+MEKEIRNSVKYA TA EIW+DLKERFG
Sbjct: 63  FIDGTMKKPEPTSTEYMPWMRCDAMIKGWLNTSMEKEIRNSVKYASTAEEIWSDLKERFG 122

Query: 386 KESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXX 565
           KESAP+AYELKQSL + RQDG ++SAYYT+LRVLWDE++S+LP P+C+C+ C        
Sbjct: 123 KESAPRAYELKQSLTNIRQDGASISAYYTKLRVLWDEIQSVLPIPKCSCNGCTYNVGNQL 182

Query: 566 XXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVR 745
                  + YEFL+GLD  F+ I+TQILAMKPTP L  AYHL AEDE+Q+ IA++  P  
Sbjct: 183 IELKEKEKLYEFLLGLDSDFTTIRTQILAMKPTPSLRTAYHLAAEDEKQQMIAATNRPAI 242

Query: 746 EVAAFQTGFQGVREQTRNQQ 805
              AFQ  F   RE    QQ
Sbjct: 243 NTTAFQVSFPSKREGGSGQQ 262


>ref|XP_021995445.1| uncharacterized protein LOC110892596 [Helianthus annuus]
          Length = 247

 Score =  339 bits (870), Expect = e-115
 Identities = 162/245 (66%), Positives = 187/245 (76%)
 Frame = +2

Query: 29  GDSGKKTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTGF 208
           G  G KT  GD ID NSPLYLH SDYP+QM VND LTD N++DW QEM NF+FAKNK GF
Sbjct: 3   GGEGSKT--GDSIDPNSPLYLHPSDYPRQMQVNDALTDHNFNDWMQEMSNFLFAKNKIGF 60

Query: 209 IDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFGK 388
           +DGSIKKPE   + Y+PWMRCDAM+KGWLTTAMEKEIR SVKYA TAAEIW DL ERFGK
Sbjct: 61  VDGSIKKPEDTDKDYMPWMRCDAMVKGWLTTAMEKEIRASVKYANTAAEIWKDLNERFGK 120

Query: 389 ESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXXX 568
           ES P+AYELKQ+L  T+QDG +VSAYYT+LR +WDE+ ++LPTP C+C+ C         
Sbjct: 121 ESVPRAYELKQTLNVTKQDGASVSAYYTKLRRIWDEINTVLPTPNCSCNGCKCEVGKRLT 180

Query: 569 XXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVRE 748
                 R YEFL+GLD  F+VI+TQILAMKPTP LS AYH+VAEDEQQR +A+ K    E
Sbjct: 181 QLKEKERLYEFLLGLDSAFAVIRTQILAMKPTPSLSNAYHMVAEDEQQRNVATGKKIATE 240

Query: 749 VAAFQ 763
             AFQ
Sbjct: 241 SVAFQ 245


>ref|XP_022022818.1| uncharacterized protein LOC110922934 [Helianthus annuus]
          Length = 369

 Score =  341 bits (875), Expect = e-114
 Identities = 163/258 (63%), Positives = 198/258 (76%)
 Frame = +2

Query: 32  DSGKKTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTGFI 211
           D G  + + D I+ NSP YLH SDYPKQ+HVND LTD N+SDW QEM NF+FAKNK GF+
Sbjct: 4   DQGSGSKDADQINPNSPYYLHPSDYPKQLHVNDSLTDSNFSDWIQEMTNFLFAKNKIGFV 63

Query: 212 DGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFGKE 391
           DG++KKPE  S++Y+ WMRCDAMIKGWLTTAMEKEIR SVKYA T+AEIW DL ERFGKE
Sbjct: 64  DGTLKKPEKTSKEYMAWMRCDAMIKGWLTTAMEKEIRTSVKYANTSAEIWKDLNERFGKE 123

Query: 392 SAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXXXX 571
           SAP+AY+LKQSL  TRQ+G +VSAYYT+LR +WDE+ ++LPTPRCTCD C          
Sbjct: 124 SAPRAYKLKQSLNVTRQNGVSVSAYYTKLRGIWDEINTVLPTPRCTCDGCSCEVGKKLVE 183

Query: 572 XXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVREV 751
                R YEFL+GLD  F+VI+TQILAMKPTP L AAYH+V+EDEQQR ++++K    E 
Sbjct: 184 LKEKERLYEFLLGLDADFAVIRTQILAMKPTPTLGAAYHMVSEDEQQRNLSTNKKGTVEN 243

Query: 752 AAFQTGFQGVREQTRNQQ 805
           AAFQ   Q  R++ + Q+
Sbjct: 244 AAFQAS-QFARKEGQTQR 260


>ref|XP_022004432.1| uncharacterized protein LOC110901994 [Helianthus annuus]
          Length = 276

 Score =  337 bits (863), Expect = e-113
 Identities = 160/247 (64%), Positives = 189/247 (76%), Gaps = 2/247 (0%)
 Frame = +2

Query: 29  GDSGK--KTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKT 202
           GD GK  K +EG ++DH+SP YLH SDYP+QMHVNDVLTD NY+DW QEM NF+FAKNK 
Sbjct: 3   GDEGKTEKKNEG-VLDHDSPYYLHPSDYPRQMHVNDVLTDGNYTDWSQEMQNFLFAKNKI 61

Query: 203 GFIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERF 382
           GF+D +IKKPE  S  ++ WMRCDAMIKGWL TAMEKEIR SVKYA TA EIW DL+ERF
Sbjct: 62  GFVDRTIKKPEQGSSSHMAWMRCDAMIKGWLNTAMEKEIRTSVKYATTAREIWVDLRERF 121

Query: 383 GKESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXX 562
           GKESAP+AYELKQSL  TRQ+GT+VS YYT+LR +WDE++S+LP PRC CD C       
Sbjct: 122 GKESAPRAYELKQSLTVTRQEGTSVSTYYTKLRTIWDEIQSVLPAPRCNCDGCTCGIGKK 181

Query: 563 XXXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPV 742
                   R YE L+GLD +F  I+TQILAM+P P L AAYHLVA+DEQQR ++ +K P 
Sbjct: 182 LTELRDKERLYECLLGLDPEFGTIRTQILAMQPIPSLGAAYHLVADDEQQRAVSGTKRPT 241

Query: 743 REVAAFQ 763
            + AAFQ
Sbjct: 242 SDAAAFQ 248


>ref|XP_021998957.1| uncharacterized protein LOC110895884 isoform X2 [Helianthus annuus]
          Length = 370

 Score =  339 bits (870), Expect = e-113
 Identities = 162/260 (62%), Positives = 196/260 (75%), Gaps = 2/260 (0%)
 Frame = +2

Query: 29  GDSGK--KTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKT 202
           GD GK  K +EG + DH+SP YLH S+YP+QMHVNDVLTD NY+DW QE+ NF+FAKNK 
Sbjct: 3   GDEGKTEKKNEG-VSDHDSPYYLHPSEYPRQMHVNDVLTDGNYTDWSQEIQNFLFAKNKI 61

Query: 203 GFIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERF 382
           GF+DG+IKKPE  S  ++ WMRCDAMIKGWL TAMEKEIR SVKYA TA EIW DL+ERF
Sbjct: 62  GFVDGTIKKPEQGSSSHMAWMRCDAMIKGWLNTAMEKEIRTSVKYATTAREIWVDLRERF 121

Query: 383 GKESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXX 562
           GKESAP+AYELKQSL  TRQ+GT+VS YYT+LR +WDE++S+LP PRC CD C       
Sbjct: 122 GKESAPRAYELKQSLTVTRQEGTSVSTYYTKLRTIWDEIQSVLPVPRCNCDGCTCGIGKK 181

Query: 563 XXXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPV 742
                   R YEFL+GLD +F  I+TQILAM+P P L AAYHLVA+DEQQR ++ +K P 
Sbjct: 182 LTELRDKERLYEFLLGLDPEFRTIRTQILAMQPIPSLGAAYHLVADDEQQRAVSGTKRPT 241

Query: 743 REVAAFQTGFQGVREQTRNQ 802
            + AAFQ      R++ ++Q
Sbjct: 242 SDAAAFQAHVPIRRDKNQSQ 261


>ref|XP_021998956.1| uncharacterized protein LOC110895884 isoform X1 [Helianthus annuus]
          Length = 374

 Score =  339 bits (870), Expect = e-113
 Identities = 162/260 (62%), Positives = 196/260 (75%), Gaps = 2/260 (0%)
 Frame = +2

Query: 29  GDSGK--KTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKT 202
           GD GK  K +EG + DH+SP YLH S+YP+QMHVNDVLTD NY+DW QE+ NF+FAKNK 
Sbjct: 3   GDEGKTEKKNEG-VSDHDSPYYLHPSEYPRQMHVNDVLTDGNYTDWSQEIQNFLFAKNKI 61

Query: 203 GFIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERF 382
           GF+DG+IKKPE  S  ++ WMRCDAMIKGWL TAMEKEIR SVKYA TA EIW DL+ERF
Sbjct: 62  GFVDGTIKKPEQGSSSHMAWMRCDAMIKGWLNTAMEKEIRTSVKYATTAREIWVDLRERF 121

Query: 383 GKESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXX 562
           GKESAP+AYELKQSL  TRQ+GT+VS YYT+LR +WDE++S+LP PRC CD C       
Sbjct: 122 GKESAPRAYELKQSLTVTRQEGTSVSTYYTKLRTIWDEIQSVLPVPRCNCDGCTCGIGKK 181

Query: 563 XXXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPV 742
                   R YEFL+GLD +F  I+TQILAM+P P L AAYHLVA+DEQQR ++ +K P 
Sbjct: 182 LTELRDKERLYEFLLGLDPEFRTIRTQILAMQPIPSLGAAYHLVADDEQQRAVSGTKRPT 241

Query: 743 REVAAFQTGFQGVREQTRNQ 802
            + AAFQ      R++ ++Q
Sbjct: 242 SDAAAFQAHVPIRRDKNQSQ 261


>ref|XP_022006954.1| uncharacterized protein LOC110905727 [Helianthus annuus]
          Length = 363

 Score =  338 bits (866), Expect = e-112
 Identities = 167/254 (65%), Positives = 193/254 (75%)
 Frame = +2

Query: 29  GDSGKKTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTGF 208
           GD   K +E   ID +SPLYLH SDYP+QM VND LTD N++DW QEM NF+FAKNK GF
Sbjct: 3   GDDQPKPNEP--IDPSSPLYLHPSDYPRQMQVNDTLTDSNFNDWVQEMSNFLFAKNKIGF 60

Query: 209 IDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFGK 388
           +DGSI+KPE  S++Y+PWMRCDAM+KGWLTT+M+KEIR SVKYA TA+EIW+DLKERFGK
Sbjct: 61  VDGSIRKPEHTSKEYMPWMRCDAMVKGWLTTSMDKEIRASVKYANTASEIWDDLKERFGK 120

Query: 389 ESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXXX 568
           ESAP+AYELK+SL  TRQDG TVSAYYTRLR +WDE+  +LPTP CTCD C         
Sbjct: 121 ESAPRAYELKRSLHITRQDGGTVSAYYTRLRKIWDEINVVLPTPYCTCDGCKCDLGKKQV 180

Query: 569 XXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVRE 748
                 R YEFLMGLDD F VI+TQILAMKPTP L+ AYH+VAEDEQQR +   K    E
Sbjct: 181 QNKEKERLYEFLMGLDDDFGVIRTQILAMKPTPSLNNAYHMVAEDEQQRNMTGKKATF-E 239

Query: 749 VAAFQTGFQGVREQ 790
            AAFQ   Q  +EQ
Sbjct: 240 AAAFQVS-QNKKEQ 252


>gb|OTF97514.1| putative GAG-pre-integrase domain, Gag-polypeptide of LTR
           copia-type [Helianthus annuus]
          Length = 537

 Score =  343 bits (881), Expect = e-112
 Identities = 169/262 (64%), Positives = 193/262 (73%), Gaps = 2/262 (0%)
 Frame = +2

Query: 26  GGDSGKKTHEGDMI--DHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNK 199
           G +S K+  + D    DHNSP YLH SDYP+QMHVND LTD NY DW QEM NF+FAKNK
Sbjct: 3   GDNSNKENLKTDATGPDHNSPFYLHPSDYPRQMHVNDALTDNNYLDWVQEMENFLFAKNK 62

Query: 200 TGFIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKER 379
            GFIDG+IKKPET    Y+ WMRCDAMIKGWLTTAMEKEIR SVKYA +A+EIW DL+ER
Sbjct: 63  RGFIDGTIKKPETDDINYMAWMRCDAMIKGWLTTAMEKEIRGSVKYANSASEIWKDLQER 122

Query: 380 FGKESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXX 559
           FGKESAP+AYELKQ++++TRQDG TVSAYYT+LR LWDEM+S LPTP+C C+ C      
Sbjct: 123 FGKESAPRAYELKQAISNTRQDGMTVSAYYTKLRGLWDEMQSFLPTPKCKCNGCTCGLGK 182

Query: 560 XXXXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGP 739
                    + YEFLMGLD +FS+I+TQILA KP P L  AYHLVAEDEQQRTIA  K  
Sbjct: 183 SLKELREKEQLYEFLMGLDREFSIIRTQILATKPIPSLGNAYHLVAEDEQQRTIAGGKRL 242

Query: 740 VREVAAFQTGFQGVREQTRNQQ 805
           V E  AFQ   +     TR  Q
Sbjct: 243 VNETVAFQATVKRNAPPTRTGQ 264


>ref|XP_022019315.1| uncharacterized protein LOC110919350 [Helianthus annuus]
          Length = 294

 Score =  334 bits (857), Expect = e-112
 Identities = 156/239 (65%), Positives = 183/239 (76%)
 Frame = +2

Query: 47  THEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTGFIDGSIK 226
           +H G+++D+NSP YLH SDYP+QMHVND L+DKNY+DW QEM NF+FAKNK GFIDGSIK
Sbjct: 13  SHGGNVVDYNSPFYLHPSDYPRQMHVNDALSDKNYADWVQEMENFLFAKNKIGFIDGSIK 72

Query: 227 KPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFGKESAPKA 406
           KPE  S+ Y+PWMR DAMIKGWLT AMEKEIR SVKYA TAA +W+DL ERFGKESAP+A
Sbjct: 73  KPEKTSKDYMPWMRVDAMIKGWLTAAMEKEIRGSVKYANTAAVMWSDLHERFGKESAPRA 132

Query: 407 YELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXXXXXXXXX 586
           YELK  +  T Q+G TVSAYYT+LR LWDE+ES+ P PRCTC+ C               
Sbjct: 133 YELKNKITATHQEGATVSAYYTKLRSLWDEIESVFPVPRCTCNGCTCDLGKRMVEHQEKE 192

Query: 587 RTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVREVAAFQ 763
           + YEFLMGLD++F VIKTQILA KPTP L   YHLVA+DE+Q+ I   K P  E AAF+
Sbjct: 193 KLYEFLMGLDNEFGVIKTQILATKPTPALGTVYHLVAKDERQKQITEDKKPSMETAAFK 251


>ref|XP_021991470.1| uncharacterized protein LOC110888246 [Helianthus annuus]
          Length = 278

 Score =  332 bits (850), Expect = e-111
 Identities = 160/253 (63%), Positives = 190/253 (75%), Gaps = 3/253 (1%)
 Frame = +2

Query: 29  GDSGKKTHE-GDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTG 205
           G+ G+ + + G+ ID NSP YLH SDYPKQ  VN+ L+D N++DW QEM NF+FAKNK G
Sbjct: 3   GEKGEGSKDNGETIDTNSPYYLHPSDYPKQFQVNENLSDSNFNDWSQEMTNFLFAKNKIG 62

Query: 206 FIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFG 385
           F+DGS+ KP+    KY+ WMRCDAMIKGWLTTAMEKEIR+SVKYA TA +IW DL ERFG
Sbjct: 63  FVDGSLLKPDKNDAKYMQWMRCDAMIKGWLTTAMEKEIRSSVKYANTALKIWKDLHERFG 122

Query: 386 KESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXX 565
           KESAP+AYELKQS+  TRQ+G +VSAY+T+LR LWDE++S+LPTPRC CD C        
Sbjct: 123 KESAPRAYELKQSVTQTRQEGVSVSAYFTKLRSLWDEIDSVLPTPRCECDGCTCNVVKKI 182

Query: 566 XXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSK--GP 739
                  R YEFLMGLD +FSV++TQILA KPTP L  AYHLVAEDEQQR IA+ K    
Sbjct: 183 TELKEKERLYEFLMGLDAEFSVMRTQILATKPTPSLGTAYHLVAEDEQQRNIAAGKKMAA 242

Query: 740 VREVAAFQTGFQG 778
             + AAFQT  QG
Sbjct: 243 TPDAAAFQTSQQG 255


>ref|XP_021986813.1| uncharacterized protein LOC110883333 [Helianthus annuus]
          Length = 565

 Score =  341 bits (874), Expect = e-111
 Identities = 163/260 (62%), Positives = 195/260 (75%), Gaps = 2/260 (0%)
 Frame = +2

Query: 29  GDSGK--KTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKT 202
           GD GK  K +EG + DH+SP YLH SDYP QMHVNDVLTD NY+DW QEM NF+FAKNK 
Sbjct: 3   GDEGKTEKKNEG-VSDHDSPYYLHPSDYPTQMHVNDVLTDGNYTDWSQEMQNFLFAKNKI 61

Query: 203 GFIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERF 382
           GF+DG+IKKPE  S  ++ WMRCDAMIKGWL T MEKEIR SVKYA TA EIW DL+ERF
Sbjct: 62  GFVDGTIKKPEQGSSSHMAWMRCDAMIKGWLNTTMEKEIRTSVKYATTAREIWVDLRERF 121

Query: 383 GKESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXX 562
           GKESAP+AYELKQSL  TRQ+GT+VS YYT+LR +WDE++S+LP PRC CD C       
Sbjct: 122 GKESAPRAYELKQSLTLTRQEGTSVSTYYTKLRTIWDEIQSVLPVPRCNCDGCTSGIGKK 181

Query: 563 XXXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPV 742
                   R YEFL+GLD +F +I+TQILAM+P P L AAYHLVA+DEQQR ++ +K P 
Sbjct: 182 LIELRDKERLYEFLLGLDPEFGIIRTQILAMQPIPSLGAAYHLVADDEQQRAVSGTKRPT 241

Query: 743 REVAAFQTGFQGVREQTRNQ 802
            + AAFQ      R++ ++Q
Sbjct: 242 SDAAAFQAHVPIRRDKNQSQ 261


>ref|XP_022007266.1| uncharacterized protein LOC110906440 [Helianthus annuus]
          Length = 240

 Score =  329 bits (844), Expect = e-111
 Identities = 154/228 (67%), Positives = 181/228 (79%)
 Frame = +2

Query: 65  IDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTGFIDGSIKKPETKS 244
           ID +SPLYLH SDYP+QM VND LTD N++DW QEM NF+FAKNK GF+DGSI+KPE  S
Sbjct: 13  IDTSSPLYLHPSDYPRQMQVNDTLTDSNFNDWVQEMSNFLFAKNKIGFVDGSIRKPEHTS 72

Query: 245 EKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFGKESAPKAYELKQS 424
           ++Y+PWMRCDAM+KGWLTTAM+KEIR SVKYA TA+EIW DLKERFGKESAP+AYELK+S
Sbjct: 73  KEYMPWMRCDAMVKGWLTTAMDKEIRASVKYANTASEIWGDLKERFGKESAPRAYELKRS 132

Query: 425 LADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXXXXXXXXXRTYEFL 604
           L  TRQDG TVSAYYTRLR +WDE+  +LPTP CTC+ C               R YEFL
Sbjct: 133 LHITRQDGGTVSAYYTRLRKIWDEINVVLPTPYCTCNGCKCDLGRKQVQNKEKERLYEFL 192

Query: 605 MGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVRE 748
           MGLDD F VI+TQILAMKPTP L+ AYH+VAEDEQQR +   +  +++
Sbjct: 193 MGLDDDFGVIRTQILAMKPTPSLNNAYHMVAEDEQQRNMTGKRQHLKQ 240


>ref|XP_022023560.1| uncharacterized protein LOC110923810 [Helianthus annuus]
          Length = 272

 Score =  330 bits (845), Expect = e-111
 Identities = 158/247 (63%), Positives = 184/247 (74%)
 Frame = +2

Query: 29  GDSGKKTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTGF 208
           G  G K +  D+ID NSPLYLH SDYP+QM VND LTD N++DW QEM  F+FAKNK GF
Sbjct: 3   GTDGSKAN--DLIDPNSPLYLHPSDYPRQMQVNDSLTDNNFNDWVQEMTEFLFAKNKFGF 60

Query: 209 IDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFGK 388
           +D +IKKPE   + Y PWMRCDAM+KGWL TAMEK+IR SVKYA T  EIW DL ERFGK
Sbjct: 61  VDETIKKPEKSHKDYTPWMRCDAMVKGWLKTAMEKDIRASVKYANTTPEIWKDLNERFGK 120

Query: 389 ESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXXX 568
           ES P+AYELKQSL  TRQDG +VSAYYT+LR +WDE+  +LP P+C+CD C         
Sbjct: 121 ESTPRAYELKQSLNVTRQDGASVSAYYTKLRRIWDEINEVLPIPQCSCDGCKCGVGKRLV 180

Query: 569 XXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVRE 748
                 R YEFL+GLD++F+VI+TQILAMKPTP LS AYH+VAEDEQQR +A  K    E
Sbjct: 181 ELKEKERLYEFLLGLDNEFAVIRTQILAMKPTPTLSNAYHMVAEDEQQRNLAFGKKTTSE 240

Query: 749 VAAFQTG 769
           VAAFQ G
Sbjct: 241 VAAFQAG 247


>ref|XP_022015104.1| uncharacterized protein LOC110914625 [Helianthus annuus]
          Length = 356

 Score =  332 bits (852), Expect = e-110
 Identities = 161/264 (60%), Positives = 195/264 (73%), Gaps = 6/264 (2%)
 Frame = +2

Query: 32  DSGKKTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTGFI 211
           D+ KKT E   +D NSP ++HAS+YP+QMHVND LTD  Y+ W QEM+NF+FAKNK GFI
Sbjct: 4   DALKKTKETH-VDSNSPYFIHASEYPRQMHVNDALTDIIYNAWSQEMVNFLFAKNKIGFI 62

Query: 212 DGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFGKE 391
           DG++KKPE     Y+ WMRCDAMIKGWLTTAME++IRNSVKYA TA+E+W+DLKERFGKE
Sbjct: 63  DGTMKKPEKTDSTYMQWMRCDAMIKGWLTTAMERDIRNSVKYANTASEMWSDLKERFGKE 122

Query: 392 SAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXXXX 571
           SAP+AYELKQ+L +TRQDG++VSAYYTRLR LWDE+ ++ P PRC+C+ C          
Sbjct: 123 SAPRAYELKQTLNNTRQDGSSVSAYYTRLRALWDEIHTVFPAPRCSCNRCSCEVGKKISE 182

Query: 572 XXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVREV 751
                R YEFLMGLD +FSV++TQILAM PTP L   YHLVAEDEQQR I   K    EV
Sbjct: 183 QKEKERVYEFLMGLDGEFSVMRTQILAMNPTPSLGTTYHLVAEDEQQRAIIGGKKTNPEV 242

Query: 752 AAFQ------TGFQGVREQTRNQQ 805
           A FQ      +G QG +   R+ +
Sbjct: 243 ATFQAYAPRNSGTQGTKSTQRDSK 266


>ref|XP_022008139.1| uncharacterized protein LOC110907465 [Helianthus annuus]
          Length = 309

 Score =  328 bits (840), Expect = e-109
 Identities = 156/251 (62%), Positives = 185/251 (73%)
 Frame = +2

Query: 50  HEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTGFIDGSIKK 229
           +E  + DHNSP Y+H SDYP+Q+HVNDVLTD+NY+DW QEM+NF+FAKNK GFIDGSIKK
Sbjct: 12  NEDAVNDHNSPYYIHPSDYPRQLHVNDVLTDRNYTDWSQEMLNFLFAKNKMGFIDGSIKK 71

Query: 230 PETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFGKESAPKAY 409
           PE  S  Y+ WMRCDAM+KGWL TAMEKEIR SVKY  TA EIW DLKERFGK +AP+AY
Sbjct: 72  PEPNSSAYMAWMRCDAMLKGWLNTAMEKEIRTSVKYTCTAQEIWADLKERFGKGNAPRAY 131

Query: 410 ELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXXXXXXXXXR 589
           ELKQ L   +Q+GTTVSAYYT+L+ +WDE++S LPTP C C+ C               R
Sbjct: 132 ELKQLLTTMKQEGTTVSAYYTKLQSIWDEIQSALPTPVCGCNGCKCEIGKKLHDLREKER 191

Query: 590 TYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVREVAAFQTG 769
            YEFL+GLD +F  I+TQILAMKPTP L  AYHLVAEDEQQR I S +    + AAFQ  
Sbjct: 192 LYEFLLGLDCEFGTIRTQILAMKPTPSLGTAYHLVAEDEQQRAITSGRRSTVDAAAFQAF 251

Query: 770 FQGVREQTRNQ 802
               ++Q  +Q
Sbjct: 252 IPKRKDQNVSQ 262


>gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA polymerase,
           Gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 938

 Score =  345 bits (886), Expect = e-109
 Identities = 167/260 (64%), Positives = 196/260 (75%)
 Frame = +2

Query: 26  GGDSGKKTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTG 205
           G D G K  EG   D NSPLY+HASDYPKQMHVND LTD NY+DW QEM+NF+FAKNK G
Sbjct: 3   GNDEGTKK-EGSSPDINSPLYIHASDYPKQMHVNDTLTDNNYTDWSQEMLNFLFAKNKVG 61

Query: 206 FIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFG 385
           F+DG++KKPE  +  Y+ WMRCDAM+KGWLTTAMEK+IR SVKYA TA+EIW+DL+ERFG
Sbjct: 62  FVDGTLKKPEKTATDYMAWMRCDAMVKGWLTTAMEKDIRGSVKYANTASEIWSDLRERFG 121

Query: 386 KESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXX 565
           K SAP+AYELKQ+L++T Q G++VSAYYT+LRVLWDE+ES+LP PRCTCD C        
Sbjct: 122 KASAPRAYELKQTLSNTHQSGSSVSAYYTKLRVLWDEIESVLPAPRCTCDKCSCGVGKKM 181

Query: 566 XXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVR 745
                  R YEFLMGLD  F+VIKTQILAM P P L  AYHLVAEDE+QR I+  K    
Sbjct: 182 NELREKERLYEFLMGLDADFAVIKTQILAMNPIPTLGNAYHLVAEDERQRMISGEKKTPT 241

Query: 746 EVAAFQTGFQGVREQTRNQQ 805
           E AAF+  F+ VR +    Q
Sbjct: 242 ENAAFK-AFKPVRRENSTSQ 260


>gb|OTG28537.1| putative GAG-pre-integrase domain, Gag-polypeptide of LTR
           copia-type [Helianthus annuus]
          Length = 520

 Score =  333 bits (854), Expect = e-109
 Identities = 160/258 (62%), Positives = 189/258 (73%)
 Frame = +2

Query: 32  DSGKKTHEGDMIDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAKNKTGFI 211
           ++ K+  EG+ +D NSP Y+H SDYPKQM VND L D NY+DW QEM NF+FAKNK GF+
Sbjct: 6   ETNKRKTEGESLDSNSPYYIHPSDYPKQMQVNDALNDGNYNDWAQEMENFLFAKNKIGFV 65

Query: 212 DGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLKERFGKE 391
            GSIKKPE  S+ Y+PWMRCDAMIKGWLTTAMEKEIR+SVKYA TAAEIW+DLKERFGKE
Sbjct: 66  VGSIKKPEKGSQTYMPWMRCDAMIKGWLTTAMEKEIRSSVKYANTAAEIWSDLKERFGKE 125

Query: 392 SAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXXXXXXXX 571
           SAP AYELKQ+L+ T Q  T+VSAY+T+LR +WDEM+S  P PRC C  C          
Sbjct: 126 SAPHAYELKQTLSATVQGDTSVSAYFTKLRSIWDEMQSAFPIPRCKCSGCSCDVGRKLVE 185

Query: 572 XXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSKGPVREV 751
                R YEFLMGL+  FSVI+TQIL M PTP L+ AYHLVAEDE+QR I S + P  + 
Sbjct: 186 HKESERLYEFLMGLNSDFSVIRTQILTMNPTPTLTNAYHLVAEDERQRAITSERRPSTDA 245

Query: 752 AAFQTGFQGVREQTRNQQ 805
            AF+    G RE   +Q+
Sbjct: 246 VAFKAFVPGRRENNSSQR 263


>ref|XP_021991821.1| uncharacterized protein LOC110888610 [Helianthus annuus]
          Length = 332

 Score =  326 bits (835), Expect = e-108
 Identities = 156/250 (62%), Positives = 188/250 (75%), Gaps = 4/250 (1%)
 Frame = +2

Query: 26  GGDSGKKTHEGDM----IDHNSPLYLHASDYPKQMHVNDVLTDKNYSDWEQEMMNFMFAK 193
           G D+  KT +G+     I H+SP YLH SDYPKQ+HVNDVLTD NY+DW+QEMMNF+FAK
Sbjct: 3   GDDTSNKTKDGEGYGGGISHDSPYYLHPSDYPKQLHVNDVLTDCNYADWKQEMMNFLFAK 62

Query: 194 NKTGFIDGSIKKPETKSEKYLPWMRCDAMIKGWLTTAMEKEIRNSVKYAKTAAEIWNDLK 373
           NK  F+DGSIKKPE  S+ Y+PWMR DAMIKGWLTTAMEK IRNSVKYA TA+EIW+DL 
Sbjct: 63  NKAEFVDGSIKKPEKASKDYMPWMRVDAMIKGWLTTAMEKSIRNSVKYASTASEIWSDLD 122

Query: 374 ERFGKESAPKAYELKQSLADTRQDGTTVSAYYTRLRVLWDEMESILPTPRCTCDNCXXXX 553
           ERFGKESAP+AY+LKQ +A TRQ G +VSAY+T+LR LWDE +S+ P P+C+CD C    
Sbjct: 123 ERFGKESAPRAYKLKQKIAATRQGGNSVSAYFTQLRSLWDEAQSVQPFPQCSCDKCECDV 182

Query: 554 XXXXXXXXXXXRTYEFLMGLDDQFSVIKTQILAMKPTPDLSAAYHLVAEDEQQRTIASSK 733
                        YEFLMGLD +F+VIKTQILA KP P L+ AYH+V +DE+QR ++S  
Sbjct: 183 GKRIFEYQEKEHLYEFLMGLDTEFAVIKTQILATKPVPSLTVAYHMVHDDEKQRAVSSEN 242

Query: 734 GPVREVAAFQ 763
               E AAF+
Sbjct: 243 KTHTESAAFK 252


Top