BLASTX nr result

ID: Chrysanthemum21_contig00036170 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00036170
         (777 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022019676.1| uncharacterized protein LOC110919724 [Helian...   460   e-161
gb|OTF97514.1| putative GAG-pre-integrase domain, Gag-polypeptid...   443   e-152
ref|XP_021998957.1| uncharacterized protein LOC110895884 isoform...   373   e-126
ref|XP_021998956.1| uncharacterized protein LOC110895884 isoform...   373   e-126
ref|XP_023754965.1| uncharacterized protein LOC111903421 [Lactuc...   363   e-122
ref|XP_023740913.1| uncharacterized protein LOC111888988 [Lactuc...   362   e-122
ref|XP_022022818.1| uncharacterized protein LOC110922934 [Helian...   358   e-121
gb|OTG28537.1| putative GAG-pre-integrase domain, Gag-polypeptid...   362   e-120
ref|XP_022015104.1| uncharacterized protein LOC110914625 [Helian...   356   e-120
ref|XP_022008139.1| uncharacterized protein LOC110907465 [Helian...   344   e-116
gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA...   363   e-116
ref|XP_022006954.1| uncharacterized protein LOC110905727 [Helian...   344   e-115
ref|XP_023734935.1| uncharacterized protein LOC111882795 [Lactuc...   340   e-115
ref|XP_022004670.1| uncharacterized protein LOC110902278 [Helian...   348   e-114
ref|XP_021975260.1| uncharacterized protein LOC110870384 [Helian...   352   e-114
ref|XP_021975442.1| uncharacterized protein LOC110870567 [Helian...   343   e-113
ref|XP_021979891.1| uncharacterized protein LOC110876015 [Helian...   345   e-113
ref|XP_023750032.1| uncharacterized protein LOC111898332 [Lactuc...   340   e-112
ref|XP_022004456.1| uncharacterized protein LOC110902021 [Helian...   340   e-112
ref|XP_021999527.1| uncharacterized protein LOC110896564 [Helian...   335   e-111

>ref|XP_022019676.1| uncharacterized protein LOC110919724 [Helianthus annuus]
          Length = 373

 Score =  460 bits (1184), Expect = e-161
 Identities = 214/258 (82%), Positives = 234/258 (90%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NYLDWVQEMENFLFAKNK+GF+DGTIKRP+AGDA+ MAWMRCDAMIKGWLTTAMEKEIRG
Sbjct: 44  NYLDWVQEMENFLFAKNKIGFVDGTIKRPKAGDASYMAWMRCDAMIKGWLTTAMEKEIRG 103

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA SASEIWKDLKE+FGKES PRAYELKQAISN ++EGM VSAYYTKLRGLWDEM+S
Sbjct: 104 SVKYANSASEIWKDLKEQFGKESVPRAYELKQAISNIKQEGMMVSAYYTKLRGLWDEMQS 163

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
           FLP  ICKC GCTC IG+SL EL+EKEQLYEFL+GLDG+FSIIRTQILAT PIPSLG AY
Sbjct: 164 FLPATICKCNGCTCGIGRSLTELREKEQLYEFLLGLDGEFSIIRTQILATKPIPSLGTAY 223

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFMKRGGSSNRVNQKEEKSVAHCGECGKDGHTRD 722
           HLVAEDE+QR I GG+KP+SETMAFQ+ MKRGG+ NRV QK+ K   H   CG+DGHTRD
Sbjct: 224 HLVAEDEQQRTIAGGKKPVSETMAFQASMKRGGTPNRVGQKDNKGATHFNHCGRDGHTRD 283

Query: 723 GCFKIIGYPEWWNVKNKR 776
           GCFKI+GYPEWWNVKNKR
Sbjct: 284 GCFKIVGYPEWWNVKNKR 301


>gb|OTF97514.1| putative GAG-pre-integrase domain, Gag-polypeptide of LTR
           copia-type [Helianthus annuus]
          Length = 537

 Score =  443 bits (1140), Expect = e-152
 Identities = 208/258 (80%), Positives = 227/258 (87%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NYLDWVQEMENFLFAKNK GFIDGTIK+PE  D N MAWMRCDAMIKGWLTTAMEKEIRG
Sbjct: 45  NYLDWVQEMENFLFAKNKRGFIDGTIKKPETDDINYMAWMRCDAMIKGWLTTAMEKEIRG 104

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA SASEIWKDL+ERFGKESAPRAYELKQAISNTR++GMTVSAYYTKLRGLWDEM+S
Sbjct: 105 SVKYANSASEIWKDLQERFGKESAPRAYELKQAISNTRQDGMTVSAYYTKLRGLWDEMQS 164

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
           FLP P CKC GCTC +GKSL+EL+EKEQLYEFLMGLD +FSIIRTQILAT PIPSLGNAY
Sbjct: 165 FLPTPKCKCNGCTCGLGKSLKELREKEQLYEFLMGLDREFSIIRTQILATKPIPSLGNAY 224

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFMKRGGSSNRVNQKEEKSVAHCGECGKDGHTRD 722
           HLVAEDE+QR I GG++ ++ET+AFQ+ +KR     R  QK+EK   HC  CGKDGHTRD
Sbjct: 225 HLVAEDEQQRTIAGGKRLVNETVAFQATVKRNAPPTRTGQKDEKPSGHCDHCGKDGHTRD 284

Query: 723 GCFKIIGYPEWWNVKNKR 776
           GCFK +GYPEWW  KNKR
Sbjct: 285 GCFKRVGYPEWWPGKNKR 302


>ref|XP_021998957.1| uncharacterized protein LOC110895884 isoform X2 [Helianthus annuus]
          Length = 370

 Score =  373 bits (957), Expect = e-126
 Identities = 175/267 (65%), Positives = 212/267 (79%), Gaps = 9/267 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NY DW QE++NFLFAKNK+GF+DGTIK+PE G ++ MAWMRCDAMIKGWL TAMEKEIR 
Sbjct: 43  NYTDWSQEIQNFLFAKNKIGFVDGTIKKPEQGSSSHMAWMRCDAMIKGWLNTAMEKEIRT 102

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYAT+A EIW DL+ERFGKESAPRAYELKQ+++ TR+EG +VS YYTKLR +WDE++S
Sbjct: 103 SVKYATTAREIWVDLRERFGKESAPRAYELKQSLTVTRQEGTSVSTYYTKLRTIWDEIQS 162

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP+P C C+GCTC IGK L EL++KE+LYEFL+GLD +F  IRTQILA  PIPSLG AY
Sbjct: 163 VLPVPRCNCDGCTCGIGKKLTELRDKERLYEFLLGLDPEFRTIRTQILAMQPIPSLGAAY 222

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFM----KRGGSSNRVNQKEEK-----SVAHCGE 695
           HLVA+DE+QRA+ G ++P S+  AFQ+ +     +  S NRV QK+ K      + HC  
Sbjct: 223 HLVADDEQQRAVSGTKRPTSDAAAFQAHVPIRRDKNQSQNRVKQKDAKRSGTDEIEHCTF 282

Query: 696 CGKDGHTRDGCFKIIGYPEWWNVKNKR 776
           CGKDGH +DGCFK IGYPEWW  K K+
Sbjct: 283 CGKDGHNKDGCFKRIGYPEWWPGKGKQ 309


>ref|XP_021998956.1| uncharacterized protein LOC110895884 isoform X1 [Helianthus annuus]
          Length = 374

 Score =  373 bits (957), Expect = e-126
 Identities = 175/267 (65%), Positives = 212/267 (79%), Gaps = 9/267 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NY DW QE++NFLFAKNK+GF+DGTIK+PE G ++ MAWMRCDAMIKGWL TAMEKEIR 
Sbjct: 43  NYTDWSQEIQNFLFAKNKIGFVDGTIKKPEQGSSSHMAWMRCDAMIKGWLNTAMEKEIRT 102

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYAT+A EIW DL+ERFGKESAPRAYELKQ+++ TR+EG +VS YYTKLR +WDE++S
Sbjct: 103 SVKYATTAREIWVDLRERFGKESAPRAYELKQSLTVTRQEGTSVSTYYTKLRTIWDEIQS 162

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP+P C C+GCTC IGK L EL++KE+LYEFL+GLD +F  IRTQILA  PIPSLG AY
Sbjct: 163 VLPVPRCNCDGCTCGIGKKLTELRDKERLYEFLLGLDPEFRTIRTQILAMQPIPSLGAAY 222

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFM----KRGGSSNRVNQKEEK-----SVAHCGE 695
           HLVA+DE+QRA+ G ++P S+  AFQ+ +     +  S NRV QK+ K      + HC  
Sbjct: 223 HLVADDEQQRAVSGTKRPTSDAAAFQAHVPIRRDKNQSQNRVKQKDAKRSGTDEIEHCTF 282

Query: 696 CGKDGHTRDGCFKIIGYPEWWNVKNKR 776
           CGKDGH +DGCFK IGYPEWW  K K+
Sbjct: 283 CGKDGHNKDGCFKRIGYPEWWPGKGKQ 309


>ref|XP_023754965.1| uncharacterized protein LOC111903421 [Lactuca sativa]
          Length = 384

 Score =  363 bits (932), Expect = e-122
 Identities = 173/265 (65%), Positives = 204/265 (76%), Gaps = 8/265 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NYLDWVQEMENFLFAKNK+GF+DGT+++PE   AN M W+RCDAMIKGWLTTAME+EIR 
Sbjct: 47  NYLDWVQEMENFLFAKNKIGFVDGTLQKPEKTHANHMGWLRCDAMIKGWLTTAMEREIRS 106

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA++A EIW DL+ERFGKESAPRAYELKQ ++ T+++G +VSAYYTKLR LWDE+ S
Sbjct: 107 SVKYASTAEEIWNDLRERFGKESAPRAYELKQLLTTTKQDGASVSAYYTKLRALWDEISS 166

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
              IP C C GC C I K L EL++KE+LYEFL+GLD + S IRTQILA  PIP+LG AY
Sbjct: 167 VFNIPKCSCAGCKCGISKRLTELRDKERLYEFLLGLDSELSTIRTQILAMKPIPTLGEAY 226

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFMKRGG---SSNRVNQKEEK-----SVAHCGEC 698
           HLVAEDE+QRAI  G++  SET AFQ+ +K      S  ++  +  K      V HC  C
Sbjct: 227 HLVAEDEQQRAISTGKRTNSETAAFQAHIKHDADVWSQRKMGPRNGKRGGNDKVEHCDFC 286

Query: 699 GKDGHTRDGCFKIIGYPEWWNVKNK 773
           GKDGHTRDGCFK IGYPEWW  K K
Sbjct: 287 GKDGHTRDGCFKRIGYPEWWPGKGK 311


>ref|XP_023740913.1| uncharacterized protein LOC111888988 [Lactuca sativa]
          Length = 384

 Score =  362 bits (928), Expect = e-122
 Identities = 172/265 (64%), Positives = 203/265 (76%), Gaps = 8/265 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NYLDWVQEMENFLFAKNK+GF+DGT+++PE   AN M W+RCDAMIKGWLTTAME+EIR 
Sbjct: 47  NYLDWVQEMENFLFAKNKIGFVDGTLQKPEKTHANHMGWLRCDAMIKGWLTTAMEREIRS 106

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA++A EIW DL+ERFGKESAPRAYELKQ ++ T+++G +VSAYYTKLR LWDE+ S
Sbjct: 107 SVKYASTAEEIWNDLRERFGKESAPRAYELKQLLTTTKQDGASVSAYYTKLRALWDEISS 166

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
              IP C C GC C I K L EL++KE+LYEFL+GLD + S IRTQILA  PIP+LG AY
Sbjct: 167 VFNIPKCSCAGCKCGISKRLTELRDKERLYEFLLGLDSELSTIRTQILAMKPIPTLGEAY 226

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFMKRGG---SSNRVNQKEEK-----SVAHCGEC 698
           HLVAEDE+QRAI  G++  SET  FQ+ +K      S  ++  +  K      V HC  C
Sbjct: 227 HLVAEDEQQRAISTGKRTNSETATFQAHIKHDADVWSQRKMGPRNGKRGGNDKVEHCDFC 286

Query: 699 GKDGHTRDGCFKIIGYPEWWNVKNK 773
           GKDGHTRDGCFK IGYPEWW  K K
Sbjct: 287 GKDGHTRDGCFKRIGYPEWWPGKGK 311


>ref|XP_022022818.1| uncharacterized protein LOC110922934 [Helianthus annuus]
          Length = 369

 Score =  358 bits (920), Expect = e-121
 Identities = 168/267 (62%), Positives = 206/267 (77%), Gaps = 9/267 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           N+ DW+QEM NFLFAKNK+GF+DGT+K+PE      MAWMRCDAMIKGWLTTAMEKEIR 
Sbjct: 42  NFSDWIQEMTNFLFAKNKIGFVDGTLKKPEKTSKEYMAWMRCDAMIKGWLTTAMEKEIRT 101

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA +++EIWKDL ERFGKESAPRAY+LKQ+++ TR+ G++VSAYYTKLRG+WDE+ +
Sbjct: 102 SVKYANTSAEIWKDLNERFGKESAPRAYKLKQSLNVTRQNGVSVSAYYTKLRGIWDEINT 161

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP P C C+GC+C++GK L ELKEKE+LYEFL+GLD DF++IRTQILA  P P+LG AY
Sbjct: 162 VLPTPRCTCDGCSCEVGKKLVELKEKERLYEFLLGLDADFAVIRTQILAMKPTPTLGAAY 221

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFM---KRGGSSNRVNQKEEKSVA------HCGE 695
           H+V+EDE+QR +   +K   E  AFQ+     K G +  R+  K+EK         HC  
Sbjct: 222 HMVSEDEQQRNLSTNKKGTVENAAFQASQFARKEGQTQRRIWSKQEKGSGLINKNEHCTF 281

Query: 696 CGKDGHTRDGCFKIIGYPEWWNVKNKR 776
           CGKDGH RDGCFK IGYPEWW  K K+
Sbjct: 282 CGKDGHNRDGCFKRIGYPEWWPGKGKK 308


>gb|OTG28537.1| putative GAG-pre-integrase domain, Gag-polypeptide of LTR
           copia-type [Helianthus annuus]
          Length = 520

 Score =  362 bits (929), Expect = e-120
 Identities = 172/265 (64%), Positives = 204/265 (76%), Gaps = 7/265 (2%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NY DW QEMENFLFAKNK+GF+ G+IK+PE G    M WMRCDAMIKGWLTTAMEKEIR 
Sbjct: 44  NYNDWAQEMENFLFAKNKIGFVVGSIKKPEKGSQTYMPWMRCDAMIKGWLTTAMEKEIRS 103

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA +A+EIW DLKERFGKESAP AYELKQ +S T +   +VSAY+TKLR +WDEM+S
Sbjct: 104 SVKYANTAAEIWSDLKERFGKESAPHAYELKQTLSATVQGDTSVSAYFTKLRSIWDEMQS 163

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
             PIP CKC GC+CD+G+ L E KE E+LYEFLMGL+ DFS+IRTQIL  NP P+L NAY
Sbjct: 164 AFPIPRCKCSGCSCDVGRKLVEHKESERLYEFLMGLNSDFSVIRTQILTMNPTPTLTNAY 223

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFM---KRGGSSNRVNQKEEKSVA----HCGECG 701
           HLVAEDE+QRAI   R+P ++ +AF++F+   +   SS R ++   K V     HC  CG
Sbjct: 224 HLVAEDERQRAITSERRPSTDAVAFKAFVPGRRENNSSQRRDKPASKDVKHAADHCTFCG 283

Query: 702 KDGHTRDGCFKIIGYPEWWNVKNKR 776
           KDGHTRDGCFK+IG+PEWW    KR
Sbjct: 284 KDGHTRDGCFKLIGFPEWWPGNRKR 308


>ref|XP_022015104.1| uncharacterized protein LOC110914625 [Helianthus annuus]
          Length = 356

 Score =  356 bits (913), Expect = e-120
 Identities = 165/264 (62%), Positives = 202/264 (76%), Gaps = 7/264 (2%)
 Frame = +3

Query: 6   YLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRGS 185
           Y  W QEM NFLFAKNK+GFIDGT+K+PE  D+  M WMRCDAMIKGWLTTAME++IR S
Sbjct: 42  YNAWSQEMVNFLFAKNKIGFIDGTMKKPEKTDSTYMQWMRCDAMIKGWLTTAMERDIRNS 101

Query: 186 VKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKSF 365
           VKYA +ASE+W DLKERFGKESAPRAYELKQ ++NTR++G +VSAYYT+LR LWDE+ + 
Sbjct: 102 VKYANTASEMWSDLKERFGKESAPRAYELKQTLNNTRQDGSSVSAYYTRLRALWDEIHTV 161

Query: 366 LPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAYH 545
            P P C C  C+C++GK + E KEKE++YEFLMGLDG+FS++RTQILA NP PSLG  YH
Sbjct: 162 FPAPRCSCNRCSCEVGKKISEQKEKERVYEFLMGLDGEFSVMRTQILAMNPTPSLGTTYH 221

Query: 546 LVAEDEKQRAIVGGRKPMSETMAFQSFMKR--GGSSNRVNQKEEKSV-----AHCGECGK 704
           LVAEDE+QRAI+GG+K   E   FQ++  R  G    +  Q++ K +      HC  CG+
Sbjct: 222 LVAEDEQQRAIIGGKKTNPEVATFQAYAPRNSGTQGTKSTQRDSKRIQNDRSEHCDFCGR 281

Query: 705 DGHTRDGCFKIIGYPEWWNVKNKR 776
           DGH ++GCFK I YPEWW  K KR
Sbjct: 282 DGHNKEGCFKRICYPEWWPGKGKR 305


>ref|XP_022008139.1| uncharacterized protein LOC110907465 [Helianthus annuus]
          Length = 309

 Score =  344 bits (883), Expect = e-116
 Identities = 165/266 (62%), Positives = 200/266 (75%), Gaps = 9/266 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NY DW QEM NFLFAKNKMGFIDG+IK+PE   +  MAWMRCDAM+KGWL TAMEKEIR 
Sbjct: 44  NYTDWSQEMLNFLFAKNKMGFIDGSIKKPEPNSSAYMAWMRCDAMLKGWLNTAMEKEIRT 103

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKY  +A EIW DLKERFGK +APRAYELKQ ++  ++EG TVSAYYTKL+ +WDE++S
Sbjct: 104 SVKYTCTAQEIWADLKERFGKGNAPRAYELKQLLTTMKQEGTTVSAYYTKLQSIWDEIQS 163

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP P+C C GC C+IGK L +L+EKE+LYEFL+GLD +F  IRTQILA  P PSLG AY
Sbjct: 164 ALPTPVCGCNGCKCEIGKKLHDLREKERLYEFLLGLDCEFGTIRTQILAMKPTPSLGTAY 223

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFMKR----GGSSNRVNQKEEK-----SVAHCGE 695
           HLVAEDE+QRAI  GR+   +  AFQ+F+ +      S ++V +K+ K      + HC  
Sbjct: 224 HLVAEDEQQRAITSGRRSTVDAAAFQAFIPKRKDQNVSQSKVGKKDGKRTEVERLEHCDY 283

Query: 696 CGKDGHTRDGCFKIIGYPEWWNVKNK 773
           CGKDGH ++GCFK IGYP+W   K K
Sbjct: 284 CGKDGHVQEGCFKKIGYPKWCPGKAK 309


>gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA polymerase,
           Gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 938

 Score =  363 bits (933), Expect = e-116
 Identities = 171/264 (64%), Positives = 206/264 (78%), Gaps = 7/264 (2%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NY DW QEM NFLFAKNK+GF+DGT+K+PE    + MAWMRCDAM+KGWLTTAMEK+IRG
Sbjct: 42  NYTDWSQEMLNFLFAKNKVGFVDGTLKKPEKTATDYMAWMRCDAMVKGWLTTAMEKDIRG 101

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA +ASEIW DL+ERFGK SAPRAYELKQ +SNT + G +VSAYYTKLR LWDE++S
Sbjct: 102 SVKYANTASEIWSDLRERFGKASAPRAYELKQTLSNTHQSGSSVSAYYTKLRVLWDEIES 161

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP P C C+ C+C +GK + EL+EKE+LYEFLMGLD DF++I+TQILA NPIP+LGNAY
Sbjct: 162 VLPAPRCTCDKCSCGVGKKMNELREKERLYEFLMGLDADFAVIKTQILAMNPIPTLGNAY 221

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSF----MKRGGSSNRVNQKEEK---SVAHCGECG 701
           HLVAEDE+QR I G +K  +E  AF++F     +   S N+   K++K    V  C  CG
Sbjct: 222 HLVAEDERQRMISGEKKTPTENAAFKAFKPVRRENSTSQNKAAPKDQKHGDMVEQCTHCG 281

Query: 702 KDGHTRDGCFKIIGYPEWWNVKNK 773
           + GH RDGCFKIIGYP+WW  K K
Sbjct: 282 RSGHKRDGCFKIIGYPDWWPGKMK 305


>ref|XP_022006954.1| uncharacterized protein LOC110905727 [Helianthus annuus]
          Length = 363

 Score =  344 bits (882), Expect = e-115
 Identities = 167/265 (63%), Positives = 197/265 (74%), Gaps = 7/265 (2%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           N+ DWVQEM NFLFAKNK+GF+DG+I++PE      M WMRCDAM+KGWLTT+M+KEIR 
Sbjct: 40  NFNDWVQEMSNFLFAKNKIGFVDGSIRKPEHTSKEYMPWMRCDAMVKGWLTTSMDKEIRA 99

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA +ASEIW DLKERFGKESAPRAYELK+++  TR++G TVSAYYT+LR +WDE+  
Sbjct: 100 SVKYANTASEIWDDLKERFGKESAPRAYELKRSLHITRQDGGTVSAYYTRLRKIWDEINV 159

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP P C C+GC CD+GK   + KEKE+LYEFLMGLD DF +IRTQILA  P PSL NAY
Sbjct: 160 VLPTPYCTCDGCKCDLGKKQVQNKEKERLYEFLMGLDDDFGVIRTQILAMKPTPSLNNAY 219

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQ--SFMKRGGSSNRVNQKEEKS-----VAHCGECG 701
           H+VAEDE+QR +  G+K   E  AFQ     K    S R  +K EKS       HC  CG
Sbjct: 220 HMVAEDEQQRNMT-GKKATFEAAAFQVSQNKKEQQPSKRPTEKAEKSSSMSRTEHCTFCG 278

Query: 702 KDGHTRDGCFKIIGYPEWWNVKNKR 776
           +DGH +DGCFK IGYPEWW  K KR
Sbjct: 279 EDGHNKDGCFKRIGYPEWWPGKTKR 303


>ref|XP_023734935.1| uncharacterized protein LOC111882795 [Lactuca sativa]
          Length = 278

 Score =  340 bits (872), Expect = e-115
 Identities = 164/260 (63%), Positives = 201/260 (77%), Gaps = 10/260 (3%)
 Frame = +3

Query: 27  MENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRGSVKYATSA 206
           M NFLFAKNK+GF+DGT+ +P+  +   MAWMRCDAMIKGWLTT M+KEIR SVKYA +A
Sbjct: 1   MTNFLFAKNKIGFVDGTLHKPDKSNEKYMAWMRCDAMIKGWLTTTMDKEIRSSVKYANTA 60

Query: 207 SEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKSFLPIPICK 386
            EIW DL ERFGKESAPR YELKQ+I+ TR+EG +VSAYYTKLRG+WDE+ S LP P C+
Sbjct: 61  LEIWNDLHERFGKESAPRGYELKQSITQTRQEGTSVSAYYTKLRGIWDEIDSILPTPRCE 120

Query: 387 CEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAYHLVAEDEK 566
           C+G  C+IGK + ELKEKE+LYEFLMGLD +FS++RTQILAT P PSLG AYHLVAEDE+
Sbjct: 121 CDGYECNIGKKITELKEKERLYEFLMGLDAEFSVMRTQILATKPTPSLGIAYHLVAEDEQ 180

Query: 567 QRAIVGGRK--PMSETMAFQSFMKRG--GSSNRVN-QKEEKSVA-----HCGECGKDGHT 716
           QR+IV GRK     +  AFQ+  +      +++ N QK +K+ +     HC  CGKDGH 
Sbjct: 181 QRSIVAGRKITTTPDAAAFQASQQVSCENQNHKKNWQKTDKTSSNNKSGHCTFCGKDGHV 240

Query: 717 RDGCFKIIGYPEWWNVKNKR 776
           RDGCFK++GYP+WW  K K+
Sbjct: 241 RDGCFKLVGYPDWWQGKGKK 260


>ref|XP_022004670.1| uncharacterized protein LOC110902278 [Helianthus annuus]
          Length = 543

 Score =  348 bits (893), Expect = e-114
 Identities = 166/267 (62%), Positives = 203/267 (76%), Gaps = 9/267 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           N+ DWVQEM NFLFAKNK+GF+DG+I +PE    + M WMRCDAM+KGWLTTAM+KEIRG
Sbjct: 41  NFNDWVQEMTNFLFAKNKIGFVDGSIAKPEKTSKDYMPWMRCDAMVKGWLTTAMDKEIRG 100

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA +ASEIW DLKERFGKESAPRAYELKQ++  TR+EG TVSAY+T+LR +WDE+  
Sbjct: 101 SVKYANTASEIWADLKERFGKESAPRAYELKQSLHVTRQEGTTVSAYFTRLRKIWDEINM 160

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP P C C GC C++GK + ELKEKE+LYEFLMGLD +FS+IRTQILA  P PSL NAY
Sbjct: 161 VLPAPRCTCSGCKCEVGKKIIELKEKERLYEFLMGLDDEFSVIRTQILAIKPTPSLSNAY 220

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFMKRGGSSNRVN----QKEEKSVA-----HCGE 695
           H+VAEDE QR++  G+K  ++ +AFQ+       + + +    +K EK+       HC  
Sbjct: 221 HMVAEDEHQRSVT-GKKQTNDAVAFQAVQANQKDAQQRSKKGWEKNEKAAPITKTDHCTF 279

Query: 696 CGKDGHTRDGCFKIIGYPEWWNVKNKR 776
           CGKDGH ++GCFK IGYPEWW  K KR
Sbjct: 280 CGKDGHNKEGCFKRIGYPEWWPGKAKR 306


>ref|XP_021975260.1| uncharacterized protein LOC110870384 [Helianthus annuus]
          Length = 657

 Score =  352 bits (902), Expect = e-114
 Identities = 170/263 (64%), Positives = 197/263 (74%), Gaps = 11/263 (4%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NY+DW QEM NFLFAKNK GFIDGT+K+PE      M WMRCDAMIKGWL T+MEKEIR 
Sbjct: 43  NYIDWAQEMMNFLFAKNKTGFIDGTMKKPEPTSTEYMPWMRCDAMIKGWLNTSMEKEIRN 102

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA++A EIW DLKERFGKESAPRAYELKQ+++N R++G ++SAYYTKLR LWDE++S
Sbjct: 103 SVKYASTAEEIWSDLKERFGKESAPRAYELKQSLTNIRQDGASISAYYTKLRVLWDEIQS 162

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LPIP C C GCT ++G  L ELKEKE+LYEFL+GLD DF+ IRTQILA  P PSL  AY
Sbjct: 163 VLPIPKCSCNGCTYNVGNQLIELKEKEKLYEFLLGLDSDFTTIRTQILAMKPTPSLRTAY 222

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQ-SF--MKRGGSSNRVNQKEEKS--------VAHC 689
           HL AEDEKQ+ I    +P   T AFQ SF   + GGS  +VN+   K         V HC
Sbjct: 223 HLAAEDEKQQMIAATNRPAINTTAFQVSFPSKREGGSGQQVNRTGPKEGRRNVSDRVEHC 282

Query: 690 GECGKDGHTRDGCFKIIGYPEWW 758
             CGKDGH +DGCFK IGYP+WW
Sbjct: 283 DFCGKDGHNKDGCFKRIGYPDWW 305


>ref|XP_021975442.1| uncharacterized protein LOC110870567 [Helianthus annuus]
          Length = 482

 Score =  343 bits (879), Expect = e-113
 Identities = 167/267 (62%), Positives = 201/267 (75%), Gaps = 9/267 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           N+ DW QEM NFLFAKNK+GF+DGT+ +PE  D N M WMRCDAMIKGWLTTAMEK+IRG
Sbjct: 43  NFNDWNQEMTNFLFAKNKIGFVDGTVVKPEKTDKNYMPWMRCDAMIKGWLTTAMEKDIRG 102

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKY  +A EIW DL ERFGKESAPR YELKQAI  TR++G +VSAYYT+LR LWDE+ S
Sbjct: 103 SVKYVNTAKEIWDDLNERFGKESAPRTYELKQAIVTTRQDGSSVSAYYTRLRALWDEIDS 162

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP P C+C GC CD+GK L E K+KE+LY+FLMGLD  FS+I+TQILA  P PS+G AY
Sbjct: 163 VLPYPRCECSGCECDLGKKLVEAKDKERLYQFLMGLDDSFSVIKTQILAMKPTPSMGAAY 222

Query: 543 HLVAEDEKQRAIVGGRKPM-SETMAFQSF--------MKRGGSSNRVNQKEEKSVAHCGE 695
           HL+AEDE+Q  IV GRK   ++T AFQ+         ++R  S+ +  +  EK+ AH   
Sbjct: 223 HLIAEDEQQWNIVVGRKGNGADTTAFQASHMTTRTTQIQRKSSAQKTERPGEKT-AHYTF 281

Query: 696 CGKDGHTRDGCFKIIGYPEWWNVKNKR 776
           CGKDGH  +GCFK IGYP+WW+ K KR
Sbjct: 282 CGKDGHIAEGCFKKIGYPDWWSGKGKR 308


>ref|XP_021979891.1| uncharacterized protein LOC110876015 [Helianthus annuus]
          Length = 561

 Score =  345 bits (884), Expect = e-113
 Identities = 165/268 (61%), Positives = 200/268 (74%), Gaps = 10/268 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           N+ DWVQEM NFLFAKNK+GF+DG+I +PE    + M WMRCDAM+KGWLT AM+KEIRG
Sbjct: 41  NFNDWVQEMTNFLFAKNKIGFVDGSIVKPETTSKDYMPWMRCDAMVKGWLTMAMDKEIRG 100

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA +ASEIW DLKERFGKESAPRAYELKQ++  TR+EG TVSAY+T+LR +WDE+  
Sbjct: 101 SVKYANTASEIWADLKERFGKESAPRAYELKQSLHVTRQEGTTVSAYFTRLRKIWDEINM 160

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP P C C GC C++GK + ELKEKE+LYEFLMGLD +FS+IRTQILA  P PSL NAY
Sbjct: 161 VLPAPRCTCSGCKCEVGKKITELKEKERLYEFLMGLDDEFSVIRTQILAIKPTPSLSNAY 220

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFM----------KRGGSSNRVNQKEEKSVAHCG 692
           H+VAEDE QR++  G+K  ++ +AFQ+            K+G   N+      K+  HC 
Sbjct: 221 HMVAEDEHQRSVT-GKKQTNDAVAFQAVQANQKDAQQRSKKGWEKNKKAAPITKT-DHCT 278

Query: 693 ECGKDGHTRDGCFKIIGYPEWWNVKNKR 776
            CGKDGH ++GCFK  GYPEWW  K KR
Sbjct: 279 FCGKDGHNKEGCFKRFGYPEWWPGKAKR 306


>ref|XP_023750032.1| uncharacterized protein LOC111898332 [Lactuca sativa]
          Length = 451

 Score =  340 bits (873), Expect = e-112
 Identities = 165/265 (62%), Positives = 189/265 (71%), Gaps = 8/265 (3%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NY DW QEM NFLFAKNK GF+DG+IKRPE      + WMRCDAMIKGWLTTAMEKEIR 
Sbjct: 45  NYNDWEQEMMNFLFAKNKTGFVDGSIKRPETESKKYLPWMRCDAMIKGWLTTAMEKEIRN 104

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYA + +EIW+DLKERFGKESAP+AYELKQA++NTR++  TVSAYYT+L  LWDEM++
Sbjct: 105 SVKYAKTTTEIWQDLKERFGKESAPKAYELKQAMNNTRQDDTTVSAYYTRLHVLWDEMET 164

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
            LP P C C GC C + K L ELKEKE+ YEFLMGLD  FS+I+TQILA  P P L   Y
Sbjct: 165 ILPTPRCSCNGCLCGLAKKLTELKEKERTYEFLMGLDDQFSVIKTQILAMKPTPKLSTVY 224

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQ-SFMKRGGSSNRVNQKEEK-------SVAHCGEC 698
           HLVAEDE+Q+ I   +KP  E  AFQ SF  R   +    +K  K       S   C  C
Sbjct: 225 HLVAEDEQQQMITASKKPAREMAAFQASFQGRREPARNSQEKGWKKTEKGTVSTESCSHC 284

Query: 699 GKDGHTRDGCFKIIGYPEWWNVKNK 773
           GK GH R+GCFK IGYPEWW  K K
Sbjct: 285 GKKGHDREGCFKRIGYPEWWPAKGK 309


>ref|XP_022004456.1| uncharacterized protein LOC110902021 [Helianthus annuus]
          Length = 458

 Score =  340 bits (871), Expect = e-112
 Identities = 167/261 (63%), Positives = 202/261 (77%), Gaps = 8/261 (3%)
 Frame = +3

Query: 18  VQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRGSVKYA 197
           + E+ NFLFAKNK+GF+DGTI +PE  D   M WMRCDAMIKGWLTTAMEKEIR SVKYA
Sbjct: 22  IDEITNFLFAKNKIGFVDGTINKPEKTDKKYMNWMRCDAMIKGWLTTAMEKEIRASVKYA 81

Query: 198 TSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKSFLPIP 377
            +A+EIWKDL E FGKESAPR YE+KQ+I+ TR+E  TVSAY+TKLRGLWDE+ + L +P
Sbjct: 82  NTAAEIWKDLHEIFGKESAPRTYEIKQSITMTRQEEATVSAYFTKLRGLWDEIDTMLQVP 141

Query: 378 ICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAYHLVAE 557
            C+C+GCTCDIGK L ELKEKE+LYEFLMGL+ DF++IRTQILA  P P+LG AYHLVAE
Sbjct: 142 RCECKGCTCDIGKKLVELKEKERLYEFLMGLNSDFAVIRTQILAMKPTPTLGVAYHLVAE 201

Query: 558 DEKQRAIVGGRKPMSETMAFQSFM--KRGGSSNRVN-QKEEKS-----VAHCGECGKDGH 713
           DE+QR I  G+K  +ETMAFQ     KR     + + QK EK+       HC  CG++GH
Sbjct: 202 DEQQRNITVGKKVATETMAFQVAQQGKRDVQPQKKSWQKTEKTQQNTKTGHCTFCGRNGH 261

Query: 714 TRDGCFKIIGYPEWWNVKNKR 776
            R+GCFK+IGYP+W+  K K+
Sbjct: 262 IREGCFKLIGYPDWFFGKEKK 282


>ref|XP_021999527.1| uncharacterized protein LOC110896564 [Helianthus annuus]
 gb|OTG04725.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 361

 Score =  335 bits (858), Expect = e-111
 Identities = 153/256 (59%), Positives = 191/256 (74%), Gaps = 4/256 (1%)
 Frame = +3

Query: 3   NYLDWVQEMENFLFAKNKMGFIDGTIKRPEAGDANRMAWMRCDAMIKGWLTTAMEKEIRG 182
           NY DW +EM NFLFAKNK+ F+DGT+K+PE    +   WMRCDAM+KGWLT AMEK IR 
Sbjct: 42  NYTDWAREMSNFLFAKNKIDFVDGTLKKPETTSLDYKPWMRCDAMVKGWLTAAMEKGIRD 101

Query: 183 SVKYATSASEIWKDLKERFGKESAPRAYELKQAISNTRKEGMTVSAYYTKLRGLWDEMKS 362
           SVKYAT+ASEIW DL+ERFGKESAPRAYELKQ I+ TR++G +VS YYT+LR LWDE +S
Sbjct: 102 SVKYATTASEIWTDLRERFGKESAPRAYELKQKIAGTRQDGSSVSIYYTRLRALWDESQS 161

Query: 363 FLPIPICKCEGCTCDIGKSLRELKEKEQLYEFLMGLDGDFSIIRTQILATNPIPSLGNAY 542
               P C C  CTC++GK + E  EKE+LYEFLMGLD DF++I+TQILAT P+P+LG AY
Sbjct: 162 IFSFPCCSCNKCTCELGKKITEHIEKERLYEFLMGLDTDFNVIKTQILATTPLPTLGIAY 221

Query: 543 HLVAEDEKQRAIVGGRKPMSETMAFQSFMKR----GGSSNRVNQKEEKSVAHCGECGKDG 710
           H+VAEDE+ R I    +  +E  AF++F KR    G S  +   KE K    C  CG++G
Sbjct: 222 HMVAEDERHRMISNVNQVTTEPAAFKAFQKRENGSGDSKEKTAGKESKQSDQCTFCGRNG 281

Query: 711 HTRDGCFKIIGYPEWW 758
           H ++GCFK++GYP+WW
Sbjct: 282 HKKEGCFKLVGYPDWW 297


Top