BLASTX nr result

ID: Chrysanthemum21_contig00031613 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00031613
         (911 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022022551.1| uncharacterized protein LOC110922584 [Helian...   307   e-100
ref|XP_022022421.1| uncharacterized protein LOC110922420 isoform...   291   1e-93
ref|XP_022022422.1| uncharacterized protein LOC110922420 isoform...   264   2e-83
ref|XP_021999527.1| uncharacterized protein LOC110896564 [Helian...   263   7e-83
ref|XP_022014825.1| uncharacterized protein LOC110914334 [Helian...   259   4e-80
ref|XP_021991821.1| uncharacterized protein LOC110888610 [Helian...   249   8e-78
gb|OTG24510.1| putative reverse transcriptase, RNA-dependent DNA...   261   1e-76
gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA...   237   8e-71
ref|XP_022040234.1| uncharacterized protein LOC110942777 [Helian...   229   1e-68
ref|XP_021998956.1| uncharacterized protein LOC110895884 isoform...   224   2e-67
ref|XP_021998957.1| uncharacterized protein LOC110895884 isoform...   220   4e-66
ref|XP_022022818.1| uncharacterized protein LOC110922934 [Helian...   218   2e-65
ref|XP_021990916.1| uncharacterized protein LOC110887643 [Helian...   210   6e-63
ref|XP_022032845.1| uncharacterized protein LOC110933954 [Helian...   211   5e-62
ref|XP_023757537.1| uncharacterized protein LOC111906028 [Lactuc...   204   5e-61
ref|XP_022004670.1| uncharacterized protein LOC110902278 [Helian...   211   8e-61
ref|XP_021979891.1| uncharacterized protein LOC110876015 [Helian...   211   1e-60
ref|XP_022006954.1| uncharacterized protein LOC110905727 [Helian...   206   1e-60
gb|OTG28537.1| putative GAG-pre-integrase domain, Gag-polypeptid...   208   1e-59
ref|XP_022015104.1| uncharacterized protein LOC110914625 [Helian...   201   9e-59

>ref|XP_022022551.1| uncharacterized protein LOC110922584 [Helianthus annuus]
          Length = 370

 Score =  307 bits (786), Expect = e-100
 Identities = 146/212 (68%), Positives = 169/212 (79%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS+ PFP+CSC+KCEC+I KKI EHQE+E LYEFLMGLDAEFTVIRTQILATKPIPS
Sbjct: 161 DEAQSVQPFPQCSCDKCECEIRKKIFEHQEREHLYEFLMGLDAEFTVIRTQILATKPIPS 220

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEVVEYCT 552
           L TAYHMV EDE+QRA+SNEN   +ESAAFKAFQKR       KE+  TK  ++  E+CT
Sbjct: 221 LTTAYHMVHEDEKQRAVSNENKSNVESAAFKAFQKRESG--HGKEKGGTKGSRDGTEHCT 278

Query: 551 ECGRDGHKREGCFKLIGYPEWWPGKKGERSKGKVACVETEMSPIPGLRNEDYQLFLKQFS 372
            C RD HK+EGCFKL+GYP+WWPGKKGE+++GK ACVETE SPIPGL  E YQ  +K  S
Sbjct: 279 FCNRDEHKKEGCFKLVGYPDWWPGKKGEKARGKAACVETESSPIPGLTQEHYQTLVKHLS 338

Query: 371 GTGNAEGVKPTANMAHKESEEGATEEELDWYG 276
           G+GN E +KP ANMA K ++EG  EEELDW G
Sbjct: 339 GSGNIETIKPLANMACKTNDEGFAEEELDWCG 370


>ref|XP_022022421.1| uncharacterized protein LOC110922420 isoform X1 [Helianthus annuus]
          Length = 371

 Score =  291 bits (745), Expect = 1e-93
 Identities = 136/212 (64%), Positives = 169/212 (79%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS+ PFP+CSC KCEC++ KKI EHQ+KE LYEFLMGLD EF+VIRTQILATKPIPS
Sbjct: 162 DEAQSVQPFPQCSCGKCECEVRKKIFEHQDKEHLYEFLMGLDNEFSVIRTQILATKPIPS 221

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEVVEYCT 552
           L  AYHMV +DE+QRA+S+E+   +E+AAFKAFQ+++     NKE++  K +++  E+CT
Sbjct: 222 LTAAYHMVHDDEKQRAVSSESKGNVEAAAFKAFQRKDRD--HNKEKSGAKGVRDGTEHCT 279

Query: 551 ECGRDGHKREGCFKLIGYPEWWPGKKGERSKGKVACVETEMSPIPGLRNEDYQLFLKQFS 372
            C +DGHKREGCF+L+GYP+WWPGKKGE++KGK ACVET+ S IPGL  E+YQ  LK FS
Sbjct: 280 FCNKDGHKREGCFRLVGYPDWWPGKKGEKTKGKAACVETDASSIPGLTQENYQTLLKHFS 339

Query: 371 GTGNAEGVKPTANMAHKESEEGATEEELDWYG 276
           G+GN E  K  ANMA K ++EG  EEELDW G
Sbjct: 340 GSGNDEITKSLANMACKTNDEGIAEEELDWSG 371


>ref|XP_022022422.1| uncharacterized protein LOC110922420 isoform X2 [Helianthus annuus]
          Length = 352

 Score =  264 bits (675), Expect = 2e-83
 Identities = 127/212 (59%), Positives = 158/212 (74%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS+ PFP+CSC KCEC++ KKI EHQ+KE LYEFLMGLD EF+VIRTQILATKPIPS
Sbjct: 162 DEAQSVQPFPQCSCGKCECEVRKKIFEHQDKEHLYEFLMGLDNEFSVIRTQILATKPIPS 221

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEVVEYCT 552
           L  AYHMV +DE+QRA+S+E+   +E+AAFKAFQ+++     NKE++  K +++  E+CT
Sbjct: 222 LTAAYHMVHDDEKQRAVSSESKGNVEAAAFKAFQRKDRD--HNKEKSGAKGVRDGTEHCT 279

Query: 551 ECGRDGHKREGCFKLIGYPEWWPGKKGERSKGKVACVETEMSPIPGLRNEDYQLFLKQFS 372
            C +DGHKREGCF+L+GYP+WWPGKKGE++KGK ACVET+ S IPGL  E+YQ  LK FS
Sbjct: 280 FCNKDGHKREGCFRLVGYPDWWPGKKGEKTKGKAACVETDASSIPGLTQENYQTLLKHFS 339

Query: 371 GTGNAEGVKPTANMAHKESEEGATEEELDWYG 276
           G+G A                   EEELDW G
Sbjct: 340 GSGIA-------------------EEELDWSG 352


>ref|XP_021999527.1| uncharacterized protein LOC110896564 [Helianthus annuus]
 gb|OTG04725.1| putative gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 361

 Score =  263 bits (672), Expect = 7e-83
 Identities = 133/212 (62%), Positives = 151/212 (71%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QSI  FP CSC KC C++GKKI EH EKERLYEFLMGLD +F VI+TQILAT P+P+
Sbjct: 157 DESQSIFSFPCCSCNKCTCELGKKITEHIEKERLYEFLMGLDTDFNVIKTQILATTPLPT 216

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEVVEYCT 552
           LG AYHMVAEDER R ISN N    E AAFKAFQKR      +KE+T  K  K+  + CT
Sbjct: 217 LGIAYHMVAEDERHRMISNVNQVTTEPAAFKAFQKRENGSGDSKEKTAGKESKQ-SDQCT 275

Query: 551 ECGRDGHKREGCFKLIGYPEWWPGKKGERSKGKVACVETEMSPIPGLRNEDYQLFLKQFS 372
            CGR+GHK+EGCFKL+GYP+WWPGKK  + K K ACVE   SPIPGL  E YQ F+K FS
Sbjct: 276 FCGRNGHKKEGCFKLVGYPDWWPGKKDNKVKPKAACVEMGTSPIPGLSEEQYQEFVKFFS 335

Query: 371 GTGNAEGVKPTANMAHKESEEGATEEELDWYG 276
           G+G    +KP ANMA      GAT E LDW G
Sbjct: 336 GSGKNAEIKPEANMA------GATFEGLDWIG 361


>ref|XP_022014825.1| uncharacterized protein LOC110914334 [Helianthus annuus]
          Length = 470

 Score =  259 bits (663), Expect = 4e-80
 Identities = 126/196 (64%), Positives = 150/196 (76%), Gaps = 2/196 (1%)
 Frame = -1

Query: 902 QSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPSLGT 723
           ++   FPRCSC KC C++ KK+ +H EKERLYEFLM LD++FTVI+TQILATKP  +LG 
Sbjct: 161 EATQSFPRCSCNKCTCELSKKVIQHLEKERLYEFLMVLDSDFTVIKTQILATKPTLTLGV 220

Query: 722 AYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEVVE--YCTE 549
           AYHMVAEDERQRAISNEN    ESAAFKAFQKR   +   KE+  TK  +E  E  +CT 
Sbjct: 221 AYHMVAEDERQRAISNENRVAPESAAFKAFQKREGNISQTKEKYTTKQGREGKENDHCTF 280

Query: 548 CGRDGHKREGCFKLIGYPEWWPGKKGERSKGKVACVETEMSPIPGLRNEDYQLFLKQFSG 369
           CG+DGHKREGCFKL+GYP+WWPGKKG+++K K ACVET  SPIPGL +E YQ F+K FSG
Sbjct: 281 CGKDGHKREGCFKLVGYPDWWPGKKGDKAKPKAACVETGNSPIPGLSDEQYQSFVKFFSG 340

Query: 368 TGNAEGVKPTANMAHK 321
           + +    KP ANMA K
Sbjct: 341 SNSGAETKPEANMAGK 356


>ref|XP_021991821.1| uncharacterized protein LOC110888610 [Helianthus annuus]
          Length = 332

 Score =  249 bits (636), Expect = 8e-78
 Identities = 116/170 (68%), Positives = 137/170 (80%), Gaps = 3/170 (1%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS+ PFP+CSC+KCECD+GK+I E+QEKE LYEFLMGLD EF VI+TQILATKP+PS
Sbjct: 162 DEAQSVQPFPQCSCDKCECDVGKRIFEYQEKEHLYEFLMGLDTEFAVIKTQILATKPVPS 221

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLI---PNKERTRTKTLKEVVE 561
           L  AYHMV +DE+QRA+S+EN    ESAAFKAFQKR         N+ER   K +KE ++
Sbjct: 222 LTVAYHMVHDDEKQRAVSSENKTHTESAAFKAFQKRESNGTNGNHNRERGGIKGVKEGID 281

Query: 560 YCTECGRDGHKREGCFKLIGYPEWWPGKKGERSKGKVACVETEMSPIPGL 411
           +CT C +DGHKREGCFKL+GYP+WWPGKKGE+ KGK ACVE E SPIPGL
Sbjct: 282 HCTFCNKDGHKREGCFKLVGYPDWWPGKKGEKVKGKAACVEAEPSPIPGL 331


>gb|OTG24510.1| putative reverse transcriptase, RNA-dependent DNA polymerase,
           Gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 934

 Score =  261 bits (666), Expect = 1e-76
 Identities = 128/197 (64%), Positives = 148/197 (75%), Gaps = 2/197 (1%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE  SI PFP CSC KC C++GKKI EH EK++LYEFLMGLD +F VIRTQILATKP+P+
Sbjct: 156 DESHSIFPFPCCSCNKCTCELGKKIAEHLEKQQLYEFLMGLDNDFNVIRTQILATKPVPT 215

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEVV--EY 558
           LGTAYHMVAEDERQRAISNEN    ESAAFK FQKR+      KE+  T   KE    + 
Sbjct: 216 LGTAYHMVAEDERQRAISNENRVAPESAAFKTFQKRHNNFKSPKEKYTTTQEKESKQNDQ 275

Query: 557 CTECGRDGHKREGCFKLIGYPEWWPGKKGERSKGKVACVETEMSPIPGLRNEDYQLFLKQ 378
           CT CGR+ HKREGCFKL+GYP+WWPGKK +++K K ACV+T  SPIPG+  E YQ F+K 
Sbjct: 276 CTFCGRNSHKREGCFKLVGYPDWWPGKKDDKAKPKAACVDTGTSPIPGISEEQYQAFVKF 335

Query: 377 FSGTGNAEGVKPTANMA 327
           FSG+GN    K  ANMA
Sbjct: 336 FSGSGNNVETKSEANMA 352


>gb|OTG09093.1| putative reverse transcriptase, RNA-dependent DNA polymerase,
           Gag-polypeptide of LTR copia-type [Helianthus annuus]
          Length = 938

 Score =  237 bits (604), Expect(2) = 8e-71
 Identities = 116/200 (58%), Positives = 142/200 (71%), Gaps = 2/200 (1%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE +S+ P PRC+C+KC C +GKK+NE +EKERLYEFLMGLDA+F VI+TQILA  PIP+
Sbjct: 157 DEIESVLPAPRCTCDKCSCGVGKKMNELREKERLYEFLMGLDADFAVIKTQILAMNPIPT 216

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQ--KRNIPLIPNKERTRTKTLKEVVEY 558
           LG AYH+VAEDERQR IS E   P E+AAFKAF+  +R      NK   + +   ++VE 
Sbjct: 217 LGNAYHLVAEDERQRMISGEKKTPTENAAFKAFKPVRRENSTSQNKAAPKDQKHGDMVEQ 276

Query: 557 CTECGRDGHKREGCFKLIGYPEWWPGKKGERSKGKVACVETEMSPIPGLRNEDYQLFLKQ 378
           CT CGR GHKR+GCFK+IGYP+WWPGK     K K A VET+ SP+PGL  E YQ FLK 
Sbjct: 277 CTHCGRSGHKRDGCFKIIGYPDWWPGK----MKPKAAHVETDASPVPGLTKEQYQSFLKH 332

Query: 377 FSGTGNAEGVKPTANMAHKE 318
           F+     +G    ANMA K+
Sbjct: 333 FAENDVKDGSVRMANMAGKK 352



 Score = 60.1 bits (144), Expect(2) = 8e-71
 Identities = 28/48 (58%), Positives = 36/48 (75%), Gaps = 1/48 (2%)
 Frame = -2

Query: 304 LQKRNLIGTGRCQGGLYRMKMIQG-RKAMATTIETWHRRLGHASKGKL 164
           L  R+LIG G+C+ GLYRM +  G R++M TT  TWH+RLGHAS+ KL
Sbjct: 360 LHTRSLIGAGKCRKGLYRMGLFSGERRSMMTTGNTWHKRLGHASEDKL 407


>ref|XP_022040234.1| uncharacterized protein LOC110942777 [Helianthus annuus]
          Length = 436

 Score =  229 bits (583), Expect = 1e-68
 Identities = 117/202 (57%), Positives = 142/202 (70%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS+ PFP+CSC KCEC++GKKI EHQEKE LYEFLMGLD EF+VIRTQILATKPIPS
Sbjct: 162 DEAQSVQPFPQCSCGKCECEVGKKIFEHQEKEHLYEFLMGLDNEFSVIRTQILATKPIPS 221

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEVVEYCT 552
           L  AYHMV +DE+QRA+S+E+   +E+AAFKAFQK        K+R   K          
Sbjct: 222 LTAAYHMVHDDEKQRAVSSESKGNVEAAAFKAFQK--------KDRDHNK---------- 263

Query: 551 ECGRDGHKREGCFKLIGYPEWWPGKKGERSKGKVACVETEMSPIPGLRNEDYQLFLKQFS 372
                  ++ G   L+GYP+WWPGKKGE++KGK A VET+ SPIPGL  E+YQ  LK FS
Sbjct: 264 -------EKSGAKGLVGYPDWWPGKKGEKTKGKAAYVETDASPIPGLTQENYQTLLKHFS 316

Query: 371 GTGNAEGVKPTANMAHKESEEG 306
           G+ + E  K  ANMA K ++EG
Sbjct: 317 GSVSDEITKSLANMACKTNDEG 338


>ref|XP_021998956.1| uncharacterized protein LOC110895884 isoform X1 [Helianthus annuus]
          Length = 374

 Score =  224 bits (570), Expect = 2e-67
 Identities = 115/224 (51%), Positives = 152/224 (67%), Gaps = 12/224 (5%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS+ P PRC+C+ C C IGKK+ E ++KERLYEFL+GLD EF  IRTQILA +PIPS
Sbjct: 158 DEIQSVLPVPRCNCDGCTCGIGKKLTELRDKERLYEFLLGLDPEFRTIRTQILAMQPIPS 217

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLK------- 573
           LG AYH+VA+DE+QRA+S       ++AAF+A    ++P+  +K +++ +  +       
Sbjct: 218 LGAAYHLVADDEQQRAVSGTKRPTSDAAAFQA----HVPIRRDKNQSQNRVKQKDAKRSG 273

Query: 572 -EVVEYCTECGRDGHKREGCFKLIGYPEWWPGK-KGERSKGKVACVETEMSPIPGLRNED 399
            + +E+CT CG+DGH ++GCFK IGYPEWWPGK K +  K K ACVE E SPIPGL N+ 
Sbjct: 274 TDEIEHCTFCGKDGHNKDGCFKRIGYPEWWPGKGKQDSVKPKAACVEGEESPIPGLTNKQ 333

Query: 398 YQLFLKQFS---GTGNAEGVKPTANMAHKESEEGATEEELDWYG 276
           YQ F+  FS   G    EG  P ANMA   ++EG   E++DW G
Sbjct: 334 YQKFVNFFSKKDGVTEEEGA-PVANMA--GNKEGLMFEDIDWSG 374


>ref|XP_021998957.1| uncharacterized protein LOC110895884 isoform X2 [Helianthus annuus]
          Length = 370

 Score =  220 bits (561), Expect = 4e-66
 Identities = 114/224 (50%), Positives = 149/224 (66%), Gaps = 12/224 (5%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS+ P PRC+C+ C C IGKK+ E ++KERLYEFL+GLD EF  IRTQILA +PIPS
Sbjct: 158 DEIQSVLPVPRCNCDGCTCGIGKKLTELRDKERLYEFLLGLDPEFRTIRTQILAMQPIPS 217

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLK------- 573
           LG AYH+VA+DE+QRA+S       ++AAF+A    ++P+  +K +++ +  +       
Sbjct: 218 LGAAYHLVADDEQQRAVSGTKRPTSDAAAFQA----HVPIRRDKNQSQNRVKQKDAKRSG 273

Query: 572 -EVVEYCTECGRDGHKREGCFKLIGYPEWWPGK-KGERSKGKVACVETEMSPIPGLRNED 399
            + +E+CT CG+DGH ++GCFK IGYPEWWPGK K +  K K ACVE E SPIPGL N+ 
Sbjct: 274 TDEIEHCTFCGKDGHNKDGCFKRIGYPEWWPGKGKQDSVKPKAACVEGEESPIPGLTNKQ 333

Query: 398 YQLFLKQFS---GTGNAEGVKPTANMAHKESEEGATEEELDWYG 276
           YQ F+  FS   G    EG  P ANMA      G   E++DW G
Sbjct: 334 YQKFVNFFSKKDGVTEEEGA-PVANMA------GLMFEDIDWSG 370


>ref|XP_022022818.1| uncharacterized protein LOC110922934 [Helianthus annuus]
          Length = 369

 Score =  218 bits (556), Expect = 2e-65
 Identities = 114/223 (51%), Positives = 148/223 (66%), Gaps = 11/223 (4%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE  ++ P PRC+C+ C C++GKK+ E +EKERLYEFL+GLDA+F VIRTQILA KP P+
Sbjct: 157 DEINTVLPTPRCTCDGCSCEVGKKLVELKEKERLYEFLLGLDADFAVIRTQILAMKPTPT 216

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQ--------KRNIPLIPNKERTRTKTL 576
           LG AYHMV+EDE+QR +S      +E+AAF+A Q        +R I      ++ +   L
Sbjct: 217 LGAAYHMVSEDEQQRNLSTNKKGTVENAAFQASQFARKEGQTQRRI----WSKQEKGSGL 272

Query: 575 KEVVEYCTECGRDGHKREGCFKLIGYPEWWPGK-KGERSKGKVACVETEMSPIPGLRNED 399
               E+CT CG+DGH R+GCFK IGYPEWWPGK K + +K K A VE+  SP+PG+ +E 
Sbjct: 273 INKNEHCTFCGKDGHNRDGCFKRIGYPEWWPGKGKKDGAKPKAAMVESTSSPVPGMTDEQ 332

Query: 398 YQLFLKQFSGTGNAEGVK--PTANMAHKESEEGATEEELDWYG 276
           Y +FLK F G  N E  +  P+ANMA      G  +EELDW G
Sbjct: 333 YAMFLKLFGGNKNQEKEEPSPSANMA------GLEQEELDWSG 369


>ref|XP_021990916.1| uncharacterized protein LOC110887643 [Helianthus annuus]
          Length = 305

 Score =  210 bits (535), Expect = 6e-63
 Identities = 102/144 (70%), Positives = 112/144 (77%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS+ PFP+CSC KCECD+GKKI EHQEKE LYEFLMGLD +FTVIRTQILATKP+PS
Sbjct: 162 DEAQSVQPFPQCSCGKCECDVGKKIIEHQEKEHLYEFLMGLDTDFTVIRTQILATKPVPS 221

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEVVEYCT 552
           L  AYHMVAEDERQRAIS+EN    E AAFKAFQKR+      KE+   K   E  E+CT
Sbjct: 222 LSGAYHMVAEDERQRAISSENRTHTEPAAFKAFQKRDGNTNHYKEKGGAKGANEGSEHCT 281

Query: 551 ECGRDGHKREGCFKLIGYPEWWPG 480
              RD HKREGCFKLI YP+WWPG
Sbjct: 282 FYDRDSHKREGCFKLIVYPDWWPG 305


>ref|XP_022032845.1| uncharacterized protein LOC110933954 [Helianthus annuus]
          Length = 423

 Score =  211 bits (538), Expect = 5e-62
 Identities = 115/218 (52%), Positives = 142/218 (65%), Gaps = 8/218 (3%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS+ P P+C+C  C CD+ K++ EHQEKERLYEFLMGLD +F VI+TQILA KP P 
Sbjct: 77  DEIQSVFPMPQCTCNGCTCDVVKRLVEHQEKERLYEFLMGLDNQFVVIKTQILANKPTPG 136

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAF--QKRN-IPLIPNKERTRTKT-LKEVV 564
           LGTAYH+VAEDERQ  IS E    +E+A FKAF  QKR+    I  K +  +K   +E V
Sbjct: 137 LGTAYHLVAEDERQELISEEKRPAIEAAVFKAFVPQKRDGQGSINQKGKGYSKQGNQEEV 196

Query: 563 EYCTECGRDGHKREGCFKLIGYPEWWPGK-KGERSKGKVACVETEMSPIPGLRNEDYQLF 387
            +CT C RDGH REGCFK IGYP+WW GK +G++ K K +CVE    PIPG   E Y++ 
Sbjct: 197 PHCTHCDRDGHSREGCFKRIGYPDWWLGKVRGDKPKPKPSCVEGGSCPIPGFTVEQYEIL 256

Query: 386 LKQF---SGTGNAEGVKPTANMAHKESEEGATEEELDW 282
           +K F   + TG+ E  K  ANM       G   +E DW
Sbjct: 257 VKHFKDGAETGDEEN-KCVANMT------GRFNDEADW 287



 Score = 65.1 bits (157), Expect = 4e-08
 Identities = 29/47 (61%), Positives = 38/47 (80%), Gaps = 1/47 (2%)
 Frame = -2

Query: 310 KGLQKRNLIGTGRCQGGLYRMKMI-QGRKAMATTIETWHRRLGHASK 173
           +GL+ RNLIG G+C+GGLY+M M  + RKAMA T+E WH+RLGH S+
Sbjct: 377 QGLRLRNLIGVGKCKGGLYQMGMFGEERKAMAVTVERWHKRLGHTSQ 423


>ref|XP_023757537.1| uncharacterized protein LOC111906028 [Lactuca sativa]
          Length = 261

 Score =  204 bits (518), Expect = 5e-61
 Identities = 98/151 (64%), Positives = 114/151 (75%), Gaps = 5/151 (3%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE  SI PFP+CSC+ C C++GK+INE QEKERLY+FLMGLDAE+ VI+TQILATKP PS
Sbjct: 108 DEVSSILPFPKCSCDGCSCNVGKEINEFQEKERLYQFLMGLDAEYAVIKTQILATKPTPS 167

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKE-----V 567
           L T YH+VAEDERQR ISN+   P E AAFKAFQ ++    P KER   K+ KE      
Sbjct: 168 LTTVYHLVAEDERQRTISNDKRAPPEPAAFKAFQGKSTNFNPPKERNFQKSFKENKEKME 227

Query: 566 VEYCTECGRDGHKREGCFKLIGYPEWWPGKK 474
            E CT CGR+GHKR+GC KLIGYPEWWP K+
Sbjct: 228 DEQCTFCGRNGHKRDGCSKLIGYPEWWPRKQ 258


>ref|XP_022004670.1| uncharacterized protein LOC110902278 [Helianthus annuus]
          Length = 543

 Score =  211 bits (538), Expect = 8e-61
 Identities = 108/200 (54%), Positives = 131/200 (65%), Gaps = 5/200 (2%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE   + P PRC+C  C+C++GKKI E +EKERLYEFLMGLD EF+VIRTQILA KP PS
Sbjct: 156 DEINMVLPAPRCTCSGCKCEVGKKIIELKEKERLYEFLMGLDDEFSVIRTQILAIKPTPS 215

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEV----V 564
           L  AYHMVAEDE QR+++ +  +  ++ AF+A Q          ++   K  K       
Sbjct: 216 LSNAYHMVAEDEHQRSVTGKK-QTNDAVAFQAVQANQKDAQQRSKKGWEKNEKAAPITKT 274

Query: 563 EYCTECGRDGHKREGCFKLIGYPEWWPGK-KGERSKGKVACVETEMSPIPGLRNEDYQLF 387
           ++CT CG+DGH +EGCFK IGYPEWWPGK K E SK K A  ET  SP+PG+ NE Y LF
Sbjct: 275 DHCTFCGKDGHNKEGCFKRIGYPEWWPGKAKREASKPKAAYAETATSPVPGMSNEQYSLF 334

Query: 386 LKQFSGTGNAEGVKPTANMA 327
           LK F G    E   P ANMA
Sbjct: 335 LKLFGGNKAQEESAPQANMA 354



 Score = 85.1 bits (209), Expect = 1e-14
 Identities = 44/85 (51%), Positives = 57/85 (67%), Gaps = 5/85 (5%)
 Frame = -2

Query: 310 KGLQKRNLIGTGRCQGGLYRMKMIQG-RKAMATTIETWHRRLGHASKGKLARVDFLK--- 143
           +GL+ R+LIG G C GGLYRM++++  R+A+A T  TWH RLGH    KL+ +D+ K   
Sbjct: 454 QGLKTRSLIGAGNCVGGLYRMEVMKTERQALAVTSSTWHNRLGHTPFDKLSHMDYFKDVR 513

Query: 142 -TSINNLDNFCDSCARAKHTRLPFP 71
               NN  N CDSC RAK T+LPFP
Sbjct: 514 FDYCNN--NVCDSCQRAKFTKLPFP 536


>ref|XP_021979891.1| uncharacterized protein LOC110876015 [Helianthus annuus]
          Length = 561

 Score =  211 bits (538), Expect = 1e-60
 Identities = 107/200 (53%), Positives = 130/200 (65%), Gaps = 5/200 (2%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE   + P PRC+C  C+C++GKKI E +EKERLYEFLMGLD EF+VIRTQILA KP PS
Sbjct: 156 DEINMVLPAPRCTCSGCKCEVGKKITELKEKERLYEFLMGLDDEFSVIRTQILAIKPTPS 215

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNKERTRTKTLKEV----V 564
           L  AYHMVAEDE QR+++ +  +  ++ AF+A Q          ++   K  K       
Sbjct: 216 LSNAYHMVAEDEHQRSVTGKK-QTNDAVAFQAVQANQKDAQQRSKKGWEKNKKAAPITKT 274

Query: 563 EYCTECGRDGHKREGCFKLIGYPEWWPGK-KGERSKGKVACVETEMSPIPGLRNEDYQLF 387
           ++CT CG+DGH +EGCFK  GYPEWWPGK K E SK K A  ET  SP+PG+ NE Y LF
Sbjct: 275 DHCTFCGKDGHNKEGCFKRFGYPEWWPGKAKREASKPKAAYAETATSPVPGMSNEQYSLF 334

Query: 386 LKQFSGTGNAEGVKPTANMA 327
           LK F G    E   P ANMA
Sbjct: 335 LKLFGGNKPQEESAPQANMA 354



 Score =  112 bits (280), Expect = 4e-24
 Identities = 57/108 (52%), Positives = 74/108 (68%), Gaps = 5/108 (4%)
 Frame = -2

Query: 310 KGLQKRNLIGTGRCQGGLYRMKMIQG-RKAMATTIETWHRRLGHASKGKLARVDFLKTS- 137
           +GL+ R+LIG G C GGLYRM++++  R+A+A    TWH+RLGH    KL+ +D+ K   
Sbjct: 454 QGLKTRSLIGAGNCVGGLYRMEVMKTERQALAVYSSTWHKRLGHTPFDKLSHMDYFKDVR 513

Query: 136 ---INNLDNFCDSCARAKHTRLPFPSSSIKTNAPFELIHCDIWGGKKK 2
               NN  N CDSC RAK T+LPFP SSIKTN  FEL+H DIWG  ++
Sbjct: 514 FDYCNN--NVCDSCQRAKFTKLPFPLSSIKTNDSFELVHSDIWGDTER 559


>ref|XP_022006954.1| uncharacterized protein LOC110905727 [Helianthus annuus]
          Length = 363

 Score =  206 bits (524), Expect = 1e-60
 Identities = 111/216 (51%), Positives = 137/216 (63%), Gaps = 4/216 (1%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE   + P P C+C+ C+CD+GKK  +++EKERLYEFLMGLD +F VIRTQILA KP PS
Sbjct: 155 DEINVVLPTPYCTCDGCKCDLGKKQVQNKEKERLYEFLMGLDDDFGVIRTQILAMKPTPS 214

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQ--KRNIPLIPNKERTRTKTLKEVVEY 558
           L  AYHMVAEDE+QR ++ + A   E+AAF+  Q  K   P     E+    +     E+
Sbjct: 215 LNNAYHMVAEDEQQRNMTGKKA-TFEAAAFQVSQNKKEQQPSKRPTEKAEKSSSMSRTEH 273

Query: 557 CTECGRDGHKREGCFKLIGYPEWWPGK-KGERSKGKVACVETEMSPIPGLRNEDYQLFLK 381
           CT CG DGH ++GCFK IGYPEWWPGK K E +K K A  E+  SP+ GL +E Y +FLK
Sbjct: 274 CTFCGEDGHNKDGCFKRIGYPEWWPGKTKRETTKPKAAYAESTASPVSGLTDEQYGMFLK 333

Query: 380 QFSG-TGNAEGVKPTANMAHKESEEGATEEELDWYG 276
            F G T   +   P ANMA      G   EELDW G
Sbjct: 334 LFGGKTQPQDESAPQANMA------GIENEELDWSG 363


>gb|OTG28537.1| putative GAG-pre-integrase domain, Gag-polypeptide of LTR
           copia-type [Helianthus annuus]
          Length = 520

 Score =  208 bits (529), Expect = 1e-59
 Identities = 99/175 (56%), Positives = 126/175 (72%), Gaps = 4/175 (2%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE QS  P PRC C  C CD+G+K+ EH+E ERLYEFLMGL+++F+VIRTQIL   P P+
Sbjct: 159 DEMQSAFPIPRCKCSGCSCDVGRKLVEHKESERLYEFLMGLNSDFSVIRTQILTMNPTPT 218

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAF--QKRNIPLIPNKERTRTKTLKEVVEY 558
           L  AYH+VAEDERQRAI++E     ++ AFKAF   +R       +++  +K +K   ++
Sbjct: 219 LTNAYHLVAEDERQRAITSERRPSTDAVAFKAFVPGRRENNSSQRRDKPASKDVKHAADH 278

Query: 557 CTECGRDGHKREGCFKLIGYPEWWPG-KKGERSKGKVACVETEMSPIPGLR-NED 399
           CT CG+DGH R+GCFKLIG+PEWWPG +K E +K KVACVET  SPIPG   NED
Sbjct: 279 CTFCGKDGHTRDGCFKLIGFPEWWPGNRKREETKPKVACVETNSSPIPGKNDNED 333



 Score =  105 bits (261), Expect = 1e-21
 Identities = 52/94 (55%), Positives = 69/94 (73%), Gaps = 2/94 (2%)
 Frame = -2

Query: 304 LQKRNLIGTGRCQGGLYRMKMIQ-GRKAMATTIETWHRRLGHASKGKLARVDFL-KTSIN 131
           L  R+LIG G+C+ GLYRM + + GR+A+ T+ +TWH+RLGHAS  KL  +DFL K   N
Sbjct: 427 LHTRSLIGAGKCKRGLYRMGLFEDGRRALMTSGDTWHKRLGHASNEKLTHIDFLSKVPFN 486

Query: 130 NLDNFCDSCARAKHTRLPFPSSSIKTNAPFELIH 29
            L   CDSC++AKH RLPF +S+IKT+  FEL+H
Sbjct: 487 KL---CDSCSKAKHARLPFSNSNIKTSGCFELLH 517


>ref|XP_022015104.1| uncharacterized protein LOC110914625 [Helianthus annuus]
          Length = 356

 Score =  201 bits (511), Expect = 9e-59
 Identities = 100/187 (53%), Positives = 123/187 (65%), Gaps = 3/187 (1%)
 Frame = -1

Query: 911 DEFQSINPFPRCSCEKCECDIGKKINEHQEKERLYEFLMGLDAEFTVIRTQILATKPIPS 732
           DE  ++ P PRCSC +C C++GKKI+E +EKER+YEFLMGLD EF+V+RTQILA  P PS
Sbjct: 156 DEIHTVFPAPRCSCNRCSCEVGKKISEQKEKERVYEFLMGLDGEFSVMRTQILAMNPTPS 215

Query: 731 LGTAYHMVAEDERQRAISNENAKPLESAAFKAFQKRNIPLIPNK--ERTRTKTLKEVVEY 558
           LGT YH+VAEDE+QRAI        E A F+A+  RN      K  +R   +   +  E+
Sbjct: 216 LGTTYHLVAEDEQQRAIIGGKKTNPEVATFQAYAPRNSGTQGTKSTQRDSKRIQNDRSEH 275

Query: 557 CTECGRDGHKREGCFKLIGYPEWWPGK-KGERSKGKVACVETEMSPIPGLRNEDYQLFLK 381
           C  CGRDGH +EGCFK I YPEWWPGK K ++ K K A VE   SPIPGL  E YQ  L 
Sbjct: 276 CDFCGRDGHNKEGCFKRICYPEWWPGKGKRDKVKAKAAFVEIGSSPIPGLSGEQYQALLA 335

Query: 380 QFSGTGN 360
             +  G+
Sbjct: 336 HLAEKGS 342


Top