BLASTX nr result

ID: Chrysanthemum22_contig00039557 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00039557
         (552 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVI04791.1| DNA topoisomerase I, bacterial-type [Cynara cardu...   156   1e-40
gb|PLY70831.1| hypothetical protein LSAT_4X39420 [Lactuca sativa]     148   6e-38
ref|XP_023737661.1| uncharacterized protein LOC111885654 isoform...   148   6e-38
gb|OTF92370.1| putative DNA topoisomerase, type IA, core [Helian...   127   1e-30
ref|XP_022017569.1| uncharacterized protein LOC110917293 [Helian...   104   1e-22

>gb|KVI04791.1| DNA topoisomerase I, bacterial-type [Cynara cardunculus var.
           scolymus]
          Length = 1092

 Score =  156 bits (394), Expect = 1e-40
 Identities = 98/194 (50%), Positives = 112/194 (57%), Gaps = 13/194 (6%)
 Frame = +2

Query: 8   LKPRKVYSSIKPISNRSYSAFSFQPLLRSRS----EPMNTISCGLINGGFGLFAPYKSVI 175
           LKPR+   S +  +N SYSAF FQPLLRS S    EPMN    G+INGGFG FAPYK   
Sbjct: 33  LKPRRASFSFESRTNGSYSAFPFQPLLRSGSADYMEPMNY---GIINGGFGFFAPYKGAF 89

Query: 176 STRHFSQASKAVLPNIVAGDKEGRKKSTSFLAFNKHWKQSKSLTHKKPL---------LV 328
           +   FSQ  +AV    +A D E R KS SFLAFNKHWKQSK+LTHKKPL           
Sbjct: 90  T---FSQVPRAVTSKNIACDGEKRGKSKSFLAFNKHWKQSKTLTHKKPLEFGGRTYLKSS 146

Query: 329 TNAAKKDTASALKDPLLEDGKKEADVVVPALINNEGQLXXXXXXXXXXXXXXXXXXXXTI 508
            NA +   + A +DPLLEDGKKE +VVVP L N E QL                    T+
Sbjct: 147 INALENHDSIASRDPLLEDGKKEMNVVVPPLTNKEDQLSKAKVKKKQQTKIKKVKEKSTV 206

Query: 509 ASEELTTQTSSKKV 550
           ASEE   Q SS+KV
Sbjct: 207 ASEEPAPQASSRKV 220


>gb|PLY70831.1| hypothetical protein LSAT_4X39420 [Lactuca sativa]
          Length = 1096

 Score =  148 bits (374), Expect = 6e-38
 Identities = 95/199 (47%), Positives = 113/199 (56%), Gaps = 16/199 (8%)
 Frame = +2

Query: 2   PRLKPRKVYSSIKPISNRSYSAFSFQPLLRSRS--EPMNTISCGLINGGFGLFAPYKSVI 175
           P LKP+K Y S +  +N+SYS FS QPL RS     PM+TI+ GL+NGGFG+FAPY    
Sbjct: 45  PSLKPQKPYFSFRSHTNQSYSTFSIQPLSRSSPPIRPMSTINYGLMNGGFGIFAPY---- 100

Query: 176 STRHFSQASKAVLPNIVAGDKEGRKKSTSFLAFNKHWKQSKSLTHKKPLLV--------T 331
             RHFSQ  +AV+P   A D EGR+KSTSFL+FNK+WKQ+K LTHKKPL V         
Sbjct: 101 --RHFSQIPRAVIPKNDACDGEGRRKSTSFLSFNKNWKQTKRLTHKKPLEVGGRTSTKSI 158

Query: 332 NAAKKDTASALKDPLLEDGKKEADVVVPALINNEGQLXXXXXXXXXXXXXXXXXXXXTIA 511
            A KKD   ALKD  +EDGK E +     L+ NE QL                     +A
Sbjct: 159 KAVKKDDILALKDSPIEDGKIEVN-----LVENESQLPKTKAKKKPTPKVKKSQEKSAVA 213

Query: 512 SEE------LTTQTSSKKV 550
           SEE         Q SSKKV
Sbjct: 214 SEEPPPPPPPQPQASSKKV 232


>ref|XP_023737661.1| uncharacterized protein LOC111885654 isoform X1 [Lactuca sativa]
 ref|XP_023737662.1| uncharacterized protein LOC111885654 isoform X2 [Lactuca sativa]
          Length = 1126

 Score =  148 bits (374), Expect = 6e-38
 Identities = 95/199 (47%), Positives = 113/199 (56%), Gaps = 16/199 (8%)
 Frame = +2

Query: 2   PRLKPRKVYSSIKPISNRSYSAFSFQPLLRSRS--EPMNTISCGLINGGFGLFAPYKSVI 175
           P LKP+K Y S +  +N+SYS FS QPL RS     PM+TI+ GL+NGGFG+FAPY    
Sbjct: 45  PSLKPQKPYFSFRSHTNQSYSTFSIQPLSRSSPPIRPMSTINYGLMNGGFGIFAPY---- 100

Query: 176 STRHFSQASKAVLPNIVAGDKEGRKKSTSFLAFNKHWKQSKSLTHKKPLLV--------T 331
             RHFSQ  +AV+P   A D EGR+KSTSFL+FNK+WKQ+K LTHKKPL V         
Sbjct: 101 --RHFSQIPRAVIPKNDACDGEGRRKSTSFLSFNKNWKQTKRLTHKKPLEVGGRTSTKSI 158

Query: 332 NAAKKDTASALKDPLLEDGKKEADVVVPALINNEGQLXXXXXXXXXXXXXXXXXXXXTIA 511
            A KKD   ALKD  +EDGK E +     L+ NE QL                     +A
Sbjct: 159 KAVKKDDILALKDSPIEDGKIEVN-----LVENESQLPKTKAKKKPTPKVKKSQEKSAVA 213

Query: 512 SEE------LTTQTSSKKV 550
           SEE         Q SSKKV
Sbjct: 214 SEEPPPPPPPQPQASSKKV 232


>gb|OTF92370.1| putative DNA topoisomerase, type IA, core [Helianthus annuus]
          Length = 1150

 Score =  127 bits (319), Expect = 1e-30
 Identities = 75/140 (53%), Positives = 87/140 (62%), Gaps = 11/140 (7%)
 Frame = +2

Query: 11  KPRKVYSSIKPISNRSYSAFSFQPLLRSRSE----PMNTISCGLINGGFGLFAPYKSVIS 178
           KP K Y  +    NRSYSAF  QPL RS S     P+N+IS  L NGGFG FAPYK V+S
Sbjct: 51  KPLKPYFHLGSHINRSYSAFPIQPLFRSSSRDNMLPLNSISYSLTNGGFGTFAPYKGVLS 110

Query: 179 TRHFSQASKAVLPNIVAGDKEGRKKSTSFLAFNKHWKQSKSLTHK-----KPLLVTN--A 337
            R FS   +A+LP I  GD E R KSTSFLAFNKHW QSK L  +     KP + ++  A
Sbjct: 111 MRRFS-TPRAILPKIAVGDGEKRAKSTSFLAFNKHWTQSKKLKKRVGNEGKPNVKSSLKA 169

Query: 338 AKKDTASALKDPLLEDGKKE 397
            + D    LKDP +EDGKKE
Sbjct: 170 VENDEIVTLKDPPVEDGKKE 189


>ref|XP_022017569.1| uncharacterized protein LOC110917293 [Helianthus annuus]
          Length = 1067

 Score =  104 bits (260), Expect = 1e-22
 Identities = 58/105 (55%), Positives = 69/105 (65%), Gaps = 7/105 (6%)
 Frame = +2

Query: 104 PMNTISCGLINGGFGLFAPYKSVISTRHFSQASKAVLPNIVAGDKEGRKKSTSFLAFNKH 283
           P+N+IS  L NGGFG FAPYK V+S R FS   +A+LP I  GD E R KSTSFLAFNKH
Sbjct: 3   PLNSISYSLTNGGFGTFAPYKGVLSMRRFS-TPRAILPKIAVGDGEKRAKSTSFLAFNKH 61

Query: 284 WKQSKSLTHK-----KPLLVTN--AAKKDTASALKDPLLEDGKKE 397
           W QSK L  +     KP + ++  A + D    LKDP +EDGKKE
Sbjct: 62  WTQSKKLKKRVGNEGKPNVKSSLKAVENDEIVTLKDPPVEDGKKE 106


Top