BLASTX nr result

ID: Chrysanthemum22_contig00016471 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum22_contig00016471
         (1047 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_021977590.1| nuclear pore complex protein DDB_G0274915 is...   169   7e-43
ref|XP_021977589.1| nuclear pore complex protein DDB_G0274915 is...   169   7e-43
gb|KVI04577.1| hypothetical protein Ccrd_017110 [Cynara carduncu...   141   2e-33
ref|XP_023771815.1| uncharacterized protein LOC111920464 isoform...    68   1e-08
ref|XP_023771811.1| uncharacterized protein LOC111920464 isoform...    68   1e-08
gb|PLY97869.1| hypothetical protein LSAT_2X134840 [Lactuca sativa]     68   1e-08

>ref|XP_021977590.1| nuclear pore complex protein DDB_G0274915 isoform X2 [Helianthus
           annuus]
          Length = 896

 Score =  169 bits (427), Expect = 7e-43
 Identities = 100/206 (48%), Positives = 123/206 (59%), Gaps = 8/206 (3%)
 Frame = -1

Query: 597 DGSRGKQDSAMPLGFFGDYVARENNTGPSVVGKSNG-AFPAANHGLVSTGFSKENSTTWN 421
           DGSRGK+DS+     FG  V RENN GPS++G S G + PA  HGLVS+GFS E++ TWN
Sbjct: 331 DGSRGKEDSSSFGPVFGRPVQRENNVGPSIIGASLGTSSPATGHGLVSSGFSNEHTFTWN 390

Query: 420 GNASYY--ETRPFFDDSSSALKSPSITKPSXXXXXXXXXXXXXXXXXXXXXSLFNSNNVL 247
             A  Y  ETR  F DSS+   SP  TK                         FN NN+ 
Sbjct: 391 DYAPNYSFETRSVFYDSSTDHLSPLTTKSQDISSTSP----------------FNKNNLS 434

Query: 246 DNNKPLKESEPGHPVGFQFKATSSG-----NNTEDVKSASNLSEHVDHHQAEEDSPCWKG 82
            + KPLKE+E    VGF++K++ S      N+TE+V SA ++SEH+DHH   EDSPCWKG
Sbjct: 435 VSPKPLKENESYPAVGFEYKSSFSTQVPDINSTEEVNSAEHVSEHLDHHNPGEDSPCWKG 494

Query: 81  ASTHFSLFGSLQGESSQHPIKKLQEN 4
           A T FS  GS + ESSQHP+KKLQ N
Sbjct: 495 APTRFSSSGSQEEESSQHPMKKLQSN 520



 Score = 94.0 bits (232), Expect = 3e-17
 Identities = 69/168 (41%), Positives = 87/168 (51%), Gaps = 21/168 (12%)
 Frame = -1

Query: 1044 MNSSVKSGLEGVGQFGGNLYGNVKPTTP---WAFGDGDREVNDVPVVGFDMLS------- 895
            MNSSVKSGLE V   GG+++G VK  T    W+ G+ D   +  P++  D +        
Sbjct: 250  MNSSVKSGLEHVDPPGGHVFG-VKSNTQSQKWSLGN-DGSDSGAPMMPLDYVLLPSRSFV 307

Query: 894  GDFGSSQVDYGQSLFEAKYGGQADGSRGKEGSPMFXXXXXXXXXXDKNVGPSVISKS-HD 718
             D+GSSQV Y QSL  + Y G ADGSRGKE S  F          + NVGPS+I  S   
Sbjct: 308  QDYGSSQVGYSQSLSGSMYSGPADGSRGKEDSSSFGPVFGRPVQRENNVGPSIIGASLGT 367

Query: 717  AYPATNH--------KENGLSWN--GPTVPFETRPLLWSSSGDIKSPL 604
            + PAT H         E+  +WN   P   FETR + + SS D  SPL
Sbjct: 368  SSPATGHGLVSSGFSNEHTFTWNDYAPNYSFETRSVFYDSSTDHLSPL 415


>ref|XP_021977589.1| nuclear pore complex protein DDB_G0274915 isoform X1 [Helianthus
           annuus]
 gb|OTG18706.1| putative outer protein D [Helianthus annuus]
          Length = 898

 Score =  169 bits (427), Expect = 7e-43
 Identities = 100/206 (48%), Positives = 123/206 (59%), Gaps = 8/206 (3%)
 Frame = -1

Query: 597 DGSRGKQDSAMPLGFFGDYVARENNTGPSVVGKSNG-AFPAANHGLVSTGFSKENSTTWN 421
           DGSRGK+DS+     FG  V RENN GPS++G S G + PA  HGLVS+GFS E++ TWN
Sbjct: 331 DGSRGKEDSSSFGPVFGRPVQRENNVGPSIIGASLGTSSPATGHGLVSSGFSNEHTFTWN 390

Query: 420 GNASYY--ETRPFFDDSSSALKSPSITKPSXXXXXXXXXXXXXXXXXXXXXSLFNSNNVL 247
             A  Y  ETR  F DSS+   SP  TK                         FN NN+ 
Sbjct: 391 DYAPNYSFETRSVFYDSSTDHLSPLTTKSQDISSTSP----------------FNKNNLS 434

Query: 246 DNNKPLKESEPGHPVGFQFKATSSG-----NNTEDVKSASNLSEHVDHHQAEEDSPCWKG 82
            + KPLKE+E    VGF++K++ S      N+TE+V SA ++SEH+DHH   EDSPCWKG
Sbjct: 435 VSPKPLKENESYPAVGFEYKSSFSTQVPDINSTEEVNSAEHVSEHLDHHNPGEDSPCWKG 494

Query: 81  ASTHFSLFGSLQGESSQHPIKKLQEN 4
           A T FS  GS + ESSQHP+KKLQ N
Sbjct: 495 APTRFSSSGSQEEESSQHPMKKLQSN 520



 Score = 94.0 bits (232), Expect = 3e-17
 Identities = 69/168 (41%), Positives = 87/168 (51%), Gaps = 21/168 (12%)
 Frame = -1

Query: 1044 MNSSVKSGLEGVGQFGGNLYGNVKPTTP---WAFGDGDREVNDVPVVGFDMLS------- 895
            MNSSVKSGLE V   GG+++G VK  T    W+ G+ D   +  P++  D +        
Sbjct: 250  MNSSVKSGLEHVDPPGGHVFG-VKSNTQSQKWSLGN-DGSDSGAPMMPLDYVLLPSRSFV 307

Query: 894  GDFGSSQVDYGQSLFEAKYGGQADGSRGKEGSPMFXXXXXXXXXXDKNVGPSVISKS-HD 718
             D+GSSQV Y QSL  + Y G ADGSRGKE S  F          + NVGPS+I  S   
Sbjct: 308  QDYGSSQVGYSQSLSGSMYSGPADGSRGKEDSSSFGPVFGRPVQRENNVGPSIIGASLGT 367

Query: 717  AYPATNH--------KENGLSWN--GPTVPFETRPLLWSSSGDIKSPL 604
            + PAT H         E+  +WN   P   FETR + + SS D  SPL
Sbjct: 368  SSPATGHGLVSSGFSNEHTFTWNDYAPNYSFETRSVFYDSSTDHLSPL 415


>gb|KVI04577.1| hypothetical protein Ccrd_017110 [Cynara cardunculus var. scolymus]
          Length = 1037

 Score =  141 bits (356), Expect = 2e-33
 Identities = 127/385 (32%), Positives = 157/385 (40%), Gaps = 91/385 (23%)
 Frame = -1

Query: 885  GSSQVDYGQSLFEAKY--------GGQADGSRGKEGSPMFXXXXXXXXXXDKNVGPSVIS 730
            GSSQVDY Q L   KY        GG ADGSRG                 + NV  S+  
Sbjct: 208  GSSQVDYSQHLSGLKYNAPQVGTWGGLADGSRGNN-----LDAEQSFFSEEANVAGSLAC 262

Query: 729  KSHDAYPATNHKENGLSWNGPTVPFETRPLLWSSSGDIKSPLINDGSRGKQDSAMPLGFF 550
            +S+                           +   + D++S      S+GK+D AM  G  
Sbjct: 263  RSY---------------------------MNQGAYDVESL-----SKGKEDPAMFPGRH 290

Query: 549  GDYVARENNTGPSVVGKSNGA-FPAANHGLVSTGFSKENSTT------------------ 427
             + V RE N GPS+VGKS+GA F A N GL   GFSKE + T                  
Sbjct: 291  SNLV-REKNIGPSIVGKSHGASFSAVNQGLNLNGFSKELTFTAFPEFSESHPLVPSPEPP 349

Query: 426  ---WNGNASY--YETRPFFDDSSSALKSPSITK---------PSXXXXXXXXXXXXXXXX 289
               WN ++SY  Y TR  FD  ++ LK PSITK         P+                
Sbjct: 350  KEPWNNHSSYTPYGTRSLFDTYTN-LKPPSITKSLPSVVIKPPASLSTFSAQGAVSSKNV 408

Query: 288  XXXXXSLFNSNNVLDNNKPLKESEPGHPVGFQFKATSSG--------------------- 172
                   FNSN+VL ++KPLKE E   P+GF+ K +S                       
Sbjct: 409  EISSTPAFNSNDVLVSHKPLKEKESHLPLGFEAKGSSLALNQLSFQIGRSDDHVLVDSSA 468

Query: 171  -----------------------------NNTEDVKSASNLSEHVDHHQAEEDSPCWKGA 79
                                         N T+D KSA+NLSEH+DHH   EDSPCWKGA
Sbjct: 469  RRDASNMMSTDDQLDFKFKSIPNVQFPDINITKDGKSATNLSEHLDHHNPAEDSPCWKGA 528

Query: 78   STHFSLFGSLQGESSQHPIKKLQEN 4
             THFS FGS   E  QHP+KK  E+
Sbjct: 529  PTHFSPFGSPDAEPPQHPMKKRHEH 553


>ref|XP_023771815.1| uncharacterized protein LOC111920464 isoform X2 [Lactuca sativa]
          Length = 836

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 117/443 (26%), Positives = 153/443 (34%), Gaps = 97/443 (21%)
 Frame = -1

Query: 1047 GMNSSVKSGLEGV------------------GQFGGNLYGNVKPTTPWAFGDGDREVNDV 922
            G+NS+ K GLE +                    F  NL    KP  P          +D+
Sbjct: 101  GLNSNGKHGLEQITPGRPHWSAVYPSPKRVADSFSYNL-SEAKPFYPPYASSSSVNNDDI 159

Query: 921  PVV-----GFDMLS----------GDFGSSQVDYGQSLFEAKYGGQADGSRGKEGSPMFX 787
            P+V     G+D+LS          GD  +SQVDY +SL   +Y  Q D S    G P   
Sbjct: 160  PLVTFSEPGYDLLSSSGLGLAHGHGD-ETSQVDYTRSLSGLEYNPQHD-SVWSTGLP--- 214

Query: 786  XXXXXXXXXDKNV--GPSVISKSHDAYPATNHKENGLSWNGPTVPFETRPLLWSSSGDIK 613
                      K +    S  S+  +     N+   G   N                    
Sbjct: 215  -----EGKQVKKIESDDSFFSEEANLAAYNNYFNQGAYGN-------------------- 249

Query: 612  SPLINDGSRGKQDSAMPLGFFGDYVARENNTGPSVVGKSNGAFPAAN---HGLVSTGFSK 442
                   S+ K D++  L ++ D + R NN+G    GKS+ A  +AN   H L     SK
Sbjct: 250  ----KSSSKSKDDAS--LFYYADIIRRANNSGH---GKSDAASFSANANTHSLDPNDLSK 300

Query: 441  ENST----------------------TWNGNASY----YETRPFFDDSSSALKSPSITKP 340
              ST                       WN   SY    YE R    D S       ITK 
Sbjct: 301  GGSTFRAFPMFSESHPLIQSPEPPEDLWNSQNSYNPNPYEKRYHMFDCSD----DHITKS 356

Query: 339  SXXXXXXXXXXXXXXXXXXXXXSLFNSNNVLDNNKPLKESEPG---------HPVGFQFK 187
            S                     +L N N+V+ +  PLKE E           + + FQ  
Sbjct: 357  SALVIKPPAAVSSKSLEIGNLSAL-NINDVVGS--PLKEKEQSLSGGSFLAPNQLSFQIG 413

Query: 186  ATSSGNN------------------------TEDVKSASNLSEHVDHHQAEEDSPCWKGA 79
             TSS  N                        T +VK ++N  E  DHH   EDSPCWKGA
Sbjct: 414  RTSSTKNEDMSASASDQLKFEFKAQVPDINTTGNVKISANSFEQFDHHNPAEDSPCWKGA 473

Query: 78   STHFSLFGSLQGESSQHPIKKLQ 10
             T  S F   + +S Q P+KKLQ
Sbjct: 474  PTSQSQFLPFESQSPQPPMKKLQ 496


>ref|XP_023771811.1| uncharacterized protein LOC111920464 isoform X1 [Lactuca sativa]
          Length = 838

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 117/443 (26%), Positives = 153/443 (34%), Gaps = 97/443 (21%)
 Frame = -1

Query: 1047 GMNSSVKSGLEGV------------------GQFGGNLYGNVKPTTPWAFGDGDREVNDV 922
            G+NS+ K GLE +                    F  NL    KP  P          +D+
Sbjct: 101  GLNSNGKHGLEQITPGRPHWSAVYPSPKRVADSFSYNL-SEAKPFYPPYASSSSVNNDDI 159

Query: 921  PVV-----GFDMLS----------GDFGSSQVDYGQSLFEAKYGGQADGSRGKEGSPMFX 787
            P+V     G+D+LS          GD  +SQVDY +SL   +Y  Q D S    G P   
Sbjct: 160  PLVTFSEPGYDLLSSSGLGLAHGHGD-ETSQVDYTRSLSGLEYNPQHD-SVWSTGLP--- 214

Query: 786  XXXXXXXXXDKNV--GPSVISKSHDAYPATNHKENGLSWNGPTVPFETRPLLWSSSGDIK 613
                      K +    S  S+  +     N+   G   N                    
Sbjct: 215  -----EGKQVKKIESDDSFFSEEANLAAYNNYFNQGAYGN-------------------- 249

Query: 612  SPLINDGSRGKQDSAMPLGFFGDYVARENNTGPSVVGKSNGAFPAAN---HGLVSTGFSK 442
                   S+ K D++  L ++ D + R NN+G    GKS+ A  +AN   H L     SK
Sbjct: 250  ----KSSSKSKDDAS--LFYYADIIRRANNSGH---GKSDAASFSANANTHSLDPNDLSK 300

Query: 441  ENST----------------------TWNGNASY----YETRPFFDDSSSALKSPSITKP 340
              ST                       WN   SY    YE R    D S       ITK 
Sbjct: 301  GGSTFRAFPMFSESHPLIQSPEPPEDLWNSQNSYNPNPYEKRYHMFDCSD----DHITKS 356

Query: 339  SXXXXXXXXXXXXXXXXXXXXXSLFNSNNVLDNNKPLKESEPG---------HPVGFQFK 187
            S                     +L N N+V+ +  PLKE E           + + FQ  
Sbjct: 357  SALVIKPPAAVSSKSLEIGNLSAL-NINDVVGS--PLKEKEQSLSGGSFLAPNQLSFQIG 413

Query: 186  ATSSGNN------------------------TEDVKSASNLSEHVDHHQAEEDSPCWKGA 79
             TSS  N                        T +VK ++N  E  DHH   EDSPCWKGA
Sbjct: 414  RTSSTKNEDMSASASDQLKFEFKAQVPDINTTGNVKISANSFEQFDHHNPAEDSPCWKGA 473

Query: 78   STHFSLFGSLQGESSQHPIKKLQ 10
             T  S F   + +S Q P+KKLQ
Sbjct: 474  PTSQSQFLPFESQSPQPPMKKLQ 496


>gb|PLY97869.1| hypothetical protein LSAT_2X134840 [Lactuca sativa]
          Length = 948

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 117/443 (26%), Positives = 153/443 (34%), Gaps = 97/443 (21%)
 Frame = -1

Query: 1047 GMNSSVKSGLEGV------------------GQFGGNLYGNVKPTTPWAFGDGDREVNDV 922
            G+NS+ K GLE +                    F  NL    KP  P          +D+
Sbjct: 101  GLNSNGKHGLEQITPGRPHWSAVYPSPKRVADSFSYNL-SEAKPFYPPYASSSSVNNDDI 159

Query: 921  PVV-----GFDMLS----------GDFGSSQVDYGQSLFEAKYGGQADGSRGKEGSPMFX 787
            P+V     G+D+LS          GD  +SQVDY +SL   +Y  Q D S    G P   
Sbjct: 160  PLVTFSEPGYDLLSSSGLGLAHGHGD-ETSQVDYTRSLSGLEYNPQHD-SVWSTGLP--- 214

Query: 786  XXXXXXXXXDKNV--GPSVISKSHDAYPATNHKENGLSWNGPTVPFETRPLLWSSSGDIK 613
                      K +    S  S+  +     N+   G   N                    
Sbjct: 215  -----EGKQVKKIESDDSFFSEEANLAAYNNYFNQGAYGN-------------------- 249

Query: 612  SPLINDGSRGKQDSAMPLGFFGDYVARENNTGPSVVGKSNGAFPAAN---HGLVSTGFSK 442
                   S+ K D++  L ++ D + R NN+G    GKS+ A  +AN   H L     SK
Sbjct: 250  ----KSSSKSKDDAS--LFYYADIIRRANNSGH---GKSDAASFSANANTHSLDPNDLSK 300

Query: 441  ENST----------------------TWNGNASY----YETRPFFDDSSSALKSPSITKP 340
              ST                       WN   SY    YE R    D S       ITK 
Sbjct: 301  GGSTFRAFPMFSESHPLIQSPEPPEDLWNSQNSYNPNPYEKRYHMFDCSD----DHITKS 356

Query: 339  SXXXXXXXXXXXXXXXXXXXXXSLFNSNNVLDNNKPLKESEPG---------HPVGFQFK 187
            S                     +L N N+V+ +  PLKE E           + + FQ  
Sbjct: 357  SALVIKPPAAVSSKSLEIGNLSAL-NINDVVGS--PLKEKEQSLSGGSFLAPNQLSFQIG 413

Query: 186  ATSSGNN------------------------TEDVKSASNLSEHVDHHQAEEDSPCWKGA 79
             TSS  N                        T +VK ++N  E  DHH   EDSPCWKGA
Sbjct: 414  RTSSTKNEDMSASASDQLKFEFKAQVPDINTTGNVKISANSFEQFDHHNPAEDSPCWKGA 473

Query: 78   STHFSLFGSLQGESSQHPIKKLQ 10
             T  S F   + +S Q P+KKLQ
Sbjct: 474  PTSQSQFLPFESQSPQPPMKKLQ 496


Top