BLASTX nr result

ID: Chrysanthemum21_contig00033400 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00033400
         (523 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KVI09310.1| Protein of unknown function DUF1635 [Cynara cardu...    85   1e-16
ref|XP_023753074.1| uncharacterized protein LOC111901455 [Lactuc...    84   1e-16
ref|XP_022005748.1| uncharacterized protein LOC110904226 [Helian...    78   3e-14
gb|OTF99027.1| Protein of unknown function (DUF1635) [Helianthus...    78   4e-14
ref|XP_021979578.1| uncharacterized protein LOC110875681 [Helian...    73   1e-12
ref|XP_023738124.1| uncharacterized protein LOC111886120 [Lactuc...    73   2e-12
gb|PLY70510.1| hypothetical protein LSAT_1X64540 [Lactuca sativa]      73   9e-12
ref|XP_019235603.1| PREDICTED: uncharacterized protein LOC109215...    67   2e-10
ref|XP_019235602.1| PREDICTED: uncharacterized protein LOC109215...    67   3e-10
ref|XP_009587804.1| PREDICTED: uncharacterized protein LOC104085...    67   3e-10
ref|XP_016487853.1| PREDICTED: uncharacterized protein LOC107807...    67   3e-10
dbj|GAU37764.1| hypothetical protein TSUD_102810 [Trifolium subt...    66   7e-10
dbj|GAV73452.1| DUF1635 domain-containing protein [Cephalotus fo...    66   8e-10
gb|KHN21073.1| hypothetical protein glysoja_006552 [Glycine soja]      65   1e-09
ref|XP_003539789.1| PREDICTED: uncharacterized protein LOC100782...    65   1e-09
ref|XP_003538191.1| PREDICTED: uncharacterized protein LOC100781...    65   2e-09
ref|XP_009788570.1| PREDICTED: uncharacterized protein LOC104236...    65   3e-09
ref|XP_006400802.1| uncharacterized protein LOC18016568 [Eutrema...    64   4e-09
ref|XP_013464838.1| DUF1635 family protein [Medicago truncatula]...    63   9e-09
ref|XP_021972097.1| uncharacterized protein LOC110867315 [Helian...    63   1e-08

>gb|KVI09310.1| Protein of unknown function DUF1635 [Cynara cardunculus var.
           scolymus]
          Length = 260

 Score = 84.7 bits (208), Expect = 1e-16
 Identities = 47/97 (48%), Positives = 53/97 (54%), Gaps = 10/97 (10%)
 Frame = -2

Query: 501 KGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI----------XXXXXX 352
           KGLPE GKFLEAVMK              PHWRHPPPPL+++QI                
Sbjct: 162 KGLPEKGKFLEAVMKAAPLLQNLLLAGPLPHWRHPPPPLDTYQIPSPSLVIPTPPVRHLL 221

Query: 351 XXXXXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRIS 241
                     NCGEF+KKR F ED DSS+ETKYQRI+
Sbjct: 222 SQESLRKITYNCGEFNKKRAFSEDRDSSTETKYQRIA 258


>ref|XP_023753074.1| uncharacterized protein LOC111901455 [Lactuca sativa]
 gb|PLY93585.1| hypothetical protein LSAT_2X99840 [Lactuca sativa]
          Length = 256

 Score = 84.3 bits (207), Expect = 1e-16
 Identities = 47/107 (43%), Positives = 56/107 (52%), Gaps = 13/107 (12%)
 Frame = -2

Query: 522 EKFEFPA---KGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI------ 370
           ++  FP    KGLPE GKFLEA+MK              PHW+HPPPPL+++ I      
Sbjct: 148 QELRFPVVQPKGLPEKGKFLEAMMKAGPLLQNLLLAGPLPHWQHPPPPLDTYHIPSPPLV 207

Query: 369 ----XXXXXXXXXXXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRIS 241
                               NCGEF+ KR F EDCDSSSETKYQRI+
Sbjct: 208 ISTPTSHHLSNQDFLRRITNNCGEFNTKRAFSEDCDSSSETKYQRIA 254


>ref|XP_022005748.1| uncharacterized protein LOC110904226 [Helianthus annuus]
          Length = 229

 Score = 77.8 bits (190), Expect = 3e-14
 Identities = 43/89 (48%), Positives = 49/89 (55%)
 Frame = -2

Query: 507 PAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQIXXXXXXXXXXXXXX 328
           P KGLPE GKFLEAVMK              PHWRHPPP L++++I              
Sbjct: 139 PPKGLPEQGKFLEAVMKAGPLLQNLLLAGPLPHWRHPPPQLDAYRIPSPPLAIATQPLYY 198

Query: 327 XVNCGEFSKKRVFYEDCDSSSETKYQRIS 241
             N  E +KKR F ED  SS+ETKYQRIS
Sbjct: 199 TNNAFEVNKKRGFSEDSLSSTETKYQRIS 227


>gb|OTF99027.1| Protein of unknown function (DUF1635) [Helianthus annuus]
          Length = 248

 Score = 77.8 bits (190), Expect = 4e-14
 Identities = 43/89 (48%), Positives = 49/89 (55%)
 Frame = -2

Query: 507 PAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQIXXXXXXXXXXXXXX 328
           P KGLPE GKFLEAVMK              PHWRHPPP L++++I              
Sbjct: 158 PPKGLPEQGKFLEAVMKAGPLLQNLLLAGPLPHWRHPPPQLDAYRIPSPPLAIATQPLYY 217

Query: 327 XVNCGEFSKKRVFYEDCDSSSETKYQRIS 241
             N  E +KKR F ED  SS+ETKYQRIS
Sbjct: 218 TNNAFEVNKKRGFSEDSLSSTETKYQRIS 246


>ref|XP_021979578.1| uncharacterized protein LOC110875681 [Helianthus annuus]
 gb|OTG37510.1| Protein of unknown function (DUF1635) [Helianthus annuus]
          Length = 215

 Score = 73.2 bits (178), Expect = 1e-12
 Identities = 43/94 (45%), Positives = 53/94 (56%), Gaps = 2/94 (2%)
 Frame = -2

Query: 516 FEFPAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQIXXXXXXXXXXX 337
           F F  K LPE GKFLEAVMK              PHWRHPPPPL++++I           
Sbjct: 123 FGFSIKALPEKGKFLEAVMKAGPLLQNLLLAGPLPHWRHPPPPLDAYRI---PPPPVVPP 179

Query: 336 XXXXVNC--GEFSKKRVFYEDCDSSSETKYQRIS 241
               VNC  G+FS+KR F E CDSS +++ QRI+
Sbjct: 180 AMLPVNCLVGKFSRKRGFPEGCDSSIDSRCQRIN 213


>ref|XP_023738124.1| uncharacterized protein LOC111886120 [Lactuca sativa]
          Length = 240

 Score = 72.8 bits (177), Expect = 2e-12
 Identities = 39/94 (41%), Positives = 52/94 (55%)
 Frame = -2

Query: 522 EKFEFPAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQIXXXXXXXXX 343
           ++  FP + LPE G+ LEAV+K              PHWRHPPPPL+++QI         
Sbjct: 146 QELGFPVRALPEKGRLLEAVIKAGPLLQNLLLAGPLPHWRHPPPPLDTYQIPPIPVVLPS 205

Query: 342 XXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRIS 241
                 +  G F+KKR F E  DSS+ETKYQR++
Sbjct: 206 PPPINHL-LGAFTKKRGFPEGSDSSTETKYQRVA 238


>gb|PLY70510.1| hypothetical protein LSAT_1X64540 [Lactuca sativa]
          Length = 435

 Score = 72.8 bits (177), Expect = 9e-12
 Identities = 39/94 (41%), Positives = 52/94 (55%)
 Frame = -2

Query: 522 EKFEFPAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQIXXXXXXXXX 343
           ++  FP + LPE G+ LEAV+K              PHWRHPPPPL+++QI         
Sbjct: 341 QELGFPVRALPEKGRLLEAVIKAGPLLQNLLLAGPLPHWRHPPPPLDTYQIPPIPVVLPS 400

Query: 342 XXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRIS 241
                 +  G F+KKR F E  DSS+ETKYQR++
Sbjct: 401 PPPINHL-LGAFTKKRGFPEGSDSSTETKYQRVA 433


>ref|XP_019235603.1| PREDICTED: uncharacterized protein LOC109215934 isoform X2
           [Nicotiana attenuata]
          Length = 229

 Score = 67.4 bits (163), Expect = 2e-10
 Identities = 41/97 (42%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
 Frame = -2

Query: 513 EFPA---KGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI--XXXXXXX 349
           EFP    K LPENGKFL+AVMK              P WRHPPPP++S++I         
Sbjct: 131 EFPIVIDKPLPENGKFLQAVMKAGPLLQTLLVAGPLPQWRHPPPPMDSYEIPPPPVVIPS 190

Query: 348 XXXXXXXXVNCGEFSKKR--VFYEDCDSSSETKYQRI 244
                    NCG  +KKR    ++D DSS  TKYQR+
Sbjct: 191 QDPIFNAFNNCGRLNKKRGGGLFDDSDSSIGTKYQRV 227


>ref|XP_019235602.1| PREDICTED: uncharacterized protein LOC109215934 isoform X1
           [Nicotiana attenuata]
 gb|OIT25499.1| hypothetical protein A4A49_37172 [Nicotiana attenuata]
          Length = 248

 Score = 67.4 bits (163), Expect = 3e-10
 Identities = 41/97 (42%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
 Frame = -2

Query: 513 EFPA---KGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI--XXXXXXX 349
           EFP    K LPENGKFL+AVMK              P WRHPPPP++S++I         
Sbjct: 150 EFPIVIDKPLPENGKFLQAVMKAGPLLQTLLVAGPLPQWRHPPPPMDSYEIPPPPVVIPS 209

Query: 348 XXXXXXXXVNCGEFSKKR--VFYEDCDSSSETKYQRI 244
                    NCG  +KKR    ++D DSS  TKYQR+
Sbjct: 210 QDPIFNAFNNCGRLNKKRGGGLFDDSDSSIGTKYQRV 246


>ref|XP_009587804.1| PREDICTED: uncharacterized protein LOC104085471 [Nicotiana
           tomentosiformis]
          Length = 248

 Score = 67.4 bits (163), Expect = 3e-10
 Identities = 41/97 (42%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
 Frame = -2

Query: 513 EFPA---KGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI--XXXXXXX 349
           EFP    K LPENGKFL+AVMK              P WRHPPPP++S++I         
Sbjct: 150 EFPIVIDKPLPENGKFLQAVMKAGPLLQTILVAGPLPQWRHPPPPMDSYEIPPPPVVIPS 209

Query: 348 XXXXXXXXVNCGEFSKKR--VFYEDCDSSSETKYQRI 244
                    NCG  +KKR    ++D DSS  TKYQR+
Sbjct: 210 QDPIFNAFNNCGRLNKKRGGGLFDDSDSSIGTKYQRV 246


>ref|XP_016487853.1| PREDICTED: uncharacterized protein LOC107807913 [Nicotiana tabacum]
          Length = 249

 Score = 67.4 bits (163), Expect = 3e-10
 Identities = 41/97 (42%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
 Frame = -2

Query: 513 EFPA---KGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI--XXXXXXX 349
           EFP    K LPENGKFL+AVMK              P WRHPPPP++S++I         
Sbjct: 151 EFPIVIDKPLPENGKFLQAVMKAGPLLQTILVAGPLPQWRHPPPPMDSYEIPPPPVVIPS 210

Query: 348 XXXXXXXXVNCGEFSKKR--VFYEDCDSSSETKYQRI 244
                    NCG  +KKR    ++D DSS  TKYQR+
Sbjct: 211 QDPIFNAFNNCGRLNKKRGGGLFDDSDSSIGTKYQRV 247


>dbj|GAU37764.1| hypothetical protein TSUD_102810 [Trifolium subterraneum]
          Length = 245

 Score = 66.2 bits (160), Expect = 7e-10
 Identities = 37/98 (37%), Positives = 46/98 (46%), Gaps = 10/98 (10%)
 Frame = -2

Query: 507 PAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI----------XXXX 358
           P K LPE GK L+AVMK              P W+HPPPPL S QI              
Sbjct: 145 PNKPLPEKGKLLQAVMKAGPLLQTLLLAGPLPQWKHPPPPLESFQIPPVTIPQILHQDSI 204

Query: 357 XXXXXXXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRI 244
                       +CG  S+KRVF++D DS ++ KYQR+
Sbjct: 205 FSSNINTTNSDSHCGRVSRKRVFFDDSDSPNQNKYQRV 242


>dbj|GAV73452.1| DUF1635 domain-containing protein [Cephalotus follicularis]
          Length = 258

 Score = 66.2 bits (160), Expect = 8e-10
 Identities = 39/95 (41%), Positives = 45/95 (47%), Gaps = 7/95 (7%)
 Frame = -2

Query: 507 PAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI-------XXXXXXX 349
           P K LPE GK L+AVMK              P WRHPPPPL S +I              
Sbjct: 161 PDKPLPEKGKLLQAVMKAGPLLQTLLLAGPLPQWRHPPPPLESFEIPPVTIPSPPQLLQQ 220

Query: 348 XXXXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRI 244
                    NCG+ ++KRV  E  DS +ETKYQRI
Sbjct: 221 DSLMNIGLNNCGKVNRKRVLCEGSDSPTETKYQRI 255


>gb|KHN21073.1| hypothetical protein glysoja_006552 [Glycine soja]
          Length = 242

 Score = 65.5 bits (158), Expect = 1e-09
 Identities = 38/93 (40%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
 Frame = -2

Query: 507 PAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQ-----IXXXXXXXXX 343
           P K LPE GK L+AVMK              P WRHPPPPL S +     I         
Sbjct: 147 PDKPLPEKGKLLQAVMKAGPLLQTLLLAGPLPQWRHPPPPLESFEIPPVTIPSPPQPQLP 206

Query: 342 XXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRI 244
                  NCG  S+KRVF E  DS ++ K+QRI
Sbjct: 207 HQDSFTSNCGRVSRKRVFCEGTDSPTQNKFQRI 239


>ref|XP_003539789.1| PREDICTED: uncharacterized protein LOC100782615 [Glycine max]
 gb|KRH25102.1| hypothetical protein GLYMA_12G080800 [Glycine max]
          Length = 242

 Score = 65.5 bits (158), Expect = 1e-09
 Identities = 38/93 (40%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
 Frame = -2

Query: 507 PAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQ-----IXXXXXXXXX 343
           P K LPE GK L+AVMK              P WRHPPPPL S +     I         
Sbjct: 147 PDKPLPEKGKLLQAVMKAGPLLQTLLLAGPLPQWRHPPPPLESFEIPPVTIPSPPQPQLP 206

Query: 342 XXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRI 244
                  NCG  S+KRVF E  DS ++ K+QRI
Sbjct: 207 HQDSFTSNCGRVSRKRVFCEGTDSPTQNKFQRI 239


>ref|XP_003538191.1| PREDICTED: uncharacterized protein LOC100781323 [Glycine max]
 gb|KHN38708.1| hypothetical protein glysoja_008257 [Glycine soja]
 gb|KRH30569.1| hypothetical protein GLYMA_11G193200 [Glycine max]
          Length = 259

 Score = 65.1 bits (157), Expect = 2e-09
 Identities = 37/92 (40%), Positives = 44/92 (47%), Gaps = 6/92 (6%)
 Frame = -2

Query: 501 KGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI------XXXXXXXXXX 340
           K LPE GK L+AVMK              P WRHPPPPL S +I                
Sbjct: 165 KPLPEKGKLLQAVMKAGPLLQTLLLAGPLPQWRHPPPPLESFEIPPVTIPSPPPQQQLHQ 224

Query: 339 XXXXXVNCGEFSKKRVFYEDCDSSSETKYQRI 244
                 NCG  S+KRVF+E  DS ++ K+QRI
Sbjct: 225 DSFITSNCGRVSRKRVFFEGTDSPTQNKFQRI 256


>ref|XP_009788570.1| PREDICTED: uncharacterized protein LOC104236360 [Nicotiana
           sylvestris]
 ref|XP_016510747.1| PREDICTED: uncharacterized protein LOC107828009 [Nicotiana tabacum]
          Length = 247

 Score = 64.7 bits (156), Expect = 3e-09
 Identities = 40/97 (41%), Positives = 49/97 (50%), Gaps = 7/97 (7%)
 Frame = -2

Query: 513 EFPA---KGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI--XXXXXXX 349
           EFP    K LPENGKFL+AVMK              P WRHPPPP++S++I         
Sbjct: 149 EFPIIIDKPLPENGKFLQAVMKAGPLLQTLLVAGPLPQWRHPPPPMDSYEIPPPPVVIPS 208

Query: 348 XXXXXXXXVNCGEFSKKR--VFYEDCDSSSETKYQRI 244
                    NCG  +KKR    ++D DSS  TKY R+
Sbjct: 209 QDPIFNAFNNCGRLNKKRGGGLFDDSDSSIGTKYLRV 245


>ref|XP_006400802.1| uncharacterized protein LOC18016568 [Eutrema salsugineum]
 gb|ESQ42255.1| hypothetical protein EUTSA_v10014543mg [Eutrema salsugineum]
          Length = 239

 Score = 63.9 bits (154), Expect = 4e-09
 Identities = 37/89 (41%), Positives = 46/89 (51%)
 Frame = -2

Query: 510 FPAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQIXXXXXXXXXXXXX 331
           FP K LPE GK L+AVMK              P WRHPPPPL + +I             
Sbjct: 154 FPDKPLPEKGKLLQAVMKAGPLLQTLLLAGPLPQWRHPPPPLKTFEI----PPVTVQCPI 209

Query: 330 XXVNCGEFSKKRVFYEDCDSSSETKYQRI 244
               CG+F++KRVF +   S SETKYQ++
Sbjct: 210 VNNGCGKFNRKRVFSD--GSYSETKYQKV 236


>ref|XP_013464838.1| DUF1635 family protein [Medicago truncatula]
 gb|KEH38873.1| DUF1635 family protein [Medicago truncatula]
          Length = 240

 Score = 63.2 bits (152), Expect = 9e-09
 Identities = 36/101 (35%), Positives = 46/101 (45%), Gaps = 10/101 (9%)
 Frame = -2

Query: 507 PAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI----------XXXX 358
           P + LPE GK L+AVMK              P W+HPPPPL S +I              
Sbjct: 140 PNRPLPEKGKLLQAVMKAGPLLKTLLLAGPLPQWKHPPPPLESFEIPPVSIPRILHQDSI 199

Query: 357 XXXXXXXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRIS*Y 235
                       +CG  + KRVF++D DS +E KYQR+  Y
Sbjct: 200 FSSNIDTTNANSHCGRVNMKRVFFDDSDSPNENKYQRVVPY 240


>ref|XP_021972097.1| uncharacterized protein LOC110867315 [Helianthus annuus]
 gb|OTG20865.1| putative protein of unknown function DUF1635 [Helianthus annuus]
          Length = 225

 Score = 62.8 bits (151), Expect = 1e-08
 Identities = 42/101 (41%), Positives = 46/101 (45%), Gaps = 10/101 (9%)
 Frame = -2

Query: 513 EFPAKGLPENGKFLEAVMKXXXXXXXXXXXXXXPHWRHPPPPLNSHQI----------XX 364
           + P KGLPENGKFLEAVM               P+WR PPPPL+  QI            
Sbjct: 121 QVPTKGLPENGKFLEAVMNAGPLLQNLLLAGSLPNWRQPPPPLDGLQIPSPPLVVPTRPA 180

Query: 363 XXXXXXXXXXXXXVNCGEFSKKRVFYEDCDSSSETKYQRIS 241
                         NC  F KKR   ED  SS+ TKYQRIS
Sbjct: 181 HHLLSQDYLHNVSSNC-LFKKKRGISEDTVSSTNTKYQRIS 220


Top