BLASTX nr result

ID: Glycyrrhiza32_contig00032094 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00032094
         (472 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KHN09844.1 hypothetical protein glysoja_017145 [Glycine soja]          85   9e-18
XP_006600846.1 PREDICTED: uncharacterized protein DDB_G0284459 [...    85   3e-16
XP_006579644.2 PREDICTED: uncharacterized protein DDB_G0284459-l...    76   5e-14
XP_014627151.1 PREDICTED: uncharacterized protein LOC106797386 [...    77   1e-13
KRG94729.1 hypothetical protein GLYMA_19G105000 [Glycine max]          77   1e-13
KRH57469.1 hypothetical protein GLYMA_05G0628002, partial [Glyci...    71   3e-12
KHN44924.1 hypothetical protein glysoja_036135 [Glycine soja]          70   5e-12
BAD93737.1 glycoprotein homolog, partial [Arabidopsis thaliana]        60   2e-09
XP_012070042.1 PREDICTED: uncharacterized protein LOC105632305 [...    64   5e-09
KDP39910.1 hypothetical protein JCGZ_03441 [Jatropha curcas]           64   5e-09
KRH25518.1 hypothetical protein GLYMA_12G109100 [Glycine max]          64   6e-09
XP_006593062.1 PREDICTED: uncharacterized protein LOC102667948 [...    64   6e-09
CDX76590.1 BnaA08g08100D [Brassica napus]                              58   3e-08
XP_002868098.1 hydroxyproline-rich glycoprotein family protein [...    62   3e-08
XP_014631510.1 PREDICTED: uncharacterized protein LOC102667348 [...    60   4e-08
XP_016900571.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized p...    61   6e-08
XP_006282590.1 hypothetical protein CARUB_v10004732mg [Capsella ...    60   8e-08
XP_017640106.1 PREDICTED: uncharacterized protein LOC108481491 [...    59   9e-08
OAO97873.1 hypothetical protein AXX17_AT4G19780 [Arabidopsis tha...    60   1e-07
CAB88077.1 hypothetical protein, partial [Arabidopsis thaliana]        60   1e-07

>KHN09844.1 hypothetical protein glysoja_017145 [Glycine soja]
          Length = 168

 Score = 84.7 bits (208), Expect = 9e-18
 Identities = 63/144 (43%), Positives = 72/144 (50%), Gaps = 18/144 (12%)
 Frame = -2

Query: 384 SFNKGDSFDKELKRSFTSERNY-----------LKQRSSTGHNKLMGRASAPLVS-VEEK 241
           SFN+  SF+KELKRSF SERN              QR+S   +K+MG AS PLVS   EK
Sbjct: 16  SFNEEPSFNKELKRSFKSERNMPVGKKIDEENKPMQRTSFRSDKIMGHASVPLVSQPAEK 75

Query: 240 ESFFXXXXXXXXXXXXXXXXXXXXXEKFVQKERGEXXXXXXXXXXXSN------GGDVDK 79
           ESF                         VQ +  +           S+      G DVDK
Sbjct: 76  ESFLVESNDDEDEDTETEDEDVGGGRIIVQNQNNDSGKGPQVIGGESSKTDGDEGPDVDK 135

Query: 78  KADEFIAKFREQIRLQRIDSIKRS 7
           KADEFIAKFREQIRLQRI+ IKRS
Sbjct: 136 KADEFIAKFREQIRLQRIECIKRS 159


>XP_006600846.1 PREDICTED: uncharacterized protein DDB_G0284459 [Glycine max]
           KRH04180.1 hypothetical protein GLYMA_17G144800 [Glycine
           max]
          Length = 574

 Score = 84.7 bits (208), Expect = 3e-16
 Identities = 63/144 (43%), Positives = 72/144 (50%), Gaps = 18/144 (12%)
 Frame = -2

Query: 384 SFNKGDSFDKELKRSFTSERNY-----------LKQRSSTGHNKLMGRASAPLVS-VEEK 241
           SFN+  SF+KELKRSF SERN              QR+S   +K+MG AS PLVS   EK
Sbjct: 422 SFNEEPSFNKELKRSFKSERNMPVGKKIDEENKPMQRTSFRSDKIMGHASVPLVSQPAEK 481

Query: 240 ESFFXXXXXXXXXXXXXXXXXXXXXEKFVQKERGEXXXXXXXXXXXSN------GGDVDK 79
           ESF                         VQ +  +           S+      G DVDK
Sbjct: 482 ESFLVESNDDEDEDTETEDEDVGGGRIIVQNQNNDSGKGPQVIGGESSKTDGDEGPDVDK 541

Query: 78  KADEFIAKFREQIRLQRIDSIKRS 7
           KADEFIAKFREQIRLQRI+ IKRS
Sbjct: 542 KADEFIAKFREQIRLQRIECIKRS 565


>XP_006579644.2 PREDICTED: uncharacterized protein DDB_G0284459-like [Glycine max]
          Length = 231

 Score = 76.3 bits (186), Expect = 5e-14
 Identities = 64/161 (39%), Positives = 72/161 (44%), Gaps = 17/161 (10%)
 Frame = -2

Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSER-----------NYLKQRSSTGH 292
           +FQKS      +K     S N+   F+KELKRSFTSER           N   Q +    
Sbjct: 66  MFQKSV----FMKPRFGGSSNEAPCFNKELKRSFTSERTTPVGKKSDEENKSMQPTLFRS 121

Query: 291 NKLMGRASAPLVS-VEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFVQKERGE-----XX 130
           NK +G AS PLVS   EKES                        + V K           
Sbjct: 122 NKFIGHASVPLVSQPAEKESLLVESDDDDDDDDTEIEDQDVEGGRTVAKNNDSGKSPPVI 181

Query: 129 XXXXXXXXXSNGGDVDKKADEFIAKFREQIRLQRIDSIKRS 7
                      G DVDKKADEFIAKFREQIRLQRI+SIKRS
Sbjct: 182 GGESSKTDGDEGPDVDKKADEFIAKFREQIRLQRIESIKRS 222


>XP_014627151.1 PREDICTED: uncharacterized protein LOC106797386 [Glycine max]
          Length = 363

 Score = 77.0 bits (188), Expect = 1e-13
 Identities = 65/161 (40%), Positives = 79/161 (49%), Gaps = 17/161 (10%)
 Frame = -2

Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG----H 292
           +FQKS      +K     S N+  SF+KELKRSFTSER       ++ + +S  G     
Sbjct: 198 MFQKSV----FMKPRFDGSSNEAPSFNKELKRSFTSERTTPVGKKSHEENKSMQGTLFRS 253

Query: 291 NKLMGRASAPLVS-VEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFV--QKERGEXXXXX 121
           NK MG AS PLVS   EKES                        + V    +RG+     
Sbjct: 254 NKFMGHASVPLVSQPAEKESLLVESDDDDDDDDTETEDQDVEAGRIVAQNNDRGKGPPVI 313

Query: 120 XXXXXXSNGG---DVDKKADEFIAKFREQIRLQRIDSIKRS 7
                  +G    DVDKKA+EFIAKF EQIRLQRI+SIKRS
Sbjct: 314 GGESSKIDGDEGPDVDKKANEFIAKFIEQIRLQRIESIKRS 354


>KRG94729.1 hypothetical protein GLYMA_19G105000 [Glycine max]
          Length = 377

 Score = 77.0 bits (188), Expect = 1e-13
 Identities = 65/161 (40%), Positives = 79/161 (49%), Gaps = 17/161 (10%)
 Frame = -2

Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG----H 292
           +FQKS      +K     S N+  SF+KELKRSFTSER       ++ + +S  G     
Sbjct: 212 MFQKSV----FMKPRFDGSSNEAPSFNKELKRSFTSERTTPVGKKSHEENKSMQGTLFRS 267

Query: 291 NKLMGRASAPLVS-VEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFV--QKERGEXXXXX 121
           NK MG AS PLVS   EKES                        + V    +RG+     
Sbjct: 268 NKFMGHASVPLVSQPAEKESLLVESDDDDDDDDTETEDQDVEAGRIVAQNNDRGKGPPVI 327

Query: 120 XXXXXXSNGG---DVDKKADEFIAKFREQIRLQRIDSIKRS 7
                  +G    DVDKKA+EFIAKF EQIRLQRI+SIKRS
Sbjct: 328 GGESSKIDGDEGPDVDKKANEFIAKFIEQIRLQRIESIKRS 368


>KRH57469.1 hypothetical protein GLYMA_05G0628002, partial [Glycine max]
          Length = 208

 Score = 71.2 bits (173), Expect = 3e-12
 Identities = 58/149 (38%), Positives = 66/149 (44%), Gaps = 5/149 (3%)
 Frame = -2

Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSERNYLKQRSSTGHNKLMGRASAPL 259
           +FQKS      +K     S N+   F+KELKRSFTSER     + S   NK M     P 
Sbjct: 59  MFQKSV----FMKPRFGGSSNEAPCFNKELKRSFTSERTTPVGKKSDEENKSM----QPT 110

Query: 258 VSVEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFVQKERGE-----XXXXXXXXXXXSNG 94
           +   EKES                        + V K                      G
Sbjct: 111 LFRTEKESLLVESDDDDDDDDTEIEDQDVEGGRTVAKNNDSGKSPPVIGGESSKTDGDEG 170

Query: 93  GDVDKKADEFIAKFREQIRLQRIDSIKRS 7
            DVDKKADEFIAKFREQIRLQRI+SIKRS
Sbjct: 171 PDVDKKADEFIAKFREQIRLQRIESIKRS 199


>KHN44924.1 hypothetical protein glysoja_036135 [Glycine soja]
          Length = 203

 Score = 70.5 bits (171), Expect = 5e-12
 Identities = 63/162 (38%), Positives = 79/162 (48%), Gaps = 17/162 (10%)
 Frame = -2

Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG----H 292
           +FQKS      +K     S N+  SF+KELKRSFTSER       ++ + +S  G     
Sbjct: 27  MFQKSV----FMKPRFDGSSNEAPSFNKELKRSFTSERTTPVGKKSHEENKSMQGTLFRS 82

Query: 291 NKLMGRASAPLVS-VEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFV--QKERGEXXXXX 121
           NK MG AS  L+S   EKES                        + V    +RG+     
Sbjct: 83  NKFMGHASVRLMSQPAEKESLLVESDDDDDDDDTETKDQDIEGGRTVAQNNDRGKGPPVI 142

Query: 120 XXXXXXSNGG---DVDKKADEFIAKFREQIRLQRIDSIKRSA 4
                 ++G    DVDKKA+EFIAKF EQ RLQRI+SIKRSA
Sbjct: 143 GGESSKTDGDEGPDVDKKANEFIAKFIEQPRLQRIESIKRSA 184


>BAD93737.1 glycoprotein homolog, partial [Arabidopsis thaliana]
          Length = 59

 Score = 60.1 bits (144), Expect = 2e-09
 Identities = 29/31 (93%), Positives = 30/31 (96%)
 Frame = -2

Query: 99  NGGDVDKKADEFIAKFREQIRLQRIDSIKRS 7
           NG DVDKKADEFIAKFREQIRLQRI+SIKRS
Sbjct: 19  NGSDVDKKADEFIAKFREQIRLQRIESIKRS 49


>XP_012070042.1 PREDICTED: uncharacterized protein LOC105632305 [Jatropha curcas]
          Length = 453

 Score = 63.9 bits (154), Expect = 5e-09
 Identities = 48/131 (36%), Positives = 60/131 (45%), Gaps = 11/131 (8%)
 Frame = -2

Query: 360 DKELKRSFTSE----RNYL-------KQRSSTGHNKLMGRASAPLVSVEEKESFFXXXXX 214
           +K+LKRSFT +    R  L       K++ S      M + S      EEKE +      
Sbjct: 307 EKDLKRSFTGKFGEGRQMLFDEAPPKKEKQSRDRVTFMAQPSFKEFPKEEKEEYVEKIVM 366

Query: 213 XXXXXXXXXXXXXXXXEKFVQKERGEXXXXXXXXXXXSNGGDVDKKADEFIAKFREQIRL 34
                           E+   ++ G              G DVDKKADEFIAKFREQIRL
Sbjct: 367 ESEDDDMETEYEDEEEEEIAGRDFGLTNSKKNEQVGSDGGPDVDKKADEFIAKFREQIRL 426

Query: 33  QRIDSIKRSAG 1
           QRI+SIKRS+G
Sbjct: 427 QRIESIKRSSG 437


>KDP39910.1 hypothetical protein JCGZ_03441 [Jatropha curcas]
          Length = 483

 Score = 63.9 bits (154), Expect = 5e-09
 Identities = 48/131 (36%), Positives = 60/131 (45%), Gaps = 11/131 (8%)
 Frame = -2

Query: 360 DKELKRSFTSE----RNYL-------KQRSSTGHNKLMGRASAPLVSVEEKESFFXXXXX 214
           +K+LKRSFT +    R  L       K++ S      M + S      EEKE +      
Sbjct: 337 EKDLKRSFTGKFGEGRQMLFDEAPPKKEKQSRDRVTFMAQPSFKEFPKEEKEEYVEKIVM 396

Query: 213 XXXXXXXXXXXXXXXXEKFVQKERGEXXXXXXXXXXXSNGGDVDKKADEFIAKFREQIRL 34
                           E+   ++ G              G DVDKKADEFIAKFREQIRL
Sbjct: 397 ESEDDDMETEYEDEEEEEIAGRDFGLTNSKKNEQVGSDGGPDVDKKADEFIAKFREQIRL 456

Query: 33  QRIDSIKRSAG 1
           QRI+SIKRS+G
Sbjct: 457 QRIESIKRSSG 467


>KRH25518.1 hypothetical protein GLYMA_12G109100 [Glycine max]
          Length = 347

 Score = 63.5 bits (153), Expect = 6e-09
 Identities = 59/153 (38%), Positives = 70/153 (45%), Gaps = 18/153 (11%)
 Frame = -2

Query: 438 IFQKSSTSNSMVKVPR-PSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG---- 295
           +FQKS     M   PR   S NK  SF+KELKR FTSER       ++ + +S  G    
Sbjct: 200 MFQKS-----MFMKPRFDGSSNKAPSFNKELKRCFTSERTTPVGKKSHEENKSMQGTLFR 254

Query: 294 HNKLMGRASAPLVSVE-EKESFFXXXXXXXXXXXXXXXXXXXXXEKFVQK--ERGEXXXX 124
            NK MG AS PLVS   EKES                        + V +  +RG+    
Sbjct: 255 SNKFMGHASVPLVSQPVEKESLLVESDDDDDDNDTETEDQDVQGGRTVAQNNDRGKGPPV 314

Query: 123 XXXXXXXSNGG---DVDKKADEFIAKFREQIRL 34
                   NG    DVDKKADEFI KF EQ+RL
Sbjct: 315 IGGESSKINGDEGPDVDKKADEFIDKFIEQVRL 347


>XP_006593062.1 PREDICTED: uncharacterized protein LOC102667948 [Glycine max]
          Length = 372

 Score = 63.5 bits (153), Expect = 6e-09
 Identities = 59/153 (38%), Positives = 70/153 (45%), Gaps = 18/153 (11%)
 Frame = -2

Query: 438 IFQKSSTSNSMVKVPR-PSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG---- 295
           +FQKS     M   PR   S NK  SF+KELKR FTSER       ++ + +S  G    
Sbjct: 225 MFQKS-----MFMKPRFDGSSNKAPSFNKELKRCFTSERTTPVGKKSHEENKSMQGTLFR 279

Query: 294 HNKLMGRASAPLVSVE-EKESFFXXXXXXXXXXXXXXXXXXXXXEKFVQK--ERGEXXXX 124
            NK MG AS PLVS   EKES                        + V +  +RG+    
Sbjct: 280 SNKFMGHASVPLVSQPVEKESLLVESDDDDDDNDTETEDQDVQGGRTVAQNNDRGKGPPV 339

Query: 123 XXXXXXXSNGG---DVDKKADEFIAKFREQIRL 34
                   NG    DVDKKADEFI KF EQ+RL
Sbjct: 340 IGGESSKINGDEGPDVDKKADEFIDKFIEQVRL 372


>CDX76590.1 BnaA08g08100D [Brassica napus]
          Length = 89

 Score = 57.8 bits (138), Expect = 3e-08
 Identities = 29/32 (90%), Positives = 30/32 (93%)
 Frame = -2

Query: 99  NGGDVDKKADEFIAKFREQIRLQRIDSIKRSA 4
           N  DVDKKADEFIAKFREQIRLQRI+SIKRSA
Sbjct: 49  NISDVDKKADEFIAKFREQIRLQRIESIKRSA 80


>XP_002868098.1 hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
           subsp. lyrata] EFH44357.1 hydroxyproline-rich
           glycoprotein family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 461

 Score = 61.6 bits (148), Expect = 3e-08
 Identities = 30/32 (93%), Positives = 31/32 (96%)
 Frame = -2

Query: 99  NGGDVDKKADEFIAKFREQIRLQRIDSIKRSA 4
           NG DVDKKADEFIAKFREQIRLQRI+SIKRSA
Sbjct: 421 NGSDVDKKADEFIAKFREQIRLQRIESIKRSA 452


>XP_014631510.1 PREDICTED: uncharacterized protein LOC102667348 [Glycine max]
          Length = 246

 Score = 60.5 bits (145), Expect = 4e-08
 Identities = 55/143 (38%), Positives = 64/143 (44%), Gaps = 17/143 (11%)
 Frame = -2

Query: 384 SFNKGDSFDKELKRSFTSER-----------NYLKQRSSTGHNKLMGRASAPLVS-VEEK 241
           S N+  SF+KELKRSFTSER           N  KQ +   +NK MG AS PLVS   EK
Sbjct: 103 SSNEAPSFNKELKRSFTSERTTPVGKKSHEENKSKQGTLFRNNKFMGHASVPLVSQPAEK 162

Query: 240 ESFFXXXXXXXXXXXXXXXXXXXXXEKFV--QKERGEXXXXXXXXXXXSN---GGDVDKK 76
           ES                        + V    +RG+            +   G DVDKK
Sbjct: 163 ESLLVESNDDDDDNDTETEDQDVEGGRTVAQNNDRGKGPPVIGGESSKIDGHEGPDVDKK 222

Query: 75  ADEFIAKFREQIRLQRIDSIKRS 7
           ADEFIAK        RI+SIKRS
Sbjct: 223 ADEFIAK--------RIESIKRS 237


>XP_016900571.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           DDB_G0284459 [Cucumis melo]
          Length = 566

 Score = 60.8 bits (146), Expect = 6e-08
 Identities = 32/51 (62%), Positives = 35/51 (68%)
 Frame = -2

Query: 153 QKERGEXXXXXXXXXXXSNGGDVDKKADEFIAKFREQIRLQRIDSIKRSAG 1
           +KE  E             G DVDKKADEFIAKFREQIRLQRI+SIKRS+G
Sbjct: 506 EKEEEEEEAGSASNIGNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSG 556


>XP_006282590.1 hypothetical protein CARUB_v10004732mg [Capsella rubella]
           EOA15488.1 hypothetical protein CARUB_v10004732mg
           [Capsella rubella]
          Length = 471

 Score = 60.5 bits (145), Expect = 8e-08
 Identities = 30/33 (90%), Positives = 31/33 (93%)
 Frame = -2

Query: 99  NGGDVDKKADEFIAKFREQIRLQRIDSIKRSAG 1
           N  DVDKKADEFIAKFREQIRLQRI+SIKRSAG
Sbjct: 431 NVSDVDKKADEFIAKFREQIRLQRIESIKRSAG 463


>XP_017640106.1 PREDICTED: uncharacterized protein LOC108481491 [Gossypium
           arboreum]
          Length = 221

 Score = 59.3 bits (142), Expect = 9e-08
 Identities = 29/32 (90%), Positives = 30/32 (93%)
 Frame = -2

Query: 96  GGDVDKKADEFIAKFREQIRLQRIDSIKRSAG 1
           G DVDKKADEFIAK REQIRLQRIDSIKRS+G
Sbjct: 182 GSDVDKKADEFIAKVREQIRLQRIDSIKRSSG 213


>OAO97873.1 hypothetical protein AXX17_AT4G19780 [Arabidopsis thaliana]
          Length = 468

 Score = 60.1 bits (144), Expect = 1e-07
 Identities = 29/31 (93%), Positives = 30/31 (96%)
 Frame = -2

Query: 99  NGGDVDKKADEFIAKFREQIRLQRIDSIKRS 7
           NG DVDKKADEFIAKFREQIRLQRI+SIKRS
Sbjct: 428 NGSDVDKKADEFIAKFREQIRLQRIESIKRS 458


>CAB88077.1 hypothetical protein, partial [Arabidopsis thaliana]
          Length = 471

 Score = 60.1 bits (144), Expect = 1e-07
 Identities = 29/31 (93%), Positives = 30/31 (96%)
 Frame = -2

Query: 99  NGGDVDKKADEFIAKFREQIRLQRIDSIKRS 7
           NG DVDKKADEFIAKFREQIRLQRI+SIKRS
Sbjct: 431 NGSDVDKKADEFIAKFREQIRLQRIESIKRS 461


Top