BLASTX nr result
ID: Glycyrrhiza32_contig00032094
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza32_contig00032094 (472 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KHN09844.1 hypothetical protein glysoja_017145 [Glycine soja] 85 9e-18 XP_006600846.1 PREDICTED: uncharacterized protein DDB_G0284459 [... 85 3e-16 XP_006579644.2 PREDICTED: uncharacterized protein DDB_G0284459-l... 76 5e-14 XP_014627151.1 PREDICTED: uncharacterized protein LOC106797386 [... 77 1e-13 KRG94729.1 hypothetical protein GLYMA_19G105000 [Glycine max] 77 1e-13 KRH57469.1 hypothetical protein GLYMA_05G0628002, partial [Glyci... 71 3e-12 KHN44924.1 hypothetical protein glysoja_036135 [Glycine soja] 70 5e-12 BAD93737.1 glycoprotein homolog, partial [Arabidopsis thaliana] 60 2e-09 XP_012070042.1 PREDICTED: uncharacterized protein LOC105632305 [... 64 5e-09 KDP39910.1 hypothetical protein JCGZ_03441 [Jatropha curcas] 64 5e-09 KRH25518.1 hypothetical protein GLYMA_12G109100 [Glycine max] 64 6e-09 XP_006593062.1 PREDICTED: uncharacterized protein LOC102667948 [... 64 6e-09 CDX76590.1 BnaA08g08100D [Brassica napus] 58 3e-08 XP_002868098.1 hydroxyproline-rich glycoprotein family protein [... 62 3e-08 XP_014631510.1 PREDICTED: uncharacterized protein LOC102667348 [... 60 4e-08 XP_016900571.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized p... 61 6e-08 XP_006282590.1 hypothetical protein CARUB_v10004732mg [Capsella ... 60 8e-08 XP_017640106.1 PREDICTED: uncharacterized protein LOC108481491 [... 59 9e-08 OAO97873.1 hypothetical protein AXX17_AT4G19780 [Arabidopsis tha... 60 1e-07 CAB88077.1 hypothetical protein, partial [Arabidopsis thaliana] 60 1e-07 >KHN09844.1 hypothetical protein glysoja_017145 [Glycine soja] Length = 168 Score = 84.7 bits (208), Expect = 9e-18 Identities = 63/144 (43%), Positives = 72/144 (50%), Gaps = 18/144 (12%) Frame = -2 Query: 384 SFNKGDSFDKELKRSFTSERNY-----------LKQRSSTGHNKLMGRASAPLVS-VEEK 241 SFN+ SF+KELKRSF SERN QR+S +K+MG AS PLVS EK Sbjct: 16 SFNEEPSFNKELKRSFKSERNMPVGKKIDEENKPMQRTSFRSDKIMGHASVPLVSQPAEK 75 Query: 240 ESFFXXXXXXXXXXXXXXXXXXXXXEKFVQKERGEXXXXXXXXXXXSN------GGDVDK 79 ESF VQ + + S+ G DVDK Sbjct: 76 ESFLVESNDDEDEDTETEDEDVGGGRIIVQNQNNDSGKGPQVIGGESSKTDGDEGPDVDK 135 Query: 78 KADEFIAKFREQIRLQRIDSIKRS 7 KADEFIAKFREQIRLQRI+ IKRS Sbjct: 136 KADEFIAKFREQIRLQRIECIKRS 159 >XP_006600846.1 PREDICTED: uncharacterized protein DDB_G0284459 [Glycine max] KRH04180.1 hypothetical protein GLYMA_17G144800 [Glycine max] Length = 574 Score = 84.7 bits (208), Expect = 3e-16 Identities = 63/144 (43%), Positives = 72/144 (50%), Gaps = 18/144 (12%) Frame = -2 Query: 384 SFNKGDSFDKELKRSFTSERNY-----------LKQRSSTGHNKLMGRASAPLVS-VEEK 241 SFN+ SF+KELKRSF SERN QR+S +K+MG AS PLVS EK Sbjct: 422 SFNEEPSFNKELKRSFKSERNMPVGKKIDEENKPMQRTSFRSDKIMGHASVPLVSQPAEK 481 Query: 240 ESFFXXXXXXXXXXXXXXXXXXXXXEKFVQKERGEXXXXXXXXXXXSN------GGDVDK 79 ESF VQ + + S+ G DVDK Sbjct: 482 ESFLVESNDDEDEDTETEDEDVGGGRIIVQNQNNDSGKGPQVIGGESSKTDGDEGPDVDK 541 Query: 78 KADEFIAKFREQIRLQRIDSIKRS 7 KADEFIAKFREQIRLQRI+ IKRS Sbjct: 542 KADEFIAKFREQIRLQRIECIKRS 565 >XP_006579644.2 PREDICTED: uncharacterized protein DDB_G0284459-like [Glycine max] Length = 231 Score = 76.3 bits (186), Expect = 5e-14 Identities = 64/161 (39%), Positives = 72/161 (44%), Gaps = 17/161 (10%) Frame = -2 Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSER-----------NYLKQRSSTGH 292 +FQKS +K S N+ F+KELKRSFTSER N Q + Sbjct: 66 MFQKSV----FMKPRFGGSSNEAPCFNKELKRSFTSERTTPVGKKSDEENKSMQPTLFRS 121 Query: 291 NKLMGRASAPLVS-VEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFVQKERGE-----XX 130 NK +G AS PLVS EKES + V K Sbjct: 122 NKFIGHASVPLVSQPAEKESLLVESDDDDDDDDTEIEDQDVEGGRTVAKNNDSGKSPPVI 181 Query: 129 XXXXXXXXXSNGGDVDKKADEFIAKFREQIRLQRIDSIKRS 7 G DVDKKADEFIAKFREQIRLQRI+SIKRS Sbjct: 182 GGESSKTDGDEGPDVDKKADEFIAKFREQIRLQRIESIKRS 222 >XP_014627151.1 PREDICTED: uncharacterized protein LOC106797386 [Glycine max] Length = 363 Score = 77.0 bits (188), Expect = 1e-13 Identities = 65/161 (40%), Positives = 79/161 (49%), Gaps = 17/161 (10%) Frame = -2 Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG----H 292 +FQKS +K S N+ SF+KELKRSFTSER ++ + +S G Sbjct: 198 MFQKSV----FMKPRFDGSSNEAPSFNKELKRSFTSERTTPVGKKSHEENKSMQGTLFRS 253 Query: 291 NKLMGRASAPLVS-VEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFV--QKERGEXXXXX 121 NK MG AS PLVS EKES + V +RG+ Sbjct: 254 NKFMGHASVPLVSQPAEKESLLVESDDDDDDDDTETEDQDVEAGRIVAQNNDRGKGPPVI 313 Query: 120 XXXXXXSNGG---DVDKKADEFIAKFREQIRLQRIDSIKRS 7 +G DVDKKA+EFIAKF EQIRLQRI+SIKRS Sbjct: 314 GGESSKIDGDEGPDVDKKANEFIAKFIEQIRLQRIESIKRS 354 >KRG94729.1 hypothetical protein GLYMA_19G105000 [Glycine max] Length = 377 Score = 77.0 bits (188), Expect = 1e-13 Identities = 65/161 (40%), Positives = 79/161 (49%), Gaps = 17/161 (10%) Frame = -2 Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG----H 292 +FQKS +K S N+ SF+KELKRSFTSER ++ + +S G Sbjct: 212 MFQKSV----FMKPRFDGSSNEAPSFNKELKRSFTSERTTPVGKKSHEENKSMQGTLFRS 267 Query: 291 NKLMGRASAPLVS-VEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFV--QKERGEXXXXX 121 NK MG AS PLVS EKES + V +RG+ Sbjct: 268 NKFMGHASVPLVSQPAEKESLLVESDDDDDDDDTETEDQDVEAGRIVAQNNDRGKGPPVI 327 Query: 120 XXXXXXSNGG---DVDKKADEFIAKFREQIRLQRIDSIKRS 7 +G DVDKKA+EFIAKF EQIRLQRI+SIKRS Sbjct: 328 GGESSKIDGDEGPDVDKKANEFIAKFIEQIRLQRIESIKRS 368 >KRH57469.1 hypothetical protein GLYMA_05G0628002, partial [Glycine max] Length = 208 Score = 71.2 bits (173), Expect = 3e-12 Identities = 58/149 (38%), Positives = 66/149 (44%), Gaps = 5/149 (3%) Frame = -2 Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSERNYLKQRSSTGHNKLMGRASAPL 259 +FQKS +K S N+ F+KELKRSFTSER + S NK M P Sbjct: 59 MFQKSV----FMKPRFGGSSNEAPCFNKELKRSFTSERTTPVGKKSDEENKSM----QPT 110 Query: 258 VSVEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFVQKERGE-----XXXXXXXXXXXSNG 94 + EKES + V K G Sbjct: 111 LFRTEKESLLVESDDDDDDDDTEIEDQDVEGGRTVAKNNDSGKSPPVIGGESSKTDGDEG 170 Query: 93 GDVDKKADEFIAKFREQIRLQRIDSIKRS 7 DVDKKADEFIAKFREQIRLQRI+SIKRS Sbjct: 171 PDVDKKADEFIAKFREQIRLQRIESIKRS 199 >KHN44924.1 hypothetical protein glysoja_036135 [Glycine soja] Length = 203 Score = 70.5 bits (171), Expect = 5e-12 Identities = 63/162 (38%), Positives = 79/162 (48%), Gaps = 17/162 (10%) Frame = -2 Query: 438 IFQKSSTSNSMVKVPRPSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG----H 292 +FQKS +K S N+ SF+KELKRSFTSER ++ + +S G Sbjct: 27 MFQKSV----FMKPRFDGSSNEAPSFNKELKRSFTSERTTPVGKKSHEENKSMQGTLFRS 82 Query: 291 NKLMGRASAPLVS-VEEKESFFXXXXXXXXXXXXXXXXXXXXXEKFV--QKERGEXXXXX 121 NK MG AS L+S EKES + V +RG+ Sbjct: 83 NKFMGHASVRLMSQPAEKESLLVESDDDDDDDDTETKDQDIEGGRTVAQNNDRGKGPPVI 142 Query: 120 XXXXXXSNGG---DVDKKADEFIAKFREQIRLQRIDSIKRSA 4 ++G DVDKKA+EFIAKF EQ RLQRI+SIKRSA Sbjct: 143 GGESSKTDGDEGPDVDKKANEFIAKFIEQPRLQRIESIKRSA 184 >BAD93737.1 glycoprotein homolog, partial [Arabidopsis thaliana] Length = 59 Score = 60.1 bits (144), Expect = 2e-09 Identities = 29/31 (93%), Positives = 30/31 (96%) Frame = -2 Query: 99 NGGDVDKKADEFIAKFREQIRLQRIDSIKRS 7 NG DVDKKADEFIAKFREQIRLQRI+SIKRS Sbjct: 19 NGSDVDKKADEFIAKFREQIRLQRIESIKRS 49 >XP_012070042.1 PREDICTED: uncharacterized protein LOC105632305 [Jatropha curcas] Length = 453 Score = 63.9 bits (154), Expect = 5e-09 Identities = 48/131 (36%), Positives = 60/131 (45%), Gaps = 11/131 (8%) Frame = -2 Query: 360 DKELKRSFTSE----RNYL-------KQRSSTGHNKLMGRASAPLVSVEEKESFFXXXXX 214 +K+LKRSFT + R L K++ S M + S EEKE + Sbjct: 307 EKDLKRSFTGKFGEGRQMLFDEAPPKKEKQSRDRVTFMAQPSFKEFPKEEKEEYVEKIVM 366 Query: 213 XXXXXXXXXXXXXXXXEKFVQKERGEXXXXXXXXXXXSNGGDVDKKADEFIAKFREQIRL 34 E+ ++ G G DVDKKADEFIAKFREQIRL Sbjct: 367 ESEDDDMETEYEDEEEEEIAGRDFGLTNSKKNEQVGSDGGPDVDKKADEFIAKFREQIRL 426 Query: 33 QRIDSIKRSAG 1 QRI+SIKRS+G Sbjct: 427 QRIESIKRSSG 437 >KDP39910.1 hypothetical protein JCGZ_03441 [Jatropha curcas] Length = 483 Score = 63.9 bits (154), Expect = 5e-09 Identities = 48/131 (36%), Positives = 60/131 (45%), Gaps = 11/131 (8%) Frame = -2 Query: 360 DKELKRSFTSE----RNYL-------KQRSSTGHNKLMGRASAPLVSVEEKESFFXXXXX 214 +K+LKRSFT + R L K++ S M + S EEKE + Sbjct: 337 EKDLKRSFTGKFGEGRQMLFDEAPPKKEKQSRDRVTFMAQPSFKEFPKEEKEEYVEKIVM 396 Query: 213 XXXXXXXXXXXXXXXXEKFVQKERGEXXXXXXXXXXXSNGGDVDKKADEFIAKFREQIRL 34 E+ ++ G G DVDKKADEFIAKFREQIRL Sbjct: 397 ESEDDDMETEYEDEEEEEIAGRDFGLTNSKKNEQVGSDGGPDVDKKADEFIAKFREQIRL 456 Query: 33 QRIDSIKRSAG 1 QRI+SIKRS+G Sbjct: 457 QRIESIKRSSG 467 >KRH25518.1 hypothetical protein GLYMA_12G109100 [Glycine max] Length = 347 Score = 63.5 bits (153), Expect = 6e-09 Identities = 59/153 (38%), Positives = 70/153 (45%), Gaps = 18/153 (11%) Frame = -2 Query: 438 IFQKSSTSNSMVKVPR-PSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG---- 295 +FQKS M PR S NK SF+KELKR FTSER ++ + +S G Sbjct: 200 MFQKS-----MFMKPRFDGSSNKAPSFNKELKRCFTSERTTPVGKKSHEENKSMQGTLFR 254 Query: 294 HNKLMGRASAPLVSVE-EKESFFXXXXXXXXXXXXXXXXXXXXXEKFVQK--ERGEXXXX 124 NK MG AS PLVS EKES + V + +RG+ Sbjct: 255 SNKFMGHASVPLVSQPVEKESLLVESDDDDDDNDTETEDQDVQGGRTVAQNNDRGKGPPV 314 Query: 123 XXXXXXXSNGG---DVDKKADEFIAKFREQIRL 34 NG DVDKKADEFI KF EQ+RL Sbjct: 315 IGGESSKINGDEGPDVDKKADEFIDKFIEQVRL 347 >XP_006593062.1 PREDICTED: uncharacterized protein LOC102667948 [Glycine max] Length = 372 Score = 63.5 bits (153), Expect = 6e-09 Identities = 59/153 (38%), Positives = 70/153 (45%), Gaps = 18/153 (11%) Frame = -2 Query: 438 IFQKSSTSNSMVKVPR-PSSFNKGDSFDKELKRSFTSER-------NYLKQRSSTG---- 295 +FQKS M PR S NK SF+KELKR FTSER ++ + +S G Sbjct: 225 MFQKS-----MFMKPRFDGSSNKAPSFNKELKRCFTSERTTPVGKKSHEENKSMQGTLFR 279 Query: 294 HNKLMGRASAPLVSVE-EKESFFXXXXXXXXXXXXXXXXXXXXXEKFVQK--ERGEXXXX 124 NK MG AS PLVS EKES + V + +RG+ Sbjct: 280 SNKFMGHASVPLVSQPVEKESLLVESDDDDDDNDTETEDQDVQGGRTVAQNNDRGKGPPV 339 Query: 123 XXXXXXXSNGG---DVDKKADEFIAKFREQIRL 34 NG DVDKKADEFI KF EQ+RL Sbjct: 340 IGGESSKINGDEGPDVDKKADEFIDKFIEQVRL 372 >CDX76590.1 BnaA08g08100D [Brassica napus] Length = 89 Score = 57.8 bits (138), Expect = 3e-08 Identities = 29/32 (90%), Positives = 30/32 (93%) Frame = -2 Query: 99 NGGDVDKKADEFIAKFREQIRLQRIDSIKRSA 4 N DVDKKADEFIAKFREQIRLQRI+SIKRSA Sbjct: 49 NISDVDKKADEFIAKFREQIRLQRIESIKRSA 80 >XP_002868098.1 hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] EFH44357.1 hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 461 Score = 61.6 bits (148), Expect = 3e-08 Identities = 30/32 (93%), Positives = 31/32 (96%) Frame = -2 Query: 99 NGGDVDKKADEFIAKFREQIRLQRIDSIKRSA 4 NG DVDKKADEFIAKFREQIRLQRI+SIKRSA Sbjct: 421 NGSDVDKKADEFIAKFREQIRLQRIESIKRSA 452 >XP_014631510.1 PREDICTED: uncharacterized protein LOC102667348 [Glycine max] Length = 246 Score = 60.5 bits (145), Expect = 4e-08 Identities = 55/143 (38%), Positives = 64/143 (44%), Gaps = 17/143 (11%) Frame = -2 Query: 384 SFNKGDSFDKELKRSFTSER-----------NYLKQRSSTGHNKLMGRASAPLVS-VEEK 241 S N+ SF+KELKRSFTSER N KQ + +NK MG AS PLVS EK Sbjct: 103 SSNEAPSFNKELKRSFTSERTTPVGKKSHEENKSKQGTLFRNNKFMGHASVPLVSQPAEK 162 Query: 240 ESFFXXXXXXXXXXXXXXXXXXXXXEKFV--QKERGEXXXXXXXXXXXSN---GGDVDKK 76 ES + V +RG+ + G DVDKK Sbjct: 163 ESLLVESNDDDDDNDTETEDQDVEGGRTVAQNNDRGKGPPVIGGESSKIDGHEGPDVDKK 222 Query: 75 ADEFIAKFREQIRLQRIDSIKRS 7 ADEFIAK RI+SIKRS Sbjct: 223 ADEFIAK--------RIESIKRS 237 >XP_016900571.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein DDB_G0284459 [Cucumis melo] Length = 566 Score = 60.8 bits (146), Expect = 6e-08 Identities = 32/51 (62%), Positives = 35/51 (68%) Frame = -2 Query: 153 QKERGEXXXXXXXXXXXSNGGDVDKKADEFIAKFREQIRLQRIDSIKRSAG 1 +KE E G DVDKKADEFIAKFREQIRLQRI+SIKRS+G Sbjct: 506 EKEEEEEEAGSASNIGNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSSG 556 >XP_006282590.1 hypothetical protein CARUB_v10004732mg [Capsella rubella] EOA15488.1 hypothetical protein CARUB_v10004732mg [Capsella rubella] Length = 471 Score = 60.5 bits (145), Expect = 8e-08 Identities = 30/33 (90%), Positives = 31/33 (93%) Frame = -2 Query: 99 NGGDVDKKADEFIAKFREQIRLQRIDSIKRSAG 1 N DVDKKADEFIAKFREQIRLQRI+SIKRSAG Sbjct: 431 NVSDVDKKADEFIAKFREQIRLQRIESIKRSAG 463 >XP_017640106.1 PREDICTED: uncharacterized protein LOC108481491 [Gossypium arboreum] Length = 221 Score = 59.3 bits (142), Expect = 9e-08 Identities = 29/32 (90%), Positives = 30/32 (93%) Frame = -2 Query: 96 GGDVDKKADEFIAKFREQIRLQRIDSIKRSAG 1 G DVDKKADEFIAK REQIRLQRIDSIKRS+G Sbjct: 182 GSDVDKKADEFIAKVREQIRLQRIDSIKRSSG 213 >OAO97873.1 hypothetical protein AXX17_AT4G19780 [Arabidopsis thaliana] Length = 468 Score = 60.1 bits (144), Expect = 1e-07 Identities = 29/31 (93%), Positives = 30/31 (96%) Frame = -2 Query: 99 NGGDVDKKADEFIAKFREQIRLQRIDSIKRS 7 NG DVDKKADEFIAKFREQIRLQRI+SIKRS Sbjct: 428 NGSDVDKKADEFIAKFREQIRLQRIESIKRS 458 >CAB88077.1 hypothetical protein, partial [Arabidopsis thaliana] Length = 471 Score = 60.1 bits (144), Expect = 1e-07 Identities = 29/31 (93%), Positives = 30/31 (96%) Frame = -2 Query: 99 NGGDVDKKADEFIAKFREQIRLQRIDSIKRS 7 NG DVDKKADEFIAKFREQIRLQRI+SIKRS Sbjct: 431 NGSDVDKKADEFIAKFREQIRLQRIESIKRS 461