BLASTX nr result

ID: Glycyrrhiza23_contig00003544 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00003544
         (893 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003526811.1| PREDICTED: alpha-galactosidase-like isoform ...   526   e-147
ref|XP_003526810.1| PREDICTED: alpha-galactosidase-like isoform ...   511   e-143
ref|XP_002520852.1| alpha-galactosidase/alpha-n-acetylgalactosam...   484   e-135
ref|XP_002325481.1| predicted protein [Populus trichocarpa] gi|2...   470   e-130
ref|XP_002876361.1| hypothetical protein ARALYDRAFT_486071 [Arab...   467   e-129

>ref|XP_003526811.1| PREDICTED: alpha-galactosidase-like isoform 2 [Glycine max]
          Length = 407

 Score =  526 bits (1354), Expect = e-147
 Identities = 249/289 (86%), Positives = 261/289 (90%)
 Frame = +1

Query: 1   CQVRPGSIFHETDDADLFASWGVDYLKYDNCYNLGIPPKKRYPPMRDALNTTGRKIFYSI 180
           CQVRPGSIFHETDDADLFASWGVDYLKYDNCYNLGIPPK+RYPPMRDALN TG+KIFYS+
Sbjct: 135 CQVRPGSIFHETDDADLFASWGVDYLKYDNCYNLGIPPKERYPPMRDALNATGQKIFYSL 194

Query: 181 CEWGVEDPALWAGKVGNSWRTTEDINDTWASMTTIADLNDKWAAYAGPGGWNDPDMLEVG 360
           CEWGVEDPALWA KVGNSWRTT DIND+WASMTTIADLNDKWAAYAGPGGWNDPDMLEVG
Sbjct: 195 CEWGVEDPALWADKVGNSWRTTGDINDSWASMTTIADLNDKWAAYAGPGGWNDPDMLEVG 254

Query: 361 NGGMTYQEYRAHFSIWALAKAPLLIGCDVRNMTAETLEILSNKEVIAINQDSLGVQGRKV 540
           NGGMTYQEYRAHFSIWALAKAPLLIGCDVRN+TAETLEILSNKEVIAINQDSLGVQGRKV
Sbjct: 255 NGGMTYQEYRAHFSIWALAKAPLLIGCDVRNLTAETLEILSNKEVIAINQDSLGVQGRKV 314

Query: 541 QFAGTDGCGQVWAGPLSGNRLAVALWNRCSKVAXXXXXXXXXXXXXXXXXTITASWEALG 720
           Q +G DGC QVWAGPLSGNRLAVALWNRCSKVA                 TITASWEALG
Sbjct: 315 QVSGADGCRQVWAGPLSGNRLAVALWNRCSKVA-----------------TITASWEALG 357

Query: 721 LESGIHVSVRDLWQHKVINGDAVSSFSARIDSHDCKLYIFTPSTASYSL 867
           LESG+HVSVRDLWQHKV+ GDAVSSFSAR+D HDC+LYIF P T S+S+
Sbjct: 358 LESGVHVSVRDLWQHKVVTGDAVSSFSARVDIHDCQLYIFAPFTVSHSV 406


>ref|XP_003526810.1| PREDICTED: alpha-galactosidase-like isoform 1 [Glycine max]
          Length = 418

 Score =  511 bits (1316), Expect = e-143
 Identities = 242/277 (87%), Positives = 252/277 (90%)
 Frame = +1

Query: 1   CQVRPGSIFHETDDADLFASWGVDYLKYDNCYNLGIPPKKRYPPMRDALNTTGRKIFYSI 180
           CQVRPGSIFHETDDADLFASWGVDYLKYDNCYNLGIPPK+RYPPMRDALN TG+KIFYS+
Sbjct: 159 CQVRPGSIFHETDDADLFASWGVDYLKYDNCYNLGIPPKERYPPMRDALNATGQKIFYSL 218

Query: 181 CEWGVEDPALWAGKVGNSWRTTEDINDTWASMTTIADLNDKWAAYAGPGGWNDPDMLEVG 360
           CEWGVEDPALWA KVGNSWRTT DIND+WASMTTIADLNDKWAAYAGPGGWNDPDMLEVG
Sbjct: 219 CEWGVEDPALWADKVGNSWRTTGDINDSWASMTTIADLNDKWAAYAGPGGWNDPDMLEVG 278

Query: 361 NGGMTYQEYRAHFSIWALAKAPLLIGCDVRNMTAETLEILSNKEVIAINQDSLGVQGRKV 540
           NGGMTYQEYRAHFSIWALAKAPLLIGCDVRN+TAETLEILSNKEVIAINQDSLGVQGRKV
Sbjct: 279 NGGMTYQEYRAHFSIWALAKAPLLIGCDVRNLTAETLEILSNKEVIAINQDSLGVQGRKV 338

Query: 541 QFAGTDGCGQVWAGPLSGNRLAVALWNRCSKVAXXXXXXXXXXXXXXXXXTITASWEALG 720
           Q +G DGC QVWAGPLSGNRLAVALWNRCSKVA                 TITASWEALG
Sbjct: 339 QVSGADGCRQVWAGPLSGNRLAVALWNRCSKVA-----------------TITASWEALG 381

Query: 721 LESGIHVSVRDLWQHKVINGDAVSSFSARIDSHDCKL 831
           LESG+HVSVRDLWQHKV+ GDAVSSFSAR+D HDC+L
Sbjct: 382 LESGVHVSVRDLWQHKVVTGDAVSSFSARVDIHDCQL 418


>ref|XP_002520852.1| alpha-galactosidase/alpha-n-acetylgalactosaminidase, putative
           [Ricinus communis] gi|223539983|gb|EEF41561.1|
           alpha-galactosidase/alpha-n-acetylgalactosaminidase,
           putative [Ricinus communis]
          Length = 360

 Score =  484 bits (1247), Expect = e-135
 Identities = 225/284 (79%), Positives = 245/284 (86%)
 Frame = +1

Query: 1   CQVRPGSIFHETDDADLFASWGVDYLKYDNCYNLGIPPKKRYPPMRDALNTTGRKIFYSI 180
           CQVRPGS+ HE DDADLFASWGVDYLKYDNC+NLGI PK+RYPPMRDALN +GR IFYS+
Sbjct: 88  CQVRPGSLHHEEDDADLFASWGVDYLKYDNCFNLGIKPKERYPPMRDALNASGRTIFYSL 147

Query: 181 CEWGVEDPALWAGKVGNSWRTTEDINDTWASMTTIADLNDKWAAYAGPGGWNDPDMLEVG 360
           CEWGV+DPALWAGKVGNSWRTT+DIND+W SMTTIADLNDKWAAYAGPGGWNDPDMLEVG
Sbjct: 148 CEWGVDDPALWAGKVGNSWRTTDDINDSWVSMTTIADLNDKWAAYAGPGGWNDPDMLEVG 207

Query: 361 NGGMTYQEYRAHFSIWALAKAPLLIGCDVRNMTAETLEILSNKEVIAINQDSLGVQGRKV 540
           NGGMTYQEYRAHFSIWAL KAPLLIGCDVRNMTAET EIL+NKEVIA+NQDSLGVQGRKV
Sbjct: 208 NGGMTYQEYRAHFSIWALMKAPLLIGCDVRNMTAETYEILTNKEVIAVNQDSLGVQGRKV 267

Query: 541 QFAGTDGCGQVWAGPLSGNRLAVALWNRCSKVAXXXXXXXXXXXXXXXXXTITASWEALG 720
           Q +GTDGC QVWAGPLSG+R+AV LWNRCSK A                 TITA W+ALG
Sbjct: 268 QASGTDGCLQVWAGPLSGHRMAVVLWNRCSKAA-----------------TITARWDALG 310

Query: 721 LESGIHVSVRDLWQHKVINGDAVSSFSARIDSHDCKLYIFTPST 852
           LESG  V+VRDLWQHK I GD+V+SF  R+D+HDC +Y FTP T
Sbjct: 311 LESGTSVAVRDLWQHKDITGDSVASFGTRVDAHDCAMYTFTPKT 354


>ref|XP_002325481.1| predicted protein [Populus trichocarpa] gi|222862356|gb|EEE99862.1|
           predicted protein [Populus trichocarpa]
          Length = 380

 Score =  470 bits (1210), Expect = e-130
 Identities = 218/282 (77%), Positives = 241/282 (85%)
 Frame = +1

Query: 1   CQVRPGSIFHETDDADLFASWGVDYLKYDNCYNLGIPPKKRYPPMRDALNTTGRKIFYSI 180
           CQVRPGS+ HE DDA+LFASWGVDYLKYDNC+NLGI PK+RYPPMRDALN+TGR +FYS+
Sbjct: 116 CQVRPGSLLHEKDDAELFASWGVDYLKYDNCFNLGINPKERYPPMRDALNSTGRTVFYSL 175

Query: 181 CEWGVEDPALWAGKVGNSWRTTEDINDTWASMTTIADLNDKWAAYAGPGGWNDPDMLEVG 360
           CEWGV+DPALWAGKVGNSWRTT+DIND+WASMTT ADLNDKWA+YAGPGGWNDPDMLEVG
Sbjct: 176 CEWGVDDPALWAGKVGNSWRTTDDINDSWASMTTTADLNDKWASYAGPGGWNDPDMLEVG 235

Query: 361 NGGMTYQEYRAHFSIWALAKAPLLIGCDVRNMTAETLEILSNKEVIAINQDSLGVQGRKV 540
           NGGMTY EYRAHFSIWAL KAPLLIGCDVRNMTAET+EIL+NKE+IA+NQD LG+QGRKV
Sbjct: 236 NGGMTYHEYRAHFSIWALMKAPLLIGCDVRNMTAETIEILTNKEIIAVNQDPLGIQGRKV 295

Query: 541 QFAGTDGCGQVWAGPLSGNRLAVALWNRCSKVAXXXXXXXXXXXXXXXXXTITASWEALG 720
              GTDGC QVWAGPLSG+R+ VALWNRCSK A                 TITA W ALG
Sbjct: 296 YSTGTDGCLQVWAGPLSGHRIVVALWNRCSKAA-----------------TITAGWGALG 338

Query: 721 LESGIHVSVRDLWQHKVINGDAVSSFSARIDSHDCKLYIFTP 846
           LES   VSVRDLWQ K I GDAV+SF AR+D+HDC ++IFTP
Sbjct: 339 LESSTSVSVRDLWQGKDIVGDAVASFGARVDAHDCLIFIFTP 380


>ref|XP_002876361.1| hypothetical protein ARALYDRAFT_486071 [Arabidopsis lyrata subsp.
           lyrata] gi|297322199|gb|EFH52620.1| hypothetical protein
           ARALYDRAFT_486071 [Arabidopsis lyrata subsp. lyrata]
          Length = 430

 Score =  467 bits (1201), Expect = e-129
 Identities = 215/288 (74%), Positives = 241/288 (83%)
 Frame = +1

Query: 1   CQVRPGSIFHETDDADLFASWGVDYLKYDNCYNLGIPPKKRYPPMRDALNTTGRKIFYSI 180
           CQVRPGS+FHE DDAD+FASWGVDYLKYDNC+NLGI P KRYPPMRDALN TGR IFYS+
Sbjct: 158 CQVRPGSLFHEVDDADIFASWGVDYLKYDNCFNLGIKPIKRYPPMRDALNATGRSIFYSL 217

Query: 181 CEWGVEDPALWAGKVGNSWRTTEDINDTWASMTTIADLNDKWAAYAGPGGWNDPDMLEVG 360
           CEWGV+DPALWA +VGNSWRTT+DINDTWASMTTIADLN+KWAAYAGPGGWNDPDMLE+G
Sbjct: 218 CEWGVDDPALWAKEVGNSWRTTDDINDTWASMTTIADLNNKWAAYAGPGGWNDPDMLEIG 277

Query: 361 NGGMTYQEYRAHFSIWALAKAPLLIGCDVRNMTAETLEILSNKEVIAINQDSLGVQGRKV 540
           NGGMTY+EYR HFSIWAL KAPLLIGCDVRNMTAET EILSNKEVIA+NQD LGVQGRK+
Sbjct: 278 NGGMTYEEYRGHFSIWALMKAPLLIGCDVRNMTAETFEILSNKEVIAVNQDPLGVQGRKI 337

Query: 541 QFAGTDGCGQVWAGPLSGNRLAVALWNRCSKVAXXXXXXXXXXXXXXXXXTITASWEALG 720
           Q  G D C QVW+GPLSG+R+ VALWNRCS+ A                 TITASW+ +G
Sbjct: 338 QANGEDDCQQVWSGPLSGDRIVVALWNRCSEQA-----------------TITASWDVIG 380

Query: 721 LESGIHVSVRDLWQHKVINGDAVSSFSARIDSHDCKLYIFTPSTASYS 864
           LES I VSVRDLWQHK +  +A  SF A++D+HDC +Y+ TP T S+S
Sbjct: 381 LESTISVSVRDLWQHKDVTENASGSFEAQVDAHDCHMYVLTPQTVSHS 428


Top