BLASTX nr result

ID: Bupleurum21_contig00014007 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00014007
         (1308 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q42656.1|AGAL_COFAR RecName: Full=Alpha-galactosidase; AltNam...   632   e-179
emb|CAI47559.1| alpha galactosidase [Coffea arabica]                  628   e-177
ref|XP_002331416.1| predicted protein [Populus trichocarpa] gi|2...   627   e-177
gb|AEB98600.1| alpha-galactosidase [Nicotiana tabacum]                626   e-177
gb|AEB98601.1| alpha-galactosidase [Nicotiana tabacum]                626   e-177

>sp|Q42656.1|AGAL_COFAR RecName: Full=Alpha-galactosidase; AltName: Full=Alpha-D-galactoside
            galactohydrolase; AltName: Full=Melibiase; Flags:
            Precursor gi|504489|gb|AAA33022.1| alpha-galactosidase
            [Coffea arabica]
          Length = 378

 Score =  632 bits (1630), Expect = e-179
 Identities = 295/372 (79%), Positives = 327/372 (87%)
 Frame = +1

Query: 16   SDQQIRASLLANGLGLTPQMGWNSWNHFQCNINEQMIRDTADAMVSTGLAGAGYKYINLD 195
            ++   R SLLANGLGLTP MGWNSWNHF+CN++E++IR+TADAMVS GLA  GYKYINLD
Sbjct: 7    TEDYTRRSLLANGLGLTPPMGWNSWNHFRCNLDEKLIRETADAMVSKGLAALGYKYINLD 66

Query: 196  DCWAELNRDSQGKMVAKHSTFPSGIKALADYVHSKGLKLGIYSDAGVQTCSGRMPGSLGH 375
            DCWAELNRDSQG +V K STFPSGIKALADYVHSKGLKLGIYSDAG QTCS  MPGSLGH
Sbjct: 67   DCWAELNRDSQGNLVPKGSTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKTMPGSLGH 126

Query: 376  EEQDAKTFASWGIDYLKYDNCYNKNITARQRYPIMSKALLNSGRPIFFSMCEWGQEDPAT 555
            EEQDAKTFASWG+DYLKYDNC N NI+ ++RYPIMSKALLNSGR IFFS+CEWG+EDPAT
Sbjct: 127  EEQDAKTFASWGVDYLKYDNCNNNNISPKERYPIMSKALLNSGRSIFFSLCEWGEEDPAT 186

Query: 556  WAPKIGNSWRTTGDISDNWNSMTSLADQNDKWASYAGPGGWNDPDMLEVGNGGMTVDEYR 735
            WA ++GNSWRTTGDI D+W+SMTS AD NDKWASYAGPGGWNDPDMLEVGNGGMT  EYR
Sbjct: 187  WAKEVGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTTEYR 246

Query: 736  SHFSIWALSKAPLLIGCDLRAMSNATFELLSNKEVIAVNQDKLGIQGKKVKSNGGLEVWA 915
            SHFSIWAL+KAPLLIGCD+R+M  ATF+LLSN EVIAVNQDKLG+QG KVK+ G LEVWA
Sbjct: 247  SHFSIWALAKAPLLIGCDIRSMDGATFQLLSNAEVIAVNQDKLGVQGNKVKTYGDLEVWA 306

Query: 916  GKLYKNRIAVVLWNRGASEALITASWSDIGLKSSTVVNARDLWAHRTQKSVKGKIYAKVK 1095
            G L   R+AV LWNRG+S A ITA WSD+GL S+ VVNARDLWAH T+KSVKG+I A V 
Sbjct: 307  GPLSGKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNARDLWAHSTEKSVKGQISAAVD 366

Query: 1096 SHDCKMYVLTPQ 1131
            +HD KMYVLTPQ
Sbjct: 367  AHDSKMYVLTPQ 378


>emb|CAI47559.1| alpha galactosidase [Coffea arabica]
          Length = 420

 Score =  628 bits (1619), Expect = e-177
 Identities = 293/372 (78%), Positives = 325/372 (87%)
 Frame = +1

Query: 16   SDQQIRASLLANGLGLTPQMGWNSWNHFQCNINEQMIRDTADAMVSTGLAGAGYKYINLD 195
            ++   R SLLANGLGLTP MGWNSWNHF CN++E++IR+TADAM S GLA  GYKYINLD
Sbjct: 49   TEDYTRRSLLANGLGLTPPMGWNSWNHFSCNLDEKLIRETADAMASKGLAALGYKYINLD 108

Query: 196  DCWAELNRDSQGKMVAKHSTFPSGIKALADYVHSKGLKLGIYSDAGVQTCSGRMPGSLGH 375
            DCWAELNRDSQG +V K STFPSGIKALADYVHSKGLKLGIYSDAG QTCS  MPGSLGH
Sbjct: 109  DCWAELNRDSQGNLVPKGSTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKTMPGSLGH 168

Query: 376  EEQDAKTFASWGIDYLKYDNCYNKNITARQRYPIMSKALLNSGRPIFFSMCEWGQEDPAT 555
            EEQDAKTFASWG+DYLKYDNC + NI+ ++RYPIMSKALLNSGR IFFS+CEWG EDPAT
Sbjct: 169  EEQDAKTFASWGVDYLKYDNCNDNNISPKERYPIMSKALLNSGRSIFFSLCEWGDEDPAT 228

Query: 556  WAPKIGNSWRTTGDISDNWNSMTSLADQNDKWASYAGPGGWNDPDMLEVGNGGMTVDEYR 735
            WA ++GNSWRTTGDI D+W+SMTS AD NDKWASYAGPGGWNDPDMLEVGNGGMT  EYR
Sbjct: 229  WAKEVGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTTEYR 288

Query: 736  SHFSIWALSKAPLLIGCDLRAMSNATFELLSNKEVIAVNQDKLGIQGKKVKSNGGLEVWA 915
            SHFSIWAL+KAPLLIGCD+R++  ATF+LLSN EVIAVNQDKLG+QGKKVK+ G LEVWA
Sbjct: 289  SHFSIWALAKAPLLIGCDIRSIDGATFQLLSNAEVIAVNQDKLGVQGKKVKTYGDLEVWA 348

Query: 916  GKLYKNRIAVVLWNRGASEALITASWSDIGLKSSTVVNARDLWAHRTQKSVKGKIYAKVK 1095
            G L   R+AV LWNRG+S A ITA WSD+GL S+ VVNARDLWAH T+KSVKG+I A V 
Sbjct: 349  GPLSGKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNARDLWAHSTEKSVKGQISAAVD 408

Query: 1096 SHDCKMYVLTPQ 1131
            +HD KMYVLTPQ
Sbjct: 409  AHDSKMYVLTPQ 420


>ref|XP_002331416.1| predicted protein [Populus trichocarpa] gi|222873630|gb|EEF10761.1|
            predicted protein [Populus trichocarpa]
          Length = 367

 Score =  627 bits (1618), Expect = e-177
 Identities = 289/364 (79%), Positives = 321/364 (88%)
 Frame = +1

Query: 40   LLANGLGLTPQMGWNSWNHFQCNINEQMIRDTADAMVSTGLAGAGYKYINLDDCWAELNR 219
            L ANGLGL P MGWNSWNHF CNI E++IRDTADAMVS+GLA  GY+++NLDDCWAELNR
Sbjct: 4    LSANGLGLAPPMGWNSWNHFHCNIEEKLIRDTADAMVSSGLAALGYEHVNLDDCWAELNR 63

Query: 220  DSQGKMVAKHSTFPSGIKALADYVHSKGLKLGIYSDAGVQTCSGRMPGSLGHEEQDAKTF 399
            DS+G +V K STFPSGIKALADY+H KGLKLGIYSDAG QTCSG MPGSLGHEEQDAKTF
Sbjct: 64   DSEGNLVPKASTFPSGIKALADYIHGKGLKLGIYSDAGSQTCSGTMPGSLGHEEQDAKTF 123

Query: 400  ASWGIDYLKYDNCYNKNITARQRYPIMSKALLNSGRPIFFSMCEWGQEDPATWAPKIGNS 579
            ASWG+DYLKYDNC N   + ++RYP+MSKALLNSGRPIFFS+CEWGQEDPATWA  +GNS
Sbjct: 124  ASWGVDYLKYDNCNNDGTSPKERYPVMSKALLNSGRPIFFSLCEWGQEDPATWASNVGNS 183

Query: 580  WRTTGDISDNWNSMTSLADQNDKWASYAGPGGWNDPDMLEVGNGGMTVDEYRSHFSIWAL 759
            WRTTGDISDNW+SMTS ADQND+WASYA PGGWNDPDMLEVGNGGMT +EYRSHFSIWAL
Sbjct: 184  WRTTGDISDNWDSMTSRADQNDQWASYAAPGGWNDPDMLEVGNGGMTTEEYRSHFSIWAL 243

Query: 760  SKAPLLIGCDLRAMSNATFELLSNKEVIAVNQDKLGIQGKKVKSNGGLEVWAGKLYKNRI 939
            +KAPLLIGCD+R MS+ T E+LSN+EVIAVNQDKLG+QGKKVK+NG LEVWAG L  N+I
Sbjct: 244  AKAPLLIGCDVRTMSDETIEILSNREVIAVNQDKLGVQGKKVKNNGDLEVWAGPLSNNKI 303

Query: 940  AVVLWNRGASEALITASWSDIGLKSSTVVNARDLWAHRTQKSVKGKIYAKVKSHDCKMYV 1119
            AVVLWNRG+S A +TA WSDIGL  +T VNARDLWAH  Q SVKG+I A + SH CKMYV
Sbjct: 304  AVVLWNRGSSRATVTAYWSDIGLDPTTTVNARDLWAHSNQPSVKGQISADLDSHACKMYV 363

Query: 1120 LTPQ 1131
            LTPQ
Sbjct: 364  LTPQ 367


>gb|AEB98600.1| alpha-galactosidase [Nicotiana tabacum]
          Length = 413

 Score =  626 bits (1615), Expect = e-177
 Identities = 292/372 (78%), Positives = 326/372 (87%)
 Frame = +1

Query: 16   SDQQIRASLLANGLGLTPQMGWNSWNHFQCNINEQMIRDTADAMVSTGLAGAGYKYINLD 195
            S+  IR SLL+NGLG TPQMGW+SWNHF CNI E+MIR+TADAMVSTGLA  GY+Y+N+D
Sbjct: 41   SNAYIRRSLLSNGLGRTPQMGWSSWNHFACNIEEKMIRETADAMVSTGLASLGYEYVNID 100

Query: 196  DCWAELNRDSQGKMVAKHSTFPSGIKALADYVHSKGLKLGIYSDAGVQTCSGRMPGSLGH 375
            DCWAELNRDSQG MV K STFPSGIKALADYVH KGLKLGIYSDAG QTCS +MPGSLGH
Sbjct: 101  DCWAELNRDSQGNMVPKSSTFPSGIKALADYVHGKGLKLGIYSDAGSQTCSKQMPGSLGH 160

Query: 376  EEQDAKTFASWGIDYLKYDNCYNKNITARQRYPIMSKALLNSGRPIFFSMCEWGQEDPAT 555
            EEQDAKTFASWG+DYLKYDNC N+N + R+RYPIMSKAL NSGR IF+S+CEWG +DPAT
Sbjct: 161  EEQDAKTFASWGVDYLKYDNCNNENRSPRERYPIMSKALQNSGRAIFYSLCEWGDDDPAT 220

Query: 556  WAPKIGNSWRTTGDISDNWNSMTSLADQNDKWASYAGPGGWNDPDMLEVGNGGMTVDEYR 735
            WA  +GNSWRTTGDISDNW+SMTS AD NDKWASYAGPGGWNDPDMLEVGNGGMT  EYR
Sbjct: 221  WASSVGNSWRTTGDISDNWDSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTAEYR 280

Query: 736  SHFSIWALSKAPLLIGCDLRAMSNATFELLSNKEVIAVNQDKLGIQGKKVKSNGGLEVWA 915
            SHFSIWAL+KAPL+IGCDLR+M     E+LSNKEVIAVNQDKLG+QGKKVK NG LEVWA
Sbjct: 281  SHFSIWALAKAPLIIGCDLRSMDQTAHEILSNKEVIAVNQDKLGVQGKKVKQNGDLEVWA 340

Query: 916  GKLYKNRIAVVLWNRGASEALITASWSDIGLKSSTVVNARDLWAHRTQKSVKGKIYAKVK 1095
            G L   R+A+VLWNR +S+A ITA WSDIGL SSTVV+ARDLWAH T+ SVKG++ A + 
Sbjct: 341  GPLSGKRLAMVLWNRSSSKADITAYWSDIGLDSSTVVDARDLWAHSTKGSVKGQLSASID 400

Query: 1096 SHDCKMYVLTPQ 1131
            SHDC+MYVLTP+
Sbjct: 401  SHDCRMYVLTPK 412


>gb|AEB98601.1| alpha-galactosidase [Nicotiana tabacum]
          Length = 413

 Score =  626 bits (1614), Expect = e-177
 Identities = 292/371 (78%), Positives = 325/371 (87%)
 Frame = +1

Query: 16   SDQQIRASLLANGLGLTPQMGWNSWNHFQCNINEQMIRDTADAMVSTGLAGAGYKYINLD 195
            S+  IR SLL+NGLG TPQMGW+SWNHF CNI E+MIR+TADAMVSTGLA  GY+Y+N+D
Sbjct: 41   SNAYIRRSLLSNGLGRTPQMGWSSWNHFACNIEEKMIRETADAMVSTGLASLGYEYVNID 100

Query: 196  DCWAELNRDSQGKMVAKHSTFPSGIKALADYVHSKGLKLGIYSDAGVQTCSGRMPGSLGH 375
            DCWAELNRDSQG MV K STFPSGIKALADYVH KGLKLGIYSDAG QTCS +MPGSLGH
Sbjct: 101  DCWAELNRDSQGNMVPKSSTFPSGIKALADYVHGKGLKLGIYSDAGSQTCSKQMPGSLGH 160

Query: 376  EEQDAKTFASWGIDYLKYDNCYNKNITARQRYPIMSKALLNSGRPIFFSMCEWGQEDPAT 555
            EEQDAKTFASWG+DYLKYDNC N+N + R+RYPIMSKAL NSGR IF+S+CEWG +DPAT
Sbjct: 161  EEQDAKTFASWGVDYLKYDNCNNENRSPRERYPIMSKALQNSGRAIFYSLCEWGDDDPAT 220

Query: 556  WAPKIGNSWRTTGDISDNWNSMTSLADQNDKWASYAGPGGWNDPDMLEVGNGGMTVDEYR 735
            WA  +GNSWRTTGDISDNW+SMTS AD NDKWASYAGPGGWNDPDMLEVGNGGMT  EYR
Sbjct: 221  WASSVGNSWRTTGDISDNWDSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTAEYR 280

Query: 736  SHFSIWALSKAPLLIGCDLRAMSNATFELLSNKEVIAVNQDKLGIQGKKVKSNGGLEVWA 915
            SHFSIWAL+KAPL+IGCDLR+M     E+LSNKEVIAVNQDKLG+QGKKVK NG LEVWA
Sbjct: 281  SHFSIWALAKAPLIIGCDLRSMDQTAHEILSNKEVIAVNQDKLGVQGKKVKQNGDLEVWA 340

Query: 916  GKLYKNRIAVVLWNRGASEALITASWSDIGLKSSTVVNARDLWAHRTQKSVKGKIYAKVK 1095
            G L   R+A+VLWNR +S+A ITA WSDIGL SSTVV+ARDLWAH T+ SVKG++ A + 
Sbjct: 341  GPLSGKRLAMVLWNRSSSKADITAYWSDIGLDSSTVVDARDLWAHSTKGSVKGQLSASID 400

Query: 1096 SHDCKMYVLTP 1128
            SHDC+MYVLTP
Sbjct: 401  SHDCRMYVLTP 411


Top