BLASTX nr result
ID: Cephaelis21_contig00003783
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00003783 (2249 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAI47559.1| alpha galactosidase [Coffea arabica] 672 0.0 sp|Q42656.1|AGAL_COFAR RecName: Full=Alpha-galactosidase; AltNam... 663 0.0 emb|CAI47560.1| alpha-galactosidase [Coffea canephora] 653 0.0 gb|AEB98600.1| alpha-galactosidase [Nicotiana tabacum] 645 0.0 gb|AEB98601.1| alpha-galactosidase [Nicotiana tabacum] 645 0.0 >emb|CAI47559.1| alpha galactosidase [Coffea arabica] Length = 420 Score = 672 bits (1734), Expect = 0.0 Identities = 336/434 (77%), Positives = 365/434 (84%), Gaps = 2/434 (0%) Frame = -3 Query: 1515 VAAANVYL-STKSHHQQLLLRRPXXXXXXXXXXXXLCCSCFM-FGSVNASGRQMMKSVVE 1342 +AAA YL S+K Q+L+LR CF+ +V AS R+M+KS Sbjct: 1 MAAAYYYLFSSKKATQKLVLRASLLMLL-----------CFLTVENVGASARRMVKSP-G 48 Query: 1341 THDAVHARRNLLNNGLGGTPPMGWNSWNHFHCSINEQLIRETADAMVSTGLAALGYQYIN 1162 T D + RR+LL NGLG TPPMGWNSWNHF C+++E+LIRETADAM S GLAALGY+YIN Sbjct: 49 TED--YTRRSLLANGLGLTPPMGWNSWNHFSCNLDEKLIRETADAMASKGLAALGYKYIN 106 Query: 1161 LDDCWGDYNRDSQGNLIAKASTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKIMPGSL 982 LDDCW + NRDSQGNL+ K STFPSGIKALADYVHSKGLKLGIYSDAGTQTCSK MPGSL Sbjct: 107 LDDCWAELNRDSQGNLVPKGSTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKTMPGSL 166 Query: 981 GHEEQDAKTFASWGVDYLKYDNCNDNGISPKDRYPIMSKALLNSGRSIFFSLCEWGVEDP 802 GHEEQDAKTFASWGVDYLKYDNCNDN ISPK+RYPIMSKALLNSGRSIFFSLCEWG EDP Sbjct: 167 GHEEQDAKTFASWGVDYLKYDNCNDNNISPKERYPIMSKALLNSGRSIFFSLCEWGDEDP 226 Query: 801 ATWAKQLGNSWRTTGDIADNWASMTSRADENDKWANYASPGGWNDPDMLEVGNGGMTTGE 622 ATWAK++GNSWRTTGDI D+W+SMTSRAD NDKWA+YA PGGWNDPDMLEVGNGGMTT E Sbjct: 227 ATWAKEVGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTTE 286 Query: 621 YRSHFSIWALAKAPLLIGCDIRSMDSVTVQLLSNKEVIAVNQDKLGVQGKKRKKDGDLEV 442 YRSHFSIWALAKAPLLIGCDIRS+D T QLLSN EVIAVNQDKLGVQGKK K GDLEV Sbjct: 287 YRSHFSIWALAKAPLLIGCDIRSIDGATFQLLSNAEVIAVNQDKLGVQGKKVKTYGDLEV 346 Query: 441 WGGPLSGNRVAVVLWNRGSSKATITAYWSDLGLQSSTVVNARDLWAHSTQGAVKGQISAQ 262 W GPLSG RVAV LWNRGSS ATITAYWSD+GL S+ VVNARDLWAHST+ +VKGQISA Sbjct: 347 WAGPLSGKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNARDLWAHSTEKSVKGQISAA 406 Query: 261 VDPHDVKMYLLTPQ 220 VD HD KMY+LTPQ Sbjct: 407 VDAHDSKMYVLTPQ 420 >sp|Q42656.1|AGAL_COFAR RecName: Full=Alpha-galactosidase; AltName: Full=Alpha-D-galactoside galactohydrolase; AltName: Full=Melibiase; Flags: Precursor gi|504489|gb|AAA33022.1| alpha-galactosidase [Coffea arabica] Length = 378 Score = 663 bits (1710), Expect = 0.0 Identities = 314/369 (85%), Positives = 335/369 (90%) Frame = -3 Query: 1326 HARRNLLNNGLGGTPPMGWNSWNHFHCSINEQLIRETADAMVSTGLAALGYQYINLDDCW 1147 + RR+LL NGLG TPPMGWNSWNHF C+++E+LIRETADAMVS GLAALGY+YINLDDCW Sbjct: 10 YTRRSLLANGLGLTPPMGWNSWNHFRCNLDEKLIRETADAMVSKGLAALGYKYINLDDCW 69 Query: 1146 GDYNRDSQGNLIAKASTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKIMPGSLGHEEQ 967 + NRDSQGNL+ K STFPSGIKALADYVHSKGLKLGIYSDAGTQTCSK MPGSLGHEEQ Sbjct: 70 AELNRDSQGNLVPKGSTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKTMPGSLGHEEQ 129 Query: 966 DAKTFASWGVDYLKYDNCNDNGISPKDRYPIMSKALLNSGRSIFFSLCEWGVEDPATWAK 787 DAKTFASWGVDYLKYDNCN+N ISPK+RYPIMSKALLNSGRSIFFSLCEWG EDPATWAK Sbjct: 130 DAKTFASWGVDYLKYDNCNNNNISPKERYPIMSKALLNSGRSIFFSLCEWGEEDPATWAK 189 Query: 786 QLGNSWRTTGDIADNWASMTSRADENDKWANYASPGGWNDPDMLEVGNGGMTTGEYRSHF 607 ++GNSWRTTGDI D+W+SMTSRAD NDKWA+YA PGGWNDPDMLEVGNGGMTT EYRSHF Sbjct: 190 EVGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTTEYRSHF 249 Query: 606 SIWALAKAPLLIGCDIRSMDSVTVQLLSNKEVIAVNQDKLGVQGKKRKKDGDLEVWGGPL 427 SIWALAKAPLLIGCDIRSMD T QLLSN EVIAVNQDKLGVQG K K GDLEVW GPL Sbjct: 250 SIWALAKAPLLIGCDIRSMDGATFQLLSNAEVIAVNQDKLGVQGNKVKTYGDLEVWAGPL 309 Query: 426 SGNRVAVVLWNRGSSKATITAYWSDLGLQSSTVVNARDLWAHSTQGAVKGQISAQVDPHD 247 SG RVAV LWNRGSS ATITAYWSD+GL S+ VVNARDLWAHST+ +VKGQISA VD HD Sbjct: 310 SGKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNARDLWAHSTEKSVKGQISAAVDAHD 369 Query: 246 VKMYLLTPQ 220 KMY+LTPQ Sbjct: 370 SKMYVLTPQ 378 >emb|CAI47560.1| alpha-galactosidase [Coffea canephora] Length = 378 Score = 653 bits (1685), Expect = 0.0 Identities = 311/369 (84%), Positives = 333/369 (90%) Frame = -3 Query: 1326 HARRNLLNNGLGGTPPMGWNSWNHFHCSINEQLIRETADAMVSTGLAALGYQYINLDDCW 1147 + RR+LL NGLG TPPMGWNS NHF C+++E+LIRETADAMVS GLAALGY+YINLDDCW Sbjct: 10 YTRRSLLANGLGLTPPMGWNSRNHFRCNLDEKLIRETADAMVSKGLAALGYKYINLDDCW 69 Query: 1146 GDYNRDSQGNLIAKASTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKIMPGSLGHEEQ 967 + NRDSQGNL+ K STFPSGIKALADYVHSKGLKLGIYSDAGTQTCSK MPGSLG+EEQ Sbjct: 70 AELNRDSQGNLVPKGSTFPSGIKALADYVHSKGLKLGIYSDAGTQTCSKTMPGSLGNEEQ 129 Query: 966 DAKTFASWGVDYLKYDNCNDNGISPKDRYPIMSKALLNSGRSIFFSLCEWGVEDPATWAK 787 DAKTFASWGVDYLKYDNCN+N ISPK+RYPIMSKALLNSGRSIFFSLCEWG EDPATWAK Sbjct: 130 DAKTFASWGVDYLKYDNCNNNNISPKERYPIMSKALLNSGRSIFFSLCEWGEEDPATWAK 189 Query: 786 QLGNSWRTTGDIADNWASMTSRADENDKWANYASPGGWNDPDMLEVGNGGMTTGEYRSHF 607 ++GNSWRTTGDI D+W+SMTSRAD NDKWA+YA PGGWNDPDMLEVGNGGMTT EYRSHF Sbjct: 190 EVGNSWRTTGDIDDSWSSMTSRADMNDKWASYAGPGGWNDPDMLEVGNGGMTTTEYRSHF 249 Query: 606 SIWALAKAPLLIGCDIRSMDSVTVQLLSNKEVIAVNQDKLGVQGKKRKKDGDLEVWGGPL 427 SIWALAKAPLLIGCDIRSMD T QLLSN EVIAVNQDKLGVQG K K GDLEVW GPL Sbjct: 250 SIWALAKAPLLIGCDIRSMDGATFQLLSNAEVIAVNQDKLGVQGNKVKTYGDLEVWAGPL 309 Query: 426 SGNRVAVVLWNRGSSKATITAYWSDLGLQSSTVVNARDLWAHSTQGAVKGQISAQVDPHD 247 SG RVAV LWNRGSS ATITAYWSD+GL S+ VVNARDLWAHST+ +VKGQISA D HD Sbjct: 310 SGKRVAVALWNRGSSTATITAYWSDVGLPSTAVVNARDLWAHSTEKSVKGQISAAADAHD 369 Query: 246 VKMYLLTPQ 220 KMY+LTPQ Sbjct: 370 SKMYVLTPQ 378 >gb|AEB98600.1| alpha-galactosidase [Nicotiana tabacum] Length = 413 Score = 645 bits (1665), Expect = 0.0 Identities = 298/402 (74%), Positives = 345/402 (85%), Gaps = 5/402 (1%) Frame = -3 Query: 1410 CCSCFMFGSVNASGRQMMKSVV-----ETHDAVHARRNLLNNGLGGTPPMGWNSWNHFHC 1246 CC C R +++++ T + RR+LL+NGLG TP MGW+SWNHF C Sbjct: 11 CCLCLCGVITTTYARPQLRNLIIADSNSTTSNAYIRRSLLSNGLGRTPQMGWSSWNHFAC 70 Query: 1245 SINEQLIRETADAMVSTGLAALGYQYINLDDCWGDYNRDSQGNLIAKASTFPSGIKALAD 1066 +I E++IRETADAMVSTGLA+LGY+Y+N+DDCW + NRDSQGN++ K+STFPSGIKALAD Sbjct: 71 NIEEKMIRETADAMVSTGLASLGYEYVNIDDCWAELNRDSQGNMVPKSSTFPSGIKALAD 130 Query: 1065 YVHSKGLKLGIYSDAGTQTCSKIMPGSLGHEEQDAKTFASWGVDYLKYDNCNDNGISPKD 886 YVH KGLKLGIYSDAG+QTCSK MPGSLGHEEQDAKTFASWGVDYLKYDNCN+ SP++ Sbjct: 131 YVHGKGLKLGIYSDAGSQTCSKQMPGSLGHEEQDAKTFASWGVDYLKYDNCNNENRSPRE 190 Query: 885 RYPIMSKALLNSGRSIFFSLCEWGVEDPATWAKQLGNSWRTTGDIADNWASMTSRADEND 706 RYPIMSKAL NSGR+IF+SLCEWG +DPATWA +GNSWRTTGDI+DNW SMTSRAD ND Sbjct: 191 RYPIMSKALQNSGRAIFYSLCEWGDDDPATWASSVGNSWRTTGDISDNWDSMTSRADMND 250 Query: 705 KWANYASPGGWNDPDMLEVGNGGMTTGEYRSHFSIWALAKAPLLIGCDIRSMDSVTVQLL 526 KWA+YA PGGWNDPDMLEVGNGGMTT EYRSHFSIWALAKAPL+IGCD+RSMD ++L Sbjct: 251 KWASYAGPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPLIIGCDLRSMDQTAHEIL 310 Query: 525 SNKEVIAVNQDKLGVQGKKRKKDGDLEVWGGPLSGNRVAVVLWNRGSSKATITAYWSDLG 346 SNKEVIAVNQDKLGVQGKK K++GDLEVW GPLSG R+A+VLWNR SSKA ITAYWSD+G Sbjct: 311 SNKEVIAVNQDKLGVQGKKVKQNGDLEVWAGPLSGKRLAMVLWNRSSSKADITAYWSDIG 370 Query: 345 LQSSTVVNARDLWAHSTQGAVKGQISAQVDPHDVKMYLLTPQ 220 L SSTVV+ARDLWAHST+G+VKGQ+SA +D HD +MY+LTP+ Sbjct: 371 LDSSTVVDARDLWAHSTKGSVKGQLSASIDSHDCRMYVLTPK 412 >gb|AEB98601.1| alpha-galactosidase [Nicotiana tabacum] Length = 413 Score = 645 bits (1664), Expect = 0.0 Identities = 298/401 (74%), Positives = 344/401 (85%), Gaps = 5/401 (1%) Frame = -3 Query: 1410 CCSCFMFGSVNASGRQMMKSVV-----ETHDAVHARRNLLNNGLGGTPPMGWNSWNHFHC 1246 CC C R +++++ T + RR+LL+NGLG TP MGW+SWNHF C Sbjct: 11 CCLCLCGVITTTYARPQLRNLIIADSNSTTSNAYIRRSLLSNGLGRTPQMGWSSWNHFAC 70 Query: 1245 SINEQLIRETADAMVSTGLAALGYQYINLDDCWGDYNRDSQGNLIAKASTFPSGIKALAD 1066 +I E++IRETADAMVSTGLA+LGY+Y+N+DDCW + NRDSQGN++ K+STFPSGIKALAD Sbjct: 71 NIEEKMIRETADAMVSTGLASLGYEYVNIDDCWAELNRDSQGNMVPKSSTFPSGIKALAD 130 Query: 1065 YVHSKGLKLGIYSDAGTQTCSKIMPGSLGHEEQDAKTFASWGVDYLKYDNCNDNGISPKD 886 YVH KGLKLGIYSDAG+QTCSK MPGSLGHEEQDAKTFASWGVDYLKYDNCN+ SP++ Sbjct: 131 YVHGKGLKLGIYSDAGSQTCSKQMPGSLGHEEQDAKTFASWGVDYLKYDNCNNENRSPRE 190 Query: 885 RYPIMSKALLNSGRSIFFSLCEWGVEDPATWAKQLGNSWRTTGDIADNWASMTSRADEND 706 RYPIMSKAL NSGR+IF+SLCEWG +DPATWA +GNSWRTTGDI+DNW SMTSRAD ND Sbjct: 191 RYPIMSKALQNSGRAIFYSLCEWGDDDPATWASSVGNSWRTTGDISDNWDSMTSRADMND 250 Query: 705 KWANYASPGGWNDPDMLEVGNGGMTTGEYRSHFSIWALAKAPLLIGCDIRSMDSVTVQLL 526 KWA+YA PGGWNDPDMLEVGNGGMTT EYRSHFSIWALAKAPL+IGCD+RSMD ++L Sbjct: 251 KWASYAGPGGWNDPDMLEVGNGGMTTAEYRSHFSIWALAKAPLIIGCDLRSMDQTAHEIL 310 Query: 525 SNKEVIAVNQDKLGVQGKKRKKDGDLEVWGGPLSGNRVAVVLWNRGSSKATITAYWSDLG 346 SNKEVIAVNQDKLGVQGKK K++GDLEVW GPLSG R+A+VLWNR SSKA ITAYWSD+G Sbjct: 311 SNKEVIAVNQDKLGVQGKKVKQNGDLEVWAGPLSGKRLAMVLWNRSSSKADITAYWSDIG 370 Query: 345 LQSSTVVNARDLWAHSTQGAVKGQISAQVDPHDVKMYLLTP 223 L SSTVV+ARDLWAHST+G+VKGQ+SA +D HD +MY+LTP Sbjct: 371 LDSSTVVDARDLWAHSTKGSVKGQLSASIDSHDCRMYVLTP 411