BLASTX nr result

ID: Atropa21_contig00040925 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00040925
         (736 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006357149.1| PREDICTED: uncharacterized protein LOC102589...   411   e-112
ref|XP_004233331.1| PREDICTED: uncharacterized protein LOC101258...   410   e-112
ref|XP_002300037.1| hypothetical protein POPTR_0001s34970g [Popu...   370   e-100
ref|XP_006470330.1| PREDICTED: uncharacterized protein LOC102617...   364   1e-98
gb|EOY02396.1| Beta-galactosidase 9 isoform 2 [Theobroma cacao] ...   362   5e-98
gb|EOY02395.1| Beta-galactosidase 9 isoform 1 [Theobroma cacao]       362   5e-98
ref|XP_002278059.1| PREDICTED: uncharacterized protein LOC100243...   362   5e-98
ref|XP_002524639.1| conserved hypothetical protein [Ricinus comm...   362   9e-98
ref|XP_006446492.1| hypothetical protein CICLE_v10015353mg [Citr...   360   3e-97
gb|EMJ16614.1| hypothetical protein PRUPE_ppa006026mg [Prunus pe...   360   3e-97
ref|XP_004142805.1| PREDICTED: uncharacterized protein LOC101205...   358   8e-97
gb|EOY02398.1| Beta-galactosidase 9 isoform 4 [Theobroma cacao]       358   1e-96
ref|XP_004489278.1| PREDICTED: uncharacterized protein LOC101500...   334   2e-89
ref|XP_004306345.1| PREDICTED: uncharacterized protein LOC101294...   330   4e-88
gb|ESW23035.1| hypothetical protein PHAVU_004G013700g [Phaseolus...   328   9e-88
gb|EPS70531.1| hypothetical protein M569_04229, partial [Genlise...   327   2e-87
ref|XP_003618290.1| hypothetical protein MTR_6g007040 [Medicago ...   323   4e-86
gb|ACZ74705.1| hypothetical protein [Phaseolus vulgaris]              318   9e-85
ref|XP_006395389.1| hypothetical protein EUTSA_v10004293mg [Eutr...   318   1e-84
ref|NP_001189992.1| uncharacterized protein [Arabidopsis thalian...   316   6e-84

>ref|XP_006357149.1| PREDICTED: uncharacterized protein LOC102589054 [Solanum tuberosum]
          Length = 426

 Score =  411 bits (1057), Expect = e-112
 Identities = 203/213 (95%), Positives = 206/213 (96%), Gaps = 2/213 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNLENWSCAIGYGVGSGSPLSPSFNFGLELAK SQFIASFYQHVVVQRRVKNPLEEDEVV
Sbjct: 214 KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKSSQFIASFYQHVVVQRRVKNPLEEDEVV 273

Query: 181 GITNYIDFGFELQTRVNDEKTPS-SIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAF 357
           GITNYIDFGFELQT VNDEK P+ SI DSTFQV ASWQANKNFLVKGKVGPLSSS+ALAF
Sbjct: 274 GITNYIDFGFELQTSVNDEKAPAISIHDSTFQVAASWQANKNFLVKGKVGPLSSSVALAF 333

Query: 358 KSWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGI 534
           KSWWKPSFTFNVSATRDR+KG TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGI
Sbjct: 334 KSWWKPSFTFNVSATRDRVKGATAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGI 393

Query: 535 HWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           HWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML
Sbjct: 394 HWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 426


>ref|XP_004233331.1| PREDICTED: uncharacterized protein LOC101258818 [Solanum
           lycopersicum]
          Length = 426

 Score =  410 bits (1054), Expect = e-112
 Identities = 203/213 (95%), Positives = 205/213 (96%), Gaps = 2/213 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNLENWSCAIGYGVGSGSPLSPSFNFGLELAK SQFIASFYQHVVVQRRVKNPLEEDEVV
Sbjct: 214 KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKSSQFIASFYQHVVVQRRVKNPLEEDEVV 273

Query: 181 GITNYIDFGFELQTRVNDEKTP-SSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAF 357
           GITNYIDFGFELQT VNDEK P SSI DSTFQV ASWQANKNFLVKGKVGPLSSS+ALAF
Sbjct: 274 GITNYIDFGFELQTSVNDEKAPASSIHDSTFQVAASWQANKNFLVKGKVGPLSSSVALAF 333

Query: 358 KSWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGI 534
           KSWWKPSFTFNVSATRDR+KG TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGI
Sbjct: 334 KSWWKPSFTFNVSATRDRVKGATAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGI 393

Query: 535 HWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           HWKVGKRPLLQSDVNSGNFDGMPRELRP GKML
Sbjct: 394 HWKVGKRPLLQSDVNSGNFDGMPRELRPIGKML 426


>ref|XP_002300037.1| hypothetical protein POPTR_0001s34970g [Populus trichocarpa]
           gi|222847295|gb|EEE84842.1| hypothetical protein
           POPTR_0001s34970g [Populus trichocarpa]
          Length = 423

 Score =  370 bits (950), Expect = e-100
 Identities = 176/212 (83%), Positives = 197/212 (92%), Gaps = 1/212 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNL NWS A+GYGVGSGSPLSPSFNF LELAK SQFIASFYQHVVVQRRVKNPLEE+E+V
Sbjct: 212 KNLMNWSAAVGYGVGSGSPLSPSFNFSLELAKTSQFIASFYQHVVVQRRVKNPLEENEIV 271

Query: 181 GITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFK 360
           GITNYIDFGFELQTRV+D KT ++I DSTFQ  ASWQANKNFL+KGKVGPLSS++A AFK
Sbjct: 272 GITNYIDFGFELQTRVDDPKTSNNIPDSTFQAAASWQANKNFLLKGKVGPLSSTLAFAFK 331

Query: 361 SWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIH 537
           SWWKPSFTFN+SATRDRI G T++GFGIR++N+RE SYQRADPNFVMLTP+KEHLAEGI 
Sbjct: 332 SWWKPSFTFNISATRDRIIGKTSYGFGIRIENLREASYQRADPNFVMLTPSKEHLAEGII 391

Query: 538 WKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           WK+GKRP+LQSDVN+GNFDG+PRELRP GK+L
Sbjct: 392 WKIGKRPMLQSDVNAGNFDGLPRELRPLGKIL 423


>ref|XP_006470330.1| PREDICTED: uncharacterized protein LOC102617442 [Citrus sinensis]
          Length = 441

 Score =  364 bits (935), Expect = 1e-98
 Identities = 172/212 (81%), Positives = 197/212 (92%), Gaps = 1/212 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNL NWS AIGYGVGSGSPLSPSFNFGLELAK S FIASFYQHVVVQRRVKNPLEEDE+V
Sbjct: 212 KNLMNWSYAIGYGVGSGSPLSPSFNFGLELAKSSVFIASFYQHVVVQRRVKNPLEEDEIV 271

Query: 181 GITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFK 360
           GITNYIDFGFELQTR++D KT +SI +S+FQV ASWQANKNFL+KGKVGPLSSS+A+AFK
Sbjct: 272 GITNYIDFGFELQTRIDDAKTANSIPESSFQVAASWQANKNFLLKGKVGPLSSSVAMAFK 331

Query: 361 SWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIH 537
           SWWKPSFTF++SAT+DR+ G T++GFGIRV+N+RE SYQRADPNFVMLTP+KEHLAEG+ 
Sbjct: 332 SWWKPSFTFSISATKDRVVGKTSYGFGIRVENLREASYQRADPNFVMLTPSKEHLAEGMV 391

Query: 538 WKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           WK G+RP+LQSDVN+GNF+G+P+ELRP GK L
Sbjct: 392 WKTGRRPMLQSDVNAGNFEGLPKELRPMGKFL 423


>gb|EOY02396.1| Beta-galactosidase 9 isoform 2 [Theobroma cacao]
           gi|508710500|gb|EOY02397.1| Beta-galactosidase 9 isoform
           2 [Theobroma cacao]
          Length = 314

 Score =  362 bits (930), Expect = 5e-98
 Identities = 172/212 (81%), Positives = 197/212 (92%), Gaps = 1/212 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNL NWSCAIGYGVGSGSPLSPSFNFGLELA+ SQFIASFYQHVVVQRRVKNPLEE+EVV
Sbjct: 103 KNLLNWSCAIGYGVGSGSPLSPSFNFGLELARSSQFIASFYQHVVVQRRVKNPLEENEVV 162

Query: 181 GITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFK 360
           GITNYIDFGFELQTR++D KT ++I DSTFQV ASWQANKNFL+KGK+GPLSSS+ALA K
Sbjct: 163 GITNYIDFGFELQTRMDDTKTSNNIPDSTFQVAASWQANKNFLLKGKMGPLSSSLALALK 222

Query: 361 SWWKPSFTFNVSATRDRI-KGTAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIH 537
           SWWKPSFTF++SATRD I + TA+GFG+RV+N+RE SY+RADPNFVMLTP KEHLAEGI 
Sbjct: 223 SWWKPSFTFSISATRDHISRTTAYGFGLRVENLREASYERADPNFVMLTPNKEHLAEGIV 282

Query: 538 WKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           WK+GKRP+LQSDVN+GNF+ +P+ELRP G++L
Sbjct: 283 WKIGKRPMLQSDVNAGNFESLPKELRPQGRIL 314


>gb|EOY02395.1| Beta-galactosidase 9 isoform 1 [Theobroma cacao]
          Length = 423

 Score =  362 bits (930), Expect = 5e-98
 Identities = 172/212 (81%), Positives = 197/212 (92%), Gaps = 1/212 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNL NWSCAIGYGVGSGSPLSPSFNFGLELA+ SQFIASFYQHVVVQRRVKNPLEE+EVV
Sbjct: 212 KNLLNWSCAIGYGVGSGSPLSPSFNFGLELARSSQFIASFYQHVVVQRRVKNPLEENEVV 271

Query: 181 GITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFK 360
           GITNYIDFGFELQTR++D KT ++I DSTFQV ASWQANKNFL+KGK+GPLSSS+ALA K
Sbjct: 272 GITNYIDFGFELQTRMDDTKTSNNIPDSTFQVAASWQANKNFLLKGKMGPLSSSLALALK 331

Query: 361 SWWKPSFTFNVSATRDRI-KGTAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIH 537
           SWWKPSFTF++SATRD I + TA+GFG+RV+N+RE SY+RADPNFVMLTP KEHLAEGI 
Sbjct: 332 SWWKPSFTFSISATRDHISRTTAYGFGLRVENLREASYERADPNFVMLTPNKEHLAEGIV 391

Query: 538 WKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           WK+GKRP+LQSDVN+GNF+ +P+ELRP G++L
Sbjct: 392 WKIGKRPMLQSDVNAGNFESLPKELRPQGRIL 423


>ref|XP_002278059.1| PREDICTED: uncharacterized protein LOC100243971 [Vitis vinifera]
           gi|296087664|emb|CBI34920.3| unnamed protein product
           [Vitis vinifera]
          Length = 423

 Score =  362 bits (930), Expect = 5e-98
 Identities = 174/212 (82%), Positives = 192/212 (90%), Gaps = 1/212 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNL NWSCAIGYGVG+GSPLSPSF FGLELAK SQFIASFYQHVVVQRRVKNPLEE+EVV
Sbjct: 212 KNLRNWSCAIGYGVGTGSPLSPSFIFGLELAKSSQFIASFYQHVVVQRRVKNPLEENEVV 271

Query: 181 GITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFK 360
           GITNYIDFGFELQ+RV D +T + + DSTFQV ASWQANKNFL+KGK GPLSSSI LAFK
Sbjct: 272 GITNYIDFGFELQSRVEDVETSNGLPDSTFQVAASWQANKNFLLKGKAGPLSSSIVLAFK 331

Query: 361 SWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIH 537
           SWWKPSFTF++SATRDR  G TA GFGI V+NIRE SY+RADPNFVMLTP+KEHLAEGIH
Sbjct: 332 SWWKPSFTFSISATRDRTVGKTALGFGIHVENIREASYERADPNFVMLTPSKEHLAEGIH 391

Query: 538 WKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           W+ GKRP+LQSD+NS NFDG+PRELRP+G +L
Sbjct: 392 WRSGKRPMLQSDLNSENFDGLPRELRPYGNIL 423


>ref|XP_002524639.1| conserved hypothetical protein [Ricinus communis]
           gi|223536000|gb|EEF37658.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 426

 Score =  362 bits (928), Expect = 9e-98
 Identities = 175/212 (82%), Positives = 195/212 (91%), Gaps = 1/212 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNL NWS AIGYGVGSGSPLSPSFNF LELAK SQFIASFYQHVVVQRRVKNPLEE+E+V
Sbjct: 215 KNLMNWSAAIGYGVGSGSPLSPSFNFCLELAKNSQFIASFYQHVVVQRRVKNPLEENEIV 274

Query: 181 GITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFK 360
           GITNY DFGFELQTRV+D +T ++I DSTFQV ASWQANKNFL+KGKVGPLSSS+ LAFK
Sbjct: 275 GITNYFDFGFELQTRVDDVETSNNIPDSTFQVAASWQANKNFLLKGKVGPLSSSLTLAFK 334

Query: 361 SWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIH 537
           SWWKPSFTFNVSATRDRI G TA+GFGIRV+N+RE SYQRADPNF+MLTP+KEHLAEGI 
Sbjct: 335 SWWKPSFTFNVSATRDRIIGKTAYGFGIRVENLREASYQRADPNFLMLTPSKEHLAEGIL 394

Query: 538 WKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           WK GKRP+LQS++N+GNF+ +PRELRP GK+L
Sbjct: 395 WKSGKRPMLQSEINAGNFNDLPRELRPLGKIL 426


>ref|XP_006446492.1| hypothetical protein CICLE_v10015353mg [Citrus clementina]
           gi|557549103|gb|ESR59732.1| hypothetical protein
           CICLE_v10015353mg [Citrus clementina]
          Length = 423

 Score =  360 bits (923), Expect = 3e-97
 Identities = 169/212 (79%), Positives = 197/212 (92%), Gaps = 1/212 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNL NWS AIGYGVGSGSPLSPSFNFGLELA  S FIASFYQHVVVQRRVKNPLEEDE+V
Sbjct: 212 KNLMNWSYAIGYGVGSGSPLSPSFNFGLELANSSVFIASFYQHVVVQRRVKNPLEEDEIV 271

Query: 181 GITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFK 360
           GITNYIDFGFELQTR++D KT ++I +S+FQV ASWQANKNFL+KGKVGPLSSS+A+AFK
Sbjct: 272 GITNYIDFGFELQTRIDDAKTANNIPESSFQVAASWQANKNFLLKGKVGPLSSSVAMAFK 331

Query: 361 SWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIH 537
           SWWKPSFTF++SAT+DR+ G T++GFGIRV+N+RE SYQRADPNFVMLTP+KEHLAEG+ 
Sbjct: 332 SWWKPSFTFSISATKDRVVGKTSYGFGIRVENLREASYQRADPNFVMLTPSKEHLAEGMV 391

Query: 538 WKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           WK G+RP+LQSDVN+G+F+G+P+ELRP GK+L
Sbjct: 392 WKTGRRPMLQSDVNAGHFEGLPKELRPLGKIL 423


>gb|EMJ16614.1| hypothetical protein PRUPE_ppa006026mg [Prunus persica]
          Length = 432

 Score =  360 bits (923), Expect = 3e-97
 Identities = 174/221 (78%), Positives = 196/221 (88%), Gaps = 10/221 (4%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           +NL+NWSCAIGYGVGS SPLSPSFNFGLEL+K SQFIASFYQHVVVQRRVKNPLEE+EVV
Sbjct: 212 RNLKNWSCAIGYGVGSSSPLSPSFNFGLELSKSSQFIASFYQHVVVQRRVKNPLEENEVV 271

Query: 181 GITNYIDFGFELQTRVNDEKTPS---------SIQDSTFQVVASWQANKNFLVKGKVGPL 333
           GITNYIDFGFELQTRV+D K+ +          + DSTFQV ASWQANKNFL+KGKVGPL
Sbjct: 272 GITNYIDFGFELQTRVDDTKSSNDVPESSYVPQVPDSTFQVAASWQANKNFLLKGKVGPL 331

Query: 334 SSSIALAFKSWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPT 510
           SSSIALAFKSWWKPSFTF++SATRD + G T +GFGIRV+N+R+ SYQRADPNFVMLTP 
Sbjct: 332 SSSIALAFKSWWKPSFTFSISATRDHVVGETGYGFGIRVENLRQASYQRADPNFVMLTPN 391

Query: 511 KEHLAEGIHWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           KEHLAEGI WKVGKRP+LQSD+ +GNFDG+P+ELRP GK+L
Sbjct: 392 KEHLAEGIVWKVGKRPMLQSDITAGNFDGIPKELRPLGKIL 432


>ref|XP_004142805.1| PREDICTED: uncharacterized protein LOC101205581 [Cucumis sativus]
           gi|449483757|ref|XP_004156681.1| PREDICTED:
           uncharacterized LOC101205581 [Cucumis sativus]
          Length = 425

 Score =  358 bits (920), Expect = 8e-97
 Identities = 171/214 (79%), Positives = 197/214 (92%), Gaps = 3/214 (1%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           KNL NWSCAIGY VGSGSPLSPSFNFGLELAK SQFIASFYQHVVVQRRVKNPLEE+E+V
Sbjct: 212 KNLMNWSCAIGYDVGSGSPLSPSFNFGLELAKNSQFIASFYQHVVVQRRVKNPLEENEIV 271

Query: 181 GITNYIDFGFELQT--RVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALA 354
           GITNYIDFGFELQ+  RV+D +  ++I DSTFQ+ ASWQANKNFL+KGKVGPLSSS+A+A
Sbjct: 272 GITNYIDFGFELQSQMRVDDVQAANNIPDSTFQIAASWQANKNFLLKGKVGPLSSSLAMA 331

Query: 355 FKSWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEG 531
           FKSWWKPSFTF++SA RDRI G T++GFGIRV+N+RE SYQRADPNFVMLTP+KEHLAEG
Sbjct: 332 FKSWWKPSFTFSISAVRDRIVGRTSYGFGIRVENLREASYQRADPNFVMLTPSKEHLAEG 391

Query: 532 IHWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           + WK+GKRP+LQSD+N+GNFDG+P+ELRP  K+L
Sbjct: 392 MVWKIGKRPMLQSDINAGNFDGIPKELRPLNKIL 425


>gb|EOY02398.1| Beta-galactosidase 9 isoform 4 [Theobroma cacao]
          Length = 361

 Score =  358 bits (918), Expect = 1e-96
 Identities = 172/213 (80%), Positives = 197/213 (92%), Gaps = 2/213 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQ-FIASFYQHVVVQRRVKNPLEEDEV 177
           KNL NWSCAIGYGVGSGSPLSPSFNFGLELA+ SQ FIASFYQHVVVQRRVKNPLEE+EV
Sbjct: 149 KNLLNWSCAIGYGVGSGSPLSPSFNFGLELARSSQQFIASFYQHVVVQRRVKNPLEENEV 208

Query: 178 VGITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAF 357
           VGITNYIDFGFELQTR++D KT ++I DSTFQV ASWQANKNFL+KGK+GPLSSS+ALA 
Sbjct: 209 VGITNYIDFGFELQTRMDDTKTSNNIPDSTFQVAASWQANKNFLLKGKMGPLSSSLALAL 268

Query: 358 KSWWKPSFTFNVSATRDRI-KGTAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGI 534
           KSWWKPSFTF++SATRD I + TA+GFG+RV+N+RE SY+RADPNFVMLTP KEHLAEGI
Sbjct: 269 KSWWKPSFTFSISATRDHISRTTAYGFGLRVENLREASYERADPNFVMLTPNKEHLAEGI 328

Query: 535 HWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
            WK+GKRP+LQSDVN+GNF+ +P+ELRP G++L
Sbjct: 329 VWKIGKRPMLQSDVNAGNFESLPKELRPQGRIL 361


>ref|XP_004489278.1| PREDICTED: uncharacterized protein LOC101500159 [Cicer arietinum]
          Length = 422

 Score =  334 bits (857), Expect = 2e-89
 Identities = 159/211 (75%), Positives = 185/211 (87%), Gaps = 1/211 (0%)
 Frame = +1

Query: 4   NLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVVG 183
           NL NWSCA+GYGVGSGSPLSPSFNF LEL K SQF+ASFYQH+VVQRRVKNPLEE+ VVG
Sbjct: 212 NLMNWSCAMGYGVGSGSPLSPSFNFSLELVKSSQFVASFYQHMVVQRRVKNPLEENTVVG 271

Query: 184 ITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFKS 363
           ITNYIDFGFELQT V+D    ++I DSTFQ+ ASWQANKNFLVK KVGP SS++ALAFKS
Sbjct: 272 ITNYIDFGFELQTSVDDAIATNNISDSTFQIGASWQANKNFLVKAKVGPRSSTMALAFKS 331

Query: 364 WWKPSFTFNVSATRDRIKGTA-FGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIHW 540
           WWKPSFTF+VSATRDR  G   +GFG++ +++RE SYQRADPNFVMLT +KEHLAEGI W
Sbjct: 332 WWKPSFTFSVSATRDRADGNVQYGFGVQSESLREASYQRADPNFVMLTQSKEHLAEGIVW 391

Query: 541 KVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           + GKRP+LQSD+N+G+F+G+PRELRP  K+L
Sbjct: 392 QTGKRPMLQSDINAGHFEGLPRELRPLDKIL 422


>ref|XP_004306345.1| PREDICTED: uncharacterized protein LOC101294392 [Fragaria vesca
           subsp. vesca]
          Length = 443

 Score =  330 bits (845), Expect = 4e-88
 Identities = 161/227 (70%), Positives = 193/227 (85%), Gaps = 16/227 (7%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           K+++NWS AIGYGVGS SPLSPSFNFGLEL++ +QFIASFYQHVVVQRRVKNPLEE+++V
Sbjct: 217 KDVKNWSGAIGYGVGSDSPLSPSFNFGLELSRSNQFIASFYQHVVVQRRVKNPLEENQIV 276

Query: 181 GITNYIDFGFELQTRV---NDEKT------------PSSIQDSTFQVVASWQANKNFLVK 315
           GITNYIDFGFELQ+R+   +D K+            P +I DSTFQV ASWQANKN L+K
Sbjct: 277 GITNYIDFGFELQSRLDRFDDTKSSDDESNSNRVPDPKNIPDSTFQVAASWQANKNCLLK 336

Query: 316 GKVGPLSSSIALAFKSWWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNF 492
           GKVGPLSSS+A AFKSWWKPSFTF++SA RD + G TA+GFGIRV+NIR+ SY+RADPNF
Sbjct: 337 GKVGPLSSSLAFAFKSWWKPSFTFSISAVRDHVFGNTAYGFGIRVENIRQASYERADPNF 396

Query: 493 VMLTPTKEHLAEGIHWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           VMLTPTKEHLA+GI W+ GKRP+L+S++N+GNFDG+P ELRP  K+L
Sbjct: 397 VMLTPTKEHLAQGIAWEAGKRPMLESNINAGNFDGVPMELRPLRKIL 443


>gb|ESW23035.1| hypothetical protein PHAVU_004G013700g [Phaseolus vulgaris]
          Length = 424

 Score =  328 bits (842), Expect = 9e-88
 Identities = 158/211 (74%), Positives = 181/211 (85%), Gaps = 1/211 (0%)
 Frame = +1

Query: 4   NLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVVG 183
           NL NWSCA+GYGVGSGSPL PSFNF LEL K SQFIASFYQH+VVQRRVKNPLEE+ VVG
Sbjct: 214 NLMNWSCALGYGVGSGSPLCPSFNFNLELVKSSQFIASFYQHMVVQRRVKNPLEENSVVG 273

Query: 184 ITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFKS 363
           ITNYIDFGFEL T V+D    ++I DSTFQ+ ASWQANKNFL+K K GP SSS+ALAFKS
Sbjct: 274 ITNYIDFGFELLTSVDDAIAANNISDSTFQIGASWQANKNFLLKAKAGPRSSSMALAFKS 333

Query: 364 WWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIHW 540
           WWKPSFT ++SATRDR  G   +GFGI+ +N+RE SYQRADPNFVMLTP+KEHLAEGI W
Sbjct: 334 WWKPSFTISISATRDRADGKMQYGFGIQSENLREASYQRADPNFVMLTPSKEHLAEGIVW 393

Query: 541 KVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           + GKRP+ QSDV++G+FD +PRELRPF K+L
Sbjct: 394 ETGKRPMFQSDVDAGHFDVLPRELRPFDKIL 424


>gb|EPS70531.1| hypothetical protein M569_04229, partial [Genlisea aurea]
          Length = 422

 Score =  327 bits (839), Expect = 2e-87
 Identities = 152/213 (71%), Positives = 182/213 (85%), Gaps = 2/213 (0%)
 Frame = +1

Query: 1   KNLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVV 180
           +N+ENWSCA+GYG+GSGSPLSPSFNF LE A+ SQFIASFYQHVVVQRRVKNP+EE E+V
Sbjct: 210 RNMENWSCAVGYGLGSGSPLSPSFNFALEFARNSQFIASFYQHVVVQRRVKNPIEEKEIV 269

Query: 181 GITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFK 360
           GITNYIDFGFELQT+VNDEK+P ++ + + Q  ASWQANKN L+KGKVG LSSSIALAFK
Sbjct: 270 GITNYIDFGFELQTKVNDEKSPKNLNEPSLQAAASWQANKNLLLKGKVGQLSSSIALAFK 329

Query: 361 SWWKPSFTFNVSATRDRI--KGTAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGI 534
           SWW PSFT ++SA  D       +FGFG+R+DNIR+ SY+RADPNFVMLTP KEHLA+G+
Sbjct: 330 SWWNPSFTLSMSAMWDHTAKANGSFGFGLRIDNIRKASYERADPNFVMLTPNKEHLAQGM 389

Query: 535 HWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
            W+ GKRPLLQS++ SG+F+G+P ELRP  K+L
Sbjct: 390 QWETGKRPLLQSELGSGDFNGIPNELRPLDKIL 422


>ref|XP_003618290.1| hypothetical protein MTR_6g007040 [Medicago truncatula]
           gi|355493305|gb|AES74508.1| hypothetical protein
           MTR_6g007040 [Medicago truncatula]
          Length = 465

 Score =  323 bits (828), Expect = 4e-86
 Identities = 155/214 (72%), Positives = 182/214 (85%), Gaps = 4/214 (1%)
 Frame = +1

Query: 4   NLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVVG 183
           NL NWSCA+ YGVGS SPLSPSFNF LEL K SQF+ASFYQH+VVQRRVKNPLEE+ VVG
Sbjct: 252 NLMNWSCAMAYGVGSQSPLSPSFNFSLELVKSSQFVASFYQHMVVQRRVKNPLEENTVVG 311

Query: 184 ITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFKS 363
           ITNYIDFGFELQT V+D    ++I DSTFQ+ ASWQANKNFLVK K GP SS++ALAFKS
Sbjct: 312 ITNYIDFGFELQTSVDDAIAANNISDSTFQIAASWQANKNFLVKAKAGPKSSTMALAFKS 371

Query: 364 WWKPSFTFNV---SATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEG 531
           WWKPSFTF++   S TRDR  G   +GFG++ +++RE SYQRADPNFVMLTP+KEHLAEG
Sbjct: 372 WWKPSFTFSISGTSTTRDRADGQVQYGFGLQSESLREASYQRADPNFVMLTPSKEHLAEG 431

Query: 532 IHWKVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           I W+ GKRP+LQSD+++G+FDG+PRELRP  K+L
Sbjct: 432 IVWETGKRPMLQSDIDAGHFDGLPRELRPLDKIL 465


>gb|ACZ74705.1| hypothetical protein [Phaseolus vulgaris]
          Length = 421

 Score =  318 bits (816), Expect = 9e-85
 Identities = 156/211 (73%), Positives = 177/211 (83%), Gaps = 1/211 (0%)
 Frame = +1

Query: 4   NLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVVG 183
           NL NWSCA+GYGVGSGSPL PSFNF LEL K SQFIASFYQH+VVQRRVKNPLEE+ VVG
Sbjct: 214 NLMNWSCALGYGVGSGSPLCPSFNFNLELVKSSQFIASFYQHMVVQRRVKNPLEENSVVG 273

Query: 184 ITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFKS 363
           ITNYIDFGFEL T V+D    ++I DSTFQ+ ASWQANKNFL+K K GP SSS+ALAFKS
Sbjct: 274 ITNYIDFGFELLTSVDDAIAANNISDSTFQIGASWQANKNFLLKAKAGPRSSSMALAFKS 333

Query: 364 WWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIHW 540
           WWKPSFT     TRDR  G   +GFGI+ +N+RE SYQRADPNFVMLTP+KEHLAEGI W
Sbjct: 334 WWKPSFTI---TTRDRADGKMQYGFGIQSENLREASYQRADPNFVMLTPSKEHLAEGIVW 390

Query: 541 KVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           + GKRP+ QSDV++G+FD +PRELRPF K+L
Sbjct: 391 ETGKRPMFQSDVDAGHFDVLPRELRPFDKIL 421


>ref|XP_006395389.1| hypothetical protein EUTSA_v10004293mg [Eutrema salsugineum]
           gi|557092028|gb|ESQ32675.1| hypothetical protein
           EUTSA_v10004293mg [Eutrema salsugineum]
          Length = 419

 Score =  318 bits (815), Expect = 1e-84
 Identities = 156/211 (73%), Positives = 178/211 (84%), Gaps = 1/211 (0%)
 Frame = +1

Query: 4   NLENWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVVG 183
           +L NWSCA GYGVGS SPLSPSFNFGLELA+ SQFIASFYQHVVVQRRVKNP EE+EVVG
Sbjct: 213 DLRNWSCAAGYGVGSRSPLSPSFNFGLELARSSQFIASFYQHVVVQRRVKNPFEENEVVG 272

Query: 184 ITNYIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFKS 363
           ITNYID GFELQTRV+D KTP    DS+ Q+ ASWQANKNFL+KGKVG LSS++ LAFKS
Sbjct: 273 ITNYIDLGFELQTRVDDSKTP----DSSLQMAASWQANKNFLLKGKVGALSSTLTLAFKS 328

Query: 364 WWKPSFTFNVSATRDRIKG-TAFGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIHW 540
           WWKPSF FN+SAT +   G    GFG+RVD++RE SYQRADPNFVMLTP KEHLAEGI W
Sbjct: 329 WWKPSFAFNISATTNHSTGEVECGFGLRVDSLREASYQRADPNFVMLTPNKEHLAEGIVW 388

Query: 541 KVGKRPLLQSDVNSGNFDGMPRELRPFGKML 633
           K+GKRP+ QSD+++ NF  +P+ELRP  K+L
Sbjct: 389 KMGKRPMFQSDLDAENFTELPKELRPPQKIL 419


>ref|NP_001189992.1| uncharacterized protein [Arabidopsis thaliana]
           gi|332643862|gb|AEE77383.1| uncharacterized protein
           AT3G27930 [Arabidopsis thaliana]
          Length = 420

 Score =  316 bits (809), Expect = 6e-84
 Identities = 151/208 (72%), Positives = 176/208 (84%), Gaps = 1/208 (0%)
 Frame = +1

Query: 13  NWSCAIGYGVGSGSPLSPSFNFGLELAKGSQFIASFYQHVVVQRRVKNPLEEDEVVGITN 192
           NWSCA GYGVGS SPL+PSFN G+ELA+ SQFIASFYQHVVVQRRV+NP EE++VVGITN
Sbjct: 213 NWSCAAGYGVGSQSPLTPSFNIGIELARSSQFIASFYQHVVVQRRVQNPFEENQVVGITN 272

Query: 193 YIDFGFELQTRVNDEKTPSSIQDSTFQVVASWQANKNFLVKGKVGPLSSSIALAFKSWWK 372
           YIDFGFELQ+RV+D KTP +  DS  QV ASWQANKNFL+KGKVG  SS+++LAFKSWWK
Sbjct: 273 YIDFGFELQSRVDDSKTPPNAPDSLLQVAASWQANKNFLLKGKVGAHSSTLSLAFKSWWK 332

Query: 373 PSFTFNVSATRDRIKGTA-FGFGIRVDNIREGSYQRADPNFVMLTPTKEHLAEGIHWKVG 549
           PSF FN+SAT +   G    GFG+RVDN+RE SYQRADPNFVMLTP KEHLAEGI WK+G
Sbjct: 333 PSFAFNISATTNHRTGNVQCGFGLRVDNLREASYQRADPNFVMLTPNKEHLAEGIVWKMG 392

Query: 550 KRPLLQSDVNSGNFDGMPRELRPFGKML 633
           KRP+ Q+DV++ NF  +P+ELRP  K+L
Sbjct: 393 KRPMYQADVDAENFSELPKELRPSQKIL 420


Top