BLASTX nr result

ID: Dioscorea21_contig00002949 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00002949
         (1133 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]                544   e-152
gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elae...   537   e-150
ref|XP_002326950.1| predicted protein [Populus trichocarpa] gi|2...   519   e-145
gb|ABK95110.1| unknown [Populus trichocarpa]                          519   e-145
ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [V...   515   e-144

>gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
          Length = 470

 Score =  544 bits (1401), Expect = e-152
 Identities = 260/341 (76%), Positives = 294/341 (86%), Gaps = 8/341 (2%)
 Frame = +1

Query: 133  DMSIIAYDEAHGVKGV-RSEAEIRNLYEGWLVKHGKAYNALGEKDQRYEIFKDNLRFIDE 309
            DMSII+YDEAHGV+G+ RSE E+R LYEGWL KHG+AYNALGEK++R+EIFKDN+ FID 
Sbjct: 24   DMSIISYDEAHGVRGLERSEEEMRILYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDA 83

Query: 310  HNA----GDHGFKLGLNRFADLTNEEFRAKFLGAKMAA---RKRVESDRYRGDVAGELPG 468
            HNA    G   F+LGLNRFAD+TNEE+RA +LG + A    R RV SDRYR +   +LP 
Sbjct: 84   HNAAADAGHRSFRLGLNRFADMTNEEYRAVYLGTRPAGHRRRARVGSDRYRYNAGEDLPE 143

Query: 469  SVDWRALGAVAPVKDQGGCGSCWAFSTVAAVEGINQIVTGDMIVLSEQELVDCDTTYNQG 648
            SVDWRA GAVA VKDQG CGSCWAFSTVAAVEGIN+IVTGD+I LSEQELVDCD  YNQG
Sbjct: 144  SVDWRAKGAVAAVKDQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQG 203

Query: 649  CNGGLMDYAFQFIIKNGGIDTEEDYPYSGKDGKCDPYRKNAKVVSITSYEDVPVNNEKAL 828
            CNGGLMDY F+FII NGGIDTEEDYPY+ +DGKCD YRKNAKVVSI  YEDVPVN+EKAL
Sbjct: 204  CNGGLMDYGFEFIINNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKAL 263

Query: 829  KTAVAHQPVSVAIEAGGREFQLYQSGVFTGRCGTELDHGVTAVGYGTDKGKDYWIVKNSW 1008
            + AVA+QPVSVAIEAGGREFQLY SG+FTGRCGT+LDHGV AVGYGT+ GKDYWIV+NSW
Sbjct: 264  QKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSW 323

Query: 1009 GKDWGENGFVRMERNINATTGKCGIAMEASYPIKKGQNPPK 1131
            G DWGE+G++RMERN+N +TGKCGIA+E SYP KKGQNPPK
Sbjct: 324  GGDWGESGYIRMERNVNTSTGKCGIAIEPSYPTKKGQNPPK 364


>gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
          Length = 525

 Score =  537 bits (1383), Expect = e-150
 Identities = 257/340 (75%), Positives = 292/340 (85%), Gaps = 8/340 (2%)
 Frame = +1

Query: 133  DMSIIAYDEAHGVKGV-RSEAEIRNLYEGWLVKHGKAYNALGEKDQRYEIFKDNLRFIDE 309
            DMSII+YDEAHGV+G+ RSE E+R LYEGWL KHG+A NALGEK++R+EIFKDN+RFID 
Sbjct: 24   DMSIISYDEAHGVQGLERSEEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDA 83

Query: 310  HNA----GDHGFKLGLNRFADLTNEEFRAKFLGAKMAA---RKRVESDRYRGDVAGELPG 468
            HNA    G   F+LGLNRFAD+TNEE+R  +LG + A+   R R+ SDRYR +   ELP 
Sbjct: 84   HNAAADSGHRSFRLGLNRFADMTNEEYRTVYLGTRPASHRRRARLGSDRYRYNAGEELPE 143

Query: 469  SVDWRALGAVAPVKDQGGCGSCWAFSTVAAVEGINQIVTGDMIVLSEQELVDCDTTYNQG 648
            SVDWR  GAV  VKDQG CGSCWAFST+AAVEGIN+IVTGD+I LSEQELVDCD   NQG
Sbjct: 144  SVDWRDKGAVTTVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQG 203

Query: 649  CNGGLMDYAFQFIIKNGGIDTEEDYPYSGKDGKCDPYRKNAKVVSITSYEDVPVNNEKAL 828
            CNGGLMDYAF+FII NGGIDTEEDYPY  +DGKCD YRKNAKVVSI  YEDVPVN+EKAL
Sbjct: 204  CNGGLMDYAFEFIINNGGIDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKAL 263

Query: 829  KTAVAHQPVSVAIEAGGREFQLYQSGVFTGRCGTELDHGVTAVGYGTDKGKDYWIVKNSW 1008
            + AVA+QPVSVAIEAGGREFQLY SG+FTGRCGT+LDHGV AVGYGT+ GKDYWIV+NSW
Sbjct: 264  QKAVANQPVSVAIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSW 323

Query: 1009 GKDWGENGFVRMERNINATTGKCGIAMEASYPIKKGQNPP 1128
            G DWGE+G++RMERN+NA+TGKCGIAME+SYP KKGQNPP
Sbjct: 324  GGDWGESGYIRMERNVNASTGKCGIAMESSYPTKKGQNPP 363


>ref|XP_002326950.1| predicted protein [Populus trichocarpa] gi|222835265|gb|EEE73700.1|
            predicted protein [Populus trichocarpa]
          Length = 456

 Score =  519 bits (1336), Expect = e-145
 Identities = 245/335 (73%), Positives = 282/335 (84%), Gaps = 3/335 (0%)
 Frame = +1

Query: 133  DMSIIAYDEAHGVKGV-RSEAEIRNLYEGWLVKHGKAYNALGEKDQRYEIFKDNLRFIDE 309
            DMSII+Y + H  K   R++ E+  +YE WLVKHGK YNALGEK++R+EIFKDNL FID+
Sbjct: 16   DMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQ 75

Query: 310  HNAGDHGFKLGLNRFADLTNEEFRAKFLGAKMAARKRVE--SDRYRGDVAGELPGSVDWR 483
            HN+ +  + +GLNRFADLTNEEFR+ +LG +   +KR+   SDRY   V   LP SVDWR
Sbjct: 76   HNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRYAPRVGDSLPDSVDWR 135

Query: 484  ALGAVAPVKDQGGCGSCWAFSTVAAVEGINQIVTGDMIVLSEQELVDCDTTYNQGCNGGL 663
              GAVA VKDQGGCGSCWAFST+AAVEGIN+IVTGD+I LSEQELVDCDT+YN+GCNGGL
Sbjct: 136  KEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGL 195

Query: 664  MDYAFQFIIKNGGIDTEEDYPYSGKDGKCDPYRKNAKVVSITSYEDVPVNNEKALKTAVA 843
            MDYAF+FII NGGIDTE+DYPY G+DG+CD YRKNAKVVSI SYEDVP N+E ALK AVA
Sbjct: 196  MDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVA 255

Query: 844  HQPVSVAIEAGGREFQLYQSGVFTGRCGTELDHGVTAVGYGTDKGKDYWIVKNSWGKDWG 1023
            +QPVSVAIE GGR FQLY SGVFTG CGT LDHGV AVGYGT+KGKDYWIV+NSWGK WG
Sbjct: 256  NQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRNSWGKSWG 315

Query: 1024 ENGFVRMERNINATTGKCGIAMEASYPIKKGQNPP 1128
            E+G++RMERNI + TGKCGIA+E SYPIKKGQNPP
Sbjct: 316  ESGYIRMERNIASPTGKCGIAIEPSYPIKKGQNPP 350


>gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  519 bits (1336), Expect = e-145
 Identities = 245/335 (73%), Positives = 282/335 (84%), Gaps = 3/335 (0%)
 Frame = +1

Query: 133  DMSIIAYDEAHGVKGV-RSEAEIRNLYEGWLVKHGKAYNALGEKDQRYEIFKDNLRFIDE 309
            DMSII+Y + H  K   R++ E+  +YE WLVKHGK YNALGEK++R+EIFKDNL FID+
Sbjct: 25   DMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQ 84

Query: 310  HNAGDHGFKLGLNRFADLTNEEFRAKFLGAKMAARKRVE--SDRYRGDVAGELPGSVDWR 483
            HN+ +  + +GLNRFADLTNEEFR+ +LG +   +KR+   SDRY   V   LP SVDWR
Sbjct: 85   HNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRYAPRVGDSLPDSVDWR 144

Query: 484  ALGAVAPVKDQGGCGSCWAFSTVAAVEGINQIVTGDMIVLSEQELVDCDTTYNQGCNGGL 663
              GAVA VKDQGGCGSCWAFST+AAVEGIN+IVTGD+I LSEQELVDCDT+YN+GCNGGL
Sbjct: 145  KEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGL 204

Query: 664  MDYAFQFIIKNGGIDTEEDYPYSGKDGKCDPYRKNAKVVSITSYEDVPVNNEKALKTAVA 843
            MDYAF+FII NGGIDTE+DYPY G+DG+CD YRKNAKVVSI SYEDVP N+E ALK AVA
Sbjct: 205  MDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVA 264

Query: 844  HQPVSVAIEAGGREFQLYQSGVFTGRCGTELDHGVTAVGYGTDKGKDYWIVKNSWGKDWG 1023
            +QPVSVAIE GGR FQLY SGVFTG CGT LDHGV AVGYGT+KGKDYWIV+NSWGK WG
Sbjct: 265  NQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEKGKDYWIVRNSWGKSWG 324

Query: 1024 ENGFVRMERNINATTGKCGIAMEASYPIKKGQNPP 1128
            E+G++RMERNI + TGKCGIA+E SYPIKKGQNPP
Sbjct: 325  ESGYIRMERNIASPTGKCGIAIEPSYPIKKGQNPP 359


>ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  515 bits (1326), Expect = e-144
 Identities = 244/337 (72%), Positives = 283/337 (83%), Gaps = 5/337 (1%)
 Frame = +1

Query: 133  DMSIIAYDEAHGVKGV-RSEAEIRNLYEGWLVKHGKAYNALGEKDQRYEIFKDNLRFIDE 309
            DMSII YDE HG K   R++ ++  +YE WL KHGK+YNALGEK++R++IFKDNLRFIDE
Sbjct: 25   DMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDE 84

Query: 310  HNAGDHGFKLGLNRFADLTNEEFRAKFLGAKMAARKRVE---SDRYRGDVAGELPGSVDW 480
            HNA +  +K+GLNRFADLTNEE+R+ +LG + AA++R     SDRY   V   LP SVDW
Sbjct: 85   HNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDW 144

Query: 481  RALGAVAPVKDQGGCGSCWAFSTVAAVEGINQIVTGDMIVLSEQELVDCDTTYNQGCNGG 660
            R  GAV  VKDQG CGSCWAFST+AAVEGIN+IVTG +I LSEQELVDCDT+YN+GCNGG
Sbjct: 145  RKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGG 204

Query: 661  LMDYAFQFIIKNGGIDTEEDYPYSGKDGKCDPYRKNAKVVSITSYEDVPVNNEKALKTAV 840
            LMDYAF+FII NGGID+EEDYPY   DG+CD YRKNAKVV+I  YEDVP N+EK+L+ AV
Sbjct: 205  LMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAV 264

Query: 841  AHQPVSVAIEAGGREFQLYQSGVFTGRCGTELDHGVTAVGYGTDKGKDYWIVKNSWGKDW 1020
            A+QPVSVAIEAGGREFQLYQSG+FTGRCGT LDHGVTAVGYGT+ G DYWIVKNSWG  W
Sbjct: 265  ANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYGTENGVDYWIVKNSWGASW 324

Query: 1021 GENGFVRMERNI-NATTGKCGIAMEASYPIKKGQNPP 1128
            GE G++RMER++  + TGKCGIAMEASYPIKKGQNPP
Sbjct: 325  GEEGYIRMERDLATSATGKCGIAMEASYPIKKGQNPP 361


Top