BLASTX nr result

ID: Dioscorea21_contig00004475 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00004475
         (1853 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AFH58568.1| aspartic acid protease [Ananas comosus]                732   0.0  
ref|XP_002298827.1| predicted protein [Populus trichocarpa] gi|2...   713   0.0  
ref|NP_001056348.1| Os05g0567100 [Oryza sativa Japonica Group] g...   704   0.0  
ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ric...   704   0.0  
dbj|BAA06876.1| aspartic protease [Oryza sativa] gi|1711289|dbj|...   702   0.0  

>gb|AFH58568.1| aspartic acid protease [Ananas comosus]
          Length = 514

 Score =  732 bits (1889), Expect = 0.0
 Identities = 354/487 (72%), Positives = 402/487 (82%), Gaps = 1/487 (0%)
 Frame = -3

Query: 1686 DGLVKIGLKKKPIDDSSRLAARLT-LQEGKRLSGHRYGLRSGLSDGNTDTDIISLKNYMN 1510
            DGLV+IGLKK+PID+++R+AARL   +EG  L+  RYGLR        +TDII+LKNYMN
Sbjct: 28   DGLVRIGLKKRPIDENNRIAARLVEKEEGPLLAARRYGLRGAPLKEGEETDIIALKNYMN 87

Query: 1509 AQYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGKS 1330
            AQYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFS+ACLFH            ++GKS
Sbjct: 88   AQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACLFHTKYKSGRSSSYHKNGKS 147

Query: 1329 AEIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEISV 1150
            A IHYGTGAISGFFS D+VKVGDLVVK Q FIEAT+EPS+TF++AKFDGILGLGF+EISV
Sbjct: 148  ASIHYGTGAISGFFSTDHVKVGDLVVKTQDFIEATKEPSVTFVVAKFDGILGLGFQEISV 207

Query: 1149 GDAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQKG 970
            G+A PVWYNMV QGLIKEPVFSFWFNR+A             DPNHYKG HTYVPVTQKG
Sbjct: 208  GNAVPVWYNMVDQGLIKEPVFSFWFNRNANDGEGGEIVFGGADPNHYKGNHTYVPVTQKG 267

Query: 969  YWQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECKA 790
            YWQF M DVL+GGQ+TGFC+GGC+AIADSGTSL+AGPTT+I EINQKIGA+GVVSQECKA
Sbjct: 268  YWQFEMGDVLVGGQSTGFCNGGCAAIADSGTSLLAGPTTIIAEINQKIGASGVVSQECKA 327

Query: 789  VVAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDAM 610
            VVA+YG+QIL +LL E QP KICS IGLC FDG QGVS GIESVV+ D  R +AG +DAM
Sbjct: 328  VVAEYGQQILQMLLAEVQPGKICSSIGLCTFDGKQGVSAGIESVVNKDTRRSAAGLSDAM 387

Query: 609  CTACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTIG 430
            C  CEMAVVWM            I +Y+N+LC++LPSPMGESSVDCSSV SMP +SFTIG
Sbjct: 388  CNVCEMAVVWMQNQISQNQTQELIFNYLNQLCEKLPSPMGESSVDCSSVASMPDISFTIG 447

Query: 429  GKTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNLQ 250
            GK F+L PEQYIL+VGEG  AQCISGFTALDVPPPRGPLWILGD+FMGAYHTVFDYGN++
Sbjct: 448  GKKFSLKPEQYILQVGEGYAAQCISGFTALDVPPPRGPLWILGDVFMGAYHTVFDYGNMR 507

Query: 249  VGFAEAA 229
            VGFA+AA
Sbjct: 508  VGFADAA 514


>ref|XP_002298827.1| predicted protein [Populus trichocarpa] gi|222846085|gb|EEE83632.1|
            predicted protein [Populus trichocarpa]
          Length = 494

 Score =  713 bits (1841), Expect = 0.0
 Identities = 344/485 (70%), Positives = 401/485 (82%)
 Frame = -3

Query: 1686 DGLVKIGLKKKPIDDSSRLAARLTLQEGKRLSGHRYGLRSGLSDGNTDTDIISLKNYMNA 1507
            DGL++IGLKK+  + ++RLAA+L  +EG+ +   +Y L   L     DTDI+SLKNYM+A
Sbjct: 11   DGLIRIGLKKRKYERNNRLAAKLESKEGESIK--KYHLLRNLGGDAEDTDIVSLKNYMDA 68

Query: 1506 QYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGKSA 1327
            QYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSVAC FH           KE+GKSA
Sbjct: 69   QYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSHSRTYKENGKSA 128

Query: 1326 EIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEISVG 1147
            EIHYGTGAISGFFSQD+VKVGDLVVK+Q FIEATREPS+TF++AKFDGILGLGF+EISVG
Sbjct: 129  EIHYGTGAISGFFSQDHVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQEISVG 188

Query: 1146 DAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQKGY 967
             A PVWYNMV+QGL+KEPVFSFWFNR+A            VDP+HYKGEHTYVPVTQKGY
Sbjct: 189  KAVPVWYNMVEQGLVKEPVFSFWFNRNADEKEGGEIVFGGVDPDHYKGEHTYVPVTQKGY 248

Query: 966  WQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECKAV 787
            WQF+M DVLIGGQT+GFC+ GC+AIADSGTSL+AGPTT+ITE+N  IGA GVVSQECKAV
Sbjct: 249  WQFDMGDVLIGGQTSGFCASGCAAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECKAV 308

Query: 786  VAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDAMC 607
            VAQYG+ I+ +LL + QP KIC+QIGLC FDGT+GVS+GIESVV++   + S G +DAMC
Sbjct: 309  VAQYGDTIMEMLLAKDQPQKICAQIGLCTFDGTRGVSMGIESVVNEHAQKASDGFHDAMC 368

Query: 606  TACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTIGG 427
            + CEMAVVWM           +IL YVNELC+RLPSPMGES+VDC  + SMP VSFTIGG
Sbjct: 369  STCEMAVVWMQNQLKQNQTQERILDYVNELCERLPSPMGESAVDCDGLSSMPNVSFTIGG 428

Query: 426  KTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNLQV 247
            + F L PEQY+LKVGEG +AQCISGFTALDVPPPRGPLWILGD+FMG++HTVFDYGN++V
Sbjct: 429  RVFELSPEQYVLKVGEGDVAQCISGFTALDVPPPRGPLWILGDVFMGSFHTVFDYGNMRV 488

Query: 246  GFAEA 232
            GFAEA
Sbjct: 489  GFAEA 493


>ref|NP_001056348.1| Os05g0567100 [Oryza sativa Japonica Group]
            gi|78099759|sp|Q42456.2|ASPR1_ORYSJ RecName:
            Full=Aspartic proteinase oryzasin-1; Flags: Precursor
            gi|51854282|gb|AAU10663.1| aspartic proteinase oryzasin 1
            precursor [Oryza sativa Japonica Group]
            gi|113579899|dbj|BAF18262.1| Os05g0567100 [Oryza sativa
            Japonica Group] gi|125553350|gb|EAY99059.1| hypothetical
            protein OsI_21016 [Oryza sativa Indica Group]
            gi|169244443|gb|ACA50495.1| aspartic proteinase oryzasin
            1 [Oryza sativa Japonica Group]
            gi|215695381|dbj|BAG90572.1| unnamed protein product
            [Oryza sativa Japonica Group]
            gi|215737145|dbj|BAG96074.1| unnamed protein product
            [Oryza sativa Japonica Group]
            gi|215740829|dbj|BAG96985.1| unnamed protein product
            [Oryza sativa Japonica Group] gi|222632587|gb|EEE64719.1|
            hypothetical protein OsJ_19575 [Oryza sativa Japonica
            Group]
          Length = 509

 Score =  704 bits (1818), Expect = 0.0
 Identities = 338/488 (69%), Positives = 397/488 (81%), Gaps = 2/488 (0%)
 Frame = -3

Query: 1686 DGLVKIGLKKKPIDDSSRLAARLTLQEGKRLSGHRYGLR--SGLSDGNTDTDIISLKNYM 1513
            +GLV+I LKK+PID++SR+AARL+ +EG R    R GLR  + L  G  + DI++LKNYM
Sbjct: 26   EGLVRIALKKRPIDENSRVAARLSGEEGAR----RLGLRGANSLGGGGGEGDIVALKNYM 81

Query: 1512 NAQYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGK 1333
            NAQYFGEIGVGTPPQ FTVIFDTGSSNLWVPSAKCYFS+AC FH           +++GK
Sbjct: 82   NAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQKNGK 141

Query: 1332 SAEIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEIS 1153
             A I YGTG+I+GFFS+D+V VGDLVVKDQ FIEAT+EP +TFM+AKFDGILGLGF+EIS
Sbjct: 142  PAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEIS 201

Query: 1152 VGDAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQK 973
            VGDA PVWY MV+QGL+ EPVFSFWFNRH+            +DP+HYKG HTYVPV+QK
Sbjct: 202  VGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQK 261

Query: 972  GYWQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECK 793
            GYWQF M DVLIGG+TTGFC+ GCSAIADSGTSL+AGPT +ITEIN+KIGA GVVSQECK
Sbjct: 262  GYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQECK 321

Query: 792  AVVAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDA 613
             VV+QYG+QIL+LLL E QP+KICSQ+GLC FDG  GVS GI+SVVDD+ G  +  Q+  
Sbjct: 322  TVVSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSVVDDEAGESNGLQSGP 381

Query: 612  MCTACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTI 433
            MC ACEMAVVWM            IL+Y+N+LCD+LPSPMGESSVDC S+ SMP +SFTI
Sbjct: 382  MCNACEMAVVWMQNQLAQNKTQDLILNYINQLCDKLPSPMGESSVDCGSLASMPEISFTI 441

Query: 432  GGKTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNL 253
            GGK F L PE+YILKVGEG+ AQCISGFTA+D+PPPRGPLWILGD+FMGAYHTVFDYG +
Sbjct: 442  GGKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVFDYGKM 501

Query: 252  QVGFAEAA 229
            +VGFA++A
Sbjct: 502  RVGFAKSA 509


>ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ricinus communis]
            gi|223530603|gb|EEF32480.1| Aspartic proteinase
            precursor, putative [Ricinus communis]
          Length = 514

 Score =  704 bits (1817), Expect = 0.0
 Identities = 342/488 (70%), Positives = 398/488 (81%), Gaps = 2/488 (0%)
 Frame = -3

Query: 1686 DGLVKIGLKKKPIDDSSRLAARLTLQEGK--RLSGHRYGLRSGLSDGNTDTDIISLKNYM 1513
            DGLV+IGLKK+  D ++R+AA+   +EG+  R S  +Y +R  L D   D DI+SLKNYM
Sbjct: 28   DGLVRIGLKKRKFDQNNRVAAQFESKEGEAFRASIKKYHIRGNLGDAE-DIDIVSLKNYM 86

Query: 1512 NAQYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGK 1333
            +AQYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSVAC FH           K++GK
Sbjct: 87   DAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSGQSSTYKKNGK 146

Query: 1332 SAEIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEIS 1153
            SA+IHYGTGAISGFFSQDNVKVG+LV+K+Q FIEATREPSITF++AKFDGILGLGF+EIS
Sbjct: 147  SADIHYGTGAISGFFSQDNVKVGELVIKNQEFIEATREPSITFLVAKFDGILGLGFQEIS 206

Query: 1152 VGDAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQK 973
            VG+A PVWYNMV QGL+KEPVFSFWFNR+A            +DPNHYKGEHTYVPVTQK
Sbjct: 207  VGNAVPVWYNMVNQGLVKEPVFSFWFNRNADEDEGGEIVFGGMDPNHYKGEHTYVPVTQK 266

Query: 972  GYWQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECK 793
            GYWQF+M DVLI G+TTG CS GC+AIADSGTSL+AGPTT+ITE+N  IGA GVVSQECK
Sbjct: 267  GYWQFDMGDVLIDGKTTGICSSGCAAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECK 326

Query: 792  AVVAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDA 613
            AVVAQYGE I+ +LL + QP KICSQIGLC FDG++GVS+GIESVV++     + G +DA
Sbjct: 327  AVVAQYGETIIAMLLAKDQPQKICSQIGLCTFDGSRGVSMGIESVVNEKIQEVAGGLHDA 386

Query: 612  MCTACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTI 433
            MC+ CEMAVVWM            IL+YVNELC+RLPSPMGES+VDC S+ +MP VSFTI
Sbjct: 387  MCSTCEMAVVWMQNQLKQNQTQEHILNYVNELCERLPSPMGESAVDCGSLSTMPNVSFTI 446

Query: 432  GGKTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNL 253
            GG+ F+L PEQY+LKVG+G  AQCISGFTALDVPPPRGPLWILGD+FMG +HTVFDYGN 
Sbjct: 447  GGRVFDLAPEQYVLKVGDGEAAQCISGFTALDVPPPRGPLWILGDVFMGPFHTVFDYGNK 506

Query: 252  QVGFAEAA 229
            +VGFAE A
Sbjct: 507  RVGFAEVA 514


>dbj|BAA06876.1| aspartic protease [Oryza sativa] gi|1711289|dbj|BAA06875.1| aspartic
            protease [Oryza sativa]
          Length = 509

 Score =  702 bits (1812), Expect = 0.0
 Identities = 337/488 (69%), Positives = 396/488 (81%), Gaps = 2/488 (0%)
 Frame = -3

Query: 1686 DGLVKIGLKKKPIDDSSRLAARLTLQEGKRLSGHRYGLR--SGLSDGNTDTDIISLKNYM 1513
            +GLV+I LKK+PID++SR+AARL+ +EG R    R GLR  + L  G  + DI++LKNYM
Sbjct: 26   EGLVRIALKKRPIDENSRVAARLSGEEGAR----RLGLRGANSLGGGGGEGDIVALKNYM 81

Query: 1512 NAQYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGK 1333
            NAQYFGEIGVGTPPQ FTVIFDTGSSNLWVPSAKCYFS+AC FH           +++GK
Sbjct: 82   NAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQKNGK 141

Query: 1332 SAEIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEIS 1153
             A I YGTG+I+GFFS+D+V VGDLVVKDQ FIEAT+EP +TFM+AKFDGILGLGF+EIS
Sbjct: 142  PAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEIS 201

Query: 1152 VGDAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQK 973
            VGDA PVWY MV+QGL+ EPVFSFWFNRH+            +DP+HYKG HTYVPV+QK
Sbjct: 202  VGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQK 261

Query: 972  GYWQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECK 793
            GYWQF M DVLIGG+TTGFC+ GCSAIADSGTSL+AGPT +ITEIN+KIGA GVVSQECK
Sbjct: 262  GYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQECK 321

Query: 792  AVVAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDA 613
             VV+QYG+QIL+LLL E QP+KICSQ+GLC FDG  GVS GI+SVVDD+ G  +  Q+  
Sbjct: 322  TVVSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSVVDDEAGESNGLQSGP 381

Query: 612  MCTACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTI 433
            MC ACEMAVVWM            IL+Y+N+LCD+LPSPMGESSVDC S+ SMP +SFTI
Sbjct: 382  MCNACEMAVVWMQNQLAQNKTQDLILNYINQLCDKLPSPMGESSVDCGSLASMPEISFTI 441

Query: 432  GGKTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNL 253
            G K F L PE+YILKVGEG+ AQCISGFTA+D+PPPRGPLWILGD+FMGAYHTVFDYG +
Sbjct: 442  GAKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVFDYGKM 501

Query: 252  QVGFAEAA 229
            +VGFA++A
Sbjct: 502  RVGFAKSA 509


Top