BLASTX nr result
ID: Dioscorea21_contig00004475
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00004475 (1853 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AFH58568.1| aspartic acid protease [Ananas comosus] 732 0.0 ref|XP_002298827.1| predicted protein [Populus trichocarpa] gi|2... 713 0.0 ref|NP_001056348.1| Os05g0567100 [Oryza sativa Japonica Group] g... 704 0.0 ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ric... 704 0.0 dbj|BAA06876.1| aspartic protease [Oryza sativa] gi|1711289|dbj|... 702 0.0 >gb|AFH58568.1| aspartic acid protease [Ananas comosus] Length = 514 Score = 732 bits (1889), Expect = 0.0 Identities = 354/487 (72%), Positives = 402/487 (82%), Gaps = 1/487 (0%) Frame = -3 Query: 1686 DGLVKIGLKKKPIDDSSRLAARLT-LQEGKRLSGHRYGLRSGLSDGNTDTDIISLKNYMN 1510 DGLV+IGLKK+PID+++R+AARL +EG L+ RYGLR +TDII+LKNYMN Sbjct: 28 DGLVRIGLKKRPIDENNRIAARLVEKEEGPLLAARRYGLRGAPLKEGEETDIIALKNYMN 87 Query: 1509 AQYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGKS 1330 AQYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFS+ACLFH ++GKS Sbjct: 88 AQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSIACLFHTKYKSGRSSSYHKNGKS 147 Query: 1329 AEIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEISV 1150 A IHYGTGAISGFFS D+VKVGDLVVK Q FIEAT+EPS+TF++AKFDGILGLGF+EISV Sbjct: 148 ASIHYGTGAISGFFSTDHVKVGDLVVKTQDFIEATKEPSVTFVVAKFDGILGLGFQEISV 207 Query: 1149 GDAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQKG 970 G+A PVWYNMV QGLIKEPVFSFWFNR+A DPNHYKG HTYVPVTQKG Sbjct: 208 GNAVPVWYNMVDQGLIKEPVFSFWFNRNANDGEGGEIVFGGADPNHYKGNHTYVPVTQKG 267 Query: 969 YWQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECKA 790 YWQF M DVL+GGQ+TGFC+GGC+AIADSGTSL+AGPTT+I EINQKIGA+GVVSQECKA Sbjct: 268 YWQFEMGDVLVGGQSTGFCNGGCAAIADSGTSLLAGPTTIIAEINQKIGASGVVSQECKA 327 Query: 789 VVAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDAM 610 VVA+YG+QIL +LL E QP KICS IGLC FDG QGVS GIESVV+ D R +AG +DAM Sbjct: 328 VVAEYGQQILQMLLAEVQPGKICSSIGLCTFDGKQGVSAGIESVVNKDTRRSAAGLSDAM 387 Query: 609 CTACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTIG 430 C CEMAVVWM I +Y+N+LC++LPSPMGESSVDCSSV SMP +SFTIG Sbjct: 388 CNVCEMAVVWMQNQISQNQTQELIFNYLNQLCEKLPSPMGESSVDCSSVASMPDISFTIG 447 Query: 429 GKTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNLQ 250 GK F+L PEQYIL+VGEG AQCISGFTALDVPPPRGPLWILGD+FMGAYHTVFDYGN++ Sbjct: 448 GKKFSLKPEQYILQVGEGYAAQCISGFTALDVPPPRGPLWILGDVFMGAYHTVFDYGNMR 507 Query: 249 VGFAEAA 229 VGFA+AA Sbjct: 508 VGFADAA 514 >ref|XP_002298827.1| predicted protein [Populus trichocarpa] gi|222846085|gb|EEE83632.1| predicted protein [Populus trichocarpa] Length = 494 Score = 713 bits (1841), Expect = 0.0 Identities = 344/485 (70%), Positives = 401/485 (82%) Frame = -3 Query: 1686 DGLVKIGLKKKPIDDSSRLAARLTLQEGKRLSGHRYGLRSGLSDGNTDTDIISLKNYMNA 1507 DGL++IGLKK+ + ++RLAA+L +EG+ + +Y L L DTDI+SLKNYM+A Sbjct: 11 DGLIRIGLKKRKYERNNRLAAKLESKEGESIK--KYHLLRNLGGDAEDTDIVSLKNYMDA 68 Query: 1506 QYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGKSA 1327 QYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSVAC FH KE+GKSA Sbjct: 69 QYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSHSRTYKENGKSA 128 Query: 1326 EIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEISVG 1147 EIHYGTGAISGFFSQD+VKVGDLVVK+Q FIEATREPS+TF++AKFDGILGLGF+EISVG Sbjct: 129 EIHYGTGAISGFFSQDHVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQEISVG 188 Query: 1146 DAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQKGY 967 A PVWYNMV+QGL+KEPVFSFWFNR+A VDP+HYKGEHTYVPVTQKGY Sbjct: 189 KAVPVWYNMVEQGLVKEPVFSFWFNRNADEKEGGEIVFGGVDPDHYKGEHTYVPVTQKGY 248 Query: 966 WQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECKAV 787 WQF+M DVLIGGQT+GFC+ GC+AIADSGTSL+AGPTT+ITE+N IGA GVVSQECKAV Sbjct: 249 WQFDMGDVLIGGQTSGFCASGCAAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECKAV 308 Query: 786 VAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDAMC 607 VAQYG+ I+ +LL + QP KIC+QIGLC FDGT+GVS+GIESVV++ + S G +DAMC Sbjct: 309 VAQYGDTIMEMLLAKDQPQKICAQIGLCTFDGTRGVSMGIESVVNEHAQKASDGFHDAMC 368 Query: 606 TACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTIGG 427 + CEMAVVWM +IL YVNELC+RLPSPMGES+VDC + SMP VSFTIGG Sbjct: 369 STCEMAVVWMQNQLKQNQTQERILDYVNELCERLPSPMGESAVDCDGLSSMPNVSFTIGG 428 Query: 426 KTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNLQV 247 + F L PEQY+LKVGEG +AQCISGFTALDVPPPRGPLWILGD+FMG++HTVFDYGN++V Sbjct: 429 RVFELSPEQYVLKVGEGDVAQCISGFTALDVPPPRGPLWILGDVFMGSFHTVFDYGNMRV 488 Query: 246 GFAEA 232 GFAEA Sbjct: 489 GFAEA 493 >ref|NP_001056348.1| Os05g0567100 [Oryza sativa Japonica Group] gi|78099759|sp|Q42456.2|ASPR1_ORYSJ RecName: Full=Aspartic proteinase oryzasin-1; Flags: Precursor gi|51854282|gb|AAU10663.1| aspartic proteinase oryzasin 1 precursor [Oryza sativa Japonica Group] gi|113579899|dbj|BAF18262.1| Os05g0567100 [Oryza sativa Japonica Group] gi|125553350|gb|EAY99059.1| hypothetical protein OsI_21016 [Oryza sativa Indica Group] gi|169244443|gb|ACA50495.1| aspartic proteinase oryzasin 1 [Oryza sativa Japonica Group] gi|215695381|dbj|BAG90572.1| unnamed protein product [Oryza sativa Japonica Group] gi|215737145|dbj|BAG96074.1| unnamed protein product [Oryza sativa Japonica Group] gi|215740829|dbj|BAG96985.1| unnamed protein product [Oryza sativa Japonica Group] gi|222632587|gb|EEE64719.1| hypothetical protein OsJ_19575 [Oryza sativa Japonica Group] Length = 509 Score = 704 bits (1818), Expect = 0.0 Identities = 338/488 (69%), Positives = 397/488 (81%), Gaps = 2/488 (0%) Frame = -3 Query: 1686 DGLVKIGLKKKPIDDSSRLAARLTLQEGKRLSGHRYGLR--SGLSDGNTDTDIISLKNYM 1513 +GLV+I LKK+PID++SR+AARL+ +EG R R GLR + L G + DI++LKNYM Sbjct: 26 EGLVRIALKKRPIDENSRVAARLSGEEGAR----RLGLRGANSLGGGGGEGDIVALKNYM 81 Query: 1512 NAQYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGK 1333 NAQYFGEIGVGTPPQ FTVIFDTGSSNLWVPSAKCYFS+AC FH +++GK Sbjct: 82 NAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQKNGK 141 Query: 1332 SAEIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEIS 1153 A I YGTG+I+GFFS+D+V VGDLVVKDQ FIEAT+EP +TFM+AKFDGILGLGF+EIS Sbjct: 142 PAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEIS 201 Query: 1152 VGDAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQK 973 VGDA PVWY MV+QGL+ EPVFSFWFNRH+ +DP+HYKG HTYVPV+QK Sbjct: 202 VGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQK 261 Query: 972 GYWQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECK 793 GYWQF M DVLIGG+TTGFC+ GCSAIADSGTSL+AGPT +ITEIN+KIGA GVVSQECK Sbjct: 262 GYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQECK 321 Query: 792 AVVAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDA 613 VV+QYG+QIL+LLL E QP+KICSQ+GLC FDG GVS GI+SVVDD+ G + Q+ Sbjct: 322 TVVSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSVVDDEAGESNGLQSGP 381 Query: 612 MCTACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTI 433 MC ACEMAVVWM IL+Y+N+LCD+LPSPMGESSVDC S+ SMP +SFTI Sbjct: 382 MCNACEMAVVWMQNQLAQNKTQDLILNYINQLCDKLPSPMGESSVDCGSLASMPEISFTI 441 Query: 432 GGKTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNL 253 GGK F L PE+YILKVGEG+ AQCISGFTA+D+PPPRGPLWILGD+FMGAYHTVFDYG + Sbjct: 442 GGKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVFDYGKM 501 Query: 252 QVGFAEAA 229 +VGFA++A Sbjct: 502 RVGFAKSA 509 >ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ricinus communis] gi|223530603|gb|EEF32480.1| Aspartic proteinase precursor, putative [Ricinus communis] Length = 514 Score = 704 bits (1817), Expect = 0.0 Identities = 342/488 (70%), Positives = 398/488 (81%), Gaps = 2/488 (0%) Frame = -3 Query: 1686 DGLVKIGLKKKPIDDSSRLAARLTLQEGK--RLSGHRYGLRSGLSDGNTDTDIISLKNYM 1513 DGLV+IGLKK+ D ++R+AA+ +EG+ R S +Y +R L D D DI+SLKNYM Sbjct: 28 DGLVRIGLKKRKFDQNNRVAAQFESKEGEAFRASIKKYHIRGNLGDAE-DIDIVSLKNYM 86 Query: 1512 NAQYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGK 1333 +AQYFGEIG+GTPPQ FTVIFDTGSSNLWVPS+KCYFSVAC FH K++GK Sbjct: 87 DAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSGQSSTYKKNGK 146 Query: 1332 SAEIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEIS 1153 SA+IHYGTGAISGFFSQDNVKVG+LV+K+Q FIEATREPSITF++AKFDGILGLGF+EIS Sbjct: 147 SADIHYGTGAISGFFSQDNVKVGELVIKNQEFIEATREPSITFLVAKFDGILGLGFQEIS 206 Query: 1152 VGDAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQK 973 VG+A PVWYNMV QGL+KEPVFSFWFNR+A +DPNHYKGEHTYVPVTQK Sbjct: 207 VGNAVPVWYNMVNQGLVKEPVFSFWFNRNADEDEGGEIVFGGMDPNHYKGEHTYVPVTQK 266 Query: 972 GYWQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECK 793 GYWQF+M DVLI G+TTG CS GC+AIADSGTSL+AGPTT+ITE+N IGA GVVSQECK Sbjct: 267 GYWQFDMGDVLIDGKTTGICSSGCAAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECK 326 Query: 792 AVVAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDA 613 AVVAQYGE I+ +LL + QP KICSQIGLC FDG++GVS+GIESVV++ + G +DA Sbjct: 327 AVVAQYGETIIAMLLAKDQPQKICSQIGLCTFDGSRGVSMGIESVVNEKIQEVAGGLHDA 386 Query: 612 MCTACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTI 433 MC+ CEMAVVWM IL+YVNELC+RLPSPMGES+VDC S+ +MP VSFTI Sbjct: 387 MCSTCEMAVVWMQNQLKQNQTQEHILNYVNELCERLPSPMGESAVDCGSLSTMPNVSFTI 446 Query: 432 GGKTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNL 253 GG+ F+L PEQY+LKVG+G AQCISGFTALDVPPPRGPLWILGD+FMG +HTVFDYGN Sbjct: 447 GGRVFDLAPEQYVLKVGDGEAAQCISGFTALDVPPPRGPLWILGDVFMGPFHTVFDYGNK 506 Query: 252 QVGFAEAA 229 +VGFAE A Sbjct: 507 RVGFAEVA 514 >dbj|BAA06876.1| aspartic protease [Oryza sativa] gi|1711289|dbj|BAA06875.1| aspartic protease [Oryza sativa] Length = 509 Score = 702 bits (1812), Expect = 0.0 Identities = 337/488 (69%), Positives = 396/488 (81%), Gaps = 2/488 (0%) Frame = -3 Query: 1686 DGLVKIGLKKKPIDDSSRLAARLTLQEGKRLSGHRYGLR--SGLSDGNTDTDIISLKNYM 1513 +GLV+I LKK+PID++SR+AARL+ +EG R R GLR + L G + DI++LKNYM Sbjct: 26 EGLVRIALKKRPIDENSRVAARLSGEEGAR----RLGLRGANSLGGGGGEGDIVALKNYM 81 Query: 1512 NAQYFGEIGVGTPPQTFTVIFDTGSSNLWVPSAKCYFSVACLFHXXXXXXXXXXXKEDGK 1333 NAQYFGEIGVGTPPQ FTVIFDTGSSNLWVPSAKCYFS+AC FH +++GK Sbjct: 82 NAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKCYFSIACFFHSRYKSGQSSTYQKNGK 141 Query: 1332 SAEIHYGTGAISGFFSQDNVKVGDLVVKDQPFIEATREPSITFMMAKFDGILGLGFKEIS 1153 A I YGTG+I+GFFS+D+V VGDLVVKDQ FIEAT+EP +TFM+AKFDGILGLGF+EIS Sbjct: 142 PAAIQYGTGSIAGFFSEDSVTVGDLVVKDQEFIEATKEPGLTFMVAKFDGILGLGFQEIS 201 Query: 1152 VGDAEPVWYNMVKQGLIKEPVFSFWFNRHAXXXXXXXXXXXXVDPNHYKGEHTYVPVTQK 973 VGDA PVWY MV+QGL+ EPVFSFWFNRH+ +DP+HYKG HTYVPV+QK Sbjct: 202 VGDAVPVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQK 261 Query: 972 GYWQFNMEDVLIGGQTTGFCSGGCSAIADSGTSLIAGPTTVITEINQKIGAAGVVSQECK 793 GYWQF M DVLIGG+TTGFC+ GCSAIADSGTSL+AGPT +ITEIN+KIGA GVVSQECK Sbjct: 262 GYWQFEMGDVLIGGKTTGFCASGCSAIADSGTSLLAGPTAIITEINEKIGATGVVSQECK 321 Query: 792 AVVAQYGEQILNLLLLEAQPAKICSQIGLCAFDGTQGVSIGIESVVDDDGGRPSAGQNDA 613 VV+QYG+QIL+LLL E QP+KICSQ+GLC FDG GVS GI+SVVDD+ G + Q+ Sbjct: 322 TVVSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSVVDDEAGESNGLQSGP 381 Query: 612 MCTACEMAVVWMXXXXXXXXXXXQILSYVNELCDRLPSPMGESSVDCSSVPSMPTVSFTI 433 MC ACEMAVVWM IL+Y+N+LCD+LPSPMGESSVDC S+ SMP +SFTI Sbjct: 382 MCNACEMAVVWMQNQLAQNKTQDLILNYINQLCDKLPSPMGESSVDCGSLASMPEISFTI 441 Query: 432 GGKTFNLGPEQYILKVGEGSMAQCISGFTALDVPPPRGPLWILGDIFMGAYHTVFDYGNL 253 G K F L PE+YILKVGEG+ AQCISGFTA+D+PPPRGPLWILGD+FMGAYHTVFDYG + Sbjct: 442 GAKKFALKPEEYILKVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVFDYGKM 501 Query: 252 QVGFAEAA 229 +VGFA++A Sbjct: 502 RVGFAKSA 509