BLASTX nr result
ID: Dioscorea21_contig00004163
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00004163 (1348 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ABG33750.1| cysteine protease [Hevea brasiliensis] 592 e-167 gb|ABR19827.1| cysteine proteinase [Elaeis guineensis] 592 e-167 ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [V... 589 e-166 ref|XP_002510170.1| cysteine protease, putative [Ricinus communi... 588 e-166 emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera] 587 e-165 >gb|ABG33750.1| cysteine protease [Hevea brasiliensis] Length = 457 Score = 592 bits (1527), Expect = e-167 Identities = 277/416 (66%), Positives = 323/416 (77%) Frame = +1 Query: 13 YQEWLTKHGKYYNDLNEVSRRFEIFKDNLRYIKEHNAGDHGYKLGLNRFADLTNEEFREK 192 Y++WL KHGK YN L E RRFE+FKDNLR+I EHN+ + Y++GLNRFADLTNEE+R Sbjct: 42 YEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSM 101 Query: 193 HLGALIGGRRNQGRISSDRYEHREGDDLPESVDWRAKGAVGPVKDQGSCGSCWAFSSVAA 372 +LGAL G RRN+ R SDRY R GD LP+SVDWR +GAV VKDQGSCGSCWAFS+VAA Sbjct: 102 YLGALSGIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAA 161 Query: 373 VEGINQIVTGDMILLSEQELVECDTESNRGCFGGLMDDAFNFIIDNGGIDTEEDYPYTAQ 552 VEGIN+IVTGD+I LSEQELV+CD N GC GGLMD F FII+NGGID+EEDYPY A+ Sbjct: 162 VEGINKIVTGDLISLSEQELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLAR 221 Query: 553 DGQCDANRRNAKVVTIDSFEDVPENDEKALKKAVAHQPVSVAIEAGGKNFQLYESGIFTG 732 DG+CD R+NA+VV+IDS+EDVP N+E AL+KAVA+QPVSVAIEAGG++FQLY SG+F+G Sbjct: 222 DGRCDTYRKNARVVSIDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSG 281 Query: 733 SCKTALDHGVTAVGYGTENGTDYWIVKNSWGKMWGEDGYIRMERNVNDATGKCGIAMMAS 912 C TALDHGV AVGYGTENG DYWIV+NSWGK WGE GY+RM RN+ TG CGIAM AS Sbjct: 282 RCGTALDHGVVAVGYGTENGQDYWIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEAS 341 Query: 913 YPIKKGKNXXXXXXXXXXXXXXXXXCDNSFSCSQGTTCCCVYEDHDNGCLAWGCCPFESA 1092 YPIKKG+N CDN FSC + TCCC++E + N C WGCCP E A Sbjct: 342 YPIKKGQNPPNPGPSPPSPVKPPSVCDNYFSCPESNTCCCIFE-YANFCFEWGCCPLEGA 400 Query: 1093 TCCEDHSSCCPHDYPICDVYHGTCLMSKDNPLGVKAFTRTPAMLNFRPNYEGRSDA 1260 TCC+DH SCCPHDYPIC+V GTCLMSKDNPLGVKA RT A ++ EG+ + Sbjct: 401 TCCDDHYSCCPHDYPICNVNQGTCLMSKDNPLGVKAIRRTRAKPHWALGAEGKKSS 456 >gb|ABR19827.1| cysteine proteinase [Elaeis guineensis] Length = 470 Score = 592 bits (1526), Expect = e-167 Identities = 278/408 (68%), Positives = 315/408 (77%), Gaps = 4/408 (0%) Frame = +1 Query: 7 LTYQEWLTKHGKYYNDLNEVSRRFEIFKDNLRYIKEHNA----GDHGYKLGLNRFADLTN 174 + Y+ WL KHG+ YN L E RRFEIFKDN+ +I HNA G ++LGLNRFAD+TN Sbjct: 48 ILYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFADMTN 107 Query: 175 EEFREKHLGALIGGRRNQGRISSDRYEHREGDDLPESVDWRAKGAVGPVKDQGSCGSCWA 354 EE+R +LG G R + R+ SDRY + G+DLPESVDWRAKGAV VKDQGSCGSCWA Sbjct: 108 EEYRAVYLGTRPAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVKDQGSCGSCWA 167 Query: 355 FSSVAAVEGINQIVTGDMILLSEQELVECDTESNRGCFGGLMDDAFNFIIDNGGIDTEED 534 FS+VAAVEGIN+IVTGD+I LSEQELV+CD N+GC GGLMD F FII+NGGIDTEED Sbjct: 168 FSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFIINNGGIDTEED 227 Query: 535 YPYTAQDGQCDANRRNAKVVTIDSFEDVPENDEKALKKAVAHQPVSVAIEAGGKNFQLYE 714 YPYTA+DG+CD R+NAKVV+ID +EDVP NDEKAL+KAVA+QPVSVAIEAGG+ FQLY Sbjct: 228 YPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQLYH 287 Query: 715 SGIFTGSCKTALDHGVTAVGYGTENGTDYWIVKNSWGKMWGEDGYIRMERNVNDATGKCG 894 SGIFTG C T LDHGV AVGYGTENG DYWIV+NSWG WGE GYIRMERNVN +TGKCG Sbjct: 288 SGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSWGGDWGESGYIRMERNVNTSTGKCG 347 Query: 895 IAMMASYPIKKGKNXXXXXXXXXXXXXXXXXCDNSFSCSQGTTCCCVYEDHDNGCLAWGC 1074 IA+ SYP KKG+N CDN +SC TTCCCVYE + C AWGC Sbjct: 348 IAIEPSYPTKKGQNPPKPAPSPPSPVSPPTVCDNYYSCPSSTTCCCVYE-YGRYCFAWGC 406 Query: 1075 CPFESATCCEDHSSCCPHDYPICDVYHGTCLMSKDNPLGVKAFTRTPA 1218 CP E ATCCEDH SCCPHDYP+C+V GTC +SKDNPLGVKA RTPA Sbjct: 407 CPLEGATCCEDHYSCCPHDYPVCNVKAGTCQLSKDNPLGVKALARTPA 454 >ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera] Length = 467 Score = 589 bits (1518), Expect = e-166 Identities = 277/414 (66%), Positives = 320/414 (77%), Gaps = 1/414 (0%) Frame = +1 Query: 13 YQEWLTKHGKYYNDLNEVSRRFEIFKDNLRYIKEHNAGDHGYKLGLNRFADLTNEEFREK 192 Y+ WL KHGK YN L E RRF+IFKDNLR+I EHNA + YK+GLNRFADLTNEE+R Sbjct: 51 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSM 110 Query: 193 HLGALIGGRRNQGRISSDRYEHREGDDLPESVDWRAKGAVGPVKDQGSCGSCWAFSSVAA 372 +LG +R SDRY R GD LPESVDWR KGAV VKDQGSCGSCWAFS++AA Sbjct: 111 YLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAA 170 Query: 373 VEGINQIVTGDMILLSEQELVECDTESNRGCFGGLMDDAFNFIIDNGGIDTEEDYPYTAQ 552 VEGIN+IVTG +I LSEQELV+CDT N GC GGLMD AF FII+NGGID+EEDYPY A Sbjct: 171 VEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKAS 230 Query: 553 DGQCDANRRNAKVVTIDSFEDVPENDEKALKKAVAHQPVSVAIEAGGKNFQLYESGIFTG 732 DG+CD R+NAKVVTID +EDVPENDEK+L+KAVA+QPVSVAIEAGG+ FQLY+SGIFTG Sbjct: 231 DGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTG 290 Query: 733 SCKTALDHGVTAVGYGTENGTDYWIVKNSWGKMWGEDGYIRMERNV-NDATGKCGIAMMA 909 C TALDHGVTAVGYGTENG DYWIVKNSWG WGE+GYIRMER++ ATGKCGIAM A Sbjct: 291 RCGTALDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEA 350 Query: 910 SYPIKKGKNXXXXXXXXXXXXXXXXXCDNSFSCSQGTTCCCVYEDHDNGCLAWGCCPFES 1089 SYPIKKG+N CDN ++C + +TCCC++E + C WGCCP E+ Sbjct: 351 SYPIKKGQNPPNPGPSPPSPIKPPTVCDNYYACPESSTCCCIFE-YAKYCFQWGCCPLEA 409 Query: 1090 ATCCEDHSSCCPHDYPICDVYHGTCLMSKDNPLGVKAFTRTPAMLNFRPNYEGR 1251 ATCCEDH SCCP +YP+C+V GTC+MSKDNPLGVKA RT A ++ +G+ Sbjct: 410 ATCCEDHDSCCPQEYPVCNVRAGTCMMSKDNPLGVKALKRTAAKPHWAYGGDGK 463 >ref|XP_002510170.1| cysteine protease, putative [Ricinus communis] gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis] Length = 469 Score = 588 bits (1517), Expect = e-166 Identities = 275/419 (65%), Positives = 324/419 (77%), Gaps = 3/419 (0%) Frame = +1 Query: 13 YQEWLTKHGKYY---NDLNEVSRRFEIFKDNLRYIKEHNAGDHGYKLGLNRFADLTNEEF 183 Y+EWL K+GK + N L E RRF++FKDNLR+I EHN+ + YK+GLNRFADLTNEE+ Sbjct: 51 YEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSENRSYKVGLNRFADLTNEEY 110 Query: 184 REKHLGALIGGRRNQGRISSDRYEHREGDDLPESVDWRAKGAVGPVKDQGSCGSCWAFSS 363 R +LGA G +RN+ SS+RY R GD LP+SVDWR +GAV VKDQGSCGSCWAFS+ Sbjct: 111 RSMYLGARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFST 170 Query: 364 VAAVEGINQIVTGDMILLSEQELVECDTESNRGCFGGLMDDAFNFIIDNGGIDTEEDYPY 543 +AAVEGIN+IVTGD+I LSEQELV+CD N GC GGLMD AF FII+NGGID+EEDYPY Sbjct: 171 IAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCNGGLMDYAFQFIINNGGIDSEEDYPY 230 Query: 544 TAQDGQCDANRRNAKVVTIDSFEDVPENDEKALKKAVAHQPVSVAIEAGGKNFQLYESGI 723 A+DG CD R+NAKVVTID++EDVP NDEKAL+KAVA+QPVSVAIEAGG+ FQ Y+SGI Sbjct: 231 LARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQFYQSGI 290 Query: 724 FTGSCKTALDHGVTAVGYGTENGTDYWIVKNSWGKMWGEDGYIRMERNVNDATGKCGIAM 903 FTG C TALDHGV AVGYGTENG DYWIV+NSWGK WGE GYIRMERN+ ATGKCGIA+ Sbjct: 291 FTGRCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYIRMERNIATATGKCGIAI 350 Query: 904 MASYPIKKGKNXXXXXXXXXXXXXXXXXCDNSFSCSQGTTCCCVYEDHDNGCLAWGCCPF 1083 SYPIKKG+N CD+ FSC + TTCCC++E + C WGCCP Sbjct: 351 EPSYPIKKGQNPPNPGPSPPSPIKPPSVCDSYFSCPESTTCCCIFE-YAKYCFEWGCCPL 409 Query: 1084 ESATCCEDHSSCCPHDYPICDVYHGTCLMSKDNPLGVKAFTRTPAMLNFRPNYEGRSDA 1260 E ATCC+DH SCCPHDYP+C++ GTCL+ KDNP GVKA RTPA ++ EGR ++ Sbjct: 410 EGATCCDDHYSCCPHDYPVCNINEGTCLIGKDNPFGVKAMRRTPAKPHWAYGLEGRKNS 468 >emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera] Length = 469 Score = 587 bits (1512), Expect = e-165 Identities = 276/414 (66%), Positives = 319/414 (77%), Gaps = 1/414 (0%) Frame = +1 Query: 13 YQEWLTKHGKYYNDLNEVSRRFEIFKDNLRYIKEHNAGDHGYKLGLNRFADLTNEEFREK 192 Y+ WL KHGK YN L E RRF+IFKDNLR+I EHNA + YK+GLNRFADLTNEE+R Sbjct: 53 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSM 112 Query: 193 HLGALIGGRRNQGRISSDRYEHREGDDLPESVDWRAKGAVGPVKDQGSCGSCWAFSSVAA 372 +LG +R SDRY R GD LPESVDWR KGAV VKDQGSCGSCWAFS++AA Sbjct: 113 YLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAA 172 Query: 373 VEGINQIVTGDMILLSEQELVECDTESNRGCFGGLMDDAFNFIIDNGGIDTEEDYPYTAQ 552 VEGIN+IVTG +I LSEQELV+CDT N GC GGLMD AF FII+NGGID+EEDYPY A Sbjct: 173 VEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKAS 232 Query: 553 DGQCDANRRNAKVVTIDSFEDVPENDEKALKKAVAHQPVSVAIEAGGKNFQLYESGIFTG 732 DG+CD R+NA VVTID +EDVPENDEK+L+KAVA+QPVSVAIEAGG+ FQLY+SGIFTG Sbjct: 233 DGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTG 292 Query: 733 SCKTALDHGVTAVGYGTENGTDYWIVKNSWGKMWGEDGYIRMERNV-NDATGKCGIAMMA 909 C TALDHGVTAVGYGTENG DYWIVKNSWG WGE+GYIRMER++ ATGKCGIAM A Sbjct: 293 RCGTALDHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEA 352 Query: 910 SYPIKKGKNXXXXXXXXXXXXXXXXXCDNSFSCSQGTTCCCVYEDHDNGCLAWGCCPFES 1089 SYPIKKG+N CDN ++C + +TCCC++E + C WGCCP E+ Sbjct: 353 SYPIKKGQNPPNPGPSPPSPIKPPTVCDNYYACPESSTCCCIFE-YAKYCFQWGCCPLEA 411 Query: 1090 ATCCEDHSSCCPHDYPICDVYHGTCLMSKDNPLGVKAFTRTPAMLNFRPNYEGR 1251 ATCCEDH SCCP +YP+C+V GTC+MSKDNPLGVKA RT A ++ +G+ Sbjct: 412 ATCCEDHDSCCPQEYPVCNVRAGTCMMSKDNPLGVKALKRTAAKPHWAYGGDGK 465