BLASTX nr result
ID: Glycyrrhiza23_contig00020298
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00020298 (1957 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula] gi... 795 0.0 ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [G... 748 0.0 ref|XP_003590277.1| Cysteine protease [Medicago truncatula] gi|3... 743 0.0 ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis v... 684 0.0 ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|2... 669 0.0 >ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula] gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula] Length = 475 Score = 795 bits (2053), Expect = 0.0 Identities = 383/506 (75%), Positives = 417/506 (82%), Gaps = 1/506 (0%) Frame = -2 Query: 1851 MGSHQKNQXXXXXXXLIWGSWAFL-CYGISSDEYSILALDLDKLPSEEQVVELFQQWKQD 1675 MGSHQKN L+W S FL CYGI S EYSILA DL+K PSEEQVVELFQQWK++ Sbjct: 1 MGSHQKN--LLLLFTLLWCSLTFLSCYGIPS-EYSILAFDLNKFPSEEQVVELFQQWKKE 57 Query: 1674 HQKFYRHPEEAALRLENFKRNLKYIIEKNAMRNSPLSHRLGLNRFADMSNEEFRNKFISK 1495 HQKFY HPEEAALRLENFKRNLKYI+E+NAMRNSP+ H LGLNRFADMSNEEF+NKFISK Sbjct: 58 HQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFISK 117 Query: 1494 VKKPFGKRNSLTRSSGLHVKDDSCEDAPSSLDWRKKGAVTGVKDQGNCGSCWAFSSTGAM 1315 V +SC+DAP SLDWRKKG VTGVKDQGNCGSCW+FSSTGA+ Sbjct: 118 V--------------------ESCDDAPYSLDWRKKGVVTGVKDQGNCGSCWSFSSTGAI 157 Query: 1314 EGVNAIVTGDLISLSEQELVDCDTSNDGCDGGYMDYAFEWVINNGGIDTEANYPYTGLDG 1135 EGVNAIVTGDLISLSEQELVDCDT+NDGC+GGYMDYAFEWVINNGGIDTEA+YPY G+ G Sbjct: 158 EGVNAIVTGDLISLSEQELVDCDTTNDGCEGGYMDYAFEWVINNGGIDTEADYPYIGVGG 217 Query: 1134 TCNVTKEETKVVSIDGYTDVAQSDSAVLCATVKQPISAGIDGGSLDFQLYTGGIYDGDCS 955 TCNVTKEETKVV+IDGYTDV QSDSA+ CATVKQPIS GIDG +LDFQLYTGGIYDGDCS Sbjct: 218 TCNVTKEETKVVTIDGYTDVTQSDSALFCATVKQPISVGIDGSTLDFQLYTGGIYDGDCS 277 Query: 954 SNPDDIDHAILIVGYGSEGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMAS 775 SNPDDIDHA+LIVGYGS+G++DYWIVKNSWGTSWG+EG+IYIRRNTNLKYGVCAINYMAS Sbjct: 278 SNPDDIDHAVLIVGYGSDGNQDYWIVKNSWGTSWGIEGFIYIRRNTNLKYGVCAINYMAS 337 Query: 774 YPTKESTAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSQCGDFSYCPFDETCCCLYEFF 595 +PTKEST+ S+CGDFSYC +ETCCCLYE F Sbjct: 338 FPTKESTS--------ISPTSPPSPPSPPPPTPPSPTPSKCGDFSYCTTEETCCCLYELF 389 Query: 594 DFCLIYGCCEYENAVCCTGTDYCCPSDYPICDIEDGLCLQNSGDIMGVAAKKKKMGRHKF 415 DFCL YGCCEYENAVCCTGT YCCPSDYPICD EDGLCLQN GD+MGVAAKKKKMG+HKF Sbjct: 390 DFCLAYGCCEYENAVCCTGTKYCCPSDYPICDTEDGLCLQNYGDLMGVAAKKKKMGKHKF 449 Query: 414 PWTKFEQTKMTHYILQMRRNPFAAMR 337 PWTK+EQTK THY LQ+RR FA +R Sbjct: 450 PWTKYEQTKKTHYPLQLRRGAFATVR 475 >ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max] Length = 517 Score = 748 bits (1931), Expect = 0.0 Identities = 357/509 (70%), Positives = 395/509 (77%), Gaps = 21/509 (4%) Frame = -2 Query: 1803 IWGSWAFLCYGISSDEYSILALDLDKLPSEEQVVELFQQWKQDHQKFYRHPEEAALRLEN 1624 +WGSW FLCYG+ S EYSILAL++DK PSEE V+ELFQ+WK++++K YR P++ LR EN Sbjct: 15 VWGSWTFLCYGLPS-EYSILALEIDKFPSEEGVIELFQRWKEENKKIYRSPDQEKLRFEN 73 Query: 1623 FKRNLKYIIEKNAMRNSPLSHRLGLNRFADMSNEEFRNKFISKVKKPFGKRNSLTRSSGL 1444 FKRNLKYI EKN+ R SP LGLNRFADMSNEEF++KF SKVKKPF KRN GL Sbjct: 74 FKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFKSKFTSKVKKPFSKRN------GL 127 Query: 1443 HVKDDSCEDAPSSLDWRKKGAVTGVKDQGNCGSCWAFSSTGAMEGVNAIVTGDLISLSEQ 1264 KD SCEDAP SLDWRKKG VT VKDQG CG CWAFSSTGA+EG+NAIV+GDLISLSE Sbjct: 128 SGKDHSCEDAPYSLDWRKKGVVTAVKDQGYCGCCWAFSSTGAIEGINAIVSGDLISLSEP 187 Query: 1263 ELVDCDTSNDGCDGGYMDYAFEWVINNGGIDTEANYPYTGLDGTCNVTKEETKVVSIDGY 1084 ELVDCD +NDGCDGG+MDYAFEWV++NGGIDTE NYPY+G DGTCNV KEETKV+ IDGY Sbjct: 188 ELVDCDRTNDGCDGGHMDYAFEWVMHNGGIDTETNYPYSGADGTCNVAKEETKVIGIDGY 247 Query: 1083 TDVAQSDSAVLCATVKQPISAGIDGGSLDFQLYTGGIYDGDCSSNPDDIDHAILIVGYGS 904 +V QSD ++LCATVKQPISAGIDG S DFQLY GGIYDGDCSS+PDDIDHAIL+VGYGS Sbjct: 248 YNVEQSDRSLLCATVKQPISAGIDGSSWDFQLYIGGIYDGDCSSDPDDIDHAILVVGYGS 307 Query: 903 EGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKESTA--------- 751 EGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKE TA Sbjct: 308 EGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKEPTAPSPSSPPSP 367 Query: 750 ------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSQCGDFSYCPFDETCCCL 607 S+CG FSYCP ETCCCL Sbjct: 368 PSSPPPSPLTPPALPPPSPPATPPLSPPLPPATPPPLPPPPPSKCGQFSYCPAHETCCCL 427 Query: 606 YEFFDFCLIYGCCEYENAVCCTGTDYCCPSDYPICDIEDGLCLQNSGDIMGVAAKKKKMG 427 YEFF FCL+YGCCEY+NAVCC T+YCCPSDYPICDI DGLCLQ GD+MGVAAKK K G Sbjct: 428 YEFFGFCLVYGCCEYKNAVCCIWTEYCCPSDYPICDIRDGLCLQKHGDLMGVAAKKIKKG 487 Query: 426 RHKFPWTKFEQTKMTHYILQMRRNPFAAM 340 RHK PWTKFEQT+ T++ LQ RN FAA+ Sbjct: 488 RHKLPWTKFEQTEKTYHHLQTGRNAFAAV 516 >ref|XP_003590277.1| Cysteine protease [Medicago truncatula] gi|355479325|gb|AES60528.1| Cysteine protease [Medicago truncatula] Length = 514 Score = 743 bits (1917), Expect = 0.0 Identities = 366/519 (70%), Positives = 399/519 (76%), Gaps = 45/519 (8%) Frame = -2 Query: 1845 SHQKNQXXXXXXXLIWGSWAFL-CYGISSDEYSILALDLDKLPSEEQVVELFQQWKQDHQ 1669 S Q + L+W S FL CYGI S EYSILA DL+K PSEEQVVELFQQWK++HQ Sbjct: 2 SQQSKKNLLLLFTLLWCSLTFLSCYGIPS-EYSILAFDLNKFPSEEQVVELFQQWKKEHQ 60 Query: 1668 KFYRHPEEAALRLENFKRNLKYIIEKNAMRNSPLSHRLGLNRFADMSNEEFRNKFISKVK 1489 KFY HPEEAALRLENFKRNLKYI+E+NAMRNSP+ H LGLNRFADMSNEEF+NKFISKVK Sbjct: 61 KFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFISKVK 120 Query: 1488 KPFGKRNSLTRSSGLHVKDDSCEDAPSSLDWRKKGAVTGVKDQGNCG------------- 1348 KP KR +S LHVK +SC+DAP SLDWRKKG VTGVKDQGNCG Sbjct: 121 KPISKR-----ASNLHVKVESCDDAPYSLDWRKKGVVTGVKDQGNCGKLLYFMHFKSFLV 175 Query: 1347 -------------------------------SCWAFSSTGAMEGVNAIVTGDLISLSEQE 1261 SCW+FSSTGA+EGVNAIVTGDLISLSEQE Sbjct: 176 IYILELTTNFPLYSFESQFCILEKKKLDFVGSCWSFSSTGAIEGVNAIVTGDLISLSEQE 235 Query: 1260 LVDCDTSNDGCDGGYMDYAFEWVINNGGIDTEANYPYTGLDGTCNVTKEETKVVSIDGYT 1081 LVDCDT+NDGC+GGYMDYAFEWVINNGGIDTEA+YPY G+ GTCNVTKEETKVV+IDGYT Sbjct: 236 LVDCDTTNDGCEGGYMDYAFEWVINNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYT 295 Query: 1080 DVAQSDSAVLCATVKQPISAGIDGGSLDFQLYTGGIYDGDCSSNPDDIDHAILIVGYGSE 901 DV QSDSA+ CATVKQPIS GIDG +LDFQLYTGGIYDGDCSSNPDDIDHA+LIVGYGS+ Sbjct: 296 DVTQSDSALFCATVKQPISVGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGYGSD 355 Query: 900 GDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKESTAXXXXXXXXXX 721 G++DYWIVKNSWGTSWG+EG+IYIRRNTNLKYGVCAINYMAS+PTKEST+ Sbjct: 356 GNQDYWIVKNSWGTSWGIEGFIYIRRNTNLKYGVCAINYMASFPTKESTS--------IS 407 Query: 720 XXXXXXXXXXXXXXXXXXXXSQCGDFSYCPFDETCCCLYEFFDFCLIYGCCEYENAVCCT 541 S+CGDFSYC +ETCCCLYE FDFCL YGCCEYENAVCCT Sbjct: 408 PTSPPSPPSPPPPTPPSPTPSKCGDFSYCTTEETCCCLYELFDFCLAYGCCEYENAVCCT 467 Query: 540 GTDYCCPSDYPICDIEDGLCLQNSGDIMGVAAKKKKMGR 424 GT YCCPSDYPICD EDGLCLQN GD+MGVAAKKKK G+ Sbjct: 468 GTKYCCPSDYPICDTEDGLCLQNYGDLMGVAAKKKKNGK 506 >ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera] Length = 501 Score = 684 bits (1766), Expect = 0.0 Identities = 321/495 (64%), Positives = 381/495 (76%), Gaps = 6/495 (1%) Frame = -2 Query: 1803 IWGSWAFLCYGISSDEYSILALDLDKLPSEEQVVELFQQWKQDHQKFYRHPEEAALRLEN 1624 IW S A L + ++ Y + ++ SEE+V ELF WK+ H++ Y+H EE A R E Sbjct: 14 IWASLACLSSSLPTEFY----ITGEEFASEERVRELFHLWKERHKRVYKHAEETAKRFEI 69 Query: 1623 FKRNLKYIIEKNAMRNSPLSHRLGLNRFADMSNEEFRNKFISKVKKPFGKRNSLTRSSGL 1444 FK NLKY+IE+N+ + H LG+N+FADMSNEEF+ K++SK+KKP K+N+ R S Sbjct: 70 FKENLKYVIERNSKGHR---HTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRSMQ 126 Query: 1443 HVKDDSCEDAPSSLDWRKKGAVTGVKDQGNCGSCWAFSSTGAMEGVNAIVTGDLISLSEQ 1264 K + +APSSLDWRKKG VTG+KDQG+CGSCWAFSSTGAMEG+NAIVTGDLISLSEQ Sbjct: 127 QKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLISLSEQ 186 Query: 1263 ELVDCDTSNDGCDGGYMDYAFEWVINNGGIDTEANYPYTGLDGTCNVTKEETKVVSIDGY 1084 ELVDCDT+N GC+GGYMDYAFEWVI+NGGID+E++YPYTG DGTCN TKE+TKVVSIDGY Sbjct: 187 ELVDCDTTNYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDGTCNTTKEDTKVVSIDGY 246 Query: 1083 TDVAQSDSAVLCATVKQPISAGIDGGSLDFQLYTGGIYDGDCSSNPDDIDHAILIVGYGS 904 DV +SDSA+LCA V QPIS G+DG +LDFQLYT GIY GDCS +PDDIDHA+LIVGYGS Sbjct: 247 KDVDESDSALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHAVLIVGYGS 306 Query: 903 EGDEDYWIVKNSWGTSWGMEGYIYIRRNTNLKYGVCAINYMASYPTKEST------AXXX 742 E EDYWI KNSWGTSWGMEGY YI+RNT+L YG CAIN MASYPTKES+ + Sbjct: 307 EDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKESSSPSPYPSPAV 366 Query: 741 XXXXXXXXXXXXXXXXXXXXXXXXXXXSQCGDFSYCPFDETCCCLYEFFDFCLIYGCCEY 562 S+CGDFSYCP DETCCC+YEF+DFCLIYGCCEY Sbjct: 367 PPPPPPPPSPPPPPPPSPPPPSPGPSPSECGDFSYCPSDETCCCIYEFYDFCLIYGCCEY 426 Query: 561 ENAVCCTGTDYCCPSDYPICDIEDGLCLQNSGDIMGVAAKKKKMGRHKFPWTKFEQTKMT 382 ENAVCCTGT+YCCPSDYPICD+E+GLCL+N GD +GVAAKK+KM +HKFPWTK E+T+ T Sbjct: 427 ENAVCCTGTEYCCPSDYPICDVEEGLCLKNQGDYLGVAAKKRKMAKHKFPWTKIEETQKT 486 Query: 381 HYILQMRRNPFAAMR 337 + L+ +RN FAAMR Sbjct: 487 YQPLEWKRNRFAAMR 501 >ref|XP_002305743.1| predicted protein [Populus trichocarpa] gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa] Length = 494 Score = 669 bits (1727), Expect = 0.0 Identities = 314/476 (65%), Positives = 365/476 (76%), Gaps = 2/476 (0%) Frame = -2 Query: 1758 EYSILALDLDKLPSEEQVVELFQQWKQDHQKFYRHPEEAALRLENFKRNLKYIIEKNAMR 1579 EYSI+ D +LP +E ++E+FQQW+ HQK Y+H EEA R NFKRNLKYIIEK + Sbjct: 22 EYSIVGNDFSELPPDESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTG-K 80 Query: 1578 NSPLSHRLGLNRFADMSNEEFRNKFISKVKKPFGKRNSLTRSSGLHVKDDSCEDAPSSLD 1399 + L HR+GLN+FAD+SNEEF+ ++SKVKKP K + SC DAPSSLD Sbjct: 81 ETTLRHRVGLNKFADLSNEEFKQLYLSKVKKPINKTR-IDAEDRSRRNLQSC-DAPSSLD 138 Query: 1398 WRKKGAVTGVKDQGNCGSCWAFSSTGAMEGVNAIVTGDLISLSEQELVDCDTSNDGCDGG 1219 WRKKG VT VKDQG+CGSCW+FS+TGA+EG+NAIVT DLISLSEQELVDCDT+N GC+GG Sbjct: 139 WRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTTNYGCEGG 198 Query: 1218 YMDYAFEWVINNGGIDTEANYPYTGLDGTCNVTKEETKVVSIDGYTDVAQSDSAVLCATV 1039 YMDYAFEWVINNGGIDTEANYPYTG+DGTCN KEE KVVSIDGY DV ++DSA+LCA Sbjct: 199 YMDYAFEWVINNGGIDTEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETDSALLCAAA 258 Query: 1038 KQPISAGIDGGSLDFQLYTGGIYDGDCSSNPDDIDHAILIVGYGSEGDEDYWIVKNSWGT 859 +QPIS GIDG ++DFQLYTGGIYDGDCS +PDDIDHA+LIVGYGSE EDYWIVKNSWGT Sbjct: 259 QQPISVGIDGSAIDFQLYTGGIYDGDCSDDPDDIDHAVLIVGYGSENGEDYWIVKNSWGT 318 Query: 858 SWGMEGYIYIRRNTNLKYGVCAINYMASYPTKESTA-XXXXXXXXXXXXXXXXXXXXXXX 682 SWG+EGY YI+RNT+L YGVCAIN MASYPTKE++A Sbjct: 319 SWGIEGYFYIKRNTDLPYGVCAINAMASYPTKEASAQSPTSPPSPPSPPPPPPPPPTPVP 378 Query: 681 XXXXXXXSQCGDFSYCPFDETCCCLYEFFDFCLIYGCCEYENAVCCTGTDYCCPSDYPIC 502 S CGDFSYCP DETCCC+ FD+CL+YGCC YENAVCC + YCCPSDYPIC Sbjct: 379 PPPSPQPSDCGDFSYCPSDETCCCILNVFDYCLVYGCCAYENAVCCADSVYCCPSDYPIC 438 Query: 501 DIEDGLCLQNSGDIMGVAAKKKKMGRHKFPWTKF-EQTKMTHYILQMRRNPFAAMR 337 D+E+GLCL+ GD +GVAA K+ M +HKFPWTK E+ K H +LQ +RNPFAAMR Sbjct: 439 DVEEGLCLKGQGDYLGVAASKRHMAKHKFPWTKLQERAKTDHRVLQWKRNPFAAMR 494