BLASTX nr result
ID: Glycyrrhiza23_contig00002876
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00002876 (1830 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ACU23369.1| unknown [Glycine max] 839 0.0 ref|XP_003547951.1| PREDICTED: probable serine protease EDA2-lik... 838 0.0 ref|NP_001242784.1| uncharacterized protein LOC100805858 precurs... 831 0.0 ref|XP_003629354.1| Thymus-specific serine protease [Medicago tr... 808 0.0 ref|XP_003612122.1| Thymus-specific serine protease [Medicago tr... 788 0.0 >gb|ACU23369.1| unknown [Glycine max] Length = 490 Score = 839 bits (2168), Expect = 0.0 Identities = 398/477 (83%), Positives = 433/477 (90%) Frame = -3 Query: 1723 LLFLILSLAAVSCVFAFVPPRTLLNNLSEGKILTNEELWFNQTLDHFSPYDHRQFRQRYY 1544 LL + S A+S + VPPRTLLN LS+G LT +E WFNQTLDHFSPYDH QFRQRY+ Sbjct: 17 LLVFVSSFPALS--YGVVPPRTLLNKLSQGSYLTTQEQWFNQTLDHFSPYDHHQFRQRYF 74 Query: 1543 EFLDYFRVPDGPIFLVICGEGPCNGIANDYIGVLAKKFGAALVSLEHRYYGKSSPFDSLA 1364 EFLDYFR+PDGPIFLVI GEGPCNGI NDYIGVLAKKFGAA+V+LEHRYYGKSSPF+SL Sbjct: 75 EFLDYFRIPDGPIFLVIGGEGPCNGITNDYIGVLAKKFGAAMVTLEHRYYGKSSPFNSLE 134 Query: 1363 TENLKYLSSKQALFDLAVFRQYYQDSLNAKLNRSEVENPWFFFGGSYPGALSAWFRLKFP 1184 TENLKYLSSKQALFDLAVFRQYYQDSLNAKLNR+++ENPWF FGGSY GALSAWFRLKFP Sbjct: 135 TENLKYLSSKQALFDLAVFRQYYQDSLNAKLNRTKIENPWFVFGGSYAGALSAWFRLKFP 194 Query: 1183 HLTCGSLASSAVVLAVYNYTEFDQQIGESAGAECKAALQETTQLIEKNLSTNGKALRAYF 1004 HLTCGSLASSAVVLAVYN+TE+DQQIGESAGAECKA LQETTQLIE L+TNGK L+A F Sbjct: 195 HLTCGSLASSAVVLAVYNFTEYDQQIGESAGAECKAVLQETTQLIEHKLATNGKELKASF 254 Query: 1003 NADDLEIDGDFMYFLADAAVIAFQYGNPDKLCKPLVEAKKDEKDLVDAYAKYVKEYYVGT 824 NADDLE DGDFMY +ADAA +AFQYGNPDK+CKP+VEAK +DLVDAYAKYVKEYY+GT Sbjct: 255 NADDLEKDGDFMYLIADAAAVAFQYGNPDKVCKPMVEAKNAGEDLVDAYAKYVKEYYIGT 314 Query: 823 FGVNVQTYDQKYLKKTAIGEDSSARLWWFQVCTEVAYFQVAPSNDSIRSSKVDTKYHLDL 644 FGVNVQTYDQ+YLKKTAI EDSS RLWWFQVCTEVA+FQVAPSNDSIRSS++D KYH+DL Sbjct: 315 FGVNVQTYDQEYLKKTAINEDSSTRLWWFQVCTEVAFFQVAPSNDSIRSSEIDAKYHMDL 374 Query: 643 CKNVFGEGIFPDVDATNLYYGGTKIAGSKIIFTNGSQDPWRHASKQISSPDMPSYTITCY 464 CKN+FGEGIFPDVDATNLYYGGTKIAGSKI+F NGSQDPWRHASKQ SSPD+PSYTITC Sbjct: 375 CKNIFGEGIFPDVDATNLYYGGTKIAGSKIVFANGSQDPWRHASKQTSSPDLPSYTITCS 434 Query: 463 NCGHCTDMRGCPQAPLVLEGNEKNCTSPDAVHKVRQKIQEQMDLWLSECQNTGWSFI 293 NC HCTD RGCPQ+PLVLEGNEKNC+SPDAVHKVRQ+I E MDLWLSECQ G S+I Sbjct: 435 NCAHCTDFRGCPQSPLVLEGNEKNCSSPDAVHKVRQQITEHMDLWLSECQE-GSSYI 490 >ref|XP_003547951.1| PREDICTED: probable serine protease EDA2-like [Glycine max] Length = 490 Score = 838 bits (2165), Expect = 0.0 Identities = 398/477 (83%), Positives = 432/477 (90%) Frame = -3 Query: 1723 LLFLILSLAAVSCVFAFVPPRTLLNNLSEGKILTNEELWFNQTLDHFSPYDHRQFRQRYY 1544 LL + S A+S + VPPRTLLN LS+G LT +E WFNQTLDHFSPYDH QFRQRY+ Sbjct: 17 LLVFVSSFPALS--YGVVPPRTLLNKLSQGSYLTTQEQWFNQTLDHFSPYDHHQFRQRYF 74 Query: 1543 EFLDYFRVPDGPIFLVICGEGPCNGIANDYIGVLAKKFGAALVSLEHRYYGKSSPFDSLA 1364 EFLDYFR+PDGPIFLVI GEGPCNGI NDYIGVLAKKFGAA+V+LEHRYYGKSSPF+SL Sbjct: 75 EFLDYFRIPDGPIFLVIGGEGPCNGITNDYIGVLAKKFGAAMVTLEHRYYGKSSPFNSLE 134 Query: 1363 TENLKYLSSKQALFDLAVFRQYYQDSLNAKLNRSEVENPWFFFGGSYPGALSAWFRLKFP 1184 TENLKYLSSKQALFDLAVFRQYYQDSLNAKLNR++ ENPWF FGGSY GALSAWFRLKFP Sbjct: 135 TENLKYLSSKQALFDLAVFRQYYQDSLNAKLNRTKTENPWFVFGGSYAGALSAWFRLKFP 194 Query: 1183 HLTCGSLASSAVVLAVYNYTEFDQQIGESAGAECKAALQETTQLIEKNLSTNGKALRAYF 1004 HLTCGSLASSAVVLAVYN+TE+DQQIGESAGAECKA LQETTQLIE L+TNGK L+A F Sbjct: 195 HLTCGSLASSAVVLAVYNFTEYDQQIGESAGAECKAVLQETTQLIEHKLATNGKELKASF 254 Query: 1003 NADDLEIDGDFMYFLADAAVIAFQYGNPDKLCKPLVEAKKDEKDLVDAYAKYVKEYYVGT 824 NADDLE DGDFMY +ADAA +AFQYGNPDK+CKP+VEAK +DLVDAYAKYVKEYY+GT Sbjct: 255 NADDLEKDGDFMYLIADAAAVAFQYGNPDKVCKPMVEAKNAGEDLVDAYAKYVKEYYIGT 314 Query: 823 FGVNVQTYDQKYLKKTAIGEDSSARLWWFQVCTEVAYFQVAPSNDSIRSSKVDTKYHLDL 644 FGVNVQTYDQ+YLKKTAI EDSS RLWWFQVCTEVA+FQVAPSNDSIRSS++D KYH+DL Sbjct: 315 FGVNVQTYDQEYLKKTAINEDSSTRLWWFQVCTEVAFFQVAPSNDSIRSSEIDAKYHMDL 374 Query: 643 CKNVFGEGIFPDVDATNLYYGGTKIAGSKIIFTNGSQDPWRHASKQISSPDMPSYTITCY 464 CKN+FGEGIFPDVDATNLYYGGTKIAGSKI+F NGSQDPWRHASKQ SSPD+PSYTITC Sbjct: 375 CKNIFGEGIFPDVDATNLYYGGTKIAGSKIVFANGSQDPWRHASKQTSSPDLPSYTITCS 434 Query: 463 NCGHCTDMRGCPQAPLVLEGNEKNCTSPDAVHKVRQKIQEQMDLWLSECQNTGWSFI 293 NC HCTD RGCPQ+PLVLEGNEKNC+SPDAVHKVRQ+I E MDLWLSECQ G S+I Sbjct: 435 NCAHCTDFRGCPQSPLVLEGNEKNCSSPDAVHKVRQQITEHMDLWLSECQE-GSSYI 490 >ref|NP_001242784.1| uncharacterized protein LOC100805858 precursor [Glycine max] gi|255635884|gb|ACU18289.1| unknown [Glycine max] Length = 488 Score = 831 bits (2147), Expect = 0.0 Identities = 398/477 (83%), Positives = 428/477 (89%), Gaps = 4/477 (0%) Frame = -3 Query: 1711 ILSLAAVSCV----FAFVPPRTLLNNLSEGKILTNEELWFNQTLDHFSPYDHRQFRQRYY 1544 +LSL VS + VPPRTLLN LSEGK L +ELWF+QTLDHFSPYDHRQFRQRYY Sbjct: 12 LLSLLFVSSFPPLSYGVVPPRTLLNKLSEGKYLNTQELWFDQTLDHFSPYDHRQFRQRYY 71 Query: 1543 EFLDYFRVPDGPIFLVICGEGPCNGIANDYIGVLAKKFGAALVSLEHRYYGKSSPFDSLA 1364 EFLDYFR+PDGPIFLVI GEG NG+ANDY+ VLAKKFGAA+V+LEHRYYGKS+PF+SL Sbjct: 72 EFLDYFRIPDGPIFLVIGGEGILNGVANDYLAVLAKKFGAAMVTLEHRYYGKSTPFNSLE 131 Query: 1363 TENLKYLSSKQALFDLAVFRQYYQDSLNAKLNRSEVENPWFFFGGSYPGALSAWFRLKFP 1184 TENLKYLSSKQAL DLAVFRQYYQDS+NAKLNR+++ENPWF FGGSY GALSAWFRLKFP Sbjct: 132 TENLKYLSSKQALSDLAVFRQYYQDSINAKLNRAKIENPWFIFGGSYSGALSAWFRLKFP 191 Query: 1183 HLTCGSLASSAVVLAVYNYTEFDQQIGESAGAECKAALQETTQLIEKNLSTNGKALRAYF 1004 HLTCGSLASSAVVLAVYNYTEFDQQIGESAG ECK ALQETTQLIE L+T+GK L+A F Sbjct: 192 HLTCGSLASSAVVLAVYNYTEFDQQIGESAGPECKEALQETTQLIEHKLATSGKELKASF 251 Query: 1003 NADDLEIDGDFMYFLADAAVIAFQYGNPDKLCKPLVEAKKDEKDLVDAYAKYVKEYYVGT 824 +A DLEIDGDF YFLADA IAFQYGNPDK+CKPLVEAKK +DLVDAYAKYVKEYY+GT Sbjct: 252 DAADLEIDGDFFYFLADATAIAFQYGNPDKVCKPLVEAKKAGEDLVDAYAKYVKEYYIGT 311 Query: 823 FGVNVQTYDQKYLKKTAIGEDSSARLWWFQVCTEVAYFQVAPSNDSIRSSKVDTKYHLDL 644 FG +VQTYDQKYLK+TA+ ED+SARLWWFQVCTEVAYFQVAPSNDSIRSSKVD KYH DL Sbjct: 312 FGTDVQTYDQKYLKRTAMNEDNSARLWWFQVCTEVAYFQVAPSNDSIRSSKVDIKYHFDL 371 Query: 643 CKNVFGEGIFPDVDATNLYYGGTKIAGSKIIFTNGSQDPWRHASKQISSPDMPSYTITCY 464 CKNVFGEGIFPDVDATNLYYGGTKIAGSKIIFTNGSQDPWRHASKQ SSPDMPSY + CY Sbjct: 372 CKNVFGEGIFPDVDATNLYYGGTKIAGSKIIFTNGSQDPWRHASKQTSSPDMPSYIVKCY 431 Query: 463 NCGHCTDMRGCPQAPLVLEGNEKNCTSPDAVHKVRQKIQEQMDLWLSECQNTGWSFI 293 NCGHC+D RGCPQ P +EGNEKNCTSPDAVHKVRQKI E MDLWLSEC +TG SFI Sbjct: 432 NCGHCSDYRGCPQFPFSIEGNEKNCTSPDAVHKVRQKISEHMDLWLSECVDTGRSFI 488 >ref|XP_003629354.1| Thymus-specific serine protease [Medicago truncatula] gi|355523376|gb|AET03830.1| Thymus-specific serine protease [Medicago truncatula] Length = 455 Score = 808 bits (2087), Expect = 0.0 Identities = 378/448 (84%), Positives = 414/448 (92%) Frame = -3 Query: 1642 SEGKILTNEELWFNQTLDHFSPYDHRQFRQRYYEFLDYFRVPDGPIFLVICGEGPCNGIA 1463 S G+ L+ + +WFNQTLDHFSPYDHRQFRQRYYEFLDYFR PDGPIFLVI GE CNGI Sbjct: 6 SLGRFLSTDVIWFNQTLDHFSPYDHRQFRQRYYEFLDYFRAPDGPIFLVIGGEATCNGIV 65 Query: 1462 NDYIGVLAKKFGAALVSLEHRYYGKSSPFDSLATENLKYLSSKQALFDLAVFRQYYQDSL 1283 NDYIGVLAKKFGAA+VSLEHRYYG+S+PFD+ +TENLKYLSSKQALFDLAVFRQYYQDSL Sbjct: 66 NDYIGVLAKKFGAAVVSLEHRYYGESTPFDTFSTENLKYLSSKQALFDLAVFRQYYQDSL 125 Query: 1282 NAKLNRSEVENPWFFFGGSYPGALSAWFRLKFPHLTCGSLASSAVVLAVYNYTEFDQQIG 1103 NAKLNRS VENPWFFFGGSY GALSAWFRLKFPHLTCGSLASSAVVLAV ++ EFDQQIG Sbjct: 126 NAKLNRSGVENPWFFFGGSYSGALSAWFRLKFPHLTCGSLASSAVVLAVQDFAEFDQQIG 185 Query: 1102 ESAGAECKAALQETTQLIEKNLSTNGKALRAYFNADDLEIDGDFMYFLADAAVIAFQYGN 923 ESAG ECKA LQETTQL+E L+ +GKALR+ FNADDLEIDGDF+Y+LADAAVIAFQYGN Sbjct: 186 ESAGPECKAVLQETTQLVETKLADDGKALRSIFNADDLEIDGDFLYYLADAAVIAFQYGN 245 Query: 922 PDKLCKPLVEAKKDEKDLVDAYAKYVKEYYVGTFGVNVQTYDQKYLKKTAIGEDSSARLW 743 PDKLCKPLV+AK +DLVDAYAKYVKEYYVGTFG+ ++YDQ+YLKKTAI EDSS RLW Sbjct: 246 PDKLCKPLVDAKNAGEDLVDAYAKYVKEYYVGTFGITPKSYDQEYLKKTAINEDSSTRLW 305 Query: 742 WFQVCTEVAYFQVAPSNDSIRSSKVDTKYHLDLCKNVFGEGIFPDVDATNLYYGGTKIAG 563 WFQVCTEVAYFQVAPSNDSIRSSK+DTKYHLDLCKN+FG+G+FPDVDATNLYYGGTK+AG Sbjct: 306 WFQVCTEVAYFQVAPSNDSIRSSKIDTKYHLDLCKNIFGDGVFPDVDATNLYYGGTKVAG 365 Query: 562 SKIIFTNGSQDPWRHASKQISSPDMPSYTITCYNCGHCTDMRGCPQAPLVLEGNEKNCTS 383 SKIIFTNGSQDPWRHASKQ SSPD+PSY I C NCGHCTD+RGCPQ+PLV+EGNEKNC+S Sbjct: 366 SKIIFTNGSQDPWRHASKQTSSPDLPSYLIKCNNCGHCTDLRGCPQSPLVIEGNEKNCSS 425 Query: 382 PDAVHKVRQKIQEQMDLWLSECQNTGWS 299 PDAVHKVRQK+QE MDLWLSEC ++G S Sbjct: 426 PDAVHKVRQKVQEDMDLWLSECIDSGRS 453 >ref|XP_003612122.1| Thymus-specific serine protease [Medicago truncatula] gi|355513457|gb|AES95080.1| Thymus-specific serine protease [Medicago truncatula] Length = 478 Score = 788 bits (2034), Expect = 0.0 Identities = 381/471 (80%), Positives = 420/471 (89%), Gaps = 1/471 (0%) Frame = -3 Query: 1714 LILSLAAVSCVFAFVPPRTLLNNLSEG-KILTNEELWFNQTLDHFSPYDHRQFRQRYYEF 1538 L++ L +S V A P L LSE + LT EELWF QTLDH+SPYDHR+F+QRYYEF Sbjct: 6 LLVFLFFISTVSA--TPHLLRRRLSESARYLTKEELWFPQTLDHYSPYDHRKFQQRYYEF 63 Query: 1537 LDYFRVPDGPIFLVICGEGPCNGIANDYIGVLAKKFGAALVSLEHRYYGKSSPFDSLATE 1358 LD+FR+PDGP+FLVICGE C+GI NDYIGVLAKKFGAA+VSLEHRYYGKSSPF SLAT+ Sbjct: 64 LDHFRIPDGPVFLVICGEYSCDGIRNDYIGVLAKKFGAAVVSLEHRYYGKSSPFKSLATK 123 Query: 1357 NLKYLSSKQALFDLAVFRQYYQDSLNAKLNRSEVENPWFFFGGSYPGALSAWFRLKFPHL 1178 NL+YLSSKQALFDLAVFRQ YQDSLNAKLNR+ +NPWF FG SYPGALSAWFRLKFPHL Sbjct: 124 NLRYLSSKQALFDLAVFRQNYQDSLNAKLNRTNADNPWFVFGVSYPGALSAWFRLKFPHL 183 Query: 1177 TCGSLASSAVVLAVYNYTEFDQQIGESAGAECKAALQETTQLIEKNLSTNGKALRAYFNA 998 TCGSLASSAVVLAVYN+TEFDQQIGESAG ECKAALQETT+LIE+ L TNGKAL+A FNA Sbjct: 184 TCGSLASSAVVLAVYNFTEFDQQIGESAGVECKAALQETTRLIERKLVTNGKALKASFNA 243 Query: 997 DDLEIDGDFMYFLADAAVIAFQYGNPDKLCKPLVEAKKDEKDLVDAYAKYVKEYYVGTFG 818 DLEIDGDF+YFLADAAV AFQYGNPD LCKPLV+AKKD +DLVDAYAK++KE+Y+GT G Sbjct: 244 ADLEIDGDFLYFLADAAVTAFQYGNPDILCKPLVKAKKDGEDLVDAYAKFIKEFYLGTEG 303 Query: 817 VNVQTYDQKYLKKTAIGEDSSARLWWFQVCTEVAYFQVAPSNDSIRSSKVDTKYHLDLCK 638 + Q Y+Q LK AI E+SS RLWWFQVCTEVAYFQVAPSNDSIRSSKVDT+YHLDLCK Sbjct: 304 ESTQDYNQNNLKNAAITENSSGRLWWFQVCTEVAYFQVAPSNDSIRSSKVDTRYHLDLCK 363 Query: 637 NVFGEGIFPDVDATNLYYGGTKIAGSKIIFTNGSQDPWRHASKQISSPDMPSYTITCYNC 458 NVFGEGIFPDVDATN+YYGGTKIAGSKI+FTNGSQDPWR ASKQISSP+MPSYTITC+NC Sbjct: 364 NVFGEGIFPDVDATNIYYGGTKIAGSKIVFTNGSQDPWRRASKQISSPNMPSYTITCHNC 423 Query: 457 GHCTDMRGCPQAPLVLEGNEKNCTSPDAVHKVRQKIQEQMDLWLSECQNTG 305 GH TDMRGCPQ+P +EGNEKNCTSPDAVHKVRQKI E MDLWLS+CQ+TG Sbjct: 424 GHGTDMRGCPQSPFNIEGNEKNCTSPDAVHKVRQKIIEHMDLWLSQCQDTG 474