BLASTX nr result
ID: Glycyrrhiza24_contig00001645
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00001645 (2254 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003542713.1| PREDICTED: uncharacterized protein LOC100806... 798 0.0 ref|XP_003529356.1| PREDICTED: uncharacterized protein LOC100798... 790 0.0 ref|XP_002522505.1| conserved hypothetical protein [Ricinus comm... 698 0.0 ref|XP_003537055.1| PREDICTED: uncharacterized protein LOC100820... 691 0.0 ref|XP_003541452.1| PREDICTED: uncharacterized protein LOC100784... 688 0.0 >ref|XP_003542713.1| PREDICTED: uncharacterized protein LOC100806521 [Glycine max] Length = 478 Score = 798 bits (2061), Expect = 0.0 Identities = 385/479 (80%), Positives = 415/479 (86%) Frame = -1 Query: 2032 MKKISKGSCKSASHRLFKDKAKNHVDDLQVMFLDLQFARKESRAVDAALLEEQVHQMLRE 1853 MKK+ K SCKSASHRLFKDKAKN VDDLQ+MFLDLQFARKESR VDAA+LEEQVHQMLRE Sbjct: 1 MKKVPKNSCKSASHRLFKDKAKNRVDDLQLMFLDLQFARKESRTVDAAVLEEQVHQMLRE 60 Query: 1852 WKAELNEPSPAXXXXXXXXXXSFSTDICRLLQLCEEEDDATSPLAVPKPEPDDQTLQADA 1673 WKAELNEPSPA SFSTDICRLLQLCEEEDDA+S LA PKPEP+DQTLQ Sbjct: 61 WKAELNEPSPASSLQQGGSLGSFSTDICRLLQLCEEEDDASSQLAAPKPEPNDQTLQVGG 120 Query: 1672 KVIFQEGQHQHDFLSIDESKHSTLGVPNIAANNSDGQSGVELECHQFDLNEDFKHSFYAG 1493 KV+FQEGQ QHDF +DE KH T V N+AANN DG +LE HQFDL++D+ H FY G Sbjct: 121 KVMFQEGQQQHDFPFVDECKHFTSSVQNVAANNPDGH---DLEYHQFDLHQDYDHGFYTG 177 Query: 1492 LNGTGFCEEDVVXXXXXXXXXXXXXXSAFLGPKCALWDCSRPMQGLDWCQDYCSSFHAAL 1313 NGTG+CEED + SAFLGPKCALWDC RP QGLDWCQDYCSSFHAAL Sbjct: 178 FNGTGYCEEDAIPHISSYLPSICPPPSAFLGPKCALWDCPRPAQGLDWCQDYCSSFHAAL 237 Query: 1312 ALNEGPQGMAPVLRPGGIGLKDNLLFAALSAKAEGKDVGIPECAGAATAKSPWNAPELFD 1133 ALNEGP GMAPVLRPGGIGLKDNLLFAALSAKA+GKDVGIPEC GAATAKSPWNAPELFD Sbjct: 238 ALNEGPPGMAPVLRPGGIGLKDNLLFAALSAKAQGKDVGIPECEGAATAKSPWNAPELFD 297 Query: 1132 LSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMNEFGGMKRSYYM 953 LSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMNEFGG+KRSYYM Sbjct: 298 LSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMNEFGGLKRSYYM 357 Query: 952 DPQPLNRYEWHLYEYEISKCDACALYRLELKLVDGKKSSRAKITSDSVTDLQKRIGRLSA 773 DPQPLN +EWHLYEYEISKCDACALYRLELKLVDGKK+S+ K +DSV DLQK++GRLSA Sbjct: 358 DPQPLNHFEWHLYEYEISKCDACALYRLELKLVDGKKNSKTKAANDSVADLQKQMGRLSA 417 Query: 772 EFPYDNKRSAKGRAKLNAKVGIGGGVYSALNKVTPLNGTYEYGLAAPYDHLVDSTGDYF 596 EFP+DNKR+AKGRAK+N KVGIGGGVYSA ++VTPLNGTYEYGLAAPYD+LVD+ GDY+ Sbjct: 418 EFPHDNKRAAKGRAKINTKVGIGGGVYSASHRVTPLNGTYEYGLAAPYDYLVDNMGDYY 476 >ref|XP_003529356.1| PREDICTED: uncharacterized protein LOC100798565 [Glycine max] Length = 477 Score = 790 bits (2040), Expect = 0.0 Identities = 384/479 (80%), Positives = 414/479 (86%) Frame = -1 Query: 2032 MKKISKGSCKSASHRLFKDKAKNHVDDLQVMFLDLQFARKESRAVDAALLEEQVHQMLRE 1853 MKK+ K SCKSASHRLFKDKAKN VDDLQ+MFLDLQFARKESR VDA +LEEQVHQMLRE Sbjct: 1 MKKVPKNSCKSASHRLFKDKAKNRVDDLQLMFLDLQFARKESRTVDAVVLEEQVHQMLRE 60 Query: 1852 WKAELNEPSPAXXXXXXXXXXSFSTDICRLLQLCEEEDDATSPLAVPKPEPDDQTLQADA 1673 WKAELNEPSPA SFSTDICRLLQLCEEEDDA+SPLA PKPEP+DQTLQA Sbjct: 61 WKAELNEPSPASSLQQGGSLGSFSTDICRLLQLCEEEDDASSPLAAPKPEPNDQTLQAGG 120 Query: 1672 KVIFQEGQHQHDFLSIDESKHSTLGVPNIAANNSDGQSGVELECHQFDLNEDFKHSFYAG 1493 KV+FQEGQ QH F +DE KHST V N+AANN DG + LE HQFDL++D+ H Y G Sbjct: 121 KVMFQEGQQQHYFPLVDECKHSTSSVQNVAANNPDGHA---LEYHQFDLHQDYDHGLYTG 177 Query: 1492 LNGTGFCEEDVVXXXXXXXXXXXXXXSAFLGPKCALWDCSRPMQGLDWCQDYCSSFHAAL 1313 NGTG+CEED + SAFLGPKCALWDC RP QGLDWCQDYCSSFHAAL Sbjct: 178 FNGTGYCEEDAIPHISSYLPSICPPPSAFLGPKCALWDCPRPAQGLDWCQDYCSSFHAAL 237 Query: 1312 ALNEGPQGMAPVLRPGGIGLKDNLLFAALSAKAEGKDVGIPECAGAATAKSPWNAPELFD 1133 ALNEGP GMAPVLRPGGIGLKDNLLFAALSAKA+GKDVGIPEC GAATAKSPWNAPELFD Sbjct: 238 ALNEGPPGMAPVLRPGGIGLKDNLLFAALSAKAQGKDVGIPECEGAATAKSPWNAPELFD 297 Query: 1132 LSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMNEFGGMKRSYYM 953 LSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMNEFGG+KRSYYM Sbjct: 298 LSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMNEFGGLKRSYYM 357 Query: 952 DPQPLNRYEWHLYEYEISKCDACALYRLELKLVDGKKSSRAKITSDSVTDLQKRIGRLSA 773 DPQPLN +EWHLYEYEISKCDACALYRLELKLVDGKK+S+ K +DSV DLQK++GRLSA Sbjct: 358 DPQPLNHFEWHLYEYEISKCDACALYRLELKLVDGKKNSKTKAANDSVADLQKQMGRLSA 417 Query: 772 EFPYDNKRSAKGRAKLNAKVGIGGGVYSALNKVTPLNGTYEYGLAAPYDHLVDSTGDYF 596 EFP+DNKR+AKGRAK+NAKVGI GGVY A ++VTPLNGTYEYGLAAPYD+LVD+ GDY+ Sbjct: 418 EFPHDNKRAAKGRAKINAKVGI-GGVYPASHRVTPLNGTYEYGLAAPYDYLVDNMGDYY 475 >ref|XP_002522505.1| conserved hypothetical protein [Ricinus communis] gi|223538196|gb|EEF39805.1| conserved hypothetical protein [Ricinus communis] Length = 484 Score = 698 bits (1802), Expect = 0.0 Identities = 349/487 (71%), Positives = 390/487 (80%), Gaps = 7/487 (1%) Frame = -1 Query: 2032 MKKISKGSCKSASHRLFKDKAKNHVDDLQVMFLDLQFARKESRAVDAALLEEQVHQMLRE 1853 M K SK +CKSASH+LFKDKAKN VDDLQ MF+DLQFARKESR+VD A+LEEQVHQMLRE Sbjct: 1 MGKGSKINCKSASHKLFKDKAKNRVDDLQGMFMDLQFARKESRSVDVAVLEEQVHQMLRE 60 Query: 1852 WKAELNEPSPAXXXXXXXXXXSFSTDICRLLQLCEEEDDATSPLAVPKPEPDDQTLQADA 1673 WKAELNEPSPA SFS+DICRLLQLCEEEDDATS LA PKPEP+D +LQ Sbjct: 61 WKAELNEPSPASSLQHGASLGSFSSDICRLLQLCEEEDDATSALAAPKPEPNDHSLQIGN 120 Query: 1672 KVIFQE------GQHQHDFLSIDESKHSTLGVPNIAANNSDGQSGVELECHQFDLNEDFK 1511 V+FQE GQ H F +D+ K S GV + NN +G G +LE H FDL+++++ Sbjct: 121 NVVFQEEFGVNQGQQNHSFPFVDQCKESPSGVHGMVVNNLEG--GAQLEFHHFDLSQNYE 178 Query: 1510 HSFYAGLNGTGFCEEDVVXXXXXXXXXXXXXXSAFLGPKCALWDCSRPMQG-LDWCQDYC 1334 +FYA N T C ED V SAFLGPKCALWDC RP QG LDWCQDYC Sbjct: 179 SNFYADFNSTDLCAEDGVPQVSGYLPSICPPPSAFLGPKCALWDCPRPAQGGLDWCQDYC 238 Query: 1333 SSFHAALALNEGPQGMAPVLRPGGIGLKDNLLFAALSAKAEGKDVGIPECAGAATAKSPW 1154 SSFH ALALNEGP GM+PVLRPGGIGLKD LLFAALSAKA+GKDVGIPEC GAATAKSPW Sbjct: 239 SSFHHALALNEGPPGMSPVLRPGGIGLKDGLLFAALSAKAQGKDVGIPECEGAATAKSPW 298 Query: 1153 NAPELFDLSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMNEFGG 974 NAPELFDLSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMNEFGG Sbjct: 299 NAPELFDLSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMNEFGG 358 Query: 973 MKRSYYMDPQPLNRYEWHLYEYEISKCDACALYRLELKLVDGKKSSRAKITSDSVTDLQK 794 +KRSYYMDPQPLN +EWHLYEYEI+KCDACALYRLELK VDGKK ++ KIT++SV DLQK Sbjct: 359 LKRSYYMDPQPLNTFEWHLYEYEINKCDACALYRLELKAVDGKKGAKGKITNESVADLQK 418 Query: 793 RIGRLSAEFPYDNKRSAKGRAKLNAKVGIGGGVYSALNKVTPLNGTYEYGLAAPYDHLVD 614 ++GRL+AEFP DNKRS KGR K++ KVG+ G VYS N+V P N TY+Y L PY++LVD Sbjct: 419 QMGRLTAEFPSDNKRSVKGRTKVSVKVGV-GNVYSTTNRVVPTNETYDYEL-GPYNYLVD 476 Query: 613 STGDYFV 593 + GDY+V Sbjct: 477 NLGDYYV 483 >ref|XP_003537055.1| PREDICTED: uncharacterized protein LOC100820163 [Glycine max] Length = 479 Score = 691 bits (1782), Expect = 0.0 Identities = 367/499 (73%), Positives = 391/499 (78%), Gaps = 19/499 (3%) Frame = -1 Query: 2032 MKKISKGSCKSASHRLFKDKAKNHVDDLQVMFLDLQFARKESRAVDAALLEEQVHQMLRE 1853 MKKISK SCKSASHRLFKDKA+NHVDDLQVMFLDLQFARKESR +DAALLEEQVHQMLRE Sbjct: 1 MKKISKSSCKSASHRLFKDKARNHVDDLQVMFLDLQFARKESRTIDAALLEEQVHQMLRE 60 Query: 1852 WKAELNEPSPAXXXXXXXXXXSFSTDICRLLQLCEEEDDATSPLAVPKPEPDDQTLQADA 1673 WKAELNE SPA SFSTD+ RLLQLCEEEDDATSPL PK EP+DQ +QA A Sbjct: 61 WKAELNETSPASSLQQGGSLGSFSTDVYRLLQLCEEEDDATSPLVAPKSEPNDQIMQAGA 120 Query: 1672 KVIFQE--------------GQHQHDFLSIDESKHSTLGVPNIAANNSDGQSGVELECHQ 1535 KVI QE GQHQ +F + E KHST+GVPNIAANN DG + LE HQ Sbjct: 121 KVIVQEVSNCNEFLGLTVNQGQHQQEFRLLKECKHSTVGVPNIAANNLDGTA---LEYHQ 177 Query: 1534 FDLNEDFKHSFYAGLNGTGFCEEDVVXXXXXXXXXXXXXXSAFLGPKCALWDCSRPMQGL 1355 FD+ +D HSFYAG TGFCEE V SAFLGPKCALWDC RP+QGL Sbjct: 178 FDI-KDLDHSFYAG---TGFCEEGGVPHISSYLPSVCPPPSAFLGPKCALWDCPRPVQGL 233 Query: 1354 DWCQDYCSSFHAALALNEGPQGMAPVLRPGGIGLKDNLLFAALSAKAEGKDVGIPECAGA 1175 DWCQDYCSSFHA LALNEGP GM PVLRPGGIGLKDNLLFAAL AKA+GK VGIPEC GA Sbjct: 234 DWCQDYCSSFHATLALNEGPPGMTPVLRPGGIGLKDNLLFAALGAKAQGKVVGIPECEGA 293 Query: 1174 ATAKSPWNAPELFDLSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQ 995 ATAKSPWNAPELFD VLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQ Sbjct: 294 ATAKSPWNAPELFDTCVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQ 353 Query: 994 VMNEFGGMKRSYYMDPQPLNRYEWHLYEYEISKCDACALYRLELKLVDGKKSSRAKI--T 821 VMNEFGG+KRSYYMDPQPLN +EWHLYEYEISKCDA ALYRLELKLVDGKKSS+AKI T Sbjct: 354 VMNEFGGLKRSYYMDPQPLNLFEWHLYEYEISKCDARALYRLELKLVDGKKSSKAKIVVT 413 Query: 820 SDSVTDLQKRIGRLSAEFPYDNKRSAKGRAKLNAKVGIGGGVYSALNKVT-PLNGTYEYG 644 SDSV LQK IG L SAKGR KL GIGGGVYSA N+V LN Y+YG Sbjct: 414 SDSVAGLQKHIGSL----------SAKGRTKL----GIGGGVYSASNRVALQLNAPYQYG 459 Query: 643 L-AAPY-DHLVDSTGDYFV 593 L ++PY D++VD+T DY+V Sbjct: 460 LTSSPYDDYVVDNTRDYYV 478 >ref|XP_003541452.1| PREDICTED: uncharacterized protein LOC100784593 [Glycine max] Length = 470 Score = 688 bits (1775), Expect = 0.0 Identities = 357/493 (72%), Positives = 383/493 (77%), Gaps = 13/493 (2%) Frame = -1 Query: 2032 MKKISKGSCKSASHRLFKDKAKNHVDDLQVMFLDLQFARKESRAVDAALLEEQVHQMLRE 1853 MKKISK SCK ASHRLFKDKA+NHVDDLQVMFLDLQFARKESR +DAALLEEQVHQMLRE Sbjct: 1 MKKISKSSCKLASHRLFKDKARNHVDDLQVMFLDLQFARKESRTIDAALLEEQVHQMLRE 60 Query: 1852 WKAELNEPSPAXXXXXXXXXXSFSTDICRLLQLCEEEDDATSPLAVPKPEPDDQTLQADA 1673 WKAELNEPSPA SFSTD+ RLLQLCEEEDDATSPL PKPEP+DQ +QA A Sbjct: 61 WKAELNEPSPASSLQQGGSLGSFSTDVYRLLQLCEEEDDATSPLVAPKPEPNDQIMQAGA 120 Query: 1672 KVIFQE-----------GQHQHDFLSIDESKHSTLGVPNIAANNSDGQSGVELECHQFDL 1526 KVI QE GQHQ + L + E K T+G PN+ G LE HQFD+ Sbjct: 121 KVIVQEVNEFLGLTINQGQHQQELLLLKECKPFTVGFPNL--------DGTALEYHQFDI 172 Query: 1525 NEDFKHSFYAGLNGTGFCEEDVVXXXXXXXXXXXXXXSAFLGPKCALWDCSRPMQGLDWC 1346 N+D SFYAG TGFCEE V SAFLGPKCALWDC RP+QGLDWC Sbjct: 173 NKDMDDSFYAG---TGFCEEGRVPHISSYLPSVCPPPSAFLGPKCALWDCPRPVQGLDWC 229 Query: 1345 QDYCSSFHAALALNEGPQGMAPVLRPGGIGLKDNLLFAALSAKAEGKDVGIPECAGAATA 1166 QDYCSSFHAALALNEGP GM PVLRPGGIGLKDNLLFAALSAKA+GK VGIPEC GAATA Sbjct: 230 QDYCSSFHAALALNEGPPGMTPVLRPGGIGLKDNLLFAALSAKAQGKVVGIPECEGAATA 289 Query: 1165 KSPWNAPELFDLSVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMN 986 KSPWNAPELFD+ VLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMN Sbjct: 290 KSPWNAPELFDICVLEGETIREWLFFDKPRRAFESGNRKQRSLPDYSGRGWHESRKQVMN 349 Query: 985 EFGGMKRSYYMDPQPLNRYEWHLYEYEISKCDACALYRLELKLVDGKKSSRAKI-TSDSV 809 EFGG+KRSYYMDPQPLN +EWHLYEYEISKCD CALYRLELKLVDGKKSS+AKI T DSV Sbjct: 350 EFGGLKRSYYMDPQPLNLFEWHLYEYEISKCDVCALYRLELKLVDGKKSSKAKIVTGDSV 409 Query: 808 TDLQKRIGRLSAEFPYDNKRSAKGRAKLNAKVGIGGGVYSALNKVTPLNGTYEYGLAAPY 629 DLQK +G L SAKG K+GIGGGVYSA N+V LN Y+YGL +PY Sbjct: 410 ADLQKHMGSL----------SAKGS---RTKLGIGGGVYSASNRVALLNAPYQYGLTSPY 456 Query: 628 -DHLVDSTGDYFV 593 D++V +T DY+V Sbjct: 457 DDYVVHNTLDYYV 469