BLASTX nr result
ID: Glycyrrhiza23_contig00004010
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00004010 (2224 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003521435.1| PREDICTED: uncharacterized protein LOC100809... 846 0.0 ref|XP_003553579.1| PREDICTED: uncharacterized protein LOC100808... 835 0.0 ref|XP_003537035.1| PREDICTED: uncharacterized protein LOC100804... 661 0.0 ref|XP_003590387.1| hypothetical protein MTR_1g061540 [Medicago ... 567 e-159 ref|XP_002528655.1| hypothetical protein RCOM_0841800 [Ricinus c... 498 e-138 >ref|XP_003521435.1| PREDICTED: uncharacterized protein LOC100809482 [Glycine max] Length = 570 Score = 846 bits (2185), Expect = 0.0 Identities = 435/571 (76%), Positives = 468/571 (81%), Gaps = 2/571 (0%) Frame = +2 Query: 380 MGGETLNGSWLSALWPVSRKNASDNKAVVGVLASEVAGLMLKVVNLWQGLSDGEVLSLRE 559 MGGET+NGSW S LWPVSRK+ASDNKAVVGVLA EVAGLMLKVVNLWQ LSD EVLSLRE Sbjct: 1 MGGETMNGSWFSVLWPVSRKSASDNKAVVGVLALEVAGLMLKVVNLWQSLSDAEVLSLRE 60 Query: 560 GTVNSVGVKMLVSENDDYLMELALNEILDNFQSLAMSVARLGKRCTDPVYHRFEHFVCNP 739 G VNSVGVK LVS++DDYLMELALNEILDNFQSLA SVARLGK+C DPVYH+FEHFV NP Sbjct: 61 GIVNSVGVKTLVSDDDDYLMELALNEILDNFQSLARSVARLGKKCVDPVYHQFEHFVHNP 120 Query: 740 AQNYVQWSGWEYXXXXXXXXXXXXXXFAAAMTQFCQELEVLAEVEQTFRRMQANPELHRV 919 AQNY QWS WEY F +AMTQFCQE+EVLAEVEQTFRRMQANP+LH+V Sbjct: 121 AQNYFQWSEWEYRWKKMERKVKKMEKFVSAMTQFCQEVEVLAEVEQTFRRMQANPDLHKV 180 Query: 920 KLLEFQKKVTSQRQEVRNLRDMSPWNRSYDYVVRLLAKSLFTILERIILVFGNNHLPTLQ 1099 K LEFQKKV RQEVRNLRDMSPW+RSYDYVVRLLA+SLFTILERIILVF N PT+Q Sbjct: 181 KFLEFQKKVMLHRQEVRNLRDMSPWSRSYDYVVRLLARSLFTILERIILVFAINQPPTVQ 240 Query: 1100 LENDSQNMNANNLLRXXXXXXXXXXXXXPSENDLYGFNSGPAGRRPASNSGFSVHKSKRK 1279 +ND Q+MNANNLLR PSENDLYGFNSGP G RP S SGF V K +RK Sbjct: 241 EQNDYQHMNANNLLR-SHSFSVMHSSVHPSENDLYGFNSGPVGGRPVSKSGFLVDKGRRK 299 Query: 1280 KEQQYALHPPDLSGKHIRSESKQLGHIGPFKSCMSLANDSPVIQSCVQTNGGSMRLADCH 1459 K+QQ ALH P L K++ SESKQLGHI FK CMS AN+SPVIQSC+QTNGGSMRL DC Sbjct: 300 KKQQQALHEPALFRKNLHSESKQLGHIVAFKGCMSAANNSPVIQSCMQTNGGSMRLTDCQ 359 Query: 1460 MKYVDKMKTVEKSSRSNRIRIYSKLCINNRLKPASFTLGDAALALHYANMIVLIERMASL 1639 +K DKMKTV+K S SNRIRIYSKL I N LKP S TLGDAALALHYANMIVLIERM S Sbjct: 360 LKSFDKMKTVDKLSLSNRIRIYSKLSIKNWLKPVSLTLGDAALALHYANMIVLIERMLSS 419 Query: 1640 PHLIDLETRDDLYNMLPTTIRTVLRAKLKCHAKGKSSS-VHDANLAAEWSPVLAQILEWL 1816 PHL+DL RDDLYNMLPTT+ T LRAKLKCHAK KSSS HDAN AAEWSPVLAQILEWL Sbjct: 420 PHLVDLAARDDLYNMLPTTVTTALRAKLKCHAKSKSSSNAHDANPAAEWSPVLAQILEWL 479 Query: 1817 APLAQNTIRWHSERNFEKEHTTEKANILLVQTLYFANQAKTEAAMVDLLVGLNYVCRVDT 1996 APLA N + WHSERNFEKEH+ AN+LLVQTLYFANQAKTEAA++DLLVGLNYVCR+DT Sbjct: 480 APLAHNMLSWHSERNFEKEHSVFNANVLLVQTLYFANQAKTEAAIIDLLVGLNYVCRIDT 539 Query: 1997 SVGMRDTLEFASTRSVNGLRLRK-GMYDGFL 2086 VG RDTL+ STRS NG+ LRK GMY FL Sbjct: 540 KVGTRDTLDCVSTRSFNGVHLRKNGMYTEFL 570 >ref|XP_003553579.1| PREDICTED: uncharacterized protein LOC100808409 [Glycine max] Length = 577 Score = 835 bits (2157), Expect = 0.0 Identities = 435/572 (76%), Positives = 468/572 (81%), Gaps = 3/572 (0%) Frame = +2 Query: 380 MGGETLNGSWLSALWPVSRKNASDNKAVVGVLASEVAGLMLKVVNLWQGLSDGEVLSLRE 559 MGGET+NGSW S LWPVSRK+ASDNKAVVGVLA EVAGLMLKVVNLWQ LSD EVLSLRE Sbjct: 1 MGGETVNGSWFSVLWPVSRKSASDNKAVVGVLALEVAGLMLKVVNLWQSLSDAEVLSLRE 60 Query: 560 GTVNSVGVKMLVSENDDYLMELALNEILDNFQSLAMSVARLGKRCTDPVYHRFEHFVCNP 739 G VNSVGVK LVS++DDYLMELALNEILDNFQSLA SVARLGK+C DPVYHRFEHFV NP Sbjct: 61 GIVNSVGVKTLVSDDDDYLMELALNEILDNFQSLARSVARLGKKCVDPVYHRFEHFVHNP 120 Query: 740 AQNYVQWSGWEYXXXXXXXXXXXXXXFAAAMTQFCQELEVLAEVEQTFRRMQANPELHRV 919 AQNY QWSGWEY F AAMTQ CQE+EVLAEVEQTFRRMQANPELH++ Sbjct: 121 AQNYFQWSGWEYRWKKMERKVKKMEKFVAAMTQLCQEVEVLAEVEQTFRRMQANPELHKL 180 Query: 920 KLLEFQKKVTSQRQEVRNLRDMSPWNRSYDYVVRLLAKSLFTILERIILVFGNNHLPTLQ 1099 KLLEFQKKV Q QEVRNLRDMSPWNRSYDYVVRLLA+SLFTILERIILVF NNH T+Q Sbjct: 181 KLLEFQKKVMLQCQEVRNLRDMSPWNRSYDYVVRLLARSLFTILERIILVFANNHPSTVQ 240 Query: 1100 LENDSQNMNANNLLRXXXXXXXXXXXXXPSENDLYGFNSGPAGRRPASNSGFSVHKSKRK 1279 +ND Q+MNANNLLR PSE+DL GFNSGP G RP S SGF V K +RK Sbjct: 241 EQNDYQHMNANNLLR-SHSFSVIHSSVHPSEHDLCGFNSGPVGGRPVSKSGFLVDKGRRK 299 Query: 1280 KEQQYALHPPDLSGKHIRSESKQLGHIGPFKSCMSLANDSPVIQSCVQTNGGSMRLADCH 1459 K+ Q A H P L ++ SESKQLGHI FK CMS AN+SPVIQSC+QTNGGSMRL DCH Sbjct: 300 KKLQQARHEPALFRNNLHSESKQLGHIVTFKGCMSAANNSPVIQSCMQTNGGSMRLTDCH 359 Query: 1460 MKYVDKMKTVEKSSRSNRIRIYSKLCINNRLKPASFTLGDAALALHYANMIVLIERMASL 1639 +K +DKMKTV+K S SNRIRIYSKL I NRLK +S TLGDAALALHYA MIVLIERMAS Sbjct: 360 LKSIDKMKTVDKLSPSNRIRIYSKLSIKNRLKASSLTLGDAALALHYAKMIVLIERMASS 419 Query: 1640 PHLIDLETRDDLYNMLPTTIRTVLRAKLKCHAKGKSSS-VHDANLAAEWSPVLAQILEWL 1816 PHL+DL RDDLYNMLPTT+RT LRAKLK H K KSSS HDANLAAEWSPVLAQIL+WL Sbjct: 420 PHLVDLAARDDLYNMLPTTVRTALRAKLKRHVKSKSSSNGHDANLAAEWSPVLAQILDWL 479 Query: 1817 APLAQNTIRWHSERNFEKEHTTEKANILLVQTLYFANQAKTEAAMVDLLVGLNYVCRVDT 1996 APLA N I WHSERNFEKE + N+LLVQTLYFANQ KTEAA++DLLV LNYVCRVDT Sbjct: 480 APLAHNMISWHSERNFEKEQSIFNTNVLLVQTLYFANQPKTEAAIIDLLVALNYVCRVDT 539 Query: 1997 SVGMRDTLEFA-STRSVNGLRLRK-GMYDGFL 2086 VG RDTL+ A STRS NG+RLRK GMY+ FL Sbjct: 540 KVGTRDTLDCANSTRSFNGVRLRKNGMYNEFL 571 >ref|XP_003537035.1| PREDICTED: uncharacterized protein LOC100804666 [Glycine max] Length = 583 Score = 661 bits (1705), Expect = 0.0 Identities = 355/573 (61%), Positives = 420/573 (73%), Gaps = 9/573 (1%) Frame = +2 Query: 380 MGGETLNGSWLSALWPVSRKNASDNKAVVGVLASEVAGLMLKVVNLWQGLSDGEVLSLRE 559 MGGET+NG+WLSA W VSRK+ASD K V+GVLA EVAGLM KVVNLW+ LSD E+++ + Sbjct: 1 MGGETVNGTWLSAFWSVSRKSASDGKEVIGVLAFEVAGLMSKVVNLWRSLSDREIMNTKA 60 Query: 560 GTVNSVGVKMLVSENDDYLMELALNEILDNFQSLAMSVARLGKRCTDPVYHRFEHFVCNP 739 + SVGVKMLVS++D +LM+LAL EIL+NF+SLA SVARL K+C PVYH +EHFV NP Sbjct: 61 WIMKSVGVKMLVSDDDYFLMDLALCEILNNFESLAWSVARLSKKCKGPVYHGYEHFVDNP 120 Query: 740 AQNYVQWSGWEYXXXXXXXXXXXXXXFAAAMTQFCQELEVLAEVEQTFRRMQANPELHRV 919 AQNY+QWSGWEY F A M+ QELEVLA+ EQTFRRM+AN ELH V Sbjct: 121 AQNYLQWSGWEYAWKKMERKVKKMDRFVACMSLLSQELEVLADREQTFRRMKANRELHGV 180 Query: 920 KLLEFQKKVTSQRQEVRNLRDMSPWNRSYDYVVRLLAKSLFTILERIILVFGNNHLPTLQ 1099 KLLEFQKKV QRQ+V+NLRDM+PWNRSYDYVVRLLA+SLFTILERII+VFGN+H+P Sbjct: 181 KLLEFQKKVMWQRQQVKNLRDMAPWNRSYDYVVRLLARSLFTILERIIVVFGNSHIPIEN 240 Query: 1100 LENDSQN----MNANNLLR-XXXXXXXXXXXXXPSENDLYGFNSGPAGRRPASNSGFSV- 1261 +NDS + N N L R PS+ + YGF S P + NSGF V Sbjct: 241 QQNDSLSPPVTTNNNRLTRSHSFSTLRHTTSVHPSKTNSYGFCSQPIESKSVLNSGFEVD 300 Query: 1262 -HKSKRKKEQQYALHPPDLSGKHIRSESKQLGHIGPFKSCMSLANDSPVIQSCVQTNGGS 1438 KSK+KK++Q LH SESKQ HI PF MS+ N SP +QSCV T GGS Sbjct: 301 KSKSKKKKKEQQVLH----------SESKQFEHIVPFTGFMSVGNKSPFVQSCVPTKGGS 350 Query: 1439 MRLADCHMKYVDKMKTVEKSSRSNRIRIYSKLCINNRLKPASFTLGDAALALHYANMIVL 1618 MRL DCH+K D MKTV+KSS R RIY KL + RLKP TLGDAALALHYAN+IVL Sbjct: 351 MRLVDCHVKNNDNMKTVDKSSLICRTRIYLKLSMKGRLKPGPSTLGDAALALHYANVIVL 410 Query: 1619 IERM-ASLPHLIDLETRDDLYNMLPTTIRTVLRAKLKCHAKGKSSSVHDANLAAEWSPVL 1795 IE+M S PHLID ETRDDLYNMLPTTIRT LR KLK +AK + ++VH+A+LA EWS V+ Sbjct: 411 IEKMVVSAPHLIDHETRDDLYNMLPTTIRTALRGKLKWYAKSQRATVHEASLAVEWSMVV 470 Query: 1796 AQILEWLAPLAQNTIRWHSERNFEKEHTTEKA-NILLVQTLYFANQAKTEAAMVDLLVGL 1972 AQILEWLAPLA N I+WHSERNFE+E KA N+LLV TLYFA+QAK EAAMV+LLVG+ Sbjct: 471 AQILEWLAPLAHNMIKWHSERNFEREQCASKAKNVLLVHTLYFADQAKAEAAMVELLVGV 530 Query: 1973 NYVCRVDTSVGMRDTLEFASTRSVNGLRLRKGM 2071 +YVCR+D R+ EFA +R++NG+RLRK + Sbjct: 531 HYVCRID-----REAQEFAGSRALNGVRLRKNV 558 >ref|XP_003590387.1| hypothetical protein MTR_1g061540 [Medicago truncatula] gi|92870930|gb|ABE80130.1| Protein of unknown function DUF668 [Medicago truncatula] gi|355479435|gb|AES60638.1| hypothetical protein MTR_1g061540 [Medicago truncatula] Length = 529 Score = 567 bits (1462), Expect = e-159 Identities = 309/550 (56%), Positives = 381/550 (69%), Gaps = 6/550 (1%) Frame = +2 Query: 380 MGGETLNGSWLSALWPVSRKNASDNKAVVGVLASEVAGLMLKVVNLWQGLSDGEVLSLRE 559 M GET+N +WL +WPVSRK+ SD +G++A EVAGLM KVVNLW LSD E+++LRE Sbjct: 1 MKGETVNVTWLGGIWPVSRKSGSDENNEIGIMAFEVAGLMSKVVNLWHSLSDNELMNLRE 60 Query: 560 GTVNSVGVKMLVSENDDYLMELALNEILDNFQSLAMSVARLGKRCTDPVYHRFEHFVCNP 739 V+SVGVKMLVS+++ +LMEL NEIL+NFQSL+ SVARL K+C DP+YH +E FV NP Sbjct: 61 WIVSSVGVKMLVSDDEYFLMELTRNEILNNFQSLSQSVARLSKKCKDPMYHSYESFVHNP 120 Query: 740 AQNYVQWSGWEYXXXXXXXXXXXXXXFAAAMTQFCQELEVLAEVEQTFRRMQANPE-LHR 916 +NYVQWSGWEY F +++ QELEVLAE EQT RRM+ + +++ Sbjct: 121 FENYVQWSGWEYRLKKMEKKVKKMERFVCSLSLLSQELEVLAECEQTLRRMKLTRDVVNK 180 Query: 917 VKLLEFQKKVTSQRQEVRNLRDMSPWNRSYDYVVRLLAKSLFTILERIILVFGNNHLPTL 1096 KLLEFQKKV QRQ+V+N+RD+SPW+RSYDY+VRLLA+SLFTILERIILVFGN+HLP Sbjct: 181 AKLLEFQKKVMCQRQQVQNVRDLSPWSRSYDYIVRLLARSLFTILERIILVFGNSHLPIE 240 Query: 1097 QLENDSQNMNANNLLRXXXXXXXXXXXXXPSENDLYGFNSGPAGRRPASNSGFSVHKSKR 1276 L+ND+ N A N P E +L F SGP GR+ S K+ Sbjct: 241 NLKNDTNNRLARNHSSPALHVMHSSVHPSP-ETNLNEFCSGPIGRKNKS--------KKK 291 Query: 1277 KKEQQYALHPPDLS-GKHIRSESKQLGHIGPFKSCMSLANDSPVIQSCVQTNGGSMRLAD 1453 KK+Q LH D S K + SE KQL +IG FK C+S+ NDS V+QSC+ +NG SMR Sbjct: 292 KKDQPVLLHSQDSSCEKLLPSEGKQLTYIGSFKGCISVQNDSHVVQSCIPSNGSSMR--- 348 Query: 1454 CHMKYVD--KMKTVEKSSRSNRIRIYSKLCINNRLKPASFTLGDAALALHYANMIVLIER 1627 K +D V K S +R R+Y KL + +LKP TLGDAALA+HYAN+IVLIE+ Sbjct: 349 ---KNIDVNTKSLVNKPSLFHRSRVYFKLSLKEKLKPIPSTLGDAALAIHYANVIVLIEK 405 Query: 1628 MAS--LPHLIDLETRDDLYNMLPTTIRTVLRAKLKCHAKGKSSSVHDANLAAEWSPVLAQ 1801 + S + ID+ TRDDLYN LPTTIRT LR KLK +AK K L EW+ VL Q Sbjct: 406 IVSSRRTNTIDVRTRDDLYNKLPTTIRTALRGKLKWYAKSK--------LETEWNVVLKQ 457 Query: 1802 ILEWLAPLAQNTIRWHSERNFEKEHTTEKANILLVQTLYFANQAKTEAAMVDLLVGLNYV 1981 ILEWLAPLA N ++W+SERNFEKE+T+ KAN+LLVQTLYFANQAKTEAAMV+LLVGL+YV Sbjct: 458 ILEWLAPLAHNMVKWYSERNFEKEYTSLKANVLLVQTLYFANQAKTEAAMVELLVGLHYV 517 Query: 1982 CRVDTSVGMR 2011 CR+D R Sbjct: 518 CRIDVETRFR 527 >ref|XP_002528655.1| hypothetical protein RCOM_0841800 [Ricinus communis] gi|223531944|gb|EEF33758.1| hypothetical protein RCOM_0841800 [Ricinus communis] Length = 576 Score = 498 bits (1281), Expect = e-138 Identities = 279/570 (48%), Positives = 363/570 (63%), Gaps = 5/570 (0%) Frame = +2 Query: 374 KSMGGETLNGSWLSALWPVSRKNA-SDNKAVVGVLASEVAGLMLKVVNLWQGLSDGEVLS 550 + MG E SW S LW SRK+A KA +GVLA EVA LM KV LW L + E+ Sbjct: 7 REMGTE----SWFSNLWWNSRKDALQTEKAAIGVLAFEVASLMSKVAKLWHFLGENEMFR 62 Query: 551 LREGTVNSVGVKMLVSENDDYLMELALNEILDNFQSLAMSVARLGKRCTDPVYHRFEHFV 730 LR +NS+G++ LVS+ DDYLM+LALNEI++NF L+ SVARLG+RC DP + RFEHFV Sbjct: 63 LRGDILNSIGIQKLVSDKDDYLMDLALNEIMENFGLLSRSVARLGRRCIDPHFRRFEHFV 122 Query: 731 CNPAQNYVQWSGWEYXXXXXXXXXXXXXXFAAAMTQFCQELEVLAEVEQTFRRMQANPEL 910 +P N ++W GWEY F A Q QELE+LAE+EQT RRM+ANP L Sbjct: 123 NDPLANNLEWIGWEYRLTKMERKVKKMERFVAVTMQLSQELEILAELEQTLRRMRANPVL 182 Query: 911 HRVKLLEFQKKVTSQRQEVRNLRDMSPWNRSYDYVVRLLAKSLFTILERIILVFGNNHLP 1090 R KLLE Q+KV QRQEVRNLR+MSPW R+YDY+VRLLA+SL TIL+RI+ VF + LP Sbjct: 183 SRRKLLEMQQKVMWQRQEVRNLREMSPWIRTYDYIVRLLARSLLTILQRIMNVFEISQLP 242 Query: 1091 TLQLENDSQNMNANNLLRXXXXXXXXXXXXXPSENDLYGFNSGPAGRRPASNSGFSVHKS 1270 + + D ++MN+ R PSEN L G + GP GR S SG + + Sbjct: 243 SPEENIDQEHMNSTRFPRSLSFSVLMQSSIYPSENFLCGISPGPPGRLD-SKSGVTSANN 301 Query: 1271 KRKKEQQYALHPPDLSGKHIRSESKQLGHIGPFKSCMSLANDSPVIQSCVQTNGGSMRLA 1450 K K Q+ LH S R ++ QL H+G FK CM+ + SP++Q+ GS + Sbjct: 302 KVNKTQRQLLH--QSSTVLPRFKTNQLAHVGLFKECMTSGSRSPILQAGKPAFCGSAKFT 359 Query: 1451 DCHMKYVDKMKTVEKSSRSNRIRIYSKLCI----NNRLKPASFTLGDAALALHYANMIVL 1618 +M DKM+ + RIY KL + + L S TLG AALALHYAN+IV Sbjct: 360 VDYMTVADKMENLNLWPFICSNRIYYKLALFSSKHGLLNAPSSTLGHAALALHYANVIVF 419 Query: 1619 IERMASLPHLIDLETRDDLYNMLPTTIRTVLRAKLKCHAKGKSSSVHDANLAAEWSPVLA 1798 IE++AS P+ +D ETRDDLYNMLPTTIR LR++LK + K S+S +DA+LA EWS L Sbjct: 420 IEKLASSPYTVDYETRDDLYNMLPTTIRAALRSRLKAYGKALSTSAYDASLAQEWSLALT 479 Query: 1799 QILEWLAPLAQNTIRWHSERNFEKEHTTEKANILLVQTLYFANQAKTEAAMVDLLVGLNY 1978 +LEWL+PLA + I+WHSERNFE++ + N+LL+QTL++ANQAKTEAA+V+LLVGLNY Sbjct: 480 YMLEWLSPLAHDMIKWHSERNFERDQEVSRTNVLLLQTLHYANQAKTEAAIVELLVGLNY 539 Query: 1979 VCRVDTSVGMRDTLEFASTRSVNGLRLRKG 2068 +C ++ + + E + R VN +KG Sbjct: 540 ICTINQDLDEKGWPESSRCR-VNSFIHQKG 568