BLASTX nr result
ID: Glycyrrhiza23_contig00022257
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00022257 (1935 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003627939.1| Flavonol sulfotransferase-like protein [Medi... 304 2e-98 ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817... 258 3e-66 ref|XP_003557045.1| PREDICTED: uncharacterized protein LOC100783... 245 3e-62 ref|XP_003551446.1| PREDICTED: uncharacterized protein LOC100819... 244 8e-62 ref|XP_003549005.1| PREDICTED: uncharacterized protein LOC100789... 243 1e-61 >ref|XP_003627939.1| Flavonol sulfotransferase-like protein [Medicago truncatula] gi|355521961|gb|AET02415.1| Flavonol sulfotransferase-like protein [Medicago truncatula] Length = 640 Score = 304 bits (779), Expect(2) = 2e-98 Identities = 150/337 (44%), Positives = 230/337 (68%), Gaps = 8/337 (2%) Frame = -2 Query: 1385 MTVSLDTKNKIGFVDGSIPQPLANSANSLAWKRCNSMVLSWIMHSLDSDVAQSVLWMTVA 1206 + V+L +K+K+ F++G++P+P + +S+AW RCN+M++SW+ +S+D ++QS+LWM A Sbjct: 42 LQVALRSKHKLHFINGALPRPCDDDHDSIAWDRCNTMIMSWLSNSVDPQISQSILWMDTA 101 Query: 1205 SEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTRLKGLWQELDNYRPIPSCT 1026 E+W+EL++R+YQGD+FRI ++QEEIY+L+QG+ SI+SY+T++K LWQELDN RPIP+ Sbjct: 102 LEIWNELKERFYQGDIFRISDLQEEIYTLKQGESSISSYYTKMKKLWQELDNVRPIPTSN 161 Query: 1025 CTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIMLSKPIPDINIVFSLLIQQE 846 C C + R Y+D+D VIRFL+GLNEQ+++VRSQIML P+P I V+ LL+QQE Sbjct: 162 CVDDCKAIAK--MREYKDSDQVIRFLKGLNEQFSAVRSQIMLMDPLPSIGKVYYLLVQQE 219 Query: 845 RQMNAEMQEPRVLANVADSRGGPKGKGRGQNVTS------GGKSSSDKRKFNNKVCTYCN 684 RQ+ + E ++LA +S G GRG S GG+SS + K + C++C Sbjct: 220 RQVVIPLDESKLLAVSNNSFSGHSSYGRGHMNASRGSGDRGGRSSYGRGK-GIRACSFCG 278 Query: 683 KLGHTVDKCYQKYGHPPNFKKNEGKMINNCTQDNDQFKDDDESESIVQEDNMKDTCSGSF 504 K HTVD C++KYG PP++++ INNCT D +++E S ED+ D+ + F Sbjct: 279 KSNHTVDTCFKKYGFPPHYQQENS--INNCT---DVSGNEEEQNSAHFEDDQNDSTTEKF 333 Query: 503 NFTAEQRDVLLAILQNQSS--SHATNQVSTSRQNSSG 399 + T+EQ LLA+LQN S+ SH+ N ++T+ + +G Sbjct: 334 SLTSEQHKALLALLQNSSTLPSHSLNHITTTPASRTG 370 Score = 83.6 bits (205), Expect(2) = 2e-98 Identities = 41/97 (42%), Positives = 58/97 (59%), Gaps = 7/97 (7%) Frame = -3 Query: 406 VQDINTLKMIGVAESKDGLYYLSAETLQL---PPTKTVNTFHSNT----CNIWHLRFGHP 248 +Q ++ KMIG AE +DGLY L + + P +N + T CN+WH+RFGH Sbjct: 406 IQAKHSQKMIGAAELEDGLYLLKTPLVSIVHTPYLHCINNVKNMTLNKDCNLWHMRFGHA 465 Query: 247 SHEKLSQLHDLFPFIVCSKNNKPCETCHFAKQKRLPF 137 SH+KL ++ FP I ++ PC+ C +AKQKRLPF Sbjct: 466 SHDKLIEIKKKFPCISIDTSSDPCDICFYAKQKRLPF 502 >ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817175 [Glycine max] Length = 2045 Score = 258 bits (660), Expect = 3e-66 Identities = 138/346 (39%), Positives = 205/346 (59%), Gaps = 13/346 (3%) Frame = -2 Query: 1439 LSSVLTSSGENYNSWARSMTVSLDTKNKIGFVDGSIPQPLANSANSLAWKRCNSMVLSWI 1260 +S VL S+ NY+SW+RSM +L KNK+ F+DGS P+PL AW RCN+MV+SWI Sbjct: 373 VSPVLDST--NYHSWSRSMVTALSAKNKVEFIDGSAPEPLKTDRMHGAWCRCNNMVVSWI 430 Query: 1259 MHSLDSDVAQSVLWMTVASEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTR 1080 +HS+ + + QS+LWM A E+W +L+ RY QGD+ RI ++Q+E +++QG +++T YFT Sbjct: 431 VHSVATSIRQSILWMDKAEEIWRDLKSRYSQGDLLRISDLQQEASTMKQGTLTVTEYFTC 490 Query: 1079 LKGLWQELDNYRPIPSCTCTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIML 900 L+ +W E++N+RP P C+C I+C+C + T + D ++FLRGLNEQYA++RS ++L Sbjct: 491 LRVIWDEIENFRPDPICSCNIRCSCNAFTIIAQRKLEDRAMQFLRGLNEQYANIRSHVLL 550 Query: 899 SKPIPDINIVFSLLIQQERQM--------NAEMQEPRVLA--NVADSRGGPKGKGRGQNV 750 PIP I+ +FS + QQERQ+ N E ++ + A V D G Sbjct: 551 MDPIPTISKIFSYVAQQERQLLGNTGPGINFEPKDISINAAKTVCDFCGRIGHVESTCYK 610 Query: 749 TSGGKSSSDKRKFNN--KVCTYCNKLGHTVDKCYQKYGHPPNFKKNEGK-MINNCTQDND 579 G S+ D R +N K CT+C K+GHTVD CY+K+G+PP +K G+ +NN Sbjct: 611 KHGVPSNYDARNKSNGRKACTHCGKIGHTVDVCYRKHGYPPGYKPYSGRTTVNNVVAVES 670 Query: 578 QFKDDDESESIVQEDNMKDTCSGSFNFTAEQRDVLLAILQNQSSSH 441 + DD E F+ EQ LLA++Q S+ + Sbjct: 671 KATDDQAQHHESHE---------FVRFSPEQYKALLALIQEPSAGN 707 Score = 120 bits (300), Expect = 2e-24 Identities = 64/142 (45%), Positives = 89/142 (62%), Gaps = 8/142 (5%) Frame = -3 Query: 406 VQDINTLKMIGVAESKDGLYYLSAETLQLPPTKTVN-TFHSNTCNI-----WHLRFGHPS 245 +Q++N IG+ E+K GLY+L L TK VN T CN+ WH R GHPS Sbjct: 831 LQEMNNHMKIGIVEAKHGLYHLIPNQLT---TKAVNSTITHPRCNVIPIDLWHFRLGHPS 887 Query: 244 HEKLSQLHDLFPFIVCSKNNKP--CETCHFAKQKRLPFPDSDSVSAQMFDLVHMDIWGPL 71 E++ + +P + +NNK C TCH+AK K++PF S+S ++ FDL+HMDI GP Sbjct: 888 AERIQCMKTYYPLL---RNNKNFVCNTCHYAKHKKMPFSLSNSHASHAFDLLHMDIRGPC 944 Query: 70 AHPSMMGHRYFLTVVDNKTRFT 5 + PSM GH+YFLT+VD+ +RFT Sbjct: 945 SKPSMHGHKYFLTIVDDCSRFT 966 >ref|XP_003557045.1| PREDICTED: uncharacterized protein LOC100783177 [Glycine max] Length = 2219 Score = 245 bits (625), Expect = 3e-62 Identities = 126/338 (37%), Positives = 205/338 (60%), Gaps = 1/338 (0%) Frame = -2 Query: 1415 GENYNSWARSMTVSLDTKNKIGFVDGSIPQPL-ANSANSLAWKRCNSMVLSWIMHSLDSD 1239 G NY+SWARS+ +L K K F+DG+IP P+ A + AW RCN ++ SWI++S++ Sbjct: 233 GSNYHSWARSLRRALGAKLKFEFLDGTIPMPVDAFDPSFRAWNRCNMLIHSWILNSVEPS 292 Query: 1238 VAQSVLWMTVASEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTRLKGLWQE 1059 +++S+++M AS+VW +L++R+ QGD+ R+ E+Q+EIY+L QG S+T++++ LK LW+E Sbjct: 293 ISRSIVFMDNASDVWLDLKERFSQGDLVRVSEIQQEIYALTQGTRSVTTFYSDLKALWEE 352 Query: 1058 LDNYRPIPSCTCTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIMLSKPIPDI 879 L+ Y PIP+CTC +C+C + AR + T +V+RFL GLN+++ +V+SQI+L +P+P I Sbjct: 353 LEIYMPIPNCTCHHRCSCDAMRLARRHHHTLHVMRFLTGLNDEFNAVKSQILLIEPLPSI 412 Query: 878 NIVFSLLIQQERQMNAEMQEPRVLANVADSRGGPKGKGRGQNVTSGGKSSSDKRKFNNKV 699 +FS++IQ ERQ + N+ DS+ ++ G+S+S +++ Sbjct: 413 TKIFSMVIQFERQ--------NCVPNLDDSKALVNASTSKSQGSANGRSNSGSKRY---- 460 Query: 698 CTYCNKLGHTVDKCYQKYGHPPNFKKNEGKMINNCTQDNDQFKDDDESESIVQEDNMKDT 519 CTYC+K H V+ C+QK+G PP+ KN ++ D + + S ++ T Sbjct: 461 CTYCHKTNHFVENCFQKHGVPPHMMKNHSGSAHHSAVDGGERVE----SSTASQNTTSVT 516 Query: 518 CSGSFNFTAEQRDVLLAILQNQSSSHATNQVSTSRQNS 405 + S T EQ D LL ++Q S +H S Q S Sbjct: 517 MTPS--LTQEQLDKLLQLIQPPSVNHCNASTSKQEQKS 552 >ref|XP_003551446.1| PREDICTED: uncharacterized protein LOC100819074 [Glycine max] Length = 1750 Score = 244 bits (622), Expect = 8e-62 Identities = 127/336 (37%), Positives = 207/336 (61%), Gaps = 1/336 (0%) Frame = -2 Query: 1415 GENYNSWARSMTVSLDTKNKIGFVDGSIPQPL-ANSANSLAWKRCNSMVLSWIMHSLDSD 1239 G NY+SWARS+ +L K K F+DG+IP P+ A + AW RCN ++ SWI++S++ Sbjct: 40 GSNYHSWARSLRRALGAKLKFEFLDGTIPMPVDAFDPSFRAWNRCNMLIHSWILNSVEPS 99 Query: 1238 VAQSVLWMTVASEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTRLKGLWQE 1059 +++S+++M AS+VW +L++R+ QGD+ R+ E+Q+EIY+L QG S+T++++ LK LW+E Sbjct: 100 ISRSIVFMDNASDVWLDLKERFSQGDLVRVSEIQQEIYALTQGTRSVTTFYSDLKALWEE 159 Query: 1058 LDNYRPIPSCTCTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIMLSKPIPDI 879 L+ Y PIP+CTC +C+C + AR + T +V+RFL GLN+++ +V+SQI+L +P+P I Sbjct: 160 LEIYMPIPNCTCHHRCSCDAMRLARRHHHTLHVMRFLTGLNDEFNAVKSQILLIEPLPSI 219 Query: 878 NIVFSLLIQQERQMNAEMQEPRVLANVADSRGGPKGKGRGQNVTSGGKSSSDKRKFNNKV 699 +FS++IQ ERQ + N+ DS+ ++ G+S+S +++ Sbjct: 220 TKIFSMVIQFERQ--------NCVPNLDDSKALVNASTSKSQGSANGRSNSGSKRY---- 267 Query: 698 CTYCNKLGHTVDKCYQKYGHPPNFKKNEGKMINNCTQDNDQFKDDDESESIVQEDNMKDT 519 CTYC+K H V+ C+QK+G PP+ KN ++ D + + S ++ T Sbjct: 268 CTYCHKTNHFVENCFQKHGVPPHMMKNHSGSAHHSAVDGGERVE----SSTASQNTTSVT 323 Query: 518 CSGSFNFTAEQRDVLLAILQNQSSSHATNQVSTSRQ 411 + S T EQ D LL ++Q S +H STS+Q Sbjct: 324 MTPS--LTQEQLDKLLQLIQPPSVNHC--NASTSKQ 355 Score = 122 bits (305), Expect = 4e-25 Identities = 59/143 (41%), Positives = 87/143 (60%), Gaps = 8/143 (5%) Frame = -3 Query: 409 TVQDINTLKMIGVAESKDGLYYLSAETLQLPPTK--------TVNTFHSNTCNIWHLRFG 254 ++Q+ +LKMIG+ ES+DGLYYL+ + + + N H IWH R G Sbjct: 478 SIQEQKSLKMIGLGESRDGLYYLTQTNKECASSNYNISSIFSSANNVHIPENAIWHFRLG 537 Query: 253 HPSHEKLSQLHDLFPFIVCSKNNKPCETCHFAKQKRLPFPDSDSVSAQMFDLVHMDIWGP 74 H S +++ LH FPFIV + ++ C+ CHFAK ++LPF S + + FDL+H DIWGP Sbjct: 538 HLSSSRIALLHSQFPFIV-NDSSSVCDICHFAKHRKLPFVHSYHKAIKCFDLIHFDIWGP 596 Query: 73 LAHPSMMGHRYFLTVVDNKTRFT 5 ++ S+ H YFLT VD+ +R+T Sbjct: 597 ISIKSVHNHAYFLTAVDDHSRYT 619 >ref|XP_003549005.1| PREDICTED: uncharacterized protein LOC100789964 [Glycine max] Length = 2412 Score = 243 bits (621), Expect = 1e-61 Identities = 131/354 (37%), Positives = 201/354 (56%), Gaps = 21/354 (5%) Frame = -2 Query: 1397 WARSMTVSLDTKNKIGFVDGSIPQPLANSANSLAWKRCNSMVLSWIMHSLDSDVAQSVLW 1218 W RSM V+L +KNK+ FVDG++ P + W RCN +VLSW+ S+ ++A+S+LW Sbjct: 293 WCRSMKVALISKNKVKFVDGTLSPPPISDPLYEPWLRCNKLVLSWLQRSISEEIAKSLLW 352 Query: 1217 MTVASEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTRLKGLWQELDNYRPI 1038 AS VW L R+ QGD+FR+ ++QEE+ L+QG + I+SYFT+L W+E++N+ PI Sbjct: 353 CDRASLVWKSLANRFSQGDIFRVADIQEEVARLQQGTLDISSYFTKLMTPWEEIENFCPI 412 Query: 1037 PSCTCTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIMLSKPIPDINIVFSLL 858 CTC I C+C + T R +++ D VI+FL+G+ +QY+ VRSQIML P+P ++ F+L+ Sbjct: 413 RDCTCAIPCSCGAATDLRKFKEQDKVIKFLKGIGDQYSHVRSQIMLMSPLPTLDNAFNLI 472 Query: 857 IQQERQMN------AEMQEPRVLANVADSRGGPK---GKGRGQNVTSGGKSSSDKRKFNN 705 +QQERQ N + ++ + + + + P G+G G+ +SGG+ N Sbjct: 473 LQQERQFNLPSTTDSSIENQSSVNHFSQTPSRPSNNFGRGCGRGYSSGGR--------GN 524 Query: 704 KVCTYCNKLGHTVDKCYQKYGHPPNFKKNEGKMINNCTQDND-QFKDDDESESIVQEDNM 528 ++CT+CN+ HTV+ C+ K+G+PP F+ + N + N Q S Sbjct: 525 RLCTHCNRTNHTVETCFIKHGYPPGFQHRKSNSSGNASVVNSVQDAGSAHISSSSSASTS 584 Query: 527 KDTCSGSFNFTAEQRDVLLAILQNQ----------SSSHATNQVS-TSRQNSSG 399 + S S + EQ +L +LQ +S ATN VS TS SSG Sbjct: 585 TNGSSASLSTIQEQYTQILQLLQQSNLQSTSPSSVNSVFATNFVSHTSPSPSSG 638 Score = 118 bits (295), Expect = 6e-24 Identities = 64/144 (44%), Positives = 84/144 (58%), Gaps = 4/144 (2%) Frame = -3 Query: 424 PHLDKTVQDINTLKMIGVAESKDGLYYLSAETLQL---PPTKTVNTFHSNT-CNIWHLRF 257 P+ K VQ ++ KM G + G Y+ T L P +VNT S+ IWH R Sbjct: 732 PNSCKIVQLLHP-KMTGFTARRIGKLYVLDTTSPLVFSPTPGSVNTSISHDPATIWHFRL 790 Query: 256 GHPSHEKLSQLHDLFPFIVCSKNNKPCETCHFAKQKRLPFPDSDSVSAQMFDLVHMDIWG 77 GH S + FPF+ + N+KPC TCH AKQ+ LPF S + S +FDL+H DIWG Sbjct: 791 GHLSSHIHKCISSYFPFVTFNDNHKPCNTCHLAKQRNLPFAHSTTKSVAIFDLIHADIWG 850 Query: 76 PLAHPSMMGHRYFLTVVDNKTRFT 5 PL+ PS+ GH+YFLT+VD+ RFT Sbjct: 851 PLSTPSISGHKYFLTLVDDYNRFT 874