BLASTX nr result

ID: Glycyrrhiza23_contig00022257 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00022257
         (1935 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003627939.1| Flavonol sulfotransferase-like protein [Medi...   304   2e-98
ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817...   258   3e-66
ref|XP_003557045.1| PREDICTED: uncharacterized protein LOC100783...   245   3e-62
ref|XP_003551446.1| PREDICTED: uncharacterized protein LOC100819...   244   8e-62
ref|XP_003549005.1| PREDICTED: uncharacterized protein LOC100789...   243   1e-61

>ref|XP_003627939.1| Flavonol sulfotransferase-like protein [Medicago truncatula]
            gi|355521961|gb|AET02415.1| Flavonol
            sulfotransferase-like protein [Medicago truncatula]
          Length = 640

 Score =  304 bits (779), Expect(2) = 2e-98
 Identities = 150/337 (44%), Positives = 230/337 (68%), Gaps = 8/337 (2%)
 Frame = -2

Query: 1385 MTVSLDTKNKIGFVDGSIPQPLANSANSLAWKRCNSMVLSWIMHSLDSDVAQSVLWMTVA 1206
            + V+L +K+K+ F++G++P+P  +  +S+AW RCN+M++SW+ +S+D  ++QS+LWM  A
Sbjct: 42   LQVALRSKHKLHFINGALPRPCDDDHDSIAWDRCNTMIMSWLSNSVDPQISQSILWMDTA 101

Query: 1205 SEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTRLKGLWQELDNYRPIPSCT 1026
             E+W+EL++R+YQGD+FRI ++QEEIY+L+QG+ SI+SY+T++K LWQELDN RPIP+  
Sbjct: 102  LEIWNELKERFYQGDIFRISDLQEEIYTLKQGESSISSYYTKMKKLWQELDNVRPIPTSN 161

Query: 1025 CTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIMLSKPIPDINIVFSLLIQQE 846
            C   C   +    R Y+D+D VIRFL+GLNEQ+++VRSQIML  P+P I  V+ LL+QQE
Sbjct: 162  CVDDCKAIAK--MREYKDSDQVIRFLKGLNEQFSAVRSQIMLMDPLPSIGKVYYLLVQQE 219

Query: 845  RQMNAEMQEPRVLANVADSRGGPKGKGRGQNVTS------GGKSSSDKRKFNNKVCTYCN 684
            RQ+   + E ++LA   +S  G    GRG    S      GG+SS  + K   + C++C 
Sbjct: 220  RQVVIPLDESKLLAVSNNSFSGHSSYGRGHMNASRGSGDRGGRSSYGRGK-GIRACSFCG 278

Query: 683  KLGHTVDKCYQKYGHPPNFKKNEGKMINNCTQDNDQFKDDDESESIVQEDNMKDTCSGSF 504
            K  HTVD C++KYG PP++++     INNCT   D   +++E  S   ED+  D+ +  F
Sbjct: 279  KSNHTVDTCFKKYGFPPHYQQENS--INNCT---DVSGNEEEQNSAHFEDDQNDSTTEKF 333

Query: 503  NFTAEQRDVLLAILQNQSS--SHATNQVSTSRQNSSG 399
            + T+EQ   LLA+LQN S+  SH+ N ++T+  + +G
Sbjct: 334  SLTSEQHKALLALLQNSSTLPSHSLNHITTTPASRTG 370



 Score = 83.6 bits (205), Expect(2) = 2e-98
 Identities = 41/97 (42%), Positives = 58/97 (59%), Gaps = 7/97 (7%)
 Frame = -3

Query: 406 VQDINTLKMIGVAESKDGLYYLSAETLQL---PPTKTVNTFHSNT----CNIWHLRFGHP 248
           +Q  ++ KMIG AE +DGLY L    + +   P    +N   + T    CN+WH+RFGH 
Sbjct: 406 IQAKHSQKMIGAAELEDGLYLLKTPLVSIVHTPYLHCINNVKNMTLNKDCNLWHMRFGHA 465

Query: 247 SHEKLSQLHDLFPFIVCSKNNKPCETCHFAKQKRLPF 137
           SH+KL ++   FP I    ++ PC+ C +AKQKRLPF
Sbjct: 466 SHDKLIEIKKKFPCISIDTSSDPCDICFYAKQKRLPF 502


>ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817175 [Glycine max]
          Length = 2045

 Score =  258 bits (660), Expect = 3e-66
 Identities = 138/346 (39%), Positives = 205/346 (59%), Gaps = 13/346 (3%)
 Frame = -2

Query: 1439 LSSVLTSSGENYNSWARSMTVSLDTKNKIGFVDGSIPQPLANSANSLAWKRCNSMVLSWI 1260
            +S VL S+  NY+SW+RSM  +L  KNK+ F+DGS P+PL       AW RCN+MV+SWI
Sbjct: 373  VSPVLDST--NYHSWSRSMVTALSAKNKVEFIDGSAPEPLKTDRMHGAWCRCNNMVVSWI 430

Query: 1259 MHSLDSDVAQSVLWMTVASEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTR 1080
            +HS+ + + QS+LWM  A E+W +L+ RY QGD+ RI ++Q+E  +++QG +++T YFT 
Sbjct: 431  VHSVATSIRQSILWMDKAEEIWRDLKSRYSQGDLLRISDLQQEASTMKQGTLTVTEYFTC 490

Query: 1079 LKGLWQELDNYRPIPSCTCTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIML 900
            L+ +W E++N+RP P C+C I+C+C + T     +  D  ++FLRGLNEQYA++RS ++L
Sbjct: 491  LRVIWDEIENFRPDPICSCNIRCSCNAFTIIAQRKLEDRAMQFLRGLNEQYANIRSHVLL 550

Query: 899  SKPIPDINIVFSLLIQQERQM--------NAEMQEPRVLA--NVADSRGGPKGKGRGQNV 750
              PIP I+ +FS + QQERQ+        N E ++  + A   V D  G           
Sbjct: 551  MDPIPTISKIFSYVAQQERQLLGNTGPGINFEPKDISINAAKTVCDFCGRIGHVESTCYK 610

Query: 749  TSGGKSSSDKRKFNN--KVCTYCNKLGHTVDKCYQKYGHPPNFKKNEGK-MINNCTQDND 579
              G  S+ D R  +N  K CT+C K+GHTVD CY+K+G+PP +K   G+  +NN      
Sbjct: 611  KHGVPSNYDARNKSNGRKACTHCGKIGHTVDVCYRKHGYPPGYKPYSGRTTVNNVVAVES 670

Query: 578  QFKDDDESESIVQEDNMKDTCSGSFNFTAEQRDVLLAILQNQSSSH 441
            +  DD        E            F+ EQ   LLA++Q  S+ +
Sbjct: 671  KATDDQAQHHESHE---------FVRFSPEQYKALLALIQEPSAGN 707



 Score =  120 bits (300), Expect = 2e-24
 Identities = 64/142 (45%), Positives = 89/142 (62%), Gaps = 8/142 (5%)
 Frame = -3

Query: 406  VQDINTLKMIGVAESKDGLYYLSAETLQLPPTKTVN-TFHSNTCNI-----WHLRFGHPS 245
            +Q++N    IG+ E+K GLY+L    L    TK VN T     CN+     WH R GHPS
Sbjct: 831  LQEMNNHMKIGIVEAKHGLYHLIPNQLT---TKAVNSTITHPRCNVIPIDLWHFRLGHPS 887

Query: 244  HEKLSQLHDLFPFIVCSKNNKP--CETCHFAKQKRLPFPDSDSVSAQMFDLVHMDIWGPL 71
             E++  +   +P +   +NNK   C TCH+AK K++PF  S+S ++  FDL+HMDI GP 
Sbjct: 888  AERIQCMKTYYPLL---RNNKNFVCNTCHYAKHKKMPFSLSNSHASHAFDLLHMDIRGPC 944

Query: 70   AHPSMMGHRYFLTVVDNKTRFT 5
            + PSM GH+YFLT+VD+ +RFT
Sbjct: 945  SKPSMHGHKYFLTIVDDCSRFT 966


>ref|XP_003557045.1| PREDICTED: uncharacterized protein LOC100783177 [Glycine max]
          Length = 2219

 Score =  245 bits (625), Expect = 3e-62
 Identities = 126/338 (37%), Positives = 205/338 (60%), Gaps = 1/338 (0%)
 Frame = -2

Query: 1415 GENYNSWARSMTVSLDTKNKIGFVDGSIPQPL-ANSANSLAWKRCNSMVLSWIMHSLDSD 1239
            G NY+SWARS+  +L  K K  F+DG+IP P+ A   +  AW RCN ++ SWI++S++  
Sbjct: 233  GSNYHSWARSLRRALGAKLKFEFLDGTIPMPVDAFDPSFRAWNRCNMLIHSWILNSVEPS 292

Query: 1238 VAQSVLWMTVASEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTRLKGLWQE 1059
            +++S+++M  AS+VW +L++R+ QGD+ R+ E+Q+EIY+L QG  S+T++++ LK LW+E
Sbjct: 293  ISRSIVFMDNASDVWLDLKERFSQGDLVRVSEIQQEIYALTQGTRSVTTFYSDLKALWEE 352

Query: 1058 LDNYRPIPSCTCTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIMLSKPIPDI 879
            L+ Y PIP+CTC  +C+C +   AR +  T +V+RFL GLN+++ +V+SQI+L +P+P I
Sbjct: 353  LEIYMPIPNCTCHHRCSCDAMRLARRHHHTLHVMRFLTGLNDEFNAVKSQILLIEPLPSI 412

Query: 878  NIVFSLLIQQERQMNAEMQEPRVLANVADSRGGPKGKGRGQNVTSGGKSSSDKRKFNNKV 699
              +FS++IQ ERQ          + N+ DS+            ++ G+S+S  +++    
Sbjct: 413  TKIFSMVIQFERQ--------NCVPNLDDSKALVNASTSKSQGSANGRSNSGSKRY---- 460

Query: 698  CTYCNKLGHTVDKCYQKYGHPPNFKKNEGKMINNCTQDNDQFKDDDESESIVQEDNMKDT 519
            CTYC+K  H V+ C+QK+G PP+  KN     ++   D  +  +     S   ++    T
Sbjct: 461  CTYCHKTNHFVENCFQKHGVPPHMMKNHSGSAHHSAVDGGERVE----SSTASQNTTSVT 516

Query: 518  CSGSFNFTAEQRDVLLAILQNQSSSHATNQVSTSRQNS 405
             + S   T EQ D LL ++Q  S +H     S   Q S
Sbjct: 517  MTPS--LTQEQLDKLLQLIQPPSVNHCNASTSKQEQKS 552


>ref|XP_003551446.1| PREDICTED: uncharacterized protein LOC100819074 [Glycine max]
          Length = 1750

 Score =  244 bits (622), Expect = 8e-62
 Identities = 127/336 (37%), Positives = 207/336 (61%), Gaps = 1/336 (0%)
 Frame = -2

Query: 1415 GENYNSWARSMTVSLDTKNKIGFVDGSIPQPL-ANSANSLAWKRCNSMVLSWIMHSLDSD 1239
            G NY+SWARS+  +L  K K  F+DG+IP P+ A   +  AW RCN ++ SWI++S++  
Sbjct: 40   GSNYHSWARSLRRALGAKLKFEFLDGTIPMPVDAFDPSFRAWNRCNMLIHSWILNSVEPS 99

Query: 1238 VAQSVLWMTVASEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTRLKGLWQE 1059
            +++S+++M  AS+VW +L++R+ QGD+ R+ E+Q+EIY+L QG  S+T++++ LK LW+E
Sbjct: 100  ISRSIVFMDNASDVWLDLKERFSQGDLVRVSEIQQEIYALTQGTRSVTTFYSDLKALWEE 159

Query: 1058 LDNYRPIPSCTCTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIMLSKPIPDI 879
            L+ Y PIP+CTC  +C+C +   AR +  T +V+RFL GLN+++ +V+SQI+L +P+P I
Sbjct: 160  LEIYMPIPNCTCHHRCSCDAMRLARRHHHTLHVMRFLTGLNDEFNAVKSQILLIEPLPSI 219

Query: 878  NIVFSLLIQQERQMNAEMQEPRVLANVADSRGGPKGKGRGQNVTSGGKSSSDKRKFNNKV 699
              +FS++IQ ERQ          + N+ DS+            ++ G+S+S  +++    
Sbjct: 220  TKIFSMVIQFERQ--------NCVPNLDDSKALVNASTSKSQGSANGRSNSGSKRY---- 267

Query: 698  CTYCNKLGHTVDKCYQKYGHPPNFKKNEGKMINNCTQDNDQFKDDDESESIVQEDNMKDT 519
            CTYC+K  H V+ C+QK+G PP+  KN     ++   D  +  +     S   ++    T
Sbjct: 268  CTYCHKTNHFVENCFQKHGVPPHMMKNHSGSAHHSAVDGGERVE----SSTASQNTTSVT 323

Query: 518  CSGSFNFTAEQRDVLLAILQNQSSSHATNQVSTSRQ 411
             + S   T EQ D LL ++Q  S +H     STS+Q
Sbjct: 324  MTPS--LTQEQLDKLLQLIQPPSVNHC--NASTSKQ 355



 Score =  122 bits (305), Expect = 4e-25
 Identities = 59/143 (41%), Positives = 87/143 (60%), Gaps = 8/143 (5%)
 Frame = -3

Query: 409 TVQDINTLKMIGVAESKDGLYYLSAETLQLPPTK--------TVNTFHSNTCNIWHLRFG 254
           ++Q+  +LKMIG+ ES+DGLYYL+    +   +         + N  H     IWH R G
Sbjct: 478 SIQEQKSLKMIGLGESRDGLYYLTQTNKECASSNYNISSIFSSANNVHIPENAIWHFRLG 537

Query: 253 HPSHEKLSQLHDLFPFIVCSKNNKPCETCHFAKQKRLPFPDSDSVSAQMFDLVHMDIWGP 74
           H S  +++ LH  FPFIV + ++  C+ CHFAK ++LPF  S   + + FDL+H DIWGP
Sbjct: 538 HLSSSRIALLHSQFPFIV-NDSSSVCDICHFAKHRKLPFVHSYHKAIKCFDLIHFDIWGP 596

Query: 73  LAHPSMMGHRYFLTVVDNKTRFT 5
           ++  S+  H YFLT VD+ +R+T
Sbjct: 597 ISIKSVHNHAYFLTAVDDHSRYT 619


>ref|XP_003549005.1| PREDICTED: uncharacterized protein LOC100789964 [Glycine max]
          Length = 2412

 Score =  243 bits (621), Expect = 1e-61
 Identities = 131/354 (37%), Positives = 201/354 (56%), Gaps = 21/354 (5%)
 Frame = -2

Query: 1397 WARSMTVSLDTKNKIGFVDGSIPQPLANSANSLAWKRCNSMVLSWIMHSLDSDVAQSVLW 1218
            W RSM V+L +KNK+ FVDG++  P  +      W RCN +VLSW+  S+  ++A+S+LW
Sbjct: 293  WCRSMKVALISKNKVKFVDGTLSPPPISDPLYEPWLRCNKLVLSWLQRSISEEIAKSLLW 352

Query: 1217 MTVASEVWSELRQRYYQGDVFRICEVQEEIYSLRQGDVSITSYFTRLKGLWQELDNYRPI 1038
               AS VW  L  R+ QGD+FR+ ++QEE+  L+QG + I+SYFT+L   W+E++N+ PI
Sbjct: 353  CDRASLVWKSLANRFSQGDIFRVADIQEEVARLQQGTLDISSYFTKLMTPWEEIENFCPI 412

Query: 1037 PSCTCTIQCTCASNTAARGYRDTDYVIRFLRGLNEQYASVRSQIMLSKPIPDINIVFSLL 858
              CTC I C+C + T  R +++ D VI+FL+G+ +QY+ VRSQIML  P+P ++  F+L+
Sbjct: 413  RDCTCAIPCSCGAATDLRKFKEQDKVIKFLKGIGDQYSHVRSQIMLMSPLPTLDNAFNLI 472

Query: 857  IQQERQMN------AEMQEPRVLANVADSRGGPK---GKGRGQNVTSGGKSSSDKRKFNN 705
            +QQERQ N      + ++    + + + +   P    G+G G+  +SGG+         N
Sbjct: 473  LQQERQFNLPSTTDSSIENQSSVNHFSQTPSRPSNNFGRGCGRGYSSGGR--------GN 524

Query: 704  KVCTYCNKLGHTVDKCYQKYGHPPNFKKNEGKMINNCTQDND-QFKDDDESESIVQEDNM 528
            ++CT+CN+  HTV+ C+ K+G+PP F+  +     N +  N  Q        S       
Sbjct: 525  RLCTHCNRTNHTVETCFIKHGYPPGFQHRKSNSSGNASVVNSVQDAGSAHISSSSSASTS 584

Query: 527  KDTCSGSFNFTAEQRDVLLAILQNQ----------SSSHATNQVS-TSRQNSSG 399
             +  S S +   EQ   +L +LQ            +S  ATN VS TS   SSG
Sbjct: 585  TNGSSASLSTIQEQYTQILQLLQQSNLQSTSPSSVNSVFATNFVSHTSPSPSSG 638



 Score =  118 bits (295), Expect = 6e-24
 Identities = 64/144 (44%), Positives = 84/144 (58%), Gaps = 4/144 (2%)
 Frame = -3

Query: 424  PHLDKTVQDINTLKMIGVAESKDGLYYLSAETLQL---PPTKTVNTFHSNT-CNIWHLRF 257
            P+  K VQ ++  KM G    + G  Y+   T  L   P   +VNT  S+    IWH R 
Sbjct: 732  PNSCKIVQLLHP-KMTGFTARRIGKLYVLDTTSPLVFSPTPGSVNTSISHDPATIWHFRL 790

Query: 256  GHPSHEKLSQLHDLFPFIVCSKNNKPCETCHFAKQKRLPFPDSDSVSAQMFDLVHMDIWG 77
            GH S      +   FPF+  + N+KPC TCH AKQ+ LPF  S + S  +FDL+H DIWG
Sbjct: 791  GHLSSHIHKCISSYFPFVTFNDNHKPCNTCHLAKQRNLPFAHSTTKSVAIFDLIHADIWG 850

Query: 76   PLAHPSMMGHRYFLTVVDNKTRFT 5
            PL+ PS+ GH+YFLT+VD+  RFT
Sbjct: 851  PLSTPSISGHKYFLTLVDDYNRFT 874


Top