BLASTX nr result

ID: Glycyrrhiza23_contig00001477 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00001477
         (2327 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003519334.1| PREDICTED: uncharacterized protein LOC100795...   764   0.0  
ref|XP_003544992.1| PREDICTED: uncharacterized protein LOC100805...   736   0.0  
ref|XP_003519335.1| PREDICTED: uncharacterized protein LOC100795...   664   0.0  
ref|XP_003633736.1| PREDICTED: uncharacterized protein LOC100853...   586   e-164
emb|CAN74204.1| hypothetical protein VITISV_021204 [Vitis vinifera]   575   e-161

>ref|XP_003519334.1| PREDICTED: uncharacterized protein LOC100795617 isoform 1 [Glycine
            max]
          Length = 592

 Score =  764 bits (1972), Expect = 0.0
 Identities = 419/609 (68%), Positives = 465/609 (76%), Gaps = 16/609 (2%)
 Frame = -2

Query: 2113 MDENSALIEAILREQEEEQANLRGNR------KEQNGNVNEWQTVSYKKRNRNKSASKQP 1952
            MDE SALIEAILREQEEE+      R      K  N N NEWQTVSY KRNRN++ +++P
Sbjct: 1    MDETSALIEAILREQEEEEEEAHRRRRNHTTIKNNNNNNNEWQTVSYTKRNRNRNNNRKP 60

Query: 1951 LAADD---DGFSSDVFRSVEQHSEERRHRLMEAQIAAASTES-----SGSKLHSYDDIEE 1796
            LA D+   D  SSDVF SV++HSE+RR RL+++QIAAA   +     S SK HS D+ E+
Sbjct: 61   LADDNFAADPSSSDVFSSVQRHSEDRRLRLLKSQIAAAEAAAAEATPSRSKRHS-DNEED 119

Query: 1795 EDKDSYKQLNRNGSSSDSXXXXXXXXXXXXXXKQPKVTVAEAASGINADDLGAFLAEITA 1616
             D +   ++ +                      +PKVTVAEAASGI+ADDL AFLAEITA
Sbjct: 120  GDAEPEAEVKKAKQKKPK---------------KPKVTVAEAASGISADDLDAFLAEITA 164

Query: 1615 SYEDSQQDVQLMRFADYFGRAFSSVGGAQFPWVKTFRESTVAAIVDIPLLHISGDVYKIS 1436
            SYE SQQD+ LMRFADYFGRAFSSV GAQFPW+KTF+ESTVA IVDIPLLHIS D+YKIS
Sbjct: 165  SYE-SQQDIMLMRFADYFGRAFSSVSGAQFPWLKTFKESTVAKIVDIPLLHISEDIYKIS 223

Query: 1435 TDWIGHRSSEALGSFVLWSLDSFLADFASHQGNXXXXXXXXXXXXXXXXVAIFVVLAMVL 1256
            TDW+ HRS EALGSFVLWSLDS LAD ASHQG                 VA+FVVLAMVL
Sbjct: 224  TDWVSHRSYEALGSFVLWSLDSILADLASHQGVVKGSKKAVQQSSPKSQVAMFVVLAMVL 283

Query: 1255 RRKPDVMISLLPRMKESQKYQGQDKLPVTVWVIAQASQADLAVGLYLWVSLLLPMLNGKS 1076
            RRKPDV+ISLLP +KE++KYQGQDKLPV VWVI QASQ DL +GLYLWV LLLPML+ KS
Sbjct: 284  RRKPDVLISLLPIIKENKKYQGQDKLPVIVWVITQASQGDLVMGLYLWVYLLLPMLSVKS 343

Query: 1075 GCNPQSRDLILQLVERIITFPKARPILINGAVRKGERVVPPWALDSLLRATFPLPSARVK 896
            GCNPQSRDLILQLVERIIT PKAR IL+NGAVR+GERVVPPWALDSLLR TFPLPSARVK
Sbjct: 344  GCNPQSRDLILQLVERIITSPKARSILLNGAVRRGERVVPPWALDSLLRVTFPLPSARVK 403

Query: 895  ATERFGAVYPILKEVALAGSPGSKAIKHLAQQILSFAIKAAGEANRDLSKEACDIFIWCL 716
            ATERF AVYP L+EVALA SPGSKAIKHLAQQILSFAIKAAGEAN DLSKEA DIFIWCL
Sbjct: 404  ATERFEAVYPTLREVALASSPGSKAIKHLAQQILSFAIKAAGEANSDLSKEASDIFIWCL 463

Query: 715  TQNPECFKQWDLLYMDNLEASIVVLRKLSDEWK-HIVKHDTLHPFTETLKSFSQKNEKAL 539
            TQNPEC+KQWD LYMDNLEAS+VVLRKLS EWK + VKH TL P  E LKSFSQKNEKAL
Sbjct: 464  TQNPECYKQWDFLYMDNLEASVVVLRKLSGEWKEYFVKHPTLDPLRENLKSFSQKNEKAL 523

Query: 538  AKV-DGARDALLKDADKYCKAILRQMSQGRGXXXXXXXXXXXXXXXXXFLSQNMHFLDYQ 362
            AKV DGAR ALLKDADKYCK +L Q+SQG G                 F+SQN+H  DY 
Sbjct: 524  AKVDDGARHALLKDADKYCKVLLGQLSQGHGCLKSMIVLSVVLAVGAVFMSQNLHLWDYS 583

Query: 361  KLSEMWNLS 335
            +L+EM NLS
Sbjct: 584  QLTEMLNLS 592


>ref|XP_003544992.1| PREDICTED: uncharacterized protein LOC100805286 isoform 1 [Glycine
            max]
          Length = 588

 Score =  736 bits (1901), Expect = 0.0
 Identities = 416/608 (68%), Positives = 458/608 (75%), Gaps = 15/608 (2%)
 Frame = -2

Query: 2113 MDENSALIEAILREQEEEQANLRGNRKE---QNGNV---NEWQTVSYKKRNRN--KSASK 1958
            MDE SALIEAILREQEEE+      R+    QN  +   N+WQTVSY KRNRN  KS+SK
Sbjct: 1    MDETSALIEAILREQEEEEEEAHRRRRNLTTQNTTIKSNNQWQTVSYHKRNRNNNKSSSK 60

Query: 1957 QPLAADDDGFSSDVFRSVEQHSEERRHRLMEAQIA-----AASTESSGSKLHSYDDIEEE 1793
            QPLAAD    S DVF SV++HSE  R RL+E+QIA     AA+   S SK HS D   E+
Sbjct: 61   QPLAADP---SPDVFSSVQRHSEHSRRRLLESQIASEAEAAAAAAPSRSKRHSDD---ED 114

Query: 1792 DKDSYKQLNRNGSSSDSXXXXXXXXXXXXXXKQPKVTVAEAASGINADDLGAFLAEITAS 1613
            D D+           ++              K+PKVTVAEAAS I+ADDL AFLAEITAS
Sbjct: 115  DGDA---------EHEASAVQEVKKAKQKKPKKPKVTVAEAASRISADDLDAFLAEITAS 165

Query: 1612 YEDSQQDVQLMRFADYFGRAFSSVGGAQFPWVKTFRESTVAAIVDIPLLHISGDVYKIST 1433
            YE SQQD+ LMRFADYFGRAFSSV  AQFPW+KTF+ESTVA IVDIPLLHIS D+YKIST
Sbjct: 166  YE-SQQDIMLMRFADYFGRAFSSVSAAQFPWLKTFKESTVAKIVDIPLLHISEDIYKIST 224

Query: 1432 DWIGHRSSEALGSFVLWSLDSFLADFASHQGNXXXXXXXXXXXXXXXXVAIFVVLAMVLR 1253
            DWI HRS EALGSFVLWSLDS L+D ASHQG                 VA+FVVL MVLR
Sbjct: 225  DWISHRSYEALGSFVLWSLDSILSDLASHQG----VKKAVQQSSSKSQVAMFVVLTMVLR 280

Query: 1252 RKPDVMISLLPRMKESQKYQGQDKLPVTVWVIAQASQADLAVGLYLWVSLLLPMLNGKSG 1073
            RKPDV+ISLLP +KE++KYQGQDKLPV VWVI QASQ DL +GLYLWV LLLPML+ KSG
Sbjct: 281  RKPDVLISLLPILKENKKYQGQDKLPVIVWVITQASQGDLVMGLYLWVYLLLPMLSVKSG 340

Query: 1072 CNPQSRDLILQLVERIITFPKARPILINGAVRKGERVVPPWALDSLLRATFPLPSARVKA 893
            CNPQSRDLILQLVERIITFPKA  IL++GAVRKGERVVPPWALDSLLR TFPL SARVKA
Sbjct: 341  CNPQSRDLILQLVERIITFPKAHSILLSGAVRKGERVVPPWALDSLLRVTFPLHSARVKA 400

Query: 892  TERFGAVYPILKEVALAGSPGSKAIKHLAQQILSFAIKAAGEANRDLSKEACDIFIWCLT 713
            TERF AVYP L+EVALAGSPGSKAIKHLAQQILSFAIKAAG+AN DLSKEA DIFIWCLT
Sbjct: 401  TERFEAVYPTLREVALAGSPGSKAIKHLAQQILSFAIKAAGKANLDLSKEASDIFIWCLT 460

Query: 712  QNPECFKQWDLLYMDNLEASIVVLRKLSDEWK-HIVKHDTLHPFTETLKSFSQKNEKALA 536
            QNPEC+KQWDLLYMDNLEASIVVLR LS EWK + +KH TL P  ETLKSFSQKNEKALA
Sbjct: 461  QNPECYKQWDLLYMDNLEASIVVLRILSGEWKEYFIKHPTLDPLRETLKSFSQKNEKALA 520

Query: 535  KV-DGARDALLKDADKYCKAILRQMSQGRGXXXXXXXXXXXXXXXXXFLSQNMHFLDYQK 359
            K  D AR ALLKDADKYCKA+L ++SQ  G                 F+ QN+H  DY +
Sbjct: 521  KADDAARHALLKDADKYCKALLGRLSQDHGCMKSVTILSVVFAVGAIFVYQNLHLWDYSQ 580

Query: 358  LSEMWNLS 335
            L+EM NLS
Sbjct: 581  LTEMLNLS 588


>ref|XP_003519335.1| PREDICTED: uncharacterized protein LOC100795617 isoform 2 [Glycine
            max]
          Length = 546

 Score =  664 bits (1714), Expect = 0.0
 Identities = 379/609 (62%), Positives = 422/609 (69%), Gaps = 16/609 (2%)
 Frame = -2

Query: 2113 MDENSALIEAILREQEEEQANLRGNR------KEQNGNVNEWQTVSYKKRNRNKSASKQP 1952
            MDE SALIEAILREQEEE+      R      K  N N NEWQTVSY KRNRN++ +++P
Sbjct: 1    MDETSALIEAILREQEEEEEEAHRRRRNHTTIKNNNNNNNEWQTVSYTKRNRNRNNNRKP 60

Query: 1951 LAADD---DGFSSDVFRSVEQHSEERRHRLMEAQIAAASTES-----SGSKLHSYDDIEE 1796
            LA D+   D  SSDVF SV++HSE+RR RL+++QIAAA   +     S SK HS D+ E+
Sbjct: 61   LADDNFAADPSSSDVFSSVQRHSEDRRLRLLKSQIAAAEAAAAEATPSRSKRHS-DNEED 119

Query: 1795 EDKDSYKQLNRNGSSSDSXXXXXXXXXXXXXXKQPKVTVAEAASGINADDLGAFLAEITA 1616
             D +   ++ +                      +PKVTVAEAASGI+ADDL AFLAEIT 
Sbjct: 120  GDAEPEAEVKKAKQKKPK---------------KPKVTVAEAASGISADDLDAFLAEIT- 163

Query: 1615 SYEDSQQDVQLMRFADYFGRAFSSVGGAQFPWVKTFRESTVAAIVDIPLLHISGDVYKIS 1436
                                                          IPLLHIS D+YKIS
Sbjct: 164  ----------------------------------------------IPLLHISEDIYKIS 177

Query: 1435 TDWIGHRSSEALGSFVLWSLDSFLADFASHQGNXXXXXXXXXXXXXXXXVAIFVVLAMVL 1256
            TDW+ HRS EALGSFVLWSLDS LAD ASHQG                 VA+FVVLAMVL
Sbjct: 178  TDWVSHRSYEALGSFVLWSLDSILADLASHQGVVKGSKKAVQQSSPKSQVAMFVVLAMVL 237

Query: 1255 RRKPDVMISLLPRMKESQKYQGQDKLPVTVWVIAQASQADLAVGLYLWVSLLLPMLNGKS 1076
            RRKPDV+ISLLP +KE++KYQGQDKLPV VWVI QASQ DL +GLYLWV LLLPML+ KS
Sbjct: 238  RRKPDVLISLLPIIKENKKYQGQDKLPVIVWVITQASQGDLVMGLYLWVYLLLPMLSVKS 297

Query: 1075 GCNPQSRDLILQLVERIITFPKARPILINGAVRKGERVVPPWALDSLLRATFPLPSARVK 896
            GCNPQSRDLILQLVERIIT PKAR IL+NGAVR+GERVVPPWALDSLLR TFPLPSARVK
Sbjct: 298  GCNPQSRDLILQLVERIITSPKARSILLNGAVRRGERVVPPWALDSLLRVTFPLPSARVK 357

Query: 895  ATERFGAVYPILKEVALAGSPGSKAIKHLAQQILSFAIKAAGEANRDLSKEACDIFIWCL 716
            ATERF AVYP L+EVALA SPGSKAIKHLAQQILSFAIKAAGEAN DLSKEA DIFIWCL
Sbjct: 358  ATERFEAVYPTLREVALASSPGSKAIKHLAQQILSFAIKAAGEANSDLSKEASDIFIWCL 417

Query: 715  TQNPECFKQWDLLYMDNLEASIVVLRKLSDEWK-HIVKHDTLHPFTETLKSFSQKNEKAL 539
            TQNPEC+KQWD LYMDNLEAS+VVLRKLS EWK + VKH TL P  E LKSFSQKNEKAL
Sbjct: 418  TQNPECYKQWDFLYMDNLEASVVVLRKLSGEWKEYFVKHPTLDPLRENLKSFSQKNEKAL 477

Query: 538  AKV-DGARDALLKDADKYCKAILRQMSQGRGXXXXXXXXXXXXXXXXXFLSQNMHFLDYQ 362
            AKV DGAR ALLKDADKYCK +L Q+SQG G                 F+SQN+H  DY 
Sbjct: 478  AKVDDGARHALLKDADKYCKVLLGQLSQGHGCLKSMIVLSVVLAVGAVFMSQNLHLWDYS 537

Query: 361  KLSEMWNLS 335
            +L+EM NLS
Sbjct: 538  QLTEMLNLS 546


>ref|XP_003633736.1| PREDICTED: uncharacterized protein LOC100853921 [Vitis vinifera]
          Length = 587

 Score =  586 bits (1510), Expect = e-164
 Identities = 333/602 (55%), Positives = 410/602 (68%), Gaps = 12/602 (1%)
 Frame = -2

Query: 2113 MDENSALIEAILREQEEEQANLRGNRKEQNGNVNEWQTVSYKKRNRNKSA-SKQPLAADD 1937
            MDENS +IEAILR  ++   NL  ++ + +G    W+TVSY KR +N    S QP     
Sbjct: 1    MDENSEIIEAILRG-DDHATNLNDHQSQDSG----WKTVSYSKRRKNPPQNSLQPSLTPF 55

Query: 1936 DGFSSDVFRSVEQHSEERRHRLMEAQIAAASTESS-----GSKLHSYDDIEEEDKDSYKQ 1772
               +SDVFRSV+QHSE+R  R  EA   AA+  ++      SK HS DD  + D +    
Sbjct: 56   H--NSDVFRSVDQHSEDRLRRAQEAAATAAAAAAALQSAVRSKQHSDDD--DSDAEIPAG 111

Query: 1771 LNRNGSSSDSXXXXXXXXXXXXXXKQPKVTVAEAASGINADDLGAFLAEITASYEDSQQD 1592
               NG +                 K+PKV+V +AAS ++ADDL AFL +I+ASYE + QD
Sbjct: 112  AVDNGGAE-------VKKVKPKKPKKPKVSVGDAASKMDADDLSAFLLDISASYE-THQD 163

Query: 1591 VQLMRFADYFGRAFSSVGGAQFPWVKTFRESTVAAIVDIPLLHISGDVYKISTDWIGHRS 1412
            +QLMRFADYFGRAF+ V  AQFPW+K  +ESTVA ++++PL HI   VYK S DWI  RS
Sbjct: 164  IQLMRFADYFGRAFAPVSAAQFPWMKILKESTVAKMIEVPLSHIPEAVYKTSGDWINQRS 223

Query: 1411 SEALGSFVLWSLDSFLADFASHQGNXXXXXXXXXXXXXXXXVAIFVVLAMVLRRKPDVMI 1232
             EA+GSFVLW LD+  AD A HQG                 VAIFVVLAM LRRKP+V+I
Sbjct: 224  FEAVGSFVLWLLDNIHADLAIHQGTVKGSKKVAQQAPSKSQVAIFVVLAMSLRRKPEVLI 283

Query: 1231 SLLPRMKESQKYQGQDKLPVTVWVIAQASQADLAVGLYLWVSLLLPMLNGKSGCNPQSRD 1052
            SLLP MKE+ KYQ QDKLPVTVW+I+QASQ DLAVGLY+W  +LLPML+GKS CNPQSRD
Sbjct: 284  SLLPIMKENPKYQAQDKLPVTVWMISQASQGDLAVGLYMWTHMLLPMLSGKSSCNPQSRD 343

Query: 1051 LILQLVERIITFPKARPILINGAVRKGERVVPPWALDSLLRATFPLPSARVKATERFGAV 872
            LILQLVERI++ PK+R ILINGAVRKGER+VPP AL+ L+RATFP PSARVKATERF A+
Sbjct: 344  LILQLVERILSSPKSRTILINGAVRKGERLVPPSALELLMRATFPAPSARVKATERFEAM 403

Query: 871  YPILKEVALAGSPGSKAIKHLAQQILSFAIKAAGEANRDLSKEACDIFIWCLTQNPECFK 692
            YP LKEVALAGS  SKA+K +  QI++FAIKAAGE   DLS+EA DIF WCL QNP+C+K
Sbjct: 404  YPTLKEVALAGSSRSKAMKQVLLQIMNFAIKAAGEGILDLSREAVDIFTWCLNQNPDCYK 463

Query: 691  QWDLLYMDNLEASIVVLRKLSDEWKHI-VKHDTLHPFTETLKSFSQKNEKALAKVD-GAR 518
            QWDL+Y+DNLEAS++VL+ LS EWK +  K+ +L P  + LKSF QKNEK L   + GAR
Sbjct: 464  QWDLIYLDNLEASVLVLKMLSHEWKELSAKNPSLDPLKDALKSFQQKNEKELGGGEHGAR 523

Query: 517  DALLKDADKYCKAILRQMSQGRG----XXXXXXXXXXXXXXXXXFLSQNMHFLDYQKLSE 350
             A LKDADKYCK IL ++S+G G                      LS N+   D+++L E
Sbjct: 524  HASLKDADKYCKVILGRLSRGHGCTVSKVFASAALALGAAAGFALLSPNLQSYDWKRLPE 583

Query: 349  MW 344
            ++
Sbjct: 584  LF 585


>emb|CAN74204.1| hypothetical protein VITISV_021204 [Vitis vinifera]
          Length = 583

 Score =  575 bits (1482), Expect = e-161
 Identities = 326/602 (54%), Positives = 403/602 (66%), Gaps = 12/602 (1%)
 Frame = -2

Query: 2113 MDENSALIEAILREQEEEQANLRGNRKEQNGNVNEWQTVSYKKRNRNKSA-SKQPLAADD 1937
            MDENS +IEAILR  ++   NL  ++ + +G    W+TVSY KR +N    S QP     
Sbjct: 1    MDENSEIIEAILRG-DDHATNLNDHQSQDSG----WKTVSYSKRRKNPPQNSLQPSLTPF 55

Query: 1936 DGFSSDVFRSVEQHSEERRHRLMEAQIAAASTESS-----GSKLHSYDDIEEEDKDSYKQ 1772
               +SDVFRSV+QHSE+R  R  EA   AA+  ++      SK HS DD      DS  +
Sbjct: 56   H--NSDVFRSVDQHSEDRLRRAQEAAATAAAAAAALQSAVRSKQHSDDD------DSDAE 107

Query: 1771 LNRNGSSSDSXXXXXXXXXXXXXXKQPKVTVAEAASGINADDLGAFLAEITASYEDSQQD 1592
            +      +                K+PKV+V +AAS ++ADDL AFL +I+        D
Sbjct: 108  IPAGAVDNGGAEVKKVKPKKPKKPKKPKVSVGDAASKMDADDLSAFLLDIS--------D 159

Query: 1591 VQLMRFADYFGRAFSSVGGAQFPWVKTFRESTVAAIVDIPLLHISGDVYKISTDWIGHRS 1412
            +QLMRFADYFGRAF+ V  AQFPW+K  +ESTVA ++++PL HI   VYK S DWI  RS
Sbjct: 160  IQLMRFADYFGRAFAPVSAAQFPWMKILKESTVAKMIEVPLSHIPEAVYKTSGDWINQRS 219

Query: 1411 SEALGSFVLWSLDSFLADFASHQGNXXXXXXXXXXXXXXXXVAIFVVLAMVLRRKPDVMI 1232
             EA+GSFVLW LD+  AD A HQG                 VAIFVVLAM LRRKP+V+I
Sbjct: 220  FEAVGSFVLWLLDNIHADLAIHQGTVKGSKKVAQQAPSKSLVAIFVVLAMSLRRKPEVLI 279

Query: 1231 SLLPRMKESQKYQGQDKLPVTVWVIAQASQADLAVGLYLWVSLLLPMLNGKSGCNPQSRD 1052
            SLLP MKE+ KYQ QDKLPVTVW+I+QASQ DLAVGLY+W  +LLPML+GKS CNPQSRD
Sbjct: 280  SLLPIMKENPKYQAQDKLPVTVWMISQASQGDLAVGLYMWTHMLLPMLSGKSSCNPQSRD 339

Query: 1051 LILQLVERIITFPKARPILINGAVRKGERVVPPWALDSLLRATFPLPSARVKATERFGAV 872
            LILQLVER+++ PK+R ILINGAVRKGER+VPP AL+ L+RATFP PSARVKATERF A+
Sbjct: 340  LILQLVERVLSSPKSRTILINGAVRKGERLVPPSALELLMRATFPAPSARVKATERFEAM 399

Query: 871  YPILKEVALAGSPGSKAIKHLAQQILSFAIKAAGEANRDLSKEACDIFIWCLTQNPECFK 692
            YP LKEVALAGS  SKA+K +  QI++FAIKAAGE   DLS+EA DIF WCL QNP+C+K
Sbjct: 400  YPTLKEVALAGSSRSKAMKQVLLQIMNFAIKAAGEGILDLSREAVDIFTWCLNQNPDCYK 459

Query: 691  QWDLLYMDNLEASIVVLRKLSDEWKHI-VKHDTLHPFTETLKSFSQKNEKALAKVD-GAR 518
            QWDL+Y+DNLEAS++VL+ LS EWK +  K+ +L P  + LKSF QKNEK L   + GAR
Sbjct: 460  QWDLIYLDNLEASVLVLKMLSHEWKELSAKNPSLDPLKDALKSFQQKNEKELGGGEHGAR 519

Query: 517  DALLKDADKYCKAILRQMSQGRG----XXXXXXXXXXXXXXXXXFLSQNMHFLDYQKLSE 350
             A LKDADKYCK IL ++S+G G                      LS N+   D+++L E
Sbjct: 520  HASLKDADKYCKVILGRLSRGHGCTVSKVFASAALALGAAAGFALLSPNLQSYDWKRLPE 579

Query: 349  MW 344
            ++
Sbjct: 580  LF 581


Top