BLASTX nr result

ID: Glycyrrhiza23_contig00020447 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00020447
         (1472 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1...   558   e-156
ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit,...   556   e-156
ref|NP_189198.1| aspartyl protease family protein [Arabidopsis t...   550   e-154
ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2...   545   e-152
ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arab...   544   e-152

>ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  558 bits (1437), Expect = e-156
 Identities = 267/389 (68%), Positives = 312/389 (80%), Gaps = 1/389 (0%)
 Frame = -1

Query: 1316 RAAKTPLVSGAFTGSGQYFADLRIGSPPQRLLLVADTGSDLVWVKCSACRNCSDHRPGSA 1137
            ++ K+P+VSGA TGSGQYF DLR+G+PPQ+LLLVADTGSDLVWVKCSACRNC+ H PGSA
Sbjct: 72   QSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSA 131

Query: 1136 FLARHSKTFSPHHCYDSPCRLLPHPKLATPCNNHTRIHTPCRYEYSYADGSTTSGFFSKE 957
            FLARHS TFSP+HCYDS C+L+P PK    CN H R+H+PCRYEYSY DGS TSGFFSKE
Sbjct: 132  FLARHSTTFSPNHCYDSACQLVPLPKHHR-CN-HARLHSPCRYEYSYGDGSKTSGFFSKE 189

Query: 956  KTTFNTSNGNEVKIKNLSFGCGFRISGPSVTGASFNGAQGVMGLGRGPISFSSQLGRRFG 777
             TT NTS+G E K+K ++FGC FRISGPSV+GASFNGA GVMGLGRGPIS SSQLG RFG
Sbjct: 190  TTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFG 249

Query: 776  NRFSYCLLDYTLSPPPKSYLTLGGSINDDV-SRKKFSYTPLLTNPLSPTFYYVAIEGVTV 600
            N+FSYCL+D+ +SP P SYL +G + ND    +++  +TPL  NPLSPTFYY+ IE V+V
Sbjct: 250  NKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSV 309

Query: 599  DGAKLPISPSVWAIDEGGNGGTVVDSGTTLTFLAEQAYRQVLAAFRRRVRLPEAEGPALG 420
            DG KLPI+PSVWA+DE GNGGT+VDSGTTLTFL E AY Q+L   +RRVRLP    P  G
Sbjct: 310  DGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPG 369

Query: 419  FDLCVNVSGMSRPKLPRLRFGLTGKVALLPPVRNYFIEAADRVLCLAIQPVKPGSGFSVI 240
            FDLCVNVS +  P+LP+L F L G     PP RNYF++  + V CLA+Q V   SGFSVI
Sbjct: 370  FDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVI 429

Query: 239  GNLMQQGYLFEFDSDRSRLGFSRHGCAIP 153
            GNLMQQG+L EFD DR+RLGFSRHGCA+P
Sbjct: 430  GNLMQQGFLLEFDKDRTRLGFSRHGCALP 458


>ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
            communis] gi|223536362|gb|EEF38012.1| basic 7S globulin 2
            precursor small subunit, putative [Ricinus communis]
          Length = 455

 Score =  556 bits (1433), Expect = e-156
 Identities = 268/415 (64%), Positives = 323/415 (77%), Gaps = 12/415 (2%)
 Frame = -1

Query: 1361 AELLSADLRR---LFGHRR--------AAKTPLVSGAFTGSGQYFADLRIGSPPQRLLLV 1215
            +E L+ D+ R   L  H R        + ++P++SGA +GSGQYF  LRIG+PPQ LLLV
Sbjct: 43   SEALAFDINRRLSLLHHHRHQQQHKQNSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLV 102

Query: 1214 ADTGSDLVWVKCSACRNCSDHRPGSAFLARHSKTFSPHHCYDSPCRLLPHPKLATPCNNH 1035
            ADTGSDL+WVKCS CRNCS   PGSAF ARHS T+S  HCY   C+L+PHP    PCN  
Sbjct: 103  ADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYSPQCQLVPHPH-PNPCNR- 160

Query: 1034 TRIHTPCRYEYSYADGSTTSGFFSKEKTTFNTSNGNEVKIKNLSFGCGFRISGPSVTGAS 855
            TR+H+PCRY+Y+YAD STT+GFFSKE  T NTS G   K+  LSFGCGFRISGPS+TGAS
Sbjct: 161  TRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGLSFGCGFRISGPSLTGAS 220

Query: 854  FNGAQGVMGLGRGPISFSSQLGRRFGNRFSYCLLDYTLSPPPKSYLTLGGSINDDVSRKK 675
            F GAQGVMGLGR PISFSSQLGRRFG++FSYCL+DYTLSPPP S+LT+GG+ N  VS+K 
Sbjct: 221  FEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSPPPTSFLTIGGAQNVAVSKKG 280

Query: 674  F-SYTPLLTNPLSPTFYYVAIEGVTVDGAKLPISPSVWAIDEGGNGGTVVDSGTTLTFLA 498
              S+TPLL NPLSPTFYY+AI+GV V+G KLPI+PSVW+ID+ GNGGT++DSGTTLTF+ 
Sbjct: 281  IMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFIT 340

Query: 497  EQAYRQVLAAFRRRVRLPEAEGPALGFDLCVNVSGMSRPKLPRLRFGLTGKVALLPPVRN 318
            E AY ++L AF++RV+LP    P  GFDLC+NVSG++RP LPR+ F L G     PP RN
Sbjct: 341  EPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRN 400

Query: 317  YFIEAADRVLCLAIQPVKPGSGFSVIGNLMQQGYLFEFDSDRSRLGFSRHGCAIP 153
            YFIE  D++ CLA+QPV    GFSV+GNLMQQG+L EFD D+SRLGF+R GCA+P
Sbjct: 401  YFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCALP 455


>ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA
            binding protein-like; nucellin-like protein [Arabidopsis
            thaliana] gi|189339286|gb|ACD89063.1| At3g25700
            [Arabidopsis thaliana] gi|332643533|gb|AEE77054.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  550 bits (1417), Expect = e-154
 Identities = 273/410 (66%), Positives = 318/410 (77%), Gaps = 8/410 (1%)
 Frame = -1

Query: 1358 ELLSADLRRL--FGHRRA----AKTPLVSGAFTGSGQYFADLRIGSPPQRLLLVADTGSD 1197
            + L+ D RRL     RR      K+P+VSGA +GSGQYF DLRIG PPQ LLL+ADTGSD
Sbjct: 47   QALALDTRRLHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSD 106

Query: 1196 LVWVKCSACRNCSDHRPGSAFLARHSKTFSPHHCYDSPCRLLPHPKLATPCNNHTRIHTP 1017
            LVWVKCSACRNCS H P + F  RHS TFSP HCYD  CRL+P P  A  CN HTRIH+ 
Sbjct: 107  LVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICN-HTRIHST 165

Query: 1016 CRYEYSYADGSTTSGFFSKEKTTFNTSNGNEVKIKNLSFGCGFRISGPSVTGASFNGAQG 837
            C YEY YADGS TSG F++E T+  TS+G E ++K+++FGCGFRISG SV+G SFNGA G
Sbjct: 166  CHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANG 225

Query: 836  VMGLGRGPISFSSQLGRRFGNRFSYCLLDYTLSPPPKSYLTLGGSINDDVSRKKFSYTPL 657
            VMGLGRGPISF+SQLGRRFGN+FSYCL+DYTLSPPP SYL +G    D +S  K  +TPL
Sbjct: 226  VMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNG-GDGIS--KLFFTPL 282

Query: 656  LTNPLSPTFYYVAIEGVTVDGAKLPISPSVWAIDEGGNGGTVVDSGTTLTFLAEQAYRQV 477
            LTNPLSPTFYYV ++ V V+GAKL I PS+W ID+ GNGGTVVDSGTTL FLAE AYR V
Sbjct: 283  LTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSV 342

Query: 476  LAAFRRRVRLPEAEGPALGFDLCVNVSGMSRPK--LPRLRFGLTGKVALLPPVRNYFIEA 303
            +AA RRRV+LP A+    GFDLCVNVSG+++P+  LPRL+F  +G    +PP RNYFIE 
Sbjct: 343  IAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIET 402

Query: 302  ADRVLCLAIQPVKPGSGFSVIGNLMQQGYLFEFDSDRSRLGFSRHGCAIP 153
             +++ CLAIQ V P  GFSVIGNLMQQG+LFEFD DRSRLGFSR GCA+P
Sbjct: 403  EEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452


>ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
            gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic
            proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  545 bits (1404), Expect = e-152
 Identities = 269/410 (65%), Positives = 317/410 (77%), Gaps = 7/410 (1%)
 Frame = -1

Query: 1361 AELLSADLRRL---FGHRRAA-KTPLVSGAFTGSGQYFADLRIGSPPQRLLLVADTGSDL 1194
            ++ LS+D  RL   F       K+PL+SGA TGSGQYF D+R+G+PPQ LLLVADTGSDL
Sbjct: 52   SQSLSSDTHRLSLLFSRPNPTLKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDL 111

Query: 1193 VWVKCSACRNCSDHRPGSAFLARHSKTFSPHHCYDSPCRLLPHPKLATPCNNHTRIHTPC 1014
            VWVKCSACRNCS H P SAFL RHS +FSP HC+D  CRLLPH      CN HTR+H+PC
Sbjct: 112  VWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHL-CN-HTRLHSPC 169

Query: 1013 RYEYSYADGSTTSGFFSKEKTTFNTSNGNEVKIKNLSFGCGFRISGPSVTGASFNGAQGV 834
            R+ YSYADGS +SGFFSKE TT  + +G+E+ +K LSFGCGFRISGPSV+GA FNGA+GV
Sbjct: 170  RFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGV 229

Query: 833  MGLGRGPISFSSQLGRRFGNRFSYCLLDYTLSPPPKSYLTLGGSIND--DVSRKKFSYTP 660
            MGLGRG ISFSSQLGRRFGN+FSYCL+DYTLSPPP S+L +GG ++     +  K SYTP
Sbjct: 230  MGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTP 289

Query: 659  LLTNPLSPTFYYVAIEGVTVDGAKLPISPSVWAIDEGGNGGTVVDSGTTLTFLAEQAYRQ 480
            L  NPLSPTFYY+ I  +T+DG KLPI+P+VW IDE GNGGTVVDSGTTLT+L + AY +
Sbjct: 290  LQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEE 349

Query: 479  VLAAFRRRVRLPEAEGPALGFDLCVNVSGMS-RPKLPRLRFGLTGKVALLPPVRNYFIEA 303
            VL + RRRV+LP A     GFDLCVN SG S RP LPRLRF L G     PP RNYF+E 
Sbjct: 350  VLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLET 409

Query: 302  ADRVLCLAIQPVKPGSGFSVIGNLMQQGYLFEFDSDRSRLGFSRHGCAIP 153
             + V+CLAI+ V+ G+GFSVIGNLMQQG+L EFD + SRLGF+R GC +P
Sbjct: 410  EEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGCGLP 459


>ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
            lyrata] gi|297321109|gb|EFH51530.1| hypothetical protein
            ARALYDRAFT_484331 [Arabidopsis lyrata subsp. lyrata]
          Length = 451

 Score =  544 bits (1402), Expect = e-152
 Identities = 270/410 (65%), Positives = 318/410 (77%), Gaps = 8/410 (1%)
 Frame = -1

Query: 1358 ELLSADLRRL--FGHRRA----AKTPLVSGAFTGSGQYFADLRIGSPPQRLLLVADTGSD 1197
            + L+ D RRL     RR      K+P+VSGA +GSGQYF DLRIG PPQ LLL+ADTGSD
Sbjct: 46   QALALDTRRLHFLSLRRKPVPFVKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSD 105

Query: 1196 LVWVKCSACRNCSDHRPGSAFLARHSKTFSPHHCYDSPCRLLPHPKLATPCNNHTRIHTP 1017
            LVWVKCSACRNCS H P + F  RHS TFSP HCYD  CRL+P P  A  CN HTRIH+ 
Sbjct: 106  LVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPGRAPRCN-HTRIHST 164

Query: 1016 CRYEYSYADGSTTSGFFSKEKTTFNTSNGNEVKIKNLSFGCGFRISGPSVTGASFNGAQG 837
            C YEY YADGS TSG F++E T+  TS+G E K+K+++FGCGFRISG SV+G SFNGA G
Sbjct: 165  CPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANG 224

Query: 836  VMGLGRGPISFSSQLGRRFGNRFSYCLLDYTLSPPPKSYLTLGGSINDDVSRKKFSYTPL 657
            VMGLGRGPISF+SQLGRRFGN+FSYCL+DYTLSPPP SYL +G    D VS  K  +TPL
Sbjct: 225  VMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDG-GDAVS--KLFFTPL 281

Query: 656  LTNPLSPTFYYVAIEGVTVDGAKLPISPSVWAIDEGGNGGTVVDSGTTLTFLAEQAYRQV 477
            LTNPLSPTFYYV ++ V V+GAKL I PS+W ID+ GNGGTV+DSGTTL FLA+ AYR V
Sbjct: 282  LTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLV 341

Query: 476  LAAFRRRVRLPEAEGPALGFDLCVNVSGMSRPK--LPRLRFGLTGKVALLPPVRNYFIEA 303
            +AA ++R++LP A+    GFDLCVNVSG+++P+  LPRL+F  +G    +PP RNYFIE 
Sbjct: 342  IAAVKQRIKLPNADELTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIET 401

Query: 302  ADRVLCLAIQPVKPGSGFSVIGNLMQQGYLFEFDSDRSRLGFSRHGCAIP 153
             +++ CLAIQ V P  GFSVIGNLMQQG+LFEFD DRSRLGFSR GCA+P
Sbjct: 402  EEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 451


Top