BLASTX nr result

ID: Glycyrrhiza23_contig00011543 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00011543
         (1516 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003554966.1| PREDICTED: uncharacterized protein LOC100811...   671   0.0  
ref|XP_004140425.1| PREDICTED: uncharacterized protein LOC101206...   441   e-121
ref|XP_002305971.1| predicted protein [Populus trichocarpa] gi|1...   409   e-111
ref|XP_002888081.1| hypothetical protein ARALYDRAFT_475167 [Arab...   372   e-100
ref|NP_176354.1| uncharacterized protein [Arabidopsis thaliana] ...   368   3e-99

>ref|XP_003554966.1| PREDICTED: uncharacterized protein LOC100811822 [Glycine max]
          Length = 406

 Score =  671 bits (1731), Expect = 0.0
 Identities = 319/408 (78%), Positives = 350/408 (85%), Gaps = 8/408 (1%)
 Frame = +3

Query: 24   MYVTRPLSMYRRSPSSLEIPPPDAPYSGYLVITDEEAEAEDTCCWRLCRRKRVKKLPFPQ 203
            MYVTRPLSMYRRSPS+L +PPPD PYSGYLVITDEEAEAEDTCCWRLCRRK+VKKLPFPQ
Sbjct: 1    MYVTRPLSMYRRSPSTLSMPPPDGPYSGYLVITDEEAEAEDTCCWRLCRRKKVKKLPFPQ 60

Query: 204  DKIFSVSHASEYEQTSNTKVWFLPVPDHPLSSNRYYVIRAKGRHKGKAYKCSREGDIVSC 383
            DKIFSV+HASEYEQTS+TKVWFLPVPDHPL+SNRYYVIRAKGR KGKAYKCSRE DIV+C
Sbjct: 61   DKIFSVTHASEYEQTSSTKVWFLPVPDHPLASNRYYVIRAKGRQKGKAYKCSREADIVTC 120

Query: 384  CFTDLLNDKRPKPFNLKDLNQIFKIHSHQSGGFFARSITPDGIPPKFLRKKGWRVRISGS 563
            CFTD+LND+RPKPFNLKDL QIFKIHSHQSGGFFARSITPDGIPP FLRKKGWR+R SGS
Sbjct: 121  CFTDILNDQRPKPFNLKDLYQIFKIHSHQSGGFFARSITPDGIPPSFLRKKGWRIRTSGS 180

Query: 564  YRSCRITEALGVDSTLREKLPDFNFPISRKRSPPLVVGKWYCPFIFVKEGTKKFKHQMKK 743
            YRSC+++EALGVD+ LREKLPDFNFPISRKRSPP+ VG+WYCPFIFV++   + KHQMKK
Sbjct: 181  YRSCKLSEALGVDAPLREKLPDFNFPISRKRSPPVTVGRWYCPFIFVRDNGTRVKHQMKK 240

Query: 744  SMFYKMTLEQKWEEVYSCVNDH---ESSEGD-----GNXXXXXXXXXXXXXXXXXSGMEA 899
            SM+Y MTLEQ+WEEVY+C NDH   +S EGD     G                  SGMEA
Sbjct: 241  SMYYSMTLEQRWEEVYTCGNDHHHEKSDEGDDNGGGGGGGVVIVNVCVEREVVLVSGMEA 300

Query: 900  TKNNGKGDANGFLWYRACNPYNRRRVNVGLSLAIVEHMRWVQEQGGWAYGNGREKVVRMR 1079
            T+ NG  DANGF WYRA + YN+RR NVGLS AIVEHMRWVQE GGWAYG+GRE+VVR+R
Sbjct: 301  TR-NGSTDANGFFWYRAYDAYNKRRANVGLSSAIVEHMRWVQEAGGWAYGHGRERVVRVR 359

Query: 1080 EEVTCQQSEWHRFGFYVLVESFCLRTLDGKLVFRYEFRHTGRVKCKWE 1223
            EE  C QSEW RFG YVLVESFCLRTLDGKLV RY+FRHT +VKCKWE
Sbjct: 360  EEARC-QSEWLRFGCYVLVESFCLRTLDGKLVLRYDFRHTHKVKCKWE 406


>ref|XP_004140425.1| PREDICTED: uncharacterized protein LOC101206442 [Cucumis sativus]
            gi|449481269|ref|XP_004156133.1| PREDICTED:
            uncharacterized LOC101206442 [Cucumis sativus]
          Length = 407

 Score =  441 bits (1135), Expect = e-121
 Identities = 215/413 (52%), Positives = 285/413 (69%), Gaps = 13/413 (3%)
 Frame = +3

Query: 24   MYVTRPLSMYRRSPSSLEIP--PPDAPYSGYLVITDEEAEAEDTCCWRLCRRKRVKKLPF 197
            M V+RPLS++RRSPSS+ +P    + P+SG  V+ DEEAEA+D+ CW +C+R+ +KK PF
Sbjct: 1    MLVSRPLSLFRRSPSSISMPVAASEGPFSGVFVVKDEEAEADDSYCWGICKRRSIKKAPF 60

Query: 198  PQDKIFSVSHAS-EYEQTSNTKVWFLPVPDHPLSSNRYYVIRAKGRHKGKAYKCSREGDI 374
            PQD+I ++ H+S +YE+T +TKVW LPV D PLSSNRYY+I+A G+HKGKAYKCSRE DI
Sbjct: 61   PQDRILTILHSSSQYEETKSTKVWLLPVLDRPLSSNRYYLIKAGGKHKGKAYKCSREDDI 120

Query: 375  VSCCFTDLLNDKRPKPFNLKDLNQIFKIHSHQSGGFFARSITPDGIPPKFLRKKGWRVRI 554
             +CCF D+L+DK+P PFNLKD+ Q F+IH H SGGFFA+S+ PDG+PPKFLR KGW++R 
Sbjct: 121  RTCCFGDVLSDKKPSPFNLKDIYQQFQIHRHHSGGFFAQSVAPDGVPPKFLRTKGWKLRS 180

Query: 555  SGSYRSCRI--TEALGVDSTLREKLPDFNFPISRKRSPPLVVGKWYCPFIFVKEGTKKFK 728
            S S  +  +   EALG+DS+ RE LPDFNFPI   RSPP+VVGKW CPF+FV+E +   +
Sbjct: 181  SSSSSTFHLPFQEALGLDSSSRELLPDFNFPIFTTRSPPVVVGKWLCPFVFVRENSMSIR 240

Query: 729  HQMKKSMFYKMTLEQKWEEVYSCVNDHESSEGDGNXXXXXXXXXXXXXXXXXSGMEATKN 908
             QMK+S  Y +TLEQ WE+++SC      S  D                   +G EA + 
Sbjct: 241  KQMKRSPIYSLTLEQCWEQMFSC-----ESPNDETSSIVTVTIDVAREVVLLAGREAERE 295

Query: 909  NGKGDANGFLWYRACNPYN--RRRVNVGLSLAIVEHMRWVQEQGGWAYG----NGREKVV 1070
             G     GF+W++ CN  +     + +GLS+A++E +RWVQE GGW        G EKVV
Sbjct: 296  KGDEHRKGFIWFKVCNRLDGGGTAMGIGLSIALLEKIRWVQEAGGWFSSGDNDKGGEKVV 355

Query: 1071 RMR--EEVTCQQSEWHRFGFYVLVESFCLRTLDGKLVFRYEFRHTGRVKCKWE 1223
            R+   EE+T  ++ W RF  Y+LVESF LR  +G LV++Y FRHT  +KCKWE
Sbjct: 356  RVEKVEEIT-SENGWRRFSLYMLVESFVLRRSNGGLVWKYNFRHTHTIKCKWE 407


>ref|XP_002305971.1| predicted protein [Populus trichocarpa] gi|118484298|gb|ABK94028.1|
            unknown [Populus trichocarpa] gi|222848935|gb|EEE86482.1|
            predicted protein [Populus trichocarpa]
          Length = 392

 Score =  409 bits (1051), Expect = e-111
 Identities = 203/401 (50%), Positives = 268/401 (66%), Gaps = 1/401 (0%)
 Frame = +3

Query: 24   MYVTRPLSMYRRSPSSLEIPPPDAPYSGYLVITDEEAEAEDTCCWRLCRRKRVKKLPFPQ 203
            MYVTRPLS+YR  PS+L   PP+ P++GYLVITDEEAEA++T CW + + +RVKKLPFPQ
Sbjct: 1    MYVTRPLSLYRNFPSALSREPPEGPHTGYLVITDEEAEAQETYCWGIRKSRRVKKLPFPQ 60

Query: 204  DKIFSVSHASEYEQTSNTKVWFLPVPDHPLSSNRYYVIRAKGRHKGKAYKCSREGDIVSC 383
            DKI SV H+S++E+T   K WF+PV D PLSSN YYVI+AKG HKG+A  CSRE D+   
Sbjct: 61   DKILSVVHSSDHEETIVKKAWFIPVLDQPLSSNCYYVIKAKGSHKGQACTCSREMDMGLW 120

Query: 384  CFTDLLNDKRPKPFNLKDLNQIFKIHSHQSGGFFARSITPDGIPPKFLRKKGWRVRISGS 563
            CF  ++ND +PKPF+ +++ Q FKIH H    FF++S+ PDG PP+FLRKKGW VR S S
Sbjct: 121  CFKSVINDIKPKPFDYRNIYQQFKIHRHHGKSFFSKSLAPDGFPPRFLRKKGWEVRSSRS 180

Query: 564  YRSCRITEALGVDSTLREKLPDFNFPISRKRSPPLVVGKWYCPFIFVKEGTKKFKHQMKK 743
            Y+  +++EALG+D  LR +LP F+FP+S K S  + VG+WYCPF+ V+E   + + QMK+
Sbjct: 181  YK-FQLSEALGLDVPLRSQLPSFDFPLSTKSSSRVTVGRWYCPFVLVRE-EPRIREQMKR 238

Query: 744  SMFYKMTLEQKWEEVYSCVNDHESSEGDGNXXXXXXXXXXXXXXXXXSGMEATKNNGKGD 923
            +M Y MTLEQ W+E+YSC N +  +E                      GMEAT++ G   
Sbjct: 239  TMLYSMTLEQYWKEIYSCENANNEAES-----TIMVSVNVQREMDLVFGMEATRDGGVSH 293

Query: 924  ANGFLWYRACNPYNR-RRVNVGLSLAIVEHMRWVQEQGGWAYGNGREKVVRMREEVTCQQ 1100
              G +WYR  +  +  R   VG+S A VE M+WVQE GGW  G    + V    E+   +
Sbjct: 294  -GGVIWYRVVSRNSSGRGFKVGVSAATVEKMKWVQEAGGWIDGGDVNETVERAVEIR-SE 351

Query: 1101 SEWHRFGFYVLVESFCLRTLDGKLVFRYEFRHTGRVKCKWE 1223
            + W +FG Y LVESF LR +DG LV R +FRHT ++K KWE
Sbjct: 352  NGWRKFGCYALVESFVLRRMDGSLVLRCDFRHTHKIKSKWE 392


>ref|XP_002888081.1| hypothetical protein ARALYDRAFT_475167 [Arabidopsis lyrata subsp.
            lyrata] gi|297333922|gb|EFH64340.1| hypothetical protein
            ARALYDRAFT_475167 [Arabidopsis lyrata subsp. lyrata]
          Length = 414

 Score =  372 bits (955), Expect = e-100
 Identities = 200/418 (47%), Positives = 255/418 (61%), Gaps = 18/418 (4%)
 Frame = +3

Query: 24   MYVTRPLSMYRRSPSSLEIPPPDAPYSGYLVITDEEAEAEDTCCWRLCRRKRVKKLPFPQ 203
            MYVTR LS +R+   +L    P+ P+SG LVITDEEAE EDT C+ +C R +++KLP PQ
Sbjct: 1    MYVTRTLSQFRKYQKTLSEESPEGPFSGVLVITDEEAETEDTFCFGMCTRTKIEKLPLPQ 60

Query: 204  DKIFSVSH--ASEYEQTSNTKVWFLPVPDHPLSSNRYYVIRAKGRHKGKAYKCSREGDIV 377
            DKI SV H  +S   +TS  KV F+P  D PLSSNRYYV+ A+GR+KGK   CSRE +  
Sbjct: 61   DKILSVVHLDSSGNRETSVKKVLFIPALDQPLSSNRYYVVHARGRYKGKVSVCSREIEKG 120

Query: 378  SCCFTDLLNDKRPKPFNLKDLNQIFKIHSHQSGGFFARSITPDGIPPKFLRKKGWRVRIS 557
             CCF D+L+DK+PKP + +++ Q  KI+ H    FF +S+ PDG PP FL+KKGW +R S
Sbjct: 121  VCCFPDILHDKKPKPLDPRNIYQTVKINRHHDRTFFGKSVAPDGTPPSFLKKKGWELRTS 180

Query: 558  GSYRSCRITEALGVDSTLREKLPDFNFPISRKRSPPLVVGKWYCPFIFVKEGTKKFKHQM 737
             S    R  EALG+D  LR +LP F FP+S  RS  ++VG+WYCPF+FVKE      +QM
Sbjct: 181  RSLHPRRPREALGLDDELRARLPAFGFPVSTIRSGSVIVGEWYCPFMFVKENC-SLSYQM 239

Query: 738  KKSMFYKMTLEQKWEEVYSCVNDH-----ESSEGDGNXXXXXXXXXXXXXXXXXSGMEAT 902
            +KSMFY++TL Q WE +Y C N+      + +  D                    GMEA 
Sbjct: 240  RKSMFYRITLSQYWERIYHCENNDAHNNIDENNDDNEEEVVSVEANVVREANYVKGMEAV 299

Query: 903  KNNGKGDANGFLWYRAC----NPYNRRRVN-----VGLSLAIVEHMRWVQEQGGWAYGNG 1055
            K   +G   GF WYR       P  RRR       VGLS  +VE MR V E+GGW  G G
Sbjct: 300  KGEKEGH-GGFHWYRQVQGPRGPGERRRKRGVSSPVGLSFVVVERMRRVMEEGGWV-GGG 357

Query: 1056 REKVVRMR--EEVTCQQSEWHRFGFYVLVESFCLRTLDGKLVFRYEFRHTGRVKCKWE 1223
            R KVVR+   E +   + +W RFG YVLVESF LR  DG L+ +  FRHT R++C WE
Sbjct: 358  R-KVVRVERDEPIRISRRDWRRFGCYVLVESFGLRRADGVLLVKCVFRHTNRLRCNWE 414


>ref|NP_176354.1| uncharacterized protein [Arabidopsis thaliana]
            gi|4585877|gb|AAD25550.1|AC005850_7 Hypothetical protein
            [Arabidopsis thaliana] gi|332195740|gb|AEE33861.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 421

 Score =  368 bits (944), Expect = 3e-99
 Identities = 201/425 (47%), Positives = 255/425 (60%), Gaps = 25/425 (5%)
 Frame = +3

Query: 24   MYVTRPLSMYRRSPSSLEIPPPDAPYSGYLVITDEEAEAEDTCCWRLCRRKRVKKLPFPQ 203
            MYVTR LS +R+   +L    P+ P+SG LVITDEEAE EDT C+ +C R +++KLP PQ
Sbjct: 1    MYVTRTLSQFRKYQKTLSEESPEGPFSGVLVITDEEAETEDTFCFGMCTRTKIEKLPLPQ 60

Query: 204  DKIFSVSH--ASEYEQTSNTKVWFLPVPDHPLSSNRYYVIRAKGRHKGKAYKCSREGDIV 377
            DKI SV H  +S   +TS  KV F+P  D PLSSNRYYV+ A+GRHKGK   CSRE +  
Sbjct: 61   DKILSVVHLDSSGNRETSVKKVLFIPALDQPLSSNRYYVVHARGRHKGKVSVCSREIEKG 120

Query: 378  SCCFTDLLNDKRPKPFNLKDLNQIFKIHSHQSGGFFARSITPDGIPPKFLRKKGWRVRIS 557
             CCF D+L+DK+PKP + +++ Q  KI+ H    F+A+S+ PDG PP FL+KKGW +R S
Sbjct: 121  VCCFPDILHDKKPKPLDPRNIYQTVKINRHHDRTFYAKSVAPDGTPPTFLKKKGWELRTS 180

Query: 558  GSYRSCRITEALGVDSTLREKLPDFNFPISRKRSPPLVVGKWYCPFIFVKEGTKKFKHQM 737
             S    R  EALG+D  LR +LP F FP+S  RS  ++VG+WYCPF+FVKE       QM
Sbjct: 181  RSLHPRRPREALGLDEELRARLPAFGFPVSTIRSGSVIVGEWYCPFMFVKENC-SVSQQM 239

Query: 738  KKSMFYKMTLEQKWEEVYSC-VNDHESSEGDGNXXXXXXXXXXXXXXXXXSGMEATKNNG 914
            +KSMFY++TL Q WE +Y C  ND + +  +                    GMEA K   
Sbjct: 240  RKSMFYRITLSQYWERIYHCGNNDLDENNDENEEEVVRVEANVVREANYVKGMEAVKGEK 299

Query: 915  KGDANGFLWYRAC----NPYNRRRVN-----VGLSLAIVEHMRWVQEQGGWAYGNGREKV 1067
            +G   GF WYR       P  RRR       VGLS  +VE MR V E+GGW  G GR KV
Sbjct: 300  EGH-GGFYWYRQVQGPRGPGERRRKTGLRSPVGLSFVVVERMRRVMEEGGWV-GGGR-KV 356

Query: 1068 VRMREEV---TCQQS----------EWHRFGFYVLVESFCLRTLDGKLVFRYEFRHTGRV 1208
            VR+  +     C++            W RFG YVLVESF LR  DG L+ +  FRHT R+
Sbjct: 357  VRVERDEPIRVCRRDGRNMNGNNDRNWRRFGCYVLVESFGLRRADGVLLVKCVFRHTNRL 416

Query: 1209 KCKWE 1223
            +C WE
Sbjct: 417  RCNWE 421


Top