BLASTX nr result

ID: Glycyrrhiza23_contig00016106 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00016106
         (2108 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003520361.1| PREDICTED: uncharacterized protein LOC100813...   595   e-167
ref|NP_187006.2| uncharacterized protein [Arabidopsis thaliana] ...   556   e-156
ref|XP_002884395.1| hypothetical protein ARALYDRAFT_477601 [Arab...   555   e-155
ref|XP_002319228.1| predicted protein [Populus trichocarpa] gi|2...   389   e-105
gb|AAF01580.1|AC009895_1 hypothetical protein [Arabidopsis thali...   385   e-104

>ref|XP_003520361.1| PREDICTED: uncharacterized protein LOC100813936 [Glycine max]
          Length = 607

 Score =  595 bits (1533), Expect = e-167
 Identities = 296/375 (78%), Positives = 319/375 (85%), Gaps = 10/375 (2%)
 Frame = +2

Query: 551  LHDQLQWRIRNSH----------DRIGELNSVLESHADNGNHVVESPGSGNLTSHIHNEF 700
            LH  + W ++  +          DR+GEL SVLES ADNGNHVVESPGSGNLTSH HN+F
Sbjct: 235  LHSDVNWGLKTFNYQQTSNADRSDRMGELTSVLESRADNGNHVVESPGSGNLTSHTHNDF 294

Query: 701  MFQHNFPQQNLIGNEQSPQPMSNITGYMNPVFNGDINGAFKRVNYQDISKADRDLSSFRH 880
            MFQHNFPQQNLIGNEQS QPMSN+ GYM+P  + D+N   K  NYQ  S ADR +SSF H
Sbjct: 295  MFQHNFPQQNLIGNEQSHQPMSNVAGYMHPALHSDVNWGLKTFNYQQTSNADRGISSFPH 354

Query: 881  GSINTIGVQERTGERKFVNGNGNLYQPPPEHDETASSVSEDGPGIENFQICGDAIPGEKL 1060
             SI+ IGVQ++  ER F  GNGN YQ PP+ DETASSVSEDGPGIENFQ+ GDAIPGEKL
Sbjct: 355  ASIDKIGVQDKNMERNF--GNGNFYQHPPDLDETASSVSEDGPGIENFQVSGDAIPGEKL 412

Query: 1061 LGCGYPVRGTSLCMFQWVRHLEDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGR 1240
            LGCGYPVRGTSLCMFQWVRHLEDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGR
Sbjct: 413  LGCGYPVRGTSLCMFQWVRHLEDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGR 472

Query: 1241 QGEIVRLFANDQNKIKCDSDMQREIDTYLSKGEATFSVLLLMDSSENWEQATLFLRRSGY 1420
            QGE+V+LFANDQNKI CDS+M+ EI T LSKGEATFSVLLL DSSENWEQATLFLRRSGY
Sbjct: 473  QGELVKLFANDQNKITCDSEMKHEIGTNLSKGEATFSVLLLRDSSENWEQATLFLRRSGY 532

Query: 1421 QIKINGTEAPVVYEKLSKDLSIKVPCGLSTQFVLTCSDGSSHPLSTYSVRMRDTLVLTMR 1600
            QIKINGTEA VV EK SK+LSIKVPCGLS QFVLT S+GSSHPLSTYSVRMRDTLVLTMR
Sbjct: 533  QIKINGTEATVVDEKFSKELSIKVPCGLSAQFVLTSSNGSSHPLSTYSVRMRDTLVLTMR 592

Query: 1601 IFQSKVLDDKRKGRA 1645
            +FQSK LDDKRKGRA
Sbjct: 593  LFQSKALDDKRKGRA 607



 Score =  417 bits (1073), Expect = e-114
 Identities = 209/279 (74%), Positives = 235/279 (84%), Gaps = 3/279 (1%)
 Frame = +2

Query: 125 TQLAQSNFKSNDVHNHMKDLDTMELYSRERRQEEEILSLREQIAIACMKELQLLNEKCKL 304
           TQLAQ NFKSND  NH+++ +TMELYSR R QEEEILSLREQI IACMKELQLLNEKCKL
Sbjct: 12  TQLAQRNFKSNDTQNHIQEQNTMELYSRAREQEEEILSLREQIGIACMKELQLLNEKCKL 71

Query: 305 EREFSELRMAIDDKQNEAITSASNELARRKGYLEENLKLAHDLKVVEDERYMFMSSMLGL 484
           ER+FSELRMA+D+KQNEAI+SASN+L +RKGYLEENLKLAHDLK V+DERY+FMSSMLGL
Sbjct: 72  ERQFSELRMAVDEKQNEAISSASNDLVQRKGYLEENLKLAHDLKAVDDERYIFMSSMLGL 131

Query: 485 LAEYGLWPRVMNASSISNCVKHLHDQLQWRIRNSHDRIGELNSVLESHADNGNHVVESPG 664
           LAEYGLWPRVMNASSIS+CVKHLHDQLQWRIR+SHDR+GEL SVLES ADNGNHVVESPG
Sbjct: 132 LAEYGLWPRVMNASSISSCVKHLHDQLQWRIRSSHDRMGELTSVLESRADNGNHVVESPG 191

Query: 665 SGNLTSHIHNEFMFQHNFPQQNLIGNEQSPQPMSNITGYMNPVFNGDINGAFKRVNYQDI 844
           SGNLTSH HN+FMFQHNFPQQNLIGNEQS QPMSN+ GYM+P  + D+N   K  NYQ  
Sbjct: 192 SGNLTSHTHNDFMFQHNFPQQNLIGNEQSHQPMSNVAGYMHPALHSDVNWGLKTFNYQQT 251

Query: 845 SKADRDLSSFRHGSINTIGVQERTGERKFV---NGNGNL 952
           S ADR   S R G + ++ ++ R      V    G+GNL
Sbjct: 252 SNADR---SDRMGELTSV-LESRADNGNHVVESPGSGNL 286


>ref|NP_187006.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332640436|gb|AEE73957.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 521

 Score =  556 bits (1434), Expect = e-156
 Identities = 300/540 (55%), Positives = 387/540 (71%), Gaps = 19/540 (3%)
 Frame = +2

Query: 80   NQGKSSEILGRHILGTQLAQSNFKSNDVHNHMKDLDTMELYSRERRQEEEILSLREQIAI 259
            +  +SSE + RH +      S    +     ++D + M LY++ R QEEEI SL+E+IA 
Sbjct: 2    DDNRSSESIKRHEIEKDTIASRKLEDTNTKLIQDPEEMALYAKVRSQEEEIHSLQERIAA 61

Query: 260  ACMKELQLLNEKCKLEREFSELRMAIDDKQNEAITSASNELARRKGYLEENLKLAHDLKV 439
            AC+K++QLLNEK  LER+ ++LR+AID+KQNE++TSA NELARRKG LEENLKLAHDLKV
Sbjct: 62   ACLKDMQLLNEKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENLKLAHDLKV 121

Query: 440  VEDERYMFMSSMLGLLAEYGLWPRVMNASSISNCVKHLHDQLQWRIRNSHDRIGELNSVL 619
             EDERY+FM+S+LGLLAEYG+WPRV NA++IS+ +KHLHDQLQW+ +  +DRI EL+S++
Sbjct: 122  TEDERYIFMTSLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKTKACNDRIRELSSIV 181

Query: 620  ESHADNGNHVVESPGSGNLTSHIHNEFMFQHNFPQQNLIG----------NEQSPQPMSN 769
            E+           PG+  ++   H+      N   Q   G          NEQ   PM N
Sbjct: 182  ENQ----------PGTDFISKDNHDP----RNSKTQASYGSTDRGNDYQTNEQLLPPMEN 227

Query: 770  ITG--YMNPV-------FNGDINGAFKRVNYQDISKADRDLSSFRHGSINTIGVQERTGE 922
            +T   Y N +       FN  I G  + +      +  R+   +    ++++  +E   E
Sbjct: 228  VTRNPYHNIMQDTESLRFNNQIGGGSQGI----FPQPKRENFGY---PLSSVAGKEMIQE 280

Query: 923  RKFVNGNGNLYQPPPEHDETASSVSEDGPGIENFQICGDAIPGEKLLGCGYPVRGTSLCM 1102
            R+    N +++     ++E AS V E+GPGI+ FQI GDAIPGEK+LGCG+PVRGT+LCM
Sbjct: 281  REEKAENSSMFDAYNGNEEFASHVYEEGPGIDGFQIIGDAIPGEKVLGCGFPVRGTTLCM 340

Query: 1103 FQWVRHLEDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGRQGEIVRLFANDQNK 1282
            FQWVRHLEDGTRQYIEGAT+PEY+VTADDVDKLIAVECIPMDD+GRQGE+VRLFANDQNK
Sbjct: 341  FQWVRHLEDGTRQYIEGATHPEYIVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNK 400

Query: 1283 IKCDSDMQREIDTYLSKGEATFSVLLLMDSSENWEQATLFLRRSGYQIKINGTEAPVVYE 1462
            I+CD++MQ EIDTY+S+G+A+F+V LLMDSSE+WE AT+ L+RS YQIK N TEA V+ E
Sbjct: 401  IRCDTEMQTEIDTYISRGQASFNVQLLMDSSESWEPATVVLKRSSYQIKTNTTEAVVISE 460

Query: 1463 KLSKDLSIKVPCGLSTQFVLTCSDGSSHPLSTYSVRMRDTLVLTMRIFQSKVLDDKRKGR 1642
            K SK+L I+VP G STQFVL   DGSSHP+ST +VRMRDTLVLTMR+ QSK LD++RKGR
Sbjct: 461  KYSKELQIRVPSGESTQFVLISYDGSSHPISTLNVRMRDTLVLTMRMLQSKALDERRKGR 520


>ref|XP_002884395.1| hypothetical protein ARALYDRAFT_477601 [Arabidopsis lyrata subsp.
            lyrata] gi|297330235|gb|EFH60654.1| hypothetical protein
            ARALYDRAFT_477601 [Arabidopsis lyrata subsp. lyrata]
          Length = 519

 Score =  555 bits (1430), Expect = e-155
 Identities = 301/540 (55%), Positives = 387/540 (71%), Gaps = 19/540 (3%)
 Frame = +2

Query: 80   NQGKSSEILGRHILGTQLAQSNFKSNDVHNHMKDLDTMELYSRERRQEEEILSLREQIAI 259
            +  +SSE + RH +      S    +     ++D + M LY++ R QEEEI SL+E+IA 
Sbjct: 2    DDNRSSESIKRHEIEKDTIASRKLEDSNAKLIQDPEEMALYAKVRSQEEEIHSLQERIAA 61

Query: 260  ACMKELQLLNEKCKLEREFSELRMAIDDKQNEAITSASNELARRKGYLEENLKLAHDLKV 439
            AC+K++QLLNEK  LER+ ++LR+AID+KQNE++TSA NELARRKG LEEN KLAHDLKV
Sbjct: 62   ACLKDMQLLNEKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENSKLAHDLKV 121

Query: 440  VEDERYMFMSSMLGLLAEYGLWPRVMNASSISNCVKHLHDQLQWRIRNSHDRIGELNSVL 619
             EDERY+FM+S+LGLLAEYG+WPRV NA++IS+ +KHLHDQLQW+ +  +DRI EL+S++
Sbjct: 122  TEDERYIFMTSLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKTKACNDRIRELSSIV 181

Query: 620  ESHADNGNHVVESPGSGNLTSHIHNEFMFQHNFPQQNLIG----------NEQSPQPMSN 769
            E+           PG+  ++   H+      N   Q   G          NEQ   PM N
Sbjct: 182  ENQ----------PGTDFISKDNHDP----RNSKSQASYGSTDRGNDYQTNEQLLPPMEN 227

Query: 770  ITG--YMNPV-------FNGDINGAFKRVNYQDISKADRDLSSFRHGSINTIGVQERTGE 922
            +T   Y N +       FN  I G  + +  Q   +      +F +  ++++  +E   E
Sbjct: 228  VTRNPYHNVMQDTEGLRFNNQIGGGSQGIFQQPKRE------NFGY-PLSSVAGKEMIRE 280

Query: 923  RKFVNGNGNLYQPPPEHDETASSVSEDGPGIENFQICGDAIPGEKLLGCGYPVRGTSLCM 1102
            R+    + +++     ++E AS V E+GPGI+ FQI GDAIPGEK+LGCG+PVRGT+LCM
Sbjct: 281  REEKAESSSMFDAYNGNEEFASHVYEEGPGIDGFQIIGDAIPGEKVLGCGFPVRGTTLCM 340

Query: 1103 FQWVRHLEDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGRQGEIVRLFANDQNK 1282
            FQWVRHLEDGTRQYIEGAT+PEYVVTADDVDKLIAVECIPMDD+GRQGE+VRLFANDQNK
Sbjct: 341  FQWVRHLEDGTRQYIEGATHPEYVVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNK 400

Query: 1283 IKCDSDMQREIDTYLSKGEATFSVLLLMDSSENWEQATLFLRRSGYQIKINGTEAPVVYE 1462
            I+CD++MQ EIDTY+S+G+A+F+V LLMDSSE+WE AT+ L+RS YQIK N TE  V+ E
Sbjct: 401  IRCDTEMQAEIDTYISRGQASFNVQLLMDSSESWETATVILKRSSYQIKTNTTE--VISE 458

Query: 1463 KLSKDLSIKVPCGLSTQFVLTCSDGSSHPLSTYSVRMRDTLVLTMRIFQSKVLDDKRKGR 1642
            K SK+L IKVPCG STQFVL   DGSSHP+ST +VRMRDTLVLTMR+ QSK LD++RKGR
Sbjct: 459  KYSKELQIKVPCGFSTQFVLISYDGSSHPISTLNVRMRDTLVLTMRMLQSKALDERRKGR 518


>ref|XP_002319228.1| predicted protein [Populus trichocarpa] gi|222857604|gb|EEE95151.1|
            predicted protein [Populus trichocarpa]
          Length = 243

 Score =  389 bits (999), Expect = e-105
 Identities = 191/229 (83%), Positives = 204/229 (89%)
 Frame = +2

Query: 959  PPPEHDETASSVSEDGPGIENFQICGDAIPGEKLLGCGYPVRGTSLCMFQWVRHLEDGTR 1138
            P   +DE ASSVS+D PGIE FQI GDA PGEKLLGCG+PVRGTSLCMFQWV HLEDGTR
Sbjct: 15   PSSMNDEIASSVSDDLPGIEGFQIIGDATPGEKLLGCGFPVRGTSLCMFQWVHHLEDGTR 74

Query: 1139 QYIEGATNPEYVVTADDVDKLIAVECIPMDDKGRQGEIVRLFANDQNKIKCDSDMQREID 1318
            QYIEGATNPEY+VTADDVDKLIAVECIPMDD+GRQGE+VRLFANDQNKIKCD DMQREID
Sbjct: 75   QYIEGATNPEYIVTADDVDKLIAVECIPMDDQGRQGELVRLFANDQNKIKCDPDMQREID 134

Query: 1319 TYLSKGEATFSVLLLMDSSENWEQATLFLRRSGYQIKINGTEAPVVYEKLSKDLSIKVPC 1498
            TY+SKGEATFSVLLL DSSENW+  TL LRRSGYQIK +G    V+ EK SKDLSIK+P 
Sbjct: 135  TYISKGEATFSVLLLTDSSENWDSTTLVLRRSGYQIKSDGRGNVVIAEKFSKDLSIKIPA 194

Query: 1499 GLSTQFVLTCSDGSSHPLSTYSVRMRDTLVLTMRIFQSKVLDDKRKGRA 1645
            GLSTQFVLTCS+GSSHPLSTY VRMRDTLVL MR+FQSK LDDKRKGRA
Sbjct: 195  GLSTQFVLTCSNGSSHPLSTYDVRMRDTLVLAMRMFQSKALDDKRKGRA 243


>gb|AAF01580.1|AC009895_1 hypothetical protein [Arabidopsis thaliana]
            gi|6091766|gb|AAF03476.1|AC009327_15 hypothetical protein
            [Arabidopsis thaliana]
          Length = 436

 Score =  385 bits (989), Expect = e-104
 Identities = 220/451 (48%), Positives = 288/451 (63%), Gaps = 47/451 (10%)
 Frame = +2

Query: 80   NQGKSSEILGRHILGTQLAQSNFKSNDVHNHMKDLDTMELYSRERRQEEEILSLREQIAI 259
            +  +SSE + RH +      S    +     ++D + M LY++ R QEEEI SL+E+IA 
Sbjct: 2    DDNRSSESIKRHEIEKDTIASRKLEDTNTKLIQDPEEMALYAKVRSQEEEIHSLQERIAA 61

Query: 260  ACMKELQLLNEKCKLEREFSELRMAIDDKQNEAITSASNELARRKGYLEENLKLAHDLKV 439
            AC+K++QLLNEK  LER+ ++LR+AID+KQNE++TSA NELARRKG LEENLKLAHDLKV
Sbjct: 62   ACLKDMQLLNEKYGLERKCADLRVAIDEKQNESVTSALNELARRKGDLEENLKLAHDLKV 121

Query: 440  VEDERYMFMSSMLGLLAEYGLWPRVMNASSISNCVKHLHDQLQWRIRNSHDRIGELNSVL 619
             EDERY+FM+S+LGLLAEYG+WPRV NA++IS+ +KHLHDQLQW+ +  +DRI EL+S++
Sbjct: 122  TEDERYIFMTSLLGLLAEYGVWPRVANATAISSGIKHLHDQLQWKTKACNDRIRELSSIV 181

Query: 620  ESHADNGNHVVESPGSGNLTSHIHNEFMFQHNFPQQNLIG----------NEQSPQPMSN 769
            E+           PG+  ++   H+      N   Q   G          NEQ   PM N
Sbjct: 182  EN----------QPGTDFISKDNHD----PRNSKTQASYGSTDRGNDYQTNEQLLPPMEN 227

Query: 770  ITGYMNPV-----------FNGDINGAFKRVNYQDISKADRDLSSFRHGSINTIGVQERT 916
            +T   NP            FN  I G  + +      +  R+   +    ++++  +E  
Sbjct: 228  VT--RNPYHNIMQDTESLRFNNQIGGGSQGI----FPQPKRENFGY---PLSSVAGKEMI 278

Query: 917  GERKFVNGNGNLYQPPPEHDETASSVSEDGPGIENFQICGDAIPGEKLLGCGYPVRGTSL 1096
             ER+    N +++     ++E AS V E+GPGI+ FQI GDAIPGEK+LGCG+PVRGT+L
Sbjct: 279  QEREEKAENSSMFDAYNGNEEFASHVYEEGPGIDGFQIIGDAIPGEKVLGCGFPVRGTTL 338

Query: 1097 CMFQWVRHLEDGTRQYIEGATNPEYVVTADDVDKLIAVECIPMDDKGR------------ 1240
            CMFQWVRHLEDGTRQYIEGAT+PEY+VTADDVDKLIAVECIPMDD+GR            
Sbjct: 339  CMFQWVRHLEDGTRQYIEGATHPEYIVTADDVDKLIAVECIPMDDQGRQVKYRDFSGIYS 398

Query: 1241 --------------QGEIVRLFANDQNKIKC 1291
                          QGE+VRLFANDQNKI+C
Sbjct: 399  FNESVVSKDVLLIMQGELVRLFANDQNKIRC 429


Top