BLASTX nr result

ID: Glycyrrhiza24_contig00012407 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza24_contig00012407
         (1845 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003523678.1| PREDICTED: uncharacterized protein LOC100776...   405   e-110
ref|XP_003524965.1| PREDICTED: uncharacterized protein LOC100791...   378   e-102
ref|XP_003547177.1| PREDICTED: pentatricopeptide repeat-containi...   375   e-101
ref|XP_003527782.1| PREDICTED: uncharacterized protein LOC100785...   346   1e-92
gb|ACU20003.1| unknown [Glycine max]                                  344   5e-92

>ref|XP_003523678.1| PREDICTED: uncharacterized protein LOC100776373 [Glycine max]
          Length = 289

 Score =  405 bits (1040), Expect = e-110
 Identities = 218/312 (69%), Positives = 237/312 (75%)
 Frame = -2

Query: 1202 MDQGKGGDEMLDPVHEVMGENLSRLVEGVAIDAVDGSSLGDRACINRQVGTSSEVDEDSG 1023
            MDQ +   E+LDPVH+    NLS   EGVAI          R  I+ + GTS EV ED G
Sbjct: 1    MDQHRS--EVLDPVHD---GNLSHSAEGVAIA---------RDNISGEAGTSGEVGEDLG 46

Query: 1022 PNNXXXXXXXXXXKGQDKPDQENICDELLGVVDQGTSYDSTNLVNQEVLETVVVIESVQS 843
             +            GQDK DQENI D L GV DQGTSY+S +L NQEV+ETVVVIESVQ+
Sbjct: 47   SDKEPKEEVK----GQDKRDQENISDNLPGV-DQGTSYNSRHLANQEVIETVVVIESVQT 101

Query: 842  EYVNGDNRKLEAKAQESGLSLVSMKAPKGVSETDEDSCVIDIKCSSRKGFYENSQGERIC 663
            EY N DNRKLEA   ESGLSLVSMK PKGVSETD++SCVIDIKCSSRK FYE+S+GERIC
Sbjct: 102  EYANEDNRKLEAIVDESGLSLVSMKTPKGVSETDKNSCVIDIKCSSRKEFYESSEGERIC 161

Query: 662  RICHLASGQPLESDDATTVSGTANSATSADLIQLGCACKDELGIAHTHCAEAWFKLKGNR 483
            RICHL SGQ L   +ATTV GT  SATS DLIQLGCACKDELGIAH HCAEAWFKLKGNR
Sbjct: 162  RICHLTSGQSL---NATTV-GTVESATSEDLIQLGCACKDELGIAHGHCAEAWFKLKGNR 217

Query: 482  LCEICGETAKNVSGIANTGFMEEWNERRFMDNDSNSSHSFGGCWRGQPFCNFLMACLVIA 303
            LCEICGE AKNVSG+ +  FM+EWNERRF+D D NSSH    CWRGQPFCNFLMACLVIA
Sbjct: 218  LCEICGEAAKNVSGVTSNAFMDEWNERRFVDIDGNSSHRVVRCWRGQPFCNFLMACLVIA 277

Query: 302  FVLPWFFRVNMF 267
            FVLPWFFRVNMF
Sbjct: 278  FVLPWFFRVNMF 289


>ref|XP_003524965.1| PREDICTED: uncharacterized protein LOC100791129 [Glycine max]
          Length = 310

 Score =  378 bits (970), Expect = e-102
 Identities = 203/316 (64%), Positives = 232/316 (73%), Gaps = 4/316 (1%)
 Frame = -2

Query: 1202 MDQGK--GGDEMLDPVHEVMGENLSRLVEGVAIDAVDGSSLGDRACINRQVGTSSEVDED 1029
            MDQ K  G  E+LD V E + E+ ++LVEG AIDA  G SLG+   ++ + GTS E+  D
Sbjct: 1    MDQAKSKGSGEILDHVDERVDEDFNQLVEGNAIDAGGGHSLGEGVNVSGEDGTSEEMCND 60

Query: 1028 SGPNNXXXXXXXXXXKGQDKPDQENICDELLGVVDQGTSYDSTNLVNQEVLETVVVIE-S 852
               N             Q+  D + I D L GV DQGTSY S NLV+ EV ET VV++ S
Sbjct: 61   LRLNRELKEGSQEVND-QNMGDPQEINDTLHGV-DQGTSYSSRNLVSWEVPETCVVVDPS 118

Query: 851  VQSEYVNGDNRKLEAKAQESGLSLVSMKAPKGVSETDEDSCVIDIKCSSRKGFYENSQGE 672
             Q E VNGDNRKLEAK  ESGL+ VSMK   GVSETD++SCVIDI C S  GF EN +GE
Sbjct: 119  TQIECVNGDNRKLEAKPNESGLNKVSMKVTNGVSETDKNSCVIDINCHSCDGFSENLEGE 178

Query: 671  RICRICHLASGQPLESDDATTVSGTANSATS-ADLIQLGCACKDELGIAHTHCAEAWFKL 495
             ICRICHLASGQPLE+ D     GTA+SAT+  DLIQLGCACKDELGI H+HCAEAWFKL
Sbjct: 179  MICRICHLASGQPLEAADV----GTASSATTNTDLIQLGCACKDELGIVHSHCAEAWFKL 234

Query: 494  KGNRLCEICGETAKNVSGIANTGFMEEWNERRFMDNDSNSSHSFGGCWRGQPFCNFLMAC 315
            KGNRLCEICGETAKNVS + + GF+EEWN+ RFMD+D+ SS  FGGCWRGQPFCNFLMAC
Sbjct: 235  KGNRLCEICGETAKNVSDVTDNGFIEEWNDTRFMDSDNTSSRRFGGCWRGQPFCNFLMAC 294

Query: 314  LVIAFVLPWFFRVNMF 267
            LVIAFVLPWFFRVNMF
Sbjct: 295  LVIAFVLPWFFRVNMF 310


>ref|XP_003547177.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18750,
            chloroplastic-like [Glycine max]
          Length = 1227

 Score =  375 bits (962), Expect = e-101
 Identities = 209/350 (59%), Positives = 237/350 (67%), Gaps = 53/350 (15%)
 Frame = -2

Query: 1202 MDQG--KGGDEMLDPVHEVMGENLSRLVEGVA--IDAVDGSSLGDRACINRQVGTSSEVD 1035
            MDQ   +G  E+L PV+E    NLS LVEGV   IDAVDG+SLG R  I+ + GTS EV 
Sbjct: 1    MDQNTSEGRGEVLGPVYEGEDGNLSNLVEGVVVVIDAVDGNSLGGRVSISEEAGTSGEVG 60

Query: 1034 EDSGPNNXXXXXXXXXXKGQDKPDQENICDELLGVVDQGTSYDSTNLVNQEVLETVVVIE 855
            +D G ++          K QD  DQENI D+ L  VDQGTSY+S +LVNQ V+ETVV IE
Sbjct: 61   KDFG-SDMELEEGIEEVKDQDNHDQENISDDKLLEVDQGTSYNSNHLVNQVVIETVVAIE 119

Query: 854  SVQSEYVNGDNRKLEAKAQESGLSLVSMKAPKGVSETDEDSCVIDIKCSSRKGFYENSQG 675
            SVQ++YV  DNRKLE K  ESGLS+VSMKAP+GVSETD+DS VIDIKCSS K  YE+S+G
Sbjct: 120  SVQTKYVTEDNRKLETKVDESGLSMVSMKAPEGVSETDKDSRVIDIKCSSCKKVYEDSEG 179

Query: 674  ERICRICHLASGQPLESDDATTVSGTANSATSADLIQLGCACKDELGIAHTHCAEAWFKL 495
            ER+CRICHL S   ++S D TTV GTA+SATSADLIQLGCACKDELGIAH HCAEAWFKL
Sbjct: 180  ERVCRICHLTS---VQSSDETTV-GTASSATSADLIQLGCACKDELGIAHVHCAEAWFKL 235

Query: 494  KGNR-------------------------------------------------LCEICGE 462
            KGNR                                                 LCEICGE
Sbjct: 236  KGNRELVSVAHCYLPWIGTSLVGEQLASLLHSDIAVQCISVAISLFGIPIVKELCEICGE 295

Query: 461  TAKNVSGIANTGFMEEWNERRFMDNDSNSSHSFGGCWRGQPFCNFLMACL 312
            TA+NVSG+ N GFME+WNERRFMD+D NSSH FGGCWRGQPFCNFLMACL
Sbjct: 296  TAENVSGVTNYGFMEKWNERRFMDDDGNSSHRFGGCWRGQPFCNFLMACL 345


>ref|XP_003527782.1| PREDICTED: uncharacterized protein LOC100785323 [Glycine max]
          Length = 258

 Score =  346 bits (887), Expect = 1e-92
 Identities = 192/284 (67%), Positives = 213/284 (75%)
 Frame = -2

Query: 1202 MDQGKGGDEMLDPVHEVMGENLSRLVEGVAIDAVDGSSLGDRACINRQVGTSSEVDEDSG 1023
            MDQ +G  E+LDPVH+    NLS  VEGVAI          RA I+ +VG   EVDED G
Sbjct: 1    MDQHRG--EVLDPVHD---RNLSHSVEGVAIA---------RASISGEVG---EVDEDLG 43

Query: 1022 PNNXXXXXXXXXXKGQDKPDQENICDELLGVVDQGTSYDSTNLVNQEVLETVVVIESVQS 843
             +            GQDK DQENI D+L GV DQGTSY+S NLVNQEV+ETVVVIESVQ+
Sbjct: 44   SDKELKDDVK----GQDKHDQENISDKLPGV-DQGTSYNSNNLVNQEVIETVVVIESVQT 98

Query: 842  EYVNGDNRKLEAKAQESGLSLVSMKAPKGVSETDEDSCVIDIKCSSRKGFYENSQGERIC 663
            EY N DN KLEAK  ESGLSLVSMKAPKGVSETD++SCVIDIKCSSRK  Y++S+GERIC
Sbjct: 99   EYANEDNTKLEAKVDESGLSLVSMKAPKGVSETDKNSCVIDIKCSSRKKIYKSSEGERIC 158

Query: 662  RICHLASGQPLESDDATTVSGTANSATSADLIQLGCACKDELGIAHTHCAEAWFKLKGNR 483
            RICHL SGQ   S DATTV GT++SATSADLIQLGCACK + GIAH HCA AWFKLKGN 
Sbjct: 159  RICHLTSGQ---SSDATTV-GTSDSATSADLIQLGCACKGKPGIAHVHCALAWFKLKGNM 214

Query: 482  LCEICGETAKNVSGIANTGFMEEWNERRFMDNDSNSSHSFGGCW 351
            LCEICGE AKNVSG+   GFMEEWNERR MD + N+SH   GCW
Sbjct: 215  LCEICGEAAKNVSGVTINGFMEEWNERRLMDTEGNASHRVVGCW 258


>gb|ACU20003.1| unknown [Glycine max]
          Length = 254

 Score =  344 bits (882), Expect = 5e-92
 Identities = 173/239 (72%), Positives = 193/239 (80%), Gaps = 2/239 (0%)
 Frame = -2

Query: 977 QDKPDQENICDELLGVVDQGTSYDSTNLVNQEVLETVVVIE-SVQSEYVNGDNRKLEAKA 801
           Q+  D + I D L GV DQGTSY S NLV+ EV ET VV++ S Q E VNGDNRKLEAK 
Sbjct: 21  QNMGDPQEINDTLHGV-DQGTSYSSRNLVSWEVPETCVVVDPSTQIECVNGDNRKLEAKP 79

Query: 800 QESGLSLVSMKAPKGVSETDEDSCVIDIKCSSRKGFYENSQGERICRICHLASGQPLESD 621
            ESGL+ VSMK   GVSETD++SCVIDI C S  GF EN +GE ICR+CHLASGQPLE+ 
Sbjct: 80  NESGLNKVSMKVTNGVSETDKNSCVIDINCHSCDGFSENLEGEMICRVCHLASGQPLEAA 139

Query: 620 DATTVSGTANSATS-ADLIQLGCACKDELGIAHTHCAEAWFKLKGNRLCEICGETAKNVS 444
           D     GTA+SAT+  DLIQLGCACKDELGI H+HCAEAWFKLKGNRLCEICGETAKNVS
Sbjct: 140 DV----GTASSATTNTDLIQLGCACKDELGIVHSHCAEAWFKLKGNRLCEICGETAKNVS 195

Query: 443 GIANTGFMEEWNERRFMDNDSNSSHSFGGCWRGQPFCNFLMACLVIAFVLPWFFRVNMF 267
            + + GF+EEWN+ RFMD+D+ SS  FGGCWRGQPFCNFLMACLVIAFVLPWFFRVNMF
Sbjct: 196 DVTDNGFIEEWNDTRFMDSDNTSSRRFGGCWRGQPFCNFLMACLVIAFVLPWFFRVNMF 254


Top