BLASTX nr result

ID: Glycyrrhiza23_contig00003818 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00003818
         (1477 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003520278.1| PREDICTED: uncharacterized protein LOC100814...   659   0.0  
ref|XP_003534779.1| PREDICTED: uncharacterized protein LOC100796...   646   0.0  
ref|XP_002282555.1| PREDICTED: uncharacterized protein LOC100251...   604   e-170
ref|XP_002522160.1| conserved hypothetical protein [Ricinus comm...   592   e-166
ref|XP_004144331.1| PREDICTED: uncharacterized protein LOC101219...   582   e-164

>ref|XP_003520278.1| PREDICTED: uncharacterized protein LOC100814636 [Glycine max]
          Length = 490

 Score =  659 bits (1701), Expect = 0.0
 Identities = 331/399 (82%), Positives = 347/399 (86%), Gaps = 2/399 (0%)
 Frame = +3

Query: 39   IAMAFHRSSKVGKPKAPRSAIVVLGVAVTAIALLFLLXXXXXXXXXXXXXXXXXXHVEVE 218
            I MAFHR+SK  KPK+ RS I++L  AVTAIALLFL                       E
Sbjct: 99   IQMAFHRASKA-KPKSARSRILLLAAAVTAIALLFLFSSLLSTTVSKANLLQRRQQQPFE 157

Query: 219  VGHSEKKYVYWGTRIDCPGKHCGSCEGLGHQESSLRCALEEAIFLRRTFVMPSRMCINPI 398
                  KY+YWGTRIDCPGKHCGSCEGLGHQESSLRCALEEA+FL RTFVMPSRMCINPI
Sbjct: 158  ------KYLYWGTRIDCPGKHCGSCEGLGHQESSLRCALEEALFLGRTFVMPSRMCINPI 211

Query: 399  HNKKGILHR--NATSEEQWAASSCAMDSLYDLELISDTVPVILDNSKEWYRVLSTGMKLG 572
            HNKKGILH   NA+SEE W ASSCAMDSLYD EL+SDTVPVILDNSKEWYRVLST MKLG
Sbjct: 212  HNKKGILHHSTNASSEELWDASSCAMDSLYDTELMSDTVPVILDNSKEWYRVLSTSMKLG 271

Query: 573  ARGVAHVAGVSRVELKENNRYSDLLLINRTASPLSWFMECKDRNNRSSIMLPYSFLPSMA 752
            ARGVAHV GVSR ELKEN+RYSDLLLINRTASPLSWFMECKDRNNRS+IMLPYSFLPSMA
Sbjct: 272  ARGVAHVEGVSRFELKENSRYSDLLLINRTASPLSWFMECKDRNNRSAIMLPYSFLPSMA 331

Query: 753  AKKLRDVAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDTRPEFILCRIA 932
            A KLRD AEKIKALLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDTRPEF+LCRIA
Sbjct: 332  AGKLRDAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDTRPEFMLCRIA 391

Query: 933  KWVPPGRTLFIASNERTPGFFSPLSVRYRLAYSSNYSHMLDPMIENNYQLFMIERLILMG 1112
            KWVPPGRTLFIASNERTPGFFSPLS RYRLAYSSNYSH+LDP+IENNYQLFMIERLI+MG
Sbjct: 392  KWVPPGRTLFIASNERTPGFFSPLSARYRLAYSSNYSHILDPLIENNYQLFMIERLIMMG 451

Query: 1113 AKTFIRTFKEDKTDLSLTDDPKKNTKAWQIPVYHADETC 1229
             KTFIRTFKED+TDLSLTDDPKKNTK WQIPVY+ DETC
Sbjct: 452  GKTFIRTFKEDETDLSLTDDPKKNTKLWQIPVYNVDETC 490


>ref|XP_003534779.1| PREDICTED: uncharacterized protein LOC100796131 [Glycine max]
          Length = 385

 Score =  646 bits (1667), Expect = 0.0
 Identities = 325/396 (82%), Positives = 342/396 (86%), Gaps = 1/396 (0%)
 Frame = +3

Query: 45   MAFHRSSKVGKPKAPRSAIVVLGVAVTAIALLFLLXXXXXXXXXXXXXXXXXXHVEVEVG 224
            MAFHR+SK  KPK+ RS I++L  AVTAIALL L                          
Sbjct: 1    MAFHRASKA-KPKSARSRILLLAAAVTAIALLLLFSSLLSTTVSKANLLQQPF------- 52

Query: 225  HSEKKYVYWGTRIDCPGKHCGSCEGLGHQESSLRCALEEAIFLRRTFVMPSRMCINPIHN 404
               +KY+YWGTRIDCPGKHC SCEGLGHQESSLRCALEEA+FL RTFVMPS MCINPIHN
Sbjct: 53   ---EKYLYWGTRIDCPGKHCRSCEGLGHQESSLRCALEEALFLGRTFVMPSGMCINPIHN 109

Query: 405  KKGILHR-NATSEEQWAASSCAMDSLYDLELISDTVPVILDNSKEWYRVLSTGMKLGARG 581
            KKGILH  NA+SEE W ASSCAMDSLYD EL+SDTVPVILDNSKEWYRVLST MKLGARG
Sbjct: 110  KKGILHSTNASSEELWDASSCAMDSLYDTELMSDTVPVILDNSKEWYRVLSTSMKLGARG 169

Query: 582  VAHVAGVSRVELKENNRYSDLLLINRTASPLSWFMECKDRNNRSSIMLPYSFLPSMAAKK 761
            VAHV GVSR ELKEN+RYSDLLLINRTASPLSWFMECKDRNN S+IMLPYSFLPSMAA K
Sbjct: 170  VAHVGGVSRFELKENSRYSDLLLINRTASPLSWFMECKDRNNGSAIMLPYSFLPSMAAGK 229

Query: 762  LRDVAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDTRPEFILCRIAKWV 941
            LRD AEKIKALLGDYDAIHVRRGDKIKTRKDRFGV RSLHPHLDRDTRPEF+LCRIAKWV
Sbjct: 230  LRDAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVVRSLHPHLDRDTRPEFMLCRIAKWV 289

Query: 942  PPGRTLFIASNERTPGFFSPLSVRYRLAYSSNYSHMLDPMIENNYQLFMIERLILMGAKT 1121
            PPGRTLFIASNERTPGFFSPLS RYRLAYSSNYSH+LDP+IENNYQLFMIERLI+MGAKT
Sbjct: 290  PPGRTLFIASNERTPGFFSPLSARYRLAYSSNYSHILDPLIENNYQLFMIERLIMMGAKT 349

Query: 1122 FIRTFKEDKTDLSLTDDPKKNTKAWQIPVYHADETC 1229
            FIRTFKED+TDLSLTDDPKKNTK WQIPVY+ DETC
Sbjct: 350  FIRTFKEDETDLSLTDDPKKNTKLWQIPVYNVDETC 385


>ref|XP_002282555.1| PREDICTED: uncharacterized protein LOC100251727 [Vitis vinifera]
            gi|297741651|emb|CBI32783.3| unnamed protein product
            [Vitis vinifera]
          Length = 398

 Score =  604 bits (1557), Expect = e-170
 Identities = 304/397 (76%), Positives = 337/397 (84%), Gaps = 5/397 (1%)
 Frame = +3

Query: 45   MAFHRSSKVGKPKAPRSAIVVLGVAVTAIALLFLLXXXXXXXXXXXXXXXXXXHV---EV 215
            MA +++ +  KPK  RS I+ L +++ AIA L +L                  +      
Sbjct: 1    MAIYKTLRT-KPKPNRSPILFLVLSILAIAFLCILSSSISINGSLFRSEKTQLNTLKSRH 59

Query: 216  EVGHSEKKYVYWGTRIDCPGKHCGSCEGLGHQESSLRCALEEAIFLRRTFVMPSRMCINP 395
             VGH  +KY+YWG +IDCPGKHC SCEGLGHQESSLRCALEEA+FL+RT VMPSRMCINP
Sbjct: 60   SVGH--EKYLYWGYKIDCPGKHCDSCEGLGHQESSLRCALEEALFLQRTLVMPSRMCINP 117

Query: 396  IHNKKGILHR--NATSEEQWAASSCAMDSLYDLELISDTVPVILDNSKEWYRVLSTGMKL 569
            IHNKKGILH   NATSEE WAA+SCAMDSLYDL+L+S+TVPVILDNSK WYRVLST MKL
Sbjct: 118  IHNKKGILHHSSNATSEEMWAANSCAMDSLYDLDLMSNTVPVILDNSKMWYRVLSTSMKL 177

Query: 570  GARGVAHVAGVSRVELKENNRYSDLLLINRTASPLSWFMECKDRNNRSSIMLPYSFLPSM 749
            GARGVAHVAGVSR+ L++N+ YS+LLLINRTASPLSWFMECKDRNNRS+IMLPYSFLPSM
Sbjct: 178  GARGVAHVAGVSRIALRDNSHYSNLLLINRTASPLSWFMECKDRNNRSAIMLPYSFLPSM 237

Query: 750  AAKKLRDVAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDTRPEFILCRI 929
            AAKKLRD A KIK LLGDYDA+HVRRGDKIKTRKDRFGV R+LHPHLDRDTRPEFIL RI
Sbjct: 238  AAKKLRDAAGKIKVLLGDYDAMHVRRGDKIKTRKDRFGVDRTLHPHLDRDTRPEFILRRI 297

Query: 930  AKWVPPGRTLFIASNERTPGFFSPLSVRYRLAYSSNYSHMLDPMIENNYQLFMIERLILM 1109
             KWVPPGRTLFIASNERTPGFFSPLS RYRLAYSSNYS +LDP++ENNYQLFMIERLILM
Sbjct: 298  EKWVPPGRTLFIASNERTPGFFSPLSARYRLAYSSNYSKILDPLVENNYQLFMIERLILM 357

Query: 1110 GAKTFIRTFKEDKTDLSLTDDPKKNTKAWQIPVYHAD 1220
            GAKT+IRTFKED+T LSLTDDPKKNTKAWQIPVY +D
Sbjct: 358  GAKTYIRTFKEDETYLSLTDDPKKNTKAWQIPVYTSD 394


>ref|XP_002522160.1| conserved hypothetical protein [Ricinus communis]
            gi|223538598|gb|EEF40201.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 405

 Score =  592 bits (1525), Expect = e-166
 Identities = 295/400 (73%), Positives = 333/400 (83%), Gaps = 15/400 (3%)
 Frame = +3

Query: 69   VGKPKAPR----SAIVVLGVAVTAIALLFLLXXXXXXXXXXXXXXXXXXHVEVEVGHSEK 236
            + KP  P+    S I+   + +++IALLFL                   ++ V++    K
Sbjct: 4    INKPLRPKPKLSSPILTTVICISSIALLFLFSSLISTNGFSFSSPNTH-NIIVDIIRPTK 62

Query: 237  ---------KYVYWGTRIDCPGKHCGSCEGLGHQESSLRCALEEAIFLRRTFVMPSRMCI 389
                     KY+YWG RIDCPGKHC SCEGLGHQESSLRCALEEAIFL RTFVMPS MCI
Sbjct: 63   PTRPHSLHDKYLYWGNRIDCPGKHCDSCEGLGHQESSLRCALEEAIFLNRTFVMPSAMCI 122

Query: 390  NPIHNKKGILHR--NATSEEQWAASSCAMDSLYDLELISDTVPVILDNSKEWYRVLSTGM 563
            NPIHNKKGILH   N+++EE+WAA+SCAMDSLYD++LIS+T+PVILDNSK WY+VLST M
Sbjct: 123  NPIHNKKGILHHSSNSSAEERWAANSCAMDSLYDIDLISETIPVILDNSKIWYQVLSTSM 182

Query: 564  KLGARGVAHVAGVSRVELKENNRYSDLLLINRTASPLSWFMECKDRNNRSSIMLPYSFLP 743
            KLG RG+AHV GVSRV+LKEN+ YS+LLLINRTASPLSWFMECKDRNNRS+I+LPYSFLP
Sbjct: 183  KLGDRGIAHVEGVSRVDLKENSGYSNLLLINRTASPLSWFMECKDRNNRSAILLPYSFLP 242

Query: 744  SMAAKKLRDVAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDTRPEFILC 923
            SMAA+KLRD A+KIK LLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDTRP+F+L 
Sbjct: 243  SMAAEKLRDAADKIKTLLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDTRPDFMLL 302

Query: 924  RIAKWVPPGRTLFIASNERTPGFFSPLSVRYRLAYSSNYSHMLDPMIENNYQLFMIERLI 1103
            RI KWVPPGRTLFIASNE+TPGFFSPLSVRY+LAYS NYS +LDP+IENNYQLFMIERLI
Sbjct: 303  RIEKWVPPGRTLFIASNEKTPGFFSPLSVRYKLAYSLNYSWILDPLIENNYQLFMIERLI 362

Query: 1104 LMGAKTFIRTFKEDKTDLSLTDDPKKNTKAWQIPVYHADE 1223
            +MGA+TFIRTFKED TDLSLTDDPKKNTK WQIPVY  DE
Sbjct: 363  MMGARTFIRTFKEDDTDLSLTDDPKKNTKKWQIPVYTMDE 402


>ref|XP_004144331.1| PREDICTED: uncharacterized protein LOC101219097 [Cucumis sativus]
          Length = 407

 Score =  582 bits (1501), Expect = e-164
 Identities = 293/407 (71%), Positives = 331/407 (81%), Gaps = 14/407 (3%)
 Frame = +3

Query: 45   MAFHRSSKVGKPKAPRSAIVVLGVAVTAIALLFLLXXXXXXXXXXXXXXXXXXHVEVEVG 224
            MAF R+ K  KPK PRS ++   V+++AIA LFL                        + 
Sbjct: 1    MAFPRTQKP-KPK-PRSPLIFFFVSLSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFRLK 58

Query: 225  HSEKK-------------YVYWGTRIDCPGKHCGSCEGLGHQESSLRCALEEAIFLRRTF 365
            +  +K             ++YWG RIDCPGKHC SCEGLGHQESSLRCALEEA+FL+RTF
Sbjct: 59   NLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQRTF 118

Query: 366  VMPSRMCINPIHNKKGILHR-NATSEEQWAASSCAMDSLYDLELISDTVPVILDNSKEWY 542
            VMPSRMCINPIHNKKG+LH+ N++SEE W A+SCAMDSLYD++LISDTVPVILDNSK WY
Sbjct: 119  VMPSRMCINPIHNKKGLLHQSNSSSEESWEANSCAMDSLYDMDLISDTVPVILDNSKSWY 178

Query: 543  RVLSTGMKLGARGVAHVAGVSRVELKENNRYSDLLLINRTASPLSWFMECKDRNNRSSIM 722
            +VLSTGMKLGAR V HV  VSR+EL++++RYS+LLLINRTASPLSWFMECKDRNN S++M
Sbjct: 179  QVLSTGMKLGARAVGHVEKVSRIELRDSSRYSNLLLINRTASPLSWFMECKDRNNHSAVM 238

Query: 723  LPYSFLPSMAAKKLRDVAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDT 902
            LPY FLPSMAA+ LRD AEKIK LLGDYDAIHVRRGDKIKTRKDRFGV RSLHPHLDRDT
Sbjct: 239  LPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVDRSLHPHLDRDT 298

Query: 903  RPEFILCRIAKWVPPGRTLFIASNERTPGFFSPLSVRYRLAYSSNYSHMLDPMIENNYQL 1082
            RPEF+L RIAKWVP GRTLFIASNER PGFFSPLS RY+LAYSSNYS +LDP+++NNYQL
Sbjct: 299  RPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSARYKLAYSSNYSDILDPVVQNNYQL 358

Query: 1083 FMIERLILMGAKTFIRTFKEDKTDLSLTDDPKKNTKAWQIPVYHADE 1223
            FMIERLI+ GAKT IRTFKED TDLSLTDDPKKNTKAWQIPVY  +E
Sbjct: 359  FMIERLIMAGAKTLIRTFKEDDTDLSLTDDPKKNTKAWQIPVYTDEE 405


Top