BLASTX nr result

ID: Glycyrrhiza24_contig00024242 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza24_contig00024242
         (1628 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003517770.1| PREDICTED: transcription factor IIIB 50 kDa ...   703   0.0  
ref|XP_003594604.1| hypothetical protein MTR_2g031410 [Medicago ...   676   0.0  
ref|XP_004149065.1| PREDICTED: uncharacterized protein LOC101208...   512   e-143
ref|XP_002517218.1| conserved hypothetical protein [Ricinus comm...   494   e-137
ref|XP_002275055.1| PREDICTED: uncharacterized protein LOC100260...   487   e-135

>ref|XP_003517770.1| PREDICTED: transcription factor IIIB 50 kDa subunit-like [Glycine
            max]
          Length = 536

 Score =  703 bits (1815), Expect = 0.0
 Identities = 354/536 (66%), Positives = 415/536 (77%), Gaps = 1/536 (0%)
 Frame = +3

Query: 24   MSTSRSCSNCGNTSFIRXXXXXXXXXXXXXXLQEFDQFEAQIGGISGPQGTFIHLGTAGS 203
            MS S  C+ CG  SFIR              +Q+FDQF+AQIGGI GPQGTFIH+GTAGS
Sbjct: 1    MSKSPPCTYCGRASFIRDDISGELICSSCGGVQQFDQFDAQIGGIDGPQGTFIHVGTAGS 60

Query: 204  GSIYNYRDRKLFAARNSIDELTNRLGLGSKSVDITSMVSIITEGEFGQGDWFQVLIGACA 383
            GS+Y+YR+RKLFAA+N IDE+TN+LGL SKS D+ SM+S ITEGEFGQG+WF VLIGACA
Sbjct: 61   GSLYSYRERKLFAAQNLIDEVTNQLGLSSKSGDVRSMISTITEGEFGQGEWFHVLIGACA 120

Query: 384  YVVMRKDDRPLPMAEVASAVGCGVYELGRMILRVIDFLDLKRPDFPEFDIVHSLERTLKN 563
            YVVMRK+DRPLPMAEVASA+GC VYE+GRMILRV+DFL+L RPDFPEFDIVH LERT++N
Sbjct: 121  YVVMRKEDRPLPMAEVASAIGCDVYEIGRMILRVVDFLNL-RPDFPEFDIVHLLERTIRN 179

Query: 564  SPSFSAVERSQLDRMRKQGIFLIQCAVKWFLSTGRRXXXXXXXXXXXXXEMNQVEVRLDD 743
               F++VER  ++RMRKQG+FLIQCAVKW+LSTGRR             E+N V V +++
Sbjct: 180  CNGFASVERDLIERMRKQGVFLIQCAVKWYLSTGRRPVPLVVAVLVFVAELNGVGVGMEE 239

Query: 744  LAKEVHAMVSTCRTRYRELLDTLVKAAQVLPWGKDITTKNVVKNAPFVIQYMEKKSMSKH 923
            LAKEVHA VSTCR RY+ELL+TLVK AQVLPWGKD+T KN+VKNAP VIQYME+K+M K 
Sbjct: 240  LAKEVHAKVSTCRARYKELLETLVKVAQVLPWGKDVTVKNIVKNAPIVIQYMERKAMLKP 299

Query: 924  VEKRKNLDRPGLDLADVVTECLRQDDEYEEYGNDGLTSQKDPQYLSLGSNAYREGIQDVD 1103
             +KR+ LDR  +DL DVV ECLR D E+ EYG DG+T +KD QY SL SNA R G  D D
Sbjct: 300  GKKREGLDRAAVDLEDVVAECLRNDGEF-EYGVDGMTKRKDSQYFSLESNAGRVGGGDSD 358

Query: 1104 RLQISPECLSMLYEKFLNE-XXXXXXXXXXXXXXXXXLGLDLWDCREWWDGKSEMSKKLI 1280
             LQISPECLS+LY+KFL+E                  L  DL +CREWWDGKSE+SKKL+
Sbjct: 359  GLQISPECLSLLYKKFLDENCCIESLRGSGNAQKRRVLRFDLLECREWWDGKSELSKKLL 418

Query: 1281 LKHLLEKDVGVDTMPPSFVTGQLKCKMRREKIDAAKRRIKRITHPXXXXXXXXXXXXXXX 1460
            L  LLEKDVGVDTMPPSFV GQLKC+MRRE+I+AAK RIKRI HP               
Sbjct: 419  LNRLLEKDVGVDTMPPSFVNGQLKCEMRRERINAAKVRIKRIMHPSDADLGDAEIPCPLD 478

Query: 1461 TTYPERRKTKRRGMAVNDIDWEDLIIETLILHQVKEEEIEKGHYNTLLDLYVFNSG 1628
            ++YPERR+ KR+GM V+D+DWEDLIIETL+LH+VKEEEIEKGHYNTLLDL+VFNSG
Sbjct: 479  SSYPERRRKKRKGMVVDDVDWEDLIIETLVLHRVKEEEIEKGHYNTLLDLHVFNSG 534


>ref|XP_003594604.1| hypothetical protein MTR_2g031410 [Medicago truncatula]
            gi|355483652|gb|AES64855.1| hypothetical protein
            MTR_2g031410 [Medicago truncatula]
          Length = 532

 Score =  676 bits (1744), Expect = 0.0
 Identities = 343/535 (64%), Positives = 410/535 (76%)
 Frame = +3

Query: 24   MSTSRSCSNCGNTSFIRXXXXXXXXXXXXXXLQEFDQFEAQIGGISGPQGTFIHLGTAGS 203
            MSTSR C NC  TSF R               Q FDQFE    GI+GPQGTFIH+GT+GS
Sbjct: 1    MSTSRRCINCSKTSFTRDDETGGSFCSSCGAEQHFDQFETYTIGINGPQGTFIHIGTSGS 60

Query: 204  GSIYNYRDRKLFAARNSIDELTNRLGLGSKSVDITSMVSIITEGEFGQGDWFQVLIGACA 383
            G+IY+Y+DRKLF+ARNSI++ T +LGL SK ++I +M+S IT+GEFGQGDWFQVLIGAC 
Sbjct: 61   GTIYSYKDRKLFSARNSIEQFTIKLGLSSKKIEINTMISDITDGEFGQGDWFQVLIGACC 120

Query: 384  YVVMRKDDRPLPMAEVASAVGCGVYELGRMILRVIDFLDLKRPDFPEFDIVHSLERTLKN 563
            YVVMR+D+R L M EVA+AVGC V+ELG+M++RV+D+LDL+  DFP+FDIVH L+R++ +
Sbjct: 121  YVVMRRDERALSMNEVANAVGCDVFELGKMVIRVVDYLDLRGSDFPDFDIVHLLKRSVDS 180

Query: 564  SPSFSAVERSQLDRMRKQGIFLIQCAVKWFLSTGRRXXXXXXXXXXXXXEMNQVEVRLDD 743
               F  V+RS +DRM+KQG+FL+QCAVK FLSTGRR             E+N V++RL+D
Sbjct: 181  CRCFREVDRSLVDRMKKQGVFLLQCAVKLFLSTGRRPLPLVVAVLVLVAEINGVDIRLED 240

Query: 744  LAKEVHAMVSTCRTRYRELLDTLVKAAQVLPWGKDITTKNVVKNAPFVIQYMEKKSMSKH 923
            LAKEVHA+VSTCRTRYRELL+TLV  AQVLPWGKDIT KN++KNAPFVIQYMEKKSMSK 
Sbjct: 241  LAKEVHAIVSTCRTRYRELLETLVNVAQVLPWGKDITKKNIIKNAPFVIQYMEKKSMSKP 300

Query: 924  VEKRKNLDRPGLDLADVVTECLRQDDEYEEYGNDGLTSQKDPQYLSLGSNAYREGIQDVD 1103
            VEKRK++D+ GLDLADVV ECL Q+ +Y EYG DGL  +KD QY SL SN  REGI D +
Sbjct: 301  VEKRKDVDQTGLDLADVVDECLTQEGQY-EYGVDGLIHRKDSQYFSLQSNCDREGIVDDE 359

Query: 1104 RLQISPECLSMLYEKFLNEXXXXXXXXXXXXXXXXXLGLDLWDCREWWDGKSEMSKKLIL 1283
            RLQISPECLS++Y+KFLNE                 L LD    +EWW+ +SE+S+KLIL
Sbjct: 360  RLQISPECLSLMYDKFLNENRDAMSSRSANVQKRKRLELDF---QEWWNEESELSRKLIL 416

Query: 1284 KHLLEKDVGVDTMPPSFVTGQLKCKMRREKIDAAKRRIKRITHPXXXXXXXXXXXXXXXT 1463
            K LLEKD+GV+TMPPSFV GQLKCKMRREKI+AAK+RIKRITHP               +
Sbjct: 417  KELLEKDIGVETMPPSFVNGQLKCKMRREKINAAKKRIKRITHPLHSDLGDTANLGILDS 476

Query: 1464 TYPERRKTKRRGMAVNDIDWEDLIIETLILHQVKEEEIEKGHYNTLLDLYVFNSG 1628
            T  ER+K KRRGMAV+ IDWEDLIIETL+LHQVK+EEIEKGHYNTLL LYVFNSG
Sbjct: 477  TCTERKK-KRRGMAVDGIDWEDLIIETLVLHQVKDEEIEKGHYNTLLGLYVFNSG 530


>ref|XP_004149065.1| PREDICTED: uncharacterized protein LOC101208099 [Cucumis sativus]
            gi|449509052|ref|XP_004163479.1| PREDICTED:
            uncharacterized protein LOC101223951 [Cucumis sativus]
          Length = 525

 Score =  512 bits (1319), Expect = e-143
 Identities = 268/536 (50%), Positives = 360/536 (67%), Gaps = 5/536 (0%)
 Frame = +3

Query: 33   SRSCSNCGNTSFIRXXXXXXXXXXXXXXLQEFDQFEAQIGGISGPQGTFIHLGTAGSGSI 212
            S SC NC + S  R              +QEFD ++AQ+GGI+GPQGTF+ +GT+GSGS+
Sbjct: 2    SGSCKNCHSRSIFRDDVSGNQICSSCGIVQEFDNYDAQLGGINGPQGTFVRVGTSGSGSV 61

Query: 213  YNYRDRKLFAARNSIDELTNRLGLG-SKSVDITSMVSIITEGEFGQGDWFQVLIGACAYV 389
             NY+D+K++ A+  I+++T RLG   SKS D+  +VS ITEGE+G GDWF +L+GACAYV
Sbjct: 62   LNYKDKKIYEAQKVIEDITFRLGFSASKSNDVRILVSTITEGEYGLGDWFPILVGACAYV 121

Query: 390  VMRKDDRPLPMAEVASAVGCGVYELGRMILRVIDFLDLKRPDFPEFDIVHSLERTLKNSP 569
             MRKD RPL M+EVASAV C ++ELGRM++RV++FLDL+  +FP FDIV SLER  +NSP
Sbjct: 122  SMRKDSRPLSMSEVASAVECDLHELGRMVMRVVEFLDLRGSEFPVFDIVGSLERAARNSP 181

Query: 570  SFSAVERSQLDRMRKQGIFLIQCAVKWFLSTGRRXXXXXXXXXXXXXEMNQVEVRLDDLA 749
            SFS +E   L+R+ KQGIFL+QCA+KWFL+TGR+             ++N+V+V ++++ 
Sbjct: 182  SFSRLEADILERIVKQGIFLLQCAMKWFLTTGRQPLPMVAAVLVLVSKLNEVDVSIENVG 241

Query: 750  KEVHAMVSTCRTRYRELLDTLVKAAQVLPWGKDITTKNVVKNAPFVIQYMEKKSMSKHVE 929
             EVHA VSTC+ RYRELL+ LV+  + LPWGKDITTKN+VKNAPFVIQYME KSMSK   
Sbjct: 242  MEVHANVSTCKKRYRELLEALVEVGKKLPWGKDITTKNIVKNAPFVIQYMELKSMSKASG 301

Query: 930  KRKNLDRPGLDLADVVTECLRQDDEYEEYGNDGLTSQKDPQYLSLGSNAYRE--GIQDVD 1103
            K K+L+   +DL   V+ECLR++ EYE   ++    + D QY  L  + +++     + +
Sbjct: 302  KGKDLENVEIDLQSAVSECLRKELEYE---SEVYNLEDDSQYFELQRSRWQDESNRDNGN 358

Query: 1104 RLQISPECLSMLYEKFLNEXXXXXXXXXXXXXXXXXLG--LDLWDCREWWDGKSEMSKKL 1277
            RL IS ECLS++Y KFL+E                  G     +   EWW+GKSE+SKKL
Sbjct: 359  RLNISHECLSLIYNKFLDEMAELRSSGGINEVYGTKQGRKTGFYSSTEWWEGKSELSKKL 418

Query: 1278 ILKHLLEKDVGVDTMPPSFVTGQLKCKMRREKIDAAKRRIKRITHPXXXXXXXXXXXXXX 1457
            +L+ LLE D+G   +PPSFV+     + R+EK++AAK+RI+RI HP              
Sbjct: 419  LLQQLLETDIGSQGIPPSFVSSCNAYERRKEKVNAAKKRIQRIMHPSTAPADDVNI---- 474

Query: 1458 XTTYPERRKTKRRGMAVNDIDWEDLIIETLILHQVKEEEIEKGHYNTLLDLYVFNS 1625
                  ++K KR+G  V  I+WED+IIETL+LH V+EEEIEKGHY  LLDLYVF S
Sbjct: 475  ------KKKRKRKGADV--IEWEDIIIETLLLHGVQEEEIEKGHYKVLLDLYVFTS 522


>ref|XP_002517218.1| conserved hypothetical protein [Ricinus communis]
            gi|223543589|gb|EEF45118.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 532

 Score =  494 bits (1273), Expect = e-137
 Identities = 264/537 (49%), Positives = 348/537 (64%), Gaps = 8/537 (1%)
 Frame = +3

Query: 42   CSNCGNTSFIRXXXXXXXXXXXXXXLQEFDQFEAQIGGISGPQGTFIHLGTAGSGSIYNY 221
            C +CG+ S IR              +Q+FD +E   GG++GPQG F+ +GT+G+GS  NY
Sbjct: 3    CYSCGHRSLIRDDITGSLVCDSCGTVQKFDNYETHTGGVNGPQGVFVRVGTSGTGSTLNY 62

Query: 222  RDRKLFAARNSIDELTNRLGL-GSKSVDITSMVSIITEGEFGQGDWFQVLIGACAYVVMR 398
            +++K+F A   ID++  +L L G K  DI SM+  IT+GE+GQGDWF VLIGACAYVV+R
Sbjct: 63   KEKKIFEANKLIDDIAYKLNLLGQKVTDIKSMIDNITDGEYGQGDWFPVLIGACAYVVVR 122

Query: 399  KDDRP-LPMAEVASAVGCGVYELGRMILRVIDFLDLKRPDFPEFDIVHSLERTLKNSPSF 575
             +++  L +AE+   +GC VYELGRM+ RV+D L++K    PEFDIV S E+ ++N  + 
Sbjct: 123  NENKTTLSIAEIGDLIGCDVYELGRMVTRVVDHLNIK---LPEFDIVTSFEKVVRNLFNL 179

Query: 576  SAVERSQLDRMRKQGIFLIQCAVKWFLSTGRRXXXXXXXXXXXXXEMNQVE-VRLDDLAK 752
              VE  + +RMR+QG+FLIQC + WFL+TGRR             E+N VE VR++D+A+
Sbjct: 180  GRVESDKFERMREQGVFLIQCMINWFLTTGRRPLPIVAAVLVLVAELNGVEGVRIEDVAR 239

Query: 753  EVHAMVSTCRTRYRELLDTLVKAAQVLPWGKDITTKNVVKNAPFVIQYMEKKSMSKHVEK 932
            EVHA VSTC+ RY+ELL+ LVK AQVLPWGKD+T KNVVKN PFV++YME KSM K    
Sbjct: 240  EVHAAVSTCKLRYKELLEALVKVAQVLPWGKDVTVKNVVKNGPFVLRYMEMKSMEKCDGH 299

Query: 933  RKNLDRPGLDLADVVTECLRQDDEYEEYGNDGLTSQ-KDPQYLSL--GSNAYREGIQDVD 1103
            RK L   G DL +VV++CLR+DD   EYG +  + +  D +Y  +  GS   + G   + 
Sbjct: 300  RKGLHYGGFDLGEVVSQCLRKDD--VEYGVEEKSVECGDSRYFEVETGSELSKMGDDGMK 357

Query: 1104 RLQISPECLSMLYEKFLNE--XXXXXXXXXXXXXXXXXLGLDLWDCREWWDGKSEMSKKL 1277
            +LQ+S ECLSM+Y KFLNE                      +L+   +WW+GKSE+SKK+
Sbjct: 358  KLQLSHECLSMVYNKFLNEASCGKYKEEIGRAYRRKSKRAFELF-ATDWWNGKSELSKKI 416

Query: 1278 ILKHLLEKDVGVDTMPPSFVTGQLKCKMRREKIDAAKRRIKRITHPXXXXXXXXXXXXXX 1457
             LK +LEKDVG+D MPPSFV G +  + RR KI+AAK RI+RI HP              
Sbjct: 417  FLKQILEKDVGLDLMPPSFVNGCVVVERRRAKINAAKLRIERIVHPWTADSGDCSDIDIL 476

Query: 1458 XTTYPERRKTKRRGMAVNDIDWEDLIIETLILHQVKEEEIEKGHYNTLLDLYVFNSG 1628
               +  +RK K        IDWED +IETL+LHQVKEEEIEKGHYNTLLDL+VFNSG
Sbjct: 477  QDLHTNKRKRK---TPAKGIDWEDFVIETLLLHQVKEEEIEKGHYNTLLDLHVFNSG 530


>ref|XP_002275055.1| PREDICTED: uncharacterized protein LOC100260157 [Vitis vinifera]
            gi|297745306|emb|CBI40386.3| unnamed protein product
            [Vitis vinifera]
          Length = 535

 Score =  487 bits (1254), Expect = e-135
 Identities = 264/545 (48%), Positives = 350/545 (64%), Gaps = 10/545 (1%)
 Frame = +3

Query: 24   MSTSRSCSNCGNTSFIRXXXXXXXXXXXXXXLQEFDQFEAQIGGISGPQGTFIHLGTAGS 203
            M  S SC  C   S IR              +Q FD ++ Q+GG++GPQGTF+ +GTAG+
Sbjct: 1    MEASGSCKGCKKNSLIRDDVTGSLVCSSCGLIQPFDNYDPQLGGLNGPQGTFVRVGTAGT 60

Query: 204  GSIYNYRDRKLFAARNSIDELTNRLGLGS-KSVDITSMVSIITEGEFGQGDWFQVLIGAC 380
            GS  NY+D+K+F A+  ID+L  +LG  S KS ++ +MVS ITEGEFGQGDWF VL+GAC
Sbjct: 61   GSSLNYKDKKIFEAQKLIDDLMFKLGFSSEKSNEVRTMVSTITEGEFGQGDWFPVLVGAC 120

Query: 381  AYVVMRKDDRPLPMAEVASAVGCGVYELGRMILRVIDFLDLKRPDFPEFDIVHSLERTLK 560
            +YVV R+ +R LP+AEV + +GC VYELGRMI RV++FL+LK    PE DIV+SLE   +
Sbjct: 121  SYVVRRRSNRALPIAEVGAVIGCDVYELGRMIGRVVEFLNLK---LPELDIVNSLELAFR 177

Query: 561  NSPSFSAVERSQLDRMRKQGIFLIQCAVKWFLSTGRRXXXXXXXXXXXXXEMNQVEVRLD 740
               S + V + ++D+M KQG FL+Q AVKWFL+TGRR             E+NQV+VR++
Sbjct: 178  KCGSLNRVSKDKVDQMLKQGKFLVQWAVKWFLTTGRRPLPMIAAVLMFVAELNQVDVRIE 237

Query: 741  DLAKEVHAMVSTCRTRYRELLDTLVKAAQVLPWGKDITTKNVVKNAPFVIQYMEKKSMS- 917
            ++A E+HA V+T R RY+EL + LVK AQ LPWG D++TKN+VKNAPFVIQYME K  S 
Sbjct: 238  NIANEIHAGVATSRLRYKELSEALVKVAQSLPWGSDVSTKNIVKNAPFVIQYMEMKLRSQ 297

Query: 918  ---KHVEKRKNLDRPGLDLADVVTECLRQDDEY--EEYGNDGLTSQKDPQYLSLGSNAYR 1082
               K  + +KNL+R G DL  VV+ECL+++ +Y  E Y  +   +  D   L L      
Sbjct: 298  LSGKPRKGKKNLERIGFDLDSVVSECLKKEFDYVSEGYSIENGAAADDRNGLLL------ 351

Query: 1083 EGIQDVDRLQISPECLSMLYEKFLNEXXXXXXXXXXXXXXXXXLGLDLWD---CREWWDG 1253
              I D D+L++S E LS++Y KF+NE                      +D    + WW+G
Sbjct: 352  -DIDDSDKLKLSQESLSLMYFKFVNEGSRVNPMGDDGGDNRRRKRRREYDPPVIKNWWNG 410

Query: 1254 KSEMSKKLILKHLLEKDVGVDTMPPSFVTGQLKCKMRREKIDAAKRRIKRITHPXXXXXX 1433
            KS+ SKKL+LK +LEKDVG++ +PPSFV+G L  + RREKI+AAK RIK++ +P      
Sbjct: 411  KSDKSKKLLLKQILEKDVGLNALPPSFVSGCLAYERRREKINAAKLRIKKVMYPSNTSSD 470

Query: 1434 XXXXXXXXXTTYPERRKTKRRGMAVNDIDWEDLIIETLILHQVKEEEIEKGHYNTLLDLY 1613
                       +    K +RR  A  DIDWED  IETL+LH VKE+EIEKGHYNTLLDL+
Sbjct: 471  DTDDFSAKEQEHLHAEKKRRRAKA--DIDWEDFAIETLLLHHVKEDEIEKGHYNTLLDLH 528

Query: 1614 VFNSG 1628
            VFNSG
Sbjct: 529  VFNSG 533


Top