BLASTX nr result

ID: Glycyrrhiza24_contig00004007 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza24_contig00004007
         (1395 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003522290.1| PREDICTED: uncharacterized protein LOC100787...   630   e-178
ref|XP_003528229.1| PREDICTED: uncharacterized protein LOC100805...   592   e-167
ref|XP_003525442.1| PREDICTED: uncharacterized protein LOC100775...   461   e-127
ref|XP_002328635.1| predicted protein [Populus trichocarpa] gi|2...   449   e-124
ref|XP_002514640.1| conserved hypothetical protein [Ricinus comm...   449   e-123

>ref|XP_003522290.1| PREDICTED: uncharacterized protein LOC100787391 [Glycine max]
          Length = 1247

 Score =  630 bits (1626), Expect = e-178
 Identities = 337/429 (78%), Positives = 362/429 (84%), Gaps = 2/429 (0%)
 Frame = -2

Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215
            SDSNKPRRQSGKKAT+SGSPGG+ R K LN  H DEQLSEISNE RSLSFQGDEISLQS+
Sbjct: 821  SDSNKPRRQSGKKATESGSPGGRQRPKSLNVPHGDEQLSEISNEPRSLSFQGDEISLQSN 880

Query: 1214 SITVNSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELATV 1035
            S+TVNSKMDMEVTSSL++ EI+DSQSPSLKA+KQL+SETVQKKSTPRLDEDET+AELAT 
Sbjct: 881  SLTVNSKMDMEVTSSLQTVEIDDSQSPSLKAVKQLISETVQKKSTPRLDEDETVAELATD 940

Query: 1034 APEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSST 855
             PEHPSPISVLD GSVYRDD+ SPVKQIS   K +DAQES+ENE+KDQW PADSLSF+ T
Sbjct: 941  TPEHPSPISVLD-GSVYRDDMPSPVKQISEDSKGEDAQESKENEIKDQWNPADSLSFNCT 999

Query: 854  GSGEINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXXXXXX 675
            GS EINRKKLQ+IDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEI        
Sbjct: 1000 GSLEINRKKLQNIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEILLASGLLL 1059

Query: 674  XXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKLHRKL 495
                   LTFQLHS G+PINPELFLVLEQT           SPGK A +K N EK HRKL
Sbjct: 1060 RDLSSELLTFQLHSSGHPINPELFLVLEQTKASSLLSKEESSPGKDANMKLNKEKFHRKL 1119

Query: 494  IFDAVNEILGAKLGSSPEPWFQP--NRLTKKTLSAQKLLKELCFEIEKAQAKEPECCLXX 321
            IFD+VNEILGAK GSSPEP FQP  NRLTKKTLSAQKLLKELCFEIEK QAK+PECCL  
Sbjct: 1120 IFDSVNEILGAKFGSSPEPCFQPNSNRLTKKTLSAQKLLKELCFEIEKIQAKKPECCL-E 1178

Query: 320  XXXDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGLRVKSS 141
               DGLK ML EDVMHGSESW +F G LPGVVLDVERL+FKDLVDE+VIGE++GLRVK S
Sbjct: 1179 DDHDGLKNMLCEDVMHGSESWTDFHGYLPGVVLDVERLLFKDLVDEVVIGESSGLRVKPS 1238

Query: 140  VRRRKLFGK 114
            VRRRKLFGK
Sbjct: 1239 VRRRKLFGK 1247


>ref|XP_003528229.1| PREDICTED: uncharacterized protein LOC100805643 [Glycine max]
          Length = 1092

 Score =  592 bits (1526), Expect = e-167
 Identities = 321/429 (74%), Positives = 348/429 (81%), Gaps = 2/429 (0%)
 Frame = -2

Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215
            SDSNKPRRQSGKKAT+ GSPGG+ R K LN  H DEQLSEISNESRSLS QGD +SLQSD
Sbjct: 674  SDSNKPRRQSGKKATELGSPGGRQRPKSLNLPHGDEQLSEISNESRSLSCQGDGVSLQSD 733

Query: 1214 SITVNSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELATV 1035
            S+TVNSKMDMEVTSSLR+ EI+DS+SPSLKA K+L+SETVQKKSTPRLDE+ET+AELAT 
Sbjct: 734  SLTVNSKMDMEVTSSLRTVEIDDSRSPSLKAAKRLISETVQKKSTPRLDEEETVAELATD 793

Query: 1034 APEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSST 855
            APEHPSPISVLD GSVYRDDV SPVKQIS        ++S+ENE+KDQW P DSLSF+ST
Sbjct: 794  APEHPSPISVLD-GSVYRDDVPSPVKQIS--------EDSKENEIKDQWNPEDSLSFNST 844

Query: 854  GSGEINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXXXXXX 675
            G  EINRKKLQ+I+HLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEI        
Sbjct: 845  GPLEINRKKLQNINHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEILLASGLLL 904

Query: 674  XXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKLHRKL 495
                   LTFQLHS  +PINPELFLVLEQT            PGK A  K N EK HRKL
Sbjct: 905  RDLSSELLTFQLHSSVHPINPELFLVLEQTKASSLLSKEESIPGKDANSKLNKEKFHRKL 964

Query: 494  IFDAVNEILGAKLGSSPEPWFQP--NRLTKKTLSAQKLLKELCFEIEKAQAKEPECCLXX 321
            IFD+VNEILGAK  SSPEPW QP  NRLTKKTLSAQKLLKELCFEIEK QAK+ EC L  
Sbjct: 965  IFDSVNEILGAKFSSSPEPWIQPNSNRLTKKTLSAQKLLKELCFEIEKIQAKKTECSL-E 1023

Query: 320  XXXDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGLRVKSS 141
               DGLK +L EDV+HGSESW +F G LPGVVLDVERL+FKDLVDE+VIGE+ GLRVKS 
Sbjct: 1024 EEDDGLKNILCEDVLHGSESWTDFHGYLPGVVLDVERLIFKDLVDEVVIGESTGLRVKSL 1083

Query: 140  VRRRKLFGK 114
            VRRRKLFGK
Sbjct: 1084 VRRRKLFGK 1092


>ref|XP_003525442.1| PREDICTED: uncharacterized protein LOC100775311 [Glycine max]
          Length = 1051

 Score =  461 bits (1187), Expect = e-127
 Identities = 263/421 (62%), Positives = 302/421 (71%)
 Frame = -2

Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215
            SDSN PRRQS K+ T+SGSP  K+R KV NS +SD++LSE SNE RSLS Q DEISLQSD
Sbjct: 637  SDSNNPRRQSCKQTTESGSPSRKLRPKVANSWYSDDRLSETSNELRSLSSQWDEISLQSD 696

Query: 1214 SITVNSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELATV 1035
            SITV+SKMD+EVTSSL+S +  DSQ  S+KA + LVS +  KKST R DEDE+IAE AT 
Sbjct: 697  SITVDSKMDIEVTSSLQSDDTIDSQFRSMKANEHLVSGSTHKKSTLRWDEDESIAEPATD 756

Query: 1034 APEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSST 855
            A +HPS  SV D  SVY+ D+ SPVK  S  PKAD+ QE + N+  D W PAD    ++T
Sbjct: 757  ASDHPSLDSV-DDVSVYKYDMPSPVKSKSNAPKADNGQEYKANDNTDHWNPADGFFVNNT 815

Query: 854  GSGEINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXXXXXX 675
                INRKK QS+D L+QKLR+LNSSHDE RIDYIASLCENTNPDHRYI+EI        
Sbjct: 816  ----INRKKFQSVDCLIQKLRQLNSSHDETRIDYIASLCENTNPDHRYIAEILLASGLLL 871

Query: 674  XXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKLHRKL 495
                   LTFQ HS G+PINPELFLVLEQT           S GKVA+++ NTEK HRKL
Sbjct: 872  RALSSELLTFQHHSSGHPINPELFLVLEQTKLSSLLSKDESSFGKVAYMRLNTEKWHRKL 931

Query: 494  IFDAVNEILGAKLGSSPEPWFQPNRLTKKTLSAQKLLKELCFEIEKAQAKEPECCLXXXX 315
            IFDAVNEILG KLGS  EP  +PN L  K +SAQKLLKELCFE++K Q  +P+C L    
Sbjct: 932  IFDAVNEILGEKLGSFVEPCLKPNGLATKFVSAQKLLKELCFEVQKLQYVKPDCSL-EDE 990

Query: 314  XDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGLRVKSSVR 135
             DGLK ML EDVM  SE+W  F GELPGVVLDVERL+FKDL+DE VI E A LRVK S  
Sbjct: 991  GDGLKSMLREDVMCHSENWTGFPGELPGVVLDVERLIFKDLIDEFVIDEMASLRVKFSKH 1050

Query: 134  R 132
            R
Sbjct: 1051 R 1051


>ref|XP_002328635.1| predicted protein [Populus trichocarpa] gi|222838811|gb|EEE77162.1|
            predicted protein [Populus trichocarpa]
          Length = 1027

 Score =  449 bits (1155), Expect = e-124
 Identities = 255/433 (58%), Positives = 298/433 (68%), Gaps = 6/433 (1%)
 Frame = -2

Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215
            SD++K R QS ++ T+ GSPG K R K      SD+QLS+ISNESR+ S QGD+ISLQSD
Sbjct: 598  SDTSKQRTQSNRQPTEIGSPGRKHRVKYPKVPPSDDQLSQISNESRTSSHQGDDISLQSD 657

Query: 1214 SITVNSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELATV 1035
              T + K DMEVTS+ RS +    QSP+L A  +LVS ++QKKST   +ED T AELA V
Sbjct: 658  GTTFDLKTDMEVTSTERSTDNYSGQSPTLNAASRLVSGSLQKKSTFMFEEDRTSAELAVV 717

Query: 1034 APEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSST 855
            APEHPSP+SVLD  SVYRDD  SPVKQ+  + K D  ++    + +DQW PAD+L  +S 
Sbjct: 718  APEHPSPVSVLD-ASVYRDDALSPVKQMPNLIKGDVPKDFHYQQSEDQWNPADNLLSNSV 776

Query: 854  GSG---EINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXXX 684
             SG   +INRKKLQ I++LVQKLR+LNS+HDE+  DYIASLCENTNPDHRYISEI     
Sbjct: 777  ASGLSSDINRKKLQKIENLVQKLRQLNSTHDESSTDYIASLCENTNPDHRYISEILLASG 836

Query: 683  XXXXXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKLH 504
                       TFQLH  G+PINPELF VLEQT           SPGK    K N EK H
Sbjct: 837  LLLRDLSSGLSTFQLHPSGHPINPELFFVLEQTKASNLVSKEECSPGKSFHSKPNPEKFH 896

Query: 503  RKLIFDAVNEILGAKLG---SSPEPWFQPNRLTKKTLSAQKLLKELCFEIEKAQAKEPEC 333
            RKLIFDAVNEIL  KL     SPEPW + ++L KKTLSAQKLLKELC E+E+   K+ EC
Sbjct: 897  RKLIFDAVNEILVKKLALVEPSPEPWLKSDKLAKKTLSAQKLLKELCSEMEQLLVKKSEC 956

Query: 332  CLXXXXXDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGLR 153
             L     DGLK +L  DVMH SESW +F  E  GVVLDVERLVFKDLVDEIVIGEAAG+R
Sbjct: 957  SL--EEEDGLKSILCYDVMHRSESWIDFHSETSGVVLDVERLVFKDLVDEIVIGEAAGIR 1014

Query: 152  VKSSVRRRKLFGK 114
             K    RR+LFGK
Sbjct: 1015 TKPGRSRRQLFGK 1027


>ref|XP_002514640.1| conserved hypothetical protein [Ricinus communis]
            gi|223546244|gb|EEF47746.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1094

 Score =  449 bits (1154), Expect = e-123
 Identities = 256/434 (58%), Positives = 298/434 (68%), Gaps = 7/434 (1%)
 Frame = -2

Query: 1394 SDSNKPRRQSGKKATDSGSPGGKVRAKVLNSQHSDEQLSEISNESRSLSFQGDEISLQSD 1215
            SDSNKPRRQS K   + GSPGGK R K      SD+QLS+ISNESR+ S QGD+ISLQSD
Sbjct: 669  SDSNKPRRQSKKMLNELGSPGGKNRPKSHKLPTSDDQLSQISNESRTSSHQGDDISLQSD 728

Query: 1214 SITV-NSKMDMEVTSSLRSCEINDSQSPSLKAMKQLVSETVQKKSTPRLDEDETIAELAT 1038
            +  V + K DMEVTS+ +  E+N   SPS  A+  +VS + Q   TPRL+ED T+A+ A 
Sbjct: 729  NTVVFDLKTDMEVTSTEQPNELNIDHSPSSNAVSHVVSGSKQNNPTPRLEEDGTLADFAV 788

Query: 1037 VAPEHPSPISVLDGGSVYRDDVSSPVKQISRVPKADDAQESQENEVKDQWKPADSLSFSS 858
              PEHPSPISVLD  SVYRDD  SPVKQI  +PK D A+ S     KDQW PAD+    S
Sbjct: 789  DTPEHPSPISVLDA-SVYRDDALSPVKQIPNLPKGDSAEAS-----KDQWDPADNFLSDS 842

Query: 857  TGS---GEINRKKLQSIDHLVQKLRRLNSSHDEARIDYIASLCENTNPDHRYISEIXXXX 687
             GS    EI+RKKLQ++++LV+KLRRLNS+HDEA  DYIASLCENTNPDHRYISEI    
Sbjct: 843  VGSVLTSEISRKKLQNVENLVKKLRRLNSTHDEASTDYIASLCENTNPDHRYISEILLAS 902

Query: 686  XXXXXXXXXXXLTFQLHSLGNPINPELFLVLEQTXXXXXXXXXXXSPGKVAFLKQNTEKL 507
                        TFQLHS G+PINPELF VLEQT           +PGK    K N E+ 
Sbjct: 903  GLLLRDLGSGMTTFQLHSSGHPINPELFFVLEQTKASTLASKEECNPGKTYHSKPNPERF 962

Query: 506  HRKLIFDAVNEILGAKLG---SSPEPWFQPNRLTKKTLSAQKLLKELCFEIEKAQAKEPE 336
            HRKLIFDAVNE++  KL     SPEPW + ++L KKTLSAQKLLKELC EIE+ Q K+ E
Sbjct: 963  HRKLIFDAVNEMIVKKLALEEQSPEPWLKSDKLAKKTLSAQKLLKELCSEIEQLQDKKSE 1022

Query: 335  CCLXXXXXDGLKCMLWEDVMHGSESWENFTGELPGVVLDVERLVFKDLVDEIVIGEAAGL 156
            C L     D LK +LW+DVM  SESW +F  EL GVVLDVER +FKDLVDEIVIGEAAG 
Sbjct: 1023 CSL-EDEEDDLKGVLWDDVMRRSESWTDFHSELSGVVLDVERSIFKDLVDEIVIGEAAGS 1081

Query: 155  RVKSSVRRRKLFGK 114
            R+K   RRR+LF K
Sbjct: 1082 RIKPG-RRRQLFAK 1094


Top