BLASTX nr result

ID: Glycyrrhiza23_contig00009371 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00009371
         (1753 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003551988.1| PREDICTED: uncharacterized protein LOC100808...   756   0.0  
ref|XP_003530682.1| PREDICTED: uncharacterized protein LOC100796...   744   0.0  
ref|XP_002311880.1| predicted protein [Populus trichocarpa] gi|2...   553   e-155
ref|XP_002520139.1| conserved hypothetical protein [Ricinus comm...   541   e-151
ref|XP_002269942.1| PREDICTED: uncharacterized protein LOC100255...   535   e-149

>ref|XP_003551988.1| PREDICTED: uncharacterized protein LOC100808045 [Glycine max]
          Length = 858

 Score =  756 bits (1953), Expect = 0.0
 Identities = 394/540 (72%), Positives = 427/540 (79%), Gaps = 7/540 (1%)
 Frame = -3

Query: 1601 MNFLMRSTTHVYSEREKXXXXXXXPEHR-------ADVPAPQGSPSLESLMYEDPYSQLS 1443
            MNFLMRSTTHVYS+REK                     P   G+ SLESLM +DPY+Q  
Sbjct: 1    MNFLMRSTTHVYSDREKPSSTATATAATPTTTVMTTTTPPTDGASSLESLMSDDPYAQ-- 58

Query: 1442 TTVERFDGEIDAENGTQESKIDATVLVAKHLDVSEEEGWIAIPYKELPEDWNHVSDMQSL 1263
              VE FDGE + ENG Q SK DA VL AKHLDVSE+EGWI IPYKELPE+WNHVSDMQSL
Sbjct: 59   --VEHFDGEFEGENGAQSSKNDAPVL-AKHLDVSEDEGWITIPYKELPENWNHVSDMQSL 115

Query: 1262 RSLDRSFLFPGEQVHIVACLSACKQDTEIITPFKVAAVMSKNGMGHSPSKENGNIENRNN 1083
            RSLDRSFLFPGEQVHI+ACLSACKQDTEIITPFKVAAVMSKNGMGHS  KENGN+ENRN+
Sbjct: 116  RSLDRSFLFPGEQVHILACLSACKQDTEIITPFKVAAVMSKNGMGHSSDKENGNVENRND 175

Query: 1082 SVSGEGQLSPSGQDQNMENLPKVKTDHPADVSAGESLLRMEVHRRQTALLLEKFKNSHFF 903
            SVSGEGQLSPS Q+Q  + L KVKTDHPAD SAGESLLRMEVH+RQTALLLEKF++SHFF
Sbjct: 176  SVSGEGQLSPSKQEQKEDKLEKVKTDHPADASAGESLLRMEVHKRQTALLLEKFESSHFF 235

Query: 902  VRICESGQPLWXXXXXXXXXXXXETNDQTISTIEVKETAKNVSSVSAIIDRANFDATISG 723
             RI ES +PLW            E N Q IS+ E+K+TAKN SS+SA+IDRANFDATISG
Sbjct: 236  ARISESDEPLW-SKRGSSEKSYSELNGQRISSFEIKDTAKNASSISAVIDRANFDATISG 294

Query: 722  GAARNSVKCCALPNGDIVVLLQVNVGVNFLRDPCIEILQYEKYEEKILSSENQDNSVYTN 543
            G ARNSV CCALPNGDIVVLLQVNVGV+FLRDPCIEILQYEKY++KILSSENQ+NSV+TN
Sbjct: 295  GVARNSVNCCALPNGDIVVLLQVNVGVDFLRDPCIEILQYEKYQDKILSSENQNNSVHTN 354

Query: 542  QDPCGELLKWILPLDNTLPPATRSFSPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHL 363
            QDPCG LLKWILPLDNTLP A+R  SP                              SH 
Sbjct: 355  QDPCGALLKWILPLDNTLPLASRPLSPPQFSLNSGIGNTSQRSNSSASPGSQLFSFGSHF 414

Query: 362  RSYSMSSLPQNTSAPTVPLKAASSKPSFDIDEWDQISSQKFLRKKNGVEELLSFRGVSLE 183
            RSYSMS+LPQNT+AP  PLKAASSKPSFDI++WDQ  SQK LRKKNGVEELLSFRGVSLE
Sbjct: 415  RSYSMSALPQNTNAPNPPLKAASSKPSFDIEDWDQFPSQK-LRKKNGVEELLSFRGVSLE 473

Query: 182  RERFSVCCGLEGIYTPGRRWKRKLEIIQPVEIHSFAADFNSQDLLCVQIKNVAPAHAPDI 3
            RERFSVCCGLEGIYTPGRRW+RK EIIQPVEIHSFAAD NS+DLLCVQIKNVAPAH P I
Sbjct: 474  RERFSVCCGLEGIYTPGRRWRRKFEIIQPVEIHSFAADCNSEDLLCVQIKNVAPAHVPGI 533


>ref|XP_003530682.1| PREDICTED: uncharacterized protein LOC100796980 [Glycine max]
          Length = 849

 Score =  744 bits (1922), Expect = 0.0
 Identities = 382/530 (72%), Positives = 420/530 (79%), Gaps = 1/530 (0%)
 Frame = -3

Query: 1589 MRSTTHVYSEREKXXXXXXXPEHR-ADVPAPQGSPSLESLMYEDPYSQLSTTVERFDGEI 1413
            MRST+HVYS+REK               P   G+ SLESLM +DPY+Q    VE FDGE 
Sbjct: 1    MRSTSHVYSDREKPPSSSTAATTTPTTTPHADGASSLESLMSDDPYAQ----VEHFDGEA 56

Query: 1412 DAENGTQESKIDATVLVAKHLDVSEEEGWIAIPYKELPEDWNHVSDMQSLRSLDRSFLFP 1233
            + ENG Q S+ DA VL AKH+DVSE+EGWI IPYKE+PE+WNHVSDMQSLRSLDRSFLFP
Sbjct: 57   EGENGAQSSRNDAPVL-AKHVDVSEDEGWITIPYKEIPENWNHVSDMQSLRSLDRSFLFP 115

Query: 1232 GEQVHIVACLSACKQDTEIITPFKVAAVMSKNGMGHSPSKENGNIENRNNSVSGEGQLSP 1053
            GEQVHI+ACLSACKQD EIITPFKVAAVMSKNGMGH P KENGN+ENRN+SVSGEG+LSP
Sbjct: 116  GEQVHILACLSACKQDMEIITPFKVAAVMSKNGMGHGPDKENGNVENRNDSVSGEGKLSP 175

Query: 1052 SGQDQNMENLPKVKTDHPADVSAGESLLRMEVHRRQTALLLEKFKNSHFFVRICESGQPL 873
            S Q+Q  E   KVKTDH AD SAGESLLRMEVH+RQTALLL+KF+NSHFF  I ES +PL
Sbjct: 176  SRQEQKEEKQEKVKTDHQADASAGESLLRMEVHKRQTALLLQKFENSHFFATISESDEPL 235

Query: 872  WXXXXXXXXXXXXETNDQTISTIEVKETAKNVSSVSAIIDRANFDATISGGAARNSVKCC 693
            W            E N   IS+ E+K+TAKN SS+SA+IDRANFDATISGG ARNSV+CC
Sbjct: 236  WSKRGSSEKFNSSELNGPKISSFEIKDTAKNASSISAVIDRANFDATISGGVARNSVQCC 295

Query: 692  ALPNGDIVVLLQVNVGVNFLRDPCIEILQYEKYEEKILSSENQDNSVYTNQDPCGELLKW 513
            ALPNGDIVVLLQVNVGV+FLRDPCIEILQYEKY+EK+LSSENQ+NSV+TNQDPCG LLKW
Sbjct: 296  ALPNGDIVVLLQVNVGVDFLRDPCIEILQYEKYQEKVLSSENQNNSVHTNQDPCGALLKW 355

Query: 512  ILPLDNTLPPATRSFSPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHLRSYSMSSLPQ 333
            ILPLDNTLPPATR  SP                              SH RSYSMS+LPQ
Sbjct: 356  ILPLDNTLPPATRPLSPPQFSLNSGIGNTSQRSNSSASPGSQLFSFGSHFRSYSMSALPQ 415

Query: 332  NTSAPTVPLKAASSKPSFDIDEWDQISSQKFLRKKNGVEELLSFRGVSLERERFSVCCGL 153
            NT+AP+ PLKAASSKPSFDI++WDQ  SQK LRKKNGVEELLSFRGVSLE ERFSVCCGL
Sbjct: 416  NTNAPSPPLKAASSKPSFDIEDWDQFPSQK-LRKKNGVEELLSFRGVSLEPERFSVCCGL 474

Query: 152  EGIYTPGRRWKRKLEIIQPVEIHSFAADFNSQDLLCVQIKNVAPAHAPDI 3
            EGIYTPGRRW+RK EIIQPVEIHSFAAD NS+DLLCVQIKNV PAH PDI
Sbjct: 475  EGIYTPGRRWRRKFEIIQPVEIHSFAADCNSEDLLCVQIKNVTPAHVPDI 524


>ref|XP_002311880.1| predicted protein [Populus trichocarpa] gi|222851700|gb|EEE89247.1|
            predicted protein [Populus trichocarpa]
          Length = 827

 Score =  553 bits (1426), Expect = e-155
 Identities = 305/547 (55%), Positives = 366/547 (66%), Gaps = 14/547 (2%)
 Frame = -3

Query: 1601 MNFLMRSTTHVYSEREKXXXXXXXPEHRADVPA---PQGSPSLESLMYEDPYSQLSTTVE 1431
            MNFL+R TTH   + +            A VPA   P  + +LE L+ ED + Q      
Sbjct: 1    MNFLLRPTTHQVIKEQVS----------APVPALESPSPAVTLEGLIAEDSFPQSEVR-- 48

Query: 1430 RFDGEIDAENGT-QESKIDATVLVAKHLDVSEEEGWIAIPYKELPEDWNHVSDMQSLRSL 1254
              D  I  ENG+   +K D+++++  H DVSEEEGWI IP+ ELP+DW +  D+ SLRSL
Sbjct: 49   --DMGIGGENGSVAATKNDSSLVLENHSDVSEEEGWIVIPFGELPDDWKNAPDIHSLRSL 106

Query: 1253 DRSFLFPGEQVHIVACLSACKQDTEIITPFKVAAVMSKNGMGHSPSKENGNIENRNNSVS 1074
            DRSF+FPGEQVHI+ACLSA KQDTEIITPFKVAAVMSKNG+G SP K+NGN+++  +SVS
Sbjct: 107  DRSFVFPGEQVHILACLSAYKQDTEIITPFKVAAVMSKNGIGQSPEKQNGNLKDGGSSVS 166

Query: 1073 GEGQLSPSGQ--DQNMENLPKVKTDHPADVSAGESLLRMEVHRRQTALLLEKFKNSHFFV 900
             +G++S   Q    N     K KTD   D+SA +S LRME ++RQT +LL++FKNSHFFV
Sbjct: 167  AQGEVSSDSQVIGLNGNGASKQKTDPQGDISASKSFLRMEDYKRQTEMLLQRFKNSHFFV 226

Query: 899  RICESGQPLWXXXXXXXXXXXXETNDQTISTIE-------VKETAKNVSSVSAIIDRANF 741
            RI ESG+PLW               DQ  S ++        K+TA N   +SA+IDR NF
Sbjct: 227  RIAESGEPLWSRKSAL---------DQEYSEVDSQNKPQRTKKTADNTFHLSALIDRGNF 277

Query: 740  DATISGGAARNSVKCCALPNGDIVVLLQVNVGVNFLRDPCIEILQYEKYEEKILSSENQD 561
            DA +SGGAARN V CC+L NGDIVVLLQVNVGVNF RDP IEILQ+EKY+E+    ENQD
Sbjct: 278  DANVSGGAARNGVSCCSLSNGDIVVLLQVNVGVNFFRDPVIEILQFEKYQERNRFPENQD 337

Query: 560  NSVYTNQDPCGELLKWILPLDNTLPPATRSFSPXXXXXXXXXXXXXXXXXXXXXXXXXXX 381
            N  Y+N DPCGELLKW+LP+DNTL    RS  P                           
Sbjct: 338  NLNYSNYDPCGELLKWLLPVDNTLSSPARSLPPPQLGSNSGFGGASQKSSSSGSQLFS-- 395

Query: 380  XXXSHLRSYSMSSLPQNTSAPTVPLKAASSKPSFDIDEWDQISSQKFLR-KKNGVEELLS 204
                H RSYSMSSLPQN++ P  P+KA SSKP+FD+++WDQ SSQK  + +K   EELLS
Sbjct: 396  ----HFRSYSMSSLPQNSAPPPQPVKAQSSKPNFDLEDWDQYSSQKLWKSQKPADEELLS 451

Query: 203  FRGVSLERERFSVCCGLEGIYTPGRRWKRKLEIIQPVEIHSFAADFNSQDLLCVQIKNVA 24
             RGVSLERERFSV CGLEGIY PGRRW RKLEIIQPVEIHSFAAD N+ DLLCVQIKNV+
Sbjct: 452  IRGVSLERERFSVRCGLEGIYIPGRRWLRKLEIIQPVEIHSFAADCNTDDLLCVQIKNVS 511

Query: 23   PAHAPDI 3
            PA  PDI
Sbjct: 512  PAITPDI 518


>ref|XP_002520139.1| conserved hypothetical protein [Ricinus communis]
            gi|223540631|gb|EEF42194.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 843

 Score =  541 bits (1394), Expect = e-151
 Identities = 301/545 (55%), Positives = 362/545 (66%), Gaps = 12/545 (2%)
 Frame = -3

Query: 1601 MNFLMRSTTHVYSEREKXXXXXXXPEHRADVPAPQGSPSLESLMYEDPYSQLSTTVERFD 1422
            MNFL R TT  ++   +       P       + + S +LE L+ EDP+ Q  T  E  D
Sbjct: 1    MNFLQRYTTTHHNAVTEHVPPVYEPPIDTRYASSKPSATLEGLIAEDPFQQSPTATEAHD 60

Query: 1421 GE------IDAENGTQESKIDA---TVLVAKHLDVSEEEGWIAIPYKELPEDWNHVSDMQ 1269
             +      +  ENG       A   ++ V  H DVSEEEGWI IP+ +LP+ WN+  D+ 
Sbjct: 61   DDAAHGSTVAGENGRAGGGASAKNESIDVENHSDVSEEEGWITIPHGKLPDGWNNAPDIN 120

Query: 1268 SLRSLDRSFLFPGEQVHIVACLSACKQDTEIITPFKVAAVMSKNGMGHSPSKENGNIENR 1089
            SLRSLDRSF+FPGEQVHI+ACLSA KQDTEIITPFKVAAVMSKNG+G SP K+NGN+++R
Sbjct: 121  SLRSLDRSFVFPGEQVHILACLSAYKQDTEIITPFKVAAVMSKNGIGQSPEKQNGNMKDR 180

Query: 1088 NNSVSGEGQLSPSG-QDQNMENLPKVKTDHPADVSAGESLLRMEVHRRQTALLLEKFKNS 912
             N  SGE   S +   DQN     K + D   D+SA ES LRME H+RQT  LL++F+NS
Sbjct: 181  TNLESGEEMGSGNQLMDQNQNEPLKQEIDSQKDISASESFLRMEDHKRQTESLLQRFRNS 240

Query: 911  HFFVRICESGQPLWXXXXXXXXXXXXETNDQTISTIEVKE-TAKNVSSVSAIIDRANFDA 735
            HFFVRI ESG+PLW             T D   S ++ +  TA N+S + A++DR NFD 
Sbjct: 241  HFFVRIAESGEPLWSKKG---------TFDPRSSEMDGQNSTANNISRLGALVDRGNFDL 291

Query: 734  TISGGAARNSVKCCALPNGDIVVLLQVNVGVNFLRDPCIEILQYEKYEEKILSSENQDNS 555
             +SGGAARN+V C +L NGDIVVLLQVN+GVNFLRDP IEILQ+EKY+E+ LS ENQ+N 
Sbjct: 292  NVSGGAARNTVNCYSLSNGDIVVLLQVNIGVNFLRDPIIEILQFEKYQERNLSPENQENL 351

Query: 554  VYTNQDPCGELLKWILPLDNTLPPATRSFSPXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 375
               N DPCGELLKW+LPLDNTLPP  RS SP                             
Sbjct: 352  NCVNYDPCGELLKWLLPLDNTLPPPARSLSP------TRLGSGSGIVGASQKPSPSGSQL 405

Query: 374  XSHLRSYSMSSLPQNTSAPTVPLKAASSKPSFDIDEWDQISSQKFLR-KKNGVEELLSFR 198
             SH RSYSMSSLPQNT++   P+K  SSKPSFDI +W+Q SSQK  + +K GVE LLSFR
Sbjct: 406  FSHFRSYSMSSLPQNTASSPQPVKTQSSKPSFDIGDWNQYSSQKLWKSQKVGVEGLLSFR 465

Query: 197  GVSLERERFSVCCGLEGIYTPGRRWKRKLEIIQPVEIHSFAADFNSQDLLCVQIKNVAPA 18
            GVSLER+RFSV CGLEGIY PGRRW+RKLEIIQPVEI SFAAD N+ DLLCVQIKN++P+
Sbjct: 466  GVSLERQRFSVRCGLEGIYIPGRRWRRKLEIIQPVEIRSFAADCNTDDLLCVQIKNISPS 525

Query: 17   HAPDI 3
               DI
Sbjct: 526  SNADI 530


>ref|XP_002269942.1| PREDICTED: uncharacterized protein LOC100255337 [Vitis vinifera]
          Length = 868

 Score =  535 bits (1379), Expect = e-149
 Identities = 291/541 (53%), Positives = 363/541 (67%), Gaps = 8/541 (1%)
 Frame = -3

Query: 1601 MNFLMRSTTHVYSEREKXXXXXXXPEHRADVPAPQGSPSLESLMYEDPYSQLSTTVERFD 1422
            MNFLMR +   +++           +H       + + +LE L+ ED +      V+   
Sbjct: 1    MNFLMRPSHTAHADEPPVHEISKGTQH-----VTKPTATLEGLIAEDSFPNYF--VDEIH 53

Query: 1421 GEIDAENGTQ---ESKIDATVLVAKHLDVSEEEGWIAIPYKELPEDWNHVSDMQSLRSLD 1251
            GE+  ENG+     SK D+  LV    DV+EEEGWI IP KELP++W    D+ S RSLD
Sbjct: 54   GEVGGENGSVAGLSSKSDSPDLVNLS-DVTEEEGWIIIPQKELPDNWRDAPDICSFRSLD 112

Query: 1250 RSFLFPGEQVHIVACLSACKQDTEIITPFKVAAVMSKNGMGHSPSKENGNIENRNNSVSG 1071
            RSF+FPGEQVHI+ACLS+ KQ+T+IITPFKVAA+MSKNG+G S  K++G  E+  NS+ G
Sbjct: 113  RSFVFPGEQVHILACLSSSKQETQIITPFKVAAMMSKNGIGQSTKKQSGETEDETNSMLG 172

Query: 1070 EGQLSPSGQD--QNMENLPKVKTDHPADVSAGESLLRMEVHRRQTALLLEKFKNSHFFVR 897
            + + +P+G+D   N ENL K K D   D+SA ESLLRME H+RQT +LL+KFKNSHFFVR
Sbjct: 173  KVEANPAGEDTYHNGENLLKEKIDSEKDISASESLLRMEDHKRQTEILLQKFKNSHFFVR 232

Query: 896  ICESGQPLWXXXXXXXXXXXXETNDQTIST-IEVKETAKNVSSVSAIIDRANFDATISGG 720
            I ESG+PLW                   ST I+ ++TAK ++ ++A+ID+ NF+A +SGG
Sbjct: 233  IAESGEPLWSKRNAAETSLQFSEMSAPKSTAIKTRKTAKEITPLTAVIDKGNFNANVSGG 292

Query: 719  AARNSVKCCALPNGDIVVLLQVNVGVNFLRDPCIEILQYEKYEEKILSSENQDNSVYTNQ 540
             ARN V CC+L NGDIVVLLQVNV V+  RDP +EILQ+EKY     SSEN+D+ VY NQ
Sbjct: 293  VARNIVDCCSLSNGDIVVLLQVNVAVDSQRDPVLEILQFEKYNNDKFSSENKDSLVYANQ 352

Query: 539  DPCGELLKWILPLDNTLPPATRSFSPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSHLR 360
            DPCGELLKW+LPLDNTLPP T + SP                               H R
Sbjct: 353  DPCGELLKWLLPLDNTLPPPTPALSP-PPLSSSSGIGSTSQRSTLSASSGSQLFSFGHFR 411

Query: 359  SYSMSSL-PQNTSAPTVPLKAASSKPSFDIDEWDQISSQKFLR-KKNGVEELLSFRGVSL 186
            SYSMSSL PQ+T  P   +   SSKP+F++++WD+ S QKF++ KK G EELLSFRGVSL
Sbjct: 412  SYSMSSLPPQSTPPPPPSVATPSSKPNFELEDWDRSSPQKFVKSKKTGSEELLSFRGVSL 471

Query: 185  ERERFSVCCGLEGIYTPGRRWKRKLEIIQPVEIHSFAADFNSQDLLCVQIKNVAPAHAPD 6
            E +RFSVCCGLEGIY PGRRW+RKLEIIQPVEI SFAAD N+ DLLCVQIKNV+PAH PD
Sbjct: 472  EPKRFSVCCGLEGIYIPGRRWRRKLEIIQPVEIRSFAADCNTDDLLCVQIKNVSPAHTPD 531

Query: 5    I 3
            I
Sbjct: 532  I 532


Top