BLASTX nr result

ID: Glycyrrhiza23_contig00015384 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza23_contig00015384
         (1735 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003536716.1| PREDICTED: uncharacterized protein LOC100788...   541   e-151
ref|XP_003555215.1| PREDICTED: uncharacterized protein LOC100811...   518   e-144
ref|XP_003604055.1| hypothetical protein MTR_3g118020 [Medicago ...   267   6e-69
ref|XP_002313701.1| predicted protein [Populus trichocarpa] gi|2...   230   1e-57
ref|XP_002521290.1| conserved hypothetical protein [Ricinus comm...   224   6e-56

>ref|XP_003536716.1| PREDICTED: uncharacterized protein LOC100788258 [Glycine max]
          Length = 437

 Score =  541 bits (1395), Expect = e-151
 Identities = 299/456 (65%), Positives = 340/456 (74%), Gaps = 7/456 (1%)
 Frame = -2

Query: 1611 FLVTLGLVRVKGEEGDTRKEMAQNTRVMPNCVYASNPYHECTDACLIKMKEA--AKPPKN 1438
            F  T+GLV  KGE G+T+KEMA+   VMP+CVYA NPYHECT+AC+ ++KEA   KP K 
Sbjct: 5    FCFTVGLVGTKGEAGETQKEMAELGGVMPDCVYAPNPYHECTEACVHRIKEAKPGKPSKT 64

Query: 1437 KK-----DYRRSVTDGELGKKMNEEKRTQSGCPKASNPYHTCDENCHKRMSGADSGVIPV 1273
            KK     DYRRSVTDGELGK M EEKR  SGCPKASNPYH CDE C K    ADSG + +
Sbjct: 65   KKGTAFKDYRRSVTDGELGKTMKEEKRRPSGCPKASNPYHVCDEYCQK----ADSGTMSL 120

Query: 1272 NFGRKKKLGSKPELPVLDNVPASKIGAIYLSDAASPISNYYEKKKLEPKSNELPVPVSGE 1093
            NF R+KK+GSKPELPVLD+VP SKIGAIYLSDA+SP+SNY EK K E KSNEL +PVSGE
Sbjct: 121  NFDRRKKVGSKPELPVLDSVPPSKIGAIYLSDASSPLSNYSEKTKEESKSNEL-IPVSGE 179

Query: 1092 IHARDVKPVDPKGQPKDGSENLAGLIPTKQAAGDKNPSPKVVPITSVDDTGEGLTKSAGG 913
            IH  DV P + K Q K                GDKN SPKVVPITSVDDTG  LTK  GG
Sbjct: 180  IHVLDVMPTNHKVQAKHN--------------GDKNASPKVVPITSVDDTG-CLTKPDGG 224

Query: 912  STHFCYSGVPHDDDEECSDGEETESVVSENRVPVGRYHVKESFASILRAIFDKHGDIGES 733
            S +FC+SG+ HD+++  SDGEETESVVSE+RVPVG+YHVKESFA ILR+IF+K+GDIG S
Sbjct: 225  SMNFCFSGL-HDNED--SDGEETESVVSESRVPVGKYHVKESFAPILRSIFEKYGDIGAS 281

Query: 732  CHLESVVMRSYYVECVCFVVQELQSTSIMHLTKSKVKELLAILNDVESAQLRVAWLRNIV 553
            CHLESVVMRSYYVECVCFVVQELQST IM LTKSK+KEL+AIL DVESAQLRVAWLR+IV
Sbjct: 282  CHLESVVMRSYYVECVCFVVQELQSTPIMQLTKSKIKELMAILKDVESAQLRVAWLRSIV 341

Query: 552  NEIAENIELINQHRATEAAKANSXXXXXXXXXXXXXXXETLAQKEQEVNDVKTRIEGMXX 373
            +EI ENIELI++H   E AKANS               E LAQKEQEV D+KTRIE +  
Sbjct: 342  DEITENIELIDEHCVAETAKANSDREVESLNKELESNLEILAQKEQEVTDIKTRIEAIRE 401

Query: 372  XXXXXXXXXXXXXKNMVSIKSKVDNLDSKNLLYELL 265
                         KN++SIKSKVDNLDSK+LL EL+
Sbjct: 402  RLSELELKSCDLDKNILSIKSKVDNLDSKSLLDELV 437


>ref|XP_003555215.1| PREDICTED: uncharacterized protein LOC100811116 [Glycine max]
          Length = 411

 Score =  518 bits (1335), Expect = e-144
 Identities = 288/434 (66%), Positives = 325/434 (74%), Gaps = 5/434 (1%)
 Frame = -2

Query: 1551 MAQNTRVMPNCVYASNPYHECTDACLIKMKEA--AKPPKNKK---DYRRSVTDGELGKKM 1387
            MA++ RVM NCVYA NPYHECT+AC+ ++KEA   KP K KK   DYRRSVTDGELGKKM
Sbjct: 1    MAEHGRVMANCVYAPNPYHECTEACVQRIKEAKPGKPSKTKKVFKDYRRSVTDGELGKKM 60

Query: 1386 NEEKRTQSGCPKASNPYHTCDENCHKRMSGADSGVIPVNFGRKKKLGSKPELPVLDNVPA 1207
             EEKR  SGCPKASNPYH CDE C K    ADSG + +NF R+KK+GSKPELPVLD+VP 
Sbjct: 61   KEEKRRPSGCPKASNPYHVCDEYCQK----ADSGTMSLNFDRRKKVGSKPELPVLDSVPP 116

Query: 1206 SKIGAIYLSDAASPISNYYEKKKLEPKSNELPVPVSGEIHARDVKPVDPKGQPKDGSENL 1027
            SKIGAIYLSDA+SP+SNY EK K E KSNEL +PVSGEIH  DV P + K Q K      
Sbjct: 117  SKIGAIYLSDASSPLSNYSEKTKEESKSNEL-IPVSGEIHVLDVMPTNHKVQSKHN---- 171

Query: 1026 AGLIPTKQAAGDKNPSPKVVPITSVDDTGEGLTKSAGGSTHFCYSGVPHDDDEECSDGEE 847
                      GDKN SPKVVPITSVDDTG  LTK  GGS +FC SG+ HD+++  SDG E
Sbjct: 172  ----------GDKNASPKVVPITSVDDTG-CLTKPDGGSMNFCLSGL-HDNED--SDGGE 217

Query: 846  TESVVSENRVPVGRYHVKESFASILRAIFDKHGDIGESCHLESVVMRSYYVECVCFVVQE 667
            TESVVSE+RVPVG+YHVKESFA ILR+IF+K+GDIG SCHLESVVMRSYYVECVCFVVQE
Sbjct: 218  TESVVSESRVPVGKYHVKESFAPILRSIFEKYGDIGASCHLESVVMRSYYVECVCFVVQE 277

Query: 666  LQSTSIMHLTKSKVKELLAILNDVESAQLRVAWLRNIVNEIAENIELINQHRATEAAKAN 487
            LQST IM L KSK+ EL+AIL DVESAQLRVAWLRNIV+EIAENIELI++H   E AKAN
Sbjct: 278  LQSTPIMQLAKSKIMELMAILKDVESAQLRVAWLRNIVDEIAENIELIDEHCMAEMAKAN 337

Query: 486  SXXXXXXXXXXXXXXXETLAQKEQEVNDVKTRIEGMXXXXXXXXXXXXXXXKNMVSIKSK 307
            S               E+LAQKEQEV D+KTRIE +               KN++SIKSK
Sbjct: 338  SDREMETLNKELESNLESLAQKEQEVTDIKTRIEEIREHLSELELKSSDLAKNILSIKSK 397

Query: 306  VDNLDSKNLLYELL 265
            VDNLDSK+LL EL+
Sbjct: 398  VDNLDSKSLLDELV 411


>ref|XP_003604055.1| hypothetical protein MTR_3g118020 [Medicago truncatula]
            gi|355493103|gb|AES74306.1| hypothetical protein
            MTR_3g118020 [Medicago truncatula]
          Length = 412

 Score =  267 bits (683), Expect = 6e-69
 Identities = 179/397 (45%), Positives = 228/397 (57%), Gaps = 18/397 (4%)
 Frame = -2

Query: 1524 NCVYASNPYHECTDACLIKMK--------EAAKPPKNKKDYRRSVTDGELGKKMNEEKRT 1369
            +C  ASNPYH+CT AC  K K         AA    N     R V +G        E+RT
Sbjct: 32   SCANASNPYHQCTQACSQKTKGTKTHHAPTAAVTASNSNSNNRKVINGG-------ERRT 84

Query: 1368 ----QSGCPKASNPYHTCDENCHKRMSGADSGVIPVN-FGRKKKLGSKPELPVLDNVPAS 1204
                 S CPK+SNPYH CD NC+      +SG  P +    +KK+GSKP+ PVL +VP +
Sbjct: 85   YASSSSSCPKSSNPYHKCDANCN------NSGATPHSKIDHRKKVGSKPQPPVLHSVPPT 138

Query: 1203 KIGAIYLSDAASPISNYYEKKKLEPKSNELPVPVSGEIHARDVKPVDPKGQPKDGSENLA 1024
            K+ A   +D   P S                 P+S ++H  DV P D   Q KDG+  + 
Sbjct: 139  KLVATK-NDEIIPTSG----------------PISAQLHIPDVMPKD---QVKDGATEV- 177

Query: 1023 GLIPTKQAAGDKNPSPKVVPITSVDDTGEGLTKSAGGSTHFCYSGVP----HDDDEECSD 856
                 K AA     S ++VP+T+ ++T EG      GS  F +SG P    + + +  S+
Sbjct: 178  -----KVAA-----SHEIVPVTNSNETHEG------GSKDFSFSGNPLPLHNKEIDTSSE 221

Query: 855  GE-ETESVVSENRVPVGRYHVKESFASILRAIFDKHGDIGESCHLESVVMRSYYVECVCF 679
            GE ++ SVVSE+RV +G+Y+VKESF SIL+ I DK+GDIG SC LESVVMRSYY+ECVCF
Sbjct: 222  GEADSVSVVSESRVSIGKYNVKESFGSILQTIVDKYGDIGASCDLESVVMRSYYMECVCF 281

Query: 678  VVQELQSTSIMHLTKSKVKELLAILNDVESAQLRVAWLRNIVNEIAENIELINQHRATEA 499
            VVQELQS+S   ++KSKV ELL I+ DVESA LRVAWL N ++EI ENIELI+ H+  E 
Sbjct: 282  VVQELQSSS-DSISKSKVSELLDIVKDVESAHLRVAWLHNTLDEIVENIELISHHQDMEM 340

Query: 498  AKANSXXXXXXXXXXXXXXXETLAQKEQEVNDVKTRI 388
             KAN                ETLAQKEQEV D+  RI
Sbjct: 341  EKANYDREMESLREQLESELETLAQKEQEVADINIRI 377


>ref|XP_002313701.1| predicted protein [Populus trichocarpa] gi|222850109|gb|EEE87656.1|
            predicted protein [Populus trichocarpa]
          Length = 541

 Score =  230 bits (586), Expect = 1e-57
 Identities = 182/493 (36%), Positives = 236/493 (47%), Gaps = 62/493 (12%)
 Frame = -2

Query: 1557 KEMAQNTRVMPNCVYASNPYHECTDACLIKMKE-----------AAKP-PKNKKDYR--- 1423
            K M    R  P C  ASNPYH+C + C  +  E            AKP PK    Y    
Sbjct: 54   KNMDGERRAQPTCPKASNPYHKCEEFCSNRTAEPKPGGVKKETGGAKPCPKASNPYHKCE 113

Query: 1422 -----RSVTDGELGKKMNEEKRTQSGCPKASNPYHTCDENCHKRMSGAD-SGV------- 1282
                 R+      G K   E+     CP+ASNP H CDE C  R S A+  GV       
Sbjct: 114  EFCSNRTADANPRGVKKQSERAQP--CPRASNPSHKCDEFCSNRTSEANPQGVEKESGSF 171

Query: 1281 --IPVNFGRKKKLG-SKPELP-VLDNVPASKIGAIYLSDA------ASPISNYYEKKKLE 1132
                ++FGRKKK   S+   P  ++N PA K GA+  + A      A P       KK E
Sbjct: 172  LDTALSFGRKKKESESQQNSPRAVNNAPAVK-GAVNNAPAVKAVRRAPPSPLILPTKKDE 230

Query: 1131 PKSNELPVPVS--------GEIHARDVKPVDPKGQ-----------PKDGSE-NLAGL-I 1015
               N      S         E HA D  PV   G            PK  S+ +LA   I
Sbjct: 231  EPENSRSFSSSQPHSDESYSEDHALDKVPVQSPGPMHVSGKITPDPPKSPSKISLACYKI 290

Query: 1014 PTK---QAAGDKNPSPKVVPITSVDDTGEGLTKSAGGSTHFCYSGVPHDDDEECSDGEET 844
            PT    Q  G  + SPK  P  S +  G           +F +SG+      E SDGEE 
Sbjct: 291  PTPAEPQQNGKLHGSPKAAPYPSANHVGRVTNGPITEYLNFSFSGISRAS--EGSDGEEV 348

Query: 843  ESVVSENRVPVGRYHVKESFASILRAIFDKHGDIGESCHLESVVMRSYYVECVCFVVQEL 664
            +SVVS++ V VG+YHV+ + ASIL+ IF+K+GDI     LES  MR+YY+EC+CFVVQEL
Sbjct: 349  QSVVSDSCVSVGKYHVRANVASILQLIFEKYGDIATGSRLESASMRAYYLECLCFVVQEL 408

Query: 663  QSTSIMHLTKSKVKELLAILNDVESAQLRVAWLRNIVNEIAENIELINQHRATEAAKANS 484
            Q T    LTKSKV+E+LA+L DVESAQ+ V+WLR+I+N++AE +EL NQH+A E +K+N 
Sbjct: 409  QCTPFKQLTKSKVREMLAVLKDVESAQIDVSWLRDILNDLAEGMELSNQHQAAEESKSNC 468

Query: 483  XXXXXXXXXXXXXXXETLAQKEQEVNDVKTRIEGMXXXXXXXXXXXXXXXKNMVSIKSKV 304
                           E LA KE+ V D K +I                  + + SI+S+V
Sbjct: 469  DDLIESKKKELESMMEDLALKEKAVADAKAQITETRTHLSNLELESSKLGETISSIQSRV 528

Query: 303  DNLDSKNLLYELL 265
            +    K L  E+L
Sbjct: 529  EKFHEKPLADEIL 541



 Score = 94.7 bits (234), Expect = 7e-17
 Identities = 50/113 (44%), Positives = 61/113 (53%), Gaps = 1/113 (0%)
 Frame = -2

Query: 1551 MAQNTRVMPNCVYASNPYHECTDACLIKMKEA-AKPPKNKKDYRRSVTDGELGKKMNEEK 1375
            MA+N RV P+CV A+NPYHEC  ACL K+ +   +  K K DY   V    L K M+ E+
Sbjct: 1    MAENGRVHPDCVNAANPYHECGVACLEKISQGQGRKEKKKSDYHNGVNGSWLSKNMDGER 60

Query: 1374 RTQSGCPKASNPYHTCDENCHKRMSGADSGVIPVNFGRKKKLGSKPELPVLDN 1216
            R Q  CPKASNPYH C+E C  R +    G      G KK+ G     P   N
Sbjct: 61   RAQPTCPKASNPYHKCEEFCSNRTAEPKPG------GVKKETGGAKPCPKASN 107


>ref|XP_002521290.1| conserved hypothetical protein [Ricinus communis]
            gi|223539558|gb|EEF41146.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 400

 Score =  224 bits (571), Expect = 6e-56
 Identities = 165/432 (38%), Positives = 229/432 (53%), Gaps = 3/432 (0%)
 Frame = -2

Query: 1551 MAQNTRVMPNCVYASNPYHECTDACLIKMKEAAKPPKNKKDYRRSVTDGELG---KKMNE 1381
            M  N RV P+CV ASNPYHEC  ACL K+ +  +  K KK     + D  L    KK   
Sbjct: 1    MTDNGRVHPDCVNASNPYHECGVACLEKIAQ-GQGWKEKKKSGSFILDTSLSFGRKKKGS 59

Query: 1380 EKRTQSGCPKASNPYHTCDENCHKRMSGADSGVIPVNFGRKKKLGSKPELPVLDNVPASK 1201
            E + +S  PK +N     + +  K +  AD       F  KKK+ S       DN  +S 
Sbjct: 60   ESQPRS--PKVAN-----NVSAAKAVYPADLSSPRSPFPTKKKVES-------DNGHSSS 105

Query: 1200 IGAIYLSDAASPISNYYEKKKLEPKSNELPVPVSGEIHARDVKPVDPKGQPKDGSENLAG 1021
                +  ++ S   ++ + + L P+     VPVSG     ++KP  PK          A 
Sbjct: 106  SSRQHSEESYSQDHSFDKGQVLAPEL----VPVSG-----NLKPDGPKNLSLGSFTCFAI 156

Query: 1020 LIPTKQAAGDKNPSPKVVPITSVDDTGEGLTKSAGGSTHFCYSGVPHDDDEECSDGEETE 841
              PT+Q   DK  SP    I +V+ T    T+    S +F +SG+      E SD EE  
Sbjct: 157  APPTEQ--DDKEKSPLPGAIKNVEITNGRSTE----SLNFTFSGISR--ATEGSDDEEIL 208

Query: 840  SVVSENRVPVGRYHVKESFASILRAIFDKHGDIGESCHLESVVMRSYYVECVCFVVQELQ 661
            SV+S++ V VG+YHV+ + ASIL++I DK+GDI  +C LES  +R+YY+EC+C VVQELQ
Sbjct: 209  SVISDSCVSVGKYHVRANSASILQSIIDKYGDIAANCRLESTSLRTYYLECLCSVVQELQ 268

Query: 660  STSIMHLTKSKVKELLAILNDVESAQLRVAWLRNIVNEIAENIELINQHRATEAAKANSX 481
            STS+  LTKSKV+ELLA+L DVESAQ+ V+WLR+I+N + E +EL N+ +A E AK N  
Sbjct: 269  STSLNQLTKSKVRELLAVLKDVESAQIDVSWLRSILNGLTEAVELNNKQQAAEEAKTNCD 328

Query: 480  XXXXXXXXXXXXXXETLAQKEQEVNDVKTRIEGMXXXXXXXXXXXXXXXKNMVSIKSKVD 301
                          E L QKEQ V + K RIE +                 ++S++SK+D
Sbjct: 329  HVIESTRKELESMVEELGQKEQAVANTKARIEEISAHLSELELESSELSDTILSLRSKID 388

Query: 300  NLDSKNLLYELL 265
            N  SK L  ++L
Sbjct: 389  NFHSKPLRDQIL 400


Top