BLASTX nr result

ID: Glycyrrhiza34_contig00020511 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza34_contig00020511
         (375 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KYP39878.1 hypothetical protein KK1_038796 [Cajanus cajan]             81   9e-16
XP_014632447.1 PREDICTED: uncharacterized protein LOC100798029 [...    78   8e-15
KYP78202.1 Retrovirus-related Pol polyprotein from transposon TN...    77   3e-14
XP_003597128.2 Myb/SANT-like DNA-binding domain protein [Medicag...    77   6e-14
XP_014623011.1 PREDICTED: uncharacterized protein LOC102665640 [...    77   7e-14
XP_003538716.1 PREDICTED: uncharacterized protein LOC100798851 [...    76   1e-13
XP_006589869.1 PREDICTED: uncharacterized protein LOC102663555 [...    72   2e-13
KYP35038.1 hypothetical protein KK1_043951 [Cajanus cajan]             74   5e-13
XP_006573751.1 PREDICTED: uncharacterized protein LOC100807274 i...    74   5e-13
KHN25746.1 hypothetical protein glysoja_018320 [Glycine soja]          74   5e-13
XP_003516682.1 PREDICTED: uncharacterized protein LOC100807274 i...    74   5e-13
XP_014619055.1 PREDICTED: uncharacterized protein LOC106795037 [...    70   7e-13
XP_013441863.1 Ulp1 protease family, carboxy-terminal domain pro...    73   8e-13
XP_006582171.1 PREDICTED: uncharacterized protein LOC102668599 [...    69   2e-12
XP_006589878.2 PREDICTED: uncharacterized protein LOC102665245 [...    72   3e-12
XP_014622322.1 PREDICTED: uncharacterized protein LOC100818274 [...    71   6e-12
KYP40410.1 hypothetical protein KK1_038259 [Cajanus cajan]             71   8e-12
XP_003627090.1 Ulp1 protease family, carboxy-terminal domain pro...    70   1e-11
XP_014630124.1 PREDICTED: uncharacterized protein LOC102667062 [...    70   1e-11
KHM99138.1 hypothetical protein glysoja_032029 [Glycine soja]          70   1e-11

>KYP39878.1 hypothetical protein KK1_038796 [Cajanus cajan]
          Length = 300

 Score = 80.9 bits (198), Expect = 9e-16
 Identities = 41/105 (39%), Positives = 58/105 (55%), Gaps = 1/105 (0%)
 Frame = +3

Query: 24  VVVWMCSLDHKPENSMKNTVNVALEGYWRTKG-KRKKNLIWINATVPIQVSN*ECGHYVM 200
           + V +CSL   P    +  ++ A+EGY   K  K KK + W++     Q  N ECG+YV+
Sbjct: 194 IAVCICSLHKLPPMDFRQLLDRAMEGYHILKSLKLKKKMSWVSPKSHKQKGNYECGYYVL 253

Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLEHY 335
           + M  IV  +IV  W E F D+SP   E I   RE WA F+++HY
Sbjct: 254 KIMHTIVDSKIVSGWTEIFIDRSPLPLEDINTIREQWATFFIDHY 298


>XP_014632447.1 PREDICTED: uncharacterized protein LOC100798029 [Glycine max]
          Length = 265

 Score = 77.8 bits (190), Expect = 8e-15
 Identities = 40/104 (38%), Positives = 63/104 (60%), Gaps = 2/104 (1%)
 Frame = +3

Query: 24  VVVWMCSLDHKPENSMKNTVNVALEGYWRT-KGKRKKN-LIWINATVPIQVSN*ECGHYV 197
           VVVW CSL  +P+ ++K  VN A++   +T +GK  ++   WI A   +Q  N ECG+YV
Sbjct: 159 VVVWFCSLRKRPDAAIKGAVNSAMKSVTKTAEGKPPQHGPQWIEAKSHVQTGNYECGYYV 218

Query: 198 MRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           M ++  IV+G +  DW   F+D+S  +EE I   R  WA ++++
Sbjct: 219 MHWIWCIVTGGLKDDWIHWFSDRSAVTEETITTLRHKWAAYFIQ 262


>KYP78202.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 294

 Score = 76.6 bits (187), Expect = 3e-14
 Identities = 41/105 (39%), Positives = 59/105 (56%), Gaps = 1/105 (0%)
 Frame = +3

Query: 24  VVVWMCSLDHKPENSMKNTVNVALEGYWRTKG-KRKKNLIWINATVPIQVSN*ECGHYVM 200
           + V +CSL   P    +  ++ A+EGY   KG K KK + W++     Q  N EC +YVM
Sbjct: 188 IAVCICSLHKPPPMDFRQLLDRAMEGYHILKGSKLKKKMSWVSPKSHKQKGNYECEYYVM 247

Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLEHY 335
           + M  IV  +IV  W + F D+SP S E I   RE WA F++++Y
Sbjct: 248 KTMHTIVDLQIVSRWTKIFIDQSPLSLEDINTIREQWATFFIDYY 292


>XP_003597128.2 Myb/SANT-like DNA-binding domain protein [Medicago truncatula]
            AES67379.2 Myb/SANT-like DNA-binding domain protein
            [Medicago truncatula]
          Length = 1223

 Score = 77.0 bits (188), Expect = 6e-14
 Identities = 38/101 (37%), Positives = 60/101 (59%)
 Frame = +3

Query: 27   VVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRKKNLIWINATVPIQVSN*ECGHYVMRY 206
            VV++CSL+ KP+ ++  TV+ AL+ Y + +G +KK   WI      Q  + ECG+Y+M +
Sbjct: 1115 VVFLCSLERKPDKNIIQTVDSALDEYHKLQGVQKKKPTWIVPVCQRQPESYECGYYIMIH 1174

Query: 207  MLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
            ML IVS  I+  W++ F +  PF E+ +   R+  A   LE
Sbjct: 1175 MLKIVSDGIIDSWKKIFGNPEPFDEDELINVRQRCASLILE 1215


>XP_014623011.1 PREDICTED: uncharacterized protein LOC102665640 [Glycine max]
          Length = 530

 Score = 76.6 bits (187), Expect = 7e-14
 Identities = 39/104 (37%), Positives = 64/104 (61%), Gaps = 2/104 (1%)
 Frame = +3

Query: 24  VVVWMCSLDHKPENSMKNTVNVALEGYWRT-KGKRKKN-LIWINATVPIQVSN*ECGHYV 197
           VVVW  SL  KP+ ++K  VN A++   +T +GK  ++   WI A   +Q  N ECG+YV
Sbjct: 415 VVVWFFSLKKKPDAAIKGAVNSAMKSVTKTAEGKPPQHGPQWIEAKSHVQTGNYECGYYV 474

Query: 198 MRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           M+++  I+SG +  DW   F+++SP +EE +   R  WA ++++
Sbjct: 475 MQWIWCIISGGLKDDWIHWFSNRSPLTEETMTTLRHKWAAYFIQ 518


>XP_003538716.1 PREDICTED: uncharacterized protein LOC100798851 [Glycine max]
           KHN34985.1 hypothetical protein glysoja_004751 [Glycine
           soja]
          Length = 736

 Score = 76.3 bits (186), Expect = 1e-13
 Identities = 44/109 (40%), Positives = 59/109 (54%), Gaps = 5/109 (4%)
 Frame = +3

Query: 18  QPVVVWMCSLDHKPEN-SMKNTVNVALEGYWRTKGK----RKKNLIWINATVPIQVSN*E 182
           Q VVV +CSL  K  N  MK TV++A++ Y R  G     R+K   WI      Q    E
Sbjct: 624 QNVVVLLCSLHKKTINREMKTTVDLAMDEYQRLVGSQSRSRRKKPTWILPRCQTQTEGYE 683

Query: 183 CGHYVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           CG+YVM+ ML +V+  IV  W++ F    PF EE I   ++ WA F L+
Sbjct: 684 CGYYVMKQMLTVVTVDIVDSWKKIFNSSGPFPEEDIADIQQRWAAFLLQ 732


>XP_006589869.1 PREDICTED: uncharacterized protein LOC102663555 [Glycine max]
          Length = 137

 Score = 71.6 bits (174), Expect = 2e-13
 Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 1/103 (0%)
 Frame = +3

Query: 24  VVVWMCSLDHKPENSMKNTVNVALEGYWRT-KGKRKKNLIWINATVPIQVSN*ECGHYVM 200
           +VVW CSL ++ +N +K  +N AL+G   T + K K    WI      Q  + ECG+YVM
Sbjct: 30  LVVWFCSLHNRLDNYLKGIINSALKGLDDTPQPKSKVGARWIVVKCNRQKGSIECGYYVM 89

Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
            +M  I+ G    +WE  F D  P   E +   R  WA++YL+
Sbjct: 90  HWMSTIILGSFKNNWEMYFNDVRPLEAERLNALRIQWAKYYLK 132


>KYP35038.1 hypothetical protein KK1_043951 [Cajanus cajan]
          Length = 564

 Score = 74.3 bits (181), Expect = 5e-13
 Identities = 38/100 (38%), Positives = 51/100 (51%), Gaps = 3/100 (3%)
 Frame = +3

Query: 24  VVVWMCSLDHKPENSMKNTVNVALEGYWRTKGK---RKKNLIWINATVPIQVSN*ECGHY 194
           V VW CS  HKP   +KN +   +  Y    G+   + KNL WI      Q    ECG+Y
Sbjct: 402 VAVWFCSFYHKPNVQIKNLIKSVMVVYNVMGGRSTAQPKNLDWIYPMSNQQQGGYECGYY 461

Query: 195 VMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWA 314
           VM +ML+I+ G +  DW E F D +P  E  ++  R  WA
Sbjct: 462 VMNWMLDIIEGEVTNDWIELFDDVAPLPETKLEDIRSQWA 501


>XP_006573751.1 PREDICTED: uncharacterized protein LOC100807274 isoform X2 [Glycine
           max]
          Length = 647

 Score = 74.3 bits (181), Expect = 5e-13
 Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
 Frame = +3

Query: 18  QPVVVWMCSLDHKPENS-MKNTVNVALEGYWRTKGK----RKKNLIWINATVPIQVSN*E 182
           Q VVV +CSL  K  N  MK  VN+A++ Y R  G     R+K   WI      Q    E
Sbjct: 535 QNVVVLLCSLHKKTINKEMKTIVNLAMDEYQRLVGSQSRSRRKKPTWILPRCQTQSKGYE 594

Query: 183 CGHYVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           CG+YVM+ M  +V+  IV  W++ F +  PF EE I   ++ WA F L+
Sbjct: 595 CGYYVMKQMFTVVTVDIVDSWKQLFNNSGPFPEEDIADIQQRWAAFLLQ 643


>KHN25746.1 hypothetical protein glysoja_018320 [Glycine soja]
          Length = 736

 Score = 74.3 bits (181), Expect = 5e-13
 Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
 Frame = +3

Query: 18  QPVVVWMCSLDHKPENS-MKNTVNVALEGYWRTKGK----RKKNLIWINATVPIQVSN*E 182
           Q VVV +CSL  K  N  MK  VN+A++ Y R  G     R+K   WI      Q    E
Sbjct: 624 QNVVVLLCSLHKKTINKEMKTIVNLAMDEYQRLVGSQSRSRRKKPTWILPRCQTQSKGYE 683

Query: 183 CGHYVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           CG+YVM+ M  +V+  IV  W++ F +  PF EE I   ++ WA F L+
Sbjct: 684 CGYYVMKQMFTVVTVDIVDSWKQLFNNSGPFPEEDIADIQQRWAAFLLQ 732


>XP_003516682.1 PREDICTED: uncharacterized protein LOC100807274 isoform X1 [Glycine
           max]
          Length = 736

 Score = 74.3 bits (181), Expect = 5e-13
 Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 5/109 (4%)
 Frame = +3

Query: 18  QPVVVWMCSLDHKPENS-MKNTVNVALEGYWRTKGK----RKKNLIWINATVPIQVSN*E 182
           Q VVV +CSL  K  N  MK  VN+A++ Y R  G     R+K   WI      Q    E
Sbjct: 624 QNVVVLLCSLHKKTINKEMKTIVNLAMDEYQRLVGSQSRSRRKKPTWILPRCQTQSKGYE 683

Query: 183 CGHYVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           CG+YVM+ M  +V+  IV  W++ F +  PF EE I   ++ WA F L+
Sbjct: 684 CGYYVMKQMFTVVTVDIVDSWKQLFNNSGPFPEEDIADIQQRWAAFLLQ 732


>XP_014619055.1 PREDICTED: uncharacterized protein LOC106795037 [Glycine max]
          Length = 139

 Score = 70.1 bits (170), Expect = 7e-13
 Identities = 35/105 (33%), Positives = 56/105 (53%), Gaps = 3/105 (2%)
 Frame = +3

Query: 24  VVVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRKKNLI---WINATVPIQVSN*ECGHY 194
           +VVW  SL H+P+N +K  +N AL+G   T   + K  +   WI      Q  + +CG+Y
Sbjct: 30  LVVWFYSLHHRPDNYLKGIINSALKGVDGTPQPKSKAAVGARWIVVKCNKQKGSTKCGYY 89

Query: 195 VMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           V+ ++  I+SG    +WE  F D  P   + ++  R  WA +YL+
Sbjct: 90  VIHWLSTIISGSFRNNWELYFNDVRPLELDTLKAFRIQWANYYLK 134


>XP_013441863.1 Ulp1 protease family, carboxy-terminal domain protein [Medicago
           truncatula] KEH15888.1 Ulp1 protease family,
           carboxy-terminal domain protein [Medicago truncatula]
          Length = 296

 Score = 72.8 bits (177), Expect = 8e-13
 Identities = 40/102 (39%), Positives = 58/102 (56%), Gaps = 1/102 (0%)
 Frame = +3

Query: 27  VVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRK-KNLIWINATVPIQVSN*ECGHYVMR 203
           VV++CS+  KP+  +   VN A+EGY    G RK +  IW       Q  N ECG+++M 
Sbjct: 190 VVFLCSMGWKPDKILVQIVNSAIEGYNMLSGFRKARKPIWEIPACQRQPFNYECGYFIMI 249

Query: 204 YMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           +MLNIVS  I   W   F D++PF+++ +   +E  A F LE
Sbjct: 250 HMLNIVSAGITDSWNMIFGDETPFTDDEMTKVQERCANFILE 291


>XP_006582171.1 PREDICTED: uncharacterized protein LOC102668599 [Glycine max]
           XP_014632186.1 PREDICTED: uncharacterized protein
           LOC102668599 [Glycine max]
          Length = 127

 Score = 68.9 bits (167), Expect = 2e-12
 Identities = 35/104 (33%), Positives = 52/104 (50%), Gaps = 2/104 (1%)
 Frame = +3

Query: 24  VVVWMCSLDHKPENSMKNTVNVALEGYWRTKG--KRKKNLIWINATVPIQVSN*ECGHYV 197
           VV W CSL  KP+  +K  +N A++    T      +    WI     +Q    ECG+YV
Sbjct: 15  VVAWFCSLRKKPDTHIKTAINNAMKTANTTANGTNNQGTPKWIEVKSHVQSGGYECGYYV 74

Query: 198 MRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           M +M NI+SG +  DW   F D +   +E I    + WA ++L+
Sbjct: 75  MHWMWNIISGGLKNDWTMWFLDGTTLDKETITTIHQKWASYFLK 118


>XP_006589878.2 PREDICTED: uncharacterized protein LOC102665245 [Glycine max]
          Length = 339

 Score = 71.6 bits (174), Expect = 3e-12
 Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 2/106 (1%)
 Frame = +3

Query: 18  QPVVVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRKKNLI--WINATVPIQVSN*ECGH 191
           Q VVVW CSL  KP+  +K T+N  +    +T     K +   WI     +Q    ECG+
Sbjct: 227 QHVVVWFCSLRRKPDMHIKATINSVMTKLKKTLSPETKAVAPKWIEVKSHVQTGCYECGY 286

Query: 192 YVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           Y+M ++ NI++  I  DW   F + +P    +I   R+ WA F+L+
Sbjct: 287 YIMHWIWNIIASDIKSDWSMWFANDTPLDIGIITTIRKKWATFFLK 332


>XP_014622322.1 PREDICTED: uncharacterized protein LOC100818274 [Glycine max]
          Length = 857

 Score = 71.2 bits (173), Expect = 6e-12
 Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 2/104 (1%)
 Frame = +3

Query: 24   VVVWMCSLDHKPENSMKNTVNVALEGYWRT-KGKRKKN-LIWINATVPIQVSN*ECGHYV 197
            V VW CSL  K + ++K  VN A++   +T +GK  ++   WI A   +Q  N EC +YV
Sbjct: 745  VAVWFCSLRKKLDAAIKGAVNSAMKSVTKTAEGKPPQHGPQWIEAKSHVQTGNYECEYYV 804

Query: 198  MRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
            M ++  IVSG +   W + F+D+SP  EE +   R  WA ++++
Sbjct: 805  MHWIWCIVSGGLKDGWIDWFSDRSPIPEETMTTLRHKWAAYFIQ 848


>KYP40410.1 hypothetical protein KK1_038259 [Cajanus cajan]
          Length = 571

 Score = 70.9 bits (172), Expect = 8e-12
 Identities = 39/98 (39%), Positives = 52/98 (53%), Gaps = 1/98 (1%)
 Frame = +3

Query: 39  CSLDHKPENSMKNTVNVALEGYWRTKG-KRKKNLIWINATVPIQVSN*ECGHYVMRYMLN 215
           CS+   P    K  ++  +EGY   KG K KK + W+      Q  N ECG+YVM+ M  
Sbjct: 470 CSMYKPPPTEFKQLLDKTMEGYHILKGSKSKKKMQWLFVKSHKQNGNYECGYYVMKAMHT 529

Query: 216 IVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           IV+ +IV  W E F D+S    E I   RE WA F++E
Sbjct: 530 IVNSQIVSGWTEIFIDRSSLPLEDINIIREQWATFFIE 567


>XP_003627090.1 Ulp1 protease family, carboxy-terminal domain protein [Medicago
           truncatula] AET01566.1 Ulp1 protease family,
           carboxy-terminal domain protein [Medicago truncatula]
          Length = 694

 Score = 70.5 bits (171), Expect = 1e-11
 Identities = 37/103 (35%), Positives = 62/103 (60%), Gaps = 2/103 (1%)
 Frame = +3

Query: 27  VVWMCSLDHKP-ENSMKNTVNVALEGYWRTKGKRKKN-LIWINATVPIQVSN*ECGHYVM 200
           V ++CSL  KP + ++   V+ ALEGY++ +G RK + ++W   T   Q  + E G++VM
Sbjct: 583 VTFLCSLGKKPSDKNLPVIVDSALEGYYKLQGVRKHSKVVWFYPTSRRQSVSYESGYFVM 642

Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
            +MLNI+S  +V  W + F D +PF ++ ++  +E  A   LE
Sbjct: 643 LHMLNIISSGVVDSWMQIFADSTPFQKDEVKNVQERCANLILE 685


>XP_014630124.1 PREDICTED: uncharacterized protein LOC102667062 [Glycine max]
           XP_014630126.1 PREDICTED: uncharacterized protein
           LOC102667062 [Glycine max]
          Length = 767

 Score = 70.5 bits (171), Expect = 1e-11
 Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 2/106 (1%)
 Frame = +3

Query: 18  QPVVVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRKKNLI--WINATVPIQVSN*ECGH 191
           Q VVVW CSL  KP+  +K T+N  +    +T     K +   WI     +Q    ECG+
Sbjct: 655 QHVVVWFCSLRRKPDMHIKATINSVMTKLKKTLSPETKAVAPKWIEVKSHVQTGCYECGY 714

Query: 192 YVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329
           Y+M ++ NI+   I  DW   F + +P    +I   R+ WA F+L+
Sbjct: 715 YIMHWIWNIIVSDIKSDWSMWFANDTPLDIGIITTIRKKWATFFLK 760


>KHM99138.1 hypothetical protein glysoja_032029 [Glycine soja]
          Length = 315

 Score = 69.7 bits (169), Expect = 1e-11
 Identities = 38/112 (33%), Positives = 63/112 (56%), Gaps = 2/112 (1%)
 Frame = +3

Query: 27  VVWMCSLDHKPENSMKNTVNVALEGYWR-TKGKRKKNLI-WINATVPIQVSN*ECGHYVM 200
           VVW CSL  K + S+K TVN A++   +  KG   + +  WI   + +Q    EC +YVM
Sbjct: 204 VVWFCSLRKKLDASIKATVNSAMKTLSKGDKGNTDQPMPQWIEPMIHVQTGAYECVYYVM 263

Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLEHYERSIKKT 356
            ++ NIVSG +  +W   F+D +P ++E +   R  WA ++L+     ++K+
Sbjct: 264 HWIWNIVSGGLKDEWITWFSDGTPLTKETMTTLRHKWAAYFLQIKNLEVRKS 315


Top