content-embedded-secrets¶

Detect potential API keys, tokens, and passwords in instruction files


Severity	error (auto)
Autofix	-
Since	v0.7.0
Category	Content Intelligence

Why¶

Instruction files are checked into version control and often read by multiple agents and users. A hardcoded API key, token, or password in an instruction file is a credential leak — it is visible in the git history even after removal and may be harvested by automated scanners.

Detection¶

Two classes of match are handled differently:

Structured token formats (AKIA…, ghp_…, sk-ant-…, private-key blocks, JWTs, …) are high-confidence and always reported.
Generic credential assignments (password = "…", api_key: "…", secret_key, access_token) are gated to avoid flagging documentation examples:
- Placeholder allowlist: values containing obvious placeholder markers (example, placeholder, dummy, changeme, your-…, hunter2, …), template syntax (<your-key>, ${VAR}, {{ var }}), or a single repeated character are skipped. Extend the list with additional-placeholders.
- Entropy gating: the value's Shannon entropy must reach entropy-threshold (default 3.5 bits/char). Real random secrets pass; English-ish placeholder strings do not. Values shorter than 16 characters are length-normalized before comparison (per-char Shannon entropy of an n-char string is capped at log2(n), so a fully random 10-char password measures only ~3.3 bits/char raw — short random passwords still fire).

Examples¶

Bad:

Set the API key to `sk-abc123...` in your environment.

Good:

Set the API key via the `OPENAI_API_KEY` environment variable.
Store secrets in `.env` (gitignored) — never inline them in instruction
files.

How to fix¶

Replace the hardcoded secret with an environment variable reference (e.g., $API_KEY) or a note directing the reader to a secure storage mechanism. Rotate the exposed credential immediately — removing it from the file does not remove it from git history. A coding agent can redact detected secrets automatically.

Configuration¶

rules:
  content-embedded-secrets:
    enabled: auto  # true | false | auto
    severity: error

Parameter	Description	Default
`entropy-threshold`	Minimum Shannon entropy (bits/char) a generic key = "value" match must reach to be reported; structured tokens (AKIA…, ghp_…, private keys) are always reported	`3.5`
`additional-placeholders`	Extra case-insensitive substrings that mark a generic credential value as a placeholder (suppressing the violation)	`[]`

Research Basis¶

Detects potential API keys, tokens, and passwords in instruction files.

CLAUDE.md files are loaded into context every session. A hardcoded API key in an instruction file is exposed to every conversation, every collaborator, and potentially every model provider's logging infrastructure. This is CWE-798 (Use of Hard-coded Credentials), mapping to OWASP Top Ten 2021 A07.

References:

CWE-798: Use of Hard-coded Credentials — Authoritative weakness enumeration
OWASP Secrets Management Cheat Sheet
Claude Code Security — Instruction files are loaded into context every session

Run skillsaw explain content-embedded-secrets to see this documentation and the rule's effective configuration in your terminal.