Originally created by: TheoV823
PR [#43] shipped a narrow CI hotfix after site/llms.txt triggered a false-positive deploy_001 governance verdict.
The immediate site issue was resolved by excluding site/llms.txt and site/llms-full.txt from the mneme-check CI file scope. That is acceptable as a hotfix, but it exposed a deeper product-level retrieval limitation.
deploy_001 includes a multi-word anti-pattern such as:
"python scripts/deploy_site.py"
The current retrieval behavior appears to tokenize anti-patterns word-by-word. As a result, static content that mentions Python can retrieve a deployment-script rule even when the file is not executable code and has no deploy-script context.
This is noisy because the rule is about running a deployment script from the wrong directory, not about the word Python appearing in documentation or AI-discovery files.
This issue tracks product improvements to make retrieval more scope-aware and less token-noisy:
Example: python scripts/deploy_site.py should only fire when that sequence or equivalent command pattern appears, not when a file merely contains Python.
Rule-level applies_to scope
Example: deployment-script rules could apply to scripts/*.py, workflow files, or shell commands, but not site/*.txt.
Content-type awareness
.htaccess, Python, shell, and workflow files.Deployment/code-execution rules should not be retrieved for static content unless explicitly scoped there.
Explicit no-scope verdict
SKIP — no rules apply rather than an ambiguous UNKNOWN.Before implementation, propose an architecture-compatible design that preserves Mneme's deterministic governance model:
site/llms.txt mentions Python but does not retrieve/fail deploy_001.Post-freeze product improvement. Important, but not an emergency fix.