15 Detection Engines
Counters, content scanners, ML security models, and rule engines. Under 25ms total latency.
SAFE MODE
Safe Mode is a local MCP proxy that sits between your AI editor and your tools. 15 detection engines, automatic rollback, and phone approvals — no cloud required.
npx safemode initHOW IT WORKS
Safe Mode detects your MCP clients, patches their configs to route through the proxy, and runs every tool call through the detection engine pipeline. Original configs are backed up to ~/.safemode/backup/.
SUPPORTED CLIENTS
Plus any MCP-compatible client.
FEATURES
Counters, content scanners, ML security models, and rule engines. Under 25ms total latency.
Start with coding, personal, strict, trading, or yolo. Tune every parameter individually.
Automatic file snapshots before every write. Instant rollback to any restore point.
Route high-risk actions to Telegram or Discord. Approve or deny from your phone.
No cloud required. No telemetry. No data leaves your machine unless you opt in.
Connect to TrustScope for +12 cloud engines, Agent DNA, compliance evidence, and team dashboards.
PRESETS
Balanced for software development. Scoped writes, terminal guards, budget caps.
Budget cap: $20/session
For Claude Desktop personal use. No terminal, no git, filesystem scoped.
Budget cap: $10/session
Maximum restrictions. Everything requires approval. Read-only filesystem.
Budget cap: $5/session
Circuit breakers and hard caps for financial operations.
Budget cap: $50/session
Maximum autonomy. Catch catastrophes only. For sandboxed environments.
Budget cap: $100/session
Switch anytime with safemode preset <name>. Every knob is individually overridable.
GRANULAR CONTROL
Every knob is allow, approve, or block. Presets set sensible defaults. Override any knob individually.
| Category | Knobs | Controls |
|---|---|---|
| Terminal | 10 | command exec, destructive commands, sudo, package installs, daemons |
| Filesystem | 8 | read, write, delete, symlinks, permissions |
| Git | 6 | commit, push, force push, branch delete, rebase |
| Network | 5 | HTTP, WebSocket, DNS, domain allowlist/blocklist |
| Database | 5 | read, write, delete, schema change, admin |
| Financial | 5 | payments, transfers, subscriptions, refunds |
| API | 5 | read, write, delete, admin, rate limit |
| Communication | 5 | email, messages, notifications, calendar |
| Cloud Infrastructure | 5 | instances, storage, network, IAM |
| Container | 4 | create, delete, image pull, volume mount |
| Package Management | 4 | install, uninstall, update, publish |
| Deployment | 4 | staging, production, rollback, scale |
| Data | 4 | export, import, backup, transform |
| Scheduling | 3 | cron, timers, scheduled tasks |
| Authentication | 3 | credential read/write, sessions |
| Monitoring | 3 | logs, metrics, alerts |
| Browser | 3 | navigate, form submit, download |
| Physical | 3 | IoT commands, hardware, sensors |
| Custom | — | your own rules |
allow
Action proceeds without interruption
approve
Action paused until human approval
block
Action denied completely
2 knobs are hardcoded and cannot be overridden: destructive_commands and pipe_to_shell are always blocked.
DETECTION ENGINES
ML security models (prompt injection & jailbreak) are optional. Enable with safemode ml --enable (~85MB download).
CLI
$ safemode init # detect clients, pick preset, patch configs
$ safemode start # start proxy
$ safemode status # show engines, clients, preset
$ safemode summary # session stats + restore points
$ safemode restore # rollback to snapshot
$ safemode scan # security scan current directory
$ safemode doctor # health check + diagnostics
$ safemode activity -f # live activity feedSafe Mode works entirely offline. When you're ready for team visibility, compliance evidence, and advanced detection, connect to TrustScope with safemode connect.
+12 cloud engines
PII cloud, anomaly detection, Agent DNA, behavioral fingerprinting
Team dashboard
Centralized visibility across all agents and developers
Compliance evidence
Signed receipts, hash chains, and framework-mapped exports
One command. No account. No cloud. Just guardrails.
npx safemode init