🔤
L0
Input Normalizer
Anti-obfuscation preprocessing: base64 decoding, Unicode normalization, zero-width character stripping, and leetspeak expansion. Ensures downstream layers see the true payload.
💉
L1
Prompt Injection Detection
12+ regex patterns covering DAN, SYSTEM overrides, Llama-format injection, and null-byte attacks. Sub-3ms detection before any token is processed.
🧹
L1
PII Stripping
SSN, credit cards, emails stripped before the model sees them. Mutate not block — clean requests still pass with redacted content. Optional Presidio NER.
🔑
L2.1–L2.3
Auth + Trust Chain
API key/JWT validation, agent-to-agent HMAC trust tokens with 60s replay protection, and live threat signature matching from OWASP + CrowdSec.
🔌
L2.4–L2.5
MCP + RAG Guard
MCP environment hardening validates tool server manifests and parameter schemas. RAG poisoning detection inspects retrieved context for injected instructions and adversarial chunks.
🧠
L3
Intent Guard
Semantic drift detection across multi-turn conversations, topic correlation analysis, and command-and-control pattern recognition. Catches slow-burn manipulation attacks.
🛠
L4
Tool RBAC
Deny-by-default for destructive tools (drop, rm, wipe, purge). Allowlist per role. CVE-mapped tool patterns from NVD. Prevents agents from calling what they shouldn't.
⏱
L5
Rate + Cost Cap
Per-session token bucket and daily cost accumulator. Single request with $101 spend triggers immediate rejection. Global E-Stop flag for instant kill-switch.
🌐
L6
SSRF Prevention
RFC-1918 private ranges, localhost, AWS/Azure/GCP metadata endpoints all blocked in model egress. Private infrastructure can never be reached via a model response.
🚫
L6
Exfiltration Block
AWS IAM keys, PGP/RSA private key blocks, JWTs, SQL dumps, and large base64 blobs caught in model output before they reach the client.
📜
L7
Tamper-Evident Audit
SHA-256 hash-chained record per request. Chain verification detects any modification. Every pass, reject, and mutation logged with caller, layer, and timestamp.
⚡
L8
Semantic Cache + Routing
Intelligent response caching and model routing. Token budget tracking per request. Reduces redundant LLM calls while maintaining security coverage on every interaction.