Detect Prompt Injection Before It Strikes
Real-time API that scans user inputs for prompt injection patterns, returns risk scores (0–100) and sanitized versions. Protect your AI apps in one API call.
POST /api/v1/scan
{
"input": "Ignore previous instructions and...",
"result": {
"risk_score": 94,
"flagged": true,
"sanitized": "[REDACTED]",
"patterns": ["instruction_override"]
}
}Simple Pricing
Pro
$29
/month
- ✓100,000 API calls/month
- ✓Risk scores + sanitized output
- ✓Pattern library updates
- ✓API key dashboard
- ✓Usage analytics
- ✓Email support
FAQ
How does the risk score work?
We analyze inputs using ML pattern matching and heuristics across 50+ known injection patterns. Scores range from 0 (safe) to 100 (critical threat), giving you fine-grained control over your thresholds.
What is a sanitized output?
When injection patterns are detected, we return a cleaned version of the input with malicious instructions removed or redacted, so you can optionally pass it downstream safely.
Can I integrate this with any LLM?
Yes. The API is model-agnostic — a single REST call before you send input to OpenAI, Anthropic, or any other LLM. Works with any language or framework.