CONCEPT Cited by 1 source
WAF attack score¶
Definition¶
WAF Attack Score is Cloudflare's ML-based request classification layer that assigns a score (1–99) to every HTTP request based on structural similarity to historical attack traffic — independent of signature-based rules. Lower scores indicate higher confidence the request is malicious.
How it differs from signature matching¶
Traditional WAF rules match against a list of known-bad patterns (regex, content strings, CVE-specific signatures). WAF Attack Score is trained on the shape of past attacks: a novel SQL injection or RCE chain is almost always a rearrangement of attack shapes the model has seen before, even when the specific exploit is brand new. This makes it effective against zero-day variants and frontier-model-generated payloads.
Operational detail¶
- Score range: 1–99
- Scoring runs on every request (not sampled)
- Lower score → more aggressively treated (challenge, block, or rate-limit)
- Same methodology applied to AI prompts via "AI Security for Apps" — scoring prompt similarity to known attack prompts rather than checking against a list
(Source: sources/2026-06-09-cloudflare-defend-against-frontier-cyber-models)
Seen in¶
- sources/2026-06-09-cloudflare-defend-against-frontier-cyber-models — "We run the model on every request and assign a WAF Attack Score between 1 and 99, based on how closely the request resembles those underlying shapes, not against a list of known-bad signatures."