Skip to content

CONCEPT Cited by 1 source

WAF attack score

Definition

WAF Attack Score is Cloudflare's ML-based request classification layer that assigns a score (1–99) to every HTTP request based on structural similarity to historical attack traffic — independent of signature-based rules. Lower scores indicate higher confidence the request is malicious.

How it differs from signature matching

Traditional WAF rules match against a list of known-bad patterns (regex, content strings, CVE-specific signatures). WAF Attack Score is trained on the shape of past attacks: a novel SQL injection or RCE chain is almost always a rearrangement of attack shapes the model has seen before, even when the specific exploit is brand new. This makes it effective against zero-day variants and frontier-model-generated payloads.

Operational detail

  • Score range: 1–99
  • Scoring runs on every request (not sampled)
  • Lower score → more aggressively treated (challenge, block, or rate-limit)
  • Same methodology applied to AI prompts via "AI Security for Apps" — scoring prompt similarity to known attack prompts rather than checking against a list

(Source: sources/2026-06-09-cloudflare-defend-against-frontier-cyber-models)

Seen in

Last updated · 542 distilled / 1,571 read