[WAF] AI security updates (#28919)

pedrosousa · web-flow · commit e4bb62b2c43e · 2026-03-11T15:15:51.000Z
diff --git a/src/content/docs/waf/detections/ai-security-for-apps/index.mdx b/src/content/docs/waf/detections/ai-security-for-apps/index.mdx
@@ -29,12 +29,13 @@ Cloudflare will populate [AI detection fields](/waf/detections/ai-security-for-a
 
 AI Security for Apps capabilities vary by Cloudflare plan:
 
-| Capability                                                                                               | Free | Pro | Business | Enterprise |
-| -------------------------------------------------------------------------------------------------------- | ---- | --- | -------- | ---------- |
-| **LLM endpoint discovery** — Automatically identify AI-powered endpoints across your web properties      | Yes  | Yes | Yes      | Yes        |
-| **AI detection fields** — PII detection, prompt injection scoring, unsafe topic detection, custom topics | No   | No  | No       | Yes        |
+| Capability                                                                                                       | Free | Pro | Business | Enterprise |
+| ---------------------------------------------------------------------------------------------------------------- | ---- | --- | -------- | ---------- |
+| **LLM endpoint discovery** — Automatically identify AI-powered endpoints across your web properties              | Yes  | Yes | Yes      | Yes        |
+| **AI Security Log Mode Ruleset** — Pre-built ruleset that logs the full request body alongside detection results | No   | No  | No       | Yes        |
+| **AI detection fields** — PII detection, prompt injection scoring, unsafe topic detection, custom topics         | No   | No  | No       | Yes        |
 
-To enable AI detection fields, contact your account team.
+To get access to the [AI Security Log Mode Ruleset](/waf/detections/ai-security-for-apps/log-mode-vs-production-mode/#log-mode) and enable [AI detection fields](/waf/detections/ai-security-for-apps/fields/), contact your account team.
 
 AI Security for Apps is built into the Cloudflare [Web Application Firewall (WAF)](/waf/) — the WAF must be enabled on your zone before detection fields can be populated and used in rule expressions.
 
diff --git a/src/content/docs/waf/detections/ai-security-for-apps/log-mode-vs-production-mode.mdx b/src/content/docs/waf/detections/ai-security-for-apps/log-mode-vs-production-mode.mdx
@@ -23,7 +23,7 @@ AI Security for Apps can operate in two distinct modes. Understanding the trade-
 
 | Feature                | Production mode                                                                              | Log mode                                                                                                                              |
 | ---------------------- | -------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------- |
-| **How it works**       | You write WAF [custom rules](/waf/custom-rules/) using AI Security for Apps detection fields | You enable the **AI Security Log Mode Ruleset** with pre-built rules                                                                  |
+| **How it works**       | You write WAF [custom rules](/waf/custom-rules/) using AI Security for Apps detection fields | You enable the AI Security Log Mode Ruleset with pre-built rules                                                                      |
 | **Prompt logging**     | No — only request metadata is logged                                                         | Yes — the full request body is logged (encrypted via [payload logging](/waf/managed-rules/payload-logging/))                          |
 | **Response logging**   | No — use [AI Gateway](/ai-gateway/) if response visibility is required                       | No — same limitation                                                                                                                  |
 | **Policy flexibility** | Full — combine injection scores, PII categories, bot scores, custom topics, and more         | Limited — three fixed rules (PII detected, unsafe topic detected, prompt injection detected) with no score-based or subcategory logic |
@@ -48,7 +48,7 @@ In production mode, the prompt text is not logged. You can see detection metadat
 
 ## Log mode
 
-Log mode uses the **AI Security Log Mode Ruleset** — a pre-built ruleset that logs the full request body alongside detection results. This mode is designed for evaluation and tuning rather than production enforcement.
+Log mode uses the AI Security Log Mode Ruleset — a pre-built ruleset that logs the full request body alongside detection results. This mode is designed for evaluation and tuning rather than production enforcement.
 
 In log mode:
 
diff --git a/src/content/fields/index.yaml b/src/content/fields/index.yaml
@@ -1339,23 +1339,24 @@ entries:
 
       | Value | Category name             | Description                                                                                                       |
       | ----- | ------------------------- | ----------------------------------------------------------------------------------------------------------------- |
-      | `S1`  | Violent Crimes            | Violent crimes against people or animals.                                                                         |
-      | `S2`  | Non-Violent Crimes        | Non-violent offenses such as fraud, theft, drug creation, or hacking.                                             |
-      | `S3`  | Sex-Related Crimes        | Sex-related crimes, including trafficking, assault, and harassment.                                               |
-      | `S4`  | Child Sexual Exploitation | Sexual exploitation of children.                                                                                  |
+      | `S1`  | Violent crimes            | Violent crimes against people or animals.                                                                         |
+      | `S2`  | Non-violent crimes        | Non-violent offenses such as fraud, theft, drug creation, or hacking.                                             |
+      | `S3`  | Sex-related crimes        | Sex-related crimes, including trafficking, assault, and harassment.                                               |
+      | `S4`  | Child sexual exploitation | Sexual exploitation of children.                                                                                  |
       | `S5`  | Defamation                | False statements that are likely to damage a living person's reputation.                                          |
-      | `S6`  | Specialized Advice        | Specialized financial, medical, or legal advice, or misrepresent dangerous things as safe.                        |
+      | `S6`  | Specialized advice        | Specialized financial, medical, or legal advice, or misrepresent dangerous things as safe.                        |
       | `S7`  | Privacy                   | Sensitive, nonpublic personal information that could endanger an individual.                                      |
-      | `S8`  | Intellectual Property     | Violate a third party's intellectual property rights.                                                             |
-      | `S9`  | Indiscriminate Weapons    | Creation of indiscriminate weapons like chemical, biological, or nuclear arms.                                    |
+      | `S8`  | Intellectual property     | Violate a third party's intellectual property rights.                                                             |
+      | `S9`  | Indiscriminate weapons    | Creation of indiscriminate weapons like chemical, biological, or nuclear arms.                                    |
       | `S10` | Hate                      | Demean or dehumanize people based on their race, religion, sexual orientation, or other personal characteristics. |
-      | `S11` | Suicide & Self-Harm       | Encourage or endorse suicide, self-injury, or disordered eating.                                                  |
-      | `S12` | Sexual Content            | Erotic content.                                                                                                   |
+      | `S11` | Suicide and self-harm     | Encourage or endorse suicide, self-injury, or disordered eating.                                                  |
+      | `S12` | Sexual content            | Erotic content.                                                                                                   |
       | `S13` | Elections                 | False information about the time, place, or manner of voting in elections.                                        |
+      | `S14` | Code interpreter abuse    | Misuse of code execution capabilities.                                                                            |
 
       Requires a Cloudflare Enterprise plan. You must also enable [AI Security for Apps](/waf/detections/ai-security-for-apps/).
     example_block: |-
-      # Matches requests where an unsafe topic categorized as "S2" (Non-Violent Crimes) or "S10" (Hate) was detected in the LLM prompt:
+      # Matches requests where an unsafe topic categorized as "S2" (Non-violent crimes) or "S10" (Hate) was detected in the LLM prompt:
       (cf.llm.prompt.unsafe_topic_detected and any(cf.llm.prompt.unsafe_topic_categories[*] in {"S2" "S10"}))
 
   - name: cf.llm.prompt.injection_score