Skip to content

Commit 094ebb7

Browse files
authored
Clarify /crawl changelog (#28914)
* Clarify /crawl changelog * minor changes
1 parent e4bb62b commit 094ebb7

1 file changed

Lines changed: 6 additions & 2 deletions

File tree

src/content/changelog/browser-rendering/2026-03-10-br-crawl-endpoint.mdx

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,9 @@ products:
66
date: 2026-03-10
77
---
88

9-
You can now crawl an entire website with a single API call using [Browser Rendering](/browser-rendering/)'s new [`/crawl` endpoint](/browser-rendering/rest-api/crawl-endpoint/), available in open beta. Submit a starting URL, and pages are automatically discovered, rendered in a headless browser, and returned in multiple formats, including HTML, Markdown, and structured JSON. This is great for training models, building RAG pipelines, and researching or monitoring content across a site.
9+
_Edit: this post has been edited to clarify crawling behavior with respect to site guidance._
10+
11+
You can now crawl an entire website with a single API call using [Browser Rendering](/browser-rendering/)'s new [`/crawl` endpoint](/browser-rendering/rest-api/crawl-endpoint/), available in open beta. Submit a starting URL, and pages are automatically discovered, rendered in a headless browser, and returned in multiple formats, including HTML, Markdown, and structured JSON. The endpoint is a [signed-agent](https://developers.cloudflare.com/bots/concepts/bot/signed-agents/) that respects robots.txt and [AI Crawl Control](https://www.cloudflare.com/ai-crawl-control/) by default, making it easy for developers to comply with website rules, and making it less likely for crawlers to ignore web-owner guidance. This is great for training models, building RAG pipelines, and researching or monitoring content across a site.
1012

1113
Crawl jobs run asynchronously. You submit a URL, receive a job ID, and check back for results as pages are processed.
1214

@@ -33,7 +35,9 @@ Key features:
3335
- **Static mode** - Set `render: false` to fetch static HTML without spinning up a browser, for faster crawling of static sites
3436
- **Well-behaved bot** - Honors `robots.txt` directives, including `crawl-delay`
3537

36-
Available on both the Workers Free and Paid plans.
38+
Available on both the Workers Free and Paid plans.
39+
40+
**Note**: the /crawl endpoint cannot bypass Cloudflare bot detection or captchas, and self-identifies as a bot.
3741

3842
To get started, refer to the [crawl endpoint documentation](/browser-rendering/rest-api/crawl-endpoint/).
3943
If you are setting up your own site to be crawled, review the [robots.txt and sitemaps best practices](/browser-rendering/reference/robots-txt/).

0 commit comments

Comments
 (0)