News: 0178245910

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

Cloudflare Flips AI Scraping Model With Pay-Per-Crawl System For Publishers (cloudflare.com)

(Tuesday July 01, 2025 @05:20PM (msmash) from the AI-toll-booth dept.)


Cloudflare today announced a " [1]Pay Per Crawl " program that allows website owners to charge AI companies for accessing their content, a potential revenue stream for publishers whose work is increasingly being scraped to train AI models. The system uses HTTP response code 402 to enable content creators to set per-request prices across their sites. Publishers can choose to allow free access, require payment at a configured rate, or block crawlers entirely.

When an AI crawler requests paid content, it either presents payment intent via request headers for successful access or receives a "402 Payment Required" response with pricing information. Cloudflare acts as the merchant of record and handles the underlying technical infrastructure. The company aggregates billing events, charges crawlers, and distributes earnings to publishers.

Alongside Pay Per Crawl, Cloudflare has [2]switched to blocking AI crawlers by default for its customers, becoming the first major internet infrastructure provider to require explicit permission for AI access. The company handles traffic for 20% of the web and more than one million customers have already activated its AI-blocking tools since their September 2024 launch, it wrote in a blog post.



[1] https://blog.cloudflare.com/introducing-pay-per-crawl/

[2] https://www.cloudflare.com/ru-ru/press-releases/2025/cloudflare-just-changed-how-ai-crawlers-scrape-the-internet-at-large/



The great internet paywall begins (Score:5, Interesting)

by xack ( 5304745 )

All because you couldn't behave, now we have the equivalent of the TSA for the internet. Expect browsers like Pale Moon, Ladybird and Seamonkey to get put on the wrong side of the wall again. Expect adblocker users to get hit by this soon too, as now they have the technology.

Re: (Score:3)

by JamesTRexx ( 675890 )

I did see a lot more Cloudflare prompts lately on sites which had no issue with Pale Moon before.

I prefer my method (Score:1)

by CEC-P ( 10248912 )

Someone already developed an AI Tar Pit code for website. When it detects bot traffic or robots.txt violations, you just keep creating more and more dummy pages in an infinite chain and then slow down the response time for the server to like 750ms. It traps the bots there forever without using much bandwidth.

Re: I prefer my method (Score:2)

by EldoranDark ( 10182303 )

Why not both though?

Re: (Score:2)

by OverlordQ ( 264228 )

That's what CF was already doing.

So when are the lawsuits coming? (Score:2)

by Sebby ( 238625 )

No doubt Meta[stasize], Google, OpenAI and all other major AI shops will whine about having to pay for anything and conjure up some reasoning why this system is illegal because reasons and sue Cloudflare to tie them up in litigation - so my question is: when is that happening?

Re: (Score:2)

by timeOday ( 582209 )

I actually think that's the way to go. There needs to be a law about whether AI is allowed to learn from content on the same terms as a living individual, or not. Then there also need to be technical means to enact whatever policy is legislated, which is here this Cloudflare technology could fit in.

Re: (Score:2)

by smooth wombat ( 796938 )

will whine about having to pay for anything

You mean like people on here and elsewhere who brag about stealing music/movies/software because they don't want to pay?

If it's okay for you to steal someone else's work, why is not acceptable for these companies to scrape available content?

Re: (Score:2)

by Sebby ( 238625 )

> will whine about having to pay for anything You mean like people on here and elsewhere who brag about stealing music/movies/software because they don't want to pay?

"brag" != "whine"

> If it's okay for you to steal someone else's work

WTF here said that in this discussion. Provide citations.

> why is not acceptable for these companies to scrape available content?

Comparing apples to oranges.

Multi-billion$ AI companies scrape content, then repeatedly sell access to services that use that content at scale without compensation to the creators, without whose content those companies would have nothing to offer in the first place.

Quite different than some individual "stealing" a song for their own use (sure there's some level of deprivation of funding to the creator, but they're not making money of

Interesting but too late... (Score:2)

by PsychoSlashDot ( 207849 )

I'd imagine most of the AI crawlers out there have already ingested most of the freely available content. I don't imagine many places that have new content who haven't already struck up some agreement with the scrapers. For instance Ars Technica has.

Had this existed three years ago, it might've been interesting.

That said, there's something missing. There's two ways a crawler can work. Either they request content, get told a price and have to reconnect and agree to the price OR they can declare in ad

but muh profits! (Score:2)

by Growlley ( 6732614 )

how is big tech supposed to profit from stealing your content if they have to pay for it?

So AI only crawls spam content for AI from now on (Score:1)

by jkechel ( 1101181 )

Former it was like: AI might hurt itself when it gets more and more ai generated input

Now its like: AI will only get ai generated content at all

Re: (Score:2)

by usedtobestine ( 7476084 )

But not the spam behind cloudflare...

Brief History Of Linux (#15)
Too many hyphens: Traf-O-Data and Micro-soft

Bill Gates and Paul Allen attended an exclusive private school in Seattle.
In 1968, after raising $3,000 from a yard sale, they gained access to a
timeshare computer and became addicted. After depleting their money
learning BASIC and playing Solitaire, they convinced a company to give
them free computer time in exchange for reporting bugs -- ironically, an
early form of Open Source development!

The two then founded a small company called Traf-O-Data that collected and
analyzed traffic counts for municipalities using a crude device based on
the Intel "Pretanium" 8008 CPU. They had some success at first, but ran
into problems when they were unable to deliver their much hyped
next-generation device called "TrafficX". An engineer is quoted as saying
that "Traf-O-Data is the local leader in vaporware", the first documented
usage of the term that has come to be synonymous with Bill Gates.

Soon thereafter, the two developed their own BASIC interpreter, and sold
it to MITS for their new Altair computer. April 4, 1975 is the fateful day
that Micro-soft was founded in Albuquerque, NM as a language vendor.