Ad-supported AI is not a fantasy. One search ad every 39 minutes, or one display ad every 3 minutes, covers the full compute cost of running a trillion-parameter model on 4 NVIDIA B200 Blackwell GPUs at $4.50 per hour spot pricing, serving 300 concurrent users at 50% peak capacity. Google Search CPM is $38.40. Google Display CPM is $3.12. The break-even math is straightforward and it works.

The frequency ceiling is higher than assumed. Hyper-casual mobile games already serve 6 ads per session, roughly one per minute. At the effective CPM floor of $1.50, after fill rates and network revenue share, the required frequency rises to one display ad every 90 seconds. That is still inside what mobile users accept. Rewarded video, clearing at $40 to $50 CPM with near-100% fill in gaming, pushes the model further: a single rewarded impression across 300 users covers 83% of one hour of cluster cost. The article walks through each scenario with exact numbers, and the footnotes cite the underlying benchmarks.

The ceiling is agentic workloads. Coding agents burn 10 to 20 times more tokens than passive chat, with Claude Code consuming roughly 33,000 tokens per task and heavy users running 20 to 60 tasks per day. Ads alone cannot cover that. But a hybrid model does: $10 per month plus 8 rewarded ads per day funds 2 million tokens daily per user. Anthropic just pulled Claude Code from its $20 plan. The article makes the case that the answer is not higher paywalls, but a different funding structure entirely.

[READ ORIGINAL →]