Graph showing OpenAI API token usage surge from 6B to 15B per minute, highlighting compute strain.

Editorial illustration for OpenAI API token usage rises from 6 bn to 15 bn per minute, straining compute

OpenAI API Token Usage Surges to 15 Billion per Minute

OpenAI API token usage rises from 6 bn to 15 bn per minute, straining compute

April 13, 2026 • 2 min read

The surge in demand for OpenAI’s services is hitting the back‑end hard. Over the past half‑year, the volume of tokens processed by the company’s API has more than doubled, pushing the infrastructure to its limits. Engineers report frequent outages, while the firm has begun rationing access to keep key customers online.

At the same time, the market for graphics processing units—essential for training and inference—has tightened, driving prices upward and squeezing margins. Inside the organization, senior leaders are forced to prioritize short‑term capacity, often juggling trade‑offs that would have been unthinkable a few months ago. It’s a pressure cooker scenario: relentless growth meets a finite pool of compute resources, and every additional request adds strain.

This context frames what OpenAI’s chief financial officer, Sarah Friar, told the Wall Street Journal about her day‑to‑day focus and the tough choices the company now faces.

Token usage across OpenAI's API jumped from 6 billion per minute in October to 15 billion per minute by the end of March, according to the WSJ. OpenAI CFO Sarah Friar told the WSJ that she spends much of her time hunting for near-term compute capacity and that the company is making difficult decisions about which projects to shelve because resources simply aren't available. Providers have been rolling out new limits since January to manage the agent boom The capacity crisis is also reshaping plans for developer tools, which increasingly run agentic workloads that consume far more tokens.

The AI industry is running out of compute, with outages, rationing, and rising GPU prices - THE DECODER

Is the AI sector hitting a hard ceiling? Token consumption on OpenAI’s API surged from six billion per minute in October to fifteen billion by March, a spike that the Wall Street Journal says is straining the available compute pool. Outages are now common.

And enterprises are feeling the pinch as GPU prices climb and providers resort to rationing. While Anthropic reports an API availability of 98.95 percent, well below the 99.99 percent benchmark, it is already losing enterprise customers to OpenAI. OpenAI is cutting Sora.

The company shut down its video‑generation app Sora to reallocate GPU cycles toward coding tools and its enterprise professional tier, a move CFO Sarah Friar said is necessary while she hunts for near‑term compute capacity. Can the sector secure enough silicon to keep pace? Unclear whether demand will subside.

Until providers can expand capacity or find sustainable pricing for GPUs, enterprises may continue to face throttled services, and the current compute crunch could shape short‑term product strategies across the industry.

Common Questions Answered

How much has OpenAI's API token usage increased between October and March?

OpenAI's API token usage surged from 6 billion tokens per minute in October to 15 billion tokens per minute by the end of March. This dramatic increase represents more than a 150% growth in just five months, putting significant strain on the company's computational infrastructure.

What challenges is OpenAI facing due to the massive increase in token usage?

OpenAI is experiencing frequent infrastructure outages and is being forced to ration access to its services to keep key customers online. The company's CFO, Sarah Friar, is spending considerable time searching for near-term compute capacity and making difficult decisions about which projects to postpone due to resource constraints.

How is the current GPU market affecting OpenAI's operations?

The graphics processing unit (GPU) market has tightened significantly, driving prices upward and squeezing profit margins for AI companies. This scarcity of computational resources is forcing providers like OpenAI to implement new limits and carefully manage their available compute capacity.

🎓

Featured Review

No Code MBA

Build AI apps without coding. Our in-depth course review.

Read Review

OpenAI API Token Usage Surges to 15 Billion per Minute

Further Reading

Common Questions Answered

How much has OpenAI's API token usage increased between October and March?

What challenges is OpenAI facing due to the massive increase in token usage?

How is the current GPU market affecting OpenAI's operations?

Most Popular

Intuit turns months of tax code work into hours with proprietary DSL

Two new AI sandbox architectures limit credential exposure after prompt injection

Google Vids adds Veo, Lyria AI models and directable avatars for flyers, reels

Alibaba’s Tongyi Lab launches VimRAG, a memory‑graph multimodal RAG framework

Guide to Building Document Intelligence Pipelines with LangExtract and OpenAI

Meta's structured prompting lifts LLM code review accuracy to 93%

Nvidia unveils Agentforce AI platform with Adobe, Salesforce, SAP at GTC 2026

Sam Altman proposes new AI 'social contract' in You.com guide

Anthropic ends free OpenClaw access to Claude, adds extra fee April 4

Batch Mode VC-6 and NVIDIA Nsight Speed Up Vision AI Pipelines

Further Reading

Related Reading

OpenAI, a Series F San Francisco startup founded in 2015 by eight pioneers

Terminal-Bench 2.0 launches with Harbor, testing any container-installable agent

Zuckerberg Unveils Meta Compute to Build Global AI Infrastructure

Gen AI app sessions up fivefold, downloads jump 778% as ChatGPT leads traffic

GPT-5 helps mathematicians offload tedious tasks, says Timothy Gowers

We refined facial expressions, clothing, and lighting for AI article image

Molotov cocktail thrown at OpenAI CEO Sam Altman's home in the middle of the night

Researchers say OpenAI's Sora and Google's Veo aren't true world models

Cursor, Windsurf get funding for tools; OpenAI, Google, Anthropic add products

Common Questions Answered

How much has OpenAI's API token usage increased between October and March?

What challenges is OpenAI facing due to the massive increase in token usage?

How is the current GPU market affecting OpenAI's operations?

Most Popular

Intuit turns months of tax code work into hours with proprietary DSL

Two new AI sandbox architectures limit credential exposure after prompt injection

Google Vids adds Veo, Lyria AI models and directable avatars for flyers, reels

Alibaba’s Tongyi Lab launches VimRAG, a memory‑graph multimodal RAG framework

Guide to Building Document Intelligence Pipelines with LangExtract and OpenAI

Meta's structured prompting lifts LLM code review accuracy to 93%

Nvidia unveils Agentforce AI platform with Adobe, Salesforce, SAP at GTC 2026

Sam Altman proposes new AI 'social contract' in You.com guide

Anthropic ends free OpenClaw access to Claude, adds extra fee April 4

Batch Mode VC-6 and NVIDIA Nsight Speed Up Vision AI Pipelines