AI model o1, trained with reinforcement learning, displays uncertainty on a screen.

Editorial illustration for Reinforcement learning trains AI like OpenAI's o1 to admit uncertainty

AI Models Learn to Admit When They Don't Know

Reinforcement learning trains AI like OpenAI's o1 to admit uncertainty

By AI Daily Post Edited by Brian Petersen, Editor-in-Chief

April 22, 2026 • Updated: July 4, 2026 • 4 min read

OpenAI's new o1 model will answer any question put to it. It will do so with terrifying, unblinking certainty. This is by design.

The reinforcement learning that built it pays for correct answers, full stop. A lucky guess earns the same reward as flawless logic. So the model learns to bluff.

It learns that silence—that simple, human "I don't know"—is a failure state. The result is a system of profound, confident uselessness, a silent hazard for anyone in medicine or law relying on its stated confidence as their only signal.

The reinforcement learning (RL) methods behind recent breakthroughs in AI reasoning, including the training approach used in systems like OpenAI's o1, reward models for getting the right answer, and penalize them for getting it wrong. A model that arrives at the correct answer through careful reasoning receives the same reward as one that guesses correctly by chance. Over time, this trains models to confidently answer every question they are asked, whether they have strong evidence or are effectively flipping a coin.

When models are deployed in medicine, law, finance, or any setting where users make decisions based on AI outputs, a system that expresses high confidence regardless of its actual certainty becomes unreliable in ways that are difficult to detect from the outside. A model that says "I'm 95 percent sure" when it is right only half the time is more dangerous than one that simply gets the answer wrong, because users have no signal to seek a second opinion. "The standard training approach is simple and powerful, but it gives the model no incentive to express uncertainty or say I don't know," says Mehul Damani, an MIT PhD student and co-lead author on the paper.

"So the model naturally learns to guess when it is unsure." RLCR addresses this by adding a single term to the reward function: a Brier score, a well-established measure that penalizes the gap between a model's stated confidence and its actual accuracy. During training, models learn to reason about both the problem and their own uncertainty, producing an answer and a confidence estimate together. The math backs it up: the team proved formally that this type of reward structure guarantees models that are both accurate and well-calibrated.

Teaching AI models to say “I’m not sure” - MIT News - Artificial Intelligence (AI2)

MIT's fix, RLCR, attacks the reward function itself. It adds a Brier score—a classic calibration metric—to penalize the gap between a model's bravado and its actual knowledge. Now a model must reason about the problem and its own doubt in tandem, producing an answer alongside a calibrated confidence score.

The goal isn't a timid AI. It's an honest one. For a doctor reading a diagnostic summary, that honesty is the only possible basis for trust.

It transforms the technology's purpose. The inscrutable black box begins to look like a collaborator, one that tells you plainly when it's on solid ground and when it's just guessing. That's the real progress.

Common Questions Answered

How does reinforcement learning change the way AI models like o1 approach uncertainty?

Reinforcement learning introduces a reward system that encourages AI models to be more transparent about their knowledge gaps. Instead of always providing an answer, models are trained to recognize and admit when they lack sufficient evidence to confidently respond.

What problem does the current reinforcement learning approach create in AI reasoning?

The current reinforcement learning method tends to train AI models to answer every question with unwarranted confidence, even when they are essentially guessing. This approach rewards correct answers regardless of whether they are reached through careful reasoning or pure chance.

Why is admitting uncertainty important for AI reasoning systems like o1?

Admitting uncertainty helps prevent AI models from spreading misinformation or providing potentially incorrect answers. By developing a mechanism to flag uncertainty, AI systems can become more reliable and trustworthy sources of information.

Ship an AI product this weekend — no engineers required.

Structured, in-depth lessons on the exact no-code tools — not scattered tutorials.

The exact platforms, taught in depth
Build real, working projects
Our honest review + a reader discount

Read the review →

AI Models Learn to Admit When They Don't Know

Common Questions Answered

How does reinforcement learning change the way AI models like o1 approach uncertainty?

What problem does the current reinforcement learning approach create in AI reasoning?

Why is admitting uncertainty important for AI reasoning systems like o1?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

AI Breached OpenAI Research, Reached Internet via Lateral Movement

Nvidia's Blackwell Chips Reportedly Overheated in Server Racks

Gemini 3.6 Flash Boosts Coding and Token Efficiency

LWiAI Podcast #252: GPT 5.6, Grok 4.5, and AI 2040 Discussed

OpenAI: Hugging Face Breach Traced to Pre-Release Models' Testing Goal

Meta Tests 'StoryKit' AI App for Children's Bedtime Stories

Google launches cost-effective AI security model Gemini 3.5 Flash-Lite

Poolside's Laguna S 2.1 Coding Model Leads Open-Weight Pack on SWE-Bench

Expedia AI chief: Users must have final say over AI agents

OpenAI Models Escaped Through Package Proxy, Hacked HuggingFace

Related Reading

Google's FACTS benchmark shows 70% factuality ceiling across four tests

Databricks finds multi-step agents beat single-turn RAG by 21% to 38% on STaRK

Nvidia's DLSS 4.5 beta adds 6x Multi Frame Generation for RTX 50 GPUs

OpenAI researcher quits, citing distrust over ad‑driven engagement metrics

OpenAI launches GPT-Image 1.5 with precise editing for enterprise visuals

LangSmith adds reusable LLM-as-judge and rule-based code evaluator templates

AI made up over a third of new sites by 2025; Pope warning flagged as AI

May 8 WIRED livestream panel tackles Musk vs. Altman and OpenAI's future

OpenAI regains image lead as Algolia releases AI agent guide

Common Questions Answered

How does reinforcement learning change the way AI models like o1 approach uncertainty?

What problem does the current reinforcement learning approach create in AI reasoning?

Why is admitting uncertainty important for AI reasoning systems like o1?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

AI Breached OpenAI Research, Reached Internet via Lateral Movement

Nvidia's Blackwell Chips Reportedly Overheated in Server Racks

Gemini 3.6 Flash Boosts Coding and Token Efficiency

LWiAI Podcast #252: GPT 5.6, Grok 4.5, and AI 2040 Discussed

OpenAI: Hugging Face Breach Traced to Pre-Release Models' Testing Goal

Meta Tests 'StoryKit' AI App for Children's Bedtime Stories

Google launches cost-effective AI security model Gemini 3.5 Flash-Lite

Poolside's Laguna S 2.1 Coding Model Leads Open-Weight Pack on SWE-Bench

Expedia AI chief: Users must have final say over AI agents

OpenAI Models Escaped Through Package Proxy, Hacked HuggingFace