Thinking Machines AI system processing real-time input with neural network visuals and instant response interface displayed o

Editorial illustration for Thinking Machines develops AI that processes input and replies simultaneously

Thinking Machines develops AI that processes input and...

Thinking Machines develops AI that processes input and replies simultaneously

By AI Daily Post Edited by Brian Petersen, Editor-in-Chief

May 12, 2026 • 2 min read

Thinking Machines Lab, the startup Mira Murati launched after leaving OpenAI, unveiled a new class of “interaction models” on Monday. The core idea is simple yet unfamiliar: an AI that can listen and talk at the same time, mimicking the flow of a phone call rather than the stop‑start of a text chat. The company calls this capability “full duplex.” Its first prototype, TML‑Interaction‑Small, reportedly generates a reply in 0.40 seconds—about the pace of a natural human exchange and noticeably quicker than comparable offerings from OpenAI and Google, according to the firm’s own benchmarks.

Still, the technology is in a research preview stage, not a consumer product. A limited preview is slated for the next few months, with a broader rollout expected later in the year. While the numbers look promising, the real‑world experience remains untested.

The move raises questions about whether native interactivity will translate into a smoother conversational feel once the model reaches a wider audience.

Thinking Machines Lab, the AI startup founded last year by former OpenAI CTO Mira Murati, on Monday announced something called interaction models, which, at its essence, sounds like AI that can interrupt you.

— Lab, Thinking Machines wants to build an AI that actually listens while it talks - TechCrunch AI

Why this matters

We see Thinking Machines Lab attempting a shift from the classic turn‑taking dialogue model to a “full duplex” interaction where the AI listens and talks simultaneously. If TML‑Interaction‑Small truly delivers responses in 0.4 seconds while still processing incoming speech, developers could prototype more fluid conversational agents without the latency of back‑and‑forth exchanges. Founders may wonder whether this architecture reduces the need for complex state‑management logic, yet the article offers no data on accuracy or resource consumption, leaving open the question of scalability.

Researchers will have a new benchmark to test: can an AI maintain coherence while interrupting its own output? The claim sounds promising, but without independent evaluation we cannot confirm whether the model handles overlapping inputs without degradation. Moreover, the brief description does not address how the system deals with ambiguous or conflicting cues when both streams operate together.

As we explore these interaction models, we should remain cautious, tracking real‑world performance before assuming they will redefine conversational AI design.

Thinking Machines develops AI that processes input and...

Further Reading

Latest News

LLM pipeline compares DAO ERC‑8004 and Google A2A governance, 4,323 records

Slowed AI model development could chill data‑center buildout, risk industry

Physics‑Guided CNN Predicts Phase‑Separation Evolution in Binary Mixtures

Anthropic's Mythos struggles deepen as cybersecurity ties with Trump wane

OpenAI postpones GPT‑5.6 rollout after Trump administration request

Calibration uses NVIDIA Triton Llama-3-8B A10 and vLLM Qwen2.5-7B RTX 4090 data

Meta says AI moderators make 13% fewer errors than humans, defends rollout speed

NVIDIA TensorRT Enables Context Parallelism for Multi‑GPU AI Inference

DeepReinforce releases Ornith-1.0 open-source model with state‑of‑the‑art results

Grok AI's traffic over 50% adult content as xAI expands porn generation

Further Reading

Related Reading

OpenAI, a Series F San Francisco startup founded in 2015 by eight pioneers

Terminal-Bench 2.0 launches with Harbor, testing any container-installable agent

Zuckerberg Unveils Meta Compute to Build Global AI Infrastructure

Digg relaunches as AI news aggregator, targeting busy users but faces adoption doubts

Method uncovers hidden coalitions in multi‑agent AI using mutual‑info graph