OpenAI GPT-5.4 interface on a laptop, showing Excel plugin integration and BrowseComp boost.

Editorial illustration for OpenAI launches GPT-5.4 with computer-use, Excel plugins, 17% BrowseComp boost

GPT-5.4 Transforms Finance with Excel Plugin Magic

OpenAI launches GPT-5.4 with computer-use, Excel plugins, 17% BrowseComp boost

By AI Daily Post Edited by Brian Petersen, Editor-in-Chief

March 5, 2026 • Updated: July 4, 2026 • 3 min read

Forget answering questions. GPT-5.4 has learned to push buttons.

OpenAI’s latest model can now control a computer. It clicks. It types.

It scrolls through spreadsheets. The benchmarks prove it. On OSWorld, a test of desktop navigation using screenshots and simulated mouse movements, GPT-5.4 scored 75% success.

The reported human performance on the same test is 72.4%. The machine just edged past the human baseline.

On BrowseComp, which measures how well AI agents can persistently browse the web to find hard-to-locate information, OpenAI reports GPT-5.4 improving by 17% absolute over GPT-5.2, and GPT-5.4 Pro reaching 89.3%, described as a new state of the art. On OSWorld-Verified, which measures desktop navigation using screenshots plus keyboard and mouse actions, OpenAI reports GPT-5.4 at 75.0% success, compared to 47.3% for GPT-5.2, and notes reported human performance at 72.4%. On WebArena-Verified, GPT-5.4 reaches 67.3% success using both DOM- and screenshot-driven interaction, compared to 65.4% for GPT-5.2.

On Online-Mind2Web, OpenAI reports 92.8% success using screenshot-based observations alone. OpenAI also links computer use to improvements in vision and document handling. On MMMU-Pro, GPT-5.4 reaches 81.2% success without tool use, compared with 79.5% for GPT-5.2, and OpenAI says it achieves that result using a fraction of the "thinking tokens." On OmniDocBench, GPT-5.4's average error is reported at 0.109, improved from 0.140 for GPT-5.2.

OpenAI launches GPT-5.4 with native computer use mode, financial plugins for Microsoft Excel, Google Sheets - VentureBeat AI

This is not a small improvement. It is a functional leap. That 17% absolute boost on BrowseComp means the model can now persistently hunt for obscure information online, a task that once required a patient human.

New plugins for Excel and Google Sheets mean it can operate inside finance software, manipulating data directly. It got better at visual reasoning on the MMMU-Pro benchmark while using fewer computational resources, a sign of tighter engineering.

The abstraction layer is thinning. The AI is no longer trapped behind a chat window. It can reach into your software and start moving things around.

Common Questions Answered

How does GPT-5.4 improve web browsing capabilities compared to previous versions?

OpenAI reports that GPT-5.4 significantly enhances web browsing performance on the BrowseComp benchmark, improving by 17% absolute over GPT-5.2. The Pro version reaches an impressive 89.3% success rate, which the company describes as a new state of the art in persistent web information retrieval.

What new features does GPT-5.4 introduce for productivity tools?

GPT-5.4 adds a native 'computer-use' mode and introduces direct plugins for Microsoft Excel and Google Sheets, making it a more hands-on assistant for finance teams. The model also demonstrates improved desktop navigation, achieving a 75.0% success rate on OSWorld-Verified tests, compared to 47.3% for the previous version.

What are the different versions of GPT-5.4 being released?

OpenAI is launching two variants of GPT-5.4: a Thinking version for general use and a Pro version designed for the most demanding tasks. These will be distributed through OpenAI's paid API and the Codex development platform, with the Thinking variant also being made available through additional channels.

Ship an AI product this weekend — no engineers required.

Structured, in-depth lessons on the exact no-code tools — not scattered tutorials.

The exact platforms, taught in depth
Build real, working projects
Our honest review + a reader discount

Read the review →

GPT-5.4 Transforms Finance with Excel Plugin Magic

Common Questions Answered

How does GPT-5.4 improve web browsing capabilities compared to previous versions?

What new features does GPT-5.4 introduce for productivity tools?

What are the different versions of GPT-5.4 being released?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

OpenAI's Miles Wang in Talks for USD 2B AI Drug Discovery Startup

Mistral Vibe for Code Leads in Multi-Agent Programming Benchmark

OpenAI's First Hardware Device Is a Movable, Screenless Speaker

PrismML's Bonsai 27B Runs Qwen3.6 on Laptops With 1-bit and Ternary Builds

OpenAI Targets 2027 for First Major Hardware: A ChatGPT Speaker

Publishers sue Google over unauthorized AI book training

Anthropic's Claude for Teachers Vows Not to Train on Student Data

DeepSeek Seeks More Capital Weeks After USD 7B Funding Round

Anthropic's New AI Ad Campaign Draws Criticism for 'Creepy' Tactics

DeepMind CEO proposes independent AI regulator as White House advisor voices skepticism

Related Reading

Nordic pilot adds Gemini for Education, NotebookLM to boost AI literacy

Kling launches Video O1, all-in-one model with MVL bridge using transformer

DeepSeek Seeks More Capital Weeks After USD 7B Funding Round

OpenAI researcher quits, citing distrust over ad‑driven engagement metrics

OpenAI launches GPT-Image 1.5 with precise editing for enterprise visuals

Apple Music introduces optional AI labels to boost transparency

Anthropic CEO Dario Amodei returns to Pentagon talks to salvage deal

OpenAI launches GPT-5.4 and ChatGPT Agent, enabling computer‑task automation

Amodei slams OpenAI in memo, urges automated audit‑ready evidence collection

Common Questions Answered

How does GPT-5.4 improve web browsing capabilities compared to previous versions?

What new features does GPT-5.4 introduce for productivity tools?

What are the different versions of GPT-5.4 being released?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

OpenAI's Miles Wang in Talks for USD 2B AI Drug Discovery Startup

Mistral Vibe for Code Leads in Multi-Agent Programming Benchmark

OpenAI's First Hardware Device Is a Movable, Screenless Speaker

PrismML's Bonsai 27B Runs Qwen3.6 on Laptops With 1-bit and Ternary Builds

OpenAI Targets 2027 for First Major Hardware: A ChatGPT Speaker

Publishers sue Google over unauthorized AI book training

Anthropic's Claude for Teachers Vows Not to Train on Student Data

DeepSeek Seeks More Capital Weeks After USD 7B Funding Round

Anthropic's New AI Ad Campaign Draws Criticism for 'Creepy' Tactics

DeepMind CEO proposes independent AI regulator as White House advisor voices skepticism