GPT-5.4: The Game-Changing New Capabilities of OpenAI’s Latest Model – AI News – #2 March 2026

3min.

Comments:0

09 March 2026

GPT-5.4: The Game-Changing New Capabilities of OpenAI’s Latest Model – AI News – #2 March 2026d-tags
OpenAI has officially released the GPT-5.4 model, and it is far more than just a cosmetic update. It represents a massive leap toward unified AI models that don't just generate text, but comprehensively plan, code, and execute actual professional work on your behalf. With scores hitting 83.3% on the advanced ARC-AGI-2 benchmark and a 75.0% success rate in desktop UI navigation, GPT-5.4 is no longer just a virtual assistant—it has become a fully-fledged digital coworker.

3min.

Comments:0

09 March 2026

Why GPT-5.4 Is Much More Than a Simple Update

Just a few weeks ago, if you asked professionals about the best model for coding and analytical work, they often pointed to competing solutions like Claude. With the launch of GPT-5.4, OpenAI has effectively closed that gap.

What makes this update absolutely crucial is the convergence of advanced reasoning and coding into a single model. Instead of creating separate, highly specialized tools for different tasks, we are witnessing a shift toward unified models. GPT-5.4 can think through a problem from various angles: it will write the code itself, open a browser, gather data, and finally prepare a professional presentation from it. This direction of AI development is progressing much faster than most experts anticipated.

GDPval: Proof That AI Delivers Ready-to-Use Work

The most important indicator of the new model’s value is its performance on the GDPval benchmark. Why does this matter? Because unlike abstract benchmarks and flashy tech demos, GDPval measures the AI’s ability to generate actual, ready-to-deploy work deliverables across 44 different occupations.

[Placeholder: Image of an automated investment banking spreadsheet model]

The GPT-5.4 model set a new record here. In head-to-head matchups with human professionals, it matched or beat their results a staggering 83.0% of the time (for comparison, GPT-5.2 scored 70.9%). Its achievements in working with data look particularly impressive:

  • The new ChatGPT add-in for Excel solves complex investment banking modeling tasks with an 87.3% success rate (up from 68.4% in the previous version).
  • High visual quality of presentations: Expert evaluations show a drastic improvement in the aesthetics, visual variety, and usability of slides generated directly by the model.

A Breakthrough in Computer Use and Interface Navigation

One of the most fascinating new features is equipping GPT-5.4 natively with computer use capabilities. The model perfectly handles mouse and keyboard operations based on screenshot analysis.

In the rigorous OSWorld-Verified test, which checks the ability to navigate a desktop interface, GPT-5.4 achieved a success rate of 75.0%. Interestingly, the average human score in the same test is 72.4%. This means that in standardized tasks involving clicking through applications, the model performs more efficiently than an average user.

Greater Control and Fewer Hallucinations in Daily Work

OpenAI has also introduced a number of changes regarding the user experience:

  • The ability to interrupt: GPT-5.4 Thinking operates transparently. It presents an outline of its reasoning upfront, and you can interrupt it at any time to add new details while the model is still “thinking.” This allows you to course-correct its workflow without having to start over or use multiple prompts.
  • Fewer errors on the web: GPT-5.3 Instant now becomes the default model in standard ChatGPT, which—as OpenAI declares—translates to a 26.8% reduction in hallucinations when using web search.

Does OpenAI’s New Model Have Any Flaws?

Every revolution comes with a cost. To make GPT-5.4 more flexible and conversational, OpenAI engineers had to “loosen” its rigid boundaries slightly. As a result, the model gained a personality that some developers describe as “chaotic neutral.”

It sometimes overinterprets a task—it might implement a feature that no one asked for (e.g., adding GDPR consent checkboxes to a simple demo site), or “leak” part of the prompt directly into the UI elements. It’s akin to working with a brilliant, but occasionally rogue coworker.

Model Availability on the Market

GPT-5.4 is gradually rolling out across the OpenAI ecosystem. Currently, access to the “Thinking” variant and the “Pro” version (optimized for the most complex and compute-heavy tasks) is available to users on the ChatGPT Plus, Team, and Pro plans.

Regardless of minor hiccups, the convergence of analytics, programming, and understanding the visual world means that GPT-5.4 sets an entirely new standard in the AI industry.

Want to stay up to date with the latest changes in the AI and SEO world? The world of technology and search engines changes day by day. Subscribe to the Delante newsletter to regularly receive expert analyses, proven strategies, and the most important news that will help you stay ahead of the competition. Join the ranks of professionals today!

Source: https://openai.com/pl-PL/index/introducing-gpt-5-4/

Author
Maciej Jakubiec - Junior SEO Specialist
Author
Maciej Jakubiec

SEO Specialist

A marketing graduate specializing in e-commerce from the University of Economics in Kraków – part of Delante’s SEO team since 2022. A firm believer in the importance of well-crafted content, and apart from being an SEO, a passionate music producer crafting sounds since his early teens.

Author
Maciej Jakubiec - Junior SEO Specialist
Author
Maciej Jakubiec

SEO Specialist

A marketing graduate specializing in e-commerce from the University of Economics in Kraków – part of Delante’s SEO team since 2022. A firm believer in the importance of well-crafted content, and apart from being an SEO, a passionate music producer crafting sounds since his early teens.