Technologies

OpenAI unveils GPT-5.4 Pro and Thinking models

07.03.2026

OpenAI released GPT-5.4 on Thursday, introducing a new foundation model available in standard, Thinking, and Pro versions.

The launch introduces a model with a 1 million token context window and improved token efficiency, targeting professional workloads. The release includes new benchmark records and a system to manage tool calling within the API.

GPT-5.4 is available in three versions: standard, a reasoning model (GPT-5.4 Thinking), and an optimized high-performance version (GPT-5.4 Pro). The API version supports context windows as large as 1 million tokens, the largest available from OpenAI. OpenAI stated GPT-5.4 solves the same problems with significantly fewer tokens than its predecessor.

The model achieved record scores in computer-use benchmarks OSWorld-Verified and WebArena Verified. It scored a record 83% on OpenAI’s GDPval test for knowledge work tasks. GPT-5.4 also took the lead on Mercor’s APEX-Agents benchmark, which tests professional skills in law and finance.

Mercor CEO Brendan Foody stated that GPT-5.4 excels at creating long-horizon deliverables such as slide decks, financial models, and legal analysis. Foody said the model delivers top performance while running faster and at lower cost than competitive frontier models.

OpenAI reported GPT-5.4 is 33% less likely to make errors in individual claims compared to GPT 5.2. Overall responses are 18% less likely to contain errors. OpenAI introduced Tool Search, a new system for managing tool calling in the API that allows models to look up tool definitions as needed.

Tool Search reduces token use and improves speed and cost in systems with many tools. OpenAI added a new safety evaluation to test chain-of-thought monitoring, addressing concerns that reasoning models could misrepresent their reasoning process.

The new evaluation shows deception is less likely in the GPT-5.4 Thinking version. OpenAI stated this suggests the model lacks the ability to hide its reasoning and that CoT monitoring remains an effective safety tool.

Featured image credit

Occupiers are deporting orphans from the TOT to Russian military camps…

Russia launched over 30 strikes on Dnipropetrovsk region – a woman…

Enemy launched over 5.5 thousand drones and carried out 177 attacks…

Russia attacked a DTEK energy facility, dozens of settlements without power

Russia has launched over 80 drones at Ukraine since morning, with…

SpaceX warns investors that Grok’s NSFW AI is risky business

The biggest revelations from SpaceX’s S-1 filing

Meta says the quiet part out loud about layoffs as the…

Sam Altman has a proposition for startup founders: AI tokens for…

Barnes & Noble CEO is fine with stocking AI-written books —…

SpaceX Listed Grok’s ‘Spicy’ Mode as a Risk in Its IPO…

SpaceX IPO Filing Reveals Anthropic Is Paying $15 Billion a Year…

I Gave My OpenClaw Agent a Physical Body

Demis Hassabis Thinks AI Job Cuts Are Dumb

Meta Employees Are Scrambling to Use Up Benefits Ahead of Layoffs

The Kremlin has called Armenia's course toward EU accession unacceptable

Trump announced the final stage of negotiations with Iran and warned…

Sweden supports opening all negotiation clusters with the EU for Ukraine…

Xi Jinping and Putin discussed the war in Ukraine during a…

EU agrees to extend sanctions against Russia for human rights violations

What Are Dropshipping Spy Tools and How Do They Work in…

What Makes a Browser Secure? Key Security Features Explained

Why Insurance Agencies Need Structured Data Before AI Can Help

The Step-by-Step Guide to Automating Your Accounts Payable Cycle

A Connected Experience on Telegram

The EU Battery Passport Explained: Requirements, Timeline and Compliance Steps f…

The Rise of Software-Defined Hydrogen Vehicles

US Department of Energy Initiatives Supporting Software Defined Vehicle with Gre…

How Software-Defined Vehicles are Redefining Architecture and Transforming Field…

Bridging the Lifecycle Gap: Managing Semiconductor Obsolescence in Automotive EC…

OpenAI unveils GPT-5.4 Pro and Thinking models

SpaceX warns investors that Grok’s NSFW AI is risky business

SpaceX Listed Grok’s ‘Spicy’ Mode as a Risk in Its IPO Filing

What Are Dropshipping Spy Tools and How Do They Work in 2026?

Recent Posts

SpaceX warns investors that Grok’s NSFW AI is risky business

SpaceX Listed Grok’s ‘Spicy’ Mode as a Risk in Its IPO Filing

What Are Dropshipping Spy Tools and How Do They Work in 2026?

Occupiers are deporting orphans from the TOT to Russian military camps – NRC

The Kremlin has called Armenia's course toward EU accession unacceptable