{"id":47049,"date":"2026-03-07T10:32:26","date_gmt":"2026-03-07T10:32:26","guid":{"rendered":"https:\/\/agooka.com\/news\/technologies\/openai-unveils-gpt-5-4-pro-and-thinking-models\/"},"modified":"2026-03-07T10:32:26","modified_gmt":"2026-03-07T10:32:26","slug":"openai-unveils-gpt-5-4-pro-and-thinking-models","status":"publish","type":"post","link":"https:\/\/agooka.com\/news\/technologies\/openai-unveils-gpt-5-4-pro-and-thinking-models\/","title":{"rendered":"OpenAI unveils GPT-5.4 Pro and Thinking models"},"content":{"rendered":"<p><img decoding=\"async\" src=\"https:\/\/dataconomy.com\/wp-content\/uploads\/2026\/03\/1121722.jpg\" alt=\"OpenAI unveils GPT-5.4 Pro and Thinking models\" title=\"OpenAI unveils GPT-5.4 Pro and Thinking models\"\/><\/p>\n<p>OpenAI released GPT-5.4 on Thursday, introducing a new foundation model available in standard, Thinking, and Pro versions.<\/p>\n<p>The launch introduces a model with a 1 million token context window and improved token efficiency, targeting professional workloads. The release includes new benchmark records and a system to manage tool calling within the API.<\/p>\n<p>GPT-5.4 is available in three versions: standard, a reasoning model (GPT-5.4 Thinking), and an optimized high-performance version (GPT-5.4 Pro). The API version supports context windows as large as 1 million tokens, the largest available from OpenAI. OpenAI stated GPT-5.4 solves the same problems with significantly fewer tokens than its predecessor.<\/p>\n<p>The model achieved record scores in computer-use benchmarks OSWorld-Verified and WebArena Verified. It scored a record 83% on OpenAI\u2019s GDPval test for knowledge work tasks. GPT-5.4 also took the lead on Mercor\u2019s APEX-Agents benchmark, which tests professional skills in law and finance.<\/p>\n<p>Mercor CEO Brendan Foody stated that GPT-5.4 excels at creating long-horizon deliverables such as slide decks, financial models, and legal analysis. Foody said the model delivers top performance while running faster and at lower cost than competitive frontier models.<\/p>\n<p>OpenAI reported GPT-5.4 is 33% less likely to make errors in individual claims compared to GPT 5.2. Overall responses are 18% less likely to contain errors. OpenAI introduced Tool Search, a new system for managing tool calling in the API that allows models to look up tool definitions as needed.<\/p>\n<p>Tool Search reduces token use and improves speed and cost in systems with many tools. OpenAI added a new safety evaluation to test chain-of-thought monitoring, addressing concerns that reasoning models could misrepresent their reasoning process.<\/p>\n<p>The new evaluation shows deception is less likely in the GPT-5.4 Thinking version. OpenAI stated this suggests the model lacks the ability to hide its reasoning and that CoT monitoring remains an effective safety tool.<\/p>\n<p><strong>Featured image credit<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>OpenAI released GPT-5.4 on Thursday, introducing a new foundation model available in standard, Thinking, and Pro versions. The launch introduces a model with a 1 million token context window and improved token efficiency, targeting professional workloads. The release includes new benchmark records and a system to manage tool calling within the API. GPT-5.4 is available [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":47050,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[37],"tags":[],"class_list":["post-47049","post","type-post","status-publish","format-standard","has-post-thumbnail","category-technologies"],"_links":{"self":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/posts\/47049","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/comments?post=47049"}],"version-history":[{"count":0,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/posts\/47049\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/media\/47050"}],"wp:attachment":[{"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/media?parent=47049"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/categories?post=47049"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/agooka.com\/news\/wp-json\/wp\/v2\/tags?post=47049"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}