Meta scraps advanced AI training chip after design roadblocks

0
19

Meta scraps advanced AI training chip after design roadblocks

Meta scrapped its most advanced in-house AI training chip due to design challenges and is shifting to a simpler version, according to The Information.

The setback highlights the difficulty of building custom silicon to rival Nvidia and raises questions about Meta’s ability to reduce its dependence on external suppliers. The company’s internal chip design efforts “hit roadblocks,” forcing the abandonment of its most ambitious training chip.

The Meta Training and Inference Accelerator program has a history of setbacks. Meta scrapped an earlier inference chip after it underperformed in small-scale testing and pivoted in 2022 to billions of dollars’ worth of Nvidia GPUs. The company eventually deployed an MTIA chip for inference tasks on Facebook and Instagram, but the training chip has proven more elusive.

Meta began testing its first in-house AI training chip manufactured by TSMC after completing a tape-out. Analyst Jeff Pu noted in January that Meta appeared to be scaling back its in-house ASIC program, turning to AMD instead of its own chips or Google’s TPUs.

Meta announced a multiyear agreement with AMD on February 24 worth more than $100 billion for up to six gigawatts of MI450 GPUs. Shipments are scheduled to begin in the second half of this year. AMD issued Meta a performance-based warrant for up to 160 million shares of its common stock under the deal.

A week earlier, Meta expanded its partnership with Nvidia for millions of next-generation Vera Rubin GPUs and Grace CPUs. That deal is likely worth tens of billions of dollars. Meta also signed a deal to rent Google TPUs for developing new AI models.

Meta has committed up to $135 billion in capital expenditures for 2026 to build out AI infrastructure. The company plans to expand across more than 30 data centers. Meta co-developed its MTIA chips with Broadcom, which also partners with Google on its TPUs.

Meta Chief Product Officer Chris Cox described the company’s chip development journey last year as a “walk, crawl, run situation.” The company scrapped an earlier inference chip after it underperformed in small-scale testing. The Financial Times previously reported that Meta experienced technical challenges with its next-generation training chips.

Featured image credit