OpenAI and Broadcom unveil LLM-optimized inference chip
Summary
OpenAI and Broadcom have unveiled Jalapeño, a specialized AI processor designed specifically for Large Language Model (LLM) inference. Developed in just nine months, the chip features a unique architecture that balances compute, memory, and networking to improve performance-per-watt efficiency over existing hardware. As the first entry in a multi-generation partnership, Jalapeño is set for deployment in gigawatt-scale data centers starting in 2026, aiming to make advanced AI more affordable, reliable, and accessible.
(Source:OpenAI)