OpenaiLiveAppeal 9.045 sec read

OpenAI Unveils Custom Inference Chip

24 June 2026By Pulse24 desk
← Back
Share →

What happened

OpenAI, in collaboration with Broadcom, unveiled Jalapeño, its first custom AI inference processor. Designed with assistance from OpenAI's own AI models, the chip targets significantly better performance-per-watt and lower operating costs for inference workloads compared to current AI GPUs. OpenAI stated the chip aims to make its models faster, more reliable, and more affordable.

Why it matters

This custom chip reduces OpenAI's reliance on general-purpose GPUs, directly impacting operational expenditure for running its AI models. Procurement teams and platform engineers will see a shift in unit economics, as specialised hardware offers lower inference costs and improved efficiency for real-time AI applications. This follows similar moves by Google and Amazon with their custom TPUs and Trainium chips, signalling a broader industry trend towards vertically integrated AI infrastructure to control costs and optimise performance. Teams should prepare for increased hardware specialisation across the AI stack.

Source · techcrunch.comAI-processed content may differ from the original.
Published 24 June 2026