OpenAI has unveiled Jalapeño, its first custom AI inference chip developed in partnership with Broadcom.
The chip was specifically designed to run large language models more efficiently and was optimized around OpenAI’s model architectures and real-world product workloads.
The company plans to deploy the first Jalapeño-powered data centers before the end of the year.
Custom Silicon for AI
Jalapeño represents OpenAI’s latest effort to reduce dependence on third-party AI hardware while improving performance and efficiency across its infrastructure.
According to OpenAI, the chip delivers significantly better performance per watt than existing solutions, although detailed benchmark results have not yet been released.
The design focuses on inference workloads, which involve generating responses from trained AI models rather than training them.
As AI adoption accelerates, inference has become one of the fastest-growing segments of data center demand.
AI Infrastructure Race Intensifies
The launch places OpenAI among a growing list of technology companies developing custom AI silicon to optimize costs and performance.
Major technology firms including Google, Amazon, Microsoft, and Meta have invested heavily in proprietary AI chips as demand for computing power continues to surge.
Analysts view custom hardware as a key competitive advantage in the next phase of AI development.
The broader takeaway is that OpenAI is expanding beyond software and models into custom semiconductor design, seeking greater control over the infrastructure powering its AI systems.