OpenAI and Broadcom unveil Jalapeño, a data-center chip for scalable LLM inference

TL;DR Summary
OpenAI and Broadcom introduced Jalapeño, a purpose-built ASIC designed from scratch for large-language-model inference in data centers, with early testing claiming substantially better performance per watt; development took nine months and is part of a broader effort to own more of the AI stack and reduce reliance on Nvidia, with deployments planned by year-end as the silicon race heats up.
- OpenAI and Broadcom announce chip designed for LLM inference at scale Ars Technica
- OpenAI and Broadcom unveil LLM-optimized inference chip OpenAI
- OpenAI unveils first chip as part of Broadcom deal in effort to 'build the full stack' CNBC
- OpenAI just announced its first custom chip to help ChatGPT run better CNN
- OpenAI tests homegrown AI chips Axios
Reading Insights
Total Reads
1
Unique Readers
4
Time Saved
3 min
vs 4 min read
Condensed
92%
714 → 60 words
Want the full story? Read the original article
Read on Ars Technica