Hardware

OpenAI unveils first custom AI inference chip built by Broadcom, codenamed Jalapeño

Tags AI · Infrastructure

TechCrunch·June 24, 2026

OpenAI unveils first custom AI inference chip built by Broadcom, codenamed Jalapeño

OpenAI announced its first custom-designed processor, codenamed Jalapeño, built in partnership with Broadcom and optimized specifically for AI inference workloads. The chip represents OpenAI's push to reduce dependence on general-purpose GPU clusters and tailor silicon to its model serving infrastructure. Broadcom CEO Hock Tan joined OpenAI CEO Sam Altman at the unveiling. The announcement signals that leading AI labs are following Google's TPU path toward custom inference hardware to improve cost-efficiency and latency at scale.

Technical significance

Custom inference silicon designed specifically for transformer serving patterns could meaningfully reduce per-token costs compared to general-purpose GPUs. This accelerates the trend of AI labs vertically integrating hardware, potentially reshaping the inference market dominated by NVIDIA and creating a new axis of competition around inference efficiency rather than just training scale.

Sources

TechCrunch

← Today's Digest