Explainer

OpenAI Just Built Its Own AI Chip — What Jalapeño Means for the Cost of Every No-Code App You Build

OpenAI announced Jalapeño, its first custom AI chip built with Broadcom, targeting late 2026 deployment. This is the single biggest structural signal that AI inference costs will continue dropping for years. For no-code builders, it means the AI features in Bubble, Webflow, Zapier, and Stacker will get cheaper, faster, and more capable — without subscription prices needing to rise. Here's how custom silicon changes the economics of every no-code app that touches AI.

On June 24, OpenAI did something that changes the economics of every no-code app that touches AI. It announced Jalapeño, its first custom AI chip, built in partnership with Broadcom, with initial deployment targeted by the end of 2026.

This isn't another model release. It's not a benchmark war with Anthropic or a plugin announcement. It's OpenAI making the same move Apple made in 2020 when it ditched Intel for M1: vertical integration into the silicon layer. And the implications for no-code builders are bigger than they look.

TL;DR: OpenAI's Jalapeño chip signals that AI inference costs will keep dropping for years. Every no-code platform embedding AI — Bubble's AI Agent, Webflow's designer, Zapier's AI steps, Stacker's portal generation — rests on the assumption that API costs trend downward. OpenAI building its own silicon makes that bet safer. And it ends Nvidia's monopoly on the hardware that powers the models you build with.

What is Jalapeño, actually?

Let's be precise. Jalapeño is not a general-purpose chip you'd put in a laptop. It's an AI inference accelerator, designed specifically to run large language models at the lowest possible cost per token. Broadcom, the semiconductor giant, is handling manufacturing. OpenAI designed the architecture.

The chip targets deployment by late 2026, which in silicon terms is aggressive but credible. The CNBC report that broke the story cited people familiar with the project who described it as OpenAI's bid to "reduce dependence on Nvidia." That dependence is not trivial. Training a frontier model requires tens of thousands of Nvidia H100 GPUs, each costing roughly $30,000. Inference, the bit where the model actually answers your prompt, requires its own fleet of GPUs running 24/7. Nvidia captures roughly 80% of that market, and its margins reflect it.

OpenAI's calculus is straightforward. If you're spending billions on inference every year, owning the silicon saves billions. Every dollar saved on compute is a dollar you don't need to pass on to developers in API pricing. Or, more cynically, it's a dollar you keep as margin while your competitors pay the Nvidia tax.

Why this matters for no-code builders

Most people reading this will never touch a GPU in their lives. You build with Bubble, Webflow, Stacker, Glide, Softr, or any of the platforms that are bolting AI onto their core experience. Why should a chip announcement matter to you?

Because the AI features you use every day are running on someone else's inference hardware, and the cost of that hardware determines what you pay.

When Bubble's AI Agent generates a responsive layout from a description, it costs Bubble something in API calls. When Webflow's AI code components write JavaScript for your custom elements, there's a model inference cost. When Zapier's AI steps classify emails and route them, each classification burns tokens. When Stacker generates an entire customer portal from your data, the AI behind it is consuming compute.

None of these platforms charge you per token today. Most bundle AI into their subscription pricing. But that bundling only works if the underlying inference costs stay manageable. If Nvidia raises GPU prices, or if demand outpaces supply, the platforms either eat the cost, which limits how much AI they can offer, or they start metering it, which limits how much AI you can use.

Jalapeño is a structural bet that those costs will trend down, not up. OpenAI isn't the only player. Google has its TPUs. Amazon has Trainium. Microsoft is reportedly working on its own silicon. Every major cloud provider and AI lab is racing to build custom inference hardware, and the net effect is that the per-token cost of running an LLM is going to fall, possibly dramatically, over the next three to five years.

For no-code builders, that means the AI features you use today will get cheaper, faster, and more capable without your subscription price needing to rise. The platforms absorb the silicon savings and pass some of them on as better features. That was always the assumption. Jalapeño makes it closer to reality.

The end of the Nvidia bottleneck

This is worth understanding because it shapes the economics of everything that touches AI.

Nvidia's dominance in AI chips is not a secret. The H100 GPU became the de facto standard for AI training and inference almost by accident — it was originally designed for graphics rendering, and its parallel processing architecture happened to be perfectly suited to the matrix operations that transformers require. Nvidia leaned into this advantage with CUDA, a software ecosystem that made it trivially easy for researchers to run their models on Nvidia hardware. Competitors like AMD have comparable hardware. They don't have CUDA's fifteen-year head start on developer tooling.

The result is a near-monopoly. Nvidia GPUs are backordered for months. Cloud providers allocate them through waitlists. Startups describe getting GPU capacity as harder than raising money. And Nvidia's pricing reflects its market power.

Jalapeño breaks that dependency in one specific, important way. It's designed for inference, not training. Inference is where the long-term cost lives. You train a model once, in a massive burst of compute. You run inference on that model millions or billions of times, every time someone uses it. Training is the capital expenditure. Inference is the operating expenditure. Making inference cheaper means the whole economics of AI deployment shift.

The HN thread on the announcement hit 653 points and 369 comments, and the sentiment was consistent: this was overdue, it's the right strategic move, and it signals that the AI industry is maturing from a research sprint into a real business. Custom silicon is what real businesses do.

What platforms should do with cheaper AI

If inference costs drop by 50%, or 80%, or 90% over the next few years — and custom silicon combined with model efficiency improvements makes that plausible — the downstream effects on no-code platforms are significant.

First, AI stops being a premium feature. When token costs are negligible, platforms can offer AI as a baseline, not an upsell. The distinction between "AI" and "not AI" features dissolves. Everything gets AI assistance built in, because the marginal cost is too low to meter.

Second, real-time AI becomes viable. Today, most no-code AI features operate on a request-response model. You prompt, it generates, you review. With cheap inference, persistent AI agents that monitor your app, suggest improvements, catch errors before they reach users, and optimise workflows in the background become practical at scale. The compute bill doesn't make them uneconomic.

Third, AI personalisation becomes the default. Currently, most AI features are generic. The model doesn't know your app, your users, or your business. With cheap inference, on-the-fly fine-tuning and context injection become standard. The AI adapts to your specific database schema, your design system, your business rules. Not as a premium feature. As the default behaviour.

None of this happens overnight. Jalapeño won't be in production until late 2026 at the earliest, and even then it will take time for the full effects to ripple through the ecosystem. But the direction is set. The most expensive part of running AI — inference — is being commoditised, and that commoditisation is the single most important trend for no-code builders to track.

The counterarguments worth acknowledging

I should note what this doesn't solve.

Custom silicon reduces inference cost. It doesn't improve model quality. Cheaper inference means you can afford to run bigger models more often, but the models themselves still need to get better. And model improvement, for now, requires training runs that cost hundreds of millions of dollars. Jalapeño is an inference chip. Training is a different problem.

It also doesn't solve the provider dependency issue we've been covering heavily. Your no-code platform might rely on OpenAI's API, and OpenAI's API might run on Jalapeño, but OpenAI can still change its pricing, deprecate models, or restrict access. Cheaper chips don't give you provider independence. They just reduce the cost of your dependency.

And it's worth remembering that OpenAI's incentives are not necessarily aligned with keeping your costs low. If Jalapeño cuts OpenAI's inference costs by 70%, OpenAI gets to decide how much of that saving it passes on to API customers and how much it keeps as margin. The history of cloud pricing suggests that cost savings do get passed on, eventually, because competition forces it. But "eventually" can be a long time, and "competition" assumes multiple viable chip options exist.

The takeaway

Jalapeño isn't going to change your Bubble project this week. But it's a structural signal that the cost of AI is heading in one direction: down. For no-code builders, that means the AI-powered features you're already using will get cheaper and better, and the AI features that are currently too expensive to ship will become viable.

The platforms that win will be the ones that invest in AI now, on the assumption that inference costs will fall, rather than waiting for the costs to fall before investing. The builders who win will be the ones who understand that AI is becoming a utility — cheap, ubiquitous, and expected — rather than a premium feature you charge extra for.

Jalapeño isn't the destination. It's the confirmation that the destination exists.

Want to read
more articles
like these?

Become a NoCode Member and get access to our community, discounts and - of course - our latest articles delivered straight to your inbox twice a month!

Join 10,000+ NoCoders already reading!