Why Cerebras’ Mind-Boggling LLM Raw Speed Is Still Falling Into Nvidia’s Massive Software Trap

1 day ago 2

Alex Sirois

Fri, June 26, 2026 astatine 9:21 AM CDT 3 min read

Quick Read

  • CBRS secured a $20B+ OpenAI woody yet guided to antagonistic operating margins, portion NVDA's 75% gross margins uncover the level moat's fiscal power.

  • Cerebras' wafer-scale spot delivers a 21x velocity vantage implicit Nvidia hardware, but each large LLM model natively optimizes for CUDA, requiring costly customized engineering.

  • Don't wait: the expert who called NVIDIA successful 2010 conscionable revealed his apical 10 AI stocks. See the afloat database FREE now.

NVIDIA (NASDAQ: NVDA) and Cerebras Systems (NASDAQ: CBRS) conscionable delivered net that framework the aforesaid question from other ends. Nvidia posted different blowout 4th built connected its CUDA bundle stack. Cerebras, caller disconnected its May IPO, showed jaw-dropping inference velocity yet guided full-year operating margins negative. The moat is developer gravity.

 CBRS" are acceptable   against a acheronian  bluish  inheritance  with abstract information  streams and glowing reddish  and orangish  vigor  beams. A large, aureate  "VS." symbol, surrounded by light, dominates the halfway  of the image.

24/7 Wall St.

One Sells Platforms. The Other Sells Speed.

Nvidia's Q1 FY27 deed $81.61 cardinal successful revenue, up 85.2% YoY, with Data Center unsocial reaching $75.25 cardinal connected 92% growth. Networking soared 199% arsenic InfiniBand, NVLink and Spectrum-X locked customers deeper into the stack. Jensen Huang told investors NVIDIA is "the lone level that runs successful each cloud, powers each frontier and unfastened root model, and scales everyplace AI is produced", and the numbers backmost the claim.

Cerebras' archetypal study arsenic a nationalist institution landed differently. Q1 GAAP gross reached $193.4 million, up 94% YoY, with unreality services increasing 178%. A multi-year, $20 billion-plus OpenAI inference woody covering 750 megawatts anchors near-term growth. Yet absorption guided full-year operating margins to antagonistic 28% to antagonistic 32%. Speed sells. Scaling it economically is harder.

Software Gravity Beats Wafer-Scale Throughput

Independent benchmarks amusement Cerebras' wafer-scale plan delivering a 21x velocity vantage implicit Nvidia hardware for latency-sensitive, low-batch inference. The drawback is that each large LLM model and endeavor developer stack is natively optimized for Nvidia architecture retired of the box, portion Cerebras requires specialized compilation and customized engineering enactment for thing disconnected the well-trodden path.

Don't wait: the expert who called NVIDIA successful 2010 conscionable revealed his apical 10 AI stocks. See the afloat database FREE now.

Lens

NVIDIA

Cerebras

Core Bet

CUDA full-stack level

Wafer-scale inference velocity

Q1 Gross Margin

75.0% non-GAAP

44.6% GAAP

Anchor Customers

Meta, OpenAI, Anthropic, Google

OpenAI, AWS, G42

Biggest Vulnerability

OpenAI's Jalapeño customized spot

Negative operating margins

Nvidia's $119 cardinal successful proviso commitments and $80 cardinal added to its buyback authorization awesome absorption is doubling down connected the platform. Cerebras raised $5.6 cardinal astatine IPO and is funneling it into information halfway capableness for OpenAI's decode workloads portion AWS Trainium 3 handles prefill. That is simply a focused inference stake riding connected 1 customer's roadmap.

Read Entire Article