Skip to content
Chip Beat
Explainers AI & GPU Accelerators Chip Design & Architecture Foundries & Manufacturing
Memory & Storage Advanced Packaging Geopolitics & Supply Chain Startups & Funding Industry Analysis

#ai-inference

Conceptual illustration of data flowing efficiently between memory and compute units on a chip.
AI & GPU Accelerators

Semidynamics Funding Signals Shift to Memory-Centric AI

The AI hardware race isn't just about raw cores anymore. Semidynamics' latest funding round underscores a critical shift: tackling the "memory wall" with a new architectural paradigm.

6 min read 3 days, 3 hours ago
Fractile's logo with a background suggesting complex circuitry or data flow.
AI & GPU Accelerators

Fractile's $220M bet: Supercharging AI inference hardware

Forget raw AI capability; the real bottleneck is now time. Fractile just raised $220 million on the audacious premise that the future of AI hinges on radically faster inference hardware.

6 min read 1 week, 3 days ago
Conceptual illustration of a satellite equipped with large solar panels and cooling radiators, orbiting Earth.
Startups & Funding

Space Data Centers: Orbiting AI's Insatiable Energy Demand

As AI's energy appetite explodes, one startup is looking skyward, proposing data centers on satellites. It's a bold vision, aiming to harness solar power and escape terrestrial grid limitations.

6 min read 1 week, 6 days ago
💻
Chip Design & Architecture

SiPearl & Semidynamics: A New Rack-Scale AI Inference Play?

Arm meets RISC-V in a bid to conquer the AI inference server market. SiPearl and Semidynamics are launching a new rack-scale platform, but the real question is who benefits and at what cost?

5 min read 2 weeks, 1 day ago
Illustration of abstract neural network connections with data flowing through them.
Chip Design & Architecture

Anthropic Eyes UK Startup for 100x Faster AI Inference

AI labs are desperate for faster, cheaper compute. Anthropic's reported interest in UK startup Fractile suggests a serious play for performance gains, potentially shaking up the chip landscape.

6 min read 2 weeks, 5 days ago
Tenstorrent Galaxy Blackhole server rack, showcasing a dense configuration of AI compute hardware.
AI & GPU Accelerators

Tenstorrent's Galaxy Claims 350 Tokens/s, 5x NVIDIA TCO Cut

Tenstorrent isn't just entering the AI hardware race; they're declaring war. Their new Galaxy Blackhole servers, powered by RISC-V, claim performance metrics that could fundamentally shift the landscape, and with prices that might just make hyperscalers listen.

6 min read 3 weeks ago
Abstract representation of interconnected chips and data streams symbolizing AI infrastructure.
Chip Design & Architecture

Intel & SambaNova: The Quiet AI Hardware Shakeup

The AI gold rush is hitting a wall, and chipmakers are quietly scrambling. Intel and SambaNova's latest move isn't just another partnership; it's a pragmatic pivot away from the GPU-only playbook.

6 min read 1 month, 1 week ago
Jensen Huang at GTC 2026 stacking R200 GPU and Groq LP30 chip
AI & GPU Accelerators

Nvidia's $20 Billion Groq Heist: GPU Kingpin Bows to Inference Rebels

Nvidia shelled out $20 billion for Groq's team and tech. Now they're admitting it: GPUs alone won't cut it for low-latency AI inference anymore.

5 min read 1 month, 1 week ago
VSORA Jotunn8 chip with flowing data streams in AI inference architecture
AI & GPU Accelerators

VSORA's Jotunn8: Silicon That Starves Never Again for AI Inference

Deep in a blazing data center, VSORA's Jotunn8 chip devours inference workloads like a Norse giant feasting endlessly. No more data droughts—just pure, relentless AI power.

4 min read 1 month, 1 week ago
Nvidia MLPerf inference benchmark charts showing record highs
AI & GPU Accelerators

Nvidia's Software Flexes Hard in MLPerf Inference Blowout

Jensen Huang's full-platform sermon just got empirical backup. Nvidia's software tweaks propel MLPerf inference benchmarks to absurd new heights, leaving rivals in the dust.

5 min read 1 month, 1 week ago
Jensen Huang unveiling Nvidia Groq 3 LPU on stage at GTC with inference architecture diagram
AI & GPU Accelerators

Nvidia's Groq 3 LPU: Why Inference Just Ate Training's Lunch

Forget the training hype—Nvidia's surprise Groq 3 LPU at GTC signals inference is now king. This lean beast prioritizes speed over brute force, reshaping data centers.

5 min read 1 month, 1 week ago
VSORA engineers celebrating AI inference chip tape-out in Paris lab
AI & GPU Accelerators

VSORA's Memory Wall Breakthrough: Can This French Upstart Reshape AI Inference?

A fresh tape-out from France's VSORA targets the memory wall strangling AI inference. Sandra Rivera explains why this could flip the script on power-hungry GPUs.

5 min read 1 month, 1 week ago

Categories

Explainers AI & GPU Accelerators Chip Design & Architecture Foundries & Manufacturing Memory & Storage Advanced Packaging Geopolitics & Supply Chain Startups & Funding
Chip Beat

Silicon. Signals. Systems.

More

  • RSS Feed
  • Sitemap
  • About
  • Editorial Process
  • Advertise

Legal

  • Privacy
  • Terms
  • Work With Us

Our Network

The AI Catchup AI & Machine Learning Threat Digest Cybersecurity Legal AI Beat Legal Tech Fintech Rundown Finance & Banking DevTools Feed Developer Tools Open Source Beat Open Source Fintech Dose Crypto & DeFi Chip Beat Semiconductors AdTech Beat Ad Technology Supply Chain Beat Logistics

© 2026 Chip Beat. All rights reserved.

🏠Home 🔍Search 🔖Saved 📂Categories
Privacy & cookies

We use a privacy-respecting analytics tool to count page views — no personal profiles, no ad tracking, no third-party cookies. Accept to help us understand which stories matter to readers.

Details