Topic

#inference

2 articles exploring inference. Expert insights and analysis from our editorial team.

Showing 1–2 of 2 articles

Articles

Newest first
AI Research

Executing Programs Inside Transformers: The Inference Breakthrough Nobody Expected

A new architecture from Percepta embeds a program interpreter directly into transformer weights, achieving logarithmic-time execution lookups that could reshape how AI agents handle deterministic computation—if the early claims survive scrutiny.

· 8 min read
AI Infrastructure

IonRouter: The YC Startup Solving the LLM Inference Cost Crisis

IonRouter by Cumulus Labs (YC W26) is a high-throughput inference API built on a custom C++ runtime for NVIDIA GH200 hardware, delivering roughly 2x the throughput of comparable providers at half the cost. As inference spending scales into the billions, it represents one of the first startups to compete at the infrastructure layer with purpose-built silicon optimization.

· 8 min read