Topic
#inference
2 articles exploring inference. Expert insights and analysis from our editorial team.
Showing 1–2 of 2 articles
Articles
Newest first
AI Research
Executing Programs Inside Transformers: The Inference Breakthrough Nobody Expected
A new architecture from Percepta embeds a program interpreter directly into transformer weights, achieving logarithmic-time execution lookups that could reshape how AI agents handle deterministic computation—if the early claims survive scrutiny.
AI Infrastructure
IonRouter: The YC Startup Solving the LLM Inference Cost Crisis
IonRouter by Cumulus Labs (YC W26) is a high-throughput inference API built on a custom C++ runtime for NVIDIA GH200 hardware, delivering roughly 2x the throughput of comparable providers at half the cost. As inference spending scales into the billions, it represents one of the first startups to compete at the infrastructure layer with purpose-built silicon optimization.