Great Infra for Background Agents

Inference Systems

Sail is building the compute stack for background agents, from silicon to API. We'll discuss the unique infra challenges of agents that run for hours without human intervention, and why Sail is building a new stack here. The focus will start on token processing, and what tradeoffs we can make between throughput and latency. Then we'll move to the sandbox world, and explore our options to give every agent a cloud computer. Finally, we'll close with some examples of what customers do with massively parallel baclground agents on Sail.

Co-Founder

Neil Movva

Sail Research

Neil Movva is an ML systems engineer who's spent his whole career at the hardware-software interface. He worked on the first generation of Tensor Cores at NVIDIA, then Apple's Neural Engine, and led a GPU kernel programming team at Together.ai. He started Sail Research in 2025 to be the best platform for agents, designed from the ground up for efficiency and scale. Neil received his BS and MS degrees in Electrical Engineering from Stanford.

2026 Talks

Great Infra for Background Agents

Inference Systems

The AI Conference for Humans Who Ship