2026 Talks
Inference for Async Agents in Production
Missing value detected...
Video will be populated after the conference
- Agent Infrastructure
As models improve, we are starting to build long-running, asynchronous agents such as deep research agents and browser agents that can execute multi-step workflows autonomously. These systems unlock new use cases, but they use orders of magnitude more tokens and compute, creating scaling bottlenecks.
This talk discusses practical strategies builders can use to maximize async agent performance while keeping inference costs under control. Topics covered include context engineering, compaction, cache maintenance, model routing, and batch inference. This talk is aimed at use case developers, with secondary relevance to platform engineers.
CEO
Meryem Arik
Doubleword
Meryem Arik is Co-founder and CEO of Doubleword, the inference provider built for high-volume async workloads - and the team behind one of Europe's leading inference provider. Forbes 30 Under 30 honoree and ex-Physics alumna of Oxford, she is a regular conference speaker, having spoken at TEDx, QCon, and leading AI engineering conferences.
The AI Conference for Humans Who Ship
While other conferences theorize, AI Council features the engineers shipping tomorrow's breakthroughs today.