It’s challenging to understand how complex agents are improving over time and they have many ways that they could go wrong. It is important for AI Engineers to incorporate a disciplined evaluation process as a result. In this session, Curtis Galione (Solutions Engineer, Braintrust) will walk through the process of evaluating an agent end-to-end with a hands-on example. Feel free to follow along!
Bio Coming Soon