Technical Talks

View All

Five years of OpenLineage: How we built an industry standard and why agents need it

Harel Shein Harel Shein | Senior Engineering Manager | Datadog

Over the past five years, OpenLineage has become the de facto standard for data lineage metadata, adopted across the industry by leading platforms and enterprises. In this talk, we'll trace the journey of building an open standard. You'll learn what changed in the ecosystem that made standardization possible, the critical features that drove adoption (column-level lineage, streaming support, unified facets), and where OpenLineage stands today - five years since its initial release. Most importantly, we'll explore why this matters now: as AI agents increasingly make decisions about data - where to read from, what to trust, how fresh it is - they need a shared understanding of data context. Lineage metadata is the knowledge graph that transforms agents from black boxes into informed decision-makers. The talk covers the standards perspective, the pragmatic integration challenges, and a forward-looking vision for how great metadata enables intelligent data systems.

Harel Shein
Harel Shein
Senior Engineering Manager | Datadog

Harel Shein is an Engineering Manager II at Datadog, a leading observability and security SaaS platform. He works on data lineage and integrations for Data Observability and is a TSC member and committer of OpenLineage. Prior to working at Datadog, he held product engineering leadership positions at Astronomer and data engineering leadership at WeWork.

FEATURED MEETINGS