2026
2025
Sponsor
Apply to Speak

Technical Talks

View All

RLVR in Practice: From Synthetic Data to GRPO

Chris Alexiuk | Product Research Engineer | NVIDIA

Reinforcement Learning from Verifiable Rewards (RLVR) is increasingly common in post-training pipelines, but the practical details are often glossed over. How do you design reward functions that programmatically verify model outputs? What makes synthetic training data effective? How do you build a custom RL environment that doesn't silently break your training?

Want to be updated on our next Data Council?

Join Newsletter

Chris Alexiuk

Product Research Engineer | NVIDIA

Bio Coming Soon

Follow / Join Us
YouTube
Twitter
LinkedIn

Contact Us
Email Us
Lost Your Tickets?

Menu
Blog
Code of Conduct
Privacy Policy
Terms of Use

MENU