Published On Aug 30, 2024
We discussed:
The essentials of building a distilabel pipeline by exploring two key use cases: cleaning an existing dataset and generating a preference dataset for DPO/ORPO. You’ll also learn how to make the most of it, integrating Argilla to gather human feedback and improve its quality.
This session is perfect for you
if you’re getting started with distilabel or synthetic data
if you want to discover new functionalities
if you want to provide us with new feedback
You can find an overview of the shared documents here: https://drive.google.com/drive/folder...
Signup for coming meetups here: https://lu.ma/d720wy9f
show more