Generating and cleaning a preference dataset for DPO / ORPO with LLMs and distilabel
Argilla Argilla
445 subscribers
293 views
8

 Published On Aug 30, 2024

We discussed:
The essentials of building a distilabel pipeline by exploring two key use cases: cleaning an existing dataset and generating a preference dataset for DPO/ORPO. You’ll also learn how to make the most of it, integrating Argilla to gather human feedback and improve its quality.

This session is perfect for you
​if you’re getting started with distilabel or synthetic data
​if you want to discover new functionalities
​if you want to provide us with new feedback

You can find an overview of the shared documents here: https://drive.google.com/drive/folder...

Signup for coming meetups here: https://lu.ma/d720wy9f

show more

Share/Embed