Generating and cleaning a preference dataset for DPO / ORPO with LLMs and distilabel

445 subscribers

293 views

About
Share

Published On Aug 30, 2024

We discussed:
The essentials of building a distilabel pipeline by exploring two key use cases: cleaning an existing dataset and generating a preference dataset for DPO/ORPO. You’ll also learn how to make the most of it, integrating Argilla to gather human feedback and improve its quality.

This session is perfect for you
if you’re getting started with distilabel or synthetic data
if you want to discover new functionalities
if you want to provide us with new feedback

You can find an overview of the shared documents here: https://drive.google.com/drive/folder...

Signup for coming meetups here: https://lu.ma/d720wy9f

Published On Aug 30, 2024

Share/Embed

Video Link