Argilla
445 subscribers
44:03
Argilla Community Everything image: from fine-tuning CLIP models to synthetic image datasets
Argilla
121 views • 2 weeks ago
0:59
What is distilabel? A brief feature overview.
Argilla
170 views • 1 month ago
31:19
Generating and cleaning a preference dataset for DPO / ORPO with LLMs and distilabel
Argilla
292 views • 1 month ago
45:54
Optimizing RAG Pipelines by fine-tuning custom embedding models on synthetic data with ZenML
Argilla
304 views • 1 month ago
0:32
ZenML a way to streamline your complex projects with ease
Argilla
35 views • 1 month ago
0:32
cosine similarity as proxy for quality of sentence pair data
Argilla
58 views • 1 month ago
0:39
optimizing RAG by choosing the right model
Argilla
63 views • 1 month ago
0:36
model pooling for diverse synthetic data generation
Argilla
32 views • 1 month ago
30:12
Ellamind on synthetic data generation with distilabel for pipelining and LLM finetuning
Argilla
185 views • 2 months ago
4:45
Scaling Synthetic Data Creation with 1 Billion Personas | PersonaHub Dataset Explained
Argilla
699 views • 2 months ago
50:02
Javier Alonso on lead optimisation at Idealista
Argilla
95 views • 3 months ago
6:29
Exploring the PRISM Dataset: Conversations, Insights, and Model Performance
Argilla
196 views • 3 months ago
39:06
Ben Burtenshaw on the Argilla 2.0 SDK refactor
Argilla
91 views • 3 months ago
1:02:59
Louis Guitton on NER with Argilla
Argilla
128 views • 4 months ago
54:39
Weights & Biases on Wandbot
Argilla
27 views • 4 months ago
32:18
Datamaran on using Argilla in MLOps workflows for ESG governance
Argilla
102 views • 4 months ago
39:00
Understanding and reproducing DEITA with MantisNLP using distilabel=1.0.0
Argilla
168 views • 6 months ago
38:11
Elad Levi on AutoPrompt and intent-based prompt calibration and prompt engineering
Argilla
551 views • 6 months ago
55:51
Daniel van Strien on the Hugging Face hub and synthetic creation of a DPO dataset for Haiku
Argilla
230 views • 7 months ago
51:10
Seth Levine on the usage of SetFit and BerTopic for unsupervised clustering
Argilla
450 views • 7 months ago
45:40
Red Cross 510 on NLP for good with SetFit for chat message classification
Argilla
209 views • 7 months ago
49:29
Prolific on workload distribution, LLM preference data annotation and Phi2 fine-tune Colab
Argilla
323 views • 8 months ago
1:03:58
Pitching AI to your boss, SLMs vs LLMs and contributing to open source projects
Argilla
92 views • 8 months ago
43:59
Kickstart NLP with synthetic data and running LLMs on Google Colab using vLLM
Argilla
234 views • 9 months ago
36:03
How we cleaned OpenBMB UltraFeedback and Notus
Argilla
197 views • 9 months ago
36:40
An introduction to distilabel for AI feedback and synthetic data generation
Argilla
621 views • 10 months ago
46:16
Deploy Argilla on a private Hugging Face space and how to contribute to open source
Argilla
164 views • 10 months ago
45:29
Deploy Argilla on a public Hugging Face space and create multi-modal datasets
Argilla
251 views • 11 months ago
2:17
Meet Argilla
Argilla
4.2K views • 1 year ago
0:20
Collect human feedback for evaluating fine-tuned LLMs
Argilla
339 views • 1 year ago
Load More