view article Article FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages davanstrien • Jul 8, 2025 • 35
view article Article FineWeb2-C: Help Build Better Language Models in Your Language davanstrien • Dec 23, 2024 • 21
view article Article Argilla 2.4: Easily Build Fine-Tuning and Evaluation Datasets on the Hub — No Code Required +1 nataliaElv, burtenshaw, dvilasuero • Nov 4, 2024 • 45
view article Article How to build a custom text classifier without days of human labeling sdiazlor • Oct 17, 2024 • 57
view article Article How to optimize your data labelling project with custom interfaces burtenshaw • Oct 16, 2024 • 20