Quentin Gallouédec's picture

Hiring 💼

Quentin Gallouédec PRO

qgallouedec

huggingface

·

AI & ML interests

None yet

Recent Activity

posted an update about 23 hours ago

TRL v1.3 ships day-one training support for Qwen 3.6 🚀 The new Qwen 3.6 family (`Qwen/Qwen3.6-27B`, `Qwen/Qwen3.6-35B-A3B`) reuses the Qwen3.5-MoE architecture but ships a slightly different chat template, so we updated the stack end-to-end: new training template with `{% generation %}` markers, tool-call response schema routing, tiny test models for the VLM matrix. SFT with assistant-only loss works out of the box: ```python from trl import SFTConfig, SFTTrainer trainer = SFTTrainer( model="Qwen/Qwen3.6-27B", args=SFTConfig(assistant_only_loss=True), train_dataset=dataset, ) trainer.train() ``` So does GRPO tool-calling — just hand `tools=[...]` to `GRPOTrainer`. v1.3 also brings a new experimental TPO trainer (Triple Preference Optimization), speculative decoding in `trl vllm-serve` (Qwen3 MTP / Eagle3 drafts), 12 more KTO ↔ DPO alignment PRs (KTO promotion to stable is now in reach), three more `{% generation %}` chat templates (Gemma/Gemma 2, Phi-3, GLM-4-MoE), and a chunky SFT entropy bug fix. Full release notes: https://github.com/huggingface/trl/releases/tag/v1.3.0

updated a dataset 1 day ago

hf-doc-build/doc-build

updated a bucket 1 day ago

hf-doc-build/doc

View all activity

Organizations

qgallouedec 's models 789

qgallouedec/Qwen3-0.6B-SFT-20251113165959

Text Generation • 0.6B • Updated 19 days ago • 291

qgallouedec/tiny-aya-global-SFT

qgallouedec/tiny-aya-global-tool-calling-SFT

qgallouedec/my-other-awesome-model

Text Generation • 0.5B • Updated Feb 14 • 12

qgallouedec/my-awesome-model

Text Generation • 0.5B • Updated Feb 14 • 13

qgallouedec/trainer_output

Text Generation • 0.5B • Updated Feb 14 • 10

qgallouedec/test_push_output_4

Text Classification • 87.5k • Updated Feb 14 • 6

qgallouedec/qwen2-0.5b-deepmath-grpo

qgallouedec/my-finetuned-model

0.8B • Updated Jan 2 • 1

qgallouedec/Qwen3-0.6B-SFT-20251113163732

Updated Nov 13, 2025

qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112173255

Updated Nov 12, 2025

qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112165832

Updated Nov 12, 2025

qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112171926

Updated Nov 12, 2025

qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112171823

Updated Nov 12, 2025

qgallouedec/gold-model

Updated Oct 30, 2025

qgallouedec/custom-resnet50d

Feature Extraction • 25.6M • Updated Oct 1, 2025 • 4

qgallouedec/Qwen3-1.7B-parsing

Text Generation • 2B • Updated Sep 27, 2025 • 5

qgallouedec/Qwen2.5-0.5B-SFT

Text Generation • 0.5B • Updated Sep 14, 2025 • 7

qgallouedec/Qwen2-0.5B-Reward

Token Classification • 0.5B • Updated Sep 14, 2025 • 6

qgallouedec/Qwen3-0.6B-SFT-20250911031144

Text Generation • 0.6B • Updated Sep 11, 2025 • 10

qgallouedec/Qwen3-0.6B-SFT-20250911023224

Text Generation • 0.6B • Updated Sep 11, 2025 • 6

qgallouedec/Qwen3-0.6B-Base-SFT-20250911020040

Text Generation • 0.6B • Updated Sep 11, 2025 • 4

qgallouedec/Qwen3-0.6B-SFT-20250911021538

Text Generation • 0.6B • Updated Sep 11, 2025 • 2

qgallouedec/Qwen3-0.6B-Base-SFT-20250911021314

Text Generation • 0.6B • Updated Sep 11, 2025 • 3

qgallouedec/Qwen3-0.6B-Base-SFT-20250911014759

Text Generation • 0.6B • Updated Sep 11, 2025 • 2

qgallouedec/Qwen3-0.6B-Base-SFT-20250911011255

Text Generation • 0.6B • Updated Sep 11, 2025 • 4

qgallouedec/after

Text Generation • 0.5B • Updated Sep 11, 2025 • 7

qgallouedec/before

Text Generation • 0.5B • Updated Sep 11, 2025 • 7

qgallouedec/Qwen3-1.7B-SFT-20250910184326

Text Generation • 2B • Updated Sep 10, 2025 • 10

qgallouedec/Qwen3-4B-SFT-20250910180651

Text Generation • 4B • Updated Sep 10, 2025 • 2