8 2

Ivy

FURUF

AI & ML interests

NLP RL

Recent Activity

upvoted a paper about 2 months ago

Shaping capabilities with token-level data filtering

upvoted a paper about 2 months ago

Reinforcement Learning via Self-Distillation

upvoted a paper 2 months ago

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

View all activity

Organizations

None yet

upvoted 2 papers about 2 months ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published Jan 29 • 27

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 42

upvoted a paper 2 months ago

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Paper • 2601.03559 • Published Jan 7 • 14

updated a Space 3 months ago

Trackio

🚀

published a Space 3 months ago

Trackio

🚀

liked a Space 4 months ago

The Smol Training Playbook

📚

3.06k

The secrets to building world-class LLMs

updated a dataset 6 months ago

FURUF/Entity4Hallucination

Preview • Updated Oct 7, 2025 • 8

published a dataset 6 months ago

FURUF/Entity4Hallucination

Preview • Updated Oct 7, 2025 • 8

upvoted a paper 6 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 100

upvoted an article 8 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29, 2025

•

219

upvoted a paper 10 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 339

upvoted a paper about 1 year ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 154

liked a model over 1 year ago

EleutherAI/sae-llama-3-8b-32x-v2

Updated Jul 16, 2024 • 17

upvoted an article over 1 year ago

Article

Vision Language Models Explained

Apr 11, 2024

•

529

Ivy

AI & ML interests

Recent Activity

Organizations

FURUF's activity

Trackio

Trackio

The Smol Training Playbook

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Vision Language Models Explained