Blanc Swan's picture

Blanc Swan PRO

blancsw

·

https://www.infomaniak.com

swan-blanc-no-code-team

AI & ML interests

ChatBot

Recent Activity

upvoted a paper about 15 hours ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

upvoted a paper about 15 hours ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

upvoted a paper 6 days ago

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

View all activity

Organizations

upvoted 2 papers about 15 hours ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published 14 days ago • 167

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published 16 days ago • 175

upvoted a paper 6 days ago

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published 14 days ago • 228

upvoted a changelog 8 days ago

Changelog

Community Evals and Benchmark Repositories

20 days ago

• 60

liked a model 9 days ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 2 days ago • 483k • • 1.04k

updated a collection 10 days ago

Distilation

3 items • Updated 10 days ago

upvoted a collection 10 days ago

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 67 items • Updated 8 days ago • 8

updated 2 collections 10 days ago

Distilation

3 items • Updated 10 days ago

TranslateGemma VLLM

Modified version of google/translategemma-4/12/27b-it optimized for deployment with vLLM. • 3 items • Updated 2 days ago • 2

upvoted an article 10 days ago

Article

Custom Kernels for All from Codex and Claude

+2

12 days ago

•

60

upvoted a paper 10 days ago

Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation

Paper • 2512.20908 • Published Dec 24, 2025 • 29

upvoted a paper 11 days ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published 20 days ago • 333

upvoted a paper 12 days ago

MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents

Paper • 2601.03236 • Published Jan 6 • 7

liked a model 13 days ago

a-m-team/AM-Thinking-v1

Text Generation • 33B • Updated May 14, 2025 • 135 • • 203

liked a model 14 days ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 171B • Updated 20 days ago • 1.38M • • 2.12k

upvoted 2 papers 22 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 228

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published 27 days ago • 156

New activity in Infomaniak-AI/vllm-translategemma-4b-it 22 days ago

Any plan to release a 12B version?

#1 opened 27 days ago by

updated a model 22 days ago

Infomaniak-AI/vllm-translategemma-12b-it

Image-Text-to-Text • Updated 22 days ago • 1.72k • 1

updated a collection 22 days ago

TranslateGemma VLLM

Modified version of google/translategemma-4/12/27b-it optimized for deployment with vLLM. • 3 items • Updated 2 days ago • 2