-
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper • 2402.01739 • Published • 28 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 141 -
Rethinking Interpretability in the Era of Large Language Models
Paper • 2402.01761 • Published • 23 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 161
Michel Chaduteau
michadu
·
AI & ML interests
None yet
Recent Activity
commented on
a paper
14 days ago
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies updated
a collection
4 months ago
LLM_papers updated
a collection
about 2 years ago
LLM_papers Organizations
None yet