DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published Jan 22, 2025 β’ 444
Improved Baselines with Visual Instruction Tuning Paper β’ 2310.03744 β’ Published Oct 5, 2023 β’ 39
view changelog Hugging Face Changelog Connect Your MCP Client to the Hugging Face Hub Jun 6, 2025 β’ 114
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL +4 Jun 3, 2025 β’ 101
sarvam-m Collection Collection of all variations of the sarvam-m model β’ 3 items β’ Updated May 24, 2025 β’ 26
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 β’ 128
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 β’ 176
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context +6 Jul 23, 2024 β’ 241