view article Article How I contributed a new model to the Transformers library using Codex 24 days ago β’ 48
π€ SmolLM2 Automatic Essay Grading Collection Automatic Essay Grading - SmolLM2 β’ 15 items β’ Updated Jun 9, 2025 β’ 1
πͺ Qwen2.5 Automatic Essay Grading Collection Automatic Essay Grading - Qwen2.5 β’ 15 items β’ Updated Jun 9, 2025 β’ 1
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper β’ 2501.12599 β’ Published Jan 22, 2025 β’ 129
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 β’ 70
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7, 2025 β’ 289
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks β’ 8 items β’ Updated Jul 31, 2025 β’ 32
view article Article Towards a Fully Arabic Retrieval-Augmented Generation (RAG) Pipeline: Nov 30, 2024 β’ 28
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Paper β’ 2404.05726 β’ Published Apr 8, 2024 β’ 23
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! β’ 30 items β’ Updated Jun 12, 2024 β’ 253
Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models Paper β’ 2312.17661 β’ Published Dec 29, 2023 β’ 15
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper β’ 2312.11514 β’ Published Dec 12, 2023 β’ 264
Distributed Representations of Words and Phrases and their Compositionality Paper β’ 1310.4546 β’ Published Oct 16, 2013 β’ 3
Efficient Estimation of Word Representations in Vector Space Paper β’ 1301.3781 β’ Published Jan 16, 2013 β’ 8
LoRA: Low-Rank Adaptation of Large Language Models Paper β’ 2106.09685 β’ Published Jun 17, 2021 β’ 60
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning Paper β’ 2012.13255 β’ Published Dec 22, 2020 β’ 5