Reasoning - a kd303 Collection

kd303 's Collections

Dataset - speech

Books-data-training

Data Quality Models

Reasoning-lastest

Synthetic Data papers

Reasoning

updated Aug 17, 2025

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 73
Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 20
Democratizing Reasoning Ability: Tailored Learning from Large Language Model

Paper • 2310.13332 • Published Oct 20, 2023 • 16
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Paper • 2412.16849 • Published Dec 22, 2024 • 9
o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 44
SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation

Paper • 2411.11053 • Published Nov 17, 2024 • 4
Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay

Paper • 2410.12236 • Published Oct 16, 2024 • 1
Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 36
nvidia/OpenCodeReasoning

Viewer • Updated May 4, 2025 • 753k • 4k • 531
nvidia/OpenCodeReasoning-2

Viewer • Updated May 17, 2025 • 2.16M • 4.07k • 53