Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles Paper • 2505.19914 • Published May 26, 2025 • 46
Running 29 Llama-4-Maverick-03-26-Experimental Battles 🔥 29 Display and filter chat conversations between models
ValueFX9507/Tifa-Deepsex-14b-CoT Reinforcement Learning • 15B • Updated Feb 13, 2025 • 343 • 218
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published Jan 20, 2025 • 109