Nuclear clamp: every reward source in the codebase now returns (0.05, 0.95) 719c147 ar9avg commited on Apr 12
Clamp all remaining score leak paths: /state, step_rewards, demo SSE e99d0aa ar9avg commited on Apr 11
Bulletproof _safe_score for all bad inputs (None, NaN, strings, bool) 2014920 ar9avg commited on Apr 11
Fix task scores to be strictly in (0, 1) exclusive per OpenEnv spec d2d92b8 ar9avg commited on Apr 11
Refactor README.md by removing metadata and updating content 805743c unverified ar9avg commited on Apr 11
Reset SQL on new attempt so attempts don't concatenate in the same box b15235a ar9avg commited on Apr 11
Fix UnboundLocalError: remove duplicate local import of REPAIR_ACTION_BY_NAME c2894a4 ar9avg commited on Apr 11
Surface GEPA optimization: prompt history, live banner, smart retry aa3ae1f ar9avg commited on Apr 11
Add LLM diagnostics: /api/test-llm endpoint + startup/error logging 63cbec3 ar9avg commited on Apr 11
Fix chat SSE events to match frontend protocol (result+done instead of success) f4110fc ar9avg commited on Apr 11
fix: GEPA current_generation, task_id mapping, Connect DB button, remove difficulty from header f0b682f ar9avg commited on Apr 11
fix: default LLM to HF Router + Qwen2.5-72B, no custom Space variables needed 68ebe84 ar9avg commited on Apr 11
feat: demo mode with reward chart, github diff, single-difficulty rounds, no loop 2d33bcd ar9avg commited on Apr 11