v19b best model — stage 11 (Hard, 15 opponents), 90% win rate, update 1260 672903a verified JoshuaFreeman commited on 2 days ago
v19b best model (update 580, reward 1.29, best sweep params + potential shaping alpha=1) 573abc2 verified JoshuaFreeman commited on 3 days ago
Update best_model.pt to v18b (obs_dim=96, 512-512-256, reward=0.584, loss penalty) 2e8eda4 verified JoshuaFreeman commited on 3 days ago
Update best_model.pt to v17 (obs_dim=80, best_reward=0.535) 9d3b760 verified JoshuaFreeman commited on 4 days ago
Upload v16 best model (wilderness + boat attack fix, reward 0.516) 5ccfee2 verified JoshuaFreeman commited on 5 days ago
Upload v15 best model (wilderness + optimized BFS, reward 0.555) 9e935c8 verified JoshuaFreeman commited on 5 days ago
v13b (update 1550): normalized elim + winner bonus, vf=0.5, best generalization 2296e2d verified JoshuaFreeman commited on 5 days ago
v12a: 100% win rate on Easy/2, normalized elimination reward 2d620cc verified JoshuaFreeman commited on 6 days ago