Optimal last-iterate convergence in matrix games with bandit feedback using the log-barrier
Paper • 2604.15242 • Published
None defined yet.
MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts
T-REGS: Minimum Spanning Tree Regularization for Self-Supervised Learning