PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
Paper
•
2601.10657
•
Published
•
20
We're the McAuley Lab at UC San Diego with PI Prof. Julian McAuley, focusing on cool machine learning and natural language processing applications!
When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation
BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses