McAuley-Lab

university

https://cseweb.ucsd.edu/~jmcauley/

AI & ML interests

We're the McAuley Lab at UC San Diego with PI Prof. Julian McAuley, focusing on cool machine learning and natural language processing applications!

Recent Activity

ZhankuiHe authored a paper 27 days ago

PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution

hyp1231 authored a paper 27 days ago

Deriving Character Logic from Storyline as Codified Decision Trees

hyp1231 authored a paper about 1 month ago

Codified Foreshadowing-Payoff Text Generation

View all activity

Papers

When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation

BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses

View all Papers

ZhankuiHe

authored a paper 27 days ago

PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution

Paper • 2601.10657 • Published 28 days ago • 20

hyp1231

authored a paper 27 days ago

Deriving Character Logic from Storyline as Codified Decision Trees

Paper • 2601.10080 • Published 29 days ago • 6

hyp1231

authored a paper about 1 month ago

Codified Foreshadowing-Payoff Text Generation

Paper • 2601.07033 • Published Jan 11 • 3

XinXuNLPer

authored a paper 4 months ago

When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation

Paper • 2510.07238 • Published Oct 8, 2025 • 15

Ethan2003

updated a dataset 4 months ago

McAuley-Lab/BLaIR-Bench-API

Viewer • Updated Oct 3, 2025 • 1.63k • 44

hyp1231

updated a dataset 4 months ago

McAuley-Lab/BLaIR-Bench-API

Viewer • Updated Oct 3, 2025 • 1.63k • 44

XinXuNLPer

authored a paper 4 months ago

BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses

Paper • 2510.00232 • Published Sep 30, 2025 • 16

Ethan2003

published a dataset 5 months ago

McAuley-Lab/BLaIR-Bench-API

Viewer • Updated Oct 3, 2025 • 1.63k • 44

XinXuNLPer

authored a paper 5 months ago

WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning

Paper • 2509.04744 • Published Sep 5, 2025 • 12

Ethan2003

updated a dataset 6 months ago

McAuley-Lab/BLaIR-Benchmark-Testset

Preview • Updated Aug 14, 2025 • 16

tianyang

authored 2 papers 8 months ago

Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models

Paper • 2411.08733 • Published Nov 13, 2024 • 1

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17, 2025 • 50

XinXuNLPer

authored a paper 9 months ago

Improving In-Context Learning with Reasoning Distillation

Paper • 2504.10647 • Published Apr 14, 2025

hyp1231

published a dataset 9 months ago

McAuley-Lab/blair-bench

Viewer • Updated Jan 14, 2025 • 27.6k • 5

XtremSup

authored a paper 9 months ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5, 2025 • 80

hyp1231

in McAuley-Lab/Amazon-Reviews-2023 10 months ago

Dataset Viewer issue: DatasetWithScriptNotSupportedError

#9 opened over 1 year ago by

MuajNSTU

XtremSup

authored a paper 10 months ago

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16, 2025 • 48

hyp1231

in McAuley-Lab/Amazon-Reviews-2023 11 months ago

Nullity of Meta data

#13 opened 11 months ago by

S4m2357

5-Core Full Reviews?

#14 opened 11 months ago by

bcobos

Ethan2003

published a dataset 11 months ago

McAuley-Lab/BLaIR-Benchmark-Testset

Preview • Updated Aug 14, 2025 • 16

AI & ML interests

Recent Activity

Papers

Team members 12

McAuley-Lab's activity

Dataset Viewer issue: DatasetWithScriptNotSupportedError

Nullity of Meta data

5-Core Full Reviews?