Darshan Deshpande
DarshanDeshpande
AI & ML interests
Explainability, Robustness, Evaluations
Recent Activity
liked
a dataset 29 days ago
PatronusAI/trace-dataset upvoted a paper about 1 month ago
Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis submitted
a paper
about 1 month ago
Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis