Joseph Tang's picture

Joseph Tang

lilvjosephtang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

authored a paper 20 days ago

ChessQA: Evaluating Large Language Models for Chess Understanding

authored a paper 20 days ago

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

View all activity

Organizations

upvoted a paper 1 day ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

Paper • 2604.18519 • Published 8 days ago • 19

upvoted a paper 24 days ago

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published 26 days ago • 42

upvoted an article 6 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

119

upvoted a paper 8 months ago

SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models

Paper • 2508.18179 • Published Aug 25, 2025 • 9