Difan Jiao's picture

Difan Jiao

difanjiao

·

difanj0713

AI & ML interests

Generative Models & Mech Interp

Recent Activity

authored a paper about 5 hours ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

upvoted a paper 7 days ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

updated a model 11 days ago

difanjiao/vanilla_grpo_math_Qwen3-4B

View all activity

Organizations

upvoted a paper 7 days ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published 14 days ago • 94

upvoted a paper 18 days ago

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published 20 days ago • 41

upvoted an article 6 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

288