Open to Collab

2 7 2

Zijun Wang

Olivia714

https://asillycat.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 minutes ago

VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

upvoted a paper about 19 hours ago

Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows

upvoted a paper 3 days ago

Target-Oriented Pretraining Data Selection via Neuron-Activated Graph

View all activity

Organizations

upvoted a paper 19 minutes ago

VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

Paper • 2604.21375 • Published 2 days ago • 10

upvoted a paper about 19 hours ago

Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows

Paper • 2604.20200 • Published 3 days ago • 4

upvoted a paper 3 days ago

Target-Oriented Pretraining Data Selection via Neuron-Activated Graph

Paper • 2604.15706 • Published 8 days ago • 10

submitted a paper to Daily Papers 3 days ago

Target-Oriented Pretraining Data Selection via Neuron-Activated Graph

Paper • 2604.15706 • Published 8 days ago • 10

authored 7 papers 18 days ago

STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

Paper • 2504.01903 • Published Apr 2, 2025

AHELM: A Holistic Evaluation of Audio-Language Models

Paper • 2508.21376 • Published Aug 29, 2025 • 9

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 60

Where on Earth? A Vision-Language Benchmark for Probing Model Geolocation Skills Across Scales

Paper • 2510.10880 • Published Oct 13, 2025

How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs

Paper • 2311.16101 • Published Nov 27, 2023 • 1

AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation

Paper • 2410.09040 • Published Oct 11, 2024

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Paper • 2604.04759 • Published 19 days ago • 24

upvoted a paper 18 days ago

Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Paper • 2604.04759 • Published 19 days ago • 24

upvoted a paper about 1 month ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 139

upvoted a paper 6 months ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 60

updated 2 models 7 months ago

Olivia714/qwen7b-distill-thinkflag1-all_10_or_1_9_plus_2_9_5k-epoch0

Text Generation • 8B • Updated Oct 9, 2025 • 2

Olivia714/llama8b-distill-thinkflag1-all_10_or_1_9_plus_2_9_5k-epoch0

Text Generation • 8B • Updated Oct 9, 2025 • 2

published 2 models 7 months ago

Olivia714/qwen7b-distill-thinkflag1-all_10_or_1_9_plus_2_9_5k-epoch0

Text Generation • 8B • Updated Oct 9, 2025 • 2

Olivia714/llama8b-distill-thinkflag1-all_10_or_1_9_plus_2_9_5k-epoch0

Text Generation • 8B • Updated Oct 9, 2025 • 2

upvoted a paper 8 months ago

AHELM: A Holistic Evaluation of Audio-Language Models

Paper • 2508.21376 • Published Aug 29, 2025 • 9

updated a model about 1 year ago

UCSC-VLAA/STAR1-R1-Distill-32B

Text Generation • 33B • Updated Apr 4, 2025 • 247

Zijun Wang

AI & ML interests

Recent Activity

Organizations

Olivia714's activity