6 413 31

Young-Jun Lee PRO

passing2961

Sterzhang's profile picture

starsuzi's profile picture

chano12's profile picture

https://sites.google.com/view/passing2961/home

passing2961
passing2961
passing2961

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper 4 days ago

Towards a Science of AI Agent Reliability

upvoted a paper 5 days ago

HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam

upvoted a paper 5 days ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

View all activity

Organizations

passing2961 's collections 5

Multi-Turn Evaluation Benchmarks

A collection of benchmarks for evaluating LMs or VLMs under multi-turn interaction

passing2961/MultiVerse

Viewer • Updated Nov 1, 2025 • 647 • 65 • 1
passing2961/photochat_plus

Viewer • Updated Dec 3, 2024 • 968 • 67 • 4
RefineBench/RefineBench

Viewer • Updated Dec 2, 2025 • 1k • 1.24k • 5

Stark

Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

passing2961/stark-face-image

Viewer • Updated Nov 6, 2024 • 93.6k • 78 • 3
passing2961/stark-summary

Viewer • Updated Nov 6, 2024 • 53.3k • 76 • 2
passing2961/stark-image-url

Viewer • Updated Nov 6, 2024 • 899k • 170 • 1
passing2961/stark-image

Viewer • Updated Nov 6, 2024 • 1.72M • 66 • 3

DialogCC

General multi-modal conversation datasets

passing2961/dialogcc

Viewer • Updated Jun 24, 2024 • 83.4k • 50 • 10

Thanos

Skill-of-Mind-Infused LLM

passing2961/Thanos-1B

1B • Updated Nov 8, 2024 • 7
passing2961/Thanos-3B

3B • Updated Nov 8, 2024 • 7 • 4
passing2961/Thanos-8B

8B • Updated Nov 8, 2024 • 8 • 3
passing2961/multifaceted-skill-of-mind

Viewer • Updated Nov 8, 2024 • 100k • 69 • 5

Ultron

Multi-modal conversation model & Multi-modal dialogue summarization model

passing2961/Ultron-Summarizer-1B

1B • Updated Nov 6, 2024 • 3
passing2961/Ultron-Summarizer-3B

3B • Updated Nov 6, 2024 • 2 • 3
passing2961/Ultron-Summarizer-8B

8B • Updated Nov 6, 2024 • 4 • 2
passing2961/Ultron-11B

11B • Updated Nov 6, 2024 • 2 • 1

Multi-Turn Evaluation Benchmarks

A collection of benchmarks for evaluating LMs or VLMs under multi-turn interaction

passing2961/MultiVerse

Viewer • Updated Nov 1, 2025 • 647 • 65 • 1
passing2961/photochat_plus

Viewer • Updated Dec 3, 2024 • 968 • 67 • 4
RefineBench/RefineBench

Viewer • Updated Dec 2, 2025 • 1k • 1.24k • 5

Thanos

Skill-of-Mind-Infused LLM

passing2961/Thanos-1B

1B • Updated Nov 8, 2024 • 7
passing2961/Thanos-3B

3B • Updated Nov 8, 2024 • 7 • 4
passing2961/Thanos-8B

8B • Updated Nov 8, 2024 • 8 • 3
passing2961/multifaceted-skill-of-mind

Viewer • Updated Nov 8, 2024 • 100k • 69 • 5

Stark

Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

passing2961/stark-face-image

Viewer • Updated Nov 6, 2024 • 93.6k • 78 • 3
passing2961/stark-summary

Viewer • Updated Nov 6, 2024 • 53.3k • 76 • 2
passing2961/stark-image-url

Viewer • Updated Nov 6, 2024 • 899k • 170 • 1
passing2961/stark-image

Viewer • Updated Nov 6, 2024 • 1.72M • 66 • 3

Ultron

Multi-modal conversation model & Multi-modal dialogue summarization model

passing2961/Ultron-Summarizer-1B

1B • Updated Nov 6, 2024 • 3
passing2961/Ultron-Summarizer-3B

3B • Updated Nov 6, 2024 • 2 • 3
passing2961/Ultron-Summarizer-8B

8B • Updated Nov 6, 2024 • 4 • 2
passing2961/Ultron-11B

11B • Updated Nov 6, 2024 • 2 • 1

DialogCC

General multi-modal conversation datasets

passing2961/dialogcc

Viewer • Updated Jun 24, 2024 • 83.4k • 50 • 10