3 6

Fuyang Cui

scottcfy

https://www.cs.toronto.edu/~scottc/

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

liked a dataset 25 days ago

stepfun-ai/Step-3.5-Flash-SFT

liked a model 4 months ago

deepseek-ai/DeepSeek-Math-V2

View all activity

Organizations

None yet

upvoted a paper 1 day ago

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published 8 days ago • 33

liked a dataset 25 days ago

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated 26 days ago • 1.62M • 57.9k • 309

liked a model 4 months ago

deepseek-ai/DeepSeek-Math-V2

Text Generation • Updated Nov 27, 2025 • 2.96k • 687

liked a model 6 months ago

nvidia/omni-embed-nemotron-3b

liked a dataset 6 months ago

nvidia/TechQA-RAG-Eval

Viewer • Updated May 27, 2025 • 910 • 236 • 4

liked a Space 6 months ago

MTEB Leaderboard

🥇

7.24k

Embedding Leaderboard

updated a model 6 months ago

scottcfy/Qwen2-VL-2B-Instruct-pdf2latex

Image-Text-to-Text • 2B • Updated Oct 4, 2025 • 1

published a model 6 months ago

scottcfy/Qwen2-VL-2B-Instruct-pdf2latex

Image-Text-to-Text • 2B • Updated Oct 4, 2025 • 1

authored a paper 6 months ago

Where LLM Agents Fail and How They can Learn From Failures

Paper • 2509.25370 • Published Sep 29, 2025 • 12

upvoted a paper 6 months ago

Where LLM Agents Fail and How They can Learn From Failures

Paper • 2509.25370 • Published Sep 29, 2025 • 12

liked a dataset about 1 year ago

openai/openai_humaneval

Viewer • Updated Jan 4, 2024 • 164 • 227k • 378

authored a paper over 1 year ago

Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries

Paper • 2409.00844 • Published Sep 1, 2024 • 12

upvoted a paper over 1 year ago