spectacle's picture

spectacle

spectaclecs

·

spectaclecs

AI & ML interests

Multimodal LLM, Agent

Recent Activity

upvoted a paper 11 days ago

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

upvoted a collection 18 days ago

upvoted a paper 24 days ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

View all activity

Organizations

upvoted a paper 11 days ago

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published 20 days ago • 52

upvoted a collection 18 days ago

Qwen3-Next

4 items • Updated Dec 31, 2025 • 185

upvoted a paper 24 days ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published 25 days ago • 57

upvoted a collection about 1 month ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.71k

upvoted a paper about 1 month ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9, 2025 • 45

upvoted a collection 2 months ago

DeepSeek-V3.2

4 items • Updated Dec 1, 2025 • 531

upvoted 3 papers 3 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 260

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Paper • 2412.10302 • Published Dec 13, 2024 • 22

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 91

upvoted a paper 4 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

upvoted a collection 4 months ago

CapRL

Data & Models for CapRL1.0 series &2.0 series • 10 items • Updated Dec 25, 2025 • 6

upvoted a paper 7 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144

upvoted an article over 1 year ago

Article

From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease

Oct 21, 2022

•

43

upvoted a collection over 1 year ago

Emu3

Emu3: Next-Token Prediction is All You Need • 7 items • Updated about 1 month ago • 80

upvoted an article over 1 year ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

•

292

upvoted a collection over 1 year ago

MiniCPM-o & MiniCPM-V

Multimodal models with leading performance. • 29 items • Updated 5 days ago • 71