SEAL: Entangled White-box Watermarks on Low-Rank Adaptation Paper • 2501.09284 • Published Jan 16, 2025 • 10
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks Paper • 2505.11881 • Published May 17, 2025 • 4
MAVL: A Multilingual Audio-Video Lyrics Dataset for Animated Song Translation Paper • 2505.18614 • Published May 24, 2025
Running on CPU Upgrade Featured 3.03k The Smol Training Playbook 📚 3.03k The secrets to building world-class LLMs
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks Paper • 2505.11881 • Published May 17, 2025 • 4
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 143
FuseCodec: Semantic-Contextual Fusion and Supervision for Neural Codecs Paper • 2509.11425 • Published Sep 14, 2025 • 4
Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues Paper • 2506.00958 • Published Jun 1, 2025 • 20
VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms Paper • 2503.14427 • Published Mar 18, 2025 • 19
How Far is Video Generation from World Model: A Physical Law Perspective Paper • 2411.02385 • Published Nov 4, 2024 • 34