FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Paper • 2510.10868 • Published Oct 13, 2025 • 13
When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains Paper • 2603.01301 • Published 24 days ago • 8
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published Feb 11 • 15
PuzzleCraft Collection Qwen2.5-VL-3B & 7B models trained with PuzzleCraft • 9 items • Updated 6 days ago • 3
LoopFormer Collection Models trained in the ICLR2026 paper: LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation • 17 items • Updated Feb 19 • 2
MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network Paper • 2310.16288 • Published Oct 25, 2023
Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset Paper • 2306.11167 • Published Jun 19, 2023 • 2
Pain in 3D: Generating Controllable Synthetic Faces for Automated Pain Assessment Paper • 2509.16727 • Published Sep 20, 2025
SUM: Saliency Unification through Mamba for Visual Attention Modeling Paper • 2406.17815 • Published Jun 25, 2024 • 1
PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Paper • 2510.07546 • Published Oct 8, 2025 • 22
PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Paper • 2510.07546 • Published Oct 8, 2025 • 22