CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets Paper • 2406.13897 • Published May 30, 2024 • 12
BANG: Dividing 3D Assets via Generative Exploded Dynamics Paper • 2507.21493 • Published Jul 29, 2025 • 65
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image Paper • 2502.12894 • Published Feb 18, 2025 • 18
X-Part: high fidelity and structure coherent shape decomposition Paper • 2509.08643 • Published Sep 10, 2025 • 28
PREF: Phasorial Embedding Fields for Compact Neural Representations Paper • 2205.13524 • Published May 26, 2022
Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces Paper • 2208.14851 • Published Aug 31, 2022
MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs Paper • 2508.18264 • Published Aug 25, 2025 • 25
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Paper • 2508.15760 • Published Aug 21, 2025 • 47
MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning Paper • 2401.10727 • Published Jan 19, 2024 • 2
TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting Paper • 2204.01018 • Published Apr 3, 2022
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos Paper • 2303.12370 • Published Mar 22, 2023
Efficient Post-Training Refinement of Latent Reasoning in Large Language Models Paper • 2506.08552 • Published Jun 10, 2025 • 1
Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives Paper • 2506.24124 • Published Jun 30, 2025 • 1
Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering Paper • 2409.07441 • Published Sep 11, 2024 • 12
Efficient Detection of Toxic Prompts in Large Language Models Paper • 2408.11727 • Published Aug 21, 2024 • 13