AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models Paper • 2410.18325 • Published Oct 23, 2024 • 3
LaughTalk: Expressive 3D Talking Head Generation with Laughter Paper • 2311.00994 • Published Nov 2, 2023 • 3
FedPara: Low-Rank Hadamard Product for Communication-Efficient Federated Learning Paper • 2108.06098 • Published Aug 13, 2021 • 3
MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset Paper • 2406.14272 • Published Jun 20, 2024 • 3
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers Paper • 2505.00482 • Published May 1, 2025 • 2
AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation Paper • 2504.20629 • Published Apr 29, 2025 • 1
Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration Paper • 2502.16652 • Published Feb 23, 2025 • 1
Scratching Visual Transformer's Back with Uniform Attention Paper • 2210.08457 • Published Oct 16, 2022 • 1
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics Paper • 2503.20308 • Published Mar 26, 2025 • 23