Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

multASD

university
https://plnguyen2908.github.io/
Activity Feed

AI & ML interests

None defined yet.

Le Thien Phuc Nguyen's profile pictureCao Quang Nhat Khoa's profile pictureYuwei Guo's profile pictureLucas Poon's profile pictureTu Ho Manh Pham's profile pictureKiet Pham's profile pictureToan Vo's profile pictureJIE REN's profile pictureTuan Tai Nguyen's profile pictureAnh Duc Duong's profile picture

plnguyen2908 
authored 2 papers 4 months ago

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

Paper • 2512.02231 • Published Dec 1, 2025 • 9

LASER: Lip Landmark Assisted Speaker Detection for Robustness

Paper • 2501.11899 • Published Jan 21, 2025
plnguyen2908 
submitted a paper to Daily Papers 4 months ago

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

Paper • 2512.02231 • Published Dec 1, 2025 • 9
plnguyen2908 
authored a paper 10 months ago

UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios

Paper • 2505.21954 • Published May 28, 2025 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs