16 9 6

Le Thien Phuc Nguyen

plnguyen2908

https://plnguyen2908.github.io/

plnguyen2908

AI & ML interests

Computer Vision, NLP, Applied AI

Recent Activity

upvoted a paper 10 days ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

published a dataset 16 days ago

plnguyen2908/AVHBench_clone

upvoted a collection 25 days ago

VideoLLaMA2

View all activity

Organizations

upvoted a paper 10 days ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Paper • 2604.13151 • Published 13 days ago • 24

upvoted a collection 25 days ago

VideoLLaMA2

Collection

Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability • 13 items • Updated Sep 2, 2025 • 20

upvoted a collection 3 months ago

VisionLM

Collection

1884 items • Updated Jan 12 • 146

upvoted an article 4 months ago

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7, 2025

•

110

upvoted 2 papers 5 months ago

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

Paper • 2512.02231 • Published Dec 1, 2025 • 9

Relational Visual Similarity

Paper • 2512.07833 • Published Dec 8, 2025 • 25

upvoted an article 5 months ago

Article

The Annotated Diffusion Model

Jun 7, 2022

•

348

upvoted a paper 7 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513

upvoted a paper 11 months ago

UniTalk: Towards Universal Active Speaker Detection in Real World Scenarios

Paper • 2505.21954 • Published May 28, 2025 • 1