26 23

Phuong Pham

mp1704

AI & ML interests

None yet

Recent Activity

upvoted an article 5 days ago

Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness

liked a model 21 days ago

utter-project/mHuBERT-147

upvoted an article 22 days ago

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

View all activity

Organizations

upvoted an article 5 days ago

Article

Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness

Nov 5, 2025

•

liked a model 21 days ago

utter-project/mHuBERT-147

Feature Extraction • 94.4M • Updated Dec 19, 2024 • 28k • • 98

upvoted an article 22 days ago

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

Dec 4, 2025

•

upvoted an article 25 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

234

liked a model 28 days ago

hynt/Zipformer-30M-RNNT-Streaming-6000h

Updated 28 days ago • 85 • 8

upvoted an article about 1 month ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

Jan 28

•

144

liked a dataset about 1 month ago

mazesmazes/sift-audio

Viewer • Updated Jan 29 • 293k • 663 • 4

liked a Space about 2 months ago

TTSDataset

🐠

Process audio files and create transcription datasets

updated a collection about 2 months ago

vi-audio

Collection

5 items • Updated Jan 12

liked 2 datasets about 2 months ago

pnnbao-ump/VieNeu-TTS-1000h

Viewer • Updated Nov 25, 2025 • 421k • 26 • 15

pnnbao-ump/VieNeu-TTS-140h

Viewer • Updated Nov 18, 2025 • 73.5k • 386 • 24

updated a collection about 2 months ago

vi-audio

Collection

5 items • Updated Jan 12

upvoted an article about 2 months ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5

•

upvoted a paper 2 months ago

An Anatomy of Vision-Language-Action Models: From Modules to Milestones and Challenges

Paper • 2512.11362 • Published Dec 12, 2025 • 22

liked a Space 2 months ago

The Eiffel Tower Llama

📝

108

Explore the Eiffel Tower Llama experiment with open-source models

liked a dataset 2 months ago

nguyendv02/ViMD_Dataset

Viewer • Updated Jan 28 • 19k • 996 • 16

updated a collection 3 months ago

vi-audio

Collection

5 items • Updated Jan 12

liked a dataset 3 months ago

dolly-vn/dolly-audio-1000h-vietnamese

Viewer • Updated Nov 24, 2025 • 664k • 1.27k • 47

liked a Space 4 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

Visualize on-policy distillation for any model family

upvoted an article 4 months ago

Article

Vision Language Models Explained

Apr 11, 2024

•

521

Phuong Pham

AI & ML interests

Recent Activity

Organizations

mp1704's activity

Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

KV Caching Explained: Optimizing Transformer Inference Efficiency

We Got Claude to Build CUDA Kernels and teach open models!

TTSDataset

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

The Eiffel Tower Llama

Unlocking On-Policy Distillation for Any Model Family

Vision Language Models Explained