23 5

Austin Liu

Austin362667

austin362667

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

updated a model 20 days ago

Austin362667/Qwen3-1.7B-MLX-bf16-python-18k-alpaca

updated a model 20 days ago

Austin362667/Qwen3-0.6B-MLX-bf16-python-18k-alpaca

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published 3 days ago • 23

upvoted an article 27 days ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

•

upvoted an article 28 days ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

•

128

upvoted a collection 30 days ago

SiliconMind-V1

Collection

4 items • Updated Feb 11 • 2

upvoted 2 articles about 2 months ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

May 20, 2025

•

Article

KV Cache from scratch in nanoVLM

Jun 4, 2025

•

114

upvoted an article 2 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

upvoted an article 4 months ago

Article

Continuous batching from first principles

Nov 25, 2025

•

354

upvoted 2 articles 6 months ago

Article

Key Insights into the Law of Vision Representations in MLLMs

Sep 2, 2024

•

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

276

upvoted 4 articles 8 months ago

Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Jun 11, 2024

•

Article

Parquet Content-Defined Chunking

Jul 25, 2025

•

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3, 2025

•

342

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Jul 23, 2025

•

upvoted 2 articles 9 months ago

Article

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

Jun 28, 2025

•

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20, 2025

•

333

upvoted 2 articles 10 months ago

Article

Introducing Cosmos Predict-2: A Foundation For Your Own World Model

Jun 17, 2025

•

Article

🐯 Liger GRPO meets TRL

May 25, 2025

•

upvoted 2 articles about 1 year ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

•

132

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.11k

Austin Liu

AI & ML interests

Recent Activity

Organizations

Austin362667's activity

Assisted Generation: a new direction toward low-latency text generation

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

KV Cache from scratch in nanoVLM

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Continuous batching from first principles

Key Insights into the Law of Vision Representations in MLLMs

KV Caching Explained: Optimizing Transformer Inference Efficiency

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Parquet Content-Defined Chunking

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

TimeScope: How Long Can Your Video Large Multimodal Model Go?

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

SmolVLM2: Bringing Video Understanding to Every Device

Introducing Cosmos Predict-2: A Foundation For Your Own World Model

🐯 Liger GRPO meets TRL

Introduction to 3D Gaussian Splatting

Mixture of Experts Explained