4 10

Xie

stonexjr

AI & ML interests

Generative Art

Recent Activity

upvoted an article 5 days ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

upvoted an article 5 months ago

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

upvoted an article 5 months ago

You could have designed state of the art positional encoding

View all activity

Organizations

None yet

upvoted an article 5 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

252

upvoted 2 articles 5 months ago

Article

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

Sep 16, 2025

•

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

455

liked a model 11 months ago

ostris/Flex.2-preview

Text-to-Image • Updated Apr 25, 2025 • 403 • 385

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.75k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset about 1 year ago

poloclub/diffusiondb

Updated Jan 22, 2024 • 9.53k • 598

upvoted an article about 1 year ago

Article

Understanding InstaFlow/Rectified Flow

Oct 6, 2023

•

liked 2 datasets about 1 year ago

zh-plus/tiny-imagenet

Viewer • Updated Jul 12, 2022 • 110k • 15.1k • 96

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 115k • 755

liked 4 models about 1 year ago

liked a model about 3 years ago

lllyasviel/ControlNet

Updated Feb 25, 2023 • 4 • 3.79k

Xie

AI & ML interests

Recent Activity

Organizations

stonexjr's activity

KV Caching Explained: Optimizing Transformer Inference Efficiency

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

You could have designed state of the art positional encoding

The Ultra-Scale Playbook

Understanding InstaFlow/Rectified Flow