4 9 4

Xiangchendong

Xiang-cd

Xiang-cd

AI & ML interests

pre-train models

Recent Activity

authored a paper 1 day ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

upvoted a paper 1 day ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

upvoted a paper 1 day ago

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

View all activity

Organizations

None yet

authored a paper 1 day ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published 8 days ago • 33

upvoted 2 papers 1 day ago

SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

Paper • 2602.13515 • Published 8 days ago • 33

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Paper • 2602.07854 • Published 13 days ago • 8

authored a paper 3 days ago

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

Paper • 2602.07854 • Published 13 days ago • 8

upvoted a paper 19 days ago

Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation

Paper • 2602.02214 • Published 19 days ago • 24

liked a model about 1 month ago

Xiang-cd/vidar

Updated Dec 18, 2025 • 2

published a model 2 months ago

Xiang-cd/vidar

Updated Dec 18, 2025 • 2

updated a model 2 months ago

Xiang-cd/vidar

Updated Dec 18, 2025 • 2

upvoted an article 4 months ago

Article

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

Jun 13, 2024

•

upvoted a paper 5 months ago

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 118

liked a dataset 9 months ago

waltsun/MOAT

Viewer • Updated Dec 21, 2025 • 1.01k • 50 • 6

upvoted a paper 9 months ago

SageAttention2++: A More Efficient Implementation of SageAttention2

Paper • 2505.21136 • Published May 27, 2025 • 45

upvoted an article 9 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

450

New activity in Xiang-cd/sparge-attention-model-zoo 10 months ago

Add link to paper

#1 opened 10 months ago by

nielsr

updated a model 10 months ago

Xiang-cd/sparge-attention-model-zoo

Updated Apr 22, 2025 • 6

liked a model 10 months ago

Xiang-cd/sparge-attention-model-zoo

Updated Apr 22, 2025 • 6

published a model 11 months ago

Xiang-cd/sparge-attention-model-zoo

Updated Apr 22, 2025 • 6

upvoted a paper 12 months ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25, 2025 • 60

authored a paper 12 months ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25, 2025 • 60

upvoted a paper almost 2 years ago

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

Paper • 2403.05034 • Published Mar 8, 2024 • 21

Xiangchendong

AI & ML interests

Recent Activity

Organizations

Xiang-cd's activity

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

You could have designed state of the art positional encoding

Add link to paper