nics-efc/MARSHAL-Mini-Hanabi-Qwen3-4B
Text Generation
•
4B
•
Updated
•
6
None defined yet.
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models