| --- |
| license: apache-2.0 |
| tags: |
| - slidesparse |
| - sparse |
| - quantization |
| - int8 |
| - fp8 |
| - llama |
| - qwen |
| --- |
| |
| # SlideSparse Checkpoints |
|
|
| Pre-converted sparse model checkpoints using the **SlideSparse** technique. |
|
|
| ## Overview |
|
|
| This repository contains model weights converted with various sparsity configurations: |
| - **2:4** - Standard N:M sparsity (50% sparse) |
| - **2:6** - Extended sparsity (67% sparse) |
| - **2:8** - Higher sparsity (75% sparse) |
| - **2:10** - Maximum sparsity (80% sparse) |
|
|
| ## Models Included |
|
|
| | Base Model | Quantization | Sparsity Variants | |
| |------------|--------------|-------------------| |
| | Llama-3.2-1B | INT8, FP8 | 2:4, 2:6, 2:8, 2:10 | |
| | Llama-3.2-3B | INT8, FP8 | 2:4, 2:6, 2:8, 2:10 | |
| | Qwen2.5-7B | INT8, FP8 | 2:4, 2:6, 2:8, 2:10 | |
| | Qwen2.5-14B | INT8, FP8 | 2:4, 2:6, 2:8, 2:10 | |
|
|
| ## Source Models |
|
|
| These checkpoints are derived from: |
| - [RedHatAI/Llama-3.2-1B-Instruct-quantized.w8a8](https://huggingface.co/RedHatAI/Llama-3.2-1B-Instruct-quantized.w8a8) |
| - [RedHatAI/Llama-3.2-3B-Instruct-quantized.w8a8](https://huggingface.co/RedHatAI/Llama-3.2-3B-Instruct-quantized.w8a8) |
| - [RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a8](https://huggingface.co/RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a8) |
| - [RedHatAI/Qwen2.5-14B-Instruct-quantized.w8a8](https://huggingface.co/RedHatAI/Qwen2.5-14B-Instruct-quantized.w8a8) |
|
|
| ## License |
|
|
| - **Qwen models**: Apache 2.0 |
| - **Llama models**: Please refer to [Meta's Llama license](https://llama.meta.com/llama3/license/) |
|
|
| ## Usage |
|
|
| ```bash |
| # Download all checkpoints |
| huggingface-cli download bcacdwk/slidesparse-checkpoints --local-dir ./checkpoints_slidesparse |
| |
| # Download specific model |
| huggingface-cli download bcacdwk/slidesparse-checkpoints Llama3.2-1B-INT8-SlideSparse-2_4 --local-dir ./checkpoints_slidesparse/Llama3.2-1B-INT8-SlideSparse-2_4 |
| ``` |
|
|
| ## Citation |
|
|
| If you use these checkpoints, please cite the SlideSparse paper (coming soon). |
|
|