wenhua cheng
wenhuach
AI & ML interests
Model Compression, CV
Recent Activity
reacted to theirpost with π₯ about 2 hours ago
We demonstrate that AutoRound achieves SOTA or near SOTA performance under INT4 (W4A4) quantization.
Check out the accuracy data at https://github.com/intel/auto-round/blob/main/docs/int4_acc.md
This capability is currently a research-only feature, with no production model export. posted an update about 2 hours ago
We demonstrate that AutoRound achieves SOTA or near SOTA performance under INT4 (W4A4) quantization.
Check out the accuracy data at https://github.com/intel/auto-round/blob/main/docs/int4_acc.md
This capability is currently a research-only feature, with no production model export. new activity about 4 hours ago
Intel/gemma-4-26B-A4B-it-int4-mixed-AutoRound:any plan for an Ampere compatible version?