·
AI & ML interests
Quantization
Recent Activity
Organizations
view article Getting More from Your Test-Time Compute Budget with Portfolio Beam Search
view article Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models


- +3
upvoted a paper over 1 year ago