RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published 5 days ago • 6