Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey Paper • 2603.04445 • Published Feb 23 • 5