Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization
Abstract
Stackelberg PPO addresses morphology-control co-design by modeling their coupling as a Stackelberg game, improving training stability and efficiency over standard PPO.
Morphology-control co-design concerns the coupled optimization of an agent's body structure and control policy. This problem exhibits a bi-level structure, where the control dynamically adapts to the morphology to maximize performance. Existing methods typically neglect the control's adaptation dynamics by adopting a single-level formulation that treats the control policy as fixed when optimizing morphology. This can lead to inefficient optimization, as morphology updates may be misaligned with control adaptation. In this paper, we revisit the co-design problem from a game-theoretic perspective, modeling the intrinsic coupling between morphology and control as a novel variant of a Stackelberg game. We propose Stackelberg Proximal Policy Optimization (Stackelberg PPO), which explicitly incorporates the control's adaptation dynamics into morphology optimization. By modeling this intrinsic coupling, our method aligns morphology updates with control adaptation, thereby stabilizing training and improving learning efficiency. Experiments across diverse co-design tasks demonstrate that Stackelberg PPO outperforms standard PPO in both stability and final performance, opening the way for dramatically more efficient robotics designs.
Community
Using only the signal of box-forwarding speed, our Stackelberg PPO automatically evolves robots 🤖 with arms 💪🏻 for pushing and legs 🦵🏻 for moving.
The key idea is a novel game-theoretic view of structure–control co-design — yielding more effective optimization and dramatically better designs.
🎉 Accepted at ICLR 2026
💻 Code: https://github.com/YanningDai/StackelbergPPO
🌐 Project: https://yanningdai.github.io/stackelberg-ppo-co-design/
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper