Papers
arxiv:2603.15388

Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization

Published on Mar 16
Authors:
,
,
,

Abstract

Stackelberg PPO addresses morphology-control co-design by modeling their coupling as a Stackelberg game, improving training stability and efficiency over standard PPO.

AI-generated summary

Morphology-control co-design concerns the coupled optimization of an agent's body structure and control policy. This problem exhibits a bi-level structure, where the control dynamically adapts to the morphology to maximize performance. Existing methods typically neglect the control's adaptation dynamics by adopting a single-level formulation that treats the control policy as fixed when optimizing morphology. This can lead to inefficient optimization, as morphology updates may be misaligned with control adaptation. In this paper, we revisit the co-design problem from a game-theoretic perspective, modeling the intrinsic coupling between morphology and control as a novel variant of a Stackelberg game. We propose Stackelberg Proximal Policy Optimization (Stackelberg PPO), which explicitly incorporates the control's adaptation dynamics into morphology optimization. By modeling this intrinsic coupling, our method aligns morphology updates with control adaptation, thereby stabilizing training and improving learning efficiency. Experiments across diverse co-design tasks demonstrate that Stackelberg PPO outperforms standard PPO in both stability and final performance, opening the way for dramatically more efficient robotics designs.

Community

Using only the signal of box-forwarding speed, our Stackelberg PPO automatically evolves robots 🤖 with arms 💪🏻 for pushing and legs 🦵🏻 for moving.
The key idea is a novel game-theoretic view of structure–control co-design — yielding more effective optimization and dramatically better designs.
🎉 Accepted at ICLR 2026

💻 Code: https://github.com/YanningDai/StackelbergPPO
🌐 Project: https://yanningdai.github.io/stackelberg-ppo-co-design/

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2603.15388 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2603.15388 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2603.15388 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.