Abstract
Reinforcement learning (RL) has been widely adopted for intelligent decision-making in embodied agents due to its effective trial-and-error learning capabilities. However, most RL methods overlook the causal relationships among states and actions during policy exploration and lack the human-like ability to distinguish signal from noise and reason with important abstractions, resulting in poor sample efficiency. To address this issue, we propose a novel method named causal action empowerment (CAE) for efficient RL, designed to improve sample efficiency in policy learning for embodied agents. CAE identifies and leverages causal relationships among states, actions, and rewards to extract controllable state variables and reweight actions for prioritizing high-impact behaviors. Moreover, by integrating a causality-aware empowerment term, CAE significantly enhances an embodied agent’s execution of causally-aware behavior for more efficient exploration via boosting controllability in complex embodied environments. Benefiting from these two improvements, CAE bridges the gap between local causal discovery and global causal empowerment. To comprehensively evaluate the effectiveness of CAE, we conduct extensive experiments across 25 tasks in 5 diverse embodied environments, encompassing both locomotion and manipulation skill learning with dense and sparse reward settings. Experimental results demonstrate that CAE consistently outperforms existing methods across this wide range of scenarios, offering a promising avenue for improving sample efficiency in RL. © Science China Press 2025.
Original language | English |
---|---|
Article number | 150201 |
Journal | Science China Information Sciences |
Volume | 68 |
Issue number | 5 |
Online published | 24 Apr 2025 |
DOIs | |
Publication status | Published - May 2025 |
Funding
This work was supported in part by National Natural Science Foundation of China (Grant Nos. 62276128, 62192783), Jiangsu Science and Technology Major Project (Grant No. BG2024031), Natural Science Foundation of Jiangsu Province (Grant No. BK20243051), Fundamental Research Funds for the Central Universities (Grant No. 14380128), and Collaborative Innovation Center of Novel Software Technology and Industrialization.
Research Keywords
- causal
- embodied agents
- empowerment
- reinforcement learning
- sample efficiency