Abstract
Imitation learning has emerged as a promising approach for addressing sequential decision-making problems, with the assumption that expert demonstrations are optimal. However, in real-world scenarios, most demonstrations are often imperfect, leading to challenges in the effectiveness of imitation learning. While existing research has focused on optimizing with imperfect demonstrations, the training typically requires a certain proportion of optimal demonstrations to guarantee performance. To tackle these problems, we propose to purify the potential noises in imperfect demonstrations first, and subsequently conduct imitation learning from these purified demonstrations. Motivated by the success of diffusion model, we introduce a two-step purification via diffusion process. In the first step, we apply a forward diffusion process to smooth potential noises in imperfect demonstrations by introducing additional noise. Subsequently, a reverse generative process is utilized to recover the optimal demonstration from the diffused ones. We provide theoretical evidence supporting our approach, demonstrating that the distance between the purified and optimal demonstration can be bounded. Empirical results on MuJoCo and RoboSuite demonstrate the effectiveness of our method from different aspects. Copyright 2024 by the author(s)
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 41st International Conference on Machine Learning |
| Editors | Ruslan Salakhutdinov, Zico Kolter, Katherine Heller |
| Publisher | ML Research Press |
| Pages | 50313-50331 |
| Publication status | Published - Jul 2024 |
| Event | 41st International Conference on Machine Learning (ICML 2024) - Messe Wien Exhibition Congress Center, Vienna, Austria Duration: 21 Jul 2024 → 27 Jul 2024 https://proceedings.mlr.press/v235/ https://icml.cc/ |
Publication series
| Name | Proceedings of Machine Learning Research |
|---|---|
| Volume | 235 |
| ISSN (Print) | 2640-3498 |
Conference
| Conference | 41st International Conference on Machine Learning (ICML 2024) |
|---|---|
| Place | Austria |
| City | Vienna |
| Period | 21/07/24 → 27/07/24 |
| Internet address |
Funding
This work was supported by the National Key Research and Development Program of China 2023YFC2705700, National Natural Science Foundation of China under Grants 62225113, the Innovative Research Group Project of Hubei Province under Grants 2024AFA017, the Australian Research Council under Projects DP210101859 and FT230100549, and the CityU APRC Project No. 9610680.
Fingerprint
Dive into the research topics of 'Imitation Learning from Purified Demonstrations'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver