Skip to main navigation Skip to search Skip to main content

Imitation Learning from Purified Demonstrations

  • Yunke Wang
  • , Minjing Dong
  • , Yukun Zhao
  • , Bo Du*
  • , Chang Xu
  • *Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Imitation learning has emerged as a promising approach for addressing sequential decision-making problems, with the assumption that expert demonstrations are optimal. However, in real-world scenarios, most demonstrations are often imperfect, leading to challenges in the effectiveness of imitation learning. While existing research has focused on optimizing with imperfect demonstrations, the training typically requires a certain proportion of optimal demonstrations to guarantee performance. To tackle these problems, we propose to purify the potential noises in imperfect demonstrations first, and subsequently conduct imitation learning from these purified demonstrations. Motivated by the success of diffusion model, we introduce a two-step purification via diffusion process. In the first step, we apply a forward diffusion process to smooth potential noises in imperfect demonstrations by introducing additional noise. Subsequently, a reverse generative process is utilized to recover the optimal demonstration from the diffused ones. We provide theoretical evidence supporting our approach, demonstrating that the distance between the purified and optimal demonstration can be bounded. Empirical results on MuJoCo and RoboSuite demonstrate the effectiveness of our method from different aspects. Copyright 2024 by the author(s)
Original languageEnglish
Title of host publicationProceedings of the 41st International Conference on Machine Learning
EditorsRuslan Salakhutdinov, Zico Kolter, Katherine Heller
PublisherML Research Press
Pages50313-50331
Publication statusPublished - Jul 2024
Event41st International Conference on Machine Learning (ICML 2024) - Messe Wien Exhibition Congress Center, Vienna, Austria
Duration: 21 Jul 202427 Jul 2024
https://proceedings.mlr.press/v235/
https://icml.cc/

Publication series

NameProceedings of Machine Learning Research
Volume235
ISSN (Print)2640-3498

Conference

Conference41st International Conference on Machine Learning (ICML 2024)
PlaceAustria
CityVienna
Period21/07/2427/07/24
Internet address

Funding

This work was supported by the National Key Research and Development Program of China 2023YFC2705700, National Natural Science Foundation of China under Grants 62225113, the Innovative Research Group Project of Hubei Province under Grants 2024AFA017, the Australian Research Council under Projects DP210101859 and FT230100549, and the CityU APRC Project No. 9610680.

Fingerprint

Dive into the research topics of 'Imitation Learning from Purified Demonstrations'. Together they form a unique fingerprint.

Cite this