Discounted Sampling Policy Gradient for Robot Multi-objective Visual Control
Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45) › 32_Refereed conference paper (with ISBN/ISSN) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | Evolutionary Multi-Criterion Optimization |
Subtitle of host publication | 11th International Conference, EMO 2021, Shenzhen, China, March 28–31, 2021, Proceedings |
Editors | Hisao Ishibuchi, Qingfu Zhang, Ran Cheng, Ke Li, Hui Li, Handing Wang, Aimin Zhou |
Place of Publication | Cham |
Publisher | Springer |
Pages | 441-452 |
Number of pages | 12 |
ISBN (Electronic) | 9783030720629 |
ISBN (Print) | 9783030720612 |
Publication status | Published - 28 Mar 2021 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Volume | 12654 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Title | 11th International Conference on Evolutionary Multi-Criterion Optimization (EMO 2021) |
---|---|
Location | Hampton by Hilton Hotel (on-site & on-line) |
Place | China |
City | Shenzhen |
Period | 28 - 31 March 2021 |
Link(s)
Abstract
Robot visual control often involves multiple objectives such as achieving high efficiency, maintaining stability, and avoiding failure. This paper proposes a novel Vision-Based Control method (VBC) with the Discounted Sampling Policy Gradient (DSPG) and Cosine Annealing (CA) to achieve excellent multi-objective control performance. In our proposed visual control framework, a DSPG learning agent is employed to learn a policy estimating continuous kinematics for VBC. The deep policy maps the visual observation to a specific action in an end-to-end manner. The DSPG agent finally can update the policy to obtain the optimal or near-optimal solution using shaped rewards from the environment. The proposed VBC-DSPG model is optimized using a heuristic method. Experimental results demonstrate that the proposed method performs very well compared with some classical competitors in the multi-objective visual control scenario.
Research Area(s)
- Multi-objective visual control, Kinematics, Discounted sampling policy gradient, Cosine annealing
Citation Format(s)
Discounted Sampling Policy Gradient for Robot Multi-objective Visual Control. / Xu, Meng; Zhang, Qingfu; Wang, Jianping.
Evolutionary Multi-Criterion Optimization: 11th International Conference, EMO 2021, Shenzhen, China, March 28–31, 2021, Proceedings. ed. / Hisao Ishibuchi; Qingfu Zhang; Ran Cheng; Ke Li; Hui Li; Handing Wang; Aimin Zhou. Cham : Springer, 2021. p. 441-452 (Lecture Notes in Computer Science; Vol. 12654).Research output: Chapters, Conference Papers, Creative and Literary Works (RGC: 12, 32, 41, 45) › 32_Refereed conference paper (with ISBN/ISSN) › peer-review