Projects per year
Abstract
Unlike autonomous ground vehicles (AGVs), unmanned aerial vehicles (UAVs) have a higher dimensional configuration space, which makes the motion planning of multi-UAVs a challenging task. In addition, uncertainties and noises are more significant in UAV scenarios, which increases the difficulty of autonomous navigation for multi-UAV. In this letter, we proposed a two-stage reinforcement learning (RL) based multi-UAV collision avoidance approach without explicitly modeling the uncertainty and noise in the environment. Our goal is to train a policy to plan a collision-free trajectory by leveraging local noisy observations. However, the reinforcement learned collision avoidance policies usually suffer from high variance and low reproducibility, because unlike supervised learning, RL does not have a fixed training set with ground-truth labels. To address these issues, we introduced a two-stage training method for RL based collision avoidance. For the first stage, we optimize the policy using a supervised training method with a loss function that encourages the agent to follow the well-known reciprocal collision avoidance strategy. For the second stage, we use policy gradient to refine the policy. We validate our policy in a variety of simulated scenarios, and the extensive numerical simulations demonstrate that our policy can generate time-efficient and collision-free paths under imperfect sensing, and can well handle noisy local observations with unknown noise levels.
| Original language | English |
|---|---|
| Article number | 9001167 |
| Pages (from-to) | 3098-3105 |
| Journal | IEEE Robotics and Automation Letters |
| Volume | 5 |
| Issue number | 2 |
| Online published | 18 Feb 2020 |
| DOIs | |
| Publication status | Published - Apr 2020 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Research Keywords
- Collision avoidance
- deep learning in robotics and automation
Fingerprint
Dive into the research topics of 'A Two-Stage Reinforcement Learning Approach for Multi-UAV Collision Avoidance under Imperfect Sensing'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: Fully-decentralized and Near-optimal Large-scale Multi-robot Collision Avoidance via Deep Learning
PAN, J. (Principal Investigator / Project Coordinator)
1/01/19 → 2/01/19
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver