Abstract
Semantic bird-eye-view (BEV) map is an efficient data representation for environment perception in autonomous driving. In real driving scenarios, the collected sensory data usually exhibit class imbalance. For example, road layouts are often the majority classes and road objects are the minority. Such imbalanced data could lead to inferior performance in BEV map generation, particularly for minority objects due to insufficient learning samples. This work attempts to mitigate this issue from the perspective of network and loss function design. To this end, a diffusion-guided semantic BEV map generation network with a boundary-aware loss is proposed. The network learns the underlying distribution of the data, including the relationship between majority and minority classes. The boundary-aware loss increases weighting for minority classes during training, making the network focus on these classes. Experimental results on a public dataset demonstrate our superiority over the state-of-the-art methods, and our effectiveness in addressing the class imbalance issue. © 2025 IEEE.
| Original language | English |
|---|---|
| Pages (from-to) | 10188-10198 |
| Number of pages | 11 |
| Journal | IEEE Transactions on Circuits and Systems for Video Technology |
| Volume | 35 |
| Issue number | 10 |
| Online published | 24 Apr 2025 |
| DOIs | |
| Publication status | Published - Oct 2025 |
Funding
This work was supported in part by Guangdong Basic and Applied Basic Research Foundation under Grant 2022A1515010116, and in part by City University of Hong Kong under Grant 9610675.
Research Keywords
- Autonomous Driving
- Class Imbalance
- Semantic BEV Map
- Semantic Scene Understanding
Fingerprint
Dive into the research topics of 'Boundary-aware Semantic Bird-Eye-View Map Generation based on Conditional Diffusion Models'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver