Abstract
Dynamic Vision Sensor (DVS) is a compelling neuromorphic camera compared to conventional camera, but it suffers from fiercer noise. Due to the nature of irregular format and asynchronous readout, DVS data is always transformed into a regular tensor (e.g., 3D voxel or image) for deep learning method, which corrupts its own asynchronous properties. To maintain asynchronous, we establish an innovative asynchronous event denoise neural network, named AEDNet, which directly consumes the correlation of the irregular signal in spatial-temporal range without destroying its original structural property. Based on the property of continuation in temporal domain and discreteness in spatial domain, we decompose the DVS signal into two parts, i.e., temporal correlation and spatial affinity, and separately process these two parts. Our spatial feature embedding unit is a unique feature extraction module that extracts feature from event-level, which perfectly maintains its spatial-temporal correlation. To test effectiveness, we build a novel dataset named DVSCLEAN containing both simulated and real-world data. The experimental results of AEDNet achieve SOTA. © 2022 Association for Computing Machinery.
| Original language | English |
|---|---|
| Title of host publication | MM ’22 |
| Subtitle of host publication | Proceedings of the 30th ACM International Conference on Multimedia |
| Publisher | Association for Computing Machinery |
| Pages | 1427-1435 |
| ISBN (Print) | 9781450392037 |
| DOIs | |
| Publication status | Published - 2022 |
| Event | 30th ACM International Conference on Multimedia (MM 2022) - Lisbon, Portugal Duration: 10 Oct 2022 → 14 Oct 2022 https://2022.acmmm.org/ |
Publication series
| Name | MM - Proceedings of the ACM International Conference on Multimedia |
|---|
Conference
| Conference | 30th ACM International Conference on Multimedia (MM 2022) |
|---|---|
| Abbreviated title | ACM Multimedia 2022 |
| Place | Portugal |
| City | Lisbon |
| Period | 10/10/22 → 14/10/22 |
| Internet address |
Research Keywords
- datasets
- dynamic vision sensor
- event denoise
- event-based neural networks