ADStereo: Efficient Stereo Matching with Adaptive Downsampling and Disparity Alignment

Yun Wang, Kunhong Li, Longguang Wang, Junjie Hu, Dapeng Oliver Wu, Yulan Guo*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

1 Citation (Scopus)

Abstract

The balance between accuracy and computational efficiency is crucial for the applications of deep learning-based stereo matching algorithms in real-world scenarios. Since matching cost aggregation is usually the most computationally expensive component, a common practice is to construct cost volumes at a low resolution for aggregation and then directly regress a high-resolution disparity map. However, current solutions often suffer from limitations such as the loss of discriminative features caused by downsampling operations that treat all pixels equally, and spatial misalignment resulting from repeated downsampling and upsampling. To overcome these challenges, this paper presents two sampling strategies: the Adaptive Downsampling Module (ADM) and the Disparity Alignment Module (DAM), to prioritize real-time inference while ensuring accuracy. The ADM leverages local features to learn adaptive weights, enabling more effective downsampling while preserving crucial structure information. On the other hand, the DAM employs a learnable interpolation strategy to predict transformation offsets of pixels, thereby mitigating the spatial misalignment issue. Building upon these modules, we introduce ADStereo, a real-time yet accurate network that achieves highly competitive performance on multiple public benchmarks. Specifically, our ADStereo runs over 5× faster than the current state-of-the-art CREStereo (0.054s vs. 0.29s) under the same hardware while achieving comparable accuracy (1.82% vs. 1.69%) on the KITTI stereo 2015 benchmark. The codes are available at: https://github.com/cocowy1/ADStereo.

© 2025 IEEE. All rights reserved, including rights for text and data mining, and training of artificial intelligence and similar technologies. Personal use is permitted, but republication/redistribution requires IEEE permission.
Original languageEnglish
Pages (from-to)1204-1218
JournalIEEE Transactions on Image Processing
Volume34
Online published14 Feb 2025
DOIs
Publication statusPublished - 2025

Research Keywords

  • Depth estimation
  • disparity estimation
  • stereo matching

Fingerprint

Dive into the research topics of 'ADStereo: Efficient Stereo Matching with Adaptive Downsampling and Disparity Alignment'. Together they form a unique fingerprint.

Cite this