Abstract
Glass surfaces challenge object detection models as they mix the transmitted background with the reflected surrounding, creating confusing visual patterns. Previous methods relying on low-level cues (e.g., reflections and boundaries) or surrounding semantics are often unreliable in complex real-world scenarios. A glass image inherently comprises three distinct semantic components: semantics of the transmitted content, semantics of the reflected content, and semantics of the surrounding content. In this work, we observe that there is a relationship among these three types of semantics, where reflection semantics closely resembles surrounding semantics, while these two types of semantics tend to be different from the transmission semantics. For example, when on a street, we may see into a cafeteria through a glass wall, intermixed with reflection of the street, while the glass is surrounded by other street contents like shops and pedestrians, thereby creating a unique multi-semantic signature. Based on this observation, we propose the Multi-Semantic Net, MSNet, which identifies transmission, reflection, and surrounding semantics from glass images and exploits their relationships for glass surface detection. MSNet consists of two novel modules: (1) A Semantic Decomposition Module (SDM) containing Dual-Semantics Extraction Block to extract original image and reflection semantics and Semantic Elimination Block to progressively derive transmission and surrounding semantics, and (2) An Adaptive Semantic Fusion Module (ASFM) to fuse these semantic components and adaptively learn their relationships to handle varying reflection conditions. Extensive experiments demonstrate that MSNet surpasses SOTA methods on public glass detection benchmarks. © 2026, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 40th Annual AAAI Conference on Artificial Intelligence |
| Editors | Sven Koenig, Chad Jenkins, Matthew E. Taylor |
| Place of Publication | Washington, DC |
| Publisher | AAAI Press |
| Pages | 3273-3281 |
| Number of pages | 9 |
| ISBN (Print) | 978-1-57735-906-7, 1-57735-906-2 |
| DOIs | |
| Publication status | Published - 2026 |
| Event | 40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026) - Singapore EXPO, Singapore Duration: 20 Jan 2026 → 27 Jan 2026 https://aaai.org/conference/aaai/aaai-26/ |
Publication series
| Name | Proceedings of the AAAI Conference on Artificial Intelligence |
|---|---|
| Publisher | Association for the Advancement of Artificial Intelligence |
| Number | 5 |
| Volume | 40 |
| ISSN (Print) | 2159-5399 |
| ISSN (Electronic) | 2374-3468 |
Conference
| Conference | 40th Annual AAAI Conference on Artificial Intelligence (AAAI 2026) |
|---|---|
| Abbreviated title | AAAI-26 |
| Place | Singapore |
| Period | 20/01/26 → 27/01/26 |
| Internet address |
Funding
This work is in part supported by two GRFs from the Research Grants Council of Hong Kong (Ref: 11211223 and 11220724).
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Multi-Semantic Modeling for Glass Surface Detection in the Wild'. Together they form a unique fingerprint.Projects
- 2 Active
-
GRF: Mirror Detection with Intrinsic Mirror Cues
LAU, R. W. H. (Principal Investigator / Project Coordinator) & HANCKE, G. P. (Co-Investigator)
1/01/25 → …
Project: Research
-
GRF: Analyses of the Reflection Cues for Glass Surface Detection
LAU, R. W. H. (Principal Investigator / Project Coordinator) & WAH, B. W. S. (Co-Investigator)
1/10/23 → …
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver