Depth Inconsistency-based spatial-channel attention gate for Mirror Segmentation


Ritsuki Kurohiji (Wakayama University), Hirotaka Hachiya (Wakayama University)
The 35th British Machine Vision Conference

Abstract

We propose a novel depth-aware mirror and specular object segmentation framework that leverages the contextual inconsistency between two types of depth information: time-of-flight (ToF) measurements and depth predicted by a deep neural network. ToF depth values are often significantly overestimated near mirrors due to indirect light paths, while predicted depth maps tend to be overly smoothed in those regions. We observe that this contextual discrepancy serves as a strong cue for identifying mirror regions. However, direct comparison between these depths in either the image or feature space results in high false positive rates, primarily due to their inherently different value ranges and responses to scene content. To address this issue, we introduce a depth inconsistency-based spatial-channel Attention Gate (discAG), which adaptively highlights informative regions in ToF feature maps based on spatial- and channel-wise extracted depth context inconsistencies. By integrating discAG into a segmentation architecture, our method achieves superior performance compared to state-of-the-art approaches. Experimental results on the RGB-D Mirror Segmentation dataset and our newly constructed Specular Object dataset demonstrate the effectiveness of our approach, particularly in challenging scenarios involving small or frameless mirrors.

Citation

@inproceedings{Kurohiji_2025_BMVC,
author    = {Ritsuki Kurohiji and Hirotaka Hachiya},
title     = {Depth Inconsistency-based spatial-channel attention gate for Mirror Segmentation},
booktitle = {36th British Machine Vision Conference 2025, {BMVC} 2025, Sheffield, UK, November 24-27, 2025},
publisher = {BMVA},
year      = {2025},
url       = {https://bmva-archive.org.uk/bmvc/2025/assets/papers/Paper_391/paper.pdf}
}


Copyright © 2025 The British Machine Vision Association and Society for Pattern Recognition
The British Machine Vision Conference is organised by The British Machine Vision Association and Society for Pattern Recognition. The Association is a Company limited by guarantee, No.2543446, and a non-profit-making body, registered in England and Wales as Charity No.1002307 (Registered Office: Dept. of Computer Science, Durham University, South Road, Durham, DH1 3LE, UK).

Imprint | Data Protection