WBNet: Weakly-supervised salient object detection via scribble and pseudo-background priors
dc.citation.volume | 154 | |
dc.contributor.author | Wang Y | |
dc.contributor.author | Wang R | |
dc.contributor.author | He X | |
dc.contributor.author | Lin C | |
dc.contributor.author | Wang T | |
dc.contributor.author | Jia Q | |
dc.contributor.author | Fan X | |
dc.date.accessioned | 2024-08-08T23:49:02Z | |
dc.date.available | 2024-08-08T23:49:02Z | |
dc.date.issued | 2024-10 | |
dc.description.abstract | Weakly supervised salient object detection (WSOD) methods endeavor to boost sparse labels to get more salient cues in various ways. Among them, an effective approach is using pseudo labels from multiple unsupervised self-learning methods, but inaccurate and inconsistent pseudo labels could ultimately lead to detection performance degradation. To tackle this problem, we develop a new multi-source WSOD framework, WBNet, that can effectively utilize pseudo-background (non-salient region) labels combined with scribble labels to obtain more accurate salient features. We first design a comprehensive salient pseudo-mask generator from multiple self-learning features. Then, we pioneer the exploration of generating salient pseudo-labels via point-prompted and box-prompted Segment Anything Models (SAM). Then, WBNet leverages a pixel-level Feature Aggregation Module (FAM), a mask-level Transformer-decoder (TFD), and an auxiliary Boundary Prediction Module (EPM) with a hybrid loss function to handle complex saliency detection tasks. Comprehensively evaluated with state-of-the-art methods on five widely used datasets, the proposed method significantly improves saliency detection performance. The code and results are publicly available at https://github.com/yiwangtz/WBNet. | |
dc.description.confidential | false | |
dc.edition.edition | October 2024 | |
dc.identifier.citation | Wang Y, Wang R, He X, Lin C, Wang T, Jia Q, Fan X. (2024). WBNet: Weakly-supervised salient object detection via scribble and pseudo-background priors. Pattern Recognition. 154. | |
dc.identifier.doi | 10.1016/j.patcog.2024.110579 | |
dc.identifier.eissn | 1873-5142 | |
dc.identifier.elements-type | journal-article | |
dc.identifier.issn | 0031-3203 | |
dc.identifier.number | 110579 | |
dc.identifier.pii | S0031320324003303 | |
dc.identifier.uri | https://mro.massey.ac.nz/handle/10179/71246 | |
dc.language | English | |
dc.publisher | Elsevier Ltd | |
dc.publisher.uri | https://www.sciencedirect.com/science/article/pii/S0031320324003303 | |
dc.relation.isPartOf | Pattern Recognition | |
dc.rights | (c) 2024 The Author/s | |
dc.rights | CC BY 4.0 | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Weakly supervision | |
dc.subject | Salient object detection | |
dc.subject | Neural networks | |
dc.subject | Transformer | |
dc.subject | Pseudo labels | |
dc.subject | Scribble labels | |
dc.title | WBNet: Weakly-supervised salient object detection via scribble and pseudo-background priors | |
dc.type | Journal article | |
pubs.elements-id | 489036 | |
pubs.organisational-group | Other |