Skip to content

[WACV 2025] Feature-level and Spatial-level Activation Expansion for Weakly-Supervised Semantic Segmentation

Notifications You must be signed in to change notification settings

obeychoi0120/FSAE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Feature-level and Spatial-level Activation Expansion for Weakly-Supervised Semantic Segmentation [WACV 2025]

This repository is the official implementation of "Feature-level and Spatial-level Activation Expansion for Weakly-Supervised Semantic Segmentation", accepted to WACV 2025.

Abstract

Weakly-supervised Semantic Segmentation (WSSS) aims to provide a precise semantic segmentation results without expensive pixel-wise segmentation labels. With the supervision gap between classification and segmentation, Image-level WSSS mainly relies on Class Activation Maps (CAMs) from the classification model to emulate the pixel-wise annotations. However, CAMs often fail to cover the entire object region because classification models tend to focus on narrow discriminative regions in an object. Towards accurate CAM coverage, Existing WSSS methods have tried to boost feature representation learning or impose consistency regularization to the classification models, but still there are limitation in activating non-discriminative area, where the focus of the models is weak. To tackle this issue, we propose FSAE framework, which provides explicit supervision of non-discriminative area, encouraging the CAMs to activate on various object features. We leverage weak-strong consistency with pseudo-label expansion strategy for reliable supervision and enhance learning of non-discriminative object boundaries. Specifically, we use strong perturbation to make challenging inference target, and focus on generating reliable pixel-wise supervision signal for broad object regions. Extensive experiments on the WSSS benchmark datasets show that our method boosts initial seed quality and segmentation performance by large margin, achieving new state-of-the-art performance on benchmark WSSS datasets.

Setup (WIP)

Environments

Our method is tested on single NVIDIA RTX 3090 with CUDA 11.7, Ubuntu 18.04. Essential packages are as below:

For maximum reproducibility, We recommend to construct individual virtualenv for each base model from the forked branches in this repos

or follow the details from original repos:

Datasets (WIP)

PASCAL VOC 2012

  • Download PASCAL VOC 2012 from this link.
    wget http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar
    tar –xvf VOCtrainval_11-May-2012.tar
  • Download augmented training set SegmentationClassAug.zip from SBD dataset from this link.
  • The directories should be like below:
    ├── VOCdevkit/
    │    └── VOC2012
    │     ├── Annotations
    │     ├── ImageSets
    │     ├── JPEGImages
    │     ├── SegmentationClass
    │     ├── SegmentationClassAug
    │     └── SegmentationObject

MS COCO 2014

  • Download MS COCO 2014 dataset
    wget http://images.cocodataset.org/zips/train2014.zip
    wget http://images.cocodataset.org/zips/val2014.zip
  • The directories should be like below:
    ├── COCO2014/
    │   ├── train/              
    │   │   ├── image/     
    │   │   ├── mask/        
    │   │   └── xml/
    │   └── validation/
    │       ├── image/     
    │       ├── mask/        
    │       └── xml/

Usage

Please navigate the other branches.

Performance

Marked with * means reproduced performance.

  • PASCAL VOC 2012

    basemodel Mask mIoU val mIoU test mIoU
    PPC 73.3 74.4 75.0
    PPC+Ours 77.0 74.4 75.0
    SIPE 64.7 68.8 73.6
    SIPE+Ours 70.5 69.9 71.2
    MCTFormer* 67.9 69.0 69.8
    MCTFormer+Ours 68.8 69.8 70.5
  • COCO 2014

    basemodel val mIoU
    PPC* 33.7
    PPC+Ours 35.4
    SIPE 39.3
    SIPE+Ours 39.5

About

[WACV 2025] Feature-level and Spatial-level Activation Expansion for Weakly-Supervised Semantic Segmentation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published