HisynSeg

HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization

Accepted by IEEE Transactions on Medical Imaging.

It is an extended version of our AAAI paper "Weakly-Supervised Semantic Segmentation for Histopathology Images Based on Dataset Synthesis and Feature Consistency Constraint".

Abstract

Tissue semantic segmentation is one of the key tasks in computational pathology. To avoid the expensive and laborious acquisition of pixel-level annotations, a wide range of studies attempt to adopt the class activation map (CAM), a weakly-supervised learning scheme, to achieve pixel-level tissue segmentation. However, CAM-based methods are prone to suffer from under-activation and over-activation issues, leading to poor segmentation performance. To address this problem, we propose a novel weakly-supervised semantic segmentation framework for histopathological images based on image-mixing synthesis and consistency regularization, dubbed HisynSeg. Specifically, synthesized histopathological images with pixel-level masks are generated for fully-supervised model training, where two synthesis strategies are proposed based on Mosaic transformation and Bézier mask generation. Besides, an image filtering module is developed to guarantee the authenticity of the synthesized images. In order to further avoid the model overfitting to the occasional synthesis artifacts, we additionally propose a novel self-supervised consistency regularization, which enables the real images without segmentation masks to supervise the training of the segmentation model. By integrating the proposed techniques, the HisynSeg framework successfully transforms the weakly-supervised semantic segmentation problem into a fully-supervised one, greatly improving the segmentation accuracy. Experimental results on three datasets prove that the proposed method achieves a state-of-the-art performance. Code is available at https://github.com/Vison307/HisynSeg.

Environment

Code tested on

Ubuntu 18.04
A single Nvidia GeForce RTX 3090
Python 3.8
Pytorch 1.12.1
Pytorch Lightning 1.7.1
Albumentations 1.2.1
Segmentation models pytorch 0.3.3
Timm 0.9.2

Please use the follwing command to install the dependencies:

conda env create -f environment.yaml

For more details, you can check Dockerfile and requirements.in for reference.

Orginal Dataset Preparation

Download the WSSS4LUAD dataset and put it in ./data/WSSS4LUAD
Download the BCSS-WSSS dataset and put it in ./data/BCSS-WSSS (Thanks to Han et. al)
Download the LUAD-HistoSeg dataset and put it in ./data/LUAD-HistoSeg (Thanks to Han et. al)

Generate Synthesized Datasets

Synthesize datasets with Mosaic Transformation

Run ./create_synthesis_datasets/mosaic_{wsss4luad|bcss|luad}.ipynb
Synthesize datasets with Bézier Mask Generation

Run ./create_synthesis_datasets/bezier_{wsss4luad|bcss|luad}.ipynb
Train the synthesized image filtering module

Run ./create_synthesis_datasets/discriminate_{wsss4luad|bcss|luad}.ipynb
Obtain the filtered synthesized images

For Mosaic Transformation (BCSS/LUAD-HistoSeg)
```
CUDA_VISIBLE_DEVICES=0, python ./create_synthesis_datasets/filter_mosaic_{bcss|luad}.py --run 0
```
For Mosaic Transformation (WSSS4LUAD)
```
CUDA_VISIBLE_DEVICES=0, python ./create_synthesis_datasets/filter_mosaic_wsss4luad.py --run 0 --idx [0-9]
```
NOTE: It takes some time to generate the filtered synthesized images for WSSS4LUAD. To accelerate the process, we utilize the idx argument enable multiple processing.

For Bézier Mask Generation
```
CUDA_VISIBLE_DEVICES=0, python ./create_synthesis_datasets/filter_bezier_{wsss4luad|bcss|luad}.py --run 0
```

Train the Segmentation Module

Preparation for the WSSS4LUAD dataset

Since the Validation and Test images are not in the same shape for the WSSS4LUAD dataset, we first split them by a sliding window strategy with multi-scales. For the validation set, we utilize a sliding window size of 224 and a stride of 224. For testing, we utilize a sliding window size of 224 and a stride of 112.

You can do the pre-processing by running split_validation.ipynb.

Train & Test Scripts

Please check the scripts directory. For example, if you want to train on the WSSS4LUAD dataset, please run

bash scripts/run-wsss4luad.sh

Reproduce the paper results

We tried our best to ensure the reproducibility of the results, but since the torch.nn.functional.interpolate function is not deterministic, the results may be different over runs if you train from scratch. If you want to fully reproduce the results, you can use the following Docker image with built-in weights on Baidu Disk (code: fjm9) or OneDrive. And then run:

docker load < hisynseg_test.tar.gz

docker run --gpus "device=0" --rm -it --shm-size 8G -v /path/to/your/data:/opt/app/data -v /path/to/your/outputs:/opt/app/outputs hisynseg:test

Make sure you have give 777 access to the ./outputs directory.

Citation

If you find our work helpful, please cite our paper:

@article{fang2024hisynseg,
  title={HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization},
  author={Fang, Zijie and Wang, Yifeng and Xie, Peizhang and Wang, Zhi and Zhang, Yongbing},
  journal={IEEE Transactions on Medical Imaging},
  year={2024},
  publisher={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
create_synthesis_datasets		create_synthesis_datasets
models		models
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
environment.yaml		environment.yaml
infer_pseudo_masks_v2.py		infer_pseudo_masks_v2.py
loss.py		loss.py
mosaic_train_v2.py		mosaic_train_v2.py
requirements.in		requirements.in
segmentation_test_v2.py		segmentation_test_v2.py
split_validation.ipynb		split_validation.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HisynSeg

Abstract

Environment

Orginal Dataset Preparation

Generate Synthesized Datasets

For Mosaic Transformation (BCSS/LUAD-HistoSeg)

For Mosaic Transformation (WSSS4LUAD)

For Bézier Mask Generation

Train the Segmentation Module

Preparation for the WSSS4LUAD dataset

Train & Test Scripts

Reproduce the paper results

Citation

About

Releases

Packages

Languages

License

Vison307/HisynSeg

Folders and files

Latest commit

History

Repository files navigation

HisynSeg

Abstract

Environment

Orginal Dataset Preparation

Generate Synthesized Datasets

For Mosaic Transformation (BCSS/LUAD-HistoSeg)

For Mosaic Transformation (WSSS4LUAD)

For Bézier Mask Generation

Train the Segmentation Module

Preparation for the WSSS4LUAD dataset

Train & Test Scripts

Reproduce the paper results

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages