ReCLIP & ReCLIP++

This is the Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation and the extended version of the conference paper: ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation.

Installation

Step 1 Install PyTorch and Torchvision

pip install torch torchvision
# We're using python==3.9 torch==1.11.0 and torchvision==0.12.0

Step 2 Install CLIP

pip install ftfy regex tqdm
pip install git+https://github.com/openai/CLIP.git

Dataset

Maskclip
├── data
│   ├── VOCdevkit
│   │   ├── VOC2012
│   │   │   ├── JPEGImages
│   │   │   ├── SegmentationClass
│   │   │   ├── ImageSets
│   │   │   │   ├── Segmentation
│   │   ├── VOC2010
│   │   │   ├── JPEGImages
│   │   │   ├── SegmentationClassContext
│   │   │   ├── ImageSets
│   │   │   │   ├── SegmentationContext
│   │   │   │   │   ├── train.txt
│   │   │   │   │   ├── val.txt
│   │   │   ├── trainval_merged.json
│   ├── ADEChallengeData2016
│   │   ├── annotations
│   │   │   ├── training
│   │   │   ├── validation
│   │   ├── images
│   │   │   ├── training
│   │   │   ├── validation
│   ├── Cityscapes
│   │   ├── gtFine
│   │   │   ├── train
│   │   │   ├── val
│   │   ├── leftImg8bit
│   │   │   ├── train
│   │   │   ├── val
│   ├── coco_stuff164k
│   │   ├── images
│   │   │   ├── train2017
│   │   │   ├── val2017
│   │   ├── annotations
│   │   │   ├── train2017
│   │   │   ├── val2017

Training

Step 1 Extract Text Embedding for datasets, e.g.,

python utils/prompt_engineering.py --model ViT16 --class-set voc
# The Text Embeddings will be saved at 'text/voc_ViT16_clip_text.pth'
# Options for dataset: voc, context, ade, cityscapes, coco

Step 2 Extract Image-level Multi-label Hypothesis

python tools/pseudo_class.py --cfg 'config/voc_train_ori_cfg.yaml' --model 'RECLIPPP'
# The Image-level Multi-label Hypothesis will be saved at 'text/voc_pseudo_label_ReCLIPPP.json'
# Options for dataset: voc, context, ade, cityscapes, coco
# Options for model: RECLIPPP(ReCLIP++), ReCLIP(ReCLIP)

Step 3 Rectification Stage, e.g.,

python tools/train.py --cfg 'config/voc_train_ori_cfg.yaml' --model 'RECLIPPP'
# Options for dataset: voc, context, ade, cityscapes, coco
# Options for model: RECLIPPP(ReCLIP++), ReCLIP(ReCLIP)

Step 4 Distillation Stage, e.g.,

python tools/distill.py --cfg 'config/voc_distill_ori_cfg.yaml'
# Options for dataset: voc, context, ade, cityscapes, coco

Test

Evaluation for Rectification Stage, e.g.,

python tools/test.py --cfg 'config/voc_test_ori_cfg.yaml' --model 'RECLIPPP'
# Options for dataset: voc, context, ade, cityscapes, coco
# Options for model: RECLIPPP(ReCLIP++), ReCLIP(ReCLIP)

Evaluation for Distillation Stage, e.g.,

python tools/distill_val.py --cfg 'config/voc_distill_ori_cfg.yaml'
# Options for dataset: voc, context, ade, cityscapes, coco

Weight

ReCLIP

Dataset	Rectification	Distillation
PASCAL VOC	58.5	75.4
PASCAL Context	25.8	33.8
ADE20K	11.1	14.3

ReCLIP++

Dataset	Rectification
PASCAL VOC	85.4
PASCAL Context	36.1
ADE20K	16.4
Cityscapes	26.5
COCO Stuff	23.8

Citing

Please cite our paper if you use our code in your research:

@inproceedings{wang2024learn,
  title={Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation},
  author={Wang, Jingyun and Kang, Guoliang},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={4102--4112},
  year={2024}
}

@article{wang2024reclip++,
  title={ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation},
  author={Wang, Jingyun and Kang, Guoliang},
  journal={arXiv preprint arXiv:2408.06747},
  year={2024}
}

Contact

For questions about our paper or code, please contact [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
config		config
model		model
pretrain		pretrain
text		text
tools		tools
utils		utils
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReCLIP & ReCLIP++

Installation

Step 1 Install PyTorch and Torchvision

Step 2 Install CLIP

Dataset

Training

Step 1 Extract Text Embedding for datasets, e.g.,

Step 2 Extract Image-level Multi-label Hypothesis

Step 3 Rectification Stage, e.g.,

Step 4 Distillation Stage, e.g.,

Test

Evaluation for Rectification Stage, e.g.,

Evaluation for Distillation Stage, e.g.,

Weight

ReCLIP

ReCLIP++

Citing

Contact

About

Releases

Packages

Languages

dogehhh/ReCLIP

Folders and files

Latest commit

History

Repository files navigation

ReCLIP & ReCLIP++

Installation

Step 1 Install PyTorch and Torchvision

Step 2 Install CLIP

Dataset

Training

Step 1 Extract Text Embedding for datasets, e.g.,

Step 2 Extract Image-level Multi-label Hypothesis

Step 3 Rectification Stage, e.g.,

Step 4 Distillation Stage, e.g.,

Test

Evaluation for Rectification Stage, e.g.,

Evaluation for Distillation Stage, e.g.,

Weight

ReCLIP

ReCLIP++

Citing

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages