CVPR24-MedSAM-on-Laptop

This repository contains the code for our submission to the CVPR 2024: SEGMENT ANYTHING IN MEDICAL IMAGES ON LAPTOP competition:

DAFT: Data-Aware Fine-Tuning of Foundation Models for Efficient and Effective Medical Image Segmentation

BibTex citation:

@inproceedings{pfefferle2024daft,
  title={DAFT: Data-Aware Fine-Tuning of Foundation Models for Efficient and Effective Medical Image Segmentation},
  author={Pfefferle, Alexander Tobias and Purucker, Lennart and Hutter, Frank},
  booktitle={CVPR 2024: Segment Anything In Medical Images On Laptop}
}

The competition was organized by BoWang's Lab at University of Toronto and hosted on Codabench.

The main branch contains the code of the performance booster challenge that followed the competition. Check out the finalsubmission branch for our final submission to the competition.

Environments and Requirements

We trained on the JUWELS Booster (4x A100 40GB, Python 3.11.3, CUDA 12.2) using the packages in requirements.txt.

Dataset and Models

download the LiteMedSAM weights and put them in work_dir/LiteMedSAM/, also download the EfficientViT-SAM l0 checkpoint
download the training data (we only used the data provided by the organizers, no other external allowed datasets)
cd train_npz and unzip all files
prepare the new datasets
mv NewTrainingData/totalseg_mr MR
cd CVPR24-PostChallenge-Train/ && mkdir XRay PET
for file in *; do [ -f "$file" ] && mv "$file" "${file:3}"; done (we want all files of a modality to have the modality name as prefix)
mv X-Ray_Teeth_* XRay/ && mv PET_Lesion_psma_* PET/

Now we just need to split the data into training and validation sets and foundation modalities for DAFT:

go to wherever you cloned the repository and run python split_dataset.py <path_to_data>
cd datasplit && python modalities3D.py train.csv val.csv

Preprocessing

we didn't do any further preprocessing outside the distillation/finetuning code
during distillation/finetuning we did the same as the baseline training code
resize and pad images to 256x256
normalize intensities
flip horizontally with 50% probability
flip vertically with 50% probability
randomly pertubate the box by up to 5 pixels

Training

Make sure to use different workdirs for distillation, general finetuning and each DAFT run.

use python distill.py -num_epochs 70 -batch_size 7 -device cuda -work_dir work_dir_distill/ -resume work_dir_distill/medsam_lite_latest.pth -pretrained_checkpoint l0.pt to distill the TinyViT in LiteMedSAM to EfficientViT, use python modelmerge.py work_dir_distill/medsam_lite_best.pth distilled.pth to create weights for EfficientViT-SAM by using the distilled EfficientViT image encoder checkpoint and copying the prompt encoder and mask decoder from LiteMedSAM
fine tune our the model from step 3 on all the data by using python finetune.py -pretrained_checkpoint distilled.pth -num_epochs 70 -batch_size 96 -device cuda -work_dir work_dir_general -resume work_dir_general/medsam_lite_latest.pth, use python extract_evit.py work_dir_general/medsam_lite_best.pth general_finetuned.pth to extract the weights from the latest training checkpoint afterwards
fine tune our model from step 2 on different subsets of the data, depending on the modality, by using python daft.py -pretrained_checkpoint general_finetuned.pth -num_epochs 70 -batch_size 96 -device cuda -work_dir work_dir_modalities3D/<modality>/ -resume work_dir_modalities3D/<modality>/medsam_lite_latest.pth --traincsv datasplit/modalities3D/<modality>.train.csv --valcsv datasplit/modalities3D/<modality>.val.csv, afterwards use mkdir models && ./extract_modalities3D.sh to extract the weights from the checkpoints
cp general_finetuned.pth models/general.pth to use the generally finetuned model as a fallback if we can't read the modality from the file name
convert all PyTorch models to ONNX by running python export_onnx.py
cd onnxmodels && mv 3D_prompt_encoder.onnx prompt_encoder.onnx && rm *_prompt_encoder.onnx && cd .. since the prompt encoder is shared
convert ONNX to OpenVINO IR via python onnx2openvino.py

You can now use PerfectMetaOpenVINO.py for inference.

The OpenVINO IR artifacts of our performance booster submission and the distilled and generally finetuned models are also available here.

Inference

Download the demo data and run python PerfectMetaOpenVINO.py

Our C++ implementation is available in cpp/.

Evaluation

Run python evaluation/compute_metrics.py -s test_demo/segs -g test_demo/gts -csv_dir ./metrics.csv

Docker submission

To create the docker you can use ./build_docker_cpp.sh (or ./build_docker_openvino.sh for the python version) and docker save automlfreiburg | gzip -c > automlfreiburg.tar.gz to save it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CVPR24-MedSAM-on-Laptop

Environments and Requirements

Dataset and Models

Preprocessing

Training

Inference

Evaluation

Docker submission

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
cpp		cpp
datasplit		datasplit
evaluation		evaluation
imgs		imgs
segment_anything		segment_anything
work_dir/LiteMedSAM		work_dir/LiteMedSAM
.gitignore		.gitignore
.gitmodules		.gitmodules
CITATION.bib		CITATION.bib
DockerfileCpp		DockerfileCpp
DockerfileCpp.dockerignore		DockerfileCpp.dockerignore
DockerfileOpenVINO		DockerfileOpenVINO
DockerfileOpenVINO.dockerignore		DockerfileOpenVINO.dockerignore
LICENSE		LICENSE
PerfectMetaOpenVINO.py		PerfectMetaOpenVINO.py
README.md		README.md
build_docker_cpp.sh		build_docker_cpp.sh
build_docker_openvino.sh		build_docker_openvino.sh
daft.py		daft.py
distill.py		distill.py
export_onnx.py		export_onnx.py
extract_evit.py		extract_evit.py
extract_modalities3D.sh		extract_modalities3D.sh
finetune.py		finetune.py
modelmerge.py		modelmerge.py
onnx2openvino.py		onnx2openvino.py
predict_cpp.sh		predict_cpp.sh
predict_openvino.sh		predict_openvino.sh
requirements.txt		requirements.txt
split_dataset.py		split_dataset.py
tiny_vit_sam.py		tiny_vit_sam.py

License

CRC-1597-Small-Data/CVPR24-MedSAM-on-Laptop

Folders and files

Latest commit

History

Repository files navigation

CVPR24-MedSAM-on-Laptop

Environments and Requirements

Dataset and Models

Preprocessing

Training

Inference

Evaluation

Docker submission

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages