Track 3 - Robust Occupancy Prediction

Preparation
- Installation
- Datasets
Getting Started
- Training
- Evaluation
Customized Dataset
Baseline Results
References

Preparation

We implemented SurroundOcc as the baseline model for Track 3. The baseline model was trained on the official train split of the nuScenes dataset and evaluated on our robustness probing sets under different corruptions.

This codebase provides basic instructions for the reproduction of the baseline model in the RoboDrive Challenge.

⚙️ Installation

Kindly refer to the INSTALL.md to set up environments and download necessary checkpoints.

♨️ Datasets

We use data under the nuScenes train split as the training set and the RoboDrive robustness probing data as the evaluation sets. For training data preparation, kindly refer to DATA_PREPARE.md.

For evaluation data preparation, kindly download the dataset from the following resources:

Type	Phase 1	Phase 2
Google Drive	`link1` or `link2`	`link1` or `link2`

Uncompress the downloaded dataset and organize the folder structure as follows:

.
├── ckpts
│   └── surroundocc.pth
├── data
│   ├── nuscenes
│   ├── nuscenes_infos_val.pkl
│   ├── nuscenes_occ
│   ├── robodrive_infos_temporal_test.pkl
│   └── robodrive-release
├── docs
├── extensions
└── projects

Next, run the following command to generate the .pkl file for the evaluation sets:

bash tools/create_data.sh

🚙 Hint: You can download our generated robodrive_infos_temporal_test.pkl file from this Google Drive link.

The nuscenes folder should end up looking like this:

.
├── basemap
├── can_bus
├── can_bus.zip
├── expansion
├── lidarseg
├── maps
├── nuscenes_infos_temporal_train.pkl
├── nuscenes_infos_temporal_val.pkl
├── nuScenes-panoptic-v1.0-all
├── prediction
├── robodrive_infos_temporal_test.pkl
├── robodrive-v1.0-test
├── samples
├── sweeps
├── v1.0-mini
├── v1.0-test
└── v1.0-trainval

Getting Started

The training and evaluation instructions are summarized as follows.

🚀 Training

Kindly refer to GET_STARTED.md for the details regarding model training.

📊 Evaluation

Simply run the following command to evaluate the trained baseline model on the RoboDrive robustness probing sets.

cd SurroundOcc
bash tools/dist_test_corruption.sh

We provide the script to generate prediction files. Kindly refer to robodrive_multi_gpu_test.py.

The generated results will be saved in the folder structure as follows. Each results.pkl is a dictionary, its key is sample_idx and its value is np.ndarray.

.
├── brightness
│   └── results.pkl
├── color_quant
│   └── results.pkl
├── contrast
│   └── results.pkl
...
├── snow
└── zoom_blur

Next, kindly merge all the .pkl files into a single pred.pkl file and zip compress it.

You can merge the results using the following command:

python ./tools/convert_submit.py

⚠️ Note: The prediction file MUST be named as pred.pkl. The .zip file can be named as you like.

Finally, upload the compressed file to Track 3's evaluation server for model evaluation.

🚙 Hint: We provided the baseline submission file at this Google Drive link. Feel free to download and check it for reference and learn how to correctly submit the prediction files to the server.

Customized Dataset

To customize your own dataset, simply build your dataset based on NuScenesCorruptionDataset.

We mainly modified the data loading part. We only consider the subset of scenes for each corruption type, below is an example showing how to load a subset of scenes under each corruption type.

For more information, kindly refer to corruption_dataset.py.

data = mmcv.load(ann_file)
        
data_infos = data['infos']
sample_data_infos = []
for data_info in data_infos:
    if self.corruption is not None:
        if data_info['scene_token'] in self.sample_scenes_dict[self.corruption]:
            sample_data_infos.append(data_info)
        else:
            sample_data_infos.append(data_info)

Baseline Results

Phase 1

Corruption	SC IoU	SSC mIoU	barrier	bicycle	car	const. veh	motorcycle	pedestrain	traffic cone	trailer	trunk	drive. suf	other flat	sidewalk	terrian	manmade	vegetation
Bright	0.2923	0.1613	0.2028	0.1209	0.2327	0.2937	0.0096	0.1923	0.1173	0.0766	0.0000	0.2356	0.3451	0.0438	0.1908	0.2145	0.1083
Dark	0.2404	0.1232	0.1030	0.0508	0.1821	0.2305	0.0591	0.0843	0.0294	0.0583	0.1201	0.1846	0.2859	0.1057	0.1503	0.1247	0.0802
Fog	0.2815	0.1815	0.2795	0.1067	0.1999	0.2944	0.0536	0.1204	0.1478	0.0803	0.1692	0.2228	0.3258	0.2652	0.1817	0.1876	0.1029
Frost	0.1417	0.0349	0.0062	0.0041	0.0241	0.1240	0.0002	0.0037	0.0111	0.0196	0.0052	0.0438	0.1365	0.0118	0.0241	0.0314	0.0373
Snow	0.2233	0.1085	0.1271	0.0298	0.1902	0.2272	0.0469	0.0149	0.0595	0.0913	0.0233	0.1110	0.2780	0.1520	0.1107	0.0935	0.0606
Contrast	0.2217	0.0768	0.0000	0.0060	0.1457	0.2217	0.0000	0.0237	0.0511	0.0020	0.0000	0.1725	0.2534	0.0174	0.1069	0.0518	0.0650
Defocus Blur	0.2484	0.1067	0.0635	0.0070	0.0871	0.2857	0.0660	0.0307	0.0494	0.0155	0.0279	0.1546	0.2901	0.1083	0.1811	0.0978	0.0994
Glass Blur	0.2397	0.0978	0.0685	0.0222	0.0726	0.2577	0.0018	0.0079	0.0477	0.0677	0.0394	0.1171	0.2793	0.1006	0.1704	0.1257	0.0820
Motion Blur	0.1675	0.1002	0.1448	0.0549	0.2610	0.2073	0.0092	0.0270	0.0685	0.0541	0.0000	0.1628	0.2161	0.1223	0.0791	0.0706	0.0528
Zoom Blur	0.1775	0.0566	0.0441	0.0000	0.1167	0.0855	0.0085	0.0008	0.0103	0.0043	0.0265	0.0616	0.2267	0.0400	0.0871	0.0702	0.0637
Elastic Transform	0.2890	0.1671	0.1173	0.0660	0.1935	0.2881	0.0601	0.0506	0.1180	0.0913	0.1838	0.2277	0.3395	0.2026	0.2208	0.1694	0.1239
Color Quant	0.2282	0.0974	0.0274	0.0160	0.1895	0.2417	0.0032	0.0195	0.0481	0.0238	0.0551	0.1456	0.2725	0.1350	0.1460	0.0428	0.0849
Gaussian Noise	0.2420	0.0875	0.0201	0.0098	0.0692	0.2420	0.0334	0.0448	0.0248	0.0218	0.0267	0.1075	0.2735	0.0344	0.1719	0.0904	0.0987
Impluse Noise	0.2256	0.1000	0.0338	0.0205	0.1510	0.2209	0.0166	0.0350	0.0399	0.0266	0.0838	0.1580	0.2376	0.1008	0.1621	0.0863	0.0897
Shot Noise	0.2604	0.1263	0.0746	0.0267	0.2982	0.2810	0.0694	0.0506	0.0400	0.0210	0.0973	0.1150	0.3019	0.1424	0.1828	0.0767	0.0988
ISO Noise	0.2334	0.1114	0.0786	0.0076	0.2046	0.2726	0.0891	0.0169	0.0294	0.0223	0.0271	0.1519	0.2580	0.1207	0.1684	0.0865	0.0855
Pixelate	0.3012	0.1677	0.0777	0.0415	0.3107	0.3057	0.0879	0.1745	0.0428	0.0518	0.0559	0.1871	0.3377	0.2475	0.2573	0.1448	0.1462
JPEG	0.2549	0.1287	0.0781	0.0189	0.1998	0.2455	0.0932	0.1151	0.0686	0.0530	0.1465	0.1369	0.2963	0.1495	0.1814	0.1052	0.0784

Phase 2

To be updated.

References

Kindly cite the corresponding paper(s) once you use the baseline model in this track.

@inproceedings{wei2023surroundocc,
    title = {SurroundOCC: Multi-Camera 3D Occupancy Prediction for Autonomous Driving},
    author = {Wei, Yi and Zhao, Linqing and Zheng, Wenzhao and Zhu, Zheng and Zhou, Jie and Lu, Jiwen},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
    pages = {21729-21740},
    year = {2023}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Track 3 - Robust Occupancy Prediction

Preparation

⚙️ Installation

♨️ Datasets

Getting Started

🚀 Training

📊 Evaluation

Customized Dataset

Baseline Results

Phase 1

Phase 2

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

Track 3 - Robust Occupancy Prediction

Preparation

⚙️ Installation

♨️ Datasets

Getting Started

🚀 Training

📊 Evaluation

Customized Dataset

Baseline Results

Phase 1

Phase 2

References