Skip to content

Latest commit

 

History

History
96 lines (67 loc) · 5.81 KB

Dataset.md

File metadata and controls

96 lines (67 loc) · 5.81 KB

SHHQ Dataset

Overview

SHHQ is a dataset with high-quality full-body human images in a resolution of 1024 × 512. Since we need to follow a rigorous legal review in our institute, we can not release all of the data at once.

For now, SHHQ-1.0 with 40K images is released! More data will be released in the later versions.

Annotation

🔥🔥🔥 Now we release the human parsing annotations along with 2D body keypoints for SHHQ-1.0. 🔥🔥🔥

Parsing images are manually annotated, and keypoints are the combination of detection results from OpenPose and Dlib.

The released 25 body-keypoints is inline with OpenPose, and all the keypoints are stored in a single text file in the following format:

<file_name> <x_0> <y_0> <x_1> <y_1> <x_2> <y_2> ... <x_24> <y_24>

The color labels (in RGB) for 16 categories in the annotation file listed below:

Class ID Label Color Code Class ID Label Color Code
1 Headwear (127, 255, 212) 2 Hair (255, 0, 0)
3 Glove (213, 140, 88) 4 Eyeglasses (0, 100, 0)
5 Tops (255, 250, 250) 6 Dress (255, 250, 205)
7 Coat (220, 220, 220) 8 Socks (160, 140, 88)
9 Pants (211, 211, 211) 10 Skin (144, 238, 144)
11 Scarf (150, 26, 181) 12 Skirt (250, 235, 215)
13 Face (16, 78, 139) 14 Shoes (245, 222, 179)
15 Bag (255, 140, 0) 16 Accessories (50, 205, 50)

Note: all the parsing images are in ''.png'' format

Data Sources

Images are collected in two main ways:

  1. From the Internet. We developed a crawler tool with an official API, mainly downloading images from Flickr, Pixabay and Pexels. So you need to meet all the following licenses when using the dataset: CC0, Pixabay License, and Pexels Licenses.
  2. From the data providers. We purchased images from databases of individual photographers, modeling agencies and other suppliers. Images were reviewed by our legal team prior to purchase to ensure permission for use in research.

Note:

The composition of SHHQ-1.0:

  1. Images obtained from the above sources.
  2. Processed 9991 DeepFashion [1] images (retain only full body images).
  3. 1940 African images from the InFashAI [2] dataset to increase data diversity.

Data License

We are aware of privacy concerns and seriously treat the license and privacy issues. All released data will be ensured under the license of CC0 and free for research use. Also, persons in the dataset are anonymised without additional private or sensitive metadata.

Agreement

The SHHQ is available for non-commercial research purposes only.

You agree not to reproduce, duplicate, copy, sell, trade, resell or exploit any portion of the images and any portion of the derived data for commercial purposes.

You agree NOT to further copy, publish or distribute any portion of SHHQ to any third party for any purpose. Except, for internal use at a single site within the same organization it is allowed to make copies of the dataset.

Shanghai AI Lab reserves the right to terminate your access to the SHHQ at any time.

Dataset Preview

For those interested in our dataset, we provide a preview version with 100 images randomly sampled from SHHQ-1.0: SHHQ-1.0_samples.

In SHHQ-1.0, we provide aligned raw images along with machine-calculated segmentation masks. Later we are planning to release manually annotated human-parsing version of these 40,000 images. Please stay tuned.

We also provide script bg_white.py to whiten the background of the raw image using its segmentation mask.

If you want to access the full SHHQ-1.0, please read the following instructions.

Model trained using SHHQ-1.0

Structure 1024x512 Metric Scores 512x256 Metric Scores
StyleGAN1 to be released - - to be released - -
StyleGAN2 SHHQ-1.0_sg2_1024.pkl fid50k_full 3.56 SHHQ-1.0_sg2_512.pkl fid50k_full 3.68
StyleGAN3 to be released - - to be released - -

Download Instructions

We are pleased to announce that our dataset is now publicly available. By default, it is only permitted for academic research purposes. Please note that any use of the dataset must comply with relevant laws and regulations. We reserve the right to take legal action against any unauthorized use. Dataset Link: Link Password: [StylisH-HumanS-hq_1.0]

We hope this dataset will contribute to the academic community and look forward to seeing the valuable research results it may inspire.

References

[1] Liu, Ziwei and Luo, Ping and Qiu, Shi and Wang, Xiaogang and Tang, Xiaoou. DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. CVPR (2016)

[2] Hacheme, Gilles and Sayouti, Noureini. Neural fashion image captioning: Accounting for data diversity. arXiv preprint arXiv:2106.12154 (2021)