Datapipeline

Easy dataset preparation tool for speaking avatar model training

Installation

First of all, you'll have to load all LFS data and init submodules:

git lfs fetch --all
git submodule update --init --recursive

Docker

Docker container can reduce pain in your butthole, so use it:

# Build an image
docker build -t IMAGE_NAME .
# Run container and launch interactive shell
docker run -it --gpus all --entrypoint=/bin/bash IMAGE_NAME

To ensure your image is built properly, run tests inside the container:

pytest tests

Local (for development)

The tool is tested under Python3.9 / CUDA 11.3 machines only.

The most crucial packages are torch, pytorch3d, transformers, so if you install them properly for your Python/CUDA then you'll have 99% chance to use this tool.

Create Python venv

python3 -m venv dpl-venv
source dpl-venv/bin/activate
./scripts/install_deps.sh

Install jpegHDF5 plugin

Some computational nodes use h5py module to pack data into HDF5 containers. In addition, we use jpeg plugin to save space when storing images.

You may choose one of the two options to build it under your machine.

1. Stable, but with Anaconda

Download and install Anaconda
Run installation script

sudo bash scripts/build_jpeghdf5_anaconda.sh \
    thirdparty/jpegHDF5 \
    PATH/TO/ANACONDA

2. Unstable, but without Anaconda

Ensure you have the latest versions of libjpeg and libhdf5

# On Ubuntu
sudo apt-get update -y
sudo apt-get install -y libjpeg-dev libhdf5-dev

Run installation script

sudo bash scripts/build_jpeghdf5.sh thirdparty/jpegHDF5

Test your environment

Run to see if all dependencies installed correctly:

pytest tests

Usage

1. Define computation graph

Like this:

graph = [
    {
        'name': 'VideoToWavNode',
        'params': {
            'num_jobs': 32,
        }
    },
    {
        'name': 'Wav2vecNode',
        'params': {
            'device': "cuda:0",
        }
    }
]

Explanation: name is the class name of the computation node from dpl/processor/nodes, and params is dict of its parameters.

The graph above extracts audio in .wav format from each given video and then computes wav2vec features.

More useful graphs can be found in configs folder.

2. Specify inputs

You already have your input data, right? If so, just put roots to each data type in dict like this:

inputs = {
    'video': 'root/path/to/videos',
    # If your computation graph takes more data types
    # then all of them should be specified here
    # 'wav': 'root/path/to/audio'
}

All avalable data types and their file extensions can be found in dpl/processor/datatypes.py.

3. Run!

Save the graph and inputs to JSON-files

python run.py \
    --graph "path/to/graph/file.json" \
    --inputs "path/to/inputs/file.json" \
    --output_dir "path/to/output/root"

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
assets		assets
configs		configs
dpl		dpl
scripts		scripts
tests		tests
thirdparty		thirdparty
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Datapipeline

Installation

Docker

Local (for development)

Create Python venv

Install jpegHDF5 plugin

Test your environment

Usage

1. Define computation graph

2. Specify inputs

3. Run!

Backlog

About

Releases

Packages

Languages

ploshkin/talkinghead-datapipeline

Folders and files

Latest commit

History

Repository files navigation

Datapipeline

Installation

Docker

Local (for development)

Create Python venv

Install jpegHDF5 plugin

Test your environment

Usage

1. Define computation graph

2. Specify inputs

3. Run!

Backlog

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages