Memory management for training on large data sets #137

alanakbik · 2018-10-10T15:02:37Z

In uses cases where training data sets are large or there is little available RAM, language model embeddings cannot be stored in memory (see #135 ).

Current solution: Currently, the only way to still train a model in such cases is to set the
embeddings_in_memory tag to False in the trainer classes (TextClassifierTrainer or SequenceTaggerTrainer). With this flag, embeddings get generated on the fly at each epoch and immediately discarded after use, which solves the memory issue but is computationally expensive since there is no re-use of already computed embeddings.

Idea: Use a key-value store to persist embeddings to disk and enable quick lookup of already computed embeddings. A nice side-effect is that if we run several experiments on the same dataset, embeddings from earlier runs can be re-used, thus speeding up parameter-sweep experiments.

The text was updated successfully, but these errors were encountered:

…lling

GH-137: memory optimizations

alanakbik · 2018-10-11T11:51:14Z

will be part of release-0.3 and activated by default for CharLMEmbeddings (can still be turned off to save disk space)

alanakbik added the feature A new feature label Oct 10, 2018

alanakbik mentioned this issue Oct 10, 2018

Training on PTB 3 POS task #135

Closed

alanakbik pushed a commit that referenced this issue Oct 10, 2018

GH-137: switch to train only at beginning of epoch

7cd1746

alanakbik pushed a commit that referenced this issue Oct 10, 2018

GH-137: add option to anneal with restarts

45fd335

alanakbik pushed a commit that referenced this issue Oct 10, 2018

GH-137: add caching option to CharLMEmbeddings

7776ef1

alanakbik pushed a commit that referenced this issue Oct 11, 2018

GH-137: add caching option to CharLMEmbeddings

4c6ab4f

alanakbik pushed a commit that referenced this issue Oct 11, 2018

GH-137: add caching option to CharLMEmbeddings

7d2543a

alanakbik pushed a commit that referenced this issue Oct 11, 2018

GH-137: ignore deletion errors on test class to handle sqllite journa…

821ba52

…lling

tabergma added the release-0.3 label Oct 11, 2018

alanakbik pushed a commit that referenced this issue Oct 11, 2018

Merge branch 'release-0.3' into GH-137-memory-optimizations

f38c7d4

tabergma added a commit that referenced this issue Oct 11, 2018

Merge pull request #142 from zalandoresearch/GH-137-memory-optimizations

6c896d2

GH-137: memory optimizations

alanakbik closed this as completed Oct 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory management for training on large data sets #137

Memory management for training on large data sets #137

alanakbik commented Oct 10, 2018

alanakbik commented Oct 11, 2018

Memory management for training on large data sets #137

Memory management for training on large data sets #137

Comments

alanakbik commented Oct 10, 2018

alanakbik commented Oct 11, 2018