Fix transformer smaller training vocab #3155

helpmefindaname · 2023-03-20T18:25:33Z

this PR fixes the usage of the transformer smaller training vocab and improves documentation:

fix that transformer smaller training vocab are actually used for training and not just temporarily reduce the vocab size before the training starts.
enforce using a newer version of the library. There are 2 changes:
- embedding-parameters that are not trainable, but present in the optimizer, will now be rightfully kept not trainable. Hence, running embeddings.model.embeddings.word_embeddings.requires_grad_(False) before training will work with reduce=True.
- The config rightfully sets the vocab size and therefore reduced models can be saved and later loaded as such.
the transformer smaller training vocab is now documented in the tutorials
the onnx tutorial is now slightly improved to fit the newer structure

alanakbik · 2023-03-22T23:29:49Z

Thanks for this @helpmefindaname! I tested on our cluster - it works with 'distilbert-base-uncased', but running a FLERT training script with 'xlm-roberta-large' throws a weird CUDA error (RuntimeError: CUDA error: CUBLAS_STATUS_NOT_INITIALIZED when calling cublasCreate(handle)``). Can you check?

helpmefindaname · 2023-03-27T08:36:55Z

Thanks for this @helpmefindaname! I tested on our cluster - it works with 'distilbert-base-uncased', but running a FLERT training script with 'xlm-roberta-large' throws a weird CUDA error (RuntimeError: CUDA error: CUBLAS_STATUS_NOT_INITIALIZED when calling cublasCreate(handle)``). Can you check?

Thanks for pointing this out, this should be fixed with version 0.2.3

alanakbik · 2023-03-29T09:58:56Z

@helpmefindaname thanks, it works now!

Benedikt Fuchs and others added 3 commits March 20, 2023 18:57

document transformer smaller training vocab and onnx transformers

8a9baac

fix transformer smaller training vocab

e700365

fix black formatting

9d186ec

alanakbik merged commit 25ebf38 into flairNLP:master Mar 29, 2023

helpmefindaname deleted the fix_transformer_smaller_training_vocab branch March 29, 2023 13:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix transformer smaller training vocab #3155

Fix transformer smaller training vocab #3155

helpmefindaname commented Mar 20, 2023

alanakbik commented Mar 22, 2023

helpmefindaname commented Mar 27, 2023

alanakbik commented Mar 29, 2023

Fix transformer smaller training vocab #3155

Fix transformer smaller training vocab #3155

Conversation

helpmefindaname commented Mar 20, 2023

alanakbik commented Mar 22, 2023

helpmefindaname commented Mar 27, 2023

alanakbik commented Mar 29, 2023