Upgrade to HF transformers 4.3.2 #16

vblagoje · 2021-02-24T21:32:59Z

@lvwerra here is a proposal for HT Transformers update to 4.3.2. The only important compatibility breaking issue I found was related to a parameter name change in GPT2Model#forward function. I also upgraded simpletransformers to a version that uses Transformers 4.3.2 as well. I have tried all notebooks in nbs directory to make sure everything works as it used to. I did not upgrade any other libs as I am not that familiar with nbdev environment.

04-gpt2-sentiment-ppo-training.ipynb works as expected and trains in about 2 hours. However, for 05-gpt2-sentiment-control.ipynb I had to lower the batch size to 128 and forward_batch_size to 4 in order to make the training work and avoid CUDA out of memory errors. The estimated running time on tqdm is 5 hours.

vblagoje · 2021-02-26T14:11:04Z

@lvwerra I tried this branch on both imdb ppo notebooks (the basic ppo sentiment training and the controlled sentiment ppo). They both work as expected, please try it as well. Let me know if any other checks should be done.

lvwerra · 2021-02-26T14:55:54Z

awesome! did you also use weights and biases? in case you did, would you mind sharing the logs?

vblagoje · 2021-02-26T16:04:18Z

Yes, I did but I deleted the first report for 04-gpt2-sentiment-ppo-training.ipynb. Here is the report for 05-gpt2-sentiment-control.ipynb

Upgrade to HF transformers 4.3.2

78ba8f6

lvwerra merged commit 750f5fd into huggingface:master Feb 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade to HF transformers 4.3.2 #16

Upgrade to HF transformers 4.3.2 #16

vblagoje commented Feb 24, 2021

vblagoje commented Feb 26, 2021 •

edited

Loading

lvwerra commented Feb 26, 2021

vblagoje commented Feb 26, 2021

Upgrade to HF transformers 4.3.2 #16

Upgrade to HF transformers 4.3.2 #16

Conversation

vblagoje commented Feb 24, 2021

vblagoje commented Feb 26, 2021 • edited Loading

lvwerra commented Feb 26, 2021

vblagoje commented Feb 26, 2021

vblagoje commented Feb 26, 2021 •

edited

Loading