Accept Upstream Changes #1

will-rice · 2021-05-06T01:17:41Z

No description provided.

* Add more metadata to the user agent * Fix typo * Use DISABLE_TELEMETRY * Address review comments * Use global env * Add clean envs on circle CI

* First third * Styling and fix mistake * Quality * All the rest * Treat %s and %d * typo * Missing ) * Apply suggestions from code review Co-authored-by: Lysandre Debut <[email protected]> Co-authored-by: Lysandre Debut <[email protected]>

* Replace is_sagemaker_distributed_available * Merge SageMakerTrainer into Trainer * Test with shorter condition * Put back deleted line * Deprecate SageMakerTrainer and SageMakerTrainingArguments * Apply suggestions from code review Co-authored-by: Philipp Schmid <[email protected]> Co-authored-by: Philipp Schmid <[email protected]>

In the group by length documentation length is misspelled as legnth

* Add initial script for finetuning MLM models with accelerate * Add evaluation metric calculation * Fix bugs * Use no_grad on evaluation * update script docstring * Update examples/language-modeling/run_mlm_no_trainer.py Co-authored-by: Sylvain Gugger <[email protected]> * PR feedback * Fix CI failure * Update examples/language-modeling/run_mlm_no_trainer.py Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

* Update optimization.py Fix documentation to reflect optimal settings for Adafactor * update and expand on the recommendations * style * Apply suggestions from code review Co-authored-by: Sylvain Gugger <[email protected]> * flip scale_parameter to True for the 2nd recommendatoin Co-authored-by: Stas Bekman <[email protected]> Co-authored-by: Stas Bekman <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

* use bisect to add one token to unique_no_split_tokens * fix style

@sgugger

* Squash all commits into one * Update ViTFeatureExtractor to use image_utils instead of torchvision * Remove torchvision and add Pillow * Small docs improvement * Address most comments by @sgugger * Fix tests * Clean up conversion script * Pooler first draft * Fix quality * Improve conversion script * Make style and quality * Make fix-copies * Minor docs improvements * Should use fix-copies instead of manual handling * Revert "Should use fix-copies instead of manual handling" This reverts commit fd4e591. * Place ViT in alphabetical order Co-authored-by: Lysandre <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

@sgugger

* closes #10258 * typo * reworked deberta test * implemented the comments from BigBird01 regarding sequence pair encoding of deberta * Update style * VOCAB_FILES_NAMES is now a oneliner as suggested by @sgugger Co-authored-by: Sylvain Gugger <[email protected]> * added #fmt: on as requested by @sgugger * Style Co-authored-by: Lysandre <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Lysandre Debut <[email protected]>

*negative* log-likelihood

* added new notebook and merge of trainer * Update docs/source/sagemaker.md Co-authored-by: Lysandre Debut <[email protected]> Co-authored-by: Lysandre Debut <[email protected]>

double : prevents code-block section to be rendered, so made it single :

* Pin docutils * Versions table

* Refactor AutoModel classes and add Flax Auto classes * Add new objects to the init * Fix hubconf and sort models * Fix TF tests * Missing coma * Update src/transformers/models/auto/auto_factory.py Co-authored-by: Lysandre Debut <[email protected]> * Fix init * Fix dummies * Other init to fix Co-authored-by: Lysandre Debut <[email protected]>

) * Documentation about loading a fast tokenizer within Transformers * Apply suggestions from code review Co-authored-by: Sylvain Gugger <[email protected]> * style Co-authored-by: Sylvain Gugger <[email protected]>

* Add example for callback registry Resolves: #9036 * Update callback registry documentation * Added comments for other ways to register callback

* Initial draft for clm no trainer * Remove unwanted args * Fix bug * Update examples/language-modeling/run_clm_no_trainer.py Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

* Replace pkg_resources with importlib_metadata Fixes #10964. The other reason for this change is that pkg_resources has been [deprecated](pypa/setuptools@8fe85c2) in favor of importlib_metadata. * Reduce to a single importlib_metadata import switch * Trigger CI Co-authored-by: Stas Bekman <[email protected]>

* Fixed the doc for the shape of return scores tuples in generation_utils.py. * Fix the output shape of `scores` for `DecoderOnlyOutput`. * style fix

Replaces `tok` with `tokenizer` so examples can run with copy-paste

* small fixes * style

* push * small change * correct other typo

@patil-suraj

* Rebase with master * Minor bug fix in docs * Copy files from adding_luke_v2 and improve docs * change the default value of use_entity_aware_attention to True * remove word_hidden_states * fix head models * fix tests * fix the conversion script * add integration tests for the pretrained large model * improve docstring * Improve docs, make style * fix _init_weights for pytorch 1.8 * improve docs * fix tokenizer to construct entity sequence with [MASK] entity when entities=None * Make fix-copies * Make style & quality * Bug fixes * Add LukeTokenizer to init * Address most comments by @patil-suraj and @LysandreJik * rename _compute_extended_attention_mask to get_extended_attention_mask * add comments to LukeSelfAttention * fix the documentation of the tokenizer * address comments by @patil-suraj, @LysandreJik, and @sgugger * improve docs * Make style, quality and fix-copies * Improve docs * fix docs * add "entity_span_classification" task * update example code for LukeForEntitySpanClassification * improve docs * improve docs * improve the code example in luke.rst * rename the classification layer in LukeForEntityClassification from typing to classifier * add bias to the classifier in LukeForEntitySpanClassification * update docs to use fine-tuned hub models in code examples of the head models * update the example sentences * Make style & quality * Add require_torch to tokenizer tests * Add require_torch to tokenizer tests * Address comments by @sgugger and add community notebooks * Make fix-copies Co-authored-by: Ikuya Yamada <[email protected]>

…s to tokenizer (#11538) * Fixed tokenization mistakes while adding single-char tokens to tokenizer * Added tests and Removed unnecessary comments. * finalize wav2vec2 tok * add more aggressive tests * Apply suggestions from code review * fix useless import Co-authored-by: Patrick von Platen <[email protected]>

Fixes #11525

* Update training tutorial * Apply suggestions from code review Co-authored-by: Hamel Husain <[email protected]> * Address review comments * Update docs/source/training.rst Co-authored-by: Lysandre Debut <[email protected]> * More review comments * Last review comments Co-authored-by: Hamel Husain <[email protected]> Co-authored-by: Lysandre Debut <[email protected]>

* add to bert * review comments * Update src/transformers/configuration_utils.py Co-authored-by: Sylvain Gugger <[email protected]> * Update src/transformers/configuration_utils.py Co-authored-by: Sylvain Gugger <[email protected]> * self.config.problem_type * fix style * fix * fin * fix * update doc * fix * test * Test more problem types * Update src/transformers/configuration_utils.py Co-authored-by: Sylvain Gugger <[email protected]> * fix * remove * fix * quality * make fix-copies * remove test Co-authored-by: abhishek thakur <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Lysandre <[email protected]>

* Fix tests * Reorganize * Update tests/test_modeling_mobilebert.py * Remove unnecessary addition

* Make quality scripts work when one backend is missing. * Check env variable is properly set * Add default * With print statements * Fix typo * Set env variable * Remove debug code

* add flax roberta * make style * correct initialiazation * modify model to save weights * fix copied from * fix copied from * correct some more code * add more roberta models * Apply suggestions from code review * merge from master * finish * finish docs Co-authored-by: Patrick von Platen <[email protected]>

* removed all old code * make quality

* add electra model to flax * Remove Electra Next Sentence Prediction model added by mistake * fix parameter sharing and loosen equality threshold * fix styling issues * add mistaken removen imports * fix electra table * Add FlaxElectra to automodels and fixe docs * fix issues pointed out the PR * fix flax electra to comply with latest changes * remove stale class * add copied from Co-authored-by: Patrick von Platen <[email protected]>

* Set generator in dataloader * Use generator in all random samplers * Checkpoint all RNG states * Final version * Quality * Test * Address review comments * Quality * Remove debug util * Add python and numpy RNGs * Split states in different files in distributed * Quality * local_rank for TPUs * Only use generator when accepted * Add test * Set seed to avoid flakiness * Make test less flaky * Quality

* document resume randomness * fix link * reword * fix * reword * style

… recipe (#11591) * add importlib_metadata as dependency (#11490) Co-authored-by: Deepali Chourasia <[email protected]> * add huggingface_hub dependency Co-authored-by: Deepali Chourasia <[email protected]>

* lazy_init_weights * remove ipdb * save int * add necessary code * remove unnecessary utils * Update src/transformers/models/t5/modeling_t5.py * clean * add tests * correct * finish tests * finish tests * fix some more tests * fix xlnet & transfo-xl * fix more tests * make sure tests are independent * fix tests more * finist tests * final touches * Update src/transformers/modeling_utils.py * Apply suggestions from code review * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <[email protected]> * Update src/transformers/modeling_utils.py Co-authored-by: Stas Bekman <[email protected]> * clean tests * give arg positive name * add more mock weights to xlnet Co-authored-by: Stas Bekman <[email protected]>

sgugger and others added 30 commits March 31, 2021 09:36

Add more metadata to the user agent (#10972)

d0b3797

* Add more metadata to the user agent * Fix typo * Use DISABLE_TELEMETRY * Address review comments * Use global env * Add clean envs on circle CI

add notebook (#10995)

b6dddda

add blog to docs (#10997)

01068ab

Update training_args.py (#11000)

455f817

In the group by length documentation length is misspelled as legnth

Improve the speed of adding tokens from added_tokens.json (#10780)

af67322

* use bisect to add one token to unique_no_split_tokens * fix style

minor typo fix

f4ad3d8

*negative* log-likelihood

[doc] no more bucket

e8da77d

added new notebook and merge of trainer (#11015)

34e1bec

* added new notebook and merge of trainer * Update docs/source/sagemaker.md Co-authored-by: Lysandre Debut <[email protected]> Co-authored-by: Lysandre Debut <[email protected]>

fixed typo: logging instead of logger (#11025)

335c0ca

Add a script to check inits are consistent (#11024)

b0d49fd

s|Pretrained|PreTrained| (#11048)

3d39226

[doc] update code-block rendering (#11053)

6e31014

double : prevents code-block section to be rendered, so made it single :

Pin docutils (#11062)

ef62f03

* Pin docutils * Versions table

Remove unnecessary space (#11060)

773e4c7

Some models have no tokenizers (#11064)

eb3479e

Add example for registering callbacks with trainers (#10928)

e1c02e0

* Add example for callback registry Resolves: #9036 * Update callback registry documentation * Added comments for other ways to register callback

Add center_crop to ImageFeatureExtractoMixin (#11066)

090e3e6

Document common config attributes (#11070)

f05a8a0

Fix distributed gather for tuples of tensors of varying sizes (#11071)

04ceee7

Make a base init in FeatureExtractionMixin (#11074)

2199608

kylie-box and others added 26 commits May 2, 2021 10:10

Fixed docs for the shape of scores in generate() (#10057)

9802086

* Fixed the doc for the shape of return scores tuples in generation_utils.py. * Fix the output shape of `scores` for `DecoderOnlyOutput`. * style fix

Fix examples in M2M100 docstrings (#11540)

a5d2967

Replaces `tok` with `tokenizer` so examples can run with copy-paste

[Flax BERT/Roberta] few small fixes (#11558)

623281a

* small fixes * style

[Wav2Vec2] Fix convert (#11562)

c448c01

* push * small change * correct other typo

Remove datasets submodule. (#11563)

1c86157

fix the mlm longformer example by changing [MASK] to <mask> (#11559)

6a11e4c

Fix metric computation in run_glue_no_trainer (#11569)

87dd1a0

Fixes a useless warning. (#11566)

1e8e068

Fixes #11525

Accumulate opt state dict on do_rank 0 (#11481)

f4c9a7e

fix resize_token_embeddings (#11572)

7c62248

Enable added tokens (#11325)

09b0bcf

* Fix tests * Reorganize * Update tests/test_modeling_mobilebert.py * Remove unnecessary addition

Make quality scripts work when one backend is missing. (#11573)

2ce0fb8

* Make quality scripts work when one backend is missing. * Check env variable is properly set * Add default * With print statements * Fix typo * Set env variable * Remove debug code

Removes SageMakerTrainer code but keeps class as wrapper (#11587)

226e74b

* removed all old code * make quality

[trainer] document resume randomness (#11588)

c065025

* document resume randomness * fix link * reword * fix * reword * style

copies need to be fixed too (#11585)

bf0dfa9

add importlib_metadata and huggingface_hub as dependency in the conda…

83e59d8

… recipe (#11591) * add importlib_metadata as dependency (#11490) Co-authored-by: Deepali Chourasia <[email protected]> * add huggingface_hub dependency Co-authored-by: Deepali Chourasia <[email protected]>

Skip Funnel test

8fa8e19

Accept tensorflow-rocm package when checking TF availability (#11595)

864c1df

will-rice merged commit 412e295 into will-rice:master May 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accept Upstream Changes #1

Accept Upstream Changes #1

will-rice commented May 6, 2021

Accept Upstream Changes #1

Accept Upstream Changes #1

Conversation

will-rice commented May 6, 2021