v0.8.0 -> v0.9.0 #1452

myleott · 2019-12-03T22:30:28Z

Possibly breaking changes:

Set global numpy seed (4a7cd58)
Split in_proj_weight into separate k, v, q projections in MultiheadAttention (fdf4c3e)
TransformerEncoder returns namedtuples instead of dict (27568a7)

New features:

Add --fast-stat-sync option (e1ba32a)
Add --empty-cache-freq option (315c463)
Support criterions with parameters (ba5f829)

New papers:

Simple and Effective Noisy Channel Modeling for Neural Machine Translation (49177c9)
Levenshtein Transformer (86857a5, ...)
Cross+Self-Attention for Transformer Models (4ac2c5f)
Jointly Learning to Align and Translate with Transformer Models (1c66792)
Reducing Transformer Depth on Demand with Structured Dropout (dabbef4)
Unsupervised Cross-lingual Representation Learning at Scale (XLM-RoBERTa) (e23e5ea)
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension (a92bcda)
CamemBERT: a French BERT (b31849a)

Speed improvements:

Add CUDA kernels for LightConv and DynamicConv (f840564)
Cythonization of various dataloading components (4fc3953, ...)
Don't project mask tokens for MLM training (718677e)

facebook-github-bot

@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: Possibly breaking changes: - Set global numpy seed (4a7cd58) - Split `in_proj_weight` into separate k, v, q projections in MultiheadAttention (fdf4c3e) - TransformerEncoder returns namedtuples instead of dict (27568a7) New features: - Add `--fast-stat-sync` option (e1ba32a) - Add `--empty-cache-freq` option (315c463) - Support criterions with parameters (ba5f829) New papers: - Simple and Effective Noisy Channel Modeling for Neural Machine Translation (49177c9) - Levenshtein Transformer (86857a5, ...) - Cross+Self-Attention for Transformer Models (4ac2c5f) - Jointly Learning to Align and Translate with Transformer Models (1c66792) - Reducing Transformer Depth on Demand with Structured Dropout (dabbef4) - Unsupervised Cross-lingual Representation Learning at Scale (XLM-RoBERTa) (e23e5ea) - BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension (a92bcda) - CamemBERT: a French BERT (b31849a) Speed improvements: - Add CUDA kernels for LightConv and DynamicConv (f840564) - Cythonization of various dataloading components (4fc3953, ...) - Don't project mask tokens for MLM training (718677e) Pull Request resolved: facebookresearch#1452 Differential Revision: D18798409 Pulled By: myleott fbshipit-source-id: 860a0d5aaf7377c8c9bd63cdb3b33d464f0e1727

Summary: Pull Request resolved: fairinternal/fairseq-py#1452 Test Plan: Imported from OSS Reviewed By: lematt1991 Differential Revision: D25108462 Pulled By: myleott fbshipit-source-id: 3c17a9937a4c3edb69f64130dfd866c5f42a4aaf

Summary: Possibly breaking changes: - Set global numpy seed (4a7cd58) - Split `in_proj_weight` into separate k, v, q projections in MultiheadAttention (fdf4c3e) - TransformerEncoder returns namedtuples instead of dict (27568a7) New features: - Add `--fast-stat-sync` option (e1ba32a) - Add `--empty-cache-freq` option (315c463) - Support criterions with parameters (ba5f829) New papers: - Simple and Effective Noisy Channel Modeling for Neural Machine Translation (49177c9) - Levenshtein Transformer (86857a5, ...) - Cross+Self-Attention for Transformer Models (4ac2c5f) - Jointly Learning to Align and Translate with Transformer Models (1c66792) - Reducing Transformer Depth on Demand with Structured Dropout (dabbef4) - Unsupervised Cross-lingual Representation Learning at Scale (XLM-RoBERTa) (e23e5ea) - BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension (a92bcda) - CamemBERT: a French BERT (b31849a) Speed improvements: - Add CUDA kernels for LightConv and DynamicConv (f840564) - Cythonization of various dataloading components (4fc3953, ...) - Don't project mask tokens for MLM training (718677e) Pull Request resolved: facebookresearch/fairseq#1452 Differential Revision: D18798409 Pulled By: myleott fbshipit-source-id: 860a0d5aaf7377c8c9bd63cdb3b33d464f0e1727

Summary: Pull Request resolved: fairinternal/fairseq-py#1452 Test Plan: Imported from OSS Reviewed By: lematt1991 Differential Revision: D25108462 Pulled By: myleott fbshipit-source-id: 3c17a9937a4c3edb69f64130dfd866c5f42a4aaf

v0.8.0 -> v0.9.0

ecb1783

facebook-github-bot added the CLA Signed label Dec 3, 2019

facebook-github-bot reviewed Dec 3, 2019

View reviewed changes

facebook-github-bot closed this in df2f84c Dec 3, 2019

myleott deleted the v0.9.0 branch December 6, 2019 22:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.8.0 -> v0.9.0 #1452

v0.8.0 -> v0.9.0 #1452

myleott commented Dec 3, 2019

facebook-github-bot left a comment

v0.8.0 -> v0.9.0 #1452

v0.8.0 -> v0.9.0 #1452

Conversation

myleott commented Dec 3, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment