Tags: taolei87/sru
Tags
Merge pull request asappresearch#191 from asappresearch/remove_rezero Update layer norm options
Merge pull request asappresearch#188 from asappresearch/torchscript_g… …pu_v2.5 support GPU inference in torchscript model for v2.5 / v2.6
Merge pull request asappresearch#184 from asappresearch/3.0.0-dev-tao update version
Merge pull request asappresearch#169 from asappresearch/3.0.0-dev-tao Speed up data loading / batching for ONE BILLION WORD experiment
PreviousNext