Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace FeedforwardBlock with a correct implementation #211

Merged
merged 4 commits into from
Apr 8, 2021
Merged

Conversation

mryab
Copy link
Member

@mryab mryab commented Apr 7, 2021

Previously, we had a larger version of the FFN layer that had more parameters and ReLU as an activation. This PR replaces the default FFN layer with an implementation used in ALBERT by the transformers library as a reasonable default.

@mryab mryab requested a review from justheuristic April 7, 2021 07:54
Copy link
Member

@justheuristic justheuristic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Request: changing FeedforwardBlock will alter the benchmarking performance reported here

image

Can you please update the performance stats there?

@mryab
Copy link
Member Author

mryab commented Apr 8, 2021

Re: benchmarks, we'll update the entire docs section when most of the pending server-side features are implemented and merged to master

@mryab mryab merged commit ca6d87a into master Apr 8, 2021
@mryab mryab deleted the correct_ffn branch April 8, 2021 09:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants