-
Notifications
You must be signed in to change notification settings - Fork 1
A Bhat edited this page Apr 14, 2024
·
11 revisions
- American multinational technology company that specializes in Internet-related services and products
Small CPU models that are embedded in application servers, invoked within the application process instead of via RPC/REST calls, remain critical for some of Google’s business workloads from users like Ads.
Source: https://blog.tensorflow.org/2022/02/tfrt-progress-update.html
Name | Program | Size (MB) (Avg/Max) | Programs | Std Dev | Range |
---|---|---|---|---|---|
MLP0 | - | 580/2500 | 27 | ±17 | 1-93 |
MLP1 | RankBrain | 90/NA | 5 | ±0.3 | 1-5 |
CNN0 | AlphaZero | 60/454 | 1 | - | 1 |
CNN1 | Image Classification Model | 120/680 | 6 | ±10 | 1-34 |
RNN0 | Transformer encoder + LSTM decoder | 1300/1300 | 13 | ±3 | 1-29 |
RNN1 | Wave RNN | 120/400 | 1 | - | - |
BERT0 | - | 3000/3000 | 9 | ±2 | 1-14 |
BERT1 | DeepRank | 90/N.A | 5 | ±0.3 | 1-5 |
Type | % use in 2016 | % use in 2020 |
---|---|---|
MLP | 61% | 25% |
CNN | 5% | 18% |
RNN | 0% | 29% |
BERT | 0% | 28% |