-
Notifications
You must be signed in to change notification settings - Fork 1
NNPACK
Ashok Bhat edited this page Nov 11, 2020
·
9 revisions
- An acceleration package for neural network computations
- https://github.com/Maratyszcza/NNPACK
- No longer actively developed
- Multiple algorithms for convolutional layers:
- Fast convolution based on Fourier transform (for kernels up to 16x16 without stride)
- Fast convolution based on Winograd transform (for 3x3 kernels without stride)
- Implicit matrix-matrix multiplication algorithm (no limitations)
- Direct convolution algorithm (for 1x1 kernels without stride)
- Multi-threaded SIMD-aware implementations of neural network layers
- No longer actively developed
- Originally developed by Marat Dukhan of Georgia Tech with guidance from Facebook AI Research
- Marat Dukhan now is one of the main contributors to XNNPACK
A comment in GitHub -
NNPACK is not actively developed anymore and XNNPACK is the successor and is in my experience faster (as well as having a better and more complete interface)
- PyTorch - On mobile for inference in convolutional layers.
- TVM - for inference in convolutional layers
- MXNet - for inference in convolutional layers, fully-connected, and max-pooling layers.
- Facebook - in production.
- Prisma - in the mobile app