Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test impact of symmetric vs. affine quantization #4

Open
etrommer opened this issue Sep 20, 2023 · 2 comments
Open

Test impact of symmetric vs. affine quantization #4

etrommer opened this issue Sep 20, 2023 · 2 comments
Assignees
Labels
research Experiments and Evaluations

Comments

@etrommer
Copy link
Owner

TorchApprox has been extended to use PyTorch's internal affine quantization: https://github.com/etrommer/torch-approx/tree/feature/torchquant

It needs to be benchmarked in order to assess whether this improves inference accuracy.

Experimental setups:

  • 8-bit affine vs. 8-bit symmetric for a choice of networks
  • lower bitwidth (4-bit?) as it is expected to produce more pronounced results
@etrommer etrommer added the research Experiments and Evaluations label Sep 20, 2023
@etrommer etrommer self-assigned this Sep 20, 2023
@etrommer
Copy link
Owner Author

etrommer commented Oct 13, 2023

231013_affine_vs_per_tensor_lenet5.csv

  • Results for LeNet5/MNIST
  • 2,3,4-Bit EvoApprox multipliers
  • Default gradient_clip_val = 0.5
  • 3-Bit multipliers rerun with gradient_clip_val = 1.0

@etrommer
Copy link
Owner Author

231016_affine_vs_per_tensor_resnet8.csv

  • Results for ResNet8/CIFAR10
  • 8x4-Bit EvoApprox multipliers
  • Default gradient_clip_val = 0.5

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
research Experiments and Evaluations
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

1 participant