Implement generate_vbe_metadata cpu #3715

spcyppt · 2025-02-19T21:12:58Z

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/796

This diff implements generate_vbe_metadata for cpu, such that the function returns the same output for CPU, CUDA and MTIA.

To support VBE on CPU with existing fixed-batch-size CPU kernel, we need to recompute offsets, which is previously done in python. This diff implements offsets recomputation in C++ such that all manipulations are done in C++.

Note that reshaping offsets and grad_input to work with existing fixed-batch-size CPU kernels are done in Autograd instead of wrapper to avoid multiple computations.

VBE CPU tests are in the next diff.

Reviewed By: sryap

Differential Revision: D69162870

facebook-github-bot · 2025-02-19T21:13:07Z

This pull request was exported from Phabricator. Differential Revision: D69162870

netlify · 2025-02-19T21:13:17Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`ae43025`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/67b912405fa47900086acaa7
😎 Deploy Preview	https://deploy-preview-3715--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Summary: X-link: facebookresearch/FBGEMM#796 This diff implements `generate_vbe_metadata` for cpu, such that the function returns the same output for CPU, CUDA and MTIA. To support VBE on CPU with existing fixed-batch-size CPU kernel, we need to recompute offsets, which is previously done in python. This diff implements offsets recomputation in C++ such that all manipulations are done in C++. Note that reshaping offsets and grad_input to work with existing fixed-batch-size CPU kernels are done in Autograd instead of wrapper to avoid multiple computations. VBE CPU tests are in the next diff. Differential Revision: D69162870

facebook-github-bot · 2025-02-21T00:13:42Z

This pull request was exported from Phabricator. Differential Revision: D69162870

Differential Revision: D68055168 [fbgemm_gpu] Update torchrec to use learning_rate_tensor D69799449

facebook-github-bot · 2025-02-21T23:46:31Z

This pull request was exported from Phabricator. Differential Revision: D69162870

Summary: Pull Request resolved: pytorch#3715 X-link: facebookresearch/FBGEMM#796 This diff implements `generate_vbe_metadata` for cpu, such that the function returns the same output for CPU, CUDA and MTIA. To support VBE on CPU with existing fixed-batch-size CPU kernel, we need to recompute offsets, which is previously done in python. This diff implements offsets recomputation in C++ such that all manipulations are done in C++. Note that reshaping offsets and grad_input to work with existing fixed-batch-size CPU kernels are done in Autograd instead of wrapper to avoid multiple computations. VBE CPU tests are in the next diff. Differential Revision: D69162870

facebook-github-bot · 2025-02-21T23:54:37Z

This pull request was exported from Phabricator. Differential Revision: D69162870

facebook-github-bot added the cla signed label Feb 19, 2025

facebook-github-bot added the fb-exported label Feb 19, 2025

spcyppt force-pushed the export-D69162870 branch from 7529dfe to b2d0bcd Compare February 21, 2025 00:13

Unifying TBE API using List (Frontend) - addr cmt

b6c528a

Differential Revision: D68055168 [fbgemm_gpu] Update torchrec to use learning_rate_tensor D69799449

spcyppt force-pushed the export-D69162870 branch from b2d0bcd to aac9690 Compare February 21, 2025 23:46

spcyppt force-pushed the export-D69162870 branch from aac9690 to ae43025 Compare February 21, 2025 23:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement generate_vbe_metadata cpu #3715

Implement generate_vbe_metadata cpu #3715

spcyppt commented Feb 19, 2025

facebook-github-bot commented Feb 19, 2025

netlify bot commented Feb 19, 2025 •

edited

Loading

facebook-github-bot commented Feb 21, 2025

facebook-github-bot commented Feb 21, 2025

facebook-github-bot commented Feb 21, 2025

Implement generate_vbe_metadata cpu #3715

Are you sure you want to change the base?

Implement generate_vbe_metadata cpu #3715

Conversation

spcyppt commented Feb 19, 2025

facebook-github-bot commented Feb 19, 2025

netlify bot commented Feb 19, 2025 • edited Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

facebook-github-bot commented Feb 21, 2025

facebook-github-bot commented Feb 21, 2025

facebook-github-bot commented Feb 21, 2025

netlify bot commented Feb 19, 2025 •

edited

Loading