Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement GPU support #1972

Closed
5 of 6 tasks
muhamadazmy opened this issue Jun 1, 2023 · 1 comment · Fixed by #1973
Closed
5 of 6 tasks

Implement GPU support #1972

muhamadazmy opened this issue Jun 1, 2023 · 1 comment · Fixed by #1973

Comments

@muhamadazmy
Copy link
Member

muhamadazmy commented Jun 1, 2023

Limitations:

  • We start by supporting only one (and only one) GPU for now, while keeping in mind that more GPU can be available in zos node in the future

Changes

  • Identify and list all attached GPUs
  • On registration, a node need to update its set of features on the chain (depends on chain change extended Node features  tfchain#720)
  • ZMachine can have an extra gpu flag (list of GPUs)
    • This is only accepted if node is dedicated (otherwise an error)
    • A GPU can be only used by one ZMachine
  • Expose the gpu device to the VM (based on research here GPU Support #1293 (comment))
@muhamadazmy muhamadazmy moved this to In Progress in 3.11.x Jun 1, 2023
@muhamadazmy
Copy link
Member Author

muhamadazmy commented Jun 1, 2023

Sub-tasks

will be updated regularly

  • Identify GPU devices (scan the system PCI and filter via class, then figure out the vendor/device names

@github-project-automation github-project-automation bot moved this from In Progress to Done in 3.11.x Jun 19, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Status: Done
Development

Successfully merging a pull request may close this issue.

1 participant