-
Notifications
You must be signed in to change notification settings - Fork 10.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Request: Support RPC with -dev/-devd #10609
Comments
We should remove |
This issue was closed because it has been inactive for 14 days since being marked as stale. |
Early register RPC devices and do not propagate RPC specifics in the llama model structures. ref: ggerganov#10609
Early register RPC devices and do not propagate RPC specifics in the llama model structures. ref: ggerganov#10609
PR #11262 will resolve this but note that
vs.
The same applies for |
Early register RPC devices and do not propagate RPC specifics in the llama model structures. ref: #10609
Prerequisites
Feature Description
-dev
/-devd
currently doesn't appear to work with RPC, due to RPC devices getting created later down the line:Motivation
I have one computer that can run a large model and fit nothing else. I have another computer that can fit a smaller draft model and run it pretty quickly, so it'd be pretty nice if I could run the draft model over RPC. To do so, I need to set
-dev
to my local machine's gpu, and-devd
to the system over RPC.Possible Implementation
RPC's device creation would need to happen much earlier, before the arguments for
-dev
are validated. I was trying to see if I could hack the feature in but wasn't sure how to approach it.The text was updated successfully, but these errors were encountered: