Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU Deployment in Milvus 2.4 #263

Open
qchenzi opened this issue Feb 11, 2025 · 3 comments
Open

GPU Deployment in Milvus 2.4 #263

qchenzi opened this issue Feb 11, 2025 · 3 comments
Labels
documentation Improvements or additions to documentation
Milestone

Comments

@qchenzi
Copy link

qchenzi commented Feb 11, 2025

I am considering deploying Milvus 2.4 in K8S and would like to confirm the feasibility of the following configuration:

  • GPU Nodes: Deploying only the IndexNode and QueryNode components on GPU-equipped nodes.
  • Non-GPU Nodes: Deploying the remaining components—DataNode, MixCoord, Proxy—on nodes without GPU capabilities.

This setup aims to leverage GPU acceleration for indexing and query processing while utilizing non-GPU nodes for other tasks. Could you please confirm if this configuration is supported in Milvus.

Thank you for your assistance.

@haorenfsa
Copy link
Collaborator

Yes, it's supported.
By the way, 2 things to be noted:

  1. Check our prerequisite guides for setup in https://milvus.io/docs/prerequisite-gpu.md.
  2. Use GPU Index Types, otherwise Milvus will still use CPU for Indexing https://milvus.io/docs/gpu_index.md

@haorenfsa haorenfsa added the documentation Improvements or additions to documentation label Feb 11, 2025
@haorenfsa haorenfsa added this to the v1.3.0 milestone Feb 11, 2025
@haorenfsa
Copy link
Collaborator

  • Add docs for GPU deployment using milvus-operator

@qchenzi
Copy link
Author

qchenzi commented Feb 17, 2025

Thank you for confirming that the configuration is supported.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

2 participants