Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Add device filter for selecting or excluding a subset of devices #1189

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

manhtukhang
Copy link

Fix

Description

Selecting and excluding some specific devices on specific nodes is a common use case when users want to share GPUs for other purposes outside K8S or for testing/debugging.

This PR introduces new flags (and configurations) for the above feature.
By using combine with Per-Node Configuration With a Node Label, it allows more granular control on selecting/excluding devices on some specific nodes

Copy link

copy-pr-bot bot commented Mar 5, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@manhtukhang manhtukhang changed the title [WIP][Feature] Add device filter for selecting or excluding a subset of devices [Feature] Add device filter for selecting or excluding a subset of devices Mar 6, 2025
@manhtukhang
Copy link
Author

Hi @elezar @ArangoGutierrez,

We are an Nvidia customer and have the real use case of filtering devices on some specific nodes.
Could you please review this Feature PR?

Many thanks!

Signed-off-by: manhtukhang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants