Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for provisioning GPUs from GCP #1794

Merged
merged 5 commits into from
Sep 11, 2024

Conversation

bdattoma
Copy link
Contributor

@bdattoma bdattoma commented Sep 10, 2024

Based on OCP documentation https://docs.openshift.com/container-platform/4.16/machine_management/creating_machinesets/creating-machineset-gcp.html#machineset-gcp-enabling-gpu-support_creating-machineset-gcp
- GPUs like T4, V100, etc must be attached to a N1 virtual machine
- higher profile GPUs like A100, H100, etc have their own virtual machine flavor, hence can be provisioned without attaching to the N1

PR validation (done locally, CI not yet supported):

  • Deploy a N1-attached (e.g., nvidia T4) GPU on GCP cluster: PASS
  • deploy a GPU node on GCP cluster (e.g., L4): PASS
  • (regression) deploy a GPU on AWS: PASS

Copy link
Contributor

Robot Results

✅ Passed ❌ Failed ⏭️ Skipped Total Pass %
535 0 0 535 100

@bdattoma bdattoma self-assigned this Sep 10, 2024
@bdattoma bdattoma added needs testing Needs to be tested in Jenkins enhancements Bugfixes, enhancements, refactoring, ... in tests or libraries (PR will be listed in release-notes) labels Sep 10, 2024
@apodhrad
Copy link
Contributor

Hi @bdattoma what is N1?

@bdattoma
Copy link
Contributor Author

Hi @bdattoma what is N1?

a series of virtual machine on GCP: https://cloud.google.com/compute/docs/general-purpose-machines#n1_machines

@bdattoma bdattoma added verified This PR has been tested with Jenkins and removed needs testing Needs to be tested in Jenkins labels Sep 11, 2024
@bdattoma bdattoma requested a review from kobihk September 11, 2024 11:22
Copy link

@bdattoma bdattoma merged commit 56b7ed4 into red-hat-data-services:master Sep 11, 2024
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancements Bugfixes, enhancements, refactoring, ... in tests or libraries (PR will be listed in release-notes) verified This PR has been tested with Jenkins
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants