-
Notifications
You must be signed in to change notification settings - Fork 204
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Publish NVIDIA ubi-9 version of container-toolkit docker image #412
Comments
@faiq as a matter of interest, have you tried running a |
I have not. I can give it a try today. |
@faiq the main reason to select a different image is the GLIBC version on the host. In the case of rocky8, I would assume that any of the image that we publish should be compatible. Otherwise, I would recommend the If you're seeing specific failures when using this, please let us know. |
@elezar im trying to target rocky9 not 8 |
Using containers it looks like the versions for glibc are off, and I think this would likely not work. This is the result of
this is the result for running
There's a version mismatch and I think I would get an error similar to the one described in the issue NVIDIA/gpu-operator#72 It looks like these are built on the base cuda images are available and the
It seems like the process to add this OS support is straight forward by adding some Dockerfiles and Make targets. Is this correct, and can I work on this? Thanks. |
The issue is the MINIMUM glibc versions. As long as the glibc version on the host is newer than that in the container, the NVIDIA Container Toolkit is expected to work |
I see, it wasn't communicated before that this was a minimum version requirement from the previous comments. I'm going to give it a try using the ubi-8 container image. To see if it works today. However, it's still very confusing to have this toolkit image which map to the actual hosts binaries will get copied on to. It lead me to believe that matching the os major version in this case ubi9 to rocky9 was a strict requirement to getting it to work. Is it possible to consolidate these images now into just one? There was talk about doing static linking as a solution. However, since a lot of older OSes (centos7.9/rhel7.9) are being deprecated in just a few months would the minimum glibc version available in the ubuntu toolkit container be compatible with all the other systems? |
Hey folks,
I am requesting that we get a container-toolkit docker image published for ubi-9 based systems. From my understanding this would be straight forward by adding a commit like this to create targets for ubi9 05dd438. The base image seems like it's available on ngc
nvcr.io/nvidia/cuda:12.3.2-base-ubi9
This would enable us to run gpu-operator on os like rocky-9, because it seems like there is a dependency on the host NVIDIA/gpu-operator#72 (comment)
Let me know if I can start working on it.
The text was updated successfully, but these errors were encountered: