Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Changes to Lustre installation (v5) #42

Merged
merged 2 commits into from
Feb 15, 2024

Conversation

tpdownes
Copy link
Member

@tpdownes tpdownes commented Feb 14, 2024

The existing solution for installing lustre breaks whenever new point releases of Rocky linux are made because it upgrades the kernel packages without also upgrading rocky-release (which is what tells the OS whether it is 8.8 or 8.9 via the /etc/os-release file).

The new solution makes the assumption that, after upgrading the kernel and rocky-release packages that:

  • we are running the latest version of Rocky Linux (safe assumption)
  • that the latest-release of DDN's packaged lustre supports the latest version of Rocky linux (probably unsafe immediately after release)

This solution is "less unsafe" because, if DDN has not yet packaged a version of lustre for the latest Rocky, then Ansible will fail. The current solution will successfully install lustre, but for the older kernel it is no longer booting. So the failure mode will not be detected during image build.

The existing solution breaks whenever new point releases of Rocky linux
are made. The new solution makes the assumption that, after upgrading the
kernel and package for /etc/os-release, that we are running the latest
version of Rocky Linux AND that the latest-release of DDN's packaged
lustre supports the latest version of Rocky linux.
@tpdownes tpdownes merged commit ad6ba70 into GoogleCloudPlatform:v5 Feb 15, 2024
2 checks passed
@tpdownes tpdownes deleted the fix_v5_lustre branch February 15, 2024 00:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants