Ending on January 8th, the Tetralith login and compute nodes were upgraded from CentOS 7 to Rocky Linux 9.
The changes were kept to a minimum. You can still use SSH or Thinlinc to log in, you can still submit your jobs to Slurm, etc.
Storage (/proj, /home) are not affected (i.e no need to move your data).
To continue using Tetralith without interruption, you should move your jobs to the upgraded part of Tetralith sometime between November 13th and January 8th.
You yourself choose when to do this, but we recommend that you do it sooner rather than later to leave as much time as possible for you (and possibly NSC) to fix any problems you may encounter.
Typically, it will look something like this:
Once your jobs are working in the upgraded part of Tetralith, please stop using the non-upgraded part. NSC will monitor the demand for nodes in both parts of the system and adjust the number of nodes in each part accordingly, i.e it will not be easier to run jobs in the old part.
This section covers how to get software you have been running on CentOS 7 to run on Rocky Linux 9.
Most applications provided by NSC will continue to work.
Initially (i.e on November 13th) we will provide the latest and most commonly used versions of the most common applications.
Additional applications and versions will be added later, but only if requested via NSC Support.
If you don’t find your application/version in the output from module
avail
(on upgraded nodes) and it’s not mentioned on this page
already, please contact NSC Support and ask about it.
Some of the NSC-provided applications will run natively in the new operating system, some will have been recompiled and others will be run in a CentOS 7 container using Apptainer. We aim to make all this as transparent to the user as possible, but minor adjustments to job scripts may be needed (and documented on this page).
If you or someone in your project has built or installed your own application you will need to choose a suitable way to run the application in the new environment.
There are several ways to do this (recompile, run as-is, run in a container). Which one to use depends on the application. NSC Support can assist you with this. Some documentation is provided below, more will follow later.
Inside a supercomputer job, please try the following steps
(i.e., inside a submit script, or after issuing interactive
).
Note that it is a good idea to verify not just that the application
starts, but also to try an example job to check that it performs as expected.
Run your application as usual, i.e., as mpprun <application>
.
Some applications just work without any additional steps.
However, you may see error messages about “missing symbols” or
“missing libraries”. In that case, go to the next step.
Try mpprun --compat el7 <application>
. This runs your application
inside a “compatibility configuration” designed to mimic CentOS 7
as closely as possible. The compatibility consists of running your application,
leveraging modern OS features, inside an NSC provided OS container image of
the old CentOS 7 OS mimicking the old software environment to very high fidelity.
(Note: if you have already recompiled your binary on Rocky 9, you should NOT run it with the --compat el7
flag.)
If step 2 also fails, the next step is to try to rebuild
the application with one of the build environments provided
in the Rocky Linux 9 environment. Load an appropriate buildenv-<something>
module and follow the usual instructions for how to install software.
Note: make sure to completely rebuild the application, i.e., do a make dist-clean
, make clean
, or equivalent as the first step to make sure all components are rebuilt from scratch.
If you or your project have installed a software application that you like to use on the login node for data analysis etc., please try the following steps:
Try to run your application as usual, e.g., as ./<application>
.
Some applications just work without any additional steps.
However, you may see error messages about “missing symbols” or
“missing libraries”. In that case, go to the next step.
hpc_compat_el7 <application>
. This runs your application
inside a “compatibility configuration” designed to mimic CentOS 7
as closely as possible. The compatibility solution is the same as that
used for mpprun --compat el7
.
(Note: if you have already recompiled your binary on Rocky 9, you should NOT run it via hpc_compat_el7
.)
`buildenv-<something>
module and follow the usual instructions for how to install software.
Note: make sure to completely rebuild the application, i.e., do a make dist-clean
, make clean
, or equivalent as the first step to make sure all components are rebuilt from scratch.If your application includes a GUI:
2D graphics: test using the “compatibility configuration” (step 2 above), if this fails please try to use an interactive session. If this also fails you may have to rebuild your application (step 3 above).
3D graphics: Your application will have to be reinstalled (step 3 above)
The Rocky Linux 9 environment provides Anaconda and Condaforge modules that work the same way as those available in CentOS 7. You should be able to load these modules and just keep using the conda environments you installed under CentOS 7.
If you’d rather use the modules provided under Python/
Note that the mpprun
application differs substantially on Rocky Linux 9 compared to CentOS 7.
Use mpprun -h
to see the help, and the option mpprun -i <binary>
can be helpful for power
users to understand more precisely how mpprun
will launch a binary through the various
MPI-specific launchers.
The old build environments of CentOS 7, buildenv-<something>/<old-version>
, will not be carried over to EL9, but there will be newer version replacement modules available corresponding to them. If you can not make use of these refreshed modules, please contact NSC Support for assistance on your migration to these environments.
NSC has run applications on the upgraded part of Tetralith (both recompiled, run as-is, and run through a CentOS 7 container) and not seen any significant negative performance impact. In fact, some applications are even running faster.
If you see a significant performance loss in the upgraded part of Tetralith, please let NSC Support know as soon as possible.
Users can choose which part to use by logging in to the corresponding login node (using SSH or Thinlinc):
The “devel” reservation will contain CentOS 7 nodes but will be removed ON December 12th. Halfway through the migration window there should be no or very little need for test and development of CentOS 7 jobs.
For Rocky 9, nodes for test and development will be available in a reservation named “now”. The same policy (test and development only, max wall time 1h, max 64 cores per user) as for CentOS 7 apply (but might be relaxed later on).
The “lsda” nodes were upgraded to EL9 November 13th.
The number of upgraded compute nodes will gradually increase during the upgrade window. The rate at which this happens will be adjusted based on how fast users are moving their jobs.
The current operating system CentOS 7 (which is based on RedHat Enterprise Linux 7) will not receive any security updates after 2024-06-30.
As Tetralith is planned to continue operating longer than this, possibly until late 2025, we need to upgrade the operating system.
The Sigma cluster was upgraded December 8th.
Known issues regarding the Tetralith OS upgrade to EL9
Guides, documentation and FAQ.
Applying for projects and login accounts.