Skip to content

bump existing SPH-EXA easyconfigs to v0.96.2#25606

Merged
ocaisa merged 4 commits into
easybuilders:developfrom
pescobar:sph-exa-0.96.2
Apr 8, 2026
Merged

bump existing SPH-EXA easyconfigs to v0.96.2#25606
ocaisa merged 4 commits into
easybuilders:developfrom
pescobar:sph-exa-0.96.2

Conversation

@pescobar
Copy link
Copy Markdown
Member

@pescobar pescobar commented Mar 25, 2026

@github-actions github-actions Bot added 2023b 2025b issues & PRs related to 2025b common toolchains update labels Mar 25, 2026
@github-actions
Copy link
Copy Markdown

Updated software SPH-EXA-0.96.2-foss-2023b-CUDA-12.4.0.eb

Diff against SPH-EXA-0.96.1-foss-2025b-CUDA-12.9.1.eb

easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2025b-CUDA-12.9.1.eb

diff --git a/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2025b-CUDA-12.9.1.eb b/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2023b-CUDA-12.4.0.eb
index 08c6c5e672..139574fbbc 100644
--- a/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2025b-CUDA-12.9.1.eb
+++ b/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2023b-CUDA-12.4.0.eb
@@ -6,29 +6,29 @@
 easyblock = 'CMakeMake'
 
 name = 'SPH-EXA'
-version = '0.96.1'
+version = '0.96.2'
 versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://github.com/sphexa-org/sphexa'
 description = """ SPH-EXA is a C++20 simulation code for hydrodynamics simulations
 (with gravity and other physics), parallelized with MPI, OpenMP, CUDA, and HIP."""
 
-toolchain = {'name': 'foss', 'version': '2025b'}
+toolchain = {'name': 'foss', 'version': '2023b'}
 toolchainopts = {'usempi': True}
 
 source_urls = ['https://github.com/sphexa-org/sphexa/archive/refs/tags/']
 sources = ['v%(version)s.tar.gz']
-checksums = ['3b7bf2c2b24d5d00e1eb66d74820888a87a38e017272d2c5977c4aa82f16368e']
+checksums = ['2ff6edb422eadf47634f98d20258458c82374efe617de0d0c3d5bb3d7945be23']
 
 builddependencies = [
-    ('CMake', '4.0.3'),
+    ('CMake', '3.27.6'),
 ]
 
 dependencies = [
-    ('HDF5', '1.14.6'),
-    ('CUDA', '12.9.1', '', SYSTEM),
-    ('UCX-CUDA', '1.19.0', versionsuffix),
-    ('UCC-CUDA', '1.4.4', versionsuffix),
+    ('HDF5', '1.14.3'),
+    ('CUDA', '12.4.0', '', SYSTEM),
+    ('UCX-CUDA', '1.15.0', versionsuffix),
+    ('UCC-CUDA', '1.2.0', versionsuffix),
 ]
 
 cuda_compute_capabilities = ['8.0', '8.6', '8.9', '9.0']
Diff against SPH-EXA-0.96.1-foss-2023b-CUDA-12.4.0.eb

easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2023b-CUDA-12.4.0.eb

diff --git a/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2023b-CUDA-12.4.0.eb b/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2023b-CUDA-12.4.0.eb
index 3c284fee21..139574fbbc 100644
--- a/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2023b-CUDA-12.4.0.eb
+++ b/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2023b-CUDA-12.4.0.eb
@@ -6,7 +6,7 @@
 easyblock = 'CMakeMake'
 
 name = 'SPH-EXA'
-version = '0.96.1'
+version = '0.96.2'
 versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://github.com/sphexa-org/sphexa'
@@ -18,7 +18,7 @@ toolchainopts = {'usempi': True}
 
 source_urls = ['https://github.com/sphexa-org/sphexa/archive/refs/tags/']
 sources = ['v%(version)s.tar.gz']
-checksums = ['3b7bf2c2b24d5d00e1eb66d74820888a87a38e017272d2c5977c4aa82f16368e']
+checksums = ['2ff6edb422eadf47634f98d20258458c82374efe617de0d0c3d5bb3d7945be23']
 
 builddependencies = [
     ('CMake', '3.27.6'),

Updated software SPH-EXA-0.96.2-foss-2025b-CUDA-12.9.1.eb

Diff against SPH-EXA-0.96.1-foss-2025b-CUDA-12.9.1.eb

easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2025b-CUDA-12.9.1.eb

diff --git a/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2025b-CUDA-12.9.1.eb b/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2025b-CUDA-12.9.1.eb
index 08c6c5e672..0aeaf80db2 100644
--- a/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2025b-CUDA-12.9.1.eb
+++ b/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2025b-CUDA-12.9.1.eb
@@ -6,7 +6,7 @@
 easyblock = 'CMakeMake'
 
 name = 'SPH-EXA'
-version = '0.96.1'
+version = '0.96.2'
 versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://github.com/sphexa-org/sphexa'
@@ -18,7 +18,7 @@ toolchainopts = {'usempi': True}
 
 source_urls = ['https://github.com/sphexa-org/sphexa/archive/refs/tags/']
 sources = ['v%(version)s.tar.gz']
-checksums = ['3b7bf2c2b24d5d00e1eb66d74820888a87a38e017272d2c5977c4aa82f16368e']
+checksums = ['2ff6edb422eadf47634f98d20258458c82374efe617de0d0c3d5bb3d7945be23']
 
 builddependencies = [
     ('CMake', '4.0.3'),
Diff against SPH-EXA-0.96.1-foss-2023b-CUDA-12.4.0.eb

easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2023b-CUDA-12.4.0.eb

diff --git a/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2023b-CUDA-12.4.0.eb b/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2025b-CUDA-12.9.1.eb
index 3c284fee21..0aeaf80db2 100644
--- a/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.1-foss-2023b-CUDA-12.4.0.eb
+++ b/easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2025b-CUDA-12.9.1.eb
@@ -6,29 +6,29 @@
 easyblock = 'CMakeMake'
 
 name = 'SPH-EXA'
-version = '0.96.1'
+version = '0.96.2'
 versionsuffix = '-CUDA-%(cudaver)s'
 
 homepage = 'https://github.com/sphexa-org/sphexa'
 description = """ SPH-EXA is a C++20 simulation code for hydrodynamics simulations
 (with gravity and other physics), parallelized with MPI, OpenMP, CUDA, and HIP."""
 
-toolchain = {'name': 'foss', 'version': '2023b'}
+toolchain = {'name': 'foss', 'version': '2025b'}
 toolchainopts = {'usempi': True}
 
 source_urls = ['https://github.com/sphexa-org/sphexa/archive/refs/tags/']
 sources = ['v%(version)s.tar.gz']
-checksums = ['3b7bf2c2b24d5d00e1eb66d74820888a87a38e017272d2c5977c4aa82f16368e']
+checksums = ['2ff6edb422eadf47634f98d20258458c82374efe617de0d0c3d5bb3d7945be23']
 
 builddependencies = [
-    ('CMake', '3.27.6'),
+    ('CMake', '4.0.3'),
 ]
 
 dependencies = [
-    ('HDF5', '1.14.3'),
-    ('CUDA', '12.4.0', '', SYSTEM),
-    ('UCX-CUDA', '1.15.0', versionsuffix),
-    ('UCC-CUDA', '1.2.0', versionsuffix),
+    ('HDF5', '1.14.6'),
+    ('CUDA', '12.9.1', '', SYSTEM),
+    ('UCX-CUDA', '1.19.0', versionsuffix),
+    ('UCC-CUDA', '1.4.4', versionsuffix),
 ]
 
 cuda_compute_capabilities = ['8.0', '8.6', '8.9', '9.0']

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Mar 25, 2026

The previous versions were only merged last week and are not in an EB release, let's remove those as part of this PR so we just retain the latest version?

@github-actions github-actions Bot added change and removed update labels Mar 25, 2026
@pescobar
Copy link
Copy Markdown
Member Author

@ocaisa I have removed previous release as suggested

ocaisa
ocaisa previously approved these changes Mar 25, 2026
Copy link
Copy Markdown
Member

@ocaisa ocaisa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM these original versions were only merged last week so safe to replace

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Mar 25, 2026

@boegelbot please test @ jsc-zen3-a100

@boegelbot
Copy link
Copy Markdown
Collaborator

@ocaisa: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25606 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25606 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 10085

Test results coming soon (I hope)...

Details

- notification for comment with ID 4125432886 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@pescobar
Copy link
Copy Markdown
Member Author

@ocaisa is this one ok?

I will update the EESSI PRs as soon as this one gets merged

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Mar 25, 2026

Just waiting on the bot to report back, I guess build jobs are backed up

@boegel boegel changed the title Add latest release SPH-EXA bump existing SPH-EXA easyconfigs to v0.96.2 Mar 25, 2026
boegel
boegel previously approved these changes Mar 25, 2026
Copy link
Copy Markdown
Member

@boegel boegel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@boegel
Copy link
Copy Markdown
Member

boegel commented Mar 25, 2026

Just waiting on the bot to report back, I guess build jobs are backed up

That's going to take a while, the only GPU node we have in jsc-zen3 is chewing on PyTorch 2.9.1, see #24365 (comment)

I'll submit a test report from our GPU cluster so this can be merged...

@boegel boegel added this to the next release (5.2.2?) milestone Mar 25, 2026
@pescobar
Copy link
Copy Markdown
Member Author

I'll submit a test report from our GPU cluster so this can be merged...

Thank you!

@boegel
Copy link
Copy Markdown
Member

boegel commented Mar 25, 2026

Test report by @boegel
FAILED
Build succeeded for 0 out of 2 (total: 7 mins 40 secs) (2 easyconfigs in total)
node3900.accelgor.os - Linux RHEL 9.6, x86_64, AMD EPYC 7413 24-Core Processor (zen3), 1 x NVIDIA NVIDIA A100-SXM4-80GB, 590.48.01, Python 3.9.21
See https://gist.github.com/boegel/f9627cd096ba558c3d959cf58b8f21d7 for a full test report.

@boegel
Copy link
Copy Markdown
Member

boegel commented Mar 25, 2026

@pescobar

cannot find -lhdf5-static: No such file or directory

am I missing something?

@pescobar
Copy link
Copy Markdown
Member Author

@boegel I didn't try a local build because I was busy and I assumed such a small upgrade won't break anything but I guess I was wrong

I will debug it locally and reply here as soon as I get rid of my boring meetings

@boegelbot
Copy link
Copy Markdown
Collaborator

Test report by @boegelbot
FAILED
Build succeeded for 1 out of 2 (total: 23 mins 33 secs) (2 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 590.48.01, Python 3.9.25
See https://gist.github.com/boegelbot/4b78fd1c1694559c2687d9d3581429d5 for a full test report.

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Mar 26, 2026

Seems the dependency H5hut is missing (and built on the fly by CMake): https://github.com/sphexa-org/sphexa/blob/6a1b00e8d115e542318f598d7e9dfcdc34fcab79/cmake/Fetch_H5hut.cmake

@pescobar
Copy link
Copy Markdown
Member Author

I could reproduce the build error locally and I have reported it in sphexa-org/sphexa#587

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Apr 8, 2026

We've had to patch for that issue in the past

-set (gatb-core-libraries gatbcore-static dl pthread z hdf5-static ${gatb-core-extra-libraries})
+set (gatb-core-libraries gatbcore-static dl pthread z hdf5 ${gatb-core-extra-libraries})

I wonder are they (or the h5hut dependency they pull in) vendoring hdf5?

Copy link
Copy Markdown
Member

@ocaisa ocaisa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's see if we can just disable using static libraries

Comment thread easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2023b-CUDA-12.4.0.eb Outdated
Comment thread easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2025b-CUDA-12.9.1.eb Outdated
Co-authored-by: ocaisa <alan.ocais@cecam.org>
@ocaisa ocaisa dismissed stale reviews from boegel and themself via 7e0ccbe April 8, 2026 14:38
@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Apr 8, 2026

@boegelbot please test @ jsc-zen3-a100

@boegelbot
Copy link
Copy Markdown
Collaborator

@ocaisa: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25606 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25606 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 10174

Test results coming soon (I hope)...

Details

- notification for comment with ID 4207075065 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Copy Markdown
Collaborator

Test report by @boegelbot
FAILED
Build succeeded for 1 out of 2 (total: 24 mins 4 secs) (2 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 590.48.01, Python 3.9.25
See https://gist.github.com/boegelbot/cae3657b771291b30699f0915f7383f3 for a full test report.

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Apr 8, 2026

Ah, it was only the 2023b build that failed. In that case, I think CMake is the guilty party: https://github.com/Kitware/CMake/blob/v3.27.6/Modules/FindHDF5.cmake#L532-L536

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Apr 8, 2026

Hmm, this looks like a smoking gun: H5hut/H5hut#5

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Apr 8, 2026

That might actually be the fix that is needed: sphexa-org/sphexa#588

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Apr 8, 2026

Ok, I found it. They are pulling in a branch at build time for their h5hut dependency https://github.com/eth-cscs/h5hut/commits/cmake/ , this should be (at least) pinned to a commit

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Apr 8, 2026

@pescobar you need a patch to pin https://github.com/sphexa-org/sphexa/blob/develop/CMakeLists.txt#L87 to a commit (probably best is eth-cscs/h5hut@74cd84c)

@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Apr 8, 2026

The other option is to disable h5hut altogether -DSPH_EXA_WITH_H5HUT=OFF.

Comment thread easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2023b-CUDA-12.4.0.eb Outdated
Comment thread easybuild/easyconfigs/s/SPH-EXA/SPH-EXA-0.96.2-foss-2025b-CUDA-12.9.1.eb Outdated
Co-authored-by: ocaisa <alan.ocais@cecam.org>
@ocaisa
Copy link
Copy Markdown
Member

ocaisa commented Apr 8, 2026

@boegelbot please test @ jsc-zen3-a100

@boegelbot
Copy link
Copy Markdown
Collaborator

@ocaisa: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25606 EB_ARGS= EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25606 --ntasks=8 --partition=jsczen3g --gres=gpu:1 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 10178

Test results coming soon (I hope)...

Details

- notification for comment with ID 4207607569 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Copy Markdown
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 2 out of 2 (total: 36 mins 12 secs) (2 easyconfigs in total)
jsczen3g1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), 1 x NVIDIA NVIDIA A100 80GB PCIe, 590.48.01, Python 3.9.25
See https://gist.github.com/boegelbot/9af785bded7ca1143e6b833f478488fe for a full test report.

Copy link
Copy Markdown
Member

@ocaisa ocaisa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't like having easyconfigs in the release that we know are now broken. I'm going to merge this as is and @pescobar can follow up after with a PR that adds a specific commit for h5hut and re-enables support for that

@ocaisa ocaisa merged commit 236b292 into easybuilders:develop Apr 8, 2026
6 checks passed
@pescobar
Copy link
Copy Markdown
Member Author

pescobar commented Apr 8, 2026

@ocaisa thanks for finding where the problem is, I was expecting a fix from the developers and I didn't look at it

I have managed to build the application with cmake -DH5hut_VERSION=git.30ff892 .. to point to commit eth-cscs/h5hut@30ff892 which is the one right before the commit breaking the build eth-cscs/h5hut@0269c70

I am going to create a new PR with the working easyconfig

@pescobar
Copy link
Copy Markdown
Member Author

pescobar commented Apr 8, 2026

@ocaisa I have created a new PR #25693

Let me know if you want any change

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2023b 2025b issues & PRs related to 2025b common toolchains change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants