Recipes¶

A recipe is a description of all of the compilers and software packages to be installed, along with configuration of modules and environment scripts the stack will provide to users. A recipe is comprised of the following yaml files in a directory:

config.yaml: common configuration for the stack.
compilers.yaml: the compilers provided by the stack.
environments.yaml: environments that contain all the software packages.
modules.yaml: optional module generation rules
- follows the spec for spack module configuration
packages.yaml: optional define external packages
- follows the spec for spack package configuration
repo: optional custom spack package definitions.
extra: optional additional meta data to copy to the meta data of the stack.
post-install: optional a script to run after Spack has been executed to build the stack.
pre-install: optional a script to run before any packages have been built.

Configuration¶

config.yaml

name: prgenv-gnu
store: /user-environment
spack:
    repo: https://github.com/spack/spack.git
    commit: releases/v1.0
    packages:
        repo: https://github.com/spack/spack-packages.git
        commit: develop
modules: true
description: "HPC development tools for building MPI applications with the GNU compiler toolchain"
version: 2

name: a plain text name for the environment
store: the location where the environment will be mounted.
spack: which spack and package repositories to use for installation.
modules: optional enable/diasble module file generation (default true).
description: optional a string that describes the environment (default empty).
version: default = 1 the version of the uenv recipe (see below)

uenv recipe versions

Stackinator 6 introduces breaking changes to the uenv recipe format, introduced to support Spack v1.0.

We have started versioning uenv recipes:

version 1: original uenv recipes for Spack v0.23 and earlier, supported by Stackinator version 5.
version 2: uenv recipes for Spack v1.0 and later, supported by Stackinator version 6.

The default version is 1, so that old recipes that do not set a version are supported.

You must set version 2 explicitly to use Spack v1.0

Version 1 recipes must be configured using Stackinator v5

Version 5 of Stackinator is maintained in the releases/v5 branch of stackinator.

You must also use the releases/v5 branch of Alps cluster config.

Compilers¶

Take an example configuration:

compilers.yaml

gcc:
  version: "13"
llvm:
  version: "16"
nvhpc:
  version: "25.1"

Warning

The version must be a string in quotes, i.e. "13" not 13.

The compilers are built in multiple stages:

gcc: gcc is built using the system compiler.
- gcc:version: The version of gcc
llvm: (optional) The llvm toolchain is built using the gcc toolchain installed in step 1.
- llvm:version: The version of llvm
nvhpc: (optional) The nvhpc toolchain is built using the gcc toolchain installed in step 1.
- nvhpc:version: The version of nvhpc

The first step - building gcc - is required, so that the simplest stack will provide at least one version of gcc compiled for the target architecture.

Note

Don't provide full specs, because the tool will insert "opinionated" specs for the target node type, for example:

nvhpc:version:"21.7" generates nvhpc@21.7 ~mpi~blas~lapack
llvm:version:"14" generates llvm@14 +clang ~gold
gcc:version:"13" generates gcc@13 build_type=Release +profiled +strip +bootstrap

Environments¶

The software packages to install using the compiler toolchains are configured as disjoint environments, each built with the same compiler, and configured with an optional implementation of MPI. These are specified in the environments.yaml file.

For example, consider a workflow that has to build more multiple applications - some of which require Fortran+OpenACC and others that are CPU only C code that can be built with GCC. To provide a single Spack stack that meets the workflow's needs, we would create two environments, one for each of the nvhpc and gcc compiler toolchains:

environments.yaml high level overview

# A GCC-based programming environment
prgenv-gnu:
  compiler:   # ... compiler toolchain
  network:    # ... network configuration
  deprecated: # ... whether to allow usage of deprecated packages or not
  unify:      # ... configure Spack concretizer
  duplicates: # ... configure how the concretizer should handle duplicates
  specs:      # ... list of packages to install
  variants:   # ... variants to apply to packages (e.g. +mpi)
  packages:   # ... list of external packages to use
  views:      # ... environment views to provide to users
# An NVIDIA programming environment
prgenv-nvgpu:
  # ... same structure as prgenv-gnu

In the following sections, we will explore each of the environment configuration fields in detail.

Compilers¶

The compiler field describes a list compilers to use to build the software stack. Each compiler toolchain is specified using toolchain and spec

compile all packages with gcc

  compiler: [gcc]

Sometimes two compiler toolchains are required, for example when using the nvhpc compilers, there are often dependencies that can't be built using the NVIDIA, or are better being built with GCC (for example cmake, perl and netcdf-c). The example below uses the nvhpc compilers with gcc@11.

compile all packages with gcc and nvhpc

  compiler: [gcc, nvhpc]

The order of the compilers is significant. The first compiler is the default, and the other compilers will only be used to build packages when explicitly added to a spec. For example, in the recipe below, only netcdf-fortran will be built with the nvhpc toolchain, while the root specs cmake and netcdf-c and all dependencies will be built using the gcc toolchain.

compile all packages with gcc@11

  compiler: [gcc, nvhpc]
  specs
  - cmake
  - netcdf-c
  - netcdf-fortran%nvhpc

Note

This approach is typically used to build Fortran applications and packages with one toolchain (e.g. nvhpc), and all of the C/C++ dependencies with a different toolchain (e.g. gcc).

MPI and networking¶

Stackinator can configure MPI (cray-mpich and OpenMPI) its dependencies (libfabric, cxi, etc) through the network field.

The network field replaces the mpi field in Stackinator 6. See the porting guide for guidance on updating uenv recipes for Spack 1.0.

If the network field is not set, MPI will not be configured in an environment.

The network field has a field for defining MPI and additional custom package definitions

enironments.yaml overview of options

<env-name>:
  network:
    mpi:        # provide the spec for MPI
    specs:      # additional custom specs for dependencies (libfabric etc)

cray-mpich

Rely on the defaults for the target system:

environments.yaml

network:
  mpi: cray-mpich

Provide a more explicit spec that disables cuda (this might be useful on a system where cuda support is the default). Also request a specific version of libfabric

environments.yaml

network:
  mpi: cray-mpich@9.0 ~cuda
  specs: ['libfabric@1.2.2']

openmpi

environments.yaml

network:
  mpi: openmpi@5.0.8
  specs: ['libfabric@2.2.0']

It is only possible to have a single MPI implementation in an environment, specified through the mpi field. Behind the scenes, Stackinator adds a hard requirement that all packages in the environment use the chosen MPI, to help Spack concretise correctly.

How do I provide cray-mpich and openmpi in my uenv?

No problem! Just add two environments in your environments.yaml file, for example:

mpich:
    compilers: [gcc]
    network:
      mpi: cray-mpich
      specs: ['libfabric@1.2.2']
    specs:
      - hdf5+mpi+fortran
    views:
      cray-mpich:
        ... # define
openmpi:
    compilers: [gcc]
    network:
      mpi: openmpi@5.0.8
      specs: ['libfabric@1.2.2']
    specs: 
      - hdf5+mpi+fortran
    views:
      openmpi:
        ... # define

The uenv will provide two views for the end user.

The network.yaml file in the cluster config provides a set of default specs for MPI dependencies for each MPI distribution. See the network.yaml documentation for more information about how the defaults are set.

Why add a network:specs field instead of just adding libfabric and friends to the main specs list?

It is easier to override these by providing a custom field for network dependencies.

Alps

The recommended MPI distribution on Alps is cray-mpich, as it is the most widely tested MPI for the libfabric/Slingshot network.

OpenMPI's support for the Slingshot network is improving, however it may not be optimal for many applications, or requires more effort to fine tune. As such, it is recommended as an option for applications that have performance issues or bugs with cray-mpich.

Specs¶

The list of software packages to install is configured in the spec: field of an environment. The specs follow the standard Spack practice.

The deprecated: field controls if Spack should consider versions marked as deprecated, and can be set to true or false (for considering or not considering deprecated versions, respectively).

The unify: field controls the Spack concretiser, and can be set to three values true, false or when_possible. The

cuda-env:
  specs:
  - cmake
  - hdf5
  - python@3.10
  unify: true

To install more than one version of the same package, or to concretise some more challenging combinations of packages, you might have to relax the concretiser to when_possible or false. For example, this environment provides hdf5 both with and without MPI support:

cuda-env:
  specs:
  - cmake
  - hdf5~mpi
  - hdf5+mpi
  - python@3.10
  unify: when_possible

Note

Use unify:true when possible, then unify:when_possible, and finally unify:false.

Warning

Don't provide a spec for MPI or Compilers, which are configured in the network: and compilers fields respectively.

Warning

Stackinator does not support "spec matrices", and likely won't, because they use multiple compiler toolchains in a manner that is contrary to the Stackinator "keep it simple" principle.

The duplicates:strategy field controls how the concretiser handles duplicates. The default strategy is minimal which is typically the right choice. In mixed compiler environments, minimal can sometimes be insufficient to allow concretization. full slightly relaxes the constraints that Spack imposes on the environment which can allow concretization.

Warning

The full strategy is still experimental and may produce surprising concretizations. In particular, the concretiser will more happily mix compilers using the full strategy. Please check the concretization when using this strategy to make sure it looks as you'd expect.

Packages¶

To specify external packages that should be used instead of building them, use the packages field. For example, if the perl, python@3 and git packages are build dependencies of an environment and the versions that are available in the base CrayOS installation are sufficient, the following spec would be specified:

environments.yaml: specifying external packages

my-env:
  packages:
  - perl
  - git

Note

If a package is not found, it will be built by Spack.

Note

External packages specified in this manner will only be used when concretising this environment, and will not affect downstream users.

expand if you are curious how Stackinator configures Spack for packages

The following Spack call is used to generate packages.yaml in the Spack environment that Stackinator generates in the build path to concretise and build the packages in the example above:

Makefile target for external packages in an environment

packages.yaml:
    spack external find --not-buildable --scope=user perl git

Variants¶

To specify variants that should be applied to all package specs in the environment by default (unless overridden explicitly in a package spec), use the variants field. For example, to concretise all specs in an environment that support MPI or CUDA and target A100 GPUs, the following variants could be set:

environments.yaml: variants for MPI and CUDA on A100

cuda-env:
  variants:
    - +mpi
    - +cuda
    - cuda_arch=80

expand if you are curious how Stackinator configures Spack for variants

The above will add the following to the generated spack.yaml file used internally by Spack.

spack.yaml: packages spec generated for variants

spack:
  packages:
    all:
      variants:
      - +mpi
      - +cuda
      - cuda_arch=80

Views¶

File system views are an optional way to provide the software from an environment in a directory structure similar to /usr/local, based on Spack's filesystem views.

Each environment can provide more than one view, and the structure of the YAML is the same as used by the version of Spack used to build the Spack stack. For example, the views description:

cuda-env:
  views:
    default:
    no-python:
      exclude:
        - 'python'

will configure two views:

default: a view of all the software in the environment using the default settings of Spack.
no-python: everything in the default view, except any versions of python.

Stackinator provides some additional options that are not provided by Spack, to fine tune the view, that can be set in the uenv: field:

cuda-env:
  views:
    uenv:
      add_compilers: true
      prefix_paths:
        LD_LIBRARY_PATH: [lib, lib64]
      env_vars:
          set:
          - WOMBAT: null
          - NOCOLOR: "1"
          - JULIAUP_INSTALLDIR: "${@SCRATCH@}/.julia/gh200/juliaup"
          - PKG_CONFIG_PATH: null
          prepend_path:
          - PATH: "${@HOME@}/.local/x86_4/bin/"
          append_path:
          - PKG_CONFIG_PATH: /usr/lib/pkgconfig
          - PKG_CONFIG_PATH: /opt/cray/libfabric/1.15.2.0/lib64/pkgconfig

add_compilers (default true): by default Spack will not add compilers to the PATH variable. Stackinator automatically adds the gcc and/or nvhpc to path. This option can be used to explicitly disable or enable this feature.
prefix_paths (default empty): this option can be used to customise prefix style environment variables (PATH, LD_LIBRARY_PATH, PKG_CONFIG_PATH, PYTHONPATH, etc).
- the key is the environment variable, and the value is a list of paths to search for in the environment view. All paths that match an entry in the list will be prepended to the prefix path environment variable.
- the main use for this feature is to opt-in to setting LD_LIBRARY_PATH. By default Spack does not add lib and lib64 to LD_LIBRARY_PATH because that can break system installed applications that depend on LD_LIBRARY_PATH or finding their dependencies in standard locations like /usr/lib.
env_vars: see below for a more detailed explanation of how to fine tune environment variables in the view.

Info

See the interfaces documentation for more information about how the environment views are provided.

Setting environment variables with `env_vars`¶

The views:<view_name>:uenv:env_vars field can be used to further fine-tune the environment variables that are set when the view is started. There are three environment variable "operation" that can be specified - set, prepend_path, and append_path - as demonstrated in the example above.

The set field is a list of environment variables key-value pairs that specify the variable name and the value to set it to.

Setting a field to null will unset the variable if it was set by the parent environment.
set is a list, and values will be applied in the order that they are provided. It is possible to provide two values for a variable, and the last value will be the one used.
It is not possible to set an initial value that is not null for a prefix path variable. Set such variables to null (unset it), then provide append_path and prefix_path operations below to set the individual paths.

using ${@VAR@} to use environment variables

Sometimes you want to compose an environment variable that has been set in the runtime environment in your environment variable definition. For example, every user has a different HOME or SCRATCH value, and you might want to configure your view to store / read configuration from this path. The special syntax ${@VAR@} will defer expanding the environment variable VAR until the view is loaded by uenv. The example above shows how to set the Juliaup install directory to be in the user's local scratch, i.e. a personalised private location for each user.

The prepend_path field takes a list of key-value pairs that define paths to prepend to a prefix path variable.

Each entry is a single path
To prepend more than one path to a variable, pass multiple values (see PKG_CONFIG_PATH in append_path above)
The order in the list matters: paths will be prepended in the order that they appear in the list.

The append_path field is the same as prepend_path, except it appends instead of prepending.

What are prefix path variables?

Prefix path variables are environment variables that are :-separated list of paths, like PATH, LD_LIBRARY_PATH, PYTHONPATH etc that provide a list of paths that are typically searched in order.

Currently the set of supported prefix path variables is hard coded in stackinator. If you need to set a prefix path that isn't on this list, contact the stackinator devs and we can implement support for user-defined prefix paths.

the hard-coded prefix paths

ACLOCAL_PATH
CMAKE_PREFIX_PATH
CPATH
LD_LIBRARY_PATH
LIBRARY_PATH
MANPATH
MODULEPATH
PATH
PKG_CONFIG_PATH
PYTHONPATH

Modules¶

Modules are generated for the installed compilers and packages by spack. The default module generation rules set by the version of spack specified in config.yaml will be used if no modules.yaml file is provided.

To set rules for module generation, provide a modules.yaml file as per the spack documentation.

To disable module generation, set the field config:modules:False in config.yaml.

Custom Spack Packages¶

An optional package repository can be added to a recipe to provide new or customized Spack packages in addition to Spack's builtin package repository, if a repo path is provided in the recipe.

For example, the following repo path will add custom package definitions for the hdf5 and nvhpc packages:

repo
└─ packages
   ├─ hdf5
   │  └─ package.py
   └─ nvhpc
      └─ package.py

Additional custom packages can be provided as part of the cluster configuration, as well as additional site packages. These packages are all optional, and will be installed together in a single Spack package repository that is made available to downstream users of the generated uenv stack. See the documentation for cluster configuration for more detail.

Note

If you need to backport a spack package from a more recent spack version, you can do it by using an already checked out spack repository like this

(disclaimer: the package might need adjustments due to spack directives changes)

# ensure to have the folder for custom packages in your recipe
mkdir -p stackinator-recipe/repo/packages
# switch to the already checked out spack repository
cd $SPACK_ROOT
# use git to extract package files into your "custom packages" section of the stackinator recipe
git archive origin/develop `spack location -p fmt` | tar -x --strip-components=5 -C stackinator-recipe/repo/packages

In the above case, the package fmt is backported from origin/develop into the stackinator-recipe.

Alps

All packages are installed under a single spack package repository called alps. The CSCS configurations in github.com/eth-cscs/alps-cluster-config provides a site configuration that defines cray-mpich, its dependencies, and the most up to date versions of cuda, nvhpc etc to all clusters on Alps.

Warning

Unlike Spack package repositories, any repos.yaml file in the repo path will be ignored. This is because the provided packages are added to the alps namespace.

Post install configuration¶

If a script post-install is provided in the recipe, it will be run during the build process: after the stack has been built, and just before the final squashfs image is generated. Post install scripts can be used to modify or extend an environment with operations that can't be performed in Spack, for example:

configure a license file;
install additional software outside of Spack;
generate activation scripts.

The following steps are effectively run, where we assume that the recipe is in $recipe and the mount point is the default /user-environment:

# copy the post-install script to the mount point
cp "$recipe"/post-install /user-environment
chmod +x /user-environment/post-install

# apply Jinja templates
jinja -d env.json /user-environment/post-install > /user-environment/post-install

# execute the script from the mount point
cd /user-environment
/user-environment/post-install

The post-install script is templated using Jinja, with the following variables available for use in a script:

Variable	Description
`env.mount`	The mount point of the image - default `/user-environment`
`env.config`	The installation tree of the Spack installation that was built in previous steps
`env.build`	The build path
`env.spack`	The location of Spack used to build the software stack (only available during installation)

The use of Jinja templates is demonstrated in the following example of a bash script that generates an activation script that adds the installation path of GROMACS to the system PATH:

post-install script that generates a simple activation script.

#!/bin/bash

gmx_path=$(spack -C {{ env.config }} location -i gromacs)/bin
echo "export PATH=$gmx_path:$PATH" >> {{ env.mount }}/activate.sh

Note

The copy of Spack used to build the stack is available in the environment in which post-install runs, and can be called directly.

Note

The script does not have to be bash - it can be in any scripting language, such as Python or Perl, that is available on the target system.

Pre install configuration¶

Similarly to the post-install hook, if a pre-install script is provided in the recipe, it will be run during the build process:

directly after the initial test that Spack has been installed correctly;
directly before the build cache is configured, and/or the first compiler environment is concretised.

The pre-install script is copied, templated and executed similarly to the post-install hook (see above).

Meta-Data¶

Stackinator generates meta-data about the stack to the extra path of the installation path. A recipe can install arbitrary meta data by providing a extra path, the contents of which will be copied to the meta/extra path in the installation path.

Alps

This is used to provide additional information required by ReFrame as part of the CI/CD pipeline for software stacks on Alps, defined in the GitHub eth-cscs/alps-spack-stacks repository.

Recipes¶

Configuration¶

Compilers¶

Environments¶

Compilers¶

MPI and networking¶

Specs¶

Packages¶

Variants¶

Views¶

Setting environment variables with env_vars¶

Modules¶

Custom Spack Packages¶

Post install configuration¶

Pre install configuration¶

Meta-Data¶

Setting environment variables with `env_vars`¶