Software

The following software is available on the HPC systems. This initial list only shows software installed on Juno. Future updates will include software installed on Ganymede and Ganymede2.

NoSW Suite NameSW NameDescriptionSW URLKeywordsSystemModule
1EasyBuildEasyBuildEasyBuild is a software build and installation framework that allows you to manage (scientific) software on High Performance Computing (HPC) systems in an efficient way.https://easybuild.iohttps://easybuild.iobuild tools, software installation, python, HPC, automationJunoEasyBuild/4.9.4
2RRFree software environment for statistical computing and graphics with bug fixes and improved complex number handlinghttps://www.r-project.orgstatistics, data analysis, programming, scientific computing, graphics, mathematicsJunoR/4.4.1
3ADIOS2ADIOS2Adaptable I/O system featuring unified Python API, enhanced campaign management tools, and improved performance with GPU-aware staging engineshttps://github.com/ornladios/ADIOS2scientific computing, data management, GPU, MPI, Python,data analysisJunoadios2/2.10.1
4Amber24Amber24Suite of biomolecular simulation programs for molecular dynamics with enhanced GPU performance and updated force fieldshttps://ambermd.orgmolecular dynamics, biomolecular simulation, GPU computing, protein simulation, force fields, drug design, computational chemistry,Junoamber/24
5Amber24Amber24-betaSuite of biomolecular simulation programs for molecular dynamics with enhanced GPU performance and updated force fieldshttps://ambermd.orgmolecular dynamics, biomolecular simulation, GPU computing, protein simulation, force fields, drug design, computational chemistry,Junoamber/24-beta
6AOCCAOCCAMD’s optimizing C/C++/Fortran compiler based on LLVM 17.0.6 with Zen 5 support and enhanced vectorization optimizationhttps://www.amd.com/en/developer/aocc.htmlAMD, Zen, C++, Fortran, LLVM, optimization, vectorization, Linux, x86Junoaocc/5.0
7ApptainerApptainerContainer platform for HPC environments with improved overlay support, AppArmor fixes, and enhanced monitoring capabilitieshttps://apptainer.orgvirtualization, OCI, Docker, reproducibility, portability, containersJunoapptainer/1.3.4
8GNUGNU AutotoolsBuild automation toolset featuring configure script generation, cross-platform build support, and portable software installation capabilitieshttps://www.gnu.org/software/automakebuild system, configure, automake, autoconf, libtool, cross-platform, software installation, make, compilation, package managementJunoautotools/
9BoostBoostC++ libraries collection with major updates to URL parsing, Container Hash, Locale support and new unordered containershttps://www.boost.orgC++, libraries, templates, cross-platform, metaprogramming, algorithms, iterators, threading, serialization, networkingJunoboost/1.81.0
10CharliecloudCharliecloudLightweight container runtime for HPC with user namespaces support, shell/Python libraries, and unprivileged container executionhttps://hpc.github.io/charliecloudcontainers, virtualization, Docker, user namespaces, portability, reproducibilityJunocharliecloud/0.15
11CMakeCMakeCross-platform build system generator with enhanced header-only library support, CMake presets, and improved CUDA capabilitieshttps://cmake.org/build system, compilation, cross-platform, configuration, C++, software installation, make, ninja, Visual Studio, compilation toolsJunocmake/3.24.2
12CubeLibCubeLibGeneral purpose C++ library for reading/writing multi-dimensional performance data with enhanced scalability and data format supporthttps://www.scalasca.org/software/cube-4.xperformance analysis, profiling, Score-P, Scalasca, visualization, metrics, call path analysis, system monitoring, C++Junocubelib/4.8.2
13CubeWCubeWGeneral purpose C++ library for reading/writing multi-dimensional performance data with enhanced scalability and data format supporthttps://www.scalasca.org/software/cube-4.xperformance analysis, profiling, Score-P, Scalasca, visualization, metrics, call path analysis, system monitoring, C++Junocubew/4.8.2
14CUDA 12.4CUDA 12.4GPU-accelerated computing toolkit with compiler tools, and libraries for high-performance applicationshttps://developer.nvidia.com/cuda-toolkitGPU computing, parallel computing, NVIDIA,, deep learning, scientific computing, acceleration, machine learningJunocuda/12.4
15CUDA 12.6CUDA 12.6GPU-accelerated computing toolkit with compiler tools, and libraries for high-performance applicationshttps://developer.nvidia.com/cuda-toolkitGPU computing, parallel computing, NVIDIA,, deep learning, scientific computing, acceleration, machine learningJunocuda/12.6
16DimemasDimemasHigh-abstracted network simulator for message-passing programs with performance prediction and analysis capabilitieshttps://tools.bsc.es/dimemasperformance analysis, network simulation, message passing, parallel computing, trace analysis, workload prediction, architecture modelingJunodimemas/5.4.2
17ExtraeExtraeDynamic instrumentation and tracing package for performance analysis of parallel applications with MPI, OpenMP, CUDA supporthttps://tools.bsc.es/extraeperformance analysis, tracing, profiling, HPC, MPI, OpenMP, CUDA, instrumentation, sampling, debuggingJunoextrae/3.8.3
18FFTWFFTWHigh-performance C library for computing discrete Fourier transforms with adaptive algorithm selection and multi-threading supporthttps://www.fftw.orgFFT, numerical computing, signal processing, scientific computing, C, MPI, OpenMPJunofftw/3.3.10
19GNUGNU12GNU Compiler Collection with optimizing compilers for C, C++, and other languageshttps://gcc.gnu.orgcompiler, C, C++, FortranJunognu12/12.4.0
20GNUGNU13GNU Compiler Collection with optimizing compilers for C, C++, and other languageshttps://gcc.gnu.orgcompiler, C, C++, FortranJunognu13/13.2.0
21GNUGNU14GNU Compiler Collection with optimizing compilers for C, C++, and other languageshttps://gcc.gnu.orgcompiler, C, C++, FortranJunognu14/14.2.0
22GSLGSLGNU Scientific Library for numerical analysis with over 1000 mathematical routines including random number generators, special functions and FFThttps://www.gnu.org/software/gslnumerical computing, scientific computing, C, C++, mathematics, statistics, FFT, linear algebra, special functions, random numbers, optimization, BLASJunogsl/2.7.1
23HDF5HDF5High-performance data management suite with signed binary support, enhanced error stack management, and improved cloud optimizationhttps://www.hdfgroup.org/solutions/hdf5data management, parallel I/O, compression, cloud storage, big data, file format, C, FortranJunohdf5/1.14.5
24HWLOCHWLOCPortable hardware topology detection and management library with enhanced memory attributes, NUMA support, and device locality featureshttps://www.open-mpi.org/projects/hwlochardware topology, NUMA, hardware locality, system architecture, CPU binding, cache hierarchy, GPU locality, process binding, hardware detectionJunohwloc/2.11.1
25HYPREHYPREHigh-performance parallel library of preconditioners and solvers featuring multigrid methods for structured and unstructured grid problemshttps://github.com/hypre-space/hyprelinear solvers, parallel computing, preconditioners, numerical methodsJunohypre/2.18.1
26IMBIMBIntel MPI Benchmarks suite with enhanced GPU benchmarks, message pause control, and increased default window size for performance testinghttps://github.com/intel/mpi-benchmarksMPI, benchmarking, performance testing, GPU computing, message passing, communication testingJunoimb/2021.3
27Intel OneAPIIntel OneAPIComprehensive HPC toolkit featuring new F2023 support, MPI 4.0 implementation, enhanced GPU optimizations, and advanced AI development capabilitieshttps://www.intel.com/oneapiHPC, compiler, MPI, OpenMP, GPU computing, Fortran, C++, SYCL, AI acceleration, parallel computing, optimization, DPC++, oneAPIJunointel/2025.0
28LibfabricLibfabricCore OFI component library providing user-space API for high-performance networking with TCP support, provider libraries integration, and enhanced NUMA capabilitieshttps://ofiwg.github.io/libfabricnetworking, OFI, TCP, NUMA, provider libraries, MPI, OpenMPIJunolibfabric/1.18.0
29LIKWIDLIKWIDPerformance monitoring and benchmarking suite featuring support for Intel SapphireRapids, AMD Zen4, Apple M1, and AMD GPUs with enhanced ROCm backendhttps://github.com/rrze-hpc/likwidperformance monitoring, benchmarking, hardware counters, profiling, CPU topology, power monitoring, thread pinning, NUMA, GPU monitoringJunolikwid/5.3.0
30METISMETISSerial library for graph partitioning and sparse matrix ordering featuring multilevel k-way partitioning and multi-constraint schemeshttps://dssl.cs.vt.edu/metis-head.htmlgraph partitioning, sparse matrices, mesh partitioning, matrix ordering, finite elements, numerical computing, optimizationJunometis/5.1.0
31MFEMMFEMHigh-performance finite element library featuring AMG solvers for AMD GPUs, high-order mesh adaptation, and enhanced GPU-enabled assembly capabilitieshttps://mfem.orgfinite elements,, numerical computing, PDE solver, GPU computing, mesh optimization, AMD, C++, AMGJunomfem/4.4
32MinicondaMinicondaMinimal Anaconda distribution with conda package manager, Python, and essential dependencies featuring enhanced uninstallation options and telemetryhttps://docs.anaconda.com/miniconda/Python, package management, conda, environment management, data science, machine learning, AI, cross-platform, scientific computingJunominiconda/24.11.1
33MPICH OFIMPICH OFIHigh-performance MPI implementation with OFI network module support for optimized fabric interface communications and enhanced portabilityhttps://www.mpich.org/MPI, message passing, OFI, libfabric, network communicationJunompich/3.4.3-ofi
34MPICH UCXMPICH UCXHigh-performance MPI implementation with UCX network module support featuring enhanced GPU awareness and hardware-optimized communicationhttps://www.mpich.orgMPI, UCX, GPU computing, message passing, CUDA, Junompich/3.4.3-ucx
35MUMPSMUMPSParallel sparse direct solver for large linear systems with multifrontal method, symmetric matrix support, and distributed memory capabilitieshttps://mumps-solver.orgsparse solver, linear algebra, multifrontal method, MPI, ScaLAPACK, BLAS, matrix factorizationJunomumps/5.2.1
36NAMDNAMDParallel molecular dynamics code for high-performance biomolecular simulation with enhanced GPU-resident mode and improved pressure tensor calculationshttps://www.ks.uiuc.edu/Research/namdmolecular dynamics, biomolecular simulation, VMD, AMBER, CHARMM, Charm++Junonamd/3.0.1
37NetCDF C++NetCDF C++C++ interface library for scientific data access to NetCDF with enhanced data model support and improved binary data handling capabilitieshttps://www.unidata.ucar.edu/software/netcdfC++, parallel I/O, array-oriented data, data format, data access, machine-independent, self-describing formatJunonetcdf-cxx/4.3.1
38NetCDF FortranNetCDF FortranFortran interface for NetCDF library with enhanced quantize functionality, zstandard compression support, and parallel I/O capabilitieshttps://www.unidata.ucar.edu/software/netcdf Fortran, parallel I/O, array-oriented data, compression, quantization, zstandard, data format, MPIJunonetcdf-fortran/4.6.1
39NetCDFNetCDFScientific data format library featuring HDF5 1.14.0 support, enhanced error stack management, and improved parallel I/O with compressionhttps://www.unidata.ucar.edu/software/netcdfparallel I/O, HDF5, data compression, array-oriented data, self-describing format, high performance computing, C, Fortran, data managementJunonetcdf/4.9.2
40NVIDIA HPC SDKNVIDIA HPC SDKComprehensive HPC compiler suite featuring enhanced C++23 support, improved GPU optimizations, and CUDA 12.6 update 2 with advanced cuBLAS performancehttps://developer.nvidia.com/hpc-sdkCUDA, GPU computing, OpenACC, C++, Fortran, , scientific computing, cuBLAS, optimization, machine learningJunonvhpc/24.11
41OpenHPCOpenHPCCommunity-driven HPC software stack providing pre-built components for cluster deployment including provisioning tools, resource managers, and scientific librarieshttps://openhpc.communitycluster management, provisioning, resource management, compilers, MPI, libraries, parallel computing, software stackJunoohpc/
42OSU Micro-BenchmarksOSU Micro-BenchmarksPerformance testing suite for MPI operations featuring point-to-point benchmarks, collective operations, and GPU-enabled tests with CUDA/ROCm supporthttps://sarus.readthedocs.io/en/stable/cookbook/osu_mb/osu_mb.htmlMPI, benchmarking, performance testing, collective operations, ROCm, latency testing, bandwidth testingJunoomb/7.3
43OPARI2OPARI2Source-to-source instrumentation tool for OpenMP and hybrid codes with enhanced multi-directory build support and OpenMP task trackinghttps://www.vi-hps.org/tools/opari2.htmlinstrumentation, performance analysis, hybrid codes, source-to-source, POMP2, profilingJunoopari2/2.0.8
44OpenBLASOpenBLASOpenBLAS implements low-level routines for performing linear algebra operations such as vector addition, scalar multiplication, dot products, linear combinations, and matrix multiplication.http://www.openmathlib.org/OpenBLAS/BLAS, linear algebra, matrix operations, Fortran, C, optimizationJunoopenblas/0.3.28
45OpenCoarraysOpenCoarraysOpen-source software project that produces an application binary interface (ABI) used by the GNU Compiler Collection (GCC) Fortran front-end to build executable programs that leverage the parallel programming features of Fortran 2018http://www.opencoarrays.orgFortran,coarray, GCC, gfortran, MPIJunoopencoarrays/2.10.2
46OpenMPI 4OpenMPI 4High-performance Message Passing Interface implementation featuring fault tolerance, GPU acceleration, and enhanced collective operationshttps://www.open-mpi.orgMPI, HPC, message passing, distributed computing, GPU computing, scientific computing, UCX, PMIx, fault toleranceJunoopenmpi4/4.1.6
47OpenMPI 5OpenMPI 5High-performance Message Passing Interface implementation featuring fault tolerance, GPU acceleration, and enhanced collective operationshttps://www.open-mpi.orgMPI, HPC, message passing, distributed computing, GPU computing, scientific computing, UCX, PMIx, fault toleranceJunoopenmpi5/5.0.5
48ORCAORCAQuantum chemistry package featuring enhanced spectroscopic calculations and integrated XTB 6.7.1 support for semiempirical methodshttps://orcaforum.kofo.mpg.de/app.php/portalquantum, chemistry, DFT, ab initio, spectroscopy, molecular modeling, computational chemistry, semiempirical methods, XTBJunoorca/6.0.1
50OFT2OFT2Highly scalable event trace data format library with Python bindings support, multi-substrate read/write capabilities, and in-place timestamp manipulationhttps://www.vi-hps.org/projects/score-p/index.htmlperformance analysis, tracing, Scalasca, Vampir, Tau, profiling, event trace, data formatJunootf2/3.0.3
51PAPIPAPIPerformance monitoring interface featuring Software Defined Events API, high-level API revision, and enhanced GPU support for AMD ROCm and NVIDIA componentshttps://icl.utk.edu/papi/performance monitoring, hardware counters, profiling, ROCm, CUDA, power monitoring, instrumentation, software eventsJunopapi/6.0.0
52Program Database ToolkitProgram Database ToolkitFramework for analyzing source code written in several programming languages featuring EDG C/C++ parsers, ROSE parser integration, and program database generation capabilitieshttps://www.cs.uoregon.edu/research/pdtsource code analysis, program analysis, static analysis, dynamic analysis, C++, Fortran, instrumentation, compilersJunopdtoolkit/3.25.1
53PETScPETScScientific computing toolkit featuring parallel linear/nonlinear solvers, ODE integrators, and optimization algorithms with enhanced error handlinghttps://petsc.org/release/ linear solvers, nonlinear solvers, optimization, C, C++, Fortran, Python, PDE solvers, matrices, vectorsJunopetsc/3.18.1
54PHDF5PHDF5Parallel version of HDF5 data management suite featuring enhanced MPI-IO support, improved parallel I/O performance, and virtual object layer optimizationshttps://www.hdfgroup.orgparallel I/O, data management, MPI-IO, big data, file format, data compressionJunophdf5/1.14.0
55PLASMAPLASMAParallel Linear Algebra Software for Multicore Architectures featuring tile algorithms, dynamic scheduling, and enhanced GPU accelerationhttps://icl.utk.edu/plasma/linear algebra, LAPACK, BLAS, matrices, numerical computingJunoplasma/21.8.29
56PMIxPMIxProcess Management Interface library for Exascale computing featuring enhanced dstore component support, improved Python regex for documentation, and fixed test buildshttps://openpmix.github.ioprocess management, MPI, exascale computing, tool support, runtime systems, debuggingJunopmix/4.2.9
57PnetCDFPnetCDFHigh-performance parallel I/O library for NetCDF file access featuring intra-node aggregation and improved nonblocking performancehttps://parallel-netcdf.github.ioparallel I/O, NetCDF,MPI-IO, file format, collective I/O, data management,CDF formatJunopnetcdf/1.12.3
58PRUNPRUNMPI wrapper utility that automatically handles MPI implementation selection and Slurm configurations for simplified parallel job execution in HPC environmentshttps://www.open-mpi.org/doc/v3.1/man1/prun.1.phpMPI, Slurm, process management, job launcher, OpenMPI, resource management, batch systemJunoprun/2.2
59PT-ScotchPT-ScotchMPI-parallel version of SCOTCH featuring dynamic thread management, hybrid MPI+threads parallelism, and improved computational reproducibility for large-scale graph/mesh partitioninghttps://www.labri.fr/perso/pelegrin/scotch/graph partitioning, mesh partitioning, MPI, sparse matrices, matrix ordering, static mappingJunoptscotch/7.0.1
60MPI4PyMPI4PyPython bindings for MPI featuring pickle-based communication, NumPy array support, and enhanced error handling with ERRORS_RETURNhttps://pypi.org/project/mpi4py/MPI, Python, parallel computing, NumPy, pickling, message passing, HPC, collective operations, point-to-point communication, scientific computingJunopy3-mpi4py/3.1.5
61NumPyNumPyFundamental scientific computing package for Python featurin highly optimized array API functionalityhttps://numpy.orgnumerical analysis, array operations, Python, linear algebra, matrices, statistics, mathematics, data scienceJunopy3-numpy/1.26.4
62Q-ChemQ-ChemComprehensive quantum chemistry software featuring enhanced DFT capabilities, improved TDDFT gradients, and machine learning force field integrationhttps://www.q-chem.comquantum chemistry, DFT, ab initio, molecular modeling, electronic structure, spectroscopy, excited states, machine learningJunoqchem/6.2.2
63ScaLAPACK ScaLAPACK High-performance parallel linear algebra library featuring ILP64 mode support, pointer macro improvements, and enhanced shared library capabilitieshttps://www.netlib.org/scalapack/inear algebra,matrices, eigenvalues, linear systems, LAPACKJunoscalapack/2.2.0
64ScalascaScalascaTrace-based performance analysis toolset for parallel applications featuring automatic wait-state detection, enhanced MPI/OpenMP analysis, and Score-P integratiohttps://www.scalasca.orgperformance analysis, trace analysis, MPI, OpenMP, profiling, Score-P, wait-state detectionJunoscalasca/2.6.1
65Score-PScore-PHighly scalable profiling and event tracing infrastructure featuring enhanced memory management, Python bindings, and multi-substrate read/write capabilitieshttps://www.vi-hps.org/projects/score-pperformance analysis, profiling, tracing,, MPI, OpenMP, CUDA, instrumentation, parallel computing, Vampir, Scalasca, TAUJunoscorep/8.4
66ScotchScotchGraph and mesh partitioning package featuring parallel matrix ordering, dynamic thread management, and efficient memory usage for large-scale computationshttps://www.labri.fr/perso/pelegrin/scotch/graph partitioning, mesh partitioning, sparse matrices, matrix ordering, MPI, POSIX threads, static mappingJunoscotch/6.0.6
67SIONlibSIONlibScalable I/O library for parallel access to task-local files featuring collective I/O operations, CUDA-aware interfaces, and MSA-aware algorithmshttps://apps.fz-juelich.de/jsc/sionlib/docu/index.htmlparallel, I/O, binary data, task local files, MPI, OpenMP, CUDA, collective I/O, file handlingJunosionlib/1.7.7
68SLEPcSLEPcScalable library for large sparse eigenvalue computations featuring spectrum slicing, matrix function evaluation, and enhanced GPU supporthttps://slepc.upv.eseigenvalue problems, sparse matrices, parallel computing, PETSc, SVD, linear algebra , matrix functionsJunoslepc/3.18.0
69SpackSpackPackage manager for HPC featuring forward compatibility with v0.23, enhanced urllib support for Python 3.12.6+, and improved aarch64 architecture supporthttps://spack.iopackage management, software installation, compilers, build system, dependencies, environments, modules, configuration management, reproducibilityJunospack/0.22.2
70SuperLUSuperLUGeneral purpose library for direct solution of large sparse linear systems featuring enhanced GPU support, 64-bit indexing, and improved threshold-based ILU factorizationhttps://github.com/xiaoyeli/superlusparse solver, LU decomposition, Gaussian elimination, matrix factorizationJunosuperlu/7.0.0
71SuperLU_DISTSuperLU_DISTParallel sparse linear system solver featuring static pivoting, OpenMP/MPI hybrid parallelism, and distributed memory capabilities for large-scale computationshttps://github.com/xiaoyeli/superlu_distsparse solver, parallel computing, linear algebra, matrix factorizationJunosuperlu_dist/6.4.0
72TAU
(Tuning and Analysis Utilities)
TAU
(Tuning and Analysis Utilities)
Performance analysis toolkit featuring profiling, tracing, sampling capabilities with hardware counter support and multi-language instrumentation for parallel programshttps://www.cs.uoregon.edu/research/tau/home.phpperformance analysis, profiling, tracing, sampling, MPI, OpenMP, CUDA, instrumentation, PAPI, callpath profiling, memory analysisJunotau/2.31.1
73TrilinosTrilinosCollection of scientific libraries for large-scale multiphysics simulations featuring enhanced Kokkos ecosystem, Tpetra improvements, and GPU accelerationhttps://trilinos.github.io linear solvers, optimization, preconditioners, finite elements, multiphysics, KokkosJunotrilinos/13.4.0
74UCXUCXHigh-performance communication framework featuring improved rendezvous protocols, enhanced GPU support, and optimized RDMA capabilities with Intel Level-Zero integratiohttps://openucx.orgRDMA, communication, InfiniBand, RoCE, shared memory, network protocolsJunoucx/1.17.0
75ValgrindValgrindDynamic binary instrumentation framework featuring memory debugging, cache profiling, and thread error detection with enhanced POWER10 supporthttps://valgrind.orgdebugging, memory analysis, profiling, leak detection, cache analysis, thread checking, binary instrumentation, performance analysis, development toolsJunovalgrind/3.23.0
76VASPVASPAb initio quantum mechanical molecular dynamics package featuring enhanced GPU acceleration, improved VTST tools, and optimized hybrid functional calculationshttps://www.vasp.atquantum mechanics, molecular dynamics, DFT, electronic structure, materials science,ab initio, solid state physicsJunovasp/6.4.2

This table was updated on February 7, 2025.