Slurm
Slurm Workload Manager, formerly known as Simple Linux Utility for Resource Management (SLURM), is a free, open-source job scheduler and cluster management system for Linux and Unix-like kernels. It's designed to manage compute jobs on high performance computing (HPC) clusters and high throughput computing (HTC) environments, and is used by many of the world's supercomputers and computer clusters.
Learn more
AWS ParallelCluster
AWS ParallelCluster is an open-source cluster management tool that simplifies the deployment and management of High-Performance Computing (HPC) clusters on AWS. It automates the setup of required resources, including compute nodes, a shared filesystem, and a job scheduler, supporting multiple instance types and job submission queues. Users can interact with ParallelCluster through a graphical user interface, command-line interface, or API, enabling flexible cluster configuration and management. The tool integrates with job schedulers like AWS Batch and Slurm, facilitating seamless migration of existing HPC workloads to the cloud with minimal modifications. AWS ParallelCluster is available at no additional charge; users only pay for the AWS resources consumed by their applications. With AWS ParallelCluster, you can use a simple text file to model, provision, and dynamically scale the resources needed for your applications in an automated and secure manner.
Learn more
HPE Performance Cluster Manager
HPE Performance Cluster Manager (HPCM) delivers an integrated system management solution for Linux®-based high performance computing (HPC) clusters. HPE Performance Cluster Manager provides complete provisioning, management, and monitoring for clusters scaling up to Exascale sized supercomputers. The software enables fast system setup from bare-metal, comprehensive hardware monitoring and management, image management, software updates, power management, and cluster health management. Additionally, it makes scaling HPC clusters easier and efficient while providing integration with a plethora of 3rd party tools for running and managing workloads. HPE Performance Cluster Manager reduces the time and resources spent administering HPC systems - lowering total cost of ownership, increasing productivity and providing a better return on hardware investments.
Learn more
Rocky Linux
CIQ empowers people to do amazing things by providing innovative and stable software infrastructure solutions for all computing needs. From the base operating system, through containers, orchestration, provisioning, computing, and cloud applications, CIQ works with every part of the technology stack to drive solutions for customers and communities with stable, scalable, secure production environments. CIQ is the founding support and services partner of Rocky Linux, and the creator of the next generation federated computing stack.
- Rocky Linux, open, Secure Enterprise Linux
- Apptainer, application Containers for High Performance Computing
- Warewulf, cluster Management and Operating System Provisioning
- HPC2.0, the Next Generation of High Performance Computing, a Cloud Native Federated Computing Platform
- Traditional HPC, turnkey computing stack for traditional HPC
Learn more