Updated February 2025

Blueprints for serious HPC programs

hpctutorials tracks how top labs—from ORNL to Argonne—run clusters with thousands of nodes, GPUs, and impatient scientists. We distill their patterns into actionable runbooks so your team can stand up reliable, policy-compliant compute without the guesswork.

Every guide mirrors the thought-leader design system behind pranavkulkarni.org: single-column focus, ruthless clarity, and data pulled from the latest TOP500, MLPerf, and Slurm releases.

Maintained by Mandar Gurav & Pranav Kulkarni — operators who live inside Slurm, Flux, and exascale programs daily.

Where we go deep

Latest briefings

Cluster playbooks ready today

Access & environment hygiene

From bastion policies to module stacks so new researchers ship jobs in under 30 minutes.

Scheduler fluency

Modern Slurm patterns, job arrays, heterogeneous allocations, and QoS dashboards.

Accelerated science + AI

How labs fuse MPI, CUDA, and inference with profiling and governance guardrails.

Benchmark pulse · reality, not hype

Frontier · ORNL

Aurora · Argonne

El Capitan · LLNL

Operational KPIs