Cluster Schema

ClusterCockpit Cluster Schema Reference

The following schema in its raw form can be found in the ClusterCockpit GitHub repository.

HPC cluster description

Title: HPC cluster description

Typeobject
RequiredNo
Additional properties[Any type: allowed]

Description: Meta data information of a HPC cluster

1. [Required] Property HPC cluster description > name
Typestring
RequiredYes

Description: The unique identifier of a cluster

2. [Required] Property HPC cluster description > metricConfig
Typearray of object
RequiredYes

Description: Metric specifications

Array restrictions
Min items1
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
metricConfig items-

2.1. HPC cluster description > metricConfig > metricConfig items

Typeobject
RequiredNo
Additional properties[Any type: allowed]
2.1.1. [Required] Property HPC cluster description > metricConfig > metricConfig items > name
Typestring
RequiredYes

Description: Metric name

2.1.2. [Required] Property HPC cluster description > metricConfig > metricConfig items > unit
Typeobject
RequiredYes
Additional properties[Any type: allowed]
Defined inunit.schema.json

Description: Metric unit

2.1.2.1. [Required] Property HPC cluster description > metricConfig > metricConfig items > unit > base
Typeenum (of string)
RequiredYes

Description: Metric base unit

Must be one of:

  • “B”
  • “F”
  • “B/s”
  • “F/s”
  • “CPI”
  • “IPC”
  • “Hz”
  • “W”
  • “°C”
  • ""
2.1.2.2. [Optional] Property HPC cluster description > metricConfig > metricConfig items > unit > prefix
Typeenum (of string)
RequiredNo

Description: Unit prefix

Must be one of:

  • “K”
  • “M”
  • “G”
  • “T”
  • “P”
  • “E”
2.1.3. [Required] Property HPC cluster description > metricConfig > metricConfig items > scope
Typestring
RequiredYes

Description: Native measurement resolution

2.1.4. [Required] Property HPC cluster description > metricConfig > metricConfig items > timestep
Typeinteger
RequiredYes

Description: Frequency of timeseries points

2.1.5. [Required] Property HPC cluster description > metricConfig > metricConfig items > aggregation
Typeenum (of string)
RequiredYes

Description: How the metric is aggregated

Must be one of:

  • “sum”
  • “avg”
2.1.6. [Required] Property HPC cluster description > metricConfig > metricConfig items > peak
Typenumber
RequiredYes

Description: Metric peak threshold (Upper metric limit)

2.1.7. [Required] Property HPC cluster description > metricConfig > metricConfig items > normal
Typenumber
RequiredYes

Description: Metric normal threshold

2.1.8. [Required] Property HPC cluster description > metricConfig > metricConfig items > caution
Typenumber
RequiredYes

Description: Metric caution threshold (Suspicious but does not require immediate action)

2.1.9. [Required] Property HPC cluster description > metricConfig > metricConfig items > alert
Typenumber
RequiredYes

Description: Metric alert threshold (Requires immediate action)

2.1.10. [Optional] Property HPC cluster description > metricConfig > metricConfig items > subClusters
Typearray of object
RequiredNo

Description: Array of cluster hardware partition metric thresholds

Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
subClusters items-
2.1.10.1. HPC cluster description > metricConfig > metricConfig items > subClusters > subClusters items
Typeobject
RequiredNo
Additional properties[Any type: allowed]
2.1.10.1.1. [Required] Property HPC cluster description > metricConfig > metricConfig items > subClusters > subClusters items > name
Typestring
RequiredYes

Description: Hardware partition name

2.1.10.1.2. [Optional] Property HPC cluster description > metricConfig > metricConfig items > subClusters > subClusters items > peak
Typenumber
RequiredNo
2.1.10.1.3. [Optional] Property HPC cluster description > metricConfig > metricConfig items > subClusters > subClusters items > normal
Typenumber
RequiredNo
2.1.10.1.4. [Optional] Property HPC cluster description > metricConfig > metricConfig items > subClusters > subClusters items > caution
Typenumber
RequiredNo
2.1.10.1.5. [Optional] Property HPC cluster description > metricConfig > metricConfig items > subClusters > subClusters items > alert
Typenumber
RequiredNo
2.1.10.1.6. [Optional] Property HPC cluster description > metricConfig > metricConfig items > subClusters > subClusters items > remove
Typeboolean
RequiredNo
3. [Required] Property HPC cluster description > subClusters
Typearray of object
RequiredYes

Description: Array of cluster hardware partitions

Array restrictions
Min items1
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
subClusters items-

3.1. HPC cluster description > subClusters > subClusters items

Typeobject
RequiredNo
Additional properties[Any type: allowed]
3.1.1. [Required] Property HPC cluster description > subClusters > subClusters items > name
Typestring
RequiredYes

Description: Hardware partition name

3.1.2. [Required] Property HPC cluster description > subClusters > subClusters items > processorType
Typestring
RequiredYes

Description: Processor type

3.1.3. [Required] Property HPC cluster description > subClusters > subClusters items > socketsPerNode
Typeinteger
RequiredYes

Description: Number of sockets per node

3.1.4. [Required] Property HPC cluster description > subClusters > subClusters items > coresPerSocket
Typeinteger
RequiredYes

Description: Number of cores per socket

3.1.5. [Required] Property HPC cluster description > subClusters > subClusters items > threadsPerCore
Typeinteger
RequiredYes

Description: Number of SMT threads per core

3.1.6. [Required] Property HPC cluster description > subClusters > subClusters items > flopRateScalar
Typeobject
RequiredYes
Additional properties[Any type: allowed]

Description: Theoretical node peak flop rate for scalar code in GFlops/s

3.1.6.1. [Optional] Property HPC cluster description > subClusters > subClusters items > flopRateScalar > unit
Typeobject
RequiredNo
Additional properties[Any type: allowed]
Same definition asunit

Description: Metric unit

3.1.6.2. [Optional] Property HPC cluster description > subClusters > subClusters items > flopRateScalar > value
Typenumber
RequiredNo
3.1.7. [Required] Property HPC cluster description > subClusters > subClusters items > flopRateSimd
Typeobject
RequiredYes
Additional properties[Any type: allowed]

Description: Theoretical node peak flop rate for SIMD code in GFlops/s

3.1.7.1. [Optional] Property HPC cluster description > subClusters > subClusters items > flopRateSimd > unit
Typeobject
RequiredNo
Additional properties[Any type: allowed]
Same definition asunit

Description: Metric unit

3.1.7.2. [Optional] Property HPC cluster description > subClusters > subClusters items > flopRateSimd > value
Typenumber
RequiredNo
3.1.8. [Required] Property HPC cluster description > subClusters > subClusters items > memoryBandwidth
Typeobject
RequiredYes
Additional properties[Any type: allowed]

Description: Theoretical node peak memory bandwidth in GB/s

3.1.8.1. [Optional] Property HPC cluster description > subClusters > subClusters items > memoryBandwidth > unit
Typeobject
RequiredNo
Additional properties[Any type: allowed]
Same definition asunit

Description: Metric unit

3.1.8.2. [Optional] Property HPC cluster description > subClusters > subClusters items > memoryBandwidth > value
Typenumber
RequiredNo
3.1.9. [Required] Property HPC cluster description > subClusters > subClusters items > nodes
Typestring
RequiredYes

Description: Node list expression

3.1.10. [Required] Property HPC cluster description > subClusters > subClusters items > topology
Typeobject
RequiredYes
Additional properties[Any type: allowed]

Description: Node topology

3.1.10.1. [Required] Property HPC cluster description > subClusters > subClusters items > topology > node
Typearray of integer
RequiredYes

Description: HwTread lists of node

Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
node items-
3.1.10.1.1. HPC cluster description > subClusters > subClusters items > topology > node > node items
Typeinteger
RequiredNo
3.1.10.2. [Required] Property HPC cluster description > subClusters > subClusters items > topology > socket
Typearray of array
RequiredYes

Description: HwTread lists of sockets

Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
socket items-
3.1.10.2.1. HPC cluster description > subClusters > subClusters items > topology > socket > socket items
Typearray of integer
RequiredNo
Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
socket items items-
3.1.10.2.1.1. HPC cluster description > subClusters > subClusters items > topology > socket > socket items > socket items items
Typeinteger
RequiredNo
3.1.10.3. [Required] Property HPC cluster description > subClusters > subClusters items > topology > memoryDomain
Typearray of array
RequiredYes

Description: HwTread lists of memory domains

Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
memoryDomain items-
3.1.10.3.1. HPC cluster description > subClusters > subClusters items > topology > memoryDomain > memoryDomain items
Typearray of integer
RequiredNo
Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
memoryDomain items items-
3.1.10.3.1.1. HPC cluster description > subClusters > subClusters items > topology > memoryDomain > memoryDomain items > memoryDomain items items
Typeinteger
RequiredNo
3.1.10.4. [Optional] Property HPC cluster description > subClusters > subClusters items > topology > die
Typearray of array
RequiredNo

Description: HwTread lists of dies

Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
die items-
3.1.10.4.1. HPC cluster description > subClusters > subClusters items > topology > die > die items
Typearray of integer
RequiredNo
Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
die items items-
3.1.10.4.1.1. HPC cluster description > subClusters > subClusters items > topology > die > die items > die items items
Typeinteger
RequiredNo
3.1.10.5. [Optional] Property HPC cluster description > subClusters > subClusters items > topology > core
Typearray of array
RequiredNo

Description: HwTread lists of cores

Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
core items-
3.1.10.5.1. HPC cluster description > subClusters > subClusters items > topology > core > core items
Typearray of integer
RequiredNo
Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
core items items-
3.1.10.5.1.1. HPC cluster description > subClusters > subClusters items > topology > core > core items > core items items
Typeinteger
RequiredNo
3.1.10.6. [Optional] Property HPC cluster description > subClusters > subClusters items > topology > accelerators
Typearray of object
RequiredNo

Description: List of of accelerator devices

Array restrictions
Min itemsN/A
Max itemsN/A
Items unicityFalse
Additional itemsFalse
Tuple validationSee below
Each item of this array must beDescription
accelerators items-
3.1.10.6.1. HPC cluster description > subClusters > subClusters items > topology > accelerators > accelerators items
Typeobject
RequiredNo
Additional properties[Any type: allowed]
3.1.10.6.1.1. [Required] Property HPC cluster description > subClusters > subClusters items > topology > accelerators > accelerators items > id
Typestring
RequiredYes

Description: The unique device id

3.1.10.6.1.2. [Required] Property HPC cluster description > subClusters > subClusters items > topology > accelerators > accelerators items > type
Typeenum (of string)
RequiredYes

Description: The accelerator type

Must be one of:

  • “Nvidia GPU”
  • “AMD GPU”
  • “Intel GPU”
3.1.10.6.1.3. [Required] Property HPC cluster description > subClusters > subClusters items > topology > accelerators > accelerators items > model
Typestring
RequiredYes

Description: The accelerator model


Generated using json-schema-for-humans on 2024-02-02 at 14:36:54 +0100