The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core.
- Throughput at design limit(s)
- Good (low) CPI
- Integral ratio of cycles to specific instruction count(s).
Use a hardware-counter tool like:
- LIKWID with performance groups FLOPS_SP, FLOPS_DP, DATA and CLOCK
- Same information can be provided by perf or PAPI