Difference between revisions of "ALUSaturation"

From HPC Wiki
Jump to navigation Jump to search
(Created page with "== Description == The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core. == Symptoms == * Throughput...")
 
Line 9: Line 9:
 
== Detection ==
 
== Detection ==
 
Use a hardware-counter tool like:
 
Use a hardware-counter tool like:
* LIKWID with performance groups FLOPS_SP, FLOPS
+
* LIKWID with performance groups FLOPS_SP, FLOPS_DP, DATA and CLOCK
 +
* Same information can be provided by perf or PAPI
  
 
== Possible optimizations and/or fixes ==
 
== Possible optimizations and/or fixes ==

Revision as of 11:59, 6 March 2019

Description

The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core.

Symptoms

  • Throughput at design limit(s)
  • Good (low) CPI
  • Integral ratio of cycles to specific instruction count(s).

Detection

Use a hardware-counter tool like:

  • LIKWID with performance groups FLOPS_SP, FLOPS_DP, DATA and CLOCK
  • Same information can be provided by perf or PAPI

Possible optimizations and/or fixes