Difference between revisions of "ALUSaturation"
Jump to navigation
Jump to search
(Created page with "== Description == The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core. == Symptoms == * Throughput...") |
|||
Line 9: | Line 9: | ||
== Detection == | == Detection == | ||
Use a hardware-counter tool like: | Use a hardware-counter tool like: | ||
− | * LIKWID with performance groups FLOPS_SP, | + | * LIKWID with performance groups FLOPS_SP, FLOPS_DP, DATA and CLOCK |
+ | * Same information can be provided by perf or PAPI | ||
== Possible optimizations and/or fixes == | == Possible optimizations and/or fixes == |
Revision as of 10:59, 6 March 2019
Description
The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core.
Symptoms
- Throughput at design limit(s)
- Good (low) CPI
- Integral ratio of cycles to specific instruction count(s).
Detection
Use a hardware-counter tool like:
- LIKWID with performance groups FLOPS_SP, FLOPS_DP, DATA and CLOCK
- Same information can be provided by perf or PAPI