Difference between revisions of "ALUSaturation"
Jump to navigation
Jump to search
(Created page with "== Description == The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core. == Symptoms == * Throughput...") |
m |
||
(One intermediate revision by one other user not shown) | |||
Line 1: | Line 1: | ||
+ | [[Category:Performance Pattern]] | ||
== Description == | == Description == | ||
The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core. | The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core. | ||
Line 9: | Line 10: | ||
== Detection == | == Detection == | ||
Use a hardware-counter tool like: | Use a hardware-counter tool like: | ||
− | * LIKWID with performance groups FLOPS_SP, | + | * LIKWID with performance groups FLOPS_SP, FLOPS_DP, DATA and CLOCK |
+ | * Same information can be provided by perf or PAPI | ||
== Possible optimizations and/or fixes == | == Possible optimizations and/or fixes == |
Latest revision as of 15:13, 3 September 2019
Description
The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core.
Symptoms
- Throughput at design limit(s)
- Good (low) CPI
- Integral ratio of cycles to specific instruction count(s).
Detection
Use a hardware-counter tool like:
- LIKWID with performance groups FLOPS_SP, FLOPS_DP, DATA and CLOCK
- Same information can be provided by perf or PAPI