<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://hpc-wiki.info/hpc/index.php?action=history&amp;feed=atom&amp;title=Load_Imbalance</id>
	<title>Load Imbalance - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://hpc-wiki.info/hpc/index.php?action=history&amp;feed=atom&amp;title=Load_Imbalance"/>
	<link rel="alternate" type="text/html" href="https://hpc-wiki.info/hpc/index.php?title=Load_Imbalance&amp;action=history"/>
	<updated>2026-05-26T11:17:49Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.35.9</generator>
	<entry>
		<id>https://hpc-wiki.info/hpc/index.php?title=Load_Imbalance&amp;diff=2407&amp;oldid=prev</id>
		<title>Paul-kapinos-e26d@rwth-aachen.de: Paul-kapinos-e26d@rwth-aachen.de moved page LoadImbalance to Load Imbalance: pretty-printing</title>
		<link rel="alternate" type="text/html" href="https://hpc-wiki.info/hpc/index.php?title=Load_Imbalance&amp;diff=2407&amp;oldid=prev"/>
		<updated>2020-07-22T12:56:33Z</updated>

		<summary type="html">&lt;p&gt;Paul-kapinos-e26d@rwth-aachen.de moved page &lt;a href=&quot;/hpc/LoadImbalance&quot; class=&quot;mw-redirect&quot; title=&quot;LoadImbalance&quot;&gt;Load Imbalance&lt;/a&gt; to &lt;a href=&quot;/hpc/Load_Imbalance&quot; title=&quot;Load Imbalance&quot;&gt;Load Imbalance&lt;/a&gt;: pretty-printing&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left diff-editfont-monospace&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 12:56, 22 July 2020&lt;/td&gt;
				&lt;/tr&gt;
&lt;!-- diff cache key hpc_wiki:diff::1.12:old-2104:rev-2407 --&gt;
&lt;/table&gt;</summary>
		<author><name>Paul-kapinos-e26d@rwth-aachen.de</name></author>
	</entry>
	<entry>
		<id>https://hpc-wiki.info/hpc/index.php?title=Load_Imbalance&amp;diff=2104&amp;oldid=prev</id>
		<title>Daniel-schurhoff-de23@rwth-aachen.de at 06:25, 4 September 2019</title>
		<link rel="alternate" type="text/html" href="https://hpc-wiki.info/hpc/index.php?title=Load_Imbalance&amp;diff=2104&amp;oldid=prev"/>
		<updated>2019-09-04T06:25:03Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left diff-editfont-monospace&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 06:25, 4 September 2019&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot; &gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt; &lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[Category:Performance Pattern]]&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;== Description ==&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;== Description ==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The pattern &amp;quot;Load Imbalance&amp;quot; describes a common problem in parallelized applications. It describes the problem when work is not equally distributed over all processing units and consequently some unit(s) do more work than others. This commonly results in wait time for the processing units being faster (less work) until the slower ones (more work) finished their task at a synchronization point.&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The pattern &amp;quot;Load Imbalance&amp;quot; describes a common problem in parallelized applications. It describes the problem when work is not equally distributed over all processing units and consequently some unit(s) do more work than others. This commonly results in wait time for the processing units being faster (less work) until the slower ones (more work) finished their task at a synchronization point.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key hpc_wiki:diff::1.12:old-1476:rev-2104 --&gt;
&lt;/table&gt;</summary>
		<author><name>Daniel-schurhoff-de23@rwth-aachen.de</name></author>
	</entry>
	<entry>
		<id>https://hpc-wiki.info/hpc/index.php?title=Load_Imbalance&amp;diff=1476&amp;oldid=prev</id>
		<title>Thomas-gruber-5537@uni-erlangen.de: Created page with &quot;== Description == The pattern &quot;Load Imbalance&quot; describes a common problem in parallelized applications. It describes the problem when work is not equally distributed over all...&quot;</title>
		<link rel="alternate" type="text/html" href="https://hpc-wiki.info/hpc/index.php?title=Load_Imbalance&amp;diff=1476&amp;oldid=prev"/>
		<updated>2019-03-06T11:41:26Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;== Description == The pattern &amp;quot;Load Imbalance&amp;quot; describes a common problem in parallelized applications. It describes the problem when work is not equally distributed over all...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;== Description ==&lt;br /&gt;
The pattern &amp;quot;Load Imbalance&amp;quot; describes a common problem in parallelized applications. It describes the problem when work is not equally distributed over all processing units and consequently some unit(s) do more work than others. This commonly results in wait time for the processing units being faster (less work) until the slower ones (more work) finished their task at a synchronization point.&lt;br /&gt;
&lt;br /&gt;
== Symptoms ==&lt;br /&gt;
* Saturating/sub-linear speedup&lt;br /&gt;
&lt;br /&gt;
== Detection ==&lt;br /&gt;
The detection mechanisms depend on the definition of &amp;#039;work&amp;#039; for the application. If floating-point calculations are the smallest task of processing, you can use hardware performance monitoring tools:&lt;br /&gt;
* LIKWID with performance groups FLOPS_DP and FLOPS_SP&lt;br /&gt;
* PAPI with papi_mflops() or PAPI_SP_OPS and PAPI_DP_OPS events&lt;br /&gt;
* perf offers fp_arith_inst_retired.* events&lt;br /&gt;
If other operations are your smallest task and there are no hardware performance events available to count them, use measurements near to the processing units which regards data transfers, the inputs for your work.&lt;br /&gt;
* LIKWID with performance groups DATA and L1&lt;br /&gt;
* PAPI and perf also provide events for load/store counting at each CPU core and data transfers between core and L1 cache&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Possible optimizations and/or fixes ==&lt;br /&gt;
* Balance the work over all processing units as good as possible.&lt;/div&gt;</summary>
		<author><name>Thomas-gruber-5537@uni-erlangen.de</name></author>
	</entry>
</feed>