diff --git a/index.html b/index.html
index f4cec6e..5850538 100644
--- a/index.html
+++ b/index.html
@@ -1254,493 +1254,494 @@ pre{ white-space:pre }
 <li><a href="#gem5-arm-platforms">19.17. gem5 ARM platforms</a></li>
 <li><a href="#gem5-upstream-images">19.18. gem5 upstream images</a></li>
 <li><a href="#gem5-bootloaders">19.19. gem5 bootloaders</a></li>
-<li><a href="#gem5-internals">19.20. gem5 internals</a>
+<li><a href="#gem5-commmonitor">19.20. gem5 <code>CommMonitor</code></a></li>
+<li><a href="#gem5-internals">19.21. gem5 internals</a>
 <ul class="sectlevel3">
-<li><a href="#gem5-eclipse-configuration">19.20.1. gem5 Eclipse configuration</a></li>
-<li><a href="#gem5-python-c-interaction">19.20.2. gem5 Python C++ interaction</a></li>
-<li><a href="#gem5-entry-point">19.20.3. gem5 entry point</a>
+<li><a href="#gem5-eclipse-configuration">19.21.1. gem5 Eclipse configuration</a></li>
+<li><a href="#gem5-python-c-interaction">19.21.2. gem5 Python C++ interaction</a></li>
+<li><a href="#gem5-entry-point">19.21.3. gem5 entry point</a>
 <ul class="sectlevel4">
-<li><a href="#gem5-m5-objects-module">19.20.3.1. gem5 <code>m5.objects</code> module</a></li>
+<li><a href="#gem5-m5-objects-module">19.21.3.1. gem5 <code>m5.objects</code> module</a></li>
 </ul>
 </li>
-<li><a href="#gem5-event-queue">19.20.4. gem5 event queue</a>
+<li><a href="#gem5-event-queue">19.21.4. gem5 event queue</a>
 <ul class="sectlevel4">
-<li><a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis">19.20.4.1. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis</a>
+<li><a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis">19.21.4.1. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis</a>
 <ul class="sectlevel5">
-<li><a href="#atomicsimplecpu-initial-events">19.20.4.1.1. AtomicSimpleCPU initial events</a></li>
-<li><a href="#atomicsimplecpu-tick-reschedule-timing">19.20.4.1.2. AtomicSimpleCPU tick reschedule timing</a></li>
-<li><a href="#atomicsimplecpu-memory-access">19.20.4.1.3. AtomicSimpleCPU memory access</a></li>
-<li><a href="#gem5-se-py-page-translation">19.20.4.1.4. gem5 se.py page translation</a></li>
+<li><a href="#atomicsimplecpu-initial-events">19.21.4.1.1. AtomicSimpleCPU initial events</a></li>
+<li><a href="#atomicsimplecpu-tick-reschedule-timing">19.21.4.1.2. AtomicSimpleCPU tick reschedule timing</a></li>
+<li><a href="#atomicsimplecpu-memory-access">19.21.4.1.3. AtomicSimpleCPU memory access</a></li>
+<li><a href="#gem5-se-py-page-translation">19.21.4.1.4. gem5 se.py page translation</a></li>
 </ul>
 </li>
-<li><a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis">19.20.4.2. gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis</a>
+<li><a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis">19.21.4.2. gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis</a>
 <ul class="sectlevel5">
-<li><a href="#timingsimplecpu-analysis-0">19.20.4.2.1. TimingSimpleCPU analysis #0</a></li>
-<li><a href="#timingsimplecpu-analysis-1">19.20.4.2.2. TimingSimpleCPU analysis #1</a></li>
-<li><a href="#timingsimplecpu-analysis-2">19.20.4.2.3. TimingSimpleCPU analysis #2</a></li>
-<li><a href="#timingsimplecpu-analysis-3-and-4">19.20.4.2.4. TimingSimpleCPU analysis #3 and #4</a></li>
-<li><a href="#timingsimplecpu-analysis-5">19.20.4.2.5. TimingSimpleCPU analysis #5</a></li>
-<li><a href="#timingsimplecpu-analysis-6">19.20.4.2.6. TimingSimpleCPU analysis #6</a></li>
-<li><a href="#timingsimplecpu-analysis-7">19.20.4.2.7. TimingSimpleCPU analysis #7</a></li>
-<li><a href="#timingsimplecpu-analysis-8">19.20.4.2.8. TimingSimpleCPU analysis #8</a></li>
-<li><a href="#timingsimplecpu-analysis-9">19.20.4.2.9. TimingSimpleCPU analysis #9</a></li>
-<li><a href="#timingsimplecpu-analysis-10">19.20.4.2.10. TimingSimpleCPU analysis #10</a></li>
-<li><a href="#timingsimplecpu-analysis-11">19.20.4.2.11. TimingSimpleCPU analysis #11</a></li>
-<li><a href="#timingsimplecpu-analysis-12">19.20.4.2.12. TimingSimpleCPU analysis #12</a></li>
-<li><a href="#timingsimplecpu-analysis-13">19.20.4.2.13. TimingSimpleCPU analysis #13</a></li>
-<li><a href="#timingsimplecpu-analysis-14">19.20.4.2.14. TimingSimpleCPU analysis #14</a></li>
-<li><a href="#timingsimplecpu-analysis-15">19.20.4.2.15. TimingSimpleCPU analysis #15</a></li>
-<li><a href="#timingsimplecpu-analysis-16">19.20.4.2.16. TimingSimpleCPU analysis #16</a></li>
-<li><a href="#timingsimplecpu-analysis-17">19.20.4.2.17. TimingSimpleCPU analysis #17</a></li>
-<li><a href="#timingsimplecpu-analysis-18">19.20.4.2.18. TimingSimpleCPU analysis #18</a></li>
-<li><a href="#timingsimplecpu-analysis-19">19.20.4.2.19. TimingSimpleCPU analysis #19</a></li>
-<li><a href="#timingsimplecpu-analysis-20">19.20.4.2.20. TimingSimpleCPU analysis #20</a></li>
-<li><a href="#timingsimplecpu-analysis-21">19.20.4.2.21. TimingSimpleCPU analysis #21</a></li>
-<li><a href="#timingsimplecpu-analysis-22">19.20.4.2.22. TimingSimpleCPU analysis #22</a></li>
-<li><a href="#timingsimplecpu-analysis-23">19.20.4.2.23. TimingSimpleCPU analysis #23</a></li>
-<li><a href="#timingsimplecpu-analysis-24">19.20.4.2.24. TimingSimpleCPU analysis #24</a></li>
-<li><a href="#timingsimplecpu-analysis-25">19.20.4.2.25. TimingSimpleCPU analysis #25</a></li>
-<li><a href="#timingsimplecpu-analysis-26">19.20.4.2.26. TimingSimpleCPU analysis #26</a></li>
-<li><a href="#timingsimplecpu-analysis-27">19.20.4.2.27. TimingSimpleCPU analysis #27</a></li>
-<li><a href="#timingsimplecpu-analysis-28">19.20.4.2.28. TimingSimpleCPU analysis #28</a></li>
-<li><a href="#timingsimplecpu-analysis-29">19.20.4.2.29. TimingSimpleCPU analysis #29</a></li>
-<li><a href="#timingsimplecpu-analysis-ldr-stall">19.20.4.2.30. TimingSimpleCPU analysis: LDR stall</a></li>
+<li><a href="#timingsimplecpu-analysis-0">19.21.4.2.1. TimingSimpleCPU analysis #0</a></li>
+<li><a href="#timingsimplecpu-analysis-1">19.21.4.2.2. TimingSimpleCPU analysis #1</a></li>
+<li><a href="#timingsimplecpu-analysis-2">19.21.4.2.3. TimingSimpleCPU analysis #2</a></li>
+<li><a href="#timingsimplecpu-analysis-3-and-4">19.21.4.2.4. TimingSimpleCPU analysis #3 and #4</a></li>
+<li><a href="#timingsimplecpu-analysis-5">19.21.4.2.5. TimingSimpleCPU analysis #5</a></li>
+<li><a href="#timingsimplecpu-analysis-6">19.21.4.2.6. TimingSimpleCPU analysis #6</a></li>
+<li><a href="#timingsimplecpu-analysis-7">19.21.4.2.7. TimingSimpleCPU analysis #7</a></li>
+<li><a href="#timingsimplecpu-analysis-8">19.21.4.2.8. TimingSimpleCPU analysis #8</a></li>
+<li><a href="#timingsimplecpu-analysis-9">19.21.4.2.9. TimingSimpleCPU analysis #9</a></li>
+<li><a href="#timingsimplecpu-analysis-10">19.21.4.2.10. TimingSimpleCPU analysis #10</a></li>
+<li><a href="#timingsimplecpu-analysis-11">19.21.4.2.11. TimingSimpleCPU analysis #11</a></li>
+<li><a href="#timingsimplecpu-analysis-12">19.21.4.2.12. TimingSimpleCPU analysis #12</a></li>
+<li><a href="#timingsimplecpu-analysis-13">19.21.4.2.13. TimingSimpleCPU analysis #13</a></li>
+<li><a href="#timingsimplecpu-analysis-14">19.21.4.2.14. TimingSimpleCPU analysis #14</a></li>
+<li><a href="#timingsimplecpu-analysis-15">19.21.4.2.15. TimingSimpleCPU analysis #15</a></li>
+<li><a href="#timingsimplecpu-analysis-16">19.21.4.2.16. TimingSimpleCPU analysis #16</a></li>
+<li><a href="#timingsimplecpu-analysis-17">19.21.4.2.17. TimingSimpleCPU analysis #17</a></li>
+<li><a href="#timingsimplecpu-analysis-18">19.21.4.2.18. TimingSimpleCPU analysis #18</a></li>
+<li><a href="#timingsimplecpu-analysis-19">19.21.4.2.19. TimingSimpleCPU analysis #19</a></li>
+<li><a href="#timingsimplecpu-analysis-20">19.21.4.2.20. TimingSimpleCPU analysis #20</a></li>
+<li><a href="#timingsimplecpu-analysis-21">19.21.4.2.21. TimingSimpleCPU analysis #21</a></li>
+<li><a href="#timingsimplecpu-analysis-22">19.21.4.2.22. TimingSimpleCPU analysis #22</a></li>
+<li><a href="#timingsimplecpu-analysis-23">19.21.4.2.23. TimingSimpleCPU analysis #23</a></li>
+<li><a href="#timingsimplecpu-analysis-24">19.21.4.2.24. TimingSimpleCPU analysis #24</a></li>
+<li><a href="#timingsimplecpu-analysis-25">19.21.4.2.25. TimingSimpleCPU analysis #25</a></li>
+<li><a href="#timingsimplecpu-analysis-26">19.21.4.2.26. TimingSimpleCPU analysis #26</a></li>
+<li><a href="#timingsimplecpu-analysis-27">19.21.4.2.27. TimingSimpleCPU analysis #27</a></li>
+<li><a href="#timingsimplecpu-analysis-28">19.21.4.2.28. TimingSimpleCPU analysis #28</a></li>
+<li><a href="#timingsimplecpu-analysis-29">19.21.4.2.29. TimingSimpleCPU analysis #29</a></li>
+<li><a href="#timingsimplecpu-analysis-ldr-stall">19.21.4.2.30. TimingSimpleCPU analysis: LDR stall</a></li>
 </ul>
 </li>
-<li><a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches">19.20.4.3. gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis with caches</a>
+<li><a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches">19.21.4.3. gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis with caches</a>
 <ul class="sectlevel5">
-<li><a href="#what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5">19.20.4.3.1. What is the coherency protocol implemented by the classic cache system in gem5?</a></li>
+<li><a href="#what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5">19.21.4.3.1. What is the coherency protocol implemented by the classic cache system in gem5?</a></li>
 </ul>
 </li>
-<li><a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">19.20.4.4. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs</a>
+<li><a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">19.21.4.4. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs</a>
 <ul class="sectlevel5">
-<li><a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus-and-ruby">19.20.4.4.1. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs and Ruby</a></li>
+<li><a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus-and-ruby">19.21.4.4.1. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs and Ruby</a></li>
 </ul>
 </li>
-<li><a href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis">19.20.4.5. gem5 event queue MinorCPU syscall emulation freestanding example analysis</a>
+<li><a href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis">19.21.4.5. gem5 event queue MinorCPU syscall emulation freestanding example analysis</a>
 <ul class="sectlevel5">
-<li><a href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis-hazard">19.20.4.5.1. gem5 event queue MinorCPU syscall emulation freestanding example analysis: hazard</a></li>
+<li><a href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis-hazard">19.21.4.5.1. gem5 event queue MinorCPU syscall emulation freestanding example analysis: hazard</a></li>
 </ul>
 </li>
-<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis">19.20.4.6. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis</a>
+<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis">19.21.4.6. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis</a>
 <ul class="sectlevel5">
-<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazardless">19.20.4.6.1. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazardless</a></li>
-<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard">19.20.4.6.2. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard</a></li>
-<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard4">19.20.4.6.3. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard4</a></li>
-<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall">19.20.4.6.4. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall</a></li>
-<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-gain">19.20.4.6.5. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall-gain</a></li>
-<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-hazard4">19.20.4.6.6. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall-hazard4</a></li>
-<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-speculative">19.20.4.6.7. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: speculative</a></li>
+<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazardless">19.21.4.6.1. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazardless</a></li>
+<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard">19.21.4.6.2. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard</a></li>
+<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard4">19.21.4.6.3. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard4</a></li>
+<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall">19.21.4.6.4. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall</a></li>
+<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-gain">19.21.4.6.5. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall-gain</a></li>
+<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-hazard4">19.21.4.6.6. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall-hazard4</a></li>
+<li><a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-speculative">19.21.4.6.7. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: speculative</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#gem5-instruction-definitions">19.20.5. gem5 instruction definitions</a>
+<li><a href="#gem5-instruction-definitions">19.21.5. gem5 instruction definitions</a>
 <ul class="sectlevel4">
-<li><a href="#gem5-execute-vs-initiateacc-vs-completeacc">19.20.5.1. gem5 <code>execute</code> vs <code>initiateAcc</code> vs <code>completeAcc</code></a>
+<li><a href="#gem5-execute-vs-initiateacc-vs-completeacc">19.21.5.1. gem5 <code>execute</code> vs <code>initiateAcc</code> vs <code>completeAcc</code></a>
 <ul class="sectlevel5">
-<li><a href="#gem5-completeacc">19.20.5.1.1. gem5 <code>completeAcc</code></a></li>
+<li><a href="#gem5-completeacc">19.21.5.1.1. gem5 <code>completeAcc</code></a></li>
 </ul>
 </li>
-<li><a href="#gem5-microops">19.20.5.2. gem5 microops</a></li>
+<li><a href="#gem5-microops">19.21.5.2. gem5 microops</a></li>
 </ul>
 </li>
-<li><a href="#gem5-port-system">19.20.6. gem5 port system</a>
+<li><a href="#gem5-port-system">19.21.6. gem5 port system</a>
 <ul class="sectlevel4">
-<li><a href="#gem5-functional-vs-atomic-vs-timing-memory-requests">19.20.6.1. gem5 functional vs atomic vs timing memory requests</a>
+<li><a href="#gem5-functional-vs-atomic-vs-timing-memory-requests">19.21.6.1. gem5 functional vs atomic vs timing memory requests</a>
 <ul class="sectlevel5">
-<li><a href="#gem5-functional-requests">19.20.6.1.1. gem5 functional requests</a></li>
+<li><a href="#gem5-functional-requests">19.21.6.1.1. gem5 functional requests</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#gem5-threadcontext-vs-threadstate-vs-execcontext-vs-process">19.20.7. gem5 <code>ThreadContext</code> vs <code>ThreadState</code> vs <code>ExecContext</code> vs <code>Process</code></a>
+<li><a href="#gem5-threadcontext-vs-threadstate-vs-execcontext-vs-process">19.21.7. gem5 <code>ThreadContext</code> vs <code>ThreadState</code> vs <code>ExecContext</code> vs <code>Process</code></a>
 <ul class="sectlevel4">
-<li><a href="#gem5-threadcontext">19.20.7.1. gem5 <code>ThreadContext</code></a>
+<li><a href="#gem5-threadcontext">19.21.7.1. gem5 <code>ThreadContext</code></a>
 <ul class="sectlevel5">
-<li><a href="#gem5-simplethread">19.20.7.1.1. gem5 <code>SimpleThread</code></a></li>
-<li><a href="#gem5-o3threadcontext">19.20.7.1.2. gem5 <code>O3ThreadContext</code></a></li>
+<li><a href="#gem5-simplethread">19.21.7.1.1. gem5 <code>SimpleThread</code></a></li>
+<li><a href="#gem5-o3threadcontext">19.21.7.1.2. gem5 <code>O3ThreadContext</code></a></li>
 </ul>
 </li>
-<li><a href="#gem5-threadstate">19.20.7.2. gem5 <code>ThreadState</code></a></li>
-<li><a href="#gem5-execcontext">19.20.7.3. gem5 <code>ExecContext</code></a>
+<li><a href="#gem5-threadstate">19.21.7.2. gem5 <code>ThreadState</code></a></li>
+<li><a href="#gem5-execcontext">19.21.7.3. gem5 <code>ExecContext</code></a>
 <ul class="sectlevel5">
-<li><a href="#gem5-execcontext-readintregoperand-register-resolution">19.20.7.3.1. gem5 <code>ExecContext::readIntRegOperand</code> register resolution</a></li>
+<li><a href="#gem5-execcontext-readintregoperand-register-resolution">19.21.7.3.1. gem5 <code>ExecContext::readIntRegOperand</code> register resolution</a></li>
 </ul>
 </li>
-<li><a href="#gem5-process">19.20.7.4. gem5 <code>Process</code></a></li>
+<li><a href="#gem5-process">19.21.7.4. gem5 <code>Process</code></a></li>
 </ul>
 </li>
-<li><a href="#gem5-functional-units">19.20.8. gem5 functional units</a>
+<li><a href="#gem5-functional-units">19.21.8. gem5 functional units</a>
 <ul class="sectlevel4">
-<li><a href="#gem5-minorcpu-default-functional-units">19.20.8.1. gem5 <code>MinorCPU</code> default functional units</a></li>
-<li><a href="#gem5-derivo3cpu-default-functional-units">19.20.8.2. gem5 DerivO3CPU default functional units</a></li>
+<li><a href="#gem5-minorcpu-default-functional-units">19.21.8.1. gem5 <code>MinorCPU</code> default functional units</a></li>
+<li><a href="#gem5-derivo3cpu-default-functional-units">19.21.8.2. gem5 DerivO3CPU default functional units</a></li>
 </ul>
 </li>
-<li><a href="#gem5-code-generation">19.20.9. gem5 code generation</a>
+<li><a href="#gem5-code-generation">19.21.9. gem5 code generation</a>
 <ul class="sectlevel4">
-<li><a href="#gem5-the-isa">19.20.9.1. gem5 THE_ISA</a></li>
+<li><a href="#gem5-the-isa">19.21.9.1. gem5 THE_ISA</a></li>
 </ul>
 </li>
-<li><a href="#gem5-build-system">19.20.10. gem5 build system</a>
+<li><a href="#gem5-build-system">19.21.10. gem5 build system</a>
 <ul class="sectlevel4">
-<li><a href="#m5-override-py-source">19.20.10.1. M5_OVERRIDE_PY_SOURCE</a></li>
-<li><a href="#gem5-build-broken-on-recent-compiler-version">19.20.10.2. gem5 build broken on recent compiler version</a></li>
-<li><a href="#gem5-polymorphic-isa-includes">19.20.10.3. gem5 polymorphic ISA includes</a></li>
-<li><a href="#why-are-all-c-symlinked-into-the-gem5-build-dir">19.20.10.4. Why are all C++ symlinked into the gem5 build dir?</a></li>
+<li><a href="#m5-override-py-source">19.21.10.1. M5_OVERRIDE_PY_SOURCE</a></li>
+<li><a href="#gem5-build-broken-on-recent-compiler-version">19.21.10.2. gem5 build broken on recent compiler version</a></li>
+<li><a href="#gem5-polymorphic-isa-includes">19.21.10.3. gem5 polymorphic ISA includes</a></li>
+<li><a href="#why-are-all-c-symlinked-into-the-gem5-build-dir">19.21.10.4. Why are all C++ symlinked into the gem5 build dir?</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#gensim">19.21. Gensim</a></li>
 </ul>
 </li>
-<li><a href="#buildroot">20. Buildroot</a>
+<li><a href="#gensim">20. Gensim</a></li>
+<li><a href="#buildroot">21. Buildroot</a>
 <ul class="sectlevel2">
-<li><a href="#introduction-to-buildroot">20.1. Introduction to Buildroot</a></li>
-<li><a href="#custom-buildroot-configs">20.2. Custom Buildroot configs</a>
+<li><a href="#introduction-to-buildroot">21.1. Introduction to Buildroot</a></li>
+<li><a href="#custom-buildroot-configs">21.2. Custom Buildroot configs</a>
 <ul class="sectlevel3">
-<li><a href="#enable-buildroot-compiler-optimizations">20.2.1. Enable Buildroot compiler optimizations</a></li>
+<li><a href="#enable-buildroot-compiler-optimizations">21.2.1. Enable Buildroot compiler optimizations</a></li>
 </ul>
 </li>
-<li><a href="#find-buildroot-options-with-make-menuconfig">20.3. Find Buildroot options with make menuconfig</a></li>
-<li><a href="#change-user">20.4. Change user</a>
+<li><a href="#find-buildroot-options-with-make-menuconfig">21.3. Find Buildroot options with make menuconfig</a></li>
+<li><a href="#change-user">21.4. Change user</a>
 <ul class="sectlevel3">
-<li><a href="#login-as-a-non-root-user-without-password">20.4.1. Login as a non-root user without password</a></li>
+<li><a href="#login-as-a-non-root-user-without-password">21.4.1. Login as a non-root user without password</a></li>
 </ul>
 </li>
-<li><a href="#add-new-files-to-the-buildroot-image">20.5. Add new files to the Buildroot image</a>
+<li><a href="#add-new-files-to-the-buildroot-image">21.5. Add new files to the Buildroot image</a>
 <ul class="sectlevel3">
-<li><a href="#add-new-buildroot-packages">20.5.1. Add new Buildroot packages</a></li>
+<li><a href="#add-new-buildroot-packages">21.5.1. Add new Buildroot packages</a></li>
 </ul>
 </li>
-<li><a href="#remove-buildroot-packages">20.6. Remove Buildroot packages</a></li>
-<li><a href="#br2-target-rootfs-ext2-size">20.7. BR2_TARGET_ROOTFS_EXT2_SIZE</a>
+<li><a href="#remove-buildroot-packages">21.6. Remove Buildroot packages</a></li>
+<li><a href="#br2-target-rootfs-ext2-size">21.7. BR2_TARGET_ROOTFS_EXT2_SIZE</a>
 <ul class="sectlevel3">
-<li><a href="#squashfs">20.7.1. SquashFS</a></li>
+<li><a href="#squashfs">21.7.1. SquashFS</a></li>
 </ul>
 </li>
-<li><a href="#rpath">20.8. Buildroot rebuild is slow when the root filesystem is large</a></li>
-<li><a href="#report-upstream-bugs">20.9. Report upstream bugs</a></li>
-<li><a href="#libc-choice">20.10. libc choice</a></li>
-<li><a href="#buildroot-hello-world">20.11. Buildroot hello world</a></li>
-<li><a href="#update-the-buildroot-toolchain">20.12. Update the Buildroot toolchain</a>
+<li><a href="#rpath">21.8. Buildroot rebuild is slow when the root filesystem is large</a></li>
+<li><a href="#report-upstream-bugs">21.9. Report upstream bugs</a></li>
+<li><a href="#libc-choice">21.10. libc choice</a></li>
+<li><a href="#buildroot-hello-world">21.11. Buildroot hello world</a></li>
+<li><a href="#update-the-buildroot-toolchain">21.12. Update the Buildroot toolchain</a>
 <ul class="sectlevel3">
-<li><a href="#update-gcc-gcc-supported-by-buildroot">20.12.1. Update GCC: GCC supported by Buildroot</a></li>
-<li><a href="#update-gcc-gcc-not-supported-by-buildroot">20.12.2. Update GCC: GCC not supported by Buildroot</a></li>
+<li><a href="#update-gcc-gcc-supported-by-buildroot">21.12.1. Update GCC: GCC supported by Buildroot</a></li>
+<li><a href="#update-gcc-gcc-not-supported-by-buildroot">21.12.2. Update GCC: GCC not supported by Buildroot</a></li>
 </ul>
 </li>
-<li><a href="#buildroot-vanilla-kernel">20.13. Buildroot vanilla kernel</a></li>
+<li><a href="#buildroot-vanilla-kernel">21.13. Buildroot vanilla kernel</a></li>
 </ul>
 </li>
-<li><a href="#userland-content">21. Userland content</a>
+<li><a href="#userland-content">22. Userland content</a>
 <ul class="sectlevel2">
-<li><a href="#c">21.1. C</a>
+<li><a href="#c">22.1. C</a>
 <ul class="sectlevel3">
-<li><a href="#malloc">21.1.1. malloc</a>
+<li><a href="#malloc">22.1.1. malloc</a>
 <ul class="sectlevel4">
-<li><a href="#malloc-implementation">21.1.1.1. malloc implementation</a></li>
-<li><a href="#malloc-maximum-size">21.1.1.2. malloc maximum size</a>
+<li><a href="#malloc-implementation">22.1.1.1. malloc implementation</a></li>
+<li><a href="#malloc-maximum-size">22.1.1.2. malloc maximum size</a>
 <ul class="sectlevel5">
-<li><a href="#linux-out-of-memory-killer">21.1.1.2.1. Linux out-of-memory killer</a></li>
+<li><a href="#linux-out-of-memory-killer">22.1.1.2.1. Linux out-of-memory killer</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#c-multithreading">21.1.2. C multithreading</a>
+<li><a href="#c-multithreading">22.1.2. C multithreading</a>
 <ul class="sectlevel4">
-<li><a href="#atomic-c">21.1.2.1. atomic.c</a></li>
+<li><a href="#atomic-c">22.1.2.1. atomic.c</a></li>
 </ul>
 </li>
-<li><a href="#gcc-c-extensions">21.1.3. GCC C extensions</a>
+<li><a href="#gcc-c-extensions">22.1.3. GCC C extensions</a>
 <ul class="sectlevel4">
-<li><a href="#c-empty-struct">21.1.3.1. C empty struct</a></li>
-<li><a href="#openmp">21.1.3.2. OpenMP</a>
+<li><a href="#c-empty-struct">22.1.3.1. C empty struct</a></li>
+<li><a href="#openmp">22.1.3.2. OpenMP</a>
 <ul class="sectlevel5">
-<li><a href="#openmp-validation">21.1.3.2.1. OpenMP validation</a></li>
+<li><a href="#openmp-validation">22.1.3.2.1. OpenMP validation</a></li>
 </ul>
 </li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#cpp">21.2. C++</a>
+<li><a href="#cpp">22.2. C++</a>
 <ul class="sectlevel3">
-<li><a href="#cpp-initialization-types">21.2.1. C++ initialization types</a></li>
-<li><a href="#cpp-multithreading">21.2.2. C++ multithreading</a>
+<li><a href="#cpp-initialization-types">22.2.1. C++ initialization types</a></li>
+<li><a href="#cpp-multithreading">22.2.2. C++ multithreading</a>
 <ul class="sectlevel4">
-<li><a href="#atomic-cpp">21.2.2.1. atomic.cpp</a>
+<li><a href="#atomic-cpp">22.2.2.1. atomic.cpp</a>
 <ul class="sectlevel5">
-<li><a href="#detailed-gem5-analysis-of-how-data-races-happen">21.2.2.1.1. Detailed gem5 analysis of how data races happen</a></li>
+<li><a href="#detailed-gem5-analysis-of-how-data-races-happen">22.2.2.1.1. Detailed gem5 analysis of how data races happen</a></li>
 </ul>
 </li>
-<li><a href="#cpp-memory-order">21.2.2.2. C++ std::memory_order</a></li>
-<li><a href="#cpp-parallel-algorithms">21.2.2.3. C++ parallel algorithms</a></li>
+<li><a href="#cpp-memory-order">22.2.2.2. C++ std::memory_order</a></li>
+<li><a href="#cpp-parallel-algorithms">22.2.2.3. C++ parallel algorithms</a></li>
 </ul>
 </li>
-<li><a href="#cpp-standards">21.2.3. C++ standards</a>
+<li><a href="#cpp-standards">22.2.3. C++ standards</a>
 <ul class="sectlevel4">
-<li><a href="#cpp17">21.2.3.1. C++17 N4659 standards draft</a></li>
+<li><a href="#cpp17">22.2.3.1. C++17 N4659 standards draft</a></li>
 </ul>
 </li>
-<li><a href="#cpp-type-casting">21.2.4. C++ type casting</a></li>
+<li><a href="#cpp-type-casting">22.2.4. C++ type casting</a></li>
 </ul>
 </li>
-<li><a href="#posix">21.3. POSIX</a>
+<li><a href="#posix">22.3. POSIX</a>
 <ul class="sectlevel3">
-<li><a href="#environment-variables">21.3.1. Environment variables</a></li>
-<li><a href="#unistd-h">21.3.2. unistd.h</a></li>
-<li><a href="#fork">21.3.3. fork</a>
+<li><a href="#environment-variables">22.3.1. Environment variables</a></li>
+<li><a href="#unistd-h">22.3.2. unistd.h</a></li>
+<li><a href="#fork">22.3.3. fork</a>
 <ul class="sectlevel4">
-<li><a href="#getpid">21.3.3.1. getpid</a></li>
-<li><a href="#fork-bomb">21.3.3.2. Fork bomb</a></li>
+<li><a href="#getpid">22.3.3.1. getpid</a></li>
+<li><a href="#fork-bomb">22.3.3.2. Fork bomb</a></li>
 </ul>
 </li>
-<li><a href="#pthreads">21.3.4. pthreads</a>
+<li><a href="#pthreads">22.3.4. pthreads</a>
 <ul class="sectlevel4">
-<li><a href="#pthread-mutex">21.3.4.1. pthread_mutex</a></li>
+<li><a href="#pthread-mutex">22.3.4.1. pthread_mutex</a></li>
 </ul>
 </li>
-<li><a href="#sysconf">21.3.5. sysconf</a></li>
-<li><a href="#mmap-2">21.3.6. mmap</a>
+<li><a href="#sysconf">22.3.5. sysconf</a></li>
+<li><a href="#mmap-2">22.3.6. mmap</a>
 <ul class="sectlevel4">
-<li><a href="#mmap-map-anonymous">21.3.6.1. mmap MAP_ANONYMOUS</a></li>
-<li><a href="#mmap-file">21.3.6.2. mmap file</a></li>
-<li><a href="#brk">21.3.6.3. brk</a></li>
+<li><a href="#mmap-map-anonymous">22.3.6.1. mmap MAP_ANONYMOUS</a></li>
+<li><a href="#mmap-file">22.3.6.2. mmap file</a></li>
+<li><a href="#brk">22.3.6.3. brk</a></li>
 </ul>
 </li>
-<li><a href="#socket">21.3.7. socket</a></li>
+<li><a href="#socket">22.3.7. socket</a></li>
 </ul>
 </li>
-<li><a href="#userland-multithreading">21.4. Userland multithreading</a></li>
-<li><a href="#c-debugging">21.5. C debugging</a>
+<li><a href="#userland-multithreading">22.4. Userland multithreading</a></li>
+<li><a href="#c-debugging">22.5. C debugging</a>
 <ul class="sectlevel3">
-<li><a href="#stack-smashing">21.5.1. Stack smashing</a></li>
-<li><a href="#memory-leaks">21.5.2. Memory leaks</a></li>
-<li><a href="#profiling-userland-programs">21.5.3. Profiling userland programs</a></li>
+<li><a href="#stack-smashing">22.5.1. Stack smashing</a></li>
+<li><a href="#memory-leaks">22.5.2. Memory leaks</a></li>
+<li><a href="#profiling-userland-programs">22.5.3. Profiling userland programs</a></li>
 </ul>
 </li>
-<li><a href="#interpreted-languages">21.6. Interpreted languages</a>
+<li><a href="#interpreted-languages">22.6. Interpreted languages</a>
 <ul class="sectlevel3">
-<li><a href="#python">21.6.1. Python</a>
+<li><a href="#python">22.6.1. Python</a>
 <ul class="sectlevel4">
-<li><a href="#build-and-install-the-interpreter">21.6.1.1. Build and install the interpreter</a></li>
-<li><a href="#python-gem5-user-mode-simulation">21.6.1.2. Python gem5 user mode simulation</a></li>
-<li><a href="#embedding-python-in-another-application">21.6.1.3. Embedding Python in another application</a></li>
-<li><a href="#pybind11">21.6.1.4. pybind11</a></li>
+<li><a href="#build-and-install-the-interpreter">22.6.1.1. Build and install the interpreter</a></li>
+<li><a href="#python-gem5-user-mode-simulation">22.6.1.2. Python gem5 user mode simulation</a></li>
+<li><a href="#embedding-python-in-another-application">22.6.1.3. Embedding Python in another application</a></li>
+<li><a href="#pybind11">22.6.1.4. pybind11</a></li>
 </ul>
 </li>
-<li><a href="#node-js">21.6.2. Node.js</a>
+<li><a href="#node-js">22.6.2. Node.js</a>
 <ul class="sectlevel4">
-<li><a href="#npm">21.6.2.1. NPM</a>
+<li><a href="#npm">22.6.2.1. NPM</a>
 <ul class="sectlevel5">
-<li><a href="#npm-data-files">21.6.2.1.1. NPM data-files</a></li>
+<li><a href="#npm-data-files">22.6.2.1.1. NPM data-files</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#java">21.6.3. Java</a></li>
+<li><a href="#java">22.6.3. Java</a></li>
 </ul>
 </li>
-<li><a href="#algorithms">21.7. Algorithms</a>
+<li><a href="#algorithms">22.7. Algorithms</a>
 <ul class="sectlevel3">
-<li><a href="#bst-vs-heap-vs-hashmap">21.7.1. BST vs heap vs hashmap</a></li>
-<li><a href="#blas">21.7.2. BLAS</a></li>
-<li><a href="#eigen">21.7.3. Eigen</a></li>
+<li><a href="#bst-vs-heap-vs-hashmap">22.7.1. BST vs heap vs hashmap</a></li>
+<li><a href="#blas">22.7.2. BLAS</a></li>
+<li><a href="#eigen">22.7.3. Eigen</a></li>
 </ul>
 </li>
-<li><a href="#benchmarks">21.8. Benchmarks</a>
+<li><a href="#benchmarks">22.8. Benchmarks</a>
 <ul class="sectlevel3">
-<li><a href="#parsec-benchmark">21.8.1. PARSEC benchmark</a>
+<li><a href="#parsec-benchmark">22.8.1. PARSEC benchmark</a>
 <ul class="sectlevel4">
-<li><a href="#parsec-benchmark-without-parsecmgmt">21.8.1.1. PARSEC benchmark without parsecmgmt</a></li>
-<li><a href="#parsec-change-the-input-size">21.8.1.2. PARSEC change the input size</a></li>
-<li><a href="#parsec-benchmark-with-parsecmgmt">21.8.1.3. PARSEC benchmark with parsecmgmt</a></li>
-<li><a href="#parsec-uninstall">21.8.1.4. PARSEC uninstall</a></li>
-<li><a href="#parsec-benchmark-hacking">21.8.1.5. PARSEC benchmark hacking</a></li>
-<li><a href="#coremark">21.8.1.6. Coremark</a></li>
+<li><a href="#parsec-benchmark-without-parsecmgmt">22.8.1.1. PARSEC benchmark without parsecmgmt</a></li>
+<li><a href="#parsec-change-the-input-size">22.8.1.2. PARSEC change the input size</a></li>
+<li><a href="#parsec-benchmark-with-parsecmgmt">22.8.1.3. PARSEC benchmark with parsecmgmt</a></li>
+<li><a href="#parsec-uninstall">22.8.1.4. PARSEC uninstall</a></li>
+<li><a href="#parsec-benchmark-hacking">22.8.1.5. PARSEC benchmark hacking</a></li>
+<li><a href="#coremark">22.8.1.6. Coremark</a></li>
 </ul>
 </li>
-<li><a href="#microbenchmarks">21.8.2. Microbenchmarks</a>
+<li><a href="#microbenchmarks">22.8.2. Microbenchmarks</a>
 <ul class="sectlevel4">
-<li><a href="#dhrystone">21.8.2.1. Dhrystone</a></li>
-<li><a href="#lmbench">21.8.2.2. LMbench</a></li>
-<li><a href="#stream-benchmark">21.8.2.3. STREAM benchmark</a></li>
+<li><a href="#dhrystone">22.8.2.1. Dhrystone</a></li>
+<li><a href="#lmbench">22.8.2.2. LMbench</a></li>
+<li><a href="#stream-benchmark">22.8.2.3. STREAM benchmark</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#userland-libs-directory">21.9. userland/libs directory</a>
+<li><a href="#userland-libs-directory">22.9. userland/libs directory</a>
 <ul class="sectlevel3">
-<li><a href="#boost">21.9.1. Boost</a></li>
-<li><a href="#hdf5">21.9.2. HDF5</a></li>
+<li><a href="#boost">22.9.1. Boost</a></li>
+<li><a href="#hdf5">22.9.2. HDF5</a></li>
 </ul>
 </li>
-<li><a href="#userland-content-filename-conventions">21.10. Userland content filename conventions</a></li>
-<li><a href="#userland-content-bibliography">21.11. Userland content bibliography</a></li>
+<li><a href="#userland-content-filename-conventions">22.10. Userland content filename conventions</a></li>
+<li><a href="#userland-content-bibliography">22.11. Userland content bibliography</a></li>
 </ul>
 </li>
-<li><a href="#userland-assembly">22. Userland assembly</a>
+<li><a href="#userland-assembly">23. Userland assembly</a>
 <ul class="sectlevel2">
-<li><a href="#assembly-registers">22.1. Assembly registers</a>
+<li><a href="#assembly-registers">23.1. Assembly registers</a>
 <ul class="sectlevel3">
-<li><a href="#armv8-aarch64-x31-register">22.1.1. ARMv8 aarch64 x31 register</a></li>
+<li><a href="#armv8-aarch64-x31-register">23.1.1. ARMv8 aarch64 x31 register</a></li>
 </ul>
 </li>
-<li><a href="#floating-point-assembly">22.2. Floating point assembly</a></li>
-<li><a href="#simd-assembly">22.3. SIMD assembly</a>
+<li><a href="#floating-point-assembly">23.2. Floating point assembly</a></li>
+<li><a href="#simd-assembly">23.3. SIMD assembly</a>
 <ul class="sectlevel3">
-<li><a href="#fma-instruction">22.3.1. FMA instruction</a></li>
+<li><a href="#fma-instruction">23.3.1. FMA instruction</a></li>
 </ul>
 </li>
-<li><a href="#user-vs-system-assembly">22.4. User vs system assembly</a></li>
-<li><a href="#userland-assembly-c-standard-library">22.5. Userland assembly C standard library</a>
+<li><a href="#user-vs-system-assembly">23.4. User vs system assembly</a></li>
+<li><a href="#userland-assembly-c-standard-library">23.5. Userland assembly C standard library</a>
 <ul class="sectlevel3">
-<li><a href="#freestanding-programs">22.5.1. Freestanding programs</a>
+<li><a href="#freestanding-programs">23.5.1. Freestanding programs</a>
 <ul class="sectlevel4">
-<li><a href="#nostartfiles-programs">22.5.1.1. nostartfiles programs</a></li>
+<li><a href="#nostartfiles-programs">23.5.1.1. nostartfiles programs</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#gcc-inline-assembly">22.6. GCC inline assembly</a>
+<li><a href="#gcc-inline-assembly">23.6. GCC inline assembly</a>
 <ul class="sectlevel3">
-<li><a href="#gcc-inline-assembly-register-variables">22.6.1. GCC inline assembly register variables</a></li>
-<li><a href="#gcc-inline-assembly-scratch-registers">22.6.2. GCC inline assembly scratch registers</a></li>
-<li><a href="#gcc-inline-assembly-early-clobbers">22.6.3. GCC inline assembly early-clobbers</a></li>
-<li><a href="#gcc-inline-assembly-floating-point-arm">22.6.4. GCC inline assembly floating point ARM</a></li>
-<li><a href="#gcc-intrinsics">22.6.5. GCC intrinsics</a>
+<li><a href="#gcc-inline-assembly-register-variables">23.6.1. GCC inline assembly register variables</a></li>
+<li><a href="#gcc-inline-assembly-scratch-registers">23.6.2. GCC inline assembly scratch registers</a></li>
+<li><a href="#gcc-inline-assembly-early-clobbers">23.6.3. GCC inline assembly early-clobbers</a></li>
+<li><a href="#gcc-inline-assembly-floating-point-arm">23.6.4. GCC inline assembly floating point ARM</a></li>
+<li><a href="#gcc-intrinsics">23.6.5. GCC intrinsics</a>
 <ul class="sectlevel4">
-<li><a href="#gcc-x86-intrinsics">22.6.5.1. GCC x86 intrinsics</a></li>
+<li><a href="#gcc-x86-intrinsics">23.6.5.1. GCC x86 intrinsics</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#linux-system-calls">22.7. Linux system calls</a>
+<li><a href="#linux-system-calls">23.7. Linux system calls</a>
 <ul class="sectlevel3">
-<li><a href="#futex-system-call">22.7.1. futex system call</a>
+<li><a href="#futex-system-call">23.7.1. futex system call</a>
 <ul class="sectlevel4">
-<li><a href="#userland-mutex-implementation">22.7.1.1. Userland mutex implementation</a></li>
+<li><a href="#userland-mutex-implementation">23.7.1.1. Userland mutex implementation</a></li>
 </ul>
 </li>
-<li><a href="#getcpu">22.7.2. <code>getcpu</code> system call and the <code>sched_getaffinity</code> glibc wrapper</a></li>
+<li><a href="#getcpu">23.7.2. <code>getcpu</code> system call and the <code>sched_getaffinity</code> glibc wrapper</a></li>
 </ul>
 </li>
-<li><a href="#linux-calling-conventions">22.8. Linux calling conventions</a>
+<li><a href="#linux-calling-conventions">23.8. Linux calling conventions</a>
 <ul class="sectlevel3">
-<li><a href="#x86_64-calling-convention">22.8.1. x86_64 calling convention</a></li>
-<li><a href="#arm-calling-convention">22.8.2. ARM calling convention</a></li>
+<li><a href="#x86_64-calling-convention">23.8.1. x86_64 calling convention</a></li>
+<li><a href="#arm-calling-convention">23.8.2. ARM calling convention</a></li>
 </ul>
 </li>
-<li><a href="#gnu-gas-assembler">22.9. GNU GAS assembler</a>
+<li><a href="#gnu-gas-assembler">23.9. GNU GAS assembler</a>
 <ul class="sectlevel3">
-<li><a href="#gnu-gas-assembler-comments">22.9.1. GNU GAS assembler comments</a></li>
-<li><a href="#gnu-gas-assembler-immediates">22.9.2. GNU GAS assembler immediates</a></li>
-<li><a href="#gnu-gas-assembler-data-sizes">22.9.3. GNU GAS assembler data sizes</a>
+<li><a href="#gnu-gas-assembler-comments">23.9.1. GNU GAS assembler comments</a></li>
+<li><a href="#gnu-gas-assembler-immediates">23.9.2. GNU GAS assembler immediates</a></li>
+<li><a href="#gnu-gas-assembler-data-sizes">23.9.3. GNU GAS assembler data sizes</a>
 <ul class="sectlevel4">
-<li><a href="#gnu-gas-assembler-arm-specifics">22.9.3.1. GNU GAS assembler ARM specifics</a>
+<li><a href="#gnu-gas-assembler-arm-specifics">23.9.3.1. GNU GAS assembler ARM specifics</a>
 <ul class="sectlevel5">
-<li><a href="#gnu-gas-assembler-arm-unified-syntax">22.9.3.1.1. GNU GAS assembler ARM unified syntax</a></li>
+<li><a href="#gnu-gas-assembler-arm-unified-syntax">23.9.3.1.1. GNU GAS assembler ARM unified syntax</a></li>
 </ul>
 </li>
-<li><a href="#gnu-gas-assembler-arm-n-and-w-suffixes">22.9.3.2. GNU GAS assembler ARM .n and .w suffixes</a></li>
+<li><a href="#gnu-gas-assembler-arm-n-and-w-suffixes">23.9.3.2. GNU GAS assembler ARM .n and .w suffixes</a></li>
 </ul>
 </li>
-<li><a href="#gnu-gas-assembler-char-literals">22.9.4. GNU GAS assembler char literals</a></li>
+<li><a href="#gnu-gas-assembler-char-literals">23.9.4. GNU GAS assembler char literals</a></li>
 </ul>
 </li>
-<li><a href="#nop-instructions">22.10. NOP instructions</a></li>
+<li><a href="#nop-instructions">23.10. NOP instructions</a></li>
 </ul>
 </li>
-<li><a href="#x86-userland-assembly">23. x86 userland assembly</a>
+<li><a href="#x86-userland-assembly">24. x86 userland assembly</a>
 <ul class="sectlevel2">
-<li><a href="#x86-registers">23.1. x86 registers</a>
+<li><a href="#x86-registers">24.1. x86 registers</a>
 <ul class="sectlevel3">
-<li><a href="#x86-flags-registers">23.1.1. x86 FLAGS registers</a></li>
+<li><a href="#x86-flags-registers">24.1.1. x86 FLAGS registers</a></li>
 </ul>
 </li>
-<li><a href="#x86-addressing-modes">23.2. x86 addressing modes</a></li>
-<li><a href="#x86-data-transfer-instructions">23.3. x86 data transfer instructions</a>
+<li><a href="#x86-addressing-modes">24.2. x86 addressing modes</a></li>
+<li><a href="#x86-data-transfer-instructions">24.3. x86 data transfer instructions</a>
 <ul class="sectlevel3">
-<li><a href="#x86-exchange-instructions">23.3.1. x86 exchange instructions</a>
+<li><a href="#x86-exchange-instructions">24.3.1. x86 exchange instructions</a>
 <ul class="sectlevel4">
-<li><a href="#x86-cmpxchg-instruction">23.3.1.1. x86 CMPXCHG instruction</a></li>
+<li><a href="#x86-cmpxchg-instruction">24.3.1.1. x86 CMPXCHG instruction</a></li>
 </ul>
 </li>
-<li><a href="#x86-push-and-pop-instructions">23.3.2. x86 PUSH and POP instructions</a></li>
-<li><a href="#x86-cqto-and-cltq-instructions">23.3.3. x86 CQTO and CLTQ instructions</a></li>
-<li><a href="#x86-cmovcc-instructions">23.3.4. x86 CMOVcc instructions</a></li>
+<li><a href="#x86-push-and-pop-instructions">24.3.2. x86 PUSH and POP instructions</a></li>
+<li><a href="#x86-cqto-and-cltq-instructions">24.3.3. x86 CQTO and CLTQ instructions</a></li>
+<li><a href="#x86-cmovcc-instructions">24.3.4. x86 CMOVcc instructions</a></li>
 </ul>
 </li>
-<li><a href="#x86-binary-arithmetic-instructions">23.4. x86 binary arithmetic instructions</a></li>
-<li><a href="#x86-logical-instructions">23.5. x86 logical instructions</a></li>
-<li><a href="#x86-shift-and-rotate-instructions">23.6. x86 shift and rotate instructions</a></li>
-<li><a href="#x86-bit-and-byte-instructions">23.7. x86 bit and byte instructions</a></li>
-<li><a href="#x86-control-transfer-instructions">23.8. x86 control transfer instructions</a>
+<li><a href="#x86-binary-arithmetic-instructions">24.4. x86 binary arithmetic instructions</a></li>
+<li><a href="#x86-logical-instructions">24.5. x86 logical instructions</a></li>
+<li><a href="#x86-shift-and-rotate-instructions">24.6. x86 shift and rotate instructions</a></li>
+<li><a href="#x86-bit-and-byte-instructions">24.7. x86 bit and byte instructions</a></li>
+<li><a href="#x86-control-transfer-instructions">24.8. x86 control transfer instructions</a>
 <ul class="sectlevel3">
-<li><a href="#x86-jcc-instructions">23.8.1. x86 Jcc instructions</a></li>
-<li><a href="#x86-loop-instruction">23.8.2. x86 LOOP instruction</a></li>
-<li><a href="#x86-string-instructions">23.8.3. x86 string instructions</a>
+<li><a href="#x86-jcc-instructions">24.8.1. x86 Jcc instructions</a></li>
+<li><a href="#x86-loop-instruction">24.8.2. x86 LOOP instruction</a></li>
+<li><a href="#x86-string-instructions">24.8.3. x86 string instructions</a>
 <ul class="sectlevel4">
-<li><a href="#x86-rep-prefix">23.8.3.1. x86 REP prefix</a></li>
+<li><a href="#x86-rep-prefix">24.8.3.1. x86 REP prefix</a></li>
 </ul>
 </li>
-<li><a href="#x86-enter-and-leave-instructions">23.8.4. x86 ENTER and LEAVE instructions</a></li>
+<li><a href="#x86-enter-and-leave-instructions">24.8.4. x86 ENTER and LEAVE instructions</a></li>
 </ul>
 </li>
-<li><a href="#x86-miscellaneous-instructions">23.9. x86 miscellaneous instructions</a></li>
-<li><a href="#x86-random-number-generator-instructions">23.10. x86 random number generator instructions</a>
+<li><a href="#x86-miscellaneous-instructions">24.9. x86 miscellaneous instructions</a></li>
+<li><a href="#x86-random-number-generator-instructions">24.10. x86 random number generator instructions</a>
 <ul class="sectlevel3">
-<li><a href="#x86-cpuid-instruction">23.10.1. x86 CPUID instruction</a></li>
+<li><a href="#x86-cpuid-instruction">24.10.1. x86 CPUID instruction</a></li>
 </ul>
 </li>
-<li><a href="#x86-x87-fpu-instructions">23.11. x86 x87 FPU instructions</a>
+<li><a href="#x86-x87-fpu-instructions">24.11. x86 x87 FPU instructions</a>
 <ul class="sectlevel3">
-<li><a href="#x86-x87-fpu-vs-simd">23.11.1. x86 x87 FPU vs SIMD</a></li>
+<li><a href="#x86-x87-fpu-vs-simd">24.11.1. x86 x87 FPU vs SIMD</a></li>
 </ul>
 </li>
-<li><a href="#x86-simd">23.12. x86 SIMD</a>
+<li><a href="#x86-simd">24.12. x86 SIMD</a>
 <ul class="sectlevel3">
-<li><a href="#x86-sse-instructions">23.12.1. x86 SSE instructions</a>
+<li><a href="#x86-sse-instructions">24.12.1. x86 SSE instructions</a>
 <ul class="sectlevel4">
-<li><a href="#x86-sse-data-transfer-instructions">23.12.1.1. x86 SSE data transfer instructions</a></li>
-<li><a href="#x86-sse-packed-arithmetic-instructions">23.12.1.2. x86 SSE packed arithmetic instructions</a></li>
-<li><a href="#x86-sse-conversion-instructions">23.12.1.3. x86 SSE conversion instructions</a></li>
+<li><a href="#x86-sse-data-transfer-instructions">24.12.1.1. x86 SSE data transfer instructions</a></li>
+<li><a href="#x86-sse-packed-arithmetic-instructions">24.12.1.2. x86 SSE packed arithmetic instructions</a></li>
+<li><a href="#x86-sse-conversion-instructions">24.12.1.3. x86 SSE conversion instructions</a></li>
 </ul>
 </li>
-<li><a href="#x86-sse2-instructions">23.12.2. x86 SSE2 instructions</a>
+<li><a href="#x86-sse2-instructions">24.12.2. x86 SSE2 instructions</a>
 <ul class="sectlevel4">
-<li><a href="#x86-paddq-instruction">23.12.2.1. x86 PADDQ instruction</a></li>
+<li><a href="#x86-paddq-instruction">24.12.2.1. x86 PADDQ instruction</a></li>
 </ul>
 </li>
-<li><a href="#x86-fma">23.12.3. x86 fused multiply add (FMA)</a></li>
+<li><a href="#x86-fma">24.12.3. x86 fused multiply add (FMA)</a></li>
 </ul>
 </li>
-<li><a href="#x86-system-instructions">23.13. x86 system instructions</a>
+<li><a href="#x86-system-instructions">24.13. x86 system instructions</a>
 <ul class="sectlevel3">
-<li><a href="#x86-rdtsc-instruction">23.13.1. x86 RDTSC instruction</a>
+<li><a href="#x86-rdtsc-instruction">24.13.1. x86 RDTSC instruction</a>
 <ul class="sectlevel4">
-<li><a href="#x86-rdtscp-instruction">23.13.1.1. x86 RDTSCP instruction</a></li>
-<li><a href="#arm-pmccntr-register">23.13.1.2. ARM PMCCNTR register</a></li>
+<li><a href="#x86-rdtscp-instruction">24.13.1.1. x86 RDTSCP instruction</a></li>
+<li><a href="#arm-pmccntr-register">24.13.1.2. ARM PMCCNTR register</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#x86-thread-synchronization-primitives">23.14. x86 thread synchronization primitives</a>
+<li><a href="#x86-thread-synchronization-primitives">24.14. x86 thread synchronization primitives</a>
 <ul class="sectlevel3">
-<li><a href="#x86-lock-prefix">23.14.1. x86 LOCK prefix</a></li>
+<li><a href="#x86-lock-prefix">24.14.1. x86 LOCK prefix</a></li>
 </ul>
 </li>
-<li><a href="#x86-assembly-bibliography">23.15. x86 assembly bibliography</a>
+<li><a href="#x86-assembly-bibliography">24.15. x86 assembly bibliography</a>
 <ul class="sectlevel3">
-<li><a href="#x86-official-bibliography">23.15.1. x86 official bibliography</a>
+<li><a href="#x86-official-bibliography">24.15.1. x86 official bibliography</a>
 <ul class="sectlevel4">
-<li><a href="#intel-manual">23.15.1.1. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals</a>
+<li><a href="#intel-manual">24.15.1.1. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals</a>
 <ul class="sectlevel5">
-<li><a href="#intel-manual-1">23.15.1.1.1. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a></li>
-<li><a href="#intel-manual-2">23.15.1.1.2. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 2</a></li>
-<li><a href="#intel-manual-3">23.15.1.1.3. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 3</a></li>
-<li><a href="#intel-manual-4">23.15.1.1.4. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 4</a></li>
+<li><a href="#intel-manual-1">24.15.1.1.1. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a></li>
+<li><a href="#intel-manual-2">24.15.1.1.2. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 2</a></li>
+<li><a href="#intel-manual-3">24.15.1.1.3. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 3</a></li>
+<li><a href="#intel-manual-4">24.15.1.1.4. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 4</a></li>
 </ul>
 </li>
 </ul>
@@ -1749,532 +1750,532 @@ pre{ white-space:pre }
 </li>
 </ul>
 </li>
-<li><a href="#arm-userland-assembly">24. ARM userland assembly</a>
+<li><a href="#arm-userland-assembly">25. ARM userland assembly</a>
 <ul class="sectlevel2">
-<li><a href="#introduction-to-the-arm-architecture">24.1. Introduction to the ARM architecture</a>
+<li><a href="#introduction-to-the-arm-architecture">25.1. Introduction to the ARM architecture</a>
 <ul class="sectlevel3">
-<li><a href="#armv8-vs-armv7-vs-aarch64-vs-aarch32">24.1.1. ARMv8 vs ARMv7 vs AArch64 vs AArch32</a>
+<li><a href="#armv8-vs-armv7-vs-aarch64-vs-aarch32">25.1.1. ARMv8 vs ARMv7 vs AArch64 vs AArch32</a>
 <ul class="sectlevel4">
-<li><a href="#aarch32">24.1.1.1. AArch32</a></li>
-<li><a href="#aarch32-vs-aarch64">24.1.1.2. AArch32 vs AArch64</a></li>
+<li><a href="#aarch32">25.1.1.1. AArch32</a></li>
+<li><a href="#aarch32-vs-aarch64">25.1.1.2. AArch32 vs AArch64</a></li>
 </ul>
 </li>
-<li><a href="#free-arm-implementations">24.1.2. Free ARM implementations</a></li>
-<li><a href="#arm-instruction-encodings">24.1.3. ARM instruction encodings</a>
+<li><a href="#free-arm-implementations">25.1.2. Free ARM implementations</a></li>
+<li><a href="#arm-instruction-encodings">25.1.3. ARM instruction encodings</a>
 <ul class="sectlevel4">
-<li><a href="#arm-thumb-encoding">24.1.3.1. ARM Thumb encoding</a></li>
-<li><a href="#arm-big-endian-mode">24.1.3.2. ARM big endian mode</a></li>
+<li><a href="#arm-thumb-encoding">25.1.3.1. ARM Thumb encoding</a></li>
+<li><a href="#arm-big-endian-mode">25.1.3.2. ARM big endian mode</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#arm-branch-instructions">24.2. ARM branch instructions</a>
+<li><a href="#arm-branch-instructions">25.2. ARM branch instructions</a>
 <ul class="sectlevel3">
-<li><a href="#arm-b-instruction">24.2.1. ARM B instruction</a></li>
-<li><a href="#arm-beq-instruction">24.2.2. ARM BEQ instruction</a></li>
-<li><a href="#arm-bl-instruction">24.2.3. ARM BL instruction</a>
+<li><a href="#arm-b-instruction">25.2.1. ARM B instruction</a></li>
+<li><a href="#arm-beq-instruction">25.2.2. ARM BEQ instruction</a></li>
+<li><a href="#arm-bl-instruction">25.2.3. ARM BL instruction</a>
 <ul class="sectlevel4">
-<li><a href="#arm-bx-instruction">24.2.3.1. ARM BX instruction</a></li>
-<li><a href="#armv8-aarch64-ret-instruction">24.2.3.2. ARMv8 aarch64 ret instruction</a></li>
+<li><a href="#arm-bx-instruction">25.2.3.1. ARM BX instruction</a></li>
+<li><a href="#armv8-aarch64-ret-instruction">25.2.3.2. ARMv8 aarch64 ret instruction</a></li>
 </ul>
 </li>
-<li><a href="#arm-cbz-instruction">24.2.4. ARM CBZ instruction</a></li>
-<li><a href="#arm-conditional-execution">24.2.5. ARM conditional execution</a></li>
+<li><a href="#arm-cbz-instruction">25.2.4. ARM CBZ instruction</a></li>
+<li><a href="#arm-conditional-execution">25.2.5. ARM conditional execution</a></li>
 </ul>
 </li>
-<li><a href="#arm-load-and-store-instructions">24.3. ARM load and store instructions</a>
+<li><a href="#arm-load-and-store-instructions">25.3. ARM load and store instructions</a>
 <ul class="sectlevel3">
-<li><a href="#arm-ldr-instruction">24.3.1. ARM LDR instruction</a>
+<li><a href="#arm-ldr-instruction">25.3.1. ARM LDR instruction</a>
 <ul class="sectlevel4">
-<li><a href="#arm-ldr-pseudo-instruction">24.3.1.1. ARM LDR pseudo-instruction</a></li>
-<li><a href="#arm-addressing-modes">24.3.1.2. ARM addressing modes</a>
+<li><a href="#arm-ldr-pseudo-instruction">25.3.1.1. ARM LDR pseudo-instruction</a></li>
+<li><a href="#arm-addressing-modes">25.3.1.2. ARM addressing modes</a>
 <ul class="sectlevel5">
-<li><a href="#arm-loop-over-array">24.3.1.2.1. ARM loop over array</a></li>
+<li><a href="#arm-loop-over-array">25.3.1.2.1. ARM loop over array</a></li>
 </ul>
 </li>
-<li><a href="#arm-ldrh-and-ldrb-instructions">24.3.1.3. ARM LDRH and LDRB instructions</a></li>
+<li><a href="#arm-ldrh-and-ldrb-instructions">25.3.1.3. ARM LDRH and LDRB instructions</a></li>
 </ul>
 </li>
-<li><a href="#arm-str-instruction">24.3.2. ARM STR instruction</a>
+<li><a href="#arm-str-instruction">25.3.2. ARM STR instruction</a>
 <ul class="sectlevel4">
-<li><a href="#armv8-aarch64-str-instruction">24.3.2.1. ARMv8 aarch64 STR instruction</a></li>
-<li><a href="#armv8-aarch64-ldp-and-stp-instructions">24.3.2.2. ARMv8 aarch64 LDP and STP instructions</a>
+<li><a href="#armv8-aarch64-str-instruction">25.3.2.1. ARMv8 aarch64 STR instruction</a></li>
+<li><a href="#armv8-aarch64-ldp-and-stp-instructions">25.3.2.2. ARMv8 aarch64 LDP and STP instructions</a>
 <ul class="sectlevel5">
-<li><a href="#armv8-aarch64-stack-alignment">24.3.2.2.1. ARMV8 aarch64 stack alignment</a></li>
+<li><a href="#armv8-aarch64-stack-alignment">25.3.2.2.1. ARMV8 aarch64 stack alignment</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#arm-ldmia-instruction">24.3.3. ARM LDMIA instruction</a></li>
+<li><a href="#arm-ldmia-instruction">25.3.3. ARM LDMIA instruction</a></li>
 </ul>
 </li>
-<li><a href="#arm-data-processing-instructions">24.4. ARM data processing instructions</a>
+<li><a href="#arm-data-processing-instructions">25.4. ARM data processing instructions</a>
 <ul class="sectlevel3">
-<li><a href="#arm-cset-instruction">24.4.1. ARM CSET instruction</a></li>
-<li><a href="#arm-bitwise-instructions">24.4.2. ARM bitwise instructions</a>
+<li><a href="#arm-cset-instruction">25.4.1. ARM CSET instruction</a></li>
+<li><a href="#arm-bitwise-instructions">25.4.2. ARM bitwise instructions</a>
 <ul class="sectlevel4">
-<li><a href="#arm-bic-instruction">24.4.2.1. ARM BIC instruction</a></li>
-<li><a href="#arm-ubfm-instruction">24.4.2.2. ARM UBFM instruction</a>
+<li><a href="#arm-bic-instruction">25.4.2.1. ARM BIC instruction</a></li>
+<li><a href="#arm-ubfm-instruction">25.4.2.2. ARM UBFM instruction</a>
 <ul class="sectlevel5">
-<li><a href="#arm-ubfx-instruction">24.4.2.2.1. ARM UBFX instruction</a></li>
+<li><a href="#arm-ubfx-instruction">25.4.2.2.1. ARM UBFX instruction</a></li>
 </ul>
 </li>
-<li><a href="#arm-bfm-instruction">24.4.2.3. ARM BFM instruction</a>
+<li><a href="#arm-bfm-instruction">25.4.2.3. ARM BFM instruction</a>
 <ul class="sectlevel5">
-<li><a href="#arm-bfi-instruction">24.4.2.3.1. ARM BFI instruction</a></li>
+<li><a href="#arm-bfi-instruction">25.4.2.3.1. ARM BFI instruction</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#arm-mov-instruction">24.4.3. ARM MOV instruction</a>
+<li><a href="#arm-mov-instruction">25.4.3. ARM MOV instruction</a>
 <ul class="sectlevel4">
-<li><a href="#arm-movw-and-movt-instructions">24.4.3.1. ARM movw and movt instructions</a></li>
-<li><a href="#armv8-aarch64-movk-instruction">24.4.3.2. ARMv8 aarch64 movk instruction</a></li>
-<li><a href="#armv8-aarch64-movn-instruction">24.4.3.3. ARMv8 aarch64 movn instruction</a></li>
+<li><a href="#arm-movw-and-movt-instructions">25.4.3.1. ARM movw and movt instructions</a></li>
+<li><a href="#armv8-aarch64-movk-instruction">25.4.3.2. ARMv8 aarch64 movk instruction</a></li>
+<li><a href="#armv8-aarch64-movn-instruction">25.4.3.3. ARMv8 aarch64 movn instruction</a></li>
 </ul>
 </li>
-<li><a href="#arm-data-processing-instruction-suffixes">24.4.4. ARM data processing instruction suffixes</a>
+<li><a href="#arm-data-processing-instruction-suffixes">25.4.4. ARM data processing instruction suffixes</a>
 <ul class="sectlevel4">
-<li><a href="#arm-shift-suffixes">24.4.4.1. ARM shift suffixes</a></li>
-<li><a href="#arm-s-suffix">24.4.4.2. ARM S suffix</a></li>
+<li><a href="#arm-shift-suffixes">25.4.4.1. ARM shift suffixes</a></li>
+<li><a href="#arm-s-suffix">25.4.4.2. ARM S suffix</a></li>
 </ul>
 </li>
-<li><a href="#arm-adr-instruction">24.4.5. ARM ADR instruction</a>
+<li><a href="#arm-adr-instruction">25.4.5. ARM ADR instruction</a>
 <ul class="sectlevel4">
-<li><a href="#arm-adrl-instruction">24.4.5.1. ARM ADRL instruction</a></li>
+<li><a href="#arm-adrl-instruction">25.4.5.1. ARM ADRL instruction</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#arm-miscellaneous-instructions">24.5. ARM miscellaneous instructions</a>
+<li><a href="#arm-miscellaneous-instructions">25.5. ARM miscellaneous instructions</a>
 <ul class="sectlevel3">
-<li><a href="#arm-nop-instruction">24.5.1. ARM NOP instruction</a></li>
-<li><a href="#arm-udf-instruction">24.5.2. ARM UDF instruction</a></li>
-<li><a href="#arm-system-register-instructions">24.5.3. ARM system register instructions</a>
+<li><a href="#arm-nop-instruction">25.5.1. ARM NOP instruction</a></li>
+<li><a href="#arm-udf-instruction">25.5.2. ARM UDF instruction</a></li>
+<li><a href="#arm-system-register-instructions">25.5.3. ARM system register instructions</a>
 <ul class="sectlevel4">
-<li><a href="#arm-system-register-encodings">24.5.3.1. ARM system register encodings</a></li>
+<li><a href="#arm-system-register-encodings">25.5.3.1. ARM system register encodings</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#arm-simd">24.6. ARM SIMD</a>
+<li><a href="#arm-simd">25.6. ARM SIMD</a>
 <ul class="sectlevel3">
-<li><a href="#arm-vfp">24.6.1. ARM VFP</a>
+<li><a href="#arm-vfp">25.6.1. ARM VFP</a>
 <ul class="sectlevel4">
-<li><a href="#arm-vfp-registers">24.6.1.1. ARM VFP registers</a></li>
-<li><a href="#arm-vadd-instruction">24.6.1.2. ARM VADD instruction</a></li>
-<li><a href="#arm-vcvt-instruction">24.6.1.3. ARM VCVT instruction</a>
+<li><a href="#arm-vfp-registers">25.6.1.1. ARM VFP registers</a></li>
+<li><a href="#arm-vadd-instruction">25.6.1.2. ARM VADD instruction</a></li>
+<li><a href="#arm-vcvt-instruction">25.6.1.3. ARM VCVT instruction</a>
 <ul class="sectlevel5">
-<li><a href="#arm-vcvtr-instruction">24.6.1.3.1. ARM VCVTR instruction</a></li>
-<li><a href="#armv8-aarch32-vcvta-instruction">24.6.1.3.2. ARMv8 AArch32 VCVTA instruction</a></li>
+<li><a href="#arm-vcvtr-instruction">25.6.1.3.1. ARM VCVTR instruction</a></li>
+<li><a href="#armv8-aarch32-vcvta-instruction">25.6.1.3.2. ARMv8 AArch32 VCVTA instruction</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#armv8-advanced-simd-and-floating-point-support">24.6.2. ARMv8 Advanced SIMD and floating-point support</a>
+<li><a href="#armv8-advanced-simd-and-floating-point-support">25.6.2. ARMv8 Advanced SIMD and floating-point support</a>
 <ul class="sectlevel4">
-<li><a href="#armv8-floating-point-availability">24.6.2.1. ARMv8 floating point availability</a></li>
-<li><a href="#arm-neon">24.6.2.2. ARM NEON</a></li>
+<li><a href="#armv8-floating-point-availability">25.6.2.1. ARMv8 floating point availability</a></li>
+<li><a href="#arm-neon">25.6.2.2. ARM NEON</a></li>
 </ul>
 </li>
-<li><a href="#armv8-aarch64-floating-point-registers">24.6.3. ARMv8 AArch64 floating point registers</a>
+<li><a href="#armv8-aarch64-floating-point-registers">25.6.3. ARMv8 AArch64 floating point registers</a>
 <ul class="sectlevel4">
-<li><a href="#armv8-aarch64-add-vector-instruction">24.6.3.1. ARMv8 aarch64 add vector instruction</a></li>
-<li><a href="#armv8-aarch64-fadd-instruction">24.6.3.2. ARMv8 aarch64 FADD instruction</a>
+<li><a href="#armv8-aarch64-add-vector-instruction">25.6.3.1. ARMv8 aarch64 add vector instruction</a></li>
+<li><a href="#armv8-aarch64-fadd-instruction">25.6.3.2. ARMv8 aarch64 FADD instruction</a>
 <ul class="sectlevel5">
-<li><a href="#arm-fadd-vs-vadd">24.6.3.2.1. ARM FADD vs VADD</a></li>
+<li><a href="#arm-fadd-vs-vadd">25.6.3.2.1. ARM FADD vs VADD</a></li>
 </ul>
 </li>
-<li><a href="#armv8-aarch64-ld2-instruction">24.6.3.3. ARMv8 aarch64 LD2 instruction</a></li>
+<li><a href="#armv8-aarch64-ld2-instruction">25.6.3.3. ARMv8 aarch64 LD2 instruction</a></li>
 </ul>
 </li>
-<li><a href="#arm-simd-bibliography">24.6.4. ARM SIMD bibliography</a></li>
-<li><a href="#arm-sve">24.6.5. ARM SVE</a>
+<li><a href="#arm-simd-bibliography">25.6.4. ARM SIMD bibliography</a></li>
+<li><a href="#arm-sve">25.6.5. ARM SVE</a>
 <ul class="sectlevel4">
-<li><a href="#arm-sve-vaddl-instruction">24.6.5.1. ARM SVE VADDL instruction</a></li>
-<li><a href="#change-arm-sve-vector-length-in-emulators">24.6.5.2. Change ARM SVE vector length in emulators</a></li>
-<li><a href="#sve-bibliography">24.6.5.3. SVE bibliography</a>
+<li><a href="#arm-sve-vaddl-instruction">25.6.5.1. ARM SVE VADDL instruction</a></li>
+<li><a href="#change-arm-sve-vector-length-in-emulators">25.6.5.2. Change ARM SVE vector length in emulators</a></li>
+<li><a href="#sve-bibliography">25.6.5.3. SVE bibliography</a>
 <ul class="sectlevel5">
-<li><a href="#sve-spec">24.6.5.3.1. SVE spec</a></li>
+<li><a href="#sve-spec">25.6.5.3.1. SVE spec</a></li>
 </ul>
 </li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#arm-thread-synchronization-primitives">24.7. ARM thread synchronization primitives</a>
+<li><a href="#arm-thread-synchronization-primitives">25.7. ARM thread synchronization primitives</a>
 <ul class="sectlevel3">
-<li><a href="#arm-ldxr-and-stxr-instructions">24.7.1. ARM LDXR and STXR instructions</a></li>
-<li><a href="#arm-lse">24.7.2. ARM Large System Extensions (LSE)</a></li>
+<li><a href="#arm-ldxr-and-stxr-instructions">25.7.1. ARM LDXR and STXR instructions</a></li>
+<li><a href="#arm-lse">25.7.2. ARM Large System Extensions (LSE)</a></li>
 </ul>
 </li>
-<li><a href="#armv8-architecture-extensions">24.8. ARMv8 architecture extensions</a>
+<li><a href="#armv8-architecture-extensions">25.8. ARMv8 architecture extensions</a>
 <ul class="sectlevel3">
-<li><a href="#armv8-1-architecture-extension">24.8.1. ARMv8.1 architecture extension</a></li>
+<li><a href="#armv8-1-architecture-extension">25.8.1. ARMv8.1 architecture extension</a></li>
 </ul>
 </li>
-<li><a href="#arm-assembly-bibliography">24.9. ARM assembly bibliography</a>
+<li><a href="#arm-assembly-bibliography">25.9. ARM assembly bibliography</a>
 <ul class="sectlevel3">
-<li><a href="#arm-non-official-bibliography">24.9.1. ARM non-official bibliography</a></li>
-<li><a href="#arm-official-bibliography">24.9.2. ARM official bibliography</a>
+<li><a href="#arm-non-official-bibliography">25.9.1. ARM non-official bibliography</a></li>
+<li><a href="#arm-official-bibliography">25.9.2. ARM official bibliography</a>
 <ul class="sectlevel4">
-<li><a href="#armarm7">24.9.2.1. ARMv7 architecture reference manual</a></li>
-<li><a href="#armarm8">24.9.2.2. ARMv8 architecture reference manual</a></li>
-<li><a href="#armarm8-db">24.9.2.3. ARMv8 architecture reference manual db</a></li>
-<li><a href="#armarm8-fa">24.9.2.4. ARMv8 architecture reference manual db</a></li>
-<li><a href="#armv8-programmers-guide">24.9.2.5. Programmer&#8217;s Guide for ARMv8-A</a></li>
-<li><a href="#arm-a64-instruction-set-architecture-future-architecture-technologies-in-the-a-architecture-profile-documentation">24.9.2.6. Arm A64 Instruction Set Architecture: Future Architecture Technologies in the A architecture profile Documentation</a></li>
-<li><a href="#arm-processor-documentation">24.9.2.7. ARM processor documentation</a>
+<li><a href="#armarm7">25.9.2.1. ARMv7 architecture reference manual</a></li>
+<li><a href="#armarm8">25.9.2.2. ARMv8 architecture reference manual</a></li>
+<li><a href="#armarm8-db">25.9.2.3. ARMv8 architecture reference manual db</a></li>
+<li><a href="#armarm8-fa">25.9.2.4. ARMv8 architecture reference manual db</a></li>
+<li><a href="#armv8-programmers-guide">25.9.2.5. Programmer&#8217;s Guide for ARMv8-A</a></li>
+<li><a href="#arm-a64-instruction-set-architecture-future-architecture-technologies-in-the-a-architecture-profile-documentation">25.9.2.6. Arm A64 Instruction Set Architecture: Future Architecture Technologies in the A architecture profile Documentation</a></li>
+<li><a href="#arm-processor-documentation">25.9.2.7. ARM processor documentation</a>
 <ul class="sectlevel5">
-<li><a href="#arm-cortex15-trm">24.9.2.7.1. ARM Cortex-A15 MPCore Processor Technical Reference Manual r4p0</a></li>
+<li><a href="#arm-cortex15-trm">25.9.2.7.1. ARM Cortex-A15 MPCore Processor Technical Reference Manual r4p0</a></li>
 </ul>
 </li>
-<li><a href="#arm-cortex-a77-trm">24.9.2.8. Arm Cortex‑A77 Technical Reference Manual r1p1</a></li>
-<li><a href="#arm-cortex-a77-sog">24.9.2.9. Arm Cortex‑A77 Software Optimization Guide r1p1</a></li>
+<li><a href="#arm-cortex-a77-trm">25.9.2.8. Arm Cortex‑A77 Technical Reference Manual r1p1</a></li>
+<li><a href="#arm-cortex-a77-sog">25.9.2.9. Arm Cortex‑A77 Software Optimization Guide r1p1</a></li>
 </ul>
 </li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#elf">25. ELF</a></li>
-<li><a href="#ieee-754">26. IEEE 754</a></li>
-<li><a href="#baremetal">27. Baremetal</a>
+<li><a href="#elf">26. ELF</a></li>
+<li><a href="#ieee-754">27. IEEE 754</a></li>
+<li><a href="#baremetal">28. Baremetal</a>
 <ul class="sectlevel2">
-<li><a href="#baremetal-gdb-step-debug">27.1. Baremetal GDB step debug</a></li>
-<li><a href="#baremetal-bootloaders">27.2. Baremetal bootloaders</a></li>
-<li><a href="#baremetal-linker-script">27.3. Baremetal linker script</a></li>
-<li><a href="#baremetal-command-line-arguments">27.4. Baremetal command line arguments</a>
+<li><a href="#baremetal-gdb-step-debug">28.1. Baremetal GDB step debug</a></li>
+<li><a href="#baremetal-bootloaders">28.2. Baremetal bootloaders</a></li>
+<li><a href="#baremetal-linker-script">28.3. Baremetal linker script</a></li>
+<li><a href="#baremetal-command-line-arguments">28.4. Baremetal command line arguments</a>
 <ul class="sectlevel3">
-<li><a href="#gem5-baremetal-arm-cli-args">27.4.1. gem5 baremetal arm CLI args</a></li>
+<li><a href="#gem5-baremetal-arm-cli-args">28.4.1. gem5 baremetal arm CLI args</a></li>
 </ul>
 </li>
-<li><a href="#semihosting">27.5. Semihosting</a>
+<li><a href="#semihosting">28.5. Semihosting</a>
 <ul class="sectlevel3">
-<li><a href="#gem5-semihosting">27.5.1. gem5 semihosting</a></li>
+<li><a href="#gem5-semihosting">28.5.1. gem5 semihosting</a></li>
 </ul>
 </li>
-<li><a href="#gem5-baremetal-carriage-return">27.6. gem5 baremetal carriage return</a></li>
-<li><a href="#baremetal-host-packaged-toolchain">27.7. Baremetal host packaged toolchain</a></li>
-<li><a href="#baremetal-cpp">27.8. Baremetal C++</a></li>
-<li><a href="#gdb-builtin-cpu-simulator">27.9. GDB builtin CPU simulator</a>
+<li><a href="#gem5-baremetal-carriage-return">28.6. gem5 baremetal carriage return</a></li>
+<li><a href="#baremetal-host-packaged-toolchain">28.7. Baremetal host packaged toolchain</a></li>
+<li><a href="#baremetal-cpp">28.8. Baremetal C++</a></li>
+<li><a href="#gdb-builtin-cpu-simulator">28.9. GDB builtin CPU simulator</a>
 <ul class="sectlevel3">
-<li><a href="#gdb-builtin-cpu-simulator-userland">27.9.1. GDB builtin CPU simulator userland</a></li>
+<li><a href="#gdb-builtin-cpu-simulator-userland">28.9.1. GDB builtin CPU simulator userland</a></li>
 </ul>
 </li>
-<li><a href="#arm-baremetal">27.10. ARM baremetal</a>
+<li><a href="#arm-baremetal">28.10. ARM baremetal</a>
 <ul class="sectlevel3">
-<li><a href="#arm-exception-levels">27.10.1. ARM exception levels</a>
+<li><a href="#arm-exception-levels">28.10.1. ARM exception levels</a>
 <ul class="sectlevel4">
-<li><a href="#arm-change-exception-level">27.10.1.1. ARM change exception level</a></li>
-<li><a href="#arm-sp0-vs-spx">27.10.1.2. ARM SP0 vs SPx</a></li>
+<li><a href="#arm-change-exception-level">28.10.1.1. ARM change exception level</a></li>
+<li><a href="#arm-sp0-vs-spx">28.10.1.2. ARM SP0 vs SPx</a></li>
 </ul>
 </li>
-<li><a href="#arm-svc-instruction">27.10.2. ARM SVC instruction</a>
+<li><a href="#arm-svc-instruction">28.10.2. ARM SVC instruction</a>
 <ul class="sectlevel4">
-<li><a href="#armv8-exception-vector-table-format">27.10.2.1. ARMv8 exception vector table format</a></li>
-<li><a href="#arm-esr-register">27.10.2.2. ARM ESR register</a></li>
-<li><a href="#arm-elr-register">27.10.2.3. ARM ELR register</a></li>
+<li><a href="#armv8-exception-vector-table-format">28.10.2.1. ARMv8 exception vector table format</a></li>
+<li><a href="#arm-esr-register">28.10.2.2. ARM ESR register</a></li>
+<li><a href="#arm-elr-register">28.10.2.3. ARM ELR register</a></li>
 </ul>
 </li>
-<li><a href="#arm-baremetal-multicore">27.10.3. ARM baremetal multicore</a>
+<li><a href="#arm-baremetal-multicore">28.10.3. ARM baremetal multicore</a>
 <ul class="sectlevel4">
-<li><a href="#arm-wfe-and-sev-instructions">27.10.3.1. ARM WFE and SEV instructions</a>
+<li><a href="#arm-wfe-and-sev-instructions">28.10.3.1. ARM WFE and SEV instructions</a>
 <ul class="sectlevel5">
-<li><a href="#arm-wfe-global-monitor-events">27.10.3.1.1. ARM WFE global monitor events</a></li>
-<li><a href="#wfe-from-userland">27.10.3.1.2. WFE from userland</a></li>
-<li><a href="#armv8-spinlock-pattern">27.10.3.1.3. ARMv8 spinlock pattern</a></li>
-<li><a href="#gem5-arm-wfe">27.10.3.1.4. gem5 ARM WFE</a></li>
-<li><a href="#arm-yield-instruction">27.10.3.1.5. ARM YIELD instruction</a></li>
+<li><a href="#arm-wfe-global-monitor-events">28.10.3.1.1. ARM WFE global monitor events</a></li>
+<li><a href="#wfe-from-userland">28.10.3.1.2. WFE from userland</a></li>
+<li><a href="#armv8-spinlock-pattern">28.10.3.1.3. ARMv8 spinlock pattern</a></li>
+<li><a href="#gem5-arm-wfe">28.10.3.1.4. gem5 ARM WFE</a></li>
+<li><a href="#arm-yield-instruction">28.10.3.1.5. ARM YIELD instruction</a></li>
 </ul>
 </li>
-<li><a href="#arm-ldaxr-and-stlxr-instructions">27.10.3.2. ARM LDAXR and STLXR instructions</a></li>
-<li><a href="#arm-psci">27.10.3.3. ARM PSCI</a></li>
-<li><a href="#arm-dmb-instruction">27.10.3.4. ARM DMB instruction</a></li>
+<li><a href="#arm-ldaxr-and-stlxr-instructions">28.10.3.2. ARM LDAXR and STLXR instructions</a></li>
+<li><a href="#arm-psci">28.10.3.3. ARM PSCI</a></li>
+<li><a href="#arm-dmb-instruction">28.10.3.4. ARM DMB instruction</a></li>
 </ul>
 </li>
-<li><a href="#arm-timer">27.10.4. ARM timer</a></li>
-<li><a href="#arm-gic">27.10.5. ARM GIC</a></li>
-<li><a href="#arm-paging">27.10.6. ARM paging</a></li>
-<li><a href="#arm-baremetal-bibliography">27.10.7. ARM baremetal bibliography</a>
+<li><a href="#arm-timer">28.10.4. ARM timer</a></li>
+<li><a href="#arm-gic">28.10.5. ARM GIC</a></li>
+<li><a href="#arm-paging">28.10.6. ARM paging</a></li>
+<li><a href="#arm-baremetal-bibliography">28.10.7. ARM baremetal bibliography</a>
 <ul class="sectlevel4">
-<li><a href="#nienfengyaoarmv8-bare-metal">27.10.7.1. NienfengYao/armv8-bare-metal</a></li>
-<li><a href="#tukl-msdgem5-bare-metal">27.10.7.2. tukl-msd/gem5.bare-metal</a></li>
+<li><a href="#nienfengyaoarmv8-bare-metal">28.10.7.1. NienfengYao/armv8-bare-metal</a></li>
+<li><a href="#tukl-msdgem5-bare-metal">28.10.7.2. tukl-msd/gem5.bare-metal</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#how-we-got-some-baremetal-stuff-to-work">27.11. How we got some baremetal stuff to work</a>
+<li><a href="#how-we-got-some-baremetal-stuff-to-work">28.11. How we got some baremetal stuff to work</a>
 <ul class="sectlevel3">
-<li><a href="#find-the-uart-address">27.11.1. Find the UART address</a></li>
-<li><a href="#aarch64-baremetal-neon-setup">27.11.2. aarch64 baremetal NEON setup</a></li>
+<li><a href="#find-the-uart-address">28.11.1. Find the UART address</a></li>
+<li><a href="#aarch64-baremetal-neon-setup">28.11.2. aarch64 baremetal NEON setup</a></li>
 </ul>
 </li>
-<li><a href="#baremetal-tests">27.12. Baremetal tests</a></li>
+<li><a href="#baremetal-tests">28.12. Baremetal tests</a></li>
 </ul>
 </li>
-<li><a href="#android">28. Android</a>
+<li><a href="#android">29. Android</a>
 <ul class="sectlevel2">
-<li><a href="#android-image-structure">28.1. Android image structure</a>
+<li><a href="#android-image-structure">29.1. Android image structure</a>
 <ul class="sectlevel3">
-<li><a href="#android-images-read-only">28.1.1. Android images read-only</a></li>
-<li><a href="#android-data-partition">28.1.2. Android /data partition</a></li>
+<li><a href="#android-images-read-only">29.1.1. Android images read-only</a></li>
+<li><a href="#android-data-partition">29.1.2. Android /data partition</a></li>
 </ul>
 </li>
-<li><a href="#install-android-apps">28.2. Install Android apps</a></li>
-<li><a href="#android-init">28.3. Android init</a></li>
+<li><a href="#install-android-apps">29.2. Install Android apps</a></li>
+<li><a href="#android-init">29.3. Android init</a></li>
 </ul>
 </li>
-<li><a href="#benchmark-this-repo">29. Benchmark this repo</a>
+<li><a href="#benchmark-this-repo">30. Benchmark this repo</a>
 <ul class="sectlevel2">
-<li><a href="#continuous-integration">29.1. Continuous integration</a>
+<li><a href="#continuous-integration">30.1. Continuous integration</a>
 <ul class="sectlevel3">
-<li><a href="#travis">29.1.1. Travis</a></li>
-<li><a href="#circleci">29.1.2. CircleCI</a></li>
+<li><a href="#travis">30.1.1. Travis</a></li>
+<li><a href="#circleci">30.1.2. CircleCI</a></li>
 </ul>
 </li>
-<li><a href="#benchmark-this-repo-benchmarks">29.2. Benchmark this repo benchmarks</a>
+<li><a href="#benchmark-this-repo-benchmarks">30.2. Benchmark this repo benchmarks</a>
 <ul class="sectlevel3">
-<li><a href="#benchmark-linux-kernel-boot">29.2.1. Benchmark Linux kernel boot</a>
+<li><a href="#benchmark-linux-kernel-boot">30.2.1. Benchmark Linux kernel boot</a>
 <ul class="sectlevel4">
-<li><a href="#gem5-arm-hpi-boot-takes-much-longer-than-aarch64">29.2.1.1. gem5 arm HPI boot takes much longer than aarch64</a></li>
-<li><a href="#gem5-x86_64-derivo3cpu-boot-panics">29.2.1.2. gem5 x86_64 DerivO3CPU boot panics</a></li>
+<li><a href="#gem5-arm-hpi-boot-takes-much-longer-than-aarch64">30.2.1.1. gem5 arm HPI boot takes much longer than aarch64</a></li>
+<li><a href="#gem5-x86_64-derivo3cpu-boot-panics">30.2.1.2. gem5 x86_64 DerivO3CPU boot panics</a></li>
 </ul>
 </li>
-<li><a href="#benchmark-emulators-on-userland-executables">29.2.2. Benchmark emulators on userland executables</a>
+<li><a href="#benchmark-emulators-on-userland-executables">30.2.2. Benchmark emulators on userland executables</a>
 <ul class="sectlevel4">
-<li><a href="#user-mode-vs-full-system-benchmark">29.2.2.1. User mode vs full system benchmark</a></li>
+<li><a href="#user-mode-vs-full-system-benchmark">30.2.2.1. User mode vs full system benchmark</a></li>
 </ul>
 </li>
-<li><a href="#benchmark-builds">29.2.3. Benchmark builds</a>
+<li><a href="#benchmark-builds">30.2.3. Benchmark builds</a>
 <ul class="sectlevel4">
-<li><a href="#find-which-buildroot-packages-are-making-the-build-slow-and-big">29.2.3.1. Find which Buildroot packages are making the build slow and big</a>
+<li><a href="#find-which-buildroot-packages-are-making-the-build-slow-and-big">30.2.3.1. Find which Buildroot packages are making the build slow and big</a>
 <ul class="sectlevel5">
-<li><a href="#prebuilt-toolchain">29.2.3.1.1. Buildroot use prebuilt host toolchain</a></li>
+<li><a href="#prebuilt-toolchain">30.2.3.1.1. Buildroot use prebuilt host toolchain</a></li>
 </ul>
 </li>
-<li><a href="#benchmark-buildroot-build-baseline">29.2.3.2. Benchmark Buildroot build baseline</a></li>
-<li><a href="#benchmark-gem5-build">29.2.3.3. Benchmark gem5 build</a>
+<li><a href="#benchmark-buildroot-build-baseline">30.2.3.2. Benchmark Buildroot build baseline</a></li>
+<li><a href="#benchmark-gem5-build">30.2.3.3. Benchmark gem5 build</a>
 <ul class="sectlevel5">
-<li><a href="#pybind11-accounts-for-50-of-gem5-build-time">29.2.3.3.1. pybind11 accounts for 50% of gem5 build time</a></li>
-<li><a href="#benchmark-gem5-single-file-change-rebuild-time">29.2.3.3.2. Benchmark gem5 single file change rebuild time</a></li>
+<li><a href="#pybind11-accounts-for-50-of-gem5-build-time">30.2.3.3.1. pybind11 accounts for 50% of gem5 build time</a></li>
+<li><a href="#benchmark-gem5-single-file-change-rebuild-time">30.2.3.3.2. Benchmark gem5 single file change rebuild time</a></li>
 </ul>
 </li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#benchmark-machines">29.3. Benchmark machines</a>
+<li><a href="#benchmark-machines">30.3. Benchmark machines</a>
 <ul class="sectlevel3">
-<li><a href="#p51">29.3.1. 2017 Lenovo ThinkPad P51</a>
+<li><a href="#p51">30.3.1. 2017 Lenovo ThinkPad P51</a>
 <ul class="sectlevel4">
-<li><a href="#p51-benchmarks">29.3.1.1. P51 benchmarks</a>
+<li><a href="#p51-benchmarks">30.3.1.1. P51 benchmarks</a>
 <ul class="sectlevel5">
-<li><a href="#p51-coremark-pro">29.3.1.1.1. P51 CoreMark-Pro</a></li>
+<li><a href="#p51-coremark-pro">30.3.1.1.1. P51 CoreMark-Pro</a></li>
 </ul>
 </li>
-<li><a href="#p51-maintenance-history">29.3.1.2. P51 maintenance history</a></li>
-<li><a href="#intel-core-i7-7820hq-cpu">29.3.1.3. Intel Core i7-7820HQ CPU</a></li>
-<li><a href="#samsung-m471a2k43bb1-crc-16gb-dram">29.3.1.4. Samsung M471A2K43BB1-CRC 16GB DRAM</a></li>
-<li><a href="#samsung-mzvlb512hajq-000l7-512gb-ssd">29.3.1.5. Samsung MZVLB512HAJQ-000L7 512GB SSD</a></li>
-<li><a href="#seagate-st1000lm035-1rk1-1tb-hard-disk">29.3.1.6. Seagate ST1000LM035-1RK1 1TB hard disk</a></li>
-<li><a href="#nvidia-quadro-m1200-4gb-gddr5-gpu">29.3.1.7. NVIDIA Quadro M1200 4GB GDDR5 GPU</a></li>
+<li><a href="#p51-maintenance-history">30.3.1.2. P51 maintenance history</a></li>
+<li><a href="#intel-core-i7-7820hq-cpu">30.3.1.3. Intel Core i7-7820HQ CPU</a></li>
+<li><a href="#samsung-m471a2k43bb1-crc-16gb-dram">30.3.1.4. Samsung M471A2K43BB1-CRC 16GB DRAM</a></li>
+<li><a href="#samsung-mzvlb512hajq-000l7-512gb-ssd">30.3.1.5. Samsung MZVLB512HAJQ-000L7 512GB SSD</a></li>
+<li><a href="#seagate-st1000lm035-1rk1-1tb-hard-disk">30.3.1.6. Seagate ST1000LM035-1RK1 1TB hard disk</a></li>
+<li><a href="#nvidia-quadro-m1200-4gb-gddr5-gpu">30.3.1.7. NVIDIA Quadro M1200 4GB GDDR5 GPU</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#benchmark-internets">29.4. Benchmark Internets</a>
+<li><a href="#benchmark-internets">30.4. Benchmark Internets</a>
 <ul class="sectlevel3">
-<li><a href="#38mbps-internet">29.4.1. 38Mbps internet</a></li>
+<li><a href="#38mbps-internet">30.4.1. 38Mbps internet</a></li>
 </ul>
 </li>
-<li><a href="#benchmark-this-repo-bibliography">29.5. Benchmark this repo bibliography</a></li>
+<li><a href="#benchmark-this-repo-bibliography">30.5. Benchmark this repo bibliography</a></li>
 </ul>
 </li>
-<li><a href="#rtos">30. RTOS</a>
+<li><a href="#rtos">31. RTOS</a>
 <ul class="sectlevel2">
-<li><a href="#zephyr">30.1. Zephyr</a></li>
-<li><a href="#arm-mbed">30.2. ARM Mbed</a></li>
+<li><a href="#zephyr">31.1. Zephyr</a></li>
+<li><a href="#arm-mbed">31.2. ARM Mbed</a></li>
 </ul>
 </li>
-<li><a href="#compilers">31. Compilers</a>
+<li><a href="#compilers">32. Compilers</a>
 <ul class="sectlevel2">
-<li><a href="#prevent-statement-reordering">31.1. Prevent statement reordering</a></li>
-<li><a href="#c-busy-loop">31.2. C busy loop</a></li>
+<li><a href="#prevent-statement-reordering">32.1. Prevent statement reordering</a></li>
+<li><a href="#c-busy-loop">32.2. C busy loop</a></li>
 </ul>
 </li>
-<li><a href="#computer-architecture">32. Computer architecture</a>
+<li><a href="#computer-architecture">33. Computer architecture</a>
 <ul class="sectlevel2">
-<li><a href="#instruction-pipelining">32.1. Instruction pipelining</a>
+<li><a href="#instruction-pipelining">33.1. Instruction pipelining</a>
 <ul class="sectlevel3">
-<li><a href="#classic-risc-pipeline">32.1.1. Classic RISC pipeline</a></li>
+<li><a href="#classic-risc-pipeline">33.1.1. Classic RISC pipeline</a></li>
 </ul>
 </li>
-<li><a href="#superscalar-processor">32.2. Superscalar processor</a>
+<li><a href="#superscalar-processor">33.2. Superscalar processor</a>
 <ul class="sectlevel3">
-<li><a href="#execution-unit">32.2.1. Execution unit</a></li>
+<li><a href="#execution-unit">33.2.1. Execution unit</a></li>
 </ul>
 </li>
-<li><a href="#out-of-order-execution">32.3. Out-of-order execution</a>
+<li><a href="#out-of-order-execution">33.3. Out-of-order execution</a>
 <ul class="sectlevel3">
-<li><a href="#speculative-execution">32.3.1. Speculative execution</a>
+<li><a href="#speculative-execution">33.3.1. Speculative execution</a>
 <ul class="sectlevel4">
-<li><a href="#branch-predictor">32.3.1.1. Branch predictor</a></li>
+<li><a href="#branch-predictor">33.3.1.1. Branch predictor</a></li>
 </ul>
 </li>
-<li><a href="#re-order-buffer">32.3.2. Re-order buffer</a></li>
-<li><a href="#register-renaming">32.3.3. Register renaming</a></li>
+<li><a href="#re-order-buffer">33.3.2. Re-order buffer</a></li>
+<li><a href="#register-renaming">33.3.3. Register renaming</a></li>
 </ul>
 </li>
-<li><a href="#instruction-level-parallelism">32.4. Instruction level parallelism</a></li>
-<li><a href="#hardware-threads">32.5. Hardware threads</a></li>
-<li><a href="#cache-coherence">32.6. Cache coherence</a>
+<li><a href="#instruction-level-parallelism">33.4. Instruction level parallelism</a></li>
+<li><a href="#hardware-threads">33.5. Hardware threads</a></li>
+<li><a href="#cache-coherence">33.6. Cache coherence</a>
 <ul class="sectlevel3">
-<li><a href="#memory-consistency">32.6.1. Memory consistency</a>
+<li><a href="#memory-consistency">33.6.1. Memory consistency</a>
 <ul class="sectlevel4">
-<li><a href="#sequential-consistency">32.6.1.1. Sequential Consistency</a></li>
+<li><a href="#sequential-consistency">33.6.1.1. Sequential Consistency</a></li>
 </ul>
 </li>
-<li><a href="#can-caches-snoop-data-from-other-caches">32.6.2. Can caches snoop data from other caches?</a></li>
-<li><a href="#vi-cache-coherence-protocol">32.6.3. VI cache coherence protocol</a></li>
-<li><a href="#msi-cache-coherence-protocol">32.6.4. MSI cache coherence protocol</a>
+<li><a href="#can-caches-snoop-data-from-other-caches">33.6.2. Can caches snoop data from other caches?</a></li>
+<li><a href="#vi-cache-coherence-protocol">33.6.3. VI cache coherence protocol</a></li>
+<li><a href="#msi-cache-coherence-protocol">33.6.4. MSI cache coherence protocol</a>
 <ul class="sectlevel4">
-<li><a href="#msi-cache-coherence-protocol-with-transient-states">32.6.4.1. MSI cache coherence protocol with transient states</a></li>
+<li><a href="#msi-cache-coherence-protocol-with-transient-states">33.6.4.1. MSI cache coherence protocol with transient states</a></li>
 </ul>
 </li>
-<li><a href="#mesi-cache-coherence-protocol">32.6.5. MESI cache coherence protocol</a></li>
-<li><a href="#mosi-cache-coherence-protocol">32.6.6. MOSI cache coherence protocol</a></li>
-<li><a href="#moesi">32.6.7. MOESI cache coherence protocol</a></li>
+<li><a href="#mesi-cache-coherence-protocol">33.6.5. MESI cache coherence protocol</a></li>
+<li><a href="#mosi-cache-coherence-protocol">33.6.6. MOSI cache coherence protocol</a></li>
+<li><a href="#moesi">33.6.7. MOESI cache coherence protocol</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#about-this-repo">33. About this repo</a>
+<li><a href="#about-this-repo">34. About this repo</a>
 <ul class="sectlevel2">
-<li><a href="#supported-hosts">33.1. Supported hosts</a></li>
-<li><a href="#common-build-issues">33.2. Common build issues</a>
+<li><a href="#supported-hosts">34.1. Supported hosts</a></li>
+<li><a href="#common-build-issues">34.2. Common build issues</a>
 <ul class="sectlevel3">
-<li><a href="#put-source-uris-in-sources">33.2.1. You must put some 'source' URIs in your sources.list</a></li>
-<li><a href="#build-from-downloaded-source-zip-files">33.2.2. Build from downloaded source zip files</a></li>
+<li><a href="#put-source-uris-in-sources">34.2.1. You must put some 'source' URIs in your sources.list</a></li>
+<li><a href="#build-from-downloaded-source-zip-files">34.2.2. Build from downloaded source zip files</a></li>
 </ul>
 </li>
-<li><a href="#run-command-after-boot">33.3. Run command after boot</a></li>
-<li><a href="#default-command-line-arguments">33.4. Default command line arguments</a></li>
-<li><a href="#documentation">33.5. Documentation</a>
+<li><a href="#run-command-after-boot">34.3. Run command after boot</a></li>
+<li><a href="#default-command-line-arguments">34.4. Default command line arguments</a></li>
+<li><a href="#documentation">34.5. Documentation</a>
 <ul class="sectlevel3">
-<li><a href="#documentation-verification">33.5.1. Documentation verification</a>
+<li><a href="#documentation-verification">34.5.1. Documentation verification</a>
 <ul class="sectlevel4">
-<li><a href="#asciidoctor-extract-link-targets">33.5.1.1. asciidoctor/extract-link-targets</a></li>
-<li><a href="#asciidoctor-extract-header-ids">33.5.1.2. asciidoctor/extract-header-ids</a></li>
+<li><a href="#asciidoctor-extract-link-targets">34.5.1.1. asciidoctor/extract-link-targets</a></li>
+<li><a href="#asciidoctor-extract-header-ids">34.5.1.2. asciidoctor/extract-header-ids</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#asciidoctor-link-target-up-rb">33.6. asciidoctor/link-target-up.rb</a>
+<li><a href="#asciidoctor-link-target-up-rb">34.6. asciidoctor/link-target-up.rb</a>
 <ul class="sectlevel3">
-<li><a href="#github-pages">33.6.1. GitHub pages</a></li>
+<li><a href="#github-pages">34.6.1. GitHub pages</a></li>
 </ul>
 </li>
-<li><a href="#clean-the-build">33.7. Clean the build</a></li>
-<li><a href="#custom-build-directory">33.8. Custom build directory</a></li>
-<li><a href="#ccache">33.9. ccache</a></li>
-<li><a href="#getvar">33.10. getvar</a>
+<li><a href="#clean-the-build">34.7. Clean the build</a></li>
+<li><a href="#custom-build-directory">34.8. Custom build directory</a></li>
+<li><a href="#ccache">34.9. ccache</a></li>
+<li><a href="#getvar">34.10. getvar</a>
 <ul class="sectlevel3">
-<li><a href="#run-toolchain">33.10.1. run-toolchain</a>
+<li><a href="#run-toolchain">34.10.1. run-toolchain</a>
 <ul class="sectlevel4">
-<li><a href="#disas">33.10.1.1. disas</a></li>
+<li><a href="#disas">34.10.1.1. disas</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#rebuild-buildroot-while-running">33.11. Rebuild Buildroot while running</a></li>
-<li><a href="#simultaneous-runs">33.12. Simultaneous runs</a></li>
-<li><a href="#build-variants">33.13. Build variants</a>
+<li><a href="#rebuild-buildroot-while-running">34.11. Rebuild Buildroot while running</a></li>
+<li><a href="#simultaneous-runs">34.12. Simultaneous runs</a></li>
+<li><a href="#build-variants">34.13. Build variants</a>
 <ul class="sectlevel3">
-<li><a href="#linux-kernel-build-variants">33.13.1. Linux kernel build variants</a></li>
-<li><a href="#qemu-build-variants">33.13.2. QEMU build variants</a></li>
-<li><a href="#gem5-build-variants">33.13.3. gem5 build variants</a>
+<li><a href="#linux-kernel-build-variants">34.13.1. Linux kernel build variants</a></li>
+<li><a href="#qemu-build-variants">34.13.2. QEMU build variants</a></li>
+<li><a href="#gem5-build-variants">34.13.3. gem5 build variants</a>
 <ul class="sectlevel4">
-<li><a href="#gem5-worktree">33.13.3.1. gem5 worktree</a></li>
-<li><a href="#gem5-private-source-trees">33.13.3.2. gem5 private source trees</a></li>
+<li><a href="#gem5-worktree">34.13.3.1. gem5 worktree</a></li>
+<li><a href="#gem5-private-source-trees">34.13.3.2. gem5 private source trees</a></li>
 </ul>
 </li>
-<li><a href="#buildroot-build-variants">33.13.4. Buildroot build variants</a></li>
+<li><a href="#buildroot-build-variants">34.13.4. Buildroot build variants</a></li>
 </ul>
 </li>
-<li><a href="#optimization-level-of-a-build">33.14. Optimization level of a build</a></li>
-<li><a href="#directory-structure">33.15. Directory structure</a>
+<li><a href="#optimization-level-of-a-build">34.14. Optimization level of a build</a></li>
+<li><a href="#directory-structure">34.15. Directory structure</a>
 <ul class="sectlevel3">
-<li><a href="#lkmc-directory">33.15.1. lkmc directory</a>
+<li><a href="#lkmc-directory">34.15.1. lkmc directory</a>
 <ul class="sectlevel4">
-<li><a href="#userland-objects-vs-header-only">33.15.1.1. Userland objects vs header-only</a></li>
+<li><a href="#userland-objects-vs-header-only">34.15.1.1. Userland objects vs header-only</a></li>
 </ul>
 </li>
-<li><a href="#buildroot_packages-directory">33.15.2. buildroot_packages directory</a>
+<li><a href="#buildroot_packages-directory">34.15.2. buildroot_packages directory</a>
 <ul class="sectlevel4">
-<li><a href="#kernel-modules-buildroot-package">33.15.2.1. kernel_modules buildroot package</a></li>
+<li><a href="#kernel-modules-buildroot-package">34.15.2.1. kernel_modules buildroot package</a></li>
 </ul>
 </li>
-<li><a href="#patches-directory">33.15.3. patches directory</a>
+<li><a href="#patches-directory">34.15.3. patches directory</a>
 <ul class="sectlevel4">
-<li><a href="#patches-global-directory">33.15.3.1. patches/global directory</a></li>
-<li><a href="#patches-manual-directory">33.15.3.2. patches/manual directory</a></li>
+<li><a href="#patches-global-directory">34.15.3.1. patches/global directory</a></li>
+<li><a href="#patches-manual-directory">34.15.3.2. patches/manual directory</a></li>
 </ul>
 </li>
-<li><a href="#rootfs_overlay">33.15.4. rootfs_overlay</a>
+<li><a href="#rootfs_overlay">34.15.4. rootfs_overlay</a>
 <ul class="sectlevel4">
-<li><a href="#out_rootfs_overlay_dir">33.15.4.1. out_rootfs_overlay_dir</a></li>
+<li><a href="#out_rootfs_overlay_dir">34.15.4.1. out_rootfs_overlay_dir</a></li>
 </ul>
 </li>
-<li><a href="#lkmc-c">33.15.5. lkmc.c</a></li>
-<li><a href="#lkmc_home">33.15.6. lkmc_home</a></li>
-<li><a href="#path-properties">33.15.7. path_properties.py</a></li>
-<li><a href="#rand_check-out">33.15.8. rand_check.out</a></li>
+<li><a href="#lkmc-c">34.15.5. lkmc.c</a></li>
+<li><a href="#lkmc_home">34.15.6. lkmc_home</a></li>
+<li><a href="#path-properties">34.15.7. path_properties.py</a></li>
+<li><a href="#rand_check-out">34.15.8. rand_check.out</a></li>
 </ul>
 </li>
-<li><a href="#test-this-repo">33.16. Test this repo</a>
+<li><a href="#test-this-repo">34.16. Test this repo</a>
 <ul class="sectlevel3">
-<li><a href="#automated-tests">33.16.1. Automated tests</a>
+<li><a href="#automated-tests">34.16.1. Automated tests</a>
 <ul class="sectlevel4">
-<li><a href="#test-arch-and-emulator-selection">33.16.1.1. Test arch and emulator selection</a></li>
-<li><a href="#quit-on-fail">33.16.1.2. Quit on fail</a></li>
-<li><a href="#test-userland-in-full-system">33.16.1.3. Test userland in full system</a></li>
-<li><a href="#gdb-tests">33.16.1.4. GDB tests</a></li>
-<li><a href="#magic-failure-string">33.16.1.5. Magic failure string</a></li>
+<li><a href="#test-arch-and-emulator-selection">34.16.1.1. Test arch and emulator selection</a></li>
+<li><a href="#quit-on-fail">34.16.1.2. Quit on fail</a></li>
+<li><a href="#test-userland-in-full-system">34.16.1.3. Test userland in full system</a></li>
+<li><a href="#gdb-tests">34.16.1.4. GDB tests</a></li>
+<li><a href="#magic-failure-string">34.16.1.5. Magic failure string</a></li>
 </ul>
 </li>
-<li><a href="#non-automated-tests">33.16.2. Non-automated tests</a>
+<li><a href="#non-automated-tests">34.16.2. Non-automated tests</a>
 <ul class="sectlevel4">
-<li><a href="#test-gdb-linux-kernel">33.16.2.1. Test GDB Linux kernel</a></li>
-<li><a href="#test-the-internet">33.16.2.2. Test the Internet</a></li>
-<li><a href="#cli-script-tests">33.16.2.3. CLI script tests</a></li>
+<li><a href="#test-gdb-linux-kernel">34.16.2.1. Test GDB Linux kernel</a></li>
+<li><a href="#test-the-internet">34.16.2.2. Test the Internet</a></li>
+<li><a href="#cli-script-tests">34.16.2.3. CLI script tests</a></li>
 </ul>
 </li>
 </ul>
 </li>
-<li><a href="#bisection">33.17. Bisection</a></li>
-<li><a href="#update-a-forked-submodule">33.18. Update a forked submodule</a></li>
-<li><a href="#release">33.19. Release</a>
+<li><a href="#bisection">34.17. Bisection</a></li>
+<li><a href="#update-a-forked-submodule">34.18. Update a forked submodule</a></li>
+<li><a href="#release">34.19. Release</a>
 <ul class="sectlevel3">
-<li><a href="#release-procedure">33.19.1. Release procedure</a></li>
-<li><a href="#release-zip">33.19.2. release-zip</a></li>
-<li><a href="#release-upload">33.19.3. release-upload</a></li>
+<li><a href="#release-procedure">34.19.1. Release procedure</a></li>
+<li><a href="#release-zip">34.19.2. release-zip</a></li>
+<li><a href="#release-upload">34.19.3. release-upload</a></li>
 </ul>
 </li>
-<li><a href="#design-rationale">33.20. Design rationale</a>
+<li><a href="#design-rationale">34.20. Design rationale</a>
 <ul class="sectlevel3">
-<li><a href="#design-goals">33.20.1. Design goals</a></li>
-<li><a href="#setup-trade-offs">33.20.2. Setup trade-offs</a></li>
-<li><a href="#resource-tradeoff-guidelines">33.20.3. Resource tradeoff guidelines</a></li>
-<li><a href="#linux-distro-choice">33.20.4. Linux distro choice</a></li>
+<li><a href="#design-goals">34.20.1. Design goals</a></li>
+<li><a href="#setup-trade-offs">34.20.2. Setup trade-offs</a></li>
+<li><a href="#resource-tradeoff-guidelines">34.20.3. Resource tradeoff guidelines</a></li>
+<li><a href="#linux-distro-choice">34.20.4. Linux distro choice</a></li>
 </ul>
 </li>
-<li><a href="#soft-topics">33.21. Soft topics</a>
+<li><a href="#soft-topics">34.21. Soft topics</a>
 <ul class="sectlevel3">
-<li><a href="#fairy-tale">33.21.1. Fairy tale</a></li>
+<li><a href="#fairy-tale">34.21.1. Fairy tale</a></li>
 </ul>
 </li>
-<li><a href="#bibliography">33.22. Bibliography</a></li>
+<li><a href="#bibliography">34.22. Bibliography</a></li>
 </ul>
 </li>
 </ul>
@@ -2291,7 +2292,7 @@ pre{ white-space:pre }
 <p>If you don&#8217;t know which one to go for, start with <a href="#qemu-buildroot-setup-getting-started">QEMU Buildroot setup getting started</a>.</p>
 </div>
 <div class="paragraph">
-<p>Design goals of this project are documented at: <a href="#design-goals">Section 33.20.1, &#8220;Design goals&#8221;</a>.</p>
+<p>Design goals of this project are documented at: <a href="#design-goals">Section 34.20.1, &#8220;Design goals&#8221;</a>.</p>
 </div>
 <div class="sect2">
 <h3 id="should-you-waste-your-life-with-systems-programming"><a class="anchor" href="#should-you-waste-your-life-with-systems-programming"></a><a class="link" href="#should-you-waste-your-life-with-systems-programming">1.1. Should you waste your life with systems programming?</a></h3>
@@ -2388,7 +2389,7 @@ pre{ white-space:pre }
 <div class="sect3">
 <h4 id="qemu-buildroot-setup-getting-started"><a class="anchor" href="#qemu-buildroot-setup-getting-started"></a><a class="link" href="#qemu-buildroot-setup-getting-started">1.2.1. QEMU Buildroot setup getting started</a></h4>
 <div class="paragraph">
-<p>This setup has been mostly tested on Ubuntu. For other host operating systems see: <a href="#supported-hosts">Section 33.1, &#8220;Supported hosts&#8221;</a>. For greater stability, consider using the <a href="#release-procedure">latest release</a> instead of master: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/releases" class="bare">https://github.com/cirosantilli/linux-kernel-module-cheat/releases</a></p>
+<p>This setup has been mostly tested on Ubuntu. For other host operating systems see: <a href="#supported-hosts">Section 34.1, &#8220;Supported hosts&#8221;</a>. For greater stability, consider using the <a href="#release-procedure">latest release</a> instead of master: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/releases" class="bare">https://github.com/cirosantilli/linux-kernel-module-cheat/releases</a></p>
 </div>
 <div class="paragraph">
 <p>Reserve 12Gb of disk and run:</p>
@@ -2405,7 +2406,7 @@ cd linux-kernel-module-cheat
 <p>You don&#8217;t need to clone recursively even though we have <code>.git</code> submodules: <code>download-dependencies</code> fetches just the submodules that you need for this build to save time.</p>
 </div>
 <div class="paragraph">
-<p>If something goes wrong, see: <a href="#common-build-issues">Section 33.2, &#8220;Common build issues&#8221;</a> and use our issue tracker: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/issues" class="bare">https://github.com/cirosantilli/linux-kernel-module-cheat/issues</a></p>
+<p>If something goes wrong, see: <a href="#common-build-issues">Section 34.2, &#8220;Common build issues&#8221;</a> and use our issue tracker: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/issues" class="bare">https://github.com/cirosantilli/linux-kernel-module-cheat/issues</a></p>
 </div>
 <div class="paragraph">
 <p>The initial build will take a while (30 minutes to 2 hours) to clone and build, see <a href="#benchmark-builds">Benchmark builds</a> for more details.</p>
@@ -2488,7 +2489,7 @@ hello2 cleanup</pre>
 </div>
 </div>
 <div class="paragraph">
-<p>To avoid typing <code>--arch aarch64</code> many times, you can set the default arch as explained at: <a href="#default-command-line-arguments">Section 33.4, &#8220;Default command line arguments&#8221;</a></p>
+<p>To avoid typing <code>--arch aarch64</code> many times, you can set the default arch as explained at: <a href="#default-command-line-arguments">Section 34.4, &#8220;Default command line arguments&#8221;</a></p>
 </div>
 <div class="paragraph">
 <p>I now urge you to read the following sections which contain widely applicable information:</p>
@@ -3329,7 +3330,7 @@ j = 0</pre>
 <p>This repository has been tested inside clean <a href="https://en.wikipedia.org/wiki/Docker_(software)">Docker</a> containers.</p>
 </div>
 <div class="paragraph">
-<p>This is a good option if you are on a Linux host, but the native setup failed due to your weird host distribution, and you have better things to do with your life than to debug it. See also: <a href="#supported-hosts">Section 33.1, &#8220;Supported hosts&#8221;</a>.</p>
+<p>This is a good option if you are on a Linux host, but the native setup failed due to your weird host distribution, and you have better things to do with your life than to debug it. See also: <a href="#supported-hosts">Section 34.1, &#8220;Supported hosts&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>For example, to do a <a href="#qemu-buildroot-setup">QEMU Buildroot setup</a> inside Docker, run:</p>
@@ -3517,7 +3518,7 @@ j = 0</pre>
 <div class="ulist">
 <ul>
 <li>
-<p>can&#8217;t <a href="#gdb">GDB step debug the kernel</a>, since the source and cross toolchain with GDB are not available. Buildroot cannot easily use a host toolchain: <a href="#prebuilt-toolchain">Section 29.2.3.1.1, &#8220;Buildroot use prebuilt host toolchain&#8221;</a>.</p>
+<p>can&#8217;t <a href="#gdb">GDB step debug the kernel</a>, since the source and cross toolchain with GDB are not available. Buildroot cannot easily use a host toolchain: <a href="#prebuilt-toolchain">Section 30.2.3.1.1, &#8220;Buildroot use prebuilt host toolchain&#8221;</a>.</p>
 <div class="paragraph">
 <p>Maybe we could work around this by just downloading the kernel source somehow, and using a host prebuilt GDB, but we felt that it would be too messy and unreliable.</p>
 </div>
@@ -4291,7 +4292,7 @@ error: simulation error detected by parsing logs</pre>
 </div>
 </div>
 <div class="paragraph">
-<p>TODO: the carriage returns are a bit different than in QEMU, see: <a href="#gem5-baremetal-carriage-return">Section 27.6, &#8220;gem5 baremetal carriage return&#8221;</a>.</p>
+<p>TODO: the carriage returns are a bit different than in QEMU, see: <a href="#gem5-baremetal-carriage-return">Section 28.6, &#8220;gem5 baremetal carriage return&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Note that <code>./build-baremetal</code> requires the <code>--emulator gem5</code> option, and generates separate executable images for both, as can be seen from:</p>
@@ -4339,10 +4340,10 @@ echo "$(./getvar --arch aarch64 --baremetal userland/c/hello.c --emulator gem5 -
 <p>But just stick to newer and better <code>VExpress_GEM5_V1</code> unless you have a good reason to use <code>RealViewPBX</code>.</p>
 </div>
 <div class="paragraph">
-<p>When doing baremetal programming, it is likely that you will want to learn userland assembly first, see: <a href="#userland-assembly">Section 22, &#8220;Userland assembly&#8221;</a>.</p>
+<p>When doing baremetal programming, it is likely that you will want to learn userland assembly first, see: <a href="#userland-assembly">Section 23, &#8220;Userland assembly&#8221;</a>.</p>
 </div>
 <div class="paragraph">
-<p>For more information on baremetal, see the section: <a href="#baremetal">Section 27, &#8220;Baremetal&#8221;</a>.</p>
+<p>For more information on baremetal, see the section: <a href="#baremetal">Section 28, &#8220;Baremetal&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>The following subjects are particularly important:</p>
@@ -4407,7 +4408,7 @@ xdg-open README.html</pre>
 </div>
 </div>
 <div class="paragraph">
-<p>More information about our documentation internals can be found at: <a href="#documentation">Section 33.5, &#8220;Documentation&#8221;</a></p>
+<p>More information about our documentation internals can be found at: <a href="#documentation">Section 34.5, &#8220;Documentation&#8221;</a></p>
 </div>
 </div>
 </div>
@@ -5643,7 +5644,7 @@ Breakpoint 3 at 0xffffffff811615e3: fdget_pos. (9 locations)
 <div class="sect2">
 <h3 id="gdb-step-debug-multicore-userland"><a class="anchor" href="#gdb-step-debug-multicore-userland"></a><a class="link" href="#gdb-step-debug-multicore-userland">2.9. GDB step debug multicore userland</a></h3>
 <div class="paragraph">
-<p>For a more minimal baremetal multicore setup, see: <a href="#arm-baremetal-multicore">Section 27.10.3, &#8220;ARM baremetal multicore&#8221;</a>.</p>
+<p>For a more minimal baremetal multicore setup, see: <a href="#arm-baremetal-multicore">Section 28.10.3, &#8220;ARM baremetal multicore&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>We can set and get which cores the Linux kernel allows a program to run on with <code>sched_getaffinity</code> and <code>sched_setaffinity</code>:</p>
@@ -8023,7 +8024,7 @@ qw er</pre>
 </div>
 </div>
 <div class="paragraph">
-<p>To stop at the very first instruction of a freestanding program, just use <code>--no-continue</code>. A good example of this is shown at: <a href="#freestanding-programs">Section 22.5.1, &#8220;Freestanding programs&#8221;</a>.</p>
+<p>To stop at the very first instruction of a freestanding program, just use <code>--no-continue</code>. A good example of this is shown at: <a href="#freestanding-programs">Section 23.5.1, &#8220;Freestanding programs&#8221;</a>.</p>
 </div>
 </div>
 </div>
@@ -8076,7 +8077,7 @@ qw er</pre>
 <p>The gem5 tests require building statically with build id <code>static</code>, see also: <a href="#gem5-syscall-emulation-mode">Section 10.7, &#8220;gem5 syscall emulation mode&#8221;</a>. TODO automate this better.</p>
 </div>
 <div class="paragraph">
-<p>See: <a href="#test-this-repo">Section 33.16, &#8220;Test this repo&#8221;</a> for more useful testing tips.</p>
+<p>See: <a href="#test-this-repo">Section 34.16, &#8220;Test this repo&#8221;</a> for more useful testing tips.</p>
 </div>
 </div>
 <div class="sect2">
@@ -8491,7 +8492,7 @@ qemu: uncaught target signal 6 (Aborted) - core dumped</pre>
 <p>Support for dynamic linking was added in November 2019: <a href="https://stackoverflow.com/questions/50542222/how-to-run-a-dynamically-linked-executable-syscall-emulation-mode-se-py-in-gem5/50696098#50696098" class="bare">https://stackoverflow.com/questions/50542222/how-to-run-a-dynamically-linked-executable-syscall-emulation-mode-se-py-in-gem5/50696098#50696098</a></p>
 </div>
 <div class="paragraph">
-<p>Note that as shown at <a href="#benchmark-emulators-on-userland-executables">Section 29.2.2, &#8220;Benchmark emulators on userland executables&#8221;</a>, the dynamic version runs 200x more instructions, which might have an impact on smaller simulations in detailed CPUs.</p>
+<p>Note that as shown at <a href="#benchmark-emulators-on-userland-executables">Section 30.2.2, &#8220;Benchmark emulators on userland executables&#8221;</a>, the dynamic version runs 200x more instructions, which might have an impact on smaller simulations in detailed CPUs.</p>
 </div>
 </div>
 <div class="sect3">
@@ -8928,7 +8929,7 @@ Program aborted at tick 0</pre>
 <div class="ulist">
 <ul>
 <li>
-<p>modules built with Buildroot, see: <a href="#kernel-modules-buildroot-package">Section 33.15.2.1, &#8220;kernel_modules buildroot package&#8221;</a></p>
+<p>modules built with Buildroot, see: <a href="#kernel-modules-buildroot-package">Section 34.15.2.1, &#8220;kernel_modules buildroot package&#8221;</a></p>
 </li>
 <li>
 <p>modules built from the kernel tree itself, see: <a href="#dummy-irq">Section 15.12.2, &#8220;dummy-irq&#8221;</a></p>
@@ -9052,7 +9053,7 @@ Program aborted at tick 0</pre>
 <p>no need to regenerate the root filesystem at all and reboot</p>
 </li>
 <li>
-<p>overcomes the <code>check_bin_arch</code> problem as shown at: <a href="#rpath">Section 20.8, &#8220;Buildroot rebuild is slow when the root filesystem is large&#8221;</a></p>
+<p>overcomes the <code>check_bin_arch</code> problem as shown at: <a href="#rpath">Section 21.8, &#8220;Buildroot rebuild is slow when the root filesystem is large&#8221;</a></p>
 </li>
 </ul>
 </div>
@@ -9835,7 +9836,7 @@ xeyes</pre>
 <div class="sect2">
 <h3 id="enable-networking"><a class="anchor" href="#enable-networking"></a><a class="link" href="#enable-networking">14.1. Enable networking</a></h3>
 <div class="paragraph">
-<p>We disable networking by default because it starts an userland process, and we want to keep the number of userland processes to a minimum to make the system more understandable as explained at: <a href="#resource-tradeoff-guidelines">Section 33.20.3, &#8220;Resource tradeoff guidelines&#8221;</a></p>
+<p>We disable networking by default because it starts an userland process, and we want to keep the number of userland processes to a minimum to make the system more understandable as explained at: <a href="#resource-tradeoff-guidelines">Section 34.20.3, &#8220;Resource tradeoff guidelines&#8221;</a></p>
 </div>
 <div class="paragraph">
 <p>To enable networking on Buildroot, simply run:</p>
@@ -10684,15 +10685,15 @@ git log | grep -E '    Linux [0-9]+\.' | head</pre>
 <p>This also makes this repo the perfect setup to develop the Linux kernel.</p>
 </div>
 <div class="paragraph">
-<p>In case something breaks while updating the Linux kernel, you can try to bisect it to understand the root cause, see: <a href="#bisection">Section 33.17, &#8220;Bisection&#8221;</a>.</p>
+<p>In case something breaks while updating the Linux kernel, you can try to bisect it to understand the root cause, see: <a href="#bisection">Section 34.17, &#8220;Bisection&#8221;</a>.</p>
 </div>
 <div class="sect4">
 <h5 id="update-the-linux-kernel-lkmc-procedure"><a class="anchor" href="#update-the-linux-kernel-lkmc-procedure"></a><a class="link" href="#update-the-linux-kernel-lkmc-procedure">15.2.2.1. Update the Linux kernel LKMC procedure</a></h5>
 <div class="paragraph">
-<p>First, use use the branching procedure described at: <a href="#update-a-forked-submodule">Section 33.18, &#8220;Update a forked submodule&#8221;</a></p>
+<p>First, use use the branching procedure described at: <a href="#update-a-forked-submodule">Section 34.18, &#8220;Update a forked submodule&#8221;</a></p>
 </div>
 <div class="paragraph">
-<p>Because the kernel is so central to this repository, almost all tests must be re-run, so basically just follow the full testing procedure described at: <a href="#test-this-repo">Section 33.16, &#8220;Test this repo&#8221;</a>. The only tests that can be skipped are essentially the <a href="#baremetal">Baremetal</a> tests.</p>
+<p>Because the kernel is so central to this repository, almost all tests must be re-run, so basically just follow the full testing procedure described at: <a href="#test-this-repo">Section 34.16, &#8220;Test this repo&#8221;</a>. The only tests that can be skipped are essentially the <a href="#baremetal">Baremetal</a> tests.</p>
 </div>
 <div class="paragraph">
 <p>Before comitting, don&#8217;t forget to update:</p>
@@ -15240,7 +15241,7 @@ detected buffer overflow in strlen
 </div>
 </div>
 <div class="paragraph">
-<p>SELinux requires glibc as mentioned at: <a href="#libc-choice">Section 20.10, &#8220;libc choice&#8221;</a>.</p>
+<p>SELinux requires glibc as mentioned at: <a href="#libc-choice">Section 21.10, &#8220;libc choice&#8221;</a>.</p>
 </div>
 </div>
 </div>
@@ -16371,7 +16372,7 @@ wget \
 </div>
 </div>
 <div class="paragraph">
-<p><code>STRESS_NG</code> is likely the best, but it requires glibc, see: <a href="#libc-choice">Section 20.10, &#8220;libc choice&#8221;</a>.</p>
+<p><code>STRESS_NG</code> is likely the best, but it requires glibc, see: <a href="#libc-choice">Section 21.10, &#8220;libc choice&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Websites:</p>
@@ -17809,10 +17810,10 @@ run
 <p>The build outputs are automatically stored in a different directories for optimized and debug builds, which prevents <code>debug</code> files from overwriting <code>opt</code> ones. Therefore, <code>--gem5-build-id</code> is not required.</p>
 </div>
 <div class="paragraph">
-<p>The price to pay for debuggability is high however: a Linux kernel boot was about 3x slower in QEMU and 14 times slower in gem5 debug compared to opt, see benchmarks at: <a href="#benchmark-linux-kernel-boot">Section 29.2.1, &#8220;Benchmark Linux kernel boot&#8221;</a>.</p>
+<p>The price to pay for debuggability is high however: a Linux kernel boot was about 3x slower in QEMU and 14 times slower in gem5 debug compared to opt, see benchmarks at: <a href="#benchmark-linux-kernel-boot">Section 30.2.1, &#8220;Benchmark Linux kernel boot&#8221;</a>.</p>
 </div>
 <div class="paragraph">
-<p>Similar slowdowns can be observed at: <a href="#benchmark-emulators-on-userland-executables">Section 29.2.2, &#8220;Benchmark emulators on userland executables&#8221;</a>.</p>
+<p>Similar slowdowns can be observed at: <a href="#benchmark-emulators-on-userland-executables">Section 30.2.2, &#8220;Benchmark emulators on userland executables&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>When in <a href="#qemu-text-mode">QEMU text mode</a>, using <code>--debug-vm</code> makes Ctrl-C not get passed to the QEMU guest anymore: it is instead captured by GDB itself, so allow breaking. So e.g. you won&#8217;t be able to easily quit from a guest program like:</p>
@@ -18546,7 +18547,7 @@ extern SimpleFlag ExecEnable;
 <p><code>25007500</code>: time count in some unit. Note how the microops execute at further timestamps.</p>
 </li>
 <li>
-<p><code>system.cpu</code>: distinguishes between CPUs when there are more than one. For example, running <a href="#arm-baremetal-multicore">Section 27.10.3, &#8220;ARM baremetal multicore&#8221;</a> with two cores produces <code>system.cpu0</code> and <code>system.cpu1</code></p>
+<p><code>system.cpu</code>: distinguishes between CPUs when there are more than one. For example, running <a href="#arm-baremetal-multicore">Section 28.10.3, &#8220;ARM baremetal multicore&#8221;</a> with two cores produces <code>system.cpu0</code> and <code>system.cpu1</code></p>
 </li>
 <li>
 <p><code>T0</code>: thread number. TODO: <a href="https://superuser.com/questions/133082/hyper-threading-and-dual-core-whats-the-difference/995858#995858">hyperthread</a>? How to play with it?</p>
@@ -18839,7 +18840,7 @@ root</pre>
 <p>runs are deterministic by default, unlike QEMU which has a special <a href="#qemu-record-and-replay">QEMU record and replay</a> mode, that requires first playing the content once and then replaying</p>
 </li>
 <li>
-<p>gem5 ARM at least appears to implement more low level CPU functionality than QEMU, e.g. QEMU only added EL2 in 2018: <a href="https://stackoverflow.com/questions/42824706/qemu-system-aarch64-entering-el1-when-emulating-a53-power-up" class="bare">https://stackoverflow.com/questions/42824706/qemu-system-aarch64-entering-el1-when-emulating-a53-power-up</a> See also: <a href="#arm-exception-levels">Section 27.10.1, &#8220;ARM exception levels&#8221;</a></p>
+<p>gem5 ARM at least appears to implement more low level CPU functionality than QEMU, e.g. QEMU only added EL2 in 2018: <a href="https://stackoverflow.com/questions/42824706/qemu-system-aarch64-entering-el1-when-emulating-a53-power-up" class="bare">https://stackoverflow.com/questions/42824706/qemu-system-aarch64-entering-el1-when-emulating-a53-power-up</a> See also: <a href="#arm-exception-levels">Section 28.10.1, &#8220;ARM exception levels&#8221;</a></p>
 </li>
 <li>
 <p>gem5 offers more advanced logging, even for non micro architectural things which QEMU models in some way, e.g. <a href="#qemu-trace-memory-accesses">QEMU trace memory accesses</a>, because QEMU&#8217;s binary translation optimizations reduce visibility</p>
@@ -18852,7 +18853,7 @@ root</pre>
 <div class="ulist">
 <ul>
 <li>
-<p>slower than QEMU, see: <a href="#benchmark-linux-kernel-boot">Section 29.2.1, &#8220;Benchmark Linux kernel boot&#8221;</a></p>
+<p>slower than QEMU, see: <a href="#benchmark-linux-kernel-boot">Section 30.2.1, &#8220;Benchmark Linux kernel boot&#8221;</a></p>
 <div class="paragraph">
 <p>This implies that the user base is much smaller, since no Android devs.</p>
 </div>
@@ -19480,7 +19481,7 @@ instructions 91738770</pre>
 <p>we have no caches, each instruction is fetched from memory</p>
 </li>
 <li>
-<p>each loop contains 11 instructions as shown at <a href="#c-busy-loop">Section 31.2, &#8220;C busy loop&#8221;</a></p>
+<p>each loop contains 11 instructions as shown at <a href="#c-busy-loop">Section 32.2, &#8220;C busy loop&#8221;</a></p>
 </li>
 <li>
 <p>and supposing that the loop dominated executable pre/post <code>main</code>, which we know is true since as shown in <a href="#benchmark-emulators-on-userland-executables">Benchmark emulators on userland executables</a> an empty dynamically linked C program only as about 100k instructions, while our loop runs 1000000 * 11 = 12M.</p>
@@ -21954,7 +21955,7 @@ Exiting @ tick 18446744073709551615 because simulate() limit reached</pre>
 <div class="ulist">
 <ul>
 <li>
-<p><a href="#benchmark-emulators-on-userland-executables">Section 29.2.2, &#8220;Benchmark emulators on userland executables&#8221;</a></p>
+<p><a href="#benchmark-emulators-on-userland-executables">Section 30.2.2, &#8220;Benchmark emulators on userland executables&#8221;</a></p>
 </li>
 </ul>
 </div>
@@ -22255,7 +22256,7 @@ cat "$(./getvar --arch aarch64 --emulator gem5 trace_txt_file)"</pre>
 <p>It presumably implements a crossbar switch along the lines of: <a href="https://en.wikipedia.org/wiki/Crossbar_switch" class="bare">https://en.wikipedia.org/wiki/Crossbar_switch</a></p>
 </div>
 <div class="paragraph">
-<p>One simple example of its operation can be seen at: <a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis">Section 19.20.4.2, &#8220;gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis&#8221;</a></p>
+<p>One simple example of its operation can be seen at: <a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis">Section 19.21.4.2, &#8220;gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis&#8221;</a></p>
 </div>
 <div class="paragraph">
 <p>But arguably interesting effects can only be observed when we have more than 1 CPUs as in <a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs</a>.</p>
@@ -22762,7 +22763,50 @@ cd ..
 </div>
 </div>
 <div class="sect2">
-<h3 id="gem5-internals"><a class="anchor" href="#gem5-internals"></a><a class="link" href="#gem5-internals">19.20. gem5 internals</a></h3>
+<h3 id="gem5-commmonitor"><a class="anchor" href="#gem5-commmonitor"></a><a class="link" href="#gem5-commmonitor">19.20. gem5 <code>CommMonitor</code></a></h3>
+<div class="paragraph">
+<p>You can place this <a href="#gem5-python-c-interaction">SimObject</a> in between two <a href="#gem5-port-system">ports</a> to get extra statistics about the packets that are going through.</p>
+</div>
+<div class="paragraph">
+<p>It only works on timing CPUs, and does not seem to dump any memory values, only add extra <a href="#gem5-m5out-stats-txt-file">statistics</a>.</p>
+</div>
+<div class="paragraph">
+<p>For example, the patch <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/patches/manual/gem5-commmonitor-se.patch">patches/manual/gem5-commmonitor-se.patch</a> hack a <code>CommMonitor</code> between the CPU and the L1 cache on top of gem5 1c3662c9557c85f0d25490dc4fbde3f8ab0cb350:</p>
+</div>
+<div class="literalblock">
+<div class="content">
+<pre>patch -d "$(./getvar gem5_source_dir)" -p 1 &lt; patches/manual/gem5-commmonitor-se.patch</pre>
+</div>
+</div>
+<div class="paragraph">
+<p>which you can run with:</p>
+</div>
+<div class="literalblock">
+<div class="content">
+<pre>./run \
+  --arch aarch64 \
+  --emulator gem5 \
+  --userland userland/arch/aarch64/freestanding/linux/hello.S \
+  -- \
+  --caches \
+  --cpu-type TimingSimpleCPU \
+;</pre>
+</div>
+</div>
+<div class="paragraph">
+<p>and now we have some new extra histogram statistics such as:</p>
+</div>
+<div class="literalblock">
+<div class="content">
+<pre>system.cpu.dcache_mon.readBurstLengthHist::samples            1</pre>
+</div>
+</div>
+<div class="paragraph">
+<p>One neat thing about this is that it is agnostic to the memory object type, so you don&#8217;t have to recode those statistics for every new type of object that operates on memory packets.</p>
+</div>
+</div>
+<div class="sect2">
+<h3 id="gem5-internals"><a class="anchor" href="#gem5-internals"></a><a class="link" href="#gem5-internals">19.21. gem5 internals</a></h3>
 <div class="paragraph">
 <p>Internals under other sections:</p>
 </div>
@@ -22780,7 +22824,7 @@ cd ..
 </ul>
 </div>
 <div class="sect3">
-<h4 id="gem5-eclipse-configuration"><a class="anchor" href="#gem5-eclipse-configuration"></a><a class="link" href="#gem5-eclipse-configuration">19.20.1. gem5 Eclipse configuration</a></h4>
+<h4 id="gem5-eclipse-configuration"><a class="anchor" href="#gem5-eclipse-configuration"></a><a class="link" href="#gem5-eclipse-configuration">19.21.1. gem5 Eclipse configuration</a></h4>
 <div class="paragraph">
 <p><a href="https://stackoverflow.com/questions/61656709/how-to-setup-eclipse-ide-for-gem5-development" class="bare">https://stackoverflow.com/questions/61656709/how-to-setup-eclipse-ide-for-gem5-development</a></p>
 </div>
@@ -22842,7 +22886,7 @@ cd ..
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-python-c-interaction"><a class="anchor" href="#gem5-python-c-interaction"></a><a class="link" href="#gem5-python-c-interaction">19.20.2. gem5 Python C++ interaction</a></h4>
+<h4 id="gem5-python-c-interaction"><a class="anchor" href="#gem5-python-c-interaction"></a><a class="link" href="#gem5-python-c-interaction">19.21.2. gem5 Python C++ interaction</a></h4>
 <div class="paragraph">
 <p>The interaction uses the Python C extension interface <a href="https://docs.python.org/2/extending/extending.html" class="bare">https://docs.python.org/2/extending/extending.html</a> interface through the <a href="#pybind11">pybind11</a> helper library: <a href="https://github.com/pybind/pybind11" class="bare">https://github.com/pybind/pybind11</a></p>
 </div>
@@ -23017,6 +23061,9 @@ static EmbeddedPyBind embed_obj("BadDevice", module_init, "BasicPioDevice");</pr
 <li>
 <p><a href="https://stackoverflow.com/questions/61910993/viewing-the-parameters-of-the-branch-predictor-in-gem5/61914449#61914449" class="bare">https://stackoverflow.com/questions/61910993/viewing-the-parameters-of-the-branch-predictor-in-gem5/61914449#61914449</a></p>
 </li>
+<li>
+<p><a href="https://stackoverflow.com/questions/62969566/attributes-of-system-object-in-gem5/62970092#62970092" class="bare">https://stackoverflow.com/questions/62969566/attributes-of-system-object-in-gem5/62970092#62970092</a></p>
+</li>
 </ul>
 </div>
 <div class="paragraph">
@@ -23024,7 +23071,7 @@ static EmbeddedPyBind embed_obj("BadDevice", module_init, "BasicPioDevice");</pr
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-entry-point"><a class="anchor" href="#gem5-entry-point"></a><a class="link" href="#gem5-entry-point">19.20.3. gem5 entry point</a></h4>
+<h4 id="gem5-entry-point"><a class="anchor" href="#gem5-entry-point"></a><a class="link" href="#gem5-entry-point">19.21.3. gem5 entry point</a></h4>
 <div class="paragraph">
 <p>The main is at: <code>src/sim/main.cc</code>. It calls:</p>
 </div>
@@ -23112,7 +23159,7 @@ exec filecode in scope</pre>
 <p>Tested at gem5 b4879ae5b0b6644e6836b0881e4da05c64a6550d.</p>
 </div>
 <div class="sect4">
-<h5 id="gem5-m5-objects-module"><a class="anchor" href="#gem5-m5-objects-module"></a><a class="link" href="#gem5-m5-objects-module">19.20.3.1. gem5 <code>m5.objects</code> module</a></h5>
+<h5 id="gem5-m5-objects-module"><a class="anchor" href="#gem5-m5-objects-module"></a><a class="link" href="#gem5-m5-objects-module">19.21.3.1. gem5 <code>m5.objects</code> module</a></h5>
 <div class="paragraph">
 <p>All <code>SimObjects</code> seem to be automatically added to the <code>m5.objects</code> namespace, and this is done in a very convoluted way, let&#8217;s try to understand a bit:</p>
 </div>
@@ -23277,7 +23324,7 @@ for source in PySource.all:
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-event-queue"><a class="anchor" href="#gem5-event-queue"></a><a class="link" href="#gem5-event-queue">19.20.4. gem5 event queue</a></h4>
+<h4 id="gem5-event-queue"><a class="anchor" href="#gem5-event-queue"></a><a class="link" href="#gem5-event-queue">19.21.4. gem5 event queue</a></h4>
 <div class="paragraph">
 <p>gem5 is an event based simulator, and as such the event queue is of of the crucial elements in the system.</p>
 </div>
@@ -23383,7 +23430,7 @@ b EventFunctionWrapper::process</pre>
 <p>Then, once we had that, the most perfect thing ever would be to make the full event graph containing which events schedule which events!</p>
 </div>
 <div class="sect4">
-<h5 id="gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis"><a class="anchor" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis"></a><a class="link" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis">19.20.4.1. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis</a></h5>
+<h5 id="gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis"><a class="anchor" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis"></a><a class="link" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis">19.21.4.1. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis</a></h5>
 <div class="paragraph">
 <p>Let&#8217;s now analyze every single event on a minimal <a href="#gem5-syscall-emulation-mode">gem5 syscall emulation mode</a> in the <a href="#gem5-cpu-types">simplest CPU that we have</a>:</p>
 </div>
@@ -23519,7 +23566,7 @@ AtomicSimpleCPU::tick() at atomic.cc:757 0x55555907834c</pre>
 <p>Tested in gem5 12c917de54145d2d50260035ba7fa614e25317a3.</p>
 </div>
 <div class="sect5">
-<h6 id="atomicsimplecpu-initial-events"><a class="anchor" href="#atomicsimplecpu-initial-events"></a><a class="link" href="#atomicsimplecpu-initial-events">19.20.4.1.1. AtomicSimpleCPU initial events</a></h6>
+<h6 id="atomicsimplecpu-initial-events"><a class="anchor" href="#atomicsimplecpu-initial-events"></a><a class="link" href="#atomicsimplecpu-initial-events">19.21.4.1.1. AtomicSimpleCPU initial events</a></h6>
 <div class="paragraph">
 <p>Let&#8217;s have a closer look at the initial magically scheduled events of the simulation.</p>
 </div>
@@ -23738,7 +23785,7 @@ simulate() at simulate.cc:104 0x555559476d6f</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="atomicsimplecpu-tick-reschedule-timing"><a class="anchor" href="#atomicsimplecpu-tick-reschedule-timing"></a><a class="link" href="#atomicsimplecpu-tick-reschedule-timing">19.20.4.1.2. AtomicSimpleCPU tick reschedule timing</a></h6>
+<h6 id="atomicsimplecpu-tick-reschedule-timing"><a class="anchor" href="#atomicsimplecpu-tick-reschedule-timing"></a><a class="link" href="#atomicsimplecpu-tick-reschedule-timing">19.21.4.1.2. AtomicSimpleCPU tick reschedule timing</a></h6>
 <div class="paragraph">
 <p>Inside <code>AtomicSimpleCPU::tick()</code> we saw previously that the reschedule happens at:</p>
 </div>
@@ -23778,7 +23825,7 @@ clock=500</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="atomicsimplecpu-memory-access"><a class="anchor" href="#atomicsimplecpu-memory-access"></a><a class="link" href="#atomicsimplecpu-memory-access">19.20.4.1.3. AtomicSimpleCPU memory access</a></h6>
+<h6 id="atomicsimplecpu-memory-access"><a class="anchor" href="#atomicsimplecpu-memory-access"></a><a class="link" href="#atomicsimplecpu-memory-access">19.21.4.1.3. AtomicSimpleCPU memory access</a></h6>
 <div class="paragraph">
 <p>It will be interesting to see how <code>AtomicSimpleCPU</code> makes memory access on GDB and to compare that with <a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis"><code>TimingSimpleCPU</code></a>.</p>
 </div>
@@ -23832,7 +23879,7 @@ clock=500</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-se-py-page-translation"><a class="anchor" href="#gem5-se-py-page-translation"></a><a class="link" href="#gem5-se-py-page-translation">19.20.4.1.4. gem5 se.py page translation</a></h6>
+<h6 id="gem5-se-py-page-translation"><a class="anchor" href="#gem5-se-py-page-translation"></a><a class="link" href="#gem5-se-py-page-translation">19.21.4.1.4. gem5 se.py page translation</a></h6>
 <div class="paragraph">
 <p>Happens on <code>EmulationPageTable</code>, and seems to happen atomically without making any extra memory requests.</p>
 </div>
@@ -23903,7 +23950,7 @@ Exiting @ tick 3500 because exiting with last active thread context
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis"><a class="anchor" href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis"></a><a class="link" href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis">19.20.4.2. gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis</a></h5>
+<h5 id="gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis"><a class="anchor" href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis"></a><a class="link" href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis">19.21.4.2. gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis</a></h5>
 <div class="paragraph">
 <p>Now, let&#8217;s move on to <code>TimingSimpleCPU</code>, which is just like <code>AtomicSimpleCPU</code> internally, but now the memory requests don&#8217;t actually finish immediately: <a href="#gem5-cpu-types">gem5 CPU types</a>!</p>
 </div>
@@ -24184,7 +24231,7 @@ info: Entering event queue @ 0.  Starting simulation...
 </ul>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-0"><a class="anchor" href="#timingsimplecpu-analysis-0"></a><a class="link" href="#timingsimplecpu-analysis-0">19.20.4.2.1. TimingSimpleCPU analysis #0</a></h6>
+<h6 id="timingsimplecpu-analysis-0"><a class="anchor" href="#timingsimplecpu-analysis-0"></a><a class="link" href="#timingsimplecpu-analysis-0">19.21.4.2.1. TimingSimpleCPU analysis #0</a></h6>
 <div class="paragraph">
 <p>Schedules <code>TimingSimpleCPU::fetch</code> through:</p>
 </div>
@@ -24229,7 +24276,7 @@ ArmLinuxProcess64::initState</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-1"><a class="anchor" href="#timingsimplecpu-analysis-1"></a><a class="link" href="#timingsimplecpu-analysis-1">19.20.4.2.2. TimingSimpleCPU analysis #1</a></h6>
+<h6 id="timingsimplecpu-analysis-1"><a class="anchor" href="#timingsimplecpu-analysis-1"></a><a class="link" href="#timingsimplecpu-analysis-1">19.21.4.2.2. TimingSimpleCPU analysis #1</a></h6>
 <div class="paragraph">
 <p>Backtrace:</p>
 </div>
@@ -24360,7 +24407,7 @@ DRAMCtrl::Rank::startup(Tick ref_tick)
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-2"><a class="anchor" href="#timingsimplecpu-analysis-2"></a><a class="link" href="#timingsimplecpu-analysis-2">19.20.4.2.3. TimingSimpleCPU analysis #2</a></h6>
+<h6 id="timingsimplecpu-analysis-2"><a class="anchor" href="#timingsimplecpu-analysis-2"></a><a class="link" href="#timingsimplecpu-analysis-2">19.21.4.2.3. TimingSimpleCPU analysis #2</a></h6>
 <div class="paragraph">
 <p>This is just the startup of the second rank, see: <a href="#timingsimplecpu-analysis-1">TimingSimpleCPU analysis #1</a>.</p>
 </div>
@@ -24393,13 +24440,13 @@ DRAMCtrl::Rank::startup(Tick ref_tick)
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-3-and-4"><a class="anchor" href="#timingsimplecpu-analysis-3-and-4"></a><a class="link" href="#timingsimplecpu-analysis-3-and-4">19.20.4.2.4. TimingSimpleCPU analysis #3 and #4</a></h6>
+<h6 id="timingsimplecpu-analysis-3-and-4"><a class="anchor" href="#timingsimplecpu-analysis-3-and-4"></a><a class="link" href="#timingsimplecpu-analysis-3-and-4">19.21.4.2.4. TimingSimpleCPU analysis #3 and #4</a></h6>
 <div class="paragraph">
 <p>From the timing we know what that one is: the end of time exit event, like for <code>AtomicSimpleCPU</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-5"><a class="anchor" href="#timingsimplecpu-analysis-5"></a><a class="link" href="#timingsimplecpu-analysis-5">19.20.4.2.5. TimingSimpleCPU analysis #5</a></h6>
+<h6 id="timingsimplecpu-analysis-5"><a class="anchor" href="#timingsimplecpu-analysis-5"></a><a class="link" href="#timingsimplecpu-analysis-5">19.21.4.2.5. TimingSimpleCPU analysis #5</a></h6>
 <div class="paragraph">
 <p>Executes <code>TimingSimpleCPU::fetch()</code>.</p>
 </div>
@@ -24507,7 +24554,7 @@ DRAMCtrl::Rank::startup(Tick ref_tick)
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-6"><a class="anchor" href="#timingsimplecpu-analysis-6"></a><a class="link" href="#timingsimplecpu-analysis-6">19.20.4.2.6. TimingSimpleCPU analysis #6</a></h6>
+<h6 id="timingsimplecpu-analysis-6"><a class="anchor" href="#timingsimplecpu-analysis-6"></a><a class="link" href="#timingsimplecpu-analysis-6">19.21.4.2.6. TimingSimpleCPU analysis #6</a></h6>
 <div class="paragraph">
 <p>Schedules <code>DRAMCtrl::processNextReqEvent</code> through:</p>
 </div>
@@ -24644,7 +24691,7 @@ TimingSimpleCPU::fetch</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-7"><a class="anchor" href="#timingsimplecpu-analysis-7"></a><a class="link" href="#timingsimplecpu-analysis-7">19.20.4.2.7. TimingSimpleCPU analysis #7</a></h6>
+<h6 id="timingsimplecpu-analysis-7"><a class="anchor" href="#timingsimplecpu-analysis-7"></a><a class="link" href="#timingsimplecpu-analysis-7">19.21.4.2.7. TimingSimpleCPU analysis #7</a></h6>
 <div class="paragraph">
 <p>Schedules <code>BaseXBar::Layer::releaseLayer</code> through:</p>
 </div>
@@ -24670,13 +24717,13 @@ TimingSimpleCPU::fetch</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-8"><a class="anchor" href="#timingsimplecpu-analysis-8"></a><a class="link" href="#timingsimplecpu-analysis-8">19.20.4.2.8. TimingSimpleCPU analysis #8</a></h6>
+<h6 id="timingsimplecpu-analysis-8"><a class="anchor" href="#timingsimplecpu-analysis-8"></a><a class="link" href="#timingsimplecpu-analysis-8">19.21.4.2.8. TimingSimpleCPU analysis #8</a></h6>
 <div class="paragraph">
 <p>Executes <code>DRAMCtrl::processNextReqEvent</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-9"><a class="anchor" href="#timingsimplecpu-analysis-9"></a><a class="link" href="#timingsimplecpu-analysis-9">19.20.4.2.9. TimingSimpleCPU analysis #9</a></h6>
+<h6 id="timingsimplecpu-analysis-9"><a class="anchor" href="#timingsimplecpu-analysis-9"></a><a class="link" href="#timingsimplecpu-analysis-9">19.21.4.2.9. TimingSimpleCPU analysis #9</a></h6>
 <div class="paragraph">
 <p>Schedules <code>DRAMCtrl::Rank::processActivateEvent</code> through:</p>
 </div>
@@ -24690,7 +24737,7 @@ DRAMCtrl::processNextReqEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-10"><a class="anchor" href="#timingsimplecpu-analysis-10"></a><a class="link" href="#timingsimplecpu-analysis-10">19.20.4.2.10. TimingSimpleCPU analysis #10</a></h6>
+<h6 id="timingsimplecpu-analysis-10"><a class="anchor" href="#timingsimplecpu-analysis-10"></a><a class="link" href="#timingsimplecpu-analysis-10">19.21.4.2.10. TimingSimpleCPU analysis #10</a></h6>
 <div class="paragraph">
 <p>Schedules <code>DRAMCtrl::processRespondEvent</code> through:</p>
 </div>
@@ -24702,7 +24749,7 @@ DRAMCtrl::processNextReqEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-11"><a class="anchor" href="#timingsimplecpu-analysis-11"></a><a class="link" href="#timingsimplecpu-analysis-11">19.20.4.2.11. TimingSimpleCPU analysis #11</a></h6>
+<h6 id="timingsimplecpu-analysis-11"><a class="anchor" href="#timingsimplecpu-analysis-11"></a><a class="link" href="#timingsimplecpu-analysis-11">19.21.4.2.11. TimingSimpleCPU analysis #11</a></h6>
 <div class="paragraph">
 <p>Schedules <code>DRAMCtrl::processNextReqEvent</code> through:</p>
 </div>
@@ -24714,7 +24761,7 @@ DRAMCtrl::processNextReqEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-12"><a class="anchor" href="#timingsimplecpu-analysis-12"></a><a class="link" href="#timingsimplecpu-analysis-12">19.20.4.2.12. TimingSimpleCPU analysis #12</a></h6>
+<h6 id="timingsimplecpu-analysis-12"><a class="anchor" href="#timingsimplecpu-analysis-12"></a><a class="link" href="#timingsimplecpu-analysis-12">19.21.4.2.12. TimingSimpleCPU analysis #12</a></h6>
 <div class="paragraph">
 <p>Executes <code>DRAMCtrl::Rank::processActivateEvent</code>.</p>
 </div>
@@ -24723,7 +24770,7 @@ DRAMCtrl::processNextReqEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-13"><a class="anchor" href="#timingsimplecpu-analysis-13"></a><a class="link" href="#timingsimplecpu-analysis-13">19.20.4.2.13. TimingSimpleCPU analysis #13</a></h6>
+<h6 id="timingsimplecpu-analysis-13"><a class="anchor" href="#timingsimplecpu-analysis-13"></a><a class="link" href="#timingsimplecpu-analysis-13">19.21.4.2.13. TimingSimpleCPU analysis #13</a></h6>
 <div class="paragraph">
 <p>Schedules <code>DRAMCtrl::Rank::processPowerEvent</code> through:</p>
 </div>
@@ -24736,7 +24783,7 @@ DRAMCtrl::Rank::processActivateEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-14"><a class="anchor" href="#timingsimplecpu-analysis-14"></a><a class="link" href="#timingsimplecpu-analysis-14">19.20.4.2.14. TimingSimpleCPU analysis #14</a></h6>
+<h6 id="timingsimplecpu-analysis-14"><a class="anchor" href="#timingsimplecpu-analysis-14"></a><a class="link" href="#timingsimplecpu-analysis-14">19.21.4.2.14. TimingSimpleCPU analysis #14</a></h6>
 <div class="paragraph">
 <p>Executes <code>DRAMCtrl::Rank::processPowerEvent</code>.</p>
 </div>
@@ -24745,25 +24792,25 @@ DRAMCtrl::Rank::processActivateEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-15"><a class="anchor" href="#timingsimplecpu-analysis-15"></a><a class="link" href="#timingsimplecpu-analysis-15">19.20.4.2.15. TimingSimpleCPU analysis #15</a></h6>
+<h6 id="timingsimplecpu-analysis-15"><a class="anchor" href="#timingsimplecpu-analysis-15"></a><a class="link" href="#timingsimplecpu-analysis-15">19.21.4.2.15. TimingSimpleCPU analysis #15</a></h6>
 <div class="paragraph">
 <p>Executes <code>BaseXBar::Layer&lt;SrcType, DstType&gt;::releaseLayer</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-16"><a class="anchor" href="#timingsimplecpu-analysis-16"></a><a class="link" href="#timingsimplecpu-analysis-16">19.20.4.2.16. TimingSimpleCPU analysis #16</a></h6>
+<h6 id="timingsimplecpu-analysis-16"><a class="anchor" href="#timingsimplecpu-analysis-16"></a><a class="link" href="#timingsimplecpu-analysis-16">19.21.4.2.16. TimingSimpleCPU analysis #16</a></h6>
 <div class="paragraph">
 <p>Executes <code>DRAMCtrl::processNextReqEvent()</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-17"><a class="anchor" href="#timingsimplecpu-analysis-17"></a><a class="link" href="#timingsimplecpu-analysis-17">19.20.4.2.17. TimingSimpleCPU analysis #17</a></h6>
+<h6 id="timingsimplecpu-analysis-17"><a class="anchor" href="#timingsimplecpu-analysis-17"></a><a class="link" href="#timingsimplecpu-analysis-17">19.21.4.2.17. TimingSimpleCPU analysis #17</a></h6>
 <div class="paragraph">
 <p>Executes <code>DRAMCtrl::processRespondEvent()</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-18"><a class="anchor" href="#timingsimplecpu-analysis-18"></a><a class="link" href="#timingsimplecpu-analysis-18">19.20.4.2.18. TimingSimpleCPU analysis #18</a></h6>
+<h6 id="timingsimplecpu-analysis-18"><a class="anchor" href="#timingsimplecpu-analysis-18"></a><a class="link" href="#timingsimplecpu-analysis-18">19.21.4.2.18. TimingSimpleCPU analysis #18</a></h6>
 <div class="paragraph">
 <p>Schedules <code>PacketQueue::processSendEvent()</code> through:</p>
 </div>
@@ -24778,13 +24825,13 @@ DRAMCtrl::processRespondEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-19"><a class="anchor" href="#timingsimplecpu-analysis-19"></a><a class="link" href="#timingsimplecpu-analysis-19">19.20.4.2.19. TimingSimpleCPU analysis #19</a></h6>
+<h6 id="timingsimplecpu-analysis-19"><a class="anchor" href="#timingsimplecpu-analysis-19"></a><a class="link" href="#timingsimplecpu-analysis-19">19.21.4.2.19. TimingSimpleCPU analysis #19</a></h6>
 <div class="paragraph">
 <p>Executes <code>PacketQueue::processSendEvent()</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-20"><a class="anchor" href="#timingsimplecpu-analysis-20"></a><a class="link" href="#timingsimplecpu-analysis-20">19.20.4.2.20. TimingSimpleCPU analysis #20</a></h6>
+<h6 id="timingsimplecpu-analysis-20"><a class="anchor" href="#timingsimplecpu-analysis-20"></a><a class="link" href="#timingsimplecpu-analysis-20">19.21.4.2.20. TimingSimpleCPU analysis #20</a></h6>
 <div class="paragraph">
 <p>Schedules <code>PacketQueue::processSendEvent</code> through:</p>
 </div>
@@ -24808,7 +24855,7 @@ PacketQueue::processSendEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-21"><a class="anchor" href="#timingsimplecpu-analysis-21"></a><a class="link" href="#timingsimplecpu-analysis-21">19.20.4.2.21. TimingSimpleCPU analysis #21</a></h6>
+<h6 id="timingsimplecpu-analysis-21"><a class="anchor" href="#timingsimplecpu-analysis-21"></a><a class="link" href="#timingsimplecpu-analysis-21">19.21.4.2.21. TimingSimpleCPU analysis #21</a></h6>
 <div class="paragraph">
 <p>Schedules <code>BaseXBar::Layer&lt;SrcType, DstType&gt;::releaseLayer</code> through:</p>
 </div>
@@ -24828,19 +24875,19 @@ PacketQueue::processSendEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-22"><a class="anchor" href="#timingsimplecpu-analysis-22"></a><a class="link" href="#timingsimplecpu-analysis-22">19.20.4.2.22. TimingSimpleCPU analysis #22</a></h6>
+<h6 id="timingsimplecpu-analysis-22"><a class="anchor" href="#timingsimplecpu-analysis-22"></a><a class="link" href="#timingsimplecpu-analysis-22">19.21.4.2.22. TimingSimpleCPU analysis #22</a></h6>
 <div class="paragraph">
 <p>Executes <code>BaseXBar::Layer&lt;SrcType, DstType&gt;::releaseLayer</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-23"><a class="anchor" href="#timingsimplecpu-analysis-23"></a><a class="link" href="#timingsimplecpu-analysis-23">19.20.4.2.23. TimingSimpleCPU analysis #23</a></h6>
+<h6 id="timingsimplecpu-analysis-23"><a class="anchor" href="#timingsimplecpu-analysis-23"></a><a class="link" href="#timingsimplecpu-analysis-23">19.21.4.2.23. TimingSimpleCPU analysis #23</a></h6>
 <div class="paragraph">
 <p>Executes <code>PacketQueue::processSendEvent</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-24"><a class="anchor" href="#timingsimplecpu-analysis-24"></a><a class="link" href="#timingsimplecpu-analysis-24">19.20.4.2.24. TimingSimpleCPU analysis #24</a></h6>
+<h6 id="timingsimplecpu-analysis-24"><a class="anchor" href="#timingsimplecpu-analysis-24"></a><a class="link" href="#timingsimplecpu-analysis-24">19.21.4.2.24. TimingSimpleCPU analysis #24</a></h6>
 <div class="paragraph">
 <p>Schedules <code>TimingSimpleCPU::IcachePort::ITickEvent::process()</code> through:</p>
 </div>
@@ -24858,7 +24905,7 @@ PacketQueue::processSendEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-25"><a class="anchor" href="#timingsimplecpu-analysis-25"></a><a class="link" href="#timingsimplecpu-analysis-25">19.20.4.2.25. TimingSimpleCPU analysis #25</a></h6>
+<h6 id="timingsimplecpu-analysis-25"><a class="anchor" href="#timingsimplecpu-analysis-25"></a><a class="link" href="#timingsimplecpu-analysis-25">19.21.4.2.25. TimingSimpleCPU analysis #25</a></h6>
 <div class="paragraph">
 <p>Executes <code>TimingSimpleCPU::IcachePort::ITickEvent::process()</code>.</p>
 </div>
@@ -24878,7 +24925,7 @@ PacketQueue::processSendEvent</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-26"><a class="anchor" href="#timingsimplecpu-analysis-26"></a><a class="link" href="#timingsimplecpu-analysis-26">19.20.4.2.26. TimingSimpleCPU analysis #26</a></h6>
+<h6 id="timingsimplecpu-analysis-26"><a class="anchor" href="#timingsimplecpu-analysis-26"></a><a class="link" href="#timingsimplecpu-analysis-26">19.21.4.2.26. TimingSimpleCPU analysis #26</a></h6>
 <div class="paragraph">
 <p>Schedules <code>DRAMCtrl::processNextReqEvent</code> through:</p>
 </div>
@@ -24907,7 +24954,7 @@ TimingSimpleCPU::IcachePort::ITickEvent::process</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-27"><a class="anchor" href="#timingsimplecpu-analysis-27"></a><a class="link" href="#timingsimplecpu-analysis-27">19.20.4.2.27. TimingSimpleCPU analysis #27</a></h6>
+<h6 id="timingsimplecpu-analysis-27"><a class="anchor" href="#timingsimplecpu-analysis-27"></a><a class="link" href="#timingsimplecpu-analysis-27">19.21.4.2.27. TimingSimpleCPU analysis #27</a></h6>
 <div class="paragraph">
 <p>Schedules <code>BaseXBar::Layer&lt;SrcType, DstType&gt;::releaseLayer</code> through:</p>
 </div>
@@ -24933,19 +24980,19 @@ TimingSimpleCPU::IcachePort::ITickEvent::process</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-28"><a class="anchor" href="#timingsimplecpu-analysis-28"></a><a class="link" href="#timingsimplecpu-analysis-28">19.20.4.2.28. TimingSimpleCPU analysis #28</a></h6>
+<h6 id="timingsimplecpu-analysis-28"><a class="anchor" href="#timingsimplecpu-analysis-28"></a><a class="link" href="#timingsimplecpu-analysis-28">19.21.4.2.28. TimingSimpleCPU analysis #28</a></h6>
 <div class="paragraph">
 <p>Execute <code>DRAMCtrl::processNextReqEvent</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-29"><a class="anchor" href="#timingsimplecpu-analysis-29"></a><a class="link" href="#timingsimplecpu-analysis-29">19.20.4.2.29. TimingSimpleCPU analysis #29</a></h6>
+<h6 id="timingsimplecpu-analysis-29"><a class="anchor" href="#timingsimplecpu-analysis-29"></a><a class="link" href="#timingsimplecpu-analysis-29">19.21.4.2.29. TimingSimpleCPU analysis #29</a></h6>
 <div class="paragraph">
 <p>Schedule <code>DRAMCtrl::processRespondEvent()</code>.</p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="timingsimplecpu-analysis-ldr-stall"><a class="anchor" href="#timingsimplecpu-analysis-ldr-stall"></a><a class="link" href="#timingsimplecpu-analysis-ldr-stall">19.20.4.2.30. TimingSimpleCPU analysis: LDR stall</a></h6>
+<h6 id="timingsimplecpu-analysis-ldr-stall"><a class="anchor" href="#timingsimplecpu-analysis-ldr-stall"></a><a class="link" href="#timingsimplecpu-analysis-ldr-stall">19.21.4.2.30. TimingSimpleCPU analysis: LDR stall</a></h6>
 <div class="paragraph">
 <p>One important thing we want to check now, is how the memory reads are going to make the processor stall in the middle of an instruction.</p>
 </div>
@@ -25063,7 +25110,7 @@ TimingSimpleCPU::IcachePort::ITickEvent::process</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches"><a class="anchor" href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches"></a><a class="link" href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches">19.20.4.3. gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis with caches</a></h5>
+<h5 id="gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches"><a class="anchor" href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches"></a><a class="link" href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches">19.21.4.3. gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis with caches</a></h5>
 <div class="paragraph">
 <p>Let&#8217;s just add <code>--caches</code> to <a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis">gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis</a> to see if things go any faster, and add <code>Cache</code> to <code>--trace</code> as in:</p>
 </div>
@@ -25358,7 +25405,7 @@ type=SetAssociative</pre>
 </ul>
 </div>
 <div class="sect5">
-<h6 id="what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5"><a class="anchor" href="#what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5"></a><a class="link" href="#what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5">19.20.4.3.1. What is the coherency protocol implemented by the classic cache system in gem5?</a></h6>
+<h6 id="what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5"><a class="anchor" href="#what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5"></a><a class="link" href="#what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5">19.21.4.3.1. What is the coherency protocol implemented by the classic cache system in gem5?</a></h6>
 <div class="paragraph">
 <p><a href="#moesi">MOESI cache coherence protocol</a>: <a href="https://github.com/gem5/gem5/blob/9fc9c67b4242c03f165951775be5cd0812f2a705/src/mem/cache/cache_blk.hh#L352" class="bare">https://github.com/gem5/gem5/blob/9fc9c67b4242c03f165951775be5cd0812f2a705/src/mem/cache/cache_blk.hh#L352</a></p>
 </div>
@@ -25366,12 +25413,12 @@ type=SetAssociative</pre>
 <p>The actual representation is done via separate state bits: <a href="https://github.com/gem5/gem5/blob/9fc9c67b4242c03f165951775be5cd0812f2a705/src/mem/cache/cache_blk.hh#L66" class="bare">https://github.com/gem5/gem5/blob/9fc9c67b4242c03f165951775be5cd0812f2a705/src/mem/cache/cache_blk.hh#L66</a> and MOESI appears explicitly only on the pretty printing.</p>
 </div>
 <div class="paragraph">
-<p>This pretty printing appears for example in the <code>--trace Cache</code> lines as shown at <a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches">gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis with caches</a> and with a few more transitions visible at <a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">Section 19.20.4.4, &#8220;gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs&#8221;</a>.</p>
+<p>This pretty printing appears for example in the <code>--trace Cache</code> lines as shown at <a href="#gem5-event-queue-timingsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches">gem5 event queue TimingSimpleCPU syscall emulation freestanding example analysis with caches</a> and with a few more transitions visible at <a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">Section 19.21.4.4, &#8220;gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs&#8221;</a>.</p>
 </div>
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus"><a class="anchor" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus"></a><a class="link" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">19.20.4.4. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs</a></h5>
+<h5 id="gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus"><a class="anchor" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus"></a><a class="link" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">19.21.4.4. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs</a></h5>
 <div class="paragraph">
 <p>It would be amazing to analyze a simple example with interconnect packets possibly invalidating caches of other CPUs.</p>
 </div>
@@ -25581,7 +25628,7 @@ type=SetAssociative</pre>
 <p>and so on, they just keep fighting over that address and changing one another&#8217;s state.</p>
 </div>
 <div class="sect5">
-<h6 id="gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus-and-ruby"><a class="anchor" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus-and-ruby"></a><a class="link" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus-and-ruby">19.20.4.4.1. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs and Ruby</a></h6>
+<h6 id="gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus-and-ruby"><a class="anchor" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus-and-ruby"></a><a class="link" href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus-and-ruby">19.21.4.4.1. gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs and Ruby</a></h6>
 <div class="paragraph">
 <p>Now let&#8217;s do the exact same we did for <a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs</a>, but with <a href="#gem5-ruby-build">Ruby</a> rather than the classic system.</p>
 </div>
@@ -25624,7 +25671,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis"><a class="anchor" href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis"></a><a class="link" href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis">19.20.4.5. gem5 event queue MinorCPU syscall emulation freestanding example analysis</a></h5>
+<h5 id="gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis"><a class="anchor" href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis"></a><a class="link" href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis">19.21.4.5. gem5 event queue MinorCPU syscall emulation freestanding example analysis</a></h5>
 <div class="paragraph">
 <p>The events <a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis">for the Atomic CPU</a> were pretty simple: basically just ticks.</p>
 </div>
@@ -25757,11 +25804,11 @@ non-atomic 19</pre>
 <div class="paragraph">
 <p>so now we are ready to run the third and fourth instructions of the program:</p>
 </div>
-<div class="paragraph">
-<p>,&#8230;&#8203;
-    ldr x2, =len
-    mov x8, 64
-,&#8230;&#8203;</p>
+<div class="literalblock">
+<div class="content">
+<pre>    ldr x2, =len
+    mov x8, 64</pre>
+</div>
 </div>
 <div class="paragraph">
 <p>The <a href="#arm-ldr-instruction">LDR</a> goes all the way down to FU 6 which is the memory one:</p>
@@ -25794,14 +25841,14 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis-hazard"><a class="anchor" href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis-hazard"></a><a class="link" href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis-hazard">19.20.4.5.1. gem5 event queue MinorCPU syscall emulation freestanding example analysis: hazard</a></h6>
+<h6 id="gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis-hazard"><a class="anchor" href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis-hazard"></a><a class="link" href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis-hazard">19.21.4.5.1. gem5 event queue MinorCPU syscall emulation freestanding example analysis: hazard</a></h6>
 <div class="paragraph">
 <p>TODO like <a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard">gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard</a> but with the hazard.</p>
 </div>
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis">19.20.4.6. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis</a></h5>
+<h5 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis">19.21.4.6. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis</a></h5>
 <div class="paragraph">
 <p>Like <a href="#gem5-event-queue-minorcpu-syscall-emulation-freestanding-example-analysis">gem5 event queue MinorCPU syscall emulation freestanding example analysis</a> but even more complex since for the <a href="#gem5-derivo3cpu">gem5 <code>DerivO3CPU</code></a>!</p>
 </div>
@@ -25829,7 +25876,7 @@ non-atomic 19</pre>
 <p>This section and children are tested at LKMC 144a552cf926ea630ef9eadbb22b79fe2468c456.</p>
 </div>
 <div class="sect5">
-<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazardless"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazardless"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazardless">19.20.4.6.1. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazardless</a></h6>
+<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazardless"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazardless"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazardless">19.21.4.6.1. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazardless</a></h6>
 <div class="paragraph">
 <p>Let&#8217;s  have a look at the arguably simplest example <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/freestanding/linux/hazardless.S">userland/arch/aarch64/freestanding/linux/hazardless.S</a>.</p>
 </div>
@@ -26068,7 +26115,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard">19.20.4.6.2. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard</a></h6>
+<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard">19.21.4.6.2. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard</a></h6>
 <div class="paragraph">
 <p>Now let&#8217;s do the same as in <a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazardless">gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazardless</a> but with a hazard: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/freestanding/linux/hazard.S">userland/arch/aarch64/freestanding/linux/hazard.S</a>.</p>
 </div>
@@ -26112,7 +26159,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard4"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard4"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard4">19.20.4.6.3. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard4</a></h6>
+<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard4"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard4"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard4">19.21.4.6.3. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard4</a></h6>
 <div class="paragraph">
 <p>Like <a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard">gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard</a> but a hazard of depth 4: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/freestanding/linux/hazard.S">userland/arch/aarch64/freestanding/linux/hazard.S</a>.</p>
 </div>
@@ -26153,7 +26200,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall">19.20.4.6.4. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall</a></h6>
+<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall">19.21.4.6.4. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall</a></h6>
 <div class="paragraph">
 <p>Like <a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-hazard">gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: hazard</a> but now with an LDR stall: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/freestanding/linux/stall.S">userland/arch/aarch64/freestanding/linux/stall.S</a>.</p>
 </div>
@@ -26204,7 +26251,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-gain"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-gain"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-gain">19.20.4.6.5. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall-gain</a></h6>
+<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-gain"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-gain"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-gain">19.21.4.6.5. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall-gain</a></h6>
 <div class="paragraph">
 <p>Like <a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall">gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall</a> but now with an LDR stall: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/freestanding/linux/stall-gain.S">userland/arch/aarch64/freestanding/linux/stall-gain.S</a>.</p>
 </div>
@@ -26291,7 +26338,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-hazard4"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-hazard4"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-hazard4">19.20.4.6.6. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall-hazard4</a></h6>
+<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-hazard4"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-hazard4"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-hazard4">19.21.4.6.6. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall-hazard4</a></h6>
 <div class="paragraph">
 <p>Like <a href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-stall-gain">gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: stall-gain</a> but now with some dependencies after the LDR: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/freestanding/linux/stall-hazard4.S">userland/arch/aarch64/freestanding/linux/stall-hazard4.S</a>.</p>
 </div>
@@ -26358,7 +26405,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-speculative"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-speculative"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-speculative">19.20.4.6.7. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: speculative</a></h6>
+<h6 id="gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-speculative"><a class="anchor" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-speculative"></a><a class="link" href="#gem5-event-queue-derivo3cpu-syscall-emulation-freestanding-example-analysis-speculative">19.21.4.6.7. gem5 event queue DerivO3CPU syscall emulation freestanding example analysis: speculative</a></h6>
 <div class="paragraph">
 <p>Now let&#8217;s try to see some <a href="#speculative-execution">Speculative execution</a> in action with <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/freestanding/linux/speculative.S">userland/arch/aarch64/freestanding/linux/speculative.S</a>.</p>
 </div>
@@ -26547,7 +26594,7 @@ wbActual:0
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-instruction-definitions"><a class="anchor" href="#gem5-instruction-definitions"></a><a class="link" href="#gem5-instruction-definitions">19.20.5. gem5 instruction definitions</a></h4>
+<h4 id="gem5-instruction-definitions"><a class="anchor" href="#gem5-instruction-definitions"></a><a class="link" href="#gem5-instruction-definitions">19.21.5. gem5 instruction definitions</a></h4>
 <div class="paragraph">
 <p>This is one of the parts of gem5 that rely on semi-useless <a href="#gem5-code-generation">code generation</a> inside the <code>.isa</code> sublanguage.</p>
 </div>
@@ -26590,7 +26637,7 @@ wbActual:0
 </div>
 </div>
 <div class="paragraph">
-<p>We also notice that the key argument passed to those instructions is of type <code>ExecContext</code>, which is discussed further at: <a href="#gem5-execcontext">Section 19.20.7.3, &#8220;gem5 <code>ExecContext</code>&#8221;</a>.</p>
+<p>We also notice that the key argument passed to those instructions is of type <code>ExecContext</code>, which is discussed further at: <a href="#gem5-execcontext">Section 19.21.7.3, &#8220;gem5 <code>ExecContext</code>&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>The file is an include so that compilation can be split up into chunks by the autogenerated includers</p>
@@ -26795,7 +26842,7 @@ namespace ArmISAInst {
 <p>Tested in gem5 b1623cb2087873f64197e503ab8894b5e4d4c7b4.</p>
 </div>
 <div class="sect4">
-<h5 id="gem5-execute-vs-initiateacc-vs-completeacc"><a class="anchor" href="#gem5-execute-vs-initiateacc-vs-completeacc"></a><a class="link" href="#gem5-execute-vs-initiateacc-vs-completeacc">19.20.5.1. gem5 <code>execute</code> vs <code>initiateAcc</code> vs <code>completeAcc</code></a></h5>
+<h5 id="gem5-execute-vs-initiateacc-vs-completeacc"><a class="anchor" href="#gem5-execute-vs-initiateacc-vs-completeacc"></a><a class="link" href="#gem5-execute-vs-initiateacc-vs-completeacc">19.21.5.1. gem5 <code>execute</code> vs <code>initiateAcc</code> vs <code>completeAcc</code></a></h5>
 <div class="paragraph">
 <p>These are the key methods defined in instruction definitions, so lets see when each one gets called and what they do more or less.</p>
 </div>
@@ -26849,7 +26896,7 @@ namespace ArmISAInst {
 <p>This can be seen concretely in GDB from the analysis done at: <a href="#timingsimplecpu-analysis-ldr-stall">TimingSimpleCPU analysis: LDR stall</a> and for more memory details see <a href="#gem5-functional-vs-atomic-vs-timing-memory-requests">gem5 functional vs atomic vs timing memory requests</a>.</p>
 </div>
 <div class="sect5">
-<h6 id="gem5-completeacc"><a class="anchor" href="#gem5-completeacc"></a><a class="link" href="#gem5-completeacc">19.20.5.1.1. gem5 <code>completeAcc</code></a></h6>
+<h6 id="gem5-completeacc"><a class="anchor" href="#gem5-completeacc"></a><a class="link" href="#gem5-completeacc">19.21.5.1.1. gem5 <code>completeAcc</code></a></h6>
 <div class="paragraph">
 <p><code>completeAcc</code> is boring on most simple store memory instructions, e.g. a simple STR:</p>
 </div>
@@ -26902,7 +26949,7 @@ namespace ArmISAInst {
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-microops"><a class="anchor" href="#gem5-microops"></a><a class="link" href="#gem5-microops">19.20.5.2. gem5 microops</a></h5>
+<h5 id="gem5-microops"><a class="anchor" href="#gem5-microops"></a><a class="link" href="#gem5-microops">19.21.5.2. gem5 microops</a></h5>
 <div class="paragraph">
 <p>Some gem5 instructions break down into multiple microops.</p>
 </div>
@@ -26963,7 +27010,7 @@ namespace ArmISAInst {
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-port-system"><a class="anchor" href="#gem5-port-system"></a><a class="link" href="#gem5-port-system">19.20.6. gem5 port system</a></h4>
+<h4 id="gem5-port-system"><a class="anchor" href="#gem5-port-system"></a><a class="link" href="#gem5-port-system">19.21.6. gem5 port system</a></h4>
 <div class="paragraph">
 <p>The gem5 memory system is connected in a very flexible way through the port system.</p>
 </div>
@@ -26971,7 +27018,7 @@ namespace ArmISAInst {
 <p>This system exists to allow seamlessly connecting any combination of CPU, caches, interconnects, DRAM and peripherals.</p>
 </div>
 <div class="sect4">
-<h5 id="gem5-functional-vs-atomic-vs-timing-memory-requests"><a class="anchor" href="#gem5-functional-vs-atomic-vs-timing-memory-requests"></a><a class="link" href="#gem5-functional-vs-atomic-vs-timing-memory-requests">19.20.6.1. gem5 functional vs atomic vs timing memory requests</a></h5>
+<h5 id="gem5-functional-vs-atomic-vs-timing-memory-requests"><a class="anchor" href="#gem5-functional-vs-atomic-vs-timing-memory-requests"></a><a class="link" href="#gem5-functional-vs-atomic-vs-timing-memory-requests">19.21.6.1. gem5 functional vs atomic vs timing memory requests</a></h5>
 <div class="paragraph">
 <p>gem5 memory requests can be classified in the following broad categories:</p>
 </div>
@@ -27181,7 +27228,7 @@ TimingSimpleCPU::finishTranslation(WholeTranslationState *state)
 <p>Tested in gem5 b1623cb2087873f64197e503ab8894b5e4d4c7b4.</p>
 </div>
 <div class="sect5">
-<h6 id="gem5-functional-requests"><a class="anchor" href="#gem5-functional-requests"></a><a class="link" href="#gem5-functional-requests">19.20.6.1.1. gem5 functional requests</a></h6>
+<h6 id="gem5-functional-requests"><a class="anchor" href="#gem5-functional-requests"></a><a class="link" href="#gem5-functional-requests">19.21.6.1.1. gem5 functional requests</a></h6>
 <div class="paragraph">
 <p>As seen at <a href="#gem5-functional-vs-atomic-vs-timing-memory-requests">gem5 functional vs atomic vs timing memory requests</a>, functional requests are not used in common simulation, since the core must always go through caches.</p>
 </div>
@@ -27228,7 +27275,7 @@ TimingSimpleCPU::finishTranslation(WholeTranslationState *state)
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-threadcontext-vs-threadstate-vs-execcontext-vs-process"><a class="anchor" href="#gem5-threadcontext-vs-threadstate-vs-execcontext-vs-process"></a><a class="link" href="#gem5-threadcontext-vs-threadstate-vs-execcontext-vs-process">19.20.7. gem5 <code>ThreadContext</code> vs <code>ThreadState</code> vs <code>ExecContext</code> vs <code>Process</code></a></h4>
+<h4 id="gem5-threadcontext-vs-threadstate-vs-execcontext-vs-process"><a class="anchor" href="#gem5-threadcontext-vs-threadstate-vs-execcontext-vs-process"></a><a class="link" href="#gem5-threadcontext-vs-threadstate-vs-execcontext-vs-process">19.21.7. gem5 <code>ThreadContext</code> vs <code>ThreadState</code> vs <code>ExecContext</code> vs <code>Process</code></a></h4>
 <div class="paragraph">
 <p>These classes get used everywhere, and they have a somewhat convoluted relation with one another, so let&#8217;s figure it out this mess.</p>
 </div>
@@ -27239,7 +27286,7 @@ TimingSimpleCPU::finishTranslation(WholeTranslationState *state)
 <p>This section and all children tested at gem5 b1623cb2087873f64197e503ab8894b5e4d4c7b4.</p>
 </div>
 <div class="sect4">
-<h5 id="gem5-threadcontext"><a class="anchor" href="#gem5-threadcontext"></a><a class="link" href="#gem5-threadcontext">19.20.7.1. gem5 <code>ThreadContext</code></a></h5>
+<h5 id="gem5-threadcontext"><a class="anchor" href="#gem5-threadcontext"></a><a class="link" href="#gem5-threadcontext">19.21.7.1. gem5 <code>ThreadContext</code></a></h5>
 <div class="paragraph">
 <p>As we delve into more details below, we will reach the following conclusion: a <code>ThreadContext</code> represents on thread of a CPU with multiple <a href="#hardware-threads">Hardware threads</a>.</p>
 </div>
@@ -27289,7 +27336,7 @@ typedef SimpleThread MinorThread;</pre>
 <p>Essentially all methods of the base <code>ThreadContext</code> are pure virtual.</p>
 </div>
 <div class="sect5">
-<h6 id="gem5-simplethread"><a class="anchor" href="#gem5-simplethread"></a><a class="link" href="#gem5-simplethread">19.20.7.1.1. gem5 <code>SimpleThread</code></a></h6>
+<h6 id="gem5-simplethread"><a class="anchor" href="#gem5-simplethread"></a><a class="link" href="#gem5-simplethread">19.21.7.1.1. gem5 <code>SimpleThread</code></a></h6>
 <div class="paragraph">
 <p><code>SimpleThread</code> storage defined on <a href="#gem5-basesimplecpu"><code>BaseSimpleCPU</code></a> for simple CPUs like <code>AtomicSimpleCPU</code>:</p>
 </div>
@@ -27384,7 +27431,7 @@ typedef SimpleThread MinorThread;</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-o3threadcontext"><a class="anchor" href="#gem5-o3threadcontext"></a><a class="link" href="#gem5-o3threadcontext">19.20.7.1.2. gem5 <code>O3ThreadContext</code></a></h6>
+<h6 id="gem5-o3threadcontext"><a class="anchor" href="#gem5-o3threadcontext"></a><a class="link" href="#gem5-o3threadcontext">19.21.7.1.2. gem5 <code>O3ThreadContext</code></a></h6>
 <div class="paragraph">
 <p>Instantiation happens in the <code>FullO3CPU</code> constructor:</p>
 </div>
@@ -27485,7 +27532,7 @@ FullO3CPU&lt;Impl&gt;::readArchIntReg(int reg_idx, ThreadID tid)
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-threadstate"><a class="anchor" href="#gem5-threadstate"></a><a class="link" href="#gem5-threadstate">19.20.7.2. gem5 <code>ThreadState</code></a></h5>
+<h5 id="gem5-threadstate"><a class="anchor" href="#gem5-threadstate"></a><a class="link" href="#gem5-threadstate">19.21.7.2. gem5 <code>ThreadState</code></a></h5>
 <div class="paragraph">
 <p>Owned one per <code>ThreadContext</code>.</p>
 </div>
@@ -27531,7 +27578,7 @@ class O3ThreadContext : public ThreadContext
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-execcontext"><a class="anchor" href="#gem5-execcontext"></a><a class="link" href="#gem5-execcontext">19.20.7.3. gem5 <code>ExecContext</code></a></h5>
+<h5 id="gem5-execcontext"><a class="anchor" href="#gem5-execcontext"></a><a class="link" href="#gem5-execcontext">19.21.7.3. gem5 <code>ExecContext</code></a></h5>
 <div class="paragraph">
 <p><code>ExecContext</code> gets used in <a href="#gem5-instruction-definitions">gem5 instruction definitions</a>, e.g.:</p>
 </div>
@@ -27691,7 +27738,7 @@ class O3ThreadContext : public ThreadContext
 <p>This makes sense, since each <code>ThreadContext</code> represents one CPU register set, and therefore needs a separate <code>ExecContext</code> which allows instruction implementations to access those registers.</p>
 </div>
 <div class="sect5">
-<h6 id="gem5-execcontext-readintregoperand-register-resolution"><a class="anchor" href="#gem5-execcontext-readintregoperand-register-resolution"></a><a class="link" href="#gem5-execcontext-readintregoperand-register-resolution">19.20.7.3.1. gem5 <code>ExecContext::readIntRegOperand</code> register resolution</a></h6>
+<h6 id="gem5-execcontext-readintregoperand-register-resolution"><a class="anchor" href="#gem5-execcontext-readintregoperand-register-resolution"></a><a class="link" href="#gem5-execcontext-readintregoperand-register-resolution">19.21.7.3.1. gem5 <code>ExecContext::readIntRegOperand</code> register resolution</a></h6>
 <div class="paragraph">
 <p>Let&#8217;s have a look at how <code>ExecContext::readIntRegOperand</code> actually matches registers to decoded registers IDs, since it is not obvious.</p>
 </div>
@@ -27730,7 +27777,7 @@ class O3ThreadContext : public ThreadContext
 <p>First, we guess that they  must be related to the reading of <code>x1</code> and <code>x2</code>, which are the inputs of the addition.</p>
 </div>
 <div class="paragraph">
-<p>Next, we also guess that the <code>0</code> read must correspond to <code>x2</code>, since it later gets potentially shifted as mentioned at <a href="#arm-shift-suffixes">Section 24.4.4.1, &#8220;ARM shift suffixes&#8221;</a>.</p>
+<p>Next, we also guess that the <code>0</code> read must correspond to <code>x2</code>, since it later gets potentially shifted as mentioned at <a href="#arm-shift-suffixes">Section 25.4.4.1, &#8220;ARM shift suffixes&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Let&#8217;s also have a look at the decoder code that builds the instruction instance in <code>build/ARM/arch/arm/generated/decoder-ns.cc.inc</code>:</p>
@@ -27964,7 +28011,7 @@ flattenIntIndex(int reg) const
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-process"><a class="anchor" href="#gem5-process"></a><a class="link" href="#gem5-process">19.20.7.4. gem5 <code>Process</code></a></h5>
+<h5 id="gem5-process"><a class="anchor" href="#gem5-process"></a><a class="link" href="#gem5-process">19.21.7.4. gem5 <code>Process</code></a></h5>
 <div class="paragraph">
 <p>The <code>Process</code> class is used only for <a href="#gem5-syscall-emulation-mode">gem5 syscall emulation mode</a>, and it represents a process like a Linux userland process, in addition to any further gem5 specific data needed to represent the process.</p>
 </div>
@@ -28052,12 +28099,12 @@ readFunc(SyscallDesc *desc, ThreadContext *tc,
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-functional-units"><a class="anchor" href="#gem5-functional-units"></a><a class="link" href="#gem5-functional-units">19.20.8. gem5 functional units</a></h4>
+<h4 id="gem5-functional-units"><a class="anchor" href="#gem5-functional-units"></a><a class="link" href="#gem5-functional-units">19.21.8. gem5 functional units</a></h4>
 <div class="paragraph">
 <p>Each instruction is marked with a class, and each class can execute in a given <a href="#execution-unit">functional unit</a>.</p>
 </div>
 <div class="sect4">
-<h5 id="gem5-minorcpu-default-functional-units"><a class="anchor" href="#gem5-minorcpu-default-functional-units"></a><a class="link" href="#gem5-minorcpu-default-functional-units">19.20.8.1. gem5 <code>MinorCPU</code> default functional units</a></h5>
+<h5 id="gem5-minorcpu-default-functional-units"><a class="anchor" href="#gem5-minorcpu-default-functional-units"></a><a class="link" href="#gem5-minorcpu-default-functional-units">19.21.8.1. gem5 <code>MinorCPU</code> default functional units</a></h5>
 <div class="paragraph">
 <p>Which units are available is visible for example on the <a href="#gem5-config-ini">gem5 config.ini</a> of a <a href="#gem5-minorcpu">gem5 MinorCPU</a> run. Functional units are not present in simple CPUs like <a href="#gem5-timingsimplecpu">gem5 <code>TimingSimpleCPU</code></a>.</p>
 </div>
@@ -28216,7 +28263,7 @@ opClass=IntAlu</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-derivo3cpu-default-functional-units"><a class="anchor" href="#gem5-derivo3cpu-default-functional-units"></a><a class="link" href="#gem5-derivo3cpu-default-functional-units">19.20.8.2. gem5 DerivO3CPU default functional units</a></h5>
+<h5 id="gem5-derivo3cpu-default-functional-units"><a class="anchor" href="#gem5-derivo3cpu-default-functional-units"></a><a class="link" href="#gem5-derivo3cpu-default-functional-units">19.21.8.2. gem5 DerivO3CPU default functional units</a></h5>
 <div class="paragraph">
 <p>On gem5 3ca404da175a66e0b958165ad75eb5f54cb5e772, after running:</p>
 </div>
@@ -28314,7 +28361,7 @@ pipelined=false</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-code-generation"><a class="anchor" href="#gem5-code-generation"></a><a class="link" href="#gem5-code-generation">19.20.9. gem5 code generation</a></h4>
+<h4 id="gem5-code-generation"><a class="anchor" href="#gem5-code-generation"></a><a class="link" href="#gem5-code-generation">19.21.9. gem5 code generation</a></h4>
 <div class="paragraph">
 <p>gem5 uses a ton of code generation, which makes the project horrendous:</p>
 </div>
@@ -28359,7 +28406,7 @@ pipelined=false</pre>
 <p>But it has been widely overused to insanity. It likely also exists partly because when the project started in 2003 C++ compilers weren&#8217;t that good, so you couldn&#8217;t rely on features like templates that much.</p>
 </div>
 <div class="sect4">
-<h5 id="gem5-the-isa"><a class="anchor" href="#gem5-the-isa"></a><a class="link" href="#gem5-the-isa">19.20.9.1. gem5 THE_ISA</a></h5>
+<h5 id="gem5-the-isa"><a class="anchor" href="#gem5-the-isa"></a><a class="link" href="#gem5-the-isa">19.21.9.1. gem5 THE_ISA</a></h5>
 <div class="paragraph">
 <p>Generated code at: <code>build/&lt;ISA&gt;/config/the_isa.hh</code> which e.g. for ARM contains:</p>
 </div>
@@ -28405,9 +28452,9 @@ enum class Arch {
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-build-system"><a class="anchor" href="#gem5-build-system"></a><a class="link" href="#gem5-build-system">19.20.10. gem5 build system</a></h4>
+<h4 id="gem5-build-system"><a class="anchor" href="#gem5-build-system"></a><a class="link" href="#gem5-build-system">19.21.10. gem5 build system</a></h4>
 <div class="sect4">
-<h5 id="m5-override-py-source"><a class="anchor" href="#m5-override-py-source"></a><a class="link" href="#m5-override-py-source">19.20.10.1. M5_OVERRIDE_PY_SOURCE</a></h5>
+<h5 id="m5-override-py-source"><a class="anchor" href="#m5-override-py-source"></a><a class="link" href="#m5-override-py-source">19.21.10.1. M5_OVERRIDE_PY_SOURCE</a></h5>
 <div class="paragraph">
 <p><a href="https://stackoverflow.com/questions/52312070/how-to-modify-a-file-under-src-python-and-run-it-without-rebuilding-in-gem5" class="bare">https://stackoverflow.com/questions/52312070/how-to-modify-a-file-under-src-python-and-run-it-without-rebuilding-in-gem5</a></p>
 </div>
@@ -28422,7 +28469,7 @@ enum class Arch {
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-build-broken-on-recent-compiler-version"><a class="anchor" href="#gem5-build-broken-on-recent-compiler-version"></a><a class="link" href="#gem5-build-broken-on-recent-compiler-version">19.20.10.2. gem5 build broken on recent compiler version</a></h5>
+<h5 id="gem5-build-broken-on-recent-compiler-version"><a class="anchor" href="#gem5-build-broken-on-recent-compiler-version"></a><a class="link" href="#gem5-build-broken-on-recent-compiler-version">19.21.10.2. gem5 build broken on recent compiler version</a></h5>
 <div class="paragraph">
 <p>gem5 moves a bit slowly, and if your host compiler is very new, the gem5 build might be broken for it, e.g. this was the case for Ubuntu 19.10 with GCC 9 and gem5 62d75e7105fe172eb906d4f80f360ff8591d4178 from Dec 2019.</p>
 </div>
@@ -28447,7 +28494,7 @@ enum class Arch {
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-polymorphic-isa-includes"><a class="anchor" href="#gem5-polymorphic-isa-includes"></a><a class="link" href="#gem5-polymorphic-isa-includes">19.20.10.3. gem5 polymorphic ISA includes</a></h5>
+<h5 id="gem5-polymorphic-isa-includes"><a class="anchor" href="#gem5-polymorphic-isa-includes"></a><a class="link" href="#gem5-polymorphic-isa-includes">19.21.10.3. gem5 polymorphic ISA includes</a></h5>
 <div class="paragraph">
 <p>E.g. <code>src/cpu/decode_cache.hh</code> includes:</p>
 </div>
@@ -28526,7 +28573,7 @@ build/ARM/config/the_isa.hh
 </div>
 </div>
 <div class="sect4">
-<h5 id="why-are-all-c-symlinked-into-the-gem5-build-dir"><a class="anchor" href="#why-are-all-c-symlinked-into-the-gem5-build-dir"></a><a class="link" href="#why-are-all-c-symlinked-into-the-gem5-build-dir">19.20.10.4. Why are all C++ symlinked into the gem5 build dir?</a></h5>
+<h5 id="why-are-all-c-symlinked-into-the-gem5-build-dir"><a class="anchor" href="#why-are-all-c-symlinked-into-the-gem5-build-dir"></a><a class="link" href="#why-are-all-c-symlinked-into-the-gem5-build-dir">19.21.10.4. Why are all C++ symlinked into the gem5 build dir?</a></h5>
 <div class="paragraph">
 <p>Upstream request: <a href="https://gem5.atlassian.net/browse/GEM5-469" class="bare">https://gem5.atlassian.net/browse/GEM5-469</a></p>
 </div>
@@ -28564,8 +28611,11 @@ build/ARM/config/the_isa.hh
 </div>
 </div>
 </div>
-<div class="sect2">
-<h3 id="gensim"><a class="anchor" href="#gensim"></a><a class="link" href="#gensim">19.21. Gensim</a></h3>
+</div>
+</div>
+<div class="sect1">
+<h2 id="gensim"><a class="anchor" href="#gensim"></a><a class="link" href="#gensim">20. Gensim</a></h2>
+<div class="sectionbody">
 <div class="paragraph">
 <p><a href="https://gensim.org" class="bare">https://gensim.org</a></p>
 </div>
@@ -28667,12 +28717,11 @@ gensim/models/armv8/isa.ac
 </div>
 </div>
 </div>
-</div>
 <div class="sect1">
-<h2 id="buildroot"><a class="anchor" href="#buildroot"></a><a class="link" href="#buildroot">20. Buildroot</a></h2>
+<h2 id="buildroot"><a class="anchor" href="#buildroot"></a><a class="link" href="#buildroot">21. Buildroot</a></h2>
 <div class="sectionbody">
 <div class="sect2">
-<h3 id="introduction-to-buildroot"><a class="anchor" href="#introduction-to-buildroot"></a><a class="link" href="#introduction-to-buildroot">20.1. Introduction to Buildroot</a></h3>
+<h3 id="introduction-to-buildroot"><a class="anchor" href="#introduction-to-buildroot"></a><a class="link" href="#introduction-to-buildroot">21.1. Introduction to Buildroot</a></h3>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Buildroot">Buildroot</a> is a set of Make scripts that download and compile from source compatible versions of:</p>
 </div>
@@ -28685,7 +28734,7 @@ gensim/models/armv8/isa.ac
 <p>Linux kernel</p>
 </li>
 <li>
-<p>C standard library: Buildroot supports several implementations, see: <a href="#libc-choice">Section 20.10, &#8220;libc choice&#8221;</a></p>
+<p>C standard library: Buildroot supports several implementations, see: <a href="#libc-choice">Section 21.10, &#8220;libc choice&#8221;</a></p>
 </li>
 <li>
 <p><a href="https://en.wikipedia.org/wiki/BusyBox">BusyBox</a>: provides the shell and basic command line utilities</p>
@@ -28696,7 +28745,7 @@ gensim/models/armv8/isa.ac
 <p>It therefore produces a pristine, blob-less, debuggable setup, where all moving parts are configured to work perfectly together.</p>
 </div>
 <div class="paragraph">
-<p>Perhaps the awesomeness of Buildroot only sinks in once you notice that all it takes is 4 commands as explained at <a href="#buildroot-hello-world">Section 20.11, &#8220;Buildroot hello world&#8221;</a>.</p>
+<p>Perhaps the awesomeness of Buildroot only sinks in once you notice that all it takes is 4 commands as explained at <a href="#buildroot-hello-world">Section 21.11, &#8220;Buildroot hello world&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>The downsides of Buildroot are:</p>
@@ -28741,7 +28790,7 @@ gensim/models/armv8/isa.ac
 </div>
 </div>
 <div class="sect2">
-<h3 id="custom-buildroot-configs"><a class="anchor" href="#custom-buildroot-configs"></a><a class="link" href="#custom-buildroot-configs">20.2. Custom Buildroot configs</a></h3>
+<h3 id="custom-buildroot-configs"><a class="anchor" href="#custom-buildroot-configs"></a><a class="link" href="#custom-buildroot-configs">21.2. Custom Buildroot configs</a></h3>
 <div class="paragraph">
 <p>We provide the following mechanisms:</p>
 </div>
@@ -28776,10 +28825,10 @@ gensim/models/armv8/isa.ac
 <p>The clean is necessary because the source files didn&#8217;t change, so <code>make</code> would just check the timestamps and not build anything.</p>
 </div>
 <div class="paragraph">
-<p>You will then likely want to make those more permanent as explained at: <a href="#default-command-line-arguments">Section 33.4, &#8220;Default command line arguments&#8221;</a>.</p>
+<p>You will then likely want to make those more permanent as explained at: <a href="#default-command-line-arguments">Section 34.4, &#8220;Default command line arguments&#8221;</a>.</p>
 </div>
 <div class="sect3">
-<h4 id="enable-buildroot-compiler-optimizations"><a class="anchor" href="#enable-buildroot-compiler-optimizations"></a><a class="link" href="#enable-buildroot-compiler-optimizations">20.2.1. Enable Buildroot compiler optimizations</a></h4>
+<h4 id="enable-buildroot-compiler-optimizations"><a class="anchor" href="#enable-buildroot-compiler-optimizations"></a><a class="link" href="#enable-buildroot-compiler-optimizations">21.2.1. Enable Buildroot compiler optimizations</a></h4>
 <div class="paragraph">
 <p>If you are benchmarking compiled programs instead of hand written assembly, remember that we configure Buildroot to disable optimizations by default with:</p>
 </div>
@@ -28811,7 +28860,7 @@ gensim/models/armv8/isa.ac
 <div class="ulist">
 <ul>
 <li>
-<p>if you already have a full <code>-O0</code> build, you can choose to rebuild just your package of interest to save some time as described at: <a href="#custom-buildroot-configs">Section 20.2, &#8220;Custom Buildroot configs&#8221;</a></p>
+<p>if you already have a full <code>-O0</code> build, you can choose to rebuild just your package of interest to save some time as described at: <a href="#custom-buildroot-configs">Section 21.2, &#8220;Custom Buildroot configs&#8221;</a></p>
 <div class="literalblock">
 <div class="content">
 <pre>./build-buildroot \
@@ -28847,7 +28896,7 @@ gensim/models/armv8/isa.ac
 </div>
 </div>
 <div class="sect2">
-<h3 id="find-buildroot-options-with-make-menuconfig"><a class="anchor" href="#find-buildroot-options-with-make-menuconfig"></a><a class="link" href="#find-buildroot-options-with-make-menuconfig">20.3. Find Buildroot options with make menuconfig</a></h3>
+<h3 id="find-buildroot-options-with-make-menuconfig"><a class="anchor" href="#find-buildroot-options-with-make-menuconfig"></a><a class="link" href="#find-buildroot-options-with-make-menuconfig">21.3. Find Buildroot options with make menuconfig</a></h3>
 <div class="paragraph">
 <p><code>make menuconfig</code> is a convenient way to find Buildroot configurations:</p>
 </div>
@@ -28873,7 +28922,7 @@ make menuconfig</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="change-user"><a class="anchor" href="#change-user"></a><a class="link" href="#change-user">20.4. Change user</a></h3>
+<h3 id="change-user"><a class="anchor" href="#change-user"></a><a class="link" href="#change-user">21.4. Change user</a></h3>
 <div class="paragraph">
 <p>At startup, we login automatically as the <code>root</code> user.</p>
 </div>
@@ -28910,7 +28959,7 @@ make menuconfig</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="login-as-a-non-root-user-without-password"><a class="anchor" href="#login-as-a-non-root-user-without-password"></a><a class="link" href="#login-as-a-non-root-user-without-password">20.4.1. Login as a non-root user without password</a></h4>
+<h4 id="login-as-a-non-root-user-without-password"><a class="anchor" href="#login-as-a-non-root-user-without-password"></a><a class="link" href="#login-as-a-non-root-user-without-password">21.4.1. Login as a non-root user without password</a></h4>
 <div class="paragraph">
 <p>Replace on <code>inittab</code>:</p>
 </div>
@@ -28933,7 +28982,7 @@ make menuconfig</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="add-new-files-to-the-buildroot-image"><a class="anchor" href="#add-new-files-to-the-buildroot-image"></a><a class="link" href="#add-new-files-to-the-buildroot-image">20.5. Add new files to the Buildroot image</a></h3>
+<h3 id="add-new-files-to-the-buildroot-image"><a class="anchor" href="#add-new-files-to-the-buildroot-image"></a><a class="link" href="#add-new-files-to-the-buildroot-image">21.5. Add new files to the Buildroot image</a></h3>
 <div class="paragraph">
 <p>There are basically two choices:</p>
 </div>
@@ -28986,7 +29035,7 @@ make menuconfig</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="add-new-buildroot-packages"><a class="anchor" href="#add-new-buildroot-packages"></a><a class="link" href="#add-new-buildroot-packages">20.5.1. Add new Buildroot packages</a></h4>
+<h4 id="add-new-buildroot-packages"><a class="anchor" href="#add-new-buildroot-packages"></a><a class="link" href="#add-new-buildroot-packages">21.5.1. Add new Buildroot packages</a></h4>
 <div class="paragraph">
 <p>First, see if you can&#8217;t get away without actually adding a new package, for example:</p>
 </div>
@@ -28996,7 +29045,7 @@ make menuconfig</pre>
 <p>if you have a standalone C file with no dependencies besides the C standard library to be compiled with GCC, just add a new file under <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/buildroot_packages/sample_package">buildroot_packages/sample_package</a> and you are done</p>
 </li>
 <li>
-<p>if you have a dependency on a library, first check if Buildroot doesn&#8217;t have a package for it already with <code>ls buildroot/package</code>. If yes, just enable that package as explained at: <a href="#custom-buildroot-configs">Section 20.2, &#8220;Custom Buildroot configs&#8221;</a></p>
+<p>if you have a dependency on a library, first check if Buildroot doesn&#8217;t have a package for it already with <code>ls buildroot/package</code>. If yes, just enable that package as explained at: <a href="#custom-buildroot-configs">Section 21.2, &#8220;Custom Buildroot configs&#8221;</a></p>
 </li>
 </ul>
 </div>
@@ -29004,7 +29053,7 @@ make menuconfig</pre>
 <p>If none of those methods are flexible enough for you, you can just fork or hack up <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/buildroot_packages/sample_package">buildroot_packages/sample_package</a> the sample package to do what you want.</p>
 </div>
 <div class="paragraph">
-<p>For how to use that package, see: <a href="#buildroot_packages-directory">Section 33.15.2, &#8220;buildroot_packages directory&#8221;</a>.</p>
+<p>For how to use that package, see: <a href="#buildroot_packages-directory">Section 34.15.2, &#8220;buildroot_packages directory&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Then iterate trying to do what you want and reading the manual until it works: <a href="https://buildroot.org/downloads/manual/manual.html" class="bare">https://buildroot.org/downloads/manual/manual.html</a></p>
@@ -29012,7 +29061,7 @@ make menuconfig</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="remove-buildroot-packages"><a class="anchor" href="#remove-buildroot-packages"></a><a class="link" href="#remove-buildroot-packages">20.6. Remove Buildroot packages</a></h3>
+<h3 id="remove-buildroot-packages"><a class="anchor" href="#remove-buildroot-packages"></a><a class="link" href="#remove-buildroot-packages">21.6. Remove Buildroot packages</a></h3>
 <div class="paragraph">
 <p>Once you&#8217;ve built a package in to the image, there is no easy way to remove it.</p>
 </div>
@@ -29023,11 +29072,11 @@ make menuconfig</pre>
 <p>Also mentioned at: <a href="https://stackoverflow.com/questions/47320800/how-to-clean-only-target-in-buildroot" class="bare">https://stackoverflow.com/questions/47320800/how-to-clean-only-target-in-buildroot</a></p>
 </div>
 <div class="paragraph">
-<p>See this for a sample manual workaround: <a href="#parsec-uninstall">Section 21.8.1.4, &#8220;PARSEC uninstall&#8221;</a>.</p>
+<p>See this for a sample manual workaround: <a href="#parsec-uninstall">Section 22.8.1.4, &#8220;PARSEC uninstall&#8221;</a>.</p>
 </div>
 </div>
 <div class="sect2">
-<h3 id="br2-target-rootfs-ext2-size"><a class="anchor" href="#br2-target-rootfs-ext2-size"></a><a class="link" href="#br2-target-rootfs-ext2-size">20.7. BR2_TARGET_ROOTFS_EXT2_SIZE</a></h3>
+<h3 id="br2-target-rootfs-ext2-size"><a class="anchor" href="#br2-target-rootfs-ext2-size"></a><a class="link" href="#br2-target-rootfs-ext2-size">21.7. BR2_TARGET_ROOTFS_EXT2_SIZE</a></h3>
 <div class="paragraph">
 <p>When adding new large package to the Buildroot root filesystem, it may fail with the message:</p>
 </div>
@@ -29079,7 +29128,7 @@ TODO benchmark: would gem5 suffer a considerable disk read performance hit due t
 <p>Bibliography: <a href="https://stackoverflow.com/questions/49211241/is-there-a-way-to-automatically-detect-the-minimum-required-br2-target-rootfs-ex" class="bare">https://stackoverflow.com/questions/49211241/is-there-a-way-to-automatically-detect-the-minimum-required-br2-target-rootfs-ex</a></p>
 </div>
 <div class="sect3">
-<h4 id="squashfs"><a class="anchor" href="#squashfs"></a><a class="link" href="#squashfs">20.7.1. SquashFS</a></h4>
+<h4 id="squashfs"><a class="anchor" href="#squashfs"></a><a class="link" href="#squashfs">21.7.1. SquashFS</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/SquashFS">SquashFS</a> creation with <code>mksquashfs</code> does not take fixed sizes, and I have successfully booted from it, but it is readonly, which is unacceptable.</p>
 </div>
@@ -29092,7 +29141,7 @@ TODO benchmark: would gem5 suffer a considerable disk read performance hit due t
 </div>
 </div>
 <div class="sect2">
-<h3 id="rpath"><a class="anchor" href="#rpath"></a><a class="link" href="#rpath">20.8. Buildroot rebuild is slow when the root filesystem is large</a></h3>
+<h3 id="rpath"><a class="anchor" href="#rpath"></a><a class="link" href="#rpath">21.8. Buildroot rebuild is slow when the root filesystem is large</a></h3>
 <div class="paragraph">
 <p>Buildroot is not designed for large root filesystem images, and the rebuild becomes very slow when we add a large package to it.</p>
 </div>
@@ -29130,7 +29179,7 @@ TODO benchmark: would gem5 suffer a considerable disk read performance hit due t
 </div>
 </div>
 <div class="sect2">
-<h3 id="report-upstream-bugs"><a class="anchor" href="#report-upstream-bugs"></a><a class="link" href="#report-upstream-bugs">20.9. Report upstream bugs</a></h3>
+<h3 id="report-upstream-bugs"><a class="anchor" href="#report-upstream-bugs"></a><a class="link" href="#report-upstream-bugs">21.9. Report upstream bugs</a></h3>
 <div class="paragraph">
 <p>When asking for help on upstream repositories outside of this repository, you will need to provide the commands that you are running in detail without referencing our scripts.</p>
 </div>
@@ -29190,7 +29239,7 @@ git -C "$(./getvar qemu_source_dir)" checkout -
 <p>Then, you will also want to do a <a href="#bisection">Bisection</a> to pinpoint the exact commit to blame, and CC that developer.</p>
 </div>
 <div class="paragraph">
-<p>Finally, give the images you used save upstream developers' time as shown at: <a href="#release-zip">Section 33.19.2, &#8220;release-zip&#8221;</a>.</p>
+<p>Finally, give the images you used save upstream developers' time as shown at: <a href="#release-zip">Section 34.19.2, &#8220;release-zip&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>For Buildroot problems, you should wither provide the config you have:</p>
@@ -29205,7 +29254,7 @@ git -C "$(./getvar qemu_source_dir)" checkout -
 </div>
 </div>
 <div class="sect2">
-<h3 id="libc-choice"><a class="anchor" href="#libc-choice"></a><a class="link" href="#libc-choice">20.10. libc choice</a></h3>
+<h3 id="libc-choice"><a class="anchor" href="#libc-choice"></a><a class="link" href="#libc-choice">21.10. libc choice</a></h3>
 <div class="paragraph">
 <p>Buildroot supports several libc implementations, including:</p>
 </div>
@@ -29253,7 +29302,7 @@ git -C "$(./getvar qemu_source_dir)" checkout -
 </div>
 </div>
 <div class="sect2">
-<h3 id="buildroot-hello-world"><a class="anchor" href="#buildroot-hello-world"></a><a class="link" href="#buildroot-hello-world">20.11. Buildroot hello world</a></h3>
+<h3 id="buildroot-hello-world"><a class="anchor" href="#buildroot-hello-world"></a><a class="link" href="#buildroot-hello-world">21.11. Buildroot hello world</a></h3>
 <div class="paragraph">
 <p>This repo doesn&#8217;t do much more other than setting a bunch of Buildroot configurations and building it.</p>
 </div>
@@ -29298,7 +29347,7 @@ git -C "$(./getvar qemu_source_dir)" checkout -
 </div>
 </div>
 <div class="sect2">
-<h3 id="update-the-buildroot-toolchain"><a class="anchor" href="#update-the-buildroot-toolchain"></a><a class="link" href="#update-the-buildroot-toolchain">20.12. Update the Buildroot toolchain</a></h3>
+<h3 id="update-the-buildroot-toolchain"><a class="anchor" href="#update-the-buildroot-toolchain"></a><a class="link" href="#update-the-buildroot-toolchain">21.12. Update the Buildroot toolchain</a></h3>
 <div class="paragraph">
 <p>Users of this repo will often want to update the compilation toolchain to the latest version to get fresh new features like new ISA instructions.</p>
 </div>
@@ -29312,7 +29361,7 @@ git -C "$(./getvar qemu_source_dir)" checkout -
 <p>In this section we cover the most common cases.</p>
 </div>
 <div class="sect3">
-<h4 id="update-gcc-gcc-supported-by-buildroot"><a class="anchor" href="#update-gcc-gcc-supported-by-buildroot"></a><a class="link" href="#update-gcc-gcc-supported-by-buildroot">20.12.1. Update GCC: GCC supported by Buildroot</a></h4>
+<h4 id="update-gcc-gcc-supported-by-buildroot"><a class="anchor" href="#update-gcc-gcc-supported-by-buildroot"></a><a class="link" href="#update-gcc-gcc-supported-by-buildroot">21.12.1. Update GCC: GCC supported by Buildroot</a></h4>
 <div class="paragraph">
 <p>This is of course the simplest case.</p>
 </div>
@@ -29430,9 +29479,9 @@ cd ../..
 </div>
 </div>
 <div class="sect3">
-<h4 id="update-gcc-gcc-not-supported-by-buildroot"><a class="anchor" href="#update-gcc-gcc-not-supported-by-buildroot"></a><a class="link" href="#update-gcc-gcc-not-supported-by-buildroot">20.12.2. Update GCC: GCC not supported by Buildroot</a></h4>
+<h4 id="update-gcc-gcc-not-supported-by-buildroot"><a class="anchor" href="#update-gcc-gcc-not-supported-by-buildroot"></a><a class="link" href="#update-gcc-gcc-not-supported-by-buildroot">21.12.2. Update GCC: GCC not supported by Buildroot</a></h4>
 <div class="paragraph">
-<p>Now it gets fun, but well, guess what, we will try to do the same as <a href="#update-gcc-gcc-supported-by-buildroot">Section 20.12.1, &#8220;Update GCC: GCC supported by Buildroot&#8221;</a> but:</p>
+<p>Now it gets fun, but well, guess what, we will try to do the same as <a href="#update-gcc-gcc-supported-by-buildroot">Section 21.12.1, &#8220;Update GCC: GCC supported by Buildroot&#8221;</a> but:</p>
 </div>
 <div class="ulist">
 <ul>
@@ -29490,7 +29539,7 @@ cd ../..
 </div>
 </div>
 <div class="sect2">
-<h3 id="buildroot-vanilla-kernel"><a class="anchor" href="#buildroot-vanilla-kernel"></a><a class="link" href="#buildroot-vanilla-kernel">20.13. Buildroot vanilla kernel</a></h3>
+<h3 id="buildroot-vanilla-kernel"><a class="anchor" href="#buildroot-vanilla-kernel"></a><a class="link" href="#buildroot-vanilla-kernel">21.13. Buildroot vanilla kernel</a></h3>
 <div class="paragraph">
 <p>By default, our build system uses <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/build-linux">build-linux</a>, and the Buildroot kernel build is disabled: <a href="https://stackoverflow.com/questions/52231793/can-buildroot-build-the-root-filesystem-without-building-the-linux-kernel" class="bare">https://stackoverflow.com/questions/52231793/can-buildroot-build-the-root-filesystem-without-building-the-linux-kernel</a></p>
 </div>
@@ -29522,7 +29571,7 @@ cd ../..
 </div>
 </div>
 <div class="sect1">
-<h2 id="userland-content"><a class="anchor" href="#userland-content"></a><a class="link" href="#userland-content">21. Userland content</a></h2>
+<h2 id="userland-content"><a class="anchor" href="#userland-content"></a><a class="link" href="#userland-content">22. Userland content</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
 <p>This section documents our test and educational userland content, such as <a href="#c">C</a>, <a href="#cpp">C++</a> and <a href="#posix">POSIX</a> examples, present mostly under <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/">userland/</a>.</p>
@@ -29531,7 +29580,7 @@ cd ../..
 <p>Getting started at: <a href="#userland-setup">Section 1.8, &#8220;Userland setup&#8221;</a></p>
 </div>
 <div class="paragraph">
-<p>Userland assembly content is located at: <a href="#userland-assembly">Section 22, &#8220;Userland assembly&#8221;</a>. It was split from this section basically because we were hitting the HTML <code>h6</code> limit, stupid web :-)</p>
+<p>Userland assembly content is located at: <a href="#userland-assembly">Section 23, &#8220;Userland assembly&#8221;</a>. It was split from this section basically because we were hitting the HTML <code>h6</code> limit, stupid web :-)</p>
 </div>
 <div class="paragraph">
 <p>This content makes up the bulk of the <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/">userland/</a> directory.</p>
@@ -29543,7 +29592,7 @@ cd ../..
 <p>This section was originally moved in here from: <a href="https://github.com/cirosantilli/cpp-cheat" class="bare">https://github.com/cirosantilli/cpp-cheat</a></p>
 </div>
 <div class="sect2">
-<h3 id="c"><a class="anchor" href="#c"></a><a class="link" href="#c">21.1. C</a></h3>
+<h3 id="c"><a class="anchor" href="#c"></a><a class="link" href="#c">22.1. C</a></h3>
 <div class="paragraph">
 <p>Programs under <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/c/">userland/c/</a> are examples of <a href="https://en.wikipedia.org/wiki/ANSI_C">ANSI C</a> programming:</p>
 </div>
@@ -29682,7 +29731,7 @@ cd ../..
 </ul>
 </div>
 <div class="sect3">
-<h4 id="malloc"><a class="anchor" href="#malloc"></a><a class="link" href="#malloc">21.1.1. malloc</a></h4>
+<h4 id="malloc"><a class="anchor" href="#malloc"></a><a class="link" href="#malloc">22.1.1. malloc</a></h4>
 <div class="paragraph">
 <p>Allocate memory! Vs using the stack: <a href="https://stackoverflow.com/questions/4584089/what-is-the-function-of-the-push-pop-instructions-used-on-registers-in-x86-ass/33583134#33583134" class="bare">https://stackoverflow.com/questions/4584089/what-is-the-function-of-the-push-pop-instructions-used-on-registers-in-x86-ass/33583134#33583134</a></p>
 </div>
@@ -29696,7 +29745,7 @@ cd ../..
 <p><code>malloc</code> leads to the infinite joys of <a href="#memory-leaks">Memory leaks</a>.</p>
 </div>
 <div class="sect4">
-<h5 id="malloc-implementation"><a class="anchor" href="#malloc-implementation"></a><a class="link" href="#malloc-implementation">21.1.1.1. malloc implementation</a></h5>
+<h5 id="malloc-implementation"><a class="anchor" href="#malloc-implementation"></a><a class="link" href="#malloc-implementation">22.1.1.1. malloc implementation</a></h5>
 <div class="paragraph">
 <p>TODO: the exact answer is going to be hard.</p>
 </div>
@@ -29741,7 +29790,7 @@ printf '%x\n' 4198400
 </div>
 </div>
 <div class="sect4">
-<h5 id="malloc-maximum-size"><a class="anchor" href="#malloc-maximum-size"></a><a class="link" href="#malloc-maximum-size">21.1.1.2. malloc maximum size</a></h5>
+<h5 id="malloc-maximum-size"><a class="anchor" href="#malloc-maximum-size"></a><a class="link" href="#malloc-maximum-size">22.1.1.2. malloc maximum size</a></h5>
 <div class="paragraph">
 <p>General overview at: <a href="https://stackoverflow.com/questions/2798330/maximum-memory-which-malloc-can-allocate" class="bare">https://stackoverflow.com/questions/2798330/maximum-memory-which-malloc-can-allocate</a></p>
 </div>
@@ -29807,7 +29856,7 @@ echo 1 &gt; /proc/sys/vm/overcommit_memory
 <p>If we start using the pages, the OOM killer would sooner or later step in and kill our process: <a href="#linux-out-of-memory-killer">Linux out-of-memory killer</a>.</p>
 </div>
 <div class="sect5">
-<h6 id="linux-out-of-memory-killer"><a class="anchor" href="#linux-out-of-memory-killer"></a><a class="link" href="#linux-out-of-memory-killer">21.1.1.2.1. Linux out-of-memory killer</a></h6>
+<h6 id="linux-out-of-memory-killer"><a class="anchor" href="#linux-out-of-memory-killer"></a><a class="link" href="#linux-out-of-memory-killer">22.1.1.2.1. Linux out-of-memory killer</a></h6>
 <div class="paragraph">
 <p>We can observe the OOM in LKMC 1e969e832f66cb5a72d12d57c53fb09e9721d589 which defaults to 256MiB of memory with:</p>
 </div>
@@ -29833,7 +29882,7 @@ echo 1 &gt; /proc/sys/vm/overcommit_memory
 </div>
 </div>
 <div class="sect3">
-<h4 id="c-multithreading"><a class="anchor" href="#c-multithreading"></a><a class="link" href="#c-multithreading">21.1.2. C multithreading</a></h4>
+<h4 id="c-multithreading"><a class="anchor" href="#c-multithreading"></a><a class="link" href="#c-multithreading">22.1.2. C multithreading</a></h4>
 <div class="paragraph">
 <p>Added in C11!</p>
 </div>
@@ -29851,7 +29900,7 @@ echo 1 &gt; /proc/sys/vm/overcommit_memory
 </ul>
 </div>
 <div class="sect4">
-<h5 id="atomic-c"><a class="anchor" href="#atomic-c"></a><a class="link" href="#atomic-c">21.1.2.1. atomic.c</a></h5>
+<h5 id="atomic-c"><a class="anchor" href="#atomic-c"></a><a class="link" href="#atomic-c">22.1.2.1. atomic.c</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/c/atomic.c">userland/c/atomic.c</a></p>
 </div>
@@ -29927,9 +29976,9 @@ echo 1 &gt; /proc/sys/vm/overcommit_memory
 </div>
 </div>
 <div class="sect3">
-<h4 id="gcc-c-extensions"><a class="anchor" href="#gcc-c-extensions"></a><a class="link" href="#gcc-c-extensions">21.1.3. GCC C extensions</a></h4>
+<h4 id="gcc-c-extensions"><a class="anchor" href="#gcc-c-extensions"></a><a class="link" href="#gcc-c-extensions">22.1.3. GCC C extensions</a></h4>
 <div class="sect4">
-<h5 id="c-empty-struct"><a class="anchor" href="#c-empty-struct"></a><a class="link" href="#c-empty-struct">21.1.3.1. C empty struct</a></h5>
+<h5 id="c-empty-struct"><a class="anchor" href="#c-empty-struct"></a><a class="link" href="#c-empty-struct">22.1.3.1. C empty struct</a></h5>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/gcc/empty_struct.c">userland/gcc/empty_struct.c</a></p>
 </div>
@@ -29941,7 +29990,7 @@ echo 1 &gt; /proc/sys/vm/overcommit_memory
 </div>
 </div>
 <div class="sect4">
-<h5 id="openmp"><a class="anchor" href="#openmp"></a><a class="link" href="#openmp">21.1.3.2. OpenMP</a></h5>
+<h5 id="openmp"><a class="anchor" href="#openmp"></a><a class="link" href="#openmp">22.1.3.2. OpenMP</a></h5>
 <div class="paragraph">
 <p>GCC implements the <a href="#openmp">OpenMP</a> threading implementation: <a href="https://stackoverflow.com/questions/3949901/pthreads-vs-openmp" class="bare">https://stackoverflow.com/questions/3949901/pthreads-vs-openmp</a></p>
 </div>
@@ -29964,7 +30013,7 @@ echo 1 &gt; /proc/sys/vm/overcommit_memory
 <p><code>strace</code> shows that OpenMP makes <code>clone()</code> syscalls in Linux. TODO: does it actually call <code>pthread_</code> functions, or does it make syscalls directly? Or in other words, can it work on <a href="#freestanding-programs">Freestanding programs</a>? A quick grep shows many references to pthreads.</p>
 </div>
 <div class="sect5">
-<h6 id="openmp-validation"><a class="anchor" href="#openmp-validation"></a><a class="link" href="#openmp-validation">21.1.3.2.1. OpenMP validation</a></h6>
+<h6 id="openmp-validation"><a class="anchor" href="#openmp-validation"></a><a class="link" href="#openmp-validation">22.1.3.2.1. OpenMP validation</a></h6>
 <div class="paragraph">
 <p><a href="https://github.com/uhhpctools/omp-validation" class="bare">https://github.com/uhhpctools/omp-validation</a></p>
 </div>
@@ -30062,7 +30111,7 @@ mkdir -p bin/c
 </div>
 </div>
 <div class="sect2">
-<h3 id="cpp"><a class="anchor" href="#cpp"></a><a class="link" href="#cpp">21.2. C++</a></h3>
+<h3 id="cpp"><a class="anchor" href="#cpp"></a><a class="link" href="#cpp">22.2. C++</a></h3>
 <div class="paragraph">
 <p>Programs under <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/cpp/">userland/cpp/</a> are examples of <a href="https://en.wikipedia.org/wiki/C%2B%2B#Standardization">ISO C</a> programming.</p>
 </div>
@@ -30186,7 +30235,7 @@ mkdir -p bin/c
 </ul>
 </div>
 <div class="sect3">
-<h4 id="cpp-initialization-types"><a class="anchor" href="#cpp-initialization-types"></a><a class="link" href="#cpp-initialization-types">21.2.1. C++ initialization types</a></h4>
+<h4 id="cpp-initialization-types"><a class="anchor" href="#cpp-initialization-types"></a><a class="link" href="#cpp-initialization-types">22.2.1. C++ initialization types</a></h4>
 <div class="paragraph">
 <p>OMG this is hell, understand when primitive variables are initialized or not:</p>
 </div>
@@ -30234,7 +30283,7 @@ mkdir -p bin/c
 </div>
 </div>
 <div class="sect3">
-<h4 id="cpp-multithreading"><a class="anchor" href="#cpp-multithreading"></a><a class="link" href="#cpp-multithreading">21.2.2. C++ multithreading</a></h4>
+<h4 id="cpp-multithreading"><a class="anchor" href="#cpp-multithreading"></a><a class="link" href="#cpp-multithreading">22.2.2. C++ multithreading</a></h4>
 <div class="ulist">
 <ul>
 <li>
@@ -30262,7 +30311,7 @@ mkdir -p bin/c
 </ul>
 </div>
 <div class="sect4">
-<h5 id="atomic-cpp"><a class="anchor" href="#atomic-cpp"></a><a class="link" href="#atomic-cpp">21.2.2.1. atomic.cpp</a></h5>
+<h5 id="atomic-cpp"><a class="anchor" href="#atomic-cpp"></a><a class="link" href="#atomic-cpp">22.2.2.1. atomic.cpp</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/cpp/atomic/">userland/cpp/atomic/</a></p>
 </div>
@@ -30465,7 +30514,7 @@ time ./mutex.out 4 100000000</pre>
 </ul>
 </div>
 <div class="sect5">
-<h6 id="detailed-gem5-analysis-of-how-data-races-happen"><a class="anchor" href="#detailed-gem5-analysis-of-how-data-races-happen"></a><a class="link" href="#detailed-gem5-analysis-of-how-data-races-happen">21.2.2.1.1. Detailed gem5 analysis of how data races happen</a></h6>
+<h6 id="detailed-gem5-analysis-of-how-data-races-happen"><a class="anchor" href="#detailed-gem5-analysis-of-how-data-races-happen"></a><a class="link" href="#detailed-gem5-analysis-of-how-data-races-happen">22.2.2.1.1. Detailed gem5 analysis of how data races happen</a></h6>
 <div class="paragraph">
 <p>The smallest data race we managed to come up as of LKMC 7c01b29f1ee7da878c7cc9cb4565f3f3cf516a92 and gem5 872cb227fdc0b4d60acc7840889d567a6936b6e1 was with <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/c/atomic.c">userland/c/atomic.c</a> (see also <a href="#c-multithreading">C multithreading</a>):</p>
 </div>
@@ -30570,7 +30619,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="cpp-memory-order"><a class="anchor" href="#cpp-memory-order"></a><a class="link" href="#cpp-memory-order">21.2.2.2. C++ std::memory_order</a></h5>
+<h5 id="cpp-memory-order"><a class="anchor" href="#cpp-memory-order"></a><a class="link" href="#cpp-memory-order">22.2.2.2. C++ std::memory_order</a></h5>
 <div class="paragraph">
 <p><a href="https://stackoverflow.com/questions/12346487/what-do-each-memory-order-mean" class="bare">https://stackoverflow.com/questions/12346487/what-do-each-memory-order-mean</a></p>
 </div>
@@ -30582,7 +30631,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="cpp-parallel-algorithms"><a class="anchor" href="#cpp-parallel-algorithms"></a><a class="link" href="#cpp-parallel-algorithms">21.2.2.3. C++ parallel algorithms</a></h5>
+<h5 id="cpp-parallel-algorithms"><a class="anchor" href="#cpp-parallel-algorithms"></a><a class="link" href="#cpp-parallel-algorithms">22.2.2.3. C++ parallel algorithms</a></h5>
 <div class="paragraph">
 <p><a href="https://stackoverflow.com/questions/51031060/are-c17-parallel-algorithms-implemented-already/55989883#55989883" class="bare">https://stackoverflow.com/questions/51031060/are-c17-parallel-algorithms-implemented-already/55989883#55989883</a></p>
 </div>
@@ -30592,7 +30641,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="cpp-standards"><a class="anchor" href="#cpp-standards"></a><a class="link" href="#cpp-standards">21.2.3. C++ standards</a></h4>
+<h4 id="cpp-standards"><a class="anchor" href="#cpp-standards"></a><a class="link" href="#cpp-standards">22.2.3. C++ standards</a></h4>
 <div class="paragraph">
 <p>Like for C, you have to pay for the standards&#8230;&#8203; insane. So we just use the closest free drafts instead.</p>
 </div>
@@ -30600,14 +30649,14 @@ non-atomic 19</pre>
 <p><a href="https://stackoverflow.com/questions/81656/where-do-i-find-the-current-c-or-c-standard-documents" class="bare">https://stackoverflow.com/questions/81656/where-do-i-find-the-current-c-or-c-standard-documents</a></p>
 </div>
 <div class="sect4">
-<h5 id="cpp17"><a class="anchor" href="#cpp17"></a><a class="link" href="#cpp17">21.2.3.1. C++17 N4659 standards draft</a></h5>
+<h5 id="cpp17"><a class="anchor" href="#cpp17"></a><a class="link" href="#cpp17">22.2.3.1. C++17 N4659 standards draft</a></h5>
 <div class="paragraph">
 <p><a href="http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2017/n4659.pdf" class="bare">http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2017/n4659.pdf</a></p>
 </div>
 </div>
 </div>
 <div class="sect3">
-<h4 id="cpp-type-casting"><a class="anchor" href="#cpp-type-casting"></a><a class="link" href="#cpp-type-casting">21.2.4. C++ type casting</a></h4>
+<h4 id="cpp-type-casting"><a class="anchor" href="#cpp-type-casting"></a><a class="link" href="#cpp-type-casting">22.2.4. C++ type casting</a></h4>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/cpp/static_dynamic_reinterpret_cast.cpp">userland/cpp/static_dynamic_reinterpret_cast.cpp</a></p>
 </div>
@@ -30617,7 +30666,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="posix"><a class="anchor" href="#posix"></a><a class="link" href="#posix">21.3. POSIX</a></h3>
+<h3 id="posix"><a class="anchor" href="#posix"></a><a class="link" href="#posix">22.3. POSIX</a></h3>
 <div class="paragraph">
 <p>Programs under <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/posix/">userland/posix/</a> are examples of POSIX C programming.</p>
 </div>
@@ -30635,13 +30684,13 @@ non-atomic 19</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="environment-variables"><a class="anchor" href="#environment-variables"></a><a class="link" href="#environment-variables">21.3.1. Environment variables</a></h4>
+<h4 id="environment-variables"><a class="anchor" href="#environment-variables"></a><a class="link" href="#environment-variables">22.3.1. Environment variables</a></h4>
 <div class="paragraph">
 <p>POSIX C example that prints all environment variables: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/posix/environ.c">userland/posix/environ.c</a></p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="unistd-h"><a class="anchor" href="#unistd-h"></a><a class="link" href="#unistd-h">21.3.2. unistd.h</a></h4>
+<h4 id="unistd-h"><a class="anchor" href="#unistd-h"></a><a class="link" href="#unistd-h">22.3.2. unistd.h</a></h4>
 <div class="ulist">
 <ul>
 <li>
@@ -30654,7 +30703,7 @@ non-atomic 19</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="fork"><a class="anchor" href="#fork"></a><a class="link" href="#fork">21.3.3. fork</a></h4>
+<h4 id="fork"><a class="anchor" href="#fork"></a><a class="link" href="#fork">22.3.3. fork</a></h4>
 <div class="paragraph">
 <p>POSIX' multiprocess API. Contrast with <a href="#pthreads">pthreads</a> which are for threads.</p>
 </div>
@@ -30679,7 +30728,7 @@ fork() return = 13039</pre>
 <p>Read the source comments and understand everything that is going on!</p>
 </div>
 <div class="sect4">
-<h5 id="getpid"><a class="anchor" href="#getpid"></a><a class="link" href="#getpid">21.3.3.1. getpid</a></h5>
+<h5 id="getpid"><a class="anchor" href="#getpid"></a><a class="link" href="#getpid">22.3.3.1. getpid</a></h5>
 <div class="paragraph">
 <p>The minimal interesting example is to use fork and observe different PIDs.</p>
 </div>
@@ -30691,7 +30740,7 @@ fork() return = 13039</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="fork-bomb"><a class="anchor" href="#fork-bomb"></a><a class="link" href="#fork-bomb">21.3.3.2. Fork bomb</a></h5>
+<h5 id="fork-bomb"><a class="anchor" href="#fork-bomb"></a><a class="link" href="#fork-bomb">22.3.3.2. Fork bomb</a></h5>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Fork_bomb" class="bare">https://en.wikipedia.org/wiki/Fork_bomb</a></p>
 </div>
@@ -30726,7 +30775,7 @@ fork() return = 13039</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="pthreads"><a class="anchor" href="#pthreads"></a><a class="link" href="#pthreads">21.3.4. pthreads</a></h4>
+<h4 id="pthreads"><a class="anchor" href="#pthreads"></a><a class="link" href="#pthreads">22.3.4. pthreads</a></h4>
 <div class="paragraph">
 <p>POSIX' multithreading API. Contrast with <a href="#fork">fork</a> which is for processes.</p>
 </div>
@@ -30750,7 +30799,7 @@ fork() return = 13039</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="pthread-mutex"><a class="anchor" href="#pthread-mutex"></a><a class="link" href="#pthread-mutex">21.3.4.1. pthread_mutex</a></h5>
+<h5 id="pthread-mutex"><a class="anchor" href="#pthread-mutex"></a><a class="link" href="#pthread-mutex">22.3.4.1. pthread_mutex</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/posix/pthread_count.c">userland/posix/pthread_count.c</a> exemplifies the functions:</p>
 </div>
@@ -30787,7 +30836,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect3">
-<h4 id="sysconf"><a class="anchor" href="#sysconf"></a><a class="link" href="#sysconf">21.3.5. sysconf</a></h4>
+<h4 id="sysconf"><a class="anchor" href="#sysconf"></a><a class="link" href="#sysconf">22.3.5. sysconf</a></h4>
 <div class="paragraph">
 <p><a href="https://pubs.opengroup.org/onlinepubs/9699919799/functions/sysconf.html" class="bare">https://pubs.opengroup.org/onlinepubs/9699919799/functions/sysconf.html</a></p>
 </div>
@@ -30833,7 +30882,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect3">
-<h4 id="mmap-2"><a class="anchor" href="#mmap-2"></a><a class="link" href="#mmap-2">21.3.6. mmap</a></h4>
+<h4 id="mmap-2"><a class="anchor" href="#mmap-2"></a><a class="link" href="#mmap-2">22.3.6. mmap</a></h4>
 <div class="paragraph">
 <p>The mmap system call allows advanced memory operations.</p>
 </div>
@@ -30844,7 +30893,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 <p>Linux adds has several POSIX extension flags to it.</p>
 </div>
 <div class="sect4">
-<h5 id="mmap-map-anonymous"><a class="anchor" href="#mmap-map-anonymous"></a><a class="link" href="#mmap-map-anonymous">21.3.6.1. mmap MAP_ANONYMOUS</a></h5>
+<h5 id="mmap-map-anonymous"><a class="anchor" href="#mmap-map-anonymous"></a><a class="link" href="#mmap-map-anonymous">22.3.6.1. mmap MAP_ANONYMOUS</a></h5>
 <div class="paragraph">
 <p>Basic <code>mmap</code> example, do the same as <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/c/malloc.c">userland/c/malloc.c</a>, but with <code>mmap</code>.</p>
 </div>
@@ -30862,7 +30911,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect4">
-<h5 id="mmap-file"><a class="anchor" href="#mmap-file"></a><a class="link" href="#mmap-file">21.3.6.2. mmap file</a></h5>
+<h5 id="mmap-file"><a class="anchor" href="#mmap-file"></a><a class="link" href="#mmap-file">22.3.6.2. mmap file</a></h5>
 <div class="paragraph">
 <p>Memory mapped file example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/posix/mmap_file.c">userland/posix/mmap_file.c</a></p>
 </div>
@@ -30874,7 +30923,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect4">
-<h5 id="brk"><a class="anchor" href="#brk"></a><a class="link" href="#brk">21.3.6.3. brk</a></h5>
+<h5 id="brk"><a class="anchor" href="#brk"></a><a class="link" href="#brk">22.3.6.3. brk</a></h5>
 <div class="paragraph">
 <p>Previously <a href="#posix">POSIX</a>, but was deprecated in favor of <a href="#malloc">malloc</a></p>
 </div>
@@ -30890,7 +30939,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect3">
-<h4 id="socket"><a class="anchor" href="#socket"></a><a class="link" href="#socket">21.3.7. socket</a></h4>
+<h4 id="socket"><a class="anchor" href="#socket"></a><a class="link" href="#socket">22.3.7. socket</a></h4>
 <div class="paragraph">
 <p>A bit like <code>read</code> and <code>write</code>, but from / to the Internet!</p>
 </div>
@@ -30904,7 +30953,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect2">
-<h3 id="userland-multithreading"><a class="anchor" href="#userland-multithreading"></a><a class="link" href="#userland-multithreading">21.4. Userland multithreading</a></h3>
+<h3 id="userland-multithreading"><a class="anchor" href="#userland-multithreading"></a><a class="link" href="#userland-multithreading">22.4. Userland multithreading</a></h3>
 <div class="paragraph">
 <p>The following sections are related to multithreading in userland:</p>
 </div>
@@ -30966,12 +31015,12 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect2">
-<h3 id="c-debugging"><a class="anchor" href="#c-debugging"></a><a class="link" href="#c-debugging">21.5. C debugging</a></h3>
+<h3 id="c-debugging"><a class="anchor" href="#c-debugging"></a><a class="link" href="#c-debugging">22.5. C debugging</a></h3>
 <div class="paragraph">
 <p>Let&#8217;s group the hard-to-debug undefined-behaviour-like stuff found in C / C+ here and how to tackle those problems.</p>
 </div>
 <div class="sect3">
-<h4 id="stack-smashing"><a class="anchor" href="#stack-smashing"></a><a class="link" href="#stack-smashing">21.5.1. Stack smashing</a></h4>
+<h4 id="stack-smashing"><a class="anchor" href="#stack-smashing"></a><a class="link" href="#stack-smashing">22.5.1. Stack smashing</a></h4>
 <div class="paragraph">
 <p><a href="https://stackoverflow.com/questions/1345670/stack-smashing-detected/51897264#51897264" class="bare">https://stackoverflow.com/questions/1345670/stack-smashing-detected/51897264#51897264</a></p>
 </div>
@@ -30991,7 +31040,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect3">
-<h4 id="memory-leaks"><a class="anchor" href="#memory-leaks"></a><a class="link" href="#memory-leaks">21.5.2. Memory leaks</a></h4>
+<h4 id="memory-leaks"><a class="anchor" href="#memory-leaks"></a><a class="link" href="#memory-leaks">22.5.2. Memory leaks</a></h4>
 <div class="paragraph">
 <p>How to debug: <a href="https://stackoverflow.com/questions/6261201/how-to-find-memory-leak-in-a-c-code-project/57877190#57877190" class="bare">https://stackoverflow.com/questions/6261201/how-to-find-memory-leak-in-a-c-code-project/57877190#57877190</a></p>
 </div>
@@ -31000,7 +31049,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect3">
-<h4 id="profiling-userland-programs"><a class="anchor" href="#profiling-userland-programs"></a><a class="link" href="#profiling-userland-programs">21.5.3. Profiling userland programs</a></h4>
+<h4 id="profiling-userland-programs"><a class="anchor" href="#profiling-userland-programs"></a><a class="link" href="#profiling-userland-programs">22.5.3. Profiling userland programs</a></h4>
 <div class="paragraph">
 <p><a href="https://stackoverflow.com/questions/375913/how-can-i-profile-c-code-running-on-linux/60265409#60265409" class="bare">https://stackoverflow.com/questions/375913/how-can-i-profile-c-code-running-on-linux/60265409#60265409</a></p>
 </div>
@@ -31020,12 +31069,12 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect2">
-<h3 id="interpreted-languages"><a class="anchor" href="#interpreted-languages"></a><a class="link" href="#interpreted-languages">21.6. Interpreted languages</a></h3>
+<h3 id="interpreted-languages"><a class="anchor" href="#interpreted-languages"></a><a class="link" href="#interpreted-languages">22.6. Interpreted languages</a></h3>
 <div class="paragraph">
 <p>Maybe some day someone will use this setup to study the performance of interpreters.</p>
 </div>
 <div class="sect3">
-<h4 id="python"><a class="anchor" href="#python"></a><a class="link" href="#python">21.6.1. Python</a></h4>
+<h4 id="python"><a class="anchor" href="#python"></a><a class="link" href="#python">22.6.1. Python</a></h4>
 <div class="paragraph">
 <p>Examples:</p>
 </div>
@@ -31050,7 +31099,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </ul>
 </div>
 <div class="sect4">
-<h5 id="build-and-install-the-interpreter"><a class="anchor" href="#build-and-install-the-interpreter"></a><a class="link" href="#build-and-install-the-interpreter">21.6.1.1. Build and install the interpreter</a></h5>
+<h5 id="build-and-install-the-interpreter"><a class="anchor" href="#build-and-install-the-interpreter"></a><a class="link" href="#build-and-install-the-interpreter">22.6.1.1. Build and install the interpreter</a></h5>
 <div class="paragraph">
 <p>Buildroot has a Python package that can be added to the guest image:</p>
 </div>
@@ -31109,7 +31158,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect4">
-<h5 id="python-gem5-user-mode-simulation"><a class="anchor" href="#python-gem5-user-mode-simulation"></a><a class="link" href="#python-gem5-user-mode-simulation">21.6.1.2. Python gem5 user mode simulation</a></h5>
+<h5 id="python-gem5-user-mode-simulation"><a class="anchor" href="#python-gem5-user-mode-simulation"></a><a class="link" href="#python-gem5-user-mode-simulation">22.6.1.2. Python gem5 user mode simulation</a></h5>
 <div class="paragraph">
 <p>At LKMC 50ac89b779363774325c81157ec8b9a6bdb50a2f gem5 390a74f59934b85d91489f8a563450d8321b602da:</p>
 </div>
@@ -31159,7 +31208,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect4">
-<h5 id="embedding-python-in-another-application"><a class="anchor" href="#embedding-python-in-another-application"></a><a class="link" href="#embedding-python-in-another-application">21.6.1.3. Embedding Python in another application</a></h5>
+<h5 id="embedding-python-in-another-application"><a class="anchor" href="#embedding-python-in-another-application"></a><a class="link" href="#embedding-python-in-another-application">22.6.1.3. Embedding Python in another application</a></h5>
 <div class="paragraph">
 <p>Here we will add some better examples and explanations for: <a href="https://docs.python.org/3/extending/embedding.html#very-high-level-embedding" class="bare">https://docs.python.org/3/extending/embedding.html#very-high-level-embedding</a></p>
 </div>
@@ -31210,7 +31259,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect4">
-<h5 id="pybind11"><a class="anchor" href="#pybind11"></a><a class="link" href="#pybind11">21.6.1.4. pybind11</a></h5>
+<h5 id="pybind11"><a class="anchor" href="#pybind11"></a><a class="link" href="#pybind11">22.6.1.4. pybind11</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/libs/pybind11">userland/libs/pybind11</a></p>
 </div>
@@ -31233,7 +31282,7 @@ There are no non-locking atomic types or atomic primitives in POSIX: <a href="ht
 </div>
 </div>
 <div class="sect3">
-<h4 id="node-js"><a class="anchor" href="#node-js"></a><a class="link" href="#node-js">21.6.2. Node.js</a></h4>
+<h4 id="node-js"><a class="anchor" href="#node-js"></a><a class="link" href="#node-js">22.6.2. Node.js</a></h4>
 <div class="paragraph">
 <p>Host installation shown at: <a href="https://askubuntu.com/questions/594656/how-to-install-the-latest-versions-of-nodejs-and-npm/971612#971612" class="bare">https://askubuntu.com/questions/594656/how-to-install-the-latest-versions-of-nodejs-and-npm/971612#971612</a></p>
 </div>
@@ -31330,7 +31379,7 @@ my type is MyClassToString and a is 1 and b is 2</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="npm"><a class="anchor" href="#npm"></a><a class="link" href="#npm">21.6.2.1. NPM</a></h5>
+<h5 id="npm"><a class="anchor" href="#npm"></a><a class="link" href="#npm">22.6.2.1. NPM</a></h5>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Npm_(software" class="bare">https://en.wikipedia.org/wiki/Npm_(software</a>)</p>
 </div>
@@ -31349,7 +31398,7 @@ my type is MyClassToString and a is 1 and b is 2</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="npm-data-files"><a class="anchor" href="#npm-data-files"></a><a class="link" href="#npm-data-files">21.6.2.1.1. NPM data-files</a></h6>
+<h6 id="npm-data-files"><a class="anchor" href="#npm-data-files"></a><a class="link" href="#npm-data-files">22.6.2.1.1. NPM data-files</a></h6>
 <div class="paragraph">
 <p>Illustrates how to add extra non-code data files to an NPM package, and then use those files at runtime.</p>
 </div>
@@ -31360,7 +31409,7 @@ my type is MyClassToString and a is 1 and b is 2</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="java"><a class="anchor" href="#java"></a><a class="link" href="#java">21.6.3. Java</a></h4>
+<h4 id="java"><a class="anchor" href="#java"></a><a class="link" href="#java">22.6.3. Java</a></h4>
 <div class="paragraph">
 <p>No OpenJDK package as of 2018.08: <a href="https://stackoverflow.com/questions/28874150/buildroot-with-jamvm-2-0-for-java-8/59290927#59290927" class="bare">https://stackoverflow.com/questions/28874150/buildroot-with-jamvm-2-0-for-java-8/59290927#59290927</a> partly because their build system is shit like the rest of the project&#8217;s setup.</p>
 </div>
@@ -31376,7 +31425,7 @@ my type is MyClassToString and a is 1 and b is 2</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="algorithms"><a class="anchor" href="#algorithms"></a><a class="link" href="#algorithms">21.7. Algorithms</a></h3>
+<h3 id="algorithms"><a class="anchor" href="#algorithms"></a><a class="link" href="#algorithms">22.7. Algorithms</a></h3>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/algorithm">userland/algorithm</a></p>
 </div>
@@ -31536,7 +31585,7 @@ cmp tmp.o tmp.e</pre>
 <p>These are good targets for <a href="#gem5-run-benchmark">performance analysis with gem5</a>, and there is some overlap between this section and <a href="#benchmarks">Benchmarks</a>.</p>
 </div>
 <div class="sect3">
-<h4 id="bst-vs-heap-vs-hashmap"><a class="anchor" href="#bst-vs-heap-vs-hashmap"></a><a class="link" href="#bst-vs-heap-vs-hashmap">21.7.1. BST vs heap vs hashmap</a></h4>
+<h4 id="bst-vs-heap-vs-hashmap"><a class="anchor" href="#bst-vs-heap-vs-hashmap"></a><a class="link" href="#bst-vs-heap-vs-hashmap">22.7.1. BST vs heap vs hashmap</a></h4>
 <div class="paragraph">
 <p>TODO: move benchmark graph from <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/cpp/bst_vs_heap_vs_hashmap.cpp">userland/cpp/bst_vs_heap_vs_hashmap.cpp</a> to <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/algorithm/set">userland/algorithm/set</a>.</p>
 </div>
@@ -31654,7 +31703,7 @@ xdg-open bst_vs_heap_vs_hashmap_gem5.tmp.png</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="blas"><a class="anchor" href="#blas"></a><a class="link" href="#blas">21.7.2. BLAS</a></h4>
+<h4 id="blas"><a class="anchor" href="#blas"></a><a class="link" href="#blas">22.7.2. BLAS</a></h4>
 <div class="paragraph">
 <p>Buildroot supports it, which makes everything just trivial:</p>
 </div>
@@ -31706,7 +31755,7 @@ cblas_dgemm(      CblasColMajor, CblasNoTrans, CblasTrans,3,3,2  ,1,    A,3,  B,
 </div>
 </div>
 <div class="sect3">
-<h4 id="eigen"><a class="anchor" href="#eigen"></a><a class="link" href="#eigen">21.7.3. Eigen</a></h4>
+<h4 id="eigen"><a class="anchor" href="#eigen"></a><a class="link" href="#eigen">22.7.3. Eigen</a></h4>
 <div class="paragraph">
 <p>Header only linear algebra library with a mainline Buildroot package:</p>
 </div>
@@ -31745,7 +31794,7 @@ cblas_dgemm(      CblasColMajor, CblasNoTrans, CblasTrans,3,3,2  ,1,    A,3,  B,
 </div>
 </div>
 <div class="sect2">
-<h3 id="benchmarks"><a class="anchor" href="#benchmarks"></a><a class="link" href="#benchmarks">21.8. Benchmarks</a></h3>
+<h3 id="benchmarks"><a class="anchor" href="#benchmarks"></a><a class="link" href="#benchmarks">22.8. Benchmarks</a></h3>
 <div class="paragraph">
 <p>These are good targets for <a href="#gem5-run-benchmark">performance analysis with gem5</a>.</p>
 </div>
@@ -31763,7 +31812,7 @@ cblas_dgemm(      CblasColMajor, CblasNoTrans, CblasTrans,3,3,2  ,1,    A,3,  B,
 </ul>
 </div>
 <div class="sect3">
-<h4 id="parsec-benchmark"><a class="anchor" href="#parsec-benchmark"></a><a class="link" href="#parsec-benchmark">21.8.1. PARSEC benchmark</a></h4>
+<h4 id="parsec-benchmark"><a class="anchor" href="#parsec-benchmark"></a><a class="link" href="#parsec-benchmark">22.8.1. PARSEC benchmark</a></h4>
 <div class="paragraph">
 <p>We have ported parts of the <a href="http://parsec.cs.princeton.edu">PARSEC benchmark</a> for cross compilation at: <a href="https://github.com/cirosantilli/parsec-benchmark" class="bare">https://github.com/cirosantilli/parsec-benchmark</a> See the documentation on that repo to find out which benchmarks have been ported. Some of the benchmarks were are segfaulting, they are documented in that repo.</p>
 </div>
@@ -31781,7 +31830,7 @@ cblas_dgemm(      CblasColMajor, CblasNoTrans, CblasTrans,3,3,2  ,1,    A,3,  B,
 </ul>
 </div>
 <div class="sect4">
-<h5 id="parsec-benchmark-without-parsecmgmt"><a class="anchor" href="#parsec-benchmark-without-parsecmgmt"></a><a class="link" href="#parsec-benchmark-without-parsecmgmt">21.8.1.1. PARSEC benchmark without parsecmgmt</a></h5>
+<h5 id="parsec-benchmark-without-parsecmgmt"><a class="anchor" href="#parsec-benchmark-without-parsecmgmt"></a><a class="link" href="#parsec-benchmark-without-parsecmgmt">22.8.1.1. PARSEC benchmark without parsecmgmt</a></h5>
 <div class="literalblock">
 <div class="content">
 <pre>./build --arch arm --download-dependencies gem5-buildroot parsec-benchmark
@@ -31815,7 +31864,7 @@ cblas_dgemm(      CblasColMajor, CblasNoTrans, CblasTrans,3,3,2  ,1,    A,3,  B,
 </div>
 </div>
 <div class="sect4">
-<h5 id="parsec-change-the-input-size"><a class="anchor" href="#parsec-change-the-input-size"></a><a class="link" href="#parsec-change-the-input-size">21.8.1.2. PARSEC change the input size</a></h5>
+<h5 id="parsec-change-the-input-size"><a class="anchor" href="#parsec-change-the-input-size"></a><a class="link" href="#parsec-change-the-input-size">22.8.1.2. PARSEC change the input size</a></h5>
 <div class="paragraph">
 <p>Running a benchmark of a size different than <code>test</code>, e.g. <code>simsmall</code>, requires a rebuild with:</p>
 </div>
@@ -31879,7 +31928,7 @@ cblas_dgemm(      CblasColMajor, CblasNoTrans, CblasTrans,3,3,2  ,1,    A,3,  B,
 </div>
 </div>
 <div class="sect4">
-<h5 id="parsec-benchmark-with-parsecmgmt"><a class="anchor" href="#parsec-benchmark-with-parsecmgmt"></a><a class="link" href="#parsec-benchmark-with-parsecmgmt">21.8.1.3. PARSEC benchmark with parsecmgmt</a></h5>
+<h5 id="parsec-benchmark-with-parsecmgmt"><a class="anchor" href="#parsec-benchmark-with-parsecmgmt"></a><a class="link" href="#parsec-benchmark-with-parsecmgmt">22.8.1.3. PARSEC benchmark with parsecmgmt</a></h5>
 <div class="paragraph">
 <p>Most users won&#8217;t want to use this method because:</p>
 </div>
@@ -31942,9 +31991,9 @@ parsecmgmt -a run -p splash2x.fmm -i test</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="parsec-uninstall"><a class="anchor" href="#parsec-uninstall"></a><a class="link" href="#parsec-uninstall">21.8.1.4. PARSEC uninstall</a></h5>
+<h5 id="parsec-uninstall"><a class="anchor" href="#parsec-uninstall"></a><a class="link" href="#parsec-uninstall">22.8.1.4. PARSEC uninstall</a></h5>
 <div class="paragraph">
-<p>If you want to remove PARSEC later, Buildroot doesn&#8217;t provide an automated package removal mechanism as mentioned at: <a href="#remove-buildroot-packages">Section 20.6, &#8220;Remove Buildroot packages&#8221;</a>, but the following procedure should be satisfactory:</p>
+<p>If you want to remove PARSEC later, Buildroot doesn&#8217;t provide an automated package removal mechanism as mentioned at: <a href="#remove-buildroot-packages">Section 21.6, &#8220;Remove Buildroot packages&#8221;</a>, but the following procedure should be satisfactory:</p>
 </div>
 <div class="literalblock">
 <div class="content">
@@ -31960,7 +32009,7 @@ parsecmgmt -a run -p splash2x.fmm -i test</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="parsec-benchmark-hacking"><a class="anchor" href="#parsec-benchmark-hacking"></a><a class="link" href="#parsec-benchmark-hacking">21.8.1.5. PARSEC benchmark hacking</a></h5>
+<h5 id="parsec-benchmark-hacking"><a class="anchor" href="#parsec-benchmark-hacking"></a><a class="link" href="#parsec-benchmark-hacking">22.8.1.5. PARSEC benchmark hacking</a></h5>
 <div class="paragraph">
 <p>If you end up going inside <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/submodules/parsec-benchmark">submodules/parsec-benchmark</a> to hack up the benchmark (you will!), these tips will be helpful.</p>
 </div>
@@ -32012,7 +32061,7 @@ git clean -xdf .</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="coremark"><a class="anchor" href="#coremark"></a><a class="link" href="#coremark">21.8.1.6. Coremark</a></h5>
+<h5 id="coremark"><a class="anchor" href="#coremark"></a><a class="link" href="#coremark">22.8.1.6. Coremark</a></h5>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Coremark" class="bare">https://en.wikipedia.org/wiki/Coremark</a></p>
 </div>
@@ -32225,7 +32274,7 @@ RUN_FLAGS =</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="microbenchmarks"><a class="anchor" href="#microbenchmarks"></a><a class="link" href="#microbenchmarks">21.8.2. Microbenchmarks</a></h4>
+<h4 id="microbenchmarks"><a class="anchor" href="#microbenchmarks"></a><a class="link" href="#microbenchmarks">22.8.2. Microbenchmarks</a></h4>
 <div class="paragraph">
 <p>It eventually has to come to that, hasn&#8217;t it?</p>
 </div>
@@ -32262,7 +32311,7 @@ RUN_FLAGS =</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="dhrystone"><a class="anchor" href="#dhrystone"></a><a class="link" href="#dhrystone">21.8.2.1. Dhrystone</a></h5>
+<h5 id="dhrystone"><a class="anchor" href="#dhrystone"></a><a class="link" href="#dhrystone">22.8.2.1. Dhrystone</a></h5>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Dhrystone" class="bare">https://en.wikipedia.org/wiki/Dhrystone</a></p>
 </div>
@@ -32379,7 +32428,7 @@ Dhrystones per Second:                      16152479.0</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="lmbench"><a class="anchor" href="#lmbench"></a><a class="link" href="#lmbench">21.8.2.2. LMbench</a></h5>
+<h5 id="lmbench"><a class="anchor" href="#lmbench"></a><a class="link" href="#lmbench">22.8.2.2. LMbench</a></h5>
 <div class="paragraph">
 <p><a href="http://www.bitmover.com/lmbench/" class="bare">http://www.bitmover.com/lmbench/</a></p>
 </div>
@@ -32497,7 +32546,7 @@ make</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="stream-benchmark"><a class="anchor" href="#stream-benchmark"></a><a class="link" href="#stream-benchmark">21.8.2.3. STREAM benchmark</a></h5>
+<h5 id="stream-benchmark"><a class="anchor" href="#stream-benchmark"></a><a class="link" href="#stream-benchmark">22.8.2.3. STREAM benchmark</a></h5>
 <div class="paragraph">
 <p><a href="http://www.cs.virginia.edu/stream/ref.html" class="bare">http://www.cs.virginia.edu/stream/ref.html</a></p>
 </div>
@@ -32623,7 +32672,7 @@ Solution Validates: avg error less than 1.000000e-13 on all three arrays
 </div>
 </div>
 <div class="sect2">
-<h3 id="userland-libs-directory"><a class="anchor" href="#userland-libs-directory"></a><a class="link" href="#userland-libs-directory">21.9. userland/libs directory</a></h3>
+<h3 id="userland-libs-directory"><a class="anchor" href="#userland-libs-directory"></a><a class="link" href="#userland-libs-directory">22.9. userland/libs directory</a></h3>
 <div class="paragraph">
 <p>Tests under <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/libs">userland/libs</a> require certain optional libraries to be installed on the target, and are not built or tested by default, you must enable them with either:</p>
 </div>
@@ -32637,7 +32686,7 @@ Solution Validates: avg error less than 1.000000e-13 on all three arrays
 <p>See for example <a href="#blas">BLAS</a>.</p>
 </div>
 <div class="sect3">
-<h4 id="boost"><a class="anchor" href="#boost"></a><a class="link" href="#boost">21.9.1. Boost</a></h4>
+<h4 id="boost"><a class="anchor" href="#boost"></a><a class="link" href="#boost">22.9.1. Boost</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Boost_(C%2B%2B_libraries)"><a href="https://en.wikipedia.org/wiki/Boost_(C%2B%2B_libraries)" class="bare">https://en.wikipedia.org/wiki/Boost_(C%2B%2B_libraries)</a></a></p>
 </div>
@@ -32653,7 +32702,7 @@ Solution Validates: avg error less than 1.000000e-13 on all three arrays
 </div>
 </div>
 <div class="sect3">
-<h4 id="hdf5"><a class="anchor" href="#hdf5"></a><a class="link" href="#hdf5">21.9.2. HDF5</a></h4>
+<h4 id="hdf5"><a class="anchor" href="#hdf5"></a><a class="link" href="#hdf5">22.9.2. HDF5</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Hierarchical_Data_Format" class="bare">https://en.wikipedia.org/wiki/Hierarchical_Data_Format</a></p>
 </div>
@@ -32676,7 +32725,7 @@ Solution Validates: avg error less than 1.000000e-13 on all three arrays
 </div>
 </div>
 <div class="sect2">
-<h3 id="userland-content-filename-conventions"><a class="anchor" href="#userland-content-filename-conventions"></a><a class="link" href="#userland-content-filename-conventions">21.10. Userland content filename conventions</a></h3>
+<h3 id="userland-content-filename-conventions"><a class="anchor" href="#userland-content-filename-conventions"></a><a class="link" href="#userland-content-filename-conventions">22.10. Userland content filename conventions</a></h3>
 <div class="paragraph">
 <p>The following basenames should always refer to programs that do the same thing, but in different languages:</p>
 </div>
@@ -32705,7 +32754,7 @@ Solution Validates: avg error less than 1.000000e-13 on all three arrays
 </div>
 </div>
 <div class="sect2">
-<h3 id="userland-content-bibliography"><a class="anchor" href="#userland-content-bibliography"></a><a class="link" href="#userland-content-bibliography">21.11. Userland content bibliography</a></h3>
+<h3 id="userland-content-bibliography"><a class="anchor" href="#userland-content-bibliography"></a><a class="link" href="#userland-content-bibliography">22.11. Userland content bibliography</a></h3>
 <div class="ulist">
 <ul>
 <li>
@@ -32717,7 +32766,7 @@ Solution Validates: avg error less than 1.000000e-13 on all three arrays
 </div>
 </div>
 <div class="sect1">
-<h2 id="userland-assembly"><a class="anchor" href="#userland-assembly"></a><a class="link" href="#userland-assembly">22. Userland assembly</a></h2>
+<h2 id="userland-assembly"><a class="anchor" href="#userland-assembly"></a><a class="link" href="#userland-assembly">23. Userland assembly</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
 <p>Programs under <code>userland/arch/&lt;arch&gt;/</code> are examples of userland assembly programming.</p>
@@ -32818,7 +32867,7 @@ Solution Validates: avg error less than 1.000000e-13 on all three arrays
 </div>
 </li>
 <li>
-<p>registers, see: <a href="#assembly-registers">Section 22.1, &#8220;Assembly registers&#8221;</a></p>
+<p>registers, see: <a href="#assembly-registers">Section 23.1, &#8220;Assembly registers&#8221;</a></p>
 </li>
 <li>
 <p>jumping:</p>
@@ -32961,14 +33010,14 @@ error: asm_main returned 1 at line 8</pre>
 </ul>
 </div>
 <div class="sect2">
-<h3 id="assembly-registers"><a class="anchor" href="#assembly-registers"></a><a class="link" href="#assembly-registers">22.1. Assembly registers</a></h3>
+<h3 id="assembly-registers"><a class="anchor" href="#assembly-registers"></a><a class="link" href="#assembly-registers">23.1. Assembly registers</a></h3>
 <div class="paragraph">
 <p>After seeing an <a href="#userland-assembly">ADD hello world</a>, you need to learn the general registers:</p>
 </div>
 <div class="ulist">
 <ul>
 <li>
-<p>x86, see: <a href="#x86-registers">Section 23.1, &#8220;x86 registers&#8221;</a></p>
+<p>x86, see: <a href="#x86-registers">Section 24.1, &#8220;x86 registers&#8221;</a></p>
 </li>
 <li>
 <p>arm</p>
@@ -32999,7 +33048,7 @@ error: asm_main returned 1 at line 8</pre>
 <p>Bibliography: <a href="#armarm7">ARMv7 architecture reference manual</a> A2.3 "ARM core registers".</p>
 </div>
 <div class="sect3">
-<h4 id="armv8-aarch64-x31-register"><a class="anchor" href="#armv8-aarch64-x31-register"></a><a class="link" href="#armv8-aarch64-x31-register">22.1.1. ARMv8 aarch64 x31 register</a></h4>
+<h4 id="armv8-aarch64-x31-register"><a class="anchor" href="#armv8-aarch64-x31-register"></a><a class="link" href="#armv8-aarch64-x31-register">23.1.1. ARMv8 aarch64 x31 register</a></h4>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/x31.S">userland/arch/aarch64/x31.S</a></p>
 </div>
@@ -33084,7 +33133,7 @@ When instructions do not interpret this operand encoding as the zero register, u
 </div>
 </div>
 <div class="sect2">
-<h3 id="floating-point-assembly"><a class="anchor" href="#floating-point-assembly"></a><a class="link" href="#floating-point-assembly">22.2. Floating point assembly</a></h3>
+<h3 id="floating-point-assembly"><a class="anchor" href="#floating-point-assembly"></a><a class="link" href="#floating-point-assembly">23.2. Floating point assembly</a></h3>
 <div class="paragraph">
 <p>Keep in mind that many ISAs started floating point as an optional thing, and it later got better integrated into the main CPU, side by side with SIMD.</p>
 </div>
@@ -33126,7 +33175,7 @@ When instructions do not interpret this operand encoding as the zero register, u
 </div>
 </div>
 <div class="sect2">
-<h3 id="simd-assembly"><a class="anchor" href="#simd-assembly"></a><a class="link" href="#simd-assembly">22.3. SIMD assembly</a></h3>
+<h3 id="simd-assembly"><a class="anchor" href="#simd-assembly"></a><a class="link" href="#simd-assembly">23.3. SIMD assembly</a></h3>
 <div class="paragraph">
 <p>Much like ADD for non-SIMD, start learning SIMD instructions by looking at the integer and floating point SIMD ADD instructions of each ISA:</p>
 </div>
@@ -33216,14 +33265,14 @@ When instructions do not interpret this operand encoding as the zero register, u
 <p>Bibliography: <a href="https://stackoverflow.com/questions/1389712/getting-started-with-intel-x86-sse-simd-instructions/56409539#56409539" class="bare">https://stackoverflow.com/questions/1389712/getting-started-with-intel-x86-sse-simd-instructions/56409539#56409539</a></p>
 </div>
 <div class="sect3">
-<h4 id="fma-instruction"><a class="anchor" href="#fma-instruction"></a><a class="link" href="#fma-instruction">22.3.1. FMA instruction</a></h4>
+<h4 id="fma-instruction"><a class="anchor" href="#fma-instruction"></a><a class="link" href="#fma-instruction">23.3.1. FMA instruction</a></h4>
 <div class="paragraph">
 <p>Fused multiply add:</p>
 </div>
 <div class="ulist">
 <ul>
 <li>
-<p>x86: <a href="#x86-fma">Section 23.12.3, &#8220;x86 fused multiply add (FMA)&#8221;</a></p>
+<p>x86: <a href="#x86-fma">Section 24.12.3, &#8220;x86 fused multiply add (FMA)&#8221;</a></p>
 </li>
 </ul>
 </div>
@@ -33265,7 +33314,7 @@ When instructions do not interpret this operand encoding as the zero register, u
 </div>
 </div>
 <div class="sect2">
-<h3 id="user-vs-system-assembly"><a class="anchor" href="#user-vs-system-assembly"></a><a class="link" href="#user-vs-system-assembly">22.4. User vs system assembly</a></h3>
+<h3 id="user-vs-system-assembly"><a class="anchor" href="#user-vs-system-assembly"></a><a class="link" href="#user-vs-system-assembly">23.4. User vs system assembly</a></h3>
 <div class="paragraph">
 <p>By "userland assembly", we mean "the parts of the ISA which can be freely used from userland".</p>
 </div>
@@ -33276,7 +33325,7 @@ When instructions do not interpret this operand encoding as the zero register, u
 <p>One big difference between both is that we can run userland assembly on <a href="#userland-setup">Userland setup</a>, which is easier to get running and debug.</p>
 </div>
 <div class="paragraph">
-<p>In particular, most userland assembly examples link to the C standard library, see: <a href="#userland-assembly-c-standard-library">Section 22.5, &#8220;Userland assembly C standard library&#8221;</a>.</p>
+<p>In particular, most userland assembly examples link to the C standard library, see: <a href="#userland-assembly-c-standard-library">Section 23.5, &#8220;Userland assembly C standard library&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Userland assembly is generally simpler, and a pre-requisite for <a href="#baremetal-setup">Baremetal setup</a>.</p>
@@ -33286,7 +33335,7 @@ When instructions do not interpret this operand encoding as the zero register, u
 </div>
 </div>
 <div class="sect2">
-<h3 id="userland-assembly-c-standard-library"><a class="anchor" href="#userland-assembly-c-standard-library"></a><a class="link" href="#userland-assembly-c-standard-library">22.5. Userland assembly C standard library</a></h3>
+<h3 id="userland-assembly-c-standard-library"><a class="anchor" href="#userland-assembly-c-standard-library"></a><a class="link" href="#userland-assembly-c-standard-library">23.5. Userland assembly C standard library</a></h3>
 <div class="paragraph">
 <p>All examples except the <a href="#freestanding-programs">Freestanding programs</a> link to the C standard library.</p>
 </div>
@@ -33319,7 +33368,7 @@ When instructions do not interpret this operand encoding as the zero register, u
 </ul>
 </div>
 <div class="sect3">
-<h4 id="freestanding-programs"><a class="anchor" href="#freestanding-programs"></a><a class="link" href="#freestanding-programs">22.5.1. Freestanding programs</a></h4>
+<h4 id="freestanding-programs"><a class="anchor" href="#freestanding-programs"></a><a class="link" href="#freestanding-programs">23.5.1. Freestanding programs</a></h4>
 <div class="paragraph">
 <p>Unlike most our other assembly examples, which use the C standard library for portability, examples under <code>freestanding/</code> directories don&#8217;t link to the C standard library:</p>
 </div>
@@ -33374,7 +33423,7 @@ When instructions do not interpret this operand encoding as the zero register, u
 <p>This is analogous to <a href="#baremetal-gdb-step-debug">step debugging baremetal examples</a>.</p>
 </div>
 <div class="sect4">
-<h5 id="nostartfiles-programs"><a class="anchor" href="#nostartfiles-programs"></a><a class="link" href="#nostartfiles-programs">22.5.1.1. nostartfiles programs</a></h5>
+<h5 id="nostartfiles-programs"><a class="anchor" href="#nostartfiles-programs"></a><a class="link" href="#nostartfiles-programs">23.5.1.1. nostartfiles programs</a></h5>
 <div class="paragraph">
 <p>Assembly examples under <code>nostartfiles</code> directories can use the standard library, but they don&#8217;t use the pre-<code>main</code> boilerplate and start directly at our explicitly given <code>_start</code>:</p>
 </div>
@@ -33457,7 +33506,7 @@ Is it any easy to determine which functions I can use or not, in case there are
 </div>
 </div>
 <div class="sect2">
-<h3 id="gcc-inline-assembly"><a class="anchor" href="#gcc-inline-assembly"></a><a class="link" href="#gcc-inline-assembly">22.6. GCC inline assembly</a></h3>
+<h3 id="gcc-inline-assembly"><a class="anchor" href="#gcc-inline-assembly"></a><a class="link" href="#gcc-inline-assembly">23.6. GCC inline assembly</a></h3>
 <div class="paragraph">
 <p>Examples under <code>arch/&lt;arch&gt;/c/</code> directories show to how use inline assembly from higher level languages such as C:</p>
 </div>
@@ -33520,7 +33569,7 @@ Is it any easy to determine which functions I can use or not, in case there are
 </ul>
 </div>
 <div class="sect3">
-<h4 id="gcc-inline-assembly-register-variables"><a class="anchor" href="#gcc-inline-assembly-register-variables"></a><a class="link" href="#gcc-inline-assembly-register-variables">22.6.1. GCC inline assembly register variables</a></h4>
+<h4 id="gcc-inline-assembly-register-variables"><a class="anchor" href="#gcc-inline-assembly-register-variables"></a><a class="link" href="#gcc-inline-assembly-register-variables">23.6.1. GCC inline assembly register variables</a></h4>
 <div class="paragraph">
 <p>Used notably in some of the <a href="#linux-system-calls">Linux system calls</a> setups:</p>
 </div>
@@ -33544,14 +33593,14 @@ Is it any easy to determine which functions I can use or not, in case there are
 <p>In arm, it is the only way to achieve this effect: <a href="https://stackoverflow.com/questions/10831792/how-to-use-specific-register-in-arm-inline-assembler" class="bare">https://stackoverflow.com/questions/10831792/how-to-use-specific-register-in-arm-inline-assembler</a></p>
 </div>
 <div class="paragraph">
-<p>This feature notably useful for making system calls from C, see: <a href="#linux-system-calls">Section 22.7, &#8220;Linux system calls&#8221;</a>.</p>
+<p>This feature notably useful for making system calls from C, see: <a href="#linux-system-calls">Section 23.7, &#8220;Linux system calls&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Documentation: <a href="https://gcc.gnu.org/onlinedocs/gcc-4.4.2/gcc/Explicit-Reg-Vars.html" class="bare">https://gcc.gnu.org/onlinedocs/gcc-4.4.2/gcc/Explicit-Reg-Vars.html</a></p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="gcc-inline-assembly-scratch-registers"><a class="anchor" href="#gcc-inline-assembly-scratch-registers"></a><a class="link" href="#gcc-inline-assembly-scratch-registers">22.6.2. GCC inline assembly scratch registers</a></h4>
+<h4 id="gcc-inline-assembly-scratch-registers"><a class="anchor" href="#gcc-inline-assembly-scratch-registers"></a><a class="link" href="#gcc-inline-assembly-scratch-registers">23.6.2. GCC inline assembly scratch registers</a></h4>
 <div class="paragraph">
 <p>How to use temporary registers in inline assembly:</p>
 </div>
@@ -33577,7 +33626,7 @@ Is it any easy to determine which functions I can use or not, in case there are
 </div>
 </div>
 <div class="sect3">
-<h4 id="gcc-inline-assembly-early-clobbers"><a class="anchor" href="#gcc-inline-assembly-early-clobbers"></a><a class="link" href="#gcc-inline-assembly-early-clobbers">22.6.3. GCC inline assembly early-clobbers</a></h4>
+<h4 id="gcc-inline-assembly-early-clobbers"><a class="anchor" href="#gcc-inline-assembly-early-clobbers"></a><a class="link" href="#gcc-inline-assembly-early-clobbers">23.6.3. GCC inline assembly early-clobbers</a></h4>
 <div class="paragraph">
 <p>An example of using the <code>&amp;</code> early-clobber modifier: link:userland/arch/aarch64/earlyclobber.c</p>
 </div>
@@ -33589,7 +33638,7 @@ Is it any easy to determine which functions I can use or not, in case there are
 </div>
 </div>
 <div class="sect3">
-<h4 id="gcc-inline-assembly-floating-point-arm"><a class="anchor" href="#gcc-inline-assembly-floating-point-arm"></a><a class="link" href="#gcc-inline-assembly-floating-point-arm">22.6.4. GCC inline assembly floating point ARM</a></h4>
+<h4 id="gcc-inline-assembly-floating-point-arm"><a class="anchor" href="#gcc-inline-assembly-floating-point-arm"></a><a class="link" href="#gcc-inline-assembly-floating-point-arm">23.6.4. GCC inline assembly floating point ARM</a></h4>
 <div class="paragraph">
 <p>Not documented as of GCC 8.2, but possible: <a href="https://stackoverflow.com/questions/53960240/armv8-floating-point-output-inline-assembly" class="bare">https://stackoverflow.com/questions/53960240/armv8-floating-point-output-inline-assembly</a></p>
 </div>
@@ -33605,7 +33654,7 @@ Is it any easy to determine which functions I can use or not, in case there are
 </div>
 </div>
 <div class="sect3">
-<h4 id="gcc-intrinsics"><a class="anchor" href="#gcc-intrinsics"></a><a class="link" href="#gcc-intrinsics">22.6.5. GCC intrinsics</a></h4>
+<h4 id="gcc-intrinsics"><a class="anchor" href="#gcc-intrinsics"></a><a class="link" href="#gcc-intrinsics">23.6.5. GCC intrinsics</a></h4>
 <div class="paragraph">
 <p>Pre-existing C wrappers using inline assembly, this is what production programs should use instead of inline assembly for SIMD:</p>
 </div>
@@ -33627,7 +33676,7 @@ Is it any easy to determine which functions I can use or not, in case there are
 </ul>
 </div>
 <div class="sect4">
-<h5 id="gcc-x86-intrinsics"><a class="anchor" href="#gcc-x86-intrinsics"></a><a class="link" href="#gcc-x86-intrinsics">22.6.5.1. GCC x86 intrinsics</a></h5>
+<h5 id="gcc-x86-intrinsics"><a class="anchor" href="#gcc-x86-intrinsics"></a><a class="link" href="#gcc-x86-intrinsics">23.6.5.1. GCC x86 intrinsics</a></h5>
 <div class="paragraph">
 <p>Good official cheatsheet with all intrinsics and what they expand to: <a href="https://software.intel.com/sites/landingpage/IntrinsicsGuide" class="bare">https://software.intel.com/sites/landingpage/IntrinsicsGuide</a></p>
 </div>
@@ -33755,7 +33804,7 @@ zmmintrin.h AVX512</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="linux-system-calls"><a class="anchor" href="#linux-system-calls"></a><a class="link" href="#linux-system-calls">22.7. Linux system calls</a></h3>
+<h3 id="linux-system-calls"><a class="anchor" href="#linux-system-calls"></a><a class="link" href="#linux-system-calls">23.7. Linux system calls</a></h3>
 <div class="paragraph">
 <p>The following <a href="#userland-setup">Userland setup</a> programs illustrate how to make system calls:</p>
 </div>
@@ -33854,7 +33903,7 @@ zmmintrin.h AVX512</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="futex-system-call"><a class="anchor" href="#futex-system-call"></a><a class="link" href="#futex-system-call">22.7.1. futex system call</a></h4>
+<h4 id="futex-system-call"><a class="anchor" href="#futex-system-call"></a><a class="link" href="#futex-system-call">23.7.1. futex system call</a></h4>
 <div class="paragraph">
 <p>This is how threads either:</p>
 </div>
@@ -33916,7 +33965,7 @@ child after parent sleep</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="userland-mutex-implementation"><a class="anchor" href="#userland-mutex-implementation"></a><a class="link" href="#userland-mutex-implementation">22.7.1.1. Userland mutex implementation</a></h5>
+<h5 id="userland-mutex-implementation"><a class="anchor" href="#userland-mutex-implementation"></a><a class="link" href="#userland-mutex-implementation">23.7.1.1. Userland mutex implementation</a></h5>
 <div class="paragraph">
 <p>The best article to understand spinlocks is: <a href="https://eli.thegreenplace.net/2018/basics-of-futexes/" class="bare">https://eli.thegreenplace.net/2018/basics-of-futexes/</a></p>
 </div>
@@ -33926,7 +33975,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="getcpu"><a class="anchor" href="#getcpu"></a><a class="link" href="#getcpu">22.7.2. <code>getcpu</code> system call and the <code>sched_getaffinity</code> glibc wrapper</a></h4>
+<h4 id="getcpu"><a class="anchor" href="#getcpu"></a><a class="link" href="#getcpu">23.7.2. <code>getcpu</code> system call and the <code>sched_getaffinity</code> glibc wrapper</a></h4>
 <div class="paragraph">
 <p>Examples:</p>
 </div>
@@ -34001,7 +34050,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="linux-calling-conventions"><a class="anchor" href="#linux-calling-conventions"></a><a class="link" href="#linux-calling-conventions">22.8. Linux calling conventions</a></h3>
+<h3 id="linux-calling-conventions"><a class="anchor" href="#linux-calling-conventions"></a><a class="link" href="#linux-calling-conventions">23.8. Linux calling conventions</a></h3>
 <div class="paragraph">
 <p>A summary of results is shown at: <a href="#table-linux-calling-conventions">Table 3, &#8220;Summary of Linux calling conventions for several architectures&#8221;</a>.</p>
 </div>
@@ -34043,7 +34092,7 @@ child after parent sleep</pre>
 </tbody>
 </table>
 <div class="sect3">
-<h4 id="x86_64-calling-convention"><a class="anchor" href="#x86_64-calling-convention"></a><a class="link" href="#x86_64-calling-convention">22.8.1. x86_64 calling convention</a></h4>
+<h4 id="x86_64-calling-convention"><a class="anchor" href="#x86_64-calling-convention"></a><a class="link" href="#x86_64-calling-convention">23.8.1. x86_64 calling convention</a></h4>
 <div class="paragraph">
 <p>Examples:</p>
 </div>
@@ -34072,7 +34121,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-calling-convention"><a class="anchor" href="#arm-calling-convention"></a><a class="link" href="#arm-calling-convention">22.8.2. ARM calling convention</a></h4>
+<h4 id="arm-calling-convention"><a class="anchor" href="#arm-calling-convention"></a><a class="link" href="#arm-calling-convention">23.8.2. ARM calling convention</a></h4>
 <div class="paragraph">
 <p>Call C standard library functions from assembly and vice versa.</p>
 </div>
@@ -34134,7 +34183,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="gnu-gas-assembler"><a class="anchor" href="#gnu-gas-assembler"></a><a class="link" href="#gnu-gas-assembler">22.9. GNU GAS assembler</a></h3>
+<h3 id="gnu-gas-assembler"><a class="anchor" href="#gnu-gas-assembler"></a><a class="link" href="#gnu-gas-assembler">23.9. GNU GAS assembler</a></h3>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/GNU_Assembler">GNU GAS</a> is the default assembler used by GDB, and therefore it completely dominates in Linux.</p>
 </div>
@@ -34142,7 +34191,7 @@ child after parent sleep</pre>
 <p>The Linux kernel in particular uses GNU GAS assembly extensively for the arch specific parts under <code>arch/</code>.</p>
 </div>
 <div class="sect3">
-<h4 id="gnu-gas-assembler-comments"><a class="anchor" href="#gnu-gas-assembler-comments"></a><a class="link" href="#gnu-gas-assembler-comments">22.9.1. GNU GAS assembler comments</a></h4>
+<h4 id="gnu-gas-assembler-comments"><a class="anchor" href="#gnu-gas-assembler-comments"></a><a class="link" href="#gnu-gas-assembler-comments">23.9.1. GNU GAS assembler comments</a></h4>
 <div class="paragraph">
 <p>In this tutorial, we use exclusively C Preprocessor <code>/**/</code> comments because:</p>
 </div>
@@ -34177,7 +34226,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="gnu-gas-assembler-immediates"><a class="anchor" href="#gnu-gas-assembler-immediates"></a><a class="link" href="#gnu-gas-assembler-immediates">22.9.2. GNU GAS assembler immediates</a></h4>
+<h4 id="gnu-gas-assembler-immediates"><a class="anchor" href="#gnu-gas-assembler-immediates"></a><a class="link" href="#gnu-gas-assembler-immediates">23.9.2. GNU GAS assembler immediates</a></h4>
 <div class="paragraph">
 <p>Summary:</p>
 </div>
@@ -34209,7 +34258,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="gnu-gas-assembler-data-sizes"><a class="anchor" href="#gnu-gas-assembler-data-sizes"></a><a class="link" href="#gnu-gas-assembler-data-sizes">22.9.3. GNU GAS assembler data sizes</a></h4>
+<h4 id="gnu-gas-assembler-data-sizes"><a class="anchor" href="#gnu-gas-assembler-data-sizes"></a><a class="link" href="#gnu-gas-assembler-data-sizes">23.9.3. GNU GAS assembler data sizes</a></h4>
 <div class="paragraph">
 <p>Let&#8217;s see how many bytes go into each data type:</p>
 </div>
@@ -34301,9 +34350,9 @@ child after parent sleep</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="gnu-gas-assembler-arm-specifics"><a class="anchor" href="#gnu-gas-assembler-arm-specifics"></a><a class="link" href="#gnu-gas-assembler-arm-specifics">22.9.3.1. GNU GAS assembler ARM specifics</a></h5>
+<h5 id="gnu-gas-assembler-arm-specifics"><a class="anchor" href="#gnu-gas-assembler-arm-specifics"></a><a class="link" href="#gnu-gas-assembler-arm-specifics">23.9.3.1. GNU GAS assembler ARM specifics</a></h5>
 <div class="sect5">
-<h6 id="gnu-gas-assembler-arm-unified-syntax"><a class="anchor" href="#gnu-gas-assembler-arm-unified-syntax"></a><a class="link" href="#gnu-gas-assembler-arm-unified-syntax">22.9.3.1.1. GNU GAS assembler ARM unified syntax</a></h6>
+<h6 id="gnu-gas-assembler-arm-unified-syntax"><a class="anchor" href="#gnu-gas-assembler-arm-unified-syntax"></a><a class="link" href="#gnu-gas-assembler-arm-unified-syntax">23.9.3.1.1. GNU GAS assembler ARM unified syntax</a></h6>
 <div class="paragraph">
 <p>There are two types of ARMv7 assemblies:</p>
 </div>
@@ -34348,14 +34397,14 @@ child after parent sleep</pre>
 </div>
 </li>
 <li>
-<p>cannot have implicit destination with shift, see: <a href="#arm-shift-suffixes">Section 24.4.4.1, &#8220;ARM shift suffixes&#8221;</a></p>
+<p>cannot have implicit destination with shift, see: <a href="#arm-shift-suffixes">Section 25.4.4.1, &#8220;ARM shift suffixes&#8221;</a></p>
 </li>
 </ul>
 </div>
 </div>
 </div>
 <div class="sect4">
-<h5 id="gnu-gas-assembler-arm-n-and-w-suffixes"><a class="anchor" href="#gnu-gas-assembler-arm-n-and-w-suffixes"></a><a class="link" href="#gnu-gas-assembler-arm-n-and-w-suffixes">22.9.3.2. GNU GAS assembler ARM .n and .w suffixes</a></h5>
+<h5 id="gnu-gas-assembler-arm-n-and-w-suffixes"><a class="anchor" href="#gnu-gas-assembler-arm-n-and-w-suffixes"></a><a class="link" href="#gnu-gas-assembler-arm-n-and-w-suffixes">23.9.3.2. GNU GAS assembler ARM .n and .w suffixes</a></h5>
 <div class="paragraph">
 <p>When reading disassembly, many instructions have either a <code>.n</code> or <code>.w</code> suffix.</p>
 </div>
@@ -34368,7 +34417,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="gnu-gas-assembler-char-literals"><a class="anchor" href="#gnu-gas-assembler-char-literals"></a><a class="link" href="#gnu-gas-assembler-char-literals">22.9.4. GNU GAS assembler char literals</a></h4>
+<h4 id="gnu-gas-assembler-char-literals"><a class="anchor" href="#gnu-gas-assembler-char-literals"></a><a class="link" href="#gnu-gas-assembler-char-literals">23.9.4. GNU GAS assembler char literals</a></h4>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/char_literals.S">userland/arch/x86_64/char_literals.S</a></p>
 </div>
@@ -34389,14 +34438,14 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="nop-instructions"><a class="anchor" href="#nop-instructions"></a><a class="link" href="#nop-instructions">22.10. NOP instructions</a></h3>
+<h3 id="nop-instructions"><a class="anchor" href="#nop-instructions"></a><a class="link" href="#nop-instructions">23.10. NOP instructions</a></h3>
 <div class="ulist">
 <ul>
 <li>
 <p>x86: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/nop.S">NOP</a></p>
 </li>
 <li>
-<p>ARM: <a href="#arm-nop-instruction">Section 24.5.1, &#8220;ARM NOP instruction&#8221;</a></p>
+<p>ARM: <a href="#arm-nop-instruction">Section 25.5.1, &#8220;ARM NOP instruction&#8221;</a></p>
 </li>
 </ul>
 </div>
@@ -34413,13 +34462,13 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect1">
-<h2 id="x86-userland-assembly"><a class="anchor" href="#x86-userland-assembly"></a><a class="link" href="#x86-userland-assembly">23. x86 userland assembly</a></h2>
+<h2 id="x86-userland-assembly"><a class="anchor" href="#x86-userland-assembly"></a><a class="link" href="#x86-userland-assembly">24. x86 userland assembly</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
-<p>Arch agnostic infrastructure getting started at: <a href="#userland-assembly">Section 22, &#8220;Userland assembly&#8221;</a>.</p>
+<p>Arch agnostic infrastructure getting started at: <a href="#userland-assembly">Section 23, &#8220;Userland assembly&#8221;</a>.</p>
 </div>
 <div class="sect2">
-<h3 id="x86-registers"><a class="anchor" href="#x86-registers"></a><a class="link" href="#x86-registers">23.1. x86 registers</a></h3>
+<h3 id="x86-registers"><a class="anchor" href="#x86-registers"></a><a class="link" href="#x86-registers">24.1. x86 registers</a></h3>
 <div class="paragraph">
 <p>link:userland/arch/x86_64/registers.S</p>
 </div>
@@ -34470,7 +34519,7 @@ child after parent sleep</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="x86-flags-registers"><a class="anchor" href="#x86-flags-registers"></a><a class="link" href="#x86-flags-registers">23.1.1. x86 FLAGS registers</a></h4>
+<h4 id="x86-flags-registers"><a class="anchor" href="#x86-flags-registers"></a><a class="link" href="#x86-flags-registers">24.1.1. x86 FLAGS registers</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/FLAGS_register" class="bare">https://en.wikipedia.org/wiki/FLAGS_register</a></p>
 </div>
@@ -34480,7 +34529,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-addressing-modes"><a class="anchor" href="#x86-addressing-modes"></a><a class="link" href="#x86-addressing-modes">23.2. x86 addressing modes</a></h3>
+<h3 id="x86-addressing-modes"><a class="anchor" href="#x86-addressing-modes"></a><a class="link" href="#x86-addressing-modes">24.2. x86 addressing modes</a></h3>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/address_modes.S">userland/arch/x86_64/address_modes.S</a></p>
 </div>
@@ -34553,7 +34602,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-data-transfer-instructions"><a class="anchor" href="#x86-data-transfer-instructions"></a><a class="link" href="#x86-data-transfer-instructions">23.3. x86 data transfer instructions</a></h3>
+<h3 id="x86-data-transfer-instructions"><a class="anchor" href="#x86-data-transfer-instructions"></a><a class="link" href="#x86-data-transfer-instructions">24.3. x86 data transfer instructions</a></h3>
 <div class="paragraph">
 <p>5.1.1 "Data Transfer Instructions"</p>
 </div>
@@ -34584,7 +34633,7 @@ child after parent sleep</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="x86-exchange-instructions"><a class="anchor" href="#x86-exchange-instructions"></a><a class="link" href="#x86-exchange-instructions">23.3.1. x86 exchange instructions</a></h4>
+<h4 id="x86-exchange-instructions"><a class="anchor" href="#x86-exchange-instructions"></a><a class="link" href="#x86-exchange-instructions">24.3.1. x86 exchange instructions</a></h4>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 7.3.1.2 "Exchange Instructions":</p>
 </div>
@@ -34602,7 +34651,7 @@ child after parent sleep</pre>
 <p>TODO: concrete multi-thread <a href="#gcc-inline-assembly">GCC inline assembly</a> examples of how all those instructions are normally used as synchronization primitives.</p>
 </div>
 <div class="sect4">
-<h5 id="x86-cmpxchg-instruction"><a class="anchor" href="#x86-cmpxchg-instruction"></a><a class="link" href="#x86-cmpxchg-instruction">23.3.1.1. x86 CMPXCHG instruction</a></h5>
+<h5 id="x86-cmpxchg-instruction"><a class="anchor" href="#x86-cmpxchg-instruction"></a><a class="link" href="#x86-cmpxchg-instruction">24.3.1.1. x86 CMPXCHG instruction</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/cmpxchg.S">userland/arch/x86_64/cmpxchg.S</a></p>
 </div>
@@ -34626,7 +34675,7 @@ child after parent sleep</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="x86-push-and-pop-instructions"><a class="anchor" href="#x86-push-and-pop-instructions"></a><a class="link" href="#x86-push-and-pop-instructions">23.3.2. x86 PUSH and POP instructions</a></h4>
+<h4 id="x86-push-and-pop-instructions"><a class="anchor" href="#x86-push-and-pop-instructions"></a><a class="link" href="#x86-push-and-pop-instructions">24.3.2. x86 PUSH and POP instructions</a></h4>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/push.S">userland/arch/x86_64/push.S</a></p>
 </div>
@@ -34653,7 +34702,7 @@ add $8, %rsp</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="x86-cqto-and-cltq-instructions"><a class="anchor" href="#x86-cqto-and-cltq-instructions"></a><a class="link" href="#x86-cqto-and-cltq-instructions">23.3.3. x86 CQTO and CLTQ instructions</a></h4>
+<h4 id="x86-cqto-and-cltq-instructions"><a class="anchor" href="#x86-cqto-and-cltq-instructions"></a><a class="link" href="#x86-cqto-and-cltq-instructions">24.3.3. x86 CQTO and CLTQ instructions</a></h4>
 <div class="paragraph">
 <p>Examples:</p>
 </div>
@@ -34754,7 +34803,7 @@ add $8, %rsp</pre>
 </table>
 </div>
 <div class="sect3">
-<h4 id="x86-cmovcc-instructions"><a class="anchor" href="#x86-cmovcc-instructions"></a><a class="link" href="#x86-cmovcc-instructions">23.3.4. x86 CMOVcc instructions</a></h4>
+<h4 id="x86-cmovcc-instructions"><a class="anchor" href="#x86-cmovcc-instructions"></a><a class="link" href="#x86-cmovcc-instructions">24.3.4. x86 CMOVcc instructions</a></h4>
 <div class="ulist">
 <ul>
 <li>
@@ -34807,12 +34856,12 @@ add $8, %rsp</pre>
 <p>This is partly why the ternary <code>?</code> C operator exists: <a href="https://stackoverflow.com/questions/3565368/ternary-operator-vs-if-else" class="bare">https://stackoverflow.com/questions/3565368/ternary-operator-vs-if-else</a></p>
 </div>
 <div class="paragraph">
-<p>It is interesting to compare this with ARMv7 conditional execution: which is available for all instructions, as shown at: <a href="#arm-conditional-execution">Section 24.2.5, &#8220;ARM conditional execution&#8221;</a>.</p>
+<p>It is interesting to compare this with ARMv7 conditional execution: which is available for all instructions, as shown at: <a href="#arm-conditional-execution">Section 25.2.5, &#8220;ARM conditional execution&#8221;</a>.</p>
 </div>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-binary-arithmetic-instructions"><a class="anchor" href="#x86-binary-arithmetic-instructions"></a><a class="link" href="#x86-binary-arithmetic-instructions">23.4. x86 binary arithmetic instructions</a></h3>
+<h3 id="x86-binary-arithmetic-instructions"><a class="anchor" href="#x86-binary-arithmetic-instructions"></a><a class="link" href="#x86-binary-arithmetic-instructions">24.4. x86 binary arithmetic instructions</a></h3>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.1.2 "Binary Arithmetic Instructions":</p>
 </div>
@@ -34880,7 +34929,7 @@ add $8, %rsp</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-logical-instructions"><a class="anchor" href="#x86-logical-instructions"></a><a class="link" href="#x86-logical-instructions">23.5. x86 logical instructions</a></h3>
+<h3 id="x86-logical-instructions"><a class="anchor" href="#x86-logical-instructions"></a><a class="link" href="#x86-logical-instructions">24.5. x86 logical instructions</a></h3>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.1.4 "Logical Instructions"</p>
 </div>
@@ -34902,7 +34951,7 @@ add $8, %rsp</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-shift-and-rotate-instructions"><a class="anchor" href="#x86-shift-and-rotate-instructions"></a><a class="link" href="#x86-shift-and-rotate-instructions">23.6. x86 shift and rotate instructions</a></h3>
+<h3 id="x86-shift-and-rotate-instructions"><a class="anchor" href="#x86-shift-and-rotate-instructions"></a><a class="link" href="#x86-shift-and-rotate-instructions">24.6. x86 shift and rotate instructions</a></h3>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.1.5 "Shift and Rotate Instructions"</p>
 </div>
@@ -34954,7 +35003,7 @@ add $8, %rsp</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-bit-and-byte-instructions"><a class="anchor" href="#x86-bit-and-byte-instructions"></a><a class="link" href="#x86-bit-and-byte-instructions">23.7. x86 bit and byte instructions</a></h3>
+<h3 id="x86-bit-and-byte-instructions"><a class="anchor" href="#x86-bit-and-byte-instructions"></a><a class="link" href="#x86-bit-and-byte-instructions">24.7. x86 bit and byte instructions</a></h3>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.1.6 "Bit and Byte Instructions"</p>
 </div>
@@ -35013,7 +35062,7 @@ add $8, %rsp</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-control-transfer-instructions"><a class="anchor" href="#x86-control-transfer-instructions"></a><a class="link" href="#x86-control-transfer-instructions">23.8. x86 control transfer instructions</a></h3>
+<h3 id="x86-control-transfer-instructions"><a class="anchor" href="#x86-control-transfer-instructions"></a><a class="link" href="#x86-control-transfer-instructions">24.8. x86 control transfer instructions</a></h3>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.1.7 "Control Transfer Instructions"</p>
 </div>
@@ -35032,7 +35081,7 @@ add $8, %rsp</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="x86-jcc-instructions"><a class="anchor" href="#x86-jcc-instructions"></a><a class="link" href="#x86-jcc-instructions">23.8.1. x86 Jcc instructions</a></h4>
+<h4 id="x86-jcc-instructions"><a class="anchor" href="#x86-jcc-instructions"></a><a class="link" href="#x86-jcc-instructions">24.8.1. x86 Jcc instructions</a></h4>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/jcc.S">userland/arch/x86_64/jcc.S</a></p>
 </div>
@@ -35106,7 +35155,7 @@ add $8, %rsp</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="x86-loop-instruction"><a class="anchor" href="#x86-loop-instruction"></a><a class="link" href="#x86-loop-instruction">23.8.2. x86 LOOP instruction</a></h4>
+<h4 id="x86-loop-instruction"><a class="anchor" href="#x86-loop-instruction"></a><a class="link" href="#x86-loop-instruction">24.8.2. x86 LOOP instruction</a></h4>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/loop.S">userland/arch/x86_64/loop.S</a></p>
 </div>
@@ -35115,7 +35164,7 @@ add $8, %rsp</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="x86-string-instructions"><a class="anchor" href="#x86-string-instructions"></a><a class="link" href="#x86-string-instructions">23.8.3. x86 string instructions</a></h4>
+<h4 id="x86-string-instructions"><a class="anchor" href="#x86-string-instructions"></a><a class="link" href="#x86-string-instructions">24.8.3. x86 string instructions</a></h4>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.1.8 "String Instructions"</p>
 </div>
@@ -35168,7 +35217,7 @@ add $8, %rsp</pre>
 <p>However, as computer architecture evolved, those instructions might not offer considerable speedups anymore, and modern glibc such as 2.29 just uses <a href="#x86-simd">x86 SIMD</a> operations instead:, see also: <a href="https://stackoverflow.com/questions/33480999/how-can-the-rep-stosb-instruction-execute-faster-than-the-equivalent-loop" class="bare">https://stackoverflow.com/questions/33480999/how-can-the-rep-stosb-instruction-execute-faster-than-the-equivalent-loop</a></p>
 </div>
 <div class="sect4">
-<h5 id="x86-rep-prefix"><a class="anchor" href="#x86-rep-prefix"></a><a class="link" href="#x86-rep-prefix">23.8.3.1. x86 REP prefix</a></h5>
+<h5 id="x86-rep-prefix"><a class="anchor" href="#x86-rep-prefix"></a><a class="link" href="#x86-rep-prefix">24.8.3.1. x86 REP prefix</a></h5>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/rep.S">userland/arch/x86_64/rep.S</a></p>
 </div>
@@ -35207,7 +35256,7 @@ add $8, %rsp</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="x86-enter-and-leave-instructions"><a class="anchor" href="#x86-enter-and-leave-instructions"></a><a class="link" href="#x86-enter-and-leave-instructions">23.8.4. x86 ENTER and LEAVE instructions</a></h4>
+<h4 id="x86-enter-and-leave-instructions"><a class="anchor" href="#x86-enter-and-leave-instructions"></a><a class="link" href="#x86-enter-and-leave-instructions">24.8.4. x86 ENTER and LEAVE instructions</a></h4>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/enter.S">userland/arch/x86_64/enter.S</a></p>
 </div>
@@ -35258,16 +35307,16 @@ pop %rbp</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-miscellaneous-instructions"><a class="anchor" href="#x86-miscellaneous-instructions"></a><a class="link" href="#x86-miscellaneous-instructions">23.9. x86 miscellaneous instructions</a></h3>
+<h3 id="x86-miscellaneous-instructions"><a class="anchor" href="#x86-miscellaneous-instructions"></a><a class="link" href="#x86-miscellaneous-instructions">24.9. x86 miscellaneous instructions</a></h3>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.1.13 "Miscellaneous Instructions"</p>
 </div>
 <div class="paragraph">
-<p>NOP: <a href="#nop-instructions">Section 22.10, &#8220;NOP instructions&#8221;</a></p>
+<p>NOP: <a href="#nop-instructions">Section 23.10, &#8220;NOP instructions&#8221;</a></p>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-random-number-generator-instructions"><a class="anchor" href="#x86-random-number-generator-instructions"></a><a class="link" href="#x86-random-number-generator-instructions">23.10. x86 random number generator instructions</a></h3>
+<h3 id="x86-random-number-generator-instructions"><a class="anchor" href="#x86-random-number-generator-instructions"></a><a class="link" href="#x86-random-number-generator-instructions">24.10. x86 random number generator instructions</a></h3>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.1.15 Random Number Generator Instructions</p>
 </div>
@@ -35290,7 +35339,7 @@ pop %rbp</pre>
 <p>RDRAND sets the carry flag when data is ready so we must loop if the carry flag isn&#8217;t set.</p>
 </div>
 <div class="sect3">
-<h4 id="x86-cpuid-instruction"><a class="anchor" href="#x86-cpuid-instruction"></a><a class="link" href="#x86-cpuid-instruction">23.10.1. x86 CPUID instruction</a></h4>
+<h4 id="x86-cpuid-instruction"><a class="anchor" href="#x86-cpuid-instruction"></a><a class="link" href="#x86-cpuid-instruction">24.10.1. x86 CPUID instruction</a></h4>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/cpuid.S">userland/arch/x86_64/cpuid.S</a></p>
 </div>
@@ -35361,7 +35410,7 @@ pop %rbp</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-x87-fpu-instructions"><a class="anchor" href="#x86-x87-fpu-instructions"></a><a class="link" href="#x86-x87-fpu-instructions">23.11. x86 x87 FPU instructions</a></h3>
+<h3 id="x86-x87-fpu-instructions"><a class="anchor" href="#x86-x87-fpu-instructions"></a><a class="link" href="#x86-x87-fpu-instructions">24.11. x86 x87 FPU instructions</a></h3>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.2 "X87 FPU INSTRUCTIONS"</p>
 </div>
@@ -35454,7 +35503,7 @@ pop %rbp</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="x86-x87-fpu-vs-simd"><a class="anchor" href="#x86-x87-fpu-vs-simd"></a><a class="link" href="#x86-x87-fpu-vs-simd">23.11.1. x86 x87 FPU vs SIMD</a></h4>
+<h4 id="x86-x87-fpu-vs-simd"><a class="anchor" href="#x86-x87-fpu-vs-simd"></a><a class="link" href="#x86-x87-fpu-vs-simd">24.11.1. x86 x87 FPU vs SIMD</a></h4>
 <div class="paragraph">
 <p><a href="https://stackoverflow.com/questions/1844669/benefits-of-x87-over-sse" class="bare">https://stackoverflow.com/questions/1844669/benefits-of-x87-over-sse</a></p>
 </div>
@@ -35493,9 +35542,9 @@ pop %rbp</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-simd"><a class="anchor" href="#x86-simd"></a><a class="link" href="#x86-simd">23.12. x86 SIMD</a></h3>
+<h3 id="x86-simd"><a class="anchor" href="#x86-simd"></a><a class="link" href="#x86-simd">24.12. x86 SIMD</a></h3>
 <div class="paragraph">
-<p>Parent section: <a href="#simd-assembly">Section 22.3, &#8220;SIMD assembly&#8221;</a></p>
+<p>Parent section: <a href="#simd-assembly">Section 23.3, &#8220;SIMD assembly&#8221;</a></p>
 </div>
 <div class="paragraph">
 <p>History:</p>
@@ -35529,12 +35578,12 @@ pop %rbp</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="x86-sse-instructions"><a class="anchor" href="#x86-sse-instructions"></a><a class="link" href="#x86-sse-instructions">23.12.1. x86 SSE instructions</a></h4>
+<h4 id="x86-sse-instructions"><a class="anchor" href="#x86-sse-instructions"></a><a class="link" href="#x86-sse-instructions">24.12.1. x86 SSE instructions</a></h4>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.5 "SSE INSTRUCTIONS"</p>
 </div>
 <div class="sect4">
-<h5 id="x86-sse-data-transfer-instructions"><a class="anchor" href="#x86-sse-data-transfer-instructions"></a><a class="link" href="#x86-sse-data-transfer-instructions">23.12.1.1. x86 SSE data transfer instructions</a></h5>
+<h5 id="x86-sse-data-transfer-instructions"><a class="anchor" href="#x86-sse-data-transfer-instructions"></a><a class="link" href="#x86-sse-data-transfer-instructions">24.12.1.1. x86 SSE data transfer instructions</a></h5>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.5.1.1 "SSE Data Transfer Instructions"</p>
 </div>
@@ -35553,7 +35602,7 @@ pop %rbp</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="x86-sse-packed-arithmetic-instructions"><a class="anchor" href="#x86-sse-packed-arithmetic-instructions"></a><a class="link" href="#x86-sse-packed-arithmetic-instructions">23.12.1.2. x86 SSE packed arithmetic instructions</a></h5>
+<h5 id="x86-sse-packed-arithmetic-instructions"><a class="anchor" href="#x86-sse-packed-arithmetic-instructions"></a><a class="link" href="#x86-sse-packed-arithmetic-instructions">24.12.1.2. x86 SSE packed arithmetic instructions</a></h5>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.5.1.2 "SSE Packed Arithmetic Instructions"</p>
 </div>
@@ -35566,14 +35615,14 @@ pop %rbp</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="x86-sse-conversion-instructions"><a class="anchor" href="#x86-sse-conversion-instructions"></a><a class="link" href="#x86-sse-conversion-instructions">23.12.1.3. x86 SSE conversion instructions</a></h5>
+<h5 id="x86-sse-conversion-instructions"><a class="anchor" href="#x86-sse-conversion-instructions"></a><a class="link" href="#x86-sse-conversion-instructions">24.12.1.3. x86 SSE conversion instructions</a></h5>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.5.1.6 "SSE Conversion Instructions"</p>
 </div>
 </div>
 </div>
 <div class="sect3">
-<h4 id="x86-sse2-instructions"><a class="anchor" href="#x86-sse2-instructions"></a><a class="link" href="#x86-sse2-instructions">23.12.2. x86 SSE2 instructions</a></h4>
+<h4 id="x86-sse2-instructions"><a class="anchor" href="#x86-sse2-instructions"></a><a class="link" href="#x86-sse2-instructions">24.12.2. x86 SSE2 instructions</a></h4>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.6 "SSE2 INSTRUCTIONS"</p>
 </div>
@@ -35585,7 +35634,7 @@ pop %rbp</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="x86-paddq-instruction"><a class="anchor" href="#x86-paddq-instruction"></a><a class="link" href="#x86-paddq-instruction">23.12.2.1. x86 PADDQ instruction</a></h5>
+<h5 id="x86-paddq-instruction"><a class="anchor" href="#x86-paddq-instruction"></a><a class="link" href="#x86-paddq-instruction">24.12.2.1. x86 PADDQ instruction</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/x86_64/paddq.S">userland/arch/x86_64/paddq.S</a>: PADDQ, PADDL, PADDW, PADDB</p>
 </div>
@@ -35595,7 +35644,7 @@ pop %rbp</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="x86-fma"><a class="anchor" href="#x86-fma"></a><a class="link" href="#x86-fma">23.12.3. x86 fused multiply add (FMA)</a></h4>
+<h4 id="x86-fma"><a class="anchor" href="#x86-fma"></a><a class="link" href="#x86-fma">24.12.3. x86 fused multiply add (FMA)</a></h4>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.15 "FUSED-MULTIPLY-ADD (FMA)"</p>
 </div>
@@ -35615,12 +35664,12 @@ pop %rbp</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-system-instructions"><a class="anchor" href="#x86-system-instructions"></a><a class="link" href="#x86-system-instructions">23.13. x86 system instructions</a></h3>
+<h3 id="x86-system-instructions"><a class="anchor" href="#x86-system-instructions"></a><a class="link" href="#x86-system-instructions">24.13. x86 system instructions</a></h3>
 <div class="paragraph">
 <p><a href="#intel-manual-1">Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a> 5.20 "SYSTEM INSTRUCTIONS"</p>
 </div>
 <div class="sect3">
-<h4 id="x86-rdtsc-instruction"><a class="anchor" href="#x86-rdtsc-instruction"></a><a class="link" href="#x86-rdtsc-instruction">23.13.1. x86 RDTSC instruction</a></h4>
+<h4 id="x86-rdtsc-instruction"><a class="anchor" href="#x86-rdtsc-instruction"></a><a class="link" href="#x86-rdtsc-instruction">24.13.1. x86 RDTSC instruction</a></h4>
 <div class="paragraph">
 <p>Sources:</p>
 </div>
@@ -35694,7 +35743,7 @@ pop %rbp</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="x86-rdtscp-instruction"><a class="anchor" href="#x86-rdtscp-instruction"></a><a class="link" href="#x86-rdtscp-instruction">23.13.1.1. x86 RDTSCP instruction</a></h5>
+<h5 id="x86-rdtscp-instruction"><a class="anchor" href="#x86-rdtscp-instruction"></a><a class="link" href="#x86-rdtscp-instruction">24.13.1.1. x86 RDTSCP instruction</a></h5>
 <div class="paragraph">
 <p>RDTSCP is like RDTSP, but it also stores the CPU ID into ECX: this is convenient because the value of RDTSC depends on which core we are currently on, so you often also want the core ID when you want the RDTSC.</p>
 </div>
@@ -35737,7 +35786,7 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-pmccntr-register"><a class="anchor" href="#arm-pmccntr-register"></a><a class="link" href="#arm-pmccntr-register">23.13.1.2. ARM PMCCNTR register</a></h5>
+<h5 id="arm-pmccntr-register"><a class="anchor" href="#arm-pmccntr-register"></a><a class="link" href="#arm-pmccntr-register">24.13.1.2. ARM PMCCNTR register</a></h5>
 <div class="paragraph">
 <p>TODO We didn&#8217;t manage to find a working ARM analogue to <a href="#x86-rdtsc-instruction">x86 RDTSC instruction</a>: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/kernel_modules/pmccntr.c">kernel_modules/pmccntr.c</a> is oopsing, and even it if weren&#8217;t, it likely won&#8217;t give the cycle count since boot since it needs to be activate before it starts counting anything:</p>
 </div>
@@ -35758,9 +35807,9 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-thread-synchronization-primitives"><a class="anchor" href="#x86-thread-synchronization-primitives"></a><a class="link" href="#x86-thread-synchronization-primitives">23.14. x86 thread synchronization primitives</a></h3>
+<h3 id="x86-thread-synchronization-primitives"><a class="anchor" href="#x86-thread-synchronization-primitives"></a><a class="link" href="#x86-thread-synchronization-primitives">24.14. x86 thread synchronization primitives</a></h3>
 <div class="sect3">
-<h4 id="x86-lock-prefix"><a class="anchor" href="#x86-lock-prefix"></a><a class="link" href="#x86-lock-prefix">23.14.1. x86 LOCK prefix</a></h4>
+<h4 id="x86-lock-prefix"><a class="anchor" href="#x86-lock-prefix"></a><a class="link" href="#x86-lock-prefix">24.14.1. x86 LOCK prefix</a></h4>
 <div class="paragraph">
 <p>Inline assembly example at: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/cpp/atomic/x86_64_lock_inc.cpp">userland/cpp/atomic/x86_64_lock_inc.cpp</a>, see also: <a href="#atomic-cpp">atomic.cpp</a>.</p>
 </div>
@@ -35786,11 +35835,11 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="x86-assembly-bibliography"><a class="anchor" href="#x86-assembly-bibliography"></a><a class="link" href="#x86-assembly-bibliography">23.15. x86 assembly bibliography</a></h3>
+<h3 id="x86-assembly-bibliography"><a class="anchor" href="#x86-assembly-bibliography"></a><a class="link" href="#x86-assembly-bibliography">24.15. x86 assembly bibliography</a></h3>
 <div class="sect3">
-<h4 id="x86-official-bibliography"><a class="anchor" href="#x86-official-bibliography"></a><a class="link" href="#x86-official-bibliography">23.15.1. x86 official bibliography</a></h4>
+<h4 id="x86-official-bibliography"><a class="anchor" href="#x86-official-bibliography"></a><a class="link" href="#x86-official-bibliography">24.15.1. x86 official bibliography</a></h4>
 <div class="sect4">
-<h5 id="intel-manual"><a class="anchor" href="#intel-manual"></a><a class="link" href="#intel-manual">23.15.1.1. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals</a></h5>
+<h5 id="intel-manual"><a class="anchor" href="#intel-manual"></a><a class="link" href="#intel-manual">24.15.1.1. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals</a></h5>
 <div class="paragraph">
 <p>We are using the May 2019 version unless otherwise noted.</p>
 </div>
@@ -35807,25 +35856,25 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 <p>Also I can&#8217;t find older versions on the website easily, so I just web archive everything.</p>
 </div>
 <div class="sect5">
-<h6 id="intel-manual-1"><a class="anchor" href="#intel-manual-1"></a><a class="link" href="#intel-manual-1">23.15.1.1.1. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a></h6>
+<h6 id="intel-manual-1"><a class="anchor" href="#intel-manual-1"></a><a class="link" href="#intel-manual-1">24.15.1.1.1. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 1</a></h6>
 <div class="paragraph">
 <p>Userland basics: <a href="http://web.archive.org/web/20190606075544/https://software.intel.com/sites/default/files/managed/a4/60/253665-sdm-vol-1.pdf" class="bare">http://web.archive.org/web/20190606075544/https://software.intel.com/sites/default/files/managed/a4/60/253665-sdm-vol-1.pdf</a></p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="intel-manual-2"><a class="anchor" href="#intel-manual-2"></a><a class="link" href="#intel-manual-2">23.15.1.1.2. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 2</a></h6>
+<h6 id="intel-manual-2"><a class="anchor" href="#intel-manual-2"></a><a class="link" href="#intel-manual-2">24.15.1.1.2. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 2</a></h6>
 <div class="paragraph">
 <p>Instruction list: <a href="http://web.archive.org/web/20190606075330/https://software.intel.com/sites/default/files/managed/a4/60/325383-sdm-vol-2abcd.pdf" class="bare">http://web.archive.org/web/20190606075330/https://software.intel.com/sites/default/files/managed/a4/60/325383-sdm-vol-2abcd.pdf</a></p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="intel-manual-3"><a class="anchor" href="#intel-manual-3"></a><a class="link" href="#intel-manual-3">23.15.1.1.3. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 3</a></h6>
+<h6 id="intel-manual-3"><a class="anchor" href="#intel-manual-3"></a><a class="link" href="#intel-manual-3">24.15.1.1.3. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 3</a></h6>
 <div class="paragraph">
 <p>Kernel land: <a href="http://web.archive.org/web/20190606075534/https://software.intel.com/sites/default/files/managed/a4/60/325384-sdm-vol-3abcd.pdf" class="bare">http://web.archive.org/web/20190606075534/https://software.intel.com/sites/default/files/managed/a4/60/325384-sdm-vol-3abcd.pdf</a></p>
 </div>
 </div>
 <div class="sect5">
-<h6 id="intel-manual-4"><a class="anchor" href="#intel-manual-4"></a><a class="link" href="#intel-manual-4">23.15.1.1.4. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 4</a></h6>
+<h6 id="intel-manual-4"><a class="anchor" href="#intel-manual-4"></a><a class="link" href="#intel-manual-4">24.15.1.1.4. Intel 64 and IA-32 Architectures Software Developer&#8217;s Manuals Volume 4</a></h6>
 <div class="paragraph">
 <p>Model specific extensions: <a href="http://web.archive.org/web/20190606075325/https://software.intel.com/sites/default/files/managed/22/0d/335592-sdm-vol-4.pdf" class="bare">http://web.archive.org/web/20190606075325/https://software.intel.com/sites/default/files/managed/22/0d/335592-sdm-vol-4.pdf</a></p>
 </div>
@@ -35836,10 +35885,10 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 </div>
 </div>
 <div class="sect1">
-<h2 id="arm-userland-assembly"><a class="anchor" href="#arm-userland-assembly"></a><a class="link" href="#arm-userland-assembly">24. ARM userland assembly</a></h2>
+<h2 id="arm-userland-assembly"><a class="anchor" href="#arm-userland-assembly"></a><a class="link" href="#arm-userland-assembly">25. ARM userland assembly</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
-<p>Arch general getting started at: <a href="#userland-assembly">Section 22, &#8220;Userland assembly&#8221;</a>.</p>
+<p>Arch general getting started at: <a href="#userland-assembly">Section 23, &#8220;Userland assembly&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Instructions here loosely grouped based on that of the <a href="#armarm7">ARMv7 architecture reference manual</a> Chapter A4 "The Instruction Sets".</p>
@@ -35848,7 +35897,7 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 <p>We cover here mostly ARMv7, and then treat aarch64 differentially, since much of the ARMv7 userland is the same in aarch32.</p>
 </div>
 <div class="sect2">
-<h3 id="introduction-to-the-arm-architecture"><a class="anchor" href="#introduction-to-the-arm-architecture"></a><a class="link" href="#introduction-to-the-arm-architecture">24.1. Introduction to the ARM architecture</a></h3>
+<h3 id="introduction-to-the-arm-architecture"><a class="anchor" href="#introduction-to-the-arm-architecture"></a><a class="link" href="#introduction-to-the-arm-architecture">25.1. Introduction to the ARM architecture</a></h3>
 <div class="paragraph">
 <p>The <a href="https://en.wikipedia.org/wiki/ARM_architecture">ARM architecture</a> is has been used on the vast majority of mobile phones in the 2010&#8217;s, and on a large fraction of micro controllers.</p>
 </div>
@@ -35865,7 +35914,7 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 <p>ARM Holdings was bought by the Japanese giant SoftBank in 2016.</p>
 </div>
 <div class="sect3">
-<h4 id="armv8-vs-armv7-vs-aarch64-vs-aarch32"><a class="anchor" href="#armv8-vs-armv7-vs-aarch64-vs-aarch32"></a><a class="link" href="#armv8-vs-armv7-vs-aarch64-vs-aarch32">24.1.1. ARMv8 vs ARMv7 vs AArch64 vs AArch32</a></h4>
+<h4 id="armv8-vs-armv7-vs-aarch64-vs-aarch32"><a class="anchor" href="#armv8-vs-armv7-vs-aarch64-vs-aarch32"></a><a class="link" href="#armv8-vs-armv7-vs-aarch64-vs-aarch32">25.1.1. ARMv8 vs ARMv7 vs AArch64 vs AArch32</a></h4>
 <div class="paragraph">
 <p>ARMv7 is the older architecture described at: <a href="#armarm7">ARMv7 architecture reference manual</a>.</p>
 </div>
@@ -35921,7 +35970,7 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 <p>They are described at: <a href="#armarm8">ARMv8 architecture reference manual</a> A1.7 "ARMv8 architecture extensions".</p>
 </div>
 <div class="sect4">
-<h5 id="aarch32"><a class="anchor" href="#aarch32"></a><a class="link" href="#aarch32">24.1.1.1. AArch32</a></h5>
+<h5 id="aarch32"><a class="anchor" href="#aarch32"></a><a class="link" href="#aarch32">25.1.1.1. AArch32</a></h5>
 <div class="paragraph">
 <p>32-bit mode of operation of ARMv8.</p>
 </div>
@@ -35953,7 +36002,7 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="aarch32-vs-aarch64"><a class="anchor" href="#aarch32-vs-aarch64"></a><a class="link" href="#aarch32-vs-aarch64">24.1.1.2. AArch32 vs AArch64</a></h5>
+<h5 id="aarch32-vs-aarch64"><a class="anchor" href="#aarch32-vs-aarch64"></a><a class="link" href="#aarch32-vs-aarch64">25.1.1.2. AArch32 vs AArch64</a></h5>
 <div class="paragraph">
 <p>A great summary of differences can be found at: <a href="https://en.wikipedia.org/wiki/ARM_architecture#AArch64_features" class="bare">https://en.wikipedia.org/wiki/ARM_architecture#AArch64_features</a></p>
 </div>
@@ -35963,17 +36012,17 @@ taskset -c 1 ./userland/arch/x86_64/rdtscp.out | tail -n 1</pre>
 <div class="ulist">
 <ul>
 <li>
-<p>aarch32 has two encodings: Thumb and ARM: <a href="#arm-instruction-encodings">Section 24.1.3, &#8220;ARM instruction encodings&#8221;</a></p>
+<p>aarch32 has two encodings: Thumb and ARM: <a href="#arm-instruction-encodings">Section 25.1.3, &#8220;ARM instruction encodings&#8221;</a></p>
 </li>
 <li>
-<p>in ARMv8, the stack can be enforced to 16-byte alignment: <a href="#armv8-aarch64-stack-alignment">Section 24.3.2.2.1, &#8220;ARMV8 aarch64 stack alignment&#8221;</a></p>
+<p>in ARMv8, the stack can be enforced to 16-byte alignment: <a href="#armv8-aarch64-stack-alignment">Section 25.3.2.2.1, &#8220;ARMV8 aarch64 stack alignment&#8221;</a></p>
 </li>
 </ul>
 </div>
 </div>
 </div>
 <div class="sect3">
-<h4 id="free-arm-implementations"><a class="anchor" href="#free-arm-implementations"></a><a class="link" href="#free-arm-implementations">24.1.2. Free ARM implementations</a></h4>
+<h4 id="free-arm-implementations"><a class="anchor" href="#free-arm-implementations"></a><a class="link" href="#free-arm-implementations">25.1.2. Free ARM implementations</a></h4>
 <div class="paragraph">
 <p>The ARM instruction set is itself protected by patents / copyright / whatever, and you have to pay ARM Holdings a licence to implement it, even if you are creating your own custom Verilog code.</p>
 </div>
@@ -36012,7 +36061,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-instruction-encodings"><a class="anchor" href="#arm-instruction-encodings"></a><a class="link" href="#arm-instruction-encodings">24.1.3. ARM instruction encodings</a></h4>
+<h4 id="arm-instruction-encodings"><a class="anchor" href="#arm-instruction-encodings"></a><a class="link" href="#arm-instruction-encodings">25.1.3. ARM instruction encodings</a></h4>
 <div class="paragraph">
 <p>Understanding the basics of instruction encodings is fundamental to help you to remember what instructions do and why some things are possible or not, notably the <a href="#arm-ldr-pseudo-instruction">ARM LDR pseudo-instruction</a> and the <a href="#arm-adr-instruction">ADRP instruction</a>.</p>
 </div>
@@ -36124,7 +36173,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </ul>
 </div>
 <div class="sect4">
-<h5 id="arm-thumb-encoding"><a class="anchor" href="#arm-thumb-encoding"></a><a class="link" href="#arm-thumb-encoding">24.1.3.1. ARM Thumb encoding</a></h5>
+<h5 id="arm-thumb-encoding"><a class="anchor" href="#arm-thumb-encoding"></a><a class="link" href="#arm-thumb-encoding">25.1.3.1. ARM Thumb encoding</a></h5>
 <div class="paragraph">
 <p>Thumb examples are available at:</p>
 </div>
@@ -36183,7 +36232,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-big-endian-mode"><a class="anchor" href="#arm-big-endian-mode"></a><a class="link" href="#arm-big-endian-mode">24.1.3.2. ARM big endian mode</a></h5>
+<h5 id="arm-big-endian-mode"><a class="anchor" href="#arm-big-endian-mode"></a><a class="link" href="#arm-big-endian-mode">25.1.3.2. ARM big endian mode</a></h5>
 <div class="paragraph">
 <p>ARM can switch between big and little endian mode on the fly!</p>
 </div>
@@ -36279,9 +36328,9 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect2">
-<h3 id="arm-branch-instructions"><a class="anchor" href="#arm-branch-instructions"></a><a class="link" href="#arm-branch-instructions">24.2. ARM branch instructions</a></h3>
+<h3 id="arm-branch-instructions"><a class="anchor" href="#arm-branch-instructions"></a><a class="link" href="#arm-branch-instructions">25.2. ARM branch instructions</a></h3>
 <div class="sect3">
-<h4 id="arm-b-instruction"><a class="anchor" href="#arm-b-instruction"></a><a class="link" href="#arm-b-instruction">24.2.1. ARM B instruction</a></h4>
+<h4 id="arm-b-instruction"><a class="anchor" href="#arm-b-instruction"></a><a class="link" href="#arm-b-instruction">25.2.1. ARM B instruction</a></h4>
 <div class="paragraph">
 <p>Unconditional branch.</p>
 </div>
@@ -36299,7 +36348,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-beq-instruction"><a class="anchor" href="#arm-beq-instruction"></a><a class="link" href="#arm-beq-instruction">24.2.2. ARM BEQ instruction</a></h4>
+<h4 id="arm-beq-instruction"><a class="anchor" href="#arm-beq-instruction"></a><a class="link" href="#arm-beq-instruction">25.2.2. ARM BEQ instruction</a></h4>
 <div class="paragraph">
 <p>Branch if equal based on the status registers.</p>
 </div>
@@ -36343,7 +36392,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-bl-instruction"><a class="anchor" href="#arm-bl-instruction"></a><a class="link" href="#arm-bl-instruction">24.2.3. ARM BL instruction</a></h4>
+<h4 id="arm-bl-instruction"><a class="anchor" href="#arm-bl-instruction"></a><a class="link" href="#arm-bl-instruction">25.2.3. ARM BL instruction</a></h4>
 <div class="paragraph">
 <p>Branch with link, i.e. branch and store the return address on the RL register.</p>
 </div>
@@ -36357,13 +36406,13 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 <p>The current ARM / Thumb mode is encoded in the least significant bit of lr.</p>
 </div>
 <div class="sect4">
-<h5 id="arm-bx-instruction"><a class="anchor" href="#arm-bx-instruction"></a><a class="link" href="#arm-bx-instruction">24.2.3.1. ARM BX instruction</a></h5>
+<h5 id="arm-bx-instruction"><a class="anchor" href="#arm-bx-instruction"></a><a class="link" href="#arm-bx-instruction">25.2.3.1. ARM BX instruction</a></h5>
 <div class="paragraph">
-<p>See: <a href="#arm-thumb-encoding">Section 24.1.3.1, &#8220;ARM Thumb encoding&#8221;</a></p>
+<p>See: <a href="#arm-thumb-encoding">Section 25.1.3.1, &#8220;ARM Thumb encoding&#8221;</a></p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armv8-aarch64-ret-instruction"><a class="anchor" href="#armv8-aarch64-ret-instruction"></a><a class="link" href="#armv8-aarch64-ret-instruction">24.2.3.2. ARMv8 aarch64 ret instruction</a></h5>
+<h5 id="armv8-aarch64-ret-instruction"><a class="anchor" href="#armv8-aarch64-ret-instruction"></a><a class="link" href="#armv8-aarch64-ret-instruction">25.2.3.2. ARMv8 aarch64 ret instruction</a></h5>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/ret.S">userland/arch/aarch64/ret.S</a></p>
 </div>
@@ -36396,7 +36445,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-cbz-instruction"><a class="anchor" href="#arm-cbz-instruction"></a><a class="link" href="#arm-cbz-instruction">24.2.4. ARM CBZ instruction</a></h4>
+<h4 id="arm-cbz-instruction"><a class="anchor" href="#arm-cbz-instruction"></a><a class="link" href="#arm-cbz-instruction">25.2.4. ARM CBZ instruction</a></h4>
 <div class="paragraph">
 <p>Compare and branch if zero.</p>
 </div>
@@ -36411,7 +36460,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-conditional-execution"><a class="anchor" href="#arm-conditional-execution"></a><a class="link" href="#arm-conditional-execution">24.2.5. ARM conditional execution</a></h4>
+<h4 id="arm-conditional-execution"><a class="anchor" href="#arm-conditional-execution"></a><a class="link" href="#arm-conditional-execution">25.2.5. ARM conditional execution</a></h4>
 <div class="paragraph">
 <p>Weirdly, <a href="#arm-b-instruction">ARM B instruction</a> and family are not the only instructions that can execute conditionally on the flags: the same also applies to most instructions, e.g. ADD.</p>
 </div>
@@ -36427,7 +36476,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect2">
-<h3 id="arm-load-and-store-instructions"><a class="anchor" href="#arm-load-and-store-instructions"></a><a class="link" href="#arm-load-and-store-instructions">24.3. ARM load and store instructions</a></h3>
+<h3 id="arm-load-and-store-instructions"><a class="anchor" href="#arm-load-and-store-instructions"></a><a class="link" href="#arm-load-and-store-instructions">25.3. ARM load and store instructions</a></h3>
 <div class="paragraph">
 <p>In ARM, there are only two instruction families that do memory access:</p>
 </div>
@@ -36451,9 +36500,9 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 <p>This kind of architecture is called a <a href="https://en.wikipedia.org/wiki/Load/store_architecture">Load/store architecture</a>.</p>
 </div>
 <div class="sect3">
-<h4 id="arm-ldr-instruction"><a class="anchor" href="#arm-ldr-instruction"></a><a class="link" href="#arm-ldr-instruction">24.3.1. ARM LDR instruction</a></h4>
+<h4 id="arm-ldr-instruction"><a class="anchor" href="#arm-ldr-instruction"></a><a class="link" href="#arm-ldr-instruction">25.3.1. ARM LDR instruction</a></h4>
 <div class="sect4">
-<h5 id="arm-ldr-pseudo-instruction"><a class="anchor" href="#arm-ldr-pseudo-instruction"></a><a class="link" href="#arm-ldr-pseudo-instruction">24.3.1.1. ARM LDR pseudo-instruction</a></h5>
+<h5 id="arm-ldr-pseudo-instruction"><a class="anchor" href="#arm-ldr-pseudo-instruction"></a><a class="link" href="#arm-ldr-pseudo-instruction">25.3.1.1. ARM LDR pseudo-instruction</a></h5>
 <div class="paragraph">
 <p>LDR can be either a regular instruction that loads stuff into memory, or also a pseudo-instruction (assembler magic): <a href="http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0041c/Babbfdih.html" class="bare">http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0041c/Babbfdih.html</a></p>
 </div>
@@ -36487,7 +36536,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-addressing-modes"><a class="anchor" href="#arm-addressing-modes"></a><a class="link" href="#arm-addressing-modes">24.3.1.2. ARM addressing modes</a></h5>
+<h5 id="arm-addressing-modes"><a class="anchor" href="#arm-addressing-modes"></a><a class="link" href="#arm-addressing-modes">25.3.1.2. ARM addressing modes</a></h5>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/address_modes.S">userland/arch/arm/address_modes.S</a></p>
 </div>
@@ -36558,7 +36607,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 <p><a href="#armarm8">ARMv8 architecture reference manual</a>: C1.3.3 "Load/Store addressing modes"</p>
 </div>
 <div class="sect5">
-<h6 id="arm-loop-over-array"><a class="anchor" href="#arm-loop-over-array"></a><a class="link" href="#arm-loop-over-array">24.3.1.2.1. ARM loop over array</a></h6>
+<h6 id="arm-loop-over-array"><a class="anchor" href="#arm-loop-over-array"></a><a class="link" href="#arm-loop-over-array">25.3.1.2.1. ARM loop over array</a></h6>
 <div class="paragraph">
 <p>As an application of the post-indexed addressing mode, let&#8217;s increment an array.</p>
 </div>
@@ -36568,7 +36617,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-ldrh-and-ldrb-instructions"><a class="anchor" href="#arm-ldrh-and-ldrb-instructions"></a><a class="link" href="#arm-ldrh-and-ldrb-instructions">24.3.1.3. ARM LDRH and LDRB instructions</a></h5>
+<h5 id="arm-ldrh-and-ldrb-instructions"><a class="anchor" href="#arm-ldrh-and-ldrb-instructions"></a><a class="link" href="#arm-ldrh-and-ldrb-instructions">25.3.1.3. ARM LDRH and LDRB instructions</a></h5>
 <div class="paragraph">
 <p>There are LDR variants that load less than full 4 bytes:</p>
 </div>
@@ -36595,7 +36644,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-str-instruction"><a class="anchor" href="#arm-str-instruction"></a><a class="link" href="#arm-str-instruction">24.3.2. ARM STR instruction</a></h4>
+<h4 id="arm-str-instruction"><a class="anchor" href="#arm-str-instruction"></a><a class="link" href="#arm-str-instruction">25.3.2. ARM STR instruction</a></h4>
 <div class="paragraph">
 <p>Store from memory into registers.</p>
 </div>
@@ -36606,7 +36655,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 <p>Basically everything that applies to <a href="#arm-ldr-instruction">ARM LDR instruction</a> also applies here so we won&#8217;t go into much detail.</p>
 </div>
 <div class="sect4">
-<h5 id="armv8-aarch64-str-instruction"><a class="anchor" href="#armv8-aarch64-str-instruction"></a><a class="link" href="#armv8-aarch64-str-instruction">24.3.2.1. ARMv8 aarch64 STR instruction</a></h5>
+<h5 id="armv8-aarch64-str-instruction"><a class="anchor" href="#armv8-aarch64-str-instruction"></a><a class="link" href="#armv8-aarch64-str-instruction">25.3.2.1. ARMv8 aarch64 STR instruction</a></h5>
 <div class="paragraph">
 <p>PC-relative STR is not possible in aarch64.</p>
 </div>
@@ -36624,7 +36673,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect4">
-<h5 id="armv8-aarch64-ldp-and-stp-instructions"><a class="anchor" href="#armv8-aarch64-ldp-and-stp-instructions"></a><a class="link" href="#armv8-aarch64-ldp-and-stp-instructions">24.3.2.2. ARMv8 aarch64 LDP and STP instructions</a></h5>
+<h5 id="armv8-aarch64-ldp-and-stp-instructions"><a class="anchor" href="#armv8-aarch64-ldp-and-stp-instructions"></a><a class="link" href="#armv8-aarch64-ldp-and-stp-instructions">25.3.2.2. ARMv8 aarch64 LDP and STP instructions</a></h5>
 <div class="paragraph">
 <p>Push a pair of registers to the stack.</p>
 </div>
@@ -36632,7 +36681,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 <p>TODO minimal example. Currently used in <code>LKMC_PROLOGUE</code> at <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/lkmc/aarch64.h">lkmc/aarch64.h</a> since it is the main way to restore register state.</p>
 </div>
 <div class="sect5">
-<h6 id="armv8-aarch64-stack-alignment"><a class="anchor" href="#armv8-aarch64-stack-alignment"></a><a class="link" href="#armv8-aarch64-stack-alignment">24.3.2.2.1. ARMV8 aarch64 stack alignment</a></h6>
+<h6 id="armv8-aarch64-stack-alignment"><a class="anchor" href="#armv8-aarch64-stack-alignment"></a><a class="link" href="#armv8-aarch64-stack-alignment">25.3.2.2.1. ARMV8 aarch64 stack alignment</a></h6>
 <div class="paragraph">
 <p>In ARMv8, the stack can be enforced to 16-byte alignment.</p>
 </div>
@@ -36679,7 +36728,7 @@ Bibliography: <a href="https://www.quora.com/Why-is-it-that-you-need-a-license-f
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-ldmia-instruction"><a class="anchor" href="#arm-ldmia-instruction"></a><a class="link" href="#arm-ldmia-instruction">24.3.3. ARM LDMIA instruction</a></h4>
+<h4 id="arm-ldmia-instruction"><a class="anchor" href="#arm-ldmia-instruction"></a><a class="link" href="#arm-ldmia-instruction">25.3.3. ARM LDMIA instruction</a></h4>
 <div class="paragraph">
 <p>Pop values form stack into the register and optionally update the address register.</p>
 </div>
@@ -36729,7 +36778,7 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="arm-data-processing-instructions"><a class="anchor" href="#arm-data-processing-instructions"></a><a class="link" href="#arm-data-processing-instructions">24.4. ARM data processing instructions</a></h3>
+<h3 id="arm-data-processing-instructions"><a class="anchor" href="#arm-data-processing-instructions"></a><a class="link" href="#arm-data-processing-instructions">25.4. ARM data processing instructions</a></h3>
 <div class="paragraph">
 <p>Arithmetic:</p>
 </div>
@@ -36753,7 +36802,7 @@ ldmia sp!, reglist</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="arm-cset-instruction"><a class="anchor" href="#arm-cset-instruction"></a><a class="link" href="#arm-cset-instruction">24.4.1. ARM CSET instruction</a></h4>
+<h4 id="arm-cset-instruction"><a class="anchor" href="#arm-cset-instruction"></a><a class="link" href="#arm-cset-instruction">25.4.1. ARM CSET instruction</a></h4>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/cset.S">userland/arch/aarch64/cset.S</a></p>
 </div>
@@ -36765,7 +36814,7 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-bitwise-instructions"><a class="anchor" href="#arm-bitwise-instructions"></a><a class="link" href="#arm-bitwise-instructions">24.4.2. ARM bitwise instructions</a></h4>
+<h4 id="arm-bitwise-instructions"><a class="anchor" href="#arm-bitwise-instructions"></a><a class="link" href="#arm-bitwise-instructions">25.4.2. ARM bitwise instructions</a></h4>
 <div class="ulist">
 <ul>
 <li>
@@ -36783,7 +36832,7 @@ ldmia sp!, reglist</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="arm-bic-instruction"><a class="anchor" href="#arm-bic-instruction"></a><a class="link" href="#arm-bic-instruction">24.4.2.1. ARM BIC instruction</a></h5>
+<h5 id="arm-bic-instruction"><a class="anchor" href="#arm-bic-instruction"></a><a class="link" href="#arm-bic-instruction">25.4.2.1. ARM BIC instruction</a></h5>
 <div class="paragraph">
 <p>Bitwise Bit Clear: clear some bits.</p>
 </div>
@@ -36797,7 +36846,7 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-ubfm-instruction"><a class="anchor" href="#arm-ubfm-instruction"></a><a class="link" href="#arm-ubfm-instruction">24.4.2.2. ARM UBFM instruction</a></h5>
+<h5 id="arm-ubfm-instruction"><a class="anchor" href="#arm-ubfm-instruction"></a><a class="link" href="#arm-ubfm-instruction">25.4.2.2. ARM UBFM instruction</a></h5>
 <div class="paragraph">
 <p>Unsigned Bitfield Move.</p>
 </div>
@@ -36815,7 +36864,7 @@ ldmia sp!, reglist</pre>
 <p>TODO: explain full behaviour. Very complicated. Has several simpler to understand aliases.</p>
 </div>
 <div class="sect5">
-<h6 id="arm-ubfx-instruction"><a class="anchor" href="#arm-ubfx-instruction"></a><a class="link" href="#arm-ubfx-instruction">24.4.2.2.1. ARM UBFX instruction</a></h6>
+<h6 id="arm-ubfx-instruction"><a class="anchor" href="#arm-ubfx-instruction"></a><a class="link" href="#arm-ubfx-instruction">25.4.2.2.1. ARM UBFX instruction</a></h6>
 <div class="paragraph">
 <p>Alias for:</p>
 </div>
@@ -36849,12 +36898,12 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-bfm-instruction"><a class="anchor" href="#arm-bfm-instruction"></a><a class="link" href="#arm-bfm-instruction">24.4.2.3. ARM BFM instruction</a></h5>
+<h5 id="arm-bfm-instruction"><a class="anchor" href="#arm-bfm-instruction"></a><a class="link" href="#arm-bfm-instruction">25.4.2.3. ARM BFM instruction</a></h5>
 <div class="paragraph">
 <p>TODO: explain. Similar to <a href="#arm-ubfm-instruction">UBFM</a> but leave untouched bits unmodified.</p>
 </div>
 <div class="sect5">
-<h6 id="arm-bfi-instruction"><a class="anchor" href="#arm-bfi-instruction"></a><a class="link" href="#arm-bfi-instruction">24.4.2.3.1. ARM BFI instruction</a></h6>
+<h6 id="arm-bfi-instruction"><a class="anchor" href="#arm-bfi-instruction"></a><a class="link" href="#arm-bfi-instruction">25.4.2.3.1. ARM BFI instruction</a></h6>
 <div class="paragraph">
 <p>Examples:</p>
 </div>
@@ -36885,12 +36934,12 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-mov-instruction"><a class="anchor" href="#arm-mov-instruction"></a><a class="link" href="#arm-mov-instruction">24.4.3. ARM MOV instruction</a></h4>
+<h4 id="arm-mov-instruction"><a class="anchor" href="#arm-mov-instruction"></a><a class="link" href="#arm-mov-instruction">25.4.3. ARM MOV instruction</a></h4>
 <div class="paragraph">
 <p>Move an immediate to a register, or a register to another register.</p>
 </div>
 <div class="paragraph">
-<p>Cannot load from or to memory, since only the LDR and STR instruction families can do that in ARM as mentioned at: <a href="#arm-load-and-store-instructions">Section 24.3, &#8220;ARM load and store instructions&#8221;</a>.</p>
+<p>Cannot load from or to memory, since only the LDR and STR instruction families can do that in ARM as mentioned at: <a href="#arm-load-and-store-instructions">Section 25.3, &#8220;ARM load and store instructions&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/mov.S">userland/arch/arm/mov.S</a></p>
@@ -36951,7 +37000,7 @@ ldmia sp!, reglist</pre>
 <p>Assemblers however support magic memory allocations which may hide what is truly going on: <a href="https://stackoverflow.com/questions/14046686/why-use-ldr-over-mov-or-vice-versa-in-arm-assembly" class="bare">https://stackoverflow.com/questions/14046686/why-use-ldr-over-mov-or-vice-versa-in-arm-assembly</a> Always ask your friendly disassembly for a good confirmation.</p>
 </div>
 <div class="sect4">
-<h5 id="arm-movw-and-movt-instructions"><a class="anchor" href="#arm-movw-and-movt-instructions"></a><a class="link" href="#arm-movw-and-movt-instructions">24.4.3.1. ARM movw and movt instructions</a></h5>
+<h5 id="arm-movw-and-movt-instructions"><a class="anchor" href="#arm-movw-and-movt-instructions"></a><a class="link" href="#arm-movw-and-movt-instructions">25.4.3.1. ARM movw and movt instructions</a></h5>
 <div class="paragraph">
 <p>Set the higher or lower 16 bits of a register to an immediate in one go.</p>
 </div>
@@ -36963,7 +37012,7 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armv8-aarch64-movk-instruction"><a class="anchor" href="#armv8-aarch64-movk-instruction"></a><a class="link" href="#armv8-aarch64-movk-instruction">24.4.3.2. ARMv8 aarch64 movk instruction</a></h5>
+<h5 id="armv8-aarch64-movk-instruction"><a class="anchor" href="#armv8-aarch64-movk-instruction"></a><a class="link" href="#armv8-aarch64-movk-instruction">25.4.3.2. ARMv8 aarch64 movk instruction</a></h5>
 <div class="paragraph">
 <p>Fill a 64 bit register with 4 16-bit instructions one at a time.</p>
 </div>
@@ -36978,7 +37027,7 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armv8-aarch64-movn-instruction"><a class="anchor" href="#armv8-aarch64-movn-instruction"></a><a class="link" href="#armv8-aarch64-movn-instruction">24.4.3.3. ARMv8 aarch64 movn instruction</a></h5>
+<h5 id="armv8-aarch64-movn-instruction"><a class="anchor" href="#armv8-aarch64-movn-instruction"></a><a class="link" href="#armv8-aarch64-movn-instruction">25.4.3.3. ARMv8 aarch64 movn instruction</a></h5>
 <div class="paragraph">
 <p>Set 16-bits negated and the rest to <code>1</code>.</p>
 </div>
@@ -36988,9 +37037,9 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-data-processing-instruction-suffixes"><a class="anchor" href="#arm-data-processing-instruction-suffixes"></a><a class="link" href="#arm-data-processing-instruction-suffixes">24.4.4. ARM data processing instruction suffixes</a></h4>
+<h4 id="arm-data-processing-instruction-suffixes"><a class="anchor" href="#arm-data-processing-instruction-suffixes"></a><a class="link" href="#arm-data-processing-instruction-suffixes">25.4.4. ARM data processing instruction suffixes</a></h4>
 <div class="sect4">
-<h5 id="arm-shift-suffixes"><a class="anchor" href="#arm-shift-suffixes"></a><a class="link" href="#arm-shift-suffixes">24.4.4.1. ARM shift suffixes</a></h5>
+<h5 id="arm-shift-suffixes"><a class="anchor" href="#arm-shift-suffixes"></a><a class="link" href="#arm-shift-suffixes">25.4.4.1. ARM shift suffixes</a></h5>
 <div class="paragraph">
 <p>Most data processing instructions can also optionally shift the second register operand.</p>
 </div>
@@ -37018,7 +37067,7 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-s-suffix"><a class="anchor" href="#arm-s-suffix"></a><a class="link" href="#arm-s-suffix">24.4.4.2. ARM S suffix</a></h5>
+<h5 id="arm-s-suffix"><a class="anchor" href="#arm-s-suffix"></a><a class="link" href="#arm-s-suffix">25.4.4.2. ARM S suffix</a></h5>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/s_suffix.S">userland/arch/arm/s_suffix.S</a></p>
 </div>
@@ -37034,7 +37083,7 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-adr-instruction"><a class="anchor" href="#arm-adr-instruction"></a><a class="link" href="#arm-adr-instruction">24.4.5. ARM ADR instruction</a></h4>
+<h4 id="arm-adr-instruction"><a class="anchor" href="#arm-adr-instruction"></a><a class="link" href="#arm-adr-instruction">25.4.5. ARM ADR instruction</a></h4>
 <div class="paragraph">
 <p>Similar rationale to the <a href="#arm-ldr-pseudo-instruction">ARM LDR pseudo-instruction</a>, allowing to easily store a PC-relative reachable address into a register in one go, to overcome the 4-byte fixed instruction size.</p>
 </div>
@@ -37058,19 +37107,19 @@ ldmia sp!, reglist</pre>
 <p>More details: <a href="https://stackoverflow.com/questions/41906688/what-are-the-semantics-of-adrp-and-adrl-instructions-in-arm-assembly/54042899#54042899" class="bare">https://stackoverflow.com/questions/41906688/what-are-the-semantics-of-adrp-and-adrl-instructions-in-arm-assembly/54042899#54042899</a></p>
 </div>
 <div class="sect4">
-<h5 id="arm-adrl-instruction"><a class="anchor" href="#arm-adrl-instruction"></a><a class="link" href="#arm-adrl-instruction">24.4.5.1. ARM ADRL instruction</a></h5>
+<h5 id="arm-adrl-instruction"><a class="anchor" href="#arm-adrl-instruction"></a><a class="link" href="#arm-adrl-instruction">25.4.5.1. ARM ADRL instruction</a></h5>
 <div class="paragraph">
-<p>See: <a href="#arm-adr-instruction">Section 24.4.5, &#8220;ARM ADR instruction&#8221;</a>.</p>
+<p>See: <a href="#arm-adr-instruction">Section 25.4.5, &#8220;ARM ADR instruction&#8221;</a>.</p>
 </div>
 </div>
 </div>
 </div>
 <div class="sect2">
-<h3 id="arm-miscellaneous-instructions"><a class="anchor" href="#arm-miscellaneous-instructions"></a><a class="link" href="#arm-miscellaneous-instructions">24.5. ARM miscellaneous instructions</a></h3>
+<h3 id="arm-miscellaneous-instructions"><a class="anchor" href="#arm-miscellaneous-instructions"></a><a class="link" href="#arm-miscellaneous-instructions">25.5. ARM miscellaneous instructions</a></h3>
 <div class="sect3">
-<h4 id="arm-nop-instruction"><a class="anchor" href="#arm-nop-instruction"></a><a class="link" href="#arm-nop-instruction">24.5.1. ARM NOP instruction</a></h4>
+<h4 id="arm-nop-instruction"><a class="anchor" href="#arm-nop-instruction"></a><a class="link" href="#arm-nop-instruction">25.5.1. ARM NOP instruction</a></h4>
 <div class="paragraph">
-<p>Parent section: <a href="#nop-instructions">Section 22.10, &#8220;NOP instructions&#8221;</a></p>
+<p>Parent section: <a href="#nop-instructions">Section 23.10, &#8220;NOP instructions&#8221;</a></p>
 </div>
 <div class="paragraph">
 <p>There are a few different ways to encode NOP, notably MOV a register into itself, and a dedicated miscellaneous instruction.</p>
@@ -37091,7 +37140,7 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-udf-instruction"><a class="anchor" href="#arm-udf-instruction"></a><a class="link" href="#arm-udf-instruction">24.5.2. ARM UDF instruction</a></h4>
+<h4 id="arm-udf-instruction"><a class="anchor" href="#arm-udf-instruction"></a><a class="link" href="#arm-udf-instruction">25.5.2. ARM UDF instruction</a></h4>
 <div class="paragraph">
 <p>Guaranteed undefined! Therefore raise illegal instruction signal. Used by GCC <code>__builtin_trap</code> apparently: <a href="https://stackoverflow.com/questions/16081618/programmatically-cause-undefined-instruction-exception" class="bare">https://stackoverflow.com/questions/16081618/programmatically-cause-undefined-instruction-exception</a></p>
 </div>
@@ -37110,7 +37159,7 @@ ldmia sp!, reglist</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-system-register-instructions"><a class="anchor" href="#arm-system-register-instructions"></a><a class="link" href="#arm-system-register-instructions">24.5.3. ARM system register instructions</a></h4>
+<h4 id="arm-system-register-instructions"><a class="anchor" href="#arm-system-register-instructions"></a><a class="link" href="#arm-system-register-instructions">25.5.3. ARM system register instructions</a></h4>
 <div class="paragraph">
 <p>Examples of using them can be found at: <a href="#dump-regs">dump_regs</a></p>
 </div>
@@ -37217,7 +37266,7 @@ dc isw</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-system-register-encodings"><a class="anchor" href="#arm-system-register-encodings"></a><a class="link" href="#arm-system-register-encodings">24.5.3.1. ARM system register encodings</a></h5>
+<h5 id="arm-system-register-encodings"><a class="anchor" href="#arm-system-register-encodings"></a><a class="link" href="#arm-system-register-encodings">25.5.3.1. ARM system register encodings</a></h5>
 <div class="paragraph">
 <p>Each aarch64 system register is specified in the encoding of <a href="#arm-system-register-instructions">ARM system register instructions</a> by 5 integer numbers:</p>
 </div>
@@ -37263,12 +37312,12 @@ LKMC_DUMP_SYSTEM_REGS_PRINTF("ID_ISAR6_EL1 0x%" PRIX32 "\n", id_isar6_el1);</pre
 </div>
 </div>
 <div class="sect2">
-<h3 id="arm-simd"><a class="anchor" href="#arm-simd"></a><a class="link" href="#arm-simd">24.6. ARM SIMD</a></h3>
+<h3 id="arm-simd"><a class="anchor" href="#arm-simd"></a><a class="link" href="#arm-simd">25.6. ARM SIMD</a></h3>
 <div class="paragraph">
-<p>Parent section: <a href="#simd-assembly">Section 22.3, &#8220;SIMD assembly&#8221;</a></p>
+<p>Parent section: <a href="#simd-assembly">Section 23.3, &#8220;SIMD assembly&#8221;</a></p>
 </div>
 <div class="sect3">
-<h4 id="arm-vfp"><a class="anchor" href="#arm-vfp"></a><a class="link" href="#arm-vfp">24.6.1. ARM VFP</a></h4>
+<h4 id="arm-vfp"><a class="anchor" href="#arm-vfp"></a><a class="link" href="#arm-vfp">25.6.1. ARM VFP</a></h4>
 <div class="paragraph">
 <p>The name for the ARMv7 and AArch32 floating point and SIMD instructions / registers.</p>
 </div>
@@ -37314,7 +37363,7 @@ LKMC_DUMP_SYSTEM_REGS_PRINTF("ID_ISAR6_EL1 0x%" PRIX32 "\n", id_isar6_el1);</pre
 </ul>
 </div>
 <div class="sect4">
-<h5 id="arm-vfp-registers"><a class="anchor" href="#arm-vfp-registers"></a><a class="link" href="#arm-vfp-registers">24.6.1.1. ARM VFP registers</a></h5>
+<h5 id="arm-vfp-registers"><a class="anchor" href="#arm-vfp-registers"></a><a class="link" href="#arm-vfp-registers">25.6.1.1. ARM VFP registers</a></h5>
 <div class="paragraph">
 <p>TODO example</p>
 </div>
@@ -37350,20 +37399,20 @@ LKMC_DUMP_SYSTEM_REGS_PRINTF("ID_ISAR6_EL1 0x%" PRIX32 "\n", id_isar6_el1);</pre
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-vadd-instruction"><a class="anchor" href="#arm-vadd-instruction"></a><a class="link" href="#arm-vadd-instruction">24.6.1.2. ARM VADD instruction</a></h5>
+<h5 id="arm-vadd-instruction"><a class="anchor" href="#arm-vadd-instruction"></a><a class="link" href="#arm-vadd-instruction">25.6.1.2. ARM VADD instruction</a></h5>
 <div class="ulist">
 <ul>
 <li>
-<p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/vadd_scalar.S">userland/arch/arm/vadd_scalar.S</a>: see also: <a href="#floating-point-assembly">Section 22.2, &#8220;Floating point assembly&#8221;</a></p>
+<p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/vadd_scalar.S">userland/arch/arm/vadd_scalar.S</a>: see also: <a href="#floating-point-assembly">Section 23.2, &#8220;Floating point assembly&#8221;</a></p>
 </li>
 <li>
-<p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/vadd_vector.S">userland/arch/arm/vadd_vector.S</a>: see also: <a href="#simd-assembly">Section 22.3, &#8220;SIMD assembly&#8221;</a></p>
+<p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/vadd_vector.S">userland/arch/arm/vadd_vector.S</a>: see also: <a href="#simd-assembly">Section 23.3, &#8220;SIMD assembly&#8221;</a></p>
 </li>
 </ul>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-vcvt-instruction"><a class="anchor" href="#arm-vcvt-instruction"></a><a class="link" href="#arm-vcvt-instruction">24.6.1.3. ARM VCVT instruction</a></h5>
+<h5 id="arm-vcvt-instruction"><a class="anchor" href="#arm-vcvt-instruction"></a><a class="link" href="#arm-vcvt-instruction">25.6.1.3. ARM VCVT instruction</a></h5>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/vcvt.S">userland/arch/arm/vcvt.S</a></p>
 </div>
@@ -37392,7 +37441,7 @@ LKMC_DUMP_SYSTEM_REGS_PRINTF("ID_ISAR6_EL1 0x%" PRIX32 "\n", id_isar6_el1);</pre
 </div>
 </div>
 <div class="sect5">
-<h6 id="arm-vcvtr-instruction"><a class="anchor" href="#arm-vcvtr-instruction"></a><a class="link" href="#arm-vcvtr-instruction">24.6.1.3.1. ARM VCVTR instruction</a></h6>
+<h6 id="arm-vcvtr-instruction"><a class="anchor" href="#arm-vcvtr-instruction"></a><a class="link" href="#arm-vcvtr-instruction">25.6.1.3.1. ARM VCVTR instruction</a></h6>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/vcvtr.S">userland/arch/arm/vcvtr.S</a></p>
 </div>
@@ -37410,7 +37459,7 @@ LKMC_DUMP_SYSTEM_REGS_PRINTF("ID_ISAR6_EL1 0x%" PRIX32 "\n", id_isar6_el1);</pre
 </div>
 </div>
 <div class="sect5">
-<h6 id="armv8-aarch32-vcvta-instruction"><a class="anchor" href="#armv8-aarch32-vcvta-instruction"></a><a class="link" href="#armv8-aarch32-vcvta-instruction">24.6.1.3.2. ARMv8 AArch32 VCVTA instruction</a></h6>
+<h6 id="armv8-aarch32-vcvta-instruction"><a class="anchor" href="#armv8-aarch32-vcvta-instruction"></a><a class="link" href="#armv8-aarch32-vcvta-instruction">25.6.1.3.2. ARMv8 AArch32 VCVTA instruction</a></h6>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/vcvt.S">userland/arch/arm/vcvt.S</a></p>
 </div>
@@ -37430,7 +37479,7 @@ LKMC_DUMP_SYSTEM_REGS_PRINTF("ID_ISAR6_EL1 0x%" PRIX32 "\n", id_isar6_el1);</pre
 </div>
 </div>
 <div class="sect3">
-<h4 id="armv8-advanced-simd-and-floating-point-support"><a class="anchor" href="#armv8-advanced-simd-and-floating-point-support"></a><a class="link" href="#armv8-advanced-simd-and-floating-point-support">24.6.2. ARMv8 Advanced SIMD and floating-point support</a></h4>
+<h4 id="armv8-advanced-simd-and-floating-point-support"><a class="anchor" href="#armv8-advanced-simd-and-floating-point-support"></a><a class="link" href="#armv8-advanced-simd-and-floating-point-support">25.6.2. ARMv8 Advanced SIMD and floating-point support</a></h4>
 <div class="paragraph">
 <p>The <a href="#armarm8">ARMv8 architecture reference manual</a> specifies floating point and SIMD support in the main architecture at A1.5 "Advanced SIMD and floating-point support".</p>
 </div>
@@ -37438,13 +37487,13 @@ LKMC_DUMP_SYSTEM_REGS_PRINTF("ID_ISAR6_EL1 0x%" PRIX32 "\n", id_isar6_el1);</pre
 <p>The feature is often refered to simply as "SIMD&amp;FP" throughout the manual.</p>
 </div>
 <div class="paragraph">
-<p>The Linux kernel shows <code>/proc/cpuinfo</code> compatibility as <code>neon</code>, which is yet another intermediate name that came up at some point, see: <a href="#arm-neon">Section 24.6.2.2, &#8220;ARM NEON&#8221;</a>.</p>
+<p>The Linux kernel shows <code>/proc/cpuinfo</code> compatibility as <code>neon</code>, which is yet another intermediate name that came up at some point, see: <a href="#arm-neon">Section 25.6.2.2, &#8220;ARM NEON&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Vs <a href="#arm-vfp">ARM VFP</a>: <a href="https://stackoverflow.com/questions/4097034/arm-cortex-a8-whats-the-difference-between-vfp-and-neon" class="bare">https://stackoverflow.com/questions/4097034/arm-cortex-a8-whats-the-difference-between-vfp-and-neon</a></p>
 </div>
 <div class="sect4">
-<h5 id="armv8-floating-point-availability"><a class="anchor" href="#armv8-floating-point-availability"></a><a class="link" href="#armv8-floating-point-availability">24.6.2.1. ARMv8 floating point availability</a></h5>
+<h5 id="armv8-floating-point-availability"><a class="anchor" href="#armv8-floating-point-availability"></a><a class="link" href="#armv8-floating-point-availability">25.6.2.1. ARMv8 floating point availability</a></h5>
 <div class="paragraph">
 <p>Support is semi-mandatory. <a href="#armarm8">ARMv8 architecture reference manual</a> A1.5 "Advanced SIMD and floating-point support":</p>
 </div>
@@ -37481,7 +37530,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-neon"><a class="anchor" href="#arm-neon"></a><a class="link" href="#arm-neon">24.6.2.2. ARM NEON</a></h5>
+<h5 id="arm-neon"><a class="anchor" href="#arm-neon"></a><a class="link" href="#arm-neon">25.6.2.2. ARM NEON</a></h5>
 <div class="paragraph">
 <p>Just an informal name for the "Advanced SIMD instructions"? Very confusing.</p>
 </div>
@@ -37508,7 +37557,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="armv8-aarch64-floating-point-registers"><a class="anchor" href="#armv8-aarch64-floating-point-registers"></a><a class="link" href="#armv8-aarch64-floating-point-registers">24.6.3. ARMv8 AArch64 floating point registers</a></h4>
+<h4 id="armv8-aarch64-floating-point-registers"><a class="anchor" href="#armv8-aarch64-floating-point-registers"></a><a class="link" href="#armv8-aarch64-floating-point-registers">25.6.3. ARMv8 AArch64 floating point registers</a></h4>
 <div class="paragraph">
 <p>TODO example.</p>
 </div>
@@ -37563,7 +37612,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armv8-aarch64-add-vector-instruction"><a class="anchor" href="#armv8-aarch64-add-vector-instruction"></a><a class="link" href="#armv8-aarch64-add-vector-instruction">24.6.3.1. ARMv8 aarch64 add vector instruction</a></h5>
+<h5 id="armv8-aarch64-add-vector-instruction"><a class="anchor" href="#armv8-aarch64-add-vector-instruction"></a><a class="link" href="#armv8-aarch64-add-vector-instruction">25.6.3.1. ARMv8 aarch64 add vector instruction</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/add_vector.S">userland/arch/aarch64/add_vector.S</a></p>
 </div>
@@ -37572,21 +37621,21 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armv8-aarch64-fadd-instruction"><a class="anchor" href="#armv8-aarch64-fadd-instruction"></a><a class="link" href="#armv8-aarch64-fadd-instruction">24.6.3.2. ARMv8 aarch64 FADD instruction</a></h5>
+<h5 id="armv8-aarch64-fadd-instruction"><a class="anchor" href="#armv8-aarch64-fadd-instruction"></a><a class="link" href="#armv8-aarch64-fadd-instruction">25.6.3.2. ARMv8 aarch64 FADD instruction</a></h5>
 <div class="ulist">
 <ul>
 <li>
-<p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/fadd_vector.S">userland/arch/aarch64/fadd_vector.S</a>: see also: <a href="#simd-assembly">Section 22.3, &#8220;SIMD assembly&#8221;</a></p>
+<p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/fadd_vector.S">userland/arch/aarch64/fadd_vector.S</a>: see also: <a href="#simd-assembly">Section 23.3, &#8220;SIMD assembly&#8221;</a></p>
 </li>
 <li>
-<p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/fadd_scalar.S">userland/arch/aarch64/fadd_scalar.S</a>: see also: <a href="#floating-point-assembly">Section 22.2, &#8220;Floating point assembly&#8221;</a></p>
+<p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/fadd_scalar.S">userland/arch/aarch64/fadd_scalar.S</a>: see also: <a href="#floating-point-assembly">Section 23.2, &#8220;Floating point assembly&#8221;</a></p>
 </li>
 </ul>
 </div>
 <div class="sect5">
-<h6 id="arm-fadd-vs-vadd"><a class="anchor" href="#arm-fadd-vs-vadd"></a><a class="link" href="#arm-fadd-vs-vadd">24.6.3.2.1. ARM FADD vs VADD</a></h6>
+<h6 id="arm-fadd-vs-vadd"><a class="anchor" href="#arm-fadd-vs-vadd"></a><a class="link" href="#arm-fadd-vs-vadd">25.6.3.2.1. ARM FADD vs VADD</a></h6>
 <div class="paragraph">
-<p>It is very confusing, but FADDS and FADDD in Aarch32 are <a href="#gnu-gas-assembler-arm-unified-syntax">pre-UAL</a> for <code>vadd.f32</code> and <code>vadd.f64</code> which we use in this tutorial, see: <a href="#arm-vadd-instruction">Section 24.6.1.2, &#8220;ARM VADD instruction&#8221;</a></p>
+<p>It is very confusing, but FADDS and FADDD in Aarch32 are <a href="#gnu-gas-assembler-arm-unified-syntax">pre-UAL</a> for <code>vadd.f32</code> and <code>vadd.f64</code> which we use in this tutorial, see: <a href="#arm-vadd-instruction">Section 25.6.1.2, &#8220;ARM VADD instruction&#8221;</a></p>
 </div>
 <div class="paragraph">
 <p>The same goes for most ARMv7 mnemonics: <code>f*</code> is old, and <code>v*</code> is the newer better syntax.</p>
@@ -37598,12 +37647,12 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 <p>Also keep in mind that fused multiply add is FMADD.</p>
 </div>
 <div class="paragraph">
-<p>Examples at: <a href="#simd-assembly">Section 22.3, &#8220;SIMD assembly&#8221;</a></p>
+<p>Examples at: <a href="#simd-assembly">Section 23.3, &#8220;SIMD assembly&#8221;</a></p>
 </div>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armv8-aarch64-ld2-instruction"><a class="anchor" href="#armv8-aarch64-ld2-instruction"></a><a class="link" href="#armv8-aarch64-ld2-instruction">24.6.3.3. ARMv8 aarch64 LD2 instruction</a></h5>
+<h5 id="armv8-aarch64-ld2-instruction"><a class="anchor" href="#armv8-aarch64-ld2-instruction"></a><a class="link" href="#armv8-aarch64-ld2-instruction">25.6.3.3. ARMv8 aarch64 LD2 instruction</a></h5>
 <div class="paragraph">
 <p>Example: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/aarch64/ld2.S">userland/arch/aarch64/ld2.S</a></p>
 </div>
@@ -37619,7 +37668,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-simd-bibliography"><a class="anchor" href="#arm-simd-bibliography"></a><a class="link" href="#arm-simd-bibliography">24.6.4. ARM SIMD bibliography</a></h4>
+<h4 id="arm-simd-bibliography"><a class="anchor" href="#arm-simd-bibliography"></a><a class="link" href="#arm-simd-bibliography">25.6.4. ARM SIMD bibliography</a></h4>
 <div class="ulist">
 <ul>
 <li>
@@ -37642,7 +37691,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-sve"><a class="anchor" href="#arm-sve"></a><a class="link" href="#arm-sve">24.6.5. ARM SVE</a></h4>
+<h4 id="arm-sve"><a class="anchor" href="#arm-sve"></a><a class="link" href="#arm-sve">25.6.5. ARM SVE</a></h4>
 <div class="paragraph">
 <p>Scalable Vector Extension.</p>
 </div>
@@ -37697,7 +37746,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 <p>Using SVE normally requires setting the CPACR_EL1.FPEN and ZEN bits, which as as of lkmc 29fd625f3fda79f5e0ee6cac43517ba74340d513 + 1 we also enable in our <a href="#baremetal-bootloaders">Baremetal bootloaders</a>, see also: <a href="#aarch64-baremetal-neon-setup">aarch64 baremetal NEON setup</a>.</p>
 </div>
 <div class="sect4">
-<h5 id="arm-sve-vaddl-instruction"><a class="anchor" href="#arm-sve-vaddl-instruction"></a><a class="link" href="#arm-sve-vaddl-instruction">24.6.5.1. ARM SVE VADDL instruction</a></h5>
+<h5 id="arm-sve-vaddl-instruction"><a class="anchor" href="#arm-sve-vaddl-instruction"></a><a class="link" href="#arm-sve-vaddl-instruction">25.6.5.1. ARM SVE VADDL instruction</a></h5>
 <div class="paragraph">
 <p>Get the SVE vector length. The following programs do that and print it to stdout:</p>
 </div>
@@ -37713,7 +37762,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="change-arm-sve-vector-length-in-emulators"><a class="anchor" href="#change-arm-sve-vector-length-in-emulators"></a><a class="link" href="#change-arm-sve-vector-length-in-emulators">24.6.5.2. Change ARM SVE vector length in emulators</a></h5>
+<h5 id="change-arm-sve-vector-length-in-emulators"><a class="anchor" href="#change-arm-sve-vector-length-in-emulators"></a><a class="link" href="#change-arm-sve-vector-length-in-emulators">25.6.5.2. Change ARM SVE vector length in emulators</a></h5>
 <div class="paragraph">
 <p>gem5 covered at: <a href="https://stackoverflow.com/questions/57692765/how-to-change-the-gem5-arm-sve-vector-length" class="bare">https://stackoverflow.com/questions/57692765/how-to-change-the-gem5-arm-sve-vector-length</a></p>
 </div>
@@ -37750,7 +37799,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="sve-bibliography"><a class="anchor" href="#sve-bibliography"></a><a class="link" href="#sve-bibliography">24.6.5.3. SVE bibliography</a></h5>
+<h5 id="sve-bibliography"><a class="anchor" href="#sve-bibliography"></a><a class="link" href="#sve-bibliography">25.6.5.3. SVE bibliography</a></h5>
 <div class="ulist">
 <ul>
 <li>
@@ -37765,7 +37814,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </ul>
 </div>
 <div class="sect5">
-<h6 id="sve-spec"><a class="anchor" href="#sve-spec"></a><a class="link" href="#sve-spec">24.6.5.3.1. SVE spec</a></h6>
+<h6 id="sve-spec"><a class="anchor" href="#sve-spec"></a><a class="link" href="#sve-spec">25.6.5.3.1. SVE spec</a></h6>
 <div class="paragraph">
 <p><a href="#armarm8">ARMv8 architecture reference manual</a> A1.7 "ARMv8 architecture extensions" says:</p>
 </div>
@@ -37790,12 +37839,12 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect2">
-<h3 id="arm-thread-synchronization-primitives"><a class="anchor" href="#arm-thread-synchronization-primitives"></a><a class="link" href="#arm-thread-synchronization-primitives">24.7. ARM thread synchronization primitives</a></h3>
+<h3 id="arm-thread-synchronization-primitives"><a class="anchor" href="#arm-thread-synchronization-primitives"></a><a class="link" href="#arm-thread-synchronization-primitives">25.7. ARM thread synchronization primitives</a></h3>
 <div class="paragraph">
 <p>Parent section: <a href="#userland-multithreading">Userland multithreading</a>.</p>
 </div>
 <div class="sect3">
-<h4 id="arm-ldxr-and-stxr-instructions"><a class="anchor" href="#arm-ldxr-and-stxr-instructions"></a><a class="link" href="#arm-ldxr-and-stxr-instructions">24.7.1. ARM LDXR and STXR instructions</a></h4>
+<h4 id="arm-ldxr-and-stxr-instructions"><a class="anchor" href="#arm-ldxr-and-stxr-instructions"></a><a class="link" href="#arm-ldxr-and-stxr-instructions">25.7.1. ARM LDXR and STXR instructions</a></h4>
 <div class="paragraph">
 <p>Parent section: <a href="#atomic-cpp">atomic.cpp</a></p>
 </div>
@@ -37845,7 +37894,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-lse"><a class="anchor" href="#arm-lse"></a><a class="link" href="#arm-lse">24.7.2. ARM Large System Extensions (LSE)</a></h4>
+<h4 id="arm-lse"><a class="anchor" href="#arm-lse"></a><a class="link" href="#arm-lse">25.7.2. ARM Large System Extensions (LSE)</a></h4>
 <div class="paragraph">
 <p>Set of atomic and synchronization primitives added in <a href="#armv8-1-architecture-extension">ARMv8.1 architecture extension</a>.</p>
 </div>
@@ -37872,9 +37921,9 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect2">
-<h3 id="armv8-architecture-extensions"><a class="anchor" href="#armv8-architecture-extensions"></a><a class="link" href="#armv8-architecture-extensions">24.8. ARMv8 architecture extensions</a></h3>
+<h3 id="armv8-architecture-extensions"><a class="anchor" href="#armv8-architecture-extensions"></a><a class="link" href="#armv8-architecture-extensions">25.8. ARMv8 architecture extensions</a></h3>
 <div class="sect3">
-<h4 id="armv8-1-architecture-extension"><a class="anchor" href="#armv8-1-architecture-extension"></a><a class="link" href="#armv8-1-architecture-extension">24.8.1. ARMv8.1 architecture extension</a></h4>
+<h4 id="armv8-1-architecture-extension"><a class="anchor" href="#armv8-1-architecture-extension"></a><a class="link" href="#armv8-1-architecture-extension">25.8.1. ARMv8.1 architecture extension</a></h4>
 <div class="paragraph">
 <p><a href="#armarm8-db">ARMv8 architecture reference manual db</a> A1.7.3 "The ARMv8.1 architecture extension"</p>
 </div>
@@ -37888,9 +37937,9 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect2">
-<h3 id="arm-assembly-bibliography"><a class="anchor" href="#arm-assembly-bibliography"></a><a class="link" href="#arm-assembly-bibliography">24.9. ARM assembly bibliography</a></h3>
+<h3 id="arm-assembly-bibliography"><a class="anchor" href="#arm-assembly-bibliography"></a><a class="link" href="#arm-assembly-bibliography">25.9. ARM assembly bibliography</a></h3>
 <div class="sect3">
-<h4 id="arm-non-official-bibliography"><a class="anchor" href="#arm-non-official-bibliography"></a><a class="link" href="#arm-non-official-bibliography">24.9.1. ARM non-official bibliography</a></h4>
+<h4 id="arm-non-official-bibliography"><a class="anchor" href="#arm-non-official-bibliography"></a><a class="link" href="#arm-non-official-bibliography">25.9.1. ARM non-official bibliography</a></h4>
 <div class="paragraph">
 <p>Good getting started tutorials:</p>
 </div>
@@ -37912,7 +37961,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-official-bibliography"><a class="anchor" href="#arm-official-bibliography"></a><a class="link" href="#arm-official-bibliography">24.9.2. ARM official bibliography</a></h4>
+<h4 id="arm-official-bibliography"><a class="anchor" href="#arm-official-bibliography"></a><a class="link" href="#arm-official-bibliography">25.9.2. ARM official bibliography</a></h4>
 <div class="paragraph">
 <p>The official manuals were stored in <a href="http://infocenter.arm.com" class="bare">http://infocenter.arm.com</a> but as of 2017 they started to slowly move to <a href="https://developer.arm.com" class="bare">https://developer.arm.com</a>.</p>
 </div>
@@ -37926,7 +37975,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 <p>Bibliography: <a href="https://www.quora.com/Where-can-I-find-the-official-documentation-of-ARM-instruction-set-architectures-ISAs" class="bare">https://www.quora.com/Where-can-I-find-the-official-documentation-of-ARM-instruction-set-architectures-ISAs</a></p>
 </div>
 <div class="sect4">
-<h5 id="armarm7"><a class="anchor" href="#armarm7"></a><a class="link" href="#armarm7">24.9.2.1. ARMv7 architecture reference manual</a></h5>
+<h5 id="armarm7"><a class="anchor" href="#armarm7"></a><a class="link" href="#armarm7">25.9.2.1. ARMv7 architecture reference manual</a></h5>
 <div class="paragraph">
 <p><a href="https://developer.arm.com/products/architecture/a-profile/docs/ddi0406/latest/arm-architecture-reference-manual-armv7-a-and-armv7-r-edition" class="bare">https://developer.arm.com/products/architecture/a-profile/docs/ddi0406/latest/arm-architecture-reference-manual-armv7-a-and-armv7-r-edition</a></p>
 </div>
@@ -37938,7 +37987,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armarm8"><a class="anchor" href="#armarm8"></a><a class="link" href="#armarm8">24.9.2.2. ARMv8 architecture reference manual</a></h5>
+<h5 id="armarm8"><a class="anchor" href="#armarm8"></a><a class="link" href="#armarm8">25.9.2.2. ARMv8 architecture reference manual</a></h5>
 <div class="paragraph">
 <p><a href="https://static.docs.arm.com/ddi0487/ca/DDI0487C_a_armv8_arm.pdf" class="bare">https://static.docs.arm.com/ddi0487/ca/DDI0487C_a_armv8_arm.pdf</a></p>
 </div>
@@ -37994,19 +38043,19 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armarm8-db"><a class="anchor" href="#armarm8-db"></a><a class="link" href="#armarm8-db">24.9.2.3. ARMv8 architecture reference manual db</a></h5>
+<h5 id="armarm8-db"><a class="anchor" href="#armarm8-db"></a><a class="link" href="#armarm8-db">25.9.2.3. ARMv8 architecture reference manual db</a></h5>
 <div class="paragraph">
 <p><a href="https://static.docs.arm.com/ddi0487/db/DDI0487D_b_armv8_arm.pdf" class="bare">https://static.docs.arm.com/ddi0487/db/DDI0487D_b_armv8_arm.pdf</a></p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armarm8-fa"><a class="anchor" href="#armarm8-fa"></a><a class="link" href="#armarm8-fa">24.9.2.4. ARMv8 architecture reference manual db</a></h5>
+<h5 id="armarm8-fa"><a class="anchor" href="#armarm8-fa"></a><a class="link" href="#armarm8-fa">25.9.2.4. ARMv8 architecture reference manual db</a></h5>
 <div class="paragraph">
 <p><a href="https://static.docs.arm.com/ddi0487/fa/DDI0487F_a_armv8_arm.pdf" class="bare">https://static.docs.arm.com/ddi0487/fa/DDI0487F_a_armv8_arm.pdf</a></p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="armv8-programmers-guide"><a class="anchor" href="#armv8-programmers-guide"></a><a class="link" href="#armv8-programmers-guide">24.9.2.5. Programmer&#8217;s Guide for ARMv8-A</a></h5>
+<h5 id="armv8-programmers-guide"><a class="anchor" href="#armv8-programmers-guide"></a><a class="link" href="#armv8-programmers-guide">25.9.2.5. Programmer&#8217;s Guide for ARMv8-A</a></h5>
 <div class="paragraph">
 <p><a href="https://static.docs.arm.com/den0024/a/DEN0024A_v8_architecture_PG.pdf" class="bare">https://static.docs.arm.com/den0024/a/DEN0024A_v8_architecture_PG.pdf</a></p>
 </div>
@@ -38021,7 +38070,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-a64-instruction-set-architecture-future-architecture-technologies-in-the-a-architecture-profile-documentation"><a class="anchor" href="#arm-a64-instruction-set-architecture-future-architecture-technologies-in-the-a-architecture-profile-documentation"></a><a class="link" href="#arm-a64-instruction-set-architecture-future-architecture-technologies-in-the-a-architecture-profile-documentation">24.9.2.6. Arm A64 Instruction Set Architecture: Future Architecture Technologies in the A architecture profile Documentation</a></h5>
+<h5 id="arm-a64-instruction-set-architecture-future-architecture-technologies-in-the-a-architecture-profile-documentation"><a class="anchor" href="#arm-a64-instruction-set-architecture-future-architecture-technologies-in-the-a-architecture-profile-documentation"></a><a class="link" href="#arm-a64-instruction-set-architecture-future-architecture-technologies-in-the-a-architecture-profile-documentation">25.9.2.6. Arm A64 Instruction Set Architecture: Future Architecture Technologies in the A architecture profile Documentation</a></h5>
 <div class="paragraph">
 <p><a href="https://developer.arm.com/docs/ddi0602/b" class="bare">https://developer.arm.com/docs/ddi0602/b</a></p>
 </div>
@@ -38030,7 +38079,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-processor-documentation"><a class="anchor" href="#arm-processor-documentation"></a><a class="link" href="#arm-processor-documentation">24.9.2.7. ARM processor documentation</a></h5>
+<h5 id="arm-processor-documentation"><a class="anchor" href="#arm-processor-documentation"></a><a class="link" href="#arm-processor-documentation">25.9.2.7. ARM processor documentation</a></h5>
 <div class="paragraph">
 <p>ARM also releases documentation specific to each given processor.</p>
 </div>
@@ -38054,7 +38103,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </ul>
 </div>
 <div class="sect5">
-<h6 id="arm-cortex15-trm"><a class="anchor" href="#arm-cortex15-trm"></a><a class="link" href="#arm-cortex15-trm">24.9.2.7.1. ARM Cortex-A15 MPCore Processor Technical Reference Manual r4p0</a></h6>
+<h6 id="arm-cortex15-trm"><a class="anchor" href="#arm-cortex15-trm"></a><a class="link" href="#arm-cortex15-trm">25.9.2.7.1. ARM Cortex-A15 MPCore Processor Technical Reference Manual r4p0</a></h6>
 <div class="paragraph">
 <p><a href="http://infocenter.arm.com/help/topic/com.arm.doc.ddi0438i/DDI0438I_cortex_a15_r4p0_trm.pdf" class="bare">http://infocenter.arm.com/help/topic/com.arm.doc.ddi0438i/DDI0438I_cortex_a15_r4p0_trm.pdf</a></p>
 </div>
@@ -38064,13 +38113,13 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-cortex-a77-trm"><a class="anchor" href="#arm-cortex-a77-trm"></a><a class="link" href="#arm-cortex-a77-trm">24.9.2.8. Arm Cortex‑A77 Technical Reference Manual r1p1</a></h5>
+<h5 id="arm-cortex-a77-trm"><a class="anchor" href="#arm-cortex-a77-trm"></a><a class="link" href="#arm-cortex-a77-trm">25.9.2.8. Arm Cortex‑A77 Technical Reference Manual r1p1</a></h5>
 <div class="paragraph">
 <p><a href="https://static.docs.arm.com/101111/0101/arm_cortex_a77_trm_101111_0101_04_en.pdf" class="bare">https://static.docs.arm.com/101111/0101/arm_cortex_a77_trm_101111_0101_04_en.pdf</a></p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-cortex-a77-sog"><a class="anchor" href="#arm-cortex-a77-sog"></a><a class="link" href="#arm-cortex-a77-sog">24.9.2.9. Arm Cortex‑A77 Software Optimization Guide r1p1</a></h5>
+<h5 id="arm-cortex-a77-sog"><a class="anchor" href="#arm-cortex-a77-sog"></a><a class="link" href="#arm-cortex-a77-sog">25.9.2.9. Arm Cortex‑A77 Software Optimization Guide r1p1</a></h5>
 <div class="paragraph">
 <p><a href="https://static.docs.arm.com/swog011050/c/Arm_Cortex-A77_Software_Optimization_Guide.pdf" class="bare">https://static.docs.arm.com/swog011050/c/Arm_Cortex-A77_Software_Optimization_Guide.pdf</a></p>
 </div>
@@ -38080,7 +38129,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect1">
-<h2 id="elf"><a class="anchor" href="#elf"></a><a class="link" href="#elf">25. ELF</a></h2>
+<h2 id="elf"><a class="anchor" href="#elf"></a><a class="link" href="#elf">26. ELF</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Executable_and_Linkable_Format" class="bare">https://en.wikipedia.org/wiki/Executable_and_Linkable_Format</a></p>
@@ -38094,7 +38143,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect1">
-<h2 id="ieee-754"><a class="anchor" href="#ieee-754"></a><a class="link" href="#ieee-754">26. IEEE 754</a></h2>
+<h2 id="ieee-754"><a class="anchor" href="#ieee-754"></a><a class="link" href="#ieee-754">27. IEEE 754</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/IEEE_754" class="bare">https://en.wikipedia.org/wiki/IEEE_754</a></p>
@@ -38124,13 +38173,13 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect1">
-<h2 id="baremetal"><a class="anchor" href="#baremetal"></a><a class="link" href="#baremetal">27. Baremetal</a></h2>
+<h2 id="baremetal"><a class="anchor" href="#baremetal"></a><a class="link" href="#baremetal">28. Baremetal</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
 <p>Getting started at: <a href="#baremetal-setup">Section 1.9, &#8220;Baremetal setup&#8221;</a></p>
 </div>
 <div class="sect2">
-<h3 id="baremetal-gdb-step-debug"><a class="anchor" href="#baremetal-gdb-step-debug"></a><a class="link" href="#baremetal-gdb-step-debug">27.1. Baremetal GDB step debug</a></h3>
+<h3 id="baremetal-gdb-step-debug"><a class="anchor" href="#baremetal-gdb-step-debug"></a><a class="link" href="#baremetal-gdb-step-debug">28.1. Baremetal GDB step debug</a></h3>
 <div class="paragraph">
 <p>GDB step debug works on baremetal exactly as it does on the Linux kernel, which is described at: <a href="#gdb">Section 2, &#8220;GDB step debug&#8221;</a>.</p>
 </div>
@@ -38201,7 +38250,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect2">
-<h3 id="baremetal-bootloaders"><a class="anchor" href="#baremetal-bootloaders"></a><a class="link" href="#baremetal-bootloaders">27.2. Baremetal bootloaders</a></h3>
+<h3 id="baremetal-bootloaders"><a class="anchor" href="#baremetal-bootloaders"></a><a class="link" href="#baremetal-bootloaders">28.2. Baremetal bootloaders</a></h3>
 <div class="paragraph">
 <p>As can be seen from <a href="#baremetal-gdb-step-debug">Baremetal GDB step debug</a>, all examples under <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/baremetal/">baremetal/</a>, with the exception of <code>baremetal/arch/&lt;arch&gt;/no_bootloader</code>, start from our tiny bootloaders:</p>
 </div>
@@ -38237,7 +38286,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 <p>the stack pointer</p>
 </li>
 <li>
-<p>NEON: <a href="#aarch64-baremetal-neon-setup">Section 27.11.2, &#8220;aarch64 baremetal NEON setup&#8221;</a></p>
+<p>NEON: <a href="#aarch64-baremetal-neon-setup">Section 28.11.2, &#8220;aarch64 baremetal NEON setup&#8221;</a></p>
 </li>
 <li>
 <p>TODO: we don&#8217;t do this currently but maybe we should setup BSS</p>
@@ -38265,7 +38314,7 @@ AArch64, see Procedure Call Standard for the ARM 64-bit Architecture.</p>
 </div>
 </div>
 <div class="sect2">
-<h3 id="baremetal-linker-script"><a class="anchor" href="#baremetal-linker-script"></a><a class="link" href="#baremetal-linker-script">27.3. Baremetal linker script</a></h3>
+<h3 id="baremetal-linker-script"><a class="anchor" href="#baremetal-linker-script"></a><a class="link" href="#baremetal-linker-script">28.3. Baremetal linker script</a></h3>
 <div class="paragraph">
 <p>For things to work in baremetal, we often have to layout memory in specific ways.</p>
 </div>
@@ -38294,7 +38343,7 @@ lkmc_heap_top = .;</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="baremetal-command-line-arguments"><a class="anchor" href="#baremetal-command-line-arguments"></a><a class="link" href="#baremetal-command-line-arguments">27.4. Baremetal command line arguments</a></h3>
+<h3 id="baremetal-command-line-arguments"><a class="anchor" href="#baremetal-command-line-arguments"></a><a class="link" href="#baremetal-command-line-arguments">28.4. Baremetal command line arguments</a></h3>
 <div class="paragraph">
 <p>QEMU and gem5 currently supports baremetal CLI arguments!</p>
 </div>
@@ -38343,7 +38392,7 @@ cc</pre>
 <p>It is worth noting that e.g. ARM has a <a href="#semihosting">Semihosting</a> mechanism for loading CLI arguments through <code>SYS_GET_CMDLINE</code>, but our mechanism works in principle for any ISA.</p>
 </div>
 <div class="sect3">
-<h4 id="gem5-baremetal-arm-cli-args"><a class="anchor" href="#gem5-baremetal-arm-cli-args"></a><a class="link" href="#gem5-baremetal-arm-cli-args">27.4.1. gem5 baremetal arm CLI args</a></h4>
+<h4 id="gem5-baremetal-arm-cli-args"><a class="anchor" href="#gem5-baremetal-arm-cli-args"></a><a class="link" href="#gem5-baremetal-arm-cli-args">28.4.1. gem5 baremetal arm CLI args</a></h4>
 <div class="paragraph">
 <p>Currently not supported, so we just hardcode argc 0 on the <a href="#baremetal-bootloaders">arm baremetal bootloader</a>.</p>
 </div>
@@ -38353,7 +38402,7 @@ cc</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="semihosting"><a class="anchor" href="#semihosting"></a><a class="link" href="#semihosting">27.5. Semihosting</a></h3>
+<h3 id="semihosting"><a class="anchor" href="#semihosting"></a><a class="link" href="#semihosting">28.5. Semihosting</a></h3>
 <div class="paragraph">
 <p>Semihosting is a publicly documented interface specified by ARM Holdings that allows us to do some magic operations very useful in development, such as writting to the terminal or reading and writing host files.</p>
 </div>
@@ -38471,9 +38520,9 @@ svc 0x00123456</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="gem5-semihosting"><a class="anchor" href="#gem5-semihosting"></a><a class="link" href="#gem5-semihosting">27.5.1. gem5 semihosting</a></h4>
+<h4 id="gem5-semihosting"><a class="anchor" href="#gem5-semihosting"></a><a class="link" href="#gem5-semihosting">28.5.1. gem5 semihosting</a></h4>
 <div class="paragraph">
-<p>For gem5, you need:</p>
+<p>For gem5, you need <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/patches/manual/gem5-semihost.patch">patches/manual/gem5-semihost.patch</a>:</p>
 </div>
 <div class="literalblock">
 <div class="content">
@@ -38486,7 +38535,7 @@ svc 0x00123456</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="gem5-baremetal-carriage-return"><a class="anchor" href="#gem5-baremetal-carriage-return"></a><a class="link" href="#gem5-baremetal-carriage-return">27.6. gem5 baremetal carriage return</a></h3>
+<h3 id="gem5-baremetal-carriage-return"><a class="anchor" href="#gem5-baremetal-carriage-return"></a><a class="link" href="#gem5-baremetal-carriage-return">28.6. gem5 baremetal carriage return</a></h3>
 <div class="paragraph">
 <p>TODO: our example is printing newlines without automatic carriage return <code>\r</code> as in:</p>
 </div>
@@ -38509,7 +38558,7 @@ svc 0x00123456</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="baremetal-host-packaged-toolchain"><a class="anchor" href="#baremetal-host-packaged-toolchain"></a><a class="link" href="#baremetal-host-packaged-toolchain">27.7. Baremetal host packaged toolchain</a></h3>
+<h3 id="baremetal-host-packaged-toolchain"><a class="anchor" href="#baremetal-host-packaged-toolchain"></a><a class="link" href="#baremetal-host-packaged-toolchain">28.7. Baremetal host packaged toolchain</a></h3>
 <div class="paragraph">
 <p>For <code>arm</code>, some baremetal examples compile fine with:</p>
 </div>
@@ -38545,13 +38594,13 @@ collect2: error: ld returned 1 exit status</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="baremetal-cpp"><a class="anchor" href="#baremetal-cpp"></a><a class="link" href="#baremetal-cpp">27.8. Baremetal C++</a></h3>
+<h3 id="baremetal-cpp"><a class="anchor" href="#baremetal-cpp"></a><a class="link" href="#baremetal-cpp">28.8. Baremetal C++</a></h3>
 <div class="paragraph">
 <p>Didn&#8217;t get it working, traking at: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/issues/119" class="bare">https://github.com/cirosantilli/linux-kernel-module-cheat/issues/119</a></p>
 </div>
 </div>
 <div class="sect2">
-<h3 id="gdb-builtin-cpu-simulator"><a class="anchor" href="#gdb-builtin-cpu-simulator"></a><a class="link" href="#gdb-builtin-cpu-simulator">27.9. GDB builtin CPU simulator</a></h3>
+<h3 id="gdb-builtin-cpu-simulator"><a class="anchor" href="#gdb-builtin-cpu-simulator"></a><a class="link" href="#gdb-builtin-cpu-simulator">28.9. GDB builtin CPU simulator</a></h3>
 <div class="paragraph">
 <p>It is incredible, but GDB also has a CPU simulator inside of it as documented at: <a href="https://sourceware.org/gdb/onlinedocs/gdb/Target-Commands.html" class="bare">https://sourceware.org/gdb/onlinedocs/gdb/Target-Commands.html</a></p>
 </div>
@@ -38611,7 +38660,7 @@ starti</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="gdb-builtin-cpu-simulator-userland"><a class="anchor" href="#gdb-builtin-cpu-simulator-userland"></a><a class="link" href="#gdb-builtin-cpu-simulator-userland">27.9.1. GDB builtin CPU simulator userland</a></h4>
+<h4 id="gdb-builtin-cpu-simulator-userland"><a class="anchor" href="#gdb-builtin-cpu-simulator-userland"></a><a class="link" href="#gdb-builtin-cpu-simulator-userland">28.9.1. GDB builtin CPU simulator userland</a></h4>
 <div class="paragraph">
 <p>Since I had this compiled, I also decided to try it out on userland.</p>
 </div>
@@ -38646,7 +38695,7 @@ starti</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="arm-baremetal"><a class="anchor" href="#arm-baremetal"></a><a class="link" href="#arm-baremetal">27.10. ARM baremetal</a></h3>
+<h3 id="arm-baremetal"><a class="anchor" href="#arm-baremetal"></a><a class="link" href="#arm-baremetal">28.10. ARM baremetal</a></h3>
 <div class="paragraph">
 <p>In this section we will focus on learning ARM architecture concepts that can only learnt on baremetal setups.</p>
 </div>
@@ -38654,7 +38703,7 @@ starti</pre>
 <p>Userland information can be found at: <a href="https://github.com/cirosantilli/arm-assembly-cheat" class="bare">https://github.com/cirosantilli/arm-assembly-cheat</a></p>
 </div>
 <div class="sect3">
-<h4 id="arm-exception-levels"><a class="anchor" href="#arm-exception-levels"></a><a class="link" href="#arm-exception-levels">27.10.1. ARM exception levels</a></h4>
+<h4 id="arm-exception-levels"><a class="anchor" href="#arm-exception-levels"></a><a class="link" href="#arm-exception-levels">28.10.1. ARM exception levels</a></h4>
 <div class="paragraph">
 <p>ARM exception levels are analogous to x86 <a href="#ring0">rings</a>.</p>
 </div>
@@ -38783,13 +38832,13 @@ CurrentEL.EL 0x3</pre>
 <p>According to <a href="#armarm7">ARMv7 architecture reference manual</a>, access to that register is controlled by other registers <code>NSACR.{CP11, CP10}</code> and <code>HCPTR</code> so those must be turned off, but I&#8217;m lazy to investigate now, even just trying to dump those registers in <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/arch/arm/dump_regs.c">userland/arch/arm/dump_regs.c</a> also leads to exceptions&#8230;&#8203;</p>
 </div>
 <div class="sect4">
-<h5 id="arm-change-exception-level"><a class="anchor" href="#arm-change-exception-level"></a><a class="link" href="#arm-change-exception-level">27.10.1.1. ARM change exception level</a></h5>
+<h5 id="arm-change-exception-level"><a class="anchor" href="#arm-change-exception-level"></a><a class="link" href="#arm-change-exception-level">28.10.1.1. ARM change exception level</a></h5>
 <div class="paragraph">
 <p>TODO. Create a minimal runnable example of going into EL0 and jumping to EL1.</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-sp0-vs-spx"><a class="anchor" href="#arm-sp0-vs-spx"></a><a class="link" href="#arm-sp0-vs-spx">27.10.1.2. ARM SP0 vs SPx</a></h5>
+<h5 id="arm-sp0-vs-spx"><a class="anchor" href="#arm-sp0-vs-spx"></a><a class="link" href="#arm-sp0-vs-spx">28.10.1.2. ARM SP0 vs SPx</a></h5>
 <div class="paragraph">
 <p>See <a href="#armarm8-db">ARMv8 architecture reference manual db</a> D1.6.2 "The stack pointer registers".</p>
 </div>
@@ -38808,7 +38857,7 @@ CurrentEL.EL 0x3</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-svc-instruction"><a class="anchor" href="#arm-svc-instruction"></a><a class="link" href="#arm-svc-instruction">27.10.2. ARM SVC instruction</a></h4>
+<h4 id="arm-svc-instruction"><a class="anchor" href="#arm-svc-instruction"></a><a class="link" href="#arm-svc-instruction">28.10.2. ARM SVC instruction</a></h4>
 <div class="paragraph">
 <p>This is the most basic example of exception handling we have.</p>
 </div>
@@ -39157,7 +39206,7 @@ IN: main
 </ul>
 </div>
 <div class="sect4">
-<h5 id="armv8-exception-vector-table-format"><a class="anchor" href="#armv8-exception-vector-table-format"></a><a class="link" href="#armv8-exception-vector-table-format">27.10.2.1. ARMv8 exception vector table format</a></h5>
+<h5 id="armv8-exception-vector-table-format"><a class="anchor" href="#armv8-exception-vector-table-format"></a><a class="link" href="#armv8-exception-vector-table-format">28.10.2.1. ARMv8 exception vector table format</a></h5>
 <div class="paragraph">
 <p>The vector table format is described on <a href="#armarm8">ARMv8 architecture reference manual</a> Table D1-7 "Vector offsets from vector table base address".</p>
 </div>
@@ -39297,29 +39346,29 @@ IN: main
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-esr-register"><a class="anchor" href="#arm-esr-register"></a><a class="link" href="#arm-esr-register">27.10.2.2. ARM ESR register</a></h5>
+<h5 id="arm-esr-register"><a class="anchor" href="#arm-esr-register"></a><a class="link" href="#arm-esr-register">28.10.2.2. ARM ESR register</a></h5>
 <div class="paragraph">
 <p>Exception Syndrome Register.</p>
 </div>
 <div class="paragraph">
-<p>See example at: <a href="#arm-svc-instruction">Section 27.10.2, &#8220;ARM SVC instruction&#8221;</a></p>
+<p>See example at: <a href="#arm-svc-instruction">Section 28.10.2, &#8220;ARM SVC instruction&#8221;</a></p>
 </div>
 <div class="paragraph">
 <p>Documentation: <a href="#armarm8-db">ARMv8 architecture reference manual db</a> D12.2.36 "ESR_EL1, Exception Syndrome Register (EL1)".</p>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-elr-register"><a class="anchor" href="#arm-elr-register"></a><a class="link" href="#arm-elr-register">27.10.2.3. ARM ELR register</a></h5>
+<h5 id="arm-elr-register"><a class="anchor" href="#arm-elr-register"></a><a class="link" href="#arm-elr-register">28.10.2.3. ARM ELR register</a></h5>
 <div class="paragraph">
 <p>Exception Link Register.</p>
 </div>
 <div class="paragraph">
-<p>See the example at: <a href="#arm-svc-instruction">Section 27.10.2, &#8220;ARM SVC instruction&#8221;</a></p>
+<p>See the example at: <a href="#arm-svc-instruction">Section 28.10.2, &#8220;ARM SVC instruction&#8221;</a></p>
 </div>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-baremetal-multicore"><a class="anchor" href="#arm-baremetal-multicore"></a><a class="link" href="#arm-baremetal-multicore">27.10.3. ARM baremetal multicore</a></h4>
+<h4 id="arm-baremetal-multicore"><a class="anchor" href="#arm-baremetal-multicore"></a><a class="link" href="#arm-baremetal-multicore">28.10.3. ARM baremetal multicore</a></h4>
 <div class="paragraph">
 <p>Examples:</p>
 </div>
@@ -39398,7 +39447,7 @@ IN: main
 <p>Bibliography: <a href="https://stackoverflow.com/questions/980999/what-does-multicore-assembly-language-look-like/33651438#33651438" class="bare">https://stackoverflow.com/questions/980999/what-does-multicore-assembly-language-look-like/33651438#33651438</a></p>
 </div>
 <div class="sect4">
-<h5 id="arm-wfe-and-sev-instructions"><a class="anchor" href="#arm-wfe-and-sev-instructions"></a><a class="link" href="#arm-wfe-and-sev-instructions">27.10.3.1. ARM WFE and SEV instructions</a></h5>
+<h5 id="arm-wfe-and-sev-instructions"><a class="anchor" href="#arm-wfe-and-sev-instructions"></a><a class="link" href="#arm-wfe-and-sev-instructions">28.10.3.1. ARM WFE and SEV instructions</a></h5>
 <div class="paragraph">
 <p>The WFE and SEV instructions are just hints: a compliant implementation can treat them as NOPs.</p>
 </div>
@@ -39551,7 +39600,7 @@ IN: main
 <p>For how userland spinlocks and mutexes are implemented see <a href="#userland-mutex-implementation">Userland mutex implementation</a>.</p>
 </div>
 <div class="sect5">
-<h6 id="arm-wfe-global-monitor-events"><a class="anchor" href="#arm-wfe-global-monitor-events"></a><a class="link" href="#arm-wfe-global-monitor-events">27.10.3.1.1. ARM WFE global monitor events</a></h6>
+<h6 id="arm-wfe-global-monitor-events"><a class="anchor" href="#arm-wfe-global-monitor-events"></a><a class="link" href="#arm-wfe-global-monitor-events">28.10.3.1.1. ARM WFE global monitor events</a></h6>
 <div class="paragraph">
 <p>Examples:</p>
 </div>
@@ -39591,7 +39640,7 @@ IN: main
 </div>
 </div>
 <div class="sect5">
-<h6 id="wfe-from-userland"><a class="anchor" href="#wfe-from-userland"></a><a class="link" href="#wfe-from-userland">27.10.3.1.2. WFE from userland</a></h6>
+<h6 id="wfe-from-userland"><a class="anchor" href="#wfe-from-userland"></a><a class="link" href="#wfe-from-userland">28.10.3.1.2. WFE from userland</a></h6>
 <div class="paragraph">
 <p>WFE and SEV are usable from userland, and are part of an efficient spinlock implementation (which userland should arguably stay away from and rather use the <a href="#futex-system-call">futex system call</a> which allow for non busy sleep instead), which maybe is not something that userland should ever tho and just stick to mutexes?</p>
 </div>
@@ -39698,7 +39747,7 @@ IN: main
 </div>
 </div>
 <div class="sect5">
-<h6 id="armv8-spinlock-pattern"><a class="anchor" href="#armv8-spinlock-pattern"></a><a class="link" href="#armv8-spinlock-pattern">27.10.3.1.3. ARMv8 spinlock pattern</a></h6>
+<h6 id="armv8-spinlock-pattern"><a class="anchor" href="#armv8-spinlock-pattern"></a><a class="link" href="#armv8-spinlock-pattern">28.10.3.1.3. ARMv8 spinlock pattern</a></h6>
 <div class="paragraph">
 <p><a href="http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.faqs/ka16277.html" class="bare">http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.faqs/ka16277.html</a></p>
 </div>
@@ -39717,7 +39766,7 @@ IN: main
 </div>
 </div>
 <div class="sect5">
-<h6 id="gem5-arm-wfe"><a class="anchor" href="#gem5-arm-wfe"></a><a class="link" href="#gem5-arm-wfe">27.10.3.1.4. gem5 ARM WFE</a></h6>
+<h6 id="gem5-arm-wfe"><a class="anchor" href="#gem5-arm-wfe"></a><a class="link" href="#gem5-arm-wfe">28.10.3.1.4. gem5 ARM WFE</a></h6>
 <div class="paragraph">
 <p>gem5 390a74f59934b85d91489f8a563450d8321b602d does not sleep on the first WFE on either syscall emulation or full system, because the code does:</p>
 </div>
@@ -39759,14 +39808,14 @@ IN: main
 </div>
 </div>
 <div class="sect5">
-<h6 id="arm-yield-instruction"><a class="anchor" href="#arm-yield-instruction"></a><a class="link" href="#arm-yield-instruction">27.10.3.1.5. ARM YIELD instruction</a></h6>
+<h6 id="arm-yield-instruction"><a class="anchor" href="#arm-yield-instruction"></a><a class="link" href="#arm-yield-instruction">28.10.3.1.5. ARM YIELD instruction</a></h6>
 <div class="paragraph">
 <p><a href="https://stackoverflow.com/questions/59311066/how-does-the-arm-yield-instruction-inform-other-threads-that-they-could-start-a" class="bare">https://stackoverflow.com/questions/59311066/how-does-the-arm-yield-instruction-inform-other-threads-that-they-could-start-a</a></p>
 </div>
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-ldaxr-and-stlxr-instructions"><a class="anchor" href="#arm-ldaxr-and-stlxr-instructions"></a><a class="link" href="#arm-ldaxr-and-stlxr-instructions">27.10.3.2. ARM LDAXR and STLXR instructions</a></h5>
+<h5 id="arm-ldaxr-and-stlxr-instructions"><a class="anchor" href="#arm-ldaxr-and-stlxr-instructions"></a><a class="link" href="#arm-ldaxr-and-stlxr-instructions">28.10.3.2. ARM LDAXR and STLXR instructions</a></h5>
 <div class="paragraph">
 <p>Can be used to implement atomic variables, see also:</p>
 </div>
@@ -39785,7 +39834,7 @@ IN: main
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-psci"><a class="anchor" href="#arm-psci"></a><a class="link" href="#arm-psci">27.10.3.3. ARM PSCI</a></h5>
+<h5 id="arm-psci"><a class="anchor" href="#arm-psci"></a><a class="link" href="#arm-psci">28.10.3.3. ARM PSCI</a></h5>
 <div class="paragraph">
 <p>In QEMU, CPU 1 starts in a halted state. This can be observed from GDB, where:</p>
 </div>
@@ -39835,14 +39884,14 @@ IN: main
 </div>
 </div>
 <div class="sect4">
-<h5 id="arm-dmb-instruction"><a class="anchor" href="#arm-dmb-instruction"></a><a class="link" href="#arm-dmb-instruction">27.10.3.4. ARM DMB instruction</a></h5>
+<h5 id="arm-dmb-instruction"><a class="anchor" href="#arm-dmb-instruction"></a><a class="link" href="#arm-dmb-instruction">28.10.3.4. ARM DMB instruction</a></h5>
 <div class="paragraph">
 <p>TODO: create and study a minimal examples in gem5 where the DMB instruction leads to less cycles: <a href="https://stackoverflow.com/questions/15491751/real-life-use-cases-of-barriers-dsb-dmb-isb-in-arm" class="bare">https://stackoverflow.com/questions/15491751/real-life-use-cases-of-barriers-dsb-dmb-isb-in-arm</a></p>
 </div>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-timer"><a class="anchor" href="#arm-timer"></a><a class="link" href="#arm-timer">27.10.4. ARM timer</a></h4>
+<h4 id="arm-timer"><a class="anchor" href="#arm-timer"></a><a class="link" href="#arm-timer">28.10.4. ARM timer</a></h4>
 <div class="paragraph">
 <p>The ARM timer is the simplest way to generate hardware interrupts periodically, and therefore serves as the simples example of <a href="#arm-gic">ARM GIC</a> usage.</p>
 </div>
@@ -39995,7 +40044,7 @@ cntvct_el0 0x3CF516F</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-gic"><a class="anchor" href="#arm-gic"></a><a class="link" href="#arm-gic">27.10.5. ARM GIC</a></h4>
+<h4 id="arm-gic"><a class="anchor" href="#arm-gic"></a><a class="link" href="#arm-gic">28.10.5. ARM GIC</a></h4>
 <div class="paragraph">
 <p>Generic Interrupt Controller.</p>
 </div>
@@ -40037,7 +40086,7 @@ cntvct_el0 0x3CF516F</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-paging"><a class="anchor" href="#arm-paging"></a><a class="link" href="#arm-paging">27.10.6. ARM paging</a></h4>
+<h4 id="arm-paging"><a class="anchor" href="#arm-paging"></a><a class="link" href="#arm-paging">28.10.6. ARM paging</a></h4>
 <div class="paragraph">
 <p>TODO create a minimal working aarch64 example analogous to the x86 one at: <a href="https://github.com/cirosantilli/x86-bare-metal-examples/blob/6dc9a73830fc05358d8d66128f740ef9906f7677/paging.S" class="bare">https://github.com/cirosantilli/x86-bare-metal-examples/blob/6dc9a73830fc05358d8d66128f740ef9906f7677/paging.S</a></p>
 </div>
@@ -40067,9 +40116,9 @@ cntvct_el0 0x3CF516F</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="arm-baremetal-bibliography"><a class="anchor" href="#arm-baremetal-bibliography"></a><a class="link" href="#arm-baremetal-bibliography">27.10.7. ARM baremetal bibliography</a></h4>
+<h4 id="arm-baremetal-bibliography"><a class="anchor" href="#arm-baremetal-bibliography"></a><a class="link" href="#arm-baremetal-bibliography">28.10.7. ARM baremetal bibliography</a></h4>
 <div class="paragraph">
-<p>First, also consider the userland bibliography: <a href="#arm-assembly-bibliography">Section 24.9, &#8220;ARM assembly bibliography&#8221;</a>.</p>
+<p>First, also consider the userland bibliography: <a href="#arm-assembly-bibliography">Section 25.9, &#8220;ARM assembly bibliography&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>The most useful ARM baremetal example sets we&#8217;ve seen so far are:</p>
@@ -40094,7 +40143,7 @@ cntvct_el0 0x3CF516F</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="nienfengyaoarmv8-bare-metal"><a class="anchor" href="#nienfengyaoarmv8-bare-metal"></a><a class="link" href="#nienfengyaoarmv8-bare-metal">27.10.7.1. NienfengYao/armv8-bare-metal</a></h5>
+<h5 id="nienfengyaoarmv8-bare-metal"><a class="anchor" href="#nienfengyaoarmv8-bare-metal"></a><a class="link" href="#nienfengyaoarmv8-bare-metal">28.10.7.1. NienfengYao/armv8-bare-metal</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/NienfengYao/armv8-bare-metal" class="bare">https://github.com/NienfengYao/armv8-bare-metal</a></p>
 </div>
@@ -40153,7 +40202,7 @@ cntvct_el0 0x3CF516F</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="tukl-msdgem5-bare-metal"><a class="anchor" href="#tukl-msdgem5-bare-metal"></a><a class="link" href="#tukl-msdgem5-bare-metal">27.10.7.2. tukl-msd/gem5.bare-metal</a></h5>
+<h5 id="tukl-msdgem5-bare-metal"><a class="anchor" href="#tukl-msdgem5-bare-metal"></a><a class="link" href="#tukl-msdgem5-bare-metal">28.10.7.2. tukl-msd/gem5.bare-metal</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/tukl-msd/gem5.bare-metal" class="bare">https://github.com/tukl-msd/gem5.bare-metal</a></p>
 </div>
@@ -40195,7 +40244,7 @@ make CROSS_COMPILE_DIR=/usr/bin
 </div>
 </div>
 <div class="sect2">
-<h3 id="how-we-got-some-baremetal-stuff-to-work"><a class="anchor" href="#how-we-got-some-baremetal-stuff-to-work"></a><a class="link" href="#how-we-got-some-baremetal-stuff-to-work">27.11. How we got some baremetal stuff to work</a></h3>
+<h3 id="how-we-got-some-baremetal-stuff-to-work"><a class="anchor" href="#how-we-got-some-baremetal-stuff-to-work"></a><a class="link" href="#how-we-got-some-baremetal-stuff-to-work">28.11. How we got some baremetal stuff to work</a></h3>
 <div class="paragraph">
 <p>It is nice when thing just work.</p>
 </div>
@@ -40203,7 +40252,7 @@ make CROSS_COMPILE_DIR=/usr/bin
 <p>But you can also learn a thing or two from how I actually made them work in the first place.</p>
 </div>
 <div class="sect3">
-<h4 id="find-the-uart-address"><a class="anchor" href="#find-the-uart-address"></a><a class="link" href="#find-the-uart-address">27.11.1. Find the UART address</a></h4>
+<h4 id="find-the-uart-address"><a class="anchor" href="#find-the-uart-address"></a><a class="link" href="#find-the-uart-address">28.11.1. Find the UART address</a></h4>
 <div class="paragraph">
 <p>Enter the QEMU console:</p>
 </div>
@@ -40239,7 +40288,7 @@ make CROSS_COMPILE_DIR=/usr/bin
 </div>
 </div>
 <div class="sect3">
-<h4 id="aarch64-baremetal-neon-setup"><a class="anchor" href="#aarch64-baremetal-neon-setup"></a><a class="link" href="#aarch64-baremetal-neon-setup">27.11.2. aarch64 baremetal NEON setup</a></h4>
+<h4 id="aarch64-baremetal-neon-setup"><a class="anchor" href="#aarch64-baremetal-neon-setup"></a><a class="link" href="#aarch64-baremetal-neon-setup">28.11.2. aarch64 baremetal NEON setup</a></h4>
 <div class="paragraph">
 <p>Inside <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/baremetal/lib/aarch64.S">baremetal/lib/aarch64.S</a> there is a chunk of code that enables floating point operations:</p>
 </div>
@@ -40363,7 +40412,7 @@ ISB</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="baremetal-tests"><a class="anchor" href="#baremetal-tests"></a><a class="link" href="#baremetal-tests">27.12. Baremetal tests</a></h3>
+<h3 id="baremetal-tests"><a class="anchor" href="#baremetal-tests"></a><a class="link" href="#baremetal-tests">28.12. Baremetal tests</a></h3>
 <div class="paragraph">
 <p>Baremetal tests work exactly like <a href="#user-mode-tests">User mode tests</a>, except that you have to add the <code>--mode baremetal</code> option, for example:</p>
 </div>
@@ -40376,13 +40425,13 @@ ISB</pre>
 <p>In baremetal, we detect if tests failed by parsing logs for the <a href="#magic-failure-string">Magic failure string</a>.</p>
 </div>
 <div class="paragraph">
-<p>See: <a href="#test-this-repo">Section 33.16, &#8220;Test this repo&#8221;</a> for more useful testing tips.</p>
+<p>See: <a href="#test-this-repo">Section 34.16, &#8220;Test this repo&#8221;</a> for more useful testing tips.</p>
 </div>
 </div>
 </div>
 </div>
 <div class="sect1">
-<h2 id="android"><a class="anchor" href="#android"></a><a class="link" href="#android">28. Android</a></h2>
+<h2 id="android"><a class="anchor" href="#android"></a><a class="link" href="#android">29. Android</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
 <p>Remember: Android AOSP is a huge undocumented piece of bloatware. It&#8217;s integration into this repo will likely never be super good. See also: <a href="https://cirosantilli.com#android" class="bare">https://cirosantilli.com#android</a></p>
@@ -40430,7 +40479,7 @@ ISB</pre>
 <p>Tested on: <code>8.1.0_r60</code>.</p>
 </div>
 <div class="sect2">
-<h3 id="android-image-structure"><a class="anchor" href="#android-image-structure"></a><a class="link" href="#android-image-structure">28.1. Android image structure</a></h3>
+<h3 id="android-image-structure"><a class="anchor" href="#android-image-structure"></a><a class="link" href="#android-image-structure">29.1. Android image structure</a></h3>
 <div class="paragraph">
 <p><a href="https://source.android.com/devices/bootloader/partitions-images" class="bare">https://source.android.com/devices/bootloader/partitions-images</a></p>
 </div>
@@ -40514,7 +40563,7 @@ vendor-qemu</pre>
 <p>Tested on: <code>8.1.0_r60</code>.</p>
 </div>
 <div class="sect3">
-<h4 id="android-images-read-only"><a class="anchor" href="#android-images-read-only"></a><a class="link" href="#android-images-read-only">28.1.1. Android images read-only</a></h4>
+<h4 id="android-images-read-only"><a class="anchor" href="#android-images-read-only"></a><a class="link" href="#android-images-read-only">29.1.1. Android images read-only</a></h4>
 <div class="paragraph">
 <p>From <code>mount</code>, we can see that some of the mounted images are <code>ro</code>.</p>
 </div>
@@ -40587,7 +40636,7 @@ date &gt;/system/a</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="android-data-partition"><a class="anchor" href="#android-data-partition"></a><a class="link" href="#android-data-partition">28.1.2. Android /data partition</a></h4>
+<h4 id="android-data-partition"><a class="anchor" href="#android-data-partition"></a><a class="link" href="#android-data-partition">29.1.2. Android /data partition</a></h4>
 <div class="paragraph">
 <p>When I install an app like F-Droid, it goes under <code>/data</code> according to:</p>
 </div>
@@ -40648,7 +40697,7 @@ date &gt;/system/a</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="install-android-apps"><a class="anchor" href="#install-android-apps"></a><a class="link" href="#install-android-apps">28.2. Install Android apps</a></h3>
+<h3 id="install-android-apps"><a class="anchor" href="#install-android-apps"></a><a class="link" href="#install-android-apps">29.2. Install Android apps</a></h3>
 <div class="paragraph">
 <p>I don&#8217;t know how to download files from the web on Vanilla android, the default browser does not download anything, and there is no <code>wget</code>:</p>
 </div>
@@ -40698,7 +40747,7 @@ date &gt;/system/a</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="android-init"><a class="anchor" href="#android-init"></a><a class="link" href="#android-init">28.3. Android init</a></h3>
+<h3 id="android-init"><a class="anchor" href="#android-init"></a><a class="link" href="#android-init">29.3. Android init</a></h3>
 <div class="paragraph">
 <p>For Linux in general, see: <a href="#init">Section 6, &#8220;init&#8221;</a>.</p>
 </div>
@@ -40747,7 +40796,7 @@ import /init.${ro.zygote}.rc</pre>
 </div>
 </div>
 <div class="sect1">
-<h2 id="benchmark-this-repo"><a class="anchor" href="#benchmark-this-repo"></a><a class="link" href="#benchmark-this-repo">29. Benchmark this repo</a></h2>
+<h2 id="benchmark-this-repo"><a class="anchor" href="#benchmark-this-repo"></a><a class="link" href="#benchmark-this-repo">30. Benchmark this repo</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
 <p>TODO: didn&#8217;t fully port during refactor after 3b0a343647bed577586989fb702b760bd280844a. Reimplementing should not be hard.</p>
@@ -40776,7 +40825,7 @@ cd -
 </div>
 </div>
 <div class="sect2">
-<h3 id="continuous-integration"><a class="anchor" href="#continuous-integration"></a><a class="link" href="#continuous-integration">29.1. Continuous integration</a></h3>
+<h3 id="continuous-integration"><a class="anchor" href="#continuous-integration"></a><a class="link" href="#continuous-integration">30.1. Continuous integration</a></h3>
 <div class="paragraph">
 <p>We have explored a few Continuous integration solutions.</p>
 </div>
@@ -40784,13 +40833,13 @@ cd -
 <p>We haven&#8217;t setup any of them yet.</p>
 </div>
 <div class="sect3">
-<h4 id="travis"><a class="anchor" href="#travis"></a><a class="link" href="#travis">29.1.1. Travis</a></h4>
+<h4 id="travis"><a class="anchor" href="#travis"></a><a class="link" href="#travis">30.1.1. Travis</a></h4>
 <div class="paragraph">
 <p>We tried to automate it on Travis with <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/.travis.yml">.travis.yml</a> but it hits the current 50 minute job timeout: <a href="https://travis-ci.org/cirosantilli/linux-kernel-module-cheat/builds/296454523" class="bare">https://travis-ci.org/cirosantilli/linux-kernel-module-cheat/builds/296454523</a> And I bet it would likely hit a disk maxout either way if it went on.</p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="circleci"><a class="anchor" href="#circleci"></a><a class="link" href="#circleci">29.1.2. CircleCI</a></h4>
+<h4 id="circleci"><a class="anchor" href="#circleci"></a><a class="link" href="#circleci">30.1.2. CircleCI</a></h4>
 <div class="paragraph">
 <p>This setup successfully built gem5 on every commit: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/.circleci/config.yml">.circleci/config.yml</a></p>
 </div>
@@ -40819,9 +40868,9 @@ cd -
 </div>
 </div>
 <div class="sect2">
-<h3 id="benchmark-this-repo-benchmarks"><a class="anchor" href="#benchmark-this-repo-benchmarks"></a><a class="link" href="#benchmark-this-repo-benchmarks">29.2. Benchmark this repo benchmarks</a></h3>
+<h3 id="benchmark-this-repo-benchmarks"><a class="anchor" href="#benchmark-this-repo-benchmarks"></a><a class="link" href="#benchmark-this-repo-benchmarks">30.2. Benchmark this repo benchmarks</a></h3>
 <div class="sect3">
-<h4 id="benchmark-linux-kernel-boot"><a class="anchor" href="#benchmark-linux-kernel-boot"></a><a class="link" href="#benchmark-linux-kernel-boot">29.2.1. Benchmark Linux kernel boot</a></h4>
+<h4 id="benchmark-linux-kernel-boot"><a class="anchor" href="#benchmark-linux-kernel-boot"></a><a class="link" href="#benchmark-linux-kernel-boot">30.2.1. Benchmark Linux kernel boot</a></h4>
 <div class="paragraph">
 <p>Run all kernel boot benchmarks for one arch:</p>
 </div>
@@ -40930,7 +40979,7 @@ instructions 124346081</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-arm-hpi-boot-takes-much-longer-than-aarch64"><a class="anchor" href="#gem5-arm-hpi-boot-takes-much-longer-than-aarch64"></a><a class="link" href="#gem5-arm-hpi-boot-takes-much-longer-than-aarch64">29.2.1.1. gem5 arm HPI boot takes much longer than aarch64</a></h5>
+<h5 id="gem5-arm-hpi-boot-takes-much-longer-than-aarch64"><a class="anchor" href="#gem5-arm-hpi-boot-takes-much-longer-than-aarch64"></a><a class="link" href="#gem5-arm-hpi-boot-takes-much-longer-than-aarch64">30.2.1.1. gem5 arm HPI boot takes much longer than aarch64</a></h5>
 <div class="paragraph">
 <p>TODO 62f6870e4e0b384c4bd2d514116247e81b241251 takes 33 minutes to finish at 62f6870e4e0b384c4bd2d514116247e81b241251:</p>
 </div>
@@ -40956,7 +41005,7 @@ instructions 124346081</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-x86_64-derivo3cpu-boot-panics"><a class="anchor" href="#gem5-x86_64-derivo3cpu-boot-panics"></a><a class="link" href="#gem5-x86_64-derivo3cpu-boot-panics">29.2.1.2. gem5 x86_64 DerivO3CPU boot panics</a></h5>
+<h5 id="gem5-x86_64-derivo3cpu-boot-panics"><a class="anchor" href="#gem5-x86_64-derivo3cpu-boot-panics"></a><a class="link" href="#gem5-x86_64-derivo3cpu-boot-panics">30.2.1.2. gem5 x86_64 DerivO3CPU boot panics</a></h5>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli-work/gem5-issues/issues/2" class="bare">https://github.com/cirosantilli-work/gem5-issues/issues/2</a></p>
 </div>
@@ -40968,7 +41017,7 @@ instructions 124346081</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="benchmark-emulators-on-userland-executables"><a class="anchor" href="#benchmark-emulators-on-userland-executables"></a><a class="link" href="#benchmark-emulators-on-userland-executables">29.2.2. Benchmark emulators on userland executables</a></h4>
+<h4 id="benchmark-emulators-on-userland-executables"><a class="anchor" href="#benchmark-emulators-on-userland-executables"></a><a class="link" href="#benchmark-emulators-on-userland-executables">30.2.2. Benchmark emulators on userland executables</a></h4>
 <div class="paragraph">
 <p>Let&#8217;s see how fast our simulators are running some well known or easy to understand userland benchmarks!</p>
 </div>
@@ -41287,7 +41336,7 @@ instructions 124346081</pre>
 <p>so ~ 110 million instructions / 100 seconds makes ~ 1 MIPS (million instructions per second).</p>
 </div>
 <div class="paragraph">
-<p>This experiment also suggests that each loop is about 11 instructions long (110M instructions / 10M loops), which we confirm at <a href="#c-busy-loop">Section 31.2, &#8220;C busy loop&#8221;</a>, bingo!</p>
+<p>This experiment also suggests that each loop is about 11 instructions long (110M instructions / 10M loops), which we confirm at <a href="#c-busy-loop">Section 32.2, &#8220;C busy loop&#8221;</a>, bingo!</p>
 </div>
 <div class="paragraph">
 <p>Then for QEMU, we experimentally turn the number of loops up to 10^10 loops (<code>100000 100000</code>), which contains an expected 11 * 10^10 instructions, and the runtime is 00:01:08, so we have 1.1 * 10^11 instruction / 68 seconds ~ 2 * 10^9 = 2000 MIPS!</p>
@@ -41296,7 +41345,7 @@ instructions 124346081</pre>
 <p>We can then repeat the experiment for other gem5 CPUs to see how they compare.</p>
 </div>
 <div class="sect4">
-<h5 id="user-mode-vs-full-system-benchmark"><a class="anchor" href="#user-mode-vs-full-system-benchmark"></a><a class="link" href="#user-mode-vs-full-system-benchmark">29.2.2.1. User mode vs full system benchmark</a></h5>
+<h5 id="user-mode-vs-full-system-benchmark"><a class="anchor" href="#user-mode-vs-full-system-benchmark"></a><a class="link" href="#user-mode-vs-full-system-benchmark">30.2.2.1. User mode vs full system benchmark</a></h5>
 <div class="paragraph">
 <p>Let&#8217;s see if user mode runs considerably faster than full system or not, ignoring the kernel boot.</p>
 </div>
@@ -41304,7 +41353,7 @@ instructions 124346081</pre>
 <p>First we build <a href="#dhrystone">Dhrystone</a> manually statically since dynamic linking is broken in gem5 as explained at: <a href="#gem5-syscall-emulation-mode">Section 10.7, &#8220;gem5 syscall emulation mode&#8221;</a>.</p>
 </div>
 <div class="paragraph">
-<p>TODO: move this section to our new custom dhrystone setup: <a href="#dhrystone">Section 21.8.2.1, &#8220;Dhrystone&#8221;</a>.</p>
+<p>TODO: move this section to our new custom dhrystone setup: <a href="#dhrystone">Section 22.8.2.1, &#8220;Dhrystone&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>gem5 user mode:</p>
@@ -41383,7 +41432,7 @@ time \
 </div>
 </div>
 <div class="sect3">
-<h4 id="benchmark-builds"><a class="anchor" href="#benchmark-builds"></a><a class="link" href="#benchmark-builds">29.2.3. Benchmark builds</a></h4>
+<h4 id="benchmark-builds"><a class="anchor" href="#benchmark-builds"></a><a class="link" href="#benchmark-builds">30.2.3. Benchmark builds</a></h4>
 <div class="paragraph">
 <p>The build times are calculated after doing <code>./configure</code> and <a href="https://buildroot.org/downloads/manual/manual.html#_offline_builds"><code>make source</code></a>, which downloads the sources, and basically benchmarks the <a href="#benchmark-internets">Internet</a>.</p>
 </div>
@@ -41408,7 +41457,7 @@ cat ../linux-kernel-module-cheat-regression/*/build-time.log</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="find-which-buildroot-packages-are-making-the-build-slow-and-big"><a class="anchor" href="#find-which-buildroot-packages-are-making-the-build-slow-and-big"></a><a class="link" href="#find-which-buildroot-packages-are-making-the-build-slow-and-big">29.2.3.1. Find which Buildroot packages are making the build slow and big</a></h5>
+<h5 id="find-which-buildroot-packages-are-making-the-build-slow-and-big"><a class="anchor" href="#find-which-buildroot-packages-are-making-the-build-slow-and-big"></a><a class="link" href="#find-which-buildroot-packages-are-making-the-build-slow-and-big">30.2.3.1. Find which Buildroot packages are making the build slow and big</a></h5>
 <div class="literalblock">
 <div class="content">
 <pre>./build-buildroot -- graph-build graph-size graph-depends
@@ -41419,14 +41468,14 @@ xdg-open graph-size.pdf</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="prebuilt-toolchain"><a class="anchor" href="#prebuilt-toolchain"></a><a class="link" href="#prebuilt-toolchain">29.2.3.1.1. Buildroot use prebuilt host toolchain</a></h6>
+<h6 id="prebuilt-toolchain"><a class="anchor" href="#prebuilt-toolchain"></a><a class="link" href="#prebuilt-toolchain">30.2.3.1.1. Buildroot use prebuilt host toolchain</a></h6>
 <div class="paragraph">
 <p>The biggest build time hog is always GCC, and it does not look like we can use a precompiled one: <a href="https://stackoverflow.com/questions/10833672/buildroot-environment-with-host-toolchain" class="bare">https://stackoverflow.com/questions/10833672/buildroot-environment-with-host-toolchain</a></p>
 </div>
 </div>
 </div>
 <div class="sect4">
-<h5 id="benchmark-buildroot-build-baseline"><a class="anchor" href="#benchmark-buildroot-build-baseline"></a><a class="link" href="#benchmark-buildroot-build-baseline">29.2.3.2. Benchmark Buildroot build baseline</a></h5>
+<h5 id="benchmark-buildroot-build-baseline"><a class="anchor" href="#benchmark-buildroot-build-baseline"></a><a class="link" href="#benchmark-buildroot-build-baseline">30.2.3.2. Benchmark Buildroot build baseline</a></h5>
 <div class="paragraph">
 <p>This is the minimal build we could expect to get away with.</p>
 </div>
@@ -41494,7 +41543,7 @@ xdg-open graph-size.pdf</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="benchmark-gem5-build"><a class="anchor" href="#benchmark-gem5-build"></a><a class="link" href="#benchmark-gem5-build">29.2.3.3. Benchmark gem5 build</a></h5>
+<h5 id="benchmark-gem5-build"><a class="anchor" href="#benchmark-gem5-build"></a><a class="link" href="#benchmark-gem5-build">30.2.3.3. Benchmark gem5 build</a></h5>
 <div class="paragraph">
 <p>How long it takes to build gem5 itself.</p>
 </div>
@@ -41526,7 +41575,7 @@ tail -n+1 ../linux-kernel-module-cheat-regression/*/gem5-bench-build-*.txt</pre>
 <p>A profiling of the build has been done at: <a href="https://gem5.atlassian.net/browse/GEM5-277" class="bare">https://gem5.atlassian.net/browse/GEM5-277</a> Analysis there showed that d7d9bc240615625141cd6feddbadd392457e49eb (2018-06-17) is also composed of 50% pybind11 and with no obvious time sinks.</p>
 </div>
 <div class="sect5">
-<h6 id="pybind11-accounts-for-50-of-gem5-build-time"><a class="anchor" href="#pybind11-accounts-for-50-of-gem5-build-time"></a><a class="link" href="#pybind11-accounts-for-50-of-gem5-build-time">29.2.3.3.1. pybind11 accounts for 50% of gem5 build time</a></h6>
+<h6 id="pybind11-accounts-for-50-of-gem5-build-time"><a class="anchor" href="#pybind11-accounts-for-50-of-gem5-build-time"></a><a class="link" href="#pybind11-accounts-for-50-of-gem5-build-time">30.2.3.3.1. pybind11 accounts for 50% of gem5 build time</a></h6>
 <div class="paragraph">
 <p><a href="https://gem5.atlassian.net/browse/GEM5-366" class="bare">https://gem5.atlassian.net/browse/GEM5-366</a></p>
 </div>
@@ -41538,7 +41587,7 @@ tail -n+1 ../linux-kernel-module-cheat-regression/*/gem5-bench-build-*.txt</pre>
 </div>
 </div>
 <div class="sect5">
-<h6 id="benchmark-gem5-single-file-change-rebuild-time"><a class="anchor" href="#benchmark-gem5-single-file-change-rebuild-time"></a><a class="link" href="#benchmark-gem5-single-file-change-rebuild-time">29.2.3.3.2. Benchmark gem5 single file change rebuild time</a></h6>
+<h6 id="benchmark-gem5-single-file-change-rebuild-time"><a class="anchor" href="#benchmark-gem5-single-file-change-rebuild-time"></a><a class="link" href="#benchmark-gem5-single-file-change-rebuild-time">30.2.3.3.2. Benchmark gem5 single file change rebuild time</a></h6>
 <div class="paragraph">
 <p>This is the critical development parameter, and is dominated by the link time of huge binaries.</p>
 </div>
@@ -41615,9 +41664,9 @@ tail -n+1 ../linux-kernel-module-cheat-regression/*/gem5-bench-build-*.txt</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="benchmark-machines"><a class="anchor" href="#benchmark-machines"></a><a class="link" href="#benchmark-machines">29.3. Benchmark machines</a></h3>
+<h3 id="benchmark-machines"><a class="anchor" href="#benchmark-machines"></a><a class="link" href="#benchmark-machines">30.3. Benchmark machines</a></h3>
 <div class="sect3">
-<h4 id="p51"><a class="anchor" href="#p51"></a><a class="link" href="#p51">29.3.1. 2017 Lenovo ThinkPad P51</a></h4>
+<h4 id="p51"><a class="anchor" href="#p51"></a><a class="link" href="#p51">30.3.1. 2017 Lenovo ThinkPad P51</a></h4>
 <div class="paragraph">
 <p>Serial number: TYPE 20HH-CTO1WW S/N PF-0V5V5N 17/11</p>
 </div>
@@ -41723,7 +41772,7 @@ tail -n+1 ../linux-kernel-module-cheat-regression/*/gem5-bench-build-*.txt</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="p51-benchmarks"><a class="anchor" href="#p51-benchmarks"></a><a class="link" href="#p51-benchmarks">29.3.1.1. P51 benchmarks</a></h5>
+<h5 id="p51-benchmarks"><a class="anchor" href="#p51-benchmarks"></a><a class="link" href="#p51-benchmarks">30.3.1.1. P51 benchmarks</a></h5>
 <div class="paragraph">
 <p><a href="#dhrystone">Dhrystone</a> on Ubuntu 20.04 results at <a href="#dhrystone">Dhrystone</a>.</p>
 </div>
@@ -41731,7 +41780,7 @@ tail -n+1 ../linux-kernel-module-cheat-regression/*/gem5-bench-build-*.txt</pre>
 <p><a href="#stream-benchmark">STREAM benchmark</a> on Ubuntu 20.04 results at <a href="#stream-benchmark">STREAM benchmark</a>.</p>
 </div>
 <div class="sect5">
-<h6 id="p51-coremark-pro"><a class="anchor" href="#p51-coremark-pro"></a><a class="link" href="#p51-coremark-pro">29.3.1.1.1. P51 CoreMark-Pro</a></h6>
+<h6 id="p51-coremark-pro"><a class="anchor" href="#p51-coremark-pro"></a><a class="link" href="#p51-coremark-pro">30.3.1.1.1. P51 CoreMark-Pro</a></h6>
 <div class="paragraph">
 <p><a href="#coremark">CoreMark-Pro</a> d5b4f2ba7ba31e37a5aa93423831e7d5eb933868 on Ubuntu 20.04 with <code>XCMD="-c$(nproc)"</code>:</p>
 </div>
@@ -41760,7 +41809,7 @@ CoreMark-PRO                                      25016.00    6079.70       4.11
 </div>
 </div>
 <div class="sect4">
-<h5 id="p51-maintenance-history"><a class="anchor" href="#p51-maintenance-history"></a><a class="link" href="#p51-maintenance-history">29.3.1.2. P51 maintenance history</a></h5>
+<h5 id="p51-maintenance-history"><a class="anchor" href="#p51-maintenance-history"></a><a class="link" href="#p51-maintenance-history">30.3.1.2. P51 maintenance history</a></h5>
 <div class="paragraph">
 <p>Bought: 2017 for approximately 2400 pounds.</p>
 </div>
@@ -41818,7 +41867,7 @@ CoreMark-PRO                                      25016.00    6079.70       4.11
 </div>
 </div>
 <div class="sect4">
-<h5 id="intel-core-i7-7820hq-cpu"><a class="anchor" href="#intel-core-i7-7820hq-cpu"></a><a class="link" href="#intel-core-i7-7820hq-cpu">29.3.1.3. Intel Core i7-7820HQ CPU</a></h5>
+<h5 id="intel-core-i7-7820hq-cpu"><a class="anchor" href="#intel-core-i7-7820hq-cpu"></a><a class="link" href="#intel-core-i7-7820hq-cpu">30.3.1.3. Intel Core i7-7820HQ CPU</a></h5>
 <div class="paragraph">
 <p><a href="https://ark.intel.com/products/97496/Intel-Core-i7-7820HQ-Processor-8M-Cache-up-to-3-90-GHz-" class="bare">https://ark.intel.com/products/97496/Intel-Core-i7-7820HQ-Processor-8M-Cache-up-to-3-90-GHz-</a> (<a href="http://web.archive.org/web/20181224203737/https://ark.intel.com/products/97496/Intel-Core-i7-7820HQ-Processor-8M-Cache-up-to-3-90-GHz-">archive</a>).</p>
 </div>
@@ -41900,7 +41949,7 @@ LEVEL4_CACHE_LINESIZE              0</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="samsung-m471a2k43bb1-crc-16gb-dram"><a class="anchor" href="#samsung-m471a2k43bb1-crc-16gb-dram"></a><a class="link" href="#samsung-m471a2k43bb1-crc-16gb-dram">29.3.1.4. Samsung M471A2K43BB1-CRC 16GB DRAM</a></h5>
+<h5 id="samsung-m471a2k43bb1-crc-16gb-dram"><a class="anchor" href="#samsung-m471a2k43bb1-crc-16gb-dram"></a><a class="link" href="#samsung-m471a2k43bb1-crc-16gb-dram">30.3.1.4. Samsung M471A2K43BB1-CRC 16GB DRAM</a></h5>
 <div class="paragraph">
 <p>Nominal speed: 2400 Mbps</p>
 </div>
@@ -41915,7 +41964,7 @@ LEVEL4_CACHE_LINESIZE              0</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="samsung-mzvlb512hajq-000l7-512gb-ssd"><a class="anchor" href="#samsung-mzvlb512hajq-000l7-512gb-ssd"></a><a class="link" href="#samsung-mzvlb512hajq-000l7-512gb-ssd">29.3.1.5. Samsung MZVLB512HAJQ-000L7 512GB SSD</a></h5>
+<h5 id="samsung-mzvlb512hajq-000l7-512gb-ssd"><a class="anchor" href="#samsung-mzvlb512hajq-000l7-512gb-ssd"></a><a class="link" href="#samsung-mzvlb512hajq-000l7-512gb-ssd">30.3.1.5. Samsung MZVLB512HAJQ-000L7 512GB SSD</a></h5>
 <div class="paragraph">
 <p>PCIe TLC OPAL2.</p>
 </div>
@@ -41940,7 +41989,7 @@ LEVEL4_CACHE_LINESIZE              0</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="seagate-st1000lm035-1rk1-1tb-hard-disk"><a class="anchor" href="#seagate-st1000lm035-1rk1-1tb-hard-disk"></a><a class="link" href="#seagate-st1000lm035-1rk1-1tb-hard-disk">29.3.1.6. Seagate ST1000LM035-1RK1 1TB hard disk</a></h5>
+<h5 id="seagate-st1000lm035-1rk1-1tb-hard-disk"><a class="anchor" href="#seagate-st1000lm035-1rk1-1tb-hard-disk"></a><a class="link" href="#seagate-st1000lm035-1rk1-1tb-hard-disk">30.3.1.6. Seagate ST1000LM035-1RK1 1TB hard disk</a></h5>
 <div class="paragraph">
 <p>1TB.</p>
 </div>
@@ -41964,15 +42013,15 @@ LEVEL4_CACHE_LINESIZE              0</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="nvidia-quadro-m1200-4gb-gddr5-gpu"><a class="anchor" href="#nvidia-quadro-m1200-4gb-gddr5-gpu"></a><a class="link" href="#nvidia-quadro-m1200-4gb-gddr5-gpu">29.3.1.7. NVIDIA Quadro M1200 4GB GDDR5 GPU</a></h5>
+<h5 id="nvidia-quadro-m1200-4gb-gddr5-gpu"><a class="anchor" href="#nvidia-quadro-m1200-4gb-gddr5-gpu"></a><a class="link" href="#nvidia-quadro-m1200-4gb-gddr5-gpu">30.3.1.7. NVIDIA Quadro M1200 4GB GDDR5 GPU</a></h5>
 
 </div>
 </div>
 </div>
 <div class="sect2">
-<h3 id="benchmark-internets"><a class="anchor" href="#benchmark-internets"></a><a class="link" href="#benchmark-internets">29.4. Benchmark Internets</a></h3>
+<h3 id="benchmark-internets"><a class="anchor" href="#benchmark-internets"></a><a class="link" href="#benchmark-internets">30.4. Benchmark Internets</a></h3>
 <div class="sect3">
-<h4 id="38mbps-internet"><a class="anchor" href="#38mbps-internet"></a><a class="link" href="#38mbps-internet">29.4.1. 38Mbps internet</a></h4>
+<h4 id="38mbps-internet"><a class="anchor" href="#38mbps-internet"></a><a class="link" href="#38mbps-internet">30.4.1. 38Mbps internet</a></h4>
 <div class="paragraph">
 <p>2c12b21b304178a81c9912817b782ead0286d282:</p>
 </div>
@@ -41992,7 +42041,7 @@ LEVEL4_CACHE_LINESIZE              0</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="benchmark-this-repo-bibliography"><a class="anchor" href="#benchmark-this-repo-bibliography"></a><a class="link" href="#benchmark-this-repo-bibliography">29.5. Benchmark this repo bibliography</a></h3>
+<h3 id="benchmark-this-repo-bibliography"><a class="anchor" href="#benchmark-this-repo-bibliography"></a><a class="link" href="#benchmark-this-repo-bibliography">30.5. Benchmark this repo bibliography</a></h3>
 <div class="paragraph">
 <p>gem5:</p>
 </div>
@@ -42020,10 +42069,10 @@ LEVEL4_CACHE_LINESIZE              0</pre>
 </div>
 </div>
 <div class="sect1">
-<h2 id="rtos"><a class="anchor" href="#rtos"></a><a class="link" href="#rtos">30. RTOS</a></h2>
+<h2 id="rtos"><a class="anchor" href="#rtos"></a><a class="link" href="#rtos">31. RTOS</a></h2>
 <div class="sectionbody">
 <div class="sect2">
-<h3 id="zephyr"><a class="anchor" href="#zephyr"></a><a class="link" href="#zephyr">30.1. Zephyr</a></h3>
+<h3 id="zephyr"><a class="anchor" href="#zephyr"></a><a class="link" href="#zephyr">31.1. Zephyr</a></h3>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Zephyr_(operating_system" class="bare">https://en.wikipedia.org/wiki/Zephyr_(operating_system</a>)</p>
 </div>
@@ -42066,7 +42115,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="arm-mbed"><a class="anchor" href="#arm-mbed"></a><a class="link" href="#arm-mbed">30.2. ARM Mbed</a></h3>
+<h3 id="arm-mbed"><a class="anchor" href="#arm-mbed"></a><a class="link" href="#arm-mbed">31.2. ARM Mbed</a></h3>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Mbed" class="bare">https://en.wikipedia.org/wiki/Mbed</a></p>
 </div>
@@ -42077,13 +42126,13 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect1">
-<h2 id="compilers"><a class="anchor" href="#compilers"></a><a class="link" href="#compilers">31. Compilers</a></h2>
+<h2 id="compilers"><a class="anchor" href="#compilers"></a><a class="link" href="#compilers">32. Compilers</a></h2>
 <div class="sectionbody">
 <div class="paragraph">
 <p>Argh, compilers are boring, let&#8217;s learn a bit about them.</p>
 </div>
 <div class="sect2">
-<h3 id="prevent-statement-reordering"><a class="anchor" href="#prevent-statement-reordering"></a><a class="link" href="#prevent-statement-reordering">31.1. Prevent statement reordering</a></h3>
+<h3 id="prevent-statement-reordering"><a class="anchor" href="#prevent-statement-reordering"></a><a class="link" href="#prevent-statement-reordering">32.1. Prevent statement reordering</a></h3>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/gcc/prevent_reorder.cpp">userland/gcc/prevent_reorder.cpp</a></p>
 </div>
@@ -42095,7 +42144,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="c-busy-loop"><a class="anchor" href="#c-busy-loop"></a><a class="link" href="#c-busy-loop">31.2. C busy loop</a></h3>
+<h3 id="c-busy-loop"><a class="anchor" href="#c-busy-loop"></a><a class="link" href="#c-busy-loop">32.2. C busy loop</a></h3>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/userland/gcc/busy_loop.c">userland/gcc/busy_loop.c</a></p>
 </div>
@@ -42179,10 +42228,10 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect1">
-<h2 id="computer-architecture"><a class="anchor" href="#computer-architecture"></a><a class="link" href="#computer-architecture">32. Computer architecture</a></h2>
+<h2 id="computer-architecture"><a class="anchor" href="#computer-architecture"></a><a class="link" href="#computer-architecture">33. Computer architecture</a></h2>
 <div class="sectionbody">
 <div class="sect2">
-<h3 id="instruction-pipelining"><a class="anchor" href="#instruction-pipelining"></a><a class="link" href="#instruction-pipelining">32.1. Instruction pipelining</a></h3>
+<h3 id="instruction-pipelining"><a class="anchor" href="#instruction-pipelining"></a><a class="link" href="#instruction-pipelining">33.1. Instruction pipelining</a></h3>
 <div class="paragraph">
 <p>In gem5, can be seen on:</p>
 </div>
@@ -42197,7 +42246,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="classic-risc-pipeline"><a class="anchor" href="#classic-risc-pipeline"></a><a class="link" href="#classic-risc-pipeline">32.1.1. Classic RISC pipeline</a></h4>
+<h4 id="classic-risc-pipeline"><a class="anchor" href="#classic-risc-pipeline"></a><a class="link" href="#classic-risc-pipeline">33.1.1. Classic RISC pipeline</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Classic_RISC_pipeline" class="bare">https://en.wikipedia.org/wiki/Classic_RISC_pipeline</a></p>
 </div>
@@ -42207,7 +42256,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="superscalar-processor"><a class="anchor" href="#superscalar-processor"></a><a class="link" href="#superscalar-processor">32.2. Superscalar processor</a></h3>
+<h3 id="superscalar-processor"><a class="anchor" href="#superscalar-processor"></a><a class="link" href="#superscalar-processor">33.2. Superscalar processor</a></h3>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Superscalar_processor" class="bare">https://en.wikipedia.org/wiki/Superscalar_processor</a></p>
 </div>
@@ -42234,7 +42283,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="execution-unit"><a class="anchor" href="#execution-unit"></a><a class="link" href="#execution-unit">32.2.1. Execution unit</a></h4>
+<h4 id="execution-unit"><a class="anchor" href="#execution-unit"></a><a class="link" href="#execution-unit">33.2.1. Execution unit</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Execution_unit" class="bare">https://en.wikipedia.org/wiki/Execution_unit</a></p>
 </div>
@@ -42247,7 +42296,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="out-of-order-execution"><a class="anchor" href="#out-of-order-execution"></a><a class="link" href="#out-of-order-execution">32.3. Out-of-order execution</a></h3>
+<h3 id="out-of-order-execution"><a class="anchor" href="#out-of-order-execution"></a><a class="link" href="#out-of-order-execution">33.3. Out-of-order execution</a></h3>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Out-of-order_execution" class="bare">https://en.wikipedia.org/wiki/Out-of-order_execution</a></p>
 </div>
@@ -42264,7 +42313,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 <p>As mentioned at: <a href="https://stackoverflow.com/questions/10074831/what-is-general-difference-between-superscalar-and-ooo-execution" class="bare">https://stackoverflow.com/questions/10074831/what-is-general-difference-between-superscalar-and-ooo-execution</a> it is in theory possible for an out-of-order CPU to not a <a href="#superscalar-processor">Superscalar processor</a>, but the combination is so natural (since you can look ahead, you might as well run it!) that it is not super common.</p>
 </div>
 <div class="sect3">
-<h4 id="speculative-execution"><a class="anchor" href="#speculative-execution"></a><a class="link" href="#speculative-execution">32.3.1. Speculative execution</a></h4>
+<h4 id="speculative-execution"><a class="anchor" href="#speculative-execution"></a><a class="link" href="#speculative-execution">33.3.1. Speculative execution</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Speculative_execution" class="bare">https://en.wikipedia.org/wiki/Speculative_execution</a></p>
 </div>
@@ -42282,7 +42331,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </ul>
 </div>
 <div class="sect4">
-<h5 id="branch-predictor"><a class="anchor" href="#branch-predictor"></a><a class="link" href="#branch-predictor">32.3.1.1. Branch predictor</a></h5>
+<h5 id="branch-predictor"><a class="anchor" href="#branch-predictor"></a><a class="link" href="#branch-predictor">33.3.1.1. Branch predictor</a></h5>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Branch_predictor" class="bare">https://en.wikipedia.org/wiki/Branch_predictor</a></p>
 </div>
@@ -42295,20 +42344,20 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="re-order-buffer"><a class="anchor" href="#re-order-buffer"></a><a class="link" href="#re-order-buffer">32.3.2. Re-order buffer</a></h4>
+<h4 id="re-order-buffer"><a class="anchor" href="#re-order-buffer"></a><a class="link" href="#re-order-buffer">33.3.2. Re-order buffer</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Re-order_buffer" class="bare">https://en.wikipedia.org/wiki/Re-order_buffer</a></p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="register-renaming"><a class="anchor" href="#register-renaming"></a><a class="link" href="#register-renaming">32.3.3. Register renaming</a></h4>
+<h4 id="register-renaming"><a class="anchor" href="#register-renaming"></a><a class="link" href="#register-renaming">33.3.3. Register renaming</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Register_renaming" class="bare">https://en.wikipedia.org/wiki/Register_renaming</a></p>
 </div>
 </div>
 </div>
 <div class="sect2">
-<h3 id="instruction-level-parallelism"><a class="anchor" href="#instruction-level-parallelism"></a><a class="link" href="#instruction-level-parallelism">32.4. Instruction level parallelism</a></h3>
+<h3 id="instruction-level-parallelism"><a class="anchor" href="#instruction-level-parallelism"></a><a class="link" href="#instruction-level-parallelism">33.4. Instruction level parallelism</a></h3>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Instruction-level_parallelism" class="bare">https://en.wikipedia.org/wiki/Instruction-level_parallelism</a></p>
 </div>
@@ -42327,7 +42376,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="hardware-threads"><a class="anchor" href="#hardware-threads"></a><a class="link" href="#hardware-threads">32.5. Hardware threads</a></h3>
+<h3 id="hardware-threads"><a class="anchor" href="#hardware-threads"></a><a class="link" href="#hardware-threads">33.5. Hardware threads</a></h3>
 <div class="paragraph">
 <p>Intel name: "Hyperthreading"</p>
 </div>
@@ -42377,7 +42426,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="cache-coherence"><a class="anchor" href="#cache-coherence"></a><a class="link" href="#cache-coherence">32.6. Cache coherence</a></h3>
+<h3 id="cache-coherence"><a class="anchor" href="#cache-coherence"></a><a class="link" href="#cache-coherence">33.6. Cache coherence</a></h3>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Cache_coherence" class="bare">https://en.wikipedia.org/wiki/Cache_coherence</a></p>
 </div>
@@ -42419,7 +42468,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 <p>Even if caches are coherent, this is still not enough to avoid data race conditions, because this does not enforce atomicity of read modify write sequences. This is for example shown at: <a href="#detailed-gem5-analysis-of-how-data-races-happen">Detailed gem5 analysis of how data races happen</a>.</p>
 </div>
 <div class="sect3">
-<h4 id="memory-consistency"><a class="anchor" href="#memory-consistency"></a><a class="link" href="#memory-consistency">32.6.1. Memory consistency</a></h4>
+<h4 id="memory-consistency"><a class="anchor" href="#memory-consistency"></a><a class="link" href="#memory-consistency">33.6.1. Memory consistency</a></h4>
 <div class="paragraph">
 <p>According to <a href="http://www.inf.ed.ac.uk/teaching/courses/pa/Notes/lecture07-sc.pdf" class="bare">http://www.inf.ed.ac.uk/teaching/courses/pa/Notes/lecture07-sc.pdf</a> "memory consistency" is about ordering requirements of different memory addresses.</p>
 </div>
@@ -42427,14 +42476,14 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 <p>This is represented explicitly in C++ for example <a href="#cpp-memory-order">C++ std::memory_order</a>.</p>
 </div>
 <div class="sect4">
-<h5 id="sequential-consistency"><a class="anchor" href="#sequential-consistency"></a><a class="link" href="#sequential-consistency">32.6.1.1. Sequential Consistency</a></h5>
+<h5 id="sequential-consistency"><a class="anchor" href="#sequential-consistency"></a><a class="link" href="#sequential-consistency">33.6.1.1. Sequential Consistency</a></h5>
 <div class="paragraph">
 <p>According to <a href="http://www.inf.ed.ac.uk/teaching/courses/pa/Notes/lecture07-sc.pdf" class="bare">http://www.inf.ed.ac.uk/teaching/courses/pa/Notes/lecture07-sc.pdf</a>, the strongest possible consistency, everything nicely ordered as you&#8217;d expect.</p>
 </div>
 </div>
 </div>
 <div class="sect3">
-<h4 id="can-caches-snoop-data-from-other-caches"><a class="anchor" href="#can-caches-snoop-data-from-other-caches"></a><a class="link" href="#can-caches-snoop-data-from-other-caches">32.6.2. Can caches snoop data from other caches?</a></h4>
+<h4 id="can-caches-snoop-data-from-other-caches"><a class="anchor" href="#can-caches-snoop-data-from-other-caches"></a><a class="link" href="#can-caches-snoop-data-from-other-caches">33.6.2. Can caches snoop data from other caches?</a></h4>
 <div class="paragraph">
 <p>Either they can snoop only control, or both control and data can be snooped.</p>
 </div>
@@ -42449,7 +42498,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="vi-cache-coherence-protocol"><a class="anchor" href="#vi-cache-coherence-protocol"></a><a class="link" href="#vi-cache-coherence-protocol">32.6.3. VI cache coherence protocol</a></h4>
+<h4 id="vi-cache-coherence-protocol"><a class="anchor" href="#vi-cache-coherence-protocol"></a><a class="link" href="#vi-cache-coherence-protocol">33.6.3. VI cache coherence protocol</a></h4>
 <div class="paragraph">
 <p>Mentioned at:</p>
 </div>
@@ -42696,7 +42745,7 @@ west build -b qemu_aarch64 samples/hello_world</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="msi-cache-coherence-protocol"><a class="anchor" href="#msi-cache-coherence-protocol"></a><a class="link" href="#msi-cache-coherence-protocol">32.6.4. MSI cache coherence protocol</a></h4>
+<h4 id="msi-cache-coherence-protocol"><a class="anchor" href="#msi-cache-coherence-protocol"></a><a class="link" href="#msi-cache-coherence-protocol">33.6.4. MSI cache coherence protocol</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/MSI_protocol" class="bare">https://en.wikipedia.org/wiki/MSI_protocol</a></p>
 </div>
@@ -43008,7 +43057,7 @@ CACHE2 S nyy
 <p>TODO gem5 concrete example.</p>
 </div>
 <div class="sect4">
-<h5 id="msi-cache-coherence-protocol-with-transient-states"><a class="anchor" href="#msi-cache-coherence-protocol-with-transient-states"></a><a class="link" href="#msi-cache-coherence-protocol-with-transient-states">32.6.4.1. MSI cache coherence protocol with transient states</a></h5>
+<h5 id="msi-cache-coherence-protocol-with-transient-states"><a class="anchor" href="#msi-cache-coherence-protocol-with-transient-states"></a><a class="link" href="#msi-cache-coherence-protocol-with-transient-states">33.6.4.1. MSI cache coherence protocol with transient states</a></h5>
 <div class="paragraph">
 <p>TODO understand well why those are needed.</p>
 </div>
@@ -43028,7 +43077,7 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect3">
-<h4 id="mesi-cache-coherence-protocol"><a class="anchor" href="#mesi-cache-coherence-protocol"></a><a class="link" href="#mesi-cache-coherence-protocol">32.6.5. MESI cache coherence protocol</a></h4>
+<h4 id="mesi-cache-coherence-protocol"><a class="anchor" href="#mesi-cache-coherence-protocol"></a><a class="link" href="#mesi-cache-coherence-protocol">33.6.5. MESI cache coherence protocol</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/MESI_protocol" class="bare">https://en.wikipedia.org/wiki/MESI_protocol</a></p>
 </div>
@@ -43088,7 +43137,7 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect3">
-<h4 id="mosi-cache-coherence-protocol"><a class="anchor" href="#mosi-cache-coherence-protocol"></a><a class="link" href="#mosi-cache-coherence-protocol">32.6.6. MOSI cache coherence protocol</a></h4>
+<h4 id="mosi-cache-coherence-protocol"><a class="anchor" href="#mosi-cache-coherence-protocol"></a><a class="link" href="#mosi-cache-coherence-protocol">33.6.6. MOSI cache coherence protocol</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/MOSI_protocol" class="bare">https://en.wikipedia.org/wiki/MOSI_protocol</a> The critical MSI vs MOSI section was a bit bogus though: <a href="https://en.wikipedia.org/w/index.php?title=MOSI_protocol&amp;oldid=895443023" class="bare">https://en.wikipedia.org/w/index.php?title=MOSI_protocol&amp;oldid=895443023</a> but I edited it :-)</p>
 </div>
@@ -43148,7 +43197,7 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect3">
-<h4 id="moesi"><a class="anchor" href="#moesi"></a><a class="link" href="#moesi">32.6.7. MOESI cache coherence protocol</a></h4>
+<h4 id="moesi"><a class="anchor" href="#moesi"></a><a class="link" href="#moesi">33.6.7. MOESI cache coherence protocol</a></h4>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/MOESI_protocol" class="bare">https://en.wikipedia.org/wiki/MOESI_protocol</a></p>
 </div>
@@ -43156,10 +43205,10 @@ CACHE2 S nyy
 <p><a href="#mesi-cache-coherence-protocol">MESI cache coherence protocol</a> + <a href="#mosi-cache-coherence-protocol">MOSI cache coherence protocol</a>, not much else to it!</p>
 </div>
 <div class="paragraph">
-<p>In gem5 9fc9c67b4242c03f165951775be5cd0812f2a705, MOESI is the default cache coherency protocol of the <a href="#gem5-ruby-build">classic memory system</a> as shown at <a href="#what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5">Section 19.20.4.3.1, &#8220;What is the coherency protocol implemented by the classic cache system in gem5?&#8221;</a>.</p>
+<p>In gem5 9fc9c67b4242c03f165951775be5cd0812f2a705, MOESI is the default cache coherency protocol of the <a href="#gem5-ruby-build">classic memory system</a> as shown at <a href="#what-is-the-coherency-protocol-implemented-by-the-classic-cache-system-in-gem5">Section 19.21.4.3.1, &#8220;What is the coherency protocol implemented by the classic cache system in gem5?&#8221;</a>.</p>
 </div>
 <div class="paragraph">
-<p>A good an simple example showing several MOESI transitions in the classic memory model can be seen at: <a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">Section 19.20.4.4, &#8220;gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs&#8221;</a>.</p>
+<p>A good an simple example showing several MOESI transitions in the classic memory model can be seen at: <a href="#gem5-event-queue-atomicsimplecpu-syscall-emulation-freestanding-example-analysis-with-caches-and-multiple-cpus">Section 19.21.4.4, &#8220;gem5 event queue AtomicSimpleCPU syscall emulation freestanding example analysis with caches and multiple CPUs&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>gem5 12c917de54145d2d50260035ba7fa614e25317a3 has several <a href="#gem5-ruby-build">Ruby</a> MOESI models implemented: <code>MOESI_AMD_Base</code>, <code>MOESI_CMP_directory</code>, <code>MOESI_CMP_token</code> and <code>MOESI_hammer</code>.</p>
@@ -43169,10 +43218,10 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect1">
-<h2 id="about-this-repo"><a class="anchor" href="#about-this-repo"></a><a class="link" href="#about-this-repo">33. About this repo</a></h2>
+<h2 id="about-this-repo"><a class="anchor" href="#about-this-repo"></a><a class="link" href="#about-this-repo">34. About this repo</a></h2>
 <div class="sectionbody">
 <div class="sect2">
-<h3 id="supported-hosts"><a class="anchor" href="#supported-hosts"></a><a class="link" href="#supported-hosts">33.1. Supported hosts</a></h3>
+<h3 id="supported-hosts"><a class="anchor" href="#supported-hosts"></a><a class="link" href="#supported-hosts">34.1. Supported hosts</a></h3>
 <div class="paragraph">
 <p>The host requirements depend a lot on which examples you want to run.</p>
 </div>
@@ -43221,9 +43270,9 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect2">
-<h3 id="common-build-issues"><a class="anchor" href="#common-build-issues"></a><a class="link" href="#common-build-issues">33.2. Common build issues</a></h3>
+<h3 id="common-build-issues"><a class="anchor" href="#common-build-issues"></a><a class="link" href="#common-build-issues">34.2. Common build issues</a></h3>
 <div class="sect3">
-<h4 id="put-source-uris-in-sources"><a class="anchor" href="#put-source-uris-in-sources"></a><a class="link" href="#put-source-uris-in-sources">33.2.1. You must put some 'source' URIs in your sources.list</a></h4>
+<h4 id="put-source-uris-in-sources"><a class="anchor" href="#put-source-uris-in-sources"></a><a class="link" href="#put-source-uris-in-sources">34.2.1. You must put some 'source' URIs in your sources.list</a></h4>
 <div class="paragraph">
 <p>If <code>./build --download-dependencies</code> fails with:</p>
 </div>
@@ -43237,7 +43286,7 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect3">
-<h4 id="build-from-downloaded-source-zip-files"><a class="anchor" href="#build-from-downloaded-source-zip-files"></a><a class="link" href="#build-from-downloaded-source-zip-files">33.2.2. Build from downloaded source zip files</a></h4>
+<h4 id="build-from-downloaded-source-zip-files"><a class="anchor" href="#build-from-downloaded-source-zip-files"></a><a class="link" href="#build-from-downloaded-source-zip-files">34.2.2. Build from downloaded source zip files</a></h4>
 <div class="paragraph">
 <p>It does not work if you just download the <code>.zip</code> with the sources for this repository from GitHub because we use <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/.gitmodules">Git submodules</a>, you must clone this repo.</p>
 </div>
@@ -43247,7 +43296,7 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect2">
-<h3 id="run-command-after-boot"><a class="anchor" href="#run-command-after-boot"></a><a class="link" href="#run-command-after-boot">33.3. Run command after boot</a></h3>
+<h3 id="run-command-after-boot"><a class="anchor" href="#run-command-after-boot"></a><a class="link" href="#run-command-after-boot">34.3. Run command after boot</a></h3>
 <div class="paragraph">
 <p>If you just want to run a command after boot ends without thinking much about it, just use the <code>--eval-after</code> option, e.g.:</p>
 </div>
@@ -43264,7 +43313,7 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect2">
-<h3 id="default-command-line-arguments"><a class="anchor" href="#default-command-line-arguments"></a><a class="link" href="#default-command-line-arguments">33.4. Default command line arguments</a></h3>
+<h3 id="default-command-line-arguments"><a class="anchor" href="#default-command-line-arguments"></a><a class="link" href="#default-command-line-arguments">34.4. Default command line arguments</a></h3>
 <div class="paragraph">
 <p>It gets annoying to retype <code>--arch aarch64</code> for every single command, or to remember <code>--config</code> setups.</p>
 </div>
@@ -43309,12 +43358,12 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect2">
-<h3 id="documentation"><a class="anchor" href="#documentation"></a><a class="link" href="#documentation">33.5. Documentation</a></h3>
+<h3 id="documentation"><a class="anchor" href="#documentation"></a><a class="link" href="#documentation">34.5. Documentation</a></h3>
 <div class="paragraph">
 <p>To learn how to build the documentation see: <a href="#build-the-documentation">Section 1.10, &#8220;Build the documentation&#8221;</a>.</p>
 </div>
 <div class="sect3">
-<h4 id="documentation-verification"><a class="anchor" href="#documentation-verification"></a><a class="link" href="#documentation-verification">33.5.1. Documentation verification</a></h4>
+<h4 id="documentation-verification"><a class="anchor" href="#documentation-verification"></a><a class="link" href="#documentation-verification">34.5.1. Documentation verification</a></h4>
 <div class="paragraph">
 <p>When running <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/build-doc">build-doc</a>, we do the following checks:</p>
 </div>
@@ -43335,7 +43384,7 @@ CACHE2 S nyy
 <p>The scripts prints what you have to fix and exits with an error status if there are any errors.</p>
 </div>
 <div class="sect4">
-<h5 id="asciidoctor-extract-link-targets"><a class="anchor" href="#asciidoctor-extract-link-targets"></a><a class="link" href="#asciidoctor-extract-link-targets">33.5.1.1. asciidoctor/extract-link-targets</a></h5>
+<h5 id="asciidoctor-extract-link-targets"><a class="anchor" href="#asciidoctor-extract-link-targets"></a><a class="link" href="#asciidoctor-extract-link-targets">34.5.1.1. asciidoctor/extract-link-targets</a></h5>
 <div class="paragraph">
 <p>Documentation for <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/asciidoctor/extract-link-targets">asciidoctor/extract-link-targets</a></p>
 </div>
@@ -43358,7 +43407,7 @@ CACHE2 S nyy
 </div>
 </div>
 <div class="sect4">
-<h5 id="asciidoctor-extract-header-ids"><a class="anchor" href="#asciidoctor-extract-header-ids"></a><a class="link" href="#asciidoctor-extract-header-ids">33.5.1.2. asciidoctor/extract-header-ids</a></h5>
+<h5 id="asciidoctor-extract-header-ids"><a class="anchor" href="#asciidoctor-extract-header-ids"></a><a class="link" href="#asciidoctor-extract-header-ids">34.5.1.2. asciidoctor/extract-header-ids</a></h5>
 <div class="paragraph">
 <p>Documentation for <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/asciidoctor/extract-header-ids">asciidoctor/extract-header-ids</a></p>
 </div>
@@ -43403,7 +43452,7 @@ explicitly-given</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="asciidoctor-link-target-up-rb"><a class="anchor" href="#asciidoctor-link-target-up-rb"></a><a class="link" href="#asciidoctor-link-target-up-rb">33.6. asciidoctor/link-target-up.rb</a></h3>
+<h3 id="asciidoctor-link-target-up-rb"><a class="anchor" href="#asciidoctor-link-target-up-rb"></a><a class="link" href="#asciidoctor-link-target-up-rb">34.6. asciidoctor/link-target-up.rb</a></h3>
 <div class="paragraph">
 <p>The Asciidoctor extension scripts:</p>
 </div>
@@ -43431,7 +43480,7 @@ explicitly-given</pre>
 </ul>
 </div>
 <div class="sect3">
-<h4 id="github-pages"><a class="anchor" href="#github-pages"></a><a class="link" href="#github-pages">33.6.1. GitHub pages</a></h4>
+<h4 id="github-pages"><a class="anchor" href="#github-pages"></a><a class="link" href="#github-pages">34.6.1. GitHub pages</a></h4>
 <div class="paragraph">
 <p>As mentioned before the TOC, we have to push this README to GitHub pages due to: <a href="https://github.com/isaacs/github/issues/1610" class="bare">https://github.com/isaacs/github/issues/1610</a></p>
 </div>
@@ -43481,7 +43530,7 @@ explicitly-given</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="clean-the-build"><a class="anchor" href="#clean-the-build"></a><a class="link" href="#clean-the-build">33.7. Clean the build</a></h3>
+<h3 id="clean-the-build"><a class="anchor" href="#clean-the-build"></a><a class="link" href="#clean-the-build">34.7. Clean the build</a></h3>
 <div class="paragraph">
 <p>You did something crazy, and nothing seems to work anymore?</p>
 </div>
@@ -43545,7 +43594,7 @@ ls "$(./getvar buildroot_build_dir)"</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="custom-build-directory"><a class="anchor" href="#custom-build-directory"></a><a class="link" href="#custom-build-directory">33.8. Custom build directory</a></h3>
+<h3 id="custom-build-directory"><a class="anchor" href="#custom-build-directory"></a><a class="link" href="#custom-build-directory">34.8. Custom build directory</a></h3>
 <div class="paragraph">
 <p>For now there is no way to change the build directory from <code>out/</code> (resp. <code>out.docker</code> for &lt;&lt;docker&gt;.) to something else.</p>
 </div>
@@ -43560,7 +43609,7 @@ ln -s out /mnt/hd/linux-kernel-module-cheat-out</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="ccache"><a class="anchor" href="#ccache"></a><a class="link" href="#ccache">33.9. ccache</a></h3>
+<h3 id="ccache"><a class="anchor" href="#ccache"></a><a class="link" href="#ccache">34.9. ccache</a></h3>
 <div class="paragraph">
 <p><a href="https://en.wikipedia.org/wiki/Ccache">ccache</a> <a href="#benchmark-builds">might</a> save you a lot of re-build when you decide to <a href="#clean-the-build">Clean the build</a> or create a new <a href="#build-variants">build variant</a>.</p>
 </div>
@@ -43640,7 +43689,7 @@ export CCACHE_MAXSIZE="20G"</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="getvar"><a class="anchor" href="#getvar"></a><a class="link" href="#getvar">33.10. getvar</a></h3>
+<h3 id="getvar"><a class="anchor" href="#getvar"></a><a class="link" href="#getvar">34.10. getvar</a></h3>
 <div class="paragraph">
 <p>The <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/getvar">getvar</a> helper script can print the values of internal LKMC variables.</p>
 </div>
@@ -43678,7 +43727,7 @@ export CCACHE_MAXSIZE="20G"</pre>
 <p>For this reason, we use it in particular often in this README to reduce the need for refactoring.</p>
 </div>
 <div class="sect3">
-<h4 id="run-toolchain"><a class="anchor" href="#run-toolchain"></a><a class="link" href="#run-toolchain">33.10.1. run-toolchain</a></h4>
+<h4 id="run-toolchain"><a class="anchor" href="#run-toolchain"></a><a class="link" href="#run-toolchain">34.10.1. run-toolchain</a></h4>
 <div class="paragraph">
 <p>While you could just manually find/learn the path to toolchain tools, e.g. in LKMC b15a0e455d691afa49f3b813ad9b09394dfb02b7 they are:</p>
 </div>
@@ -43725,7 +43774,7 @@ export CCACHE_MAXSIZE="20G"</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="disas"><a class="anchor" href="#disas"></a><a class="link" href="#disas">33.10.1.1. disas</a></h5>
+<h5 id="disas"><a class="anchor" href="#disas"></a><a class="link" href="#disas">34.10.1.1. disas</a></h5>
 <div class="paragraph">
 <p>Since disassembly of a single function of a LKMC executable with GDB is such a common use case for <a href="#run-toolchain">run-toolchain</a> via <a href="https://stackoverflow.com/questions/22769246/how-to-disassemble-one-single-function-using-objdump" class="bare">https://stackoverflow.com/questions/22769246/how-to-disassemble-one-single-function-using-objdump</a>, we have this shortcut for it.</p>
 </div>
@@ -43757,7 +43806,7 @@ export CCACHE_MAXSIZE="20G"</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="rebuild-buildroot-while-running"><a class="anchor" href="#rebuild-buildroot-while-running"></a><a class="link" href="#rebuild-buildroot-while-running">33.11. Rebuild Buildroot while running</a></h3>
+<h3 id="rebuild-buildroot-while-running"><a class="anchor" href="#rebuild-buildroot-while-running"></a><a class="link" href="#rebuild-buildroot-while-running">34.11. Rebuild Buildroot while running</a></h3>
 <div class="paragraph">
 <p>It is not possible to rebuild the root filesystem while running QEMU because QEMU holds the file qcow2 file:</p>
 </div>
@@ -43768,7 +43817,7 @@ export CCACHE_MAXSIZE="20G"</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="simultaneous-runs"><a class="anchor" href="#simultaneous-runs"></a><a class="link" href="#simultaneous-runs">33.12. Simultaneous runs</a></h3>
+<h3 id="simultaneous-runs"><a class="anchor" href="#simultaneous-runs"></a><a class="link" href="#simultaneous-runs">34.12. Simultaneous runs</a></h3>
 <div class="paragraph">
 <p>When doing long simulations sweeping across multiple system parameters, it becomes fundamental to do multiple simulations in parallel.</p>
 </div>
@@ -43864,7 +43913,7 @@ less "$(./getvar --arch aarch64 --emulator gem5 --run-id 1 termout_file)"</pre>
 </div>
 </div>
 <div class="paragraph">
-<p>To run multiple gem5 checkouts, see: <a href="#gem5-worktree">Section 33.13.3.1, &#8220;gem5 worktree&#8221;</a>.</p>
+<p>To run multiple gem5 checkouts, see: <a href="#gem5-worktree">Section 34.13.3.1, &#8220;gem5 worktree&#8221;</a>.</p>
 </div>
 <div class="paragraph">
 <p>Implementation note: we create multiple namespaces for two things:</p>
@@ -43903,7 +43952,7 @@ less "$(./getvar --arch aarch64 --emulator gem5 --run-id 1 termout_file)"</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="build-variants"><a class="anchor" href="#build-variants"></a><a class="link" href="#build-variants">33.13. Build variants</a></h3>
+<h3 id="build-variants"><a class="anchor" href="#build-variants"></a><a class="link" href="#build-variants">34.13. Build variants</a></h3>
 <div class="paragraph">
 <p>It often happens that you are comparing two versions of the build, a good and a bad one, and trying to figure out why the bad one is bad.</p>
 </div>
@@ -43911,7 +43960,7 @@ less "$(./getvar --arch aarch64 --emulator gem5 --run-id 1 termout_file)"</pre>
 <p>Our build variants system allows you to keep multiple built versions of all major components, so that you can easily switching between running one or the other.</p>
 </div>
 <div class="sect3">
-<h4 id="linux-kernel-build-variants"><a class="anchor" href="#linux-kernel-build-variants"></a><a class="link" href="#linux-kernel-build-variants">33.13.1. Linux kernel build variants</a></h4>
+<h4 id="linux-kernel-build-variants"><a class="anchor" href="#linux-kernel-build-variants"></a><a class="link" href="#linux-kernel-build-variants">34.13.1. Linux kernel build variants</a></h4>
 <div class="paragraph">
 <p>If you want to keep two builds around, one for the latest Linux version, and the other for Linux <code>v4.16</code>:</p>
 </div>
@@ -43947,11 +43996,11 @@ git -C "$(./getvar linux_source_dir)" checkout -
 </div>
 </div>
 <div class="paragraph">
-<p>To run both kernels simultaneously, one on each QEMU instance, see: <a href="#simultaneous-runs">Section 33.12, &#8220;Simultaneous runs&#8221;</a>.</p>
+<p>To run both kernels simultaneously, one on each QEMU instance, see: <a href="#simultaneous-runs">Section 34.12, &#8220;Simultaneous runs&#8221;</a>.</p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="qemu-build-variants"><a class="anchor" href="#qemu-build-variants"></a><a class="link" href="#qemu-build-variants">33.13.2. QEMU build variants</a></h4>
+<h4 id="qemu-build-variants"><a class="anchor" href="#qemu-build-variants"></a><a class="link" href="#qemu-build-variants">34.13.2. QEMU build variants</a></h4>
 <div class="paragraph">
 <p>Analogous to the <a href="#linux-kernel-build-variants">Linux kernel build variants</a> but with the <code>--qemu-build-id</code> option instead:</p>
 </div>
@@ -43967,7 +44016,7 @@ git -C "$(./getvar qemu_source_dir)" checkout -
 </div>
 </div>
 <div class="sect3">
-<h4 id="gem5-build-variants"><a class="anchor" href="#gem5-build-variants"></a><a class="link" href="#gem5-build-variants">33.13.3. gem5 build variants</a></h4>
+<h4 id="gem5-build-variants"><a class="anchor" href="#gem5-build-variants"></a><a class="link" href="#gem5-build-variants">34.13.3. gem5 build variants</a></h4>
 <div class="paragraph">
 <p>Analogous to the <a href="#linux-kernel-build-variants">Linux kernel build variants</a> but with the <code>--gem5-build-id</code> option instead:</p>
 </div>
@@ -43998,7 +44047,7 @@ git -C "$(./getvar gem5_source_dir)" checkout some-branch
 <p>Therefore, you can&#8217;t forget to checkout to the sources to that of the corresponding build before running, unless you explicitly tell gem5 to use a non-default source tree with <a href="#gem5-worktree">gem5 worktree</a>. This becomes inevitable when you want to launch multiple simultaneous runs at different checkouts.</p>
 </div>
 <div class="sect4">
-<h5 id="gem5-worktree"><a class="anchor" href="#gem5-worktree"></a><a class="link" href="#gem5-worktree">33.13.3.1. gem5 worktree</a></h5>
+<h5 id="gem5-worktree"><a class="anchor" href="#gem5-worktree"></a><a class="link" href="#gem5-worktree">34.13.3.1. gem5 worktree</a></h5>
 <div class="paragraph">
 <p><a href="#gem5-build-variants"><code>--gem5-build-id</code></a> goes a long way, but if you want to seamlessly switch between two gem5 tress without checking out multiple times, then <code>--gem5-worktree</code> is for you.</p>
 </div>
@@ -44051,7 +44100,7 @@ cd -
 </div>
 </div>
 <div class="sect4">
-<h5 id="gem5-private-source-trees"><a class="anchor" href="#gem5-private-source-trees"></a><a class="link" href="#gem5-private-source-trees">33.13.3.2. gem5 private source trees</a></h5>
+<h5 id="gem5-private-source-trees"><a class="anchor" href="#gem5-private-source-trees"></a><a class="link" href="#gem5-private-source-trees">34.13.3.2. gem5 private source trees</a></h5>
 <div class="paragraph">
 <p>Suppose that you are working on a private fork of gem5, but you want to use this repository to develop it as well.</p>
 </div>
@@ -44095,7 +44144,7 @@ gem5_internal="$(pwd)/gem5-internal"</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="buildroot-build-variants"><a class="anchor" href="#buildroot-build-variants"></a><a class="link" href="#buildroot-build-variants">33.13.4. Buildroot build variants</a></h4>
+<h4 id="buildroot-build-variants"><a class="anchor" href="#buildroot-build-variants"></a><a class="link" href="#buildroot-build-variants">34.13.4. Buildroot build variants</a></h4>
 <div class="paragraph">
 <p>Allows you to have multiple versions of the GCC toolchain or root filesystem.</p>
 </div>
@@ -44115,7 +44164,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 </div>
 </div>
 <div class="sect2">
-<h3 id="optimization-level-of-a-build"><a class="anchor" href="#optimization-level-of-a-build"></a><a class="link" href="#optimization-level-of-a-build">33.14. Optimization level of a build</a></h3>
+<h3 id="optimization-level-of-a-build"><a class="anchor" href="#optimization-level-of-a-build"></a><a class="link" href="#optimization-level-of-a-build">34.14. Optimization level of a build</a></h3>
 <div class="paragraph">
 <p>The <code>--optimization-level</code> option is available on all build scripts and sets the given GCC `-`O optimization level where it has been implemented for guest binaries.</p>
 </div>
@@ -44142,9 +44191,9 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 </div>
 </div>
 <div class="sect2">
-<h3 id="directory-structure"><a class="anchor" href="#directory-structure"></a><a class="link" href="#directory-structure">33.15. Directory structure</a></h3>
+<h3 id="directory-structure"><a class="anchor" href="#directory-structure"></a><a class="link" href="#directory-structure">34.15. Directory structure</a></h3>
 <div class="sect3">
-<h4 id="lkmc-directory"><a class="anchor" href="#lkmc-directory"></a><a class="link" href="#lkmc-directory">33.15.1. lkmc directory</a></h4>
+<h4 id="lkmc-directory"><a class="anchor" href="#lkmc-directory"></a><a class="link" href="#lkmc-directory">34.15.1. lkmc directory</a></h4>
 <div class="paragraph">
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/lkmc/">lkmc/</a> contains sources and headers that are shared across kernel modules, userland and baremetal examples.</p>
 </div>
@@ -44155,7 +44204,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 <p>Another option would have been to name it as <code>includes/lkmc</code>, but that would make paths longer, and we might want to store source code in that directory as well in the future.</p>
 </div>
 <div class="sect4">
-<h5 id="userland-objects-vs-header-only"><a class="anchor" href="#userland-objects-vs-header-only"></a><a class="link" href="#userland-objects-vs-header-only">33.15.1.1. Userland objects vs header-only</a></h5>
+<h5 id="userland-objects-vs-header-only"><a class="anchor" href="#userland-objects-vs-header-only"></a><a class="link" href="#userland-objects-vs-header-only">34.15.1.1. Userland objects vs header-only</a></h5>
 <div class="paragraph">
 <p>When factoring out functionality across userland examples, there are two main options:</p>
 </div>
@@ -44214,7 +44263,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 </div>
 </div>
 <div class="sect3">
-<h4 id="buildroot_packages-directory"><a class="anchor" href="#buildroot_packages-directory"></a><a class="link" href="#buildroot_packages-directory">33.15.2. buildroot_packages directory</a></h4>
+<h4 id="buildroot_packages-directory"><a class="anchor" href="#buildroot_packages-directory"></a><a class="link" href="#buildroot_packages-directory">34.15.2. buildroot_packages directory</a></h4>
 <div class="paragraph">
 <p>Source: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/buildroot_packages/">buildroot_packages/</a>.</p>
 </div>
@@ -44263,7 +44312,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 <p>A custom build script can give you more flexibility: e.g. the package can be made work with other root filesystems more easily, have better <a href="#9p">9P</a> support, and rebuild faster as it evades some Buildroot boilerplate.</p>
 </div>
 <div class="sect4">
-<h5 id="kernel-modules-buildroot-package"><a class="anchor" href="#kernel-modules-buildroot-package"></a><a class="link" href="#kernel-modules-buildroot-package">33.15.2.1. kernel_modules buildroot package</a></h5>
+<h5 id="kernel-modules-buildroot-package"><a class="anchor" href="#kernel-modules-buildroot-package"></a><a class="link" href="#kernel-modules-buildroot-package">34.15.2.1. kernel_modules buildroot package</a></h5>
 <div class="paragraph">
 <p>Source: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/buildroot_packages/kernel_modules/">buildroot_packages/kernel_modules/</a></p>
 </div>
@@ -44310,9 +44359,9 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 </div>
 </div>
 <div class="sect3">
-<h4 id="patches-directory"><a class="anchor" href="#patches-directory"></a><a class="link" href="#patches-directory">33.15.3. patches directory</a></h4>
+<h4 id="patches-directory"><a class="anchor" href="#patches-directory"></a><a class="link" href="#patches-directory">34.15.3. patches directory</a></h4>
 <div class="sect4">
-<h5 id="patches-global-directory"><a class="anchor" href="#patches-global-directory"></a><a class="link" href="#patches-global-directory">33.15.3.1. patches/global directory</a></h5>
+<h5 id="patches-global-directory"><a class="anchor" href="#patches-global-directory"></a><a class="link" href="#patches-global-directory">34.15.3.1. patches/global directory</a></h5>
 <div class="paragraph">
 <p>Has the following structure:</p>
 </div>
@@ -44329,7 +44378,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 </div>
 </div>
 <div class="sect4">
-<h5 id="patches-manual-directory"><a class="anchor" href="#patches-manual-directory"></a><a class="link" href="#patches-manual-directory">33.15.3.2. patches/manual directory</a></h5>
+<h5 id="patches-manual-directory"><a class="anchor" href="#patches-manual-directory"></a><a class="link" href="#patches-manual-directory">34.15.3.2. patches/manual directory</a></h5>
 <div class="paragraph">
 <p>Patches in this directory are never applied automatically: it is up to users to manually apply them before usage following the instructions in this documentation.</p>
 </div>
@@ -44339,7 +44388,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 </div>
 </div>
 <div class="sect3">
-<h4 id="rootfs_overlay"><a class="anchor" href="#rootfs_overlay"></a><a class="link" href="#rootfs_overlay">33.15.4. rootfs_overlay</a></h4>
+<h4 id="rootfs_overlay"><a class="anchor" href="#rootfs_overlay"></a><a class="link" href="#rootfs_overlay">34.15.4. rootfs_overlay</a></h4>
 <div class="paragraph">
 <p>Source: <a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/rootfs_overlay">rootfs_overlay</a>.</p>
 </div>
@@ -44386,7 +44435,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 <p>This way you can just hack away the scripts and try them out immediately without any further operations.</p>
 </div>
 <div class="sect4">
-<h5 id="out_rootfs_overlay_dir"><a class="anchor" href="#out_rootfs_overlay_dir"></a><a class="link" href="#out_rootfs_overlay_dir">33.15.4.1. out_rootfs_overlay_dir</a></h5>
+<h5 id="out_rootfs_overlay_dir"><a class="anchor" href="#out_rootfs_overlay_dir"></a><a class="link" href="#out_rootfs_overlay_dir">34.15.4.1. out_rootfs_overlay_dir</a></h5>
 <div class="paragraph">
 <p>This path can be found with:</p>
 </div>
@@ -44420,7 +44469,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 </div>
 </div>
 <div class="sect3">
-<h4 id="lkmc-c"><a class="anchor" href="#lkmc-c"></a><a class="link" href="#lkmc-c">33.15.5. lkmc.c</a></h4>
+<h4 id="lkmc-c"><a class="anchor" href="#lkmc-c"></a><a class="link" href="#lkmc-c">34.15.5. lkmc.c</a></h4>
 <div class="paragraph">
 <p>The files:</p>
 </div>
@@ -44450,7 +44499,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 </div>
 </div>
 <div class="sect3">
-<h4 id="lkmc_home"><a class="anchor" href="#lkmc_home"></a><a class="link" href="#lkmc_home">33.15.6. lkmc_home</a></h4>
+<h4 id="lkmc_home"><a class="anchor" href="#lkmc_home"></a><a class="link" href="#lkmc_home">34.15.6. lkmc_home</a></h4>
 <div class="paragraph">
 <p><code>lkmc_home</code> refers to the target base directory in which we put all our custom built stuff, such as <a href="#userland-setup">userland executables</a> and <a href="#your-first-kernel-module-hack">kernel modules</a>.</p>
 </div>
@@ -44483,7 +44532,7 @@ git -C "$(./getvar buildroot_source_dir)" checkout -
 </div>
 </div>
 <div class="sect3">
-<h4 id="path-properties"><a class="anchor" href="#path-properties"></a><a class="link" href="#path-properties">33.15.7. path_properties.py</a></h4>
+<h4 id="path-properties"><a class="anchor" href="#path-properties"></a><a class="link" href="#path-properties">34.15.7. path_properties.py</a></h4>
 <div class="paragraph">
 <p>In order to build and run each userland and <a href="#baremetal-setup">baremetal</a> example properly, we need per-file metadata such as compiler flags and required number of cores.</p>
 </div>
@@ -44546,7 +44595,7 @@ baremetal=True</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="rand_check-out"><a class="anchor" href="#rand_check-out"></a><a class="link" href="#rand_check-out">33.15.8. rand_check.out</a></h4>
+<h4 id="rand_check-out"><a class="anchor" href="#rand_check-out"></a><a class="link" href="#rand_check-out">34.15.8. rand_check.out</a></h4>
 <div class="paragraph">
 <p>Print out several parameters that normally change randomly from boot to boot:</p>
 </div>
@@ -44574,9 +44623,9 @@ baremetal=True</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="test-this-repo"><a class="anchor" href="#test-this-repo"></a><a class="link" href="#test-this-repo">33.16. Test this repo</a></h3>
+<h3 id="test-this-repo"><a class="anchor" href="#test-this-repo"></a><a class="link" href="#test-this-repo">34.16. Test this repo</a></h3>
 <div class="sect3">
-<h4 id="automated-tests"><a class="anchor" href="#automated-tests"></a><a class="link" href="#automated-tests">33.16.1. Automated tests</a></h4>
+<h4 id="automated-tests"><a class="anchor" href="#automated-tests"></a><a class="link" href="#automated-tests">34.16.1. Automated tests</a></h4>
 <div class="paragraph">
 <p>Run almost all tests:</p>
 </div>
@@ -44632,7 +44681,7 @@ echo $?</pre>
 <p><a href="https://github.com/cirosantilli/linux-kernel-module-cheat/blob/master/test">test</a> does not all possible tests, because there are too many possible variations and that would take forever. The rationale is the same as for <code>./build all</code> and is explained in <code>./build --help</code>.</p>
 </div>
 <div class="sect4">
-<h5 id="test-arch-and-emulator-selection"><a class="anchor" href="#test-arch-and-emulator-selection"></a><a class="link" href="#test-arch-and-emulator-selection">33.16.1.1. Test arch and emulator selection</a></h5>
+<h5 id="test-arch-and-emulator-selection"><a class="anchor" href="#test-arch-and-emulator-selection"></a><a class="link" href="#test-arch-and-emulator-selection">34.16.1.1. Test arch and emulator selection</a></h5>
 <div class="paragraph">
 <p>You can select multiple archs and emulators of interest, as for an other command, with:</p>
 </div>
@@ -44665,7 +44714,7 @@ echo $?</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="quit-on-fail"><a class="anchor" href="#quit-on-fail"></a><a class="link" href="#quit-on-fail">33.16.1.2. Quit on fail</a></h5>
+<h5 id="quit-on-fail"><a class="anchor" href="#quit-on-fail"></a><a class="link" href="#quit-on-fail">34.16.1.2. Quit on fail</a></h5>
 <div class="paragraph">
 <p>By default, continue running even after the first failure happens, and they show a summary at the end.</p>
 </div>
@@ -44679,7 +44728,7 @@ echo $?</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="test-userland-in-full-system"><a class="anchor" href="#test-userland-in-full-system"></a><a class="link" href="#test-userland-in-full-system">33.16.1.3. Test userland in full system</a></h5>
+<h5 id="test-userland-in-full-system"><a class="anchor" href="#test-userland-in-full-system"></a><a class="link" href="#test-userland-in-full-system">34.16.1.3. Test userland in full system</a></h5>
 <div class="paragraph">
 <p>TODO: we really need a mechanism to automatically generate the test list automatically e.g. based on <a href="#path-properties">path_properties.py</a>, currently there are many tests missing, and we have to add everything manually which is very annoying.</p>
 </div>
@@ -44708,7 +44757,7 @@ echo $?</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="gdb-tests"><a class="anchor" href="#gdb-tests"></a><a class="link" href="#gdb-tests">33.16.1.4. GDB tests</a></h5>
+<h5 id="gdb-tests"><a class="anchor" href="#gdb-tests"></a><a class="link" href="#gdb-tests">34.16.1.4. GDB tests</a></h5>
 <div class="paragraph">
 <p>We have some <a href="https://github.com/pexpect/pexpect">pexpect</a> automated tests for GDB for both userland and baremetal programs!</p>
 </div>
@@ -44781,7 +44830,7 @@ echo $?</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="magic-failure-string"><a class="anchor" href="#magic-failure-string"></a><a class="link" href="#magic-failure-string">33.16.1.5. Magic failure string</a></h5>
+<h5 id="magic-failure-string"><a class="anchor" href="#magic-failure-string"></a><a class="link" href="#magic-failure-string">34.16.1.5. Magic failure string</a></h5>
 <div class="paragraph">
 <p>We do not know of any way to set the emulator exit status in QEMU arm full system.</p>
 </div>
@@ -44884,9 +44933,9 @@ echo $?</pre>
 </div>
 </div>
 <div class="sect3">
-<h4 id="non-automated-tests"><a class="anchor" href="#non-automated-tests"></a><a class="link" href="#non-automated-tests">33.16.2. Non-automated tests</a></h4>
+<h4 id="non-automated-tests"><a class="anchor" href="#non-automated-tests"></a><a class="link" href="#non-automated-tests">34.16.2. Non-automated tests</a></h4>
 <div class="sect4">
-<h5 id="test-gdb-linux-kernel"><a class="anchor" href="#test-gdb-linux-kernel"></a><a class="link" href="#test-gdb-linux-kernel">33.16.2.1. Test GDB Linux kernel</a></h5>
+<h5 id="test-gdb-linux-kernel"><a class="anchor" href="#test-gdb-linux-kernel"></a><a class="link" href="#test-gdb-linux-kernel">34.16.2.1. Test GDB Linux kernel</a></h5>
 <div class="paragraph">
 <p>For the Linux kernel, do the following manual tests for now.</p>
 </div>
@@ -44924,7 +44973,7 @@ echo $?</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="test-the-internet"><a class="anchor" href="#test-the-internet"></a><a class="link" href="#test-the-internet">33.16.2.2. Test the Internet</a></h5>
+<h5 id="test-the-internet"><a class="anchor" href="#test-the-internet"></a><a class="link" href="#test-the-internet">34.16.2.2. Test the Internet</a></h5>
 <div class="paragraph">
 <p>You should also test that the Internet works:</p>
 </div>
@@ -44935,7 +44984,7 @@ echo $?</pre>
 </div>
 </div>
 <div class="sect4">
-<h5 id="cli-script-tests"><a class="anchor" href="#cli-script-tests"></a><a class="link" href="#cli-script-tests">33.16.2.3. CLI script tests</a></h5>
+<h5 id="cli-script-tests"><a class="anchor" href="#cli-script-tests"></a><a class="link" href="#cli-script-tests">34.16.2.3. CLI script tests</a></h5>
 <div class="paragraph">
 <p><code>build-userland</code> and <code>test-executables</code> have a wide variety of target selection modes, and it was hard to keep them all working without some tests:</p>
 </div>
@@ -44953,7 +45002,7 @@ echo $?</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="bisection"><a class="anchor" href="#bisection"></a><a class="link" href="#bisection">33.17. Bisection</a></h3>
+<h3 id="bisection"><a class="anchor" href="#bisection"></a><a class="link" href="#bisection">34.17. Bisection</a></h3>
 <div class="paragraph">
 <p>When updating the Linux kernel, QEMU and gem5, things sometimes break.</p>
 </div>
@@ -45009,7 +45058,7 @@ git submodule update
 </div>
 </div>
 <div class="sect2">
-<h3 id="update-a-forked-submodule"><a class="anchor" href="#update-a-forked-submodule"></a><a class="link" href="#update-a-forked-submodule">33.18. Update a forked submodule</a></h3>
+<h3 id="update-a-forked-submodule"><a class="anchor" href="#update-a-forked-submodule"></a><a class="link" href="#update-a-forked-submodule">34.18. Update a forked submodule</a></h3>
 <div class="paragraph">
 <p>This is a template update procedure for submodules for which we have some patches on on top of mainline.</p>
 </div>
@@ -45038,9 +45087,9 @@ git commit -m "linux: update to ${next_mainline_revision}"</pre>
 </div>
 </div>
 <div class="sect2">
-<h3 id="release"><a class="anchor" href="#release"></a><a class="link" href="#release">33.19. Release</a></h3>
+<h3 id="release"><a class="anchor" href="#release"></a><a class="link" href="#release">34.19. Release</a></h3>
 <div class="sect3">
-<h4 id="release-procedure"><a class="anchor" href="#release-procedure"></a><a class="link" href="#release-procedure">33.19.1. Release procedure</a></h4>
+<h4 id="release-procedure"><a class="anchor" href="#release-procedure"></a><a class="link" href="#release-procedure">34.19.1. Release procedure</a></h4>
 <div class="paragraph">
 <p>Ensure that the <a href="#automated-tests">Automated tests</a> are passing on a clean build:</p>
 </div>
@@ -45051,7 +45100,7 @@ git commit -m "linux: update to ${next_mainline_revision}"</pre>
 </div>
 </div>
 <div class="paragraph">
-<p>The <code>./build-test</code> command builds a superset of what will be downloaded which also tests other things we would like to be working on the release. For the minimal build to generate the files to be uploaded, see: <a href="#release-zip">Section 33.19.2, &#8220;release-zip&#8221;</a></p>
+<p>The <code>./build-test</code> command builds a superset of what will be downloaded which also tests other things we would like to be working on the release. For the minimal build to generate the files to be uploaded, see: <a href="#release-zip">Section 34.19.2, &#8220;release-zip&#8221;</a></p>
 </div>
 <div class="paragraph">
 <p>The clean build is necessary as it generates clean images since <a href="#remove-buildroot-packages">it is not possible to remove Buildroot packages</a></p>
@@ -45121,7 +45170,7 @@ git push --follow-tags
 </div>
 </div>
 <div class="sect3">
-<h4 id="release-zip"><a class="anchor" href="#release-zip"></a><a class="link" href="#release-zip">33.19.2. release-zip</a></h4>
+<h4 id="release-zip"><a class="anchor" href="#release-zip"></a><a class="link" href="#release-zip">34.19.2. release-zip</a></h4>
 <div class="paragraph">
 <p>Create a zip containing all files required for <a href="#prebuilt">Prebuilt setup</a>:</p>
 </div>
@@ -45146,7 +45195,7 @@ git push --follow-tags
 </div>
 </div>
 <div class="sect3">
-<h4 id="release-upload"><a class="anchor" href="#release-upload"></a><a class="link" href="#release-upload">33.19.3. release-upload</a></h4>
+<h4 id="release-upload"><a class="anchor" href="#release-upload"></a><a class="link" href="#release-upload">34.19.3. release-upload</a></h4>
 <div class="paragraph">
 <p>After:</p>
 </div>
@@ -45194,9 +45243,9 @@ git push --follow-tags
 </div>
 </div>
 <div class="sect2">
-<h3 id="design-rationale"><a class="anchor" href="#design-rationale"></a><a class="link" href="#design-rationale">33.20. Design rationale</a></h3>
+<h3 id="design-rationale"><a class="anchor" href="#design-rationale"></a><a class="link" href="#design-rationale">34.20. Design rationale</a></h3>
 <div class="sect3">
-<h4 id="design-goals"><a class="anchor" href="#design-goals"></a><a class="link" href="#design-goals">33.20.1. Design goals</a></h4>
+<h4 id="design-goals"><a class="anchor" href="#design-goals"></a><a class="link" href="#design-goals">34.20.1. Design goals</a></h4>
 <div class="paragraph">
 <p>This project was created to help me understand, modify and test low level system components by using system simulators.</p>
 </div>
@@ -45272,7 +45321,7 @@ git push --follow-tags
 </div>
 </div>
 <div class="sect3">
-<h4 id="setup-trade-offs"><a class="anchor" href="#setup-trade-offs"></a><a class="link" href="#setup-trade-offs">33.20.2. Setup trade-offs</a></h4>
+<h4 id="setup-trade-offs"><a class="anchor" href="#setup-trade-offs"></a><a class="link" href="#setup-trade-offs">34.20.2. Setup trade-offs</a></h4>
 <div class="paragraph">
 <p>The trade-offs between the different <a href="#getting-started">setups</a> are basically a balance between:</p>
 </div>
@@ -45297,13 +45346,13 @@ git push --follow-tags
 <p>compatibility: how likely is is that all the components will work well together: emulator, compiler, kernel, standard library, &#8230;&#8203;</p>
 </li>
 <li>
-<p>guest software availability: how wide is your choice of easily installed guest software packages? See also: <a href="#linux-distro-choice">Section 33.20.4, &#8220;Linux distro choice&#8221;</a></p>
+<p>guest software availability: how wide is your choice of easily installed guest software packages? See also: <a href="#linux-distro-choice">Section 34.20.4, &#8220;Linux distro choice&#8221;</a></p>
 </li>
 </ul>
 </div>
 </div>
 <div class="sect3">
-<h4 id="resource-tradeoff-guidelines"><a class="anchor" href="#resource-tradeoff-guidelines"></a><a class="link" href="#resource-tradeoff-guidelines">33.20.3. Resource tradeoff guidelines</a></h4>
+<h4 id="resource-tradeoff-guidelines"><a class="anchor" href="#resource-tradeoff-guidelines"></a><a class="link" href="#resource-tradeoff-guidelines">34.20.3. Resource tradeoff guidelines</a></h4>
 <div class="paragraph">
 <p>Choosing which features go into our default builds means making tradeoffs, here are our guidelines:</p>
 </div>
@@ -45344,11 +45393,11 @@ git push --follow-tags
 </ul>
 </div>
 <div class="paragraph">
-<p>In order to learn how to measure some of those aspects, see: <a href="#benchmark-this-repo">Section 29, &#8220;Benchmark this repo&#8221;</a>.</p>
+<p>In order to learn how to measure some of those aspects, see: <a href="#benchmark-this-repo">Section 30, &#8220;Benchmark this repo&#8221;</a>.</p>
 </div>
 </div>
 <div class="sect3">
-<h4 id="linux-distro-choice"><a class="anchor" href="#linux-distro-choice"></a><a class="link" href="#linux-distro-choice">33.20.4. Linux distro choice</a></h4>
+<h4 id="linux-distro-choice"><a class="anchor" href="#linux-distro-choice"></a><a class="link" href="#linux-distro-choice">34.20.4. Linux distro choice</a></h4>
 <div class="paragraph">
 <p>We haven&#8217;t found the ultimate distro yet, here is a summary table of trade-offs that we care about: <a href="#table-lkmc-linux-distro-comparison">Table 8, &#8220;Comparison of Linux distros for usage in this repository&#8221;</a>.</p>
 </div>
@@ -45451,9 +45500,9 @@ git push --follow-tags
 </div>
 </div>
 <div class="sect2">
-<h3 id="soft-topics"><a class="anchor" href="#soft-topics"></a><a class="link" href="#soft-topics">33.21. Soft topics</a></h3>
+<h3 id="soft-topics"><a class="anchor" href="#soft-topics"></a><a class="link" href="#soft-topics">34.21. Soft topics</a></h3>
 <div class="sect3">
-<h4 id="fairy-tale"><a class="anchor" href="#fairy-tale"></a><a class="link" href="#fairy-tale">33.21.1. Fairy tale</a></h4>
+<h4 id="fairy-tale"><a class="anchor" href="#fairy-tale"></a><a class="link" href="#fairy-tale">34.21.1. Fairy tale</a></h4>
 <div class="quoteblock">
 <blockquote>
 <div class="paragraph">
@@ -45491,7 +45540,7 @@ git push --follow-tags
 </div>
 </div>
 <div class="sect2">
-<h3 id="bibliography"><a class="anchor" href="#bibliography"></a><a class="link" href="#bibliography">33.22. Bibliography</a></h3>
+<h3 id="bibliography"><a class="anchor" href="#bibliography"></a><a class="link" href="#bibliography">34.22. Bibliography</a></h3>
 <div class="paragraph">
 <p>Runnable stuff:</p>
 </div>