benchmark gem5 --trace ExecAll vs without on SE.py

This commit is contained in:
Ciro Santilli 六四事件 法轮功
2020-01-17 00:00:00 +00:00
parent 452f30c016
commit 911cd11ab9

View File

@@ -19724,8 +19724,9 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
.Busy loop MIPS for different simulator setups
[options="header"]
|===
|LKMC |Benchmark build |Emulator command |Loops |Time (s) |Instruction count |Approximate MIPS
|Comment |LKMC |Benchmark build |Emulator command |Loops |Time (s) |Instruction count |Approximate MIPS
|
|a18f28e263c91362519ef550150b5c9d75fa3679 + 1
|link:userland/gcc/busy_loop.c[] `-O0`
|`qemu --arch aarch64`
@@ -19734,6 +19735,7 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|1.1 * 10^11 (approx)
|2000
|
|a18f28e263c91362519ef550150b5c9d75fa3679 + 1
|link:userland/gcc/busy_loop.c[] `-O0`
|`gem5 --arch aarch64`
@@ -19742,6 +19744,7 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|1.10018162 * 10^8
|1
|
|a18f28e263c91362519ef550150b5c9d75fa3679 + 1
|link:userland/gcc/busy_loop.c[] `-O0`
|`gem5 --arch aarch64 --gem5-build-id debug`
@@ -19750,6 +19753,7 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|2.528728 * 10^6
|0.08
|
|a18f28e263c91362519ef550150b5c9d75fa3679 + 1
|link:userland/gcc/busy_loop.c[] `-O0`
|`+gem5 --arch aarch64 -- --cpu-type MinorCPU --caches+`
@@ -19758,6 +19762,7 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|1.1018152 * 10^7
|0.4
|
|a18f28e263c91362519ef550150b5c9d75fa3679 + 1
|link:userland/gcc/busy_loop.c[] `-O0`
|`+gem5 --arch aarch64 -- --cpu-type DerivO3CPU --caches+`
@@ -19766,6 +19771,7 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|1.1018128 * 10^7
|0.2
|
|a18f28e263c91362519ef550150b5c9d75fa3679 + 1
|link:userland/gcc/busy_loop.c[] `-O0`
|`+gem5 --arch aarch64 --gem5-build-id MOESI_CMP_directory -- --cpu-type DerivO3CPU --caches --ruby+`
@@ -19774,6 +19780,7 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|1.1005150 * 10^7
|0.2
|
|605448f07e6380634b1aa7e9732d111759f69fd
|<<dhrystone>> `-O3`
|`gem5 --arch aarch64`
@@ -19782,22 +19789,25 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|9.2034139 * 10^7
|1.6
|
|5d233f2664a78789f9907d27e2a40e86cefad595
|<<stream-benchmark>> `-O3`
|`gem5 --arch aarch64 --userland-args 300000 2`
|`./run --arch aarch64 --emulator gem5 --userland userland/gcc/busy_loop.c --userland-args 1000000 --trace ExecAll`
|3 * 10^5 * 2
|64
|9.9674773 * 10^7
|1.6
|glibc C pre-main effects
|ab6f7331406b22f8ab6e2df5f8b8e464fb35b611
|glibc C pre-main link:userland/c/m5ops.c[] `-O0`
|link:userland/c/m5ops.c[] `-O0`
|`gem5 --arch aarch64 --userland-args e`
|1
|2
|1.26479 * 10^5
|0.05
|
|ab6f7331406b22f8ab6e2df5f8b8e464fb35b611
|glibc C pre-main link:userland/c/m5ops.c[] `-O0`
|`gem5 --arch aarch64 --userland-args e --gem5-build-type debug`
@@ -19806,6 +19816,7 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|1.26479 * 10^5
|0.05
|
|ab6f7331406b22f8ab6e2df5f8b8e464fb35b611
|glibc C++ pre-main link:userland/cpp/m5ops.cpp[] `-O0`
|`gem5 --arch aarch64 --userland-args e`
@@ -19814,6 +19825,7 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|2.385012 * 10^6
|1
|
|ab6f7331406b22f8ab6e2df5f8b8e464fb35b611
|glibc C++ pre-main link:userland/cpp/m5ops.cpp[] `-O0`
|`gem5 --arch aarch64 --userland-args e --gem5-build-type debug`
@@ -19822,21 +19834,41 @@ Summary of manually collected results on <<p51>> at LKMC a18f28e263c91362519ef55
|2.385012 * 10^6
|0.1
|gem5 optimized build immediate exit on first instruction to benchmark the simulator startup time
|ab6f7331406b22f8ab6e2df5f8b8e464fb35b611
|immediate exit link:userland/arch/aarch64/freestanding/linux/gem5_exit.S[] `-O0`
|`gem5 --arch aarch64`
|1
|1
|
|1
|
|same as above but debug build
|ab6f7331406b22f8ab6e2df5f8b8e464fb35b611
|immediate exit link:userland/arch/aarch64/freestanding/linux/gem5_exit.S[] `-O0`
|link:userland/arch/aarch64/freestanding/linux/gem5_exit.S[] `-O0`
|`gem5 --arch aarch64 --gem5-build-type debug`
|1
|1
|1
|
|
|Check the effect of an ExecAll log (log every instruction) on execution time, compare to analogous run without it. `trace.txt` size: 3.5GB. 5x slowdown observed with output to a hard disk.
|d29a07ddad499f273cc90dd66e40f8474b5dfc40
|link:userland/gcc/busy_loop.c[] `-O0`
|`./run --arch aarch64 --emulator gem5 --userland userland/gcc/busy_loop.c --userland-args 1000000 --gem5-worktree master --trace ExecAll`
|10^6
|2.4106774 * 10^7
|136
|0.2
|Same as above but with run command manually hacked to output to a ramfs. Slightly faster, but the bulk was still just in log format operations!
|d29a07ddad499f273cc90dd66e40f8474b5dfc40
|link:userland/gcc/busy_loop.c[] `-O0`
|`./run --arch aarch64 --emulator gem5 --userland userland/gcc/busy_loop.c --userland-args 1000000 --gem5-worktree master --trace ExecAll`
|10^6
|2.4106774 * 10^7
|107
|0.2
|===
@@ -20108,14 +20140,9 @@ Tested at: d4b3e064adeeace3c3e7d106801f95c14637c12f + 1.
==== P51
Lenovo ThinkPad https://www3.lenovo.com/gb/en/laptops/thinkpad/p-series/P51/p/22TP2WPWP51[P51 laptop]:
Lenovo ThinkPad https://www3.lenovo.com/gb/en/laptops/thinkpad/p-series/P51/p/22TP2WPWP51[P51 laptop] with the Latest stable Ubuntu.
* 2500 USD in 2018 (high end)
* Intel Core i7-7820HQ Processor (8MB Cache, up to 3.90GHz) (4 cores 8 threads)
* 32GB(16+16) DDR4 2400MHz SODIMM
* 512GB SSD PCIe TLC OPAL2
* NVIDIA Quadro M1200 Mobile, latest Ubuntu supported proprietary driver
* Latest Ubuntu
Full specs and benchmark scores will be maintained at the latest version of: https://github.com/cirosantilli/notes/blob/0c038b0e430d0017f12d028c6a0e7c0b99ec957f/my-hardware.adoc#thinkpad-p51
=== Benchmark Internets