diff --git a/README.adoc b/README.adoc index 95211ad..7167199 100644 --- a/README.adoc +++ b/README.adoc @@ -19724,8 +19724,9 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 .Busy loop MIPS for different simulator setups [options="header"] |=== -|LKMC |Benchmark build |Emulator command |Loops |Time (s) |Instruction count |Approximate MIPS +|Comment |LKMC |Benchmark build |Emulator command |Loops |Time (s) |Instruction count |Approximate MIPS +| |a18f28e263c91362519ef550150b5c9d75fa3679 + 1 |link:userland/gcc/busy_loop.c[] `-O0` |`qemu --arch aarch64` @@ -19734,6 +19735,7 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |1.1 * 10^11 (approx) |2000 +| |a18f28e263c91362519ef550150b5c9d75fa3679 + 1 |link:userland/gcc/busy_loop.c[] `-O0` |`gem5 --arch aarch64` @@ -19742,6 +19744,7 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |1.10018162 * 10^8 |1 +| |a18f28e263c91362519ef550150b5c9d75fa3679 + 1 |link:userland/gcc/busy_loop.c[] `-O0` |`gem5 --arch aarch64 --gem5-build-id debug` @@ -19750,6 +19753,7 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |2.528728 * 10^6 |0.08 +| |a18f28e263c91362519ef550150b5c9d75fa3679 + 1 |link:userland/gcc/busy_loop.c[] `-O0` |`+gem5 --arch aarch64 -- --cpu-type MinorCPU --caches+` @@ -19758,6 +19762,7 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |1.1018152 * 10^7 |0.4 +| |a18f28e263c91362519ef550150b5c9d75fa3679 + 1 |link:userland/gcc/busy_loop.c[] `-O0` |`+gem5 --arch aarch64 -- --cpu-type DerivO3CPU --caches+` @@ -19766,6 +19771,7 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |1.1018128 * 10^7 |0.2 +| |a18f28e263c91362519ef550150b5c9d75fa3679 + 1 |link:userland/gcc/busy_loop.c[] `-O0` |`+gem5 --arch aarch64 --gem5-build-id MOESI_CMP_directory -- --cpu-type DerivO3CPU --caches --ruby+` @@ -19774,6 +19780,7 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |1.1005150 * 10^7 |0.2 +| |605448f07e6380634b1aa7e9732d111759f69fd |<> `-O3` |`gem5 --arch aarch64` @@ -19782,22 +19789,25 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |9.2034139 * 10^7 |1.6 +| |5d233f2664a78789f9907d27e2a40e86cefad595 |<> `-O3` -|`gem5 --arch aarch64 --userland-args 300000 2` +|`./run --arch aarch64 --emulator gem5 --userland userland/gcc/busy_loop.c --userland-args 1000000 --trace ExecAll` |3 * 10^5 * 2 |64 |9.9674773 * 10^7 |1.6 +|glibc C pre-main effects |ab6f7331406b22f8ab6e2df5f8b8e464fb35b611 -|glibc C pre-main link:userland/c/m5ops.c[] `-O0` +|link:userland/c/m5ops.c[] `-O0` |`gem5 --arch aarch64 --userland-args e` |1 |2 |1.26479 * 10^5 |0.05 +| |ab6f7331406b22f8ab6e2df5f8b8e464fb35b611 |glibc C pre-main link:userland/c/m5ops.c[] `-O0` |`gem5 --arch aarch64 --userland-args e --gem5-build-type debug` @@ -19806,6 +19816,7 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |1.26479 * 10^5 |0.05 +| |ab6f7331406b22f8ab6e2df5f8b8e464fb35b611 |glibc C++ pre-main link:userland/cpp/m5ops.cpp[] `-O0` |`gem5 --arch aarch64 --userland-args e` @@ -19814,6 +19825,7 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |2.385012 * 10^6 |1 +| |ab6f7331406b22f8ab6e2df5f8b8e464fb35b611 |glibc C++ pre-main link:userland/cpp/m5ops.cpp[] `-O0` |`gem5 --arch aarch64 --userland-args e --gem5-build-type debug` @@ -19822,21 +19834,41 @@ Summary of manually collected results on <> at LKMC a18f28e263c91362519ef55 |2.385012 * 10^6 |0.1 +|gem5 optimized build immediate exit on first instruction to benchmark the simulator startup time |ab6f7331406b22f8ab6e2df5f8b8e464fb35b611 |immediate exit link:userland/arch/aarch64/freestanding/linux/gem5_exit.S[] `-O0` |`gem5 --arch aarch64` |1 |1 -| +|1 | +|same as above but debug build |ab6f7331406b22f8ab6e2df5f8b8e464fb35b611 -|immediate exit link:userland/arch/aarch64/freestanding/linux/gem5_exit.S[] `-O0` +|link:userland/arch/aarch64/freestanding/linux/gem5_exit.S[] `-O0` |`gem5 --arch aarch64 --gem5-build-type debug` |1 |1 +|1 | -| + +|Check the effect of an ExecAll log (log every instruction) on execution time, compare to analogous run without it. `trace.txt` size: 3.5GB. 5x slowdown observed with output to a hard disk. +|d29a07ddad499f273cc90dd66e40f8474b5dfc40 +|link:userland/gcc/busy_loop.c[] `-O0` +|`./run --arch aarch64 --emulator gem5 --userland userland/gcc/busy_loop.c --userland-args 1000000 --gem5-worktree master --trace ExecAll` +|10^6 +|2.4106774 * 10^7 +|136 +|0.2 + +|Same as above but with run command manually hacked to output to a ramfs. Slightly faster, but the bulk was still just in log format operations! +|d29a07ddad499f273cc90dd66e40f8474b5dfc40 +|link:userland/gcc/busy_loop.c[] `-O0` +|`./run --arch aarch64 --emulator gem5 --userland userland/gcc/busy_loop.c --userland-args 1000000 --gem5-worktree master --trace ExecAll` +|10^6 +|2.4106774 * 10^7 +|107 +|0.2 |=== @@ -20108,14 +20140,9 @@ Tested at: d4b3e064adeeace3c3e7d106801f95c14637c12f + 1. ==== P51 -Lenovo ThinkPad https://www3.lenovo.com/gb/en/laptops/thinkpad/p-series/P51/p/22TP2WPWP51[P51 laptop]: +Lenovo ThinkPad https://www3.lenovo.com/gb/en/laptops/thinkpad/p-series/P51/p/22TP2WPWP51[P51 laptop] with the Latest stable Ubuntu. -* 2500 USD in 2018 (high end) -* Intel Core i7-7820HQ Processor (8MB Cache, up to 3.90GHz) (4 cores 8 threads) -* 32GB(16+16) DDR4 2400MHz SODIMM -* 512GB SSD PCIe TLC OPAL2 -* NVIDIA Quadro M1200 Mobile, latest Ubuntu supported proprietary driver -* Latest Ubuntu +Full specs and benchmark scores will be maintained at the latest version of: https://github.com/cirosantilli/notes/blob/0c038b0e430d0017f12d028c6a0e7c0b99ec957f/my-hardware.adoc#thinkpad-p51 === Benchmark Internets