Computer Specs

Computer 1 (Brownie)

Here you can see the specs for Brownie, one of the computers used in the benchmarking tests.

Processor

Key Value
Manufacturer Intel
Product Core i5-8300H

Relevant details viewable on WikiChip.

Of specific note are the 4 available cores (8 threads via hyperthreading). The processor runs at a base frequency of 2.3 GHz with boost scaling as such:

Turbo Freqency [GHz] Cores
4.0 1
4.0 2
3.9 3
3.9 4

The L1D (D for data) cache is 32 KiB which is shared by 2 threads per core and the unified L2 cache has 256 KiB available per core. Lastly, the L3 has up to 2 MiB available per core wich can be shared between all cores.

Below you can see the latency pyramid for fastest load-to-use (simple pointer accesses):

Memory Latency [cycles]
L1D 4
L2 12
L3 42

Compared to RAM, data is loaded from L1D via 64 byte cache lines. This leads to the following theoretical L1D bandwidths:

Memory Bandwidth Base Frequency [GB/s] Bandwidth Turbo Frequency [GB/s] #Active Cores
L1D 147.2 256 1
L1D 294.4 512 2
L1D 441.6 748.8 3
L1D 588 998 4

While the AVX2 instruction set is supported, AVX512 is not. The chip integrates Intel's UHD Graphics 630 but in our benchmarks we usually forego this integrated GPU.

Graphics

Key Value
Manufacturer NVIDIA
Product GeForce GTX 1050 Ti with Max-Q Design

It features 4 GB of dedicated GPU memory. The memory modules have a frequency of 1752 MHz. Each GDDR5 cycle transfers 4 bytes which leads to an effective datarate of 7.008 GB/s per pin. As the memory bus is 128 bit wide, we have therefore a theoretical bandwidth of 112.128 GB/s. But this theoretical bandwidth is only useful if the memory already is on the device. For host-device transfers data goes through a PCI-Express 3.0 x16 interface, which has a theoretical bandwidth of 15.5 GB/s.

Running NVIDIA's publicly available bandwidthtest.cu bandwidth measuring program, yields the following results:

Device 0: NVIDIA GeForce GTX 1050 Ti with Max-Q Design
Quick Mode

Host to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes)        Bandwidth(GB/s)
32000000                     12.6

Device to Host Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes)        Bandwidth(GB/s)
32000000                     11.8

Device to Device Bandwidth, 1 Device(s)
PINNED Memory Transfers
Transfer Size (Bytes)        Bandwidth(GB/s)
32000000                     94.9

These values are reasonably close to the aforementioned limits. The theoretical FP64:FP32 FLOP rate is 1:32 for this consumer graphics card.

Memory

Key Value
Manufacturer Micron
Product 8ATF1G64HZ-2G6E1

Obtained via:

$ wmic memorychip list full

The system is outfitted with 2x 8 GB RAM sticks for a joint memory of 16 GB. Given a clock speed of 1333MHz (which translates to 2666 MT/s of 64 bit values) we get a theoretical bandwidth of 21.328 GB/s per RAM stick. This is the highest bandwidth memory the Core i5-8300H supports. The CPU supports two of these, so the overall theoretical maximum bandwidth is twice the amount, topping at 42.656 GB/s.

Storage

Slot 0

Key Value
Manufacturer Western Digital
Product SN720
Size (Gross) 512 GB

Slot 1

Key Value
Manufacturer Crucial
Product P3
Size (Gross) 1 TB