PCMark 10 Storage Bench - Real-World Access Traces

Publish date: 2024-06-15

There are a number of storage benchmarks that can subject a device to artificial access traces by varying the mix of reads and writes, the access block sizes, and the queue depth / number of outstanding data requests. We saw results from two popular ones - ATTO, and CrystalDiskMark - in a previous section. More serious benchmarks, however, actually replicate access traces from real-world workloads to determine the suitability of a particular device for a particular workload. Real-world access traces may be used for simulating the behavior of computing activities that are limited by storage performance. Examples include booting an operating system or loading a particular game from the disk.

PCMark 10's storage bench (introduced in v2.1.2153) includes four storage benchmarks that use relevant real-world traces from popular applications and common tasks to fully test the performance of the latest modern drives:

Despite the data drive benchmark appearing most suitable for testing direct-attached storage, we opt to run the full system drive benchmark as part of our evaluation flow. Many of us use portable flash drives as boot drives and storage for Steam games. These types of use-cases are addressed only in the full system drive benchmark.

The Full System Drive Benchmark comprises of 23 different traces. For the purpose of presenting results, we classify them under five different categories:

PCMark 10 also generates an overall score, bandwidth, and average latency number for quick comparison of different drives. The sub-sections in the rest of the page reference the access traces specified in the PCMark 10 Technical Guide.

Booting Windows 10

The read-write bandwidth recorded for each drive in the boo access trace is presented below.

Windows 10 Boot

It is surprising to note that enabling write caching on the storage device has a significant effect on the boot performance. While the workload is definitely read-heavy, the ability of the PSSD to absorb small random writes quickly can result in a significant boost. The raw bandwidth available doesn't seem to have much of an effect, as we see the device performing better when connected via the 10Gbps path compared to the 40Gbps one.

Creative Workloads

The read-write bandwidth recorded for each drive in the sacr, saft, sill, spre, slig, sps, aft, exc, ill, ind, psh, and psl access traces are presented below.

Startup - Adobe Acrobat

The Thunderbolt 3 PSSD with write caching on is a top performer in creative workloads, as one might expect from the SanDisk Professional brand. There are a few components where the Kingston XS2000 with its Gen 2x2 interface sneaks ahead. By and large, though, the PRO-G40 delivers results that are expected from it. The default Thunderbolt 3 PSSD setting of write caching off is a different story - in some components, it performs worse than the PRO-G40 connected via a 10 Gbps interface.

Office Workloads

The read-write bandwidth recorded for each drive in the exc and pow access traces are presented below.

Usage - Microsoft Excel

The write caching setting again plays a notable role in the Thunderbolt mode. Under default conditions, the Thunderbolt mode actually performs a bit worse than the 10Gbps mode.

Gaming Workloads

The read-write bandwidth recorded for each drive in the bf, cod, and ow access traces are presented below.

Startup - Battlefield V

These are read-intensive workloads with token writes that the write caching setting can easily handle to provide top-of-the-line performance. The distribution of transfers is such that the 10Gbps connection again provides better performance compared to default Thunderbolt mode for certain games.

Files Transfer Workloads

The read-write bandwidth recorded for each drive in the cp1, cp2, cp3, cps1, cps2, and cps3 access traces are presented below.

Duplicating ISOs (Read-Write)

File transfer workloads deal mainly with large-sized writes, and it is no surprise then that Thunderbolt mode with write caching on is a top performer across the board. The PSSDs with a 10Gbps connection make up the bottom half of the graphs, but that is no surprise based on the size distribution of the transfers.

Overall Scores

PCMark 10 reports an overall score based on the observed bandwidth and access times for the full workload set. The score, bandwidth, and average access latency for each of the drives are presented below.

Full System Drive Benchmark Bandwidth (MBps)

The overall scores are along expected lines - the Thunderbolt 3 mode with write caching on performs on par with a premium PCIe Gen 3.0 x4 internal SSD, while the other PSSD configurations / host connections follow well behind in terms of both bandwidth and latency.

ncG1vNJzZmivp6x7orrAp5utnZOde6S7zGiqoaenZH54gZhyZqyZnpm2tLeMqamonpWowKq7zZqjZqiipLR1fIytn66mlJq%2Fo7vLrWSuq5Jisbaty6amnZ1dpbyzwMCbo55lo6ixbr7Er6Cer19p