Skip to content

SYSTEM Cited by 3 sources

NVMe SSD

What it is

NVMe (Non-Volatile Memory express) is the host-controller interface specification that exposes an SSD to the CPU over PCIe instead of over older storage buses (SATA, SAS) designed for spinning disks. An NVMe SSD is a solid-state drive that speaks NVMe natively.

Why it matters: SATA was designed assuming hundreds of IOPS (HDD scale). NVMe was designed for hundreds-of-thousands of IOPS (SSD scale), with multi-queue parallelism, short command sets, and direct PCIe lane access — no HBA translation.

Dicken's positioning

"NVMe SSDs are a type of solid state disk that use the non-volatile memory host controller interface specification for blazing-fast IO speed and great bandwidth."

"A round trip from the CPU to a locally-attached NVMe SSD takes about 50,000 nanoseconds (50 microseconds)."

(Source: sources/2025-03-13-planetscale-io-devices-and-latency)

Performance profile

Metric Typical NVMe SSD
Round-trip latency ~50 μs
Random-read IOPS 100k–1M+
Sequential throughput 3–14 GB/s (PCIe 4.0–5.0)
Internal organisation Targets → blocks → pages (concepts/nand-flash-page-block-erasure)
Parallelism Multi-target (multi-lane) within the drive (concepts/ssd-parallelism-via-targets)

Compared to SATA SSD (~100 μs, 100k IOPS max, 550 MB/s sequential) NVMe is a ~2× latency win and ~10× throughput win from the interface change alone, on top of whatever the underlying NAND generations contribute.

Why NVMe matters to databases

  • Fewer IOs are on the critical path. NVMe's latency floor is close to the NAND's — there's less protocol overhead to subtract from the budget.
  • Multi-queue submission matches multi-threaded database engines. A database with many concurrent transactions can keep the drive's parallel targets busy.
  • Durability semantics are identical to SATA SSD — power-loss protection, write-ordering guarantees, TRIM/DISCARD all apply.

Hidden performance issues

The same physics issues all SSDs have still apply to NVMe:

NVMe in the cloud

NVMe SSDs appear in cloud deployments in two shapes:

Direct-attached (instance store)

The drive is physically in the hypervisor host. Bandwidth and latency approach bare-metal figures. Caveat: the drive does not survive instance termination. Common on AWS i3 / i4 / i7 / im4gn / metal instance families.

Used by:

Network-attached (EBS-class, backed by NVMe)

The customer's logical volume is served by a remote fleet of NVMe-backed storage servers over a datacenter network. Adds the concepts/network-attached-storage-latency-penalty|5× latency hop but survives instance loss. See systems/aws-ebs (and AWS's custom systems/aws-nitro-ssd, which is the NVMe drive they built specifically for the EBS workload).

Seen in

Last updated · 319 distilled / 1,201 read