Arm's answer: launch Neoverse as a dedicated infrastructure roadmap in October 2018, re-characterising Cortex-A76 for servers and committing to an annual cadence.
Mirrors how Arm served mobile with Cortex-A/R/M. A cloud platform wants dozens of N-cores per socket for scale-out; an HPC system wants fewer V-cores with wider SIMD; a 5G baseband or DPU wants E-cores at lowest power.
Delivered every year since 2019 — N1 (2019), V1/E1 (2020-21), N2 (2021), V2 (2022), N3/V3 (2024). Faster than x86, driven by shared microarchitecture with Cortex-A flagships.
HPC and analytics codes vectorise well; Neoverse V1's 2 × 256-bit SVE delivers ~2.5 × the per-core FP throughput of N1 on BLAS-like kernels, with comparable area to a desktop X1.
x86 has run 2-way SMT since Pentium 4. Neoverse V1 brings the same trick to Arm — DB-heavy and in-memory KV workloads see 15-30% throughput lift.
A DPU terminating millions of packets per second doesn't want huge OoO machinery; it wants many in-order SMT threads that can keep a 400 GbE pipe full. N/V-class cores would waste area and power on branch predictors the workload can't use.
Ethos-N NPUs and Mali GPUs often sit on the same chip — Arm's "infrastructure CSS" reference designs bundle Neoverse-E with Ethos for combined edge-ML + packet boxes.
N2's sweet spot is 64-128 cores per socket at ~2.5-3.5 GHz, DDR5 + CXL 2.0 memory. Beats x86 Zen 4 on cloud-native (nginx, Redis, Java) by 25-45% perf/watt.
Paired with N2: derived from Cortex-X3. 4 × 128-bit SVE2 (narrower than V1's 2×256 but same FLOPs). Shipped in NVIDIA Grace CPU (72 cores per chiplet, 2 chiplets = 144, 2024).
CSS is the "server-on-a-chip starter kit" — customers integrate chiplets on top, saving 1-2 years of IP integration.
| Generation | Codename | Based on | SVE | Year | Canonical silicon |
|---|---|---|---|---|---|
| N1 | Ares | Cortex-A76 | — | 2019 | Graviton 2, Ampere Altra, Yitian 710 (early) |
| E1 | Helios | Cortex-A65AE | — | 2020 | Marvell Octeon 10, 5G basebands |
| V1 | Zeus | Cortex-X1 | 2 × 256-bit | 2021 | Graviton 3 / 3E, SiPearl Rhea1 |
| N2 | Perseus | Cortex-A710 (v9-A) | 4 × 128-bit SVE2 | 2021 | Graviton 4 (partial), Cobalt 100, Yitian 710 |
| V2 | Demeter | Cortex-X3 (v9-A) | 4 × 128-bit SVE2 | 2022 | NVIDIA Grace (72/144), HPE, planned HPC |
| N3 | — | A720-class (v9.2-A) | 4 × 128-bit SVE2 | 2024 | Azure Cobalt 200 (reported) |
| V3 | — | X4-class (v9.2-A) | 4 × 128-bit SVE2 | 2024 | Graviton 5 (reported), NVIDIA Grace next-gen |
Not shown: custom-core Neoverse "cousins" like Ampere AmpereOne (A192 — 192 custom cores, Armv8.6-A) and Microsoft Cobalt 100 (N2 integration). The V-series trades core count for width; the N-series goes for socket density.
AWS reports Graviton now accounts for >50% of new EC2 capacity. Customers see 20-40% price/perf improvement over Intel/AMD. Hard to argue with the datacentre P&L.
Arm backed Linaro + TF-A + EDK2 + Tianocore to make sure the full server stack was permissively licensed. Removed a big barrier for hyperscalers.
Integrating a full Neoverse mesh is hard — physical design, RTL signoff, ISO 26262/RAS. Only hyperscalers with 100+ engineer silicon teams could afford it. CSS democratises access to Neoverse for companies with much smaller teams.
CSS maps cleanly onto a single chiplet, making Neoverse the "compute" side of UCIe-based multi-chiplet SoCs. Memory / IO / AI accelerators live on complementary chiplets.
Typical pattern: Cortex-X/-A unveiled at Computex/May. Matching Neoverse unveiled at Arm Neoverse Tech Day (Oct-Feb) 6-12 months later. That's the validation + RAS window.
Phone cores aim for Geekbench ST. Neoverse aims for SPECrate, STREAM, DGEMM, DB/nginx/Kafka rps. Different prefetcher tuning per target.
Arm Ltd. — Neoverse TRMs (N1, V1, N2, V2, N3, V3) — freely downloadable on developer.arm.com
Arm Ltd. — Neoverse Tech Day 2022 / 2024 keynotes and whitepapers
Arm Ltd. — Arm Compute Sub-Systems (CSS) product briefs
AWS — Graviton 2 / 3 / 4 performance whitepapers — aws.amazon.com/ec2/graviton
NVIDIA — NVIDIA Grace CPU superchip architecture whitepaper
ServeTheHome / Phoronix / Anandtech — independent Neoverse benchmark reviews (2019-2024)
Chipsandcheese.com — microarchitecture deep-dives on Neoverse N1/V1/V2
SiPearl / Jupiter EuroHPC — Rhea1 architecture papers
Presentation built with Reveal.js 4.6 · Playfair Display + DM Sans + JetBrains Mono
Educational use.