Blog

Three Truths About Hard Drives and SSDs

An examination of the claim that flash will replace hard drives in the data center.

Table of Contents:

“Hard drives will soon be a thing of the past.”

“All-flash arrays will soon replace disks and hybrid arrays in the data center.”

“The data center of the future is all-flash.”

Welcome to the latest installment of the perennial hard drive extinction saga. The debate—with highlights sampled above—has spanned more than a decade now. The predictions foretelling hard drives’ demise, uttered by a few vocal and, shall we say, optimistic proponents of flash-only technology, have not aged well over the years. But they seem to get increasingly brazen with time. 

Without question, flash storage is well-suited to support applications that require high-performance and speed. And flash revenue is growing, as is all-flash-array (AFA) revenue. But not at the expense of hard drives. The premise underlying speculation around the death of hard drives is deeply flawed.

We are living in an era where the ubiquity of the cloud and the emergence of AI use cases have driven up the value of massive data sets. Hard drives, which today store by far the majority of the world’s exabytes (EB), are more indispensable to data center operators than ever

Even in recent years, when flash prices temporarily dropped to all-time lows, solid state drives (SSDs) did not displace hard drives in workloads requiring mass-data storage. 

Industry analysts expect hard drives to be the primary beneficiary of continued EB growth. The chart below shows that enterprise and large-scale cloud data centers—where the vast majority of the world’s data sets reside—will be a key location of this growth. In relative terms, hard drive storage is projected to grow by 6,996EB whereas SSDs will grow by 1,363EB between 2022 and 2027.1
It’s not a zero-sum game. In data centers, hard drives and flash have always worked in synergy, deployed in support of different services. They each have their own unique benefits and value proposition. In fact, in the era of generative AI, compute clusters closely coupled with flash technology indirectly fuel the downstream need for more hard drive EBs, since the generated content needs to be economically stored.

This storage media synergy is alive and well, while the conjecture around hard drives’ obsolescence lacks credibility and will not ultimately pan out. 

Let’s take a closer look at three key myths underlying this conjecture—and the third-party data-driven reasons why hard drives will remain central to data storage architectures for the foreseeable future.

Truth #1 – Pricing Disparity

Myth: SSD pricing will soon match the pricing of hard drives.

Reality: SSD and hard drive pricing will not converge at any point in the next decade.

The data is clear. Hard drives hold a firm cost-per-terabyte (TB) advantage over SSDs, which positions them as the unquestionable cornerstone of data center storage infrastructure. 

Even though the NAND flash memory storage pricing remains highly volatile and hit a low in 2023 due to weak demand and oversupply, analyst firm Forward Insights forecasts a price resurgence for SSDs starting in 2024 and through 2025. After facing precipitous price declines, SSD vendors will welcome this turnaround after struggling to reduce aging inventory and cut capital expenditure to align supply with demand. Subsequently, we have already started to see price increases for NAND-based solutions. 

Seagate’s analysis of research by IDC, TRENDFOCUS, and Forward Insights confirms that hard drives will remain the most cost-effective option for most enterprise tasks. The price-per-TB difference between enterprise SSDs and enterprise hard drives is projected to remain at or above a 6 to 1 premium through at least 2027. 


This price-per-TB differential is particularly evident in the data center, where device acquisition cost is by far the dominant component in total cost of ownership (TCO). Taking all storage system costs into consideration—including device acquisition, power, networking, and compute costs—a far superior TCO is rendered by hard drive-based systems on a per-TB basis.


In an attempt to circumvent these unassailable TCO and price disparities, some AFA OEMs have begun designing their own custom high-density NAND devices with capacity points into the hundreds of TBs—claiming theoretical TCO advantages that extend beyond device economics to the system level. The problem with this logic is that adding dramatically higher levels of NAND density to a single device or system still doesn’t alter the stark cost-per-TB differential of the raw media.

Another tactic used to distract from the cost-per-TB disadvantage has to do with the so-called “TBe” or “effective terabytes.” The assertion is made that due to data reduction techniques (e.g., data compression), an SSD can offer substantially more storage space than its raw capacity implies. However, in large deployments, data reduction occurs higher up in the stack, rendering it irrelevant at the storage level. In addition, given the increased focus on protecting data and the prevalence of encryption, data compression often isn’t feasible in most enterprise and cloud use cases. When data is encrypted, it can’t be compressed because its entropy is so high that there is no pattern to simplify.

Bottom line: While flash excels at performing specific and high-performance tasks, hard drives will continue to be the primary destination for data center EBs, offering a reliable, cost-effective, and widely adopted solution for the foreseeable future.

Truth #2 – Manufacturing Scale

Myth: Supply of NAND can ramp to replace all hard drive capacity.

Reality: Entirely replacing hard drives with NAND would require untenable CapEx investments.

The notion that the NAND industry would or could rapidly increase its supply to replace all hard drive capacity isn’t just optimistic—such an attempt would lead to financial ruin. Transitioning from hard drive to NAND isn’t just about producing more units. It’s a financial and logistical behemoth to execute, let alone at a price that is competitive with hard drives.

According to the Q4 2023 NAND Market Monitor report from industry analyst Yole Intelligence, the entire NAND industry shipped 3.1 zettabytes (ZB) from 2015 to 2023, while having to invest a staggering $208 billion in CapEx—approximately 47% of their combined revenue.

In contrast, the hard drive industry addresses the vast majority—almost 90%—of data center storage needs in a highly capital-efficient manner. To help crystalize this, let’s use Seagate Technology as a proxy for the hard drive industry. Between 2015 and 2023, Seagate shipped 3.5ZB of storage. Seagate’s capital investments over that eight-year period totaled $4.3 billion, or only around 5% of Seagate’s total hard drive revenue. This equals approximately $67 billion per ZB for the NAND industry versus about $1 billion per ZB for hard drive production (as represented by Seagate). The hard drive industry is far more efficient at delivering ZBs to the data center. Seagate’s analysis of forecasts from IDC for hard drives and Forward Insights for SSDs shows that in 2024, hard drive EB production will be almost three times that of SSDs. In that same year, in enterprise and data center markets, hard drive EB production will be six times that of SSDs. 

Recently, some AFA vendors have claimed that the flash industry could fully replace the entire hard drive industry’s capacity output by 2028. Let’s look at what kind of investment would be needed by the NAND industry to achieve this.

The Yole Intelligence report cited above indicates that from 2025 to 2027, the NAND industry will invest about $73 billion, which is estimated to yield 963EB of output for enterprise SSDs as well as other NAND products for tablets and phones. This translates to an investment of about $76 per TB of flash storage output. Applying that same capital price per bit, it would require a staggering $206 billion in additional investment to support the 2.723ZB of hard drive capacity forecast to ship in 2027. In total, that’s nearly $279 billion of investment for a total addressable market of approximately $25 billion.

It’s clear that this level of investment is unlikely for an industry facing uncertain returns, especially after losing money throughout 2023.

The latest NAND Flash Platinum Datasheet from TrendForce shows there are about 28 operating NAND fabrication plants (fabs) worldwide in 2024. If we use Kioxia’s Fab7 Phase 1, opened in October 2022, as an example, building a single green-field NAND fab costs about $6.8 billion. Thus, the $206 billion incremental CapEx needed by the NAND industry would roughly equal 30 new fabs. These facilities would need to be built, scaled, tested, qualified, and brought online to full production in the next three to four years, doubling the number of NAND fabs worldwide in less than four years. 

Additionally, IDC’s 2023 StorageSphere report2 shows that in 2023, the ratio of existing hard drive to SSD installed capacity in cloud and non-cloud data centers is 7 to 1. IDC forecasts this dominant HDD-based EBs ratio to stay around 6 to7 times for the foreseeable future, with a 26% compound annual growth rate (CAGR), leading to an installed HDD capacity of as much as 10ZB in 2027. Therefore, besides replacing all future annual production of new hard drive installations year by year as previously described, the NAND industry would also need to invest to replace the aging portion of this 10ZB installed base of data center hard drives when they reach the end of their life cycles—an incremental investment well above the $206 billion needed just to replace the 2.723ZB hard drive capacity expected to be delivered in 2027.

NAND solutions serve specific data center workloads efficiently, but the idea that data centers will fully rely on them is littered with pitfalls. Beyond the risks and the implausibility of the NAND industry replacing the hard drive supply, volatile pricing adds another layer of uncertainty for businesses seeking supply stability and the best TCO for their storage.

The idea that NAND could completely replace hard drives in the foreseeable future is highly improbable, if not impossible. The industry would have to overcome formidable financial and logistical obstacles while investing a large amount of capital and technology in a market that isn’t prepared for a change that would upend current data center architecture.

Truth #3 – Workload Profiles

Myth: Only AFAs can meet the performance requirements of modern enterprise workloads.

Reality: Enterprise storage architecture usually mixes media types, using disk or hybrid arrays, flash, and tape to optimize for the cost, capacity, and performance needs of specific workloads.

At issue here is a false dichotomy. All-flash vendors advise enterprises to “simplify” and “future-proof” by going all-in on flash for high performance. Otherwise, they posit, enterprises risk finding themselves unable to keep pace with the performance demands of modern workloads.This zero-sum logic fails for three reasons:

  1. The vast majority of modern workloads do not require the performance advantage offered by flash.
  2. Enterprises under budget constraints and with rapidly growing data sets must balance capacity and cost, as well as performance.
  3. The purported simplicity of a single tier storage architecture is a solution in search of a problem.

Let’s address these one by one.

First, most of the world’s data resides in the cloud and large data centers. In these environments, workloads follow a Pareto rule: only a small percentage of the workload requires a significant percentage of the performance. This is why according to IDC3 over the last five years, hard drives have amounted to almost 90% of the storage installed base in cloud service providers and hyperscale data centers.

Take a look at the chart below, derived from IDC’s Global DataSphere 2023 research. Most of the world’s data is part of workloads that need nominal data transfer time for general-purpose use cases.

 

In some cases, all-flash systems are not even required at all as part of the highest performance solutions. There are hybrid storage systems that perform as well as or faster than all-flash. At a device level, the differences in performance are obvious; however, at scale in data center racks, hard drive performance benefits from extremely parallel access, resulting in a performance level that is more than sufficient for most workloads, including AI and machine learning. Just as important, any significantly incremental performance advantages afforded by flash can often be constrained by other infrastructure decisions, like network capacity or quality. 

Second, as established earlier in this article, TCO considerations are key to most data center infrastructure decisions. This forces a balance of cost, capacity, and performance. Optimal TCO is achieved by aligning the most cost-effective media—hard drive, flash, or tape—to the workload requirement. Hard drives and hybrid arrays (built from hard drives and SSDs) are a great fit for most enterprise and cloud storage and application use cases.

Of course, one might elect to use SSDs or AFAs for workloads that are best suited for hard drives—like file services, object storage, document management systems, or web hosting. But cost-wise, the higher the capacity, the more strangely illogical such a decision would be. It’s like using your car, parked in a garage, to store your clothes. Doable? Sure, if that’s what you want to do with a car. But cost-effective? Not at all.

While flash storage excels in read-intensive scenarios, its endurance diminishes with increased write activity. Manufacturers address this with error correction and overprovisioning—extra, unseen storage, to replace worn cells. However, these solutions come with extra costs: Overprovisioning greatly increases the imbedded product cost and constant power is needed to avoid data loss. This poses challenges for environments like edge data centers, or any setting where continuous operation isn’t guaranteed and is accelerated at high temperatures. 

Additionally, while technologies like triple level cell (TLC) and quad-level cell (QLC) allow flash to handle data-heavy workloads like hard drives, the economic rationale weakens for larger data sets or long-term retention. In these cases, disk drives, with their growing areal density, offer a more cost-effective solution. In hyperscale environments, leveraging thousands of hard drives in parallel achieves performance that complements flash, illustrating their collaborative role in modern data centers.

Consequently, while QLC flash is taking over a sizable percentage of the TLC market—much like how TLC replaced multi-level cell (MLC) NAND storage—it is not eroding hard drive market share due to cost, availability, and workload factors explored in this article.

The third and related point is the claim that AFAs are superior to hybrid arrays or hard drive storage systems. Flash proponents claim that using one type of storage is “simpler” than adopting a mix of media types and storage tiers. Not so fast. 

Many hybrid storage systems employ a well-proven and finely tuned software-defined architecture that seamlessly integrates and harnesses the strengths of diverse media types into singular units. In scale-out private or public cloud data center architectures, file systems or software defined storage is used to manage the data storage workloads across data center locations and regions. They offer more than adequate flexibility, allowing businesses to adjust their storage composition based on ever-changing needs.

AFAs and SSDs are a great fit for high-performance, read-intensive workloads. But it’s a mistake to extrapolate from niche use cases or small-scale deployments to the mass market and hyperscale where AFAs provide an unnecessarily expensive way to do what hard drives already deliver at a much lower TCO.
Cloud, hyperscale, and large enterprise storage architectures select storage that optimize cost, capacity, and performance. Hard drives serve workloads that flash should not. Flash serves workloads that hard drives should not. Both storage media will coexist in the data center, with hard drives continuing to dominate in terms of EBs stored for the foreseeable future.

Speaking of EBs, it’s common to point to increasing SSD unit volumes in conjunction with declining hard drive unit shipments as proof of a turning point in the storage market. But this argument is a red herring, failing to recognize step-ups in hard drive capacity and total hard drive EB shipments, which are trending up faster than ever. Case in point: thanks to HAMR-enabled areal density innovation, Seagate’s new Mozaic™ platform will double maximum unit capacity over the next four years, whereas traditional perpendicular magnetic recording (PMR) technology took nine years to achieve the doubling of capacity.

Instead of counting unit volume, what matters when it comes to accurately measuring growth are EB shipments. Analysts predict hard drive EB shipments will continue increasing at an unprecedented rate. And although flash storage will also see growth, it won’t approach hard drives in terms of installed capacity. 

Seagate analysis of data from IDC and TRENDFOCUS predicts an almost 250% increase in EB outlook for hard drives by 2028. Extrapolating further out in time, that ratio holds well into the next decade. Take a look: 

Conclusion - Here to Stay

The supposed obsolescence of hard drives has been a topic of discussion in the technology industry for more than a decade. But the various predictions have not aged well. We do not expect the most recent round to pan out either.

Almost invariably, all-flash absolutists attempt to substantiate their arguments with logical fallacies, often extrapolating from a small subset of use cases to scale—which is where their conclusions don’t hold.

It is creative marketing at best.

In reality: 

  • NAND and hard drive pricing will not converge anytime soon, especially with accelerated hard drive areal density gains afforded by the volume ramp of Seagate Mozaic 3+ platform.
  • Contrary to what some have suggested, NAND producers will not be able to scale manufacturing capacity to replace the existing and new hard drive EB demand. It is impossible for AFA vendors to offer both enough supply and cheaper storage than hard drives because of the level of investment required. The AFA manufacturers are unlikely to find hundreds of billions of dollars to invest at a 10 to 1 loss to create enough NAND to replace hard drives. 
  • Large cloud and enterprise data center operators are pragmatic and understand that scale-out storage architectures require a mix of media—optimized to meet the budget, capacity, and performance needs of their workloads.

Of course, there are other myths that contribute to the “creative marketing” that predicts the demise of hard drives—myths around sustainability, power, reliability, among other areas. Stay tuned: we will address them in coming posts. However, the three myths discussed above strike us as the most relevant.

Any serious interrogation of the data laid out in this article leads to the conclusion that hard drives are here to stay. They will continue to store the vast majority of the world’s data far into the future.

To suggest otherwise is pure delusion.

  1. IDC, Worldwide Global StorageSphere Forecast, 2023-2027. Doc #US50851423, June 2023.

  2. Ibid.

  3. IDC, Multi-Client Study, Cloud Infrastructure Index 2023: Compute and Storage Consumption by 100 Service Providers, November 2023.

  1. ⁺Promotion terms and conditions available at https://www.seagate.com/legal/sales-and-promotion/03-28-2024-maythe4th/