A great deal has been written about RAID hardware, software, and a myriad of related topics since the early 1990s when hardware RAID first appeared on the scene. This month I’ll present the topic of RAID from a slightly different perspective than what you’ve likely previously seen written. Types of RAID I separate RAID devices […]
  A great deal has been written about RAID hardware, software, and a myriad of related topics since the early 1990s when hardware RAID first appeared on the scene. This month I’ll present the topic of RAID from a slightly different perspective than what you’ve likely previously seen written.
I separate RAID devices into two types. They are:
Often these two types of technology are called “enterprise RAID” (cache-centric) and “mid-range RAID” (storage-centric). Whatever you choose to call these devices, there are major structural differences between the two types, even though both are called RAID and perform many of the same functions.
I use the term cache-centric because RAID devices in this category depend significantly on data residing in cache to achieve good performance. Most cache-centric devices generally have feature sets such as:
Some of the cache-centric RAIDs vendors and devices include:
All of these products can run both on UNIX servers and on IBM mainframes. They are designed to support environments where reliability is an issue and, in some cases, where customers are willing to trade off some performance for reliability. For these RAID products to have great performance they need to have a high number of cache hits. These RAID systems allow huge amounts of storage behind a single controller for simplification of management of the storage.
Page 2: Storage-centric RAIDs
I use the term storage-centric because these devices primarily depend not on the cache but rather on the underlying storage and architecture. These devices have the following features:
Examples of storage-centric RAID vendors and products include:
There are many vendors in this market area as it is much easier to develop products than for the cache-centric devices given the lower reliability requirement. Additionally, vendors do not have to create mainframe interfaces and the market has a lower expectation for software. Most vendors in this market space are now looking at options that use ATA drives instead of SCSI drives. They are doing this to provide greater data density and lower price points than are available from SCSI drives. Of course, you are trading reliability as bit error rates for Fibre Channel or SCSI drives are an order of magnitude greater than those for ATA drives (according to the Seagate website). RAID devices of this type are far more prevalent and usually have two or more times more bandwidth to disk than to the front-end servers.
Page 3: So Which Is Better?
As with many things in computing, “It depends on many factors, issues, and requirements.” About a year and a half ago, my company was asked to work on a project where the customer wanted to use enterprise storage for capturing high-speed data streams. Internally, we called this project BFASP— “blood from a stone project.” We knew, as did many of the other technical people involved, that the customer could not use an enterprise storage box for high-speed, full duplex I/O (writing files and reading other files at the same time).
The enterprise storage vendors benchmark team said they could do this type of I/O, but our technical team thought this was absurd. After spending 36 hours with them, the benchmark team finally came to this conclusion: “Oh, we did not understand you needed full duplex I/O at these speeds; we cannot do this.” The customer ended up purchasing a storage-centric RAID, and the enterprise storage vendor left with their tails between their legs, as they should have, given the time and money that was wasted.
On the other hand, the same enterprise storage box will far exceed the performance of the storage-centric device for a database where the index files are used often and fit into the cache. A few new vendors are entering the market and claiming to combine the best of both products. Vendors such as EMC with its new DMX2000, 3ParData, and others will be seeking to prove their products can ably perform both functions.
The decision on what to buy is quite often far above your pay grade, but assuming that you do have some say, or at least have some input, you need to review your requirements and your operational usage. Here are just a few of the major issues that you should consider:
It comes down to how much downtime you can afford for the box. If you have many attached hosts, you cannot afford any downtime. This is often called the 9 count, or how many “9s” of reliability are supported. Here’s a chart showing several different combinations of “9s” and their corresponding downtime:
| 
 Percentage  | 
 Downtime  | 
 Downtime Per Year in Seconds  | 
 Downtime in Minutes Per Year  | 
 Downtime in Hours Per Year  | 
 Downtime in Days Per Year  | 
| 
 96.0000%  | 
 103680.00  | 
 1244160.00  | 
 20736.00  | 
 345.60  | 
 14.40  | 
| 
 97.0000%  | 
 77760.00  | 
 933120.00  | 
 15552.00  | 
 259.20  | 
 10.80  | 
| 
 98.0000%  | 
 51840.00  | 
 622080.00  | 
 10368.00  | 
 172.80  | 
 7.20  | 
| 
 99.0000%  | 
 25920.00  | 
 311040.00  | 
 5184.00  | 
 86.40  | 
 3.60  | 
| 
 99.5000%  | 
 12960.00  | 
 155520.00  | 
 2592.00  | 
 43.20  | 
 1.80  | 
| 
 99.9000%  | 
 2592.00  | 
 31104.00  | 
 518.40  | 
 8.64  | 
 0.36  | 
| 
 99.9500%  | 
 1296.00  | 
 15552.00  | 
 259.20  | 
 4.32  | 
 0.18  | 
| 
 99.9900%  | 
 259.20  | 
 3110.40  | 
 51.84  | 
 0.86  | 
 0.04  | 
| 
 99.9950%  | 
 129.60  | 
 1555.20  | 
 25.92  | 
 0.43  | 
 0.02  | 
| 
 99.9990%  | 
 25.92  | 
 311.04  | 
 5.18  | 
 0.09  | 
 0.00  | 
| 
 99.9995%  | 
 12.96  | 
 155.52  | 
 2.59  | 
 0.04  | 
 0.00  | 
| 
 99.9999%  | 
 2.59  | 
 31.10  | 
 0.52  | 
 0.01  | 
 0.00  | 
Knowing the reliability requirements and the number of attached hosts will help determine the type of equipment needed.
Page 4: Backup
Enterprise boxes have the ability to create a mirror, break the mirror for a backup, and then reattach the mirror and update the box. Most mid-range boxes do not have this feature, although it is starting to become more common. Backup is becoming more difficult given the total amount to storage that needs to be kept. More and more often, I am seeing sites move to hierarchical storage management (HSM) techniques rather than backup, given the immense data volumes. See my article on tapesfor additional information on this topic. The issues for backup will be addressed in a few months.
You need to determine what RAID level you are going to need to use. There are many different RAID levels, but the two used most often are RAID-1 and RAID-5. The RAID level depends on:
So, if you are making small requests (especially if they are random), RAID-1 will be much faster than RAID-5. With RAID-1 each device is mirrored, but with RAID-5 you create a LUN (Logical Unit Number) with a parity drive so that the LUN can be rebuilt if a device fails. With RAID-1 you write far more data (as each disk is mirrored) than you write with RAID-5, resulting in far more backend bandwidth for sequential I/O.
In today’s world, the likelihood of everything fitting into the RAID cache is very low; in fact, why would you even buy a RAID if that was the case, as you could simply purchase an SSD (solid state storage). The real questions you need to ask yourself are how are these files accessed and can the data be reused. This leads to the next important question — would a large cache help or not make any real difference?
The choice between cache-centric RAID devices and storage-centric RAID devices likely will be made by budgetary constraints and other issues rather than the performance of the devices. The operational environments of the two types are often vastly different, with issues to consider including:
What it really comes down to is something I personally always say it comes back to: “The requirements.” Purchases should be made in terms of the requirements and budget so you get the best value for the available funds.
In the next article, we will dig deeper into RAID and discuss how the layout of RAID and file systems should be architected.
  Henry Newman has been a contributor to TechnologyAdvice websites for more than 20 years. His career in high-performance computing, storage and security dates to the early 1980s, when Cray was the name of a supercomputing company rather than an entry in Urban Dictionary. After nearly four decades of architecting IT systems, he recently retired as CTO of a storage company’s Federal group, but he rather quickly lost a bet that he wouldn't be able to stay retired by taking a consulting gig in his first month of retirement.
  Enterprise Storage Forum offers practical information on data storage and protection from several different perspectives: hardware, software, on-premises services and cloud services. It also includes storage security and deep looks into various storage technologies, including object storage and modern parallel file systems. ESF is an ideal website for enterprise storage admins, CTOs and storage architects to reference in order to stay informed about the latest products, services and trends in the storage industry.
Property of TechnologyAdvice. © 2025 TechnologyAdvice. All Rights Reserved
Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.