SimplyNAS Guide to Understanding RAID

The Ultimate Guide for RAID

Data loss for anyone whether it is a business or individual ultimately means lost business or shattered memories respectively. Data is the most valuable asset of any business, and this can’t be understated, protection of business assets must always command the highest priority.

Backing up data will not be enough if you have no protection against online disk failure. Adding RAID to your storage configuration is the most cost effective and simplest ways to maintain access and data protection.

RAID

Whilst a number of companies offer RAID, not all RAID implementation are created equal. As an example there is Hardware RAID which is determined by a specific RAID controller which will probably have Cache memory as well to boost performance and possible battery back-up in case the controller fails. Hardware RAID has history and is very mature. Software RAID on the other hand is not totally new but the quality and reliability is dictated by the quality of the components used in the integration of the RAID components and the quality of software code driving the RAID implementation.

To choose the RAID level that's right for you, begin by considering the factors below. Each one of these factors becomes a trade-off for another:

  • Cost of disk storage
  • Data protection (low, medium, high)
  • Data Availability needed (low, medium, high)
  • Performance Requirements (low, medium, high)

The Cost basically comes down to the trade-off between disk capacity and added data availability or performance. For example, RAID 1,10 and small disk counts of RAID 6 are costly in terms of lost disk space (50%), but high in data availability.

Performance also depends on the access pattern (random/sequential, read/write, long/short) and the numbers of users. This guide is intended to give an overview on the performance and availability of various RAID levels in general and may not be accurate in all user scenarios. This is meant only as a guide through the RAID jungle.

The following are commonly used RAID levels (click each heading for more information):

RAID Min # Drives Data Protection Read Speed Write Speed Read Speed (degraded) Write Speed (degraded) Capacity Utilization Typical Applications Click arrow to expand
RAID 0 2 No Protection High High N/A N/A 100% High End Workstations, data logging, real-time rendering, very transitory data
RAID 0

Definition

Striping without parity, improved performance, additional storage, no fault tolerance

Advantages

  • I/O performance is greatly improved by spreading the I/O load across many channels and drives (best performance is achieved when data is striped across multiple channels with only one drive per channel)
  • No parity calculation overhead is involved
  • Very simple design
  • Easy to implement

Disadvantages

  • Not a "True" RAID because the failure of just one drive will result in all data in a virtual disk being lost
  • Should not be used for critical data
RAID 1 2 Single-drive failure High Medium Medium High 50% Operating System, transaction databases
RAID 1

Definition

Mirroring without parity, fault tolerance for disk errors and single disk failures

Advantages

  • High performance up to twice the read transaction rate of single disks, and the same write transaction rate as single disks
  • 100 percent redundancy of data means no rebuild of data is necessary in case of disk failure, just a copy to the replacement disk
  • Typically supports hot-swap disks
  • Simplest RAID storage subsystem design

Disadvantages

  • Highest disk overhead of all RAID types (100 percent) results in inefficient use of drive capacity
  • Limited capacity since the virtual disk can only include two disk drives
RAID 5 3 Single-drive failure High Low Low Low 67% - 94% Data warehousing, web serving, archiving
RAID 5

Definition

Striping with distributed parity, improved performance, fault tolerance for disk errors and single disk failures

Advantages

  • Most efficient use of drive capacity of all the redundant RAID configurations
  • High read transaction rate
  • Medium-to-high write transaction rate

Disadvantages

  • Disk failure has a medium impact on throughput
  • Most complex controller design
  • Retrieval of parity information after a drive failure takes longer than with mirroring
RAID 6 4 Two-drive failure High Low Low Low 50% - 88% High End Workstations, data logging, real-time rendering, very transitory data
RAID 6

Definition

Striping with dual parity, fault tolerance for dual drive failures

Advantages

  • Can survive the loss of two disks without losing data
  • Data redundancy, high read rates, and good performance

Disadvantages

  • Requires two sets of parity data for each write operation, resulting in significant decrease in write performance
  • Additional costs because of the extra capacity required by using two parity blocks per stripe
  • Retrieval of parity information after a drive failure takes longer than with mirroring
RAID 10 4 Up to one disk failure in each sub-array High Medium High High 50% Fast databases, application servers
RAID 10

Definition

Implements a 1+0 RAID Level, enabling block level and mirroring combined with striping, better performance, fault tolerance for disk errors and multiple drive failure (one drive failure per mirror set)

Advantages

  • RAID 10 has the same redundancy as RAID level 1
  • High I/O rates are achieved by striping RAID 1 segments

Disadvantages

  • Most expensive RAID solution
  • Requires 2n where n > 1 disks
  • Very limited scalability at a very high inherent cost
RAID 50 6 Up to one disk failure in each sub-array High Medium Medium Medium 67% - 94% Large databases, file servers, application servers
RAID 50

Definition

Combines multiple RAID 5 sets with striping, improved performance, fault disk errors and multiple drive failures (one drive failure per span)

Advantages

  • Allows creation of largest RAID groups, up to 256 drives (theoretical)
  • High read transaction rate
  • Higher degree of fault tolerance due to parity calculation being done for each RAID 5 subset
  • Potential for faster read transaction rates over large RAID 5 virtual disks
  • Medium-to-high write transaction rate

Disadvantages

  • Potential for faster read transaction rates over large RAID 5 virtual disks
  • One of the more complex RAID implementations
  • Less space efficient than RAID 5 since separate parity calculations are done for each RAID 5 subset
  • Retrieval of parity information after a drive failure takes longer than using a mirrored solution
RAID 60 8 Up to two disk failures in each sub-array High Medium Medium Low 50% - 88% Data archive, backup to disk, high availability solutions, servers with large capacity requirements
RAID 60

Definition

Combines multiple RAID 6 sets with striping, improved performance, fault disk errors and multiple drive failures (two drive failures per span)

Advantages

  • Allows creation of largest RAID groups, up to 256 drives (theoretical)
  • High degree of fault tolerance due to 2 parity calculations being done for each RAID 6 subset
  • Medium-to-high write transaction rate

Disadvantages

  • One of the more complex RAID implementations
  • Less space efficient than RAID 6 since separate parity calculations are done for each RAID 6 subset
  • Retrieval of parity information after a drive failure takes longer than using a mirrored solution

RAID Calculator

Enter the number and capacity of your drives below and select a raid level to calculate the total array size and useable space of your system.
PLEASE BE AWARE THAT YOUR TOTAL CAPACITY WILL BE REDUCED BY APPROXIMATELY 1.2TB PER DRIVE DUE TO FILE SYSTEM STORAGE CALCULATIONS.

Number of Disks:
Drive Capacity (GB): Please input as thousands (3TB = 3000)
Raid Level: Raid 0
Raid 1
Raid 5
Raid 6
Raid 10
Raid 50
Raid 60
Total Array Size (GB):
Total Useable disk space (GB):


Depending on how you implement RAID, the benefits include one or both of the following:

Faster Performance In RAID 0, 10, 50, or 60 virtual disks, the host system can access simultaneously. This improves performance because each disk in an virtual disk has to handle of the request. For example, in a two-disk virtual disk, each disk needs to provide only its requested data.
Data Protection In RAID 1, 10, 5, 6, 50, and 60 virtual disks, the data is backed up on disk (mirror). In the RAID 5, 50, 6, or 60 virtual disks, the data is parity protected on a single multiple disks. RAID 10, 50, and 60 also allow the host to access disks simultaneously.
Just added to your wishlist:
My Wishlist
You've just added this product to the cart:
Go to cart page