HOME

TheInfoList



OR:

In
computer data storage Computer data storage is a technology consisting of computer components and recording media that are used to retain digital data. It is a core function and fundamental component of computers. The central processing unit (CPU) of a comput ...
, data striping is the technique of segmenting logically sequential data, such as a file, so that consecutive segments are stored on different physical storage devices. Striping is useful when a processing device requests data more quickly than a single storage device can provide it. By spreading segments across multiple devices which can be accessed concurrently, total data throughput is increased. It is also a useful method for balancing I/O load across an array of disks. Striping is used across disk drives in
redundant array of independent disks Raid, RAID or Raids may refer to: Attack * Raid (military), a sudden attack behind the enemy's lines without the intention of holding ground * Corporate raid, a type of hostile takeover in business * Panty raid, a prankish raid by male college ...
(RAID) storage,
network interface controller A network interface controller (NIC, also known as a network interface card, network adapter, LAN adapter or physical network interface, and by similar terms) is a computer hardware component that connects a computer to a computer network. Ear ...
s, disk arrays, different computers in
clustered file system A clustered file system is a file system which is shared by being simultaneously mounted on multiple servers. There are several approaches to clustering, most of which do not employ a clustered file system (only direct attached storage for ...
s and
grid-oriented storage Grid-oriented Storage (GOS) was a term used for data storage by a university project during the era when the term grid computing was popular. Description GOS was a successor of the term network-attached storage (NAS). GOS systems contained ha ...
, and
RAM Ram, ram, or RAM may refer to: Animals * A male sheep * Ram cichlid, a freshwater tropical fish People * Ram (given name) * Ram (surname) * Ram (director) (Ramsubramaniam), an Indian Tamil film director * RAM (musician) (born 1974), Dutch * ...
in some systems.


Method

One method of striping is done by interleaving sequential segments on storage devices in a round-robin fashion from the beginning of the data sequence. This works well for streaming data, but subsequent random accesses will require knowledge of which device contains the data. If the data is stored such that the physical address of each data segment is assigned a one-to-one mapping to a particular device, the device to access each segment requested can be calculated from the address without knowing the offset of the data within the full sequence. Other methods might be employed in which sequential segments are not stored on sequential devices. Such non-sequential interleaving can have benefits in some
error correction In information theory and coding theory with applications in computer science and telecommunication, error detection and correction (EDAC) or error control are techniques that enable reliable delivery of digital data over unreliable communi ...
schemes.


Advantages and disadvantages

Advantages of striping include performance and throughput. Sequential time interleaving of data accesses allows the lesser data access throughput of each storage devices to be cumulatively multiplied by the number of storage devices employed. Increased throughput allows the data processing device to continue its work without interruption, and thereby finish its procedures more quickly. This is manifested in improved performance of the data processing. Because different segments of data are kept on different storage devices, the failure of one device causes the corruption of the full data sequence. In effect, the
failure rate Failure rate is the frequency with which an engineered system or component fails, expressed in failures per unit of time. It is usually denoted by the Greek letter λ (lambda) and is often used in reliability engineering. The failure rate of a ...
of the array of storage devices is equal to the sum of the failure rate of each storage device. This disadvantage of striping can be overcome by the storage of redundant information, such as
parity Parity may refer to: * Parity (computing) ** Parity bit in computing, sets the parity of data for the purpose of error detection ** Parity flag in computing, indicates if the number of set bits is odd or even in the binary representation of the ...
, for the purpose of error correction. In such a system, the disadvantage is overcome at the cost of requiring extra storage.


Terminology

The segments of sequential data written to or read from a disk before the operation continues on the next disk are usually called ''chunks'', ''strides'' or ''stripe units'', while their logical groups forming single striped operations are called ''strips'' or ''stripes''. The amount of data in one chunk (stripe unit), often denominated in bytes, is variously referred to as the ''chunk size'', ''stride size'', ''stripe size'', ''stripe depth'' or ''stripe length''. The number of data disks in the array is sometimes called the ''stripe width'', but it may also refer to the amount of data within a stripe. The amount of data in one stride multiplied by the number of data disks in the array (i.e., ''stripe depth'' times ''stripe width'', which in the geometrical analogy would yield an area) is sometimes called the ''stripe size'' or ''stripe width''."Stripe depth is the size of the stripe, sometimes called stripe unit. Stripe width is the product of the stripe depth and the number of drives in the striped set."
/ref> ''Wide striping'' occurs when chunks of data are spread across multiple arrays, possibly all the drives in the system. ''Narrow striping'' occurs when the chunks of data are spread across the drives in a single array.


Applications

Data striping is used in some
database In computing, a database is an organized collection of data stored and accessed electronically. Small databases can be stored on a file system, while large databases are hosted on computer clusters or cloud storage. The design of databases s ...
s, such as Sybase, and in certain RAID devices under software or hardware control, such as IBM's 9394 RAMAC Array subsystem. File systems of clusters also use striping. Oracle
Automatic Storage Management Automatic Storage Management (ASM) is a feature provided by Oracle Corporation within the Oracle Database from release Oracle 10g (revision 1) onwards. ASM aims to simplify the management of database datafiles, control files and log files. To do s ...
allows ASM files to be either coarse or fine striped. ; RAID : In some RAID configurations, such as
RAID 0 In computer storage, the standard RAID levels comprise a basic set of RAID ("redundant array of independent disks" or "redundant array of inexpensive disks") configurations that employ the techniques of striping, mirroring, or parity to create ...
, failure of a single member drive of the RAID array causes all stored data to be lost. In other RAID configurations, such as a
RAID 5 In computer storage, the standard RAID levels comprise a basic set of RAID ("redundant array of independent disks" or "redundant array of inexpensive disks") configurations that employ the techniques of striping, mirroring, or parity to create ...
that contains distributed parity and provides redundancy, if one member drive fails the data can be restored using the other drives in the array. ; LVM2 : Data striping can also be achieved with Linux's Logical Volume Management (LVM). The LVM system allows for the adjustment of coarseness of the striping pattern. LVM tools will allow implementation of data striping in conjunction with mirroring. LVM offers the added benefit of read and write caching on
NVM Express NVM Express (NVMe) or Non-Volatile Memory Host Controller Interface Specification (NVMHCIS) is an open, logical-device interface specification for accessing a computer's non-volatile storage media usually attached via PCI Express (PCIe) bus. The ...
for slow spinning storage. LVM has other advantages that are not directly related to data striping (like snapshots, dynamic resizing, etc). ; Btrfs and ZFS : Have RAID like features but with the security of chunk integrity to detect bad blocks, and the added flexibility of adding arbitrary numbers of extra drives. They also have other advantages that are not directly related to data striping (copy on write, etc).


See also

*
Partition alignment Partition alignment is the proper alignment of partitions to the boundaries available in a data storage device. Examples include the following: * 4 KB sector alignment with hard disk drives supporting Advanced Format (AF) * Track partition a ...
*
Link aggregation In computer networking, link aggregation is the combining ( aggregating) of multiple network connections in parallel by any of several methods, in order to increase throughput beyond what a single connection could sustain, to provide redundan ...


References

{{DEFAULTSORT:Data striping Data partitioning RAID