Data degradation is the gradual
corruption
Corruption is a form of dishonesty or a criminal offense which is undertaken by a person or an organization which is entrusted in a position of authority, in order to acquire illicit benefits or abuse power for one's personal gain. Corruption m ...
of
computer data due to an accumulation of non-critical failures in a
data storage device
Data storage is the recording (storing) of information (data) in a storage medium. Handwriting, phonographic recording, magnetic tape, and optical discs are all examples of storage media. Biological molecules such as RNA and DNA are conside ...
. The
phenomenon is also known as data decay, data rot or bit rot.
Example
Below are several digital images illustrating data degradation, all consisting of 326,272 bits. The original photo is displayed first. In the next image, a single bit was changed from 0 to 1. In the next two images, two and three bits were flipped. On
Linux systems, the binary difference between files can be revealed using command (e.g. ).
File:Bitrot in JPEG files, 0 bits flipped.jpg, 0 bits flipped
File:Bitrot in JPEG files, 1 bit flipped.jpg, 1 bit flipped
File:Bitrot in JPEG files, 2 bits flipped.jpg, 2 bits flipped
File:Bitrot in JPEG files, 3 bits flipped.jpg, 3 bits flipped
Primary storages
Data degradation in
dynamic random-access memory
Dynamic random-access memory (dynamic RAM or DRAM) is a type of random-access semiconductor memory that stores each bit of data in a memory cell, usually consisting of a tiny capacitor and a transistor, both typically based on metal-oxide ...
(DRAM) can occur when the
electric charge of a
bit in DRAM disperses, possibly altering program code or stored data. DRAM may be altered by
cosmic rays or other high-energy particles. Such data degradation is known as a
soft error
In electronics and computing, a soft error is a type of error where a signal or datum is wrong. Errors may be caused by a defect, usually understood either to be a mistake in design or construction, or a broken component. A soft error is also a s ...
.
ECC memory can be used to mitigate this type of data degradation.
Secondary storages
Data degradation results from the gradual decay of
storage media over the course of years or longer. Causes vary by medium:
* ''
Solid-state media'', such as
EPROMs,
flash memory
Flash memory is an electronic non-volatile computer memory storage medium that can be electrically erased and reprogrammed. The two main types of flash memory, NOR flash and NAND flash, are named for the NOR and NAND logic gates. Both us ...
and other
solid-state drives, store data using electrical charges, which can slowly leak away due to imperfect insulation. The chip itself is not affected by this, so reprogramming it approximately once per decade prevents decay. An undamaged copy of the master data is required for the reprogramming.
* ''
Magnetic media'', such as
hard disk drives,
floppy disk
A floppy disk or floppy diskette (casually referred to as a floppy, or a diskette) is an obsolescent type of disk storage composed of a thin and flexible disk of a magnetic storage medium in a square or nearly square plastic enclosure lined w ...
s and
magnetic tape
Magnetic tape is a medium for magnetic storage made of a thin, magnetizable coating on a long, narrow strip of plastic film. It was developed in Germany in 1928, based on the earlier magnetic wire recording from Denmark. Devices that use magne ...
s, may experience data decay as bits lose their magnetic orientation. Periodic refreshing by rewriting the data can alleviate this problem. In warm/humid conditions these media, especially those poorly protected against ambient air, are prone to the physical
decomposition of the storage medium.
* ''
Optical media'', such as
CD-R,
DVD-R
DVD recordable and DVD rewritable are optical disc recording technologies. Both terms describe DVD optical discs that can be written to by a DVD recorder, whereas only 'rewritable' discs are able to erase and rewrite data. Data is written ('burne ...
and
BD-R
Blu-ray Disc Recordable (BD-R) refers to two direct to disc optical disc recording technologies that can be recorded on to a Blu-ray-based optical disc with an optical disc recorder. BD-R discs can be written to once, whereas Blu-ray Disc Recorda ...
, may experience data decay from the
breakdown of the storage medium. This can be mitigated by storing discs in a dark, cool, low humidity location. "Archival quality" discs are available with an extended lifetime, but are still not permanent. However,
data integrity scanning that measures the rates of various types of errors is able to predict data decay on optical media well ahead of uncorrectable data loss occurring.
* ''
Paper media'', such as
punched cards and
punched tape, may literally
rot
Rot(s) or rotting may refer to:
Decay Organic matter
* Rot, decomposition of organic matter
** Dry rot, of wood
** Root rot
** Wet rot, of wood
* Necrosis, of tissue
Technology
* Bit rot, data degradation
** Software rot, a form of bit rot
* Di ...
.
Mylar punched tape is another approach that does not rely on electromagnetic stability.
Hardware failures
Most disk,
disk controller and higher-level systems are subject to a slight chance of unrecoverable failure. With ever-growing disk capacities, file sizes, and increases in the amount of data stored on a disk, the likelihood of the occurrence of data decay and other forms of uncorrected and undetected
data corruption increases.
Low-level disk controllers typically employ
error correction codes (ECC) to correct erroneous data.
Higher-level software systems may be employed to mitigate the risk of such underlying failures by increasing redundancy and implementing integrity checking, error correction codes and self-repairing algorithms. The
ZFS file system
In computing, file system or filesystem (often abbreviated to fs) is a method and data structure that the operating system uses to control how data is stored and retrieved. Without a file system, data placed in a storage medium would be one larg ...
was designed to address many of these data corruption issues. The
Btrfs file system also includes data protection and recovery mechanisms, as does
ReFS
Resilient File System (ReFS), codenamed "Protogon", is a Microsoft proprietary file system introduced with Windows Server 2012 with the intent of becoming the "next generation" file system after NTFS.
ReFS was designed to overcome problems tha ...
.
See also
*
Checksum
*
Database integrity
Data integrity is the maintenance of, and the assurance of, data accuracy and consistency over its entire life-cycle and is a critical aspect to the design, implementation, and usage of any system that stores, processes, or retrieves data. The ter ...
*
Data curation
*
Data preservation
*
Data scrubbing
*
Digital permanence
*
Digital preservation
*
Disc rot
*
Error detection and correction
In information theory and coding theory with applications in computer science and telecommunication, error detection and correction (EDAC) or error control are techniques that enable reliable delivery of digital data over unreliable communi ...
*
Link rot
*
Media preservation
*
RAR RAR or Rar may refer to:
* Radio acoustic ranging, a non-visual technique for determining a ship's position at sea
* "rar", the ISO 639-2 code for the Cook Islands Māori language
* RAR (file format), a proprietary compressed archive file format in ...
archive file format has optional recovery
*
PAR2 recovery file format
References
{{Data
Computer jargon
Data quality