Data degradation
   HOME

TheInfoList



OR:

Data degradation is the gradual
corruption Corruption is a form of dishonesty or a criminal offense which is undertaken by a person or an organization which is entrusted in a position of authority, in order to acquire illicit benefits or abuse power for one's personal gain. Corruption m ...
of computer data due to an accumulation of non-critical failures in a
data storage device Data storage is the recording (storing) of information (data) in a storage medium. Handwriting, phonographic recording, magnetic tape, and optical discs are all examples of storage media. Biological molecules such as RNA and DNA are conside ...
. The phenomenon is also known as data decay, data rot or bit rot.


Example

Below are several digital images illustrating data degradation, all consisting of 326,272 bits. The original photo is displayed first. In the next image, a single bit was changed from 0 to 1. In the next two images, two and three bits were flipped. On Linux systems, the binary difference between files can be revealed using command (e.g. ). File:Bitrot in JPEG files, 0 bits flipped.jpg, 0 bits flipped File:Bitrot in JPEG files, 1 bit flipped.jpg, 1 bit flipped File:Bitrot in JPEG files, 2 bits flipped.jpg, 2 bits flipped File:Bitrot in JPEG files, 3 bits flipped.jpg, 3 bits flipped


Primary storages

Data degradation in
dynamic random-access memory Dynamic random-access memory (dynamic RAM or DRAM) is a type of random-access semiconductor memory that stores each bit of data in a memory cell, usually consisting of a tiny capacitor and a transistor, both typically based on metal-oxide ...
(DRAM) can occur when the electric charge of a bit in DRAM disperses, possibly altering program code or stored data. DRAM may be altered by cosmic rays or other high-energy particles. Such data degradation is known as a
soft error In electronics and computing, a soft error is a type of error where a signal or datum is wrong. Errors may be caused by a defect, usually understood either to be a mistake in design or construction, or a broken component. A soft error is also a s ...
. ECC memory can be used to mitigate this type of data degradation.


Secondary storages

Data degradation results from the gradual decay of storage media over the course of years or longer. Causes vary by medium: * '' Solid-state media'', such as EPROMs,
flash memory Flash memory is an electronic non-volatile computer memory storage medium that can be electrically erased and reprogrammed. The two main types of flash memory, NOR flash and NAND flash, are named for the NOR and NAND logic gates. Both us ...
and other solid-state drives, store data using electrical charges, which can slowly leak away due to imperfect insulation. The chip itself is not affected by this, so reprogramming it approximately once per decade prevents decay. An undamaged copy of the master data is required for the reprogramming. * '' Magnetic media'', such as hard disk drives,
floppy disk A floppy disk or floppy diskette (casually referred to as a floppy, or a diskette) is an obsolescent type of disk storage composed of a thin and flexible disk of a magnetic storage medium in a square or nearly square plastic enclosure lined w ...
s and
magnetic tape Magnetic tape is a medium for magnetic storage made of a thin, magnetizable coating on a long, narrow strip of plastic film. It was developed in Germany in 1928, based on the earlier magnetic wire recording from Denmark. Devices that use magne ...
s, may experience data decay as bits lose their magnetic orientation. Periodic refreshing by rewriting the data can alleviate this problem. In warm/humid conditions these media, especially those poorly protected against ambient air, are prone to the physical decomposition of the storage medium. * '' Optical media'', such as CD-R,
DVD-R DVD recordable and DVD rewritable are optical disc recording technologies. Both terms describe DVD optical discs that can be written to by a DVD recorder, whereas only 'rewritable' discs are able to erase and rewrite data. Data is written ('burne ...
and
BD-R Blu-ray Disc Recordable (BD-R) refers to two direct to disc optical disc recording technologies that can be recorded on to a Blu-ray-based optical disc with an optical disc recorder. BD-R discs can be written to once, whereas Blu-ray Disc Recorda ...
, may experience data decay from the breakdown of the storage medium. This can be mitigated by storing discs in a dark, cool, low humidity location. "Archival quality" discs are available with an extended lifetime, but are still not permanent. However, data integrity scanning that measures the rates of various types of errors is able to predict data decay on optical media well ahead of uncorrectable data loss occurring. * '' Paper media'', such as punched cards and punched tape, may literally
rot Rot(s) or rotting may refer to: Decay Organic matter * Rot, decomposition of organic matter ** Dry rot, of wood ** Root rot ** Wet rot, of wood * Necrosis, of tissue Technology * Bit rot, data degradation ** Software rot, a form of bit rot * Di ...
. Mylar punched tape is another approach that does not rely on electromagnetic stability.


Hardware failures

Most disk, disk controller and higher-level systems are subject to a slight chance of unrecoverable failure. With ever-growing disk capacities, file sizes, and increases in the amount of data stored on a disk, the likelihood of the occurrence of data decay and other forms of uncorrected and undetected data corruption increases. Low-level disk controllers typically employ error correction codes (ECC) to correct erroneous data. Higher-level software systems may be employed to mitigate the risk of such underlying failures by increasing redundancy and implementing integrity checking, error correction codes and self-repairing algorithms. The ZFS
file system In computing, file system or filesystem (often abbreviated to fs) is a method and data structure that the operating system uses to control how data is stored and retrieved. Without a file system, data placed in a storage medium would be one larg ...
was designed to address many of these data corruption issues. The Btrfs file system also includes data protection and recovery mechanisms, as does
ReFS Resilient File System (ReFS), codenamed "Protogon", is a Microsoft proprietary file system introduced with Windows Server 2012 with the intent of becoming the "next generation" file system after NTFS. ReFS was designed to overcome problems tha ...
.


See also

* Checksum *
Database integrity Data integrity is the maintenance of, and the assurance of, data accuracy and consistency over its entire life-cycle and is a critical aspect to the design, implementation, and usage of any system that stores, processes, or retrieves data. The ter ...
* Data curation * Data preservation * Data scrubbing * Digital permanence * Digital preservation * Disc rot *
Error detection and correction In information theory and coding theory with applications in computer science and telecommunication, error detection and correction (EDAC) or error control are techniques that enable reliable delivery of digital data over unreliable communi ...
* Link rot * Media preservation *
RAR RAR or Rar may refer to: * Radio acoustic ranging, a non-visual technique for determining a ship's position at sea * "rar", the ISO 639-2 code for the Cook Islands Māori language * RAR (file format), a proprietary compressed archive file format in ...
archive file format has optional recovery * PAR2 recovery file format


References

{{Data Computer jargon Data quality