In
computing, data recovery is a process of retrieving deleted, inaccessible, lost, corrupted, damaged, or formatted data from
secondary storage
Computer data storage is a technology consisting of computer components and recording media that are used to retain digital data. It is a core function and fundamental component of computers.
The central processing unit (CPU) of a computer ...
,
removable media or
files
File or filing may refer to:
Mechanical tools and processes
* File (tool), a tool used to ''remove'' fine amounts of material from a workpiece
**Filing (metalworking), a material removal process in manufacturing
** Nail file, a tool used to gent ...
, when the data stored in them cannot be accessed in a usual way. The data is most often salvaged from storage media such as internal or external
hard disk drives (HDDs),
solid-state drives (SSDs),
USB flash drives,
magnetic tapes,
CDs,
DVDs,
RAID subsystems, and other
electronic devices. Recovery may be required due to physical damage to the storage devices or logical damage to the
file system
In computing, file system or filesystem (often abbreviated to fs) is a method and data structure that the operating system uses to control how data is stored and retrieved. Without a file system, data placed in a storage medium would be one larg ...
that prevents it from being
mounted by the host
operating system (OS).
Logical failures occur when the hard drive devices are functional but the user or automated-OS cannot retrieve or access date stored in it. It can occur due to corrupt engineering chip, lost partitions, deleted data, firmware failure, failed formatting/re-installation.
Data recovery can be a very simple or technical challenge. This is why there are specific software companies specialized in this field.
About
The most common data recovery scenarios involve an operating system failure, malfunction of a storage device, logical failure of storage devices, accidental damage or deletion, etc. (typically, on a single-drive, single-
partition, single-OS system), in which case the ultimate goal is simply to copy all important files from the damaged media to another new drive. This can be accomplished using a
Live CD, or DVD by booting directly from a
ROM or a USB drive instead of the corrupted drive in question. Many Live CDs or DVDs provide a means to mount the system drive and backup drives or removable media, and to move the files from the system drive to the backup media with a
file manager or
optical disc authoring software. Such cases can often be mitigated by
disk partitioning and consistently storing valuable data files (or copies of them) on a different partition from the replaceable OS system files.
Another scenario involves a drive-level failure, such as a compromised
file system
In computing, file system or filesystem (often abbreviated to fs) is a method and data structure that the operating system uses to control how data is stored and retrieved. Without a file system, data placed in a storage medium would be one larg ...
or drive partition, or a
hard disk drive failure
A hard disk drive failure occurs when a hard disk drive malfunctions and the stored information cannot be accessed with a properly configured computer.
A hard disk failure may occur in the course of normal operation, or due to an external factor ...
. In any of these cases, the data is not easily read from the media devices. Depending on the situation, solutions involve repairing the logical file system, partition table, or
master boot record
A master boot record (MBR) is a special type of boot sector at the very beginning of partitioned computer mass storage devices like fixed disks or removable drives intended for use with IBM PC-compatible systems and beyond. The concept of MBR ...
, or updating the
firmware
In computing, firmware is a specific class of computer software that provides the low-level control for a device's specific hardware. Firmware, such as the BIOS of a personal computer, may contain basic functions of a device, and may provide h ...
or drive recovery techniques ranging from software-based recovery of corrupted data, to hardware- and software-based recovery of damaged service areas (also known as the hard disk drive's "firmware"), to hardware replacement on a physically damaged drive which allows for the extraction of data to a new drive. If a drive recovery is necessary, the drive itself has typically failed permanently, and the focus is rather on a one-time recovery, salvaging whatever data can be read.
In a third scenario, files have been accidentally "
deleted" from a storage medium by the users. Typically, the contents of deleted files are not removed immediately from the physical drive; instead, references to them in the directory structure are removed, and thereafter space the deleted data occupy is made available for later data overwriting. In the mind of
end user
In product development, an end user (sometimes end-user) is a person who ultimately uses or is intended to ultimately use a product. The end user stands in contrast to users who support or maintain the product, such as sysops, system administrat ...
s, deleted files cannot be discoverable through a standard file manager, but the deleted data still technically exists on the physical drive. In the meantime, the original file contents remain, often several disconnected
fragments, and may be recoverable if not overwritten by other data files.
The term "data recovery" is also used in the context of
forensic applications or
espionage, where data which have been
encrypted, hidden, or deleted, rather than damaged, are recovered. Sometimes data present in the computer gets encrypted or hidden due to reasons like virus attacks which can only be recovered by some computer forensic experts.
Physical damage
A wide variety of failures can cause physical damage to storage media, which may result from human errors and natural disasters.
CD-ROM
A CD-ROM (, compact disc read-only memory) is a type of read-only memory consisting of a pre-pressed optical compact disc that contains data. Computers can read—but not write or erase—CD-ROMs. Some CDs, called enhanced CDs, hold both comput ...
s can have their metallic substrate or dye layer scratched off; hard disks can suffer from a multitude of mechanical failures, such as
head crash
A head crash is a hard-disk failure that occurs when a read–write head of a hard disk drive makes contact with its rotating platter, slashing its surface and permanently damaging its magnetic media. It is most often caused by a sudden seve ...
es, PCB failure, and failed motors;
tapes can simply break.
Physical damage to a hard drive, even in cases where a head crash has occurred, does not necessarily mean there will be a permanent loss of data. The techniques employed by many professional data recovery companies can typically salvage most, if not all, of the data that had been lost when the failure occurred.
Of course, there are exceptions to this, such as cases where severe damage to the hard drive
platters may have occurred. However, if the hard drive can be repaired and a full image or clone created, then the logical file structure can be rebuilt in most instances.
Most physical damage cannot be repaired by end users. For example, opening a hard disk drive in a normal environment can allow airborne dust to settle on the platter and become caught between the platter and the
read/write head
A disk read-and-write head is the small part of a disk drive which moves above the disk platter and transforms the platter's magnetic field into electrical current (reads the disk) or, vice versa, transforms electrical current into magnetic fi ...
. During normal operation, read/write heads float 3 to 6
nanometers above the platter surface, and the average dust particles found in a normal environment are typically around 30,000 nanometers in diameter. When these dust particles get caught between the read/write heads and the platter, they can cause new head crashes that further damage the platter and thus compromise the recovery process. Furthermore, end users generally do not have the hardware or technical expertise required to make these repairs. Consequently, data recovery companies are often employed to salvage important data with the more reputable ones using
class 100 dust- and static-free
cleanrooms.
Recovery techniques
Recovering data from physically damaged hardware can involve multiple techniques. Some damage can be repaired by replacing parts in the hard disk. This alone may make the disk usable, but there may still be logical damage. A specialized disk-imaging procedure is used to recover every readable bit from the surface. Once this image is acquired and saved on a reliable medium, the image can be safely analyzed for logical damage and will possibly allow much of the original file system to be reconstructed.
Hardware repair
A common misconception is that a damaged
printed circuit board
A printed circuit board (PCB; also printed wiring board or PWB) is a medium used in Electrical engineering, electrical and electronic engineering to connect electronic components to one another in a controlled manner. It takes the form of a L ...
(PCB) may be simply replaced during recovery procedures by an identical PCB from a healthy drive. While this may work in rare circumstances on hard disk drives manufactured before 2003, it will not work on newer drives. Electronics boards of modern drives usually contain drive-specific
adaptation data
A hard disk drive failure occurs when a hard disk drive malfunctions and the stored information cannot be accessed with a properly configured computer.
A hard disk failure may occur in the course of normal operation, or due to an external factor ...
(generally a map of bad sectors and tuning parameters) and other information required to properly access data on the drive. Replacement boards often need this information to effectively recover all of the data. The replacement board may need to be reprogrammed. Some manufacturers (Seagate, for example) store this information on a serial
EEPROM
EEPROM (also called E2PROM) stands for electrically erasable programmable read-only memory and is a type of non-volatile memory used in computers, usually integrated in microcontrollers such as smart cards and remote keyless systems, or as a ...
chip, which can be removed and transferred to the replacement board.
Each hard disk drive has what is called a ''system area'' or ''service area''; this portion of the drive, which is not directly accessible to the end user, usually contains drive's firmware and adaptive data that helps the drive operate within normal parameters. One function of the system area is to log defective sectors within the drive; essentially telling the drive where it can and cannot write data.
The sector lists are also stored on various chips attached to the PCB, and they are unique to each hard disk drive. If the data on the PCB do not match what is stored on the platter, then the drive will not calibrate properly. In most cases the drive heads will click because they are unable to find the data matching what is stored on the PCB.
Logical damage
![Data loss of image file](https://upload.wikimedia.org/wikipedia/commons/6/63/Data_loss_of_image_file.JPG)
The term "logical damage" refers to situations in which the error is not a problem in the hardware and requires software-level solutions.
Corrupt partitions and file systems, media errors
In some cases, data on a hard disk drive can be unreadable due to damage to the
partition table or
file system
In computing, file system or filesystem (often abbreviated to fs) is a method and data structure that the operating system uses to control how data is stored and retrieved. Without a file system, data placed in a storage medium would be one larg ...
, or to (intermittent) media errors. In the majority of these cases, at least a portion of the original data can be recovered by repairing the damaged partition table or file system using specialized data recovery software such as
Testdisk; software like
ddrescue can image media despite intermittent errors, and image raw data when there is partition table or file system damage. This type of data recovery can be performed by people without expertise in drive hardware as it requires no special physical equipment or access to platters.
Sometimes data can be recovered using relatively simple methods and tools; more serious cases can require expert intervention, particularly if parts of files are irrecoverable.
Data carving File carving is the process of reassembling computer files from fragments in the absence of
filesystem metadata.
Introduction and basic principles
All filesystems contain some metadata that describes the actual file system. At a minimum, this inc ...
is the recovery of parts of damaged files using knowledge of their structure.
Overwritten data
After data has been physically overwritten on a hard disk drive, it is generally assumed that the previous data are no longer possible to recover. In 1996,
Peter Gutmann, a computer scientist, presented a paper that suggested overwritten data could be recovered through the use of
magnetic force microscopy
Magnetic force microscopy (MFM) is a variety of atomic force microscopy, in which a sharp magnetized tip scans a magnetic sample; the tip-sample magnetic interactions are detected and used to reconstruct the magnetic structure of the sample surf ...
. In 2001, he presented another paper on a similar topic. To guard against this type of data recovery, Gutmann and Colin Plumb designed a method of irreversibly scrubbing data, known as the
Gutmann method and used by several disk-scrubbing software packages.
Substantial criticism has followed, primarily dealing with the lack of any concrete examples of significant amounts of overwritten data being recovered. Although Gutmann's theory may be correct, there is no practical evidence that overwritten data can be recovered, while research has shown to support that overwritten data cannot be recovered.
Solid-state drives (SSD) overwrite data differently from hard disk drives (HDD) which makes at least some of their data easier to recover. Most SSDs use
flash memory
Flash memory is an electronic non-volatile computer memory storage medium that can be electrically erased and reprogrammed. The two main types of flash memory, NOR flash and NAND flash, are named for the NOR and NAND logic gates. Both us ...
to store data in pages and blocks, referenced by
logical block addresses (LBA) which are managed by the
flash translation layer (FTL). When the FTL modifies a sector it writes the new data to another location and updates the map so the new data appear at the target LBA. This leaves the pre-modification data in place, with possibly many generations, and recoverable by data recovery software.
Lost, deleted, and formatted data
Sometimes, data present in the physical drives (Internal/External Hard disk,
Pen Drive, etc.) gets lost, deleted and formatted due to circumstances like virus attack, accidental deletion or accidental use of SHIFT+DELETE. In these cases, data recovery software is used to recover/restore the data files.
Logical bad sector
In the list of logical failures of hard disks, a logical bad sector is the most common fault leading data not to be readable. Sometimes it is possible to sidestep error detection even in software, and perhaps with repeated reading and statistical analysis recover at least some of the underlying stored data. Sometimes prior knowledge of the data stored and the error detection and correction codes can be used to recover even erroneous data. However, if the underlying physical drive is degraded badly enough, at least the hardware surrounding the data must be replaced, or it might even be necessary to apply laboratory techniques to the physical recording medium. Each of the approaches is progressively more expensive, and as such progressively more rarely sought.
Eventually, if the final, physical storage medium has indeed been disturbed badly enough, recovery will not be possible using any means; the information has irreversibly been lost.
Remote data recovery
Recovery experts do not always need to have physical access to the damaged hardware. When the lost data can be recovered by software techniques, they can often perform the recovery using remote access software over the Internet, LAN or other connection to the physical location of the damaged media. The process is essentially no different from what the end user could perform by themselves.
Remote recovery requires a stable connection with an adequate bandwidth. However, it is not applicable where access to the hardware is required, as in cases of physical damage.
Four phases of data recovery
Usually, there are four phases when it comes to successful data recovery, though that can vary depending on the type of data corruption and recovery required.
; Phase 1: Repair the hard disk drive
: The hard drive is repaired in order to get it running in some form, or at least in a state suitable for reading the data from it. For example, if heads are bad they need to be changed; if the PCB is faulty then it needs to be fixed or replaced; if the spindle motor is bad the platters and heads should be moved to a new drive.
; Phase 2: Image the drive to a new drive or a disk image file
: When a hard disk drive fails, the importance of getting the data off the drive is the top priority. The longer a faulty drive is used, the more likely further data loss is to occur. Creating an image of the drive will ensure that there is a secondary copy of the data on another device, on which it is safe to perform testing and recovery procedures without harming the source.
; Phase 3: Logical recovery of files, partition, MBR and filesystem structures
: After the drive has been cloned to a new drive, it is suitable to attempt the retrieval of lost data. If the drive has failed logically, there are a number of reasons for that. Using the clone it may be possible to repair the partition table or
master boot record
A master boot record (MBR) is a special type of boot sector at the very beginning of partitioned computer mass storage devices like fixed disks or removable drives intended for use with IBM PC-compatible systems and beyond. The concept of MBR ...
(MBR) in order to read the file system's data structure and retrieve stored data.
; Phase 4: Repair damaged files that were retrieved
: Data damage can be caused when, for example, a file is written to a sector on the drive that has been damaged. This is the most common cause in a failing drive, meaning that data needs to be reconstructed to become readable. Corrupted documents can be recovered by several software methods or by manually reconstructing the document using a hex editor.
Restore disk
The
Windows operating system can be reinstalled on a computer that is already licensed for it. The reinstallation can be done by downloading the operating system or by using a "restore disk" provided by the computer manufacturer. Eric Lundgren was fined and sentenced to U.S. federal prison in April 2018 for producing 28,000 restore disks and intending to distribute them for about 25 cents each as a convenience to computer repair shops.
List of data recovery software
Bootable
Data recovery cannot always be done on a running system. As a result, a
boot disk,
live CD,
live USB, or any other type of live distro contains a minimal operating system.
*
BartPE
BartPE (Bart's Preinstalled Environment) is a discontinued tool that customizes Windows XP or Windows Server 2003 into a lightweight environment, similar to Windows Preinstallation Environment, which could be run from a Live CD or Live USB drive. ...
: a lightweight variant of
Microsoft Windows XP or
Windows Server 2003 32-bit operating systems, similar to a Windows Preinstallation Environment, which can be run from a live CD or live USB drive. Discontinued.
*
Finnix: a
Debian
Debian (), also known as Debian GNU/Linux, is a Linux distribution composed of free and open-source software, developed by the community-supported Debian Project, which was established by Ian Murdock on August 16, 1993. The first version of D ...
-based Live CD with a focus on being small and fast, useful for computer and data rescue
*
Disk Drill Basic: capable of creating bootable
Mac OS X USB drives for data recovery
*
Knoppix
KNOPPIX ( ) is an operating system based on Debian designed to be run directly from a CD / DVD (Live CD) or a USB flash drive (Live USB), one of the first live operating system distributions (just after Yggdrasil Linux). Knoppix was developed b ...
: contains utilities for data recovery under Linux
*
SpinRite
SpinRite is a computer program for scanning RAS Random Access Storage devices such as hard disks, reading and rewriting data using proprietary programming methods to resolve and retrieve data that is unreadable by DOS or Windows. The first vers ...
: a
FreeDOS-based data recovery tool for hard disks and magnetic storage devices
*
SystemRescueCD: an
Arch Linux based live CD, useful for repairing unbootable computer systems and retrieving data after a system crash
*
Windows Preinstallation Environment (WinPE): A customizable Windows Boot DVD (made by Microsoft and distributed for free). Can be modified to boot to any of the programs listed.
Consistency checkers
*
CHKDSK: a consistency checker for
DOS and Windows systems
*
Disk First Aid: a consistency checker for
Mac OS 9
Mac OS 9 is the ninth major release of Apple Inc., Apple's classic Mac OS operating system which was succeeded by macOS, Mac OS X (renamed to OS X in 2011 and macOS in 2016) in 2001. Introduced on October 23, 1999, it was promoted by Apple as "T ...
*
Disk Utility: a consistency checker for
Mac OS X
*
fsck: a consistency checker for UNIX
*
gparted
GParted (acronym of GNOME Partition Editor) is a GTK front-end to GNU Parted and an official GNOME partition-editing application (alongside Disks). GParted is used for creating, deleting, resizing, moving, checking, and copying disk partitions ...
: a GUI for
GNU parted, the GNU partition editor, capable of calling fsck
File recovery
*
CDRoller: recovers data from
optical disc
*
EaseUS Data Recovery Wizard
EaseUS Data Recovery Wizard is proprietary data recovery software that aims to recover deleted files from a partition on a digital storage device such as a hard disk or solid state drive,and recover deleted or formatted partitions and corrupted ...
: Windows and Mac file recovery utilities by EaseUS
*
Disk Drill Basic: data recovery application for Mac OS X and Windows
*
dvdisaster: generates error-correction data for optical discs
* from
Nero AG: "Data Reliability" feature generates redundant and checksum data in the remaining space of optical discs.
*
GetDataBack
GetDataBack is a data recovery software developed by Runtime Software. It can be used to recover data from external and internal hard disks, flash cards, USB drives, etc. with the FAT, ExFAT, NTFS, Ext, HFS+ and APFS file system
In computing, ...
: a Windows recovery program
*
Hetman Partition Recovery: data drive recovery solution
*
IsoBuster
IsoBuster is a data recovery computer program by Smart Projects, a Belgian company founded in 1995 by Peter Van Hove. As of version 3.0, it can recover data from damaged file systems or physically damaged disks including optical discs, hard di ...
: recovers data from optical discs, USB sticks, flash drives and hard drives
*
Mac Data Recovery Guru
Mac Data Recovery Guru is a data recovery application, for macOS. It was designed to recover deleted files from hard disk drives, USB flash drives, memory cards of cameras and portable devices, MP3 players, PlayStations, X-Boxes, Wii's, pal ...
: Mac OS X data recovery program which works on USB sticks, optical media, and hard drives
*
MiniTool Partition Wizard
MiniTool Partition Wizard is a partition management program for hard disk drives developed by MiniTool Solution.
The 'free' version cannot save any of the data that the software may find.
From version 12 all free features have been removed, e ...
: for Windows 7 and later; includes data recovery
*
Norton Utilities: a suite of utilities that has a file recovery component
*
PhotoRec: advanced multi-platform program with
text-based user interface used to recover files
*
Recover My Files: proprietary software for Windows 2000 and later—FAT, NTFS and HFS
*
Recovery Toolbox: freeware and shareware tools plus online services for various Windows 2000 and later programs
*
Recuva: proprietary software for Windows 2000 and later—FAT and NTFS
*
Stellar Data Recovery for Mac: data recovery utility for Mac OS
*
Stellar Data Recovery for Windows: data recovery utility for Windows
*
Stellar Photo Recovery: photo recovery utility for Mac OS and Windows
*
TestDisk: free, open source, multi-platform. recover files and lost
partitions
*
TuneUp Utilities
AVG TuneUp, previously called AVG PC Tuneup, and TuneUp Utilities, is a utility software suite for Microsoft Windows designed to help manage, maintain, optimize, configure, and troubleshoot a computer system. It was produced and developed by Tu ...
: a suite of utilities that has a file recovery component for Windows XP and later
*
Windows File Recovery
Windows File Recovery is a command-line software utility from Microsoft to recover deleted files. It is freely available for Windows 10 version 2004 (May 2020 Update) and later from the Microsoft Store.
Windows File Recovery can recover file ...
: a command-line utility from Microsoft to recover deleted files for Windows 10 version 2004 and later
Forensics
*
Foremost: an open-source
command-line file recovery program, originally developed by the
U.S. Air Force Office of Special Investigations and
NPS Center for Information Systems Security Studies and Research
*
Forensic Toolkit: by AccessData, used by law enforcement
*
Open Computer Forensics Architecture: An open-source program for Linux
*
The Coroner's Toolkit: a suite of utilities for assisting in forensic analysis of a UNIX system after a break-in
*
The Sleuth Kit: also known as TSK, a suite of forensic analysis tools developed by Brian Carrier for UNIX, Linux and Windows systems. TSK includes the Autopsy forensic browser.
Imaging tools
*
Clonezilla: a free disk cloning, disk imaging, data recovery, and deployment boot disk
*
dd: common byte-to-byte cloning tool found on Unix-like systems
*
ddrescue: an open-source tool similar to dd but with the ability to skip over and subsequently retry bad blocks on failing storage devices
*
Team Win Recovery Project: a free and open-source recovery system for Android devices
See also
*
Backup
In information technology, a backup, or data backup is a copy of computer data taken and stored elsewhere so that it may be used to restore the original after a data loss event. The verb form, referring to the process of doing so, is "back up", w ...
*
Cleanroom
*
Comparison of file systems
*
Computer forensics
Computer forensics (also known as computer forensic science) is a branch of digital forensic science pertaining to evidence found in computers and digital storage media. The goal of computer forensics is to examine digital media in a forensical ...
*
Continuous data protection
*
Crypto-shredding
*
Data archaeology
*
Data curation
*
Data preservation
*
Data loss
*
Error detection and correction
In information theory and coding theory with applications in computer science and telecommunication, error detection and correction (EDAC) or error control are techniques that enable reliable delivery of digital data over unreliable communi ...
*
File carving File carving is the process of reassembling computer files from fragments in the absence of
filesystem metadata.
Introduction and basic principles
All filesystems contain some metadata that describes the actual file system. At a minimum, this inc ...
*
Hidden file and hidden directory
*
Undeletion
*
List of data recovery software
*
List of data-erasing software
References
Further reading
* Tanenbaum, A. & Woodhull, A. S. (1997). ''Operating Systems: Design And Implementation,'' 2nd ed. New York: Prentice Hall.
*
{{Authority control
Computer data
Data management
Transaction processing
*
Recovery