A disk compression
software utility increases the amount of information that can be stored on a
hard disk
A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating hard disk drive platter, pla ...
drive of given size. Unlike a
file compression utility, which compresses only specified files—and which requires the
user to designate the files to be compressed—an on-the-fly disk compression utility works automatically through resident software without the user needing to be aware of its existence. On-the-fly disk compression is therefore also known as transparent, real-time or online disk compression.
When information needs to be stored to the hard disk, the utility
compresses the information. When information needs to be read, the utility decompresses the information. A disk compression utility overrides the standard
operating system
An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs.
Time-sharing operating systems scheduler (computing), schedule tasks for ...
routines. Since all
software application
Application software is any computer program that is intended for end-user use not computer operator, operating, system administration, administering or computer programming, programming the computer. An application (app, application program, sof ...
s access the hard disk using these routines, they continue to work after disk compression has been installed.
Disk compression utilities were popular especially in the early 1990s, when
microcomputer hard disks were still relatively small (20 to 80
megabyte
The megabyte is a multiple of the unit byte for digital information. Its recommended unit symbol is MB. The unit prefix ''mega'' is a multiplier of (106) in the International System of Units (SI). Therefore, one megabyte is one million bytes ...
s). Hard drives were also rather expensive at the time, costing roughly 10
USD
The United States dollar (symbol: $; currency code: USD) is the official currency of the United States and several other countries. The Coinage Act of 1792 introduced the U.S. dollar at par with the Spanish silver dollar, divided it int ...
per megabyte. For the users who bought disk compression applications, the software proved to be in the short term a more economic means of acquiring more disk space as opposed to replacing their current drive with a larger one. A good disk compression utility could, on average, double the available space with negligible speed loss. Disk compression fell into disuse by the late 1990s, as advances in hard drive technology and manufacturing led to increased capacities and lower prices.
Common disk compression solutions
Standalone hardware
Some of the initial disk compression solutions were hardware-assisted and utilized add-on compressor/decompressor
coprocessor cards in addition to a software driver. Known solutions include:
* Stacker XT/8 and Stacker AT/16 from
Stac Electronics
* Expanz! and Expanz! Plus from
InfoChip Systems
* DiskDoubler dd2000 from
Datran Corporation
*
MRCI (Microsoft Real-Time Compression Interface) from
Microsoft
Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
Standalone software
With increasing PC processor power software-only solutions began to reach or even outperform the performance of hardware-assisted solutions in most scenarios. These compression utilities were sold independently. A user had to specifically choose to install and configure the software.
* Squish
and Squish Plus from
Sundog Software
* Stacker from
Stac Electronics
* XtraDrive from
Integrated Information Technology (IIT)
* SuperStor and SuperStor Pro from
AddStor
* DoubleDisk and DoubleDisk Gold from
Vertisoft Systems
* JAM (JAM.SYS) from JAM Software, Kiev, Ukraine
*
DiskDoubler from
Salient Software
* Double Density from
Data Becker
Bundled software
The idea of
bundling disk compression into new machines appealed to resellers and users. Resellers liked that they could claim more storage space; users liked that they did not have to configure the software. Bundled utilities included (in chronological order):
*
DR DOS 6.0 (1991), from
Digital Research
Digital Research, Inc. (DR or DRI) was a privately held American software company created by Gary Kildall to market and develop his CP/M operating system and related 8-bit, 16-bit and 32-bit systems like MP/M, Concurrent DOS, FlexOS, Multiuser ...
, included a custom version of
AddStor's ''SuperStor''.
*
PalmDOS 1.0 (1992), from
Novell
Novell, Inc. () was an American software and services company headquartered in Provo, Utah, that existed from 1980 until 2014. Its most significant product was the multi-platform network operating system known as NetWare. Novell technolog ...
(after acquiring Digital Research), included a
DPMS-enabled version of ''SuperStor''.
*
MS-DOS 6.0 (1993), from
Microsoft
Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
, included ''
DoubleSpace'',
which was based in part on
Vertisoft Systems' ''DoubleDisk''. MS-DOS 6.0 introduced the so-called
DOS preload API, an undocumented
challenge–response type interface which allowed the disk compression software (
DBLSPACE.BIN) to be loaded automatically even before
CONFIG.SYS was processed.
*
PC DOS 6.1 (1993), from
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
, while the original release did not include a disk compressor, a custom version of ''SuperStor/DS'' (after acquiring AddStor) was included in a later release, also introducing the preload API in the PC DOS line.
*
MS-DOS 6.2 (1993), from
Microsoft
Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
, included an improved version of ''DoubleSpace''.
*
Novell DOS 7 (1993), from Novell, included a DPMS-enabled version of ''
Stacker 3.12'' (
STACKER.BIN) from
Stac Electronics, which also supported compressed data exchange with remote
NetWare and
PNW servers also running Stacker in order to avoid unnecessary recompression when copying files.
It also introduced support for the DOS 6 preload API in order to maintain compatibility with Microsoft's DoubleSpace in multi-boot scenarios.
A similar mechanism was used to load the optional single- or multi-user
SECURITY component.
*
PTS-DOS Extended 6.4 (1994), from
PhysTechSoft
PTS-DOS (aka PTS/DOS) is a disk operating system, a DOS clone, developed in Russia by PhysTechSoft and Paragon Technology Systems.
History and versions
PhysTechSoft was formed in 1991 in Moscow, Russia by graduates and members of MIPT, in ...
, included a disk compression component named ''Folder'' (
FOLDER.SYS)
*
PC DOS 6.3 (1994), from IBM, included a version of ''SuperStor/DS''.
*
MS-DOS 6.22 (1994), from Microsoft, included ''
DriveSpace'', which was ''DoubleSpace'' with a different
compression algorithm (
DRVSPACE.BIN), after
MS-DOS 6.21 had to ship without any disk compression software at all for legal reasons.
*
PC DOS 7.0 (1995), from IBM, included a DPMS-enabled version of ''
Stacker 4.02'' from Stac Electronics.
*
Windows 95
Windows 95 is a consumer-oriented operating system developed by Microsoft and the first of its Windows 9x family of operating systems, released to manufacturing on July 14, 1995, and generally to retail on August 24, 1995. Windows 95 merged ...
(1995), from Microsoft, included an updated version of ''
DriveSpace 2''.
*
Windows 95 OSR2 (1996), from Microsoft, included a stripped-down version of ''
DriveSpace 3''.
*
Windows 95 Plus! (1996), from Microsoft, included the full version of ''DriveSpace 3''.
*
OpenDOS 7.01 (1997), from
Caldera
A caldera ( ) is a large cauldron-like hollow that forms shortly after the emptying of a magma chamber in a volcanic eruption. An eruption that ejects large volumes of magma over a short period of time can cause significant detriment to the str ...
(after acquiring Novell DOS), included a DPMS-enabled version of ''Stacker 3.12''.
*
DR-DOS 7.02 (1998), from Caldera, included a DPMS-enabled version of ''Stacker 3.12''.
DR-DOS 7.02 also added support for Microsoft's ''DriveSpace''.
*
PC DOS 2000 (1998), from IBM, included a DPMS-enabled version of ''Stacker 4.02''.
*
Windows 98
Windows 98 is a consumer-oriented operating system developed by Microsoft as part of its Windows 9x family of Microsoft Windows operating systems. It was the second operating system in the 9x line, as the successor to Windows 95. It was Software ...
(1998), from Microsoft, included the full version of ''DriveSpace 3''.
*
DR-DOS 7.03 (1999), from Caldera, included a DPMS-enabled version of ''Stacker 3.12''.
*
Windows 98 SE (1999), from Microsoft, included the full version of ''DriveSpace 3''.
*
PTS-DOS Pro 2000 (1999), from
Paragon Technology Software, included a disk compression component named ''DATACOMP''.
Other solutions
* Squeeze, a resident DOS file compressor for
Lotus 1-2-3
Lotus 1-2-3 is a discontinued spreadsheet program from Lotus Software (later part of IBM). It was the first killer application of the IBM PC, was hugely popular in the 1980s, and significantly contributed to the success of IBM PC-compatibles ...
and
Lotus Symphony files from Turner Hall Publishing
* Squish Plus ROM from
Sundog Software, but distributed exclusively through
Personalized Software (now
Thaddeus Computing), was a special
ROM
Rom, or ROM may refer to:
Biomechanics and medicine
* Risk of mortality, a medical classification to estimate the likelihood of death for a patient
* Rupture of membranes, a term used during pregnancy to describe a rupture of the amniotic sac
* ...
chip containing a custom version of Squish Plus with only 36 KB footprint adapted for the
HP Portable Plus.
*
Double Tools for DoubleSpace from
Addstor was an add-on product, enhancing the functionality of the
DoubleSpace bundled with
MS-DOS 6.0.
* Multimedia Stacker from
Helix Software was a bundle of Stac's
DPMS-enabled Stacker 4.01 combined with
Helix Cloaking and utilities, utilizing Cloaking's built-in DPMS server to relocate and run in
protected mode
In computing, protected mode, also called protected virtual address mode, is an operational mode of x86-compatible central processing units (CPUs). It allows system software to use features such as Memory_segmentation, segmentation, virtual mem ...
.
* THS filesystem (1994–1996 by Thomas Scheuermann), a compressed file system driver for
Linux
Linux ( ) is a family of open source Unix-like operating systems based on the Linux kernel, an kernel (operating system), operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically package manager, pac ...
, which could read some
CVFs.
* DMSDOS (1996), a Linux kernel driver,
to support the reading and writing of DoubleSpace, DriveSpace 2 & 3, as well as Stacker 3 & 4 volumes.
While
Windows XP
Windows XP is a major release of Microsoft's Windows NT operating system. It was released to manufacturing on August 24, 2001, and later to retail on October 25, 2001. It is a direct successor to Windows 2000 for high-end and business users a ...
, from Microsoft, included both a native support and a
command
Command may refer to:
Computing
* Command (computing), a statement in a computer language
* command (Unix), a Unix command
* COMMAND.COM, the default operating system shell and command-line interpreter for DOS
* Command key, a modifier key on A ...
named that compresses files on
NTFS
NT File System (NTFS) (commonly called ''New Technology File System'') is a proprietary journaling file system developed by Microsoft in the 1990s.
It was developed to overcome scalability, security and other limitations with File Allocation Tabl ...
systems, that is not implemented as a separate "compressed drive" like those above.
Operation
Disk compression usually creates a single large file, which becomes a
virtual hard drive. This is similar to how a single physical hard drive can be
partitioned into multiple virtual drives. The compressed drive is accessed via a
device driver
In the context of an operating system, a device driver is a computer program that operates or controls a particular type of device that is attached to a computer or automaton. A driver provides a software interface to hardware devices, enabli ...
.
Compressing existing drives
All drives would initially be empty. The utility to create a drive would usually offer to "compress a current drive". This meant the utility would:
* Create an empty compressed drive, stored on the existing drive.
* Transfer existing files on the old drive to the new compressed drive.
* Increase the size of the new compressed drive as necessary to accommodate more files and allow empty space when done.
* When all files were transferred, the drive letters would be swapped.
Usually certain
system files would not be transferred. For example, OS
swap files would remain only on the host drive.
Compressing the boot drive
A device driver had to be loaded to access the compressed drive. A compressed drive C: required changes to the
boot process as follows:
* The
System BIOS loads sector 0 of the first physical hard drive (
Master Boot Record
A master boot record (MBR) is a type of boot sector in the first block of disk partitioning, partitioned computer mass storage devices like fixed disks or removable drives intended for use with IBM PC-compatible systems and beyond. The concept ...
).
* The partition sector loads sector 0 of the bootable partition (
Volume Boot Record). In this case, it's the host drive.
* Host drive sector 0 loads (in the case of
DOS) the DOS BIOS (
IO.SYS,
IBMBIO.COM etc.) Depending on the version, only the first few sectors of the file or the whole file may be loaded into memory. In the first case, the loaded stub contains another loader loading the remainder of the file from disk. In the case of DR-DOS, the BIOS is packed and will decompress itself through an embedded decompression routine.
* The DOS BIOS then loads the DOS kernel (
MSDOS.SYS,
IBMDOS.COM etc.)
* If the DOS supports the
preload API, the DOS BIOS attempts to load a preloadable compression driver
(
DBLSPACE.BIN,
DRVSPACE.BIN or
STACKER.BIN) from the root of the host drive and attempts to communicate with it through the DOS preload API.
If the driver responds as expected, the DOS BIOS will keep the driver, if not, it will get discarded.
If kept and configured accordingly, drive letters may get swapped, so that the compressed drive becomes C:.
* The DOS BIOS starts
CONFIG.SYS processing by looking for the file in the root of the boot drive. In the case of DR-DOS,
DCONFIG.SYS takes precedence over CONFIG.SYS, if present.
* Compression drivers not loaded through the preload API may be loaded through
DEVICE statements.
* Depending on configuration and if this hasn't happened already, drive letters may get swapped, so that the compressed drive becomes C:.
* Processing continues from compressed drive.
Performance impact
On systems with slower hard drives, disk compression could actually increase system performance. This was accomplished two ways:
* Once compressed, there was less data to be stored.
* Disk accesses would often be batched together for efficiency.
If the system had to wait frequently for hard drive access to complete (
I/O bound) converting the hard drive to compressed drives could speed up the system significantly. Compression and decompression of the data increases the CPU utilization. If the system was already
CPU bound
A central processing unit (CPU), also called a central processor, main processor, or just processor, is the primary Processor (computing), processor in a given computer. Its electronic circuitry executes Instruction (computing), instructions ...
, disk compression decreases overall performance.
Drawbacks
Some common drawbacks to using disk compression:
* Not all compression utilities would confirm the absence of errors in the
FAT file system before compressing a disk in place. Some errors, such as
crosslinked files, could result in additional data loss during the transfer process.
* The compressed drive is only visible if the device driver is loaded and the compressed drive is mounted. A
boot disk, for example, might not contain the driver. (This was not normally an issue for solutions that came bundled with the operation systems unless booting foreign operating systems, as system utilities like
SYS were modified to copy the compression drivers to boot disks by default,
and the corresponding operation systems would also fall back to look for them on drives other than the boot disk if they couldn't be found there.)
* Uninstallation was problematic if the expanded size of data on the compressed disk was larger than the normal disk capacity.
* Users did not always realize that the large file on the host drive contained the compressed drive. While it was usually "hidden" by default,
users who did find the large file curious or suspicious were able to delete it. This would normally result in data loss.
See also
*
RAM compression
*
Data compression
In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compressi ...
*
Disk fragmentation
*
Executable compression
*
Self-extracting archive
*
cramfs
*
e2compr
*
SquashFS
Squashfs is a compressed read-only file system for Linux. Squashfs compresses files, inodes and directories, and supports block sizes from 4 KiB up to 1 MiB for greater compression. Several compression algorithms are supported. Squashfs is ...
*
MDFAT (DoubleSpace)
*
BitFAT (DoubleSpace)
*
MDBPB (DoubleSpace)
Notes
[In crosslinked files, two files are storing at least part of their data in the same location. At least part of one file (the "bad" file) is always lost in this instance. However, if the "bad" file is copied and then deleted, part of the "good" file is deleted as well. Microsoft SCANDISK was created, in part, to perform a better check of the file system prior to compression than the ]MS-DOS
MS-DOS ( ; acronym for Microsoft Disk Operating System, also known as Microsoft DOS) is an operating system for x86-based personal computers mostly developed by Microsoft. Collectively, MS-DOS, its rebranding as IBM PC DOS, and a few op ...
CHKDSK
In computing, CHKDSK (short for "check disk") is a system software, system tool and command (computing), command in DOS and Microsoft Windows (and related operating systems), as well as Digital Research FlexOS, IBM/Toshiba 4690 Operating System, 4 ...
utility.
[For example, DOS associated up to four attributes with files: System, Hidden, Read-Only, and Archive. Files with the System or Hidden attributes are often not displayed by default. Files with the System or Read-Only attribute cannot be deleted with the ERASE or ]DEL
Del, or nabla, is an operator used in mathematics (particularly in vector calculus) as a vector differential operator, usually represented by the nabla symbol ∇. When applied to a function defined on a one-dimensional domain, it denotes ...
DOS command. Most compression utilities would mark the drive file with at least one or more of the System, Hidden, and Read-Only attributes (many would use all three). However, files marked with such attributes can be viewed and deleted by other means. In addition, the user can also remove attributes.
References
External links
*
* ff.
*
* {{cite journal , journal=
Smart Computing , url=http://www.smartcomputing.com/editorial/article.asp?article=articles/archive/l0504/9834/9834.asp&guid= , title=How To... Compress Hard Drives , date=April 1999
Compression
Compression
Utility software types
Compression file systems