HOME

TheInfoList



OR:

A binary prefix is a unit prefix for multiples of units. It is most often used in data processing, data transmission, and digital information, principally in association with the bit and the
byte The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable uni ...
, to indicate multiplication by a power of 2. As shown in the table to the right there are two sets of symbols for binary prefixes, one set established by International Electrotechnical Commission (IEC) and several other standards and trade organizations using two-letter symbols, e.g. ''Mi'' indicating with a second set established by semiconductor industry convention using one-letter symbols, e.g., ''M'' also indicating . In most contexts, industry uses the multipliers ''kilo'' (''k''), ''mega'' (''M''), ''giga'' (''G''), etc., in a manner consistent with their meaning in the International System of Units (SI), namely as powers of 1000. For example, a 500-gigabyte hard disk holds bytes, and a 1 Gbit/s (gigabit per second)
Ethernet Ethernet () is a family of wired computer networking technologies commonly used in local area networks (LAN), metropolitan area networks (MAN) and wide area networks (WAN). It was commercially introduced in 1980 and first standardized in 1 ...
connection transfers data at nominal speed of bit/s. In contrast with the ''binary prefix'' usage, this use is described as a ''decimal prefix'', as 1000 is a
power of 10 A power of 10 is any of the integer powers of the number ten; in other words, ten multiplied by itself a certain number of times (when the power is a positive integer). By definition, the number one is a power (the zeroth power) of ten. The fi ...
(103). The computer industry has historically in citations of main memory ( RAM) capacity used the units ''kilobyte'', ''megabyte'', and ''gigabyte'', and the corresponding symbols ''KB'', ''MB'', and ''GB'', in a binary sense: ''gigabyte'' customarily means bytes. As this is a power of 1024, and 1024 is a
power of two A power of two is a number of the form where is an integer, that is, the result of exponentiation with number two as the base and integer  as the exponent. In a context where only integers are considered, is restricted to non-negat ...
(210), these three usages are referred to as a binary prefixes; they were defined by the
Joint Electron Device Engineering Council A joint or articulation (or articular surface) is the connection made between bones, ossicles, or other hard structures in the body which link an animal's skeletal system into a functional whole.Saladin, Ken. Anatomy & Physiology. 7th ed. McGraw ...
(JEDEC) for
semiconductor A semiconductor is a material which has an electrical conductivity value falling between that of a conductor, such as copper, and an insulator, such as glass. Its resistivity falls as its temperature rises; metals behave in the opposite way ...
s and are used by some
operating system An operating system (OS) is system software that manages computer hardware, software resources, and provides common daemon (computing), services for computer programs. Time-sharing operating systems scheduler (computing), schedule tasks for ef ...
s. The use of the same unit prefixes with two different meanings has caused confusion. Starting around 1998, the IEC and several other standards and trade organizations attempted to address the ambiguity by publishing standards and recommendations for a set of binary prefixes that refer exclusively to powers of 1024. Accordingly, the US
National Institute of Standards and Technology The National Institute of Standards and Technology (NIST) is an agency of the United States Department of Commerce whose mission is to promote American innovation and industrial competitiveness. NIST's activities are organized into physical s ...
(NIST) requires that SI prefixes be used only in the decimal sense: kilobyte and megabyte denote one thousand bytes and one million bytes respectively (consistent with SI), while new terms such as kibibyte, mebibyte, and gibibyte, having the symbols KiB, MiB, and GiB, denote 1024 bytes, bytes, and bytes, respectively. In 2008, the IEC prefixes were incorporated into the ISO/IEC 80000 standard alongside the decimal prefixes of the international standard system of units. In response to litigation over the use of metric prefixes, the United States District Court for the Northern District of California includes a judicial notice that "the U.S. Congress has deemed the decimal definition of gigabyte to be the 'preferred' one for the purposes of 'U.S. trade and commerce.


History


Early prefixes

The original metric system adopted by France in 1795 included two binary prefixes named ''
double A double is a look-alike or doppelgänger; one person or being that resembles another. Double, The Double or Dubble may also refer to: Film and television * Double (filmmaking), someone who substitutes for the credited actor of a character * ...
-'' (2×) and ''
demi Demi is a feminine given name with Greek and Latin roots. It was originally a nickname of Demetria, the feminine form of the masculine name Demetrius, which is itself the Latin and English spelling of the Greek name Demetrios. It is also an Alb ...
-'' (×). However, these were not retained when the SI prefixes were internationally adopted by the 11th CGPM conference in 1960.


Main memory

Early computers used one of two addressing methods to access the system memory; binary (base 2) or decimal (base 10). For example, the IBM 701 (1952) used binary and could address 2048 words of 36 bits each, while the IBM 702 (1953) used decimal and could address ten thousand 7-bit words. By the mid-1960s, binary addressing had become the standard architecture in most computer designs, and main memory sizes were most commonly powers of two. This is the most natural configuration for memory, as all combinations of their address lines map to a valid address, allowing easy aggregation into a larger block of memory with contiguous addresses. Early computer system documentation would specify the memory size with an exact number such as 4096, 8192, or words of storage. These are all
powers of two A power of two is a number of the form where is an integer, that is, the result of exponentiation with number two as the base and integer  as the exponent. In a context where only integers are considered, is restricted to non-negati ...
, and furthermore are small multiples of 210, or 1024. As storage capacities increased, several different methods were developed to abbreviate these quantities. The method most commonly used today uses prefixes such as kilo, mega, giga, and corresponding symbols K, M, and G, which the computer industry originally adopted from the metric system. The prefixes ''kilo-'' and ''mega-'', meaning 1000 and respectively, were commonly used in the electronics industry before World War II. Along with ''giga-'' or G-, meaning , they are now known as SI prefixes after the International System of Units (SI), introduced in 1960 to formalize aspects of the metric system. The International System of Units does not define units for digital information but notes that the SI prefixes may be applied outside the contexts where base units or derived units would be used. But as computer main memory in a binary-addressed system is manufactured in sizes that were easily expressed as multiples of 1024, ''kilobyte'', when applied to computer memory, came to be used to mean 1024 bytes instead of 1000. This usage is not consistent with the SI. Compliance with the SI requires that the prefixes take their 1000-based meaning, and that they are not to be used as placeholders for other numbers, like 1024. The use of K in the binary sense as in a "32K core" meaning words, i.e., words, can be found as early as 1959. Note: the IBM 704 core memory units had 4096 36-bit words. Up to words could be installed "The 8K core stores were getting fairly common in this country in 1954. The 32K store started mass production in 1956; it is the standard now for large machines and at least 200 machines of the size (or its equivalent in the character addressable machines) are in existence today (and at least 100 were in existence in mid-1959)." Note: The IBM 1401 was a character addressable computer. Gene Amdahl's seminal 1964 article on IBM System/360 used "1K" to mean 1024. Figure 1 gives storage (memory) capacity ranges of the various models in "Capacity 8-bit bytes, 1 K = 1024" This style was used by other computer vendors, the CDC 7600 ''System Description'' (1968) made extensive use of K as 1024. Thus the first binary prefix was born. Another style was to truncate the last three digits and append K, essentially using K as a decimal prefix similar to SI, but always truncating to the next lower whole number instead of rounding to the nearest. The exact values words, words and words would then be described as "32K", "65K" and "131K". (If these values had been rounded to nearest they would have become 33K, 66K, and 131K, respectively.) This style was used from about 1965 to 1975. These two styles (K = 1024 and truncation) were used loosely around the same time, sometimes by the same company. In discussions of binary-addressed memories, the exact size was evident from context. (For memory sizes of "41K" and below, there is no difference between the two styles.) The HP 21MX real-time computer (1974) denoted (which is 192×1024) as "196K" and as "1M", while the HP 3000 business computer (1973) could have "64K", "96K", or "128K" bytes of memory. The "truncation" method gradually waned. Capitalization of the letter K became the ''de facto'' standard for binary notation, although this could not be extended to higher powers, and use of the lowercase k did persist. Nevertheless, the practice of using the SI-inspired "kilo" to indicate 1024 was later extended to "megabyte" meaning 10242 () bytes, and later "gigabyte" for 10243 () bytes. For example, a "512 megabyte" RAM module is bytes (512 × , or ), rather than . The symbols Kbit, Kbyte, Mbit and Mbyte started to be used as "binary units"—"bit" or "byte" with a multiplier that is a power of 1024—in the early 1970s. For a time, memory capacities were often expressed in K, even when M could have been used: The IBM System/370 Model 158 brochure (1972) had the following: "Real storage capacity is available in 512K increments ranging from 512K to 2,048K bytes." Megabyte was used to describe the 22-bit addressing of DEC PDP-11/70 (1975) and gigabyte the 30-bit addressing DEC VAX-11/780 (1977). In 1998, the International Electrotechnical Commission IEC introduced the binary prefixes kibi, mebi, gibi, etc., to mean 1024, 10242, 10243 etc., so that 1048576 bytes could be referred to unambiguously as 1 mebibyte. The IEC prefixes were defined for use alongside the International System of Quantities (ISQ) in 2009.


Disk drives

The disk drive industry has followed a different pattern. Disk drive capacity is generally specified with unit prefixes with decimal meaning, in accordance to SI practices. Unlike computer main memory, disk architecture or construction does not mandate or make it convenient to use binary multiples. Drives can have any practical number of platters or surfaces, and the count of tracks, as well as the count of sectors per track may vary greatly between designs. The first commercially sold disk drive, the IBM 350, had fifty physical disk platters containing a total of sectors of 100 characters each, for a total quoted capacity of 5 million characters. It was introduced in September 1956. In the 1960s most disk drives used IBM's variable block length format, called
Count Key Data Count key data (CKD) is a direct-access storage device (DASD) data recording format introduced in 1964, by IBM with its IBM System/360 and still being emulated on IBM mainframes. It is a self-defining format with each data record represented by ...
(CKD). Any block size could be specified up to the maximum track length. Since the block headers occupied space, the usable capacity of the drive was dependent on the block size. Blocks ("records" in IBM's terminology) of 88, 96, 880 and 960 were often used because they related to the fixed block size of 80- and 96-character punch cards. The drive capacity was usually stated under conditions of full track record blocking. For example, the 100-megabyte 3336 disk pack only achieved that capacity with a full track block size of bytes. Floppy disks for the IBM PC and compatibles quickly standardized on 512-byte sectors, so two sectors were easily referred to as "1K". The 3.5-inch "360 KB" and "720 KB" had 720 (single-sided) and 1440 sectors (double-sided) respectively. When the High Density "1.44 MB" floppies came along, with 2880 of these 512-byte sectors, that terminology represented a hybrid binary-decimal definition of "1 MB" = 210 × 103 = 1 024 000 bytes. In contrast, hard disk drive manufacturers used ''megabytes'' or ''MB'', meaning 106 bytes, to characterize their products as early as 1974. By 1977, in its first edition, Disk/Trend, a leading hard disk drive industry marketing consultancy segmented the industry according to MBs (decimal sense) of capacity. One of the earliest hard disk drives in personal computing history, the Seagate
ST-412 The ST-506 and ST-412 (sometimes written ST506 and ST412) were early hard disk drive products introduced by Seagate in 1980 and 1981 respectively, that later became construed as hard disk drive interfaces: the ST-506 disk interface and the ST- ...
, was specified as ''Formatted: 10.0 Megabytes''. The drive contains four heads and active surfaces (tracks per cylinder), 306 cylinders. When formatted with a sector size of 256 bytes and 32 sectors/track it has a capacity of . This drive was one of several types installed in the IBM PC/XT and extensively advertised and reported as a "10 MB" (formatted) hard disk drive. The cylinder count of 306 is not conveniently close to any power of 1024; operating systems and programs using the customary binary prefixes show this as 9.5625 MB. Many later drives in the personal computer market used 17 sectors per track; still later,
zone bit recording In computer storage, zone bit recording (ZBR) is a method used by disk drives to optimise the tracks for increased data capacity. It does this by placing more sectors per zone on outer tracks than on inner tracks. This contrasts with other approa ...
was introduced, causing the number of sectors per track to vary from the outer track to the inner. The hard drive industry continues to use decimal prefixes for drive capacity, as well as for transfer rate. For example, a "300 GB" hard drive offers slightly more than , or , bytes, not (which would be about ). Operating systems such as
Microsoft Windows Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for ...
that display hard drive sizes using the customary binary prefix "GB" (as it is used for RAM) would display this as "279.4 GB" (meaning bytes, or ). On the other hand,
macOS macOS (; previously OS X and originally Mac OS X) is a Unix operating system developed and marketed by Apple Inc. since 2001. It is the primary operating system for Apple's Mac computers. Within the market of desktop and la ...
has since version 10.6 shown hard drive size using decimal prefixes (thus matching the drive makers' packaging). (Previous versions of Mac OS X used binary prefixes.) Disk drive manufacturers sometimes use ''both'' IEC and SI prefixes with their standardized meanings. Seagate has specified data transfer rates in select manuals of some hard drives with both units, with the conversion between units clearly shown and the numeric values adjusted accordingly.Seagate Savvio 10K.5 SAS Product Manual
100628561, Rev D, March 2011, sec 5.2.3, p. 10 (18th page of the pdf), states the drive's sustained transfer speed as "89 to 160
MiB The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable unit ...
/s" on one line, and "93 to 168 MB/s" on the next line.
"Advanced Format" drives uses the term "4K sectors", which it defines as having size of "4096 (4K) bytes".


Information transfer and clock rates

Computer
clock A clock or a timepiece is a device used to measure and indicate time. The clock is one of the oldest human inventions, meeting the need to measure intervals of time shorter than the natural units such as the day, the lunar month and ...
frequencies are always quoted using SI prefixes in their decimal sense. For example, the internal clock frequency of the original IBM PC was 4.77 MHz, that is . Similarly, digital information transfer rates are quoted using decimal prefixes: * The ATA-100 disk interface refers to bytes per second * A "56K" modem refers to bits per second * SATA-2 has a raw bit rate of 3 Gbit/s = bits per second * PC2-6400 RAM transfers bytes per second * Firewire 800 has a raw rate of bits per second * In 2011, Seagate specified the sustained transfer rate of some hard disk drive models with both decimal and IEC binary prefixes.


Historical standardization of dual definitions

By the mid-1970s it was common to see K meaning 1024 and the occasional M meaning for words or bytes of main memory (RAM) while K and M were commonly used with their decimal meaning for disk storage. In the 1980s, as capacities of both types of devices increased, the SI prefix G, with SI meaning, was commonly applied to disk storage, while M in its binary meaning, became common for computer memory. In the 1990s, the prefix G, in its binary meaning, became commonly used for computer memory capacity. The first terabyte (SI prefix, bytes) hard disk drive was introduced in 2007. The dual usage of the kilo (K), mega (M), and giga (G) prefixes as both powers of 1000 and powers of 1024 has been recorded in standards and dictionaries. For example, the obsolete standard ANSI/IEEE Std 1084-1986 defined dual uses for kilo and mega. The binary units Kbyte and Mbyte were formally defined in the obsolete standard ANSI/IEEE Std 1212-1991. Many dictionaries have noted the practice of using customary prefixes to indicate binary multiples. Oxford online dictionary historically defined, for example, megabyte as: "Computing: a unit of information equal to one million or (strictly) bytes." The units Kbyte, Mbyte, and Gbyte were found in the trade press and in IEEE journals. Gigabyte was formally defined in obsolete standard IEEE Std 610.10-1994 as either or 230 bytes. Kilobyte, Kbyte, and KB were all defined in the obsolete standard IEEE 100–2000. "kB See kilobyte." "Kbyte Kilobyte. Indicates 210 bytes." "Kilobyte Either 1000 or 210 or 1024 bytes." The standard also defines megabyte and gigabyte with a note that an alternative notation for base 2 is under development. The hardware industry measures system memory (RAM) using the binary meaning while magnetic disk storage uses the SI definition. However, many exceptions exist. Labeling of one type of diskette uses the megabyte to denote bytes. In the optical disks market,
compact disc The compact disc (CD) is a digital optical disc data storage format that was co-developed by Philips and Sony to store and play digital audio recordings. In August 1982, the first compact disc was manufactured. It was then released in O ...
s use ''MB'' to mean 10242 bytes while DVDs use ''GB'' to mean 10003 bytes.


Inconsistent use of units


Deviation between powers of 1024 and powers of 1000

Computer storage has become cheaper per unit and thereby larger, by many orders of magnitude since "K" was first used to mean 1024. Because both the SI and "binary" meanings of kilo, mega, etc., are based on powers of 1000 or 1024 rather than simple multiples, the difference between 1M "binary" and 1M "decimal" is proportionally larger than that between 1K "binary" and 1k "decimal", and so on up the scale. The relative difference between the values in the binary and decimal interpretations increases, when using the SI prefixes as the base, from 2.4% for kilo to nearly 27% for the quetta prefix. Although the prefixes ronna and quetta have been defined, as of 2022 no names have been officially assigned to the corresponding binary prefixes.


Consumer confusion

In the early days of computers (roughly, prior to the advent of personal computers) there was little or no consumer confusion because of the technical sophistication of the buyers and their familiarity with the products. In addition, it was common for computer manufacturers to specify their products with capacities in full precision. In the personal computing era, one source of consumer confusion is the difference in the way many operating systems display hard drive sizes, compared to the way hard drive manufacturers describe them.
Hard drives A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with magn ...
are specified and sold using "GB" and "TB" in their decimal meaning: one billion and one trillion bytes. Many operating systems and other software, however, display hard drive and file sizes using "MB", "GB" or other SI-looking prefixes in their binary sense, just as they do for displays of RAM capacity. For example, many such systems display a hard drive marketed as "1 TB" as "931 GB". The earliest known presentation of hard disk drive capacity by an operating system using "KB" or "MB" in a binary sense is 1984; earlier operating systems generally presented the hard disk drive capacity as an exact number of bytes, with no prefix of any sort, for example, in the output of the DOS command '
CHKDSK In computing, CHKDSK (short for "check disk") is a system tool and command in DOS, Digital Research FlexOS, IBM/ Toshiba 4690 OS, IBM OS/2, Microsoft Windows and related operating systems. It verifies the file system integrity of a volu ...
'.


Legal disputes

The different interpretations of disk size prefixes has led to class action lawsuits against digital storage manufacturers. These cases involved both flash memory and hard disk drives.


Early cases

Early cases (2004–2007) were settled prior to any court ruling with the manufacturers admitting no wrongdoing but agreeing to clarify the storage capacity of their products on the consumer packaging. Accordingly, many flash memory and hard disk manufacturers have disclosures on their packaging and web sites clarifying the formatted capacity of the devices or defining MB as 1 million bytes and 1 GB as 1 billion bytes.


''Willem Vroegh v. Eastman Kodak Company''

On 20 February 2004, Willem Vroegh filed a lawsuit against Lexar Media, Dane–Elec Memory, Fuji Photo Film USA, Eastman Kodak Company, Kingston Technology Company, Inc., Memorex Products, Inc.; PNY Technologies Inc., SanDisk Corporation, Verbatim Corporation, and
Viking Interworks Sanmina Corporation is an American electronics manufacturing services (EMS) provider headquartered in San Jose, California that serves original equipment manufacturers in communications and computer hardware fields. The firm has nearly 80 manufac ...
alleging that their descriptions of the capacity of their flash memory cards were false and misleading. Vroegh claimed that a 256 MB Flash Memory Device had only 244 MB of accessible memory. "Plaintiffs allege that Defendants marketed the memory capacity of their products by assuming that one megabyte equals one million bytes and one gigabyte equals one billion bytes." The plaintiffs wanted the defendants to use the customary values of 10242 for megabyte and 10243 for gigabyte. The plaintiffs acknowledged that the IEC and IEEE standards define a MB as one million bytes but stated that the industry has largely ignored the IEC standards. The parties agreed that manufacturers could continue to use the decimal definition so long as the definition was added to the packaging and web sites. The consumers could apply for "a discount of ten percent off a future online purchase from Defendants' Online Stores Flash Memory Device".


''Orin Safier v. Western Digital Corporation''

On 7 July 2005, an action entitled ''Orin Safier v. Western Digital Corporation, et al.'' was filed in the Superior Court for the City and County of San Francisco, Case No. CGC-05-442812. The case was subsequently moved to the Northern District of California, Case No. 05-03353 BZ. Although Western Digital maintained that their usage of units is consistent with "the indisputably correct industry standard for measuring and describing storage capacity", and that they "cannot be expected to reform the software industry", they agreed to settle in March 2006 with 14 June 2006 as the Final Approval hearing date. Western Digital offered to compensate customers with a free download of backup and recovery software valued at US$30. They also paid $ in fees and expenses to San Francisco lawyers Adam Gutride and Seth Safier, who filed the suit. The settlement called for Western Digital to add a disclaimer to their later packaging and advertising.


''Cho v. Seagate Technology (US) Holdings, Inc.''

A lawsuit (''Cho v. Seagate Technology (US) Holdings, Inc.'', San Francisco Superior Court, Case No. CGC-06-453195) was filed against Seagate Technology, alleging that Seagate overrepresented the amount of usable storage by 7% on hard drives sold between 22 March 2001 and 26 September 2007. The case was settled without Seagate admitting wrongdoing, but agreeing to supply those purchasers with free backup software or a 5% refund on the cost of the drives.


''Dinan et al. v. SanDisk LLC''

On 22 January 2020, the district court of the Northern District of California ruled in favor of the defendant, SanDisk, upholding its use of "GB" to mean .


Unique binary prefixes


Early suggestions

While early computer scientists typically used k to mean 1000, some recognized the convenience that would result from working with multiples of 1024 and the confusion that resulted from using the same prefixes for two different meanings. Several proposals for unique binary prefixesA binary prefix is a prefix that denotes a power of 1024. For example, in the computer industry's customary practice, one "megabyte" of RAM is 10242 bytes of RAM, one "gigabyte" of RAM is 10243 bytes of RAM, and so on. In the IEC system, these would be expressed as one "mebibyte" and one "gibibyte", respectively. Both are "binary prefixes" in these usages. were made in 1968. Donald Morrison proposed to use the Greek letter kappa ( κ) to denote 1024, κ2 to denote 10242, and so on. (At the time, memory size was small, and only K was in widespread use.)
Wallace Givens James Wallace Givens, Jr. (December 14, 1910 – March 5, 1993) was a mathematician and a pioneer in computer science. He is the eponym of the well-known Givens rotations. Born the son of two teachers in Alberene, Virginia (a small town near ...
responded with a proposal to use bK as an abbreviation for 1024 and bK2 or bK2 for 10242, though he noted that neither the Greek letter nor lowercase letter b would be easy to reproduce on computer printers of the day. Bruce Alan Martin of Brookhaven National Laboratory further proposed that the prefixes be abandoned altogether, and the letter B be used for base-2 exponents, similar to E in
decimal scientific notation Scientific notation is a way of expressing real numbers, numbers that are too large or too small (usually would result in a long string of digits) to be conveniently written in decimal form. It may be referred to as scientific form or standard ...
, to create shorthands like 3B20 for , a convention still used on some calculators to present binary floating point-numbers today. None of these gained much acceptance, and capitalization of the letter K became the ''de facto'' standard for indicating a factor of 1024 instead of 1000, although this could not be extended to higher powers. As the discrepancy between the two systems increased in the higher-order powers, more proposals for unique prefixes were made. In 1996, Markus Kuhn proposed a system with ''di'' prefixes, like the "dikilobyte" (K2B or K2B). Donald Knuth, who uses decimal notation like 1 MB = 1000 kB,
The Art of Computer Programming
'' Volume 1, Donald Knuth, pp. 24 and 94
proposed that the powers of 1024 be designated as "large kilobytes" and "large megabytes", with abbreviations KKB and MMB. Double prefixes were already abolished from SI, however, having a multiplicative meaning ("MMB" would be equivalent to "TB"), and this proposed usage never gained any traction.


IEC prefixes

The set of binary prefixes that were eventually adopted, now referred to as the "IEC prefixes",The term IEC binary prefix or IEC prefix refers to the prefixes such as kibi, mebi, gibi, etc., or their corresponding symbols Ki, Mi, Gi, etc., first adopted by the International Electrotechnical Commission (IEC). Such prefixes are commonly used with the units bits or bytes (or less commonly, compound units derived from them such as bytes/second) and always denote powers of 1024; that is, they are always used as binary prefixes. Thus 1 mebibyte of RAM is 10242 bytes of RAM, one gibibyte or 1 GiB of RAM is 10243 bytes, and so on. were first proposed by the
International Union of Pure and Applied Chemistry The International Union of Pure and Applied Chemistry (IUPAC ) is an international federation of National Adhering Organizations working for the advancement of the chemical sciences, especially by developing nomenclature and terminology. It is ...
's (IUPAC) Interdivisional Committee on Nomenclature and Symbols (IDCNS) in 1995. At that time, it was proposed that the terms kilobyte and megabyte be used only for 103 bytes and 106 bytes, respectively. The new prefixes ''kibi'' (kilobinary), ''mebi'' (megabinary), ''gibi'' (gigabinary) and ''tebi'' (terabinary) were also proposed at the time, and the proposed symbols for the prefixes were kb, Mb, Gb and Tb respectively, rather than Ki, Mi, Gi and Ti. The proposal was not accepted at the time. The Institute of Electrical and Electronics Engineers (IEEE) began to collaborate with the
International Organization for Standardization The International Organization for Standardization (ISO ) is an international standard development organization composed of representatives from the national standards organizations of member countries. Membership requirements are given in A ...
(ISO) and International Electrotechnical Commission (IEC) to find acceptable names for binary prefixes. IEC proposed ''kibi'', ''mebi'', ''gibi'' and ''tebi'', with the symbols Ki, Mi, Gi and Ti respectively, in 1996. The names for the new prefixes are derived from the original SI prefixes combined with the term ''binary'', but contracted, by taking the first two letters of the SI prefix and "bi" from binary. The first letter of each such prefix is identical to the corresponding SI prefix, except that "k" is capitalised to "K". The IEEE decided that their standards would use the prefixes ''kilo'', etc. with their metric definitions, but allowed the binary definitions to be used in an interim period as long as such usage was explicitly pointed out on a case-by-case basis.


Adoption by IEC, NIST and ISO

In January 1999, the IEC published the first international standard ( IEC 60027-2 Amendment 2) with the new prefixes, extended up to ''pebi'' (Pi) and ''exbi'' (Ei). The IEC 60027-2 Amendment 2 also states that the IEC position is the same as that of BIPM (the body that regulates the SI system); the SI prefixes retain their definitions in powers of 1000 and are never used to mean a power of 1024. In usage, products and concepts typically described using powers of 1024 would continue to be, but with the new IEC prefixes. For example, a memory module of bytes () would be referred to as 512 MiB or 512 mebibytes instead of 512 MB or 512 megabytes. Conversely, since hard drives have historically been marketed using the SI convention that "giga" means , a "500 GB" hard drive would still be labeled as such. According to these recommendations, operating systems and other software would also use binary and SI prefixes in the same way, so the purchaser of a "500 GB" hard drive would find the operating system reporting either "500 GB" or "466 GiB", while bytes of RAM would be displayed as "512 MiB". The second edition of the standard, published in 2000,IEC 60027-2 (2000-11) Ed. 2.0 defined them only up to ''exbi'', but in 2005, the third edition added prefixes ''zebi'' and ''yobi'', thus matching all then-defined SI prefixes with binary counterparts. The harmonized ISO/
IEC The International Electrotechnical Commission (IEC; in French: ''Commission électrotechnique internationale'') is an international standards organization that prepares and publishes international standards for all electrical, electronic and r ...
IEC 80000-13:2008 standard cancels and replaces subclauses 3.8 and 3.9 of IEC 60027-2:2005 (those defining prefixes for binary multiples). The only significant change is the addition of explicit definitions for some quantities. In 2009, the prefixes kibi-, mebi-, etc. were defined by
ISO 80000-1 ISO 80000 or IEC 80000 is an international standard introducing the International System of Quantities (ISQ). It was developed and promulgated jointly by the International Organization for Standardization (ISO) and the International Electrotech ...
in their own right, independently of the kibibyte, mebibyte, and so on. The BIPM standard JCGM 200:2012 "International vocabulary of metrology – Basic and general concepts and associated terms (VIM), 3rd edition" lists the IEC binary prefixes and states "SI prefixes refer strictly to powers of 10, and should not be used for powers of 2. For example, 1 kilobit should not be used to represent bits (210 bits), which is 1 kibibit." The additional decimal prefixes '' ronna-'' for 10009 and ''
quetta- A metric prefix is a unit prefix that precedes a basic unit of measure to indicate a multiple or submultiple of the unit. All metric prefixes used today are decadic. Each prefix has a unique symbol that is prepended to any unit symbol. The pre ...
'' for 100010 were adopted by the
International Bureau of Weights and Measures The International Bureau of Weights and Measures (french: Bureau international des poids et mesures, BIPM) is an intergovernmental organisation, through which its 59 member-states act together on measurement standards in four areas: chemistry ...
(BIPM) in 2022. The natural binary counterparts to ''ronna-'' and ''quetta-'' were suggested in a consultation paper of the International Committee for Weights and Measures' Consultative Committee for Units (CCU) as ''robi-'' (Ri, 10249) and ''quebi-'' (Qi, 102410). , no corresponding binary prefixes have been adopted by the IEC and ISO.


Other standards bodies and organizations

The IEC standard binary prefixes are now supported by other standardization bodies and technical organizations. The United States
National Institute of Standards and Technology The National Institute of Standards and Technology (NIST) is an agency of the United States Department of Commerce whose mission is to promote American innovation and industrial competitiveness. NIST's activities are organized into physical s ...
(NIST) supports the ISO/IEC standards for "Prefixes for binary multiples" and has a web page documenting them, describing and justifying their use. NIST suggests that in English, the first syllable of the name of the binary-multiple prefix should be pronounced in the same way as the first syllable of the name of the corresponding SI prefix, and that the second syllable should be pronounced as ''bee''. NIST has stated the SI prefixes "refer strictly to powers of 10" and that the binary definitions "should not be used" for them. The microelectronics industry standards body JEDEC describes the IEC prefixes in its online dictionary with a note: "The definitions of kilo, giga, and mega based on powers of two are included only to reflect common usage." The JEDEC standards for semiconductor memory use the customary prefix symbols K, M and G in the binary sense. Free registration required to download the standard. On 19 March 2005, the IEEE standard
IEEE 1541-2002 IEEE 1541-2002 is a standard issued in 2002 by the Institute of Electrical and Electronics Engineers (IEEE) concerning the use of prefixes for binary multiples of units of measurement related to digital electronics and computing. While the In ...
("Prefixes for Binary Multiples") was elevated to a full-use standard by the IEEE Standards Association after a two-year trial period. However, , the IEEE Publications division does not require the use of IEC prefixes in its major magazines such as ''Spectrum'' ''"A lot can happen in a decade. You can hold the Nokia N800 in your hand, yet it's a near-exact match for a high-end desktop PC from 10 years ago. It has a 320-megahertz processor, 128 megabytes of RAM, and a few gigabytes of available mass storage."'' or ''Computer''. "The processor has a memory subsystem with separate first-level 32-Kbyte instruction and data caches, and a 512-Kbyte unified second-level cache." Authors are with IBM. The
International Bureau of Weights and Measures The International Bureau of Weights and Measures (french: Bureau international des poids et mesures, BIPM) is an intergovernmental organisation, through which its 59 member-states act together on measurement standards in four areas: chemistry ...
(BIPM), which maintains the International System of Units (SI), expressly prohibits the use of SI prefixes to denote binary multiples, and recommends the use of the IEC prefixes as an alternative since units of information are not included in SI. The Society of Automotive Engineers (SAE) prohibits the use of SI prefixes with anything but a power-of-1000 meaning, but does not recommend or otherwise cite the IEC binary prefixes. The European Committee for Electrotechnical Standardization ( CENELEC) adopted the IEC-recommended binary prefixes via the harmonization document HD 60027-2:2003-03. The European Union (EU) has required the use of the IEC binary prefixes since 2007.


Current practice in information technology

Most computer hardware uses SI prefixes to state capacity and define other performance parameters such as data rate. Main and cache memories are notable exceptions. Capacities of main memory and cache memory are usually expressed with customary binary prefixesAs used in this article, the term customary binary prefix or similar refers to prefixes such as kilo, mega, giga, etc., borrowed from the similarly named SI prefixes but used to denote a power of 1024. On the other hand, flash memory, like that found in solid state drives, mostly uses SI prefixes to state capacity. Some operating systems and other software continue to use the customary binary prefixes in displays of memory, disk storage capacity, and file size, but SI prefixesThe term SI prefix or similar refers to prefixes such as kilo, mega, giga, etc., defined by the SI system of units and ''always'' used to denote a power of 1000; in other words, always as decimal prefixes. in other areas such as network communication speeds and processor speeds. In the following subsections, unless otherwise noted, examples are first given using the common prefixes used in each case, and then followed by interpretation using other notation where appropriate.


Operating systems

Prior to the release of
Macintosh System Software Mac OS (originally System Software; retronym: Classic Mac OS) is the series of operating systems developed for the Macintosh family of personal computers by Apple Computer from 1984 to 2001, starting with System 1 and ending with Mac OS 9. Th ...
(1984), file sizes were typically reported by the operating system without any prefixes. Today, most operating systems report file sizes with prefixes. * The Linux kernel uses standards-compliant decimal and binary prefixes when booting up. However, many
Unix-like A Unix-like (sometimes referred to as UN*X or *nix) operating system is one that behaves in a manner similar to a Unix system, although not necessarily conforming to or being certified to any version of the Single UNIX Specification. A Unix-li ...
system utilities, such as the ls command, use powers of 1024 indicated as K/M (customary binary prefixes) if called with the "" option. They give the exact value in bytes otherwise. The GNU versions will also use powers of 10 indicated with k/M if called with the "" option. ** The Ubuntu Linux distribution uses the IEC prefixes for base-2 numbers as of the 10.10 release. *
Microsoft Windows Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for ...
reports file sizes and disk device capacities using the customary binary prefixes or, in a "Properties" dialog, using the exact value in bytes. * iOS 10 and earlier, Mac OS X Leopard and earlier and watchOS use the binary system (1 GB = ). Apple product specifications, iOS and macOS (including Mac OS X Snow Leopard: version 10.6) now report sizes using SI decimal prefixes (1 GB = bytes).


Software

, most software did not distinguish symbols for binary and decimal prefixes.A decimal prefix is a prefix that denotes a power of 1000. For example, "kilo" denotes 1000, "mega" denotes 10002 or one million, "giga" denotes 10003 or one billion, and so on. SI prefixes are decimal prefixes. The
IEC The International Electrotechnical Commission (IEC; in French: ''Commission électrotechnique internationale'') is an international standards organization that prepares and publishes international standards for all electrical, electronic and r ...
binary naming convention is not yet used universally. One of the stated goals of the introduction of the IEC prefixes was "to preserve the SI prefixes as unambiguous decimal multipliers". Programs such as
fdisk In computing, the fdisk command-line utility provides disk-partitioning functions, preparatory to defining file systems. fdisk features in the DOS, DR FlexOS, IBM OS/2, and Microsoft Windows operating systems, and in certain ports of ...
/
cfdisk cfdisk is a Linux partition editor, similar to fdisk, but with a different, curses-based user interface. It is part of the util-linux package of Linux utility programs. The current cfdisk implementation utilizes the libfdisk library
,
parted GNU Parted (the name being the conjunction of the two words PARTition and EDitor) is a free partition editor, used for creating and deleting partitions. This is useful for creating space for new operating systems, reorganising hard disk usage, ...
, and
apt-get Advanced package tool, or APT, is a free-software user interface that works with core libraries to handle the installation and removal of software on Debian, and Debian-based Linux distributions. APT simplifies the process of managing softwa ...
use SI prefixes with their decimal meaning. Image:GParted 0.3.5 -- 2008, 06.png, GNOME's partition editor uses IEC prefixes to display partition sizes. The total capacity of the 120×109-byte disk is displayed as "111.79  GiB" Image:GNOME System Monitor memory size and network rate.png, GNOME's system monitor uses IEC prefixes to show memory size and networking data rate. Image:Bittornado screenshot showing use of IEC and SI prefixes.png, BitTornado uses standard SI prefixes for data rates and IEC prefixes for file sizes Image:Deluge_using_Si_prefix_for_wiki_CD.png, Deluge (BitTorrent client) uses IEC prefixes for data rates as well as file sizes Example of the use of IEC binary prefixes in the
Linux Linux ( or ) is a family of open-source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991, by Linus Torvalds. Linux is typically packaged as a Linux distribution, whi ...
operating system displaying traffic volume on a network interface in kibibytes (KiB) and mebibytes (MiB), as obtained with the ifconfig utility:
eth0      Link encap:Ethernet  ..          RX packets:254804 errors:0 dropped:0 overruns:0 frame:0
          TX packets:756 errors:0 dropped:0 overruns:0 carrier:0
           ..          RX bytes:18613795 (17.7 MiB)  TX bytes:45708 (44.6 KiB)
Software that uses IEC binary prefixes for powers of 1024 ''and'' uses standard SI prefixes for powers of 1000 includes: * GNU Core Utilities * GParted * FreeDOS-32 * ifconfig * GNOME Network * SLIB *
Cygwin/X Cygwin/X is an implementation of the X Window System that runs under Microsoft Windows. It is part of the Cygwin project, and is installed using Cygwin's standard setup system. Cygwin/X is free software, licensed under the X11 License. Cygwin/ ...
* HTTrack * Pidgin (IM client) * Deluge * yafc * tnftp * WinSCP * MediaInfo Software that uses standard SI prefixes for powers of 1000, and does not use any prefixes for powers of 1024, includes: * Mac OS X v10.6 and later for hard drive and file sizes Software that supports decimal prefixes for powers of 1000 ''and'' binary prefixes for powers of 1024 (but does not follow SI or IEC nomenclature for this) includes: * 4DOS (uses lowercase letters as decimal and uppercase letters as binary prefixes)


Computer hardware

Hardware types that use powers-of-1024 multipliers, such as memory, continue to be marketed with customary binary prefixes.


Computer memory

Measurements of most types of electronic
memory Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remember ...
such as RAM and ROM are given using customary binary prefixes (kilo, mega, and giga). This includes some flash memory, like EEPROMs. For example, a "512-megabyte" memory module is bytes (512 × , or ). JEDEC Solid State Technology Association, the semiconductor engineering standardization body of the Electronic Industries Alliance (EIA), continues to include the customary binary definitions of kilo, mega and giga in their ''Terms, Definitions, and Letter Symbols'' document, (Requires free registration and login.) and uses those definitions in later memory standards (See also
JEDEC memory standards The JEDEC memory standards are the specifications for semiconductor memory circuits and similar storage devices promulgated by the Joint Electron Device Engineering Council (JEDEC) Solid State Technology Association, a semiconductor trade and eng ...
.) Many computer programming tasks reference memory in terms of
powers of two A power of two is a number of the form where is an integer, that is, the result of exponentiation with number two as the base and integer  as the exponent. In a context where only integers are considered, is restricted to non-negati ...
because of the inherent binary design of current hardware addressing systems. For example, a 16-bit processor register can reference at most items (bytes, words, or other objects); this is conveniently expressed as "64K" items. An operating system might map memory as 4096-byte
pages Page most commonly refers to: * Page (paper), one side of a leaf of paper, as in a book Page, PAGE, pages, or paging may also refer to: Roles * Page (assistance occupation), a professional occupation * Page (servant), traditionally a young mal ...
, in which case exactly 8192 pages could be allocated within bytes of memory: "8K" (8192) pages of "4 kilobytes" (4096 bytes) each within "32 megabytes" (32 MiB) of memory.


Hard disk drives

All hard disk drive manufacturers state capacity using SI prefixes.


Flash drives

USB flash drives, flash-based memory cards like
CompactFlash CompactFlash (CF) is a flash memory mass storage device used mainly in portable electronic devices. The format was specified and the devices were first manufactured by SanDisk in 1994. CompactFlash became one of the most successful of the e ...
or
Secure Digital Secure Digital, officially abbreviated as SD, is a proprietary non-volatile flash memory card format developed by the SD Association (SDA) for use in portable devices. The standard was introduced in August 1999 by joint efforts between SanDis ...
, and flash-based solid-state drives (SSDs) use SI prefixes; for example, a "256 MB" flash card provides at least 256 million bytes (), not 256×1024×1024 (). The flash memory chips inside these devices contain considerably more than the quoted capacities, but much like a traditional hard drive, some space is reserved for internal functions of the flash drive. These include wear leveling, error correction, sparing, and metadata needed by the device's internal firmware.


Floppy drives

Floppy disks have existed in numerous physical and logical formats, and have been sized inconsistently. In part, this is because the end user capacity of a particular disk is a function of the controller hardware, so that the same disk could be formatted to a variety of capacities. In many cases, the media are marketed without any indication of the end user capacity, as for example, DSDD, meaning double-sided double-density. The last widely adopted diskette was the 3.5-inch high density. This has a formatted capacity of bytes or 1440 KB (1440 × 1024, using "KB" in the customary binary sense). These are marketed as "HD", or "1.44 MB" or both. This usage creates a third definition of "megabyte" as 1000×1024 bytes. When these disks were commonplace, most operating systems displayed the capacity using "MB" in the customary binary sense, resulting in a display of "1.4 MB" (). Some users have noticed the missing 0.04 MB and both Apple and Microsoft have support bulletins referring to them as 1.4 MB. "The 1.44-megabyte (MB) value associated with the 3.5-inch disk format does not represent the actual size or free space of these disks. Although its size has been popularly called 1.44 MB, the correct size is actually 1.40 MB." The earlier "1200 KB" ( bytes) 5.25-inch diskette sold with the IBM PC AT was marketed as "1.2 MB" (). The largest 8-inch diskette formats could contain more than a megabyte, and the capacities of those devices were often irregularly specified in megabytes, also without controversy. Older and smaller diskette formats were usually identified as an accurate number of (binary) KB, for example the Apple Disk II described as "140KB" had a -byte capacity, and the original "360KB" double sided, double density disk drive used on the IBM PC had a -byte capacity. In many cases diskette hardware was marketed based on unformatted capacity, and the overhead required to format sectors on the media would reduce the nominal capacity as well (and this overhead typically varied based on the size of the formatted sectors), leading to more irregularities.


Optical discs

The capacities of most
optical disc In computing and optical disc recording technologies, an optical disc (OD) is a flat, usually circular disc that encodes binary data ( bits) in the form of pits and lands on a special material, often aluminum, on one of its flat surface ...
storage media like DVD, Blu-ray Disc, HD DVD and magneto-optical (MO) are given using SI decimal prefixes. A "4.7 GB" DVD has a nominal capacity of about 4.38  GiB.Understanding Recordable and Rewritable DVD
However, CD capacities are always given using customary binary prefixes. Thus a "700-MB" (or "80-minute") CD has a nominal capacity of about 700 
MiB The byte is a unit of digital information that most commonly consists of eight bits. Historically, the byte was the number of bits used to encode a single character of text in a computer and for this reason it is the smallest addressable unit ...
(approximately 730 MB).


Tape drives and media

Tape drive and media manufacturers use SI decimal prefixes to identify capacity.


Data transmission and clock rates

Certain units are always used with SI decimal prefixes even in computing contexts. Two examples are
hertz The hertz (symbol: Hz) is the unit of frequency in the International System of Units (SI), equivalent to one event (or cycle) per second. The hertz is an SI derived unit whose expression in terms of SI base units is s−1, meaning that o ...
(Hz), which is used to measure the clock rates of electronic components, and to bit/s and B/s, which are used to measure data transmission speed. * A 1 GHz processor receives clock ticks per second. * A sound file sampled at has samples per second. * A MP3 stream consumes bits (16 kilobytes, ) per second. * A Internet connection can transfer bits per second ( bytes per second ≈ , assuming an 8-bit byte and no overhead) * A Ethernet connection can transfer at nominal speed of bits per second ( bytes per second ≈ , assuming an 8-bit byte and no overhead) * A
56k A modulator-demodulator or modem is a computer hardware device that converts data from a digital format into a format suitable for an analog transmission medium such as telephone or radio. A modem transmits data by modulating one or more carr ...
modem transfers bits per second ≈ . Bus clock speeds and therefore bandwidths are both quoted using SI decimal prefixes. * PC3200 memory on a double data rate bus, transferring 8 bytes per cycle with a clock speed of ( cycles per second) has a bandwidth of = B/s = (about ). * A PCI-X bus at ( cycles per second), 64 bits per transfer, has a bandwidth of transfers per second × 64 bits per transfer = bit/s, or B/s, usually quoted as (about ).


Use by industry

IEC prefixes are used by Toshiba, IBM, HP to advertise or describe some of their products. According to one HP brochure

" reduce confusion, vendors are pursuing one of two remedies: they are changing SI prefixes to the new binary prefixes, or they are recalculating the numbers as powers of ten." The IBM Data Center also uses IEC prefixes to reduce confusion. The IBM Style Guide reads
To help avoid inaccuracy (especially with the larger prefixes) and potential ambiguity, the International Electrotechnical Commission (IEC) in 2000 adopted a set of prefixes specifically for binary multipliers (See IEC 60027-2). Their use is now supported by the United States National Institute of Standards and Technology (NIST) and incorporated into ISO 80000. They are also required by EU law and in certain contexts in the US. However, most documentation and products in the industry continue to use SI prefixes when referring to binary multipliers. In product documentation, follow the same standard that is used in the product itself (for example, in the interface or firmware). Whether you choose to use IEC prefixes for powers of 2 and SI prefixes for powers of 10, or use SI prefixes for a dual purpose ... be consistent in your usage and explain to the user your adopted system.


Other uses

The international standard
ISO 80000-1 ISO 80000 or IEC 80000 is an international standard introducing the International System of Quantities (ISQ). It was developed and promulgated jointly by the International Organization for Standardization (ISO) and the International Electrotech ...
defines the prefixes kibi-, mebi-, gibi- ... without limiting their application to
information technology Information technology (IT) is the use of computers to create, process, store, retrieve, and exchange all kinds of data . and information. IT forms part of information and communications technology (ICT). An information technology syste ...
. Uses of binary prefixes for quantities other than bits or bytes include their use to indicate binary multiples of the frequency unit
hertz The hertz (symbol: Hz) is the unit of frequency in the International System of Units (SI), equivalent to one event (or cycle) per second. The hertz is an SI derived unit whose expression in terms of SI base units is s−1, meaning that o ...
(Hz), for example the kibihertz (symbol KiHz) is 1024 Hz.
(22 pages)


See also

* Binary engineering notation *
B notation (scientific notation) Scientific notation is a way of expressing numbers that are too large or too small (usually would result in a long string of digits) to be conveniently written in decimal form. It may be referred to as scientific form or standard index form, o ...
* ISO/IEC 80000 * Nibble *
Octet Octet may refer to: Music * Octet (music), ensemble consisting of eight instruments or voices, or composition written for such an ensemble ** String octet, a piece of music written for eight string instruments *** Octet (Mendelssohn), 1825 com ...


Definitions


References


Further reading

* – An introduction to binary prefixes * * * —a 1996–1999 paper on bits, bytes, prefixes and symbols * * —Another description of binary prefixes * —White-paper on the controversy over drive capacities


External links


A plea for sanity

A summary of the organizations, software, and so on that have implemented the new binary prefixes



SI/Binary Prefix Converter

Storage Capacity Measurement Standards
{{DEFAULTSORT:Binary Prefix Measurement Naming conventions Units of information Numeral systems