Elias
code or Elias gamma code is a
universal code encoding positive integers developed by
Peter Elias.
It is used most commonly when coding integers whose upper bound cannot be determined beforehand.
Encoding
To code a
number
A number is a mathematical object used to count, measure, and label. The most basic examples are the natural numbers 1, 2, 3, 4, and so forth. Numbers can be represented in language with number words. More universally, individual numbers can ...
''x'' ≥ 1:
# Let
be the highest power of 2 it contains, so 2
''N'' ≤ ''x'' < 2
''N''+1.
# Write out
zero bits, then
# Append the
binary form of
, an
-bit binary number.
An equivalent way to express the same process:
# Encode
in
unary; that is, as
zeroes followed by a one.
# Append the remaining
binary digits of
to this representation of
.
To represent a number
, Elias gamma (γ) uses
bits.
The code begins (the
implied probability distribution for the code is added for clarity):
Decoding
To decode an Elias gamma-coded integer:
#Read and count 0s from the stream until you reach the first 1. Call this count of zeroes ''N''.
#Considering the one that was reached to be the first digit of the integer, with a value of 2
''N'', read the remaining ''N'' digits of the integer.
Uses
Gamma coding is used in applications where the largest encoded value is not known ahead of time, or to
compress data in which small values are much more frequent than large values.
Gamma coding can be more size efficient in those situations. For example, note that, in the table above, if a fixed 8-bit size is chosen to store a small number like the number 5, the resulting binary would be
00000101
, while the γ-encoding variable-bit version would be
00 1 01
, needing 3 bits less. On the contrary, bigger values, like 254 stored in fixed 8-bit size, would be
11111110
while the γ-encoding variable-bit version would be
0000000 1 1111110
, needing 7 extra bits.
Gamma coding is a building block in the
Elias delta code.
Generalizations
Gamma coding does not code zero or negative integers.
One way of handling zero is to add 1 before coding and then subtract 1 after decoding.
Another way is to prefix each nonzero code with a 1 and then code zero as a single 0.
One way to code all integers is to set up a
bijection
In mathematics, a bijection, bijective function, or one-to-one correspondence is a function between two sets such that each element of the second set (the codomain) is the image of exactly one element of the first set (the domain). Equival ...
, mapping integers (0, −1, 1, −2, 2, −3, 3, ...) to (1, 2, 3, 4, 5, 6, 7, ...) before coding. In software, this is most easily done by mapping non-negative inputs to odd outputs, and negative inputs to even outputs, so the least-significant bit becomes an inverted
sign bit:
Exponential-Golomb coding generalizes the gamma code to integers with a "flatter" power-law distribution, just as
Golomb coding
Golomb coding is a lossless data compression method using a family of data compression codes invented by Solomon W. Golomb in the 1960s. Alphabets following a geometric distribution will have a Golomb code as an optimal prefix code, making ...
generalizes the unary code.
It involves dividing the number by a positive divisor, commonly a power of 2, writing the gamma code for one more than the quotient, and writing out the remainder in an ordinary binary code.
See also
*
*
*
References
Further reading
*
{{DEFAULTSORT:Elias Gamma Coding
Entropy coding
Numeral systems
Data compression