Aztec Code
   HOME

TheInfoList



OR:

The Aztec Code is a matrix code invented by
Andrew Longacre, Jr. Andrew is the English form of a given name common in many countries. In the 1990s, it was among the top ten most popular names given to boys in English-speaking countries. "Andrew" is frequently shortened to "Andy" or "Drew". The word is derive ...
and Robert Hussey in 1995.* The code was published by AIM, Inc. in 1997. Although the Aztec Code was patented, that patent was officially made public domain. Click "images" then "correction" to see dedication to the public domain. The Aztec Code is also published as ISO/IEC 24778:2008 standard. Named after the resemblance of the central finder pattern to an Aztec pyramid, Aztec Code has the potential to use less space than other matrix barcodes because it does not require a surrounding blank "quiet zone".


Structure

The symbol is built on a square grid with a bulls-eye pattern at its centre for locating the code. Data is
encode The Encyclopedia of DNA Elements (ENCODE) is a public research project which aims to identify functional elements in the human genome. ENCODE also supports further biomedical research by "generating community resources of genomics data, software ...
d in concentric square rings around the bulls-eye pattern. The central bulls-eye is 9×9 or 13×13
pixel In digital imaging, a pixel (abbreviated px), pel, or picture element is the smallest addressable element in a raster image, or the smallest point in an all points addressable display device. In most digital display devices, pixels are the ...
s, and one row of pixels around that encodes basic coding parameters, producing a "core" of 11×11 or 15×15 squares. Data is added in "layers", each one containing two rings of pixels, giving total sizes of 15×15, 19×19, 23×23, etc. The corners of the core include orientation marks, allowing the code to be read if rotated or reflected. Decoding begins at the corner with three black pixels, and proceeds clockwise to the corners with two, one, and zero black pixels. The variable pixels in the central core encode the size, so it is not necessary to mark the boundary of the code with a blank "quiet zone", although some bar code readers require one. File:Aztec Code with desc.png, The core of the compact Aztec code (red ascending diagonal hatching), showing the central bulls-eye, the four orientation marks (blue diagonal cross-hatching), and space for 28 bits (7 bits per side) of coding information (green horizontal hatching). The first ring of data begins outside that (grey descending diagonal hatching). File:Aztec code full core.png, The core of the full Aztec code. 40 bits are available between the orientation marks for encoding parameters. File:Aztec Encoding.svg, Message data is placed in a spiral pattern around the core. The mode message begins "01011100", indicating 01₂+1 = 2 layers, and 011100₂+1 = 29 data codewords (of 6 bits each). The compact Aztec code core may be surrounded by 1 to 4 layers, producing symbols from 15×15 (room for 13 digits or 12 letters) through 27×27. There is additionally a special 11×11 "rune" that encodes one byte of information. The full core supports up to 32 layers, 151×151 pixels, which can encode 3832 digits, 3067 letters, or 1914 bytes of data. Whatever part of the symbol is not used for the basic data is used for
Reed–Solomon error correction Reed–Solomon codes are a group of error-correcting codes that were introduced by Irving S. Reed and Gustave Solomon in 1960. They have many applications, the most prominent of which include consumer technologies such as MiniDiscs, CDs, DVDs, ...
, and the split is completely configurable, between limits of 1 data word, and 3 check words. The recommended number of check words is 23% of symbol capacity plus 3 codewords. Aztec Code is supposed to produce readable codes with various printer technologies. It is also well suited for displays of cell phones and other mobile devices.


Encoding

The encoding process consists of the following steps: # Converting the source message to a string of bits # Computing the necessary symbol size and mode message, which determines the Reed–Solomon codeword size # Bit-stuffing the message into Reed–Solomon codewords # Padding the message to a codeword boundary # Appending check codewords # Arranging the complete message in a spiral around the core All conversion between bits strings and other forms is performed according to the
big-endian In computing, endianness, also known as byte sex, is the order or sequence of bytes of a word of digital data in computer memory. Endianness is primarily expressed as big-endian (BE) or little-endian (LE). A big-endian system stores the most si ...
(most significant bit first) convention.


Character set

All 8-bit values can be encoded, plus two escape codes: * FNC1, an escape symbol used to mark the presence of an application identifier, in the same way as in the
GS1-128 Code 128 is a high-density linear barcode symbology defined in ISO/IEC 15417:2007. It is used for alphanumeric or numeric-only barcodes. It can encode all 128 characters of ASCII and, by use of an extension symbol (FNC4), the Latin-1 characters ...
standard. * ECI, an escape followed by a 6-digit
Extended Channel Interpretation Extended Channel Interpretation (ECI) is an extension to the communication protocol that is used to transmit data from a bar code reader to a host when a bar code symbol is scanned. It enables the application software to receive additional inform ...
code, which specifies the character set used to interpret following bytes. By default, codes 0–127 are interpreted according to ANSI X3.4 (
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because ...
), and 128–255 are interpreted according to
ISO/IEC 8859-1 ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in ...
: Latin Alphabet No. 1. This corresponds to ECI 000003. Bytes are translated into 4- and 5-bit codes, based on a current decoding mode, with shift and latch codes for changing modes. Byte values not available this way may be encoded using a general "binary shift" code, which is followed by a length and a number of 8-bit codes. For changing modes, a ''shift'' affects only the interpretation of the single following code, while a ''latch'' affects all following codes. Most modes use 5-bit codes, but Digit mode uses 4-bit codes. * Initial mode is "Upper" * x/S = Shift to mode x for one character; B/S = shift to 8-bit binary * x/L = Latch to mode x for following characters * Punct codes 2–5 encode two bytes each * The table lists ASCII characters, but it is the byte values that are encoded, even if a non-ASCII character set it in use B/S (binary shift) is followed by a 5-bit length. If non-zero, this indicates that 1–31 8-bit bytes follow. If zero, 11 additional length bits encode the number of following bytes less 31. (Note that for 32–62 bytes, two 5-bit byte shift sequences are more compact than one 11-bit.) At the end of the binary sequence, the previous mode is resumed. FLG(''n'') is followed by a 3-bit ''n'' value. ''n''=0 encodes FNC1. ''n''=1–6 is followed by 1–6 digits (in digit mode) which are zero-padded to make a 6-bit ECI identifier. ''n''=7 is reserved and currently illegal.


Mode message

The mode message encodes the number of layers (''L'' layers encoded as the integer ''L''−1), and the number of data codewords (''D'' codewords, encoded as the integer ''D''−1) in the message. All remaining codewords are used as check codewords. For compact Aztec codes, the number of layers is encoded as a 2-bit value, and the number of data codewords as a 6-bit value, resulting in an 8-bit mode word. For full Aztec codes, the number of layers is encoded in 5 bits, and the number of data codewords is encoded in 11 bits, making a 16-bit mode word. The mode word is broken into 2 or 4 4-bit codewords in GF(16), and 5 or 6 Reed–Solomon check words are appended, making a 28- or 40-bit mode message, which is wrapped in a 1-pixel layer around the core. Because an L+1-layer compact Aztec code can hold more data than an L-layer full code, full codes with less than 4 layers are rarely used. Most importantly, the number of layers determines the size of the Reed–Solomon codewords used. This varies from 6 to 12 bits: The codeword size ''b'' is the smallest even number which ensures that the total number of codewords in the symbol is less than the limit of 2''b''−1 which can be corrected by a Reed–Solomon code. As mentioned above, it is recommended that at least 23% of the available codewords, plus 3, are reserved for correction, and a symbol size is chosen such that the message will fit into the available space.


Bit stuffing

The data bits are broken into codewords, with the first bit corresponding to the most significant coefficient. While doing this, code words of all-zero and all-ones are avoided by
bit stuffing In data transmission and telecommunication, bit stuffing (also known—uncommonly—as positive justification) is the insertion of non-information bits into data. Stuffed bits should not be confused with overhead bits. Bit stuffing is used for ...
: if the first ''b''−1 bits of a code word have the same value, an extra bit with the complementary value is inserted into the data stream. This insertion takes place whether or not the last bit of the code word would have had the same value or not. Also note that this only applies to strings of ''b''−1 bits ''at the beginning of a code word''. Longer strings of identical bits are permitted as long as they straddle a code word boundary. When decoding, a code word of all zero or all one may be assumed to be an
erasure Erasure () is an English synth-pop duo formed in London in 1985, consisting of lead vocalist and songwriter Andy Bell with songwriter, producer and keyboardist Vince Clarke, previously known as co-founder of the band Depeche Mode and a membe ...
, and corrected more efficiently than a general error. This process makes the message longer, and the final number of data codewords recorded in the mode message is not known until it is complete. In rare cases it may be necessary to jump to the next-largest symbol and begin the process all over again to maintain the minimum fraction of check words.


Padding

After bit stuffing, the data string is padded to the next codeword boundary by appending 1 bits. If this would result in a code word of all ones, the last bit is changed to zero (and will be ignored by the decoder as a bit-stuffing bit). On decoding, the padding bits may be decoded as shift and latch codes, but that will not affect the message content. The reader must accept and ignore a partial code at the end of the message, as long as it is all-ones. Additionally, if the total number of data bits available in the symbol is not a multiple of the codeword size, the data string is prefixed with an appropriate number of 0 bits to occupy the extra space. These bits are not included in the check word computation.


Check codewords

Both the mode word, and the data, must have check words appended to fill out the available space. This is computed by appending ''K'' check words such that the entire message is a multiple of the Reed–Solomon polynomial (''x''−2)(''x''−4)...(''x''−2''K''). Note that check words are ''not'' subject to bit stuffing, and may be all-zero or all-one. Thus, it is not possible to detect erasure of a check word.


Laying out the message

A full Aztec code symbol has, in addition to the core, a "reference grid" of alternating black and white pixels occupying every 16th row and column. These known pixels allow a reader to maintain alignment with the pixel grid over large symbols. For up to 4 layers (31×31 pixels), this consists only of single lines extending outward from the core, continuing the alternating pattern. Inside the 5th layer, however, additional rows and columns of alternating pixels are inserted ±16 pixels from the center, so the 5th layer is located ±17 and ±18 pixels from the center, and a 5-layer symbol is 37×37 pixels. Likewise, additional reference grid rows and columns are inserted ±32 pixels from the center, making a 12-layer symbol 67×67 pixels. In this case, the 12th layer occupies rings ±31 and ±33 pixels from the center. The pattern continues indefinitely outward, with 15-pixel blocks of data separated by rows and columns of the reference grid. One way to construct the symbol is to delete the reference grid entirely, and begin with a 14×14-pixel core centered on a 2×2 pixel white square. Then break it into 15×15 pixel blocks and insert the reference grid between them. The mode message begins at the top-left corner of the core, and wraps around it clockwise in a 1-bit thick layer. It begins with the most significant bit of the number of layers, and ends with the check words. For a compact Aztec code, it is broken into four 7-bit pieces to leave room for the orientation marks. For a full Aztec code, it is broken into four 10-bit pieces, and those pieces are each divided in half by the reference grid. In some cases, the total capacity of the matrix does not divide evenly by full code words. In such cases the main message is padded with 0 bits in the beginning. These bits are not included in the check word calculation and should be skipped during decoding. The total matrix capacity for a full symbol can be calculated as (112+16*L)*L for a full Aztec code and (88+16*L)*L for a compact Aztec code, where L is the symbol size in layers. As an example, the total matrix capacity of a compact Aztec code with 1 layer is 104 bits. Since code words are six bits, this gives 17 code words and two extra bits. Two zero bits are prepended to the message as padding and must be skipped during decoding. The padded main message begins at the outer top-left of the entire symbol, and spirals around it ''counterclockwise'' in a 2-bit thick layer, ending directly above the top-left corner of the core. This places the bit-stuffed data words, for which erasures can be detected, in the outermost layers of the symbol, which is most prone to erasures. The check words are stored closer to the core. The last check word ends just above the top left corner of the bull's eye. With the core in its standard orientation, the first bit of the first data word is placed in the upper-left corner, with additional bits placed in a 2-bit-wide column left-to-right and top-to-bottom. This continues until 2 rows from the bottom of the symbol when the pattern rotates 90 degrees counterclockwise, and continues in a 2-bit high row, bottom-to-top and left-to-right. After 4 equal-sized quarter layers, the spiral continues with the top-left corner of the next-inner layer, finally ending one pixel above the top-left corner of the core. Finally, 1 bits are printed as black squares, and 0 bits are printed as white squares.


Usage


Transport

Aztec codes are widely used for transport ticketing. The Aztec Code has been selected by the airline industry ( IATA's BCBP standard) for electronic boarding passes. Several airlines send Aztec Codes to passengers' mobile phones to act as boarding passes. These are often integrated with apps on passengers' phones, including
Apple Wallet Apple Wallet, or simply Wallet and formerly known as Passbook, is a digital wallet developed by Apple Inc. and included with iOS and watchOS that allows users to store Wallet passes such as coupons, boarding passes, student ID cards, govern ...
. Aztec codes are also used in rail, including by
Tehran Metro The Tehran Metro ( fa, مترو تهران, translit=Metro-ye Tehrān) is a rapid transit system serving Tehran, the capital of Iran. It is the largest metro system in the Middle East. The system is owned and operated by Tehran Urban and Subur ...
,
Eurostar Eurostar is an international high-speed rail service connecting the United Kingdom with France, Belgium and the Netherlands. Most Eurostar trains travel through the Channel Tunnel between the United Kingdom and France, owned and operate ...
,
Deutsche Bahn The (; abbreviated as DB or DB AG) is the national railway company of Germany. Headquartered in the Bahntower in Berlin, it is a joint-stock company ( AG). The Federal Republic of Germany is its single shareholder. describes itself as the ...
,
TCDD Taşımacılık TCDD Taşımacılık A.Ş. ( en, TCDD Transport, reporting mark TCDDT) is a government-owned railway company responsible for the operations of most passenger and freight rail in Turkey. The company was formed on 14 June 2016, splitting off from th ...
, DSB, SJ, České dráhy, Slovak Railways,
Slovenian Railways Slovenian Railways ( sl, Slovenske železnice, ''SŽ'') is the state railway company of Slovenia, created in 1991. Slovenia is a member of the International Union of Railways (UIC). The UIC Country Code for Slovenia is 79. History What is n ...
,
Croatian Railways Croatian Railways ( hr, Hrvatske željeznice; abbreviated as HŽ) is the national railway company of Croatia. Croatia is a member of the International Union of Railways (UIC). The UIC Country Code for Croatia is 78. The Croatian rail network car ...
,
Trenitalia Trenitalia is the primary train operator in Italy. A subsidiary of Ferrovie dello Stato Italiane, itself owned by the Italian government, the company was established in 2000 following a European Union directive on the deregulation of rail transp ...
,
Nederlandse Spoorwegen Nederlandse Spoorwegen (NS; ; en, "Dutch Railways") is the principal passenger railway operator in the Netherlands. It is a Dutch state-owned company founded in 1938. The Dutch rail network is one of the busiest in the European Union, and t ...
,
PKP Intercity PKP Intercity is a company of PKP Group responsible for long-distance passenger transport. It runs about 350 trains daily, connecting mainly large agglomerations and smaller towns in Poland. The company also provides most international trains t ...
,
VR Group VR-Group Plc ( fi, VR-Yhtymä Oyj, sv, VR-Group Abp), commonly known as VR, is a government-owned railway company in Finland. VR's most important function is the operation of Finland's passenger rail services with 250 long-distance and 800 co ...
,
National Rail National Rail (NR) is the trading name licensed for use by the Rail Delivery Group, an unincorporated association whose membership consists of the passenger train operating companies (TOCs) of England, Scotland, and Wales. The TOCs run the ...
,
Via Rail Via Rail Canada Inc. (), operating as Via Rail or Via, is a Canadian Crown corporation that is mandated to operate intercity passenger rail service in Canada. It receives an annual subsidy from Transport Canada to offset the cost of operating ...
,
Swiss Federal Railways Swiss Federal Railways (german: link=no, Schweizerische Bundesbahnen, ''SBB''; french: link=no, Chemins de fer fédéraux suisses, ''CFF''; it, Ferrovie federali svizzere, ''FFS'') is the national railway company of Switzerland. It is usuall ...
,
SNCB french: Société nationale des chemins de fer belgesgerman: Nationale Gesellschaft der Belgischen Eisenbahnen , type = Statutory corporation , industry = Rail Transport , foundation = 1926 , founder = Government of ...
and
SNCF The Société nationale des chemins de fer français (; abbreviated as SNCF ; French for "National society of French railroads") is France's national state-owned railway company. Founded in 1938, it operates the country's national rail traffic ...
for tickets sold online and printed out by customers or displayed on mobile phone screens. The Aztec code is scanned by a handheld scanner by on-train staff or at the turnstile to validate the ticket.


Governmental

Car registration documents in
Poland Poland, officially the Republic of Poland, is a country in Central Europe. It is divided into 16 administrative provinces called voivodeships, covering an area of . Poland has a population of over 38 million and is the fifth-most populou ...
bear a summary, compressed by NRV2E algorithm, encoded as Aztec Code. Works are underway to enable car insurance companies to automatically fill in the relevant information based on digital photographs of the document as the first step of closing a new insurance contract. Federal Tax Service in
Russia Russia (, , ), or the Russian Federation, is a transcontinental country spanning Eastern Europe and Northern Asia. It is the largest country in the world, with its internationally recognised territory covering , and encompassing one-ei ...
encodes payment information in tax notices as Aztec Code.


Commercial

Many
bill Bill(s) may refer to: Common meanings * Banknote, paper cash (especially in the United States) * Bill (law), a proposed law put before a legislature * Invoice, commercial document issued by a seller to a buyer * Bill, a bird or animal's beak Pla ...
s in Canada are now using this technology as well, including
EastLink (company) Eastlink Inc. is a Canadian cable television and telecommunications company. The privately held company was founded in Nova Scotia in 1969 by the Bragg family, and has grown since through the amalgamation of several telecommunications companies ...
,
Shaw Cable Shaw Communications Inc. is a Canadian telecommunications company which provides telephone, Internet, television, and mobile services. Headquartered in Calgary, Alberta, Shaw provides home telecommunications services primarily in Alberta and Br ...
,
Bell Aliant Bell Aliant is a brand name used by Bell Canada for telecommunications services in Atlantic Canada. Prior to 2015, Bell Aliant Inc. (formerly Aliant Inc.) was a separate company providing telecom services in the Atlantic provinces and a few othe ...
and many more.


References


External links


Aztec code specification (ISO/IEC 24778:2008)
{{Barcodes Barcodes