HOME

TheInfoList



OR:

Code page 37 (
CCSID A CCSID (coded character set identifier) is a 16-bit number that represents a particular encoding of a specific code page. For example, Unicode is a code page that has several encoding (so called "transformation") forms, like UTF-8, UTF-16 and UT ...
37; label ), known as "USA/Canada - CECP", is an
EBCDIC Extended Binary Coded Decimal Interchange Code (EBCDIC; ) is an eight- bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding ...
code page used on IBM mainframes. It encodes the ISO/IEC 8859-1 repertoire of graphic characters. Code page 37 is one of the most-used and best-supported EBCDIC code pages. It is used as the default
z/OS z/OS is a 64-bit operating system for IBM z/Architecture mainframes, introduced by IBM in October 2000. It derives from and is the successor to OS/390, which in turn was preceded by a string of MVS versions.Starting with the earliest: * ...
code page in the
United States The United States of America (U.S.A. or USA), commonly known as the United States (U.S. or US) or America, is a country primarily located in North America. It consists of 50 states, a federal district, five major unincorporated territo ...
and other English speaking countries. It is considered the "required" EBCDIC code page for the United States, and also used in Australia,
New Zealand New Zealand ( mi, Aotearoa ) is an island country in the southwestern Pacific Ocean. It consists of two main landmasses—the North Island () and the South Island ()—and over 700 smaller islands. It is the sixth-largest island country ...
, the
Netherlands ) , anthem = ( en, "William of Nassau") , image_map = , map_caption = , subdivision_type = Sovereign state , subdivision_name = Kingdom of the Netherlands , established_title = Before independence , established_date = Spanish Neth ...
,
Portugal Portugal, officially the Portuguese Republic ( pt, República Portuguesa, links=yes ), is a country whose mainland is located on the Iberian Peninsula of Southwestern Europe, and whose territory also includes the Atlantic archipelagos of the ...
and
Brazil Brazil ( pt, Brasil; ), officially the Federative Republic of Brazil (Portuguese: ), is the largest country in both South America and Latin America. At and with over 217 million people, Brazil is the world's fifth-largest country by area ...
, and on
ESA/390 The IBM System/390 is a discontinued mainframe product family implementing the ESA/390, the fifth generation of the System/360 instruction set architecture. The first computers to use the ESA/390 were the Enterprise System/9000 (ES/9000) ...
systems in
Canada Canada is a country in North America. Its Provinces and territories of Canada, ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, covering over , making it the world ...
, but not on Canadian
AS/400 The IBM AS/400 (Application System/400) is a family of midrange computers from IBM announced in June 1988 and released in August 1988. It was the successor to the System/36 and System/38 platforms, and ran the OS/400 operating system. Lower-cos ...
systems, which use Code page 500 instead. It is one of four EBCDIC code pages (alongside 500, 875 and 1026) with mapping data supplied by
Microsoft Microsoft Corporation is an American multinational technology corporation producing computer software, consumer electronics, personal computers, and related services headquartered at the Microsoft Redmond campus located in Redmond, Washi ...
to the
Unicode Consortium The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California. Its primary purpose is to maintain and publish the Unicode Standard which was developed with the intentio ...
, and one of seven (alongside 273, 424, 500, 875, 1026 and 1140) supported by
Python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (pr ...
as standard.


Character set

Code page 37 exists in two versions: a "base character set" or "DP94" version (GCSGID 101 with CPGID 37, or CCSID 8229), containing only 94 graphical characters, and a "CECP" (Country Extended Code Page) version (GCSGID 697 with CPGID 37, or CCSID 37) with expansions to support the entire ISO/IEC 8859-1 repertoire.


Code page translation

Since CP 037 contains all of the standard Latin-1 characters, it is possible to translate the character codes from the CP 037 charset to
ISO 8859-1 ISO/IEC 8859-1:1998, ''Information technology — 8-bit single-byte coded graphic character sets — Part 1: Latin alphabet No. 1'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1 ...
character codes, so that translation back to the CP 037 charset is an exact value-preserving round-trip conversion. Likewise, half of the
control character In computing and telecommunication, a control character or non-printing character (NPC) is a code point (a number) in a character set, that does not represent a written symbol. They are used as in-band signaling to cause effects other than the ...
codes can be translated into their exact
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because o ...
equivalents. If the remaining
EBCDIC Extended Binary Coded Decimal Interchange Code (EBCDIC; ) is an eight- bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding ...
-only control characters are translated (arbitrarily) into the remaining unused ASCII codes points (hex 80 to 9F) as well, the resulting translation covers all of the 256 character code points. Such a translation table is shown below: :::


Variants and relation to other code pages

Several other code pages exist, which amount to code page 37 with a few characters swapped. In particular, code page 37 encodes the hard brackets () at BA and BB, while other variants use other encodings for the hard brackets. For example, code page 1047 is a variant used by the
MVS Multiple Virtual Storage, more commonly called MVS, was the most commonly used operating system on the System/370 and System/390 IBM mainframe computers. IBM developed MVS, along with OS/VS1 and SVS, as a successor to OS/360. It is unrelated ...
Open Systems compiler. It differs from code page 37 in six positions: and switch places, as do and , placing the hard brackets at AD and BD; the caret and not sign also switch places This encoding of the brackets was inherited from the 3270 display system. Although IBM intends code page 1047 to be used to fulfil specific requirements only, and not as a replacement for code pages 37 and 500, the subset of code page 1047 encoding the
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because o ...
repertoire was used for the single-byte graphical codes in
UTF-EBCDIC UTF-EBCDIC is a character encoding capable of encoding all 1,112,064 valid character code points in Unicode using one to five one-byte (8-bit) code units (in contrast to a maximum of four for UTF-8). It is meant to be EBCDIC-friendly, so that lega ...
. A hybrid of code pages 37 and 1047 used in combination with
code page 310 The programming language APL uses a number of symbols, rather than words from natural language, to identify operations, similarly to mathematical symbols. Prior to the wide adoption of Unicode, a number of special-purpose EBCDIC and non-EBCDIC co ...
on the 3279 system has been acknowledged by SHARE, but not officially designated as a code page by IBM, and is sometimes referred to as code page 37-2 by
transcoding Transcoding is the direct digital-to-digital conversion of one encoding to another, such as for video data files, audio files (e.g., MP3, WAV), or character encoding (e.g., UTF-8, ISO/IEC 8859). This is usually done in cases where a target de ...
software. It mostly matches code page 1047, except that the caret and the not sign are inverted to match code page 37. Code page 500, known as "International EBCDIC", "International Latin-1" or "International Number 5", is the other major EBCDIC encoding for the ISO/IEC 8859-1 repertoire. It is used in
Belgium Belgium, ; french: Belgique ; german: Belgien officially the Kingdom of Belgium, is a country in Northwestern Europe. The country is bordered by the Netherlands to the north, Germany to the east, Luxembourg to the southeast, France to the ...
,
Switzerland ). Swiss law does not designate a ''capital'' as such, but the federal parliament and government are installed in Bern, while other federal institutions, such as the federal courts, are in other cities (Bellinzona, Lausanne, Luzern, Neuchâtel ...
and on
AS/400 The IBM AS/400 (Application System/400) is a family of midrange computers from IBM announced in June 1988 and released in August 1988. It was the successor to the System/36 and System/38 platforms, and ran the OS/400 operating system. Lower-cos ...
systems in
Canada Canada is a country in North America. Its Provinces and territories of Canada, ten provinces and three territories extend from the Atlantic Ocean to the Pacific Ocean and northward into the Arctic Ocean, covering over , making it the world ...
. It is related to code page 37 and has the same repertoire, but differs in seven positions; in particular, it encodes and at 4A and 5A respectively, which are used for the
cent sign The cent is a monetary unit of many national currencies that equals of the basic monetary unit. Etymologically, the word 'cent' derives from the Latin word meaning hundred. The cent sign is commonly a simple minuscule (lower case) letter . ...
() and
exclamation point The exclamation mark, , or exclamation point (American English), is a punctuation mark usually used after an interjection or exclamation to indicate strong feelings or to show emphasis. The exclamation mark often marks the end of a sentence, f ...
() in code page 37. The caret () is also encoded at 5F, similarly to code page 1047. The is encoded at B0, the at BA, the at 4F and the pipe character (, ) at BB. Code page 500 is in turn related to the German code page 273, which swaps several character pairs in a manner corresponding to the differences between DRV8 and ISO/IEC 8859-1, resulting in placement of the hard brackets at 63 and FC. A variant of code page 37 with the
euro sign The euro sign () is the currency sign used for the euro, the official currency of the eurozone and unilaterally adopted by Kosovo and Montenegro. The design was presented to the public by the European Commission on 12 December 1996. It cons ...
() character added at the 9Fhex position, replacing the universal currency sign () is code page 1140. Similarly, code page 1148 makes the same change to code page 500, and code page 1141 does so to code page 273. An EBCDIC code page with the
ISO/IEC 8859-15 ISO/IEC 8859-15:1999, ''Information technology — 8-bit single-byte coded graphic character sets — Part 15: Latin alphabet No. 9'', is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1999. ...
repertoire, including the euro sign, is code page 924; this is related to code page 1047 but with the ISO/IEC 8859-1 characters replaced with their ISO/IEC 8859-15 replacements, with ¢ and ¬ moved to their code page 500 locations at B0hex and BAhex respectively, and with Ý moved to 4Ahex.


See also

*
EBCDIC Extended Binary Coded Decimal Interchange Code (EBCDIC; ) is an eight- bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding ...


References


External links


Longpela ASCII - EBCDIC 37 code converter
{{Character encoding IBM mainframe operating systems EBCDIC code pages