Macintosh Character Set
   HOME

TheInfoList



OR:

Mac OS Roman is a
character encoding Character encoding is the process of assigning numbers to Graphics, graphical character (computing), characters, especially the written characters of Language, human language, allowing them to be Data storage, stored, Data communication, transmi ...
created by Apple Computer, Inc. for use by
Macintosh The Mac (known as Macintosh until 1999) is a family of personal computers designed and marketed by Apple Inc. Macs are known for their ease of use and minimalist designs, and are popular among students, creative professionals, and software en ...
computers. It is suitable for representing text in
English English usually refers to: * English language * English people English may also refer to: Peoples, culture, and language * ''English'', an adjective for something of, from, or related to England ** English national ide ...
and several other Western languages. Mac OS Roman encodes 256 characters, the first 128 of which are identical to
ASCII ASCII ( ), abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because ...
, with the remaining characters including mathematical symbols,
diacritic A diacritic (also diacritical mark, diacritical point, diacritical sign, or accent) is a glyph added to a letter or to a basic glyph. The term derives from the Ancient Greek (, "distinguishing"), from (, "to distinguish"). The word ''diacriti ...
s, and additional punctuation marks. Mac OS Roman is an extension of the original Macintosh character set, which encoded only 217 characters. Full support for Mac OS Roman first appeared in System 6.0.4, released in 1989, and the encoding is still supported in current versions of
macOS macOS (; previously OS X and originally Mac OS X) is a Unix operating system developed and marketed by Apple Inc. since 2001. It is the primary operating system for Apple's Mac computers. Within the market of desktop and lapt ...
, though the standard character encodings are now
UTF-8 UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''. UTF-8 is capable of ...
or
UTF-16 UTF-16 (16-bit Unicode Transformation Format) is a character encoding capable of encoding all 1,112,064 valid code points of Unicode (in fact this number of code points is dictated by the design of UTF-16). The encoding is variable-length, as cod ...
. Apple modified Mac OS Roman in 1998 with the release of Mac OS 8.5 by replacing the currency sign at position hexadecimal 0xDB with the euro sign, but otherwise the encoding has been unchanged since its release.


Character set

The following table shows how characters are encoded in Mac OS Roman. The row and column headings give the first and second digit of the hexadecimal code for each character in the table.


Technical notes

The
Internet Assigned Numbers Authority The Internet Assigned Numbers Authority (IANA) is a standards organization that oversees global IP address allocation, autonomous system number allocation, root zone management in the Domain Name System (DNS), media types, and other Inte ...
identifies this encoding using the string "macintosh". The
MIME Multipurpose Internet Mail Extensions (MIME) is an Internet standard that extends the format of email messages to support text in character sets other than ASCII, as well as attachments of audio, video, images, and application programs. Message ...
Content-Type for this encoding is therefore "text/plain; charset=macintosh". The Microsoft Windows code page number is
10000 10,000 (ten thousand) is the natural number following 9,999 and preceding 10,001. Name Many languages have a specific word for this number: in Ancient Greek it is (the etymological root of the word myriad in English), in Aramaic , in Hebrew ...
. IBM uses code page/
CCSID A CCSID (coded character set identifier) is a 16-bit number that represents a particular encoding of a specific code page. For example, Unicode is a code page that has several encoding (so called "transformation") forms, like UTF-8, UTF-16 and U ...
1275. With the release of
Mac OS X macOS (; previously OS X and originally Mac OS X) is a Unix operating system developed and marketed by Apple Inc. since 2001. It is the primary operating system for Apple's Mac computers. Within the market of desktop and la ...
, Mac OS Roman and all other "scripts" (as classic Mac OS called them) were replaced by
UTF-8 UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''. UTF-8 is capable of ...
as the standard character encoding for the Macintosh operating system. However, the default character encoding in
Java Java (; id, Jawa, ; jv, ꦗꦮ; su, ) is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea to the north. With a population of 151.6 million people, Java is the world's mos ...
for Mac OS X remained MacRoman, and the
keyboard layout A keyboard layout is any specific physical, visual or functional arrangement of the keys, legends, or key-meaning associations (respectively) of a computer keyboard, mobile phone, or other computer-controlled typographic keyboard. is the actua ...
with its combination of control, option, and
dead key A dead key is a special kind of modifier key on a mechanical typewriter, or computer keyboard, that is typically used to attach a specific diacritic to a base letter. The dead key does not generate a (complete) character by itself, but modifies t ...
s still map to the original characters in MacRoman. The default character encoding for Java can be changed to UTF-8 by adding the following line to
.profile A Unix shell is a command-line interpreter or shell that provides a command line user interface for Unix-like operating systems. The shell is both an interactive command language and a scripting language, and is used by the operating system to ...
: With Java 18 and later, th
default character encoding is UTF-8
across platforms, including macOS.


See also

*
Western Latin character sets (computing) Several binary representations of 8-bit character sets for common Western European languages are compared in this article. These encodings were designed for representation of Italian, Spanish, Portuguese, French, German, Dutch, English, Danish, ...


Notes

{{DEFAULTSORT:Mac Os Roman Character sets
Roman Roman or Romans most often refers to: *Rome, the capital city of Italy *Ancient Rome, Roman civilization from 8th century BC to 5th century AD *Roman people, the people of ancient Rome *'' Epistle to the Romans'', shortened to ''Romans'', a lette ...
Articles with unsupported PUA characters Computer-related introductions in 1987