The Microsoft text-to-speech voices are
speech synthesizers provided for use with applications that use the
Microsoft Speech API (SAPI) or the Microsoft Speech Server Platform.
There are client, server, and mobile versions of Microsoft text-to-speech voices. Client voices are shipped with Windows operating systems; server voices are available for download for use with server applications such as Speech Server, Lync etc. for both Windows client and server platforms, and mobile voices are often shipped with more recent versions.
Voices
Windows 2000 and Windows XP
Microsoft Sam is the default text-to-speech male voice in Microsoft
Windows 2000
Windows 2000 is a major release of the Windows NT operating system developed by Microsoft, targeting the server and business markets. It is the direct successor to Windows NT 4.0, and was Software release life cycle#Release to manufacturing (RT ...
and
Windows XP
Windows XP is a major release of Microsoft's Windows NT operating system. It was released to manufacturing on August 24, 2001, and later to retail on October 25, 2001. It is a direct successor to Windows 2000 for high-end and business users a ...
. It is used by
Narrator
Narration is the use of a written or spoken commentary to convey a story to an audience. Narration is conveyed by a narrator: a specific person, or unspecified literary voice, developed by the creator of the story to deliver information to the ...
, the
screen reader
A screen reader is a form of assistive technology (AT) that renders text and image content as speech or braille output. Screen readers are essential to blindness, blind people, and are useful to visually impaired people, Illiteracy, illiterate, ...
program built into the operating system.
Microsoft Mike and Microsoft Mary are optional male and female voices respectively, available for download from the Microsoft website. Michael and Michelle are also optional male and female voices licensed by Microsoft from
Lernout & Hauspie, and are available through
Microsoft Office XP and
Microsoft Office 2003 or
Microsoft Reader
Microsoft Reader is a discontinued Microsoft application for reading e-books, first released in August 2000, that used its own .LIT format. It was available for Windows computers and Pocket PC PDAs. The name was also used later for an unrelated ...
. The SAPI 5.1 SDK also includes an additional voice for testing purposes known as "Sample TTS Voice", which utilizes recorded voice samples for use with the text-to-speech engine using a predefined set of words.
[Speech SDK 5.1](_blank)
/ref>
There have been both SAPI 4 and SAPI 5 versions of these text-to-speech voices that were released. These two versions are different from each other in terms of the speech patterns, pronunciation of certain words, and changes to some words spoken by the text-to-speech engine. SAPI 4 voices are only available on Windows 2000 and later Windows NT-based operating systems. Redistributable versions of the SAPI 4 voices were available for download on Windows 9x
Windows 9x is a generic term referring to a line of discontinued Microsoft Windows operating systems released from 1995 to 2000 and supported until 2006, which were based on the kernel introduced in Windows 95 and modified in succeeding version ...
operating systems, however they are no longer offered from the Microsoft website. While the SAPI 5 versions of Microsoft Mike and Microsoft Mary are only downloadable as a Merge Module, the installable versions may be installed on end users' systems by speech applications such as Microsoft Reader.
The SAPI 4 versions of Microsoft Sam, Microsoft Mike and Microsoft Mary can be used on Windows XP, Windows Vista
Windows Vista is a major release of the Windows NT operating system developed by Microsoft. It was the direct successor to Windows XP, released five years earlier, which was then the longest time span between successive releases of Microsoft W ...
, and later with a third-party program (like Speakonia and TTSReader) installed on the machine that supports these operating systems. In addition, the SAPI 4 versions of the Michael and Michelle soundalikes from Lernout & Hauspie (with different dialects) can also be used on Windows Vista and later by downloading the respective British English pack and then using it with a third-party program like Speakonia (Conversely, said voices are also compatible with XP and prior as well).
The SAPI 5 versions of Microsoft Sam, Microsoft Mike and Microsoft Mary (as well as the "Sample TTS Voice" test voice) can also be used on Windows Vista and later by installing the SAPI 5.1 SDK, which can also be installed in versions of Windows prior to XP beginning with Windows NT 4.0 SP6a and Windows 98
Windows 98 is a consumer-oriented operating system developed by Microsoft as part of its Windows 9x family of Microsoft Windows operating systems. It was the second operating system in the 9x line, as the successor to Windows 95. It was Software ...
. These voices (apart from the "Sample TTS Voice" test voice) can also be installed separately via a manually-defined batch script. Furthermore, the SAPI 5 versions of Michael and Michelle from Lernout & Hauspie can also be installed via programs such as Microsoft Office XP or Microsoft Office 2003, however the voices cannot be chosen under normal means.
Windows Vista and Windows 7
Beginning with Windows Vista
Windows Vista is a major release of the Windows NT operating system developed by Microsoft. It was the direct successor to Windows XP, released five years earlier, which was then the longest time span between successive releases of Microsoft W ...
and Windows 7
Windows 7 is a major release of the Windows NT operating system developed by Microsoft. It was Software release life cycle#Release to manufacturing (RTM), released to manufacturing on July 22, 2009, and became generally available on October 22, ...
, ''Microsoft Anna'' is the default English voice. It is a SAPI 5-only female voice and is designed to sound more natural than Microsoft Sam. Microsoft Streets & Trips 2006 and later install the Microsoft Anna voice on Windows XP systems for the voice-prompt direction feature. There are no male voices shipping with Windows Vista and Windows 7, and neither Microsoft Mike or Mary will work on Windows 7.
A female voice called ''Microsoft Lili'' that replaces the earlier male SAPI 5 voice "Microsoft Simplified Chinese" is available in Chinese versions of Windows Vista and Windows 7. It can also be obtained in non-Chinese versions of Windows 7 or Vista by installing the Chinese language pack.
In 2010, Microsoft released the newer Speech Platform compatible voices for Speech Recognition and Text-to-Speech for use with client and server applications. These voices are available in 26 languages and can be installed on Windows client and server operating systems. Speech Platform voices, unlike SAPI 5 voices, are female-only; no male voices were ever released.
Windows 8 and Windows 8.1
In Windows 8
Windows 8 is a major release of the Windows NT operating system developed by Microsoft. It was Software release life cycle#Release to manufacturing (RTM), released to manufacturing on August 1, 2012, made available for download via Microsoft ...
, there are three new client (desktop) voices - Microsoft David (US male), Hazel (UK female) and Zira (US female) which are intended to sound more natural than Microsoft Anna. The server versions of these voices are available via the above-mentioned Speech Platform for operating systems earlier than Windows 8. Other voices are available for specific language versions of either Windows 8 or Windows 8.1.
Unlike Windows 7 or Vista, one cannot use any third-party program for Microsoft Anna because there is no official Anna Voice API for download (especially because Microsoft Anna was only available in SAPI 5 and no SAPI 4 version of the voice exists).
Windows 10
In Windows 10
Windows 10 is a major release of Microsoft's Windows NT operating system. The successor to Windows 8.1, it was Software release cycle#Release to manufacturing (RTM), released to manufacturing on July 15, 2015, and later to retail on July 2 ...
, Microsoft Hazel was removed from the US English Language Pack and the Microsoft voices for Mobile (Phone/tablet) are available (Microsoft Mark and Microsoft Zira). These are the same voices found on Windows Phone 8, Windows Phone 8.1 and Windows 10 Mobile.
Also with these voices language packs are also available for a variety of voices similar to that of Windows 8 and 8.1. None of these voices match the Cortana text-to-speech voice which can be found on Windows Phone 8.1, Windows 10, and Windows 10 Mobile.
In an attempt to unify its software with Windows 10
Windows 10 is a major release of Microsoft's Windows NT operating system. The successor to Windows 8.1, it was Software release cycle#Release to manufacturing (RTM), released to manufacturing on July 15, 2015, and later to retail on July 2 ...
, all of Microsoft's current platforms use the same text-to-speech voices except for Microsoft David and a few others.
Mobile
Every mobile voice package has the combination of male/female, while most of the desktop voice packages have only female voices. All mobile voices have been made universal and any user who downloads the language pack of that choice will have one extra male and female voice per that package.
A hidden text-to-speech voice in Windows 10 called Microsoft Eva Mobile is present within the system. Users can download a pre-packaged registry file from the windowsreport.com website. Microsoft Eva is believed to be the early voice for Cortana until Microsoft replaced her with the voice of Jen Taylor in most areas.
These voices are updated with Windows to sound more natural than in the original version as seen in updated retail builds of Windows 10.
Windows 11
Windows 11
Windows 11 is a version of Microsoft's Windows NT operating system, released on October 5, 2021, as the successor to Windows 10 (2015). It is available as a free upgrade for devices running Windows 10 that meet the #System requirements, Windo ...
introduced three new "natural voices" borrowed from Microsoft's Azure cloud computing
Cloud computing is "a paradigm for enabling network access to a scalable and elastic pool of shareable physical or virtual resources with self-service provisioning and administration on-demand," according to International Organization for ...
platform starting with version 22H2: Microsoft Aria, Jenny, and Guy. These natural voices are intended to sound more natural than previous text-to-speech voices. It is exclusively available in Narrator and cannot be used in any other applications outside of it, including all first-party and third-party applications .
The voices from Windows 10 are now reclassified as "legacy voices", however Microsoft David was still used as the default for the desktop client.
See also
*Speech synthesis
Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal langua ...
* Comparison of speech synthesizers
References
External links
Vista Watch: New Chinese features in Windows Vista
{{DEFAULTSORT:Microsoft Text-To-Speech Voices
Speech synthesis software
Text-to-speech voices
Voice technology