SpeechWeb
   HOME

TheInfoList



OR:

A SpeechWeb is a collection of hyperlinked speech applications, accessed remotely by speech browsers running on end-user devices. Links are activated through spoken commands. The idea of surfing the
web Web most often refers to: * Spider web, a silken structure created by the animal * World Wide Web or the Web, an Internet-based hypertext system Web, WEB, or the Web may also refer to: Computing * WEB, a literate programming system created by ...
by voice dates back to at least the work of Hemphill and Thrift in 1995 Hemphill, C.T. and Thrift, P. R.
Surfing the Web by Voice
" ''Proceedings of the third ACM International Multimedia Conference (San Francisco 1995)'', Year: 1995, Pages: 215 – 222.
who developed a system in which,
HTML The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScri ...
pages were downloaded and processed on client-side computers enabling voice access to web page content, and activation of hyperlinks through spoken commands. Also in the mid 1990s, researchers at
AT&T AT&T Inc. is an American multinational telecommunications holding company headquartered at Whitacre Tower in Downtown Dallas, Texas. It is the world's largest telecommunications company by revenue and the third largest provider of mobile tel ...
were discussing the development of a new
markup language Markup language refers to a text-encoding system consisting of a set of symbols inserted in a text document to control its structure, formatting, or the relationship between its parts. Markup is often used to control the display of the document ...
that would enable the web to be accessed through regular phones. From 1995 to 1999,
AT&T AT&T Inc. is an American multinational telecommunications holding company headquartered at Whitacre Tower in Downtown Dallas, Texas. It is the world's largest telecommunications company by revenue and the third largest provider of mobile tel ...
,
Lucent Lucent Technologies, Inc. was an American Multinational corporation, multinational telecommunications equipment company headquartered in Murray Hill, New Jersey, Murray Hill, New Jersey. It was established on September 30, 1996, through the dives ...
,
Motorola Motorola, Inc. () was an American Multinational corporation, multinational telecommunications company based in Schaumburg, Illinois, United States. After having lost $4.3 billion from 2007 to 2009, the company split into two independent p ...
, and IBM all developed their own versions of phone and speech markup languages. These companies created th
VoiceXML Forum
and jointly designed the Voice Markup Language,
VXML VoiceXML (VXML) is a digital document standard for specifying interactive media and voice dialogs between humans and computers. It is used for developing audio and voice response applications, such as banking systems and automated customer service ...
, which was accepted by the
W3C The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web. Founded in 1994 and led by Tim Berners-Lee, the consortium is made up of member organizations that maintain full-time staff working to ...
Committee in 2000. VXML is typically used to create hyperlinked speech applications.Lucas, B
VoiceXML for Web-based distributed conversational applications
" ''Commun. ACM 43, 9,'' Year: 2000, Pages: 53 – 57.
VXML pages include commands for prompting user speech input, invoking recognition grammars, outputting synthesized voice, iterating through blocks of code, calling local JavaScript, and hyperlinking to other remote
VXML VoiceXML (VXML) is a digital document standard for specifying interactive media and voice dialogs between humans and computers. It is used for developing audio and voice response applications, such as banking systems and automated customer service ...
pages downloaded in a manner similar to the linking of HTML pages in the conventional Web. Around the same time as the emergence of
VXML VoiceXML (VXML) is a digital document standard for specifying interactive media and voice dialogs between humans and computers. It is used for developing audio and voice response applications, such as banking systems and automated customer service ...
,
research group
at the
University of Windsor , mottoeng = Goodness, Discipline and Knowledge , established = , academic_affiliations = CARL, COU, Universities Canada , former_names = Assumption College (1857-1956)Assumption University of Windsor (1956-1963) , type = Public universit ...
in Canada were developing an alternative approach, in which speech applications deployed on the web can be accessed by client-side speech browsers which provide the speech-recognition capability, that is tailored to the application by downloading an application-specific recognition grammar from the remote speech application web site. Input that is recognized by the client-side browser is sent to the remote server which processes it and returns a text result to the browsers for output as synthesized voice. The term SpeechWeb was used, in 1999,Frost, R. A. and Chitte, S.
A New Approach for Providing Natural-Language Speech Access to Large Knowledge Bases
''Proc. of PACLING ’99, The Conference of the Pacific Association for Computational Linguistics, University of Waterloo, Ontario, Canada'' Year: 1999, Pages: 82 – 90.
to describe the collection of hyperlinked speech applications in this architecture . The first SpeechWeb browser was demonstrated at the AAAI Sixteenth National Conference on Artificial Intelligence.Frost, R. A.
A Natural-Language Speech Interface Constructed Entirely as a Set of Executable Specifications
" ''Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, Orlando, Florida, USA.'' Year: 1999, Pages: 908 - 909.
The term "speechweb" has also bee
used
since the 1990s, in a different context to describe a web based network of information on speech, language and speech-language pathology. In addition, it was also hoped to provide a meeting place for professionals and those who have been affected by communication disorders. The term "speechWeb" has been trademarked by the company PipeBeach, which is now owned by HP, and refers to a software product which bridges telephone networks and conventional web servers. In 2005, it was recognized that very few voice applications were available to the public through the
Internet The Internet (or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a '' network of networks'' that consists of private, pub ...
, despite the maturity of VXML at that time. It was also observed that nearly all
VXML VoiceXML (VXML) is a digital document standard for specifying interactive media and voice dialogs between humans and computers. It is used for developing audio and voice response applications, such as banking systems and automated customer service ...
applications that were available had been constructed by people working in commerce and industry. This was in stark contrast to the huge growth of the conventional web, and the huge involvement of the public in the development of regular web pages, only a few years after the development of
HTML The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScri ...
. This observation led to th
call for a Public-Domain SpeechWeb
Frost, R. A. "
call for a public-domain SpeechWeb
" ''Commun. ACM 48, 11,'' Year: 2005, Pages: 45 – 49.
which is accessible to the public through existing web browsers (with speech plugins) and which contains hyperlinked speech applications that are created and deployed by the public in a manner that is analogous to the creation and deployment of HTML pages on the conventional web.
browser for the Public-Domain SpeechWeb
was demonstrated at the 16th International World Wide Web Conference, held in Banff, Canada in 2007.Frost, R. A., Ma, X. and Shi, Y.
A browser for a public-domain SpeechWeb
" ''World Wide Web Conference,Banff, Canada'' Year: 2007, Pages: 1307–1308.
The browser is a small X+V page which is executed by the freely availabl
Opera
with the free IBM speech-recognition plugin. Two research groups are developing software to facilitate the construction and deployment of SpeechWeb applications by non-experts: * Th
"MySpeechWeb"
research group at the University of Windsor has developed documentation and software to facilitate for people who want to access and/or create SpeechWeb applications. The group has also created a prototype Public-Domain SpeechWeb containing examples o

which are available through a portal. * Th
"w3voice skeleton"
research group at the Auditory Media Laboratory, Wakayama University in Japan has created software that facilitates the construction and deployment of speech applications for the Japanese language.


References


External links


MySpeechWeb
- research group at the University of Windsor
Video demonstration of Public Domain SpeechWeb
Speech recognition