Semantic Interpretation for Speech Recognition
   HOME

TheInfoList



OR:

Semantic Interpretation for Speech Recognition (SISR) defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar Specification (SRGS). Since 5 April 2007, it is a
World Wide Web Consortium The World Wide Web Consortium (W3C) is the main international standards organization for the World Wide Web. Founded in 1994 by Tim Berners-Lee, the consortium is made up of member organizations that maintain full-time staff working together in ...
recommendation.Semantic Interpretation for Speech Recognition (SISR) Version 1.0
/ref> By building upon SRGS grammars, it allows
voice browser {{Short description, Interactive voice user interface A voice browser is a Application software, software application that presents an interactive voice user interface to the user in a manner analogous to the functioning of a web browser interpretin ...
s via
ECMAScript ECMAScript (; ES) is a standard for scripting languages, including JavaScript, JScript, and ActionScript. It is best known as a JavaScript standard intended to ensure the interoperability of web pages across different web browsers. It is stan ...
to semantically interpret complex grammars and provide the information back to the application. For example, it allows utterances like "I would like a Coca-cola and three large pizzas with pepperoni and mushrooms." to be interpreted into an object that can be understood by an application. For example, the utterance could produce the following object named : If used against this grammar that includes SISR markup in addition to the standard SRGS grammar in XML format: I would like a out.drink = new Object(); out.drink.liquid=rules.drink.type; out.drink.drinksize=rules.drink.drinksize; and out.pizza=rules.pizza; coke pepsi coca colaout="coke"; out="medium"; smallout="small"; medium largeout="large"; regularout="medium"; out=new Array; out.push(rules.top); and out.push(rules.top); anchovies pepperoni mushroomout="mushrooms"; mushrooms out.drinksize=rules.foodsize; out.type=rules.kindofdrink; out.pizzasize=rules.foodsize; out.number=rules.number; pizzas with out.topping=rules.tops; out=1; a one twoout=2; threeout=3;


See also

*
VoiceXML VoiceXML (VXML) is a digital document standard for specifying interactive media and voice dialogs between humans and computers. It is used for developing audio and voice response applications, such as banking systems and automated customer service ...
*
SRGS Speech Recognition Grammar Specification (SRGS) is a W3C standard for how ''speech recognition grammars'' are specified. A speech recognition grammar is a set of word patterns, and tells a speech recognition system what to expect a human to say. ...


References


External links


SRGS Specification (W3C Recommendation)

SISR Specification (W3C Recommendation)

W3C's Voice Browser Working Group
{{W3C Standards World Wide Web Consortium standards XML-based standards