Semantic Interpretation for Speech Recognition
Encyclopedia
Semantic Interpretation for Speech Recognition (SISR) defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar Specification
(SRGS). Since 5 April 2007 it is currently a World Wide Web Consortium Recommendation.
By building upon SRGS grammars it allows Voice browser
s via ECMAScript
to semantically interpret complex grammars and provide the information back to the application. For example, it allows utterances like "I would like a coca cola and three large pizzas with pepperoni and mushrooms." to be interpreted into an object that can be understood by an application. For example, the utterance could produce the following object named order:
{
drink: {
liquid:"coke",
drinksize:"medium"},
pizza: {
number: "3",
pizzasize: "large",
topping: [ "pepperoni", "mushrooms" ]
}
}
If used against this grammar that includes SISR markup in addition to the standard SRGS grammar in XML format:
Speech Recognition Grammar Specification
Speech Recognition Grammar Specification is a W3C standard for how speech recognition grammars are specified. A speech recognition grammar is a set of word patterns, and tells a speech recognition system what to expect a human to say...
(SRGS). Since 5 April 2007 it is currently a World Wide Web Consortium Recommendation.
By building upon SRGS grammars it allows Voice browser
Voice browser
A voice browser is a web browser that presents an interactive voice user interface to the user. In addition, it typically provides an interface to the PSTN or a PBX. Just as a visual web browser works with HTML pages, a voice browser operates on pages that specify voice dialogues...
s via ECMAScript
ECMAScript
ECMAScript is the scripting language standardized by Ecma International in the ECMA-262 specification and ISO/IEC 16262. The language is widely used for client-side scripting on the web, in the form of several well-known dialects such as JavaScript, JScript, and ActionScript.- History :JavaScript...
to semantically interpret complex grammars and provide the information back to the application. For example, it allows utterances like "I would like a coca cola and three large pizzas with pepperoni and mushrooms." to be interpreted into an object that can be understood by an application. For example, the utterance could produce the following object named order:
{
drink: {
liquid:"coke",
drinksize:"medium"},
pizza: {
number: "3",
pizzasize: "large",
topping: [ "pepperoni", "mushrooms" ]
}
}
If used against this grammar that includes SISR markup in addition to the standard SRGS grammar in XML format: