Semantic Interpretation for Speech Recognition

Semantic Interpretation for Speech Recognition (SISR) defines the syntax and semantics of annotations to grammar rules in the Speech Recognition Grammar Specification

Speech Recognition Grammar Specification

Speech Recognition Grammar Specification is a W3C standard for how speech recognition grammars are specified. A speech recognition grammar is a set of word patterns, and tells a speech recognition system what to expect a human to say...

(SRGS). Since 5 April 2007 it is currently a World Wide Web Consortium Recommendation.

By building upon SRGS grammars it allows Voice browser

Voice browser

A voice browser is a web browser that presents an interactive voice user interface to the user. In addition, it typically provides an interface to the PSTN or a PBX. Just as a visual web browser works with HTML pages, a voice browser operates on pages that specify voice dialogues...

s via ECMAScript

ECMAScript

ECMAScript is the scripting language standardized by Ecma International in the ECMA-262 specification and ISO/IEC 16262. The language is widely used for client-side scripting on the web, in the form of several well-known dialects such as JavaScript, JScript, and ActionScript.- History :JavaScript...

to semantically interpret complex grammars and provide the information back to the application. For example, it allows utterances like "I would like a coca cola and three large pizzas with pepperoni and mushrooms." to be interpreted into an object that can be understood by an application. For example, the utterance could produce the following object named order:

{
drink: {
liquid:"coke",
drinksize:"medium"},
pizza: {
number: "3",
pizzasize: "large",
topping: [ "pepperoni", "mushrooms" ]
}
}

If used against this grammar that includes SISR markup in addition to the standard SRGS grammar in XML format:

"http://www.w3.org/TR/speech-grammar/grammar.dtd">
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.w3.org/2001/06/grammar
http://www.w3.org/TR/speech-grammar/grammar.xsd"
version="1.0" mode="voice" tag-format="semantics/1.0" root="order">

I would like a

out.drink = new Object; out.drink.liquid=rules.drink.type;
out.drink.drinksize=rules.drink.drinksize;
and

out.pizza=rules.pizza;

coke
pepsi
coca colaout="coke";

out="medium";

smallout="small";
medium
largeout="large";
regularout="medium";

out=new Array;

out.push(rules.top);

and

out.push(rules.top);

anchovies
pepperoni
mushroomout="mushrooms";
mushrooms

out.drinksize=rules.foodsize; out.type=rules.kindofdrink;

out.pizzasize=rules.foodsize; out.number=rules.number;
pizzas with

out.topping=rules.tops;

out=1;

a
one

twoout=2;
threeout=3;

External links

The source of this article is wikipedia, the free encyclopedia. The text of this article is licensed under the GFDL.