NaturallySpeaking
Encyclopedia
Dragon NaturallySpeaking is a speech recognition
software package developed and sold by Nuance Communications
for Windows
personal computer
s. The most recent package is version 11.5, which supports 32-bit and 64-bit editions of Windows XP
, Vista
and 7. The Mac OS version is called Dragon Dictate.
as they are spoken (though there is an option to set this feature so it is not displayed to increase speed), and when the speaker pauses, the program transcribes
the words into the active window
at the location of the cursor (Dragon does not support dictating to background windows). The software has three primary areas of functionality: dictation, text-to-speech and command input. The user is able to dictate and have speech transcribed as written text, have a document synthesized as an audio stream, or issue commands that are recognized as such by the program. In addition, voice profiles can be accessed through different computers in a networked environment, although the audio hardware and configuration must be identical on both machines. The Professional version allows creation of custom commands to control programs or functions not built into NaturallySpeaking.
was first released for DOS
, and utilized hidden Markov models, which is a statistical method for the recognition of speech. At the time, the hardware was insufficiently powerful to address the problem of word segmentation, and DragonDictate was unable to determine the boundaries of words during continuous speech input. Users were forced to pronounce one word at a time, each clearly separated by a small pause. DragonDictate was based on a trigram
model, and is known as a discrete utterance speech recognition engine.
Dragon Systems released NaturallySpeaking 1.0 as their first continuous dictation product in 1997. The company was then purchased in June 2000 by Lernout & Hauspie
, a corporation that had been involved in financial scandals as reported by the New York Times. Following the bankruptcy of Lernout & Hauspie, the rights to the Dragon product line were acquired by ScanSoft. In 2005, ScanSoft launched a de facto acquisition of Nuance Communications
, and rebranded itself as Nuance
.
Speech recognition
Speech recognition converts spoken words to text. The term "voice recognition" is sometimes used to refer to recognition systems that must be trained to a particular speaker—as is the case for most desktop recognition software...
software package developed and sold by Nuance Communications
Nuance Communications
Nuance Communications is a multinational computer software technology corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications...
for Windows
Microsoft Windows
Microsoft Windows is a series of operating systems produced by Microsoft.Microsoft introduced an operating environment named Windows on November 20, 1985 as an add-on to MS-DOS in response to the growing interest in graphical user interfaces . Microsoft Windows came to dominate the world's personal...
personal computer
Personal computer
A personal computer is any general-purpose computer whose size, capabilities, and original sales price make it useful for individuals, and which is intended to be operated directly by an end-user with no intervening computer operator...
s. The most recent package is version 11.5, which supports 32-bit and 64-bit editions of Windows XP
Windows XP
Windows XP is an operating system produced by Microsoft for use on personal computers, including home and business desktops, laptops and media centers. First released to computer manufacturers on August 24, 2001, it is the second most popular version of Windows, based on installed user base...
, Vista
Windows Vista
Windows Vista is an operating system released in several variations developed by Microsoft for use on personal computers, including home and business desktops, laptops, tablet PCs, and media center PCs...
and 7. The Mac OS version is called Dragon Dictate.
Features
NaturallySpeaking utilizes a minimal user interface. As an example, dictated words appear in a floating tooltipTooltip
The tooltip or infotip is a common graphical user interface element. It is used in conjunction with a cursor, usually a mouse pointer. The user hovers the cursor over an item, without clicking it, and a tooltip may appear—a small "hover box" with information about the item being hovered...
as they are spoken (though there is an option to set this feature so it is not displayed to increase speed), and when the speaker pauses, the program transcribes
Transcription (linguistics)
Transcription in the linguistic sense is the systematic representation of language in written form. The source can either be utterances or preexisting text in another writing system, although some linguists only consider the former as transcription.Transcription should not be confused with...
the words into the active window
Active window
An active window is the currently focused window in the current window manager or explorer. Different window managers indicate the currently-active window in different ways and allow the user to switch between windows in different ways. For example, in Microsoft Windows, if both Notepad and...
at the location of the cursor (Dragon does not support dictating to background windows). The software has three primary areas of functionality: dictation, text-to-speech and command input. The user is able to dictate and have speech transcribed as written text, have a document synthesized as an audio stream, or issue commands that are recognized as such by the program. In addition, voice profiles can be accessed through different computers in a networked environment, although the audio hardware and configuration must be identical on both machines. The Professional version allows creation of custom commands to control programs or functions not built into NaturallySpeaking.
History
Drs. James and Janet Baker founded Dragon Systems in 1982 to release products centered around their voice recognition prototype. DragonDictateDragonDictate
DragonDictate and Dragon Dictate are proprietary speech recognition software. The older program, DragonDictate, was originally developed by Dragon Systems for Microsoft Windows. It has now been replaced by Dragon NaturallySpeaking for Windows, developed by Nuance Communications...
was first released for DOS
DOS
DOS, short for "Disk Operating System", is an acronym for several closely related operating systems that dominated the IBM PC compatible market between 1981 and 1995, or until about 2000 if one includes the partially DOS-based Microsoft Windows versions 95, 98, and Millennium Edition.Related...
, and utilized hidden Markov models, which is a statistical method for the recognition of speech. At the time, the hardware was insufficiently powerful to address the problem of word segmentation, and DragonDictate was unable to determine the boundaries of words during continuous speech input. Users were forced to pronounce one word at a time, each clearly separated by a small pause. DragonDictate was based on a trigram
Trigram
Trigrams are a special case of the N-gram, where N is 3. They are often used in natural language processing for doing statistical analysis of texts.-Frequency:The 16 most common trigrams in English are:-Examples:...
model, and is known as a discrete utterance speech recognition engine.
Dragon Systems released NaturallySpeaking 1.0 as their first continuous dictation product in 1997. The company was then purchased in June 2000 by Lernout & Hauspie
Lernout & Hauspie
Lernout & Hauspie Speech Products, or L&H, was a leading Belgium-based speech recognition technology company, founded by Jo Lernout and Pol Hauspie, that went bankrupt in 2001...
, a corporation that had been involved in financial scandals as reported by the New York Times. Following the bankruptcy of Lernout & Hauspie, the rights to the Dragon product line were acquired by ScanSoft. In 2005, ScanSoft launched a de facto acquisition of Nuance Communications
Nuance Communications
Nuance Communications is a multinational computer software technology corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications...
, and rebranded itself as Nuance
Nuance Communications
Nuance Communications is a multinational computer software technology corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications...
.
Versions
Version | Release date | Editions | Operating Systems Supported |
---|---|---|---|
1.0 | June 1997 | Personal | Windows 95, NT 4.0. |
2.0 | November 1997 | Standard, Preferred, Deluxe | Windows 95, NT 4.0. |
3.0 | October 1998 | Point & Speak, Standard, Preferred, Professional (with optional Legal and Medical add-on products) | Windows 95, 98, NT 4.0. |
4.0 | August 4, 1999 | Essentials, Standard, Preferred, Professional, Legal, Medical, Mobile | Windows 95, 98, NT 4.0 SP3+. |
5.0 | August 2000 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows 98, Me, NT 4.0 SP6+, 2000. |
6.0 | November 15, 2001 | Essentials, Standard, Preferred, Professional, Legal, Medical | |
7.0 | March 2003 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows 98SE, Me, NT4 SP6+, 2000, XP. |
8.0 | November 2004 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows Me (Only Standard and Preferred editions), Windows 2000 SP4+, Windows XP SP1+. |
9.0 | July 2006 | Standard, Preferred, Professional, Legal, Medical, SDK client, SDK server, | Windows 2000 SP4+, XP SP1+. |
9.5 | January 2007 | Standard, Preferred, Professional, Legal, Medical, SDK client, SDK server | Windows 2000 SP4+, XP SP1+, Vista (32-bit). |
10.0 | August 7, 2008 | Essentials, Standard, Preferred, Professional, Legal, Medical | Windows 2000 SP4+, XP SP2+ (32-bit), Vista (32-bit and 64-bit), Windows 7 (32 and 64-bit). Server 2003. |
10.1 | March 2009 | Standard, Preferred, Professional, Legal, Medical | Windows 2000 SP4+, XP SP2+ (32-bit), Vista (32-bit and 64-bit), Windows 7 (32 and 64-bit). Server 2003. |
11.0 | August 2010 | Home, Premium, Professional, Legal | Windows XP SP2+ (32-bit), Vista SP1+ (32-bit and 64-bit), 7 (32 and 64-bit). Server 2003, 2008. |
11.5 | June 2011 | Home, Premium, Professional, Legal | Windows XP SP2+ (32-bit), Vista SP1+ (32-bit and 64-bit), 7 (32 and 64-bit). Server 2003, 2008. |