Abstract of speech recognition

Introduction

Physiological Characteristics

Behavioral Characteristic

Biometrics are automated methods of recognizing a person based on a physiological or behavioral characteristic.

Physiological characteristics are related with the shape of the body.

Behavioral charcteristics are related with behavior of a person included but not limited to voice recognition.

IQBALReg # 9952MBA(M) – Section A

Speech Recognition Simply is the process of

converting spoken input to text. It is also known as Speech-to-Text and Voice

Recognition. Technically Speech recognition is the process of

converting an acoustic signal, captured by a microphone or a telephone, to a set of words.

Dragon Naturally Speaking developed and acquired by Dragon Systems and Nuance Communications respectively.

Microsoft Speech Recognition by Microsoft.

Via Voice by IBM

NUANCE COMMUNICATIONS:-

This Nuance Communications is a multinational computer software technology

corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications.

http://en.wikipedia.org/wiki/Multinational_corporation

http://en.wikipedia.org/wiki/Computer

http://en.wikipedia.org/wiki/Software

Current business products focus on server & embedded speech recognition, telephone call steering systems, automated telephone directory services, medical transcription software & systems, optical character recognition software, and desktop imaging software.

ScanSoft and Nuance merged in October 2005; before the merger, the two companies competed in the commercial large scale speech application business.

Nuance was founded in 1994 as a spinoff of SRI International's Speech Technology and Research (STAR) Laboratory to commercialise the speaker-independent speech recognition technology developed for the US government at SRI.

Based in Menlo Park, California, Nuance deployed their first commercial large-scale speech application in 1996.

1994 – Nuance spun off from SRI's STAR Lab.

1996 – Nuance deployed its first commercial speech application.

2000 April 13 – Nuance files initial public offering on the Nasdaq under the symbol NUANE

Dragon speech recognition software is a Naturally Speaking Language.

This software has three primary features of functionality.

Dictation Text-To-Speech Command Input

Dictation As user dictates the words it will converts it

into text and it displays. Text-To-Speech And as text what is present or selected can be

converted to speech. Command Input User can control the operations by means of

his voice without using keyboard by just giving commands.

TRANSLATION It cannot translate from one language

to another language here comes translation problem.

UNTRAINED It cannot work without training ,training

is required,dynamic acceptance is not present.

PLATFORM DEPENDENT It cannot work on another platforms

other than windows like mac o.s,ubuntu etc.

• To develop a translation feature in near future to spread the availabilty of product to all type of users.

• To make the system platform independent.

• Home AutomationThere is a lot of interest in the use of SR in domestic appliances such as ovens, refrigerators, dishwashers and washing machines.

• Wearable ComputersThe most futuristic application is in the use and functionality of wearable computers.

The most futuristic application is in the use and functionality of wearable computers. These would allow people to go about their everyday lives, but still store information (thoughts, notes, to-do lists) verbally, or communicate via email, phone or videophone, through wearable devices. Crucially, this would be done without having to interact with the device, or even remember that it is there; the user would just speak, the device would know what to do with the speech, and would carry out the appropriate task.

• People with DisabilitiesSpeech recognition technology helps people with disabilities interact with computers more easily. People with motor limitations, who cannot use a standard keyboard and mouse, can use their voices to navigate the computer and create documents.

• Dyslexic PeopleSpeech Recognition Technology is helpful for people with learning disabilities, who experience difficulty with spelling and writing.

Speech to text module

Command Input module Input predefined executecommand commands

command define command |

Sound Cards soundcard with the cleanest A/D (Analog

to Digital) conversions are recommended. Microphone The best choice for microphone is the

headset style.

Computers / Processors The more the speed the better Speech

Recognition would work. For good Speech Recognition you should be having 1 GHz processor and 1 GB of RAM.

Windows Operating System(NT,XP,7,8).

Audio Driver Software

As for a bussiness like online shopping,organisations like amazon etc have separate dept for replying to customers in that place of replying e-mails this can be used to minimisation of time.

Cost required for developing the product is more.

Time required for developing the product is medium.

• Speech recognition will revolutionize the way people conduct business over the Web and will, ultimately, differentiate world-class e-businesses. VoiceXML ties speech recognition and telephony together and provides the technology with which businesses can develop and deploy voice-enabled Web solutions TODAY!

These solutions can greatly expand the accessibility of Web-based self-service transactions to customers who would otherwise not have access, and, at the same time, leverage a business’ existing Web investments.

Speech recognition and VoiceXML clearly represent the next wave of the Web. In near future people will be using their home and business computers by speech not by keyboard or mouse. Home automation will be completely based on speech recognition system.

Abstract of speech recognition

Technology

Transcript of Abstract of speech recognition