Abstract of speech recognition
-
Upload
vinay-jaisriram -
Category
Technology
-
view
1.448 -
download
1
description
Transcript of Abstract of speech recognition
![Page 1: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/1.jpg)
![Page 2: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/2.jpg)
Introduction
Physiological Characteristics
Behavioral Characteristic
![Page 3: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/3.jpg)
Biometrics are automated methods of recognizing a person based on a physiological or behavioral characteristic.
Physiological characteristics are related with the shape of the body.
Behavioral charcteristics are related with behavior of a person included but not limited to voice recognition.
![Page 4: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/4.jpg)
![Page 5: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/5.jpg)
IQBALReg # 9952MBA(M) – Section A
![Page 6: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/6.jpg)
![Page 7: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/7.jpg)
Speech Recognition Simply is the process of
converting spoken input to text. It is also known as Speech-to-Text and Voice
Recognition. Technically Speech recognition is the process of
converting an acoustic signal, captured by a microphone or a telephone, to a set of words.
![Page 8: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/8.jpg)
Dragon Naturally Speaking developed and acquired by Dragon Systems and Nuance Communications respectively.
![Page 9: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/9.jpg)
Microsoft Speech Recognition by Microsoft.
Via Voice by IBM
![Page 10: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/10.jpg)
NUANCE COMMUNICATIONS:-
This Nuance Communications is a multinational computer software technology
corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications.
![Page 11: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/11.jpg)
Current business products focus on server & embedded speech recognition, telephone call steering systems, automated telephone directory services, medical transcription software & systems, optical character recognition software, and desktop imaging software.
ScanSoft and Nuance merged in October 2005; before the merger, the two companies competed in the commercial large scale speech application business.
![Page 12: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/12.jpg)
Nuance was founded in 1994 as a spinoff of SRI International's Speech Technology and Research (STAR) Laboratory to commercialise the speaker-independent speech recognition technology developed for the US government at SRI.
Based in Menlo Park, California, Nuance deployed their first commercial large-scale speech application in 1996.
![Page 13: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/13.jpg)
1994 – Nuance spun off from SRI's STAR Lab.
1996 – Nuance deployed its first commercial speech application.
2000 April 13 – Nuance files initial public offering on the Nasdaq under the symbol NUANE
![Page 14: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/14.jpg)
Dragon speech recognition software is a Naturally Speaking Language.
This software has three primary features of functionality.
Dictation Text-To-Speech Command Input
![Page 15: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/15.jpg)
Dictation As user dictates the words it will converts it
into text and it displays. Text-To-Speech And as text what is present or selected can be
converted to speech. Command Input User can control the operations by means of
his voice without using keyboard by just giving commands.
![Page 16: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/16.jpg)
TRANSLATION It cannot translate from one language
to another language here comes translation problem.
UNTRAINED It cannot work without training ,training
is required,dynamic acceptance is not present.
![Page 17: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/17.jpg)
PLATFORM DEPENDENT It cannot work on another platforms
other than windows like mac o.s,ubuntu etc.
![Page 18: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/18.jpg)
• To develop a translation feature in near future to spread the availabilty of product to all type of users.
• To make the system platform independent.
![Page 19: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/19.jpg)
• Home AutomationThere is a lot of interest in the use of SR in domestic appliances such as ovens, refrigerators, dishwashers and washing machines.
• Wearable ComputersThe most futuristic application is in the use and functionality of wearable computers.
![Page 20: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/20.jpg)
The most futuristic application is in the use and functionality of wearable computers. These would allow people to go about their everyday lives, but still store information (thoughts, notes, to-do lists) verbally, or communicate via email, phone or videophone, through wearable devices. Crucially, this would be done without having to interact with the device, or even remember that it is there; the user would just speak, the device would know what to do with the speech, and would carry out the appropriate task.
![Page 21: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/21.jpg)
• People with DisabilitiesSpeech recognition technology helps people with disabilities interact with computers more easily. People with motor limitations, who cannot use a standard keyboard and mouse, can use their voices to navigate the computer and create documents.
• Dyslexic PeopleSpeech Recognition Technology is helpful for people with learning disabilities, who experience difficulty with spelling and writing.
![Page 22: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/22.jpg)
Speech to text module
![Page 23: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/23.jpg)
Command Input module Input predefined executecommand commands
command define command |
![Page 24: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/24.jpg)
Sound Cards soundcard with the cleanest A/D (Analog
to Digital) conversions are recommended. Microphone The best choice for microphone is the
headset style.
![Page 25: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/25.jpg)
Computers / Processors The more the speed the better Speech
Recognition would work. For good Speech Recognition you should be having 1 GHz processor and 1 GB of RAM.
![Page 26: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/26.jpg)
Windows Operating System(NT,XP,7,8).
Audio Driver Software
![Page 27: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/27.jpg)
As for a bussiness like online shopping,organisations like amazon etc have separate dept for replying to customers in that place of replying e-mails this can be used to minimisation of time.
Cost required for developing the product is more.
Time required for developing the product is medium.
![Page 28: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/28.jpg)
• Speech recognition will revolutionize the way people conduct business over the Web and will, ultimately, differentiate world-class e-businesses. VoiceXML ties speech recognition and telephony together and provides the technology with which businesses can develop and deploy voice-enabled Web solutions TODAY!
![Page 29: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/29.jpg)
These solutions can greatly expand the accessibility of Web-based self-service transactions to customers who would otherwise not have access, and, at the same time, leverage a business’ existing Web investments.
![Page 30: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/30.jpg)
Speech recognition and VoiceXML clearly represent the next wave of the Web. In near future people will be using their home and business computers by speech not by keyboard or mouse. Home automation will be completely based on speech recognition system.
![Page 31: Abstract of speech recognition](https://reader035.fdocuments.net/reader035/viewer/2022081414/54bde70e4a79596f188b4583/html5/thumbnails/31.jpg)