rospeex: a cloud-based speech communication toolkit for ROS

rospeexA Cloud-based speech communication toolkit for ROS

Komei SugiuraNational Institute of Information and Communication Technology, Japankomei.sugiura@nict.go.jp

2013/12/13

ROS (Robot Operating System)

• ROS: middleware for robots– Version 1.0 released in 2010– Global de facto standard– From driver and package management to learning and

visualization

Speech communication toolkit for ROS

• ROS compatible• Speech recognition using VoiceTra engine• Other functionalities

– Noise reduction, non-monologues speech synthesis

Conventional packages rospeexSpeech recognition/synthesis

Sphinx, festival, Julius(or commercial tools)

VoiceTra engine(or third-party engines)

Engine Stand alone Cloud-basedLanguage Single language ja, en, zh, ko

rospeex

Position in Cloud Robotics

• Cloud robotics [James Kuffner@Google, 2011]– Manipulation using Google Goggles [Kehoe+ 2013]– Knowledge sharing based on RoboEarth [Tenorth+ 2012]– Speech communication for robots rospeex

Commercial systems(Nuance, ToSpeak, AmiVoice Cloud, ..)

rospeex

Many OpenHRI, HARK,PocketSphinx, Festival

Cloud-based

Stand-alone

Robot middleware compatibleIncompatible

Quadrilingual communication using rospeex

rospeex provides speech recognition/synthesis, user constructs dialogue processing

Speech moduleDialogue

processingSpeech

synthesis

Task manager

Speech output

Speech input

Input from other modules(Sensors, recognized obj, etc)

Output to other modules(Actuators, learning, etc)

Provided by the user

Provided by rospeex

Speech recognition

Speech recognition & synthesis servers

Noise reduction

Speech recognition & synthesis servers

Provided by third parties

Non-monologue speech synthesis for robots

• Reading-style robot voice– Monotonous, unnatural and unfriendly– Hard to realize that the robot is asking

a question• Conventional text-to-speech (TTS) systems

are not optimized for communication

Voice talentXIMERA 3

(Text reading)

Demohttp://komeisugiura.jp/software/nm_tts.html

Using speech recognition/synthesis without ROS

• Send JSON file to the server– Recognition– Synthesis

• Sample codes (JavaScript, Python, C++) are available

{ “method” : “speak”,"params" : [

"ja","こんにちは","*","audio/x-wav"

http://rospeex.ucri.jgn-x.jp/nauth_json/jsServices/VoiceTraSS

{ "method":"recognize","params":[

"ja",{“audio”:“base64-encoded wav",

"audioType":"audio/x-wav","voiceType":"*"} ] }

http://rospeex.ucri.jgn-x.jp/nauth_json/jsServices/VoiceTraSR

Recognition Synthesis

Non-monologue speech synthesis Search

rospeex: a cloud-based speech communication toolkit for ROS

Technology

Transcript of rospeex: a cloud-based speech communication toolkit for ROS

ASSISTIVE CONTEXT-AWARE TOOLKIT (ACAT) - … · 6.6 Handling Calibration ... Text-to-Speech Conversion from text to speech through the Microsoft ... files from the folder where you

Rosbridge: ROS for Non-ROS Userschriscrick/Crick11b.pdf · Rosbridge: ROS for Non-ROS Users 3 analysis, shaping, routing, data acquisition and conglomeration, but still communi- ...

Formulario Ros

Sustainability ros

Extending the OpenAI Gym for robotics: a toolkit for ...erlerobotics.com/whitepaper/robot_gym.pdfExtending the OpenAI Gym for robotics: a toolkit for reinforcement learning using ROS

PR2Introduction - uni-hamburg.de...ros-indigo docker: ubuntu16.04 ros-kinetic c2 Overlayofc1 ssd: docker: ubuntu16.04 ros-kinetic pr2-head ssd: ubuntu14.04 ros-indigo docker: ubuntu16.04

ROS tutorial - UPV/EHUlsi.vc.ehu.es/pablogn/investig/ROS/lepej_ROS_Tutorial_SS2014.pdf · • ROS is to support code reuse in robotics research and development. • ROS is a distributed

sitcecytezemsad.comsitcecytezemsad.com/documentos/caad2020.pdf · plate-ros plate-ros plate-ros plate-ros plate-ros estacion san jose estaciÓn san josÉ estaciÓn san josÉ estaciÓn

School/ Pre-school referral toolkit · Introduction This toolkit provides guidance and information for universal and targeted support for children with Speech, Language and Communication

A toolkit for speech recognition research (According to legend, Kaldi was the Ethiopian goatherd who discovered the coffee plant).

TurtleBot2&ROS - Simulationsspanel/ROSWorkshop2013/1_5 ROS_simulations.pdf · ROS/Gazebo URDF Gazebo / URDF practise TurtleBot2&ROS - Simulations 2/22. Introduction TurtleBot2&ROS

ROS Solutions for Industrial Robotics - Squarespacestatic.squarespace.com/static/51df34b1e4b08840dcfd2841/t/51eed45be4b... · ROS Solutions for Industrial Robotics ROS-Industrial

HY-ROS Befestigungstechnik HY-ROS per ogni applicazione HY ... · HY-ROS Befestigungstechnik: Alles überall, schnell und immer sicher fest. Die HY-ROS Befestigungstechnik hat ihre

Introduction to ROS 2 - ATLASOverview 1. Visualization in ROS 2. Other ROS utils 1. Tranformation 2. URDF 3. Ros time and ros bag 3. Simulation in ROS 4. Best practices in ROS NTA3

TECHNICAL MEETING / Robotics Information Day ROS ... · • ROS-Industrial brings the power of ROS to the industrial robotics and automation market • Support for ROS-Industrial

Revenue Online Service (ROS) · 5.2 Reset ROS Login ... The Revenue Online Service (ROS) ... ROS provides secure communications between customers and Revenue. Revenue

What is speech? Grammar Toolkit. Direct speech is exactly what someone says. My record for headers is two hundred in a row. Indirect speech reports what.

Combining Speech and Speaker Recognition - A Joint Modeling … · 2018. 11. 2. · work, TIK, that connects well-known deep learning toolkit Tensor ow and speech recognition toolkit

Windows™ Speech Recognition Toolkit · Speech Recognition at Redmond in the summer of 2006 we thought very highly of the accuracy of the speech engine, the ability to command and

Архитектура ROS