SpeeG - A Multimodal Speech- and Gesture-based Text Input Solution

SpeeGA Mul&modal Speech-‐ and

Gesture-‐based Text Input Solu&on

Lode Hoste, Bruno Dumas and Beat Signer

SpeeG - Lode HosteVrije Universiteit Brussel 2

Text-input for set-top boxes

SpeeG - Lode HosteVrije Universiteit Brussel

Dasher

8PenSwiftKey

Speech Dasher SpeeG

EdgeWriter

1D Keyboard for Kinect Virtual Keyboard for XboxChatpad Controller

Virtual keyboard

Kinect 1D keyboard

Dasher

8PenSwiftKey

Speech Dasher SpeeG

EdgeWriter

Dasher

8PenSwiftKey

Speech Dasher SpeeG

EdgeWriter

Dasher

Continuous inputJoystick / Gaze / ...Open vocabularyAllows imprecise navigation

Dasher

Controller-freeText inputWithout training

KinectCMU SphinxDasher

Used technologies:Goals:

SpeeG Architecture

GUI (JDasher)

Speech Recogniser(CMU Sphinx 4)

Hand Tracking(Microsoft Kinect and NITE)

Evaluation

SpeeGUser

GUI (JDasher)

3Speech-only

Virtual Keyboard Kinect Keyboard

Evaluation

“this was easy for us”“he will allow a rare lie”“did you eat yet”

“my watch fell in the water”“the world is a stage”“peek out the window”

7 (male) users: 23-31y

1-3: DARPA’s TIMIT

Performed a quantitative (Words per minute and nr of errors) and qualitative (feedback and preference) evaluation

4-6: MacKenzie and Soukoreff

show 2 about ‘expertise of users’

S1 S2 S3 S4 S5 S6

Sentence

User 1

User 2

User 3

User 4

User 5

User 6

User 7

Virtual keyboard

6.3 WPM

S1 S2 S3 S4 S5 S6

Sentence

User 1

User 2

User 3

User 4

User 5

User 6

User 7

Kinect Keyboard

1.83 WPM

S1 S2 S3 S4 S5 S6

Sentence

User 1

User 2

User 3

User 4

User 5

User 6

User 7

Speech-only

GUI (JDasher)

11 WPM

S1 S2 S3 S4 S5 S6

Sentence

User 2

User 1

User 3

User 4

User 5

User 6

User 7

5.8 WPM

S1 S2 S3 S4 S5 S6

Sentence

User 2

User 1

User 3

User 4

User 5

User 6

User 7

2.6 7.8 WPM

S1 S2 S3 S4 S5 S6

Sentence

Controller

Speech only

Kinect only

Mean WPM per sentenceand input device

1D Keyboard for XboxVirtual Keyboard for Xbox

Speech-onlyUser

GUI (JDasher)

S1 S2 S3 S4 S5 S6

r of e

Sentence

Controller Speech only Kinect only SpeeG

1D Keyboard for XboxVirtual Keyboard for Xbox

Speech-onlyUser

GUI (JDasher)

Errors per sentenceand input device

Future work

Other visualisations Smaller gesturesDedicated commands (gesture / voice)

Kinect

- Controller-free text input- Real-time correction- Dasher, zoomable interface - probabilities - alphabetic order - character-level

SpeeGA Mul&modal Speech-‐ and

Gesture-‐ based Text Input Solu&on Lode Hoste, Bruno Dumas, Beat Signer

Speech

- Non-native speakers- Untrained voice recogniser- 6-12 WPM- Perceived fastest- Game-like character- Novice and experts

30Special thanks to Jorn De Baerdenmaeker and Keith Vertaenen

SpeeG - A Multimodal Speech- and Gesture-based Text Input Solution

Science

Transcript of SpeeG - A Multimodal Speech- and Gesture-based Text Input Solution

SpeeG: A Speech- and Gesture-based Text Input Device · 2016. 11. 24. · Figure 1: Wii Mote & Playstation Move Around September 2010, Playstation came to the market with an almost

136. Deixis,gesture,andembodimentroma … Fricke 2014 - Deixis gesture...136. Deixis, gesture, ... Müller,Cienki,Fricke,Ladewig,McNeill,Teßendor ... (1997: 62 63) termed such multimodal

Laughter Animation Synthesis - Semantic Scholar · Laughter is a multimodal process involving speech information, facial expression and body gesture (e.g ... Niewiadomski and Pelachaud

A Wearable Gesture Recognition Device for Detecting ... wearable gesture recognition device for... · A Wearable Gesture Recognition Device for Detecting Muscular ... WEARABLE GESTURE

Touch and 3D Gesture Control · 3D Gesture GestIC® Technology Free Space Gesture Control. Touch and 3D Gesture Control 3 Touch and Gesture Why Microchip Capacitive Touch? From Your

SpeeG: A Multimodal Speech- and Gesture-based Text Input ...

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE … · 2017. 3. 9. · Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition Di Wu, Lionel Pigou, Pieter-Jan

Gesture-based Programming for Robotic Arc Weldingjmd/multimodal/ABB Multimodal Controller Report.pdf · Gesture-based Programming for Robotic Arc Welding Kevin Dixon, Soshi Iba, John

Nov 22 nd, 2006 Multimodal Analysis of Expressive Human Communication: Speech and gesture interplay Ph.D. Dissertation Proposal Carlos Busso Adviser: Dr.

Multimodal Gesture Recognition Based on the ResC3D Networkopenaccess.thecvf.com/.../w44/...Gesture_Recognition_ICCV_2017_p… · 1 School of Computer Science and Technology, Xidian

Multimodal Human Discourse: Gesture and Speechweb.media.mit.edu/~cynthiab/Readings/Quek-p171.pdfconversational gesture recognition still remains to be proven (by proving, e.g., that

Speech-Gesture Driven Multimodal Interfaces for Crisis ... - in press IEEE... · Speech-Gesture Driven Multimodal Interfaces for Crisis Management R. Sharma 1,2,5, M. Yeasin 2,1,

Multimodal Gesture Recognition via Multiple Hypotheses ...jmlr.csail.mit.edu/papers/volume16/pitsikalis15a/pitsikalis15a.pdf · Multimodal Gesture Recognition via Multiple Hypotheses

MULTIMODAL GESTURE-SPEECH INTEGRATION STRATEGIES …prosodia.upf.edu/home/arxiu/tesis/doctorat/Ingrid_PhD... · 2019-08-20 · PhD research plan i RESUM En l’àmbit de l’estudi

opus.lib.uts.edu.au Web viewFacilitating peripheral interaction: Design and evaluation of peripheral interaction for a gesture-based lighting control with multimodal feedback. Marigo

Multimodal Gesture Recognition via Multiple Hypotheses ...Journal of Machine Learning Research 16 (2015) 255-284 Submitted 3/14; Revised 9/14; Published 2/15 Multimodal Gesture Recognition

Cognitive Linguistics, gesture studies, and multimodal ...

Multimodal Interfaces - unifr.ch · PDF fileMultimodal Interfaces JPotter Report Department: Information and Communication Technology Field: Computer Sciences Keywords: MMI, gesture,

Gesture Input and Gesture Recognition Algorithms.

GESTURE DRAWING. Gesture Drawing Gesture drawing is the easiest form of drawing there is. A GESTURE is an EXPRESSIVE MOVEMENT. The purpose of drawing.