A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A...

82
Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school, 05 Aug 2016 Tsung-Hsien (Shawn) Wen

Transcript of A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A...

Page 1: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Dialogue Systems Group

A Network-based End-to-End Trainable Task-oriented Dialogue System

Deep Learning Summer school, 05 Aug 2016

Tsung-Hsien (Shawn) Wen

Page 2: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Outline

Intro

Neural Dialogue System

Wizard-of-Oz Data Collection

Experiments

Conclusion

2

Page 3: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Outline

Intro

Neural Dialogue System

Wizard-of-Oz Data Collection

Experiments

Conclusion

3

Page 4: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

The two paradigms4

Goal-oriented Dialogue Systems

Page 5: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

The two paradigms4

Goal-oriented Dialogue Systems

Help the user to accomplish domain tasks

Page 6: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

The two paradigms4

Goal-oriented Dialogue Systems

Help the user to accomplish domain tasks

Domain specific, hard to collect data

Page 7: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

The two paradigms4

Goal-oriented Dialogue Systems

Help the user to accomplish domain tasks

Domain specific, hard to collect data

Current Systems

Modular, highly handcrafted, restricted ability

Page 8: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

The two paradigms4

Goal-oriented Dialogue Systems

Help the user to accomplish domain tasks

Domain specific, hard to collect data

Current Systems

Modular, highly handcrafted, restricted ability

Can we train a dialogue system on a small dataset w/ a minimal amount of handcrafting?

Page 9: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

The two paradigms4

Goal-oriented Dialogue Systems

Help the user to accomplish domain tasks

Domain specific, hard to collect data

Current Systems

Modular, highly handcrafted, restricted ability

Can we train a dialogue system on a small dataset w/ a minimal amount of handcrafting?

How can we collect data to train this model?

Page 10: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Outline

Intro

Neural Dialogue System

Wizard-of-Oz Data Collection

Experiments

Conclusion

5

Page 11: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Traditional Dialogue Systems6

Language Understanding

DialogueManager

DB

Ontology

Dialogue System

Language Generation

text

text

Page 12: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Neural Dialogue Systems7

DB

Ontology

Neural Dialogue System

text

text

Page 13: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Can I have <v.food>

Korean 0.7British 0.2French 0.1

Belief Tracker

0 0 0 … 0 1

MySQL query:“Select * where food=Korean”

Database Operator

Intent Network

Can I have <v.food>

Generation Network

<v.name> serves great <v.food> .

Policy NetworkCopy field

Database

Seven d

ays

Cu

rry Prin

ce

Nirala

Ro

yal Stand

ard

Little Seuo

l

DB pointerxt

zt

pt

qt

Page 14: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Intent Network

Can I have <v.food>

Generation Network

<v.name> serves great <v.food> .

zt

Seq2Seq

Page 15: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Can I have <v.food>

Korean 0.7British 0.2French 0.1

Belief Tracker

Intent Network

Can I have <v.food>

Generation Network

<v.name> serves great <v.food> .

zt

pt

Language Grounding

Page 16: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Can I have <v.food>

Korean 0.7British 0.2French 0.1

Belief Tracker

0 0 0 … 0 1

MySQL query:“Select * where food=Korean”

Database Operator

Intent Network

Can I have <v.food>

Generation Network

<v.name> serves great <v.food> .

Database

Seven d

ays

Cu

rry Prin

ce

Nirala

Ro

yal Stand

ard

Little Seuo

l

DB pointerxt

zt

pt

qt

Database Accessing

Page 17: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Can I have <v.food>

Korean 0.7British 0.2French 0.1

Belief Tracker

0 0 0 … 0 1

MySQL query:“Select * where food=Korean”

Database Operator

Intent Network

Can I have <v.food>

Generation Network

<v.name> serves great <v.food> .

Policy Network

Database

Seven d

ays

Cu

rry Prin

ce

Nirala

Ro

yal Stand

ard

Little Seuo

l

DB pointerxt

zt

pt

qt

Decision Making

Page 18: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Can I have <v.food>

Korean 0.7British 0.2French 0.1

Belief Tracker

0 0 0 … 0 1

MySQL query:“Select * where food=Korean”

Database Operator

Intent Network

Can I have <v.food>

Generation Network

<v.name> serves great <v.food> .

Policy NetworkCopy field

Database

Seven d

ays

Cu

rry Prin

ce

Nirala

Ro

yal Stand

ard

Little Seuo

l

DB pointerxt

zt

pt

qt

Page 19: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Outline

Intro

Neural Dialogue System

Wizard-of-Oz Data Collection

Experiments

Conclusion

14

Page 20: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection15

Online parallel version of WOZ on MTurk

Page 21: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection15

Online parallel version of WOZ on MTurk

Randomly hire a worker to be user/wizard.

Page 22: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection15

Online parallel version of WOZ on MTurk

Randomly hire a worker to be user/wizard.

Task: Enter an appropriate response for ONE TURN.

Page 23: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection15

Online parallel version of WOZ on MTurk

Randomly hire a worker to be user/wizard.

Task: Enter an appropriate response for ONE TURN.

Repeat the process until all dialogues are finished.

Page 24: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection15

Online parallel version of WOZ on MTurk

Randomly hire a worker to be user/wizard.

Task: Enter an appropriate response for ONE TURN.

Repeat the process until all dialogues are finished.

Advantage:

Page 25: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection15

Online parallel version of WOZ on MTurk

Randomly hire a worker to be user/wizard.

Task: Enter an appropriate response for ONE TURN.

Repeat the process until all dialogues are finished.

Advantage:

Avoid latency, parallelisable, cheap

Page 26: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection16

1

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

Page 27: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection16

Hi, I want a cheap Korean restaurant.

1

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

Page 28: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection16

Hi, I want a cheap Korean restaurant.

1

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

1

Page 29: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection16

Hi, I want a cheap Korean restaurant.

1

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

1

What user wants?

Food Korean

Price Cheap

Area N/A

Page 30: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection16

Hi, I want a cheap Korean restaurant.

1

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

1

What user wants?

Food Korean

Price Cheap

Area N/A

Search Table

Little Seoul …

Best Korea …

Page 31: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection16

Hi, I want a cheap Korean restaurant.

What area are you looking for ?

1

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

1

What user wants?

Food Korean

Price Cheap

Area N/A

Search Table

Little Seoul …

Best Korea …

Page 32: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection17

Hi, I want a cheap Korean restaurant.

What area are you looking for ?

2

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

Page 33: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection17

Hi, I want a cheap Korean restaurant.

What area are you looking for ?

Somewhere in the north.

2

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

2

Page 34: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection17

Hi, I want a cheap Korean restaurant.

What area are you looking for ?

Somewhere in the north.

2

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

2

What user wants?

Food Korean

Price Cheap

Area North

Page 35: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection17

Hi, I want a cheap Korean restaurant.

What area are you looking for ?

Somewhere in the north.

2

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

2

What user wants?

Food Korean

Price Cheap

Area North

Search Table

Little Seoul …

Page 36: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection17

Hi, I want a cheap Korean restaurant.

What area are you looking for ?

Somewhere in the north.

Little Seoul is nice one in the north.2

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

2

What user wants?

Food Korean

Price Cheap

Area North

Search Table

Little Seoul …

Page 37: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection18

Hi, I want a cheap Korean restaurant.

What area are you looking for ?

Somewhere in the north.

Little Seoul is nice one in the north.

Its phone number is 01223456789.

What is the phone number?

Thank you very much, good bye.

Thank you for using the system.

4

Task: Find a restaurant, cheap, Korean, NorthAsk phone number

4

What user wants?

Food Korean

Price Cheap

Area North

Search Table

Little Seoul

Page 38: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Wizard of Oz Data Collection19

Hi, I want a cheap Korean restaurant.

What area are you looking for ?

Somewhere in the north.

Little Seoul is nice one in the north.

Its phone number is 01223456789.

What is the phone number?

Thank you very much, good bye.

Thank you for using the system.

What user wants?

Food Korean

Price Cheap

Area North

Page 39: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Data Statistics20

Ontology:

Page 40: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Data Statistics20

Ontology:

Cambridge restaurant domain, 99 venues.

Page 41: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Data Statistics20

Ontology:

Cambridge restaurant domain, 99 venues.

3 informable slots: area, price range, food type

3 requestable slots: address, phone, postcode

Page 42: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Data Statistics20

Ontology:

Cambridge restaurant domain, 99 venues.

3 informable slots: area, price range, food type

3 requestable slots: address, phone, postcode

Dataset

Page 43: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Data Statistics20

Ontology:

Cambridge restaurant domain, 99 venues.

3 informable slots: area, price range, food type

3 requestable slots: address, phone, postcode

Dataset

676 dialogues, ~2750 turns

Page 44: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Data Statistics20

Ontology:

Cambridge restaurant domain, 99 venues.

3 informable slots: area, price range, food type

3 requestable slots: address, phone, postcode

Dataset

676 dialogues, ~2750 turns

3000 HITS, takes 3 days, costs ~400 USD

Page 45: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Data Statistics20

Ontology:

Cambridge restaurant domain, 99 venues.

3 informable slots: area, price range, food type

3 requestable slots: address, phone, postcode

Dataset

676 dialogues, ~2750 turns

3000 HITS, takes 3 days, costs ~400 USD

Data cleaning takes 2-3 days for one person

Page 46: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Outline

Intro

Neural Dialogue System

Wizard-of-Oz Data Collection

Experiments

Conclusion

21

Page 47: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Experiments22

Experimental details

Train/valid/test: 3/1/1

SGD, l2 regularisation, early stopping, gradient clip=1

Hidden size = 50, Vocab size: ~500

Page 48: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Experiments22

Experimental details

Train/valid/test: 3/1/1

SGD, l2 regularisation, early stopping, gradient clip=1

Hidden size = 50, Vocab size: ~500

Two stage training:

Training trackers with label cross entropy

Training other parts with response cross entropy

Page 49: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Experiments22

Experimental details

Train/valid/test: 3/1/1

SGD, l2 regularisation, early stopping, gradient clip=1

Hidden size = 50, Vocab size: ~500

Two stage training:

Training trackers with label cross entropy

Training other parts with response cross entropy

Decoding

Beam search w/ beam width 10

Decode with average word likelihood

Page 50: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Human evaluation23

System Comparison

Page 51: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Example dialogues24

Page 52: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Example dialogues24

Page 53: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Example dialogues24

Page 54: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Visualising action embedding25

Page 55: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Outline

Intro

Neural Dialogue System

Wizard-of-Oz Data Collection

Experiments

Conclusion

26

Page 56: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Conclusion

An end-to-end trainable task-oriented dialogue system architecture is proposed.

27

Page 57: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Conclusion

An end-to-end trainable task-oriented dialogue system architecture is proposed.

A complementary WOZ data collection is also proposed (no latency, parallel, cheap).

27

Page 58: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Conclusion

An end-to-end trainable task-oriented dialogue system architecture is proposed.

A complementary WOZ data collection is also proposed (no latency, parallel, cheap).

Results show that it can learn from human-human conversations and help users to complete tasks.

27

Page 59: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

The paper

Tsung-Hsien Wen, David Vandyke, Nikola Mrksic, Milica Gasic,Lina M.R. Barahona, Pei-Hao Su, Stefan Ultes, and SteveYoung. A Network-based End-to-End Trainable Task-orientedDialogue System. arXiv preprint: 1604.04562 2016.

28

Page 60: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Dialogue Systems Group

Thank you! Questions?

Tsung-Hsien Wen is supported by a studentship funded by Toshiba Research Europe Ltd, Cambridge Research Laboratory

Page 61: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Response Generation Task30

Model Match (%) Success (%) BLEU

Seq2Seq [Sutskever et al, 2014] - - 0.1718

HRED [Serban et al, 2015] - - 0.1861

Our model w/o req. trackers 89.70 30.60 0.1799

Our full model 86.34 75.16 0.2313

Our full model + attention 90.88 80.02 0.2388

Page 62: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Example dialogues31

Page 63: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Example dialogues31

Page 64: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

User turn t System turn t-1

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

Jordan RNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

32

Page 65: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

<nil>

I

want

Korean

food

<nil>

Jordan RNN-CNN belief trackers

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

British French Korean … Chinese

33

Page 66: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

British French Korean … Chinese

33

Page 67: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

British French Korean … Chinese

33

Memorisethe delex. position

Page 68: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv.

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

British French Korean … Chinese

33

Page 69: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv.

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

British French Korean … Chinese

33

Pad zeros to have the

same length

Page 70: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv.

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

British French Korean … Chinese

33

Page 71: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv.

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

British French Korean … Chinese

33

Page 72: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

British French Korean … Chinese

33

Page 73: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese

33

Page 74: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese

1.3

33

Page 75: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese

1.3

33

Slot-specific delex. ngram

feature

Page 76: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese

1.3

33

Slot-specific delex. ngram

feature

Value-specific delex. ngramplaceholder

Page 77: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese

1.3 2.3

33

Page 78: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese

1.3 2.3 9.7

33

Page 79: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese

1.3 2.3 9.7

33

Value-specific delex. ngram

feature

Page 80: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese

1.3 2.3 9.7 1.2

33

Page 81: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese.01 .02 .85 .01

33

Page 82: A Network-based End-to-End Trainable Task-oriented ...€¦ · Dialogue Systems Group A Network-based End-to-End Trainable Task-oriented Dialogue System Deep Learning Summer school,

Jordan RNN-CNN belief trackers

1st conv. 2nd conv. 3rd conv. max-pool avg-pool

Turn tInput layer

Output layer

Hidden layer

Delexicalised CNN

<nil>

I

want

v.food

s.food

<nil>

sentence representation

British French Korean … Chinese.01 .02 .85 .01

33

[Henderson et al, 2014]