New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end...

1

Transcript of New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end...

Page 1: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-

Sequence Architectures

Wenqiang Lei

Page 2: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Traditional Pipeline Designs for Task-oriented Dialogue System

• Intent classifier– Booking restaurants etc.

• Belief tracker• Policy maker• Dialogue generator

Page 3: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

• Complex belief trackers

• Fragility

• Templated response

Problems of Traditional Pipeline Designs

3

Page 4: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

An End-to-end Solution• Intent classifier

– Booking restaurants etc.

• Belief tracker• Policy maker• Response generator

An End-to-endTrainable Dialogue

System (NDM) (Wen et al., 2017b)

Tsung-Hsien Wen, David Vandyke, Nikola Mrksic, Milica Gasic, Lina M Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young. 2017b. A network-based end-to-end trainable task-oriented dialogue system. EACL .

Page 5: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

• Complex belief trackers

• Pre-trained Belief Tracker

• Fragility

• Templated response

Some Problems Still Remains in NDM

5

Page 6: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Complex Belief Tracker In NDM

Food style Price range Open hour …

Chinese food Expensive Before 11:00 pm …

Japanese food Cheap … …

French food … … …

… … … …

… ... … …

Requiring address? Requiring phone number? Requiring name? …

Yes Yes Yes …

No No Know …

• Informable slots

• Requestable slots

Page 7: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Solution• Belief span

– <Inf>Italian;Cheap</Inf>

<Req>Address</Req>

Page 8: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Solution• Belief span

– <Inf>Italian;Cheap</Inf>

<Req>Address</Req>

Page 9: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Solution• Belief span

– <Inf>Italian;Cheap</Inf>

<Req>Address</Req>

Page 10: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Solution• Belief span

– <Inf>Italian;Cheap</Inf>

<Req>Address</Req>

Page 11: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Solution• Belief span

– <Inf>Italian;Cheap</Inf>

<Req>Address</Req>

Page 12: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Solution• Belief span

– <Inf>Italian;Cheap</Inf><Req>Address</Req>

• Notation– Bt: belief span– Ut: user utterance– Rt: machine response

Source sequence

Target sequence

Page 13: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Illustration

Page 14: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Illustration

Page 15: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Illustration

Page 16: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Formalization• Source-target pair

– {B0R0U1, B1R1}, {B1R1U2, B2R2}, …, {Bt-1Rt-1Ut, BtRt}

• Two stage decoding– Bt=Seq2seq(Bt-1Rt-1Ut)– Rt=Seq2seq(Bt-1Rt-1Ut|Bt, KB search results)

Page 17: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Sequicity Instantiation: A Two-stage CopyNet (TSCP)

• Two-stage copyNet– Encoding: Bt-1Rt-1Ut

– Decoding Bt : copying from Bt-1Rt-1Ut

– Decoding Rt : copying from Bt-1Rt-1Ut and Bt

• Conditioning KB search results for Rt generation– A three-dimension vector [multiple match, single match, no

match]

Page 18: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Optimization• Joint log-likelihood

– Short coming: treating each word equally

– E.g., The closest Italian restaurant is at <addr_slot>

• Reinforcement learning– Action: decoding a word

– State: hidden vectors generated by RNNs

– Reward: decoding a correct placeholder +1, decoding each word -0.1

Page 19: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Experiments: Datasets

Page 20: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Experiments: Evaluation Metrics

• Evaluation metrics– Match Rate

– BLEU

– Success F1

– Training time

Page 21: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Experiment Results

Page 22: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Time Expenses on Belief Trackers

Page 23: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

RL Helps with BLEU and Succ. F1

Page 24: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Removing CopyNets

Page 25: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Discussions: OOV Experiments

Synthesized OOV data: I would like some Chinese food. I would like some Chinese_unk food.

Page 26: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Discussion: Parameter Scales

Page 27: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Discussion: Parameter Scales

Page 28: New Sequicity: Simplifying Task-oriented Dialogue Systems with … · 2020. 5. 6. · An End-to-end Trainable Dialogue System (NDM) (Wen et al., 2017b) Tsung-Hsien Wen, David Vandyke,

Conclusion• Sequicity provides another direction for task-

oriented dialogue systems.• It is more light-weighted, can handle OOV

requests.• It learns dialogue action directly from data with

less human interventions– Requires more training data.

5/22/2018 28