CS388: Natural Language Processing Lecture 21:...

CS388:NaturalLanguageProcessingLecture21:Dialogue

GregDurrett

Administrivia‣ Proposalduetodayat5pm

‣ ProposalsreturnedASAP,thenProject2

Recall:ExtracGveSummarizaGon‣ Countnumberofdocumentseachbigramoccursintomeasureimportance

GillickandFavre(2009)

score(massiveearthquake)=3score(Iraqicapital)=1score(sixkilled)=2score(magnitude7.3)=2

‣ ILPformulaGon:candsareindicatorvariablesindexedoverconcepts(bigrams)andsentences,respecGvely

“setcito1iffsomesentencethatcontainsitisincluded”

sumofincludedsentences’lengthscan’texceedL

‣ Findsummarythatmaximizesthescoreofbigramsitcovers

Recall:Compression

‣ Nowsjvariablesarenodesorsetsofnodesintheparsetree

Atleastsixpeoplewerekilledandmanyothersinjured

‣ Newconstraint:s2≤s1“s1isaprerequisitefors2”

Recall:Pointer-Generator

Seeetal.(2017)

Recall:Seq2seqSummarizaGon

Seeetal.(2017)

‣ SoluGons:copymechanism,coverage,justlikeinMT…

‣ ThingsmightsGllgowrong,nowayofprevenGngthis…

ThisLecture‣ Chatbotdialoguesystems

‣ Task-orienteddialogue

‣ OtherdialogueapplicaGons

Chatbots

TuringTest(1950)‣ ImitaGongame:AandBarelockedinroomsandanswerC’squesGonsviatypewriter.BotharetryingtoactlikeB

trainedjudgeC trainedjudge

OriginalInterpretaGon: StandardInterpretaGon:

‣ Thetestisnot“doesthiscomputerseemhuman-liketorandompeoplewithawebbrowser?”

ELIZA‣ Created1964-1966atMIT,heavilyscripted

‣ DOCTORscriptwasmostsuccessful:repeatsuser’sinput,asksinanequesGons

Weizenbaum(1966)

(.*)you(.*)me

WhydoyouthinkI$2you?

‣ IdenGfykeyword,idenGfycontext,applytransformaGonrule

‣ Verylijleneedtogeneratenewcontent,butcanonlyhaveonetypeofconversaGon

My(.)(.*)

Whatelsecomestomindwhenyou thinkofyour$1?

Cleverbot‣ Carpenter(1986),onlinesystembuiltin2006

‣ “Nearestneighbors”:humansaysstatementA,findahumanresponseinhuman-humanorhuman-computerchatstostatementA,repeatthat

User:Hitherecleverbot.Cleverbot:Howareyou?User:Doingokay.whatplansdoyouhavetoday?Cleverbot:Idorevision.User:Whatareyourevisingfor?Cleverbot:Mathshistory.User:Howoldareyou?Cleverbot:That'snoneofyourbusiness.User:Whatismathshistory?Cleverbot:Idon’tknow,farmingmaybe

‣ Canooengivesensibleanswers,butthebotdoesn’treallyimposehigh-level discoursestructure

Data-DrivenApproaches‣ CantreatasamachinetranslaGonproblem:“translate”fromcurrentujerancetonextone

‣ Filterthedata,usestaGsGcalmeasurestopruneextractedphrasestogetbejerperformance

Rijeretal.(2011)

Data-DrivenApproaches

Rijeretal.(2011)

Seq2seqmodels

Whatareyoudoing

am going home [STOP]

‣ JustlikeconvenGonalMT,cantrainseq2seqmodelsforthistask

‣Whymightthismodelperformpoorly?Whatmightitbebadat?

‣ Hardtoevaluate:

LackofDiversity

Lietal.(2016)

‣ Trainingtomaximizelikelihoodgivesasystemthatpreferscommonresponses:

LackofDiversity

Lietal.(2016)

‣ SoluGon:mutualinformaGoncriterion;responseRshouldbepredicGveofuserujeranceUaswell

‣MutualinformaGon:

‣ StandardcondiGonallikelihood: logP (R|U)

P (R,U)

P (R)P (U)

= logP (R|U)� logP (R)

‣ logP(R)canreflectprobabiliGesunderalanguagemodel

LackofDiversity

Lietal.(2016)

‣ OpenSubGtlesdata

Futureofchatbots

‣ XiaoIce:MicrosoochatbotinChinese,20Musers,averageuserinteracts60Gmes/month

‣ Peopledoseemtoliketalkingtothem…?

‣ HowdeepcanaconversaGonbewithoutmoresemanGcgrounding?Basicfactsaren’tevenconsistent…

‣ Canforcechatbotstogiveconsistentanswers,butsGllprobablynotveryinteresGng

Lietal.(2016)Persona…

Task-OrientedDialogue

Google,what’sthemostvaluable

Americancompany?

WhoisitsCEO?

TimCook

‣ QuesGonanswering/search:

Siri,findmeagoodsushi restaurantinChelsea

SushiSekiChelseaisasushirestaurantinChelseawith4.4stars

onGoogle

‣ Personalassistants/APIfront-ends:

Howexpensiveisit?

Entreesarearound$30each

Findmesomethingcheaper

HeyAlexa,whyisn’tmyAmazon orderhere?

Letmeretrieveyourorder. Yourorderwasscheduledtoarrive

at4pmtoday.

‣ Personalassistants/APIfront-ends:

Itnevercame

Okay,Icanputyouthroughtocustomerservice.

AirTravelInformaGonService(ATIS)‣ Givenanujerance,predictadomain-specificsemanGcinterpretaGon

DARPA(early1990s),FigurefromTuretal.(2010)

‣ CanformulateassemanGcparsing,butsimpleslot-fillingsoluGons(classifiers)workwelltoo

FullDialogueTask‣ Parsing/languageunderstandingisjustonepieceofasystem

Youngetal.(2013)

‣ Dialoguestate:reflectsanyinformaGonabouttheconversaGon(e.g.,searchhistory)

‣ Userujerance->updatedialoguestate->takeacGon(e.g.,querytherestaurantdatabase)->saysomething

‣Muchmorecomplexthanchatbots!

FullDialogueTask

FindmeagoodsushirestaurantinChelsea

restaurant_type <- sushi

location <- Chelsea

SushiSekiChelseaisasushirestaurantinChelseawith4.4starsonGoogle

curr_result <- execute_search()

Howexpensiveisit?get_value(cost, curr_result)

POMDP-basedDialogueSystems

Youngetal.(2013)

‣ Dialoguemodel:canlooklikeaparseroranykindofencodermodel

‣ POMDP:useristhe“environment,”anujeranceisanoisysignalofstate

‣ Generator:usetemplatesorseq2seqmodel

‣Wheredorewardscomefrom?

RewardforcompleGngtask?

location <- Chelsea

make_reservation(curr_result)

Howexpensiveisit?

…OkaymakemeareservaGon!

Veryindirectsignal ofwhatshould happenuphere

Usergivesreward?

location <- Chelsea

Howexpensiveisit?get_value(cost, curr_result)

Howdoestheuserknowtherightsearchhappened?

Wizard-of-Oz

Kelley(early1980s),FordandSmith(1982)

‣ LearningfromdemonstraGons:“wizard”pullstheleversandmakesthedialoguesystemupdateitsstateandtakeacGons

FullDialogueTaskFindmeagoodsushirestaurantinChelsea

location <- Chelsea

curr_result <- execute_search(){wizardenters

SushiSekiChelseaisasushirestaurantinChelseawith4.4starsonGoogle{wizardtypesthis

outorinvokes templates

‣Wizardcanbeatrainedexpertandknowexactlywhatthedialoguesystemsissupposedtodo

LearningfromStaGcTraces

Bordesetal.(2017)

‣ Usingeitherwizard-of-OzorotherannotaGons,cancollectstaGctracesandtrainfromthese

FullDialogueTaskFindmeagoodsushirestaurantinChelsea

location <- Chelsea

‣ Useraskedfora“good”restaurant—doesthatmeanweshouldfilterbystarraGng?Whatdoes“good”mean?

‣ HardtochangesystembehavioriftrainingfromstaGctraces,especiallyifsystemcapabiliGesordesiredbehaviorchange

stars <- 4+

Goal-orientedDialogue

‣ BigCompanies:AppleSiri(VocalIQ),GoogleAllo,AmazonAlexa,MicrosooCortana,FacebookM,SamsungBixby,TencentWeChat

‣ Startups:

‣ Lotsofcoolworkthat’snotpublicyet

‣ Tonsofindustryinterest!

OtherDialogueApplicaGons

Search/QAasDialogue

‣ “HasChrisPrajwonanOscar?”/“HashewonanOscar”

QAasDialogue‣ DialogueisaverynaturalwaytofindinformaGonfromasearchengineoraQAsystem

Iyyeretal.(2017)

‣ QAishardenoughonitsown

‣ Usersmovethegoalposts

‣ Challenges:

QAasDialogue‣ UWQuACdataset:QuesGonAnsweringinContext

Choietal.(2018)

SearchasDialogue

‣ Googlecandealwithmisspellings,somoremisspellingshappen—Googlehastodomore!

DialogueMissionCreep

System

Erroranalysis

Bejermodel

‣ FixeddistribuGon(e.g.,naturallanguagesentences),errorrate->0

‣ Errorrate->???;“missioncreep”fromHCIelement

HarderData

MostNLPtasks

System

Erroranalysis

Bejermodel

Dialogue/Search/QA

DialogueMissionCreep

‣ Highvisibility—yourproducthastoworkreallywell!

Takeaways‣ Somedecentchatbots,butunclearhowtomakethesemoresophisGcatedthantheyarerightnow

‣ Task-orienteddialoguesystemsaregrowinginscopeandcomplexity—reallyexciGngsystemsontheway

‣Moreandmoreproblemsarebeingformulatedasdialogue—interesGngapplicaGonsbutchallengingtogetworkingwell

CS388: Natural Language Processing Lecture 21:...

Documents

Transcript of CS388: Natural Language Processing Lecture 21:...

A Conversation With Cleverbot

geometerjustin.comgeometerjustin.com/teaching/eg/fa2018/syllabus/M621_text.pdf · Contents Contents i Preface 1 1 Euclid and Hilbert 3 1.1 Euclid - The Formal Beginning . . . . .

Grandezas e unidades - qa.ff.up.ptqa.ff.up.pt/fa2018/pdf/fa-t02.pdf · –Comprimento, massa , tempo, volume, força, velocidade, etc. Medidas ... Sistemas de unidades Quando se define

Administrivia CS388: Natural Language Processing Lecture ...gdurrett/courses/fa2019/lectures/lec25-4pp.pdf · ‣Many languages used all over the world have much richer morphology

CS388: Natural Language Processing Lecture 26: Wrapup + Ethics

Lecture 6: HMM algorithms - University Of Illinois › cs447 › fa2018 › Slides › ...CS498JH: Introduction to NLP Dynamic programming algorithms for HMMs I. Likelihood of the

Cats - College of Charlestonmath.cofc.edu/exams/math101/m101-fa2018-finalanswers.pdf · [6] Solve each inequality and write your answer using interval notation.12. 14x < 5 12. 31,

Xkcd.com/208 Regex comic Cleverbot video .

CS388: Natural Language Processing

Administrivia CS388: Natural Language Processing Lecture ...

CS388: Natural Language Processing Lecture 24 ... · This Lecture ‣ Morphology: eﬀects and challenges ‣ Cross-lingual tagging and parsing ‣ Morphology tasks: analysis, inﬂec9on,

CS388: Natural Language Processing Lecture 16: Informa;on ...

CS388: Natural Language Processinggdurrett/courses/fa2019/...CS388: Natural Language Processing Greg Durre8 Lecture 5: CRFs Administrivia ‣ Project 1 is out, sample writeups on website

CS388: Natural Language Processing Lecture 6: Neural Networks · 2019. 9. 17. · Recall: CRFs ‣ Naive Bayes : logisGc regression :: HMMs : CRFs local vs. global normalizaGon

CS388: Natural Language Processing Lecture 5: Named En ...gdurrett/courses/sp2021/...CS388: Natural Language Processing Greg Durre8 Lecture 5: Named En=ty Recogni=on, CRFs Administrivia

Distributional Semantics - Department of Computer Sciencemooney/cs388/slides/dist-sem-intro-NLP... · same as Principal Component Analysis, PCA) I Some alternatives: Independent Component

FISICA APLICADA - qa.ff.up.ptqa.ff.up.pt/fa2018/fa-t/lab.pdf · FISICA APLICADA Exemplificação dos cálculos dos trabalhos laboratoriais Obs: Os valores utilizados nos cálculos

CS388: Natural Language Processing Lecture 10: Syntax Igdurrett/courses/fa2018/lectures/lec10-4pp.pdf‣Lexicon consists of “preterminals” (POS tags) rewriLng as terminals (words)

Administrivia CS388: Natural Language Processing Lecture 7 ...gdurrett/courses/fa2019/lectures/lec7-4pp.pdf · CS388: Natural Language Processing Greg Durre8 Lecture 7: Word Embeddings

CS388: Natural Language Processing Lecture 14: Seman