Mary Lou Maher University of Sydney AAAI AI and Fun Workshop July 2010

Curious Characters in Multiuser Games: A Study in Motivated Reinforcement Learning for Creative Behavior Policies*

Mary Lou MaherUniversity of Sydney

AAAI AI and Fun WorkshopJuly 2010

1 Based on Merrick, K. and Maher, M.L. (2009) Motivated Reinforcement Learning: Curious Characters for Multiuser Games, Springer.

Outline Curiosity and Fun Motivation Motivated Reinforcement Learning An Agent Model of a Curious Character Evaluation of Behavior Policies

Can AI model Fun?

Claim:An agent motivated by curiosity to learn patterns is a model of fun.

Games try to achieve flow: a function of the players skill and performance

J. Chen, Flow in games (and everything else). Communications of the ACM 50(4):31-34, 2007

Why Motivated Reinforcement Learning?

More efficient learning: Complement external reward with internal reward

External reward not known at design time Design tasks Real world scenrios: Robotics Virtual world scenarios: NPC in computer games

More autonomy in determining learning tasks Robotics NPC in computer games

Models of Motivation Cognitive:

Interest Competency Challenge

Biological Stasis variables: energy, blood pressure, etc

Social Conformity Peer pressure

MRL Agent Model

Motivation as Interesting Events

I n t e r e s t

- 0 . 5

0 0 . 5 1 1 . 5 2

I n t e n s i t y / N o v e l t y

Pleasantness / Interest .

Event is a change in observations:O(t)–O(t’) = (Δ(o1(t), o1(t’)), Δ(o2(t), o2(t’)), … Δ(oL(t), oL(t’)), …)

D.E. Berlyne, Exploration and Curiosity, Science 153:24-33, 1966

Sensed States: Context Free Grammar (CFG)CFG = (VS, ΓS, ΨS, S) where: VS is a set of variables or syntactic categories, ΓS is a finite set of terminals such that VS ∩ ΓS = {}, ΨS is a set of productions V -> v where V is a variable

and v is a string of terminals and variables, S is the start symbol.

Thus, the general form of a sensed state is: S -> <sensations> <sensations> -> <PiSensations><sensations> | ε <PiSensations> -> <sL><PiSensations> | ε <sL> -> <number> | <string>

MRL for Non Player Characters

Agent Forest

Smithy

Carpenter’s shop

Habituated Self Organizing Map

Behavioral Variety

Behavioural variety measures the number of events for which a near optimal policy is learned.

We characterise the level of optimality of a policy learned to achieve the event E(t) in terms of its structural stability.

0 5 0 0 0 1 0 0 0 0 1 5 0 0 0 2 0 0 0 0

T i m e

Behavioural Variety .e

Behavioral Complexity The complexity of a policy can be measured

by averaging the mean numbers of actions ā

E(t) required to repeat E(t) at any time when the current behaviour is stable

0 5 1 0 1 5 2 0

B e h a v i o u r N u m b e r

Behavioural Complexity .e

Research Directions Scalability and dynamics: different RL

such as decision trees and NN function approximation

Motivation functions: competence, optimal challenges, social models

Relevance to AI and Fun Is it more fun to play with curious NPC?

Can a curious agent play a game to test how fun a game is?

Mary Lou Maher University of Sydney AAAI AI and Fun Workshop July 2010

Documents

Transcript of Mary Lou Maher University of Sydney AAAI AI and Fun Workshop July 2010

AAAI 86 Proceedings. Copyright ©1986, AAAI ( ...

ADJUDIPRO - AAAI

Conferences Review – AAAI and IJCAI Sean. 2 Outline Introduction to AAAI Selected papers from AAAI (3) Introduction to IJCAI Selected papers from IJCAI.

From Statistics to Beliefs* - AAAI

Upending the Uncanny Valley - AAAI

Mary Lou Maher CreativeIT Program Director, NSF University of Colorado August 2007 Mary Lou Maher CreativeIT Program Director, NSF University of Colorado.

r Nonmo otonic Reasoning - AAAI

2006 AAAI Computer Poker Competition

AAAI-08 / IAAI-08

Towards Grammars for Cradle-to-Cradle Design Douglas H. Fisher Vanderbilt University douglas.h.fisher@vanderbilt.edu Mary Lou Maher University of Maryland,

Tutorial AAAI 2006 Tutorial Forum

[published at AAAI-2013]

Cerutti--AAAI Fall Symposia 2009

Wagner Aaai Real Time

Design Agents in 3D Virtual Worlds - Mary Lou Maher

CURRICULUM VITAE Celine Latulipe Education · Toward Inclusive Success. In FIE 2018. Tonya Frevert, Audrey Rorrer, Daniel Davis, Celine Latulipe, Mary Lou Maher, Bojan Cukic, Larry

AAAI 2017 Conference Program

Lou Lou Boutiques

Curious Dances - Media Systems...Curious Dances: Operationalizing aspects of creativity Mary Lou Maher Software and Information Systems UNC Charlotte August 2012 + + + Insights from

AAAI Organization · Officers AAAI President Eric Horvitz (Microsoft Corporation) AAAI President-Elect Martha E. Pollack (University of Michigan) Past President Alan Mackworth (University