CS 4100/5100: Foundations of AIrobplatt/cs5100_2013/platt...CS 4100/5100: Foundations of AI...

CS 4100/5100: Foundations of AIAdversarial Search (i.e. game playing)

Instructor: Rob Plattr.platt@neu.edu

College of Computer and information ScienceNortheastern University

September 5, 2013

Kinds of games:

I stochastic vs perfect information

I fully observable vs partially observable

I most of the games we consider are zero-sum

Formally, a game is defined by:

I initial state

I transition function and description of feasible actions

I terminal test and utility functionI description of how players take turns

I this is the only thing specific to games

Game tree

The only difference between game search and regular search is thatyou need to take into account the actions of an opponent. How todo that?

I assume in advance that the opponent always makes the bestpossible move available to him/her.

I Both players are assumed to play optimally. An optimalstrategy is as good as you can do assuming you are playing aninfallible opponent.

I This is basically the same as AND-OR search. Why?

Minimax search

Minimax search is and-or search for game trees.

I recursively explore the tree (DFS)

Minimax search

Properties of minimax

I optimal

I complete

I time complexity?

I space complexity?

Chess: b ≈ 35, m ≈ 100.

Minimax example

There are six spaces on a board that must be filled in order. Player1 (MAX) uses x, and Player 2 (MIN) uses o. Each player can placeat most 3 symbols per turn. Whoever places their symbol in thelast slot on the board wins. Draw the minimax search tree.

Alpha-beta search

Idea: prune parts of the tree that you know won’t change yourstrategy.

A player is in the process of unrolling the search tree in order todecide what to do...

I suppose that the best move that the player has found so farhas value v

I suppose that you already know that the node you arecurrently expanding has a maximum value of w < v (how canthis happen?)

I then, you don’t need to explore that node any more and canmove on to the next one...

Alpha-beta search

Move ordering in Alpha-beta search

Notice that the effectiveness of AB depends on the order in whichstates are examined.

I minimax search complexity: bm

I AB search w/ random move ordering: b0,75∗m

I AB search w/ ”optimal” move ordering: b0.5m (killer moveheuristic)

But, how do you get a good move ordering?

I a heuristic

I iterative deepening...

Avoiding repeated search:

I tranposition table: hash table that stores the values forpreviously explored states.

Doing better

Usually, AB isn’t enough. AB is complete and sometimes that justisn’t possible.

I alternative: cut off search early w/ an approximated value forthat state.

Evaluation function: an estimate of the expected utility of a givenstate.

I for example, in Chess: assign each board position anapproximate value in terms of the number of pawns, bishops,knights, etc.

I cut off search after a certain depth and just use the evaluationfunction

Doing better (specifically)

But how do you decide how to set the evaluation function?

Think of it as dividing up the state space into equivalence classes:

I all board states w/ 5 pawns, 2 bishops, etc. belong in a singleequiv class

I the value of each equiv class is equal to the probability ofwinning from that state (i.e. the expected value in general).

I but, how do you calculate that? learn from experience?

Cutting off search

A simple way to cut off search is to use iterative deepening andjust return the best current answer when the time’s up.

I ID also helps w/ move ordering

Cutting off search

Don’t want to cut off search pre-maturely. Sometimes a givenstate is likely to suffer a large change in value in the near future...

I only apply cutoff to states that are quiescent

Forward pruning

Look at evaluation function before expanding nodes. Just don’texpand low-scoring nodes:

I for example: on each move, only do search on the top nmoves.

I alternative (probcut): estimate how likely a given node is tobe outside the (α, β) threshold based on past experience andwithout searching the entire tree.

Minimax variations

Sometimes, strict minimax seems to be the wrong thing to do:

I alternative is to associate the evaluation function w/ somemeasure of variance. Then, calculate the probability that thenode is actually better.

CS 4100/5100: Foundations of AIrobplatt/cs5100_2013/platt...CS 4100/5100: Foundations of AI...

Documents

Transcript of CS 4100/5100: Foundations of AIrobplatt/cs5100_2013/platt...CS 4100/5100: Foundations of AI...

Emenda Platt

Agilent 5100 ICP-OES · Hole Vertical torch 5100 SVDV Mode Selector 5100 ... ss) As 188.980 Cd 228 ... • The 5100 delivered accurate results in 58 seconds per sample. • The 5100

#1441846 - PLATT

CS 4100/5100: Foundations of AI - Classical Searchrobplatt/cs5100_2013/platt...CS 4100/5100: Foundations of AI Classical Search Instructor: Rob Platt r.platt@neu.edu College of Computer

WM 5100 W WM 5100 S

Geoff Platt

Depositadora/Extrusora Painel de Operação para Os tipos ... · Depositadora/Extrusora Painel de Operação para Os tipos V50, V45, 4100, 4300, 4500, 5000 e 5100 Nestlé, Brasil

Paperless Multimedia Congress System...HCS-5390 Series Economical Digital Infrared Wireless Conference System HCS-5100 Plus Series Infrared Language Distribution System HCS-4100/50

PUMA 4100 5100 series - CFT group

Hochdeutsch - Niederaspher Platt und Äser Platt - Hochdäitsch · Hans-Jörg Freiling 1993, Revision 2005 Wörterbuch Hochdeutsch - Niederaspher Platt und Äser Platt - Hochdäitsch

TACO BELL’S DRIVE-THRU - UToledo Engineeringgrafe/4100/bch061.pdf · TACO BELL’S DRIVE-THRU SIMULATION SYSTEMS MIME 4100/5100 Ethan Hoover Nelly Barbery Matt Cuthbertson

CS 4100/5100 Foundations of AI - Northeastern University · Artificial Intelligence: A Modern Approach Third Edition Stuart Russell, Peter Norvig . Course Participation Additional

CS 4100/5100: Foundations of AI

Multimedia Communication Server 5100 (MCS 5100)

TRATOR 4100 / 4100 - Agrale

APPLICATION GUIDE - Platt Electric SupplyAPPLICATION GUIDE - Platt Electric Supply ... Link

R S Marco ToTal RefRacTion TRS-5100 SySTemMarco TRS-5100 ToTal RefRacTion SySTem TRS-5100: ToTal RefRacTion SySTem TRS-5100 SpecificaTionS S145000 BR-TRS-5100 MeasureMent range Sphere-29.00

商品番号INDEX - Kaunet · 2011. 8. 8. · 4100-3886 795 4100-3909 795 4100-3923 765 4100-3930 765 4100-3947 763 4100-3954 763 4100-3985 803 4100-3992 803 4100-4104 780 4100-4111

PUMA 4100 5100 series - DOOSAN

OILWEB - Corporate Intranet Portal › announcement › HolidayHome...Lemon Tree Hotel 4100 5100 Ahmedabad Auran abad Lemon Tree Hotel Electronics Ci 4250 4750 5250 Ben aluru Lemon