Monte-Carlo Search Algorithms · Chess and Go Chess • Long established in AI • Large Branching...

Monte-Carlo Search Algorithms

Chang LiuAndrew Tremblay

Outline

• Chess and Goo Large Game Trees

• Current Monte-Carlo Algorithms

• Current Codebases & Our Additionso Fuegoo Gomba

Chess and Go

Chess• Long established in AI• Large Branching Factor,

o Fixed board size (8x8)• Games converge to win or loss

Go • Used in AI More Recently • Very Large Branching Factor

o Varying Board Size (9x9 to 19x19) • Games take much longer to converge

Game Trees

• Consist of States and Actions • Picking an Action leads to a new State• Find a winning state, choose the action leading to it

Large Game Trees

• Also consist of States and Actions• How do you find the best action? • How do you find one close to best?

• Upper Confidence Trees

• “Multi-Armed Bandit”o Simulated Regret

• Property of “Upper Confidence Bounds”o Balances Exploration Through Tree

• What this meanso Always exploring the current best moveo Converges towards optimal choice

UCT-RAVE

• Rapid Action Value Estimation (RAVE)• Extension of basic UCT• Updates the values across multiple states, rather than maintains the

value on a per-action-state basis.• AMAF (all moves as first) is a general name for this type of

heuristic.

UCT-RAVE

Implementation - Fuego

• Pre-Existing Library of Go Algorithms

• UCT-RAVE already implemented

• Uses Go Text Protocol (GTP )to play againsto Other Go ProgramsGnuGo, MoGo

o HumansGoGui

• First program to defeat a 7 Dan Go player in 9x9 Go.

Fuego - Framework

• Large Codebase

• Only One External Library

• Multi-Threaded

• Open Source

• Extendable

Fuego - Experiments

Disadvantages

Fuego et al. take a very long time to test. Play against separate algorithm for many gamesChange parametersPlay again

Effectiveness of a new algorithm difficult to prove.Parameters difficult to optimize.

Cluster Jobs / Parallel Testing improves response of results

There is an alternative.

Artificial Game Tree

• Faster speed• Better heuristic• Easily twisted parameters (branching factors, depth, etc.)

• Developed by Daniel Bjorge and John Schaeffer in 2010•• Mini-max Artificial Game Tree• Lazy State Expansion• Fast Action Simulation• Fast Termination Evaluation• Fast Heuristic Evaluation• ... ...

Problem with Gomba

• AMAF algorithms - UCT RAVE algorithmo transposition tableo global knowledge of the moves

• Structure of Gomba cannot support this type of algorithms

Our improvements

• Modified the tree structure to better support AMAF algorithms

• Add global knowledge of the moves• Better correlation and consistency among the moves• Modification to the lazy state expansion

Correlations among the moves

Adjust to the tree structure

Improvements

Correlations

Conclusions & Future Work

• Improved Gomba testing frameworko Better support of AMAF algorithms (UCT RAVE)

• Implemented RAVE-Pool in Fuegoo With Variations

Köszönöm!

Advisors• Levente Kocsis• Gábor Sárközy• Stanley Selkow

Daniel Bjorge & John SchaefferMTA-SZTAKI

All SZTAKI Colleagues

Kérdések?

Monte-Carlo Search Algorithms · Chess and Go Chess • Long established in AI • Large Branching...

Documents

Transcript of Monte-Carlo Search Algorithms · Chess and Go Chess • Long established in AI • Large Branching...

Reusing Polycom Phones with 8x8 - Packet8 - Support ...sims.8x8.com/Documents/710510_4_Reusing_Polycom_Phones_with_… · can reuse existing Polycom phones with 8x8 as long as they

Go and Chess

8x8 Virtual Office Solo User Guidesims.8x8.com/Documents/710578_3_Virtual_Office_Solo_User_Guide.pdf5 USER GUIDE 8x8 Virtual Office Solo Note: Performance varies widely depending on

8x8 Virtual Office Click2Pop for SugarCRM Setup Guidesims.8x8.com/...5_Click2Pop_Sugar_CRM_Setup_Guide.pdf · 3 Virtual Office Click2Pop for SugarCRM Setup Guide 8x8 Click2Pop for

MATRIZ DE AUDIO 8x8 8x8 AUDIO MATRIX M130 … Manual ESP ANG.pdf · MATRIZ DE AUDIO 8x8 8x8 AUDIO MATRIX M130 M132M M134T M134P. M130 Versión 1.0 Página 1 de 22 ... No utilice disolventes

Brochure (Maritimegoldmine 8X8) - maritimeinvest.in · PAC . Title: Brochure (Maritimegoldmine 8X8) Author: Administrator Created Date

Algorithm Reinforcement Learning Self-Play with a General Mastering Chess …allen450/alphazero-talk.pdf · 2019-03-27 · Chess, Shogi, and Go Chess Pieces have varying movement

Simulare 8x8

Tatrapan 8x8 Cc

paRS iii 8x8 - FNSS Savunma Sistemleri A.Ş. subject to change without notice. vARIANTS PARS III 8x8 CFV PARS III 8x8 IFV PARS III 8x8 ICV The CFV variant is the Cavalry Fighting Vehicle

8x8 manual r071812

8x8 СМ - doiturselfforfree.com

Huw Rees 8x8, Inc. huwr@8x8.com Hosted iPBX / IP Centrex.

8x8 EasyGrow Manual

8x8 Virtual Office Solo Account Manager Guidesims.8x8.com/...3_Account_Manager_-_Virtual_Office_Solo_User_Guide.pdfACCOUNT MANAGER GUIDE 8x8 Virtual Office Solo 8x8 Virtual Office

8x8 Virtual Office Power Keys — User Guidesims.8x8.com/Documents/711422_1_Virtual_Office_Power_Keys_Admi… · ACCOUN MANAGER GUID 8X8 VIRTUAL OFFIC POWER KEYS — USER GUIDE 8x8

sudoku 8x8

8x8 Virtual Meeting User Guidesims.8x8.com/Documents/710400_7_Virtual_Meeting_User_Guide.pdf · Virtual Meeting User Guide V. 1.0 8x8 Virtual Meeting User Guide Version 1.0, January

Matriz de LED 8X8

Portfolio Of Sorts 8x8