SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos,...

18
SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    215
  • download

    0

Transcript of SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos,...

Page 1: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

SteerBench: a benchmark suite for evaluating steering behaviors

Authors: Singh, Kapadia, Faloutsos, Reinman

Presented by: Jessica Siewert

Page 2: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Content of presentation

• Introduction• Previous work• The Method• Assessment

Page 3: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Introduction – Context and motivation

– Steering of agents– Objective comparison– Standard?– Test cases and scoring, user evaluation– Metric scoring– Demonstration

Page 4: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Introduction – Previous work There is not really anything like it yet (Nov ‘08)

Page 5: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Introduction - Promises

• Evaluate objectively• Help researchers• Working towards a standard for evaluation• Take into account:– Cognitive decisions– Situation-specific aspects

Page 6: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

The test cases

– Simple validation scenarios– Basic one – on – one interactions– Agent interactions including obstacles– Group interactions– Large-scale scenarios

Page 7: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

The user’s opinion

• Rank on overal score across test cases (comparing)• Rank algorithms based on – a single case, or – one agent’s behavior

• Pass/fail• Visually inspect results• Examine detailed metrics of the performance

Page 8: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

The metric

• Number of collisions• Time efficiency• Effort efficiency• Penalties?

Page 9: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Movies…

Page 10: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Developments since then• Ioannis Karamouzas , Peter Heil , Pascal Beek , Mark H. Overmars, A Predictive Colli

sion Avoidance Model for Pedestrian Simulation, Proceedings of the 2nd International Workshop on Motion in Games, November 21-24, 2009, Zeist, The Netherlands

• Shawn Singh , Mubbasir Kapadia , Billy Hewlett , Glenn Reinman , Petros Faloutsos,

A modular framework for adaptive agent-based steering, Symposium on Interactive 3D Graphics and Games, February 18-20, 2011, San Francisco, California

• Suiping Zhou , Dan Chen , Wentong Cai , Linbo Luo , Malcolm Yoke Hean Low , Feng Tian , Victor Su-Han Tay , Darren Wee Sze Ong , Benjamin D. Hamilton, Crowd modeling and simulation technologies, ACM Transactions on Modeling and Computer Simulation (TOMACS), v.20 n.4, p.1-35, October 2010

Page 11: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Experiments – Claim recall

• Evaluate objectively• Help researchers• Working towards a standard for evaluation

Page 12: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Assessment – good things

• All the measured variables seem logical (Too?)• Extensive variable set, with option to expand• Customized evaluation• Cheating not allowed – collision penalties– fail constraint– goal constraint

• Layered set of test cases

Page 13: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Assessment

• The measurements all seem to be approximately the same

• User test makes the difference?• Who are these users?• Examine, inspect, all vage terms• What about the objective of objectiveness?

Page 14: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Assessment

• How good is it to be general• How general/specific is this method?• Time efficiency VS. Effort efficiency• Should it be blind for the algorithm itself?• Penalties, fail and goal constraints not

specified!

Page 15: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Assessment – scoring(1/2)

• The test cases are clearly specified. But it is not specified HOW a GOOD agent SHOULD react, though they say there is such a specification

• How can you get cognitive decisions out of only position, direction and a goal?

Page 16: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Assessment – scoring(2/2)

• “Scoring not intended to be a proof of an algorithm’s effectiveness.”

• How do you interpreted scores and who wins?– “B is slightly better on average, but A has the

highest scores.”

Page 17: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Assessment – final questions

• Can this method become a standard?• What if someone claims to be so innovative

this standard does not apply to them?• Nice first try, though!

Getty images

Page 18: SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.