Multiagent learning and argumentation (Br 2008)

40
Argumentation & Learning in Multiagent Social Choice Enric Plaza (IIIA-CSIC)

description

Multiagent learning and argumentation. Porto Alegre (BR)

Transcript of Multiagent learning and argumentation (Br 2008)

Page 1: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Argumentation & Learning in Multiagent Social Choice

Enric Plaza (IIIA-CSIC)

Page 2: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Outline

•CBR vs. Learning vs. Agents

•Introduction to Social Choice in MAS

•Justified Predictions in MAS

•Arguments and Counterexamples

•Argumentation Process

•Experimental Evaluation

•Conclusions & Future Work

Page 3: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Learning, RL, and Agents

Why “always” Reinforcement Learning in MAS?It’s Online Learning!

Page 4: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

CBR is Online Learning

Argumentation

Page 5: Multiagent learning and argumentation (Br 2008)

CommitteeInput

DeliberationAggregation

Output

Committee: (1)A group of people officially delegated (elected or appointed) to perform a function, such as investigating, considering, reporting, or acting on a matter.Team: A number of persons associated in some joint action. A group organized to work together.Coalition: A combination or alliance, esp. a temporary one between persons, factions, states, etc. An alliance for combined action, especially a temporary alliance of political parties.

Page 6: Multiagent learning and argumentation (Br 2008)

Ensemble Effect

Page 7: Multiagent learning and argumentation (Br 2008)

Committee of agents

Page 8: Multiagent learning and argumentation (Br 2008)

Deliberation + Voting

Page 9: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Argumentation in Multi-Agent

Learning•An argumentation framework for

learning agents

•Justified Predictions as arguments

•Individual policies for agents to

•generate arguments and

•generate counterarguments

•select counterexamples

Page 10: Multiagent learning and argumentation (Br 2008)

Justified PredictionJustification: A symbolic description with the

information relevant to determine a specific prediction

Page 11: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Justified Prediction

•A justified prediction is a tuple

•where agent J.A considers J.S to be the correct solution for problem J.P

•and this is justified by a symbolic description J.D such that

Page 12: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Examination (1)

•Agent A1 sends a problem to agent A2

•Agent A2 sends a justification with a symbolic description D of why P1 has solution S2 according to its experience

Page 13: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Justification example

The predicted solution is hadromerida because the smooth form of the megascleres of the spiculate skeleton of the sponge is of type tylostyle, the spikulate skeleton of the sponge has not uniform length, and there are no gemmules in the external features of the sponge.

Page 14: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Examination (2)

Set of cases subsumed by

justification J.D

Page 15: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Examination (2)

Set of cases subsumed by

justification J.D

Counterexamples form theRefutation set

Page 16: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Counterargument generation

Page 17: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

•Justified Prediction: An argument α endorsing a individual prediction

•Counterargument: An argument β offered in opposition to an argument α

•Counterexample: A case c contradicting an argument α

Argument types

Page 18: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Case-based Confidence

Page 19: Multiagent learning and argumentation (Br 2008)

Preference on Arguments

Confidence on an

argument based on cases

Page 20: Multiagent learning and argumentation (Br 2008)

Preference on Arguments(2)

Joint Confidence

Preferred

Page 21: Multiagent learning and argumentation (Br 2008)

Relations between arguments

ConsistentIncomparable Counterargument

Page 22: Multiagent learning and argumentation (Br 2008)

Relations between cases and justified

predictions

Endorsing case Irrelevant case Counterexample

Page 23: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Argument Generation

•Generation of a Justified Prediction

•LID generates a description α.D subsuming P

•Generation of a Counterargument

• if no Counterargument can be generated then:

•Selection of a Counterexample

Page 24: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Counterargument Generation

• Counterarguments are generated based on the specificity criterion

• LID generates a description β.D subsuming P and subsumed by α.D

Page 25: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Selection of a Counterexample

• Select a case c subsumed by α.D and endorsing a different solution class.

Page 26: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

AMAL protocol

Justified prediction asserted in the next round

Assertions of n agents at round t

Agent states a counterargument ß

Set of contradicting arguments for agent Ai at round t (those predicting adifferent solution)

Page 27: Multiagent learning and argumentation (Br 2008)
Page 28: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Argument Generation

Page 29: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Confidence-weighted Voting

•Each argument in Ht is a vote for an alternative

•Each vote is weighted by the joint confidence measure of that argument

•Weighted voting: alternative with higher aggregated confidence wins

Page 30: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Experiments•Designed to validate 2 hypotheses

•average of 5 10-fold cross validation runs

•(H1) that argumentation is a useful framework for joint deliberation and can improve over other typical methods such as voting; and

•(H2) that learning from communication improves the individual performance of a learning agent participating in an argumentation process.

Page 31: Multiagent learning and argumentation (Br 2008)

#Agents

Acc

ura

cy

Sponges Data set

Accuracy after Deliberation

Page 32: Multiagent learning and argumentation (Br 2008)

Accuracy after Deliberation

#Agents

Acc

ura

cy

Soybean Data set

Page 33: Multiagent learning and argumentation (Br 2008)

Individual Learning from

Communication

%cases

Acc

ura

cy

Sponges Data set

Page 34: Multiagent learning and argumentation (Br 2008)

Individual Learning from

Communication

%cases

Acc

ura

cy

Soybean Data set

Page 35: Multiagent learning and argumentation (Br 2008)

Case Base Size#

case

s

23.4%20%

31.58%20.02%

Page 36: Multiagent learning and argumentation (Br 2008)

#ca

ses

23.4%20%

31.58%20.02%

Page 37: Multiagent learning and argumentation (Br 2008)

Learning from a few good cases while

arguing

%cases

Acc

ura

cy

Soybean Data set

Page 38: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Conclusions•An argumentation framework for

learning agents

•a case-based preference relation over arguments,

• by computing a confidence estimation of arguments

•a case-based policy to generate counter-arguments and select counterexamples

•an argumentation-based approach for learning from communication

Page 39: Multiagent learning and argumentation (Br 2008)

QuickTime™ and aGIF decompressor

are needed to see this picture.

Future Work•Framework for agents using induction• better policies for generating CA• collaborative search in generalization space

• Deliberative Agreement

• for social choice and collective judgment models

• aggregation procedures of sets of interconnected judgments

• deliberation over sets of interconnected judgments

Page 40: Multiagent learning and argumentation (Br 2008)

AMAL vs Centralized

#Agents

Acc

ura

cy

Sponges Data set

QuickTime™ and aGIF decompressor

are needed to see this picture.