Research Seminar on Aggregation Operators in …...Aggregation Operators in Fuzzy Control Andrea Zem...
Transcript of Research Seminar on Aggregation Operators in …...Aggregation Operators in Fuzzy Control Andrea Zem...
Research Seminar on
Aggregation Operators in Fuzzy Control
Andrea Zemankova
Aggregation operators - what they are?
Which car to choose? It should be fast, with low consumption,
nice design and cheap.
Usually no car on the market match all the criteria.
Speed Consumption Design PriceCar 1 0.3 0.4 0.9 0.8Car 2 0.9 0.5 0.5 0.4Car 3 0.2 0.9 0.8 0.3
How to decide which one is the best?
Which car is the best?
There are many way how to decide which one of these cars is thebest. Everything depends on the user preferences.
Example 1 If all criteria are equally important we can use the sum:
Car 1 0.3 + 0.4 + 0.9 + 0.8 = 2.4Car 2 0.9 + 0.5 + 0.5 + 0.4 = 2.3Car 3 0.2 + 0.9 + 0.8 + 0.3 = 2.2
or the product (in this case car which has middle score in all criteriais preferred to one which has half of the criteria with high score andhalf with low score)
Car 1 0.3 · 0.4 · 0.9 · 0.8 = 0.0864Car 2 0.9 · 0.5 · 0.5 · 0.4 = 0.09Car 3 0.2 · 0.9 · 0.8 · 0.3 = 0.0432
Example 2 If Consumption is the most important criterion, Designis the second most important and Speed and Price are less important,we can use weighted mean
Car 1 18 · 0.3 + 1
2 · 0.4 + 14 · 0.9 + 1
8 · 0.8 = 0.5625
Car 2 18 · 0.9 + 1
2 · 0.5 + 14 · 0.5 + 1
8 · 0.4 = 0.5375
Car 3 18 · 0.2 + 1
2 · 0.9 + 14 · 0.8 + 1
8 · 0.3 = 0.7125
Summary Every candidate is best in some criterion, but we need todecide which one is generally the best for us and order the cars fromthe best to the worst.
In other words we need aggregate scores for all criteria and assign toeach car just one number. Then we can see immediately which car isthe best.
Aggregation operators - what they are?
Definition
• n-ary aggregation operator is a non-decreasing function
A : [0,1]n −→ [0,1] such that
A(0, . . . ,0︸ ︷︷ ︸n−times
) = 0 and A(1, . . . ,1︸ ︷︷ ︸n−times
) = 1
• aggregation operator is a function A :⋃
n∈N[0,1]n −→ [0,1] such that
A(x) = x and A|[0,1]n is an n-ary aggregation operator for all n ∈ N.
Outline
• T-norms
• T-conorms, Uninorms
• Related operations - Implications, Negations
• Other types of aggregation operators (QAM,QWM,OWA,...)
Triangular norms
Notion ”t-norm” appeared first in 1942 in the paper of Karl Menger
and it was used in statistical metric spaces.
Later Schweizer and Sklar changed slightly the definition of a t-norm
and since then people find plenty of applications for t-norms in fuzzy
control, fuzzy measures and integrals, decision making, expert sys-
tems, . . .
In fuzzy control, t-norms are usually used when we need to find the
intersection of fuzzy sets, or when we need to model a conjunction
(AND connective).
Classical logic {0,1} Many-valued logics⇒ ÃLukasiewicz {0, 12,1}
1 0 1
0 0 0
0 1
1 0 12 1
12 0 ? 1
2
0 0 0 0
0 12 1
Example There are two sets: set of young people and set of tallpeople defined in ÃLukasiewicz three valued logic, i.e., every one eitheris definitely tall, or is definitely not tall or is something in between -i.e., he is tall to the degree 1
2 (similarly for the set of young people).
If somebody is young to degree 12 and is tall also to degree 1
2, whatis the truth value of the statement that he is young and tall?
In other words, to what degree does he belong to the intersection ofthe set of young people and the set of tall people?
Triangular norms
Fuzzy logic - truth values - [0,1] - AND operator - triangular norm
Intersection of fuzzy sets
Crisp set x ∈ A ∩B iff (x ∈ A&x ∈ B)
µA(x) =
1 if x ∈ A
0 elseµB(x) =
1 if x ∈ B
0 else
µA∩B(x) = AND(µA(x), µB(x)) =
1 if x ∈ A&x ∈ B
0 else
Fuzzy set µA∩B(x) = T (µA(x), µB(x))
Triangular norms
Function T : [0,1]2 −→ [0,1] is called a t-norm if it is
• Commutative, i.e., T (x, y) = T (y, x)
• Non-decreasing, i.e., T (x, y) ≤ T (x, z) whenever y ≤ z
• 1 is neutral element, i.e., T (1, x) = x
• Associative, i.e., T (x, T (y, z)) = T (T (x, y), z)
Basic t-norms
TM(x, y) = min(x, y)
TP (x, y) = x · y
TL(x, y) = max(0, x + y − 1)
TD(x, y) = 0 (except the boundary, where TD(1, x) = x = TD(x,1))
TD ≤ TL ≤ TP ≤ TM
0.20.4
0.60.8
1
0
0.5
0.2
0.4
0.6
0.8
1
TM
0.20.4
0.60.8
1
0.2
0.4
0.6
0.8
0.20.4
0.60.8
1
0
0.5
0.2
0.4
0.6
0.8
1
TP
0.20.4
0.60.8
1
0.2
0.4
0.6
0.8
0.20.4
0.60.8
1
0
0.5
0.2
0.4
0.6
0.8
1
TL
0.20.4
0.60.8
1
0.2
0.4
0.6
0.8
0.20.4
0.60.8
1
0
0.5
0.2
0.4
0.6
0.8
1
TD
0.20.4
0.60.8
1
0.2
0.4
0.6
0.8
Car example
First car
TM(0.3,0.4,0.9,0.8) = 0.3
TP(0.3,0.4,0.9,0.8) = 0.0864
TL(0.3,0.4,0.9,0.8) = 0
TD(0.3,0.4,0.9,0.8) = 0
Majority of non-continuous t-norms
is not suitable for fuzzy control
applications.
Class of continuous t-norms can be divided to two main groups.Archimedean and non-Archimedean. Archimedean are those t-normswhich except of 0 and 1 has no idempotent element.
If for x we have T (x, x) = x then x is called an idempotent elementof T.
For Archimedean T and x ∈ ]0,1[ there is T (x, x) < x
Ordinal sum Additive generator
T1
T2
T3
TM
T (x, y) = t−1(min(t(0), t(x)+ t(y)))
Isomorphism of t-norms
ϕ−1(T (ϕ(x), ϕ(y)))
Additive generator
Additive generator t(x) = − ln(x) then
T (x, y) = t−1(min(t(0), t(x) + t(y))) = e−(min(∞,− ln(x)+(− ln(y)))) =
e−(− ln(x·y)) = eln(x·y) = x · y
or if t(x) = 1− x then
T (x, y) = t−1(min(t(0), t(x) + t(y))) = 1− (min(1,1− x + (1− y))) =
1−min(1,2− x− y) = max(0,1− (2− x− y)) = max(0, x + y − 1)
Isomorphism
Isomorphism ϕ(x) = x2 then
Tϕ(x, y) = ϕ−1(T (ϕ(x), ϕ(y))) = (T (x2, y2))12
if T = TL we have TLϕ(x, y) = (max(0, x2+y2−1))12 - Schweizer-Sklar
t-norm with parameter p = 2
if T = TP we have TPϕ(x, y) = (x2 · y2)12 = x · y
Archimedean t-norms are isomorphic either to product TP (strict)
or to ÃLukasiewicz t-norm TL (nilpotent).
Ordinal sum
T = (〈TL,0, 12〉, 〈TP, 3
4,1〉)
T =
max(0, x + y − 12) if (x, y) ∈
]0, 1
2
[2,
4(x− 34)(y − 3
4) + 34 if (x, y) ∈
]34,1
[2,
min(x, y) else.
00.2
0.40.6
0.810.5
0.2
0.4
0.6
0.8
1
0.20.4
0.60.8
1
0.2
0.4
0.6
0.8
Every continuous t-norm is an ordinal sum of Archimedean t-norms.
• conjunction in fuzzy logic, intersection of fuzzy sets
• aggregation
• fuzzy relations (f.e., T -equivalency (similarities), T -E-ordering, T -partition)
• addition of distribution functions, fuzzy numbers
• fuzzy integrals (f.e., Sugeno, Shilkret, Sugeno-Weber)
• copulas (1-Lipschitz t-norms) modelling of stochastic dependenceof random vectors
• we can derive dual t-conorm, residual implication, negation
t-conorm commutative, associative, non-decreasing, 0 is neutral ele-ment
t-conorm is used (besides other applications) to model OR in fuzzylogic
uninorm commutative, associative, non-decreasing, there is a neutralelement e ∈ ]0,1[
Aggregation operator is called compensatory if low score in one ofthe criteria (one small input) can be compensated by high score inanother of criteria (high input).
T-norms are non-compensatory operations since T ≤ min and thusone low input causes that the result will never be greater than thisinput.
T-conorms are fully compensatory operations since C ≥ max and thusone high input causes that the result will never be smaller than thisinput.
Aggregation operator A is
• conjunctive if A ≤ min (t-norm,x1x22x3
3 · · ·xnn)
• disjunctive if A ≥ max (t-conorm,x1x122x
133 · · ·x
1nn)
Uninorm U(x, y)
T
Cmin ≤ U ≤ max
min ≤ U ≤ max
e
e
dual t-conorm C(x, y) = 1− T (1− x,1− y)
CM(x, y) = max(x, y), CP (x, y) = x + y − xy, CL = min(x + y,1)
residual implication x →T y = sup{z ∈ [0,1] | T (x, z) ≤ y}
x →T y = 1 iff x ≤ y and otherwise
x →M y = y, x →P y = yx, x →L y = y − x + 1
negation nT (x) = x →T 0
nL(x) = 1− x, nP (x) = nM(x) = 0 for x > 0 and 1 for x = 0.
Note that residual implication and negation are important operatorsused in fuzzy logic, but they are not aggregation operators!
Idempotent (averaging) aggregation operators
Arithmetic mean x1+...+xnn f(x) = x
Geometric mean n√
x1 · · ·xn f(x) = ln(x)
Harmonic mean n1x1
+...+ 1xn
f(x) = 1x
Quasi-arithmetic mean f−1(f(x1)+...+f(xn)n )
f : [0,1] −→ R, f is continuous and monotone (f : S −→ R)
Exercise
What kind of mean is generated by f(x) = x2 for x ∈ [0,1]?
What kind of mean is generated by f(x) = 3x2 + 2 for x ∈ [0,1]?
Properties of quasi-arithmetic means
For (x, . . . , x) we have
Af(x, . . . , x) = f−1(f(x)+...+f(x)n ) = f−1(f(x)) = x
Thus quasi-arithmetic means are idempotent
For an idempotent aggregation operator A always
min ≤ A ≤ max
If g(x) = a · f(x) + c then g−1(x) = f−1(x−ca )
Ag(x1, . . . , xn) = g−1(g(x1)+...+g(xn)n ) = f−1(
(a·f(x1)+c+...+a·f(xn)+c
n )−ca ) =
f−1(f(x1)+...+f(xn)n ) = Af(x1, . . . , xn)
Quasi-arithmetic means are invariant wrt. offsets and scaling of f.
Exercise
There are 7 numbers to be aggregated by quadratic mean. However,
first three of them are lost, but we remember that quadratic mean of
these three numbers was 0.5. Compute the quadratic mean of all 7
numbers, where last four are 0.2,0.3,0.4,0.6.
Properties of quasi-arithmetic means
A(x1, . . . , xn) = A(x1, . . . , xk, xk+1, . . . , xn) = A(m, . . . , m︸ ︷︷ ︸k−times
, xk+1, . . . , xn),
where m = A(x1, . . . , xk)
Subsets of elements can be aggregated a priori, without altering the
mean, given that the multiplicity of elements is maintained (quasi-
arithmetic means are decomposable).
Weighted quasi-arithmetic means
Weighted mean w1x1+...+wnxnw1+...+wn
Weighted geometric mean (w1+...+wn)√
xw11 · · ·xwn
n
((xw1
1 · · ·xwnn )
1w1+...+wn
)
Weighted quasi-aritmetic mean f−1(w1f(x1)+...+wnf(xn)w1+...+wn
)
w1, . . . , wn ∈ [0,1]
Weighted means are used in TSK systems.
Exercise There are three students in the course. Their scores inMaths and Physics are
Maths PhysicsStudent A 20 1Student B 9 9Student C 0 22
Try to find a weighted mean W (x, y) = w1x + w2y with w1 + w2 = 1such that the overall ranking of students will be
1. A > B > C2. A > C > B3. C > B > A4. C > A > B5. B > A > C6. B > C > A
OWA operators
Ordered weighted average (OWA)w1xσ(1)+...+wnxσ(n)
w1+...+wn
where σ is a permutation such that xσ(1) ≤ · · · ≤ xσ(n)
Example If w1 = 1 and wi = 0 for i 6= 1 then
A(x1, . . . , xn) =w1xσ(1)+...+wnxσ(n)
w1+...+wn=
1·xσ(1)1 =
xσ(1) = min(x1, . . . , xn)
What operator do we get by setting wn = 1 and wi = 0 for i 6= n?
Example If n = 3, w1 = 0, w2 = 1, w3 = 0 then
A(x1, x2, x3) = xσ(2) = Med(x1, x2, x3)
is so-called Median (middle value).
We can obtain n-ary Median by putting wn+12
= 1 and wi = 0 for
i 6= n+12 in the case that n is odd and wn
2= 1
2, wn2+1 = 1
2 and wi = 0
for i 6= n2, n
2 + 1 in the case that n is even.
Choquet integral
Let σ be a permutation such that xσ(1) ≤ · · · ≤ xσ(n)
Let m : P({1, . . . , n}) −→ [0,1] be a fuzzy measure (monotone set
function with m({1, . . . , n}) = 1 and m(∅) = 0)
Then the Choquet integral for finite space X = {1, . . . , n} is defined
by
Ch(x1, . . . , xn) =n∑
i=1(xσ(i)−xσ(i−1))m(A(i)) =
n∑i=1
xσ(i)(m(A(i))−m(A(i+1)))
where xσ(0) = 0 and A(i) = {σ(i), . . . , σ(n)} with m(A(n+1)) = 0.
Choquet integral
A fuzzy measure m is said to be additive (on a finite space X) if
m(A) =∑
i∈A
m({i})
for all sets A ∈ X.
A fuzzy measure is said to be cardinal if it depends only on the
cardinality of sets, i.e., m(A) = m(B) whenever |A| = |B|.
Choquet integral
For additive fuzzy measure m(A(i)) = m({σ(i)})+ · · ·+m({σ(n)}) andthus
(m(A(i))−m(A(i+1))) = m({σ(i)})
Therefore Choquet integral is equal to
Ch(x1, . . . , xn) =n∑
i=1xσ(i)(m(A(i))−m(A(i+1))) =
n∑i=1
xσ(i)m({σ(i)})
Thus for an additive fuzzy measure Choquet integral is equal to a
weighted meann∑
i=1wixi with weights wi = m({i}).
Choquet integral
For cardinal fuzzy measure let us denote by m(k) measure of the set
with k elements. Then Choquet integral is equal to
Ch(x1, . . . , xn) =n∑
i=1xσ(i)(m(A(i))−m(A(i+1))) =
xσ(1)(1−m(n−1))+xσ(2)(m(n−1)−m(n−2))+ · · ·+xσ(n)(m(1)−0)
Thus for a cardinal fuzzy measure Choquet integral is equal to an
OWA operatorn∑
i=1wixσ(i) with weights wi = m(n− i + 1)−m(n− i).
Choquet integral
For infinite space X and f : X −→ [0,1] the Choquet integral is given
by
Ch(f) =∫ 1
0m({x ∈ X | f(x) ≥ t})dt
Ordinal scales
Sometimes we need to work on an ordinal scale, where weightedmeans (Choquet integral) cannot be used. Like for example scaleK = {weak,medium, excellent}.
In such a case addition and multiplication is replaced by maximumand minimum and thus instead of weighted mean here the weightedmaximum is used.
On [0,1] scale, weighted maximum operator is given by∨
i
wi ∧ xi,
where wi ∈ [0,1] are weights with∨i
wi = 1.
Weighted maximum is used in Mamdani systems.
Example On K = {bad,weak, fair,good, excellent} for n = 3 let
Design Consumption PriceCar 1 weak excellent fairCar 2 bad fair excellentImportance of the criteria excellent good fair
i.e., for Car 1 x1 = weak, x2 = excellent, x3 = fair with weightsw1 = excellent, w2 = good, w3 = fair. Then weighted maximum isgiven by
∨
i
wi ∧ xi = (w1 ∧ x1)∨
(w2 ∧ x2)∨
(w3 ∧ x3) =
(excellent ∧weak)∨
(good ∧ excellent)∨
(fair ∧ fair) =
weak ∨ good ∨ fair = good
Similarly weighted maximum for Car 2 is fair.
Sugeno integral
Let σ be a permutation such that xσ(1) ≤ · · · ≤ xσ(n)
Let m : {1, . . . , n} −→ [0,1] be a fuzzy measure (monotone set functionwith m({1, . . . , n}) = 1 and m(∅) = 0)
Then the Sugeno integral for finite space X = {1, . . . , n} is defined by
S(x1, . . . , xn) =n∨
i=1
xσ(i) ∧m(A(i))
where A(i) = {σ(i), . . . , σ(n)}
A fuzzy measure m is said to be maxitive (on a finite space X) if
m(A) =∨
i∈A
m({i})
for all sets A ∈ X.
For maxitive fuzzy measure, Sugeno integral is equal to
S(x1, . . . , xn) =n∨
i=1xσ(i) ∧m(A(i)) =
n∨i=1
xσ(i) ∧ (∨
j≥σ(i)m({j}))
After some computation we get
n∨i=1
xσ(i) ∧ (∨
j≥σ(i)m({j})) =
n∨i=1
xσ(i) ∧ (m({σ(i)}))
Thus for a maxitive fuzzy measure Sugeno integral is equal to a
weighted maximumn∨
i=1wi ∧ xi with weights wi = m({i}).
Sugeno integral
For cardinal fuzzy measure, Sugeno integral is equal to ordered weighted
maximumn∨
i=1wixσ(i) with weights wi = m(n− i + 1).
For infinite space X and f : X −→ [0,1] the Sugeno integral is givenby
S(f) =∨
t∈[0,1]
t ∧m({f(x) ≥ t})