Brasil 96

8/12/2019 Brasil 96

1/6

A Methodology for Implementing Hybrid Expert Systems

L.M. Brasil, F.M. de Azevedo, R.Garcia and J.M. Barreto.Dept. of Electrical Engineering,

Federal University of Santa Catarina

Biomedical Engineering Research Group (GPEB)

University Campus, Florianpolis, Brazil, 88040-900

Phone (48)231-9594 Fax(48) 231-9770 E-Mail: [email protected]

Abstract - This paper deals with a new methodology for

developing an Expert System (ES). It has the ability of learning

to extract knowledge from a poor knowledge base using the

learning by example paradigms. The choice of a poor knowledge

base was motivated by the fact that in this case it is easier to

have consistence in putting together the several pieces of

knowledge. So, the problems attached to knowledge elicitation

are simplified. The implementation leads to a Hybrid Expert

System (HES). This system consists of a Neural Network based

Expert System (NNES) and a Rule Based Expert System(RBES). The main idea is that if the knowledge engineer has

conditions to obtain some basic rules, and a set of examples,

from the domain expert then it is possible to define the basic

structure of the NNES using those basic rules. Then the NNES

can be refined using the set of examples. In this stage structural

changes of the network are allowed by the learning algorithm.

Rules can be deduced after this refinement. Then it can be used

to form a RBES and an Explanatory Expert System (EES). The

methodology developed to HES is intended to be used in

implementing Decision Support Systems in the Medical Area.

I -INTRODUCTION

At present, one of the most known products of the

Artificial Intelligence (AI) is ES. It has proved its efficiency

independently of the implementation paradigm adopted:

symbolic manipulation or connectionist.

There are several problems in building an ES. One of them

is the process of Knowledge Acquisition (KA), which

comprehend knowledge extraction from a domain expert and

the choice of the model for the knowledge representation to

code human reasoning [1]. Knowledge elicitation stage

consumes time mainly because normally human beings, even

knowing how to solve a problem, have difficulty in

explaining how they reached the solution of a specific

problem by themself. Moreover, there is the problem of

knowledge representation of a domain expert. There are

many forms to represent it. When symbolic manipulation is

used, production rules are one of the most common

knowledge representation schemes used. It is simple and

direct, but it relies on a rich knowledge base. Moreover, it is

necessary to update it constantly. In case of connecionist

paradigm, the problems have been basically the choice of

data to be fed to the input of neural network (NN), the

number of neurons in the hidden layer and the kind of

topology of the NN used. Finally, only in some special cases,

there are difficulties in obtaining the explanation on how the

network arrived to a conclusion.

So, a HES is proposed to deal with the problems

mentioned above. This system consists of a NNES, a RBES

and, an EES [2].

The paper presents the architecture and operation of the

HES, the system methodology, the learning algorithm, and

discusses the proposed approach.

II- PROPOSED SYSTEM

The proposed architecture for ES includes a NNES, a

RBES, and an EES.

NNES has been used to implement ES as an alternative

manner to RBES. Artificial Neural Networks (ANN) are

made through a big number of units. These units own some

properties like natural neurons. Therefore, every unit presents

several inputs, some excitatories and others inhibitories.

Moreover, these units take values of each input and generate

an output that is function of the inputs. So, a network ischaracterized by the units (neurons) and by the way the

neurons are connected (topology). Moreover, algorithms are

used to change weights of the connections (learning rules).

Thus, these three aspects constitute the connectionist

paradigm of the artificial intelligence [3]. The

implementation of ES this way are called NNES. These

systems are generally developed using a static network with

feedforward topology trained by a backpropagation-like

learning algorithm [4]. So, while the basic network represents

relations among concepts and connections by way of

inferring something through it, the set of examples will refine

NNES. In this last process, the algorithm provides

modifications not only in the weights of connections, but alsoin the network structure. It uses this topology and it generates

and/or eliminates connections that had not been in basic

rules. Moreover, it can also occasionally generate more

concepts that were not in the basic rules. Therefore, the

system translates as rules the basic rules that the expert was

not able to extract. The basic rules after extraction suffers a

treatment due to the kind of variables applied as input of the

network, where they represent different types of concepts, as

quantitative, linguistic, boolean or a combination of them

8/12/2019 Brasil 96

2/6

[2,4,5]. Moreover, a structural modification of the ANN

consists in determining the number of neurons of the hidden

layer using a Genetic Algorithm (GA) [6].

RBES broadly depends on formal logic as a way of

explicit knowledge representation. Our system has two kinds

of data: basic rules and set of examples. The basic rules are

used to create the initial NNES which is refined through the

examples. The refined NNES is then translated in a RBES

from which explanations can be obtained. The model

proposed is through fuzzy logic. The theory of fuzzy logic

provides a good mathematical framework to represent

imprecise knowledge as in our case [2,5,7].

Finally, the EES of the HES is derived from RBES. It

compares the answers given by the NNES and by the RBES.

If the two answers are equals the EES is triggered and it will

give an explanation. Otherwise, it will state the impossibility

of reaching the goal and it will suggest how to obtain a

suitable solution [2].

III -METHODOLOGY

One of the difficulties in eliciting the knowledge of domain

experts is when one wishes to obtain an adequate set of rules.

Firstly, in many fields experts are not capable of realizing or

articulating which knowledge they use in solving their

problems. Secondly, different experts have often different

explanations to their decisions and sometimes even their

decisions disagree. [3].

On the other hand, a NNES needs only a set of examples to

represent the problem considered. However, as the

knowledge is often distributed in the network connections,

explaining how the NNES reached a conclusion is very

difficulty except if the knowledge representation is localized.Or, explaining the ES reasoning is of capital importance,

mainly when the user disagrees with the ES suggestion and

also during the tuning phase. This is extremely important in

our case, where physicians are the potential users. So, the

proposed methodology deals with a hybrid scheme, which

explores the intrinsic suitable characteristics of each

approach (Fig.1).

KNOWLEDGE ACQUISITION

INTEGRATED SYSTEM

EXPLANATORY SYSTEM

RBES

REFINED NNES

NNES

SET OF

EXAMPLES

BASIC

RULES

Fig.1 - Block diagram of the general system.

A.Knowledge Acquisition

The KA task consists on extracting knowledge of the

domain expert (i.e., a physician expert). In our case, the main

goal is to minimize the intrinsic difficulties of the KA

process. To do so, we try to obtain all possible rules from the

domain expert in a short time and also a set of examples.

The basic rules can be improved capturing the

uncertainties associated with the human cognitive processes.

The model proposed uses fuzzy logic [5].

Basic rules are translated by AND/OR graphs, so that they

define the NN basic structure of the NNES. In other words,

an AND/OR graph, which represent concepts and

connections, indicates the neurons number in the input and

output layers. The graph also shows the existence of

intermediate concepts and their connections which they

translate in the intermediate layer of the NN, according to

Fig.2.

R6

R6: If Q Then X.

R5: If P Then X.

R4: If G And H Then Q.

R3: If E And F Then Q.

R2: If C And D Then P.

R1: If A And B Then P.

Basic Rules:

R4R3

R5

R2R1

X

QP

HGFEDCBA

Fig.2- Organization of the rules base as a network.

Basic rules consist of an if-part, that indicates the

antecedent and it expresses, in our case, the symptoms of a

disease, while a then-part deals with the consequent and it

expresses the possible diagnostics [8]. Moreover, each of

these rules presents a membership degree. It corresponds tothe value that will be put in the inputs of the basic NN after

the treatment of the semantic variables.

B.Neural Model

A NNES structure that has conditions to receive several

kinds of semantic variables, i.e., boolean, linguistic and,

quantitative inputs, is considered (Fig.3). These inputs have

been treated as a unique kind of variable of fuzzy type. It is

believed that this sort of structure deals with modeling a

structure by a possible simpler way and also consuming less

learning time than a traditional structure.

Quantitative

Boolean

Linguistic

STATEMENTS

Neural

Inputs

(Neural

Outputs)

DecisionsNeural

Network

Learning

Algorithm

Interface

to Fuzzy

Fig.3 - State Variable Kinds

8/12/2019 Brasil 96

3/6

The mathematics model of the neuron is given by,

X(t)= n-dimensional input vector

X(t) = [x1(t), x2(t),..., xi(t),...,xn(t)]T n (1)

y(t)= scalar output of each neuron

y(t) 1

N: nonlinear mapping function, X Y, x(t) |y(t)

where: X:

+

n

and Y:

+

(2)

This mapping can be noted as N and so:

y(t) = N [X(t) n] 1 (3)

Mathematically, the neural nonlinear mapping function N

can be divided into two parts: a function called confluence

[5] and a nonlinear activation operation. The confluence

function is the name given to a general function having as

arguments the synaptic weights and inputs. A particular case

widely use is the inner product.

This mapping yields a scalar output u(t) which is a

measure of the similarity between the neural input vector X(t)

and the knowledge stored in the synaptic weight vector W(t).So, u(t) and W(t) are given by,

W(t) = [w0(t), w1(t),..., wi(t),..., wn(t)]T n+1 (4)

u(t) 1

Redefining X(t) to include the bias x0(t) we have:

u(t) = X(t) W(t) (5)

The nonlinear activation function maps the confluence

value u(t) [-,] to a bounded neural output. Then, the nonlinear

activation operator transforms the signal u(t) into a bounded

neural output y(t), that is,

y(t) = [u(t)] (6)

y(t) = [W(t) X(t)] 1 (7)

Applying the equations (1), (4), (5), and (7) to a multilayer

NN (e.g.,three layers), we have:

Y(t) = N3[N2[N1[X(t)n]]] m (8)

Y(t) = 3[W3(t) 2[W2(t) 1[W1(t) X(t)]]] (9)

Where i is non-linear activation operator, is the

confluence operator, and W1(t), W2(t), and W3(t) are the

synaptic weight vectors for the input, hidden and output

layers, respectively.

If we express the neural input signals in terms of their

membership functions each over the interval [0,1], rather

than in their absolute amplitudes, then we can perform

mathematical operation on these signals using logical

operations such as AND/OR, according to [5].

Let us express the inputs x1 and x2 over [0,1]. Then we

define the generalized AND (T-norm) as a T mapping

function and generalized OR (T-conorm) as a S mapping

function [7]:

T: [0,1] x [0,1] [0,1]

S: [0,1] x [0,1] [0,1], given by:

y1= [x1AND x2] [x1T x2] = T[x1, x2] (10)

y2= [x1OR x2] [x1S x2] = S[x1, x2] (11)

Then, in (5), by replacing the -operation by the T-

operation, and the -operation by the S-operation, we get

u(t) = Si

n

=0[wi(t) T xi(t)] [0,1] (12)

and

y(t) = [u(t)] [0, 1] (13)

C. Other Stages of the HES

After the basic NNES is obtained, the set of examples

serves to validate the neural network structure. In worst case,

the network does not represent the knowledge of the

problem. Therefore, it becomes evident that the basic rulesextracted from the expert are not sufficient, as expected. So,

these same examples are used by the learning algorithm to

refine the NN. This algorithm can change, generate and/or

eliminate connections, or it can still generate or eliminate

neurons in the hidden layer. After the refinement of the

network, a new discussion is made with the domain expert to

validate the modifications in the basic structure of the

network. Thus, a new set of examples is obtained to test

again the network. In the case that it has well performed, it is

supposed the network represents the proposed goal.

After the NNES is refined, a reverse process is followed

toward the inferring of the if-then rules together with their

membership degrees. So, a RBES is implemented. Following,

the RBES serves as basis for developing other system, the

EES, that is supposed able to explain why the NNES reached

a conclusion.

IV- LEARNING ALGORITHM

The learning algorithm developed for NNES is inspired on

the traditional backpropagation algorithm. Nevertheless, it

presents some differences:

- Optimization of the hidden layer is supported by GA.

- Incorporation of logic operators AND/OR in place of theweighted sum.

Therefore, the conventional backpropagation learning

algorithm is altered in function of these modifications. Some

works that integrate these topics are [2,4,5,6,8,9,10].

A.Learning Algorithm Stages

Summarizing, the first stage in the implementation of the

learning algorithm is considered as the assembly of the NN

basic structure. It is based in function of a AND/OR graph, in

8/12/2019 Brasil 96

4/6

other words, the hidden layer consists of AND nodes and the

output layer of OR nodes. So, the logic operators AND/ OR

are incorporated in place of the weighted sum in the network

neurons [5].

In next stage, the learning algorithm uses examples, which

are extracted from the domain expert, such as modifying, not

only the weight of each connection, but also the network

structure. In this last case, connections are included or

excluded among neurons, as well as neurons can be included

or excluded of the hidden layer by the application of the GA

[6].

B. Genetic Algorithm

GA are based on the work of Holland [11] where he was

inspired by the evolution of a population subject to

reproduction, mutations and crossover in a selective

environment. So, following this idea, we choose GA to

optimize [6] the size of the hidden layer and determining

weights to be set to zero. This can be justified by the

following main facts: it allows to avoid local optimum andprovides near-global optimization solutions and they are easy

to implement. Nevertheless, when it is applied with this goal,

in hidden layer of a NN, we must take care of respecting a

maximum and a minimum number of neurons of this layer. In

fact, too many neurons generally have as effect a decrease of

the generalization capabilities of the network and implies a

long learning phase. On the other hand, too few neurons can

be unable to learn, with the desired precision, the task. So,

there is an intermediate number of neurons that must be put

in the hidden layer, to avoid the problems mentioned above.

Then, the network must be sufficiently rich to solve a

problem and as it must also be adequately simple to solve a

problem well as it must not consume longer time of training.The parameter called as momentum, which it has the

objective to give higher speed of training to network, can still

be added in this same algorithm [12].

V- RESULTS AND DISCUSSIONS

An HES, including a RBES and a NNES, is discussed

under the aspects of KA, where the treatment of imprecision

is a possibility of explaining the reasoning to attain a

conclusion. The KA phase is based on learning by example

paradigms and the intrinsic capabilities of the NNES are

exploited. Soon afterwards, that knowledge can be

transferred to the RBES that uses fuzzy logic to deal with

imprecision. This methodology has proved, in the

preliminary studies performed, very promising by leading to

an easier KA phase than expected if the KA was performed

using symbolic techniques alone. The hybridism, on the other

hand, allows to complement the NNES with explanation of

reasoning facilities, that in most cases are difficult to obtain

with a NNES.

In future we intend to validate the approach with a

concrete example. This example will be probably the

construction of a Medical Decision Support System to

classify epileptic crises. This system will have about 15 to 30

rules combined with their membership degrees. These data

will be elicited by physician experts, mainly at the University

Hospital of Federal University of Santa Catarina.

ACKNOWLEDGMENT

The first author acknowledges the CAPES (Coordination

Foundation of High Level Personnel Improving) and the last

author the CNPq (National Counsil for Scientific and

Technological Development) - RHAE Program - for the

material support in the development of this work.

REFERENCES

[1] L.M. Brasil, "Aquisio de Conhecimento Aplicada aoDiagnstico de Epilepsia", M.Sc.Thesis in Electrical

Engineering, Federal University of Santa Catarina,

Florianpolis, UFSC, 1994.

[2] L. M. Brasil, F.M. de Azevedo, R. Garcia and J.M. Barreto

Cooperation of Symbolic and Connectionist Expert System

Techniques to Overcome Difficulties. In: Proceedings of 2nd

Neural Networks Brazilian Congress, Brazil: Curitiba,

pp.177-182, 1995.

[3] F.M. de Azevedo, "Contribution to the Study of Neural

Networks in Dynamical Expert Systems", Ph.D. Thesis,

Institut d'Informatique, FUNDP, Belgium, 1993.

[4] S. Mitra and S.K. Pal, "Logical operation based fuzzy MLP

for classification and rule generation", Neural Networks,vol.7, no.2, pp.353-373, 1994.

[5] M.M. Gupta and D. H. Rao, "On the principles of fuzzy neural

networks", Fuzzy Sets and Systems, vol.61, no.1, pp.1-18,

1994.

[6] R. G. Ojeda, F.M. Azevedo and J.M. Barreto, "Genetic

algorithms in the optimal choice of neural networks for signal

processing", in 38th. Midwest Symposium on Circuits and

Systems, Rio de Janeiro, Brazil. (In Press), 1995.

[7] H.J. Zimmermann, Fuzzy Set Theory - and Its Applications,

Massachusetts, USA: Kluwer Academic Publishes, 1991.

[8] L.M. Fu, "Knowledge-based connectionism for revising domain

theories", IEEE Transactions on Systems, Man, and

Cybernetics, vol.23, no.1, pp.173-182, 1993.[9] H. Ishibuchi, R. Fujioka and H. Tanaka, "Neural networks that

learn from fuzzy if-then rules", IEEE Transactions on Fuzzy

Systems, vol.1, no.2, pp.85-97, 1993.

[10] R. Sun, "Rule-based structure identification in an adaptive-

network-based fuzzy inference system", IEEE Transactions on

Fuzzy Systems, vol.2, no.1, pp.64-73, 1994.

[11] J.H. Holland, Adaptation in Natural and Artificial Systems,

Cambridge, Massachusetts: MIT Press, 1992.

[12] J.M. Zurada, Introduction to Artificial Neural Systems, St.

Paul, MN: West Publishing Company, 1992.

8/12/2019 Brasil 96

5/6

8/12/2019 Brasil 96

6/6

Brasil 96

Documents

Transcript of Brasil 96