ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of...

20
R. Subhashini*et al. /International Journal of Pharmacy & Technology IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6563 ISSN: 0975-766X CODEN: IJPTFI Available Online through Review Article www.ijptonline.com PERFORMANCE ANALYSIS OF DIFFERENT CLASSIFICATION TECHNIQUES FOR THE PREDICTION OF CHRONIC KIDNEY DISEASE R. Subhashini *1 , M.K.Jeyakumar 2 *1 Research Scholar, Department of Computer Science, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180. 2 Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180. Email: [email protected] Received on 28-10-2017 Accepted on: 25-11-2017 Abstract The performance degradation of renal of human being is portrayed as chronic kidney disease. Itdisappearthe human life by causing severe health issues thus its prediction becomes a central attraction in medical and research field. This paperanalyses number of existing literatures which are utilisingdiverse techniques principally classification algorithms for predicting chronic kidney disease. The performance analysis is carried out for the artificial neural network, naive Bayes, support vector machine, decision tree, k-nearest neighbour and fuzzy classifiers. The performance of these algorithms is evaluated using fiveclassification parameters. The classification methodologies analysed in this paper are derived from three major strategies they are the neural network, statistical methods and machine learning. Therefore by analysing the six techniques, this paper outlines the deep study of different classification strategies with its performance. Finally performance analysis is evaluated for the medical data set and comparison is graphically shown with the MATLAB simulation tool. Keywords: Chronic kidney disease; classification; artificial neural network; k-nearest neighbour; decision tree; naive Bayes. 1. Introduction Chronic kidney disease is the term for heterogeneous disorders which affect the functions and structure of kidney by doing damage or decreasing the functions. In developed country, this disease has associated with diabetes, old age, obesity, and hypertension, etc. So the exact diagnosis is tough. The abnormal presence of red and white blood cells causes the disease. The stage of the disease is based on the glomerular filtration rate (GFR). In normal the genetic factor can lead to the disease. If it increases, it causes damage to the kidney, after

Transcript of ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of...

Page 1: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6563

ISSN: 0975-766X

CODEN: IJPTFI

Available Online through Review Article

www.ijptonline.com PERFORMANCE ANALYSIS OF DIFFERENT CLASSIFICATION TECHNIQUES

FOR THE PREDICTION OF CHRONIC KIDNEY DISEASE

R. Subhashini*1

, M.K.Jeyakumar2

*1Research Scholar, Department of Computer Science, Noorul Islam Center for Higher Education,

Kanyakumari, Tamil Nadu, India 629180. 2Additional Controller of Examinations, Noorul Islam Center for Higher Education,

Kanyakumari, Tamil Nadu, India 629180.

Email: [email protected]

Received on 28-10-2017 Accepted on: 25-11-2017

Abstract

The performance degradation of renal of human being is portrayed as chronic kidney disease. Itdisappearthe

human life by causing severe health issues thus its prediction becomes a central attraction in medical and

research field. This paperanalyses number of existing literatures which are utilisingdiverse techniques

principally classification algorithms for predicting chronic kidney disease. The performance analysis is carried

out for the artificial neural network, naive Bayes, support vector machine, decision tree, k-nearest neighbour

and fuzzy classifiers. The performance of these algorithms is evaluated using fiveclassification parameters. The

classification methodologies analysed in this paper are derived from three major strategies they are the neural

network, statistical methods and machine learning. Therefore by analysing the six techniques, this paper

outlines the deep study of different classification strategies with its performance. Finally performance analysis

is evaluated for the medical data set and comparison is graphically shown with the MATLAB simulation tool.

Keywords: Chronic kidney disease; classification; artificial neural network; k-nearest neighbour; decision tree;

naive Bayes.

1. Introduction

Chronic kidney disease is the term for heterogeneous disorders which affect the functions and structure of

kidney by doing damage or decreasing the functions. In developed country, this disease has associated with

diabetes, old age, obesity, and hypertension, etc. So the exact diagnosis is tough. The abnormal presence of red

and white blood cells causes the disease. The stage of the disease is based on the glomerular filtration rate

(GFR). In normal the genetic factor can lead to the disease. If it increases, it causes damage to the kidney, after

Page 2: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6564

that the glomerular filtration rate is defined at a higher rate, which causes the kidney failure. This can be

diagnosed by the dialysis or transplantation technology. While providing this treatment the patients‟ health may

be improved or else the disease gets worse and leads to death. [1-3].

The injuries and burden of disease measurement are the input of human health policy. The entire group of

chronic disease can be degraded by diminishing the heart disease and diabetes. The common health concern is

the excess of body weight [4-5]. The enzyme method is presented to estimate the glomerular filtration rate. The

metabolism of vitamin D is regulated by the growth factor of fibroblast 23 (FGF23). The binding of co-

receptors can yield the glands of parathyroid cellular specificity in the kidney which improves the affinity of

growth factor to express the FGF receptors. The maintenance of phosphate balance in the disease affected

peoples with the compensatory response is the FGF level of 23 [6-7].

The chronic kidney disease can be predicted by several methods among this the abbreviated Modification of

diet in renal disease study equations which is modified by the coefficients of Japanese is the one method to

catch the disease. Thought out this technique the GFR values are calculated from the value of serum creatinine.

However, the renal function never decreased with a lower value of GFR [8]. Then the model of consecutive

series is developed to the disease with several variables which are independent of kidney failure. The model can

use the data's of sex and age, adding clinical and candidate variables. The hazard ratio is fitter for successive

models, but it is poor. The disease of CKD with patients may lose their energy and protein. The term of

cachexia represents a lot of protein- energy wastage [9].

CKD predicts one of the risk factors of the cardiovascular event. There are no equations established to prevent

the risk in CKD patients. The predicted scores are required to beat the risk by the evaluation of power [10]. The

CKD is based on the GFR, albuminuria, and GFR. The disease can be avoided by stopping the drug usage,

better treatment, test and intervention [11]. The level of higher metabolite levels depends on the decreased risk

of CKD. In early stages disease never provides the information about the injury and the cause of disease. The

patients will intake the optimal amount of energy [12-13]. The pathway of wnt/β-catenin and redox signalling

are used to avoid disease and five urine metabolites are correlated with GFR [14].

The cardiac surgery can treat the kidney injury, and the intraoperative variables can improve the current clinical

score. The prediction tools are used to handle the surveillance and Reno-protective efforts [15-16]. The risk

factors are in the disease in which the proteinuria is the important one, and it can be avoided by changing the

Page 3: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6565

intraglomerular hemodynamic. This will be independent of human blood pressure. The CV (cardiovascular)

risks are not only focusing on the clinical decision making but also make the health care. The biomarkers are

important to clear the pathway and to determine the injury in the kidney [17-18].

The most of the patients with chronic kidney disease are associated with the successive problems such as

diabetes, hypertension and cardiovascular disease [19-20]. Therefore the early prediction of chronic kidney

disease may save the human life by controlling the effects of disease and cost of treatment. The major

contribution of this review is analysing CKD prediction techniques with their accuracy of detection.

The recently growing health hindrance across the universe is the Chronic Kidney Disease (CKD). Table 1

defines the growth rate of different countries [21].

Table 1. Disease growth rate of countries.

No. Country Growth (out of 100 people)

1 The United States of America 10 or 15

2 Australia 11.2

3 Singapore 10.1

4 Japan 18.7

5 Iran 8.3 to 18.9

6 India 16

Table 2. Comparison of KNN performance.

Author(s) Technique Advantage Accuracy

(%)

Parul Sinha and

Poonam Sinha

[42]

KNN

SVM

It provides a new decision-

making system for

predicting CKD.

78

73

Ani R et al. [43] neural network based

back propagation (BPN)

probability based Naive

Bayes

Helps to improve the

accuracy and ease of

diagnosis.

81.5

78

76

Page 4: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6566

LDA classifier

BasmaBoukenze,

et al. [44]

C4.5

KNN

NB

MLP

Supports health care

industry to take decisions

for diagnosing kidney

failure.

63

58.25

57.5

62.25

Table 3. Attributes.

age - age

bp - blood pressure

sg - specific gravity

al - albumin

su - sugar

rbc - red blood cells

pc - pus cell

pcc - pus cell clumps

ba - bacteria

bgr - blood glucose random

bu - blood urea

sc - serum creatinine

sod - sodium

pot - potassium

hemo - haemoglobin

pcv - packed cell volume

wc - white blood cell count

rc - Red blood cell count

htn - hypertension

dm - diabetes mellitus

Page 5: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6567

cad - coronary artery

disease

appet - appetite

pe - pedal edema

ane - anemia

The morbidity and mortality have been an increasing problem in the medical research, which is greatly caused

by CKD [22]. The major contribution of this review is analysing CKD prediction techniques with their

accuracy of detection.

This paper is further organized as follows: Section 2, Section 3 and Section 4 presents the motivation of the

work, Methodology involved in the prediction of CKD, result and discussion with performance metrics.

Furthermore, Section 5 includes the conclusion, and finally references are added.

2. Methodology involved in the prediction of CKD

The healthcare sector in each country handles a large amount of data about the health condition of people.

However, the information from the data is not well used for the analysis and prediction of diseases. In the past

literature works these kinds of datasets are processed with some techniques, and the accurate prediction of the

disease has attained. The techniques have its pros and cons. The five most used methods in the prediction are

evaluated in this work. Figure 1 describes the flow of the work.

Figure 1.Classification of Chronic kidney disease.

Page 6: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6568

3.1. Dataset Collection

The dataset for the analysis is collected from the UCI Machine Learning Repository. The dataset of original

version has multiple different entries without some of the deleted or unfilled data. The collection has 400

instances for 24 attributes in which 11 attributes have numerical values, and 13 have nominal values. These

instances are divided into 250 identified as chronic kidney diseases and 150 as not chronic kidney diseases.

3.2. Classification Techniques

The regression, classification, clustering and association based techniques have been utilized for the prediction

of the diseases [23]. In this analysis, there are five classification techniques discussed for the prediction

accuracy. They are an artificial neural network (ANN), K-nearest neighbour (KNN), Support Vector Machine

(SVM), Naive Bayes, Decision Tree and fuzzy.

3.2.1. Artificial Neural network based classification

Nowadays the neural network is employed in the field of medical as a classifier to classify cancer [24],

diagnosis of Parkinson [25] disease and heart disease. Figure 3 demonstrates the ANN based classification.

The widespread usage of the neural network as a pattern classifier increases, because it learns the input data and

provides the desired response based on that data [26]. The error correction based training relates the preferred

output and input data as,

nYnDnE iii (1)

In equation (1) i represents the neuron, n is the number of iteration, nE i is the error response, nDi is the

preferred output and nY i is the estimated response.

In a neural network for every number of input attributes the comparison is the main task in the output side [27]

[28]. SherifE.Husseinet al. [29] confess iridology validation for estimating the kidney aberrations. This was

validated through the classification by combined wavelet and neural network analysis. Chongjian Wang et al.

[30] had proposed ANN as an efficient classification approach for the prediction of type 2 diabetes mellitus

affected grown-ups.

The processing way carried this methodology without using any of the biochemical parameters.

JamshidNorouziet al. [19] acknowledged a Neuro-fuzzy interference system for predicting renal failure based

on the real-time data.

Page 7: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6569

3.2.2. K-Nearest Neighbour based classification

The K- Nearest neighbour is the straight forward classifier with greater accuracy. The classification in this

algorithm depends upon the similarity measure. The continuity of attributes is very much important in this

process [31]. The progression stages in the nearest neighbour algorithm are shown in figure 2.

Figure 2. Flow of prediction methodology.

Ani R et al. [32] were utilizes the neural network based back propagation (BPN), probability based Naive

Bayes, LDA classifier techniques for the prediction with 81.5, 78 and 76% accuracy respectively.

BasmaBoukenze, et al. [33] was used C4.5, KNN, NB and MLP with 63, 58.25, 57.5 and 62.25% accuracy

respectively.

3.2.3. Support vector machine based classification

The SVM has considered as a machine learning algorithm for the classification of two-class problems. The

training of support vector machines with the positive and negative types of data is known as one-against-all or

one-against-rest. The two-class type SVM is combined for creating a multi-class support vector machine. The

classification by this algorithm is represented as in the equation [34] (2).

bxF j

n

jjj ,sgn

1 (2)

In (2) j is the Lagrange Multiplier, jj xand are the dimensional vectors of the two associated classes and

j , is the Kernel function. The numerical difficulties of this method are eliminated using Kernel function.

Page 8: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6570

Davar Givekiet al. [35] investigated feature weighted support vector machines and modified cuckoo search

based technique for evaluating the diabetes disease. It was processed with UCI dataset and provided an

accuracy value of 93.58%. Abdullah A. Aljumahet al. [36] had provided an experimental verification for

diabetes cure analysis regression tree based data mining techniques such as support vector machine. The

multiple ensemble based classifier analysis of cardiovascular disease had discussed in [37] by Jae-Hong Eomet

al. By analyzing the urinary biomarkers of patients, Wolfram Gronwald, et al. [38] had predicted the kidney

function let-down. In this analysis, the SVM classifier was used along with multidimensional-multinuclear

nuclear magnetic resonance spectroscopy.

3.2.4. Naive Bayes based classification

Consider n number of classes nNNN ,........, 21 . The each single sample is constructed with a vector S of dimension

with i number of attributes. The naive Bayesian classifier uses the probability count for predicting the class of S

By this theorem,

SP

NPNSP

SNPii

i

(3)

For evaluating the numerator term, the attributes of the dimensional vector is considered as independent values

of each other. This is denoted as in (4).

p

PiPi NSPNSP

1

(4)

Ashok Kumar Dwivedi [39] had presented an analysis of six techniques for the early prediction of Diabetes

including naïve Bayes. The objective of the work was developed by Daria Prilutskyet al. in[40] is the

classification of infectious diseases by the data mining techniques with the lower accuracy of naïve Bayes

technique. Abeer Y. Al-Hyariet al. [41] had discussed a new clinical decision support system with classifiers

like naïve Bayes for envisaging the evolution of chronic renal failure with high accuracy. Asif Salekinet al. [42]

had addressed the CKD prediction with machine learning approaches 24 predictive parameters. The approaches

proposed in the paper were well designed for the diminution of noise data.

3.2.5. Decision tree based classification

This technique is a branching methodology which provides the outputs which are possibly related to the

decision. For the generation of proper decision tree approach uses greedy search. The classification tree

Page 9: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6571

analysis was discussed by Paola Rucciet al. [43] for the estimation of clinical and demographic features of

patients who are holding differential annual decline in glomerular filtration rate to predict the chronic kidney

disease. The decision support methodology for the reduction of treatment cost and welfare of the person having

unwanted health behaviour affected with the problem of haemodialysis was presented by Jinn-Yi Yehet al.

[44]. Naoki Hiramatsuet al. [45] had used the decision tree analysis for identifying the anaemia disease. The

data was collected from the hepatitis C patients for the evaluation. The collection of data mining techniques

including decision trees is analysed for the accurate prediction of kidney diseases which helps the health

industry was reviewed by Ms. AsthaAmeta and Ms.Kalpana Jain in [46].

3.2.6. Fuzzy based classification

The fuzzy classifier can classify the object which holds multiple values. It provides a low value of

computational cost at the same it yields higher classification accuracy. In the fuzzy based classification

methodology, the data is converted into a practical form using the principal component analysis scenario [47]. It

provides some merits which are shown below [48].

1) Describing the knowledge using linguistic terms such as „„high” instead of hard terms.

2) Acting as simulating the actual situations.

N. Pavithra and Dr R. Shanmugavadivu have proposed a paper in which the data mining techniques including

fuzzy was analysed for the prediction of kidney disorder in the earlier stage of the disease. The methodology

was provided with the advantage of classification accuracy increment while increasing the attributes [49].

Jamshid Norouziet al. [19] were addressed intelligent fuzzy expert system for the prediction of chronic kidney

disease. The renal failure progression was identified in this paper as a major theme. This suggests the accurate

discovery over time is necessary for the reduction of cost and mortality rate. Mohammad SaberIraji [50] was

presented the fuzzy technique which combined with the neural network for the prediction of chronic kidney

disease. There are 24 features were considered for the evaluation.

4. Result and Discussion

The analyses of considered methods in the review are validated using Matlab R2016a installed in the system

having the configuration of Windows 10. The five kinds of methodologies namely ANN, KNN, Decision tree,

Naive Bayes, SVM and fuzzy are validated and tested for the classification accuracy of the prediction of

Page 10: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6572

chronic kidney diseases. The parameters used for the validation are accuracy, precision, recall, specificity and

F-Measure.

4.1. Accuracy

The accuracy of the classifiers is measured by the equation (5).

negativefalsenegativetruepositivefalsepositivetrue

negativetruepositivetrueAccuracy

(5)

Figure 3. Neural network classification.

Figure 3 provides the classification accuracy of the techniques analysed here. Based on the accuracy analysis,

the fuzzy is highlighted by its performance. It provides a classification accuracy of 90%. The descending order

of performance is Fuzzy, Decision tree, KNN, Naïve Bayes, SVM and ANN. Even though the decision tree

technique exhibits larger searching time for the correct decision, it provides accurate output than the Naïve

Bayes and supports vector machine classifier.

4.2. Precision

Precision is the measure defined by the ratio of true positive to the addition of true positive and false positive. It

can be estimated using (6) and shown in figure 4.

positivefalsepositivetrue

positivetrueecision

Pr (6)

Page 11: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6573

The fuzzy classifier provides the precision of 0.75. The ANN classifier provides the accuracy of 75% with a

precision of 0.37. The precision for the nearest neighbour for the accuracy of 85.5% is 0.37. The decision tree

method holds a precision value of 0.25 with accuracy 88.12%. The naïve Bayes and SVM has an accuracy of 85

and 82.5% with precision .451 and 0.412 respectively.

Figure 4. KNN classification.

4.3. Recall

The recall is another measure for the performance evaluation of classifiers, and it is otherwise known as

sensitivity. It is evaluated as by below equation (7).

negativefalsepositivetrue

positivetrueySensitivitcall

/Re (7)

In (7) true positive is the indication of the positive sample which is identified properly by the classification

technique used.The false negative is evaluated by positive sample mistakenly identified as the negative sample

by the classifier. The below figure 5 shows the performance of six classifiers for the parameter Recall.

Figure 5. Decision tree classification.

Page 12: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6574

4.4. Specificity

Here we are taking specificity as the final measure for the analysis of the performance of the five classifiers

used in this work. The specificity can be evaluated using the equation (8).

positivefalsenegativetrue

negativetrueySpecificit

(8)

In (8) true negative is the identification of negative sample equally by the classification technique used.The

false positive is evaluated by negative sample mistakenly identified as the positive sample.

Figure 6. Accuracy of classifiers.

From figure 6 specificity value achieved by classifier support vector machine is 0.4125, and the naïve Bayes

technique is 0.456. The other classifiers ANN, KNN and decision tree used in the analysis provide the value of

0.6, 0.37 and 0.72 respectively. The specificity of the fuzzy classifier is 0.82.

4.5. F-measure

The F-measure is a parameter related to precision and sensitivity and for the proposed classifiers it is

represented in figure 7. The F-measure can be defined as,

senitivityprecision

senitivityprecisionmeasureF

2 (9)

Figure 7. Precision of classifiers.

Page 13: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6575

Figure 8. Recall of classifiers.

Figure 9. Specificity of classifiers.

Figure10. MCC of classifiers.

Figure 11. NPV of classifiers.

Page 14: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6576

Figure 12. F-measure of classifiers.

Figure 13. FDR of classifiers.

The F-Measure of the classifier is nearest to 1 means it is the highest performance classification methodology,

and the value is nearest to zero it indicates the stumpy performance. The F-measure of the fuzzy is 0.71, and the

decision tree classifier is 0.14 from the evaluation.

So the analysis for the prediction of chronic kidney diseases with classification techniques is validated in the

result part. From the analysis of performance, it concludes that fuzzy has the highest accuracy of prediction

over other technologies. Moreover, the each technique have own drawback and advantages. Even the artificial

neural network has the high performance next to fuzzy classifier in the prediction it is slower because of the

learning method. The performance of the K nearest neighbour has reduced when the dataset carries larger

instances.

The need of memory is very large in the support vector machines. The precision value has converted into

descending value while data was reduced in naive Bayes. However, according to the analysis of performance

based on the five parameters, it is concluded that fuzzy shows better classification performance for the

predictionof chronic kidney disease.

Page 15: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6577

5. Conclusion

This paper concentrates on the analysis of classification methodologies for the accurate prediction of chronic

kidney disease. However many techniques are ascended to meet the challenges of classification, it is a

challenging issue in the medical field. In this work, six classification methodologies are weighed with five

parametric comparisons in MATLAB simulation tool. The techniques used in the analysis are an artificial

neural network, K-nearest neighbour, decision tree, naive Bayes, support vector machine and fuzzy. Each

technique has verified with medical dataset having 400 instances. From the evaluation of parameters, the fuzzy

has discovered as the best technique for the classification of chronic kidney disease. Finally, in the result part,

the techniques are discussed for the drawbacks associated with them. As this literature concludes with a future

extension that based on the need of work the better performance classification methodology is expanded and

this is based on the modification and enhancement of the technique with required accuracy of prediction.

Acknowledgements

I would like to thank my co-author Mr. M.K.Jeyakumar for guiding me to complete this research and also am

very thankful to my institution „Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India‟

for giving me full support to complete this work

Conflict of interest:

R. Subhashini, M.K.Jeyakumar, A state that there are no conflicts of interest. Patients‟ rights and animal

protection statements: This research article does not contain any studies with human or animal subjects.

Ethical statements

Animal and human subjects were not used in this study.

Funding

This Research did not receive any specific grant from funding agencies in the public, commercial or not-for-

profit sectors.

References

1. S. Levey, and J. Coresh, “Chronic kidney disease,” The Lancet, vol. 379, no. 9811, pp. 165-180, 2012.

2. C. Delles, and R. Vanholder, “Chronic kidney disease,” Clinical Science, vol. 131, no. 3, pp. 225-226,

2017.

Page 16: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6578

3. S. S. Lim, T.Vos,A. D. Flaxman, G.Danaei, K. Shibuya, H. Adair-Rohani, M. A.AlMazroa, M.Amann, H.

R. Anderson, K. G. Andrews,M.Aryee, C. Atkinson,L. J. Bacchus, A. N.Bahalim, K.Balakrishnan,

J.Balmes, S. Barker-Collo, and A. Baxter, “comparative risk assessment of burden of disease and injury

attributable to 67 risk factors and risk factor clusters in 21 regions,” a systematic analysis for the Global

Burden of Disease Study, The lancet, vol. 380, no. 9859, pp. 2224-2260, 2013.

4. D. O.Abegunde,C. D.Mathers, Taghreed Adam, Monica Ortegon, and Kathleen Strong, “The burden and

costs of chronic diseases in low-income and middle-income countries,” The Lancet, vol. 370, no. 9603,

pp. 1929-1938 , 2007.

5. A.S. Levey, R. Atkins, J. Coresh, E.P. Cohen, A.J. Collins, K.-U. Eckardt, M.E. Nahas, B.L. Jaber, M.

Jadoul, A. Levin, N.R. Powe, J. Rossert, D.C. Wheeler, N. Lameire, and G. Eknoyan, “Chronic kidney

disease as a global public health problem: approaches and initiatives–a position statement from Kidney

Disease Improving Global Outcomes,” International journal, vol. 72, no. 3 pp. 247-259, 2007.

6. E. Imai, M.Horio, K. Nitta, K. Yamagata, K. Iseki, S. Hara, N.Ura, Y.Kiyohara, H. Hirakata, T.

Watanabe, T. Moriyama, Y. Ando, D.Inaguma, I. Narita, and H.Iso, “Estimation of glomerular filtration

rate by the MDRD study equation modified for Japanese patients with chronic kidney disease,” Clinical

and experimental nephrology, vol. 11, no. 1, pp. 41-50, 2007.

7. T.Isakova, H.Xie, and W. Yang, “Fibroblast growth factor 23 and risks of mortality and end-stage renal

disease in patients with chronic kidney disease,” vol. 305, no. 23, pp. 2432-2439, 2011.

8. D. E. Weiner, H.Tighiouart, E. F. Elsayed, J. L. Griffith, D. N. Salem, A. S. Levey, and M. J. Sarnak,

“The Framingham predictive instrument in chronic kidney disease,” Journal of the American College of

Cardiology, vol. 50, no. 3, pp. 217-224, 2007.

9. N.Tangri, L. A. Stevens, J. Griffith,H.Tighiouart, OgnjenkaDjurdjev, David Naimark, Adeera Levin, and

Andrew S. Levey, “A predictive model for progression of chronic kidney disease to kidney failure,” vol.

305, no. 15, pp. 1553-1559, 2011.

10. S. C. Mora, M.Goicoechea, E. Torres, U.Verdalles, A. P. de Jose, E. Verde, S.García de Vinuesa, and

J.Luno, “Cardiovascular risk prediction in chronic kidney disease patients,” no. 1, p. 8, 2017.

11. G. M. McMahon, S-J. Hwang, C. B. Clish, A. Tin, Q. Yang, M. G. Larson, E. P. Rhee, M. Li, CKDGen

Consortium Daniel Levy, and Christopher J. Donnell, “Urinary metabolites along with common and rare

Page 17: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6579

genetic variations are associated with incident chronic kidney disease,” vol. 91, no. 6, pp. 1426–1435,

2017.

12. M-E. Chen, S-J. Hwang, H-C. Chen, C-C. Hung, H-C. Hung, S-C. Liu, T-J. Wu, and M-C. Huang,

“Correlations of dietary energy and protein intakes with renal function impairment in chronic kidney

disease patients with or without diabetes,” The Kaohsiung Journal of Medical Sciences, vol. 33, no. 5, pp.

252-259, 2017.

13. J.Crosina, J. Lerner, J.Ho, N.Tangri, P.Komenda, B.Hiebert, N. Choi, R. C. Arora, and C.Rigatto,

“Improving the Prediction of Cardiac Surgery–Associated Acute Kidney Injury,” Kidney International

Reports, Vol. 2, no. 2, pp. 172–179, 2017.

14. M.Vervloet, and M.Cozzolino, “Vascular calcification in chronic kidney disease: different bricks in the

wall,” Kidney International, vol. 91, no. 4, pp. 808-817, 2016.

15. K. Soto, P. Campos, I. Pinto, B. Rodrigues, F.Frade, A. L.Papoila, and P.Devarajan, “The risk of chronic

kidney disease and mortality are increased after community-acquired acute kidney injury,” Kidney

International, vol. 90, no. 5, pp. 1090-1099, 2016

16. D.Collister, T. Ferguson, P.Komenda, and N.Tangri, “The patterns, risk factors, and prediction of

progression in chronic kidney disease,” Seminars in Nephrology, vol. 36, no. 4, pp. 273-282, 2016.

17. O. Alebiosu, and O. E. Ayodele, "The global burden of chronic kidney disease and the way forward,”

Ethnicity and Disease, vol. 15, no. 3, p.418, 2005.

18. R. L. Perlman,F. O. Finkelstein, L. Liu, E.Roys, M. Kiser, G.Eisele, S. Burrows-Hudson, J. M. Messana,

N. Levin, S.Rajagopalan, F. K. Port, R. A. Wolfe, and R. Saran, "Quality of life in chronic kidney disease

(CKD): a cross-sectional analysis in the Renal Research Institute-CKD study", American journal of

kidney diseases, vol. 45, no. 4, pp.658-666, 2005.

19. J.Norouzi,A.Yadollahpour,S. A.Mirbagheri,M. M.Mazdeh,and S. A.Hosseini, “Predicting renal failure

progression in chronic kidney disease using integrated intelligent fuzzy expert system,” Computational

and mathematical methods in medicine, vol. 2016, no. 1, p. 9, 2016.

20. M.Mahdavi-Mazdeh, “Why do we need chronic kidney disease screening and which way to go,” Iranian

Journal of Kidney Diseases, vol. 4, no. 4, pp. 275–281, 2010.

Page 18: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6580

21. V.Kunwar, K.Chandel, A. S.Sabitha, and A. Bansal “Chronic kidney disease analysis using data mining

classification techniques,” In proceedings of Cloud System and Big Data Engineering, no. 1, pp. 300-305,

2016.

22. H.P. Hu, Z.J. Niu, Y.P. Bai, and X.H. Tan, “Cancer classification based on gene expression using neural

networks,” Genetics and Molecular Research, vol. 14, no. 4, pp. 17605-17611, 2015.

23. E.Orhan, C.Onursal,B. M. Serdar, and T.Feyzullah, “A Comparative Study on Parkinson's disease

Diagnosis Using Neural Networks and Artificial Immune System,” Journal of Medical Imaging and

Health Informatics, vol. 6, no. 1, pp. 264-268, 2016.

24. A.Khemphila, and V.Boonjing, “Heart disease classification using artificial neural networks,” in

proceedings of Systems Engineering, no. 1, pp. 406-409, 2011.

25. X.e Wang, H. Chen,C. Chang,M. Jiang, X. Wang, and L. Xu, “Study the therapeutic mechanism of

Amomumcompactum in gentamicin-induced acute kidney injury rat based on a back propagation neural

network algorithm,” Journal of Chromatography B, vol. 1040,no. 1, pp. 81–88, 2017.

26. T. D.Noia, V. C.Ostuni, F.Pesce, G.Binetti, D.Naso, F. P.Schena, and E. D. Sciascio, “An end stage

kidney disease predictor based on an artificial neural networks ensemble,” Expert Systems with

Applications, Vol. 40, no. 11, pp. 4438–4445, 2013.

27. S. E. Hussein, O. A. Hassan, and M. H. Granat, “Assessment of the potential iridology for diagnosing

kidney disease using wavelet analysis and neural networks,” Biomedical Signal Processing and Control,

Vol. 8, no. 6, pp. 534–541, 2013.

28. C. Wang, L. Li, L. Wang, Z. Ping, M. T. Flory, G. Wang, Y. Xi, and W. Li, “Evaluating the risk of type 2

diabetes mellitus using artificial neural network: An effective classification approach,” Diabetes Research

and Clinical Practice, Vol. 100, no. 1, pp. 111–118, 2013.

29. M.A.jabbar, B.L.Deekshatulu, and P. Chandra, “Classification of Heart Disease Using K- Nearest

Neighbor and Genetic Algorithm,” In proceedings of Computational Intelligence: Modeling Techniques

and Applications, vol. 10, no. 1, pp. 85 – 94, 2013.

30. P. Sinha and P. Sinha, "Comparative study of chronic kidney disease prediction using KNN and SVM. In

International Journal of Engineering Research and Technology", vol. 4, no. 12, 2015.

Page 19: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6581

31. R. Ani, G. Sasi, U. R. Sankar, and O. S. Deepa, “Decision Support system for diagnosis and prediction of

Chronic Renal Failure using Random Subspace Classification,” In proceedings of advances in Computing,

Communications and Informatics, no. 1, 1287-1292, 2016.

32. B.Boukenze, A.Haqiq and H.Mousannif, “Predicting Chronic Kidney Failure Disease Using Data Mining

Techniques”, In the proceedings of Advances in Ubiquitous Networking 2, vol. 397, no. 1, pp. 701-712,

2017.

33. X. Tang,L.Zhuang, J.Cai, and C. Li, "Multi-fault classification based on support vector machine trained

by chaos particle swarm optimization", Knowledge-Based Systems, vol. 23, no. 5, pp. 486-490, 2010.

34. D.Giveki, H.Salimi, G. R.Bahmanyar and Y.Khademian, "Automatic detection of diabetes diagnosis

using feature weighted support vector machines based on mutual information and modified cuckoo

search", arXiv preprint arXiv:1201.2173, 2012.

35. A. A. Aljumah, M. G.Ahamad and M. K. Siddiqui, "Application of data mining: Diabetes health care in

young and old patients", Journal of King Saud University-Computer and Information Sciences, vol. 25,

no. 2, pp. 127-136, 2013.

36. J-H.Eom, S-C. Kim and B-T. Zhang, “AptaCDSS-E: A classifier ensemble-based clinical decision support

system for cardiovascular disease level prediction”, Expert Systems with Applications, vol. 34, no. 4, pp.

2465-2479, 2008.

37. W.Gronwald, M. S. Klein, R.Zeltner, B-D. Schulze, S. W. Reinhold, M.Deutschmann, A-K.Immervoll, C.

A. Boger, B.Banas, K-U.Eckardt and P. J. Oefner, “Detection of autosomal dominant polycystic kidney

disease by NMR spectroscopic fingerprinting of urine”, Kidney international, vol. 79, no. 11, pp. 1244-

1253, 2011.

38. X. Liu,R. Lu,J. Ma,L. Chen and B. Qin, “Privacy-preserving patient-centric clinical decision support

system on naive Bayesian classification”, IEEE journal of biomedical and health informatics, vol. 20, no.

2, pp. 655-668, 2016.

39. A. K.Dwivedi, "Analysis of computational intelligence techniques for diabetes mellitus prediction",

Neural Computing and Applications, pp. 1-9, 2017.

Page 20: ISSN: 0975 -766X CODEN: IJPTFI Available Online through ... · 2Additional Controller of Examinations, Noorul Islam Center for Higher Education, Kanyakumari, Tamil Nadu, India 629180

R. Subhashini*et al. /International Journal of Pharmacy & Technology

IJPT| Dec-2017| Vol. 9 | Issue No.4 | 6563-6582 Page 6582

40. D.Prilutsky, B.Rogachev, R. S. Marks, L.Lobel and M. Las, Classification of infectious diseases based on

chemiluminescent signatures of phagocytes in whole blood”, Artificial intelligence in medicine, vol. 52,

no. 3, pp. 153-163, 2011.

41. A. Y. Al-Hyari, A. M. Al-Taee and M. A. Al-Taee, “Clinical decision support system for diagnosis and

management of chronic renal failure”, In the proceedings of Applied Electrical Engineering and

Computing Technologies, no. 1, pp. 1-6, 2013.

42. A.Salekin, J. Stankovic, “Detection of Chronic Kidney Disease and Selecting Important Predictive

Attributes”, In the proceedings of IEEE Healthcare Informatics, pp. 262-270, 2016.

43. P.Rucci, M.Mandreoli, D.Gibertoni, A.Zuccalà, M. P.Fantini, J.Lenzi and A. Santoro, “A clinical

stratification tool for chronic kidney disease progression rate based on classification tree analysis”,

Nephrology Dialysis Transplantation, vol. 29, no. 3, pp. 603-610, 2014.

44. J-Y.Yeh, T-H. Wu, C-W.Tsao, “Using data mining techniques to predict hospitalization of hemodialysis

patients”, Decision Support Systems, vol. 50, no. 2, pp. 439-448, 2011.

45. N.Hiramatsu, M. Kurosaki, N. Sakamoto, M. Iwasaki, M. Sakamoto, Y. Suzuki, F.Sugauchi, A.Tamori,

S.Kakinnuma, K. Matsuura and N. Izumi, “Pretreatment prediction of anemia progression by pegylated

interferon alpha-2b plus ribavirin combination therapy in chronic hepatitis C infection: decision-tree

analysis”, Journal of gastroenterology, vol. 46, no. 9, pp. 1111-1119, 2011.

46. A.Ameta, K. Jain, “Data Mining Techniques for the Prediction of Kidney Diseases and Treatment: A

Review”, International Journal of Engineering and Computer Science, vol. 6, no. 1, pp. 20376-20378,

2017.

47. P.Luukka, "Classification based on fuzzy robust PCA algorithms and similarity classifier", Expert

Systems with Applications, vol. 36, no. 4, pp. 7463-7468, 2009.

48. S. Das, P. K. Ghosh, S.Kar,"Hypertension diagnosis: a comparative study using fuzzy expert system and

neuro fuzzy system", In the proceedings of Fuzzy Systems, pp. 1-7, 2013.

49. N.Pavithra and R. Shanmugavadivu,"Efficient Early Risk Factor Analysis of Kidney Disorder Using Data

mining Technique", International Journal of Innovative Research in Computer and Communication

Engineering", vol. 5, no. 1, 2017.