Post on 19-Jan-2015
description
DATA MINING IS....
"A process used by companies to turn raw data into useful information.
Get an idea...
In vitro fertilizationGiven : Embryos described by 60 features Problem : Selection of embryos that will survive Data : Historical records of embryos and outcome
Cow culling Given : Cows described by 700 featuresProblem : Selection of cows that should be culledData : Historical records and farmers’ decisions
How it works?
salary
: paid
:unpaid
Example
loan
loan
salary
: paid
:unpaidIf salary < p..then loan : unpaid
p
At a glance....
Methodology of Data Mining
SIMPLE EXAMPLE
Outlook Temperature Humidity Play
Sunny Hot High No
Sunny Hot High No
Rainy Mild High Yes
Rainy Cool Normal Yes
Overcast Cool Normal Yes
Sunny Mild High No
Rainy Mild Normal Yes
Sunny Mild Normal Yes
Overcast Hot Normal Yes
Rainy Mild High No
WEATHER DATA1.1
We can generalise the data as We can generalise the data as followsfollowsIf outlook = sunny and humidity = high then play = noIf outlook = overcastIf humidity = normal
then play = yesthen play = yesthen play = yesIf none of the above
OUTLOOK
Rainy
Yes
Outlook
Sunny
Temperature
Mild Hot
HumidityYes
Normal
Yes No
High
DECISION TREE FOR WEATHER DATA
DATAMINING IS IMPORTANT ......
"We are overwhelmed by large amount of data today.."
"Data mining make it easier to handle it !! "