R & Data mining in action
-
Upload
kasia-mrowca -
Category
Technology
-
view
184 -
download
2
description
Transcript of R & Data mining in action
![Page 1: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/1.jpg)
R & data mining in action
Katarzyna Mrowca
![Page 2: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/2.jpg)
Sztuka czytania między wierszami
czyli język R i Data Mining w akcji
![Page 3: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/3.jpg)
Katarzyna Mrowca
<me>
</me>
![Page 4: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/4.jpg)
![Page 5: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/5.jpg)
The deal
![Page 6: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/6.jpg)
Agenda
• Quick glance on theory - Data mining• Exercises on… paper• Quick glance on tool – R console• Exercises – became friend with R• …
![Page 7: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/7.jpg)
Agenda
• Quick glance on theory - Data mining• Exercises on… paper• Quick glance on tool – R console• Exercises – became friend with R• …
ExerciseTheory
![Page 8: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/8.jpg)
Agenda
• Quick glance on theory - Data preparation• Exercises • Regression• Time series• Decision trees• Cluser analysis• Text mining• …
ExerciseTheory
![Page 9: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/9.jpg)
Quick glance on theory!
![Page 10: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/10.jpg)
What data mining is?
![Page 11: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/11.jpg)
What „google” says?
![Page 12: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/12.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science,
![Page 13: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/13.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 14: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/14.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 15: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/15.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 16: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/16.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 17: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/17.jpg)
What „google” says?
Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science, is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics.
![Page 18: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/18.jpg)
What „google” says?
The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
![Page 19: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/19.jpg)
What „google” says?
The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
![Page 20: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/20.jpg)
What „google” says?
The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
![Page 21: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/21.jpg)
What „google” says?
Aside from the raw analysis step, it involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating.
Source: wikipedia
![Page 22: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/22.jpg)
Data mining – what is „inside”
• Predictive• Regression• Classification• Collaborative Filtering
• Descriptive• Clustering / similarity matching• Association rules and variants• Deviation detection
![Page 23: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/23.jpg)
Data mining – what is „inside”
• Predictive:• Regression• Classification• Collaborative Filtering
• Descriptive:• Clustering / similarity matching• Association rules and variants• Deviation detection
![Page 24: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/24.jpg)
Data mining – what is „inside”
• Predictive:• Regression• Classification• Collaborative Filtering
• Descriptive:• Clustering / similarity matching• Association rules and variants• Deviation detection
![Page 25: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/25.jpg)
What data mining is not?
![Page 26: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/26.jpg)
Why Data Mining is so popular?
![Page 27: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/27.jpg)
What is a difference between statistics and data mining?
![Page 28: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/28.jpg)
Data preparation
![Page 29: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/29.jpg)
Variables
![Page 30: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/30.jpg)
Qualitative & Quantitative
![Page 31: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/31.jpg)
Tame R console!
![Page 32: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/32.jpg)
NetBeans + R
Source: https://blogs.oracle.com/geertjan/entry/r_plugin_for_netbeans_ide
![Page 34: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/34.jpg)
Revolution Analytics <- R + Hadoop + EnterpriseFind out more: http://www.revolutionanalytics.com
![Page 35: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/35.jpg)
Take a break
![Page 36: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/36.jpg)
Regression
![Page 37: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/37.jpg)
Time series
![Page 38: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/38.jpg)
Decision trees
![Page 39: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/39.jpg)
Regression trees
![Page 40: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/40.jpg)
Classification trees
![Page 41: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/41.jpg)
K means
![Page 42: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/42.jpg)
Text mining
![Page 43: R & Data mining in action](https://reader036.fdocuments.net/reader036/viewer/2022081413/54971b30ac7959412e8b521e/html5/thumbnails/43.jpg)
Thank you!