CS 523 Database Topics
description
Transcript of CS 523 Database Topics
Exam Reviews
1
Class Introduction• Definitions• Introduction to OLAP processing• Introduction to Data Mining Processing
Data Warehousing• Differences between a data warehouse and
a transactional database• Data organization – the star schema• Bit Map Indexes
2
Introduction to ETL• Data Preprocessing Activities• SQL Commands to copy data
Extraction/transformation/load• Sqlldr• Externally stored tables• Dblinks
OLAP Introduction OLAP in SQL
• Rollups and Cubes
3
More OLAP in SQL• Partitioning• Analytical functions (cume_dist, ntile, etc.)
Introduction to data mining• No prediction – time series, clustering• Predict one thing – decision trees, value
prediction• Predict everything – association, sequential
patterns.
4
Class Introduction• Basic Definitions
Introduction to ETL• Data Preprocessing Activities• SQL Commands to copy data• Extraction/transformation/load• Sqlldr• Externally stored tables• Dblinks
5
6
Data Warehousing• Definitions• Star Schema – Dimensions and facts.• Snowflakes, Constellations• Bit-Map Index
7
Decision Support• Data Warehousing
Definitions, Tools• OLAP
Roll up, drill down, slice and dice SQL: Partitions, Rank, N-Tiles, Distributions,
etc.• Data Mining
Preprocessing techniques Clustering, Time sequences, Classification,
Value Prediction, Association, Sequential Patterns
Data Mining Agorithms• Association – a priori• Association – fp growth• Decision Trees – ID3
8
9
DON’T FORGET TO BRING A
CALCULATOR!!!