Datawarehousing and Analytics Introduction to · PDF fileDatawarehousing and Analytics...

12
Anwendersoftware a Anwendungssoftware s Datawarehousing and Analytics Introduction to Assignments Holger Schwarz Universität Stuttgart Winter Term 2014/2015

Transcript of Datawarehousing and Analytics Introduction to · PDF fileDatawarehousing and Analytics...

Anwendersoftware aaAnwendungssoftware

ss

Datawarehousing and AnalyticsIntroduction to Assignments

Holger SchwarzUniversität Stuttgart

Winter Term 2014/2015

Anwendersoftware aaAnwendungssoftware

ssaaAnwendungssoftware

ss

Assignment 1

• Data Mining Discuss option to apply data mining techniques to the data of a give scenario,

in order to provide relevant information that supports management in decision making.

• OLAP Provide various SQL statements providing data that helps to anser typical

OLAP type information requests. For the SQL statements, you may consider the OLAP features of SQL, which you have learned in the lecture.

2

Anwendersoftware aaAnwendungssoftware

ssaaAnwendungssoftware

ss

Rules for Assignment 1

• Work on this assignment in teams of four students• Prepare a result document (PDF) with your solutions for all the tasks• Send this document to [email protected] no later than October

31, 10:00 am. Your email has also to include your name, your team number, your matriculation number and your study programme

• Contact Holger Schwarz for any further questions

3

Anwendersoftware aaAnwendungssoftware

ssaaAnwendungssoftware

ss

Assignment 2: Goals

• Hands-on experience with tools supporting data warehouse processes• Focus on analytics• Set of prepared and guided exercises (document describes what to do)• Additional tasks extending the prepared exercises

• Each team will have access to a virtual machine with IBM InfoSphereWarehouse and Cognos installed.

• You will run through a tutorial that explains same basic tasks of usingthese tools: explore data and data models Prepare, deploy and visualize data cubes Create mining flows Prepare a customer segmentation Prepare a revenue forecast

• Extend the reports and data mining models of the tutorial4

Anwendersoftware aaAnwendungssoftware

ssaaAnwendungssoftware

ss

Rules for Assignment 2

• You participate in this hands-on lab in teams of four students.• Each team gets ist own virtual machine for this exercise.• Each team has to complete all tasks until December 1st. Please send a

PDF to [email protected] covering the following: Names, study programme and matriculation numbers of all team members For each exercises, name the team member that are ready to present the

results List specific difficulties with the hands-on lab. Are there any aspects where

the description does not match with what you found on the server?• Each team has to make an appointment with Holger Schwarz to present

the results (15 to 30 minutes per team).• Each group member has to actively participate in this result presentation.

You will be asked to present selected results.• Result presentations are scheduled for end of November and December.

Appointments will be organized by E-Mail/Doodle.

5

InfoSphere Warehouse Architecture

DB2 Cognos BI

Admin ConsoleControl and monitor flows and cubes on production system

Manages DeploymentWorkload Management

Design StudioDesign, Develop and Optimize Warehouse & Analytics

Deployment

explore datamodel

deploydatamodel

importdata

run report

dimensional data modeling import cube

model

deploycubemodel

exercise 1 exercise 2

deploysegmentation

model

exercise 3

prepare customersegmentation

displaysegmentation

exercise 5

create miningflow

deployand runminingflow

displayminingresults

(forecast)

Lab Overview

• Exercise 1: Using the InfoSphere Warehouse Model Pack for Customer Insight to deploy a basic analytical schema with appropriate Cognos reports

– Import the physical data model– Work with Data Architect tooling (part of Design Studio)– Import predefined Cognos reports

7

Lab Overview

• Exercise 2: Multidimensional modeling with InfoSphere Warehouse and OLAP analysis using Cognos

– Data Architect– SQL Warehousing– Cubing Services– Cognos Reporting

8

Lab Overview

• Exercise 3: Descriptively prepare your data and perform customer segmentation using InfoSphere Warehouse

– Descriptive Data Preparation– Data Mining Solution Plan for Customer Segmentation

9

Lab Overview

• Exercise 5: Revenue Forecasting with InfoSphere Warehouse– Data Mining Preparation– Time Series Forecasting

10

Introduction Anwendersoftware aaAnwendungssoftware

ssaaAnwendungssoftware

ss

Documents

• assignment2_v01.pdf (available in approx. 10 days) Describes details on how to access your VM Each group has its private VM with all the necessary software installed. To access your server …

- You may use any computer/laptop connected to the network of the faculty of computer science (Universitätsstraße 38).

- See the PDF for details on the requirements• HandsOnLab.pdf

Describes five exercises Each team has to complete exercises 1, 2, 3, 5

11

Anwendersoftware aaAnwendungssoftware

ssaaAnwendungssoftware

ss

How to organize the teams?

• Goto: http://goo.gl/vp14sR• Download file template.txt• Fill in the data for your team

of (exactly!) three students• Teams of two students might be

combined (depending on number ofoverall participants)

• Upload the file asLastname1Lastname2Lastname3.txt

• Data must be available onMonday, October 6, 2014, 10 am

• Change requests possible untilThursday, October 9, 2014, 8 am

12

Team member 1:last name:first name:study programme:email:

Team member 2:last name:first name:study programme:email:

Team member 3:last name:first name:study programme:email:…Comments:

Template.txt