해외 저널리즘 스쿨 운영 현황 연구download.kpf.or.kr/MediaPds/HOBYSFTTIHCJAWI.pdf · 프랑스 저널리즘 교육기관의 구분 65 프랑스의
Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer...
Transcript of Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer...
![Page 1: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/1.jpg)
hci+d lab.
Joonhwan Leehuman-computer interaction + design lab.
Week 04 • 데이터 저널리즘
Data Processing
![Page 2: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/2.jpg)
hci+d lab.
• Data Processing Process• CSV import• Fix Data Type• Understand Data through Exploration• Data Filtering• Add Key(Column) to the Data
오늘 다룰 내용
![Page 3: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/3.jpg)
hci+d lab.
Data Processing
![Page 4: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/4.jpg)
hci+d lab.
Data Analysis Process
!4
Question Wrangling Explore Predict Communication
![Page 5: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/5.jpg)
hci+d lab.
Data Analysis Process
✦ Question Phase✦ Characteristics of students who finish MOOC lectures
✦ Age and gender distribution of people who spend money in Gangnam area
!5
![Page 6: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/6.jpg)
hci+d lab.
Data Analysis Process
✦ Wrangling Phase✦ Data acquisition - where to get data to answer the
questions
✦ Data cleaning - (in most case) data need to be cleaned - we spend most of our time for this…(80~90%)
!6
![Page 7: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/7.jpg)
hci+d lab.
Data Analysis Process
✦ Explore Phase✦ Build intuition by exploratory data analysis
✦ information visualization
✦ find patterns
!7
![Page 8: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/8.jpg)
hci+d lab.
Data Analysis Process
✦ Prediction Phase✦ Predict results of out question
✦ eg. Age and gender distribution of people who spend money in Gangnam area => According to our data analysis, 20-30 women spend more money in this area. => marketing insights
✦ Usually requires statistics or machine learning
!8
![Page 9: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/9.jpg)
hci+d lab.
Data Analysis Process
✦ Communication Phase✦ Data Journalisms
✦ Blog Posts
✦ Data Visualizations
✦ Papers
!9
![Page 10: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/10.jpg)
hci+d lab.
Data Analysis Process
!10
Question Wrangling Explore Predict Communication
![Page 11: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/11.jpg)
hci+d lab.
Data Acquisition
✦ Downloading files ✦ Accessing an API✦ Scraping a web page
!11
➝ will do these later
![Page 12: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/12.jpg)
hci+d lab.
Data Format
✦ CSV: Comma Separated Values✦ data column separated by comma
✦ text file format (xls is binary format) ➝ can read from text editors
!12
![Page 13: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/13.jpg)
hci+d lab.
Data Format
✦ CSV: Comma Separated Values
!13
![Page 14: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/14.jpg)
hci+d lab.
What are we going to do today?
✦ CSV import✦ Fix Data Type✦ Understand Data through Exploration✦ Data Filtering✦ Add Key(Column) to the Data
!14
![Page 15: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/15.jpg)
hci+d lab.
Data & Code
✦ Modified from “Introduction to Data Analysis” course at Udacity.
✦ Using their login data.✦ Data description included.
!15
![Page 16: Week 0 Data Processing - hcid-courses.github.io · hci+d lab. Joonhwan Lee human-computer interaction + design lab. Week 04 • 데이터 저널리즘 Data Processing](https://reader034.fdocuments.net/reader034/viewer/2022042220/5ec63b0f8e0f702c25347117/html5/thumbnails/16.jpg)
hci+d lab.
Questions?