Learning Analytics for the Social Media Age · Learning Analytics for the Social Media Age Studying...

65
Learning Analytics for the Social Media Age Studying Online Interaction using Social Network Analysis Anatoliy Gruzd [email protected] @gruzd Canada Research Chair in Social Media Data Stewardship Associate Professor, Ted Rogers School of Management Director, Social Media Lab Ryerson University Open University of Hong Kong July 7, 2016 Twitter: @gruzd ANATOLIY GRUZD 1

Transcript of Learning Analytics for the Social Media Age · Learning Analytics for the Social Media Age Studying...

Learning Analytics for the Social Media Age

Studying Online Interaction using Social Network Analysis

Anatoliy [email protected]@gruzd

Canada Research Chair in Social Media Data Stewardship Associate Professor, Ted Rogers School of ManagementDirector, Social Media LabRyerson University

Open University of Hong Kong

July 7, 2016

Twitter: @gruzd ANATOLIY GRUZD 1

Research at the Social Media Lab • Social Media Analytics

• Social Media Data Stewardship

• Measuring Influence on Social Media

• Online Political Engagement

• Learning Analytics

• Social Media & Health

Learning Objectives 1. Social Media Data Collection

2. Basics of Social Network Analysis

3. Visualization and Analysis of Communication Networks

4. Practice using Netlytic: Studying Collaborative Learning in cMOOC

Twitter: @gruzd ANATOLIY GRUZD 3

Twitter: @gruzd

ANATOLIY GRUZD

Social Media sites have become

an integral part of our daily lives!

Growth of Social Media Data

Facebook

1.5B users

Instagram

400M users

Twitter

300M users

Decision Making

in domains such as Education, Politics, Health Care

Twitter: @gruzd ANATOLIY GRUZD 5

How to Make Sense of Social Media Data?

Self-collected/reported

Public APIs

Data Resellers

How to Make Sense of Social Media Data?Big Data Technology

Twitter: @gruzd ANATOLIY GRUZD 6

Cloud & Distributed Computing

Data & Information Organization

Analytics Visualization

Twitter: @gruzd ANATOLIY GRUZD 7

Social Media Analytics Toolshttp://socialmedialab.ca/apps/social-media-toolkit/

Data -> Visualizations -> Understanding

How to Make Sense of Social Media Data?

Twitter: @gruzd ANATOLIY GRUZD 8

How to Make Sense of Social Media Data?Example: Geo-based Analysis

Twitter: @gruzd ANATOLIY GRUZD 9

Social Network Analysis (SNA)

• Nodes = People

• Edges /Ties (lines) = Relations/

“Who retweeted/ replied/

mentioned whom”

How to Make Sense of Social Media Data?

Twitter: @gruzd ANATOLIY GRUZD 10

Makes it much easier to understand what is going on

in a group

Advantages of

Social Network Analysis

Once the network is discovered, we can find

out:

• How do people interact with each other,

• Who are the most/least active members,

• Who is influential in a group,

• Who is susceptible to being influenced,

etc…

Twitter: @gruzdANATOLIY GRUZD

11

Liberal

ConservativeSpam

Unknown &

Undecided

NDP

Left

Green

Bloc

Other

Gruzd, A. and Roy, J (2014). Political Polarization on Social Media: Do

Birds of a Feather Flock Together on Twitter? Policy & Internet.

Common approach for collecting social network data:

• Self-reported social network data may not be available/accurate

• Surveys or interviews

Problems with surveys or interviews

• Time-consuming

• Questions can be too sensitive

• Answers are subjective or incomplete

• Participant can forget people and

interactions

• Different people perceive events and

relationships differently

How Do We Collect Information About Online Social Networks?

Twitter: @gruzd ANATOLIY GRUZD 12

Studying Online Social Networks

http://www.visualcomplexity.com/vc

Forum networks

Blog networks

Friends’ networks (Facebook,

Twitter, Google+, etc…)

Networks of like-minded people

(YouTube, Flickr, etc…)

Twitter: @gruzd ANATOLIY GRUZD 14

Goal: Automated Networks Discovery

Challenge: Figuring out what content-based features of online interactions can help to uncover nodes and ties between group members

How Do We Collect Information About Online Social Networks?

Twitter: @gruzd ANATOLIY GRUZD 15

Automated Discovery of Social Networks

Emails

Nick

Rick

Dick

• Nodes = People

• Ties = “Who talks to whom”

• Tie strength = The number of

messages exchanged between

individuals

Twitter: @gruzd ANATOLIY GRUZD 16

Automated Discovery of Social Networks

“Many to Many” Communication

ChatMailing listservForum Comments

Twitter: @gruzd ANATOLIY GRUZD 17

@John

@Peter

@Paul • Nodes = People

• Ties = “Who retweeted/

replied/mentioned whom”

• Tie strength = The number of

retweets, replies or mentions

Automated Discovery of Social NetworksTwitter Networks

Twitter: @gruzd ANATOLIY GRUZD 18

Automated Discovery of Social Networks

Twitter Data Examples

Network Ties

@Cheeflo -> @JoeProf@Cheeflo -> @VMosco@JoeProf -> @VMosco

Twitter: @gruzd ANATOLIY GRUZD 19

Network Tie

@Gruzd -> @SidneyEve

Connection type: Mention

Connection type: Reply

Sample Twitter Searches

#ELECTION2016 #HONGKONG

Twitter: @gruzd ANATOLIY GRUZD 20

3557 records (Dec 3, 2015)1394 records (Oct 29, 2015)

Sample Twitter Searches

#ELECTION2016 #HONGKONG

Twitter: @gruzd ANATOLIY GRUZD 21

3557 records (Dec 3, 2015)1394 records (Oct 29, 2015)

Sample Twitter Searches

#ELECTION2016 #HONGKONG

Twitter: @gruzd ANATOLIY GRUZD 22

3557 records (Dec 3, 2015)1394 records (Oct 29, 2015)

What do these visualizations tell us?

SNA MeasuresMicro-level

In-degree centrality

Out-degree centrality

Betweenness centrality

Other centrality measures (e.g., closeness, eigenvector)

Macro-level

Density

Diameter

Reciprocity

Centralization

Modularity

ANATOLIY GRUZD 23Twitter: @gruzd

SNA MeasuresMicro-level

In-degree centrality

Out-degree centrality

Betweenness centrality

Other centrality measures (e.g., closeness, eigenvector)

ANATOLIY GRUZD 24

In-degree suggests “prestige” highlighting the most mentioned or replied Twitter users

Twitter: @gruzd

In-degree centrality#HongKong Twitter network

Twitter: @gruzd ANATOLIY GRUZD 25

SEVENTEEN or SVT is

a S.Korean boy group formed

by Pledis Entertainment

SNA MeasuresMicro-level

In-degree centrality

Out-degree centrality

Betweenness centrality

Other centrality measures (e.g., closeness, eigenvector)

ANATOLIY GRUZD 26

Out-degree reveals active Twitter users with a good awareness of others in the network

Twitter: @gruzd

Out-degree centrality#HongKong Twitter network

Twitter: @gruzd ANATOLIY GRUZD 27

Note: A music fan (many

retweets & replies to others)

SNA MeasuresMicro-level

In-degree centrality

Out-degree centrality

Betweenness centrality

Other centrality measures (e.g., closeness, eigenvector)

ANATOLIY GRUZD 28

Betweenness shows actors who are located on the most number of information paths and who often connect different groups of users in the network

Twitter: @gruzd

Betweenness centrality#HongKong Twitter network

Twitter: @gruzd ANATOLIY GRUZD 29

Note: A fan (retweets/replies to messages

from two different fan communities/sites)

Sample Twitter Searches

#ELECTION2016 #HONGKONG

Twitter: @gruzd ANATOLIY GRUZD 30

3557 records (Dec 3, 2015)1394 records (Oct 29, 2015)

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Density indicates the overall connectivity in the network (the total number of connections divided by the total number of possible connections).

It is equal to 1 when everyone is connected to everyone.

ANATOLIY GRUZD 31Twitter: @gruzd

User1 User3

User2Density = 1

#Election2016 #HongKong

Nodes 491 2570

Edges 1075 2447

Density 0.005 (0.5%) 0.0004 (0.04%)

Diameter

Reciprocity

Centralization

Modularity

ANATOLIY GRUZD 32Twitter: @gruzd

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Diameter gives a general idea of how “wide” the network is; the longest of the shortest paths between any two nodes in the network.

ANATOLIY GRUZD 33Twitter: @gruzd

#1

User1User3

User2

User4

Diameter = 3

#2

#3

#Election2016 #HongKong

Nodes 491 2570

Edges 1075 2447

Density 0.005 (0.5%) 0.0004 (0.04%)

Diameter 28 14

Reciprocity

Centralization

Modularity

ANATOLIY GRUZD 34Twitter: @gruzd

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Reciprocity shows how many online participants are having two-way conversations.

In a scenario when everyone replies to everyone, the reciprocity value will be 1.

ANATOLIY GRUZD 35Twitter: @gruzd

User2

User1User3

User4 Reciprocity=1

#Election2016 #HongKong

Nodes 491 2570

Edges 1075 2447

Density 0.005 (0.5%) 0.0004 (0.04%)

Diameter 28 14

Reciprocity 0.006 (0.6%) 0.003 (0.3%)

Centralization

Modularity

ANATOLIY GRUZD 36Twitter: @gruzd

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Centralization indicates whether a network is dominated by few central participants (values are closer to 1),

or whether more people are contributing to discussion and information dissemination (values are closer to 0).

ANATOLIY GRUZD 37Twitter: @gruzd

User2

User1User3

User4 Centralization=1

#Election2016 #HongKong

Nodes 491 2570

Edges 1075 2447

Density 0.005 (0.5%) 0.0004 (0.04%)

Diameter 28 14

Reciprocity 0.006 (0.6%) 0.003 (0.3%)

Centralization 0.05 0.11

Modularity

ANATOLIY GRUZD 38Twitter: @gruzd

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Modularity provides an estimate of whether a network consists of one coherent group of participants who are engaged in the same conversation and who are paying attention to each other (values closer to 0);

or whether a network consists of different conversations and communities with a weak overlap (values closer to 1).

ANATOLIY GRUZD 39Twitter: @gruzd

#Election2016 #HongKong

Nodes 491 2570

Edges 1075 2447

Density 0.005 (0.5%) 0.0004 (0.04%)

Diameter 28 14

Reciprocity 0.006 (0.6%) 0.003 (0.3%)

Centralization 0.05 0.11

Modularity 0.42 0.92

ANATOLIY GRUZD 40Twitter: @gruzd

Learning Objectives 1. Social Media Data Collection

2. Basics of Social Network Analysis

3. Visualization and Analysis of Communication Networks

4. Practice using Netlytic: Studying Collaborative Learning in cMOOC

Twitter: @gruzd ANATOLIY GRUZD 41

Practice with Netlytic.orgStudying Collaborative Learning in cMOOC

Tutorial Steps :

https://netlytic.org/home/?p=470

Also see Gruzd, A., Paulin, D., & Haythornthwaite, C. (forthcoming). Analyzing Social Media and Learning Through Content and Social Network Analysis: A Faceted Methodological Approach. Journal of Learning Analytics.

Twitter: @gruzd ANATOLIY GRUZD 42

43

cMOOC Dataset

Athabasca University Course◦ CCK11: Connectivism and Connective Knowledge

Not restricted to any one

platform

“Throughout this ‘course’

participants will use a variety of

technologies, for example, blogs,

Second Life, RSS Readers,

UStream, etc.”

ANATOLIY GRUZD @gruzd

44

CCK11 MOOC Dataset: Sample course page

ANATOLIY GRUZD @gruzd

45

CCK11 MOOC Dataset

#Posts Avg charactercount

Avg word count

Blogs 812 332.3 54.2

DiscussionThreads

68 541.8 90.8

Comments 306 837.6 138.9

Twitter posts 1722 113.9 14.4

ANATOLIY GRUZD @gruzd

Today’s

focus

Question 1 - #LAK14 #pLASMAWhat does this visualization tell us from an instructor’s perspective?

ANATOLIY GRUZD 46@gruzd

What does this visualization tell us from a student’s perspective?

ANATOLIY GRUZD 47@gruzd

What does this visualization not tell us?

ANATOLIY GRUZD 48@gruzd

SNA Measures (recap)Micro-level

In-degree centrality

Out-degree centrality

Betweenness centrality

Other centrality measures (e.g., closeness, eigenvector)

Macro-level

Density

Diameter

Reciprocity

Centralization

Modularity

ANATOLIY GRUZD 49@gruzd

SNA MeasuresMicro-level

In-degree centrality

Out-degree centrality

Betweenness centrality

Other centrality measures (e.g., closeness, eigenvector)

ANATOLIY GRUZD 50

In-degree suggests “prestige” highlighting the most mentioned or replied Twitter users

@gruzd

51

#CCK11 Class Tweets (Node size=“Indegree”)

ANATOLIY GRUZD @gruzd

SNA MeasuresMicro-level

In-degree centrality

Out-degree centrality

Betweenness centrality

Other centrality measures (e.g., closeness, eigenvector)

ANATOLIY GRUZD 52

Out-degree reveals active Twitter users with a good awareness of others in the network

@gruzd

53

#CCK11 Class Tweets (Node size=“Outdegree”)

ANATOLIY GRUZD @gruzd

Top 10 Twitter users in the Twitter network based on centrality measures

ANATOLIY GRUZD 54@gruzd

IN-DEGREE OUT-DEGREE

participant1(m) cck11feeds

participant2(f) participant8(m)

gsiemens web20education

downes participant9(f)

guestLecturer3(m) participant6(m)

web20education participant1(m)

participant4(f) participant10(f)

participant5(m) participant4(f)

participant6(m) participant7(m)

participant7(m) participant11(f)

participant8(m)

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Density indicates the overall connectivity in the network (the total number of connections divided by the total number of possible connections).

It is equal to 1 when everyone is connected to everyone.

ANATOLIY GRUZD 55@gruzd

Student

InstructorStudent

Density = 1

#CCK11 Class #Election2016 #HongKong

Nodes 498 491 2570

Edges 761 1075 2447

Density 0.003 (0.3%) 0.005 (0.5%) 0.0004 (0.04%)

ANATOLIY GRUZD 56@gruzd

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Diameter gives a general idea of how “wide” the network is; the longest of the shortest paths between any two nodes in the network.

ANATOLIY GRUZD 57@gruzd

#1

Student A Instructor

Student B

Student C

Diameter = 3

#2

#3

#CCK11 Class #Election2016 #HongKong

Nodes 498 491 2570

Edges 761 1075 2447

Density 0.003 (0.3%) 0.005 (0.5%) 0.0004 (0.04%)

Diameter 38 28 14

ANATOLIY GRUZD 58@gruzd

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Reciprocity shows how many online participants are having two-way conversations.

In a scenario when everyone replies to everyone, the reciprocity value will be 1.

ANATOLIY GRUZD 59@gruzd

Student

InstructorStudent

Student Reciprocity=1

#CCK11 Class #Election2016 #HongKong

Nodes 498 491 2570

Edges 761 1075 2447

Density 0.003 (0.3%) 0.005 (0.5%) 0.0004 (0.04%)

Diameter 38 28 14

Reciprocity 0.09 (9%) 0.006 (0.6%) 0.003 (0.3%)

ANATOLIY GRUZD 60@gruzd

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Centralization indicates whether a network is dominated by few central participants (values are closer to 1),

or whether more people are contributing to discussion and information dissemination (values are closer to 0).

ANATOLIY GRUZD 61@gruzd

Student

InstructorStudent

Student Centralization=1

#CCK11 Class #Election2016 #HongKong

Nodes 498 491 2570

Edges 761 1075 2447

Density 0.003 (0.3%) 0.005 (0.5%) 0.0004 (0.04%)

Diameter 38 28 14

Reciprocity 0.09 (9%) 0.006 (0.6%) 0.003 (0.3%)

Centralization 0.07 0.05 0.11

ANATOLIY GRUZD 62@gruzd

SNA MeasuresMacro-level

Density

Diameter

Reciprocity

Centralization

Modularity

Modularity provides an estimate of whether a network consists of one coherent group of participants who are engaged in the same conversation and who are paying attention to each other (values closer to 0);

or whether a network consists of different conversations and communities with a weak overlap (values closer to 1).

ANATOLIY GRUZD 63@gruzd

#CCK11 Class #Election2016 #HongKong

Nodes 498 491 2570

Edges 761 1075 2447

Density 0.003 (0.3%) 0.005 (0.5%) 0.0004 (0.04%)

Diameter 38 28 14

Reciprocity 0.09 (9%) 0.006 (0.6%) 0.003 (0.3%)

Centralization 0.07 0.05 0.11

Modularity 0.67 0.42 0.92

ANATOLIY GRUZD 64@gruzd

Learning Objectives 1. Social Media Data Collection

2. Basics of Social Network Analysis

3. Visualization and Analysis of Communication Networks

4. Practice using Netlytic: Studying Collaborative Learning in cMOOC

Twitter: @gruzd ANATOLIY GRUZD 65

Learning Analytics for the Social Media Age

Studying Online Interaction using Social Network Analysis

Anatoliy [email protected]@gruzd

Canada Research Chair in Social Media Data Stewardship Associate Professor, Ted Rogers School of ManagementDirector, Social Media LabRyerson University

Open University of Hong Kong

July 7, 2016

Twitter: @gruzd ANATOLIY GRUZD 66