Analysing Public Science Debates through Blogs and Online News Sources

20
Analysing Public Science Debates through Blogs and Online News Sources Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton, UK

description

Analysing Public Science Debates through Blogs and Online News Sources. Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton, UK. Contents. Background Blogs Oline news sources RSS Tracking public science debates Detecting public science debates. Background. - PowerPoint PPT Presentation

Transcript of Analysing Public Science Debates through Blogs and Online News Sources

Page 1: Analysing Public Science Debates through Blogs and Online News Sources

Analysing Public Science Debates through Blogs and Online News Sources

Mike ThelwallStatistical Cybermetrics Research GroupUniversity of Wolverhampton, UK

Page 2: Analysing Public Science Debates through Blogs and Online News Sources

Contents

Background Blogs Oline news sources RSS

Tracking public science debatesDetecting public science debates

Page 3: Analysing Public Science Debates through Blogs and Online News Sources

Background

Blogs, public opinion, online news, RSS

Page 4: Analysing Public Science Debates through Blogs and Online News Sources

Background

There are millions of bloggersBloggers are almost normal human beingsAutomatically tracking bloggers’ postings may give insights into public opinion

Page 5: Analysing Public Science Debates through Blogs and Online News Sources

Blog tracking companies

IBM WebFountain

Intelliseek BlogPulse “Monitor, measure and leverage

consumer-generated media”

Others growing…

Page 6: Analysing Public Science Debates through Blogs and Online News Sources

RSS Format

Rich Site Syndication/Really Simple Syndication XML technology Used for frequently updated information

sources (blogs, news, academic journals)

RSS Readers Users subscribe to the RSS feeds of

favourite blogs/sites/journals/searches Notified when updates available User-controlled ‘push’ technology

Page 7: Analysing Public Science Debates through Blogs and Online News Sources

Tracking Public Science Debates

Page 8: Analysing Public Science Debates through Blogs and Online News Sources

Blog keyword searches

Technorati “Searches weblogs by keyword and for links” Stem cell research

Blogdigger stem cell research

IceRocket Allows Advanced searches Allows genuine date range search (Google only

allows “last updated” date range searches)

Page 9: Analysing Public Science Debates through Blogs and Online News Sources

Track evolution over time

What is changing about interest in Stem cell research/GM food?Are experts good at identifying changes in public interest?How can experts be sure/can they be supported with quantitative information?Can blogs be used to generate time series reflecting changes in “public interest”?

Page 10: Analysing Public Science Debates through Blogs and Online News Sources

Free science debate graphs

Solves the trend identification problem?Blogpulse Offers free automatic blog searches and keyword-generated click-search graphs Stem cell research GM food Mobile phone radiation

Page 11: Analysing Public Science Debates through Blogs and Online News Sources

Research graphs

Time-consuming to collect dataGive control over the data source

Page 12: Analysing Public Science Debates through Blogs and Online News Sources

Detecting Public Science Debates

Page 13: Analysing Public Science Debates through Blogs and Online News Sources

How to detect a new debate?

Heuristic methods E.g. Read papers, scan relevant blogs

Automatic methods E.g. look for sudden increase in usage

of science-related words in blogs?

Page 14: Analysing Public Science Debates through Blogs and Online News Sources

Free hot topic searches

Blog keyword search (sort by date) Technorati “Searches weblogs by keyword and for

links” Stem cell research

Blogdigger blog search

Hot topic searches Blogdex – top contagious information Bloglines – today’s hot topics (most popular links)

Searches find the really big science debates?

Page 15: Analysing Public Science Debates through Blogs and Online News Sources

Specialist research tools

Commercial software Intelliseek/IBM

Mozdeh RSS monitor Generates sub-collections Generates word time series Allows keyword searches Identifies hot topics

Page 16: Analysing Public Science Debates through Blogs and Online News Sources

Mozdeh Science Concern Corpus

A collection of blog postings containing a fear word AND a science wordTrend detection used to identify hot “science fear” topicsData cleaning to remove spamNeed manual scanning of list of words experiencing biggest usage increase

Page 17: Analysing Public Science Debates through Blogs and Online News Sources

Science concern hot topics (7%)

0 20 40 60 80

Fear of Science

Information

Progress

Threat Prediction

Other

Duplicate

Temporal Descriptor

Random

Hot science fear words

Page 18: Analysing Public Science Debates through Blogs and Online News Sources

Unexpected results?

Social science research Sudden burst of discussion over fears of the

economic theories of Karl Rove, an influential advisor to George Bush

Computer security Concern over spyware features in a

software vendor’s products Research showing that consumers’ pin

numbers could be revealed by poor printing

Page 19: Analysing Public Science Debates through Blogs and Online News Sources

Conclusions

Many free tools support exploration of Consumer Generated MediaAlso room for specialist research tools

Page 20: Analysing Public Science Debates through Blogs and Online News Sources

References

http://www.blogpulse.com/http://www.blogpulse.com/www2006-workshop/http://www.creen.org/

Thelwall, M., Prabowo, R. & Fairclough, R. (2006, to appear). Are raw RSS feeds suitable for broad issue scanning? A science concern case study. Journal of the American Society for Information Science and Technology.