Semantic Web and Content Strategy

Post on 08-Sep-2014

24.719 views 1 download

Tags:

description

A presentation I gave at the Content Strategy Forum 2010, in Paris. For those who couldn't make it to Paris, I gave this presentation again in Chicago in June, at Web Content 2010. This is the (slightly) updated Chicago version.

Transcript of Semantic Web and Content Strategy

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

THERE’S NO SEMANTIC WEB WITHOUT CONTENT AND DATAWEB CONTENT 2010 – 8 JUNE 2010

RACHEL LOVINGER

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

“Language is magic, and computers are still dumb."

- Aaron Straup Cope (flickr.com)

2

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

BLACKBERRY

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

BLACKBERRY

Photo by enrique dans

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

BLACKBERRY

Photo by Rob MacEwen

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

AGENDA

‣ What is the semantic web?‣ The key ingredients‣ How it’s being used now‣ What it means for Content Strategy

6

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

WHAT IS THE SEMANTIC WEB?

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

TRANSLATE THAT INTO COMPUTER-ESE

The underlying strategy of the Semantic Web is to create data and websites that are “machine-readable.”

If machines comprehend the meaning of data and content, they can: ‣ manipulate data in more meaningful ways‣ provide precisely the information that the user wants

8

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IS THERE A STARBUCKS NEARBY?

9

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

A FRENCH RESTAURANT?

10

© 2010 Razorfish. All rights reserved. Confidential and proprietary.11

GIFT FOR YOUR SUPERHERO NIECE?

?

? ??

??

?

?

Photo by Brendan Riley

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

FIND A HAIR APPOINTMENTSearch for specific criteria:• Highly-rated salon• Near the office• Available time that fits

your busy schedule

12

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

SOLVING FOR COMPLEXITY

Machines are good at complex things that people do poorly

• Computing or recalling long strings of numbers• Comparing large sets of data• Searching through millions of pages or data records for a

specific item

13Image by Eric Dobbs

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

SOLVING FOR COMPLEXITY

People are good at some complex things that machines don’t handle well

14

Equivalence 6:00pm and 18:00

Lumping similar things 6:00pm and 8:23am

Splitting different things 6:07:10 and 060710

Semantic systems are designed to capture the logic that will allow them to understand these types of relationships within data and use them to create new facts about the data.

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

THE KEY INGREDIENTS

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

HOW DO MACHINES KNOW WHAT DATA MEANS?

Identity + Definition + Structure

16

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

17

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDs‣ Machines need a unique, consistent way to identify a thing or concept. ‣ People can usually tell by context, but a machine needs a unique identifier to

be able to make connections or distinctions.

IDENTITY + DEFINITION + STRUCTURE

18

Bill Clinton = President William Jefferson Clinton

President Bush(George H. W.)

President Bush (George W.)

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY: STANDARDS

Standard identifiers

ISBN: International Standard Book Number

ISMN: MusicISAN: Audiovisual works

19

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY: OPEN SOURCE

MusicBrainz: database of music metadata, licensed by BBC to augment web pages

The Police MBID: 9e0e2b01-41db-4008-bd8b-988977d6019a

20

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

OntologyDefine classifications, properties, relationships, and logic

Blackberry1 is a type of FruitA Fruit is an Edible Thing

Blackberry2 is a type of Wireless E-mail DeviceA Wireless E-mail Device is a Mobile Electronic Device

Properties of Edible Things:Seasonal – Yes/NoCalories – #Ingredients (optional) – other Edible Things

A Mobile Electronic Device can never be an Edible Thing.

22

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

23

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

Some non-standard ways to express semantics‣ MicroFormats – uses XHTML & HTML markup to embed meaning in a webpage

‣ hCard for contact information‣ hCalendar for events

‣ Machine Tags – definition added to simple user tagging (“folksonomy”)‣ flora:tree=coniferous‣ upcoming:event=81334

24

<span class="vevent"> <span class="summary">This presentation was given</span>on <span class="dtstart">2010-04-16</span>at the Content Strategy Forumin <span class="location">Paris, France</span>.

</span>

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

25

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

26

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

New Web StandardsDeveloped specifically for expressing metadata and metadata relationships‣ Dublin Core – an ISO standard defining 15 common metadata elements‣ RDF – a model for expressing metadata as triples (subject-predicate-object)‣ OWL – adds semantic meaning‣ SKOS – expresses structured controlled vocabularies, taxonomies

27

Subject

Object

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

Blackberry1

Fruit

BerryPie

EdibleThing

Blackberry1

28

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

IDENTITY + DEFINITION + STRUCTURE

Note: Blackberry2 can’t be an ingredient of BerryPie, because it’s not an EdibleThing and all ingredients of EdibleThings must also be EdibleThings

Blackberry1

Fruit

BerryPie

EdibleThing

xyzabc 123In

gred

ient

Of

29

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

LINKED DATA: A DISTRIBUTED APPROACH

A Web of Data

30

Image by Richard Cyganiak and Anja Jentzsch

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

LINKED DATA: A DISTRIBUTED APPROACH

One page per concept ‣ URL is a type of ID‣ “topic pages” – a powerful tool

and reference point‣ high SEO value‣ aggregate content‣ contain related data & IDs

31

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

HOW IT’S BEING USED NOW

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

WALL STREET JOURNAL MOVIE REVIEWS

33

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

ENRICHED SEARCH RESULTS

34

Google Rich SnippetsYahoo! SearchMonkey

+

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

ENRICHED VANITY SEARCH

35

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

GETGLUE – RATINGS AND RECOMMENDATIONS

36

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

GETGLUE – RATINGS AND RECOMMENDATIONS

37

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

THE NEW YORK TIMES – ALUMNI IN THE NEWS

38

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

BBC MUSIC BETA – ARTISTS PAGES

39

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

BBC PROGRAMME PAGES

40

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

DATA.GOV

“The purpose of Data.gov is to increase public access to high value, machine readable datasets generated by the Executive Branch of the Federal Government.”

41

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

FLUVIEWNational Flu Activity Map – a widget by CDC.gov

42

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

DATA.GOV.UK

43

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

DATA.GOV.UK APPS

Help you find things‣ A post box‣ A school‣ An affordable place to live‣ A job‣ A volunteering opportunity‣ A dentist‣ A pharmacy‣ A bike route‣ A hospital ‣ A parking spot‣ A care home

44

Cyclestreets.net

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

PARKOPEDIA

45

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

DATA.GOV.UK APPS

Get information on ‣ How taxes are spent‣ Technology investments‣ Crime stats‣ The geological makeup of your area‣ Geographical details‣ Local issues ‣ Local government‣ Health‣ Obesity‣ Real Estate

46

‣ Renewable energy projects‣ Planning Alerts‣ Anti-social behavior in the area‣ Hazardous street conditions

fillthathole.org.uk

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

ASBOROMETER

47

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

WHAT IT MEANS FOR CONTENT STRATEGY

© 2010 Razorfish. All rights reserved. Confidential and proprietary.49Photo by Jon Higgins

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

SEMANTIC CAPABILITIES

Content Strategists should get familiar with these new kinds of tools and services‣ Related Content Services‣ Advanced Media Monitoring‣ Semantic Publishing Tools‣ Semantic Ad Targeting‣ Rich Data Services‣ Machine-Assisted Tagging‣ Semantic SEO

50

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

RELATED CONTENT SERVICES

51

‣ Enhance existing pages‣ Identify key concepts‣ Place assets and information

on the page or link to relevant offsite content

‣ Video, images, user-generated reviews, tweets, Wikipedia entries, etc.

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

RELATED CONTENT SERVICES

Example Services

52

Apture Provides additional contextual information in multimedia pop-ups, drawn from places such as Wikipedia, YouTube and Flickr.

Evri Allows readers to browse articles, images, and videos related to the topic of an article or content element, and provides widgets for sidebars, posts and popovers.

Headup Provides contextually relevant material from social networks and web services.

NewsCred Augments content with related stories from 6000 top news sources, as well as topic pages and license-free photos.

Zemanta Suggests related content and pictures that editors can embed in articles or blog posts.

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

ADVANCED MEDIA MONITORING

‣ Track Twitter, social networks, blogs, discussion boards, content sites

‣ Track a brand, industry, domain or topic

‣ With semantic capabilities:‣ more accurate relevance‣ sentiment analysis

‣ Track ongoing stories and audience reaction

53Screenshot © 2010 Phase 2 Technology

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

ADVANCED MEDIA MONITORING

Example Services

54

Imooty Tracks keywords and mentions of a brand, using a simple dashboard or by creating alerts, widgets, or RSS feeds.

Inbenta Follow the topics that people in your business are following.

Lexalytics Scans what’s being said in blogs, tweets and social media to provide sentiment analysis about companies, topics and current events.

Tattler Mines news, websites, blogs, multimedia sites, and social media to find mentions of topics or issues of interest to you.

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

SEMANTIC PUBLISHING TOOLS

‣ Content management tools that incorporate a wide range of structure and metadata capabilities

‣ Create and publish content encoded with semantic markup and meaningful metadata

‣ Not necessary to understand all the underlying code

‣ Streamlines the publishing process‣ Makes it faster, easier, and

cheaper to bring new content products to market

55Screenshot © 2010 Thomson Reuters

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

SEMANTIC PUBLISHING TOOLS

Example Services

56

OpenPublish A version of Drupal with OpenCalais machine assisted tagging and RDFa formatting built in.

Jiglu Insight Finds hidden relationships to other content you’ve published and automatically creates links.

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

SEMANTIC AD TARGETING

57

‣ Analyzing content pages for message, context, or mood, and inserts relevant ads

‣ Creates highly desirable ad inventory‣ Audience targeting, without the

privacy concerns of behavioral targeting

‣ Brand protection against unfortunate term-matching

An example of non-semantic contextual ads

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

SEMANTIC AD TARGETING

Example Services

58

ad pepper Provides ad placement, lead generation and brand protection through semantic analysis of page content and user behavior.

Peer39 Understands the meaning and sentiment of web pages so that ads can be targeted to appropriate audiences, and also protects advertisers from having their campaigns placed on negative or objectionable content. Identifies hot topics on the fly, and quickly adapts to create new “premium” inventory.

Proximic Performs real-time content analysis to accurately target ads, builds user profiles for better audience targeting, and includes brand protection measures.

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

RICH DATA SERVICES

59

‣ Enhance content with linked data

‣ Import additional information, assets, services, and user-generated content

‣ Improve SEO‣ Obtain additional data and

content for application development

‣ Data set may already include map to other desirable data and services

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

RICH DATA SERVICES

Example Services

60

Factual An open data platform providing tools to enable anyone to contribute and use sources of structured data.

Freebase An open, semantically enhanced database of information, similar to Wikipedia, but with structured data on millions of topics in dozens of domains.

iGlue A community editable database containing images, video, individuals, institutions, and geographic locations.

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

MACHINE-ASSISTED TAGGING

‣ Streamlines the process of tagging content by extracting concepts on a page‣ Suggests a set of consistent tags for each piece of content‣ Content producer approves or rejects each suggested tag

61Screenshot © 2010 Thomson Reuters

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

MACHINE-ASSISTED TAGGING

Example Services

62

OpenCalais Automatically tags people, places, companies, facts and events found in the content.

TextWise Generates weighted, relevant metadata based on key concepts found in the text of a document or web page.

Tagaroo An OpenCalais plug-in for WordPress.

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

SEMANTIC SEO

‣ Adds semantic markup to the content, or validates existing markup

‣ Submits it to search engines‣ Boosts search rankings‣ Makes pages more accessible for

visually impaired users‣ Displays additional business data,

content, or product information directly in search results

63Screenshot © 2010 Dapper

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

SEMANTIC SEO

Example Services

64

Google Rich Snippets Testing Tool

Tests webpage markup to ensure that Google’s Rich Snippets feature can interpret it correctly.

Inbenta Assists in the creation of content using the terminology of popular search queries.

Semantify(by Dapper)

Provides automated semantic enhancement of a site without changing its pages. Search engines see the site with RDFa tagging embedded in the page.

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

ADDITIONAL RESOURCES

‣ Sindice (http://sindice.com) – The semantic web index‣ SchemaWeb (http://www.schemaweb.info) – A directory of RDF schemas‣ Semantic Universe (http://www.semanticuniverse.com) – Educating the World

About Semantic Technologies and Applications) ‣ Semanticweb.org – A wiki for the semantic community‣ ReadWriteWeb: Semantic Web Archives

(http://www.readwriteweb.com/archives/semantic-web/) – All the Semantic Web articles on this leading information technology blog

‣ LinkedData.org – Resources from across the Linked Data community‣ Nimble: A Razorfish report on publishing in the digital age – Available now

at http://nimble.razorfish.com

65

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

CONCLUSION

‣ Content Strategy will still be needed to help implement and use these tools‣ Related Content Services, Semantic Ad Targeting, Rich Data Services,

Semantic SEO, Taxonomy/Ontology/Controlled Vocabularies‣ Establish business rules‣ Help configure the tools‣ Periodically monitor the results‣ Make adjustments as needed

‣ Advanced Media Monitoring, Semantic Publishing Tools, Machine-Assisted Tagging‣ Ongoing interaction by insightful, skilled users‣ CS might be the primary user‣ CS might train others to get the best results from their use

66

© 2010 Razorfish. All rights reserved. Confidential and proprietary.

QUESTIONS?

Rachel.Lovinger@razorfish.comTwitter: @rlovinger

http://scattergather.razorfish.comhttp://nimble.razorfish.com

Thank you!

67