If youre-not-writing-content-for-machines-print

120
Write Content for Machines or Leave Money on the Table

Transcript of If youre-not-writing-content-for-machines-print

Write Content for Machinesor Leave Money on the Table

©2017

Did you know that when you see this?...

©2017

Machines see this?...

JPEG Text Text Text Text Text Text

JPEG

JPEGJPEGJPEGJPEG

JPEGJPEGJPEG

Big Headline

Smaller Headline Smaller HeadlineSmaller Headline

Smaller Headline

Even Smaller Headline Even Smaller HeadlineEven Smaller Headline

Smaller Headline

Even Smaller Headline Even Smaller Headline Even Smaller Headline Even Smaller HeadlineWidget

©2017

MACHINES??

©2017

Machines!

©2017

Machines!

©2017

Machines!

…and the computer systemsthat run them!

©2017

Machines!

©2017

A VAGUE, BUT EXCITING IDEA

©2017

1989

March

©2017

1989

March

This is for everyone!

©2017

HTML Documents

©2017

Image Documents

©2017

Other Documents

©2017

Web of Documents

By mid-1994, there were 2,738 websites.

©2017

How Search Used to Work

©2017

How Search Used to Work• Users of the Web would use comma separated

keywords, and would employ quotation marks, and symbols like * and + to “trick” the search engines into delivering the results we needed.

• Search engines would match strings and return (many) pages of links for us to cull through.

Because… Keywords!

©2017

Keyword-only Approach is Problematic

mole

©2017

mole, n.

©2017

©2017

That. Doesn’t. Scale.

http://www.internetlivestats.com/total-number-of-websites/

©2017

THE WEB AND SEARCH HAVE CHANGED

©2017

How We Search Has Changed

Today…

• We ask questions and search engines answer.

©2017

©2017

©2017

©2017

WHERE WE SEARCH HAS CHANGED

©2017

We Search In-Page

©2017

We Search In-Site

©2017

We Search In-Network

©2017

We Search In-Network

©2017

We Search Within Search Results

©2017

We Ask Questions Verbally

©2017

We Ask Questions Verbally

The movie starts at 4:00.

©2017

SO, SEARCH IS GETTING BETTER

©2017

TheBig

But

©2017

©2017

©2017

Still true that when you see this...

©2017

Machines (including search engines) see this...

JPEG Text Text Text Text Text Text

JPEG

JPEGJPEGJPEGJPEG

JPEGJPEGJPEG

Big Headline

Smaller Headline Smaller HeadlineSmaller Headline

Smaller Headline

Even Smaller Headline Even Smaller HeadlineEven Smaller Headline

Smaller Headline

Even Smaller Headline Even Smaller Headline Even Smaller Headline Even Smaller HeadlineWidget

©2017

We Need to Add Some Extra Code

• to connect DATA

• to make information interpretable by machines

©2017

MACHINE INTERPRETATION

©2017

Web 1.0 – Linking Documents

The Web does NOThave “versions!!”

©2017

Web 1.0 – Linking Documents

©2017

Web 1.0

©2017

Web 1.0

“I see: characters + formatting + images”--my Computer

©2017

Documents connected by…

URI Links!!

http://www.example.com/document.filetype

©2017

Web 1.0 – Linking DocumentsWeb 2.0 – Linking People

©2017

Linking People• Our Profiles

• Our Likes

• Our Dislikes

• Our Interests

• Our Friends, Followers, Connections, and Communities

• Our Opinions, Thoughts, Comments, and Reviews

All connected by…

©2017

URI Links!!

https://www.linkedin.com/in/ericfranzon

©2017

Web 2.0

“I see: characters + formatting + images”--my Computer

©2017

Web 2.0

“I see: characters + formatting + images”--my Computer

©2017

It’s hard to interpret meaning when all you see are characters,

images, and formatting.

Context is critical.

©2017

Web 1.0 – Linking DocumentsWeb 2.0 – Linking PeopleWeb 3.0 – Linking Data

©2017

Web 3.0 – Linking DataAlbum Title

Price (in USD)

Media Format

AlbumCover

Band Name

©2017

Web 3.0 – Linking DataAlbum Title

Price (in USD)

Media Format

AlbumCover

Band Name“I see: things + relationships. This is about a collection of music.”

©2017

Data connected by…

URI Links!!

http://www.example.com/document.filetype#thing

©2017

CONTENT FOR HUMANSCONTENT FOR MACHINES

©2017

What a webpage without machine-interpretable code looks like…

©2017

©2017

What a webpage with machine-interpretable code looks like…

©2017

©2017

©2017

We’ve seen this dynamic before!

Human Readable

Machine Interpretable

©2017

We’ve seen this dynamic before!

Human Readable

Machine Interpretable

©2017

What Kind of Data Are We Adding?

• Semantic Data – communicate meaning• Structured Data – follow a formal structure• Linked Data – linked by URIs

Smart Data!!

©2017

WHO’S USING THESE WEB STANDARDS?

©2017

• Healthcare / Life Sciences• Financial Services• Manufacturing / Retail• Marketing, Advertising• SEO/SEM• Libraries• Archives• Museums • Governments• Enterprise Software Vendors

Who’s Using Them?

©2017

Who’s Using Them?

©2017

Who’s Using Them?

©2017

Who’s Using Them?

©2017

2010April21

What it Looks Like

©2017

©2017

• Activities• Businesses• Groups• Organizations• People• Places• Products and Entertainment• Websites

OGP is used to Describe…

©2017

Who’s Using Them?

©2017

2011June

2

©2017

What is schema.org?

“…A collection of schemas, i.e., html tags, that webmasters can use to markup their pages in ways recognized by major search providers.”

©2017

e.g. Product Markup

©2017

What it looks like

©2017

Based on a sample of 12 billion web pages:

• ~5 million domains (6% of domains)

• 15 billion entities (i.e. “things”)

• 65 billion semantic statements

• 2.5 billion pages (~21% of pages)

-Reported in an August 2014 SemTechBiz Keynote by R. V. Guha, Google Fellow

Schema.org Adoption

31% of pages (Dec. 2015)

©2017

What is schema.org?

“…A collection of schemas, i.e., html tags, that webmasters can use to markup their pages in ways recognized by major search providers.”

©2017

Connecting THINGS not STRINGS!

©2017

2012May16

©2017

What it looks like

©2017

e.g. TV Episode Markup

©2017

What it looks like

©2017

What it looks like

©2017

What it looks like

©2017

CAN A SMALL BUSINESS BENEFIT?

©2017

LorisLabradors.com

©2017

LorisLabradors.com – Before

©2017

LorisLabradors.com – After

MapContact InformationReviewsSocial Media Profiles

©2017

LorisLabradors.com – After

©2017

PremierConstructionIL.com

©2017

PremierConstructionIL.com – Before

©2017

PremierConstructionIL.com – After

©2017

PremierConstructionIL.com – Projects

©2017

PremierConstructionIL.com – Projects

©2017

PremierConstructionIL.com – Projects

©2017

DATAVERSITY.net – Articles and Events

©2017

DATAVERSITY.net – Articles and Events

©2017

DATAVERSITY.net – Articles and Events

©2017

PITFALLS

©2017

Growing Pains

• Immature tools available

• Lack of understanding/misinformation

• Meaning is difficult to automate

• Vocabularies change.

©2017

• Global companies showing as local

• Old data conflicts

• Entities mismatched to concepts

Feeling the Pain

Incorrect signals are being sent to machines.

©2017

SCHEMA.ORG IS NOT THE ONLY ONE

©2017

Linking Open Data ProjectMay, 2007

©2017 July 2009

©2017

September 2011

©2017

August 2014

©2017

Wikidata is a project of the Wikimedia Foundation: a free, collaborative, multilingual, secondary database, collecting structured data to provide support for Wikipedia, Wikimedia Commons, the other Wikimedia projects, and well beyond that.

©2017

Data from these trusted sourcesis available for you

to use in your applications TODAY.

Data you can LINK to.

©2017

Semantic Data that is machine READABLE.

…and machine INTERPRETABLE!

©2017

FUTURE PROOFING

©2017

2009Feb18

©2017

Consuming the Data

2009Feb18

©2017

Consuming the Data

2012

May

©2017

Questions? Operators are standing by.

THANK YOU!

[email protected]@EricAxelhttp://linkedin.com/in/ericfranzon

Are youbeing seenin the Web?

©2017

Resourceshttps://flic.kr/p/6krdsMhttps://flic.kr/p/p9jiDKhttps://flic.kr/p/3q8afLhttps://flic.kr/p/brJs4Ghttps://flic.kr/p/78rsTchttps://flic.kr/p/bpSeR2https://flic.kr/p/pQcWQthttps://flic.kr/p/daKwMLhttps://flic.kr/p/8bpMhFhttp://www.flickr.com/photos/dawnmanser/3532853278/http://www.flickr.com/photos/artolog/3983764041/http://www.flickr.com/photos/97964364@N00/59780745/https://flic.kr/p/p1FYTdhttps://www.flickr.com/photos/andrikoolme/32123136165https://www.flickr.com/photos/bjtechnewsphotolibrary/31231185724https://www.flickr.com/photos/corrafig/30320040075/