A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

41
#pubcon A Technical Look at Content Presented by: Patrick Stox @patrickstox

Transcript of A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

Page 1: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

A Technical Look at Content

Presented by:Patrick Stox@patrickstox

Page 2: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Normal On-Page SEO• Title tag• Meta Description• Canonical• Header Tags• Image name and alt attributes• Keyword in URL• Speed

• HTTPS• Pagination• HREFLANG• Mobile Friendly• Content visible• Internal links• Indexable

Page 3: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

It’s All Been Done Before Right?

Page 4: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Query IntentWhat’s the query trying to address?

Page 5: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

We’ve All Seen This• Informational• Navigational• Transactional

Page 6: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Google’s Quality Raters Guidelines Has

• Know query, some of which are Know Simple queries

• Do query, some of which are Device Action queries• Website query, when the user is looking for a

specific website or webpage• Visit-in-person query, some of which are looking

for a specific business or organization, some of which are looking for a category of businesses

Page 7: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Website FeaturesWhat would you expect to see when visiting a website?

Physical Store: Address, Phone #, Hours of operationE-Commerce: Pricing, Reviews, Return Policy, Contact

Some niches have things like certification numbers

Page 8: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

I Need You To Write Quality Content

Page 9: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

What Is Quality Content?

Page 10: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Google Tells You Things Not To Do• Automatically generated content• Participating in link schemes• Creating pages with little or no original

content• Cloaking• Sneaky redirects• Hidden text or links• Doorway pages• Creating pages with malicious behavior,

such as phishing or installing viruses, trojans or other badware

• Scraped content• Participating in affiliate

programs without adding sufficient value

• Loading pages with irrelevant keywords

• Abusing rich snippets markup

• Sending automated queries to Google

Page 11: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

But Google Is Vague On What To Do• Make pages primarily for users, not for search

engines.• Don’t deceive your users.• Avoid tricks intended to improve search engine

rankings. • Think about what makes your website unique,

valuable or engaging. Make your website stand out from others in your field.

Page 12: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

The Good Practices Listed• Monitoring your site for hacking and removing

hacked content as soon as it appears• Preventing and removing user-generated spam

on your site

Page 14: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

What Are These?• Topical relevance to the query (“Does it

address the query?”)• Content Quality (as measured by Authority,

Utility, and Presentation), and• Context (“Is the query about a recent topic?”,

“What’s the user’s physical location?” etc…)

Page 15: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Google Has More In Webmaster Academy

• Useful and informative• More valuable and useful than other sites• Credible• High-quality• Engaging

Page 16: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

There’s More!• Readability• Spelling• Grammar• Broken Links• Facts or Incorrect Information

Page 17: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

How Deep Down The Rabbit Hole Do We Want to Go? -> Readability

• Flesch Kincaid Reading Ease• Flesch Kincaid Grade Level• Gunning Fog Score• Coleman Liau Index• Automated Readability Index (ARI)• SMOG (Simple Measure of Gobbledygook)

• Fog Index• Lix formula• Spache Index• Dale-Chall Index• Dale-Chall Grade

Page 18: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

But Wait, There’s More!• Position of content. Hidden/visible, font size,

styling• Who the author is• What website the content is on• Duplicate/uniqueness, different take, etc.• Semantically related

Page 19: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Looking At Content Is The Fun Part• Keyword density - times keyword appears on

page / total words on page, expressed as %• LSI (Latent Semantic Indexing) - looks for

closely related words, synonyms, variants

Page 20: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Sprinkle Some Keywords

Page 21: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Use Any Of The Following As Guides

Page 22: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

LSALatent Semantic Analysis

Bag of words. Count based models.

It finds words mentioned but not really the meaning. So we might see Hogwarts related to Harry Potter, but not see it as a school for higher learning.

Page 23: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

TF-IDFTerm Frequency – Inverse Document

FrequencyFrequency of a term within a document divided by its frequency in the entire corpus

How important a word is in a document or collection of documents.

Page 24: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

WDF*IDFWithin Document Frequency - Inverse Document Frequency

This is basically keyword density 2.0 with a correction value and weighted across a set of documents.

Page 25: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

BM25Like TF-IDF but takes into account document length.

Used by Common Search (building a nonprofit search engine) https://about.commonsearch.org/

Page 26: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

N-gramsUnigram, bigram, trigram, four-gram, five-gram.

Basically co-occurring words and phrases.

Page 27: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Word2VecPredictive instead of count based.

Tries to predict source context-words from the target words. One word predicts a nearby word.

Page 28: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

What Can You Do With Word2Vec?• Measure the similarity between words or

documents.• Find most similar words to a word or phrase. • Add and subtract words from each other to find

interesting results.• Visualize the relationship between words in a

document.

Page 29: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Word2Vec

Page 30: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Word2Vec

Page 31: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Word2Vec Vector Space

Page 32: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Page 33: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

RankBrain = Word2VecProbably

Page 34: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

It might be more…Doc2vec correlates labels and words, rather than words with other words.

LDA predicts a word from a global context.

Lda2vec tries to build both word and document topics.

Page 35: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

What Else Can We Look At?

Page 36: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Concepts And EntitiesUsed for understanding and context.

Page 37: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Autosuggested PhrasesShows what other people are searching for around a topic.

Page 38: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

What Other Terms Top Pages Rank For

Shows what it says.

Page 39: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

What Questions Are People Asking?

Page 40: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Remember That These Are All Guides, Not Absolutes!

Page 41: A Technical Look at Content - PUBCON SFIMA 2017 - Patrick Stox

#pubcon

Thank You!

Patrick Stox@patrickstox