SearchLove San Diego 2017 | Will Critchlow | Knowing Ranking Factors Won't Be Enough: How To Avoid...

Knowing ranking factors won’t be enough

How to avoid losing your job to a robot

@willcritchlow

I’m going to tell you about a robot that understands ranking factors

better than any of you

...but before I get to that, let’s look at a bit of history...

The other day I searched:

Unsurprisingly, I got an answer

But it got me thinking about how, in 2009, the results would have looked more like this.

In 2009, it would have looked more like this.

With every title containing the keyphrase.

In 2009, it would have looked more like this.

With every title containing the keyphrase.

Most at the beginning.

OK. Maybe wikipedia would have been #1.

We used to have a pretty good understanding of ranking factors

My mental model for c. 2009 ranking factors had three different modes:

My mental model for ~2009 ranking factors had three different modes:

One in the hyper-competitive

One in the

competitive

mid-tail

...and o

ne in th

long-t

Tons of perfectly on-topic pages to choose from

So pick only perfectly-on-topic pages

...and rank by authority (*)

(*) Page authority, but the domain inevitably factors into that calculation. This is why

so many homepages ranked

This resulted in a mix of homepages of mid-size sites, and inner pages on huge sites

But the general way to move up was through increased authority

Kind of search result

Pages ranking To move up...

Head Homepages of mid-size sites and inner pages of massive sites. All perfectly-targeted.

Improve authority.

Mid-tail

Long-tail

One in the

competitive

mid-tail

Wealth of ROUGHLY on-topic pages to choose from

One in the

competitive

mid-tail

PERFECTLY on-topic could do well even on a relatively weak site

One in the

competitive

mid-tail

Rank the roughly on-topic pages by

authority x “on-topicness”

One in the

competitive

mid-tail

Move up with better targeting or more authority

One in the

competitive

mid-tail

Improve authority.

Mid-tail Perfectly on-topic pages on relatively weak sites plus roughly on-topic on bigger sites.

Improve targeting or authority.

Long-tail

One in the

competitive

mid-tail

...and o

ne in th

long-t

In the long-tail, a site of arbitrary weakness could rank if it was the most relevant

...and o

ne in th

long-t

Otherwise, massive sites rank with off-topic pages that mention something similar

...and o

ne in th

long-t

Generally, move up with better targeting

...and o

ne in th

long-t

Improve authority.

Long-tail Arbitrarily-weak on-topic pages and roughly-targeted deep pages on massive sites.

Improve targeting.

Improve authority.

Long-tail Arbitrarily-weak on-topic pages and roughly-targeted deep pages on massive sites.

Improve targeting.

So that was

It’s not so simple any more.Google is harder to understand these days.

PageRank(the first algorithm to use the link structure

of the web)

We know how we got to ~2009...

Information retrieval

PageRank

PageRankOriginal research

TWEAKS

...with growing complexity in subsequent years

When Amit left Google, there was a fascinating thread on Hacker News in discussion of this article

Particularly this comment from a user called Kevin Lacker (@lacker):

I was thinking about it like it was a math puzzle and if I just thought

really hard it would all make sense.

-- Kevin Lacker (@lacker)

Hey why don't you take the square root?

-- Amit Singhal according to Kevin Lacker (@lacker)

oh... am I allowed to write code that doesn't make any sense?

-- Kevin Lacker (@lacker)

Multiply by 2 if it helps, add 5, whatever, just make things work and we can make it make sense

later.

-- Amit Singhal according to Kevin Lacker (@lacker)

Why does this make the algorithm so hard to understand?

High-dimension

Non-linear

Discontinuous

3 big reasons:

High-dimension

Non-linear

Discontinuous

High-dimension

Non-linear

Discontinuous

High-dimension

Non-linear

Discontinuous

You might know what any one of the levers does, but they can

interact with each other in complex ways

This is what a high-dimensional function looks like

High-dimension

Non-linear

Discontinuous

We sell custom cigar humidors. Our custom cigar humidors are handmade. If you’re thinking of buying a custom cigar

humidor, please contact our custom cigar humidor specialists at

custom.cigar.humidors@example.com

What this needs is another mention of [cigar humidors]

With no mentions of [cigar] or [humidor] this page would be unlikely to rank

And yet you can clearly go too far, and have the effect turn negative.

This is called nonlinearity.

The cigar example is taken directly from Google’s quality guidelines.

High-dimension

Non-linear

Discontinuous

Discontinuities are steps in the function

Think about so-called “over-optimization” tipping points

Let’s put all this togetherinto a practical example:

Think about category pages:Do you recommend removing “SEO text”?

We’ve tested it, so we know the answer.

If you said “yes”, congratulations(+3.1% organic sessions in a split-test)

Unless you’re responsible for this siteNo effect / possible negative effect

No, but I’m still pretty good at this

You’re thinking this to yourself right now.

I promised to tell you about a robot that is better than even

experienced SEOs...

Well. It turns out all we needed was a coin to flip. You’re all fired.

It’s only going to get worse under Sundar Pichai

Who knows who this is?(This is the only CC-licensed photo of him on the internet)

ENHANCEWhat about now?

John Giannandrea - Google’s head of searchSundar’s choice to lead search after Amit. Previously running machine learning.

...and of course Jeff Dean is doing Jeff Dean things(c.f. Chuck Norris)

Jeff Dean puts his pants on one leg at a time, but if he had more legs,

you would see that his approach is O(log n).

Source: Jeff Dean facts

Once, in early 2002, when the search back-ends went down, Jeff

Dean answered user queries manually for two hours.

Result quality improved markedly during this time

When Jeff Dean goes on vacation, production services across Google mysteriously stop working within a

few days.

This was reportedly actually true

The original Google Translate was the result of the work of hundreds of engineers over 10 years.

Director of Translate, Macduff Hughes said that it sounded to him as if maybe they could pull off a neural-network-based replacement in three years.

Jeff Dean said “we can do it by the end of the year, if we put our minds to it”.

Hughes: “I’m not going to be the one to say Jeff Dean can’t deliver speed.”

A month later, the work of a team of 3 engineers was tested against the existing system. The improvement was roughly equivalent to the improvement of the old system over the previous 10 years.

Hughes sent his team an email. All projects on the old system were to be suspended immediately.

[Read the whole story ]

Background reading: (backchannel, bloomberg)

How to avoid losing your job to a robot

This is what you promised, Will.

Let’s start by understanding

some robot weaknesses

What’s this?

Ooh. Ooh.

I know this one.

-- robot

“It’s a leopard. I’m like 99% sure.”

Computers are better than humans at classification, but struggle with adversaries

Read more about this here -- Cheetah, Leopard, Jaguar

Lesson:

We expect adversarial abilities to take a step backwards

They will remain good at classifying bad links but will be likely to fall

prey to weird outcomes in adversarial situations

Example:

Remember Tay, the Microsoft chatbot that Twitter taught to be

racist and sexist in less than a day?

SearchLove San Diego 2017 | Will Critchlow | Knowing Ranking Factors Won't Be Enough: How To Avoid...

Marketing

Transcript of SearchLove San Diego 2017 | Will Critchlow | Knowing Ranking Factors Won't Be Enough: How To Avoid...

Halloween by Sue Critchlow - pagbnews.co.uk

Will Critchlow - The Future of Link Building

Rock your Brand - Melanie Spring - SearchLove 2014

SearchLove London 2011

Searchlove Day 2 roundup presentation

Tom Critchlow - Universal Search Optimisation

Critchlow Arnold DWI police report

SearchLove Boston 2013_Wil Reynolds_How we Get Unstuck

Searchlove 2013 - Getting Unstuck

SearchLove Boston 2013_Ross Hudgens_Actionable Content Marketing Tips

We Won't Forget, We Won't Give up

SearchLove Boston 2013_Kate MorrisInternational seo

Will Critchlow - The Future of Search

Data Feed SEO for Affiliates by Will Critchlow

Will Critchlow The Inbounder

Pers Soc Psychol Bull Critchlow 258 74

SearchLove London | Hannah Smith, Existential Crisis Management

Existential Crisis Management - SearchLove 2014

2016 Searchlove Frameworks for Content Success

I Won't Say I Will But I Won't Say I Won't