How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

28
@badams Crawl Optimisation Barry Adams

Transcript of How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

Page 1: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl Optimisation

Barry Adams

Page 2: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Barry Adams

• Doing SEO since 1998• Founder of Polemic Digital• Senior Editor at State of Digital

Page 3: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl Optimisation

Page 4: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

What is Crawl Optimisation?

Ensuring search engine spiders waste as little time as possible crawling the right URLs on your site.

Page 5: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Why is Crawl Optimisation important?

If you waste crawl budget, the right pages are unlikely to be crawled & indexed.

Page 6: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl Budget

Page 7: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Google’s Crawl Sources

• Site crawl• XML Sitemaps• Inbound links

• DNS records• Domain registrations• Browsing data

Page 8: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Identifying Crawl Waste

Page 9: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Identifying Crawl Waste

Page 10: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Identifying Crawl Waste

Page 11: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl Waste

• Bogus URLs in XML Sitemap

Page 12: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Optimise XML Sitemaps• Ensure your sitemap contains final URLs only

• Minimise 301-redirects or other non-200 status codes

• Use multiple sitemaps to identify crawl waste in GSC

Page 13: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl Waste

• Paginated Listings• Faceted Navigation

http://website.com/jewellery/?page=2&cat=5&color=silver&style=glass&collection=autumnsort=a&…

Page 14: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Optimise Paginated Listings

• List more items on a single page

• Implement rel=prev/next pagination meta tags

• Block sorting parameters in robots.txt Disallow: /*?sort=*

Page 15: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Optimise Faceted Navigation

• Decide which facets have SEO value Build static pages for these

• All other facets: robots.txt disallow ‘rel=nofollow’ on facet links

Page 16: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl Waste

• Internal Site Search Results

Page 17: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Block Internal Site Search Pages

• Block in robots.txt

User-agent: *Disallow: /SearchResults.aspxDisallow: /*query=*Disallow: /*s=*

Page 18: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl Waste

• Internal redirects

Page 19: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Minimise Internal Redirects

• Find redirects with Screaming Frog• Internal links should all be 200 OK• Flat site structure

Page 20: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl Waste

• Canonicalised Pages

Page 21: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Use Canonicals Wisely• “rel=canonical” is primarily for index issues

It is not a fix for crawl waste

Search engines need to see the canonical tag before they can act on it

Ergo, pages need to be crawled before rel=canonical has any effect

Ditto with meta noindex tags

Page 22: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

DON’T use Canonicals for…

• Faceted navigation

• Pagination & sorting

• Site Search pages

Page 23: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

OK to use Canonicals for…

• Separate mobile URLs

• Session-specific URL parameters

• Content syndication

• Unavoidable content duplication

Page 24: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl Waste

• Slow loading pages

Page 25: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Optimise Load Speed• Time to First Byte• Lightweight pages• Caching• Compression

Page 26: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Crawl OptimisationSummarised

• Don’t let search enginesdo the hard work

• Tools at your disposal; DeepCrawl Google Search Console Screaming Frog SEO Crawler WebPageTest.org

• Solutions;– XML Sitemaps– robots.txt– rel=nofollow– rel=prev / rel=next– Load speed

Page 27: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

The End Goal

Page 28: How to Find & Fix Crawl Optimisation Issues - #BrightonSEO

@badams

Thank [email protected]@polemicdigital@badams