How to Avoid Shooting Yourself in the SEO

48
How to Avoid Shooting Yourself in the SEO Fifteen Tips to Improve SEO on Sitecore Sites June 23, 2013 1

description

Fifteen Tips to Improve SEO on Sitecore Sites - June 23, 2013

Transcript of How to Avoid Shooting Yourself in the SEO

Page 1: How to Avoid Shooting Yourself in the SEO

How to Avoid Shooting Yourself in the SEO

Fifteen Tips to Improve SEO on Sitecore SitesJune 23, 2013 1

Page 2: How to Avoid Shooting Yourself in the SEO

Sitecore Enterprise Web CMS

• Popular .NET Web Content Management System for mid-to-large organizations

• Provides friendly tools for non-technical Content Authors and Marketers

• Can be customized and extended to meet almost any conceivable business need

2

Page 3: How to Avoid Shooting Yourself in the SEO

Why SEO is Important

Search Engine Optimization (SEO)• Getting listed on Search Engine Results Pages (SERPs)• Getting more traffic from listings on search engines• Getting your content recognized as the authority

• What search engines consider most relevant to users• Search engines rank authority pages higher

Why Search Engine Optimization is Done• Search engines need to determine relevance

• What the page is about• How the page would be useful

• SEO helps the engines understand content context 3

Page 4: How to Avoid Shooting Yourself in the SEO

Things that Hurt SEO

• Signs of a neglected or abandoned site• Having too many broken links on a page• Having too many HTML errors• Missing Alt Text for images

• Substantially similar content on the same site• Having a country code top level domain• Helps site rank for that particular country• Limits the site’s ability to rank globally

4

Page 5: How to Avoid Shooting Yourself in the SEO

Static URLs are Better for SEO

URLs can be static or dynamic• Dynamic URLs contain a question mark• Static - http://example.com/en/content• Dynamic - http://example.com/content?sclang=en

Use static URLs over dynamic URLs for SEO• Static URLs typically rank better in search engines• Search engines are slower to index dynamic pages• Static URLs also appear more friendly to users

5

Page 6: How to Avoid Shooting Yourself in the SEO

Using Sitecore Wildcard Nodes

Wildcard Node• A node in the Content Tree that matches all requests

for the given level if the item is not found on that level• URL parameters can be passed without query strings• Used to create friendly URLs for external content • Content is not in the Content Tree • Query strings do not have to be used to access it

How to create Wildcard Nodes• Place a * node below a node • Node accepts all URLs for items not found at that level• Parse the URL for the data

6

Page 7: How to Avoid Shooting Yourself in the SEO

Handling URL Parameters for SEO

Google provides a Parameter Handling Tool• Used to tell Google how to handle URL parameters• Used to reduce impact of duplicate content on SEO• Useful when query strings cannot be avoided

• Available on the Webmaster Tools Dashboard

7

Page 8: How to Avoid Shooting Yourself in the SEO

Why Duplicate Content is Bad

Duplicate content• Identical or nearly identical content on…• Multiple pages of a site• Pages on different sites

• URL variations pointing to the same page

Impact of duplicate content • Search engines try to index pages with distinct info• Links to the page contain multiple URLs• Popularity of the page can get divided across URLs• Page ranking of all the pages can be reduced• This can hinder pages' performance in SERPs

8

Page 9: How to Avoid Shooting Yourself in the SEO

How Google Handles Duplicates

When Google detects duplicate content• Google groups the duplicate URLs into a cluster• Google selects one URL to represent the cluster• Google consolidates the link popularity to that URL

When Google doesn’t detect duplicate content• Google cannot consolidate the link popularity• This can dilute the ranking by splitting it across URLs

9

Page 10: How to Avoid Shooting Yourself in the SEO

How Duplicate Content Occurs

Sitecore Content Tree items• Content items are copied with little or no editing• Cloned items are created and used• Alias items are created and used• Campaign items are created and used for tracking• Multiple layouts are used to display the same content• Web, print, and mobile layouts, etc.

URLs• Multiple domain names are bound to the same site• Links to a page contain formatting variations

10

Page 11: How to Avoid Shooting Yourself in the SEO

URL Variations

Variations that can occur in links to a page• Lowercase vs. mixed case• Www vs. no www• Trailing slash vs. no trailing slash• Language embedding vs. no embedding (localization)• URL parameters vs. no parameters (campaign tracking)

URL variations are treated as duplicate content

11

Page 12: How to Avoid Shooting Yourself in the SEO

What is a Canonical Page?

Canonical pages• Identify a page as the primary page over others • Useful when a site contains duplicate content• Multiple pages with (nearly) identical content• Multiple layouts used for the same content• Multiple URLs used to load the same page

12

Page 13: How to Avoid Shooting Yourself in the SEO

What is a Canonical Tag?

Canonical tags• Used when pages contain duplicate content• Specifies the canonical page• Format: <link rel="canonical" href="http://primary-url>• Add to primary as well as secondary pages

• Prevents search engines from picking a primary page• Supports duplicate content within a single domain• Supports pages across different sub-domain URLs• Does not support pages in different domain URLs 13

Page 14: How to Avoid Shooting Yourself in the SEO

DNS Mistakes that Hurt SEO

DNS domain name mistakes• Don’t bind multiple generic domains to the primary site• Don’t bind your primary site using a wildcard for all names• DNS Admins can get requests to add new names for a site• Each name that connects to the site can dilute the SEO for all

Content-related mistakes• Avoid creating multiple pages with duplicate content• Avoid varying the format of URLs on a site• Don’t let the search engines decide which page is primary

14

Page 15: How to Avoid Shooting Yourself in the SEO

Other DNS Mistakes for SEO

Development sites for developers• Can contain an out-of-date copy of site content• Should not be accessible over the Internet • Should not have a binding to a public DNS name

Content management sites for content authors• Can contain unapproved and unpublished content• Can have a separate IP address• Can be mapped to the master database for website• Should not have a binding to a public DNS name 15

Page 16: How to Avoid Shooting Yourself in the SEO

How to Set Up DNS for SEO

Using multiple DNS domain names for a live site• Bind a single generic domain name to the live site• Bind top-level country domain names to the live site• Create a second empty IIS site• Bind all other generic domain names to second site• Use the HTTP Redirect in IIS to set a 301 redirect• Select the second empty site in IIS• Open the HTTP Redirect tool in the IIS section• Add the primary URL in the Redirect text box• Set the status code to 301

16

Page 17: How to Avoid Shooting Yourself in the SEO

Setting Up DNS for Localization

Try to use top level domains for localization• Get country specific domain names for each language version• Use www.example.fr for French, not fr.example.com• Use www.example.de for German, not www.example.com/de

Sites in top-level domains have distinct SERPS• Search Engines use an initial link for the root of the site to

crawl the site in a particular language• No session is needed to remember the language• Each page has a unique URL in the different languages because

the domain name is unique17

Page 18: How to Avoid Shooting Yourself in the SEO

Geotargeting for SEO

For country-coded top-level domains • Sites are already associated with a geographic region

For generic top-level domains (.com, .org, etc.)• Use Webmaster Tools to set geographic targets• You can target users in a particular geographic location• Targets help determine how site appears on SERPs

18

Page 19: How to Avoid Shooting Yourself in the SEO

What to Ask Your DNS Admin

Identify DNS names that point to your live site• Find the ones with no binding to your site• Add a binding for it to a secondary site that redirects• Alternatively, request the DNS name be removed

Identify DNS names that don’t point to your live site• Find the ones that display duplicate content• Create a 301 redirect to the real site or• Use a robots.txt file to prevent indexing of the site• Old links to the site will continue to work• Search engines will list only the real site

19

Page 20: How to Avoid Shooting Yourself in the SEO

Staging Mistakes that Reduce SEO

Duplicate content on a staging site• A separate staging site can exist for Sitecore sites• The staging site points to a copy of the web database• A DNS Address record can exist for the staging site• This DNS Address record may not be restricted• The binding for the staging site uses the DNS record

• Search bots view the two sites as duplicate content• There is no robots.txt file at the root of the staging site• SEO can be reduced for the primary Live site 20

Page 21: How to Avoid Shooting Yourself in the SEO

Setting up Staging Sites for SEO

There are good reasons to have a staging site • Authors can test content changes before going live• Admins can troubleshoot publishing issues

Set up your staging site without hurting SEO• Do one or more of the following…• Add a Robots.txt file to the root of the staging site • Restrict access with NTFS and Windows Authentication• Restrict access with IP Address restrictions• Allow DNS name resolution only on corporate network 21

Page 22: How to Avoid Shooting Yourself in the SEO

How SEO Mistakes Can Occur

A company decides to revamp their web site• Old site continues to run • New site is created over time with no robots.txt file• There are public DNS records for both sites

A company wants to test a version upgrade• Old site is copied as-is with configuration settings• Upgrade is applied to copy of site and tested over time

A company wants to keep their old site around• Robots.txt file is not added to old site• Default redirects in IIS (302s) are used instead of 301s

<Your scenario here>

22

Page 23: How to Avoid Shooting Yourself in the SEO

Using Robots.txt to Help SEO

Robots.txt• Plain text file• Add to the root folder of the site• Reduces indexing of duplicate content by search bots• Can prevent indexing of specific folders or paths• Can prevent indexing of entire site

How to prevent indexing of a siteUser-agent: *Disallow: / 23

Page 24: How to Avoid Shooting Yourself in the SEO

Using a Sitemap to Improve SEO

What is a Sitemap?• Lists a hierarchy of the site pages you want indexed• Helps search engines discover canonical pages

Guidelines for Sitemaps• Add a Sitemaps file to the root of your site• Periodically submit a Sitemap to all search engines• Use the Sitemap protocol so all search engines can use it• Specify your preferred URL for pages in your Sitemap• Identify the location of your Sitemap file in robots.txt

24

Page 25: How to Avoid Shooting Yourself in the SEO

Redirect Mistakes that Reduce SEO

Types of redirects• 301 indicates “Moved Permanently”• 302 indicates “Moved Temporarily or Found”• 307 indicates “Temporary Redirect”

The three biggest redirect mistakes for SEO• 302 Redirects are used• 307 Redirects are used• 301 Redirects are not used

Our next slide explains why…25

Page 26: How to Avoid Shooting Yourself in the SEO

Link Juice

Marketing term for "online ranking power“• The quantity and quality of inbound links to a page• One of the primary factors that determines rankings

How Link Juice is affected by redirects• 302 redirects forward almost no link juice • 307 redirects forward almost no link juice• 301 redirects forward 90-99% of link juice

26

Page 27: How to Avoid Shooting Yourself in the SEO

When to Create a Redirect

Redirects are useful when…• A page of content is moved on your site• You want an old link to work on your new site• You want the link juice from an old link to be

transferred to the new location of the content• You want the link juice from multiple domain names

transferred to a single site name• You don’t mind if visitors type abc.com and xyz.com is

displayed in their address bar 27

Page 28: How to Avoid Shooting Yourself in the SEO

The URL Rewrite Tool

Microsoft add-on for IIS 7• Permits Web admins to replace and redirect URLs• Rules are configured in IIS and saved to web.config• URLs can be more search engine friendly• Supports many types of redirects, including 301s• Supports regular expressions in addition to wildcards• Hint: Use regular expressions (regex) over wildcards

28

Page 29: How to Avoid Shooting Yourself in the SEO

Benefits of the URL Rewrite Tool

The URL Rewrite tool is very powerful• Should be installed on “Live” Sitecore server• Enables admins & webmasters to support redirects• Helps avoid binding non-primary generic domains

to secondary site and use 301 redirects (Slide #16) • Create a “Use Canonical Hostname” rule in URL

Rewrite tool instead (see next slide for details)

29

Page 30: How to Avoid Shooting Yourself in the SEO

The ‘Use Canonical Name’ Rule

A rule created in the URL Rewrite tool• Install the URL Rewrite Tool add-on for IIS 7• Bind all domain names and wildcards to the main site• Create a rule named “Use Canonical Name”• Add all secondary generic domain names as conditions• Set the Logical Grouping to “Match Any”• Create a permanent redirect to your preferred URL• See examples on the next three slides

30

Page 31: How to Avoid Shooting Yourself in the SEO

‘Use Canonical Name’ ExampleExample 1 – Top of URL Rewrite Window

31

Page 32: How to Avoid Shooting Yourself in the SEO

‘Use Canonical Name’ ExampleExample 2 – Bottom of URL Rewrite Window

32

Page 33: How to Avoid Shooting Yourself in the SEO

‘Use Canonical Name’ ExampleExample 3 – Resulting Rule in Web.config File

<rule name="Use Canonical Name" stopProcessing="true"> <match url="(.*)" /> <conditions logicalGrouping="MatchAny“ trackAllCaptures="false"> <add input="{HTTP_HOST}" pattern="(.*)sitecorenet\.com" /> <add input="{HTTP_HOST}" pattern="(.*)sitecorepedia\.com" /> </conditions> <action type="Redirect" url="http://www.theinsidecorner.com/{tolower:{R:1}}" /></rule>

33

Page 34: How to Avoid Shooting Yourself in the SEO

Improving SEO with Internal Links

SEO guidelines for internal links on Sitecore sites• Use consistent URL formats for all internal links• Use the toolbar to configure links in Rich Text• Avoid adding markup for URLs manually • Use General Links over Rich Text to create links• Use Clones when displaying duplicate content• Review Broken Links Report and fix broken links

34

Page 35: How to Avoid Shooting Yourself in the SEO

Improving SEO with Link Manager

Sitecore’s Link Manager settings• Located in the web.config file or a custom config file

Guidelines for Link Manager settings• Set Lowercase URLs to True• Set Add Aspx Extension to False• Set Always Include Server URL to True• Set Language Embedding to Always for multi-lingual sites• Set Language Location to File Path for multi-lingual sites• Set Shorten URLs to True for Content Trees with multiple sites

Additional guidelines for SEO• Configure IIS to pass all requests to ASP.NET• Add a trailing slash to the URL when appropriate

35

Page 36: How to Avoid Shooting Yourself in the SEO

Use File Paths to Embed Languages

Link Manager supports two types of embedding• Query Strings• Parameter sets the language in a session variable• Search engines cannot read session variables

• File Paths• A session-less solution• Allows multiple languages in the same domain

Use file paths over query strings36

Page 37: How to Avoid Shooting Yourself in the SEO

Sitecore Shared Source for SEO

SEO-Friendly URL module• Enables SEO-friendly URLs for Sitecore items • Implements a custom link provider & item resolver

Jungle Torch module• A SEO reporting and inbound marketing tool• Useful for social network analysis

Both are available from http://marketplace.sitecore.net 37

Page 38: How to Avoid Shooting Yourself in the SEO

MOZ Tools for SEO

MozBar• A free SEO-Toolbar for Firefox or Chrome• Displays relevant SEO metrics as you surf the Web• Available from http://moz.com/tools/seo-toolbar

Open Site Explorer• A free tool that uses the Mozscape web index to

analyze links• Supports analyzing up to 10k links to any site or page• Available from http://www.opensiteexplorer.org/ 38

Page 39: How to Avoid Shooting Yourself in the SEO

Tools To Locate Duplicate Content

Duplicate Content Tool• Performs a series of SEO checks on submitted URLs• http://www.virante.org/seo-tools/duplicate-content

Xenu• Scans site links and returns a table of all available URLs• http://home.snafu.de/tilman/xenulink.html

Google Webmaster Tools• Reports site’s duplicate titles and meta descriptions• https://www.google.com/webmasters/tools/home

39

Page 40: How to Avoid Shooting Yourself in the SEO

Duplicate Content Tool Checks

• WWW/NonWWW Header checks if site returns a 301 redirect from www to non-www or vice versa• Google Cache checks if site has issues with www vs. non

www in Google’s Cache• Similarity checks if site has pages omitted from Google for

being too similar in the top 1000 results• Default Page checks if site appears to have redirected or

standardized on either the / or /index.{ext} • 404 checks if site correctly returns 404 errors for pages

that do not exist• PageRank Dispersion compares page rankings for the www

and non-www versions of the site to verify they are same

40

Page 41: How to Avoid Shooting Yourself in the SEO

In Summary1. Use consistent URL formats and minimize similar content2. Minimize the use of cloned items and alias items in Sitecore3. Use static URLs over dynamic URLs and avoid query strings4. Clean up broken links and HTML validation errors5. Add canonical tags to pages with duplicate content6. Specify canonical URLs in a Sitemap and submit it to search engines7. Use Google Webmaster tools to specify your preferred domain8. Analyze your site bindings and compare them with your DNS records9. Bind only your primary DNS domain(s) to your live site10. Bind generic DNS domains to a second site that redirects to live site11. Use 301 redirects instead of 302 or 307 redirects12. Use top-level domains for localization, not subdomains or folders13. Block public DNS resolution for all development and staging sites14. Add a robots.txt file to all development and staging sites15. Configure your Sitecore Link Manager settings for SEO

41

Page 42: How to Avoid Shooting Yourself in the SEO

Caveat Emptor

There are many ways to manage SEO for Sitecore sites• Skilled developers can provide other solutions• Review your SEO needs with your Sitecore partner• Review proposed changes with all stakeholders• Implement new configurations on a test system firstThis presentation is intended to be academic• To enhance public knowledge of SEO issues• To promote a better understanding of SEOAuthor cannot guarantee accuracy or fitness for your site• Author makes no representations or warranties• Author disclaims any liability if you act on this information

42

Page 43: How to Avoid Shooting Yourself in the SEO

More Topics from TICAll Sitecore-related SlideShares from TIChttp://www.slideshare.net/cwnelson/presentations

How to Assure Success for a New Sitecore Site (full slide deck)http://www.slideshare.net/cwnelson/creating-new-sitecore-sites-full

Implementing Routine Sitecore Maintenancehttp://www.slideshare.net/cwnelson/sitecore-maintenance-tips

Performance Tips for Sitecore Mobile Usershttp://www.slideshare.net/cwnelson/sitecore-mobile-performance-tips

Sitecore 7 – New Features for Usershttp://www.slideshare.net/cwnelson/sitecore-7-new-features

Sitecore Performance Tips for Visitor Engagementhttp://www.slideshare.net/cwnelson/sitecore-performance-tips

Troubleshooting Sitecore Performancehttp://www.slideshare.net/cwnelson/troubleshooting-sitecore-performance

43

Page 44: How to Avoid Shooting Yourself in the SEO

The Inside Corner, Inc.

• The Inside Corner, Inc. (TIC) is owned by Craig Nelson• TIC is a Sitecore partner & web consultancy• A U.S. point of contact for teams of Sitecore specialists • We create new Sitecore sites & enhance existing ones

44

Page 45: How to Avoid Shooting Yourself in the SEO

Primary Developer - Craig Nelson

• Sitecore Specialist certified for Sitecore 7• 10 years of system administration experience• 10 years as a Microsoft Certified Trainer• 5 years of ASP.NET web development experience• 4 years of Sitecore development experience• 3 years as a Sitecore technical instructor• Sitecore architecture & configuration expert• Performance & SEO optimization specialist• Workflow configuration specialist 45

Page 46: How to Avoid Shooting Yourself in the SEO

Get Help with your Sitecore Site

Follow TIC on LinkedIn• http://www.linkedin.com/company/the-inside-corner-inc.

Add TIC to your Circles on Google Plus• http://google.com/+Theinsidecorner

Follow TIC on Twitter • https://twitter.com/TheInsideCorner

Contact Craig Directly• Craig Nelson - Certified 7.0 Developer and Sitecore Partner• cwnelson[at]ticdevs.com• Mobile - 707.319.7446

Visit the TIC website• http://ticdevs.com

46

Page 47: How to Avoid Shooting Yourself in the SEO

Bibliography

A Look at Google's 200 Search Ranking Factors About /robots.txtDemystifying the "duplicate content penalty"Link Juice ExplainedMOZ - CanonicalizationMultiple Languages in SitecoreSDN - Languages and Search EnginesSitecore - Avoiding query string in dynamic URLWebmaster Tools - Duplicate ContentWebmaster Tools - GeotargetingWebmaster Tools - URL parameters

Download slide presentation to view URLs

47

Page 48: How to Avoid Shooting Yourself in the SEO

Thank you for taking the time to view this presentation on

How to Avoid Shooting Yourself in the SEO

Please share with others and add a comment if you think it is useful

Thanks!

June 23, 2013

48