SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you...

41
SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know that behind every SharePoint installation is Microsoft Sql Server! Being document centric, the SharePoint databases grow 100 times faster than standard SQL Server databases. By Tom Bizannes Microsoft Certified Professional MacroView Business Technology Level 12, 171 Clarence St, Sydney, Australia +612 9249 2700

Transcript of SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you...

Page 1: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

SharePoint for Sql Guys

With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know that behind every SharePoint installation is Microsoft Sql Server!

Being document centric, the SharePoint databases grow 100 times faster than standard SQL Server databases.

By Tom BizannesMicrosoft Certified ProfessionalMacroView Business TechnologyLevel 12, 171 Clarence St, Sydney, Australia+612 9249 2700

Page 2: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

What do you know?

• Sql Administrators?• Sql Developers?• Any .Net Developers?• Any SharePoint Administrators?• Who has never seen or used SharePoint?

Page 3: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

What we will cover

• Brief Overview of components• Optimizing – the need for speed!• Backing up• Extras

– Integrating Reporting Services 2008 on IIS7– Reporting Services and SharePoint Lists

Page 4: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Why

• Because Microsoft brought out a whitepaper optimizing SharePoint late last year!

• Also, working for an Australian Company that has the leading document management add-on for SharePoint (WISDOM DMF), we see first hand how quickly SharePoint can grow and how complex they can get

Page 5: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

What is SharePoint?Office SharePoint Server 2007 (MOSS 2007)

Excel ServicesInfoPath Forms ServicesBusiness Data catalogExtensible Enterprise SearchLOB single sign-onContent management

Windows SharePoint Services (WSS v3)ASP.NET 2.0 integrationWorkflow supportContent types and meta dataAuditingFeature & solution deployment

Office SharePoint Server

Windows SharePoint Services

ASP.NETSQL

Page 6: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Office SharePoint Server 2007Business productivity server

Business Intelligence Collaboration

Portals

ECM

Business Process

and Forms

Search

RSS

Blogs

Wikis

Workspaces

Web services

Composition

Role based

Multi-browser

Page 7: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

The Big Picture

Sql Server

Indexing

Sharepoint

Page 8: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Need to work togetherThe Sql Administrators need to work with the SharePoint

Administators

Page 9: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

What do you want to know?

• What to get your SharePoint Administrator to do

• What you need to do as a Sql Administrator• What Tools you can use to help you

Page 10: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Why the fuss?

Let’s move all the files off the file store into SharePoint.

= Better searching, sharing etc= Easier to Share with external clients etc= 3 x the space on your SAN (For the initial

migration)– E.g. 400G to 1.2 Terabytes!

= A lot more work for the Sql Guy! But he can handle it

Page 11: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

The search indexer

• This is your biggest problem!• You can put the focus on your SharePoint guy,

but this is the animal you need to tame.• Anything you can do to make it work faster

and better will reduce the CPU hit on the server.

Page 12: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Optimise your Search Indexer• There are many factors involved in the SharePoint crawling process that can impact indexing performance. There are also some

steps you can take to improve that. Here are the common causes and their resolution:• Indexing Performace is set at reduced - common mistake on the configuration screen for the index service. See Central Administration

> Operations > Services on Server > Office SharePoint Server Search Service Settings and set to Maximum. • Number of Connections - by default the indexer will run a limited number of simultaneous threads (6 usually). This can be increased

manually by adding specific Crawler Impact Rules for each host. You can really improve speed by setting a large file server to 32 connections. But watch your network for bottlenecks.

• Crawled systems are slow or hosted on remote networks. - not a lot to be done here, except by moving those file closer. • Overlapping Crawls - SharePoint gives priority to the first running crawl so that if you already are indexing one system it will hold up

the indexing of a second and increase crawl times. – Solutions: Schedule your crawl times so there is no overlap. Full crawls will take the longest so run those exclusively.

• IFilter Issues - the Adobe PDF IFilter can only filter one file at a time and that will slow crawls down. – Using a PDF filter from pdflib.com or Foxit

• Not enought Memory Allocated to Filter Process - you can increase the memory allocation by adjusting the following registry keys – HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\12.0\Search\Global\Gathering Manager: set

DedicatedFilterProcessMemoryQuota = 200000000 Decimal – HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\12.0\Search\Global\Gathering Manager: set

FilterProcessMemoryQuota = 200000000 Decimal• Bad File Retries - there is a setting in the registry that controls the number of times a file is retried on error. This will severly slow down

incremental crawls as the default is 100. This retry count can be adjust by this key: – HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\12.0\Search\Global\Gathering Manager: set DeleteOnErrorInterval

= 4 Decimal• General Architecture Issues - Ensure that you have at least 2 Gig of free memory available before your crawl even starts and that you

have at least 2 real processors available • Disk Health - the nature of the indexing process causes extensive fragmentation of the file system for both the index server and the

database server. Schedule defrags routinely and after all full crawls. Ensure you have enough diskspace always.• Run 64 bit OS - school is still out on this one, i personally haven't seen must difference as long as there is enough memory and the

same processor types, but MS recommends this for large deployments.

Page 13: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

More questions for the SharePoint Administrators

• Versioning– This impacts on the size, but makes restoring less of an issue– Can delete minor versions and do all sorts of other things if the

database gets too big.• Auditing

– This can grow out of control• Have you set indexed columns on large document libraries for

performance?• How many site collections do you have and how big will they

grow?– Can set a separate database for each one– E.g. might have historical data on one etc– Maybe a site collection for clients A to M and another for clients N to Z

Page 14: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

The Databases

• Main Database is Wss_content• If running MOSS 2007 backup the config database

at the same point in time. If you don’t back this up, you will need to run some scripts to restore

• Know what service packs and hot fixes have been applied if you want to restore!

• Set a 100G limit per site collection? (This is as per the Microsoft Whitepaper )

• Optimize the search indexer

Page 15: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Always backup your Content Store

• The main database is wss_content• You can create a content store per site

collection. E.g. wss_content_hr, wss_content_admin etc

• You can get your SharePoint Administrator to set up an alert via SharePoint Administration when it gets close to the capacity you set.

Page 16: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Other networking things

• Enable IIS Compression• As a general best practice, the SharePoint

Operations team verified that IIS compression for static content was enabled on front-end servers. Enabling static compression is especially helpful for serving content to users over slower links

• Get fast Network cards on all servers (1Gb)

Page 17: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Service packs and hot fixes

• Searching had a few bugs that were fixed with Service Pack 1– The crawler never finished if indexes were rebuilt

• Always note what service pack and hot fix you are up to as you will need these if you ever restore or the schemas will cause issues etc

Page 18: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Optimising the databases

• Set a fill factor on all indexes to 70%Q: Where do you set a default fill factor?

• Use indexed columns on large document libraries for performance (For your SharePoint Guy)

• Create different site collections and/or document libraries on different content databases

• Separate Databases – see following

Page 19: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Separate Tempdb first• Office SharePoint Server 2007 farm performance can be

significantly impeded by insufficient disk I/O for the tempdb. To avoid this issue, allocate dedicated disks for the tempdb. If a high workload is projected or monitored — that is, the average read operation or the average write operation requires more than 20 milliseconds (ms) — you might need to ease the bottleneck by either separating the files across disks, or by replacing your disks with faster disks.

• For best performance, place the tempdb on a RAID 10 array. The number of tempdb data files should equal the number of core CPUs, and the tempdb data files should be set at an equal size. Count dual core processors as two CPUs for this purpose. Count each processor that supports hyper-threading as a single CPU.

Page 20: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Separate and prioritize your data among disks

• When prioritizing data among faster disks, use the following ranking: – Tempdb data and transaction logs– Database transaction log files– Search database– Database data files

• In a heavily read-oriented portal site, prioritize data over logs.• The use of multiple data files for databases other than content databases

and the SSP search database is not supported. • The use of SQL Server partitioning is not supported for SharePoint

databases. Use only simple data files. (Note: We have clients who opted for partitioning and their searches still work etc The partition though was split into 100G blocks)

• Ideally, place the tempdb, content databases, and SQL Server 2005 transaction logs on separate physical hard disks.

Page 21: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Separate and prioritize your data among disks - continued

• Only create files in the primary filegroup for the database. • Distribute the files across separate disks. • Create data files of equal size. • Separate database data and transaction log file across different

disks. If files must share disks because the files are too small to warrant a whole disk or stripe or you have a shortage of disk space, put files that have different usage patterns on the same disk to minimize simultaneous access requests.

• Consult your storage hardware vendor for information about how to configure all logs and the search databases for write optimization for your particular storage solution.

• Allocate dedicated spindles for the search database.

Page 22: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Partitioning or Not?

The Notes clearly state that Microsoft doesn’t support partitioning. Companies will use it though as it is easier for huge implementations.* 100G per site collection is just a guideline!

Page 24: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

The tools Microsoft talks about• Event Viewer This tool is especially useful for understanding the underlying behavior by evaluating application errors and

warnings, or investigating system events that occur before, during, and after a performance incident. • Dump file analysis Analyzing dump files is an advanced troubleshooting and analysis approach that provides low-level

information about critical system errors and memory dumps. It enables the SharePoint Operations team to examine the data in memory and analyze the possible causes of such issues as memory leaks and invalid pointers.

• System Monitor The SharePoint Operations team uses tools such as Event Viewer and dump file analysis to investigate specific incidents and performance issues. The team uses System Monitor in the Windows Server® 2003 operating system (called Performance Monitor in Windows Server 2008) for establishing a performance baseline, tracking trends, and compiling data on resulting performance after making changes.

• SQL Server Profiler This tool is a graphical user interface to SQL Trace for monitoring an instance of SQL Server Database Engine or SQL Server Analysis Services. Microsoft IT and other teams use this tool to evaluate SQL Server performance aspects such as query times, stored procedure run times, and deadlocks. This tool is especially useful for analyzing the underlying calls to SQL Server databases that are housed on the storage area network (SAN).

• Custom tool for client-based URL ping The SharePoint Operations team created a custom tool that recorded the time to first byte for URLs hosted on SharePoint servers. This is one of the most useful tools because it enables the comparison of statistics before and after implementing configuration changes to the environment.

• Log Parser The SharePoint Operations team uses logging extensively when determining root causes of issues, including SharePoint trace logs and IIS and Unified Logging Service (ULS) application and service logs. Microsoft IT uses Log Parser as one of the tools to monitor traffic, determine traffic sources distribution, and establish performance baselines. This free tool parses IIS logs, event logs, and many other kinds of structured data by using syntax similar to Structured Query Language (SQL). For more information about Log Parser, refer to the Script Center resource at http://www.microsoft.com/technet/scriptcenter/tools/logparser/default.mspx.

• Fiddler This tool is helpful for measuring caching, page sizes, authentication, and general performance issues. For more information, visit the Fiddler Web site at http://www.fiddler2.com/fiddler2/.

Page 25: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Backup and restore

Two Types of Restore1. Restoring from a nasty corruption or hardware

failure2. Restoring a single or small set of files that have

been overwritten e.g. user error(Item level restore)

* Item level can save a lot of time and effort* The recycle bin won’t help in case 2, but

versioning might

Page 26: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Backing Up

For restoring from failure or corruption• Sql Server• SharePoint Admistration• Windows and stsadmin..via scripts….• SharePoint Designer• Microsoft Data Protection Management• Quest – just very fast sql backup….for huge databasesFor item level restore• Doc Avenue• Idera

Page 27: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Normal Sql Server and Windows

• If MOSS 2007 then backup the config database as well.– There is a fix if you forgot to backup the config…

but you will pull your hair out first!• Know what service pack and hot fix you are up

to with SharePoint (Schema related)• With the recycle bin and versioning, you may

not need a more granular backup..

Page 28: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Windows and stsadmin..

• Can get clever with scripts to backup various sites.– More granular if you have lots of scripts against

different sites.• SharePoint Designer creates the same sort of

backup as stsadmin but needs to be run manually.

Page 29: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Third party tools

• Microsoft Data Protection Management– Can do incremental backups every 15 minutes etc

• Doc Avenue– Great for restoring a site or document library– Very granular

• Idera

Page 30: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Microsoft’s Conclusions

Best practices for the front end include:• Run IIS version 7.0 on 64-bit servers Memory and CPU are common performance optimization factors

for SharePoint Server. Using 64-bit hardware increases the amount of usable memory, which helps to maintain a healthy system state for worker processes.

• Use a front-end and back-end NIC configuration for IIS During peak load times, as many people access SharePoint sites, the NIC traffic increases. Using dedicated NICs for connections to the SQL Server back end and the clients provides better load distribution. Using dedicated NICs also provides more-accurate statistics and helps with troubleshooting traffic congestion issues by segregating the front-end and back-end traffic.

• Load balance client traffic The SharePoint Operations team uses NLB for balancing client traffic. It is a best practice to load balance incoming traffic for optimal user experience and server utilization.

• Use IIS compression for static content The SharePoint Operations team ensures that static compression is enabled to conserve traffic and server resources.

• Enable caching Page output caching on front-end servers reduces CPU utilization on front-end servers by storing compiled ASP.NET pages in RAM. Enabling this setting resulted in performance gains for the SharePoint Operations team. BLOB caching helps to relieve load on back-end servers by caching static content and not accessing databases when it is requested.

Page 31: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Microsoft’s Conclusions p2

Best practices for the back end include:• Limit database size to enhance manageability When databases grow, they can become less manageable for

backup and restore operations, or for troubleshooting. The SharePoint Operations team uses a 100-GB limit.• Allocate storage for versioning and the recycle bin When designing the environment, an organization

should consider business needs, such as versioning, and ensure that adequate disk space and I/O are available to accommodate them.

• Use quota templates to manage storage Microsoft IT uses standardized configuration templates in all possible and practical scenarios, including quotas. Using quota templates helps preserve a standard environment, which reduces administrative overhead.

• Manage large lists for performance Having large lists by itself is not necessarily a performance issue. When SharePoint Server renders the many items in those lists, that can cause spikes in render times and database blocking. One way to mitigate large lists is to use subfolders and create a hierarchical structure where each folder or subfolder has no more than 3,000 items.

• Separate and prioritize data among disks and create disk groups for specific data Because available disk I/O throughput is so important for optimal SQL Server performance, identifying the read/write patterns of services and dedicating SAN LUNs to them results in better performance than using many service types with the same disk group. The SharePoint Operations team takes this idea a step farther and uses dedicated partitions for data.

Page 32: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Summary

• Map out what is happening• Check the fill factor• Split databases for performance (Tempdb, Logs,

Wss_Content, etc)• Have good benchmarking / reporting in place• Check the growth and cpu usage• Have it out with your SharePoint Admins• Plan and keep on planning• Check out some tools to make life easier

Page 33: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

References• SharePoint Performance Optimization: How Microsoft IT Increases Availability and Decreases Rendering

Time of SharePoint Sites, which discusses how Microsoft IT discovered opportunities to enhance SharePoint optimization.

• Manage lists and libraries with many items

• The Company I work for

• My Blog

Page 34: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Idera – SharePoint Tools

Point backup - backup & recovery for SharePoint

sqltools.com.au

Page 35: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Point backup Highlights

Restore to Same Site or Different Site

Self-service item level recovery for admins

Provides Reliable Backups of SharePoint Sites & Site Collections

Easy to use Interface

Up to 95% compression

Item Level restores

Quickly Search Archives

Central Management

Automated backup scheduling

sqltools.com.au

Page 36: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Point backup – is easy to install

SUPPORTED SHAREPOINT SERVERS Windows SharePoint Services (WSS) 3.0 Microsoft Office SharePoint Services (MOSS) 2007

TCP Ports:

7484 - Management Service7485 - WFE Backup Service

sqltools.com.au

Page 37: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

Point admin toolset

11 Tools to Simplify SharePoint® Management

sqltools.com.au

Page 38: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

© AvePoint, Inc.AvePoint confidential and proprietary. This document may not be distributed in any form without the prior written

consent from AvePoint, Inc.

Industry’s most comprehensive, integrated infrastructure management solution for Microsoft SharePoint

DocAve Enterprise Solution

Page 39: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

© AvePoint, Inc.AvePoint confidential and proprietary. This document may not be distributed in any form without the prior written

consent from AvePoint, Inc.

Major Category Functional Areas

Full Spectrum Item-level & full fidelity SharePoint Backup/RestoreIndustry’s first true granular backup and full-fidelity restore along with intelligent, business process-aware and SLA-driven data protection

1

Administration/Discovery & Replication with ArchivingAllows administrators to easily view, search, manage, report, and replicate configuration, security and/or content across all SharePoint assets.

2

3Industry Compliance RegulationsStore and monitor SharePoint data in a structured, auditable way to comply with stringent legal and regulatory requirements. Track and record all SharePoint interactions, including usage, search, and security changes

4Seamless SharePoint MigrationMigrate from any SharePoint versions, file systems, Exchange Public Folders, Lotus Notes/QuickPlace/Quickr, Documentum eRoom to Microsoft SharePoint 2007

Page 40: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

© AvePoint, Inc.AvePoint confidential and proprietary. This document may not be distributed in any form without the prior written

consent from AvePoint, Inc.

DocAve v5: Release Themes

Intelligent SharePoint Backup and RecoveryRevolutionary “business process-aware” and SLA-driven data protection, anchored by the Business Criticality Matrix

1

Groundbreaking innovations and powerful new modulesSolution-wide enhancements and new modules such as Lotus Notes Migrator and Deployment Manager round out comprehensive suite

2

3

Dynamic, vibrant user interactionRich internet application provides engaging experience, improved usability and user satisfaction, as well as increased productivity

4

Fully distributed solution architecture delivers enterprise “-ilities”Multiple service redundancy and optimized load-balancing for increased reliability and efficiency

Page 41: SharePoint for Sql Guys With over 100 million users and counting, SharePoint 2007 is everywhere you look, yet some network administrators don't even know.

© AvePoint, Inc.AvePoint confidential and proprietary. This document may not be distributed in any form without the prior written

consent from AvePoint, Inc.

Comprehensive Data Protection

Backup and Recovery• Item Level Backup/Restore of

SharePoint Content in SQL• Total Farm Backup with granular item-

level full fidelity recovery

High Availability• Continuous Replication of Content to

Standby or Offline Database

WFEs

APP Servers

SQLSQL