TSM Oxford Symposium Disk Tuning with TSM Dave Canan, IBM...
Transcript of TSM Oxford Symposium Disk Tuning with TSM Dave Canan, IBM...
IBM Software Group
TSM Oxford SymposiumDisk Tuning with TSMDave Canan, IBM Advanced Technical SupportDave Canan, IBM Advanced Technical [email protected]
Oxford University TSM Symposium 2007 © 2007 IBM Corporation
IBM Software Group | Tivoli software
Agenda
Cost vs. Performance: Considerations when sizing the TSM environment
TSM Disk Tuning BasicsTSM Disk Tuning Basics
How TSM does I/O
Utilities that can be used for examining disk IO with TSMUtilities that can be used for examining disk IO with TSM
TSM Disk Tuning and AIX
R d ti f TSM d V i Di k T h l i /S b tRecommendations for TSM and Various Disk Technologies/Subsystems
Issues still needing further study
A di ATS Di k S dAppendix – ATS Disk Study
Oxford University TSM Symposium 2007 © 2007 IBM Corporation2
IBM Software Group
Cost vs. Performance: Considerations When Sizing the TSM Environment
Oxford University TSM Symposium 2007 © 2007 IBM Corporation
IBM Software Group | Tivoli software
Cost vs. Performance: Considerations When Sizing the TSM Environment
Customer needs to consider pros and cons of cost vs. performance
Any backup application does heavy I/O. TSM is no exception. If this is not planned for you may have performance issuesplanned for, you may have performance issues.
Better performance comes at an additional cost.
Trend in industry is to have larger capacity disks Customer must decideTrend in industry is to have larger capacity disks. Customer must decide whether they should purchase more disks of a smaller size vs. fewer disks of a larger size. This comes with a trade-off in performance
B t ti i t i l t TSM t h ibl f hBest practice is to isolate TSM components as much as possible from each other.
In most cases, some portion of the disk must be left vacant to achieve i fmaximum performance
For larger TSM environments, we recommend that customers dedicate a complete disk subsystem to TSM
Oxford University TSM Symposium 2007 © 2007 IBM Corporation4
IBM Software Group | Tivoli software
Cost vs. Performance: Considerations when sizing the TSM environment
Data management software (TSM) should not be a last minute “add-on” for a project. Proper capacity and performance sizing should be a priority. This p j p p y p g p ywill save money in the long run.
If capacity and performance planning is done by a department other than the TSM group that organization should be involved with the TSM planning fromTSM group, that organization should be involved with the TSM planning from the beginning of the project.
Don’t let the storage management group dictate what you have to use. You g g g p ydon’t just need space, you need performance.
Oxford University TSM Symposium 2007 © 2007 IBM Corporation5
IBM Software Group
TSM Disk Tuning Basics
Oxford University TSM Symposium 2007 © 2007 IBM Corporation
IBM Software Group | Tivoli software
TSM Disk Tuning BasicsPoor disk setup WILL impact performance !!
• Advanced functionality of some devices should reduce RAID impact
DIO (direct I/O) now the default in TSM 5 3 + for disk storage pools (not forDIO (direct I/O) now the default in TSM 5.3 + for disk storage pools (not for Linux)
• Filesystem selection is now less important (exception Solaris and UFS)y p ( p )• With the advent of DIO, file system I/O is now more efficient
Sharing disk with other apps WILL impact performance.
Be careful when multiple TSM servers are sharing a disk subsystem• Treat them as separate applications
For some disk subsystems, cache settings can be set separately for read and write
Disk monitoring tools are available for performance tuning
Oxford University TSM Symposium 2007 © 2007 IBM Corporation7
Disk monitoring tools are available for performance tuning
IBM Software Group | Tivoli software
TSM Disk Tuning Basics
Disk not the only component to consider. All I/O factors must be considered together:
I/O h i i f li kl d• I/O characteristics of client workload• Disk drive capacity and % utilized• Disk and/or filesystem fragmentation• Disk drive speed (rpm) (example, SATA vs. FC)• Processor speed of disk subsystem• Disk subsystem cache size and settingsDisk subsystem cache size and settings• Number and type of disk adapters• RAID type
Data la o t• Data layout• How the disks are “wired” • Type of Filesystem used (JFS, JFS2, RLV, etc.
Oxford University TSM Symposium 2007 © 2007 IBM Corporation8
IBM Software Group
How TSM does I/O
Oxford University TSM Symposium 2007 © 2007 IBM Corporation
IBM Software Group | Tivoli software
The Nature of I/O Behavior with TSM
Three separate components of TSM architecture
• TSM database• TSM database
• TSM Recovery Log
• TSM Storage Pools
Need to be considered separately for setup and tuning
• Each has a different I/O access behavior
• Each has different cache requirements
Oxford University TSM Symposium 2007 © 2007 IBM Corporation10
IBM Software Group | Tivoli software
I/O Behavior for the TSM Database (1 of 2)Access pattern for DB is random during most operations Because ofAccess pattern for DB is random during most operations. Because of random IO, use the fastest disks you have for the DB.
Access pattern is sequential during DB backup
• DB volumes read one at a time, from beginning to end of DB volume• Volumes read for backup in the order in which they were defined to
TSMTSM
DB volumes will initially be filled to EOV before going to next volume
DB reorganization moves data to fewer volumes (may not be desirable for performance)
With DS4800, RAID5 is good for DB. Pay attention to cache settings for DB
Oxford University TSM Symposium 2007 © 2007 IBM Corporation11
IBM Software Group | Tivoli software
I/O Behavior for the TSM Database (2 of 2)
Look at IOPS for how the TSM DB is performing. If IOPS are not being met, you should consider adding additional volumes for the database
• Other FC disks can handle roughly 150 IOPS before queuing occurs • Iostat or filemon can be used to determine the number of IOPS occurring against
volumes• Server instrumentation DB threads DB also give count of actual I/Os
If possible, sacrifice space and use more disks for best performance
Create between 4 and 16 DB volumes for use with TSM
Blocksize for I/O varies, I/O to DB does NOT use Direct I/O
• 4KB for all operations except during TSM database backup• 256KB reads done during TSM database backup
Oxford University TSM Symposium 2007 © 2007 IBM Corporation12
IBM Software Group | Tivoli software
I/O Behavior for the TSM Recovery Log
Access to recovery log is always sequential
I/O blocksize used is always 4KB, I/O to log does NOT use direct I/O
Good candidate for RAID0 (if using TSM mirroring). If not using TSM mirroring, then you could mirror with RAID1
Sacrifice disk space for best performance If you can’t sacrifice a volume forSacrifice disk space for best performance. If you can’t sacrifice a volume for the log, it is better to place with storage pools rather than with DB.
Number of log volumes not important; may want to have 2 for easy of i tmaintenance.
TSM recovery log normally only written to except at TSM initialization.
Oxford University TSM Symposium 2007 © 2007 IBM Corporation13
IBM Software Group | Tivoli software
I/O Behavior for the TSM Storage Pools
TSM storage pools are generally written to in the order in which they are defined to TSM. This becomes irrelevant once many tasks are executing.
Understand the disk technologies when designing the storage pools SomeUnderstand the disk technologies when designing the storage pools. Some SATA-1 drives do not support NCQ, SATA-IO (also known as SATA-3.0) drives do.
Two “rules of thumb” for how many storage pool volumes to define on diskTwo “rules-of-thumb” for how many storage pool volumes to define on disk. In order of priority, consider the following:
1. For any given logical volume (LV), you should have no more storage pool f ( fvolumes defined within that LV than you have hdisks in the LV. (Example: if you
had 6 hdisks striped together in a logical volume, you would define no more than 6 storage pool volumes within that LV.)
2. Try to have as many storage pool volumes defined in a storage pool as you have simultaneous backup sessions to that storage pool
Oxford University TSM Symposium 2007 © 2007 IBM Corporation14
IBM Software Group | Tivoli software
I/O Behavior for the TSM Storage Pools
TSM storage pools always read/written using 256KB blocksize
TSM storage pools should be sized (# of physical disks required) based on throughput requirements.
• Generally, FCS adapter is limiting factor (200MB/sec per adapter)
With TSM 5.3, Direct I/O is enabled by default for disk storage pool volumes (If desired, DB and Log volumes can be mounted with DIO option under AIX)
• TSM 5.3 removed this option from documentation (AIXDIRECTIO YES|NO)
• Because of direct I/O now being used, vmtune/vmo/ioo tuning is no longer g , g gnecessary for storage pool volumes. It still may provide some minor benefit for TSM DB database backup
Oxford University TSM Symposium 2007 © 2007 IBM Corporation15
IBM Software Group | Tivoli software
I/O Behavior for the TSM Storage Pools
TSM storage pools are written to sequentially (in 256KB blocks)
• Try and keep all pieces of I/O path (sector size segment size PP size• Try and keep all pieces of I/O path (sector size, segment size, PP size, etc.) at least this big or larger.
Consider how many storage pool volumes you have per physicalConsider how many storage pool volumes you have per physical disk.
• If there are too many, then the sequential I/O in effect becomes random because of disk head movement (thrashing)( g)
• Command queuing helps here.
• Some research from hardware storage indicates more than 3 sequential streams to a disk causes the I/O to be more random insequential streams to a disk causes the I/O to be more random in nature.
Oxford University TSM Symposium 2007 © 2007 IBM Corporation16
IBM Software Group
Utilities that can be used for Examining Disk I/O with TSM
Oxford University TSM Symposium 2007 © 2007 IBM Corporation
IBM Software Group | Tivoli software
TSM Disk Tuning Basics – Utilities to Measure Performance (TSM Server side)
For TSM Server, you can use:• Nmon utility (AIX/Linux)
• Iostat (UNIX) (look for disks > 20% busy)• Iostat (UNIX) (look for disks > 20% busy)
• Filemon command (provides detailed perf info at a file level) (AIX)
• Fileplace command (provides fragmentation information) (AIX)
• Server instrumentation trace – average disk response time, InstTput rate
• Client Instrumentation trace – high % of DataVerb and EndTxn time might indicate an disk issue on the server
• Tops/monitor utility
• Expire inventory, backup DB processesExpire inventory, backup DB processes
• Perfmon (Windows)
Oxford University TSM Symposium 2007 © 2007 IBM Corporation18
IBM Software Group | Tivoli software
TSM Disk Tuning Basics – Utilities to Measure Performance (Client side)
For TSM Client, you can use:
• Nmon utility (AIX/Linux)
• Iostat (UNIX) (look for disks > 20% busy)
• Filemon command (provides detailed perf info at a file level) (AIX)
• Fileplace command (provides fragmentation information) (AIX)• Fileplace command (provides fragmentation information) (AIX)
• Client instrumentation trace (high % File I/O or Process Dirs = Red Flag)
T / i ili• Tops/monitor utility
• Perfmon (Windows)
Oxford University TSM Symposium 2007 © 2007 IBM Corporation19
IBM Software Group | Tivoli software
TSM Disk Tuning Basics – The NMON “Weighted Average” Snapshot % Busy
1 02 03 0
Disk was inactive
The Average disk utilization during the 12 snapshots was :
3 04 05 25%
inactive (25+35+90+80+35)/12 = 22%
Th W i ht d A di k6 35%7 90%8 80%
Disk was active
The Weighted Average disk utilization for the time the disk was active was:
(25+35+90+80+35)/5 = 53%8 80%9 35%10 0
Disk was
(25+35+90+80+35)/5 = 53%
Max = > 90%
11 012 0
Disk was inactive
Avg = 22%
WAvg = 53%
Oxford University TSM Symposium 2007 © 2007 IBM Corporation20
IBM Software Group | Tivoli software
TSM Disk Tuning Utilities – NMON Example #1 (Some disks having high I/O)
Disk %Busy bs1 (1st 50) 6/4/2004
Avg. WAvg. Max.
Weighted Avg for disk. This is the critical thing to l k f
708090
100 look for.
304050600
01020
50 16 49 01 17 55 69 74 27 30 60 02 k0 13 11 k1 14 38 32 41 20 27 03 16 18 04 13 29 17 59 12 19 54 28 73 28 66 29 88 86 15 24 43 18 83 44 85 12 84 87
hdis
khd
isk1
hdis
khd
isk1
hdis
k1hd
isk
hdis
khd
isk
hdis
khd
isk
hdis
khd
isk1
hdis
hdis
k1hd
isk1
hdis
hdis
khd
isk
hdis
khd
isk
hdis
k1hd
isk1
hdis
k1hd
isk
hdis
k1hd
isk1
hdis
khd
isk1
hdis
khd
isk
hdis
k1hd
isk1
hdis
khd
isk1
hdis
khd
isk
hdis
khd
isk
hdis
khd
isk
hdis
khd
isk
hdis
khd
isk
hdis
khd
isk
hdis
khd
isk
hdis
khd
isk
Oxford University TSM Symposium 2007 © 2007 IBM Corporation21
IBM Software Group | Tivoli software
TSM Disk Tuning Utilities – NMON Example #2 (Heavily Loaded System)
Disk %Busy atsr4p2 1/9/2006
Avg. WAvg. Max.
708090
100
3040506070
0102030
k2 k0 k1 d0 d1 0 1 2 3 4 5 6 7 8 9 20 21 22 23 24 k3 k4 k5 k6 k7 k8 k9
hdis
k
hdis
k
hdis
k cd cd
hdis
k1
hdis
k1
hdis
k1
hdis
k1
hdis
k1
hdis
k1
hdis
k1
hdis
k1
hdis
k1
hdis
k1
hdis
k 2
hdis
k2
hdis
k2
hdis
k2
hdis
k2
hdis
k
hdis
k
hdis
k
hdis
k
hdis
k
hdis
k
hdis
k
Oxford University TSM Symposium 2007 © 2007 IBM Corporation22
IBM Software Group | Tivoli software
TSM Disk Tuning Utilities – TSM Selects for DB backup and Expiration Performance
select activity cast((end time)as date) as "Date" -select activity,cast((end_time)as date) as Date , (examined/cast((end_time-start_time) seconds as decimal(18,13))*3600) “4KB Pages Backed Up/Hr" -
from summary where activity='FULL_DBBACKUP' and -days(end_time)-days(start_time)=0
select activity,cast((end_time)as date) as "Date", -(examined/cast((end time-start time) seconds as(examined/cast((end_time start_time) seconds as decimal(18,13))*3600) "Objects Examined/Hr" -
from summary where activity='EXPIRATION' and -days(end_time)-days(start_time)=0
Oxford University TSM Symposium 2007 © 2007 IBM Corporation23
IBM Software Group | Tivoli software
TSM Disk Tuning Utilities – Select Example for DB Backup Performance
ACTIVITY Date 4KB Pages Backed Up/Hr------------------ ---------- ---------------------------------FULL_DBBACKUP 2007-02-26 12726000FULL DBBACKUP 2007-03-05 10299600_FULL_DBBACKUP 2007-03-12 9691200FULL_DBBACKUP 2007-03-19 12330000
Rule of Thumb – 8MB/sec is “acceptable” base for DB backup performance. (8MB/sec = 28 GB/hr)
Last column is in # of 4KB pages per hour, we are looking for this number to be greater than ~ 2000000
If rule of thumb not being met this may indicate a problemIf rule of thumb not being met, this may indicate a problem
Oxford University TSM Symposium 2007 © 2007 IBM Corporation24
IBM Software Group | Tivoli software
TSM Disk Tuning Utilities – Select for Expiration Performance Example
ACTIVITY Date Objects Examined/Hr------------------ --------- ----------------------EXPIRATION 2007-02-22 1828800EXPIRATION 2007-02-23 1674000EXPIRATION 2007-02-24 1972800
O 200 02 2 1 2 600EXPIRATION 2007-02-25 1425600EXPIRATION 2007-02-26 2084400EXPIRATION 2007-02-27 1782000EXPIRATION 2007-02-28 2926800EXPIRATION 2007-03-01 2829600EXPIRATION 2007-03-02 2613600EXPIRATION 2007-03-03 2336400EXPIRATION 2007-03-04 2052000EXPIRATION 2007-03-05 2138400EXPIRATION 2007-03-06 1584000EXPIRATION 2007-03-07 2314800EXPIRATION 2007-03-08 2584800EXPIRATION 2007-03-09 2466000EXPIRATION 2007-03-10 2311200EXPIRATION 2007-03-11 2617200EXPIRATION 2007-03-12 2156400EXPIRATION 2007-03-13 1692000
Rule of Thumb – 3,800,000 Objects Examined / Hr
Note: this can indicate disk tuning may be needed or that the DB is fragmented or that additional hardware may be needed for the database to perform properly. .
Oxford University TSM Symposium 2007 © 2007 IBM Corporation25
IBM Software Group | Tivoli software
TSM Disk Tuning Utilities - Server Instrumentation Traces 42 ms for avg disk
read!! (stgpool -------------------------------- Disk Migration Example -------------------
Thread 56 DiskServerThread parent=0 (AIX TID 1798311) 13:47:34.766-->14:03:20.744/dev/rdiskpool_lv3
volume)
6MB/sec!! _Operation Count Tottime Avgtime Mintime Maxtime InstTput Total KB------------------------------------------------------------------------------Disk Read 13835 584.982 0.042 0.000 3.332 6012.2 3517004 Disk Write 14 0.087 0.006 0.000 0.010 32162.6 2804Thread Wait 13784 360.45 0.026 0.000 73.961Unknown 0.459
(Look here 1st)
--------------------------------------------------------------------------------Total 945.978
3720.8 3519808
---------------------------------- Backup TSM DB Example -------------
33 ms for avg read!! (DB V l )Thread 1 LvmDiskServer parent=0 (AIX TID 2064489) 11:51:21.126-->12:05:18.174
Operation Count Tottime Avgtime Mintime Maxtime InstTput Total KB-------------------------------------------------------------------------------Disk Read 24093 793.630 0.033 0.000 2.622 6732.1 5342796Disk Write 125 0.279 0.002 0.000 0.050 1788.7 500Thread Wait 23983 42 133 0 002 0 000 22 656
Volume)
Thread Wait 23983 42.133 0.002 0.000 22.656Unknown 1.004--------------------------------------------------------------------------------Total 837.047
6383.5 5343296
Count/Tottime yields 30 IOPS
Oxford University TSM Symposium 2007 © 2007 IBM Corporation26
IBM Software Group | Tivoli software
TSM Disk Tuning Utilities - Client Instrumentation Trace Example
94% f iFinal Detailed Instrumentation statistics
Elapsed time: 318.117 sec
S i l i ( ) i ( ) d
94% of time spent here!!
Section Total Time(sec) Average Time(msec) Frequency used
------------------------------------------------------------------Client Setup 5.297 5297.0 1Process Dirs 0.050 6.3 8Solve Tree 0.000 0.0 1Compute 0.280 0.0 16396Transaction 0.411 0.0 49219BeginTxn Verb 0.000 0.0 3File I/O 11.885 0.7 16406Compression 0.000 0.0 0Encryption 0.000 0.0 0
Client side
ypCRC 0.000 0.0 0Delta 0.000 0.0 0Data Verb 298.322 18.2 16396Confirm Verb 0.250 35.7 7EndTxn Verb 0.551 183.7 3
Server/NW Side
------------------------------------------------------------------
Oxford University TSM Symposium 2007 © 2007 IBM Corporation27
IBM Software Group | Tivoli software
TSM Disk Tuning Utilities - Filemon Example 1 Busiest logical volumes disks reported during
Most Active Logical Volumes-----------------------------------------------------------------util #rblk #wblk KB/s volume description
reported during run period
p-----------------------------------------------------------------0.96 0 123723608 82139.1 /dev/lv-stg-c /stg-c0.90 0 123722072 82138.0 /dev/lv-stg-d /stg-d
Most Active Physical VolumesMost Active Physical Volumes-----------------------------------------------------------------util #rblk #wblk KB/s volume description
-----------------------------------------------------------------0.65 0 61860520 41068.7 /dev/hdisk8 N/A/ / /0.65 0 61861672 41069.4 /dev/hdisk5 N/A0.59 0 61861552 41069.4 /dev/hdisk7 N/A0.55 0 61861936 41069.6 /dev/hdisk6 N/A
Busiest physicalBusiest physical volumes disks reported during run period
Oxford University TSM Symposium 2007 © 2007 IBM Corporation28
p
IBM Software Group | Tivoli software
TSM Disk Tuning Utilities - Filemon Example 2
BlksizeVOLUME: /dev/hdisk8 description: N/Awrites: 241743 (0 errs)write sizes (blks): avg 255.9 min max 256 sdev 4.5
Blksize of write
gwrite times (msec): avg 2.023 min 0.237 max 1545.563 sdev 13.474write sequences: 241743write seq. lengths: avg 255.9 min 8 max 256 sdev 4 54.5
seeks: 241743 (100.0%)seek dist (blks): init 34114816,
avg 6316189.4 min 1008 max 81789184 sdev 13621796.3seek dist (%tot blks):init 24.64729,
avg 4.56332 min 0.00073 max 59.09109 sdev 9.84148time to next req(msec): avg 2.911 min 0.260 max 1553.078 sdev
19.307th h t 41068 7 KB/throughput: 41068.7 KB/secutilization: 0.65 How far for
avg seek% Busy
Oxford University TSM Symposium 2007 © 2007 IBM Corporation29
IBM Software Group
TSM Disk Tuning and AIX
Oxford University TSM Symposium 2007 © 2007 IBM Corporation
IBM Software Group | Tivoli software
The AIX I/O Model and TSM
Raw
LVR
aw dis
Logical File System
At top is application layer. TSM uses cache buffers to avoid DB I/O
TSM App
BUFPOOLSIZE
Local FSJFS/JFS2
Remote FS NFS
Vssks
Direct I/O being used with TSM 5.3 (no VMM) for stgpool volumes
AIX DIRECT IOVMM
LVM
Queues exist for HBAs and hdisks•Queue_depth is # of outstanding IO requests that can be done at a disk l l
AIX DIRECT IO
(DIO)
Disk
Device Driver (s)Disk Subsystem (optional)
level•Lg_term_dma is a memory area on the FC adapter used to store IO commands and data
Physical diskQueue _depthLg_term_dma
Cache
Disk
Write Cache - ack sent back to application
Subsystem R/W CacheDisk memory to Store commands
Physical disk
Oxford University TSM Symposium 2007 © 2007 IBM Corporation31
IBM Software Group | Tivoli software
TSM Disk Tuning and AIX
When designing the Storage Pools, try to spread them out across as many disks as you can. Two ways you can do this when using TSM Server on AIX: (See diagrams on following 2 pages)TSM Server on AIX: (See diagrams on following 2 pages)
1. Use TSM to do the I/O spreading across the disks.
2. Use AIX LVM to do the I/O spreading across the disks.
Oxford University TSM Symposium 2007 © 2007 IBM Corporation32
IBM Software Group | Tivoli software
TSM Storage Pools with AIX – “Spreading the I/O” Using TSM
A B C D E F
A B C D E F
Drawer 1
Drawer 2A B C D E F
A B C D E F
Drawer 2
Drawer 3
D 4A B C D E F
A B C D E F
Drawer 4
Drawer 5
Technique #1 – using TSM to “spread the I/O”. In this diagram, we have:
6 LUNs of 4+1 RAID5 (LUNS A-F)6 LUNs of 4+1 RAID5 (LUNS A F)
Each LUN has 1 AIX Logical Volume and 1 Stgpool volume
Each storagepool volume spread across 5 volumes
Oxford University TSM Symposium 2007 © 2007 IBM Corporation33
IBM Software Group | Tivoli software
TSM Storage Pools with AIX – “Spreading the I/O” Using AIX LVM
A B C D E F
A B C D E F
Drawer 1
Drawer 2A B C D E F
A B C D E F
Drawer 2
Drawer 3
D 4A B C D E F
A B C D E F
Drawer 4
Drawer 5
Technique #2 – using AIX to “spread the I/O”. In this diagram, we have:
6 LUNs of 4+1 RAID5 – 6 Hdisks to AIX (LUNS A-F)
LV t d ith LVM t i i i d hdi kLVs created with LVM striping, maximum spread across hdisks
LV has 6 hdisks volumes and 6 TSM Stgpool volume defined in it
Total of 6 storage pool volumes spread across 30 volumes
Oxford University TSM Symposium 2007 © 2007 IBM Corporation34
g p p
IBM Software Group
Recommendations for TSM and Various Disk Technologies/Subsystems
Oxford University TSM Symposium 2007 © 2007 IBM Corporation
IBM Software Group | Tivoli software
Recommendations TSM and Disk Subsystems
Raid types and use with TSM
DS4xxx RecommendationsDS4xxx Recommendations
SATA Recommendations
SVC (San Volume Controller) and Storage Pools Recommendations
SVC (San Volume Controller) and TSM DB/Log Recommendations
Oxford University TSM Symposium 2007 © 2007 IBM Corporation36
IBM Software Group | Tivoli software
Mirroring / RAID 1 / RAID 5 / RAID 10TSM mirroring recommended over hardware mirroringTSM mirroring recommended over hardware mirroring
• Partial writes can potentially corrupt TSM DB
Hardware mirroring is usually faster• Hardware mirroring is usually faster
• Hardware mirroring comes at an additional cost.
RAID0 is ideal for the recovery log (when used with TSM Mirroring)
Buy more cache, this helps offset the extra reads and writes that occur with RAID5 Buy as much cache as you can affordRAID5. Buy as much cache as you can afford.
RAID5 vs. RAID10: RAID10 provides better read performance, at a much higher cost.
Oxford University TSM Symposium 2007 © 2007 IBM Corporation37
IBM Software Group | Tivoli software
Cache Recommendations for using TSM with the DS4xxx
Consider disabling write cache mirroring (WCM) on DS4xxx – understand implications of doing this
Start/stop cache flushing should be set to 50/50
TSM Database
• Write cache enabledR d h di bl d ( t t bl it f DB b k )• Read cache disabled (may want to enable it for DB backup purposes)
TSM Recovery Log
• Write cache enabled• Read cache disabled• Read cache disabled
TSM Storage Pools
• Read/Write cache enabled (pre-fetch or cache read-ahead = 1 for DS4xxx firmware level of 6.1 or higher)
Oxford University TSM Symposium 2007 © 2007 IBM Corporation38
IBM Software Group | Tivoli software
Segment Size Recommendations for using TSM with the DS4xxx
TSM database
• 64KB segment sizeg
TSM recovery log
• 64KB segment size
TSM storage poolsTSM storage pools
• Set the segment size equal to the stripe size
Oxford University TSM Symposium 2007 © 2007 IBM Corporation39
IBM Software Group | Tivoli software
Recommendations for using TSM with the DS4xxx (Parameters)
Because of storage pools perform heavy sequential IO, you should:
• lg_term_dma attribute of the fcs adapter should be set to 0x800000
• Note this was moved to slide 42 and this slide will be removed• Note – this was moved to slide 42, and this slide will be removed.
Oxford University TSM Symposium 2007 © 2007 IBM Corporation40
IBM Software Group | Tivoli software
Recommendations for using TSM with AIX 5.3
AIX tuning parameters (no outage or reboot required).• Set lru_file_repage=0 via vmo
• Set j2 maxPageReadAhead=128 via iooSet j2_maxPageReadAhead 128 via ioo
• Set maxclient=maxperm=80% (default) via vmo
• Set strict_maxclient=1 (default) via vmo
• Set minfree=max(960, 120*#lcpus/#mempools) via vmo
• Set maxfree=minfree + max page ahead x #lcpus/#mempools via vmo
• Set lru poll interval=8 via vmoSet lru_poll_interval 8 via vmo.
Oxford University TSM Symposium 2007 © 2007 IBM Corporation41
IBM Software Group | Tivoli software
Recommendations for using TSM with the DS4xxx (AIX)
Change HDISK queue depth (for both FC and SATA disks) -Change HDISK queue depth (for both FC and SATA disks) Requires Reboot.
• Queue_depth=2048/(number of LUNS from the DS4800 controller)
• Command to change the hdisk queue depth is:• Command to change the hdisk queue depth is:chdev -l <hdisk#> -a queue_depth=<new value> -P
Change Fibre Channel Adapter parameters Requires RebootChange Fibre Channel Adapter parameters - Requires Reboot
• For FCs for disknum cmd elems = 512num_cmd_elems = 512max_xfer_size = 0x200000lg_term_dma= 0x800000
command is: (should be all one line)chdev -l <fcs#> -a num_cmd_elems=512 -a max xfer size=0x200000 P
Oxford University TSM Symposium 2007 © 2007 IBM Corporation42
max_xfer_size=0x200000 -P
IBM Software Group | Tivoli software
Overall Findings from DS4800 StudyConfiguration using Vertical LUNs out performs Horizontal LUNsConfiguration using Vertical LUNs out performs Horizontal LUNs
Configuration using striped LUNS Across LV used with DEVCLASS=FILE sequential volumes outperforms DEVCLASS=DISK random volumes
Mirror write consistency impacts performance
Mixing TSM components impacts performance of backupsMixing TSM components impacts performance of backups
Segment size for storage pool LUNs should be equal to stripe size
Sequential files out performs random disk
DS4800 cache settings are different depending on how the LUN is used
Oxford University TSM Symposium 2007 © 2007 IBM Corporation43
IBM Software Group | Tivoli software
SATA disk
Recommend using fastest disks available for use with DB and Recovery Log
• SATA-1 disks are generally 7200 RPM and can handle fewer IOPS
SATA technology is still improving.
Earlier generations had limitations
•NCQ – (Native Command Queuing) optimizes order of I/O requests to diskdisk
•Adapter Compatibility
Oxford University TSM Symposium 2007 © 2007 IBM Corporation44
IBM Software Group | Tivoli software
SATA disk
Check your throughput requirements when evaluating SATA technology for use with storage pools.
Good for less frequently accessed TSM type storage (HSM for example) or Tier-2 type storage.
Design the storage pools so that you attempt to keep storage pool IODesign the storage pools so that you attempt to keep storage pool IO sequential all the way through to the hard disk drive
• Keep a 1-1 relationship if possible between LUN-Logical Volume-File p p p gSystem-Storagepool volume
Oxford University TSM Symposium 2007 © 2007 IBM Corporation45
IBM Software Group | Tivoli software
SVC (San Volume Controller)
Still need more research in this area – some experience through performance related PMRs.
Strive again for keeping the sequential throughput performance for storagepool volumes and high IOPS for database recovery log volumes
Oxford University TSM Symposium 2007 © 2007 IBM Corporation46
IBM Software Group | Tivoli software
SVC (San Volume Controller) Overview
MDISK Group
DS4800 subsystem
Oxford University TSM Symposium 2007 © 2007 IBM Corporation47
IBM Software Group | Tivoli software
SVC (San Volume Controller) Overview
OS hdisk
Oxford University TSM Symposium 2007 © 2007 IBM Corporation48
IBM Software Group | Tivoli software
SVC + DS4xxx And StoragePools (SATA drives)
Goal is to maximize the opportunity to maintain sequential performance and read ahead capability.
• No more than 2 LUNS per SATA array
• For 2 arrays, this would mean 4 mdisks
DS4xxx - Stripe size / segment size: 256KDS4xxx - Read cache = YESDS4xxx - Write Cache=YESDS4xxx Write Cache mirror=NODS4xxx - Write Cache mirror=NODS4xxx - Read Prefetch=YES. (value > 1) SVC – Read Cache = YESSVC – Write Cache = YES
Oxford University TSM Symposium 2007 © 2007 IBM Corporation49
IBM Software Group | Tivoli software
SVC And Storage Pools (SATA drives) Sample customer
Lun1 = Mdisk1 Lun2 = Mdisk2Step3: These 2 LUNS are
presented to the SVC as 2 mdisks
3
Step2: This array then has 2 LUNs carved from it, each 1TB in size)
2
Step1: Create 4+P 500GB
each 1TB in size)
1 pRAID5 Array (~2TB)
1
Oxford University TSM Symposium 2007 © 2007 IBM Corporation50
IBM Software Group | Tivoli software
SVC And StoragePools (SATA drives) Sample customer
Mdisk1 Mdisk2X1A X2A
VDisk1 VDisk2X1A X2A “Image Mode” VDisks
t dX1BX1C
X1D
X2BX2CX2D
X1BX1C
X1D
X2BX2CX2D
created
1 Volume Group – 2 PVs
2 LVs striped across the 2
X1EX1FX1G
X2EX2FX2G
X1EX1FX1G
X2EX2FX2G
PVs
1 Filesystem Per LV
Pre-allocate Storagepool X1GX1HX1I
X2GX2HX2I
X1GX1HX1I
X2GX2HX2I
g pvolumes across the FS
Oxford University TSM Symposium 2007 © 2007 IBM Corporation51
IBM Software Group | Tivoli software
SVC And Database/Log (Fibre drives) Sample customer (100GB Database)
Lun1 = Mdisk1 Lun2 = Mdisk2Step3: These 2 LUNS are
presented to the SVC as 2 mdisks
3
Step2: This array then has 2 LUNs carved from it, each 400GB in size)
2
Step1: Create 2 3+P
each 400GB in size)
1 p146GB RAID5 Arrays (~800GB)
1
Oxford University TSM Symposium 2007 © 2007 IBM Corporation52
IBM Software Group | Tivoli software
SVC And Database/Log (Fibre drives) Sample customer
Mdisk1 Mdisk2X1A X2AX1BX1C
X1D
X2BX2CX2D
VDisk1X1AX2A
VDisk2X1DX2D
VDisk3X1GX2G
X1EX1FX1G
X2EX2FX2G
X1BX2BX1C
X1EX2EX1F
X1HX2HX1IX1G
X1HX1I
X2GX2HX2I
X2C X2F X2I
VG1 – TSM DB VG2 – Log
2 LVs, 2 DBvols / LV 1 LV, 2 Log volumes
Oxford University TSM Symposium 2007 © 2007 IBM Corporation53
IBM Software Group | Tivoli software
SVC And Database/Log (Fibre drives)
Again, goal is to maximize the IOPS to the fibre drives.
• Consider wasting space on fastest drives to gain benefit of performanceg g
• Strive to isolate the components if possible or use extra space for install images or temp space
DS4xxx - Stripe size / segment size: 64KDS4xxx - Read cache = NODS4xxx - Write Cache=YESDS4xxx Write Cache mirror=NO (Use TSM mirroring)DS4xxx - Write Cache mirror=NO (Use TSM mirroring)DS4xxx - Read Prefetch=NO. (value = 0) SVC – Read Cache = NOSVC – Write Cache = YES
Oxford University TSM Symposium 2007 © 2007 IBM Corporation54
IBM Software Group
Issues Still Needing Further Study
Oxford University TSM Symposium 2007 © 2007 IBM Corporation
IBM Software Group | Tivoli software
Issues Not Addressed YetRAW vs JFS2 for Storage PoolsRAW vs. JFS2 for Storage Pools
• DIO might give better performance on backup, but may be hindrance on migration
– Migration not studied in detail during study due to time
Analysis of performance impact on using TSM mirroring
Analysis of impact on using TSM with other applications
• Has always been recommendation to isolate TSM on disk subsystem for DRy y
Analysis of impact on sharing disks with other TSM Servers
E t i t ti ith diff t iExtensive testing with different size arrays
Need more testing with SATA-IO for use as Storage Pools
Oxford University TSM Symposium 2007 © 2007 IBM Corporation56
IBM Software Group | Tivoli software
Questions?
Oxford University TSM Symposium 2007 © 2007 IBM Corporation57
IBM Software Group | Tivoli software
TSM Disk Tuning Reference Information
Ti li St M P f d T i G idTivoli Storage Manager Performance and Tuning Guide
• http://publib.boulder.ibm.com/infocenter/tivihelp/v1r1/index.jsp?toc=/com.ibm.itstorage.doc/toc.xml
AIX 5L Performance Tools Handbook – SG24-6039
• http://www.redbooks.ibm.com/redbooks/pdfs/sg246039.pdf
NMON Tool (internal use only)
• http://www.ibm.com/collaboration/wiki/display/WikiPtype/nmon
Oxford University TSM Symposium 2007 © 2007 IBM Corporation58