
Chapter 3. Analyzing performance bottlenecks  85
Draft Document for Review May 4, 2007 11:35 am 4285ch03.fm
Disk I/O can take a relatively long time and disk queues will become full, so the CPUs will 
be idle or have low utilization because they wait long periods of time before processing the 
next request.
The disk subsystem is perhaps the most challenging subsystem to properly configure. 
Besides looking at raw disk interface speed and disk capacity, it is key to also understand the 
workload: Is disk access random or sequential? Is there large I/O or small I/O? Answering 
these questions provides the necessary information to make sure the disk subsystem is 
adequately tuned.
Disk manufacturers tend to showcase the upper limits of their drive technology’s throughput. 
However, taking the time to understand the throughput of your workload will help you 
understand what true expectations to have of your underlying disk subsystem.
Table 3-2   Exercise showing true throughput for 8 KB I/Os for different drive speeds
Random read/write workloads usually require several disks to scale. The bus bandwidths of 
SCSI or Fibre Channel are of lesser concern. Larger databases with random access 
workload will benefit from having more disks. Larger SMP servers will scale better with more 
disks. Given the I/O profile of 70% reads and 30% writes of the average commercial 
workload, a RAID-10 implementation will perform 50% to 60% better than a RAID-5.
Sequential workloads tend to stress the bus bandwidth of disk subsystems. Pay special 
attention to the number of SCSI buses and Fibre Channel controllers when maximum 
throughput is desired. Given the same number of drives in an array, RAID-10, RAID-0, and 
RAID-5 all have similar streaming read and write throughput.
There are two ways to approach disk bottleneck analysis: real-time monitoring and tracing. 
Real-time monitoring must be done while the problem is occurring. This may not be 
practical in cases where system workload is dynamic and the problem is not repeatable. 
However, if the problem is repeatable, this method is flexible because of the ability to add 
objects and counters as the problem becomes well understood.
Tracing is the collecting of performance data over time to diagnose a problem. This is a 
good way to perform remote performance analysis. Some of the drawbacks include the 
potential for having to analyze large files when performance problems are not repeatable, 
and the potential for not having all key objects and parameters in the trace and having to 
wait for the next time the problem occurs for the additional data.
vmstat commandOne way to track disk usage on a Linux system is by using the vmstat tool. The columns of 
interest in vmstat with respect to I/O are the bi and bo fields. These fields monitor the 
movement of blocks in and out of the disk subsystem. Having a baseline is key to being able 
to identify any changes over time.
Disk speed Latency Seek 
time
Total random 
access timea
a. Assuming that the handling of the command + data transfer < 1 ms, total random
access time = latency + seek time + 1 ms.
I/Os per 
second 
per diskb
b. Calculated as 1/total random access time.
Throughput 
given 8 KB I/O
15 000 RPM 2.0 ms 3.8 ms 6.8 ms 147 1.15 MBps 
10 000 RPM 3.0 ms 4.9 ms 8.9 ms 112 900 KBps
7 200 RPM 4.2 ms 9 ms 13.2 ms 75 600 KBps