Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 31 Mar 2003 22:01:25 -0500
From:      "Michael C. Brenner" <mbrenner@kaibren.com>
To:        Jason Andresen <jandrese@mitre.org>, FreeBSD Stable List <freebsd-stable@freebsd.org>
Subject:   Re: vinum performance
Message-ID:  <5.2.0.9.2.20030331214429.02677a90@gw.kaibren.com>
In-Reply-To: <3E88B601.90802@mitre.org>
References:  <3E88AECD.10607@liwing.de> <20030330125138.K23911@leelou.in.tern> <3E870CC7.5000204@mac.com> <20030330175605.E23911@leelou.in.tern> <3E87204C.5060304@ludd.luth.se> <3E88524A.1060600@mitre.org> <3E88AECD.10607@liwing.de>

next in thread | previous in thread | raw e-mail | index | archive | help
At 04:41 PM 3/31/2003, Jason Andresen wrote:
>>>>>Ok. But I still don't understand why RAID 5 write performance is _so_ bad.
>>>>>The CPU is not the bottle neck, it's rather bored. And I don't understand
>>>>>why RAID 0 doesn't give a big boost at all. Is the ahc driver known to be
>>>>>slow?
>>>>
>>>>(Both of these were on previously untouched files to prevent any 
>>>>caching, and the "write" test is on a new file, not rewriting an old one)
>Write speed:
>81920000 bytes transferred in 3.761307 secs (21779663 bytes/sec)
>Read speed:
>81920000 bytes transferred in 3.488978 secs (23479655 bytes/sec)
>
>But on the RAID5:
>Write speed:
>81920000 bytes transferred in 17.651300 secs (4641018 bytes/sec)
>Read speed:
>81920000 bytes transferred in 4.304083 secs (19033090 bytes/sec)

Writing to a RAID5 stripe set requires that all disks in the array 
successfully report completion before the RAID5 controller's buffer can be 
released back to the cache. (Applies to either software or hardware raid.) 
If you are doing a large block write (like dd) you can easily fill the 
cache on most controllers. Once the cache is full, the controller slows 
each write to the LONGEST completion time of each spindle in the array. ECC 
calculation becomes part of the latency also. In a 5 drive system (other 
than one where the cache is larger than the largest file being written as 
in a large EMC array) the writes are always about 4-5 times longer than the 
reads. Tuning stripes and blocking factors can speed up a specific transfer 
but RAID5 has always been slow to write large data and best for read mostly 
data.

Read operations benefit from RAID5 or mirrors. Now the shortest completion 
time of the minimal drive set is the gating event. The first set of drives 
to deliver the data block ends the operation. This makes a 2 to 1 
difference into a 4 to one difference.

MB



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?5.2.0.9.2.20030331214429.02677a90>