Date: Sun, 24 Apr 2005 22:20:20 -0600 From: Scott Long <scottl@samsco.org> To: Julian Elischer <julian@elischer.org> Cc: Kris Kennaway <kris@obsecurity.org> Subject: I/O Benchmarking [Re: FreeBSD 6 is coming too fast] Message-ID: <426C7004.8020003@samsco.org> In-Reply-To: <426C6B1D.3040704@elischer.org> References: <20050424175543.71041.qmail@web51805.mail.yahoo.com> <20050424151517.O68772@lexi.siliconlandmark.com> <3822.216.177.243.38.1114385370.localmail@webmail.dnswatch.com> <20050425000459.GA28667@xor.obsecurity.org> <6.2.1.2.0.20050424204611.072105a0@64.7.153.2> <20050425010242.GA44110@xor.obsecurity.org> <6.2.1.2.0.20050424210422.03d22990@64.7.153.2> <20050425014453.GA59981@xor.obsecurity.org> <426C6B1D.3040704@elischer.org>
next in thread | previous in thread | raw e-mail | index | archive | help
Julian Elischer wrote: > Kris Kennaway wrote: > >> Measuring disk device performance (i.e. running a benchmark against >> the bare device) and filesystem performance (writing to a filesystem >> on the device) are very different things. > > > I wish people would stop trying to deny that we have serious work in > front of us to get the VFS and disk IO figures back to where they were > before. > > there ARE slowdowns and I have seen it both with tests on teh basic > hardware and throug the filesystems. I don't know why this surproses > people because we have still a lot of work to do in teh interrupt > latency field for example, and I doubt that even PHK would say that > there is no work left to do in geom. > Where we are now is closing in on "feature complete". Now we need to > profile and optimise. You are absolutely right. However, Kris is also absolutely right that I/O is hard to profile. These two statements are not mutually exclusive. What I don't want is for people to run around quoting how fast bonnie benchmarks their memory controller, that does nothing to characterize the memory subsystem. Here is what I want to see happen: Step 1: Get the easy crap out of the way Step 1a: Figure out a reliable way to measure sequential read/write performance of just the driver and the hardware. Prove that no needless copy operations are going on. Step 1b: Measure sequential read/write through the buffer cache. Prove that no needless copy operations are going on. Step 1c: Measure sequential read/write through the syscall layer. Prove that no needless copy operations are going on. The only thing that should affect sequential speed is the speed of the hardware, buses, and memory controller. If anything in the OS is standing in the way, we need to weed it out. Then we need to forget about sequential performance; that's the real of unscrupulous IDE marketeers and linnex kiddies. I don't really care how fast we can copy 20 bazillion terabytes of 0's, I want to know how many transactions per second our databases can do, how scalable our mail servers are, etc. None of that has anything to do with sequential I/O performance. Step 2: Do the real work Step 2a: Measure transaction latency and thoroughput through the driver and hardware. Profile lock contention. Measure interrupt latency. Step 2b: Measure VFS latency in both specfs and ufs. Profile lock contention and usage. Compare contention against the driver. Step 2c: Profile the buffer cache. Are pages being cached effectively? Are they being cleaned efficiently? Can we fix the "lemming syncer"? Measure lock contention. I imagine that this is where the real work is. And lots of it. Step 3: Beat the benchmarks Step 3a: Figure out what is being tested and how. How do the data sources and sinks work? Are other OS's beating the benchmarks by cheating here? Can we create our own high-speed sources and sinks? Step 3b: ??? This is important even if it does sound slimy. We could have a real kick-ass I/O subsystem, but be beaten by a linux rig that fools the benchmarks. Kinda like how Intel has benchmarks that show that their Pentium4 line is the fastest thing on the planet, when in reality it's just smoke and mirrors. Gotta win the PR race, here. Scott
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?426C7004.8020003>