Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 23 Dec 1995 20:34:18 -0800
From:      David Greenman <davidg@Root.COM>
To:        Torbjorn Granlund <tege@matematik.su.se>
Cc:        freebsd-hackers@freebsd.org
Subject:   Re: Pentium bcopy 
Message-ID:  <199512240434.UAA00489@corbin.Root.COM>
In-Reply-To: Your message of "Sat, 23 Dec 95 19:57:23 PST." <199512240357.TAA00460@corbin.Root.COM> 

next in thread | previous in thread | raw e-mail | index | archive | help
>>   Anyway, your optimization looks interesting and I do intend to try it out.
>>Thanks for your efforts and please don't get too discouraged.
>
>   I need to do some more testing, but a quick test shows that for copying
>page-sized amounts, it's about 5% faster than bcopy on a 150Mhz P6 (Orion)
>and about 25% faster on a 90Mhz Pentium (Triton, PB cache).
>   ...not 5 times faster, but definately an improvement. Thanks!

   Woops! ...that test was a bit too quick. Okay, so I was off by a factor of
12288 :-) (SIZE was 4096 in the test, which would have the copysize at 16K
bytes). For 4096 bytes (1024 longwords):

150Mhz P6:
[corbin:davidg] time ./copytest
copy 1170
bcopy 1675
2.847u 0.007s 0:02.86 99.3% 37+204k 0+0io 0pf+0w

90Mhz P5:
[implode:davidg] time ./copytest
copy 1836
bcopy 5204
7.041u 0.007s 0:07.05 99.8% 36+205k 0+0io 0pf+0w

   So for the P6 it's about 41% faster, and for the P5 it's about 283% faster.
The good numbers require that the thing being copied fits in the L1 cache, so
it will be interesting to see how much it improves more 'real world' sorts of
things (like paging performance and filesystem cache reads).
   Anyway, thanks again for the code.

-DG



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199512240434.UAA00489>