Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 16 Dec 2006 14:54:06 +0100
From:      Stefan Ehmann <shoesoft@gmx.net>
To:        obrien@freebsd.org
Cc:        Peter Jeremy <peterjeremy@optushome.com.au>, freebsd-current@freebsd.org, Steve Kargl <sgk@troutmask.apl.washington.edu>
Subject:   Re: Let's use gcc-4.2, not 4.1 -- OpenMP
Message-ID:  <200612161454.08351.shoesoft@gmx.net>
In-Reply-To: <20061215205138.GB55276@dragon.NUXI.org>
References:  <20061213192150.CF83D16A417@hub.freebsd.org> <200612151914.53705.shoesoft@gmx.net> <20061215205138.GB55276@dragon.NUXI.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Friday 15 December 2006 21:51, David O'Brien wrote:
> On Fri, Dec 15, 2006 at 07:14:53PM +0100, Stefan Ehmann wrote:
> > > CPU: AMD Athlon(TM) XP 2700+ (2166.44-MHz 686-class CPU)
>
> ..
>
> > Settings/Compiler           | gcc-3.4 | gcc-4.1 | gcc-4.2
> > ----------------------------+---------+---------+---------
> > -O2                         |   6.46s |   6.67s |   6.38s
> > -O2 -funroll-loops          |   4.44s |   4.16s |   4.02s
> > -O2 -march=athlon-xp -fun.. |   4.39s |   4.38s |   4.26s
> > -O3                         |   6.14s |   5.23s |   5.16s
> > -O3 -funroll-loops          |   4.24s |   4.87s |   4.95s
> > -O3 -march=athlon-xp -fun.. |   4.19s |   4.90s |   5.07s
>
> A fine example that -O3 isn't always better than -O2.
> I wonder if you're blowing the L2 cache.  IIRC, all Athlon XP 2700+
> are the Thoughbread core, which has only 256KB L2.
Yes, only 256KB L2 cache here.

Results on a pentium-m with 2MB L2 cache were quite similar. With loop 
unrolling -O2 was still faster than -O3. Though not as much slower as on the 
Athlon XP.

As a side note: (stripped) gcc42 binaries were up to 200% of the size of the 
gcc34 binaries.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200612161454.08351.shoesoft>