Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 10 Nov 1996 12:48:45 -0800
From:      Amancio Hasty <hasty@rah.star-gate.com>
To:        hackers@freebsd.org
Subject:   PPRO optimizations?
Message-ID:  <199611102048.MAA05053@rah.star-gate.com>

next in thread | raw e-mail | index | archive | help

Tnks,
	Amancio


Subject: 
             PPro optimiz. (multiple accumulators)
       Date: 
             Sun, 10 Nov 1996 11:56:34 -0700
       From: 
             "H.W. Stockman" <hwstock@swcp.com>
Organization: 
             Southwest Cyberport
 Newsgroups: 
             comp.lang.asm.x86, comp.sys.intel


I heard that Intel has a document with
examples of "multiple accumulator" programming
for the PPro.  One example is supposedly
like saxpy() or daxpy(), and ostensibly
the speedup for the multi-accumulator
approach was something like 150 MFLOPs
vs. 60 MFLOPs.

Has anyone seen this example... and could you
tell me where I might get the document? I've
see Terje Mathisen's excellent sqrt2(1/x)
example, and am looking to increase my bag
of tricks...




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199611102048.MAA05053>