Date: Sun, 10 Nov 1996 12:48:45 -0800 From: Amancio Hasty <hasty@rah.star-gate.com> To: hackers@freebsd.org Subject: PPRO optimizations? Message-ID: <199611102048.MAA05053@rah.star-gate.com>
next in thread | raw e-mail | index | archive | help
Tnks, Amancio Subject: PPro optimiz. (multiple accumulators) Date: Sun, 10 Nov 1996 11:56:34 -0700 From: "H.W. Stockman" <hwstock@swcp.com> Organization: Southwest Cyberport Newsgroups: comp.lang.asm.x86, comp.sys.intel I heard that Intel has a document with examples of "multiple accumulator" programming for the PPro. One example is supposedly like saxpy() or daxpy(), and ostensibly the speedup for the multi-accumulator approach was something like 150 MFLOPs vs. 60 MFLOPs. Has anyone seen this example... and could you tell me where I might get the document? I've see Terje Mathisen's excellent sqrt2(1/x) example, and am looking to increase my bag of tricks...
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199611102048.MAA05053>