From owner-freebsd-numerics@FreeBSD.ORG Sun Aug 12 23:10:42 2012 Return-Path: Delivered-To: freebsd-numerics@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 8C1021065674 for ; Sun, 12 Aug 2012 23:10:42 +0000 (UTC) (envelope-from peter@rulingia.com) Received: from vps.rulingia.com (host-122-100-2-194.octopus.com.au [122.100.2.194]) by mx1.freebsd.org (Postfix) with ESMTP id 1188A8FC0C for ; Sun, 12 Aug 2012 23:10:41 +0000 (UTC) Received: from server.rulingia.com (c220-239-249-137.belrs5.nsw.optusnet.com.au [220.239.249.137]) by vps.rulingia.com (8.14.5/8.14.5) with ESMTP id q7CNAfjr075831 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK) for ; Mon, 13 Aug 2012 09:10:42 +1000 (EST) (envelope-from peter@rulingia.com) X-Bogosity: Ham, spamicity=0.000000 Received: from server.rulingia.com (localhost.rulingia.com [127.0.0.1]) by server.rulingia.com (8.14.5/8.14.5) with ESMTP id q7CNAZ7W021643 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Mon, 13 Aug 2012 09:10:35 +1000 (EST) (envelope-from peter@server.rulingia.com) Received: (from peter@localhost) by server.rulingia.com (8.14.5/8.14.5/Submit) id q7CNAZHw021642 for freebsd-numerics@freebsd.org; Mon, 13 Aug 2012 09:10:35 +1000 (EST) (envelope-from peter) Resent-From: Peter Jeremy Resent-Date: Mon, 13 Aug 2012 09:10:35 +1000 Resent-Message-ID: <20120812231035.GZ20453@server.rulingia.com> Resent-To: freebsd-numerics@freebsd.org Received: from vps.rulingia.com (host-122-100-2-194.octopus.com.au [122.100.2.194]) by server.rulingia.com (8.14.5/8.14.5) with ESMTP id q6MKDm4g007043 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK) for ; Mon, 23 Jul 2012 06:13:48 +1000 (EST) (envelope-from stephen@missouri.edu) Received: from wilberforce.math.missouri.edu (wilberforce.math.missouri.edu [128.206.184.213]) by vps.rulingia.com (8.14.5/8.14.5) with ESMTP id q6MKDjlF009627 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Mon, 23 Jul 2012 06:13:48 +1000 (EST) (envelope-from stephen@missouri.edu) Received: from [127.0.0.1] (wilberforce.math.missouri.edu [128.206.184.213]) by wilberforce.math.missouri.edu (8.14.5/8.14.5) with ESMTP id q6MKDOBd072644; Sun, 22 Jul 2012 15:13:25 -0500 (CDT) (envelope-from stephen@missouri.edu) Message-ID: <500C5EE5.4090602@missouri.edu> From: Stephen Montgomery-Smith Mail-Followup-To: freebsd-numerics@freebsd.org User-Agent: Mozilla/5.0 (X11; Linux i686; rv:14.0) Gecko/20120714 Thunderbird/14.0 MIME-Version: 1.0 To: Bruce Evans References: <20120714120432.GA70706@server.rulingia.com> <20120717084457.U3890@besplex.bde.org> <5004A5C7.1040405@missouri.edu> <5004DEA9.1050001@missouri.edu> <20120717200931.U6624@besplex.bde.org> <5006D13D.2080702@missouri.edu> <20120718205625.GA409@troutmask.apl.washington.edu> <500725F2.7060603@missouri.edu> <20120719025345.GA1376@troutmask.apl.washington.edu> <50077987.1080307@missouri.edu> <20120719032706.GA1558@troutmask.apl.washington.edu> <5007826D.7060806@missouri.edu> <5007AD41.9070000@missouri.edu> <20120719205347.T2601@besplex.bde.org> <50084322.7020401@missouri.edu> <20120720035001.W4053@besplex.bde.org> <50085441.4090305@missouri.edu> <20120720162953.N2162@besplex.bde.org> <20120720184114.B2790@besplex.bde.org> <50095CDE.4050507@missouri.edu> <20120723044308.X6145@besplex.bde.org> In-Reply-To: <20120723044308.X6145@besplex.bde.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: Diane Bruce , John Baldwin , David Chisnall , Bruce Evans , Steve Kargl , David Schultz , Peter Jeremy , Warner Losh Subject: Re: Use of C99 extra long double math functions after r236148 X-BeenThere: freebsd-numerics@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Discussions of high quality implementation of libm functions." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Date: Sun, 12 Aug 2012 23:10:42 -0000 X-Original-Date: Sun, 22 Jul 2012 15:13:25 -0500 X-List-Received-Date: Sun, 12 Aug 2012 23:10:42 -0000 But I will say that your latest version of clog doesn't do as well as mine with this input: x = unur_sample_cont(gen); y = unur_sample_cont(gen); h = hypot(x,y); x = x/h; y = y/h; I was able to get ULPs less than 2. Your program gets ULPs more like up to 4000. I have to say that I consider a ULP of 4000 under these very extreme circumstances to be acceptable. Definitely acceptable if the code goes a whole lot faster than code that has a ULP of less than 2. On 07/22/2012 02:29 PM, Bruce Evans wrote: > Replying again to this... > > On Fri, 20 Jul 2012, Stephen Montgomery-Smith wrote: > >> I worked quite a bit on my clog function last night. Could you look >> at this one? > > I ended up deleting most of your changes in this one. > >> The trick for when hypot(x,y) is close to 1 can only work so well, and >> you are testing it out of its range of applicability. But for the >> special case x is close to 1, and y is close to zero, I have a much >> better formula. I can produce three other formula so that I can >> handle |x| close to 1, y small, and |y| close to 1 and x small. >> >> After fixing it up, could you send it back as an attachment? That >> will make it easier for me to put it back into my system, and work >> more on it. > > It will be painful for everyone to understand and merge. This time as > an attachment as well as (partly) inline with commentary. > > % #include > % #include > % #include > % % #include "math_private.h" > % % static inline void > % xonorm(double *a, double *b) > % { > % double w; > % % if (fabs(*a) < fabs(*b)) { > % w = *a; > % *a = *b; > % *b = w; > % } > % STRICT_ASSIGN(double, w, *a + *b); > % *b = (*a - w) + *b; > % *a = w; > % } > % % #define xnorm(a, b) xonorm(&(a), &(b)) > % % #define xspadd(a, b, c) do { \ > % double __tmp; \ > % \ > % __tmp = (c); \ > % xonorm(&__tmp, &(a)); \ > % (b) += (a); \ > % (a) = __tmp; \ > % } while (0) > % % static inline void > % xonormf(float *a, float *b) > % { > % float w; > % % if (fabsf(*a) < fabsf(*b)) { > % w = *a; > % *a = *b; > % *b = w; > % } > % STRICT_ASSIGN(float, w, *a + *b); > % *b = (*a - w) + *b; > % *a = w; > % } > % % #define xnormf(a, b) xonormf(&(a), &(b)) > % % #define xspaddf(a, b, c) do { \ > % float __tmp; \ > % \ > % __tmp = (c); \ > % xonormf(&__tmp, &(a)); \ > % (b) += (a); \ > % (a) = __tmp; \ > % } while (0) > > Above are my standard extra-precision macros from my math_private.h, > cut down for use here and named with an x. Then expanded and pessimized > to swap the args. Optimal callers ensure that they don't need swapping. > See s_fma.c for a fuller algorithm that doesn't need swapping but does > more operations. I started to copy that, but s_fma.c doesn't seem to > have anything as convenient as xspaddf(). Further expanded and > pessimized() to do STRICT_ASSIGN(). Optimal callers use float_t and > double_t so that STRICT_ASSIGN() is unnecessary. Compiler bugs break > algorithms like the above on i386 unless float_t and double_t, or > STRICT_ASSIGN() are used. Fixing the bugs would give the same slowness > as STRICT_ASSIGN(), but globally by doing it for all assignments, so > even I now consider these bugs to be features and C standards to be > broken for not allowing them. I first discussed fixing them with gcc > maintainers over 20 years ago. > > % % double complex > % clog(double complex z) > % { > % double x, y, h, t1, t2, t3; > % double ax, ay, x0, y0, x1, y1; > % % x = creal(z); > % y = cimag(z); > % % /* Handle NaNs using the general formula to mix them right. */ > % if (x != x || y != y) > % return (cpack(log(hypot(x, y)), atan2(y, x))); > > I replaced all my messy ifdefs for this by the function call. Also > changes isnan() to a not-so-magic test. Though isnan() is about the > only FP classification macro that I trust the compiler for. > > % % ax = fabs(x); > % ay = fabs(y); > % if (ax < ay) { > % t1 = ax; > % ax = ay; > % ay = t1; > % } > > I got tired of repeating fabs()'s, and need to know which arg is larger. > > % % /* > % * To avoid unnecessary overflow, if x or y are very large, divide x > % * and y by M_E, and then add 1 to the logarithm. This depends on > % * M_E being larger than sqrt(2). > % * > % * XXX bugs to fix: > % * - underflow if one of x or y is tiny. e_hypot.c avoids this > % * problem, and optimizes for the case that the ratio of the > % * args is very large, by returning the absolute value of > % * the largest arg in this case. > % * - not very accurate. Could divide by 2 and add log(2) in extra > % * precision. A general scaling step that divides by 2**k and > % * adds k*log(2) in extra precision might be good for reducing > % * the range so that we don't have to worry about overflow or > % * underflow in the general steps. This needs the previous step > % * of eliminating large ratios of args so that the args can be > % * scaled on the same scale. > % * - s/are/is/ in comment. > % */ > % if (ax > 1e308) > % return (cpack(log(hypot(x / M_E, y / M_E)) + 1, atan2(y, x))); > > No need to avoid infinities here. No need to test y now that we know > ax is largest. > > % % if (ax == 1) { > % if (ay < 1e-150) > % return (cpack((ay * 0.5) * ay, atan2(y, x))); > % return (cpack(log1p(ay * ay) * 0.5, atan2(y, x))); > % } > > Special case mainly for when (ay * ay) is rounded down to the smallest > denormal. > > % % /* > % * Because atan2 and hypot conform to C99, this also covers all the > % * edge cases when x or y are 0 or infinite. > % */ > % if (ax < 1e-50 || ay < 1e-50 || ax > 1e50 || ay > 1e50) > % return (cpack(log(hypot(x, y)), atan2(y, x))); > > Not quite right. It is the ratio that matters more than these magic > magnitudes. > > % % /* We don't need to worry about overflow in x*x+y*y. */ > % % /* > % * Take extra care so that ULP of real part is small if h is > % * moderately close to 1. If one only cares about the relative error > % * of the whole result (real and imaginary part taken together), this > % * algorithm is overkill. > % * > % * This algorithm does a rather good job if |h-1| >= 1e-5. The only > % * algorithm that I can think of that would work for any h close to > % * one would require hypot(x,y) being computed using double double > % * precision precision (i.e. double as many bits in the mantissa as > % * double precision). > % * > % * x0 and y0 are good approximations to x and y, but have their bits > % * trimmed so that double precision floating point is capable of > % * calculating x0*x0 + y0*y0 - 1 exactly. > % */ > > Comments not all updated. This one especially out of date. > > % x0 = ax; > % SET_LOW_WORD(x0, 0); > % x1 = ax - x0; > % y0 = ay; > % SET_LOW_WORD(y0, 0); > % y1 = ay - y0; > > Sloppy decomposition with only 21 bits in hi part. Since we are short > of bits, we shouldn't burn 5 like this for efficency. In float precision, > all the multiplications are exact since 24 splits exactly. > > % /* Notice that mathematically, h = t1*(1+t3). */ > % #if 0 > > Old version. Still drops bits unnecessary, although I added several > full hi/lo decomposition steps to it. > > % t1 = x0 * x0; /* Exact. */ > % t2 = y0 * y0; /* Exact. */ > > Comments not quite right. All of the muliplications are as exact as > possible. They would all be exact if we could split 53 in half, and > did so. > > % STRICT_ASSIGN(double, t3, t1 + t2); > % t2 = (t1 - t3) + t2; > % t1 = t3; /* Now t1+t2 is hi+lo for x0*x0+y0*y0.*/ > % t2 += 2 * x0 * x1; > % STRICT_ASSIGN(double, t3, t1 + t2); > % t2 = (t1 - t3) + t2; > % t1 = t3; > % t2 += 2 * y0 * y1; > % STRICT_ASSIGN(double, t3, t1 + t2); > % t2 = (t1 - t3) + t2; > % t1 = t3; > % t2 += x1 * x1 + y1 * y1; > % STRICT_ASSIGN(double, t3, t1 + t2); > % t2 = (t1 - t3) + t2; > % t1 = t3; /* Now t1+t2 is hi+lo for x*x+y*y.*/ > % #else > % t1 = x1 * x1; > % t2 = y1 * y1; > % xnorm(t1, t2); > % xspadd(t1, t2, 2 * y0 * y1); > % xspadd(t1, t2, 2 * x0 * x1); > % xspadd(t1, t2, y0 * y0); > % xspadd(t1, t2, x0 * x0); > % xnorm(t1, t2); > > It was too hard to turn the above into this without using the macros. > Now all the multiplications are as exact as possible, and extra precision > is used for all the additions (this mattered even for the first 2 terms). > Terms should be added from the smallest to the highest. This happens in > most cases and some bits are lost when it isn't. > > % #endif > % t3 = t2 / t1; > % /* > % * |t3| ~< 2**-22 since we work with 24 extra bits of precision, so > % * log1p(t3) can be evaluated with about 13 extra bits of precision > % * using 2 terms of its power series. But there are complexities > % * to avoid underflow. > % */ > > Complexities to avoid underflow incomplete and not here yet. > > Comment otherwise anachronistic/anaspac(sp?)istic. 22 and 13 are for the > float version. The final xnorm() step (to maximize accuracy) ensures that > |t3| < 2**-24 precisely for floats (half an ulp). 2**-53 for doubles. > > % return (cpack((t3 - t3*0.5*t3 + log(t1)) * 0.5, atan2(y, x))); > > The second term for log1p(t3) is probably nonsense . We lose by > inaccuracies in t3 itself. > > % } > % % float complex > % clogf(float complex z) > % { > > This is a routine translation. It duplicates too many comments. Only > this has been tested much with the latest accuracty fixes. > > % float x, y, h, t1, t2, t3; > % float ax, ay, x0, y0, x1, y1; > % uint32_t hx, hy; > % % x = crealf(z); > % y = cimagf(z); > % % /* Handle NaNs using the general formula to mix them right. */ > % if (x != x || y != y) > % return (cpack(log(hypot(x, y)), atan2(y, x))); > > Oops, copied too much -- forgot to add f's. > > Bruce