643657 – mpfr_fma bug

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 643657 - mpfr_fma bug

Summary: mpfr_fma bug

Keywords:
Status:	CLOSED WONTFIX
Alias:	None
Product:	Red Hat Enterprise Linux 6
Classification:	Red Hat
Component:	mpfr
Sub Component:
Version:	6.0
Hardware:	x86_64
OS:	Linux
Priority:	low
Severity:	medium
Target Milestone:	rc
Target Release:	---
Assignee:	Frantisek Kluknavsky
QA Contact:	BaseOS QE - Apps
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2010-10-16 20:24 UTC by Jakub Jelinek
Modified:	2017-12-06 12:32 UTC (History)
CC List:	1 user (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2017-12-06 12:32:20 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
mpfr-rh643657.patch (21.79 KB, patch) 2010-10-19 16:59 UTC, Jakub Jelinek	no flags	Details \| Diff
View All

Description Jakub Jelinek 2010-10-16 20:24:42 UTC

This source should show a bug, where mpfr_fma (IMNSHO) gives incorrect result (1ulp smaller), which causes also __builtin_fmal to misbehave.
The testcase comment explains why I believe the right result is
0xc.0bffffffffffffep-3942L (== 0x3.02fffffffffffff8p-3940L)
instead of
0xc.0bffffffffffffdp-3942L (== 0x3.02fffffffffffff4p-3940L)
FMA is supposed to be computed with infinite precision and only rounded at the end.  When stepping through mpfr_fma, it seems u is computed correctly:
(gdb) p mpfr_printf ("%Ra\n", u)
0x3.02fffffffffffffap-3940
$1 = 27
so the problem seems to be in the following mpfr_add (s, u, z, rnd_mode);
where u has precision 128, but s and z have precision 64.
The textcase is for extended 80-bit long double, so ?86/x86_64/ia64.

#include <math.h>
#include <gmp.h>
#include <stdio.h>
#include <mpfr.h>

/*

80-bit extended long double

fmal (0x8.07fffffffffffffp+7418L, 0xcp-11363L, -0xf.ff80f8p-4070L)

  0x8.07fffffffffffffp+7418L
* 0xcp-11363L
= 0xc.0bffffffffffffe8p-3942L

  0xc.0bffffffffffffe8p-3942L
- 0x0.0000000000000000000000000000000fff80f8p-3942L
= 0xc.0bffffffffffffe7fffffffffffffff0007f08p-3942L

RNDN:
= 0xc.0bffffffffffffep-3942L instead of 0xc.0bffffffffffffdp-3942L

__builtin_fmal uses mpfr_fma internally in the compiler.
 */

volatile long double zero;

int
main (void)
{
  mpz_t a, b, c;
  mpz_init (a); mpz_init (b); mpz_init (c);
  mpz_set_str (a, "807fffffffffffff", 16);
  mpz_set_str (b, "c", 16);
  mpz_mul (c, a, b);
  mpz_mul_si (c, c, 2);
  mpz_out_str (stdout, 16, c);
  printf ("\n");
  mpz_set_str (a, "c0bffffffffffffe80000000000000000000000", 16);
  mpz_set_str (b, "00000000000000000000000000000000fff80f8", 16);
  mpz_sub (c, a, b);
  mpz_out_str (stdout, 16, c);
  printf ("\n");
  mpz_clear (a); mpz_clear (b); mpz_clear (c);
  printf ("mul            %La\n__builtin_fmal %La\nfmal           %La\n",
          0x8.07fffffffffffffp+7418L * 0xcp-11363L,
          __builtin_fmal (0x8.07fffffffffffffp+7418L, 0xcp-11363L,
                          -0xf.ff80f8p-4070L),
          fmal (0x8.07fffffffffffffp+7418L, 0xcp-11363L,
                -0xf.ff80f8p-4070L + zero));
  mpfr_set_default_prec (64);
  mpfr_set_emin (-16384 -64 +4);
  mpfr_set_emax (16384);
  mpfr_t xx, xy, xz, xr, xm;
  mpfr_init (xx); mpfr_set_ld (xx, 0x8.07fffffffffffffp+7418L, GMP_RNDN);
  mpfr_init (xy); mpfr_set_ld (xy, 0xcp-11363L, GMP_RNDN);
  mpfr_init (xz); mpfr_set_ld (xz, -0xf.ff80f8p-4070L, GMP_RNDN);
  mpfr_init (xr); mpfr_init (xm);
  int i = mpfr_mul (xm, xx, xy, GMP_RNDN);
  mpfr_subnormalize (xm, i, GMP_RNDN);
  i = mpfr_fma (xr, xx, xy, xz, GMP_RNDN);
  mpfr_subnormalize (xr, i, GMP_RNDN);
  mpfr_printf ("mpfr_mul       %Ra\nmpfr_fma       %Ra\n", xm, xr);
  mpfr_clear (xx); mpfr_clear (xy); mpfr_clear (xz);
  mpfr_clear (xm); mpfr_clear (xr);
  return 0;
}

Comment 1 Jakub Jelinek 2010-10-17 19:33:17 UTC

mpfr_sub1 is called, where b is 2 limbs long (128-bit), while a and c is just one limb long (64-bit).
As c is small, but not small enough (difference in exponents exactly 128), we reach the:

for (k = 0; (bn > 0) || (cn > 0); k = 1)

loop in sub1.c, with sh == 0.  In the first iteration bb is equal to MPFR_LIMB_HIGHBIT and cc is 0 (it is before the highest limb in c), and we reach the special case:

      /* the case rounding to nearest with sh=0 is special since one couldn't
         subtract above 1/2 ulp in the trailing limb of the result */
      if ((rnd_mode == GMP_RNDN) && sh == 0 && k == 0)
        {
          mp_limb_t half = MPFR_LIMB_HIGHBIT;

          is_exact = (bb == cc);

          /* add one ulp if bb > cc + half
             truncate if cc - half < bb < cc + half
             sub one ulp if bb < cc - half
          */
...
        }

The comments seems to cover most of the cases, just not two important ones, where either bb is equal to cc + half (this case) or bb is equal to cc - half.
In that case after this bb is equal to cc, eventhough they weren't equal before.  In this case with round to nearest we really can't decide right away, need to look at next (or following limbs) to decide if exactly 0.5ulp was added resp. subtracted, or if tiny bit above it or below it, but that decision should take into account that the special case above with bb == cc + half or bb == cc - half happened.  In the code it doesn't, so we fall through to the next limb - 0 in bb (as it is after lowest limb) and 0xfff80f8000000000 in cc.  k is non-zero, so the special case isn't done now, bb is smaller than cc, so it sets down and reaches the
420                   else if (down && sh == 0)
421                     goto sub_one_ulp;
case, so it incorrectly subtracts 1ulp instead of just raising inexact and truncating.

Comment 2 Paul Zimmermann 2010-10-18 19:55:19 UTC

fixed upstream in revision 7212. Jakub, please could you check it fixes the
problem, so that we can close the bug on our side? Thank you for reporting it
and analyzing the reason.

Paul Zimmermann

PS: I guess Vincent Lefevre will post a patch for MPFR 3.0.0

Comment 3 Jakub Jelinek 2010-10-19 16:59:01 UTC

Created attachment 454393 [details]
mpfr-rh643657.patch

Backported patch from upstream (the differences are just changing MPFR_RND* back to GMP_RND* and undoing the move of sub1.c to src/sub1.c).
So far my fma tester runs with this patch for 5 hours on each of -DDO_FLT, -DDO_DBL and -DDO_LDBL on x86_64-linux and no problems were reported.

Comment 4 RHEL Program Management 2011-01-07 15:38:02 UTC

This request was evaluated by Red Hat Product Management for
inclusion in the current release of Red Hat Enterprise Linux.
Because the affected component is not scheduled to be updated
in the current release, Red Hat is unfortunately unable to
address this request at this time. Red Hat invites you to
ask your support representative to propose this request, if
appropriate and relevant, in the next release of Red Hat
Enterprise Linux. If you would like it considered as an
exception in the current release, please ask your support
representative.

Comment 5 Jan Kurik 2017-12-06 12:32:20 UTC

Red Hat Enterprise Linux 6 is in the Production 3 Phase. During the Production 3 Phase, Critical impact Security Advisories (RHSAs) and selected Urgent Priority Bug Fix Advisories (RHBAs) may be released as they become available.

The official life cycle policy can be reviewed here:

http://redhat.com/rhel/lifecycle

This issue does not meet the inclusion criteria for the Production 3 Phase and will be marked as CLOSED/WONTFIX. If this remains a critical requirement, please contact Red Hat Customer Support to request a re-evaluation of the issue, citing a clear business justification. Note that a strong business justification will be required for re-evaluation. Red Hat Customer Support can be contacted via the Red Hat Customer Portal at the following URL:

https://access.redhat.com/

Note You need to log in before you can comment on or make changes to this bug.