This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] New numbers in the benchtests.

From: Joseph Myers <joseph at codesourcery dot com>
To: Adhemerval Zanella <adhemerval dot zanella at linaro dot org>
Cc: <libc-alpha at sourceware dot org>, <patrick dot mcgehearty at oracle dot com>
Date: Tue, 19 Dec 2017 18:54:26 +0000
Subject: Re: [PATCH] New numbers in the benchtests.
Authentication-results: sourceware.org; auth=none
References: <20171215042058.23705-1-woodard@redhat.com> <alpine.DEB.2.20.1712151603550.13157@digraph.polyomino.org.uk> <f9b87bdf-0d71-088a-99ee-4b034deca9b5@gotplt.org> <3f2393c9-c853-999b-b0c0-c07dcb30e434@oracle.com> <alpine.DEB.2.20.1712181718240.26421@digraph.polyomino.org.uk> <05bbfd67-b8c0-d999-7e99-c67abd4d67e5@oracle.com> <alpine.DEB.2.20.1712191728050.27055@digraph.polyomino.org.uk> <9a4d7043-65c6-7715-3fb4-fc36eb57d9fb@linaro.org> <alpine.DEB.2.20.1712191808110.27055@digraph.polyomino.org.uk>

Here is a possible cause for the failures, not verified as such:

The exp implementation was using get_rounding_mode.  But tgamma uses 
SET_RESTORE_ROUND and calls exp within the scope of SET_RESTORE_ROUND.  
SET_RESTORE_ROUND, for x86_64, only sets the SSE rounding mode, but 
get_rounding_mode gets the x87 rounding mode.  The effect would have been 
that the code in exp found that the x87 rounding mode was not to-nearest, 
then redundantly set the SSE rounding mode to to-nearest, then ended up 
trying to restore the rounding mode and actually setting the SSE rounding 
mode to the not-to-nearest value of the x87 rounding modes.

If this hypothesis is correct, I advise resubmitting the patch in a form 
that just uses SET_RESTORE_ROUND (FE_TONEAREST) like other libm functions 
- the performance improvement was sufficient that presumably even this 
form would still perform better than the existing code.  Then, once that 
is properly validated and checked in, it would be possible to restore the 
optimization that the use of get_rounding_mode and separate code paths 
were intended to achieve.  To do that, you'd need libc_fegetround{,f,l} 
which by default just use get_rounding_mode, but on x86 __SSE2_MATH__ do 
something different for the float and double versions to get the SSE 
rounding mode instead; exp would use libc_fegetround instead of using 
get_rounding_mode directly.

-- 
Joseph S. Myers
joseph@codesourcery.com

Follow-Ups:
- Re: [PATCH] New numbers in the benchtests.
  - From: Patrick McGehearty

References:
- [PATCH] New numbers in the benchtests.
  - From: Ben Woodard
- Re: [PATCH] New numbers in the benchtests.
  - From: Joseph Myers
- Re: [PATCH] New numbers in the benchtests.
  - From: Siddhesh Poyarekar
- Re: [PATCH] New numbers in the benchtests.
  - From: Patrick McGehearty
- Re: [PATCH] New numbers in the benchtests.
  - From: Joseph Myers
- Re: [PATCH] New numbers in the benchtests.
  - From: Patrick McGehearty
- Re: [PATCH] New numbers in the benchtests.
  - From: Joseph Myers
- Re: [PATCH] New numbers in the benchtests.
  - From: Adhemerval Zanella
- Re: [PATCH] New numbers in the benchtests.
  - From: Joseph Myers

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]