This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] Sparc exp(), expf() performance improvement

From: Patrick McGehearty <patrick dot mcgehearty at oracle dot com>
To: libc-alpha at sourceware dot org
Date: Thu, 3 Aug 2017 11:50:53 -0500
Subject: Re: [PATCH] Sparc exp(), expf() performance improvement
Authentication-results: sourceware.org; auth=none
References: <20170731.142137.2230165415917491707.davem@davemloft.net> <11ace70c-ca7b-b213-bc63-ac7f252ef8ba@oracle.com> <6835b045-e169-84a1-f4d6-479502a90d4b@oracle.com> <20170802.161958.1521864822969671626.davem@davemloft.net>

On 8/2/2017 6:19 PM, David Miller wrote:

From: Patrick McGehearty <patrick.mcgehearty@oracle.com>
Date: Wed, 2 Aug 2017 14:53:19 -0500

I concur that it would be beneficial to use -mcpu=niagara4 for
selected libm functions, but do not believe it should be default on
without more extensive testing (and likely more compiler tuning).

This is precisely what has been suggested, have an niagara4 IFUNC
vector for math functions that benefit greatly from the fp<-->int
register move instructions.

I'm new to linux glibc development and have been using existing examplesto suggesthow I should do things like use ifunc. If there is a different approachthat is recommended,

I'm willing to use it but will need some hints/instruction on what is.

My current approach of using
sparc_libm_ifunc (exp, hwcap & HWCAP_SPARC_CRYPTO ?
                  __exp_niagara4 : __ieee754_exp);

in sysdeps/sparc/sparc64/fpu/multiarch/e_exp-niagara4.c matches theapproach used in

sysdeps/sparc/sparc64/fpu/multiarch/s_ceil.c
sysdeps/sparc/sparc64/fpu/multiarch/s_ceilf.c
sysdeps/sparc/sparc64/fpu/multiarch/s_floor.c
sysdeps/sparc/sparc64/fpu/multiarch/s_floorf.c
sysdeps/sparc/sparc64/fpu/multiarch/s_fma.c
sysdeps/sparc/sparc64/fpu/multiarch/s_fmaf.c
sysdeps/sparc/sparc64/fpu/multiarch/s_trunc.c
sysdeps/sparc/sparc64/fpu/multiarch/s_truncf.c

I'll note that simply using -mcpu=niagara4 on the existing ieee754 exp()code onlyprovides a 5% performance gain. Due to other assignments, I will nothave time/resources

to redo other libm functions at this time.

- patrick

Follow-Ups:
- Re: [PATCH] Sparc exp(), expf() performance improvement
  - From: Adhemerval Zanella

References:
- Re: [PATCH] Sparc exp(), expf() performance improvement
  - From: Patrick McGehearty
- Re: [PATCH] Sparc exp(), expf() performance improvement
  - From: Patrick McGehearty
- Re: [PATCH] Sparc exp(), expf() performance improvement
  - From: David Miller

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]