This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[PATCH] aarch64: Implement math acceleration via builtins


This patch converts asm statements into builtin and neon intrinsics for AArch64.
As an example for the file sysdeps/aarch64/fpu/_s_ceil.c, we convert the function from

double
__ceil (double x)
{
  double result;
  asm ("frintp\t%d0, %d1" :
       "=w" (result) : "w" (x) );
  return result;
}

into

double
__ceil (double x)
{
  return __builtin_ceil (x);
}

Tested on aarch64-linux-gnu with gcc-4.9.4 and gcc-6. Okay for trunk?

2017-10-13  Michael Collison  <michael.collison@arm.com>

	* sysdeps/aarch64/fpu/e_sqrt.c (ieee754_sqrt): Replace
	asm statements with __builtin_sqrt.
	* sysdeps/aarch64/fpu/e_sqrtf.c (ieee754_sqrtf): Replace
	asm statements with __builtin_sqrtf.
	* sysdeps/aarch64/fpu/s_ceil.c (__ceil): Replace
	asm statements with __builtin_ceil.
	* sysdeps/aarch64/fpu/s_ceilf.c (__ceilf): Replace
	asm statements with __builtin_ceilf.
	* sysdeps/aarch64/fpu/s_floor.c (__floor): Replace
	asm statements with __builtin_floor.
	* sysdeps/aarch64/fpu/s_floorf.c (__floorf): Replace
	asm statements with __builtin_floorf.
	* sysdeps/aarch64/fpu/s_fma.c (__fma): Replace
	asm statements with __builtin_fma.
	* sysdeps/aarch64/fpu/s_fmaf.c (__fmaf): Replace
	asm statements with __builtin_fmaf.
	* sysdeps/aarch64/fpu/s_fmax.c (__fmax): Replace
	asm statements with __builtin_fmax.
	* sysdeps/aarch64/fpu/s_fmaxf.c (__fmaxf): Replace
	asm statements with __builtin_fmaxf.
	* sysdeps/aarch64/fpu/s_fmin.c (__fmin): Replace
	asm statements with __builtin_fmin.
	* sysdeps/aarch64/fpu/s_fminf.c (__fminf): Replace
	asm statements with __builtin_fminf.
	* sysdeps/aarch64/fpu/s_frint.c: Delete file.
	* sysdeps/aarch64/fpu/s_frintf.c: Delete file.
	* sysdeps/aarch64/fpu/s_llrint.c (__llrint): Replace
	asm statements with builtin and neon intrinsic.
	* sysdeps/aarch64/fpu/s_llrintf.c (__llrintf): Ditto.
	* sysdeps/aarch64/fpu/s_llround.c (__llround): Replace
	asm statements with neon intrinsic.
	* sysdeps/aarch64/fpu/s_llroundf.c (__llroundf): Ditto.
	* sysdeps/aarch64/fpu/s_lrint.c (__lrint): Replace
	asm statements with builtin and neon intrinsic.
	* sysdeps/aarch64/fpu/s_lrintf.c (__lrintf): Ditto.
	* sysdeps/aarch64/fpu/s_lround.c (__lround): Replace
	asm statements with neon intrinsic vcvtad_s64_f64.
	* sysdeps/aarch64/fpu/s_lroundf.c (__lroundf): Replace
	asm statements with neon intrinsic vcvtas_s32_f32.
	* sysdeps/aarch64/fpu/s_nearbyint.c (__nearbyint): Replace
	asm statements with __builtin_nearbyint.
	* sysdeps/aarch64/fpu/s_nearbyintf.c (__nearbyintf): Replace
	asm statements with __builtin_nearbyintf.
	* sysdeps/aarch64/fpu/s_rint.c (__rint): Replace
	asm statements with __builtin_rint.
	* sysdeps/aarch64/fpu/s_rintf.c (__rintf): Replace
	asm statements with __builtin_rintf.
	* sysdeps/aarch64/fpu/s_round.c (__round): Replace
	asm statements with __builtin_round.
	* sysdeps/aarch64/fpu/s_roundf.c (__roundf): Replace
	asm statements with __builtin_roundf.
	* sysdeps/aarch64/fpu/s_trunc.c (__trunc): Replace
	asm statements with __builtin_trunc.
	* sysdeps/aarch64/fpu/s_truncf.c (__truncf): Replace
	asm statements with __builtin_truncf.
	* sysdeps/aarch64/fpu/Makefile: Build e_sqrt[f].c with
	-fno-math-errno.

Attachment: gnutools-1925-rebase-v12.patch
Description: gnutools-1925-rebase-v12.patch


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]