This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [PATCH 1/N] x86_64 vectorization support: vectorized math functions addition to Glibc
- From: OndÅej BÃlka <neleai at seznam dot cz>
- To: Andi Kleen <andi at firstfloor dot org>
- Cc: Rich Felker <dalias at libc dot org>, Matthew Fortune <Matthew dot Fortune at imgtec dot com>, "Joseph S. Myers" <joseph at codesourcery dot com>, Andrew Senkevich <andrew dot n dot senkevich at gmail dot com>, libc-alpha <libc-alpha at sourceware dot org>, "igor dot zamyatin at intel dot com" <igor dot zamyatin at intel dot com>, "Melik-Adamyan, Areg" <areg dot melik-adamyan at intel dot com>, "jakub at redhat dot com" <jakub at redhat dot com>
- Date: Fri, 12 Sep 2014 09:18:38 +0200
- Subject: Re: [PATCH 1/N] x86_64 vectorization support: vectorized math functions addition to Glibc
- Authentication-results: sourceware.org; auth=none
- References: <CAMXFM3t=ppndDUBzHzSus7xyuF5hTaLFZ5b273jD39NtddSvsw at mail dot gmail dot com> <Pine dot LNX dot 4 dot 64 dot 1409101549490 dot 12853 at digraph dot polyomino dot org dot uk> <6D39441BF12EF246A7ABCE6654B0235320F09D65 at LEMAIL01 dot le dot imgtec dot org> <20140911210246 dot GN23797 at brightrain dot aerifal dot cx> <87a9655rnu dot fsf at tassilo dot jf dot intel dot com>
On Thu, Sep 11, 2014 at 10:33:41PM -0700, Andi Kleen wrote:
> Rich Felker <dalias@libc.org> writes:
> >
> > This really seems like something the compiler should be doing --
> > translating parallelizable calls to the standard math functions into
> > calls to special simd versions (
>
> Of course gcc already supports that. Even in two different flavours.
>
> Not sure why the patch doesn't implement one of those ABIs though.
>
> -mveclibabi=type
> Specifies the ABI type to use for vectorizing intrinsics
> using an external library.
> Supported values for type are svml for the Intel short vector
> math library and acml for
> the AMD math core library. To use this option, both
> -ftree-vectorize and
> -funsafe-math-optimizations have to be enabled, and an SVML
> or ACML ABI-compatible
> library must be specified at link time.
>
> GCC currently emits calls to "vmldExp2", "vmldLn2",
Which has problem when one want to support both users with svml, amcl or
nothing package maintainers for some reason do not want create three
versions of same package.
What about doing runtime detection what is present? With ifunc one could
make use logic like
int vectorized;
function_ifunc ()
{
if (!(svml = dlopen ("svml.so")))
{
if (!(amcl = dlopen ("amcl.so")))
return function;
vec_exp = dlsym (amcl, "__vrd2_exp");
return function;
}
else
{
vec_exp = dlsym (svml, "vmldExp2");
return function;
}
}
when vectorized loop could look like
if (size < 4 || !vec_exp)
goto simple_loop;
else
goto vector_loop;
That would also preserve compatibility and allow to add avx versions
with detection if processor supports them.