This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [Patch] aarch64: ThunderX2 specific memcpy and memmove


Hi Saikat,

Is this the correct patch you intended to post? I'm asking because
neither memcpy nor memmove pass any tests (it's obvious from the
patch that the overlap case is completely missing from memmove),
the code is absolutely humongous (17KB!!!) and very inefficient due to
using inline assembler.

For any resubmission I'd recommend you first run all GLIBC tests and
benchmarks plus do a SPEC comparison run against the generic memcpy.

Due to the huge size and large number of branches I would expect it
to be significantly slower in the real world, even if it seems faster in
some microbenchmarks. glibc/benchtests/bench-memcpy-random is a
quick way to verify this as it is based on the memcpy size distribution
in SPEC.

Cheers,
Wilco


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]