This is the mail archive of the newlib@sourceware.org mailing list for the newlib project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

RE: [PATCH, AARCH64] Optimized memcpy


> Marcus Shawcroft wrote:
> On 9 July 2015 at 16:03, Marcus Shawcroft <marcus.shawcroft@gmail.com> wrote:
> > On 8 July 2015 at 16:05, Wilco Dijkstra <wdijkstr@arm.com> wrote:
> >> This is an optimized memcpy for AArch64. Copies are split into 3 main cases: small copies
> of up to
> >> 16 bytes, medium copies of 17..96 bytes which are fully unrolled. Large copies of more than
> 96 bytes
> >> align the destination and use an unrolled loop processing 64 bytes per iteration. In order
> to share
> >> code with memmove, small and medium copies read all data before writing, allowing any kind
> of
> >> overlap. On a random copy test memcpy is 40.8% faster on A57 and 28.4% on A53.
> >>
> >> ChangeLog:
> >> 2015-07-08  Wilco Dijkstra  <wdijkstr@arm.com>
> >>
> >>         * newlib/libc/machine/aarch64/memcpy.S (memcpy):
> >>         Rewrite of optimized memcpy.
> >>
> >> OK for commit?
> >
> > OK /Marcus
> 
> Drop the unrelated white space changes in the boiler plate first please.
> /Marcus

Done, see attached.

Wilco

Attachment: 0001-Optimized-memcpy.txt
Description: Text document


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]