This is the mail archive of the newlib@sourceware.org mailing list for the newlib project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [AArch64] Optimized memcmp

From: Corinna Vinschen <vinschen at redhat dot com>
To: newlib at sourceware dot org
Date: Thu, 29 Jun 2017 20:42:33 +0200
Subject: Re: [AArch64] Optimized memcmp
Authentication-results: sourceware.org; auth=none
Authentication-results: ext-mx08.extmail.prod.ext.phx2.redhat.com; dmarc=none (p=none dis=none) header.from=redhat.com
Authentication-results: ext-mx08.extmail.prod.ext.phx2.redhat.com; spf=pass smtp.mailfrom=vinschen at redhat dot com
Dkim-filter: OpenDKIM Filter v2.11.0 mx1.redhat.com BD4AFC058EAE
Dmarc-filter: OpenDMARC Filter v1.3.2 mx1.redhat.com BD4AFC058EAE
References: <AM5PR0802MB261024C52F84B3B42279AFF883D20@AM5PR0802MB2610.eurprd08.prod.outlook.com>
Reply-to: newlib at sourceware dot org

On Jun 29 14:32, Wilco Dijkstra wrote:
> This is an optimized memcmp for AArch64.  This is a complete rewrite
> using a different algorithm.  The previous version split into cases
> where both inputs were aligned, the inputs were mutually aligned and
> unaligned using a byte loop.  The new version combines all these cases,
> while small inputs of less than 8 bytes are handled separately.
> 
> This allows the main code to be sped up using unaligned loads since
> there are now at least 8 bytes to be compared.  After the first 8 bytes,
> align the first input.  This ensures each iteration does at most one
> unaligned access and mutually aligned inputs behave as aligned.
> After the main loop, process the last 8 bytes using unaligned accesses.
> 
> This improves performance of (mutually) aligned cases by 25% and 
> unaligned by >500% (yes >6 times faster) on large inputs.
> 
> ChangeLog:
> 2017-06-28  Wilco Dijkstra  <wdijkstr@arm.com>
> 
>         * newlib/libc/machine/aarch64/memcmp.S (memcmp): 
>         Rewrite of optimized memcmp.

Pushed.


Thanks,
Corinna

-- 
Corinna Vinschen
Cygwin Maintainer
Red Hat

Attachment: signature.asc
Description: PGP signature

References:
- [AArch64] Optimized memcmp
  - From: Wilco Dijkstra

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]