This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

RE: [RFC] Clean up SSE variable shifts

From: "Lu, Hongjiu" <hongjiu dot lu at intel dot com>
To: Ulrich Drepper <drepper at redhat dot com>
Cc: "libc-alpha at sourceware dot org" <libc-alpha at sourceware dot org>, Richard Henderson<rth at twiddle dot net>
Date: Tue, 24 Aug 2010 06:40:46 -0700
Subject: RE: [RFC] Clean up SSE variable shifts
References: <8EA2C2C4116BF44AB370468FBF85A7770179ACAA50@orsmsx504.amr.corp.intel.com><373984966.1014151282626273242.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com>

[hjl@gnu-6 strcspn]$ size sse4-strcspn-vshft1.o sse4-strcspn-vshft2.o sse4-strcspn-vshft3.o sse4-strcspn-vshft4.o sse4-strcspn-vshft5.o varshift?.o
   text	   data	    bss	    dec	    hex	filename
    684	      0	      0	    684	    2ac	sse4-strcspn-vshft1.o
    591	      0	      0	    591	    24f	sse4-strcspn-vshft2.o
    335	      0	      0	    335	    14f	sse4-strcspn-vshft3.o
    335	      0	      0	    335	    14f	sse4-strcspn-vshft4.o
    324	      0	      0	    324	    144	sse4-strcspn-vshft5.o
    174	      0	      0	    174	     ae	varshift3.o
    256	      0	      0	    256	    100	varshift4.o
     31	      0	      0	     31	     1f	varshift5.o

The order of size with the smallest first is

Replace palignr with unaligned load, replace intrinsic with pshufb + unaligned load
Replace palignr with unaligned load, replace intrinsic with a function call
Replace palignr with unaligned load, replace intrinsic with pshufb
Replace palignr with unaligned load, replace intrinsic with asm statement
Replace palignr with unaligned load

H.J.


> -----Original Message-----
> From: Ulrich Drepper [mailto:drepper@redhat.com]
> Sent: Monday, August 23, 2010 10:05 PM
> To: Lu, Hongjiu
> Cc: libc-alpha@sourceware.org; Richard Henderson
> Subject: Re: [RFC] Clean up SSE variable shifts
> 
> ----- "Hongjiu Lu" <hongjiu.lu@intel.com> wrote:
> > Here are TSC deltas between different implementations. It is
> > hard to tell which one is faster.
> 
> Agreed.  This is, though, the best result.  It means the
> implementations really don't differ at all in micro-benchmarks.
> THerefore we can use the one with has the minimum resource use, in
> code and data.  Can you post that data, too?  I mean the 'size' for
> the various sections.
> 
> --
> â Ulrich Drepper â Red Hat, Inc. â 444 Castro St â Mountain View, CA â

Follow-Ups:
- Re: [RFC] Clean up SSE variable shifts
  - From: Ulrich Drepper

References:
- RE: [RFC] Clean up SSE variable shifts
  - From: Lu, Hongjiu
- Re: [RFC] Clean up SSE variable shifts
  - From: Ulrich Drepper

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]