This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] Support separate benchmark outputs

From: OndÅej BÃlka <neleai at seznam dot cz>
To: Siddhesh Poyarekar <siddhesh dot poyarekar at gmail dot com>
Cc: Siddhesh Poyarekar <siddhesh at redhat dot com>, GNU C Library <libc-alpha at sourceware dot org>
Date: Tue, 16 Apr 2013 20:57:20 +0200
Subject: Re: [PATCH] Support separate benchmark outputs
References: <20130416122544 dot GH3063 at spoyarek dot pnq dot redhat dot com> <20130416132838 dot GA29626 at domone dot kolej dot mff dot cuni dot cz> <20130416140355 dot GI3063 at spoyarek dot pnq dot redhat dot com> <20130416155032 dot GA30216 at domone dot kolej dot mff dot cuni dot cz> <CAAHN_R1U_8_OMuzVKJ2q2w8i_XWYrx0mYmvHodBLxmNLiznSkQ at mail dot gmail dot com>

On Tue, Apr 16, 2013 at 11:09:46PM +0530, Siddhesh Poyarekar wrote:
> On 16 April 2013 21:20, OndÅej BÃlka <neleai@seznam.cz> wrote:
> > That is my point that you must measure relative performance. However
> > code above does not measure performance. In simple test
> 
> No, your idea of relative performance is different from mine -
> actually I can't call it mine since these are already existing tests
> (written by Jakub I think) that I'm only copying over.  

How do you want to use data that you get from these benchmarks?
Could you say that if I come with new implementation if firefox will run
faster or slower?

> From your
> description it seems like your definition of relative is the original
> memcpy vs the modified memcpy.  Here 'relative' implies comparison
> between multiple implementations of functions, i.e. the sse3, sse4,
> avx, etc. and then with the generic implementation and finally the
> simple byte copy/move/write, etc.

I   measure several implementations (glibc, modified, byte, qwords)
You measure several implementations (sse3,  sse4    , byte, avx)

Where is difference?

> 
> > According to sequential glibc implementation is better than my by 15%.
> > However when I sample randomly my implementation becomes 33% better than
> > glibc one.
> 
> There's a do_random_tests in memcpy (and possibly in others too, I
> haven't checked) that is there just for correctness tests.  It can be
> trivially modified to measure the cost of the calls and make into a
> reasonable random sampling benchmark.
> 
You can but not easily. You must avoid several pitfalls. See file 
tests/rand.c at my microbenchmark.

Follow-Ups:
- Re: [PATCH] Support separate benchmark outputs
  - From: Siddhesh Poyarekar

References:
- [PATCH] Support separate benchmark outputs
  - From: Siddhesh Poyarekar
- Re: [PATCH] Support separate benchmark outputs
  - From: OndÅej BÃlka
- Re: [PATCH] Support separate benchmark outputs
  - From: Siddhesh Poyarekar
- Re: [PATCH] Support separate benchmark outputs
  - From: OndÅej BÃlka
- Re: [PATCH] Support separate benchmark outputs
  - From: Siddhesh Poyarekar

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]