This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [PATCH RFC] Imporve 64bit memcpy performance for Haswell CPU with AVX instruction


On Wed, Jun 25, 2014 at 7:45 AM, Ling Ma <ling.ma.program@gmail.com> wrote:
> By modifying test suite, we re-test 403.gcc in two parts: one is below
> 256bytes,
> the other is over 256bytes, The results as gzipped attachment shows
> (compared with pending sse2 memcpy):
> 1. when copy size is below 256 bytes, avx memcpy get almost the same
> performance because its instructions also use 16bytes registers.
>
> 2. when copy size is over 256bytes avx memcpy improve performance from
> 4.9% to 33% because its instructions use 32bytes registers.
>
> So avx memcpy avoid regression for small size and improve performance
> for big size.
>
> Thanks
> Ling
>

I'd like to get it in.  Any more feedbacks?

Thanks.


-- 
H.J.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]