This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [PATCH RFC V2] Improve 64bit memcpy/memove for Corei7 with unaligned avx instruction
- From: Ling Ma <ling dot ma dot program at gmail dot com>
- To: Liubov Dmitrieva <liubov dot dmitrieva at gmail dot com>
- Cc: GNU C Library <libc-alpha at sourceware dot org>, Ondrej Bilka <neleai at seznam dot cz>, Ma Ling <ling dot ml at alibaba-inc dot com>
- Date: Fri, 12 Jul 2013 22:03:27 +0800
- Subject: Re: [PATCH RFC V2] Improve 64bit memcpy/memove for Corei7 with unaligned avx instruction
- References: <1373547096-8095-1-git-send-email-ling dot ma dot program at gmail dot com> <CAHjhQ91fVakxKNkEniz0AL-Srn3kNtLf+5AaB+VHozy5_z5zeA at mail dot gmail dot com>
2013/7/11, Liubov Dmitrieva <liubov.dmitrieva@gmail.com>:
> We need to check performance for core i7 with AVX before install this.
> As far as I understood you checked on Haswell only? But AVX works for
> more architectures than AVX2.
Ling:Sandy Bridge load & store 16bytes per cycle, the code is for
haswell platform 32bytes per cycle. Our experiment shows haswell also
enhanced non-temporary buffer, it can help us to pre-fetch data but
the same operation will hurt SandyBridge.
> You missed to fix Copyright: s/2010/2013
Ling: Ok
Thanks
Ling