This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH RFC V2] Improve 64bit memcpy/memove for Corei7 with unaligned avx instruction

From: Ling Ma <ling dot ma dot program at gmail dot com>
To: Liubov Dmitrieva <liubov dot dmitrieva at gmail dot com>
Cc: GNU C Library <libc-alpha at sourceware dot org>, Ondrej Bilka <neleai at seznam dot cz>, Ma Ling <ling dot ml at alibaba-inc dot com>
Date: Fri, 12 Jul 2013 22:03:27 +0800
Subject: Re: [PATCH RFC V2] Improve 64bit memcpy/memove for Corei7 with unaligned avx instruction
References: <1373547096-8095-1-git-send-email-ling dot ma dot program at gmail dot com> <CAHjhQ91fVakxKNkEniz0AL-Srn3kNtLf+5AaB+VHozy5_z5zeA at mail dot gmail dot com>

2013/7/11, Liubov Dmitrieva <liubov.dmitrieva@gmail.com>:
> We need to check performance for core i7 with AVX before install this.
> As far as I understood you checked on Haswell only? But AVX works for
> more architectures than AVX2.
Ling:Sandy Bridge load & store 16bytes per cycle, the code is  for
haswell platform 32bytes per cycle. Our experiment shows haswell also
enhanced non-temporary buffer, it can help us to pre-fetch data but
the same operation will hurt SandyBridge.

> You missed to fix Copyright: s/2010/2013
Ling: Ok

Thanks
Ling

References:
- [PATCH RFC V2] Improve 64bit memcpy/memove for Corei7 with unaligned avx instruction
  - From: ling . ma . program
- Re: [PATCH RFC V2] Improve 64bit memcpy/memove for Corei7 with unaligned avx instruction
  - From: Liubov Dmitrieva

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]