This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug string/19583] SSSE3_Fast_Copy_Backward flag needs to be enabled for AMD Excavator core


https://sourceware.org/bugzilla/show_bug.cgi?id=19583

--- Comment #4 from cvs-commit at gcc dot gnu.org <cvs-commit at gcc dot gnu.org> ---
This is an automated email from the git hooks/post-receive script. It was
generated because a ref change was pushed to the repository containing
the project "GNU C Library master sources".

The branch, master has been updated
       via  e41b395523040fcb58c7d378475720c2836d280c (commit)
      from  b66d837bb5398795c6b0f651bd5a5d66091d8577 (commit)

Those revisions listed above that are new to this repository have
not appeared on any other notification email; so we list those
revisions in full, below.

- Log -----------------------------------------------------------------
https://sourceware.org/git/gitweb.cgi?p=glibc.git;h=e41b395523040fcb58c7d378475720c2836d280c

commit e41b395523040fcb58c7d378475720c2836d280c
Author: H.J. Lu <hjl.tools@gmail.com>
Date:   Mon Mar 28 04:39:48 2016 -0700

    [x86] Add a feature bit: Fast_Unaligned_Copy

    On AMD processors, memcpy optimized with unaligned SSE load is
    slower than emcpy optimized with aligned SSSE3 while other string
    functions are faster with unaligned SSE load.  A feature bit,
    Fast_Unaligned_Copy, is added to select memcpy optimized with
    unaligned SSE load.

        [BZ #19583]
        * sysdeps/x86/cpu-features.c (init_cpu_features): Set
        Fast_Unaligned_Copy with Fast_Unaligned_Load for Intel
        processors.  Set Fast_Copy_Backward for AMD Excavator
        processors.
        * sysdeps/x86/cpu-features.h (bit_arch_Fast_Unaligned_Copy):
        New.
        (index_arch_Fast_Unaligned_Copy): Likewise.
        * sysdeps/x86_64/multiarch/memcpy.S (__new_memcpy): Check
        Fast_Unaligned_Copy instead of Fast_Unaligned_Load.

-----------------------------------------------------------------------

Summary of changes:
 ChangeLog                         |   14 ++++++++++++++
 sysdeps/x86/cpu-features.c        |   14 +++++++++++++-
 sysdeps/x86/cpu-features.h        |    3 +++
 sysdeps/x86_64/multiarch/memcpy.S |    2 +-
 4 files changed, 31 insertions(+), 2 deletions(-)

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]