This is the mail archive of the
glibc-cvs@sourceware.org
mailing list for the glibc project.
GNU C Library master sources branch hjl/avx512f-mem/prefetcht1 created. glibc-2.22-677-g28abfee
- From: hjl at sourceware dot org
- To: glibc-cvs at sourceware dot org
- Date: 15 Jan 2016 20:58:52 -0000
- Subject: GNU C Library master sources branch hjl/avx512f-mem/prefetcht1 created. glibc-2.22-677-g28abfee
This is an automated email from the git hooks/post-receive script. It was
generated because a ref change was pushed to the repository containing
the project "GNU C Library master sources".
The branch, hjl/avx512f-mem/prefetcht1 has been created
at 28abfeeeea44baab0fc19da56ce005f14c6c98c4 (commit)
- Log -----------------------------------------------------------------
http://sourceware.org/git/gitweb.cgi?p=glibc.git;a=commitdiff;h=28abfeeeea44baab0fc19da56ce005f14c6c98c4
commit 28abfeeeea44baab0fc19da56ce005f14c6c98c4
Author: H.J. Lu <hjl.tools@gmail.com>
Date: Fri Jan 15 12:58:16 2016 -0800
Use prefetcht1 with non-temporal stores
diff --git a/sysdeps/x86_64/multiarch/memcpy-avx512-no-vzeroupper.S b/sysdeps/x86_64/multiarch/memcpy-avx512-no-vzeroupper.S
index 56e727b..754fce1 100644
--- a/sysdeps/x86_64/multiarch/memcpy-avx512-no-vzeroupper.S
+++ b/sysdeps/x86_64/multiarch/memcpy-avx512-no-vzeroupper.S
@@ -349,10 +349,10 @@ L(gobble_256bytes_nt_loop):
prefetcht1 0x240(%rsi)
prefetcht1 0x280(%rsi)
prefetcht1 0x2C0(%rsi)
- prefetchnta 0x300(%rsi)
- prefetchnta 0x340(%rsi)
- prefetchnta 0x380(%rsi)
- prefetchnta 0x3C0(%rsi)
+ prefetcht1 0x300(%rsi)
+ prefetcht1 0x340(%rsi)
+ prefetcht1 0x380(%rsi)
+ prefetcht1 0x3C0(%rsi)
vmovdqu64 (%rsi), %zmm0
vmovdqu64 0x40(%rsi), %zmm1
vmovdqu64 0x80(%rsi), %zmm2
@@ -387,10 +387,10 @@ L(gobble_256bytes_nt_loop_bkw):
prefetcht1 -0x2C0(%rcx)
prefetcht1 -0x280(%rcx)
prefetcht1 -0x240(%rcx)
- prefetchnta -0x400(%rcx)
- prefetchnta -0x3C0(%rcx)
- prefetchnta -0x380(%rcx)
- prefetchnta -0x340(%rcx)
+ prefetcht1 -0x400(%rcx)
+ prefetcht1 -0x3C0(%rcx)
+ prefetcht1 -0x380(%rcx)
+ prefetcht1 -0x340(%rcx)
vmovdqu64 -0x100(%rcx), %zmm0
vmovdqu64 -0xC0(%rcx), %zmm1
vmovdqu64 -0x80(%rcx), %zmm2
-----------------------------------------------------------------------
hooks/post-receive
--
GNU C Library master sources