diff options
author | Eric Biggers <ebiggers@google.com> | 2024-04-06 03:26:08 +0300 |
---|---|---|
committer | Herbert Xu <herbert@gondor.apana.org.au> | 2024-04-12 10:07:52 +0300 |
commit | 4ad096cca942959871d8ff73826d30f81f856f6e (patch) | |
tree | 967879ffa1f7a1e11897fddee63ee57e176c89dd /arch/x86/crypto/nh-avx2-x86_64.S | |
parent | 8f0e0cf74ccef41b383daddcf5447bba655031b3 (diff) | |
download | linux-4ad096cca942959871d8ff73826d30f81f856f6e.tar.xz |
crypto: x86/nh-avx2 - add missing vzeroupper
Since nh_avx2() uses ymm registers, execute vzeroupper before returning
from it. This is necessary to avoid reducing the performance of SSE
code.
Fixes: 0f961f9f670e ("crypto: x86/nhpoly1305 - add AVX2 accelerated NHPoly1305")
Signed-off-by: Eric Biggers <ebiggers@google.com>
Acked-by: Tim Chen <tim.c.chen@linux.intel.com>
Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
Diffstat (limited to 'arch/x86/crypto/nh-avx2-x86_64.S')
-rw-r--r-- | arch/x86/crypto/nh-avx2-x86_64.S | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/arch/x86/crypto/nh-avx2-x86_64.S b/arch/x86/crypto/nh-avx2-x86_64.S index ef73a3ab8726..791386d9a83a 100644 --- a/arch/x86/crypto/nh-avx2-x86_64.S +++ b/arch/x86/crypto/nh-avx2-x86_64.S @@ -154,5 +154,6 @@ SYM_TYPED_FUNC_START(nh_avx2) vpaddq T1, T0, T0 vpaddq T4, T0, T0 vmovdqu T0, (HASH) + vzeroupper RET SYM_FUNC_END(nh_avx2) |