unofficial mirror of libc-alpha@sourceware.org
 help / color / mirror / Atom feed
* [PATCH v1 1/3] x86/fpu: Move svml_{s|d}_wrapper_impl.h -> svml_{s|d}_wrapper_impl.h.S
@ 2022-11-18  6:37 Noah Goldstein via Libc-alpha
  2022-11-18  6:37 ` [PATCH v1 2/3] x86/fpu: Reformat svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: Noah Goldstein via Libc-alpha @ 2022-11-18  6:37 UTC (permalink / raw)
  To: libc-alpha; +Cc: goldstein.w.n, hjl.tools, carlos

The files are assembler so having the proper file extension is
convenient.  This doesn't change libm.so or libmvec.so.
---
 sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S          | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S          | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S          | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S          | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S          | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S       | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S        | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S        | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S        | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S        | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S     | 2 +-
 sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S        | 2 +-
 sysdeps/x86_64/fpu/svml_d_acos2_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_acos4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_acos8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_acosh2_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_acosh4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_d_acosh8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_asin2_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_asin4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_asin8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_asinh2_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_asinh4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_d_asinh8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_atan22_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_atan24_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_d_atan28_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_atan2_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_atan4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_atan8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_atanh2_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_atanh4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_d_atanh8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_cbrt2_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_cbrt4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_cbrt8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_cos2_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_cos4_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_d_cos8_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_cosh2_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_cosh4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_cosh8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_erf2_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_erf4_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_d_erf8_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_erfc2_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_erfc4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_erfc8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp102_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp104_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp108_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp22_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp24_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp28_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp2_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp4_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_d_exp8_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_expm12_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_expm14_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_d_expm18_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_hypot2_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_hypot4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_d_hypot8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_log102_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_log104_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_log104_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_d_log108_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_log1p2_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_log1p4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_d_log1p8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_d_log22_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_log24_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_log24_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_log28_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_log2_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_log4_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_log4_core_avx.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_d_log8_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_pow2_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_pow4_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_d_pow8_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_sin2_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_sin4_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_d_sin8_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_sincos2_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_d_sincos4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_d_sincos8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_d_sinh2_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_sinh4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_sinh8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_tan2_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_tan4_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_d_tan8_core.S                           | 2 +-
 sysdeps/x86_64/fpu/svml_d_tanh2_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_tanh4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_d_tanh8_core.S                          | 2 +-
 .../fpu/{svml_d_wrapper_impl.h => svml_d_wrapper_impl.h.S}      | 0
 sysdeps/x86_64/fpu/svml_s_acosf16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_acosf4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_acosf8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_s_acoshf16_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_acoshf4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_acoshf8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_s_asinf16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_asinf4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_asinf8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_s_asinhf16_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_asinhf4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_asinhf8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_s_atan2f16_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_atan2f4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_atan2f8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_s_atanf16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_atanf4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_atanf8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_s_atanhf16_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_atanhf4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_atanhf8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_s_cosf16_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_cosf4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_cosf8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_s_coshf16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_coshf4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_coshf8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_s_erfcf16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_erfcf4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_erfcf8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_s_erff16_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_erff4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_erff8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_s_exp10f16_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_exp10f4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_exp10f8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_s_exp2f16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_exp2f4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_exp2f8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_s_expf16_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_expf4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_expf8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_s_expm1f16_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_expm1f4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_expm1f8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_s_hypotf16_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_hypotf4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_hypotf8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_s_log10f16_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_log10f4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_log10f8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_s_log1pf16_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_log1pf4_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_log1pf8_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S                    | 2 +-
 sysdeps/x86_64/fpu/svml_s_log2f16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_log2f4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_log2f8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_s_logf16_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_logf4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_logf8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_s_powf16_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_powf4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_powf8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_s_sincosf16_core.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_s_sincosf4_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_sincosf8_core.S                       | 2 +-
 sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S                   | 2 +-
 sysdeps/x86_64/fpu/svml_s_sinf16_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_sinf4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_sinf8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_s_sinhf16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_sinhf4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_sinhf8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S                     | 2 +-
 sysdeps/x86_64/fpu/svml_s_tanf16_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_tanf4_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_tanf8_core.S                          | 2 +-
 sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S                      | 2 +-
 sysdeps/x86_64/fpu/svml_s_tanhf16_core.S                        | 2 +-
 sysdeps/x86_64/fpu/svml_s_tanhf4_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_tanhf8_core.S                         | 2 +-
 sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S                     | 2 +-
 .../fpu/{svml_s_wrapper_impl.h => svml_s_wrapper_impl.h.S}      | 0
 230 files changed, 228 insertions(+), 228 deletions(-)
 rename sysdeps/x86_64/fpu/{svml_d_wrapper_impl.h => svml_d_wrapper_impl.h.S} (100%)
 rename sysdeps/x86_64/fpu/{svml_s_wrapper_impl.h => svml_s_wrapper_impl.h.S} (100%)

diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S
index 86543d50d3..b2675d9647 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_d_trig_data.h"
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_cos_knl)
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S
index bccc56858d..98f2bc2e41 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_d_exp_data.h"
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_exp_knl)
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S
index 230ebe8ceb..20d4d967e0 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_d_log_data.h"
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_log_knl)
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S
index 963fbe3662..a4c6c27147 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_d_pow_data.h"
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 /* ALGORITHM DESCRIPTION:
 
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S
index 54bd8679d1..f3de17b413 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_d_trig_data.h"
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_sin_knl)
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S
index f6297e0024..a05a900ede 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_d_trig_data.h"
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 /*
    ALGORITHM DESCRIPTION:
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S
index b0612c9357..5929fc0203 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_s_trig_data.h"
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_cosf_knl)
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S
index 599b67f83b..71ed573a6e 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_s_expf_data.h"
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_expf_knl)
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S
index 0070b0c419..b0062a1eb5 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_s_logf_data.h"
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_logf_knl)
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S
index 306895838b..c05960bc31 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_s_powf_data.h"
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 /*
    ALGORITHM DESCRIPTION:
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S
index 9a2aebb023..9ab6af69be 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_s_trig_data.h"
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 /*
    ALGORITHM DESCRIPTION:
diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S
index 33b210fac0..0c037fd663 100644
--- a/sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S
+++ b/sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S
@@ -18,7 +18,7 @@
 
 #include <sysdep.h>
 #include "svml_s_trig_data.h"
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY(_ZGVeN16v_sinf_knl)
diff --git a/sysdeps/x86_64/fpu/svml_d_acos2_core.S b/sysdeps/x86_64/fpu/svml_d_acos2_core.S
index 8c573fcff1..1022d7e4e7 100644
--- a/sysdeps/x86_64/fpu/svml_d_acos2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_acos2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_acos)
diff --git a/sysdeps/x86_64/fpu/svml_d_acos4_core.S b/sysdeps/x86_64/fpu/svml_d_acos4_core.S
index ef2253f3dd..b4a8990f5a 100644
--- a/sysdeps/x86_64/fpu/svml_d_acos4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_acos4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_acos)
diff --git a/sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S
index cc5377850c..48e610f4ef 100644
--- a/sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_acos)
diff --git a/sysdeps/x86_64/fpu/svml_d_acos8_core.S b/sysdeps/x86_64/fpu/svml_d_acos8_core.S
index 3bf8d8dcdf..8f8017a72d 100644
--- a/sysdeps/x86_64/fpu/svml_d_acos8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_acos8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_acos)
diff --git a/sysdeps/x86_64/fpu/svml_d_acosh2_core.S b/sysdeps/x86_64/fpu/svml_d_acosh2_core.S
index ab92795d13..23d22de0fe 100644
--- a/sysdeps/x86_64/fpu/svml_d_acosh2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_acosh2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_acosh)
diff --git a/sysdeps/x86_64/fpu/svml_d_acosh4_core.S b/sysdeps/x86_64/fpu/svml_d_acosh4_core.S
index 91ff3018c1..5bef8065f7 100644
--- a/sysdeps/x86_64/fpu/svml_d_acosh4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_acosh4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_acosh)
diff --git a/sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S
index 84cbe5871c..95dd9ed670 100644
--- a/sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_acosh)
diff --git a/sysdeps/x86_64/fpu/svml_d_acosh8_core.S b/sysdeps/x86_64/fpu/svml_d_acosh8_core.S
index 0b4521ce64..6277eebc1e 100644
--- a/sysdeps/x86_64/fpu/svml_d_acosh8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_acosh8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_acosh)
diff --git a/sysdeps/x86_64/fpu/svml_d_asin2_core.S b/sysdeps/x86_64/fpu/svml_d_asin2_core.S
index 911625030f..13c186ac08 100644
--- a/sysdeps/x86_64/fpu/svml_d_asin2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_asin2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_asin)
diff --git a/sysdeps/x86_64/fpu/svml_d_asin4_core.S b/sysdeps/x86_64/fpu/svml_d_asin4_core.S
index 222a0b8f73..50ea5a4207 100644
--- a/sysdeps/x86_64/fpu/svml_d_asin4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_asin4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_asin)
diff --git a/sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S
index 04ff322ea6..6de73c97cf 100644
--- a/sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_asin)
diff --git a/sysdeps/x86_64/fpu/svml_d_asin8_core.S b/sysdeps/x86_64/fpu/svml_d_asin8_core.S
index bea94b954c..aaf93dc8e9 100644
--- a/sysdeps/x86_64/fpu/svml_d_asin8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_asin8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_asin)
diff --git a/sysdeps/x86_64/fpu/svml_d_asinh2_core.S b/sysdeps/x86_64/fpu/svml_d_asinh2_core.S
index a621fae3fb..00723458e4 100644
--- a/sysdeps/x86_64/fpu/svml_d_asinh2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_asinh2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_asinh)
diff --git a/sysdeps/x86_64/fpu/svml_d_asinh4_core.S b/sysdeps/x86_64/fpu/svml_d_asinh4_core.S
index a0332eca57..f2e13c3a0a 100644
--- a/sysdeps/x86_64/fpu/svml_d_asinh4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_asinh4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_asinh)
diff --git a/sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S
index bc365d7dd7..51324b697f 100644
--- a/sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_asinh)
diff --git a/sysdeps/x86_64/fpu/svml_d_asinh8_core.S b/sysdeps/x86_64/fpu/svml_d_asinh8_core.S
index 2271466d99..3c32741b39 100644
--- a/sysdeps/x86_64/fpu/svml_d_asinh8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_asinh8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_asinh)
diff --git a/sysdeps/x86_64/fpu/svml_d_atan22_core.S b/sysdeps/x86_64/fpu/svml_d_atan22_core.S
index 7f40f1be6f..6b3cef4269 100644
--- a/sysdeps/x86_64/fpu/svml_d_atan22_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_atan22_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2vv_atan2)
diff --git a/sysdeps/x86_64/fpu/svml_d_atan24_core.S b/sysdeps/x86_64/fpu/svml_d_atan24_core.S
index afb8296d80..7f76bf166c 100644
--- a/sysdeps/x86_64/fpu/svml_d_atan24_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_atan24_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4vv_atan2)
diff --git a/sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S b/sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S
index 81db9cd297..7e8fead6e0 100644
--- a/sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4vv_atan2)
diff --git a/sysdeps/x86_64/fpu/svml_d_atan28_core.S b/sysdeps/x86_64/fpu/svml_d_atan28_core.S
index 9c25de34b6..2c710eadac 100644
--- a/sysdeps/x86_64/fpu/svml_d_atan28_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_atan28_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8vv_atan2)
diff --git a/sysdeps/x86_64/fpu/svml_d_atan2_core.S b/sysdeps/x86_64/fpu/svml_d_atan2_core.S
index bd4020ab69..7ea434fec5 100644
--- a/sysdeps/x86_64/fpu/svml_d_atan2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_atan2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_atan)
diff --git a/sysdeps/x86_64/fpu/svml_d_atan4_core.S b/sysdeps/x86_64/fpu/svml_d_atan4_core.S
index 89737790f4..06e3e2a870 100644
--- a/sysdeps/x86_64/fpu/svml_d_atan4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_atan4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_atan)
diff --git a/sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S
index 8abe407abe..73a8ab8ebb 100644
--- a/sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_atan)
diff --git a/sysdeps/x86_64/fpu/svml_d_atan8_core.S b/sysdeps/x86_64/fpu/svml_d_atan8_core.S
index cea8caf797..dfb6f788bc 100644
--- a/sysdeps/x86_64/fpu/svml_d_atan8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_atan8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_atan)
diff --git a/sysdeps/x86_64/fpu/svml_d_atanh2_core.S b/sysdeps/x86_64/fpu/svml_d_atanh2_core.S
index 8c300fbc09..f112f43793 100644
--- a/sysdeps/x86_64/fpu/svml_d_atanh2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_atanh2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_atanh)
diff --git a/sysdeps/x86_64/fpu/svml_d_atanh4_core.S b/sysdeps/x86_64/fpu/svml_d_atanh4_core.S
index df63d2d655..ba74bcfeb5 100644
--- a/sysdeps/x86_64/fpu/svml_d_atanh4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_atanh4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_atanh)
diff --git a/sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S
index 0bb51b2d60..9d1fa7f8c7 100644
--- a/sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_atanh)
diff --git a/sysdeps/x86_64/fpu/svml_d_atanh8_core.S b/sysdeps/x86_64/fpu/svml_d_atanh8_core.S
index 956230e978..24011dad24 100644
--- a/sysdeps/x86_64/fpu/svml_d_atanh8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_atanh8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_atanh)
diff --git a/sysdeps/x86_64/fpu/svml_d_cbrt2_core.S b/sysdeps/x86_64/fpu/svml_d_cbrt2_core.S
index 5ca1129c7f..3a3152000b 100644
--- a/sysdeps/x86_64/fpu/svml_d_cbrt2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_cbrt2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_cbrt)
diff --git a/sysdeps/x86_64/fpu/svml_d_cbrt4_core.S b/sysdeps/x86_64/fpu/svml_d_cbrt4_core.S
index c3948ec480..31e0bc46db 100644
--- a/sysdeps/x86_64/fpu/svml_d_cbrt4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_cbrt4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_cbrt)
diff --git a/sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S
index b64825e876..8b4122c7ad 100644
--- a/sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_cbrt)
diff --git a/sysdeps/x86_64/fpu/svml_d_cbrt8_core.S b/sysdeps/x86_64/fpu/svml_d_cbrt8_core.S
index 312f015706..0aac584981 100644
--- a/sysdeps/x86_64/fpu/svml_d_cbrt8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_cbrt8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_cbrt)
diff --git a/sysdeps/x86_64/fpu/svml_d_cos2_core.S b/sysdeps/x86_64/fpu/svml_d_cos2_core.S
index 02a6583776..d2ea460b50 100644
--- a/sysdeps/x86_64/fpu/svml_d_cos2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_cos2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_cos)
diff --git a/sysdeps/x86_64/fpu/svml_d_cos4_core.S b/sysdeps/x86_64/fpu/svml_d_cos4_core.S
index f066ceaf4c..1f09e366c2 100644
--- a/sysdeps/x86_64/fpu/svml_d_cos4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_cos4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_cos)
diff --git a/sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S
index 5212078c9c..e96205ec00 100644
--- a/sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_cos)
diff --git a/sysdeps/x86_64/fpu/svml_d_cos8_core.S b/sysdeps/x86_64/fpu/svml_d_cos8_core.S
index 2ac8e4687a..ef3c7a0a71 100644
--- a/sysdeps/x86_64/fpu/svml_d_cos8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_cos8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_cos)
diff --git a/sysdeps/x86_64/fpu/svml_d_cosh2_core.S b/sysdeps/x86_64/fpu/svml_d_cosh2_core.S
index 6aa76d40ed..55b4ae267b 100644
--- a/sysdeps/x86_64/fpu/svml_d_cosh2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_cosh2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_cosh)
diff --git a/sysdeps/x86_64/fpu/svml_d_cosh4_core.S b/sysdeps/x86_64/fpu/svml_d_cosh4_core.S
index d3a7bec3f4..9a99bd3fab 100644
--- a/sysdeps/x86_64/fpu/svml_d_cosh4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_cosh4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_cosh)
diff --git a/sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S
index 73612e33bc..8a514c8790 100644
--- a/sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_cosh)
diff --git a/sysdeps/x86_64/fpu/svml_d_cosh8_core.S b/sysdeps/x86_64/fpu/svml_d_cosh8_core.S
index e89d2c23fb..35ab7f6ee9 100644
--- a/sysdeps/x86_64/fpu/svml_d_cosh8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_cosh8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_cosh)
diff --git a/sysdeps/x86_64/fpu/svml_d_erf2_core.S b/sysdeps/x86_64/fpu/svml_d_erf2_core.S
index 828dd59cb3..0622c3caac 100644
--- a/sysdeps/x86_64/fpu/svml_d_erf2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_erf2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_erf)
diff --git a/sysdeps/x86_64/fpu/svml_d_erf4_core.S b/sysdeps/x86_64/fpu/svml_d_erf4_core.S
index 9432824d8a..6f95c5feef 100644
--- a/sysdeps/x86_64/fpu/svml_d_erf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_erf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_erf)
diff --git a/sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S
index e3f73f9044..4815f8468a 100644
--- a/sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_erf)
diff --git a/sysdeps/x86_64/fpu/svml_d_erf8_core.S b/sysdeps/x86_64/fpu/svml_d_erf8_core.S
index fde688f5dd..079f423284 100644
--- a/sysdeps/x86_64/fpu/svml_d_erf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_erf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_erf)
diff --git a/sysdeps/x86_64/fpu/svml_d_erfc2_core.S b/sysdeps/x86_64/fpu/svml_d_erfc2_core.S
index 530d358b8b..cb239f82e8 100644
--- a/sysdeps/x86_64/fpu/svml_d_erfc2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_erfc2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_erfc)
diff --git a/sysdeps/x86_64/fpu/svml_d_erfc4_core.S b/sysdeps/x86_64/fpu/svml_d_erfc4_core.S
index c6b3e2c367..2b42b3af83 100644
--- a/sysdeps/x86_64/fpu/svml_d_erfc4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_erfc4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_erfc)
diff --git a/sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S
index e81ca48897..9e16bf2ec5 100644
--- a/sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_erfc)
diff --git a/sysdeps/x86_64/fpu/svml_d_erfc8_core.S b/sysdeps/x86_64/fpu/svml_d_erfc8_core.S
index b3d31fec27..f781a7af9a 100644
--- a/sysdeps/x86_64/fpu/svml_d_erfc8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_erfc8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_erfc)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp102_core.S b/sysdeps/x86_64/fpu/svml_d_exp102_core.S
index 8e6131d717..d6d7da6fec 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp102_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp102_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_exp10)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp104_core.S b/sysdeps/x86_64/fpu/svml_d_exp104_core.S
index e226bb88bb..e0c651f904 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp104_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp104_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_exp10)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S b/sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S
index 8758d35449..ca3931eb8b 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_exp10)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp108_core.S b/sysdeps/x86_64/fpu/svml_d_exp108_core.S
index 745d3c7ec7..ad2ba994c0 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp108_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp108_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_exp10)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp22_core.S b/sysdeps/x86_64/fpu/svml_d_exp22_core.S
index 3639892544..4205e5510d 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp22_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp22_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_exp2)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp24_core.S b/sysdeps/x86_64/fpu/svml_d_exp24_core.S
index aea3afc3e3..4be1ee4e48 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp24_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp24_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_exp2)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S b/sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S
index 68f28b8f63..b09da67af1 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_exp2)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp28_core.S b/sysdeps/x86_64/fpu/svml_d_exp28_core.S
index dc0b3e77d0..0e8e027713 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp28_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp28_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_exp2)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp2_core.S b/sysdeps/x86_64/fpu/svml_d_exp2_core.S
index 14e41b9707..15109ef829 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_exp)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp4_core.S b/sysdeps/x86_64/fpu/svml_d_exp4_core.S
index cf91c08ab4..de35ab001f 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_exp)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S
index b1ed2bc0ae..452558041c 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_exp)
diff --git a/sysdeps/x86_64/fpu/svml_d_exp8_core.S b/sysdeps/x86_64/fpu/svml_d_exp8_core.S
index ef7d62c1ec..88e663fa9b 100644
--- a/sysdeps/x86_64/fpu/svml_d_exp8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_exp8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_exp)
diff --git a/sysdeps/x86_64/fpu/svml_d_expm12_core.S b/sysdeps/x86_64/fpu/svml_d_expm12_core.S
index b5bfd598a8..315d12e2e9 100644
--- a/sysdeps/x86_64/fpu/svml_d_expm12_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_expm12_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_expm1)
diff --git a/sysdeps/x86_64/fpu/svml_d_expm14_core.S b/sysdeps/x86_64/fpu/svml_d_expm14_core.S
index 1986f086ff..902a85bd4a 100644
--- a/sysdeps/x86_64/fpu/svml_d_expm14_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_expm14_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_expm1)
diff --git a/sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S b/sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S
index 333a7011ae..f8f7d15759 100644
--- a/sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_expm1)
diff --git a/sysdeps/x86_64/fpu/svml_d_expm18_core.S b/sysdeps/x86_64/fpu/svml_d_expm18_core.S
index d008c4273d..513688ebf5 100644
--- a/sysdeps/x86_64/fpu/svml_d_expm18_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_expm18_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_expm1)
diff --git a/sysdeps/x86_64/fpu/svml_d_hypot2_core.S b/sysdeps/x86_64/fpu/svml_d_hypot2_core.S
index 6532e46a8b..c746560128 100644
--- a/sysdeps/x86_64/fpu/svml_d_hypot2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_hypot2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2vv_hypot)
diff --git a/sysdeps/x86_64/fpu/svml_d_hypot4_core.S b/sysdeps/x86_64/fpu/svml_d_hypot4_core.S
index 1383ac5304..aa1e2b330b 100644
--- a/sysdeps/x86_64/fpu/svml_d_hypot4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_hypot4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4vv_hypot)
diff --git a/sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S
index 6b1b165a2c..7028c55eab 100644
--- a/sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4vv_hypot)
diff --git a/sysdeps/x86_64/fpu/svml_d_hypot8_core.S b/sysdeps/x86_64/fpu/svml_d_hypot8_core.S
index ec73ba6184..10c831fd08 100644
--- a/sysdeps/x86_64/fpu/svml_d_hypot8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_hypot8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8vv_hypot)
diff --git a/sysdeps/x86_64/fpu/svml_d_log102_core.S b/sysdeps/x86_64/fpu/svml_d_log102_core.S
index 9e0aa00f7e..a63e82bdbc 100644
--- a/sysdeps/x86_64/fpu/svml_d_log102_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log102_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_log10)
diff --git a/sysdeps/x86_64/fpu/svml_d_log104_core.S b/sysdeps/x86_64/fpu/svml_d_log104_core.S
index cc5e285dee..051ad8284a 100644
--- a/sysdeps/x86_64/fpu/svml_d_log104_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log104_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_log10)
diff --git a/sysdeps/x86_64/fpu/svml_d_log104_core_avx.S b/sysdeps/x86_64/fpu/svml_d_log104_core_avx.S
index ea1cb93a9d..f08b78e35e 100644
--- a/sysdeps/x86_64/fpu/svml_d_log104_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_log104_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_log10)
diff --git a/sysdeps/x86_64/fpu/svml_d_log108_core.S b/sysdeps/x86_64/fpu/svml_d_log108_core.S
index 59d5835cc7..8b5a59cf00 100644
--- a/sysdeps/x86_64/fpu/svml_d_log108_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log108_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_log10)
diff --git a/sysdeps/x86_64/fpu/svml_d_log1p2_core.S b/sysdeps/x86_64/fpu/svml_d_log1p2_core.S
index b40e238e11..058e6d47da 100644
--- a/sysdeps/x86_64/fpu/svml_d_log1p2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log1p2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_log1p)
diff --git a/sysdeps/x86_64/fpu/svml_d_log1p4_core.S b/sysdeps/x86_64/fpu/svml_d_log1p4_core.S
index 2c76849ca8..7c27ab9a5c 100644
--- a/sysdeps/x86_64/fpu/svml_d_log1p4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log1p4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_log1p)
diff --git a/sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S
index 42ace645e7..8b0b20f018 100644
--- a/sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_log1p)
diff --git a/sysdeps/x86_64/fpu/svml_d_log1p8_core.S b/sysdeps/x86_64/fpu/svml_d_log1p8_core.S
index cca39f1847..1c6be998d7 100644
--- a/sysdeps/x86_64/fpu/svml_d_log1p8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log1p8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_log1p)
diff --git a/sysdeps/x86_64/fpu/svml_d_log22_core.S b/sysdeps/x86_64/fpu/svml_d_log22_core.S
index 8c8464b2f8..5e998a38e1 100644
--- a/sysdeps/x86_64/fpu/svml_d_log22_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log22_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_log2)
diff --git a/sysdeps/x86_64/fpu/svml_d_log24_core.S b/sysdeps/x86_64/fpu/svml_d_log24_core.S
index b75936f785..b2c3c4a727 100644
--- a/sysdeps/x86_64/fpu/svml_d_log24_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log24_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_log2)
diff --git a/sysdeps/x86_64/fpu/svml_d_log24_core_avx.S b/sysdeps/x86_64/fpu/svml_d_log24_core_avx.S
index f4a946033f..eaeee60be5 100644
--- a/sysdeps/x86_64/fpu/svml_d_log24_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_log24_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_log2)
diff --git a/sysdeps/x86_64/fpu/svml_d_log28_core.S b/sysdeps/x86_64/fpu/svml_d_log28_core.S
index e547d8431b..e81a0d27d7 100644
--- a/sysdeps/x86_64/fpu/svml_d_log28_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log28_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_log2)
diff --git a/sysdeps/x86_64/fpu/svml_d_log2_core.S b/sysdeps/x86_64/fpu/svml_d_log2_core.S
index 1e9790d78c..e38d58d465 100644
--- a/sysdeps/x86_64/fpu/svml_d_log2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_log)
diff --git a/sysdeps/x86_64/fpu/svml_d_log4_core.S b/sysdeps/x86_64/fpu/svml_d_log4_core.S
index cac601bb62..ab2801d343 100644
--- a/sysdeps/x86_64/fpu/svml_d_log4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_log)
diff --git a/sysdeps/x86_64/fpu/svml_d_log4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_log4_core_avx.S
index f769844917..ace1ac535a 100644
--- a/sysdeps/x86_64/fpu/svml_d_log4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_log4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_log)
diff --git a/sysdeps/x86_64/fpu/svml_d_log8_core.S b/sysdeps/x86_64/fpu/svml_d_log8_core.S
index 56284de286..37c0bcce11 100644
--- a/sysdeps/x86_64/fpu/svml_d_log8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_log8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_log)
diff --git a/sysdeps/x86_64/fpu/svml_d_pow2_core.S b/sysdeps/x86_64/fpu/svml_d_pow2_core.S
index f8cf580f76..7142d6ece4 100644
--- a/sysdeps/x86_64/fpu/svml_d_pow2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_pow2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2vv_pow)
diff --git a/sysdeps/x86_64/fpu/svml_d_pow4_core.S b/sysdeps/x86_64/fpu/svml_d_pow4_core.S
index 4cf403230d..572ba50519 100644
--- a/sysdeps/x86_64/fpu/svml_d_pow4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_pow4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4vv_pow)
diff --git a/sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S
index f72e9532f3..e8f7c56d82 100644
--- a/sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4vv_pow)
diff --git a/sysdeps/x86_64/fpu/svml_d_pow8_core.S b/sysdeps/x86_64/fpu/svml_d_pow8_core.S
index 0b99cf6570..3f7d63cad0 100644
--- a/sysdeps/x86_64/fpu/svml_d_pow8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_pow8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8vv_pow)
diff --git a/sysdeps/x86_64/fpu/svml_d_sin2_core.S b/sysdeps/x86_64/fpu/svml_d_sin2_core.S
index 40f23bf473..c101822478 100644
--- a/sysdeps/x86_64/fpu/svml_d_sin2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_sin2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_sin)
diff --git a/sysdeps/x86_64/fpu/svml_d_sin4_core.S b/sysdeps/x86_64/fpu/svml_d_sin4_core.S
index f7c6e5de88..53464f52e0 100644
--- a/sysdeps/x86_64/fpu/svml_d_sin4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_sin4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_sin)
diff --git a/sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S
index 694ac1977a..d291d020dc 100644
--- a/sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_sin)
diff --git a/sysdeps/x86_64/fpu/svml_d_sin8_core.S b/sysdeps/x86_64/fpu/svml_d_sin8_core.S
index c3c978a843..03dbf25620 100644
--- a/sysdeps/x86_64/fpu/svml_d_sin8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_sin8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_sin)
diff --git a/sysdeps/x86_64/fpu/svml_d_sincos2_core.S b/sysdeps/x86_64/fpu/svml_d_sincos2_core.S
index 617e206870..3e9f604ead 100644
--- a/sysdeps/x86_64/fpu/svml_d_sincos2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_sincos2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2vl8l8_sincos)
diff --git a/sysdeps/x86_64/fpu/svml_d_sincos4_core.S b/sysdeps/x86_64/fpu/svml_d_sincos4_core.S
index 53d491102e..b20361129b 100644
--- a/sysdeps/x86_64/fpu/svml_d_sincos4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_sincos4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4vl8l8_sincos)
diff --git a/sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S
index ef798926f3..242c812c7c 100644
--- a/sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4vl8l8_sincos)
diff --git a/sysdeps/x86_64/fpu/svml_d_sincos8_core.S b/sysdeps/x86_64/fpu/svml_d_sincos8_core.S
index d9ae0bedd0..6a39b7dbb4 100644
--- a/sysdeps/x86_64/fpu/svml_d_sincos8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_sincos8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8vl8l8_sincos)
diff --git a/sysdeps/x86_64/fpu/svml_d_sinh2_core.S b/sysdeps/x86_64/fpu/svml_d_sinh2_core.S
index 205ad3b582..ab96636d0e 100644
--- a/sysdeps/x86_64/fpu/svml_d_sinh2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_sinh2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_sinh)
diff --git a/sysdeps/x86_64/fpu/svml_d_sinh4_core.S b/sysdeps/x86_64/fpu/svml_d_sinh4_core.S
index 8f8bc794d6..9017582a06 100644
--- a/sysdeps/x86_64/fpu/svml_d_sinh4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_sinh4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_sinh)
diff --git a/sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S
index 771fb2cd78..71aeb8d84d 100644
--- a/sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_sinh)
diff --git a/sysdeps/x86_64/fpu/svml_d_sinh8_core.S b/sysdeps/x86_64/fpu/svml_d_sinh8_core.S
index 0279264b30..1dd49a9dbb 100644
--- a/sysdeps/x86_64/fpu/svml_d_sinh8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_sinh8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_sinh)
diff --git a/sysdeps/x86_64/fpu/svml_d_tan2_core.S b/sysdeps/x86_64/fpu/svml_d_tan2_core.S
index 6fdc334a40..f127f7819c 100644
--- a/sysdeps/x86_64/fpu/svml_d_tan2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_tan2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_tan)
diff --git a/sysdeps/x86_64/fpu/svml_d_tan4_core.S b/sysdeps/x86_64/fpu/svml_d_tan4_core.S
index 50e5331768..c1325c0984 100644
--- a/sysdeps/x86_64/fpu/svml_d_tan4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_tan4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_tan)
diff --git a/sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S
index e4b1f1ed93..0088977a95 100644
--- a/sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_tan)
diff --git a/sysdeps/x86_64/fpu/svml_d_tan8_core.S b/sysdeps/x86_64/fpu/svml_d_tan8_core.S
index fd26488470..b92659d894 100644
--- a/sysdeps/x86_64/fpu/svml_d_tan8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_tan8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_tan)
diff --git a/sysdeps/x86_64/fpu/svml_d_tanh2_core.S b/sysdeps/x86_64/fpu/svml_d_tanh2_core.S
index 1970e4622c..867adfaeef 100644
--- a/sysdeps/x86_64/fpu/svml_d_tanh2_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_tanh2_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN2v_tanh)
diff --git a/sysdeps/x86_64/fpu/svml_d_tanh4_core.S b/sysdeps/x86_64/fpu/svml_d_tanh4_core.S
index 1205efdec8..720539fa07 100644
--- a/sysdeps/x86_64/fpu/svml_d_tanh4_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_tanh4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN4v_tanh)
diff --git a/sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S
index c42f63c865..270a76e495 100644
--- a/sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVcN4v_tanh)
diff --git a/sysdeps/x86_64/fpu/svml_d_tanh8_core.S b/sysdeps/x86_64/fpu/svml_d_tanh8_core.S
index fe7c82370d..0ec89c2636 100644
--- a/sysdeps/x86_64/fpu/svml_d_tanh8_core.S
+++ b/sysdeps/x86_64/fpu/svml_d_tanh8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_d_wrapper_impl.h"
+#include "svml_d_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN8v_tanh)
diff --git a/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h b/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
similarity index 100%
rename from sysdeps/x86_64/fpu/svml_d_wrapper_impl.h
rename to sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
diff --git a/sysdeps/x86_64/fpu/svml_s_acosf16_core.S b/sysdeps/x86_64/fpu/svml_s_acosf16_core.S
index 8dbe96d505..054713e328 100644
--- a/sysdeps/x86_64/fpu/svml_s_acosf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_acosf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_acosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_acosf4_core.S b/sysdeps/x86_64/fpu/svml_s_acosf4_core.S
index aeb82221f8..4160c34fb4 100644
--- a/sysdeps/x86_64/fpu/svml_s_acosf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_acosf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_acosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_acosf8_core.S b/sysdeps/x86_64/fpu/svml_s_acosf8_core.S
index 189f81005e..8e2efc1146 100644
--- a/sysdeps/x86_64/fpu/svml_s_acosf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_acosf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_acosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S
index fb432639ab..f0ee911cbe 100644
--- a/sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_acosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_acoshf16_core.S b/sysdeps/x86_64/fpu/svml_s_acoshf16_core.S
index e5ece41ec6..391f1af733 100644
--- a/sysdeps/x86_64/fpu/svml_s_acoshf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_acoshf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_acoshf)
diff --git a/sysdeps/x86_64/fpu/svml_s_acoshf4_core.S b/sysdeps/x86_64/fpu/svml_s_acoshf4_core.S
index d728e9f54f..ecb8705328 100644
--- a/sysdeps/x86_64/fpu/svml_s_acoshf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_acoshf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_acoshf)
diff --git a/sysdeps/x86_64/fpu/svml_s_acoshf8_core.S b/sysdeps/x86_64/fpu/svml_s_acoshf8_core.S
index 1f8352077e..cbac2627b7 100644
--- a/sysdeps/x86_64/fpu/svml_s_acoshf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_acoshf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_acoshf)
diff --git a/sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S
index b9fc52c289..ead2bf3f2b 100644
--- a/sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_acoshf)
diff --git a/sysdeps/x86_64/fpu/svml_s_asinf16_core.S b/sysdeps/x86_64/fpu/svml_s_asinf16_core.S
index 54e947edd1..717d340b24 100644
--- a/sysdeps/x86_64/fpu/svml_s_asinf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_asinf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_asinf)
diff --git a/sysdeps/x86_64/fpu/svml_s_asinf4_core.S b/sysdeps/x86_64/fpu/svml_s_asinf4_core.S
index 10bddc352b..6b94efd9d3 100644
--- a/sysdeps/x86_64/fpu/svml_s_asinf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_asinf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_asinf)
diff --git a/sysdeps/x86_64/fpu/svml_s_asinf8_core.S b/sysdeps/x86_64/fpu/svml_s_asinf8_core.S
index 77f4d83a2c..4e490cbd5f 100644
--- a/sysdeps/x86_64/fpu/svml_s_asinf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_asinf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_asinf)
diff --git a/sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S
index 6202b4d4cf..cde5528197 100644
--- a/sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_asinf)
diff --git a/sysdeps/x86_64/fpu/svml_s_asinhf16_core.S b/sysdeps/x86_64/fpu/svml_s_asinhf16_core.S
index 16de57456e..ffb49184f7 100644
--- a/sysdeps/x86_64/fpu/svml_s_asinhf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_asinhf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_asinhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_asinhf4_core.S b/sysdeps/x86_64/fpu/svml_s_asinhf4_core.S
index 07dacc599f..ad4acbbd66 100644
--- a/sysdeps/x86_64/fpu/svml_s_asinhf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_asinhf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_asinhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_asinhf8_core.S b/sysdeps/x86_64/fpu/svml_s_asinhf8_core.S
index b874f4256b..41a1527162 100644
--- a/sysdeps/x86_64/fpu/svml_s_asinhf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_asinhf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_asinhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S
index b00a52e005..44607cd889 100644
--- a/sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_asinhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_atan2f16_core.S b/sysdeps/x86_64/fpu/svml_s_atan2f16_core.S
index 345792d376..612b5af828 100644
--- a/sysdeps/x86_64/fpu/svml_s_atan2f16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_atan2f16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16vv_atan2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_atan2f4_core.S b/sysdeps/x86_64/fpu/svml_s_atan2f4_core.S
index 536b739f04..a3fb092911 100644
--- a/sysdeps/x86_64/fpu/svml_s_atan2f4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_atan2f4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4vv_atan2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_atan2f8_core.S b/sysdeps/x86_64/fpu/svml_s_atan2f8_core.S
index 67376ca528..2f694b6486 100644
--- a/sysdeps/x86_64/fpu/svml_s_atan2f8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_atan2f8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8vv_atan2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S
index 1c670d627f..06c856191b 100644
--- a/sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY(_ZGVcN8vv_atan2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_atanf16_core.S b/sysdeps/x86_64/fpu/svml_s_atanf16_core.S
index b1471451a0..59a0267aa0 100644
--- a/sysdeps/x86_64/fpu/svml_s_atanf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_atanf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_atanf)
diff --git a/sysdeps/x86_64/fpu/svml_s_atanf4_core.S b/sysdeps/x86_64/fpu/svml_s_atanf4_core.S
index ea118a7984..2567206463 100644
--- a/sysdeps/x86_64/fpu/svml_s_atanf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_atanf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_atanf)
diff --git a/sysdeps/x86_64/fpu/svml_s_atanf8_core.S b/sysdeps/x86_64/fpu/svml_s_atanf8_core.S
index 8fad748544..b7961c7dbc 100644
--- a/sysdeps/x86_64/fpu/svml_s_atanf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_atanf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_atanf)
diff --git a/sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S
index 1216b8a057..bedb807e1f 100644
--- a/sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_atanf)
diff --git a/sysdeps/x86_64/fpu/svml_s_atanhf16_core.S b/sysdeps/x86_64/fpu/svml_s_atanhf16_core.S
index 96dce010ec..240d943d6c 100644
--- a/sysdeps/x86_64/fpu/svml_s_atanhf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_atanhf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_atanhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_atanhf4_core.S b/sysdeps/x86_64/fpu/svml_s_atanhf4_core.S
index 1c1ec2bd0f..af4f58944f 100644
--- a/sysdeps/x86_64/fpu/svml_s_atanhf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_atanhf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_atanhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_atanhf8_core.S b/sysdeps/x86_64/fpu/svml_s_atanhf8_core.S
index d92825330e..d23d62a18b 100644
--- a/sysdeps/x86_64/fpu/svml_s_atanhf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_atanhf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_atanhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S
index 54d0b77b08..e5ec0c8eac 100644
--- a/sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_atanhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S b/sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S
index efe9149807..726631e734 100644
--- a/sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_cbrtf)
diff --git a/sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S b/sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S
index 687c549d7f..44e4f9f678 100644
--- a/sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_cbrtf)
diff --git a/sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S b/sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S
index 2bb6fe8e78..e202f42255 100644
--- a/sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_cbrtf)
diff --git a/sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S
index 592f2cd419..b5f0f8ebdb 100644
--- a/sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_cbrtf)
diff --git a/sysdeps/x86_64/fpu/svml_s_cosf16_core.S b/sysdeps/x86_64/fpu/svml_s_cosf16_core.S
index 8daeffc289..8775f8da9c 100644
--- a/sysdeps/x86_64/fpu/svml_s_cosf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_cosf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_cosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_cosf4_core.S b/sysdeps/x86_64/fpu/svml_s_cosf4_core.S
index c6c3a4a622..5d9806898b 100644
--- a/sysdeps/x86_64/fpu/svml_s_cosf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_cosf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_cosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_cosf8_core.S b/sysdeps/x86_64/fpu/svml_s_cosf8_core.S
index a4033da6de..5730e4265d 100644
--- a/sysdeps/x86_64/fpu/svml_s_cosf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_cosf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_cosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S
index 12c6fddfa8..ab49929b56 100644
--- a/sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_cosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_coshf16_core.S b/sysdeps/x86_64/fpu/svml_s_coshf16_core.S
index 97667390aa..8668f5bc86 100644
--- a/sysdeps/x86_64/fpu/svml_s_coshf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_coshf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_coshf)
diff --git a/sysdeps/x86_64/fpu/svml_s_coshf4_core.S b/sysdeps/x86_64/fpu/svml_s_coshf4_core.S
index 8803c5e70b..82b893cd76 100644
--- a/sysdeps/x86_64/fpu/svml_s_coshf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_coshf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_coshf)
diff --git a/sysdeps/x86_64/fpu/svml_s_coshf8_core.S b/sysdeps/x86_64/fpu/svml_s_coshf8_core.S
index 85b144938e..1527b69e28 100644
--- a/sysdeps/x86_64/fpu/svml_s_coshf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_coshf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_coshf)
diff --git a/sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S
index 8e4e1c19bb..f4ccf65eb9 100644
--- a/sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_coshf)
diff --git a/sysdeps/x86_64/fpu/svml_s_erfcf16_core.S b/sysdeps/x86_64/fpu/svml_s_erfcf16_core.S
index 0a14a1a4da..ca5a3cae8a 100644
--- a/sysdeps/x86_64/fpu/svml_s_erfcf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_erfcf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_erfcf)
diff --git a/sysdeps/x86_64/fpu/svml_s_erfcf4_core.S b/sysdeps/x86_64/fpu/svml_s_erfcf4_core.S
index 0a7e3b79ee..ee98c24e3e 100644
--- a/sysdeps/x86_64/fpu/svml_s_erfcf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_erfcf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_erfcf)
diff --git a/sysdeps/x86_64/fpu/svml_s_erfcf8_core.S b/sysdeps/x86_64/fpu/svml_s_erfcf8_core.S
index 8122963d73..ef1f77458b 100644
--- a/sysdeps/x86_64/fpu/svml_s_erfcf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_erfcf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_erfcf)
diff --git a/sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S
index 4f347c4d3c..803c9533c2 100644
--- a/sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_erfcf)
diff --git a/sysdeps/x86_64/fpu/svml_s_erff16_core.S b/sysdeps/x86_64/fpu/svml_s_erff16_core.S
index d544a6742d..95be88c16d 100644
--- a/sysdeps/x86_64/fpu/svml_s_erff16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_erff16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_erff)
diff --git a/sysdeps/x86_64/fpu/svml_s_erff4_core.S b/sysdeps/x86_64/fpu/svml_s_erff4_core.S
index 4a83c8b6cc..3f9aace76b 100644
--- a/sysdeps/x86_64/fpu/svml_s_erff4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_erff4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_erff)
diff --git a/sysdeps/x86_64/fpu/svml_s_erff8_core.S b/sysdeps/x86_64/fpu/svml_s_erff8_core.S
index d91b5ad0c2..eb018cde99 100644
--- a/sysdeps/x86_64/fpu/svml_s_erff8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_erff8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_erff)
diff --git a/sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S
index b2a8e7a152..87ded07bf1 100644
--- a/sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_erff)
diff --git a/sysdeps/x86_64/fpu/svml_s_exp10f16_core.S b/sysdeps/x86_64/fpu/svml_s_exp10f16_core.S
index 6e59aa2d45..64bfc69929 100644
--- a/sysdeps/x86_64/fpu/svml_s_exp10f16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_exp10f16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_exp10f)
diff --git a/sysdeps/x86_64/fpu/svml_s_exp10f4_core.S b/sysdeps/x86_64/fpu/svml_s_exp10f4_core.S
index 3ebc7ed06d..652231cacf 100644
--- a/sysdeps/x86_64/fpu/svml_s_exp10f4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_exp10f4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_exp10f)
diff --git a/sysdeps/x86_64/fpu/svml_s_exp10f8_core.S b/sysdeps/x86_64/fpu/svml_s_exp10f8_core.S
index 338caf3fc2..61a9b908ca 100644
--- a/sysdeps/x86_64/fpu/svml_s_exp10f8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_exp10f8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_exp10f)
diff --git a/sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S
index e0b6202662..f436570d36 100644
--- a/sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_exp10f)
diff --git a/sysdeps/x86_64/fpu/svml_s_exp2f16_core.S b/sysdeps/x86_64/fpu/svml_s_exp2f16_core.S
index 2a73400a0b..0d3d6c0c9d 100644
--- a/sysdeps/x86_64/fpu/svml_s_exp2f16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_exp2f16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_exp2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_exp2f4_core.S b/sysdeps/x86_64/fpu/svml_s_exp2f4_core.S
index ef6d68974c..05975db34f 100644
--- a/sysdeps/x86_64/fpu/svml_s_exp2f4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_exp2f4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_exp2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_exp2f8_core.S b/sysdeps/x86_64/fpu/svml_s_exp2f8_core.S
index f7efe3736c..0dbeb46254 100644
--- a/sysdeps/x86_64/fpu/svml_s_exp2f8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_exp2f8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_exp2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S
index b394654549..77e67248a7 100644
--- a/sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_exp2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_expf16_core.S b/sysdeps/x86_64/fpu/svml_s_expf16_core.S
index cb3fc1f333..b99898deb3 100644
--- a/sysdeps/x86_64/fpu/svml_s_expf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_expf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_expf)
diff --git a/sysdeps/x86_64/fpu/svml_s_expf4_core.S b/sysdeps/x86_64/fpu/svml_s_expf4_core.S
index 587e90bf33..2aaefa163e 100644
--- a/sysdeps/x86_64/fpu/svml_s_expf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_expf4_core.S
@@ -18,7 +18,7 @@
 
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_expf)
diff --git a/sysdeps/x86_64/fpu/svml_s_expf8_core.S b/sysdeps/x86_64/fpu/svml_s_expf8_core.S
index 01ca10da37..6f14117d74 100644
--- a/sysdeps/x86_64/fpu/svml_s_expf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_expf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_expf)
diff --git a/sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S
index a3ede96599..6e2cd7b8af 100644
--- a/sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY(_ZGVcN8v_expf)
diff --git a/sysdeps/x86_64/fpu/svml_s_expm1f16_core.S b/sysdeps/x86_64/fpu/svml_s_expm1f16_core.S
index 082cf1e6e5..7f3d9826ae 100644
--- a/sysdeps/x86_64/fpu/svml_s_expm1f16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_expm1f16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_expm1f)
diff --git a/sysdeps/x86_64/fpu/svml_s_expm1f4_core.S b/sysdeps/x86_64/fpu/svml_s_expm1f4_core.S
index c1b1860728..13f5ccdf3a 100644
--- a/sysdeps/x86_64/fpu/svml_s_expm1f4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_expm1f4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_expm1f)
diff --git a/sysdeps/x86_64/fpu/svml_s_expm1f8_core.S b/sysdeps/x86_64/fpu/svml_s_expm1f8_core.S
index a39538dfd7..3f69069585 100644
--- a/sysdeps/x86_64/fpu/svml_s_expm1f8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_expm1f8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_expm1f)
diff --git a/sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S
index eaaca17ce7..dc6a7a4973 100644
--- a/sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_expm1f)
diff --git a/sysdeps/x86_64/fpu/svml_s_hypotf16_core.S b/sysdeps/x86_64/fpu/svml_s_hypotf16_core.S
index 34ca783049..672f05269c 100644
--- a/sysdeps/x86_64/fpu/svml_s_hypotf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_hypotf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16vv_hypotf)
diff --git a/sysdeps/x86_64/fpu/svml_s_hypotf4_core.S b/sysdeps/x86_64/fpu/svml_s_hypotf4_core.S
index e1d8ad19a6..dd09fad865 100644
--- a/sysdeps/x86_64/fpu/svml_s_hypotf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_hypotf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4vv_hypotf)
diff --git a/sysdeps/x86_64/fpu/svml_s_hypotf8_core.S b/sysdeps/x86_64/fpu/svml_s_hypotf8_core.S
index f5aa9f1e10..6d87708238 100644
--- a/sysdeps/x86_64/fpu/svml_s_hypotf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_hypotf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8vv_hypotf)
diff --git a/sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S
index 707e91d7d3..7edb0d3cf2 100644
--- a/sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY(_ZGVcN8vv_hypotf)
diff --git a/sysdeps/x86_64/fpu/svml_s_log10f16_core.S b/sysdeps/x86_64/fpu/svml_s_log10f16_core.S
index 8544bfb78c..e9df420be5 100644
--- a/sysdeps/x86_64/fpu/svml_s_log10f16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_log10f16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_log10f)
diff --git a/sysdeps/x86_64/fpu/svml_s_log10f4_core.S b/sysdeps/x86_64/fpu/svml_s_log10f4_core.S
index 82188bedf4..7ac8512459 100644
--- a/sysdeps/x86_64/fpu/svml_s_log10f4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_log10f4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_log10f)
diff --git a/sysdeps/x86_64/fpu/svml_s_log10f8_core.S b/sysdeps/x86_64/fpu/svml_s_log10f8_core.S
index 03eea3415c..9f6006d2d1 100644
--- a/sysdeps/x86_64/fpu/svml_s_log10f8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_log10f8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_log10f)
diff --git a/sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S
index 9943a86eb5..b811d10876 100644
--- a/sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_log10f)
diff --git a/sysdeps/x86_64/fpu/svml_s_log1pf16_core.S b/sysdeps/x86_64/fpu/svml_s_log1pf16_core.S
index 9e70217b1e..20c838b4de 100644
--- a/sysdeps/x86_64/fpu/svml_s_log1pf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_log1pf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_log1pf)
diff --git a/sysdeps/x86_64/fpu/svml_s_log1pf4_core.S b/sysdeps/x86_64/fpu/svml_s_log1pf4_core.S
index 4de7f84564..2e179b91e6 100644
--- a/sysdeps/x86_64/fpu/svml_s_log1pf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_log1pf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_log1pf)
diff --git a/sysdeps/x86_64/fpu/svml_s_log1pf8_core.S b/sysdeps/x86_64/fpu/svml_s_log1pf8_core.S
index edf35f29ca..9a24ad132f 100644
--- a/sysdeps/x86_64/fpu/svml_s_log1pf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_log1pf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_log1pf)
diff --git a/sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S
index 1acd26070f..3055ce8d96 100644
--- a/sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_log1pf)
diff --git a/sysdeps/x86_64/fpu/svml_s_log2f16_core.S b/sysdeps/x86_64/fpu/svml_s_log2f16_core.S
index 1a21fe6996..9305f1fc2d 100644
--- a/sysdeps/x86_64/fpu/svml_s_log2f16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_log2f16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_log2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_log2f4_core.S b/sysdeps/x86_64/fpu/svml_s_log2f4_core.S
index fd28c9e43b..4bcc88b349 100644
--- a/sysdeps/x86_64/fpu/svml_s_log2f4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_log2f4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_log2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_log2f8_core.S b/sysdeps/x86_64/fpu/svml_s_log2f8_core.S
index 8f6254f7d2..db51a650be 100644
--- a/sysdeps/x86_64/fpu/svml_s_log2f8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_log2f8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_log2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S
index 6a1832faa4..743fcd3d91 100644
--- a/sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_log2f)
diff --git a/sysdeps/x86_64/fpu/svml_s_logf16_core.S b/sysdeps/x86_64/fpu/svml_s_logf16_core.S
index d9df2536d2..2de06af230 100644
--- a/sysdeps/x86_64/fpu/svml_s_logf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_logf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_logf)
diff --git a/sysdeps/x86_64/fpu/svml_s_logf4_core.S b/sysdeps/x86_64/fpu/svml_s_logf4_core.S
index 8300d4464e..6edb8f6cec 100644
--- a/sysdeps/x86_64/fpu/svml_s_logf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_logf4_core.S
@@ -18,7 +18,7 @@
 
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_logf)
diff --git a/sysdeps/x86_64/fpu/svml_s_logf8_core.S b/sysdeps/x86_64/fpu/svml_s_logf8_core.S
index 2d779866f4..df57322134 100644
--- a/sysdeps/x86_64/fpu/svml_s_logf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_logf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_logf)
diff --git a/sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S
index 8e77a9ea2d..a78dca2ce5 100644
--- a/sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY(_ZGVcN8v_logf)
diff --git a/sysdeps/x86_64/fpu/svml_s_powf16_core.S b/sysdeps/x86_64/fpu/svml_s_powf16_core.S
index 117d907007..69fef6a25d 100644
--- a/sysdeps/x86_64/fpu/svml_s_powf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_powf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16vv_powf)
diff --git a/sysdeps/x86_64/fpu/svml_s_powf4_core.S b/sysdeps/x86_64/fpu/svml_s_powf4_core.S
index d328436a0a..f28a9ed9d5 100644
--- a/sysdeps/x86_64/fpu/svml_s_powf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_powf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4vv_powf)
diff --git a/sysdeps/x86_64/fpu/svml_s_powf8_core.S b/sysdeps/x86_64/fpu/svml_s_powf8_core.S
index 881c841eb5..ace548b93e 100644
--- a/sysdeps/x86_64/fpu/svml_s_powf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_powf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8vv_powf)
diff --git a/sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S
index 7d71c56106..d269163620 100644
--- a/sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY(_ZGVcN8vv_powf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sincosf16_core.S b/sysdeps/x86_64/fpu/svml_s_sincosf16_core.S
index c46f34cc33..6b13904f32 100644
--- a/sysdeps/x86_64/fpu/svml_s_sincosf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_sincosf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16vl4l4_sincosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sincosf4_core.S b/sysdeps/x86_64/fpu/svml_s_sincosf4_core.S
index a2c38364fb..5e8e68cb46 100644
--- a/sysdeps/x86_64/fpu/svml_s_sincosf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_sincosf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4vl4l4_sincosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sincosf8_core.S b/sysdeps/x86_64/fpu/svml_s_sincosf8_core.S
index c6e9d544a2..62d4ed96a0 100644
--- a/sysdeps/x86_64/fpu/svml_s_sincosf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_sincosf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8vl4l4_sincosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S
index fe5fbd929d..39d3b6135c 100644
--- a/sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8vl4l4_sincosf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sinf16_core.S b/sysdeps/x86_64/fpu/svml_s_sinf16_core.S
index d456e801b6..32a89294f6 100644
--- a/sysdeps/x86_64/fpu/svml_s_sinf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_sinf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_sinf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sinf4_core.S b/sysdeps/x86_64/fpu/svml_s_sinf4_core.S
index 5b89563158..ad7e12f8da 100644
--- a/sysdeps/x86_64/fpu/svml_s_sinf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_sinf4_core.S
@@ -18,7 +18,7 @@
 
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_sinf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sinf8_core.S b/sysdeps/x86_64/fpu/svml_s_sinf8_core.S
index f1bf44fa6f..bb80a43360 100644
--- a/sysdeps/x86_64/fpu/svml_s_sinf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_sinf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_sinf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S
index 14de999a89..a296e1a233 100644
--- a/sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY(_ZGVcN8v_sinf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sinhf16_core.S b/sysdeps/x86_64/fpu/svml_s_sinhf16_core.S
index da1269c156..666270e804 100644
--- a/sysdeps/x86_64/fpu/svml_s_sinhf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_sinhf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_sinhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sinhf4_core.S b/sysdeps/x86_64/fpu/svml_s_sinhf4_core.S
index c233bf1ac6..50b36b81a6 100644
--- a/sysdeps/x86_64/fpu/svml_s_sinhf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_sinhf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_sinhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sinhf8_core.S b/sysdeps/x86_64/fpu/svml_s_sinhf8_core.S
index 9e301d8896..1d2913eb88 100644
--- a/sysdeps/x86_64/fpu/svml_s_sinhf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_sinhf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_sinhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S
index f21eb1e26c..555ca75a04 100644
--- a/sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_sinhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_tanf16_core.S b/sysdeps/x86_64/fpu/svml_s_tanf16_core.S
index f8ce7880fb..1e66c2c186 100644
--- a/sysdeps/x86_64/fpu/svml_s_tanf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_tanf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_tanf)
diff --git a/sysdeps/x86_64/fpu/svml_s_tanf4_core.S b/sysdeps/x86_64/fpu/svml_s_tanf4_core.S
index 3b6c0603c8..868a6ab327 100644
--- a/sysdeps/x86_64/fpu/svml_s_tanf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_tanf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_tanf)
diff --git a/sysdeps/x86_64/fpu/svml_s_tanf8_core.S b/sysdeps/x86_64/fpu/svml_s_tanf8_core.S
index 084f2b4c95..92a2e583c0 100644
--- a/sysdeps/x86_64/fpu/svml_s_tanf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_tanf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_tanf)
diff --git a/sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S
index 8d42d3bec4..0b506755d8 100644
--- a/sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_tanf)
diff --git a/sysdeps/x86_64/fpu/svml_s_tanhf16_core.S b/sysdeps/x86_64/fpu/svml_s_tanhf16_core.S
index c25c5925a4..b7166b480a 100644
--- a/sysdeps/x86_64/fpu/svml_s_tanhf16_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_tanhf16_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVeN16v_tanhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_tanhf4_core.S b/sysdeps/x86_64/fpu/svml_s_tanhf4_core.S
index a6cf56cbe4..5fe52830b1 100644
--- a/sysdeps/x86_64/fpu/svml_s_tanhf4_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_tanhf4_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVbN4v_tanhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_tanhf8_core.S b/sysdeps/x86_64/fpu/svml_s_tanhf8_core.S
index 610c4e0759..9ea6b9bc8c 100644
--- a/sysdeps/x86_64/fpu/svml_s_tanhf8_core.S
+++ b/sysdeps/x86_64/fpu/svml_s_tanhf8_core.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
 	.text
 ENTRY (_ZGVdN8v_tanhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S
index 5e9eca577f..9230746e01 100644
--- a/sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S
+++ b/sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S
@@ -17,7 +17,7 @@
    <https://www.gnu.org/licenses/>.  */
 
 #include <sysdep.h>
-#include "svml_s_wrapper_impl.h"
+#include "svml_s_wrapper_impl.h.S"
 
         .text
 ENTRY (_ZGVcN8v_tanhf)
diff --git a/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h b/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
similarity index 100%
rename from sysdeps/x86_64/fpu/svml_s_wrapper_impl.h
rename to sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
-- 
2.34.1


^ permalink raw reply related	[flat|nested] 4+ messages in thread

* [PATCH v1 2/3] x86/fpu: Reformat svml_{s|d}_wrapper_impl.h.S
  2022-11-18  6:37 [PATCH v1 1/3] x86/fpu: Move svml_{s|d}_wrapper_impl.h -> svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
@ 2022-11-18  6:37 ` Noah Goldstein via Libc-alpha
  2022-11-18  6:37 ` [PATCH v1 3/3] x86/fpu: Cleanup code in svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
  2022-11-18 15:49 ` [PATCH v1 1/3] x86/fpu: Move svml_{s|d}_wrapper_impl.h -> svml_{s|d}_wrapper_impl.h.S H.J. Lu via Libc-alpha
  2 siblings, 0 replies; 4+ messages in thread
From: Noah Goldstein via Libc-alpha @ 2022-11-18  6:37 UTC (permalink / raw)
  To: libc-alpha; +Cc: goldstein.w.n, hjl.tools, carlos

Just reformat with the style convention used in other x86 assembler
files.  This doesn't change libm.so or libmvec.so.
---
 sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S | 474 +++++++++---------
 sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S | 546 ++++++++++-----------
 2 files changed, 510 insertions(+), 510 deletions(-)

diff --git a/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S b/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
index 2334713015..b03a2122b9 100644
--- a/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
+++ b/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
@@ -18,273 +18,273 @@
 
 /* SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2 callee
-        subq      $40, %rsp
-        cfi_adjust_cfa_offset(40)
-        movaps    %xmm0, (%rsp)
-        call      JUMPTARGET(\callee)
-        movsd     %xmm0, 16(%rsp)
-        movsd     8(%rsp), %xmm0
-        call      JUMPTARGET(\callee)
-        movsd     16(%rsp), %xmm1
-        movsd     %xmm0, 24(%rsp)
-        unpcklpd  %xmm0, %xmm1
-        movaps    %xmm1, %xmm0
-        addq      $40, %rsp
-        cfi_adjust_cfa_offset(-40)
-        ret
+	subq	$40, %rsp
+	cfi_adjust_cfa_offset (40)
+	movaps	%xmm0, (%rsp)
+	call	JUMPTARGET(\callee)
+	movsd	%xmm0, 16(%rsp)
+	movsd	8(%rsp), %xmm0
+	call	JUMPTARGET(\callee)
+	movsd	16(%rsp), %xmm1
+	movsd	%xmm0, 24(%rsp)
+	unpcklpd %xmm0, %xmm1
+	movaps	%xmm1, %xmm0
+	addq	$40, %rsp
+	cfi_adjust_cfa_offset (-40)
+	ret
 .endm
 
 /* 2 argument SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2_ff callee
-        subq      $56, %rsp
-        cfi_adjust_cfa_offset(56)
-        movaps    %xmm0, (%rsp)
-        movaps    %xmm1, 16(%rsp)
-        call      JUMPTARGET(\callee)
-        movsd     %xmm0, 32(%rsp)
-        movsd     8(%rsp), %xmm0
-        movsd     24(%rsp), %xmm1
-        call      JUMPTARGET(\callee)
-        movsd     32(%rsp), %xmm1
-        movsd     %xmm0, 40(%rsp)
-        unpcklpd  %xmm0, %xmm1
-        movaps    %xmm1, %xmm0
-        addq      $56, %rsp
-        cfi_adjust_cfa_offset(-56)
-        ret
+	subq	$56, %rsp
+	cfi_adjust_cfa_offset (56)
+	movaps	%xmm0, (%rsp)
+	movaps	%xmm1, 16(%rsp)
+	call	JUMPTARGET(\callee)
+	movsd	%xmm0, 32(%rsp)
+	movsd	8(%rsp), %xmm0
+	movsd	24(%rsp), %xmm1
+	call	JUMPTARGET(\callee)
+	movsd	32(%rsp), %xmm1
+	movsd	%xmm0, 40(%rsp)
+	unpcklpd %xmm0, %xmm1
+	movaps	%xmm1, %xmm0
+	addq	$56, %rsp
+	cfi_adjust_cfa_offset (-56)
+	ret
 .endm
 
 /* 3 argument SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2_fFF callee
-        pushq   %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        pushq   %rbx
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbx, 0)
-        movq    %rdi, %rbp
-        movq    %rsi, %rbx
-        subq    $40, %rsp
-        cfi_adjust_cfa_offset(40)
-        leaq    16(%rsp), %rsi
-        leaq    24(%rsp), %rdi
-        movaps  %xmm0, (%rsp)
-        call    JUMPTARGET(\callee)
-        leaq    16(%rsp), %rsi
-        leaq    24(%rsp), %rdi
-        movsd   24(%rsp), %xmm0
-        movapd  (%rsp), %xmm1
-        movsd   %xmm0, 0(%rbp)
-        unpckhpd        %xmm1, %xmm1
-        movsd   16(%rsp), %xmm0
-        movsd   %xmm0, (%rbx)
-        movapd  %xmm1, %xmm0
-        call    JUMPTARGET(\callee)
-        movsd   24(%rsp), %xmm0
-        movsd   %xmm0, 8(%rbp)
-        movsd   16(%rsp), %xmm0
-        movsd   %xmm0, 8(%rbx)
-        addq    $40, %rsp
-        cfi_adjust_cfa_offset(-40)
-        popq    %rbx
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbx)
-        popq    %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	pushq	%rbx
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbx, 0)
+	movq	%rdi, %rbp
+	movq	%rsi, %rbx
+	subq	$40, %rsp
+	cfi_adjust_cfa_offset (40)
+	leaq	16(%rsp), %rsi
+	leaq	24(%rsp), %rdi
+	movaps	%xmm0, (%rsp)
+	call	JUMPTARGET(\callee)
+	leaq	16(%rsp), %rsi
+	leaq	24(%rsp), %rdi
+	movsd	24(%rsp), %xmm0
+	movapd	(%rsp), %xmm1
+	movsd	%xmm0, 0(%rbp)
+	unpckhpd %xmm1, %xmm1
+	movsd	16(%rsp), %xmm0
+	movsd	%xmm0, (%rbx)
+	movapd	%xmm1, %xmm0
+	call	JUMPTARGET(\callee)
+	movsd	24(%rsp), %xmm0
+	movsd	%xmm0, 8(%rbp)
+	movsd	16(%rsp), %xmm0
+	movsd	%xmm0, 8(%rbx)
+	addq	$40, %rsp
+	cfi_adjust_cfa_offset (-40)
+	popq	%rbx
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbx)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
 .macro WRAPPER_IMPL_AVX callee
-        pushq		%rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq		%rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq		$-32, %rsp
-        subq		$32, %rsp
-        vextractf128	$1, %ymm0, (%rsp)
-        vzeroupper
-        call		HIDDEN_JUMPTARGET(\callee)
-        vmovapd		%xmm0, 16(%rsp)
-        vmovaps		(%rsp), %xmm0
-        call		HIDDEN_JUMPTARGET(\callee)
-        vmovapd		%xmm0, %xmm1
-        vmovapd		16(%rsp), %xmm0
-        vinsertf128	$1, %xmm1, %ymm0, %ymm0
-        movq		%rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq		%rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-32, %rsp
+	subq	$32, %rsp
+	vextractf128 $1, %ymm0, (%rsp)
+	vzeroupper
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovapd	%xmm0, 16(%rsp)
+	vmovaps	(%rsp), %xmm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovapd	%xmm0, %xmm1
+	vmovapd	16(%rsp), %xmm0
+	vinsertf128 $1, %xmm1, %ymm0, %ymm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* 2 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
 .macro WRAPPER_IMPL_AVX_ff callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      %rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-32, %rsp
-        subq      $64, %rsp
-        vextractf128 $1, %ymm0, 16(%rsp)
-        vextractf128 $1, %ymm1, (%rsp)
-        vzeroupper
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovaps   %xmm0, 32(%rsp)
-        vmovaps   16(%rsp), %xmm0
-        vmovaps   (%rsp), %xmm1
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovaps   %xmm0, %xmm1
-        vmovaps   32(%rsp), %xmm0
-        vinsertf128 $1, %xmm1, %ymm0, %ymm0
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-32, %rsp
+	subq	$64, %rsp
+	vextractf128 $1, %ymm0, 16(%rsp)
+	vextractf128 $1, %ymm1, (%rsp)
+	vzeroupper
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	%xmm0, 32(%rsp)
+	vmovaps	16(%rsp), %xmm0
+	vmovaps	(%rsp), %xmm1
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	%xmm0, %xmm1
+	vmovaps	32(%rsp), %xmm0
+	vinsertf128 $1, %xmm1, %ymm0, %ymm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* 3 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
 .macro WRAPPER_IMPL_AVX_fFF callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      %rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-32, %rsp
-        pushq     %r13
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%r13, 0)
-        pushq     %r14
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%r14, 0)
-        subq      $48, %rsp
-        movq      %rsi, %r14
-        movq      %rdi, %r13
-        vextractf128 $1, %ymm0, 32(%rsp)
-        vzeroupper
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovaps   32(%rsp), %xmm0
-        lea       (%rsp), %rdi
-        lea       16(%rsp), %rsi
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovapd   (%rsp), %xmm0
-        vmovapd   16(%rsp), %xmm1
-        vmovapd   %xmm0, 16(%r13)
-        vmovapd   %xmm1, 16(%r14)
-        addq      $48, %rsp
-        popq      %r14
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%r14)
-        popq      %r13
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%r13)
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-32, %rsp
+	pushq	%r13
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%r13, 0)
+	pushq	%r14
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%r14, 0)
+	subq	$48, %rsp
+	movq	%rsi, %r14
+	movq	%rdi, %r13
+	vextractf128 $1, %ymm0, 32(%rsp)
+	vzeroupper
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	32(%rsp), %xmm0
+	lea	(%rsp), %rdi
+	lea	16(%rsp), %rsi
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovapd	(%rsp), %xmm0
+	vmovapd	16(%rsp), %xmm1
+	vmovapd	%xmm0, 16(%r13)
+	vmovapd	%xmm1, 16(%r14)
+	addq	$48, %rsp
+	popq	%r14
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%r14)
+	popq	%r13
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%r13)
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* AVX512 ISA version as wrapper to AVX2 ISA version.  */
 .macro WRAPPER_IMPL_AVX512 callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      %rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-64, %rsp
-        subq      $128, %rsp
-        vmovups   %zmm0, (%rsp)
-        vmovupd   (%rsp), %ymm0
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovupd   %ymm0, 64(%rsp)
-        vmovupd   32(%rsp), %ymm0
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovupd   %ymm0, 96(%rsp)
-        vmovups   64(%rsp), %zmm0
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-64, %rsp
+	subq	$128, %rsp
+	vmovups	%zmm0, (%rsp)
+	vmovupd	(%rsp), %ymm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovupd	%ymm0, 64(%rsp)
+	vmovupd	32(%rsp), %ymm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovupd	%ymm0, 96(%rsp)
+	vmovups	64(%rsp), %zmm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* 2 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
 .macro WRAPPER_IMPL_AVX512_ff callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      %rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-64, %rsp
-        subq      $192, %rsp
-        vmovups   %zmm0, (%rsp)
-        vmovups   %zmm1, 64(%rsp)
-        vmovupd   (%rsp), %ymm0
-        vmovupd   64(%rsp), %ymm1
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovupd   %ymm0, 128(%rsp)
-        vmovupd   32(%rsp), %ymm0
-        vmovupd   96(%rsp), %ymm1
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovupd   %ymm0, 160(%rsp)
-        vmovups   128(%rsp), %zmm0
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-64, %rsp
+	subq	$192, %rsp
+	vmovups	%zmm0, (%rsp)
+	vmovups	%zmm1, 64(%rsp)
+	vmovupd	(%rsp), %ymm0
+	vmovupd	64(%rsp), %ymm1
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovupd	%ymm0, 128(%rsp)
+	vmovupd	32(%rsp), %ymm0
+	vmovupd	96(%rsp), %ymm1
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovupd	%ymm0, 160(%rsp)
+	vmovups	128(%rsp), %zmm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* 3 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
 .macro WRAPPER_IMPL_AVX512_fFF callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      %rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-64, %rsp
-        pushq     %r12
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%r12, 0)
-        pushq     %r13
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%r13, 0)
-        subq      $176, %rsp
-        movq      %rsi, %r13
-        vmovups   %zmm0, (%rsp)
-        movq    %rdi, %r12
-        vmovupd (%rsp), %ymm0
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovupd   32(%rsp), %ymm0
-        lea       64(%rsp), %rdi
-        lea       96(%rsp), %rsi
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovupd   64(%rsp), %ymm0
-        vmovupd   96(%rsp), %ymm1
-        vmovupd   %ymm0, 32(%r12)
-        vmovupd   %ymm1, 32(%r13)
-        vzeroupper
-        addq      $176, %rsp
-        popq      %r13
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%r13)
-        popq      %r12
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%r12)
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-64, %rsp
+	pushq	%r12
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%r12, 0)
+	pushq	%r13
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%r13, 0)
+	subq	$176, %rsp
+	movq	%rsi, %r13
+	vmovups	%zmm0, (%rsp)
+	movq	%rdi, %r12
+	vmovupd	(%rsp), %ymm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovupd	32(%rsp), %ymm0
+	lea	64(%rsp), %rdi
+	lea	96(%rsp), %rsi
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovupd	64(%rsp), %ymm0
+	vmovupd	96(%rsp), %ymm1
+	vmovupd	%ymm0, 32(%r12)
+	vmovupd	%ymm1, 32(%r13)
+	vzeroupper
+	addq	$176, %rsp
+	popq	%r13
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%r13)
+	popq	%r12
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%r12)
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
diff --git a/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S b/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
index c23da7ec83..cecf6c8384 100644
--- a/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
+++ b/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
@@ -18,309 +18,309 @@
 
 /* SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2 callee
-        subq      $40, %rsp
-        cfi_adjust_cfa_offset(40)
-        movaps    %xmm0, (%rsp)
-        call      JUMPTARGET(\callee)
-        movss     %xmm0, 16(%rsp)
-        movss     4(%rsp), %xmm0
-        call      JUMPTARGET(\callee)
-        movss     %xmm0, 20(%rsp)
-        movss     8(%rsp), %xmm0
-        call      JUMPTARGET(\callee)
-        movss     %xmm0, 24(%rsp)
-        movss     12(%rsp), %xmm0
-        call      JUMPTARGET(\callee)
-        movss     16(%rsp), %xmm3
-        movss     20(%rsp), %xmm2
-        movss     24(%rsp), %xmm1
-        movss     %xmm0, 28(%rsp)
-        unpcklps  %xmm1, %xmm3
-        unpcklps  %xmm0, %xmm2
-        unpcklps  %xmm2, %xmm3
-        movaps    %xmm3, %xmm0
-        addq      $40, %rsp
-        cfi_adjust_cfa_offset(-40)
-        ret
+	subq	$40, %rsp
+	cfi_adjust_cfa_offset (40)
+	movaps	%xmm0, (%rsp)
+	call	JUMPTARGET(\callee)
+	movss	%xmm0, 16(%rsp)
+	movss	4(%rsp), %xmm0
+	call	JUMPTARGET(\callee)
+	movss	%xmm0, 20(%rsp)
+	movss	8(%rsp), %xmm0
+	call	JUMPTARGET(\callee)
+	movss	%xmm0, 24(%rsp)
+	movss	12(%rsp), %xmm0
+	call	JUMPTARGET(\callee)
+	movss	16(%rsp), %xmm3
+	movss	20(%rsp), %xmm2
+	movss	24(%rsp), %xmm1
+	movss	%xmm0, 28(%rsp)
+	unpcklps %xmm1, %xmm3
+	unpcklps %xmm0, %xmm2
+	unpcklps %xmm2, %xmm3
+	movaps	%xmm3, %xmm0
+	addq	$40, %rsp
+	cfi_adjust_cfa_offset (-40)
+	ret
 .endm
 
 /* 2 argument SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2_ff callee
-        subq      $56, %rsp
-        cfi_adjust_cfa_offset(56)
-        movaps    %xmm0, (%rsp)
-        movaps    %xmm1, 16(%rsp)
-        call      JUMPTARGET(\callee)
-        movss     %xmm0, 32(%rsp)
-        movss     4(%rsp), %xmm0
-        movss     20(%rsp), %xmm1
-        call      JUMPTARGET(\callee)
-        movss     %xmm0, 36(%rsp)
-        movss     8(%rsp), %xmm0
-        movss     24(%rsp), %xmm1
-        call      JUMPTARGET(\callee)
-        movss     %xmm0, 40(%rsp)
-        movss     12(%rsp), %xmm0
-        movss     28(%rsp), %xmm1
-        call      JUMPTARGET(\callee)
-        movss     32(%rsp), %xmm3
-        movss     36(%rsp), %xmm2
-        movss     40(%rsp), %xmm1
-        movss     %xmm0, 44(%rsp)
-        unpcklps  %xmm1, %xmm3
-        unpcklps  %xmm0, %xmm2
-        unpcklps  %xmm2, %xmm3
-        movaps    %xmm3, %xmm0
-        addq      $56, %rsp
-        cfi_adjust_cfa_offset(-56)
-        ret
+	subq	$56, %rsp
+	cfi_adjust_cfa_offset (56)
+	movaps	%xmm0, (%rsp)
+	movaps	%xmm1, 16(%rsp)
+	call	JUMPTARGET(\callee)
+	movss	%xmm0, 32(%rsp)
+	movss	4(%rsp), %xmm0
+	movss	20(%rsp), %xmm1
+	call	JUMPTARGET(\callee)
+	movss	%xmm0, 36(%rsp)
+	movss	8(%rsp), %xmm0
+	movss	24(%rsp), %xmm1
+	call	JUMPTARGET(\callee)
+	movss	%xmm0, 40(%rsp)
+	movss	12(%rsp), %xmm0
+	movss	28(%rsp), %xmm1
+	call	JUMPTARGET(\callee)
+	movss	32(%rsp), %xmm3
+	movss	36(%rsp), %xmm2
+	movss	40(%rsp), %xmm1
+	movss	%xmm0, 44(%rsp)
+	unpcklps %xmm1, %xmm3
+	unpcklps %xmm0, %xmm2
+	unpcklps %xmm2, %xmm3
+	movaps	%xmm3, %xmm0
+	addq	$56, %rsp
+	cfi_adjust_cfa_offset (-56)
+	ret
 .endm
 
 /* 3 argument SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2_fFF callee
-        pushq   %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        pushq   %rbx
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbx, 0)
-        movq    %rdi, %rbp
-        movq    %rsi, %rbx
-        subq    $40, %rsp
-        cfi_adjust_cfa_offset(40)
-        leaq    24(%rsp), %rsi
-        leaq    28(%rsp), %rdi
-        movaps  %xmm0, (%rsp)
-        call    JUMPTARGET(\callee)
-        leaq    24(%rsp), %rsi
-        leaq    28(%rsp), %rdi
-        movss   28(%rsp), %xmm0
-        movss   %xmm0, 0(%rbp)
-        movaps  (%rsp), %xmm1
-        movss   24(%rsp), %xmm0
-        movss   %xmm0, (%rbx)
-        movaps  %xmm1, %xmm0
-        shufps  $85, %xmm1, %xmm0
-        call    JUMPTARGET(\callee)
-        movss   28(%rsp), %xmm0
-        leaq    24(%rsp), %rsi
-        movss   %xmm0, 4(%rbp)
-        leaq    28(%rsp), %rdi
-        movaps  (%rsp), %xmm1
-        movss   24(%rsp), %xmm0
-        movss   %xmm0, 4(%rbx)
-        movaps  %xmm1, %xmm0
-        unpckhps        %xmm1, %xmm0
-        call    JUMPTARGET(\callee)
-        movaps  (%rsp), %xmm1
-        leaq    24(%rsp), %rsi
-        leaq    28(%rsp), %rdi
-        movss   28(%rsp), %xmm0
-        shufps  $255, %xmm1, %xmm1
-        movss   %xmm0, 8(%rbp)
-        movss   24(%rsp), %xmm0
-        movss   %xmm0, 8(%rbx)
-        movaps  %xmm1, %xmm0
-        call    JUMPTARGET(\callee)
-        movss   28(%rsp), %xmm0
-        movss   %xmm0, 12(%rbp)
-        movss   24(%rsp), %xmm0
-        movss   %xmm0, 12(%rbx)
-        addq    $40, %rsp
-        cfi_adjust_cfa_offset(-40)
-        popq    %rbx
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbx)
-        popq    %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	pushq	%rbx
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbx, 0)
+	movq	%rdi, %rbp
+	movq	%rsi, %rbx
+	subq	$40, %rsp
+	cfi_adjust_cfa_offset (40)
+	leaq	24(%rsp), %rsi
+	leaq	28(%rsp), %rdi
+	movaps	%xmm0, (%rsp)
+	call	JUMPTARGET(\callee)
+	leaq	24(%rsp), %rsi
+	leaq	28(%rsp), %rdi
+	movss	28(%rsp), %xmm0
+	movss	%xmm0, 0(%rbp)
+	movaps	(%rsp), %xmm1
+	movss	24(%rsp), %xmm0
+	movss	%xmm0, (%rbx)
+	movaps	%xmm1, %xmm0
+	shufps	$85, %xmm1, %xmm0
+	call	JUMPTARGET(\callee)
+	movss	28(%rsp), %xmm0
+	leaq	24(%rsp), %rsi
+	movss	%xmm0, 4(%rbp)
+	leaq	28(%rsp), %rdi
+	movaps	(%rsp), %xmm1
+	movss	24(%rsp), %xmm0
+	movss	%xmm0, 4(%rbx)
+	movaps	%xmm1, %xmm0
+	unpckhps %xmm1, %xmm0
+	call	JUMPTARGET(\callee)
+	movaps	(%rsp), %xmm1
+	leaq	24(%rsp), %rsi
+	leaq	28(%rsp), %rdi
+	movss	28(%rsp), %xmm0
+	shufps	$255, %xmm1, %xmm1
+	movss	%xmm0, 8(%rbp)
+	movss	24(%rsp), %xmm0
+	movss	%xmm0, 8(%rbx)
+	movaps	%xmm1, %xmm0
+	call	JUMPTARGET(\callee)
+	movss	28(%rsp), %xmm0
+	movss	%xmm0, 12(%rbp)
+	movss	24(%rsp), %xmm0
+	movss	%xmm0, 12(%rbx)
+	addq	$40, %rsp
+	cfi_adjust_cfa_offset (-40)
+	popq	%rbx
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbx)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
 .macro WRAPPER_IMPL_AVX callee
-        pushq     	%rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      	%rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      	$-32, %rsp
-        subq      	$32, %rsp
-        vextractf128 	$1, %ymm0, (%rsp)
-        vzeroupper
-        call      	HIDDEN_JUMPTARGET(\callee)
-        vmovaps   	%xmm0, 16(%rsp)
-        vmovaps   	(%rsp), %xmm0
-        call      	HIDDEN_JUMPTARGET(\callee)
-        vmovaps   	%xmm0, %xmm1
-        vmovaps   	16(%rsp), %xmm0
-        vinsertf128 	$1, %xmm1, %ymm0, %ymm0
-        movq      	%rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      	%rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-32, %rsp
+	subq	$32, %rsp
+	vextractf128 $1, %ymm0, (%rsp)
+	vzeroupper
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	%xmm0, 16(%rsp)
+	vmovaps	(%rsp), %xmm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	%xmm0, %xmm1
+	vmovaps	16(%rsp), %xmm0
+	vinsertf128 $1, %xmm1, %ymm0, %ymm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* 2 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
 .macro WRAPPER_IMPL_AVX_ff callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      %rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-32, %rsp
-        subq      $64, %rsp
-        vextractf128 $1, %ymm0, 16(%rsp)
-        vextractf128 $1, %ymm1, (%rsp)
-        vzeroupper
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovaps   %xmm0, 32(%rsp)
-        vmovaps   16(%rsp), %xmm0
-        vmovaps   (%rsp), %xmm1
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovaps   %xmm0, %xmm1
-        vmovaps   32(%rsp), %xmm0
-        vinsertf128 $1, %xmm1, %ymm0, %ymm0
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-32, %rsp
+	subq	$64, %rsp
+	vextractf128 $1, %ymm0, 16(%rsp)
+	vextractf128 $1, %ymm1, (%rsp)
+	vzeroupper
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	%xmm0, 32(%rsp)
+	vmovaps	16(%rsp), %xmm0
+	vmovaps	(%rsp), %xmm1
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	%xmm0, %xmm1
+	vmovaps	32(%rsp), %xmm0
+	vinsertf128 $1, %xmm1, %ymm0, %ymm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* 3 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
 .macro WRAPPER_IMPL_AVX_fFF callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      %rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-32, %rsp
-        pushq     %r13
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%r13, 0)
-        pushq     %r14
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%r14, 0)
-        subq      $48, %rsp
-        movq      %rsi, %r14
-        vmovaps   %ymm0, (%rsp)
-        movq      %rdi, %r13
-        vmovaps   16(%rsp), %xmm1
-        vmovaps   %xmm1, 32(%rsp)
-        vzeroupper
-        vmovaps   (%rsp), %xmm0
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovaps   32(%rsp), %xmm0
-        lea       (%rsp), %rdi
-        lea       16(%rsp), %rsi
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovaps   (%rsp), %xmm0
-        vmovaps   16(%rsp), %xmm1
-        vmovaps   %xmm0, 16(%r13)
-        vmovaps   %xmm1, 16(%r14)
-        addq      $48, %rsp
-        popq      %r14
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%r14)
-        popq      %r13
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%r13)
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-32, %rsp
+	pushq	%r13
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%r13, 0)
+	pushq	%r14
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%r14, 0)
+	subq	$48, %rsp
+	movq	%rsi, %r14
+	vmovaps	%ymm0, (%rsp)
+	movq	%rdi, %r13
+	vmovaps	16(%rsp), %xmm1
+	vmovaps	%xmm1, 32(%rsp)
+	vzeroupper
+	vmovaps	(%rsp), %xmm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	32(%rsp), %xmm0
+	lea	(%rsp), %rdi
+	lea	16(%rsp), %rsi
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	(%rsp), %xmm0
+	vmovaps	16(%rsp), %xmm1
+	vmovaps	%xmm0, 16(%r13)
+	vmovaps	%xmm1, 16(%r14)
+	addq	$48, %rsp
+	popq	%r14
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%r14)
+	popq	%r13
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%r13)
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* AVX512 ISA version as wrapper to AVX2 ISA version.  */
 .macro WRAPPER_IMPL_AVX512 callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      %rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-64, %rsp
-        subq      $128, %rsp
-        vmovups   %zmm0, (%rsp)
-        vmovupd   (%rsp), %ymm0
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovupd   %ymm0, 64(%rsp)
-        vmovupd   32(%rsp), %ymm0
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovupd   %ymm0, 96(%rsp)
-        vmovups   64(%rsp), %zmm0
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-64, %rsp
+	subq	$128, %rsp
+	vmovups	%zmm0, (%rsp)
+	vmovupd	(%rsp), %ymm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovupd	%ymm0, 64(%rsp)
+	vmovupd	32(%rsp), %ymm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovupd	%ymm0, 96(%rsp)
+	vmovups	64(%rsp), %zmm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* 2 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
 .macro WRAPPER_IMPL_AVX512_ff callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq      %rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-64, %rsp
-        subq      $192, %rsp
-        vmovups   %zmm0, (%rsp)
-        vmovups   %zmm1, 64(%rsp)
-        vmovups   (%rsp), %ymm0
-        vmovups   64(%rsp), %ymm1
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovups   %ymm0, 128(%rsp)
-        vmovups   32(%rsp), %ymm0
-        vmovups   96(%rsp), %ymm1
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovups   %ymm0, 160(%rsp)
-        vmovups   128(%rsp), %zmm0
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq      %rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-64, %rsp
+	subq	$192, %rsp
+	vmovups	%zmm0, (%rsp)
+	vmovups	%zmm1, 64(%rsp)
+	vmovups	(%rsp), %ymm0
+	vmovups	64(%rsp), %ymm1
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovups	%ymm0, 128(%rsp)
+	vmovups	32(%rsp), %ymm0
+	vmovups	96(%rsp), %ymm1
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovups	%ymm0, 160(%rsp)
+	vmovups	128(%rsp), %zmm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
 
 /* 3 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
 .macro WRAPPER_IMPL_AVX512_fFF callee
-        pushq     %rbp
-        cfi_adjust_cfa_offset (8)
-        cfi_rel_offset (%rbp, 0)
-        movq	%rsp, %rbp
-        cfi_def_cfa_register (%rbp)
-        andq      $-64, %rsp
-        pushq     %r12
-        pushq     %r13
-        subq      $176, %rsp
-        movq      %rsi, %r13
-        vmovaps   %zmm0, (%rsp)
-        movq      %rdi, %r12
-        vmovaps   (%rsp), %ymm0
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovaps   32(%rsp), %ymm0
-        lea       64(%rsp), %rdi
-        lea       96(%rsp), %rsi
-        call      HIDDEN_JUMPTARGET(\callee)
-        vmovaps   64(%rsp), %ymm0
-        vmovaps   96(%rsp), %ymm1
-        vmovaps   %ymm0, 32(%r12)
-        vmovaps   %ymm1, 32(%r13)
-        addq      $176, %rsp
-        popq      %r13
-        popq      %r12
-        movq      %rbp, %rsp
-        cfi_def_cfa_register (%rsp)
-        popq	%rbp
-        cfi_adjust_cfa_offset (-8)
-        cfi_restore (%rbp)
-        ret
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-64, %rsp
+	pushq	%r12
+	pushq	%r13
+	subq	$176, %rsp
+	movq	%rsi, %r13
+	vmovaps	%zmm0, (%rsp)
+	movq	%rdi, %r12
+	vmovaps	(%rsp), %ymm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	32(%rsp), %ymm0
+	lea	64(%rsp), %rdi
+	lea	96(%rsp), %rsi
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	64(%rsp), %ymm0
+	vmovaps	96(%rsp), %ymm1
+	vmovaps	%ymm0, 32(%r12)
+	vmovaps	%ymm1, 32(%r13)
+	addq	$176, %rsp
+	popq	%r13
+	popq	%r12
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
 .endm
-- 
2.34.1


^ permalink raw reply related	[flat|nested] 4+ messages in thread

* [PATCH v1 3/3] x86/fpu: Cleanup code in svml_{s|d}_wrapper_impl.h.S
  2022-11-18  6:37 [PATCH v1 1/3] x86/fpu: Move svml_{s|d}_wrapper_impl.h -> svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
  2022-11-18  6:37 ` [PATCH v1 2/3] x86/fpu: Reformat svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
@ 2022-11-18  6:37 ` Noah Goldstein via Libc-alpha
  2022-11-18 15:49 ` [PATCH v1 1/3] x86/fpu: Move svml_{s|d}_wrapper_impl.h -> svml_{s|d}_wrapper_impl.h.S H.J. Lu via Libc-alpha
  2 siblings, 0 replies; 4+ messages in thread
From: Noah Goldstein via Libc-alpha @ 2022-11-18  6:37 UTC (permalink / raw)
  To: libc-alpha; +Cc: goldstein.w.n, hjl.tools, carlos

1. Remove unnecessary spills.
2. Split the shared avx/avx512 wrappers to a new file.
3. Fix some small nit missed optimizations.

All math and mathvec tests pass on x86.
---
 sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S  | 253 ++--------------
 sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S  | 309 ++++----------------
 sysdeps/x86_64/fpu/svml_sd_wrapper_impl.h.S | 186 ++++++++++++
 3 files changed, 262 insertions(+), 486 deletions(-)
 create mode 100644 sysdeps/x86_64/fpu/svml_sd_wrapper_impl.h.S

diff --git a/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S b/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
index b03a2122b9..e54c16ea6e 100644
--- a/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
+++ b/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
@@ -18,39 +18,38 @@
 
 /* SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2 callee
-	subq	$40, %rsp
-	cfi_adjust_cfa_offset (40)
+	subq	$24, %rsp
+	cfi_adjust_cfa_offset (24)
 	movaps	%xmm0, (%rsp)
 	call	JUMPTARGET(\callee)
-	movsd	%xmm0, 16(%rsp)
+	movsd	%xmm0, (%rsp)
 	movsd	8(%rsp), %xmm0
 	call	JUMPTARGET(\callee)
-	movsd	16(%rsp), %xmm1
-	movsd	%xmm0, 24(%rsp)
+	movsd	(%rsp), %xmm1
 	unpcklpd %xmm0, %xmm1
 	movaps	%xmm1, %xmm0
-	addq	$40, %rsp
-	cfi_adjust_cfa_offset (-40)
+	addq	$24, %rsp
+	cfi_adjust_cfa_offset (-24)
 	ret
 .endm
 
+
 /* 2 argument SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2_ff callee
-	subq	$56, %rsp
-	cfi_adjust_cfa_offset (56)
+	subq	$40, %rsp
+	cfi_adjust_cfa_offset (40)
 	movaps	%xmm0, (%rsp)
 	movaps	%xmm1, 16(%rsp)
 	call	JUMPTARGET(\callee)
-	movsd	%xmm0, 32(%rsp)
+	movsd	%xmm0, (%rsp)
 	movsd	8(%rsp), %xmm0
 	movsd	24(%rsp), %xmm1
 	call	JUMPTARGET(\callee)
-	movsd	32(%rsp), %xmm1
-	movsd	%xmm0, 40(%rsp)
+	movsd	(%rsp), %xmm1
 	unpcklpd %xmm0, %xmm1
 	movaps	%xmm1, %xmm0
-	addq	$56, %rsp
-	cfi_adjust_cfa_offset (-56)
+	addq	$40, %rsp
+	cfi_adjust_cfa_offset (-40)
 	ret
 .endm
 
@@ -62,229 +61,25 @@
 	pushq	%rbx
 	cfi_adjust_cfa_offset (8)
 	cfi_rel_offset (%rbx, 0)
+	subq	$24, %rsp
+	cfi_adjust_cfa_offset (24)
+	movaps	%xmm0, (%rsp)
 	movq	%rdi, %rbp
 	movq	%rsi, %rbx
-	subq	$40, %rsp
-	cfi_adjust_cfa_offset (40)
-	leaq	16(%rsp), %rsi
-	leaq	24(%rsp), %rdi
-	movaps	%xmm0, (%rsp)
 	call	JUMPTARGET(\callee)
-	leaq	16(%rsp), %rsi
-	leaq	24(%rsp), %rdi
-	movsd	24(%rsp), %xmm0
-	movapd	(%rsp), %xmm1
-	movsd	%xmm0, 0(%rbp)
-	unpckhpd %xmm1, %xmm1
-	movsd	16(%rsp), %xmm0
-	movsd	%xmm0, (%rbx)
-	movapd	%xmm1, %xmm0
+	movsd	8(%rsp), %xmm0
+	leaq	8(%rbp), %rdi
+	leaq	8(%rbx), %rsi
 	call	JUMPTARGET(\callee)
-	movsd	24(%rsp), %xmm0
-	movsd	%xmm0, 8(%rbp)
-	movsd	16(%rsp), %xmm0
-	movsd	%xmm0, 8(%rbx)
-	addq	$40, %rsp
-	cfi_adjust_cfa_offset (-40)
-	popq	%rbx
+	addq	$24, %rsp
+	cfi_adjust_cfa_offset (-24)
+	pop	%rbx
 	cfi_adjust_cfa_offset (-8)
 	cfi_restore (%rbx)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
-.macro WRAPPER_IMPL_AVX callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-32, %rsp
-	subq	$32, %rsp
-	vextractf128 $1, %ymm0, (%rsp)
-	vzeroupper
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovapd	%xmm0, 16(%rsp)
-	vmovaps	(%rsp), %xmm0
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovapd	%xmm0, %xmm1
-	vmovapd	16(%rsp), %xmm0
-	vinsertf128 $1, %xmm1, %ymm0, %ymm0
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* 2 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
-.macro WRAPPER_IMPL_AVX_ff callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-32, %rsp
-	subq	$64, %rsp
-	vextractf128 $1, %ymm0, 16(%rsp)
-	vextractf128 $1, %ymm1, (%rsp)
-	vzeroupper
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	%xmm0, 32(%rsp)
-	vmovaps	16(%rsp), %xmm0
-	vmovaps	(%rsp), %xmm1
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	%xmm0, %xmm1
-	vmovaps	32(%rsp), %xmm0
-	vinsertf128 $1, %xmm1, %ymm0, %ymm0
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
+	pop	%rbp
 	cfi_adjust_cfa_offset (-8)
 	cfi_restore (%rbp)
 	ret
 .endm
 
-/* 3 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
-.macro WRAPPER_IMPL_AVX_fFF callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-32, %rsp
-	pushq	%r13
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%r13, 0)
-	pushq	%r14
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%r14, 0)
-	subq	$48, %rsp
-	movq	%rsi, %r14
-	movq	%rdi, %r13
-	vextractf128 $1, %ymm0, 32(%rsp)
-	vzeroupper
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	32(%rsp), %xmm0
-	lea	(%rsp), %rdi
-	lea	16(%rsp), %rsi
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovapd	(%rsp), %xmm0
-	vmovapd	16(%rsp), %xmm1
-	vmovapd	%xmm0, 16(%r13)
-	vmovapd	%xmm1, 16(%r14)
-	addq	$48, %rsp
-	popq	%r14
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%r14)
-	popq	%r13
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%r13)
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* AVX512 ISA version as wrapper to AVX2 ISA version.  */
-.macro WRAPPER_IMPL_AVX512 callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-64, %rsp
-	subq	$128, %rsp
-	vmovups	%zmm0, (%rsp)
-	vmovupd	(%rsp), %ymm0
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovupd	%ymm0, 64(%rsp)
-	vmovupd	32(%rsp), %ymm0
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovupd	%ymm0, 96(%rsp)
-	vmovups	64(%rsp), %zmm0
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* 2 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
-.macro WRAPPER_IMPL_AVX512_ff callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-64, %rsp
-	subq	$192, %rsp
-	vmovups	%zmm0, (%rsp)
-	vmovups	%zmm1, 64(%rsp)
-	vmovupd	(%rsp), %ymm0
-	vmovupd	64(%rsp), %ymm1
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovupd	%ymm0, 128(%rsp)
-	vmovupd	32(%rsp), %ymm0
-	vmovupd	96(%rsp), %ymm1
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovupd	%ymm0, 160(%rsp)
-	vmovups	128(%rsp), %zmm0
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* 3 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
-.macro WRAPPER_IMPL_AVX512_fFF callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-64, %rsp
-	pushq	%r12
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%r12, 0)
-	pushq	%r13
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%r13, 0)
-	subq	$176, %rsp
-	movq	%rsi, %r13
-	vmovups	%zmm0, (%rsp)
-	movq	%rdi, %r12
-	vmovupd	(%rsp), %ymm0
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovupd	32(%rsp), %ymm0
-	lea	64(%rsp), %rdi
-	lea	96(%rsp), %rsi
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovupd	64(%rsp), %ymm0
-	vmovupd	96(%rsp), %ymm1
-	vmovupd	%ymm0, 32(%r12)
-	vmovupd	%ymm1, 32(%r13)
-	vzeroupper
-	addq	$176, %rsp
-	popq	%r13
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%r13)
-	popq	%r12
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%r12)
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
+#include "svml_sd_wrapper_impl.h.S"
diff --git a/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S b/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
index cecf6c8384..958d1be243 100644
--- a/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
+++ b/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
@@ -18,61 +18,66 @@
 
 /* SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2 callee
-	subq	$40, %rsp
-	cfi_adjust_cfa_offset (40)
+	push	%rbx
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbx, 0)
+	subq	$16, %rsp
+	cfi_adjust_cfa_offset (16)
 	movaps	%xmm0, (%rsp)
 	call	JUMPTARGET(\callee)
-	movss	%xmm0, 16(%rsp)
+	movss	%xmm0, (%rsp)
 	movss	4(%rsp), %xmm0
 	call	JUMPTARGET(\callee)
-	movss	%xmm0, 20(%rsp)
+	movss	%xmm0, 4(%rsp)
 	movss	8(%rsp), %xmm0
 	call	JUMPTARGET(\callee)
-	movss	%xmm0, 24(%rsp)
+	movd	%xmm0, %ebx
 	movss	12(%rsp), %xmm0
 	call	JUMPTARGET(\callee)
-	movss	16(%rsp), %xmm3
-	movss	20(%rsp), %xmm2
-	movss	24(%rsp), %xmm1
-	movss	%xmm0, 28(%rsp)
-	unpcklps %xmm1, %xmm3
-	unpcklps %xmm0, %xmm2
-	unpcklps %xmm2, %xmm3
-	movaps	%xmm3, %xmm0
-	addq	$40, %rsp
-	cfi_adjust_cfa_offset (-40)
+	movd	%ebx, %xmm1
+	unpcklps %xmm0, %xmm1
+	movsd	(%rsp), %xmm0
+	unpcklpd %xmm1, %xmm0
+	addq	$16, %rsp
+	cfi_adjust_cfa_offset (-16)
+	pop	%rbx
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbx)
 	ret
 .endm
 
 /* 2 argument SSE2 ISA version as wrapper to scalar.  */
 .macro WRAPPER_IMPL_SSE2_ff callee
-	subq	$56, %rsp
-	cfi_adjust_cfa_offset (56)
+	push	%rbx
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbx, 0)
+	subq	$32, %rsp
+	cfi_adjust_cfa_offset (40)
 	movaps	%xmm0, (%rsp)
 	movaps	%xmm1, 16(%rsp)
 	call	JUMPTARGET(\callee)
-	movss	%xmm0, 32(%rsp)
-	movss	4(%rsp), %xmm0
 	movss	20(%rsp), %xmm1
+	movss	%xmm0, 0(%rsp)
+	movss	4(%rsp), %xmm0
 	call	JUMPTARGET(\callee)
-	movss	%xmm0, 36(%rsp)
-	movss	8(%rsp), %xmm0
 	movss	24(%rsp), %xmm1
+	movss	%xmm0, 4(%rsp)
+	movss	8(%rsp), %xmm0
 	call	JUMPTARGET(\callee)
-	movss	%xmm0, 40(%rsp)
-	movss	12(%rsp), %xmm0
 	movss	28(%rsp), %xmm1
+	movd	%xmm0, %ebx
+	movss	12(%rsp), %xmm0
 	call	JUMPTARGET(\callee)
-	movss	32(%rsp), %xmm3
-	movss	36(%rsp), %xmm2
-	movss	40(%rsp), %xmm1
-	movss	%xmm0, 44(%rsp)
-	unpcklps %xmm1, %xmm3
-	unpcklps %xmm0, %xmm2
-	unpcklps %xmm2, %xmm3
-	movaps	%xmm3, %xmm0
-	addq	$56, %rsp
-	cfi_adjust_cfa_offset (-56)
+	/* merge 4x results into xmm0.  */
+	movd	%ebx, %xmm1
+	unpcklps %xmm0, %xmm1
+	movsd	(%rsp), %xmm0
+	unpcklpd %xmm1, %xmm0
+	addq	$32, %rsp
+	cfi_adjust_cfa_offset (-32)
+	pop	%rbx
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbx)
 	ret
 .endm
 
@@ -86,48 +91,24 @@
 	cfi_rel_offset (%rbx, 0)
 	movq	%rdi, %rbp
 	movq	%rsi, %rbx
-	subq	$40, %rsp
-	cfi_adjust_cfa_offset (40)
-	leaq	24(%rsp), %rsi
-	leaq	28(%rsp), %rdi
+	subq	$24, %rsp
+	cfi_adjust_cfa_offset (24)
 	movaps	%xmm0, (%rsp)
 	call	JUMPTARGET(\callee)
-	leaq	24(%rsp), %rsi
-	leaq	28(%rsp), %rdi
-	movss	28(%rsp), %xmm0
-	movss	%xmm0, 0(%rbp)
-	movaps	(%rsp), %xmm1
-	movss	24(%rsp), %xmm0
-	movss	%xmm0, (%rbx)
-	movaps	%xmm1, %xmm0
-	shufps	$85, %xmm1, %xmm0
+	movss	4(%rsp), %xmm0
+	leaq	4(%rbp), %rdi
+	leaq	4(%rbx), %rsi
 	call	JUMPTARGET(\callee)
-	movss	28(%rsp), %xmm0
-	leaq	24(%rsp), %rsi
-	movss	%xmm0, 4(%rbp)
-	leaq	28(%rsp), %rdi
-	movaps	(%rsp), %xmm1
-	movss	24(%rsp), %xmm0
-	movss	%xmm0, 4(%rbx)
-	movaps	%xmm1, %xmm0
-	unpckhps %xmm1, %xmm0
+	movss	8(%rsp), %xmm0
+	leaq	8(%rbp), %rdi
+	leaq	8(%rbx), %rsi
 	call	JUMPTARGET(\callee)
-	movaps	(%rsp), %xmm1
-	leaq	24(%rsp), %rsi
-	leaq	28(%rsp), %rdi
-	movss	28(%rsp), %xmm0
-	shufps	$255, %xmm1, %xmm1
-	movss	%xmm0, 8(%rbp)
-	movss	24(%rsp), %xmm0
-	movss	%xmm0, 8(%rbx)
-	movaps	%xmm1, %xmm0
+	movss	12(%rsp), %xmm0
+	leaq	12(%rbp), %rdi
+	leaq	12(%rbx), %rsi
 	call	JUMPTARGET(\callee)
-	movss	28(%rsp), %xmm0
-	movss	%xmm0, 12(%rbp)
-	movss	24(%rsp), %xmm0
-	movss	%xmm0, 12(%rbx)
-	addq	$40, %rsp
-	cfi_adjust_cfa_offset (-40)
+	addq	$24, %rsp
+	cfi_adjust_cfa_offset (-24)
 	popq	%rbx
 	cfi_adjust_cfa_offset (-8)
 	cfi_restore (%rbx)
@@ -137,190 +118,4 @@
 	ret
 .endm
 
-/* AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
-.macro WRAPPER_IMPL_AVX callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-32, %rsp
-	subq	$32, %rsp
-	vextractf128 $1, %ymm0, (%rsp)
-	vzeroupper
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	%xmm0, 16(%rsp)
-	vmovaps	(%rsp), %xmm0
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	%xmm0, %xmm1
-	vmovaps	16(%rsp), %xmm0
-	vinsertf128 $1, %xmm1, %ymm0, %ymm0
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* 2 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
-.macro WRAPPER_IMPL_AVX_ff callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-32, %rsp
-	subq	$64, %rsp
-	vextractf128 $1, %ymm0, 16(%rsp)
-	vextractf128 $1, %ymm1, (%rsp)
-	vzeroupper
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	%xmm0, 32(%rsp)
-	vmovaps	16(%rsp), %xmm0
-	vmovaps	(%rsp), %xmm1
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	%xmm0, %xmm1
-	vmovaps	32(%rsp), %xmm0
-	vinsertf128 $1, %xmm1, %ymm0, %ymm0
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* 3 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
-.macro WRAPPER_IMPL_AVX_fFF callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-32, %rsp
-	pushq	%r13
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%r13, 0)
-	pushq	%r14
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%r14, 0)
-	subq	$48, %rsp
-	movq	%rsi, %r14
-	vmovaps	%ymm0, (%rsp)
-	movq	%rdi, %r13
-	vmovaps	16(%rsp), %xmm1
-	vmovaps	%xmm1, 32(%rsp)
-	vzeroupper
-	vmovaps	(%rsp), %xmm0
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	32(%rsp), %xmm0
-	lea	(%rsp), %rdi
-	lea	16(%rsp), %rsi
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	(%rsp), %xmm0
-	vmovaps	16(%rsp), %xmm1
-	vmovaps	%xmm0, 16(%r13)
-	vmovaps	%xmm1, 16(%r14)
-	addq	$48, %rsp
-	popq	%r14
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%r14)
-	popq	%r13
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%r13)
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* AVX512 ISA version as wrapper to AVX2 ISA version.  */
-.macro WRAPPER_IMPL_AVX512 callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-64, %rsp
-	subq	$128, %rsp
-	vmovups	%zmm0, (%rsp)
-	vmovupd	(%rsp), %ymm0
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovupd	%ymm0, 64(%rsp)
-	vmovupd	32(%rsp), %ymm0
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovupd	%ymm0, 96(%rsp)
-	vmovups	64(%rsp), %zmm0
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* 2 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
-.macro WRAPPER_IMPL_AVX512_ff callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-64, %rsp
-	subq	$192, %rsp
-	vmovups	%zmm0, (%rsp)
-	vmovups	%zmm1, 64(%rsp)
-	vmovups	(%rsp), %ymm0
-	vmovups	64(%rsp), %ymm1
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovups	%ymm0, 128(%rsp)
-	vmovups	32(%rsp), %ymm0
-	vmovups	96(%rsp), %ymm1
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovups	%ymm0, 160(%rsp)
-	vmovups	128(%rsp), %zmm0
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
-
-/* 3 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
-.macro WRAPPER_IMPL_AVX512_fFF callee
-	pushq	%rbp
-	cfi_adjust_cfa_offset (8)
-	cfi_rel_offset (%rbp, 0)
-	movq	%rsp, %rbp
-	cfi_def_cfa_register (%rbp)
-	andq	$-64, %rsp
-	pushq	%r12
-	pushq	%r13
-	subq	$176, %rsp
-	movq	%rsi, %r13
-	vmovaps	%zmm0, (%rsp)
-	movq	%rdi, %r12
-	vmovaps	(%rsp), %ymm0
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	32(%rsp), %ymm0
-	lea	64(%rsp), %rdi
-	lea	96(%rsp), %rsi
-	call	HIDDEN_JUMPTARGET(\callee)
-	vmovaps	64(%rsp), %ymm0
-	vmovaps	96(%rsp), %ymm1
-	vmovaps	%ymm0, 32(%r12)
-	vmovaps	%ymm1, 32(%r13)
-	addq	$176, %rsp
-	popq	%r13
-	popq	%r12
-	movq	%rbp, %rsp
-	cfi_def_cfa_register (%rsp)
-	popq	%rbp
-	cfi_adjust_cfa_offset (-8)
-	cfi_restore (%rbp)
-	ret
-.endm
+#include "svml_sd_wrapper_impl.h.S"
diff --git a/sysdeps/x86_64/fpu/svml_sd_wrapper_impl.h.S b/sysdeps/x86_64/fpu/svml_sd_wrapper_impl.h.S
new file mode 100644
index 0000000000..ba5b2932af
--- /dev/null
+++ b/sysdeps/x86_64/fpu/svml_sd_wrapper_impl.h.S
@@ -0,0 +1,186 @@
+/* Common float/double wrapper implementations of vector math
+   functions.
+   Copyright (C) 2022 Free Software Foundation, Inc.
+   This file is part of the GNU C Library.
+
+   The GNU C Library is free software; you can redistribute it and/or
+   modify it under the terms of the GNU Lesser General Public
+   License as published by the Free Software Foundation; either
+   version 2.1 of the License, or (at your option) any later version.
+
+   The GNU C Library is distributed in the hope that it will be useful,
+   but WITHOUT ANY WARRANTY; without even the implied warranty of
+   MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+   Lesser General Public License for more details.
+
+   You should have received a copy of the GNU Lesser General Public
+   License along with the GNU C Library; if not, see
+   <https://www.gnu.org/licenses/>.  */
+
+/* AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
+.macro WRAPPER_IMPL_AVX callee
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-32, %rsp
+	subq	$32, %rsp
+	vmovaps	%ymm0, (%rsp)
+	vzeroupper
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	%xmm0, (%rsp)
+	vmovaps	16(%rsp), %xmm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	/* combine xmm0 (return of second call) with result of first
+	   call (saved on stack). Might be worth exploring logic that
+	   uses `vpblend` and reads in ymm1 using -16(rsp).  */
+	vmovaps	(%rsp), %xmm1
+	vinsertf128 $1, %xmm0, %ymm1, %ymm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
+.endm
+
+/* 2 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
+.macro WRAPPER_IMPL_AVX_ff callee
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-32, %rsp
+	subq	$64, %rsp
+	vmovaps	%ymm0, (%rsp)
+	vmovaps	%ymm1, 32(%rsp)
+	vzeroupper
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	48(%rsp), %xmm1
+	vmovaps	%xmm0, (%rsp)
+	vmovaps	16(%rsp), %xmm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	/* combine xmm0 (return of second call) with result of first
+	   call (saved on stack). Might be worth exploring logic that
+	   uses `vpblend` and reads in ymm1 using -16(rsp).  */
+	vmovaps	(%rsp), %xmm1
+	vinsertf128 $1, %xmm0, %ymm1, %ymm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
+.endm
+
+/* 3 argument AVX/AVX2 ISA version as wrapper to SSE ISA version.  */
+.macro WRAPPER_IMPL_AVX_fFF callee
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	andq	$-32, %rsp
+	subq	$48, %rsp
+	vmovaps	%ymm0, 16(%rsp)
+	movq	%rdi, (%rsp)
+	movq	%rsi, 8(%rsp)
+	vzeroupper
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	32(%rsp), %xmm0
+	movq	(%rsp), %rdi
+	movq	8(%rsp), %rsi
+	addq	$16, %rdi
+	addq	$16, %rsi
+	call	HIDDEN_JUMPTARGET(\callee)
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
+.endm
+
+/* AVX512 ISA version as wrapper to AVX2 ISA version.  */
+.macro WRAPPER_IMPL_AVX512 callee
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-64, %rsp
+	subq	$64, %rsp
+	vmovups	%zmm0, (%rsp)
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovupd	%ymm0, (%rsp)
+	vmovupd	32(%rsp), %ymm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	/* combine ymm0 (return of second call) with result of first
+	   call (saved on stack).  */
+	vmovaps	(%rsp), %ymm1
+	vinserti64x4 $0x1, %ymm0, %zmm1, %zmm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
+.endm
+
+/* 2 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
+.macro WRAPPER_IMPL_AVX512_ff callee
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-64, %rsp
+	addq	$-128, %rsp
+	vmovups	%zmm0, (%rsp)
+	vmovups	%zmm1, 64(%rsp)
+	/* ymm0 and ymm1 are already set.  */
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovups	96(%rsp), %ymm1
+	vmovaps	%ymm0, (%rsp)
+	vmovups	32(%rsp), %ymm0
+	call	HIDDEN_JUMPTARGET(\callee)
+	/* combine ymm0 (return of second call) with result of first
+	   call (saved on stack).  */
+	vmovaps	(%rsp), %ymm1
+	vinserti64x4 $0x1, %ymm0, %zmm1, %zmm0
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
+.endm
+
+/* 3 argument AVX512 ISA version as wrapper to AVX2 ISA version.  */
+.macro WRAPPER_IMPL_AVX512_fFF callee
+	pushq	%rbp
+	cfi_adjust_cfa_offset (8)
+	cfi_rel_offset (%rbp, 0)
+	movq	%rsp, %rbp
+	cfi_def_cfa_register (%rbp)
+	andq	$-64, %rsp
+	subq	$80, %rsp
+	vmovaps	%zmm0, 16(%rsp)
+	movq	%rdi, (%rsp)
+	movq	%rsi, 8(%rsp)
+	/* ymm0 is already set.  */
+	call	HIDDEN_JUMPTARGET(\callee)
+	vmovaps	48(%rsp), %ymm0
+	movq	(%rsp), %rdi
+	movq	8(%rsp), %rsi
+	addq	$32, %rdi
+	addq	$32, %rsi
+	call	HIDDEN_JUMPTARGET(\callee)
+	movq	%rbp, %rsp
+	cfi_def_cfa_register (%rsp)
+	popq	%rbp
+	cfi_adjust_cfa_offset (-8)
+	cfi_restore (%rbp)
+	ret
+.endm
-- 
2.34.1


^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [PATCH v1 1/3] x86/fpu: Move svml_{s|d}_wrapper_impl.h -> svml_{s|d}_wrapper_impl.h.S
  2022-11-18  6:37 [PATCH v1 1/3] x86/fpu: Move svml_{s|d}_wrapper_impl.h -> svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
  2022-11-18  6:37 ` [PATCH v1 2/3] x86/fpu: Reformat svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
  2022-11-18  6:37 ` [PATCH v1 3/3] x86/fpu: Cleanup code in svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
@ 2022-11-18 15:49 ` H.J. Lu via Libc-alpha
  2 siblings, 0 replies; 4+ messages in thread
From: H.J. Lu via Libc-alpha @ 2022-11-18 15:49 UTC (permalink / raw)
  To: Noah Goldstein; +Cc: libc-alpha, carlos

On Thu, Nov 17, 2022 at 10:37 PM Noah Goldstein <goldstein.w.n@gmail.com> wrote:
>
> The files are assembler so having the proper file extension is
> convenient.  This doesn't change libm.so or libmvec.so.
> ---
>  sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S          | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S          | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S          | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S          | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S          | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S       | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S        | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S        | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S        | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S        | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S     | 2 +-
>  sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S        | 2 +-
>  sysdeps/x86_64/fpu/svml_d_acos2_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_acos4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_acos8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_acosh2_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_acosh4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_d_acosh8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_asin2_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_asin4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_asin8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_asinh2_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_asinh4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_d_asinh8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atan22_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atan24_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atan28_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atan2_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atan4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atan8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atanh2_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atanh4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_d_atanh8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cbrt2_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cbrt4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cbrt8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cos2_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cos4_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cos8_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cosh2_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cosh4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_cosh8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_erf2_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_erf4_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_d_erf8_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_erfc2_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_erfc4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_erfc8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp102_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp104_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp108_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp22_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp24_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp28_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp2_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp4_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_d_exp8_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_expm12_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_expm14_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_d_expm18_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_hypot2_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_hypot4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_d_hypot8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log102_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log104_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log104_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log108_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log1p2_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log1p4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log1p8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log22_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log24_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log24_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log28_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log2_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log4_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log4_core_avx.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_d_log8_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_pow2_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_pow4_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_d_pow8_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sin2_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sin4_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sin8_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sincos2_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sincos4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sincos8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sinh2_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sinh4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_sinh8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_tan2_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_tan4_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_d_tan8_core.S                           | 2 +-
>  sysdeps/x86_64/fpu/svml_d_tanh2_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_tanh4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_d_tanh8_core.S                          | 2 +-
>  .../fpu/{svml_d_wrapper_impl.h => svml_d_wrapper_impl.h.S}      | 0
>  sysdeps/x86_64/fpu/svml_s_acosf16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_acosf4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_acosf8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_s_acoshf16_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_acoshf4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_acoshf8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_s_asinf16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_asinf4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_asinf8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_s_asinhf16_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_asinhf4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_asinhf8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atan2f16_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atan2f4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atan2f8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atanf16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atanf4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atanf8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atanhf16_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atanhf4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atanhf8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_s_cosf16_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_cosf4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_cosf8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_s_coshf16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_coshf4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_coshf8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_s_erfcf16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_erfcf4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_erfcf8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_s_erff16_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_erff4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_erff8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_s_exp10f16_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_exp10f4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_exp10f8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_s_exp2f16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_exp2f4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_exp2f8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_s_expf16_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_expf4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_expf8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_s_expm1f16_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_expm1f4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_expm1f8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_s_hypotf16_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_hypotf4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_hypotf8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log10f16_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log10f4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log10f8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log1pf16_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log1pf4_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log1pf8_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S                    | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log2f16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log2f4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log2f8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_s_logf16_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_logf4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_logf8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_s_powf16_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_powf4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_powf8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sincosf16_core.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sincosf4_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sincosf8_core.S                       | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S                   | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sinf16_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sinf4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sinf8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sinhf16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sinhf4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sinhf8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S                     | 2 +-
>  sysdeps/x86_64/fpu/svml_s_tanf16_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_tanf4_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_tanf8_core.S                          | 2 +-
>  sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S                      | 2 +-
>  sysdeps/x86_64/fpu/svml_s_tanhf16_core.S                        | 2 +-
>  sysdeps/x86_64/fpu/svml_s_tanhf4_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_tanhf8_core.S                         | 2 +-
>  sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S                     | 2 +-
>  .../fpu/{svml_s_wrapper_impl.h => svml_s_wrapper_impl.h.S}      | 0
>  230 files changed, 228 insertions(+), 228 deletions(-)
>  rename sysdeps/x86_64/fpu/{svml_d_wrapper_impl.h => svml_d_wrapper_impl.h.S} (100%)
>  rename sysdeps/x86_64/fpu/{svml_s_wrapper_impl.h => svml_s_wrapper_impl.h.S} (100%)
>
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S
> index 86543d50d3..b2675d9647 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_d_cos8_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_d_trig_data.h"
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_cos_knl)
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S
> index bccc56858d..98f2bc2e41 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_d_exp8_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_d_exp_data.h"
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_exp_knl)
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S
> index 230ebe8ceb..20d4d967e0 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_d_log8_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_d_log_data.h"
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_log_knl)
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S
> index 963fbe3662..a4c6c27147 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_d_pow8_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_d_pow_data.h"
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>  /* ALGORITHM DESCRIPTION:
>
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S
> index 54bd8679d1..f3de17b413 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_d_sin8_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_d_trig_data.h"
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_sin_knl)
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S
> index f6297e0024..a05a900ede 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_d_sincos8_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_d_trig_data.h"
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>  /*
>     ALGORITHM DESCRIPTION:
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S
> index b0612c9357..5929fc0203 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_s_trig_data.h"
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_cosf_knl)
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S
> index 599b67f83b..71ed573a6e 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_s_expf16_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_s_expf_data.h"
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_expf_knl)
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S
> index 0070b0c419..b0062a1eb5 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_s_logf16_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_s_logf_data.h"
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_logf_knl)
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S
> index 306895838b..c05960bc31 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_s_powf16_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_s_powf_data.h"
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>  /*
>     ALGORITHM DESCRIPTION:
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S
> index 9a2aebb023..9ab6af69be 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_s_sincosf16_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_s_trig_data.h"
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>  /*
>     ALGORITHM DESCRIPTION:
> diff --git a/sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S b/sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S
> index 33b210fac0..0c037fd663 100644
> --- a/sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S
> +++ b/sysdeps/x86_64/fpu/multiarch/svml_s_sinf16_core_avx512.S
> @@ -18,7 +18,7 @@
>
>  #include <sysdep.h>
>  #include "svml_s_trig_data.h"
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY(_ZGVeN16v_sinf_knl)
> diff --git a/sysdeps/x86_64/fpu/svml_d_acos2_core.S b/sysdeps/x86_64/fpu/svml_d_acos2_core.S
> index 8c573fcff1..1022d7e4e7 100644
> --- a/sysdeps/x86_64/fpu/svml_d_acos2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_acos2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_acos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_acos4_core.S b/sysdeps/x86_64/fpu/svml_d_acos4_core.S
> index ef2253f3dd..b4a8990f5a 100644
> --- a/sysdeps/x86_64/fpu/svml_d_acos4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_acos4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_acos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S
> index cc5377850c..48e610f4ef 100644
> --- a/sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_acos4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_acos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_acos8_core.S b/sysdeps/x86_64/fpu/svml_d_acos8_core.S
> index 3bf8d8dcdf..8f8017a72d 100644
> --- a/sysdeps/x86_64/fpu/svml_d_acos8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_acos8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_acos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_acosh2_core.S b/sysdeps/x86_64/fpu/svml_d_acosh2_core.S
> index ab92795d13..23d22de0fe 100644
> --- a/sysdeps/x86_64/fpu/svml_d_acosh2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_acosh2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_acosh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_acosh4_core.S b/sysdeps/x86_64/fpu/svml_d_acosh4_core.S
> index 91ff3018c1..5bef8065f7 100644
> --- a/sysdeps/x86_64/fpu/svml_d_acosh4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_acosh4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_acosh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S
> index 84cbe5871c..95dd9ed670 100644
> --- a/sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_acosh4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_acosh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_acosh8_core.S b/sysdeps/x86_64/fpu/svml_d_acosh8_core.S
> index 0b4521ce64..6277eebc1e 100644
> --- a/sysdeps/x86_64/fpu/svml_d_acosh8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_acosh8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_acosh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_asin2_core.S b/sysdeps/x86_64/fpu/svml_d_asin2_core.S
> index 911625030f..13c186ac08 100644
> --- a/sysdeps/x86_64/fpu/svml_d_asin2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_asin2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_asin)
> diff --git a/sysdeps/x86_64/fpu/svml_d_asin4_core.S b/sysdeps/x86_64/fpu/svml_d_asin4_core.S
> index 222a0b8f73..50ea5a4207 100644
> --- a/sysdeps/x86_64/fpu/svml_d_asin4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_asin4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_asin)
> diff --git a/sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S
> index 04ff322ea6..6de73c97cf 100644
> --- a/sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_asin4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_asin)
> diff --git a/sysdeps/x86_64/fpu/svml_d_asin8_core.S b/sysdeps/x86_64/fpu/svml_d_asin8_core.S
> index bea94b954c..aaf93dc8e9 100644
> --- a/sysdeps/x86_64/fpu/svml_d_asin8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_asin8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_asin)
> diff --git a/sysdeps/x86_64/fpu/svml_d_asinh2_core.S b/sysdeps/x86_64/fpu/svml_d_asinh2_core.S
> index a621fae3fb..00723458e4 100644
> --- a/sysdeps/x86_64/fpu/svml_d_asinh2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_asinh2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_asinh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_asinh4_core.S b/sysdeps/x86_64/fpu/svml_d_asinh4_core.S
> index a0332eca57..f2e13c3a0a 100644
> --- a/sysdeps/x86_64/fpu/svml_d_asinh4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_asinh4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_asinh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S
> index bc365d7dd7..51324b697f 100644
> --- a/sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_asinh4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_asinh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_asinh8_core.S b/sysdeps/x86_64/fpu/svml_d_asinh8_core.S
> index 2271466d99..3c32741b39 100644
> --- a/sysdeps/x86_64/fpu/svml_d_asinh8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_asinh8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_asinh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atan22_core.S b/sysdeps/x86_64/fpu/svml_d_atan22_core.S
> index 7f40f1be6f..6b3cef4269 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atan22_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atan22_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2vv_atan2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atan24_core.S b/sysdeps/x86_64/fpu/svml_d_atan24_core.S
> index afb8296d80..7f76bf166c 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atan24_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atan24_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4vv_atan2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S b/sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S
> index 81db9cd297..7e8fead6e0 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4vv_atan2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atan28_core.S b/sysdeps/x86_64/fpu/svml_d_atan28_core.S
> index 9c25de34b6..2c710eadac 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atan28_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atan28_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8vv_atan2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atan2_core.S b/sysdeps/x86_64/fpu/svml_d_atan2_core.S
> index bd4020ab69..7ea434fec5 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atan2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atan2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_atan)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atan4_core.S b/sysdeps/x86_64/fpu/svml_d_atan4_core.S
> index 89737790f4..06e3e2a870 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atan4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atan4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_atan)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S
> index 8abe407abe..73a8ab8ebb 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_atan)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atan8_core.S b/sysdeps/x86_64/fpu/svml_d_atan8_core.S
> index cea8caf797..dfb6f788bc 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atan8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atan8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_atan)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atanh2_core.S b/sysdeps/x86_64/fpu/svml_d_atanh2_core.S
> index 8c300fbc09..f112f43793 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atanh2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atanh2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_atanh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atanh4_core.S b/sysdeps/x86_64/fpu/svml_d_atanh4_core.S
> index df63d2d655..ba74bcfeb5 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atanh4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atanh4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_atanh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S
> index 0bb51b2d60..9d1fa7f8c7 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atanh4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_atanh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_atanh8_core.S b/sysdeps/x86_64/fpu/svml_d_atanh8_core.S
> index 956230e978..24011dad24 100644
> --- a/sysdeps/x86_64/fpu/svml_d_atanh8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_atanh8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_atanh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cbrt2_core.S b/sysdeps/x86_64/fpu/svml_d_cbrt2_core.S
> index 5ca1129c7f..3a3152000b 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cbrt2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cbrt2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_cbrt)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cbrt4_core.S b/sysdeps/x86_64/fpu/svml_d_cbrt4_core.S
> index c3948ec480..31e0bc46db 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cbrt4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cbrt4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_cbrt)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S
> index b64825e876..8b4122c7ad 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cbrt4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_cbrt)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cbrt8_core.S b/sysdeps/x86_64/fpu/svml_d_cbrt8_core.S
> index 312f015706..0aac584981 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cbrt8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cbrt8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_cbrt)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cos2_core.S b/sysdeps/x86_64/fpu/svml_d_cos2_core.S
> index 02a6583776..d2ea460b50 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cos2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cos2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_cos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cos4_core.S b/sysdeps/x86_64/fpu/svml_d_cos4_core.S
> index f066ceaf4c..1f09e366c2 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cos4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cos4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_cos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S
> index 5212078c9c..e96205ec00 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cos4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_cos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cos8_core.S b/sysdeps/x86_64/fpu/svml_d_cos8_core.S
> index 2ac8e4687a..ef3c7a0a71 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cos8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cos8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_cos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cosh2_core.S b/sysdeps/x86_64/fpu/svml_d_cosh2_core.S
> index 6aa76d40ed..55b4ae267b 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cosh2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cosh2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_cosh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cosh4_core.S b/sysdeps/x86_64/fpu/svml_d_cosh4_core.S
> index d3a7bec3f4..9a99bd3fab 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cosh4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cosh4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_cosh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S
> index 73612e33bc..8a514c8790 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cosh4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_cosh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_cosh8_core.S b/sysdeps/x86_64/fpu/svml_d_cosh8_core.S
> index e89d2c23fb..35ab7f6ee9 100644
> --- a/sysdeps/x86_64/fpu/svml_d_cosh8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_cosh8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_cosh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_erf2_core.S b/sysdeps/x86_64/fpu/svml_d_erf2_core.S
> index 828dd59cb3..0622c3caac 100644
> --- a/sysdeps/x86_64/fpu/svml_d_erf2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_erf2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_erf)
> diff --git a/sysdeps/x86_64/fpu/svml_d_erf4_core.S b/sysdeps/x86_64/fpu/svml_d_erf4_core.S
> index 9432824d8a..6f95c5feef 100644
> --- a/sysdeps/x86_64/fpu/svml_d_erf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_erf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_erf)
> diff --git a/sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S
> index e3f73f9044..4815f8468a 100644
> --- a/sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_erf4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_erf)
> diff --git a/sysdeps/x86_64/fpu/svml_d_erf8_core.S b/sysdeps/x86_64/fpu/svml_d_erf8_core.S
> index fde688f5dd..079f423284 100644
> --- a/sysdeps/x86_64/fpu/svml_d_erf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_erf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_erf)
> diff --git a/sysdeps/x86_64/fpu/svml_d_erfc2_core.S b/sysdeps/x86_64/fpu/svml_d_erfc2_core.S
> index 530d358b8b..cb239f82e8 100644
> --- a/sysdeps/x86_64/fpu/svml_d_erfc2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_erfc2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_erfc)
> diff --git a/sysdeps/x86_64/fpu/svml_d_erfc4_core.S b/sysdeps/x86_64/fpu/svml_d_erfc4_core.S
> index c6b3e2c367..2b42b3af83 100644
> --- a/sysdeps/x86_64/fpu/svml_d_erfc4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_erfc4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_erfc)
> diff --git a/sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S
> index e81ca48897..9e16bf2ec5 100644
> --- a/sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_erfc4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_erfc)
> diff --git a/sysdeps/x86_64/fpu/svml_d_erfc8_core.S b/sysdeps/x86_64/fpu/svml_d_erfc8_core.S
> index b3d31fec27..f781a7af9a 100644
> --- a/sysdeps/x86_64/fpu/svml_d_erfc8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_erfc8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_erfc)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp102_core.S b/sysdeps/x86_64/fpu/svml_d_exp102_core.S
> index 8e6131d717..d6d7da6fec 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp102_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp102_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_exp10)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp104_core.S b/sysdeps/x86_64/fpu/svml_d_exp104_core.S
> index e226bb88bb..e0c651f904 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp104_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp104_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_exp10)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S b/sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S
> index 8758d35449..ca3931eb8b 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp104_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_exp10)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp108_core.S b/sysdeps/x86_64/fpu/svml_d_exp108_core.S
> index 745d3c7ec7..ad2ba994c0 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp108_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp108_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_exp10)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp22_core.S b/sysdeps/x86_64/fpu/svml_d_exp22_core.S
> index 3639892544..4205e5510d 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp22_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp22_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_exp2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp24_core.S b/sysdeps/x86_64/fpu/svml_d_exp24_core.S
> index aea3afc3e3..4be1ee4e48 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp24_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp24_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_exp2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S b/sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S
> index 68f28b8f63..b09da67af1 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp24_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_exp2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp28_core.S b/sysdeps/x86_64/fpu/svml_d_exp28_core.S
> index dc0b3e77d0..0e8e027713 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp28_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp28_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_exp2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp2_core.S b/sysdeps/x86_64/fpu/svml_d_exp2_core.S
> index 14e41b9707..15109ef829 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_exp)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp4_core.S b/sysdeps/x86_64/fpu/svml_d_exp4_core.S
> index cf91c08ab4..de35ab001f 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_exp)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S
> index b1ed2bc0ae..452558041c 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_exp)
> diff --git a/sysdeps/x86_64/fpu/svml_d_exp8_core.S b/sysdeps/x86_64/fpu/svml_d_exp8_core.S
> index ef7d62c1ec..88e663fa9b 100644
> --- a/sysdeps/x86_64/fpu/svml_d_exp8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_exp8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_exp)
> diff --git a/sysdeps/x86_64/fpu/svml_d_expm12_core.S b/sysdeps/x86_64/fpu/svml_d_expm12_core.S
> index b5bfd598a8..315d12e2e9 100644
> --- a/sysdeps/x86_64/fpu/svml_d_expm12_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_expm12_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_expm1)
> diff --git a/sysdeps/x86_64/fpu/svml_d_expm14_core.S b/sysdeps/x86_64/fpu/svml_d_expm14_core.S
> index 1986f086ff..902a85bd4a 100644
> --- a/sysdeps/x86_64/fpu/svml_d_expm14_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_expm14_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_expm1)
> diff --git a/sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S b/sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S
> index 333a7011ae..f8f7d15759 100644
> --- a/sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_expm14_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_expm1)
> diff --git a/sysdeps/x86_64/fpu/svml_d_expm18_core.S b/sysdeps/x86_64/fpu/svml_d_expm18_core.S
> index d008c4273d..513688ebf5 100644
> --- a/sysdeps/x86_64/fpu/svml_d_expm18_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_expm18_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_expm1)
> diff --git a/sysdeps/x86_64/fpu/svml_d_hypot2_core.S b/sysdeps/x86_64/fpu/svml_d_hypot2_core.S
> index 6532e46a8b..c746560128 100644
> --- a/sysdeps/x86_64/fpu/svml_d_hypot2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_hypot2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2vv_hypot)
> diff --git a/sysdeps/x86_64/fpu/svml_d_hypot4_core.S b/sysdeps/x86_64/fpu/svml_d_hypot4_core.S
> index 1383ac5304..aa1e2b330b 100644
> --- a/sysdeps/x86_64/fpu/svml_d_hypot4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_hypot4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4vv_hypot)
> diff --git a/sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S
> index 6b1b165a2c..7028c55eab 100644
> --- a/sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4vv_hypot)
> diff --git a/sysdeps/x86_64/fpu/svml_d_hypot8_core.S b/sysdeps/x86_64/fpu/svml_d_hypot8_core.S
> index ec73ba6184..10c831fd08 100644
> --- a/sysdeps/x86_64/fpu/svml_d_hypot8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_hypot8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8vv_hypot)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log102_core.S b/sysdeps/x86_64/fpu/svml_d_log102_core.S
> index 9e0aa00f7e..a63e82bdbc 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log102_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log102_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_log10)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log104_core.S b/sysdeps/x86_64/fpu/svml_d_log104_core.S
> index cc5e285dee..051ad8284a 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log104_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log104_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_log10)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log104_core_avx.S b/sysdeps/x86_64/fpu/svml_d_log104_core_avx.S
> index ea1cb93a9d..f08b78e35e 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log104_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log104_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_log10)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log108_core.S b/sysdeps/x86_64/fpu/svml_d_log108_core.S
> index 59d5835cc7..8b5a59cf00 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log108_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log108_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_log10)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log1p2_core.S b/sysdeps/x86_64/fpu/svml_d_log1p2_core.S
> index b40e238e11..058e6d47da 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log1p2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log1p2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_log1p)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log1p4_core.S b/sysdeps/x86_64/fpu/svml_d_log1p4_core.S
> index 2c76849ca8..7c27ab9a5c 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log1p4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log1p4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_log1p)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S
> index 42ace645e7..8b0b20f018 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log1p4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_log1p)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log1p8_core.S b/sysdeps/x86_64/fpu/svml_d_log1p8_core.S
> index cca39f1847..1c6be998d7 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log1p8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log1p8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_log1p)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log22_core.S b/sysdeps/x86_64/fpu/svml_d_log22_core.S
> index 8c8464b2f8..5e998a38e1 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log22_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log22_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_log2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log24_core.S b/sysdeps/x86_64/fpu/svml_d_log24_core.S
> index b75936f785..b2c3c4a727 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log24_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log24_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_log2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log24_core_avx.S b/sysdeps/x86_64/fpu/svml_d_log24_core_avx.S
> index f4a946033f..eaeee60be5 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log24_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log24_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_log2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log28_core.S b/sysdeps/x86_64/fpu/svml_d_log28_core.S
> index e547d8431b..e81a0d27d7 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log28_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log28_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_log2)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log2_core.S b/sysdeps/x86_64/fpu/svml_d_log2_core.S
> index 1e9790d78c..e38d58d465 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_log)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log4_core.S b/sysdeps/x86_64/fpu/svml_d_log4_core.S
> index cac601bb62..ab2801d343 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_log)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_log4_core_avx.S
> index f769844917..ace1ac535a 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_log)
> diff --git a/sysdeps/x86_64/fpu/svml_d_log8_core.S b/sysdeps/x86_64/fpu/svml_d_log8_core.S
> index 56284de286..37c0bcce11 100644
> --- a/sysdeps/x86_64/fpu/svml_d_log8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_log8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_log)
> diff --git a/sysdeps/x86_64/fpu/svml_d_pow2_core.S b/sysdeps/x86_64/fpu/svml_d_pow2_core.S
> index f8cf580f76..7142d6ece4 100644
> --- a/sysdeps/x86_64/fpu/svml_d_pow2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_pow2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2vv_pow)
> diff --git a/sysdeps/x86_64/fpu/svml_d_pow4_core.S b/sysdeps/x86_64/fpu/svml_d_pow4_core.S
> index 4cf403230d..572ba50519 100644
> --- a/sysdeps/x86_64/fpu/svml_d_pow4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_pow4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4vv_pow)
> diff --git a/sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S
> index f72e9532f3..e8f7c56d82 100644
> --- a/sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_pow4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4vv_pow)
> diff --git a/sysdeps/x86_64/fpu/svml_d_pow8_core.S b/sysdeps/x86_64/fpu/svml_d_pow8_core.S
> index 0b99cf6570..3f7d63cad0 100644
> --- a/sysdeps/x86_64/fpu/svml_d_pow8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_pow8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8vv_pow)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sin2_core.S b/sysdeps/x86_64/fpu/svml_d_sin2_core.S
> index 40f23bf473..c101822478 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sin2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sin2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_sin)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sin4_core.S b/sysdeps/x86_64/fpu/svml_d_sin4_core.S
> index f7c6e5de88..53464f52e0 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sin4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sin4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_sin)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S
> index 694ac1977a..d291d020dc 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sin4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_sin)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sin8_core.S b/sysdeps/x86_64/fpu/svml_d_sin8_core.S
> index c3c978a843..03dbf25620 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sin8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sin8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_sin)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sincos2_core.S b/sysdeps/x86_64/fpu/svml_d_sincos2_core.S
> index 617e206870..3e9f604ead 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sincos2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sincos2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2vl8l8_sincos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sincos4_core.S b/sysdeps/x86_64/fpu/svml_d_sincos4_core.S
> index 53d491102e..b20361129b 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sincos4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sincos4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4vl8l8_sincos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S
> index ef798926f3..242c812c7c 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sincos4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4vl8l8_sincos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sincos8_core.S b/sysdeps/x86_64/fpu/svml_d_sincos8_core.S
> index d9ae0bedd0..6a39b7dbb4 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sincos8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sincos8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8vl8l8_sincos)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sinh2_core.S b/sysdeps/x86_64/fpu/svml_d_sinh2_core.S
> index 205ad3b582..ab96636d0e 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sinh2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sinh2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_sinh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sinh4_core.S b/sysdeps/x86_64/fpu/svml_d_sinh4_core.S
> index 8f8bc794d6..9017582a06 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sinh4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sinh4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_sinh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S
> index 771fb2cd78..71aeb8d84d 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sinh4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_sinh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_sinh8_core.S b/sysdeps/x86_64/fpu/svml_d_sinh8_core.S
> index 0279264b30..1dd49a9dbb 100644
> --- a/sysdeps/x86_64/fpu/svml_d_sinh8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_sinh8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_sinh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_tan2_core.S b/sysdeps/x86_64/fpu/svml_d_tan2_core.S
> index 6fdc334a40..f127f7819c 100644
> --- a/sysdeps/x86_64/fpu/svml_d_tan2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_tan2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_tan)
> diff --git a/sysdeps/x86_64/fpu/svml_d_tan4_core.S b/sysdeps/x86_64/fpu/svml_d_tan4_core.S
> index 50e5331768..c1325c0984 100644
> --- a/sysdeps/x86_64/fpu/svml_d_tan4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_tan4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_tan)
> diff --git a/sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S
> index e4b1f1ed93..0088977a95 100644
> --- a/sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_tan4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_tan)
> diff --git a/sysdeps/x86_64/fpu/svml_d_tan8_core.S b/sysdeps/x86_64/fpu/svml_d_tan8_core.S
> index fd26488470..b92659d894 100644
> --- a/sysdeps/x86_64/fpu/svml_d_tan8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_tan8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_tan)
> diff --git a/sysdeps/x86_64/fpu/svml_d_tanh2_core.S b/sysdeps/x86_64/fpu/svml_d_tanh2_core.S
> index 1970e4622c..867adfaeef 100644
> --- a/sysdeps/x86_64/fpu/svml_d_tanh2_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_tanh2_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN2v_tanh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_tanh4_core.S b/sysdeps/x86_64/fpu/svml_d_tanh4_core.S
> index 1205efdec8..720539fa07 100644
> --- a/sysdeps/x86_64/fpu/svml_d_tanh4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_tanh4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN4v_tanh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S b/sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S
> index c42f63c865..270a76e495 100644
> --- a/sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_d_tanh4_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVcN4v_tanh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_tanh8_core.S b/sysdeps/x86_64/fpu/svml_d_tanh8_core.S
> index fe7c82370d..0ec89c2636 100644
> --- a/sysdeps/x86_64/fpu/svml_d_tanh8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_d_tanh8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_d_wrapper_impl.h"
> +#include "svml_d_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN8v_tanh)
> diff --git a/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h b/sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
> similarity index 100%
> rename from sysdeps/x86_64/fpu/svml_d_wrapper_impl.h
> rename to sysdeps/x86_64/fpu/svml_d_wrapper_impl.h.S
> diff --git a/sysdeps/x86_64/fpu/svml_s_acosf16_core.S b/sysdeps/x86_64/fpu/svml_s_acosf16_core.S
> index 8dbe96d505..054713e328 100644
> --- a/sysdeps/x86_64/fpu/svml_s_acosf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_acosf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_acosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_acosf4_core.S b/sysdeps/x86_64/fpu/svml_s_acosf4_core.S
> index aeb82221f8..4160c34fb4 100644
> --- a/sysdeps/x86_64/fpu/svml_s_acosf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_acosf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_acosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_acosf8_core.S b/sysdeps/x86_64/fpu/svml_s_acosf8_core.S
> index 189f81005e..8e2efc1146 100644
> --- a/sysdeps/x86_64/fpu/svml_s_acosf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_acosf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_acosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S
> index fb432639ab..f0ee911cbe 100644
> --- a/sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_acosf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_acosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_acoshf16_core.S b/sysdeps/x86_64/fpu/svml_s_acoshf16_core.S
> index e5ece41ec6..391f1af733 100644
> --- a/sysdeps/x86_64/fpu/svml_s_acoshf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_acoshf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_acoshf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_acoshf4_core.S b/sysdeps/x86_64/fpu/svml_s_acoshf4_core.S
> index d728e9f54f..ecb8705328 100644
> --- a/sysdeps/x86_64/fpu/svml_s_acoshf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_acoshf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_acoshf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_acoshf8_core.S b/sysdeps/x86_64/fpu/svml_s_acoshf8_core.S
> index 1f8352077e..cbac2627b7 100644
> --- a/sysdeps/x86_64/fpu/svml_s_acoshf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_acoshf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_acoshf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S
> index b9fc52c289..ead2bf3f2b 100644
> --- a/sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_acoshf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_acoshf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_asinf16_core.S b/sysdeps/x86_64/fpu/svml_s_asinf16_core.S
> index 54e947edd1..717d340b24 100644
> --- a/sysdeps/x86_64/fpu/svml_s_asinf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_asinf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_asinf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_asinf4_core.S b/sysdeps/x86_64/fpu/svml_s_asinf4_core.S
> index 10bddc352b..6b94efd9d3 100644
> --- a/sysdeps/x86_64/fpu/svml_s_asinf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_asinf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_asinf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_asinf8_core.S b/sysdeps/x86_64/fpu/svml_s_asinf8_core.S
> index 77f4d83a2c..4e490cbd5f 100644
> --- a/sysdeps/x86_64/fpu/svml_s_asinf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_asinf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_asinf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S
> index 6202b4d4cf..cde5528197 100644
> --- a/sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_asinf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_asinf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_asinhf16_core.S b/sysdeps/x86_64/fpu/svml_s_asinhf16_core.S
> index 16de57456e..ffb49184f7 100644
> --- a/sysdeps/x86_64/fpu/svml_s_asinhf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_asinhf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_asinhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_asinhf4_core.S b/sysdeps/x86_64/fpu/svml_s_asinhf4_core.S
> index 07dacc599f..ad4acbbd66 100644
> --- a/sysdeps/x86_64/fpu/svml_s_asinhf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_asinhf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_asinhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_asinhf8_core.S b/sysdeps/x86_64/fpu/svml_s_asinhf8_core.S
> index b874f4256b..41a1527162 100644
> --- a/sysdeps/x86_64/fpu/svml_s_asinhf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_asinhf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_asinhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S
> index b00a52e005..44607cd889 100644
> --- a/sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_asinhf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_asinhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atan2f16_core.S b/sysdeps/x86_64/fpu/svml_s_atan2f16_core.S
> index 345792d376..612b5af828 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atan2f16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atan2f16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16vv_atan2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atan2f4_core.S b/sysdeps/x86_64/fpu/svml_s_atan2f4_core.S
> index 536b739f04..a3fb092911 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atan2f4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atan2f4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4vv_atan2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atan2f8_core.S b/sysdeps/x86_64/fpu/svml_s_atan2f8_core.S
> index 67376ca528..2f694b6486 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atan2f8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atan2f8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8vv_atan2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S
> index 1c670d627f..06c856191b 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY(_ZGVcN8vv_atan2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atanf16_core.S b/sysdeps/x86_64/fpu/svml_s_atanf16_core.S
> index b1471451a0..59a0267aa0 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atanf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atanf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_atanf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atanf4_core.S b/sysdeps/x86_64/fpu/svml_s_atanf4_core.S
> index ea118a7984..2567206463 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atanf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atanf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_atanf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atanf8_core.S b/sysdeps/x86_64/fpu/svml_s_atanf8_core.S
> index 8fad748544..b7961c7dbc 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atanf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atanf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_atanf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S
> index 1216b8a057..bedb807e1f 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_atanf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atanhf16_core.S b/sysdeps/x86_64/fpu/svml_s_atanhf16_core.S
> index 96dce010ec..240d943d6c 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atanhf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atanhf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_atanhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atanhf4_core.S b/sysdeps/x86_64/fpu/svml_s_atanhf4_core.S
> index 1c1ec2bd0f..af4f58944f 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atanhf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atanhf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_atanhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atanhf8_core.S b/sysdeps/x86_64/fpu/svml_s_atanhf8_core.S
> index d92825330e..d23d62a18b 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atanhf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atanhf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_atanhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S
> index 54d0b77b08..e5ec0c8eac 100644
> --- a/sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_atanhf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_atanhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S b/sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S
> index efe9149807..726631e734 100644
> --- a/sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_cbrtf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_cbrtf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S b/sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S
> index 687c549d7f..44e4f9f678 100644
> --- a/sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_cbrtf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_cbrtf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S b/sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S
> index 2bb6fe8e78..e202f42255 100644
> --- a/sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_cbrtf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_cbrtf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S
> index 592f2cd419..b5f0f8ebdb 100644
> --- a/sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_cbrtf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_cbrtf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_cosf16_core.S b/sysdeps/x86_64/fpu/svml_s_cosf16_core.S
> index 8daeffc289..8775f8da9c 100644
> --- a/sysdeps/x86_64/fpu/svml_s_cosf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_cosf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_cosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_cosf4_core.S b/sysdeps/x86_64/fpu/svml_s_cosf4_core.S
> index c6c3a4a622..5d9806898b 100644
> --- a/sysdeps/x86_64/fpu/svml_s_cosf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_cosf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_cosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_cosf8_core.S b/sysdeps/x86_64/fpu/svml_s_cosf8_core.S
> index a4033da6de..5730e4265d 100644
> --- a/sysdeps/x86_64/fpu/svml_s_cosf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_cosf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_cosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S
> index 12c6fddfa8..ab49929b56 100644
> --- a/sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_cosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_coshf16_core.S b/sysdeps/x86_64/fpu/svml_s_coshf16_core.S
> index 97667390aa..8668f5bc86 100644
> --- a/sysdeps/x86_64/fpu/svml_s_coshf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_coshf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_coshf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_coshf4_core.S b/sysdeps/x86_64/fpu/svml_s_coshf4_core.S
> index 8803c5e70b..82b893cd76 100644
> --- a/sysdeps/x86_64/fpu/svml_s_coshf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_coshf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_coshf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_coshf8_core.S b/sysdeps/x86_64/fpu/svml_s_coshf8_core.S
> index 85b144938e..1527b69e28 100644
> --- a/sysdeps/x86_64/fpu/svml_s_coshf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_coshf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_coshf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S
> index 8e4e1c19bb..f4ccf65eb9 100644
> --- a/sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_coshf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_coshf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_erfcf16_core.S b/sysdeps/x86_64/fpu/svml_s_erfcf16_core.S
> index 0a14a1a4da..ca5a3cae8a 100644
> --- a/sysdeps/x86_64/fpu/svml_s_erfcf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_erfcf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_erfcf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_erfcf4_core.S b/sysdeps/x86_64/fpu/svml_s_erfcf4_core.S
> index 0a7e3b79ee..ee98c24e3e 100644
> --- a/sysdeps/x86_64/fpu/svml_s_erfcf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_erfcf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_erfcf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_erfcf8_core.S b/sysdeps/x86_64/fpu/svml_s_erfcf8_core.S
> index 8122963d73..ef1f77458b 100644
> --- a/sysdeps/x86_64/fpu/svml_s_erfcf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_erfcf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_erfcf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S
> index 4f347c4d3c..803c9533c2 100644
> --- a/sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_erfcf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_erfcf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_erff16_core.S b/sysdeps/x86_64/fpu/svml_s_erff16_core.S
> index d544a6742d..95be88c16d 100644
> --- a/sysdeps/x86_64/fpu/svml_s_erff16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_erff16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_erff)
> diff --git a/sysdeps/x86_64/fpu/svml_s_erff4_core.S b/sysdeps/x86_64/fpu/svml_s_erff4_core.S
> index 4a83c8b6cc..3f9aace76b 100644
> --- a/sysdeps/x86_64/fpu/svml_s_erff4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_erff4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_erff)
> diff --git a/sysdeps/x86_64/fpu/svml_s_erff8_core.S b/sysdeps/x86_64/fpu/svml_s_erff8_core.S
> index d91b5ad0c2..eb018cde99 100644
> --- a/sysdeps/x86_64/fpu/svml_s_erff8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_erff8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_erff)
> diff --git a/sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S
> index b2a8e7a152..87ded07bf1 100644
> --- a/sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_erff8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_erff)
> diff --git a/sysdeps/x86_64/fpu/svml_s_exp10f16_core.S b/sysdeps/x86_64/fpu/svml_s_exp10f16_core.S
> index 6e59aa2d45..64bfc69929 100644
> --- a/sysdeps/x86_64/fpu/svml_s_exp10f16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_exp10f16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_exp10f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_exp10f4_core.S b/sysdeps/x86_64/fpu/svml_s_exp10f4_core.S
> index 3ebc7ed06d..652231cacf 100644
> --- a/sysdeps/x86_64/fpu/svml_s_exp10f4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_exp10f4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_exp10f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_exp10f8_core.S b/sysdeps/x86_64/fpu/svml_s_exp10f8_core.S
> index 338caf3fc2..61a9b908ca 100644
> --- a/sysdeps/x86_64/fpu/svml_s_exp10f8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_exp10f8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_exp10f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S
> index e0b6202662..f436570d36 100644
> --- a/sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_exp10f8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_exp10f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_exp2f16_core.S b/sysdeps/x86_64/fpu/svml_s_exp2f16_core.S
> index 2a73400a0b..0d3d6c0c9d 100644
> --- a/sysdeps/x86_64/fpu/svml_s_exp2f16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_exp2f16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_exp2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_exp2f4_core.S b/sysdeps/x86_64/fpu/svml_s_exp2f4_core.S
> index ef6d68974c..05975db34f 100644
> --- a/sysdeps/x86_64/fpu/svml_s_exp2f4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_exp2f4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_exp2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_exp2f8_core.S b/sysdeps/x86_64/fpu/svml_s_exp2f8_core.S
> index f7efe3736c..0dbeb46254 100644
> --- a/sysdeps/x86_64/fpu/svml_s_exp2f8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_exp2f8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_exp2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S
> index b394654549..77e67248a7 100644
> --- a/sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_exp2f8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_exp2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_expf16_core.S b/sysdeps/x86_64/fpu/svml_s_expf16_core.S
> index cb3fc1f333..b99898deb3 100644
> --- a/sysdeps/x86_64/fpu/svml_s_expf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_expf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_expf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_expf4_core.S b/sysdeps/x86_64/fpu/svml_s_expf4_core.S
> index 587e90bf33..2aaefa163e 100644
> --- a/sysdeps/x86_64/fpu/svml_s_expf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_expf4_core.S
> @@ -18,7 +18,7 @@
>
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_expf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_expf8_core.S b/sysdeps/x86_64/fpu/svml_s_expf8_core.S
> index 01ca10da37..6f14117d74 100644
> --- a/sysdeps/x86_64/fpu/svml_s_expf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_expf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_expf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S
> index a3ede96599..6e2cd7b8af 100644
> --- a/sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_expf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY(_ZGVcN8v_expf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_expm1f16_core.S b/sysdeps/x86_64/fpu/svml_s_expm1f16_core.S
> index 082cf1e6e5..7f3d9826ae 100644
> --- a/sysdeps/x86_64/fpu/svml_s_expm1f16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_expm1f16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_expm1f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_expm1f4_core.S b/sysdeps/x86_64/fpu/svml_s_expm1f4_core.S
> index c1b1860728..13f5ccdf3a 100644
> --- a/sysdeps/x86_64/fpu/svml_s_expm1f4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_expm1f4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_expm1f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_expm1f8_core.S b/sysdeps/x86_64/fpu/svml_s_expm1f8_core.S
> index a39538dfd7..3f69069585 100644
> --- a/sysdeps/x86_64/fpu/svml_s_expm1f8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_expm1f8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_expm1f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S
> index eaaca17ce7..dc6a7a4973 100644
> --- a/sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_expm1f8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_expm1f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_hypotf16_core.S b/sysdeps/x86_64/fpu/svml_s_hypotf16_core.S
> index 34ca783049..672f05269c 100644
> --- a/sysdeps/x86_64/fpu/svml_s_hypotf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_hypotf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16vv_hypotf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_hypotf4_core.S b/sysdeps/x86_64/fpu/svml_s_hypotf4_core.S
> index e1d8ad19a6..dd09fad865 100644
> --- a/sysdeps/x86_64/fpu/svml_s_hypotf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_hypotf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4vv_hypotf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_hypotf8_core.S b/sysdeps/x86_64/fpu/svml_s_hypotf8_core.S
> index f5aa9f1e10..6d87708238 100644
> --- a/sysdeps/x86_64/fpu/svml_s_hypotf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_hypotf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8vv_hypotf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S
> index 707e91d7d3..7edb0d3cf2 100644
> --- a/sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY(_ZGVcN8vv_hypotf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log10f16_core.S b/sysdeps/x86_64/fpu/svml_s_log10f16_core.S
> index 8544bfb78c..e9df420be5 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log10f16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log10f16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_log10f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log10f4_core.S b/sysdeps/x86_64/fpu/svml_s_log10f4_core.S
> index 82188bedf4..7ac8512459 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log10f4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log10f4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_log10f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log10f8_core.S b/sysdeps/x86_64/fpu/svml_s_log10f8_core.S
> index 03eea3415c..9f6006d2d1 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log10f8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log10f8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_log10f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S
> index 9943a86eb5..b811d10876 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log10f8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_log10f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log1pf16_core.S b/sysdeps/x86_64/fpu/svml_s_log1pf16_core.S
> index 9e70217b1e..20c838b4de 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log1pf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log1pf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_log1pf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log1pf4_core.S b/sysdeps/x86_64/fpu/svml_s_log1pf4_core.S
> index 4de7f84564..2e179b91e6 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log1pf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log1pf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_log1pf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log1pf8_core.S b/sysdeps/x86_64/fpu/svml_s_log1pf8_core.S
> index edf35f29ca..9a24ad132f 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log1pf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log1pf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_log1pf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S
> index 1acd26070f..3055ce8d96 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log1pf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_log1pf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log2f16_core.S b/sysdeps/x86_64/fpu/svml_s_log2f16_core.S
> index 1a21fe6996..9305f1fc2d 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log2f16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log2f16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_log2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log2f4_core.S b/sysdeps/x86_64/fpu/svml_s_log2f4_core.S
> index fd28c9e43b..4bcc88b349 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log2f4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log2f4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_log2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log2f8_core.S b/sysdeps/x86_64/fpu/svml_s_log2f8_core.S
> index 8f6254f7d2..db51a650be 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log2f8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log2f8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_log2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S
> index 6a1832faa4..743fcd3d91 100644
> --- a/sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_log2f8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_log2f)
> diff --git a/sysdeps/x86_64/fpu/svml_s_logf16_core.S b/sysdeps/x86_64/fpu/svml_s_logf16_core.S
> index d9df2536d2..2de06af230 100644
> --- a/sysdeps/x86_64/fpu/svml_s_logf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_logf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_logf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_logf4_core.S b/sysdeps/x86_64/fpu/svml_s_logf4_core.S
> index 8300d4464e..6edb8f6cec 100644
> --- a/sysdeps/x86_64/fpu/svml_s_logf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_logf4_core.S
> @@ -18,7 +18,7 @@
>
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_logf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_logf8_core.S b/sysdeps/x86_64/fpu/svml_s_logf8_core.S
> index 2d779866f4..df57322134 100644
> --- a/sysdeps/x86_64/fpu/svml_s_logf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_logf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_logf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S
> index 8e77a9ea2d..a78dca2ce5 100644
> --- a/sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_logf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY(_ZGVcN8v_logf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_powf16_core.S b/sysdeps/x86_64/fpu/svml_s_powf16_core.S
> index 117d907007..69fef6a25d 100644
> --- a/sysdeps/x86_64/fpu/svml_s_powf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_powf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16vv_powf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_powf4_core.S b/sysdeps/x86_64/fpu/svml_s_powf4_core.S
> index d328436a0a..f28a9ed9d5 100644
> --- a/sysdeps/x86_64/fpu/svml_s_powf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_powf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4vv_powf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_powf8_core.S b/sysdeps/x86_64/fpu/svml_s_powf8_core.S
> index 881c841eb5..ace548b93e 100644
> --- a/sysdeps/x86_64/fpu/svml_s_powf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_powf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8vv_powf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S
> index 7d71c56106..d269163620 100644
> --- a/sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_powf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY(_ZGVcN8vv_powf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sincosf16_core.S b/sysdeps/x86_64/fpu/svml_s_sincosf16_core.S
> index c46f34cc33..6b13904f32 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sincosf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sincosf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16vl4l4_sincosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sincosf4_core.S b/sysdeps/x86_64/fpu/svml_s_sincosf4_core.S
> index a2c38364fb..5e8e68cb46 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sincosf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sincosf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4vl4l4_sincosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sincosf8_core.S b/sysdeps/x86_64/fpu/svml_s_sincosf8_core.S
> index c6e9d544a2..62d4ed96a0 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sincosf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sincosf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8vl4l4_sincosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S
> index fe5fbd929d..39d3b6135c 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sincosf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8vl4l4_sincosf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sinf16_core.S b/sysdeps/x86_64/fpu/svml_s_sinf16_core.S
> index d456e801b6..32a89294f6 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sinf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sinf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_sinf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sinf4_core.S b/sysdeps/x86_64/fpu/svml_s_sinf4_core.S
> index 5b89563158..ad7e12f8da 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sinf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sinf4_core.S
> @@ -18,7 +18,7 @@
>
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_sinf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sinf8_core.S b/sysdeps/x86_64/fpu/svml_s_sinf8_core.S
> index f1bf44fa6f..bb80a43360 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sinf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sinf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_sinf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S
> index 14de999a89..a296e1a233 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sinf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY(_ZGVcN8v_sinf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sinhf16_core.S b/sysdeps/x86_64/fpu/svml_s_sinhf16_core.S
> index da1269c156..666270e804 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sinhf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sinhf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_sinhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sinhf4_core.S b/sysdeps/x86_64/fpu/svml_s_sinhf4_core.S
> index c233bf1ac6..50b36b81a6 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sinhf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sinhf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_sinhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sinhf8_core.S b/sysdeps/x86_64/fpu/svml_s_sinhf8_core.S
> index 9e301d8896..1d2913eb88 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sinhf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sinhf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_sinhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S
> index f21eb1e26c..555ca75a04 100644
> --- a/sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_sinhf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_sinhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_tanf16_core.S b/sysdeps/x86_64/fpu/svml_s_tanf16_core.S
> index f8ce7880fb..1e66c2c186 100644
> --- a/sysdeps/x86_64/fpu/svml_s_tanf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_tanf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_tanf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_tanf4_core.S b/sysdeps/x86_64/fpu/svml_s_tanf4_core.S
> index 3b6c0603c8..868a6ab327 100644
> --- a/sysdeps/x86_64/fpu/svml_s_tanf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_tanf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_tanf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_tanf8_core.S b/sysdeps/x86_64/fpu/svml_s_tanf8_core.S
> index 084f2b4c95..92a2e583c0 100644
> --- a/sysdeps/x86_64/fpu/svml_s_tanf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_tanf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_tanf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S
> index 8d42d3bec4..0b506755d8 100644
> --- a/sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_tanf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_tanf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_tanhf16_core.S b/sysdeps/x86_64/fpu/svml_s_tanhf16_core.S
> index c25c5925a4..b7166b480a 100644
> --- a/sysdeps/x86_64/fpu/svml_s_tanhf16_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_tanhf16_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVeN16v_tanhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_tanhf4_core.S b/sysdeps/x86_64/fpu/svml_s_tanhf4_core.S
> index a6cf56cbe4..5fe52830b1 100644
> --- a/sysdeps/x86_64/fpu/svml_s_tanhf4_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_tanhf4_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVbN4v_tanhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_tanhf8_core.S b/sysdeps/x86_64/fpu/svml_s_tanhf8_core.S
> index 610c4e0759..9ea6b9bc8c 100644
> --- a/sysdeps/x86_64/fpu/svml_s_tanhf8_core.S
> +++ b/sysdeps/x86_64/fpu/svml_s_tanhf8_core.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>         .text
>  ENTRY (_ZGVdN8v_tanhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S b/sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S
> index 5e9eca577f..9230746e01 100644
> --- a/sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S
> +++ b/sysdeps/x86_64/fpu/svml_s_tanhf8_core_avx.S
> @@ -17,7 +17,7 @@
>     <https://www.gnu.org/licenses/>.  */
>
>  #include <sysdep.h>
> -#include "svml_s_wrapper_impl.h"
> +#include "svml_s_wrapper_impl.h.S"
>
>          .text
>  ENTRY (_ZGVcN8v_tanhf)
> diff --git a/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h b/sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
> similarity index 100%
> rename from sysdeps/x86_64/fpu/svml_s_wrapper_impl.h
> rename to sysdeps/x86_64/fpu/svml_s_wrapper_impl.h.S
> --
> 2.34.1

I don't think this change is needed.

-- 
H.J.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2022-11-18 15:51 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-11-18  6:37 [PATCH v1 1/3] x86/fpu: Move svml_{s|d}_wrapper_impl.h -> svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
2022-11-18  6:37 ` [PATCH v1 2/3] x86/fpu: Reformat svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
2022-11-18  6:37 ` [PATCH v1 3/3] x86/fpu: Cleanup code in svml_{s|d}_wrapper_impl.h.S Noah Goldstein via Libc-alpha
2022-11-18 15:49 ` [PATCH v1 1/3] x86/fpu: Move svml_{s|d}_wrapper_impl.h -> svml_{s|d}_wrapper_impl.h.S H.J. Lu via Libc-alpha

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).