From: Adhemerval Zanella Netto <adhemerval.zanella@linaro.org>
To: "H.J. Lu" <hjl.tools@gmail.com>,
GNU C Library <libc-alpha@sourceware.org>
Subject: Re: [PATCH v2 03/10] i386: Use generic fmod
Date: Thu, 28 Mar 2024 15:22:08 -0300 [thread overview]
Message-ID: <1c27a80e-30f9-48fb-b608-8f2387621f5e@linaro.org> (raw)
In-Reply-To: <CAMe9rOpCHzLk=kHn_AKOc6aiVGRNKu==x5U7on74BrOfPV96XQ@mail.gmail.com>
On 28/03/24 13:00, H.J. Lu wrote:
> On Thu, Mar 28, 2024 at 8:57 AM Adhemerval Zanella Netto
> <adhemerval.zanella@linaro.org> wrote:
>>
>>
>>
>> On 28/03/24 12:55, H.J. Lu wrote:
>>> On Thu, Mar 28, 2024 at 8:48 AM Adhemerval Zanella Netto
>>> <adhemerval.zanella@linaro.org> wrote:
>>>>
>>>>
>>>>
>>>> On 28/03/24 12:42, H.J. Lu wrote:
>>>>> On Thu, Mar 28, 2024 at 8:14 AM Adhemerval Zanella Netto
>>>>> <adhemerval.zanella@linaro.org> wrote:
>>>>>>
>>>>>>
>>>>>>
>>>>>> On 28/03/24 11:51, H.J. Lu wrote:
>>>>>>> On Thu, Mar 28, 2024 at 7:11 AM Adhemerval Zanella Netto
>>>>>>> <adhemerval.zanella@linaro.org> wrote:
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On 27/03/24 18:38, H.J. Lu wrote:
>>>>>>>>> On Wed, Mar 27, 2024 at 1:37 PM Adhemerval Zanella Netto
>>>>>>>>> <adhemerval.zanella@linaro.org> wrote:
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> On 27/03/24 16:55, H.J. Lu wrote:
>>>>>>>>>>> On Wed, Mar 27, 2024 at 12:40 PM Adhemerval Zanella
>>>>>>>>>>> <adhemerval.zanella@linaro.org> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>> The benchtest results shows a slight improvement (Ryzen 5900, gcc
>>>>>>>>>>>> 13.2.1):
>>>>>>>>>>>>
>>>>>>>>>>>> * sysdeps/i386/fpu/e_fmod.S:
>>>>>>>>>>>> "fmod": {
>>>>>>>>>>>> "subnormals": {
>>>>>>>>>>>> "duration": 3.68855e+09,
>>>>>>>>>>>> "iterations": 2.12608e+08,
>>>>>>>>>>>> "max": 62.012,
>>>>>>>>>>>> "min": 16.798,
>>>>>>>>>>>> "mean": 17.349
>>>>>>>>>>>> },
>>>>>>>>>>>> "normal": {
>>>>>>>>>>>> "duration": 3.88459e+09,
>>>>>>>>>>>> "iterations": 7.168e+06,
>>>>>>>>>>>> "max": 2879.12,
>>>>>>>>>>>> "min": 16.909,
>>>>>>>>>>>> "mean": 541.934
>>>>>>>>>>>> },
>>>>>>>>>>>> "close-exponents": {
>>>>>>>>>>>> "duration": 3.692e+09,
>>>>>>>>>>>> "iterations": 1.96608e+08,
>>>>>>>>>>>> "max": 66.452,
>>>>>>>>>>>> "min": 16.835,
>>>>>>>>>>>> "mean": 18.7785
>>>>>>>>>>>> }
>>>>>>>>>>>> }
>>>>>>>>>>>>
>>>>>>>>>>>> * generic
>>>>>>>>>>>> "fmod": {
>>>>>>>>>>>> "subnormals": {
>>>>>>>>>>>> "duration": 3.68645e+09,
>>>>>>>>>>>> "iterations": 2.2848e+08,
>>>>>>>>>>>> "max": 66.896,
>>>>>>>>>>>> "min": 15.91,
>>>>>>>>>>>> "mean": 16.1347
>>>>>>>>>>>> },
>>>>>>>>>>>> "normal": {
>>>>>>>>>>>> "duration": 4.1455e+09,
>>>>>>>>>>>> "iterations": 8.192e+06,
>>>>>>>>>>>> "max": 3376.18,
>>>>>>>>>>>> "min": 15.873,
>>>>>>>>>>>> "mean": 506.043
>>>>>>>>>>>> },
>>>>>>>>>>>> "close-exponents": {
>>>>>>>>>>>> "duration": 3.70197e+09,
>>>>>>>>>>>> "iterations": 2.08896e+08,
>>>>>>>>>>>> "max": 69.597,
>>>>>>>>>>>> "min": 15.947,
>>>>>>>>>>>> "mean": 17.7216
>>>>>>>>>>>> }
>>>>>>>>>>>> }
>>>>>>>>>>>> ---
>>>>>>>>>>>> sysdeps/i386/fpu/Versions | 4 ++++
>>>>>>>>>>>> sysdeps/i386/fpu/e_fmod.S | 18 ------------------
>>>>>>>>>>>> sysdeps/i386/fpu/e_fmod.c | 2 ++
>>>>>>>>>>>> sysdeps/i386/fpu/math_err.c | 1 -
>>>>>>>>>>>> sysdeps/i386/fpu/w_fmod_compat.c | 15 ---------------
>>>>>>>>>>>> sysdeps/ieee754/dbl-64/e_fmod.c | 5 ++++-
>>>>>>>>>>>> sysdeps/mach/hurd/i386/libm.abilist | 1 +
>>>>>>>>>>>> sysdeps/unix/sysv/linux/i386/libm.abilist | 1 +
>>>>>>>>>>>> 8 files changed, 12 insertions(+), 35 deletions(-)
>>>>>>>>>>>> delete mode 100644 sysdeps/i386/fpu/e_fmod.S
>>>>>>>>>>>> create mode 100644 sysdeps/i386/fpu/e_fmod.c
>>>>>>>>>>>> delete mode 100644 sysdeps/i386/fpu/math_err.c
>>>>>>>>>>>> delete mode 100644 sysdeps/i386/fpu/w_fmod_compat.c
>>>>>>>>>>>>
>>>>>>>>>>>> diff --git a/sysdeps/i386/fpu/Versions b/sysdeps/i386/fpu/Versions
>>>>>>>>>>>> index a2eec371f1..d37bc1eae6 100644
>>>>>>>>>>>> --- a/sysdeps/i386/fpu/Versions
>>>>>>>>>>>> +++ b/sysdeps/i386/fpu/Versions
>>>>>>>>>>>> @@ -3,4 +3,8 @@ libm {
>>>>>>>>>>>> # functions used in inline functions or macros
>>>>>>>>>>>> __expl; __expm1l;
>>>>>>>>>>>> }
>>>>>>>>>>>> + GLIBC_2.40 {
>>>>>>>>>>>> + # No SVID compatible error handling.
>>>>>>>>>>>> + fmod;
>>>>>>>>>>>> + }
>>>>>>>>>>>
>>>>>>>>>>> This changes the ABI. I assume that it fixes a real bug. Is there a bug
>>>>>>>>>>> report open for this?
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> The new version is the way to provide the system without the SVID compat
>>>>>>>>>> support, which we for all ABIs but i386 on 2.38. For instance:
>>>>>>>>>>
>>>>>>>>>> find . -iname libm.abilist | xargs grep -w fmod
>>>>>>>>>> ./sysdeps/unix/sysv/linux/sparc/sparc32/libm.abilist:GLIBC_2.0 fmod F
>>>>>>>>>> ./sysdeps/unix/sysv/linux/sparc/sparc32/libm.abilist:GLIBC_2.38 fmod F
>>>>>>>>>> [...]
>>>>>>>>>>
>>>>>>>>>> For i386 specifically, the old SVID symbol will be kept as fmod@GLIBC_2.0.
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Does it fix a run-time test which fails without the fix?
>>>>>>>>>
>>>>>>>>
>>>>>>>> Not really, but it is one less assembly implementation in favor a generic one
>>>>>>>> (which also shows a slight improvement on recent chips) and it sync i386
>>>>>>>> with generic code (so less possible issues, such as the static lib in this
>>>>>>>> patchset).
>>>>>>>
>>>>>>> Why do we need a new symbol?
>>>>>>
>>>>>> Because the new fmod@GLIBC_2.40 for i386 won't have the SVID handling,
>>>>>> similar to what has been done for other architectures with
>>>>>> 16439f419b270184ec501c531bf20d83b6745fb0;
>>>>>
>>>>> Does it change i386 fmod behavior? If yes, we need a testcase to verify it.
>>>>> If not, why is it needed?
>>>>>
>>>>
>>>> It is not strictly required, but it makes i386 has one less assembly optimization
>>>> that do not follow the rest of the code and it optimizes it slight because. Since
>>>> we do actually have check for SVID, the default math tests already check the
>>>> required symbol semantic.
>>>
>>> fmod@GLIBC_2.40 is added because of the SVID handling. But there is no
>>> user visible behavior change. Is this correct?
>>
>> The user visible is the missing SVID handling (which I think noone actually uses
>> it). That's the main reason we need the compat dance and this extra complexity.
>> Maybe one day we just can drop this for good...
>
> If we want to provide the SVID compatibility, 2 testcases are needed:
>
> 1. A testcase to show that the new implementation is incompatible with SVID.
> 2. A testcase to show that the compat symbol provides the SVID compatibility.
We don't really have SVID compatibility tests for any other optimization/simplification,
and although I don't really oppose on adding I also thinking that this is making this
change even more complicated than it would require.
I can drop the i386 changes to use generic implementations if you think it would
simplify this patchset.
next prev parent reply other threads:[~2024-03-28 18:22 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-03-27 19:40 [PATCH v2 00/10] Fix some libm static issues Adhemerval Zanella
2024-03-27 19:40 ` [PATCH v2 01/10] math: Add support for auto static math tests Adhemerval Zanella
2024-03-27 19:40 ` [PATCH v2 02/10] math: Fix i386 and m68k fmod/fmodf on static build (BZ 31488) Adhemerval Zanella
2024-03-27 19:40 ` [PATCH v2 03/10] i386: Use generic fmod Adhemerval Zanella
2024-03-27 19:55 ` H.J. Lu
2024-03-27 20:37 ` Adhemerval Zanella Netto
2024-03-27 21:38 ` H.J. Lu
2024-03-28 14:11 ` Adhemerval Zanella Netto
2024-03-28 14:51 ` H.J. Lu
2024-03-28 15:14 ` Adhemerval Zanella Netto
[not found] ` <CAMe9rOqhQDA-zk=+oTvdoPpq=rGEhtan0couaZ3Z_fxeFpa=7A@mail.gmail.com>
[not found] ` <9bf7af32-afa3-439c-84c0-76e76b220e44@linaro.org>
[not found] ` <CAMe9rOqTcmRc9mvguQnDRFb=BTjZJ5CptxjtPjtBGoB-a5mc6Q@mail.gmail.com>
[not found] ` <0e4733a3-d569-4a73-b2d3-001cc6c3d751@linaro.org>
2024-03-28 16:00 ` H.J. Lu
2024-03-28 18:22 ` Adhemerval Zanella Netto [this message]
2024-03-28 18:38 ` Joseph Myers
2024-03-28 19:37 ` Adhemerval Zanella Netto
2024-03-28 19:57 ` H.J. Lu
2024-03-27 19:40 ` [PATCH v2 04/10] i386: Use generic fmodf Adhemerval Zanella
2024-03-27 19:55 ` H.J. Lu
2024-03-27 19:40 ` [PATCH v2 05/10] math: Fix i386 and m68k exp10 on static build Adhemerval Zanella
2024-03-27 19:57 ` H.J. Lu
2024-03-27 20:39 ` Adhemerval Zanella Netto
2024-03-27 20:55 ` Joseph Myers
2024-03-27 19:40 ` [PATCH v2 06/10] i386: Use generic exp10 Adhemerval Zanella
2024-03-27 20:14 ` H.J. Lu
2024-03-27 19:40 ` [PATCH v2 07/10] math: Fix isnanf128 static build Adhemerval Zanella
2024-03-27 19:40 ` [PATCH v2 08/10] math: Provided copysignf128 for static libm on alpha, s390, and sparcv9 Adhemerval Zanella
2024-03-27 19:40 ` [PATCH v2 09/10] math: Provide frexpf128 " Adhemerval Zanella
2024-03-27 19:40 ` [PATCH v2 10/10] math: Provide modf128 " Adhemerval Zanella
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.gnu.org/software/libc/involved.html
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1c27a80e-30f9-48fb-b608-8f2387621f5e@linaro.org \
--to=adhemerval.zanella@linaro.org \
--cc=hjl.tools@gmail.com \
--cc=libc-alpha@sourceware.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).