unofficial mirror of libc-alpha@sourceware.org
 help / color / mirror / Atom feed
From: Szabolcs Nagy via Libc-alpha <libc-alpha@sourceware.org>
To: Wilco Dijkstra <Wilco.Dijkstra@arm.com>
Cc: 'GNU C Library' <libc-alpha@sourceware.org>
Subject: Re: [PATCH v3 1/5] AArch64: Improve A64FX memset
Date: Mon, 2 Aug 2021 15:50:03 +0100	[thread overview]
Message-ID: <20210802145003.GH14854@arm.com> (raw)
In-Reply-To: <VE1PR08MB5599AC44FC079BAA03DE9C5383EF9@VE1PR08MB5599.eurprd08.prod.outlook.com>

The 08/02/2021 14:38, Wilco Dijkstra via Libc-alpha wrote:
> > We discussed how should be defined BTI_C macro before, at that time conclusion
> > was "NOP" rather than empty unless HAVE_AARCH64_BTI.
> > Now the above code defines BTI_C as empty unconditionally.
> > A64FX doesn't support BTI, so this code is OK.
> > But I'm just interested in the reason why it is changed.
> 
> We changed to NOP in the generic code, so that works for all string functions.
> In this specific case removing the initial NOP as well allows all performance critical
> code for <= 512 bytes to be perfectly aligned to 16-byte fetch blocks.

yes, this makes sense:

originally BTI_C was always hint 34, but since that can be
slow it was changed for !HAVE_AARCH64_BTI. We don't want the
layout of asm code to change based on toolchain configuration
so BTI_C is defined as a place holder nop then.

but in a64fx specific code bti is never needed so we also
don't need the place holder nop, BTI_C can be unconditionally
empty.

  reply	other threads:[~2021-08-02 14:50 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-07-22 15:59 [PATCH v3 1/5] AArch64: Improve A64FX memset Wilco Dijkstra via Libc-alpha
2021-07-28  8:10 ` naohirot--- via Libc-alpha
2021-08-02 13:53   ` naohirot--- via Libc-alpha
2021-08-02 14:38     ` Wilco Dijkstra via Libc-alpha
2021-08-02 14:50       ` Szabolcs Nagy via Libc-alpha [this message]
2021-08-03  2:57         ` naohirot--- via Libc-alpha
2021-08-03  8:01           ` Szabolcs Nagy via Libc-alpha
2021-09-24  7:56             ` naohirot--- via Libc-alpha
2021-08-03  2:56       ` naohirot--- via Libc-alpha

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.gnu.org/software/libc/involved.html

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20210802145003.GH14854@arm.com \
    --to=libc-alpha@sourceware.org \
    --cc=Wilco.Dijkstra@arm.com \
    --cc=szabolcs.nagy@arm.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).