From: Johannes Sixt <email@example.com>
To: "Ævar Arnfjörð Bjarmason" <firstname.lastname@example.org>
Cc: "Marc-André Lureau" <email@example.com>,
"Junio C Hamano" <firstname.lastname@example.org>,
Subject: Re: [PATCH] userdiff: two simplifications of patterns for rust
Date: Thu, 30 May 2019 22:32:56 +0200 [thread overview]
Message-ID: <email@example.com> (raw)
Am 30.05.19 um 20:59 schrieb Ævar Arnfjörð Bjarmason:
> On Thu, May 30 2019, Johannes Sixt wrote:
>> - Do not enforce (but assume) syntactic correctness of language
>> constructs that go into hunk headers: we only want to ensure that
>> the keywords actually are words and not just the initial part of
>> some identifier.
>> - In the word regex, match numbers only when they begin with a digit,
>> but then be liberal in what follows, assuming that the text that is
>> matched is syntactially correct.
> I don't know if this is possible for Rust (but very much suspect so...),
> but I think that in general we should aim to be more forgiving than not
> with these patterns.
The C/C++ pattern is actually very forgiving in the hunk header pattern:
It takes every line that begins with an un-indented letter. That works
very well in in C because C does not have nested functions and it is
typical that the function definition lines are not indented. But that
breaks down with C++: indented function definitions are very common;
they happen inside class and namespace definitions. Such functions are
not picked up, and we live with that so far (at least, I do).
> Because, as the history of userdiff.c shows, new keywords get introduced
> into these languages, and old git versions survive for a long time. If
> the syntax is otherwise fairly regular perhaps we don't need to hardcode
> the list of existing keywords?
We are talking about (1) hunk header lines (not something really
important) and (2) programming languages: new keywords don't pop up
every month. Granted, inventing new languages is en vogue these days.
But really, I mean, WTH?
Having available keywords to recognize hunk header candidates helps a
lot. I thought long about a possible pattern for C++, but I gave up,
because the language is so rich and there are no suitable keywords.
prev parent reply other threads:[~2019-05-30 20:33 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-05-20 17:04 [PATCH v3] userdiff: add built-in pattern for rust marcandre.lureau
2019-05-20 19:52 ` Johannes Sixt
2019-05-21 10:57 ` Marc-André Lureau
2019-05-28 16:34 ` Junio C Hamano
2019-05-28 20:31 ` Johannes Sixt
2019-05-28 21:01 ` Marc-André Lureau
2019-05-30 16:44 ` [PATCH] userdiff: two simplifications of patterns " Johannes Sixt
2019-05-30 18:59 ` Ævar Arnfjörð Bjarmason
2019-05-30 20:32 ` Johannes Sixt [this message]
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
List information: http://vger.kernel.org/majordomo-info.html
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).