From: Phillip Wood <phillip.wood123@gmail.com>
To: Johannes Schindelin <Johannes.Schindelin@gmx.de>
Cc: "Johannes Schindelin via GitGitGadget" <gitgitgadget@gmail.com>,
git@vger.kernel.org, "Taylor Blau" <me@ttaylorr.com>,
"Phillip Wood" <phillip.wood123@gmail.com>,
"Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
Subject: Re: [PATCH v2] range-diff: support reading mbox files
Date: Tue, 22 Nov 2022 14:20:08 +0000 [thread overview]
Message-ID: <db3baf6d-02b8-f562-70dd-13d16149e9be@dunelm.org.uk> (raw)
In-Reply-To: <rr7sp534-43o9-7n1o-5700-369n5rprq75p@tzk.qr>
Hi Dscho
On 22/11/2022 07:40, Johannes Schindelin wrote:
> [...]
>> I'm a bit confused by this sed command, I've annotated it with my probably
>> flawed understanding.
>>
>>> ++ sed -e "/^From: .*/{
>>> ++ h
>>
>> This stores the From: header in the hold space
>
> 👍
>
>>> ++ s/.*/From: Bugs Bunny <bugs@bun.ny>/
>>
>> This changes the From: header in the pattern space
>
> 👍
>
>>> ++ :1
>>> ++ N
>>> ++ /[ -z]$/b1
>>
>> We loop until we find a line that does not end with a space, letter or number
>> adding the lines to the hold space
>
> I would have _loved_ to match on an empty line, i.e. `/[^\n]$/b1`. But
> that construct is not understood by the `sed` on macOS.
I don't think it matters much but can't you match an empty line with
/^$/ ? Then you can loop on non-empty lines with /^$/!b1
> I even went so far as to search for the source code of a BSD `sed` (and I
> found it, and modified the code so that it builds on Linux, see
> https://github.com/dscho/sed-bsd) to try a couple of things, but could not
> make it work with any variation of `\n`. Therefore, I settled on expecting
> all the lines in the commit header to end in printable ASCII characters
> between the space and the `z`.
>
>>> ++ G
>>
>> This appends the hold space to the pattern space, then the pattern space is
>> printed.
>
> 👍
>
>> Doesn't this mean we end up with two From: headers? Is the in-body From:
>> line already present?
>
> There is no in-body `From:` header because the patch author matches the
> `GIT_AUTHOR_IDENT` that is in effect while running the `format-patch`
> command.
>
> Let me show you what this `sed` call deals with. In the local test run, it
> modified an `mbox` starting with this:
Thanks for taking the trouble to show the mbox files, I didn't have time
to run the tests my self yesterday. The processed mbox file looks good.
> From 4d39cb329d3ef4c8e69b43859c2e11adb83f8613 Mon Sep 17 00:00:00 2001
> From: Thomas Rast <trast@inf.ethz.ch>
> Date: Mon, 22 Jul 2013 11:23:44 +0200
> Subject: [PATCH 1/3] s/4/A/ + add other-file
>
> ---
> file | 2 +-
> other-file | 0
> [...]
>
> to a modified `mbox` that starts with this:
>
> From 4d39cb329d3ef4c8e69b43859c2e11adb83f8613 Mon Sep 17 00:00:00 2001
> From: Bugs Bunny <bugs@bun.ny>
> Date: Mon, 22 Jul 2013 11:23:44 +0200
> Subject: [PATCH 1/3] s/4/A/ + add other-file
>
> From: Thomas Rast <trast@inf.ethz.ch>
> ---
> file | 2 +-
> other-file | 0
> [...]
>
>>
>>> ++ }" <mbox >mbox.from &&
>>> ++ git range-diff mode-only-change..topic mbox:./mbox.from
>>> >actual.from &&
>>> ++ test_cmp expect actual.from &&
>>> ++
>>> ++ append_cr <mbox >mbox.cr &&
>>> ++ test_must_fail git range-diff \
>>> ++ mode-only-change..topic mbox:./mbox.cr 2>err &&
>>> ++ grep CR/LF err &&
>>
>> Thanks for adding that
>
> Thank you for the suggestion to add it!
>
> And thank you again for modeling how to perform actionable, helpful and
> productive code reviews,
Thanks for your kind words, I glad you found my comments helpful
Best Wishes
Phillip
next prev parent reply other threads:[~2022-11-22 14:21 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-11-15 18:20 [PATCH] range-diff: support reading mbox files Johannes Schindelin via GitGitGadget
2022-11-16 2:09 ` Taylor Blau
2022-11-16 14:40 ` Phillip Wood
2022-11-17 10:26 ` Phillip Wood
2022-11-18 12:53 ` Johannes Schindelin
2022-11-18 13:16 ` Johannes Schindelin
2022-11-17 18:24 ` Ævar Arnfjörð Bjarmason
2022-11-18 11:39 ` Johannes Schindelin
2022-11-19 23:11 ` [PATCH v2] " Johannes Schindelin via GitGitGadget
2022-11-21 10:08 ` Phillip Wood
2022-11-22 7:40 ` Johannes Schindelin
2022-11-22 14:20 ` Phillip Wood [this message]
2022-11-22 9:08 ` [PATCH v3] " Johannes Schindelin via GitGitGadget
2022-11-22 14:23 ` Phillip Wood
2022-11-22 23:58 ` Junio C Hamano
2023-03-03 22:02 ` Junio C Hamano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: http://vger.kernel.org/majordomo-info.html
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=db3baf6d-02b8-f562-70dd-13d16149e9be@dunelm.org.uk \
--to=phillip.wood123@gmail.com \
--cc=Johannes.Schindelin@gmx.de \
--cc=avarab@gmail.com \
--cc=git@vger.kernel.org \
--cc=gitgitgadget@gmail.com \
--cc=me@ttaylorr.com \
--cc=phillip.wood@dunelm.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://80x24.org/mirrors/git.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).