From: Johannes Altmanninger <aclopte@gmail.com>
To: Elijah Newren via GitGitGadget <gitgitgadget@gmail.com>
Cc: git@vger.kernel.org, Elijah Newren <newren@gmail.com>
Subject: Re: [PATCH] sequencer: fix environment that 'exec' commands run under
Date: Sun, 14 Nov 2021 21:21:23 +0100 [thread overview]
Message-ID: <20211114202123.b2zzcso4ne7hwjvi@gmail.com> (raw)
In-Reply-To: <pull.1134.git.git.1636738973089.gitgitgadget@gmail.com>
On Fri, Nov 12, 2021 at 05:42:52PM +0000, Elijah Newren via GitGitGadget wrote:
> From: Elijah Newren <newren@gmail.com>
>
> Commands executed from `git rebase --exec` can give different behavior
> from within that environment than they would outside of it, due to the
> fact that sequencer.c exports both GIT_DIR and GIT_WORK_TREE. For
> example, if the relevant script calls something like
>
> git -C ../otherdir log --format=%H --no-walk
>
> the user may be surprised to find that the command above does not show a
> commit hash from ../otherdir, because $GIT_DIR prevents automatic gitdir
> detection and makes the -C option useless.
Yep. I've had a case where "git rebase -x 'make test'" would fail because
"make test" tries to run some Git commands in a temporary repo. The workaround
was to unset all GIT_* environment variables, just like t/test-lib.sh does.
I had the same problem when testing shell completion because git prepends
$(git --exec-path) to $PATH. I don't see a good reason why "git rebase -x
cmd" passes a modified $PATH (and $GIT_EXEC_PATH) to cmd. The user is back in
control, so I'd expect the same environment as for the parent rebase process.
AFAICT, the main purpose of changing $PATH is to ease (cross-language)
implementation, I don't think the user is meant to notice.
Of course, exporting GIT_EXEC_PATH is desirable for some commands like gc
that delegate to a bunch of git processes without user interaction, to make
sure all children are from the same build. c90d565a46 (Propagate --exec-path
setting to external commands via GIT_EXEC_PATH, 2009-03-21). But
I don't think the same applies for rebase -x.
>
> This is a regression in behavior from the original legacy
> implemented-in-shell rebase. It is perhaps rare for it to cause
> problems in practice, especially since most small problems that were
> caused by this area of bugs has been fixed-up in the past in a way that
> masked the particular bug observed without fixing the real underlying
> problem.
Really interesting that we added multiple workarounds, when we just needed
to stop polluting the environment. The fact that our test suite
unsets most GIT_* variables also masks potential problems.
>
> How we arrived at the current situation is perhaps merited. The setting
This sounds weirdly penitent if read the wrong way. I guess
"An explanation how we arrived ... "?
> of GIT_DIR and GIT_WORK_TREE done by sequencer.c arose from a sequence
> of historical accidents:
>
> * When rebase was implemented as a shell command, it would call
> git-sh-setup, which among other things would set GIT_DIR -- but not
> export it. This meant that when rebase --exec commands were run via
> /bin/sh -c "$COMMAND"
> they would not inherit the GIT_DIR setting. The fact that GIT_DIR
> was not set in the run $COMMAND is the behavior we'd like to restore.
>
> * When the rebase--helper builtin was introduced to allow incrementally
> replacing shell with C code, we were in an implementation that was
More correct to say "we were in an (implementation) state" or better:
"we had an implementation"?
> half shell, half C. In particular, commit 18633e1a22 ("rebase -i: use
> the rebase--helper builtin", 2017-02-09) added calls to
> exec git rebase--helper ...
Ok I guess this was a good reason for adding the exec-path to the environment
of exec steps.
> which caused rebase--helper to inherit the GIT_DIR environment
> variable from the shell. git's setup would change the environment
> variable from an absolute path to a relative one (".git"), but would
> leave it set. This meant that when rebase --exec commands were run
> via
> run_command_v_opt(...)
> they would inherit the GIT_DIR setting.
>
> * In commit 09d7b6c6fa ("sequencer: pass absolute GIT_DIR to exec
> commands", 2017-10-31), it was noted that the GIT_DIR caused problems
> with some commands; e.g.
> git rebase --exec 'cd subdir && git describe' ...
> would have GIT_DIR=.git which was invalid due to the change to the
> subdirectory. Instead of questioning why GIT_DIR was set, that commit
> instead made sequencer change GIT_DIR to be an absolute path and
> explicitly export it via
> argv_array_pushf(&child_env, "GIT_DIR=%s", absolute_path(get_git_dir()));
> run_command_v_opt_cd_env(..., child_env.argv)
>
> * In commit ab5e67d751 ("sequencer: pass absolute GIT_WORK_TREE to exec
> commands", 2018-07-14), it was noted that when GIT_DIR is set but
> GIT_WORK_TREE is not, that we do not discover GIT_WORK_TREE but just
> assume it is '.'. That is incorrect if trying to run commands from a
> subdirectory. However, rather than question why GIT_DIR was set, that
> commit instead also added GIT_WORK_TREE to the list of things to
> export.
>
> Each of the above problems would have been fixed automatically when
> git-rebase become a full builtin, had it not been for the fact that
s/become/became/
> sequencer.c started exporting GIT_DIR and GIT_WORK_TREE in the interim.
> Stop exporting them now.
>
> Signed-off-by: Elijah Newren <newren@gmail.com>
> Acked-by: Johannes Schindelin <johannes.schindelin@gmx.de>
> ---
> sequencer: fix environment that 'exec' commands run under
Maybe: sequencer: do not export GIT_DIR and GIT_WORK_TREE to exec
(assuming that $PATH also needs fixing)
>
> I'm not sure if the added regression tests make sense
They are simple and correct.
We still pass on values from --git-dir= and GIT_DIR=. git.
Those are /probably/ right.
> or if t3409 is the best place to put them (taking over a recently
> removed t3409 that was used for the deprecated preserve merges option).
Looks like there is no t/t34*-rebase-exec.sh yet. Most tests of --exec are
in t/t3404-rebase-interactive.sh, but we don't need the interactive bit here.
>
> Published-As: https://github.com/gitgitgadget/git/releases/tag/pr-git-1134%2Fnewren%2Ffix-rebase-exec-environ-v1
> Fetch-It-Via: git fetch https://github.com/gitgitgadget/git pr-git-1134/newren/fix-rebase-exec-environ-v1
> Pull-Request: https://github.com/git/git/pull/1134
>
> sequencer.c | 9 +--------
> t/t3409-rebase-environ.sh | 23 +++++++++++++++++++++++
> 2 files changed, 24 insertions(+), 8 deletions(-)
> create mode 100755 t/t3409-rebase-environ.sh
>
> diff --git a/sequencer.c b/sequencer.c
> index ea96837cde3..9afdbe3e3d1 100644
> --- a/sequencer.c
> +++ b/sequencer.c
> @@ -3495,17 +3495,12 @@ static int error_failed_squash(struct repository *r,
>
> static int do_exec(struct repository *r, const char *command_line)
> {
> - struct strvec child_env = STRVEC_INIT;
> const char *child_argv[] = { NULL, NULL };
> int dirty, status;
>
> fprintf(stderr, _("Executing: %s\n"), command_line);
> child_argv[0] = command_line;
> - strvec_pushf(&child_env, "GIT_DIR=%s", absolute_path(get_git_dir()));
> - strvec_pushf(&child_env, "GIT_WORK_TREE=%s",
> - absolute_path(get_git_work_tree()));
> - status = run_command_v_opt_cd_env(child_argv, RUN_USING_SHELL, NULL,
> - child_env.v);
> + status = run_command_v_opt(child_argv, RUN_USING_SHELL);
>
> /* force re-reading of the cache */
> if (discard_index(r->index) < 0 || repo_read_index(r) < 0)
> @@ -3535,8 +3530,6 @@ static int do_exec(struct repository *r, const char *command_line)
> status = 1;
> }
>
> - strvec_clear(&child_env);
> -
> return status;
> }
>
> diff --git a/t/t3409-rebase-environ.sh b/t/t3409-rebase-environ.sh
> new file mode 100755
> index 00000000000..83ffb39d9ff
> --- /dev/null
> +++ b/t/t3409-rebase-environ.sh
> @@ -0,0 +1,23 @@
> +#!/bin/sh
> +
> +test_description='git rebase interactive environment'
maybe test_description='git rebase --exec environment'
> +
> +. ./test-lib.sh
> +
> +test_expect_success 'setup' '
> + test_commit one &&
> + test_commit two &&
> + test_commit three
> +'
> +
> +test_expect_success 'rebase --exec does not muck with GIT_DIR' '
> + git rebase --exec "printf %s \$GIT_DIR >environ" HEAD~1 &&
> + test_must_be_empty environ
> +'
> +
> +test_expect_success 'rebase --exec does not muck with GIT_WORK_TREE' '
> + git rebase --exec "printf %s \$GIT_WORK_TREE >environ" HEAD~1 &&
> + test_must_be_empty environ
> +'
I guess we could add a test for existing GIT_DIR in the environment, even
if it's not affected
test_expect_success 'already exported GIT_DIR is passed on to rebase --exec commands' '
GIT_DIR=.git GIT_WORK_TREE=. git rebase HEAD~1 --exec \
"printf %s\\\\n \"\$GIT_DIR\" \"\$GIT_WORK_TREE\" >actual" &&
cat >expect <<-EOF &&
.git
.
EOF
test_cmp expect actual &&
cat actual
'
(I tried to use non-default values lik ./.git and ./. but the weird thing
is that git canonicalizes the worktree but not the git dir, so meh)
I also wasn't sure about the behavior of --git-dir= Should it be the same as GIT_DIR=?
I think it's also conceivable that --git-dir= does *not* cause GIT_DIR to
be exported to exec commands, though that might break existing
scripts. Something like
git --work-tree=../other-worktree --git-dir=../other-worktree/.git \
rebase --exec "make generate-documentation && git commit -a --amend --no-edit"
(needless to say that in this case "git -C ../other-worktree" is probably
what the user wants)
next prev parent reply other threads:[~2021-11-14 20:21 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-11-12 17:42 [PATCH] sequencer: fix environment that 'exec' commands run under Elijah Newren via GitGitGadget
2021-11-14 20:21 ` Johannes Altmanninger [this message]
2021-11-23 17:48 ` Elijah Newren
2021-12-05 8:45 ` Johannes Altmanninger
2021-11-16 5:53 ` [PATCH v2] sequencer: do not export GIT_DIR and GIT_WORK_TREE for 'exec' Elijah Newren via GitGitGadget
2021-11-16 6:06 ` Johannes Altmanninger
2021-11-16 9:59 ` Phillip Wood
2021-12-04 5:36 ` [PATCH v3] " Elijah Newren via GitGitGadget
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: http://vger.kernel.org/majordomo-info.html
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20211114202123.b2zzcso4ne7hwjvi@gmail.com \
--to=aclopte@gmail.com \
--cc=git@vger.kernel.org \
--cc=gitgitgadget@gmail.com \
--cc=newren@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://80x24.org/mirrors/git.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).