git@vger.kernel.org mailing list mirror (one of many)
 help / color / mirror / code / Atom feed
* [PATCH] submodule: explicitly specify on-demand upon push
@ 2022-11-08  0:25 Jonathan Tan
  2022-11-08  0:31 ` Taylor Blau
  2022-11-14 21:37 ` [PATCH v2] Doc: document push.recurseSubmodules=only Jonathan Tan
  0 siblings, 2 replies; 10+ messages in thread
From: Jonathan Tan @ 2022-11-08  0:25 UTC (permalink / raw)
  To: git; +Cc: Jonathan Tan

When running "git push --recurse-submodules=on-demand" (or "=only") on a
superproject with nested submodules and assuming default configurations,
only the top-level submodules are pushed. This is because recursion
into the top-level submodules is performed with "git push" (without any
additional relevant options), and the default configuration is to not
recurse to nested submodules.

Therefore, instead of recursing with "git push" without any additional
relevant options, recurse with "--recurse-submodules=on-demand".

This now means that any push.recurseSubmodules configuration in any
submodule is no longer respected: only the configuration (or CLI
argument to override it) of the superproject is used. Update the
documentation accordingly.

(As a side effect of making the documentation of both the CLI argument
and the config variable consistent, the config variable is now
documented to support "only". This has been the case for a while, and I
have also included a test to show that.)

Signed-off-by: Jonathan Tan <jonathantanmy@google.com>
---
 Documentation/config/push.txt  | 18 ++++------
 Documentation/git-push.txt     |  2 +-
 submodule.c                    |  9 +++++
 t/t5531-deep-submodule-push.sh | 66 ++++++++++++++++++++++++++++++++++
 4 files changed, 82 insertions(+), 13 deletions(-)

diff --git a/Documentation/config/push.txt b/Documentation/config/push.txt
index 7386fea225..9960afef84 100644
--- a/Documentation/config/push.txt
+++ b/Documentation/config/push.txt
@@ -110,20 +110,14 @@ This will result in only b (a and c are cleared).
 ----
 
 push.recurseSubmodules::
-	Make sure all submodule commits used by the revisions to be pushed
-	are available on a remote-tracking branch. If the value is 'check'
-	then Git will verify that all submodule commits that changed in the
-	revisions to be pushed are available on at least one remote of the
-	submodule. If any commits are missing, the push will be aborted and
-	exit with non-zero status. If the value is 'on-demand' then all
-	submodules that changed in the revisions to be pushed will be
-	pushed. If on-demand was not able to push all necessary revisions
-	it will also be aborted and exit with non-zero status. If the value
-	is 'no' then default behavior of ignoring submodules when pushing
-	is retained. You may override this configuration at time of push by
-	specifying '--recurse-submodules=check|on-demand|no'.
+	May be "check", "on-demand", "only", or "no", with the same behavior
+	as that of "push --recurse-submodules".
 	If not set, 'no' is used by default, unless 'submodule.recurse' is
 	set (in which case a 'true' value means 'on-demand').
++
+Whenever "git push" is executed, only the configuration of the
+repository in which the command was run is used, not any of its
+submodules (even if any such submodule contains its own submodules).
 
 push.useForceIfIncludes::
 	If set to "true", it is equivalent to specifying
diff --git a/Documentation/git-push.txt b/Documentation/git-push.txt
index def7657ef9..c63a4c186b 100644
--- a/Documentation/git-push.txt
+++ b/Documentation/git-push.txt
@@ -407,7 +407,7 @@ Specifying `--no-force-if-includes` disables this behavior.
 	remote of the submodule. If any commits are missing the push will
 	be aborted and exit with non-zero status. If 'on-demand' is used
 	all submodules that changed in the revisions to be pushed will be
-	pushed. If on-demand was not able to push all necessary revisions it will
+	recursively pushed. If on-demand was not able to push all necessary revisions it will
 	also be aborted and exit with non-zero status. If 'only' is used all
 	submodules will be recursively pushed while the superproject is left
 	unpushed. A value of 'no' or using `--no-recurse-submodules` can be used
diff --git a/submodule.c b/submodule.c
index bf7a2c7918..06ee74a282 100644
--- a/submodule.c
+++ b/submodule.c
@@ -1130,6 +1130,15 @@ static int push_submodule(const char *path,
 	if (for_each_remote_ref_submodule(path, has_remote, NULL) > 0) {
 		struct child_process cp = CHILD_PROCESS_INIT;
 		strvec_push(&cp.args, "push");
+		/*
+		 * If this function is called, it is because the user
+		 * requested recursion through
+		 * --recurse-submodules=on-demand or
+		 * --recurse-submodules=only (or the equivalent
+		 * config). In both cases, we should fully recurse into
+		 * all submodules and their descendants.
+		 */
+		strvec_push(&cp.args, "--recurse-submodules=on-demand");
 		if (dry_run)
 			strvec_push(&cp.args, "--dry-run");
 
diff --git a/t/t5531-deep-submodule-push.sh b/t/t5531-deep-submodule-push.sh
index 3f58b515ce..b9daf262a9 100755
--- a/t/t5531-deep-submodule-push.sh
+++ b/t/t5531-deep-submodule-push.sh
@@ -512,6 +512,72 @@ test_expect_success 'push only unpushed submodules recursively' '
 	test_cmp expected_pub actual_pub
 '
 
+setup_subsub () {
+	git init upstream &&
+	git init upstream/sub &&
+	git init upstream/sub/deepsub &&
+	test_commit -C upstream/sub/deepsub innermost &&
+	git -C upstream/sub submodule add ./deepsub deepsub &&
+	git -C upstream/sub commit -m middle &&
+	git -C upstream submodule add ./sub sub &&
+	git -C upstream commit -m outermost &&
+
+	git -c protocol.file.allow=always clone --recurse-submodules upstream downstream &&
+	git -C downstream/sub/deepsub checkout -b downstream-branch &&
+	git -C downstream/sub checkout -b downstream-branch &&
+	git -C downstream checkout -b downstream-branch
+}
+
+new_downstream_commits () {
+	test_commit -C downstream/sub/deepsub new-innermost &&
+	git -C downstream/sub add deepsub &&
+	git -C downstream/sub commit -m new-middle &&
+	git -C downstream add sub &&
+	git -C downstream commit -m new-outermost
+}
+
+test_expect_success 'push recurses into submodule and sub-submodule' '
+	test_when_finished rm -rf upstream downstream &&
+	setup_subsub &&
+	new_downstream_commits &&
+	git -C downstream push --recurse-submodules=only origin downstream-branch &&
+
+	test_must_fail git -C upstream rev-parse refs/heads/downstream-branch &&
+	git -C upstream/sub rev-parse refs/heads/downstream-branch &&
+	git -C upstream/sub/deepsub rev-parse refs/heads/downstream-branch
+'
+
+test_expect_success 'recursive push ignores submodule config' '
+	test_when_finished rm -rf upstream downstream &&
+	setup_subsub &&
+	new_downstream_commits &&
+	git -C downstream config push.recurseSubmodules no &&
+	git -C downstream/sub config push.recurseSubmodules no &&
+	git -C downstream/sub/deepsub config push.recurseSubmodules no &&
+	git -C downstream push --recurse-submodules=on-demand origin downstream-branch &&
+
+	# pushes still happen, despite config
+	git -C upstream rev-parse refs/heads/downstream-branch &&
+	git -C upstream/sub rev-parse refs/heads/downstream-branch &&
+	git -C upstream/sub/deepsub rev-parse refs/heads/downstream-branch
+'
+
+test_expect_success 'push with push.recurseSubmodules=only' '
+	test_when_finished rm -rf upstream downstream &&
+	setup_subsub &&
+	new_downstream_commits &&
+	git -C downstream config push.recurseSubmodules only &&
+	git -C downstream/sub config push.recurseSubmodules only &&
+	git -C downstream/sub/deepsub config push.recurseSubmodules only &&
+	git -C downstream push origin downstream-branch &&
+
+	# all pushes happen except superproject (the intermediate
+	# "only" does not apply because only the superproject config is honored)
+	test_must_fail git -C upstream rev-parse refs/heads/downstream-branch &&
+	git -C upstream/sub rev-parse refs/heads/downstream-branch &&
+	git -C upstream/sub/deepsub rev-parse refs/heads/downstream-branch
+'
+
 test_expect_success 'push propagating the remotes name to a submodule' '
 	git -C work remote add origin ../pub.git &&
 	git -C work remote add pub ../pub.git &&
-- 
2.38.1.431.g37b22c650d-goog


^ permalink raw reply related	[flat|nested] 10+ messages in thread

* Re: [PATCH] submodule: explicitly specify on-demand upon push
  2022-11-08  0:25 [PATCH] submodule: explicitly specify on-demand upon push Jonathan Tan
@ 2022-11-08  0:31 ` Taylor Blau
  2022-11-08 21:43   ` Jonathan Tan
  2022-11-14 21:37 ` [PATCH v2] Doc: document push.recurseSubmodules=only Jonathan Tan
  1 sibling, 1 reply; 10+ messages in thread
From: Taylor Blau @ 2022-11-08  0:31 UTC (permalink / raw)
  To: Jonathan Tan; +Cc: git

On Mon, Nov 07, 2022 at 04:25:52PM -0800, Jonathan Tan wrote:
> This now means that any push.recurseSubmodules configuration in any
> submodule is no longer respected: only the configuration (or CLI
> argument to override it) of the superproject is used. Update the
> documentation accordingly.

Hmm. Is that a desired outcome or an unfortunate side-effect of the
implementation below?

Not having thought about this a lot, the behavior I might expect is
something along the lines of recursively pushing throughout the
submodule tree, stopping the recursion as soon as we get to a nested
submodule which says "don't push any of my children".

On the other hand, I could sympathize with a compelling argument that
the superproject alone should be in charge of determining what gets
pushed.

Though TBH, it seems like the former is more convincing. If I depend on
an external repository through a submodule, and that repository itself
has submodules, it would be nice to configure (once) that I don't want
to even try and push any of that repository's children.

Thanks,
Taylor

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH] submodule: explicitly specify on-demand upon push
  2022-11-08  0:31 ` Taylor Blau
@ 2022-11-08 21:43   ` Jonathan Tan
  2022-11-09 18:31     ` Glen Choo
  0 siblings, 1 reply; 10+ messages in thread
From: Jonathan Tan @ 2022-11-08 21:43 UTC (permalink / raw)
  To: Taylor Blau; +Cc: Jonathan Tan, git

Taylor Blau <me@ttaylorr.com> writes:
> On Mon, Nov 07, 2022 at 04:25:52PM -0800, Jonathan Tan wrote:
> > This now means that any push.recurseSubmodules configuration in any
> > submodule is no longer respected: only the configuration (or CLI
> > argument to override it) of the superproject is used. Update the
> > documentation accordingly.
> 
> Hmm. Is that a desired outcome or an unfortunate side-effect of the
> implementation below?

What started this effort was investigating whether push.recurseSubmodules
could be set to "only", since that is something that would be useful at
$DAYJOB. It turns out that push.recurseSubmodules is not documented to
take "only", but it is supported, with the caveat that if a repository has
push.recurseSubmodules=only, you cannot recursively push in its superproject
because there is a check that submodules have been pushed, and with this
setting, that submodule is not pushed (since only nested submodules are
pushed).

In addition, there is the usage of the word "recursively" when describing
"--recurse-submodules=only"...

  If only is used all submodules will be recursively pushed while the
  superproject is left unpushed.

...even though this is not true with the default configuration. (Having said
that, there is no "recursively" with --recurse-submodules=on-demand, so maybe
it is the non-recursive meaning that is intended.)

So all this led me to think that it's best if only the top-level configuration
is used, but I'm open to other ideas.

> Not having thought about this a lot, the behavior I might expect is
> something along the lines of recursively pushing throughout the
> submodule tree, stopping the recursion as soon as we get to a nested
> submodule which says "don't push any of my children".
> 
> On the other hand, I could sympathize with a compelling argument that
> the superproject alone should be in charge of determining what gets
> pushed.
> 
> Though TBH, it seems like the former is more convincing. If I depend on
> an external repository through a submodule, and that repository itself
> has submodules, it would be nice to configure (once) that I don't want
> to even try and push any of that repository's children.

I just tried this and the behavior is reasonable except possibly for when
push.recurseSubmodules=only is configured in a top-level submodule. Let's see
if other people have something to say. For me, this would also be fine since we
can just make sure that we don't configure "only" in top-level submodules that
have their own nested submodules.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH] submodule: explicitly specify on-demand upon push
  2022-11-08 21:43   ` Jonathan Tan
@ 2022-11-09 18:31     ` Glen Choo
  0 siblings, 0 replies; 10+ messages in thread
From: Glen Choo @ 2022-11-09 18:31 UTC (permalink / raw)
  To: Jonathan Tan, Taylor Blau; +Cc: Jonathan Tan, git

Jonathan Tan <jonathantanmy@google.com> writes:

>> Not having thought about this a lot, the behavior I might expect is
>> something along the lines of recursively pushing throughout the
>> submodule tree, stopping the recursion as soon as we get to a nested
>> submodule which says "don't push any of my children".
>> 
>> On the other hand, I could sympathize with a compelling argument that
>> the superproject alone should be in charge of determining what gets
>> pushed.
>> 
>> Though TBH, it seems like the former is more convincing. If I depend on
>> an external repository through a submodule, and that repository itself
>> has submodules, it would be nice to configure (once) that I don't want
>> to even try and push any of that repository's children.
>
> I just tried this and the behavior is reasonable except possibly for when
> push.recurseSubmodules=only is configured in a top-level submodule. Let's see
> if other people have something to say. For me, this would also be fine since we
> can just make sure that we don't configure "only" in top-level submodules that
> have their own nested submodules.

For this patch, I think what Jonathan has proposed is preferable because
that's close to what "git fetch" [1] does, i.e. when a superproject has
parsed a setting for "recurse into submodules" (from either the CLI or
config), it passes that value to its submodules via
"--recurse-submodules", overriding the submodule's config.

In the longer term though, I think neither "git fetch" nor "git push"
(as of this patch) gets it right. I think it should be closer to:

- If the user has passed a "--recurse-submodules" flag via the CLI,
  respect that in the superproject and all submodules regardless of
  nesting. (Make sure not to conflate config values with the CLI option
  in the way "git fetch" does today.)
- Otherwise, each submodule should respect their own config setting if
  it is set.
- Otherwise, each submodule should respect their superproject's config
  setting if it is set.

[1] "git fetch" is somewhat more complicated than this, since it also
  reads values from .gitmodules and passes a "default" value, neither of
  which is relevant here I think. See this ML thread for some previous
  discussion.
  https://lore.kernel.org/git/kl6lbkwa8h5n.fsf@chooglen-macbookpro.roam.corp.google.com/

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [PATCH v2] Doc: document push.recurseSubmodules=only
  2022-11-08  0:25 [PATCH] submodule: explicitly specify on-demand upon push Jonathan Tan
  2022-11-08  0:31 ` Taylor Blau
@ 2022-11-14 21:37 ` Jonathan Tan
  2022-11-14 21:57   ` Taylor Blau
  2022-11-15  0:59   ` Glen Choo
  1 sibling, 2 replies; 10+ messages in thread
From: Jonathan Tan @ 2022-11-14 21:37 UTC (permalink / raw)
  To: git; +Cc: Jonathan Tan, chooglen, me

Git learned pushing submodules without pushing the superproject by
the user specifying --recurse-submodules=only through 6c656c3fe4
("submodules: add RECURSE_SUBMODULES_ONLY value", 2016-12-20) and
225e8bf778 ("push: add option to push only submodules", 2016-12-20).
For users who use this feature regularly, it is desirable to have an
equivalent configuration.

It turns out that such a configuration (push.recurseSubmodules=only) is
already supported, even though it is neither documented nor mentioned
in the commit messages, due to the way the --recurse-submodules=only
feature was implemented (a function used to parse --recurse-submodules
was updated to support "only", but that same function is used to parse
push.recurseSubmodules too). What is left is to document it and test it,
which is what this commit does.

There is a possible point of confusion when recursing into a submodule
that itself has the push.recurseSubmodules=only configuration, because
if a repository has only its submodules pushed and not itself, its
superproject can never be pushed. Therefore, treat such configurations
as being "on-demand", and print a warning message.

Signed-off-by: Jonathan Tan <jonathantanmy@google.com>
---
Thanks once again, Taylor and Glen for your comments. To try to
summarize:

- For me, the most important thing is to explicitly support (with
  documentation) the push.recurseSubmodules=only configuration.
- Between the submodule being able to decide for itself whether
  recursion continues, and the superproject deciding everything, Taylor
  thinks the former is more convincing.
  - The former is also the status quo.
- Glen makes a further point that either scheme (the status quo and the
  behavior in v1) would not be the ideal behavior, but that a CLI
  argument overrides all config, and in the absence of a CLI argument,
  a submodule should be able to decide for itself whether recursion
  should continue (except when there is no config present, in which case
  it should defer to the superproject).

If we were to implement v1 and then the ideal behavior later, we
would need to revert some of the v1-introduced behavior: if there is
configuration in both the superproject and a submodule, the status
quo is that the submodule's configuration takes precedence, v1 has the
superproject's configuration taking precedence, and with ideal behavior,
the submodule's configuration takes precedence.

So it makes sense to stick to the status quo for this patch. It is
possible to support push.recurseSubmodules=only with all existing
behavior preserved (in fact, push.recurseSubmodules=only is already
supported, just undocumented) so I have decided to go with that
approach.
---
 Documentation/config/push.txt  | 14 ++--------
 Documentation/git-push.txt     |  6 +++-
 builtin/push.c                 | 12 ++++++--
 submodule.c                    |  6 ++++
 t/t5531-deep-submodule-push.sh | 50 ++++++++++++++++++++++++++++++++++
 5 files changed, 73 insertions(+), 15 deletions(-)

diff --git a/Documentation/config/push.txt b/Documentation/config/push.txt
index 7386fea225..43338b65e8 100644
--- a/Documentation/config/push.txt
+++ b/Documentation/config/push.txt
@@ -110,18 +110,8 @@ This will result in only b (a and c are cleared).
 ----
 
 push.recurseSubmodules::
-	Make sure all submodule commits used by the revisions to be pushed
-	are available on a remote-tracking branch. If the value is 'check'
-	then Git will verify that all submodule commits that changed in the
-	revisions to be pushed are available on at least one remote of the
-	submodule. If any commits are missing, the push will be aborted and
-	exit with non-zero status. If the value is 'on-demand' then all
-	submodules that changed in the revisions to be pushed will be
-	pushed. If on-demand was not able to push all necessary revisions
-	it will also be aborted and exit with non-zero status. If the value
-	is 'no' then default behavior of ignoring submodules when pushing
-	is retained. You may override this configuration at time of push by
-	specifying '--recurse-submodules=check|on-demand|no'.
+	May be "check", "on-demand", "only", or "no", with the same behavior
+	as that of "push --recurse-submodules".
 	If not set, 'no' is used by default, unless 'submodule.recurse' is
 	set (in which case a 'true' value means 'on-demand').
 
diff --git a/Documentation/git-push.txt b/Documentation/git-push.txt
index def7657ef9..5bb1d5aae2 100644
--- a/Documentation/git-push.txt
+++ b/Documentation/git-push.txt
@@ -409,10 +409,14 @@ Specifying `--no-force-if-includes` disables this behavior.
 	all submodules that changed in the revisions to be pushed will be
 	pushed. If on-demand was not able to push all necessary revisions it will
 	also be aborted and exit with non-zero status. If 'only' is used all
-	submodules will be recursively pushed while the superproject is left
+	submodules will be pushed while the superproject is left
 	unpushed. A value of 'no' or using `--no-recurse-submodules` can be used
 	to override the push.recurseSubmodules configuration variable when no
 	submodule recursion is required.
++
+When using 'on-demand' or 'only', if a submodule has a
+"push.recurseSubmodules={on-demand,only}" or "submodule.recurse" configuration,
+further recursion will occur. In this case, "only" is treated as "on-demand".
 
 --[no-]verify::
 	Toggle the pre-push hook (see linkgit:githooks[5]).  The
diff --git a/builtin/push.c b/builtin/push.c
index f0329c62a2..60ac8017e5 100644
--- a/builtin/push.c
+++ b/builtin/push.c
@@ -466,8 +466,16 @@ static int option_parse_recurse_submodules(const struct option *opt,
 
 	if (unset)
 		*recurse_submodules = RECURSE_SUBMODULES_OFF;
-	else
-		*recurse_submodules = parse_push_recurse_submodules_arg(opt->long_name, arg);
+	else {
+		if (!strcmp(arg, "only-is-on-demand")) {
+			if (*recurse_submodules == RECURSE_SUBMODULES_ONLY) {
+				warning(_("recursing into submodule with push.recurseSubmodules=only; using on-demand instead"));
+				*recurse_submodules = RECURSE_SUBMODULES_ON_DEMAND;
+			}
+		} else {
+			*recurse_submodules = parse_push_recurse_submodules_arg(opt->long_name, arg);
+		}
+	}
 
 	return 0;
 }
diff --git a/submodule.c b/submodule.c
index bf7a2c7918..5cd21252d8 100644
--- a/submodule.c
+++ b/submodule.c
@@ -1130,6 +1130,12 @@ static int push_submodule(const char *path,
 	if (for_each_remote_ref_submodule(path, has_remote, NULL) > 0) {
 		struct child_process cp = CHILD_PROCESS_INIT;
 		strvec_push(&cp.args, "push");
+		/*
+		 * When recursing into a submodule, treat any "only" configurations as "on-
+		 * demand", since "only" would not work (we need all submodules to be pushed
+		 * in order to be able to push the superproject).
+		 */
+		strvec_push(&cp.args, "--recurse-submodules=only-is-on-demand");
 		if (dry_run)
 			strvec_push(&cp.args, "--dry-run");
 
diff --git a/t/t5531-deep-submodule-push.sh b/t/t5531-deep-submodule-push.sh
index 3f58b515ce..302e4cbdba 100755
--- a/t/t5531-deep-submodule-push.sh
+++ b/t/t5531-deep-submodule-push.sh
@@ -512,6 +512,56 @@ test_expect_success 'push only unpushed submodules recursively' '
 	test_cmp expected_pub actual_pub
 '
 
+setup_subsub () {
+	git init upstream &&
+	git init upstream/sub &&
+	git init upstream/sub/deepsub &&
+	test_commit -C upstream/sub/deepsub innermost &&
+	git -C upstream/sub submodule add ./deepsub deepsub &&
+	git -C upstream/sub commit -m middle &&
+	git -C upstream submodule add ./sub sub &&
+	git -C upstream commit -m outermost &&
+
+	git -c protocol.file.allow=always clone --recurse-submodules upstream downstream &&
+	git -C downstream/sub/deepsub checkout -b downstream-branch &&
+	git -C downstream/sub checkout -b downstream-branch &&
+	git -C downstream checkout -b downstream-branch
+}
+
+new_downstream_commits () {
+	test_commit -C downstream/sub/deepsub new-innermost &&
+	git -C downstream/sub add deepsub &&
+	git -C downstream/sub commit -m new-middle &&
+	git -C downstream add sub &&
+	git -C downstream commit -m new-outermost
+}
+
+test_expect_success 'push with push.recurseSubmodules=only on superproject' '
+	test_when_finished rm -rf upstream downstream &&
+	setup_subsub &&
+	new_downstream_commits &&
+	git -C downstream config push.recurseSubmodules only &&
+	git -C downstream push origin downstream-branch &&
+
+	test_must_fail git -C upstream rev-parse refs/heads/downstream-branch &&
+	git -C upstream/sub rev-parse refs/heads/downstream-branch &&
+	test_must_fail git -C upstream/sub/deepsub rev-parse refs/heads/downstream-branch
+'
+
+test_expect_success 'push with push.recurseSubmodules=only on superproject and top-level submodule' '
+	test_when_finished rm -rf upstream downstream &&
+	setup_subsub &&
+	new_downstream_commits &&
+	git -C downstream config push.recurseSubmodules only &&
+	git -C downstream/sub config push.recurseSubmodules only &&
+	git -C downstream push origin downstream-branch 2> err &&
+
+	test_must_fail git -C upstream rev-parse refs/heads/downstream-branch &&
+	git -C upstream/sub rev-parse refs/heads/downstream-branch &&
+	git -C upstream/sub/deepsub rev-parse refs/heads/downstream-branch &&
+	grep "recursing into submodule with push.recurseSubmodules=only; using on-demand instead" err
+'
+
 test_expect_success 'push propagating the remotes name to a submodule' '
 	git -C work remote add origin ../pub.git &&
 	git -C work remote add pub ../pub.git &&
-- 
2.38.1.493.g58b659f92b-goog


^ permalink raw reply related	[flat|nested] 10+ messages in thread

* Re: [PATCH v2] Doc: document push.recurseSubmodules=only
  2022-11-14 21:37 ` [PATCH v2] Doc: document push.recurseSubmodules=only Jonathan Tan
@ 2022-11-14 21:57   ` Taylor Blau
  2022-11-15  0:59   ` Glen Choo
  1 sibling, 0 replies; 10+ messages in thread
From: Taylor Blau @ 2022-11-14 21:57 UTC (permalink / raw)
  To: Jonathan Tan; +Cc: git, chooglen, me

On Mon, Nov 14, 2022 at 01:37:12PM -0800, Jonathan Tan wrote:
>  Documentation/config/push.txt  | 14 ++--------
>  Documentation/git-push.txt     |  6 +++-
>  builtin/push.c                 | 12 ++++++--
>  submodule.c                    |  6 ++++
>  t/t5531-deep-submodule-push.sh | 50 ++++++++++++++++++++++++++++++++++
>  5 files changed, 73 insertions(+), 15 deletions(-)

Seems reasonable to me. Glen?

Thanks,
Taylor

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v2] Doc: document push.recurseSubmodules=only
  2022-11-14 21:37 ` [PATCH v2] Doc: document push.recurseSubmodules=only Jonathan Tan
  2022-11-14 21:57   ` Taylor Blau
@ 2022-11-15  0:59   ` Glen Choo
  2022-11-15 18:47     ` Jonathan Tan
  1 sibling, 1 reply; 10+ messages in thread
From: Glen Choo @ 2022-11-15  0:59 UTC (permalink / raw)
  To: Jonathan Tan, git; +Cc: Jonathan Tan, me

Jonathan Tan <jonathantanmy@google.com> writes:

> - For me, the most important thing is to explicitly support (with
>   documentation) the push.recurseSubmodules=only configuration.
> - Between the submodule being able to decide for itself whether
>   recursion continues, and the superproject deciding everything, Taylor
>   thinks the former is more convincing.
>   - The former is also the status quo.
> - Glen makes a further point that either scheme (the status quo and the
>   behavior in v1) would not be the ideal behavior, but that a CLI
>   argument overrides all config, and in the absence of a CLI argument,
>   a submodule should be able to decide for itself whether recursion
>   should continue (except when there is no config present, in which case
>   it should defer to the superproject).

Ok. I find the status quo argument convincing, so I'm okay with the
approach in this patch.

That said.. I find the status quo very surprising and unergonomic. As
demonstrated in a test below, if a submodule has push.recurseSubmodules
unset, recursion doesn't occur. At the very least, I would have expected
the submodule to respect the superproject's config since that's still a
statement of intent.

I expect that we will have to change this behavior at some point, and
hopefully my "ideal" behavior makes sense to users.

> diff --git a/Documentation/config/push.txt b/Documentation/config/push.txt
> index 7386fea225..43338b65e8 100644
> --- a/Documentation/config/push.txt
> +++ b/Documentation/config/push.txt
> @@ -110,18 +110,8 @@ This will result in only b (a and c are cleared).
>  ----
>  
>  push.recurseSubmodules::
> -	Make sure all submodule commits used by the revisions to be pushed
> -	are available on a remote-tracking branch. If the value is 'check'
> -	then Git will verify that all submodule commits that changed in the
> -	revisions to be pushed are available on at least one remote of the
> -	submodule. If any commits are missing, the push will be aborted and
> -	exit with non-zero status. If the value is 'on-demand' then all
> -	submodules that changed in the revisions to be pushed will be
> -	pushed. If on-demand was not able to push all necessary revisions
> -	it will also be aborted and exit with non-zero status. If the value
> -	is 'no' then default behavior of ignoring submodules when pushing
> -	is retained. You may override this configuration at time of push by
> -	specifying '--recurse-submodules=check|on-demand|no'.
> +	May be "check", "on-demand", "only", or "no", with the same behavior
> +	as that of "push --recurse-submodules".
>  	If not set, 'no' is used by default, unless 'submodule.recurse' is
>  	set (in which case a 'true' value means 'on-demand').
>  
> diff --git a/Documentation/git-push.txt b/Documentation/git-push.txt
> index def7657ef9..5bb1d5aae2 100644
> --- a/Documentation/git-push.txt
> +++ b/Documentation/git-push.txt
> @@ -409,10 +409,14 @@ Specifying `--no-force-if-includes` disables this behavior.
>  	all submodules that changed in the revisions to be pushed will be
>  	pushed. If on-demand was not able to push all necessary revisions it will
>  	also be aborted and exit with non-zero status. If 'only' is used all
> -	submodules will be recursively pushed while the superproject is left
> +	submodules will be pushed while the superproject is left
>  	unpushed. A value of 'no' or using `--no-recurse-submodules` can be used
>  	to override the push.recurseSubmodules configuration variable when no
>  	submodule recursion is required.
> ++
> +When using 'on-demand' or 'only', if a submodule has a
> +"push.recurseSubmodules={on-demand,only}" or "submodule.recurse" configuration,
> +further recursion will occur. In this case, "only" is treated as "on-demand".
>  
>  --[no-]verify::
>  	Toggle the pre-push hook (see linkgit:githooks[5]).  The

Everything up to here looks good.

> diff --git a/builtin/push.c b/builtin/push.c
> index f0329c62a2..60ac8017e5 100644
> --- a/builtin/push.c
> +++ b/builtin/push.c
> @@ -466,8 +466,16 @@ static int option_parse_recurse_submodules(const struct option *opt,
>  
>  	if (unset)
>  		*recurse_submodules = RECURSE_SUBMODULES_OFF;
> -	else
> -		*recurse_submodules = parse_push_recurse_submodules_arg(opt->long_name, arg);
> +	else {
> +		if (!strcmp(arg, "only-is-on-demand")) {
> +			if (*recurse_submodules == RECURSE_SUBMODULES_ONLY) {
> +				warning(_("recursing into submodule with push.recurseSubmodules=only; using on-demand instead"));
> +				*recurse_submodules = RECURSE_SUBMODULES_ON_DEMAND;
> +			}
> +		} else {
> +			*recurse_submodules = parse_push_recurse_submodules_arg(opt->long_name, arg);
> +		}
> +	}
>  
>  	return 0;
>  }
> diff --git a/submodule.c b/submodule.c
> index bf7a2c7918..5cd21252d8 100644
> --- a/submodule.c
> +++ b/submodule.c
> @@ -1130,6 +1130,12 @@ static int push_submodule(const char *path,
>  	if (for_each_remote_ref_submodule(path, has_remote, NULL) > 0) {
>  		struct child_process cp = CHILD_PROCESS_INIT;
>  		strvec_push(&cp.args, "push");
> +		/*
> +		 * When recursing into a submodule, treat any "only" configurations as "on-
> +		 * demand", since "only" would not work (we need all submodules to be pushed
> +		 * in order to be able to push the superproject).
> +		 */
> +		strvec_push(&cp.args, "--recurse-submodules=only-is-on-demand");
>  		if (dry_run)
>  			strvec_push(&cp.args, "--dry-run");

When we pass this magic, undocumented value, "git push" will warn about
about "only" and override it with "on-demand". We always pass it when we
recurse into submodules, and we assume that no user will pass it, thus
we get the warning iff we are recursing into submodules.

In that case, it sounds like "--recurse-submodules=only-is-on-demand" is
a synonym for "this is a submodule that is being recursed into". In that
case, wouldn't it be more self-documenting to have a hidden CLI flag
that expresses exactly that ? e.g. we could add a PARSE_OPT_HIDDEN flag
called "--is-recursing" and check that boolean value. This seems clearer
to me at least.

>  
> diff --git a/t/t5531-deep-submodule-push.sh b/t/t5531-deep-submodule-push.sh
> index 3f58b515ce..302e4cbdba 100755
> --- a/t/t5531-deep-submodule-push.sh
> +++ b/t/t5531-deep-submodule-push.sh
> @@ -512,6 +512,56 @@ test_expect_success 'push only unpushed submodules recursively' '
>  	test_cmp expected_pub actual_pub
>  '
>  
> +setup_subsub () {
> +	git init upstream &&
> +	git init upstream/sub &&
> +	git init upstream/sub/deepsub &&
> +	test_commit -C upstream/sub/deepsub innermost &&
> +	git -C upstream/sub submodule add ./deepsub deepsub &&
> +	git -C upstream/sub commit -m middle &&
> +	git -C upstream submodule add ./sub sub &&
> +	git -C upstream commit -m outermost &&
> +
> +	git -c protocol.file.allow=always clone --recurse-submodules upstream downstream &&
> +	git -C downstream/sub/deepsub checkout -b downstream-branch &&
> +	git -C downstream/sub checkout -b downstream-branch &&
> +	git -C downstream checkout -b downstream-branch
> +}
> +
> +new_downstream_commits () {
> +	test_commit -C downstream/sub/deepsub new-innermost &&
> +	git -C downstream/sub add deepsub &&
> +	git -C downstream/sub commit -m new-middle &&
> +	git -C downstream add sub &&
> +	git -C downstream commit -m new-outermost
> +}
> +
> +test_expect_success 'push with push.recurseSubmodules=only on superproject' '
> +	test_when_finished rm -rf upstream downstream &&
> +	setup_subsub &&
> +	new_downstream_commits &&
> +	git -C downstream config push.recurseSubmodules only &&
> +	git -C downstream push origin downstream-branch &&
> +
> +	test_must_fail git -C upstream rev-parse refs/heads/downstream-branch &&
> +	git -C upstream/sub rev-parse refs/heads/downstream-branch &&
> +	test_must_fail git -C upstream/sub/deepsub rev-parse refs/heads/downstream-branch
> +'

This test demonstrates the behavior I find surprising.

> +
> +test_expect_success 'push with push.recurseSubmodules=only on superproject and top-level submodule' '
> +	test_when_finished rm -rf upstream downstream &&
> +	setup_subsub &&
> +	new_downstream_commits &&
> +	git -C downstream config push.recurseSubmodules only &&
> +	git -C downstream/sub config push.recurseSubmodules only &&
> +	git -C downstream push origin downstream-branch 2> err &&
> +
> +	test_must_fail git -C upstream rev-parse refs/heads/downstream-branch &&
> +	git -C upstream/sub rev-parse refs/heads/downstream-branch &&
> +	git -C upstream/sub/deepsub rev-parse refs/heads/downstream-branch &&
> +	grep "recursing into submodule with push.recurseSubmodules=only; using on-demand instead" err
> +'
> +
>  test_expect_success 'push propagating the remotes name to a submodule' '
>  	git -C work remote add origin ../pub.git &&
>  	git -C work remote add pub ../pub.git &&

The tests look good, and are quite readable.

> -- 
> 2.38.1.493.g58b659f92b-goog

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v2] Doc: document push.recurseSubmodules=only
  2022-11-15  0:59   ` Glen Choo
@ 2022-11-15 18:47     ` Jonathan Tan
  2022-11-16 21:21       ` Glen Choo
  0 siblings, 1 reply; 10+ messages in thread
From: Jonathan Tan @ 2022-11-15 18:47 UTC (permalink / raw)
  To: Glen Choo; +Cc: Jonathan Tan, git, me

First of all, thanks for taking a look at this.

Glen Choo <chooglen@google.com> writes:
> That said.. I find the status quo very surprising and unergonomic. As
> demonstrated in a test below, if a submodule has push.recurseSubmodules
> unset, recursion doesn't occur. At the very least, I would have expected
> the submodule to respect the superproject's config since that's still a
> statement of intent.
> 
> I expect that we will have to change this behavior at some point, and
> hopefully my "ideal" behavior makes sense to users.

Well, your "ideal" behavior makes sense to me :-)

> When we pass this magic, undocumented value, "git push" will warn about
> about "only" and override it with "on-demand". We always pass it when we
> recurse into submodules, and we assume that no user will pass it, thus
> we get the warning iff we are recursing into submodules.
> 
> In that case, it sounds like "--recurse-submodules=only-is-on-demand" is
> a synonym for "this is a submodule that is being recursed into". In that
> case, wouldn't it be more self-documenting to have a hidden CLI flag
> that expresses exactly that ? e.g. we could add a PARSE_OPT_HIDDEN flag
> called "--is-recursing" and check that boolean value. This seems clearer
> to me at least.

Hmm...--recurse-submodules=only-is-on-demand is hidden too, right? One
advantage of doing this instead of a separate arg is that neither the
caller nor "git push" needs to think about what happens if both --recurse-
submodules=something and --is-recursing are both specified.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v2] Doc: document push.recurseSubmodules=only
  2022-11-15 18:47     ` Jonathan Tan
@ 2022-11-16 21:21       ` Glen Choo
  2022-11-16 22:35         ` Taylor Blau
  0 siblings, 1 reply; 10+ messages in thread
From: Glen Choo @ 2022-11-16 21:21 UTC (permalink / raw)
  To: Jonathan Tan; +Cc: Jonathan Tan, git, me

Jonathan Tan <jonathantanmy@google.com> writes:

>> When we pass this magic, undocumented value, "git push" will warn about
>> about "only" and override it with "on-demand". We always pass it when we
>> recurse into submodules, and we assume that no user will pass it, thus
>> we get the warning iff we are recursing into submodules.
>> 
>> In that case, it sounds like "--recurse-submodules=only-is-on-demand" is
>> a synonym for "this is a submodule that is being recursed into". In that
>> case, wouldn't it be more self-documenting to have a hidden CLI flag
>> that expresses exactly that ? e.g. we could add a PARSE_OPT_HIDDEN flag
>> called "--is-recursing" and check that boolean value. This seems clearer
>> to me at least.
>
> Hmm...--recurse-submodules=only-is-on-demand is hidden too, right?

I suppose so, but as a matter of personal taste, when encountering a
hidden CLI option, I'd prefer to see the option's documentation when I
grep:

  OPT_HIDDEN_BOOL(0, "is-recursing", &is_recursing, "internal only,
    override recurseSubmodules = only")

than:

  if (!strcmp(arg, "only-is-on-demand")) {
    /* etc */
  }

>                                                                    One
> advantage of doing this instead of a separate arg is that neither the
> caller nor "git push" needs to think about what happens if both --recurse-
> submodules=something and --is-recursing are both specified.

That's true, and I suppose it prevents the temptation to reuse
--is-recursing for things it wasn't meant for. But in that case,
couldn't we say the same thing if we renamed "--is-recursing" to
"--recurse-only-is-on-demand"?

At any rate, do treat this as just a suggestion. I won't block this if
both you and Taylor find this clear enough :)

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v2] Doc: document push.recurseSubmodules=only
  2022-11-16 21:21       ` Glen Choo
@ 2022-11-16 22:35         ` Taylor Blau
  0 siblings, 0 replies; 10+ messages in thread
From: Taylor Blau @ 2022-11-16 22:35 UTC (permalink / raw)
  To: Glen Choo; +Cc: Jonathan Tan, git, me

On Wed, Nov 16, 2022 at 01:21:03PM -0800, Glen Choo wrote:
> At any rate, do treat this as just a suggestion. I won't block this if
> both you and Taylor find this clear enough :)

I admit that I do not care too much either way ;-).

Thanks,
Taylor

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2022-11-16 22:35 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-11-08  0:25 [PATCH] submodule: explicitly specify on-demand upon push Jonathan Tan
2022-11-08  0:31 ` Taylor Blau
2022-11-08 21:43   ` Jonathan Tan
2022-11-09 18:31     ` Glen Choo
2022-11-14 21:37 ` [PATCH v2] Doc: document push.recurseSubmodules=only Jonathan Tan
2022-11-14 21:57   ` Taylor Blau
2022-11-15  0:59   ` Glen Choo
2022-11-15 18:47     ` Jonathan Tan
2022-11-16 21:21       ` Glen Choo
2022-11-16 22:35         ` Taylor Blau

Code repositories for project(s) associated with this public inbox

	https://80x24.org/mirrors/git.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).