From: Junio C Hamano <gitster@pobox.com>
To: Jonathan Tan <jonathantanmy@google.com>
Cc: git@vger.kernel.org
Subject: Re: [PATCH] connected: always use partial clone optimization
Date: Fri, 20 Mar 2020 15:54:41 -0700 [thread overview]
Message-ID: <xmqq1rpmbpse.fsf@gitster.c.googlers.com> (raw)
In-Reply-To: <20200320220045.258462-1-jonathantanmy@google.com> (Jonathan Tan's message of "Fri, 20 Mar 2020 15:00:45 -0700")
Jonathan Tan <jonathantanmy@google.com> writes:
> The addition of the fast path might cause performance reductions in
> these cases:
>
> - If a partial clone or a fetch into a partial clone fails, Git will
> fruitlessly run rev-list (it is expected that everything fetched
> would go into promisor packs, so if that didn't happen, it is most
> likely that rev-list will fail too).
I agree that it is reasonable not to optimize the system for the
failure case---it is pointless to fail as quick as possible ;-)
> - Any connectivity checks done by receive-pack, in the (in my opinion,
> unlikely) event that a partial clone serves receive-pack.
Meaning "I created this repository by partially cloning, and now
I want to update it by pushing into it"? I am not sure how rare
such a use case is, so I won't be a good judge on this one.
> diff --git a/builtin/clone.c b/builtin/clone.c
> index 1ad26f4d8c..4b2b14ff61 100644
> --- a/builtin/clone.c
> +++ b/builtin/clone.c
> @@ -672,8 +672,7 @@ static void update_remote_refs(const struct ref *refs,
> const char *branch_top,
> const char *msg,
> struct transport *transport,
> - int check_connectivity,
> - int check_refs_are_promisor_objects_only)
> + int check_connectivity)
> {
> const struct ref *rm = mapped_refs;
>
> @@ -682,8 +681,6 @@ static void update_remote_refs(const struct ref *refs,
>
> opt.transport = transport;
> opt.progress = transport->progress;
> - opt.check_refs_are_promisor_objects_only =
> - !!check_refs_are_promisor_objects_only;
>
> if (check_connected(iterate_ref_map, &rm, &opt))
> die(_("remote did not send all necessary objects"));
> @@ -1275,7 +1272,7 @@ int cmd_clone(int argc, const char **argv, const char *prefix)
>
> update_remote_refs(refs, mapped_refs, remote_head_points_at,
> branch_top.buf, reflog_msg.buf, transport,
> - !is_local, filter_options.choice);
> + !is_local);
>
> update_head(our_head_points_at, remote_head, reflog_msg.buf);
>
> diff --git a/builtin/fetch.c b/builtin/fetch.c
> index bf6bab80fa..1097e1e512 100644
> --- a/builtin/fetch.c
> +++ b/builtin/fetch.c
> @@ -908,13 +908,6 @@ static int store_updated_refs(const char *raw_url, const char *remote_name,
> if (!connectivity_checked) {
> struct check_connected_options opt = CHECK_CONNECTED_INIT;
>
> - if (filter_options.choice)
> - /*
> - * Since a filter is specified, objects indirectly
> - * referenced by refs are allowed to be absent.
> - */
> - opt.check_refs_are_promisor_objects_only = 1;
> -
> rm = ref_map;
> if (check_connected(iterate_ref_map, &rm, &opt)) {
> rc = error(_("%s did not send all necessary objects\n"), url);
> diff --git a/connected.c b/connected.c
> index 7e9bd1bc62..846f2e4eef 100644
> --- a/connected.c
> +++ b/connected.c
> @@ -52,7 +52,7 @@ int check_connected(oid_iterate_fn fn, void *cb_data,
> strbuf_release(&idx_file);
> }
>
> - if (opt->check_refs_are_promisor_objects_only) {
> + if (has_promisor_remote()) {
Earlier we would have this bit on only when filter_options.choice
was non-NULL but this allows us to take the branch as long as we
are lazily populated. Makes sense.
> @@ -71,13 +71,18 @@ int check_connected(oid_iterate_fn fn, void *cb_data,
> if (find_pack_entry_one(oid.hash, p))
> goto promisor_pack_found;
> }
> - return 1;
> + /*
> + * Fallback to rev-list with oid and the rest of the
> + * object IDs provided by fn.
> + */
> + goto no_promisor_pack_found;
OK. This is the "fallback" thing you mentioned in the log message.
Makes sense.
> promisor_pack_found:
> ;
> } while (!fn(cb_data, &oid));
> return 0;
> }
>
> +no_promisor_pack_found:
> if (opt->shallow_file) {
> argv_array_push(&rev_list.args, "--shallow-file");
> argv_array_push(&rev_list.args, opt->shallow_file);
next prev parent reply other threads:[~2020-03-20 22:54 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-03-20 22:00 [PATCH] connected: always use partial clone optimization Jonathan Tan
2020-03-20 22:54 ` Junio C Hamano [this message]
2020-03-26 19:01 ` Josh Steadmon
2020-03-26 21:11 ` Emily Shaffer
2020-03-26 23:14 ` Josh Steadmon
2020-03-29 17:39 ` Junio C Hamano
2020-03-30 3:32 ` Jonathan Tan
2020-03-30 5:12 ` Junio C Hamano
2020-03-30 16:04 ` Jonathan Tan
2020-03-30 18:09 ` Junio C Hamano
2020-03-30 13:37 ` Jeff King
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: http://vger.kernel.org/majordomo-info.html
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=xmqq1rpmbpse.fsf@gitster.c.googlers.com \
--to=gitster@pobox.com \
--cc=git@vger.kernel.org \
--cc=jonathantanmy@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://80x24.org/mirrors/git.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).