From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS3215 2.6.0.0/16 X-Spam-Status: No, score=-3.4 required=3.0 tests=AWL,BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from out1.vger.email (out1.vger.email [IPv6:2620:137:e000::1:20]) by dcvr.yhbt.net (Postfix) with ESMTP id 7CEE71F66E for ; Wed, 4 May 2022 15:27:04 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1352787AbiEDP3g (ORCPT ); Wed, 4 May 2022 11:29:36 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53760 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1352572AbiEDP3H (ORCPT ); Wed, 4 May 2022 11:29:07 -0400 Received: from mail-wr1-x432.google.com (mail-wr1-x432.google.com [IPv6:2a00:1450:4864:20::432]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 27CAE62E1 for ; Wed, 4 May 2022 08:25:23 -0700 (PDT) Received: by mail-wr1-x432.google.com with SMTP id q23so2530446wra.1 for ; Wed, 04 May 2022 08:25:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=message-id:in-reply-to:references:from:date:subject:fcc :content-transfer-encoding:mime-version:to:cc; bh=oOYpatQC8nQgMBUC9EelipVmm8KOOwdxVR9fKpNgu4E=; b=AFwdZZnPWY094IUl0xoQYkAJfKL+UhUASquQaPCtR8TD1OTncPTanRd2VoES3hqsrn ARVsMmzJfbOLwxgww/zFp/mnpnmbtb2kJnGxy+whh1St1J/Fl19+2PwfLdowVCzj3wwE bbxEVeYsxWQY1BPFSSvjOBEwJVmkjTNFMOMKHWMw9x/pKlKIUh8J8OrDkVp5h0nnujFv UENGeZYIOu1ZUOk6AqnrGorRGD36eb3eIVM6pjuaLCP32DrYltvAvJW+v0Eu1Wph1egl 6AhlYGaG4QO1K+pVb6V4e/ZR8VEh2eWb02yEXugJQUYprjFbR1lP+1WTCsZgLDv4lQEg uHJQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:message-id:in-reply-to:references:from:date :subject:fcc:content-transfer-encoding:mime-version:to:cc; bh=oOYpatQC8nQgMBUC9EelipVmm8KOOwdxVR9fKpNgu4E=; b=2FBhqeL7abi3echuxI0/Ev3BuwjvUj4ms3XnSsAk6l/tY/BYW9eH1Xr7y+fMnUHvUU VJm5dh8LRuLv8223YuZQ6UGNBNFbQ0lY6I4JhQPmI2Lu8KnvoDKVdfH2zBomsyJ1hE0f 2KtWtMPYVe7c2lpNzbonzRUwCoxO4VXFwAkl5swtOU/C+jEeppCaT9tBmdUH9fWCopEr gv8FCoc85ZqNtONkwriniMKxP17Hriqk42ZvLWbCTKbOHI7gpcybisoWUlJLevYSpbQn YOCENkASObN3MvStxD/sjF1HLkM+pQWkDGbtuqgrSLhS95OVUDQEUMFm8z7NEIbltu9R HAhg== X-Gm-Message-State: AOAM533Vt7poW34cZvKATzt5xZP+4aVHF9rhHDHXV8S5nN6seTcJqWo9 RjfB9qIIg4Saz6qbGl6BHeRkBDi/9AQ= X-Google-Smtp-Source: ABdhPJyH9KaSawI65VaUHWBsqj5vkpWgs7sTVpt4NOgXtqckvS6VkxpzOie7EH3xrHhPZWirsFsKeQ== X-Received: by 2002:a5d:53cb:0:b0:20a:d769:e14e with SMTP id a11-20020a5d53cb000000b0020ad769e14emr17314990wrw.603.1651677921111; Wed, 04 May 2022 08:25:21 -0700 (PDT) Received: from [127.0.0.1] ([13.74.141.28]) by smtp.gmail.com with ESMTPSA id c10-20020adfa70a000000b0020c5253d91esm11802307wrd.106.2022.05.04.08.25.19 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 04 May 2022 08:25:20 -0700 (PDT) Message-Id: In-Reply-To: References: From: "Johannes Schindelin via GitGitGadget" Date: Wed, 04 May 2022 15:25:12 +0000 Subject: [PATCH v3 0/7] scalar: implement the subcommand "diagnose" Fcc: Sent Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit MIME-Version: 1.0 To: git@vger.kernel.org Cc: =?UTF-8?Q?Ren=C3=A9?= Scharfe , Taylor Blau , Derrick Stolee , Elijah Newren , Johannes Schindelin Precedence: bulk List-ID: X-Mailing-List: git@vger.kernel.org Over the course of the years, we developed a sub-command that gathers diagnostic data into a .zip file that can then be attached to bug reports. This sub-command turned out to be very useful in helping Scalar developers identify and fix issues. Changes since v2: * Clarified in the commit message what the biggest benefit of --add-file-with-content is. * The part of the -add-file-with-content argument can now contain colons. To do this, the path needs to start and end in double-quote characters (which are stripped), and the backslash serves as escape character in that case (to allow the path to contain both colons and double-quotes). * Fixed incorrect grammar. * Instead of strcmp(), we now say !strcmp(). * The help text for --add-file-with-content was improved a tiny bit. * Adjusted the commit message that still talked about spawning plenty of processes and about a throw-away repository for the sake of generating a .zip file. * Simplified the code that shows the diagnostics and adds them to the .zip file. * The final message that reports that the archive is complete is now printed to stderr instead of stdout. Changes since v1: * Instead of creating a throw-away repository, staging the contents of the .zip file and then using git write-tree and git archive to write the .zip file, the patch series now introduces a new option to git archive and uses write_archive() directly (avoiding any separate process). * Since the command avoids separate processes, it is now blazing fast on Windows, and I dropped the spinner() function because it's no longer needed. * While reworking the test case, I noticed that scalar [...] failed to verify that the specified directory exists, and would happily "traverse to its parent directory" on its quest to find a Scalar enlistment. That is of course incorrect, and has been fixed as a "while at it" sort of preparatory commit. * I had forgotten to sign off on all the commits, which has been fixed. * Instead of some "home-grown" readdir()-based function, the code now uses for_each_file_in_pack_dir() to look through the pack directories. * If any alternates are configured, their pack directories are now included in the output. * The commit message that might be interpreted to promise information about large loose files has been corrected to no longer promise that. * The test cases have been adjusted to test a little bit more (e.g. verifying that specific paths are mentioned in the output, instead of merely verifying that the output is non-empty). Johannes Schindelin (5): archive: optionally add "virtual" files archive --add-file-with-contents: allow paths containing colons scalar: validate the optional enlistment argument Implement `scalar diagnose` scalar diagnose: include disk space information Matthew John Cheetham (2): scalar: teach `diagnose` to gather packfile info scalar: teach `diagnose` to gather loose objects information Documentation/git-archive.txt | 16 ++ archive.c | 75 +++++++- contrib/scalar/scalar.c | 289 ++++++++++++++++++++++++++++++- contrib/scalar/scalar.txt | 12 ++ contrib/scalar/t/t9099-scalar.sh | 27 +++ t/t5003-archive-zip.sh | 20 +++ 6 files changed, 429 insertions(+), 10 deletions(-) base-commit: ddc35d833dd6f9e8946b09cecd3311b8aa18d295 Published-As: https://github.com/gitgitgadget/git/releases/tag/pr-1128%2Fdscho%2Fscalar-diagnose-v3 Fetch-It-Via: git fetch https://github.com/gitgitgadget/git pr-1128/dscho/scalar-diagnose-v3 Pull-Request: https://github.com/gitgitgadget/git/pull/1128 Range-diff vs v2: 1: 49ff3c1f2b3 ! 1: 45662cf582a archive: optionally add "virtual" files @@ Commit message archive` now supports use cases where relatively trivial files need to be added that do not exist on disk. + This will allow us to generate `.zip` files with generated content, + without having to add said content to the object database and without + having to write it out to disk. + Signed-off-by: Johannes Schindelin ## Documentation/git-archive.txt ## @@ Documentation/git-archive.txt: OPTIONS + basename of . ++ +The `` cannot contain any colon, the file mode is limited to -+a regular file, and the option may be subject platform-dependent ++a regular file, and the option may be subject to platform-dependent +command-line limits. For non-trivial cases, write an untracked file +and use `--add-file` instead. + @@ archive.c: static int add_file_cb(const struct option *opt, const char *arg, int - if (!S_ISREG(info->stat.st_mode)) - die(_("Not a regular file: %s"), path); + -+ if (strcmp(opt->long_name, "add-file-with-content")) { ++ if (!strcmp(opt->long_name, "add-file")) { + path = prefix_filename(args->prefix, arg); + if (stat(path, &info->stat)) + die(_("File not found: %s"), path); @@ archive.c: static int parse_archive_args(int argc, const char **argv, N_("add untracked file to archive"), 0, add_file_cb, (intptr_t)&base }, + { OPTION_CALLBACK, 0, "add-file-with-content", args, -+ N_("file"), N_("add untracked file to archive"), 0, ++ N_("path:content"), N_("add untracked file to archive"), 0, + add_file_cb, (intptr_t)&base }, OPT_STRING('o', "output", &output, N_("file"), N_("write the archive to this file")), -: ----------- > 2: ce4b1b680c9 archive --add-file-with-contents: allow paths containing colons 2: 600da8d465e = 3: 5a3eeb55409 scalar: validate the optional enlistment argument 3: 0d570137bb6 ! 4: dfe821d10fe Implement `scalar diagnose` @@ Commit message we had the luxury of a comprehensive standard library that includes basic functionality such as writing a `.zip` file. In the C version, we lack such a commodity. Rather than introducing a dependency on, say, - libzip, we slightly abuse Git's `archive` command: Instead of writing - the `.zip` file directly, we stage the file contents in a Git index of a - temporary, bare repository, only to let `git archive` have at it, and - finally removing the temporary repository. - - Also note: Due to the frequently-spawned `git hash-object` processes, - this command is quite a bit slow on Windows. Should it turn out to be a - big problem, the lack of a batch mode of the `hash-object` command could - potentially be worked around via using `git fast-import` with a crafted - `stdin`. + libzip, we slightly abuse Git's `archive` machinery: we write out a + `.zip` of the empty try, augmented by a couple files that are added via + the `--add-file*` options. We are careful trying not to modify the + current repository in any way lest the very circumstances that required + `scalar diagnose` to be run are changed by the `diagnose` run itself. Signed-off-by: Johannes Schindelin @@ contrib/scalar/scalar.c: cleanup: + time_t now = time(NULL); + struct tm tm; + struct strbuf path = STRBUF_INIT, buf = STRBUF_INIT; -+ size_t off; + int res = 0; + + argc = parse_options(argc, argv, NULL, options, @@ contrib/scalar/scalar.c: cleanup: + strvec_pushl(&archiver_args, "scalar-diagnose", "--format=zip", NULL); + + strbuf_reset(&buf); -+ strbuf_addstr(&buf, -+ "--add-file-with-content=diagnostics.log:" -+ "Collecting diagnostic info\n\n"); ++ strbuf_addstr(&buf, "Collecting diagnostic info\n\n"); + get_version_info(&buf, 1); + + strbuf_addf(&buf, "Enlistment root: %s\n", the_repository->worktree); -+ off = strchr(buf.buf, ':') + 1 - buf.buf; -+ write_or_die(stdout_fd, buf.buf + off, buf.len - off); -+ strvec_push(&archiver_args, buf.buf); ++ write_or_die(stdout_fd, buf.buf, buf.len); ++ strvec_pushf(&archiver_args, ++ "--add-file-with-content=diagnostics.log:%.*s", ++ (int)buf.len, buf.buf); + + if ((res = add_directory_to_archiver(&archiver_args, ".git", 0)) || + (res = add_directory_to_archiver(&archiver_args, ".git/hooks", 0)) || @@ contrib/scalar/scalar.c: cleanup: + } + + if (!res) -+ printf("\n" ++ fprintf(stderr, "\n" + "Diagnostics complete.\n" + "All of the gathered info is captured in '%s'\n", + zip_path.buf); 4: 938e38b5a09 ! 5: bb162abd383 scalar diagnose: include disk space information @@ contrib/scalar/scalar.c: static int cmd_diagnose(int argc, const char **argv) strbuf_addf(&buf, "Enlistment root: %s\n", the_repository->worktree); + get_disk_info(&buf); - off = strchr(buf.buf, ':') + 1 - buf.buf; - write_or_die(stdout_fd, buf.buf + off, buf.len - off); - strvec_push(&archiver_args, buf.buf); + write_or_die(stdout_fd, buf.buf, buf.len); + strvec_pushf(&archiver_args, + "--add-file-with-content=diagnostics.log:%.*s", ## contrib/scalar/t/t9099-scalar.sh ## @@ contrib/scalar/t/t9099-scalar.sh: SQ="'" 5: bd9428919fa ! 6: 32aaad7cce1 scalar: teach `diagnose` to gather packfile info @@ contrib/scalar/scalar.c: cleanup: { struct option options[] = { @@ contrib/scalar/scalar.c: static int cmd_diagnose(int argc, const char **argv) - write_or_die(stdout_fd, buf.buf + off, buf.len - off); - strvec_push(&archiver_args, buf.buf); + "--add-file-with-content=diagnostics.log:%.*s", + (int)buf.len, buf.buf); + strbuf_reset(&buf); + strbuf_addstr(&buf, "--add-file-with-content=packs-local.txt:"); 6: 7a8875be425 = 7: 322932f0bb8 scalar: teach `diagnose` to gather loose objects information -- gitgitgadget