From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS53758 23.128.96.0/24 X-Spam-Status: No, score=-4.1 required=3.0 tests=AWL,BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,RCVD_IN_DNSWL_LOW, SPF_HELO_PASS,SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by dcvr.yhbt.net (Postfix) with ESMTP id 921841F8C6 for ; Wed, 23 Jun 2021 20:04:26 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229818AbhFWUGm (ORCPT ); Wed, 23 Jun 2021 16:06:42 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50514 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229660AbhFWUGl (ORCPT ); Wed, 23 Jun 2021 16:06:41 -0400 Received: from mail-ed1-x52f.google.com (mail-ed1-x52f.google.com [IPv6:2a00:1450:4864:20::52f]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6A965C061574 for ; Wed, 23 Jun 2021 13:04:22 -0700 (PDT) Received: by mail-ed1-x52f.google.com with SMTP id q14so5182239eds.5 for ; Wed, 23 Jun 2021 13:04:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:references:user-agent:in-reply-to :message-id:mime-version:content-transfer-encoding; bh=FKF5YzS0NZSMNoN1JnrZqwj1ZPydsGDLkOWMj+8zWSc=; b=ocbmQD3nInjznZZrKSgv8nwrrxepZfq0JR0tqXyqO+EnIj6Q6vSAEvYWZ4SqtG8JTw oUZhGOqfEzUo5kp/uHzIGPmc0I/hEywZxoC693f/RPR9dxtJmdL6nV0zrlmYmH7MSEJs 3oDSCg3t9dfax8lrRvlWIdI6KJKli2zHQoPaCpOhxtduEQ3M01OzSeGsS4PKxysC1fwW floocfedqu5LZ8glLwy24QsKlSVsvG3QwyAeyfyFWDRsssU6fYDYQQlJYy96kgCCOxBG 12FlTCq07b6+2r5ATd2bf/pYc0fQdKJFOqmf9JT3aOlJ/ixnuxOGBrTvbZCrQaVan6tw YvGg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:references:user-agent :in-reply-to:message-id:mime-version:content-transfer-encoding; bh=FKF5YzS0NZSMNoN1JnrZqwj1ZPydsGDLkOWMj+8zWSc=; b=T+ZTAS+UytFxKDBE/szt3/cmK73OOJg4QT4iRm8UXjx5koK+RjeuLik2Peqe0OmVfz VcctEie9gtftj7aSMbnQPx+ho/YC9EaqhMrsydVJ59XMkiHADgLyLs7abEAKD52CiMv9 cUJXXIWhV/3TpoVMJynLM8n5rITjkzoI0J1EZbBsMDThesGm3K4uM8maSlA5sNsg/WxH RvCx0bwywTxeVBJRAuWQtjgBzPk862mL2XwBRozXy9FyF2DJlWpWXT/PHS+eFhKBkg4Y MmSTNjNJJ4MYqTp92JEU0YJgDNxBh/KNQBcRrQc9YZMkp+qDP20dvlxgX8wm9gN2v2U5 JSYA== X-Gm-Message-State: AOAM532xj7eyT/t/qD85DekmGA5XAy8pe2SYoTw2cds1Yj/jhSfItvl5 J/+EbbzsqXzaM4mrBxQsYy0= X-Google-Smtp-Source: ABdhPJz9Q5RCGU+ouQ4kx0hI8LIaBDkw/oumPAnKHf+tsQZ6hbsI624Nx39SDwkDWiYuwjf22nliTA== X-Received: by 2002:a05:6402:886:: with SMTP id e6mr2036801edy.33.1624478660846; Wed, 23 Jun 2021 13:04:20 -0700 (PDT) Received: from evledraar (j57224.upc-j.chello.nl. [24.132.57.224]) by smtp.gmail.com with ESMTPSA id lu21sm300594ejb.31.2021.06.23.13.04.20 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 23 Jun 2021 13:04:20 -0700 (PDT) From: =?utf-8?B?w4Z2YXIgQXJuZmrDtnLDsA==?= Bjarmason To: "Randall S. Becker" Cc: git@vger.kernel.org, 'Junio C Hamano' , 'SZEDER =?utf-8?Q?G=C3=A1bor'?= , =?utf-8?Q?'Ren=C3=A9?= Scharfe' , 'Taylor Blau' Subject: Re: [PATCH 00/25] progress.c: various fixes + SZEDER's RFC code Date: Wed, 23 Jun 2021 22:01:55 +0200 References: <00fb01d76859$8a6ebc50$9f4c34f0$@nexbridge.com> User-agent: Debian GNU/Linux 11 (bullseye); Emacs 27.1; mu4e 1.5.12 In-reply-to: <00fb01d76859$8a6ebc50$9f4c34f0$@nexbridge.com> Message-ID: <87lf701h6k.fsf@evledraar.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: git@vger.kernel.org On Wed, Jun 23 2021, Randall S. Becker wrote: > On June 23, 2021 1:48 PM, =C3=86var Arnfj=C3=B6r=C3=B0 Bjarmason wrote: >>> On Mon, Jun 21, 2021 at 02:59:53AM +0200, =C3=86var Arnfj=C3=B6r=C3=B0 = Bjarmason wrote: >>>> >>>> On Sun, Jun 20 2021, SZEDER G=C3=A1bor wrote: >>>> >>>> > Splitting off from: >>>> > >>>> > >>>> > https://public-inbox.org/git/cover-0.2-0000000000-20210607T144206Z- >>>> > avarab@gmail.com/T/#me5d3176914d4268fd9f2a96fc63f4e41beb26bd6 >>>> > >>>> > On Tue, Jun 08, 2021 at 06:14:42PM +0200, Ren=C3=A9 Scharfe wrote: >>>> >> I wonder (only in a semi-curious way, though) if we can detect >>>> >> off-by-one errors by adding an assertion to display_progress() >>>> >> that requires the first update to have the value 0, and in >>>> >> stop_progress() one that requires the previous display_progress() >>>> >> call to have a value equal to the total number of work items. Not >>>> >> sure it'd be worth the hassle.. >>>> > >>>> > I fixed and reported a number of bogus progress lines in the past, >>>> > the last one during v2.31.0-rc phase, so I've looked into whether >>>> > progress counters could be automatically validated in our tests, >>>> > and came up with these patches a few months ago. It turned out >>>> > that progress counters can be checked easily and transparently in >>>> > case of progress lines that are shown in the tests, i.e. that are >>>> > shown even when stderr is not a terminal or are forced with >>>> > '--progress'. (In other cases it's still fairly easy but not quite >>>> > transparent, as I think we need changes to the progress API; more >>>> > on that later in a separate >>>> > series.) >>>> >>>> I've also been working on some progress.[ch] patches that are mostly >>>> finished, and I'm some 20 patches in at the moment. I wasn't sure >>>> about whether to send an alternate 20-patch "let's do this (mostly) in= stead?" >>>> series, hence this message. >>>> >>>> Much of what you're doing here becomes easier after that series, e.g. >>>> your global process struct in 2/7 is something I ended up >>>> implementing as part of a general feature to allow progress to be >>>> driven by either display_progress() *or* the signal handler itself. >>> >>> It's difficult to know who should rebase onto who without seeing one >>> half of the patches. >> >>I was sort of hoping he'd take me word for it, but here it is. Don't say = I didn't warn you :) >> >>> I couldn't find a link to them anywhere (even if they are only >>> available in your fork in a pre-polished state) despite looking, but >>> my apologies if they are available and I'm just missing them. >> >>FWIW it's avar-szeder/progress-bar-assertions in https://github.com/avar/= git.git, that repo contains various functioning and not-so- >>functioning code. >> >>https://github.com/avar/git/tree/meta/ is my version of the crappy script= s we probably all have some version of for building my own git, >>things that are uncommented in series.conf is what I build my own git fro= m. >> >>> In general, I think that these patches are clear and are helpful in >>> pinning down issues with the progress API (which I have made a hadnful >>> of times in the past), so I would be happy to see them picked up. >> >>Here's all 25 patches (well, around 20 before) that I had queued up local= ly and fixed up a bit. >> >>The 01/25 is something I submitted already as https://lore.kernel.org/git= /patch-1.1-cba5d88ca35-20210621T070114Z-avarab@gmail.com; >>hoping to get this in incrementally. >> >>The 12/25 is my own version of that "global progress struct, 11/25 is the= first of many bugs SZEDER missed in his :) >> >>18/25 is the first step of the UI I was going for, the signal handler can= now drive the progress bar, so e.g. during "git gc" we show (at least >>for me, on git.git), a "stalled" message just before we start the actual = count of "Enumerating Objects". >> >>After that was in I was planning on adding config-driven support to show = a "spinner" when we stalled in that way, config-driven because >>you could just scrape e.g. https://github.com/sindresorhus/cli-spinners/b= lob/main/spinners.json >>into your own config. See >>https://jsfiddle.net/sindresorhus/2eLtsbey/embedded/result/ :) >> >>19-23/25 is my grabbing of SZEDER's patches that I'm comfortable labeling= as "PATCH", I think they work, but no BUG() assertions yet. I >>left out the GIT_TEST_CHECK_PROGRESS parts, since my earlier works set th= ings up to do any BUG() we trust by default. >> >>22/25 is what I think we should do instead of SZEDER's 6/7 >>(http://lore.kernel.org/git/20210620200303.2328957-7-szeder.dev@gmail.com) >>I don't think this "our total doesn't match at the end" is something we s= hould always BUG() on, for reasons explained there. >> >>I am sympathetic to doing it by default though, hence the >>stop_progress_early() API, that's there to allow select callers to bypass= his BUG(...) assertion. >> >>24/25 and 25/25 are "RFC" and a rebased+modified version of SZEDER's >>BUG(...) assertions. >> >>His series passes the test suite, but actually severely break things thin= gs. It'll make e.g. "git commit-graph write" BUG(...) out. The reason >>the tests don't catch it is because we have a blind spot in the tests. >> >>Namely, that most things that use the progress bar API use isatty() to ch= eck if they should start_progress(). If you run the tests as e.g. >>(better ways to do this, especially in parallel, most welcome): >> >> for t in t[0-9]*.sh; do if ! ./$t -vixd; then echo $t bad; break; fi;= done >> >>You can discover various things that his series BUG()'s on, I fixed a cou= ple of those myself, it's an early part of this series. >> >>But we'll still have various untested for BUG()'s even then, this is beca= use you *also* have to have the test actually emit a "naked" >>progress bar on stderr, if the test itself e.g. pipes fd 2 to a file it w= on't work. >> >>I created a shitty-and-mostly-broken throwaway change to search-replace a= ll the guards of "start_progress(...)" to run unconditionally, and >>convert all the "delayed" to the non-delayed version. That'll find even m= ore BUG()'s where SZEDER's series still needs to be fixed (and also >>some unrelated segfaults, I gave up on it soon after). >> >>Even if we fix that I wouldn't trust it, because a lot of the progress ba= rs we have depend on the size and shape of the data we're >>processing, e.g. the bug I fixed in 11/25. If people find this BUG() appr= oach worth pursuing I think it would be better to make it an opt-in >>flag we convert one caller at a time to. >> >>For some it's really clear that we could assert it, for others such as th= e commit-graph it's much more subtle, we're in some callback after >>setting a "total", that callback does a "break", "continue" etc. in vario= us places, all depending on repository data. >> >>It's not easy to reason about that and be certain that we can hold to the= estimate. If we get it wrong someone's repo in the wild won't fully >>GC because of the overly eager BUG(). >> >>If SZEDER wants to pursue it I think it'll be easier on top of this serie= s, but personally I really don't see the point of spending effort on it. >> >>We should really be going in the other direction, of having more fuzzy ET= As, not less. >> >>E.g. we often have enough data at the start of "Enumerating Objects" >>to give a good-enough target value, that it's 5-10% off isn't really the = point, but that the user looking at it sees something better than a >>dumb count-up, and can instead see that they'll probably be looking at it= for about a minute. Now our API is to give no ETA/target if we're >>not 100% sure, it's not good UX. >> >>So trying to get the current exact count/exact percentage right seems lik= e a distraction to me in the longer term. If anything we should >>just be rounding those numbers, showing fuzzy ETAs instead of percentages= if we can etc. >> >>SZEDER G=C3=A1bor (4): >> commit-graph: fix bogus counter in "Scanning merged commits" progress >> line >> entry: show finer-grained counter in "Filtering content" progress line >> progress: assert last update in stop_progress() >> progress: assert counting upwards in display() >> >>=C3=86var Arnfj=C3=B6r=C3=B0 Bjarmason (21): >> progress.c tests: fix breakage with COLUMNS !=3D 80 >> progress.c tests: make start/stop verbs on stdin >> progress.c tests: test some invalid usage >> progress.c tests: add a "signal" verb >> progress.c: move signal handler functions lower >> progress.c: call progress_interval() from progress_test_force_update() >> progress.c: stop eagerly fflush(stderr) when not a terminal >> progress.c: add temporary variable from progress struct >> midx perf: add a perf test for multi-pack-index >> progress.c: remove the "sparse" mode nano-optimization >> pack-bitmap-write.c: add a missing stop_progress() >> progress.c: add & assert a "global_progress" variable >> progress.[ch]: move the "struct progress" to the header >> progress.[ch]: move test-only code away from "extern" variables >> progress.c: pass "is done?" (again) to display() >> progress.[ch]: convert "title" to "struct strbuf" >> progress.c: refactor display() for less confusion, and fix bug >> progress.c: emit progress on first signal, show "stalled" >> midx: don't provide a total for QSORT() progress >> progress.c: add a stop_progress_early() function >> entry: deal with unexpected "Filtering content" total >> >> cache.h | 1 - >> commit-graph.c | 2 +- >> csum-file.h | 2 - >> entry.c | 12 +- >> midx.c | 25 +- >> pack-bitmap-write.c | 1 + >> pack.h | 1 - >> parallel-checkout.h | 1 - >> progress.c | 391 ++++++++++++++++++------------- >> progress.h | 50 +++- >> reachable.h | 1 - >> t/helper/test-progress.c | 54 +++-- >> t/perf/p5319-multi-pack-index.sh | 21 ++ >> t/t0500-progress-display.sh | 247 ++++++++++++++----- >> 14 files changed, 537 insertions(+), 272 deletions(-) create mode 10075= 5 t/perf/p5319-multi-pack-index.sh > > Is there provision for disabling progress on a per-command basis? My > use case is specifically in a CI/CD script, being able to suppress > progress handling. The current Jenkins plugin does not appear to have > provision for hooking into a mechanism, which makes things get a bit > wonky when a job runs with a pseudo-tty (as provided by Jenkins > through SSH/RMI). > -Randall There isn't, some commands support --no-progress, but it's hit and miss. You can then set the undocumented GIT_PROGRESS_DELAY=3D99999999 (or some really big number) to suppress more of them. We could just add it as a top-level "git --no-progress" option I suppose... Probably better would be to detect such not-a-terminals somehow, I think at some point our own gc.log was a victim of this.