RE: git bisect enhancement request

git@vger.kernel.org mailing list mirror (one of many)
 help / color / mirror / code / Atom feed

From: "Randall S. Becker" <rsbecker@nexbridge.com>
To: <jim.cromie@gmail.com>
Cc: "'brian m. carlson'" <sandals@crustytoothpaste.net>,
	<git@vger.kernel.org>
Subject: RE: git bisect enhancement request
Date: Sat, 15 Aug 2020 18:18:41 -0400	[thread overview]
Message-ID: <019701d67352$0ba8c250$22fa46f0$@nexbridge.com> (raw)
In-Reply-To: <CAJfuBxxqxKAFd_3F5ZKWjfmWA0v+kKX6mNqQA=sGYaP-NYXDSQ@mail.gmail.com>

On August 15, 2020 9:05 AM, jim.cromie@gmail.com wrote:
> To: Randall S. Becker <rsbecker@nexbridge.com>
> Cc: brian m. carlson <sandals@crustytoothpaste.net>; git@vger.kernel.org
> Subject: Re: git bisect enhancement request
> 
> good questions, thank you both
> 
> On Fri, Aug 14, 2020 at 4:25 PM Randall S. Becker
> <rsbecker@nexbridge.com> wrote:
> >
> > On August 14, 2020 6:08 PM, brian m. carlson wrote:
> > > On 2020-08-14 at 21:11:19, jim.cromie@gmail.com wrote:
> > > > please teach git bisect how to use 2 separate worktrees for the
> > > > bisection, toggling between them as bisection progresses, so that
> > > > the end result is 2 compiled kernels, one broken, one good.
> > >
> > > I'm not sure how such a thing would be implemented.  Git doesn't
> > > know until after it's checked out the tree whether a revision is
> > > good or bad, since usually the user needs to tell it (or use git
> > > bisect run).  Even if Git alternated between the two worktrees in
> > > order, that doesn't mean that one of them will end up being good,
> > > since Git may find the last good revision early on, then continue to bisect
> and find many bad revisions until it determines the right one.
> > >
> > > Can you tell us more about the algorithm you'd like to see here?
> >
> > I'm wondering more about the requirements on this. Does the bisect
> manipulate both worktrees at once or separately? Are these worktrees
> variants on a theme but on different branches (so synchronizing the commits
> would probably be impractical) - or bisect both workspaces but using
> different commits as start and end (this should be scriptable)? Does it flip
> back and forth between the two worktrees doing a bisect in one, then the
> other (this should be scriptable)? Or is this just to teach git to bisect a
> worktree in a distinct manner - which, correct me if I'm wrong, I think it
> already supports.
> >
> > Or is this more, I have two worktrees on basically the same branch. When
> bisect happens, one worktree is bisected, tested. If the new code succeeds,
> and the other worktree is in a failed state, don't bisect the other worktree in
> the same direction - that's not right, but I think I know your end-state goal:
> keep bisecting both worktrees until a state change. What that is leading to is
> really something different, which is that once bisect is done, you know which
> commit introduced the bug, so set one worktree to the working commit and
> the other to the broken commit. If that's the case, it's not a toggle, but an
> end-state operation to set two worktrees to adjacent commits essentially
> surrounding the introduction point. Is that what you want?
> >
> 
> heh - reading paragraph 1, I thought "yeah thats it"
> but then paragraph 2 ...
> 
> its more about the end-state as Ive thought about it, and I casually leaped to
> it being useful at every iteration.
> Having 2 worktrees converging on good/bad yes/no old/new does seem
> generically appealing, but I dont have a solid use case.
> 
> My best generic argument is that determining good/bad on a build can be
> hard, and having the previous build(s) around could be useful.
> 
> Broadening, having --last=N wktree-g<sha8>s, in a grove/orchard would be
> simple to explain (N=0 currently, wo naming games) Accepting a 'wktree'
> prefix, and adding suffixes like 5.8.0-v1-00025-g8bfb9456e727-dirty
> (with a strftime-like format "%v-%u-%5N-%14g-%s" ideally) would sidestep
> any 'policy/methodology' implied by wktree names.
> having -00025 etc would give cheap look at bisect convergence.
> 
> and having 'planted' a grove of bisection points, one could leave it around,
> for subsequent use as a "re-bisection cache" to retest against a iteratively
> refined bisect run script
> 
> Im now speculating hard, I hope it makes some sense.
> 
> My fever dream is to have a gdb-mi script/program talking to 2 separate
> targets, and "bisecting breakpoints" back from where the bad one panics, to
> the point where the gdb trace diverges between the 2 targets.
> 
> and I want to run rr record vmlinux inside qemu, with quick deterministic
> replay, and a tight "breakpoint bisection" loop.  But I digress. And a pony.

I think that you might have answered this yourself-ish. Assume that the code is broken and you are looking for the commit where that happened. Bisect will get you there in each target independently and answer which commit works and which commit breaks on system X. The same can be said for system Y in a separate worktree. You then can check whether the commits are the same or not. If not, you have established your system-X/Y delta. If they are the same, you can then check the gdb trace. If they are the same, conceptually anyway, both systems are behaving similarly. Otherwise, you can reset bisect and start over on system Y (or X, I suppose depending on the commit parentage) using the gdb trace as a criteria for good/bad for bisect purposes. Obviously I'm simplifying, but bisect is agnostic to your pass/fail criteria so perhaps some variant of this, where you use a multi-variate concept of "it works" and "the trace matches". The same concept is used for performance testing, where bisect can be very useful for determining where a performance degradation was introduced (or where things got better), by using various performance metrics as good/bad criteria instead of whether the code functionally works. Sadly (a nit of mine) is that I don't see performance regression testing used enough in DevOps pipelines although it is somewhat difficult to quantify those metrics in a multi-target pipeline).

My $0.03
Randall

-- Brief whoami:
 NonStop developer since approximately 211288444200000000
 UNIX developer since approximately 421664400
-- In my real life, I talk too much.

     prev parent reply	other threads:[~2020-08-15 22:19 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-08-14 21:11 git bisect enhancement request jim.cromie
2020-08-14 22:08 ` brian m. carlson
2020-08-14 22:25   ` Randall S. Becker
2020-08-15 13:04     ` jim.cromie
2020-08-15 22:18       ` Randall S. Becker [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: http://vger.kernel.org/majordomo-info.html

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='019701d67352$0ba8c250$22fa46f0$@nexbridge.com' \
    --to=rsbecker@nexbridge.com \
    --cc=git@vger.kernel.org \
    --cc=jim.cromie@gmail.com \
    --cc=sandals@crustytoothpaste.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

Code repositories for project(s) associated with this public inbox

	https://80x24.org/mirrors/git.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).