On 2020-02-23 at 21:57:59, Jeff King wrote: > On Sat, Feb 22, 2020 at 08:17:26PM +0000, brian m. carlson wrote: > > > Avoid hard-coding a hash size, instead preferring to use the_hash_algo. > > [...] > > - hashwrite(out, base_sha1, 20); > > + hashwrite(out, base_sha1, the_hash_algo->rawsz); > > Yeah, looks obviously correct (and I think this is new from the > pack-reuse patches of mine that Christian sent recently). I believe it is, which is why I CC'd you on it. > The name "base_sha1" is clearly not great either. It could be changed > trivially, but the real sin is that it comes from > nth_packed_object_sha1(). It could be replaced with > nth_packed_object_oid() at the cost of an extra hash copy, which isn't > too bad. I probably should send a series getting rid of the rest of the "sha1" places in our code, because there are a lot of them, but I just haven't gotten around to it yet. And yeah, as you mentioned, there are still a lot of places using nth_packed_object_sha1. > It would be nice to get rid of that function entirely. In most cases, > it's either free to do so (we end up copying the result into an oid > anyway) or we pay for one extra hashcpy (out of the mmap into a local > struct). But the one in pack-check.c:verify_packfile() is tricky; we > store a pointer per object into the index mmap, and we'd have to switch > to storing an oid per object. Given that the code isn't commonly run > (and other operations like _generating_ the index in the first place are > clearly OK with the extra memory hit), I think I'd be OK with that in > the name of cleanliness. Yeah, that's why I hadn't switched it out earlier. > I think it would also be correct to cast the mmap'd bytes to a "struct > object_id" given that the struct contains the hash bytes as the first > member. I worry a little that we'd get no compiler warning of the > breakage if that assumption changes, but it also seems unlikely to do > so. In the future, struct object_id will get a new member (an algorithm value), but I think it's fine to make the assumption that the hash bytes are first. -- brian m. carlson: Houston, Texas, US OpenPGP: https://keybase.io/bk2204