From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-Status: No, score=-4.0 required=3.0 tests=ALL_TRUSTED,BAYES_00 shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id 55A561F66E; Wed, 2 Sep 2020 19:05:25 +0000 (UTC) Date: Wed, 2 Sep 2020 19:05:25 +0000 From: Eric Wong To: meta@public-inbox.org Subject: message bloat over time... Message-ID: <20200902190525.GA11126@dcvr> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline List-Id: I've been indexing and reindexing a local mirror of https://lore.kernel.org/lkml a bit, and it's kinda depressing to see newer messages being more and more bloated even on a plain-text-only mailing list :< The first column ("$X.git" is the epoch number, older epochs are lower-numbered: "0.git" is oldest, "8.git" (not shown) is the newest. 8.git is omitted since it's still in-progress, each epoch is capped at roughly ~1.1G of packed git storage. The last column is the number of messages in that epoch, so fewer messages fit in each epoch: 7.git counting 17d7e25e3e862d5d99182557bb723374230a8497 ... 312754 6.git counting bc9b3c196d0fc92a520e9ad4f92c4d3c1db1943f ... 346017 5.git counting 31ed379430c456f90bdd172b223020c0e6d7cb8d ... 379561 4.git counting 88294f6d487193f5984791ee81213a25130d0559 ... 416015 3.git counting 93d9eace2721494d8457c7f5f6de803c0d648172 ... 453851 2.git counting d48078ceeec1f51313253a56ed3ba0eae7fde909 ... 455366 1.git counting 6b67b9f5e0cd82d3c734e6cdc44c1f722ab6fb6a ... 475671 0.git counting b67bf7f62c8125d67461cc6e7d1736ddc8844a18 ... 570488 So yeah, old epochs could fit more messages because messages were smaller back then... /me goes back to yelling at the sky...