public-inbox.git - an "archives first" approach to mailing lists

Date	Commit message (Collapse)
2022-10-05	www_coderepo: an alternative to cgit
	This will allow it to easily map a single coderepo to multiple inboxes (or multiple coderepos to any number of inboxes). For now, this is just a summary, but $REPO/$OID/s/ support will be added, along with archive downloads. Indexing of coderepos will probably be supported via -extindex, only.
2022-09-12	www: viewdiff: fix UTF-8 names inside mbox attachments
	This avoids `Wide character in print' warnings and ensures the UTF-8 characters in `Signed-off-by' trailers are properly rendered in HTML even when attempting to decode and display application/octet-stream mbox attachments as HTML. Linkification and reconstruction for coderepos is probably still broken, but that is a much bigger task to fix, I think. Fixes: ab9c03ff4aa369b3 ("www: use PerlIO::scalar (zfh) for buffering")
2022-09-10	xt: fold perf-obfuscate into perf-msgview, future-proof
	perf-obfuscate was close enough to perf-msgview that it only required setting the `obfuscate' field of the inbox. Then update perf-msgview to account for upcoming internal changes. The current use of {obuf} and concat ops results in excessive scratchpad space and I may be able to even get speedups by avoiding concat ops.
2022-09-04	prepare HTML rendering maintainer tests for upcoming changes
	There'll be a number of upcoming changes to HTML rendering of messages to hopefully reduce memory usage and speedups by writing out to the gzip buffer earlier. Update the tests now so it'll be easier to test before and after results.
2022-08-21	public-inbox 1.9.0 v1.9.0

2022-08-19	tests: add some basic "lei reindex" tests
	This is a bit hard-to-test, but at least we must ensure volatile-metadata is preserved.
2022-08-19	lei reindex: new command to reindex lei/store

2022-08-11	examples: add systemd files for -netd
	It's important show that a single systemd service and socket file can replace all other read-only daemons for ease-of-management.
2022-08-11	examples: consolidate systemd socket examples
	systemd.socket(5) files can actually contain multiple listen sockets, so shave down inode overhead and simplify config file management by consolidating all applicable ports into a single file for each daemon.
2022-08-11	doc: drop ancient Apache and WEBrick examples
	Having old, unmaintained docs for other HTTP servers is likely harmful at this point. public-inbox-httpd is specifically designed to handle git repos on slow storage and stream giant mbox.gz files fairly to slow clients.
2022-08-11	doc: add WIP 1.9 release notes
	I think "lei reindex" and some doc cleanups will be the last things before 1.9...
2022-08-02	daemon: support per-listener env, .psgi, out, err
	This allows memory savings by allowing multiple, completely unrelated-PSGI apps to run within the same process as IMAP, NNTP, and POP3.
2022-07-30	tests: maintainer test for using mpop
	This ought to be a good stress test to ensure our POP3 implementation works against the POP3 client I've found.
2022-07-24	add xt/mem-nntpd-tls maintainer test
	This ensures memory usage is reasonable when DEFLATE and TLS are enabled. It's also our only coverage for NNTP COMPRESS since Net::NNTP has yet to implement compression support: https://rt.cpan.org/Public/Bug/Display.html?id=129967
2022-07-23	imap+nntp: share COMPRESS implementation
	Their code was nearly identical to begin with, so save some memory in -netd and disk space for all of our tarball/distro users, at least. And I seem to have used multiple inheritance successfully, here, maybe...
2022-07-20	public-inbox-pop3d - a mostly read-only POP3 server
	Old account expiry has not been implemented, but it seems to work well with both mpop(1) and getmail(1). The strictness of mpop was particularly helpful in ironing out bugs in our implementation of (dreaded) message sequence numbers. "EXPIRE 0" (RFC 2449) can theoretically save numerous "DELE" commands, but that's untested by real-world clients. mpop supports PIPELINING which is effective in hiding latency, and the core networking functionality is already well-tested from our NNTP and IMAP implementations. Configuration requires "publicinbox.pop3state" to point to a directory writable by the otherwise read-only daemon. See public-inbox-pop3d(1) manpage for more usage details.
2022-07-20	lei: avoid deadlock on inotify/EVFILT_VNODE wakeups
	Enqueuing "note-event" requests from the DS event loop must not wait on workers being able to drain the queue quickly enough. Thus we make the SOCK_SEQPACKET writes nonblocking and rely on the lei-daemon event loop to enqueue writes. This is a unique problem for "note-event" since it reuses workers in between commands, while most lei commands currently fork off new workers.
2022-06-21	search: do not index base-85 binary patches
	Base-85 binary patches generated by git lead to many false positives, so skip over gibberish words which may occur in them. To avoid regressions in search results, continue to allow searching for exact size matches (via "literal $SIZE") and the phrase "GIT binary patch" for the mere presence of a binary patch.
2022-06-11	Add EditorConfig file
	This allows several editors to automatically use the correct settings when editing public-inbox files. [ew: add to MANIFEST, too]
2022-05-05	public-inbox-netd: a multi-protocol superserver
	Since we'll be adding POP3 support as our 4th network protocol; asking admins to run yet another daemon on top of existing -httpd, -nntpd, -imapd is a maintenance burden and a waste of memory. The goal of public-inbox-netd is to be able to replace all existing read-only daemons with a single process to save memory and reduce administrative overhead; hopefully encouraging more users to self-host their own mirrors. It's barely-tested at the moment. Eventually, multiple PI_CONFIG and HOME directories will be supported, as are per-listener .psgi config files.
2022-04-23	public-inbox 1.8.0 v1.8.0

2022-04-01	doc: add WIP release notes for 1.8
	1.8 will be a minor release, soon (I initially expected to release it in December, but was side-tracked). Major features will be for 1.9.
2022-01-31	rewrite Linux nodatacow use in pure Perl w/o system
	btrfs is Linux-only at the moment (and likely to remain that way for practical purposes). So rely on Linux ABI stability and use the `syscall' and `ioctl' perlops rather than relying on Inline::C. Inline::C (and gcc\|\|clang) are monstrous dependencies which we can't expect users to have. This makes supporting new architectures more difficult, but new architectures come along rarely and this reduces the burden for the majority of Linux users on popular architectures (while still avoiding the distribution of pre-built binaries). Link: https://public-inbox.org/meta/YbCPWGaJEkV6eWfo@codewreck.org/
2021-11-06	MANIFEST: update for non-fatal "make check" message
	Oops :x
2021-11-04	doc: add more 3rd-party refs, use Debian manpages for xapian
	curl, torsocks, and gitglossary manpages are all newly referenced, so make sure they're linkified properly in HTML. We'll be using Debian's manpages as an ad-free, Tor-accessible host for manpages as a fallback since hosting manpages for all 3rd-party projects we reference doesn't scale.
2021-11-03	doc: lei-q: document SEARCH TERMS prefixes
	The new Documentation/common.perl file will be used for all manpages in the future.
2021-10-31	doc: add lei-mail-sync-overview manpage
	Mostly illustrating how clunky the process is :p We'll also tweak some things in existing man pages around mail synchronization.
2021-10-22	lei: use RENAME_NOREPLACE on Linux 3.15+
	One syscall is better than two for atomicity in Maildirs. This means there's no window where another process can see both the old and new file at the same time (link && unlink), nor a window where we might inadvertantly clobber an existing file if we were to do `stat && rename'.
2021-10-18	v2: mirrors don't clobber msgs w/ reused Message-IDs
	For odd messages with reused Message-IDs, the second message showing up in a mirror (via git-fetch + -index) should never clobber an entry with a different blob in over. This is noticeable only if the messages arrive in-between indexing runs. Fixes: 4441a38481ed ("v2: index forwards (via `git log --reverse')")
2021-10-16	MANIFEST: regenerate with: git ls-files >MANIFEST

2021-10-16	doc: lei: add manpages for remaining commands
	At this point all of the current lei commands, aside from -help and -sucks, should be covered.
2021-10-15	lei: TSTP affects all curl and related subprocesses
	By relying more on pgroups for remaining remaining processes, this lets us pause all curl+tail subprocesses with a single kill(2) to avoid cluttering stderr. We won't bother pausing the pigz/gzip/bzip2/xz compressor process not cat-file processes, though, since those don't write to the terminal and they idle soon after the workers react to SIGSTOP. AutoReap is hoisted out from TestCommon.pm. CLONE_SKIP is gone since we won't be using Perl threads any time soon (they're discouraged by the maintainers of Perl).
2021-10-05	index: --reindex w/ --{since,until,before,after}
	This lets administrators reindex specific time ranges according to git "approxidate" formats. These arguments are passed directly to underlying git-log(1) invocations and may still reach into old epochs. Since these options rely on git committer dates (which we infer from the most recent Received: header), they are not guaranteed to be strictly tied to git history and it's possible to over/under-reindex some messages. It's probably not a major problem in practice, though; reindexing a few extra messages is generally harmless aside from some extra device wear. Since this currently relies on git-log, these options do not affect -extindex, yet.
2021-10-02	lei mail-diff: diagnostic command to diff mail contents
	This is useful in finding the cause of deduplication bugs, and possibly the cause of missing threads reported by Konstantin in <20211001130527.z7eivotlgqbgetzz@meerkat.local> usage: u=https://yhbt.net/lore/all/87czop5j33.fsf@tynnyri.adurom.net/raw lei mail-diff $u
2021-10-02	lei inspect: fix "mid:" prefix, expand to Xapian
	This fixes inspect for uninitialized instances, and adds Xapian ("xdoc") output if available. Reported-by: Konstantin Ryabitsev <konstantin@linuxfoundation.org> Message-ID: <20211001204943.l4yl6xvc45c5eapz@meerkat.local>
2021-10-02	doc: lei-daemon: new manpage
	In case users see "lei-daemon" in ps(1) or syslog and wonder. Helped-by: Kyle Meyer <kyle@kyleam.com>
2021-09-25	lei ls-external: split into separate file
	This was written before we had auto-loading and rarely used.
2021-09-25	lei add-external: split into separate file
	Also was written before we had auto-loading and rarely used.
2021-09-25	lei forget-external: split into separate file
	This was written before we had auto-loading, and forget-external should be a rarely-used command that's not worth loading at startup. Do some golfing while we're in the area, too.
2021-09-25	doc: lei: manpages for export-kw and refresh-mail-sync
	Something is better than nothing.
2021-09-19	lei config --edit: use controlling terminal
	As with "lei edit-search", "lei config --edit" may spawn an interactive editor which works best from the terminal running script/lei. So implement LeiConfig as a superclass of LeiEditSearch so the two commands can share the same verification hooks and retry logic.
2021-09-19	xt: add fsck script over over.sqlite3
	I'm not sure what caused it, but I've noticed two missing messages that failed from "lei up" on an https:// external; and I've also seen some duplicates in the past (which I think I fixed...).
2021-09-17	doc: add lei-security(7) manpage
	It seems like a good idea to have a manpage where somebody can quickly look up and address their concerns as to what to put on encrypted device/filesystem. And I probably would've designed lei around make(1) for parallelization if I didn't have to keep credentials off the FS :P
2021-09-17	lei refresh-mail-sync: replace prune-mail-sync
	Merely pruning mail synchronization information was insufficient for Maildir: renames are common in Maildir and we need to detect them after-the-fact when lei-daemon isn't running. Running this command could make "lei index" far more useful... v2: close R/O mail_sync.sqlite3 dbh before fork Keeping the DB file handle open across fork can cause bad things to happen even if we don't use it since sqlite3 itself still knows about it (but doesn't know Perl code doesn't know about it).
2021-09-15	multi_git: hoist out common epoch/alternates handling
	IMHO, this greatly improves code sharing and organization between v2, extindex, and lei/store. Common git-related logic for these is lightly-refactored and easier to reason about. The impetus for this big change was to ensure inboxes created+managed by public-inbox-{clone,fetch} could have alternates and configs setup properly without depending on SQLite (via V2Writable). This change does that while making old code shorter and better factored.
2021-09-12	new public-inbox-{clone,fetch} commands
	Setting up and maintaining git-only mirrors of v2 inboxes is complex since multiple commands are required to clone and fetch into epochs. Unlike grokmirror, these commands do not require any configuration. Instead, they rely on existing git config files and work like "git clone --mirror" and "git fetch", respectively. Like grokmirror, they use manifest.js.gz, but only on a per-inbox basis so users won't have to clone every inbox of a large instance nor edit config files to include/exclude inboxes they're interested in.
2021-09-10	doc: lei-index manpage
	It's a pretty incomplete command, so it's important to document its incompleteness.
2021-09-10	lei add-external --mirror: deduce paths for PSGI mount prefixes
	The current manifest.js.gz generation in WWW doesn't account for PSGI mount prefixes (and grokmirror 1.x appears to work fine). In other words, <https://yhbt.net/lore/lkml/manifest.js.gz> currently has keys like "/lkml/git/0.git" and not "/lore/lkml/git/0.git" where "/lore" is the PSGI mount prefix. This works fine with the prefix accounted for in my grokmirror (1.x) repos.conf like this: site = https://yhbt.net/lore/ manifest = https://yhbt.net/lore/manifest.js.gz Adding the PSGI mount prefix in manifest.js.gz is probably not desirable since it would force the prefix into the locally cloned path by grokmirror, and all the cloned directories would have the remote PSGI mount prefix prepended to the toplevel. So, "lei add-external --mirror" needs to account for PSGI mount prefixes by deducing the prefix based on available keys in the manifest.js.gz hash table.
2021-09-09	lei-rm: add man page, support LeiInput args
	-F/--in-format and --lock=TYPE(S) are easily supported by all classes using LeiInput.
2021-09-03	lei up --all: avoid double-close on shared STDOUT
	This is merely to avoid perl setting errors internally which were not user visible. The double-close wasn't a problem in practice since we open a new file hanlde for the mbox or mbox.gz anyways, so the new t/lei-up.t test case shows no regressions nor fixes.