public-inbox.git - an "archives first" approach to mailing lists

Date	Commit message (Collapse)
2017-06-26	watch: avoid potential race condition while quitting
	We must not trigger future activity when initializing a -watch shutdown.
2017-06-26	watch: commit changes to fast-import sooner
	We should make changes visible sooner, even during lengthy scans.
2017-06-26	watch: use "self-inotify-tempfile trick" for quit
	This should be more reliable and safer as it'll ensure existing fast-import instances are shut down properly.
2017-06-26	watch: improve fairness during full rescans
	We need to ensure new messages are being processed fairly during full rescans, so have the ->scan subroutine yield and reschedule itself. Additionally, having a long-running task inside the signal handler is dangerous and subject to reentrancy bugs. Due to the limitations of the Filesys::Notify::Simple interface, we cannot rely on multiplexing I/O interfaces (select, IO::Poll, Danga::Socket, etc...) for this. Forking a separate process was considered, but it is more expensive for a mostly-idle process. So, we use a variant of the "self-pipe trick" via inotify (or whatever Filesys::Notify::Simple gives us). Instead of writing to our own pipe, we write to a file in our own temporary directory watched by Filesys::Notify::Simple to trigger events in signal handlers.
2017-06-26	watch: ensure HUP causes the scanner to be reloaded
	Otherwise the old watcher may run indefinitely
2017-06-23	watchmaildir: deal with rejected (100) messages
	The RubyLang filter is strict about what messages it rejects, so the spam learning path will not auto-train or remove messages missing X-Mail-Count headers.
2017-06-22	add filter for RubyLang lists
	Unfortunately, it appears we have to reject this and instead add support filtering at View time(), due to DKIM signatures in messages from ruby-lang.org. () which may not be worth it
2017-05-09	watchmaildir: show $@ in warning message
	It should be helpful to know what error happened.
2017-04-04	watchmaildir: do not reject lowercase flags on Maildir files
	Dovecot uses 'a'..'z' (lowercase) to designate keywords in Maildir flags. This was preventing certain messages from being marked as spam. https://wiki2.dovecot.org/MailboxFormat/Maildir
2017-01-26	watchmaildir: allow arguments for filters
	We'll want to allow some degree of configuration for various mailing lists.
2017-01-19	watchmaildir: limit live importer processes
	We don't want to be triggering OOM or swapping on weaker systems when we have dozens of inboxes as potential targets.
2017-01-10	introduce PublicInbox::MIME wrapper class
	This should fix problems with multipart messages where text/plain parts lack a header. cf. git clone --mirror https://github.com/rjbs/Email-MIME.git refs/pull/28/head In the future, we may still introduce as streaming interface to reduce memory usage on large emails.
2017-01-02	watch: watchspam affects all configured inboxes
	If a message is spam in one mailbox, it is spam in all others a particular user/group will care about.
2016-09-01	watch: use "publicinboxwatch" namespace
	We'll keep supporting "publicinboxlearn" indefinitely, but "publicinboxwatch" is probably more appropriate at the moment. Noticed while writing documentation.
2016-08-12	watch: respect altid for incremental watch changes
	We need to pass the Inbox object to SearchIdx to get altid mappings properly for incremental imports. TODO: use the Inbox object in more places where it makes sense to do so.
2016-06-26	watch_maildir: warn on spam check failures
	It would be nice to know about spamcheck failures.
2016-06-24	watch_maildir: ignore Trash and Drafts, support Dovecot
	Trashed messages and drafts are probably not intended for importing, so do not import them. Dovecot uses extra flags via lowercase letters, so we must support those (as that's the server I use).
2016-06-24	watch_maildir: implement optional spam checking
	Mailing lists I watch and mirror may not have the best spam filtering, and an extra layer should not hurt.
2016-06-24	watch_maildir: rename _check_spam => _remove_spam
	We do not actually do spam checking, here; but will do spam checking before adding a message in the future.
2016-06-24	document Filesys::Notify::Simple dependency
	And improve documentation for existing dependencies, too.
2016-06-19	watch_maildir: tighten up path checks
	Only mark seen messages as spam, otherwise it could be too aggressive and cause problems or over training. We wouldn't want a wayward FIFO ruining our day, either :)
2016-06-19	watch_maildir: spam removal support
	We can support spam removal by watching a special "spam" Maildir, too. We can run public-inbox-learn as a separate step, and that command will be improved to support auto-learning, too.
2016-06-18	watch_maildir: add scan test
	This should be portable despite the intended use of this directory being non-portable.
2016-06-17	watch: quiet down rejected header matches
	People may use this directive because they prefer to merge several mailing lists into one local mailbox, so there may be many messages and we should not needlessly clutter logs for this.
2016-06-17	watch: introduce watch directive
	This will allow users to run importers off existing mail accounts where they may not have access to run -mda. Currently, we only support Maildirs, but IMAP ought to be doable.