From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: X-Spam-Status: No, score=-2.9 required=3.0 tests=ALL_TRUSTED,BAYES_00 shortcircuit=no autolearn=unavailable version=3.3.2 X-Original-To: meta@public-inbox.org Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id B8EA020469; Sat, 9 Apr 2016 01:33:52 +0000 (UTC) Date: Sat, 9 Apr 2016 01:33:52 +0000 From: Eric Wong To: meta@public-inbox.org Subject: [PATCH] learn: drop leading "From " line from mboxes Message-ID: <20160409013352.GA1758@dcvr.yhbt.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline List-Id: It can confuse Email::MIME if we have it. --- script/public-inbox-learn | 7 ++++++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/script/public-inbox-learn b/script/public-inbox-learn index 0c7b419..81675d0 100755 --- a/script/public-inbox-learn +++ b/script/public-inbox-learn @@ -17,7 +17,12 @@ if ($train !~ /\A(?:ham|spam)\z/) { } my $pi_config = PublicInbox::Config->new; -my $mime = Email::MIME->new(eval { local $/; <> }); +my $mime = Email::MIME->new(eval { + local $/; + my $data = scalar ; + $data =~ s/\AFrom [^\r\n]*\r?\n//s; + $data +}); # get all recipients my %dests; -- EW