user/dev discussion of public-inbox itself
 help / color / mirror / code / Atom feed
From: Eric Wong <e@80x24.org>
To: meta@public-inbox.org
Cc: Eric Wong <e@80x24.org>
Subject: [PATCH 1/2] view: account for filter bugs which leak HTML into the repo
Date: Thu, 13 Nov 2014 21:53:00 +0000	[thread overview]
Message-ID: <1415915581-2522-1-git-send-email-e@80x24.org> (raw)

Ugh, apparently there's a (yet-to-be-fixed) bug in the Filter
code which caused an HTML message portion of a multipart message
to be displayed on the web UI.  Account for that and nuke it.
---
 lib/PublicInbox/View.pm | 14 ++++++++++++--
 1 file changed, 12 insertions(+), 2 deletions(-)

diff --git a/lib/PublicInbox/View.pm b/lib/PublicInbox/View.pm
index e76d904..b09c3ba 100644
--- a/lib/PublicInbox/View.pm
+++ b/lib/PublicInbox/View.pm
@@ -104,7 +104,12 @@ sub index_entry {
 	$mime->walk_parts(sub {
 		my ($part) = @_;
 		return if $part->subparts; # walk_parts already recurses
-		my $enc = enc_for($part->content_type) || $enc_msg || $enc_utf8;
+		my $ct = $part->content_type;
+
+		# account for filter bugs...
+		return if defined $ct && $ct =~ m!\btext/[xh]+tml\b!i;
+
+		my $enc = enc_for($ct) || $enc_msg || $enc_utf8;
 
 		if ($part_nr > 0) {
 			my $fn = $part->filename;
@@ -178,7 +183,12 @@ sub multipart_text_as_html {
 	$mime->walk_parts(sub {
 		my ($part) = @_;
 		return if $part->subparts; # walk_parts already recurses
-		my $enc = enc_for($part->content_type) || $enc_msg || $enc_utf8;
+		my $ct = $part->content_type;
+
+		# account for filter bugs...
+		return if defined $ct && $ct =~ m!\btext/[xh]+tml\b!i;
+
+		my $enc = enc_for($ct) || $enc_msg || $enc_utf8;
 
 		if ($part_nr > 0) {
 			my $fn = $part->filename;
-- 
EW


             reply	other threads:[~2014-11-13 21:53 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-11-13 21:53 Eric Wong [this message]
2014-11-13 21:53 ` [PATCH 2/2] -learn: nuke HTML portions when training as ham Eric Wong

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://public-inbox.org/README

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1415915581-2522-1-git-send-email-e@80x24.org \
    --to=e@80x24.org \
    --cc=meta@public-inbox.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://80x24.org/public-inbox.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).