user/dev discussion of public-inbox itself
 help / color / mirror / code / Atom feed
From: Eric Wong <e@80x24.org>
To: "Uwe Kleine-König" <u.kleine-koenig@pengutronix.de>
Cc: meta@public-inbox.org
Subject: Re: Setup woes
Date: Tue, 23 Feb 2021 20:01:24 +0000	[thread overview]
Message-ID: <20210223200124.GA26348@dcvr> (raw)
In-Reply-To: <20210223154246.uso4tc2qf5bz6qym@pengutronix.de>

Uwe Kleine-König <u.kleine-koenig@pengutronix.de> wrote:
> Hello,
> 
> I'm currently (again) trying to setup a public-inbox instance and have
> two problems:
> 
>  1) Import of older mails
>     I used Konstantin Ryabitsev script[1] to generate a directory of
>     mbob files. It's unclear to me how to easily import that into the
>     archive. Not knowing a better way, I'd put them all in a Maildir and
>     use public-inbox-watch to import them. Is there a nicer way?

scripts/import_vger_from_mbox is probably the best maintained
of the stuff in scripts for mass imports (it supports v2, unlike
the rest of scripts/).

I will work on making -watch faster (and modeled after lei
internals) once lei is farther along in public-inbox.git...

import_maildir is v1, only, atm; but some of the stuff developed
for lei can make it easier to maintain going foward.

>  2) Apache configuration
>     I was able to setup Apache to serve individual archives but I fail
>     to get a list listing at the root. That is I can access the barebox
>     archive at http://lore.pengutronix.de/barebox/, but not the overview
>     of the archived lists at http://lore.pengutronix.de/.
> 
> 	$ curl -D - http://lore.pengutronix.de/
> 	HTTP/1.1 404 Not Found
> 	Date: Tue, 23 Feb 2021 15:31:54 GMT
> 	Server: Apache/2.4.38 (Debian)
> 	Expires: Fri, 01 Jan 1980 00:00:00 GMT
> 	Pragma: no-cache
> 	Cache-Control: no-cache, max-age=0, must-revalidate
> 	Content-Length: 9
> 	Content-Type: text/plain
> 
> 	Not Found

You might need to configure publicinbox.wwwlisting to
"all" or match=domain".  See publicinbox.wwwlisting in
public-inbox-config(5).

The default behavior is to 404 since the listing is a
fairly recent addition.

>     The Apache config looks as follows:
> 
> 	<VirtualHost *:80>
> 		ServerName lore.pengutronix.de
> 
> 		ServerAdmin webmaster@localhost
> 		DocumentRoot /srv/www/lore.pengutronix.de/root
> 
> 		<Directory "/srv/www/lore.pengutronix.de/root">
> 			AllowOverride None
> 			Require all granted
> 		</Directory>
> 
> 		LogLevel debug
> 
> 		ErrorLog ${APACHE_LOG_DIR}/error.log
> 		CustomLog ${APACHE_LOG_DIR}/access.log combined
> 
> 		Include conf-available/serve-cgi-bin.conf
> 		Include mods-available/rewrite.load
> 		SetEnv PI_DIR /home/lore/.public-inbox
> 		SetEnv NO_SCRIPT_NAME 1
> 
> 		Options +ExecCGI
> 		ScriptAlias /public-inbox.cgi "/usr/lib/cgi-bin/public-inbox.cgi"
> 
> 		DirectoryIndex public-inbox.cgi
> 		RewriteEngine On
> 		RewriteCond %{REQUEST_FILENAME} !-f
> 		RewriteCond %{REQUEST_FILENAME} !-d

Perhaps it's the -d check?  I haven't tested WwwListing with
Apache.

> 		RewriteRule ^.* /public-inbox.cgi/$0 [L,PT]
> 
> 	</VirtualHost>
> 
>      My guess is that the "SetEnv NO_SCRIPT_NAME 1" part doesn't work as
>      expected, but after some searching in the net I didn't find how to
>      do this properly.

Not sure, I haven't used Apache in ages and some of the configs
in examples/ are likely out-of-date and never tested with
WwwListing.

If you're not using public-inbox-httpd; then I strongly suggest
using at least mod_perl (and not plain CGI) since Perl startup
time is atrocious.

If you're dealing with large inboxes and supporting smart HTTP
clone/fetch, then use public-inbox-httpd (which works with
varnish, mod_proxy, or any other front-end proxy).

public-inbox-httpd can throttle CPU/memory intensive tasks
(e.g. git-http-backend, cgit) independently of other work
(the "limiter" stuff in public-inbox-config(5)).

As of public-inbox 1.6, -httpd also does git blob retrievals
around 20% faster on /T/, /t/ and t.mbox.gz requests and
decouples high-latency HDD from network ops.

> Any hints are welcome
> 
> Best regards and thanks in advance
> Uwe
> 
> [1] https://git.kernel.org/pub/scm/linux/kernel/git/mricon/korg-helpers.git/plain/list-archive-maker.py

  reply	other threads:[~2021-02-23 20:01 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-02-23 15:42 Setup woes Uwe Kleine-König
2021-02-23 20:01 ` Eric Wong [this message]
2021-02-23 21:31   ` Uwe Kleine-König
2021-02-23 22:11     ` Eric Wong
2021-02-24 10:25       ` Uwe Kleine-König
2021-02-24 11:09         ` Eric Wong

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://public-inbox.org/README

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20210223200124.GA26348@dcvr \
    --to=e@80x24.org \
    --cc=meta@public-inbox.org \
    --cc=u.kleine-koenig@pengutronix.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://80x24.org/public-inbox.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).