From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS3215 2.6.0.0/16 X-Spam-Status: No, score=-3.2 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE,SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from mail-qt1-x842.google.com (mail-qt1-x842.google.com [IPv6:2607:f8b0:4864:20::842]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id A648B211B5 for ; Tue, 5 Feb 2019 18:47:15 +0000 (UTC) Received: by mail-qt1-x842.google.com with SMTP id b8so5093995qtj.1 for ; Tue, 05 Feb 2019 10:47:15 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linuxfoundation.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=KYAZS8pME/oh8YMKkO98upJwrC1XGdvD83aWWQALoxk=; b=XyNv2NALFR7Oe6QExNsAueErcd2y2P+9O87LFvAM70yUH7fBfb8ylZ6VSrAB+LiPEU 5OdnJKsBETpTnVEdx6H/MzEQur6xo4uZXR/1JML7drqMxuqUsh+SERVkMNxQXzF8Gs3b MfxAFDv0VP9hqB6QDR+5tdLaTFfsfu6LnLeVQ= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=KYAZS8pME/oh8YMKkO98upJwrC1XGdvD83aWWQALoxk=; b=ORssr1xjN8V5Nyz6MGrtWxPwRwp8VjH9Z3mNRlAOdlLaVKfuup57qwneEdEosSOAnQ 1D47dtAEAHHmoWrskja3ZPufSRFGOFcY1u5QiuZNQ1Kx89Bfo0iXpyI80q6MK/0p55Cx gqDEfo7AcemqiLnIySXzzspwkA1O/ljMjBcZuDxY+uTxehkum6GW+hdwwn7Hgh4Baa17 RcDtwXBYJEi9NAsTpOLjzb1t98jamkbObH3W3stxe9PzT2+1bPtdns+fDhvAu8dlEZcj jc3IBesXsVg/13wQBRMSqSduaMPpLmhCnaPZr4QnjF7h6DbGx7wD1nHGdgWpIWa+DOe3 KQYQ== X-Gm-Message-State: AHQUAuaWHEsG17KZQ0XdcdyYJkRmexGu+BRxPK6BuSpVkkirqE8MlZC1 rpz8lGDcrNzoeyIFqK13FOFFx+nR45/49ISq+dsI4g== X-Google-Smtp-Source: AHgI3Iam4h6OcF+Bosl+Fs15RjG2FF1UT2IrBX7O4Sye2xFm2sJ8WxMpYvgWJX+3925xeb9zr5a0YJ+q64+lr2hOBRo= X-Received: by 2002:a0c:ae30:: with SMTP id y45mr4733293qvc.145.1549392434088; Tue, 05 Feb 2019 10:47:14 -0800 (PST) MIME-Version: 1.0 References: <20190205113224.13587-1-e@80x24.org> In-Reply-To: <20190205113224.13587-1-e@80x24.org> From: Konstantin Ryabitsev Date: Tue, 5 Feb 2019 23:47:01 +0500 Message-ID: Subject: Re: [PATCH] TODO: add item for "scraper" importers To: Eric Wong Cc: meta@public-inbox.org Content-Type: text/plain; charset="UTF-8" List-Id: On Tue, 5 Feb 2019 at 16:32, Eric Wong wrote: > The git-users mailing list is on Google Groups with obfuscated > addresses and censored archives. We should allow users to > import them soon, as obfuscated/censored archives are better > than not having archives at all when Google decides to shut down > yet-another-service. I've written an importer script (in Python) for our purposes: https://git.kernel.org/pub/scm/linux/kernel/git/mricon/korg-helpers.git/tree/list-archive-maker.py It doesn't do Google at this time, but it can handle Mailman and grabbing thing from Gmane NNTP. Perhaps it'll come in handy to identify some of the hoops I've already had to jump through. -K