user/dev discussion of public-inbox itself
 help / color / mirror / code / Atom feed
* public-inbox-init -V2 for lkml
@ 2018-07-04  9:25 Eric W. Biederman
  2018-07-04 10:50 ` [PATCH] v2: fill alternates with old epochs on init from mirrors Eric Wong
  0 siblings, 1 reply; 3+ messages in thread
From: Eric W. Biederman @ 2018-07-04  9:25 UTC (permalink / raw)
  To: meta; +Cc: Eric Wong


I was playing with using git to get my own local copy of lkml and
unless I did something wrong I have found a bug in publc-inbox-init.

Baring some cheats (aka cloning from git.kernel.org) I followed
the directions up on lore.kernel.org, and the archives did not display.

#	git clone --mirror https://lore.kernel.org/lkml/0 lkml/git/0.git
#	git clone --mirror https://lore.kernel.org/lkml/1 lkml/git/1.git
#	git clone --mirror https://lore.kernel.org/lkml/2 lkml/git/2.git
#	git clone --mirror https://lore.kernel.org/lkml/3 lkml/git/3.git
#	git clone --mirror https://lore.kernel.org/lkml/4 lkml/git/4.git
#	git clone --mirror https://lore.kernel.org/lkml/5 lkml/git/5.git
#	git clone --mirror https://lore.kernel.org/lkml/6 lkml/git/6.git
#
#	public-inbox-init -V2 lkml lkml/ https://lore.kernel.org/lkml linux-kernel@vger.kernel.org linux-kernel@archiver.kernel.org
#
#	public-inbox-index lkml

I was able to solve my issue by applying the following patch to
objects/info/alternatives.

--- all.git/objects/info/alternates-old 2018-07-04 04:18:17.734755092 -0500
+++ all.git/objects/info/alternates     2018-07-04 04:18:11.874754952 -0500
@@ -1 +1,7 @@
+../../git/0.git/objects
+../../git/1.git/objects
+../../git/2.git/objects
+../../git/3.git/objects
+../../git/4.git/objects
+../../git/5.git/objects
 ../../git/6.git/objects

AKA adding the older git directories to all.git.

Is that the correct fix?  Am I write in thinking that public-inbox-init
is failing to do that?  Or am I missing something?

Eric




^ permalink raw reply	[flat|nested] 3+ messages in thread

* [PATCH] v2: fill alternates with old epochs on init from mirrors
  2018-07-04  9:25 public-inbox-init -V2 for lkml Eric W. Biederman
@ 2018-07-04 10:50 ` Eric Wong
  2018-07-05  5:18   ` Eric W. Biederman
  0 siblings, 1 reply; 3+ messages in thread
From: Eric Wong @ 2018-07-04 10:50 UTC (permalink / raw)
  To: Eric W. Biederman; +Cc: meta

"Eric W. Biederman" <ebiederm@xmission.com> wrote:
> Is that the correct fix?  Am I write in thinking that public-inbox-init
> is failing to do that?  Or am I missing something?

Yes, filling in alternates is the correct fix.  Sorry about that,
you've found a bug which I failed to test :x

Below patch should fix it, thanks for the report:

-------8<------
Subject: [PATCH] v2: fill alternates with old epochs on init from mirrors

For v2 repositories with multiple epochs, we must not forget
about earlier epochs in clones.  Ensure we update the alternates
file with all known epochs up to the current one.

Reported-by: Eric W. Biederman <ebiederm@xmission.com>
  https://public-inbox.org/meta/871scj2vzi.fsf@xmission.com/
---
 lib/PublicInbox/V2Writable.pm | 12 +++++++++---
 t/v2mirror.t                  | 32 ++++++++++++++++++++++++--------
 2 files changed, 33 insertions(+), 11 deletions(-)

diff --git a/lib/PublicInbox/V2Writable.pm b/lib/PublicInbox/V2Writable.pm
index d8d75ec..412eb6a 100644
--- a/lib/PublicInbox/V2Writable.pm
+++ b/lib/PublicInbox/V2Writable.pm
@@ -495,15 +495,21 @@ sub git_init {
 	PublicInbox::Import::run_die(\@cmd);
 
 	my $alt = "$all/objects/info/alternates";
-	my $new_obj_dir = "../../git/$epoch.git/objects";
 	my %alts;
+	my @add;
 	if (-e $alt) {
 		open(my $fh, '<', $alt) or die "open < $alt: $!\n";
 		%alts = map { chomp; $_ => 1 } (<$fh>);
 	}
-	return $git_dir if $alts{$new_obj_dir};
+	foreach my $i (0..$epoch) {
+		my $dir = "../../git/$i.git/objects";
+		push @add, $dir if !$alts{$dir} && -d "$pfx/$i.git";
+	}
+	return $git_dir unless @add;
 	open my $fh, '>>', $alt or die "open >> $alt: $!\n";
-	print $fh "$new_obj_dir\n" or die "print >> $alt: $!\n";
+	foreach my $dir (@add) {
+		print $fh "$dir\n" or die "print >> $alt: $!\n";
+	}
 	close $fh or die "close $alt: $!\n";
 	$git_dir
 }
diff --git a/t/v2mirror.t b/t/v2mirror.t
index 4cd369b..c0c329c 100644
--- a/t/v2mirror.t
+++ b/t/v2mirror.t
@@ -45,11 +45,16 @@ To: You <you@example.com>
 Subject: a
 Date: Thu, 01 Jan 1970 00:00:00 +0000
 
+my $old_rotate_bytes = $v2w->{rotate_bytes};
+$v2w->{rotate_bytes} = 500; # force rotating
 for my $i (1..9) {
 	$mime->header_set('Message-ID', "<$i\@example.com>");
 	$mime->header_set('Subject', "subject = $i");
 	ok($v2w->add($mime), "add msg $i OK");
 }
+
+my $epoch_max = $v2w->{epoch_max};
+ok($epoch_max > 0, "multiple epochs");
 $v2w->done;
 
 my %opts = (
@@ -79,11 +84,14 @@ ok(defined $pid, 'forked httpd process successfully');
 my ($host, $port) = ($sock->sockhost, $sock->sockport);
 $sock = undef;
 
-my @cmd = (qw(git clone --mirror -q), "http://$host:$port/v2/0",
-	"$tmpdir/m/git/0.git");
+my @cmd;
+foreach my $i (0..$epoch_max) {
+	@cmd = (qw(git clone --mirror -q), "http://$host:$port/v2/$i",
+		"$tmpdir/m/git/$i.git");
 
-is(system(@cmd), 0, 'cloned OK');
-ok(-d "$tmpdir/m/git/0.git", 'mirror OK');;
+	is(system(@cmd), 0, 'cloned OK');
+	ok(-d "$tmpdir/m/git/$i.git", 'mirror OK');
+}
 
 @cmd = ("$script-init", '-V2', 'm', "$tmpdir/m", 'http://example.com/m',
 	'alt@example.com');
@@ -94,14 +102,23 @@ my $mibx = { mainrepo => "$tmpdir/m", address => 'alt@example.com' };
 $mibx = PublicInbox::Inbox->new($mibx);
 is_deeply([$mibx->mm->minmax], [$ibx->mm->minmax], 'index synched minmax');
 
+$v2w->{rotate_bytes} = $old_rotate_bytes;
 for my $i (10..15) {
 	$mime->header_set('Message-ID', "<$i\@example.com>");
 	$mime->header_set('Subject', "subject = $i");
 	ok($v2w->add($mime), "add msg $i OK");
 }
 $v2w->barrier;
-is(system('git', "--git-dir=$tmpdir/m/git/0.git", 'fetch', '-q'), 0,
-	'fetch successful');
+
+sub fetch_each_epoch {
+	foreach my $i (0..$epoch_max) {
+		my $dir = "$tmpdir/m/git/$i.git";
+		is(system('git', "--git-dir=$dir", 'fetch', '-q'), 0,
+			'fetch successful');
+	}
+}
+
+fetch_each_epoch();
 
 my $mset = $mibx->search->reopen->query('m:15@example.com', {mset => 1});
 is(scalar($mset->items), 0, 'new message not found in mirror, yet');
@@ -131,8 +148,7 @@ like($to_purge, qr/\A[a-f0-9]{40,}\z/, 'read blob to be purged');
 $mset = $ibx->search->reopen->query('m:10@example.com', {mset => 1});
 is(scalar($mset->items), 0, 'purged message gone from origin');
 
-is(system('git', "--git-dir=$tmpdir/m/git/0.git", 'fetch', '-q'), 0,
-	'fetch successful');
+fetch_each_epoch();
 {
 	open my $err, '+>', "$tmpdir/index-err" or die "open: $!";
 	my $ipid = fork;
-- 
zero. one. infinity!

^ permalink raw reply related	[flat|nested] 3+ messages in thread

* Re: [PATCH] v2: fill alternates with old epochs on init from mirrors
  2018-07-04 10:50 ` [PATCH] v2: fill alternates with old epochs on init from mirrors Eric Wong
@ 2018-07-05  5:18   ` Eric W. Biederman
  0 siblings, 0 replies; 3+ messages in thread
From: Eric W. Biederman @ 2018-07-05  5:18 UTC (permalink / raw)
  To: Eric Wong; +Cc: meta

Eric Wong <e@80x24.org> writes:

> "Eric W. Biederman" <ebiederm@xmission.com> wrote:
>> Is that the correct fix?  Am I write in thinking that public-inbox-init
>> is failing to do that?  Or am I missing something?
>
> Yes, filling in alternates is the correct fix.  Sorry about that,
> you've found a bug which I failed to test :x
>
> Below patch should fix it, thanks for the report:
>
> -------8<------
> Subject: [PATCH] v2: fill alternates with old epochs on init from mirrors
>
> For v2 repositories with multiple epochs, we must not forget
> about earlier epochs in clones.  Ensure we update the alternates
> file with all known epochs up to the current one.
>
> Reported-by: Eric W. Biederman <ebiederm@xmission.com>
>   https://public-inbox.org/meta/871scj2vzi.fsf@xmission.com/

Tested-by: "Eric W. Biederman" <ebiederm@xmission.com>
Acked-by: "Eric W. Biederman" <ebiederm@xmission.com>


I just reran my test case and objects/info/alternatives now lists
all of the repos.

Thanky you for the quick fix.


> ---
>  lib/PublicInbox/V2Writable.pm | 12 +++++++++---
>  t/v2mirror.t                  | 32 ++++++++++++++++++++++++--------
>  2 files changed, 33 insertions(+), 11 deletions(-)
>
> diff --git a/lib/PublicInbox/V2Writable.pm b/lib/PublicInbox/V2Writable.pm
> index d8d75ec..412eb6a 100644
> --- a/lib/PublicInbox/V2Writable.pm
> +++ b/lib/PublicInbox/V2Writable.pm
> @@ -495,15 +495,21 @@ sub git_init {
>  	PublicInbox::Import::run_die(\@cmd);
>  
>  	my $alt = "$all/objects/info/alternates";
> -	my $new_obj_dir = "../../git/$epoch.git/objects";
>  	my %alts;
> +	my @add;
>  	if (-e $alt) {
>  		open(my $fh, '<', $alt) or die "open < $alt: $!\n";
>  		%alts = map { chomp; $_ => 1 } (<$fh>);
>  	}
> -	return $git_dir if $alts{$new_obj_dir};
> +	foreach my $i (0..$epoch) {
> +		my $dir = "../../git/$i.git/objects";
> +		push @add, $dir if !$alts{$dir} && -d "$pfx/$i.git";
> +	}
> +	return $git_dir unless @add;
>  	open my $fh, '>>', $alt or die "open >> $alt: $!\n";
> -	print $fh "$new_obj_dir\n" or die "print >> $alt: $!\n";
> +	foreach my $dir (@add) {
> +		print $fh "$dir\n" or die "print >> $alt: $!\n";
> +	}
>  	close $fh or die "close $alt: $!\n";
>  	$git_dir
>  }
> diff --git a/t/v2mirror.t b/t/v2mirror.t
> index 4cd369b..c0c329c 100644
> --- a/t/v2mirror.t
> +++ b/t/v2mirror.t
> @@ -45,11 +45,16 @@ To: You <you@example.com>
>  Subject: a
>  Date: Thu, 01 Jan 1970 00:00:00 +0000
>  
> +my $old_rotate_bytes = $v2w->{rotate_bytes};
> +$v2w->{rotate_bytes} = 500; # force rotating
>  for my $i (1..9) {
>  	$mime->header_set('Message-ID', "<$i\@example.com>");
>  	$mime->header_set('Subject', "subject = $i");
>  	ok($v2w->add($mime), "add msg $i OK");
>  }
> +
> +my $epoch_max = $v2w->{epoch_max};
> +ok($epoch_max > 0, "multiple epochs");
>  $v2w->done;
>  
>  my %opts = (
> @@ -79,11 +84,14 @@ ok(defined $pid, 'forked httpd process successfully');
>  my ($host, $port) = ($sock->sockhost, $sock->sockport);
>  $sock = undef;
>  
> -my @cmd = (qw(git clone --mirror -q), "http://$host:$port/v2/0",
> -	"$tmpdir/m/git/0.git");
> +my @cmd;
> +foreach my $i (0..$epoch_max) {
> +	@cmd = (qw(git clone --mirror -q), "http://$host:$port/v2/$i",
> +		"$tmpdir/m/git/$i.git");
>  
> -is(system(@cmd), 0, 'cloned OK');
> -ok(-d "$tmpdir/m/git/0.git", 'mirror OK');;
> +	is(system(@cmd), 0, 'cloned OK');
> +	ok(-d "$tmpdir/m/git/$i.git", 'mirror OK');
> +}
>  
>  @cmd = ("$script-init", '-V2', 'm', "$tmpdir/m", 'http://example.com/m',
>  	'alt@example.com');
> @@ -94,14 +102,23 @@ my $mibx = { mainrepo => "$tmpdir/m", address => 'alt@example.com' };
>  $mibx = PublicInbox::Inbox->new($mibx);
>  is_deeply([$mibx->mm->minmax], [$ibx->mm->minmax], 'index synched minmax');
>  
> +$v2w->{rotate_bytes} = $old_rotate_bytes;
>  for my $i (10..15) {
>  	$mime->header_set('Message-ID', "<$i\@example.com>");
>  	$mime->header_set('Subject', "subject = $i");
>  	ok($v2w->add($mime), "add msg $i OK");
>  }
>  $v2w->barrier;
> -is(system('git', "--git-dir=$tmpdir/m/git/0.git", 'fetch', '-q'), 0,
> -	'fetch successful');
> +
> +sub fetch_each_epoch {
> +	foreach my $i (0..$epoch_max) {
> +		my $dir = "$tmpdir/m/git/$i.git";
> +		is(system('git', "--git-dir=$dir", 'fetch', '-q'), 0,
> +			'fetch successful');
> +	}
> +}
> +
> +fetch_each_epoch();
>  
>  my $mset = $mibx->search->reopen->query('m:15@example.com', {mset => 1});
>  is(scalar($mset->items), 0, 'new message not found in mirror, yet');
> @@ -131,8 +148,7 @@ like($to_purge, qr/\A[a-f0-9]{40,}\z/, 'read blob to be purged');
>  $mset = $ibx->search->reopen->query('m:10@example.com', {mset => 1});
>  is(scalar($mset->items), 0, 'purged message gone from origin');
>  
> -is(system('git', "--git-dir=$tmpdir/m/git/0.git", 'fetch', '-q'), 0,
> -	'fetch successful');
> +fetch_each_epoch();
>  {
>  	open my $err, '+>', "$tmpdir/index-err" or die "open: $!";
>  	my $ipid = fork;

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2018-07-05  5:18 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-07-04  9:25 public-inbox-init -V2 for lkml Eric W. Biederman
2018-07-04 10:50 ` [PATCH] v2: fill alternates with old epochs on init from mirrors Eric Wong
2018-07-05  5:18   ` Eric W. Biederman

Code repositories for project(s) associated with this public inbox

	https://80x24.org/public-inbox.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).