From: Jonathan Nieder <jrnieder@gmail.com>
To: Ramkumar Ramachandra <artagnon@gmail.com>
Cc: Drew Northup <drew.northup@maine.edu>,
Junio C Hamano <gitster@pobox.com>,
Git List <git@vger.kernel.org>,
David Barr <david.barr@cordelta.com>,
Sverre Rabbelier <srabbelier@gmail.com>
Subject: Re: [PATCH 4/5] fast-export: Introduce --inline-blobs
Date: Sat, 22 Jan 2011 13:18:01 -0600 [thread overview]
Message-ID: <20110122191801.GB13023@burratino> (raw)
In-Reply-To: <20110122092416.GA7827@kytes>
Ramkumar Ramachandra wrote:
> Agreed. I wouldn't like to introduce an extra dependency either. I was
> talking about using it for prototyping- if the final version includes
> an extra dependency, it's unlikely to get merged into git.git :) The
> final design will probably use an in-memory B+ tree, but I haven't
> thought about that hard enough.
Immediate reaction:
Please no. There is a value to simplicity.
As you mention, the final form is a way off, so as long as people are
careful not to get locked into bad implementation decisions, I think
it is okay. I have refrained from nitpicking the implementation so
far because the design and interface are not obvious yet.
In this particular case, I am dreaming that we will discover a hidden
"mkdir -p" node-action in the dumpfile format so the list of
directories will not be needed. ;-)
Re Junio's critique:
is it possible to use
1) a table with callbacks? See the source code to unifdef for
inspiration.
2) separate code paths for different input states? fast-import.c
does this.
3) separate "parsing" and "acting" code? That can open the door to a
little paralellism, though not necessarily enough to matter.
parser actor
read a chunk
determine the first "thing to do"
pick up the first "thing to do"
do it
determine the next "thing to do"
pick up the next "thing to do"
do it
There is potential parallelism because the parser can keep
chugging along if the actor is blocked, say, writing its
output to the network.
Syntactically: the actor function (write_dump) calls the parser
function (next_command) to ask what to do next. If wanted, a later
refactoring could make that parser function just grab an action off
of a queue, while the parser proper runs in the background.
And of course if the "thing to do" data structure is simple enough,
this can also make the code easier to read.
I mention these ideas because some of them (especially #2) could make
the prototyping a lot easier as well as resulting in code that is
easier to review.
Re error handling:
Writing robust code (e.g., checking for errors) is also a lot easier
when done from the start. The svn-fe error handling is known to be a
problem (see BUGS in contrib/svn-fe/svn-fe.txt). So yes, I also
consider avoiding segfaults and deadlocks and catching parse errors to
be worthwhile things.
Thanks.
Jonathan
next prev parent reply other threads:[~2011-01-22 19:18 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-01-19 5:44 [RFC PATCH v2 0/5] Towards a Git-to-SVN bridge Ramkumar Ramachandra
2011-01-19 5:44 ` [PATCH 1/5] date: Expose the time_to_tm function Ramkumar Ramachandra
2011-01-19 5:44 ` [PATCH 2/5] vcs-svn: Start working on the dumpfile producer Ramkumar Ramachandra
2011-01-22 0:30 ` Junio C Hamano
2011-01-22 9:45 ` Ramkumar Ramachandra
2011-01-19 5:44 ` [PATCH 3/5] Build an svn-fi target in contrib/svn-fe Ramkumar Ramachandra
2011-01-19 5:44 ` [PATCH 4/5] fast-export: Introduce --inline-blobs Ramkumar Ramachandra
2011-01-19 19:50 ` Junio C Hamano
2011-01-19 21:48 ` Jonathan Nieder
2011-01-20 4:50 ` Ramkumar Ramachandra
2011-01-20 5:48 ` Jonathan Nieder
2011-01-20 6:28 ` Ramkumar Ramachandra
2011-01-20 13:53 ` Drew Northup
2011-01-22 9:24 ` Ramkumar Ramachandra
2011-01-22 19:18 ` Jonathan Nieder [this message]
2011-01-20 5:41 ` Jonathan Nieder
2011-01-22 0:30 ` Junio C Hamano
2011-01-19 5:44 ` [PATCH 5/5] vcs-svn: Add dir_cache for svnload Ramkumar Ramachandra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: http://vger.kernel.org/majordomo-info.html
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110122191801.GB13023@burratino \
--to=jrnieder@gmail.com \
--cc=artagnon@gmail.com \
--cc=david.barr@cordelta.com \
--cc=drew.northup@maine.edu \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=srabbelier@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://80x24.org/mirrors/git.git
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).