From: Shawn Pearce <spearce@spearce.org> To: Johannes Sixt <j6t@kdbg.org> Cc: "Jeff King" <peff@peff.net>, "Junio C Hamano" <gitster@pobox.com>, "Ævar Arnfjörð" <avarab@gmail.com>, "Git Mailing List" <git@vger.kernel.org> Subject: Re: upload-pack is slow with lots of refs Date: Mon, 8 Oct 2012 23:46:05 -0700 [thread overview] Message-ID: <CAJo=hJsJgqZqPxucRcSgYSa0N3pcw5seT9vcu2BE8WwfJVrvKQ@mail.gmail.com> (raw) In-Reply-To: <5072EBD1.40500@kdbg.org> On Mon, Oct 8, 2012 at 8:05 AM, Johannes Sixt <j6t@kdbg.org> wrote: > Am 05.10.2012 18:57, schrieb Shawn Pearce: >> On Thu, Oct 4, 2012 at 11:24 PM, Johannes Sixt <j.sixt@viscovery.net> wrote: >>> Upload-pack can just start >>> advertising refs in the "v1" way and announce a "v2" capability and listen >>> for response in parallel. A v2 capable client can start sending "wants" or >>> some other signal as soon as it sees the "v2" capability. Upload-pack, >>> which was listening for responses in parallel, can interrupt its >>> advertisements and continue with v2 protocol from here. >>> >>> This sounds so simple (not the implementation, of course) - I must be >>> missing something. >> >> Smart HTTP is not bidirectional. The client can't cut off the server. > > Smart HTTP does not need it: you already posted a better solution (I'm > refering to "&v=2"). Yes but then it diverges even further from the native bidirectional protocol. >> Its also more complex to code the server to listen for a stop command >> from the client at the same time the server is blasting out useless >> references to the client. > > At least the server side does not seem to be that complex. See below. > Of course, the server blasted out some refs, but I'm confident that in > practice the client will be able to signal v2 capability after a few packets > of advertisements. You can switch on TCP_NODELAY for the first line with > the capabilities to ensure it goes out on the wire ASAP. ... > +static int client_spoke(void) > +{ > + struct pollfd pfd; > + pfd.fd = 0; > + pfd.events = POLLIN; > + return poll(&pfd, 1, 0) > 0 && > + (pfd.revents & (POLLIN|POLLHUP)); Except doing this in Java is harder on an arbitrary InputStream type. I guess we really only care about basic TCP, in which case we can use NIO to implement an emulation of poll, and SSH, where MINA SSHD probably doesn't provide a way to see if the client has given us data without blocking. That makes supporting v2 really hard in e.g. Gerrit Code Review. You could argue that its improper to attempt to implement a network protocol in a language whose standard libraries have gone out of their way to prevent you from polling to see if data is immediately available, but I prefer to ignore such arguments. As it turns out we don't really have this problem with git://. Clients can bury a v2 request in the extended headers where the host line appears today. Its a bit tricky because of that \0 bug causing infinite looping, but IIRC using \0\0 is safe even against ancient servers. So git:// and http:// both have a way where the client can ask for v2 support before the server speaks, and have it transparently be ignored by ancient servers. The only place we have a problem is SSH. That exec of the remote binary is just super-strict. Its good to be paranoid, but its also locked out any chance we have at doing the upgrade over SSH without having to run two SSH commands in the worst case. I guess the best approach is to try the v1 protocol by default, have the remote advertise it supports v2, and remember this on a per-host basis in ~/.gitconfig for future requests. Users could always force a specific preference with remote.NAME.uploadpack variable or --uploadpack command line flag.
next prev parent reply other threads:[~2012-10-09 6:46 UTC|newest] Thread overview: 38+ messages / expand[flat|nested] mbox.gz Atom feed top 2012-10-03 12:36 Ævar Arnfjörð Bjarmason 2012-10-03 13:06 ` Nguyen Thai Ngoc Duy 2012-10-03 18:03 ` Jeff King 2012-10-03 18:53 ` Junio C Hamano 2012-10-03 18:55 ` Jeff King 2012-10-03 19:41 ` Shawn Pearce 2012-10-03 20:13 ` Jeff King 2012-10-04 21:52 ` Sascha Cunz 2012-10-05 0:20 ` Jeff King 2012-10-05 6:24 ` Johannes Sixt 2012-10-05 16:57 ` Shawn Pearce 2012-10-08 15:05 ` Johannes Sixt 2012-10-09 6:46 ` Shawn Pearce [this message] 2012-10-09 20:30 ` Johannes Sixt 2012-10-09 20:46 ` Johannes Sixt 2012-10-03 20:16 ` Ævar Arnfjörð Bjarmason 2012-10-03 21:20 ` Jeff King 2012-10-03 22:15 ` Ævar Arnfjörð Bjarmason 2012-10-03 23:15 ` Jeff King 2012-10-03 23:54 ` Ævar Arnfjörð Bjarmason 2012-10-04 7:56 ` [PATCH 0/4] optimizing upload-pack ref peeling Jeff King 2012-10-04 7:58 ` [PATCH 1/4] peel_ref: use faster deref_tag_noverify Jeff King 2012-10-04 18:24 ` Junio C Hamano 2012-10-04 8:00 ` [PATCH 2/4] peel_ref: do not return a null sha1 Jeff King 2012-10-04 18:32 ` Junio C Hamano 2012-10-04 8:02 ` [PATCH 3/4] peel_ref: check object type before loading Jeff King 2012-10-04 19:06 ` Junio C Hamano 2012-10-04 19:41 ` Jeff King 2012-10-04 20:41 ` Junio C Hamano 2012-10-04 21:59 ` Jeff King 2012-10-04 8:03 ` [PATCH 4/4] upload-pack: use peel_ref for ref advertisements Jeff King 2012-10-04 8:04 ` [PATCH 0/4] optimizing upload-pack ref peeling Jeff King 2012-10-04 9:01 ` Ævar Arnfjörð Bjarmason 2012-10-04 12:14 ` Nazri Ramliy 2012-10-03 22:32 ` upload-pack is slow with lots of refs Ævar Arnfjörð Bjarmason 2012-10-03 23:21 ` Jeff King 2012-10-03 23:47 ` Ævar Arnfjörð Bjarmason 2012-10-03 19:13 ` Junio C Hamano
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style List information: http://vger.kernel.org/majordomo-info.html * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to='CAJo=hJsJgqZqPxucRcSgYSa0N3pcw5seT9vcu2BE8WwfJVrvKQ@mail.gmail.com' \ --to=spearce@spearce.org \ --cc=avarab@gmail.com \ --cc=git@vger.kernel.org \ --cc=gitster@pobox.com \ --cc=j6t@kdbg.org \ --cc=peff@peff.net \ --subject='Re: upload-pack is slow with lots of refs' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
Code repositories for project(s) associated with this inbox: https://80x24.org/mirrors/git.git This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).