From mboxrd@z Thu Jan 1 00:00:00 1970 From: David Turner Subject: [PATCH v12 10/20] watchman: support watchman to reduce index refresh cost Date: Thu, 19 May 2016 17:45:47 -0400 Message-ID: <1463694357-6503-11-git-send-email-dturner@twopensource.com> References: <1463694357-6503-1-git-send-email-dturner@twopensource.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: QUOTED-PRINTABLE Cc: David Turner To: git@vger.kernel.org, pclouds@gmail.com X-From: git-owner@vger.kernel.org Thu May 19 23:47:02 2016 Return-path: Envelope-to: gcvg-git-2@plane.gmane.org Received: from vger.kernel.org ([209.132.180.67]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1b3VmX-0006fg-Cp for gcvg-git-2@plane.gmane.org; Thu, 19 May 2016 23:47:01 +0200 Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932471AbcESVqx convert rfc822-to-quoted-printable (ORCPT ); Thu, 19 May 2016 17:46:53 -0400 Received: from mail-qk0-f178.google.com ([209.85.220.178]:36189 "EHLO mail-qk0-f178.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932358AbcESVqt (ORCPT ); Thu, 19 May 2016 17:46:49 -0400 Received: by mail-qk0-f178.google.com with SMTP id x7so56505051qkd.3 for ; Thu, 19 May 2016 14:46:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=twopensource-com.20150623.gappssmtp.com; s=20150623; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=/7U9xDoQyiLgzeN7J1tERNGJ4Yv4GDqMHD3j0EMKlO0=; b=KMeQ81d1ZDNnfr5ZQooFq+bG0Pz2kl8rU9qYHVwmIIxfEJEEvfIHuXpIg0VR4k48Ef zXmYaT08x65uIhaBbG35N8JWYIqDMXlGEOWYkv76cxST1kDMpa0ZELD/A4QVAM+OgPMD fvHM6AYElgbJ3q3m0zHRFM2VUFljZ4Q0vPo+Sx2NabFKCQ9GkxVLg5z15wk4bVG5AOCc 4RxeSZWzeTl+/4/Pxq4XGfgzYIn5Wf89kAhp89M7bqcg9YtmWBLhrEE5iJ7i+pkg1rf9 IQl07FKelmiHL/heVqMk0Ssm4p7qnciV35KRWNYgaWex6c9ZtGzD+WJM+abEAM2qsp1o PCxA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=/7U9xDoQyiLgzeN7J1tERNGJ4Yv4GDqMHD3j0EMKlO0=; b=ajWiP1SR28l+OmhwNpp6G5AgJw1CyK5n2d+IJBpVnBlox2i7rd+sqDP/1oWCq+Rgxl F5spazEKrAMo6DiPZv0cL201HiNvQe38m3sAbZP1V+6CR9ezXhpB6s48x8+cFZxwzKQS T4EETdbAocssNgiGJdtPBhp417DeXATGmvVnp/1qhfdCUt5B+fbabrqjoEZPHlw2aHXm fR85raLoPrwbUkSN7m8MJ2JJ34H4Jo6kjs4tWDH5FLqrKz6KluVHQ+jkl8ji4RqGsjJ8 avPo3PZVFQpRW21pNl2IFymgRjQHVPUoKSj6oNxVFq9D63AygX8wn+f5ZzZ+YiItTEHq BNIg== X-Gm-Message-State: AOPr4FWHu8Snu8HftRTQIcyzf+2am83y45u2h4AelfsXW4C/ovzrZAMn7wHy6ftIV/wjow== X-Received: by 10.55.50.206 with SMTP id y197mr16693576qky.199.1463694408792; Thu, 19 May 2016 14:46:48 -0700 (PDT) Received: from twopensource.com (207-38-164-98.c3-0.43d-ubr2.qens-43d.ny.cable.rcn.com. [207.38.164.98]) by smtp.gmail.com with ESMTPSA id r124sm7424730qhr.48.2016.05.19.14.46.47 (version=TLS1_2 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Thu, 19 May 2016 14:46:48 -0700 (PDT) X-Mailer: git-send-email 2.4.2.767.g62658d5-twtrsrc In-Reply-To: <1463694357-6503-1-git-send-email-dturner@twopensource.com> Sender: git-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: git@vger.kernel.org Archived-At: =46rom: Nguy=E1=BB=85n Th=C3=A1i Ng=E1=BB=8Dc Duy The previous patch has the logic to clear bits in 'WAMA' bitmap. This patch has logic to set bits as told by watchman. The missing bit, _using_ these bits, are not here yet. A lot of this code is written by David Turner originally, mostly from [1]. I'm just copying and polishing it a bit. [1] http://article.gmane.org/gmane.comp.version-control.git/248006 Signed-off-by: Nguy=E1=BB=85n Th=C3=A1i Ng=E1=BB=8Dc Duy Signed-off-by: David Turner --- Makefile | 12 +++++ cache.h | 1 + config.c | 5 ++ configure.ac | 8 ++++ environment.c | 3 ++ watchman-support.c | 135 +++++++++++++++++++++++++++++++++++++++++++++= ++++++++ watchman-support.h | 7 +++ 7 files changed, 171 insertions(+) create mode 100644 watchman-support.c create mode 100644 watchman-support.h diff --git a/Makefile b/Makefile index c8be0e7..65ab0f4 100644 --- a/Makefile +++ b/Makefile @@ -451,6 +451,7 @@ MSGFMT =3D msgfmt CURL_CONFIG =3D curl-config PTHREAD_LIBS =3D -lpthread PTHREAD_CFLAGS =3D +WATCHMAN_LIBS =3D GCOV =3D gcov =20 export TCL_PATH TCLTK_PATH @@ -1416,6 +1417,13 @@ else LIB_OBJS +=3D thread-utils.o endif =20 +ifdef USE_WATCHMAN + LIB_H +=3D watchman-support.h + LIB_OBJS +=3D watchman-support.o + WATCHMAN_LIBS =3D -lwatchman + BASIC_CFLAGS +=3D -DUSE_WATCHMAN +endif + ifdef HAVE_PATHS_H BASIC_CFLAGS +=3D -DHAVE_PATHS_H endif @@ -2025,6 +2033,9 @@ git-remote-testsvn$X: remote-testsvn.o GIT-LDFLAG= S $(GITLIBS) $(VCSSVN_LIB) $(QUIET_LINK)$(CC) $(ALL_CFLAGS) -o $@ $(ALL_LDFLAGS) $(filter %.o,$^= ) $(LIBS) \ $(VCSSVN_LIB) =20 +git-index-helper$X: index-helper.o GIT-LDFLAGS $(GITLIBS) + $(QUIET_LINK)$(CC) $(ALL_CFLAGS) -o $@ $(ALL_LDFLAGS) $(filter %.o,$^= ) $(LIBS) $(WATCHMAN_LIBS) + $(REMOTE_CURL_ALIASES): $(REMOTE_CURL_PRIMARY) $(QUIET_LNCP)$(RM) $@ && \ ln $< $@ 2>/dev/null || \ @@ -2164,6 +2175,7 @@ GIT-BUILD-OPTIONS: FORCE @echo NO_PYTHON=3D\''$(subst ','\'',$(subst ','\'',$(NO_PYTHON)))'\' = >>$@+ @echo NO_UNIX_SOCKETS=3D\''$(subst ','\'',$(subst ','\'',$(NO_UNIX_SO= CKETS)))'\' >>$@+ @echo NO_MMAP=3D\''$(subst ','\'',$(subst ','\'',$(NO_MMAP)))'\' >>$@= + + @echo USE_WATCHMAN=3D\''$(subst ','\'',$(subst ','\'',$(USE_WATCHMAN)= ))'\' >>$@+ ifdef TEST_OUTPUT_DIRECTORY @echo TEST_OUTPUT_DIRECTORY=3D\''$(subst ','\'',$(subst ','\'',$(TEST= _OUTPUT_DIRECTORY)))'\' >>$@+ endif diff --git a/cache.h b/cache.h index f10992d..452aea2 100644 --- a/cache.h +++ b/cache.h @@ -696,6 +696,7 @@ extern char *git_replace_ref_base; =20 extern int fsync_object_files; extern int core_preload_index; +extern int core_watchman_sync_timeout; extern int core_apply_sparse_checkout; extern int precomposed_unicode; extern int protect_hfs; diff --git a/config.c b/config.c index 9ba40bc..e6dc141 100644 --- a/config.c +++ b/config.c @@ -882,6 +882,11 @@ static int git_default_core_config(const char *var= , const char *value) return 0; } =20 + if (!strcmp(var, "core.watchmansynctimeout")) { + core_watchman_sync_timeout =3D git_config_int(var, value); + return 0; + } + if (!strcmp(var, "core.createobject")) { if (!strcmp(value, "rename")) object_creation_mode =3D OBJECT_CREATION_USES_RENAMES; diff --git a/configure.ac b/configure.ac index 0cd9f46..334d63b 100644 --- a/configure.ac +++ b/configure.ac @@ -1099,6 +1099,14 @@ AC_COMPILE_IFELSE([BSD_SYSCTL_SRC], HAVE_BSD_SYSCTL=3D]) GIT_CONF_SUBST([HAVE_BSD_SYSCTL]) =20 +# +# Check for watchman client library + +AC_CHECK_LIB([watchman], [watchman_connect], + [USE_WATCHMAN=3DYesPlease], + [USE_WATCHMAN=3D]) +GIT_CONF_SUBST([USE_WATCHMAN]) + ## Other checks. # Define USE_PIC if you need the main git objects to be built with -fP= IC # in order to build and link perl/Git.so. x86-64 seems to need this. diff --git a/environment.c b/environment.c index 6dec9d0..35e03c7 100644 --- a/environment.c +++ b/environment.c @@ -94,6 +94,9 @@ int core_preload_index =3D 1; */ int ignore_untracked_cache_config; =20 +int core_watchman_sync_timeout =3D 300; + + /* This is set by setup_git_dir_gently() and/or git_default_config() *= / char *git_work_tree_cfg; static char *work_tree; diff --git a/watchman-support.c b/watchman-support.c new file mode 100644 index 0000000..dc8cd51 --- /dev/null +++ b/watchman-support.c @@ -0,0 +1,135 @@ +#include "cache.h" +#include "watchman-support.h" +#include "strbuf.h" +#include "dir.h" +#include + +static struct watchman_query *make_query(const char *last_update) +{ + struct watchman_query *query =3D watchman_query(); + watchman_query_set_fields(query, WATCHMAN_FIELD_NAME | + WATCHMAN_FIELD_EXISTS | + WATCHMAN_FIELD_NEWER); + watchman_query_set_empty_on_fresh(query, 1); + query->sync_timeout =3D core_watchman_sync_timeout; + if (*last_update) + watchman_query_set_since_oclock(query, last_update); + return query; +} + +static struct watchman_query_result *query_watchman( + struct index_state *istate, struct watchman_connection *connection, + const char *fs_path, const char *last_update) +{ + struct watchman_error wm_error; + struct watchman_query *query; + struct watchman_expression *expr; + struct watchman_query_result *result; + + query =3D make_query(last_update); + expr =3D watchman_true_expression(); + result =3D watchman_do_query(connection, fs_path, query, expr, &wm_er= ror); + watchman_free_query(query); + watchman_free_expression(expr); + + if (!result) + warning("Watchman query error: %s (at %s)", + wm_error.message, + *last_update ? last_update : "the beginning"); + + return result; +} + +static void update_index(struct index_state *istate, + struct watchman_query_result *result) +{ + int i; + + if (result->is_fresh_instance) { + /* let refresh clear them later */ + for (i =3D 0; i < istate->cache_nr; i++) + istate->cache[i]->ce_flags |=3D CE_WATCHMAN_DIRTY; + goto done; + } + + for (i =3D 0; i < result->nr; i++) { + struct watchman_stat *wm =3D result->stats + i; + int pos; + + if (S_ISDIR(wm->mode) || + !strncmp(wm->name, ".git/", 5) || + strstr(wm->name, "/.git/")) + continue; + + pos =3D index_name_pos(istate, wm->name, strlen(wm->name)); + if (pos < 0) { + if (istate->untracked) { + char *name =3D xstrdup(wm->name); + char *dname =3D dirname(name); + + /* + * dirname() returns '.' for the root, + * but we call it ''. + */ + if (dname[0] =3D=3D '.' && dname[1] =3D=3D 0) + string_list_append(&istate->untracked->invalid_untracked, ""); + else + string_list_append(&istate->untracked->invalid_untracked, + dname); + free(name); + } + continue; + } + /* FIXME: ignore staged entries and gitlinks too? */ + + istate->cache[pos]->ce_flags |=3D CE_WATCHMAN_DIRTY; + } + +done: + free(istate->last_update); + istate->last_update =3D xstrdup(result->clock); + istate->cache_changed |=3D WATCHMAN_CHANGED; + if (istate->untracked) + string_list_remove_duplicates(&istate->untracked->invalid_untracked,= 0); +} + +int check_watchman(struct index_state *istate) +{ + struct watchman_error wm_error; + struct watchman_connection *connection; + struct watchman_query_result *result; + const char *fs_path; + struct timeval timeout; + /* + * Convert core_watchman_sync_timeout, in milliseconds, to + * struct timeval, in seconds and microseconds. + */ + + fs_path =3D get_git_work_tree(); + if (!fs_path) + return -1; + + timeout.tv_sec =3D core_watchman_sync_timeout / 1000; + timeout.tv_usec =3D (core_watchman_sync_timeout % 1000) * 1000; + connection =3D watchman_connect(timeout, &wm_error); + + if (!connection) { + warning("Watchman watch error: %s", wm_error.message); + return -1; + } + + if (watchman_watch(connection, fs_path, &wm_error)) { + warning("Watchman watch error: %s", wm_error.message); + watchman_connection_close(connection); + return -1; + } + + + result =3D query_watchman(istate, connection, fs_path, istate->last_u= pdate); + watchman_connection_close(connection); + if (!result) + return -1; + update_index(istate, result); + watchman_free_query_result(result); + return 0; +} diff --git a/watchman-support.h b/watchman-support.h new file mode 100644 index 0000000..ee1ef2c --- /dev/null +++ b/watchman-support.h @@ -0,0 +1,7 @@ +#ifndef WATCHMAN_SUPPORT_H +#define WATCHMAN_SUPPORT_H + +struct index_state; +int check_watchman(struct index_state *index); + +#endif /* WATCHMAN_SUPPORT_H */ --=20 2.4.2.767.g62658d5-twtrsrc