From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS31976 209.132.180.0/23 X-Spam-Status: No, score=-3.9 required=3.0 tests=AWL,BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_NONE shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by dcvr.yhbt.net (Postfix) with ESMTP id 6B4151F461 for ; Thu, 29 Aug 2019 18:32:15 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728924AbfH2ScN (ORCPT ); Thu, 29 Aug 2019 14:32:13 -0400 Received: from relay3-d.mail.gandi.net ([217.70.183.195]:52281 "EHLO relay3-d.mail.gandi.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728835AbfH2ScM (ORCPT ); Thu, 29 Aug 2019 14:32:12 -0400 X-Originating-IP: 1.186.12.40 Received: from localhost (unknown [1.186.12.40]) (Authenticated sender: me@yadavpratyush.com) by relay3-d.mail.gandi.net (Postfix) with ESMTPSA id 6AB7C60004; Thu, 29 Aug 2019 18:32:10 +0000 (UTC) Date: Fri, 30 Aug 2019 00:02:07 +0530 From: Pratyush Yadav To: Paul Mackerras Cc: git@vger.kernel.org Subject: Re: [PATCH v2] gitk: Make web links clickable Message-ID: <20190829183207.sy66tyxnnzgvjv35@yadavpratyush.com> References: <20190826221444.GB7402@blackberry> <20190829012702.GB3297@blackberry> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190829012702.GB3297@blackberry> User-Agent: NeoMutt/20180716 Sender: git-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: git@vger.kernel.org On 29/08/19 11:27AM, Paul Mackerras wrote: > This makes gitk look for http or https URLs in the commit description > and make the URLs clickable. Clicking on them will invoke an external > web browser with the URL. > > The web browser command is by default "xdg-open" on Linux, "open" on > MacOS, and "cmd /c start" on Windows. The command can be changed in > the preferences window, and it can include parameters as well as the > command name. If it is set to the empty string then URLs will no > longer be made clickable. > > Signed-off-by: Paul Mackerras > --- > v2: Match URLs anywhere, not just after [Bug]Link:. > > gitk | 51 ++++++++++++++++++++++++++++++++++++++++++++++++++- > 1 file changed, 50 insertions(+), 1 deletion(-) > > diff --git a/gitk b/gitk > index a14d7a1..2a0d00c 100755 > --- a/gitk > +++ b/gitk > @@ -7016,6 +7016,7 @@ proc commit_descriptor {p} { > > # append some text to the ctext widget, and make any SHA1 ID > # that we know about be a clickable link. > +# Also look for URLs of the form "http[s]://..." and make them web links. > proc appendwithlinks {text tags} { > global ctext linknum curview > > @@ -7032,6 +7033,18 @@ proc appendwithlinks {text tags} { > setlink $linkid link$linknum > incr linknum > } > + set wlinks [regexp -indices -all -inline -line \ > + {https?://[^[:space:]]+} $text] I know I suggested searching till the first non-whitespace character, but thinking more about, there are some problematic cases. Say someone has a commit message like: Foo bar baz (more details at https://example.com/hello) Or like: Check out https://foo.com, https://bar.com In the first example, the closing parenthesis gets included in the link, but shouldn't be. In the second, the comma after foo.com would be included in the link, but shouldn't be. So maybe use a more sophisticated regex? A quick Google search gives out the following options [0][1]. [0] gives the following regex: https?:\/\/(www\.)?[-a-zA-Z0-9@:%._\+~#=]{1,256}\.[a-zA-Z0-9()]{1,6}\b([-a-zA-Z0-9()@:%_\+.~#?&//=]*) It is kind of ugly to look at, and I'm not even sure if there are any syntax differences with Tcl's regex library. [1] lists a bunch of regexes and which URLs they work on and which ones they don't. The smallest among them I found is: @^(https?|ftp)://[^\s/$.?#].[^\s]*$@iS Again, I'm not sure how well this would work with Tcl's regex library, or how commonly these URL patterns appear in actual commit messages. Just something to consider. [0] https://stackoverflow.com/questions/3809401/what-is-a-good-regular-expression-to-match-a-url [1] https://mathiasbynens.be/demo/url-regex [snip] -- Regards, Pratyush Yadav