[ruby-core:97018] [Ruby master Feature#16511] Staged warnings for keyword arguments

ruby-core@ruby-lang.org archive (unofficial mirror)
 help / color / mirror / Atom feed

From: eregontp@gmail.com
To: ruby-core@ruby-lang.org
Subject: [ruby-core:97018] [Ruby master Feature#16511] Staged warnings for keyword arguments
Date: Wed, 29 Jan 2020 10:42:55 +0000 (UTC)	[thread overview]
Message-ID: <redmine.journal-84107.20200129104254.c37413e05046cfa8@ruby-lang.org> (raw)
In-Reply-To: redmine.issue-16511.20200116043024@ruby-lang.org

Issue #16511 has been updated by Eregon (Benoit Daloze).

FWIW, I also discussed to keep `**empty_hash` in the case it's passed by the user in https://bugs.ruby-lang.org/issues/16519#note-5
Jeremy seems clearly against it, and I get that `**empty_hash` always passing nothing is simpler to understand.

I'm not sure what's the design of your proposal.
What do you change to keep *args-delegation working?

It seems unlikely Ruby core would accept to backport anything more complicated than #16463 to 2.7.

----------------------------------------
Feature #16511: Staged warnings for keyword arguments
https://bugs.ruby-lang.org/issues/16511#change-84107

* Author: Dan0042 (Daniel DeLorme)
* Status: Open
* Priority: Normal
* Assignee: 
* Target version: 
----------------------------------------
As an alternative to #16463 and #16494 I'd like to propose this approach, which I believe allows a **much** more flexible path for migration of keyword arguments.

The idea is to indicate for every Hash object if it's intended to represent a _keyword_ hash or a _data_ hash. This extra information is then used to generate more granular warnings depending on a user's compatibility needs.

The "keywordness" of a hash would be indicated by a **flag** on the Hash object; this is already implemented in 2.7 and is the approach favored by Matz. Let's call this flagged hash a "KwHash", and a non-flagged hash is just a "Hash". Note: this could also be implemented via a **subclass** of Hash (I personally favor this object-oriented approach) which was the original idea in this proposal.

I'll try to describe the idea in detail by breaking it down into figurative steps. (Skip to "Putting it all together" for the TL;DR version.) Imagine starting with ruby 2.6 and then:

### Step 1

When a double-splat or a brace-less hash is used, instead of a Hash it creates a KwHash.

```ruby
def foo(x) x end
foo(k:1).class      #=> KwHash
foo(**hash).class   #=> KwHash
[k:1].last.class    #=> KwHash
[**hash].last.class #=> KwHash
{**hash}.class      #=> Hash
```

At this point we haven't introduced any real change. Everything that worked before is still working the same way.
(With a minor exception if using the subclass approach: unusual code like `kw.class == Hash` would now return false.)

### Step 2

When there is ambiguity due to optional vs keyword argument, we rely on the last argument being Hash or KwHash to disambiguate.

```ruby
def foo(x=nil, **kw)
  [x,kw]
end
foo({k:1}) #=> [{k:1},{}]
foo(k:1)   #=> [nil,{k:1}]
```

This is the _minimum_ amount of incompatibility required to solve ALL bugs previously reported with keyword arguments. (#8040, #8316, #9898, #10856, #11236, #11967, #12104, #12717, #12821, #13336, #13647, #14130, etc.) 

The warnings for this would be about an impending _change of behavior_ in the _next ruby version_, where `foo({k:1})` is no longer interpreted as keyword argument.

### Step 3

Introduce additional incompatibility to improve clarity of design. Here we deprecate the automatic conversion of Hash to keyword argument; only KwHash is accepted. With a deprecation/warning phase, of course. The "automatic" promotion of a KwHash to a keyword argument follows the same rules as a Hash in 2.6; since the KwHash is conceptually intended to represent keyword arguments, this conversion makes sense in a way that a normal data Hash doesn't. We've taken the "last positional hash" concept and split it into "conceptually a hash" and "conceptually keyword arguments". _Most importantly_, all the changes required to silence these warnings are _compatible with 2.6_.

```ruby
def foo(x, **kw); end
foo(k:1)      # ArgumentError because x not specified
foo(1, {k:1}) # ArgumentError because too many arguments; Hash cannot be converted to KwHashs
opts = [k:1].first
foo(opts)     # opts is a KwHash therefore used as keyword argument; ArgumentError because x not specified
foo(1, opts)  # opts is a KwHash therefore used as keyword argument
```

The warnings for this would be about upcoming _errors_ for positional arguments: `foo(x:1)` will be "given 0, expected 1" and `foo(1,{x:2})` will be "given 2, expected 1". Such errors are useful when developing, but there is no new functionality per se, just a stricter syntax. So it's less important to escalate to an error and we can keep the warnings for longer than Step 2.

At this point we have achieved _almost-full_ **dynamic** keyword separation, as opposed to the current _almost-full_ **static** approach. I want to make the point here that, yes, keyword arguments **are** separated, it's just a different paradigm. With static separation, a keyword argument is defined lexically by a double-splat. With dynamic separation, a keyword argument is when the last argument is a KwHash. {{Note: I'm saying "almost-full" because KwHash is not promoted to keywords in `def foo(a,**kw);end;foo(x:1)` and because static keywords are auto-demoted to positional in `def foo(a);end;foo(x:1)`}}

Any form of delegation works with no change required. This preserves the behavior of 2.6 but only for KwHash objects. This is similar to having 2.7 with `ruby2_keywords` enabled by default. But also different in some ways; most notably it allows the case shown in #16494 to work by default:

```ruby
array = [x:1]
array.push(x:2)
array.map{ |x:| x } #=> [1,2]
[{x:3}].map{ |x:| x } #=> but this warns, as it should
```

The current approach does not allow this to work at all. The solution proposed in #16494 has all the same flaws as Hash-based keyword arguments; what happens to `each{ |x=nil,**kw| }` ? This solution allows a KwHash to be converted to... keywords. Very unsurprising.

Given that ruby is a dynamically-typed language I feel that dynamic typing of keywords if a more natural fit than static typing. But I realize that many disagree with that, which is why we continue to...

### Step 4

Introduce additional incompatibility to reach static/lexical separation of keyword arguments. Here we require that even a KwHash should be passed with a double-splat in order to qualify as a keyword argument.

```ruby
def bar(**kw)
end
def foo(**kw)
  bar(kw)   #=> error; KwHash passed without **
  bar(**kw) #=> ok
end
```

At this point we've reached the same behavior as 2.7. Delegation needs to be fixed, but as we know the changes required to silence these warnings are **not** compatible with 2.6 or 2.7. The warnings for this are _fundamentally not fixable_ as long as Step 2 has not been fixed. This is the core reason why `ruby2_keywords` is currently necessary in 2.7. So in the version after 2.7 we can enable these warnings by default since it's now possible to fix delegation to use static keywords. Except that gem authors who need to stay compatible with ≤2.7 cannot yet make these changes, so we introduce a way to _silence **only** these "Step 4" warnings_, for people who need to remain compatible with ≤2.7. And we keep them as warnings instead of errors until ruby 2.7 is EOL.

So instead of having to update a bunch of places with `ruby2_keywords` just to temporarily silence warnings, it's a single flag like `Warning[:ruby3_keywords]`. Once ruby 2.7 is EOL these become controlled by `Warning[:deprecated]` which tells people they **have** to fix their code. Which is just like the eventual deprecation of `ruby2_keywords`, just without the busy work of adding `ruby2_keywords` statements in the first place. But again, this introduces no new functionality, just a stricter syntax. So we can play nice and leave the warnings for a few years before changing to errors.

The question remains of how to handle #16494 here. Either disallow it entirely, but I think that would be a shame. Or just like #16494 suggests, allow hash unpacking in non-lambda Proc. Except that now it can be a KwHash instead of a Hash, which at least preserves dynamic keyword separation.

## Putting it all together (TL;DR)

The idea is _not_ to reimplement keyword argument separation; all that is needed is to implement the things above that are not in 2.7:
* Create a KwHash object for brace-less and double-splatted hashes.
* Differentiate the various types of warnings and allow to toggle on/off separately
  * Step 2 warnings _must_ be fixed now; cannot toggle off
  * Step 3 warnings _should_ be fixed now but you don't absolutely need to upgrade your gems just for that
  * Step 4 warnings _should_ be fixed in next version unless you need to support ≤2.7

I think that's all, really...

### Pros
* Cleaner way to solve #16494
* Better compatibility (at least until 2.6 is EOL)
   * delegation
   * storing an argument list that ends with a KwHash
   * destructuring iteration (#16494)
* We can avoid the "unfortunate corner case" as described in the [release notes](https://www.ruby-lang.org/en/news/2019/12/12/separation-of-positional-and-keyword-arguments-in-ruby-3-0/)
   * in 2.7 only do not output "Step 4" warnings, leave delegation like it was
   * in 2.8 the "Step 3" warnings have been fixed and a Hash will not be converted to keyword arguments
   * delegation can now safely be fixed to use the `**` syntax
* ruby2_keywords is not required, which is desirable because
   * it's a hidden flag _hack_
   * it requires to change the code now, and change it _again_ when ruby2_keywords is deprecated; twice the work; twice the gem upgrades
   * it was supposed to be used only for people who need to support 2.6 or below, but it's being misunderstood as an acceptable way to fix delegation in general
   * there's the non-zero risk that ruby2_keywords will never be removed, leaving us with a permanent "hack mode"
      * dynamic keywords are by far preferable to supporting ruby2_keywords forever
* Likely _better performance_, as the KwHash class can be optimized specifically for the characteristics of keyword arguments.
* More flexible migration
   * Allow more time to upgrade the hard stuff in Step 4
   * Can reach the _same_ goal as the current static approach
   * Larger "support zone" https://xkcd.com/2224/
   * Instead of wide-ranging incompatibilities all at once, there's the _possibility_ of making it finer-grained and more gradual
      * rubyists can _choose_ to migrate all at once or in smaller chunks
   * It hedges the risks by keeping more possibilities open for now.
   * It allows to cop-out at Step 3 if Step 4 turns out too hard because it breaks too much stuff

### Cons
* It allows to cop-out at Step 3 if Step 4 turns out too hard because it breaks too much stuff

-- 
https://bugs.ruby-lang.org/

next prev parent reply	other threads:[~2020-01-29 10:43 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <redmine.issue-16511.20200116043024@ruby-lang.org>
2020-01-16  4:30 ` [ruby-core:96878] [Ruby master Feature#16511] Subclass of Hash for keyword arguments daniel
2020-01-16  7:33 ` [ruby-core:96897] " eregontp
2020-01-16  8:47 ` [ruby-core:96906] " matz
2020-01-16  8:57 ` [ruby-core:96907] " shevegen
2020-01-16 15:55 ` [ruby-core:96912] " daniel
2020-01-16 17:36 ` [ruby-core:96913] " merch-redmine
2020-01-16 19:27 ` [ruby-core:96915] " daniel
2020-01-17  4:04 ` [ruby-core:96921] " daniel
2020-01-17 16:20 ` [ruby-core:96930] " daniel
2020-01-20  0:51 ` [ruby-core:96947] [Ruby master Feature#16511] Staged warnings " daniel
2020-01-29  3:37 ` [ruby-core:97015] " daniel
2020-01-29 10:42 ` eregontp [this message]
2020-01-29 14:16 ` [ruby-core:97019] " daniel
2020-01-29 16:57 ` [ruby-core:97021] " daniel

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-list from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.ruby-lang.org/en/community/mailing-lists/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=redmine.journal-84107.20200129104254.c37413e05046cfa8@ruby-lang.org \
    --to=ruby-core@ruby-lang.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).