ruby-core@ruby-lang.org archive (unofficial mirror)
 help / color / mirror / Atom feed
From: shevegen@gmail.com
To: ruby-core@ruby-lang.org
Subject: [ruby-core:90505] [Ruby trunk Feature#15408] Deprecate object_id and _id2ref
Date: Thu, 13 Dec 2018 11:54:38 +0000 (UTC)	[thread overview]
Message-ID: <redmine.journal-75663.20181213115436.2db600a58ff9485a@ruby-lang.org> (raw)
In-Reply-To: redmine.issue-15408.20181213005326@ruby-lang.org

Issue #15408 has been updated by shevegen (Robert A. Heiler).


I originally wanted to write a very long reply but I think it becomes too difficult
to keep track of what is being said, so just my opinion in a somewhat more condensed
form than before:

- I have no strong opinion on _id2ref. I think I used it only once or twice in 13 years.

- I am not convinced that .object_id should be removed. We would lose some introspection
here, wouldn't we?

I would recommend to postpone deprecation after ruby 3.x IF it is decided to remove 
object_id. But I am not convinced that it should be removed - it is not only a question
of implementation details, or how people use it "incorrectly", but it is whether we
should be able to query object ids, find them in ObjectSpace etc...

I would suggest this to ask matz in an upcoming developer meeting since part of the
question is the intent of how matz may suggest or think that ruby users may use
object_id. I don't have much code that relies on object_id, so an any change would
not affect me that much - but I am in disfavour of removing it eventually without
really knowing what the alternatives are. Or whether we would just lose functionality
without any real gain, with which I would disagree very strongly. So my own opinion
is much closer as to what spatulasnout wrote.

PS: It is a bit difficult to reason in pro/con when it comes to "incorrect" usage
of ruby code. Ultimately the parser allows something or does not; and how matz 
designed ruby + ruby's philosophy plays in, which of course determines what the
parser allows (compare to python requiring () for method calls and ruby not
"caring" that much, unless it is ambiguous). Anything aside from that is heavily
subject to a personal opinion, and this becomes difficult to reason about.

The world is not going to end if object_id is removed; but it is not going to
end when object_id remains, either.

For sake of completion, I would also like to point out that the pickaxe mentioned
object_id several times, so I assume quite some people know and have used object_id
(and probably not many used _id2ref); and the name used to be .id if I recall
correctly before it was changed to object_id. But anyway, I highly recommend to
have this be discussed some time in 2019 at a developer meeting.

PSS: Perhaps for alternatives, such as WeakMap, it could be tested extensively,
perhaps as a separate gem, if only to provide a proof-of-concept.

----------------------------------------
Feature #15408: Deprecate object_id and _id2ref
https://bugs.ruby-lang.org/issues/15408#change-75663

* Author: headius (Charles Nutter)
* Status: Open
* Priority: Normal
* Assignee: 
* Target version: 
----------------------------------------
Ruby currently provides the object_id method to get a "identifier" for a given object. According to the documentation, this ID is the same for every object_id call against a given object, and guaranteed not to be the same as any other active (i.e. alive) object. However, no guarantee is made about the ID being reused for a future object after the original has been garbage collected.

As a result, object_id can't be used to uniquely identify any object that might be garbage collected, since that ID may be associated with a completely different object in the future.

Ruby also provides a method to go from an object_id to the object reference itself: ObjectSpace._id2ref. This method has been in Ruby for decades and is often used to implement a weak hashmap from ID to reference, since holding the ID will not keep the object alive. However due to the problems with object_id not actually being unique, it's possible for _id2ref to return a different object than originally had that ID as object slots are reused in the heap.

The only way to implement object_id safely (with idempotency guarantees) would be to assign to all objects a monotonically-increasing ID. Alternatively, this ID could be assigned lazily only for those objects on which the code calls object_id. JRuby implements object_id in this way currently.

The only way to implement _id2ref safely would be to have a mapping in memory from those monotonically-increasing IDs to the actual objects. This would have to be a weak mapping to prevent the objects from being garbage collected. JRuby currently only supports _id2ref via a flag, since the additional overhead of weakly tracking every requested object_id is extremely high. An alternative for MRI would be to implement _id2ref as a heap scan, as it is implemented in Rubinius. This would make it entirely unpractical due to the cost of scanning the heap for every ID lookup.

I propose that both methods should immediately be deprecated for removal in Ruby 3.0.

* They do not do what people expect.
* They cannot reliably do what they claim to do.
* They eventually lead to difficult-to-diagnose bugs in every possible use case.

Put simply, both methods have always been broken in MRI and making them unbroken would render them useless.



-- 
https://bugs.ruby-lang.org/

  parent reply	other threads:[~2018-12-13 11:54 UTC|newest]

Thread overview: 38+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <redmine.issue-15408.20181213005326@ruby-lang.org>
2018-12-13  0:53 ` [ruby-core:90464] [Ruby trunk Feature#15408] Deprecate object_id and _id2ref headius
2018-12-13  3:55   ` [ruby-core:90482] " Bill Kelly
2018-12-13  4:07     ` [ruby-core:90484] " Bill Kelly
2018-12-13  0:58 ` [ruby-core:90465] " headius
2018-12-13  1:22 ` [ruby-core:90469] " shyouhei
2018-12-13  1:36 ` [ruby-core:90472] " ko1
2018-12-13  1:43 ` [ruby-core:90473] " headius
2018-12-13  1:47 ` [ruby-core:90474] " headius
2018-12-13  1:52 ` [ruby-core:90475] " headius
2018-12-13  2:00 ` [ruby-core:90476] " headius
2018-12-13  2:11 ` [ruby-core:90477] " pdahorek
2018-12-13  2:17 ` [ruby-core:90478] " headius
2018-12-13  2:21 ` [ruby-core:90479] " headius
2018-12-13  4:29 ` [ruby-core:90485] " duerst
2018-12-13  5:00 ` [ruby-core:90487] " shyouhei
2018-12-13  5:36 ` [ruby-core:90490] " headius
2018-12-13  7:11 ` [ruby-core:90496] " ko1
2018-12-13 11:32 ` [ruby-core:90504] " eregontp
2018-12-13 11:54 ` shevegen [this message]
2018-12-13 13:32 ` [ruby-core:90507] " mame
2018-12-13 13:56 ` [ruby-core:90508] " headius
2018-12-13 13:59 ` [ruby-core:90509] " headius
2018-12-13 14:03 ` [ruby-core:90510] " headius
2018-12-13 14:08 ` [ruby-core:90511] " headius
2018-12-13 14:52 ` [ruby-core:90512] " muraken
2018-12-13 15:05 ` [ruby-core:90513] " headius
2018-12-13 15:23 ` [ruby-core:90514] " eregontp
2018-12-13 15:36 ` [ruby-core:90515] " muraken
2018-12-13 15:46 ` [ruby-core:90516] " headius
2018-12-13 16:01 ` [ruby-core:90517] " hanmac
2018-12-13 16:12 ` [ruby-core:90518] " headius
2018-12-15 14:29 ` [ruby-core:90550] " matz
2018-12-16 10:49 ` [ruby-core:90559] " eregontp
2018-12-16 10:57 ` [ruby-core:90561] " naruse
2018-12-16 11:00 ` [ruby-core:90563] " eregontp
2019-01-09 17:46 ` [ruby-core:90950] " eregontp
2019-01-10 23:20   ` [ruby-core:91007] " Charles Oliver Nutter
2019-01-14  7:35 ` [ruby-core:91072] " naruse

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-list from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.ruby-lang.org/en/community/mailing-lists/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=redmine.journal-75663.20181213115436.2db600a58ff9485a@ruby-lang.org \
    --to=ruby-core@ruby-lang.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).