From: Eric Wong <normalperson@yhbt.net>
To: ruby-core@ruby-lang.org
Subject: [ruby-core:81089] Re: [ruby-cvs:65407] normal:r58236 (trunk): thread.c: comments on M:N threading [ci skip]
Date: Wed, 10 May 2017 10:04:23 +0000 [thread overview]
Message-ID: <20170510100423.GA23016@starla> (raw)
In-Reply-To: <5c6c1af4-109f-ebd7-07aa-0f8dd77df9b8@atdot.net>
SASADA Koichi <ko1@atdot.net> wrote:
> Thank you.
>
> (quote it first)
> > Do you have any deadlines or priorities?
>
> No. I want to make clear your priority to avoid conflicts with me.
OK, good to know.
> On 2017/05/10 3:51, Eric Wong wrote:
> >> (1) lightweight fiber switching by pointer-exchange
> >> (w/o copying context).
> >
> > Out of all tasks here, I am least familiar with this (1).
> > This will be learning experience for me.
> >
> >> (2) auto-fiber swiching
> >> (2-1) implement with epoll/kqueue/select
> >> (2-2) design APIs to use it
> >
> > I think I will start on the select implementation first for
> > portability, but model our internal API around epoll(*).
> > I will probably implement epoll support last, since I am
> > most familiar with it.
> >
> > (*) with current GVL, I expect our kqueue+kevent implementation
> > will be faster than epoll in most cases (the API requires
> > fewer syscalls). select might be fastest with few FDs.
>
> (1) and (2) are independent so that we can do it parallel.
>
> Do you want to try (1) first or can I try (1)? Yes, Doing (1) is good to
> learn core internal, but maybe (1) affect many places in VMs. So I want
> to try. Anyway we should make new ticket and discuss on it.
You should do (1) first, you are the VM expert :)
> I guess (2) is not so easy to design APIs.
Lets keep changes to C-API internal and experiment, first.
First start with modifying rb_wait_for_single_fd() and
rb_waitpid() to be auto-Fiber-aware. They will register event
watcher and call Fiber.yield instead of releasing GVL to sleep
when waiting.
Later, we can modify rb_thread_fd_select() and rb_thread_sleep*()
and maybe others.
> * We need to survey other languages
I will study the GHC IO manager, I think they are similar to my
vision of using EV_ONESHOT/EPOLLONESHOT with multi-core support:
http://haskell.cs.yale.edu/wp-content/uploads/2013/08/hask035-voellmy.pdf
(and also similar to what I used for cmogstored)
I do not know the Haskell language, so I will need to study it some.
> * We need to define blocking operations:
> * blocking operation can switch Fibers automatically (I/O read, ...)
> * blocking operation can switch Fibers automatically
> (other than epoll/kqueue/select manage-able operations,
> extra-C exts providing blocking operations, ...)
Basically, I want auto-Fiber to behave like 1.8 green threads,
but without timer-based switching. Fiber switch should only
happen when operations cannot proceed (I/O, waitpid, sleep,
etc), or when user calls Fiber.yield.
> And how to provide such difference to users?
> * Idea: Documentation
> * example: POSIX singal safe functions
> * example: Java's thread-safety
> Generally, it is hard to use because users should
> them carefully (and usually people don't).
> * Idea: Provide new APIs which support auto-fibers
> (and other blocking operations don't support)
> * example: EventMachine, ... (other language example? Python?)
> * it is clear for users.
> * it is hard to import existing code
Exactly, new APIs will take more time to adopt. I don't think
it is necessary to introduce new IO APIs. Currently, users
expect Thread switch when doing blocking IO (GVL release);
it should be easy to understand auto-Fiber switch if IO
would block (like 1.8 Thread)
Also, there is a NeverBlock RubyGem which made Fibers automatic
(like 1.8 threads), but development stopped years ago.
Ideally, I want existing code to be able to use net/* in stdlib
(and similar) with minimal modification: s/Thread.new/auto-Fiber.new/
> * Idea: Provide a new TracePoint probe
> to know blocking operation which does not support auto-fibers.
> * This idea is for advanced user to check their scheduling
> * I think it is enough because
> * advanced user should be production maker.
> Automatic tools are preferable.
> * not advanced user don't care which operation can stop
> forever w/o auto-fiber switching
Yes, we can add this once the auto-switch is implemented :)
> * We need to define auto-Fiber constructor
Perhaps that is a job for matz :)
> * ...
>
> Ah, I remember that we have (2') providing epoll/kqueue like Ruby
> interface. (2) use them in scheduler internally and only auto-fibers use
> it. However, someone want to use them and want to write their own
> scheduler (like nodejs culture). I'm not sure we should expose such
> interface but it is valuable to consider. If we decide to provide such
> APIs, we need to share the implementation (or shouldn't?). Furthermore,
> it is more easy to provide such APIs compare with providing auto-fibers.
I don't think exposing new API is necessary, yet. I prefer we
focus on internal implementation changes, first, and expose
user-visible changes later.
<snip>
> >> (4) Re-implement Queue (some days ago you wrote)
> >
> > I already had some work-in-progress patches I can cleanup and
> > send out to redmine for review later (also ConditionVariable).
> > Last I remember, there was a small performance regression for
> > small Queue/Condvar waiter lists due to better locality on embed
> > structs. However, I think avoiding O(n) rb_ary_delete behavior
> > is more important for busy queues.
>
> OK.
Btw, that is [Feature #13552] - it might be ready.
> >> (please add your plan if you have others)
> >
> > I might break out thread.c and io.c into smaller files
> > (select/epoll/kqueue/timer_thread/copy_stream/...)
> > to make code organization easier.
>
> Not sure we can do it for io.c.
> Please ask someone else.
akr / nobu?
next prev parent reply other threads:[~2017-05-10 9:20 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20170402011414.AEA9B64CEE@svn.ruby-lang.org>
[not found] ` <8a2b82e3-dc07-1945-55f9-5a474e89130b@ruby-lang.org>
2017-04-02 2:35 ` [ruby-core:80531] Re: [ruby-cvs:65407] normal:r58236 (trunk): thread.c: comments on M:N threading [ci skip] Eric Wong
2017-04-02 3:05 ` [ruby-core:80532] " SASADA Koichi
2017-04-03 4:42 ` [ruby-core:80540] " Eric Wong
2017-05-08 0:33 ` [ruby-core:81027] " Eric Wong
2017-05-08 1:53 ` [ruby-core:81028] " SASADA Koichi
2017-05-08 2:16 ` [ruby-core:81029] " SASADA Koichi
2017-05-08 3:01 ` [ruby-core:81031] " Eric Wong
2017-05-08 3:42 ` [ruby-core:81033] " SASADA Koichi
2017-05-08 6:36 ` [ruby-core:81035] " Eric Wong
2017-05-09 2:18 ` [ruby-core:81042] " SASADA Koichi
2017-05-09 3:38 ` [ruby-core:81044] " Eric Wong
2017-05-09 4:11 ` [ruby-core:81045] " SASADA Koichi
2017-05-09 5:12 ` [ruby-core:81047] " Eric Wong
2017-05-09 5:47 ` [ruby-core:81049] " SASADA Koichi
2017-05-09 6:23 ` [ruby-core:81053] " Eric Wong
2017-05-09 6:44 ` [ruby-core:81054] " SASADA Koichi
2017-05-09 18:51 ` [ruby-core:81078] " Eric Wong
2017-05-10 3:24 ` [ruby-core:81083] " SASADA Koichi
2017-05-10 10:04 ` Eric Wong [this message]
2017-05-19 4:34 ` [ruby-core:81244] " Eric Wong
2017-06-20 19:16 ` [ruby-core:81733] " Eric Wong
2017-05-09 5:54 ` [ruby-core:81050] " SASADA Koichi
2017-05-09 6:15 ` [ruby-core:81052] " Eric Wong
2017-05-08 2:56 ` [ruby-core:81030] " Eric Wong
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-list from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://www.ruby-lang.org/en/community/mailing-lists/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170510100423.GA23016@starla \
--to=ruby-core@ruby-lang.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).