From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-Status: No, score=-4.1 required=3.0 tests=AWL,BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, SPF_HELO_PASS,SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from sourceware.org (server2.sourceware.org [IPv6:2620:52:3:1:0:246e:9693:128c]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id 2103E1F4B4 for ; Tue, 6 Oct 2020 15:34:47 +0000 (UTC) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 2D95839540CC; Tue, 6 Oct 2020 15:34:46 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 2D95839540CC DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1601998486; bh=pwj5kMx3TClI9eEP7bJdqjthI/+2nFgFTxLbSQsbVis=; h=References:In-Reply-To:Date:Subject:To:List-Id:List-Unsubscribe: List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:Cc: From; b=rd0bXzgZzX9EioXGj7jvelvie/pCAF6I4dT0AFxRkDIogHDF+mK7Wx/ZsTfnX1F+T 1HBqoKnrz2bAGw6JeTMd78mxC+VUq42mEpCNDZSFdFUXZxxzZWC93F9yxFriiQuE7u DfFSgVHid5vlmpuo3cytKIMwyOHNru003d+KdDhI= Received: from mail-oo1-xc41.google.com (mail-oo1-xc41.google.com [IPv6:2607:f8b0:4864:20::c41]) by sourceware.org (Postfix) with ESMTPS id 8D9F5385700C for ; Tue, 6 Oct 2020 15:34:43 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 8D9F5385700C Received: by mail-oo1-xc41.google.com with SMTP id l18so2159774ooa.9 for ; Tue, 06 Oct 2020 08:34:43 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=pwj5kMx3TClI9eEP7bJdqjthI/+2nFgFTxLbSQsbVis=; b=YjylwfCp5jPWVSlTGshHwbkhUYLsHvQl9T8bEY8hLPikRFVI2Kgb2dSXbfyR4+2stC Ll/0QrQOjnNgY3pyX2D1i8UcgxEkee0g+YwIZBscHOsYFJcrseuHI5JArW98ZK/TANEy FguHkfCKDB5lzaG/G2ZJQLL3arWiL2smIyYo7R5EcJgzGY/WiZLWSleqsTgrVbYZd9+1 1UMSM38DgEWWis3arqj9xRayzXjh1zF6mBJoYe/umP5/VMaY4WfssAutyVyb1sxXF0c2 bKX3ALniXqcZWdXABQpXbez6FrEEJ0F/LkLFHsGUHi6sQXnx1CVZQYURO70i4+WH8ahE 8Keg== X-Gm-Message-State: AOAM533e89GalwAUK3YwXQRKOJ/xOFwxiGwgQL4fCW0ISMgYLy9uoiEA m5t3V465VltnhF9Q0UwT7C8Dq9/wjGJfLEd3e+s= X-Google-Smtp-Source: ABdhPJy2nn7c1xrdQgfV1roSH+9HDEghmuivvRuuYugW9gN1UMpeb1fkI6fD/lJfJmfKp+ID2bNdl9OcTRSCpc3sINA= X-Received: by 2002:a4a:be0f:: with SMTP id l15mr3553236oop.58.1601998482667; Tue, 06 Oct 2020 08:34:42 -0700 (PDT) MIME-Version: 1.0 References: <20200929205746.6763-1-chang.seok.bae@intel.com> <20201005134534.GT6642@arm.com> <20201006092532.GU6642@arm.com> <20201006152553.GY6642@arm.com> In-Reply-To: <20201006152553.GY6642@arm.com> Date: Tue, 6 Oct 2020 08:34:06 -0700 Message-ID: Subject: Re: [RFC PATCH 0/4] x86: Improve Minimum Alternate Stack Size To: Dave Martin Content-Type: text/plain; charset="UTF-8" X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , From: "H.J. Lu via Libc-alpha" Reply-To: "H.J. Lu" Cc: linux-arch , Len Brown , Tony Luck , GNU C Library , "Ravi V. Shankar" , "Chang S. Bae" , the arch/x86 maintainers , LKML , Dave Hansen , Andy Lutomirski , Linux API , Thomas Gleixner , Borislav Petkov , Ingo Molnar Errors-To: libc-alpha-bounces@sourceware.org Sender: "Libc-alpha" On Tue, Oct 6, 2020 at 8:25 AM Dave Martin wrote: > > On Tue, Oct 06, 2020 at 05:12:29AM -0700, H.J. Lu wrote: > > On Tue, Oct 6, 2020 at 2:25 AM Dave Martin wrote: > > > > > > On Mon, Oct 05, 2020 at 10:17:06PM +0100, H.J. Lu wrote: > > > > On Mon, Oct 5, 2020 at 6:45 AM Dave Martin wrote: > > > > > > > > > > On Tue, Sep 29, 2020 at 01:57:42PM -0700, Chang S. Bae wrote: > > > > > > During signal entry, the kernel pushes data onto the normal userspace > > > > > > stack. On x86, the data pushed onto the user stack includes XSAVE state, > > > > > > which has grown over time as new features and larger registers have been > > > > > > added to the architecture. > > > > > > > > > > > > MINSIGSTKSZ is a constant provided in the kernel signal.h headers and > > > > > > typically distributed in lib-dev(el) packages, e.g. [1]. Its value is > > > > > > compiled into programs and is part of the user/kernel ABI. The MINSIGSTKSZ > > > > > > constant indicates to userspace how much data the kernel expects to push on > > > > > > the user stack, [2][3]. > > > > > > > > > > > > However, this constant is much too small and does not reflect recent > > > > > > additions to the architecture. For instance, when AVX-512 states are in > > > > > > use, the signal frame size can be 3.5KB while MINSIGSTKSZ remains 2KB. > > > > > > > > > > > > The bug report [4] explains this as an ABI issue. The small MINSIGSTKSZ can > > > > > > cause user stack overflow when delivering a signal. > > > > > > > > > > > > In this series, we suggest a couple of things: > > > > > > 1. Provide a variable minimum stack size to userspace, as a similar > > > > > > approach to [5] > > > > > > 2. Avoid using a too-small alternate stack > > > > > > > > > > I can't comment on the x86 specifics, but the approach followed in this > > > > > series does seem consistent with the way arm64 populates > > > > > AT_MINSIGSTKSZ. > > > > > > > > > > I need to dig up my glibc hacks for providing a sysconf interface to > > > > > this... > > > > > > > > Here is my proposal for glibc: > > > > > > > > https://sourceware.org/pipermail/libc-alpha/2020-September/118098.html > > > > > > Thanks for the link. > > > > > > Are there patches yet? I already had some hacks in the works, but I can > > > drop them if there's something already out there. > > > > I am working on it. > > OK. I may post something for discussion, but I'm happy for it to be > superseded by someone (i.e., other than me) who actually knows what > they're doing... Please see my previous email for my glibc patch: https://gitlab.com/x86-glibc/glibc/-/commits/users/hjl/AT_MINSIGSTKSZ > > > > > > > 1. Define SIGSTKSZ and MINSIGSTKSZ to 64KB. > > > > > > Can we do this? IIUC, this is an ABI break and carries the risk of > > > buffer overruns. > > > > > > The reason for not simply increasing the kernel's MINSIGSTKSZ #define > > > (apart from the fact that it is rarely used, due to glibc's shadowing > > > definitions) was that userspace binaries will have baked in the old > > > value of the constant and may be making assumptions about it. > > > > > > For example, the type (char [MINSIGSTKSZ]) changes if this #define > > > changes. This could be a problem if an newly built library tries to > > > memcpy() or dump such an object defined by and old binary. > > > Bounds-checking and the stack sizes passed to things like sigaltstack() > > > and makecontext() could similarly go wrong. > > > > With my original proposal: > > > > https://sourceware.org/pipermail/libc-alpha/2020-September/118028.html > > > > char [MINSIGSTKSZ] won't compile. The feedback is to increase the > > constants: > > > > https://sourceware.org/pipermail/libc-alpha/2020-September/118092.html > > Ah, I see. But both still API and ABI breaks; moreover, declaraing an > array with size based on (MIN)SIGSTKSZ is not just reasonable, but the > obvious thing to do with this constant in many simple cases. Such usage > is widespread, see: > > * https://codesearch.debian.net/search?q=%5BSIGSTKSZ%5D&literal=1 > > > Your two approaches seem to trade off two different sources of buffer > overruns: undersized stacks versus ABI breaks across library boundaries. We can't get everything we want. > Since undersized stack is by far the more familiar problem and we at > least have guard regions to help detect overruns, I'd vote to keep > MINSIGSTKSZ and SIGSTKSZ as-is, at least for now. Agree. > Or are people reporting real stack overruns on x86 today? I hope so. > > For arm64, we made large vectors on SVE opt-in, so that oversized signal > frames are not seen by default. Would somethine similar be feasible on > x86? > > > > > > 2. Add _SC_RSVD_SIG_STACK_SIZE for signal stack size reserved by the kernel. > > > > > > How about "_SC_MINSIGSTKSZ"? This was my initial choice since only the > > > discovery method is changing. The meaning of the value is exactly the > > > same as before. > > > > > > If we are going to rename it though, it could make sense to go for > > > something more directly descriptive, say, "_SC_SIGNAL_FRAME_SIZE". > > > > > > The trouble with including "STKSZ" is that is sounds like a > > > recommendation for your stack size. While the signal frame size is > > > relevant to picking a stack size, it's not the only thing to > > > consider. > > > > The problem is that AT_MINSIGSTKSZ is the signal frame size used by > > kernel. The minimum stack size for a signal handler is more likely > > AT_MINSIGSTKSZ + 1.5KB unless AT_MINSIGSTKSZ returns the signal > > frame size used by kernel + 6KB for user application. > > Ack; to be correct, you also need to take into account which signals may > be unmasked while running on this stack, and the stack requirements of > all their handlers. Unfortunately, that's hard :( > > What's your view on my naming suggesions? I used _SC_MINSIGSTKSZ: https://gitlab.com/x86-glibc/glibc/-/commit/73ca53bfbc1c105bc579f55f15af011a07fcded9 > > > > Also, do we need a _SC_SIGSTKSZ constant, or should the entire concept > > > of a "recommended stack size" be abandoned? glibc can at least make a > > > slightly more informed guess about suitable stack sizes than the kernel > > > (and glibc already has to guess anyway, in order to determine the > > > default thread stack size). > > > > Glibc should try to deduct signal frame size if AT_MINSIGSTKSZ isn't > > available. > > In my code, I generate _SC_SIGSTKSZ as the equivalent of > > max(sysconf(_SC_MINSIGSTKSZ) * 4, SIGSTKSZ) > > which is >= the legacy value, and broadly reperesentative of the > relationship between MINSIGSTKSZ and SIGSTKSZ on most arches. > > > What do you think? sysconf(_SC_MINSIGSTKSZ) should be usable ASIS for most cases. > > > > > 3. Deprecate SIGSTKSZ and MINSIGSTKSZ if _SC_RSVD_SIG_STACK_SIZE > > > > is in use. > > > > > > Great if we can do it. I was concerned that this might be > > > controversial. > > > > > > Would this just be a recommendation, or can we enforce it somehow? > > > > It is just an idea. We need to move away from constant SIGSTKSZ and > > MINSIGSTKSZ. > > Totally agree with that. > With my glibc patch, -D_SC_MINSIGSTKSZ_SOURCE will fail to compile if the source assumes constant SIGSTKSZ or MINSIGSTKSZ. -- H.J.