From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS3215 2.6.0.0/16 X-Spam-Status: No, score=-4.2 required=3.0 tests=AWL,BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, RCVD_IN_DNSWL_MED,SPF_HELO_PASS,SPF_PASS,UNPARSEABLE_RELAY shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from sourceware.org (server2.sourceware.org [IPv6:2620:52:3:1:0:246e:9693:128c]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id AD4791F5AE for ; Tue, 20 Jul 2021 10:30:27 +0000 (UTC) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id E4C763857408 for ; Tue, 20 Jul 2021 10:30:26 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org E4C763857408 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1626777026; bh=LBzni8/Cd0hHYdGGs1SlC6cjmFsd3TF6B3P6AkGobNM=; h=To:Subject:Date:References:In-Reply-To:List-Id:List-Unsubscribe: List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:Cc: From; b=D5bCNN41UBq5QO6sMZtQtJUGF1smbCT4KQNB280GzXp7JqYskI1dqkA8cB0gEVE7j 4wDg9EAyAFE/NTNPkoZ7w3oSmYsIO5fxwoHP3H51jDpO/IWYzxDBzZQFjeNk7LKFAp FSsCAUTmfbjUW6kO9XtCleXri/pWpUin4geaVIZo= Received: from EUR04-HE1-obe.outbound.protection.outlook.com (mail-he1eur04on0622.outbound.protection.outlook.com [IPv6:2a01:111:f400:fe0d::622]) by sourceware.org (Postfix) with ESMTPS id 621093857408 for ; Tue, 20 Jul 2021 10:30:05 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 621093857408 Received: from PR3P193CA0003.EURP193.PROD.OUTLOOK.COM (2603:10a6:102:50::8) by VE1PR08MB4989.eurprd08.prod.outlook.com (2603:10a6:803:114::19) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4331.21; Tue, 20 Jul 2021 10:30:02 +0000 Received: from VE1EUR03FT023.eop-EUR03.prod.protection.outlook.com (2603:10a6:102:50:cafe::6d) by PR3P193CA0003.outlook.office365.com (2603:10a6:102:50::8) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4331.24 via Frontend Transport; Tue, 20 Jul 2021 10:30:02 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; sourceware.org; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;sourceware.org; dmarc=pass action=none header.from=arm.com; Received-SPF: Pass (protection.outlook.com: domain of arm.com designates 63.35.35.123 as permitted sender) receiver=protection.outlook.com; client-ip=63.35.35.123; helo=64aa7808-outbound-1.mta.getcheckrecipient.com; Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by VE1EUR03FT023.mail.protection.outlook.com (10.152.18.133) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4331.21 via Frontend Transport; Tue, 20 Jul 2021 10:30:01 +0000 Received: ("Tessian outbound 870783c3513f:v99"); Tue, 20 Jul 2021 10:30:01 +0000 X-CheckRecipientChecked: true X-CR-MTA-CID: 292083a9074449b7 X-CR-MTA-TID: 64aa7808 Received: from c0e41113c6e5.1 by 64aa7808-outbound-1.mta.getcheckrecipient.com id 39BC88EA-0D40-4AB7-9F2E-58B4724408F8.1; Tue, 20 Jul 2021 10:29:53 +0000 Received: from EUR05-DB8-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id c0e41113c6e5.1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Tue, 20 Jul 2021 10:29:53 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=AdmCRjTw2JUlSZb8jpjXWwvvWE4SQGVEsWHBuE1I2Qt7n577fxyYmJTikkDqt4PmCvfaWngf0CUpvmjoi7uDlo6xMQAWvh5bVpta5XaV9QY02bEwDUy42PC3ixIWDsvDYPjEjQEr1w4iybbmMIwOiByhM8SKLxeloSyzamNWxmgvonnhU9pT/aHeMLLahwRtrKtnL+Dz0I4YTkgk4UDdaMdzrcOFkuZemNQsnqgtwa2/4n1vrwLsNJuLP3l9LDWx6iEAOkWoUnDilKoWpm9l0i7cW3RTsCcrNWEfQm4/yqW26qeEt6GpP+cjUCYkRLBicqhfOwM/+rkQhCx+WPzhrQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=LBzni8/Cd0hHYdGGs1SlC6cjmFsd3TF6B3P6AkGobNM=; b=O1KVPlz1qGO3i1NZRsfWBQ8ZnxciiWrs1knWO50x5n0me48nUsQ2q1Rq+6Kc5VjNz8TSVEVWf+XIdF1JisFafs7Lns1zCbTkvxVGwfYfUMJh1qy1tumyJFZ+Rt4I8bnd1DjKgto7hr30I77Yd64BBtFiyK+dP978WKRDxk1rm86w3oCfqvZTHbaBPJPn6/o8NpKhKJLLCfFGU5xhXbkE2URVb7jkj7EN0DUYjGxI2IOuWISz3sGui3kgdDI/at7HxSQbA9jKGVmsmYi2K8uFOQsxE9o5IdsdsyayuBfejR8x0SwpZzjWGb/D65wz7CG4F1YHs5JjO8PbjCkGFGondA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none Received: from VE1PR08MB5599.eurprd08.prod.outlook.com (2603:10a6:800:1a1::12) by VI1PR08MB5344.eurprd08.prod.outlook.com (2603:10a6:803:13e::12) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4331.29; Tue, 20 Jul 2021 10:29:52 +0000 Received: from VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::553a:38bf:131:26fa]) by VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::553a:38bf:131:26fa%3]) with mapi id 15.20.4331.034; Tue, 20 Jul 2021 10:29:52 +0000 To: "naohirot@fujitsu.com" , Noah Goldstein , "Lucas A. M. Magalhaes" Subject: Re: [PATCH] benchtests: Add memset zero fill benchmark tests Thread-Topic: [PATCH] benchtests: Add memset zero fill benchmark tests Thread-Index: AQHXd/o506Ph+ApkDUqc+W5ZNjtTm6tBPwcAgAIYy42ACDX08IAAIfH7 Date: Tue, 20 Jul 2021 10:29:52 +0000 Message-ID: References: , , In-Reply-To: Accept-Language: en-GB, en-US Content-Language: en-GB X-MS-Has-Attach: X-MS-TNEF-Correlator: msip_labels: MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_Enabled=True; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_SiteId=a19f121d-81e1-4858-a9d8-736e267fd4c7; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_SetDate=2021-07-15T08:15:31.439Z; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_Name=FUJITSU-RESTRICTED; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_ContentBits=0; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_Method=Standard; Authentication-Results-Original: fujitsu.com; dkim=none (message not signed) header.d=none;fujitsu.com; dmarc=none action=none header.from=arm.com; x-ms-publictraffictype: Email X-MS-Office365-Filtering-Correlation-Id: 5be1a893-daab-4c2e-c20d-08d94b695507 x-ms-traffictypediagnostic: VI1PR08MB5344:|VE1PR08MB4989: X-Microsoft-Antispam-PRVS: x-checkrecipientrouted: true nodisclaimer: true x-ms-oob-tlc-oobclassifiers: OLM:7219;OLM:7219; X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: 1MGBF0cA1g62aH+gpXLUyjbuIc/Lko1pTZ6Tj4pj3FE9Wn1x9xUEaSf7MvBApZr8Us7RHWIHyPuW1h0yCsoNZz9iMjz19oXFn1X00CFWF4RMQPPowk1heP2t0PFwNFdkXwUzLQykHxU+RDx/wTF1yrGesEZHH7N4fiPqc0WOQmEO81z0Ed3Bh7pSc9vBOfKe8qRKmk3avj3PWezNYm4JeNe19KmZF8vYr0S6t2auAzSGFGg7pDalOdxlOuj7NemvyK2AONFofykIDtuItPx0zHRqAr5KGh7rQLK1dENN9sfikz/27Uow2TFx2nXsUNnEY1OrFrt9g52IWpzcMg0KoidePJj6CjA/hjR6w9hHGsn+N+hhuIWpd9BFzjYm0k070GiqR/rJOfQVeQsAx5t8PHUXuqVO7ZT8REHrtd/EbruS2zrsH8++5X2rnCtABEMWsS9FfJlqakN9wfrlV/+RJx9nlHtrYo+PtBPZyfOa1up634Yzw4UcUZKNsMEYOUvtsghc8zfAbqMvJWO/Qq5wW3QZnp4L47RusK607/PZ8eFE6y99N8iP9c99xdjgfz9qDL7omaA/DGvH970xUD6uyeo6u/yp4HNHLonDGrH1xnhCiXgZ63Na0cpkHm3U5r5YjNzKqIPHGDbv+iF58MAiSF01WFy9eHNZXG6Sx2MS0cgncAYNMWZ+LNby0aiAcmaXlvuM1UwH04xpWP8epFOmmg== X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR08MB5599.eurprd08.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(39850400004)(366004)(396003)(346002)(136003)(376002)(186003)(8676002)(26005)(71200400001)(7696005)(4744005)(6506007)(9686003)(316002)(110136005)(55016002)(8936002)(5660300002)(66476007)(2906002)(33656002)(4326008)(91956017)(76116006)(66946007)(38100700002)(66556008)(66446008)(64756008)(86362001)(478600001)(52536014)(122000001)(38070700004); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?iso-8859-1?Q?fGoHOoKOlcLDDwpmCwsGF0in/kAaYk0i8RAvhDSdae04pIGLPu5qiNj5wG?= =?iso-8859-1?Q?fbzAZ70IaRvkbWdjo6UmEN2KQwr4kXlIWt8cGGel3GfDXKJBPBigXcWwWV?= =?iso-8859-1?Q?bD+WYJb6VNgOMjeVmwPLb8uKSf5Ief1++szeJ9rR7o201utvnsVRKQt5QA?= =?iso-8859-1?Q?KNWlsT4QweAIZ/a2uYzaQ+Jc+GwXS/gYIqfBu25d36ttHSNjLdeao0Oqd8?= =?iso-8859-1?Q?YqKCt9oCxwbbXTGqkNy6l/Xop/xIUJZoOvPgDqN1KyCgE6vYf6Hr+Q76+F?= =?iso-8859-1?Q?ylAklyGEL3oVGBGLmNUDHLRuHwo0yXI1PrFdhjk70tIZRrhZa4shHVyPmI?= =?iso-8859-1?Q?+TGUlUNFlwaTd5PvKgHPYLf9Zf0pOMZKR9WFN+ZJWKNRqAfJnTMUH9F7na?= =?iso-8859-1?Q?nS8Jx75M/AV9oNL5+IkHEYzORXeI4PDpkjirmrZAXESp/2733O+Qh8PZf4?= =?iso-8859-1?Q?DC/uR3EDvYihf9HYRUs6z8gyv7epvP29wfqNB1WQwvrCF+ortIqUr0OAp7?= =?iso-8859-1?Q?stXHFGHhvQs8ra4boXSvIrG7U0UtBBsxnyCO+pzQQrrMUEiajMIsvgN4Or?= =?iso-8859-1?Q?+0ciH8NbE7WgLjvRKNtdH7H/PqCa650lEQIRWLEtf3KhQetkblR08uJpXp?= =?iso-8859-1?Q?ECdJ/aPJeLT3d/2pE8p40RBNghDGtZ+RihUtIjcmvx9x4mhc+ZApmi1bX6?= =?iso-8859-1?Q?vtjZeBg6fl1jxKJHVkmuVKqkMZ6YEm7A7URAHX1+0MBu87YEa2M0grOVaV?= =?iso-8859-1?Q?yvvYKcjKx6dEedQyEk+RSB3HVaQFF86gAcemG7ZWo3xzB9b210V7kEuRTL?= =?iso-8859-1?Q?sYigQC2PoJ44j+Pads8yvab1Fev/7MMe26AJMuj/v7hr82x75i2508t8qX?= =?iso-8859-1?Q?/aSy5tgSMXQ+pPqXpKHggpar/OneK3N/vBeZ3rd9uycZ9lk7PmB5ktPArY?= =?iso-8859-1?Q?huNli5y1CtH/2O2iO2Sm30DwYuYBiF4N8HtKNKLVdQYZu2bypU3yVGybOY?= =?iso-8859-1?Q?crAR5KswXDyTxpNN9FaZwmiRYbOuWCnEsNQq740EbFKMkxJ5B0sqgCCCL2?= =?iso-8859-1?Q?vT+NC+PdjuZJgTEEN3jYpSlIB8WIR12U6PJ8Xt099oXFVrW+Ek1/AMieoT?= =?iso-8859-1?Q?YUxAGN9NWh+NAMMv/QkrO6zE/sB4I3ZF7NZ2OTnRNe09K7nLuaPmaou8uP?= =?iso-8859-1?Q?ynof+BFDxH10FE46HckoOy+vd/Aq0NRbd/peyDKh5U0o/BXFf/GVB1HFHJ?= =?iso-8859-1?Q?v6dfv2KRTGPpCM5MzadPIbrFqJW/z22HP0MCDtzorKfEDUzB2QZvxJXTOE?= =?iso-8859-1?Q?Qeh3Z/95AJlEga+ubfa7YAdNP+g9mwYd4GsdB2JNLpLIB1M=3D?= x-ms-exchange-transport-forked: True Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-MS-Exchange-Transport-CrossTenantHeadersStamped: VI1PR08MB5344 Original-Authentication-Results: fujitsu.com; dkim=none (message not signed) header.d=none;fujitsu.com; dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: VE1EUR03FT023.eop-EUR03.prod.protection.outlook.com X-MS-Office365-Filtering-Correlation-Id-Prvs: 20179dbf-29b7-4889-217a-08d94b694f3f X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: OsPurYUpXn+KGvMIisE5sQ0oPlDy5gkm0VV3nQc9tXhlX2hIczj5RdN3fro5Az+v3dMyuEESYYBK0IWVUyd4DEE3glEf4SlKcNcgQRF0WHkb5oZKTZFhwwYWxY12FZfV4Z0S0ZJlVcmK9WlZDSxmjfbqtOee2cQ1jMI3sjSqRgoqsBLTthdhV3KU3XiK/hAIEuhUznsuwerqlAIsQwHk9U2YlSoO1TcXqk76XwtG99DQ0qVLwPUGqG/9XvPDiza0srjoh/MsGCji413/P0yAR8vwl/ZbcKN2oBLstmJ93HzKx6WBx13YtU5FTjsI4JDYZ6tNvI+fGasS9KLQ+V/1kljjdzJzvwlT+fF4vTK7HGmy6dW8pEGaskZiAHQcEzLFXTLI99u/U3ScHzS3rXz/XJv7AB29XDJc3bbbPkqHG54r5TKm+FJl+4OP8gYJiKdk9vaWmaGfqGeLzp8lCQIysHGa54t/unw/UdW31h3QRE3qqtBHXpicDKTjSVLdf/aeP0f5MMgesicRBobHbTCQT0RNPkG+TVSqKuK3z7Pg162fFyaRDi8GhBJnuaXzYY/+Os6y7/5n59MRcWukfZPRfxcCNCoLJy18oZZjzPa7w8l/m/kBGz+98iZK1saNjHZS0BR9/NPcqcNOqE36MTkBwh3qq84eknDqmyn9Vt3dzDgR43TGOVpylhWZAoARvozJxRCjoJjX195X+RJuruhg7A== X-Forefront-Antispam-Report: CIP:63.35.35.123; CTRY:IE; LANG:en; SCL:1; SRV:; IPV:CAL; SFV:NSPM; H:64aa7808-outbound-1.mta.getcheckrecipient.com; PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com; CAT:NONE; SFS:(4636009)(396003)(376002)(136003)(346002)(39850400004)(36840700001)(46966006)(26005)(4744005)(356005)(186003)(70206006)(7696005)(82740400003)(81166007)(55016002)(8936002)(33656002)(9686003)(47076005)(110136005)(8676002)(36860700001)(478600001)(6506007)(2906002)(52536014)(5660300002)(4326008)(86362001)(316002)(70586007)(336012)(82310400003); DIR:OUT; SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 20 Jul 2021 10:30:01.8687 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 5be1a893-daab-4c2e-c20d-08d94b695507 X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d; Ip=[63.35.35.123]; Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: VE1EUR03FT023.eop-EUR03.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: VE1PR08MB4989 X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , From: Wilco Dijkstra via Libc-alpha Reply-To: Wilco Dijkstra Cc: GNU C Library Errors-To: libc-alpha-bounces+e=80x24.org@sourceware.org Sender: "Libc-alpha" Hi Naohiro,=0A= =0A= > In case of zero-over-zero, it's very interesting that memset_generic is f= aster than memset_a64fx.=0A= =0A= That's because it uses DC ZVA for the full range rather than only for sizes= > 8MB like in memset v2.=0A= =0A= > On the other hand, however, in case of zero-over-one, memset_a64fx is fas= ter is faster than memset_generic.=0A= > It may be due to SVE code.=0A= =0A= Yes, you get more bandwidth with 512-bit SVE stores than 128-bit Neon store= s.=0A= =0A= > Comparing the memset_generic performance, that is non SVE code, between z= ero-over-zero and=0A= > zero-over-one, it seems that a64fx micro-architecture has some zero-over-= zero optimization?.=0A= =0A= No, that does not prove that the previous value matters, just that storing = zeroes is faster.=0A= =0A= Cheers,=0A= Wilco=