From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS17314 8.43.84.0/22 X-Spam-Status: No, score=-3.7 required=3.0 tests=AWL,BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, PDS_RDNS_DYNAMIC_FP,RCVD_IN_DNSWL_MED,RDNS_DYNAMIC,SPF_HELO_PASS, SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from sourceware.org (ip-8-43-85-97.sourceware.org [8.43.85.97]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id E83BC1F5AE for ; Tue, 20 Jul 2021 08:51:44 +0000 (UTC) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id EEEBA3988009 for ; Tue, 20 Jul 2021 08:51:43 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org EEEBA3988009 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1626771104; bh=//4+ALsPKW8JLoANDWoQZImbx+wFz743to43eAuxJ44=; h=To:Subject:Date:References:In-Reply-To:List-Id:List-Unsubscribe: List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:Cc: From; b=Tk+feQY2nmC1yzCfWkNh1rn2pmirLea8C8ngFFLtFeOsSK6VP+kyNWxKnjOQI3P+U 7LO2KWWkkAxiJd7LIRwFeQvG4F33Ow0VQAT1AWDwwdxq2C8ma45yxXGybo8aRqjDT0 jHaNayMTnzYILwye8NmWRVjmzWDxQRRT1z1wSzXM= Received: from esa18.fujitsucc.c3s2.iphmx.com (esa18.fujitsucc.c3s2.iphmx.com [216.71.158.38]) by sourceware.org (Postfix) with ESMTPS id 8D3943861834 for ; Tue, 20 Jul 2021 08:51:22 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 8D3943861834 X-IronPort-AV: E=McAfee;i="6200,9189,10050"; a="35838121" X-IronPort-AV: E=Sophos;i="5.84,254,1620658800"; d="scan'208";a="35838121" Received: from mail-os2jpn01lp2051.outbound.protection.outlook.com (HELO JPN01-OS2-obe.outbound.protection.outlook.com) ([104.47.92.51]) by ob1.fujitsucc.c3s2.iphmx.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 20 Jul 2021 17:51:20 +0900 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=mWaq8tu+N2IxFmr//MViTjKaPGP1Gxe/S+QjTo+oH9vsje4IMWhQ1nHWaT5dcj83cVX+sPob8ADH3eVL28XhCkz8r4OMYUbbV0xZIz7AfKGx5VMNkgujgCYtg0WfTS0rEdG0sKINeTsUYp1KjrDEEctzhY0Ts9Niou8y+Py9J5OZ2bvf/zOTEdTomJMoIQVsZzj5t+8qWyvT0eTMrcy4zNkC+7MI+fXL8uRVEBM/VZPbgYLknPHnDnNwwnxJzDbqW9Ah0Vbbpl3WZUAPBvSW0p8rBopLjdJh7Uo7VAWnMnxsd/hqwjB8qhQxMdipgaYyAszXaxP4chwHSPKakONizQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=//4+ALsPKW8JLoANDWoQZImbx+wFz743to43eAuxJ44=; b=aLixivhIO39ZNhp5td3a+ujFzhl1rgFgg7DD8uFhGi8oqxl6LlPnJoSLWhhvwjDnD32ZuHsGTj+OCrAAwFh31ZuwnI0jLn3rgAGfrVb2dwXj5aNV9nSzo0CPV6NKBjUwrdem+FX0zeNpW06QW77rGYnhBMpxyRL683hNfu+MtnH1QM9I5H/TCpgIQ5jgWtcUavmk6IiBjUBQHIbvlqTWwfb5E56euphpObBFH/PpCdT8CSilGDKuQQiyN9L68IdYczmN4mNZmqfSgsvpTY1jAOQvUKur1ZDKcuvWFqbSjloPegPYUKcVUe8LcB/5Y/AepWeni1/lvAC5kAl7G4Pwpg== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=fujitsu.com; dmarc=pass action=none header.from=fujitsu.com; dkim=pass header.d=fujitsu.com; arc=none Received: from TYAPR01MB6025.jpnprd01.prod.outlook.com (2603:1096:402:36::13) by TYYPR01MB6762.jpnprd01.prod.outlook.com (2603:1096:400:cd::5) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4331.22; Tue, 20 Jul 2021 08:51:17 +0000 Received: from TYAPR01MB6025.jpnprd01.prod.outlook.com ([fe80::5816:45c1:5336:c108]) by TYAPR01MB6025.jpnprd01.prod.outlook.com ([fe80::5816:45c1:5336:c108%8]) with mapi id 15.20.4331.033; Tue, 20 Jul 2021 08:51:17 +0000 To: Noah Goldstein , Wilco Dijkstra , "Lucas A. M. Magalhaes" Subject: RE: [PATCH] benchtests: Add memset zero fill benchmark tests Thread-Topic: [PATCH] benchtests: Add memset zero fill benchmark tests Thread-Index: AQHXd/o506Ph+ApkDUqc+W5ZNjtTm6tBPwcAgAIYy42ACDX08A== Date: Tue, 20 Jul 2021 08:51:16 +0000 Message-ID: References: , In-Reply-To: Accept-Language: en-001, ja-JP, en-US Content-Language: ja-JP X-MS-Has-Attach: X-MS-TNEF-Correlator: x-securitypolicycheck: OK by SHieldMailChecker v2.6.3 x-shieldmailcheckermailid: 5921101fad334268a81e8813ba2ffe51 msip_labels: MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_Enabled=True; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_SiteId=a19f121d-81e1-4858-a9d8-736e267fd4c7; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_SetDate=2021-07-15T08:15:31.439Z; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_Name=FUJITSU-RESTRICTED; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_ContentBits=0; MSIP_Label_a7295cc1-d279-42ac-ab4d-3b0f4fece050_Method=Standard; x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: 391a0964-5a30-4993-bad0-08d94b5b8978 x-ms-traffictypediagnostic: TYYPR01MB6762: x-ms-exchange-transport-forked: True x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:9508; x-ms-exchange-senderadcheck: 1 x-ms-exchange-antispam-relay: 0 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: UQtd/fA/pI4pB5PplXUPr7C4GiW+6KyxKEEgaCQ3foI22DiFqBGcBkhM8jZqyUxPQtvze8QjyVsWlc8IRXkevTpB5IXiUSmRZs5E4Z87dKhX288Hpl39G12hVvaey5gvWd9cvfLriO2+TpG4UK33MFEzag0Xl8rJ7nGKAWOUvPk5q7pPnQpVoi4he3TPsQv7L8u4r43VRcX9ntL2K+g1pir1R0I9ojioSfiX8hXUPVRyWnq+QcWyzm/+HgYYdABA3tozAFFVFQqkIxUSawE5hmn8opvV/Y3oNyQpJKupazhStcgpOLsp1q7uEF3VFyR68e4s2lF7SrqTTzc9L6x2DZG5KvMeEBT5G4bJDlwTAp++3TlTwFevpOFeHaRmg5y2IVv2ocSxYaaJ6NUk9pmN54UL0JiYbU3io8wk5xAM9YbnsYLfR7bom3+e8jNnGPywLHIXLALj6ls8wz5tgPoalrmsZeCtvJlN8yommfSEg02CxlwBnnTepD7oH40wDIsJmAQ3Q43iCq7Wi6g+SxIMhqsJHfJ46DNovq7CSqYIG1o1nJ60TGOh+d+5YWMOXGt+xjF0ynONIQ7txMoOa6FMMmpOVhTHJZpWNJqOign6P/ZFnySb+8El7mOH0iCKtnMhbx+LAzvjxWG2lolPD0SmWLXNa4AJSvryeivW5mzyuVNCan6g23AS4WCijIQ9et9JEPTBBiLTDeKTu3fGBCtmZQ1esah5HJDK/Vez4Wubq8UYsv4PA7HZVFJPtFGXkfLBp346Yrwh23VugdK04gkyXo6Sb7hoYkmypQ506LURf2ynrYqVDXrp+dbXGid6CcSg x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:TYAPR01MB6025.jpnprd01.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(39860400002)(136003)(366004)(396003)(346002)(376002)(2906002)(26005)(86362001)(8676002)(9686003)(66946007)(316002)(122000001)(5660300002)(55016002)(8936002)(38100700002)(83380400001)(478600001)(71200400001)(186003)(6506007)(7696005)(33656002)(66556008)(110136005)(66446008)(85182001)(66476007)(52536014)(4326008)(966005)(76116006)(64756008)(38070700004); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?iso-2022-jp?B?T2pGOWl2UGFpRFFvZmlyektnMmZPVEdlMFFQUlZMZ1V3bzJRbkcyRXU5?= =?iso-2022-jp?B?dnFMUmc2WUNZM05FK0xyQUlPM3Ezb2ozRHgyUXlYdmJjZGR3dStDY3pj?= =?iso-2022-jp?B?QzVhbWJDcjdjZ3dPdWk5d1BSSExJSDllZ1ljS1Q1cjU3SXZBL2tYWWc0?= =?iso-2022-jp?B?S3dyZ2VyMStDdmxHNWpqZ1FUL05Rd0dsQVVvdFdzcSt5MG1GVjdieTBD?= =?iso-2022-jp?B?aFB6ald3M0NPSWxVNjB4YkV1RmZnQUtDL1hKZHlmaFlIZ0JPc3FzQThi?= =?iso-2022-jp?B?WWJRWjFYQzd5L2NOY0RNQmdqNmdTSjNQSitmSjI1TkMwOTdndWlFQjQ4?= =?iso-2022-jp?B?NDMzYjQzZ3NhakZaNXhVUWVMTm1OQ1ZYcTFpSW9vdXdpSXpzK2drMEI3?= =?iso-2022-jp?B?cFgwSlRjWUVPR3Y2N3g1NUdTM0FGN3Vkbk1TOWs0Y0tKN28vTzNDeWlR?= =?iso-2022-jp?B?NnBYL2JkM28zRlQ3dE91VUFURE03N21KeUIraGlGcldHSkdla2JzNHhh?= =?iso-2022-jp?B?TG8xNXlrYU5jSzAyWkFwUDVGMVFxN2tsUlpvbXFONUJiL0pLN0hHYlV6?= =?iso-2022-jp?B?cHJ1ZGdhalY0TDFLNlpSZ3lMVmZQRVZReXZNWG1RS2ZrcjNXd0dMemxR?= =?iso-2022-jp?B?SEVhUnQyT2hjUi8xV1VnaVR4S1Nxb3lNSVFBN3hKNFIwK0tjVnNuWWVx?= =?iso-2022-jp?B?VFg1U25TWWVJYnZ3dDVrb1VJTENwUTd0OVk5aEVHYk43Nm9kVTI5ckxV?= =?iso-2022-jp?B?enRXOXZwdG1pZ0V4YStXMGt1OHF5cHE2ajBhTmdJVVMxaUl0Mk9kaFVn?= =?iso-2022-jp?B?c2Q1VmpjZTN6WUxKaTBEbjNaUTRudkQ2eUc0UWI1MlpQNTBQV2taaTA1?= =?iso-2022-jp?B?K21qcXo4S3plVW1DNlhEejZGVXB1QzRsUlp6Y3dWWGVHYzBwbUpOMEs4?= =?iso-2022-jp?B?cXJJTVVzTW9MWVdHSDRUa2JlRzd3NThPWTVNQlJyTWRnR21lTFl3aTEw?= =?iso-2022-jp?B?ZWFvejIxUDZqbjBMUzlSbU5TZjh6Sk8vWWM1OGpmTkQ5ZVNXMDIwQjBr?= =?iso-2022-jp?B?N2N3VkVJSEg0a0lmdURLeDNCd2NCS3pmcWJ4VnA3QURnUUZVeHNVYXF4?= =?iso-2022-jp?B?QXhTaHJiTDVqcTVoVVB5czJuc3Yxam9XREhuSWtUVTNSRm1zNXgrckg5?= =?iso-2022-jp?B?WVZXYkVZbG5kSEUvQmRlUTVtR0F0bm15WGJ4WEVabDMrQ2FKa0IxRWJJ?= =?iso-2022-jp?B?SjYxbnRmenpPK2VDbFBQNHFob1NHU0c1SjRRRlJTSU16eWE3Z096ZGtE?= =?iso-2022-jp?B?S3prVkViOGJoQkdGSUw4Y000T3o4UDJNcUlBTFhMaWUrSnNPbXAzcTlT?= =?iso-2022-jp?B?RWd3V0g1cnM3bERFUlhqV0loazhhUWNYNmV4ZCs2SW5wb0h4Y0RGSGhU?= =?iso-2022-jp?B?a3NLWkt5SXl1NmFkdkFKVE0vY3NQcXkvVVE2RVlQWkJYdWM2V1I2a2lX?= =?iso-2022-jp?B?ZHJiejlzZ3lQMWR4Ukd5c25zM0FVSFVSRForK1QzWFVkYWg5cngvSm5i?= =?iso-2022-jp?B?RFQxNlh3T1U4bm5qeHhNaVVGS3VaZlJXdTBWdktuRXAzSTNLTm5lY2NY?= =?iso-2022-jp?B?cC9UcnhEUlRrNHdsZHA5azRQcmZWNkRacFF1eUcrS3lKTGFpQTgrenNh?= =?iso-2022-jp?B?R2lhVld1ZWpEUjcvTWxDZTNwWkY1YWJicVNjZnRDSUc5UWJmVXdTOGZn?= =?iso-2022-jp?B?dytnM1ZaUlhLZi83T3VsZE0rOTlTWE5sNzhpSU9jNzlsWTNwZTdSR2Jw?= =?iso-2022-jp?B?Zi9XUVFYOEFYMlhmemlmTlZRQlFROG5jczlGQUx6YzdLbWNycEg1dmpR?= =?iso-2022-jp?B?STZTY3VoSWxJcnJlM2lzNWsvWTJZPQ==?= Content-Type: text/plain; charset="iso-2022-jp" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginatorOrg: fujitsu.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: TYAPR01MB6025.jpnprd01.prod.outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: 391a0964-5a30-4993-bad0-08d94b5b8978 X-MS-Exchange-CrossTenant-originalarrivaltime: 20 Jul 2021 08:51:16.9227 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: a19f121d-81e1-4858-a9d8-736e267fd4c7 X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: ftI1WGnHDVMMtzMefWgIPSWOYzBKjubG5hIHTsvPGcr/FQltHl22Btts3r7mpWV+bJqYvieF5CjCjyQ4LH0Myg== X-MS-Exchange-Transport-CrossTenantHeadersStamped: TYYPR01MB6762 X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , From: naohirot--- via Libc-alpha Reply-To: "naohirot@fujitsu.com" Cc: GNU C Library Errors-To: libc-alpha-bounces+e=80x24.org@sourceware.org Sender: "Libc-alpha" Hi Lucas, Wilco, Noah, > From: Tamura, Naohiro/=1B$BEDB<=1B(B =1B$BD>9-=1B(B > Sent: Thursday, July 15, 2021 5:16 PM > > [1] https://travisdowns.github.io/blog/2020/05/13/intel-zero-opt.html >=20 > I throughly read the blog, it's very interesting. > It seems that Skylake-S microarchitcture has some zero-over-zero > optimization in the size range of L3 and RAM. >=20 > So why don't we proceed like this? > - create a new benchtest file, ex. memset-zerofill.c. > it measures zero-over-zero and zero-over-one from 32KB(L1), through > L2 and L3, to 64MB(RAM) > - create a script to extract some part of data such as zero-over-zero > or zero-over-one from JSON bench result to CSV in the first place in > order to create graph in spreadsheet > - update benchtests/scripts/plot_strings.py later so that it can draw > zero-over-zero and zero-over-one separately Here is a result of the memset zero fill patch v2 [1] using the source code= A64FX memset patch v2 [2]. The first graph [3] is the zero-over-zero result by typing the command: $ cat bench-memset-zerofill.out | jq -r '.functions.memset."bench-variant"= =3D"zerofill-0o0" | del(.functions.memset.results[] | select(.char1 !=3D 0)= )' | plot_strings.py -l -p thru -v - The second graph [4] is the zero-over-one result by typing the command: $ cat bench-memset-zerofill.out | jq -r '.functions.memset."bench-variant"= =3D"zerofill-0o1" | del(.functions.memset.results[] | select(.char1 !=3D 1)= )' | plot_strings.py -l -p thru -v - In case of zero-over-zero, it's very interesting that memset_generic is fas= ter than memset_a64fx. On the other hand, however, in case of zero-over-one, memset_a64fx is faste= r is faster than memset_generic. It may be due to SVE code. Comparing the memset_generic performance, that is non SVE code, between zer= o-over-zero and zero-over-one,=20 it seems that a64fx micro-architecture has some zero-over-zero optimization= ?. [1] https://sourceware.org/pipermail/libc-alpha/2021-July/129290.html [2] https://sourceware.org/pipermail/libc-alpha/2021-July/128857.html [3] https://drive.google.com/file/d/1iNlp-srAq-p9fr7PuRh62TRj-V2eJi0U/view?= usp=3Dsharing [4] https://drive.google.com/file/d/1UipGRHUWU8WJ1-EbV6YzoTcOtxtOhAy0/view?= usp=3Dsharing Thanks. Naohiro