From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS3215 2.6.0.0/16 X-Spam-Status: No, score=-4.2 required=3.0 tests=AWL,BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, RCVD_IN_DNSWL_HI,SPF_HELO_PASS,SPF_PASS,UNPARSEABLE_RELAY shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from sourceware.org (server2.sourceware.org [IPv6:2620:52:3:1:0:246e:9693:128c]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id C2FA61F8C6 for ; Mon, 9 Aug 2021 13:16:27 +0000 (UTC) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id F2C1F3894C2F for ; Mon, 9 Aug 2021 13:16:26 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org F2C1F3894C2F DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1628514987; bh=L/iVi1R3x6VCETU0akmC7RrR28vFuYdCceGOnotz/8U=; h=To:Subject:Date:List-Id:List-Unsubscribe:List-Archive:List-Post: List-Help:List-Subscribe:From:Reply-To:Cc:From; b=kL8gvfLMR+qj6NtZgq26u48uTrBlzSQsUc8mk/tR8lQnkR1wEuVZsaZShNQ93PRJJ Itoc6CVjr0imLez0IL3AFdOjMpoki8a1ZqTQfCtw0IVwXGuvZhwItORJiYAA+RDWrW +pNAkcgDnn/uzUvjkwgRBHPoKebDaBVfVkQtTR3Y= Received: from EUR03-DB5-obe.outbound.protection.outlook.com (mail-eopbgr40047.outbound.protection.outlook.com [40.107.4.47]) by sourceware.org (Postfix) with ESMTPS id BE8F4385482F for ; Mon, 9 Aug 2021 13:16:04 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org BE8F4385482F Received: from AM6P195CA0035.EURP195.PROD.OUTLOOK.COM (2603:10a6:209:81::48) by VE1PR08MB5022.eurprd08.prod.outlook.com (2603:10a6:803:114::27) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.16; Mon, 9 Aug 2021 13:16:01 +0000 Received: from AM5EUR03FT012.eop-EUR03.prod.protection.outlook.com (2603:10a6:209:81:cafe::aa) by AM6P195CA0035.outlook.office365.com (2603:10a6:209:81::48) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.15 via Frontend Transport; Mon, 9 Aug 2021 13:16:01 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; sourceware.org; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;sourceware.org; dmarc=pass action=none header.from=arm.com; Received-SPF: Pass (protection.outlook.com: domain of arm.com designates 63.35.35.123 as permitted sender) receiver=protection.outlook.com; client-ip=63.35.35.123; helo=64aa7808-outbound-1.mta.getcheckrecipient.com; Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by AM5EUR03FT012.mail.protection.outlook.com (10.152.16.161) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.15 via Frontend Transport; Mon, 9 Aug 2021 13:16:00 +0000 Received: ("Tessian outbound ab45ca2b67bc:v101"); Mon, 09 Aug 2021 13:16:00 +0000 X-CheckRecipientChecked: true X-CR-MTA-CID: 5205adce920cea79 X-CR-MTA-TID: 64aa7808 Received: from 0368c1a59114.1 by 64aa7808-outbound-1.mta.getcheckrecipient.com id 21ABFDE9-2689-4145-BFCD-AB9C2670181E.1; Mon, 09 Aug 2021 13:15:54 +0000 Received: from EUR04-HE1-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id 0368c1a59114.1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Mon, 09 Aug 2021 13:15:54 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=i6Kjox7COIZIIineOO65oYzvUNk5RNe/Vega+H6T6yK/q+gUzSshnnZ8EVOXBISGXvfl91t17rhMm6B5PSAZAw+BUr4RtDYSSCiyyPvDWU0A6eplDRw8ahCR1BGw32zMHgA4eA9YojAO5xRzKlf1EhaSOzf5kQth1Vc9k7ABoJ1WdGdaEotlwZ0N9M2IoGB8JiQBzfLx0tbPvXiG0JoGQl730Jn78s0keB4Iim1z4/YXVswQ1POW5ExVk80/4HnV6AN+FoV6VEhOGiQlzVCTSw+LzZ6vO8i8vfJXrSxR7WUMqrb4+dl++nWsWQ4h8FgQIXXEEjISzHf1mgKRNEAwVA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=L/iVi1R3x6VCETU0akmC7RrR28vFuYdCceGOnotz/8U=; b=d/zhNxRRmENB3mO2z8Slc0ZMWbtQmWPAMd43meGNLQHeISUT0dwnZj2r2to+AsHLm+uclyP23AR7mCQUtQeVcxYYVdP4VI24PxkVFfDB0ecBWx1UJnI9UyFQtUz9VRrpTq2KpcKILbz204C2M7+dl6ql5k0bgYnksxeihxnDdYdzu6jtGpWjgh+Av4xlfDPufu/JWAwEBzVwNASZCkGSl2kkN+qrFxj7bp+AH97Z6F8X9aztwG3XUA6d1XTpzXc+xT0h3YE0/2QbqpuhrUp+m9lAUL99tDJMCnkUtggVt7C+N3WMk3qoBfiNq+7/m+4eBgYh8WHAfbtopSz4wgcXGQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none Received: from VE1PR08MB5599.eurprd08.prod.outlook.com (2603:10a6:800:1a1::12) by VI1PR08MB3392.eurprd08.prod.outlook.com (2603:10a6:803:7b::30) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4394.16; Mon, 9 Aug 2021 13:15:45 +0000 Received: from VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::c437:fa2b:33:c8ba]) by VE1PR08MB5599.eurprd08.prod.outlook.com ([fe80::c437:fa2b:33:c8ba%7]) with mapi id 15.20.4394.023; Mon, 9 Aug 2021 13:15:45 +0000 To: "naohirot@fujitsu.com" Subject: [PATCH v4 5/5] AArch64: Improve A64FX memset medium loops Thread-Topic: [PATCH v4 5/5] AArch64: Improve A64FX memset medium loops Thread-Index: AQHXjSBqQUk1uBbeZkiGnGpCiCyx6w== Date: Mon, 9 Aug 2021 13:15:45 +0000 Message-ID: Accept-Language: en-GB, en-US Content-Language: en-GB X-MS-Has-Attach: X-MS-TNEF-Correlator: Authentication-Results-Original: fujitsu.com; dkim=none (message not signed) header.d=none;fujitsu.com; dmarc=none action=none header.from=arm.com; x-ms-publictraffictype: Email X-MS-Office365-Filtering-Correlation-Id: ffbe9b0d-22cd-4fd3-a362-08d95b37d552 x-ms-traffictypediagnostic: VI1PR08MB3392:|VE1PR08MB5022: X-Microsoft-Antispam-PRVS: x-checkrecipientrouted: true nodisclaimer: true x-ms-oob-tlc-oobclassifiers: OLM:4714;OLM:4714; X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: dc9lCfFWOZoHsqQr80AgIDjvUBbZvs0/cfpR27zVszyj4O5cNSu2hEDhrmZKbrma8K/mz6Md3thlzk8VGECKAYYwoDPnlPYSOBDNViZgU3hUSShfzRD4kFj2IYo8Zr8EbP96pEQE1FC+Zfv5Xrwx/gfQICcvbk3LRa6opIdGoh+S/z0wCl92ZIZVzudp9QA4lv+1mGxJ/MhkAcAsGP8aZIi7bMoKQt0vOXpDip0gJB3xYanwR3s2Y8v2BqPuTT8meZOBSj5kAt+gJGzEbUkB50Okc9E/1Q0azhaCNbui9mMXiA9Ei45Yk6Ial2MOkJkqm8PBhqM7ieRlCuiJdNUUCOkmGGLRk5/sLoqJMwxlLzmpnQul/HrLVjXLDFZJZw/5yq4AgAvfHFH65EPDB+n4NQ+BG5dhggQp3syKlaHkg/PSdzQuCuTP0CAnM3AIq6bZRc2bzcwC0B79YKCo1b7G8j6BIcfZrqYp0mBe3IKVNhb3sna3dDFgiyI81UkbC47+FbsksL9S3m1R5pzGNijxIJAHo0C9PpN4hlxVoOLQo0n+qnJOpfXPt3Su38YOmQitAFEYFNh5zC02HL4Cc1whaNKjmEvrn0jBfQX6hIc5JPdTQocg3Y2agkuRM1NFFj52WVZOXVDvfyHzwJS3pYczTjDjeN0tZQr4R+ZRtNxhAd5caLMLJtXUEiqdI+xAzAfXu6Ue7HIgmwagtFDLhAk3f3KJeMC96vtX2UgcaW40g6nESH0sk0xSl4/ttsik7dDV0ag/cNZxoAwZLdM3aId+kg== X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR08MB5599.eurprd08.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(346002)(396003)(39850400004)(136003)(376002)(55016002)(76116006)(9686003)(8936002)(8676002)(91956017)(38070700005)(5660300002)(6916009)(7696005)(66946007)(66476007)(122000001)(66446008)(38100700002)(52536014)(66556008)(64756008)(71200400001)(26005)(186003)(2906002)(86362001)(316002)(33656002)(6506007)(478600001)(4326008)(473944003)(357404004); DIR:OUT; SFP:1101; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?iso-8859-1?Q?9qlI6ZIDrlQ5vqX7+5LQA5ZKiV+WLwyWTM5ZHYzVgITe1pYWnf9jK5QBl1?= =?iso-8859-1?Q?kJ8I7BBbRhBffJZV38ncfbOeo2JOaMl4ulbECEDQ3ztD9gduy/XqwaY9Hw?= =?iso-8859-1?Q?6x8UxYvBAO8L2aYAn4IWEDZCFXMcQzNtvnaDzV9eJHvpTWrpmqfctsNisA?= =?iso-8859-1?Q?VuXokrr3CKB15f0pYsOMUb91UkXFN5O/B4qgFLWkGmvNKVEOrSm3W7Kutq?= =?iso-8859-1?Q?NioSjBaUX1ug+tc202UVGczl8cBK0x91B0oiqfrRvTfH5eZYIChwL1G72u?= =?iso-8859-1?Q?Vcmu0/+2kyyl+zps0JWu5my0YCPfZnpcgCa/nFbvg+73eAyNbLqyloDDQ/?= =?iso-8859-1?Q?KOTLfNAyX9nO7PhPjqP6D03C+4UljobLZmNqiWs3MWXNFVddi2m21gIlll?= =?iso-8859-1?Q?ChM8HTgiXEbj4JFIlUsfnRF5a24UsNaXBtp31WmIvLhhhGQ1Hs0ZUHF0bO?= =?iso-8859-1?Q?mADgLWEd5oN89jkfXyOFt23R/t523JxMd/ycku/mnC4JKfrFqDQAWiyWV1?= =?iso-8859-1?Q?qrbuGabs/1PSQUbyb2omtT009zaKcGJjyVL0uFiOhCHtaLIFui6caXuJWn?= =?iso-8859-1?Q?SswmjVDUvszRJgu5zVArB+YQ2nWzXOsVzcAQeMp/SqQAQB5tPQHpv9r6D6?= =?iso-8859-1?Q?Y+YLh0i96ccR/2Ks5wZ22EoGMkq/LLlVFIEd6c/M61nA21lOz0tf5fxhwR?= =?iso-8859-1?Q?XK7gSzA4pbOqe0UhxAGHB86NE8iQNgoWOYQFrclxm/JqnWr/AQ3RM6Rlv4?= =?iso-8859-1?Q?iv5SHqrvAMstrAcWgeiqPm1q2ZpeLt8Kl7IksB9qA9sZ39Z9AxHP6SO098?= =?iso-8859-1?Q?AnUgRbQiglcFg++figvpE51mVQJV3mAAwrogrTQm3kJHhMhvHVW9aRgoOl?= =?iso-8859-1?Q?BuuJdOjYDBDhKG05S41Ym4Rd/zaJm6NDamM+lEwoZOXF8PILfwa8GnJswD?= =?iso-8859-1?Q?7Jt+gKBZEmqfgFyluhmf0vLNEfqsMHBRl/J2p9XFlYi3rxjDm94egmuQrr?= =?iso-8859-1?Q?pE9FBz2pJi7ZnaAF5+v8drf0BSDzi77RwdvaCxc/8zRyx9bcMZGIVV7G4u?= =?iso-8859-1?Q?Kov+jQegE9Q50id7DqcN4ebwPB3LY8CkhbBXJSDRn3kLswi+2o0mGNEbla?= =?iso-8859-1?Q?7N2p9jOso4f5NFw4DzOmRwwx3mMTh9Pev8gnmDhG+LfmOl/j49RThefUfF?= =?iso-8859-1?Q?9rEl0xfcYwKZe9jYf8PYFX9TffvG1uhyXdeeTewim7/3igzlJK1OhXjxdi?= =?iso-8859-1?Q?HG21M41A5WbP1Dznn0BC2fxGT2SMRRZdA/F0PG4yzdnV6d45em7du7GRLt?= =?iso-8859-1?Q?+soN3LvDg9qcy9EmVgQIerRVFcKaduyiAfzjbRQJ/4c8L8I=3D?= x-ms-exchange-transport-forked: True Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-MS-Exchange-Transport-CrossTenantHeadersStamped: VI1PR08MB3392 Original-Authentication-Results: fujitsu.com; dkim=none (message not signed) header.d=none;fujitsu.com; dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: AM5EUR03FT012.eop-EUR03.prod.protection.outlook.com X-MS-Office365-Filtering-Correlation-Id-Prvs: 4e156455-d941-45d9-5f77-08d95b37cc58 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: jNPEftvvBz9UP/vItRnWP8mVzp043pE+fb2vK8mDlL63tLjOOt6jR++KP3G2t2DnBI7WIdc5EN4jZbiSVkaiT5YhgJhdv1O29+as9oHt9ZOgasxr1UEDqZ1id04x5+Z+7meQHvwuVpUVrnVZNwq7e4nS3c7D77e+FlYN4OQBiSgptABlogLaEmTSCjsxOnngfJxqM+izyOiS+EMlgLI9xmETG9EDwXM7C2y+vpAKckJxmFvYeqS7rfPsUtqHuKiibyvEqu7s24E9yeIJIBXpW8XIdkEcb8oxTB3HqgRayZQI/ely/pRXKvkWOIsKm70JM0ruVtaXoMltQvE7Zj4lAi9lvPpbVZWVdkLQw+QIoAzuUZB2b12x/WmfaH9kuNGKKuAXOvMy2J1M9roDwR4n2ekZU+iV48CNrWrA8E37fSESHA0flGMbj25zzXfbGAgvHspKphTKH4qy/I8XmnSznm033OhMJQuE7nSHVQg1pRreM3ZOBaWh5Tx9P1flN8Xi6jSTNPU1YS2LPyLdMXmxTSRKJixR1Z7o9sa90P8f/MVD33XfFFeE2kl0mFhbBLCbsWl3DpA6FGS3es45T/7+I78GJ45AjJvHDDcgrcuO6/QMuNtobCGB7WHGy38zahUMPhL4bw7kcl6k0Gl0QspZPQGZxh3/e3rCpftT8eUD1VlxGzwY7awgByDy8ozpJjjDZeZVh30APuy3Ojhjql8Detq06HVWv5Pdvk+xq1tpGAPKifNQ3aaUpan7q0RacwTK X-Forefront-Antispam-Report: CIP:63.35.35.123; CTRY:IE; LANG:en; SCL:1; SRV:; IPV:CAL; SFV:NSPM; H:64aa7808-outbound-1.mta.getcheckrecipient.com; PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com; CAT:NONE; SFS:(4636009)(376002)(396003)(346002)(136003)(39850400004)(46966006)(36840700001)(356005)(81166007)(82310400003)(8676002)(36860700001)(86362001)(8936002)(26005)(478600001)(2906002)(316002)(186003)(82740400003)(336012)(33656002)(52536014)(70586007)(70206006)(6862004)(9686003)(5660300002)(4326008)(6506007)(7696005)(47076005)(55016002)(473944003)(357404004); DIR:OUT; SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 09 Aug 2021 13:16:00.9383 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: ffbe9b0d-22cd-4fd3-a362-08d95b37d552 X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d; Ip=[63.35.35.123]; Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: AM5EUR03FT012.eop-EUR03.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: VE1PR08MB5022 X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , From: Wilco Dijkstra via Libc-alpha Reply-To: Wilco Dijkstra Cc: 'GNU C Library' Errors-To: libc-alpha-bounces+e=80x24.org@sourceware.org Sender: "Libc-alpha" v4: minor loop change=0A= =0A= Simplify the code for memsets smaller than L1. Improve the unroll8 and L1_p= refetch loops.=0A= =0A= ---=0A= =0A= diff --git a/sysdeps/aarch64/multiarch/memset_a64fx.S b/sysdeps/aarch64/mul= tiarch/memset_a64fx.S=0A= index 89dba912588c243e67a9527a56b4d3a44659d542..318c6350a31e0fad788b5f2139d= e645ddc51493f 100644=0A= --- a/sysdeps/aarch64/multiarch/memset_a64fx.S=0A= +++ b/sysdeps/aarch64/multiarch/memset_a64fx.S=0A= @@ -30,7 +30,6 @@=0A= #define L2_SIZE (8*1024*1024) // L2 8MB=0A= #define CACHE_LINE_SIZE 256=0A= #define PF_DIST_L1 (CACHE_LINE_SIZE * 16) // Prefetch distance L1=0A= -#define rest x2=0A= #define vector_length x9=0A= =0A= #if HAVE_AARCH64_SVE_ASM=0A= @@ -89,29 +88,19 @@ ENTRY (MEMSET)=0A= =0A= .p2align 4=0A= L(vl_agnostic): // VL Agnostic=0A= - mov rest, count=0A= mov dst, dstin=0A= - add dstend, dstin, count=0A= - // if rest >=3D L2_SIZE && vector_length =3D=3D 64 then L(L2)=0A= - mov tmp1, 64=0A= - cmp rest, L2_SIZE=0A= - ccmp vector_length, tmp1, 0, cs=0A= - b.eq L(L2)=0A= - // if rest >=3D L1_SIZE && vector_length =3D=3D 64 then L(L1_prefetch)=0A= - cmp rest, L1_SIZE=0A= - ccmp vector_length, tmp1, 0, cs=0A= - b.eq L(L1_prefetch)=0A= -=0A= + cmp count, L1_SIZE=0A= + b.hi L(L1_prefetch)=0A= =0A= + // count >=3D 8 * vector_length=0A= L(unroll8):=0A= - lsl tmp1, vector_length, 3=0A= - .p2align 3=0A= -1: cmp rest, tmp1=0A= - b.cc L(last)=0A= - st1b_unroll=0A= + sub count, count, tmp1=0A= + .p2align 4=0A= +1: st1b_unroll 0, 7=0A= add dst, dst, tmp1=0A= - sub rest, rest, tmp1=0A= - b 1b=0A= + subs count, count, tmp1=0A= + b.hi 1b=0A= + add count, count, tmp1=0A= =0A= L(last):=0A= cmp count, vector_length, lsl 1=0A= @@ -129,18 +118,22 @@ L(last):=0A= st1b z0.b, p0, [dstend, -1, mul vl]=0A= ret=0A= =0A= -L(L1_prefetch): // if rest >=3D L1_SIZE=0A= + // count >=3D L1_SIZE=0A= .p2align 3=0A= +L(L1_prefetch):=0A= + cmp count, L2_SIZE=0A= + b.hs L(L2)=0A= + cmp vector_length, 64=0A= + b.ne L(unroll8)=0A= 1: st1b_unroll 0, 3=0A= prfm pstl1keep, [dst, PF_DIST_L1]=0A= st1b_unroll 4, 7=0A= prfm pstl1keep, [dst, PF_DIST_L1 + CACHE_LINE_SIZE]=0A= add dst, dst, CACHE_LINE_SIZE * 2=0A= - sub rest, rest, CACHE_LINE_SIZE * 2=0A= - cmp rest, L1_SIZE=0A= - b.ge 1b=0A= - cbnz rest, L(unroll8)=0A= - ret=0A= + sub count, count, CACHE_LINE_SIZE * 2=0A= + cmp count, PF_DIST_L1=0A= + b.hs 1b=0A= + b L(unroll8)=0A= =0A= // count >=3D L2_SIZE=0A= .p2align 3=0A= =0A=