From owner-p4-projects@FreeBSD.ORG Fri Mar 7 09:17:19 2008 Return-Path: Delivered-To: p4-projects@freebsd.org Received: by hub.freebsd.org (Postfix, from userid 32767) id 051501065673; Fri, 7 Mar 2008 09:17:19 +0000 (UTC) Delivered-To: perforce@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B83331065671 for ; Fri, 7 Mar 2008 09:17:18 +0000 (UTC) (envelope-from rrs@cisco.com) Received: from repoman.freebsd.org (repoman.freebsd.org [IPv6:2001:4f8:fff6::29]) by mx1.freebsd.org (Postfix) with ESMTP id 9B9348FC1A for ; Fri, 7 Mar 2008 09:17:18 +0000 (UTC) (envelope-from rrs@cisco.com) Received: from repoman.freebsd.org (localhost [127.0.0.1]) by repoman.freebsd.org (8.14.1/8.14.1) with ESMTP id m279HIoC020381 for ; Fri, 7 Mar 2008 09:17:18 GMT (envelope-from rrs@cisco.com) Received: (from perforce@localhost) by repoman.freebsd.org (8.14.1/8.14.1/Submit) id m279HISY020379 for perforce@freebsd.org; Fri, 7 Mar 2008 09:17:18 GMT (envelope-from rrs@cisco.com) Date: Fri, 7 Mar 2008 09:17:18 GMT Message-Id: <200803070917.m279HISY020379@repoman.freebsd.org> X-Authentication-Warning: repoman.freebsd.org: perforce set sender to rrs@cisco.com using -f From: "Randall R. Stewart" To: Perforce Change Reviews Cc: Subject: PERFORCE change 137066 for review X-BeenThere: p4-projects@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: p4 projects tree changes List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 07 Mar 2008 09:17:19 -0000 http://perforce.freebsd.org/chv.cgi?CH=137066 Change 137066 by rrs@rrs-mips2-jnpr on 2008/03/07 09:17:12 Changing to a table driven lookup seems to improve performance on interrupts a bit. I also add a fls64() as well. Affected files ... .. //depot/projects/mips2-jnpr/src/sys/libkern/fls.c#2 edit Differences ... ==== //depot/projects/mips2-jnpr/src/sys/libkern/fls.c#2 (text+ko) ==== @@ -35,14 +35,84 @@ /* * Find Last Set bit */ + +uint8_t fls_mask[256] = { + 0x00, 0x01, 0x02, 0x02, 0x03, 0x03, 0x03, 0x03, + 0x04, 0x04, 0x04, 0x04, 0x04, 0x04, 0x04, 0x04, + 0x05, 0x05, 0x05, 0x05, 0x05, 0x05, 0x05, 0x05, + 0x05, 0x05, 0x05, 0x05, 0x05, 0x05, 0x05, 0x05, + 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, + 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, + 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, + 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, 0x06, + 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, + 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, + 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, + 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, + 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, + 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, + 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, + 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, 0x07, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, + 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08, 0x08 +}; + int -fls(int mask) +fls(int m) { - int bit; + int val; + register uint32_t mask = m; + if (mask < 0x00000100) { + val = mask & 0x000000ff; + return (fls_mask[val]); + } + if (mask > 0x00ffffff) { + val = (mask >> 24) & 0x000000ff; + return (fls_mask[val] + 24); + } + if (mask > 0x0000ffff) { + val = (mask >> 16) & 0x000000ff; + return (fls_mask[val] + 16); + } + val = (mask >> 8) & 0x000000ff; + return (fls_mask[val] + 8); +} - if (mask == 0) - return (0); - for (bit = 1; mask != 1; bit++) - mask = (unsigned int)mask >> 1; - return (bit); +int +fls64(uint64_t mask) +{ + int val; + if(mask <= 0x00000000ffffffff) { + val = mask & 0x00000000ffffffff; + return(fls(val)); + } + if(mask > 0x00ffffffffffffff) { + val = (mask >> 56) & 0x000000ff; + return (fls_mask[val] + 56); + } + if (mask > 0x0000ffffffffffff) { + val = (mask >> 48) & 0x000000ff; + return (fls_mask[val] + 48); + } + if (mask > 0x000000ffffffffff) { + val = (mask >> 40) & 0x000000ff; + return (fls_mask[val] + 40); + } + val = (mask >> 32) & 0x000000ff; + return (fls_mask[val] + 32); } +