From owner-freebsd-stable@FreeBSD.ORG Wed Apr 14 17:47:34 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 699841065672; Wed, 14 Apr 2010 17:47:34 +0000 (UTC) (envelope-from amvandemore@gmail.com) Received: from mail-qy0-f175.google.com (mail-qy0-f175.google.com [209.85.221.175]) by mx1.freebsd.org (Postfix) with ESMTP id DF8598FC21; Wed, 14 Apr 2010 17:47:33 +0000 (UTC) Received: by qyk5 with SMTP id 5so543603qyk.3 for ; Wed, 14 Apr 2010 10:47:33 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:received:message-id:subject:from:to:cc:content-type; bh=oiu3DM/7PIiWXOlri92zfvSUxZDAFg/Wcu6E6L1D+WU=; b=fXA12eOsLljAuw1PvBH7eMVlMCHlEAdLHdsGSm2VV2T9C0rSeJzHiuay/rWztmHDn6 xjJqfrCa+ckCVt9AsQudMZ21bmCdaUKkPSC/NGWMf/I4P19SHKVFlwLyJeqOW2edD2Mk 4cZ3pgMAuNBwUpg/Yu/IdxAA4mpKCgkzVp97c= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=u8ra2EXJH3rrBggzSjqZglnAZwLrAFhF5vxyHO6r+73RDRpgiUQqPE68xoJewN/3Bl Er1liY9anB60p56mn03pU5uXB/6r3BPFlE5X0e1SYXeghI1fKOq4G8fZBBd4nmbzo2JT cozSJrRcL7/oWJVN1LXw9XSPs+Y7CE9fSrxjs= MIME-Version: 1.0 Received: by 10.229.85.147 with HTTP; Wed, 14 Apr 2010 10:47:31 -0700 (PDT) In-Reply-To: <4BC5F289.7020408@freebsd.org> References: <4BC402B7.5000400@modulus.org> <20100414.082109.29593248145846106.chat95@mac.com> <4BC5DEB4.1090208@freebsd.org> <4BC5F289.7020408@freebsd.org> Date: Wed, 14 Apr 2010 12:47:31 -0500 Received: by 10.229.222.76 with SMTP id if12mr2220815qcb.17.1271267251515; Wed, 14 Apr 2010 10:47:31 -0700 (PDT) Message-ID: From: Adam Vande More To: Andriy Gapon Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: alc@freebsd.org, Maho NAKATA , alan.l.cox@gmail.com, freebsd-stable@freebsd.org, als@modulus.org Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 14 Apr 2010 17:47:34 -0000 On Wed, Apr 14, 2010 at 11:51 AM, Andriy Gapon wrote: > on 14/04/2010 19:45 Adam Vande More said the following: > > > > also if I run cpuset on the dgemm then the utilization is basically at > > the theoretical max for one core so at least that part is working. > > You can also try procstat -t to find out thread IDs and cpuset -t to > pin the > threads to the cores. > it gets to around 90% doing that. time : 103.617271 or 27.140992 Mflops : 47172.925449 n: 4100 time : 113.910669 or 30.520677 Mflops : 45174.496186 n: 4200 time : 121.880695 or 32.068070 Mflops : 46217.711013 n: 4300 tried a couple of different thread orders but didn't seem to make a difference. galacticdominator% procstat -t 1922 PID TID COMM TDNAME CPU PRI STATE WCHAN 1922 100092 dgemm initial thread 0 190 run - 1922 100268 dgemm - 1 190 run - 1922 100270 dgemm - 1 191 run - 1922 100272 dgemm - 3 190 run - 1922 100273 dgemm - 2 191 run - 1922 100274 dgemm - 2 191 run - 1922 100282 dgemm - 0 190 run - 1922 100283 dgemm - 3 190 run - galacticdominator% cpuset -t 100092 -l 0 galacticdominator% cpuset -t 100268 -l 1 galacticdominator% cpuset -t 100270 -l 2 galacticdominator% cpuset -t 100272 -l 3 galacticdominator% cpuset -t 100273 -l 0 galacticdominator% cpuset -t 100274 -l 1 galacticdominator% cpuset -t 100282 -l 2 galacticdominator% cpuset -t 100283 -l 3 galacticdominator% cpuset -t 100092 -l 0 galacticdominator% cpuset -t 100268 -l 0 galacticdominator% cpuset -t 100270 -l 1 galacticdominator% cpuset -t 100272 -l 1 galacticdominator% cpuset -t 100273 -l 2 galacticdominator% cpuset -t 100274 -l 2 galacticdominator% cpuset -t 100282 -l 3 galacticdominator% cpuset -t 100283 -l 3 This is from the second set: time : 150.348850 or 40.488350 Mflops : 45022.951141 n: 4600 time : 161.968982 or 43.589618 Mflops : 44669.884500 n: 4700 Since this is a full fledged desktop environment, 90% utilization seems pretty good. I'm no expert Andriy, but it seems like if gotoblas implemented some of the FreeBSD optimizations then we'd be in the same ballpark. -- Adam Vande More