From owner-freebsd-arm@FreeBSD.ORG Thu Sep 5 22:37:40 2013 Return-Path: Delivered-To: freebsd-arm@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTP id 220A576B for ; Thu, 5 Sep 2013 22:37:40 +0000 (UTC) (envelope-from zbb@semihalf.com) Received: from mail-qe0-f48.google.com (mail-qe0-f48.google.com [209.85.128.48]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id D72682532 for ; Thu, 5 Sep 2013 22:37:39 +0000 (UTC) Received: by mail-qe0-f48.google.com with SMTP id nd7so378150qeb.21 for ; Thu, 05 Sep 2013 15:37:33 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type; bh=HGJzmmBVZQdbB8QvhIKe22Mb/O91BrE+g6gDcugJyq8=; b=cbYswKSSg3ZyfWOxezf2qLcZCkok/19dW/0PygZ6eiu6APwlqbipWmN1/YwDnPY8CH 658F75xuK9mqqmeW85hzA7rckvt1yTjUl5hCNhVBrvkk+hKFuiBznXDxmDYw1Huvdb+q trVxqkkPmXDvkABVJXjZ/O5ljrm+0LQ6pD71VSZHB84X+KTZJBhCrNCGpA5T88g3Ly0R 92QsIA86tP1GXfK7liYx39Esipkl+h9WDOwKGve/5OP/pVE5xINUKlSXNxbvDv4cmB5r YSz6YjeZmmBuEJxkcTtmdTCXteLIdSocfm8Uc83XrixlWhtPV1WcWOYORpE0poxynprz CvQA== X-Gm-Message-State: ALoCoQntg1Fj7N7xeg5OxkrQnrObzOuzMO6YKmZH5r150Cnzwspztq+7Ta3Aog7dbOIKFwT7b0U/ MIME-Version: 1.0 X-Received: by 10.49.47.50 with SMTP id a18mr12779799qen.61.1378420653361; Thu, 05 Sep 2013 15:37:33 -0700 (PDT) Received: by 10.49.8.20 with HTTP; Thu, 5 Sep 2013 15:37:33 -0700 (PDT) In-Reply-To: References: Date: Fri, 6 Sep 2013 00:37:33 +0200 Message-ID: Subject: Re: stream benchmarking on RPi From: Zbigniew Bodek To: Jia-Shiun Li Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.14 Cc: "freebsd-arm@freebsd.org" X-BeenThere: freebsd-arm@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Porting FreeBSD to the StrongARM Processor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 05 Sep 2013 22:37:40 -0000 2013/9/5 Jia-Shiun Li > Hi all, > > just did a brief test using stream, the memory bandwidth benchmark, on > RPi with Raspbian and FreeBSD. Share these info to see if someone > might think of it useful. > > FreeBSD is faster at copying. I guess that must be attributed to > recent VM and/or superpage commits. I remembered it to be under > 300MB/s months before. On the other hand, scale, add, and triad are > significantly slower. Anyone have clues or any wild guesses? > > > Below the only compiler option given for cc/gcc is -O3. > > Raspbian 2013-07-26 (lk 3.6, gcc 4.6): > ------------------------------------------------------------- > Function Best Rate MB/s Avg time Min time Max time > Copy: 257.3 0.062703 0.062189 0.063824 > Scale: 205.1 0.079135 0.077993 0.082000 > Add: 284.1 0.085253 0.084480 0.088597 > Triad: 274.3 0.087799 0.087501 0.087940 > ------------------------------------------------------------- > > FreeBSD 10.0-CURRENT r255120 w/ cc(clang): > ------------------------------------------------------------- > Function Best Rate MB/s Avg time Min time Max time > Copy: 365.5 0.045321 0.043779 0.052929 > Scale: 31.2 0.531028 0.513082 0.550906 > Add: 68.5 0.367295 0.350467 0.391310 > Triad: 26.9 0.902672 0.893316 0.908908 > ------------------------------------------------------------- > > FreeBSD 10.0-CURRENT r255120 w/ gcc (4.2.1): > ------------------------------------------------------------- > Function Best Rate MB/s Avg time Min time Max time > Copy: 264.6 0.063977 0.060470 0.081000 > Scale: 31.0 0.535830 0.516093 0.551035 > Add: 46.7 0.534768 0.514323 0.553834 > Triad: 23.1 1.047644 1.038968 1.066887 > ------------------------------------------------------------- > > Hello Jia-Shiun. Thanks for your effort in testing. I am actually in the middle of superpages tests and another benchmark and set of results will be very helpful especially for comparison. Just for the record: did you enable superpages for your kernel? SP are not yet enabled by default, therefore one needs to set vm.pmap.sp_enabled to non-zero value in loader.conf (if you are using loader) or set this value in src by editing sys/arm/arm/pmap-v6.c -> sp_enabled. Nevertheless I've made short tests on Armada XP (clang). I used two array sizes (default and 2 x default). I also made few runs to ensure that the results are steady. Please check below (improvement in copy can be seen but from what one can observe via sysctl vm.pmap.section not so many superpages are "requested" during the test): Array size = 10000000 (elements) ================================ sp disabled ------------------------------------------------------------- Function Best Rate MB/s Avg time Min time Max time Copy: 1311.9 0.124910 0.121956 0.126486 Scale: 64.2 2.546568 2.493977 2.570808 Add: 112.1 2.163666 2.140962 2.205463 Triad: 51.3 4.683770 4.675176 4.689565 sp enabled ------------------------------------------------------------- Function Best Rate MB/s Avg time Min time Max time Copy: 1368.9 0.119831 0.116878 0.121894 Scale: 64.6 2.527607 2.476270 2.551667 Add: 112.9 2.147966 2.125261 2.189840 Triad: 51.6 4.654865 4.647609 4.662289 Array size = 20000000 (elements) ================================ sp disabled ------------------------------------------------------------- Function Best Rate MB/s Avg time Min time Max time Copy: 1271.2 0.257928 0.251738 0.260991 Scale: 64.2 5.092455 4.987830 5.139630 Add: 112.0 4.331419 4.287459 4.416701 Triad: 51.3 9.366274 9.349165 9.379344 sp enabled ------------------------------------------------------------- Function Best Rate MB/s Avg time Min time Max time Copy: 1333.3 0.250755 0.240014 0.253216 Scale: 64.5 5.065569 4.963166 5.114160 Add: 112.4 4.312079 4.268610 4.395812 Triad: 51.6 9.325673 9.309094 9.338787 Best regards Zbigniew Bodek