From owner-freebsd-stable@FreeBSD.ORG Sun Oct 14 14:42:13 2007 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 2DA3D16A41A; Sun, 14 Oct 2007 14:42:13 +0000 (UTC) (envelope-from d_elbracht@ecngs.de) Received: from ecngs.de (mail.ecngs.de [217.73.144.50]) by mx1.freebsd.org (Postfix) with ESMTP id EAF7D13C457; Sun, 14 Oct 2007 14:42:11 +0000 (UTC) (envelope-from d_elbracht@ecngs.de) Received: from EC1a (ec1.elbracht.net [217.73.144.99]) by ecngs.de (SurgeMail 3.8f2) with ESMTP id 1773237-1922481 for multiple; Sun, 14 Oct 2007 16:42:32 +0200 From: "d_elbracht" To: , Date: Sun, 14 Oct 2007 16:42:06 +0200 Message-ID: <008e01c80e70$64c92910$639049d9@EC1a> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 11 Thread-Index: AcgOcGQ+rp1Mn0RqSFKuKB0DklKcNw== X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3138 Cc: Subject: newbuf, nbufkv, nbufbs X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 14 Oct 2007 14:42:13 -0000 We have 2 machines involved with this problem. machine1, SMP, i386, 4 GB RAM was recently upgraded from 5.4 to 6.2 cvsup'ed 2007-10-10 a partition of about 2.5 TB (gstripe -s 1048576) was newfs'ed with blocksize of 65536 and fragsize of 8192 On 5.4, this was running for months with no problem. On 6.2 after a few hours of high thruput (network tx and rx 400-500 Mbit each), it became unresponsible with top showing a lot of processes with waitmessage newbuf. So, reset, fsck etc and it run again, only after a few hours, it became unresponsible again, showing processes with nbufkv and nbufbs this time, I did newfs with blocksize of 32768 and fragsize of 4096 and it's running. Thruput is decreased to 300-400 Mbit Note, it did NEVER show the problem on 5.4 machine2, SMP, amd64, 16 GB RAM, 6.2 cvsup'ed 2007-10-09 20 partitions involving 51 disks, all gstripe -s 1048576, newfs -b 65536 -f 8192 1 partion of 12 GB, (da3s1a) newfs -b 65536 -f 8192 after a few hours, top shows newbuf and the machine is unresponsible. tps on da3s1a is > 500, the others are < 100 I did newfs -b 8192 -f 1024 /dev/da3s1a and it's running without the problem (yet) The problem seems to have to do with -b 65536 and lot's of IOPS ond 6.2 Any clue ? e.g. increase BKVASIZE to 131072 and kern.nbuf to 32768 ? Cheers, Dieter