From owner-freebsd-stable@FreeBSD.ORG Fri Jan 20 18:01:47 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 1A58C16A41F for ; Fri, 20 Jan 2006 18:01:47 +0000 (GMT) (envelope-from nike_d@cytexbg.com) Received: from office.suresupport.com (office.suresupport.com [213.145.98.15]) by mx1.FreeBSD.org (Postfix) with SMTP id 2256743D45 for ; Fri, 20 Jan 2006 18:01:44 +0000 (GMT) (envelope-from nike_d@cytexbg.com) Received: (qmail 74325 invoked by uid 1026); 20 Jan 2006 18:03:39 -0000 Received: from 213.145.98.14 by office.suresupport.com (envelope-from , uid 1004) with qmail-scanner-1.23 (f-prot: 4.4.2/3.14.11. Clear:RC:1(213.145.98.14):. Processed in 0.096744 secs); 20 Jan 2006 18:03:39 -0000 Received: from unknown (HELO 14.98.145.213.in-addr.arpa) (213.145.98.14) by office.suresupport.com with SMTP; 20 Jan 2006 18:03:39 -0000 From: Niki Denev To: freebsd-stable@freebsd.org Date: Fri, 20 Jan 2006 20:03:30 +0200 User-Agent: KMail/1.9.1 MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200601202003.30336.nike_d@cytexbg.com> Subject: diskio / filesystem related deadlock on SMP 6.0-STABLE machine. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 20 Jan 2006 18:01:47 -0000 Hello, I'm experiencing some problems with a 6.0-STABLE machine, cvsupped and rebuilt yesterday. I'm not sure that the problem is related to this last update, because this machine is not very loaded currently. The machine is with dual opteron mb from Supermicro, with two Opteron 244s and 4GB of DDR 400 ECC Registered memory, integrated Adaptec U320 SCSI controller (forced to U160 mode), and four Seagate 36G 10K rpm scsi drives. The kernel config is generic SMP with enabled QUOTA support, accounting_enable=YES in rc.conf and the root fs is a software Raid-10 running two striped mirrors using geom_mirror and geom_stripe with the help of a little /boot partition for loading the kernel and the required modules. Kernel conf, dmesg and loader.conf are available here : http://www.totalterror.net/freebsd/srv/ Yesterday i was able to deadlock the machine two times, doing exactly the same thing : I was doing rsync from another machine to this one. I was syncing one rather big imap(Maildir) folder, about 270K msgs(files), and at the same time i was syncing this folder contents via the bincimap imap server on a remote machine running Kmail. Then i run a "du -sh" on the folder in question.....and all my shells to it stopped working... I was able to ping the machine and connect to listening ports, but without getting banners from the daemons. There was also zero HDD activity at this time. Unfortunately i forgot to enter the debuger and get a trace...(but, will it show something meaningfull or just the keyboard interrupt handler?) After reseting the machine booted and rebuilt it's secondary components on the both mirrors ( maybe this is normal? it seems it's happeing everytime the machine is uncleanly restarted) This is a big problem for me because this machine will soon enter in production, and should be able to serve imap to a dozen of clients. I know that 270K msg in single imap folder is stupid, but our old imap server running FreeBSD 5.4-STABLE(quite old STABLE) on AMD 1800+ with 2G of ram and 80G IDE disk has no problems with it, except being very slow of course I hope this info is enough, if not i will gladly provide more. Any suggestions are welcome, Thanks. --niki