From owner-freebsd-questions Wed Oct 9 20:44: 9 2002 Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id BC09E37B401 for ; Wed, 9 Oct 2002 20:44:07 -0700 (PDT) Received: from mx1.au.itouchnet.net (nat2.au.itouchnet.net [144.135.23.100]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3DC8B43E6E for ; Wed, 9 Oct 2002 20:44:06 -0700 (PDT) (envelope-from ajthomson@optushome.com.au) Received: from nobody by mx1.au.itouchnet.net with scanned_ok (Exim 3.36 #1) id 17zUDh-0000Zi-00 for freebsd-questions@freebsd.org; Thu, 10 Oct 2002 13:42:41 +1000 Received: from athomson.prv.au.itouchnet.net ([192.168.13.55]) by mx1.au.itouchnet.net with esmtp (Exim 3.36 #1) id 17zUDg-0000Zb-00 for freebsd-questions@freebsd.org; Thu, 10 Oct 2002 13:42:40 +1000 Subject: 4.4 mailserver dying From: Andrew Thomson To: freebsd-questions@freebsd.org Content-Type: text/plain Content-Transfer-Encoding: 7bit X-Mailer: Ximian Evolution 1.0.8 Date: 10 Oct 2002 13:42:58 +1000 Message-Id: <1034221378.95974.77.camel@athomson.prv.au.itouchnet.net> Mime-Version: 1.0 X-Checked: Scanned for any viruses and unauthorized attachments at mx1.au.itouchnet.net X-iScan-ID: 2210-1034221360-56151@mx1.au.itouchnet.net version $Name: REL_2_0_2 $ Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG I recently upgraded our mailserver to 4.4, was on 4.0. It ran for days and days on 4.0 so I'm not dubious about the hardware. The only change has been the addition of raid 5 array to store the mail.. and also some updated packages... The symptoms I'm seeing are that it will run for about a day/ 2days and then users will complain they can't access the mail server. If I try to logon, my ssh session just will go 99% of the way through but just never return me to a prompt. Trying to logon via the console doesn't help either. The common theme I'm seeing here are references to the raid array/controller before it dies.. actually looking at the logs again, it's _not_ moments before the death... Oct 10 10:53:21 mx1 /kernel.MAIL.0: xl0: transmission error: 90 Oct 10 10:53:21 mx1 /kernel.MAIL.0: xl0: tx underrun, increasing tx start threshold to 360 bytes Oct 10 12:54:00 mx1 /kernel.MAIL.0: amr0: bad slot 177 completed Oct 10 13:19:06 mx1 /kernel.MAIL.0: Copyright (c) 1992-2001 The FreeBSD Project. Oct 10 13:19:06 mx1 /kernel.MAIL.0: Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 Oct 10 13:19:06 mx1 /kernel.MAIL.0: The Regents of the University of California. All rights reserved. Oct 10 13:19:06 mx1 /kernel.MAIL.0: FreeBSD 4.4-RELEASE-p15 #0: Wed Jul 17 22:19:32 SAST 2002 It's a fairly big ass raid array just for mail and there's another scsi disk for everything else.. /dev/da0s1a 496M 42M 415M 9% / /dev/da0s1f 992M 6.8M 906M 1% /tmp /dev/da0s1g 4.9G 264M 4.3G 6% /usr /dev/da0s1e 992M 55M 858M 6% /var /dev/amrd0s1e 66G 32G 29G 52% /var/mail procfs 4.0K 4.0K 0B 100% /proc So in short.. I'm not too sure what's screwing up and there's probably not much to go on here! Thoughts? thanks, ajt. To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message