From owner-freebsd-questions@FreeBSD.ORG Fri Aug 22 12:01:03 2003 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 715D516A4BF for ; Fri, 22 Aug 2003 12:01:03 -0700 (PDT) Received: from lakemtao03.cox.net (lakemtao03.cox.net [68.1.17.242]) by mx1.FreeBSD.org (Postfix) with ESMTP id 71FDA43FE5 for ; Fri, 22 Aug 2003 12:01:02 -0700 (PDT) (envelope-from rjhjr@cox.net) Received: from kongemord.krig.net ([68.100.111.121]) by lakemtao03.cox.net (InterMail vM.5.01.06.04 201-253-122-130-104-20030726) with SMTP id <20030822190102.QRSB10977.lakemtao03.cox.net@kongemord.krig.net> for ; Fri, 22 Aug 2003 15:01:02 -0400 Received: by kongemord.krig.net (sSMTP sendmail emulation); Fri, 22 Aug 2003 15:01:01 -0400 From: "Bob Hall" Date: Fri, 22 Aug 2003 15:01:01 -0400 To: freebsd-questions@freebsd.org Message-ID: <20030822190101.GB8719@kongemord.krig.net> Mail-Followup-To: freebsd-questions@freebsd.org References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.1i Subject: Re: Something rotten in the OS? X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 22 Aug 2003 19:01:03 -0000 On Fri, Aug 22, 2003 at 10:13:30AM -0700, Steve Warwick wrote: > Hey All, > > I have just had my 2nd HD crash in a year - different machine, different > type of drive etc - the only consistency is the OS and the installed > software. > > It looks like this is some kind of overflow???? > > This problem seems to start after about a month and is indicated by there > being fragments of the kernel config data in the daily kernel log messages. > I asked about that on this list but people seemed to think it was just some > kind of log rotation. > > I caught the machine a couple of months ago with nearly all the swap used > (800meg out of 1 gig) and rebooted which kept the machine happy. > > I recently noticed the kernel log messages had the config fragments again > and was going to to a reboot - but alas too late. The machine is down, HD is > damaged and we are trying to get data off the drive... > > This is exactly the problem that occurred 6 months ago. That time I put it > down to lousy overheated hosting but now I don't have that excuse. > > Has anyone seen this before? > > Does FreeBSD have to be re-booted every month for safety? FBSD has a growing niche with businesses where a small number of admins manage large numbers of hosts under continuous and heavy use. So no, it usually doesn't have to be re-booted every month. And I doubt the problem is the OS. You haven't said what hardware you're using or what you are using FBSD for. FBSD seems to be more susceptible to having random bits reset by voltage fluctuations than other OSs. That fact that the problem recurred on another machine doesn't rule out hardware problems; I've seen lots of defective desktop machines. If you are moving from one lowest-bidder machine to another, you could easily be moving from one collection of incompatible parts to another. And you haven't said anything about what software you are running or what your are using it for. Some software is more prone to problems than others, your software may be misconfigured in some subtle way, or you may be using it in ways it doesn't like. And finally, every OS has its own unique set of weaknesses, so sometimes you can fix a problem by changing the OS. Whether the new OS is better or worse is irrelevant, as long as it isn't susceptible to the problems that are crashing the current OS. Bob Hall