From owner-freebsd-questions@FreeBSD.ORG Mon Feb 13 11:57:27 2006 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id C138516A422 for ; Mon, 13 Feb 2006 11:57:27 +0000 (GMT) (envelope-from freebsd@juiceless.net) Received: from smtp2.nblnetworks.fi (smtp2.nblnetworks.fi [217.30.182.231]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6194243D58 for ; Mon, 13 Feb 2006 11:57:22 +0000 (GMT) (envelope-from freebsd@juiceless.net) Received: from ssl.nebula.fi (webmail.nebula.fi [217.30.180.120]) by smtp2.nblnetworks.fi (8.13.1/8.12.8) with ESMTP id k1DC11kF013898 for ; Mon, 13 Feb 2006 14:01:02 +0200 Received: from 213.243.141.188 (SquirrelMail authenticated user juicelessnet3) by ssl.nebula.fi with HTTP; Mon, 13 Feb 2006 13:57:21 +0200 (EET) Message-ID: <60315.213.243.141.188.1139831841.squirrel@ssl.nebula.fi> Date: Mon, 13 Feb 2006 13:57:21 +0200 (EET) From: "Ville Lundberg" To: freebsd-questions@freebsd.org User-Agent: SquirrelMail/1.4.5 MIME-Version: 1.0 Content-Type: text/plain;charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Priority: 3 (Normal) Importance: Normal Subject: 6.0-release hanging without a clue (gmirror related?) X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 13 Feb 2006 11:57:27 -0000 Hi, I have a 6.0-release-p4 system that is hanging constantly after about a week of uptime. Nothing is printed to the logs - it just hangs, the HD light is stuck on. I don't know if anything is printed out on the console, as the system is at client's premises. The system is a Epox 4PDA3I mb (Intel ICH5 disc controller), Pentium 4 2,6GHz, 1Gb ram, 2 x WD Raptor 36,7Gb SATA harddrives on gmirror. It is very lightly stressed, as it's used for one database application only. Actually, I don't know if the system freezes completely, as it is used only thru Apache - these crashes are noticed by client when the app no longer responses. After a cold reboot, gmirror loses one of the hds (component broken, skipping). I have two theories: 1) gmirror (or fbsd sata stuff) is the cause for crash. The HD light thing is what makes me suspect this (hd action when freezing). And, when rebuilding the mirror, it failed with WRITE_DMA timeouts. I cleaned the first and last blocks of the failing hd, and then I was able to add it back to the mirror. Manufacturer disk diagnostics did not report any errors on either hds - so the cold reboot is the cause of dropping the hd from gmirror. 2) Apache 1.3.34, MySQL 4.1.16, mod_perl 1.29, Perl (don't remember exact version, but pkg_add -r perl from 6.0-release), is the fault. The system was upgraded recently when the new hds were installed, from FB 4.10-rel, Apache 1.3.19 (no mod_perl), MySQL 4.0.18 to the above mentioned. With 4.10, it was rock solid with nice uptimes like 176 days until maintenance had to reboot it... Anyone have ideas how to get to the bottom of the problem - to know why it freezes in the first place? Or know if any of the software versions mentioned above have some issues? I can provide dmesg and such if wanted. --Ville