From owner-freebsd-sparc Mon Dec 23 4:44:22 2002 Delivered-To: freebsd-sparc@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id E516637B401 for ; Mon, 23 Dec 2002 04:44:20 -0800 (PST) Received: from pump3.york.ac.uk (pump3.york.ac.uk [144.32.128.131]) by mx1.FreeBSD.org (Postfix) with ESMTP id E233143ED8 for ; Mon, 23 Dec 2002 04:44:19 -0800 (PST) (envelope-from gavin@ury.york.ac.uk) Received: from ury.york.ac.uk (ury.york.ac.uk [144.32.108.81]) by pump3.york.ac.uk (8.10.2/8.10.2) with ESMTP id gBNCiCv10661; Mon, 23 Dec 2002 12:44:12 GMT Received: from ury.york.ac.uk (localhost.york.ac.uk [127.0.0.1]) by ury.york.ac.uk (8.12.6/8.12.6) with ESMTP id gBNCiCTI035882; Mon, 23 Dec 2002 12:44:12 GMT (envelope-from gavin@ury.york.ac.uk) Received: from localhost (gavin@localhost) by ury.york.ac.uk (8.12.6/8.12.6/Submit) with ESMTP id gBNCiBtL035879; Mon, 23 Dec 2002 12:44:12 GMT Date: Mon, 23 Dec 2002 12:44:11 +0000 (GMT) From: Gavin Atkinson To: Jake Burkholder Cc: freebsd-sparc@FreeBSD.ORG Subject: Re: Hangs under load In-Reply-To: <20021222144007.N61142-100000@ury.york.ac.uk> Message-ID: <20021223122654.Q34530-100000@ury.york.ac.uk> References: <20021210184226.W66997-100000@ury.york.ac.uk> <20021210141635.A84047@locore.ca> <20021216190450.N33658-100000@ury.york.ac.uk> <20021219030745.A4242@locore.ca> <20021222144007.N61142-100000@ury.york.ac.uk> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-sparc@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.org On Sun, 22 Dec 2002, Gavin Atkinson wrote: > Sorry for not replying sooner, with the buildworld running on local disks, > I still experience these lockups. I'm wondering if somehow the disk > controller is getting wedged? Indeed, this is what happens. I left my machine running "make world" overnight, with top running over the serial port, and came to find the machine had hung. The last message on the serial console was: ad0: READ command timeout tag=0 serv=0 - resetting ata2: resetting devices... And then it hung. Even processes that presumably do not access the disks once loaded (eg top) had hung hard. The box does still respond to pings, and prints "Power Failure Detected: Shutting down NOW." when the front power button is pressed. Could it be that it never recovers from the ata issue and then never returns to userland from the kernel? Extract from dmesg: atapci0: port 0xc00020-0xc0002f,0xc00018-0xc0001b,0xc00010-0xc00017,0xc00008-0xc0000b,0xc00000-0xc00007 irq 32 at device 3.0 on pci2 ata2: at 0xc00000 on atapci0 ata3: at 0xc00010 on atapci0 ad0: 4103MB [8894/15/63] at ata2-master WDMA2 Where do I go from here? Gavin To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-sparc" in the body of the message