From owner-freebsd-stable Fri Jul 20 18:35:45 2001 Delivered-To: freebsd-stable@freebsd.org Received: from mikea.ath.cx (okc-65-26-223-53.mmcable.com [65.26.223.53]) by hub.freebsd.org (Postfix) with ESMTP id 3D27137B405 for ; Fri, 20 Jul 2001 18:35:39 -0700 (PDT) (envelope-from mikea@mikea.ath.cx) Received: (from mikea@localhost) by mikea.ath.cx (8.11.4/8.11.1) id f6L1ZcS61476 for stable@FreeBSD.ORG; Fri, 20 Jul 2001 20:35:38 -0500 (CDT) (envelope-from mikea) Date: Fri, 20 Jul 2001 20:35:38 -0500 From: mikea To: stable@FreeBSD.ORG Subject: Re: system lookup Message-ID: <20010720203538.A61352@mikea.ath.cx> References: <42245809485.20010720194146@yahoo.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.2.4i In-Reply-To: <42245809485.20010720194146@yahoo.com>; from pccb@yahoo.com on Fri, Jul 20, 2001 at 07:41:46PM -0400 Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG On Fri, Jul 20, 2001 at 07:41:46PM -0400, Peter Chiu wrote: > Under heavy disk read on my maxtor 40GB ata66 HD, freebsd locks up and > the following messages show up on the console. > > ad4: READ command timeout tag=0 serv=0 - resetting > ata2: resetting devices... > > At this point, the HD light is on, and the whole system locked up. I > have to do a hardware reset to reboot the system. This problem happen > several times and my system uptime is around 3 - 10 days because of > this. > > 1. Does anyone know what is wrong? > > 2. How to figure out where is the problem? > Harddisk (Maxtor), motherboard (Abit BP6), freebsd or whatever. > > > Info:- > zeus:opt$ sysctl -a | grep ata > kern.ipc.max_datalen: 136 > hw.ata.ata_dma: 1 > hw.ata.wc: 1 > hw.ata.tags: 0 > hw.ata.atapi_dma: 0 > hw.atamodes: dma,dma,dma,pio,dma,---,---,---, > > > dmesg output:- > > CPU: Pentium II/Pentium II Xeon/Celeron (400.91-MHz 686-class CPU) > Origin = "GenuineIntel" Id = 0x665 Stepping = 5 > Features=0x183fbff > real memory = 268435456 (262144K bytes) > avail memory = 258478080 (252420K bytes) > Programming 24 pins in IOAPIC #0 > IOAPIC #0 intpin 2 -> irq 0 > FreeBSD/SMP: Multiprocessor motherboard > cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 > cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 > io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 > Preloaded elf kernel "kernel" at 0xc02df000. > Pentium Pro MTRR support enabled > atapci0: port 0xf000-0xf00f at device 7.1 on pci0 > ata0: at 0x1f0 irq 14 on atapci0 > ata1: at 0x170 irq 15 on atapci0 > atapci1: port 0xd400-0xd4ff,0xd000-0xd003,0xcc00-0xcc07 irq 11 at device 19.0 on pci0 > ata2: at 0xcc00 on atapci1 > atapci2: port 0xe000-0xe0ff,0xdc00-0xdc03,0xd800-0xd807 irq 11 at device 19.1 on pci0 > ata3: at 0xd800 on atapci2 > ad0: 1625MB [3303/16/63] at ata0-master UDMA33 > ad1: 39082MB [79406/16/63] at ata0-slave UDMA33 > ad2: 1033MB [2100/16/63] at ata1-master WDMA2 > SMP: AP CPU #1 Launched! > ad4: 39083MB [79408/16/63] at ata2-master UDMA66 > acd0: CDROM at ata1-slave using PIO3 [I've trimmed away all except CPU, controller, and drives.] I gather that the lockup does not resolve after a reasonable time. It appears that something is terribly wrong in the path to the data on ad4. The message makes me think that ad4 itself failed to respond, but I suppose it's also possible that ata2 dropped an interrupt or other signal, and I think that would result in much the same symptom set. Have you tried setting ad4 to UDMA33 or PIO4 or PIO3, just to see if it makes a difference? Have you tried moving that disk to ata3, and leaving ata2 unused? Have you tried changing any other variables to see what effect the change(s) had? -- Mike Andrews mikea@mikea.ath.cx Tired old sysadmin since 1964 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message