Date: Mon, 6 Dec 2004 19:51:56 -0500 From: Garance A Drosihn <drosih@rpi.edu> To: freebsd-current@freebsd.org Cc: =?iso-8859-1?Q?S=F8ren_Schmidt?= <sos@DeepCore.dk> Subject: Re: Another twist on WRITE_DMA issues <- ProblemFound Message-ID: <p06200751bddaace59ce2@[128.113.24.47]> In-Reply-To: <p06200749bdd9a8598b67@[128.113.24.47]> References: <p0620072dbdd5771efefe@[128.113.24.47]> <p06200749bdd9a8598b67@[128.113.24.47]>
next in thread | previous in thread | raw e-mail | index | archive | help
At 1:28 AM -0500 12/6/04, Garance A Drosihn wrote: >At 9:31 PM -0500 12/2/04, Garance A Drosihn wrote: >> >>I have now switched from that Western Digital drive to a Seagate >>Barracuda 7200.7 120-gig (ST3120026AS). The drive seems to be >>working fairly well, but now I sometimes see some combination >>like the following three lines: >> >>Dec 2 20:29:50 kernel: Interrupt storm detected on >> "irq20: atapci0"; throttling interrupt source >>Dec 2 20:29:54 kernel: ad4: TIMEOUT - WRITE_DMA retrying >> (2 retries left) LBA=20627679 >>Dec 2 20:29:54 kernel: ad4: FAILURE - WRITE_DMA timed out [skipping] >Just before I realized that SATA controller was the problem, I had >added: > hw.ata.ata_dma=0 >to /boot/loader.conf.local, ... >I removed that setting, rebooted, and I have now done a complete >buildworld/installworld cycle without seeing a single "interrupt >storm" or a single WRITE_DMA error. While the setting was still >there, I would always see at least a few of those warning messages >(and sometimes end up with a system panic). So, my hope is that >this has finally solved the last of my problems with this machine. That isn't it either. I think the hardware is just mocking me. I had zero problems for more than 24 hours. I then copied one set of partitions to another, booted up to that second set, and immediately I was back to having the above warnings/errors, and before long I had a system panic. And when I try to 'call doadump()', that fails with an error writing to the disk, so I can't get a core dump of it either. Maybe it's an overheating issue, or maybe it's something else. But whatever it is, I am going to assume it's the fault of something in my PC. -- Garance Alistair Drosehn = gad@gilead.netel.rpi.edu Senior Systems Programmer or gad@freebsd.org Rensselaer Polytechnic Institute or drosih@rpi.edu
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?p06200751bddaace59ce2>