Date: Wed, 08 Mar 2000 07:05:09 -0800 From: Cy Schubert - ITSD Open Systems Group <Cy.Schubert@uumail.gov.bc.ca> To: Bhishan Hemrajani <bhishan@cytosine.dhs.org> Cc: Chris Byrnes <chrisb@highstability.com>, Archie Cobbs <archie@whistle.com>, freebsd-stable@FreeBSD.ORG Subject: Re: Help: interrupt timeout Message-ID: <200003081505.HAA02774@cwsys.cwsent.com> In-Reply-To: Your message of "Mon, 06 Mar 2000 21:25:20 PST." <200003070525.VAA09958@cytosine.dhs.org>
next in thread | previous in thread | raw e-mail | index | archive | help
In message <200003070525.VAA09958@cytosine.dhs.org>, Bhishan Hemrajani writes: > I sometimes have this problem with my pc. > > Usually, a good kick will get it to boot. > > Sometimes on laptops and stuff the hard drive cable gets > loose or something. Or, it's a bad hard drive. > > Try giving is a shove. > > --bhishan It's quite possibly a hardware problem though I'm not entirely convinced that it's a fatal hardware problem. I have an IDE drive, wd2 below, wdc0 at 0x1f0-0x1f7 irq 14 flags 0xa0ffa0ff on isa wdc0: unit 0 (wd0): <WDC AC22100H>, DMA, 32-bit, multi-block-16 wd0: 2014MB (4124736 sectors), 4092 cyls, 16 heads, 63 S/T, 512 B/S wdc1 at 0x170-0x177 irq 15 flags 0xa0ffa0ff on isa wdc1: unit 0 (wd2): <WDC AC22500L>, DMA, 32-bit, multi-block-16 wd2: 2441MB (4999680 sectors), 4960 cyls, 16 heads, 63 S/T, 512 B/S that occasionally has these errors under heavy load conditions, like fscking a badly damaged filesystem. It's done this since it was brand new over four years ago. The Western Digital diagnostics (just checked last week) reported no grown defects, and the Western Digital firmware utility says it doesn't need a firmware upgrade. Interestingly enough, the errors weren't bad enough to halt the system or abort an fsck. Could it be a software problem? Maybe, maybe not. Notice wd0. It too is a Western Digital drive, just older. It's had no problems, ever. Having said all that, I had a couple of WD Caviar WDAC280 drives that would cause an error message once every two to four years. When they had errors, the Western Digital diagnostics would fix them up and they'd run happily for another few years. Each drive had a couple of grown defects. I retired them after 8 years of service. One is still running in a Windows 3.1 PC, 11 years after I purchased it and the other serves quite well as a paper weight and would probably run nicely if it was still in use. Regards, Phone: (250)387-8437 Cy Schubert Fax: (250)387-5766 Team Leader, Sun/DEC Team Internet: Cy.Schubert@osg.gov.bc.ca Open Systems Group, ITSD, ISTA Province of BC "COBOL IS A WASTE OF CARDS." > > > I've had the same problem. Most people have told me that I have to > > replace the hard drive. > > > > I never had the problem before 3.4. Maybe that's just a coincidence, > > though. > > > > > > -- > > Chris Byrnes (CB5820) > > Network Engineer, High Stability Internet Services > > http://www.highstability.com > > > > On Mon, 6 Mar 2000, Archie Cobbs wrote: > > > > > My laptop running 3.4-RELEASE decided it doesn't want to boot. > > > It was uncleanly shut down via the power switch by someone > > > who thought they were shutting down a different machine. > > > > > > Now when it boots, running fsck gives this result: > > > > > > > chip0: <Intel 82439TX System Controller (MTXC)> rev 0x01 on pci0.0.0 > > > > chip1: <Intel 82371AB PCI to ISA bridge> rev 0x02 on pci0.7.0 > > > > ide_pci0: <Intel PIIX4 Bus-master IDE controller> rev 0x01 on pci0.7.1 > > > > ... > > > > wdc0 at 0x1f0-0x3f7 irq 14 on isa > > > > wdc0: unit 0 (wd0): <TOSHIBA MK6409MAV> > > > > wd0: 6194MB (12685680 sectors), 13424 cyls, 15 heads, 63 S/T 512 B/S > > > > wdc1 at 0x170-0x177 irq 15 on isa > > > > wdc1: unit 0 (atapi): <MATSHITADVD-ROM SR-8171/058A>, removable, accel, > dma, iordis > > > > ... > > > > # fsck / > > > > *** /dev/rwd0s3a > > > > *** Last Mounted on / > > > > *** Root file system > > > > *** Phase 1 - Check Blocks and Sizes > > > > wd0: interrupt timeout (status 58<rdy,seekdone> error 0) > > > > wd0: wdtimeout DMA status 4 > > > > wd0: interrupt timeout (status 50<rdy,seekdone> error 1<no_dam>) > > > > wd0: wdtimeout DMA status 4 > > > > wd0: interrupt timeout (status 50<rdy,seekdone> error 1<no_dam>) > > > > wd0: wdtimeout DMA status 4 > > > > wd0: interrupt timeout (status 50<rdy,seekdone> error 1<no_dam>) > > > > wd0: wdtimeout DMA status 4 > > > > wd0: interrupt timeout (status 50<rdy,seekdone> error 1<no_dam>) > > > > wd0: wdtimeout DMA status 4 > > > > wd0: Last time I say: interrupt timeout. Probably a portable PC. (stat > us 50<rdy,seekdone> error 1<no_dam>) > > > > > > Well, yes in fact it is a portable PC :-) It just seems to hang > > > at this point, even though there seems to be disk activity (like > > > it's continuously retrying). > > > > > > This machine has run fine under this kernel since I installed > > > 3.4-REL a month ago or so. This same problem happens with the > > > 3.4-REL GENERIC kernel. > > > > > > Before this, it was running fine with a 3.0++ kernel and never > > > had this problem after many power cycles. > > > > > > Is there any hope in getting this machine to work again?? > > > Howabout disabling DMA? Is there some way to do that? > > > > > > Thanks for any pointers.. > > > -Archie > > > > > > _________________________________________________________________________ > __ > > > Archie Cobbs * Whistle Communications, Inc. * http://www.whistle.c > om > > > > > > > > > To Unsubscribe: send mail to majordomo@FreeBSD.org > > > with "unsubscribe freebsd-stable" in the body of the message > > > > > > > > > > > To Unsubscribe: send mail to majordomo@FreeBSD.org > > with "unsubscribe freebsd-hackers" in the body of the message > > > > > > To Unsubscribe: send mail to majordomo@FreeBSD.org > with "unsubscribe freebsd-stable" in the body of the message To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200003081505.HAA02774>