Date: Tue, 17 Sep 2002 15:14:56 -0500 From: "Jack L. Stone" <jackstone@sage-one.net> To: freebsd-stable@freebsd.org Subject: Server lockups/crashes Message-ID: <3.0.5.32.20020917151456.0118ee90@mail.sage-one.net>
next in thread | raw e-mail | index | archive | help
Am running FreeBSD 4.5-RELEASE I posted this on "questions", bust suspect it should be on this list. For some months I have been having hard lockups (requiring reboots) when I try to do backups of the ad0 HD to ad1 HD on a busy server. It happens with either tar or dump/restore. Funny thing is that a dd image copying of entire ad0 to ad1 doesn't lock up (so far, but have done dozens of dds). As long as I don't kick in the ad1 for big backups, the server is very stable. I could sure use some ideas here on what to do to figure this out. Don't know if it is hardware or software or both. Maybe related to the ata driver bug(?), but thought that popped up in 4.6+. Unfortunately, I haven't configured for a core dump, but, here's the error log recorded. If anything here gives a hint, any help would be really appreciated as this is NOT GOOD! BTW, the BIOS doesn't even show the ad1 IDE on first reboot, which takes a hard switch reset. Then a complete power down/power up to bring the ad1 HD in the BIOS back....OUCH! Is scary for this server which is not very old. Just locked a few minutes ago and did the above, PLUS, fsck found errors to fix. 12:23PM up 39 mins, 5 users, load averages: 0.18, 0.17, 0.10 ERROR LOG #################################################################### Sep 16 10:38:23 sage-one /kernel: ad1: WRITE command timeout tag=0 serv=0 - resetting Sep 16 10:40:07 sage-one /kernel: ata0: resetting devices .. done Sep 16 10:40:07 sage-one /kernel: ad1: WRITE command timeout tag=0 serv=0 - resetting Sep 16 10:40:07 sage-one /kernel: ata0: resetting devices .. done Sep 16 10:40:07 sage-one /kernel: ad1s1f: hard error writing fsbn 23516351 of 5466688-5466943 (ad1s1 bn 23516351; cn 1463 tn 210 sn 26)ata0-slave: timeout waiting for command=ef s=01 e=04 Sep 16 10:40:07 sage-one /kernel: ad1: timeout waiting for DRQ - resetting Sep 16 10:40:07 sage-one /kernel: ata0: resetting devices .. done Sep 16 10:40:07 sage-one /kernel: ad1: timeout waiting for DRQ - resetting Sep 16 10:40:07 sage-one /kernel: ata0: resetting devices .. done Sep 16 10:40:07 sage-one /kernel: swap_pager: indefinite wait buffer: device: #ad/0x20001, blkno: 392, size: 4096 #################################################################### LOCKED UP FROM HERE ON...... ============================================================================ Best regards, Jack L. Stone, Administrator SageOne Net http://www.sage-one.net jackstone@sage-one.net To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3.0.5.32.20020917151456.0118ee90>