Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 17 Sep 2002 15:14:56 -0500
From:      "Jack L. Stone" <jackstone@sage-one.net>
To:        freebsd-stable@freebsd.org
Subject:   Server lockups/crashes
Message-ID:  <3.0.5.32.20020917151456.0118ee90@mail.sage-one.net>

next in thread | raw e-mail | index | archive | help
Am running FreeBSD 4.5-RELEASE

I posted this on "questions", bust suspect it should be on this list.

For some months I have been having hard lockups (requiring reboots) when I
try to do backups of the ad0 HD to ad1 HD on a busy server. It happens with
either tar or dump/restore. Funny thing is that a dd image copying of
entire ad0 to ad1 doesn't lock up (so far, but have done dozens of dds). As
long as I don't kick in the ad1 for big backups, the server is very stable.

I could sure use some ideas here on what to do to figure this out. Don't
know if it is hardware or software or both. Maybe related to the ata driver
bug(?), but thought that popped up in 4.6+.

Unfortunately, I haven't configured for a core dump, but, here's the error
log recorded. If anything here gives a hint, any help would be really
appreciated as this is NOT GOOD!

BTW, the BIOS doesn't even show the ad1 IDE on first reboot, which takes a
hard switch reset. Then a complete power down/power up to bring the ad1 HD
in the BIOS back....OUCH! Is scary for this server which is not very old.

Just locked a few minutes ago and did the above, PLUS, fsck found errors to
fix.

12:23PM  up 39 mins, 5 users, load averages: 0.18, 0.17, 0.10

ERROR LOG
####################################################################
Sep 16 10:38:23 sage-one /kernel: ad1: WRITE command timeout tag=0 serv=0 -
resetting
Sep 16 10:40:07 sage-one /kernel: ata0: resetting devices .. done
Sep 16 10:40:07 sage-one /kernel: ad1: WRITE command timeout tag=0 serv=0 -
resetting
Sep 16 10:40:07 sage-one /kernel: ata0: resetting devices .. done
Sep 16 10:40:07 sage-one /kernel: ad1s1f: hard error writing fsbn 23516351
of 5466688-5466943 (ad1s1 bn 23516351; cn 1463 tn 210 sn 26)ata0-slave:
timeout waiting for command=ef s=01 e=04
Sep 16 10:40:07 sage-one /kernel: ad1: timeout waiting for DRQ - resetting
Sep 16 10:40:07 sage-one /kernel: ata0: resetting devices .. done
Sep 16 10:40:07 sage-one /kernel: ad1: timeout waiting for DRQ - resetting
Sep 16 10:40:07 sage-one /kernel: ata0: resetting devices .. done
Sep 16 10:40:07 sage-one /kernel: swap_pager: indefinite wait buffer:
device: #ad/0x20001, blkno: 392, size: 4096
####################################################################
LOCKED UP FROM HERE ON......
============================================================================

Best regards,
Jack L. Stone,
Administrator

SageOne Net
http://www.sage-one.net
jackstone@sage-one.net

To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3.0.5.32.20020917151456.0118ee90>