Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 4 Oct 2004 15:47:24 +0200
From:      r.c.ladan@student.tue.nl
To:        freebsd-hardware@freebsd.org, freebsd-current@freebsd.org
Subject:   harddisk dying?
Message-ID:  <20041004134724.GA20675@svstud.win.tue.nl>

next in thread | raw e-mail | index | archive | help
Hi,

I have an issue with my /var partition, the others are allright.

When I boot into single user, fsck_ufs /var gives no errors, both
mounted and umounted.
However, if I boot into multi user, fsck_ufs gives error messages,
allthough at boot time the file system is clean. The messages vary:

run 1:
-------
** /dev/ad0s1e (NO WRITE)
** Last Mounted on /var
** Phase 1 - Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
UNREF FILE I=4241  OWNER=mysql MODE=100600
SIZE=2349 MTIME=Oct  4 11:38 2004 
CLEAR? no

UNREF FILE I=4257  OWNER=mysql MODE=100600
SIZE=0 MTIME=Oct  4 11:37 2004 
CLEAR? no

UNREF FILE I=4272  OWNER=mysql MODE=100600
SIZE=0 MTIME=Oct  4 11:37 2004 
CLEAR? no

** Phase 5 - Check Cyl groups
2443 files, 39110 used, 25297 free (1481 frags, 2977 blocks, 2.3% fragmentation)
------

run 2:
-------
** /dev/ad0s1e (NO WRITE)
** Last Mounted on /var
** Phase 1 - Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
UNREF FILE I=4241  OWNER=mysql MODE=100600
SIZE=2349 MTIME=Oct  4 11:41 2004 
CLEAR? no

UNREF FILE I=4257  OWNER=mysql MODE=100600
SIZE=0 MTIME=Oct  4 11:37 2004 
CLEAR? no

UNREF FILE I=4272  OWNER=mysql MODE=100600
SIZE=0 MTIME=Oct  4 11:37 2004 
CLEAR? no

** Phase 5 - Check Cyl groups
2443 files, 39110 used, 25297 free (1481 frags, 2977 blocks, 2.3% fragmentation)
------

run 3:
------
** /dev/ad0s1e (NO WRITE)
** Last Mounted on /var
** Phase 1 - Check Blocks and Sizes
** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
UNREF FILE I=4241  OWNER=mysql MODE=100600
SIZE=2349 MTIME=Oct  4 11:50 2004 
CLEAR? no

UNREF FILE I=4257  OWNER=mysql MODE=100600
SIZE=0 MTIME=Oct  4 11:37 2004 
CLEAR? no

UNREF FILE I=4272  OWNER=mysql MODE=100600
SIZE=0 MTIME=Oct  4 11:37 2004 
CLEAR? no

** Phase 5 - Check Cyl groups
2443 files, 39119 used, 25288 free (1480 frags, 2976 blocks, 2.3% fragmentation)
-------

Inode 4241 changed time between run 2 and run 3.  The mysql user comes
from the databases/mysql40-{client,server} ports.  It probably hits the
bad spot.  But it continues to give errors if I turn mysql off :

** /dev/ad0s1e (NO WRITE)
** Last Mounted on /var
** Phase 1 - Check Blocks and Sizes
INCORRECT BLOCK COUNT I=39 (4 should be 0)
CORRECT? no

** Phase 2 - Check Pathnames
** Phase 3 - Check Connectivity
** Phase 4 - Check Reference Counts
** Phase 5 - Check Cyl groups
FREE BLK COUNT(S) WRONG IN SUPERBLK
SALVAGE? no

SUMMARY INFORMATION BAD
SALVAGE? no

BLK(S) MISSING IN BIT MAPS
SALVAGE? no

2439 files, 39163 used, 25243 free (1483 frags, 2970 blocks, 2.3% fragmentation)
--------

And sometimes /var seems to be ok.

I don't get ATA_READ / ATA_WRITE errors (yet).

Does this mean my harddisk is somehow dying?  It has been running
continously for 2-3 years, and in use since September 2000.

I went to the notebook service centre of the university this afternoon.
The guy there said the disk was allright.  He performed a surface and
SMART scan from a CDROM.  On the other hand, I read in the questions@
archive that surface scans are useless on FreeBSD systems.  The NSC guy
recommended to reinstall the software, but that seems pointless to me.

My harddisk (from boot -v a few days ago, when everything was fine)

---
isab0: <PCI-ISA bridge> at device 7.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Intel PIIX4 UDMA33 controller> port 0x1080-0x108f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 7.1 on pci0
atapci0: Reserved 0x10 bytes for rid 0x20 type 4 at 0x1080
ata0: channel #0 on atapci0
atapci0: Reserved 0x8 bytes for rid 0x10 type 4 at 0x1f0
atapci0: Reserved 0x1 bytes for rid 0x14 type 4 at 0x3f6
ata: ata0 already exists; skipping it
ata: ata1 already exists; skipping it
ata0-master: pio=0x0c wdma=0x22 udma=0x44 cable=80pin
ata0-master: setting PIO4 on Intel PIIX4 chip
ata0-master: setting UDMA33 on Intel PIIX4 chip
ad0: <FUJITSU MHK2060AT/D837> ATA-5 disk at ata0-master
ad0: 5729MB (11733120 sectors), 12416 C, 15 H, 63 S, 512 B
ad0: 16 secs/int, 1 depth queue, UDMA33
ATAPI_RESET time = 340us
GEOM: new disk ad0
[0] f:80 typ:165 s(CHS):0/1/1 e(CHS):1023/14/63 s:63 l:11733057
[1] f:00 typ:0 s(CHS):0/0/0 e(CHS):0/0/0 s:0 l:0
[2] f:00 typ:0 s(CHS):0/0/0 e(CHS):0/0/0 s:0 l:0
[3] f:00 typ:0 s(CHS):0/0/0 e(CHS):0/0/0 s:0 l:0
GEOM: Configure ad0s1, start 32256 length 6007325184 end 6007357439
GEOM: Configure ad0s1a, start 0 length 134217728 end 134217727
GEOM: Configure ad0s1b, start 134217728 length 402653184 end 536870911
GEOM: Configure ad0s1c, start 0 length 6007325184 end 6007325183
GEOM: Configure ad0s1d, start 536870912 length 33554432 end 570425343
GEOM: Configure ad0s1e, start 570425344 length 134217728 end 704643071
GEOM: Configure ad0s1f, start 704643072 length 5302682112 end 6007325183
Mounting root from ufs:/dev/ad0s1a
---

I don't have PREEMPTION options in my kernel. I'm running 5.3BETA6

Regards,
Rene



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20041004134724.GA20675>