From owner-freebsd-current@FreeBSD.ORG Mon Oct 4 13:47:30 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id D0C4216A4CE; Mon, 4 Oct 2004 13:47:30 +0000 (GMT) Received: from mailhost.tue.nl (mailhost.tue.nl [131.155.2.7]) by mx1.FreeBSD.org (Postfix) with ESMTP id 0A70D43D1F; Mon, 4 Oct 2004 13:47:30 +0000 (GMT) (envelope-from s492966@svstud.win.tue.nl) Received: by mailhost.tue.nl (Postfix, from userid 40) id C26E314BDDF; Mon, 4 Oct 2004 15:47:28 +0200 (CEST) Received: from svstud.win.tue.nl (svstud.win.tue.nl [131.155.69.100]) by mailhost.tue.nl (Postfix) with ESMTP id A95CD14BCF7; Mon, 4 Oct 2004 15:47:24 +0200 (CEST) Received: from svstud.win.tue.nl (localhost.win.tue.nl [127.0.0.1]) by svstud.win.tue.nl (8.12.8/8.12.6) with ESMTP id i94DlOvT020715; Mon, 4 Oct 2004 15:47:24 +0200 Received: (from s492966@localhost) by svstud.win.tue.nl (8.12.8/8.12.8/Submit) id i94DlObh020713; Mon, 4 Oct 2004 15:47:24 +0200 Date: Mon, 4 Oct 2004 15:47:24 +0200 From: r.c.ladan@student.tue.nl To: freebsd-hardware@freebsd.org, freebsd-current@freebsd.org Message-ID: <20041004134724.GA20675@svstud.win.tue.nl> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.4.1i X-Spam-Checker-Version: SpamAssassin 2.63 (2004-01-11) on mailhost.tue.nl X-Spam-DCC: dmv.com: mailhost.tue.nl 1181; Body=1 Fuz1=1 Fuz2=1 X-Spam-Status: No, hits=-4.7 required=6.3 tests=BAYES_00,NO_REAL_NAME autolearn=no version=2.63 X-Spam-Level: Subject: harddisk dying? X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: r.c.ladan@student.tue.nl List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 04 Oct 2004 13:47:31 -0000 Hi, I have an issue with my /var partition, the others are allright. When I boot into single user, fsck_ufs /var gives no errors, both mounted and umounted. However, if I boot into multi user, fsck_ufs gives error messages, allthough at boot time the file system is clean. The messages vary: run 1: ------- ** /dev/ad0s1e (NO WRITE) ** Last Mounted on /var ** Phase 1 - Check Blocks and Sizes ** Phase 2 - Check Pathnames ** Phase 3 - Check Connectivity ** Phase 4 - Check Reference Counts UNREF FILE I=4241 OWNER=mysql MODE=100600 SIZE=2349 MTIME=Oct 4 11:38 2004 CLEAR? no UNREF FILE I=4257 OWNER=mysql MODE=100600 SIZE=0 MTIME=Oct 4 11:37 2004 CLEAR? no UNREF FILE I=4272 OWNER=mysql MODE=100600 SIZE=0 MTIME=Oct 4 11:37 2004 CLEAR? no ** Phase 5 - Check Cyl groups 2443 files, 39110 used, 25297 free (1481 frags, 2977 blocks, 2.3% fragmentation) ------ run 2: ------- ** /dev/ad0s1e (NO WRITE) ** Last Mounted on /var ** Phase 1 - Check Blocks and Sizes ** Phase 2 - Check Pathnames ** Phase 3 - Check Connectivity ** Phase 4 - Check Reference Counts UNREF FILE I=4241 OWNER=mysql MODE=100600 SIZE=2349 MTIME=Oct 4 11:41 2004 CLEAR? no UNREF FILE I=4257 OWNER=mysql MODE=100600 SIZE=0 MTIME=Oct 4 11:37 2004 CLEAR? no UNREF FILE I=4272 OWNER=mysql MODE=100600 SIZE=0 MTIME=Oct 4 11:37 2004 CLEAR? no ** Phase 5 - Check Cyl groups 2443 files, 39110 used, 25297 free (1481 frags, 2977 blocks, 2.3% fragmentation) ------ run 3: ------ ** /dev/ad0s1e (NO WRITE) ** Last Mounted on /var ** Phase 1 - Check Blocks and Sizes ** Phase 2 - Check Pathnames ** Phase 3 - Check Connectivity ** Phase 4 - Check Reference Counts UNREF FILE I=4241 OWNER=mysql MODE=100600 SIZE=2349 MTIME=Oct 4 11:50 2004 CLEAR? no UNREF FILE I=4257 OWNER=mysql MODE=100600 SIZE=0 MTIME=Oct 4 11:37 2004 CLEAR? no UNREF FILE I=4272 OWNER=mysql MODE=100600 SIZE=0 MTIME=Oct 4 11:37 2004 CLEAR? no ** Phase 5 - Check Cyl groups 2443 files, 39119 used, 25288 free (1480 frags, 2976 blocks, 2.3% fragmentation) ------- Inode 4241 changed time between run 2 and run 3. The mysql user comes from the databases/mysql40-{client,server} ports. It probably hits the bad spot. But it continues to give errors if I turn mysql off : ** /dev/ad0s1e (NO WRITE) ** Last Mounted on /var ** Phase 1 - Check Blocks and Sizes INCORRECT BLOCK COUNT I=39 (4 should be 0) CORRECT? no ** Phase 2 - Check Pathnames ** Phase 3 - Check Connectivity ** Phase 4 - Check Reference Counts ** Phase 5 - Check Cyl groups FREE BLK COUNT(S) WRONG IN SUPERBLK SALVAGE? no SUMMARY INFORMATION BAD SALVAGE? no BLK(S) MISSING IN BIT MAPS SALVAGE? no 2439 files, 39163 used, 25243 free (1483 frags, 2970 blocks, 2.3% fragmentation) -------- And sometimes /var seems to be ok. I don't get ATA_READ / ATA_WRITE errors (yet). Does this mean my harddisk is somehow dying? It has been running continously for 2-3 years, and in use since September 2000. I went to the notebook service centre of the university this afternoon. The guy there said the disk was allright. He performed a surface and SMART scan from a CDROM. On the other hand, I read in the questions@ archive that surface scans are useless on FreeBSD systems. The NSC guy recommended to reinstall the software, but that seems pointless to me. My harddisk (from boot -v a few days ago, when everything was fine) --- isab0: at device 7.0 on pci0 isa0: on isab0 atapci0: port 0x1080-0x108f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 7.1 on pci0 atapci0: Reserved 0x10 bytes for rid 0x20 type 4 at 0x1080 ata0: channel #0 on atapci0 atapci0: Reserved 0x8 bytes for rid 0x10 type 4 at 0x1f0 atapci0: Reserved 0x1 bytes for rid 0x14 type 4 at 0x3f6 ata: ata0 already exists; skipping it ata: ata1 already exists; skipping it ata0-master: pio=0x0c wdma=0x22 udma=0x44 cable=80pin ata0-master: setting PIO4 on Intel PIIX4 chip ata0-master: setting UDMA33 on Intel PIIX4 chip ad0: ATA-5 disk at ata0-master ad0: 5729MB (11733120 sectors), 12416 C, 15 H, 63 S, 512 B ad0: 16 secs/int, 1 depth queue, UDMA33 ATAPI_RESET time = 340us GEOM: new disk ad0 [0] f:80 typ:165 s(CHS):0/1/1 e(CHS):1023/14/63 s:63 l:11733057 [1] f:00 typ:0 s(CHS):0/0/0 e(CHS):0/0/0 s:0 l:0 [2] f:00 typ:0 s(CHS):0/0/0 e(CHS):0/0/0 s:0 l:0 [3] f:00 typ:0 s(CHS):0/0/0 e(CHS):0/0/0 s:0 l:0 GEOM: Configure ad0s1, start 32256 length 6007325184 end 6007357439 GEOM: Configure ad0s1a, start 0 length 134217728 end 134217727 GEOM: Configure ad0s1b, start 134217728 length 402653184 end 536870911 GEOM: Configure ad0s1c, start 0 length 6007325184 end 6007325183 GEOM: Configure ad0s1d, start 536870912 length 33554432 end 570425343 GEOM: Configure ad0s1e, start 570425344 length 134217728 end 704643071 GEOM: Configure ad0s1f, start 704643072 length 5302682112 end 6007325183 Mounting root from ufs:/dev/ad0s1a --- I don't have PREEMPTION options in my kernel. I'm running 5.3BETA6 Regards, Rene