From owner-freebsd-questions@FreeBSD.ORG Tue May 4 12:50:44 2010 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id C8F151065679 for ; Tue, 4 May 2010 12:50:44 +0000 (UTC) (envelope-from bogdan@pgn.ro) Received: from mail-bw0-f228.google.com (mail-bw0-f228.google.com [209.85.218.228]) by mx1.freebsd.org (Postfix) with ESMTP id 6538F8FC19 for ; Tue, 4 May 2010 12:50:43 +0000 (UTC) Received: by bwz28 with SMTP id 28so1957324bwz.14 for ; Tue, 04 May 2010 05:50:38 -0700 (PDT) MIME-Version: 1.0 Received: by 10.204.42.6 with SMTP id q6mr46774bke.156.1272977438355; Tue, 04 May 2010 05:50:38 -0700 (PDT) Received: by 10.204.79.19 with HTTP; Tue, 4 May 2010 05:50:38 -0700 (PDT) Date: Tue, 4 May 2010 15:50:38 +0300 Message-ID: From: Bogdan Webb To: freebsd-questions@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: sata hdd issues, timeouts'n'failures X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 04 May 2010 12:50:44 -0000 Hi .. i'll be straight to the point, yesterday morning i've experienced some issues with my FreeBSD 7.2 p7 regarding HDD partition error messages. It all started a week ago when out of the bloom a few phpBB3 database tables got corupted and upon reading the messages in /var/log i've saw: May 3 09:34:36 pgn kernel: ad10: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=143694719 May 3 09:34:40 pgn kernel: ad10: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=234746399 May 3 09:35:20 pgn kernel: ad10: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=17996279 May 3 09:35:27 pgn kernel: ad10: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=140791775 May 3 09:35:32 pgn kernel: ad10: TIMEOUT - WRITE_DMA retrying (1 retry left) LBA=14029855 ....a whole bunch of those about 1 minute apart..... May 3 09:47:09 pgn kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 88885, size: 32768 [................] May 3 09:48:46 pgn kernel: ad10: FAILURE - WRITE_DMA timed out LBA=14741375 May 3 09:48:46 pgn kernel: May 3 09:48:46 pgn kernel: g_vfs_done():ad10s1d[WRITE(offset=2715713536, length=16384)]error = 5 [.............] until the sistem became very slow and hard to use i've rebooted a few times, tryed to boot into single user mode and fun fsck but the issues still occur.. Now the GEOM_LABEL renames the ufsids timeouts still occur, and today ended up with May 4 15:26:24 pgn kernel: fsync: giving up on dirty May 4 15:26:24 pgn kernel: 0xffffff000395a7e0: tag devfs, type VCHR May 4 15:26:24 pgn kernel: usecount 1, writecount 0, refcount 934 mountedhere 0xffffff0003879c00 May 4 15:26:24 pgn kernel: flags () May 4 15:26:24 pgn kernel: v_object 0xffffff0003923e58 ref 0 pages 3725 May 4 15:26:24 pgn kernel: lock type devfs: EXCL (count 1) by thread 0xffffff00117e6370 (pid 1181) May 4 15:26:24 pgn kernel: dev ad10s1f May 4 15:26:40 pgn fsck: /dev/ad10s1f: CANNOT CREATE SNAPSHOT /usr/.snap/fsck_snapshot: Resource temporarily unavailable May 4 15:26:40 pgn fsck: May 4 15:26:40 pgn fsck: /dev/ad10s1f: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY. please advise it's pretty serious, i googled around but sincerly it's such a big issue that it can't wait :( thanks! p.s. whole /var/log/messages: http://pastebin.com/KcF3ziYu sistem info (df -h, uname, fstab, etc.): http://pastebin.com/dK8UKfhT