From owner-freebsd-current Wed Mar 5 9: 4: 3 2003 Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 4A79F37B401 for ; Wed, 5 Mar 2003 09:03:57 -0800 (PST) Received: from mail015.syd.optusnet.com.au (mail015.syd.optusnet.com.au [210.49.20.173]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6A52243F93 for ; Wed, 5 Mar 2003 09:03:55 -0800 (PST) (envelope-from phiggins1@mail.optushome.com.au) Received: from holly.mintern.st (c16947.rochd2.qld.optusnet.com.au [210.49.69.60]) by mail015.syd.optusnet.com.au (8.11.6/8.11.6) with ESMTP id h25H3rZ16130 for ; Thu, 6 Mar 2003 04:03:53 +1100 Received: from ace.mintern.st (ace.mintern.st [10.4.11.3]) by holly.mintern.st (8.12.7/8.12.7) with ESMTP id h25Gx5cn019246 for ; Thu, 6 Mar 2003 02:59:06 +1000 (EST) (envelope-from phil@mintern.st) Received: from phil by ace.mintern.st with local (Exim 3.36 #1 (Debian)) id 18qSrq-0002kJ-00 for ; Wed, 05 Mar 2003 16:59:06 +1000 Subject: Disk Read Errors under SMP on -current From: Philip Higgins To: freebsd-current@freebsd.org Content-Type: text/plain Content-Transfer-Encoding: 7bit Organization: Message-Id: <1046847545.19861.128.camel@ace.mintern.st> Mime-Version: 1.0 X-Mailer: Ximian Evolution 1.2.2 Date: 05 Mar 2003 16:59:06 +1000 Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG Running -current updated less than a day ago (same problem existed a few days before), I am getting odd errors inserted in files (generally a string of five or so garbage characters randomly replacing file contents) when files are read. Doesn't seem to affected writes at all, but I didn't hang round long enough for exhaustive testing. Seems random as to which files are affected and when, and is causing plenty of programs to dump core (usually by seg fault, sometimes by illegal instruction). It's happening on all three of my disks, on two separate controllers. System is rock-solid stable with SMP disabled. Anyone got any ideas, or does this require further investigation and/or a proper problem report? (No SMP-incompatible options in my kernel or something stupid is there?) Motherboard is an Asus PCI/E-P54NP4, dmesg and kernel config follows. ******** Copyright (c) 1992-2003 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.0-CURRENT #1: Wed Mar 5 23:40:24 EST 2003 root@hilly.mintern.st:/opt/obj/opt/src/sys/HILLY.smp Preloaded elf kernel "/boot/kernel/kernel" at 0xc0410000. Timecounter "i8254" frequency 1193182 Hz CPU: Pentium/P54C (133.33-MHz 586-class CPU) Origin = "GenuineIntel" Id = 0x52c Stepping = 12 Features=0x3bf real memory = 67108864 (64 MB) avail memory = 60743680 (57 MB) APIC_IO: MP table broken: 8259->APIC entry missing! Programming 16 pins in IOAPIC #0 EISA INTCONTROL = 0000c600 IOAPIC #0 intpin 2 -> irq 0 FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs cpu0 (BSP): apic id: 0, version: 0x00030010, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00030010, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x000f0011, at 0xfec00000 Intel Pentium detected, installing workaround for F00F bug Allocating major#253 to "net" Allocating major#252 to "pci" npx0: on motherboard npx0: INT 16 interface pcibios: BIOS version 2.00 pcib0: at pcibus 0 on motherboard pci0: on pcib0 eisab0: at device 2.0 on pci0 eisa0: on eisab0 mainboard0: on eisa0 slot 0 ahc0: at 0x3c00-0x3cff, irq 14 (level) ahc0: on eisa0 slot 3 aic7770: Twin Channel, A SCSI Id=7, B SCSI Id=7, primary B, 4/253 SCBs bt0: at 0x330-0x333, 0x4c00-0x4cff bt0: irq 15 (level) on eisa0 slot 4 bt0: BT-742AH FW Rev. 3.31 Narrow SCSI Host Adapter, SCSI ID 7, 30 CCBs isa0: on eisab0 xl0: <3Com 3c905-TX Fast Etherlink XL> port 0xe800-0xe83f irq 10 at device 6.0 on pci0 xl0: Ethernet address: 00:10:4b:22:4c:b5 miibus0: on xl0 nsphy0: on miibus0 nsphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pci0: at device 7.0 (no driver attached) orm0: