From owner-freebsd-questions@FreeBSD.ORG Sat Mar 18 17:45:58 2006 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 59F2C16A41F for ; Sat, 18 Mar 2006 17:45:58 +0000 (UTC) (envelope-from matt@compar.com) Received: from daisy2.compar.com (mail.compar.com [199.243.196.35]) by mx1.FreeBSD.org (Postfix) with ESMTP id 391EF43D5D for ; Sat, 18 Mar 2006 17:45:51 +0000 (GMT) (envelope-from matt@compar.com) Received: from localhost (localhost.compar.com [127.0.0.1]) by daisy2.compar.com (Postfix) with ESMTP id D71B113C606 for ; Sat, 18 Mar 2006 12:44:10 -0500 (EST) Received: from unknown by localhost (amavisd-new, unix socket) id client-iNLMaq7x for ; Sat, 18 Mar 2006 12:44:05 -0500 (EST) Received: from hermes (CPE00062566c7bb-CM0011e6ede298.cpe.net.cable.rogers.com [70.28.254.189]) by daisy2.compar.com (Postfix) with SMTP id 7A4B413C5D8 for ; Sat, 18 Mar 2006 12:44:04 -0500 (EST) Message-ID: <002701c64ab3$f9adb4b0$1200a8c0@gsicomp.on.ca> From: "Matt Emmerton" To: Date: Sat, 18 Mar 2006 12:47:06 -0500 MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_NextPart_000_0024_01C64A8A.10332F50" X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2800.1506 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1506 X-Virus-Scanned: amavisd-new at compar.com Subject: 6.0-REL problems with ISA ed0, FFS corruption and ancient hardware X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 18 Mar 2006 17:45:58 -0000 This is a multi-part message in MIME format. ------=_NextPart_000_0024_01C64A8A.10332F50 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit I recently upgraded a 4.11-REL machine to 6.0-REL and have run into some snags. While the installation from CD went fine, after configuring and enabling my ed0 NIC, bad things start to happen. FWIW, this machine is an ancient (hardware circa 1991, BIOS circa 1994) dual-Pentium 133 MHz machine, with EISA/PCI and onboard SCSI. So far I can reliably reproduce two panics, one appears to be a ed driver bug (based on reports of similar panics with different NICs, notably nge) and one is a filesystem corruption problem. Here's the process that I go through to reliably reproduce both problems. 1) Boot machine in multi-user mode 2) After ifconfig ed0, machine panics with a trap 12 in ithread_loop. 3) In debugger, reset (or panic to get vmcore) 4) Reboot in multi-user mode, but set "hint.ed.0.disabled=1" in the boot loader (to avoid ifconifg panic) 5) Root filesystem is fsckd; all other filesystems are scheduled for background fsck 6) Encounter panic "ffs_valloc: dup alloc" 7) In debugger, reset (or panic to get vmcore) Attached is the full dmesg and stacktrace output from kgdb for the *second* panic, since I figure this is the more critical issue. -- Matt Emmerton ------=_NextPart_000_0024_01C64A8A.10332F50 Content-Type: text/plain; name="panic2.kgdb.txt" Content-Transfer-Encoding: quoted-printable Content-Disposition: attachment; filename="panic2.kgdb.txt" Script started on Sat Mar 18 12:58:13 2006=0A= root@gabby# kgdb /boot/kernel/kernel.debug vmcore.0 [GDB will not be able to debug user-mode threads: = /usr/lib/libthread_db.so: Undefined symbol "ps_pglobal_lookup"] GNU gdb 6.1.1 [FreeBSD] Copyright 2004 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you = are welcome to change it and/or distribute copies of it under certain = conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for = details. This GDB was configured as "i386-marcel-freebsd". Unread portion of the kernel message buffer: Copyright (c) 1992-2005 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.0-RELEASE #0: Sat Mar 18 12:00:50 EST 2006 root@gabby.gsicomp.on.ca:/usr2/obj/usr2/src/sys/GABBY.20060316.01 MPTable: Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Pentium/P54C (133.16-MHz 586-class CPU) Origin =3D "GenuineIntel" Id =3D 0x52c Stepping =3D 12 Features=3D0x3bf real memory =3D 50331648 (48 MB) avail memory =3D 43941888 (41 MB) Intel Pentium detected, installing workaround for F00F bug ioapic0: Changing APIC ID to 2 ioapic0 irqs 0-15 on motherboard npx0: [FAST] npx0: on motherboard npx0: INT 16 interface cpu0 on motherboard pcib0: pcibus 0 on motherboard pci0: on pcib0 eisab0: at device 2.0 on pci0 eisa0: on eisab0 mainboard0: on eisa0 slot 0 isa0: on eisab0 ahc0: port 0xf800-0xf8ff mem = 0xffbef000-0xffbeffff irq 11 at device 11.0 on pci0 ahc0: [GIANT-LOCKED] aic7870: Wide Channel A, SCSI Id=3D7, 16/253 SCBs orm0: at iomem 0xc0000-0xc7fff,0xc8000-0xca7ff on isa0 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: irq 1 on atkbdc0 atkbd0: [GIANT-LOCKED] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: model Generic PS/2 mouse, device ID 0 fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 = on isa0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppbus0: on ppc0 lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=3D0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on = isa0 unknown: can't assign resources (irq) unknown: can't assign resources (port) unknown: can't assign resources (port) unknown: can't assign resources (port) unknown: can't assign resources (irq) unknown: can't assign resources (port) Timecounter "TSC" frequency 133160146 Hz quality 800 Timecounters tick every 1.000 msec Waiting 10 seconds for SCSI devices to settle cd0 at ahc0 bus 0 target 4 lun 0 cd0: Removable CD-ROM SCSI-2 device=20 cd0: 10.000MB/s transfers (10.000MHz, offset 15) cd0: Attempt to query device size failed: NOT READY, Medium not present da1 at ahc0 bus 0 target 5 lun 0 da1: Fixed Direct Access SCSI-2 device=20 da1: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queueing = Enabled da1: 2049MB (4197405 512 byte sectors: 64H 32S/T 2049C) da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device=20 da0: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queueing = Enabled da0: 2049MB (4197405 512 byte sectors: 64H 32S/T 2049C) Trying to mount root from ufs:/dev/da0s1a WARNING: / was not properly dismounted <118>Loading configuration files. <118>kernel dumps on /dev/da0s1b <118>Entropy harvesting: <118>. <118>swapon: adding /dev/da0s1b as swap device <118>Starting file system checks: <118>/dev/da0s1a: 1012 files, 21314 used, 52949 free (485 frags, 6558 = blocks, 0.7% fragmentation) <118>/dev/da0s1e: DEFER FOR BACKGROUND CHECKING <118>/dev/da0s1d: DEFER FOR BACKGROUND CHECKING <118>/dev/da1s1e: 147526 files, 1872872 used, 159266 free (754 frags, = 19814 blocks, 0.0% fragmentation) WARNING: /usr was not properly dismounted WARNING: /var was not properly dismounted mode =3D 040755, inum =3D 5, fs =3D /var panic: ffs_valloc: dup alloc KDB: enter: panic panic: from debugger Uptime: 1m52s Dumping 47 MB (2 chunks) chunk 0: 1MB (159 pages) ... ok chunk 1: 47MB (12032 pages) 32 16 #0 doadump () at pcpu.h:165 165 pcpu.h: No such file or directory. in pcpu.h (kgdb) where #0 doadump () at pcpu.h:165 #1 0xc04bdd1f in boot (howto=3D260) at = /usr2/src/sys/kern/kern_shutdown.c:399 #2 0xc04bdfe8 in panic (fmt=3D0xc05fd370 "from debugger") at /usr2/src/sys/kern/kern_shutdown.c:555 #3 0xc043d1a9 in db_panic (addr=3D-1068670697, have_addr=3D0, = count=3D-1,=20 modif=3D0xc52e2848 "") at /usr2/src/sys/ddb/db_command.c:438 #4 0xc043d140 in db_command (last_cmdp=3D0xc064bc24, cmd_table=3D0x0,=20 aux_cmd_tablep=3D0xc061d38c, aux_cmd_tablep_end=3D0xc061d390) at /usr2/src/sys/ddb/db_command.c:350 #5 0xc043d208 in db_command_loop () at = /usr2/src/sys/ddb/db_command.c:458 #6 0xc043ee15 in db_trap (type=3D3, code=3D0) at = /usr2/src/sys/ddb/db_main.c:221 #7 0xc04d6393 in kdb_trap (type=3D3, code=3D0, tf=3D0xc52e2988) at /usr2/src/sys/kern/subr_kdb.c:473 #8 0xc05e61f4 in trap (frame=3D {tf_fs =3D 8, tf_es =3D 40, tf_ds =3D 40, tf_edi =3D 1, tf_esi =3D = -1067380202, tf_ebp =3D -986830392, tf_isp =3D -986830412, tf_ebx =3D = -986830348, tf_edx =3D 0, tf_ecx =3D -1061072896, tf_eax =3D 18, = tf_trapno =3D 3, tf_err =3D 0, tf_eip =3D -1068670697, tf_cs =3D 32, = tf_eflags =3D 642, tf_esp =3D -986830360, tf_ss =3D -1068769417}) at /usr2/src/sys/i386/i386/trap.c:591 #9 0xc05d5cda in calltrap () at /usr2/src/sys/i386/i386/exception.s:139 #10 0xc04d6117 in kdb_enter (msg=3D0x12
) at cpufunc.h:60 #11 0xc04bdf77 in panic (fmt=3D0xc0611216 "ffs_valloc: dup alloc") at /usr2/src/sys/kern/kern_shutdown.c:539 #12 0xc0577db4 in ffs_valloc (pvp=3D0xc0e93dd0, mode=3D16877, = cred=3D0xc0d5be00,=20 vpp=3D0xc52e2a50) at /usr2/src/sys/ufs/ffs/ffs_alloc.c:933 #13 0xc0591234 in ufs_mkdir (ap=3D0xc52e2bb8) at /usr2/src/sys/ufs/ufs/ufs_vnops.c:1333 #14 0xc05ef828 in VOP_MKDIR_APV (vop=3D0x12, a=3D0xc52e2bb8) at = vnode_if.c:1251 #15 0xc051c4e5 in kern_mkdir (td=3D0xc0dc5a80,=20 path=3D0xbfbfef56
, = segflg=3DUIO_USERSPACE,=20 mode=3D511) at vnode_if.h:653 #16 0xc051c1c9 in mkdir (td=3D0xc0dc5a80, uap=3D0x12) at /usr2/src/sys/kern/vfs_syscalls.c:3301 #17 0xc05e6a67 in syscall (frame=3D {tf_fs =3D 59, tf_es =3D 59, tf_ds =3D 59, tf_edi =3D -1077940394, = tf_esi =3D 1, tf_ebp =3D -1077940632, tf_isp =3D -986829468, tf_ebx =3D = -1077940380, tf_edx =3D -1, tf_ecx =3D 672359652, tf_eax =3D 136, = tf_trapno =3D 12, tf_err =3D 2, tf_eip =3D 671833491, tf_cs =3D 51, = tf_eflags =3D 514, tf_esp =3D -1077940836, tf_ss =3D 59}) at /usr2/src/sys/i386/i386/trap.c:976 #18 0xc05d5d2f in Xint0x80_syscall () at /usr2/src/sys/i386/i386/exception.s:200 #19 0x00000033 in ?? () Previous frame inner to this frame (corrupt stack?) (kgdb) quit root@gabby# exit =0A= Script done on Sat Mar 18 12:58:43 2006=0A= ------=_NextPart_000_0024_01C64A8A.10332F50--