From owner-freebsd-current Sun Jan 16 2: 3:44 2000 Delivered-To: freebsd-current@freebsd.org Received: from verdi.nethelp.no (verdi.nethelp.no [158.36.41.162]) by hub.freebsd.org (Postfix) with SMTP id 980FA14D03 for ; Sun, 16 Jan 2000 02:02:48 -0800 (PST) (envelope-from sthaug@nethelp.no) Received: (qmail 39368 invoked by uid 1001); 16 Jan 2000 10:02:01 +0000 (GMT) To: Alexander@Leidinger.net Cc: current@freebsd.org Subject: Re: crash with ffs_softdep.c 1.52 From: sthaug@nethelp.no In-Reply-To: Your message of "Sat, 15 Jan 2000 14:45:29 +0100 (CET)" References: <200001151345.OAA00557@Magelan.Leidinger.net> X-Mailer: Mew version 1.05+ on Emacs 19.34.2 Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Date: Sun, 16 Jan 2000 11:02:01 +0100 Message-ID: <39366.948016921@verdi.nethelp.no> Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG > unfortunally no core dump, but I'm able to reproduce it (I just have to > enable softupdates). > > ---snip--- > panic: softdep_disk_write_complete: lock is held > Debugger("panic") > Stopped at Debugger+0x36: mocb $0,in_Debugger.354 > db> trace > Debugger(x027c6a3) at Debugger+0x36 > panic(c028a900, c33618c0, c0a54000, c3361a10, c02a4a34) at panic+0x70 > softdep_disk_write_complete(c3361a10) at softdep_disk_write_complete+0x1f > vfs_backgroundwritedone(c3361a10, c3361a10, c0a54000, c0aa3400, c0aa3400) at vfs_backgroundwritedone+0x45 > biodone(c3361a10, c0a54014, c3361a10) at biodone+0x5a > dadone(c0aa0780, c0aa3400, c073ba20, 40000200, ffffffff) at dadone+0x202 > camisr(c02d93b4, c02a4cec, c0241b30, 4000200, c0256486) at camisr+0x1ed > swi_cambio(40000200, c0256486, c0255f21, 40000200, c0a32800) at swi_cambio+0xd > splz_swi(c073bb20, 0, 10, c0240010, 10) at splz_swi+0x14 > Xresume9() at Xresume9+0x2b > --- interrupt, eip: 0xc024c6ce, esp = 0xc02a4d34, ebp = 0 --- > default_halt() at default_halt+0x2 > ---snip-- I have a core dump here. -current from 15. January, ffs_softdep.c 1.52. Got one crash during buildworld (but didn't have core dump enabled), so last night I started a "make buildworld; rm -rf /usr/obj/whatever" loop. Sure enough, it crashed again the third time through the loop: (kgdb) where #0 boot (howto=260) at ../../kern/kern_shutdown.c:304 #1 0xc014ec9d in panic (fmt=0xc02740f4 "from debugger") at ../../kern/kern_shutdown.c:554 #2 0xc013122d in db_panic (addr=-1071349235, have_addr=0, count=-1, modif=0xcbb2acf4 "") at ../../ddb/db_command.c:433 #3 0xc01311cd in db_command (last_cmdp=0xc02a3e1c, cmd_table=0xc02a3c7c, aux_cmd_tablep=0xc02f087c) at ../../ddb/db_command.c:333 #4 0xc0131292 in db_command_loop () at ../../ddb/db_command.c:455 #5 0xc013331b in db_trap (type=3, code=0) at ../../ddb/db_trap.c:71 #6 0xc0247f54 in kdb_trap (type=3, code=0, regs=0xcbb2adfc) at ../../i386/i386/db_interface.c:158 #7 0xc025a878 in trap (frame={tf_fs = 24, tf_es = -1047396336, tf_ds = 16, tf_edi = -972208896, tf_esi = 256, tf_ebp = -877482428, tf_isp = -877482456, tf_ebx = -1071090464, tf_edx = -1070539680, tf_ecx = 1, tf_eax = 18, tf_trapno = 3, tf_err = 0, tf_eip = -1071349235, tf_cs = 8, tf_eflags = 582, tf_esp = -1071031421, tf_ss = -1071154158}) at ../../i386/i386/trap.c:531 #8 0xc024820d in Debugger (msg=0xc0277c12 "panic") at machine/cpufunc.h:64 #9 0xc014ec94 in panic (fmt=0xc02874e0 "initiate_write_inodeblock: already started") at ../../kern/kern_shutdown.c:552 #10 0xc01fa297 in initiate_write_inodeblock (inodedep=0xc183b200, bp=0xc60d4500) at ../../ufs/ffs/ffs_softdep.c:2856 #11 0xc01fa0df in softdep_disk_io_initiation (bp=0xc60d4500) at ../../ufs/ffs/ffs_softdep.c:2743 #12 0xc018514b in spec_strategy (ap=0xcbb2aecc) at ../../miscfs/specfs/spec_vnops.c:412 #13 0xc01701a3 in bwrite (bp=0xc60d4500) at vnode_if.h:923 #14 0xc0175622 in vop_stdbwrite (ap=0xcbb2af24) at ../../kern/vfs_default.c:319 #15 0xc0175459 in vop_defaultop (ap=0xcbb2af24) at ../../kern/vfs_default.c:138 #16 0xc0171120 in vfs_bio_awrite (bp=0xc60d4500) at vnode_if.h:1172 #17 0xc0184fc2 in spec_fsync (ap=0xcbb2af7c) at ../../miscfs/specfs/spec_vnops.c:357 #18 0xc0177c93 in sched_sync () at vnode_if.h:537 #19 0xc0248974 in fork_trampoline () Cannot access memory at address 0xffe. (kgdb) up 10 #10 0xc01fa297 in initiate_write_inodeblock (inodedep=0xc183b200, bp=0xc60d4500) at ../../ufs/ffs/ffs_softdep.c:2856 2856 panic("initiate_write_inodeblock: already started"); (kgdb) l 2851 struct fs *fs; 2852 ufs_lbn_t prevlbn = 0; 2853 int i, deplist; 2854 2855 if (inodedep->id_state & IOSTARTED) 2856 panic("initiate_write_inodeblock: already started"); 2857 inodedep->id_state |= IOSTARTED; 2858 fs = inodedep->id_fs; 2859 dp = (struct dinode *)bp->b_data + 2860 ino_to_fsbo(fs, inodedep->id_ino); I had written down the routine names from the first crash - this is exactly the same crash as first time (but different from the one that Alexander Leidinger is seeing). Other details: - Abit BP6 motherboard, 2 x Celeron 366, no overclocking. IBM DPTA-372730 disk on UDMA-66 controller. Softupdates enabled on all file systems. - Kernel config file and boot messages included below. The core dump is right now being copied over a 64 kbps line, to a more accessible location, should be available in a couple of hours. If anybody would like a copy, please send me email. Steinar Haug, Nethelp consulting, sthaug@nethelp.no ---------------------------------------------------------------------- machine i386 cpu I686_CPU ident BP6_MP_ATA maxusers 32 makeoptions DEBUG=-g #Build kernel with gdb(1) debug symbols options INET #InterNETworking options FFS #Berkeley Fast Filesystem options FFS_ROOT #FFS usable as root device [keep this!] options NFS #Network Filesystem options NFS_ROOT #NFS usable as root device, "NFS" req'ed options MSDOSFS #MSDOS Filesystem options CD9660 #ISO 9660 Filesystem options CD9660_ROOT #CD-ROM usable as root. "CD9660" req'ed options PROCFS #Process filesystem options COMPAT_43 #Compatible with BSD 4.3 [KEEP THIS!] options SCSI_DELAY=3000 #Be pessimistic about Joe SCSI device options UCONSOLE #Allow users to grab the console options USERCONFIG #boot -c editor options VISUAL_USERCONFIG #visual boot -c editor options KTRACE #ktrace(1) syscall trace support options SYSVSHM #SYSV-style shared memory options SYSVMSG #SYSV-style message queues options SYSVSEM #SYSV-style semaphores options SOFTUPDATES options MSGBUF_SIZE=32768 options INCLUDE_CONFIG_FILE # Include this file in kernel options DDB options SMP # Symmetric MultiProcessor Kernel options APIC_IO # Symmetric (APIC) I/O device isa0 device pci0 device fdc0 at isa? port IO_FD1 irq 6 drq 2 device fd0 at fdc0 drive 0 device ata0 device atadisk0 # ATA disk drives device atapicd0 # ATAPI CDROM drives device sym0 # NCR/Symbios Logic (do not mix with ncr, it conflicts) device scbus0 # SCSI bus (required) device da0 # Direct Access (disks) device sa0 # Sequential Access (tape etc) device cd0 # CD device pass0 # Passthrough device (direct SCSI access) device atkbdc0 at isa? port IO_KBD device atkbd0 at atkbdc? irq 1 device psm0 at atkbdc? irq 12 device vga0 at isa? port ? conflicts device sc0 at isa? device npx0 at nexus? port IO_NPX irq 13 device apm0 at nexus? disable flags 0x31 # Advanced Power Management device sio0 at isa? port IO_COM1 flags 0x10 irq 4 device sio1 at isa? port IO_COM2 irq 3 device ppc0 at isa? port? flags 0x40 irq 7 device ppbus0 # Parallel port bus (required) device lpt0 # Printer device plip0 # TCP/IP over parallel device ppi0 # Parallel port interface device device pcm0 device smbus0 device iicbus0 device iicbb0 device intpm0 device smb0 at smbus? device de0 # DEC/Intel DC21x4x (``Tulip'') device fxp0 # Intel EtherExpress PRO/100B (82557, 82558) device ti0 # Alteon Tigon based cards (Netgear GA620 etc) pseudo-device loop # Network loopback pseudo-device ether # Ethernet support pseudo-device ppp 1 # Kernel PPP pseudo-device tun # Packet tunnel. pseudo-device pty # Pseudo-ttys (telnet etc) pseudo-device gzip # Exec gzipped a.out's pseudo-device bpf #Berkeley packet filter device uhci0 # UHCI PCI->USB interface device ohci0 # OHCI PCI->USB interface device usb0 # USB Bus (required) device ugen0 # Generic ---------------------------------------------------------------------- Copyright (c) 1992-2000 The FreeBSD Project. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 4.0-CURRENT #0: Sat Jan 15 20:26:38 CET 2000 sthaug@bp6.nethelp.no:/local/freebsd-current/src/sys/compile/BP6_MP_ATA Timecounter "i8254" frequency 1193182 Hz CPU: Pentium II/Celeron (367.50-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x665 Stepping = 5 Features=0x183fbff real memory = 268435456 (262144K bytes) avail memory = 256917504 (250896K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 IOAPIC #0 intpin 16 -> irq 12 IOAPIC #0 intpin 17 -> irq 5 IOAPIC #0 intpin 18 -> irq 11 IOAPIC #0 intpin 19 -> irq 10 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc0363000. Pentium Pro MTRR support enabled npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pcib1: at device 1.0 on pci0 pci1: on pcib1 vga-pci0: port 0x9000-0x90ff mem 0xe6000000-0xe6000fff,0xe4000000-0xe4ffffff irq 12 at device 0.0 on pci1 isab0: at device 7.0 on pci0 isa0: on isab0 ata-pci0: port 0xf000-0xf00f at device 7.1 on pci0 ata-pci0: Busmastering DMA supported ata0 at 0x01f0 irq 14 on ata-pci0 uhci0: port 0xa000-0xa01f irq 10 at device 7.2 on pci0 usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered Timecounter "PIIX" frequency 3579545 Hz intpm0: port 0x5000-0x500f irq 9 at device 7.3 on pci0 intpm0: I/O mapped 5000 intpm0: intr IRQ 9 enabled revision 0 smbus0: on intsmb0 smb0: on smbus0 intpm0: PM I/O mapped 4000 pcm0: port 0xa400-0xa43f irq 10 at device 9.0 on pci0 ti0: mem 0xec200000-0xec203fff irq 11 at device 11.0 on pci0 ti0: Ethernet address: 00:a0:cc:73:32:dd sym0: <875> port 0xa800-0xa8ff mem 0xec206000-0xec206fff,0xec207000-0xec2070ff irq 5 at device 13.0 on pci0 sym0: Tekram NVRAM, ID 7, Fast-20, SE, parity checking fxp0: port 0xac00-0xac1f mem 0xec000000-0xec0fffff,0xec205000-0xec205fff irq 12 at device 15.0 on pci0 fxp0: Ethernet address 00:90:27:1b:45:e6 fxp1: port 0xb000-0xb01f mem 0xec100000-0xec1fffff,0xec204000-0xec204fff irq 10 at device 17.0 on pci0 fxp1: Ethernet address 00:a0:c9:b6:df:4b ata-pci1: port 0xbc00-0xbcff,0xb800-0xb803,0xb400-0xb407 irq 11 at device 19.0 on pci0 ata-pci1: Busmastering DMA supported ata2 at 0xb400 irq 11 on ata-pci1 ata-pci2: port 0xc800-0xc8ff,0xc400-0xc403,0xc000-0xc007 irq 11 at device 19.1 on pci0 ata-pci2: Busmastering DMA supported fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: at port 0x60-0x6f on isa0 atkbd0: irq 1 on atkbdc0 vga0: at port 0x3b0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: on isa0 sc0: VGA <16 virtual consoles, flags=0x200> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: at port 0x378-0x37f irq 7 flags 0x40 on isa0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode APIC_IO: Testing 8254 interrupt delivery APIC_IO: routing 8254 via IOAPIC #0 intpin 2 SMP: AP CPU #1 Launched! ad0: ATA-4 disk at ata0 as master ad0: 9671MB (19807200 sectors), 19650 cyls, 16 heads, 63 S/T, 512 B/S ad0: 16 secs/int, 32 depth queue, UDMA33 ad1: ATA-4 disk at ata2 as master ad1: 26105MB (53464320 sectors), 53040 cyls, 16 heads, 63 S/T, 512 B/S ad1: 16 secs/int, 32 depth queue, UDMA66 Waiting 3 seconds for SCSI devices to settle Mounting root from ufs:/dev/ad0s1d WARNING: / was not properly dismounted cd0 at sym0 bus 0 target 3 lun 0 cd0: Removable CD-ROM SCSI-2 device cd0: 20.000MB/s transfers (20.000MHz, offset 15) cd0: cd present [337438 x 2048 byte records] To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message