From owner-freebsd-hackers Wed Mar 22 19:56:50 2000 Delivered-To: freebsd-hackers@freebsd.org Received: from mailbox.reptiles.org (mailbox.reptiles.org [198.96.117.155]) by hub.freebsd.org (Postfix) with ESMTP id F0A4C37B96E for ; Wed, 22 Mar 2000 19:56:41 -0800 (PST) (envelope-from jim@reptiles.org) Received: from localhost (10092 bytes) by mailbox.reptiles.org via sendmail with P:stdio/R:bind_hosts/T:inet_zone_bind_smtp (sender: ) (ident using unix) id for ; Wed, 22 Mar 2000 22:56:39 -0500 (EST) (Smail-3.2.0.108 1999-Sep-19 #3 built 1999-Oct-27) Date: Wed, 22 Mar 2000 22:56:39 -0500 From: Jim Mercer To: hackers@freebsd.org Subject: possible bug in kernel/if_ether.c Message-ID: <20000322225639.T983@reptiles.org> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="2oS5YaxWCcQjTEyO" X-Mailer: Mutt 1.0i Sender: owner-freebsd-hackers@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG --2oS5YaxWCcQjTEyO Content-Type: text/plain; charset=us-ascii i have a new system, dual pentium 550's, 512meg ram, dual adaptec 29160, realtek ethernet. (dmesg.boot enclosed) it has been having intermittent reboots, with no syslog output. i thought maybe we were having heat issues, or power issues, but maybe it is a kernel bug. i enabled dumpdev and now i have dumps. (arg, and it just crashed while i was snarfing the attached info) both crashes occur in the same place in the kernel. i think maybe that indicates a potential kernel/driver bug. please find attached: dmesg.boot gdb -k info - crash 0 gdb -k info - crash 1 last second info: nuffy# ifconfig rl0 rl0: flags=8843 mtu 1500 inet 209.167.108.145 netmask 0xffffff80 broadcast 209.167.108.255 inet 209.167.108.146 netmask 0xffffffff broadcast 209.167.108.146 ether 00:00:21:ef:1d:a4 media: autoselect (none) status: active supported media: autoselect 100baseTX 100baseTX 10baseT/UTP 10baseT/UTP 100baseTX is it possible that the bug is being triggered because the driver has not selected a media type? i did notice that the ethernet performance sucked, but i wasn't sure if that was because the machine i was ftp'ing to sucked, or because this interface was not specifically set to a media. i elected not to set the media, because the machine is remote from me, and didn't want to have it lose the connection. i will physically visit the machine tomorrow. -- [ Jim Mercer jim@reptiles.org +1 416 506-0654 ] [ Reptilian Research -- Longer Life through Colder Blood ] [ Don't be fooled by cheap Finnish imitations; BSD is the One True Code. ] --2oS5YaxWCcQjTEyO Content-Type: text/plain; charset=us-ascii Content-Description: kgdb crash0 Content-Disposition: attachment; filename=snuffy1 ... SMP 2 cpus IdlePTD 3100672 initial pcb at 281960 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode mp_lock = 00000002; cpuid = 0; lapic.id = 00000000 fault virtual address = 0x8 fault code = supervisor read, page not present instruction pointer = 0x8:0xc01997c8 stack pointer = 0x10:0xff806fa4 frame pointer = 0x10:0xff806fac code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = Idle interrupt mask = <- SMP: XXX trap number = 12 panic: page fault mp_lock = 00000002; cpuid = 0; lapic.id = 00000000 boot() called on cpu#0 syncing disks... 11 11 ... #0 boot (howto=256) at ../../kern/kern_shutdown.c:304 304 dumppcb.pcb_cr3 = rcr3(); (kgdb) bt #0 boot (howto=256) at ../../kern/kern_shutdown.c:304 #1 0xc01549fc in poweroff_wait (junk=0xc0259c2f, howto=0) at ../../kern/kern_shutdown.c:554 #2 0xc022983b in trap_fatal (frame=0xff806f64, eva=8) at ../../i386/i386/trap.c:924 #3 0xc02294d1 in trap_pfault (frame=0xff806f64, usermode=0, eva=8) at ../../i386/i386/trap.c:817 #4 0xc02290a3 in trap (frame={tf_fs = 1107296280, tf_es = 16, tf_ds = -1056178160, tf_edi = -1, tf_esi = 0, tf_ebp = -8360020, tf_isp = -8360048, tf_ebx = 0, tf_edx = -1071016096, tf_ecx = 1, tf_eax = -1056143360, tf_trapno = 12, tf_err = 0, tf_eip = -1072064568, tf_cs = 8, tf_eflags = 66118, tf_esp = 0, tf_ss = 0}) at ../../i386/i386/trap.c:423 #5 0xc01997c8 in arpintr () at ../../netinet/if_ether.c:447 --2oS5YaxWCcQjTEyO Content-Type: text/plain; charset=us-ascii Content-Description: kgdb crash1 Content-Disposition: attachment; filename=snuffy2 SMP 2 cpus IdlePTD 3100672 initial pcb at 281960 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode mp_lock = 01000002; cpuid = 1; lapic.id = 01000000 fault virtual address = 0x8 fault code = supervisor read, page not present instruction pointer = 0x8:0xc01997c8 stack pointer = 0x10:0xff80dfa4 frame pointer = 0x10:0xff80dfac code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = Idle interrupt mask = <- SMP: XXX trap number = 12 panic: page fault mp_lock = 01000002; cpuid = 1; lapic.id = 01000000 boot() called on cpu#1 syncing disks... 48 47 21 .... #0 boot (howto=256) at ../../kern/kern_shutdown.c:304 #1 0xc01549fc in poweroff_wait (junk=0xc0259c2f, howto=0) at ../../kern/kern_shutdown.c:554 #2 0xc022983b in trap_fatal (frame=0xff80df64, eva=8) at ../../i386/i386/trap.c:924 #3 0xc02294d1 in trap_pfault (frame=0xff80df64, usermode=0, eva=8) at ../../i386/i386/trap.c:817 #4 0xc02290a3 in trap (frame={tf_fs = 1107296280, tf_es = 16, tf_ds = -1056178160, tf_edi = -1, tf_esi = 1, tf_ebp = -8331348, tf_isp = -8331376, tf_ebx = 0, tf_edx = -1071016096, tf_ecx = 16777217, tf_eax = -1056133376, tf_trapno = 12, tf_err = 0, tf_eip = -1072064568, tf_cs = 8, tf_eflags = 66118, tf_esp = 0, tf_ss = 1}) at ../../i386/i386/trap.c:423 #5 0xc01997c8 in arpintr () at ../../netinet/if_ether.c:447 --2oS5YaxWCcQjTEyO Content-Type: text/plain; charset=us-ascii Content-Description: dmesg.boot Content-Disposition: attachment; filename="dmesg.boot" Copyright (c) 1992-2000 The FreeBSD Project. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 4.0-STABLE #2: Tue Mar 21 03:48:14 EST 2000 toor@snuffy.nag.ca:/usr/src/sys/compile/NAG-ORANGE Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon (551.25-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x673 Stepping = 3 Features=0x383fbff real memory = 536870912 (524288K bytes) avail memory = 517988352 (505848K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc02e3000. Pentium Pro MTRR support enabled md0: Malloc disk npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at 0.0 irq 0 isab0: at device 7.0 on pci0 isa0: on isab0 pci0: at 7.1 pci0: at 7.2 Timecounter "PIIX" frequency 3579545 Hz chip1: port 0x440-0x44f at device 7.3 on pci0 ahc0: port 0xe800-0xe8ff mem 0xffaff000-0xffafffff irq 16 at device 16.0 on pci0 ahc0: aic7892 Wide Channel A, SCSI Id=7, 16/255 SCBs ahc1: port 0xe400-0xe4ff mem 0xffafe000-0xffafefff irq 17 at device 17.0 on pci0 ahc1: aic7892 Wide Channel A, SCSI Id=7, 16/255 SCBs rl0: port 0xe000-0xe0ff mem 0xffafdf00-0xffafdfff irq 18 at device 18.0 on pci0 rl0: Ethernet address: 00:00:21:ef:1d:a4 miibus0: on rl0 rlphy0: on miibus0 rlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: at port 0x60-0x6f on isa0 atkbd0: irq 1 on atkbdc0 vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: on isa0 sc0: VGA <16 virtual consoles, flags=0x200> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppi0: on ppbus0 lpt0: on ppbus0 lpt0: Interrupt-driven port plip0: on ppbus0 APIC_IO: Testing 8254 interrupt delivery APIC_IO: routing 8254 via IOAPIC #0 intpin 2 IP packet filtering initialized, divert disabled, rule-based forwarding enabled, default to accept, logging limited to 100 packets/entry by default DUMMYNET initialized (000106) Waiting 15 seconds for SCSI devices to settle SMP: AP CPU #1 Launched! da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device da0: 80.000MB/s transfers (40.000MHz, offset 63, 16bit), Tagged Queueing Enabled da0: 8704MB (17827698 512 byte sectors: 255H 63S/T 1109C) da1 at ahc1 bus 0 target 0 lun 0 da1: Fixed Direct Access SCSI-2 device da1: 80.000MB/s transfers (40.000MHz, offset 63, 16bit), Tagged Queueing Enabled da1: 34746MB (71161520 512 byte sectors: 255H 63S/T 4429C) da2 at ahc1 bus 0 target 1 lun 0 da2: Fixed Direct Access SCSI-2 device da2: 80.000MB/s transfers (40.000MHz, offset 63, 16bit), Tagged Queueing Enabled da2: 34746MB (71161520 512 byte sectors: 255H 63S/T 4429C) Mounting root from ufs:/dev/da0s2a WARNING: / was not properly dismounted --2oS5YaxWCcQjTEyO-- To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-hackers" in the body of the message