From owner-freebsd-current Wed Dec 10 09:24:43 1997 Return-Path: Received: (from root@localhost) by hub.freebsd.org (8.8.7/8.8.7) id JAA29713 for current-outgoing; Wed, 10 Dec 1997 09:24:43 -0800 (PST) (envelope-from owner-freebsd-current) Received: from home.dragondata.com (toasty@home.dragondata.com [204.137.237.2]) by hub.freebsd.org (8.8.7/8.8.7) with ESMTP id JAA29693 for ; Wed, 10 Dec 1997 09:24:24 -0800 (PST) (envelope-from toasty@home.dragondata.com) Received: (from toasty@localhost) by home.dragondata.com (8.8.5/8.8.5) id LAA19545; Wed, 10 Dec 1997 11:24:09 -0600 (CST) From: Kevin Day Message-Id: <199712101724.LAA19545@home.dragondata.com> Subject: Re: 971208 and the nfs rollback didn't fix it... In-Reply-To: <199712101626.LAA00918@dyson.iquest.net> from "John S. Dyson" at "Dec 10, 97 11:26:13 am" To: toor@dyson.iquest.net (John S. Dyson) Date: Wed, 10 Dec 1997 11:24:09 -0600 (CST) Cc: karl@Mcs.Net, freebsd-current@freebsd.org X-Mailer: ELM [version 2.4ME+ PL31 (25)] MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: owner-freebsd-current@freebsd.org X-Loop: FreeBSD.org Precedence: bulk > > > > > > Kevin > > > > Post a "dmesg" with the boot log in it. What ethernet card are you using? > > > I am not an NFS "expert", so you guys tell me exactly what the symptoms are, > and I will TRY to help work out the problems. > > -- > John > dyson@freebsd.org > jdyson@nc.com > I'm using a 3C509B... They've worked great, up until we started using NFS... One machine running 2.2 has been up more than 200 days, with a 509B, with no indication of a memory leak, with heavy net traffic.. Dec 10 05:03:11 shell /kernel: Copyright (c) 1992-1997 FreeBSD Inc. Dec 10 05:03:11 shell /kernel: Copyright (c) 1982, 1986, 1989, 1991, 1993 Dec 10 05:03:11 shell /kernel: The Regents of the University of California. All rights reserved. Dec 10 05:03:11 shell /kernel: FreeBSD 3.0-CURRENT #0: Mon Nov 24 19:26:11 CST 1997 Dec 10 05:03:11 shell /kernel: toasty@shell.dragondata.com:/usr/src/sys/compile/SHELL Dec 10 05:03:11 shell /kernel: CPU: Pentium (586-class CPU) Dec 10 05:03:11 shell /kernel: Origin = "GenuineIntel" Id = 0x52c Stepping=12 Dec 10 05:03:11 shell /kernel: Features=0x3bf Dec 10 05:03:11 shell /kernel: real memory = 83886080 (81920K bytes) Dec 10 05:03:11 shell /kernel: avail memory = 78573568 (76732K bytes) Dec 10 05:03:11 shell /kernel: FreeBSD/SMP: Multiprocessor motherboard Dec 10 05:03:11 shell /kernel: cpu0 (BSP): apic id: 0, version: 0x00030010, at 0xfee00000 Dec 10 05:03:11 shell /kernel: cpu1 (AP): apic id: 1, version: 0x00030010, at 0xfee00000 Dec 10 05:03:11 shell /kernel: io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Dec 10 05:03:11 shell /kernel: Probing for devices on PCI bus 0: Dec 10 05:03:12 shell /kernel: chip0: rev 0x03 on pci0.0.0 Dec 10 05:03:12 shell /kernel: chip1: rev 0x01 on pci0.7.0 Dec 10 05:03:12 shell /kernel: ide_pci0: rev 0x00 on pci0.7.1 Dec 10 05:03:12 shell /kernel: Probing for devices on the ISA bus: Dec 10 05:03:12 shell /kernel: sc0 at 0x60-0x6f irq 1 on motherboard Dec 10 05:03:12 shell /kernel: sc0: VGA color <16 virtual consoles, flags=0x0> Dec 10 05:03:12 shell /kernel: sio0 at 0x3f8-0x3ff irq 4 on isa Dec 10 05:03:12 shell /kernel: sio0: type 16550A Dec 10 05:03:12 shell /kernel: sio1 at 0x2f8-0x2ff irq 3 on isa Dec 10 05:03:12 shell /kernel: sio1: type 16550A Dec 10 05:03:12 shell /kernel: lpt0 at 0x378-0x37f irq 7 on isa Dec 10 05:03:12 shell /kernel: lpt0: Interrupt-driven port Dec 10 05:03:12 shell /kernel: lp0: TCP/IP capable interface Dec 10 05:03:13 shell /kernel: lpt1 not found Dec 10 05:03:13 shell /kernel: mse0 not found at 0x23c Dec 10 05:03:13 shell /kernel: fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa Dec 10 05:03:13 shell /kernel: wdc0 at 0x1f0-0x1f7 irq 14 on isa Dec 10 05:03:13 shell /kernel: wdc0: unit 0 (wd0): Dec 10 05:03:13 shell /kernel: wd0: 2014MB (4124736 sectors), 4092 cyls, 16 heads, 63 S/T, 512 B/S Dec 10 05:03:13 shell /kernel: wdc1 not found at 0x170 Dec 10 05:03:13 shell /kernel: 1 3C5x9 board(s) on ISA found at 0x300 Dec 10 05:03:13 shell /kernel: ep0 at 0x300-0x30f irq 10 on isa Dec 10 05:03:13 shell /kernel: ep0: utp[*UTP*] address 00:60:97:6b:88:f4 Dec 10 05:03:13 shell /kernel: npx0 on motherboard Dec 10 05:03:13 shell /kernel: npx0: INT 16 interface Dec 10 05:03:13 shell /kernel: APIC_IO: routing 8254 via 8259 on pin 0 Dec 10 05:03:14 shell /kernel: SMP: AP CPU #1 Launched! Dec 10 05:03:14 shell /kernel: WARNING: / was not properly dismounted. Dec 10 05:03:14 shell named[80]: starting. named 4.9.6-REL Sat Nov 8 08:44:53 GMT 1997 root@make.ican.net:/usr/obj/usr/src/usr.sbin/named Dec 10 05:03:14 shell named[81]: Ready to answer queries. Dec 10 05:23:10 shell /kernel: Copyright (c) 1992-1997 FreeBSD Inc. Dec 10 05:23:10 shell /kernel: Copyright (c) 1982, 1986, 1989, 1991, 1993 Dec 10 05:23:10 shell /kernel: The Regents of the University of California. All rights reserved. Dec 10 05:23:10 shell /kernel: FreeBSD 3.0-CURRENT #0: Mon Nov 24 19:26:11 CST 1997 Dec 10 05:23:10 shell /kernel: toasty@shell.dragondata.com:/usr/src/sys/compile/SHELL Dec 10 05:23:10 shell /kernel: CPU: Pentium (586-class CPU) Dec 10 05:23:10 shell /kernel: Origin = "GenuineIntel" Id = 0x52c Stepping=12 Dec 10 05:23:10 shell /kernel: Features=0x3bf Dec 10 05:23:11 shell /kernel: real memory = 83886080 (81920K bytes) Dec 10 05:23:11 shell /kernel: avail memory = 78573568 (76732K bytes) Dec 10 05:23:11 shell /kernel: FreeBSD/SMP: Multiprocessor motherboard Dec 10 05:23:11 shell /kernel: cpu0 (BSP): apic id: 0, version: 0x00030010, at 0xfee00000 Dec 10 05:23:11 shell /kernel: cpu1 (AP): apic id: 1, version: 0x00030010, at 0xfee00000 Dec 10 05:23:11 shell /kernel: io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Dec 10 05:23:11 shell /kernel: Probing for devices on PCI bus 0: Dec 10 05:23:11 shell /kernel: chip0: rev 0x03 on pci0.0.0 Dec 10 05:23:11 shell /kernel: chip1: rev 0x01 on pci0.7.0 Dec 10 05:23:11 shell /kernel: ide_pci0: rev 0x00 on pci0.7.1 Dec 10 05:23:11 shell /kernel: Probing for devices on the ISA bus: Dec 10 05:23:11 shell /kernel: sc0 at 0x60-0x6f irq 1 on motherboard Dec 10 05:23:11 shell /kernel: sc0: VGA color <16 virtual consoles, flags=0x0> Dec 10 05:23:11 shell /kernel: sio0 at 0x3f8-0x3ff irq 4 on isa Dec 10 05:23:12 shell /kernel: sio0: type 16550A Dec 10 05:23:12 shell /kernel: sio1 at 0x2f8-0x2ff irq 3 on isa Dec 10 05:23:12 shell /kernel: sio1: type 16550A Dec 10 05:23:12 shell /kernel: lpt0 at 0x378-0x37f irq 7 on isa Dec 10 05:23:12 shell /kernel: lpt0: Interrupt-driven port Dec 10 05:23:12 shell /kernel: lp0: TCP/IP capable interface Dec 10 05:23:12 shell /kernel: lpt1 not found Dec 10 05:23:12 shell /kernel: mse0 not found at 0x23c Dec 10 05:23:12 shell /kernel: fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa Dec 10 05:23:12 shell /kernel: wdc0 at 0x1f0-0x1f7 irq 14 on isa Dec 10 05:23:12 shell /kernel: wdc0: unit 0 (wd0): Dec 10 05:23:12 shell /kernel: wd0: 2014MB (4124736 sectors), 4092 cyls, 16 heads, 63 S/T, 512 B/S Dec 10 05:23:13 shell /kernel: wdc1 not found at 0x170 Dec 10 05:23:13 shell /kernel: 1 3C5x9 board(s) on ISA found at 0x300 Dec 10 05:23:13 shell /kernel: ep0 at 0x300-0x30f irq 10 on isa Dec 10 05:23:13 shell /kernel: ep0: utp[*UTP*] address 00:60:97:6b:88:f4 Dec 10 05:23:13 shell /kernel: npx0 on motherboard Dec 10 05:23:13 shell /kernel: npx0: INT 16 interface Dec 10 05:23:13 shell /kernel: APIC_IO: routing 8254 via 8259 on pin 0 Dec 10 05:23:13 shell /kernel: SMP: AP CPU #1 Launched! Dec 10 05:23:13 shell /kernel: WARNING: / was not properly dismounted. (when it probes for wdc1, it sits there no less than 10 minutes. I really thought it was hung the first few times it did it... is that normal?) It leaves no log as to why it crashes, and the screen is just full of: I'm on CPU #1, waiting to be on CPU #0... (or whatever the message is) It scrolls so quickly I miss the panic message. (another idea: after a kernel panic, unblank the screen, and disable the screen saver.... the daemon screensaver will keep going, even after a nasty panic.... I walk by the machine, see chuck running around on there, and assume everything's ok) Here's a copy of the output from mptable: =============================================================================== MPTable, version 2.0.15 ------------------------------------------------------------------------------- MP Floating Pointer Structure: location: BIOS physical address: 0x000f5c50 signature: '_MP_' length: 16 bytes version: 1.1 checksum: 0xd4 mode: Virtual Wire ------------------------------------------------------------------------------- MP Config Table Header: physical address: 0x000f5c64 signature: 'PCMP' base table length: 396 version: 1.1 checksum: 0x3a OEM ID: 'OEM00000' Product ID: 'PROD00000000' OEM table pointer: 0x00000000 OEM table size: 0 entry count: 28 local APIC address: 0xfee00000 extended table length: 0 extended table checksum: 0 ------------------------------------------------------------------------------- MP Config Base Table Entries: -- Processors: APIC ID Version State Family Model Step Flags 0 0x11 BSP, usable 5 2 1 0x07bf 1 0x11 AP, usable 5 2 1 0x07bf -- Bus: Bus ID Type 0 PCI 1 ISA -- I/O APICs: APIC ID Version State Address 2 0x11 usable 0xfec00000 -- I/O Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN# ExtINT conforms conforms 1 0 2 0 INT conforms conforms 1 1 2 1 INT conforms conforms 1 0 2 2 INT conforms conforms 1 3 2 3 INT conforms conforms 1 4 2 4 INT conforms conforms 1 5 2 5 INT conforms conforms 1 6 2 6 INT conforms conforms 1 7 2 7 INT conforms conforms 1 8 2 8 INT conforms conforms 1 9 2 9 INT conforms conforms 1 10 2 10 INT conforms conforms 1 11 2 11 INT conforms conforms 1 12 2 12 INT conforms conforms 1 13 2 13 INT conforms conforms 1 14 2 14 INT conforms conforms 1 15 2 15 INT active-lo level 0 20:A 2 16 INT active-lo level 0 19:A 2 17 INT active-lo level 0 18:A 2 18 INT active-lo level 0 17:A 2 19 SMI conforms conforms 1 0 2 23 -- Local Ints: Type Polarity Trigger Bus ID IRQ APIC ID PIN# ExtINT conforms conforms 0 0:A 255 0 NMI conforms conforms 0 0:A 255 1 ------------------------------------------------------------------------------- # SMP kernel config file options: # Required: options SMP # Symmetric MultiProcessor Kernel options APIC_IO # Symmetric (APIC) I/O # Optional (built-in defaults will work in most cases): #options NCPU=2 # number of CPUs #options NBUS=2 # number of busses #options NAPIC=1 # number of IO APICs #options NINTR=24 # number of INTs =============================================================================== The machine crashes with SMP or without SMP, but SMP speeds it up greatly. I'm not sure what's going on, because it was up for 15+ days, then suddenly can't stay up for more than 20 minutes... I've changed nothing, and don't really notice any difference in what users are doing on it.. (it's a shell account machine) Kevin