Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 10 Dec 1997 11:42:26 -0600
From:      Karl Denninger  <karl@mcs.net>
To:        Kevin Day <toasty@home.dragondata.com>
Cc:        "John S. Dyson" <toor@dyson.iquest.net>, freebsd-current@freebsd.org
Subject:   Re: 971208 and the nfs rollback didn't fix it...
Message-ID:  <19971210114226.23033@mcs.net>
In-Reply-To: <199712101724.LAA19545@home.dragondata.com>; from Kevin Day on Wed, Dec 10, 1997 at 11:24:09AM -0600
References:  <199712101626.LAA00918@dyson.iquest.net> <199712101724.LAA19545@home.dragondata.com>

next in thread | previous in thread | raw e-mail | index | archive | help
Do you have an SMC card around there?

Do this:

1)	If you can, replace the 509B with an SMC Etherpower 10/100.  That's
	a decent card, and is what we're using here without trouble.

2)	Back out nfs_bio.c to V1.41 with "cvs update -r1.41 nfs_bio.c" in
	the nfs directory of your kernel area.

Rebuild, and try it again.

If it STILL blows up, get rid of the SMP support and see if THAT is stable.
That ought to be, because that's what I'm running here and that *IS* stable.

--
-- 
Karl Denninger (karl@MCS.Net)| MCSNet - Serving Chicagoland and Wisconsin
http://www.mcs.net/          | T1's from $600 monthly to FULL DS-3 Service
			     | NEW! K56Flex support on ALL modems
Voice: [+1 312 803-MCS1 x219]| EXCLUSIVE NEW FEATURE ON ALL PERSONAL ACCOUNTS
Fax:   [+1 312 803-4929]     | *SPAMBLOCK* Technology now included at no cost

On Wed, Dec 10, 1997 at 11:24:09AM -0600, Kevin Day wrote:
> > > > 
> > > > Kevin
> > > 
> > > Post a "dmesg" with the boot log in it.  What ethernet card are you using?
> > > 
> > I am not an NFS "expert", so you guys tell me exactly what the symptoms are,
> > and I will TRY to help work out the problems.
> > 
> > -- 
> > John
> > dyson@freebsd.org
> > jdyson@nc.com
> > 
> 
> I'm using a 3C509B... They've worked great, up until we started using NFS...
> One machine running 2.2 has been up more than 200 days, with a 509B, with no
> indication of a memory leak, with heavy net traffic..
> 
> Dec 10 05:03:11 shell /kernel: Copyright (c) 1992-1997 FreeBSD Inc.
> Dec 10 05:03:11 shell /kernel: Copyright (c) 1982, 1986, 1989, 1991, 1993
> Dec 10 05:03:11 shell /kernel: 	The Regents of the University of California. All rights reserved.
> Dec 10 05:03:11 shell /kernel: FreeBSD 3.0-CURRENT #0: Mon Nov 24 19:26:11 CST 1997
> Dec 10 05:03:11 shell /kernel:     toasty@shell.dragondata.com:/usr/src/sys/compile/SHELL
> Dec 10 05:03:11 shell /kernel: CPU: Pentium (586-class CPU)
> Dec 10 05:03:11 shell /kernel:   Origin = "GenuineIntel"  Id = 0x52c  Stepping=12
> Dec 10 05:03:11 shell /kernel:   Features=0x3bf<FPU,VME,DE,PSE,TSC,MSR,MCE,CX8,APIC>
> Dec 10 05:03:11 shell /kernel: real memory  = 83886080 (81920K bytes)
> Dec 10 05:03:11 shell /kernel: avail memory = 78573568 (76732K bytes)
> Dec 10 05:03:11 shell /kernel: FreeBSD/SMP: Multiprocessor motherboard
> Dec 10 05:03:11 shell /kernel:  cpu0 (BSP): apic id:  0, version: 0x00030010, at 0xfee00000
> Dec 10 05:03:11 shell /kernel:  cpu1 (AP):  apic id:  1, version: 0x00030010, at 0xfee00000
> Dec 10 05:03:11 shell /kernel:  io0 (APIC): apic id:  2, version: 0x00170011, at 0xfec00000
> Dec 10 05:03:11 shell /kernel: Probing for devices on PCI bus 0:
> Dec 10 05:03:12 shell /kernel: chip0: <Intel 82439> rev 0x03 on pci0.0.0
> Dec 10 05:03:12 shell /kernel: chip1: <Intel 82371SB PCI to ISA bridge> rev 0x01 on pci0.7.0
> Dec 10 05:03:12 shell /kernel: ide_pci0: <Intel PIIX3 Bus-master IDE controller> rev 0x00 on pci0.7.1
> Dec 10 05:03:12 shell /kernel: Probing for devices on the ISA bus:
> Dec 10 05:03:12 shell /kernel: sc0 at 0x60-0x6f irq 1 on motherboard
> Dec 10 05:03:12 shell /kernel: sc0: VGA color <16 virtual consoles, flags=0x0>
> Dec 10 05:03:12 shell /kernel: sio0 at 0x3f8-0x3ff irq 4 on isa
> Dec 10 05:03:12 shell /kernel: sio0: type 16550A
> Dec 10 05:03:12 shell /kernel: sio1 at 0x2f8-0x2ff irq 3 on isa
> Dec 10 05:03:12 shell /kernel: sio1: type 16550A
> Dec 10 05:03:12 shell /kernel: lpt0 at 0x378-0x37f irq 7 on isa
> Dec 10 05:03:12 shell /kernel: lpt0: Interrupt-driven port
> Dec 10 05:03:12 shell /kernel: lp0: TCP/IP capable interface
> Dec 10 05:03:13 shell /kernel: lpt1 not found
> Dec 10 05:03:13 shell /kernel: mse0 not found at 0x23c
> Dec 10 05:03:13 shell /kernel: fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
> Dec 10 05:03:13 shell /kernel: wdc0 at 0x1f0-0x1f7 irq 14 on isa
> Dec 10 05:03:13 shell /kernel: wdc0: unit 0 (wd0): <QUANTUM FIREBALL_TM2110A>
> Dec 10 05:03:13 shell /kernel: wd0: 2014MB (4124736 sectors), 4092 cyls, 16 heads, 63 S/T, 512 B/S
> Dec 10 05:03:13 shell /kernel: wdc1 not found at 0x170
> Dec 10 05:03:13 shell /kernel: 1 3C5x9 board(s) on ISA found at 0x300
> Dec 10 05:03:13 shell /kernel: ep0 at 0x300-0x30f irq 10 on isa
> Dec 10 05:03:13 shell /kernel: ep0: utp[*UTP*] address 00:60:97:6b:88:f4
> Dec 10 05:03:13 shell /kernel: npx0 on motherboard
> Dec 10 05:03:13 shell /kernel: npx0: INT 16 interface
> Dec 10 05:03:13 shell /kernel: APIC_IO: routing 8254 via 8259 on pin 0
> Dec 10 05:03:14 shell /kernel: SMP: AP CPU #1 Launched!
> Dec 10 05:03:14 shell /kernel: WARNING: / was not properly dismounted.
> Dec 10 05:03:14 shell named[80]: starting.  named 4.9.6-REL Sat Nov  8 08:44:53 GMT 1997 	root@make.ican.net:/usr/obj/usr/src/usr.sbin/named
> Dec 10 05:03:14 shell named[81]: Ready to answer queries. 
> Dec 10 05:23:10 shell /kernel: Copyright (c) 1992-1997 FreeBSD Inc.
> Dec 10 05:23:10 shell /kernel: Copyright (c) 1982, 1986, 1989, 1991, 1993
> Dec 10 05:23:10 shell /kernel: 	The Regents of the University of California. All rights reserved.
> Dec 10 05:23:10 shell /kernel: FreeBSD 3.0-CURRENT #0: Mon Nov 24 19:26:11 CST 1997
> Dec 10 05:23:10 shell /kernel:     toasty@shell.dragondata.com:/usr/src/sys/compile/SHELL
> Dec 10 05:23:10 shell /kernel: CPU: Pentium (586-class CPU)
> Dec 10 05:23:10 shell /kernel:   Origin = "GenuineIntel"  Id = 0x52c  Stepping=12
> Dec 10 05:23:10 shell /kernel:   Features=0x3bf<FPU,VME,DE,PSE,TSC,MSR,MCE,CX8,APIC>
> Dec 10 05:23:11 shell /kernel: real memory  = 83886080 (81920K bytes)
> Dec 10 05:23:11 shell /kernel: avail memory = 78573568 (76732K bytes)
> Dec 10 05:23:11 shell /kernel: FreeBSD/SMP: Multiprocessor motherboard
> Dec 10 05:23:11 shell /kernel:  cpu0 (BSP): apic id:  0, version: 0x00030010, at 0xfee00000
> Dec 10 05:23:11 shell /kernel:  cpu1 (AP):  apic id:  1, version: 0x00030010, at 0xfee00000
> Dec 10 05:23:11 shell /kernel:  io0 (APIC): apic id:  2, version: 0x00170011, at 0xfec00000
> Dec 10 05:23:11 shell /kernel: Probing for devices on PCI bus 0:
> Dec 10 05:23:11 shell /kernel: chip0: <Intel 82439> rev 0x03 on pci0.0.0
> Dec 10 05:23:11 shell /kernel: chip1: <Intel 82371SB PCI to ISA bridge> rev 0x01 on pci0.7.0
> Dec 10 05:23:11 shell /kernel: ide_pci0: <Intel PIIX3 Bus-master IDE controller> rev 0x00 on pci0.7.1
> Dec 10 05:23:11 shell /kernel: Probing for devices on the ISA bus:
> Dec 10 05:23:11 shell /kernel: sc0 at 0x60-0x6f irq 1 on motherboard
> Dec 10 05:23:11 shell /kernel: sc0: VGA color <16 virtual consoles, flags=0x0>
> Dec 10 05:23:11 shell /kernel: sio0 at 0x3f8-0x3ff irq 4 on isa
> Dec 10 05:23:12 shell /kernel: sio0: type 16550A
> Dec 10 05:23:12 shell /kernel: sio1 at 0x2f8-0x2ff irq 3 on isa
> Dec 10 05:23:12 shell /kernel: sio1: type 16550A
> Dec 10 05:23:12 shell /kernel: lpt0 at 0x378-0x37f irq 7 on isa
> Dec 10 05:23:12 shell /kernel: lpt0: Interrupt-driven port
> Dec 10 05:23:12 shell /kernel: lp0: TCP/IP capable interface
> Dec 10 05:23:12 shell /kernel: lpt1 not found
> Dec 10 05:23:12 shell /kernel: mse0 not found at 0x23c
> Dec 10 05:23:12 shell /kernel: fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
> Dec 10 05:23:12 shell /kernel: wdc0 at 0x1f0-0x1f7 irq 14 on isa
> Dec 10 05:23:12 shell /kernel: wdc0: unit 0 (wd0): <QUANTUM FIREBALL_TM2110A>
> Dec 10 05:23:12 shell /kernel: wd0: 2014MB (4124736 sectors), 4092 cyls, 16 heads, 63 S/T, 512 B/S
> Dec 10 05:23:13 shell /kernel: wdc1 not found at 0x170
> Dec 10 05:23:13 shell /kernel: 1 3C5x9 board(s) on ISA found at 0x300
> Dec 10 05:23:13 shell /kernel: ep0 at 0x300-0x30f irq 10 on isa
> Dec 10 05:23:13 shell /kernel: ep0: utp[*UTP*] address 00:60:97:6b:88:f4
> Dec 10 05:23:13 shell /kernel: npx0 on motherboard
> Dec 10 05:23:13 shell /kernel: npx0: INT 16 interface
> Dec 10 05:23:13 shell /kernel: APIC_IO: routing 8254 via 8259 on pin 0
> Dec 10 05:23:13 shell /kernel: SMP: AP CPU #1 Launched!
> Dec 10 05:23:13 shell /kernel: WARNING: / was not properly dismounted.
> 
> (when it probes for wdc1, it sits there no less than 10 minutes. I really
> thought it was hung the first few times it did it... is that normal?)
> 
> It leaves no log as to why it crashes, and the screen is just full of:
> 
> I'm on CPU #1, waiting to be on CPU #0... (or whatever the message is)
> 
> It scrolls so quickly I miss the panic message.
> 
> (another idea: after a kernel panic, unblank the screen, and disable the
> screen saver.... the daemon screensaver will keep going, even after a nasty
> panic.... I walk by the machine, see chuck running around on there, and
> assume everything's ok)
> 
> 
> 
> Here's a copy of the output from mptable:
> 
> 
> 
> 
> ===============================================================================
> 
> MPTable, version 2.0.15
> 
> -------------------------------------------------------------------------------
> 
> MP Floating Pointer Structure:
> 
>   location:			BIOS
>   physical address:		0x000f5c50
>   signature:			'_MP_'
>   length:			16 bytes
>   version:			1.1
>   checksum:			0xd4
>   mode:				Virtual Wire
> 
> -------------------------------------------------------------------------------
> 
> MP Config Table Header:
> 
>   physical address:		0x000f5c64
>   signature:			'PCMP'
>   base table length:		396
>   version:			1.1
>   checksum:			0x3a
>   OEM ID:			'OEM00000'
>   Product ID:			'PROD00000000'
>   OEM table pointer:		0x00000000
>   OEM table size:		0
>   entry count:			28
>   local APIC address:		0xfee00000
>   extended table length:	0
>   extended table checksum:	0
> 
> -------------------------------------------------------------------------------
> 
> MP Config Base Table Entries:
> 
> --
> Processors:	APIC ID	Version	State		Family	Model	Step	Flags
> 		 0	 0x11	 BSP, usable	 5	 2	 1	 0x07bf
> 		 1	 0x11	 AP, usable	 5	 2	 1	 0x07bf
> --
> Bus:		Bus ID	Type
> 		 0	 PCI   
> 		 1	 ISA   
> --
> I/O APICs:	APIC ID	Version	State		Address
> 		 2	 0x11	 usable		 0xfec00000
> --
> I/O Ints:	Type	Polarity    Trigger	Bus ID	 IRQ	APIC ID	PIN#
> 		ExtINT	 conforms    conforms	     1	   0	      2	   0
> 		INT	 conforms    conforms	     1	   1	      2	   1
> 		INT	 conforms    conforms	     1	   0	      2	   2
> 		INT	 conforms    conforms	     1	   3	      2	   3
> 		INT	 conforms    conforms	     1	   4	      2	   4
> 		INT	 conforms    conforms	     1	   5	      2	   5
> 		INT	 conforms    conforms	     1	   6	      2	   6
> 		INT	 conforms    conforms	     1	   7	      2	   7
> 		INT	 conforms    conforms	     1	   8	      2	   8
> 		INT	 conforms    conforms	     1	   9	      2	   9
> 		INT	 conforms    conforms	     1	  10	      2	  10
> 		INT	 conforms    conforms	     1	  11	      2	  11
> 		INT	 conforms    conforms	     1	  12	      2	  12
> 		INT	 conforms    conforms	     1	  13	      2	  13
> 		INT	 conforms    conforms	     1	  14	      2	  14
> 		INT	 conforms    conforms	     1	  15	      2	  15
> 		INT	active-lo       level	     0	20:A	      2	  16
> 		INT	active-lo       level	     0	19:A	      2	  17
> 		INT	active-lo       level	     0	18:A	      2	  18
> 		INT	active-lo       level	     0	17:A	      2	  19
> 		SMI	 conforms    conforms	     1	   0	      2	  23
> --
> Local Ints:	Type	Polarity    Trigger	Bus ID	 IRQ	APIC ID	PIN#
> 		ExtINT	 conforms    conforms	     0	 0:A	    255	   0
> 		NMI	 conforms    conforms	     0	 0:A	    255	   1
> 
> -------------------------------------------------------------------------------
> 
> # SMP kernel config file options:
> 
> 
> # Required:
> options		SMP			# Symmetric MultiProcessor Kernel
> options		APIC_IO			# Symmetric (APIC) I/O
> 
> # Optional (built-in defaults will work in most cases):
> #options		NCPU=2			# number of CPUs
> #options		NBUS=2			# number of busses
> #options		NAPIC=1			# number of IO APICs
> #options		NINTR=24		# number of INTs
> 
> ===============================================================================
> 
> 
> 
> The machine crashes with SMP or without SMP, but SMP speeds it up greatly.
> I'm not sure what's going on, because it was up for 15+ days, then suddenly
> can't stay up for more than 20 minutes... I've changed nothing, and don't
> really notice any difference in what users are doing on it.. (it's a shell
> account machine)
> 
> Kevin



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?19971210114226.23033>