From owner-freebsd-stable Wed Jul 4 7:45:35 2001 Delivered-To: freebsd-stable@freebsd.org Received: from mta02-svc.ntlworld.com (mta02-svc.ntlworld.com [62.253.162.42]) by hub.freebsd.org (Postfix) with ESMTP id 4EE1F37B401; Wed, 4 Jul 2001 07:45:14 -0700 (PDT) (envelope-from james-list@ntlworld.com) Received: from eclipse.bloodflowers.org ([62.253.132.240]) by mta02-svc.ntlworld.com (InterMail vM.4.01.02.27 201-229-119-110) with SMTP id <20010704144511.OIUC294.mta02-svc.ntlworld.com@eclipse.bloodflowers.org>; Wed, 4 Jul 2001 15:45:11 +0100 Date: Wed, 4 Jul 2001 15:45:12 +0100 From: James Greenhalgh To: "Hartmann, O." Cc: freebsd-stable@freebsd.org, freebsd-questions@freebsd.org Subject: Re: HELP! Server crashes since last cvsupdate! Message-Id: <20010704154512.094f0fbd.james-list@ntlworld.com> In-Reply-To: References: <20010704041755.583fb511.james-list@ntlworld.com> X-Mailer: Sylpheed version 0.4.66 (GTK+ 1.2.10; i386--freebsd4.3) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG Yes - mine seems to be behaving now with SMP support disabled. Fortunately I've got a full system disk clone from a few weeks ago so I'll downgrade when I get the opportunity to do so - so I guess the current 4.x stable should be considered to be non-stable on (some?) SMP machines. james On Wed, 4 Jul 2001 10:46:29 +0200 (CEST) "Hartmann, O." wrote: > > As described below, I suspected the Linuxator (due the linproc changes > last week). But eleminating all did not solve the problem. > > I switched all systems yesterday morning to UP kernel - and they > are still up right now. A third SMP machine is up since 17 days with > the old sources, also SCSI and a lot of similar configs like the other both > crashing SMP systems. > > Has anyone tried Tor's patch? I can not elaborate this before weekend, > because all SMP systems are heavily in use until Friday. > > > :>On Mon, 2 Jul 2001 10:54:43 +0200 (CEST) > :>"Hartmann, O." wrote: > :> > :>> Hello. > :>> > :>> Since our last update Friday, 29th June, both SMP machines run > :>> into a "stuck" condition after a while. This happened now two times > :>> and I do not know what happens. > :>> > :>> The machines do not respond on console inputs, keyboards seem > :>> to be dead. Over network, accessing the machines via ssh or telnet > :>> results in a short message, then they hung after typing passowrd (ssh) or > :>> trying to connect (telnet). Both machines have NFS running. > :>> A third server attached to the same network node (behind a switch), an UP > :>> machine, is still alive (acts also as a NFS server) and fully accessible. > :>> > :>> What's up with FreeBSD 4.2-STABLE? Has something important been changed in > :>networking?Oliver > :> > :>Perhaps it's something else. My desktop machine (rebuilt yesterday and again > :>tonight) is starting to hang in general use almost randomly. It was solid as > :>a rock before this point, running a 4.3-stable from early last month. I had > :>thought it was linux emu support or DRI causing it, but having eliminated both > :>of those it proceeded to do it again in console mode. My nonupgraded remote > :>server, similar hardware config but left at a month old setup is still happy. > :>Again an SMP machine, here's the dmesg: > :> > :>------- > :> > :>Copyright (c) 1992-2001 The FreeBSD Project. > :>Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > :> The Regents of the University of California. All rights reserved. > :>FreeBSD 4.3-STABLE #0: Wed Jul 4 03:16:56 BST 2001 > :> root@eclipse.bloodflowers.org:/usr/obj/usr/src/sys/ECLIPSE > :>Timecounter "i8254" frequency 1193182 Hz > :>CPU: Pentium II/Pentium II Xeon/Celeron (397.95-MHz 686-class CPU) > :> Origin = "GenuineIntel" Id = 0x652 Stepping = 2 > :> > :>Features=0x183fbff > :>real memory = 268369920 (262080K bytes) > :>config> di pcic0 > :>No such device: pcic0 > :>Invalid command or syntax. Type `?' for help. > :>config> di sn0 > :>No such device: sn0 > :>Invalid command or syntax. Type `?' for help. > :>config> di lnc0 > :>No such device: lnc0 > :>Invalid command or syntax. Type `?' for help. > :>config> di ie0 > :>No such device: ie0 > :>Invalid command or syntax. Type `?' for help. > :>config> di fe0 > :>No such device: fe0 > :>Invalid command or syntax. Type `?' for help. > :>config> di cs0 > :>No such device: cs0 > :>Invalid command or syntax. Type `?' for help. > :>config> di bt0 > :>No such device: bt0 > :>Invalid command or syntax. Type `?' for help. > :>config> di aic0 > :>No such device: aic0 > :>Invalid command or syntax. Type `?' for help. > :>config> di aha0 > :>No such device: aha0 > :>Invalid command or syntax. Type `?' for help. > :>config> di adv0 > :>No such device: adv0 > :>Invalid command or syntax. Type `?' for help. > :>config> en ed0 > :>config> po ed0 0x300 > :>config> ir ed0 10 > :>config> iom ed0 0xd8000 > :>config> f ed0 0 > :>config> q > :>avail memory = 257310720 (251280K bytes) > :>Programming 24 pins in IOAPIC #0 > :>IOAPIC #0 intpin 2 -> irq 0 > :>IOAPIC #0 intpin 16 -> irq 11 > :>IOAPIC #0 intpin 17 -> irq 3 > :>IOAPIC #0 intpin 18 -> irq 4 > :>IOAPIC #0 intpin 19 -> irq 7 > :>FreeBSD/SMP: Multiprocessor motherboard > :> cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 > :> cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 > :> io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 > :>Preloaded elf kernel "kernel" at 0xc03ec000. > :>Preloaded userconfig_script "/boot/kernel.conf" at 0xc03ec09c. > :>Preloaded elf module "agp.ko" at 0xc03ec0ec. > :>ccd0-3: Concatenated disk drivers > :>VESA: v3.0, 16384k memory, flags:0x1, mode table:0xc036c977 (1000117) > :>VESA: 3dfx Interactive, Inc. > :>Pentium Pro MTRR support enabled > :>md0: Malloc disk > :>npx0: on motherboard > :>npx0: INT 16 interface > :>pcib0: on motherboard > :>pci0: on pcib0 > :>agp0: mem 0xe0000000-0xe3ffffff at > :>device 0.0 on pci0 > :>pcib1: at device 1.0 on pci0 > :>pci1: on pcib1 > :>pci1: <3Dfx Voodoo 3 graphics accelerator> at 0.0 irq 11 > :>isab0: at device 7.0 on pci0 > :>isa0: on isab0 > :>atapci0: port 0xf000-0xf00f at device 7.1 on > :>pci0 > :>ata0: at 0x1f0 irq 14 on atapci0 > :>ata1: at 0x170 irq 15 on atapci0 > :>uhci0: port 0xd000-0xd01f irq 7 at > :>device 7.2 on pci0 > :>usb0: on uhci0 > :>usb0: USB revision 1.0 > :>uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > :>uhub0: 2 ports with 2 removable, self powered > :>intpm0: port 0x5000-0x500f irq 9 > :>at device 7.3 on pci0 > :>intpm0: I/O mapped 5000 > :>intpm0: intr IRQ 9 enabled revision 0 > :>smbus0: on intsmb0 > :>intpm0: PM I/O mapped 4000 > :>amr0: port 0xd400-0xd47f irq 3 at device 9.0 on pci0 > :>amr0: Firmware C.80, BIOS 1.51, 4MB RAM > :>fxp0: port 0xd800-0xd83f mem > :>0xed000000-0xed01ffff,0xed023000-0xed023fff irq 4 at device 10.0 on pci0 > :>fxp0: Ethernet address 00:02:b3:2e:57:18 > :>inphy0: on miibus0 > :>inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto > :>bktr0: mem 0xed020000-0xed020fff irq 7 at device 11.0 on pci0 > :>iicbb0: on bti2c0 > :>iicbus0: on iicbb0 master-only > :>smbus1: on bti2c0 > :>bktr0: Hauppauge Model 44354 A321 > :>bktr0: Detected a MSP3415D-B3 at 0x80 > :>bktr0: Hauppauge WinCast/TV, Philips FR1216 PAL FM tuner, msp3400c stereo, > :>remote control. > :>pci0: (vendor=0x109e, dev=0x0878) at 11.1 irq 7 > :>ahc0: port 0xdc00-0xdcff mem > :>0xed022000-0xed022fff irq 11 at device 12.0 on pci0 > :>aic7895C: Wide Channel A, SCSI Id=7, 32/255 SCBs > :>ahc1: port 0xe000-0xe0ff mem > :>0xed024000-0xed024fff irq 11 at device 12.1 on pci0 > :>ahc1: Using left over BIOS settings > :>aic7895C: Wide Channel B, SCSI Id=7, 32/255 SCBs > :>isa0: too many dependant configs (8) > :>isa0: unexpected small tag 14 > :>orm0: