Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 18 Mar 2004 12:56:32 -0600
From:      "Douglas K. Rand" <rand@meridian-enviro.com>
To:        freebsd-hardware@freebsd.org
Subject:   System Freezing
Message-ID:  <87fzc6gf1b.wl@delta.meridian-enviro.com>

next in thread | raw e-mail | index | archive | help
--Multipart_Thu_Mar_18_12:56:32_2004-1
Content-Type: text/plain; charset=US-ASCII

I'm having what is probably a hardware problem on a system that just
hangs every 6-36 hours, and I'm wondering if anybody has any ideas for
things I could try.

Its a RELENG_4_8 system with DDB, DDB_UNATTENDED, and ALT_BREAK_TO_DEBUGGER
kernel options set. (Its on a serial console, thats why the
ALT_BREAK_TO_DEBUGGER option.) Its an Athlon 3200+ on a Gigabyte
GA-7N400-L mobo, with two 512MB PC3200 DDR DIMMs, and a 2 port 3ware
controller and 2 Deskstar 180 GXP disks. The power supply is an Antec
TruePower 380W.

The system ran perfectly for about 60 days, and then started having
this problem. In almost all cases the system will simply hang, there
is no response from the console or network, and the CR ~ ^B sequence
will not get me to the kernel debugger. (I've tested this when the
system is running fine and I do get the kernel debugger.) The only
solution is to reset or power cycle the system.

It has crashed 3 times with a Fatal trap 12: page fault while in
kernel mode panic, and one time it simply rebooted as if someone
pressed the reset button. But it has simply hung 18 times.

I've tried running with only one DIMM, and when the system died 3
times with that DIMM, I tried running with only the other DIMM, and it
still dies.

I've replaced the power supply with an Antec 400W, and the system
still dies. I even replaced the power cord.

I've tried both the stock 4.8 twe driver and 3ware's beta driver, both
still die.

I replaced the onboard NIC with an Intel Etherexpress Pro, and the
system still dies.

I don't think its temperature related, I've run the system with the
case open and on its side, and a continous mbmon output shows no
temperature increases just before the system hangs. (A representative
output from mbmon is:
  Temp.= 75.2, 113.0, 86.0; Rot.= 4821, 2636,    0
  Vcore = 1.70, 2.74; Volt. = 3.31, 4.14, 11.55,  -5.29, -2.05
I've got a ThermalTake Volcano 11+ cooler on the CPU.

I don't think the problems are load related, as it carries very high
loads with out hanging, and I've had it hang with fairly light loads.

I've attached the dmesg and kernel config files. If anybody has any
suggestions I'd be thrilled. I'm up to replacing either the CPU or the
mobo, neither of which I'm looking forward too. 


--Multipart_Thu_Mar_18_12:56:32_2004-1
Content-Type: application/octet-stream
Content-Disposition: attachment; filename="dmesg"
Content-Transfer-Encoding: quoted-printable

Copyright (c) 1992-2003 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 4.8-RELEASE-p16 #6: Wed Mar 17 14:46:41 CST 2004
    rand@snow.meridian-enviro.com:/usr/obj/usr/src/sys/SNOW
Timecounter "i8254"  frequency 1193182 Hz
Timecounter "TSC"  frequency 2191242163 Hz
CPU: AMD Athlon(tm) XP 3200+ (2191.24-MHz 686-class CPU)
  Origin =3D "AuthenticAMD"  Id =3D 0x6a0  Stepping =3D 0
  Features=3D0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE=
,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
  AMD Features=3D0xc0400000<AMIE,DSP,3DNow!>
real memory  =3D 536805376 (524224K bytes)
avail memory =3D 519462912 (507288K bytes)
Preloaded elf kernel "kernel" at 0xc02db000.
Pentium Pro MTRR support enabled
Using $PIR table, 11 entries at 0xc00fcda0
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <Host to PCI bridge> on motherboard
pci0: <PCI bus> on pcib0
pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01eb) at 0.1
pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01ee) at 0.2
pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01ed) at 0.3
pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01ec) at 0.4
pci0: <unknown card> (vendor=3D0x10de, dev=3D0x01ef) at 0.5
isab0: <PCI to ISA bridge (vendor=3D10de device=3D0060)> at device 1.0 on p=
ci0
isa0: <ISA bus> on isab0
pci0: <unknown card> (vendor=3D0x10de, dev=3D0x0064) at 1.1 irq 11
pcib1: <PCI to PCI bridge (vendor=3D10de device=3D006c)> at device 8.0 on p=
ci0
pci1: <PCI bus> on pcib1
pci1: <3Dfx Voodoo 3 graphics accelerator> at 6.0 irq 12
fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0xd400-0xd43f mem 0xe7800000-0=
xe781ffff,0xe7821000-0xe7821fff irq 10 at device 7.0 on pci1
fxp0: Ethernet address 00:02:b3:e7:ab:6e
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
twe0: <3ware Storage Controller> port 0xd800-0xd80f mem 0xe7000000-0xe77fff=
ff,0xe7820000-0xe782000f irq 11 at device 9.0 on pci1
twe0: 2 ports, Firmware FE7X 1.05.00.050, BIOS BE7X 1.08.00.046
atapci0: <Generic PCI ATA controller> port 0xf000-0xf00f at device 9.0 on p=
ci0
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
pcib2: <PCI to PCI bridge (vendor=3D10de device=3D01e8)> at device 30.0 on =
pci0
pci2: <PCI bus> on pcib2
orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xc97ff,0xca000-0xcaff=
f on isa0
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=3D0x100>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A, console
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
acd0: CDROM <TEAC CD-552E> at ata0-slave PIO4
twed0: <TwinStor, Normal> on twe0
twed0: 176699MB (361880032 sectors)
twe0: command interrupt
Mounting root from ufs:/dev/twed0s1a
WARNING: / was not properly dismounted

--Multipart_Thu_Mar_18_12:56:32_2004-1
Content-Type: text/plain; charset=US-ASCII



--Multipart_Thu_Mar_18_12:56:32_2004-1
Content-Type: application/octet-stream
Content-Disposition: attachment; filename="SNOW"
Content-Transfer-Encoding: quoted-printable

machine		i386
cpu		I686_CPU
ident		SNOW
maxusers	0

options 	INET
options 	FFS
options 	FFS_ROOT
options 	SOFTUPDATES
options 	UFS_DIRHASH
options 	NFS
options 	COMPAT_43
options 	INCLUDE_CONFIG_FILE
options 	ICMP_BANDLIM
options 	MAXDSIZ=3D"(1024*1024*1024)"
options 	DFLDSIZ=3D"(1024*1024*1024)"
options 	DDB
options 	DDB_UNATTENDED
options 	ALT_BREAK_TO_DEBUGGER

device		isa
device		pci

device		fdc0	at isa? port IO_FD1 irq 6 drq 2
device		fd0	at fdc0 drive 0

device		ata
device		atadisk
device		atapicd

device		twe

device		atkbdc0	at isa? port IO_KBD
device		atkbd0	at atkbdc? irq 1 flags 0x1
device		vga0	at isa?
device		sc0	at isa? flags 0x100

device		npx0	at nexus? port IO_NPX irq 13

device		sio0	at isa? port IO_COM1 flags 0x10 irq 4
device		sio1	at isa? port IO_COM2 irq 3

device		miibus
device		fxp
device		rl

pseudo-device	loop
pseudo-device	ether
pseudo-device	pty

--Multipart_Thu_Mar_18_12:56:32_2004-1
Content-Type: text/plain; charset=US-ASCII




--Multipart_Thu_Mar_18_12:56:32_2004-1--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?87fzc6gf1b.wl>