From owner-freebsd-stable@FreeBSD.ORG Thu Nov 1 11:58:52 2007 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1781216A418 for ; Thu, 1 Nov 2007 11:58:52 +0000 (UTC) (envelope-from d.komaleev@konliga.ru) Received: from mail.konliga.ru (mail.konliga.ru [195.16.56.98]) by mx1.freebsd.org (Postfix) with ESMTP id 6D24813C4AC for ; Thu, 1 Nov 2007 11:58:50 +0000 (UTC) (envelope-from d.komaleev@konliga.ru) Received: from exch01.konliga.ru ([192.168.1.252]) by mail.konliga.ru with Microsoft SMTPSVC(6.0.3790.0); Thu, 1 Nov 2007 14:56:34 +0300 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="koi8-r" Content-Transfer-Encoding: quoted-printable X-MimeOLE: Produced By Microsoft Exchange V6.5 Date: Thu, 1 Nov 2007 14:56:53 +0300 Message-ID: <2335ED0A1B2A294FACC6EB01EF0965F72EE843@exch01.konliga.ru> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: System hangs up every day Thread-Index: AcgcLkNsaTQuc8SlRjuiDzkcI16UvwATGWzQ From: =?koi8-r?B?5M3J1NLJyiDrz83BzMXF1w==?= To: X-OriginalArrivalTime: 01 Nov 2007 11:56:34.0298 (UTC) FILETIME=[3FDD3DA0:01C81C7E] Subject: RE: System hangs up every day X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 01 Nov 2007 11:58:52 -0000 >=20 > A system failure of this sort (one which leaves no log entries of any > kind) is generally a hardware fault; memory stick failures tend to > cause kernel panics and easy repeatability. >=20 > I would suggest examining the hardware components, the motherboard > could have some faulty capacitors (burst, leaking, or swollen); the > fans on the processors could be failing causing a lockup, the power > supply fans could be failing causing an undervolt and lockup, but this > usually makes the system reset. >=20 > You get the idea, your symptoms are pointing to hardware=20 > issues in my opinion. I have written already, that I tried to plug a system HDD to another = sever with the same configuration; on the new platform the system = hanging didn't stop. The RAID controller remained the same, but it has = the own error log and it is clear. >=20 > On 10/31/07, =E4=CD=C9=D4=D2=C9=CA =EB=CF=CD=C1=CC=C5=C5=D7 = wrote: > > Hello everybody > > > > I have a big problem > > > > There is one FreeBSD server in our company. The server=20 > platform is: Supermicro SuperServer 6014V-T2B (2x Intel Xeon=20 > 2.8, 1Gb RAM, 3WARE 3W-8006-2LP RAID-Controller). > > The server works as: > > - a gateway between LAN and Internet > > - an Intranet web- and database server (Apache + MySQL + PHP) > > - a firewall (OpenBSD pf) > > - a transparent proxy server (Squid) > > A mounthly traffic through this server is about 100Gb.=20 > There is about 200 internet users in our conpany. > > Here is a part of my dmesg-listing: > > > > Copyright (c) 1992-2007 The FreeBSD Project. > > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991,=20 > 1992, 1993, 1994 > > The Regents of the University of California. All=20 > rights reserved. > > FreeBSD is a registered trademark of The FreeBSD Foundation. > > FreeBSD 6.2-RELEASE-p8 #2: Thu Oct 11 19:51:25 MSD 2007 > > sa@gateway.konliga.ru:/usr/obj/usr/src/sys/KERNEL01_NOSMP > > module_register: module pci/em already exists! > > Module pci/em failed to register: 17 > > ACPI APIC Table: > > Timecounter "i8254" frequency 1193182 Hz quality 0 > > CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2800.12-MHz 686-class CPU) > > Origin =3D "GenuineIntel" Id =3D 0xf43 Stepping =3D 3 > > =20 > Features=3D0xbfebfbff = SS,HTT,TM,PBE> > > Features2=3D0x641d> > > AMD Features=3D0x20000000 > > Logical CPUs per core: 2 > > real memory =3D 1073479680 (1023 MB) > > avail memory =3D 1041465344 (993 MB) > > ioapic0 irqs 0-23 on motherboard > > ioapic1 irqs 24-47 on motherboard > > ichwd module loaded > > kbd1 at kbdmux0 > > ath_hal: 0.9.17.2 (AR5210, AR5211, AR5212, RF5111, RF5112,=20 > RF2413, RF5413) > > acpi0: on motherboard > > acpi0: Power Button (fixed) > > Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 > > acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0 > > cpu0: on acpi0 > > acpi_throttle0: on cpu0 > > pcib0: port 0xcf8-0xcff on acpi0 > > pci0: on pcib0 > > pcib1: irq 16 at device 2.0 on pci0 > > pci1: on pcib1 > > pcib2: irq 16 at device 3.0 on pci0 > > pci2: on pcib2 > > pcib3: at device 28.0 on pci0 > > pci3: on pcib3 > > twe0: <3ware Storage Controller. Driver version=20 > 1.50.01.002> port 0xbc00-0xbc0f mem=20 > 0xfc9ffc00-0xfc9ffc0f,0xfc000000-0xfc7fffff irq 24 at device=20 > 1.0 on pci3 > > twe0: [GIANT-LOCKED] > > twe0: 2 ports, Firmware FE8S 1.05.00.068, BIOS BE7X 1.08.00.048 > > em0: =20 > port 0xb800-0xb83f mem 0xfc9c0000-0xfc9dffff irq 26 at device=20 > 3.0 on pci3 > > em0: Ethernet address: 00:30:48:58:4d:2a > > em0: [FAST] > > em1: =20 > port 0xb400-0xb43f mem 0xfc9a0000-0xfc9bffff irq 27 at device=20 > 4.0 on pci3 > > em1: Ethernet address: 00:30:48:58:4d:2b > > em1: [FAST] > > uhci0: port 0xe800-0xe81f=20 > irq 16 at device 29.0 on pci0 > > uhci0: [GIANT-LOCKED] > > usb0: on uhci0 > > usb0: USB revision 1.0 > > uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > > uhub0: 2 ports with 2 removable, self powered > > uhci1: port 0xec00-0xec1f=20 > irq 19 at device 29.1 on pci0 > > uhci1: [GIANT-LOCKED] > > usb1: on uhci1 > > usb1: USB revision 1.0 > > uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 > > uhub1: 2 ports with 2 removable, self powered > > pci0: at device 29.4 (no driver attached) > > pci0: at device=20 > 29.5 (no driver attached) > > ehci0: mem=20 > 0xfebffc00-0xfebfffff irq 23 at device 29.7 on pci0 > > ehci0: [GIANT-LOCKED] > > usb2: EHCI version 1.0 > > usb2: companion controllers, 2 ports each: usb0 usb1 > > usb2: on ehci0 > > usb2: USB revision 2.0 > > uhub2: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 > > uhub2: 4 ports with 4 removable, self powered > > pcib4: at device 30.0 on pci0 > > pci4: on pcib4 > > pci4: at device 5.0 (no driver attached) > > isab0: at device 31.0 on pci0 > > isa0: on isab0 > > atapci0: port=20 > 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xfc00-0xfc0f at device=20 > 31.1 on pci0 > > ata0: on atapci0 > > ata1: on atapci0 > > pci0: at device 31.3 (no driver attached) > > acpi_button0: on acpi0 > > acpi_button1: on acpi0 > > sio0: configured irq 4 not in bitmap of probed irqs 0 > > sio0: port may not be enabled > > sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4=20 > flags 0x10 on acpi0 > > sio0: type 16550A > > sio1: configured irq 3 not in bitmap of probed irqs 0 > > sio1: port may not be enabled > > sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 > > sio1: type 16550A > > fdc0: port=20 > 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 > > fdc0: [FAST] > > fd0: <1440-KB 3.5" drive> on fdc0 drive 0 > > ppc0: port=20 > 0x378-0x37f,0x778-0x77f irq 7 drq 3 on acpi0 > > ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode > > ppc0: FIFO with 16/16/9 bytes threshold > > ppbus0: on ppc0 > > plip0: on ppbus0 > > lpt0: on ppbus0 > > lpt0: Interrupt-driven port > > ppi0: on ppbus0 > > atkbdc0: port 0x60,0x64 irq 1 on acpi0 > > atkbd0: irq 1 on atkbdc0 > > kbd0 at atkbd0 > > atkbd0: [GIANT-LOCKED] > > psm0: irq 12 on atkbdc0 > > psm0: [GIANT-LOCKED] > > psm0: model IntelliMouse, device ID 3 > > ichwd0: on isa0 > > pmtimer0 on isa0 > > orm0: at iomem=20 > 0xc0000-0xc7fff,0xc8000-0xc8fff,0xc9800-0xca7ff,0xca800-0xcb7f > f on isa0 > > sc0: at flags 0x100 on isa0 > > sc0: VGA <16 virtual consoles, flags=3D0x300> > > vga0: at port 0x3c0-0x3df iomem=20 > 0xa0000-0xbffff on isa0 > > Timecounter "TSC" frequency 2800118202 Hz quality 800 > > Timecounters tick every 1.000 msec > > acd0: CDROM at ata0-master UDMA33 > > twed0: on twe0 > > twed0: 152626MB (312579760 sectors) > > Trying to mount root from ufs:/dev/twed0s1a > > ext0: link state changed to UP > > int0: link state changed to UP > > vlan0: link state changed to UP > > > > This server hangs up every day without any messages in the=20 > log files and on the system console. A keyboard dosen't work=20 > too. I can make only hard reset and after restart coredump=20 > files are not appearing. > > Here is my kernel configuration file: > > > > include GENERIC > > ident KERNEL01_NOSMP > > device ichwd # Intel ICH watchdog timer > > #options SMP > > options ALTQ > > options ALTQ_CBQ > > options ALTQ_RED > > options ALTQ_RIO > > options ALTQ_HFSC > > options ALTQ_PRIQ > > #options ALTQ_NOPCC > > options SC_DISABLE_REBOOT > > options MP_WATCHDOG > > options SW_WATCHDOG > > > > If I make and install a kernel with SMP options the system=20 > under working load begins hang up every two hours. > > > > The two days "Memtest" gave no result. > > I tried to install the newest Intel ethernet adapter=20 > driver, but without any results. > > As an experiment I tried also to plug a system HDD to=20 > another sever platform (SuperServer 6015V-TB), but system=20 > hanging didn't stop. > > I think that it is not only hardware problem. > > Linux (Gentoo) and Windows server 2003 on this hardware=20 > were working fine. > > > > Please help me to find a solution and solve a problem. > > > > Your faithfully > > Dmitry Komaleev > > IT Manager > > "EDIPRESSE-KONLIGA" http://www.konliga.ru > > Russia, Moscow > > tel.: +7 (495) 775-14-35, ext. 169 > > fax: +7 (495) 775-14-34 > > > > P.S. I have written the Bug Report on my problem but have=20 > received only one advice to turn off ACPI-option. > > If I disable ACPI, then the RAID-controller and both of the=20 > ethernet controllers on my server recieve the same IRQ. I=20 > believe this is not good. > > _______________________________________________ > > freebsd-stable@freebsd.org mailing list > > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > > To unsubscribe, send any mail to=20 > "freebsd-stable-unsubscribe@freebsd.org" > > >=20