From owner-freebsd-scsi Tue May 28 15:14:58 1996 Return-Path: owner-freebsd-scsi Received: (from root@localhost) by freefall.freebsd.org (8.7.5/8.7.3) id PAA07407 for freebsd-scsi-outgoing; Tue, 28 May 1996 15:14:58 -0700 (PDT) Received: from who.cdrom.com (who.cdrom.com [204.216.27.3]) by freefall.freebsd.org (8.7.5/8.7.3) with SMTP id PAA07303; Tue, 28 May 1996 15:14:34 -0700 (PDT) Received: from uu.elvisti.kiev.ua ([193.125.28.132]) by who.cdrom.com (8.6.12/8.6.11) with ESMTP id JAA06345 ; Tue, 28 May 1996 09:48:54 -0700 Received: from office.elvisti.kiev.ua (office.elvisti.kiev.ua [193.125.28.129]) by uu.elvisti.kiev.ua (8.7.5/8.7.3) with ESMTP id TAA15426; Tue, 28 May 1996 19:49:18 +0300 (EET DST) Received: (from stesin@localhost) by office.elvisti.kiev.ua (8.6.12/8.ElVisti) id TAA07502; Tue, 28 May 1996 19:49:11 +0300 From: "Andrew V. Stesin" Message-Id: <199605281649.TAA07502@office.elvisti.kiev.ua> Subject: Mysterious crashes of FreeBSD gateway -- caugh it(?) To: questions@freebsd.org, scsi@freebsd.org Date: Tue, 28 May 1996 19:49:10 +0300 (EET DST) X-Mailer: ELM [version 2.4 PL24alpha5] Content-Type: text Sender: owner-freebsd-scsi@freebsd.org X-Loop: FreeBSD.org Precedence: bulk Hi people, I got some problems, and maybe someone can comment? A machine, our recently built firewall gateway to Internet, is: ATC-1425B mainboard, PCI, SiS 496/7 chipset; 16Mb RAM; AMD 5x133 CPU; NCR 53c810 SCSI; 1Gb Conner CFP1060S drive (recent, good one); two modems on the onboard COMs (SLIP lines to the world); 1 Ethernet card. OS: FreeBSD-stable as of late March. Add-ons: IPfilter 3.0.3+ (by Darren Reed) as in-kernel IP filtering facility, Squid 1.0beta7 WWW proxy cache daemon. The machine was experiencing spontaneous reboots from time to time. Either silent reboots, or prefaced with messages from NCR driver (like "NCR dead?"). When a Compex ReadyLink (DEC 21041-based) PCI ethernet was replaced by a random NE2000, the trouble almost gone -- the box was up for some days, just Ok. Today we were able to inspire a repeatable reboot with a misconfigured test machine on the same network. It has a gateway machine as a default router, and no route to another subnet, connected via FreeBSD-1.1.5-based host, was in the tables. So each packed from the test machine, destined to those subnet, was going to gateway first, then forwarded to 1.1.5 box, and ICMP redirect was sent to the test box about this from the gateway. When doing a massive TCP transfers to the 1.1.5-connected subnet, or even ping -f, a high network load was inspired on a gateway machine (receive a packet -- forvard it -- send redirect). NE2000 worked fine, but NCR driver started to through messages about I/O errors, "NCR dead", etc. Then gateway rebooted itself. What I want to ask. What might be the source of that trouble? Poor motherboard quality, when an overloaded (?! is it an overload?) either ISA or PCI bus forces NCR to go asleep? Or is it a bug in TCP/IP stack or IPfilter, or their interraction with NCR driver, tickled with the nessesity to process IP/ICMP packets at a very high rate? I strongly suspect a hardware problem, but maybe there are other opinions? -- With best regards -- Andrew Stesin. +380 (44) 2760188 +380 (44) 2713457 +380 (44) 2713560 "You may delegate authority, but not responsibility." Frank's Management Rule #1.