From owner-freebsd-scsi  Tue May 28 15:14:58 1996
Return-Path: owner-freebsd-scsi
Received: (from root@localhost)
          by freefall.freebsd.org (8.7.5/8.7.3) id PAA07407
          for freebsd-scsi-outgoing; Tue, 28 May 1996 15:14:58 -0700 (PDT)
Received: from who.cdrom.com (who.cdrom.com [204.216.27.3])
          by freefall.freebsd.org (8.7.5/8.7.3) with SMTP id PAA07303;
          Tue, 28 May 1996 15:14:34 -0700 (PDT)
Received: from uu.elvisti.kiev.ua ([193.125.28.132])
          by who.cdrom.com (8.6.12/8.6.11) with ESMTP id JAA06345
          ; Tue, 28 May 1996 09:48:54 -0700
Received: from office.elvisti.kiev.ua (office.elvisti.kiev.ua [193.125.28.129]) by uu.elvisti.kiev.ua (8.7.5/8.7.3) with ESMTP id TAA15426; Tue, 28 May 1996 19:49:18 +0300 (EET DST)
Received: (from stesin@localhost) by office.elvisti.kiev.ua (8.6.12/8.ElVisti) id TAA07502; Tue, 28 May 1996 19:49:11 +0300
From: "Andrew V. Stesin" <stesin@elvisti.kiev.ua>
Message-Id: <199605281649.TAA07502@office.elvisti.kiev.ua>
Subject: Mysterious crashes of FreeBSD gateway -- caugh it(?)
To: questions@freebsd.org, scsi@freebsd.org
Date: Tue, 28 May 1996 19:49:10 +0300 (EET DST)
X-Mailer: ELM [version 2.4 PL24alpha5]
Content-Type: text
Sender: owner-freebsd-scsi@freebsd.org
X-Loop: FreeBSD.org
Precedence: bulk

Hi people,

I got some problems, and maybe someone can comment?

A machine, our recently built firewall gateway to Internet,
is:
	ATC-1425B mainboard, PCI, SiS 496/7 chipset;
	16Mb RAM;
	AMD 5x133 CPU;
	NCR 53c810 SCSI;
	1Gb Conner CFP1060S drive (recent, good one);
	two modems on the onboard COMs (SLIP lines to the world);
	1 Ethernet card.

	OS:      FreeBSD-stable as of late March.
	Add-ons: IPfilter 3.0.3+ (by Darren Reed) as in-kernel IP filtering
		 facility, Squid 1.0beta7 WWW proxy cache daemon.

The machine was experiencing spontaneous reboots from time to time.
Either silent reboots, or prefaced with messages from NCR driver
(like "NCR dead?").

When a Compex ReadyLink (DEC 21041-based) PCI ethernet was replaced
by a random NE2000, the trouble almost gone -- the box was up for
some days, just Ok.

Today we were able to inspire a repeatable reboot with a misconfigured
test machine on the same network. It has a gateway machine as
a default router, and no route to another subnet, connected via
FreeBSD-1.1.5-based host, was in the tables.  So each packed from
the test machine, destined to those subnet, was going to gateway
first, then forwarded to 1.1.5 box, and ICMP redirect was sent to
the test box about this from the gateway.

When doing a massive TCP transfers to the 1.1.5-connected subnet,
or even ping -f, a high network load was inspired on a gateway
machine (receive a packet -- forvard it -- send redirect).
NE2000 worked fine, but NCR driver started to through messages
about I/O errors, "NCR dead", etc.  Then gateway rebooted itself.

	What I want to ask.

What might be the source of that trouble? Poor motherboard quality, when
an overloaded (?! is it an overload?) either ISA or PCI bus forces NCR
to go asleep?  Or is it a bug in TCP/IP stack or IPfilter, or
their interraction with NCR driver, tickled
with the nessesity to process IP/ICMP packets at a very high rate?

I strongly suspect a hardware problem, but maybe there are other
opinions?

-- 

	With best regards -- Andrew Stesin.

	+380 (44) 2760188	+380 (44) 2713457	+380 (44) 2713560

	"You may delegate authority, but not responsibility."
					Frank's Management Rule #1.