From owner-freebsd-stable Fri Aug 30 12:39: 6 2002 Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.FreeBSD.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 6A69A37B400; Fri, 30 Aug 2002 12:39:01 -0700 (PDT) Received: from apollo.backplane.com (apollo.backplane.com [216.240.41.2]) by mx1.FreeBSD.org (Postfix) with ESMTP id D669243E4A; Fri, 30 Aug 2002 12:39:00 -0700 (PDT) (envelope-from dillon@apollo.backplane.com) Received: from apollo.backplane.com (localhost [127.0.0.1]) by apollo.backplane.com (8.12.5/8.12.4) with ESMTP id g7UJd0PQ031373; Fri, 30 Aug 2002 12:39:00 -0700 (PDT) (envelope-from dillon@apollo.backplane.com) Received: (from dillon@localhost) by apollo.backplane.com (8.12.5/8.12.4/Submit) id g7UJd0Pb031372; Fri, 30 Aug 2002 12:39:00 -0700 (PDT) (envelope-from dillon) Date: Fri, 30 Aug 2002 12:39:00 -0700 (PDT) From: Matthew Dillon Message-Id: <200208301939.g7UJd0Pb031372@apollo.backplane.com> To: "Marc G. Fournier" Cc: Arnvid Karstad , , Subject: Re: Problems with FreeBSD - causing zalloc to return 0 ?! References: <20020830162006.J14642-100000@mail1.hub.org> Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG :Is there any definite way of determining this? From what I can find int :he man page, VNODES have to do with the file system and directory lists :... my last two crashes look to be related to the file system itself, so :have started to watch the VNODE numbers to, but is there some way of :determining what I should raise, and where? If you can get to a DDB> prompt on the crash (kernel config w/ DDB) do this: ddb> print *kernel_vm_end And tell me what you get. Also note the fault address. In order to really track down the cause I need a vmcore and kernel.debug to play with, but baring that you might be able to dump memory statistics to a file like once a second until the machine crashes. while (1) sleep 1 date >> stats.log vmstat -m >> stats.log vmstat -z >> stats.log netstat -m >> stats.log fsync stats.log end :I'm running 4Gig of RAM and Dual CPU over here, so having a swap device :large enough to 'dump core' is kinda out of the question, so I can't :provide any more infomration that i have so far in other messages :( If you can reproduce the crash with less memory you may be able to generate a core. To boot the machine with less memory add a line to your /boot/loader.conf file: hw.physmem="768m" I usually always keep such a line in my loader.conf file, commented out (e.g. #hw.physmem=...) until I need it, because I always forget the name of the variable :-) :And, also, should any 'server class' operating system be more graceful :about such things? Some sort of soft limit that triggers it to refuse new :processes or something when its hit, so that it doesn't actually crash? :Kinda like the NMBCLUSTERS warning/error message when its set too low? FreeBSD-current will be far more graceful, but FreeBSD-stable is still using algorithms based on circa 1990 system memory capacities. As memory capacities have grown larger then available KVM the algorithms have been less able to cope with the massive number of resources that can now be cached. -Matt To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message