From owner-freebsd-hackers@FreeBSD.ORG Mon Apr 23 16:48:55 2007 Return-Path: X-Original-To: freebsd-hackers@freebsd.org Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id E213216A401; Mon, 23 Apr 2007 16:48:55 +0000 (UTC) (envelope-from raggen@passagen.se) Received: from av11-1-sn2.hy.skanova.net (av11-1-sn2.hy.skanova.net [81.228.8.183]) by mx1.freebsd.org (Postfix) with ESMTP id 6A01E13C457; Mon, 23 Apr 2007 16:48:55 +0000 (UTC) (envelope-from raggen@passagen.se) Received: by av11-1-sn2.hy.skanova.net (Postfix, from userid 502) id DD97B385AB; Mon, 23 Apr 2007 18:28:07 +0200 (CEST) Received: from smtp4-1-sn2.hy.skanova.net (smtp4-1-sn2.hy.skanova.net [81.228.8.92]) by av11-1-sn2.hy.skanova.net (Postfix) with ESMTP id BD3F2385A8; Mon, 23 Apr 2007 18:28:07 +0200 (CEST) Received: from [192.168.1.6] (81-231-90-251-no41.tbcn.telia.com [81.231.90.251]) by smtp4-1-sn2.hy.skanova.net (Postfix) with ESMTP id 9B0DE37E48; Mon, 23 Apr 2007 18:28:07 +0200 (CEST) Message-ID: <462CDE7D.5020203@passagen.se> Date: Mon, 23 Apr 2007 18:27:41 +0200 From: Roger Olofsson User-Agent: Thunderbird 1.5.0.10 (Windows/20070221) MIME-Version: 1.0 To: Alejandro Pulver , freebsd-hackers@freebsd.org References: <20070414184719.110deaa2@deimos.mars.bsd> <46217486.6080801@u.washington.edu> <20070415161753.7c7a604d@deimos.mars.bsd> <462318CB.3030205@u.washington.edu> <20070422232633.2390b1e2@deimos.mars.bsd> In-Reply-To: <20070422232633.2390b1e2@deimos.mars.bsd> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: Subject: Re: High disk load +mount/atacontrol/NFS/SMBFS crashes the system X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 23 Apr 2007 16:48:56 -0000 Alejandro Pulver skrev: > On Sun, 15 Apr 2007 23:33:47 -0700 > Garrett Cooper wrote: > >> Ale, >> I'm not sure what's going on exactly based on the information you >> provided, but I would try the following steps to isolate the issue: >> >> 1) See if you can upgrade the first machine to a later version of >> FreeBSD, say 6.2. I believe that there were related issues resolved in >> 6.2, but my memory could be incorrect. See if your problems occur after >> that. > > I did that. > >> 2) Try grabbing a different machine if possible and see if the same >> issue occurs when you put the new machine as server and client with one >> of the other machines. > > I used a Win XP machine as client / server. > >> 3) Try switching roles with the 2 machines. If machine 1 is usually >> server, let it play client and vice versa with machine 2. > > Also did this. > >> 4) Remove the new drive if possible, see if issue goes away. If it does, >> try acquiring a cheap(er) drive and put it >> > > It's the only drive it has, I meant the second machine is all new, not > just the disk. > >> Also, it appears that another FreeBSD team member had a similar issue >> (see: http://people.freebsd.org/~pho/stress/log/cons205.html and >> http://people.freebsd.org/~pho/stress/log/cons225.html). I dunno how but >> it showed up as one of the leading searches on Google. >> >> It looks like a (localized) filesystem issue, but I'm not sure what it >> is exactly. >> > > The fsync() problem seems to be related to that, but the rest could be > be a different thing. Also I only got it twice. Maybe the filesystem > issues were only derived from the crashes. > > I was unable to reproduce the problem in the first machine, maybe it > was fixed on FreeBSD 6.2 as you said. The only things I also did when > testing was unloading fuse.ko (unused) and linprocfs.ko (after > umounting it). However I will test it a few times more, and let you > know the results. > > The strange crash in the new 6.2 machine when using atacontrol is still > unexplained and I couldn't make it happen again (it now refuses to > switch to UDMA100 mode when it is SATA300, maybe they aren't supported > in SATA drives, but the other time it just crashed without advise). > > Thank you for your help with this. > > Best Regards, > Ale Dear Ale, I have experienced something similar as you described when this thread started. The solution for me was to exchange the NIC I had for one that worked better. I learned that using cheap nics with realtek chips causes crashes even on the most stable operating system in the world. When I browsed the source code for the driver of the realtek-based nic I regretted I hadn't done so earlier. The comments were _crystal_ clear about the design and performance of it. See /usr/src/sys/pci/if_rl.c. I particularly liked the following bit: /* * Here's a totally undocumented fact for you. When the * RealTek chip is in the process of copying a packet into * RAM for you, the length will be 0xfff0. If you spot a * packet header with this value, you need to stop. The * datasheet makes absolutely no mention of this and * RealTek should be shot for this. */ Hope you will solve the issue! Greetings /Roger