From owner-freebsd-current@FreeBSD.ORG Wed Jul 27 14:55:04 2005 Return-Path: X-Original-To: freebsd-current@freebsd.org Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 2B5F816A41F for ; Wed, 27 Jul 2005 14:55:04 +0000 (GMT) (envelope-from kjelderg@gmail.com) Received: from rproxy.gmail.com (rproxy.gmail.com [64.233.170.202]) by mx1.FreeBSD.org (Postfix) with ESMTP id 2840E43D48 for ; Wed, 27 Jul 2005 14:55:03 +0000 (GMT) (envelope-from kjelderg@gmail.com) Received: by rproxy.gmail.com with SMTP id i8so300464rne for ; Wed, 27 Jul 2005 07:55:02 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=TueNvO4WLL8KSoX0D9U5lqsGfPRhnnarmDPDb4n/X6XS5usUaR5Eo2qD6i2uIXXffk7lX/ypK2Z2smmcxiNspooUawMorNiDFYeAsIC0YSVW6ozEbKw+lKUy3GlGbgvT9OS4tWrMjrnRVuKKQiaMeUSARqnjX+2xakUyLonZCGw= Received: by 10.38.208.13 with SMTP id f13mr543748rng; Wed, 27 Jul 2005 07:55:02 -0700 (PDT) Received: by 10.38.101.41 with HTTP; Wed, 27 Jul 2005 07:55:02 -0700 (PDT) Message-ID: Date: Wed, 27 Jul 2005 23:55:02 +0900 From: Eric Kjeldergaard To: Eric Anderson In-Reply-To: <42E78EDC.9020801@centtech.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline References: <42E58007.9030202@rogers.com> <20050726193324.GA4603@odin.ac.hmc.edu> <20050726200059.GA47478@freebie.xs4all.nl> <200507261853.19211.jkim@FreeBSD.org> <20050726233933.GA13679@odin.ac.hmc.edu> <42E78E1C.7030104@centtech.com> <42E78EDC.9020801@centtech.com> Cc: freebsd-current@freebsd.org Subject: Re: dhclient taking all cpu X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: Eric Kjeldergaard List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 27 Jul 2005 14:55:04 -0000 On 7/27/05, Eric Anderson wrote: > Eric Anderson wrote: > > Brooks Davis wrote: > >=20 > >> On Tue, Jul 26, 2005 at 06:53:17PM -0400, Jung-uk Kim wrote: > >> > >>> On Tuesday 26 July 2005 04:00 pm, Wilko Bulte wrote: > >>> > >>>> On Tue, Jul 26, 2005 at 12:33:24PM -0700, Brooks Davis wrote.. > >>>> > >>>> > >>>>> On Mon, Jul 25, 2005 at 10:39:09PM -0400, Mike Jakubik wrote: > >>>>> > >>>>>> On Mon, July 25, 2005 9:54 pm, Brooks Davis said: > >>>>>> > >>>>>>>>> Probably something wrong with your interface, but you > >>>>>>>>> havent't provided any useful information so who knows. At > >>>>>>>>> the very least, I need to know what interface you are > >>>>>>>>> running on, something about it's status, and if both > >>>>>>>>> dhclient processes are running. > >>>>>>>> > >>>>>>>> > >>>>>>>> The interface is xl0 (3Com 3c905C-TX Fast Etherlink XL), and > >>>>>>>> it worked in this machine fine for as long as i remember. > >>>>>>>> This seems to have happened since a recent cvsup and > >>>>>>>> buildworld from ~6-BETA to 7-CURRENT. I rebooted three > >>>>>>>> times, and the problem occured rougly a minute after bootup. > >>>>>>>> On the fourth time however, it seems to be ok so far. > >>>>>>> > >>>>>>> > >>>>>>> That sounds like a problem with the code that handles the > >>>>>>> link state notifications in the interface driver. The > >>>>>>> notifications are a reletivly new feature that we're only now > >>>>>>> starting to use heavily so there are going to be bumps in the > >>>>>>> road. It would be intresting to know if you see link state > >>>>>>> messages promptly if you plug and unplug the network cable. > >>>>>> > >>>>>> > >>>>>> It seems to be back at it again, this time it took longer to > >>>>>> kick in. Here is a "ps auxw|grep dhclient" : > >>>>>> > >>>>>> _dhcp 219 93.5 0.2 1484 1136 ?? Rs 8:49PM 5:06.00=20 > >>>>>> dhclient: xl0 (dhclient) > >>>>>> root 193 0.0 0.2 1484 1088 d0- S 8:49PM 0:00.02=20 > >>>>>> dhclient: xl0 [priv] (dhclient) > >>>>>> > >>>>>> top: > >>>>>> > >>>>>> PID USERNAME THR PRI NICE SIZE RES STATE TIME WCPU= =20 > >>>>>> COMMAND 219 _dhcp 1 129 0 1484K 1136K RUN 9:33= =20 > >>>>>> 94.24% dhclient > >>>>>> > >>>>>> Nothing in dmesg about link state changes on xl0. Unplugging > >>>>>> and replugging the network cable results in link state > >>>>>> notification within a couple seconds. > >>>>> > >>>>> > >>>>> Could you see what happens if you run dhclient in the foreground? > >>>>> Just running "dhclient -d xl0" should do it. I'd like to know > >>>>> what sort of output it's generating. > >>>> > >>>> > >>>> In my case it is not displaying anything: > >>>> > >>>> > >>>> chuck#dhclient -d ath0 > >>>> DHCPREQUEST on ath0 to 255.255.255.255 port 67 > >>>> DHCPACK from 192.168.5.254 > >>>> bound to 192.168.5.20 -- renewal in 21600 seconds. > >>>> > >>>> > >>>> > >>>> I can tell the phenomenon occurs when my laptop fan springs to > >>>> life: > >>>> > >>>> CPU states: 96.5% user, 0.0% nice, 2.7% system, 0.8% interrupt,= =20 > >>>> 0.0% idle > >>>> Mem: 48M Active, 28M Inact, 50M Wired, 680K Cache, 34M Buf, 115M > >>>> Free Swap: 257M Total, 257M Free > >>>> > >>>> PID USERNAME THR PRI NICE SIZE RES STATE TIME WCPU > >>>> COMMAND 719 _dhcp 1 129 0 1384K 1092K RUN 2:14 > >>>> 93.55% dhclient 607 root 1 98 0 34584K 21212K select =20 > >>>> 0:09 1.81% Xorg 663 wb 4 20 0 46712K 40224K kserel =20 > >>>> 0:27 0.00% mozilla-bin 503 root 1 8 0 1184K 796K > >>>> nanslp 0:07 0.00% powerd > >>>> > >>>> Took (best guess) approx 5-10 minutes for the effect to kick in. > >>> > >>> > >>> FYI, I have the same issues with bge(4) and ndis(4). > >> > >> > >> > >> I've seen it on ath and em interfaces now, but am not sure what's goin= g > >> on. and have no idea how to reproduce the problem. As also reported b= y > >> Bakul Shah, we seem to be getting into a state where receive_packet() = is > >> spinning. I'm not seeing an obvious way for this to be possible. > >=20 > >=20 > > It's the latest change to tables.c that breaks it. Reverting that=20 > > single line back, fixes it. >=20 > Sigh.. It ran for about 15 minutes, without problems. Of course, again,= =20 > as soon as I hit send, I see dhclient start chugging a moment later. I= =20 > suppose it took longer this time since my CPU is already maxed out doing= =20 > portupgrade. >=20 > So, nevermind. All I know, is that about 2 weeks ago, it worked just=20 > fine. >=20 > Eric I'm having the same problem, ath on -CURRENT the only new info that might be useful that I can add right now, is that it wasn't happening on my 3 - 5 (closer to 3, I think) day old build. --=20 If I write a signature, my emails will appear more personalised.