From owner-freebsd-net@FreeBSD.ORG Thu Jan 12 10:30:02 2012 Return-Path: Delivered-To: freebsd-net@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 41999106566B; Thu, 12 Jan 2012 10:30:02 +0000 (UTC) (envelope-from avg@FreeBSD.org) Received: from citadel.icyb.net.ua (citadel.icyb.net.ua [212.40.38.140]) by mx1.freebsd.org (Postfix) with ESMTP id E1E8C8FC13; Thu, 12 Jan 2012 10:30:00 +0000 (UTC) Received: from porto.starpoint.kiev.ua (porto-e.starpoint.kiev.ua [212.40.38.100]) by citadel.icyb.net.ua (8.8.8p3/ICyb-2.3exp) with ESMTP id MAA01957; Thu, 12 Jan 2012 12:29:59 +0200 (EET) (envelope-from avg@FreeBSD.org) Received: from localhost ([127.0.0.1]) by porto.starpoint.kiev.ua with esmtp (Exim 4.34 (FreeBSD)) id 1RlHuw-000E3v-Vi; Thu, 12 Jan 2012 12:29:59 +0200 Message-ID: <4F0EB625.3000905@FreeBSD.org> Date: Thu, 12 Jan 2012 12:29:57 +0200 From: Andriy Gapon User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:9.0) Gecko/20111222 Thunderbird/9.0 MIME-Version: 1.0 To: lev@FreeBSD.org References: <1379921442.20120112133112@serebryakov.spb.ru> <4F0EADE1.9070803@FreeBSD.org> <1699441022.20120112140525@serebryakov.spb.ru> In-Reply-To: <1699441022.20120112140525@serebryakov.spb.ru> X-Enigmail-Version: undefined Content-Type: text/plain; charset=windows-1251 Content-Transfer-Encoding: 8bit Cc: freebsd-net@FreeBSD.org, freebsd-current@FreeBSD.org, jhb@FreeBSD.org Subject: Re: SCHED_ULE / NetGraph interaction broken somwhere between r227874 and r229818 X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 12 Jan 2012 10:30:02 -0000 on 12/01/2012 12:05 Lev Serebryakov said the following: > Hello, Andriy. > You wrote 12 января 2012 г., 13:54:41: > >>> Switching to 4BSD helps. 4BSD works as usual: all CPU time is >>> interrupts and network thread, system is responsive under heaviest load, >>> normal operations of DNS, DHCP and hostapd. >> How reproducible is this result? > 100% > >> In other words, have you definitely ruled out all other factors besides the >> scheduler? > > I have two almost-identical NanoBSD images which differs in one line in kernel > config -- option about scheduler. Worlds are exactly the same, only kernels were > rebuilt. > > Alexander Motin suggests, that switching scheduler could slightly > change stack consumption, which triggers switching to ng_queue > instead of direct calls. > > Really, here is diff between "md5" of all files of one and other > images: Well, I mostly meant things like uptime, load level and pattern, etc. But what mav says makes sense. Also I remember seeing some very old reports about some strange issues with SCHED_ULE and dummynet. Some links that I found: http://lists.freebsd.org/pipermail/freebsd-stable/2008-October/046332.html http://dadv.livejournal.com/139366.html#cutid1 Given the last link, I wonder if binding the ng_queue thread to a particular CPU would change anything. > blob# diff ~lev/bsd-image.md5sums ~lev/ule-image.md5sums > 74c74 > < MD5 (./boot/kernel/kernel) = 3bb0dd757628b5065d27ee5e7fc22eb3 > --- >> MD5 (./boot/kernel/kernel) = 5ba379d2c73e1277566f4bbcb618a9f2 > 618c618 > < MD5 (./conf/base/var/log/userlog) = a827af82c1f780687706b19c7d94b29e > --- >> MD5 (./conf/base/var/log/userlog) = fc289b66ae6cb23f9b24b694bf12157b > 15678c15678 > < MD5 (./var/log/userlog) = a827af82c1f780687706b19c7d94b29e > --- >> MD5 (./var/log/userlog) = fc289b66ae6cb23f9b24b694bf12157b > -- Andriy Gapon