Date: Mon, 16 Jun 2008 14:44:09 -0700 (PDT) From: arc_gabriel <m.pagulayan@auckland.ac.nz> To: freebsd-current@freebsd.org Subject: Re: FreeBSD 7, bridge, PF and syn flood = very bad performance Message-ID: <17873869.post@talk.nabble.com> In-Reply-To: <200801271549.52791.max@love2party.net> References: <479A2389.2000802@moneybookers.com> <200801262017.52091.max@love2party.net> <479B9F4F.5010705@moneybookers.com> <200801262227.36970.max@love2party.net> <479C4F31.7090804@moneybookers.com> <200801271422.23340.max@love2party.net> <479C953C.1010304@moneybookers.com> <200801271549.52791.max@love2party.net>
next in thread | previous in thread | raw e-mail | index | archive | help
Hi Guys, I am in the same boat as you guys are. I am using pf from 7.0-RELEASE FreeBSD 7.0-RELEASE Hardware Specs are IBM x3655 Interfaces: em0 and em1 (intel 1G cards) FW Setup: As Bridge I get very high load on CPU whenever em1 taskq and em0 taskq reaches it peaks. I noticed this when traffic goes to about more than 17MB/s or I can see drops as well on em1(external interface) when packets go over 14K packets/s. Which is a bit slow for this new hardware. But the other thing we setup with PF is Altq. Basically We have queues on both the em0(internal) and em1(externaL) interfaces. May this is another bottleneck? I was just wondering on how you remedied this problem. I am badly in need of solutions. Our internet has been crappy for the past week because when the CPU load goes high it drops heaps of packets. Best Regads, Mark Max Laier wrote: > > On Sunday 27 January 2008, Stefan Lambrev wrote: >> Greetings, >> >> Max Laier wrote: >> >> -cut- >> >> >> Well I think the interesting lines from this experiment are: >> >> max total wait_total count avg >> >> wait_avg cnt_hold cnt_lock name >> >> 39 25328476 70950955 9015860 2 7 >> >> 5854948 6309848 /usr/src/sys/contrib/pf/net/pf.c:6729 (sleep >> >> mutex:pf task mtx) >> >> 936935 10645209 350 50 212904 7 >> >> 110 47 /usr/src/sys/contrib/pf/net/pf.c:980 (sleep >> >> mutex:pf task mtx) >> > >> > Yeah, those two mostly are the culprit, but a quick fix is not really >> > available. You can try to "set timeout interval" to something bigger >> > (e.g. 60 seconds) which will decrease the average hold time of the >> > second lock instance at the cost of increased peak memory usage. >> >> I'll try and this. At least memory doesn't seems to be a problem :) >> >> > I have the ideas how to fix this, but it will take much much more >> > time than I currently have for FreeBSD :-\ In general this requires >> > a bottom up redesign of pf locking and some data structures involved >> > in the state tree handling. >> > >> > The first(=main) lock instance is also far from optimal (i.e. pf is a >> > congestion point in the bridge forwarding path). For this I have >> > also a plan how to make at least state table lookups run in parallel >> > to some extend, but again the lack of free time to spend coding >> > prevents me from doing it at the moment :-\ >> >> Well, now we know where the issue is. The same problem seems to affect >> synproxy state btw. >> Can I expect better performance with IPFW's dynamic rules? > > Not significantly better, I'd predict. IPFW's dynamic rules are also > protected by a single mutex leading to similar congestion problems as pf. > There should be a measureable constant improvement as IPFW does much less > sanity checks. i.e. better performance at the expense of less security. > It really depends on your needs which is better suited for your setup. > >> I wonder how one can protect himself on gigabit network and service >> more then 500pps. >> For example in my test lab I see incoming ~400k packets per second, but >> if I activate PF, >> I see only 130-140k packets per second. Is this expected behavior, if >> PF cannot handle so many packets? > > As you can see from the hwpmc trace starting this thread, we don't spend > that much time in pf. The culprit is the pf task mutext, which forces > serialization in pf congesting the whole forward path. Under different > circumstances pf can handle more pps. > >> The missing 250k+ are not listed as discarded or other errors, which is >> weird. > > As you slow down the forwarding protocols like TCP will automatically slow > down. Unless you have UDP bombs blasting at your network this is quite > usual behavior. > > -- > /"\ Best regards, | mlaier@freebsd.org > \ / Max Laier | ICQ #67774661 > X http://pf4freebsd.love2party.net/ | mlaier@EFnet > / \ ASCII Ribbon Campaign | Against HTML Mail and News > > > -- View this message in context: http://www.nabble.com/FreeBSD-7%2C-bridge%2C-PF-and-syn-flood-%3D-very-bad-performance-tp15093449p17873869.html Sent from the freebsd-current mailing list archive at Nabble.com.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?17873869.post>