From owner-svn-src-all@freebsd.org Mon Jun 6 06:06:58 2016 Return-Path: Delivered-To: svn-src-all@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id B8A30B6D160; Mon, 6 Jun 2016 06:06:58 +0000 (UTC) (envelope-from melifaro@ipfw.ru) Received: from forward1h.cmail.yandex.net (forward1h.cmail.yandex.net [IPv6:2a02:6b8:0:f35::11]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "forwards.mail.yandex.net", Issuer "Yandex CA" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 2D3581431; Mon, 6 Jun 2016 06:06:58 +0000 (UTC) (envelope-from melifaro@ipfw.ru) Received: from mxback10h.mail.yandex.net (mxback10h.mail.yandex.net [84.201.186.19]) by forward1h.cmail.yandex.net (Yandex) with ESMTP id E94AC20BB6; Mon, 6 Jun 2016 09:06:53 +0300 (MSK) Received: from web29h.yandex.ru (web29h.yandex.ru [84.201.187.163]) by mxback10h.mail.yandex.net (nwsmtp/Yandex) with ESMTP id i6TmWUc6As-6qFOKYYo; Mon, 06 Jun 2016 09:06:52 +0300 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ipfw.ru; s=mail; t=1465193212; bh=apsvOkuih3ttIpPvLaSjcUJ4+0Qi8e7FTigieJL4ilo=; h=X-Yandex-Sender-Uid:From:Envelope-From:To:Cc:In-Reply-To: References:Subject:MIME-Version:Message-Id:X-Mailer:Date: Content-Transfer-Encoding:Content-Type; b=sB0wCUHhS0RVJyupbIsWQxgjFFiU2MCao/8JeeGRFHN5GQ4CHtj5OE41nSmlVVUG0 we7XD6/O8burlAO0Eb8OihbIyTMrAoV655T3TTUb5GkEikW3k57iLUjovbTGRwHpIC tCGcWs5P3zMDXYwsZEqMISNrgafDJuo6aeA1lveM= Authentication-Results: mxback10h.mail.yandex.net; dkim=pass header.i=@ipfw.ru X-Yandex-Suid-Status: 1 0,1 0,1 0,1 0,1 0,1 1130000031510964 X-Yandex-Sender-Uid: 1130000014307927 Received: by web29h.yandex.ru with HTTP; Mon, 06 Jun 2016 09:06:52 +0300 From: Alexander V. Chernikov Envelope-From: melifaro@ipfw.ru To: George Neville-Neil Cc: Mike Karels , src-committers , svn-src-all , svn-src-head In-Reply-To: References: <201606021751.u52HpTrH090384@repo.freebsd.org> <3448221465067132@web17h.yandex.ru> Subject: Re: svn commit: r301217 - in head/sys: net netinet netinet6 MIME-Version: 1.0 Message-Id: <3505271465193212@web29h.yandex.ru> X-Mailer: Yamail [ http://yandex.ru ] 5.0 Date: Mon, 06 Jun 2016 09:06:52 +0300 Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset=utf-8 X-BeenThere: svn-src-all@freebsd.org X-Mailman-Version: 2.1.22 Precedence: list List-Id: "SVN commit messages for the entire src tree \(except for " user" and " projects" \)" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 06 Jun 2016 06:06:58 -0000 06.06.2016, 04:40, "George Neville-Neil" : > On 4 Jun 2016, at 15:05, Alexander V. Chernikov wrote: > >>  02.06.2016, 20:51, "George V. Neville-Neil" : >>>  Author: gnn >>>  Date: Thu Jun 2 17:51:29 2016 >>>  New Revision: 301217 >>>  URL: https://svnweb.freebsd.org/changeset/base/301217 >>> >>>  Log: >>>    This change re-adds L2 caching for TCP and UDP, as originally >>>  added in D4306 >>>    but removed due to other changes in the system. Restore the >>>  llentry pointer >>>    to the "struct route", and use it to cache the L2 lookup (ARP or >>>  ND6) as >>>    appropriate. >> >>  I have several comments regarding this commit. >> >>  1 Architecturally, there was quite a lot of efforts to eliminate >>  layering violation between lltable and other places in network stack. >>  It ended by committing D4102, which allowed both to cleanup lower >>  level, provide abstract “prepend” framework which could be used to >>  provide cached data to _otuput() functions. >>  This change brings these violations back in a really invasive way. >> >>  Additionally, implementing L2 PCB caching at the other subsystem >>  expense is really a bad idea. >>  If one wants caching in some subsystem, it should be implemented in >>  that subsystem not polluting other things. >>  Current implementation permits this by filling in “ro_prepend” / >>  ro_plen fields. >> >>  In general, this change looks more like a local hack and not like the >>  code that should be included in the tree. >> >>  2 There was no benchmarks proving the effectiveness of this change. >>  (For example, it is not obvious if it could significantly improve TCP >>  since we still have per-session TCP wlock + (typically) per-ring >>  mutex, so removing lltable rock might not help things here). Given >>  that the patch complicates existing code, there should be adequate >>  benefits to consider whether this change is worth implementing. >> >>  3 The “network” group was not included to the review despite the >>  fact that most of the changes were related to the L2 layer which is >>  not “transport”, so some people might have missed this review. >> >>  4 This change DOES NOT WORK. really. >>  (which raises questions on both review and benchmarking process). >> >>  The reason is that “plle” argument is filled only in “heavy” >>  lltable lookup functions (e.g. when we don’t have neighbour >>  adjacency). 99.9% of the time arpresolve/nd6_resolve() returns the >>  result w/o calling their heavy versions, and the returned “plle” >>  is NULL. >> >>  This can be easily verified by calling something like >>  dtrace -n 'fbt:kernel:ether_output:entry /arg3!=NULL&&((struct route >>  *)arg3)->ro_lle != NULL/ { stack(); }' >> >>  Given that, I kindly ask you to backout this change. > > Hi, > > I'm going to limit the scope of this reply to just you, me and Mike > Karels, from whom this originated. No, please keep the discussion open. The decision on having that particular L2 caching implementation (and L2 caching in general) is quite important, so it would be great if all technical arguments were saved so other people can (now or later) understand the decision details. Thanks for understanding. > > Best, > George