From owner-freebsd-net@FreeBSD.ORG Sun Oct 23 09:15:30 2011 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6427B106566B for ; Sun, 23 Oct 2011 09:15:30 +0000 (UTC) (envelope-from kostikbel@gmail.com) Received: from mail.zoral.com.ua (mx0.zoral.com.ua [91.193.166.200]) by mx1.freebsd.org (Postfix) with ESMTP id B26338FC12 for ; Sun, 23 Oct 2011 09:15:29 +0000 (UTC) Received: from alf.home (alf.kiev.zoral.com.ua [10.1.1.177]) by mail.zoral.com.ua (8.14.2/8.14.2) with ESMTP id p9N8ik5V047727 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Sun, 23 Oct 2011 11:44:46 +0300 (EEST) (envelope-from kostikbel@gmail.com) Received: from alf.home (kostik@localhost [127.0.0.1]) by alf.home (8.14.5/8.14.5) with ESMTP id p9N8ij5D083505; Sun, 23 Oct 2011 11:44:45 +0300 (EEST) (envelope-from kostikbel@gmail.com) Received: (from kostik@localhost) by alf.home (8.14.5/8.14.5/Submit) id p9N8ijEU083504; Sun, 23 Oct 2011 11:44:45 +0300 (EEST) (envelope-from kostikbel@gmail.com) X-Authentication-Warning: alf.home: kostik set sender to kostikbel@gmail.com using -f Date: Sun, 23 Oct 2011 11:44:45 +0300 From: Kostik Belousov To: Pawel Jakub Dawidek Message-ID: <20111023084445.GB50300@deviant.kiev.zoral.com.ua> References: <20111022084931.GD1697@garage.freebsd.pl> <4EA36F53.9050907@freebsd.org> <20111023061038.GE1697@garage.freebsd.pl> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="NP+fiIeNjDlB/E6h" Content-Disposition: inline In-Reply-To: <20111023061038.GE1697@garage.freebsd.pl> User-Agent: Mutt/1.4.2.3i X-Virus-Scanned: clamav-milter 0.95.2 at skuns.kiev.zoral.com.ua X-Virus-Status: Clean X-Spam-Status: No, score=-3.3 required=5.0 tests=ALL_TRUSTED,AWL,BAYES_00, DNS_FROM_OPENWHOIS autolearn=no version=3.2.5 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on skuns.kiev.zoral.com.ua Cc: Lawrence Stewart , freebsd-current@freebsd.org, Andre Oppermann , freebsd-net@freebsd.org Subject: Re: 9.0-RC1 panic in tcp_input: negative winow. X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 23 Oct 2011 09:15:30 -0000 --NP+fiIeNjDlB/E6h Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Sun, Oct 23, 2011 at 08:10:38AM +0200, Pawel Jakub Dawidek wrote: > On Sun, Oct 23, 2011 at 12:35:15PM +1100, Lawrence Stewart wrote: > > On 10/22/11 19:49, Pawel Jakub Dawidek wrote: > > > The panic message says: > > > > > > panic: tcp_input negative window: tp 0xfffffe007763e000 rcv_nxt 3718= 269252 rcv_adv 3718268291 > > > > > > I only have picture of the backtrace: > > > > > > http://people.freebsd.org/~pjd/misc/panic_negative_window.jpg > > > > >=20 > > ewww that is not good. Can you give us any more information about the= =20 > > machine and what it's doing? Is it terminating TCP connections from the= =20 > > internet at large or only local LAN (i.e. is there likely to be packet= =20 > > loss happening)? Are you doing TSO or LRO? Do you have any non-default= =20 > > tuning in place? >=20 > It is my local file server. It is doing NFS and AFP over LAN and also > downloads files from the internet. It is triggered after few hours. > I changed the KASSERT() into printf() and added printing 'win' variable > and this is what got logged during the night: >=20 > 05:16:24 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1107827= 269 rcv_adv 1107826256 win=3D242 > 05:16:29 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1107833= 451 rcv_adv 1107832977 win=3D880 > 05:16:41 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1107849= 563 rcv_adv 1107848860 win=3D639 > 05:20:02 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1108108= 230 rcv_adv 1108107331 win=3D567 > 05:24:30 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1108433= 302 rcv_adv 1108432272 win=3D974 > 05:24:46 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1108450= 385 rcv_adv 1108450060 win=3D751 > 05:26:44 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1108574= 818 rcv_adv 1108573851 win=3D71 > 05:28:03 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1108654= 103 rcv_adv 1108653166 win=3D0 > 05:28:43 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1108692= 396 rcv_adv 1108691451 win=3D0 > 05:30:06 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1108781= 258 rcv_adv 1108780372 win=3D235 > 05:35:05 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1109067= 578 rcv_adv 1109067335 win=3D663 > 05:37:03 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1109180= 403 rcv_adv 1109179411 win=3D0 > 05:41:03 tcp_input negative window: tp 0xfffffe0026772b70 rcv_nxt 1109428= 265 rcv_adv 1109427375 win=3D170 >=20 > And the systems seems to be fine. >=20 > I'm happy to test patches, but one round would take 24h. >=20 > My suggestion would be that if we won't be able to fix it before 9.0, > we should turn this assertion off, as the system seems to be able to > recover. Shipped kernels have all assertions turned off. --NP+fiIeNjDlB/E6h Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.18 (FreeBSD) iEYEARECAAYFAk6j0/0ACgkQC3+MBN1Mb4h5iwCfQa26LAP0gzvVdmSIiR9rLNvj 5UsAnRPP8tZxdYYn9jOXOHo2pvnPM0bJ =/lze -----END PGP SIGNATURE----- --NP+fiIeNjDlB/E6h--