From owner-freebsd-bugs@freebsd.org Wed Aug 26 16:35:06 2015 Return-Path: Delivered-To: freebsd-bugs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id AC1B49C3543 for ; Wed, 26 Aug 2015 16:35:06 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2001:1900:2254:206a::16:76]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 9085C1053 for ; Wed, 26 Aug 2015 16:35:06 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from bugs.freebsd.org ([127.0.1.118]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id t7QGZ6HL044333 for ; Wed, 26 Aug 2015 16:35:06 GMT (envelope-from bugzilla-noreply@freebsd.org) From: bugzilla-noreply@freebsd.org To: freebsd-bugs@FreeBSD.org Subject: [Bug 202680] Silent data corruption on em(4) interfaces Date: Wed, 26 Aug 2015 16:35:06 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: 10.2-STABLE X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Some People X-Bugzilla-Who: KOT@MATPOCKuH.Ru X-Bugzilla-Status: New X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: freebsd-bugs@FreeBSD.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version rep_platform op_sys bug_status bug_severity priority component assigned_to reporter Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 26 Aug 2015 16:35:06 -0000 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=202680 Bug ID: 202680 Summary: Silent data corruption on em(4) interfaces Product: Base System Version: 10.2-STABLE Hardware: Any OS: Any Status: New Severity: Affects Some People Priority: --- Component: kern Assignee: freebsd-bugs@FreeBSD.org Reporter: KOT@MATPOCKuH.Ru I got silent data corruption when transferring data via em(4) interfaces on 10.2-STABLE r286912. Immediately after reboot does not occur - after reboot I can transfer files without any errors. Problem occurs *only* after few hours of uptime. I checked on system with uptime > 1 day and got: 1. broken large file transferred via ftp (MD5 checksum mismatched); For example: Original MD5(eis-dvd.iso)= 3705908d84f3c0f9f922a00e6e9332d2 Downloaded MD5 (eis-dvd.iso) = 1f7de1fdc682ae9ca26ec0621b377e89 2. SSH disconnects when transferring large data with message: Corrupted MAC on input. Disconnecting: Packet corrupt I using Sun Fire X4100 server with 4 build in ethernet ports Intel 82546EB: em0@pci0:1:1:0: class=0x020000 card=0x10118086 chip=0x10108086 rev=0x03 hdr=0x00 vendor = 'Intel Corporation' device = '82546EB Gigabit Ethernet Controller (Copper)' class = network subclass = ethernet This server works under solaris without any problems. I tried to use: 1. lagg0.2 (lagg0 with vlan tag) over em0 and em2 interfaces: cloned_interfaces="lagg0 lagg0.2" ifconfig_em0="up" ifconfig_em2="up" create_args_lagg0="laggproto lacp laggport em0 laggport em2" 2. tagged em1.2 interface via manual ifconfig em1.2 create 192.168.X.Y And I got same problem in both cases. *One* time when transferring file I got this messages: em0: Interface stopped DISTRIBUTING, possible flapping em0: Watchdog timeout -- resetting em2: Interface stopped DISTRIBUTING, possible flapping em2: Watchdog timeout -- resetting But this problem occurred only one time. netstat -in does not see any problems on interfaces: Name Mtu Network Address Ipkts Ierrs Idrop Opkts Oerrs Coll em0 1500 00:14:4f:01:3f:7a 6689452 0 0 146720 0 0 em1 1500 00:14:4f:01:3f:7b 5732168 0 0 2865912 0 0 em2 1500 00:14:4f:01:3f:7c 501817 0 0 3392333 0 0 TCP_OFFLOAD was disabled in kernel's config. System is not in production use, and I can try to roll back to pre-r269196 code or try to collect any required additional diagnosis information, but I can't provide remote access to this system. -- You are receiving this mail because: You are the assignee for the bug.