From owner-freebsd-stable@freebsd.org Wed Sep 6 11:02:50 2017 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 481C7E21191 for ; Wed, 6 Sep 2017 11:02:50 +0000 (UTC) (envelope-from petefrench@ingresso.co.uk) Received: from constantine.ingresso.co.uk (unknown [IPv6:2a02:b90:3002:411::3]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id D58AF80ED1 for ; Wed, 6 Sep 2017 11:02:49 +0000 (UTC) (envelope-from petefrench@ingresso.co.uk) Received: from dilbert.london-internal.ingresso.co.uk ([10.64.50.6]) by constantine.ingresso.co.uk with esmtpsa (TLSv1.2:ECDHE-RSA-AES128-GCM-SHA256:128) (Exim 4.89 (FreeBSD)) (envelope-from ) id 1dpY6W-000GDE-1T for freebsd-stable@freebsd.org; Wed, 06 Sep 2017 11:02:44 +0000 Subject: Re: 11.1 running on HyperV hn interface hangs To: freebsd-stable@freebsd.org References: <20170906193309.796c79ed@akips.com> From: Pete French Message-ID: <3f96c7d0-4fbd-26cb-5c84-8868d12eb427@ingresso.co.uk> Date: Wed, 6 Sep 2017 12:02:43 +0100 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:52.0) Gecko/20100101 Thunderbird/52.3.0 MIME-Version: 1.0 In-Reply-To: <20170906193309.796c79ed@akips.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 06 Sep 2017 11:02:50 -0000 > We recently moved our software from 11.0-p9 to 11.1-p1, but looks like there > is a regression in 11.1-p1 running on HyperV (Windows/HyperV 2012 R2) where > the virtual hn0 interface hangs with the following kernel messages: > > hn0: on vmbus0 > hn0: Ethernet address: 00:15:5d:31:21:0f > hn0: link state changed to UP > ... > hn0: RXBUF ack retry > hn0: RXBUF ack failed > last message repeated 571 times > > It requires a restart of the HyperV VM. > > This is a customer production server (remote customer ~4000km away) running > fairly critical monitoring software, so we needed to roll it back to 11.0-p9. > We only have two customers running our software in HyperV, vs lots in VMware > and a handful on physical hardware. > > 11.0-p9 has been very stable. Has anyone seen this problem before with 11.1 ? I don't run anything on local hyper-v anymore, but I do run a ot of stuff in Azure, and we havent seen anything like this. I track STABLE for things though, updating after reading the commits and testing locally for a week or so, so the version I am running currently is r320175, which was part of 11.1-BETA2. I am going to upgrade to a more recent STABLE sometime this weke or next though, will do that on a test amchine and let you now how it goes. I seem to recall that there were some large changes to the hn code in August to add virtual function support. When does 11.1-p1 date from ? -pete.