From owner-freebsd-stable@freebsd.org Sun Jul 12 09:59:29 2015 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 0F09B998747 for ; Sun, 12 Jul 2015 09:59:29 +0000 (UTC) (envelope-from h.schmalzbauer@omnilan.de) Received: from mx0.gentlemail.de (mx0.gentlemail.de [IPv6:2a00:e10:2800::a130]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 830A1E86; Sun, 12 Jul 2015 09:59:28 +0000 (UTC) (envelope-from h.schmalzbauer@omnilan.de) Received: from mh0.gentlemail.de (mh0.gentlemail.de [78.138.80.135]) by mx0.gentlemail.de (8.14.5/8.14.5) with ESMTP id t6C9xOi4074241; Sun, 12 Jul 2015 11:59:24 +0200 (CEST) (envelope-from h.schmalzbauer@omnilan.de) Received: from titan.inop.mo1.omnilan.net (titan.inop.mo1.omnilan.net [IPv6:2001:a60:f0bb:1::3:1]) (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mh0.gentlemail.de (Postfix) with ESMTPSA id 2AFEFF91; Sun, 12 Jul 2015 11:59:24 +0200 (CEST) Message-ID: <55A23A75.8050003@omnilan.de> Date: Sun, 12 Jul 2015 11:59:17 +0200 From: Harald Schmalzbauer Organization: OmniLAN User-Agent: Mozilla/5.0 (X11; U; FreeBSD i386; de-DE; rv:1.9.2.8) Gecko/20100906 Lightning/1.0b2 Thunderbird/3.1.2 MIME-Version: 1.0 To: =?UTF-8?B?RWR3YXJkIFRvbWFzeiBOYXBpZXJhxYJh?= , FreeBSD Stable , kib@freebsd.org Subject: Re: r284665 causes MSI problems -> ahcich2: Timeout in slot 11 port 0 References: <55A158E1.3000905@omnilan.de> <20150712094153.GA1549@brick> In-Reply-To: <20150712094153.GA1549@brick> X-Enigmail-Version: 1.1.2 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="------------enig4C2550AF81F2E20B268F139D" X-Greylist: ACL 119 matched, not delayed by milter-greylist-4.2.7 (mx0.gentlemail.de [78.138.80.130]); Sun, 12 Jul 2015 11:59:24 +0200 (CEST) X-Milter: Spamilter (Reciever: mx0.gentlemail.de; Sender-ip: 78.138.80.135; Sender-helo: mh0.gentlemail.de; ) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 12 Jul 2015 09:59:29 -0000 This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --------------enig4C2550AF81F2E20B268F139D Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Bez=C3=BCglich Edward Tomasz Napiera=C5=82a's Nachricht vom 12.07.2015 1= 1:41 (localtime): > On 0711T1956, Harald Schmalzbauer wrote: >> Hello, >> >> r284665 causes ahci(4) to fail with timeouts when using MSI (the defau= lt). > What's the hardware? Thanks for your attention, it's Intel Cougar Point (C204, 2x SATA6G+4xSATAII), via PCIe-Passthrough in an ESXi guest. Several of these setups have been in production with 9.2 and 10.1 for 2 years+ without ahcich timeouts. >> 'hint.ahci.0.msi=3D0' is one way to make ahci(4) working with r284665,= but >> obviously not the desired solution, it just disables usage of an MSI. >> >> I can't find suspicious code in r282213 which could cause this strange= >> regression, but I verified carefully that problem arises with r284665.= >> Actually, r282901 >> (https://svnweb.freebsd.org/base?view=3Drevision&sortby=3Ddate&revisio= n=3D282901) >> is the real trigger, verified by putting >> nooptions RACCT >> nooptions RACCT_DEFAULT_TO_DISABLED >> nooptions RCTL >> into my kernel config -> problem vanishes! >> >> Setting "kern.racct.enable=3D1" doesn't make any difference, as soon a= s >> 'kern.features.racct' exists, there's the ahci(4)/ahcich2 timeout and >> machine doesn't finish booting. >> >> Unfortunately, I don't have any idea how to track this down to the >> actual culprit, but I hope the RACCT hackers do have ;-) >> >> Shall I open a bugzilla ticket? > That's... curious. I don't see how those two things could be related. > What's the FreeBSD version? How reproducible it is? Have you tried > compiling with and without those three lines a couple of times? Yes, I tried several times, and falsified that with r284665 the timeouts reproducably show up (which blocks the booting process, a major issue in my case). I also verified that several different revisions <284665 don't lead to that problem, and also that the changes in ahci code paths for the last year are not involved. I also can't see any relation, wich doesn't mean much since I don't have the kernel skills, but I'm sure the symptoms start with "options RACCT" Thanks, -Harry --------------enig4C2550AF81F2E20B268F139D Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.18 (FreeBSD) iEYEARECAAYFAlWiOnsACgkQLDqVQ9VXb8iuVQCgq3n1kyvOG7FeoO/2lw9WvA/x ywYAnj2sy0/C/IYNtUs/vf1vdIPvMImO =pbJQ -----END PGP SIGNATURE----- --------------enig4C2550AF81F2E20B268F139D--