From nobody Thu Dec 9 12:23:43 2021 X-Original-To: freebsd-stable@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 1708C18E43DE for ; Thu, 9 Dec 2021 12:36:19 +0000 (UTC) (envelope-from pmc@citylink.dinoex.sub.org) Received: from uucp.dinoex.org (uucp.dinoex.org [IPv6:2a0b:f840::12]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "uucp.dinoex.sub.de", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4J8tnG0xylz4WSs for ; Thu, 9 Dec 2021 12:36:18 +0000 (UTC) (envelope-from pmc@citylink.dinoex.sub.org) Received: from uucp.dinoex.sub.de (uucp.dinoex.org [185.220.148.12]) by uucp.dinoex.org (8.17.1/8.17.1) with ESMTPS id 1B9Ca5Iu055783 (version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NO) for ; Thu, 9 Dec 2021 13:36:05 +0100 (CET) (envelope-from pmc@citylink.dinoex.sub.org) X-MDaemon-Deliver-To: X-Authentication-Warning: uucp.dinoex.org: Host uucp.dinoex.org [185.220.148.12] claimed to be uucp.dinoex.sub.de Received: (from uucp@localhost) by uucp.dinoex.sub.de (8.17.1/8.17.1/Submit) with UUCP id 1B9Ca5JN055782 for freebsd-stable@freebsd.org; Thu, 9 Dec 2021 13:36:05 +0100 (CET) (envelope-from pmc@citylink.dinoex.sub.org) Received: from gate.intra.daemon.contact (gate-e [192.168.98.2]) by citylink.dinoex.sub.de (8.16.1/8.16.1) with ESMTP id 1B9CPemY062865 for ; Thu, 9 Dec 2021 13:25:40 +0100 (CET) (envelope-from peter@gate.intra.daemon.contact) Received: from gate.intra.daemon.contact (gate-e [192.168.98.2]) by gate.intra.daemon.contact (8.16.1/8.16.1) with ESMTPS id 1B9CNhRo061797 (version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NO) for ; Thu, 9 Dec 2021 13:23:43 +0100 (CET) (envelope-from peter@gate.intra.daemon.contact) Received: (from peter@localhost) by gate.intra.daemon.contact (8.16.1/8.16.1/Submit) id 1B9CNhPI061796 for freebsd-stable@freebsd.org; Thu, 9 Dec 2021 13:23:43 +0100 (CET) (envelope-from peter) Date: Thu, 9 Dec 2021 13:23:43 +0100 From: Peter To: freebsd-stable@freebsd.org Subject: [ahd driver] 12.3: kernel crash when stopping disks Message-ID: List-Id: Production branch of FreeBSD source code List-Archive: https://lists.freebsd.org/archives/freebsd-stable List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-stable@freebsd.org X-BeenThere: freebsd-stable@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Milter: Spamilter (Reciever: uucp.dinoex.sub.de; Sender-ip: 185.220.148.12; Sender-helo: uucp.dinoex.sub.de;) X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (uucp.dinoex.org [185.220.148.12]); Thu, 09 Dec 2021 13:36:08 +0100 (CET) X-Rspamd-Queue-Id: 4J8tnG0xylz4WSs X-Spamd-Bar: -- Authentication-Results: mx1.freebsd.org; dkim=none; dmarc=none; spf=pass (mx1.freebsd.org: domain of pmc@citylink.dinoex.sub.org designates 2a0b:f840::12 as permitted sender) smtp.mailfrom=pmc@citylink.dinoex.sub.org X-Spamd-Result: default: False [-2.89 / 15.00]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-0.66)[-0.661]; FROM_HAS_DN(0.00)[]; TO_MATCH_ENVRCPT_ALL(0.00)[]; R_SPF_ALLOW(-0.20)[+mx]; MIME_GOOD(-0.10)[text/plain]; PREVIOUSLY_DELIVERED(0.00)[freebsd-stable@freebsd.org]; HAS_XAW(0.00)[]; RCPT_COUNT_ONE(0.00)[1]; NEURAL_HAM_LONG(-0.93)[-0.932]; RCVD_COUNT_THREE(0.00)[4]; TO_DN_NONE(0.00)[]; NEURAL_HAM_SHORT(-1.00)[-1.000]; DMARC_NA(0.00)[sub.org]; FROM_EQ_ENVFROM(0.00)[]; R_DKIM_NA(0.00)[]; MIME_TRACE(0.00)[0:+]; ASN(0.00)[asn:205376, ipnet:2a0b:f840::/32, country:DE]; RCVD_TLS_LAST(0.00)[] X-ThisMailContainsUnwantedMimeParts: N > Dec 5 01:08:25 edge gstopd[64139]: Error received from stop unit command > Dec 5 01:08:25 edge kernel: ahd0: Recovery Initiated - Card was not paused > Dec 5 01:08:25 edge kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< > Dec 5 01:08:25 edge kernel: ahd0: Dumping Card State at program address 0x7e Mode 0x22 > Dec 5 01:08:25 edge kernel: (pass0:ahd0:0:0:0): SCB 247 - timed out > Dec 5 01:08:25 edge kernel: (pass0:ahd0:0:0:0): Queuing a BDR SCB > Dec 5 01:08:25 edge kernel: (pass0:ahd0:0:0:0): Bus Device Reset Message Sent Hija, I had a closer look into this one: There must be a timeout flaw in the driver logic. I tried to run the STOP UNIT from camcontrol with "-t 30", but nevertheless these controller errors happen to appear after some 5 or 10 seconds. So whereever it gets the timeout from, it is not the right one. The kernel crash is then an occasional consequence of these strange timeouts - it happened only once, while the erroneous timeouts happen more often. I now workaround the issue: as the STOP UNIT is the only concerned command, I invoke that with the IMMED bit, not waiting until the disk finally stops: as this is only for saving the rain-forests (any my power bill), I don't care if or when the disks might manage to stop. No more problems or errors since that. cheerio, PMc