From owner-freebsd-hackers@freebsd.org Thu Nov 14 15:30:24 2019 Return-Path: Delivered-To: freebsd-hackers@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id C9DA61AB23B for ; Thu, 14 Nov 2019 15:30:24 +0000 (UTC) (envelope-from danny@cs.huji.ac.il) Received: from kabab.cs.huji.ac.il (kabab.cs.huji.ac.il [132.65.116.210]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 47DQR36xHQz40Y6 for ; Thu, 14 Nov 2019 15:30:23 +0000 (UTC) (envelope-from danny@cs.huji.ac.il) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=cs.huji.ac.il; s=57791128; h=To:References:Message-Id:Content-Transfer-Encoding:Cc:Date:In-Reply-To:From:Subject:Mime-Version:Content-Type; bh=SCS6bZwZKoicEVnUIe+fqfHAVsjW5ocoLt9Rjyp5hqM=; b=2R2kFcZrQal76zL0klMw6O1bcADXxOOxnt49oyRaC2dFBDCtS3ofwmmb+EFB75fMQG1XnpMnvDz0NcuDJDZR7QjVR3/4aIpDssM7bke+9F7NYytaCQys5cFYBO4/IgexbPzEc0DM1wcZL7tmf5I9R06+15N6H/Bmtn+Hs/ysz2PdzDOw/QFIggyxLa2mt//DFBR2WkZu7lEJPYsQ2sRdAHMx3VT9kSyV/aKlCgRO7f6uy8SG0SoaYC2n/3cVDRbdoYw3p3YQEM4vEo7dP2n+Gk+YtsViX0Mmp4xfE/QTRKb7VVZSfox7adHuyaBQN3ARxf7FfoHvfpkk7FB/xGwj4g==; Received: from macmini.bk.cs.huji.ac.il ([132.65.179.19]) by kabab.cs.huji.ac.il with esmtp id 1iVH4f-000D64-7l; Thu, 14 Nov 2019 17:30:21 +0200 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 13.0 \(3601.0.10\)) Subject: Re: can the hardware watchdog reboot a hung kernel? From: Daniel Braniss In-Reply-To: Date: Thu, 14 Nov 2019 17:30:20 +0200 Cc: freebsd-hackers Content-Transfer-Encoding: quoted-printable Message-Id: <33FF6A8A-C01A-4261-A35A-0E05C96FD04A@cs.huji.ac.il> References: To: Miroslav Lachman <000.fbsd@quip.cz> X-Mailer: Apple Mail (2.3601.0.10) X-Rspamd-Queue-Id: 47DQR36xHQz40Y6 X-Spamd-Bar: - Authentication-Results: mx1.freebsd.org; dkim=pass header.d=cs.huji.ac.il header.s=57791128 header.b=2R2kFcZr; dmarc=pass (policy=none) header.from=huji.ac.il; spf=none (mx1.freebsd.org: domain of danny@cs.huji.ac.il has no SPF policy when checking 132.65.116.210) smtp.mailfrom=danny@cs.huji.ac.il X-Spamd-Result: default: False [-1.99 / 15.00]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-0.99)[-0.994,0]; R_DKIM_ALLOW(-0.20)[cs.huji.ac.il:s=57791128]; FROM_HAS_DN(0.00)[]; MV_CASE(0.50)[]; NEURAL_HAM_LONG(-1.00)[-1.000,0]; MIME_GOOD(-0.10)[text/plain]; RCVD_TLS_LAST(0.00)[]; TO_MATCH_ENVRCPT_SOME(0.00)[]; TO_DN_ALL(0.00)[]; DKIM_TRACE(0.00)[cs.huji.ac.il:+]; RCPT_COUNT_TWO(0.00)[2]; RCVD_IN_DNSWL_NONE(0.00)[210.116.65.132.list.dnswl.org : 127.0.10.0]; DMARC_POLICY_ALLOW(-0.50)[huji.ac.il,none]; R_SPF_NA(0.00)[]; FROM_EQ_ENVFROM(0.00)[]; SUBJECT_ENDS_QUESTION(1.00)[]; MIME_TRACE(0.00)[0:+]; ASN(0.00)[asn:378, ipnet:132.64.0.0/13, country:IL]; MID_RHS_MATCH_FROM(0.00)[]; IP_SCORE(-0.70)[ip: (-1.40), ipnet: 132.64.0.0/13(-1.20), asn: 378(-0.96), country: IL(0.05)]; RCVD_COUNT_TWO(0.00)[2] X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 14 Nov 2019 15:30:24 -0000 > On 14 Nov 2019, at 17:24, Miroslav Lachman <000.fbsd@quip.cz> wrote: >=20 > Daniel Braniss wrote on 2019/11/14 15:52: >> hi, >> I have serveral hundred Nano-pi NEO running, and sometimes they hang, = since there is no console >> available, the only solution is to do a power cycle - not so easy = since they are distributed in three buildings :-) >> I am looking at the watchdog stuff, but it seems that what I want is = not supported, i.e. >> reboot the kernel when hung >> wishful thinking? >=20 > There is watchdog and watchdogd in base. I never tried it but there = are some solutions which need support in BIOS / board where watchdog is = communicating with HW and if OS freezes, HW don't get reply from OS and = issue reboot after timeout. > I don't know if Nano-pi has this support or not. >=20 > Miroslav Lachman the nano reports: aw_wdog0: mem 0x1c20ca0-0x1c20cbf irq = 26 on simplebus0 so there is something there :-)