From nobody Fri Jan 28 08:15:27 2022 X-Original-To: stable@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id DCF911988BF1 for ; Fri, 28 Jan 2022 08:15:35 +0000 (UTC) (envelope-from manu@bidouilliste.com) Received: from mx.blih.net (mail.blih.net [212.83.155.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "mx.blih.net", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4JlVdL2gb2z3nrK for ; Fri, 28 Jan 2022 08:15:34 +0000 (UTC) (envelope-from manu@bidouilliste.com) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bidouilliste.com; s=mx; t=1643357727; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=MG9zHa86dxV6CmPFRvPh6k1XuOBnqd9yhfYI2zbgbZw=; b=QyNjbFNi4RouvPTSFhO4UWimsVZyC3fUG+GijzGnGoDRkrWJf5WJD+oLkGWZFsyM2t5quH gcnoHQJNOA2exloDgKLkBxjKmXmGzH5WMbJaeBOa+p43p9LlNT8ixy2m2lu33sDVXvzsJC qxoREu9pIic6/2sCwGS0DgVgWTZiNTw= Received: from amy (lfbn-idf2-1-1209-14.w90-92.abo.wanadoo.fr [90.92.34.14]) by mx.blih.net (OpenSMTPD) with ESMTPSA id 82cef1c2 (TLSv1.3:TLS_AES_256_GCM_SHA384:256:NO); Fri, 28 Jan 2022 08:15:27 +0000 (UTC) Date: Fri, 28 Jan 2022 09:15:27 +0100 From: Emmanuel Vadot To: Greg Balfour Cc: stable@freebsd.org Subject: Re: 12.3-RELEASE GPU Hangs Message-Id: <20220128091527.6cff34c1450c5ccf6b077be3@bidouilliste.com> In-Reply-To: References: X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.33; amd64-portbld-freebsd14.0) List-Id: Production branch of FreeBSD source code List-Archive: https://lists.freebsd.org/archives/freebsd-stable List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-stable@freebsd.org X-BeenThere: freebsd-stable@freebsd.org Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: 4JlVdL2gb2z3nrK X-Spamd-Bar: - Authentication-Results: mx1.freebsd.org; dkim=pass header.d=bidouilliste.com header.s=mx header.b=QyNjbFNi; dmarc=pass (policy=none) header.from=bidouilliste.com; spf=pass (mx1.freebsd.org: domain of manu@bidouilliste.com designates 212.83.155.74 as permitted sender) smtp.mailfrom=manu@bidouilliste.com X-Spamd-Result: default: False [-1.50 / 15.00]; RCVD_VIA_SMTP_AUTH(0.00)[]; ARC_NA(0.00)[]; R_DKIM_ALLOW(-0.20)[bidouilliste.com:s=mx]; FREEFALL_USER(0.00)[manu]; FROM_HAS_DN(0.00)[]; TO_DN_SOME(0.00)[]; MV_CASE(0.50)[]; MID_RHS_MATCH_FROM(0.00)[]; TAGGED_RCPT(0.00)[]; MIME_GOOD(-0.10)[text/plain]; NEURAL_HAM_LONG(-1.00)[-1.000]; R_SPF_ALLOW(-0.20)[+ip4:212.83.155.74/32]; TO_MATCH_ENVRCPT_SOME(0.00)[]; NEURAL_SPAM_SHORT(1.00)[1.000]; DKIM_TRACE(0.00)[bidouilliste.com:+]; RCPT_COUNT_TWO(0.00)[2]; DMARC_POLICY_ALLOW(-0.50)[bidouilliste.com,none]; NEURAL_HAM_MEDIUM(-1.00)[-1.000]; MLMMJ_DEST(0.00)[stable]; FREEMAIL_TO(0.00)[gmail.com]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+]; ASN(0.00)[asn:12876, ipnet:212.83.128.0/19, country:FR]; RCVD_COUNT_TWO(0.00)[2]; RCVD_TLS_ALL(0.00)[] X-ThisMailContainsUnwantedMimeParts: N On Thu, 27 Jan 2022 16:30:28 -0600 Greg Balfour wrote: > Ever since upgrading to 12.3-RELEASE I've been experiencing GPU Hangs > every 7 to 10 days. Almost always associated with Firefox (from > ports) when either the Download for Print dialog windows are opened. > I had it hang once (Jan 19) when running vlc. > > When it hangs the desktop is frozen. I am able to ssh into the > machine and do a reboot when this happens. > > Any clues as to what the issue is and how to resolve it? > > Log messages and some config info below. > > Jan 2 09:55:42 desktop kernel: [drm] GPU HANG: ecode 6:0:0x8588cffc, > in MainThread [100247], reason: Hang on rcs0, action: reset > Jan 2 09:55:42 desktop kernel: drmn0: Resetting chip after gpu hang > Jan 2 09:55:51 desktop syslogd: last message repeated 1 times > Jan 2 09:55:56 desktop kernel: drmn0: i915_reset_device timed out, > cancelling all in-flight rendering. > > Jan 12 13:40:03 desktop kernel: [drm] GPU HANG: ecode 6:0:0x8588fff8, > in MainThread [100244], reason: Hang on rcs0, action: reset > Jan 12 13:40:03 desktop kernel: drmn0: Resetting chip after gpu hang > Jan 12 13:40:12 desktop syslogd: last message repeated 1 times > Jan 12 13:40:17 desktop kernel: drmn0: i915_reset_device timed out, > cancelling all in-flight rendering. > > Jan 19 12:34:07 desktop kernel: drmn0: Resetting chip after gpu hang > Jan 19 12:34:12 desktop kernel: drmn0: i915_reset_device timed out, > cancelling all in-flight rendering. > > Jan 27 15:54:58 desktop kernel: [drm] GPU HANG: ecode 6:0:0x8588fff8, > in MainThread [100258], reason: Hang on rcs0, action: reset > Jan 27 15:54:58 desktop kernel: drmn0: Resetting chip after gpu hang > Jan 27 15:55:07 desktop syslogd: last message repeated 1 times > Jan 27 15:55:12 desktop kernel: drmn0: i915_reset_device timed out, > cancelling all in-flight rendering. > > ---<>--- > Copyright (c) 1992-2021 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD is a registered trademark of The FreeBSD Foundation. > FreeBSD 12.3-RELEASE r371126 GENERIC amd64 > FreeBSD clang version 10.0.1 (git@github.com:llvm/llvm-project.git > llvmorg-10.0.1-0-gef32c611aa2) > VT(vga): resolution 640x480 > CPU: Intel(R) Core(TM) i7-2600 CPU @ 3.40GHz (3392.37-MHz K8-class CPU) > Origin="GenuineIntel" Id=0x206a7 Family=0x6 Model=0x2a Stepping=7 > Features=0xbfebfbff > Features2=0x1fbae3ff > AMD Features=0x28100800 > AMD Features2=0x1 > XSAVE Features=0x1 > VT-x: PAT,HLT,MTF,PAUSE,EPT,UG,VPID > TSC: P-state invariant, performance statistics > real memory = 17179869184 (16384 MB) > avail memory = 16503447552 (15738 MB) > Event timer "LAPIC" quality 600 > ACPI APIC Table: > FreeBSD/SMP: Multiprocessor System Detected: 8 CPUs > FreeBSD/SMP: 1 package(s) x 4 core(s) x 2 hardware threads > ... > vgapci0: port 0x3000-0x303f mem > 0xe0c00000-0xe0ffffff,0xd0000000-0xdfffffff irq 16 at device 2.0 on > pci0 > vgapci0: Boot video device > ... > drmn0: on vgapci0 > vgapci0: child drmn0 requested pci_enable_io > vgapci0: child drmn0 requested pci_enable_io > [drm] Unable to create a private tmpfs mount, hugepage support will be > disabled(-19). > Failed to add WC MTRR for [0xd0000000-0xdfffffff]: -22; performance may suffer > [drm] Got stolen memory base 0xcba00000, size 0x4000000 > [drm] Supports vblank timestamp caching Rev 2 (21.10.2013). > [drm] Driver supports precise vblank timestamp query. > [drm] Connector VGA-1: get mode from tunables: > [drm] - kern.vt.fb.modes.VGA-1 > [drm] - kern.vt.fb.default_mode > [drm] Connector HDMI-A-1: get mode from tunables: > [drm] - kern.vt.fb.modes.HDMI-A-1 > [drm] - kern.vt.fb.default_mode > [drm] Connector DP-1: get mode from tunables: > [drm] - kern.vt.fb.modes.DP-1 > [drm] - kern.vt.fb.default_mode > [drm] Initialized i915 1.6.0 20171222 for drmn0 on minor 0 > WARNING: Device "fb" is Giant locked and may be deleted before FreeBSD 14.0. > VT: Replacing driver "vga" with new "fb". > start FB_INFO: > type=11 height=768 width=1024 depth=32 > cmsize=16 size=3145728 > pbase=0xd0061000 vbase=0xfffff800d0061000 > name=drmn0 flags=0x0 stride=4096 bpp=32 > cmap[0]=0 cmap[1]=7f0000 cmap[2]=7f00 cmap[3]=c4a000 > end FB_INFO > drmn0: fb0: inteldrmfb frame buffer device > > And in /etc/rc.conf: > kld_list="/boot/modules/drm.ko /boot/modules/i915kms.ko" > I've seen those 'gpu hang' reports a lot of time (those looks like the ones reported in https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=260890#c9) but I've never managed to reproduce, although I don't really use my sandybridge system. One point to add is that nothing changed between 12.2 and 12.3 wrt drm. Would you mind testing 13.0 or CURRENT to see if that fixes your issue ? Cheers, -- Emmanuel Vadot