From nobody Thu Apr 20 02:08:36 2023 X-Original-To: bugs@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4Q21Kc4gb4z465B0 for ; Thu, 20 Apr 2023 02:08:36 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from mxrelay.nyi.freebsd.org (mxrelay.nyi.freebsd.org [IPv6:2610:1c1:1:606c::19:3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "mxrelay.nyi.freebsd.org", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4Q21Kc2T17z47CC for ; Thu, 20 Apr 2023 02:08:36 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1681956516; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=voyK34gwylj3ksf5YnEXZX7KL8Oatz70e5vcR4gipow=; b=tpzDkUOcuk7SlGsNDewgVQhQ/CeDT/G5MChYiOrVnlh0kT0fwDpZDO7hqnWA9Fog6frHxy UqOdLtkoiVBQS2I5Q+1Tkv5OSihll4UUJo4rFibEAujLPUJH4c/2pYrmqVIvn8ScyLwA/z +sMu9dbqzvUaudtnV8NjXzz80QLQWWrOkisNiu4gNBg2VOM+zASAPhs1BAuVIFxB5pQBZ/ 5FpgGcHtTvOksbvlwY5jUQEYAD/1i6oza8asdrD+5YnRjcOmt8S4tONux0heKaqm4oTVBq wLvRsanqSsrOXw7NJOamHR5nf8aBIBNSQbUoJ3tlVY6z+2NgJjm6dXv6en3j3A== ARC-Authentication-Results: i=1; mx1.freebsd.org; none ARC-Seal: i=1; s=dkim; d=freebsd.org; t=1681956516; a=rsa-sha256; cv=none; b=fvKZ/gUGwO3giw4ennIRDULhZl/0mWT48MrIWdZQREkLXQXOi85AnQOZc3fIqDtmlgdiEc 59LMPbczXm10w07wX+hhjmyVLCpKi2tB28/BwmEcfjLebtBECKGpbX/xs9O4pzeGSedyK+ CSarrCzYArDeWK1/tmkywKzFdQWpd49ZhbxAXaAr68qHdU4iMxCe9YH8n2ay7Z26YN+BF7 1fD2F1tSF/MDwmMvA0Ub6gO4BOjZmXQJVwZYMSSLyLOATnMg2gGIhPEPKue6fbR7RCOANF G91+HrL/fDh6BkYdXsYwTZFmS086VYWLs3jALartDaXl6kO8VTPhKSUfylh8eQ== Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2610:1c1:1:606c::50:1d]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mxrelay.nyi.freebsd.org (Postfix) with ESMTPS id 4Q21Kc1VmgzJmL for ; Thu, 20 Apr 2023 02:08:36 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org ([127.0.1.5]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id 33K28agc040056 for ; Thu, 20 Apr 2023 02:08:36 GMT (envelope-from bugzilla-noreply@freebsd.org) Received: (from www@localhost) by kenobi.freebsd.org (8.15.2/8.15.2/Submit) id 33K28aX9040055 for bugs@FreeBSD.org; Thu, 20 Apr 2023 02:08:36 GMT (envelope-from bugzilla-noreply@freebsd.org) X-Authentication-Warning: kenobi.freebsd.org: www set sender to bugzilla-noreply@freebsd.org using -f From: bugzilla-noreply@freebsd.org To: bugs@FreeBSD.org Subject: [Bug 270943] Complete system freeze on Asus dual socket AMD 7742 system Date: Thu, 20 Apr 2023 02:08:36 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: misc X-Bugzilla-Version: 13.2-RELEASE X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Some People X-Bugzilla-Who: nb@synthcom.com X-Bugzilla-Status: New X-Bugzilla-Resolution: X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: bugs@FreeBSD.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version rep_platform op_sys bug_status bug_severity priority component assigned_to reporter attachments.created Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated List-Id: Bug reports List-Archive: https://lists.freebsd.org/archives/freebsd-bugs List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-bugs@freebsd.org MIME-Version: 1.0 X-ThisMailContainsUnwantedMimeParts: N https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D270943 Bug ID: 270943 Summary: Complete system freeze on Asus dual socket AMD 7742 system Product: Base System Version: 13.2-RELEASE Hardware: amd64 OS: Any Status: New Severity: Affects Some People Priority: --- Component: misc Assignee: bugs@FreeBSD.org Reporter: nb@synthcom.com Created attachment 241605 --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=3D241605&action= =3Dedit dmesg.boot For this system I have a dual socket 7742 system (128 total real cores, 128 threads) that w= ill completely lock up the system in under an hour if left idle. By "lock up", = this means: * Console unresponsive (no keyboard/USB/numlock) * Networking unresponsive (no pings, no arps, nothing) Like it's "jumping to self" with all interrupts disabled. The system needs = to be reset or power cycled. I have tried the following distributions over the last few months with the same results: FreeBSD 13.2-RELEASE releng/13.2-n254617-525ecfdad597 GENERIC amd64 FreeBSD 13.1 FreeBSD 13.0 FreeBSD 12.3 Several memstick images of 14.0 since December 2022 Other notes: * The lockup is guaranteed. I've never had it not lock up when left idle. Always locks up in <1 hour (usually in 10-20 minutes). * If I run a "stress" program, the system runs for days at a time without a= ny observed lockups. If there's any significant system activity, it appears to= not lock up. * At one point (on a 14.0 build) I was able to get the kernel debugger comp= iled in. When the system locked up, hitting the local USB keyboard sequence to g= et in to the kernel debugger worked. This also seemed to unlock the system, as after I exited the kernel debugger, the system was alive again. * I've installed the OSes on either 2GB M.2 Samsung SSDs *OR* on a Western Digital SN200 NVME disk. No changes in behavior. Storage does not appear to= be a factor. * I've halved the memory and swapped DIMMs entirely. No change. System specs: Motherboard : Asus rs700a-e11-rs12u-wocpu009z CPUs : Dual AMD 7742 CPUs BIOS Version : 0901 BMC Firmware model : RS700A-E11-RS12U BMC Firmware version: 1.2.15 Installed ECC memory: 512GB Storage : Two Samsung EVO 980 TB M.2 SSDs, and a WD SN200 7.68TB NVME U.2 disk Video is the ASpeed AST2500, which supplies video for the system.=20 I'd be happy to put this system on the internet and allow any and all interested parties access to it for troubleshooting/debugging. Thank you! --=20 You are receiving this mail because: You are the assignee for the bug.=