From owner-freebsd-stable@freebsd.org Wed Jan 17 21:45:43 2018 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id D1DD5EB9786 for ; Wed, 17 Jan 2018 21:45:43 +0000 (UTC) (envelope-from nimrod@nimrod.is-a-geek.net) Received: from mail-yw0-x235.google.com (mail-yw0-x235.google.com [IPv6:2607:f8b0:4002:c05::235]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 9C9E183A35 for ; Wed, 17 Jan 2018 21:45:43 +0000 (UTC) (envelope-from nimrod@nimrod.is-a-geek.net) Received: by mail-yw0-x235.google.com with SMTP id x190so9538422ywd.10 for ; Wed, 17 Jan 2018 13:45:43 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=F/edqvqdK50pau7AIJAKGnU7/j4gxtKzi6LpgTFo1nw=; b=WC7L9vTDh3VfhlE5M/tzF75Qtncp94AJSdjzJTHVo33MiOUotf6xQ8WaRhOuELmmEG 2/Ju/8wYyGlDC3jJJpF3/mnGSaXu9E+7bCmDdNE7iXSOpIQNq7Cxl1HkEpuV6DbpOkY4 068UhIRUzodSg1TCMYdTy/v11M5CEVA8f5LMN5/MX1F+swnijb5UZk4F+Z4vbTKXBgPf RJlZuSkyWqzZbfLCmEAZpRWv6A6K6rzx7W9jVqka3xOz/TKNvweoI7TbVxUstO4Zx3SP ZYWmlLndaP2zcE4s/78I87Q7yOpnT+qrDlsuyI6AuLoT0/n9IrNsTtGOZR6oWxMRvptT tbDw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=F/edqvqdK50pau7AIJAKGnU7/j4gxtKzi6LpgTFo1nw=; b=n/w7GCw1D233q5oXBBfd1vCb/3l2aXM1/YZ2rm4Qx5W03MB3iG74YtTTitctQ9BpoK k53FMqdoHhN4jU/4F2qI1/2vKyWBTvTTIzIjZ448D4okpEi8PGj/exHz3yv+hJXbZ013 i3Iokj+pgq0WV7Ycw80lLPTmElwvfXVyVioS4fe1QXpV5zZsCO7mymXgTdXyqF5FzGQu W2MqkDhTuKxa0XJgFcDy2IX4hA9fXIrUNpK3eZD5yyncKXvmnvqO6tyRtMzyq0Q0T/qR swu1nGtozbk1BYokG0JlvElAHOBEpxLvuun48a4XzjfDp5zZP572VIo03+9R4NhE//Xk NQ4Q== X-Gm-Message-State: AKwxytf8qxxQ/POMlt9gwS1x++/5T2mSEUH8QTbugxipzBo/N6mZYl44 h7Q9lUy4iBhFJZtPV2BIZINrBzU6fZQtgajgLzEfqA9I X-Google-Smtp-Source: ACJfBotASqrHzPf1l9mBpKHqNcavi9ELU12XrtczsPxegZa+wdXKxhJ0y0ATxCEl0B1S3X3oG71HCwKNDwwgsz7Zesw= X-Received: by 10.37.136.14 with SMTP id c14mr8159168ybl.177.1516225542148; Wed, 17 Jan 2018 13:45:42 -0800 (PST) MIME-Version: 1.0 References: <8e842dec-ade7-37d1-6bd8-856ea1a827ca@sentex.net> <3b625072-dfb3-6b4f-494d-7fe1b2fa554c@ingresso.co.uk> <2c6ce4dd-f43c-7c40-abc2-732d6f8996ec@sentex.net> <795dbb79-3c18-d967-98b9-5d09a740dbfe@sentex.net> In-Reply-To: <795dbb79-3c18-d967-98b9-5d09a740dbfe@sentex.net> From: Nimrod Levy Date: Wed, 17 Jan 2018 21:45:31 +0000 Message-ID: Subject: Re: Ryzen issues on FreeBSD ? To: Mike Tancsa Cc: Don Lewis , freebsd-stable@freebsd.org, Pete French Content-Type: text/plain; charset="UTF-8" X-Content-Filtered-By: Mailman/MimeDel 2.1.25 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.25 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 17 Jan 2018 21:45:43 -0000 I'm running 11-STABLE from 12/9. amdtemp works for me. It also has the systl indicating that it it has the shared page fix. I'm pretty sure I've seen the lockups since then. I'll update to the latest STABLE and see what happens. One weird thing about my experience is that if I keep something running continuously like the distributed.net client on 6 of 12 possible threads, it keeps the system up for MUCH longer than without. This is a home server and very lightly loaded (one could argue insanely overpowered for the use case). I'm glad to see that there has been some attention on this. I was a little disappointed by the earlier thread. I'm happy to help troubleshoot, but I'm not sure what information I can gather from a hard locked system that doesn't even show anything on the console. -- Nimrod On Wed, Jan 17, 2018 at 4:01 PM Mike Tancsa wrote: > On 1/17/2018 3:39 PM, Don Lewis wrote: > > On 17 Jan, Mike Tancsa wrote: > >> On 1/17/2018 8:43 AM, Pete French wrote: > >>> > >>> Are you running the latest STABLE ? There were some patches for Ryzen > >>> which went in I belive, and might affect te stability. Specificly the > >>> chnages to stop it locking up when executing code in the top page ? > >> > >> Hi, > >> I was testing with RELENG_11 as of 2 days ago. The fix seems to > be there > >> > >> # sysctl -A hw.lower_amd64_sharedpage > >> hw.lower_amd64_sharedpage: 1 > >> > >> Would love to find a class of motherboard that pushes its "You dont need > >> to dork around with any BIOS settings. It just works. Oh, and we have a > >> hardware watchdog too".... ipmi would be stellar. > > > > The shared page change fixed the random lockup and silent reboot problem > > for me. I've got a 1700X eight core CPU and a Gigabyte X370 Gaming 5. I > > did have to RMA my CPU (it was an early one) because it had the problem > > with random segfaults that seemed to be triggered by process migration > > between CPU cores. I still haven't switched over to using it for > > package builds because I see more random fallout than on my older > > package builder. I'm not blaming the hardware for that at this point > > because I see a lot of the same issues on my older machine, but less > > frequently. > > > > One thing to watch (though it should be less critical with a six core > > CPU) is VRM cooling. I removed the stupid plastic shroud over the VRM > > sink on my motherboard so that it gets some more airflow. > > Thanks! I will confirm the cooling. I tried just now looking at the CPU > FAN control in the BIOS and up'd it to "turbo" from the default. Does > amdtmp.ko work with your chipset ? Nothing on mine unfortunately, so I > cant tell from the OS if its running hot. > > Is there a way to see if your CPU is old and has that bug ? I havent > seen any segfaults on the few dozen buildworlds I have done. So far its > always been a total lockup and not crash with RELENG11. > > x86info v1.31pre > Found 12 identical CPUs > Extended Family: 8 Extended Model: 0 Family: 15 Model: 1 Stepping: 1 > CPU Model (x86info's best guess): AMD Zen Series Processor (ZP-B1) > Processor name string (BIOS programmed): AMD Ryzen 5 1600 Six-Core > Processor > > Monitor/Mwait: min/max line size 64/64, ecx bit 0 support, enumeration > extension > SVM: revision 1, 32768 ASIDs, np, lbrVirt, SVMLock, NRIPSave, > TscRateMsr, VmcbClean, FlushByAsid, DecodeAssists, PauseFilter, > PauseFilterThreshold > Address Size: 48 bits virtual, 48 bits physical > The physical package has 12 of 16 possible cores implemented. > running at an estimated 3.20GHz > > > > > ---Mike > > > > -- > ------------------- > Mike Tancsa, tel +1 519 651 3400 <(519)%20651-3400> > Sentex Communications, mike@sentex.net > Providing Internet services since 1994 www.sentex.net > Cambridge, Ontario Canada http://www.tancsa.com/ > _______________________________________________ > freebsd-stable@freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" > -- -- Nimrod