From owner-freebsd-questions@freebsd.org Sat Apr 23 16:25:19 2016 Return-Path: Delivered-To: freebsd-questions@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 228A1B1A90E for ; Sat, 23 Apr 2016 16:25:19 +0000 (UTC) (envelope-from lokadamus@gmx.de) Received: from mout.gmx.net (mout.gmx.net [212.227.15.18]) (using TLSv1.2 with cipher DHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "mout.gmx.net", Issuer "TeleSec ServerPass DE-1" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 788101F50 for ; Sat, 23 Apr 2016 16:25:18 +0000 (UTC) (envelope-from lokadamus@gmx.de) Received: from [192.168.0.143] ([95.91.224.46]) by mail.gmx.com (mrgmx003) with ESMTPSA (Nemesis) id 0Mb7pT-1b8enx41cp-00Kiup; Sat, 23 Apr 2016 18:25:07 +0200 Subject: Re: FreeBSD Crashes Intermittently !! To: shahzaib mushtaq References: <56E2E9AC.1040902@gmx.de> <33444.128.135.52.6.1457712900.squirrel@cosmo.uchicago.edu> <56E2F586.9000108@gmx.de> <5715DF0F.4090808@gmx.de> <5715EBDA.10907@gmx.de> <57161B95.9020802@gmx.de> Cc: freebsd-questions@freebsd.org, galtsev@kicp.uchicago.edu From: "lokadamus@gmx.de" Message-ID: <571BA1E1.5020809@gmx.de> Date: Sat, 23 Apr 2016 18:25:05 +0200 User-Agent: Mozilla/5.0 (X11; FreeBSD i386; rv:38.0) Gecko/20100101 Thunderbird/38.7.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-Provags-ID: V03:K0:pYSC0VGAh3TG689t+KWo0fGV1fEu7HVFwWfsCfftXWwNkjlseQm KG6/1BnrgmOq1JJi605ULsczQHOSjXQ9sMo/EzA4TcuUanW8kAN8q/bpjSBKjM14xlvBJO6 aWAwoSJ1Tp5jh9jX/YABylOjSP0YUiodvtC7VNPI7p9oj6un7F3P//PJb3uru9DLQNz/dJy 5S+TYwkcs+RuYWUwzoNcQ== X-UI-Out-Filterresults: notjunk:1;V01:K0:gHjbltdETlU=:hgnf07ZBE7QOeFnhLPp5aH /E39MwtMvNRSjVnC3bj+8y/LdhvYfQclzGv1Vu3Tld/nQQvf9TRWdzrRO+IkCeI3bY04IiNAB uGw+IvD0c4JPzNPyNuVan8lD/l9E8bKet/PQygeaKZOHZskGhL/YJ2g0+sEtSkpfjtVc8WwFV M9j3IHmtAOA3Kyvud/lw4mvWAXEJHi0k3pqKUm9nDSH2FM5FYykPFAG5nu90usF0nP0+Yd+eT +7HHRU4JP4+tKzQQUkkh9vtbfWTyu7IFRkZD4lB9jgJltMr+KRl7WkB0kQ/7ElhDweeB1Ybfc uie9r+ObKRiOR5a8K6zvs/fPPmDQusq06Hqb39ycShxLfkg2MSDGy+LaiqLpgY4NMw5j8jXog 2LIMyW+ChepTIPa65mK8Kkvgtra4yUWJ0m5ijPBNqXEiqlhbmQk1rgXpp/heio0XkwcaZxrFe Xf4WvuCgTDjqxOovUn39X6xRi/svSuHiA6cq7E9uKi3EPzewYwn5BWHR87MitP4S6eUJRLckm cDlLkyXXApfZ/Vr+EoWGckHx1vXIvSrb0yXI/FSyC6RTYjVIsvdcpvWnRxF13npJdOTw2b8Vw jjAT0rQTjyUyEhu28d7n9Mgc19AtzIAsLYrbVXbcLyeXfcl5tuqNCcHJ0MNr/bX/iFKeo7LO0 gc50aqQuX/0Mff/YxUOvuBZItodjK7tvKHHtHgyirl5EVhfhHM6n690lauP5U+ELi+fp2cGU8 wMbQ/xTLKsVqS7qs6G9g4oco0wBvx5YoamHFJMcDxnUthBKhpiwvzYHSsMN1y40HkrRJDGszp tkNIb1yAG40VZNglkLM4rBVywJlbw== X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 23 Apr 2016 16:25:19 -0000 Hi, Temp looks ok, but is the server working hard? I think about to disable core 22 and 23, but found your older mails and see, that different cores makes this error. http://pastebin.com/baShWuMP <-- from now http://pastebin.com/042SJ11c <-- 9th march https://lists.freebsd.org/pipermail/freebsd-current/2016-January/059148.html <-- januar I'm confused. Regards On 04/19/16 13:56, shahzaib mushtaq wrote: > Hi, > > Currently 2 x ffmpeg processes are running and the temp is : > > http://prntscr.com/au4wrm > > Well, it looks like restart is not necessary for microcode, you can simply > start microcode using command "service microcode_update start". > > Regards. > > On Tue, Apr 19, 2016 at 4:50 PM, lokadamus@gmx.de wrote: > >> Yes, this is the port. I tested it on my old system. >> After reboot it will start /usr/local/etc/rc.d/microcode-update and show >> a little message. >> My system is too old for an update. >> >> I'm wondering. i've never heard that lower 80w are protecting for >> overheating. >> Can you test this? >> >> http://www.cyberciti.biz/faq/freebsd-determine-processor-cpu-temperature-command/ >> >> On 04/19/16 13:24, shahzaib mushtaq wrote: >>> Hi, >>> >>> Can we use following freebsd guide to update microcode ? : >>> >>> Install sysutils/devcpu-data >>> >> , >>> then add: >>> >>> microcode_update_enable="YES" >>> >>> >>> ===================================================== >>> >>> https://www.freebsd.org/doc/faq/compatibility-processors.html >>> >>> On Tue, Apr 19, 2016 at 1:40 PM, shahzaib mushtaq >> >>> wrote: >>> >>>> Hi, >>>> >>>> We don't think its related to heat because L5640 only use 60W. Can we >>>> update microcode on FreeBSD? Because intel has not stated this OS when >>>> performing microcode update. >>>> >>>> Regards. >>>> >>>> On Tue, Apr 19, 2016 at 1:27 PM, lokadamus@gmx.de >>>> wrote: >>>> >>>>> Hi, >>>>> >>>>> I think about the error lines: >>>>> Hardware event. This is not a software error. >>>>> CPU 23 BANK 5 >>>>> MISC 0 ADDR 805613c60 >>>>> MCG status:MCIP >>>>> STATUS be00000000800400 MCGSTATUS 4 >>>>> .... >>>>> Hardware event. This is not a software error. >>>>> CPU 22 BANK 5 >>>>> >>>>> https://en.wikipedia.org/wiki/Machine-check_exception >>>>> >>>>> Looks like a hardware problem from the second cpu. >>>>> Thinks, what can be done: >>>>> - Is it possible to read cpu heat infos from bios? >>>>> - Disable HTT and look, if the error comes again >>>>> - Remove the second cpu and look, if ... >>>>> - Install microcode updates and hope, it will fix it >>>>> >>>>> Intel offers for many CPUs an microcode update. >>>>> >>>>> >> https://downloadcenter.intel.com/download/25512/Linux-Processor-Microcode-Data-File?v=t >>>>> >>>>> Can you test a cpu in another system? >>>>> https://www.freebsd.org/cgi/ports.cgi?query=cpuburn&stype=all >>>>> >>>>> >>>>> Regards >>>>> >>>>> On 04/19/16 09:35, shahzaib mushtaq wrote: >>>>>> Hi, sorry for the mistake, cpus are : >>>>>> >>>>>> 2 x Intel(R) Xeon(R) CPU L5640 @ 2.27GHz5640 (12 cores, 24 threads) >>>>>> >>>>>> On Tue, Apr 19, 2016 at 12:32 PM, lokadamus@gmx.de >>>>> wrote: >>>>>> >>>>>>> On 04/18/16 16:28, shahzaib mushtaq wrote: >>>>>>>> Hi again, got back after a long time. So yes, we've move to new Dell >>>>> R510 >>>>>>>> Hardware now. Here is the specs : >>>>>>>> >>>>>>>> DELL R510 >>>>>>>> 2 x L5520 >>>>>>>> 64GB RAM >>>>>>>> 12x3TB Raid stripping+mirroring (HBA LSI-9211-fw version 19.00) >>>>>>>> FreeBSD cw009.tunefiles.com 10.2-RELEASE-p14 FreeBSD >> 10.2-RELEASE-p14 >>>>>>> #0: >>>>>>>> Wed Mar 16 20:46:12 UTC 2016 >>>>>>>> root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC >>>>>>>> amd64 >>>>>>>> >>>>>>>> After 9days of uptime, server again got crashed with following error >>>>> in >>>>>>>> crash log : >>>>>>>> >>>>>>>> http://pastebin.com/baShWuMP >>>>>>>> >>>>>>>> I am so much depressed now, there's much pressure on me from my >>>>> company. >>>>>>> Please >>>>>>>> help us resolving this crash issue . :( >>>>>>> Which CPU Model is installed? Is it one or more? >>>>>>> >>>>>>> There where some microcode updates for some models. >>>>>>> >>>>>>> Greeting >>>>>>> >>>>>> _______________________________________________ >>>>>> freebsd-questions@freebsd.org mailing list >>>>>> https://lists.freebsd.org/mailman/listinfo/freebsd-questions >>>>>> To unsubscribe, send any mail to " >>>>> freebsd-questions-unsubscribe@freebsd.org" >>>>>> >>>>> >>>>> >>>> >>> >> >> > _______________________________________________ > freebsd-questions@freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org" >