Date: Sat, 23 Apr 2016 18:25:05 +0200 From: "lokadamus@gmx.de" <lokadamus@gmx.de> To: shahzaib mushtaq <shahzaib.cb@gmail.com> Cc: freebsd-questions@freebsd.org, galtsev@kicp.uchicago.edu Subject: Re: FreeBSD Crashes Intermittently !! Message-ID: <571BA1E1.5020809@gmx.de> In-Reply-To: <CAD3xhrO%2BMbTr12SgcVU7-ftHF8c2Ge4nzxbzobvKOs1qXRLQEQ@mail.gmail.com> References: <CAD3xhrMfKO8hVdpzR1xNqV=vwTMedPeTHR7v2=5W6RwC3F4V7A@mail.gmail.com> <56E2E9AC.1040902@gmx.de> <33444.128.135.52.6.1457712900.squirrel@cosmo.uchicago.edu> <56E2F586.9000108@gmx.de> <CAD3xhrM_Q=OjZzbJO1jY5a8Qhqne50ziQjJKSQ3kLO73FnJ0ag@mail.gmail.com> <CAD3xhrP=KYjzOEusoOwnysTqp=ZgWgH1ofrnnmuqaK4Z=7r_pA@mail.gmail.com> <5715DF0F.4090808@gmx.de> <CAD3xhrMTv_tOXTWX7kCim-mptYkXRZ-n_Mx%2BFX80OQkd-WMsPw@mail.gmail.com> <5715EBDA.10907@gmx.de> <CAD3xhrMtmgguc_De9de7EgBJ3cNqHnRwWQxut4g_eYWG1adgYw@mail.gmail.com> <CAD3xhrMNAU9Unamcny8N86FGdTD38V1qDE_GpHz=r-qygP7ymw@mail.gmail.com> <57161B95.9020802@gmx.de> <CAD3xhrO%2BMbTr12SgcVU7-ftHF8c2Ge4nzxbzobvKOs1qXRLQEQ@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
Hi, Temp looks ok, but is the server working hard? I think about to disable core 22 and 23, but found your older mails and see, that different cores makes this error. http://pastebin.com/baShWuMP <-- from now http://pastebin.com/042SJ11c <-- 9th march https://lists.freebsd.org/pipermail/freebsd-current/2016-January/059148.html <-- januar I'm confused. Regards On 04/19/16 13:56, shahzaib mushtaq wrote: > Hi, > > Currently 2 x ffmpeg processes are running and the temp is : > > http://prntscr.com/au4wrm > > Well, it looks like restart is not necessary for microcode, you can simply > start microcode using command "service microcode_update start". > > Regards. > > On Tue, Apr 19, 2016 at 4:50 PM, lokadamus@gmx.de <lokadamus@gmx.de> wrote: > >> Yes, this is the port. I tested it on my old system. >> After reboot it will start /usr/local/etc/rc.d/microcode-update and show >> a little message. >> My system is too old for an update. >> >> I'm wondering. i've never heard that lower 80w are protecting for >> overheating. >> Can you test this? >> >> http://www.cyberciti.biz/faq/freebsd-determine-processor-cpu-temperature-command/ >> >> On 04/19/16 13:24, shahzaib mushtaq wrote: >>> Hi, >>> >>> Can we use following freebsd guide to update microcode ? : >>> >>> Install sysutils/devcpu-data >>> <http://www.freebsd.org/cgi/url.cgi?ports/sysutils/devcpu-data/pkg-descr >>> , >>> then add: >>> >>> microcode_update_enable="YES" >>> >>> >>> ===================================================== >>> >>> https://www.freebsd.org/doc/faq/compatibility-processors.html >>> >>> On Tue, Apr 19, 2016 at 1:40 PM, shahzaib mushtaq <shahzaib.cb@gmail.com >>> >>> wrote: >>> >>>> Hi, >>>> >>>> We don't think its related to heat because L5640 only use 60W. Can we >>>> update microcode on FreeBSD? Because intel has not stated this OS when >>>> performing microcode update. >>>> >>>> Regards. >>>> >>>> On Tue, Apr 19, 2016 at 1:27 PM, lokadamus@gmx.de <lokadamus@gmx.de> >>>> wrote: >>>> >>>>> Hi, >>>>> >>>>> I think about the error lines: >>>>> Hardware event. This is not a software error. >>>>> CPU 23 BANK 5 >>>>> MISC 0 ADDR 805613c60 >>>>> MCG status:MCIP >>>>> STATUS be00000000800400 MCGSTATUS 4 >>>>> .... >>>>> Hardware event. This is not a software error. >>>>> CPU 22 BANK 5 >>>>> >>>>> https://en.wikipedia.org/wiki/Machine-check_exception >>>>> >>>>> Looks like a hardware problem from the second cpu. >>>>> Thinks, what can be done: >>>>> - Is it possible to read cpu heat infos from bios? >>>>> - Disable HTT and look, if the error comes again >>>>> - Remove the second cpu and look, if ... >>>>> - Install microcode updates and hope, it will fix it >>>>> >>>>> Intel offers for many CPUs an microcode update. >>>>> >>>>> >> https://downloadcenter.intel.com/download/25512/Linux-Processor-Microcode-Data-File?v=t >>>>> >>>>> Can you test a cpu in another system? >>>>> https://www.freebsd.org/cgi/ports.cgi?query=cpuburn&stype=all >>>>> >>>>> >>>>> Regards >>>>> >>>>> On 04/19/16 09:35, shahzaib mushtaq wrote: >>>>>> Hi, sorry for the mistake, cpus are : >>>>>> >>>>>> 2 x Intel(R) Xeon(R) CPU L5640 @ 2.27GHz5640 (12 cores, 24 threads) >>>>>> >>>>>> On Tue, Apr 19, 2016 at 12:32 PM, lokadamus@gmx.de <lokadamus@gmx.de> >>>>> wrote: >>>>>> >>>>>>> On 04/18/16 16:28, shahzaib mushtaq wrote: >>>>>>>> Hi again, got back after a long time. So yes, we've move to new Dell >>>>> R510 >>>>>>>> Hardware now. Here is the specs : >>>>>>>> >>>>>>>> DELL R510 >>>>>>>> 2 x L5520 >>>>>>>> 64GB RAM >>>>>>>> 12x3TB Raid stripping+mirroring (HBA LSI-9211-fw version 19.00) >>>>>>>> FreeBSD cw009.tunefiles.com 10.2-RELEASE-p14 FreeBSD >> 10.2-RELEASE-p14 >>>>>>> #0: >>>>>>>> Wed Mar 16 20:46:12 UTC 2016 >>>>>>>> root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC >>>>>>>> amd64 >>>>>>>> >>>>>>>> After 9days of uptime, server again got crashed with following error >>>>> in >>>>>>>> crash log : >>>>>>>> >>>>>>>> http://pastebin.com/baShWuMP >>>>>>>> >>>>>>>> I am so much depressed now, there's much pressure on me from my >>>>> company. >>>>>>> Please >>>>>>>> help us resolving this crash issue . :( >>>>>>> Which CPU Model is installed? Is it one or more? >>>>>>> >>>>>>> There where some microcode updates for some models. >>>>>>> >>>>>>> Greeting >>>>>>> >>>>>> _______________________________________________ >>>>>> freebsd-questions@freebsd.org mailing list >>>>>> https://lists.freebsd.org/mailman/listinfo/freebsd-questions >>>>>> To unsubscribe, send any mail to " >>>>> freebsd-questions-unsubscribe@freebsd.org" >>>>>> >>>>> >>>>> >>>> >>> >> >> > _______________________________________________ > freebsd-questions@freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org" >
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?571BA1E1.5020809>