Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 16 Nov 2015 10:43:34 +0100
From:      Julien Cigar <jcigar@ulb.ac.be>
To:        Gerhard Schmidt <schmidt@ze.tum.de>
Cc:        freebsd-questions@freebsd.org
Subject:   Re: Random Lockup with FreeBSD 10.2 on SuperMicro Boards
Message-ID:  <20151116094334.GS2604@mordor.lan>
In-Reply-To: <56498205.3060806@ze.tum.de>
References:  <56498205.3060806@ze.tum.de>

next in thread | previous in thread | raw e-mail | index | archive | help

--O2izrSG9ltmUPm45
Content-Type: text/plain; charset=utf-8
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Mon, Nov 16, 2015 at 08:13:09AM +0100, Gerhard Schmidt wrote:
> Hi,

Hello,

>=20
> I'm running quiet a few FreeBSD servers on SuperMicro Boards. I'm in the
> process of upgrading from 10.1 to 10.2. On the machines running 10.2 I'm
> experiencing so random lockups.
>=20
> The server running fine bit sometimes (about 2-3 month apart the /var
> filesystem just locks. Other filesystems on the same drive (mirror-raid)
> still working, only when accessing anything on /var blocks the process.
>=20
> The same machines running with 10.1 don't have this Problem.
>=20
> All Filesystems are UFS journaled soft-updates.
>=20

try to disable SU+J (tunefs -j disable), I had random lockups with HP
Proliant servers too and problem. Problem went away when I turned off
SU+J.

> There is no message on the console or in the log files (as expected as
> the log files are on /var)
>=20
> Here is the output from mount.
>=20
> /dev/raid/r0p3 on / (ufs, local)
> devfs on /dev (devfs, local, multilabel)
> /dev/raid/r0p4 on /var (ufs, local, journaled soft-updates)
> /dev/raid/r0p5 on /usr (ufs, local, journaled soft-updates)
> /dev/raid/r0p6 on /data (ufs, local, journaled soft-updates)
> fdescfs on /dev/fd (fdescfs)
> procfs on /proc (procfs, local)
> /dev/md0 on /tmp (ufs, local)
>=20
> I've updated so far three server to 10.2. Two of them by using
> freebsd-update from 10.1 and one was fresh installed. All of them failed
> once or twice since update to 10.2 and never before (running 10.1).
>=20
> I've attached the dmesg.boot from the last server to fail (fresh installe=
d)
>=20
> Regards
>    Estartu
>=20
> --=20
> -------------------------------------------------
> Gerhard Schmidt       | E-Mail: schmidt@ze.tum.de
> TU-M=C3=BCnchen	      | Jabber: estartu@ze.tum.de
> WWW & Online Services |
> Tel: 089/289-25270    |
> Fax: 089/289-25257    | PGP-Publickey auf Anfrage
>=20

> Copyright (c) 1992-2015 The FreeBSD Project.
> Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
> 	The Regents of the University of California. All rights reserved.
> FreeBSD is a registered trademark of The FreeBSD Foundation.
> FreeBSD 10.2-RELEASE-p7 #0: Mon Nov  2 14:19:39 UTC 2015
>     root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC amd64
> FreeBSD clang version 3.4.1 (tags/RELEASE_34/dot1-final 208032) 20140512
> CPU: Intel(R) Xeon(R) CPU E3-1240 v3 @ 3.40GHz (3392.22-MHz K8-class CPU)
>   Origin=3D"GenuineIntel"  Id=3D0x306c3  Family=3D0x6  Model=3D0x3c  Step=
ping=3D3
>   Features=3D0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,=
PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
>   Features2=3D0x7ffafbff<SSE3,PCLMULQDQ,DTES64,MON,DS_CPL,VMX,SMX,EST,TM2=
,SSSE3,<b11>,FMA,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,MOVBE,POPCNT,TSCD=
LT,AESNI,XSAVE,OSXSAVE,AVX,F16C,RDRAND>
>   AMD Features=3D0x2c100800<SYSCALL,NX,Page1GB,RDTSCP,LM>
>   AMD Features2=3D0x21<LAHF,ABM>
>   Structured Extended Features=3D0x2fbb<FSGSBASE,TSCADJ,BMI1,HLE,AVX2,SME=
P,BMI2,ERMS,INVPCID,RTM,NFPUSG>
>   XSAVE Features=3D0x1<XSAVEOPT>
>   VT-x: PAT,HLT,MTF,PAUSE,EPT,UG,VPID
>   TSC: P-state invariant, performance statistics
> real memory  =3D 34376515584 (32784 MB)
> avail memory =3D 33266585600 (31725 MB)
> Event timer "LAPIC" quality 600
> ACPI APIC Table: <SUPERM SMCI--MB>
> FreeBSD/SMP: Multiprocessor System Detected: 8 CPUs
> FreeBSD/SMP: 1 package(s) x 4 core(s) x 2 SMT threads
>  cpu0 (BSP): APIC ID:  0
>  cpu1 (AP): APIC ID:  1
>  cpu2 (AP): APIC ID:  2
>  cpu3 (AP): APIC ID:  3
>  cpu4 (AP): APIC ID:  4
>  cpu5 (AP): APIC ID:  5
>  cpu6 (AP): APIC ID:  6
>  cpu7 (AP): APIC ID:  7
> ioapic0 <Version 2.0> irqs 0-23 on motherboard
> random: <Software, Yarrow> initialized
> module_register_init: MOD_LOAD (vesa, 0xffffffff80db8e60, 0) error 19
> kbd1 at kbdmux0
> acpi0: <SUPERM SMCI--MB> on motherboard
> acpi0: Power Button (fixed)
> cpu0: <ACPI CPU> on acpi0
> cpu1: <ACPI CPU> on acpi0
> cpu2: <ACPI CPU> on acpi0
> cpu3: <ACPI CPU> on acpi0
> cpu4: <ACPI CPU> on acpi0
> cpu5: <ACPI CPU> on acpi0
> cpu6: <ACPI CPU> on acpi0
> cpu7: <ACPI CPU> on acpi0
> hpet0: <High Precision Event Timer> iomem 0xfed00000-0xfed003ff on acpi0
> Timecounter "HPET" frequency 14318180 Hz quality 950
> Event timer "HPET" frequency 14318180 Hz quality 550
> atrtc0: <AT realtime clock> port 0x70-0x77 irq 8 on acpi0
> atrtc0: Warning: Couldn't map I/O.
> Event timer "RTC" frequency 32768 Hz quality 0
> attimer0: <AT timer> port 0x40-0x43,0x50-0x53 irq 0 on acpi0
> Timecounter "i8254" frequency 1193182 Hz quality 0
> Event timer "i8254" frequency 1193182 Hz quality 100
> Timecounter "ACPI-fast" frequency 3579545 Hz quality 900
> acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1808-0x180b on acpi0
> pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
> pci0: <ACPI PCI bus> on pcib0
> pcib1: <ACPI PCI-PCI bridge> irq 16 at device 1.0 on pci0
> pci1: <ACPI PCI bus> on pcib1
> igb0: <Intel(R) PRO/1000 Network Connection version - 2.4.0> port 0xe020-=
0xe03f mem 0xf7180000-0xf71fffff,0xf7284000-0xf7287fff irq 16 at device 0.0=
 on pci1
> igb0: Using MSIX interrupts with 9 vectors
> igb0: Ethernet address: 00:25:90:c3:a5:3c
> igb0: Bound queue 0 to cpu 0
> igb0: Bound queue 1 to cpu 1
> igb0: Bound queue 2 to cpu 2
> igb0: Bound queue 3 to cpu 3
> igb0: Bound queue 4 to cpu 4
> igb0: Bound queue 5 to cpu 5
> igb0: Bound queue 6 to cpu 6
> igb0: Bound queue 7 to cpu 7
> igb1: <Intel(R) PRO/1000 Network Connection version - 2.4.0> port 0xe000-=
0xe01f mem 0xf7100000-0xf717ffff,0xf7280000-0xf7283fff irq 17 at device 0.1=
 on pci1
> igb1: Using MSIX interrupts with 9 vectors
> igb1: Ethernet address: 00:25:90:c3:a5:3d
> igb1: Bound queue 0 to cpu 0
> igb1: Bound queue 1 to cpu 1
> igb1: Bound queue 2 to cpu 2
> igb1: Bound queue 3 to cpu 3
> igb1: Bound queue 4 to cpu 4
> igb1: Bound queue 5 to cpu 5
> igb1: Bound queue 6 to cpu 6
> igb1: Bound queue 7 to cpu 7
> pci0: <simple comms> at device 22.0 (no driver attached)
> pci0: <simple comms> at device 22.1 (no driver attached)
> ehci0: <Intel Lynx Point USB 2.0 controller USB-B> mem 0xf7304000-0xf7304=
3ff irq 16 at device 26.0 on pci0
> usbus0: EHCI version 1.0
> usbus0 on ehci0
> pcib2: <ACPI PCI-PCI bridge> irq 16 at device 28.0 on pci0
> pci2: <ACPI PCI bus> on pcib2
> pcib3: <ACPI PCI-PCI bridge> at device 0.0 on pci2
> pci3: <ACPI PCI bus> on pcib3
> vgapci0: <VGA-compatible display> port 0xd000-0xd07f mem 0xf6000000-0xf6f=
fffff,0xf7000000-0xf701ffff irq 16 at device 0.0 on pci3
> vgapci0: Boot video device
> ehci1: <Intel Lynx Point USB 2.0 controller USB-A> mem 0xf7303000-0xf7303=
3ff irq 23 at device 29.0 on pci0
> usbus1: EHCI version 1.0
> usbus1 on ehci1
> isab0: <PCI-ISA bridge> at device 31.0 on pci0
> isa0: <ISA bus> on isab0
> ahci0: <Intel Patsburg (RAID) AHCI SATA controller> port 0xf050-0xf057,0x=
f040-0xf043,0xf030-0xf037,0xf020-0xf023,0xf000-0xf01f mem 0xf7302000-0xf730=
27ff irq 19 at device 31.2 on pci0
> ahci0: AHCI v1.30 with 6 6Gbps ports, Port Multiplier not supported
> ahcich0: <AHCI channel> at channel 0 on ahci0
> ahcich1: <AHCI channel> at channel 1 on ahci0
> ahcich2: <AHCI channel> at channel 2 on ahci0
> ahcich3: <AHCI channel> at channel 3 on ahci0
> ahcich4: <AHCI channel> at channel 4 on ahci0
> ahcich5: <AHCI channel> at channel 5 on ahci0
> ahciem0: <AHCI enclosure management bridge> on ahci0
> acpi_button0: <Power Button> on acpi0
> acpi_tz0: <Thermal Zone> on acpi0
> acpi_tz1: <Thermal Zone> on acpi0
> uart0: <16550 or compatible> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
> uart2: <16550 or compatible> port 0x3e8-0x3ef irq 7 on acpi0
> orm0: <ISA Option ROMs> at iomem 0xc0000-0xc7fff,0xd0800-0xd17ff,0xd1800-=
0xd27ff on isa0
> sc0: <System console> at flags 0x100 on isa0
> sc0: CGA <16 virtual consoles, flags=3D0x300>
> vga0: <Generic ISA VGA> at port 0x3d0-0x3db iomem 0xb8000-0xbffff on isa0
> ppc0: cannot reserve I/O port range
> est0: <Enhanced SpeedStep Frequency Control> on cpu0
> est1: <Enhanced SpeedStep Frequency Control> on cpu1
> est2: <Enhanced SpeedStep Frequency Control> on cpu2
> est3: <Enhanced SpeedStep Frequency Control> on cpu3
> est4: <Enhanced SpeedStep Frequency Control> on cpu4
> est5: <Enhanced SpeedStep Frequency Control> on cpu5
> est6: <Enhanced SpeedStep Frequency Control> on cpu6
> est7: <Enhanced SpeedStep Frequency Control> on cpu7
> random: unblocking device.
> usbus0: 480Mbps High Speed USB v2.0
> Timecounters tick every 1.000 msec
> usbus1: 480Mbps High Speed USB v2.0
> ugen0.1: <Intel> at usbus0
> uhub0: <Intel EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus0
> ugen1.1: <Intel> at usbus1
> uhub1: <Intel EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus1
> ses0 at ahciem0 bus 0 scbus6 target 0 lun 0
> ses0: <AHCI SGPIO Enclosure 1.00 0001> SEMB S-E-S 2.00 device
> ses0: SEMB SES Device
> ada0 at ahcich0 bus 0 scbus0 target 0 lun 0
> ada0: <ST1000NM0033-9ZM173 SN03> ACS-2 ATA SATA 3.x device
> ada0: Serial Number xxxxxxxx
> ada0: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes)
> ada0: Command Queueing enabled
> ada0: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C)
> ada0: Previously was known as ad4
> ada1 at ahcich1 bus 0 scbus1 target 0 lun 0
> ada1: <ST1000NM0033-9ZM173 SN03> ACS-2 ATA SATA 3.x device
> ada1: Serial Number xxxxxxxx
> ada1: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes)
> ada1: Command Queueing enabled
> ada1: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C)
> ada1: Previously was known as ad6
> GEOM_RAID: Intel-fbcfc6e1: Array Intel-fbcfc6e1 created.
> GEOM_RAID: Intel-fbcfc6e1: Disk ada0 state changed from NONE to ACTIVE.
> GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:0-ada0 state changed from NONE to =
STALE.
> GEOM_RAID: Intel-fbcfc6e1: Disk ada1 state changed from NONE to ACTIVE.
> GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:1-ada1 state changed from NONE to =
STALE.
> GEOM_RAID: Intel-fbcfc6e1: Array started.
> GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:0-ada0 state changed from STALE to=
 ACTIVE.
> GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:1-ada1 state changed from STALE to=
 RESYNC.
> GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:1-ada1 rebuild start at 0.
> GEOM_RAID: Intel-fbcfc6e1: Volume gm0 state changed from STARTING to SUBO=
PTIMAL.
> GEOM_RAID: Intel-fbcfc6e1: Provider raid/r0 for volume gm0 created.
> SMP: AP CPU #1 Launched!
> SMP: AP CPU #2 Launched!
> SMP: AP CPU #3 Launched!
> SMP: AP CPU #7 Launched!
> SMP: AP CPU #5 Launched!
> SMP: AP CPU #4 Launched!
> SMP: AP CPU #6 Launched!
> Timecounter "TSC-low" frequency 1696109722 Hz quality 1000
> uhub1: 2 ports with 2 removable, self powered
> uhub0: 2 ports with 2 removable, self powered
> Root mount waiting for: usbus1 usbus0
> ugen1.2: <vendor 0x8087> at usbus1
> uhub2: <vendor 0x8087 product 0x8000, class 9/0, rev 2.00/0.05, addr 2> o=
n usbus1
> ugen0.2: <vendor 0x8087> at usbus0
> uhub3: <vendor 0x8087 product 0x8008, class 9/0, rev 2.00/0.05, addr 2> o=
n usbus0
> Root mount waiting for: usbus1 usbus0
> uhub2: 6 ports with 6 removable, self powered
> uhub3: 6 ports with 6 removable, self powered
> ugen0.3: <vendor 0x0000> at usbus0
> uhub4: <vendor 0x0000 product 0x0001, class 9/0, rev 2.00/0.00, addr 3> o=
n usbus0
> uhub4: 4 ports with 3 removable, self powered
> Root mount waiting for: usbus0
> ugen0.4: <vendor 0x0557> at usbus0
> ukbd0: <vendor 0x0557 product 0x2419, class 0/0, rev 1.10/1.00, addr 4> o=
n usbus0
> kbd0 at ukbd0
> Trying to mount root from ufs:/dev/raid/r0p3 [rw]...
> WARNING: / was not properly dismounted
> ums0: <vendor 0x0557 product 0x2419, class 0/0, rev 1.10/1.00, addr 4> on=
 usbus0
> ums0: 3 buttons and [Z] coordinates ID=3D0

> _______________________________________________
> freebsd-questions@freebsd.org mailing list
> https://lists.freebsd.org/mailman/listinfo/freebsd-questions
> To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.o=
rg"


--=20
Julien Cigar
Belgian Biodiversity Platform (http://www.biodiversity.be)
PGP fingerprint: EEF9 F697 4B68 D275 7B11  6A25 B2BB 3710 A204 23C0
No trees were killed in the creation of this message.
However, many electrons were terribly inconvenienced.

--O2izrSG9ltmUPm45
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2

iQIcBAABCgAGBQJWSaVCAAoJEAi2KiTKQR5pOswP/3AUlrdX2hHr5rP9bjhyCSM4
Y2lQbCHs07cXQAstgKsCOCX5ztAW+MWlKszyZ+T4gBh16qv2idmyim4X+T+avbg3
agASjlCnEyuNwv+hyyXBQG2LCVytxatZu+noLbCi2PLheJDHYCoV9ysZXRdmplrw
WefnEpq2YsvgEixJYFMuqkZoL5WrdGOFJ54YNwZEVz1EQK1tn147oSKPMWQBLsTx
cSoQZwEuoBLR1srCCwTCE/OEzYf84ABAV1jd3wPkBwVRMYm1qayGEdPp1sziIQKu
vz5i6nfMlL1VGjYptl27TGVYrKBFj5RrpB6XRXcEVrEDuTjl/JnWP/i0k+LX/WZq
nSxYfsP0t4+i6FBI4BDpurrA35+AP7+TC1TR8JIAJ7iwqaM10rtIvlAlcGmg6KXH
3lDXDi95AyFekTuy2VB5SP9JFerV43YBFSDnWcz8FTfGkE/49ah5MkqFHNF63DcF
znGrkefna4VGcl/F/Hoy/fWLNlVOZTXSZ+WhLcOT6OJafgWl7hBZGFVsSKer+Xou
d0kPyHbXpqFLfdy5Pn6nbLgCwfhv/W8gDd5rNW50tlTfxFysICcb1TZPlXA4YEcs
KBWBlRvEm7PTWzTla2WZp7mCBYo7o7ruzcRmT8qQVb+mRmg/XfR4J5YAaq9c94SD
9EHbd9mFaLlTYK6oMRsQ
=3o4Z
-----END PGP SIGNATURE-----

--O2izrSG9ltmUPm45--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20151116094334.GS2604>