From owner-freebsd-questions@freebsd.org Mon Nov 16 09:44:52 2015 Return-Path: Delivered-To: freebsd-questions@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 73372A2E434 for ; Mon, 16 Nov 2015 09:44:52 +0000 (UTC) (envelope-from jcigar@ulb.ac.be) Received: from relaygateway01.edpnet.net (relaygateway01.edpnet.net [212.71.1.210]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (Client CN "*.edpnet.net", Issuer "Go Daddy Secure Certificate Authority - G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 6DB081225 for ; Mon, 16 Nov 2015 09:44:50 +0000 (UTC) (envelope-from jcigar@ulb.ac.be) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AvUGANKjSVZNbWc1/2dsb2JhbABegztTb8BHFwqCPoJnSgKBMk0BAQEBAQGBC4Q0AQEBAwEBAQEgKwgYCwULCxgJEw4CAg8FEwEJCCQIAgUEARwEiAUMCalZi22ECwEBAQEBBQEBAQEBARQJgi+EJYR+hCAbAQEBBAmCcDoTgTEFh0EDhwmHO0CFHYUggmIIgVuSFohUY4IRHYFXPTQHg3wHF4EqAQEB X-IPAS-Result: AvUGANKjSVZNbWc1/2dsb2JhbABegztTb8BHFwqCPoJnSgKBMk0BAQEBAQGBC4Q0AQEBAwEBAQEgKwgYCwULCxgJEw4CAg8FEwEJCCQIAgUEARwEiAUMCalZi22ECwEBAQEBBQEBAQEBARQJgi+EJYR+hCAbAQEBBAmCcDoTgTEFh0EDhwmHO0CFHYUggmIIgVuSFohUY4IRHYFXPTQHg3wHF4EqAQEB X-IronPort-AV: E=Sophos;i="5.20,301,1444687200"; d="asc'?scan'208";a="361413774" Received: from 77.109.103.53.adsl.dyn.edpnet.net (HELO mordor.lan) ([77.109.103.53]) by relaygateway01.edpnet.net with ESMTP/TLS/DHE-RSA-AES256-SHA; 16 Nov 2015 10:43:35 +0100 Date: Mon, 16 Nov 2015 10:43:34 +0100 From: Julien Cigar To: Gerhard Schmidt Cc: freebsd-questions@freebsd.org Subject: Re: Random Lockup with FreeBSD 10.2 on SuperMicro Boards Message-ID: <20151116094334.GS2604@mordor.lan> References: <56498205.3060806@ze.tum.de> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="O2izrSG9ltmUPm45" Content-Disposition: inline In-Reply-To: <56498205.3060806@ze.tum.de> User-Agent: Mutt/1.5.24 (2015-08-30) X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 16 Nov 2015 09:44:52 -0000 --O2izrSG9ltmUPm45 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Mon, Nov 16, 2015 at 08:13:09AM +0100, Gerhard Schmidt wrote: > Hi, Hello, >=20 > I'm running quiet a few FreeBSD servers on SuperMicro Boards. I'm in the > process of upgrading from 10.1 to 10.2. On the machines running 10.2 I'm > experiencing so random lockups. >=20 > The server running fine bit sometimes (about 2-3 month apart the /var > filesystem just locks. Other filesystems on the same drive (mirror-raid) > still working, only when accessing anything on /var blocks the process. >=20 > The same machines running with 10.1 don't have this Problem. >=20 > All Filesystems are UFS journaled soft-updates. >=20 try to disable SU+J (tunefs -j disable), I had random lockups with HP Proliant servers too and problem. Problem went away when I turned off SU+J. > There is no message on the console or in the log files (as expected as > the log files are on /var) >=20 > Here is the output from mount. >=20 > /dev/raid/r0p3 on / (ufs, local) > devfs on /dev (devfs, local, multilabel) > /dev/raid/r0p4 on /var (ufs, local, journaled soft-updates) > /dev/raid/r0p5 on /usr (ufs, local, journaled soft-updates) > /dev/raid/r0p6 on /data (ufs, local, journaled soft-updates) > fdescfs on /dev/fd (fdescfs) > procfs on /proc (procfs, local) > /dev/md0 on /tmp (ufs, local) >=20 > I've updated so far three server to 10.2. Two of them by using > freebsd-update from 10.1 and one was fresh installed. All of them failed > once or twice since update to 10.2 and never before (running 10.1). >=20 > I've attached the dmesg.boot from the last server to fail (fresh installe= d) >=20 > Regards > Estartu >=20 > --=20 > ------------------------------------------------- > Gerhard Schmidt | E-Mail: schmidt@ze.tum.de > TU-M=C3=BCnchen | Jabber: estartu@ze.tum.de > WWW & Online Services | > Tel: 089/289-25270 | > Fax: 089/289-25257 | PGP-Publickey auf Anfrage >=20 > Copyright (c) 1992-2015 The FreeBSD Project. > Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 > The Regents of the University of California. All rights reserved. > FreeBSD is a registered trademark of The FreeBSD Foundation. > FreeBSD 10.2-RELEASE-p7 #0: Mon Nov 2 14:19:39 UTC 2015 > root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC amd64 > FreeBSD clang version 3.4.1 (tags/RELEASE_34/dot1-final 208032) 20140512 > CPU: Intel(R) Xeon(R) CPU E3-1240 v3 @ 3.40GHz (3392.22-MHz K8-class CPU) > Origin=3D"GenuineIntel" Id=3D0x306c3 Family=3D0x6 Model=3D0x3c Step= ping=3D3 > Features=3D0xbfebfbff > Features2=3D0x7ffafbff,FMA,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,MOVBE,POPCNT,TSCD= LT,AESNI,XSAVE,OSXSAVE,AVX,F16C,RDRAND> > AMD Features=3D0x2c100800 > AMD Features2=3D0x21 > Structured Extended Features=3D0x2fbb > XSAVE Features=3D0x1 > VT-x: PAT,HLT,MTF,PAUSE,EPT,UG,VPID > TSC: P-state invariant, performance statistics > real memory =3D 34376515584 (32784 MB) > avail memory =3D 33266585600 (31725 MB) > Event timer "LAPIC" quality 600 > ACPI APIC Table: > FreeBSD/SMP: Multiprocessor System Detected: 8 CPUs > FreeBSD/SMP: 1 package(s) x 4 core(s) x 2 SMT threads > cpu0 (BSP): APIC ID: 0 > cpu1 (AP): APIC ID: 1 > cpu2 (AP): APIC ID: 2 > cpu3 (AP): APIC ID: 3 > cpu4 (AP): APIC ID: 4 > cpu5 (AP): APIC ID: 5 > cpu6 (AP): APIC ID: 6 > cpu7 (AP): APIC ID: 7 > ioapic0 irqs 0-23 on motherboard > random: initialized > module_register_init: MOD_LOAD (vesa, 0xffffffff80db8e60, 0) error 19 > kbd1 at kbdmux0 > acpi0: on motherboard > acpi0: Power Button (fixed) > cpu0: on acpi0 > cpu1: on acpi0 > cpu2: on acpi0 > cpu3: on acpi0 > cpu4: on acpi0 > cpu5: on acpi0 > cpu6: on acpi0 > cpu7: on acpi0 > hpet0: iomem 0xfed00000-0xfed003ff on acpi0 > Timecounter "HPET" frequency 14318180 Hz quality 950 > Event timer "HPET" frequency 14318180 Hz quality 550 > atrtc0: port 0x70-0x77 irq 8 on acpi0 > atrtc0: Warning: Couldn't map I/O. > Event timer "RTC" frequency 32768 Hz quality 0 > attimer0: port 0x40-0x43,0x50-0x53 irq 0 on acpi0 > Timecounter "i8254" frequency 1193182 Hz quality 0 > Event timer "i8254" frequency 1193182 Hz quality 100 > Timecounter "ACPI-fast" frequency 3579545 Hz quality 900 > acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1808-0x180b on acpi0 > pcib0: port 0xcf8-0xcff on acpi0 > pci0: on pcib0 > pcib1: irq 16 at device 1.0 on pci0 > pci1: on pcib1 > igb0: port 0xe020-= 0xe03f mem 0xf7180000-0xf71fffff,0xf7284000-0xf7287fff irq 16 at device 0.0= on pci1 > igb0: Using MSIX interrupts with 9 vectors > igb0: Ethernet address: 00:25:90:c3:a5:3c > igb0: Bound queue 0 to cpu 0 > igb0: Bound queue 1 to cpu 1 > igb0: Bound queue 2 to cpu 2 > igb0: Bound queue 3 to cpu 3 > igb0: Bound queue 4 to cpu 4 > igb0: Bound queue 5 to cpu 5 > igb0: Bound queue 6 to cpu 6 > igb0: Bound queue 7 to cpu 7 > igb1: port 0xe000-= 0xe01f mem 0xf7100000-0xf717ffff,0xf7280000-0xf7283fff irq 17 at device 0.1= on pci1 > igb1: Using MSIX interrupts with 9 vectors > igb1: Ethernet address: 00:25:90:c3:a5:3d > igb1: Bound queue 0 to cpu 0 > igb1: Bound queue 1 to cpu 1 > igb1: Bound queue 2 to cpu 2 > igb1: Bound queue 3 to cpu 3 > igb1: Bound queue 4 to cpu 4 > igb1: Bound queue 5 to cpu 5 > igb1: Bound queue 6 to cpu 6 > igb1: Bound queue 7 to cpu 7 > pci0: at device 22.0 (no driver attached) > pci0: at device 22.1 (no driver attached) > ehci0: mem 0xf7304000-0xf7304= 3ff irq 16 at device 26.0 on pci0 > usbus0: EHCI version 1.0 > usbus0 on ehci0 > pcib2: irq 16 at device 28.0 on pci0 > pci2: on pcib2 > pcib3: at device 0.0 on pci2 > pci3: on pcib3 > vgapci0: port 0xd000-0xd07f mem 0xf6000000-0xf6f= fffff,0xf7000000-0xf701ffff irq 16 at device 0.0 on pci3 > vgapci0: Boot video device > ehci1: mem 0xf7303000-0xf7303= 3ff irq 23 at device 29.0 on pci0 > usbus1: EHCI version 1.0 > usbus1 on ehci1 > isab0: at device 31.0 on pci0 > isa0: on isab0 > ahci0: port 0xf050-0xf057,0x= f040-0xf043,0xf030-0xf037,0xf020-0xf023,0xf000-0xf01f mem 0xf7302000-0xf730= 27ff irq 19 at device 31.2 on pci0 > ahci0: AHCI v1.30 with 6 6Gbps ports, Port Multiplier not supported > ahcich0: at channel 0 on ahci0 > ahcich1: at channel 1 on ahci0 > ahcich2: at channel 2 on ahci0 > ahcich3: at channel 3 on ahci0 > ahcich4: at channel 4 on ahci0 > ahcich5: at channel 5 on ahci0 > ahciem0: on ahci0 > acpi_button0: on acpi0 > acpi_tz0: on acpi0 > acpi_tz1: on acpi0 > uart0: <16550 or compatible> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 > uart2: <16550 or compatible> port 0x3e8-0x3ef irq 7 on acpi0 > orm0: at iomem 0xc0000-0xc7fff,0xd0800-0xd17ff,0xd1800-= 0xd27ff on isa0 > sc0: at flags 0x100 on isa0 > sc0: CGA <16 virtual consoles, flags=3D0x300> > vga0: at port 0x3d0-0x3db iomem 0xb8000-0xbffff on isa0 > ppc0: cannot reserve I/O port range > est0: on cpu0 > est1: on cpu1 > est2: on cpu2 > est3: on cpu3 > est4: on cpu4 > est5: on cpu5 > est6: on cpu6 > est7: on cpu7 > random: unblocking device. > usbus0: 480Mbps High Speed USB v2.0 > Timecounters tick every 1.000 msec > usbus1: 480Mbps High Speed USB v2.0 > ugen0.1: at usbus0 > uhub0: on usbus0 > ugen1.1: at usbus1 > uhub1: on usbus1 > ses0 at ahciem0 bus 0 scbus6 target 0 lun 0 > ses0: SEMB S-E-S 2.00 device > ses0: SEMB SES Device > ada0 at ahcich0 bus 0 scbus0 target 0 lun 0 > ada0: ACS-2 ATA SATA 3.x device > ada0: Serial Number xxxxxxxx > ada0: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes) > ada0: Command Queueing enabled > ada0: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C) > ada0: Previously was known as ad4 > ada1 at ahcich1 bus 0 scbus1 target 0 lun 0 > ada1: ACS-2 ATA SATA 3.x device > ada1: Serial Number xxxxxxxx > ada1: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes) > ada1: Command Queueing enabled > ada1: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C) > ada1: Previously was known as ad6 > GEOM_RAID: Intel-fbcfc6e1: Array Intel-fbcfc6e1 created. > GEOM_RAID: Intel-fbcfc6e1: Disk ada0 state changed from NONE to ACTIVE. > GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:0-ada0 state changed from NONE to = STALE. > GEOM_RAID: Intel-fbcfc6e1: Disk ada1 state changed from NONE to ACTIVE. > GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:1-ada1 state changed from NONE to = STALE. > GEOM_RAID: Intel-fbcfc6e1: Array started. > GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:0-ada0 state changed from STALE to= ACTIVE. > GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:1-ada1 state changed from STALE to= RESYNC. > GEOM_RAID: Intel-fbcfc6e1: Subdisk gm0:1-ada1 rebuild start at 0. > GEOM_RAID: Intel-fbcfc6e1: Volume gm0 state changed from STARTING to SUBO= PTIMAL. > GEOM_RAID: Intel-fbcfc6e1: Provider raid/r0 for volume gm0 created. > SMP: AP CPU #1 Launched! > SMP: AP CPU #2 Launched! > SMP: AP CPU #3 Launched! > SMP: AP CPU #7 Launched! > SMP: AP CPU #5 Launched! > SMP: AP CPU #4 Launched! > SMP: AP CPU #6 Launched! > Timecounter "TSC-low" frequency 1696109722 Hz quality 1000 > uhub1: 2 ports with 2 removable, self powered > uhub0: 2 ports with 2 removable, self powered > Root mount waiting for: usbus1 usbus0 > ugen1.2: at usbus1 > uhub2: o= n usbus1 > ugen0.2: at usbus0 > uhub3: o= n usbus0 > Root mount waiting for: usbus1 usbus0 > uhub2: 6 ports with 6 removable, self powered > uhub3: 6 ports with 6 removable, self powered > ugen0.3: at usbus0 > uhub4: o= n usbus0 > uhub4: 4 ports with 3 removable, self powered > Root mount waiting for: usbus0 > ugen0.4: at usbus0 > ukbd0: o= n usbus0 > kbd0 at ukbd0 > Trying to mount root from ufs:/dev/raid/r0p3 [rw]... > WARNING: / was not properly dismounted > ums0: on= usbus0 > ums0: 3 buttons and [Z] coordinates ID=3D0 > _______________________________________________ > freebsd-questions@freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.o= rg" --=20 Julien Cigar Belgian Biodiversity Platform (http://www.biodiversity.be) PGP fingerprint: EEF9 F697 4B68 D275 7B11 6A25 B2BB 3710 A204 23C0 No trees were killed in the creation of this message. However, many electrons were terribly inconvenienced. --O2izrSG9ltmUPm45 Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQIcBAABCgAGBQJWSaVCAAoJEAi2KiTKQR5pOswP/3AUlrdX2hHr5rP9bjhyCSM4 Y2lQbCHs07cXQAstgKsCOCX5ztAW+MWlKszyZ+T4gBh16qv2idmyim4X+T+avbg3 agASjlCnEyuNwv+hyyXBQG2LCVytxatZu+noLbCi2PLheJDHYCoV9ysZXRdmplrw WefnEpq2YsvgEixJYFMuqkZoL5WrdGOFJ54YNwZEVz1EQK1tn147oSKPMWQBLsTx cSoQZwEuoBLR1srCCwTCE/OEzYf84ABAV1jd3wPkBwVRMYm1qayGEdPp1sziIQKu vz5i6nfMlL1VGjYptl27TGVYrKBFj5RrpB6XRXcEVrEDuTjl/JnWP/i0k+LX/WZq nSxYfsP0t4+i6FBI4BDpurrA35+AP7+TC1TR8JIAJ7iwqaM10rtIvlAlcGmg6KXH 3lDXDi95AyFekTuy2VB5SP9JFerV43YBFSDnWcz8FTfGkE/49ah5MkqFHNF63DcF znGrkefna4VGcl/F/Hoy/fWLNlVOZTXSZ+WhLcOT6OJafgWl7hBZGFVsSKer+Xou d0kPyHbXpqFLfdy5Pn6nbLgCwfhv/W8gDd5rNW50tlTfxFysICcb1TZPlXA4YEcs KBWBlRvEm7PTWzTla2WZp7mCBYo7o7ruzcRmT8qQVb+mRmg/XfR4J5YAaq9c94SD 9EHbd9mFaLlTYK6oMRsQ =3o4Z -----END PGP SIGNATURE----- --O2izrSG9ltmUPm45--