From owner-freebsd-hackers@freebsd.org Mon Aug 22 15:54:45 2016 Return-Path: Delivered-To: freebsd-hackers@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id D2D0BBC2D1D for ; Mon, 22 Aug 2016 15:54:45 +0000 (UTC) (envelope-from matthew@FreeBSD.org) Received: from smtp.infracaninophile.co.uk (smtp.infracaninophile.co.uk [81.2.117.100]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "smtp.infracaninophile.co.uk", Issuer "infracaninophile.co.uk" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id 7506E1C67 for ; Mon, 22 Aug 2016 15:54:45 +0000 (UTC) (envelope-from matthew@FreeBSD.org) Received: from ox-dell39.ox.adestra.com (unknown [85.199.232.226]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: m.seaman@infracaninophile.co.uk) by smtp.infracaninophile.co.uk (Postfix) with ESMTPSA id 91888EC21 for ; Mon, 22 Aug 2016 15:54:34 +0000 (UTC) Authentication-Results: smtp.infracaninophile.co.uk; dmarc=none header.from=FreeBSD.org Authentication-Results: smtp.infracaninophile.co.uk/91888EC21; dkim=none; dkim-atps=neutral To: freebsd-hackers@freebsd.org From: Matthew Seaman Subject: How do I debug this? Message-ID: Date: Mon, 22 Aug 2016 16:54:22 +0100 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:45.0) Gecko/20100101 Thunderbird/45.2.0 MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="Me6iBDatIGq3QgwvnhRJpKmV5vCBhNWCN" X-Spam-Status: No, score=0.1 required=5.0 tests=BAYES_00,RDNS_NONE, SPF_SOFTFAIL,URI_NOVOWEL autolearn=no autolearn_force=no version=3.4.1 X-Spam-Checker-Version: SpamAssassin 3.4.1 (2015-04-28) on smtp.infracaninophile.co.uk X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.22 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Aug 2016 15:54:45 -0000 This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --Me6iBDatIGq3QgwvnhRJpKmV5vCBhNWCN Content-Type: multipart/mixed; boundary="NhVCCx3Kx3lcXvukKeO7a5CEGcqkHe8si" From: Matthew Seaman To: freebsd-hackers@freebsd.org Message-ID: Subject: How do I debug this? --NhVCCx3Kx3lcXvukKeO7a5CEGcqkHe8si Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable We've got an intermittent crash bug with postgresql92-server-9.2.17_2 which seems to be occurring in process startup: # gdb -c /usr/local/pgsql/data/postgres.core /usr/local/bin/postgres GNU gdb 6.1.1 [FreeBSD] Copyright 2004 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you = are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for detail= s. This GDB was configured as "amd64-marcel-freebsd"... Core was generated by `postgres'. Program terminated with signal 11, Segmentation fault. Reading symbols from /lib/libthr.so.3...done. Loaded symbols for /lib/libthr.so.3 Reading symbols from /usr/local/lib/libxml2.so.2...done. Loaded symbols for /usr/local/lib/libxml2.so.2 Reading symbols from /usr/local/lib/libicuuc.so.55...done. Loaded symbols for /usr/local/lib/libicuuc.so.55 Reading symbols from /usr/local/lib/libicui18n.so.55...done. Loaded symbols for /usr/local/lib/libicui18n.so.55 Reading symbols from /usr/local/lib/libssl.so.8...done. Loaded symbols for /usr/local/lib/libssl.so.8 Reading symbols from /usr/local/lib/libcrypto.so.8...done. Loaded symbols for /usr/local/lib/libcrypto.so.8 Reading symbols from /lib/libm.so.5...done. Loaded symbols for /lib/libm.so.5 Reading symbols from /lib/libc.so.7...done. Loaded symbols for /lib/libc.so.7 Reading symbols from /lib/libz.so.6...done. Loaded symbols for /lib/libz.so.6 Reading symbols from /usr/lib/liblzma.so.5...done. Loaded symbols for /usr/lib/liblzma.so.5 Reading symbols from /usr/local/lib/libicudata.so.55...done. Loaded symbols for /usr/local/lib/libicudata.so.55 Reading symbols from /usr/lib/libc++.so.1...done. Loaded symbols for /usr/lib/libc++.so.1 Reading symbols from /lib/libcxxrt.so.1...done. Loaded symbols for /lib/libcxxrt.so.1 Reading symbols from /lib/libgcc_s.so.1...done. Loaded symbols for /lib/libgcc_s.so.1 Reading symbols from /usr/local/lib/postgresql/auto_explain.so...done. Loaded symbols for /usr/local/lib/postgresql/auto_explain.so Reading symbols from /usr/local/lib/postgresql/plpgsql.so...done. Loaded symbols for /usr/local/lib/postgresql/plpgsql.so Reading symbols from /libexec/ld-elf.so.1...done. Loaded symbols for /libexec/ld-elf.so.1 #0 0x0000000800d1cd0a in _r_debug_postinit () from /libexec/ld-elf.so.1 [New Thread 805806400 (LWP 100362/)] (gdb) bt full #0 0x0000000800d1cd0a in _r_debug_postinit () from /libexec/ld-elf.so.1 No symbol table info available. #1 0x0000000800d1a46d in .text () from /libexec/ld-elf.so.1 No symbol table info available. #2 0x0000000804ffe174 in ?? () from /lib/libgcc_s.so.1 No symbol table info available. #3 0x00007fffffffe410 in ?? () No symbol table info available. #4 0x00000008050078f1 in ?? () from /lib/libgcc_s.so.1 No symbol table info available. #5 0x00007fffffffe410 in ?? () No symbol table info available. #6 0x0000000800d1d9a8 in dlclose () from /libexec/ld-elf.so.1 No symbol table info available. Previous frame inner to this frame (corrupt stack?) Application is: postgresql92-server-9.2.17_2 On: % freebsd-version -ku 10.3-RELEASE-p4 10.3-RELEASE-p5 This is apparently a postgres child process crashing on startup somewhere before where main() gets called. Doesn't happen often -- only when the server is reasonably busy and then it will take somewhere between a day and a week between incidents. We can't correlate it with any particular SQL being run. This was a production server so we've had to move the customers assigned to it back to some slightly older but very hardware but running the same version of postgresql (same package compiled on our own pkgrepo) and same 10.3-RELEASE-p4/p5 -- where they are running quite happily. Matthew --NhVCCx3Kx3lcXvukKeO7a5CEGcqkHe8si-- --Me6iBDatIGq3QgwvnhRJpKmV5vCBhNWCN Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- iQIcBAEBCgAGBQJXuyA0AAoJEABRPxDgqeTnNGoP/0b+v0jyRHvUGKn6tPtlL41i k+xOuyqt/qqyUrGUJguHwOYh60HvxeWw6bWWd7xTJHZiPA2YTGja876orBiZvqPF OQ29YBD5TCvh8MPYBotAPiiHjmqLXXoLm/JyLmRsq2ExlECmAb2Pf6ybbHgZ55yT WwH5TKCiFSFRGCIyQRsKlj2Eq2rIV66PWh8f5PmgZVCPdjdFJwPWqH2D3X5Zcd0G HZJW+2T3G1ijrayNKC6uQ5Fil9YaE0InjjBnxtyj+D1f0r+0gwuyR25kMF2wSJMd mTu/VisReEkof8pCaRYtlNsge5cgMX3nlaDrT/Ww12iY+L8MiTC1NiOZYovO4jcu oMehegoafD+F7xnbtghAjpzDqPR2LZPYVZRYItiUXBkGfoXNU+fC6OO+0f/Ge1zi OxZgWnIYZjrxMKW5GbJK4NmLxHQckkphm9zhP1Ej4HoffwIi9szjTljxVGu4ot2A s0It4DqvsQu3VNBwKtdXdXMDaNTP5h7B01GaMGX7j+Q7TjkJFn+Di/FNylPiPH9h 1PPfNFdDfPpxJXMojey/g7Mlc5uGzTeB1cEC/FGXRll7ktJcLn8H8FA1mPaLjMRg 0375+QXrzMCeEGzUjr86ag/+ncS5c2p1lWs1yPoQGtyizAh6Ypo7h92qvLbSd5mW 9CvXwqiAoVzfRm7upa4B =eNzk -----END PGP SIGNATURE----- --Me6iBDatIGq3QgwvnhRJpKmV5vCBhNWCN--