Date: Mon, 4 Feb 2008 18:19:20 +0200 From: Kostik Belousov <kostikbel@gmail.com> To: Primeroz lists <primeroz.lists@googlemail.com> Cc: freebsd-stable@freebsd.org Subject: Re: Crashing repeatedly: 6.2-RELEASE-p5 and MySQL 5.0.41 Message-ID: <20080204161920.GN57756@deviant.kiev.zoral.com.ua> In-Reply-To: <55b8c6fe0802040450r7ca3e739s931be2d38f499fc2@mail.gmail.com> References: <55b8c6fe0802040450r7ca3e739s931be2d38f499fc2@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
--ZOzza+p1BO565g7i Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Mon, Feb 04, 2008 at 12:50:32PM +0000, Primeroz lists wrote: > Hi all, >=20 > we are experiencing repeated crash on a Dell PowerEdge 2950 (rev 1 or 2). >=20 > FBSD release is 6.2-RELEASE-p5 , AMD64. 2xXeon QuadCore and 8G of Ram. >=20 > MySQL Version is 5.0.41 with following configuration settings: >=20 > set-variable =3D key_buffer=3D768M > set-variable =3D table_cache=3D800 > set-variable =3D sort_buffer=3D24M > set-variable =3D myisam_sort_buffer_size=3D256M > set-variable =3D record_buffer=3D16M > set-variable =3D max_allowed_packet=3D10M > set-variable =3D thread_stack=3D128K > set-variable =3D join_buffer=3D512M > set-variable =3D max_heap_table_size=3D256M > set-variable =3D max_connections=3D300 > set-variable =3D tmp_table_size=3D384M > set-variable =3D query_cache_size=3D402653184 > set-variable =3D query_cache_limit=3D134217728 > set-variable =3D read_rnd_buffer_size=3D10M > set-variable =3D ft_min_word_len=3D1 > pid-file =3D /var/db/mysqld.pid > tmpdir =3D /var/tmp > ft_stopword_file =3D '' > set-variable =3D thread_cache_size=3D80 > set-variable =3D myisam_stats_method=3Dnulls_equal >=20 >=20 > The system is crashing repeatedly and from the graphs we collect on the b= ox > i can see that every time before the crash we have an intensive usage of > *InnoDB* related resources, i collected several vmcore dump and attached = is > what i've been able to extract. >=20 > I'm not sure how much the *InnoDB* usage is related to the crash, btw i'm > quite sure that it is triggering the crash. >=20 > I've looked on the various CVS and releases to see if anything related to= my > crash has been updated in the last period but i did not find anything > specifically related so i'm wondering if anybody else had experience of t= his > kind of problems before proceding to a blind upgrade or any other blind > solution. >=20 >=20 > > $ sudo kgdb /usr/obj/usr/src/sys/PE2950/kernel.debug vmcore.2 > Password: > [GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.s= o: > Undefined symbol "ps_pglobal_lookup"] > GNU gdb 6.1.1 [FreeBSD] > Copyright 2004 Free Software Foundation, Inc. > GDB is free software, covered by the GNU General Public License, and you = are > welcome to change it and/or distribute copies of it under certain > conditions. > Type "show copying" to see the conditions. > There is absolutely no warranty for GDB. Type "show warranty" for detail= s. > This GDB was configured as "amd64-marcel-freebsd". >=20 > Unread portion of the kernel message buffer: >=20 >=20 > Fatal trap 12: page fault while in kernel mode > cpuid =3D 5; apic id =3D 05 > fault virtual address =3D 0x100166887ad > fault code =3D supervisor read, page not present > instruction pointer =3D 0x8:0xffffffff803fa290 > stack pointer =3D 0x10:0xffffffffba0a9980 > frame pointer =3D 0x10:0x2 > code segment =3D base 0x0, limit 0xfffff, type 0x1b > =3D DPL 0, pres 1, long 1, def32 0, gran 1 > processor eflags =3D interrupt enabled, resume, IOPL =3D 0 > current process =3D 1038 (mysqld) > trap number =3D 12 > panic: page fault > cpuid =3D 5 > Uptime: 1d4h37m54s > Dumping 8191 MB (3 chunks) > chunk 0: 1MB (156 pages) ... ok > chunk 1: 3327MB (851624 pages) 3311 3295 3279 3263 3247 3231 3215 3199 > 3183 3167 3151 3135 3119 3103 3087 3071 3055 3039 3023 3007 2991 2975 2959 > 2943 2927 2911 2895 2879 2863 2847 2831 2815 2799 2783 2767 2751 2735 2719 > 2703 2687 2671 2655 2639 2623 2607 2591 2575 2559 2543 2527 2511 2495 2479 > 2463 2447 2431 2415 2399 2383 2367 2351 2335 2319 2303 2287 2271 2255 2239 > 2223 2207 2191 2175 2159 2143 2127 2111 2095 2079 2063 2047 2031 2015 1999 > 1983 1967 1951 1935 1919 1903 1887 1871 1855 1839 1823 1807 1791 1775 1759 > 1743 1727 1711 1695 1679 1663 1647 1631 1615 1599 1583 1567 1551 1535 1519 > 1503 1487 1471 1455 1439 1423 1407 1391 1375 1359 1343 1327 1311 1295 1279 > 1263 1247 1231 1215 1199 1183 1167 1151 1135 1119 1103 1087 1071 1055 1039 > 1023 1007 991 975 959 943 927 911 895 879 863 847 831 815 799 783 767 751 > 735 719 703 687 671 655 639 623 607 591 575 559 543 527 511 495 479 463 4= 47 > 431 415 399 383 367 351 335 319 303 287 271 255 239 223 207 191 175 159 1= 43 > 127 111 95 79 63 47 31 15 ... ok > chunk 2: 4864MB (1245184 pages) 4849 4833 4817 4801 4785 4769 4753 4737 > 4721 4705 4689 4673 4657 4641 4625 4609 4593 4577 4561 4545 4529 4513 4497 > 4481 4465 4449 4433 4417 4401 4385 4369 4353 4337 4321 4305 4289 4273 4257 > 4241 4225 4209 4193 4177 4161 4145 4129 4113 4097 4081 4065 4049 4033 4017 > 4001 3985 3969 3953 3937 3921 3905 3889 3873 3857 3841 3825 3809 3793 3777 > 3761 3745 3729 3713 3697 3681 3665 3649 3633 3617 3601 3585 3569 3553 3537 > 3521 3505 3489 3473 3457 3441 3425 3409 3393 3377 3361 3345 3329 3313 3297 > 3281 3265 3249 3233 3217 3201 3185 3169 3153 3137 3121 3105 3089 3073 3057 > 3041 3025 3009 2993 2977 2961 2945 2929 2913 2897 2881 2865 2849 2833 2817 > 2801 2785 2769 2753 2737 2721 2705 2689 2673 2657 2641 2625 2609 2593 2577 > 2561 2545 2529 2513 2497 2481 2465 2449 2433 2417 2401 2385 2369 2353 2337 > 2321 2305 2289 2273 2257 2241 2225 2209 2193 2177 2161 2145 2129 2113 2097 > 2081 2065 2049 2033 2017 2001 1985 1969 1953 1937 1921 1905 1889 1873 1857 > 1841 1825 1809 1793 1777 1761 1745 1729 1713 1697 1681 1665 1649 1633 1617 > 1601 1585 1569 1553 1537 1521 1505 1489 1473 1457 1441 1425 1409 1393 1377 > 1361 1345 1329 1313 1297 1281 1265 1249 1233 1217 1201 1185 1169 1153 1137 > 1121 1105 1089 1073 1057 1041 1025 1009 993 977 961 945 929 913 897 881 8= 65 > 849 833 817 801 785 769 753 737 721 705 689 673 657 641 625 609 593 577 5= 61 > 545 529 513 497 481 465 449 433 417 401 385 369 353 337 321 305 289 273 2= 57 > 241 225 209 193 177 161 145 129 113 97 81 65 49 33 17 1 >=20 > #0 doadump () at pcpu.h:172 > 172 pcpu.h: No such file or directory. > in pcpu.h > (kgdb) bt > #0 doadump () at pcpu.h:172 > #1 0x0000000000000004 in ?? () > #2 0xffffffff802a7d67 in boot (howto=3D260) at > /usr/src/sys/kern/kern_shutdown.c:409 > #3 0xffffffff802a8401 in panic (fmt=3D0xffffff0036f9c720 > "???\206C???\001???????????????%\\\001?????????\200i??????") > at /usr/src/sys/kern/kern_shutdown.c:565 > #4 0xffffffff80425f7f in trap_fatal (frame=3D0xffffff0036f9c720, > eva=3D18446742981617878704) > at /usr/src/sys/amd64/amd64/trap.c:660 > #5 0xffffffff8042629f in trap_pfault (frame=3D0xffffffffba0a98d0, usermo= de=3D0) > at /usr/src/sys/amd64/amd64/trap.c:573 > #6 0xffffffff80426553 in trap (frame=3D > {tf_rdi =3D 1099887576672, tf_rsi =3D 0, tf_rdx =3D 0, tf_rcx =3D -= 1173710312, > tf_r8 =3D -1093564261992, tf_r9 =3D -1173710304, tf_rax =3D -1173710293, = tf_rbx =3D > -1093564262000, tf_rbp =3D 2, tf_r10 =3D -1098589288672, tf_r11 =3D 43583= 6558, > tf_r12 =3D 1099887576672, tf_r13 =3D -1093564262000, tf_r14 =3D 435835520= , tf_r15 > =3D -1173710312, tf_trapno =3D 12, tf_addr =3D 1099887577005, tf_flags =3D > -2144607018, tf_err =3D 0, tf_rip =3D -2143313264, tf_cs =3D 8, tf_rflags= =3D 66118, > tf_rsp =3D -1173710440, tf_ss =3D 16}) > at /usr/src/sys/amd64/amd64/trap.c:352 > #7 0xffffffff8041173b in calltrap () at > /usr/src/sys/amd64/amd64/exception.S:168 > #8 0xffffffff803fa290 in _vm_map_unlock () at /usr/src/sys/vm/vm_map.c:4= 43 > #9 0xffffffff803fdecc in vm_map_lookup (var_map=3D0xffffffffba0a9a10, > vaddr=3D435835520, fault_typea=3D2 '\002', > out_entry=3D0xffffffffba0a9a18, object=3D0xffffff01627d9998, > pindex=3D0xffffffffba0a9a20, out_prot=3D0xffffffffba0a9a2b "", > wired=3D0xffffffffba0a9a2c) at /usr/src/sys/vm/vm_map.c:3074 The vm_map.c does not contain a call to the vm_map_unlock() at the line 3074. Please, rebuild you kernel from scratch. In case this does not help, I ask you to show the backtrace from the ddb. Also, to speed up the conversation, could you, please, for each <function>+<offset> from the ddb output, do the list *(<function>+<offset>) in the kgdb ? > #10 0xffffffff802b845e in umtx_key_get (td=3D0xffffff0036f9c720, > umtx=3D0x19fa5280, key=3D0xffffff01627d9990) > at /usr/src/sys/kern/kern_umtx.c:312 > #11 0xffffffff802b8578 in _do_lock (td=3D0xffffff0036f9c720, umtx=3D0x19f= a5280, > id=3D100582, timo=3D0) > at /usr/src/sys/kern/kern_umtx.c:362 > #12 0xffffffff802b99e9 in _umtx_op (td=3D0xffffff0036f9c720, uap=3D0x188e= 6) at > /usr/src/sys/kern/kern_umtx.c:545 > #13 0xffffffff80426dd1 in syscall (frame=3D > {tf_rdi =3D 435835520, tf_rsi =3D 0, tf_rdx =3D 100582, tf_rcx =3D = 0, tf_r8 =3D > 0, tf_r9 =3D 140737452053060, tf_rax =3D 454, tf_rbx =3D 100582, tf_rbp = =3D > 435835520, tf_r10 =3D 1, tf_r11 =3D 582, tf_r12 =3D 9982128, tf_r13 =3D 1= 024, tf_r14 > =3D 0, tf_r15 =3D 0, tf_trapno =3D 12, tf_addr =3D 1387466752, tf_flags = =3D 0, tf_err > =3D 2, tf_rip =3D 34378206780, tf_cs =3D 43, tf_rflags =3D 582, tf_rsp =3D > 140737452052808, tf_ss =3D 35}) at /usr/src/sys/amd64/amd64/trap.c:792 > #14 0xffffffff804118d8 in Xfast_syscall () at > /usr/src/sys/amd64/amd64/exception.S:270 > #15 0x000000080119ce3c in ?? () > Previous frame inner to this frame (corrupt stack?) > (kgdb) >=20 > (kgdb) up 6 > #6 0xffffffff80426553 in trap (frame=3D > {tf_rdi =3D 1099887576672, tf_rsi =3D 0, tf_rdx =3D 0, tf_rcx =3D -= 1173710312, > tf_r8 =3D -1093564261992, tf_r9 =3D -1173710304, tf_rax =3D -1173710293, = tf_rbx =3D > -1093564262000, tf_rbp =3D 2, tf_r10 =3D -1098589288672, tf_r11 =3D 43583= 6558, > tf_r12 =3D 1099887576672, tf_r13 =3D -1093564262000, tf_r14 =3D 435835520= , tf_r15 > =3D -1173710312, tf_trapno =3D 12, tf_addr =3D 1099887577005, tf_flags =3D > -2144607018, tf_err =3D 0, tf_rip =3D -2143313264, tf_cs =3D 8, tf_rflags= =3D 66118, > tf_rsp =3D -1173710440, tf_ss =3D 16}) > at /usr/src/sys/amd64/amd64/trap.c:352 > 352 (void) trap_pfault(&frame, FALSE); >=20 > (kgdb) up > #7 0xffffffff8041173b in calltrap () at > /usr/src/sys/amd64/amd64/exception.S:168 > 168 call trap > Current language: auto; currently asm > (kgdb) up > #8 0xffffffff803fa290 in _vm_map_unlock () at /usr/src/sys/vm/vm_map.c:4= 43 > 443 _sx_xunlock(&map->lock, file, line); > Current language: auto; currently c > (kgdb) up > #9 0xffffffff803fdecc in vm_map_lookup (var_map=3D0xffffffffba0a9a10, > vaddr=3D435835520, fault_typea=3D2 '\002', > out_entry=3D0xffffffffba0a9a18, object=3D0xffffff01627d9998, > pindex=3D0xffffffffba0a9a20, out_prot=3D0xffffffffba0a9a2b "", > wired=3D0xffffffffba0a9a2c) at /usr/src/sys/vm/vm_map.c:3074 > 3074 vm_map_lock_read(map); > (kgdb) list > 3069 RetryLookup:; > 3070 /* > 3071 * Lookup the faulting address. > 3072 */ > 3073 > 3074 vm_map_lock_read(map); > 3075 #define RETURN(why) \ > 3076 { \ > 3077 vm_map_unlock_read(map); \ > 3078 return (why); \ > (kgdb) p map > $1 =3D 0x10016688660 > (kgdb) down > #8 0xffffffff803fa290 in _vm_map_unlock () at /usr/src/sys/vm/vm_map.c:4= 43 > 443 _sx_xunlock(&map->lock, file, line); > (kgdb) list > 438 { > 439 > 440 if (map->system_map) > 441 _mtx_unlock_flags(&map->system_mtx, 0, file, line= ); > 442 else > 443 _sx_xunlock(&map->lock, file, line); > 444 } > 445 > 446 void > 447 _vm_map_lock_read(vm_map_t map, const char *file, int line) >=20 >=20 > Thanks, > Francesco Ciocchetti > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" --ZOzza+p1BO565g7i Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.8 (FreeBSD) iEYEARECAAYFAkenOwcACgkQC3+MBN1Mb4iQAwCeKSO9l/L7HOdVkJp8SLzL7xf0 KNMAnAns0NUrA1UVKxuLND9mhPkBEfPN =rYMJ -----END PGP SIGNATURE----- --ZOzza+p1BO565g7i--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080204161920.GN57756>