FreeBSD Mail Archives

Date:      Mon, 02 Jun 2014 08:49:07 -0700
From:      Mike Carlson <mike@bayphoto.com>
To:        Steven Hartland <killing@multiplay.co.uk>, freebsd-fs@freebsd.org
Subject:   Re: ZFS Kernel Panic on 10.0-RELEASE
Message-ID:  <538C9CF3.6070208@bayphoto.com>
In-Reply-To: <782C34792E95484DBA631A96FE3BEF20@multiplay.co.uk>
References:  <5388D64D.4030400@bayphoto.com> <EC2EA442-56FC-46B4-A1E2-97523029B7B3@mail.turbofuzz.com> <5388E5B4.3030002@bayphoto.com> <538BBEB7.4070008@bayphoto.com> <782C34792E95484DBA631A96FE3BEF20@multiplay.co.uk>

index | next in thread | previous in thread | raw e-mail


[-- Attachment #1 --]
On 6/2/2014 2:12 AM, Steven Hartland wrote:
> ----- Original Message ----- From: "Mike Carlson" <mike@bayphoto.com>
>
>> On 5/30/2014 1:10 PM, Mike Carlson wrote:
>> > On 5/30/2014 12:48 PM, Jordan Hubbard wrote:
>> >> On May 30, 2014, at 12:04 PM, Mike Carlson <mike@bayphoto.com> wrote:
>> >>
>> >>> Over the weekend, we had upgraded one of our servers from 
>> 9.1-RELEASE to 10.0-RELEASE, and then the zpool was upgraded (from 
>> >>> 28 to 5000)
>> >>>
>> >>> Tuesday afternoon, the server suddenly rebooted (kernel panic), 
>> and as soon as it tried to remount all of its ZFS volumes, >>> it 
>> panic'd again.
>> >> What�s the panic text?  That�s pretty crucial in figuring out 
>> whether this is recoverable (e.g. if it�s spacemap corruption >> 
>> related, probably not).
>> >>
>> >> - Jordan
>> >>
>> >>
>> >>
>> > I had linked the pictures I took of the console, but here is my 
>> manual reproduction:
>> >
>> >    Fatal trap 12: page fault while in kernel mode
>> >    cpuid = 7; apic id = 07
>> >    fault virtual address    = 0x4a0
>> >    fault code               = supervisor read data, page not present
>> >    instruction pointer      = 0x20:0xffffffff81a7f39f
>> >    stack pointer            = 0x28:0xfffffe1834789570
>> >    frame pointer            = 0x28:0xfffffe18347895b0
>> >    code segment             = base 0x0, limit 0xfffff, type 0x1b
>> >                              = DPL 0, pres 1, long 1, def32 0, gran 1
>> >    processor eflags         = interrupt enabled, resume, IOPL = 0
>> >    current process          = 1849 (txg_thread_enter)
>> >    trap number              = 12
>> >    panic: page fault
>> >    cpuid = 7
>> >    KDB: stack backtrace:
>> >    #0 0xffffffff808e7dd0 at kdb_backtrace+0x60
>> >    #1 0xffffffff808af8b5 at panic+0x155
>> >    #2 0xffffffff80c8e629 at trap_fatal+0x3a2
>> >    #3 0xffffffff80c8e969 at trap_pfault+0x2c9
>> >    #4 0xffffffff80c8e0f6 at trap+0x5e6
>> >    #5 0xffffffff80c75392 at calltrap+0x8
>> >    #6 0xffffffff81a53b5a at dsl_dataset_block_kill+0x3a
>> >    #7 0xffffffff81a50967 at dnode_sync+0x237
>> >    #8 0xffffffff81a48fcb at dmu_objset_sync_dnodes+0x2b
>> >    #9 0xffffffff81a48e4d at dmo_objset_sync+0x1ed
>> >    #10 0xffffffff81a5d29a at dsl_pool_sync+0xca
>> >    #11 0xffffffff81a78a4e at spa_sync+0x52e
>> >    #12 0xffffffff81a81925 at txg_sync_thread+0x375
>> >    #13 0xffffffff8088198a at fork_exit+0x9a
>> >    #14 0xffffffff80c758ce at fork_trampoline+0xe
>> >    uptime: 46s
>> >    Automatic reboot in 15 seconds - press a key on the console to 
>> abort
>> >
>> This just happened again to another server. We upgraded two servers 
>> on the same morning, and now both of them exhibit this corrupted zfs 
>> volume and panic behavior.
>>
>> Out of all the volumes, one of them is causing the panic, and the 
>> panic message is nearly identical.
>>
>> I have 4 snapshots over the last 24 hours, so hopefully a snapshot 
>> from noon today can be sent to a new volume ( zfs send | zfs recv )
>>
>> I guess I can now rule out it being a hardware issue, this is clearly 
>> problem related to the upgrade (freebsd-update  was used). I first 
>> thought the first system had a bad upgrade, perhaps a mix and match 
>> of 9.2 binaries running on a 10 kernel, but I used the 
>> 'freebsd-update IDS' command to verify the integrity of the install, 
>> and it looked good, the only differences were config files in /etc/ 
>> that we manage.
>>
>
> Do you have a kernel crash dump from this?
>
> Also can you confirm if your amd64 or just i386?
>
>    Regards
>    Steve
>
>

I dont have a crash dump, and this is on amd64

I might be able to get a crash dump on one of them, the other is back up 
and running. It is a little challenging because the system I can do this 
on has zfs on root, but I have a spare drive I can use as the swap volume.

Mike C


[-- Attachment #2 --]
0�	*�H��
��0�10	+0�	*�H��
��"0�0��e��3�v=�0
	*�H��
0K10
URootCA10U

Bay Photo Lab10U
California10	UUS0
121023173218Z
271023173218Z0X10UBay Photo People CA10U

Bay Photo Lab10U
California10	UUS0�"0
	*�H��
�0�
�����;T��ą��u���y�K~Zz��2��M�'4���
��EiTj��)yL5��"k���v�7Ur�n \�!SgP;��z���h�>�ˊj�����\V�o�v��X�<�L�g��fxxkL1�C�dY�\S�;z(��5TO[)5�bu���\��mBj��*
�n��Uh&�`Qί;��Z�x���Ȝ��F���
��ԧ@�}�)��8}4#��d��z�w�&��P�^=Ad�T}*�4 q��S�^E)�̈c��A$XDS]Z��/�_��5M�`~ӻ���Ro'�����F��tw�\�e.��G.3�@�m�"\��,���c�{'��Gid���v���(T��QY����9z�b�p9���c���#Y³�����V��s�|I���f�	�ew�7�I����%G�rau07��h��f.;�{J�ʾx��/R1.L���T}�Կ!k�b�
��o8H��	]��=}�S�Έ��퉃�����0��0U-��r�fb����b,v����0U�0�0U#0��F�Nq�i�$��x'{(W+0��U��0��0����}�{http://bayca.bayhoto.local/ejbca/publicweb/webdist/certdist?cmd=crl&issuer=CN=RootCA,O=Bay%20Photo%20Lab,ST=California,C=US0U��0
	*�H��
�JM���U��Z�>�7��gm[z }��/��.~^J;ө�Ɖ-��Q_\Όh޲#���Ծ�XL�7p�h�(�@�`�+8��W��&���i��b�!���Q��j+�����ȡ���1i�T�(�#��^( ��giZ9�c�<��R�꼓��e��.�ݘ�VѬ���峿ۅ8Dh�$�~m���m�啠����~'��\����ET�&	� a}rM���K�L0u%�HY�L
�l=�`�Υ3k[؝Y}$ s��s8?~IXK��d����a<�==���m��L�[�R��Ҡs��H���B��R/�*�`JfUz�A)�'0��Jk��A�r�vp#�e-{�]�U��
Z�����`�#��2Ϡ�v�~.#l7���"�D�=&t^�-Q���_�9�Mi�
��u��Ԓn{Z���n!�U%r�3J;Q׼����Di�@��P�Ng]&;���yw�|9B*.L=��Ij�-�)��/]����'���g�^��U�����0�#0��b������=0
	*�H��
0X10UBay Photo People CA10U

Bay Photo Lab10U
California10	UUS0
121023180003Z
141023180003Z0`10
	�&���,d1306910UMike Carlson10	UIT10U

Bay Photo Lab10	UUS0�"0
	*�H��
�0�
��<ȼ^�|=e�9�K��t����Ф-j���I�_��	%[���߲'O%3��;=*�n��(���(���R��T	�����͐C�/��\W�U@����HC�j�rI���U-��i�E�˼��|p��aҨm-4����݈��a��mƵbK��$"UEk���Ezd
���w.
w�G� ����u:�B�'�9!?��t�dk��%%��̞N����.8��C1�ަί[�
B��������j���F�����0���)�{C�����9�&p��X�nĉ���Z�u��X�"�)3z��sS\��\�D���:��L����׏���1�Q��}1Gzz(d�#���V3�fRo���ш��^��C���LfQ�@���S/�������S��t�X
d�5�Y3�M���0ՙ�Q�5�ō�;�p��I��d�V]�&d�#26��z����sg����M�}r�#i�М��|3)m�d�:}��뚁�����0��0R+F0D0B+0�6http://bayca.bayhoto.local/ejbca/publicweb/status/ocsp0U��(�}a��w���J״(�#0U�00U#0�-��r�fb����b,v����0U�00U%0
+0U0�mike@bayphoto.com0
	*�H��
�9�|&�V,*�Hd�	�ƏA~���6�f������Fg'�^y�
I�'�y��y,�v���}��Z��	@ᔘ7�\F���5QA3���7��*�����L�T�4VS�t�T�e �.�Dӧ=n}=�L\�E��	�{�
�z��7��k�Y�s#R�O}��E�`��O�nL�'1M0�`Dۋ�
��r�v���V�uX?���s=�	��+O�0:y��E�?�B�A̡5��|���Ʀ�pp�*�<F���L�A36����k����덝�j�9b�=�&)KJSm�ʐXo�@�g��;�V4�@��uj����kX9��	@���W��h�#n�l\��Y)A
r����FGj�����qtv�h�u�.��ճ�K)�L����}������@�41A�K��z����&ȴ��ztÈ6͢��j=0�*���+@��;x��nc�-
�W�ƣ����L�G9X�)�=��
y����%����]��Q��@��BW�
��,��Άu����t0��0���M�iC���0
	*�H��
0X10UBay Photo People CA10U

Bay Photo Lab10U
California10	UUS0
121023175745Z
141023175745Z0`10
	�&���,d1306910UMike Carlson10	UIT10U

Bay Photo Lab10	UUS0�"0
	*�H��
�0�
��@�v�����ɌA�ļ��VA��W5:�e�h��$����n>b��%�k��7�Pwޡ���=�^���CBv�2U�L���L�qn6�+>��A:��P��#�=��ѕ����[�8Z<|&w��b��(�x椉
�i��Ғ��x�����9�H?���~���Ɔ�-�y]���jN崡���1g�eA�ˇw���H�4��w��?h!�/�^P��ؕ�f�a5-+��%<����*�/��+��`�Z�BCƀ����n��o|6'zo���e!��)����@H藱$�����z�ѩ+���
���SXDz�(~Bݬe?V
\��j;.���P���,��銉[J����ݦkj�Y�����*n�ȡ�5]�
h�lkz�3.�Wme�/�t�ɧ#�	�8��L%
Ũ�%z�p��	_p)���ڜ(C�=�MY����e�3S���>Tf���ρ�=@	����]ڑ�a��v������&��0�ۗ���;�.��j'�Yk_���0��0R+F0D0B+0�6http://bayca.bayhoto.local/ejbca/publicweb/status/ocsp0UFO�+Rd�b�`�?6��0U�00U#0�-��r�fb����b,v����0��U��0��0�����������http://bayca.bayhoto.local/ejbca/publicweb/webdist/certdist?cmd=crl&issuer=CN=Bay%20Photo%20People%20CA,O=Bay%20Photo%20Lab,ST=California,C=US0U��0U%0++0U0�mike@bayphoto.com0
	*�H��
�/��ungf��sy�������@KLw����.cM&����6�?-Y��4�����++I��JY�D	�C�£�S��_�2$e��ڏ�PU�(�(̖S~a�����M�����0�r�i�~j�k2Ւ�[�n�9���rn&Bz�(��M�ݼ�����Iܪ�*ȱI��mu5�lr[Q`3��͈�;��l{Z0��7�h$>a�t)���q��o�\]pJW7�*[c����%��

y1��FB)����������p2͞[�~=?��!��Wd�9�XY5�.bOKU�DV���[Z���9��8��E
^�������X�9��n<��H��i����@�C?�H�+jl�ۗc�݌&yq���Q���<I�i��/�
ɣ��*�B��!��f<.Re���-�������=Y���*?��-4;|��vj1��@��+I��ܑ=��J��7�%'jM�mr�S��M@�G���V����|:C'�ݮ��_���L���k�t��61�F0�B0d0X10UBay Photo People CA10U

Bay Photo Lab10U
California10	UUSM�iC���0	+���0	*�H��
	1	*�H��
0	*�H��
	1
140602154907Z0#	*�H��
	14���FD�+�(L5r"��}�0l	*�H��
	1_0]0	`�He*0	`�He0
*�H��
0*�H��
�0
*�H��
@0+0
*�H��
(0s	+�71f0d0X10UBay Photo People CA10U

Bay Photo Lab10U
California10	UUSb������=0u*�H��
	1f�d0X10UBay Photo People CA10U

Bay Photo Lab10U
California10	UUSb������=0
	*�H��
�W�E�q@D�K,p��vv���3ʾ~����瀿���|��zR}���U�"$e��\*-�7J'�����2��s��3Y�B�S]ř�ӛ�tݘi��&Ԟ����Բ���L�#����Ȩ���~~Ȳz�Ć�m���#Xm
l|aM�����
���%T���Ê]�`��\=����PyC�&�|t�u����X�����(��[t���ȳ��C	��1Q����H/�)�j�3b��L���uL4��L/8�P��
N_�)�ս
D0��9�����a#b��t8�h^"�
�6���t&մ*n('��ͩ�$=�.�A�Uӗ��מ	d��s����x�*Ʒv���ͮ0��P�����@��n03���O1P���9zD�q=�
j�����C�)Q/!7�㵥z9����}1w���	(9Xmp�H�&�A+
w��;F!���\Ws��=��8��k`�J��������S�d[�K�98����ec�^}+ɬ9)�(K:{��(�.

home | help

Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?538C9CF3.6070208>

Header And Logo

Peripheral Links

Site Navigation

Header And Logo

Peripheral Links

Search

Site Navigation