From owner-freebsd-bugs@freebsd.org Wed Jun 8 06:32:40 2016 Return-Path: Delivered-To: freebsd-bugs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 55D52B6FC8B for ; Wed, 8 Jun 2016 06:32:40 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2001:1900:2254:206a::16:76]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 3EE0A1AF6 for ; Wed, 8 Jun 2016 06:32:40 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from bugs.freebsd.org ([127.0.1.118]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id u586Wd3A018671 for ; Wed, 8 Jun 2016 06:32:40 GMT (envelope-from bugzilla-noreply@freebsd.org) From: bugzilla-noreply@freebsd.org To: freebsd-bugs@FreeBSD.org Subject: [Bug 209099] ata2: already running! panic: bad link elm 0xfffff80003b7e6a0 prev->next != elm Date: Wed, 08 Jun 2016 06:32:40 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: 11.0-CURRENT X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Only Me X-Bugzilla-Who: daryl@ci.com.au X-Bugzilla-Status: New X-Bugzilla-Resolution: X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: freebsd-bugs@FreeBSD.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: cc Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.22 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 08 Jun 2016 06:32:40 -0000 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D209099 daryl@ci.com.au changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |daryl@ci.com.au --- Comment #5 from daryl@ci.com.au --- I have an Asus P8B-X motherboard with 16G memory and an Intel 530 Series 120G SSD as the boot drive. We are running FreeBSD 10.3-STABLE #3 r300822. I believe I have this same issue. I was using salt minion to update this machine and it hung (several times). Prior to the hang I would always see t= he console message: ata2: already running! The machine would continue to run for a short period of time and then become unresponsive. A few minutes later I might get a panic otherwise it would ju= st=20 be in a hung state. I was able to pull apart the minion and found it was camcontrol causing the issue. I create a script that looped over the command: camcontrol identify ada0=20 and I am able to cause the system to either hang or panic regularly. Using the kernel debugger I was able to glean this information: spin lock 0xffffffff816f17e0 (sleepq chain) held by 0xfffff800g panic: spin lock held too long=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20 cpuid =3D 6=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 KDB: stack backtrace:=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 db_trace_self_wrapper() at db_trace_self_wrapper+0x2b/frame 0xfffffe046562e= 800=20=20 kdb_backtrace() at kdb_backtrace+0x39/frame 0xfffffe046562e8b0=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20 vpanic() at vpanic+0x126/frame 0xfffffe046562e8f0=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 panic() at panic+0x43/frame 0xfffffe046562e950=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 _mtx_lock_spin_cookie() at _mtx_lock_spin_cookie+0x287/frame 0xfffffe046562= e9c0=20 wakeup() at wakeup+0xf/frame 0xfffffe046562e9e0=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 vnlru_proc() at vnlru_proc+0x157/frame 0xfffffe046562ea70=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 fork_exit() at fork_exit+0x9a/frame 0xfffffe046562eab0=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 fork_trampoline() at fork_trampoline+0xe/frame 0xfffffe046562eab0=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20 --- trap 0, rip =3D 0, rsp =3D 0, rbp =3D 0 ---=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20 KDB: enter: panic=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20 [ thread pid 20 tid 100078 ]=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20= =20=20=20=20=20=20=20=20=20=20=20=20 Stopped at kdb_enter+0x3e: movq $0,kdb_why If salt is disabled and I dont run camcontrol commands, this machine is sta= ble. We have several other machines with different mother boards using the same kernel that dont have this problem. --=20 You are receiving this mail because: You are the assignee for the bug.=