From owner-freebsd-stable@FreeBSD.ORG Sat Nov 23 05:08:33 2013 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 9EF9BF10 for ; Sat, 23 Nov 2013 05:08:33 +0000 (UTC) Received: from mail-qa0-f44.google.com (mail-qa0-f44.google.com [209.85.216.44]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 62BD724BC for ; Sat, 23 Nov 2013 05:08:32 +0000 (UTC) Received: by mail-qa0-f44.google.com with SMTP id i13so1145045qae.17 for ; Fri, 22 Nov 2013 21:08:32 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:message-id:date:from:user-agent:mime-version:to :subject:content-type:content-transfer-encoding; bh=2qM6Vglf8b0oQwSmsoAoWI4nKGtOxe6DhNiMmvt510M=; b=YtVVXEf0hCxzvSjXjnUoxer/yUwnqwoO3UZAJpOAIyPp+UhLzNvOfEHQb7TPFTqlDH zUvINDEQfsxv0sfk1JQ4vIq9Mxwct2umKwZNjjbQ4wPhbk5zH+HlRqdd1yw7SsPk+c6W JMAVSEvedDAj1ZURzxznpFS5u7Ooc5lVCMkWmeLtso+8cS/N874Ob2abcTI4kL6orBfq ENPJf7CsZV8IsAzR0RRoK0PsINDS+ijqYSCYVJrtdXeVSf+CdbcNY8yrWwP5yImmimWx +9JuEugXFS0XI7nqudy1vmOB6mV7VgIzvA+hrvCzDaOIbcJ4umwKKtqHn9TFo9HfNFx5 ZhMg== X-Gm-Message-State: ALoCoQlcz0nmifQfhtuICdsgI7Q6qC0W5v9RXo6Q3H5ollBsCCoMgDTjpX/jy9moUyIGbJK1WcbK X-Received: by 10.224.112.134 with SMTP id w6mr27541330qap.21.1385183312143; Fri, 22 Nov 2013 21:08:32 -0800 (PST) Received: from [192.168.1.4] (pool-74-98-165-156.nrflva.fios.verizon.net. [74.98.165.156]) by mx.google.com with ESMTPSA id 3sm36415031qej.0.2013.11.22.21.08.31 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 22 Nov 2013 21:08:31 -0800 (PST) Message-ID: <5290384E.30603@ohlste.in> Date: Sat, 23 Nov 2013 00:08:30 -0500 From: Jim Ohlstein User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:24.0) Gecko/20100101 Thunderbird/24.1.0 MIME-Version: 1.0 To: stable@freebsd.org Subject: SSD becomes detached 9.2 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.16 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 23 Nov 2013 05:08:33 -0000 Hello, I am setting up a new (remote) server. I initially installed 9.2 RC4 amd64 because that's what the data center put in the drive for me. Shortly thereafter I downloaded 9.2-STABLE sources and compiled world and a generic kernel. While doing so the system became unreachable by SSH The SSH sessions appeared to connect but there was never any data returned. I could telnet to port 22 but I could not log in from a terminal. I could ping the server as well. I had the server rebooted. I did install an updated kernel and world (9.2-STABLE amd64 r258426) and it happened again just now. The OS is installed on a 120 GB SSD with root on ZFS. There is also another SSD for L2ARC and there are two 3TB SATA drives in a separate ZFS mirror pool. All drives passed cursory testing with smartmontools. CPU is an AMD-8120 (8 core Zambezi). Very little is running on the server as it is not yet in production (thankfully). Here is the relevant part of dmesg: ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000008 cs 00000000 ss 00000000 rs f8000000 tfd 40 serr 00000000 cmd 00047f17 (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 20 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Command timeout (ada0:ahcich0:0:0:0): Retrying command ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ada0 at ahcich0 bus 0 scbus0 target 0 lun 0 ada0: s/n S1D5NSAD915803Y detached ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000001 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (ada0:ahcich0:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Command timeout (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ahcich0: Timeout on slot 31 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 00000000 cmd 00047f17 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ahcich0: Timeout on slot 3 port 0 ahcich0: is 00000008 cs 00000000 ss 00000000 rs 8000000f tfd 40 serr 00000000 cmd 00046317 (ada0:ahcich0:0:0:0): DSM TRIM. ACB: 06 01 00 00 00 40 00 00 00 00 01 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 20 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Command timeout (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 38 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 88 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 10 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 30 ff a6 40 01 00 00 00 00 00 (ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request (ada0:ahcich0:0:0:0): Error 5, Periph was invalidated (ada0:ahcich0:0:0:0): Periph destroyed ahcich0: Timeout on slot 3 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 00000008 tfd 50 serr 00000000 cmd 00046317 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked ahcich0: Timeout on slot 3 port 0 ahcich0: is 00000002 cs 00000000 ss 00000000 rs 00000008 tfd 50 serr 00000000 cmd 00046317 (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 (aprobe0:ahcich0:0:0:0): CAM status: Command timeout (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked After this event, gpart show lists only ada1, ada2, and ada3. The boot drive is ada0. The entire dmesg can be seen at http://pastebin.com/RqR8LiSb. -- Jim Ohlstein