From owner-freebsd-hackers@freebsd.org Fri May 4 04:28:47 2018 Return-Path: Delivered-To: freebsd-hackers@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id CF34CFC6CF6 for ; Fri, 4 May 2018 04:28:47 +0000 (UTC) (envelope-from leres@freebsd.org) Received: from xse.com (xse.com [IPv6:2607:f2f8:abb8::3]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "xse.com", Issuer "Let's Encrypt Authority X3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 46939843F1 for ; Fri, 4 May 2018 04:28:47 +0000 (UTC) (envelope-from leres@freebsd.org) Received-SPF: pass (dot.xse.com: authenticated connection) receiver=dot.xse.com; client-ip=76.103.75.166; helo=[172.16.1.60]; envelope-from=leres@freebsd.org; x-software=spfmilter 2.001 http://www.acme.com/software/spfmilter/ with libspf2-1.2.10; Received: from [172.16.1.60] (ice.xse.com [76.103.75.166]) (authenticated bits=0) by dot.xse.com (8.15.2/8.15.2) with ESMTPSA id w444ShRH039806 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NO); Thu, 3 May 2018 21:28:45 -0700 (PDT) (envelope-from leres@freebsd.org) X-Authentication-Warning: dot.xse.com: Host ice.xse.com [76.103.75.166] claimed to be [172.16.1.60] Subject: Re: nvme0: async event occurred (log page id=0x2) To: Warner Losh References: <960be682-9991-f8c6-0253-7d6f782d4cbe@freebsd.org> Cc: "freebsd-hackers@freebsd.org" From: Craig Leres Message-ID: <8b1eadc2-8c9d-3f11-b877-b9a0a57512ec@freebsd.org> Date: Thu, 3 May 2018 21:28:42 -0700 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit X-Virus-Scanned: clamav-milter 0.100.0 at dot.xse.com X-Virus-Status: Clean X-GBUdb-Analysis: 0, 76.103.75.166, Ugly c=0.071429 p=0 Source Normal X-MessageSniffer-Rules: 0-0-0-3656-c X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.25 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 04 May 2018 04:28:48 -0000 On 5/3/2018 9:07 PM, Warner Losh wrote: > Async events are 'something went wrong' messages. Log page 2 is the > smart log page. > > what does 'nvmecontrol logpage -p 2 nvme0' tell you right after this > happens.  My guess is that it's overheating. Interesting. I try to run smartd anywhere it's supported and have appended the last few entries before things went sideways; 60° C/140° F is a bit toasty! This system is a couple of years old, might be time to blow the dust out with compressed air and see if the bios has more aggressive fan settings. Is the Raw_Read_Error_Rate changed a problem? (Thanks!) Craig May 3 13:59:22 tiny smartd[770]: Device: /dev/ada0, SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 59 to 60 May 3 13:59:22 tiny smartd[770]: Device: /dev/ada0, SMART Usage Attribute: 194 Temperature_Celsius changed from 41 to 40 May 3 14:59:23 tiny smartd[770]: Device: /dev/ada0, SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 60 to 58 May 3 14:59:23 tiny smartd[770]: Device: /dev/ada0, SMART Usage Attribute: 194 Temperature_Celsius changed from 40 to 42 May 3 17:29:23 tiny smartd[770]: Device: /dev/ada0, SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 75 to 76