From owner-freebsd-scsi@freebsd.org Tue Oct 18 09:20:08 2016 Return-Path: Delivered-To: freebsd-scsi@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 55D6CC1609F for ; Tue, 18 Oct 2016 09:20:08 +0000 (UTC) (envelope-from kashyap.desai@broadcom.com) Received: from mail-it0-x22d.google.com (mail-it0-x22d.google.com [IPv6:2607:f8b0:4001:c0b::22d]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 3013CC74 for ; Tue, 18 Oct 2016 09:20:08 +0000 (UTC) (envelope-from kashyap.desai@broadcom.com) Received: by mail-it0-x22d.google.com with SMTP id 139so67724785itm.1 for ; Tue, 18 Oct 2016 02:20:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=broadcom.com; s=google; h=from:references:in-reply-to:mime-version:thread-index:date :message-id:subject:to:cc; bh=rgmtyna+0iVFT8I5mWsBOIwNDlQ4FBje8OwVvB3pWdI=; b=CW3FRNDvKWGmsDcHJLjZyNcp2lTwlCMYG4b9onyVvNdvwsTmyXsAGTuFjOEaZn0R+x 8xdZxwkWTB/Q8moSLoWWfneXJWi+pAScnUmAKmEdbdGGu7hSDPzml4/s11oIvFjbo1lU GqhMwM9A2gsCUWIgVp0O99H+gEVfkqDzfR0kw= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:references:in-reply-to:mime-version :thread-index:date:message-id:subject:to:cc; bh=rgmtyna+0iVFT8I5mWsBOIwNDlQ4FBje8OwVvB3pWdI=; b=LAs2W4TPzBft2C0s11oAzfCEBW8YEw7rtwnX75tEF1mM1hX/dXBcwqzKKrGwZAE+Vi K6gSv38Z/mq/QP6SFsCMK/p1cpnugZe4E2vcu6BON0srAr1JQdmXx78mESS5xEZWudgO nnBmFTaT1kruVJWIvoHNUi03cH4M5BmpBnWvCR46SvnTmTCCTk/pl4Km8fgzHG5PY+9h DlzvodhvEEy/kzSew2Jv1QyAZpd5CcV6lVy0WV4AqNfPsJwv7O54aUZKb16X8xTMjI7p Oa5R7GAD+pcRXwEuOvie345PKAtA62sc1k27LWDTEC0abGT1gAyHRxu8FZ6seox190l/ uYig== X-Gm-Message-State: AA6/9RnX4rnYS2CgwYJ1XWvowNQpXDO7CYdowNl6RqUaCX4VjHCoMGPW97J6+UFgOu5F84M4YZ5P6V5sY6zQZtv9 X-Received: by 10.36.188.65 with SMTP id n62mr13317959ite.61.1476782407250; Tue, 18 Oct 2016 02:20:07 -0700 (PDT) From: Kashyap Desai References: <03ee30cfdb1ac86b644ff3516e0d88c0@mail.gmail.com> 2f1e12bf33bd90c4df3172294f20dc2e@mail.gmail.com 10b16ee77a11213da804bf8b0f2c58a0@mail.gmail.com In-Reply-To: 10b16ee77a11213da804bf8b0f2c58a0@mail.gmail.com MIME-Version: 1.0 X-Mailer: Microsoft Outlook 14.0 Thread-Index: AQHWPDlcg2acl0r0bM9EVuOrjfsiOqB+G5DQgB6Oc8CACR6IEA== Date: Tue, 18 Oct 2016 14:50:06 +0530 Message-ID: Subject: RE: MRSAS: SATA drives are getting deleted and then readded after controller reset To: Sumit Saxena , freebsd-scsi@freebsd.org, scott4long@yahoo.com, ken@freebsd.org Cc: Seema Kumashikar Content-Type: text/plain; charset=UTF-8 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 18 Oct 2016 09:20:08 -0000 Updated BZ - This issue is fixed using below patch. Please review and let me know if this is a correct fix. Root cause is - "Checksum is updated using different serial number. One without removing extra spaces and another with additional spaces. Because of that, any rescan of ATA disk is defected as different ATA drive, so it is removed and re-added later. " Index: scsi_xpt.c =================================================================== --- scsi_xpt.c (revision 307137) +++ scsi_xpt.c (working copy) @@ -1600,8 +1600,8 @@ sizeof(struct scsi_inquiry_data)); if (have_serialnum) - MD5Update(&context, serial_buf->serial_num, - serial_buf->length); + MD5Update(&context, path->device->serial_num, + path->device->serial_num_len); MD5Final(digest, &context); if (bcmp(softc->digest, digest, 16) == 0) Please review and let us know if we need fix in kernel or Any workaround to remove this check sum related code in CAM via some tunable ? ` Kashyap > -----Original Message----- > From: Kashyap Desai [mailto:kashyap.desai@broadcom.com] > Sent: Wednesday, October 12, 2016 7:33 PM > To: Sumit Saxena; 'freebsd-scsi@freebsd.org'; 'scott4long@yahoo.com'; > 'ken@freebsd.org' > Cc: Seema Kumashikar > Subject: RE: MRSAS: SATA drives are getting deleted and then readded after > controller reset > > Hi - > > Any update/pointer on this ? Issue happen only with SATA driver attached > via > CAM layer. > > Do we need to address this in driver or will there be any fix in CAM layer > ? > > > ` Kashyap > > > -----Original Message----- > > From: Kashyap Desai [mailto:kashyap.desai@broadcom.com] > > Sent: Friday, September 23, 2016 8:55 AM > > To: Sumit Saxena; 'freebsd-scsi@freebsd.org'; 'scott4long@yahoo.com'; > > 'ken@freebsd.org' > > Cc: Seema Kumashikar > > Subject: RE: MRSAS: SATA drives are getting deleted and then readded > > after controller reset > > > > Hi - > > > > I have posted new BZ/defect. > > > > https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=212914 > > > > ~ Kashyap > > > > > -----Original Message----- > > > From: Sumit Saxena [mailto:sumit.saxena@broadcom.com] > > > Sent: Thursday, September 22, 2016 7:54 PM > > > To: freebsd-scsi@freebsd.org; scott4long@yahoo.com; ken@freebsd.org > > > Cc: Kashyap Desai; Seema Kumashikar > > > Subject: MRSAS: SATA drives are getting deleted and then readded > > > after controller reset > > > > > > Ken/Scott, > > > > > > On FreeBSD11.0 RC1, we are facing an issue where SATA drives > > > connected behind LSI's MegaRAID controller getting deleted and added > > > back after controller reset. > > > I am using Broadcom/Avago/LSI's MegaRAID Invader controller(device > > > ID- 0x005d). The point to note here is- this behavior is not > > > observed with SAS drives on FreeBSD11.0-RC1. > > > Also on FreeBSD10.3 this behavior is not at all observed on SATA as > > > well. > > > We are debugging the issue but it would be much helpful if we can > > > get quick inputs/pointers. > > > > > > Please find below the detailed information- > > > > > > OS: FreeBSD 11.0 RC1 > > > Controller: LSI's MegaRAID invader controller > > > > > > Connected devices list: > > > > > > root@freeBSD11:~ # camcontrol devlist > > > at scbus5 target 0 lun 0 > > > (pass0,ada0) > > > at scbus6 target 0 lun 0 > > > (ses0,pass1) > > > at scbus8 target 51 lun 0 > > > (da9,pass11)----------------------------------------->this is SATA > > > drive which is getting deleted and re-added post controller reset > > > at scbus8 target 163 lun 0 > > > (da8,pass10) > > > at scbus9 target 0 lun 0 > > > (da6,pass8) > > > at scbus9 target 1 lun 0 > > > (da2,pass4) > > > at scbus9 target 2 lun 0 > > > (da0,pass2) > > > at scbus9 target 3 lun 0 > > > (da7,pass9) > > > at scbus9 target 4 lun 0 > > > (da3,pass5) > > > at scbus9 target 5 lun 0 > > > (da1,pass3) > > > at scbus10 target 48 lun 0 > > > (da4,pass6) > > > at scbus10 target 54 lun 0 > > > (da5,pass7) > > > > > > > > > Relevant dmesg logs snippet(da9 is SATA drive which is getting > > > deleted and added back): > > > > > > ================================ > > > mrsas0: Initiaiting OCR because of FW fault! > > > mrsas0: Waiting for FW to come to ready state > > > mrsas0: Jbod map is supported > > > mrsas0: Reset successful > > > da9 at mrsas0 bus 1 scbus8 target 51 lun 0 > > > da9: s/n 9XE02AR2 detached > > > (da9:mrsas0:1:51:0): Periph destroyed > > > (da9:mrsas0:1:51:0): UNMAPPED > > > (da9:mrsas0:1:51:0): fatal error, could not acquire reference count > > > g_access(918): provider da9 has error > > > g_access(918): provider da9 has error > > > g_access(918): provider da9 has error > > > (da9:mrsas0:1:51:0): UNMAPPED > > > da9 at mrsas0 bus 1 scbus8 target 51 lun 0 > > > da9: Fixed Direct Access SPC-4 SCSI device > > > da9: Serial Number 9XE02AR2 > > > da9: 150.000MB/s transfers > > > da9: 238475MB (488397168 512 byte sectors) > > > ================================= > > > > > > Thanks, > > > Sumit