From owner-freebsd-current@freebsd.org Mon Jun 15 11:26:58 2020 Return-Path: Delivered-To: freebsd-current@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 0F277330600 for ; Mon, 15 Jun 2020 11:26:58 +0000 (UTC) (envelope-from hps@selasky.org) Received: from mailman.nyi.freebsd.org (unknown [127.0.1.3]) by mx1.freebsd.org (Postfix) with ESMTP id 49lpvP5WbCz4SFh for ; Mon, 15 Jun 2020 11:26:57 +0000 (UTC) (envelope-from hps@selasky.org) Received: by mailman.nyi.freebsd.org (Postfix) id BD6E132F9E3; Mon, 15 Jun 2020 11:26:57 +0000 (UTC) Delivered-To: current@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id BD31F330320 for ; Mon, 15 Jun 2020 11:26:57 +0000 (UTC) (envelope-from hps@selasky.org) Received: from mail.turbocat.net (turbocat.net [88.99.82.50]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 49lpvN4nH0z4SHj for ; Mon, 15 Jun 2020 11:26:56 +0000 (UTC) (envelope-from hps@selasky.org) Received: from hps2020.home.selasky.org (unknown [62.141.129.235]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mail.turbocat.net (Postfix) with ESMTPSA id 834542601EC; Mon, 15 Jun 2020 13:26:54 +0200 (CEST) Subject: Re: Panic on mlx5en. To: Santiago Martinez , FreeBSD Current References: <1bf2ffc5-386a-c0fa-4cc1-246892da3a38@codenetworks.net> From: Hans Petter Selasky Message-ID: <37677b81-0984-d6f4-9a6d-ebe94957ce47@selasky.org> Date: Mon, 15 Jun 2020 13:26:35 +0200 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:68.0) Gecko/20100101 Thunderbird/68.8.0 MIME-Version: 1.0 In-Reply-To: <1bf2ffc5-386a-c0fa-4cc1-246892da3a38@codenetworks.net> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: 49lpvN4nH0z4SHj X-Spamd-Bar: -- Authentication-Results: mx1.freebsd.org; dkim=none; dmarc=none; spf=pass (mx1.freebsd.org: domain of hps@selasky.org designates 88.99.82.50 as permitted sender) smtp.mailfrom=hps@selasky.org X-Spamd-Result: default: False [-2.53 / 15.00]; RCVD_VIA_SMTP_AUTH(0.00)[]; ARC_NA(0.00)[]; MID_RHS_MATCH_FROM(0.00)[]; FROM_HAS_DN(0.00)[]; R_SPF_ALLOW(-0.20)[+a:mail.turbocat.net]; NEURAL_HAM_LONG(-0.98)[-0.980]; MIME_GOOD(-0.10)[text/plain]; DMARC_NA(0.00)[selasky.org]; NEURAL_HAM_MEDIUM(-0.97)[-0.973]; TO_MATCH_ENVRCPT_SOME(0.00)[]; TO_DN_ALL(0.00)[]; NEURAL_HAM_SHORT(-0.28)[-0.279]; RCPT_COUNT_TWO(0.00)[2]; FROM_EQ_ENVFROM(0.00)[]; R_DKIM_NA(0.00)[]; MIME_TRACE(0.00)[0:+]; ASN(0.00)[asn:24940, ipnet:88.99.0.0/16, country:DE]; RCVD_COUNT_TWO(0.00)[2]; RCVD_TLS_ALL(0.00)[] X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.33 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 15 Jun 2020 11:26:58 -0000 On 2020-06-15 11:12, Santiago Martinez wrote: > Hi everyone, while doing some tests for an MRSAS panic I hit another one > on mlx5en. > > The device is a LenovoSR655 with 2xMellanox NIC. > > 1 - Mellanox CX-4 Lx 10/25GbE SFP28 2-port OCP Ethernet Adapter > 2 - Mellanox CX-4 Lx 10/25GbE SFP28 2-port PCIe Ethernet Adapter > > This only happens only with current, tried different snapshots and the > same in all. > On 12.1 works without a problem. > > Trace (please not that is OCR+ manual corrections): > > Tracing pid 2288 tid 182178 td @8xfffffeB385fe6500 > kdb_enter() at kbd_inter+0x37/frame 0xfffffe0386030ba0 > vpanic() at vpanic+0x19e/frame 0xfffffe0386030bf0 > panic() at panic+0x43/frame 0xfffffe038630c50 > trap_fatal() at trap_fatal+0x387/frame 0xfffffe0386030cb0 > trap() at trap+0x80/frame Bxfffffe0386030dc0 > calltrap() at calltrap+0x80/frame Bxfffffed386830dc0 > --- trap 0x9, rip = 0xfffffff8275c060, rsp = 0xfffffe0386030e90, rbp = > 0xfffffe0386030e90 --- > linux_root_RB_INSERT_COLOR() at linux_root_RB_INSERT COLOR+0x40/frame > 0xfffffe0386030f60 > give_pages() at give pages+0x163/frame 0xfffffe0386830f20 > mlx5_satisfy_startup_pages() at mlx5_satisfy_startup_pages+0x76/frame > 0xfffffe0386030f60 > mlx5_load_one () at mlx5_load_one+0x6b7/frame 0xfffffe0386031080 > init_one() at init_one+0x12d5/frame 0xfffffe03860310f0 > linux_pci_attach_device() at linux_pci_attach device+0x573/frame > 0xfffffe0386031150 > device_attach() at device_attach+0x3ca/frame 0xfffffe0386031190 > device_probe_and_attach() at device_probe_and_attach+0x70/frame > 0xfffffe03860311c0 > pci_driver_added() at pci_driver_added+0xf6/frame 0xfffffe0386031200 > devclass_driver_added() at devclass_driver_added+0x39/frame > 0xfffffe0386031240 > devclass_add_driver() at devclass_add_driver+0x147/frame 0xfffffe0386031280 > _linux_pci_register_driver() at _linux_pci_register_driver+0xc9/frame > 0xfffffe03860312a0 > Are you using the latest version of kernel & mlx5en as of today? There was a regression issue with the rbtree.h implementation which recently was fixed. --HPS