Date: Mon, 12 May 2025 12:29:57 -0700 From: Pete Wright <pete@nomadlogic.org> To: Colin Percival <cperciva@tarsnap.com>, freebsd-cloud@FreeBSD.org Cc: Arthur Kiyanovski <akiyano@FreeBSD.org> Subject: Re: ena(4) tx timeout messages in dmesg Message-ID: <a5ef1194-a020-417c-b8a9-b82badfa3ca0@nomadlogic.org> In-Reply-To: <01000196c5db82dc-cfa5bf54-9758-4125-bdca-f1794b76ac9f-000000@email.amazonses.com> References: <fec4cb4f-2a36-4a3d-bf02-539fd1a1273c@nomadlogic.org> <01000196c5b6fa5f-b8ed430e-23ca-47fd-9dd9-374a1de9c67c-000000@email.amazonses.com> <527aa929-4083-4935-8147-e59b6416c3bf@nomadlogic.org> <01000196c5db82dc-cfa5bf54-9758-4125-bdca-f1794b76ac9f-000000@email.amazonses.com>
index | next in thread | previous in thread | raw e-mail
On 5/12/25 11:56, Colin Percival wrote: > On 5/12/25 11:25, Pete Wright wrote: >> On 5/12/25 11:17, Colin Percival wrote: >>> On 5/12/25 11:04, Pete Wright wrote: >>>> hey there - i have an ec2 instance that i'm using as a nfs server >>>> and have noticed the following messages in my dmesg buffer: >>>> [...] >>>> ena0: Found a Tx that wasn't completed on time, qid 3, index 998. 1 >>>> msecs have passed since last cleanup. Missing Tx timeout value 5000 >>>> msecs. >>>> >>> I've heard that this can be caused by a thread being starved for CPU, >>> possibly >>> due to FreeBSD kernel scheduler issues, but that was on a far more >>> heavily >>> loaded system. What instance type are you running on? >> >> oh of course, forgot to provide useful info: >> >> # uname -ar >> FreeBSD airflow-nfs.q0.ringdna.net 14.2-RELEASE-p1 FreeBSD 14.2- >> RELEASE-p1 GENERIC amd64 >> >> Instance type: >> t3a.xlarge >> >> I also verified I have plenty of available "burstable credit" >> available since this is a t class system (current balance is steady at >> 2,300 credits). > > Ah, this won't necessarily help you -- T family instances are on shared > hardware so even if you have burstable credits it's possible that you'll > be unlucky with "noisy neighbours" and the sibling instances will all want > CPU at the same time as you. But I think there's probably something else > going on as well. > oh that's a good point, since this is a pre-prod system that is less of a concern as we want to limit spend when possible. i'll be spinning up production systems in the following week or so that will be on a "c" class system, i'll keep an eye out to see if see similar messages in that environment. -pete -- Pete Wright pete@nomadlogic.orghome | help
Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?a5ef1194-a020-417c-b8a9-b82badfa3ca0>
