Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 15 Dec 2021 14:24:31 +0000
From:      bugzilla-noreply@freebsd.org
To:        ports-bugs@FreeBSD.org
Subject:   [Bug 260438] dns/bind-tools: dig SIGABRT under high load
Message-ID:  <bug-260438-7788@https.bugs.freebsd.org/bugzilla/>

next in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D260438

            Bug ID: 260438
           Summary: dns/bind-tools: dig SIGABRT under high load
           Product: Ports & Packages
           Version: Latest
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Some People
          Priority: ---
         Component: Individual Port(s)
          Assignee: mat@FreeBSD.org
          Reporter: david@isnic.is
             Flags: maintainer-feedback?(mat@FreeBSD.org)
          Assignee: mat@FreeBSD.org

We have some shell scripts to set up zones on new slaves. Since we have a l=
arge
number (tens of thousands) of zones, we do this in parallel, querying the s=
lave
to see if the zone is set up and if not send an `rndc addzone`.

When doing a bulk provisioning like this, we see dig very occasionally die =
with
SIGABRT:
```
[root@hfp-master /usr/home/ansible]# dmesg
pid 72286 (dig), jid 0, uid 0: exited on signal 6 (core dumped)
```

Error ouput when this happens:
```
dighost.c:2628: REQUIRE((__builtin_expect(!!(((query)) !=3D ((void *)0)), 1=
) &&
__builtin_expect(!!(((const isc__magic_t *)((query)))->magic =3D=3D ((('D')=
 << 24 |
('i') << 16 | ('g') << 8 | ('q')))), 1))) failed, back trace
#0 0x4359ba in ??
#1 0x43594a in ??
#2 0x2bc814 in ??
#3 0x462a33 in ??
#4 0x44ade8 in ??
#5 0x447225 in ??
#6 0x800a39ada in ??
#7 0x800a4ac1b in ??
#8 0x800a3a051 in ??
#9 0x44735b in ??
#10 0x465135 in ??
Abort trap (core dumped)
```

This only seems to happen under load, running multiple dig commands in para=
llel
in a tight loop.

I've tried to create a concise repro case here without sharing our whole DNS
deployment script set, but don't have it ready. I'll add it later if I get =
it
working.

This looks extremely similar to the following upstream bugs:
https://gitlab.isc.org/isc-projects/bind9/-/issues/1981
https://gitlab.isc.org/isc-projects/bind9/-/issues/1971
https://gitlab.isc.org/isc-projects/bind9/-/issues/1956

Based on comments there, this should be fixed in this MR:
https://gitlab.isc.org/isc-projects/bind9/-/merge_requests/3721

However it seems this still happens on FreeBSD.

We are running bind-tools-9.16.23.

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-260438-7788>