Date: Mon, 22 Jun 2020 09:37:15 +0300 From: Andriy Gapon <avg@FreeBSD.org> To: Konstantin Belousov <kostikbel@gmail.com> Cc: src-committers@freebsd.org, svn-src-all@freebsd.org, svn-src-head@freebsd.org Subject: Re: svn commit: r362294 - head/sys/dev/sound/pci/hda Message-ID: <df3a1e48-328d-c216-623f-eb429babfad3@FreeBSD.org> In-Reply-To: <20200621215945.GA2057@kib.kiev.ua> References: <202006180612.05I6C6EE013037@repo.freebsd.org> <20200621215945.GA2057@kib.kiev.ua>
next in thread | previous in thread | raw e-mail | index | archive | help
On 22/06/2020 00:59, Konstantin Belousov wrote: > This commit (or rather, a merge of this commit to stable/12) causes an issue > on my Apollo Lake machine. Look: > hdac0@pci0:0:14:0: class=0x040300 card=0xa1821458 chip=0x5a988086 rev=0x0b hdr=0x00 > vendor = 'Intel Corporation' > device = 'Celeron N3350/Pentium N4200/Atom E3900 Series Audio Cluster' > class = multimedia > subclass = HDA > bar [10] = type Memory, range 64, base 0x91410000, size 16384, enabled > bar [20] = type Memory, range 64, base 0x91000000, size 1048576, enabled > cap 01[50] = powerspec 3 supports D0 D3 current D0 > cap 09[80] = vendor (length 20) Intel cap 15 version 0 > cap 05[60] = MSI supports 1 message, 64 bit > > DMAP base is 0xfffff80000000000. > tom% sudo /usr/local/bin/kgdb ~/work/DEV/svn/RELENG_12/sys/amd64/compile/TOM/kernel.full /dev/mem > GNU gdb (GDB) 9.2 [GDB v9.2 for FreeBSD] > Reading symbols from /usr/home/kostik/work/DEV/svn/RELENG_12/sys/amd64/compile/TOM/kernel.full... > ... > sched_switch (td=0xfffff8029d516000, newtd=0xfffff800025f4000, > flags=<optimized out>) at ../../../kern/sched_ule.c:2143 > 2143 cpuid = PCPU_GET(cpuid); > (kgdb) p/x *(unsigned int *)(0xfffff80000000000+0x91410000+0x24) INTSTS > $1 = 0xc0000000 > This causes the first interrupt handler to loop forever. > > And this is why: > (kgdb) p/x *(unsigned char *)(0xfffff80000000000+0x91410000+0x5d) RIRBSTS > $3 = 0x0 > (kgdb) p/x *(unsigned char *)(0xfffff80000000000+0x91410000+0x4d) CORBSTS > $4 = 0x0 > (kgdb) p/x *(unsigned short *)(0xfffff80000000000+0x91410000+0xe) WAKESTS > $5 = 0x5 > So the SDIN State Change status bit is set, and nothing in the driver > clears it, it seems. > > More, the EDS specifies that another reason for INTSTS CIS bit set may > be the CORB Memory Error Indication (CMEI), and again it seems that driver > never clears the reason. > > So as is the change perhaps helps for some situations, but also > practically makes some systems to loose core in tight loop with interrupt > priority, which causes weird machine misbehavior like unkillable processes > and never finishing epochs. Kostik, thank you for the report and the debugging! It seems that the driver currently does do not much with the register 0xe (referred to by you as WAKESTS, by the code and the HDA specification as STATESTS). I think that we can just clear that register (documented as RW1C) More, I am surprised that the driver does not clear the register in hdac_attach2() when probing codecs. So, maybe something like a quick patch below. Also, I see that the driver never explicitly manages WAKEEN register. I think that it should. If we do not need or do not expect any wake events then we should clear the corresponding bits. So, maybe it would be better to just zero WAKEEN in attach and resume methods. Although, WAKEEN is documented as zero after reset, the specification has this note: These bits are only cleared by a power-on reset. Software must make no assumptions about how these bits are set and set them appropriately. Regarding CMEI, we never enable it by setting CMEIE and it should be zero after reset. So, we should never get that interrupt. The same applies to another potential interrupt source, RIRBOIS / RIRBOIC. Index: sys/dev/sound/pci/hda/hdac.c =================================================================== --- sys/dev/sound/pci/hda/hdac.c (revision 362383) +++ sys/dev/sound/pci/hda/hdac.c (working copy) @@ -312,6 +312,10 @@ hdac_one_intr(struct hdac_softc *sc, uint32_t ints /* Was this a controller interrupt? */ if (intsts & HDAC_INTSTS_CIS) { + /* XXX just clear SDIWAKE bits. */ + HDAC_WRITE_2(&sc->mem, HDAC_STATESTS, + HDAC_STATESTS_SDIWAKE_MASK); + rirbsts = HDAC_READ_1(&sc->mem, HDAC_RIRBSTS); /* Get as many responses that we can */ while (rirbsts & HDAC_RIRBSTS_RINTFL) { @@ -1530,6 +1534,7 @@ hdac_attach2(void *arg) device_printf(sc->dev, "Scanning HDA codecs ...\n"); ); statests = HDAC_READ_2(&sc->mem, HDAC_STATESTS); + HDAC_WRITE_2(&sc->mem, HDAC_STATESTS, statests); hdac_unlock(sc); for (i = 0; i < HDAC_CODEC_MAX; i++) { if (HDAC_STATESTS_SDIWAKE(statests, i)) { -- Andriy Gapon
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?df3a1e48-328d-c216-623f-eb429babfad3>