Date: Wed, 23 Mar 2016 19:15:12 +0200 From: Konstantin Belousov <kostikbel@gmail.com> To: John Baldwin <jhb@freebsd.org> Cc: src-committers@freebsd.org, svn-src-all@freebsd.org, svn-src-head@freebsd.org Subject: Re: svn commit: r297039 - head/sys/x86/x86 Message-ID: <20160323171512.GA1741@kib.kiev.ua> In-Reply-To: <20160323075842.GX1741@kib.kiev.ua> References: <201603181948.u2IJmndg063765@repo.freebsd.org> <20160319032216.GZ1741@kib.kiev.ua> <1866602.Bp7VFd5f42@ralph.baldwin.cx> <20160323075842.GX1741@kib.kiev.ua>
next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, Mar 23, 2016 at 09:58:42AM +0200, Konstantin Belousov wrote: > If TSC has the behaviour you described, i.e. suddenly jumping random > steps on single CPU, from the point of view of kernel, then the system > is seriosly misbehaving. The timekeeping stuff would be badly broken > regardless of the ipi_wait(). I do not see why should we worry about > that in ipi_wait(). > > I proposed slightly different thing, i.e. using timekeep code to indirect > to TSC if configured so. Below is the proof of concept patch, use of > nanouptime() may be too naive, and binuptime() would cause tiny bit less > overhead, but I do not want to think about arithmetic. > > diff --git a/sys/x86/x86/local_apic.c b/sys/x86/x86/local_apic.c > index 7e5087b..fdc12a4 100644 > --- a/sys/x86/x86/local_apic.c > +++ b/sys/x86/x86/local_apic.c > @@ -1621,31 +1621,37 @@ SYSINIT(apic_setup_io, SI_SUB_INTR, SI_ORDER_THIRD, apic_setup_io, NULL); > * private to the MD code. The public interface for the rest of the > * kernel is defined in mp_machdep.c. > */ > + > +/* > + * Wait delay microseconds for IPI to be sent. If delay is -1, we > + * wait forever. > + */ > static int > native_lapic_ipi_wait(int delay) > { > - int x; > + struct timespec end, x; > > /* LAPIC_ICR.APIC_DELSTAT_MASK is undefined in x2APIC mode */ > if (x2apic_mode) > return (1); > > - /* > - * Wait delay microseconds for IPI to be sent. If delay is > - * -1, we wait forever. > - */ > - if (delay == -1) { > - while ((lapic_read_icr_lo() & APIC_DELSTAT_MASK) != > - APIC_DELSTAT_IDLE) > - ia32_pause(); > - return (1); > + if (delay != -1) { > + KASSERT(delay > 0, ("wrong delay %d", delay)); > + x.tv_sec = delay / 1000000; > + x.tv_nsec = (delay % 1000000) * 1000; > + nanouptime(&end); > + timespecadd(&end, &x); > } > - > - for (x = 0; x < delay; x++) { > + for (;;) { > if ((lapic_read_icr_lo() & APIC_DELSTAT_MASK) == > APIC_DELSTAT_IDLE) > return (1); > - DELAY(1); > + ia32_pause(); > + if (delay != -1) { > + nanouptime(&x); > + if (timespeccmp(&x, &end, >)) > + break; > + } > } > return (0); > } Of course this cannot work if timecounter takes any locks, even spinlock. Only TSC or similar lock-less mechanisms are allowed there. Meh.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20160323171512.GA1741>