Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 23 Nov 2015 14:18:43 +0100
From:      Svatopluk Kraus <onwahe@gmail.com>
To:        Konstantin Belousov <kostikbel@gmail.com>
Cc:        FreeBSD Arch <freebsd-arch@freebsd.org>
Subject:   Re: a question about BUS_DMA_MIN_ALLOC_COMP flag meaning
Message-ID:  <CAFHCsPWy=-Q9MQZyGjYevEASoN1Q7Gv5f9Aj8N3pi1gV3Q6-Yw@mail.gmail.com>
In-Reply-To: <20151120144544.GB58629@kib.kiev.ua>
References:  <CAFHCsPVNntJVs051PvRuaY2S17Jf6kqbPPWq4tZpA-=Y-S32YA@mail.gmail.com> <20151120144544.GB58629@kib.kiev.ua>

next in thread | previous in thread | raw e-mail | index | archive | help
On Fri, Nov 20, 2015 at 3:45 PM, Konstantin Belousov
<kostikbel@gmail.com> wrote:
> On Wed, Nov 18, 2015 at 05:00:49PM +0100, Svatopluk Kraus wrote:
>> Hi,
>>
>> I have fallen to some problem with inconsistent use of
>> BUS_DMA_MIN_ALLOC_COMP flag. This flag was introduced in x86 MD code
>> very very long ago and so, the problem covers all archs which came out
>> from it.
>>
>> However, it's only about bus_dma_tag_t with BUS_DMA_COULD_BOUNCE flag set.
>>
>> (1) When bus_dma_tag_t is being created with BUS_DMA_ALLOCNOW flag
>> specified, some bounce pages could be allocated in advance and
>> BUS_DMA_MIN_ALLOC_COMP flag is set to the tag. The bounce pages are
>> allocated only if the tag's maxsize property is higher than size of
>> all bounce pages already allocated in a bounce zone.
>>
>> (2) When bus_dmamap_t is being created, then if BUS_DMA_MIN_ALLOC_COMP
>> is not set on associated tag, some bounce pages are ALWAYS allocated
>> and BUS_DMA_MIN_ALLOC_COMP is set afterwards,
>>
>> (3) else some bounce pages could be allocated if there is not enough
>> pages in a bounce zone and BUS_DMA_MIN_ALLOC_COMP is set afterwards.
>>
>> The problem is the following. Due to case (2), the number of pages in
>> bounce zone can grow infinitely, as bounce pages once allocated are
>> never freed. It can happen when a big number of bus_dma_tag_t together
>> with bus_dmamap_t are created, or they are created dynamically either
>> because of a loadable module or by design.
>>
>> The inconsistency is that when bus_dma_tag_t is being created, there
>> is no limit for how much pages could be allocated. On the other hand,
>> when bus_dmamap_t is being created, there is MAX_BPAGES limitation.
>>
>> I think that fix for case (2) presented as x86 fix is the following:
>>
>> diff --git a/sys/x86/x86/busdma_bounce.c b/sys/x86/x86/busdma_bounce.c
>> index 4826a2b..a15139f 100644
>> --- a/sys/x86/x86/busdma_bounce.c
>> +++ b/sys/x86/x86/busdma_bounce.c
>> @@ -308,7 +308,7 @@ bounce_bus_dmamap_create(bus_dma_tag_t dmat, int
>> flags, bus_dmamap_t *mapp)
>>          else
>>              maxpages = MIN(MAX_BPAGES, Maxmem -
>>                  atop(dmat->common.lowaddr));
>> -        if ((dmat->bounce_flags & BUS_DMA_MIN_ALLOC_COMP) == 0 ||
>> +        if ((dmat->bounce_flags & BUS_DMA_MIN_ALLOC_COMP) == 0 &&
>>              (bz->map_count > 0 && bz->total_bpages < maxpages)) {
>>              pages = MAX(atop(dmat->common.maxsize), 1);
>>              pages = MIN(maxpages - bz->total_bpages, pages);
>>
>>
>> IMO, it also fixes logic by making it same as in bus_dma_tag_t case.
> I think that this patch is correct.

So, with r291142 and r291193 intermezzo, the question is: what is right?

In fact, there were two possibilities:
(1) to keep BUS_DMA_MIN_ALLOC_COMP flag and make it consistent, so
bounce pages are allocated only once for a tag, or
(2) to remove it, so bounce pages are allocated for every created map
which needs them, up to some sane limit.

It turned out that (1) is not good for some driver. I'm not sure why,
but only thing I can imagine now is that a tag was created with
BUS_DMA_ALLOCNOW flag and then, a consumer breaks tag's maxsize
property. Or, there is another inconsitency in the if statement:
bz->map_count > 0. Bounce zone map count is incremented later, so when
bounce zone is without map, the test fails. Not mention that this map
count is not atomic.

Anyhow, what is right, (1) or (2) ?

>
>>
>> The next question is, if case (1) should be limited by MAX_BPAGES as
>> in case (3) or maybe better if there should be some internal
>> limitation for bounce zone itself.
> Could we apply e.g. MAX_BPAGES limit to bounce zones, or should we allow
> the limit to change based on the tag ?  But I am not sure if there is
> any reasonable way to formulate the limit.

Neither me.

>
> MAX_BPAGES looks like some arbitrary sanity limit, e.g. we could have
> unlimited maxsize, but also have an alignment constraints, and then
> tag requires bouncing. I am not sure that hard-coded values, esp. the
> amd64 32MB limit, makes much sense, or that basing the limit on the
> tag constraints makes more sense. Might be, we should allow some total
> percentage of the physical memory on machine to be consumed by all
> bounce zones altogether ?

IMO, some global limit should be there in any case. A tunable sounds
good, with some warning if a box owner wants too much.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAFHCsPWy=-Q9MQZyGjYevEASoN1Q7Gv5f9Aj8N3pi1gV3Q6-Yw>