Date: Mon, 15 Sep 2008 09:55:25 +0300 From: Kostik Belousov <kostikbel@gmail.com> To: Robert Watson <rwatson@freebsd.org> Cc: rnoland@freebsd.org, current@freebsd.org, "Yair K." <cesium2@gmail.com>, vehemens <vehemens@verizon.net> Subject: Re: cdevpriv and mmap(2) Message-ID: <20080915065525.GD39652@deviant.kiev.zoral.com.ua> In-Reply-To: <alpine.BSF.1.10.0809142004480.5662@fledge.watson.org> References: <20080914174801.GC39652@deviant.kiev.zoral.com.ua> <alpine.BSF.1.10.0809142004480.5662@fledge.watson.org>
next in thread | previous in thread | raw e-mail | index | archive | help
--SlNwLc1tujQOaj7L Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Sun, Sep 14, 2008 at 08:15:04PM +0100, Robert Watson wrote: >=20 > On Sun, 14 Sep 2008, Kostik Belousov wrote: >=20 > >When implementing cdevpriv, I completeley missed the support for d_mmap(= )=20 > >driver method. This method is called by the kernel (at least) twice: one= =20 > >time to validate the mmaped region and protection mode (see=20 > >vm/device_pager.c:dev_pager_alloc()), and second time to obtain the=20 > >physical address when serving page fault (see dev_pager_getpages()). > > > >Support for cdevpriv for the first call(s) is trivial, and is implemente= d=20 > >by the patch below. Second calls are much harder and essentially require= =20 > >attaching cdevpriv bookkeeping data to the struct vm_map_entry. In fact,= I=20 > >am not sure whether this support for the second time calls is needed at= =20 > >all in real usage. > > > >I Cc:ed people that pointed me to the issue, please evaluate the patch= =20 > >against your needs. I think I will commit it shortly after your feedback= .=20 > >On the other hand, I would think about implementing full support for=20 > >d_mmap only if actually needed. >=20 > I'm not really sure that these changes "make sense" given the way our=20 > device pager works right now. My understanding is that most consumers of= =20 > cdevpriv use it in order to provide session-centric device nodes as a=20 > cleaner alternative to cloning. However, even with your change, it's not= =20 > possible to support session-centric memory mapping of devices as the memo= ry=20 > map and device pager code assumes there is one VM object for each device,= =20 > and hence all pages mapped independently from different file descriptors = on=20 > the same device are from the same set of pages (and hence in the same VM= =20 > object page cache). In order to implement session-centric semantics, I= =20 > think it's actually quite a bit more complicated than just adding=20 > vm_map_entry book-keeping -- we also need to have a different VM object f= or=20 > each session. >=20 > And, arguably, we need a more mature device_pager that understands that= =20 > pages might someday no longer be associated with a device due to that=20 > device being removed... The issues you raised are obviously important, but IMHO they are ortogonal to the cdevpriv KPI working in for _any_ pager. Pager' getpages method is usually called from the context where kernel does not have naturally supplied filedescriptor. For instance, most typical caller if vm_fault(). Thus, whatever pager is used, we have to provide a way for kernel to somehow associate struct file with faulted page (region), and make that file available to the pager. [Hmm, because kernel is allowed to fault too, vm_fault() should save/restore td_fpop.] Another point is that we have important consumers of the existing device pager interface that already want to use cdevpriv. Their usage ATM is limited to authentification, whatever it means. I assume it means checking that the caller was given some token the validation step. The code should be structured approx. like this: dri_mmap(...) { some_dri_data *p; int error; error =3D devfs_get_cdevpriv(&p); if (error =3D=3D 0) { /* authenticate; */ } /* * Auth successfull or error =3D=3D EBADF * Do whatever needed to return phys address */ ... } And, the last issue you raised, the need for the new pager is actually real for GEM/TTM or whatever the newest DRI interface is called. I have an intent to play with it, but more smart thing would be to wait some time more. Hopefully, then DRI folks will finally decide on the (more) stable interface. I am sure that it would be changed several dozen times in the future, but have a hope that it will not be designed from scratch. --SlNwLc1tujQOaj7L Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (FreeBSD) iEYEARECAAYFAkjOBtwACgkQC3+MBN1Mb4iaRwCglRl560YvTHhWavJeI7Ih0C7C MJcAnjX9IubccQVwoYmUamaJ0oPBP3wO =p2bF -----END PGP SIGNATURE----- --SlNwLc1tujQOaj7L--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080915065525.GD39652>