Date: Sat, 8 Jan 2011 22:20:28 +0200 From: Kostik Belousov <kostikbel@gmail.com> To: Lev Serebryakov <lev@freebsd.org> Cc: freebsd-stable@freebsd.org Subject: Re: 8.2-PRERELEASE: live deadlock, almost all processes in "pfault" state Message-ID: <20110108202028.GY12599@deviant.kiev.zoral.com.ua> In-Reply-To: <1544327450.20110108231021@serebryakov.spb.ru> References: <204344488.20110108214457@serebryakov.spb.ru> <20110108190232.GU12599@deviant.kiev.zoral.com.ua> <1792026896.20110108222909@serebryakov.spb.ru> <20110108195613.GW12599@deviant.kiev.zoral.com.ua> <1544327450.20110108231021@serebryakov.spb.ru>
next in thread | previous in thread | raw e-mail | index | archive | help
--WzyiqVXNYYkrrY2o Content-Type: text/plain; charset=koi8-r Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Sat, Jan 08, 2011 at 11:10:21PM +0300, Lev Serebryakov wrote: > Hello, Kostik. > You wrote 8 =D1=CE=D7=C1=D2=D1 2011 =C7., 22:56:13: >=20 >=20 > >> And, if it is "classic deadlock" is here any "classical" solution to > >> it? > > Do not allocate during bio processing. > So, if GEOM need some cache, it needs pre-allocate it and implements > custom allocator over allocated chunk? :( >=20 > And what is "bio processing" in this context? geom_raid5 puts all bio processing =3D=3D whole time needed to finish pageout. Pageout is often performed to clean the page to lower the page shortage. If pageout requires more free pages to finish during the shortage, then we get the deadlock. Also, it seems that you allocate not only bios (small objects, not every request cause page allocation), but also the huge buffers, that require free pages each time. > bios into the (private, internal) queue and geom_start() exits > immediately, and bio could spend rather long time in queue (if it is > write request) before it will be sent to underlying provider. And, > yes, it could be combined with other bios to form new one (why > allocation of new bio is needed). >=20 > So, is "bio processing" a whole time before bio is complete, or only > geom_start() call or what? >=20 > Also, RAID5 needs to read data (other stripes) and write data (new > checksum) when "write" bio is processed. BTW, "system" geom_raid3 and > geom_vinum (with raid5 volume) need to do the same to maintain > checksums, so they could deadlock (in theory) too, if problem is > "allocate memory during bio processing". And geom_mirror needs > allocate bio for second (third, ...) component on every write... >=20 > --=20 > // Black Lion AKA Lev Serebryakov <lev@FreeBSD.org> >=20 --WzyiqVXNYYkrrY2o Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (FreeBSD) iEYEARECAAYFAk0oxwwACgkQC3+MBN1Mb4gPTQCgiL9vRWFvfd1a17Rssv9jmGt6 1xAAoI4StIuJ6/eCiriinVyGrzA3si9a =Fw1y -----END PGP SIGNATURE----- --WzyiqVXNYYkrrY2o--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20110108202028.GY12599>