From owner-freebsd-stable@freebsd.org Sat Mar 5 12:32:09 2016 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id EB9079DB479 for ; Sat, 5 Mar 2016 12:32:08 +0000 (UTC) (envelope-from trtrmitya@gmail.com) Received: from mailman.ysv.freebsd.org (mailman.ysv.freebsd.org [IPv6:2001:1900:2254:206a::50:5]) by mx1.freebsd.org (Postfix) with ESMTP id CBD93909 for ; Sat, 5 Mar 2016 12:32:08 +0000 (UTC) (envelope-from trtrmitya@gmail.com) Received: by mailman.ysv.freebsd.org (Postfix) id CB1219DB478; Sat, 5 Mar 2016 12:32:08 +0000 (UTC) Delivered-To: stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id B13989DB477 for ; Sat, 5 Mar 2016 12:32:08 +0000 (UTC) (envelope-from trtrmitya@gmail.com) Received: from mail-lb0-x230.google.com (mail-lb0-x230.google.com [IPv6:2a00:1450:4010:c04::230]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client CN "smtp.gmail.com", Issuer "Google Internet Authority G2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 38FB5906 for ; Sat, 5 Mar 2016 12:32:08 +0000 (UTC) (envelope-from trtrmitya@gmail.com) Received: by mail-lb0-x230.google.com with SMTP id cf7so71606667lbb.1 for ; Sat, 05 Mar 2016 04:32:08 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=CVECUFctIpNCUAmT6O5bKofl8Ln61H2Qha6GCDKT8+0=; b=dMYQG0akS1nEz83VGErn3e/F5iIC9ry5gy880fE52AuLo9INI/JB98zoYUGiQ7tKyv OhHSt4BhRGZNQdvDegefrIdYGfkdmx0gVwNnrUkrnLI8FgvsqvOD3KVUYB+p1HuG+kL0 p/LMVdxxm8ENVDW2Yb09GDRuePfvntFrlACFuc8YopYwh4sAfrMNVRRTfcVlczetV2qe OxQvEuRDmEKJhVdFz42C1tjFJhLcPoXZOFtEBPTlC2RaPofAO+cZ35sQry/U954IjQgk xn7wUmyYkI19Pqweeen1X/uD89lmKbhCpWKevlvUvO+l7KMQbdDeCb1g9cVHbzj8PMS9 n73Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=CVECUFctIpNCUAmT6O5bKofl8Ln61H2Qha6GCDKT8+0=; b=OKK/oKWTfL4fmShrY7/23NM8JBZOWRqQZYLpwisdSRG4YdgOyErdfdMwIS9ZF1ODfz piqvyiwL9lkS+QT714FAJ2uFV9Db8hR6RygcopU8DLm/q0yIrl2BqPBmxatqoiFjhSd7 EISUDGOepku1bFuy/7uD3CfBNPJUwyigq9kwuzpsfjVC/oAlBYQgZUJ5l9/SOGRLciQK 9R+n/WjBqmJeqLuAHCNeX2Cnv/i6cTqoQGAeRltgTzLSLssp0FZjrUD+pgaFpvJNPeVh CEcafVZM/BtdK6kWhKN5qf0AzM5OS+KRVGWu8QsY1jx05aOQAMsLansfMh69ILiPSmu5 ZKyA== X-Gm-Message-State: AD7BkJL6VIhPR1r25R+YCM+ut7HIoX0a0hvIObcqEe9rLmXY6hqVFkwHhrrYmh5C0bovpA== X-Received: by 10.112.12.98 with SMTP id x2mr4831915lbb.76.1457181126316; Sat, 05 Mar 2016 04:32:06 -0800 (PST) Received: from [10.0.1.4] (broadband-5-228-251-240.nationalcablenetworks.ru. [5.228.251.240]) by smtp.gmail.com with ESMTPSA id c126sm1320092lfb.2.2016.03.05.04.32.04 (version=TLSv1/SSLv3 cipher=OTHER); Sat, 05 Mar 2016 04:32:05 -0800 (PST) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 9.2 \(3112\)) Subject: Re: nfs_getpages: error 4 From: Dmitry Sivachenko In-Reply-To: <56DACD4E.3070905@grosbein.net> Date: Sat, 5 Mar 2016 15:32:04 +0300 Cc: FreeBSD Stable ML Content-Transfer-Encoding: quoted-printable Message-Id: <550ADE4F-9F60-44FB-BF07-A1384A6B7B1A@gmail.com> References: <56DACD4E.3070905@grosbein.net> To: Eugene Grosbein X-Mailer: Apple Mail (2.3112) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 05 Mar 2016 12:32:09 -0000 > On 05 Mar 2016, at 15:13, Eugene Grosbein wrote: >=20 > 05.03.2016 18:21, Dmitry Sivachenko =D0=BF=D0=B8=D1=88=D0=B5=D1=82: >> Hello, >>=20 >> I am running a number of machines with /home mounted via nfs (FreeBSD = 10.3-PRERELEASE #0 r294799, rw,bg,intr,soft). >>=20 >> Sometimes I get the following messages in syslog: >>=20 >> nfs_getpages: error 4 >> vm_fault: pager read error, pid NNN (myprog) >>=20 >> After that I see I lot of processes stuck in "pfault" state (these = are computational processes which use some files from NFS mount), they = use 0% of CPU after that. >>=20 >> On NFS server machine I see nothing strange in logs. procstat -kk = for such stuck processes shows: >> PID TID COMM TDNAME KSTACK >> 85274 102056 myprog - mi_switch+0xbe = sleepq_wait+0x3a _sleep+0x287 vm_waitpfault+0x8a vm_fault_hold+0xdd0 = vm_fault+0x77 trap_pfault+0x180 trap+0x52c calltrap+0x8 >>=20 >>=20 >> What can be the reason of this? >=20 > For example, if some processes running on NFS server box modify some = files "in-place" > and these files are opened by processes running on NFS client, that = could be the reason. > If so, change this so processes updating such files create new = temporary versions of them first > and then rename them atomically. >=20 This should not be the case: users are working only on NFS clients. Moreover, the nature of computations is so that each process uses it's = own set of files. (Forgot to mention in my previous e-mail that these processes can't be = stopped even with kill -9)