Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 11 Oct 2017 14:56:30 +0000 (UTC)
From:      Paul Pathiakis <pathiaki2@yahoo.com>
To:        Adam Vande More <amvandemore@gmail.com>, Kate Dawson <k4t@3msg.es>
Cc:        FreeBSD Questions <freebsd-questions@freebsd.org>
Subject:   Re: FreeBSD ZFS file server with SSD HDD
Message-ID:  <447544877.450092.1507733790531@mail.yahoo.com>
In-Reply-To: <CA%2BtpaK3Cga3SKmbKnRts_SSp=D4qk9p%2BaTzNBZeEqDuvQGVd9A@mail.gmail.com>
References:  <20171011130512.GE24374@apple.rat.burntout.org> <CA%2BtpaK3Cga3SKmbKnRts_SSp=D4qk9p%2BaTzNBZeEqDuvQGVd9A@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
Hi,
This is sort of a complementary answer. Seems like a cool setup.=C2=A0=C2=
=A0

One thing as an SA is to do detailed operational awareness with history for=
 things.=C2=A0 (aka a solid SNMP monitoring tool with graphing so that you =
can see what is happening at exactly that time regarding network status, I/=
O status, CPU + resources as well as other detail. )=C2=A0 There's a good a=
mount like Zabbix, Nagios, Munin, Ganglia, etc in ports/pkgs.=C2=A0 - My fa=
vorite is OpenNMS which I was going to do the port for but ran out of time.=
=C2=A0=C2=A0

They support point-in-time graphing.=C2=A0 Once you resolve this issue, get=
 some tool up and running so you can 'see' status on everything at that tim=
e. (an you can give read-only access to management so they can ooo and ahhh=
 at it.)=C2=A0 It is very useful to see where the bottleneck is... if there=
 is one...=C2=A0 otherwise, tuning parameters for everything (ZFS, NFS, NIC=
, etc) will have to tried as you can't easily narrow down where the problem=
 is.=C2=A0

Just trying to help in the long term.

P.

      From: Adam Vande More <amvandemore@gmail.com>
 To: Kate Dawson <k4t@3msg.es>=20
Cc: FreeBSD Questions <freebsd-questions@freebsd.org>
 Sent: Wednesday, October 11, 2017 10:41 AM
 Subject: Re: FreeBSD ZFS file server with SSD HDD
  =20
On Wed, Oct 11, 2017 at 8:05 AM, Kate Dawson <k4t@3msg.es> wrote:

> Hi,
>
> Currently running a FreeBSD NFS server with a zpool comprising
>
> 12 x 1TB hard disk drives are arranged as pairs of mirrors in a strip set
> ( RAID 10 )
>
> An additional 2x 960GB SSD added. These two SSD are partitioned with a
> small partition begin used for a ZIL log, and larger partion arranged for
> L2ARC cache.
>
> Additionally the host has 64GB RAM and 16 CPU cores (AMD Opteron 2Ghz)
>
> A dataset from the pool is exported via NFS to a number of Debian
> Gnu/Linux hosts running a xen hypervisor. These run several disk image
> based virtual machines
>
> In general use, the FreeBSD NFS host sees very little read IO, which is t=
o
> expected
> as the RAM cache=C2=A0 and L2ARC are designed to minimise the amount of r=
ead
> load
> on the disks.
>
> However we're starting to see high load ( mostly IO WAIT ) on the Linux
> virtualisation hosts, and virtual machines - with kernel timeouts
> occurring resulting in crashes and instability.
>
> I believe this may be due to the limited number of random write IOPS
> available
> on the zpool NFS export.
>
> I can get sequential writes and reads to and from the NFS server at
> speeds that approach the maximum the network provides ( currently 1Gb/s
> + Jumbo Frames, and I could increase this by bonding multiple interfaces
> together. )
>
> However day to day usage does not show network utilisation anywhere near
> this maximum.
>
> If I look at the output of `zpool iostat -v tank 1 ` I see that every
> five seconds or so, the numner of write operation go to > 2k
>
> I think this shows that the I'm hitting the limit that the spinning disk
> can provide in this workload.
>

I doubt that is the cause.=C2=A0 It is more likely you have

vfs.zfs.txg.timeout

set to the default.=C2=A0 Have you tried any other zfs or nfs tuning?=C2=A0=
 If so,
please share those details.

Does gstat reveal anything useful?


--=20
Adam
_______________________________________________
freebsd-questions@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org=
"


  =20
From owner-freebsd-questions@freebsd.org  Wed Oct 11 15:07:53 2017
Return-Path: <owner-freebsd-questions@freebsd.org>
Delivered-To: freebsd-questions@mailman.ysv.freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org
 [IPv6:2001:1900:2254:206a::19:1])
 by mailman.ysv.freebsd.org (Postfix) with ESMTP id 835C3E2EBCB
 for <freebsd-questions@mailman.ysv.freebsd.org>;
 Wed, 11 Oct 2017 15:07:53 +0000 (UTC)
 (envelope-from tech-lists@zyxst.net)
Received: from out1-smtp.messagingengine.com (out1-smtp.messagingengine.com
 [66.111.4.25])
 (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
 (Client did not present a certificate)
 by mx1.freebsd.org (Postfix) with ESMTPS id 525206B889
 for <freebsd-questions@freebsd.org>; Wed, 11 Oct 2017 15:07:52 +0000 (UTC)
 (envelope-from tech-lists@zyxst.net)
Received: from compute4.internal (compute4.nyi.internal [10.202.2.44])
 by mailout.nyi.internal (Postfix) with ESMTP id 6D91A213D5
 for <freebsd-questions@freebsd.org>; Wed, 11 Oct 2017 11:07:46 -0400 (EDT)
Received: from frontend2 ([10.202.2.161])
 by compute4.internal (MEProxy); Wed, 11 Oct 2017 11:07:46 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=zyxst.net; h=
 content-type:date:from:in-reply-to:message-id:mime-version
 :references:subject:to:x-me-sender:x-me-sender:x-sasl-enc; s=
 fm1; bh=y9HO1wvtTfcKKuYmFC0rG0ab+YPr21ZTM6hVWpSOSgw=; b=YodnTMIx
 qLwNf9DZlmWqLuLwzeSyOE24L2vWiDOl0NXiFc+jHIzIjyx6poYIgkqnpCM78/cQ
 I9mph/Lm9fNCtTqKgssJhpx9uLNrgkQvOxT+V/b3YTnLxXdvQ7P/D3OlaXtCvGLp
 nBeWu5l0L4FEHSWRnC+I0sP/I6TTtAniMwyL8/u4CWbmq/VfFlQg/iIEJspP6OlO
 f/50mYnbjbyPZCTgxDEnMZxgHkfEuJLDLPF85XRBmrqzstewoLRXro1GOh/etPze
 sdJWApxq+Vu2YsuGzzwLEAeTeicZo9zXMgK/2F4peg3nBMs24UB+4/HpE1x6E6gj
 9WF/fUJMA1ikCg==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=
 messagingengine.com; h=content-type:date:from:in-reply-to
 :message-id:mime-version:references:subject:to:x-me-sender
 :x-me-sender:x-sasl-enc; s=fm1; bh=y9HO1wvtTfcKKuYmFC0rG0ab+YPr2
 1ZTM6hVWpSOSgw=; b=LO7xrg3627lAMzuZDoKiO4W7hsbnWao9Br1vGyCJr0RsW
 KLZ0GB6SUcqQh8znM33XVMuilgWyHmwkoZG5RYTrgNePfXkvqtyPVb3m2T54of15
 FHl+qX2TDrKZLT5d78Dj//1bM2w0Mb6fBYvDF4NkF2sAqTItOCZIRqDKUpt24TLA
 3nx8DsKw+8GBwvhH8r7D1H6W+9B/mnKdTuht8Im6HM6o26nPaHE31/Dl5L2ea7GO
 jr/mqUbxtaPD/KVZ5LMLq69ZMjQWunTJSKzrXZCMAO/xPwVD6NnXOo2IGZbliZ8j
 UZ0pOZdtVMDZtsgKtt3Su7bA49Xv9LhUh14z9i6fw==
X-ME-Sender: <xms:wjPeWRVcaiTANmXKHBPIBxq6uH4EgTYq4qxD2yOlfB0wonCFBIeCaQ>
Received: from acer.zyxst.net (parsley.growveg.org [82.70.91.97])
 by mail.messagingengine.com (Postfix) with ESMTPA id DE4A1247F0
 for <freebsd-questions@freebsd.org>; Wed, 11 Oct 2017 11:07:45 -0400 (EDT)
Date: Wed, 11 Oct 2017 16:07:44 +0100
From: tech-lists <tech-lists@zyxst.net>
To: freebsd-questions@freebsd.org
Subject: Re: problems with pkg: Operation timed out
Message-ID: <20171011150743.GA23107@acer.zyxst.net>
Mail-Followup-To: freebsd-questions@freebsd.org
References: <CAJ5UdcPVYnF3GN5EyHZSbLjUiPq4icujvFSJSp7hwH0gLAykxQ@mail.gmail.com>
 <VI1PR02MB1200A355A0975AB589DD6784F6740@VI1PR02MB1200.eurprd02.prod.outlook.com>
 <CAJ5UdcMkRv_cd7n-fo9DL-H+_MVuzkFUH4eub1g_2AxYdsKVPg@mail.gmail.com>
 <CAJ5UdcP+CU0=CMe953Mf+cW-N6wh24uMiz6cpu-MRsXftd6VkQ@mail.gmail.com>
 <CA+4G5KZ+e=FXRjRYrLb+T0cd4bGg4fHV3MCBTR+06uH6dR0RUA@mail.gmail.com>
 <CAJ5UdcP-DAE01jwOZ_eVm=oHJLc=4u_Lzykj+nQn3gR4BxxSJg@mail.gmail.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii; format=flowed
Content-Disposition: inline
In-Reply-To: <CAJ5UdcP-DAE01jwOZ_eVm=oHJLc=4u_Lzykj+nQn3gR4BxxSJg@mail.gmail.com>
User-Agent: Mutt/1.9.1 (2017-09-22)
X-BeenThere: freebsd-questions@freebsd.org
X-Mailman-Version: 2.1.23
Precedence: list
List-Id: User questions <freebsd-questions.freebsd.org>
List-Unsubscribe: <https://lists.freebsd.org/mailman/options/freebsd-questions>, 
 <mailto:freebsd-questions-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-questions/>;
List-Post: <mailto:freebsd-questions@freebsd.org>
List-Help: <mailto:freebsd-questions-request@freebsd.org?subject=help>
List-Subscribe: <https://lists.freebsd.org/mailman/listinfo/freebsd-questions>, 
 <mailto:freebsd-questions-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Wed, 11 Oct 2017 15:07:53 -0000

Hi,

On Wed, Oct 11, 2017 at 08:07:39AM -0500, Antonio Olivares wrote:

>root@aceraspire:~ # traceroute pkg.freebsd.org
>traceroute to pkgmir.geo.freebsd.org (96.47.72.71), 64 hops max, 40 byte packets
> 1  10.155.142.1 (10.155.142.1)  0.914 ms  9.633 ms  0.952 ms
> 2  172.17.0.1 (172.17.0.1)  4.262 ms  2.012 ms  1.911 ms
> 3  * * *
> 4  * * *
> 5  * * *
> 6  * *^C

That indicates a problem with your internet connection (or whatever is after 
172.17.0.1) rather than pkg.freebsd.org
-- 
J.



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?447544877.450092.1507733790531>