FreeBSD Mail Archives

Date:      Thu, 7 Jun 2012 08:26:59 -0600
From:      Scott Long <scottl@samsco.org>
To:        Daniel Kalchev <daniel@digsys.bg>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: Netflix's New Peering Appliance Uses FreeBSD
Message-ID:  <6833ED24-9638-43E7-AE35-289CEB3E06C2@samsco.org>
In-Reply-To: <4FD06FD7.2000708@digsys.bg>
References:  <CAMYW4Zi4y16EL1=%2Bsfz1XATc9ZnQpocUD_Xf9Jg=LR=c1AgaKA@mail.gmail.com> <3CEF3B39-BE1E-4FC4-81F3-D26049C83313@netflix.com> <4FD06FD7.2000708@digsys.bg>

index | next in thread | previous in thread | raw e-mail

On Jun 7, 2012, at 3:09 AM, Daniel Kalchev wrote:

> 
> 
> On 06.06.12 03:16, Scott Long wrote:
> 
> [...]
>> Each disk has its own UFS+J filesystem, except for
>> the SSDs that are mirrored together with gmirror.  The SSDs hold the OS image
>> and cache some of the busiest content.  The other disks hold nothing but the
>> audio and video files for our content streams.
> 
> Could you please explain the rationale of using UFS+J for this large storage. Your published documentation states that you have reasonable redundancy in case of multiple disk failure and I wonder how you handle this with "plain" UFS. Things like avoiding hangs and panics when an disk is going to die.

Redundancy happens by allowing the streaming clients to choose multiple other sources for their stream, and buffer enough of the stream to make a switchover appear seamless.  That other source might be a peer node on the same network, or might be a node that is upstream or on a different network.  The point of the caches is to hold as much content as possible, and we've found that it's more effective to maximize capacity but allow drives to fail in place than to significantly reduce capacity with hardware or software RAID.  When a disk starts having problems that affect its ability to deliver data on time, any clients affected by it simply switch to a different source.  When the disk does finally die, it is removed from the available pool and content is reshuffled on the other drives during the next daily content update.  Once enough disks fail that the cache is no longer effective, it gets replaced.

Scott

home | help

Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?6833ED24-9638-43E7-AE35-289CEB3E06C2>

Header And Logo

Peripheral Links

Site Navigation

Header And Logo

Peripheral Links

Search

Site Navigation