From owner-freebsd-questions@freebsd.org Mon Jan 25 18:03:45 2016 Return-Path: Delivered-To: freebsd-questions@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 1C648702F for ; Mon, 25 Jan 2016 18:03:45 +0000 (UTC) (envelope-from solene@bsd.zplay.eu) Received: from bsd.zplay.eu (bsd.zplay.eu [62.210.240.224]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "bsd.zplay.eu", Issuer "StartCom Class 1 Primary Intermediate Server CA" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id 895A0EBC for ; Mon, 25 Jan 2016 18:03:44 +0000 (UTC) (envelope-from solene@bsd.zplay.eu) Received: from localhost (bsd.zplay.eu [local]) by bsd.zplay.eu (OpenSMTPD) with ESMTPA id 01e32272 for ; Mon, 25 Jan 2016 18:56:38 +0100 (CET) To: freebsd-questions@freebsd.org Subject: HAST primary role hang when copying data X-PHP-Originating-Script: 0:rcube.php MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII; format=flowed Content-Transfer-Encoding: 7bit Date: Mon, 25 Jan 2016 18:56:38 +0100 From: =?UTF-8?Q?Sol=C3=A8ne_Rapenne?= Message-ID: X-Sender: solene@bsd.zplay.eu User-Agent: Roundcube Webmail/1.1.4 X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 25 Jan 2016 18:03:45 -0000 Hello, I am trying to use HAST between 2 FreeBSD-10.2 servers which are in 2 differents DC with a 100/100 mbit/s network and 14ms of ping between them. I used both (not at the same time) OpenVPN and a SSH Tunnel to transport the data for security. In both tries, hastctl tells me that the sync is complete, I can mount it on one one, then I start to cp some files inside and then the primary system hang after a few seconds when I do something disk related, only a hard reboot can fix this. I can reproduce it anytime on both nodes. When starting to write on the primary, the second node lose synchronization and the primary hang like if it has a nfs mounted on something disconnected (if you know that case). I tried some hast.conf "tweaks" from the man to try to make it more "vpn friendly" without success resource essai { replication async compression lzf on BBB { local /home/shared remote 10.8.0.6 metaflush off } on AAA { local /home/shared remote 10.8.0.1 metaflush off } } /home/shared is a file made with dd with zeroes, maybe this is the problem ? Am I doing something wrong ? Kind regards