From owner-freebsd-www@FreeBSD.ORG Tue May 9 18:11:18 2006 Return-Path: X-Original-To: www@FreeBSD.org Delivered-To: freebsd-www@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id CB9DF16A901; Tue, 9 May 2006 18:11:18 +0000 (UTC) (envelope-from murray@freebsdmall.com) Received: from mail.freebsdmall.com (69.50.233.168.ip.nectartech.com [69.50.233.168]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3860043D4C; Tue, 9 May 2006 18:11:16 +0000 (GMT) (envelope-from murray@freebsdmall.com) Received: by mail.freebsdmall.com (Postfix, from userid 2074) id DA9281D6F9CF; Tue, 9 May 2006 11:11:15 -0700 (PDT) Date: Tue, 9 May 2006 11:11:15 -0700 From: Murray Stokely To: Pav Lucistnik Message-ID: <20060509181115.GZ83847@freebsdmall.com> References: <1147173626.68599.49.camel@pav.hide.vol.cz> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1147173626.68599.49.camel@pav.hide.vol.cz> X-GPG-Key-ID: 1024D/0E451F7D X-GPG-Key-Fingerprint: E2CA 411D DD44 53FD BB4B 3CB5 B4D7 10A2 0E45 1F7D User-Agent: Mutt/1.5.11 Cc: www@FreeBSD.org Subject: Re: website statistics X-BeenThere: freebsd-www@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: FreeBSD Project Webmasters List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 May 2006 18:11:23 -0000 On Tue, May 09, 2006 at 01:20:26PM +0200, Pav Lucistnik wrote: > Simon Nielsen was kind enough to provide me with log files from > www.FreeBSD.org, nine days between Friday 28 April and Saturday 6 May. > > BASIC STATISTICS > > All data are cleared from search engine crawlers, RSS clients and > automated downloads of files for ports infrastructure. Very interesting. Thanks for doing this analysis. > My take: Ports are immensely popular, I think they should get their own > entry on the horizontal grey navbar. On the other hand, Community is > rarely visited and could be collapsed into Support. I think that is a good idea well supported by your findings. We should make sure to remove the 'Ports' from the 'Shortcuts' list when it is added to the gray top navbar. > Couldn't find /smp/ referenced from /projects/ page, should it be there? > Couldn't find a link to CVSweb interface, where is it referenced? cvsweb : www.freebsd.org -> developers -> cvs -> web interface Seems easy enough to find. The SMP page is out of date, so I'm not particularly worried about it being referenced more. > SEARCHBOTS > > On top of the numbers above, search engine crawlers generated another > 43,778 hits/day. Googlebot alone is responsible for 7875 hits/day. Google claims to have 1,680,000 documents from www.freebsd.org indexed (+site:www.freebsd.org query), and 3 times as much if you count people.freebsd.org and the various ftpN and other domains. If that is anywhere near correct, then 7875 hits / day means the average page is only refreshed every 213 days. I hope it crawls faster than 7875 hits / day on other days. Does Apache log a page hit if it was not returned because of an If-Modified-Since header? The current cached version of www.freebsd.org/index.html on Google is from May 8, just before our new logo update, while the cached version on Yahoo is from mid April, and on MSN is from May 2. - Murray