Date: Wed, 12 Oct 2005 11:21:55 +0100 From: Ceri Davies <ceri@submonkey.net> To: Murray Stokely <murray@freebsd.org> Cc: freebsd-www@FreeBSD.org, Wolfram Schneider <wosch@FreeBSD.org> Subject: Re: Using Yahoo! or Google search bar instead of search.cgi Message-ID: <20051012102155.GE99170@submonkey.net> In-Reply-To: <20051011224812.GC73646@freebsdmall.com> References: <20051008182448.GH6076@freebsdmall.com> <434C1B5D.303@FreeBSD.org> <20051011224812.GC73646@freebsdmall.com>
next in thread | previous in thread | raw e-mail | index | archive | help
--ANwsuk3Muel3MOM0 Content-Type: multipart/mixed; boundary="zIy/tJHoBBdASg6V" Content-Disposition: inline --zIy/tJHoBBdASg6V Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, Oct 11, 2005 at 03:48:12PM -0700, Murray Stokely wrote: > On Tue, Oct 11, 2005 at 10:06:53PM +0200, Wolfram Schneider wrote: > > you know that search engines like yahoo or google index > > only 1/5 - 1/2 of a typical web site?! If a problem occurs, > > you have to wait until the next update - your are > > offline for at least 30 days. >=20 > There are sitemap APIs that allow you to get full coverage and much > quicker updates, e.g. >=20 > https://www.google.com/webmasters/sitemaps/docs/en/about.html >=20 > Still, your points are well taken and it makes it clear that trying to > get the best of both worlds (relevant results from a commercial search > engine and the breadth and immediacy given by search.cgi) would not be > trivial. Hmm, I don't know. Attached is a port of the sitemap generator and a sample config file for it. Looks pretty simple. Ceri --=20 Only two things are infinite, the universe and human stupidity, and I'm not sure about the former. -- Einstein (attrib.) --zIy/tJHoBBdASg6V Content-Type: text/plain; charset=us-ascii Content-Disposition: attachment; filename="sitemapgen-port.diff" Index: www/Makefile =================================================================== RCS file: /home/pcvs/ports/www/Makefile,v retrieving revision 1.1196 diff -u -r1.1196 Makefile --- www/Makefile 12 Oct 2005 05:54:42 -0000 1.1196 +++ www/Makefile 12 Oct 2005 10:17:31 -0000 @@ -769,6 +769,7 @@ SUBDIR += simplog SUBDIR += sitebar SUBDIR += sitecopy + SUBDIR += sitemapgen SUBDIR += skytemplate SUBDIR += slash SUBDIR += smarty Index: www/sitemapgen/Makefile =================================================================== RCS file: www/sitemapgen/Makefile diff -N www/sitemapgen/Makefile --- /dev/null 1 Jan 1970 00:00:00 -0000 +++ www/sitemapgen/Makefile 12 Oct 2005 10:17:31 -0000 @@ -0,0 +1,35 @@ +# New ports collection makefile for: sitemapgen +# Date Created: 12 Oct 2005 +# Whom: ceri +# +# $FreeBSD$ +# + +PORTNAME= sitemapgen +PORTVERSION= 1.3 +CATEGORIES= www +MASTER_SITES= ${MASTER_SITE_SOURCEFORGE} +MASTER_SITE_SUBDIR= goog-sitemapgen +DISTNAME= sitemap_gen-${PORTVERSION} + +MAINTAINER= ceri@FreeBSD.org +COMMENT= A Google sitemap generator + +USE_PYTHON= yes + +NOMAN= yes + +do-build: + +do-configure: + +do-install: + ${INSTALL_SCRIPT} ${WRKSRC}/sitemap_gen.py ${PREFIX}/bin/sitemap_gen +.if !defined(NOPORTDOCS) + ${MKDIR} ${DOCSDIR} +.for DOC in AUTHORS COPYING ChangeLog PKG-INFO README example_config.xml example_urllist.txt + ${INSTALL} ${WRKSRC}/${DOC} ${DOCSDIR} +.endfor +.endif + +.include <bsd.port.mk> Index: www/sitemapgen/distinfo =================================================================== RCS file: www/sitemapgen/distinfo diff -N www/sitemapgen/distinfo --- /dev/null 1 Jan 1970 00:00:00 -0000 +++ www/sitemapgen/distinfo 12 Oct 2005 10:17:31 -0000 @@ -0,0 +1,2 @@ +MD5 (sitemap_gen-1.3.tar.gz) = 641e1b4d9bff39b4fb599a267727b605 +SIZE (sitemap_gen-1.3.tar.gz) = 29975 Index: www/sitemapgen/pkg-descr =================================================================== RCS file: www/sitemapgen/pkg-descr diff -N www/sitemapgen/pkg-descr --- /dev/null 1 Jan 1970 00:00:00 -0000 +++ www/sitemapgen/pkg-descr 12 Oct 2005 10:17:31 -0000 @@ -0,0 +1,8 @@ +The sitemap_gen.py script analyzes your web server and generates one or +more Sitemap files. These files are XML listings of content you make +available on your web server. The files can then be directly submitted +to Google. + +Written by opensource@google.com. + +WWW: https://www.google.com/webmasters/sitemaps/docs/en/about.html Index: www/sitemapgen/pkg-plist =================================================================== RCS file: www/sitemapgen/pkg-plist diff -N www/sitemapgen/pkg-plist --- /dev/null 1 Jan 1970 00:00:00 -0000 +++ www/sitemapgen/pkg-plist 12 Oct 2005 10:17:31 -0000 @@ -0,0 +1,9 @@ +bin/sitemap_gen +%%PORTDOCS%%%%DOCSDIR%%/AUTHORS +%%PORTDOCS%%%%DOCSDIR%%/COPYING +%%PORTDOCS%%%%DOCSDIR%%/ChangeLog +%%PORTDOCS%%%%DOCSDIR%%/PKG-INFO +%%PORTDOCS%%%%DOCSDIR%%/README +%%PORTDOCS%%%%DOCSDIR%%/example_config.xml +%%PORTDOCS%%%%DOCSDIR%%/example_urllist.txt +%%PORTDOCS%%@dirrm %%DOCSDIR%% --zIy/tJHoBBdASg6V-- --ANwsuk3Muel3MOM0 Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.2 (FreeBSD) iD8DBQFDTOPDocfcwTS3JF8RAvGVAKDKWvPpLvPutwz36/9epjroYXVLlgCgioJa 5XH2oS2obsAcZ8SV0jEn7PU= =I7Bg -----END PGP SIGNATURE----- --ANwsuk3Muel3MOM0--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20051012102155.GE99170>