Date: Sat, 18 Jul 2009 23:11:58 -0700 (PDT) From: "Joe R. Jah" <jjah@cloud.ccsf.cc.ca.us> To: Andrew Brampton <brampton+freebsd@gmail.com> Cc: freebsd-questions@freebsd.org Subject: Re: OT: wget bug Message-ID: <20090718230601.F10250@cloud.ccsf.cc.ca.us> In-Reply-To: <d41814900907181009q1d4f4b62k9818607a65421419@mail.gmail.com> References: <20090717144049.C35992@cloud.ccsf.cc.ca.us> <d41814900907180452p29244911nd2570909e7274791@mail.gmail.com> <20090718093237.Y19472@cloud.ccsf.cc.ca.us> <d41814900907181009q1d4f4b62k9818607a65421419@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
This message is in MIME format. The first part should be readable text,
while the remaining parts are likely unreadable without MIME-aware tools.
--0-306400255-1247983918=:10250
Content-Type: TEXT/PLAIN; charset=UTF-8
Content-Transfer-Encoding: QUOTED-PRINTABLE
On Sat, 18 Jul 2009, Andrew Brampton wrote:
> Date: Sat, 18 Jul 2009 18:09:54 +0100
> From: Andrew Brampton <brampton+freebsd@gmail.com>
> To: Joe R. Jah <jjah@cloud.ccsf.cc.ca.us>
> Cc: freebsd-questions@freebsd.org
> Subject: Re: OT: wget bug
>
> 2009/7/18 Joe R. Jah <jjah@cloud.ccsf.cc.ca.us>:
> > Thank you Andrew. =C2=A0Yes the server is truly returning 401. =C2=A0I =
have already
> > reconfigured wget to download everything regardless of their timestamp,
> > but it's a waste of bandwidth, because most of the site is unchanged.
> >
> > Do you know of any workaround in wget, or an alternative tool to ONLY
> > download newer files by http?
> >
>
> Joe,
> There are two ways to check if the file has been changed. One, read
> the time the file was last changed, or two, read the file and compare
> it to a old copy. Wget was obviously trying to do option 1 but this is
> denied by the remote server. You most likely could get it to do option
> 2, however by doing so you are wasting bandwidth downloading unchanged
> files just to check if they had been changed.
>
> If you have control over the remote webserver, then the simplest way
> to solve this problem is to configure the webserver not to return 401
> when wget sends the If-Modified-Since HTTP header. A better solution,
> again assuming you have control of the remote server, is to use
> "rsync" as it is designed for this kind of task.
>
> If you don't have control over the remote server, then you are stuck
> with your current solution.
>
> Andrew
Thank you Andrew.
Regards,
Joe
--=20
_/ _/_/_/ _/ ____________ __o
_/ _/ _/ _/ ______________ _-\<,_
_/ _/ _/_/_/ _/ _/ ......(_)/ (_)
_/_/ oe _/ _/. _/_/ ah jjah@cloud.ccsf.cc.ca.us
--0-306400255-1247983918=:10250--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090718230601.F10250>
