Skip site navigation (1)Skip section navigation (2)
Date:      Sat, 18 Jul 2009 23:11:58 -0700 (PDT)
From:      "Joe R. Jah" <jjah@cloud.ccsf.cc.ca.us>
To:        Andrew Brampton <brampton+freebsd@gmail.com>
Cc:        freebsd-questions@freebsd.org
Subject:   Re: OT: wget bug
Message-ID:  <20090718230601.F10250@cloud.ccsf.cc.ca.us>
In-Reply-To: <d41814900907181009q1d4f4b62k9818607a65421419@mail.gmail.com>
References:  <20090717144049.C35992@cloud.ccsf.cc.ca.us>  <d41814900907180452p29244911nd2570909e7274791@mail.gmail.com>  <20090718093237.Y19472@cloud.ccsf.cc.ca.us> <d41814900907181009q1d4f4b62k9818607a65421419@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
  This message is in MIME format.  The first part should be readable text,
  while the remaining parts are likely unreadable without MIME-aware tools.

--0-306400255-1247983918=:10250
Content-Type: TEXT/PLAIN; charset=UTF-8
Content-Transfer-Encoding: QUOTED-PRINTABLE

On Sat, 18 Jul 2009, Andrew Brampton wrote:

> Date: Sat, 18 Jul 2009 18:09:54 +0100
> From: Andrew Brampton <brampton+freebsd@gmail.com>
> To: Joe R. Jah <jjah@cloud.ccsf.cc.ca.us>
> Cc: freebsd-questions@freebsd.org
> Subject: Re: OT: wget bug
>
> 2009/7/18 Joe R. Jah <jjah@cloud.ccsf.cc.ca.us>:
> > Thank you Andrew. =C2=A0Yes the server is truly returning 401. =C2=A0I =
have already
> > reconfigured wget to download everything regardless of their timestamp,
> > but it's a waste of bandwidth, because most of the site is unchanged.
> >
> > Do you know of any workaround in wget, or an alternative tool to ONLY
> > download newer files by http?
> >
>
> Joe,
> There are two ways to check if the file has been changed. One, read
> the time the file was last changed, or two, read the file and compare
> it to a old copy. Wget was obviously trying to do option 1 but this is
> denied by the remote server. You most likely could get it to do option
> 2, however by doing so you are wasting bandwidth downloading unchanged
> files just to check if they had been changed.
>
> If you have control over the remote webserver, then the simplest way
> to solve this problem is to configure the webserver not to return 401
> when wget sends the If-Modified-Since HTTP header. A better solution,
> again assuming you have control of the remote server, is to use
> "rsync" as it is designed for this kind of task.
>
> If you don't have control over the remote server, then you are stuck
> with your current solution.
>
> Andrew

Thank you Andrew.

Regards,

Joe
--=20
     _/   _/_/_/       _/              ____________    __o
     _/   _/   _/      _/         ______________     _-\<,_
 _/  _/   _/_/_/   _/  _/                     ......(_)/ (_)
  _/_/ oe _/   _/.  _/_/ ah        jjah@cloud.ccsf.cc.ca.us
--0-306400255-1247983918=:10250--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20090718230601.F10250>