Re: Downloading HTML files
|
Joined: Apr 2003
Posts: 16,441
shareholder in the making
|
shareholder in the making
Joined: Apr 2003
Posts: 16,441 |
To avoid any sort of browser "infection", you could try curl for Windows. It's built in on most UNIX/linux OSes and very easy to use. E.g., from the command prompt: curl http://www.somesite.com/stuff.html -o somesite-stuff.html This would download the stuff.html file from somesite.com and save it locally as somesite-stuff.html. Since it's a command-line utility, you could batch a bunch of sites together.
|
|
|
Re: Downloading HTML files
|
Joined: Oct 2006
Posts: 484
devotee
|
OP
devotee
Joined: Oct 2006
Posts: 484 |
The plot thickens. The fellow who approached me with this in the first place maintains that he was using IE8 on both a Win 7 box and an XP box. Good, usable results files downloaded with the XP system, and bloated unmanageable files on the Win 7 box.
He bought a Win 7 box to progress with his project, but instead it's stopped the project dead in its tracks. He's still got the XP box, but the future is with Win 7, so that's what he'd prefer to use. Understandable, I believe.
|
|
|
Re: Downloading HTML files
|
Joined: Oct 2006
Posts: 484
devotee
|
OP
devotee
Joined: Oct 2006
Posts: 484 |
That sounds like a great plan Peter. Thanks very much for that information. Really top notch.
I left my AIX/Unix days behind me in the '90s, and it's easy to forget just how useful utilities built for those OS's are.
I'll pass that along and will try to post back his feedback here this week.
|
|
|
Re: Downloading HTML files
|
Joined: Feb 2009
Posts: 3,466
connoisseur
|
connoisseur
Joined: Feb 2009
Posts: 3,466 |
wget may be easier to use than curl. It's my tool of choice.
Pioneer PDP-5020FD, Marantz SR6011 Axiom M5HP, VP160HP, QS8 Sony PS4, surround backs -Chris
|
|
|
Re: Downloading HTML files
|
Joined: Apr 2003
Posts: 16,441
shareholder in the making
|
shareholder in the making
Joined: Apr 2003
Posts: 16,441 |
True. wget is a bit more powerful and I'd use it instead if you need to grab a bunch of different files from a web server and want to filter out anything other than .htm or .html files. Here's an example I used recently:
I have a server that holds install and configuration files for the linux desktops I deploy and manage. In one of my automated installs, I need to grab the latest NVIDIA driver from my install server. The name of this file is not constant, but it always ends with a .run extension, so I use the following command to grab it:
wget -r -nH -np -nd -A run http://yum1:8080/nvidia/
The options basically say "look at all the files in the nvidia directory on that web server but only grab the ones that have a '.run' file extension." This works since I only ever keep one in there.
|
|
|
Re: Downloading HTML files
|
Joined: Feb 2009
Posts: 3,466
connoisseur
|
connoisseur
Joined: Feb 2009
Posts: 3,466 |
wget can also be as simple as:
wget "http://www.axiomaudio.com/"
That'll create a file named "index.html" in your current directory. So that is easier than curl for getting a single document. You have at least tell curl what name to save the file with, or it'll just write to the screen.
Of course you can tell wget to save with a different name by just giving it the "-o filename.html" option too.
Pioneer PDP-5020FD, Marantz SR6011 Axiom M5HP, VP160HP, QS8 Sony PS4, surround backs -Chris
|
|
|
Re: Downloading HTML files
|
Joined: Apr 2003
Posts: 16,441
shareholder in the making
|
shareholder in the making
Joined: Apr 2003
Posts: 16,441 |
curl -O http://the.url.com will also save an index.html (or whatever default file the server gives you) in your current directory.
|
|
|
Re: Downloading HTML files
|
Joined: Feb 2009
Posts: 3,466
connoisseur
|
connoisseur
Joined: Feb 2009
Posts: 3,466 |
Still too much typing.
Pioneer PDP-5020FD, Marantz SR6011 Axiom M5HP, VP160HP, QS8 Sony PS4, surround backs -Chris
|
|
|
Re: Downloading HTML files
|
Joined: Apr 2003
Posts: 16,441
shareholder in the making
|
shareholder in the making
Joined: Apr 2003
Posts: 16,441 |
A spurious criticism for a board regular to make.
|
|
|
Re: Downloading HTML files
|
Joined: Feb 2009
Posts: 3,466
connoisseur
|
connoisseur
Joined: Feb 2009
Posts: 3,466 |
If I was typing -O every time I wanted to save a file, how would I have time to spend here?
Pioneer PDP-5020FD, Marantz SR6011 Axiom M5HP, VP160HP, QS8 Sony PS4, surround backs -Chris
|
|
|
Forums16
Topics24,964
Posts442,603
Members15,631
|
Most Online2,699 Aug 8th, 2024
|
|
0 members (),
1,030
guests, and
3
robots. |
Key:
Admin,
Global Mod,
Mod
|
|
|
|