Automatic Downloading of files (PDF or HTML) using Excel

H

Hari Prasadh

Hi,

Strange requests..

a) I want to download all the conference papers of a particular SAS user
group proceedings (Statistics software). The SAS website says that
http://support.sas.com/usergroups/sugi/proceedings/index.html the papers are
intended for viewing through browser only. If downloaded then the
hyper-links would not work. Im not worried about hyperlinks not being
available.

Each of the conference paper is a separate PDF file.
http://www2.sas.com/proceedings/sugi29/toc.html . Like for this particular
conference 270 papers were published. The links for all papers are in a
regular pattern, like the first paper link is
http://www2.sas.com/proceedings/sugi29/001-29.pdf , the second paper is
http://www2.sas.com/proceedings/sugi29/002-29.pdf .... and the 270th paper
is http://www2.sas.com/proceedings/sugi29/270-29.pdf.

Using EXCEL can one download all the papers to a specified directory in my
computer. I plan to list all the paper links in excel on the Column A of a
workbook (Listing them would be easy by using Excel's concatenate function).
Can a macro read the link on each of the cells one by one and do a download.
How does one go about it.

b) In point a) all the web-page links were in a specific pattern, so listing
them out in a Excel worksheet is no sweat. But suppose they were not in any
pattern, then how would one approach the problem. Is it possible for an
Excel macro to READ all the web-links in a particular HTML file and list out
the links one by one in a Excel Workbook. This method would be handy if I
have to download all the links on a particular web-page considering that the
names of those links are not in a specific pattern.

c) I want to download all the html pages WITHIN www.solver.com in a
specified directory on my computer. If I give the PARENT link in a
particular cell in EXCEL, is it possible for a macro to download all the web
pages under the SOLVER domain?

I want to be able to do this using Excel (would help me in understanding
Excel's INTERACTION with NON-Office applications.)

(As an aside, if there are smarter ways to do this outside of excel, would
be glad to know.)

Thanks a lot,
Hari
India
 
A

Amedee Van Gasse

Hari Prasadh shared this with us in microsoft.public.excel.programming:
Hi,

Strange requests..

Not really...

*snip*
(As an aside, if there are smarter ways to do this outside of excel,
would be glad to know.)

http://www.tenmax.com/teleport/pro/home.htm
--> I used this one several years ago, it did the job just fine
http://www.bluesquirrel.com/products/webwhacker/
--> Tried it once, looked good.
http://www.spidersoft.com/webzip/default.asp
--> Never tried it but heard good comments.

Google for "offline browser" to find others.

--
Amedee Van Gasse using XanaNews 1.17.3.1
If it has an "X" in the name, it must be Linux?

How To Ask Questions The Smart Way

How to Report Bugs Effectively
http://www.chiark.greenend.org.uk/~sgtatham/bugs.html
Only ask questions with yes/no answers if you want "yes" or "no" as the
answer.
http://homepages.tesco.net/~J.deBoynePollard/FGA/questions-with-yes-or-n
o-answers.html
 
H

Hari Prasadh

Hi Amedee,

Thnx a Ton for your links, wud go thru them.

Additionally, would anybody have some inputs on how to accomplish the same
using EXCEL. I tried to search thru some posts in Google (search string --
web browser excel-- within programming group)but couldnt get far. Would
anybody have some past threads which might have done similar stuff.

Please guide me.

Thanks a lot,
Hari
India
 
H

Hari Prasadh

Hi,

I refined my search string ("download excel") and came across this
interesting stuff Tim
W --http://groups-beta.google.com/group/microsoft.public.excel.programming/browse_thread/thread/1161e802e90d035d/03a9b27a93e87d02?q=download+excel&rnum=9#03a9b27a93e87d02
-- within the article -- Bulk Download of Excel Files --

In the article Tim suggests adding references to 5 VBA projects and out of
that am not able to find the following 2 --ms ado and ms msxml -- I searched
in the alphabetical order.

I tried both Microsoft ADO and MS ADO but to no avail. But I can see
Microsoft ADO Ext. 2.7 for DDL and security and many versions of Microsoft
Active X data Objects (2.0 thru 2.8 library) in my computer.

Similarly I can see Microsoft XML, Version 2.0 thru 3.0 but not ms msxml.

(I assume ms HTML object library and Microsoft HTML library are the same.)

Do I have to install anything extra to get the above 2 object libraries.

Please guide me.

I have Office 2002 (Win XP SP2)

Thanks a lot,
Hari
India
 
T

Tim Williams

Hari Prasadh said:
Hi,
I tried both Microsoft ADO and MS ADO but to no avail. But I can see
Microsoft ADO Ext. 2.7 for DDL and security and many versions of
Microsoft Active X data Objects (2.0 thru 2.8 library) in my
computer.

That's it (Active X data Objects).
Similarly I can see Microsoft XML, Version 2.0 thru 3.0 but not ms
msxml.

msxml = microsoft XML

tim.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top