1

I need to put a little project together for myself, and I need some functionality to download a page for offline viewing. Is there a library that will download a given page and its embedded images, and edit the img tags to reflect the local locations of the images.

I know there are a lot of website downloaders out there, but I cant find something that i can use directly in my code.

I have some basic scripts done in python, so Python is very welcome. but pretty much any language will do.

Midhat
  • 17,454
  • 22
  • 87
  • 114

4 Answers4

1

You're looking for BeautifulSoup.

SLaks
  • 868,454
  • 176
  • 1,908
  • 1,964
1

How about python web crawler? http://code.google.com/p/pywebcrawler/

OR, Anemone (ruby)? http://anemone.rubyforge.org/

geffchang
  • 3,279
  • 2
  • 32
  • 58
1

Yes, BeautifulSoup + python urllib module

Dingo
  • 2,656
  • 1
  • 19
  • 16
0

simplest solution I can think of.

wget -p example.com
EricR
  • 1,487
  • 2
  • 21
  • 42