(sorry for my bad english)
recently, a friend of mine told me about how easy it is to have a bakup from his own and his customers websites. he showed me then a little bit of this program and i was very impressed about the vary of settings.
I then asked him to do me a favour and "misuse" the program to download some images from a website i'd like :-)
Unfortunately, he did'nt managed to download images from this website. (We sat there for maybe 5 hours trying all settings and options we thought, that might be solving that issue, reading this forum too, but failed. Needles to say, that we caused that website some traffic, which i did'nt not want to squander. So we stopped then trying our luck.)
He told me, that some websites protect their images and content from being "crawled and spidered", so he cannot do anything about this.
The website i'm refering to is http://www. celebscentral. net/ => please be aware, that this site may contain some (harmless) nudity, so i inserted spaces to not click acidentially on that website.
My question is now - for educational purpurse :-) - is this true that some websites can not be processed with this program?
And is any of you experienced users able to elucidate how this is, or could be done?
Any explanation on this thread is very appreceated.
Thanks a lot,
All images (big-size) linked to the starting page were loaded. The links on this site are pretty standard and plain.
To load the whole site with all people, you should choose to download from the starting domain - URL Filters - Server section. Set Level to some high value to follow to the desired depth of links.
but it seems, that anything is not configured the same way like in your program,
because it does not work here (at my friends place, where i showed him your mail)
We did the following, as you told us:
"I chose http://sasha-cohen.celebscentral.net/ as a starting address, created a
new project with this URL, Level=1, allowed download from the starting server
using URL Filters. That's all. All images (big-size) linked to the starting page
Please see the screencast, because of the settings I did while trying to follow your instructions:
the result, as you can see in the video, shows a directory with only small images.
I made a screenshot, size-sorted, to let you see, what files are downloaded:
I also saved a Project:
Stream 1.2 File
So, it is good to know, that it is theoreticially possible, but can you please point to the mistake, we do?
sorry for the late reply, but thanks too!
with your help, we have learned some new things and i finally got the requested images without beeing to bandwithconsuming.