Hello, I want to know, how to download some images from a site.

Author Message
Hajo Schmidt 02/17/2011 01:56 pm
Hello dear MetaProducts Support and MetaProducts Users,
(sorry for my bad english)

recently, a friend of mine told me about how easy it is to have a bakup from his own and his customers websites. he showed me then a little bit of this program and i was very impressed about the vary of settings.

I then asked him to do me a favour and "misuse" the program to download some images from a website i'd like :-)

Unfortunately, he did'nt managed to download images from this website. (We sat there for maybe 5 hours trying all settings and options we thought, that might be solving that issue, reading this forum too, but failed. Needles to say, that we caused that website some traffic, which i did'nt not want to squander. So we stopped then trying our luck.)

He told me, that some websites protect their images and content from being "crawled and spidered", so he cannot do anything about this.

The website i'm refering to is http://www. celebscentral. net/ => please be aware, that this site may contain some (harmless) nudity, so i inserted spaces to not click acidentially on that website.

My question is now - for educational purpurse :-) - is this true that some websites can not be processed with this program?
And is any of you experienced users able to elucidate how this is, or could be done?

Any explanation on this thread is very appreceated.

Thanks a lot,
H.S.
Oleg Chernavin 02/17/2011 01:59 pm
Actually, I tried and had no problems with the images on this site. For example, I chose http://sasha-cohen.celebscentral.net/ as a starting address, created a new project with this URL, Level=1, allowed download from the starting server using URL Filters. That's all.

All images (big-size) linked to the starting page were loaded. The links on this site are pretty standard and plain.

To load the whole site with all people, you should choose to download from the starting domain - URL Filters - Server section. Set Level to some high value to follow to the desired depth of links.

Best regards,
Oleg Chernavin
MP Staff
Hajo Schmidt 02/18/2011 01:57 pm
Hello and thank you for that short how-to-steps,
but it seems, that anything is not configured the same way like in your program,
because it does not work here (at my friends place, where i showed him your mail)

We did the following, as you told us:

"I chose http://sasha-cohen.celebscentral.net/ as a starting address, created a
new project with this URL, Level=1, allowed download from the starting server
using URL Filters. That's all. All images (big-size) linked to the starting page
were loaded."

Please see the screencast, because of the settings I did while trying to follow your instructions:
http://www.screencast.com/users/user3454363/folders/Jing/media/196b5eec-03b4-44ef-846e-62113d3ef2eb

the result, as you can see in the video, shows a directory with only small images.
I made a screenshot, size-sorted, to let you see, what files are downloaded:
http://www.bilder-hochladen.net/files/hcyg-1-png.html

I also saved a Project:
[code]
Stream 1.2 File
[Object]
OEVersion=Enterprise 5.9.0.3284
Type=0
IID=7011
Caption=http://sasha-cohen.celebscentral.net/
URL=http://sasha-cohen.celebscentral.net/
Lev=1
Weekday=257
LimTSize=10000
LimNumber=5000
LimTime=100
LTMethod=2
FTText.Exts=htmlhtmaspaspxjspstmstmlidcshtmlhtxtxttextxspxmlrxmlcfmwmlphpphp3
FTImages.Exts=gifjpgjpegtiftiffxbmfifbmppngipxjp2j2cj2kwbmplwfwebp
FTVideo.Exts=mpgavianimpegmovflvfliflcvivrmramrvasfasxwmvm1vm2vvobsmilmp4m4v
FTAudio.Exts=wavriffmp3midmp2m3uravocwmaapeoggm4a
FTArchive.Exts=ziparcgzzarjlhalayleirarcabtarpakacejarpdftgzexe
FTUDef.Exts=jsaxdcssssivbsdtdxslswfclassent
FTText.B=ooxooo
FTImages.B=ooxooo
FTVideo.B=ooxooo
FTAudio.B=ooxooo
FTArchive.B=ooxooo
FTUDef.B=ooxooo
FTOther.B=ooxooo
FTSizes=0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,3,0
NotIgnoreLogout=False
RSrvsBx=1
RProt=255
LastStart=167:102:193:63:25:210:227:64:
LastEnd=68:157:145:65:25:210:227:64:
LastStarted=18.02.2011 18:56:12
LastEnded=18.02.2011 18:56:31
S200=65
SPar=37
SSav=65
SLast=200
SSiz=932143
SMdf=65
SHTML=30
SSuccDowns=1
LFiles=65
LSize=872533
Flags=1
ImgDim=-20,-1000,-20,-1000
PrevURL=http://sasha-cohen.celebscentral.net/
ConvertRSS=True
IPAddr=371323454
LIndexed=False
IndexFiles=False
[/code]

So, it is good to know, that it is theoreticially possible, but can you please point to the mistake, we do?
Oleg Chernavin 02/18/2011 01:57 pm
Just two changes in the Project Properties dialog - Links Translation - choose Offline Links Translation. File Filters - Images - choose "Load from any site" in the Location box.

Oleg.
Hajo Schmidt 03/26/2011 01:21 pm
Hello,

sorry for the late reply, but thanks too!
with your help, we have learned some new things and i finally got the requested images without beeing to bandwithconsuming.
thank you.
Oleg Chernavin 03/29/2011 09:44 am
You are welcome! Great that it is helpful!

Oleg.