Siterip of pictures where direct access to picture content is forbidden - You don't have permission to access /images on this server.

Author Message
Mr Smith 06/28/2016 01:22 pm
I am trying to do a rip of the photos from:
http://www.comicartshop.com
however the photos are not hosted at the above site - the photos are hosted at: http://art.cafimg.com/images/

The siterip cannot begin at http://art.cafimg.com/images/ as I "don't have permission to access /images on this server."

What settings should I use to download the images and not the whole internet

Here is an sample url:
http://art.cafimg.com/images/Category_1306/subcat_94257/chenstripKANG.jpg
Oleg Chernavin 06/28/2016 07:26 pm
It is quite easy to do. You need to download from the starting server or site only and allow images to be downloaded from any site.

This is simple in the File - Tasks wizard. If you already created a Project, select it, click the Properties button, open the File Filters - Images section and select the "Load from any site" in the Location box.

Best regards,
Oleg Chernavin
MP Staff
Mr Smith 06/29/2016 08:11 am
The problem is, this site links to many other sites, and my previous attempts resulted in millions of queued links. My download directory has 497 folders of links/urls linking from that site. The site "art.cafimg.com" is there however only a minor number of images were downloaded. Could you take a look at "http://www.comicartshop.com"?

I have the starting web address as:
http://www.comicartshop.com/

Level limit set as disabled

Images as:
download from any website

download from the starting server or domain is disabled

Maybe I could restrict the downloads via the "url filters/servers/included keywords" function?
Oleg Chernavin 06/29/2016 08:20 am
Yes, this is also possible. Let's do it thuis way. Allow to download from all servers in URL Filters - Server section.

URL Filters - Directory - add to the Included list:

http://www.comicartshop.com/*
http://art.cafimg.com/images/*

All File Filters categories - select "Load using URL Filters" in their Location boxes.

If you need

Oleg.
Mr Smith 07/11/2016 12:59 pm
Could you take a look at this image and tell me if these numbers seem correct.
http://i.imgur.com/0q06Y3R.png


Also, is it possible to exclude "searchresults.asp" from being "processed"
For example:
http://www.comicartshop.com/SearchResult.asp?PF=63&PC=15&txtSearch=Men

maybe the software has been unable to go past the "searchresults.asp", it is a big website so maybe there is not a problem and I just need to continue waiting.
Oleg Chernavin 07/11/2016 09:35 pm
Yes, this is easy to disable them. Add this keyword to the URL Filters - Filename section - Excluded filename keywords list:

searchresults.asp

Oleg.
Mr Smith 07/12/2016 07:25 am
Is this correct:
http://i.imgur.com/scIjW7H.png

Also should I pause the project, then resume for the changes to take effect? as it seems that "searchresults.asp" is still being processed.
Oleg Chernavin 07/12/2016 08:29 am
Yes, this is correct. To remove them from the Downoad Queue, switch to it, click Select By Mask button, enter this keyword and abort all found items.

New URLs like this will not be added any more.

Oleg.
Mr Smith 07/12/2016 10:06 am
I went to "queue" and pressed "select all" and then pressed "abort and disable". I then entered the "custom mask" which resulted in the remaining queued links being aborted and the completion of the project. However zero images were downloaded which perhaps means I made a mistake - this is just a guess- it seemed that the software did not link/process/find the image urls at "http://art.cafimg.com/images/" from the original url of "http://www.comicartshop.com".

Could you attempt to download a single image from "http://art.cafimg.com/images" via "http://www.comicartshop.com"
and then tell me what settings you used?
Oleg Chernavin 07/12/2016 04:43 pm
OK. It looks like the following Project settings work. Select the whole text starting from the [Object] line, copy to clipboard, switch to Offline Explorer and press Ctrl+V on keyboard:

[Object]
OEVersion=Pro 7.2.4515
Type=0
IID=62805
Caption=http://www.comicartshop.com/ComicArtShops.asp
URL=http://www.comicartshop.com/ComicArtShops.asp
MVer=5
Lev=10
Weekday=257
FTText.Exts=htmlhtmaspaspxjspstmstmlidcshtmlhtxtxttextxspxmlrxmlcfmwmlphpphp3
FTImages.Exts=gifjpgjpegtiftiffxbmfifbmppngipxjp2j2cj2kwbmplwfwebp
FTVideo.Exts=mpgavianimpegmovflvfliflcvivrmramrvasfasxwmvm1vm2vvobsmilmp4m4v
FTAudio.Exts=wavriffmp3midmp2m3uravocwmaapeoggm4aaif
FTArchive.Exts=7zziparcgzzarjlhalayleirarcabtarpakacejarpdftgzexeiso
FTUDef.Exts=jsaxdcssssivbsdtdxslswfclassent
FTText.B=ooxooo
FTImages.B=ooxooo
FTVideo.B=xoxooo
FTAudio.B=xoxooo
FTArchive.B=xoxooo
FTUDef.B=ooxooo
FTOther.B=xoxooo
FTSizes=0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,3,0,0,0,0,0,0,0,0
NotIgnoreLogout=False
RFileIn=comicartshopsbycat.aspgallerypiece.asphttp://*cafimg.com/images/*/*classifiedclick.asp xxxx
RProt=255
LastStart=95:231:146:138:127:200:228:64:
LastEnd=121:194:96:155:127:200:228:64:
PrjStart=6:214:172:141:218:252:227:64:
LastStarted=12.07.2016 23:39:21
LastEnded=12.07.2016 23:42:18
S200=212
S304=36
SAbr=6891
SPar=214
SSav=212
SLast=302
SSiz=11703301
SMdf=208
SHTML=210
LFiles=248
LSize=11703301
Stopped=True
Flags=1
Descr=e1xydGYxXGFuc2lcZGVmZjB7XGZvbnR0Ymx7XGYwXGZuaWwgTVMgU2FucyBTZXJpZjt9fQ0KXHZpZXdraW5kNFx1YzFccGFyZFxsYW5nMTA0OVxmMFxmczE2IA0KXHBhciB9DQo=
ImgDim=0,0,0,0
PrevURL=http://www.comicartshop.com/ComicArtShops.asp
ConvertRSS=True
Mr Smith 07/15/2016 12:13 am
the settings you have provided were a success. This is excellent software and I have received excellent customer support, you are a exceptional credit to MetaProducts®
Oleg Chernavin 07/25/2016 07:38 pm
It is very nice to hear! Thank you very much!

Oleg.
Mr Smith 08/04/2016 06:59 pm
Could you tell me how you determined to set the depth of the rip to ten
Lev=10
from the results of the rip im quiet sure this was not an arbitrary decision.
Oleg Chernavin 08/04/2016 07:23 pm
I looked at some typical artists to see how many clicks on links might be necessary to get to the deepest picture.

However I didn't notice that some authors have over 500 images. Since a page contains 18 pictures, it would require up to level=20 to get them all.

Oleg.