Directory wildcard name--is it possible?

anova
01/05/2005 06:05 am
Let`s say I want to download photo files from a site with very simple structure:

http://www.sitename.com/members/content/<rootdirectory>/

ok, within <rootdirectory> there are 100 subdirectories with all sorts of names. These subdirectories contain the photos I would like to download.

My question is: Do I have to *specifically* include the exact subdirectory names in the URL Filters Directory section?

I would like to be able to do something like: "/*/", but that doesn`t seem to work. If I include the exact directory name it does work. But it`s too much work for me even if I can define a notepad file with the proper 100 different directory names.

Thanks for any help you can give me.
Oleg Chernavin
01/05/2005 09:13 am
Wildcards are surely supported. Can you give me more examples (a list of few directories that should be allowed for downloading)?

Thank you!

Best regards,
Oleg Chernavin
MP Staff
anova
01/05/2005 10:46 am
For example, I would like to be able to grab "index.php" files listing photo files from subdirectories:

http://www.sitename.com/members/content/photos/apple/


http://www.sitename.com/members/content/photos/orange/


http://www.sitename.com/members/content/photos/method/

http://www.sitename.com/members/content/photos/louann/

These subdirectories are really quite random, no repeating characters that I could see using url macros of the the type {:abc} on.

I was hoping I could just put in something like:
http://www.sitename.com/members/content/photos/*/
But OE enterprise (ver. 3.4) doesn`t work with the * wildcard directory.

Thanks for any help.
Oleg Chernavin
01/05/2005 10:48 am
The following Included Directory keyword should work here:

/members/content/photos/*

Oleg.
anova
01/05/2005 11:31 am
This does not work, by the way. It seems that I have to actually spell out the subdirectory name containing the files.

I don`t know what else to do. I also notice that I have to put in the actual subdirectory name in the project address URL. I was hoping I could use the parent directory (e.g., http://www.sitename.com/members/content/photos/) so that I can grab subdirectories: apple, orange, method, etc... But no go. I have to individually spell out in the project Url each subdirectory I wish to download from. My directory levels are set to 5, by the way.

I`m trying to automate the downloading of this site, so that I don`t have to sit there and feed it subdirectory names in the project url to download.

Thanks for your help.
anova
01/05/2005 01:56 pm
Ok, here`s what I have done:

Project address url: http://www.sitename.com/members/content/photos/
Referer=http://www.sitename.com/members/content/

file filter:
text (load only from starting directory): php, html, . , htm
images (load using url filter settings): jpg

directory keyword:
/*/*

filename keyword:
php
jpg
html
htm

And using these settings, it appears that now OE is actually downloading the php files within each subdirectory pointing to the links of the photos that I want. Right now, it`s downloading all the php files (1 php file per photo file), hopefully when it`s finished with the php`s it will start pulling down the actual photos.

Thanks, Oleg.
Oleg Chernavin
01/06/2005 01:45 am
You can examine the Queue to see if the photos are going to be loaded. I would only suggest you to press F9 before clicking the Queue tab. This will make it working faster with less redraws. Press F9 again when you are back in the Projects tab.

Oleg.
anova
01/06/2005 08:38 pm
Oleg,

It all worked out well. It grabbed all the photos that I wanted automatically.

Thanks for all your help.
Oleg Chernavin
01/09/2005 05:29 am
You are welcome!

Oleg.