Filename filtering not working

Author Message
Ziyad 07/22/2004 02:07 am
I`m trying to download a particular section on a forum
Forum (IPB) has 4 section
Link of the four sections is
http://www.entityparadigm.com/forums/index.php?showforum=2
http://www.entityparadigm.com/forums/index.php?showforum=1
http://www.entityparadigm.com/forums/index.php?showforum=4
http://www.entityparadigm.com/forums/index.php?showforum=3

I only want to get showforum=3 I put showforum=4 and the rest in the filename exclude list, but it isn`t working
plus i also DON`t wnat to get the user profile page which has "showuser=" in the filename. but nothing the project just gets one file and completes. when i tell it to get all the files it works fine.

Any suggestion on how to correctly use the exclude filename filter
Oleg Chernavin 07/22/2004 06:56 am
I would suggest you to put:

showforum=3
showtopic

to the Included list. It is easier. Only URLs with these keywords will be loaded. The rest - skipped.

Best regards,
Oleg Chernavin
MP Staff
ziyad 07/22/2004 10:25 am
oven pal it worked. thanks pal
one more question. i want to exclude these files
showtopic=8498&view=getlastpost

now since showtopic is in my imclude list. these files (view=getlastpost) are also downloaded.
how can i exclude these.

i want showtopic, but not if it also contains view=getlastpost

regards ziyad
Oleg Chernavin 07/22/2004 11:00 am
Simply add:

view=getlastpost

to the Excluded filename keywords list.

Oleg.
David 09/15/2004 07:39 am
Hi,
I am downloading from http://www.themarker.co.il all the pages 2 level depth.
I want to exclude index pages from the project, but I need to download the links from
those pages , without downloading the index pages.
I have identiffied a unique tag (article bold>) in those index pages and tried to filter it
with the content filter, it didn`t work.
What do you think?

Regards David
Oleg Chernavin 09/15/2004 07:54 am
Yes, it will not work, because the Contents Filter looks inside the page text, not inside tags. This code removes all tags from a page and searches only in the visible page texts.

Oleg.