I''m trying to follow and download html pages that (for instance) have ''jane'' in the server name OR filename:
I am also excluding some server keywords like ''comments'' in http://comments.theserver/*/*/janepage.html and some keyword exclusions in directories and filenames.
What would be great is the functional equivalent of [''jane'' and not (''comments'' or ''journals'')] anywhere in the url.
I''ve never understood the relationship between included and excluded keywords. For instance - what if the URL contains some excluded keywords AND some included keywords? Does one over-ride the other? And if I''m using both included and excluded what happens if a url does not contain any included or excluded keywords? Will such a url be downloaded or not? Perhaps the answer is not to use both included and excluded at the same time?
Thanks for your help,
(any server with comments in server name, any directory /*/ and any filename * after last slash.
(two keywords in the Included keywords list - the last one means any file with jane in filename).
Exclude overrides include. The general rule is the following - if any filter or limit doesn''t allow a file to be downloaded, it will be skipped.
Just to clarify on one last point. If I have zero include filters then all url''s are allowed (subject to exclude filters etc.). If I have one or more include filters then it will only allow urls that pass the include filter criteria. Is that correct?
Thanks yet again - you are a wonderful resource.
i want to filter all url contain "type=5" like http://www.example.com/Default.aspx?type=5&year=1393&month=8&day=17
i add *type=15* filter into url filters-> filenames but dont work .
why? please help me
(type=15 looks a mistake to me).