|Peter Harkness||02/04/2011 04:12 am|
|If I untick "text" in the project properties I know that text files (html etc.) are still retrieved to be parsed but are not saved. However if text is unticked it looks like the URL filters are not applied.
I want to download from a single server, and not follow links to other servers BUT I want to leave "text" unticked, i.e. NOT save text files. However I find that in this case the project accesses ever server it can see by following links.
Is there a way to restrict a project to ONE server whilst "text" is unticked in the properties?
|Oleg Chernavin||02/04/2011 05:51 am|
|Please check the File Filters - Text category and set its Location field whether to "Load using URL Filters" or "Load from the starting server". Then uncheck the Text category again.
|Peter Harkness||02/04/2011 09:48 am|
|That's what I did. I came in this morning to find google, bbc and about 30 other sites under the project folder!
|Oleg Chernavin||02/04/2011 09:53 am|
|Perhaps, they are some scripts or styles handled by the File Filters - User Defined category?
|Peter Harkness||02/04/2011 10:34 am|
|All categories are set to use URL filters. The "server" URL filter is set to the starting domain only.|
|Oleg Chernavin||02/06/2011 02:42 pm|
|Can you post the settings of the Project here (Ctrl+C on it and paste to the message)? I will try to download myself and see what happens.
|Peter Harkness||02/09/2011 01:23 pm|
The link is internal but I have found the following:
For a project url of http://www.mycompany.co.uk
In the "Test URL against URL Filters" Option something weird is happening. The URL filters is set to be restricted to the starting domain. If I type "google" in the test box it says "The URL is rejected reason: URL filters | server"
However if I type in "http://www.google.co.uk" It says "The URL will be downloaded". The same happens as follows:
"bbc" - not downloaded
"http://www.bbc.co.uk" - downloaded
And I tried this:
"rubbish" - not downloaded
"rubbish.co.uk" - downloaded.
OE pro seems to think that the starting domain is ".co.uk" NOT "mycompany.co.uk". If I make it "rubbish.org.uk" the test says that it won't get downloaded. It looks like a problem where the starting url has a domain component with 3 parts or more.
Project pasted below.
|Oleg Chernavin||02/10/2011 08:05 am|
|Thank you for the settings! I found and fixed the error. Here is the updated oe.exe file:
|Peter Harkness||02/11/2011 04:10 am|
|That worked great, thanks!|
|Oleg Chernavin||02/11/2011 05:48 am|
|You are welcome!
|Peter Harkness||02/15/2011 06:26 am|
There is still a problem. The changes you made in the 5.9.3321 Service Release 4 file you gave me has broken other server URL filters. Try the project below which is for a start URL of :
In version 5.9.3321 SR4 typing www.google.com into the test box gives a result that the URL WILL be downloaded, which is wrong as this project is restricted to servers in the starting domain.
However If I re-install the old 5.9.3318 Service Release 3 binary the URL filter says "The URL is rejected. Reason: URL Filters | Server".
So version 5.9.3318 works for xxxx.com domain filters but not xxx.co.uk domains.
version 5.9.3321 works for xxx.co.uk domains but not for xxx.com domains!
|Oleg Chernavin||02/18/2011 04:59 pm|
|I fixed that. Sorry for a silly bug!