i`ve always wondered - if i include say word "home" in both include filter list, and exclude filter list,
which one will be used?
also, what`s this IgnoreLogoutLinks that i`ve seen somewhere in this forum?
ParsingMax in registry doesnt seem to work? =/
What`s about Ctrl+Alt+Login that i`ve read?
are those some kind of tips that are listed somewhere i can`t find?
auto suspending to file command doesn`t seem to work for me? :/
I have it like this:
Also, how do i continue downloading, if i don`t have file suspended, and the session on site timed out? i can relogin to it, but how do i continue it so that existing files are not downloaded? if i select download only new files, then it doesnt work, because OE tries to download media files without opening the page they belong to, and the site kicks me out because of that :(
I want to stop downloading as soon as "User Signin" appears in any page. So I enter
that text in Content Filters page, and check "stop download .." check box. And now all of
sudden NO PAGES WHATSOEVER are being saved! Log says rejected cause of file filters. WHY? :(
Furthermore, if I select to download only missing files, then is it enough to put file in the download directory and it wont be redownloaded, or does it also check those wd3 etc files? How do those work anyway?
would be nice to see all the links in project that were not downloaded (eg scan all the files in proper, and report all the files linking to the online site, so that i can see if any content is missing).
sometimes media files are served by php scrip, for example:
when it`s this way, OE doesnt know this is media file, and "skip media files" fails (they are redownloaded).
i think it shouldnt just judge by extension, but also by the way it`s used..
So I found another problem in exported CHM file.
The images on my project are being served like this:
and they download like that filename aswell, which is fine.
Upon export to CHM, AFAIK they get converted to "gallerypage.show_imageid.XXX", (eg "," gets removed). And that works fine. However, in SOME pages (i have no idea what makes it wrong) the "," does NOT get replaced, and thus the image doesnt load. even more, after entering such page, i can no longer navigate around - i get IE errors =/
IgnoreLogoutLinks is a special filter that disallows to follow any link that has the following in the URL:
and other words.
ParsingMax still works, but it may add few extra files to the parsing queue, but not more than the ParsingMax + Number of connections.
Ctrl+Alt+Login and many other tricks can be found in the program help or in the Welcome page - Tutorials section.
SuspendToFileEvery - you need to have several lines:
Content Filters - please check another box to allow pages with no such words to be saved.
To skip a file download, it is enough to have it in the Download Directory. No other file is necessary.
>would be nice to see all the links in project that were not downloaded (eg scan all the files in
>proper, and report all the files linking to the online site, so that i can see if any content is
You can do this easily - Ctrl+F5, F9 - then you will have all missing links in the Queue tab.
I am not sure about this, because such can also lead to Web pages that have redirects, messages that there no such link, etc.
Regarding CHM export - can you please give me an example of a page with such links?
but i will need to wait till it parses all the files no? and then it`ll auto start so i need to manage to quick f9 quickly.. besides, i might not even have the access to the site anymore :/
> >img src="showimage.php?id=1"
> I am not sure about this, because such can also lead to Web pages that have redirects, messages that there no such link, etc.
but so can image.jpg - it can have redirect, error message, etc..?
> Regarding CHM export - can you please give me an example of a page with such links?
sorry, i cant - my account expired, and it was private site anyway.. i can only send you some example downloaded pages?
Yes, you can send me the saved HTML page to firstname.lastname@example.org
> Yes, you can send me the saved HTML page to email@example.com
But there are 20k or so files, it`ll take a long time to wait for that, and i`ll have to sit by and monitor :(
i`ll try to make a sample project with just few files and send it to you
yes, it`s a working address :)
And i`ve replied - I can also send you exported chm from those few files that still exhibits the problem..