I do not want files ending in .htm .aspx and other Web files. I find that if I unchecl .aspx, the download stops at one file. It seems as of the filters are applying to the crawling process and not justthe download process. I also tried URL filters using keywords. Same basic problem occurs.
How can avoid downloading all of the web junk files (aspx, htm...)?
John
some of the site I am downloading aer SharePoint sites... I wonder if that is a proablem.
I seem to get all of the desired documents but I get all the web junk that I don't want.
John
This will allow downloading all web pages, but they will be not stored on your disk. If you don't want to download web pages (ASPX, HTM fiels) then how Offline Explorer could find links to the files you need? It has to load them and follow links.
Best regards,
Oleg Chernavin
MP Staff
Thanks. I will try that.
My primary purpose is to download the desired documents for indexing by an advanced search tool called dtSearch.
I do not need to browse offline as I will be using dtSearch to review my documents.
I selected Online Translation and Mark online links as nofollow... in case I find a need to link back to the original source.
Are you aware of any issues in downloading files stored in SharePoint sites?
John
SharePoint sites are not easy for downloads. They have special kind of linking - so called doPostBack. Offline Explorer supports that, but this changes for new versions of SharePoint engine and I add improvements to handle such links often.
If some link is not followed, please let me know. It will be not easy to solve (because you work with an internal site), but I will do my best.
Oleg.