Offline Explorer and the Internet Archive
|Terrance Mcrae||09/29/2011 01:12 am|
|Hello. I know that this issue has been tackled before, but I am having trouble retrieving messages from the Wayback Machine. I've successfully used the information here: http://forum.metaproducts.com/Post.aspx?ID=5381 to copy two websites from the wayback machine, but now I am working on the last one for this project and it's not working with this particular site.
If I use the default settings, of course I just get the first page. If I use any other settings, such as the ones in the older thread or variations, I get what looks like a squatter's page and nothing else. If I watch the output at the bottom the screen, it only seems to follow six links right off the main page. The domain in that feedback is the right domain, but the information I am retrieving is not.
P.S. I know that is long and confusing, so if there is something I need to clarify, just ask.
|Oleg Chernavin||09/29/2011 02:43 am|
|Can you please give me the details on the site you are downloading to reproduce this?
|Terrance Mcrae||09/29/2011 12:37 pm|
|Sure, the site is: http://web.archive.org/web/20061205022300/http://realmsofevil.net/|
|Oleg Chernavin||10/01/2011 10:07 am|
|I downloaded it with Level=2 using the latest Offline Explorer Pro:
There are several problems with the site:
1. Many links from the first page lead to another site, like:
Perhaps, you will need to add greyhawkonline.com to the URL Filters - Directory - Included list.
2. Most of the Left-side menu links show the following error:
The server encountered an internal error and was unable to complete your request.
Could not connect to JRun Server.
It looks like some problem on the original server, which was retrieved and saved by web.archive.org site.
|Terrance Mcrae||10/01/2011 01:58 pm|
|I've recently installed the 6.0 beta, and I'll try again using those settings and a different save point to hopefully work around the JRun problem. Thanks|
|Oleg Chernavin||10/01/2011 02:20 pm|
|No, JRun problem cannot be avoided - it was recorded this way by archive.org site, unfortunately.