I have succeeded in downloading 203,770 files with 4.014 GB size.
1. When I try to view past the link from the fist page in Offline Explorer Pro browser, it launches IE (current version) and cannot find the files offline (these links are set to be opened in a new window in the html). I have checked the links, and they all reside on my harddrive but cannot be displayed. IE just shows 170.... and keeps churning away with no error message.
2. I have tried to follow the recommendations from Oleg to client Naomi in this forum of 12/5/2010 entitled "Please help me with re-constructing a site from Wayback Machine!!! garage-door-specialists.co.uk"
3. Here are the settings I have made for this download in Offline Explorer Pro:
(This site, http://www.ciadvertising.org, was downloaded to Internet Archive from 2001-2009 so there are many copies there)
checked load only within this server
unchecked Load files only from starting directory and below
nothing done here--used default values
setup rule to remove numbers and and unchecked to apply to files
(I did not do this quite correctly (will re-run) as the numbers were not replaced. Did the test on this rule and it works to remove numbers (dates of download on wayback machine) from files:
I greatly appreciate your help as I thought this site was lost forever and represents my life work as an academician (let alone my students' work). As you may know, the recommended download program by Internet Archive site no longer works with the changed wayback machine for downloading, and they indicate it will not work until after August 2011.
BTW, why when I attempt to open a .gif, e.g., from offline downloaded content in Photoshop, to verify it is on my haddrive, I get an message saying it cannot open the format?
I will do the download and try to see what is wrong.
Regarding GIF files. Yes, the site uses lots of redirects when you request a URL, it points you to another timed version of a file. So, many of the downloaded files are small HTML pages with redirections.
You may open them to see the exact location of the GIF and other such files.
I am trying to do the same thing. Is there anyway to get it so when I export the files they go into one directory instead of all into to date stamped folders?
Then redownload the project and export it.
Thanks that worked. Only problems is I'm getting 1000's of files and pages from years that I don't want. One site is archived for 2007 and I have this in URL exceptions:
but it still downloads pages from older years than 2007.
Many thanks for your help.
Exported=28/10/2011 19:10:24 - D:\directory\domain\