My organization is trying to archive a large number of Op/Ed pieces from various sources.
For some reason, Washington Post articles never download properly.
My project source URL uses a macro and tries to download all the existing OpEds from a Yahoo! archive with level limit 1:
http://story.news.yahoo.com/fc?tmpl=fc&cid=34&lp=2&ll=b2&pg={1..95}&mod=opinion___editorials&in=world&cat=mideast_conflict_archive
The Washington Post articles on these pages resolve to URLs of the form:
http://us.rd.yahoo.com/dailynews/fc/World/mideast_conflict_archive/opinion___editorials/SIG=125ts1bo8/*http://www.washingtonpost.com/wp-dyn/articles/A47947-2004Apr27.html
If you enter this URL directly into a browser, it reloads with a URL of:
http://www.washingtonpost.com/ac2/wp-dyn?pagename=article&contentId=A47947-2004Apr27¬Found=true
Try as I might, I CANNOT get OE 3.4 to download these pages.
Oleg once gave me a solution before, but I can no longer find it.
Please advise.
Regards,
Marc
Best regards,
Oleg Chernavin
MP Staff
Regards,
Marc
> Maybe it is because the URL Macro is entered incorrectly: {1..95}, while it should be {:1..95}. Does this help?
>
> Best regards,
> Oleg Chernavin
> MP Staff
Oleg.
> > Oleg.
I thought it was answered in the forum, but I don`t see it anywhere. I believe it was sometime between Aug `02 and Aug `03. If we corresponded only through email, it was probably with hallmarc@fastmail.fm or chiamarc@sbcglobal.net. Thanks again.
Marc
http://www.metaproducts.com/forum.asp?id=5004
Oleg.
Regards,
Marc
> Is it the following post?
>
> http://www.metaproducts.com/forum.asp?id=5004
>
> Oleg.
Oleg.
Thanks for referring me to the post, but sadly, the solution doesn`t seem to work any more. Would you be kind enough to investigate again, using the same problem description? Thanks.
Regards,
Marc
> We had some problems with the search and we had to disable this feature for a while. I hope we will fix them soon.
> > Oleg.
URL:
http://story.news.yahoo.com/fc?tmpl=fc&cid=34&lp=2&ll=b2&pg={:1..95}&mod=opinion___editorials&in=world&cat=mideast_conflict_archive
Level=2
URL Filters | Server and Directory sections - Load from all...
This loaded all pages well. With Level=1, some Washington Post pages were really not loaded.
Oleg.