Strange url that led to non-stopping downloading

Author Message
Steven 11/17/2010 03:03 am
When downloading the project (settings are given below), at the end of downloading when a the project is about to be finished, there would always be two links in the downloading list that POB is downloading all the time, something like:

http://www.foreignpolicy.com/articles/2010/10/11/fp%20-%20article%20-%20fp%20-%20article%20-%20fp%20-%20article%20-%20fp%20-%20article%20-%20fp%20-%20article%20-%20spacer.gif?print=no&hidecomments=yes&page=full

I tried open it online, but it was a link that does not exist. I wonder why that happens.

Project setting:


[Object]
OEVersion= 5.9.0.3284
Type=0
IID=7060
Caption=Foreign Policy
URL=http://www.foreignpolicy.com/issues/182/contents
MVer=5
Lev=1
Weekday=257
LimTSize=10000
LimNumber=5000
LimTime=100
FTText.Exts=htmlhtmaspaspxjspstmstmlidcshtmlhtxtxttextxspxmlrxmlcfmwmlphpphp3
FTImages.Exts=gifjpgjpegtiftiffxbmfifbmppngipxjp2j2cj2kwbmplwf
FTVideo.Exts=mpgavianimpegmovflvfliflcvivrmramrvasfasxwmvm1vm2vvobsmilmp4
FTAudio.Exts=wavriffmp3midmp2m3uravocwmaape
FTArchive.Exts=ziparcgzzarjlhalayleirarcabtarpakacejarpdftgzexe
FTUDef.Exts=jscssssivbsdtdxslswfclassent
FTText.B=ooxooo
FTImages.B=ooxooo
FTVideo.B=xoxooo
FTAudio.B=xoxooo
FTArchive.B=xoxooo
FTUDef.B=ooxooo
FTOther.B=ooxooo
FTSizes=0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,3,0,3,0
NotIgnoreLogout=False
RSrvsBx=1
RPathIn=articles2010 xx
RProt=255
LastStart=138:65:57:151:118:198:227:64:
LastEnd=109:9:171:156:118:198:227:64:
LastStarted=2010-11-17 16:56:34
LastEnded=2010-11-17 16:57:32
S200=457
SAbr=2
SPar=147
SSav=457
SLast=200
SSiz=38059642
SMdf=457
SHTML=102
SSuccDowns=5
LFiles=457
LSize=37852904
Stopped=True
Flags=1
SubstsB=aHR0cDovL3d3dy5mb3JlaWducG9saWN5LmNvbS9hcnRpY2xlcy8qCSoJKj9wcmludD1ubyZoaWRlY29tbWVudHM9eWVzJnBhZ2U9ZnVsbA0K
ApplyAllSubsts=True
ImgDim=0,0,0,0
PrevURL=http://www.foreignpolicy.com/issues/182/contents
SkipURLs=
ConvertRSS=True
LIndexed=False
IndexFiles=False
Oleg Chernavin 11/17/2010 07:03 am
This goes from some script. The program finds anything that looks like a link and tries to get it. Please set a limited number of attempts in the Options dialog - like 10. After trying 10 times such links will be abandoned.

Best regards,
Oleg Chernavin
MP Staff
Steven 11/17/2010 06:29 pm
But it seems that POB WAS actually downloading the link, just it's too large. (I mean the size of the files is increasing all the time when the last two links were being downloaded).
Oleg Chernavin 11/18/2010 08:33 am
Yes, I see now. The page has script that refers to this spacer.gif image again and again and the URL length was increasing in every page it loads. I will think how to add a workaround for such cases.

Oleg.
Steven 11/18/2010 10:09 am
Thank you! Looking forward to your reply.