I experienced sometimes this (big) problem.
I download a site, the main folder has lots of subdiredtory, I browse the site downloaded and everything works great.
But (obviously) I wanna Export:
- without subdirectory
- with MS-DOS 8+3 filenames
It happens that two (or more) files - that were in different directory before the Export - aren't renamed well: it happens that two (or more) links lead to the same page.
Only that one.
If I check the Export directory I find only that HTM file, the program lost the other pages and kept only the last one exported.
If I export keeping every subdirectory I don't experience that problem
http://www.metaproducts.com/
http://www.metaproducts.com/mp/
I set Level to 0 and downloaded it. Export created two files - default.htm and default0.htm - just as expected.
How I can reproduce your issue?
Best regards,
Oleg Chernavin
MP Staff
http://www.network54.com/Forum/79950/
Just start this project (we care only about htm files)
- URLs:
http://www.network54.com/Forum/79950/page-14
http://www.network54.com/Forum/79950/page-167
- Level Limit:
1
- URLs substitution:
>first
URL: *
Replace: %*.
With: .
>second
URL: *
Replace: %*
With:
Both applied to FILENAMES
Browsing offline everything works
at page 14) Harry "O' guaglione" leads to Harry "O' guaglione", FILE: Harry++
at page 167) Harry's surname leads to Harry's surname, FILE: Harry
Fine
Now EXPORT, without subdirectories and MS-Dos filenames and you get the problem
at page 14) Harry "O' guaglione" leads to Harry "O' guaglione", FILE: harry.htm, OK :-)
at page 167) Harry's surname leads to Harry "O' guaglione", FILE: harry.htm, WRONG :-(
It lost a link, no matter if it would have the right HTM file in the export directory.
file:///D:/export/harryq~1.htm
file:///D:/export/harry_~1.htm
This is what I got with your Project setup. There should be ~1.htm in filenames surely.
My Project settings:
[Object]
OEVersion=Pro 6.4.0.3839
Type=0
IID=62386
Caption=http://www.network54.com/Forum/79950/page-14
URL=http://www.network54.com/Forum/79950/page-14http://www.network54.com/Forum/79950/page-167
Lev=1
Weekday=257
FTText.Exts=htmlhtmaspaspxjspstmstmlidcshtmlhtxtxttextxspxmlrxmlcfmwmlphpphp3
FTImages.Exts=gifjpgjpegtiftiffxbmfifbmppngipxjp2j2cj2kwbmplwfwebp
FTVideo.Exts=mpgavianimpegmovflvfliflcvivrmramrvasfasxwmvm1vm2vvobsmilmp4m4v
FTAudio.Exts=wavriffmp3midmp2m3uravocwmaapeoggm4aaif
FTArchive.Exts=7zziparcgzzarjlhalayleirarcabtarpakacejarpdftgzexeiso
FTUDef.Exts=jsaxdcssssivbsdtdxslswfclassent
FTText.B=ooxooo
FTImages.B=xoxooo
FTVideo.B=ooxooo
FTAudio.B=ooxooo
FTArchive.B=ooxooo
FTUDef.B=ooxooo
FTOther.B=ooxooo
FTSizes=0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,3,0,3,0,0,0,0,0,0,0,0
NotIgnoreLogout=False
RSrvsBx=1
RPathBx=1
RProt=255
LastStart=62:255:21:154:178:25:228:64:
LastEnd=239:88:167:156:178:25:228:64:
PrjStart=6:214:172:141:218:252:227:64:
LastStarted=13.09.2012 13:57:05
LastEnded=13.09.2012 13:57:32
S200=76
SPar=76
SSav=76
SLast=200
SSiz=1553073
SMdf=76
SHTML=74
SSuccDowns=1
LFiles=76
LSize=1553073
SubstsB=KgklKi4JLglYDQoqCSUqCQlYDQo=
ImgDim=0,0,0,0
PrevURL=http://www.network54.com/Forum/79950/page-14
ConvertRSS=True
Exported=13.09.2012 13:58:04 - d:\export\
harry.htm --> Harry's surname
harry0.htm --> Harry's surname
harry1.htm --> Harry "O' guaglione"
But both links at page 14 and 167 lead to the same file, harry.htm
:-(
I get the ~1 if I do not substitute every %*
When I substitute every %* with nothing, filenames often are very short
With short filenames I get this problem
Oleg.