Parsing question

Coriolis
12/19/2008 01:33 pm
What does it mean when OEP says it is parsing a downloaded JPG or PNG file? Is it just checking that the file exists or is it actually looking through the file for something?
Oleg Chernavin
12/19/2008 01:43 pm
Parsing means extracting links from a file and changing them for offline browsing. However image files should not be parsed. Unless they have .png/.jpg extension, but in fact are HTML pages.

How did you see that it parses exactly that file? From logs?

Best regards,
Oleg Chernavin
MP Staff
Coriolis
12/19/2008 02:20 pm
I see them commonly when I turn on "Queue Messages" in the log. For example:

QUEUE 12/19/2008 2:05:48 PM Parsing files added.
QUEUE 12/19/2008 2:05:48 PM Parsing (1513). http://zip.4chan.org/w/res/608831.html
QUEUE 12/19/2008 2:05:53 PM Parsing end.
QUEUE 12/19/2008 2:05:53 PM Parsing files added.
QUEUE 12/19/2008 2:05:53 PM Parsing (1557). http://zip.4chan.org/w/src/1229314475416.jpg
QUEUE 12/19/2008 2:05:54 PM Parsing end.
QUEUE 12/19/2008 2:05:54 PM Parsing files added.
QUEUE 12/19/2008 2:05:54 PM Parsing (1556). http://zip.4chan.org/w/src/1229316875964.jpg
QUEUE 12/19/2008 2:05:54 PM Parsing end.
QUEUE 12/19/2008 2:05:54 PM Parsing files added.

These are not HTML stubs but valid images.

On a related note, adding "skipparsingfiles=jpg" to the project results in long streams of errors like this:

QUEUE 12/19/2008 2:16:23 PM Parsing files added.
QUEUE 12/19/2008 2:16:23 PM Parsing (11). http://zip.4chan.org/w/src/1229709374056.jpg
ERROR 12/19/2008 2:16:23 PM Error reading from file: Error code=00000003 Path not found
QUEUE 12/19/2008 2:16:23 PM Parsing end.
QUEUE 12/19/2008 2:16:23 PM Parsing files added.
QUEUE 12/19/2008 2:16:23 PM Parsing (10). http://zip.4chan.org/w/src/1229584295737.jpg
ERROR 12/19/2008 2:16:23 PM Error reading from file: Error code=00000003 Path not found
QUEUE 12/19/2008 2:16:23 PM Parsing end.
QUEUE 12/19/2008 2:16:23 PM Parsing files added.
QUEUE 12/19/2008 2:16:23 PM Parsing (9). http://zip.4chan.org/w/src/1229586025839.jpg
ERROR 12/19/2008 2:16:24 PM Error reading from file: Error code=00000003 Path not found
QUEUE 12/19/2008 2:16:24 PM Parsing end.

I also hit this when using "skipparsingfiles=swf", so it''s not restricted to a specific file type. This has happened for several releases but as it doesn''t hurt anything I haven''t worried about it.
Oleg Chernavin
12/19/2008 04:28 pm
Really strange. I tested on the same URL and my Offline Explorer Pro didn''t parse the image. Can you give me more information on how to reproduce this? Does it happen on other sites?

Oleg.