I have dug in the forums... and i''m still lost-
I have a site.. many thousands of pages even when set to download only two levels- I badly, however, need to do a third level but need to have the filter made tighter at this point to avoid hitting the 10^25 number of pages!!
Is there a way I can download to two levels, then change the filters such that OE skips through the existing files and only downloads the next level where the URL contains a certain word? For example: "image", "audio", "video" ??
Thanking you in anticipation
Could data mining be used here? -- How about downloading to 2 levels, then parsing the downloaded content and saving all the URL''s contained into a single list (mining if I understand correctly), then downloading this extensive list by an additional level with the tighter filters??
Please please help..
This will load the links from c:\file.txt. Links should be one per line in this file.
I''m battling with TextPipe to extract the URL''s as the downloaded files made by OE as don''t have traditional URLs... I.E. Textpipe is not picking them up-
Can you advise me on a way to mine the vast numbers of URL''s in my OE download folder?
Thanks yet again... (This problem is almost solved)
It is hard to tell without looking at particular sites.
My problem with 3 levels is the vast number of pages that are downloaded... 2 levels keeps the downloads relevant to topic and allows for rapid updates. Also, the images are on html pages and not links directly to the image files...
I really need to figure out how to parse out the URL''s and download only further URL''s containing "image:"
Thanks (again again) again
Hi Oleg... back again..
I went a figured out TextPipe... Amazingly usefull program: Managed to generate a google sitemap, extract the urls, then filter to select URL''s containing image files only...
Anywho.. now I have a 1MB text file with URL''s pointing to images only. So my question now..
If I set OE to download these images, will they placed into the propper folder structure that already exists such that links from the exported site will actually point to the images?
If you want to export them all together - select the Project that downloaded the site and the images Project with Ctrl+click and then do the export.