URL Filters

Author Message
Mike D. 03/28/2011 08:14 pm
Hello,
I had almost finished the download site. But I suspended the operation in a file. And after resume, OE did not recognize the URL filters and downloaded files prohibited by the filter list. How can I do. I must ignore these files (about more 10000).

Thank you !
Oleg Chernavin 03/29/2011 09:17 am
Resuming from file should remove the URLs you disabled in URL Filters. Can you please tell me a few examples of the URLs that were not removed? And your settings also.

Thank you!

Best regards,
Oleg Chernavin
MP Staff
Mike D. 03/29/2011 12:57 pm
URLs were not removed from the list, they are just not recognized !
ALL URLs I written the list are not recognized !
Ex :
http://www.thewebsite.com/forum-*
http://www.thewebsite.com/membres-*

Project's settings :
[Object]
OEVersion=Enterprise 5.9.0.3318
Type=0
IID=7019
Caption=myCustomCaption
URL=http://www.thewebsite.com/
Lev=1000001
Weekday=257
LimTSize=10000
LimNumber=5000
LimTime=100
SkipMedia=True
FTText.Exts=htmlhtmaspaspxjspstmstmlidcshtmlhtxtxttextxspxmlrxmlcfmwmlphpphp3
FTImages.Exts=gifjpgjpegtiftiffxbmfifbmppngipxjp2j2cj2kwbmplwfwebp
FTVideo.Exts=mpgavianimpegmovflvfliflcvivrmramrvasfasxwmvm1vm2vvobsmilmp4m4v
FTAudio.Exts=wavriffmp3midmp2m3uravocwmaapeoggm4aaif
FTArchive.Exts=7zziparcgzzarjlhalayleirarcabtarpakacejarpdftgzexeiso
FTUDef.Exts=jsaxdcssssivbsdtdxslswfclassent
FTText.B=ooxooo
FTImages.B=ooxooo
FTVideo.B=xoxooo
FTAudio.B=xoxooo
FTArchive.B=xoxooo
FTUDef.B=ooxooo
FTOther.B=ooxooo
FTSizes=0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,3,0,3,0
NotIgnoreLogout=False
RSrvsBx=1
RPathBx=1
RProt=255
LastStart=11:133:225:201:225:214:227:64:
LastEnd=57:150:40:225:225:214:227:64:
LastStarted=29/03/2011 01:20:29
LastEnded=29/03/2011 01:24:34
SAbr=13587
LFiles=66486
Stopped=True
Search=True
ImgDim=0,0,0,0
PrevURL=http://www.thewebsite.com/
SkipURLs=http://www.thewebsite.com/boutique-*http://www.thewebsite.com/emploi-*http://www.thewebsite.com/etudes-*http://www.thewebsite.com/evenements-*http://www.thewebsite.com/forum-*http://www.thewebsite.com/membres-*http://www.thewebsite.com/news-*
ConvertRSS=True
Icon=12
LIndexed=False
IndexFiles=False


----
Thank you !
Oleg Chernavin 03/29/2011 12:59 pm
Can I also ask you to open the .wdq file in Notepad and find the lines with such URLs that should be removed? They should be also sections beginning with [Object] line. Paste a few examples of such objects here.

Oleg.
Mike D. 03/29/2011 05:39 pm
Hello,
I don't really understand your request.
You can download the hole file here : http://www.mediafire.com/?3ucwu8d0gn25lvn

PS : I want to ignore/don't download these URLs : http://www.thewesite.com/membres* ; forum* ; evenements* ; news* ; emploi* !

Thank you for your help !
Oleg Chernavin 03/30/2011 07:57 am
I used your Project settings (replaced www.thewebsite.com with siteduzero.com). I loaded the .wdq file and found no /news- links. There were /membres- links, but not the http://siteduzero.com/membres-....

See the example:
http://www.siteduzero.com/tutoriel-3-351765-debuter-sur-adobe-photoshop.html/Templates/images/designs/2/tutos/membres-292.html

To skip such URLs, use URL Filters - Filename - Excluded list:

membres-*

Oleg.
Mike D. 03/30/2011 09:15 am
Hello,
Thank you again,
But it still not working correctly, it downloads all files containing the URLs i don't want to down.
As you said, I added membres-* ... in URL Filters | Files | Exclude !!

But these URLs are already in the queue, I think this is the reason, and I must re-download (upload ? which settings in Project Properties dialog ?) ?? to clear the queue !

Thank you !
Oleg Chernavin 03/30/2011 09:24 am
Yes, after adding that keyword, you can Resume From File and such URLs will not be in the queue anymore.

Oleg.
Mike D. 03/30/2011 11:10 am
Yes;
It work good now ! Love this software !

Oh thank you very much much !!!

Best regards,
Mike.
Oleg Chernavin 03/30/2011 01:09 pm
You are welcome!

Oleg.