OE doesn`t seem to use Filters

Andres Kohlbach
11/24/2005 08:51 am
Hi,

I am trying to download our own webpage (www.amaxa.com) for archiver purposes.
To prevent loading the same page multiple times due to cross-referencing I added keywords like "authors" as URL-filename-filters. I also selected "Load using URL filters settings" in File filters -> Location.
Nonetheless when I started the download, aftermore than 12 hours OE was still downloading with over 600000 files in cue. Among the downloaded files was "http://www.amaxa.com/citations.html?tx_mwaxcitations_pi1%5Bauthors%5D=Aluigi+M&no_cache=1" so it doesn`t seem to respect the filter settings.
Any ideas why.
Thanks!
Oleg Chernavin
11/24/2005 01:31 pm
Can you please post your Project settings here? Please select the Project, click the Copy button on toolbar and then paste to a forum message. I will see what is wrong with it. Thank you!

Best regards,
Oleg Chernavin
MP Staff
Andres Kohlbach
11/28/2005 06:38 am
Herer they are:

[Object]
OEVersion=Standard 3.9.0.2143
Type=0
IID=7015
Caption=amaxa Website
URL=http://www.amaxa.com/
Lev=1000001
Weekday=257
User=ako@amaxa.com
Psw=te5TuS§r
LimTSize=10000
LimNumber=5000
LimTime=100
EnableForms=True
LTMethod=1
pswMethod=4
FTText.Exts=htmlhtmaspaspxjspstmstmlidcshtmlhtxtxttextxspxmlrxmlcfmwmlphpphp3
FTImages.Exts=gifjpgjpegtiftiffxbmfifbmppngipxjp2j2cj2kwbmplwf
FTVideo.Exts=mpgavianimpegmovfliflcvivrmramrvasfasxwmvm1vm2vvob
FTAudio.Exts=wavriffmp3midmp2m3uravocwmaape
FTArchive.Exts=ziparcgzzarjlhalayleirarcabtarpakacejarpdf
FTUDef.Exts=jscssssivbsdtdxslswfclass
FTText.B=ooxooo
FTImages.B=ooxooo
FTVideo.B=ooxooo
FTAudio.B=ooxooo
FTArchive.B=ooxooo
FTUDef.B=ooxooo
FTOther.B=ooxooo
FTSizes=0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
RSrvsBx=3
RFileBx=2
RFileEx=advancedsearchauthorssource xxx
RProt=127
LastStart=31:225:102:228:246:226:226:64:
LastEnd=71:78:136:205:12:227:226:64:
S200=86146
S304=703
S400=3
SAbr=6109056
SPar=86237
SSav=86146
SLast=200
SSiz=2002566420
SMdf=68399
LFiles=86851
LSize=2002567917
Stopped=True
ImgDim=0,0,0,0
PrevURL=http://www.amaxa.com/
SkipURLs=
ParseComplexScripts=True

Thanks

Andres
Oleg Chernavin
11/28/2005 09:24 am
Can you please tell me a URL of a page that has link to http://www.amaxa.com/citations.html?tx_mwaxcitations_pi1%5Bauthors%5D=Aluigi+M&no_cache=1 or another unwanted page?

Oleg.
Andres Kohlbach
11/29/2005 03:43 am
Hi,

http://www.amaxa.com/citations.html has a link to http://www.amaxa.com/citation_adv_search.html?&no_cache=1 and a lot of links to pages like http://www.amaxa.com/citation_details.html?tx_mwaxcitations_pi2%5Bid%5D=614&tx_mwaxcitations_pi2%5Bsearch%5D=&tx_mwaxcitations_pi2%5Bpage%5D=1&no_cache=1&tx_mwaxcitations_pi2%5Badvancedsearch%5D=&tx_mwaxcitations_pi2%5Bcell_type_operator%5D=&tx_mwaxcitations_pi2%5Bcell_type%5D=&tx_mwaxcitations_pi2%5Bauthors_operator%5D=&tx_mwaxcitations_pi2%5Bauthors%5D=&tx_mwaxcitations_pi2%5Bcitation_source_operator%5D=&tx_mwaxcitations_pi2%5Bcitation_source%5D=&tx_mwaxcitations_pi2%5Btitle_operator%5D=&tx_mwaxcitations_pi2%5Btitle%5D=&tx_mwaxcitations_pi2%5Babstract_operator%5D=&tx_mwaxcitations_pi2%5Babstract%5D=&tx_mwaxcitations_pi2%5Bkeywords_operator%5D=&tx_mwaxcitations_pi2%5Bkeywords%5D=&tx_mwaxcitations_pi2%5Bproduct_operator%5D=&tx_mwaxcitations_pi2%5Bproduct%5D=&tx_mwaxcitations_pi2%5Bsource%5D=&tx_mwaxcitations_pi2%5Bpdf%5D=&tx_mwaxcitations_pi2%5Bppt%5D=.

All of these links contain the excluded keyword "search".

Andres
Oleg Chernavin
11/29/2005 04:20 am
Strange. I used your Project and replaced the starting URL to the above (http://www.amaxa.com/citations.html). Then I started the download and no excluded link was loaded.

I am using Offline Explorer Pro 4.0 Beta 2 version.

Oleg.