URL Substitutes

Author Message
Anand Chida 08/04/2005 12:22 pm
I am trying to download xls files from a website.
The xls files are located on this page at various links as follows
http://www.bba.org.uk/content/1/c4/50/20/Jan05.xls
http://www.bba.org.uk/content/1/c4/30/90/Feb05.xls
http://www.bba.org.uk/content/1/c4/70/100/Mar05.xls

I would like to download all the xls files to
http://www.bba.org.uk/Jan05.xls
http://www.bba.org.uk/Feb05.xls
http://www.bba.org.uk/Mar05.xls

I have tried various combinations with URL substitutions, can anyone pass some suggestions.

Thanks


Oleg Chernavin 08/04/2005 12:55 pm
What if you create 3 rules:

1. Replace /content/1/c4/ with /
2. Replace /*/ with /
3. Replace /*/ with /

And allow all rules to be applied. Does this work?

Best regards,
Oleg Chernavin
MP Staff
Anand Chida 08/04/2005 02:36 pm








Thanks for your quick reply, Still i am having the issue
The url i am trying to work with is
http://www.bba.org.uk/bba/jsp/polopoly.jsp?d=141&a=627
I am trying to download all the excel spreadsheets on this page.

I tried the following in the substitute dialog box, 1st Rule
URL - http://www.bba.org.uk/content/1/c4/*
Replace - /content/1/c4/*/
with - /
2nd Rule
URL - http://www.bba.org.uk/*/
Replace - /*/
With - /

The above combination leaves files in
www.bba*/12/abc.xls

Thanks for you assistance












> What if you create 3 rules:
>
> 1. Replace /content/1/c4/ with /
> 2. Replace /*/ with /
> 3. Replace /*/ with /
>
> And allow all rules to be applied. Does this work?
>
> Best regards,
> Oleg Chernavin
> MP Staff
>
Oleg Chernavin 08/04/2005 03:08 pm
The correct set is:

URL:
http://www.bba.org.uk/content/1/c4/*.xls
Replace:
/content/1/c4/*/
With:
/

URL:
http://www.bba.org.uk/*.xls
Replace:
http://www.bba.org.uk/*/
With:
http://www.bba.org.uk/

Oleg.
Anand Chida 08/04/2005 05:34 pm
I tried the following as suggested by you, but still having the same issue, having directories created at
http://bb.com/11/abc.xls
Am i missing something.

URL:
http://www.bba.org.uk/content/1/c4/*.xls
Replace:
/content/1/c4/*/
With:
/

URL:
http://www.bba.org.uk/*.xls
Replace:
http://www.bba.org.uk/*/
With:
http://www.bba.org.uk/

Regards,
Anand Chida

>
> 1. Replace /content/1/c4/ with /
> 2. Replace /*/ with /
> 3. Replace /*/ with /
>
> And allow all rules to be applied. Does this work?
>
> Best regards,
> Oleg Chernavin
> MP Staff
>
Oleg Chernavin 08/05/2005 06:54 am
Yes, this is correct. Do you have the "Apply all matching rules" box checked?

Oleg.
Anand Chida 08/05/2005 10:35 am
Hi Oleg,
Yes that was it, thanks it worked.

I am running into another issue while downloading this file

https://portal.gs.com/gs/portal/products/fixedincome/commodities/?tableFunction=Q29tbW9kIFdlYjo6RW5lcmd5IFN3YXBzIGFuZCBDcmFja3MgUGFnZQ%3D%3D&tableArgs=Q2F0ZWdvcnk%3D&tableArgs=Q29udGFpbmVy&tableArgs=SGlkZGVu&Category=QnJlbnQgYW5kIE5hdHVyYWwgR2FzIFN3YXBz&Container=Q29tbW9kOiBDcmFjayBDb250YWluZXI%3D&Hidden=QlJUIFN3YXBz&portal.page.printable=true

I can download the file, the file is downloaded as
tablefunctionxxxxxx without an extension and i am trying to rename it to abc.htm

I am using URL substitiution for this.
URL - https://portal.gs.com/*
Replace - @tableFunction=*
With - abc.htm
Also, i have made sure to uncheck so URL substitution is applied only for downloaded files.
Please let me know if you need anything more on this.
Thanks in advance.



> Yes, this is correct. Do you have the "Apply all matching rules" box checked?
>
> Oleg.
Oleg Chernavin 08/05/2005 10:49 am
Just one correction:

Replace:
?tableFunction=*

Oleg.