Trouble downloading a password protekted site

Author Message
kevork kevorkian 04/11/2004 03:21 pm
I have trouble downloading
http://cardiology.accessmedicine.com/hurst/public/co_contents/toc.html
I`m trying Offline Explorer Enterprise 3.1. In the former versions the only possible way was to log in from within the internal browser and after that to create a standard project. All other methods were unsuccessful. Now I keep receiving the login prompt page anywhere.
My user name is:
kevork
Password:
kevorkian
Hope it is possible to download it:)
Oleg Chernavin 04/12/2004 05:36 am
This is simple. When Offline Explorer starts loading the site it follows the LogOut link. To exclude it from downloading, use the Project URL Filters | Filename | Custom Configuration. Add the following line to the Excluded filename keywords:

expire

This should help.

Best regards,
Oleg Chernavin
MP Staff
kevork kevorkian 04/14/2004 05:55 am
Thank you for your cooperation, Oleg!
I`m going to follow your advice. I assumed that logout link could be causing the problem, but it did not interfere with my previouis downloads.
Kind regards:
Kevork
> This is simple. When Offline Explorer starts loading the site it follows the LogOut link. To exclude it from downloading, use the Project URL Filters | Filename | Custom Configuration. Add the following line to the Excluded filename keywords:
>
> expire
>
> This should help.
>
> Best regards,
> Oleg Chernavin
> MP Staff
Oleg Chernavin 04/14/2004 07:36 am
Well, they could have added the logout link just recently or changed it somehow, so Offline Explorer started following it.

Please let me know if this works for you.

Oleg.
kevork kevorkian 04/15/2004 05:34 pm
Yes Oleg, it worked, and your software has made a magnificent job for me! Thank you very much for helping! I remember I had problems downloading the javascript figure and table windows in the previous versions, but now Offline Explorer seems to be perfect:)
Just one more amateur question:) Is it possible to run the cgi scripts on the downloaded pages, so the search form fill return the requested info?
Best regards:
Kevork
> Well, they could have added the logout link just recently or changed it somehow, so Offline Explorer started following it.
>
> Please let me know if this works for you.
>
> Oleg.
Oleg Chernavin 04/16/2004 01:14 am
> Yes Oleg, it worked, and your software has made a magnificent job for me! Thank you very much for helping!

Thank you for your kind words!

> I remember I had problems downloading the javascript figure and table windows in the previous versions, but now Offline Explorer seems to be perfect:)

Yes, we improve scripts support with each new version.

> Just one more amateur question:) Is it possible to run the cgi scripts on the downloaded pages, so the search form fill return the requested info?

No, because CGI applications keep their actual code on the server only and it is possible to download only HTML output of them. Anyway, if you need only search, you can easily use the Find Contents feature in the Edit menu to find words in downloaded files. Sometimes it works even better than the search on live sites and it also highlights found words with colors when you browse Web pages.

Oleg.
Raman 07/13/2004 01:05 pm
Hi

I am trying to download some questions from a site

http://harrisons.accessmedicine.com/server-java/Arknoid/amed/harrisons/ox_pretests/pretests_toc.html

the questions download fine using the above mentioned way
but i am having trouble with answers

None of them is downloaded

Maybe it is because of the SUBMIT button
some "cgi" thing again...........
I don`t know what to do
please help
can`t something be done to download the answers

my username and password are

username: julkadoctor
password : rjulka
Oleg Chernavin 07/13/2004 03:12 pm
It is not easy, because Offline Explorer would not know which answer is correct and which of them should be submitted. Offline Explorer can try all possible combinations if you turn on HTML Forms Processing option in the Properties dialog | Advanced section. But this will be too many combinations of all possible answers to submit.

Oleg.
Raman 07/14/2004 08:38 am
i have downloaded the project.
but as said it was without the answers
Now as advised by you i have enabled the HTML forms and script calculations button
And then tried to download it again with enabling "Do not download existing files"
It parses, but downloads nothing

Cant something be done
If you check the answers pages online, you will find that the cgi pages that open after pressing the SUBMIT button contain links to the pages with the correct answer, and all these pages have "qae" word included in their names, but as the cgi pages are not downloaded , i am helpless, cant figure anything
please help if possible.
Oleg Chernavin 07/14/2004 10:25 am
It looks like there is a way to load all these pages.

Please logon the site in the Internal browser. Now create a new Project and place the following 5 lines to the URLs field:

http://harrisons.accessmedicine.com/cgi-bin/pretests
POST=ACTION=score&BOOK_TEMPS=%2FNEW%2FHARRISONS&BOOK=harrisons&SECTION={:1..3}&SET=1&MAXSET=4&KEY=1+C%3A2+E%3A3+A%3A4+C%3A5+A&QLIST={:1..26|5#1}+{:#1+1}+{:#1+2}+{:#1+3}+{:#1+4}&TLIST=0+0+0+0+0
IgnoreLogOutLinks
Referer=http://harrisons.accessmedicine.com/server-java/Arknoid/amed/harrisons/ox_pretests/questions/1/qset1.html
Additional=ConvertPOSTToFileName

Then set Level to 1, go to URL Filters | Directory and enable loading from all directories. Click OK button to save the Project and download it.

Oleg.
Raman 07/15/2004 10:30 am
Hi,

Thanks for taking so much pains.

I tried what you advised, lots of files (about 782 files) are downloaded, but there are following problems

1. When I try to run the above mentioned project, it says Document not found

When I tried to use the map or WIndows Explorer to locate the files in the cgi-bin, I found that there are only 18 files in the cgi bin
I tried to locate the files containing answers, the ones with "qae" in their name, only 90 files are downloaded which can`t be accessed by double clicking, as it says, File not found

The only way to see the answers is by directly accessing (double clicking) some of the files downloaded in cgi-bin, and then following the links for the answers, but that way only a few answers are downloaded. Most of the files in the cgi-bin, which say 5 out of 5 answers right, their links don`t work.

I apologize for bothering and pestering you so much.

Please help if possible
Oleg Chernavin 07/15/2004 11:10 am
Frankly, I don`t know if this helps, but it is the last chance to try. Please change the POST= line to:

POST=ACTION=score&BOOK_TEMPS=%2FNEW%2FHARRISONS&BOOK=harrisons&SECTION={:1..10}&SET={:1..4}&MAXSET=4&KEY={:1..26|5#1}+C%3A{:#1+1}+E%3A{:#1+2}+A%3A{:#1+3}+C%3A{:#1+4}+A&QLIST={#1}+{:#1+1}+{:#1+2}+{:#1+3}+{:#1+4}&TLIST=0+0+0+0+0

Maybe this will help you to get some more pages.

Oleg.
Raman 07/15/2004 12:12 pm
Hi,

Thanks again.

I tried it.

It downloads many files. Now 54 files in cgi-bin
Infact it downloads a lot of files with "qae" that is the answer files, but when i open them whether using cgi-bin links or even directly, it says FILE NOT FOUND

:-(

Anyway, Thanks for taking so much trouble

My congratulations for doing such a great job.
God bless you.
Oleg Chernavin 07/15/2004 12:13 pm
What if you open the .qae files from the Project Map?

Oleg.
Raman 07/16/2004 09:01 am
They are not .qae files, but qae*.htm files
When i open them from project map, most of them say..File not found
And a very few..only 2 or 3 out of the total show the answer page
Oleg Chernavin 07/16/2004 09:06 am
Yes, it is so, because the server generates the URLs its own way. I just got another idea. What if you load only these qae*.html files:

http://harrisons.accessmedicine.com/server-java/Arknoid/amed/harrisons/ox_pretests/questions/{:1..14}/set_{:1..5}/qae{:1..25}.html

This will be a lot of files and many of them will be invalid, but the above should get all possible correct files as well.

Oleg.
Raman 07/16/2004 09:22 am
What should be the level limit and other parameters for this new project
Raman 07/16/2004 09:35 am
I think You have struck the chord.

Lots of files are being downloaded........for each chapter 5 sets and for each set 25 qae*.html files
This way probably all right ones will also be downloaded.

I hope so

thanks a tonne

This download had made me very sad.i really wanted to downoad the answers and carry on with my preparations. Thanks again. I owe you one ( no not just one.but a lot)


This download has now lit hope that i would probably also be able to download another question site

http://www.manbit.com/mcq/mcqinit.asp

Is there a way to download all the 1024 questions. which have been selected by default?
Oleg Chernavin 07/16/2004 11:00 am
I can`t load them. I tried several parameters, but the server returns not the ones questions as I requested. Sorry.

Oleg.
Raman 07/16/2004 01:33 pm
Hi

Thanks

The accessmedicine qae*.html files porject had a little problem too
Only 25 answers were being downloaded for each section 1 to 14, even though some of them had as high as 121 answers

But I managed to download the rest. i took the long way. I took the long way. I made 10 additional projects and altered the url you gave so thatonly one chapter at a time was loaded with as many answers as i wanted.
Thanks to you.

God bless you, and never mind the manbit mcq download. I am sure one day your great software will be able to do it easily.
All the best and keep up the good (nay GREAT) work
Oleg Chernavin 07/16/2004 04:21 pm
Thank you for your kind words!

Oleg.
Raman 07/24/2004 02:48 pm
Hi Oleg,

Thanks for the previous help.

Now i was trying to download from
http://www.blackwellusmle.com/test.asp

u/n: ******************
p/w : *****************

i can download all questions, but not all answers. Only the questions i have answered, and so are in my status page.
The answers are in the page answer.asp?intQuestionID=
Please delete the u/n and p/w like before.
Thanks
please help
Raman 07/25/2004 04:00 am
Hi Oleg,

I tried to download the answers by first logging in the internal browser and then starting the following project

www.blackwellusmle.com/answer.asp@intQuestionID={:1..350}

It downloads all files, but only 4040. File not found.

I also used www.blackwellusmle.com as referrer

http://www.blackwellusmle.com/answer.asp@intQuestionID={:1..350}
Referer=http://www.blackwellusmle.com/status.asp

but no use
kindly advise how to download the answers

Oleg Chernavin 07/26/2004 04:30 am
Please use the following link instead:

http://www.blackwellusmle.com/answer.asp?intQuestionID={:1..350}
Referer=http://www.blackwellusmle.com/status.asp

Oleg.
Meghna 07/30/2004 05:42 pm
Hi,

I am trying to download this book from a site
I just want to download the greenberg book
And not so many other pages available through the pages
I tried giving included directory keyword, but all has failed
Kindly help
The url: http://accesslange.accessmedicine.com/greenberg/public/co_contents/toc.html
Username: *******************
password : **************
Kindly delete the passwords from the forum
Oleg Chernavin 07/31/2004 11:45 am
I would suggest you the following settings:

URL:

http://accesslange.accessmedicine.com/greenberg/public/co_contents/toc.html
Level=2
Then go to the URL Filters | Directory, select Custom Configuration and add the following to the Included directory keywords list:

server-java/Arknoid/amed/greenberg/co_chapters/

Click OK to save the Project. Now browse to the site in the Internal browser, logon there and you can start downloading the site.

Oleg.