How can i export a txt or excel file...

Author Message
Dennis P 01/27/2013 07:48 pm
I want to create a text or excel file just the emails or words that contain "@" in it. How can I do this?

Thanks
Dennis P 01/27/2013 07:52 pm
Sorry, wrong forum.

I'm referring to Offline Explorer Enterprise 6.5
Oleg Chernavin 01/28/2013 05:19 am
If you mean the ability to collect all E-mail addresses in a text file, you may add the following line to the Project's URLs field:

Additional=CollectEMails=c:\dir\file.txt

Best regards,
Oleg Chernavin
MP Staff
Dennis P 01/29/2013 12:04 pm
Thanks for the reply, but I'm a little confused as to where I would insert that line...

I right click on the project > properties > project tab - then in the URL box ? Do I paste it in after the URL like this?

(http://www.bbb.org/us/business-reviews/matched/Dentist+-+Orthodontist/92121/Additional=CollectEMails=c:\dir\file.txt)

Thanks for the help Oleg!
-Dennis
Dennis P 01/29/2013 10:39 pm
Just tried that it didn't work...

There are a lot of options and settings which make it confusing to navigate and set it up properly.

I want to be able to put in a URL and have the software crawl through the links and various pages of the main domain and extract e-mails to a text or excel file.

Can you please give me detailed instructions on how I could accomplish this?
Oleg Chernavin 01/30/2013 04:53 am
You need to put that on two separate lines in the URLs field:

http://www.bbb.org/us/business-reviews/matched/Dentist+-+Orthodontist/92121/
Additional=CollectEMails=c:\dir\file.txt

Oleg.
Dennis P 01/31/2013 01:31 am
I ran the project on Level Limit 0 and then Level Limit 2 using this link:

http://www.bbb.org/us/Find-Business-Reviews/type/pest+control+services/pest-control-services/92121/bbb-accredited/

but the text file showed up empty...

Then I ran a Project on this link, going inside the listed business' page:

http://www.bbb.org/san-diego/business-reviews/termite-control/hungry-termite-in-san-diego-ca-171999653

Level limit 0

But it only picked up:

mailto:pestboard@dca.ca.gov

and not:

mailto:info@hungrytermite.com

^ this is the kind of e-mails I want it to pick up...

Any suggestions? Any settings I should check to make sure its crawling through the links/hyperlinks properly?

Thanks,
Dennis
Dennis P 01/31/2013 11:50 am
I ran another project overnight and recieved results. BUT I got a lot of unwanted lines in my text doc like:

"info@sandiego.bbb.org?subject=We Got-Ya Pest Control Inc (19005603)"

and

".gov" e-mails

How can I exclude ".gov" and ".bbb.org" from the e-mail list?
Oleg Chernavin 02/01/2013 04:14 am
The first page - I am sorry, but they use a complex script to encrypt E-mail addesses. Offline Explorer doesn't yet support its decoding.

Filtering of unwanted E-mail addresses is impossible yet. It is easier to load the resulting text file to Excel and use its sorting and filtering features.

Oleg.