Getting links from one site but d/l from another
|Cap||05/29/2007 05:09 pm|
|Can you tell me if this is possible.
I'd like to pull links from a search engine (yahoo) concerning a specific website that I want to download. I can't put in the default 'home' page because the site has a lot of information not linked directly that shows up in a yahoo search.
search yahoo for 'goped tweaks' on the website 'www.gopednation.com'
you can do this either through 'advanced' or by typing this syntax: goped tweaks site:www.gopednation.com
Now in that yahoo page I'll have a list of pages that are on www.gopednation.com's website but that might not be directly linked anywhere. I can put in the search results URL: http://search.yahoo.com/search?p=goped+tweaks+site%3Awww.gopednation.com&adult_done=http%3A%2F%2Fsearch.yahoo.com%2Fsearch&adult_cancel=http%3A%2F%2Fsearch.yahoo.com%2Fweb%2Fadvanced&_adv_prop=web&ei=UTF-8&vf=all&vm=p&fl=0&n=10&fr=my-vert-web-top&_bcrumb=9ad59b43d6faeb4b1829398b81977626%2C1180472675 and everything seems cool.. but here's where I'm wanting to change things.
I only want to download actual information from gopednation.com (only using this as an example btw). I don't want all the sub yahoo pages, etc.. I will probably have to go 3 levels or so deep to get everything that I want, so if I don't filter some stuff out I'm going to be downloading junk I don't need for a long, long time.
Is there a way to say it's ok to read/dl any search.yahoo.com pages and any gopednation.com pages but nothing else? I'm trying out the latest version of this product (OE).
|Oleg Chernavin||05/31/2007 12:12 pm|
|I can offer you two ways:
1. Set the Project level to 0 and in URL Filters - Server allow 3 levels from other servers.
2. To use URL Filters - Server - Custom to exclude:
or other sites. Or even keep Included list empty to allow all sites except *.yahoo.*