Tuesday, March 11, 2008

my link to Pipes

Hi,

This is my link to Pipes
http://pipes.yahoo.com/rct tm suicide prevention/dbdb5ecc35ee1835053b23aac2262e95

I found this is very interesting. I have put Google, Yahoo, Wikipedia and nlm.nih.gov databases/directories/search engines to search the term “Randomised Controlled Trials (RCT) in Telemedicine and Suicide prevention”.
It’s fantastic; I have received 21 items which are related to the search term. However I have received a message saying that “Can't fetch pages that robots.txt disallow” and I don’t know what it is?

I am still not sure it how it can be best use in literature search. But I think if this works, it will be a good relief for me to make a pipe and waiting for results rather than searching each and every search engines separately.

Any idea!

Thanks!
Rohana

1 comment:

Kingo said...

Hi Rohana,

Robots.txt is a file that website owners can place on their site that tells web "robots" (such as Google's search engine, Yahoo's Pipe system) from traversing a specific area of their website.

I would say that the site you tried to fetch some items off has a disallow rule in their robots.txt file, and unfortunately, this stops most spiders/robots from accessing it.

For a more technical-slanted description, see Wikipedia's article on the matter.

I hope this helps you!

Sam