Did you know that Google has a bot which ignores robots.txt and does so defiantly? It's true. Google's RSS grabber, Feedfetcher, ignores robots.txt as Google reasons a human decided to publish the feed and a human has decided to request the feed. It's all explained over at the webmaster help center. I actually think Google's made the right call here, although it means you can't slam the brakes on an RSS by slapping up a robots.txt block and I'm beginning with this just to set the precedent.
I like to think one of my real scoops this year was when I noticed that Google seemed to be blocking Yahoo Pipes. Only yesterday I noted that I was disappointed that Sphinn didn't like the story but pleased that Wired writer Betsy Schiffman had.
I do believe that this blockage was temporary and accidental. Google have said complementary things about Yahoo Pipes before and you can use Yahoo Pipes to take data from Google Base. In fact, Yahoo Pipes and Google Base have been a featured project on Google Code.
In a quirk of timing, bigmouthmedia colleague and Wonga World blogger, Chris Cathcart pointed out that Google's Feedburner is also blocking Yahoo Pipes.
This time the blockage is certainly not an accident but is a human controlled decision. Why would Feedburners want to keep their RSS out of Yahoo Pipes? One possible answer is that although the publisher is happy to distribute content (or teasers) in a feed they don't want that content to be sliced, diced and mixed up with other content. One of the ways I use Yahoo Pipes is to monitor dozens of feeds but only alert me when a story is gaining a critical mass, this means I don't need to manually review all those feeds nor even look at any adverts inside them.
Here's the plug for Wonga World! Chris is our Senior Strategist in the Finance vertical. He's years of experience working in banks and digital marketing. In fact, he spoke at SMX London this year. Wonga World is written with that savvy financial sector bias which is why he gave me this 'search only' lead. What a nice man.