Archive for the ‘analysis’ Category

Splunk : Fastest way to get web operations dashboard running

This is a cross-post from my personal blog. Few weeks ago I asked a question on quora about log aggregation. I was surprised to find that no opensource solution came close to what I wanted, but I got a lot of suggessions to try out splunk. So I did. What I wanted was an aggregation tool [...]

Read the rest of this entry »

S4: Distributed Stream Computing Platform

A few weeks ago I mentioned Yahoo! Labs was working on something called S4 for real-time data analysis. Yesterday they released an 8 page paper with detailed description of how and why they built this. Here is the abstract from the paper. Its interesting to note that the authors compared S4 with MapReduce and explained [...]

Read the rest of this entry »