Posted in August 27, 2010 ¬ 6:28 pmh.Royans
My updates have been slow recently due to other things I’m involved in. If you need more updates around what I’m reading, please feel free to follow me on twitter or buzz. Here are some of the big ones I have mentioned on my twitter/buzz feeds. Tools: Real-time Relationship Analytics from large scale graph processing [...]
Read the rest of this entry »
Posted in June 1, 2010 ¬ 6:32 pmh.Royans
Most of us who deal with traditional databases take auto-increments for granted. While auto-increments are simple on consistent clusters, it can become a challenge in a cluster of independent nodes which don’t use the same source for the unique-ids. Even bigger challenge is to do it in such a way so that they are roughly [...]
Read the rest of this entry »
Posted in May 31, 2010 ¬ 12:07 pmh.Royans
A few of us joined in at the new Twitter office in downtown SF (right next to Moscone Center) and were for the first time shown what Twitter is doing about “Twitter Annotations”. We probably created the first set of 3rd party applications around this new API. During the Hackathon I spent some time to [...]
Read the rest of this entry »
Posted in May 21, 2010 ¬ 2:22 amh.Royans
Yesterday Google formally announced Google Storage to a few (5000?) of us at Google I/O. Here is the gist of this as I see it from the various discussions/talks I attended. To begin with, I have to point out that there is almost nothing new in what Google has proposed to provide. Amazon has been [...]
Read the rest of this entry »
Posted in March 18, 2010 ¬ 10:09 pmh.Royans
MapReduce, Bigtable and Pregel have their origins in Google and they all deal with “large systems”. But all of them may be dwarfed in size and complexity by a new project Google is working on, which was mentioned briefly (may be un-intentionally) at an event last year. Instead of caching data closer to user, it [...]
Read the rest of this entry »
datastore, eventually consistent, framework, google, mapreduce, replication, scalabilitydatastore, eventually consistent, google, mapreduce, replication, scalability
Posted in February 26, 2010 ¬ 8:12 amh.Royans
This is a very interesting talk by Jonathan Ellis on database scalability. He designed and implemented multi-petabyte storage for Mozy and is currently the project chair for Apache Cassandra. What every developer should know about database scalability, PyCon 2010 View more presentations from jbellis. Scalability is not improving latency, but increasing throughput But overall performance [...]
Read the rest of this entry »
Posted in February 14, 2010 ¬ 3:33 pmh.Royans
Large distributed systems run into a problem which smaller systems don’t usually have to worry about. “Brewers CAP Theorem” [ Ref 1] [ Ref 2] [ Ref 3] defines this problem in a very simple way. It states, that though its desirable to have Consistency, High-Availability and Partition-tolerance in every system, unfortunately no system can [...]
Read the rest of this entry »
Posted in February 3, 2010 ¬ 11:32 pmh.Royans
While PHP is very popular, it unfortunately doesn’t perform as some of its competitors. One of the ways to make things faster is to write PHP Extensions in C++. In this post we will describe two different ways developers can solve this problem and the milage you might get from either model may vary. Since [...]
Read the rest of this entry »
Posted in February 1, 2010 ¬ 8:24 pmh.Royans
Windows Azure is an application platform provided by Microsoft to allow others to run applications on Microsoft’s “cloud” infrastructure. Its finally open for business (as of Feb 1, 2010). Below are some links about Azure for those who are still catching up. Wikipedia: Windows Azure has three core components: Compute, Storage and Fabric. As the [...]
Read the rest of this entry »
Posted in January 31, 2010 ¬ 5:29 pmh.Royans
While “private clouds may not be the future” they are definitely needed today. Here are some of the top issues bothering some organizations which have been thinking about going into the cloud. Some of issues were based on Craig Bolding’s talk on “Guide to cloud security”. Unlike your own data center, you will never know [...]
Read the rest of this entry »