Posted in February 10, 2010 ¬ 10:30 pmh.Royans
Lots of interesting updates today. But would like to first mention the fantastic work Cloud computing group at UCSB are doing to make appengine framework more open. They have done significant work at making appscale “work” with different kinds of data sources including HBase, Cassandra, Voldemort, MongoDB, Hypertable and Mysql and MemcacheDB. Appscale is actively [...]
Read the rest of this entry »
Posted in February 1, 2010 ¬ 8:24 pmh.Royans
Windows Azure is an application platform provided by Microsoft to allow others to run applications on Microsoft’s “cloud” infrastructure. Its finally open for business (as of Feb 1, 2010). Below are some links about Azure for those who are still catching up. Wikipedia: Windows Azure has three core components: Compute, Storage and Fabric. As the [...]
Read the rest of this entry »
Posted in January 25, 2010 ¬ 11:45 pmh.Royans
Hive is a data warehouse infrastructure built over Hadoop. It provides tools to enable easy data ETL, a mechanism to put structures on the data, and the capability to querying and analysis of large data sets stored in Hadoop files. Hive defines a simple SQL-like query language, called QL, that enables users familiar with SQL [...]
Read the rest of this entry »
hadoop, highavailability, hive, mapreduce, product, scalability, scaledatabase, datawarehouse, facebook, hadoop, hive, scalability
Posted in December 21, 2009 ¬ 12:48 amh.Royans
My last post was about my struggle to find a good distributed ESB/Service-discovery solution built over open source tools which was simple to use and maintain. Thanks to reader comments (Dan especially) and some other email exchanges, it seems like building a custom solution is unavoidable if I really want to keep things simple. Dan [...]
Read the rest of this entry »
Posted in May 27, 2009 ¬ 11:39 pmh.Royans
For the last couple of weekends I’ve been playing with Google App Engine, (Java edition) and was pleasantly surprised at the direction it has taken. I was also fortunate enough to see some Google Engineers talk on this subject at Google I/O which helped me a lot to compile all this information. But before I [...]
Read the rest of this entry »
Posted in October 25, 2007 ¬ 12:56 amh.Royans
Indexing 100 million blogs with over 10 billion objects, and with a user base which is doubling every six months, technorati has an edge over most blog search engines. But they are much more than search, and any technorati user can explain you that. I recommend you read John Newton’s interview with David Sifry which [...]
Read the rest of this entry »
Posted in September 9, 2007 ¬ 12:34 pmh.Royans
Ive been hearing this word “sharding” more and more often, and its spreading like fire. Theo Schlossnagle, the author of “Scalable internet architecutres” argues that federation is form of partitioning, and that sharding is nothing but a form of partitioning and federation. Infact, according to him, Sharding has already been in use use for a [...]
Read the rest of this entry »
Posted in August 25, 2007 ¬ 12:50 pmh.Royans
TypePad was and probably is one of the first and largest paid blogging service in the world. In a presentation at OSCON 2007 , Lisa Phillips and Garth Webb spoke about TypePad’s problems in 2005. Since this is a common problem with any successful company I found it interesting enough to research a little more. [...]
Read the rest of this entry »
Posted in August 11, 2007 ¬ 10:58 pmh.Royans
Link “Introduction to MySQL Cluster The NDB storage engine (MySQL Cluster) is a high-availability storage engine for MySQL. It provides synchronous replication between storage nodes and many mysql servers having a consistent view of the database. In 4.1 and 5.0 it’s a main memory database, but in 5.1 non-indexed attributes can be stored on disk. [...]
Read the rest of this entry »