Archive for January, 2010

The real concerns about Cloud infrastructure (as it is today)

While “private clouds may not be the future” they are definitely needed today. Here are some of the top issues bothering some organizations which have been thinking about going into the cloud. Some of issues were based on Craig Bolding’s talk on “Guide to cloud security”.

Unlike your own data center, you will never know [...]

Read the rest of this entry »

Fixing GSLB (Global Server load balancing)

Standard DNS protocol allows DNS servers to respond with multiple addresses in the replies for simple DNS lookup queries. This, and the way that the order of records is changed in every reply is collectively known as the “Round Robin DNS” technique to load balance across a set of servers.
Though a lot of organizations are [...]

Read the rest of this entry »

Cloud computing in 1963 ( actually Timesharing )

Found this on Feld Thoughts. Its not really about cloud computing, but they are interested in making efficient use of computational resources, which is one of the goals of today’s “cloud computing”  as well.
This magnificent video is from 1963 Timesharing: A Solution to Computer Bottlenecks where MIT Professor Fernando Corbato explains how timesharing works to [...]

Read the rest of this entry »

AppScale, an OpenSource GAE implementation

If you don’t like EC2 you have an option to move your app to a new vendor. But if you don’t like GAE  (Google app engine) there aren’t any solutions which can replace GAE easily.
AppScale might change that.
AppScale is an open-source implementation of the Google AppEngine (GAE) cloud computing interface from the RACELab at UC [...]

Read the rest of this entry »

Videos on scalable web architectures

If you are like me, you are already following all the talks and presentations published on YouTube. But if you have not been, nothing stops you from starting now. A new “Videos” page has been added to this blog to list the latest YouTube videos related to scalable web architectures.
Videos related to scalable web [...]

Read the rest of this entry »

Scalability Updates for Jan 26th 2010

A few interesting updates for today

Derrick Harris made an insightful observation that Cloud providers are pairing up with CDNs . I won’t be surprised if some consolidations happen in this arena
Cassandra 0.5.0 is released. I can’t wait to try it out.
Paper: Keyspace: A consistently replicated, highly-available key-value store
Paper: PaxosLease: Diskless Paxos for leases
Cloudkick: A cross [...]

Read the rest of this entry »

Hive @Facebook

Hive is a data warehouse infrastructure built over Hadoop. It provides tools to enable easy data ETL, a mechanism to put structures on the data, and the capability to querying and analysis of large data sets stored in Hadoop files. Hive defines a simple SQL-like query language, called QL, that enables users familiar with SQL [...]

Read the rest of this entry »

Scalability Killers (The art of scalability)

Top 10 scalability killers from The Art of scalability: Scalable Web Architecture, Processes, and Organizations for Modern Enterprise

Thinking Scalability is just about technology
Overuse of Synchronous calls
Failure to weed or seed soon enough
Inappropriate use of databases
Cesspools instead of swim lanes
Reliance on Vertical scale
Failure to Learn from History
Changing Development methodologies to fix problems
Too little caching, too late
Overreliance [...]

Read the rest of this entry »

Private clouds not the future ?

James Hamilton is one of the leaders in this industry and has written a very thought provoking post about private clouds not being the future. This is what he said about private clouds when compared to existing not-cloud solutions.

A fix, Not the future (reference to an InformationWeek post)
Runs at lower utilization levels
Consumes more [...]

Read the rest of this entry »

HAProxy : Load balancing

Designing any scalable web architecture would be incomplete without investigating “load balancers”.  There used to be a time when selecting and installing load balancers was an art by itself. Not anymore.
A lot of organizations today, use Apache web servers as a proxy server (and also as a load balancer) for the backend application clusters. Though [...]

Read the rest of this entry »