I posted a few weeks ago that I was going to look into this Hadoop stuff. Well, I actually looked into the “Cassandra” project…and managed to get it going on a LAMP stack
Cassandra is “A scalable multi-master database with no single points of failure”.
On Ubuntu 10.04 I extracted it to a directory and then installed phpcassa. It took a while but I was able to run the tutorial script defined here:
http://thobbs.github.com/phpcassa/tutorial.html
I guess now it’s time to change my mindset from a relational (MySQL) database to this new way of storing data.