It seems BigData is all everyone is talking about these days. So I’m going to start my posts on the subject by mentioning some basics about the platform:
- Main Apache website: http://zookeeper.apache.org/
- Insightful Link:
http://blog.cloudera.com/blog/2013/02/how-to-use-apache-zookeeper-to-build-distributed-apps-and-why/
- http://hbase.apache.org/
- There are so many articles and blogs about why HBase is a great tool to use and how its superior to the RDBMS databases in performance. I’ve included couple of links to get things started below:
http://hstack.org/why-were-using-hbase-part-1/
http://www.stumbleupon.com/blog/why-we-love-hbase/
- Couple of tools that bundle all of the above (and more) together and make the installation, configuration and management easier are below. The main advantage of these tools is avoiding the painful task of managing each server and configuration file manually. Using these tools will enable the DBAs to update the configuration once through the UI and it will get deployed on all servers.
Cloudera
Hortonworks