Apache SystemML is a declarative, large-scale machine learning platform that provides automatic optimisation for custom machine learning a..
Apache Flume is a scalable, high-volume data ingestion system that allows users to load streaming data into HDFS. Typical use cases for Fl..
In the previous post I wrote about the importance of the Open Data Platform initiative. In this tutorial I will go over the steps of insta..
If you are planning to run Hadoop on a 64-bit OS you might want to compile it from source instead of using the pre-built 32-bit i386-Linux..