Installing Apache Hadoop on a Single/Multi Node Setup.

Posted: October 11, 2012 in Big Data

Apache Hadoop is an open source framework that supports data-intensive distributed, licensed under the Apache v2 license.It enables applications to work with thousands of computation-independent computers and petrabytes of data. Hadoop was derived from Google’s MapReduce and Google File System (GFS) papers.

First we need to set up Java 6. Since I am using Ubuntu 12.04 for my set up, I used https://github.com/flexiondotorg/oab-java6 for reference.

Once Java 6 is setup, just follow the instructions on the URLs provided below minus the Java 6 setup part.

Running Hadoop On Ubuntu Linux (Single-Node Cluster): http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/

Running Hadoop On Ubuntu Linux (Multi-Node Cluster): http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-multi-node-cluster/

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s