Install Hadoop on Windows

Posted on June 29, 2016 by By admin, in Big Data | 2

Introduction :

This blog enables Hadoop users to install Hadoop on windows. As Hadoop is usually built and run on LINUX, windows installation in relatively new. The following Blog contains steps to download Hadoop and its prerequisites, install YARN based Hadoop 2.5 and above.

Prerequisites :

Oracle JDK versions 1.7 and 1.6 have been tested by the Hadoop developers and are known to work.
Make sure that JAVA_HOME is set in your environment and does not contain any spaces. If your default Java installation directory has spaces then you must use the Windows 8.3 Path name instead e.g. c:\Progra~1\Java\… instead of c:\Program Files\Java\….

Downloading Hadoop sources :

From the ASF Hadoop download page or a mirror.
Subversion URL: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2.5

Build and Copy Binary Packages :

Command to install binary package directly from command prompt “mvn package -Pdist,native-win -DskipTests -Dtar”.

Installation :

Pick Target Directory for installation. Here target directory used is c:\Hadoop, and Extract the tar.gz file (e.g.hadoop-2.5.0.tar.gz) under c:\Hadoop.

After installing the folder structure would look like this in command prompt.

Starting a Single Node (pseudo-distributed) Cluster

Example HDFS Configuration

Before you can start the Hadoop Daemons you will need to make a few edits to configuration files. The configuration file templates will all be found in c:\Hadoop\etc\hadoop, assuming your installation directory is c:\Hadoop.

First edit the file hadoop-env.cmd to add the following lines near the end of the file.

Edit or create the file core-site.xml and make sure it has the following configuration key:

Edit or create the file hdfs-site.xml and add the following configuration key:

Finally, edit or create the file slaves and make sure it has the following entry :– localhost

The default configuration puts the HDFS metadata and data files under \tmp on the current drive. In the above example this would be c:\tmp. For your first test setup you can just leave it at the default.

Example YARN Configuration :

Edit or create mapred-site.xml under %HADOOP_PREFIX%\etc\hadoop and add the following entries, replacing %USERNAME% with your Windows user name.

Finally, edit or create yarn-site.xml and add the following entries:

Initialize Environment Variables

Run c:\Hadoop\etc\hadoop\hadoop-env.cmd to setup environment variables that will be used by the startup scripts and the daemons.

Format the filesystem with the following command:

%HADOOP_PREFIX%\bin\hdfs namenode -format

Start HDFS Daemons

Run the following command to start the NameNode and DataNode on localhost.

%HADOOP_PREFIX%\sbin\start-dfs.cmd

Start YARN Daemons :

%HADOOP_PREFIX%\sbin\start-yarn.cmd

Courtesy :

https://wiki.apache.org

http://hadoop.apache.org/

Best Open Source Business Intelligence Software Helical Insight is Here

A Business Intelligence Framework

Big Data big data analytics Hadoop installation windows

0 0 votes

Article Rating

2 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

Credo Systemz

7 years ago

Great blog.. Installation procedure are very clear and step by step so easy to understand..

Ananthi

7 years ago

After reading this blog i very strong in this topics and this blog really helpful to all… explanation are very clear so very easy to understand… thanks a lot for sharing this blog

You might also like..

Business Intelligence

Installation of Firebird db

By admin

Steps to install firebird db 1. Go to google and type firebird in search box and then click on first link. License aggrement 2. Click on downloads and then install Firebird latest version(5.0.0). 3. It will navigate to the below...

Software Testing

Defect Life Cycle

By admin

This blog explains about the complete life cycle of a bug and different status of bug from the stage it was identified,fixed,retest and close. What is Defect life cycle? Defect life cycle is the life cycle of a defect or...

Software Testing

Different Levels of Testing in Software Testing

By admin

What are the Levels of Software Testing? In this blog,we are going to understand the various levels of software testing In Software Testing,we have four different levels of testing,which are as mentioned below: Unit Testing Integration Testing System Testing Acceptance...

About Helical IT Solutions Pvt Ltd

Location

Contact Us

Search what you are looking for..

Install Hadoop on Windows

Posted on June 29, 2016 by By admin, in Big Data | 2

Installation :

Starting a Single Node (pseudo-distributed) Cluster

Example HDFS Configuration

Example YARN Configuration :

Initialize Environment Variables

Start YARN Daemons :

A Business Intelligence Framework

You might also like..

Business Intelligence

Installation of Firebird db

By admin

Software Testing

Defect Life Cycle

By admin

Software Testing

Different Levels of Testing in Software Testing

By admin

Contact Form