November 22, 2014

No Comments

Hadoop Confiuration

Hadoop Configuration
I have to do in the following layers.

HDFS Layer
- NameNode-Master
- DataNode-Store Data(Actual Storage)
MapReduce Layer
- JobTracker
- TaskTracker
Secondary Namenode– storing backup of NameNode it will not work as an alternate namenode, it just stored namenode metadata

Types of Hadoop Configurations

Standalone Mode
- All processes runs as single process
- Preferred in development
Pseudo Cluster Mode
- All processes run in different process but on a single machine
- Simulate cluster
Fully Cluster Mode
- All processes running on different boxes
- Preferred in production Mode

What are important files to be configure

hadoop-env.sh (set java environment and logging file)
core-site.xml (configure namenode)
hdfs-site.xml (configure datanode)
mapred-site.xml (map reduce here taking responsibility of configuring jobTracker and taskTracker)
yarn-site.xml
master (file configured on each datanodes telling about its namenode)
slave (file configured on namenode telling what all slave of datanode it has to manage)

About The Author

Dinesh Rajput

Dinesh Rajput is the chief editor of a website Dineshonjava, a technical blog dedicated to the Spring and Java technologies. It has a series of articles related to Java technologies. Dinesh has been a Spring enthusiast since 2008 and is a Pivotal Certified Spring Professional, an author of a book Spring 5 Design Pattern, and a blogger. He has more than 10 years of experience with different aspects of Spring and Java design and development. His core expertise lies in the latest version of Spring Framework, Spring Boot, Spring Security, creating REST APIs, Microservice Architecture, Reactive Pattern, Spring AOP, Design Patterns, Struts, Hibernate, Web Services, Spring Batch, Cassandra, MongoDB, and Web Application Design and Architecture. He is currently working as a technology manager at a leading product and web development company. He worked as a developer and tech lead at the Bennett, Coleman & Co. Ltd and was the first developer in his previous company, Paytm. Dinesh is passionate about the latest Java technologies and loves to write technical blogs related to it. He is a very active member of the Java and Spring community on different forums. When it comes to the Spring Framework and Java, Dinesh tops the list!