What is Hadoop?

What is Hadoop? first of all we are understanding what is DFS(Distributed File System), Why DFS?

DFS(Distributed File Systems)-

A distributed file system is a client/server-based application that allows clients to access and process data stored on the server as if it were on their own computer. When a user accesses a file on the server, the server sends the user a copy of the file, which is cached on the user’s computer while the data is being processed and is then returned to the server.

In above pics there are different physical machines in different location but in one logical machine have a common file system for all physical machine.

System that permanently store data
Divided into logical units (files, shards, chunks, blocks etc)
A file path joins file and directory names into a relative or absolute relative address to identify a file
Support access to files and remote servers
Support concurrency
Support Distribution
Support Replication
NFS, GPFS, Hadoop DFS, GlusterFS, MogileFS…

WHY DFS?

Apache Hadoop is a framework that allow for the distributed processing for large data sets across clusters of commodity computers using simple programming model.

It is design to scale up from a single server to thousands of machines each offering local computation and storage.

Apache Hadoop is simply a framework, it is library which build using java with objective of providing capability of managing huge amount of data.

Hadoop is a java framework providing by Apache hence to manage huge amount of data by providing certain components which have capability of understanding data providing the right storage capability and providing right algorithm to do analysis to it.

Open Source Software + Commodity Hardware = IT Costs reduction

What is Hadoop used for?

Searching
Log Processing
Recommendation systems
Analytics
Video and Image Analysis
Data Retention

Company Using Hadoop:

Yahoo
Google
Facebook
Amazon
AOL
IBM
other mores

http://wiki.apache.org/hadoop/PoweredBy

What is Hadoop?

DFS(Distributed File Systems)-

WHY DFS?

What is Hadoop?

What is Hadoop used for?

Company Using Hadoop:

About The Author

Dinesh Rajput

DFS(Distributed File Systems)-

WHY DFS?

What is Hadoop?

What is Hadoop used for?

Company Using Hadoop:

Share this:

Related Posts

About The Author

Dinesh Rajput