Outline
- Introduction
- Goals for this Module
- Distributed File System Basics
- Configuring HDFS
- Interacting With HDFS
- Common Example Operations
- HDFS Command Reference
- DFSAdmin Command Reference
- Using HDFS in MapReduce
- Using HDFS Programmatically
- HDFS Permissions and Security
- Additional HDFS Tasks
- Rebalancing Blocks
- Copying Large Sets of Files
- Decommissioning Nodes
- Verifying File System Health
- Rack Awareness
- HDFS Web Interface
- References
Introduction
HDFS, the Hadoop Distributed File System, is a distributed file system designed to hold very large amounts of data (terabytes or even petabytes), and provide high-throughput access to this information. Files are stored in a redundant fashion across multiple machines to ensure their durability to failure and high availability to very parallel applications. This module introduces the design of this distributed file system and instructions on how to operate it.
Goals for this Module:
- Understand the basic design of HDFS and how it relates to basic distributed file system concepts
- Learn how to set up and use HDFS from the command line
- Learn how to use HDFS in your applications
Excellent information i found here thanks to all
ReplyDeleteHadoop online Training
We also provide SAP Success Factors, SAP HR,SAP FICO,SAP ABAP Training in Chennai.
ReplyDeleteWhich is the Best SAP MM Training Institute in Chennai?
Who can provide Realtime SAP Training in Chennai?
Best SAP MM Training institues in Chennai?
For Free Live Demo @ Call to 8122241286.
www.thecreatingexperts.com
SAP HR
SAP SF