Thursday 4 October 2012

Hadoop Distributed File System - Hadoop Online Training


Outline

  1. Introduction
  2. Goals for this Module
  3. Distributed File System Basics
  4. Configuring HDFS
  5. Interacting With HDFS
    1. Common Example Operations
    2. HDFS Command Reference
    3. DFSAdmin Command Reference
  6. Using HDFS in MapReduce
  7. Using HDFS Programmatically
  8. HDFS Permissions and Security
  9. Additional HDFS Tasks
    1. Rebalancing Blocks
    2. Copying Large Sets of Files
    3. Decommissioning Nodes
    4. Verifying File System Health
    5. Rack Awareness
  10. HDFS Web Interface
  11. References

Introduction

HDFS, the Hadoop Distributed File System, is a distributed file system designed to hold very large amounts of data (terabytes or even petabytes), and provide high-throughput access to this information. Files are stored in a redundant fashion across multiple machines to ensure their durability to failure and high availability to very parallel applications. This module introduces the design of this distributed file system and instructions on how to operate it.

Goals for this Module:

  • Understand the basic design of HDFS and how it relates to basic distributed file system concepts
  • Learn how to set up and use HDFS from the command line
  • Learn how to use HDFS in your applications

2 comments:

  1. Excellent information i found here thanks to all
    Hadoop online Training

    ReplyDelete
  2. We also provide SAP Success Factors, SAP HR,SAP FICO,SAP ABAP Training in Chennai.
    Which is the Best SAP MM Training Institute in Chennai?
    Who can provide Realtime SAP Training in Chennai?
    Best SAP MM Training institues in Chennai?
    For Free Live Demo @ Call to 8122241286.
    www.thecreatingexperts.com
    SAP HR
    SAP SF

    ReplyDelete