Useful Hadoop Shell Commands

Big Data and Hadoop (170 Blogs) Become a Certified Professional

HDFS stands for ‘Hadoop Distributed File System’. The HDFS is a sub-project of the Apache Hadoop project. This Apache Software Foundation project is designed to provide a fault-tolerant file system designed to run on commodity hardware. HDFS is accessed through a set of shell commands which will be discussed in this post.

A short note before starting: All the Hadoop Shell commands are invoked by the bin/hadoop script.

User Commands:

Run DFS file system:

Usage: hadoop fsck – /

Check version of Hadoop:

Usage: Hadoop version

FS Shell Commands:

The Hadoop fs command runs a generic filesystem user client that interacts with the MapR filesystem (MapR-FS).

View file listings:

Usage: hadoop fs -ls hdfs :/

Check memory status:

Usage: hadoop fs -df hdfs :/

Count of Directories, Files and Bytes in specified path and file pattern:

Usage: hadoop fs -count hdfs :/

Move file from one location to another:

Usage: -mv <src> <dst>

Copy file from source to destination :

Usage: -cp <src> <dst>

Delete File:

Usage: -rm <path>

Put file from the Local file system to Hadoop Distributed File System:

Usage: -put <localsrc> … <dst>

Copy file from Local to HDFS:

Usage: -copyFromLocal <localsrc> … <dst>

View file in Hadoop Distributed File system:

Usage: -cat <src>

Administration Commands:

Format the namenode:

Usage: hadoop namenode -format

Starting Secondary namenode:

Usage: hadoop secondrynamenode

Run namenode :

Usage: hadoop namenode

Run data node:

Usage: hadoop datanode

Cluster Balancing:

Usage: hadoop balancer

Run MapReduce job tracker node:

Usage: hadoop jobtracker

Run MapReduce task tracker node:

Usage: hadoop tasktracker

Got a question for us? Please mention them in the comments section and we will get back to you.

Related Posts:

Get started with Big Data and Hadoop

Hadoop Cluster Configuration Files

Operators in Apache Pig

New-Age Search through Apache Solr

Helpful Hadoop Shell Commands

User Commands:

FS Shell Commands:

Administration Commands:

Recommended videos for you

Apache Spark Redefining Big Data Processing

Hive Tutorial – Understanding Hive In Depth

Big Data Processing with Spark and Scala

Logistic Regression In Data Science

Improve Customer Service With Big Data

Hadoop-A Highly Available And Secure Enterprise Data Warehousing Solution

Real-Time Analytics with Apache Storm

What Is Hadoop – All You Need To Know About Hadoop

Filtering on HBase Using MapReduce Filtering Pattern

Is Hadoop A Necessity For Data Science?

5 Things One Must Know About Spark

Streaming With Apache Spark and Scala

Secure Your Hadoop Cluster With Kerberos

Boost Your Data Career with Predictive Analytics! Learn How ?

Introduction to Hadoop Administration

Apache Kafka With Spark Streaming: Real-Time Analytics Redefined

Hadoop Architecture – Hadoop Tutorial on HDFS Architecture

What is Big Data and Why Learn Hadoop!!!

Top Hadoop Interview Questions and Answers – Ace Your Interview

New-Age Search through Apache Solr

Recommended blogs for you

Spark SQL Tutorial – Understanding Spark SQL With Examples

Why should a Software Testing Engineer learn Big Data and Hadoop Ecosystem Technologies?

CCA and CCP Certifications By Cloudera: All You Need To Know

Big Data Career Is The Right Way Forward. Know Why!

Commissioning and Decommissioning Nodes in a Hadoop Cluster

Apache Spark Architecture – Spark Cluster Architecture Explained

Why Should you go for Hadoop Administration Course?

How to Set Up Hadoop Cluster with HDFS High Availability

PySpark CheatSheet: Spark RDD with Python

Hadoop Interview Questions For 2024 – Setting Up Hadoop Cluster

Hive Data Models: Designing Efficient Data Structures

Zookeeper Tutorial: The Guide you need to Master Zookeeper

Introduction to Hadoop

We Are Deloitte’s #1 Fastest Growing Tech Company!

Why You Should Choose Python For Big Data

Overview of HBase Storage Architecture

Big Data In Healthcare: How Hadoop Is Revolutionizing Healthcare Analytics

Drilling Down On Apache Drill, the New-Age Query Engine

Splunk Careers – Your Pathway To Hot Big Data Jobs

Hadoop and Java Job Trends

Join the discussion Cancel reply

Trending Courses in Big Data

Azure Data Engineer Certification (DP-203) Co ...

PySpark Course Online Training

Big Data Hadoop Certification Training Course

Apache Spark and Scala Certification Training ...

Apache Kafka Certification Training Course

Splunk Certification Training: Power User and ...

Leveraging Big Data for Business Intelligence ...

ELK Stack Training & Certification

Apache Solr Certification Training

Apache Storm Certification Training

Browse Categories

Subscribe to our Newsletter, and get personalized recommendations.

Helpful Hadoop Shell Commands