Big Data and Hadoop (170 Blogs) Become a Certified Professional

Apache Falcon: New Data Management Platform For The Hadoop Ecosystem

Last updated on May 22,2019 2.7K Views

Awanish
Awanish is a Sr. Research Analyst at Edureka. He has rich expertise... Awanish is a Sr. Research Analyst at Edureka. He has rich expertise in Big Data technologies like Hadoop, Spark, Storm, Kafka, Flink. Awanish also...

Apache Falcon is a framework for managing data life cycle in Hadoop clusters. It establishes relationship between various data and processing elements on a Hadoop environment, and also provides feed management services such as feed retention, replications across clusters, archival etc.

Let us first discuss how to setup Apache Falcon. Run the below given command to download git repository of Falcon:

Command: git clone https://git-wip-us.apache.org/repos/asf/falcon.git falcon

git-command-apache-falcon

To run falcon, you need to build it first.

Command: cd falcon

Command: export MAVEN_OPTS=”-Xmx1024m -XX:MaxPermSize=256m -noverify” && mvn clean install -DskipTests

Command: mvn clean assembly:assembly -DskipTests -DskipITs

Run-Apache-Falcon

Once you have built falcon, you will find a falcon package inside /falcon/distro/target/ directory.

Directory-Apache-Falcon

The commands for building falcon looks very easy, but you will face a lot issues before you see the Build Success message. I faced a lot of issues while building it for Hadoop-2.2.0

So to skip the pain of building Falcon , I am giving you a successfully built falcon package, which you can download using the below link.

https://edureka.wistia.com/medias/xw5cfzqmho/download?media_file_id=124642564

Unzip the file to get falcon-0.10 directory.

Command: unzip falcon-0.10-SNAPSHOT.zip

Set flacon environment variables in .bashrc file.

Command: sudo gedit .bashrc

Unzip-Apache-Falcon

Command: source .bashrc

You can go to falcon directory and see the files and directories inside it.

Command: cd falcon-0.10-SNAPSHOT/

Command: ls

View-directory-Apache-Falcon

You can find falcon scripts inside bin directory.

Scripts-Apache-Falcon

Run below command to start Falcon.

Command: ./bin/falcon-start

start-Apache-Falcon

You’ll see a new daemon FalconServer running now.

Command: jps

Falconserver-Apache-Falcon

Command: ./bin/falcon admin -version

Admin-version-Apache-Falcon

Open your browser, and go to localhost:15000. You can see Falcon web ui.

Falcon-webui-Apache-Falcon

Got a question for us? Mention them in the comment section and we will get back to you. 

Related Posts:

Get Started with Big Data and Hadoop

Drilling Down on Apache Drill

Comments
1 Comment

Join the discussion

Browse Categories

webinar REGISTER FOR FREE WEBINAR
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP

Subscribe to our Newsletter, and get personalized recommendations.

image not found!
image not found!

Apache Falcon: New Data Management Platform For The Hadoop Ecosystem

edureka.co