Published on May 04,2016
809 Views
Email Post

Apache Falcon is a framework for managing data life cycle in Hadoop clusters. It establishes relationship between various data and processing elements on a Hadoop environment, and also provides feed management services such as feed retention, replications across clusters, archival etc.

Let us first discuss how to setup Apache Falcon. Run the below given command to download git repository of Falcon:

Command: git clone https://git-wip-us.apache.org/repos/asf/falcon.git falcon

git-command-apache-falcon

To run falcon, you need to build it first.

Command: cd falcon

Command: export MAVEN_OPTS=”-Xmx1024m -XX:MaxPermSize=256m -noverify” && mvn clean install -DskipTests

Command: mvn clean assembly:assembly -DskipTests -DskipITs

Run-Apache-Falcon

Once you have built falcon, you will find a falcon package inside /falcon/distro/target/ directory.

Directory-Apache-Falcon

The commands for building falcon looks very easy, but you will face a lot issues before you see the Build Success message. I faced a lot of issues while building it for Hadoop-2.2.0

So to skip the pain of building Falcon , I am giving you a successfully built falcon package, which you can download using the below link.

https://edureka.wistia.com/medias/xw5cfzqmho/download?media_file_id=124642564

Unzip the file to get falcon-0.10 directory.

Command: unzip falcon-0.10-SNAPSHOT.zip

Set flacon environment variables in .bashrc file.

Command: sudo gedit .bashrc

Unzip-Apache-Falcon

Command: source .bashrc

You can go to falcon directory and see the files and directories inside it.

Command: cd falcon-0.10-SNAPSHOT/

Command: ls

View-directory-Apache-Falcon

You can find falcon scripts inside bin directory.

Scripts-Apache-Falcon

Run below command to start Falcon.

Command: ./bin/falcon-start

start-Apache-Falcon

You’ll see a new daemon FalconServer running now.

Command: jps

Falconserver-Apache-Falcon

Command: ./bin/falcon admin -version

Admin-version-Apache-Falcon

Open your browser, and go to localhost:15000. You can see Falcon web ui.

Falcon-webui-Apache-Falcon

Got a question for us? Mention them in the comment section and we will get back to you. 

Related Posts:

Get Started with Big Data and Hadoop

Drilling Down on Apache Drill

About Author
Awanish
Published on May 04,2016
Awanish is a Sr. Research Analyst at Edureka. He has rich expertise in Big Data technologies like Hadoop, Spark, Storm, Kafka, Flink. Awanish also holds good knowledge on Devops tools like Git, Jenkins, Docker, Puppet, Nagios. He is passionate about new technologies which provide realistic solutions.

Share on

Browse Categories

Comments
1 Comment