What is Kafka? what is its importance in Big-Data?

0 votes
I have currently setup a cluster. I've used Apache Kafka and zookeeper. But still, I still don't get its usage in the cluster. When are both required and for what requirement?
Apr 11, 2019 in Big Data Hadoop by nitinrawat895
• 10,950 points
508 views

1 answer to this question.

0 votes

Apache Kafka is a tool used in Streaming Real-time data.

Kafka consists of:

  • Producer
  • Consumer
  • Broker
  • Zookeeper
Producer generates the topic.
Consumer subscribes to the topic.
The topics published are stored in Kafka Clusters or Brokers.
Zookeeper is the king in Kafka all the memory resource is managed and maintained by Zookeeper.
The image below depicts the working architecture of Zookeeper.
Image result for zookeeper architecture diagram
answered Apr 11, 2019 by ravikiran
• 4,600 points

Related Questions In Big Data Hadoop

+1 vote
1 answer

Is Kafka and Zookeeper are required in a Big Data Cluster?

Apache Kafka is one of the components ...READ MORE

answered Mar 22, 2018 in Big Data Hadoop by nitinrawat895
• 10,950 points
743 views
0 votes
1 answer

How is kafka used in big-data?

I can brief you the answer here ...READ MORE

answered Mar 26, 2019 in Big Data Hadoop by nitinrawat895
• 10,950 points
160 views
+1 vote
1 answer

Is Hadoop only Framework in Big Data Ecosystem ?

Actually there are many other frameworks, one of ...READ MORE

answered Mar 26, 2018 in Big Data Hadoop by Ashish
• 2,650 points
229 views
0 votes
11 answers
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,950 points
6,101 views
+1 vote
11 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyF ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
39,252 views
–1 vote
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,320 points
2,285 views
0 votes
1 answer
0 votes
1 answer

What is the use of Apache Kafka in a Big Data Cluster?

Kafka is a Distributed Messaging System which ...READ MORE

answered Jun 21, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
144 views
0 votes
1 answer

How does Hadoop process data which is split across multiple boundaries in an HDFS?

I found some comments: from the Hadoop ...READ MORE

answered Jul 1, 2019 in Big Data Hadoop by ravikiran
• 4,600 points
137 views