Why should we use "distinct" keyword in pig script?

0 votes

Hi,

I am new to Apache Pig and started working with the fundamentals. There I came through a keyword "Distinct" but did not understand why to use it. Can anyone tell the use of this keyword?

May 3 in Big Data Hadoop by disha
16 views

1 answer to this question.

Your answer

Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
0 votes

Hey,

The "distinct" statement is very simple. It removes duplicate records. It works only on entire records, not on individual fields.

answered May 3 by Gitika
• 8,620 points

Related Questions In Big Data Hadoop

0 votes
1 answer

Why should we use "extends Mapper" for Mapreduce code?

The Mapper class belongs to package org.apache.hadoop.mapreduce ...READ MORE

answered Feb 8 in Big Data Hadoop by Omkar
• 66,050 points
40 views
0 votes
1 answer

Why we use --split by command in Sqoop?

The command --split-by is used to specify the ...READ MORE

answered Apr 11 in Big Data Hadoop by Gitika
• 8,620 points
48 views
0 votes
0 answers

Why we use 'help' command in Hadoop Sqoop?

Use of help command in Hadoop sqoop. READ MORE

Apr 11 in Big Data Hadoop by amrita
18 views
0 votes
1 answer

Why do we need the FOR EACH operation in Pig Scripts?

The operation FOREACH in Apache Pig is ...READ MORE

answered Apr 30 in Big Data Hadoop by Gitika
• 8,620 points
13 views
0 votes
1 answer

What do we exactly mean by “Hadoop” – the definition of Hadoop?

The official definition of Apache Hadoop given ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by Shubham
120 views
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 9,070 points
1,683 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 9,070 points
132 views
0 votes
10 answers

hadoop fs -put command?

copy command can be used to copy files ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Sujay
8,181 views
0 votes
1 answer

Why we use Relation keyword in pig?

Hey, In pig, Relation represents a complete database. ...READ MORE

answered May 7 in Big Data Hadoop by Gitika
• 8,620 points
9 views
+1 vote
1 answer

Why do we use STORE command in pig?

Hey, We use store command to store the ...READ MORE

answered May 7 in Big Data Hadoop by Gitika
• 8,620 points
8 views

© 2018 Brain4ce Education Solutions Pvt. Ltd. All rights Reserved.
"PMP®","PMI®", "PMI-ACP®" and "PMBOK®" are registered marks of the Project Management Institute, Inc. MongoDB®, Mongo and the leaf logo are the registered trademarks of MongoDB, Inc.