Why should we use "distinct" keyword in pig script?

0 votes

Hi,

I am new to Apache Pig and started working with the fundamentals. There I came through a keyword "Distinct" but did not understand why to use it. Can anyone tell the use of this keyword?

May 3 in Big Data Hadoop by disha
29 views

1 answer to this question.

0 votes

Hey,

The "distinct" statement is very simple. It removes duplicate records. It works only on entire records, not on individual fields.

answered May 3 by Gitika
• 25,340 points

Related Questions In Big Data Hadoop

0 votes
1 answer

Why should we use "extends Mapper" for Mapreduce code?

The Mapper class belongs to package org.apache.hadoop.mapreduce ...READ MORE

answered Feb 8 in Big Data Hadoop by Omkar
• 67,600 points
50 views
0 votes
1 answer

Why we use --split by command in Sqoop?

The command --split-by is used to specify the ...READ MORE

answered Apr 11 in Big Data Hadoop by Gitika
• 25,340 points
793 views
0 votes
0 answers

Why we use 'help' command in Hadoop Sqoop?

Use of help command in Hadoop sqoop. READ MORE

Apr 11 in Big Data Hadoop by amrita
35 views
0 votes
1 answer

Why do we need the FOR EACH operation in Pig Scripts?

The operation FOREACH in Apache Pig is ...READ MORE

answered Apr 30 in Big Data Hadoop by Gitika
• 25,340 points
28 views
0 votes
1 answer

What do we exactly mean by “Hadoop” – the definition of Hadoop?

The official definition of Apache Hadoop given ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by Shubham
210 views
0 votes
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,690 points
3,063 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 10,690 points
341 views
0 votes
10 answers

hadoop fs -put command?

put syntax: put <localSrc> <dest> copy syntax: copyFr ...READ MORE

answered Dec 7, 2018 in Big Data Hadoop by Aditya
15,047 views
0 votes
1 answer

Why we use Relation keyword in pig?

Hey, In pig, Relation represents a complete database. ...READ MORE

answered May 7 in Big Data Hadoop by Gitika
• 25,340 points
25 views
+1 vote
1 answer

Why do we use STORE command in pig?

Hey, We use store command to store the ...READ MORE

answered May 7 in Big Data Hadoop by Gitika
• 25,340 points
33 views