MapFile in Pig

Question

What is the use of MapFile class in Pig?

Data_Nerd · Answer 1 · Jul 6, 2018

MapFile is a class which serves file-based map from keys to values.

A map is a directory containing two files, the data file, containing all keys and values in the map, and a smaller index file, containing a fraction of the keys. The fraction is determined by MapFile.Writer.getIndexInterval().

The index file is read entirely into memory. Thus, key implementations should try to keep themselves small. Map files are created by adding entries in-order.

answered Jul 6, 2018 by Data_Nerd
• 2,390 points

MapFile in Pig

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Big Data Hadoop

Use of MapReduce in PIG

GROUP and COGROUP in PIG

How to count number of rows in alias in PIG?

Hadoop Pig: How to include external jar file in PIG?

What do we exactly mean by “Hadoop” – the definition of Hadoop?

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

Checkpointing in Hadoop

Bucketing vs Partitioning in HIve

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES