Sqoop split-by problem

–1 vote

I am giving the number of mapper to 10 but still, the sqoop is transferring the data into 5 files only.
sqoop import --connect jdbc:mysql://localhost/retail_db \--username root \--P --table order_items -m 10 \--split-by "order_item_quantity";

When I run the query select distinct(order_item_quantity) from order_item it is giving me the result as 1,2,3,4,5.

Are we telling that the number of the mapper, in this case, is irrelevant?

Dec 19, 2018 in Big Data Hadoop by digger
• 26,720 points
1,695 views

1 answer to this question.

0 votes

Hello. The -m or --num-mappers is just a hint to the engine to maintain that degree of parallelism. But it is not mandatory to launch those number of tasks always. The mappers count may vary based on your input data. Sqoop client serializes the data, generates the deserializer and sets the input format and submits the job to be run.

answered Dec 19, 2018 by Omkar
• 69,170 points

Related Questions In Big Data Hadoop

0 votes
2 answers

Why we use --split by command in Sqoop?

In simple explanation, When specify SPLIT_BY only ...READ MORE

answered Feb 6, 2020 in Big Data Hadoop by Ramji Sridaran
8,295 views
0 votes
1 answer

Problem connecting Sqoop to MySql

Perhaps, you have slightly misunderstood the method ...READ MORE

answered May 15, 2019 in Big Data Hadoop by Ali
874 views
0 votes
1 answer

How does data gets split in Sqoop?

I will drop the answer in the ...READ MORE

answered Jul 16, 2019 in Big Data Hadoop by ravikiran
• 4,620 points
4,743 views
0 votes
1 answer

Error: Hive show tables does not display table "sqooptest" , which was imported by SQOOP

Hi, You can follow the below-given solution. Just enter ...READ MORE

answered Aug 7, 2019 in Big Data Hadoop by Gitika
• 65,950 points

edited Aug 7, 2019 by Gitika 620 views
+1 vote
1 answer

Hadoop Mapreduce word count Program

Firstly you need to understand the concept ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
7,929 views
0 votes
1 answer

hadoop.mapred vs hadoop.mapreduce?

org.apache.hadoop.mapred is the Old API  org.apache.hadoop.mapreduce is the ...READ MORE

answered Mar 16, 2018 in Data Analytics by nitinrawat895
• 11,380 points
1,341 views
+2 votes
11 answers

hadoop fs -put command?

Hi, You can create one directory in HDFS ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
63,187 views
–1 vote
1 answer

Hadoop dfs -ls command?

In your case there is no difference ...READ MORE

answered Mar 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,390 points
3,012 views
0 votes
1 answer

Sqoop import where clause problem

The syntax you are using is wrong. ...READ MORE

answered Jan 31, 2019 in Big Data Hadoop by Omkar
• 69,170 points
1,073 views
0 votes
1 answer

Sqoop incremental append by date

Consider a table with 3 records which ...READ MORE

answered Feb 20, 2019 in Big Data Hadoop by Omkar
• 69,170 points
3,053 views