Which is better in term of speed, Shark or Spark?

0 votes
I am very confused about this two. I know shark is same as hive with 100x faster, work on spark. I want to know the main difference between spark and shark. Which is faster?

When I have to use spark or when shark?????
Jun 25, 2018 in Apache Spark by Shubham
• 13,290 points
29 views

1 answer to this question.

0 votes

Spark is a framework for distributed data processing, you can write your code in Scala, Java, and Python. Shark was renamed to SparkSQL and it is some kind of SQL engine on top of Spark - you write SQL queries and they are executed using Spark framework.

Here's Spark programming guide: https://spark.apache.org/docs/latest/programming-guide.html 

Here's Spark SQL guide: https://spark.apache.org/docs/latest/sql-programming-guide.html

So if you write a Spark SQL query, it would be converted to Spark code and executed, which means that in general, you can write a Spark code that would work with the same speed or faster than Spark SQL query.

Hope it will answer your query to some extent.

answered Jun 25, 2018 by nitinrawat895
• 10,490 points

Related Questions In Apache Spark

0 votes
1 answer

Which query to use for better performance, join in SQL or using Dataset API?

DataFrames and SparkSQL performed almost about the ...READ MORE

answered Apr 19, 2018 in Apache Spark by kurt_cobain
• 9,240 points
98 views
0 votes
1 answer

Can anyone explain what is RDD in Spark?

RDD is a fundamental data structure of ...READ MORE

answered May 24, 2018 in Apache Spark by Shubham
• 13,290 points
558 views
0 votes
1 answer

Spark 2.3? What is new in it?

Here are the changes in new version ...READ MORE

answered May 28, 2018 in Apache Spark by kurt_cobain
• 9,240 points
46 views
0 votes
1 answer
+1 vote
1 answer
0 votes
1 answer

Writing File into HDFS using spark scala

The reason you are not able to ...READ MORE

answered Apr 5, 2018 in Big Data Hadoop by kurt_cobain
• 9,240 points
4,810 views
0 votes
1 answer

Is there any way to check the Spark version?

There are 2 ways to check the ...READ MORE

answered Apr 19, 2018 in Apache Spark by nitinrawat895
• 10,490 points
995 views
0 votes
1 answer

What's the difference between 'filter' and 'where' in Spark SQL?

Both 'filter' and 'where' in Spark SQL ...READ MORE

answered May 23, 2018 in Apache Spark by nitinrawat895
• 10,490 points
4,872 views
0 votes
1 answer

Is it better to have one large parquet file or lots of smaller parquet files?

Ideally, you would use snappy compression (default) ...READ MORE

answered May 23, 2018 in Apache Spark by nitinrawat895
• 10,490 points
1,605 views
0 votes
7 answers

How to print the contents of RDD in Apache Spark?

Simple and easy: line.foreach(println) READ MORE

answered Dec 10, 2018 in Apache Spark by Kuber
8,168 views