questions/apache-spark
You can access task information using TaskContext: import org.apache.spark.TaskContext sc.parallelize(Seq[Int](), ...READ MORE
Hey, Unit is a subtype of scala.anyval and ...READ MORE
Let's first look at mapper side differences Map ...READ MORE
Hey, Real-time data processing is not possible directly ...READ MORE
The HDFS path for MyLab is /user/edureka_id. ...READ MORE
Yes, it is possible and is already ...READ MORE
scala> val rdd1 = sc.parallelize(List(1,2,3,4,5)) - Creating ...READ MORE
There are two methods to persist the ...READ MORE
Hi@akhtar, Yes, Spark streaming uses checkpoint. Checkpoint is ...READ MORE
Seems like you have set the configuration ...READ MORE
Your error is with the version of ...READ MORE
Minimizing data transfers and avoiding shuffling helps ...READ MORE
Hi@ritu, To start your python spark shell, you ...READ MORE
Hi, RDD in spark stands for REsilient distributed ...READ MORE
Hi@Srinath, It seems you didn't set Hadoop for ...READ MORE
Hi, You can do it using map partition ...READ MORE
Hey, We can append a Scala array to ...READ MORE
Hi! I found 2 links on github where ...READ MORE
Hi@ritu, AWS has lots of services. For spark ...READ MORE
Firstly, it's the In-memory computation, if the file ...READ MORE
You have set Zookeeper as the recovery ...READ MORE
Though there is nothing wrong with the ...READ MORE
You can increase the memory dynamically by ...READ MORE
Can anyone suggest when we create an ...READ MORE
I found the following solution to be ...READ MORE
Hi, If you execute a bunch of programs, ...READ MORE
Option a) List(5,100,10) The take method returns the first n elements in an ...READ MORE
In the above statement, x(2) is specifying an array ...READ MORE
Hey, Parquet is a columnar format file supported ...READ MORE
Did you find any documents or example ...READ MORE
Ganglia looks like a good option for ...READ MORE
Hi@Shllpa, In general, we get the 401 status code ...READ MORE
Hey, When we try to compare two instances ...READ MORE
Hi, You can use a simple mathematical calculation ...READ MORE
This is because the maximum number of ...READ MORE
By default, the node or executor is ...READ MORE
I suggest you to check 2 things That jquery.sparkline.js is actually ...READ MORE
Hey, Scala executes a val when we define ...READ MORE
In Hadoop MapReduce the input data is ...READ MORE
You can set the property to directly ...READ MORE
By default, only one core is used for ...READ MORE
Option d) Run time error. READ MORE
Hi@Neha, You can find all the job status ...READ MORE
You can get the configuration details through ...READ MORE
Hi, You can compute the average using this ...READ MORE
Hi @asif, Share with us please the application ...READ MORE
By default a partition is created for ...READ MORE
Speculation is enabled when a fraction of ...READ MORE
Probably the spill is because you have ...READ MORE
Hey, We have four kinds of identifiers in ...READ MORE
OR
At least 1 upper-case and 1 lower-case letter
Minimum 8 characters and Maximum 50 characters
Already have an account? Sign in.