questions/big-data-hadoop
You need to sort RDD and take ...READ MORE
Hey Nafeesa, Itseems that Hive is not able ...READ MORE
For some reason, the jar hadoop-aws-[version].jar which contains the ...READ MORE
One problem with the Hadoop system is ...READ MORE
Writable in an interface in Hadoop and types ...READ MORE
You can assign more memory by editing ...READ MORE
We have to cache the values from ...READ MORE
Yes, your approach is correct - you ...READ MORE
Define the HADOOP_CONF_DIR environment variable to your Hadoop configuration ...READ MORE
Use Parquet. I'm not sure about CSV ...READ MORE
Make sure you've started Yarn. Use this ...READ MORE
The decision to choose a particular file ...READ MORE
You can spolve this by adding below ...READ MORE
Yes, Hadoop has a whole ecosystem of ...READ MORE
You can check here. From the archives. In particular, ...READ MORE
In order to forcefully let the namenode ...READ MORE
You could pass the URI when getting ...READ MORE
spark-csv is part of core Spark functionality ...READ MORE
Hey there, instead of doing the file ...READ MORE
For UBUNTU Hosts File and other configuration for Hadoop ...READ MORE
The total number of files in the ...READ MORE
Hadoop is not designed for records about ...READ MORE
MongoDB isn't built to work on top ...READ MORE
If the replica stays out of the ...READ MORE
Even though both are used for real-time ...READ MORE
Hey, a solution was given on Biostar: http://biostar.stackexchange.com/questions/8821. Hope ...READ MORE
You could probably best use Hive's built-in sampling ...READ MORE
Doc on Hadoop Streaming : http://hadoop.apache.org/docs/r1.2.1/streaming.html Hadoop streaming is ...READ MORE
Well, it's so easy. Just enter the below ...READ MORE
One of the big features of Hadoop/map-reduce ...READ MORE
Yes , now i have whole idea ...READ MORE
Hey, try this code import java.io.IOException; import java.util.Iterator; import java.util.StringTokenizer; import ...READ MORE
If you are simply looking to distribute ...READ MORE
Hadoop provides redundancy by storing multiple replicas ...READ MORE
HDFS supports fsck command to check for ...READ MORE
If you have one avro file and ...READ MORE
I suggest spending some time with Apache ...READ MORE
For each MapReduce job, Hadoop stores the ...READ MORE
You need to configure the client to ...READ MORE
Ganglia is an open-source, scalable and distributed ...READ MORE
You need to tune io.sort.mb value until ...READ MORE
You can refer the below link to ...READ MORE
FileSystem.get(conf) may return the local file system where ...READ MORE
Actually it comes in two ways: One ...READ MORE
I had this problem as well. But when ...READ MORE
Don't think that in Hadoop the same ...READ MORE
Differences are as follows: Hadoop's MR can be ...READ MORE
Few considerations to be taken are here: If ...READ MORE
If you are trying to access your ...READ MORE
It is because the parent directories do ...READ MORE
OR
At least 1 upper-case and 1 lower-case letter
Minimum 8 characters and Maximum 50 characters
Already have an account? Sign in.