I have recently started working on Hadoop and I have a curious question.
Are there some other distributed and scalable solutions as an alternatives to hadoop? Bascially, I am referring to implementions that are similar to HDFS that uses commodity hardwares and provide fault tolerant storag. Also, it should have processing engine on top of it to perform batch and real-time processing. I have heard about spark as an alternative but, I want a data warehousing solution which is distributed, fault tolerant and scalable. Suggestions are welcomed. Thanks :)